Senior Deep Learning Research Engineer, Advanced AI Systems

at Nvidia
USD 224,000-356,500 per year
SENIOR
✅ On-site

SCRAPED

Used Tools & Technologies

Not specified

Required Skills & Competences ?

Python @ 4 Distributed Systems @ 4 Machine Learning @ 4 Communication @ 7 Networking @ 4 Debugging @ 4 CUDA @ 4

Details

We are now looking for a Senior Deep Learning Research Engineer for Advanced AI Systems!

NVIDIA is seeking a Senior Research Engineer to join our advanced AI strategy team to inform direction with NVIDIA’s DL engineering teams and collaborate with the world’s leading AI companies innovating on the next generation of AI. As a research engineer on the team, you will interact with both internal teams, and key strategic partners to surface, define and help implement the strategy of our products.

If you are a research/software engineer who enjoys working at the forefront of Generative AI, are passionate about fast-evolving core technology - both from AI and systems perspective, and desire to work with external as well as the teams, are driven to unite the latest AI research and future hardware designs in a cohesive, full-stack software strategy, we should talk! We contribute to all steps of the machine learning lifecycle: from conceptualization to applied research, engineering for optimized inference, training and deployment.

Responsibilities

  • Work with the latest in research, industry and closely with our most strategic partners and researchers to surface areas that stretch our hardware and software in unique ways.
  • Working closely with internal teams to identify the key gaps. Be abreast of the latest in AI systems infra - from kernel level optimizations to data center scale deployments.
  • Will include quick prototyping, architecting and crafting new features.
  • Working with engineering, research teams across all of NVIDIA to ensure flawless transition of concepts to the NVIDIA stack.
  • Conceptualize a solution across multiple facets - data center designs, networking, different model architectures, NVIDIA stack and deployment scenarios.

Requirements

  • Understanding of the latest in Deep Neural Networks, Large Language models, Multimodal and Scaling techniques.
  • 10+ years proven experience in Deep Learning systems and infra and NVIDIA GPUs.
  • Excellent C/C++, Python programming and software design skills, including debugging, performance analysis, and test design.
  • Strong foundation in CPU and/or GPU architecture. Knowledge of high-performance computing and distributed programming.
  • Strong communication and interpersonal skills along with the ability to work in a dynamic and distributed team.
  • Doctoral degree in Computer Science, Computer Engineering, related field (or equivalent experience)
  • Ability to envision beyond what’s possible right now.

Ways to stand out from a crowd:

  • Experience architecting or developing large-scale deep learning distributed systems.
  • Knowledge of CPU and GPU architecture.
  • GPU programming (CUDA).

With competitive salaries and a generous benefits package, we are widely considered to be one of the technology world’s most desirable employers. We have some of the most forward-thinking and hardworking people in the world working for us and, due to unprecedented growth, our exclusive engineering teams are rapidly growing. If you're a creative and autonomous engineer with a real passion for technology, we want to hear from you!