Used Tools & Technologies
Not specified
Required Skills & Competences ?
Python @ 6 Machine Learning @ 4 Communication @ 4 LLM @ 4 PyTorch @ 6 CUDA @ 4 GPU @ 7Details
We are now looking for an AI Deep Learning Engineer! NVIDIA is seeking world-class researchers and engineers to join our deep learning team focused on building next-generation AI systems for coding. You will work at the cutting edge of large language model (LLM) development and contribute to improving AI's capabilities in understanding, generating, and interacting with code. Your work will directly impact how developers build software with the help of AI.
This role offers the unique opportunity to shape the future of programming by exploring innovations in post-training methods, agent-based workflows, and data synthesis strategies for coding LLMs.
Responsibilities
- Develop intelligent coding agents capable of planning, tool use, and multi-step reasoning for real-world coding workflows.
- Supervised fine-tuning and reinforcement learning to improve LLM performance on programming tasks.
- Generate and curate high-quality synthetic datasets (code, natural language, compiler traces, interaction logs) for LLM training and evaluation.
- Collaborate with cross-functional teams including CUDA teams, developer tools engineers, and foundational model researchers.
- Benchmark models on a range of code generation tasks.
Requirements
- M.S. or Ph.D. in Computer Science, Machine Learning, or related field (or equivalent experience).
- 5+ years of experience in deep learning, with a focus on LLMs or generative models.
- Strong knowledge in one or more of the following:
- Post-training techniques for LLMs (e.g., RL, instruction tuning, alignment).
- Autonomous agents or tool-augmented LLMs.
- Synthetic data generation and scalable data pipelines for machine learning.
- Proficiency in Python, C++, and deep learning frameworks such as PyTorch.
- Ability to work independently and drive projects end-to-end in a research or applied setting.
- Excellent communication and interpersonal skills to articulate complex technical concepts and collaborate effectively with cross-functional teams.
Ways to Stand Out from the Crowd
- Proven contributions in developing AI systems for software, hardware, or large-scale computing environments.
- Expertise in C++, CUDA, or GPU programming, with a deep understanding of chip design and computer architecture.
- Experience with GenAI and cutting-edge LLM technologies, and familiarity with the intersection of AI and hardware design.
Benefits
- The opportunity to work alongside some of the most forward-thinking and hardworking people in the industry, shaping the future of AI.
- A creative, autonomous work environment that encourages innovation.
- The ability to influence long-term opportunities that expand NVIDIA's impact on the datacenter and beyond.
Are you ready to innovate and take on new challenges in AI at NVIDIA? We want to hear from you!