Used Tools & Technologies
Not specified
Required Skills & Competences ?
Software Development @ 4 Python @ 6 Machine Learning @ 4 Communication @ 4 LLM @ 4 PyTorch @ 6 CUDA @ 4 GPU @ 7Details
We are now looking for an AI Deep Learning Engineer! NVIDIA is seeking world-class researchers and engineers to join our deep learning team focused on building next-generation AI systems for coding. You will work at the cutting edge of large language model (LLM) development and contribute to improving AI's capabilities in understanding, generating, and interacting with code. Your work will directly impact how developers build software with the help of AI.
This role offers the unique opportunity to shape the future of programming by exploring innovations in post-training methods, agent-based workflows, and data synthesis strategies for coding LLMs.
Responsibilities
- Develop intelligent coding agents capable of planning, tool use, and multi-step reasoning for real-world coding workflows.
- Apply supervised fine-tuning and reinforcement learning to improve LLM performance on programming tasks.
- Generate and curate high-quality synthetic datasets (code, natural language, compiler traces, interaction logs) for LLM training and evaluation.
- Collaborate with cross-functional teams including CUDA teams, developer tools engineers, and foundational model researchers.
- Benchmark models on a range of code generation tasks and measure improvements.
Requirements
- M.S. or Ph.D. in Computer Science, Machine Learning, or related field (or equivalent experience).
- 5+ years of experience in deep learning, with a focus on LLMs or generative models.
- Strong knowledge in one or more of the following areas:
- Post-training techniques for LLMs (e.g., reinforcement learning, instruction tuning, alignment).
- Autonomous agents or tool-augmented LLMs.
- Synthetic data generation and scalable data pipelines for machine learning.
- Proficiency in Python and C++ and experience with deep learning frameworks such as PyTorch.
- Ability to work independently and drive projects end-to-end in a research or applied setting.
- Excellent communication and interpersonal skills to articulate complex technical concepts and collaborate effectively with cross-functional teams.
Ways to stand out from the crowd
- Proven contributions in developing AI systems for software, hardware, or large-scale computing environments.
- Expertise in C++, CUDA, or GPU programming, with a deep understanding of chip design and computer architecture.
- Experience with GenAI and cutting-edge LLM technologies, and familiarity with the intersection of AI and hardware design.
NVIDIA is a leader in AI and GPU-accelerated computing, with a rapidly growing footprint in the datacenter business. In this role, you will be at the heart of leveraging data-driven approaches to transform hardware design and system software development. Our team collaborates across NVIDIA, engaging with deep learning research teams, CUDA Kernel developers, and Silicon Architecture groups.
Benefits
- The opportunity to work alongside some of the most forward-thinking and hardworking people in the industry, shaping the future of AI.
- A creative, autonomous work environment that encourages innovation.
- The ability to influence long-term opportunities that expand NVIDIA's impact on the datacenter and beyond.
- Eligible for equity and company benefits.
Compensation details: Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 184,000 USD - 287,500 USD for Level 4, and 224,000 USD - 356,500 USD for Level 5.
Applications for this job will be accepted at least until July 29, 2025.
NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. We do not discriminate on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.