Distinguished Artificial Intelligence Algorithms Engineer

at Nvidia

📍 Santa Clara, United States

USD 308,000-471,500 per year

MIDDLE

✅ On-site

SCRAPED

Used Tools & Technologies

Not specified

Required Skills & Competences ^?

Python @ 5 Algorithms @ 3 Debugging @ 5 API @ 3 LLM @ 3 PyTorch @ 3 GPU @ 3

Details

NVIDIA is looking for a Distinguished engineer for our core AI Frameworks (Megatron Core and NeMo Framework) team to design, develop and optimize diverse real world workloads. Megatron Core and NeMo Framework are open-source, scalable and cloud-native frameworks built for researchers and developers working on Large Language Models (LLM) and Multimodal (MM) foundation model pretraining and post-training. Our GenAI Frameworks provide end-to-end model training, including pretraining, reasoning, alignment, customization, evaluation, deployment and tooling to optimize performance and user experience.

Responsibilities

Expand Megatron Core and NeMo Framework's capabilities to enable users to develop, train, and optimize models.
Design and implement distributed training algorithms, model parallel paradigms, and model optimizations.
Define robust APIs and expand toolkits and libraries to be more comprehensive and coherent.
Meticulously analyze and tune performance across the software stack.
Collaborate with internal partners, users, and open source community members to analyze, design, and implement highly optimized solutions.
Solve large-scale, end-to-end AI training and inference challenges spanning orchestration, data pre-processing, model training and tuning, and deployment.
Work at the intersection of computer architecture, libraries, frameworks, AI applications and the entire software stack.
Research, prototype, and develop robust and scalable AI tools and pipelines.

Requirements

MS, PhD or equivalent experience in Computer Science, AI, Applied Math, or related fields and 17+ years of industry experience.
Experience with AI frameworks (for example PyTorch, JAX) and/or inference and deployment environments (for example TRTLLM, vLLM, SGLang).
Proficient in Python programming, software design, debugging, performance analysis, test design and documentation.
Consistent record of working effectively across multiple engineering initiatives and improving AI libraries with new innovations.
Strong understanding of AI/Deep-Learning fundamentals and their practical applications.

Ways to stand out

Hands-on experience in large-scale AI training with deep understanding of compute system concepts (latency/throughput bottlenecks, pipelining, multiprocessing) and demonstrated excellence in performance analysis and tuning.
Expertise in distributed computing, model parallelism, and mixed precision training.
Prior experience with Generative AI techniques applied to LLM and Multi-Modal learning (text, image, video).
Knowledge of GPU/CPU architecture and related numerical software.
Contributions to open source deep learning frameworks.

Compensation & Benefits

Base salary range: 308,000 USD - 471,500 USD (will be determined based on location, experience, and pay of employees in similar positions).
Eligible for equity and benefits.

Other details

Applications for this job will be accepted at least until December 16, 2025.
NVIDIA is an equal opportunity employer and is committed to fostering a diverse work environment.