Distinguished Artificial Intelligence Algorithms Engineer

at Nvidia
USD 308,000-471,500 per year
MIDDLE
✅ On-site

SCRAPED

Used Tools & Technologies

Not specified

Required Skills & Competences ?

Python @ 5 Algorithms @ 3 Debugging @ 5 API @ 3 LLM @ 3 PyTorch @ 3 GPU @ 3

Details

NVIDIA is looking for a Distinguished engineer for our core AI Frameworks (Megatron Core and NeMo Framework) team to design, develop and optimize diverse real world workloads. Megatron Core and NeMo Framework are open-source, scalable and cloud-native frameworks built for researchers and developers working on Large Language Models (LLM) and Multimodal (MM) foundation model pretraining and post-training. Our GenAI Frameworks provide end-to-end model training, including pretraining, reasoning, alignment, customization, evaluation, deployment and tooling to optimize performance and user experience.

Responsibilities

  • Expand Megatron Core and NeMo Framework's capabilities to enable users to develop, train, and optimize models.
  • Design and implement distributed training algorithms, model parallel paradigms, and model optimizations.
  • Define robust APIs and expand toolkits and libraries to be more comprehensive and coherent.
  • Meticulously analyze and tune performance across the software stack.
  • Collaborate with internal partners, users, and open source community members to analyze, design, and implement highly optimized solutions.
  • Solve large-scale, end-to-end AI training and inference challenges spanning orchestration, data pre-processing, model training and tuning, and deployment.
  • Work at the intersection of computer architecture, libraries, frameworks, AI applications and the entire software stack.
  • Research, prototype, and develop robust and scalable AI tools and pipelines.

Requirements

  • MS, PhD or equivalent experience in Computer Science, AI, Applied Math, or related fields and 17+ years of industry experience.
  • Experience with AI frameworks (for example PyTorch, JAX) and/or inference and deployment environments (for example TRTLLM, vLLM, SGLang).
  • Proficient in Python programming, software design, debugging, performance analysis, test design and documentation.
  • Consistent record of working effectively across multiple engineering initiatives and improving AI libraries with new innovations.
  • Strong understanding of AI/Deep-Learning fundamentals and their practical applications.

Ways to stand out

  • Hands-on experience in large-scale AI training with deep understanding of compute system concepts (latency/throughput bottlenecks, pipelining, multiprocessing) and demonstrated excellence in performance analysis and tuning.
  • Expertise in distributed computing, model parallelism, and mixed precision training.
  • Prior experience with Generative AI techniques applied to LLM and Multi-Modal learning (text, image, video).
  • Knowledge of GPU/CPU architecture and related numerical software.
  • Contributions to open source deep learning frameworks.

Compensation & Benefits

  • Base salary range: 308,000 USD - 471,500 USD (will be determined based on location, experience, and pay of employees in similar positions).
  • Eligible for equity and benefits.

Other details

  • Applications for this job will be accepted at least until December 16, 2025.
  • NVIDIA is an equal opportunity employer and is committed to fostering a diverse work environment.