Senior ML Software Engineer

at Nvidia
USD 184,000-356,500 per year
SENIOR
βœ… On-site

SCRAPED

Used Tools & Technologies

Not specified

Required Skills & Competences ?

Python @ 7 Spark @ 4 Algorithms @ 4 Machine Learning @ 4 PyTorch @ 6 GPU @ 3

Details

At NVIDIA, the Cosmos generative AI engineering team pushes the boundaries across multimodal learning, video generation, synthetic data, intelligent simulation, and agentic systems. The team seeks experienced engineers and applied scientists with deep experience in generative modeling to help define the next era of AI computing.

Responsibilities

  • Design, post-train, and optimize foundation models (e.g., LLMs, diffusion video models, VLMs, VLAs) for real-world applications.
  • Contribute to large-scale training infrastructure, high-efficiency inference pipelines, and scalable data pipelines.
  • Work with research, software, and product teams to bring models from idea to deployment.
  • Collaborate on open-source and internal projects; author technical papers or patents; mentor junior engineers.
  • Prototype and iterate rapidly on experiments across agentic systems, reinforcement learning, reasoning, and video generation.
  • Design and implement model distillation algorithms for size reduction and diffusion step optimization.
  • Profile and benchmark training and inference pipelines to meet production-ready performance requirements.

Requirements

  • Minimum 8 years industry or 5+ years research/postdoc experience building and deploying generative AI systems.
  • Proficiency in PyTorch, JAX, or other deep learning frameworks.
  • Expertise in one or more: LLMs, coding agents, diffusion models, autoregressive models, VAE/GAN architectures, retrieval-augmented generation, neural rendering, or multi-agent systems.
  • Intimate familiarity with transformer architectures and variants of attention mechanisms.
  • Hands-on experience with large-scale training techniques and distributed training (e.g., ZeRO, DDP, FSDP, TP, CP) and data processing frameworks (e.g., Ray, Spark).
  • Strong production-quality software engineering skills; the team primarily uses Python and open sources product code.
  • MS or PhD or equivalent experience in Computer Science, Machine Learning, Applied Math, Physics, or a related field.

Ways to stand out

  • Familiarity with high-performance computing and GPU acceleration.
  • Contributions to influential open-source libraries or top-tier conference publications (NeurIPS, ICML, CVPR, ICLR).
  • Experience with multimodal data (vision-language, VLA, audio).
  • Prior work with NVIDIA GPU-based compute clusters or simulation environments.

Compensation & Benefits

  • Base salary ranges by level:
    • Level 4: 184,000 USD - 287,500 USD
    • Level 5: 224,000 USD - 356,500 USD
  • Eligible for equity and additional benefits (see NVIDIA benefits page).

Other details

  • Location: Santa Clara, CA, United States.
  • Employment type: Full time.
  • Applications accepted at least until July 29, 2025.
  • NVIDIA is an equal opportunity employer committed to diversity and inclusion.