Senior Generative AI Research Engineer

at Nvidia
USD 224,000-425,500 per year
SENIOR
βœ… On-site

SCRAPED

Used Tools & Technologies

Not specified

Required Skills & Competences ?

Software Development @ 7 Python @ 7 Spark @ 4 Algorithms @ 4 Machine Learning @ 4 PyTorch @ 6 GPU @ 3

Details

At NVIDIA, the Cosmos generative AI engineering team works across multimodal learning, video generation, synthetic data, intelligent simulation, and agentic systems to push the boundaries of generative modeling. The role focuses on designing, post-training, and deploying foundation models and building scalable training and inference infrastructure.

Responsibilities

  • Design and post-train foundation models (LLMs, VLMs, VLAs, DiTs) for real-world applications.
  • Contribute to large-scale training infrastructure, high-efficiency inference pipelines, and scalable data pipelines.
  • Collaborate with research, software, and product teams to bring models from idea to deployment.
  • Contribute to open-source and internal projects, author technical papers or patents, and mentor junior engineers.
  • Prototype and iterate rapidly on experiments across agentic systems, reinforcement learning, reasoning, and video generation.
  • Design and implement model distillation algorithms for size reduction and diffusion step optimization.
  • Profile and benchmark training and inference pipelines to meet production-ready performance requirements.

Requirements

  • Minimum 8 years industry or 5+ years research/postdoc experience building and deploying generative AI systems; the description also references 12+ years of relevant software development experience.
  • Proficiency in PyTorch, JAX, or other deep learning frameworks.
  • Expertise in one or more of: LLMs, coding agents, diffusion models, autoregressive models, VAE/GAN architectures, retrieval-augmented generation, neural rendering, or multi-agent systems.
  • Deep familiarity with transformer architectures and variants of attention mechanisms.
  • Hands-on experience with large-scale training techniques and frameworks (examples given: ZeRO, DDP, FSDP, TP, CP) and data processing frameworks (examples given: Ray, Spark).
  • Strong production-quality Python software engineering skills.
  • MS or PhD (or equivalent experience) in Computer Science, Machine Learning, Applied Math, Physics, or related field.

Ways to stand out

  • Familiarity with high-performance computing and GPU acceleration.
  • Contributions to influential open-source libraries or publications at major conferences (NeurIPS, ICML, CVPR, ICLR).
  • Experience working with multimodal data (vision-language, VLA, audio).
  • Prior work with NVIDIA GPU-based compute clusters or simulation environments.

Compensation & Benefits

  • Base salary range (Level 5): 224,000 USD - 356,500 USD.
  • Base salary range (Level 6): 272,000 USD - 425,500 USD.
  • You will also be eligible for equity and benefits.

Other details

  • Location: US β€” Santa Clara, CA.
  • Employment type: Full time.
  • Applications accepted at least until October 19, 2025.
  • NVIDIA is an equal opportunity employer committed to a diverse work environment.