Senior Generative AI Research Engineer

at Nvidia
πŸ“ World
πŸ“ Canada
πŸ“ United States
USD 224,000-425,500 per year
SENIOR
βœ… Remote

SCRAPED

Used Tools & Technologies

Not specified

Required Skills & Competences ?

Software Development @ 7 Python @ 7 Spark @ 4 Algorithms @ 4 Machine Learning @ 4 PyTorch @ 6 GPU @ 3

Details

At NVIDIA, the Cosmos generative AI engineering team is advancing multimodal learning, video generation, synthetic data, intelligent simulation, and agentic systems. The team seeks experienced engineers and applied scientists with deep generative modeling expertise to drive the next era of AI computing.

Responsibilities

  • Design, post-train, and optimize foundation models (e.g., LLMs, diffusion video models, VLMs, VLAs) for real-world applications.
  • Contribute to collaborative development of large-scale training infrastructure, high-efficiency inference pipelines, and scalable data pipelines.
  • Work with research, software, and product teams to bring world models from idea to deployment.
  • Collaborate on open-source and internal projects; author technical papers or patents; mentor junior engineers.
  • Prototype and iterate rapidly on experiments across agentic systems, reinforcement learning, reasoning, and video generation.
  • Design and implement model distillation algorithms for size reduction and diffusion step optimization.
  • Profile and benchmark training and inference pipelines to meet production-ready performance requirements.

Requirements

  • Minimum 8 years industry or 5+ years research/postdoc experience building and deploying generative AI systems (note: text also lists 12+ years of relevant software development experience).
  • Proficiency in PyTorch, JAX, or other deep learning frameworks is required.
  • Expertise in one or more of: LLMs, coding agents, diffusion models, autoregressive models, VAE/GAN architectures, retrieval-augmented generation, neural rendering, or multi-agent systems.
  • Deep familiarity with transformer architectures and variants of attention mechanisms.
  • Hands-on experience with large-scale training strategies (e.g., ZeRO, DDP, FSDP, TP, CP) and data processing tools (e.g., Ray, Spark).
  • Strong production-quality software engineering skills in Python; the team open-sources products.
  • MS, PhD, or equivalent experience in Computer Science, Machine Learning, Applied Math, Physics, or related field.
  • (Listed) 12+ years of relevant software development experience.

Ways to Stand Out

  • Familiarity with high-performance computing and GPU acceleration.
  • Contributions to influential open-source libraries or top conference publications (NeurIPS, ICML, CVPR, ICLR).
  • Experience with multimodal data (vision-language, VLA, audio).
  • Prior work with NVIDIA GPU-based compute clusters or simulation environments.

Compensation & Benefits

  • Base salary ranges (determined by location, experience, and comparable pay):
    • Level 5: 224,000 USD - 356,500 USD
    • Level 6: 272,000 USD - 425,500 USD
  • Eligibility for equity and company benefits (link to NVIDIA benefits provided in original posting).

Other

  • Applications accepted at least until July 29, 2025.
  • NVIDIA is an equal opportunity employer committed to diversity and non-discrimination.