Manager, Software Engineering - Dynamo

at Nvidia
USD 224,000-356,500 per year
MIDDLE
✅ Hybrid

SCRAPED

Used Tools & Technologies

Not specified

Required Skills & Competences ?

Software Development @ 3 Distributed Systems @ 6 Communication @ 3 LLM @ 3 GPU @ 3

Details

We are seeking a Software Engineering Manager to lead the development for the Dynamo engineering team, NVIDIA’s high-performance, low-latency inference platform for serving generative AI and reasoning workloads at scale. The team accelerates deployment of cutting-edge models across diverse engines and architectures, enabling breakthroughs from real-time LLM serving to complex multi-GPU, multi-node pipelines. The ideal candidate is strong in software development, designing and creating fault-tolerant distributed systems, and has the ability to implement well thought out long term maintenance strategy.

Responsibilities

  • Mentor, grow, and develop the Dynamo engineering team and be responsible for planning and execution of projects and workflows.
  • Work across several teams and organizations to build platforms that use the latest developments in LLM inferencing; collaborate with research and development teams and serve a large user base (software teams both internal and external to NVIDIA).
  • Align priorities across collaborators and define metrics for measuring the success of the product and team.
  • Stay updated with the latest trends in AI, ML, and infrastructure; proactively seek opportunities to integrate advancements into NVIDIA's LLM and AI infrastructure solutions.

Requirements

  • Masters or PhD or equivalent experience in Computer Science, computer architecture, or related field.
  • 10+ years of overall experience in developing large distributed systems.
  • 2+ years of experience managing AI and software development teams.
  • Experience in developing and maintaining LLM or GenAI infrastructure.
  • Hands-on experience developing large-scale distributed systems, including fault-tolerant designs.
  • Excellent communication, collaboration, and problem-solving skills, with a dedication to encouraging an inclusive and diverse workplace.

Ways to Stand Out

  • Strong technical background in cloud and distributed systems.
  • Experience working in a globally distributed organization.
  • Good knowledge of CPU and/or GPU hardware architecture.
  • Background in developing LLM inference systems.
  • Experience with LLM frameworks like vLLM and TRT-LLM.

Benefits & Additional Details

  • Employment type: Full time.
  • Office policy: Hybrid (#LI-Hybrid).
  • Location: Santa Clara, CA, United States.
  • Base salary range (determined by location and experience): 224,000 USD - 356,500 USD.
  • You will also be eligible for equity and benefits (see NVIDIA benefits page).
  • Applications accepted at least until September 2, 2025.
  • NVIDIA is an equal opportunity employer and is committed to fostering a diverse work environment.