Used Tools & Technologies
Not specified
Required Skills & Competences ?
Software Development @ 3 Distributed Systems @ 6 Communication @ 3 LLM @ 3 GPU @ 3Details
We are seeking a Software Engineering Manager to lead the development for the Dynamo engineering team, NVIDIA’s high-performance, low-latency inference platform for serving generative AI and reasoning workloads at scale. The team accelerates deployment of cutting-edge models across diverse engines and architectures, enabling breakthroughs from real-time LLM serving to complex multi-GPU, multi-node pipelines. The ideal candidate is strong in software development, designing and creating fault-tolerant distributed systems, and has the ability to implement well thought out long term maintenance strategy.
Responsibilities
- Mentor, grow, and develop the Dynamo engineering team and be responsible for planning and execution of projects and workflows.
- Work across several teams and organizations to build platforms that use the latest developments in LLM inferencing; collaborate with research and development teams and serve a large user base (software teams both internal and external to NVIDIA).
- Align priorities across collaborators and define metrics for measuring the success of the product and team.
- Stay updated with the latest trends in AI, ML, and infrastructure; proactively seek opportunities to integrate advancements into NVIDIA's LLM and AI infrastructure solutions.
Requirements
- Masters or PhD or equivalent experience in Computer Science, computer architecture, or related field.
- 10+ years of overall experience in developing large distributed systems.
- 2+ years of experience managing AI and software development teams.
- Experience in developing and maintaining LLM or GenAI infrastructure.
- Hands-on experience developing large-scale distributed systems, including fault-tolerant designs.
- Excellent communication, collaboration, and problem-solving skills, with a dedication to encouraging an inclusive and diverse workplace.
Ways to Stand Out
- Strong technical background in cloud and distributed systems.
- Experience working in a globally distributed organization.
- Good knowledge of CPU and/or GPU hardware architecture.
- Background in developing LLM inference systems.
- Experience with LLM frameworks like vLLM and TRT-LLM.
Benefits & Additional Details
- Employment type: Full time.
- Office policy: Hybrid (#LI-Hybrid).
- Location: Santa Clara, CA, United States.
- Base salary range (determined by location and experience): 224,000 USD - 356,500 USD.
- You will also be eligible for equity and benefits (see NVIDIA benefits page).
- Applications accepted at least until September 2, 2025.
- NVIDIA is an equal opportunity employer and is committed to fostering a diverse work environment.