Senior Software Engineer, Compute Infrastructure for Robotics Research
at Nvidia
📍 Santa Clara, United States
USD 148,000-287,500 per year
SCRAPED
Used Tools & Technologies
Not specified
Required Skills & Competences ?
Kubernetes @ 7 Python @ 7 SQL @ 4 Spark @ 4 MLOps @ 6 TensorFlow @ 4 PyTorch @ 4 CUDA @ 4 GPU @ 4Details
We are seeking a Senior Software Engineer to join a new team building the foundational infrastructure for Robotics Research. This team will work closely with NVIDIA’s Generalist Embodied Agent Research (GEAR) group. The near-term focus is Project GR00T, NVIDIA’s initiative to build foundation models and full-stack technology for humanoid robots, and this position focuses on compute infrastructure.
Responsibilities
- Develop mechanisms to launch and manage large compute jobs to support multi-modal foundation models for robotics, including data jobs, training jobs, evaluation jobs, and more.
- Optimize GPU and cluster utilization for efficient model training, fine-tuning, and evaluation on massive datasets.
- Develop robust observability tools and procedures for the compute infrastructure to ensure reliability and performance.
- Collaborate with researchers to integrate innovative compute technologies into scalable training and evaluation pipelines.
Requirements
- Bachelor’s degree in Computer Science, Engineering, or a related field, or equivalent experience.
- 5+ years of full-time industry experience in large-scale MLOps and AI infrastructure.
- Experience with ML frameworks such as PyTorch, JAX, or TensorFlow.
- Deep understanding of Kubernetes and experience with Ray.
- Experience with data frameworks and standards like SQL, Apache Spark, and LanceDB.
- Experience with GPU acceleration and CUDA programming.
- Strong programming skills in Python and a high-performance language such as C++ for efficient system development.
Ways to stand out
- Master’s or PhD in Computer Science, Robotics, Engineering, or a related field.
- Demonstrated Tech Lead experience, coordinating engineering teams and driving projects from conception to deployment.
- Deep background building and operating large-scale data infrastructure.
- Strong experience and curiosity in frontier AI research.
Compensation & Other Details
- Base salary range (Level 3): 148,000 USD - 235,750 USD.
- Base salary range (Level 4): 184,000 USD - 287,500 USD.
- You will also be eligible for equity and benefits (link to NVIDIA benefits referenced in original posting).
- Applications for this job will be accepted at least until October 12, 2025.
About the Team & Company
You will work with a collaborative research team producing work on multimodal foundation models, large-scale robot learning, embodied AI, and physics simulation. Contributions will impact research projects and product roadmaps. NVIDIA is an equal opportunity employer committed to fostering a diverse work environment.