Senior Software Engineer - VLM Microservices For Neural Reconstruction

at Nvidia

📍 Santa Clara, United States

USD 152,000-287,500 per year

SENIOR

✅ On-site

Used Tools & Technologies

Machine Learning

Required Skills & Competences
Tag name is followed by "@" symbol and proficiency level value. About proficiency levels:

1-2 — basic awareness. Minimal hands-on experience, and a rudimentary understanding of the technology's purpose;

3-6 — daily use. Comfortable and regular usage, capable of handling common tasks and challenges related to the technology;

7-9 — you are an expert, you can teach others, you know all the pitfalls and tricks;

10 — exceptional knowledge, comprehensive understanding, and adeptness in all aspects of the technology, including advanced problem-solving. Think twice before claiming or demanding such level.

Security @ 4 Docker @ 4 Kubernetes @ 4 Python @ 6 CI/CD @ 7 Distributed Systems @ 4 Communication @ 4 gRPC @ 4 Helm @ 4 Microservices @ 4 API @ 4 LLM @ 4 CUDA @ 4 AI @ 4 Computer Vision @ 7 vLLM @ 4 SGLang @ 4

Details

NVIDIA's team builds the Omniverse NuRec SDK to enable robotic, healthcare, and AV developers to build better models faster with closed-loop validation and closed-loop training grounded in real-world scenarios. This role focuses on bringing Vision Language Models (VLMs) and 3D reconstruction models into NVIDIA's neural graphics software ecosystem, improving robustness, accuracy, and integration between real-world data and simulations.

Responsibilities

Design, build, and optimize containerized inference execution for the latest 3D VLMs, turning research work into production-grade, highly optimized software (NIMs, NVIDIA Inference Microservices).
Develop benchmarks to validate model accuracy and performance (latency, throughput, scalability).
Release and maintain models and their pipelines throughout their lifecycle (bug fixes, security patches).
Contribute VLM-related features to open-source projects such as vLLM.
Collaborate closely with Research and Product teams and influence shared roadmaps.

Requirements

Master's of Science in Computer Science + 3 years, or Electrical Engineering, Bachelor of Science (or equivalent experience) + 5 years of experience.
History of building, validating, and releasing production-grade AI distributed systems, backend services, microservices, and cloud technologies.
Deep technical expertise in distributed applications using Docker, Kubernetes, endpoints and their APIs (REST, gRPC), and Helm.
Hands-on experience with modern inference platforms (vLLM, SGLang, Torch, TRT, TRT-LLM).
Proficiency with Python and C++.
Strong software engineering fundamentals (source control, CI/CD, testing/validation, packaging, containerization).
Excellent written, visual, and verbal communication skills.
Curiosity and drive to learn new technologies and partner across teams and functions.

Ways To Stand Out

Track record contributing to open-source or production-grade software (for example, vLLM contributions).
Experience with ML model engineering: training, fine-tuning, distillation, quantization.
Experience with low-level optimization of ML models (CUDA kernels).
Strong fundamentals in 3D graphics, 3D computer vision, or neural reconstruction (NeRFs, Gaussian Splats).
History of multidisciplinary creativity and innovation around software engineering in multiple problem domains.

Compensation & Benefits

Base salary ranges provided by level: 152,000 USD - 241,500 USD for Level 3, and 184,000 USD - 287,500 USD for Level 4.
Eligible for equity and benefits.

Application Details

Applications for this job will be accepted at least until February 16, 2026.
This posting is for an existing vacancy.
NVIDIA uses AI tools in its recruiting processes.

Equal Opportunity

NVIDIA is committed to fostering a diverse work environment and is proud to be an equal opportunity employer. The company does not discriminate on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status, or other characteristics protected by law.