Model-as-a-Service Tech Lead

at Nvidia

📍 Santa Clara, United States

USD 248,000-391,000 per year

SENIOR

✅ On-site

SCRAPED

Used Tools & Technologies

Required Skills & Competences ^?

ElasticSearch @ 4 Jenkins @ 4 Kubernetes @ 4 TypeScript @ 6 Python @ 6 SQL @ 7 CI/CD @ 4 Distributed Systems @ 4 Machine Learning @ 4 AWS @ 7 gRPC @ 6 Helm @ 4 Performance Optimization @ 4 Microservices @ 6 API @ 4 System Architecture @ 4 GPU @ 4

Details

NVIDIA is seeking a highly qualified, hands-on Tech Lead to drive the technical vision, architecture, and implementation of a scalable, web-based platform that enables users to configure autonomous driving scenarios and generate synthetic data at scale for model training. This is a hands-on engineering role (not a people-management role); the successful candidate will spend a large portion of time writing production code across the full stack, owning code quality, and designing the platform for portability across the NVIDIA ecosystem.

Responsibilities

Serve as the primary, high-impact contributor on complex features; produce production code across the full stack including UI, APIs, services, and infrastructure.
Lead the code review process and set/implement coding standards, performance benchmarks, and architectural integrity to ensure high-quality, maintainable code.
Define and own long-term technical roadmap, architecture, and design, ensuring deployment pipelines and services are platform-agnostic and not tied to internal proprietary infrastructure.
Lead implementation of web services and efficient batch processing queues to integrate and operationalize foundation models into the customer-facing platform.
Implement standards for production-grade performance, monitoring, and fault tolerance; proactively identify and resolve technical debt and scalability bottlenecks.
Own CI/CD pipelines, container orchestration strategy (Kubernetes/Helm), and operational readiness to ensure scalability and reliability in production.
Mentor and guide the engineering team on full-stack development, distributed systems design, performance optimization, and portable code architecture.
Act as technical liaison translating requirements from Product Managers, ML Engineers, and Data Scientists into robust, portable designs.

Requirements

Minimum 12+ years of hands-on experience developing and deploying scalable full-stack web services in a cloud environment.
Proven Tech Lead or equivalent Senior/Staff level experience with demonstrated ability to define system architecture, mentor engineers, and take end-to-end technical ownership while remaining deeply active in coding and code reviews.
Expert-level proficiency in designing and scaling distributed microservices architectures using gRPC and REST APIs.
Deep expertise in modern frontend frameworks and building highly responsive, data-intensive UIs capable of managing high-frequency data flows.
Direct experience designing and deploying containerized applications that use a GPU (e.g., NVIDIA Container Toolkit).
Experience with MaaS (Model-as-a-Service) patterns and serving large machine learning models as high-throughput endpoints.
Mastery of container orchestration, including Kubernetes and Helm for portable, multi-service production deployments.
Proficiency in backend languages such as Python and/or Go, and TypeScript for frontend development.
Strong practical experience with Cloud Infrastructure (AWS S3) and running complex data storage/access patterns (SQL, key-value stores).
Expertise in CI/CD practices (GitLab, Jenkins) with focus on automation, testing, and deployment velocity/stability.
Bachelor’s degree (B.S.) or equivalent experience in Computer Science, Software Engineering, Electrical Engineering, or a closely related technical field; Master’s degree (M.S.) preferred.

Ways to stand out from the crowd

Experience with data querying platforms such as Apache Druid, ClickHouse, or Elasticsearch.
Familiarity with autonomous vehicle simulation environments (e.g., Carla) and synthetic data generation pipelines using foundation models.

Benefits & Compensation

The base salary range is 248,000 USD - 391,000 USD.
You will also be eligible for equity and benefits. See: https://www.nvidiabenefits.com/

Additional information

Location: Santa Clara, CA, United States.
Employment type: Full time.
Applications for this job will be accepted at least until November 21, 2025.
NVIDIA is an equal opportunity employer and describes a comprehensive benefits package in the posting.