Model-as-a-Service Tech Lead

at Nvidia
USD 248,000-391,000 per year
SENIOR
✅ On-site

SCRAPED

Used Tools & Technologies

Not specified

Required Skills & Competences ?

ElasticSearch @ 4 Go @ 6 Jenkins @ 4 Kubernetes @ 4 TypeScript @ 6 Python @ 6 SQL @ 7 CI/CD @ 4 Distributed Systems @ 4 Machine Learning @ 4 AWS @ 7 gRPC @ 6 Helm @ 4 Performance Optimization @ 4 Microservices @ 6 API @ 4 System Architecture @ 4 GPU @ 4

Details

NVIDIA is seeking a highly qualified, hands-on Tech Lead to drive the technical vision, architecture, and implementation of a scalable, web-based platform that enables users to configure autonomous driving scenarios and generate synthetic data at scale for model training. This is a hands-on engineering role (not a people-management role); the successful candidate will spend a large portion of time writing production code across the full stack, owning code quality, and designing the platform for portability across the NVIDIA ecosystem.

Responsibilities

  • Serve as the primary, high-impact contributor on complex features; produce production code across the full stack including UI, APIs, services, and infrastructure.
  • Lead the code review process and set/implement coding standards, performance benchmarks, and architectural integrity to ensure high-quality, maintainable code.
  • Define and own long-term technical roadmap, architecture, and design, ensuring deployment pipelines and services are platform-agnostic and not tied to internal proprietary infrastructure.
  • Lead implementation of web services and efficient batch processing queues to integrate and operationalize foundation models into the customer-facing platform.
  • Implement standards for production-grade performance, monitoring, and fault tolerance; proactively identify and resolve technical debt and scalability bottlenecks.
  • Own CI/CD pipelines, container orchestration strategy (Kubernetes/Helm), and operational readiness to ensure scalability and reliability in production.
  • Mentor and guide the engineering team on full-stack development, distributed systems design, performance optimization, and portable code architecture.
  • Act as technical liaison translating requirements from Product Managers, ML Engineers, and Data Scientists into robust, portable designs.

Requirements

  • Minimum 12+ years of hands-on experience developing and deploying scalable full-stack web services in a cloud environment.
  • Proven Tech Lead or equivalent Senior/Staff level experience with demonstrated ability to define system architecture, mentor engineers, and take end-to-end technical ownership while remaining deeply active in coding and code reviews.
  • Expert-level proficiency in designing and scaling distributed microservices architectures using gRPC and REST APIs.
  • Deep expertise in modern frontend frameworks and building highly responsive, data-intensive UIs capable of managing high-frequency data flows.
  • Direct experience designing and deploying containerized applications that use a GPU (e.g., NVIDIA Container Toolkit).
  • Experience with MaaS (Model-as-a-Service) patterns and serving large machine learning models as high-throughput endpoints.
  • Mastery of container orchestration, including Kubernetes and Helm for portable, multi-service production deployments.
  • Proficiency in backend languages such as Python and/or Go, and TypeScript for frontend development.
  • Strong practical experience with Cloud Infrastructure (AWS S3) and running complex data storage/access patterns (SQL, key-value stores).
  • Expertise in CI/CD practices (GitLab, Jenkins) with focus on automation, testing, and deployment velocity/stability.
  • Bachelor’s degree (B.S.) or equivalent experience in Computer Science, Software Engineering, Electrical Engineering, or a closely related technical field; Master’s degree (M.S.) preferred.

Ways to stand out from the crowd

  • Experience with data querying platforms such as Apache Druid, ClickHouse, or Elasticsearch.
  • Familiarity with autonomous vehicle simulation environments (e.g., Carla) and synthetic data generation pipelines using foundation models.

Benefits & Compensation

Additional information

  • Location: Santa Clara, CA, United States.
  • Employment type: Full time.
  • Applications for this job will be accepted at least until November 21, 2025.
  • NVIDIA is an equal opportunity employer and describes a comprehensive benefits package in the posting.