Senior Systems Software Engineer – Autonomous Vehicles Infrastructure

at Nvidia
USD 224,000-425,500 per year
SENIOR
āœ… On-site

SCRAPED

Used Tools & Technologies

Not specified

Required Skills & Competences ?

Docker @ 7 Grafana @ 3 Kubernetes @ 7 Linux @ 7 Prometheus @ 3 Python @ 6 Distributed Systems @ 7 Leadership @ 4 Communication @ 7 Product Management @ 4 Technical Leadership @ 4

Details

NVIDIA's AV Infrastructure team is building scalable, stable infrastructure to support autonomous vehicle software builds, large-scale simulation testing, and real-time observability. This role focuses on architecting distributed infrastructure solutions, improving developer tooling and automation, designing monitoring/metrics, and partnering with cross-functional AI, software, hardware, safety, and product teams. The position expects technical leadership and ownership of proof-of-concept projects and architectural decisions.

Responsibilities

  • Architect, design, and implement distributed infrastructure solutions to support AV software builds, large-scale simulation testing, and real-time observability.
  • Innovate developer tooling and automation frameworks to mitigate integration challenges, prevent regressions, and uphold quality standards.
  • Design comprehensive metrics to monitor system health, workload quality, and resource utilization across complex compute and storage environments.
  • Collaborate with multi-functional AI and engineering teams to translate requirements into scalable platforms.
  • Serve as a technical leader driving architecture decisions and guiding project execution across the platform.
  • Partner with software, hardware, safety, and product management teams to produce actionable architecture and design documentation.
  • Lead proof-of-concept projects, write technical proposals, and champion novel infrastructure solutions for complex problems.

Requirements

  • BS, MS, or equivalent experience in Computer Science, Computer Architecture, Electrical Engineering, or a related field.
  • 12+ years designing and building distributed systems with a strong foundation in Linux systems and infrastructure engineering.
  • Proficiency in Python.
  • Deep experience with container orchestration and cloud/on-prem environments (Kubernetes, Docker, VMs).
  • Familiarity with monitoring, logging, and alerting tools such as Grafana, Prometheus, and the ELK stack.
  • Proven ability to debug complex systems spanning infrastructure, build processes, and workload execution.
  • Strong communication and collaboration skills; demonstrated history of partnering with engineering teams and business stakeholders.
  • Problem-solving mentality with drive to own solutions end-to-end and thrive in a fast-paced environment.
  • Strong interpersonal skills, capable of leading across geographies and organizational boundaries.

Ways to stand out

  • Demonstrated technical leadership and architectural impact on large-scale distributed systems in production.
  • Experience building developer tools and automation pipelines that improve build reliability and reduce integration times at scale.
  • Proven ability to lead proof-of-concept projects and write compelling technical proposals.
  • Experience in autonomous vehicle or related real-time, safety-critical systems is a plus.

Compensation & Benefits

  • Base salary ranges (location- and level-dependent):
    • Level 5: 224,000 USD - 356,500 USD
    • Level 6: 272,000 USD - 425,500 USD
  • Eligible for equity and company benefits.

Additional info

  • Location: Santa Clara, CA (US).
  • Employment type: Full time.
  • Application window: Applications accepted at least until September 14, 2025.
  • NVIDIA is an equal opportunity employer committed to fostering a diverse work environment.