Senior Systems Software Engineer ā Autonomous Vehicles Infrastructure
at Nvidia
š Santa Clara, United States
USD 224,000-425,500 per year
SCRAPED
Used Tools & Technologies
Not specified
Required Skills & Competences ?
Docker @ 7 Grafana @ 3 Kubernetes @ 7 Linux @ 7 Prometheus @ 3 Python @ 6 Distributed Systems @ 7 Leadership @ 4 Communication @ 7 Product Management @ 4 Technical Leadership @ 4Details
NVIDIA's AV Infrastructure team is building scalable, stable infrastructure to support autonomous vehicle software builds, large-scale simulation testing, and real-time observability. This role focuses on architecting distributed infrastructure solutions, improving developer tooling and automation, designing monitoring/metrics, and partnering with cross-functional AI, software, hardware, safety, and product teams. The position expects technical leadership and ownership of proof-of-concept projects and architectural decisions.
Responsibilities
- Architect, design, and implement distributed infrastructure solutions to support AV software builds, large-scale simulation testing, and real-time observability.
- Innovate developer tooling and automation frameworks to mitigate integration challenges, prevent regressions, and uphold quality standards.
- Design comprehensive metrics to monitor system health, workload quality, and resource utilization across complex compute and storage environments.
- Collaborate with multi-functional AI and engineering teams to translate requirements into scalable platforms.
- Serve as a technical leader driving architecture decisions and guiding project execution across the platform.
- Partner with software, hardware, safety, and product management teams to produce actionable architecture and design documentation.
- Lead proof-of-concept projects, write technical proposals, and champion novel infrastructure solutions for complex problems.
Requirements
- BS, MS, or equivalent experience in Computer Science, Computer Architecture, Electrical Engineering, or a related field.
- 12+ years designing and building distributed systems with a strong foundation in Linux systems and infrastructure engineering.
- Proficiency in Python.
- Deep experience with container orchestration and cloud/on-prem environments (Kubernetes, Docker, VMs).
- Familiarity with monitoring, logging, and alerting tools such as Grafana, Prometheus, and the ELK stack.
- Proven ability to debug complex systems spanning infrastructure, build processes, and workload execution.
- Strong communication and collaboration skills; demonstrated history of partnering with engineering teams and business stakeholders.
- Problem-solving mentality with drive to own solutions end-to-end and thrive in a fast-paced environment.
- Strong interpersonal skills, capable of leading across geographies and organizational boundaries.
Ways to stand out
- Demonstrated technical leadership and architectural impact on large-scale distributed systems in production.
- Experience building developer tools and automation pipelines that improve build reliability and reduce integration times at scale.
- Proven ability to lead proof-of-concept projects and write compelling technical proposals.
- Experience in autonomous vehicle or related real-time, safety-critical systems is a plus.
Compensation & Benefits
- Base salary ranges (location- and level-dependent):
- Level 5: 224,000 USD - 356,500 USD
- Level 6: 272,000 USD - 425,500 USD
- Eligible for equity and company benefits.
Additional info
- Location: Santa Clara, CA (US).
- Employment type: Full time.
- Application window: Applications accepted at least until September 14, 2025.
- NVIDIA is an equal opportunity employer committed to fostering a diverse work environment.