Distinguished Engineer  High Performance AI

at Nvidia
USD 320,000-488,800 per year
SENIOR
✅ On-site

SCRAPED

Used Tools & Technologies

Not specified

Required Skills & Competences ?

Python @ 7 Leadership @ 4 Mentoring @ 4 Performance Optimization @ 4 API @ 4 Technical Leadership @ 4 CUDA @ 4 GPU @ 4

Details

NVIDIA is building groundbreaking agentic AI systems for the CUDA ecosystem. The team develops full-stack agentic AI platforms spanning multi-agent runtimes and orchestration, data and evaluation pipelines, training and inference stacks, and GPU-accelerated execution—deeply integrated with NVIDIA's software and hardware stack to advance accelerated computing end-to-end. As a senior technical leader, you will define technical direction and drive execution across the stack, collaborating closely with internal NVIDIA software and hardware teams to translate research and advances into production capabilities and products.

Responsibilities

  • Set strategy and lead execution for agentic AI systems for the CUDA ecosystem; define roadmaps and measurable success metrics (performance, quality, reliability, developer productivity).
  • Co-design agentic system solutions with software, hardware, and algorithm teams; influence and adopt new capabilities as they become available.
  • Develop reproducible, high-fidelity evaluation frameworks covering performance, quality, and developer productivity.
  • Collaborate across the AI stack and help drive architecture and key technical decisions — from hardware through compilers/toolchains, kernels/libraries, frameworks, distributed training, and inference/serving — and with model and research/engineering teams.
  • Scale impact through leadership: mentor and grow senior technical talent and lead large cross-team efforts from concept through production.

Requirements

  • Bachelors degree in Computer Science, Electrical Engineering, or related field (or equivalent experience); MS or PhD preferred.
  • 17+ years industry and/or academia experience with AI systems development.
  • Strong exposure to building foundational models, agents, or orchestration frameworks; hands-on experience with deep learning frameworks and modern inference stacks.
  • Strong C/C++ and Python programming skills and solid software engineering fundamentals; ability to set engineering standards and review architecture at scale.
  • Experience with GPU programming and performance optimization (CUDA or equivalent).
  • Proven track record leading large, cross-team efforts from concept through production, including navigating ambiguity, aligning stakeholders, and delivering measurable outcomes.

Ways to Stand Out

  • Track record building/evaluating deep learning models, coding agents and developer tooling, and driving broad adoption across teams or customers.
  • Demonstrated ability to optimize and deploy high-performance models, including on resource-constrained platforms. Deep expertise in GPU performance optimizations, evidenced by benchmark wins or published results.
  • Publications or open-source leadership in deep learning, multi-agent systems, reinforcement learning, or AI systems; contributions to widely used repositories or standards.
  • Experience leading projects end-to-end and mentoring small teams; recognized technical leadership (setting platform direction, creating widely used architectures/APIs, or establishing evaluation/benchmarking standards).

Compensation & Benefits

  • Base salary range: 320,000 USD - 488,750 USD (base salary will be determined based on your location, experience, and pay of employees in similar positions).
  • Eligible for equity and additional benefits (see NVIDIA benefits).

Additional Information

  • Location: Santa Clara, CA, United States (on-site role unless otherwise arranged by NVIDIA).
  • Applications accepted at least until January 19, 2026.
  • NVIDIA uses AI tools in its recruiting processes and is an equal opportunity employer committed to diversity and inclusion.