Senior Deep Learning Performance Architect

at Nvidia
USD 184,000-356,500 per year
SENIOR
✅ On-site

SCRAPED

Used Tools & Technologies

Not specified

Required Skills & Competences ?

Python @ 7 Algorithms @ 4 Parallel Programming @ 4 LLM @ 7 PyTorch @ 4 CUDA @ 4 GPU @ 4

Details

We are seeking a Senior Deep Learning Performance Architect to analyze and develop next-generation architectures that accelerate AI and high-performance computing applications. The role focuses on performance and energy modeling, architecture simulation, profiling, and translating silicon measurements into architecture features and simulators. This is a full-time role based in Santa Clara, CA.

Responsibilities

  • Develop innovative architectures to extend the state of the art in deep learning performance and efficiency.
  • Prototype key deep learning algorithms and applications.
  • Analyze performance, cost, and energy trade-offs by developing analytical models, simulators, and test suites.
  • Characterize power and performance on silicon parts and translate learnings to architecture features and simulators.
  • Understand and analyze the interplay of hardware and software architectures on future algorithms, programming models and applications.
  • Collaborate with software, product and research teams to guide the direction of deep learning hardware and software.

Requirements

  • Masters degree (or equivalent experience) and 6+ years of relevant experience, or PhD and 3+ years of experience in Computer Science, Electrical Engineering, Computer Engineering, or related field.
  • Strong foundation in deep learning model architectures and workload analysis, with emphasis on LLM decode architectures and performance trade-offs.
  • Experience with performance and energy modeling, power architecture, architecture simulation, profiling, analysis, and visualizations.
  • Strong programming skills in Python and C++.

Ways to stand out (nice to have)

  • Background with GPU computing and parallel programming models such as CUDA.
  • Experience with deep neural network training, inference and optimization in leading frameworks (e.g., PyTorch, JAX).

Benefits

  • Competitive base salary (see ranges below), eligibility for equity and comprehensive benefits.
  • Base salary ranges by level: Level 4: 184,000 USD - 287,500 USD; Level 5: 224,000 USD - 356,500 USD.
  • NVIDIA is an equal opportunity employer committed to diversity and inclusion.

Additional details

  • Location: Santa Clara, CA, United States.
  • Employment type: Full time.
  • Applications accepted at least until July 29, 2025.