Senior Deep Learning Performance Architect

at Nvidia
USD 184,000-356,500 per year
SENIOR
✅ On-site

SCRAPED

Used Tools & Technologies

Not specified

Required Skills & Competences ?

Python @ 7 Algorithms @ 4 Parallel Programming @ 4 LLM @ 7 PyTorch @ 4 CUDA @ 4 GPU @ 4

Details

We are looking for a Senior Deep Learning Performance Architect to join the Deep Learning Architecture team to analyze and develop next-generation architectures that accelerate AI and high-performance computing applications. This role focuses on performance and energy modeling, architecture simulation, profiling and translating silicon measurements into architecture features and simulators.

Responsibilities

  • Develop innovative architectures to extend the state of the art in deep learning performance and efficiency.
  • Prototype key deep learning algorithms and applications.
  • Analyze performance, cost and energy trade-offs by developing analytical models, simulators and test suites.
  • Characterize power and performance on silicon parts and translate the learnings to architecture features and simulators.
  • Understand and analyze the interplay of hardware and software architectures on future algorithms, programming models and applications.
  • Actively collaborate with software, product and research teams to guide the direction of deep learning hardware and software.

Requirements

  • Master's degree (or equivalent experience) and 6+ years of relevant experience, or PhD and 3+ years of experience in Computer Science, Electrical Engineering, Computer Engineering, or a related field.
  • Strong foundation in deep learning model architectures and workload analysis, with emphasis on LLM decode architectures and performance trade-offs.
  • Experience with performance and energy modeling, power architecture, architecture simulation, profiling, analysis, and visualizations.
  • Strong programming skills in Python and C++.

Ways to stand out:

  • Background with GPU computing and parallel programming models such as CUDA.
  • Experience with deep neural network training, inference and optimization in leading frameworks (e.g., PyTorch, JAX).

Benefits

  • Base salary (varies by level and location):
    • Level 4: $184,000 - $287,500 USD
    • Level 5: $224,000 - $356,500 USD
  • Eligible for equity and additional benefits.

Additional Information

  • Company: NVIDIA
  • Applications accepted at least until July 29, 2025.
  • NVIDIA is an equal opportunity employer committed to diversity and inclusion.