Senior Deep Learning Performance Architect

at Nvidia
USD 184,000-287,500 per year
SENIOR
βœ… On-site

SCRAPED

Used Tools & Technologies

Not specified

Required Skills & Competences ?

Python @ 4 Product Management @ 4 GPU @ 4

Details

We are looking for a Senior Deep Learning Performance Architect to analyze and develop next-generation architectures that accelerate AI and high-performance computing applications. The role focuses on benchmarking and analyzing AI workloads, developing simulators and analysis tools, and evaluating system-level architectural trade-offs (performance, power, area).

Responsibilities

  • Develop innovative hardware architectures to advance parallel computing performance, energy efficiency, and programmability.
  • Benchmark and analyze AI workloads in single-node and multi-node configurations.
  • Develop high-level simulators and analysis tools using C++ and Python.
  • Evaluate PPA (performance, power, area) for hardware features and system-level architectural trade-offs.
  • Collaborate closely with peer architecture teams and product management to guide product development.
  • Keep up to date with emerging trends and research in deep learning.

Requirements

  • MS or PhD in a relevant discipline (Computer Science, Electrical Engineering, Computer Engineering, etc.) or equivalent experience.
  • 4+ years of experience in parallel computing architectures, interconnect fabrics, and deep learning applications.
  • Background in GPU or deep learning ASIC architecture evaluation for training and/or inference.
  • Strong programming skills in Python and C++.

Ways to Stand Out

  • Solid fundamental knowledge in computer architecture and interconnect fabrics.
  • Understanding of modern transformer-based model architectures.
  • Ability to simplify and communicate rich technical concepts to non-technical audiences.
  • Curious demeanor and excellent problem-solving skills.

Benefits & Compensation

  • Base salary range: 184,000 USD - 287,500 USD (final base salary determined based on location, experience, and pay of employees in similar positions).
  • Eligibility for equity and company benefits (see https://www.nvidia.com/en-us/benefits/).

Additional Information

  • Location: Santa Clara, CA, United States.
  • Employment type: Full time.
  • Application window: Applications for this job will be accepted at least until August 5, 2025.
  • Start date (listed): 2025-08-01.