Used Tools & Technologies
Not specified
Required Skills & Competences ?
Python @ 7 Algorithms @ 4 Parallel Programming @ 4 LLM @ 7 PyTorch @ 4 CUDA @ 4 GPU @ 4Details
We are seeking a Senior Deep Learning Performance Architect to analyze and develop next-generation architectures that accelerate AI and high-performance computing applications. The role focuses on performance and energy modeling, architecture simulation, profiling, and translating silicon measurements into architecture features and simulators. This is a full-time role based in Santa Clara, CA.
Responsibilities
- Develop innovative architectures to extend the state of the art in deep learning performance and efficiency.
- Prototype key deep learning algorithms and applications.
- Analyze performance, cost, and energy trade-offs by developing analytical models, simulators, and test suites.
- Characterize power and performance on silicon parts and translate learnings to architecture features and simulators.
- Understand and analyze the interplay of hardware and software architectures on future algorithms, programming models and applications.
- Collaborate with software, product and research teams to guide the direction of deep learning hardware and software.
Requirements
- Masters degree (or equivalent experience) and 6+ years of relevant experience, or PhD and 3+ years of experience in Computer Science, Electrical Engineering, Computer Engineering, or related field.
- Strong foundation in deep learning model architectures and workload analysis, with emphasis on LLM decode architectures and performance trade-offs.
- Experience with performance and energy modeling, power architecture, architecture simulation, profiling, analysis, and visualizations.
- Strong programming skills in Python and C++.
Ways to stand out (nice to have)
- Background with GPU computing and parallel programming models such as CUDA.
- Experience with deep neural network training, inference and optimization in leading frameworks (e.g., PyTorch, JAX).
Benefits
- Competitive base salary (see ranges below), eligibility for equity and comprehensive benefits.
- Base salary ranges by level: Level 4: 184,000 USD - 287,500 USD; Level 5: 224,000 USD - 356,500 USD.
- NVIDIA is an equal opportunity employer committed to diversity and inclusion.
Additional details
- Location: Santa Clara, CA, United States.
- Employment type: Full time.
- Applications accepted at least until July 29, 2025.