Used Tools & Technologies
Not specified
Required Skills & Competences ?
Python @ 7 Algorithms @ 4 Parallel Programming @ 4 PyTorch @ 4 CUDA @ 4 GPU @ 4Details
We are seeking a Senior Deep Learning Performance Architect to join NVIDIA's Deep Learning Architecture team. The role focuses on performance and energy modeling, architecture simulation, profiling, and analysis to help develop next-generation architectures that accelerate AI and high-performance computing applications. The position involves close collaboration with software, product, and research teams to guide the direction of deep learning hardware and software.
Responsibilities
- Develop innovative architectures to extend the state of the art in deep learning performance and efficiency.
- Prototype key deep learning algorithms and applications.
- Analyze performance, cost and energy trade-offs by developing analytical models, simulators and test suites.
- Characterize power and performance on silicon parts and translate the learnings to architecture features and simulators.
- Understand and analyze the interplay of hardware and software architectures on future algorithms, programming models and applications.
- Actively collaborate with software, product and research teams to guide the direction of deep learning hardware and software.
Requirements
- Master’s Degree (or equivalent experience) and 6+ years of relevant experience, or PhD and 3+ years of experience in Computer Science, Electrical Engineering, Computer Engineering, or related field.
- Strong foundation in deep learning model architectures and performance tradeoffs.
- Experience with performance and energy modeling, power architecture, architecture simulation, profiling, analysis, and visualizations.
- Strong programming skills in Python, C, and C++.
- Experience with the architecture of, or workload analysis on, deep learning accelerators.
Ways to stand out
- Background with GPU computing and parallel programming models such as CUDA.
- Experience with deep neural network training, inference and optimization in leading frameworks (for example, PyTorch, JAX).
Compensation
- Base salary range for Level 4: 184,000 USD - 287,500 USD.
- Base salary range for Level 5: 224,000 USD - 356,500 USD.
- You will also be eligible for equity and benefits.
Additional details
- Location: Santa Clara, CA, United States.
- Employment type: Full time.
- Applications accepted at least until July 29, 2025.
- NVIDIA is an equal opportunity employer and values diversity in current and future employees.