Used Tools & Technologies
Not specified
Required Skills & Competences
Tag name is followed by "@" symbol and proficiency level value.
About proficiency levels:
- 1-2 — basic awareness. Minimal hands-on experience, and a rudimentary understanding of the technology's purpose;
- 3-6 — daily use. Comfortable and regular usage, capable of handling common tasks and challenges related to the technology;
- 7-9 — you are an expert, you can teach others, you know all the pitfalls and tricks;
- 10 — exceptional knowledge, comprehensive understanding, and adeptness in all aspects of the technology, including advanced problem-solving. Think twice before claiming or demanding such level.
Software Development @ 6
Python @ 7
Algorithms @ 4
Hiring @ 4
Communication @ 7
Performance Optimization @ 4
Debugging @ 4
GPU @ 4
Deep Learning @ 4
AI @ 4
Profiling @ 4
Performance Analysis @ 7
- 1-2 — basic awareness. Minimal hands-on experience, and a rudimentary understanding of the technology's purpose;
- 3-6 — daily use. Comfortable and regular usage, capable of handling common tasks and challenges related to the technology;
- 7-9 — you are an expert, you can teach others, you know all the pitfalls and tricks;
- 10 — exceptional knowledge, comprehensive understanding, and adeptness in all aspects of the technology, including advanced problem-solving. Think twice before claiming or demanding such level.
Details
NVIDIA’s GPU Architecture Group is hiring architects to design and improve the proprietary profiler subsystem embedded in every GPU that enables profiling and monitoring tools to capture data and provide feedback for performance optimization. The role combines hardware modeling and verification with deep understanding of GPU architecture, operating systems, and application performance analysis. You will work with GPU architects, software engineers and deep learning experts to innovate hardware profiling methods and deliver actionable performance insights.
Responsibilities
- Architect and plan profiling features in collaboration with software, hardware, and verification teams across the globe.
- Build functional and performance models to refine and verify hardware designs.
- Create test plans to validate designed features and contribute to their implementation.
- Continuously develop skills by improving understanding of AI workloads, GPU architecture, and the profiling software stack.
Requirements
- Master’s or PhD in a relevant field (e.g., Computer Science, Computer Engineering, or Electrical Engineering) or equivalent experience.
- 3+ years of relevant experience in computer architecture, ASIC design/verification, or software development.
- Strong programming skills in C++ (or similar) and Python (or similar).
- Solid foundation in computer architecture and hardware performance analysis.
- Experience with performance modeling and hardware simulation, ideally using SystemC.
- Strong communication and interpersonal skills and ability to work with a distributed interdisciplinary team.
Ways to stand out
- Expertise in developing and optimizing parallel algorithms, particularly using GPUs.
- Extensive experience as a user or developer of CPU or GPU profiling tools.
- Background with AI and/or high-performance computing applications.
- Experience contributing to and debugging large codebases with many developers.
Compensation & Benefits
- Base salary ranges: Level 4 — 184000 USD to 287500 USD; Level 5 — 224000 USD to 356500 USD.
- Eligible for equity and benefits.
Other information
- Applications accepted at least until March 13, 2026.
- This posting is for an existing vacancy.
- NVIDIA uses AI tools in its recruiting processes.
- NVIDIA is an equal opportunity employer and committed to fostering a diverse work environment.