Used Tools & Technologies
Not specified
Required Skills & Competences
Tag name is followed by "@" symbol and proficiency level value.
About proficiency levels:
- 1-2 — basic awareness. Minimal hands-on experience, and a rudimentary understanding of the technology's purpose;
- 3-6 — daily use. Comfortable and regular usage, capable of handling common tasks and challenges related to the technology;
- 7-9 — you are an expert, you can teach others, you know all the pitfalls and tricks;
- 10 — exceptional knowledge, comprehensive understanding, and adeptness in all aspects of the technology, including advanced problem-solving. Think twice before claiming or demanding such level.
Software Development @ 6
Python @ 7
Algorithms @ 4
Hiring @ 4
Communication @ 7
Performance Optimization @ 4
Debugging @ 4
GPU @ 4
AI @ 4
Profiling @ 4
Performance Analysis @ 7
- 1-2 — basic awareness. Minimal hands-on experience, and a rudimentary understanding of the technology's purpose;
- 3-6 — daily use. Comfortable and regular usage, capable of handling common tasks and challenges related to the technology;
- 7-9 — you are an expert, you can teach others, you know all the pitfalls and tricks;
- 10 — exceptional knowledge, comprehensive understanding, and adeptness in all aspects of the technology, including advanced problem-solving. Think twice before claiming or demanding such level.
Details
NVIDIA’s GPU Architecture Group is hiring architects to help design a proprietary profiler subsystem embedded in GPUs that enables profiling and monitoring tools to capture data and provide feedback for performance optimization. The role combines hardware modeling and verification with a deep understanding of GPU architecture, operating systems, and application performance analysis to innovate new hardware profiling methods.
Responsibilities
- Architect and plan profiling features in collaboration with software, hardware, and verification teams across the globe to implement next-generation GPU profiling features.
- Build functional and performance models to refine and verify hardware designs.
- Create test plans to validate designed features and contribute to their implementation.
- Continuously develop understanding of AI workloads, GPU architecture, and the profiling software stack to drive practical innovation.
Requirements
- Masters or PhD in a relevant field (e.g., Computer Science, Computer Engineering, Electrical Engineering) or equivalent experience.
- 3+ years of relevant experience in computer architecture, ASIC design/verification, or software development.
- Strong programming skills in C++ (or similar) and Python (or similar).
- Solid foundation in computer architecture and hardware performance analysis.
- Experience with performance modeling and hardware simulation; experience with SystemC is ideal.
- Strong communication and interpersonal skills; ability to work with a distributed interdisciplinary team.
Ways to stand out
- Expertise in developing and optimizing parallel algorithms, particularly using GPUs.
- Extensive experience as a user or developer of CPU or GPU profiling tools.
- Background with AI and/or high-performance computing applications.
- Experience contributing to and debugging large codebases with many developers.
Compensation & Benefits
- Base salary ranges (depending on level and location):
- Level 4: 184,000 USD - 287,500 USD
- Level 5: 224,000 USD - 356,500 USD
- Eligible for equity and benefits (link to NVIDIA benefits referenced in the posting).
Additional information
- Applications accepted at least until March 13, 2026.
- This posting is for an existing vacancy.
- NVIDIA uses AI tools in its recruiting processes.
- NVIDIA is an equal opportunity employer committed to a diverse work environment.