Used Tools & Technologies
Not specified
Required Skills & Competences ?
Linux @ 4 Automated Testing @ 4 Hiring @ 4 Communication @ 7 JavaScript @ 4 Perl @ 4 GPU @ 4Details
NVIDIA has continuously reinvented itself over two decades. Our invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined modern computer graphics, and revolutionized parallel computing. More recently, GPU deep learning ignited modern AI — the next era of computing. NVIDIA is a “learning machine” that constantly evolves by adapting to new opportunities that are hard to solve, that only we can seek, and that matter to the world. This is our life’s work, to amplify human imagination and intelligence.
We are now looking for a Sr. System Validation Engineer in the DGX Server Product Engineering Team. You will collaborate with a team of HW/SW engineers to develop and execute complex automated test plans for our industry leading GPU accelerated computing products. In this role you will gain a deep understanding of datacenter hardware and software as you solve complex technical issues. Your primary responsibility is to ensure outstanding quality GPU server products are brought to market. You should be comfortable in a lab environment and demonstrate a passion towards product system level validation and quality!
Responsibilities
- System level debug and validation of NVIDIA's datacenter products.
- Develop test plans for new features on each product along with automation scripts to execute tests and archive results.
- Review proposed design changes to assess impact to validation plans and tasks. Recommend design changes to improve automated testing.
- System level performance testing using industry leading Deep Learning/AI applications.
- Maintenance and debug of a medium-scale datacenter cluster for larger scale testing of our systems.
- Follow-through on all reported bugs to root cause issues that could impact quality and customer experience with our products.
- Contribute to the development of new system validation processes.
- Debug complex system level hardware and software.
Requirements
- BSEE or BSCE or equivalent experience.
- 8+ years of hands-on hardware/software debug experience.
- Experience with Linux OS and command line operation.
- Deep understanding of computing architectures.
- Coding experience with programming and scripting languages such as C/C++, JavaScript, and Perl.
- Strong verbal and written communication skills.
Ways to stand out from the crowd
- Experience with BMC and server management processors.
- Background with system level validation, IO interconnects, PCI-Express high-speed data switches.
- Understanding of DC/DC power supply operation and sensor feedback control.
- Experience with x86/Arm architectures and accelerated GPU computing.
Compensation & Benefits
Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 160,000 USD - 253,000 USD for Level 4, and 192,000 USD - 304,750 USD for Level 5. You will also be eligible for equity and benefits (see NVIDIA benefits page).
Additional information
Applications for this job will be accepted at least until November 29, 2025.
NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.