Used Tools & Technologies
Not specified
Required Skills & Competences ?
Security @ 4 Jenkins @ 4 Kubernetes @ 4 Python @ 4 CI/CD @ 4 Communication @ 4 Debugging @ 4 QA @ 4 Compliance @ 4 GPU @ 4Details
NVIDIA has been transforming computer graphics, PC gaming, and accelerated computing for more than 25 years. It’s a unique legacy of innovation fueled by great technology and amazing people. They are tapping into the unlimited potential of AI to define the next era of computing, where GPUs act as brains of computers, robots, and self-driving cars that understand the world.
Responsibilities
- Define end-to-end test strategy and validation for firmware across multiple NVIDIA platforms from pre-silicon simulation to production readiness.
- Architect scalable test infrastructure with modular, reusable frameworks supporting functional, integration, stress, regression, power, security, and performance testing.
- Collaborate with firmware developers, hardware architects, and software teams to ensure comprehensive test coverage and optimize testability.
- Own firmware quality metrics such as code coverage, system uptime, bug escape rate, and validation completeness.
- Lead root cause analysis and debugging across firmware, software, and hardware layers.
- Innovate lab automation and CI/CD pipelines, integrating continuous testing into workflows.
- Validate real-world use cases and production scenarios to ensure customer readiness.
- Mentor and coach firmware QA engineers; promote a culture of quality, innovation, and continuous learning.
- Use AI-powered tools to accelerate test development, automate validation workflows, and streamline debugging.
Requirements
- Bachelor’s, Master’s, or PhD in Electrical Engineering, Computer Engineering, Computer Science, or related field.
- 12+ years of experience in software/firmware testing focusing on embedded or low-level systems.
- Strong knowledge of system architectures, boot processes, SoCs, I2C/SPI/PCIe interfaces, and embedded controllers.
- Proven experience designing test frameworks and infrastructure in Python, C/C++, or similar languages.
- Expertise with platform standards for security, telemetry, and manageability (NIST, DMTF).
- Hands-on experience with server platform, network, storage, cluster configuration, and debugging.
- Background with platform telemetry, data center node lifecycle management including CPU/GPU workloads.
- Skilled in scripting languages such as Python.
- Expertise administering and configuring Kubernetes and Envoy.
- Validated experience with CI/CD tools like GitLab, Jenkins, and GitOps.
- Experience with lab automation, HW-in-the-loop testing, and CI/CD pipelines.
- Strong debugging, problem-solving, and analytical skills.
- Excellent communication and collaboration skills with experience in globally distributed teams.
Ways to Stand Out
- Experience with NVIDIA platforms (DGX, HGX, Grace Hopper systems).
- Exposure to security validation, compliance (FIPS, BMC security), thermal/power validation.
- Prior role as a test architect or technical lead for large-scale firmware or embedded validation.
- Contributions to open-source testing tools or frameworks related to cloud-scale validation, automation, or virtualization.
- Prior experience using AI tools for test planning, gap identification, automation, and failure analysis.
Benefits
- Competitive salary and benefits package.
- Flexible work environment.
- Opportunity to work with industry-leading experts.
- NVIDIA offers a comprehensive benefits package detailed at www.nvidiabenefits.com/
The base salary range is 200,000 USD - 391,000 USD. Salary depends on location, experience, and market comparisons.
NVIDIA is an equal opportunity employer committed to diversity and does not discriminate based on protected characteristics.