Used Tools & Technologies
Not specified
Required Skills & Competences ?
Security @ 4 Jenkins @ 4 Kubernetes @ 4 DevOps @ 4 Python @ 6 CI/CD @ 7 Communication @ 1 Debugging @ 4 Reporting @ 4 QA @ 4 System Architecture @ 7 Compliance @ 4 GPU @ 4Details
NVIDIA is seeking a Senior Test Architect to join the Enterprise Software QA team to design, build, optimize, and validate firmware and data center systems. The role focuses on driving quality, reliability, and scalability across firmware stacks used in AI, HPC, and cloud-scale systems. The position requires deep knowledge of firmware, embedded systems, datacenter platforms, test infrastructure, and CI/CD-driven lab automation.
Responsibilities
- Define end-to-end test strategy and validation plans for firmware across multiple NVIDIA platforms (pre-silicon simulation/emulation through post-silicon bring-up and production readiness).
- Architect scalable test infrastructure: design modular, reusable test frameworks and automation harnesses for functional, integration, stress, regression, power, security, and performance testing at scale.
- Collaborate with firmware developers, hardware architects, silicon validation, platform QA, and system software teams to ensure comprehensive test coverage and influence early design decisions for testability.
- Own firmware quality metrics (code coverage, system uptime, bug escape rate, validation completeness) and establish dashboards and reporting.
- Lead root-cause analysis and complex debugging that span firmware, software, and hardware layers; develop and document debug methodologies and tools.
- Innovate in lab automation and CI/CD: partner with DevOps to integrate continuous testing into nightly and pre-merge workflows and improve test automation pipelines.
- Validate real-world use cases, customer configurations, and production scenarios; contribute to release gates and sign-off criteria.
- Mentor and lead firmware QA engineers and junior test developers; champion new tools, methodologies, and best practices.
- Use AI-powered tools and copilots to accelerate test development, automate repetitive validation workflows, and streamline debug and root-cause analysis.
Requirements
- B.S./M.S./Ph.D. in Electrical Engineering, Computer Engineering, Computer Science, or related field.
- 12+ years of experience in software/firmware testing, with a focus on embedded or low-level systems.
- Strong knowledge of system architecture, boot processes, SoCs, I2C/SPI/PCIe interfaces, and embedded controllers.
- Proven experience designing test frameworks and infrastructure in Python, C/C++, or similar languages; proficiency in scripting (Python).
- Expertise with platform standards for security, telemetry and manageability (NIST, DMTF); hands-on experience with server platform, network, storage, cluster configuration and debugging.
- Background with platform telemetry and datacenter node lifecycle management/support, including CPU/GPU workloads.
- Expertise in administering, operating, and configuring Kubernetes and Envoy.
- Validated experience in CI/CD tools and models (GitLab, Jenkins, GitOps).
- Experience with lab automation, HW-in-the-loop testing, and CI/CD pipelines.
- Strong debugging, problem-solving, and analytical skills.
- Excellent communication and collaboration skills; experience working in globally distributed teams is a plus.
Ways to Stand Out
- Experience with NVIDIA platforms (e.g., DGX, HGX, Grace Hopper systems).
- Exposure to security validation and compliance (FIPS, BMC security) or thermal/power validation.
- Prior roles as a test architect or technical lead for large-scale firmware or embedded validation programs.
- Contributions to open-source testing tools or frameworks and strong knowledge of cloud-scale validation, infrastructure automation, or virtualization.
- Prior experience using AI tools to design test plans, identify test gaps, automate tests, and perform failure analysis.
Benefits
- Competitive base salary (see ranges below), eligibility for equity, and comprehensive benefits.
- Flexible work environment and opportunity to work with industry-leading experts.
- NVIDIA benefits information: https://www.nvidiabenefits.com/.
Salary
- Base salary ranges provided by level:
- Level 5: 200,000 USD - 322,000 USD
- Level 6: 248,000 USD - 391,000 USD
Applications are accepted at least until July 29, 2025. NVIDIA is an equal opportunity employer committed to diversity and inclusion.