Data Center Firmware Test Architect

at Nvidia
USD 200,000-391,000 per year
SENIOR
✅ On-site

SCRAPED

Used Tools & Technologies

Not specified

Required Skills & Competences ?

Security @ 4 Jenkins @ 4 Kubernetes @ 6 DevOps @ 4 Python @ 4 CI/CD @ 4 Hiring @ 4 Communication @ 1 Debugging @ 4 Reporting @ 4 QA @ 4 System Architecture @ 7 Compliance @ 4 GPU @ 4

Details

NVIDIA has been transforming computer graphics, PC gaming, and accelerated computing for more than 25 years. It’s a unique legacy of innovation that’s fueled by great technology—and amazing people. Today, we’re tapping into the unlimited potential of AI to define the next era of computing. An era in which our GPU acts as the brains of computers, robots, and self-driving cars that can understand the world. Doing what’s never been done before takes vision, innovation, and the world’s best talent. As an NVIDIAN, you’ll be immersed in a diverse, supportive environment where everyone is inspired to do their best work.

We are seeking a highly skilled and hard-working Senior Test Architect to join our multifaceted Enterprise Software QA team. This role offers an outstanding opportunity to leave your mark on the design, construction, optimization and testing of our flagship super computers and data center offerings. If you are a dedicated engineer with a deep understanding of firmware and data center systems, and you thrive in an exciting, innovative environment, this could be the flawless role for you!

Responsibilities

  • Define end-to-end test strategy and own the overall test architecture and validation strategy for firmware across multiple NVIDIA platforms (pre-silicon simulation/emulation through post-silicon bring-up and production readiness). Develop test plans aligned with product deliverables and customer use cases.
  • Architect scalable test infrastructure: design and implement modular, reusable test frameworks and automation harnesses supporting functional, integration, stress, regression, power, security, and performance testing at scale across hundreds of systems.
  • Engage with cross-functional engineering teams (firmware developers, hardware architects, silicon validation, platform QA, system software) to ensure comprehensive test coverage and influence design-for-testability.
  • Own firmware quality metrics (code coverage, system uptime, bug escape rate, validation completeness) and establish dashboards/reporting for data-driven decisions.
  • Lead complex root-cause analysis and cross-layer debugging spanning firmware, software, and hardware; develop debug methodologies and tools.
  • Innovate in lab automation and CI/CD: partner with DevOps and infrastructure teams to integrate continuous testing into nightly and pre-merge workflows for fast, reliable release qualification.
  • Validate real-world use cases, customer configurations, and production scenarios; contribute to release gates and sign-off criteria for deployment readiness.
  • Mentor and coach firmware QA engineers and junior test developers; foster a culture of quality, innovation, and continuous learning.
  • Leverage AI-powered tools and copilots to accelerate test development, automate repetitive validation workflows, and streamline debug and failure analysis.

Requirements

  • B.S./M.S./Ph.D. in Electrical Engineering, Computer Engineering, Computer Science, or related field.
  • 12+ years of experience in software/firmware testing, with a focus on embedded or low-level systems.
  • Strong knowledge of system architecture, boot processes, SoCs, I2C/SPI/PCIe interfaces, and embedded controllers.
  • Proven experience designing test frameworks and infrastructure using Python, C/C++, or similar languages.
  • Expertise with platform standards for security, telemetry and manageability (e.g., NIST, DMTF). Hands-on experience with server platform, network, storage, cluster configuration and debugging.
  • Background with platform telemetry and datacenter node lifecycle management/support including CPU/GPU workloads.
  • Proficiency in scripting (Python) and experience administering, operating, and configuring Kubernetes and Envoy.
  • Validated experience in CI/CD tools and models such as GitLab, Jenkins, and GitOps.
  • Experience with lab automation, hardware-in-the-loop testing, and CI/CD pipelines.
  • Strong debugging, problem-solving, and analytical skills.
  • Excellent communication and collaboration skills; experience working in a globally distributed team is a plus.

Ways To Stand Out

  • Experience with NVIDIA platforms (e.g., DGX, HGX, Grace Hopper systems).
  • Exposure to security validation and compliance (e.g., FIPS, BMC security), or thermal/power validation.
  • Prior role as a test architect or technical lead for large-scale firmware or embedded validation programs.
  • Contributions to open-source testing tools or frameworks; strong knowledge of cloud-scale validation, infrastructure automation, or virtualization.
  • Prior experience using AI tools to design test plans, identify test gaps, automate tests, and support failure analysis.

Benefits & Compensation

By joining our team, you will be part of a forward-thinking company that values innovation and creativity. We offer a competitive salary and benefits package, a flexible work environment, and the opportunity to work with industry-leading experts. You will also be eligible for equity and benefits.

Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 200,000 USD - 322,000 USD for Level 5, and 248,000 USD - 391,000 USD for Level 6.

Applications for this job will be accepted at least until July 29, 2025.

NVIDIA is committed to fostering a diverse work environment and is an equal opportunity employer. We do not discriminate (including in hiring and promotion) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.