Senior System Software Bringup Engineer

at Nvidia
USD 184,000-356,500 per year
SENIOR
βœ… On-site

SCRAPED

Used Tools & Technologies

Not specified

Required Skills & Competences ?

Security @ 4 Software Development @ 8 Python @ 4 Networking @ 7 Debugging @ 4 System Architecture @ 3 GPU @ 4

Details

NVIDIA develops GPU-centric AI computing platforms and systems (HGX, MGX, DGX) for large-scale data center and enterprise AI infrastructure. This role leads system bringup for GPU-centric server platforms in factory and data center environments, focusing on firmware, diagnostics, deployment, and automation to ensure efficient and reliable production ramp and data center operation.

Responsibilities

  • Lead and drive system bringup for GPU-centric server platforms in factory and data center environments.
  • Design and implement end-to-end factory workflows, including firmware flashing sequences, security provisioning, and deployment of software mitigations.
  • Collaborate cross-functionally with data center architects, ODMs, and OEMs to define factory and data center requirements for efficient and reliable production ramp.
  • Champion reliability, debuggability and optimization in firmware, diagnostic and deployment tool design.
  • Use AI tools to automate functionality and improve automation.
  • Troubleshoot quickly, working closely with system bring-up teams on next-generation AI systems to debug and resolve issues during bringup and deployment.

Requirements

  • 10+ years of experience in data center firmware/platform software development.
  • BS, MS, or PhD in Electrical Engineering (EE), Computer Science (CS), or a related technical field β€” or equivalent experience.
  • Deep, hands-on expertise working with ODMs/CSPs, firmware update design, and out-of-band management.
  • Proven track record of architecting and developing server firmware and diagnostic solutions for large-scale data center deployments.
  • Solid knowledge of hardware interfaces and protocols, including USB, SMBus/I2C, PCIe, and protocols such as Redfish, MCTP, and PLDM.
  • Solid knowledge of debugging servers for early bring up and production troubleshooting.
  • Advanced hands-on skills in C/C++ and Python, with a hands-on approach to coding and debugging during hardware bring-up.
  • Strong communicator, excellent collaborator, and committed team player.
  • Self-starter with a problem-solving mindset who thrives in a fast-paced, complex technical environment.

Ways to stand out

  • Hands-on experience with ODMs/CSPs during system bring-up and volume deployment.
  • Deep familiarity with x86 or ARM system architecture.
  • Strong networking expertise with high-speed NICs, including bring-up and configuration in factory environments.

Compensation & Benefits

  • Base salary ranges by level:
    • Level 4: 184,000 USD - 287,500 USD per year
    • Level 5: 224,000 USD - 356,500 USD per year
  • You will also be eligible for equity and benefits (see NVIDIA benefits page).

Additional details

  • Location: Santa Clara, CA, United States
  • Employment type: Full time
  • Application window: Applications for this job will be accepted at least until August 13, 2025.
  • NVIDIA is an equal opportunity employer committed to diversity and inclusion.