Senior System Software Bringup Engineer
at Nvidia
π Santa Clara, United States
USD 184,000-356,500 per year
SCRAPED
Used Tools & Technologies
Not specified
Required Skills & Competences ?
Security @ 4 Software Development @ 8 Python @ 4 Networking @ 7 Debugging @ 4 System Architecture @ 3 GPU @ 4Details
NVIDIA develops GPU-centric AI computing platforms and systems (HGX, MGX, DGX) for large-scale data center and enterprise AI infrastructure. This role leads system bringup for GPU-centric server platforms in factory and data center environments, focusing on firmware, diagnostics, deployment, and automation to ensure efficient and reliable production ramp and data center operation.
Responsibilities
- Lead and drive system bringup for GPU-centric server platforms in factory and data center environments.
- Design and implement end-to-end factory workflows, including firmware flashing sequences, security provisioning, and deployment of software mitigations.
- Collaborate cross-functionally with data center architects, ODMs, and OEMs to define factory and data center requirements for efficient and reliable production ramp.
- Champion reliability, debuggability and optimization in firmware, diagnostic and deployment tool design.
- Use AI tools to automate functionality and improve automation.
- Troubleshoot quickly, working closely with system bring-up teams on next-generation AI systems to debug and resolve issues during bringup and deployment.
Requirements
- 10+ years of experience in data center firmware/platform software development.
- BS, MS, or PhD in Electrical Engineering (EE), Computer Science (CS), or a related technical field β or equivalent experience.
- Deep, hands-on expertise working with ODMs/CSPs, firmware update design, and out-of-band management.
- Proven track record of architecting and developing server firmware and diagnostic solutions for large-scale data center deployments.
- Solid knowledge of hardware interfaces and protocols, including USB, SMBus/I2C, PCIe, and protocols such as Redfish, MCTP, and PLDM.
- Solid knowledge of debugging servers for early bring up and production troubleshooting.
- Advanced hands-on skills in C/C++ and Python, with a hands-on approach to coding and debugging during hardware bring-up.
- Strong communicator, excellent collaborator, and committed team player.
- Self-starter with a problem-solving mindset who thrives in a fast-paced, complex technical environment.
Ways to stand out
- Hands-on experience with ODMs/CSPs during system bring-up and volume deployment.
- Deep familiarity with x86 or ARM system architecture.
- Strong networking expertise with high-speed NICs, including bring-up and configuration in factory environments.
Compensation & Benefits
- Base salary ranges by level:
- Level 4: 184,000 USD - 287,500 USD per year
- Level 5: 224,000 USD - 356,500 USD per year
- You will also be eligible for equity and benefits (see NVIDIA benefits page).
Additional details
- Location: Santa Clara, CA, United States
- Employment type: Full time
- Application window: Applications for this job will be accepted at least until August 13, 2025.
- NVIDIA is an equal opportunity employer committed to diversity and inclusion.