Used Tools & Technologies
Not specified
Required Skills & Competences
Tag name is followed by "@" symbol and proficiency level value.
About proficiency levels:
- 1-2 — basic awareness. Minimal hands-on experience, and a rudimentary understanding of the technology's purpose;
- 3-6 — daily use. Comfortable and regular usage, capable of handling common tasks and challenges related to the technology;
- 7-9 — you are an expert, you can teach others, you know all the pitfalls and tricks;
- 10 — exceptional knowledge, comprehensive understanding, and adeptness in all aspects of the technology, including advanced problem-solving. Think twice before claiming or demanding such level.
Security @ 4
Linux @ 4
Python @ 4
CI/CD @ 4
Hiring @ 4
Leadership @ 7
Communication @ 7
Git @ 4
Jira @ 4
Debugging @ 4
Technical Leadership @ 7
System Architecture @ 7
GPU @ 4
Deep Learning @ 7
AI @ 4
- 1-2 — basic awareness. Minimal hands-on experience, and a rudimentary understanding of the technology's purpose;
- 3-6 — daily use. Comfortable and regular usage, capable of handling common tasks and challenges related to the technology;
- 7-9 — you are an expert, you can teach others, you know all the pitfalls and tricks;
- 10 — exceptional knowledge, comprehensive understanding, and adeptness in all aspects of the technology, including advanced problem-solving. Think twice before claiming or demanding such level.
Details
NVIDIA's invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined modern computer graphics, and revolutionized parallel computing. More recently, GPU deep learning ignited modern deep learning - the next era of computing - with the GPU acting as the brain of computers, robots, and self-driving cars that can perceive and understand the world. Today, we are increasingly known as "the AI computing company." We're looking to grow our company and establish teams with the most thoughtful people in the world.
We are looking for an excellent Senior Engineering Manager to lead a large firmware engineering organization delivering end-to-end manageability firmware for NVIDIA's next generation Data Center Compute Systems. This role owns HGX product line and OpenBMC-based management firmware and MCU firmware components in data center platforms, including architecture, execution, quality, reliability, telemetry, and customer readiness. We are seeking an experienced senior leader with strong technical depth, broad system perspective, and a proven ability to lead large teams through complex product cycles. This role is onsite in Santa Clara, CA, USA.
Responsibilities
- Lead a large firmware engineering organization delivering OpenBMC based firmware and MCU firmware for next-generation Data Center Compute Systems.
- Own HGX platform as a lead for Firmware and System software readiness working across the organization.
- Define and drive the long-term firmware roadmap, balancing architectural innovation with product execution and delivery milestones.
- Drive architecture strategy across BMC, MCU, platform software, manageability, health management, and data center firmware interfaces.
- Lead execution across multiple programs, coordinating priorities, hiring, managing cross component dependencies, and delivery commitments across a large engineering team.
- Collaborate with data center architects, cloud customers, senior stakeholders, and cross-functional teams to define requirements, scope implementation, and deliver at Speed of Light.
- Partner with hardware, systems, security, validation, manufacturing, field, and customer engineering teams to ensure scalable manageability architecture across data center products.
- Manage customer and executive escalations for complex firmware, platform, and deployment issues.
- Build, mentor, and grow a high-performing engineering organization with strong technical leadership, execution discipline, and quality culture.
Requirements
- BS, MS, or PhD in EE/CS or related field of education or equivalent experience.
- 12+ overall years of proven experience in server firmware, BMC/OpenBMC, MCU firmware, platform software, or data center systems.
- 6+ years of experience managing software/firmware engineering teams.
- Strong technical leadership in data center system architecture, server manageability, telemetry, health management, and reliability at scale.
- Proven record delivering production firmware for large data centers with strong quality, debug, and operational discipline.
- Experience leading architecture and execution across multiple programs, cross-functional teams, and customer-facing deliverables.
- Strong understanding of firmware development lifecycle, validation, release management, issue triage, and production support.
- Excellent communication skills, strong work ethic, sound judgment, and the ability to align teams through complex technical and business tradeoffs.
Ways to Stand Out
- Experience leading large distributed engineering organizations, including multi-team execution and senior technical leaders.
- High level of ownership to deliver products working across matrixed organizations; demonstrated ownership across products.
- Hands-on experience with BMC firmware/software stack, MCU firmware, C/C++, Python, and debugging server platform.
- Expertise with OOB management DMTF protocols and standards such as MCTP, PLDM, SPDM, and Redfish.
- Experience with Embedded Linux, FreeRTOS, Yocto/BitBake, Git, Perforce, Jira, and modern firmware CI/CD practices.
- Proven ability to drive complex architecture, quality, reliability, and customer escalation work across 25+ engineers or similarly large engineering teams.
Compensation & Other Details
- This role is onsite in Santa Clara, CA, USA.
- Base salary range: 272,000 USD - 431,250 USD (will be determined based on location, experience, and pay of employees in similar positions).
- You will also be eligible for equity and benefits.
- Applications for this job will be accepted at least until June 20, 2026.
- NVIDIA uses AI tools in its recruiting processes.
- NVIDIA is an equal opportunity employer and is committed to fostering an inclusive work environment.