Used Tools & Technologies
Not specified
Required Skills & Competences
Tag name is followed by "@" symbol and proficiency level value.
About proficiency levels:
- 1-2 — basic awareness. Minimal hands-on experience, and a rudimentary understanding of the technology's purpose;
- 3-6 — daily use. Comfortable and regular usage, capable of handling common tasks and challenges related to the technology;
- 7-9 — you are an expert, you can teach others, you know all the pitfalls and tricks;
- 10 — exceptional knowledge, comprehensive understanding, and adeptness in all aspects of the technology, including advanced problem-solving. Think twice before claiming or demanding such level.
Security @ 4
Linux @ 4
Python @ 4
Hiring @ 4
Bash @ 4
Communication @ 4
Debugging @ 4
QA @ 4
GPU @ 4
AI @ 4
- 1-2 — basic awareness. Minimal hands-on experience, and a rudimentary understanding of the technology's purpose;
- 3-6 — daily use. Comfortable and regular usage, capable of handling common tasks and challenges related to the technology;
- 7-9 — you are an expert, you can teach others, you know all the pitfalls and tricks;
- 10 — exceptional knowledge, comprehensive understanding, and adeptness in all aspects of the technology, including advanced problem-solving. Think twice before claiming or demanding such level.
Details
NVIDIA is hiring a Senior Software Engineer focused on server manageability for GPU-based AI servers, with emphasis on out-of-band (OOB) management, BMC firmware, server architecture, and enterprise systems delivery. The role involves driving firmware and system software design, collaborating with hardware teams and industry partners, and ensuring software quality and security for AI server platforms.
Responsibilities
- Design, implement, and deliver innovations for managing GPU-based AI servers with focus on OOB management, firmware development, server architecture and enterprise systems.
- Lead BMC firmware design with a global team of engineers.
- Design and develop performance-optimized active monitoring BMC solutions using DMTF standards including MCTP, Redfish, SPDM and PLDM specifications.
- Instrument code for maximum code coverage; write and automate unit tests for each module and maintain detailed unit test case reports.
- Provide software quality reports based on static analysis, code coverage, and CPU load.
- Work with security teams to align developed code with product security goals.
- Work closely with hardware teams to influence hardware design and review hardware architecture and schematics.
- Drive definition and end-to-end delivery of platforms by collaborating with internal teams, ODMs/OEMs and industry partners for AI servers.
- Collaborate with QA/Test architects to develop test tools and automation for qualifying the full system software and firmware stack.
Requirements
- Domain expertise in BMC firmware development on x86 or ARM platforms, including BMC-BIOS communication, thermal management, power management, firmware update, device monitoring, and firmware security.
- Solid experience in end-to-end delivery of high-end enterprise servers from definition to customer deployment.
- Solid understanding of low-level interfaces between SBIOS, BMC and OS such as I2C, SPI, PCIe, JTAG; knowledge of PCIe enumeration and platform-level I/O for enterprise systems.
- Experience working closely with hardware teams, ODMs, and vendors to introduce and support server platforms.
- Experience with C/C++ development, bash/python scripting, and debugging skills in embedded Linux environments.
- Strong written and oral communication skills; proven ability as an individual contributor; self-starter with attention to quality and delivery.
- Bachelor’s, Master’s, or PhD in Electrical Engineering, Computer Science or equivalent experience, and 5+ years of relevant experience.
Ways to Stand Out
- Contributions to industry standards like Open Compute, IPMI, DMTF standards, or to the OpenBMC open source project.
- Proven record delivering BMC for enterprise servers using the OpenBMC firmware stack.
Compensation & Benefits
- Base salary ranges by level: Level 3: 152,000 USD - 241,500 USD; Level 4: 184,000 USD - 287,500 USD.
- Eligible for equity and company benefits (link to NVIDIA benefits).
Additional Information
- Applications accepted at least until April 13, 2026.
- NVIDIA uses AI tools in its recruiting processes.
- NVIDIA is an equal opportunity employer and values diversity in hiring and promotion practices.