Used Tools & Technologies
HPCRequired Skills & Competences
Tag name is followed by "@" symbol and proficiency level value.
About proficiency levels:
- 1-2 — basic awareness. Minimal hands-on experience, and a rudimentary understanding of the technology's purpose;
- 3-6 — daily use. Comfortable and regular usage, capable of handling common tasks and challenges related to the technology;
- 7-9 — you are an expert, you can teach others, you know all the pitfalls and tricks;
- 10 — exceptional knowledge, comprehensive understanding, and adeptness in all aspects of the technology, including advanced problem-solving. Think twice before claiming or demanding such level.
Security @ 3
Software Development @ 3
Cumulus Linux @ 3
Linux @ 3
Python @ 3
Leadership @ 7
People Management @ 3
Communication @ 5
Networking @ 7
Project Management @ 3
AI @ 3
Data Modeling @ 3
- 1-2 — basic awareness. Minimal hands-on experience, and a rudimentary understanding of the technology's purpose;
- 3-6 — daily use. Comfortable and regular usage, capable of handling common tasks and challenges related to the technology;
- 7-9 — you are an expert, you can teach others, you know all the pitfalls and tricks;
- 10 — exceptional knowledge, comprehensive understanding, and adeptness in all aspects of the technology, including advanced problem-solving. Think twice before claiming or demanding such level.
Details
NVIDIA is looking for a top-tier Software Engineering Leader to join the NVIDIA-Cumulus Linux Team. You will lead Cumulus Linux development, take ownership of features end-to-end, and deliver independently with minimal supervision.
The team develops the Network Operating System software that powers data centers optimized for AI and high-performance computing. In this role you will lead a software development team responsible for defining and implementing Cumulus Linux core infrastructure services, Reliability/Availability/Serviceability features, and telemetry. You will collaborate with multi-functional engineering and product teams.
Responsibilities
- Lead a team developing and delivering Cumulus Linux operating system telemetry and infrastructure features.
- Partner with other engineering teams to scope and develop solutions to improve system security, performance, and reliability.
- Develop and debug C and Python code for system monitoring, reliability, and serviceability features as needed.
- Collaborate with product, architecture, and engineering teams for end-to-end integration of infrastructure features into Linux and the Cumulus Linux distribution.
- Work with project management for effort estimation and feature planning.
- Work with recruiting to expand the team: sourcing, interviewing, participating in conferences/events, and onboarding.
- Mentor engineers, assign projects aligned to career development and strengths.
- Engage with upstream communities as needed and supervise technology trends and emerging standards.
- Proactively guide problem-solving to minimize incidents and prevent recurrence.
Requirements
- Master of Science in Electrical Engineering, Computer Science, Computer Engineering, or Bachelors (or equivalent experience).
- 10+ years of proven leadership in Linux systems and data center networking technologies.
- 2+ years of people management experience in an enterprise environment.
- Familiarity with cloud native concepts.
- Strong background in Linux OS feature development.
- Good knowledge of data modeling concepts, OpenConfig, and streaming telemetry protocols like gNMI.
- Experience driving projects from concept to production.
- Excellent written and verbal communication and interpersonal skills; comfortable articulating value and influencing internal teams.
- Experience with embedded software on network switches.
- Experience with bring-up and troubleshooting of Ethernet interfaces and modules.
- Familiarity with datacenter protocols.
- Ability to work independently with minimal direction.
Ways to stand out
- Strong background in Linux systems and Linux kernel networking.
- Strong hands-on lab experience involving system bring-up and debug.
- Collaborative working style.
Compensation & Benefits
- Base salary range (by level):
- Level 3: 224,000 USD - 356,500 USD
- Level 4: 272,000 USD - 431,250 USD
- You will also be eligible for equity and benefits. Base salary will be determined based on location, experience, and pay of employees in similar positions.
Additional information
- Applications for this job will be accepted at least until June 4, 2026.
- This posting is for an existing vacancy.
- NVIDIA uses AI tools in its recruiting processes.
- NVIDIA is an equal opportunity employer and is committed to fostering an inclusive work environment.