Distinguished Software Engineer - Nvlink Fusion Software

at Nvidia
USD 308,000-471,500 per year
SENIOR
✅ On-site

SCRAPED

Used Tools & Technologies

Not specified

Required Skills & Competences ?

Communication @ 4 Networking @ 4 System Architecture @ 7

Details

NVIDIA has been transforming computer graphics, PC gaming, and accelerated computing for more than 25 years with groundbreaking technology and innovation. The company is currently advancing AI technology to define the next era of computing, where GPUs act as the brains for computers, robots, and self-driving cars with world understanding capabilities.

NVIDIA's ecosystem includes data center platform and node designs ranging from single node HGX/DGX systems to large multi-node NVLink domain rack architectures. These form the backbone of NVIDIA's enterprise and cloud businesses, combining NVIDIA GPUs, NVLink, InfiniBand networking, Grace CPUs, and a fully optimized AI and HPC software stack.

NVLink Fusion enables industry-leading AI scale-up and scale-out performance combining NVIDIA technology with semi-custom ASICs or CPUs. The role involves collaborating with partners to build ASIC hybrid AI infrastructure integrated into NVIDIA's rack-scale architecture.

Responsibilities

  • Define NVLink Fusion architecture leveraging NVIDIA’s scale-up and scale-out technologies
  • Establish software abstraction layers and reference software for NVLink Fusion partners
  • Engage directly with major customers to align their roadmaps with NVIDIA’s
  • Collaborate with business partners and vendors to shape their products to meet NVIDIA’s requirements
  • Mentor architects and engineering teams to grow future leaders
  • Make key technical decisions amidst ambiguity and mitigate risks via left shift strategy to accelerate time to market

Requirements

  • BS or MS in Computer Engineering, Computer Science, or related field, or equivalent experience
  • 16+ years of experience in system architecture and design
  • Deep expertise in designing scalable, performant server systems at the software/hardware interface
  • Experience with complex system software for accelerators (GPUs, DPUs, FPGAs)
  • Expertise in out-of-band and in-band management architectures
  • Knowledge of device management protocols such as MCTP, PLDM, and RDE
  • Knowledge of system management protocols such as Redfish and IPMI
  • Experience implementing left shift strategy to de-risk program execution
  • Excellent written and verbal communication skills

Ways to Stand Out

  • Knowledge of cloud and cluster-level deployment and management systems
  • Contributions to standards bodies like OCP and DMTF
  • Familiarity with CXL, UCIE, and other chip-to-chip technology architectures
  • Knowledge of storage and networking technologies

Benefits

  • Base salary range: 308,000 USD - 471,500 USD
  • Eligibility for equity and benefits
  • Committed to diversity and equal opportunity employment