Senior Platform Software Engineer, PCIe

at Nvidia
USD 148,000-287,500 per year
SENIOR
✅ On-site

SCRAPED

Used Tools & Technologies

Not specified

Required Skills & Competences ?

Security @ 4 Linux @ 4 Communication @ 7 Performance Optimization @ 4 Debugging @ 4 System Architecture @ 4 Customer Support @ 4 GPU @ 4

Details

NVIDIA pioneered the GPU, revolutionizing computer graphics and parallel computing, and now leads in AI computing with its GH200 superchip for HPC and generative AI workloads. The role focuses on designing rack-level solutions for next-generation scalable AI supercomputing platforms.

Responsibilities

  • Drive innovations for GPU-based AI servers focusing on PCIe architecture, system engineering, and software/firmware changes related to processor and I/O architecture.
  • Define system architecture to optimize I/O performance for various GPU applications.
  • Debug complex system issues involving GPU, I/O bus (PCIe, etc.), and CPU.
  • Architect complex systems with emphasis on I/O error handling, fault management, and resiliency for data center requirements.
  • Identify gaps in platform debuggability and improve issue resolution speed and accuracy.
  • Discover and incorporate new technologies to enhance performance, functionality, uptime, security, and reliability of GPU systems.
  • Work across industry standards to enable new technologies in AI servers efficiently.
  • Contribute to all product development phases from definition, architecture, design, implementation, debugging, testing to early customer support.

Requirements

  • Deep knowledge of server architecture, CPU design, PCI Express, and CXL at the platform level for enterprise systems.
  • Expertise in PCI Express and error handling (RAS), performance optimization, and Linux kernel.
  • Familiarity with PCIe switches, retimers, related firmware, or configuration.
  • Strong understanding of memory architecture focusing on Memory RAS.
  • Proven experience delivering high-end enterprise servers from definition to deployment.
  • Experience modifying UEFI BIOS and Linux kernel source, scripting for debug automation, and C/C++ development and debugging in Linux environments.
  • Excellent communication skills, strong work ethics, team collaboration, quality-driven, and self-starter with creative problem-solving capabilities.
  • Bachelor’s degree in Electrical Engineering or Computer Science or equivalent experience.
  • Minimum 7+ years of individual contributor experience.

Ways to Stand Out

  • Expertise in debugging complex, time-critical issues in development and production.
  • Experience with both x86 and ARM architectures.

NVIDIA offers a forward-thinking environment with talented professionals and encourages creative and autonomous individuals to apply.

Salary and Benefits

  • Base salary range: 148,000 USD - 287,500 USD per year, dependent on location, experience, and peer pay.
  • Eligibility for equity and benefits.
  • Ongoing application acceptance.

NVIDIA is committed to diversity and equal opportunity without discrimination based on protected characteristics.