Senior Software Engineer β CSP Engagements
SCRAPED
Used Tools & Technologies
Not specified
Required Skills & Competences ?
Marketing @ 4 Software Development @ 4 Kubernetes @ 7 Linux @ 4 Python @ 7 Communication @ 4 Performance Optimization @ 4 Debugging @ 4 CUDA @ 4 GPU @ 4Details
NVIDIA is seeking a Senior Software Engineer to join our CSP Engagements team, focusing on system software for Datacenter products such as GB200. This role combines deep technical expertise in embedded firmware, Linux kernel development, and middleware development, with customer-facing responsibilities to enable cloud service providers with next-generation computing platforms. You will work at the intersection of hardware and software, driving technical solutions from concept through deployment.
Responsibilities
- Design and develop software solutions for data center servers including Linux kernel modifications, device drivers, and system optimizations for GB200 and next-gen platforms.
- Lead hardware bring-up activities, BSP development, and hardware-software co-design for Cloud Service Provider deployments.
- Partner directly with CSPs to deliver technical solutions, co-develop & co-debug features and optimizations, and provide support during new product introductions.
- Collaborate with cross-functional teams in designing end-to-end solutions spanning firmware, OS, middleware, and applications with focus on AI/ML and HPC workloads.
- Perform advanced system debugging, root cause analysis, and performance optimization for large-scale data center environments.
- Collaborate with AE, FAE, and Solution Architect teams to deliver integrated customer solutions and technical documentation.
Requirements
- Deep expertise in data center server architectures, HPC systems, and hardware-software co-design.
- Expert knowledge of Linux kernel internals, device drivers, communication protocols (PCIe, USB, Ethernet).
- Deep understanding of computer architecture, microprocessor concepts, and expert knowledge of ARM (aarch64) and x86 architectures.
- Deep understanding of NUMA architectures including memory topology, processor-memory locality, and performance optimization for multi-CPU systems in data center environments.
- Strong programming skills in C/C++, Python, plus experience with virtualization, Kubernetes, and cloud-native architectures.
- Skilled in complex system-level debugging, performance analysis, and test design.
- BS or MS in Computer Engineering, Computer Science, or related field (or equivalent experience).
- 8-12 years of system software development experience.
Ways to stand out from the crowd
- Experience with GPU computing (CUDA), deep learning workloads.
- Expertise in Out of Band and In-band management architectures.
- Knowledge of Memory fabric and CXL architectures.
NVIDIA is leading the way in groundbreaking developments in Artificial Intelligence, High-Performance Computing and Visualization. The GPU, our invention, serves as the visual cortex of modern computers and is at the heart of our products and services. If youβre creative, hardworking and self-motivated, we want to hear from you!
Do you want to join a team of highly motivated and experienced program managers who drive the successful introduction of NVIDIA's next generation GPU/CPU based products? We work closely with internal leaders in Software, Hardware, Firmware, Marketing and Operations to ensure the SW team delivers outstanding products while operating across multiple functional units and all levels of management to achieve Time-To-Market. As part of the team, your knowledge of driver, firmware, diagnostics and the SW stack development processes and priorities will enable you to swiftly make the course adjustments needed to keep these complex projects on track!