Senior Software Architect, AI and HPC

at Nvidia

📍 Santa Clara, United States

USD 184,000-356,500 per year

SENIOR

✅ On-site

SCRAPED

Used Tools & Technologies

Not specified

Required Skills & Competences ^?

Software Development @ 7 TensorFlow @ 4 Hiring @ 4 Communication @ 4 Networking @ 4 Parallel Programming @ 6 PyTorch @ 4 CUDA @ 4

Details

NVIDIA is hiring a Senior Software Architect in the field of AI and high-performance computing (HPC). The role is within the software architecture group working on AI, high-performance networking, and system software. The team researches, develops, and deploys solutions in networking hardware, programming environments, and system software to make current and future high-end computer systems more performant, scalable, and usable.

Responsibilities

Create proofs-of-concept to evaluate and motivate extensions in AI frameworks (PyTorch / NeMo), HPC programming models (MPI, OpenSHMEM, PGAS), new runtime designs, and new network hardware features.
Research, design, and implement features for AI and HPC communication middleware (NCCL, Open MPI, UCX, UCC, NVSHMEM) and deep learning frameworks such as TensorFlow and PyTorch.
Review, design, and implement features to enhance compiler functionality to support the NVIDIA networking ecosystem.
Research, design, and develop hardware features relevant to scientific, deep learning, and data-intensive workloads.
Collaborate across multi-national, multi-time-zone teams and influence design and implementation within a matrix environment.

Requirements

Ph.D. or Master's degree in computer science, computer engineering, or a closely related field, or equivalent experience.
5+ years of experience in parallel programming models and/or network architecture.
Background in algorithm design, system programming, and computer architecture.
Strong programming and software development skills.
Ability to work and communicate effectively in a multi-national, multi-time-zone corporate environment.
Strong collaborative and interpersonal skills, with demonstrated ability to guide and influence in a dynamic matrix organization.
Deep understanding of technology and passion for the domain.

Ways to stand out

Experience designing communication middleware for HPC systems (InfiniBand, DPUs, Ethernet, Shared Memory).
Experience developing compiler features or optimizations (Clang/LLVM and NVIDIA compilers).
Experience implementing communications libraries (MPI, OpenSHMEM, NCCL, NVSHMEM, UCX, UCC, PGAS).
Background with CUDA programming and NVIDIA GPUs.
Experience with programming models for hierarchical heterogeneous memory systems and accelerators.

Compensation & Benefits

Base salary range (Level 4): 184,000 USD - 287,500 USD.
Base salary range (Level 5): 224,000 USD - 356,500 USD.
You will also be eligible for equity and benefits. Applications accepted at least until August 2, 2025.

Location

Santa Clara, California, United States.

Equal Opportunity

NVIDIA is committed to fostering a diverse work environment and is an equal opportunity employer. The company does not discriminate on the basis of protected characteristics.