Senior Software Architect, AI and HPC

at Nvidia
USD 184,000-356,500 per year
SENIOR
✅ On-site

SCRAPED

Used Tools & Technologies

Not specified

Required Skills & Competences ?

Software Development @ 7 TensorFlow @ 4 Hiring @ 4 Communication @ 4 Networking @ 4 Parallel Programming @ 6 PyTorch @ 4 CUDA @ 4

Details

NVIDIA is hiring a Senior Software Architect in the field of AI and high-performance computing (HPC). The role is within the software architecture group working on AI, high-performance networking, and system software. The team researches, develops, and deploys solutions in networking hardware, programming environments, and system software to make current and future high-end computer systems more performant, scalable, and usable.

Responsibilities

  • Create proofs-of-concept to evaluate and motivate extensions in AI frameworks (PyTorch / NeMo), HPC programming models (MPI, OpenSHMEM, PGAS), new runtime designs, and new network hardware features.
  • Research, design, and implement features for AI and HPC communication middleware (NCCL, Open MPI, UCX, UCC, NVSHMEM) and deep learning frameworks such as TensorFlow and PyTorch.
  • Review, design, and implement features to enhance compiler functionality to support the NVIDIA networking ecosystem.
  • Research, design, and develop hardware features relevant to scientific, deep learning, and data-intensive workloads.
  • Collaborate across multi-national, multi-time-zone teams and influence design and implementation within a matrix environment.

Requirements

  • Ph.D. or Master's degree in computer science, computer engineering, or a closely related field, or equivalent experience.
  • 5+ years of experience in parallel programming models and/or network architecture.
  • Background in algorithm design, system programming, and computer architecture.
  • Strong programming and software development skills.
  • Ability to work and communicate effectively in a multi-national, multi-time-zone corporate environment.
  • Strong collaborative and interpersonal skills, with demonstrated ability to guide and influence in a dynamic matrix organization.
  • Deep understanding of technology and passion for the domain.

Ways to stand out

  • Experience designing communication middleware for HPC systems (InfiniBand, DPUs, Ethernet, Shared Memory).
  • Experience developing compiler features or optimizations (Clang/LLVM and NVIDIA compilers).
  • Experience implementing communications libraries (MPI, OpenSHMEM, NCCL, NVSHMEM, UCX, UCC, PGAS).
  • Background with CUDA programming and NVIDIA GPUs.
  • Experience with programming models for hierarchical heterogeneous memory systems and accelerators.

Compensation & Benefits

  • Base salary range (Level 4): 184,000 USD - 287,500 USD.
  • Base salary range (Level 5): 224,000 USD - 356,500 USD.
  • You will also be eligible for equity and benefits. Applications accepted at least until August 2, 2025.

Location

  • Santa Clara, California, United States.

Equal Opportunity

NVIDIA is committed to fostering a diverse work environment and is an equal opportunity employer. The company does not discriminate on the basis of protected characteristics.