Senior Software Architect, AI and HPC

at Nvidia
USD 184,000-356,500 per year
SENIOR
✅ On-site

SCRAPED

Used Tools & Technologies

Not specified

Required Skills & Competences ?

Software Development @ 7 TensorFlow @ 4 Communication @ 4 Networking @ 4 Parallel Programming @ 6 PyTorch @ 4 CUDA @ 4

Details

NVIDIA has been transforming computer graphics, PC gaming, and accelerated computing for more than 25 years. The software architecture group has openings for software architects in AI, high-performance networking, and system software. The team researches, develops, and deploys solutions in networking hardware, programming environments, and system software to make current and future high-end computer systems more performant, scalable, and usable.

Responsibilities

  • Create proofs-of-concept to evaluate and motivate extensions in AI frameworks (PyTorch / NeMo), HPC programming models (MPI, OpenSHMEM, PGAS), new runtime designs, and new network hardware features.
  • Research, design, and implement features for AI and HPC communication middleware (NCCL, Open MPI, UCX, UCC, NVSHMEM) and deep learning frameworks such as TensorFlow and PyTorch.
  • Review, design, and implement features to enhance compiler capabilities to support the NVIDIA networking ecosystem (including Clang/LLVM and NVIDIA compilers).
  • Research, design, and develop hardware-related features relevant to scientific, deep learning, and data-intensive workloads.
  • Collaborate and communicate effectively across a multi-national, multi-time-zone corporate environment.

Requirements

  • Ph.D. or Master in computer science, computer engineering, or a closely related field, or equivalent experience.
  • 5+ years of experience in parallel programming models and/or network architecture.
  • Background in algorithm design, system programming, and computer architecture.
  • Strong programming and software development skills.
  • Ability and flexibility to work and communicate effectively in a multi-national, multi-time-zone corporate environment.
  • Strong collaborative and interpersonal skills; proven ability to guide and influence within a dynamic matrix environment.
  • Deep understanding of technology and passion for the work.

Ways to Stand Out

  • Experience designing communication middleware for HPC systems, including InfiniBand, DPUs, Ethernet, and shared-memory approaches.
  • Experience developing and implementing compiler features and optimizations, particularly Clang/LLVM and NVIDIA compilers.
  • Experience implementing communications libraries such as MPI, OpenSHMEM, NCCL, NVSHMEM, UCX, UCC, or PGAS.
  • Background with CUDA programming and NVIDIA GPUs.
  • Experience with programming models for emerging architectures including hierarchical heterogeneous memory systems and accelerators.

Compensation and Benefits

  • Base salary ranges (determined by location, experience, and comparison with similar roles):
    • Level 4: 184,000 USD - 287,500 USD
    • Level 5: 224,000 USD - 356,500 USD
  • Eligible for equity and additional benefits (see NVIDIA benefits).

Other

  • Applications for this job will be accepted at least until August 2, 2025.
  • NVIDIA is an equal opportunity employer and committed to fostering a diverse work environment.