Used Tools & Technologies
Not specified
Required Skills & Competences ?
Software Development @ 7 TensorFlow @ 4 Hiring @ 4 Communication @ 4 Networking @ 4 Parallel Programming @ 6 PyTorch @ 4 CUDA @ 4Details
NVIDIA is hiring a Senior Software Architect in the field of AI and high-performance computing (HPC). The role is within the software architecture group working on AI, high-performance networking, and system software. The team researches, develops, and deploys solutions in networking hardware, programming environments, and system software to make current and future high-end computer systems more performant, scalable, and usable.
Responsibilities
- Create proofs-of-concept to evaluate and motivate extensions in AI frameworks (PyTorch / NeMo), HPC programming models (MPI, OpenSHMEM, PGAS), new runtime designs, and new network hardware features.
- Research, design, and implement features for AI and HPC communication middleware (NCCL, Open MPI, UCX, UCC, NVSHMEM) and deep learning frameworks such as TensorFlow and PyTorch.
- Review, design, and implement features to enhance compiler functionality to support the NVIDIA networking ecosystem.
- Research, design, and develop hardware features relevant to scientific, deep learning, and data-intensive workloads.
- Collaborate across multi-national, multi-time-zone teams and influence design and implementation within a matrix environment.
Requirements
- Ph.D. or Master's degree in computer science, computer engineering, or a closely related field, or equivalent experience.
- 5+ years of experience in parallel programming models and/or network architecture.
- Background in algorithm design, system programming, and computer architecture.
- Strong programming and software development skills.
- Ability to work and communicate effectively in a multi-national, multi-time-zone corporate environment.
- Strong collaborative and interpersonal skills, with demonstrated ability to guide and influence in a dynamic matrix organization.
- Deep understanding of technology and passion for the domain.
Ways to stand out
- Experience designing communication middleware for HPC systems (InfiniBand, DPUs, Ethernet, Shared Memory).
- Experience developing compiler features or optimizations (Clang/LLVM and NVIDIA compilers).
- Experience implementing communications libraries (MPI, OpenSHMEM, NCCL, NVSHMEM, UCX, UCC, PGAS).
- Background with CUDA programming and NVIDIA GPUs.
- Experience with programming models for hierarchical heterogeneous memory systems and accelerators.
Compensation & Benefits
- Base salary range (Level 4): 184,000 USD - 287,500 USD.
- Base salary range (Level 5): 224,000 USD - 356,500 USD.
- You will also be eligible for equity and benefits. Applications accepted at least until August 2, 2025.
Location
- Santa Clara, California, United States.
Equal Opportunity
NVIDIA is committed to fostering a diverse work environment and is an equal opportunity employer. The company does not discriminate on the basis of protected characteristics.