Senior Solutions Architect, HPC and Generative AI Deployment

at Nvidia

📍 Santa Clara, United States

USD 184,000-287,500 per year

SENIOR

✅ On-site

SCRAPED

Used Tools & Technologies

Not specified

Required Skills & Competences ^?

Kubernetes @ 4 Python @ 7 Communication @ 4 Mathematics @ 4 Parallel Programming @ 4 Debugging @ 4 PyTorch @ 7 CUDA @ 4 GPU @ 4

Details

NVIDIA is seeking outstanding Solutions Architects to assist and support customers that are building solutions with our newest High Performance Computing (HPC) and Artificial Intelligence (AI) technologies. You will become a trusted technical advisor with our customers and work on projects focused on HPC and Generative AI (GenAI), collaborating with scientific researchers and developers at universities and research institutions. This role requires experience with HPC, GenAI and GPU technologies and offers the opportunity to work in an interdisciplinary team at NVIDIA.

Responsibilities

Partner with other solution architects, engineering, product and business teams to understand strategies and technical needs and help define high-value solutions
Dynamically engage with developers, scientific researchers, and data scientists across a range of technical areas
Strategically partner with lighthouse customers and researchers to help them adopt and build creative solutions using NVIDIA technology
Analyze performance and power efficiency of AI workloads on Kubernetes
Travel to conferences and customer sites as required

Requirements

BS, MS, or PhD in Computer Science, Electrical/Computer Engineering, Physics, Mathematics, other Engineering or related fields (or equivalent experience)
8+ years of hands-on experience with accelerated computing and deep learning frameworks such as PyTorch
Experience porting and/or optimizing scientific applications targeting GPUs
Strong fundamentals in programming and software design, especially in Python and C++
Experience with containerization and orchestration technologies, monitoring, and observability solutions for AI deployments
Excellent knowledge of theory and practice of AI at scale
Excellent presentation, communication and collaboration skills

Ways to Stand Out

Experience with NVIDIA GPUs and parallel programming libraries such as CUDA, OpenMP, OpenACC, and communication libraries and runtimes (MPI, NCCL, UCX, NVSHMEM)
Excellent C/C++ programming skills, including debugging, profiling, code optimization, performance analysis, and test design
Experience working with the academic research community supporting HPC or AI
Familiarity with distributed computing platforms, containers and scheduling tools
Prior experience with deep learning training at scale and deploying or optimizing DL inference in production

Compensation & Other Details

Base salary range: 184,000 USD - 287,500 USD (determined based on location, experience, and comparable pay)
Eligible for equity and benefits
Applications accepted at least until July 29, 2025
NVIDIA is an equal opportunity employer and values diversity; does not discriminate based on protected characteristics