Senior Solutions Architect, HPC and Generative AI Deployment
at Nvidia
π Santa Clara, United States
USD 184,000-287,500 per year
SCRAPED
Used Tools & Technologies
Not specified
Required Skills & Competences ?
Kubernetes @ 4 Python @ 7 Communication @ 4 Mathematics @ 4 Parallel Programming @ 4 Debugging @ 4 PyTorch @ 7 CUDA @ 4 GPU @ 4Details
NVIDIA is seeking outstanding Solutions Architects to assist and support customers that are building solutions with our newest High Performance Computing (HPC) and Artificial Intelligence (AI) technologies. You will become a trusted technical advisor with our customers and work on projects focused on HPC and Generative AI (GenAI), collaborating with scientific researchers and developers at universities and research institutions. This role requires experience with HPC, GenAI and GPU technologies and offers the opportunity to work in an interdisciplinary team at NVIDIA.
Responsibilities
- Partner with other solution architects, engineering, product and business teams to understand strategies and technical needs and help define high-value solutions
- Dynamically engage with developers, scientific researchers, and data scientists across a range of technical areas
- Strategically partner with lighthouse customers and researchers to help them adopt and build creative solutions using NVIDIA technology
- Analyze performance and power efficiency of AI workloads on Kubernetes
- Travel to conferences and customer sites as required
Requirements
- BS, MS, or PhD in Computer Science, Electrical/Computer Engineering, Physics, Mathematics, other Engineering or related fields (or equivalent experience)
- 8+ years of hands-on experience with accelerated computing and deep learning frameworks such as PyTorch
- Experience porting and/or optimizing scientific applications targeting GPUs
- Strong fundamentals in programming and software design, especially in Python and C++
- Experience with containerization and orchestration technologies, monitoring, and observability solutions for AI deployments
- Excellent knowledge of theory and practice of AI at scale
- Excellent presentation, communication and collaboration skills
Ways to Stand Out
- Experience with NVIDIA GPUs and parallel programming libraries such as CUDA, OpenMP, OpenACC, and communication libraries and runtimes (MPI, NCCL, UCX, NVSHMEM)
- Excellent C/C++ programming skills, including debugging, profiling, code optimization, performance analysis, and test design
- Experience working with the academic research community supporting HPC or AI
- Familiarity with distributed computing platforms, containers and scheduling tools
- Prior experience with deep learning training at scale and deploying or optimizing DL inference in production
Compensation & Other Details
- Base salary range: 184,000 USD - 287,500 USD (determined based on location, experience, and comparable pay)
- Eligible for equity and benefits
- Applications accepted at least until July 29, 2025
- NVIDIA is an equal opportunity employer and values diversity; does not discriminate based on protected characteristics