Senior Solutions Architect, GPU - Cloud Service Providers

at Nvidia

📍 Santa Clara, United States

USD 184,000-356,500 per year

SENIOR

✅ On-site

SCRAPED

Used Tools & Technologies

Not specified

Required Skills & Competences ^?

Docker @ 7 Kubernetes @ 7 DevOps @ 7 GCP @ 4 MLOps @ 7 AWS @ 4 Azure @ 4 Networking @ 4 Debugging @ 4 LLM @ 4 PyTorch @ 4 GPU @ 4

Details

Join our team at NVIDIA and help bring AI solutions to our largest customers. We are seeking an expert Solutions Architect to assist customers in building AI/ML and HPC software solutions at scale. As a member of our Solutions Architecture team, you will collaborate with strategic customers, providing end-to-end technology solutions and technical support based on our product strategy.

Responsibilities

Work with tech giants to develop and demonstrate solutions based on NVIDIA’s software and hardware technologies.
Partner with Sales Account Managers and Developer Relations Managers to identify and secure business opportunities for NVIDIA products and solutions.
Serve as the main technical point of contact for customers engaged in the development of complex AI infrastructure; provide support on performance aspects for large scale LLM training and inference.
Conduct regular technical customer meetings for project/product details, feature discussions, introductions to new technologies, performance advice, and debugging sessions.
Collaborate with customers to build Proof of Concepts (PoCs) to address critical business needs and support cloud service integration for NVIDIA technology on hyperscalers.
Analyze and develop solutions for customer performance issues for both AI and systems performance.

Requirements

BS/MS/PhD in Electrical/Computer Engineering, Computer Science, Physics, or other Engineering fields or equivalent experience.
8+ years of engineering (performance/system/solution) experience.
Hands-on experience building performance benchmarks for data center systems, including large scale AI training and inference.
Understanding of systems architecture including AI accelerators and networking as it relates to application performance.
Effective engineering program management with the capability of balancing multiple tasks.
Ability to communicate ideas clearly through documents, presentations, and in external customer-facing environments.

Preferred / Ways to stand out

Hands-on experience with Deep Learning frameworks (PyTorch, JAX, etc.), compilers (Triton, XLA, etc.), and NVIDIA libraries (TRTLLM, TensorRT, Nemo, NCCL, RAPIDS, etc.).
Familiarity with deep learning architectures and the latest LLM developments.
Background with NVIDIA hardware and software, performance tuning, and error diagnostics.
Hands-on experience with GPU systems including performance testing, performance tuning, and benchmarking.
Experience deploying solutions in cloud environments including AWS, GCP, Azure, or OCI.
Knowledge of DevOps/MLOps technologies such as Docker/containers, Kubernetes, data center deployments, and strong command line proficiency.

Benefits

Base salary determined by location, experience, and pay of employees in similar positions.
Base salary ranges provided: 184,000 USD - 287,500 USD for Level 4, and 224,000 USD - 356,500 USD for Level 5.
Eligible for equity and additional benefits (see NVIDIA benefits page).

Applications for this job will be accepted at least until August 25, 2025.

NVIDIA is committed to fostering a diverse work environment and is an equal opportunity employer.