Senior Solutions Architect, GPU - Cloud Service Providers
at Nvidia
π Santa Clara, United States
USD 184,000-356,500 per year
SCRAPED
Used Tools & Technologies
Not specified
Required Skills & Competences ?
Docker @ 4 Kubernetes @ 4 DevOps @ 4 GCP @ 4 MLOps @ 4 AWS @ 4 Azure @ 4 Communication @ 4 Networking @ 7 Debugging @ 4 LLM @ 4 PyTorch @ 4 GPU @ 4Details
Join NVIDIA's Solutions Architecture team to help bring AI solutions to strategic cloud service provider customers. You will work with large-scale customers to design, demonstrate, and support AI/ML and HPC software solutions that leverage NVIDIA hardware and software, focusing on performance, scalability, and cloud integration.
Responsibilities
- Work with hyperscalers and tech giants to develop and demonstrate solutions based on NVIDIA software and hardware.
- Partner with Sales Account Managers and Developer Relations Managers to identify and secure business opportunities for NVIDIA products and solutions.
- Serve as the primary technical point of contact for customers building complex AI infrastructure, supporting performance aspects for large-scale LLM training and inference.
- Run regular technical customer meetings covering project/product details, feature discussions, technology introductions, performance advice, and debugging.
- Collaborate with customers to build Proofs of Concept (PoCs) addressing critical business needs and supporting cloud service integration of NVIDIA technology on hyperscalers.
- Analyze and develop solutions for customer performance issues related to AI workloads and overall systems performance.
Requirements
- BS/MS/PhD in Electrical/Computer Engineering, Computer Science, Physics, or other engineering field, or equivalent experience.
- 8+ years of engineering experience (performance/system/solution engineering).
- Hands-on experience building performance benchmarks for data center systems, including large-scale AI training and inference.
- Strong understanding of systems architecture, including AI accelerators and networking as they relate to application performance.
- Effective engineering program management skills and the ability to balance multiple tasks.
- Excellent written and verbal communication skills, including experience producing documents and presentations for external customer-facing environments.
Preferred / Ways to Stand Out
- Hands-on experience with deep learning frameworks such as PyTorch and JAX.
- Experience with compilers and tooling like Triton and XLA.
- Familiarity with NVIDIA libraries and toolkits (TensorRT, NeMo, NCCL, RAPIDS, and similar).
- Knowledge of deep learning architectures and recent LLM developments.
- Background with NVIDIA hardware/software, performance tuning, and error diagnostics.
- Experience with GPU systems including performance testing, tuning, and benchmarking.
- Experience deploying solutions in cloud environments (AWS, GCP, Azure, OCI) and knowledge of DevOps/MLOps technologies (Docker/containers, Kubernetes, data center deployments).
- Command-line proficiency.
Compensation & Benefits
- Base salary ranges (location and level dependent):
- Level 4: 184,000 USD - 287,500 USD per year
- Level 5: 224,000 USD - 356,500 USD per year
- Eligible for equity and additional benefits (see NVIDIA benefits page).
Other Information
- Location: Santa Clara, CA, United States.
- Employment type: Full time.
- Applications accepted at least until August 25, 2025.
- NVIDIA is an equal opportunity employer and is committed to fostering a diverse work environment.