Senior Solutions Architect, GPU - Cloud Service Providers

at Nvidia
USD 184,000-356,500 per year
SENIOR
βœ… On-site

SCRAPED

Used Tools & Technologies

Not specified

Required Skills & Competences ?

Docker @ 4 Kubernetes @ 4 DevOps @ 4 GCP @ 4 MLOps @ 4 AWS @ 4 Azure @ 4 Communication @ 4 Networking @ 7 Debugging @ 4 LLM @ 4 PyTorch @ 4 GPU @ 4

Details

Join NVIDIA's Solutions Architecture team to help bring AI solutions to strategic cloud service provider customers. You will work with large-scale customers to design, demonstrate, and support AI/ML and HPC software solutions that leverage NVIDIA hardware and software, focusing on performance, scalability, and cloud integration.

Responsibilities

  • Work with hyperscalers and tech giants to develop and demonstrate solutions based on NVIDIA software and hardware.
  • Partner with Sales Account Managers and Developer Relations Managers to identify and secure business opportunities for NVIDIA products and solutions.
  • Serve as the primary technical point of contact for customers building complex AI infrastructure, supporting performance aspects for large-scale LLM training and inference.
  • Run regular technical customer meetings covering project/product details, feature discussions, technology introductions, performance advice, and debugging.
  • Collaborate with customers to build Proofs of Concept (PoCs) addressing critical business needs and supporting cloud service integration of NVIDIA technology on hyperscalers.
  • Analyze and develop solutions for customer performance issues related to AI workloads and overall systems performance.

Requirements

  • BS/MS/PhD in Electrical/Computer Engineering, Computer Science, Physics, or other engineering field, or equivalent experience.
  • 8+ years of engineering experience (performance/system/solution engineering).
  • Hands-on experience building performance benchmarks for data center systems, including large-scale AI training and inference.
  • Strong understanding of systems architecture, including AI accelerators and networking as they relate to application performance.
  • Effective engineering program management skills and the ability to balance multiple tasks.
  • Excellent written and verbal communication skills, including experience producing documents and presentations for external customer-facing environments.

Preferred / Ways to Stand Out

  • Hands-on experience with deep learning frameworks such as PyTorch and JAX.
  • Experience with compilers and tooling like Triton and XLA.
  • Familiarity with NVIDIA libraries and toolkits (TensorRT, NeMo, NCCL, RAPIDS, and similar).
  • Knowledge of deep learning architectures and recent LLM developments.
  • Background with NVIDIA hardware/software, performance tuning, and error diagnostics.
  • Experience with GPU systems including performance testing, tuning, and benchmarking.
  • Experience deploying solutions in cloud environments (AWS, GCP, Azure, OCI) and knowledge of DevOps/MLOps technologies (Docker/containers, Kubernetes, data center deployments).
  • Command-line proficiency.

Compensation & Benefits

  • Base salary ranges (location and level dependent):
    • Level 4: 184,000 USD - 287,500 USD per year
    • Level 5: 224,000 USD - 356,500 USD per year
  • Eligible for equity and additional benefits (see NVIDIA benefits page).

Other Information

  • Location: Santa Clara, CA, United States.
  • Employment type: Full time.
  • Applications accepted at least until August 25, 2025.
  • NVIDIA is an equal opportunity employer and is committed to fostering a diverse work environment.