Solutions Architect, AI Hyperscalers

at Nvidia
USD 148,000-287,500 per year
MIDDLE
✅ On-site

SCRAPED

Used Tools & Technologies

Not specified

Required Skills & Competences ?

Docker @ 3 Kubernetes @ 3 Linux @ 3 Python @ 5 Machine Learning @ 6 Data Science @ 6 Hiring @ 3 Communication @ 3 Parallel Programming @ 3 PyTorch @ 3 CUDA @ 3 GPU @ 3

Details

NVIDIA is seeking an AI/ML Solutions Architect focused on Hyperscale customers and Cloud Service Providers. The role leads technical customer engagement for AI training, inference, and infrastructure deployed at vast scale. You will work across NVIDIA and customer organizations to ensure successful, production-ready deployments, automation, and large-scale AI infrastructure optimization.

Responsibilities

  • Serve as the primary technical contact for NVIDIA products for internet giants and cloud providers, enabling AI/ML software infrastructure at hyperscale.
  • Work directly with customer engineering teams to secure design wins, address technical challenges, bring solutions to production, and provide lifecycle support.
  • Understand customer environments, constraints, and long-term strategy, and translate those into product requirements and solutions.
  • Provide feedback to NVIDIA for product improvements and help customers enhance the value of NVIDIA technology.
  • Facilitate resolution of customer issues with timely and proactive communications to mitigate risks.
  • Lead workshops, demos, and proof-of-concepts to showcase NVIDIA's AI/ML capabilities.
  • Guide customers on processes for scalable AI model deployment and inference optimization.

Requirements

  • BS/MS in Computer Science, Electrical Engineering, or equivalent experience.
  • At least 5+ years of engineering experience with a proven track record in AI/ML-focused projects or enterprise-grade solutions.
  • Proven understanding of Linux, including troubleshooting, optimization, and customization for AI/ML workloads.
  • Strong understanding of data science and machine learning infrastructure (software and hardware).
  • Professional-level communication skills; ability to tailor messages for varying technical audiences and remain composed under pressure.
  • Excellent follow-up and interpersonal skills; strong problem-solving passion.
  • Proficient in Python for scripting and building custom tools.
  • Experience with parallel programming or GPU acceleration (e.g., CUDA) is helpful.

Ways to stand out

  • Experience with chatbots, RAG pipelines, and vector databases.
  • Experience with distributed training or inference workloads and multi-node GPU clusters.
  • Background in HPC environments for AI/ML applications.
  • Experience developing in cloud and/or virtualized environments and building containerized solutions (Docker, Kubernetes).
  • Experience with common deep learning frameworks such as PyTorch or JAX.

Compensation & Benefits

  • Base salary ranges provided by level: Level 3 — 148,000 USD to 235,750 USD; Level 4 — 184,000 USD to 287,500 USD. Final base salary depends on location, experience, and internal pay equity.
  • Eligible for equity and benefits.

Additional information

  • Location: Santa Clara, CA, United States (onsite).
  • Applications accepted at least until July 29, 2025.
  • NVIDIA is an equal opportunity employer and values diversity in hiring and promotion practices.