Solutions Architect, AI Hyperscalers

at Nvidia

📍 Santa Clara, United States

USD 148,000-287,500 per year

MIDDLE SENIOR

✅ On-site

SCRAPED

Used Tools & Technologies

Not specified

Required Skills & Competences ^?

Docker @ 3 Kubernetes @ 3 Linux @ 6 Python @ 5 Machine Learning @ 6 Data Science @ 6 Communication @ 3 Parallel Programming @ 3 PyTorch @ 3 CUDA @ 3 GPU @ 3

Details

NVIDIA is searching for an AI/ML Solutions Architect focused on Hyperscale customers and Cloud Service Providers. The role leads software customer technical engagement for AI training, inference and infrastructure deployed at vast scale. You will work across multiple organizations within NVIDIA as well as at customer sites to ensure successful, trouble-free deployments. The role involves optimization and characterization of customer-specific AI models and pipelines, building automation and management for large-scale AI infrastructure, and guiding customers on scalable deployment and inference optimization.

Responsibilities

Serve as the main technical point of contact for NVIDIA products for internet giants and cloud providers, enabling AI/ML software infrastructure at scale.
Work directly with customer engineering teams to secure design wins, address challenges, bring solutions to production, and support them through lifecycle.
Become a trusted advisor by understanding customer environments, constraints, and long-term strategy; translate insights into product requirements and solutions.
Provide feedback to NVIDIA for future product improvements based on customer deployments.
Facilitate resolution of customer issues with timely and proactive communication to mitigate risks.
Lead workshops, demos, and proof-of-concepts to showcase NVIDIA's AI/ML capabilities.
Guide customers on standard processes for scalable AI model deployment and inference optimization.

Requirements

Minimum of a BS/MS in Computer Science, Electrical Engineering, or equivalent experience.
At least 5+ years of engineering experience with a proven track record in AI/ML-focused projects or enterprise-grade solutions.
Strong understanding of Linux, including troubleshooting, optimization, and customization for AI/ML workloads.
Strong understanding of data science and machine learning infrastructure (both software and hardware).
Professional-level communication skills and the ability to tailor messages for varying technical audiences.
Excellent follow-up and interpersonal skills, with a passion for problem-solving.
Proficient in Python for scripting and building custom tools.
Experience with parallel programming or GPU acceleration (e.g., CUDA) is helpful.

Ways to stand out

Experience with chatbots, RAG (retrieval-augmented generation) pipelines, and vector databases.
Experience with distributed training or inference workloads and multi-node GPU clusters.
Background in HPC (High Performance Computing) environments for AI/ML applications.
Experience developing in cloud/virtualized environments and building containerized solutions (Docker, Kubernetes).
Experience with common deep learning frameworks such as PyTorch or JAX.

Compensation & Benefits

Base salary ranges by level:
- Level 3: 148,000 USD - 235,750 USD
- Level 4: 184,000 USD - 287,500 USD
Eligible for equity and company benefits.

Additional details

Location: Santa Clara, CA (US)
Employment type: Full time
Applications accepted at least until July 29, 2025
NVIDIA is an equal opportunity employer committed to diversity and inclusion.