Solutions Architect, AI Hyperscalers

at Nvidia
USD 148,000-287,500 per year
MIDDLE SENIOR
βœ… On-site

SCRAPED

Used Tools & Technologies

Not specified

Required Skills & Competences ?

Docker @ 3 Kubernetes @ 3 Linux @ 3 Python @ 5 Machine Learning @ 6 Data Science @ 6 Communication @ 3 Parallel Programming @ 3 PyTorch @ 3 CUDA @ 3 GPU @ 3

Details

NVIDIA is seeking an AI/ML Solutions Architect focusing on Hyperscale customers and Cloud Service Providers. The role leads software customer technical engagement for AI training, inference, and infrastructure deployed at vast scale. You will work across multiple organizations within NVIDIA and with customers to ensure successful deployments, automation, optimization, and characterization of customer-specific AI models and pipelines.

Responsibilities

  • Serve as the main technical point of contact for NVIDIA products for internet giants and cloud providers, enabling AI/ML software infrastructure at scale.
  • Work directly with engineering teams to secure design wins, address challenges, bring solutions to production, and support them throughout the lifecycle.
  • Become a trusted advisor by understanding customer environments, constraints, and long-term strategy; translate insights into product requirements and solutions.
  • Provide feedback to NVIDIA for future product improvements and help customers enhance the value of NVIDIA technology.
  • Facilitate resolution of customer issues with timely and proactive communications to mitigate risks.
  • Lead workshops, demos, and proof-of-concepts to showcase NVIDIA’s AI/ML capabilities.
  • Guide customers on standard processes for scalable AI model deployment and inference optimization.

Requirements

  • Minimum of a BS/MS in Computer Science, Electrical Engineering, or equivalent experience.
  • 4+ years of engineering experience with a proven track record in AI/ML-focused projects or enterprise-grade solutions.
  • Proven understanding of Linux, including solving, optimization, and customization for AI/ML workloads.
  • Strong understanding of data science and machine learning infrastructure (software and hardware).
  • Professional-level communication skills, ability to tailor messages for varying technical audiences and remain composed under pressure.
  • Excellent follow-up and interpersonal skills, with a strong passion for problem-solving.
  • Proficient in Python, able to develop scripts and build custom tools.
  • Experience with parallel programming or GPU acceleration (e.g., CUDA) is helpful.
  • Eagerness to learn and apply new technologies.

Ways to stand out

  • Experience with chatbots, RAG pipelines, and vector databases.
  • Experience with distributed training or inference workloads.
  • Background in HPC (High Performance Computing) environments for AI/ML applications.
  • Familiarity with multi-node GPU clusters and performance tuning for large-scale AI workloads.
  • Experience developing in cloud and/or virtualized environments and containerized solutions; knowledge of Docker and Kubernetes.
  • Background with common deep learning frameworks such as PyTorch or JAX.

Compensation & Benefits

  • Base salary ranges (location/level dependent):
    • Level 3: 148,000 USD - 235,750 USD
    • Level 4: 184,000 USD - 287,500 USD
  • You will also be eligible for equity and benefits.

Application deadline

  • Applications for this job will be accepted at least until October 11, 2025.

Equal opportunity

  • NVIDIA is committed to fostering a diverse work environment and is an equal opportunity employer. They do not discriminate on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status, or any other characteristic protected by law.