Solutions Architect, Generative AI

at Nvidia

📍 Santa Clara, United States

USD 148,000-235,800 per year

MIDDLE

✅ On-site

SCRAPED

Used Tools & Technologies

Not specified

Required Skills & Competences ^?

Docker @ 3 Kubernetes @ 3 Linux @ 6 Python @ 6 Machine Learning @ 3 Data Science @ 3 TensorFlow @ 3 Bash @ 6 Communication @ 6 Mathematics @ 3 Microservices @ 3 Debugging @ 6 API @ 3 PyTorch @ 3 GPU @ 3

Details

Do you want to be part of the team that brings Artificial Intelligence (AI) technology to the field? The NVIDIA AI Enterprise (NVAIE) SA Segment team specializes in Machine Learning, Deep Learning, Generative AI, and Cloud. The team guides and enables adoption at data center scale of NVIDIA AI Enterprise Software and works with customers and partners to customize and deploy Generative AI workloads in production at scale.

Responsibilities

Develop end-to-end Generative AI solutions for enterprise use cases, adopting NVIDIA AI SDKs and APIs and designing GPU-accelerated pipelines to optimize compute utilization and performance.
Build solutions using ML and DL technologies including language and multimodal models, information retrieval, domain customization, reasoning, inference optimization, agentic systems, and other Generative AI workloads.
Create reference architectures to deploy and optimize workloads at scale across multiple industries and improve NVIDIA products by addressing scaling challenges.
Contribute to the organization and community through product engineering input, hands-on training, and knowledge sharing.
Act as a trusted technical advisor and customer-facing expert working with Solution Architects, Product, Engineering, and Research teams.

Requirements

BS, MS, or Ph.D. in Engineering, Mathematics, Physics, Computer Science, Data Science, or similar (or equivalent experience).
5+ years of experience with demonstrated track record in Deep Learning and Machine Learning.
Experience with GPUs and deep learning frameworks such as TensorFlow and PyTorch.
Strong coding, development, and debugging skills, including experience with Python, C/C++, Bash, and Linux.
Real-world development of large-scale Generative AI applications (information retrieval, model pre-/post-training, model and pipeline evaluation, inference optimization, guard-rails, agents, reasoning systems).
Experience with cluster orchestration tools including Docker, Kubernetes, and SLURM across cloud providers and on-premises.
Expertise in optimizing AI training and inference workloads over high-performance networks, including Ethernet and InfiniBand fabrics.
Ability to learn quickly and adapt to change; strong written and oral communication skills for collaboration with executives and engineering teams.

Preferred / Ways to Stand Out

Proven hands-on experience with NVIDIA AI products such as NIM and NVIDIA NeMo (NeMo Retriever, NeMo Microservices, NeMo Framework).
Expertise on NVIDIA Spectrum-X.
Experience with NVIDIA Collective Communication Library (NCCL).
Extensive engineering and customer engagement experience on multi-collaborator projects.
Willingness and ability to dig into unfamiliar territories to solve complex problems.

Other Details & Benefits

Employment type: Full time.
Location: Santa Clara, CA, United States (on-site).
Base salary range: 148,000 USD - 235,750 USD (determined based on location, experience, and comparable pay).
Eligible for equity and company benefits (link to benefits referenced in original posting).
Applications accepted at least until July 29, 2025.
NVIDIA is an equal opportunity employer committed to diversity and non-discrimination.