Solutions Architect, Applied AI

at Nvidia

📍 Santa Clara, United States

USD 148,000-235,800 per year

MIDDLE

✅ On-site

SCRAPED

Used Tools & Technologies

Not specified

Required Skills & Competences ^?

Kubernetes @ 3 Linux @ 5 Python @ 5 Data Science @ 3 TensorFlow @ 5 Mathematics @ 3 Debugging @ 5 API @ 3 PyTorch @ 5 GPU @ 3

Details

Do you want to be part of the team that brings Artificial Intelligence (AI) technology to the field? We are looking for a Solution Architect (SA) or Data Scientist to join the Applied AI SA Segment team. We specialize on the newest technology and advances in deep learning, Generative AI, and Cloud. The vision of the AI Segment team is to use our deep expertise to guide and enable the successful adoption at data center scale of NVIDIA AI Enterprise Software!

If you are passionate about AI and how it can be applied to solve real-world problems, we should talk. NVIDIA is the world leader in GPU accelerated computing and AI, and is looking for developers like you to design and build enterprise AI solutions using our newest technology. As a member of the NVAIE Segment Solution Architecture team, you will work closely with customers and partners to tackle hard problems in customizing and deploying AI workloads in production at scale.

Responsibilities

Develop end-to-end AI solutions for enterprise use cases and help customers adopt NVIDIA AI SDKs and APIs.
Design GPU-accelerated pipelines that optimize compute resource utilization and improve workload performance.
Build solutions using deep learning technologies including language and multimodal models, information retrieval, domain customization, reinforcement learning, reasoning, inferencing, agentic systems, and other sophisticated AI workloads.
Create reference architectures for deploying and optimizing workloads at large scale and collaborate across industries to address scaling challenges.
Contribute to product engineering, deliver hands-on training, and share expert knowledge across the organization and community.
Act as a trusted technical advisor to customers and partners, collaborating with Solution Architects, Product, Engineering and Research teams.

Requirements

BS, MS, or Ph.D. in Engineering, Mathematics, Physics, Computer Science, Data Science, or a similar field (or equivalent experience).
5+ years experience using deep learning frameworks and libraries such as PyTorch, TensorFlow/Keras, Hugging Face Transformers, Megatron-LM, and DeepSpeed.
Expertise running deep learning jobs on GPUs using SLURM and Kubernetes.
5+ years experience with Python and Linux; demonstrated coding and debugging skills.
Hands-on experience customizing AI models (distillation, pre-training, supervised fine-tuning, reinforcement learning, reasoning, evaluation, guard railing, data curation).
Demonstrated expertise in accuracy and performance profiling and optimization for AI training and inference workloads.
Ability to learn fast, adapt to change, and communicate clearly (written and oral) to collaborate effectively with executives and engineering teams.

Ways to stand out

Background with NVIDIA AI Enterprise software, with emphasis on NeMo.
Experience training foundational models and working on high-performance NVIDIA GPU computing clusters.
Extensive engineering and customer experience on projects with multiple collaborators.
Willingness and ability to dig into unfamiliar territories to solve complex problems.

Compensation & Benefits

Base salary range: 148,000 USD - 235,750 USD (determined based on location, experience, and pay of employees in similar positions).
Eligible for equity and benefits (see NVIDIA benefits).

Other details

Applications accepted at least until September 6, 2025.
NVIDIA is an equal opportunity employer and committed to fostering a diverse work environment.