Solutions Architect, Generative AI

at Nvidia

📍 Santa Clara, United States

USD 148,000-235,800 per year

MIDDLE

✅ On-site

SCRAPED

Used Tools & Technologies

Not specified

Required Skills & Competences ^?

Python @ 6 MLOps @ 2 TensorFlow @ 6 Hiring @ 3 Leadership @ 3 Communication @ 3 Technical Leadership @ 3 LLM @ 3 PyTorch @ 6 CUDA @ 3 GPU @ 3 LLMOps @ 2

Details

NVIDIA is hiring a Solutions Architect focused on Generative AI to enable partners by building proof-of-concept solutions, reference architectures, and production-grade AI workflows. This role combines strategic technical leadership with hands-on development to demonstrate and scale NVIDIA's accelerated Generative AI platforms (GPU systems, CUDA, NeMo, Triton) and to help partners deploy agentic and generative AI applications.

Responsibilities

Serve as the primary technical domain expert for pre- and post-sale partner engagements; embed with partners to design and deploy Generative AI solutions and maintain strong relationships with partner and customer leadership and technical teams.
Accelerate partner/customer time to value by providing repeatable reference architecture guidance, building hands-on prototypes and proofs-of-concept, and advising on methodologies for scaling solutions to production.
Define scope, success metrics, and evaluation criteria for partner-led projects, ensuring standardized and reproducible GPU-accelerated workflows.
Enable strategic partners to launch Professional Services and platforms by tailoring NVIDIA agentic AI blueprints for high-impact workloads and proactively drive deeper adoption of NVIDIA GenAI products.
Codify knowledge and operationalize technical success practices to help partners scale impact across industries and workloads.

Requirements

MSc or PhD in Computer Science, Electrical Engineering, Software Engineering, ML Engineering, or a related field (or equivalent experience).
5+ years of relevant experience developing and deploying AI models at scale as a Software Engineer or Deep Learning Engineer.
Proven track record building enterprise-grade agentic AI systems using open-source models, with a solid foundation in deep learning and generative models.
Hands-on experience with LLM and agentic frameworks such as NeMo Agent Toolkit, LangChain, Semantic Kernel, Crew.ai, AutoGen, and with evaluation and observability platforms; comfortable building prototypes and proofs-of-concept.
Strong coding proficiency in Python and C++ and experience with deep learning frameworks (PyTorch or TensorFlow).
Excellent communication and presentation skills for collaboration with internal executives, partners, and customers.

Ways to stand out

Demonstrated hands-on experience with NVIDIA AI platforms.
Understanding of advanced agent architectures and emerging communication protocols (e.g., MCP, Google A2A).
Practical expertise in Generative AI and LLM development, including ability to train models such as GPT and Megatron.
Familiarity with MLOps lifecycle management and LLMOps workflows.
Experience with CUDA programming, benchmarking, and analyzing performance of foundation models.

Compensation & Benefits

Base salary range: 148,000 USD - 235,750 USD (determined by location, experience, and comparable pay).
Eligible for equity and benefits (see NVIDIA benefits page).
Applications accepted at least until August 14, 2025.

Diversity & Inclusion

NVIDIA is an equal opportunity employer and committed to fostering a diverse work environment; they do not discriminate based on protected characteristics.