Solutions Architect, Generative AI

at Nvidia

📍 Santa Clara, United States

USD 148,000-235,800 per year

MIDDLE

✅ On-site

SCRAPED

Used Tools & Technologies

Not specified

Required Skills & Competences ^?

Python @ 6 MLOps @ 3 TensorFlow @ 3 Leadership @ 6 Communication @ 3 LLM @ 3 PyTorch @ 3 CUDA @ 3 GPU @ 3 LLMOps @ 3

Details

NVIDIA is seeking an outstanding AI Engineer or Solutions Architect to join the Generative AI partner enablement team. In this role you will act as a strategic technical expert and a hands-on developer, building proof-of-concept solutions and reference architectures that demonstrate the NVIDIA accelerated Generative AI platforms. You will provide partners with technical blueprints and guidance to architect and deploy applications using NVIDIA's full AI stack, from GPU systems and CUDA to NeMo and Triton.

Responsibilities

Serve as the primary technical domain expert for pre- and post-sale partner engagements, embedding with partners to design and deploy Generative AI solutions.
Build hands-on prototypes and repeatable reference architectures to accelerate partner/customer time to value.
Define scope, success metrics, and evaluation criteria for partner-led customer projects, ensuring standardized, reproducible GPU-accelerated workflows.
Enable strategic partners to launch Professional Services and platforms by tailoring NVIDIA agentic AI blueprints for customer workloads.
Codify knowledge and operationalize technical success practices to help partners scale impact across industries and workloads.
Maintain strong relationships with partner leadership and technical teams to drive adoption and utilization of NVIDIA GenAI platforms.

Requirements

MSc or PhD in Computer Science, Electrical Engineering, or related fields (or equivalent experience).
5+ years of relevant work experience developing and deploying AI models at scale as a Software Engineer or deep learning engineer.
Track record of building enterprise-grade agentic AI systems and strong foundation in deep learning, particularly generative models.
Hands-on experience with LLM and agentic frameworks such as NeMo Agent Toolkit, LangChain, Semantic Kernel, Crew.ai, and AutoGen.
Experience with evaluation and observability platforms and comfortable building prototypes or proofs of concept.
Strong coding proficiency in Python and C++.
Experience with deep learning frameworks (PyTorch or TensorFlow).
Excellent communication and presentation skills to collaborate with internal executives, partners, and customers.

Ways to stand out

Demonstrated expertise and hands-on experience with NVIDIA AI platforms.
Understanding of advanced agent architectures and emerging communication protocols (e.g., MCP or Google A2A).
Practical knowledge of Generative AI and LLM development, including ability to train GPT and Megatron models.
Understanding of MLOps lifecycle management and experience with LLMOps workflows.
Experience with CUDA programming, benchmarking, and performance analysis of foundation models.

Compensation and benefits

Base salary range: 148,000 USD - 235,750 USD (determined by location, experience, and comparable roles).
Eligible for equity and company benefits (link to benefits provided in original posting).

Additional information

Applications for this job will be accepted at least until August 14, 2025.
NVIDIA is an equal opportunity employer and is committed to fostering a diverse work environment.