Solutions Architect, Generative AI
at Nvidia
π Santa Clara, United States
USD 148,000-235,800 per year
SCRAPED
Used Tools & Technologies
Not specified
Required Skills & Competences ?
Python @ 6 MLOps @ 2 TensorFlow @ 6 Hiring @ 3 Leadership @ 3 Communication @ 3 Technical Leadership @ 3 LLM @ 3 PyTorch @ 6 CUDA @ 3 GPU @ 3 LLMOps @ 2Details
NVIDIA is hiring a Solutions Architect focused on Generative AI to enable partners by building proof-of-concept solutions, reference architectures, and production-grade AI workflows. This role combines strategic technical leadership with hands-on development to demonstrate and scale NVIDIA's accelerated Generative AI platforms (GPU systems, CUDA, NeMo, Triton) and to help partners deploy agentic and generative AI applications.
Responsibilities
- Serve as the primary technical domain expert for pre- and post-sale partner engagements; embed with partners to design and deploy Generative AI solutions and maintain strong relationships with partner and customer leadership and technical teams.
- Accelerate partner/customer time to value by providing repeatable reference architecture guidance, building hands-on prototypes and proofs-of-concept, and advising on methodologies for scaling solutions to production.
- Define scope, success metrics, and evaluation criteria for partner-led projects, ensuring standardized and reproducible GPU-accelerated workflows.
- Enable strategic partners to launch Professional Services and platforms by tailoring NVIDIA agentic AI blueprints for high-impact workloads and proactively drive deeper adoption of NVIDIA GenAI products.
- Codify knowledge and operationalize technical success practices to help partners scale impact across industries and workloads.
Requirements
- MSc or PhD in Computer Science, Electrical Engineering, Software Engineering, ML Engineering, or a related field (or equivalent experience).
- 5+ years of relevant experience developing and deploying AI models at scale as a Software Engineer or Deep Learning Engineer.
- Proven track record building enterprise-grade agentic AI systems using open-source models, with a solid foundation in deep learning and generative models.
- Hands-on experience with LLM and agentic frameworks such as NeMo Agent Toolkit, LangChain, Semantic Kernel, Crew.ai, AutoGen, and with evaluation and observability platforms; comfortable building prototypes and proofs-of-concept.
- Strong coding proficiency in Python and C++ and experience with deep learning frameworks (PyTorch or TensorFlow).
- Excellent communication and presentation skills for collaboration with internal executives, partners, and customers.
Ways to stand out
- Demonstrated hands-on experience with NVIDIA AI platforms.
- Understanding of advanced agent architectures and emerging communication protocols (e.g., MCP, Google A2A).
- Practical expertise in Generative AI and LLM development, including ability to train models such as GPT and Megatron.
- Familiarity with MLOps lifecycle management and LLMOps workflows.
- Experience with CUDA programming, benchmarking, and analyzing performance of foundation models.
Compensation & Benefits
- Base salary range: 148,000 USD - 235,750 USD (determined by location, experience, and comparable pay).
- Eligible for equity and benefits (see NVIDIA benefits page).
- Applications accepted at least until August 14, 2025.
Diversity & Inclusion
NVIDIA is an equal opportunity employer and committed to fostering a diverse work environment; they do not discriminate based on protected characteristics.