Solutions Architect, Generative AI
at Nvidia
π Santa Clara, United States
USD 148,000-235,800 per year
SCRAPED
Used Tools & Technologies
Not specified
Required Skills & Competences ?
Python @ 6 Machine Learning @ 3 Data Science @ 3 TensorFlow @ 6 Leadership @ 6 Communication @ 3 LLM @ 3 PyTorch @ 6 CUDA @ 3 GPU @ 3Details
NVIDIA is seeking an outstanding AI Engineer or Solutions Architect to join the Generative AI Partners Enablement Solutions Architecture team. In this role you will act as a strategic technical expert and a hands-on developer, building proof-of-concept solutions and reference architectures for agentic Generative AI applications that demonstrate the NVIDIA full-stack accelerated Generative AI platforms β from GPU systems and CUDA to NeMo and Nemotron. You will provide partners with technical blueprints and guidance to architect and deploy transformative applications using the NVIDIA AI stack.
Responsibilities
- Build end-to-end agentic AI applications that solve real-world enterprise problems across industries.
- Serve as the primary technical domain expert for pre- and post-sale engagements with partners; embed with partners to design and deploy Generative AI solutions at scale and maintain strong relationships with leadership and technical teams.
- Accelerate partner/customer time to value by providing repeatable reference architecture guidance, building hands-on prototypes and proofs of concept, and advising on standard methodologies for scaling solutions to production.
- Establish scope, success metrics, and evaluation criteria for partner-led customer projects, ensuring alignment to standardized and reproducible GPU-accelerated workflows.
- Enable strategic partners to build their own professional services, platforms, and products by integrating and accelerating NVIDIA technologies for high-impact customer workloads.
- Codify knowledge and operationalize technical success practices to help partners scale impact across industries and workloads.
Requirements
- MS or PhD in Computer Science/Engineering, Machine Learning, Data Science, Electrical Engineering or a closely related field, or equivalent experience.
- 5+ years of meaningful work experience deploying AI models at scale as a Software Engineer or Deep Learning engineer.
- Consistent track record of building enterprise-grade agentic AI systems using open-source models; solid foundation in deep learning with emphasis on LLMs and VLMs.
- Hands-on experience with LLM and agentic frameworks such as NeMo Agent Toolkit, LangChain, Semantic Kernel, Crew.ai, AutoGen, and experience with evaluation and observability platforms.
- Strong coding and development proficiency in Python and C++ and experience with deep learning frameworks (PyTorch or TensorFlow).
- Familiarity with NVIDIA GPUs and system software stacks (CUDA, NCCL) and HPC technologies such as InfiniBand, MPI, NVLink.
- Excellent communication and presentation skills to collaborate effectively with internal executives, partners and customers.
Ways to Stand Out
- Demonstrated expertise building applications and systems using NeMo Framework, Nemotron, Dynamo, TensorRTLLM, NIMs, and AI Blueprints; active open-source contributions.
- End-to-end ownership of projects and proactive acquisition of new skills.
- Experience managing multiple workstreams in fast-paced environments and prioritizing for highest customer impact.
- Understanding of advanced agent architectures and emerging communication protocols (MCP, OpenAI Agentic SDK, Google A2A).
Compensation & Benefits
- Base salary range: 148,000 USD - 235,750 USD (determined by location, experience, and comparables).
- Eligibility for equity and company benefits (see NVIDIA benefits).
Additional Information
- Location: Santa Clara, CA, United States.
- Applications for this job will be accepted at least until November 13, 2025.
- NVIDIA is an equal opportunity employer and is committed to fostering a diverse work environment. They do not discriminate on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.