Solutions Architect, Generative AI

at Nvidia
USD 148,000-235,800 per year
MIDDLE
βœ… On-site

SCRAPED

Used Tools & Technologies

Not specified

Required Skills & Competences ?

Python @ 6 MLOps @ 2 TensorFlow @ 6 Hiring @ 3 Leadership @ 3 Communication @ 3 Technical Leadership @ 3 LLM @ 3 PyTorch @ 6 CUDA @ 3 GPU @ 3 LLMOps @ 2

Details

NVIDIA is hiring a Solutions Architect focused on Generative AI to enable partners by building proof-of-concept solutions, reference architectures, and production-grade AI workflows. This role combines strategic technical leadership with hands-on development to demonstrate and scale NVIDIA's accelerated Generative AI platforms (GPU systems, CUDA, NeMo, Triton) and to help partners deploy agentic and generative AI applications.

Responsibilities

  • Serve as the primary technical domain expert for pre- and post-sale partner engagements; embed with partners to design and deploy Generative AI solutions and maintain strong relationships with partner and customer leadership and technical teams.
  • Accelerate partner/customer time to value by providing repeatable reference architecture guidance, building hands-on prototypes and proofs-of-concept, and advising on methodologies for scaling solutions to production.
  • Define scope, success metrics, and evaluation criteria for partner-led projects, ensuring standardized and reproducible GPU-accelerated workflows.
  • Enable strategic partners to launch Professional Services and platforms by tailoring NVIDIA agentic AI blueprints for high-impact workloads and proactively drive deeper adoption of NVIDIA GenAI products.
  • Codify knowledge and operationalize technical success practices to help partners scale impact across industries and workloads.

Requirements

  • MSc or PhD in Computer Science, Electrical Engineering, Software Engineering, ML Engineering, or a related field (or equivalent experience).
  • 5+ years of relevant experience developing and deploying AI models at scale as a Software Engineer or Deep Learning Engineer.
  • Proven track record building enterprise-grade agentic AI systems using open-source models, with a solid foundation in deep learning and generative models.
  • Hands-on experience with LLM and agentic frameworks such as NeMo Agent Toolkit, LangChain, Semantic Kernel, Crew.ai, AutoGen, and with evaluation and observability platforms; comfortable building prototypes and proofs-of-concept.
  • Strong coding proficiency in Python and C++ and experience with deep learning frameworks (PyTorch or TensorFlow).
  • Excellent communication and presentation skills for collaboration with internal executives, partners, and customers.

Ways to stand out

  • Demonstrated hands-on experience with NVIDIA AI platforms.
  • Understanding of advanced agent architectures and emerging communication protocols (e.g., MCP, Google A2A).
  • Practical expertise in Generative AI and LLM development, including ability to train models such as GPT and Megatron.
  • Familiarity with MLOps lifecycle management and LLMOps workflows.
  • Experience with CUDA programming, benchmarking, and analyzing performance of foundation models.

Compensation & Benefits

  • Base salary range: 148,000 USD - 235,750 USD (determined by location, experience, and comparable pay).
  • Eligible for equity and benefits (see NVIDIA benefits page).
  • Applications accepted at least until August 14, 2025.

Diversity & Inclusion

NVIDIA is an equal opportunity employer and committed to fostering a diverse work environment; they do not discriminate based on protected characteristics.