Senior Solutions Architect, Generative AI

at Nvidia
USD 184,000-287,500 per year
SENIOR
✅ Remote ✅ On-site

SCRAPED

Used Tools & Technologies

Not specified

Required Skills & Competences ?

Marketing @ 4 Kubernetes @ 3 Python @ 7 GitHub @ 7 MLOps @ 3 Communication @ 7 Mathematics @ 4 Debugging @ 4 PyTorch @ 4 CUDA @ 4 GPU @ 4

Details

NVIDIA is looking for an AI Solutions Architect with hands-on experience in efficient AI model training and/or deployment for a customer-facing role. The role involves accelerating customer workloads and leading technical engagements around NVIDIA software and technologies with top technology companies, observing emerging industry trends.

Responsibilities

  • Collaborate closely with customers to improve workload performance and reduce infrastructure costs.
  • Lead and develop proofs-of-concept for AI solutions in the Consumer Internet industry, including LLMs and recommenders, and build collateral (notebook/code) as needed.
  • Develop and debug software for NVIDIA and open-source AI frameworks and libraries.
  • Partner with NVIDIA’s software engineering, product, and sales teams to secure design wins and drive development of innovative solutions based on customer feedback.

Requirements

  • BS, MS, or PhD in Computer Science, Electrical/Computer Engineering, Physics, Mathematics, or related field, or equivalent experience.
  • 8+ years as an AI/Software Engineer with proven coding in Python and/or C++ with popular AI libraries and GPUs.
  • Experience profiling and optimizing model training/inference on GPUs.
  • Experience developing and optimizing GPU kernels for deep learning, focusing on GEMM and attention kernels.
  • Strong communication skills to convey ideas and code via GitHub, documentation, and presentations.
  • Team player who collaborates with cross-functional teams including Engineering, Research, Sales, Product, and Marketing.
  • Self-starter passionate about growth, continuous learning, and sharing insights.

Ways to Stand Out

  • Full stack experience from DL framework level (e.g., PyTorch/JAX) to lower level (e.g., CUDA/CUTLASS/cuDNN/NCCL).
  • Experience working with enterprise developers and strong customer-facing skills.
  • Familiarity with MLOps technologies like containers, Kubernetes, and data center deployments.
  • Experience with large-scale production data pipelines and AI model training/deployment.
  • Creative problem-solving skills for debugging and resolving complex issues.

The role supports remote work with occasional travel for onsite visits and conferences.

Compensation and Benefits

  • Base salary range: $184,000 - $287,500 USD per year, influenced by location and experience.
  • Eligible for equity and comprehensive benefits.

NVIDIA is an equal opportunity employer fostering diversity and inclusion in its workforce.