Senior Solutions Architect, Generative AI

at Nvidia
USD 184,000-287,500 per year
SENIOR
βœ… Hybrid

SCRAPED

Used Tools & Technologies

Not specified

Required Skills & Competences ?

Marketing @ 4 Kubernetes @ 3 Python @ 7 GitHub @ 4 MLOps @ 3 Hiring @ 4 Communication @ 7 Mathematics @ 4 Performance Optimization @ 4 Debugging @ 4 HTTP @ 4 PyTorch @ 4 CUDA @ 4 GPU @ 4

Details

NVIDIA is hiring an AI Solutions Architect for a customer-facing role focused on accelerating customer workloads and leading technical engagements around NVIDIA software and technologies. The role emphasizes hands-on work in efficient AI model training and/or deployment, performance optimization on GPUs, and building proof-of-concepts for generative AI and recommender solutions in the Consumer Internet industry. Occasional travel for on-site customer visits and conferences is required; remote work is supported.

Responsibilities

  • Collaborate closely with customers to improve workload performance and reduce infrastructure costs.
  • Lead and develop proof-of-concepts for AI solutions (including LLMs and recommenders) and produce supporting collateral (notebooks, code).
  • Develop and debug software for NVIDIA and open-source AI frameworks and libraries.
  • Profile and optimize model training and inference performance on GPUs.
  • Partner with NVIDIA software engineering, product, and sales teams to secure design wins and incorporate customer feedback into solutions.
  • Communicate technical results and code via GitHub, documentation, and presentations.

Requirements

  • BS, MS, or PhD in Computer Science, Electrical/Computer Engineering, Physics, Mathematics, or other engineering fields, or equivalent experience.
  • 8+ years of experience as an AI/Software Engineer with a proven track record coding in Python and/or C++ using popular AI software libraries and GPUs.
  • Experience profiling and optimizing model training/inference performance on GPUs.
  • Experience developing and optimizing GPU kernels for deep learning, with focus areas such as GEMM and attention kernels.
  • Strong written and verbal communication skills; ability to present and document code and designs.
  • Comfortable collaborating across cross-functional teams (Engineering, Research, Sales, Product, Marketing).
  • Self-starter with passion for continuous learning and sharing insights.

Ways to stand out

  • Full-stack experience from deep learning framework level (such as PyTorch or JAX) down to low-level CUDA/CUTLASS/cuDNN/NCCL.
  • Experience working with enterprise developers and strong customer-facing skills.
  • Familiarity with MLOps technologies such as containers and Kubernetes, and experience with data center deployments.
  • Experience with large-scale production data pipelines and AI model training/deployment at scale.
  • Creative problem-solving skills for debugging and resolving complex issues.

Benefits and additional information

  • Competitive base salary range: 184,000 USD - 287,500 USD (determined by location, experience, and similar roles).
  • Eligible for equity and benefits. More information: http://www.nvidiabenefits.com/ and https://www.nvidia.com/en-us/benefits/
  • Applications accepted at least until July 29, 2025.
  • Location listed: Santa Clara, CA, United States. Remote work is supported; occasional on-site visits are expected.