Senior Solutions Architect, Generative AI

at Nvidia

📍 Santa Clara, United States

USD 184,000-287,500 per year

SENIOR

✅ Hybrid

SCRAPED

Used Tools & Technologies

Not specified

Required Skills & Competences ^?

Marketing @ 4 Kubernetes @ 3 Python @ 7 GitHub @ 4 MLOps @ 3 Hiring @ 4 Communication @ 7 Mathematics @ 4 Performance Optimization @ 4 Debugging @ 4 HTTP @ 4 PyTorch @ 4 CUDA @ 4 GPU @ 4

Details

NVIDIA is hiring an AI Solutions Architect for a customer-facing role focused on accelerating customer workloads and leading technical engagements around NVIDIA software and technologies. The role emphasizes hands-on work in efficient AI model training and/or deployment, performance optimization on GPUs, and building proof-of-concepts for generative AI and recommender solutions in the Consumer Internet industry. Occasional travel for on-site customer visits and conferences is required; remote work is supported.

Responsibilities

Collaborate closely with customers to improve workload performance and reduce infrastructure costs.
Lead and develop proof-of-concepts for AI solutions (including LLMs and recommenders) and produce supporting collateral (notebooks, code).
Develop and debug software for NVIDIA and open-source AI frameworks and libraries.
Profile and optimize model training and inference performance on GPUs.
Partner with NVIDIA software engineering, product, and sales teams to secure design wins and incorporate customer feedback into solutions.
Communicate technical results and code via GitHub, documentation, and presentations.

Requirements

BS, MS, or PhD in Computer Science, Electrical/Computer Engineering, Physics, Mathematics, or other engineering fields, or equivalent experience.
8+ years of experience as an AI/Software Engineer with a proven track record coding in Python and/or C++ using popular AI software libraries and GPUs.
Experience profiling and optimizing model training/inference performance on GPUs.
Experience developing and optimizing GPU kernels for deep learning, with focus areas such as GEMM and attention kernels.
Strong written and verbal communication skills; ability to present and document code and designs.
Comfortable collaborating across cross-functional teams (Engineering, Research, Sales, Product, Marketing).
Self-starter with passion for continuous learning and sharing insights.

Ways to stand out

Full-stack experience from deep learning framework level (such as PyTorch or JAX) down to low-level CUDA/CUTLASS/cuDNN/NCCL.
Experience working with enterprise developers and strong customer-facing skills.
Familiarity with MLOps technologies such as containers and Kubernetes, and experience with data center deployments.
Experience with large-scale production data pipelines and AI model training/deployment at scale.
Creative problem-solving skills for debugging and resolving complex issues.

Benefits and additional information

Competitive base salary range: 184,000 USD - 287,500 USD (determined by location, experience, and similar roles).
Eligible for equity and benefits. More information: http://www.nvidiabenefits.com/ and https://www.nvidia.com/en-us/benefits/
Applications accepted at least until July 29, 2025.
Location listed: Santa Clara, CA, United States. Remote work is supported; occasional on-site visits are expected.