Forward Deployed Engineer, Optimization - AI Accelerator

at Nvidia

📍 Santa Clara, United States

USD 200,000-391,000 per year

MIDDLE

✅ On-site

SCRAPED

Used Tools & Technologies

Not specified

Required Skills & Competences ^?

Kubernetes @ 3 CI/CD @ 3 MLOps @ 3 Leadership @ 3 Communication @ 3 Performance Monitoring @ 3 Product Management @ 7 Technical Leadership @ 3 CUDA @ 3 GPU @ 3

Details

NVIDIA is seeking a Forward Deployed Architect to provide technical leadership and strategic guidance across multiple AI Accelerator customer engagements. The role focuses on improving customer value through technical mentorship, crafting feedback loops for structured solutions, and helping design and validate architectures for sophisticated AI workloads.

Responsibilities

Provide cross-account technical guidance across multiple strategic customer engagements to ensure alignment to business outcomes and optimal technical approaches.
Collaborate with customers and internal teams to understand customer goals and effective technical strategies.
Provide implementation oversight and guide Forward Deployed Engineers and customer teams on complex technical decisions, system architectures, and implementation strategies.
Identify common technical challenges, use cases, and solution approaches across projects; share findings with internal teams and the external community.
Develop and advocate standardized approaches, guidelines, and structured advice rooted in proven patterns from successful customer implementations.
Hands-on technical leadership: prototype and dive in with hands-on keyboard when needed to solve critical problems, validate architectures, or demonstrate solutions.
Cross-functional collaboration with product, engineering, and customer success teams to ensure customer findings inform internal strategy and capabilities.
Design technical strategies for AI workloads including distributed training, inference optimization, and complex MLOps pipelines applicable across customers.

Requirements

12+ years of experience in technical roles such as solutions architecture, ML engineering, technical product management, or technical consulting working across multiple customers or projects.
Strong technical leadership with proven ability to guide teams and influence technical decisions without direct authority.
Systems thinking with the ability to understand customer business outcomes and translate them into effective technical approaches.
Ability and willingness to prototype and implement solutions hands-on when needed to solve critical problems or validate approaches.
Exceptional communication skills with the ability to engage technical teams, executives, and cross-functional stakeholders.
Bachelor's degree or equivalent experience.

Ways to stand out

Experience with NVIDIA stack: CUDA, NeMo, Triton, TensorRT, NIM.
Deep expertise in AI/ML systems: distributed training, large-scale inference, model optimization, and pipeline automation.
Infrastructure experience: Kubernetes, GPU scheduling, distributed computing frameworks (SLURM, Ray), and multi-cloud environments.
Observability & automation: CI/CD, Infrastructure as Code, and GPU performance monitoring.
Background in solutions architecture or consulting with experience working across multiple customer engagements and creating reusable technical patterns.

Benefits & Compensation

Base salary range (determined by location, experience, and pay of employees in similar positions):
- Level 5: 200,000 USD - 322,000 USD
- Level 6: 248,000 USD - 391,000 USD
Eligible for equity and additional benefits (see NVIDIA benefits pages).
Applications for this job will be accepted at least until November 23, 2025.

NVIDIA is an equal opportunity employer and is committed to fostering a diverse work environment.