Senior Solutions Architect, Generative AI

at Nvidia

📍 Santa Clara, United States

USD 184,000-287,500 per year

SENIOR

✅ On-site

SCRAPED

Used Tools & Technologies

Not specified

Required Skills & Competences ^?

Python @ 7 TensorFlow @ 7 Communication @ 4 Mathematics @ 4 Parallel Programming @ 3 Debugging @ 7 LLM @ 4 PyTorch @ 7

Details

NVIDIA is seeking outstanding AI Solutions Architects to assist and support customers that are building solutions with our newest AI technology. As a Solutions Architect you will work across teams to help customers adopt and build solutions using NVIDIA Accelerated Computing and Deep Learning software and hardware platforms. You will become a trusted technical advisor and work on projects and proof-of-concepts focused on Generative AI and Large Language Models (LLMs). You will also collaborate with internal teams on performance analysis and modeling of inference software. Some travel to conferences and customers may be required.

Responsibilities

Partner with other solution architects, engineering, product and business teams to understand strategies and technical needs and help define high-value solutions.
Dynamically engage with developers, scientific researchers and data scientists across a range of technical areas.
Strategically partner with lighthouse customers and industry-specific solution partners targeting NVIDIA's computing platform.
Work closely with customers to help them adopt and build solutions using NVIDIA technology.
Analyze performance and power efficiency of deep learning inference workloads.
Present and communicate technical solutions to customers and internal teams; collaborate on cross-functional projects.

Requirements

BS, MS, or PhD in Computer Science, Electrical/Computer Engineering, Physics, Mathematics, other Engineering or related fields, or equivalent experience.
8+ years of hands-on experience with Deep Learning frameworks such as PyTorch and TensorFlow.
Strong fundamentals in programming, optimizations and software design, especially in Python.
Strong problem-solving and debugging skills.
Excellent knowledge of theory and practice of Large Language Models and Deep Learning inference.
Excellent presentation, communication and collaboration skills.
Desire to be involved in multiple diverse and creative projects.

Ways to Stand Out

Experience with NVIDIA GPUs and software libraries, such as NVIDIA NeMo Framework, NVIDIA Triton Inference Server, TensorRT, TensorRT-LLM.
Excellent C/C++ programming skills, including debugging, profiling, code optimization, performance analysis, and test design.
Familiarity with parallel programming and distributed computing platforms.
Prior experience with DL training at scale, deploying or optimizing DL inference in production.

Compensation and Benefits

Base salary range: 184,000 USD - 287,500 USD (determined based on location, experience, and pay of employees in similar positions).
Eligibility for equity and benefits (see NVIDIA benefits information).

Applications for this job will be accepted at least until December 5, 2025.

NVIDIA is an equal opportunity employer and is committed to fostering a diverse work environment.