Used Tools & Technologies
Not specified
Required Skills & Competences ?
Kubernetes @ 4 Communication @ 7 Networking @ 4 Debugging @ 4 GPU @ 4Details
NVIDIA is seeking outstanding AI Solutions Architects to assist and support customers that are building solutions with our newest AI technology. Solutions Architects at NVIDIA work across teams helping customers with Accelerated Computing and Deep Learning software and hardware platforms. You will become a trusted technical advisor and work on projects and proofs-of-concept focused on Generative AI and Large Language Models (LLMs). The role involves collaboration with internal teams on performance analysis and modeling of inference software and supporting customers across the full lifecycle of GPU cloud infrastructure deployments.
Responsibilities
- Collaborate with NVIDIA Cloud Partners to create, implement, and operate NVIDIA hardware and software solutions.
- Partner with Sales Account Managers and business leads to identify and secure opportunities for NVIDIA products and solutions.
- Act as primary technical support for customers during development, construction, and production of large GPU cloud infrastructure across the customer lifecycle.
- Conduct regular technical customer meetings for project/product details, feature discussions, introductions to new technologies, and debugging sessions.
- Work with customers to build PoCs addressing critical business needs, including building out networking and compute infrastructure.
- Prepare and deliver technical content to customers including presentations, workshops, and demos.
- Analyze and develop joint solutions for customer performance and scaling issues.
Requirements
- BS/MS/PhD in Electrical/Computer Engineering, Computer Science, Physics, or other engineering fields or equivalent experience.
- Motivation and ability to own and drive technical engagements with customers through the full customer life-cycle.
- 8+ years of Solutions Engineering (or similar Sales Engineering, Cloud Engineering) experience working directly with partners and customers.
- Experience crafting and deploying large-scale cluster environments.
- Practical expertise in data center design, development and execution for AI and HPC workloads.
- Efficient time management and ability to balance multiple tasks; strong written and verbal communication skills for documents and presentations.
Preferred / Ways to stand out
- Practical familiarity with NVIDIA hardware (GPUs, Ethernet/InfiniBand networking components, storage) in large AI and HPC cluster environments.
- Practical knowledge of NVIDIA systems technologies such as NCCL, DCGM, UFM, Mission Control, Base Command Manager.
- Familiarity with at-scale GPU systems including performance testing and AI benchmarking.
- Practical involvement in cluster administration and orchestration (SLURM, Kubernetes).
Benefits
- Base salary range: 184,000 USD - 287,500 USD (will be determined based on location, experience, and comparable pay).
- Eligible for equity and company benefits.
- Applications for this job will be accepted at least until December 5, 2025.
NVIDIA is committed to fostering a diverse work environment and is an equal opportunity employer. The company does not discriminate on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status, or any other characteristic protected by law.