Used Tools & Technologies
Not specified
Required Skills & Competences ?
Ansible @ 6 Docker @ 6 Kubernetes @ 6 Linux @ 7 Python @ 7 Bash @ 7 Networking @ 4 Debugging @ 4 GPU @ 4Details
Want to be part of a team that's revolutionizing the field of AI with data center scale solutions? We are looking for a hardworking Solution Architect with experience in designing, building, and maintaining large scale HPC and AI hybrid computing solutions to join our team at NVIDIA. As Solution Architects on the NVIDIA Partner Network team, you will help NVIDIA AI Factory solutions bring the benefits of large scale AI to customers through our partners. You will work closely with customers and partners to address unsolved problems in the industry and help deploy and operationalize AI solutions at scale.
Responsibilities
- Guide partners in their adoption of end-to-end Agentic AI solutions using NVIDIA's compute, networking, and software stacks.
- Design, build, and maintain large-scale HPC and AI hybrid computing solutions in cloud and datacenter environments.
- Use cloud-native methodologies, low-latency networks, and accelerated compute to help build modern AI factories.
- Deliver demos, assist with proof-of-concepts, and produce technical materials such as papers and developer blogs.
- Collaborate with executives and engineering teams to solve complex problems and operationalize AI solutions at scale.
- Provide hands-on technical assistance, including cluster configuration, debugging, and performance tuning.
Requirements
- BS, MS, or PhD in Engineering, Computer Science, or a related field (or equivalent experience).
- Established track record working with AI and HPC clusters, both on-premises and cloud-based.
- 4+ years of proven experience with cluster management and related tools, including Docker containers, Slurm, Kubernetes, and Ansible.
- Hands-on experience with networking, storage, cluster configuration, and debugging.
- Strong analytical and problem-solving skills and the ability to articulate technical concepts to others.
- Ability to multitask efficiently in a dynamic environment.
Preferred / Ways to stand out
- Strong channel sales knowledge and partner co-selling experience.
- Strong coding and debugging skills, including experience with Python, C/C++, Bash, and Linux utilities.
- Demonstrated expertise through projects or open source contributions involving GPU workloads, Kubernetes, InfiniBand, Ethernet, or other areas related to high-performance clusters and hybrid cloud solutions.
- Hands-on experience with NVIDIA AI Enterprise, Base Command Manager, Run:ai, and NVIDIA NIMs.
- Willingness and ability to learn quickly and solve advanced problems.
Benefits
- Base salary (varies by level and location) with equity eligibility and company benefits.
Compensation and other details
- Base salary ranges provided:
- Level 3: 148,000 USD - 235,750 USD
- Level 4: 184,000 USD - 287,500 USD
- You will also be eligible for equity and benefits (see NVIDIA benefits).
- Applications for this job will be accepted at least until August 21, 2025.
Equal opportunity
NVIDIA is committed to fostering a diverse work environment and is an equal opportunity employer. The company does not discriminate on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status, or any other characteristic protected by law.