Solutions Architect, AI Infrastructure
at Nvidia
π Santa Clara, United States
USD 148,000-287,500 per year
SCRAPED
Used Tools & Technologies
Not specified
Required Skills & Competences ?
Docker @ 2 Kubernetes @ 2 Linux @ 3 DevOps @ 2 MLOps @ 2 Communication @ 3 Networking @ 3 Debugging @ 3 CUDA @ 3 GPU @ 3Details
As a Solutions Architect, AI Infrastructure at NVIDIA, you will lead deployment of AI solutions and accelerated computing in customer data centers. You will work closely with customers and internal teams to design, deploy, debug, and demonstrate GPU-based data center solutions.
Responsibilities
- Work with NVIDIA AI Native customers on data center GPU server and networking infrastructure deployments.
- Guide customer discussions on network topologies, compute/storage, and support bring-up of server/network/cluster deployments.
- Identify new project opportunities for NVIDIA products and technology solutions in data center and AI applications.
- Conduct regular technical meetings with customers as a trusted advisor, discussing product roadmaps, cluster debugging, and new technology introductions.
- Build custom demonstrations and proofs of concept to address critical business needs.
- Analyze and debug compute and network performance issues.
Requirements
- BS/MS/PhD in Electrical/Computer Engineering, Computer Science, Physics, or related fields, or equivalent experience.
- 5+ years of experience in Solution Engineering or similar roles.
- System-level understanding of server architecture, NICs, Linux, system software, and kernel drivers.
- Practical knowledge of networking including switching & routing for Ethernet and InfiniBand, and familiarity with data center infrastructure (power/cooling).
- Familiarity with DevOps/MLOps technologies such as Docker/containers and Kubernetes.
- Effective time management and ability to balance multiple tasks.
- Excellent communication skills for articulating ideas and code clearly through documents and presentations.
Preferred / Ways to Stand Out
- External customer-facing skills and experience.
- Experience with the bring-up and deployment of large clusters.
- Proficiency in systems engineering, coding, and debugging, including C/C++, Linux kernel, and drivers.
- Hands-on experience with NVIDIA systems/SDKs (e.g., CUDA), NVIDIA networking technologies (e.g., DPU or equivalent experience, RoCE, InfiniBand), and/or ARM CPU solutions.
- Familiarity with virtualization technology concepts.
Compensation & Benefits
- Base salary ranges (determined by location, experience, and comparable employees):
- Level 3: 148,000 USD - 235,750 USD
- Level 4: 184,000 USD - 287,500 USD
- You will also be eligible for equity and benefits (see NVIDIA benefits).
Other
- Applications for this job will be accepted at least until October 3, 2025.
- NVIDIA is an equal opportunity employer and committed to fostering a diverse work environment.