Senior Solutions Architect, AI Infrastructure
at Nvidia
π Santa Clara, United States
USD 224,000-356,500 per year
SCRAPED
Used Tools & Technologies
Not specified
Required Skills & Competences ?
Docker @ 4 Kubernetes @ 4 Linux @ 4 DevOps @ 4 MLOps @ 4 Communication @ 7 Networking @ 4 Product Management @ 4 Debugging @ 4 CUDA @ 4 GPU @ 4Details
NVIDIA is looking for an experienced GPU and network systems Solutions Architect & Engineer to drive deployment of end-to-end AI hardware and software technologies in customer data centers. You will work with strategic customers to integrate NVIDIA technology solutions, provide technical guidance to business and engineering teams, and influence product roadmap features.
Responsibilities
- Work with NVIDIA AI Native, Consumer Internet and Enterprise customers on large data center GPU server and networking system deployments as a Solutions Architect Engineer.
- Guide customer discussions on network design, compute/storage and support bring-up of server/network/cluster deployments; visit customer data centers during bring-up phases.
- Demonstrate subject matter expertise in advanced GPU and network systems and act as a trusted technical advisor to strategic customers.
- Bring customer-specific requirements to product teams to guide product roadmap features.
- Identify new project opportunities for NVIDIA products and technology solutions in data center and AI applications; collaborate closely with GPU/Network Systems Engineering, Product Management and Sales teams.
- Conduct regular technical customer meetings covering product roadmap, cluster debugging, feature discussions and introductions to new technology solutions.
- Build custom product demonstrations and proofs-of-concept (POCs) addressing critical customer business needs.
- Analyze and debug compute/network configuration and performance issues to deliver performant clusters.
- Use conferencing tools extensively; occasional on-site travel (~20%) is required for customer visits and industry events.
Requirements
- BS/MS/PhD in Electrical/Computer Engineering, Computer Science, Physics, or other engineering fields, or equivalent experience.
- Approximately 12+ years of systems/solution engineering (or similar engineering roles) experience is ideal.
- System-level expertise of CPU/GPU server architecture, NICs, Linux, system software and kernel drivers.
- Experience with networking switches for Ethernet/InfiniBand and data center infrastructure (power/cooling).
- Knowledge of DevOps/MLOps technologies such as Docker/containers and Kubernetes.
- Effective time management and ability to balance multiple tasks.
- Strong verbal and written communication skills; ability to share ideas and code clearly through documents and presentations.
Ways to Stand Out
- External customer-facing background.
- Experience with bring-up and deployment of large clusters.
- Systems engineering, coding, and debugging skills including experience with C/C++, Linux kernel and drivers.
- Hands-on experience with NVIDIA GPU systems/SDKs (e.g., CUDA), NVIDIA networking technologies (e.g., NICs, RoCE, InfiniBand), and/or ARM CPU solutions.
- Familiarity with virtualization technology concepts.
Compensation & Other Details
- Base salary range: 224,000 USD - 356,500 USD (base determined by location, experience, and comparable pay).
- Eligible for equity and company benefits.
- Applications accepted at least until August 14, 2025.
- NVIDIA is an equal opportunity employer committed to a diverse work environment.