Senior Solutions Architect, AI Infrastructure

at Nvidia
USD 184,000-356,500 per year
SENIOR
βœ… Hybrid

SCRAPED

Used Tools & Technologies

Not specified

Required Skills & Competences ?

Docker @ 4 Kubernetes @ 4 Linux @ 4 DevOps @ 4 MLOps @ 4 Hiring @ 4 Communication @ 7 Networking @ 6 Product Management @ 4 Debugging @ 4 CUDA @ 4 GPU @ 4

Details

NVIDIA is hiring an experienced GPU and network systems Solutions Architect & Engineer to drive deployment of AI hardware and software technologies into customer data centers. You will work with strategic customers to integrate end-to-end solutions, influence product roadmaps, and support large-scale GPU server and networking deployments.

Responsibilities

  • Work with NVIDIA AI Native and consumer internet customers on large data center GPU server and networking system deployments as a Solutions Architect / Engineer.
  • Guide customer discussions on network design, compute/storage and support bring-up of server/network/cluster deployments. Visit customer data centers during bring-up phases.
  • Serve as a subject matter expert in advanced GPU and network systems and act as a trusted technical advisor to strategic customers.
  • Bring customer-specific requirements to product teams to influence product roadmap features.
  • Identify new project opportunities for NVIDIA products and solutions in data center and AI applications; collaborate with GPU/Network Systems Engineering, Product Management and Sales teams.
  • Conduct regular technical customer meetings for product roadmap discussions, cluster issue debugging, feature discussions and introductions to new solutions.
  • Build custom product demonstrations and proofs-of-concept (POCs) addressing critical customer needs.
  • Analyze and debug compute/network configuration and performance issues to deliver performant clusters.

Requirements

  • BS/MS/PhD in Electrical/Computer Engineering, Computer Science, Physics, or related engineering field, or equivalent experience.
  • Typically 8+ years of Systems/Solutions Engineering or similar engineering experience.
  • System-level expertise in CPU/GPU server architecture, NICs, Linux, system software and kernel drivers.
  • Experience with networking switches for Ethernet/InfiniBand and data center infrastructure (power/cooling).
  • Knowledge of DevOps/MLOps technologies such as Docker/containers and Kubernetes.
  • Systems engineering experience with coding and debugging; familiarity with C/C++ and Linux kernel/drivers is desirable.
  • Strong verbal and written communication skills; effective time management and ability to balance multiple tasks.

Ways to Stand Out

  • External customer-facing background.
  • Experience with bring-up and deployment of large clusters.
  • Strong systems engineering, coding, and debugging skills (including C/C++, Linux kernel and drivers).
  • Hands-on experience with NVIDIA GPU systems/SDKs (e.g., CUDA), NVIDIA networking technologies (NICs, RoCE, InfiniBand), and/or ARM CPU solutions.
  • Familiarity with virtualization concepts.

Location, Travel & Work Policy

  • Primary location listed: Santa Clara, California, United States.
  • Occasional on-site customer visits are required (approximately 20% travel).
  • NVIDIA is open to remote work locations; on-site customer visits and collaboration are expected (role is remote-friendly / hybrid).

Compensation & Benefits

  • Base salary ranges (determined by location, experience, and comparable roles):
    • Level 4: 184,000 USD - 287,500 USD per year
    • Level 5: 224,000 USD - 356,500 USD per year
  • Eligible for equity and benefits (see NVIDIA benefits page for details).

Other

  • Applications accepted at least until December 23, 2025.
  • NVIDIA is an equal opportunity employer and values diversity in its workforce.