Senior Technical Product Manager - DGX Systems and AI Infrastructure
at Nvidia
š Santa Clara, United States
USD 168,000-258,800 per year
SCRAPED
Used Tools & Technologies
Not specified
Required Skills & Competences ?
Kubernetes @ 4 Communication @ 7 Networking @ 4 Performance Optimization @ 4 Product Management @ 4 LLM @ 4Details
NVIDIA is seeking a deeply technical Product Manager to develop the enterprise software stack for next-generation AI factories built on NVIDIA GPUs and purpose-designed CPUs. The role sits in the Enterprise Product Group behind DGX systems and DGX SuperPOD and focuses on delivering full-stack AI platform capabilities including bare-metal orchestration, Kubernetes-native services, and cloud-native integration to power large-scale AI and HPC workloads.
Responsibilities
- Own the product lifecycle for DGX systems ā from concept through launch and lifecycle management.
- Understand and articulate value propositions of different DGX architectures (examples listed: H100, B200, GB200) and map architectures to specific AI and HPC workloads.
- Define and prioritize use cases and reference architectures for key verticals including LLM training, inference at scale, data processing pipelines, and AI research platforms.
- Define system-level requirements (PRDs) including platform software stacks and cloud-native integration.
- Collaborate with engineering, architecture, and go-to-market teams to align product features, roadmaps and messaging with market needs.
- Drive product strategy and roadmap for scale-up and scale-out AI infrastructure.
- Leverage networking architecture knowledge (e.g., InfiniBand, Ethernet, NVLink, NVSwitch) to enable scalable, high-performance AI clusters.
- Collaborate with datacenter infrastructure and deployment teams to understand power, cooling, rack-level integration, and orchestration challenges.
Requirements
- BS or MS degree in Computer Science, Computer Engineering, or similar field, or equivalent experience.
- 8+ years of product management or similar experience at a technology company with a technology-first attitude.
- Deep understanding of full-stack AI infrastructure from hardware to platform software.
- Experience or familiarity with GPUs, purpose-built CPUs, DGX systems and related platforms.
- Experience with bare-metal orchestration and Kubernetes-native services.
- Ability to define system-level requirements and write PRDs.
- Strong communication and interpersonal skills.
- Strong program management skills: very organized with ability to multitask and prioritize in a demanding environment.
- Curiosity and a proactive mindset.
Ways to stand out
- Experience deploying systems at scale in modern data center environments.
- Successful management of technical products throughout their lifecycle in fast-paced environments.
- Understanding of modern AIOps architectures and intuition for performance optimization.
- Agentic AI development and application experience.
- Direct development experience with NVIDIA software, hardware and SDKs.
Benefits
- Base salary range: 168,000 USD - 258,750 USD (final base salary determined by location, experience, and pay of employees in similar positions).
- Eligible for equity and other NVIDIA benefits (see NVIDIA benefits page).
Applications for this job will be accepted at least until October 24, 2025. NVIDIA is an equal opportunity employer committed to a diverse work environment.