Used Tools & Technologies
Not specified
Required Skills & Competences ?
Marketing @ 4 Distributed Systems @ 4 Communication @ 4 Planning @ 4 Swift @ 4 Agile @ 7Details
NVIDIA has become the platform upon which every new AI-powered application is built. From healthcare research applications to autonomous vehicles, or voice-recognition systems, there is a need to simplify and deliver predictability for AI applications and workflows. Businesses are often challenged with balancing the performance and costs of inference workloads. Token economics need to be matched with performance and user experience. NVIDIA technologies simplify model deployment while optimizing cost and performance for AI inference workloads. The role of this product manager is to work across NVIDIA to build the right kind of benchmarks, products, and tools that can help customers understand true performance & TCO of Inference.
Responsibilities
- Serve as a Subject Matter Expert on AI Inference: Maintain a deep understanding of the entire inference stack, including performance, scaling across workloads, and emerging technologies like disaggregated serving, to guide technical and product strategy.
- Drive Product Strategy through Market and User Insights: Steer product evolution and partnership direction by conducting thorough research on market trends, competitor activities, and customer feedback, translating these insights into actionable development plans and innovative ideas.
- Lead Partner Collaboration and Project Execution: Actively manage external partnerships by leading project planning, defining specific tasks and deliverables, serving as the primary liaison for communication, and educating partners on product value to ensure alignment and swift issue resolution.
- Spearhead Cross-Functional Product Introduction: Drive the new product introduction and transition processes by collaborating seamlessly with engineering, design, operations, sales, and marketing teams, ensuring a unified approach from conception to launch.
- Enhance Product and User Experience: Play a key role in the product development lifecycle by contributing to the ideation, design, and testing of user experiences, ensuring the final product meets and exceeds customer and partner expectations.
Requirements
- BS or MS in Computer Science, Computer Engineering, or a related field (or equivalent experience).
- 12+ years of product-management experience in enterprise technology.
- Subject matter expertise in Inference: understand different components of the inference stack, how they perform, scale, and correlate to workloads.
- Ability to articulate trade-offs among latency, throughput, cost, and reliability to both engineering and executive audiences.
- Strong cross-functional execution: writes clear specs and PRDs, produces GTM collateral, and leads agile processes.
Ways to Stand Out
- Masters/PhD or expertise in distributed systems or demonstrated experience in inference.
- Experience with inference (within the stack or ecosystem) and integrating with enterprise platforms; deployments at modern data-center scale.
Benefits & Compensation
Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 208,000 USD - 327,750 USD. You will also be eligible for equity and benefits. NVIDIA is committed to fostering a diverse work environment and is an equal opportunity employer.
Additional Information
- Location: Santa Clara, CA, United States (Hybrid) — #LI-Hybrid
- Employment type: Full time
- Applications for this job will be accepted at least until September 9, 2025.
- NVIDIA is at the center of Deep Learning, Artificial Intelligence, and Autonomous Vehicles. The company values candidates who thrive in ambiguous environments and share a passion for technology.