Product Manager - Inference

at Nvidia
USD 144,000-258,800 per year
MIDDLE
✅ On-site

SCRAPED

Used Tools & Technologies

Not specified

Required Skills & Competences ?

Software Development @ 3 GitHub @ 3 Algorithms @ 3 Machine Learning @ 3 Leadership @ 3 Communication @ 6 Performance Optimization @ 3 Product Management @ 5 LLM @ 3 GPU @ 3

Details

Inference is the fastest growing and most competitive area in Generative AI today. It is where AI models impact our daily life, and where every bit of accuracy and performance matters for quality, safety, and cost. Inference is also constantly evolving, with new acceleration algorithms, use cases, and deployment techniques. As a Product Manager for AI Platform Inference you will be responsible for building the tools, SDKs, and libraries which enable developers' inference deployments to thrive on NVIDIA GPUs.

Responsibilities

  • Create products to help developers build better inference deployments.
  • Develop product strategy, roadmaps, and go-to-market plans.
  • Collaborate with internal and external developers to build product-based roadmaps for model optimization software.
  • Work with leadership to align with and drive company strategy.
  • Champion developer needs inside NVIDIA and engage directly with developers to identify improvements and priorities.

Requirements

  • Experience with inference deployment and optimization software (examples listed in the posting: vLLM, SGLang, FlashInfer, TensorRT-LLM, Triton, Dynamo, TorchAO, etc.).
  • Demonstrable knowledge of Generative AI or machine learning concepts, particularly around performance optimization, and software development and delivery.
  • BS or MS degree in Computer Science, Computer Engineering, or similar experience (or equivalent experience).
  • 5+ years of technical product management, or similar, experience at a technology company.
  • Strong communication and interpersonal skills.

Ways to Stand Out

  • Experience leading optimization products for inference.
  • Working on open source & GitHub-first developer products with deep customer interactions.
  • Knowledge of GPU architecture, HW/SW co-design, and performance profiling.

Compensation & Benefits

  • Base salary ranges (dependent on level, location, and experience):
    • Level 3: 144,000 USD - 218,500 USD
    • Level 4: 168,000 USD - 258,750 USD
  • You will also be eligible for equity and benefits (see NVIDIA benefits page).

Additional Information

  • Applications for this job will be accepted at least until July 29, 2025.
  • NVIDIA is committed to fostering a diverse work environment and is an equal opportunity employer. The company does not discriminate on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status, or any other characteristic protected by law.