Technical Program Manager, Capacity

at Nvidia
USD 160,000-253,000 per year
MIDDLE
βœ… On-site

SCRAPED

Used Tools & Technologies

Not specified

Required Skills & Competences ?

Software Development @ 2 DevOps @ 2 Communication @ 3 Prioritization @ 3 Project Management @ 3 Reporting @ 3 Agile @ 3

Details

Hardware Infrastructure is seeking a Technical Program Manager to lead capacity programs that enable scaling how internal compute is managed and allocated across NVIDIA's infrastructure. The role partners with internal Hardware Infrastructure teams and with customer and partner teams to shape capacity tooling, planning, reporting and execution methodologies. This is a fast-paced environment focused on speed, performance and resource efficiency for large-scale AI and EDA workloads.

Responsibilities

  • Work across multiple internal customer teams to identify gaps and challenges in capacity allocation and shape the capacity tooling roadmap.
  • Nurture continuous improvement across tooling, automation and processes to scale capacity management.
  • Define strategies to increase efficiency and utilization of resources across internal clusters to minimize capacity waste.
  • Guide engineering efforts using an agile program methodology across planning, prioritization, design, dependency management, implementation and execution.
  • Bring a data-first approach to programs (metrics, OKRs, KPIs) to measure success and identify improvement areas.
  • Create effective communication channels to provide varying audience levels insights into program status, risks and opportunities.
  • Act as a technical and non-technical liaison between developers, customers and partners to drive alignment across a multi-functional matrixed set of leads.

Requirements

  • B.S. (or equivalent experience) in Computer Science or a related technical field.
  • 10+ years of experience across software engineering and/or technical program management roles with demonstrated mastery of technical and management practices.
  • Prior experience developing processes and programs focused on allocation and management of infrastructure resources that span a diverse and large portfolio (multi-billion dollar scope).
  • Experience leading programs that span multiple teams and engineers (100+ engineers).
  • Experience handling large-scale HPC and/or AI infrastructure deployments that span both hardware and software.
  • Exceptional communication and presentation skills for diverse technical and non-technical audiences.
  • Strong multitasking abilities with a focus on thoroughness and rapid context switching.
  • Knowledge of agile methodologies and project management tools.
  • Proactive in identifying and implementing improvements in software engineering and release management within fast-paced environments.

Ways To Stand Out

  • Prior experience bringing up new datacenter capacity across cloud service providers and on-premise locations.
  • Prior experience working with AI researchers and/or EDA developers.
  • Familiarity with software development, release and support methodology and DevOps practices.

Compensation & Benefits

  • Base salary range: 160,000 USD - 253,000 USD (final base determined by location, experience, and pay of employees in similar positions).
  • Eligible for equity and benefits (link to NVIDIA benefits provided in original posting).

Other Details

  • Applications accepted at least until September 6, 2025.
  • NVIDIA is an equal opportunity employer and committed to fostering a diverse work environment.