Technical Program Manager, Cloud Infrastructure

at Nvidia
USD 192,000-304,800 per year
MIDDLE
βœ… On-site

SCRAPED

Used Tools & Technologies

Not specified

Required Skills & Competences ?

Security @ 3 Kubernetes @ 2 Terraform @ 2 Leadership @ 3 Communication @ 3 Jira @ 3 API @ 2 Compliance @ 3 Agile @ 5 GPU @ 3

Details

NVIDIA's deep learning platforms are at the forefront of innovation, widely adopted by academic institutions, startups, and major Internet companies globally. The DGX Cloud team is seeking an accomplished Technical Program Manager (TPM) with extensive experience in cloud infrastructure bring-up and relationship management to partner with companies and internal engineering teams to build AI capacity and infrastructure worldwide.

Responsibilities

  • Partner with Engineering, Infrastructure, and Software teams and leadership to drive critical programs related to AI capacity enablement and management for DGX Cloud.
  • Develop and mature foundational capabilities and processes for DGX Cloud, including cluster/capacity bring-up and maintenance.
  • Collect technical requirements, craft detailed roadmaps, set clear achievements, and ensure compliance with the Product Lifecycle (PLC) process.
  • Establish KPIs and quantitatively demonstrate value and impact delivered by programs.
  • Use Jira and other program management platforms to establish rigor and structure in managing engineering tasks.
  • Collaborate cross-functionally (internally and with external partners) to understand partner capabilities, map to NVIDIA reference architectures, and drive execution.
  • Identify and drive adoption of third-party and in-house solutions for deployments, support, security, compliance, and observability across DGX Cloud.
  • Proactively identify, resolve, and mitigate risks and issues affecting scope, schedule, and quality across programs.
  • Develop and execute communication strategies to ensure organizational visibility on program progress and engineering delivery, including regular presentations to NVIDIA executive leadership.
  • Promote continuous improvement and identify process improvement opportunities within cloud infrastructure operations.

Requirements

  • 12+ years of technical program management experience driving the planning and execution of large-scale engineering programs, with a strong focus on software engineering projects within matrixed organizations.
  • Extensive hands-on experience in cloud infrastructure, preferably from a major Cloud Service Provider (CSP).
  • Expert-level proficiency with Jira, Smartsheet, or similar program management tools and the ability to guide engineering teams on their use within an Agile/Scrum framework.
  • Strong strategic and tactical thinking, consensus building, and program-driving capabilities.
  • Ability to thrive in ambiguous environments.
  • Excellent communication and technical presentation skills, particularly for executive audiences.
  • BS or MS in Electrical Engineering or Computer Science, or equivalent experience.

Ways to stand out

  • Exceptional communication skills and proven ability to work with cross-functional teams across geographies.
  • In-depth knowledge of NVIDIA GPU products, including deployment and bring-up.
  • Experience in high-growth tech, cloud infrastructure, and/or AI/ML organizations.
  • Significant experience with productivity tools and process automation.
  • Deep familiarity with cloud-native services and AI/ML infrastructure and working knowledge of technologies such as Kubernetes, API integration, Terraform, observability tooling, etc.

Compensation and Benefits

  • Base salary range: 192,000 USD - 304,750 USD (final base salary determined by location, experience, and pay of employees in similar positions).
  • Eligible for equity and benefits.

Other details

  • Location: US β€” Santa Clara, CA.
  • Employment type: Full time.
  • Applications accepted at least until August 29, 2025.
  • NVIDIA is an equal opportunity employer and committed to fostering a diverse work environment.