Senior Technical Program Manager, Cloud Infrastructure
at Nvidia
π Santa Clara, United States
USD 160,000-304,800 per year
SCRAPED
Used Tools & Technologies
Not specified
Required Skills & Competences ?
Security @ 4 Kubernetes @ 4 Terraform @ 4 Hiring @ 4 Leadership @ 4 Communication @ 4 Jira @ 4 API @ 4 Compliance @ 4 Agile @ 6 GPU @ 4Details
NVIDIA's deep learning platforms are at the forefront of innovation and are widely adopted by leading academic institutions, startups, and major internet companies. The DGX Cloud team is hiring an experienced Technical Program Manager (TPM) to drive cloud infrastructure bring-up, partner relationship management, and AI capacity enablement across global deployments.
Responsibilities
- Partner with Engineering, Infrastructure, and Software teams and their leadership to drive critical programs related to AI capacity enablement and management.
- Develop and mature foundational capabilities and processes for DGX Cloud, including cluster/capacity bring-up and maintenance.
- Gather technical requirements, develop comprehensive roadmaps, and ensure adherence to the Product Lifecycle (PLC) process.
- Use Jira and other program management platforms to add rigor and structure to engineering deliverables and program tracking.
- Collaborate cross-functionally (internal and external) to understand partner capabilities and map to NVIDIA reference architectures.
- Identify and drive adoption of third-party and in-house solutions for deployments, support, security, compliance, and observability across DGX Cloud.
- Establish metrics and KPIs and quantitatively demonstrate program value and impact.
- Proactively identify, resolve, and mitigate risks and issues affecting scope, schedule, and quality.
- Develop and execute communication strategies to ensure organizational visibility on program progress, including regular presentations to NVIDIA executive leadership.
- Encourage continuous improvement and find opportunities to improve cloud infrastructure operations and processes.
Requirements
- 10+ years of technical program management experience driving planning and execution of large-scale engineering programs, with strong focus on software engineering projects in matrixed organizations.
- Extensive hands-on experience in cloud infrastructure, preferably from a major Cloud Service Provider (CSP) and including AI/ML environments.
- Expert-level proficiency with Jira, Smartsheet, or similar program management tools; ability to guide engineering teams in Agile/Scrum frameworks.
- Exceptional strategic and tactical thinking, ability to build consensus and drive program success.
- Comfortable and effective in ambiguous environments.
- Excellent communication and technical presentation skills, especially for executive audiences.
- BS or MS in Electrical Engineering or Computer Science, or equivalent experience.
Ways to stand out
- In-depth knowledge of NVIDIA GPU products, including deployment and bring-up.
- Solid understanding of cloud technologies such as Kubernetes, API integration, Terraform.
- Significant experience with productivity tools and process automation.
- Deep familiarity with cloud-native product/services environments and AI/ML infrastructure.
Compensation & Benefits
- Base salary ranges by level:
- Level 4: 160,000 USD - 253,000 USD
- Level 5: 192,000 USD - 304,750 USD
- Eligible for equity and NVIDIA benefits (link to benefits referenced in original posting).
Additional information
- Location: Santa Clara, CA, United States.
- Role type: Full time.
- Application window: Applications accepted at least until August 21, 2025.
- NVIDIA is an equal opportunity employer committed to diversity and inclusion.