Used Tools & Technologies
Not specified
Required Skills & Competences ?
Security @ 4 Kubernetes @ 7 CI/CD @ 7 Leadership @ 4 Communication @ 4 Networking @ 4 Jira @ 6 Reporting @ 6 QA @ 6 GPU @ 7Details
NVIDIA’s deep learning platforms have made major impact in various fields and are broadly used across leading academic institutions, start-ups, and industry, including the world’s largest Internet companies. We are seeking an experienced and talented technical program manager for NVIDIA's DGX Cloud. We need passionate, hard-working, and creative people to help us deliver value to DGX Cloud customers.
Responsibilities
- Lead the end-to-end execution of NPI programs across engineering, operations, and cloud service provider (CSP) partners.
- Build and manage detailed project plans, milestones, and capacity plans for DGX Cloud hardware and software rollouts.
- Manage complex technical collaborations proactively, identifying and resolving critical issues before they impact deployments.
- Coordinate site readiness and infrastructure bring-up activities, including networking, inventory, corp IT, and security integration.
- Partner with software stack teams to track development, testing, and integration across product phases.
- Define and implement acceptance testing, validation workflows, and readiness gates for new platforms.
- Work closely with stakeholders to develop scalable NPI processes, tools, and dashboards.
- Drive automation efforts for break/fix workflows, telemetry enablement, and system health validation.
- Facilitate regular communication with leadership, engineering, CSP teams, and colo partners and cultivate a culture of continuous improvement and process innovation.
Requirements
- 12+ years of technical program management experience, with a focus on infrastructure, hardware/software integration, or cloud platforms.
- Proven track record of leading NPI or large cross-functional programs in fast-paced environments.
- Experience working with cloud service providers, large-scale data center deployments, or enterprise-scale infrastructure programs.
- Strong understanding of GPU compute, Kubernetes, CI/CD pipelines, and cloud-native services.
- Demonstrated experience building or improving product development processes and team workflows.
- Skilled in tools such as JIRA, Confluence, JAMA, dashboards, and reporting tools.
- Ability to influence cross-functional teams, including hardware, software, QA, Site Ops, and Product.
- Outstanding communication and leadership skills, capable of collaborating effectively with senior collaborators.
- BS/MS in Computer Science, Electrical Engineering, related technical field, or equivalent experience.
Ways to stand out
- Experience in launching cloud infrastructure products or large-scale hardware-software systems.
- Previous involvement in New Product Introduction (NPI), including platform bring-up and validation.
- Familiarity with AI infrastructure or GPU-based cloud platforms.
- Experience with process automation, observability (telemetry/metrics), and health check frameworks.
- Passion for building repeatable systems, tools, and cross-organization efficiency at scale.
Compensation & Benefits
- Base salary range: $192,000 - $304,750 USD for Level 5; $232,000 - $368,000 USD for Level 6. Base salary will be determined based on location, experience, and pay of employees in similar positions.
- Eligible for equity and benefits.
Additional information
- Applications for this job will be accepted at least until October 11, 2025.
- NVIDIA is committed to fostering a diverse work environment and is an equal opportunity employer. They do not discriminate on the basis of protected characteristics.
- #LI-Hybrid