Used Tools & Technologies
Not specified
Required Skills & Competences ?
Security @ 4 Kubernetes @ 7 CI/CD @ 7 Leadership @ 4 Communication @ 4 Networking @ 4 Jira @ 6 Reporting @ 6 QA @ 6 GPU @ 4Details
NVIDIA’s deep learning platforms are used across leading academic institutions, start-ups, and major internet companies. This role is for a DGX Cloud NPI Technical Program Manager to enable the seamless introduction of new GPU platforms into hyperscale and colocation data centers. You will lead cross-functional programs spanning hardware, software, infrastructure, and operations to drive roadmap execution and create scalable processes and tools that accelerate time-to-production for each new GPU generation.
Responsibilities
- Lead end-to-end execution of NPI programs across engineering, operations, and cloud service provider (CSP) partners.
- Build and manage detailed project plans, milestones, and capacity plans for DGX Cloud hardware and software rollouts.
- Manage complex technical collaborations, proactively identifying and resolving critical issues before they impact deployments.
- Coordinate site readiness and infrastructure bring-up activities, including networking, inventory, corporate IT, and security integration.
- Partner with software stack teams to track development, testing, and integration across product phases.
- Define and implement acceptance testing, validation workflows, and readiness gates for new platforms.
- Develop scalable NPI processes, tools, and dashboards; drive automation for break/fix workflows, telemetry enablement, and system health validation.
- Facilitate regular communication with leadership, engineering, CSP teams, and colocation partners; cultivate continuous improvement and process innovation.
Requirements
- 12+ years of technical program management experience, focused on infrastructure, hardware/software integration, or cloud platforms.
- Proven track record of leading NPI or large cross-functional programs in fast-paced environments.
- Experience working with cloud service providers, large-scale data center deployments, or enterprise-scale infrastructure programs.
- Strong understanding of GPU compute, Kubernetes, CI/CD pipelines, and cloud-native services.
- Demonstrated experience building or improving product development processes and team workflows.
- Proficiency with tools such as JIRA, Confluence, JAMA, dashboards, and reporting tools.
- Ability to influence cross-functional teams, including hardware, software, QA, site operations, and product.
- Outstanding communication and leadership skills; able to collaborate effectively with senior stakeholders.
- BS/MS in Computer Science, Electrical Engineering, a related technical field, or equivalent experience.
Ways to stand out from the crowd
- Experience launching cloud infrastructure products or large-scale hardware-software systems.
- Previous involvement in New Product Introduction (NPI), platform bring-up, and validation.
- Familiarity with AI infrastructure or GPU-based cloud platforms.
- Experience with process automation, observability (telemetry/metrics), and health check frameworks.
- Passion for building repeatable systems, tools, and cross-organization efficiency at scale.
Compensation & Benefits
- Base salary ranges by level:
- Level 5: 192,000 USD - 304,750 USD
- Level 6: 232,000 USD - 368,000 USD
- Eligible for equity and company benefits. The final base salary will be determined based on location, experience, and internal pay parity.
Other details
- Employment type: Full time
- Application window: Applications accepted at least until August 29, 2025
- NVIDIA is an equal opportunity employer and values diversity.
- Work model: Hybrid (#LI-Hybrid).