Used Tools & Technologies
Not specified
Required Skills & Competences
Tag name is followed by "@" symbol and proficiency level value.
About proficiency levels:
- 1-2 — basic awareness. Minimal hands-on experience, and a rudimentary understanding of the technology's purpose;
- 3-6 — daily use. Comfortable and regular usage, capable of handling common tasks and challenges related to the technology;
- 7-9 — you are an expert, you can teach others, you know all the pitfalls and tricks;
- 10 — exceptional knowledge, comprehensive understanding, and adeptness in all aspects of the technology, including advanced problem-solving. Think twice before claiming or demanding such level.
Security @ 4
Kubernetes @ 7
CI/CD @ 7
Leadership @ 4
Communication @ 4
Networking @ 4
Jira @ 4
Reporting @ 6
QA @ 6
GPU @ 7
Deep Learning @ 4
Observability @ 4
AI @ 3
- 1-2 — basic awareness. Minimal hands-on experience, and a rudimentary understanding of the technology's purpose;
- 3-6 — daily use. Comfortable and regular usage, capable of handling common tasks and challenges related to the technology;
- 7-9 — you are an expert, you can teach others, you know all the pitfalls and tricks;
- 10 — exceptional knowledge, comprehensive understanding, and adeptness in all aspects of the technology, including advanced problem-solving. Think twice before claiming or demanding such level.
Details
NVIDIA’s deep learning platforms have made a major impact in various fields and are broadly used across leading academic institutions, start-ups, and industry, including the world’s largest Internet companies. We are seeking an experienced and talented technical program manager for NVIDIA's DGX Cloud. We need passionate, hard-working, and creative people to help us deliver value to DGX Cloud customers.
Responsibilities
- Lead the end-to-end execution of NPI programs across engineering, operations, and cloud service provider (CSP) partners.
- Lead the DGX Cloud NPI Early Access Program — enabling processes that provide engineering teams across DGXC early access to critical NVIDIA systems (e.g., VR / VR Ultra) to develop software and automation.
- Drive PLC process for capacity bring-up into system-based, automated solutions — iteratively improving and codifying processes in tooling (including JIRA).
- Coordinate site readiness and infrastructure bring-up activities, including networking, inventory, corporate IT, and security integration.
- Partner with software stack teams to track development, testing, and integration across product phases.
- Define and implement acceptance testing, validation workflows, and readiness gates for new platforms.
- Work closely with stakeholders to develop scalable NPI processes, tools, and dashboards.
- Drive automation efforts for break/fix workflows, telemetry enablement, and system health validation.
- Facilitate regular communication with leadership, engineering, CSP teams, and colocation partners and cultivate a culture of continuous improvement and process innovation.
Requirements
- 12+ years of technical program management experience, focused on infrastructure, hardware/software integration, or cloud platforms.
- Proven success leading NPI or large cross-functional programs in fast-paced environments.
- Experience working with cloud service providers, large-scale data center deployments, or enterprise-scale infrastructure programs.
- Strong understanding of GPU compute, Kubernetes, CI/CD pipelines, and cloud-native services.
- Demonstrated experience building or improving product development processes and team workflows.
- Skilled in tools such as JIRA, Confluence, dashboards, and reporting tools.
- Ability to influence cross-functional teams, including hardware, software, QA, site operations, and product.
- Outstanding communication and leadership skills; ability to collaborate effectively with senior stakeholders.
- BS/MS in Computer Science, Electrical Engineering, related technical field, or equivalent experience.
Ways to stand out
- Experience in launching cloud infrastructure products or large-scale hardware-software systems.
- Previous involvement in New Product Introduction (NPI), including platform bring-up and validation.
- Familiarity with AI infrastructure or GPU-based cloud platforms.
- Experience with process automation, observability (telemetry/metrics), and health check frameworks.
- Passion for building repeatable systems, tools, and cross-organization efficiency at scale.
Compensation & Benefits
- Base salary range: $168,000 - $258,750 USD for Level 4; $200,000 - $322,000 USD for Level 5.
- Eligible for equity and benefits (see www.nvidiabenefits.com).
Additional information
- Applications for this job will be accepted at least until June 5, 2026.
- This posting is for an existing vacancy.
- NVIDIA uses AI tools in its recruiting processes.
- NVIDIA is an equal opportunity employer and committed to fostering an inclusive work environment.