Senior Technical Program Manager - DGX Cloud Storage

at Nvidia
USD 200,000-322,000 per year
SENIOR
✅ Hybrid

Used Tools & Technologies

Not specified

Required Skills & Competences

Security @ 4 Software Development @ 7 Ceph @ 4 DevOps @ 7 GCP @ 3 AWS @ 3 Azure @ 3 Communication @ 7 Performance Monitoring @ 4 Jira @ 6 Product Management @ 4 Compliance @ 4 Agile @ 7

Details

NVIDIA’s DGX Cloud is redefining how organizations deploy and scale AI infrastructure. This role is a high-impact Technical Program Manager focused on storage-related initiatives across development, operations, and cloud deployment. The position interfaces with engineering, product, operations, finance, and global cloud partners to deliver storage features, deployments, and operational excellence for DGX Cloud.

Responsibilities

  • Lead cross-functional storage programs from requirements gathering through execution and delivery.
  • Drive alignment across NVIDIA storage engineering, operations, cloud service providers, cluster operators, resource governance, and finance.
  • Define project plans, schedules, and milestones for storage features, storage deployments, support, security, compliance, and observability.
  • Partner with engineering and product management to define and deliver the product roadmap.
  • Manage technical risks and resolve blockers that impact quality, scope, and delivery timelines.
  • Coordinate with cross-functional teams to improve workflows, efficiency, and transparency.
  • Ensure program visibility across the organization and maintain strong communication channels with senior stakeholders.
  • Improve organizational efficiency by collaborating with multi-functional leads and optimizing processes.
  • Cultivate a culture of continuous improvement and identify process enhancements.

Requirements

  • 12+ years of experience in program management of large-scale software or infrastructure projects.
  • MS in EE or CS, or equivalent experience.
  • Proven success driving programs across global, distributed teams.
  • Outstanding communication and organizational skills, with the ability to align cross-org stakeholders.
  • Expertise with tools like Jira and Confluence, and the ability to guide teams in their use.
  • Strong foundation in software development, Agile methodologies, and DevOps best practices.
  • Familiarity with cloud platform storage services (AWS, Azure, GCP, OCI) including Block, Object, and File storage.
  • Knowledge of distributed storage systems (SAN, NAS, object storage) and scalable distributed architectures such as Ceph or Lustre.
  • Understanding of storage performance (IOPS, latency, throughput optimization) and capacity planning for large-scale environments.
  • Familiarity with data protection and disaster recovery strategies (snapshots, backups, replication).
  • Understanding storage requirements for AI/ML and HPC workloads (high-throughput training and data pipelines).

Ways to stand out

  • Hands-on experience with storage operations, provisioning, performance monitoring, and troubleshooting.
  • Experience with new product introduction and program managing research teams.

Compensation & Benefits

  • Base salary range: 200,000 USD - 322,000 USD (determined by location, experience, and internal pay equity).
  • Eligible for equity and company benefits.

Additional information

  • #LI-Hybrid
  • Applications for this job will be accepted at least until January 30, 2026.
  • This posting is for an existing vacancy.
  • NVIDIA uses AI tools in its recruiting processes.
  • NVIDIA is an equal opportunity employer and committed to fostering a diverse work environment.