Senior Technical Program Manager - DGX Cloud Storage

at Nvidia
USD 192,000-368,000 per year
SENIOR
✅ Hybrid

SCRAPED

Used Tools & Technologies

Not specified

Required Skills & Competences ?

Security @ 4 Software Development @ 7 Ceph @ 4 DevOps @ 7 GCP @ 3 AWS @ 3 Azure @ 3 Communication @ 7 Performance Monitoring @ 4 Jira @ 6 Product Management @ 4 Compliance @ 4 Agile @ 7

Details

NVIDIA’s DGX Cloud is redefining how organizations deploy and scale AI infrastructure. This role is a Senior Technical Program Manager focused on storage-related initiatives across development, operations, and cloud deployment. It is a high-impact position interfacing with engineering, product, operations, finance, and global cloud partners. The role is hybrid (see office policy) and will drive cross-functional storage programs from requirements gathering through execution and delivery.

Responsibilities

  • Lead cross-functional storage programs and act as the connective tissue across teams.
  • Drive alignment across NVIDIA storage engineering, operations, cloud service providers, cluster operators, resource governance, and finance.
  • Define project plans, schedules, and success criteria for storage features, deployments, support, security, compliance, and observability.
  • Partner with engineering and product management to define and deliver the product roadmap.
  • Manage technical risks and resolve blockers impacting quality, scope, and delivery timelines.
  • Coordinate with cross-functional teams to improve workflows, efficiency, and transparency.
  • Ensure program visibility across the organization and maintain strong communication channels with senior stakeholders.
  • Improve organizational efficiency by collaborating with multi-functional leads and optimizing processes.
  • Cultivate a culture of continuous improvement and find opportunities for process enhancements.

Requirements

  • 12+ years of experience in program management of large-scale software or infrastructure projects.
  • MS in Electrical Engineering (EE) or Computer Science (CS), or equivalent experience.
  • Proven success driving programs across global, distributed teams.
  • Outstanding communication and organizational skills, with the ability to align cross-org stakeholders.
  • Expertise with tools like Jira and Confluence, and the ability to guide teams in their use.
  • Strong foundation in software development, Agile methodologies, and DevOps best practices.
  • Familiarity with Cloud Platforms and their storage services (AWS, Azure, GCP, OCI) — Block, Object, File storage.
  • Knowledge of distributed storage systems: SAN, NAS, object storage, and scalable distributed architectures such as Ceph or Lustre.
  • Understanding of storage performance concepts (IOPS, latency, throughput) and capacity planning for large-scale environments.
  • Familiarity with data protection and disaster recovery strategies: snapshots, backups, replication, DR.
  • Understanding of storage requirements for AI/ML and HPC workloads (high-throughput training and data pipelines).

Ways to stand out

  • Hands-on experience with storage operations, provisioning, performance monitoring, and troubleshooting.
  • Experience with new product introduction and program managing research teams.

Compensation & Benefits

  • Base salary range (determined by location, experience, and comparable pay):
    • Level 5: 192,000 USD - 304,750 USD
    • Level 6: 232,000 USD - 368,000 USD
  • Eligible for equity and NVIDIA benefits. See NVIDIA benefits for details.

Additional Information

  • Location: Santa Clara, CA, United States (hybrid).
  • Application window: Applications will be accepted at least until August 23, 2025.
  • NVIDIA is an equal opportunity employer committed to fostering a diverse work environment.