Senior Software Technical Program Manager - GPU Communication Libraries

at Nvidia
USD 192,000-304,800 per year
SENIOR
βœ… On-site

SCRAPED

Used Tools & Technologies

Not specified

Required Skills & Competences ?

Software Development @ 4 Leadership @ 4 Communication @ 4 Networking @ 8 Parallel Programming @ 4 Reporting @ 4 QA @ 4 Engineering Management @ 4 Agile @ 6 GPU @ 4

Details

We are looking for an experienced, highly motivated Senior Software Technical Program Manager to lead efforts in developing pioneering compute software solutions for critically important environments. This role focuses on leading and managing GPU communication libraries (NCCL, NVSHMEM, UCX, UCC, OpenSHMEM) for Deep Learning and HPC, collaborating across product, engineering, QA, customer program management, and other stakeholders to drive delivery for cloud service providers and OEM customers.

Responsibilities

  • Lead cross-functional planning, scheduling and execution for software development lifecycles, including defining priorities, removing blockers, and driving milestones and releases.
  • Act as primary point-of-contact for customers and internal stakeholders; lead status meetings and proactively address challenges and customer concerns.
  • Strategically plan and partner across teams to drive software objectives while maintaining schedules and formulating risk management strategies across multiple parallel work streams.
  • Lead product development enhancements and software release processes; collaborate with engineering management to optimize development workflow and efficiency.
  • Translate customer requirements into actionable internal tasks and keep customers informed on issue statuses.
  • Drive virtual reviews and continuous feedback loops; communicate benchmarking results and customer insights to product and engineering leadership.
  • Track and report large-scale performance benchmarking across clusters; build performance dashboards and reporting processes to monitor KPIs and surface performance trends.
  • Collaborate with internal teams and third-party partners across time zones to resolve customer issues and oversee customer releases.
  • Partner with Customer Program Managers to address software issues and technical feedback from OEMs, CSPs, and partners.

Requirements

  • 12+ years of overall experience in the software industry with specialization in HPC networking or system software.
  • 6+ years of program management experience in a similar or related role.
  • BS, MS, or Ph.D. in Computer Science, Computer Engineering, Electrical Engineering, or a related technical field; or equivalent experience.
  • Hands-on experience with software development for hardware platforms, communication runtimes, or high-performance networking, with demonstrated success delivering complex products to customers.
  • Proficiency in Agile software development methodologies.
  • Experience with configuration management tools and productivity-enhancing tools and automation processes.
  • Proven ability to creatively resolve technical and resource issues and to build consensus to ensure program success.
  • Exceptional attention to detail, multitasking capability in dynamic environments with shifting priorities, and strong communication and technical presentation skills.
  • Previous experience coordinating activities between hardware and software organizations.

Preferred / Ways to stand out

  • Solid understanding of the Deep Learning framework ecosystem for training and inference.
  • Solid understanding of operating systems, datacenter servers, graphics principles and standards.
  • Background with parallel programming models (MPI, SHMEM) and at least one communication runtime (MPI, NCCL, NVSHMEM, OpenSHMEM, UCX, UCC).
  • Knowledge of a modern programming language and depth in HPC and ML/DL fundamentals.
  • Background with RDMA and high-performance networking technologies (InfiniBand, RoCE, Ethernet, EFA), network architecture and network topologies.

Compensation and Benefits

  • Base salary range: 192,000 USD - 304,750 USD (final base salary determined by location, experience, and comparable roles).
  • Eligible for equity and company benefits (see company benefits page).

Other

  • Applications accepted at least until August 24, 2025.
  • NVIDIA is an equal opportunity employer and is committed to fostering a diverse work environment.