Senior Software Engineer - DGX Cloud API Services

at Nvidia
USD 168,000-322,000 per year
SENIOR
βœ… On-site

SCRAPED

Used Tools & Technologies

Not specified

Required Skills & Competences ?

Go @ 6 Kubernetes @ 4 Terraform @ 4 GCP @ 4 AWS @ 4 Networking @ 4 API @ 4 GPU @ 4

Details

Join NVIDIA's DGX Cloud Kubernetes API Services team and be at the forefront of building GPU-accelerated Kubernetes clusters supporting NVIDIA AI, robotics, and scientific computing projects. As an API Services Software Engineer, you will work across the stack with partner teams to bring NVIDIA's GPUs to life in a cloud or on-prem environment, ensuring end-to-end performance across compute, storage, and networking.

An API Services engineer is above all responsible for ensuring a good customer and developer experience on the DGX Cloud Kubernetes Platform, working with our Runtime and Cluster Architecture teams and more to be the voice of the customer. We serve both customers looking to access their GPU compute via Kubernetes for whatever workloads they wish, as well as developers looking to use our API automation to bring their own services such as node health monitoring to all of NVIDIA.

Responsibilities

  • Help build out and scale customer-facing APIs and systems for the DGX Cloud Kubernetes Platform.
  • Work with the Runtime and Cluster Architecture teams to provide complete GPU-accelerated Kubernetes clusters to a wide variety of NVIDIA initiatives.
  • Be the voice of our customers to ensure they have a smooth experience to access the compute they need for the workloads they want.
  • Build platform services for other NVIDIA developers to bring their services to NVIDIA Kubernetes clusters.

Requirements

  • BS/MS in Computer Science or related field (or equivalent experience).
  • 8+ years of relevant work experience.
  • Experience in building foundational SaaS systems at scale, such as API design, user management, or authentication and authorization flows.
  • Proficiency in Go and building Go services at scale.
  • Experience with deploying and maintaining services atop Kubernetes.
  • Experience writing automation with Kubernetes (e.g., Controllers, CustomResourceDefinitions).
  • Background with AWS or GCP and related technologies like S3, GCS, RDS, etc.
  • Ability to solve issues across multiple layers: infrastructure, Kubernetes, application runtime.
  • Ability to communicate effectively across a large organization, both within and outside the Kubernetes Platform organization.

Ways to stand out

  • Experience working on internal tools and services for large engineering organizations.
  • Experience working across multiple layers of cloud infrastructure such as CSP APIs, Terraform, Kubernetes, and custom controllers and automation atop those layers.
  • Experience working deeply in and with the upstream Kubernetes apiserver code.
  • Background with user-facing APIs with a focus on customer and/or developer experience.

Benefits & Compensation

  • Base salary ranges (location, experience, and level dependent):
    • Level 4: $168,000 - $264,500 USD
    • Level 5: $200,000 - $322,000 USD
  • You will also be eligible for equity and benefits. (Link to NVIDIA benefits page provided in original posting.)

Additional information

  • Applications for this job will be accepted at least until September 30, 2025.
  • NVIDIA is committed to fostering a diverse work environment and is an equal opportunity employer. They do not discriminate on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status, or any other characteristic protected by law.