Senior Software Engineer, DGX Cloud Orchestration

at Nvidia
USD 152,000-287,500 per year
SENIOR
âś… On-site

SCRAPED

Used Tools & Technologies

Not specified

Required Skills & Competences ?

Docker @ 7 Go @ 6 Grafana @ 4 Kubernetes @ 4 Prometheus @ 4 Python @ 6 GCP @ 7 Java @ 6 Distributed Systems @ 4 AWS @ 7 Azure @ 7 Communication @ 4 JavaScript @ 3 Next.js @ 3 React @ 3 Angular @ 3 Debugging @ 7 API @ 4 GraphQL @ 4 OpenTelemetry @ 4 GPU @ 4

Details

We are looking for a Senior Software Engineer to join the DGX Cloud team and build the foundational systems that drive NVIDIA’s high-performance GPU infrastructure. You will play a critical role in designing scalable automation solutions, integrating diverse systems, and enabling seamless workflows across global cloud operations.

Responsibilities

  • Design and develop APIs (GraphQL / REST) to orchestrate and integrate operational workflows.
  • Build state management and workflow automation systems that streamline infrastructure lifecycle processes.
  • Collaborate across teams to codify business processes into scalable, self-measuring systems.
  • Develop extensible, schema-driven platforms to reduce manual toil and ensure operational consistency.
  • Drive integrations with container orchestration tools like Kubernetes and observability systems such as Prometheus, OpenTelemetry, and Grafana.
  • Optimize reliability and efficiency of cloud operations through automated workflows and telemetry systems.
  • Lead and ship impactful technical projects, ensuring quality and scalability at every stage.

Requirements

  • 5+ years of industry experience with a Bachelor’s or Master’s degree (or equivalent experience), or 2+ years with a PhD.
  • Expertise in building GraphQL and REST APIs.
  • Proficiency in programming languages such as Go, Java, or Python.
  • Familiarity with modern JavaScript frameworks (e.g., React, Angular, Next.js).
  • Strong understanding of cloud infrastructure (AWS, GCP, Azure) and container technologies like Docker and Kubernetes.
  • Experience with high-scale distributed systems, including architectural patterns for APIs and data pipelines.
  • Outstanding communication and collaboration skills, with a focus on solving complex operational challenges.
  • A passion for automating manual processes and driving system efficiency.

Ways to Stand Out

  • A track record of designing workflow orchestration systems for large-scale infrastructure.
  • Proven experience in reducing operational inefficiencies through automation and integration.
  • Strong debugging and problem-solving skills in distributed environments.

Compensation & Benefits

  • Base salary range (by level):
    • Level 3: 152,000 USD - 218,500 USD
    • Level 4: 184,000 USD - 287,500 USD
  • You will also be eligible for equity and benefits (see https://www.nvidia.com/en-us/benefits/).

Additional Information

  • Applications for this job will be accepted at least until January 18, 2026.
  • This posting is for an existing vacancy.
  • NVIDIA uses AI tools in its recruiting processes.
  • NVIDIA is an equal opportunity employer and committed to fostering a diverse work environment.