DGX Cloud Automation Engineer

at Nvidia
USD 168,000-333,500 per year
MIDDLE
βœ… On-site

SCRAPED

Used Tools & Technologies

Not specified

Required Skills & Competences ?

Security @ 3 Docker @ 3 Go @ 3 Kubernetes @ 3 DevOps @ 7 Python @ 3 CI/CD @ 3 Distributed Systems @ 3 AWS @ 3 Communication @ 3 OpenStack @ 3 Load Testing @ 3 IaaS @ 3 QA @ 3 OpenShift @ 3

Details

NVIDIA is looking for a Cloud Software Engineer to join the DGX Cloud Engineering Team and help craft the future of AI & GPUs in the Cloud. DGX Cloud is a cloud platform tailored for AI workloads, enabling organizations to move AI projects from development to deployment. The role focuses on building cloud-scale systems, CI/CD and release engineering tools, performance and quality measurement, and automation for on-premises and cloud deployments.

Responsibilities

  • Design, build, and implement scalable cloud-based systems for PaaS/IaaS.
  • Build and improve development and release processes, CI/CD pipelines, and release engineering tools and processes.
  • Create performance and quality measurement and regression management tools.
  • Develop, maintain, and improve CI/CD tools for on-prem and cloud deployment of software.
  • Collaborate with developers, QA, and Product teams to establish, refine, and streamline the software release process.
  • Support, maintain, and document software functionality.
  • Work closely with other teams on new products or feature improvements.

Requirements

  • BS or MS in Computer Science or equivalent (or equivalent experience) with 10+ years of experience in DevOps and deployment and 2+ years of programming in Golang.
  • Demonstrated understanding of cloud design in virtualization, global infrastructure, distributed systems, and security.
  • Expertise in Kubernetes (K8s) and KubeVirt.
  • Expertise in bringing up bare metal in a datacenter (PXE Boot, DHCP, DNS, OS).
  • Experience building RESTful web services.
  • Background with Docker and containers.
  • Experience with Infrastructure as Code.
  • Background with cloud service providers (example: AWS β€” Fargate, EC2, IAM, ECR, EKS, Route53).
  • Experience with Continuous Integration and Continuous Delivery (CI/CD).
  • Excellent interpersonal and written communication skills and a track record of solving complex problems with elegant solutions.

Ways to stand out

  • Expertise in virtualization technologies such as Firecracker, KVM, OpenStack, Nutanix AHV, and Red Hat OpenShift.
  • Prior experience with Go and Python.
  • Demonstrated delivery of complex projects in previous roles.
  • Experience with load testing frameworks.
  • Background with secrets management.

Compensation & Benefits

  • Base salary ranges by level:
    • Level 4: 168,000 USD - 270,250 USD
    • Level 5: 208,000 USD - 333,500 USD
  • You will also be eligible for equity and benefits (see NVIDIA benefits page).

Additional information

  • Location: Santa Clara, CA, United States.
  • Employment type: Full time.
  • Applications for this job will be accepted at least until November 7, 2025.
  • NVIDIA is an equal opportunity employer committed to a diverse work environment.