DGX Cloud Automation Engineer
at Nvidia
π Santa Clara, United States
USD 168,000-333,500 per year
SCRAPED
Used Tools & Technologies
Not specified
Required Skills & Competences ?
Security @ 3 Docker @ 3 Go @ 3 Kubernetes @ 3 DevOps @ 7 Python @ 3 CI/CD @ 3 Distributed Systems @ 3 AWS @ 3 Communication @ 3 OpenStack @ 3 Load Testing @ 3 IaaS @ 3 QA @ 3 OpenShift @ 3Details
NVIDIA is looking for a Cloud Software Engineer to join the DGX Cloud Engineering Team and help craft the future of AI & GPUs in the Cloud. DGX Cloud is a cloud platform tailored for AI workloads, enabling organizations to move AI projects from development to deployment. The role focuses on building cloud-scale systems, CI/CD and release engineering tools, performance and quality measurement, and automation for on-premises and cloud deployments.
Responsibilities
- Design, build, and implement scalable cloud-based systems for PaaS/IaaS.
- Build and improve development and release processes, CI/CD pipelines, and release engineering tools and processes.
- Create performance and quality measurement and regression management tools.
- Develop, maintain, and improve CI/CD tools for on-prem and cloud deployment of software.
- Collaborate with developers, QA, and Product teams to establish, refine, and streamline the software release process.
- Support, maintain, and document software functionality.
- Work closely with other teams on new products or feature improvements.
Requirements
- BS or MS in Computer Science or equivalent (or equivalent experience) with 10+ years of experience in DevOps and deployment and 2+ years of programming in Golang.
- Demonstrated understanding of cloud design in virtualization, global infrastructure, distributed systems, and security.
- Expertise in Kubernetes (K8s) and KubeVirt.
- Expertise in bringing up bare metal in a datacenter (PXE Boot, DHCP, DNS, OS).
- Experience building RESTful web services.
- Background with Docker and containers.
- Experience with Infrastructure as Code.
- Background with cloud service providers (example: AWS β Fargate, EC2, IAM, ECR, EKS, Route53).
- Experience with Continuous Integration and Continuous Delivery (CI/CD).
- Excellent interpersonal and written communication skills and a track record of solving complex problems with elegant solutions.
Ways to stand out
- Expertise in virtualization technologies such as Firecracker, KVM, OpenStack, Nutanix AHV, and Red Hat OpenShift.
- Prior experience with Go and Python.
- Demonstrated delivery of complex projects in previous roles.
- Experience with load testing frameworks.
- Background with secrets management.
Compensation & Benefits
- Base salary ranges by level:
- Level 4: 168,000 USD - 270,250 USD
- Level 5: 208,000 USD - 333,500 USD
- You will also be eligible for equity and benefits (see NVIDIA benefits page).
Additional information
- Location: Santa Clara, CA, United States.
- Employment type: Full time.
- Applications for this job will be accepted at least until November 7, 2025.
- NVIDIA is an equal opportunity employer committed to a diverse work environment.