Senior Systems Software Engineer, GoLang

at Nvidia
USD 148,000-287,500 per year
SENIOR
✅ On-site

SCRAPED

Used Tools & Technologies

Not specified

Required Skills & Competences ?

Software Development @ 4 Docker @ 4 Go @ 4 Grafana @ 4 Jenkins @ 4 Kubernetes @ 4 Linux @ 7 Prometheus @ 4 Python @ 4 GitHub @ 4 CI/CD @ 4 Distributed Systems @ 4 Helm @ 4 React @ 4 Node.js @ 4 Rust @ 4 Debugging @ 7 API @ 4 CUDA @ 4 GPU @ 4

Details

NVIDIA is looking for outstanding software engineers to help expand enterprise GPU management and monitoring tools. In this role you will design and build cloud-native management agents, Kubernetes integrations, and end-to-end integration solutions that combine GPUs with the datacenter software management ecosystem. Work will span HPC, cloud, and enterprise on both bare metal and virtualized platforms. Contributions include telemetry and metrics, health checks, diagnostics, configuration, and system management for both passive monitoring and active online management. Code will support single-node developer systems through large clusters with thousands of nodes.

Responsibilities

  • Develop and maintain distributed, robust and scalable Go programs deployed to Kubernetes environments that manage large datacenters.
  • Develop and maintain user-space applications, containers, Go-bindings, and CLI tools.
  • Enable GPU management integration with the open-source ecosystem, including Kubernetes and Docker.
  • Support internal and external users through bug fixes, documentation, and feature improvements.
  • Maintain high-quality products through robust test coverage.

Requirements

  • BS or higher in Computer Science or equivalent experience.
  • 5+ years of meaningful industry experience with a strong Go and Kubernetes development background.
  • Strong Linux background, including user-space development and debugging expertise in Linux environments.
  • Experience with APIs and interface design.
  • Outstanding written and verbal interpersonal skills; business-level English.
  • Strong motivation and commitment to learn new skills.
  • Ability to execute all aspects of the software development lifecycle and manage time in a fast, heavily multitasked environment.
  • Development experience with Rust, Python and/or C, C++.
  • Development experience with distributed systems and concurrent applications, especially in a Kubernetes environment.
  • Experience developing and maintaining enterprise software, and deploying, managing, and debugging applications in Kubernetes.

Ways to stand out

  • Background with containers (e.g., Docker, OCI), orchestration frameworks, and logging/telemetry backends with Kubernetes monitoring stacks such as Prometheus, Loki and Grafana.
  • Experience with modern UI development in React and Node.js or similar frameworks.
  • Experience developing Kubernetes operators or Helm charts.
  • Experience with HPC job schedulers like Slurm or Run.AI.
  • Familiarity with Kubernetes internals.
  • Exposure to GPU programming with CUDA.
  • Experience with Jenkins and GitHub/GitLab CI/CD pipelines.

Compensation & Benefits

Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 148,000 USD - 235,750 USD for Level 3, and 184,000 USD - 287,500 USD for Level 4. You will also be eligible for equity and benefits.

Other information

  • Applications for this job will be accepted at least until December 16, 2025.
  • NVIDIA is an equal opportunity employer and is committed to fostering a diverse work environment.