Senior DevOps Engineer - Accelerated Computing

at Nvidia
USD 184,000-356,500 per year
SENIOR
βœ… Hybrid

SCRAPED

Used Tools & Technologies

Not specified

Required Skills & Competences ?

Linux @ 7 DevOps @ 4 Python @ 4 CI/CD @ 4 Distributed Systems @ 7 Git @ 4 Perl @ 4 Debugging @ 7 TAG @ 4 Agile @ 4 CUDA @ 4 GPU @ 4

Details

We are the CUDA Math Libraries team at NVIDIA. We build software that finds its way into AI applications, self-driving cars, and some of the world's fastest supercomputers solving challenges in science, medicine, and engineering. We are looking for a Senior DevOps Engineer (titles such as Developer Experience Engineer, Site Reliability Engineer, Build and Release Engineer, Continuous Integration Engineer are also applicable). The role requires strong integrity, reliability, persistence, problem-solving ability, and skills in Linux, scripting, debugging, and troubleshooting.

Responsibilities

  • Build a world-class CI/CD platform that empowers engineers to create high-performance software for a diverse ecosystem running on revolutionary hardware at large scale.
  • Be part of a team that pushes the boundaries of what's possible on powerful compute platforms.
  • Run builds and tests on many architectures, operating systems, and devices.
  • Collect and analyze large amounts of data; collaborate to design infrastructure and tools to make sense of it.
  • Build strong working relationships to enable effective teamwork.
  • Work in a highly dynamic environment that requires quick thinking and adaptability.

Requirements

  • 6+ years of relevant industry experience.
  • Proficient with Linux.
  • Bachelor's degree in a related area of study or equivalent experience.
  • Expert with scripting in one or more of Python, Perl, shell, Groovy, etc.
  • Strong background with deploying, configuring, and debugging distributed systems.
  • Familiarity with the software build process (compiling C++ code with GNU Make, CMake, Visual Studio, MSBuild, etc.).
  • Background with some form of source control management (preferably git).
  • Familiar with containers.

Ways to stand out

  • Experience with HPC hardware systems such as compute clusters and HPC software performance benchmarking.
  • System administrator level experience with multi-user Linux servers.
  • Background with GPU accelerated systems.
  • Experience working in environments using Agile processes and methodologies.

Compensation & Benefits

  • Base salary range (location-, level- and experience-dependent):
    • Level 4: 184,000 USD - 287,500 USD
    • Level 5: 224,000 USD - 356,500 USD
  • Eligible for equity and company benefits.

Additional information

  • Applications for this job will be accepted at least until November 24, 2025.
  • NVIDIA is an equal opportunity employer committed to fostering a diverse work environment.
  • Job posting includes the tag: #LI-Hybrid