Software DevOps Engineer

at Nvidia
USD 148,000-276,000 per year
MIDDLE SENIOR
✅ On-site

Used Tools & Technologies

Not specified

Required Skills & Competences

Software Development @ 3 Docker @ 3 Linux @ 6 DevOps @ 3 Python @ 3 R @ 3 CI/CD @ 3 Hiring @ 3 Bash @ 3 Communication @ 3 gRPC @ 3 Networking @ 2 SRE @ 5 JSON @ 3 GPU @ 3

Details

NVIDIA is looking for an outstanding candidate to solve software integration challenges for next-generation data center platforms. You will work with GPU architectures and advanced AI infrastructure projects, ensuring the seamless integration of technologies in High-Speed Communication and virtualization. You will support products that leverage Ethernet and InfiniBand protocols and provide first-tier support to R&D teams, acting as the bridge between pioneering hardware and stable software deployments.

Responsibilities

  • Fix and prioritize complex systems during high-stakes bringups and Proof of Concepts (PoCs) for next-generation computing architectures.
  • Manage integration of large-scale products involving GPUs, complex network stacks, firmware, and drivers.
  • Create, recreate, and redeploy software artifacts: fix code, update builds, or provide workarounds to unblock development.
  • Serve as the primary technical point of contact for R&D teams to resolve immediate infrastructure and integration blockers.
  • Work closely with R&D, Verification, and DevOps teams to streamline CI/CD pipelines for specialized high-speed interconnect and system management projects.

Requirements

  • Bachelor of Science in Computer Science or similar academic degree, or equivalent experience.
  • Proven software engineering background and deep understanding of standard software development methodologies.
  • Strong knowledge of modern Linux-based operating systems (Ubuntu/RHEL).
  • 5+ years of experience in DevOps, SRE, or Systems Integration roles.
  • Experience with containerization using Docker.
  • Coding skills in C/C++, Python, and Bash for automation and system-level fixes.
  • Experience with GitLab and GitLab CI for managing complex build pipelines.
  • Ability to multi-task, self-manage in a fast-paced environment, and lead technically during critical system failures.
  • Excellent problem-solving and critical thinking abilities.

Ways to stand out (Preferred / Nice-to-have)

  • In-depth knowledge and familiarity with high-performance networking (InfiniBand, Ethernet).
  • Practical experience with gRPC, gNMI, REST, and JSON for system management and telemetry.
  • Proven track record working on large-scale HW+SW converged systems (e.g., rack-scale computing or GPU clusters).

Benefits

  • Competitive base salary (see ranges below) and eligibility for equity and benefits.
  • Generous benefits package (link referenced in original posting).

Compensation and additional information

  • Base salary ranges provided in the posting:
    • Level 3: 148,000 USD - 235,750 USD
    • Level 4: 176,000 USD - 276,000 USD
  • You will also be eligible for equity and benefits.
  • Applications accepted at least until January 24, 2026. This posting is for an existing vacancy.
  • NVIDIA uses AI tools in its recruiting processes.
  • NVIDIA is an equal opportunity employer and values diversity in hiring and promotion practices.