Senior System Software Engineer - Dynamo And Triton Inference Server

at Nvidia
πŸ“ United States
USD 184,000-356,500 per year
SENIOR
βœ… Remote

SCRAPED

Used Tools & Technologies

Not specified

Required Skills & Competences ?

Python @ 4 Algorithms @ 7 Distributed Systems @ 4 Hiring @ 4 gRPC @ 4 Protobuf @ 4 Rust @ 4 Debugging @ 4 HTTP @ 4 JSON @ 4 PyTorch @ 7 Agile @ 4 GPU @ 4

Details

We are now looking for a Senior System Software Engineer to work on Dynamo and Triton Inference Server. NVIDIA is hiring software engineers for its GPU-accelerated deep learning software team. Academic and commercial groups around the world are using GPUs to power a revolution in AI, enabling breakthroughs in problems from image classification to speech recognition to natural language processing. We are a fast-paced team building back-end services and software to make design and deployment of new AI models easier and accessible to all users.

Responsibilities

  • Develop open source software to serve inference of trained AI models running on GPUs.
  • Build robust, scalable, high performance software components to support distributed inference workloads.
  • Work with team leads to prioritize features and capabilities.
  • Load-balance asynchronous requests across available resources.
  • Optimize prediction throughput under latency constraints.
  • Integrate the latest open source technology.

Requirements

  • Masters or PhD or equivalent experience.
  • 8+ years in Computer Science, Computer Engineering, or related field.
  • Ability to work in a fast-paced, agile team environment.
  • Excellent Rust, Python, C++ programming and software design skills, including debugging, performance analysis, and test design.
  • Experience with high scale distributed systems and ML systems.

Ways to stand out from the crowd

  • Prior work experience improving performance of AI inference systems.
  • Background with deep learning algorithms and frameworks, especially experience with Large Language Models and frameworks such as PyTorch, TensorRT, and ONNX Runtime.
  • Experience building and deploying cloud services using HTTP REST, gRPC, protobuf, JSON and related technologies.
  • Familiarity with the latest AI research and working knowledge of how these systems are efficiently implemented.

Benefits

  • Base salary range: 184,000 USD - 356,500 USD.
  • Eligibility for equity and benefits.
  • NVIDIA is an equal opportunity employer valuing diversity in hiring and promotion practices.