Senior Deep Learning Software Engineer, Inference

at Nvidia
USD 148,000-287,500 per year
SENIOR
✅ Hybrid

SCRAPED

Used Tools & Technologies

Not specified

Required Skills & Competences ?

Software Development @ 6 Python @ 1 Performance Optimization @ 4 LLM @ 4 PyTorch @ 4 Agile @ 1 CUDA @ 1 GPU @ 4

Details

NVIDIA seeks a Senior Software Engineer specializing in Deep Learning Inference for our growing team. As a key contributor, you will help design, build, and optimize the GPU-accelerated software that powers today’s most sophisticated AI applications.

Responsibilities

  • Performance optimization, analysis, and tuning of DL models in various domains like LLM, Multimodal and Generative AI.
  • Scale performance of DL models across different architectures and types of NVIDIA accelerators.
  • Contribute features and code to NVIDIA’s inference libraries, vLLM and SGLang, FlashInfer and LLM software solutions.
  • Work with cross-collaborative teams across frameworks, NVIDIA libraries and inference optimization innovative solutions.

Requirements

  • Masters or PhD or equivalent experience in relevant field (Computer Engineering, Computer Science, EECS, AI).
  • 5+ years of relevant software development experience.
  • Excellent C/C++ programming and software design skills. SW Agile skills are helpful and Python experience is a plus.
  • Prior experience with training, deploying or optimizing the inference of DL models in production is a plus.
  • Background with performance modeling, profiling, debug, and code optimization or architectural knowledge of CPU and GPU is a plus.
  • GPU programming experience (CUDA, OAI TRITON or CUTLASS) is a plus.

Ways to Stand out from The Crowd

  • Contribute to deep learning software projects, such as PyTorch, vLLM, and SGLang to drive advancements in the field.
  • Experience with Multi GPU Communications (NCCL, NVSHMEM).

With highly competitive salaries and a comprehensive benefits package, NVIDIA is widely considered to be one of the technology world's most desirable employers. We have some of the most forward-thinking and hardworking people in the world working for us and, due to outstanding growth, our special engineering teams are growing fast. If you’re a creative and autonomous engineer with a genuine passion for technology, we want to hear from you!