Senior DL Algorithms Engineer - Inference Performance

at Nvidia
USD 148,000-287,500 per year
SENIOR
✅ On-site

SCRAPED

Used Tools & Technologies

Not specified

Required Skills & Competences ?

Algorithms @ 4 Performance Optimization @ 4 Microservices @ 4 LLM @ 4 PyTorch @ 6 CUDA @ 1 GPU @ 6

Details

We are seeking a Senior Deep Learning Algorithms Engineer focused on inference performance. You will work across the full hardware/software stack—from GPU architecture to deep learning frameworks—to analyze, profile, and optimize inference workloads and directly influence both software and hardware roadmaps.

Responsibilities

  • Implement language and multimodal model inference as part of NVIDIA Inference Microservices (NIMs).
  • Contribute new features, fix bugs, and deliver production code to TRT-LLM, NVIDIA's open-source inference serving library.
  • Profile and analyze bottlenecks across the full inference stack to push the boundaries of inference performance.
  • Benchmark state-of-the-art offerings for various DL model inference and perform competitive analysis for NVIDIA software/hardware stack.
  • Collaborate heavily with other software/hardware co-design teams to enable the next generation of AI-powered services.

Requirements

  • PhD in Computer Science, Electrical Engineering, CSEE, or equivalent experience.
  • 3+ years of relevant experience.
  • Strong background in deep learning and neural networks, particularly inference.
  • Experience with performance profiling, analysis, and optimization, especially for GPU-based applications.
  • Proficient in C++ and PyTorch (or equivalent frameworks).
  • Deep understanding of computer architecture and familiarity with GPU architecture fundamentals.

Ways to Stand Out

  • Proven experience with processor and system-level performance optimization.
  • Deep understanding of modern LLM architectures.
  • Strong fundamentals in algorithms.
  • GPU programming experience (CUDA or OpenCL) is a strong plus.

Compensation & Benefits

  • Base salary ranges by level:
    • Level 3: 148,000 USD - 235,750 USD
    • Level 4: 184,000 USD - 287,500 USD
  • Eligible for equity and company benefits.

Additional Details

  • Location: Santa Clara, CA, United States.
  • Employment type: Full time.
  • Applications accepted at least until July 29, 2025.
  • NVIDIA is an equal opportunity employer committed to diversity and inclusion.