Senior DL Algorithms Engineer - Inference Performance

at Nvidia
USD 148,000-287,500 per year
SENIOR
βœ… On-site

SCRAPED

Used Tools & Technologies

Not specified

Required Skills & Competences ?

Algorithms @ 4 Hiring @ 4 Performance Optimization @ 4 Microservices @ 4 LLM @ 4 PyTorch @ 6 CUDA @ 1 GPU @ 4

Details

We are seeking a Senior DL Algorithms Engineer focused on inference performance. The role requires careful performance analysis and optimization across the full hardware/software stack β€” from GPU architecture to deep learning frameworks β€” to maximize performance of Deep Learning workloads. The position offers direct influence on hardware and software roadmaps at NVIDIA.

Responsibilities

  • Implement language and multimodal model inference as part of NVIDIA Inference Microservices (NIMs).
  • Contribute new features, fix bugs, and deliver production code to TRT-LLM, NVIDIA’s open-source inference serving library.
  • Profile and analyze bottlenecks across the full inference stack to push the boundaries of inference performance.
  • Benchmark state-of-the-art DL model inference offerings and perform competitive analysis for NVIDIA software/hardware stack.
  • Collaborate heavily with software/hardware co-design teams to enable the next generation of AI-powered services.

Requirements

  • PhD in Computer Science, Electrical Engineering, CSEE, or equivalent experience.
  • 3+ years of experience.
  • Strong background in deep learning and neural networks, particularly inference.
  • Experience with performance profiling, analysis, and optimization, especially for GPU-based applications.
  • Proficient in C++ and PyTorch (or equivalent frameworks).
  • Deep understanding of computer architecture and familiarity with GPU architecture fundamentals.

Ways to stand out

  • Proven experience with processor and system-level performance optimization.
  • Deep understanding of modern LLM architectures.
  • Strong fundamentals in algorithms.
  • GPU programming experience (CUDA or OpenCL) is a strong plus.

Compensation & Benefits

  • Base salary range (Level 3): 148,000 USD - 235,750 USD.
  • Base salary range (Level 4): 184,000 USD - 287,500 USD.
  • Your base salary will be determined based on your location, experience, and pay of employees in similar positions.
  • Eligible for equity and benefits (see NVIDIA benefits page).

Other details

  • Location: US, CA, Santa Clara.
  • Employment type: Full time.
  • Applications accepted at least until August 8, 2025.
  • NVIDIA is an equal opportunity employer and values diversity in hiring and promotion practices.