Senior DL Algorithms Engineer - Inference Performance

at Nvidia
πŸ“ Toronto, Canada
CAD 116,200-247,000 per year
SENIOR
βœ… On-site

SCRAPED

Used Tools & Technologies

Not specified

Required Skills & Competences ?

Algorithms @ 4 Performance Optimization @ 4 Microservices @ 4 LLM @ 4 PyTorch @ 6 CUDA @ 1 GPU @ 4

Details

We are seeking a Senior Deep Learning (DL) Algorithms Engineer focused on inference performance. You will work across the full hardware/software stack β€” from GPU architecture to deep learning frameworks β€” to profile, analyze, and optimize inference workloads and help shape NVIDIA's hardware and software roadmap.

Responsibilities

  • Implement language and multimodal model inference as part of NVIDIA Inference Microservices (NIMs).
  • Contribute new features, fix bugs, and deliver production code to TRT-LLM (NVIDIA’s open-source inference serving library).
  • Profile and analyze bottlenecks across the full inference stack to maximize inference performance.
  • Benchmark state-of-the-art DL model inference offerings and perform competitive analysis for NVIDIA software/hardware stack.
  • Collaborate with software/hardware co-design teams to enable next-generation AI-powered services.

Requirements

  • PhD in Computer Science, Electrical Engineering, Computer Systems Engineering, or equivalent experience.
  • 3+ years of relevant experience.
  • Strong background in deep learning and neural networks, particularly inference.
  • Experience with performance profiling, analysis, and optimization, especially for GPU-based applications.
  • Proficient in C++ and PyTorch or equivalent deep learning frameworks.
  • Deep understanding of computer architecture and familiarity with GPU architecture fundamentals.

Ways to stand out

  • Proven experience with processor- and system-level performance optimization.
  • Deep understanding of modern large language model (LLM) architectures.
  • Strong fundamentals in algorithms.
  • GPU programming experience (CUDA or OpenCL) is a strong plus.

Compensation & Benefits

  • Base salary ranges (CAD): Level 3 β€” 116,250 CAD to 201,500 CAD; Level 4 β€” 142,500 CAD to 247,000 CAD.
  • Eligible for equity and additional benefits (see NVIDIA benefits page).
  • Applications accepted at least until October 19, 2025.

Additional information

  • Expected time commitment: Full time.
  • Location: Toronto, Canada (on-site role as specified).