Senior DL Algorithms Engineer - Inference Performance
at Nvidia
CAD 116,200-247,000 per year
SCRAPED
Used Tools & Technologies
Not specified
Required Skills & Competences ?
Algorithms @ 4 Performance Optimization @ 4 Microservices @ 4 LLM @ 4 PyTorch @ 6 CUDA @ 1 GPU @ 4Details
We are seeking a Senior Deep Learning (DL) Algorithms Engineer focused on inference performance. You will work across the full hardware/software stack β from GPU architecture to deep learning frameworks β to profile, analyze, and optimize inference workloads and help shape NVIDIA's hardware and software roadmap.
Responsibilities
- Implement language and multimodal model inference as part of NVIDIA Inference Microservices (NIMs).
- Contribute new features, fix bugs, and deliver production code to TRT-LLM (NVIDIAβs open-source inference serving library).
- Profile and analyze bottlenecks across the full inference stack to maximize inference performance.
- Benchmark state-of-the-art DL model inference offerings and perform competitive analysis for NVIDIA software/hardware stack.
- Collaborate with software/hardware co-design teams to enable next-generation AI-powered services.
Requirements
- PhD in Computer Science, Electrical Engineering, Computer Systems Engineering, or equivalent experience.
- 3+ years of relevant experience.
- Strong background in deep learning and neural networks, particularly inference.
- Experience with performance profiling, analysis, and optimization, especially for GPU-based applications.
- Proficient in C++ and PyTorch or equivalent deep learning frameworks.
- Deep understanding of computer architecture and familiarity with GPU architecture fundamentals.
Ways to stand out
- Proven experience with processor- and system-level performance optimization.
- Deep understanding of modern large language model (LLM) architectures.
- Strong fundamentals in algorithms.
- GPU programming experience (CUDA or OpenCL) is a strong plus.
Compensation & Benefits
- Base salary ranges (CAD): Level 3 β 116,250 CAD to 201,500 CAD; Level 4 β 142,500 CAD to 247,000 CAD.
- Eligible for equity and additional benefits (see NVIDIA benefits page).
- Applications accepted at least until October 19, 2025.
Additional information
- Expected time commitment: Full time.
- Location: Toronto, Canada (on-site role as specified).