Senior DL Algorithms Engineer - Inference Performance
at Nvidia
USD 148,000-287,500 per year
SCRAPED
Used Tools & Technologies
Not specified
Required Skills & Competences ?
Algorithms @ 4 Performance Optimization @ 4 Microservices @ 4 LLM @ 4 PyTorch @ 6 CUDA @ 1 GPU @ 6Details
We are seeking a Senior Deep Learning Algorithms Engineer focused on inference performance. You will work across the full hardware/software stack—from GPU architecture to deep learning frameworks—to analyze, profile, and optimize inference workloads and directly influence both software and hardware roadmaps.
Responsibilities
- Implement language and multimodal model inference as part of NVIDIA Inference Microservices (NIMs).
- Contribute new features, fix bugs, and deliver production code to TRT-LLM, NVIDIA's open-source inference serving library.
- Profile and analyze bottlenecks across the full inference stack to push the boundaries of inference performance.
- Benchmark state-of-the-art offerings for various DL model inference and perform competitive analysis for NVIDIA software/hardware stack.
- Collaborate heavily with other software/hardware co-design teams to enable the next generation of AI-powered services.
Requirements
- PhD in Computer Science, Electrical Engineering, CSEE, or equivalent experience.
- 3+ years of relevant experience.
- Strong background in deep learning and neural networks, particularly inference.
- Experience with performance profiling, analysis, and optimization, especially for GPU-based applications.
- Proficient in C++ and PyTorch (or equivalent frameworks).
- Deep understanding of computer architecture and familiarity with GPU architecture fundamentals.
Ways to Stand Out
- Proven experience with processor and system-level performance optimization.
- Deep understanding of modern LLM architectures.
- Strong fundamentals in algorithms.
- GPU programming experience (CUDA or OpenCL) is a strong plus.
Compensation & Benefits
- Base salary ranges by level:
- Level 3: 148,000 USD - 235,750 USD
- Level 4: 184,000 USD - 287,500 USD
- Eligible for equity and company benefits.
Additional Details
- Location: Santa Clara, CA, United States.
- Employment type: Full time.
- Applications accepted at least until July 29, 2025.
- NVIDIA is an equal opportunity employer committed to diversity and inclusion.