Senior DL Algorithms Engineer - Inference Performance
at Nvidia
π Santa Clara, United States
USD 148,000-287,500 per year
SCRAPED
Used Tools & Technologies
Not specified
Required Skills & Competences ?
Algorithms @ 4 Hiring @ 4 Performance Optimization @ 4 Microservices @ 4 LLM @ 4 PyTorch @ 6 CUDA @ 1 GPU @ 4Details
We are seeking a Senior DL Algorithms Engineer focused on inference performance. The role requires careful performance analysis and optimization across the full hardware/software stack β from GPU architecture to deep learning frameworks β to maximize performance of Deep Learning workloads. The position offers direct influence on hardware and software roadmaps at NVIDIA.
Responsibilities
- Implement language and multimodal model inference as part of NVIDIA Inference Microservices (NIMs).
- Contribute new features, fix bugs, and deliver production code to TRT-LLM, NVIDIAβs open-source inference serving library.
- Profile and analyze bottlenecks across the full inference stack to push the boundaries of inference performance.
- Benchmark state-of-the-art DL model inference offerings and perform competitive analysis for NVIDIA software/hardware stack.
- Collaborate heavily with software/hardware co-design teams to enable the next generation of AI-powered services.
Requirements
- PhD in Computer Science, Electrical Engineering, CSEE, or equivalent experience.
- 3+ years of experience.
- Strong background in deep learning and neural networks, particularly inference.
- Experience with performance profiling, analysis, and optimization, especially for GPU-based applications.
- Proficient in C++ and PyTorch (or equivalent frameworks).
- Deep understanding of computer architecture and familiarity with GPU architecture fundamentals.
Ways to stand out
- Proven experience with processor and system-level performance optimization.
- Deep understanding of modern LLM architectures.
- Strong fundamentals in algorithms.
- GPU programming experience (CUDA or OpenCL) is a strong plus.
Compensation & Benefits
- Base salary range (Level 3): 148,000 USD - 235,750 USD.
- Base salary range (Level 4): 184,000 USD - 287,500 USD.
- Your base salary will be determined based on your location, experience, and pay of employees in similar positions.
- Eligible for equity and benefits (see NVIDIA benefits page).
Other details
- Location: US, CA, Santa Clara.
- Employment type: Full time.
- Applications accepted at least until August 8, 2025.
- NVIDIA is an equal opportunity employer and values diversity in hiring and promotion practices.