Senior DL Algorithms Engineer - Inference Performance
at Nvidia
USD 148,000-287,500 per year
SCRAPED
Used Tools & Technologies
Not specified
Required Skills & Competences ?
Python @ 4 CI/CD @ 4 Algorithms @ 4 MLOps @ 4 Microservices @ 4 CUDA @ 1 GPU @ 4Details
NVIDIA is seeking a Senior DL Algorithms Engineer focused on performance analysis and optimization for Deep Learning workloads, working across the hardware/software stack to maximize GPU and Deep Learning Framework efficiency.
Responsibilities
- Deliver hyper-optimized recipes for DL inference as part of NVIDIA Inference Microservices (NIMs).
- Analyze, validate, and debug performance and accuracy characteristics of optimized models.
- Benchmark state-of-the-art offerings in various DL model inference and conduct competitive analysis for NVIDIA software and hardware stacks.
- Develop software, tooling, and processes across multiple stack layers to streamline and scale delivery of hundreds of optimized DL models.
- Collaborate with software/hardware co-design teams to create next-generation AI-powered services.
Requirements
- PhD in CS, EE, or CSEE or equivalent experience.
- 3+ years of relevant experience.
- Ability to deliver results under tight timelines and changing requirements.
- Strong background in deep learning, neural networks, particularly in inference.
- Deep understanding of computer architecture and fundamentals of GPU architecture.
- Programming skills in C++ and Python.
Ways to Stand Out
- Strong fundamentals in algorithms.
- Experience and understanding of LLMs, VLMs, RAG, and drug discovery models.
- Proven experience in processor and system-level performance modeling.
- Experience with MLOps and DLOps including building CI/CD pipelines.
- GPU programming experience (CUDA or OpenCL) is a plus but not required.
Benefits
- Base salary range of 148,000 USD to 287,500 USD.
- Eligibility for equity and benefits.
- Work in a fast-growing tech company leading the AI revolution with a diverse and inclusive environment.