Senior Deep Learning Algorithm Engineer

at Nvidia
USD 184,000-356,500 per year
SENIOR
βœ… On-site

SCRAPED

Used Tools & Technologies

Not specified

Required Skills & Competences ?

Python @ 6 Algorithms @ 4 Communication @ 4 LLM @ 4 CUDA @ 4 GPU @ 7

Details

At NVIDIA, the team focuses on large language models (LLMs) and their application in agentic and reasoning use cases. The role centers on improving algorithmic performance and efficiency of LLM inference systems, developing new inference algorithms and protocols, improving existing models, and integrating improvements into NVIDIA's LLM software stack to handle large-scale, sophisticated tasks.

Responsibilities

  • Research and Development: Explore and incorporate contemporary research on generative AI, agents, and inference systems into the NVIDIA LLM software stack.
  • Workload Analysis and Optimization: Conduct in-depth analysis, profiling, and optimization of agentic LLM workloads to reduce request latency and increase throughput while maintaining workflow fidelity.
  • System Design and Implementation: Design and implement scalable systems to accelerate agentic workflows and efficiently handle datacenter-scale use cases.
  • Collaboration and Communication: Advise future iterations of NVIDIA software, hardware, and systems by engaging with internal teams and external partners and formalizing strategic requirements presented by their workloads.

Requirements

  • BS, MS, or PhD in Computer Science, Electrical Engineering, Computer Engineering, or a related field (or equivalent experience).
  • 8+ years of experience in deep learning and deep learning systems design.
  • Proficiency in Python and C++ programming.
  • Strong understanding of computer architecture and GPU/parallel datacenter computing fundamentals.
  • Proven interest and experience in analyzing, modeling, and tuning application performance (profiling and optimization).

Ways to stand out

  • Experience building large-scale LLM inference systems, especially those involving compound AI/agentic workflows.
  • Experience with processor and system-level performance modeling.
  • GPU programming experience with CUDA or OpenCL.

Benefits

  • Base salary range provided (see below) depending on level and location.
  • Eligibility for equity and other NVIDIA benefits (link referenced in original posting).

Additional information

  • Base salary ranges provided in the posting: Level 4 β€” 184,000 USD to 287,500 USD; Level 5 β€” 224,000 USD to 356,500 USD.
  • Applications accepted through at least July 29, 2025.
  • NVIDIA is an equal opportunity employer committed to diversity and non-discrimination.