Senior Deep Learning Software Engineer, LLM Performance

at Nvidia
USD 184,000-356,500 per year
SENIOR
✅ Hybrid

SCRAPED

Used Tools & Technologies

Not specified

Required Skills & Competences ?

Software Development @ 7 Python @ 7 Algorithms @ 4 TensorFlow @ 4 Performance Optimization @ 4 Debugging @ 4 LLM @ 4 PyTorch @ 4 CUDA @ 4 GPU @ 4

Details

NVIDIA is seeking an experienced Deep Learning Engineer passionate about analyzing and improving the performance of LLM (Large Language Model) inference. The role involves research and development for Deep Learning Inference, focusing on GPU-accelerated deep learning software such as TensorRT, DL benchmarking software, and solutions for deploying and serving deep learning models.

Responsibilities

  • Performance optimization, analysis, and tuning of LLM, VLM, and GenAI models for deep learning inference, serving, and deployment.
  • Scaling performance of LLM models across various NVIDIA architectures and accelerators.
  • Optimizing for maximum throughput, minimum latency, and throughput under latency constraints.
  • Contributing features and code to NVIDIA and open-source LLM frameworks, inference benchmarking frameworks, TensorRT, and Triton.
  • Collaborating with cross-functional teams across generative AI, automotive, image understanding, and speech understanding to develop innovative solutions.

Requirements

  • Bachelor’s, Master’s, PhD or equivalent experience in Computer Engineering, Computer Science, EECS, AI or related fields.
  • At least 8 years of relevant software development experience.
  • Excellent programming skills in Python, C, and C++ with strong software design and engineering capabilities.
  • Experience with deep learning frameworks such as PyTorch, JAX, or TensorFlow.

Ways to Stand Out

  • Prior experience with LLM frameworks or deep learning compilers related to inference, deployment, algorithms, or implementation.
  • Experience with performance modeling, profiling, debugging, and optimizing high-performance computing or deep learning applications.
  • Architectural knowledge of CPU and GPU.
  • GPU programming experience using CUDA or OpenCL.

About NVIDIA

NVIDIA has driven breakthroughs in deep learning by providing GPUs that power AI applications including LLM, generative AI, recommenders, and vision. Their architecture integrates AI and computer graphics, becoming the foundation for machines that learn and reason using human language. NVIDIA is recognized as “the AI computing company.”

Benefits

The base salary range is $184,000 to $356,500 USD, depending on location, experience, and pay parity. Equity and additional benefits are available. NVIDIA supports a diverse and inclusive work environment and is an equal opportunity employer.

#LI-Hybrid