Senior Software Engineer, Deep Learning - MLIR TRT

at Nvidia
USD 184,000-356,500 per year
SENIOR
✅ On-site

SCRAPED

Used Tools & Technologies

Not specified

Required Skills & Competences ?

Software Development @ 6 GitHub @ 4 Machine Learning @ 4 TensorFlow @ 4 Mentoring @ 4 Performance Optimization @ 4 PyTorch @ 4 CUDA @ 4 GPU @ 4

Details

NVIDIA has been transforming computer graphics, PC gaming, and accelerated computing for more than 25 years. Today, NVIDIA is tapping into the unlimited potential of AI to define the next era of computing. As part of the Solution Engineering - Automotive Machine Learning team, you will develop and productize NVIDIA's deep learning solutions for autonomous driving vehicles, building compiler technology and workflows to optimize large models for NVIDIA hardware.

Responsibilities

  • Develop compiler technologies to accelerate deep learning inference on NVIDIA hardware platforms for Physical AI.
  • Work across a wide range of abstractions from model fine-tuning and quantization to low-level kernel development and performance optimization.
  • Develop workflows that let users leverage frameworks (e.g., PyTorch, JAX) and compiler technologies/tools (e.g., MLIR, Triton) without forgoing performance.
  • Work with customers to help accelerate their workloads on NVIDIA platforms.
  • Stay up to date with the latest research and innovations in deep learning, implement and experiment with new insights to improve NVIDIA's Physical AI DNNs.

Requirements

  • MS or PhD degree in computer science, computer vision, robotics, computer architecture or equivalent experience (or equivalent experience).
  • 5+ years of work experience in software development.
  • 2+ years of experience developing deep learning frameworks (e.g., PyTorch, JAX, TensorFlow, ONNX) or compiler technologies (e.g., LLVM, MLIR, TVM, Triton).
  • Domain experience in GPU programming technologies (e.g., CUDA C++ and/or DSLs like OpenAI Triton) or system-level optimization for deep learning training or inference.
  • Strong C/C++ programming skills.
  • Familiarity with state-of-the-art deep learning techniques for inference and training.
  • Strong analytical skills and willingness to take action.

Ways to stand out

  • Experience with MLIR, LLVM, or similar compiler technologies.
  • Background with low precision inference, quantization, and compression of DNNs.
  • Experience with GPU programming and building DSLs or optimizing compilers (e.g., graph compiler or kernel generator) for GPUs or other accelerated platforms.
  • Open source project ownership or contributions, healthy GitHub repositories, and mentoring experience.

Compensation & Benefits

  • Base salary ranges by level:
    • Level 4: 184,000 USD - 287,500 USD
    • Level 5: 224,000 USD - 356,500 USD
  • You will also be eligible for equity and benefits. See https://www.nvidiabenefits.com/ for more information.

Applications for this job will be accepted at least until December 19, 2025.

NVIDIA is committed to fostering a diverse work environment and is an equal opportunity employer. They do not discriminate on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status, or any other characteristic protected by law.