Senior Software Engineer, Deep Learning - MLIR TRT

at Nvidia

📍 Santa Clara, United States

USD 184,000-356,500 per year

SENIOR

✅ On-site

Used Tools & Technologies

Not specified

Required Skills & Competences
Tag name is followed by "@" symbol and proficiency level value. About proficiency levels:

1-2 — basic awareness. Minimal hands-on experience, and a rudimentary understanding of the technology's purpose;

3-6 — daily use. Comfortable and regular usage, capable of handling common tasks and challenges related to the technology;

7-9 — you are an expert, you can teach others, you know all the pitfalls and tricks;

10 — exceptional knowledge, comprehensive understanding, and adeptness in all aspects of the technology, including advanced problem-solving. Think twice before claiming or demanding such level.

Software Development @ 6 GitHub @ 4 Machine Learning @ 4 TensorFlow @ 4 Mentoring @ 4 Performance Optimization @ 4 PyTorch @ 4 CUDA @ 4 GPU @ 4

Details

NVIDIA has been transforming computer graphics, PC gaming, and accelerated computing for more than 25 years. Today, NVIDIA is tapping into the unlimited potential of AI to define the next era of computing. As part of the Solution Engineering - Automotive Machine Learning team, you will develop and productize NVIDIA's deep learning solutions for autonomous driving vehicles, building compiler technology and workflows to optimize large models for NVIDIA hardware.

Responsibilities

Develop compiler technologies to accelerate deep learning inference on NVIDIA hardware platforms for Physical AI.
Work across a wide range of abstractions from model fine-tuning and quantization to low-level kernel development and performance optimization.
Develop workflows that let users leverage frameworks (e.g., PyTorch, JAX) and compiler technologies/tools (e.g., MLIR, Triton) without forgoing performance.
Work with customers to help accelerate their workloads on NVIDIA platforms.
Stay up to date with the latest research and innovations in deep learning, implement and experiment with new insights to improve NVIDIA's Physical AI DNNs.

Requirements

MS or PhD degree in computer science, computer vision, robotics, computer architecture or equivalent experience (or equivalent experience).
5+ years of work experience in software development.
2+ years of experience developing deep learning frameworks (e.g., PyTorch, JAX, TensorFlow, ONNX) or compiler technologies (e.g., LLVM, MLIR, TVM, Triton).
Domain experience in GPU programming technologies (e.g., CUDA C++ and/or DSLs like OpenAI Triton) or system-level optimization for deep learning training or inference.
Strong C/C++ programming skills.
Familiarity with state-of-the-art deep learning techniques for inference and training.
Strong analytical skills and willingness to take action.

Ways to stand out

Experience with MLIR, LLVM, or similar compiler technologies.
Background with low precision inference, quantization, and compression of DNNs.
Experience with GPU programming and building DSLs or optimizing compilers (e.g., graph compiler or kernel generator) for GPUs or other accelerated platforms.
Open source project ownership or contributions, healthy GitHub repositories, and mentoring experience.

Compensation & Benefits

Base salary ranges by level:
- Level 4: 184,000 USD - 287,500 USD
- Level 5: 224,000 USD - 356,500 USD
You will also be eligible for equity and benefits. See https://www.nvidiabenefits.com/ for more information.

Applications for this job will be accepted at least until December 19, 2025.

NVIDIA is committed to fostering a diverse work environment and is an equal opportunity employer. They do not discriminate on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status, or any other characteristic protected by law.