Senior AI-Native Systems Software Engineer, TensorRT

at Nvidia

📍 Santa Clara, United States

USD 152,000-287,500 per year

SENIOR

✅ Hybrid

Used Tools & Technologies

LLM GenAI

Required Skills & Competences
Tag name is followed by "@" symbol and proficiency level value. About proficiency levels:

1-2 — basic awareness. Minimal hands-on experience, and a rudimentary understanding of the technology's purpose;

3-6 — daily use. Comfortable and regular usage, capable of handling common tasks and challenges related to the technology;

7-9 — you are an expert, you can teach others, you know all the pitfalls and tricks;

10 — exceptional knowledge, comprehensive understanding, and adeptness in all aspects of the technology, including advanced problem-solving. Think twice before claiming or demanding such level.

Software Development @ 6 Python @ 4 C @ 4 C++ @ 7 Communication @ 4 Performance Optimization @ 4 CUDA @ 4 GPU @ 4 Deep Learning @ 3 Generative AI @ 4 AI @ 4 Profiling @ 4 TensorRT @ 4 Performance Analysis @ 4

Details

Are you passionate about redefining how software is built in the age of Generative AI? Join NVIDIA’s TensorRT team to help lead a first-of-its-kind, AI-native initiative designed to make TensorRT the default entry point for out-of-framework inference globally. The team is building a framework from the ground up to leverage swarms of AI agents to produce high-performance, high-quality, modern C++ software at scale.

Responsibilities

Architect an AI-native framework that scales beyond human capacity, supporting many AI agents working in parallel to generate, test, and validate production-grade software.
Scale workflows via agentic tooling: build AI-native tools, multi-agent orchestrators, and codebase harnesses that let humans focus on highest-value work.
Rapid prototyping with state-of-the-art models: scout industry and academic breakthroughs (e.g., attention mechanisms, KV cache strategies) and dispatch agent swarms to prototype and integrate capabilities.
Deliver a great user experience: ensure a seamless, high-performance path to production for modern model families (LLMs, Diffusion, Audio, Vision, and multi-modal models).
Extreme performance optimization: work at the intersection of Python orchestration and C++ engine-level optimizations to achieve major latency and throughput gains for critical customer use cases.

Requirements

BS, MS, or PhD in Computer Science, Computer Engineering, AI, or equivalent experience.
4+ years of relevant software development experience.
Strong modern C++ skills: proficiency with C++11/14/17 (or newer) and the STL; emphasis on clean, maintainable, performant code.
Deep learning familiarity: experience with modern inference frameworks and an understanding of architectural nuances of LLMs, Diffusion, and multi-modal models.
Systems thinking: interest in evolving software architecture to support automated, agent-driven development and indefinitely scaling codebases.
End-to-end product sense: ability to translate high-level customer needs into concrete technical requirements and user-centric solutions.
Pragmatic execution: demonstrated ability to deliver production-quality software on tight timelines.
Collaborative mindset: excellent communication skills and comfort working across internal organizations and with customers.

Ways to stand out

Hands-on experience with AI agent orchestrators or multi-agent coding frameworks, or experience building custom agentic coding harnesses for production software.
CUDA and kernel expertise, or exposure to kernel generation / autotuning efforts.
Track record of rapidly turning state-of-the-art papers into working prototypes in days.
Expertise in software performance analysis, profiling, and optimization (CPU and/or GPU), including tooling-driven improvements.

Compensation & Benefits

Base salary ranges (location and level dependent):
- Level 3: 152,000 USD - 241,500 USD
- Level 4: 184,000 USD - 287,500 USD
Eligible for equity and company benefits (see NVIDIA benefits page).

Additional information

#LI-Hybrid (hybrid work arrangement).
Applications accepted at least until April 25, 2026.
NVIDIA uses AI tools in its recruiting processes and is an equal opportunity employer.