Used Tools & Technologies
Not specified
Required Skills & Competences ?
Software Development @ 8 Python @ 4 Hiring @ 4 Communication @ 4 Mathematics @ 4 Debugging @ 4 LLM @ 4 PyTorch @ 4Details
NVIDIA is hiring an experienced Principal Software Engineer for its TensorRT-LLM team. This role focuses on building AI serving software foundational to NVIDIA's product lines and the broader AI industry. The candidate will work on a fast-paced team requiring excellent interpersonal skills and a track record of leading complex software architecture and products.
Responsibilities
- Architect and guide development of robust inferencing software scalable across multiple platforms for performance and functionality
- Perform performance analysis, optimization, and tuning
- Stay current with advances in artificial intelligence and evolve code design accordingly
- Collaborate across software, research, and product teams to guide AI Inferencing direction
Requirements
- Bachelor's, Master’s or higher degree in Computer Engineering, Computer Science, Applied Mathematics, or related computing field (or equivalent experience)
- 15+ years of relevant software development experience with 2+ years in architect or tech lead roles
- Excellent programming and software design skills in Python or C/C++, including debugging, performance analysis, and test design
- Strong understanding of Generative AI serving and latest deep learning developments such as LLMs
- Experience with LLM inference frameworks like vLLM, SGLang
- Experience with deep learning frameworks such as PyTorch, JAX
- Excellent written and oral English communication skills
Benefits
- Competitive base salary range from 272,000 USD to 425,500 USD, determined by location, experience, and peer pay
- Eligibility for equity and additional benefits
- Hybrid work model
NVIDIA values diversity and is an equal opportunity employer committed to fostering an inclusive work environment.