Senior Software Engineer, Deep Learning Inference Workflows
at Nvidia
USD 184,000-287,500 per year
SCRAPED
Used Tools & Technologies
Not specified
Required Skills & Competences ?
Software Development @ 7 DevOps @ 4 Python @ 4 C @ 4 C++ @ 7 Machine Learning @ 7 Communication @ 4 LLM @ 4 PyTorch @ 4 GPU @ 4Details
We are looking for a Senior Software Engineer to make a significant impact in Deep Learning by building a state-of-the-art inference framework to accelerate Deep Learning models, especially Large Language Models, on NVIDIA GPUs.
Responsibilities
- Develop components of TensorRT, NVIDIA’s SDK for high-performance deep learning inference.
- Use C++ and Python to build graph parsers, optimizers, and tools for effective deployment of trained deep learning models.
- Collaborate with deep learning experts, GPU architects, and DevOps engineers across diverse teams.
Requirements
- Bachelor's, Master's, PhD, or equivalent experience in Computer Science, Computer Engineering, Electrical Engineering or related field.
- 6+ years of software development experience.
- Strong experience with C++11, C++14, and C++17.
- Strong understanding of Machine Learning concepts, especially Natural Language Processing.
- Excellent communication skills and aptitude for collaboration and teamwork.
Ways to Stand Out
- Proficiency in Python.
- Experience in software performance benchmarking, profiling, and optimizations.
- Background in compiler development.
- Experience working with TensorRT, PyTorch, ONNX Runtime, JAX, TRT-LLM, vLLM, SGLang, or other ML frameworks.
- Experience with HuggingFace Diffusers and Transformers libraries.
Benefits
- Competitive salary range of $184,000 to $287,500 USD per year depending on location and experience.
- Eligibility for equity and benefits.
- Work in a creative, autonomous, and challenging environment at NVIDIA, a leading tech employer.
- Commitment to diversity and equal opportunity employment.