Senior Software Engineer, Deep Learning Inference - New Hw Enablement

at Nvidia

📍 Santa Clara, United States

$220,000-339,200 per year

SENIOR
✅ Hybrid

SCRAPED

Used Tools & Technologies

Not specified

Required Skills & Competences ?

Software Development @ 8 Python @ 6 C @ 4 C++ @ 7

Details

Are you passionate about driving innovation in deep learning and eager to work on cutting-edge AI technology?

Join NVIDIA’s TensorRT team as a Senior Software Engineer, and be at the forefront of technology, enabling support in TensorRT for an evolving landscape of ground-breaking hardware capabilities. Your expertise will help shape the performance and functionality of our products, ensuring NVIDIA remains synonymous with innovation. If you're ready to tackle challenging projects, push the boundaries of AI, and make a significant impact in a company that values creativity, excellence, and teamwork, we want to hear from you!

Responsibilities

  • Orchestrate the integration of new hardware functionalities into TensorRT's compiler and runtime.
  • Work closely with teams and stakeholders across the whole hardware and software stack to understand and leverage new features to improve TensorRT’s functionality and performance.
  • Guide the design and implementation of robust, high-quality C++ code in alignment with Modern C++ standards.
  • Contribute to the continuous improvement of software practices and processes within the team.

Requirements

  • Masters, PhD, or equivalent experience in relevant fields (Computer Engineering, Computer Science, Electrical Engineering, AI).
  • At least 12 years of relevant software development experience.
  • Strong C++ skills, including knowledge of and application of best practices with C++11 and C++14.
  • Familiarity with deep learning concepts and frameworks.
  • A track record of taking initiative and driving projects to completion.
  • Excellent interpersonal skills and a collaborative, pragmatic approach to solving problems.

Ways to Stand Out from the Crowd

  • Proficiency with Python and/or CUDA, ideally with experience in a professional environment.
  • Background with systems programming, embedded systems, and/or compiler development.
  • Experience in software performance benchmarking, profiling, and optimizations.
  • Experience with state-of-the-art deep learning models (such as Large Language Models) & frameworks for inference.
  • Background with C++17.

NVIDIA is widely considered to be one of the technology world’s most desirable employers. We have some of the most forward-thinking and hardworking people in the world working for us. If you're creative, autonomous, and love a challenge, come join our team!