Senior Software Engineer - Deep Learning

at Nvidia
USD 184,000-287,500 per year
SENIOR
✅ On-site

SCRAPED

Used Tools & Technologies

Not specified

Required Skills & Competences ?

Algorithms @ 4 TensorFlow @ 4 Communication @ 4 PyTorch @ 4 CUDA @ 4

Details

NVIDIA has been transforming computer graphics and accelerated computing for more than 25 years. In the AI era, its legacy of innovation is fueled by advanced technology and talented people. This role sits on the NVIDIA AI for Media team building state-of-the-art AI models for video streaming and broadcasting. Models are deployed on the NVIDIA Maxine platform for real-time video communication and content creation (https://developer.nvidia.com/maxine) and are also used in the NVIDIA Broadcast App to enhance live streaming and conferencing.

Responsibilities

  • Develop highly efficient and low-cost AI models and algorithms for computer vision and video AI.
  • Optimize performance, latency, and power consumption of AI models for low-power processors and deep learning accelerators.
  • Deploy deep learning models and optimize the inference stack for real-time performance.
  • Deliver benefits of NVIDIA’s hardware and platform software innovations to deep learning solutions.
  • Collaborate closely with deep learning software and hardware teams across NVIDIA to influence roadmaps and deliver solutions.

Requirements

  • Strong experience building and optimizing innovative AI model architectures for video use cases.
  • Strong experience developing efficient models using model pruning, knowledge distillation, post-quantization, and quantization-aware training.
  • Experience analyzing and fine-tuning deep learning pipeline performance (profiling, latency analysis, tuning).
  • Experience building real-time AI models for laptop and cloud use cases.
  • Hands-on development skills with deep learning libraries and frameworks such as PyTorch, TensorFlow, ONNX, and inference/runtime tools such as TensorRT, Triton, WinML and other neural processing SDKs.
  • Collaboration skills to define project scope and roadmap and independently drive development with strong self-motivation.
  • 8+ years of relevant engineering or research experience in deep learning and/or computer vision.
  • BS, MS, or PhD in Computer Science, Electrical/Computer Engineering, or related fields (or equivalent experience).

Ways to stand out

  • Experience with AI inference-accelerating hardware and building/optimizing models on them.
  • Background in performance and latency analysis, profiling and tuning of AI workloads.
  • Experience with CUDA programming and passion for optimizing AI system performance.
  • Experience building platforms for computer vision such as real-time tracking of face, gaze, body, and avatar animation/modeling.

Compensation & Benefits

Your base salary will be determined based on location, experience, and pay of employees in similar positions. The base salary range is 184,000 USD - 287,500 USD. You will also be eligible for equity and benefits (see https://www.nvidia.com/en-us/benefits/).

Other details

  • Location: Santa Clara, CA, United States.
  • Employment type: Full time.
  • Applications for this job will be accepted at least until November 1, 2025.
  • NVIDIA is an equal opportunity employer and is committed to fostering a diverse work environment.

#deeplearning