Deep Learning Architect - New College Grad 2025

at Nvidia
USD 120,000-235,800 per year
JUNIOR MIDDLE
✅ On-site

SCRAPED

Used Tools & Technologies

Not specified

Required Skills & Competences ?

Python @ 3 Algorithms @ 3 Machine Learning @ 6 Mathematics @ 3 CUDA @ 3 GPU @ 3

Details

Today, NVIDIA is tapping into the unlimited potential of AI to define the next era of computing. An era in which our GPU acts as the brains of computers, robots, and self-driving cars that can understand the world. Doing what’s never been done before takes vision, innovation, and the world’s best talent. As an NVIDIAN, you’ll be immersed in a diverse, encouraging environment where everyone is inspired to do their best work.

We are looking for a Deep Learning Inference Performance Architect - New College Graduate. The Inference Architecture team does hardware-software co-design work focused on accelerating AI inference workloads. You will write performance-optimized low-level code on GPUs while helping guide future GPU architecture decisions.

Responsibilities

  • Develop innovative HW, DSP, GPU and system architectures to extend the state of the art in AI inference performance and efficiency.
  • Analyze and prototype key deep learning and data analytics algorithms and applications.
  • Understand and analyze the interplay of hardware and software architectures on future algorithms and applications.
  • Write efficient software for AI inference, including CUDA kernels, framework-level code, and application-level code.
  • Collaborate across the company with software, research, and product teams to guide the direction of AI.

Requirements

  • Recently completed a MS or PhD in Computer Science, Electrical Engineering, Math or related field (or equivalent experience).
  • Strong mathematical foundation in machine learning and deep learning.
  • Expert programming skills in C, C++, or Python.
  • Familiarity with GPU computing (CUDA or similar) and HPC (MPI, OpenMP).
  • Strong knowledge and coursework in computer architecture.

Preferred / Ways to stand out

  • Background with systems-level performance modeling, profiling, and analysis.
  • Experience in characterizing and modeling system-level performance, executing comparison studies, and documenting and publishing results.
  • Experience in optimizing AI inference workloads with CUDA kernel development.

Compensation & Benefits

  • Base salary ranges provided by level: Level 2: 120,000 USD - 189,750 USD; Level 3: 148,000 USD - 235,750 USD.
  • You will also be eligible for equity and benefits.

Additional information

  • This position offers the opportunity to have real impact in a dynamic, technology-focused company.
  • Applications for this job will be accepted at least until October 5, 2025.
  • NVIDIA is an equal opportunity employer and is committed to fostering a diverse work environment.

Technologies & Skills Mentioned

C, C++, Python, CUDA, GPU computing, MPI, OpenMP, CUDA kernel development, computer architecture, machine learning, deep learning, performance modeling, profiling, DSP, hardware-software co-design, HPC, mathematics