Senior Performance Engineer

at Nvidia
USD 184,000-356,500 per year
SENIOR
✅ On-site

SCRAPED

Used Tools & Technologies

Not specified

Required Skills & Competences ?

Linux @ 4 Python @ 4 Bash @ 4 Communication @ 4 Mathematics @ 4 PyTorch @ 4 CUDA @ 4

Details

NVIDIA has been transforming computer graphics, PC gaming, and accelerated computing for more than 25 years. Today the company is focused on AI and accelerated computing platforms. This role is for a Senior Performance Engineer working on at-scale AI system performance and datacenter applications. You will work with accelerated computing and deep learning software and hardware platforms to provide insights on at-scale system design, tuning mechanisms for large-scale compute runs, and to develop workflows and tools for improved performance.

Responsibilities

  • Lead implementation of performance practices in large-scale infrastructure; deliver tools, methodologies, and flows to validate and improve multiple datacenter products in parallel.
  • Align next-generation AI workloads with next-generation datacenter designs through early engagement with hardware, firmware, software, platform, and customer teams.
  • Deliver engineering solutions for continuous insights into AI workload performance across evolving environments, highlighting improvements and regressions over time.
  • Decompose high-complexity performance or stability issues into minimal reproduction cases and drive to root cause.
  • Collaborate with SW and FW teams (BMC/SBIOS/OS/drivers) to develop best-in-class practices and tools; analyze, debug, and resolve firmware and software issues for optimal AI workload performance at scale.

Requirements

  • 8+ years of experience using accelerated computing for datacenter container computing solutions.
  • Proven understanding of accelerated computing software stacks and deep learning frameworks (CUDA, PyTorch).
  • Experience with modern cloud and container-based enterprise computing architectures.
  • Programming/scripting experience in C/C++, Python, and Bash.
  • Experience with CPU architecture.
  • Experience with container technology and Linux-based operating systems.
  • Understanding of collective communication patterns in AI workloads.
  • Experience working with engineering or academic research communities supporting high performance computing or deep learning.
  • Strong verbal and written communication and teamwork skills.
  • Action driven with strong analytical skills.
  • BS in Engineering, Mathematics, Physics, or Computer Science required; MS or PhD desirable (or equivalent experience).

Ways to Stand Out

  • At-scale deep learning training experience.
  • Deep learning and graph compiling programming skills.
  • Exposure to virtualization techniques and cloud platform solutions.
  • Exposure to scheduling and resource management systems.
  • Experience with large-scale HPC environments.

Compensation & Benefits

  • Base salary ranges by level:
    • Level 4: 184,000 USD - 287,500 USD per year
    • Level 5: 224,000 USD - 356,500 USD per year
  • You will also be eligible for equity and benefits (see NVIDIA benefits).

Additional Information

  • Applications accepted at least until July 29, 2025.
  • NVIDIA is an equal opportunity employer and is committed to fostering a diverse work environment.