Senior Performance Engineer

at Nvidia

📍 Santa Clara, United States

USD 184,000-356,500 per year

SENIOR

✅ On-site

SCRAPED

Used Tools & Technologies

Not specified

Required Skills & Competences ^?

Linux @ 4 Python @ 4 Bash @ 4 Communication @ 4 Mathematics @ 4 PyTorch @ 4 CUDA @ 4 GPU @ 4

Details

NVIDIA is seeking a Senior Performance Engineer focused on at-scale AI system performance and datacenter applications. You will work with accelerated computing and deep learning software and hardware platforms to provide insights on at-scale system design and tuning mechanisms for large-scale compute runs. You will collaborate with researchers, developers, customers, and specialists in HPC, OS, CPU and GPU compute to architect, develop and bring up large scale performance platforms.

Responsibilities

Lead implementation of performance practices in large scale infrastructure; deliver tools, methodologies, and flows to validate and improve multiple datacenter products in parallel.
Align next-generation AI workloads on top of next-generation datacenter designs through early engagement with hardware, firmware, software, platform, internal and customer teams.
Deliver engineering solutions that provide continuous insights into AI workload performance across evolving environments, and generate quick insights for improvements and regressions over time.
Decompose high-complexity performance or stability issues into minimal reproduction cases and pursue root cause analysis.
Participate in engagements with software and firmware (BMC/SBIOS/OS/drivers) teams to develop best-in-class practices and tools; analyze, debug, and resolve critical firmware and software issues for optimal AI workload performance at scale.

Requirements

8+ years of experience using accelerated computing for datacenter container computing solutions.
Proven understanding of accelerated computing software stacks and deep learning frameworks (CUDA, PyTorch).
Experience with modern cloud and container-based enterprise computing architectures.
Programming and scripting experience: C/C++, Python, Bash.
Experience with CPU architecture.
Experience with container technology and Linux-based operating systems.
Understanding of collective communication and common patterns in AI workloads.
Experience working with engineering or academic research communities supporting high performance computing or deep learning.
Strong verbal and written communication skills, teamwork, and analytical problem-solving skills.
BS in Engineering, Mathematics, Physics, or Computer Science required; MS or PhD desirable (or equivalent experience).

Ways to Stand Out

At-scale deep learning training experience.
Deep learning and graph compiling programming skills.
Exposure to virtualization techniques and cloud platform solutions.
Exposure to scheduling and resource management systems.
Experience with large scale HPC environments.

Compensation and Benefits

Base salary range for Level 4: 184000 USD - 287500 USD.
Base salary range for Level 5: 224000 USD - 356500 USD.
You will also be eligible for equity and benefits (see https://www.nvidia.com/en-us/benefits/).

Application Information

Applications for this job will be accepted at least until July 29, 2025.

Equal Opportunity

NVIDIA is committed to fostering a diverse work environment and is proud to be an equal opportunity employer. They do not discriminate on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.