Vacancy is archived. Applications are no longer accepted.

Senior Software Engineer - Automated Parallel Programming

at Nvidia

📍 Santa Clara, United States

SENIOR

✅ On-site

SCRAPED

Used Tools & Technologies

Not specified

Required Skills & Competences ^?

Machine Learning @ 4 TensorFlow @ 4 Hiring @ 4 Communication @ 4 Parallel Programming @ 4 PyTorch @ 4 CUDA @ 4 GPU @ 4

Details

The PyTorch Team @ NVIDIA is hiring passionate parallel programmers to design and build tools used by millions of AI practitioners deploying AI applications scalable to thousands of GPUs. The team ensures continual delivery of best-in-class experience on NVIDIA's hardware with PyTorch. Collaboration spans multi-disciplinary engineering teams within NVIDIA and the global PyTorch open source community.

Responsibilities

Craft a code generation system to accelerate portions of a graph from machine learning frameworks.
Partner with NVIDIA’s hardware and software teams to improve GPU performance in PyTorch.
Design, build, and support production AI solutions used by enterprise customers and partners.
Optimize performance of modern Deep Learning models from academic and industry research for NVIDIA GPUs and systems.
Collaborate with internal applied researchers to improve their AI tools.
Advise the design of new hardware generations.

Requirements

MS or PhD in Computer Science, Computer Engineering, Electrical Engineering, or related field (or equivalent experience).
Experience in parallel programming with optimized kernels in NVIDIA CUDA Programming Language or similar.
Over 4 years of C++ programming experience.
Demonstrated experience developing large software projects.
Excellent verbal and written communication skills.

Ways to Stand Out

Proven foundation in CPU and GPU architectures, numeric libraries, and modular software design.
Background in deep learning compilers or compiler infrastructure.
Expertise with optimized distributed parallelism techniques, especially including parallelizing Large Language Models.
Knowledge of heuristic generation using cost models, machine learning, or auto-tuning.
Contributions to PyTorch, Numpy, JAX, TensorFlow, OpenAI-Triton, Lightning Thunder, TVM, Halide, or similar systems.

Benefits

Base salary range: $184,000 to $287,500 USD per year, determined by location, experience, and peer pay.
Equity and benefits eligibility.
NVIDIA is an equal opportunity employer committed to diversity and inclusion.