Senior Math Libraries Engineer - Sparsity in AI

at Nvidia

📍 Santa Clara, United States

USD 184,000-356,500 per year

SENIOR

✅ On-site

SCRAPED

Used Tools & Technologies

Not specified

Required Skills & Competences ^?

Software Development @ 4 Python @ 4 CI/CD @ 4 Communication @ 7 Parallel Programming @ 7 Performance Optimization @ 4 Jira @ 4 Product Management @ 4 Debugging @ 7 Project Management @ 4 PyTorch @ 4 CUDA @ 7 GPU @ 4

Details

We are looking for software engineers to contribute to the design and development of libraries and tools to simplify and accelerate computing for unstructured sparsity in deep learning (DL) and high-performance computing (HPC). Around the world, leading commercial and academic organizations are revolutionizing AI, data analytics, and scientific and engineering simulations, using data centers powered by GPUs and high-performance linear algebra libraries. Applications include LLMs, computer aided engineering, quantum chemistry, autonomous vehicles, computer vision, and more. The team develops GPU-accelerated libraries and SDKs.

Responsibilities

Design and develop a C++-based system to simplify and accelerate computing for unstructured sparsity in DL and HPC on NVIDIA GPUs.
Enable the system in languages and frameworks commonly used in DL, such as Python and PyTorch.
Evaluate and improve the performance of the system on real-life applications.
Improve library quality, performance, and maintainability by writing effective and well-tested production code.
Work closely with product management and internal/external partners to understand feature and performance requirements and contribute to technical roadmaps.

Requirements

BS, MS or PhD degree in Computer Science, Applied Math, or related field (or equivalent experience).
6+ years of experience developing, debugging, and optimizing high-performance software using C++ and parallel programming; ideally for sparse linear algebra applications and using CUDA, MPI, OpenMP, or equivalent technologies.
Experience with domain-specific language design and compiler optimizations, in particular sparse compilers (MLIR or TACO).
Excellent C++, Python, and CUDA programming skills.
Strong collaboration, communication, and documentation habits; ideally experience working in a globally distributed organization.

Ways to stand out / Preferred Qualifications

Strong understanding of sparse computations, especially sparsity in AI and HPC.
Good understanding of LLMs, deep learning methods and frameworks.
Experience with low-level GPU performance optimization.
Understanding of numerical linear algebra methods such as direct and iterative solvers.
Experience adopting and advancing software development practices such as CI/CD systems and project management tools like JIRA.

About the company

NVIDIA develops GPUs and software that power modern AI and parallel computing. The company emphasizes creativity, autonomy, and working at the forefront of technological advancement.

Compensation & Benefits

Base salary ranges (determined by location, experience, and internal pay bands):
- Level 4: 184,000 USD - 287,500 USD
- Level 5: 224,000 USD - 356,500 USD
You will also be eligible for equity and benefits (see NVIDIA benefits).
Applications for this job will be accepted at least until August 22, 2025.

Equal Opportunity

NVIDIA is committed to fostering a diverse work environment and is an equal opportunity employer. They do not discriminate on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status, or other characteristics protected by law.