Senior Math Libraries Engineer – Emulation in AI and HPC
SCRAPED
Used Tools & Technologies
Not specified
Required Skills & Competences ?
Python @ 4 Algorithms @ 4 Hiring @ 4 Leadership @ 4 Communication @ 4 Performance Optimization @ 1 Product Management @ 4 Technical Leadership @ 4 NLP @ 4 CUDA @ 6 GPU @ 4Details
We are looking for software engineers to join our math libraries teams for AI and HPC kernel generation, specifically targeting emulation of math operations across different precisions. Around the world, leading commercial and academic organizations are revolutionizing AI, scientific and engineering simulations, and data analytics, using data centers powered by GPUs. Applications of these technologies are in healthcare, NLP, VR, deep learning, autonomous vehicles and countless others. Our team develops the GPU-accelerated math libraries that make all of this possible. If tinkering with bits and precision formats in math operations and applying your knowledge to develop and optimize algorithms to make an impact around the world excites you, come and join our team!
Responsibilities
- Scope, design, and implement high-quality and high-performance numerical dense linear algebra software on GPUs.
- Provide technical leadership and feedback to library engineers working with you on projects and sometimes mentor interns.
- Work closely with product management and other internal and external customers to understand feature and performance requirements and help define the technical roadmaps of libraries.
- Find opportunities to improve library performance and reduce code maintenance overhead through re-architecting.
Requirements
- PhD or Master’s degree in Computer Science, Applied Math, or related science or engineering field (or equivalent experience).
- 5+ years of experience designing, developing, testing, maintaining, and performance-optimizing production software using CUDA and C++.
- Good knowledge of GPU (preferred) or CPU hardware architecture.
- Strong fundamentals in finite precision arithmetics and numerical methods for linear algebra.
- Great teamwork, communication, and documentation habits.
Ways to stand out / Nice to have
- Experience with CUTLASS or other low-level programming (assembly) for performance optimization is a huge plus.
- Experience with a scripting language, preferably Python.
- Experience working in a globally-distributed team.
Compensation & Benefits
Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 184,000 USD - 287,500 USD for Level 4, and 224,000 USD - 356,500 USD for Level 5. You will also be eligible for equity and benefits. Applications for this job will be accepted at least until September 7, 2025.
Company
NVIDIA is committed to fostering a diverse work environment and is an equal opportunity employer. We do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.
#LI-Hybrid