Used Tools & Technologies
Not specified
Required Skills & Competences ?
Algorithms @ 4 Data Structures @ 4 Hiring @ 4 Communication @ 4 Parallel Programming @ 4 Prioritization @ 4 CUDA @ 4 GPU @ 4Details
We’re currently seeking a Principal Developer Technology Engineer!
Are you interested in developing techniques to accelerate large application workloads on advanced computer architectures? Do you find it rewarding to investigate, find, and eliminate system bottlenecks to achieve the best possible performance on state-of-the-art computer hardware? Could you be thrilled about an opportunity to partner with the developer community, working at the forefront of technology breakthroughs that contribute to the success of an industry leader like NVIDIA? If so, the Developer Technology Team invites you to consider this opportunity.
Responsibilities
- Research and develop techniques to accelerate top CSP workloads on NVIDIA’s computing platform including advanced CPUs, GPUs and interconnects.
- Work directly with key customers to perform in-depth analysis and optimization of complex workloads to ensure the best possible performance on current and next-generation hardware.
- Collaborate with libraries, tools, system software architecture, hardware, and research teams at NVIDIA to influence the design of next-generation programming models, software, and architectures.
- Investigate performance of applications, design parallel algorithms and implement optimizations in a GPU-accelerated computing environment.
- Publish findings in developer blogs or present at relevant conferences and workshops; act as a visible representative of NVIDIA to customers, industry, and academia.
Requirements
- Masters degree in Computer Science, Computer Engineering, or related computationally focused science degree (or equivalent experience).
- 10+ years of relevant work experience or research.
- Programming proficiency in C/C++ with a deep understanding of software design, programming techniques, and algorithms.
- Background that includes parallel programming, ideally CUDA C/C++.
- Hands-on experience doing low-level performance optimizations.
- In-depth expertise with CPU and GPU architecture fundamentals.
- Strong math skills, including linear algebra, for problem-solving and performance modeling.
- Good communication, organization and prioritization skills.
Ways to stand out (Preferred / Additional)
- Designed highly optimal parallel algorithms and data structures for applications with high bytes-to-compute ratio, such as processing directly on compressed data and kernel fusion.
- Optimized end-to-end performance of applications spanning many layers of software, from OS to high-level frameworks.
- Influenced hardware feature design leveraging application and domain knowledge.
Compensation & Logistics
- Base salary range (determined by location, experience, and peer pay): 272,000 USD - 425,500 USD.
- Eligible for equity and benefits.
- #LI-Hybrid
- Applications accepted at least until July 29, 2025.
Company & Diversity
NVIDIA is widely considered to be one of technology’s most desirable employers. We have a world class team with some of the most forward-thinking and hardworking people in the world working here. Does contributing to and pushing the boundaries of state-of-the-art in GPU Accelerated Computing, HPC and Artificial Intelligence excite you? If so, we want to hear from you!
NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. We do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.