Used Tools & Technologies
Not specified
Required Skills & Competences
Tag name is followed by "@" symbol and proficiency level value.
About proficiency levels:
- 1-2 — basic awareness. Minimal hands-on experience, and a rudimentary understanding of the technology's purpose;
- 3-6 — daily use. Comfortable and regular usage, capable of handling common tasks and challenges related to the technology;
- 7-9 — you are an expert, you can teach others, you know all the pitfalls and tricks;
- 10 — exceptional knowledge, comprehensive understanding, and adeptness in all aspects of the technology, including advanced problem-solving. Think twice before claiming or demanding such level.
Python @ 7
Leadership @ 4
Communication @ 4
Debugging @ 4
API @ 4
Technical Leadership @ 4
LLM @ 4
CUDA @ 4
GPU @ 4
- 1-2 — basic awareness. Minimal hands-on experience, and a rudimentary understanding of the technology's purpose;
- 3-6 — daily use. Comfortable and regular usage, capable of handling common tasks and challenges related to the technology;
- 7-9 — you are an expert, you can teach others, you know all the pitfalls and tricks;
- 10 — exceptional knowledge, comprehensive understanding, and adeptness in all aspects of the technology, including advanced problem-solving. Think twice before claiming or demanding such level.
Details
At NVIDIA, we’re tapping into the unlimited potential of AI to define the next era of computing. An era in which our GPU acts as the brains of computers, robots, and self-driving cars that can understand the world. Doing what’s never been done before takes vision, innovation, and the world’s best talent. As an NVIDIAN, you’ll be immersed in a diverse, supportive environment where everyone is inspired to do their best work.
As a Developer Technology Engineer, you will be at the forefront of innovation, working with leading industry partners and open-source projects to help them adopt advancements in AI and accelerated computing on NVIDIA RTX. This role offers an opportunity to collaborate with world-class talent and contribute to enterprise and consumer AI.
Responsibilities
- Work closely with internal engineering and product teams and external application developers to solve local end-to-end AI GPU deployment challenges on the NVIDIA RTX AI platform.
- Apply profiling and debugging tools to analyze demanding GPU-accelerated end-to-end AI applications and detect insufficient GPU utilization and suboptimal runtime performance.
- Conduct hands-on training, develop sample code, and host presentations to provide guidance on efficient end-to-end AI deployment targeting optimal runtime performance on NVIDIA ARM-based SoCs.
- Improve Windows LLM & GenAI user experience on NVIDIA RTX by working on feature and performance enhancements of open-source software, including projects like GGML, Llama.cpp, Ollama, and ONNX Runtime.
- Collaborate with GPU driver and architecture teams and NVIDIA research to influence next-generation GPU features by providing real-world workflows and feedback on partner and customer needs.
- Provide technical leadership and mentorship to junior engineers, fostering an inclusive and high-performing team environment.
Requirements
- 8+ years of professional experience in local GPU deployment, profiling, and optimization.
- Bachelor's or Master's degree or equivalent experience in Computer Science, Engineering, or a related field.
- Strong proficiency in C++ and Python, and solid software design and programming techniques.
- Familiarity with and development experience on the Windows operating system.
- Experience working with open-source LLM and GenAI software.
- Experience with CUDA and NVIDIA's Nsight GPU profiling and debugging suite.
- Ability to travel for conferences and on-site visits with external partners.
- Strong problem-solving skills and ability to work independently and collaboratively in a fast-paced environment.
- Excellent interpersonal and communication skills and a passion for following advancements in AI technology.
Ways to Stand Out
- Experience with GPU-accelerated AI inference driven by NVIDIA APIs such as cuDNN, CUTLASS, and TensorRT.
- Confirmed expert knowledge in Vulkan and/or DirectX 12 (DX12).
- Detailed knowledge of the latest-generation GPU architectures.
- Experience with AI deployment on NPUs and ARM architectures.
Compensation & Benefits
- Base salary range (depends on location, experience, and level):
- Level 4: 184,000 USD - 287,500 USD
- Level 5: 224,000 USD - 356,500 USD
- Eligible for equity and benefits (link to NVIDIA benefits referenced in the posting).
Additional Information
- Applications for this job will be accepted at least until January 26, 2026.
- NVIDIA uses AI tools in its recruiting processes.
- NVIDIA is an equal opportunity employer and committed to fostering a diverse work environment.