Senior Developer Technology Engineer - Windows AI Platform

at Nvidia
USD 184,000-356,500 per year
SENIOR
✅ On-site

Used Tools & Technologies

Not specified

Required Skills & Competences

Python @ 7 Leadership @ 4 Communication @ 4 Debugging @ 4 API @ 4 Technical Leadership @ 4 LLM @ 4 CUDA @ 4 GPU @ 4

Details

At NVIDIA, we’re tapping into the unlimited potential of AI to define the next era of computing. An era in which our GPU acts as the brains of computers, robots, and self-driving cars that can understand the world. Doing what’s never been done before takes vision, innovation, and the world’s best talent. As an NVIDIAN, you’ll be immersed in a diverse, supportive environment where everyone is inspired to do their best work.

As a Developer Technology Engineer, you will be at the forefront of innovation, working with leading industry partners and open-source projects to help them adopt advancements in AI and accelerated computing on NVIDIA RTX. This role offers an opportunity to collaborate with world-class talent and contribute to enterprise and consumer AI.

Responsibilities

  • Work closely with internal engineering and product teams and external application developers to solve local end-to-end AI GPU deployment challenges on the NVIDIA RTX AI platform.
  • Apply profiling and debugging tools to analyze demanding GPU-accelerated end-to-end AI applications and detect insufficient GPU utilization and suboptimal runtime performance.
  • Conduct hands-on training, develop sample code, and host presentations to provide guidance on efficient end-to-end AI deployment targeting optimal runtime performance on NVIDIA ARM-based SoCs.
  • Improve Windows LLM & GenAI user experience on NVIDIA RTX by working on feature and performance enhancements of open-source software, including projects like GGML, Llama.cpp, Ollama, and ONNX Runtime.
  • Collaborate with GPU driver and architecture teams and NVIDIA research to influence next-generation GPU features by providing real-world workflows and feedback on partner and customer needs.
  • Provide technical leadership and mentorship to junior engineers, fostering an inclusive and high-performing team environment.

Requirements

  • 8+ years of professional experience in local GPU deployment, profiling, and optimization.
  • Bachelor's or Master's degree or equivalent experience in Computer Science, Engineering, or a related field.
  • Strong proficiency in C++ and Python, and solid software design and programming techniques.
  • Familiarity with and development experience on the Windows operating system.
  • Experience working with open-source LLM and GenAI software.
  • Experience with CUDA and NVIDIA's Nsight GPU profiling and debugging suite.
  • Ability to travel for conferences and on-site visits with external partners.
  • Strong problem-solving skills and ability to work independently and collaboratively in a fast-paced environment.
  • Excellent interpersonal and communication skills and a passion for following advancements in AI technology.

Ways to Stand Out

  • Experience with GPU-accelerated AI inference driven by NVIDIA APIs such as cuDNN, CUTLASS, and TensorRT.
  • Confirmed expert knowledge in Vulkan and/or DirectX 12 (DX12).
  • Detailed knowledge of the latest-generation GPU architectures.
  • Experience with AI deployment on NPUs and ARM architectures.

Compensation & Benefits

  • Base salary range (depends on location, experience, and level):
    • Level 4: 184,000 USD - 287,500 USD
    • Level 5: 224,000 USD - 356,500 USD
  • Eligible for equity and benefits (link to NVIDIA benefits referenced in the posting).

Additional Information

  • Applications for this job will be accepted at least until January 26, 2026.
  • NVIDIA uses AI tools in its recruiting processes.
  • NVIDIA is an equal opportunity employer and committed to fostering a diverse work environment.