Senior High Performance AI Engineer

at Nvidia
USD 184,000-356,500 per year
SENIOR
βœ… On-site

SCRAPED

Used Tools & Technologies

Not specified

Required Skills & Competences ?

Python @ 7 Hiring @ 4 Leadership @ 4 Performance Optimization @ 4 CUDA @ 4 GPU @ 4

Details

NVIDIA has been transforming computer graphics, PC gaming, and accelerated computing for more than 25 years. Today, NVIDIA is tapping into the unlimited potential of AI to define the next era of computing. As a member of this team you will help build groundbreaking multi-agent systems for the CUDA ecosystem, developing agent abstractions, GPU-centric runtimes, and compiler- or runtime-driven system solutions to accelerate agent planning, tool-use, code generation, and other AI workloads. You will collaborate closely with internal NVIDIA software and hardware teams to push the latest developments into NVIDIA products.

Responsibilities

  • Design, build, and optimize agentic AI systems for the CUDA ecosystem.
  • Co-design agentic system solutions with software, hardware, and algorithm teams; influence and adopt new capabilities as they become available.
  • Develop reproducible, high-fidelity evaluation frameworks covering performance, quality, and developer productivity.
  • Collaborate across the AI stack β€” from hardware through compilers/toolchains, kernels/libraries, frameworks, distributed training, and inference/serving β€” and with model/agent teams.

Requirements

  • Bachelor’s degree in Computer Science, Electrical Engineering, or related field (or equivalent experience); MS or PhD preferred.
  • 6+ years industry or academia experience with AI systems development; exposure to building foundational models, agents or orchestration frameworks; hands-on experience with deep learning frameworks and modern inference stacks.
  • Strong C/C++ and Python programming skills; solid software engineering fundamentals.
  • Experience with GPU programming and performance optimization (CUDA or equivalent).

Ways to Stand Out

  • Track record building/evaluating deep learning models, coding agents and developer tooling.
  • Demonstrated ability to optimize and deploy high-performance models, including on resource-constrained platforms.
  • Deep expertise in GPU performance optimizations, evidenced by benchmark wins or published results.
  • Publications or open-source leadership in deep learning, multi-agent systems, reinforcement learning, or AI systems; contributions to widely used repos or standards.

Compensation & Benefits

Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 184,000 USD - 287,500 USD for Level 4, and 224,000 USD - 356,500 USD for Level 5. You will also be eligible for equity and benefits.

Other Details

  • Location: Santa Clara, CA, United States.
  • Employment type: Full time.
  • Applications for this job will be accepted at least until December 26, 2025.

Company & Equal Opportunity

NVIDIA is committed to fostering a diverse work environment and is proud to be an equal opportunity employer. They do not discriminate in hiring and promotion practices on the basis of protected characteristics.