Senior Computer Vision, VLM Performance Engineer

at Nvidia
USD 184,000-356,500 per year
SENIOR
✅ On-site

SCRAPED

Used Tools & Technologies

Not specified

Required Skills & Competences ?

Marketing @ 4 Software Development @ 7 Python @ 6 Statistics @ 4 CI/CD @ 4 Communication @ 7 gRPC @ 4 Data Analysis @ 4 Product Management @ 4 API @ 4 PyTorch @ 4 CUDA @ 4

Details

NVIDIA is a world-leader in artificial intelligence and computer vision. Our team builds hardware-accelerated computer vision pipelines, cloud services and SDKs bringing the latest AI innovations to data centers, gaming rigs, cars, robots, buildings, medical devices, and more. We are looking for an engineering expert to help us productize and optimize the latest Vision Language Models (VLMs) and their pipelines. Together, we will democratize the use of these amazing models, unlocking all sorts of innovative applications the world is barely dreaming of.

Responsibilities

  • Develop, profile and optimize inference pipelines for VLMs and other AI computer vision models: improve throughput and latency, data loading, pre- and post-processing.
  • Improve the efficiency of VLM models themselves, including kernel optimization in CUDA.
  • Contribute upstream improvements to SDKs and libraries across NVIDIA and beyond to deliver accelerated computer vision at scale.
  • Promote high-performance AI computer vision across NVIDIA teams and functions (Engineering, Product Management, Marketing, and more).

Requirements

  • Master of Science in Computer Science, Electrical Engineering, or equivalent experience.
  • Approximately 8 years of practical experience (or equivalent).
  • Expertise in AI computer vision, including Vision Language Models (VLMs), Vision Transformers, and Diffusion models; proven track record using the related software ecosystem (PyTorch, HuggingFace, vLLM) to develop and release production-grade software.
  • Excellent software engineering fundamentals: source control, CI/CD, testing/validation, packaging, containerization, and release processes.
  • Proficiency with Python, C++, and CUDA (including kernel optimization).
  • Experience developing cloud applications and services (REST APIs, gRPC).
  • Strong written, visual, and verbal communication skills to present performance challenges, tradeoffs, and architectural alternatives.
  • Curiosity and drive to learn new technologies and partner across teams and functions.

Ways to Stand Out

  • Expertise in classical, non-ML computer vision.
  • Strong fundamentals in system-level performance: multi-threaded, multi-process and distributed software development.
  • Grounding in mathematical fundamentals such as linear algebra, numerical methods, statistics, and exploratory data analysis.
  • History of creativity and innovation around performance in multiple problem domains.

Compensation & Benefits

  • Base salary ranges (depending on level and location):
    • Level 4: 184,000 USD - 287,500 USD
    • Level 5: 224,000 USD - 356,500 USD
  • You will also be eligible for equity and benefits.

Additional Information

  • Applications for this job will be accepted at least until September 7, 2025.
  • NVIDIA is committed to fostering a diverse work environment and is an equal opportunity employer. We do not discriminate on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.