Senior Software Engineer, Metropolis AI NIM

at Nvidia

📍 Santa Clara, United States

$148,000-276,000 per year

SENIOR
✅ On-site

SCRAPED

Used Tools & Technologies

Not specified

Required Skills & Competences ?

Python @ 4 Algorithms @ 4 Machine Learning @ 4 Communication @ 4 Microservices @ 4 Debugging @ 7 API @ 6

Details

We are seeking a senior software engineer for Metropolis AI NIM to develop and deliver the state-of-the-art AI models to the world in the form of NVIDIA Inference Microservices (NIM). You will collaborate across the organization to bring the latest flagship models (both CV and Vision-Language Models) from our community and partners—such as VILA and Florence-2—to life as optimized NVIDIA Inference Microservices (NIM). This role offers an outstanding opportunity to craft the future of AI at a fast-growing company at the forefront of the AI revolution. Join our team of world-class software engineers and partners to deliver the most advanced models with lightning-fast inference.

Responsibilities

  • Collaborate closely with our partners and the open-source community to deliver their flagship models as highly optimized NVIDIA Inference Microservices (NIM).
  • Research and develop innovative deep learning methodologies to accurately evaluate new model families across diverse domains.
  • Analyze, influence, and enhance AI/DL libraries, frameworks, and APIs, ensuring consistency with the best engineering practices.
  • Design and develop accelerated streaming AI pipelines using CV and VLM models and lead technical design discussions.
  • Profile and optimize the AI pipelines to ensure scalability, reliability, and efficiency.
  • Take on complex system-level optimization and resource utilization challenges.
  • Participate in a product development lifecycle that values high standards for clear requirements, software quality and performance.
  • Write code in Python and C++.

Requirements

  • BS, MS, or PhD in Computer Science, AI, Applied Math, or a related field, or equivalent experience, with 5+ years of industry experience.
  • 3+ years of hands-on experience in AI for computer vision (CV) and large language models (LLMs).
  • Complex system design and development using Python, C++ 14/17/20, and object-oriented programming.
  • Strong problem-solving, debugging, performance analysis, test design, and documentation skills.
  • Solid mathematical foundations and expertise in AI/DL algorithms.
  • Excellent written and verbal communication skills, with the ability to work both independently and collaboratively in a fast-paced environment.
  • Passion for expanding your technical knowledge into new areas.
  • Ability to excel in a multinational, multi-time-zone environment: excellent communication skill (verbal & written), collaborates well, represents our core values.

Ways to stand out from the crowd:

  • Demonstrated implementation of computer vision / machine learning applications, microservices, container, and cloud-native application development.
  • Experience with cloud-native architecture involving dockers, k8s, microservices.
  • Hands-on experience with inference and deployment environments like TensorRT, ONNX, Triton, or vLLM.
  • Experience working with High Availability environments.

NVIDIA is widely considered to be one of the technology world’s most desirable employers. We have some of the most forward-thinking and passionate people in the world working for us. Are you a creative problem solver with a passion for solving real-world problems with AI? If so, we want to hear from you.