Senior Software Engineer, Metropolis AI NIM
at Nvidia
📍 Santa Clara, United States
$148,000-276,000 per year
SCRAPED
Used Tools & Technologies
Not specified
Required Skills & Competences ?
Python @ 4 Algorithms @ 4 Machine Learning @ 4 Communication @ 4 Microservices @ 4 Debugging @ 7 API @ 6Details
We are seeking a senior software engineer for Metropolis AI NIM to develop and deliver the state-of-the-art AI models to the world in the form of NVIDIA Inference Microservices (NIM). You will collaborate across the organization to bring the latest flagship models (both CV and Vision-Language Models) from our community and partners—such as VILA and Florence-2—to life as optimized NVIDIA Inference Microservices (NIM). This role offers an outstanding opportunity to craft the future of AI at a fast-growing company at the forefront of the AI revolution. Join our team of world-class software engineers and partners to deliver the most advanced models with lightning-fast inference.
Responsibilities
- Collaborate closely with our partners and the open-source community to deliver their flagship models as highly optimized NVIDIA Inference Microservices (NIM).
- Research and develop innovative deep learning methodologies to accurately evaluate new model families across diverse domains.
- Analyze, influence, and enhance AI/DL libraries, frameworks, and APIs, ensuring consistency with the best engineering practices.
- Design and develop accelerated streaming AI pipelines using CV and VLM models and lead technical design discussions.
- Profile and optimize the AI pipelines to ensure scalability, reliability, and efficiency.
- Take on complex system-level optimization and resource utilization challenges.
- Participate in a product development lifecycle that values high standards for clear requirements, software quality and performance.
- Write code in Python and C++.
Requirements
- BS, MS, or PhD in Computer Science, AI, Applied Math, or a related field, or equivalent experience, with 5+ years of industry experience.
- 3+ years of hands-on experience in AI for computer vision (CV) and large language models (LLMs).
- Complex system design and development using Python, C++ 14/17/20, and object-oriented programming.
- Strong problem-solving, debugging, performance analysis, test design, and documentation skills.
- Solid mathematical foundations and expertise in AI/DL algorithms.
- Excellent written and verbal communication skills, with the ability to work both independently and collaboratively in a fast-paced environment.
- Passion for expanding your technical knowledge into new areas.
- Ability to excel in a multinational, multi-time-zone environment: excellent communication skill (verbal & written), collaborates well, represents our core values.
Ways to stand out from the crowd:
- Demonstrated implementation of computer vision / machine learning applications, microservices, container, and cloud-native application development.
- Experience with cloud-native architecture involving dockers, k8s, microservices.
- Hands-on experience with inference and deployment environments like TensorRT, ONNX, Triton, or vLLM.
- Experience working with High Availability environments.
NVIDIA is widely considered to be one of the technology world’s most desirable employers. We have some of the most forward-thinking and passionate people in the world working for us. Are you a creative problem solver with a passion for solving real-world problems with AI? If so, we want to hear from you.