Senior Manager, Artificial Intelligence - Machine Learning Platform
at Nvidia
USD 272,000-488,800 per year
Used Tools & Technologies
HPCRequired Skills & Competences
Tag name is followed by "@" symbol and proficiency level value.
About proficiency levels:
- 1-2 — basic awareness. Minimal hands-on experience, and a rudimentary understanding of the technology's purpose;
- 3-6 — daily use. Comfortable and regular usage, capable of handling common tasks and challenges related to the technology;
- 7-9 — you are an expert, you can teach others, you know all the pitfalls and tricks;
- 10 — exceptional knowledge, comprehensive understanding, and adeptness in all aspects of the technology, including advanced problem-solving. Think twice before claiming or demanding such level.
Machine Learning @ 4
Leadership @ 7
Team Management @ 7
Communication @ 4
Networking @ 7
AI @ 4
- 1-2 — basic awareness. Minimal hands-on experience, and a rudimentary understanding of the technology's purpose;
- 3-6 — daily use. Comfortable and regular usage, capable of handling common tasks and challenges related to the technology;
- 7-9 — you are an expert, you can teach others, you know all the pitfalls and tricks;
- 10 — exceptional knowledge, comprehensive understanding, and adeptness in all aspects of the technology, including advanced problem-solving. Think twice before claiming or demanding such level.
Details
We are seeking a reasoned engineering leader in the area of Artificial Intelligence/Machine Learning Platforms to lead the development and management of a comprehensive suite of tools and services that support the entire lifecycle of AI/ML projects. This role is critical for enabling internal researchers to use very large-scale systems for training foundational models with flexibility and efficiency.
Responsibilities
- Lead the strategic direction, development, and continuous improvement of the AI/ML platform to meet the needs of internal researchers for large-scale model training and deployment.
- Optimize efficiency and resilience of ML workflows, including data ingestion, preprocessing, checkpointing, model training, deployment, and monitoring.
- Lead and mentor a team of highly skilled engineers, fostering a collaborative and high-performance culture.
- Collaborate with data scientists, researchers, and IT to understand needs and ensure the platform provides necessary tools and resources.
- Drive innovation and efficiency by exploring and implementing new technologies and methodologies to enhance the AI/ML ecosystem.
- Maintain a strong customer focus to ensure the platform meets evolving user needs and delivers high value.
Requirements
- 12+ years overall experience in AI/ML infrastructure, with a proven track record managing large-scale AI/ML projects and platforms.
- 6+ years experience leading highly technical, collaborative teams.
- Bachelor's degree or equivalent experience.
- Deep understanding of AI/ML lifecycle management, high-performance computing, niche hardware, storage, and networking.
- Strong leadership and team management skills, with experience leading multi-functional teams and sophisticated projects.
- Ability to develop and implement a strategic vision for an AI/ML platform aligned with organizational and researcher needs.
- Excellent communication and interpersonal skills for effective collaboration with diverse stakeholders.
- Strong analytical and problem-solving abilities with a focus on delivering innovative solutions.
Ways to stand out
- Experience developing and leading AI/ML platforms in a research or academic environment.
- Familiarity with cloud-based AI/ML platforms and infrastructure.
- Proven ability to drive efficiency and innovation within technical teams and projects.
Compensation & Benefits
- Base salary ranges provided by location and level:
- Level 4: USD 272,000 - 431,250 per year
- Level 5: USD 320,000 - 488,750 per year
- Eligible for equity and NVIDIA benefits (see company benefits pages referenced in the original posting).
Other information
- Location: Santa Clara, CA, United States.
- Employment type: Full time. Expected hours not specified.
- Applications accepted at least until June 4, 2026.
- NVIDIA uses AI tools in its recruiting processes and is an equal opportunity employer.