Used Tools & Technologies
Not specified
Required Skills & Competences ?
GCP @ 3 Leadership @ 3 AWS @ 3 Azure @ 3 Communication @ 6 Planning @ 3 Reporting @ 3Details
Anthropic’s mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickly growing group of committed researchers, engineers, policy experts, and business leaders working together to build beneficial AI systems.
About the role
Anthropic’s Capacity team is looking for an Engineering Manager to own and manage cloud spend across a massively scaled, multi-cloud environment. You’ll work closely with research, engineering, and finance teams to ensure we have scalable systems for capacity management, high-quality data and insights for planning, and engineering roadmaps that deliver efficiency wins.
Responsibilities
- Design, develop, and deliver capacity management systems for AI workloads on heterogeneous infrastructure
- Build and maintain robust attribution of usage and enable in-depth data-driven insights that are actionable
- Build a deep understanding of research and training workloads to accurately forecast infrastructure needs
- Oversee design and implementation of forecasting tools and software systems for managing billions of dollars in spend
- Proactively identify efficiency opportunities and collaborate with teams across the org to increase effective capacity for Anthropic
- Partner closely with Finance and leadership, providing detailed and clear capacity inputs for financial planning and strategic decision making
Requirements
- Experience managing large infrastructure spend (the posting references experience managing $XXXM to $XB in infrastructure spend)
- Experience working with public clouds (AWS, GCP, Azure) and/or hybrid on-prem/cloud environments
- Experience setting up capacity management systems that scale with growing organizations
- Comfortable leveraging data and experience building observability for complex systems
- Strong interpersonal skills enabling you to influence and build cross-organizational support for capacity initiatives
- Familiarity with large language models (LLMs) and a strong interest in learning more about research and model training workloads
- We require at least a Bachelor's degree in a related field or equivalent experience
Strong candidates may also have
- Past experience managing capacity for AI research and production workloads
- Past experience partnering with senior leadership, both technical and non-technical, to drive company-level reporting and decision making
Compensation
Annual Salary: $365,000 - $565,000 USD
The expected base compensation for this position is above. Our total compensation package for full-time employees includes equity, benefits, and may include incentive compensation.
Logistics
- Education requirements: At least a Bachelor's degree in a related field or equivalent experience
- Location-based hybrid policy: Currently, we expect all staff to be in one of our offices at least 25% of the time. However, some roles may require more time in our offices.
- Visa sponsorship: We do sponsor visas, though we cannot guarantee sponsorship for every role and candidate. If we make you an offer we will make reasonable efforts to get you a visa and retain an immigration lawyer to help.
How we're different
We believe that the highest-impact AI research will be big science. At Anthropic we work as a single cohesive team on just a few large-scale research efforts. We value impact and collaboration, and host frequent research discussions to ensure we are pursuing the highest-impact work. We also value strong communication skills.
Application notes
We encourage applicants even if you do not meet every qualification. Research shows that underrepresented groups may be prone to imposter syndrome; please apply if you're interested. Guidance on candidates' AI usage is provided in Anthropic's candidate AI guidance policy (link in the original posting).