Technical Program Manager, Inference Performance

at Anthropic

📍 San Francisco, United States
📍 Seattle, United States

USD 290,000-365,000 per year

MIDDLE

✅ Hybrid

✅ Visa Sponsorship

Used Tools & Technologies

Machine Learning LLM

Required Skills & Competences
Tag name is followed by "@" symbol and proficiency level value. About proficiency levels:

1-2 — basic awareness. Minimal hands-on experience, and a rudimentary understanding of the technology's purpose;

3-6 — daily use. Comfortable and regular usage, capable of handling common tasks and challenges related to the technology;

7-9 — you are an expert, you can teach others, you know all the pitfalls and tricks;

10 — exceptional knowledge, comprehensive understanding, and adeptness in all aspects of the technology, including advanced problem-solving. Think twice before claiming or demanding such level.

Distributed Systems @ 3 Leadership @ 3 Communication @ 3 Planning @ 3 AI @ 3

Details

Anthropic’s mission is to create reliable, interpretable, and steerable AI systems. The team is a rapidly growing group of researchers, engineers, policy experts, and business leaders working together to build beneficial AI systems.

About the Role

As a Technical Program Manager for Inference, you'll be the bridge between inference systems and the broader organization. You'll drive strategic initiatives across inference runtime and accelerator performance—coordinating model launches, managing cross-platform dependencies, and ensuring reliability across multiple hardware targets. This role keeps contended infrastructure teams shipping effectively while Research, Product, and Safety depend on their output.

Responsibilities

Systems integration & coordination: lead cross-functional initiatives for new infrastructure integration, establish clear ownership, timelines, and communication channels between teams; drive end-to-end planning for major infrastructure transitions including platform modernization and new tech adoption.
Performance & efficiency: partner with engineering teams to identify optimization opportunities, track performance metrics, and prioritize work that unlocks capacity gains; coordinate across runtime and accelerator layers to deliver efficiency without compromising reliability.
Launch coordination: drive end-to-end readiness for model and feature launches across multiple hardware platforms; establish processes for cross-platform validation, manage launch timelines, and ensure smooth handoffs between runtime, accelerator, and downstream teams.
Strategic planning: own and prioritize the inference deployment roadmap; work closely with engineering leadership to prioritize initiatives and manage dependencies; provide visibility into upcoming changes and organizational impact.
Stakeholder communication: build strong relationships across research, engineering, and product teams to understand requirements and constraints; translate technical complexities into clear updates for leadership and ensure alignment on priorities and timelines.
Process improvement: identify inefficiencies in current workflows and drive systematic improvements; establish metrics and dashboards to track infrastructure health, capacity utilization, and deployment success rates.

Requirements

Several years of experience in technical program management, with proven success delivering complex infrastructure programs—preferably in ML/AI systems or large-scale distributed systems.
Deep technical understanding of inference systems, compilers, or hardware accelerators to engage substantively with engineers and identify technical risks.
Experience creating structure and processes in ambiguous environments and bringing clarity to complex cross-team initiatives.
Strong stakeholder management skills with the ability to build trust with both technical and non-technical partners.
Comfortable navigating competing priorities and using data to drive technical decisions.
Experience with infrastructure scaling initiatives, hardware integrations, or deployment governance.
Comfortable balancing strategic planning with tactical execution; passionate about AI infrastructure and challenges of deploying and scaling large language models.

Logistics

Locations: San Francisco, CA and Seattle, WA.
Location-based hybrid policy: staff are expected to be in one of Anthropic's offices at least 25% of the time (some roles may require more time in offices).
Education requirements: at least a Bachelor's degree in a related field or equivalent experience.
Visa sponsorship: Anthropic states they do sponsor visas and will make reasonable efforts to assist with visas for candidates they make offers to; they retain an immigration lawyer to help.
Deadline to apply: None — applications are received on a rolling basis.

Compensation

Annual salary range: $290,000 - $365,000 USD.

Benefits

Competitive compensation and benefits, optional equity donation matching, generous vacation and parental leave, flexible working hours, and an office space for collaboration.

How We're Different

Anthropic emphasizes "big science" research, working as a single cohesive team on a few large-scale research efforts. They value impact, collaboration, frequent research discussions, and strong communication skills.