Technical Program Manager, Safeguards (Infrastructure & Evals)

at Anthropic

📍 New York City, United States
📍 San Francisco, United States
📍 Seattle, United States

USD 290,000-365,000 per year

MIDDLE

✅ Hybrid

✅ Visa Sponsorship

Used Tools & Technologies

Machine Learning

Required Skills & Competences
Tag name is followed by "@" symbol and proficiency level value. About proficiency levels:

1-2 — basic awareness. Minimal hands-on experience, and a rudimentary understanding of the technology's purpose;

3-6 — daily use. Comfortable and regular usage, capable of handling common tasks and challenges related to the technology;

7-9 — you are an expert, you can teach others, you know all the pitfalls and tricks;

10 — exceptional knowledge, comprehensive understanding, and adeptness in all aspects of the technology, including advanced problem-solving. Think twice before claiming or demanding such level.

Datadog @ 2 SRE @ 3 Reporting @ 3 AI @ 3

Details

Anthropic’s mission is to create reliable, interpretable, and steerable AI systems. The Safeguards Engineering team builds and operates infrastructure that keeps Anthropic's AI systems safe in production — classifiers, detection pipelines, evaluation platforms, and monitoring systems that sit between models and the real world. This role focuses on operational health and forward momentum for that stack, driving reliability through incident response, SLO definition and tracking, runbook quality, and cross-team platform projects.

Responsibilities

Own the Safeguards Engineering ops review: run recurring cadences to surface incidents, reliability trends, and ensure the right participants and decisions.
Drive incident tracking and post-mortem execution: track incidents across the organization (including partner teams like Inference), ensure post-mortems are written, and close out action items.
Establish and maintain SLOs with partner teams: define service-level objectives for safety-critical pipelines and build tracking/reporting to measure them.
Maintain runbook quality and incident-ownership clarity: keep runbooks accurate and ensure incident ownership is unambiguous for safety-critical areas (e.g., account-banning false positives, CSAM detection).
Drive platform migrations and infrastructure projects: program-manage migrations (platform, incident platform, cloud monitoring systems) and other cross-team infrastructure work so systems remain operational during transitions.
Coordinate evals platform improvements: partner with the evals engineering team to scope work, track dependencies, and drive improvements to evaluation and self-serve capabilities.

Requirements

Solid technical program management experience, especially in operational or infrastructure-heavy environments where you own both ongoing operational cadence and discrete projects.
Strong understanding of how production ML systems operate to triage incidents and have technical conversations with engineers (you don't need to write the code, but must follow the technical thread).
Demonstrated ability to close loops: track and drive post-mortem actions, ensure SLOs are checked, and keep runbooks up to date.
Ability to work effectively across team boundaries and coordinate with partner teams where you don't have direct authority.
Comfortable context-switching between "keep the lights on" work and longer-horizon platform projects.
Experience with or strong interest in AI safety and appreciation for the distinct reliability needs of safety-critical pipelines.

Strongly preferred / helpful experience

Experience with SRE practices, incident management frameworks, or on-call operations at scale.
Experience with evaluation infrastructure for ML systems and understanding of how evals are designed, run, and interpreted.
Experience driving infrastructure migrations in complex, multi-team environments without taking operational systems offline.
Familiarity with monitoring and alerting tooling and operational culture (PagerDuty, Datadog, or equivalents).

Logistics

Education: At least a Bachelor's degree in a related field or equivalent experience.
Location-based hybrid policy: staff are expected to be in one of Anthropic's offices at least 25% of the time (some roles may require more time in-office).
Visa sponsorship: Anthropic states they do sponsor visas and retain an immigration lawyer to assist, though sponsorship is not guaranteed for every role/candidate.
Deadline to apply: None (applications received on a rolling basis).

Compensation and Benefits

Annual salary range: $290,000 - $365,000 USD.
Anthropic offers competitive compensation and benefits, optional equity donation matching, generous vacation and parental leave, flexible working hours, and office space to collaborate.

Additional notes

Work involves close coordination with partner teams like Inference and Cloud Inference, and responsibilities include incident ownership for safety-sensitive areas such as CSAM detection and account-banning false positives.
Candidates are encouraged to apply even if they do not meet every qualification.