Anthropic Fellows Program — AI Safety
📍 United States
📍 London, United Kingdom
📍 Berkeley, United States
📍 San Francisco, United States
Used Tools & Technologies
Machine Learning LLMRequired Skills & Competences
Tag name is followed by "@" symbol and proficiency level value.
About proficiency levels:
- 1-2 — basic awareness. Minimal hands-on experience, and a rudimentary understanding of the technology's purpose;
- 3-6 — daily use. Comfortable and regular usage, capable of handling common tasks and challenges related to the technology;
- 7-9 — you are an expert, you can teach others, you know all the pitfalls and tricks;
- 10 — exceptional knowledge, comprehensive understanding, and adeptness in all aspects of the technology, including advanced problem-solving. Think twice before claiming or demanding such level.
Security @ 3
Python @ 5
Mathematics @ 6
API @ 3
AI @ 3
Reinforcement Learning @ 3
- 1-2 — basic awareness. Minimal hands-on experience, and a rudimentary understanding of the technology's purpose;
- 3-6 — daily use. Comfortable and regular usage, capable of handling common tasks and challenges related to the technology;
- 7-9 — you are an expert, you can teach others, you know all the pitfalls and tricks;
- 10 — exceptional knowledge, comprehensive understanding, and adeptness in all aspects of the technology, including advanced problem-solving. Think twice before claiming or demanding such level.
Details
Anthropic’s mission is to create reliable, interpretable, and steerable AI systems. The Anthropic Fellows Program is designed to foster AI research and engineering talent by providing funding and mentorship to promising technical candidates to work on empirical AI safety projects with the goal of producing a public output (e.g., a paper submission). The next cohort starts July 20, 2026; apply by April 26, 2026 for that cohort. Applications are reviewed on a rolling basis.
What to expect
- 4 months of full-time research (with possible extension)
- Direct mentorship from Anthropic researchers
- Access to a shared workspace (Berkeley, California or London, UK)
- Connection to the broader AI safety and security research community
- Weekly stipend of 3,850 USD / 2,310 GBP / 4,300 CAD + benefits (vary by country)
- Funding for compute (~$15k/month) and other research expenses
Interview process
The process includes an initial application & reference check, technical assessments & interviews, and a research discussion. The program encourages applicants who may not meet every listed qualification to apply.
Fellows workstreams
Available workstreams include (examples):
- AI Safety Fellows
- AI Security Fellows
- ML Systems & Performance Fellows
- Reinforcement Learning Fellows
- Economics & Societal Impacts Fellows
Fellows will primarily use external infrastructure (open-source models, public APIs) to work on empirical projects aligned with Anthropic research priorities. Projects aim to produce public outputs; in earlier cohorts over 80% of fellows produced papers.
AI Safety specifics (mentors & research areas)
Potential mentors and example research areas include:
- Mentors (sample): Sam Bowman, Sara Price, Alex Tamkin, Nina Panickssery, Trenton Bricken, Logan Graham, Jascha Sohl-Dickstein, Joe Benton, Collin Burns, Fabien Roger, Samuel Marks, Kyle Fish, Ethan Perez
- Research areas: Scalable Oversight, Adversarial Robustness and AI Control, Model Organisms, Model Internals / Mechanistic Interpretability, AI Welfare
- Links to past projects and representative posts are provided in the posting for examples of prior Fellows work and research directions.
Requirements
- Must be available to work full-time on the Fellows program (expected 40 hours/week)
- Fluent in Python programming
- Strong technical background in computer science, mathematics, or physics (or similarly relevant experience)
- Work authorization in the US, UK, or Canada and be located in that country during the program
- Experience with empirical ML research projects and/or large language models is strongly relevant
- Track record of open-source contributions is desirable
Logistics
- Workspace locations: shared workspaces in London and Berkeley; Anthropic is also open to remote fellows in the UK, US, or Canada. Fellows will be asked about availability to work from Berkeley or London (full- or part-time) during the program.
- Visa sponsorship: Anthropic is not currently able to sponsor visas for fellows; participants must have or independently obtain full-time work authorization in the UK, the US, or Canada.
- Program duration: 4 months, full-time. Applicants who cannot commit to full duration should still apply and note constraints.
Compensation
- Expected base stipend: 3,850 USD / 2,310 GBP / 4,300 CAD per week (40 hours/week for 4 months)
- Funding for compute (~$15,000 per month) and other research expenses
How to apply
To apply, complete the Constellation application form (application links provided in the posting). Constellation is Anthropic’s recruiting partner for this program and manages applications and interviews.
Additional notes
- The program does not guarantee full-time offers, though strong performance may lead to full-time opportunities. In past cohorts, 25–50% of fellows received full-time offers.
- Applicants should be cautious of scams; Anthropic recruiters contact from @anthropic.com addresses and will not ask for money or banking information before the first day.