Research Engineer, Universes

at Anthropic

📍 World
📍 New York City, United States
📍 San Francisco, United States
📍 Seattle, United States

USD 500,000-850,000 per year

MIDDLE

✅ Remote ✅ Hybrid

✅ Visa Sponsorship

SCRAPED

Used Tools & Technologies

Not specified

Required Skills & Competences ^?

Distributed Systems @ 3 Communication @ 3

Details

Anthropic’s mission is to create reliable, interpretable, and steerable AI systems. The Universes team within Research trains AI models to perform complex, long-horizon agentic tasks in ultra-realistic settings by designing and implementing novel training environments and rigorous evaluations.

About the role

This role blends research and engineering responsibilities: implement novel approaches, contribute to research direction, design training environments and methodologies for reinforcement learning and agentic tasks, and build evaluations that measure genuine capability. You will collaborate with research and infrastructure teams to ship environments into production training, debug and iterate across research and production ML stacks, and contribute to research culture through technical discussions and collaborative problem-solving.

Responsibilities

Build the next generation of agentic training environments
Build rigorous evaluations that measure real capability
Collaborate across research and infrastructure teams to ship environments into production training
Debug and iterate rapidly across research and production ML stacks
Contribute to research culture through technical discussions and collaborative problem-solving

Requirements / Qualifications

At least a Bachelor's degree in a related field or equivalent experience
Strong software engineering skills and ability to build robust infrastructure
Experience or strong interest in reinforcement learning, simulation systems, and designing training environments
Experience building evaluations for ML models and measuring capabilities
Ability to balance research exploration with engineering implementation and operate with high agency

Strong candidates may also have one or more of the following:

Industry experience with large language model training, fine-tuning, or evaluation
Industry experience building RL environments, simulation systems, or large-scale ML infrastructure
Senior experience in a relevant technical field, even if transitioning domains
Deep expertise in sandboxing, containerization, VM infrastructure, or distributed systems
Published influential work in relevant ML areas

Compensation

Annual Salary: $500,000 - $850,000 USD

Total compensation for full-time employees includes equity and benefits.

Logistics

Remote-friendly (travel required); listed locations: San Francisco, CA; Seattle, WA; New York City, NY
Location-based hybrid policy: currently expect all staff to be in one of our offices at least 25% of the time (some roles may require more time in our offices)
Visa sponsorship: Anthropic states they do sponsor visas and retain an immigration lawyer to help with the process (not every role/candidate can be successfully sponsored)
Education requirement: at least a Bachelor's degree in a related field or equivalent experience

Company & culture

Anthropic values high-impact, collaborative research work on a small number of large-scale efforts. The company emphasizes communication, frequent research discussions, and has an emphasis on safety and beneficial AI.

How to apply

Application fields include name, email, phone, resume/CV or LinkedIn, and questions about openness to working in-person at an office 25% of the time. Anthropic encourages candidates from diverse backgrounds to apply and provides guidance on candidate AI usage during the application process.