Research Engineer, Universes

USD 500,000-850,000 per year
MIDDLE
✅ Remote ✅ Hybrid
✅ Visa Sponsorship

SCRAPED

Used Tools & Technologies

Not specified

Required Skills & Competences ?

Distributed Systems @ 3 Communication @ 3

Details

Anthropic’s mission is to create reliable, interpretable, and steerable AI systems. The Universes team within Research trains AI models to perform complex, long-horizon agentic tasks in ultra-realistic settings by designing and implementing novel training environments and rigorous evaluations.

About the role

This role blends research and engineering responsibilities: implement novel approaches, contribute to research direction, design training environments and methodologies for reinforcement learning and agentic tasks, and build evaluations that measure genuine capability. You will collaborate with research and infrastructure teams to ship environments into production training, debug and iterate across research and production ML stacks, and contribute to research culture through technical discussions and collaborative problem-solving.

Responsibilities

  • Build the next generation of agentic training environments
  • Build rigorous evaluations that measure real capability
  • Collaborate across research and infrastructure teams to ship environments into production training
  • Debug and iterate rapidly across research and production ML stacks
  • Contribute to research culture through technical discussions and collaborative problem-solving

Requirements / Qualifications

  • At least a Bachelor's degree in a related field or equivalent experience
  • Strong software engineering skills and ability to build robust infrastructure
  • Experience or strong interest in reinforcement learning, simulation systems, and designing training environments
  • Experience building evaluations for ML models and measuring capabilities
  • Ability to balance research exploration with engineering implementation and operate with high agency

Strong candidates may also have one or more of the following:

  • Industry experience with large language model training, fine-tuning, or evaluation
  • Industry experience building RL environments, simulation systems, or large-scale ML infrastructure
  • Senior experience in a relevant technical field, even if transitioning domains
  • Deep expertise in sandboxing, containerization, VM infrastructure, or distributed systems
  • Published influential work in relevant ML areas

Compensation

Annual Salary: $500,000 - $850,000 USD

Total compensation for full-time employees includes equity and benefits.

Logistics

  • Remote-friendly (travel required); listed locations: San Francisco, CA; Seattle, WA; New York City, NY
  • Location-based hybrid policy: currently expect all staff to be in one of our offices at least 25% of the time (some roles may require more time in our offices)
  • Visa sponsorship: Anthropic states they do sponsor visas and retain an immigration lawyer to help with the process (not every role/candidate can be successfully sponsored)
  • Education requirement: at least a Bachelor's degree in a related field or equivalent experience

Company & culture

Anthropic values high-impact, collaborative research work on a small number of large-scale efforts. The company emphasizes communication, frequent research discussions, and has an emphasis on safety and beneficial AI.

How to apply

Application fields include name, email, phone, resume/CV or LinkedIn, and questions about openness to working in-person at an office 25% of the time. Anthropic encourages candidates from diverse backgrounds to apply and provides guidance on candidate AI usage during the application process.