Research Engineer, Production Model Post Training

at Anthropic

📍 New York City, United States
📍 San Francisco, United States
📍 Seattle, United States

USD 315,000-340,000 per year

MIDDLE

✅ Hybrid

SCRAPED

Used Tools & Technologies

Not specified

Required Skills & Competences ^?

Python @ 5 Distributed Systems @ 3 Hiring @ 3 Debugging @ 6

Details

Anthropic’s production models undergo sophisticated post-training processes to enhance their capabilities, alignment, and safety. As a Research Engineer on the Post-Training team, you will develop and optimize systems that transform base models into the refined Claude models that users interact with. This role sits at the intersection of cutting-edge research and production engineering and focuses on implementing, scaling, and improving post-training techniques such as Constitutional AI, RLHF, and other alignment methodologies.

Responsibilities

Implement and optimize post-training techniques at scale on frontier models (e.g., RLHF, Constitutional AI, other alignment methods).
Design, build, and run robust, efficient pipelines for model fine-tuning and evaluation.
Develop tools and metrics to measure and improve model performance across multiple dimensions (capabilities, safety, alignment).
Collaborate with research teams to translate emerging techniques into production-ready implementations.
Debug complex issues in training pipelines and model behavior.
Help establish best practices for reliable, reproducible model post-training and deployment.

Requirements

Strong software engineering skills with experience building complex ML systems.
Comfortable working with large-scale distributed systems and high-performance computing environments.
Experience with training, fine-tuning, or evaluating large language models (LLMs).
Proficiency in Python and deep learning frameworks.
Experience with distributed computing and scaling training/evaluation pipelines.
Ability to balance research exploration with engineering rigor and operational reliability.
Strong debugging and analysis skills for model training processes.
Interest in AI safety and responsible deployment; experience with alignment methodologies is a plus.
Education: At least a Bachelor's degree in a related field or equivalent experience.

Logistics

Location: San Francisco, CA; New York City, NY; Seattle, WA (hybrid policy — staff expected to be in an office at least ~25% of the time).
Visa sponsorship: Anthropic may sponsor visas and retains immigration counsel; sponsorship is not guaranteed for every role/candidate but the company will make reasonable efforts if an offer is made.
The team welcomes candidates at various experience levels, with a preference for senior engineers who have hands-on experience with frontier AI systems.

Benefits

Competitive compensation (listed range below) and benefits.
Optional equity donation matching.
Generous vacation and parental leave.
Flexible working hours and collaborative office spaces.
Guidance on appropriate use of AI in the application process and emphasis on a diverse and inclusive hiring process.

Salary

Annual salary range: $315,000 - $340,000 USD.