Research Engineer, Production Model Post Training

USD 315,000-340,000 per year
MIDDLE
✅ Hybrid

SCRAPED

Used Tools & Technologies

Not specified

Required Skills & Competences ?

Python @ 5 Distributed Systems @ 3 Hiring @ 3 Debugging @ 6

Details

Anthropic’s production models undergo sophisticated post-training processes to enhance their capabilities, alignment, and safety. As a Research Engineer on the Post-Training team, you will develop and optimize systems that transform base models into the refined Claude models that users interact with. This role sits at the intersection of cutting-edge research and production engineering and focuses on implementing, scaling, and improving post-training techniques such as Constitutional AI, RLHF, and other alignment methodologies.

Responsibilities

  • Implement and optimize post-training techniques at scale on frontier models (e.g., RLHF, Constitutional AI, other alignment methods).
  • Design, build, and run robust, efficient pipelines for model fine-tuning and evaluation.
  • Develop tools and metrics to measure and improve model performance across multiple dimensions (capabilities, safety, alignment).
  • Collaborate with research teams to translate emerging techniques into production-ready implementations.
  • Debug complex issues in training pipelines and model behavior.
  • Help establish best practices for reliable, reproducible model post-training and deployment.

Requirements

  • Strong software engineering skills with experience building complex ML systems.
  • Comfortable working with large-scale distributed systems and high-performance computing environments.
  • Experience with training, fine-tuning, or evaluating large language models (LLMs).
  • Proficiency in Python and deep learning frameworks.
  • Experience with distributed computing and scaling training/evaluation pipelines.
  • Ability to balance research exploration with engineering rigor and operational reliability.
  • Strong debugging and analysis skills for model training processes.
  • Interest in AI safety and responsible deployment; experience with alignment methodologies is a plus.
  • Education: At least a Bachelor's degree in a related field or equivalent experience.

Logistics

  • Location: San Francisco, CA; New York City, NY; Seattle, WA (hybrid policy — staff expected to be in an office at least ~25% of the time).
  • Visa sponsorship: Anthropic may sponsor visas and retains immigration counsel; sponsorship is not guaranteed for every role/candidate but the company will make reasonable efforts if an offer is made.
  • The team welcomes candidates at various experience levels, with a preference for senior engineers who have hands-on experience with frontier AI systems.

Benefits

  • Competitive compensation (listed range below) and benefits.
  • Optional equity donation matching.
  • Generous vacation and parental leave.
  • Flexible working hours and collaborative office spaces.
  • Guidance on appropriate use of AI in the application process and emphasis on a diverse and inclusive hiring process.

Salary

  • Annual salary range: $315,000 - $340,000 USD.