Used Tools & Technologies
Not specified
Required Skills & Competences ?
Python @ 5 Distributed Systems @ 3 Hiring @ 3 Debugging @ 6Details
Anthropic’s production models undergo sophisticated post-training processes to enhance their capabilities, alignment, and safety. As a Research Engineer on the Post-Training team, you will develop and optimize systems that transform base models into the refined Claude models that users interact with. This role sits at the intersection of cutting-edge research and production engineering and focuses on implementing, scaling, and improving post-training techniques such as Constitutional AI, RLHF, and other alignment methodologies.
Responsibilities
- Implement and optimize post-training techniques at scale on frontier models (e.g., RLHF, Constitutional AI, other alignment methods).
- Design, build, and run robust, efficient pipelines for model fine-tuning and evaluation.
- Develop tools and metrics to measure and improve model performance across multiple dimensions (capabilities, safety, alignment).
- Collaborate with research teams to translate emerging techniques into production-ready implementations.
- Debug complex issues in training pipelines and model behavior.
- Help establish best practices for reliable, reproducible model post-training and deployment.
Requirements
- Strong software engineering skills with experience building complex ML systems.
- Comfortable working with large-scale distributed systems and high-performance computing environments.
- Experience with training, fine-tuning, or evaluating large language models (LLMs).
- Proficiency in Python and deep learning frameworks.
- Experience with distributed computing and scaling training/evaluation pipelines.
- Ability to balance research exploration with engineering rigor and operational reliability.
- Strong debugging and analysis skills for model training processes.
- Interest in AI safety and responsible deployment; experience with alignment methodologies is a plus.
- Education: At least a Bachelor's degree in a related field or equivalent experience.
Logistics
- Location: San Francisco, CA; New York City, NY; Seattle, WA (hybrid policy — staff expected to be in an office at least ~25% of the time).
- Visa sponsorship: Anthropic may sponsor visas and retains immigration counsel; sponsorship is not guaranteed for every role/candidate but the company will make reasonable efforts if an offer is made.
- The team welcomes candidates at various experience levels, with a preference for senior engineers who have hands-on experience with frontier AI systems.
Benefits
- Competitive compensation (listed range below) and benefits.
- Optional equity donation matching.
- Generous vacation and parental leave.
- Flexible working hours and collaborative office spaces.
- Guidance on appropriate use of AI in the application process and emphasis on a diverse and inclusive hiring process.
Salary
- Annual salary range: $315,000 - $340,000 USD.