Research Scientist/Engineer, Alignment Finetuning

at Anthropic

📍 San Francisco, United States

USD 315,000-340,000 per year

MIDDLE

✅ Hybrid

SCRAPED

Used Tools & Technologies

Not specified

Required Skills & Competences ^?

Python @ 6 Machine Learning @ 3 Communication @ 6 Experimentation @ 3

Details

Anthropic’s mission is to create reliable, interpretable, and steerable AI systems that are safe and beneficial for users and society. The team includes researchers, engineers, policy experts, and business leaders dedicated to building beneficial AI systems.

Responsibilities

Develop and implement novel finetuning techniques using synthetic data generation and advanced training pipelines
Train models to have improved alignment properties such as honesty, character, and harmlessness
Create and maintain evaluation frameworks to measure model alignment properties
Collaborate across teams to integrate alignment improvements into production models
Develop processes to automate and scale team workflows

Requirements

MS/PhD in Computer Science, Machine Learning, or related field, or equivalent experience
Strong programming skills in Python
Experience with ML model training and experimentation
Proven track record of implementing ML research
Strong analytical skills for interpreting experimental results
Experience with ML metrics and evaluation frameworks
Ability to convert research ideas into production code
Problem-solving skills to address practical implementation challenges

Strong candidates may also have

Experience with language model finetuning
Background in AI alignment research
Publications in machine learning or alignment
Experience with synthetic data generation
Familiarity with RLHF, constitutional AI, reward modeling
Experience designing novel training approaches
Experience in model behavior evaluation and improvement

Logistics

Education: at least Bachelor's degree or equivalent experience
Location-based hybrid policy: staff expected to be on-site at least 25% of the time
Visa sponsorship available with legal support

Benefits

Competitive compensation, equity donation matching, generous vacation and parental leave, flexible hours, and collaborative office space in San Francisco.

The team values collaborative, high-impact AI research with strong communication skills and encourages applications from diverse backgrounds.