Research Scientist/Engineer, Alignment Finetuning

USD 315,000-340,000 per year
MIDDLE
✅ Hybrid

SCRAPED

Used Tools & Technologies

Not specified

Required Skills & Competences ?

Python @ 6 Machine Learning @ 3 Communication @ 6 Experimentation @ 3

Details

Anthropic’s mission is to create reliable, interpretable, and steerable AI systems that are safe and beneficial for users and society. The team includes researchers, engineers, policy experts, and business leaders dedicated to building beneficial AI systems.

Responsibilities

  • Develop and implement novel finetuning techniques using synthetic data generation and advanced training pipelines
  • Train models to have improved alignment properties such as honesty, character, and harmlessness
  • Create and maintain evaluation frameworks to measure model alignment properties
  • Collaborate across teams to integrate alignment improvements into production models
  • Develop processes to automate and scale team workflows

Requirements

  • MS/PhD in Computer Science, Machine Learning, or related field, or equivalent experience
  • Strong programming skills in Python
  • Experience with ML model training and experimentation
  • Proven track record of implementing ML research
  • Strong analytical skills for interpreting experimental results
  • Experience with ML metrics and evaluation frameworks
  • Ability to convert research ideas into production code
  • Problem-solving skills to address practical implementation challenges

Strong candidates may also have

  • Experience with language model finetuning
  • Background in AI alignment research
  • Publications in machine learning or alignment
  • Experience with synthetic data generation
  • Familiarity with RLHF, constitutional AI, reward modeling
  • Experience designing novel training approaches
  • Experience in model behavior evaluation and improvement

Logistics

  • Education: at least Bachelor's degree or equivalent experience
  • Location-based hybrid policy: staff expected to be on-site at least 25% of the time
  • Visa sponsorship available with legal support

Benefits

Competitive compensation, equity donation matching, generous vacation and parental leave, flexible hours, and collaborative office space in San Francisco.

The team values collaborative, high-impact AI research with strong communication skills and encourages applications from diverse backgrounds.