Research Engineer, Computer Use

USD 500,000-850,000 per year
MIDDLE
✅ Hybrid
✅ Visa Sponsorship

Used Tools & Technologies

Not specified

Required Skills & Competences

Python @ 5 Machine Learning @ 3 Communication @ 6 AI @ 3 Reinforcement Learning @ 3

Details

Anthropic’s mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickly growing group of committed researchers, engineers, policy experts, and business leaders working together to build beneficial AI systems.

Role overview

The Computer Use team focuses on teaching Claude to see, use, and understand computer interfaces. As a Research Engineer on the team, you'll work on advancing our models' ability to reliably and safely operate real software. Your work will translate directly into model improvements in our own and our customers' products. You can try Claude's computer use capabilities today through the Claude in Chrome extension and Claude Cowork.

Responsibilities

  • Design and run experiments to improve Claude's perception and agentic capabilities
  • Develop robust, reliable evaluation frameworks for measuring our models' ability to complete complex computer tasks
  • Build and improve computer use and vision reinforcement learning training environments
  • Create pipelines and tools to test and validate complex RL environments
  • Collaborate with teams across the model training and infrastructure stack to improve our production training setup
  • Partner with product teams to bring research advances into production

Minimum Qualifications

  • Software engineering experience and proficiency in Python
  • Experience training, fine-tuning, or evaluating machine learning models
  • Strong communication skills and a collaborative working style
  • Care about the societal impacts and safety of your work
  • Minimum education: Bachelor’s degree or an equivalent combination of education, training, and/or experience

Preferred Qualifications

  • Experience training models for computer use or other agentic capabilities
  • Experience with reinforcement learning, particularly in long-horizon or sparse-reward settings
  • Familiarity with multimodal model training
  • Experience building evaluations or benchmarks for agentic systems
  • Experience building reinforcement learning environments, simulation systems, or large-scale ML infrastructure
  • Experience working closely with product teams to drive model improvements

Compensation

Annual Salary: $500,000 - $850,000 USD

Location & Office Policy

Locations listed: San Francisco, CA; New York City, NY; Seattle, WA

Location-based hybrid policy: currently, staff are expected to be in one of our offices at least 25% of the time.

Logistics & Other Information

  • Years of experience required will correlate with the internal job level requirements for the position
  • Visa sponsorship: We do sponsor visas and retain an immigration lawyer; sponsorship is not guaranteed for every role/candidate but the company will make reasonable efforts if an offer is made.
  • Anthropic offers competitive compensation and benefits, optional equity donation matching, generous vacation and parental leave, flexible working hours, and office space for collaboration.

Company & Culture

Anthropic is a public benefit corporation headquartered in San Francisco. The company emphasizes large-scale empirical AI research, collaboration, and communication skills. They encourage applicants from diverse backgrounds and provide guidance on candidate AI usage during the application process.