Used Tools & Technologies
Not specified
Required Skills & Competences
Tag name is followed by "@" symbol and proficiency level value.
About proficiency levels:
- 1-2 — basic awareness. Minimal hands-on experience, and a rudimentary understanding of the technology's purpose;
- 3-6 — daily use. Comfortable and regular usage, capable of handling common tasks and challenges related to the technology;
- 7-9 — you are an expert, you can teach others, you know all the pitfalls and tricks;
- 10 — exceptional knowledge, comprehensive understanding, and adeptness in all aspects of the technology, including advanced problem-solving. Think twice before claiming or demanding such level.
Python @ 5
Machine Learning @ 3
Communication @ 6
AI @ 3
Reinforcement Learning @ 3
- 1-2 — basic awareness. Minimal hands-on experience, and a rudimentary understanding of the technology's purpose;
- 3-6 — daily use. Comfortable and regular usage, capable of handling common tasks and challenges related to the technology;
- 7-9 — you are an expert, you can teach others, you know all the pitfalls and tricks;
- 10 — exceptional knowledge, comprehensive understanding, and adeptness in all aspects of the technology, including advanced problem-solving. Think twice before claiming or demanding such level.
Details
Anthropic’s mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickly growing group of committed researchers, engineers, policy experts, and business leaders working together to build beneficial AI systems.
Role overview
The Computer Use team focuses on teaching Claude to see, use, and understand computer interfaces. As a Research Engineer on the team, you'll work on advancing our models' ability to reliably and safely operate real software. Your work will translate directly into model improvements in our own and our customers' products. You can try Claude's computer use capabilities today through the Claude in Chrome extension and Claude Cowork.
Responsibilities
- Design and run experiments to improve Claude's perception and agentic capabilities
- Develop robust, reliable evaluation frameworks for measuring our models' ability to complete complex computer tasks
- Build and improve computer use and vision reinforcement learning training environments
- Create pipelines and tools to test and validate complex RL environments
- Collaborate with teams across the model training and infrastructure stack to improve our production training setup
- Partner with product teams to bring research advances into production
Minimum Qualifications
- Software engineering experience and proficiency in Python
- Experience training, fine-tuning, or evaluating machine learning models
- Strong communication skills and a collaborative working style
- Care about the societal impacts and safety of your work
- Minimum education: Bachelor’s degree or an equivalent combination of education, training, and/or experience
Preferred Qualifications
- Experience training models for computer use or other agentic capabilities
- Experience with reinforcement learning, particularly in long-horizon or sparse-reward settings
- Familiarity with multimodal model training
- Experience building evaluations or benchmarks for agentic systems
- Experience building reinforcement learning environments, simulation systems, or large-scale ML infrastructure
- Experience working closely with product teams to drive model improvements
Compensation
Annual Salary: $500,000 - $850,000 USD
Location & Office Policy
Locations listed: San Francisco, CA; New York City, NY; Seattle, WA
Location-based hybrid policy: currently, staff are expected to be in one of our offices at least 25% of the time.
Logistics & Other Information
- Years of experience required will correlate with the internal job level requirements for the position
- Visa sponsorship: We do sponsor visas and retain an immigration lawyer; sponsorship is not guaranteed for every role/candidate but the company will make reasonable efforts if an offer is made.
- Anthropic offers competitive compensation and benefits, optional equity donation matching, generous vacation and parental leave, flexible working hours, and office space for collaboration.
Company & Culture
Anthropic is a public benefit corporation headquartered in San Francisco. The company emphasizes large-scale empirical AI research, collaboration, and communication skills. They encourage applicants from diverse backgrounds and provide guidance on candidate AI usage during the application process.