Model Quality Software Engineer, Claude Code

at Anthropic

📍 New York City, United States
📍 San Francisco, United States

USD 320,000-485,000 per year

MIDDLE

✅ Hybrid

✅ Visa Sponsorship

Used Tools & Technologies

Not specified

Required Skills & Competences
Tag name is followed by "@" symbol and proficiency level value. About proficiency levels:

1-2 — basic awareness. Minimal hands-on experience, and a rudimentary understanding of the technology's purpose;

3-6 — daily use. Comfortable and regular usage, capable of handling common tasks and challenges related to the technology;

7-9 — you are an expert, you can teach others, you know all the pitfalls and tricks;

10 — exceptional knowledge, comprehensive understanding, and adeptness in all aspects of the technology, including advanced problem-solving. Think twice before claiming or demanding such level.

TypeScript @ 3 Python @ 3 Communication @ 6 Claude Code @ 3 AI @ 3 Reinforcement Learning @ 3

Details

Anthropic’s mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickly growing group of committed researchers, engineers, policy experts, and business leaders working together to build beneficial AI systems.

This role sits at the intersection of engineering and research on the Claude Code team. You will collaborate directly with Anthropic's researchers to improve Claude’s coding capabilities through tooling, infrastructure, and evaluations. You will build systems that help the team understand where Claude Code excels and where it falls short—and then help close those gaps.

Responsibilities

Design and build eval systems that measure model capabilities across diverse coding tasks
Build tooling and infrastructure that enables researchers to run experiments at scale
Develop pipelines for data collection, processing, and analysis
Create internal tools that improve researcher productivity and accelerate iteration cycles
Serve as a bridge between product and research—bring strong product intuition to inform which capabilities matter most
Work closely with researchers to translate research questions into engineering solutions
Own systems end-to-end—from design through production reliability

Requirements / Qualifications

At least a Bachelor's degree in a related field or equivalent experience
At least 5 years of work experience
Experience building and owning complex systems—pipelines, infrastructure, or software that orchestrates many components and handles significant state and logic
Comfortable taking full ownership of problems and driving them to completion independently
Strong focus on correctness and reliability in system design and implementation
Comfortable diving into unfamiliar technical domains and figuring things out quickly
Strong communication skills and ability to collaborate with researchers and product teams

Strong candidates may also have experience with

Writing or maintaining eval/evaluation frameworks
Reinforcement learning systems
Research computing or scientific infrastructure
Working in high-performance, demanding environments (trading firms, quant funds, competitive research labs, or fast-moving startups)
Strong quantitative foundation (math, physics, or related fields)
Python and TypeScript

Compensation

Annual Salary: $320,000 - $485,000 USD

Logistics

Locations listed: San Francisco, CA and New York City, NY
Location-based hybrid policy: staff are expected to be in one of our offices at least 25% of the time; some roles may require more time in office
Education: at least a Bachelor's degree in a related field or equivalent experience
Visa sponsorship: Anthropic does sponsor visas and retains an immigration lawyer to help, though not every role/candidate can be successfully sponsored

How we're different

Anthropic works as a single cohesive team on a few large-scale research efforts and values impact, collaboration, and strong communication. The team treats AI research as an empirical science with connections to physics and biology and hosts frequent research discussions to focus on high-impact work.

Additional notes

The posting encourages applicants from diverse and underrepresented groups and asks candidates to confirm understanding of AI usage guidance during the application process.
The application form asks whether candidates are open to working in-person in one of Anthropic's offices 25% of the time and whether they require visa sponsorship or relocation.