Model Quality Software Engineer, Claude Code

USD 320,000-485,000 per year
MIDDLE
✅ Hybrid
✅ Visa Sponsorship

Used Tools & Technologies

Not specified

Required Skills & Competences

TypeScript @ 3 Python @ 3 Communication @ 6 Claude Code @ 3 AI @ 3 Reinforcement Learning @ 3

Details

Anthropic’s mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickly growing group of committed researchers, engineers, policy experts, and business leaders working together to build beneficial AI systems.

This role sits at the intersection of engineering and research on the Claude Code team. You will collaborate directly with Anthropic's researchers to improve Claude’s coding capabilities through tooling, infrastructure, and evaluations. You will build systems that help the team understand where Claude Code excels and where it falls short—and then help close those gaps.

Responsibilities

  • Design and build eval systems that measure model capabilities across diverse coding tasks
  • Build tooling and infrastructure that enables researchers to run experiments at scale
  • Develop pipelines for data collection, processing, and analysis
  • Create internal tools that improve researcher productivity and accelerate iteration cycles
  • Serve as a bridge between product and research—bring strong product intuition to inform which capabilities matter most
  • Work closely with researchers to translate research questions into engineering solutions
  • Own systems end-to-end—from design through production reliability

Requirements / Qualifications

  • At least a Bachelor's degree in a related field or equivalent experience
  • At least 5 years of work experience
  • Experience building and owning complex systems—pipelines, infrastructure, or software that orchestrates many components and handles significant state and logic
  • Comfortable taking full ownership of problems and driving them to completion independently
  • Strong focus on correctness and reliability in system design and implementation
  • Comfortable diving into unfamiliar technical domains and figuring things out quickly
  • Strong communication skills and ability to collaborate with researchers and product teams

Strong candidates may also have experience with

  • Writing or maintaining eval/evaluation frameworks
  • Reinforcement learning systems
  • Research computing or scientific infrastructure
  • Working in high-performance, demanding environments (trading firms, quant funds, competitive research labs, or fast-moving startups)
  • Strong quantitative foundation (math, physics, or related fields)
  • Python and TypeScript

Compensation

Annual Salary: $320,000 - $485,000 USD

Logistics

  • Locations listed: San Francisco, CA and New York City, NY
  • Location-based hybrid policy: staff are expected to be in one of our offices at least 25% of the time; some roles may require more time in office
  • Education: at least a Bachelor's degree in a related field or equivalent experience
  • Visa sponsorship: Anthropic does sponsor visas and retains an immigration lawyer to help, though not every role/candidate can be successfully sponsored

How we're different

Anthropic works as a single cohesive team on a few large-scale research efforts and values impact, collaboration, and strong communication. The team treats AI research as an empirical science with connections to physics and biology and hosts frequent research discussions to focus on high-impact work.

Additional notes

  • The posting encourages applicants from diverse and underrepresented groups and asks candidates to confirm understanding of AI usage guidance during the application process.
  • The application form asks whether candidates are open to working in-person in one of Anthropic's offices 25% of the time and whether they require visa sponsorship or relocation.