Privacy Research Engineer, Safeguards

at Anthropic

📍 San Francisco, United States

USD 320,000-485,000 per year

MIDDLE

✅ Hybrid

✅ Visa Sponsorship

SCRAPED

Used Tools & Technologies

Not specified

Required Skills & Competences ^?

Security @ 3 Python @ 6 Algorithms @ 3 Machine Learning @ 3 Communication @ 3 PyTorch @ 2

Details

Anthropic’s mission is to create reliable, interpretable, and steerable AI systems. The Safeguards team seeks researchers to mitigate risks from model interactions with private user data by designing and implementing privacy-preserving techniques, auditing current approaches, and setting the direction for how Anthropic handles privacy more broadly.

Responsibilities

Lead privacy analysis of frontier models, auditing the use of data and ensuring safety throughout the process
Develop privacy-first training algorithms and techniques
Develop evaluation and auditing techniques to measure the privacy of training algorithms
Work with a small, senior team of engineers and researchers to enact a forward-looking privacy policy
Advocate on behalf of users to ensure responsible handling of all data

Requirements

Experience working on privacy-preserving machine learning
Strong coding skills in Python
Familiarity with ML frameworks such as PyTorch or JAX
Deep familiarity with large language models (how they work and how they are trained)
Experience with privacy-preserving techniques (for example, differential privacy and how it differs from k-anonymity, l-diversity, and t-closeness)
Track record of shipping products and features in a fast-moving environment
Experience supporting fast-paced startup engineering teams
Demonstrated ability to bring clarity and ownership to ambiguous technical problems
Proven ability to lead cross-functional security or privacy initiatives and navigate complex organizational dynamics

Strong candidates may also have

Publications on privacy-preserving ML at top academic venues
Prior experience training large language models (dataset collection, pre-training, fine-tuning, RL fine-tuning, running evaluations)
Prior experience developing tooling to support privacy-preserving ML (e.g., differential privacy tooling such as TF-Privacy or Opacus)

Compensation

Annual salary range: $320,000 - $485,000 USD
Total compensation package includes equity and benefits

Logistics & Qualifications

Minimum: Bachelor’s degree in a related field or equivalent experience
Location: San Francisco, CA
Location-based hybrid policy: staff expected to be in one of Anthropic’s offices at least ~25% of the time (some roles may require more time)
Visa sponsorship: Anthropic does sponsor visas and retains an immigration lawyer to assist, though sponsorship is not guaranteed for every role/candidate

Benefits & Culture

Competitive compensation and benefits, optional equity donation matching, generous vacation and parental leave, flexible working hours, and office space in San Francisco
Collaborative research culture valuing high-impact, empirical AI work and communication skills

How to apply

Standard application asks for resume/CV or LinkedIn, and asks whether you require visa sponsorship and whether you are open to working in-person in an office ~25% of the time