Used Tools & Technologies
Not specified
Required Skills & Competences
Tag name is followed by "@" symbol and proficiency level value.
About proficiency levels:
- 1-2 — basic awareness. Minimal hands-on experience, and a rudimentary understanding of the technology's purpose;
- 3-6 — daily use. Comfortable and regular usage, capable of handling common tasks and challenges related to the technology;
- 7-9 — you are an expert, you can teach others, you know all the pitfalls and tricks;
- 10 — exceptional knowledge, comprehensive understanding, and adeptness in all aspects of the technology, including advanced problem-solving. Think twice before claiming or demanding such level.
Security @ 3
Distributed Systems @ 2
Leadership @ 3
Communication @ 3
AI @ 3
- 1-2 — basic awareness. Minimal hands-on experience, and a rudimentary understanding of the technology's purpose;
- 3-6 — daily use. Comfortable and regular usage, capable of handling common tasks and challenges related to the technology;
- 7-9 — you are an expert, you can teach others, you know all the pitfalls and tricks;
- 10 — exceptional knowledge, comprehensive understanding, and adeptness in all aspects of the technology, including advanced problem-solving. Think twice before claiming or demanding such level.
Details
About Anthropic
Anthropic’s mission is to create reliable, interpretable, and steerable AI systems. The team includes researchers, engineers, policy experts, and business leaders working together to build beneficial AI systems.
Role overview
We are looking for an Incident Response Manager to serve as the operational backbone of how Anthropic handles incidents. When things go wrong, this person makes sure the right people are in the room, the right information is flowing, and nothing falls through the cracks. The role brings structure and rigor to high-volume, high-stakes situations, working across engineering, product, security, legal, go-to-market, and leadership. This role writes runbooks and operates effectively even when runbooks do not yet exist.
Responsibilities
- Build the incident response management function, establishing processes, tooling, and operational standards to handle incidents at scale
- Serve as an on-call incident commander, driving coordinated response across technical and non-technical stakeholders, including managing multiple active incidents simultaneously
- Engage the right people at the right time with urgency; bring order and direction to fast-moving, ambiguous situations
- Own incident communications end-to-end: internal coordination, external channels like status pages, direct customer outreach, and stakeholder updates
- Participate in blameless incident reviews; provide operational context and drive follow-through on critical remediations
- Partner with engineering teams to develop and maintain incident response policies, procedures, and escalation frameworks that scale with growth
- Partner with engineering, product, security, legal, and go-to-market teams to continuously improve detection, response, and learning from incidents
Requirements
- 5+ years of experience in incident management, with direct experience managing technical product or infrastructure incidents (not exclusively security or trust & safety)
- Experience building or significantly shaping an incident response program, ideally at a high-growth startup or where structure needed to be created rather than inherited
- Strong sense of ownership and urgency; ability to operate independently and make sound decisions under pressure
- Comfortable working in unprecedented situations where processes are still being defined and guidance may be incomplete or conflicting
- Track record of effective cross-functional collaboration (engineering, security, legal, communications, go-to-market, executive leadership)
- Blameless, learning-oriented mindset for incident reviews focused on systemic improvement
- Experience with cloud infrastructure incidents and sufficient technical depth across the stack to engage with engineering teams during response, including familiarity with distributed systems, monitoring tools, and logs
- Analytically minded; experience using data (incident metrics, queries, trend analysis) to inform response decisions and drive operational improvements
- Clear and calm communicator under pressure, both in real-time coordination and in post-incident written communications
- Thrive in high-volume, fast-paced environments and bring operational discipline to complex, evolving situations
Logistics
- Annual salary: $290,000 - $365,000 USD
- Minimum education: Bachelor’s degree or equivalent combination of education, training, and/or experience
- Required field of study: A field relevant to the role as demonstrated through coursework, training, or professional experience
- Location-based hybrid policy: staff are expected to be in one of the offices at least 25% of the time (some roles may require more time in office)
- Visa sponsorship: Anthropic states they do sponsor visas and will make reasonable efforts and retain an immigration lawyer to assist when an offer is made
Why Anthropic / How we're different
Anthropic pursues high-impact AI research as a cohesive team focused on a few large-scale efforts. The organization values impact, collaboration, and communication. Research directions include work related to GPT-3, circuit-based interpretability, multimodal neurons, scaling laws, and AI safety.
Benefits
Anthropic offers competitive compensation and benefits, optional equity donation matching, generous vacation and parental leave, flexible working hours, and office space for collaboration.