Staff Technical Program Manager, Reliability & Observability

at Airbnb
USD 194,000-242,000 per year
SENIOR
✅ Remote ✅ Hybrid

SCRAPED

Used Tools & Technologies

Not specified

Required Skills & Competences ?

A/B Testing @ 3 Leadership @ 4 Communication @ 4

Details

Airbnb is seeking a technical, hands-on, and mission-driven Staff Technical Program Manager (TPM) to lead Reliability & Observability initiatives. You will partner with engineering and product teams, including ML engineers and infrastructure engineers, to build and enhance data-driven decision making across Airbnb. The role focuses on creating and maintaining frameworks and platforms for proactive monitoring, alerting, logging, tracing, and incident management to ensure robust, highly available, and transparent platform operations.

Responsibilities

  • Shape and influence technical direction of projects while meeting stakeholder needs and maintaining high quality standards.
  • Rapidly prototype and validate project ideas through iterative development cycles; adapt based on insights and data.
  • Balance broad, outcome-driven thinking with attention to critical details; prioritize where deep focus is necessary for successful execution.
  • Define and secure stakeholder alignment on clear, measurable success criteria to accelerate AI initiatives.
  • Regularly assess risks and opportunities, and devise proactive mitigation strategies to maintain momentum and project success.
  • Maintain transparent and effective communication channels to keep stakeholders informed of progress, developments, and challenges.
  • Present outcomes and updates to senior leadership, articulating trade-offs, risks, and emerging opportunities.

Requirements

  • At least 10 years of work experience, with at least 8 years as a TPM or relevant experience.
  • Demonstrated ability to work through ambiguity to deliver detailed solutions: frame vague problems, identify a path forward, and drive initiatives to completion.
  • Self-motivated, proactive, and able to adapt to teams with different operating cadences.
  • Strong business judgment, ability to influence others, strong analytical skills, and a track record of ownership and data-driven analyses.
  • Experience with ML models, LLMs, LRMs, feature development, model testing, and resource management to support AI-powered product experiences.
  • Familiarity with A/B testing, incremental delivery, and deployment.
  • Ability to ramp up quickly and learn new technologies with minimal lag time.
  • Excellent written and verbal business communication and people skills; ability to influence stakeholders.

Team & Mission

  • The Reliability & Observability team enables safe, resilient, and transparent operation of critical systems powering Airbnb.
  • The team builds and maintains platforms and frameworks for monitoring, alerting, logging, tracing, and incident management to help teams across Airbnb maintain service health and remediate issues quickly.
  • You will collaborate closely with ML engineers, infrastructure engineers, and product managers to ensure the platform is robust, highly available, and transparent.

Location & Work Model

  • This position is US - Remote Eligible. The role may include occasional work at an Airbnb office or attendance at offsites, as agreed with your manager.
  • You must live in a U.S. state where Airbnb, Inc. has a registered entity (some states may be excluded).

Compensation & Benefits

  • Base pay range: $194,000 — $242,000 USD.
  • The role may also be eligible for bonus, equity, benefits, and Employee Travel Credits.

Inclusion

  • Airbnb is committed to inclusion and encourages all qualified individuals to apply. Reasonable accommodations are available for candidates with disabilities via [email protected].