Staff Software Engineer, GenAI Systems

at Airbnb

📍 United States

USD 204,000-255,000 per year

SENIOR

✅ Hybrid

SCRAPED

Used Tools & Technologies

Not specified

Required Skills & Competences ^?

Software Development @ 4 Go @ 7 Python @ 7 Scala @ 7 Java @ 7 Machine Learning @ 4 Leadership @ 7 Communication @ 6 Technical Leadership @ 7 Experimentation @ 3 Customer Support @ 4 LLM @ 4

Details

Airbnb Core Machine Learning (Community Support Platform) is adopting Generative AI to enable an intelligent, scalable Customer Support experience. The AI Assistant Product Evaluation team builds reliable, high-quality, and efficient evaluation solutions and a cohesive suite of observability and testability tools to accelerate AI model development and improve the AI Assistant product experience.

Responsibilities

Develop and optimize scalable engineering evaluation frameworks and systems for Airbnb’s Generative AI products.
Work closely with Core Modeling engineers to understand LLM development pain points and develop LLM-as-a-judge solutions and models to address metric-related challenges in a scalable and efficient way.
Design, productionize, and optimize end-to-end data systems to improve the effectiveness and efficiency of AI evaluation automation frameworks.
Collaborate with ML infrastructure engineering teams to evolve how evaluation frameworks are built and tested for Airbnb Conversational AI products.
Lead all phases of software development including architecture design, implementation, and testing.
Partner with product managers, data scientists, and operations to translate business requirements into scalable engineering solutions, drive engineering decisions, and quantify impact.
Foster engineering excellence by supporting teammates in writing high-quality code, ensuring operational reliability, and sharing knowledge across the team.

Requirements

9+ years of industry experience in applied machine learning with a track record of technical leadership and delivering complex, high-impact AI/ML systems.
MS or PhD in Computer Science, Machine Learning, Artificial Intelligence, or a related technical field.
Deep expertise in Large Language Models (LLMs), including experience with LLM model evaluation methodologies and agent-based applications.
Solid programming skills in Python and at least one other language (e.g., Java, Go, or Scala); strong foundation in software design, testing, and code quality.
Strong AI/ML system design skills and experience building scalable, extensible AI systems.
Familiarity with ML infrastructure and operations, including model deployment, serving, monitoring, and experimentation.
Proven ability to work in cross-functional teams (modeling engineers, product managers, data scientists, operations) to deliver end-to-end solutions.
Excellent communication, mentorship, and technical leadership skills; ability to drive alignment and influence engineering culture across teams.

Location

This position is US - Remote Eligible. The role may include occasional work at an Airbnb office or attendance at offsites, as agreed with your manager. You must live in a U.S. state where Airbnb, Inc. has a registered entity.

Compensation & Benefits

Base pay range: $204,000 — $255,000 USD.
This role may also be eligible for bonus, equity, benefits, and Employee Travel Credits.

Inclusion & Accessibility

Airbnb encourages diverse applicants and provides accommodations for candidates with disabilities during the application and interview process ([email protected]).