Staff Software Engineer, GenAI Systems

at Airbnb
πŸ“ United States
USD 204,000-255,000 per year
SENIOR
βœ… Hybrid

SCRAPED

Used Tools & Technologies

Not specified

Required Skills & Competences ?

Software Development @ 4 Go @ 7 Python @ 7 Scala @ 7 Java @ 7 Machine Learning @ 4 Leadership @ 7 Communication @ 6 Technical Leadership @ 7 Experimentation @ 3 Customer Support @ 4 LLM @ 4

Details

Airbnb Core Machine Learning (Community Support Platform) is adopting Generative AI to enable an intelligent, scalable Customer Support experience. The AI Assistant Product Evaluation team builds reliable, high-quality, and efficient evaluation solutions and a cohesive suite of observability and testability tools to accelerate AI model development and improve the AI Assistant product experience.

Responsibilities

  • Develop and optimize scalable engineering evaluation frameworks and systems for Airbnb’s Generative AI products.
  • Work closely with Core Modeling engineers to understand LLM development pain points and develop LLM-as-a-judge solutions and models to address metric-related challenges in a scalable and efficient way.
  • Design, productionize, and optimize end-to-end data systems to improve the effectiveness and efficiency of AI evaluation automation frameworks.
  • Collaborate with ML infrastructure engineering teams to evolve how evaluation frameworks are built and tested for Airbnb Conversational AI products.
  • Lead all phases of software development including architecture design, implementation, and testing.
  • Partner with product managers, data scientists, and operations to translate business requirements into scalable engineering solutions, drive engineering decisions, and quantify impact.
  • Foster engineering excellence by supporting teammates in writing high-quality code, ensuring operational reliability, and sharing knowledge across the team.

Requirements

  • 9+ years of industry experience in applied machine learning with a track record of technical leadership and delivering complex, high-impact AI/ML systems.
  • MS or PhD in Computer Science, Machine Learning, Artificial Intelligence, or a related technical field.
  • Deep expertise in Large Language Models (LLMs), including experience with LLM model evaluation methodologies and agent-based applications.
  • Solid programming skills in Python and at least one other language (e.g., Java, Go, or Scala); strong foundation in software design, testing, and code quality.
  • Strong AI/ML system design skills and experience building scalable, extensible AI systems.
  • Familiarity with ML infrastructure and operations, including model deployment, serving, monitoring, and experimentation.
  • Proven ability to work in cross-functional teams (modeling engineers, product managers, data scientists, operations) to deliver end-to-end solutions.
  • Excellent communication, mentorship, and technical leadership skills; ability to drive alignment and influence engineering culture across teams.

Location

  • This position is US - Remote Eligible. The role may include occasional work at an Airbnb office or attendance at offsites, as agreed with your manager. You must live in a U.S. state where Airbnb, Inc. has a registered entity.

Compensation & Benefits

  • Base pay range: $204,000 β€” $255,000 USD.
  • This role may also be eligible for bonus, equity, benefits, and Employee Travel Credits.

Inclusion & Accessibility

  • Airbnb encourages diverse applicants and provides accommodations for candidates with disabilities during the application and interview process ([email protected]).