Used Tools & Technologies
Not specified
Required Skills & Competences ?
Docker @ 4 Kafka @ 4 Kubernetes @ 4 MySQL @ 4 Python @ 7 Scala @ 4 Spark @ 4 ETL @ 4 Java @ 7 Airflow @ 4 Distributed Systems @ 7 Flink @ 4 Machine Learning @ 4 TensorFlow @ 4 AWS @ 4 Communication @ 4 PostgreSQL @ 4 MLFlow @ 4 Hadoop @ 4 PyTorch @ 4Details
Airbnb was born in 2007 and has grown to over 5 million hosts who have welcomed over 2 billion guest arrivals globally. The ML Infrastructure team provides shared foundations for modeling, data, governance and productivity to ensure Airbnb’s AI/ML models and applications are built with high industry standards.
Responsibilities
- Design, build, automate, and maintain robust, scalable data pipelines using SparkSQL, Scala, and Airflow.
- Develop and optimize data models ensuring high-quality, consistent, and accurate data to support AI/ML product feature decisions.
- Collaborate closely with peer ML Infra teams to deliver automated data solutions driving AI/ML acceleration.
- Contribute to scalable Generative AI infrastructure by leveraging foundational language and vision models to create high-quality datasets for GenAI applications.
- Partner with customer teams to deliver high-impact, high-quality datasets core to Airbnb's roadmap.
- Utilize and integrate open-source and infra technologies including Spark, Airflow, Ray, MLflow, TensorFlow, PyTorch, Docker, and Kubernetes.
Requirements / Qualifications
- 5+ years of relevant industry experience (BS/Masters) or 2+ years with a PhD.
- Strong coding skills in Python, Java, or equivalent languages.
- Hands-on experience with distributed processing technologies such as Spark, Kafka, Flink, Hadoop and distributed storage like HDFS and S3.
- Solid knowledge of data warehousing concepts and databases (PostgreSQL, MySQL, Redshift, BigQuery, ClickHouse).
- Expertise building scalable ETL pipelines using schedulers like Airflow, Luigi, Oozie, or AWS Glue.
- Proven ability to analyze large datasets, identify insights, and drive impactful product solutions.
- Experience building end-to-end Machine Learning platforms and deploying ML models.
- Familiarity with Kubernetes, Docker, and modern infrastructure tools.
- Deep understanding of distributed systems and engineering best practices.
- Excellent written and verbal communication skills; comfortable collaborating cross-functionally.
Location & Work Arrangement
- This position is US - Remote Eligible. The role may include occasional work at an Airbnb office or attendance at offsites, as agreed with your manager.
- Candidates must live in a U.S. state where Airbnb, Inc. has a registered entity (some states are excluded).
Compensation & Benefits
- Base pay range: $191,000 — $223,000 USD.
- This role may also be eligible for bonus, equity, benefits, and Employee Travel Credits. Base pay depends on factors such as training, transferable skills, work experience, business needs and market demands.
Inclusion
- Airbnb encourages applications from a diverse talent pool and provides disability-inclusive application and interview processes. Reasonable accommodation requests can be sent to [email protected].