Airbnb Contextual Recommender Predicts Neighborhood Traits

•Download as PPTX, PDF•

3 likes•1,398 views

This document describes a contextual recommender system called Localebnb that uses Airbnb listing descriptions to predict neighborhood traits and customize search results based on a user's preferences. It aims to increase user satisfaction and booking rates by improving relevance. The system scrapes listing data to extract neighborhood traits using NLP techniques like word2vec and trains classifiers like naive bayes and random forests to predict traits. Initial results show 78-82% accuracy, outperforming naive bayes. Extensions proposed include using more data sources and partial weighting of nearby neighborhoods.

Data & Analytics

Localebnb
An Airbnb Contextual Recommender
-G Scott Stukey
(NOTE: best viewed by downloading the PPT)
1

Motivation
When booking a private residence,
how do you find the perfect neighborhood?
2

Problem
No ability to
search or filter
by trait!
3
Airbnb search results

Problem
No ability to
search or filter
by trait!
4
Airbnb search results

Hypothesis
Use Airbnb listing descriptions to predict neighborhood traits &
customize search results to users’ preferences
Why Airbnb should implement this:
1. Increase user satisfaction by increasing relevance
2. Increase booking rate by reducing bounces (click fatigue)
5

Solution
6
Listing Page Neighborhood Guide

Solution
7
Listing Page Neighborhood Guide
Features
Target

Scraped Search Results
& ListingsETL Scraped Neighborhood Traits
Cleaned Documents
(lemmatization, expand contractions, et al.)Prepping
Modeling Word2Vec /
Doc2Vec
Naïve Bayes Random Forest / GBC
Rank/Sort Implemented Custom Scoring Function
(inspired by Google Search CTR by position)
Methodology
16
Beautiful
Soup
NLTK
Word2Vec
+
SVM
TF-IDF Vectorization

Insights
78-82%
accuracy
5 pt lift
over naïve bayes
17
SVM Forest TF-IDF
Infrequent words
add value
Airbnb
is for foodies
Neighborhood names
dominate feature
importance
‘artsy’ model key words doc frequency

Extensions
• Scrape more descriptions across more cities
• Include additional listing information in models
• Make neighborhood traits more fluid
• Give partial weight to nearby neighborhoods utilizing graph analytics
How Airbnb could benefit:
• Guide creation of neighborhood guides in new cities
18

Thank You
Go to Localebnb.co to try for yourselves.
@gscottstukey
19

Similar to Airbnb Contextual Recommender Predicts Neighborhood Traits

Building Search Systems for the EnterpriseYunyao Li

Lean UX + DevOpsSynerzip

Faceted Navigation: (Almost) Everyone is Doing it WrongBotify

richardrodger-vespa-waterford-oct.pdfRichard Rodger

2012-Search University 4 - Knewledge-Gerald Claessens & Wouter Schikhof- ...Wouter Schikhof

Using Compass to Diagnose Performance Problems MongoDB

Using Compass to Diagnose Performance Problems in Your ClusterMongoDB

Competitive SEO Strategies | John CaldwellEnterprise Ireland

phrase autocomplete : Context completion auto-suggestor for real estate domainDhwaj Raj

The PPC Performance Pizza - 8 Powerful Ingredients To Get The Perfect PPC Re...KlientBoost

Think like a developer debugging seo - be wizard 2013 riminiDavid Sottimano

Lessons from SEO split-testingWill Critchlow

SEJ Webinar_How To Supercharge Your Keyword Research with Powerful Topic Clus...Search Engine Journal

Travel Babble June2012Fresh_Egg

Behavior Driven DevelopmentNETUserGroupBern

Preparing for Peak in Ecommerce | eTail Asia 2020Lucidworks

Finding Anything: Real-time Search with IndexTankYogiWanKenobi

Geo-Targeted SEO for the Online RetailerBenj Arriola

Behemoth SEO: Search Strategy for Huge WebsitesPhilipp Klöckner

Similar to Airbnb Contextual Recommender Predicts Neighborhood Traits (20)

Building Search Systems for the Enterprise

Lean UX + DevOps

Faceted Navigation: (Almost) Everyone is Doing it Wrong

richardrodger-vespa-waterford-oct.pdf

2012-Search University 4 - Knewledge-Gerald Claessens & Wouter Schikhof- ...

Using Compass to Diagnose Performance Problems

Using Compass to Diagnose Performance Problems in Your Cluster

Competitive SEO Strategies | John Caldwell

phrase autocomplete : Context completion auto-suggestor for real estate domain

The PPC Performance Pizza - 8 Powerful Ingredients To Get The Perfect PPC Re...

Think like a developer debugging seo - be wizard 2013 rimini

Lessons from SEO split-testing

SEJ Webinar_How To Supercharge Your Keyword Research with Powerful Topic Clus...

Travel Babble June2012

Behavior Driven Development

Preparing for Peak in Ecommerce | eTail Asia 2020

Finding Anything: Real-time Search with IndexTank

Geo-Targeted SEO for the Online Retailer

Behemoth SEO: Search Strategy for Huge Websites

Recently uploaded

Real-Time AI Streaming - AI Max PrincetonTimothy Spann

毕业文凭制作#回国入职#diploma#degree美国加州州立大学北岭分校毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#de...ttt fff

ASML's Taxonomy Adventure by Daniel Cantervoginip

How we prevented account sharing with MFAAndrei Kaleshka

Business Analytics using Microsoft Excelysmaelreyes

办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一F La

Predictive Analysis for Loan Default Presentation : Data Analysis Project PPTBoston Institute of Analytics

RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhijennyeacort

Generative AI for Social Good at Open Data Science East 2024Colleen Farrelly

Thiophen Mechanism khhjjjjjjjhhhhhhhhhhhYasamin16

MK KOMUNIKASI DATA (TI)komdat komdat.docxUnduhUnggah1

modul pembelajaran robotic Workshop _ by Slidesgo.pptxaleedritatuxx

Advanced Machine Learning for Business ProfessionalsVICTOR MAESTRE RAMIREZ

1:1定制(UQ毕业证）昆士兰大学毕业证成绩单修改留信学历认证原版一模一样vhwb25kk

DBA Basics: Getting Started with Performance Tuning.pdfJohn Sterrett

办美国阿肯色大学小石城分校毕业证成绩单pdf电子版制作修改#真实留信入库#永久存档#真实可查#diploma#degreeyuu sss

科罗拉多大学波尔得分校毕业证学位证成绩单-可办理e4aez8ss

办理(UWIC毕业证书)英国卡迪夫城市大学毕业证成绩单原版一比一F La

Data Factory in Microsoft Fabric (MsBIP #82)Cathrine Wilhelmsen

Student profile product demonstration on grades, ability, well-being and mind...Seán Kennedy

Recently uploaded (20)

Real-Time AI Streaming - AI Max Princeton

毕业文凭制作#回国入职#diploma#degree美国加州州立大学北岭分校毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#de...

ASML's Taxonomy Adventure by Daniel Canter

How we prevented account sharing with MFA

Business Analytics using Microsoft Excel

办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一

Predictive Analysis for Loan Default Presentation : Data Analysis Project PPT

RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi

Generative AI for Social Good at Open Data Science East 2024

Thiophen Mechanism khhjjjjjjjhhhhhhhhhhh

MK KOMUNIKASI DATA (TI)komdat komdat.docx

modul pembelajaran robotic Workshop _ by Slidesgo.pptx

Advanced Machine Learning for Business Professionals

1:1定制(UQ毕业证）昆士兰大学毕业证成绩单修改留信学历认证原版一模一样

DBA Basics: Getting Started with Performance Tuning.pdf

办美国阿肯色大学小石城分校毕业证成绩单pdf电子版制作修改#真实留信入库#永久存档#真实可查#diploma#degree

科罗拉多大学波尔得分校毕业证学位证成绩单-可办理

办理(UWIC毕业证书)英国卡迪夫城市大学毕业证成绩单原版一比一

Data Factory in Microsoft Fabric (MsBIP #82)

Student profile product demonstration on grades, ability, well-being and mind...

Airbnb Contextual Recommender Predicts Neighborhood Traits

1. Localebnb An Airbnb Contextual Recommender -G Scott Stukey (NOTE: best viewed by downloading the PPT) 1

2. Motivation When booking a private residence, how do you find the perfect neighborhood? 2

3. Problem No ability to search or filter by trait! 3 Airbnb search results

4. Problem No ability to search or filter by trait! 4 Airbnb search results

5. Hypothesis Use Airbnb listing descriptions to predict neighborhood traits & customize search results to users’ preferences Why Airbnb should implement this: 1. Increase user satisfaction by increasing relevance 2. Increase booking rate by reducing bounces (click fatigue) 5

6. Solution 6 Listing Page Neighborhood Guide

7. Solution 7 Listing Page Neighborhood Guide Features Target

8. 8

9. 9

10. 10

11. 11

12. 12

13. 13

14. 14

15. 15

16. Scraped Search Results & ListingsETL Scraped Neighborhood Traits Cleaned Documents (lemmatization, expand contractions, et al.)Prepping Modeling Word2Vec / Doc2Vec Naïve Bayes Random Forest / GBC Rank/Sort Implemented Custom Scoring Function (inspired by Google Search CTR by position) Methodology 16 Beautiful Soup NLTK Word2Vec + SVM TF-IDF Vectorization

17. Insights 78-82% accuracy 5 pt lift over naïve bayes 17 SVM Forest TF-IDF Infrequent words add value Airbnb is for foodies Neighborhood names dominate feature importance ‘artsy’ model key words doc frequency

18. Extensions • Scrape more descriptions across more cities • Include additional listing information in models • Make neighborhood traits more fluid • Give partial weight to nearby neighborhoods utilizing graph analytics How Airbnb could benefit: • Guide creation of neighborhood guides in new cities 18

19. Thank You Go to Localebnb.co to try for yourselves. @gscottstukey 19

Editor's Notes

Hello everyone. My name is G Scott Stukey, and I’d love to share with you my project: Localebnb – An Airbnb Contextual Recommender. I’m going to go over the background of my project, dive into using my app, and then share the methodology & insights from this project. [next slide]
The motivation behind the project was driven by the question: “When booking a private residence, how do you find the perfect neighborhood?” [next slide]
The problem I found with Airbnb’s search results is that there’s no ability to directly search or filter by neighborhood trait. [click]
They only have the neighborhood names. Personal Story - when I was trying to book a trip to Montreal, I knew the type of neighborhood I wanted to stay in: somewhere a little more ‘hipster’ with great dining, away from the touristy spots. I ended up having a amazing experience staying at a converted loft in the De Lorimier neighborhood, but only after having to research a multitude of sources. [next slide]
My hypothesis is that, by using Airbnb listing descriptions I could predict the traits of the neighborhood, and then customize the Airbnb default search results to a user’s preference. From a business standpoint, by implementing this Airbnb could increase user satisfaction by making their search results more relevant & potentially increase bookings by reducing bounces that happen from click fatigue. [next slide]
The solution came from bubbling up information from the neighborhood guide, which is currently buried on their site. It contains amazing information about neighborhoods of select cities. [click] I took the listing descriptions as my input features, mapped each listing’s neighborhood to the neighborhood guide, and used the neighborhood traits as my target variables. For Localebnb, I focused on a subset of 4 of the traits – ‘artsy’, ‘dining’, ‘shopping’ & ‘nightlife’. Now, lets dive into the app. [next slide]
I took the listing descriptions as my input features, mapped each listing’s neighborhood to the neighborhood guide, and used the neighborhood traits as my target variables. For Localebnb, I focused on a subset of 4 of the traits – ‘artsy’, ‘dining’, ‘shopping’ & ‘nightlife’. Now, lets dive into the app. [next slide]
This is Localebnb. The home page is a simple search page. [click]
Up top - you’re able to put in your search information. [click]
Down below, you’re able to select your preferences for the various neighborhood traits. Here, the user appears to enjoy artsy & shopping neighborhoods, while avoiding nightlife. When the user clicks search… [next slide]
…the search results appear. The app scraped Airbnb search results for listings, scraped each of those listings, predicted if each listing has a specific trait, then scored & re-sorted the search results based on the user’s preferences. [click]
When a user hovers over a listing, additional information about the listing pops up. [click]
The app also allows the user to change there preferences. Here we see the user increases their preference for dining. [click] When they do this, the app auto-updates its results. The user then found this gem in the Castro. Previously, this listing was at number 10 below the fold. Localebnb helped bubble that listing higher-up on the results page. This is a great example of the app’s benefits. Now lets look at the methodology behind the project. [next slide]
When they do this, the app auto-updates its results. The user then found this gem in the Castro. Previously, this listing was at number 10 below the fold. Localebnb helped bubble that listing higher-up on the results page. This is a great example of the app’s benefits. Now lets look at the methodology behind the project. [next slide]
The user then found this gem in the Castro. Previously, this listing was at number 10 below the fold. Localebnb helped bubble that listing higher-up on the results page. This is a great example of the app’s benefits. Now lets look at the methodology behind the project. [next slide]
Here you can see my project’s pipeline: As I mentioned earlier, I scraped ~4000 listings across SF & NYC and mapped them to the neighborhood traits. I then used various NLP techniques to clean the documents. To model the traits, I vectorized the descriptions & tried a variety of supervised models. What worked best were support vector machines, which is well suited for text classification. I also tested Doc2Vec, however I found my corpus to be too small to have useful results. To rank them, I created a custom scoring function. The scores were loosely inspired by Google search’s click-thru rates by position, which tries to solve an analogous relevance problem. [next slide]
I pulled out a few insights that the various models were able to provide: From SVM, I saw that infrequent words add value. The models achieved accuracies of ~80%, with a 5 point life. The big difference is that the Naïve Bayes model used only 2000 of the words 16000 words found in the descriptions, while SVM used 8000. From the Forest model, looking at information gain I confirmed intuition that the neighborhood names were key predictors of the traits. In addition, we see here the words “bars”, “galleries”, “art” and “loft” – all of which align with our expectations. From our TF-IDF, I call out that “Airbnb is for foodies”. This is because the words “kitchen” and “restaurants” appear amongst the stop words, and were more common than words like “was”, “has”, etc. [next slide]
If I were to continue my work on Localebnb, I would be interested in: Scraping more listings across more cities (as we saw that neighborhood names were predictive, but they’re city specific) Include additional listing information in the models (for example, the amenities or room type) Also, to make the neighborhood traits more fluid that “yes” or “no” – which could be done by giving partial weight to nearby neighborhoods using graph analytics techniques. In addition, Airbnb’s content team could leverage this model for neighborhood guide creation or validation. [next slide]
The app is live and can be seen at Localebnb.co – it’s also best viewed on desktop. Thank you. [End]

Airbnb Contextual Recommender Predicts Neighborhood Traits

Recommended

Recommended

More Related Content

Similar to Airbnb Contextual Recommender Predicts Neighborhood Traits

Similar to Airbnb Contextual Recommender Predicts Neighborhood Traits (20)

Recently uploaded

Recently uploaded (20)

Airbnb Contextual Recommender Predicts Neighborhood Traits

Editor's Notes