SlideShare a Scribd company logo
1 of 19
Localebnb
An Airbnb Contextual Recommender
-G Scott Stukey
(NOTE: best viewed by downloading the PPT)
1
Motivation
When booking a private residence,
how do you find the perfect neighborhood?
2
Problem
No ability to
search or filter
by trait!
3
Airbnb search results
Problem
No ability to
search or filter
by trait!
4
Airbnb search results
Hypothesis
Use Airbnb listing descriptions to predict neighborhood traits &
customize search results to users’ preferences
Why Airbnb should implement this:
1. Increase user satisfaction by increasing relevance
2. Increase booking rate by reducing bounces (click fatigue)
5
Solution
6
Listing Page Neighborhood Guide
Solution
7
Listing Page Neighborhood Guide
Features
Target
8
9
10
11
12
13
14
15
Scraped Search Results
& ListingsETL Scraped Neighborhood Traits
Cleaned Documents
(lemmatization, expand contractions, et al.)Prepping
Modeling Word2Vec /
Doc2Vec
Naïve Bayes Random Forest / GBC
Rank/Sort Implemented Custom Scoring Function
(inspired by Google Search CTR by position)
Methodology
16
Beautiful
Soup
NLTK
Word2Vec
+
SVM
TF-IDF Vectorization
Insights
78-82%
accuracy
5 pt lift
over naïve bayes
17
SVM Forest TF-IDF
Infrequent words
add value
Airbnb
is for foodies
Neighborhood names
dominate feature
importance
‘artsy’ model key words doc frequency
Extensions
• Scrape more descriptions across more cities
• Include additional listing information in models
• Make neighborhood traits more fluid
• Give partial weight to nearby neighborhoods utilizing graph analytics
How Airbnb could benefit:
• Guide creation of neighborhood guides in new cities
18
Thank You
Go to Localebnb.co to try for yourselves.
@gscottstukey
19

More Related Content

Similar to Airbnb Contextual Recommender Predicts Neighborhood Traits

Building Search Systems for the Enterprise
Building Search Systems for the EnterpriseBuilding Search Systems for the Enterprise
Building Search Systems for the EnterpriseYunyao Li
 
Lean UX + DevOps
Lean UX + DevOpsLean UX + DevOps
Lean UX + DevOpsSynerzip
 
Faceted Navigation: (Almost) Everyone is Doing it Wrong
Faceted Navigation: (Almost) Everyone is Doing it WrongFaceted Navigation: (Almost) Everyone is Doing it Wrong
Faceted Navigation: (Almost) Everyone is Doing it WrongBotify
 
richardrodger-vespa-waterford-oct.pdf
richardrodger-vespa-waterford-oct.pdfrichardrodger-vespa-waterford-oct.pdf
richardrodger-vespa-waterford-oct.pdfRichard Rodger
 
2012-Search University 4 - Knewledge-Gerald Claessens & Wouter Schikhof- ...
2012-Search University 4 - Knewledge-Gerald Claessens & Wouter Schikhof- ...2012-Search University 4 - Knewledge-Gerald Claessens & Wouter Schikhof- ...
2012-Search University 4 - Knewledge-Gerald Claessens & Wouter Schikhof- ...Wouter Schikhof
 
Using Compass to Diagnose Performance Problems
Using Compass to Diagnose Performance Problems Using Compass to Diagnose Performance Problems
Using Compass to Diagnose Performance Problems MongoDB
 
Using Compass to Diagnose Performance Problems in Your Cluster
Using Compass to Diagnose Performance Problems in Your ClusterUsing Compass to Diagnose Performance Problems in Your Cluster
Using Compass to Diagnose Performance Problems in Your ClusterMongoDB
 
Competitive SEO Strategies | John Caldwell
Competitive SEO Strategies | John CaldwellCompetitive SEO Strategies | John Caldwell
Competitive SEO Strategies | John CaldwellEnterprise Ireland
 
phrase autocomplete : Context completion auto-suggestor for real estate domain
phrase autocomplete : Context completion auto-suggestor for real estate domainphrase autocomplete : Context completion auto-suggestor for real estate domain
phrase autocomplete : Context completion auto-suggestor for real estate domainDhwaj Raj
 
The PPC Performance Pizza - 8 Powerful Ingredients To Get The Perfect PPC Re...
The PPC Performance Pizza  - 8 Powerful Ingredients To Get The Perfect PPC Re...The PPC Performance Pizza  - 8 Powerful Ingredients To Get The Perfect PPC Re...
The PPC Performance Pizza - 8 Powerful Ingredients To Get The Perfect PPC Re...KlientBoost
 
Think like a developer debugging seo - be wizard 2013 rimini
Think like a developer  debugging seo - be wizard 2013 riminiThink like a developer  debugging seo - be wizard 2013 rimini
Think like a developer debugging seo - be wizard 2013 riminiDavid Sottimano
 
Lessons from SEO split-testing
Lessons from SEO split-testingLessons from SEO split-testing
Lessons from SEO split-testingWill Critchlow
 
SEJ Webinar_How To Supercharge Your Keyword Research with Powerful Topic Clus...
SEJ Webinar_How To Supercharge Your Keyword Research with Powerful Topic Clus...SEJ Webinar_How To Supercharge Your Keyword Research with Powerful Topic Clus...
SEJ Webinar_How To Supercharge Your Keyword Research with Powerful Topic Clus...Search Engine Journal
 
Travel Babble June2012
Travel Babble June2012Travel Babble June2012
Travel Babble June2012Fresh_Egg
 
Behavior Driven Development
Behavior Driven DevelopmentBehavior Driven Development
Behavior Driven DevelopmentNETUserGroupBern
 
Preparing for Peak in Ecommerce | eTail Asia 2020
Preparing for Peak in Ecommerce | eTail Asia 2020Preparing for Peak in Ecommerce | eTail Asia 2020
Preparing for Peak in Ecommerce | eTail Asia 2020Lucidworks
 
Finding Anything: Real-time Search with IndexTank
Finding Anything: Real-time Search with IndexTankFinding Anything: Real-time Search with IndexTank
Finding Anything: Real-time Search with IndexTankYogiWanKenobi
 
Finding Anything: Real-time Search with IndexTank
Finding Anything:  Real-time Search with IndexTankFinding Anything:  Real-time Search with IndexTank
Finding Anything: Real-time Search with IndexTankYogiWanKenobi
 
Geo-Targeted SEO for the Online Retailer
Geo-Targeted SEO for the Online RetailerGeo-Targeted SEO for the Online Retailer
Geo-Targeted SEO for the Online RetailerBenj Arriola
 
Behemoth SEO: Search Strategy for Huge Websites
Behemoth SEO: Search Strategy for Huge WebsitesBehemoth SEO: Search Strategy for Huge Websites
Behemoth SEO: Search Strategy for Huge WebsitesPhilipp Klöckner
 

Similar to Airbnb Contextual Recommender Predicts Neighborhood Traits (20)

Building Search Systems for the Enterprise
Building Search Systems for the EnterpriseBuilding Search Systems for the Enterprise
Building Search Systems for the Enterprise
 
Lean UX + DevOps
Lean UX + DevOpsLean UX + DevOps
Lean UX + DevOps
 
Faceted Navigation: (Almost) Everyone is Doing it Wrong
Faceted Navigation: (Almost) Everyone is Doing it WrongFaceted Navigation: (Almost) Everyone is Doing it Wrong
Faceted Navigation: (Almost) Everyone is Doing it Wrong
 
richardrodger-vespa-waterford-oct.pdf
richardrodger-vespa-waterford-oct.pdfrichardrodger-vespa-waterford-oct.pdf
richardrodger-vespa-waterford-oct.pdf
 
2012-Search University 4 - Knewledge-Gerald Claessens & Wouter Schikhof- ...
2012-Search University 4 - Knewledge-Gerald Claessens & Wouter Schikhof- ...2012-Search University 4 - Knewledge-Gerald Claessens & Wouter Schikhof- ...
2012-Search University 4 - Knewledge-Gerald Claessens & Wouter Schikhof- ...
 
Using Compass to Diagnose Performance Problems
Using Compass to Diagnose Performance Problems Using Compass to Diagnose Performance Problems
Using Compass to Diagnose Performance Problems
 
Using Compass to Diagnose Performance Problems in Your Cluster
Using Compass to Diagnose Performance Problems in Your ClusterUsing Compass to Diagnose Performance Problems in Your Cluster
Using Compass to Diagnose Performance Problems in Your Cluster
 
Competitive SEO Strategies | John Caldwell
Competitive SEO Strategies | John CaldwellCompetitive SEO Strategies | John Caldwell
Competitive SEO Strategies | John Caldwell
 
phrase autocomplete : Context completion auto-suggestor for real estate domain
phrase autocomplete : Context completion auto-suggestor for real estate domainphrase autocomplete : Context completion auto-suggestor for real estate domain
phrase autocomplete : Context completion auto-suggestor for real estate domain
 
The PPC Performance Pizza - 8 Powerful Ingredients To Get The Perfect PPC Re...
The PPC Performance Pizza  - 8 Powerful Ingredients To Get The Perfect PPC Re...The PPC Performance Pizza  - 8 Powerful Ingredients To Get The Perfect PPC Re...
The PPC Performance Pizza - 8 Powerful Ingredients To Get The Perfect PPC Re...
 
Think like a developer debugging seo - be wizard 2013 rimini
Think like a developer  debugging seo - be wizard 2013 riminiThink like a developer  debugging seo - be wizard 2013 rimini
Think like a developer debugging seo - be wizard 2013 rimini
 
Lessons from SEO split-testing
Lessons from SEO split-testingLessons from SEO split-testing
Lessons from SEO split-testing
 
SEJ Webinar_How To Supercharge Your Keyword Research with Powerful Topic Clus...
SEJ Webinar_How To Supercharge Your Keyword Research with Powerful Topic Clus...SEJ Webinar_How To Supercharge Your Keyword Research with Powerful Topic Clus...
SEJ Webinar_How To Supercharge Your Keyword Research with Powerful Topic Clus...
 
Travel Babble June2012
Travel Babble June2012Travel Babble June2012
Travel Babble June2012
 
Behavior Driven Development
Behavior Driven DevelopmentBehavior Driven Development
Behavior Driven Development
 
Preparing for Peak in Ecommerce | eTail Asia 2020
Preparing for Peak in Ecommerce | eTail Asia 2020Preparing for Peak in Ecommerce | eTail Asia 2020
Preparing for Peak in Ecommerce | eTail Asia 2020
 
Finding Anything: Real-time Search with IndexTank
Finding Anything: Real-time Search with IndexTankFinding Anything: Real-time Search with IndexTank
Finding Anything: Real-time Search with IndexTank
 
Finding Anything: Real-time Search with IndexTank
Finding Anything:  Real-time Search with IndexTankFinding Anything:  Real-time Search with IndexTank
Finding Anything: Real-time Search with IndexTank
 
Geo-Targeted SEO for the Online Retailer
Geo-Targeted SEO for the Online RetailerGeo-Targeted SEO for the Online Retailer
Geo-Targeted SEO for the Online Retailer
 
Behemoth SEO: Search Strategy for Huge Websites
Behemoth SEO: Search Strategy for Huge WebsitesBehemoth SEO: Search Strategy for Huge Websites
Behemoth SEO: Search Strategy for Huge Websites
 

Recently uploaded

Real-Time AI Streaming - AI Max Princeton
Real-Time AI  Streaming - AI Max PrincetonReal-Time AI  Streaming - AI Max Princeton
Real-Time AI Streaming - AI Max PrincetonTimothy Spann
 
毕业文凭制作#回国入职#diploma#degree美国加州州立大学北岭分校毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#de...
毕业文凭制作#回国入职#diploma#degree美国加州州立大学北岭分校毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#de...毕业文凭制作#回国入职#diploma#degree美国加州州立大学北岭分校毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#de...
毕业文凭制作#回国入职#diploma#degree美国加州州立大学北岭分校毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#de...ttt fff
 
ASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel CanterASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel Cantervoginip
 
How we prevented account sharing with MFA
How we prevented account sharing with MFAHow we prevented account sharing with MFA
How we prevented account sharing with MFAAndrei Kaleshka
 
Business Analytics using Microsoft Excel
Business Analytics using Microsoft ExcelBusiness Analytics using Microsoft Excel
Business Analytics using Microsoft Excelysmaelreyes
 
办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一
办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一
办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一F La
 
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default  Presentation : Data Analysis Project PPTPredictive Analysis for Loan Default  Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPTBoston Institute of Analytics
 
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝DelhiRS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhijennyeacort
 
Generative AI for Social Good at Open Data Science East 2024
Generative AI for Social Good at Open Data Science East 2024Generative AI for Social Good at Open Data Science East 2024
Generative AI for Social Good at Open Data Science East 2024Colleen Farrelly
 
Thiophen Mechanism khhjjjjjjjhhhhhhhhhhh
Thiophen Mechanism khhjjjjjjjhhhhhhhhhhhThiophen Mechanism khhjjjjjjjhhhhhhhhhhh
Thiophen Mechanism khhjjjjjjjhhhhhhhhhhhYasamin16
 
MK KOMUNIKASI DATA (TI)komdat komdat.docx
MK KOMUNIKASI DATA (TI)komdat komdat.docxMK KOMUNIKASI DATA (TI)komdat komdat.docx
MK KOMUNIKASI DATA (TI)komdat komdat.docxUnduhUnggah1
 
modul pembelajaran robotic Workshop _ by Slidesgo.pptx
modul pembelajaran robotic Workshop _ by Slidesgo.pptxmodul pembelajaran robotic Workshop _ by Slidesgo.pptx
modul pembelajaran robotic Workshop _ by Slidesgo.pptxaleedritatuxx
 
Advanced Machine Learning for Business Professionals
Advanced Machine Learning for Business ProfessionalsAdvanced Machine Learning for Business Professionals
Advanced Machine Learning for Business ProfessionalsVICTOR MAESTRE RAMIREZ
 
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样vhwb25kk
 
DBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdfDBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdfJohn Sterrett
 
办美国阿肯色大学小石城分校毕业证成绩单pdf电子版制作修改#真实留信入库#永久存档#真实可查#diploma#degree
办美国阿肯色大学小石城分校毕业证成绩单pdf电子版制作修改#真实留信入库#永久存档#真实可查#diploma#degree办美国阿肯色大学小石城分校毕业证成绩单pdf电子版制作修改#真实留信入库#永久存档#真实可查#diploma#degree
办美国阿肯色大学小石城分校毕业证成绩单pdf电子版制作修改#真实留信入库#永久存档#真实可查#diploma#degreeyuu sss
 
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理e4aez8ss
 
办理(UWIC毕业证书)英国卡迪夫城市大学毕业证成绩单原版一比一
办理(UWIC毕业证书)英国卡迪夫城市大学毕业证成绩单原版一比一办理(UWIC毕业证书)英国卡迪夫城市大学毕业证成绩单原版一比一
办理(UWIC毕业证书)英国卡迪夫城市大学毕业证成绩单原版一比一F La
 
Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)Cathrine Wilhelmsen
 
Student profile product demonstration on grades, ability, well-being and mind...
Student profile product demonstration on grades, ability, well-being and mind...Student profile product demonstration on grades, ability, well-being and mind...
Student profile product demonstration on grades, ability, well-being and mind...Seán Kennedy
 

Recently uploaded (20)

Real-Time AI Streaming - AI Max Princeton
Real-Time AI  Streaming - AI Max PrincetonReal-Time AI  Streaming - AI Max Princeton
Real-Time AI Streaming - AI Max Princeton
 
毕业文凭制作#回国入职#diploma#degree美国加州州立大学北岭分校毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#de...
毕业文凭制作#回国入职#diploma#degree美国加州州立大学北岭分校毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#de...毕业文凭制作#回国入职#diploma#degree美国加州州立大学北岭分校毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#de...
毕业文凭制作#回国入职#diploma#degree美国加州州立大学北岭分校毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#de...
 
ASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel CanterASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel Canter
 
How we prevented account sharing with MFA
How we prevented account sharing with MFAHow we prevented account sharing with MFA
How we prevented account sharing with MFA
 
Business Analytics using Microsoft Excel
Business Analytics using Microsoft ExcelBusiness Analytics using Microsoft Excel
Business Analytics using Microsoft Excel
 
办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一
办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一
办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一
 
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default  Presentation : Data Analysis Project PPTPredictive Analysis for Loan Default  Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPT
 
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝DelhiRS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
 
Generative AI for Social Good at Open Data Science East 2024
Generative AI for Social Good at Open Data Science East 2024Generative AI for Social Good at Open Data Science East 2024
Generative AI for Social Good at Open Data Science East 2024
 
Thiophen Mechanism khhjjjjjjjhhhhhhhhhhh
Thiophen Mechanism khhjjjjjjjhhhhhhhhhhhThiophen Mechanism khhjjjjjjjhhhhhhhhhhh
Thiophen Mechanism khhjjjjjjjhhhhhhhhhhh
 
MK KOMUNIKASI DATA (TI)komdat komdat.docx
MK KOMUNIKASI DATA (TI)komdat komdat.docxMK KOMUNIKASI DATA (TI)komdat komdat.docx
MK KOMUNIKASI DATA (TI)komdat komdat.docx
 
modul pembelajaran robotic Workshop _ by Slidesgo.pptx
modul pembelajaran robotic Workshop _ by Slidesgo.pptxmodul pembelajaran robotic Workshop _ by Slidesgo.pptx
modul pembelajaran robotic Workshop _ by Slidesgo.pptx
 
Advanced Machine Learning for Business Professionals
Advanced Machine Learning for Business ProfessionalsAdvanced Machine Learning for Business Professionals
Advanced Machine Learning for Business Professionals
 
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
 
DBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdfDBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdf
 
办美国阿肯色大学小石城分校毕业证成绩单pdf电子版制作修改#真实留信入库#永久存档#真实可查#diploma#degree
办美国阿肯色大学小石城分校毕业证成绩单pdf电子版制作修改#真实留信入库#永久存档#真实可查#diploma#degree办美国阿肯色大学小石城分校毕业证成绩单pdf电子版制作修改#真实留信入库#永久存档#真实可查#diploma#degree
办美国阿肯色大学小石城分校毕业证成绩单pdf电子版制作修改#真实留信入库#永久存档#真实可查#diploma#degree
 
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
 
办理(UWIC毕业证书)英国卡迪夫城市大学毕业证成绩单原版一比一
办理(UWIC毕业证书)英国卡迪夫城市大学毕业证成绩单原版一比一办理(UWIC毕业证书)英国卡迪夫城市大学毕业证成绩单原版一比一
办理(UWIC毕业证书)英国卡迪夫城市大学毕业证成绩单原版一比一
 
Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)
 
Student profile product demonstration on grades, ability, well-being and mind...
Student profile product demonstration on grades, ability, well-being and mind...Student profile product demonstration on grades, ability, well-being and mind...
Student profile product demonstration on grades, ability, well-being and mind...
 

Airbnb Contextual Recommender Predicts Neighborhood Traits

  • 1. Localebnb An Airbnb Contextual Recommender -G Scott Stukey (NOTE: best viewed by downloading the PPT) 1
  • 2. Motivation When booking a private residence, how do you find the perfect neighborhood? 2
  • 3. Problem No ability to search or filter by trait! 3 Airbnb search results
  • 4. Problem No ability to search or filter by trait! 4 Airbnb search results
  • 5. Hypothesis Use Airbnb listing descriptions to predict neighborhood traits & customize search results to users’ preferences Why Airbnb should implement this: 1. Increase user satisfaction by increasing relevance 2. Increase booking rate by reducing bounces (click fatigue) 5
  • 7. Solution 7 Listing Page Neighborhood Guide Features Target
  • 8. 8
  • 9. 9
  • 10. 10
  • 11. 11
  • 12. 12
  • 13. 13
  • 14. 14
  • 15. 15
  • 16. Scraped Search Results & ListingsETL Scraped Neighborhood Traits Cleaned Documents (lemmatization, expand contractions, et al.)Prepping Modeling Word2Vec / Doc2Vec Naïve Bayes Random Forest / GBC Rank/Sort Implemented Custom Scoring Function (inspired by Google Search CTR by position) Methodology 16 Beautiful Soup NLTK Word2Vec + SVM TF-IDF Vectorization
  • 17. Insights 78-82% accuracy 5 pt lift over naïve bayes 17 SVM Forest TF-IDF Infrequent words add value Airbnb is for foodies Neighborhood names dominate feature importance ‘artsy’ model key words doc frequency
  • 18. Extensions • Scrape more descriptions across more cities • Include additional listing information in models • Make neighborhood traits more fluid • Give partial weight to nearby neighborhoods utilizing graph analytics How Airbnb could benefit: • Guide creation of neighborhood guides in new cities 18
  • 19. Thank You Go to Localebnb.co to try for yourselves. @gscottstukey 19

Editor's Notes

  1. Hello everyone. My name is G Scott Stukey, and I’d love to share with you my project: Localebnb – An Airbnb Contextual Recommender. I’m going to go over the background of my project, dive into using my app, and then share the methodology & insights from this project. [next slide]
  2. The motivation behind the project was driven by the question: “When booking a private residence, how do you find the perfect neighborhood?” [next slide]
  3. The problem I found with Airbnb’s search results is that there’s no ability to directly search or filter by neighborhood trait. [click]
  4. They only have the neighborhood names. Personal Story - when I was trying to book a trip to Montreal, I knew the type of neighborhood I wanted to stay in: somewhere a little more ‘hipster’ with great dining, away from the touristy spots. I ended up having a amazing experience staying at a converted loft in the De Lorimier neighborhood, but only after having to research a multitude of sources. [next slide]
  5. My hypothesis is that, by using Airbnb listing descriptions I could predict the traits of the neighborhood, and then customize the Airbnb default search results to a user’s preference. From a business standpoint, by implementing this Airbnb could increase user satisfaction by making their search results more relevant & potentially increase bookings by reducing bounces that happen from click fatigue. [next slide]
  6. The solution came from bubbling up information from the neighborhood guide, which is currently buried on their site. It contains amazing information about neighborhoods of select cities. [click] I took the listing descriptions as my input features, mapped each listing’s neighborhood to the neighborhood guide, and used the neighborhood traits as my target variables. For Localebnb, I focused on a subset of 4 of the traits – ‘artsy’, ‘dining’, ‘shopping’ & ‘nightlife’. Now, lets dive into the app. [next slide]
  7. I took the listing descriptions as my input features, mapped each listing’s neighborhood to the neighborhood guide, and used the neighborhood traits as my target variables. For Localebnb, I focused on a subset of 4 of the traits – ‘artsy’, ‘dining’, ‘shopping’ & ‘nightlife’. Now, lets dive into the app. [next slide]
  8. This is Localebnb. The home page is a simple search page. [click]
  9. Up top - you’re able to put in your search information. [click]
  10. Down below, you’re able to select your preferences for the various neighborhood traits. Here, the user appears to enjoy artsy & shopping neighborhoods, while avoiding nightlife. When the user clicks search… [next slide]
  11. …the search results appear. The app scraped Airbnb search results for listings, scraped each of those listings, predicted if each listing has a specific trait, then scored & re-sorted the search results based on the user’s preferences. [click]
  12. When a user hovers over a listing, additional information about the listing pops up. [click]
  13. The app also allows the user to change there preferences. Here we see the user increases their preference for dining. [click] When they do this, the app auto-updates its results. The user then found this gem in the Castro. Previously, this listing was at number 10 below the fold. Localebnb helped bubble that listing higher-up on the results page. This is a great example of the app’s benefits. Now lets look at the methodology behind the project. [next slide]
  14. When they do this, the app auto-updates its results. The user then found this gem in the Castro. Previously, this listing was at number 10 below the fold. Localebnb helped bubble that listing higher-up on the results page. This is a great example of the app’s benefits. Now lets look at the methodology behind the project. [next slide]
  15. The user then found this gem in the Castro. Previously, this listing was at number 10 below the fold. Localebnb helped bubble that listing higher-up on the results page. This is a great example of the app’s benefits. Now lets look at the methodology behind the project. [next slide]
  16. Here you can see my project’s pipeline: As I mentioned earlier, I scraped ~4000 listings across SF & NYC and mapped them to the neighborhood traits. I then used various NLP techniques to clean the documents. To model the traits, I vectorized the descriptions & tried a variety of supervised models. What worked best were support vector machines, which is well suited for text classification. I also tested Doc2Vec, however I found my corpus to be too small to have useful results. To rank them, I created a custom scoring function. The scores were loosely inspired by Google search’s click-thru rates by position, which tries to solve an analogous relevance problem. [next slide]
  17. I pulled out a few insights that the various models were able to provide: From SVM, I saw that infrequent words add value. The models achieved accuracies of ~80%, with a 5 point life. The big difference is that the Naïve Bayes model used only 2000 of the words 16000 words found in the descriptions, while SVM used 8000. From the Forest model, looking at information gain I confirmed intuition that the neighborhood names were key predictors of the traits. In addition, we see here the words “bars”, “galleries”, “art” and “loft” – all of which align with our expectations. From our TF-IDF, I call out that “Airbnb is for foodies”. This is because the words “kitchen” and “restaurants” appear amongst the stop words, and were more common than words like “was”, “has”, etc. [next slide]
  18. If I were to continue my work on Localebnb, I would be interested in: Scraping more listings across more cities (as we saw that neighborhood names were predictive, but they’re city specific) Include additional listing information in the models (for example, the amenities or room type) Also, to make the neighborhood traits more fluid that “yes” or “no” – which could be done by giving partial weight to nearby neighborhoods using graph analytics techniques. In addition, Airbnb’s content team could leverage this model for neighborhood guide creation or validation. [next slide]
  19. The app is live and can be seen at Localebnb.co – it’s also best viewed on desktop. Thank you. [End]