SlideShare a Scribd company logo
1 of 41
Download to read offline
Recommendation Architecture
Jeremy Schiff
MLConf 2015
03/27/2015
BEFORE	
   DURING	
  	
   AFTER	
  
DINERS	
  RESTAURANTS	
  
Understanding	
  
&	
  Evolving	
  
A2rac4ng	
  &	
  
Planning	
  
OpenTable: Deliver great experiences at 

every step, based on who you are
Proprietary	
   2	
  
OpenTable in Numbers
• Our network connects diners with more than
32,000 restaurants worldwide.
• Our diners have spent more than $30 billion
at our partner restaurants.
• OpenTable seats more than 16 million diners
each month.
• Every month, OpenTable diners write more
than 450,000 restaurant reviews

3	
  
Recommendations 
>>
Collaborative Filtering
4	
  
So what are recommendations?
5	
  
Building Recommendation Systems
• Importance of A/B
Testing
• Generating
Recommendations
• Recommendation
Explanations


6	
  
What’s the Goal

Minimizing Engineering Time to Improve The
Metric that Matters

• Make it Easy to Measure
• Make it Easy to Iterate
• Reduce Iteration Cycle Times
7	
  
Importance of A/B Testing
• If you don’t measure it,
you can’t improve it

 
 
 

• Metrics Drive Behavior
• Continued Forward
Progress
8	
  
Pick Your Business Metric
Revenue, Conversions 
• OpenTable 
• Amazon 

Engagement
• Netflix
• Pandora
• Spotify
9	
  
Measuring & The Iteration Loop
A/B	
  	
  
Tes4ng	
  
Weeks	
  
Measure	
  
10	
  
Measuring & The Iteration Loop
Op4mize	
  	
  
Models	
  
A/B	
  	
  
Tes4ng	
  
Days	
   Weeks	
  
Predict	
   Measure	
  
11	
  
Measuring & The Iteration Loop
Analyze	
  &	
  	
  
Introspect	
  
Op4mize	
  	
  
Models	
  
A/B	
  	
  
Tes4ng	
  
Hours	
   Days	
   Weeks	
  
Insights	
   Predict	
   Measure	
  
12	
  
Ranking Objectives
Objectives: 
• Training Error
- Minimize Loss Function
§  Often Convex
• Generalization Error
- Precision at K
• A/B Metric
- Conversion / Engagement

13	
  
Training, Generalization, and Online Error
• Training: Train on your specific dataset
- Dealing with Sparseness
• Test/Generalization: How does it generalize
to unseen data?
- Hyper-Parameter Tuning
• Online: How does it perform in the wild
- Model interaction effects between recommend
items (diversity)
Fundamental Differences in Usage

Right now vs. Planning

Cost of Being Wrong

Search vs. Recommendations


15	
  
Recommendation Stack
Query	
  Interpreta4on	
  
Retrieval	
  
Ranking	
  –	
  Item	
  &	
  Explana4on	
  
Index	
  
Building	
  
Context	
  for	
  Query	
  &	
  User	
  	
  
Model	
  
Building	
  
Explana4on	
  
Content	
  
Visualiza4on	
  
Collabora4ve	
  
Filters	
  
Item	
  /	
  User	
  
Metadata	
  
16	
  
Using Context, Frequency & Sentiment
•  Context
- Implicit: Location, Time, Mobile/Web
- Explicit: Query
•  High End Restaurant for Dinner
- Low Frequency, High Sentiment
•  Fast, Mediocre Sushi for Lunch
- High Frequency, Moderate
Sentiment

 

17	
  
How to use this data
• Frequency Data: 
- General: Popularity
- Personalized: Implicit CF
• Sentiment Data: 
- General: Good Experience
- Personalized: Explicit CF
• Good Recommendation
- Use both to drive your Business Metric
18	
  
Ranking
Phase 1: Bootstrap through heuristics

Phase 2: Learn to Rank
•  Many models
- E [ Revenue | Query, Position, Item, User ]
- E [ Engagement | Query, Position, Item, User ]
- Regression, RankSVM, LambdaMart…
•  Modeling Diversity is Important 


19	
  
Training Example
•  Context Free (Collaborative Filtering)
- Train for Content Based and Collaborative Filtering models.
- Create an Ensemble Model
- Perform Hyper-Parameter Tuning for each model
•  With Context (Search)
- Train a model using query (implicit & explicit)
§  Includes Context-Free Model
- Perform Hyper-Parameter Tuning
•  Evaluate Model using A/B
- Change models, objective functions, etc.
Training DataFlow
Collabora4ve	
  Filter	
  
Service	
  
(Real4me)	
  
Collabora4ve	
  Filter	
  
HyperParameter	
  Tuning	
  	
  
(Batch	
  with	
  Spark)	
  
Collabora4ve	
  Filter	
  
Training	
  
(Batch	
  with	
  Spark)	
  
Training DataFlow
Collabora4ve	
  Filter	
  
Service	
  
(Real4me)	
  
Collabora4ve	
  Filter	
  
HyperParameter	
  Tuning	
  	
  
(Batch	
  with	
  Spark)	
  
Collabora4ve	
  Filter	
  
Training	
  
(Batch	
  with	
  Spark)	
  
Search	
  Service	
  
(Real4me)	
  
Search	
  HyperParameter	
  
Tuning	
  	
  
(Batch	
  with	
  Spark)	
  
Search	
  Training	
  
(Batch	
  with	
  Spark)	
  
Training DataFlow
Collabora4ve	
  Filter	
  
Service	
  
(Real4me)	
  
Collabora4ve	
  Filter	
  
HyperParameter	
  Tuning	
  	
  
(Batch	
  with	
  Spark)	
  
Collabora4ve	
  Filter	
  
Training	
  
(Batch	
  with	
  Spark)	
  
Search	
  Service	
  
(Real4me)	
  
Search	
  HyperParameter	
  
Tuning	
  	
  
(Batch	
  with	
  Spark)	
  
Search	
  Training	
  
(Batch	
  with	
  Spark)	
  
User	
  Interac4on	
  Logs	
  
(Ka_a)	
  
A/B	
  Tes4ng	
  Dashboards	
  
Other	
  Services	
  
Compelling Recommendations
24	
  
Recommendation Explanations
•  Amazon

•  Ness
•  Netflix

•  Ness - Social
25	
  
Summarizing Content

• Essential for Mobile
• Balance Utility With Trust?
- Summarize, but surface raw
data

• Example: 
- Initially, read every review
- Later, use average star rating
26	
  
Summarizing Restaurant Attributes
27	
  
Dish Recommendation
• What to try once I have arrived?

28	
  
Edit	
  via	
  the	
  Header	
  &	
  Footer	
  menu	
  in	
  
PowerPoint	
  
29	
  29	
  
Analyzing Review Content
30	
  
The ingredients of a spectacular

dining experience…
31	
  
… and a spectacularly bad one
32	
  
Content Features

Pandora
• Music Genome Project

Natural Language Processing
• Topics & Tags
33	
  
Topic Modeling Methods
We applied two main topic
modeling methods:
• Latent Dirichlet Allocation
(LDA) 
- (Blei et al. 2003)
• Non-negative Matrix
Factorization (NMF) 
- (Aurora et al. 2012)
34	
  
The food was great! I loved the view of the
sailboats.

Bag of Words Model
food	
   great	
   chicken	
   sailboat	
   view	
   service	
  
1	
   1	
   0	
   1	
   1	
   0	
  
35	
  
Topics with NMF using TF-IDF
Word	
  1	
   Word	
  …	
   Word	
  N	
  
Review	
  1	
   0.8	
   0.9	
   0	
  
Review	
  …	
   0.6	
   0	
   0.8	
  
Review	
  N	
   0.9	
   0	
   0.8	
  
Reviews	
  
X	
  
Words	
  
Reviews	
  
X	
  
Topics	
  
Topics	
  
X	
  
Words	
  
36	
  
Describing Restaurants as Topics
Each	
  review	
  for	
  a	
  
given	
  restaurant	
  	
  
has	
  certain	
  topic	
  
distribuCon	
  
Combining	
  them,	
  
we	
  idenCfy	
  the	
  top	
  
topics	
  for	
  that	
  
restaurant.	
  
Topic 01! Topic 02! Topic 03! Topic 04! Topic 05!
Topic 01! Topic 02! Topic 03! Topic 04! Topic 05!
Topic 01! Topic 02! Topic 03! Topic 04! Topic 05!
review	
  1	
  
review	
  2	
  
review	
  N	
  
.	
  
.	
  
.	
  
Topic 01! Topic 02! Topic 03! Topic 04! Topic 05!
Restaurant	
  
37	
  
Examples of Topics
38	
  
Varying Topic By Region
•  San Francisco
•  `

•  London
•  Chicago

•  New York
39	
  
Building Recommendation Systems
• Importance of A/B
Testing
• Generating
Recommendations
• Recommendation
Explanations

40	
  
Thanks!
Jeremy Schiff
jschiff@opentable.com

More Related Content

Similar to Jeremy Schiff, Senior Manager, Data Science, OpenTable at MLconf NYC

Recommendation Architecture - OpenTable - RecSys 2014 - Large Scale Recommend...
Recommendation Architecture - OpenTable - RecSys 2014 - Large Scale Recommend...Recommendation Architecture - OpenTable - RecSys 2014 - Large Scale Recommend...
Recommendation Architecture - OpenTable - RecSys 2014 - Large Scale Recommend...Jeremy Schiff
 
RecSys 2015 - Unifying the Problem of Search and Recommendations at OpenTable
RecSys 2015 - Unifying the Problem of Search and Recommendations at OpenTableRecSys 2015 - Unifying the Problem of Search and Recommendations at OpenTable
RecSys 2015 - Unifying the Problem of Search and Recommendations at OpenTableJeremy Schiff
 
GraphTour: De-cyphering recipes with Neo4j
GraphTour: De-cyphering recipes with Neo4jGraphTour: De-cyphering recipes with Neo4j
GraphTour: De-cyphering recipes with Neo4jNeo4j
 
Modern Perspectives on Recommender Systems and their Applications in Mendeley
Modern Perspectives on Recommender Systems and their Applications in MendeleyModern Perspectives on Recommender Systems and their Applications in Mendeley
Modern Perspectives on Recommender Systems and their Applications in MendeleyMaya Hristakeva
 
Modern Perspectives on Recommender Systems and their Applications in Mendeley
Modern Perspectives on Recommender Systems and their Applications in MendeleyModern Perspectives on Recommender Systems and their Applications in Mendeley
Modern Perspectives on Recommender Systems and their Applications in MendeleyKris Jack
 
Talis Aspire Update - Keji Adedeji | Talis Insight Europe 2016
Talis Aspire Update - Keji Adedeji | Talis Insight Europe 2016Talis Aspire Update - Keji Adedeji | Talis Insight Europe 2016
Talis Aspire Update - Keji Adedeji | Talis Insight Europe 2016Talis
 
Website Best Practices for the High Tech Marketer
Website Best Practices for the High Tech MarketerWebsite Best Practices for the High Tech Marketer
Website Best Practices for the High Tech Marketeredynamic
 
Making your site easier to use, an in-house usability testing case study
Making your site easier to use, an in-house usability testing case studyMaking your site easier to use, an in-house usability testing case study
Making your site easier to use, an in-house usability testing case studyJason Samuels
 
UX Webinar: Always Be Testing
UX Webinar: Always Be TestingUX Webinar: Always Be Testing
UX Webinar: Always Be TestingCharity Dynamics
 
Lets eat presentation_final_20160521
Lets eat presentation_final_20160521Lets eat presentation_final_20160521
Lets eat presentation_final_20160521Lesley Chapman
 
SEO Get up to speed - Williton
SEO  Get up to speed - WillitonSEO  Get up to speed - Williton
SEO Get up to speed - WillitonGet up to Speed
 
Startupmetrics4pirates dave mcclure
Startupmetrics4pirates dave mcclureStartupmetrics4pirates dave mcclure
Startupmetrics4pirates dave mcclureYounghwan Cheon
 
Don't drive your Race car on a dirt track!! - Athresh Krishnappa, Scrum Banga...
Don't drive your Race car on a dirt track!! - Athresh Krishnappa, Scrum Banga...Don't drive your Race car on a dirt track!! - Athresh Krishnappa, Scrum Banga...
Don't drive your Race car on a dirt track!! - Athresh Krishnappa, Scrum Banga...Scrum Bangalore
 
Embrace and Beyond Mobility: Design for the Ideal Dining Experience | 拥抱和超越移...
Embrace and Beyond Mobility:  Design for the Ideal Dining Experience | 拥抱和超越移...Embrace and Beyond Mobility:  Design for the Ideal Dining Experience | 拥抱和超越移...
Embrace and Beyond Mobility: Design for the Ideal Dining Experience | 拥抱和超越移...UX Consulting Pte Ltd
 
Measuring the Quality of Online Service - Jinyoung kim
Measuring the Quality of Online Service - Jinyoung kimMeasuring the Quality of Online Service - Jinyoung kim
Measuring the Quality of Online Service - Jinyoung kimJin Young Kim
 
[UPDATE] Udacity webinar on Recommendation Systems
[UPDATE] Udacity webinar on Recommendation Systems[UPDATE] Udacity webinar on Recommendation Systems
[UPDATE] Udacity webinar on Recommendation SystemsAxel de Romblay
 
Getting Design in Your Company's DNA
Getting Design in Your Company's DNAGetting Design in Your Company's DNA
Getting Design in Your Company's DNABrian Sullivan
 
digital marketing company in pcmc
digital marketing company in pcmcdigital marketing company in pcmc
digital marketing company in pcmcjiyapatsing
 

Similar to Jeremy Schiff, Senior Manager, Data Science, OpenTable at MLconf NYC (20)

Recommendation Architecture - OpenTable - RecSys 2014 - Large Scale Recommend...
Recommendation Architecture - OpenTable - RecSys 2014 - Large Scale Recommend...Recommendation Architecture - OpenTable - RecSys 2014 - Large Scale Recommend...
Recommendation Architecture - OpenTable - RecSys 2014 - Large Scale Recommend...
 
RecSys 2015 - Unifying the Problem of Search and Recommendations at OpenTable
RecSys 2015 - Unifying the Problem of Search and Recommendations at OpenTableRecSys 2015 - Unifying the Problem of Search and Recommendations at OpenTable
RecSys 2015 - Unifying the Problem of Search and Recommendations at OpenTable
 
GraphTour: De-cyphering recipes with Neo4j
GraphTour: De-cyphering recipes with Neo4jGraphTour: De-cyphering recipes with Neo4j
GraphTour: De-cyphering recipes with Neo4j
 
Modern Perspectives on Recommender Systems and their Applications in Mendeley
Modern Perspectives on Recommender Systems and their Applications in MendeleyModern Perspectives on Recommender Systems and their Applications in Mendeley
Modern Perspectives on Recommender Systems and their Applications in Mendeley
 
Modern Perspectives on Recommender Systems and their Applications in Mendeley
Modern Perspectives on Recommender Systems and their Applications in MendeleyModern Perspectives on Recommender Systems and their Applications in Mendeley
Modern Perspectives on Recommender Systems and their Applications in Mendeley
 
Talis Aspire Update - Keji Adedeji | Talis Insight Europe 2016
Talis Aspire Update - Keji Adedeji | Talis Insight Europe 2016Talis Aspire Update - Keji Adedeji | Talis Insight Europe 2016
Talis Aspire Update - Keji Adedeji | Talis Insight Europe 2016
 
Edge Benchmarks
Edge BenchmarksEdge Benchmarks
Edge Benchmarks
 
Website Best Practices for the High Tech Marketer
Website Best Practices for the High Tech MarketerWebsite Best Practices for the High Tech Marketer
Website Best Practices for the High Tech Marketer
 
Making your site easier to use, an in-house usability testing case study
Making your site easier to use, an in-house usability testing case studyMaking your site easier to use, an in-house usability testing case study
Making your site easier to use, an in-house usability testing case study
 
UX Webinar: Always Be Testing
UX Webinar: Always Be TestingUX Webinar: Always Be Testing
UX Webinar: Always Be Testing
 
Omnyscope e245 march 2014 final
Omnyscope e245 march 2014 finalOmnyscope e245 march 2014 final
Omnyscope e245 march 2014 final
 
Lets eat presentation_final_20160521
Lets eat presentation_final_20160521Lets eat presentation_final_20160521
Lets eat presentation_final_20160521
 
SEO Get up to speed - Williton
SEO  Get up to speed - WillitonSEO  Get up to speed - Williton
SEO Get up to speed - Williton
 
Startupmetrics4pirates dave mcclure
Startupmetrics4pirates dave mcclureStartupmetrics4pirates dave mcclure
Startupmetrics4pirates dave mcclure
 
Don't drive your Race car on a dirt track!! - Athresh Krishnappa, Scrum Banga...
Don't drive your Race car on a dirt track!! - Athresh Krishnappa, Scrum Banga...Don't drive your Race car on a dirt track!! - Athresh Krishnappa, Scrum Banga...
Don't drive your Race car on a dirt track!! - Athresh Krishnappa, Scrum Banga...
 
Embrace and Beyond Mobility: Design for the Ideal Dining Experience | 拥抱和超越移...
Embrace and Beyond Mobility:  Design for the Ideal Dining Experience | 拥抱和超越移...Embrace and Beyond Mobility:  Design for the Ideal Dining Experience | 拥抱和超越移...
Embrace and Beyond Mobility: Design for the Ideal Dining Experience | 拥抱和超越移...
 
Measuring the Quality of Online Service - Jinyoung kim
Measuring the Quality of Online Service - Jinyoung kimMeasuring the Quality of Online Service - Jinyoung kim
Measuring the Quality of Online Service - Jinyoung kim
 
[UPDATE] Udacity webinar on Recommendation Systems
[UPDATE] Udacity webinar on Recommendation Systems[UPDATE] Udacity webinar on Recommendation Systems
[UPDATE] Udacity webinar on Recommendation Systems
 
Getting Design in Your Company's DNA
Getting Design in Your Company's DNAGetting Design in Your Company's DNA
Getting Design in Your Company's DNA
 
digital marketing company in pcmc
digital marketing company in pcmcdigital marketing company in pcmc
digital marketing company in pcmc
 

More from MLconf

Jamila Smith-Loud - Understanding Human Impact: Social and Equity Assessments...
Jamila Smith-Loud - Understanding Human Impact: Social and Equity Assessments...Jamila Smith-Loud - Understanding Human Impact: Social and Equity Assessments...
Jamila Smith-Loud - Understanding Human Impact: Social and Equity Assessments...MLconf
 
Ted Willke - The Brain’s Guide to Dealing with Context in Language Understanding
Ted Willke - The Brain’s Guide to Dealing with Context in Language UnderstandingTed Willke - The Brain’s Guide to Dealing with Context in Language Understanding
Ted Willke - The Brain’s Guide to Dealing with Context in Language UnderstandingMLconf
 
Justin Armstrong - Applying Computer Vision to Reduce Contamination in the Re...
Justin Armstrong - Applying Computer Vision to Reduce Contamination in the Re...Justin Armstrong - Applying Computer Vision to Reduce Contamination in the Re...
Justin Armstrong - Applying Computer Vision to Reduce Contamination in the Re...MLconf
 
Igor Markov - Quantum Computing: a Treasure Hunt, not a Gold Rush
Igor Markov - Quantum Computing: a Treasure Hunt, not a Gold RushIgor Markov - Quantum Computing: a Treasure Hunt, not a Gold Rush
Igor Markov - Quantum Computing: a Treasure Hunt, not a Gold RushMLconf
 
Josh Wills - Data Labeling as Religious Experience
Josh Wills - Data Labeling as Religious ExperienceJosh Wills - Data Labeling as Religious Experience
Josh Wills - Data Labeling as Religious ExperienceMLconf
 
Vinay Prabhu - Project GaitNet: Ushering in the ImageNet moment for human Gai...
Vinay Prabhu - Project GaitNet: Ushering in the ImageNet moment for human Gai...Vinay Prabhu - Project GaitNet: Ushering in the ImageNet moment for human Gai...
Vinay Prabhu - Project GaitNet: Ushering in the ImageNet moment for human Gai...MLconf
 
Jekaterina Novikova - Machine Learning Methods in Detecting Alzheimer’s Disea...
Jekaterina Novikova - Machine Learning Methods in Detecting Alzheimer’s Disea...Jekaterina Novikova - Machine Learning Methods in Detecting Alzheimer’s Disea...
Jekaterina Novikova - Machine Learning Methods in Detecting Alzheimer’s Disea...MLconf
 
Meghana Ravikumar - Optimized Image Classification on the Cheap
Meghana Ravikumar - Optimized Image Classification on the CheapMeghana Ravikumar - Optimized Image Classification on the Cheap
Meghana Ravikumar - Optimized Image Classification on the CheapMLconf
 
Noam Finkelstein - The Importance of Modeling Data Collection
Noam Finkelstein - The Importance of Modeling Data CollectionNoam Finkelstein - The Importance of Modeling Data Collection
Noam Finkelstein - The Importance of Modeling Data CollectionMLconf
 
June Andrews - The Uncanny Valley of ML
June Andrews - The Uncanny Valley of MLJune Andrews - The Uncanny Valley of ML
June Andrews - The Uncanny Valley of MLMLconf
 
Sneha Rajana - Deep Learning Architectures for Semantic Relation Detection Tasks
Sneha Rajana - Deep Learning Architectures for Semantic Relation Detection TasksSneha Rajana - Deep Learning Architectures for Semantic Relation Detection Tasks
Sneha Rajana - Deep Learning Architectures for Semantic Relation Detection TasksMLconf
 
Anoop Deoras - Building an Incrementally Trained, Local Taste Aware, Global D...
Anoop Deoras - Building an Incrementally Trained, Local Taste Aware, Global D...Anoop Deoras - Building an Incrementally Trained, Local Taste Aware, Global D...
Anoop Deoras - Building an Incrementally Trained, Local Taste Aware, Global D...MLconf
 
Vito Ostuni - The Voice: New Challenges in a Zero UI World
Vito Ostuni - The Voice: New Challenges in a Zero UI WorldVito Ostuni - The Voice: New Challenges in a Zero UI World
Vito Ostuni - The Voice: New Challenges in a Zero UI WorldMLconf
 
Anna choromanska - Data-driven Challenges in AI: Scale, Information Selection...
Anna choromanska - Data-driven Challenges in AI: Scale, Information Selection...Anna choromanska - Data-driven Challenges in AI: Scale, Information Selection...
Anna choromanska - Data-driven Challenges in AI: Scale, Information Selection...MLconf
 
Janani Kalyanam - Machine Learning to Detect Illegal Online Sales of Prescrip...
Janani Kalyanam - Machine Learning to Detect Illegal Online Sales of Prescrip...Janani Kalyanam - Machine Learning to Detect Illegal Online Sales of Prescrip...
Janani Kalyanam - Machine Learning to Detect Illegal Online Sales of Prescrip...MLconf
 
Esperanza Lopez Aguilera - Using a Bayesian Neural Network in the Detection o...
Esperanza Lopez Aguilera - Using a Bayesian Neural Network in the Detection o...Esperanza Lopez Aguilera - Using a Bayesian Neural Network in the Detection o...
Esperanza Lopez Aguilera - Using a Bayesian Neural Network in the Detection o...MLconf
 
Neel Sundaresan - Teaching a machine to code
Neel Sundaresan - Teaching a machine to codeNeel Sundaresan - Teaching a machine to code
Neel Sundaresan - Teaching a machine to codeMLconf
 
Rishabh Mehrotra - Recommendations in a Marketplace: Personalizing Explainabl...
Rishabh Mehrotra - Recommendations in a Marketplace: Personalizing Explainabl...Rishabh Mehrotra - Recommendations in a Marketplace: Personalizing Explainabl...
Rishabh Mehrotra - Recommendations in a Marketplace: Personalizing Explainabl...MLconf
 
Soumith Chintala - Increasing the Impact of AI Through Better Software
Soumith Chintala - Increasing the Impact of AI Through Better SoftwareSoumith Chintala - Increasing the Impact of AI Through Better Software
Soumith Chintala - Increasing the Impact of AI Through Better SoftwareMLconf
 
Roy Lowrance - Predicting Bond Prices: Regime Changes
Roy Lowrance - Predicting Bond Prices: Regime ChangesRoy Lowrance - Predicting Bond Prices: Regime Changes
Roy Lowrance - Predicting Bond Prices: Regime ChangesMLconf
 

More from MLconf (20)

Jamila Smith-Loud - Understanding Human Impact: Social and Equity Assessments...
Jamila Smith-Loud - Understanding Human Impact: Social and Equity Assessments...Jamila Smith-Loud - Understanding Human Impact: Social and Equity Assessments...
Jamila Smith-Loud - Understanding Human Impact: Social and Equity Assessments...
 
Ted Willke - The Brain’s Guide to Dealing with Context in Language Understanding
Ted Willke - The Brain’s Guide to Dealing with Context in Language UnderstandingTed Willke - The Brain’s Guide to Dealing with Context in Language Understanding
Ted Willke - The Brain’s Guide to Dealing with Context in Language Understanding
 
Justin Armstrong - Applying Computer Vision to Reduce Contamination in the Re...
Justin Armstrong - Applying Computer Vision to Reduce Contamination in the Re...Justin Armstrong - Applying Computer Vision to Reduce Contamination in the Re...
Justin Armstrong - Applying Computer Vision to Reduce Contamination in the Re...
 
Igor Markov - Quantum Computing: a Treasure Hunt, not a Gold Rush
Igor Markov - Quantum Computing: a Treasure Hunt, not a Gold RushIgor Markov - Quantum Computing: a Treasure Hunt, not a Gold Rush
Igor Markov - Quantum Computing: a Treasure Hunt, not a Gold Rush
 
Josh Wills - Data Labeling as Religious Experience
Josh Wills - Data Labeling as Religious ExperienceJosh Wills - Data Labeling as Religious Experience
Josh Wills - Data Labeling as Religious Experience
 
Vinay Prabhu - Project GaitNet: Ushering in the ImageNet moment for human Gai...
Vinay Prabhu - Project GaitNet: Ushering in the ImageNet moment for human Gai...Vinay Prabhu - Project GaitNet: Ushering in the ImageNet moment for human Gai...
Vinay Prabhu - Project GaitNet: Ushering in the ImageNet moment for human Gai...
 
Jekaterina Novikova - Machine Learning Methods in Detecting Alzheimer’s Disea...
Jekaterina Novikova - Machine Learning Methods in Detecting Alzheimer’s Disea...Jekaterina Novikova - Machine Learning Methods in Detecting Alzheimer’s Disea...
Jekaterina Novikova - Machine Learning Methods in Detecting Alzheimer’s Disea...
 
Meghana Ravikumar - Optimized Image Classification on the Cheap
Meghana Ravikumar - Optimized Image Classification on the CheapMeghana Ravikumar - Optimized Image Classification on the Cheap
Meghana Ravikumar - Optimized Image Classification on the Cheap
 
Noam Finkelstein - The Importance of Modeling Data Collection
Noam Finkelstein - The Importance of Modeling Data CollectionNoam Finkelstein - The Importance of Modeling Data Collection
Noam Finkelstein - The Importance of Modeling Data Collection
 
June Andrews - The Uncanny Valley of ML
June Andrews - The Uncanny Valley of MLJune Andrews - The Uncanny Valley of ML
June Andrews - The Uncanny Valley of ML
 
Sneha Rajana - Deep Learning Architectures for Semantic Relation Detection Tasks
Sneha Rajana - Deep Learning Architectures for Semantic Relation Detection TasksSneha Rajana - Deep Learning Architectures for Semantic Relation Detection Tasks
Sneha Rajana - Deep Learning Architectures for Semantic Relation Detection Tasks
 
Anoop Deoras - Building an Incrementally Trained, Local Taste Aware, Global D...
Anoop Deoras - Building an Incrementally Trained, Local Taste Aware, Global D...Anoop Deoras - Building an Incrementally Trained, Local Taste Aware, Global D...
Anoop Deoras - Building an Incrementally Trained, Local Taste Aware, Global D...
 
Vito Ostuni - The Voice: New Challenges in a Zero UI World
Vito Ostuni - The Voice: New Challenges in a Zero UI WorldVito Ostuni - The Voice: New Challenges in a Zero UI World
Vito Ostuni - The Voice: New Challenges in a Zero UI World
 
Anna choromanska - Data-driven Challenges in AI: Scale, Information Selection...
Anna choromanska - Data-driven Challenges in AI: Scale, Information Selection...Anna choromanska - Data-driven Challenges in AI: Scale, Information Selection...
Anna choromanska - Data-driven Challenges in AI: Scale, Information Selection...
 
Janani Kalyanam - Machine Learning to Detect Illegal Online Sales of Prescrip...
Janani Kalyanam - Machine Learning to Detect Illegal Online Sales of Prescrip...Janani Kalyanam - Machine Learning to Detect Illegal Online Sales of Prescrip...
Janani Kalyanam - Machine Learning to Detect Illegal Online Sales of Prescrip...
 
Esperanza Lopez Aguilera - Using a Bayesian Neural Network in the Detection o...
Esperanza Lopez Aguilera - Using a Bayesian Neural Network in the Detection o...Esperanza Lopez Aguilera - Using a Bayesian Neural Network in the Detection o...
Esperanza Lopez Aguilera - Using a Bayesian Neural Network in the Detection o...
 
Neel Sundaresan - Teaching a machine to code
Neel Sundaresan - Teaching a machine to codeNeel Sundaresan - Teaching a machine to code
Neel Sundaresan - Teaching a machine to code
 
Rishabh Mehrotra - Recommendations in a Marketplace: Personalizing Explainabl...
Rishabh Mehrotra - Recommendations in a Marketplace: Personalizing Explainabl...Rishabh Mehrotra - Recommendations in a Marketplace: Personalizing Explainabl...
Rishabh Mehrotra - Recommendations in a Marketplace: Personalizing Explainabl...
 
Soumith Chintala - Increasing the Impact of AI Through Better Software
Soumith Chintala - Increasing the Impact of AI Through Better SoftwareSoumith Chintala - Increasing the Impact of AI Through Better Software
Soumith Chintala - Increasing the Impact of AI Through Better Software
 
Roy Lowrance - Predicting Bond Prices: Regime Changes
Roy Lowrance - Predicting Bond Prices: Regime ChangesRoy Lowrance - Predicting Bond Prices: Regime Changes
Roy Lowrance - Predicting Bond Prices: Regime Changes
 

Recently uploaded

Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 3652toLead Limited
 
The State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxThe State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxLoriGlavin3
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation
 
What is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfWhat is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfMounikaPolabathina
 
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfHyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfPrecisely
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxLoriGlavin3
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteDianaGray10
 
Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersRaghuram Pandurangan
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero
 
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESSALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESmohitsingh558521
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxLoriGlavin3
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxLoriGlavin3
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxLoriGlavin3
 
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxLoriGlavin3
 

Recently uploaded (20)

Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365
 
The State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxThe State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptx
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck Presentation
 
What is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfWhat is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdf
 
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfHyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test Suite
 
Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information Developers
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
 
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESSALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
 
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL Certs
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
 

Jeremy Schiff, Senior Manager, Data Science, OpenTable at MLconf NYC

  • 2. BEFORE   DURING     AFTER   DINERS  RESTAURANTS   Understanding   &  Evolving   A2rac4ng  &   Planning   OpenTable: Deliver great experiences at 
 every step, based on who you are Proprietary   2  
  • 3. OpenTable in Numbers • Our network connects diners with more than 32,000 restaurants worldwide. • Our diners have spent more than $30 billion at our partner restaurants. • OpenTable seats more than 16 million diners each month. • Every month, OpenTable diners write more than 450,000 restaurant reviews 3  
  • 5. So what are recommendations? 5  
  • 6. Building Recommendation Systems • Importance of A/B Testing • Generating Recommendations • Recommendation Explanations 6  
  • 7. What’s the Goal Minimizing Engineering Time to Improve The Metric that Matters • Make it Easy to Measure • Make it Easy to Iterate • Reduce Iteration Cycle Times 7  
  • 8. Importance of A/B Testing • If you don’t measure it, you can’t improve it • Metrics Drive Behavior • Continued Forward Progress 8  
  • 9. Pick Your Business Metric Revenue, Conversions • OpenTable • Amazon Engagement • Netflix • Pandora • Spotify 9  
  • 10. Measuring & The Iteration Loop A/B     Tes4ng   Weeks   Measure   10  
  • 11. Measuring & The Iteration Loop Op4mize     Models   A/B     Tes4ng   Days   Weeks   Predict   Measure   11  
  • 12. Measuring & The Iteration Loop Analyze  &     Introspect   Op4mize     Models   A/B     Tes4ng   Hours   Days   Weeks   Insights   Predict   Measure   12  
  • 13. Ranking Objectives Objectives: • Training Error - Minimize Loss Function §  Often Convex • Generalization Error - Precision at K • A/B Metric - Conversion / Engagement 13  
  • 14. Training, Generalization, and Online Error • Training: Train on your specific dataset - Dealing with Sparseness • Test/Generalization: How does it generalize to unseen data? - Hyper-Parameter Tuning • Online: How does it perform in the wild - Model interaction effects between recommend items (diversity)
  • 15. Fundamental Differences in Usage Right now vs. Planning Cost of Being Wrong Search vs. Recommendations 15  
  • 16. Recommendation Stack Query  Interpreta4on   Retrieval   Ranking  –  Item  &  Explana4on   Index   Building   Context  for  Query  &  User     Model   Building   Explana4on   Content   Visualiza4on   Collabora4ve   Filters   Item  /  User   Metadata   16  
  • 17. Using Context, Frequency & Sentiment •  Context - Implicit: Location, Time, Mobile/Web - Explicit: Query •  High End Restaurant for Dinner - Low Frequency, High Sentiment •  Fast, Mediocre Sushi for Lunch - High Frequency, Moderate Sentiment 17  
  • 18. How to use this data • Frequency Data: - General: Popularity - Personalized: Implicit CF • Sentiment Data: - General: Good Experience - Personalized: Explicit CF • Good Recommendation - Use both to drive your Business Metric 18  
  • 19. Ranking Phase 1: Bootstrap through heuristics Phase 2: Learn to Rank •  Many models - E [ Revenue | Query, Position, Item, User ] - E [ Engagement | Query, Position, Item, User ] - Regression, RankSVM, LambdaMart… •  Modeling Diversity is Important 19  
  • 20. Training Example •  Context Free (Collaborative Filtering) - Train for Content Based and Collaborative Filtering models. - Create an Ensemble Model - Perform Hyper-Parameter Tuning for each model •  With Context (Search) - Train a model using query (implicit & explicit) §  Includes Context-Free Model - Perform Hyper-Parameter Tuning •  Evaluate Model using A/B - Change models, objective functions, etc.
  • 21. Training DataFlow Collabora4ve  Filter   Service   (Real4me)   Collabora4ve  Filter   HyperParameter  Tuning     (Batch  with  Spark)   Collabora4ve  Filter   Training   (Batch  with  Spark)  
  • 22. Training DataFlow Collabora4ve  Filter   Service   (Real4me)   Collabora4ve  Filter   HyperParameter  Tuning     (Batch  with  Spark)   Collabora4ve  Filter   Training   (Batch  with  Spark)   Search  Service   (Real4me)   Search  HyperParameter   Tuning     (Batch  with  Spark)   Search  Training   (Batch  with  Spark)  
  • 23. Training DataFlow Collabora4ve  Filter   Service   (Real4me)   Collabora4ve  Filter   HyperParameter  Tuning     (Batch  with  Spark)   Collabora4ve  Filter   Training   (Batch  with  Spark)   Search  Service   (Real4me)   Search  HyperParameter   Tuning     (Batch  with  Spark)   Search  Training   (Batch  with  Spark)   User  Interac4on  Logs   (Ka_a)   A/B  Tes4ng  Dashboards   Other  Services  
  • 25. Recommendation Explanations •  Amazon •  Ness •  Netflix •  Ness - Social 25  
  • 26. Summarizing Content • Essential for Mobile • Balance Utility With Trust? - Summarize, but surface raw data • Example: - Initially, read every review - Later, use average star rating 26  
  • 28. Dish Recommendation • What to try once I have arrived? 28  
  • 29. Edit  via  the  Header  &  Footer  menu  in   PowerPoint   29  29  
  • 31. The ingredients of a spectacular
 dining experience… 31  
  • 32. … and a spectacularly bad one 32  
  • 33. Content Features Pandora • Music Genome Project Natural Language Processing • Topics & Tags 33  
  • 34. Topic Modeling Methods We applied two main topic modeling methods: • Latent Dirichlet Allocation (LDA) - (Blei et al. 2003) • Non-negative Matrix Factorization (NMF) - (Aurora et al. 2012) 34  
  • 35. The food was great! I loved the view of the sailboats. Bag of Words Model food   great   chicken   sailboat   view   service   1   1   0   1   1   0   35  
  • 36. Topics with NMF using TF-IDF Word  1   Word  …   Word  N   Review  1   0.8   0.9   0   Review  …   0.6   0   0.8   Review  N   0.9   0   0.8   Reviews   X   Words   Reviews   X   Topics   Topics   X   Words   36  
  • 37. Describing Restaurants as Topics Each  review  for  a   given  restaurant     has  certain  topic   distribuCon   Combining  them,   we  idenCfy  the  top   topics  for  that   restaurant.   Topic 01! Topic 02! Topic 03! Topic 04! Topic 05! Topic 01! Topic 02! Topic 03! Topic 04! Topic 05! Topic 01! Topic 02! Topic 03! Topic 04! Topic 05! review  1   review  2   review  N   .   .   .   Topic 01! Topic 02! Topic 03! Topic 04! Topic 05! Restaurant   37  
  • 39. Varying Topic By Region •  San Francisco •  ` •  London •  Chicago •  New York 39  
  • 40. Building Recommendation Systems • Importance of A/B Testing • Generating Recommendations • Recommendation Explanations 40