SlideShare a Scribd company logo
1 of 23
Better Contextual Suggestions 
by Applying Domain Knowledge 
Thaer Samar, Alejandro Bellogin, 
Arjen P. de Vries
Contextual Suggestions 
 Given a user profile and a context, make 
suggestions 
 AKA Context-aware Recommendation, zero-query 
Information Retrieval, …
“Entertain me” 
 Recommend “things to do”, where 
 User profile consists of opinions about attractions 
 Context consists of a specific geo-location
My Profile
My Context
My Suggestions 
WORM 
Poortgebouw
TREC Contextual Suggestions (1/3) 
 Given a user profile 
 70 – 100 POIs represented by a title, description 
and URL (situated in Chicago / Santa Fe) 
 Rated on a scale 0 – 4 
125, Adler Planetarium & Astronomy Museum, ''Interactive 
exhibits & high-tech sky shows entertain stargazers -- 
lakefront views are a bonus.'', 
http://www.adlerplanetarium.org/ 
131,Lincoln Park Zoo,"Lincoln Park Zoo is a free 35-acre zoo 
located in Lincoln Park in Chicago, Illinois. The zoo was 
founded in 1868, making it one of the oldest zoos in the 
U.S. It is also one of a few free admission zoos in the 
United States.", http://www.lpzoo.org/ 
700, 125, 4, 4 
700, 131, 0, 1
TREC Contextual Suggestions (2/3) 
 … and a context 
 Corresponding to a metropolitan area in the USA, 
e.g., 109, Kalamazoo, MI
TREC Contextual Suggestions (3/3) 
 Suggest Web pages / snippets 
 From the Open Web, or from ClueWeb 
700, 109 ,1,"About KIA History Kalamazoo 
Institute of Arts KIA History","The Kalamazoo Institute 
of Arts is a nonprofit art museum and school. Since , the 
institute has offered art classes and free admission 
programming, including exhibitions, lectures, events, 
activities and a permanent collection. The KIAs mission 
is to cultivate the creation and appreciation of the visual 
arts for the communities",clueweb12-1811wb-14-09165
Approach 
 For a given location, select candidate web 
pages from Clueweb 
 Rank the candidates based on their cosine-similarity 
to the POIs in the user profile 
(separated in a positive and a negative profile)
Snippet Generation 
 Generate POI title: 
 Extract <title> or <header> tags 
 Generate personalized POI description: 
 Extract <description> tag 
 Break documents into sentences, ranked on their 
similarity with the user profile 
 Concatenate until 512 bytes reached
Candidate selection 
 In 2013, the CWI Clueweb based run ranked 
far below all other (Open Web) runs 
 A few issues related to evaluation, see our ECIR 
2014 short paper 
 But, also, the commercial Open Web search 
engines (Google, Bing or Yahoo!) return much 
better candidates for queries derived from the 
context than we did
Geo-Filtering 
 Exact mention of given context 
 Format: {City, ST} e.g., Miami, FL 
 Exclude documents that mention multiple 
contexts 
 E.g., a Wikipedia page about cities in Florida state
Domain Knowledge (1/2) 
 Point-of-Interest heuristic: 
 POIs will be represented on the major tourist 
information sites 
{yelp, tripadvisor, wikitravel, zagat, 
xpedia, orbitz, and travel.yahoo} 
 Extract the Clueweb documents from these 
domains (TouristListFiltered) 
 E.g., http://www.zagat.com/miami 
 Expand with the outlinks also contained in 
ClueWeb12 (TouristOutlinksFiltered)
Domain Knowledge (2/2) 
 Use Foursquare API to identify the URLs of POIs for 
the given context 
50 attractions per context 
Format: attraction name, URL 
e.g., Cortés Restaurant, http://cortesrestaurant.com 
Miami, FL 
 If the POI has no corresponding URL, use Google API with 
a query using foursquare POI + context, i.e., “Cortés 
Restaurant Miami, FL” 
 Extract any document from Clueweb whose host 
matches the (1,454 unique) hosts of the URLs 
identified (AttractionFiltered)
Candidate Selection 
ClueWeb12 
733,019,372 web pages 
“City, ST” 
8,883,068 docs 
TouristListFiltered (175,260) 
TouristOutlinksFiltered (97,678) 
AttractionsFiltered (102,604) 
GeoFiltered 
TouristFiltered
Overall Results 
 Note: 
TouristFiltered 
>> 
GeoFiltered 
 P@5 and MRR consider three dimensions of relevance: 
geographical (geo), description (desc) and document (doc) 
relevance
TouristFiltered vs. GeoFiltered 
(I.e., TouristFiltered suggests 
better POIs for 33.1% of the 
judged topics) 
% topics
Decompose metrics 
 Ignoring geo-relevance: 
GeoFiltered ~ 
TouristFiltered
Decompose metrics 
 Ignore geo-relevance: 
 Geo-relevance only: 
The two runs have 
almost similar 
performance in the 
desc and doc 
dimensions 
TouristFiltered is 
more 
geographically 
appropriate
Type of domain knowledge 
 TouristFiltered consists of three parts: 
 TouristListFiltered (TLF) 
 TouristOutlinksFiltered (TOF) 
 AttractionFiltered (AF) 
Foursquare gives the 
most significant 
improvement in 
performance
Conclusions 
 Domain knowledge about sites that are more 
likely to offer attractions lead to better 
suggestions 
 The best results were obtained when 
identifying attractions through specialized 
services such as Foursquare
Next Steps 
 Improve our recommendation algorithm 
 E.g., weighted candidate selection 
 Understand the remaining difference with 
Open Web based results 
 Our Clueweb results are reproduceable but not 
yet as good

More Related Content

Similar to Better Contextual Suggestions by Applying Domain Knowledge

OLLI Workshop : Beyond The Basics of Google Searching April 2009
OLLI Workshop : Beyond The Basics of Google Searching April 2009OLLI Workshop : Beyond The Basics of Google Searching April 2009
OLLI Workshop : Beyond The Basics of Google Searching April 2009
sheila lafferty
 
4 5 enhancing visibility seo [day 1-module 4- day2-module1]
4 5 enhancing visibility seo [day 1-module 4- day2-module1]4 5 enhancing visibility seo [day 1-module 4- day2-module1]
4 5 enhancing visibility seo [day 1-module 4- day2-module1]
Rania Alahmad
 
Data Feed SEO for Affiliates by Will Critchlow
Data Feed SEO for Affiliates by Will CritchlowData Feed SEO for Affiliates by Will Critchlow
Data Feed SEO for Affiliates by Will Critchlow
auexpo Conference
 
Trekking through the world of information
Trekking through the world of informationTrekking through the world of information
Trekking through the world of information
Kristin Hokanson
 

Similar to Better Contextual Suggestions by Applying Domain Knowledge (20)

Synchronicity: Just-In-Time Discovery of Lost Web Pages
Synchronicity: Just-In-Time Discovery of Lost Web PagesSynchronicity: Just-In-Time Discovery of Lost Web Pages
Synchronicity: Just-In-Time Discovery of Lost Web Pages
 
Liquid Query: Multi-domain Exploratory Search on the Web
Liquid Query: Multi-domain Exploratory Search on the WebLiquid Query: Multi-domain Exploratory Search on the Web
Liquid Query: Multi-domain Exploratory Search on the Web
 
Evolutionary & Swarm Computing for the Semantic Web
Evolutionary & Swarm Computing for the Semantic WebEvolutionary & Swarm Computing for the Semantic Web
Evolutionary & Swarm Computing for the Semantic Web
 
Search Analytics: Conversations with Your Customers
Search Analytics: Conversations with Your CustomersSearch Analytics: Conversations with Your Customers
Search Analytics: Conversations with Your Customers
 
OLLI Workshop : Beyond The Basics of Google Searching April 2009
OLLI Workshop : Beyond The Basics of Google Searching April 2009OLLI Workshop : Beyond The Basics of Google Searching April 2009
OLLI Workshop : Beyond The Basics of Google Searching April 2009
 
Linked Data and Locah, UKSG2011
Linked Data and Locah, UKSG2011 Linked Data and Locah, UKSG2011
Linked Data and Locah, UKSG2011
 
Using Search Analytics to Diagnose What’s Ailing your Information Architecture
Using Search Analytics to Diagnose What’s Ailing your Information ArchitectureUsing Search Analytics to Diagnose What’s Ailing your Information Architecture
Using Search Analytics to Diagnose What’s Ailing your Information Architecture
 
4 5 enhancing visibility seo [day 1-module 4- day2-module1]
4 5 enhancing visibility seo [day 1-module 4- day2-module1]4 5 enhancing visibility seo [day 1-module 4- day2-module1]
4 5 enhancing visibility seo [day 1-module 4- day2-module1]
 
4 5 enhancing visibility seo [day 1-module 4- day2-module1]
4 5 enhancing visibility seo [day 1-module 4- day2-module1]4 5 enhancing visibility seo [day 1-module 4- day2-module1]
4 5 enhancing visibility seo [day 1-module 4- day2-module1]
 
BBC Linked Data Platform (SemTechBiz San Fran 2013)
BBC Linked Data Platform (SemTechBiz San Fran 2013)BBC Linked Data Platform (SemTechBiz San Fran 2013)
BBC Linked Data Platform (SemTechBiz San Fran 2013)
 
Lifting the Lid on Linked Data
Lifting the Lid on Linked DataLifting the Lid on Linked Data
Lifting the Lid on Linked Data
 
(Re-)Discovering Lost Web Pages
(Re-)Discovering Lost Web Pages(Re-)Discovering Lost Web Pages
(Re-)Discovering Lost Web Pages
 
Research power point for students
Research power point for studentsResearch power point for students
Research power point for students
 
Linked Data in Use: Schema.org, JSON-LD and hypermedia APIs - Front in Bahia...
Linked Data in Use: Schema.org, JSON-LD and hypermedia APIs  - Front in Bahia...Linked Data in Use: Schema.org, JSON-LD and hypermedia APIs  - Front in Bahia...
Linked Data in Use: Schema.org, JSON-LD and hypermedia APIs - Front in Bahia...
 
Data Feed SEO for Affiliates by Will Critchlow
Data Feed SEO for Affiliates by Will CritchlowData Feed SEO for Affiliates by Will Critchlow
Data Feed SEO for Affiliates by Will Critchlow
 
Trekking through the world of information
Trekking through the world of informationTrekking through the world of information
Trekking through the world of information
 
Hypermedia APIs and HATEOAS
Hypermedia APIs and HATEOASHypermedia APIs and HATEOAS
Hypermedia APIs and HATEOAS
 
Coding against the Office Graph
Coding against the Office GraphCoding against the Office Graph
Coding against the Office Graph
 
Enrich Search User Experience Using Amazon CloudSearch (SVC302) | AWS re:Inve...
Enrich Search User Experience Using Amazon CloudSearch (SVC302) | AWS re:Inve...Enrich Search User Experience Using Amazon CloudSearch (SVC302) | AWS re:Inve...
Enrich Search User Experience Using Amazon CloudSearch (SVC302) | AWS re:Inve...
 
Understanding Seo At A Glance
Understanding Seo At A GlanceUnderstanding Seo At A Glance
Understanding Seo At A Glance
 

More from Arjen de Vries

The personal search engine
The personal search engineThe personal search engine
The personal search engine
Arjen de Vries
 
ESSIR 2013 - IR and Social Media
ESSIR 2013 - IR and Social MediaESSIR 2013 - IR and Social Media
ESSIR 2013 - IR and Social Media
Arjen de Vries
 
Looking beyond plain text for document representation in the enterprise
Looking beyond plain text for document representation in the enterpriseLooking beyond plain text for document representation in the enterprise
Looking beyond plain text for document representation in the enterprise
Arjen de Vries
 

More from Arjen de Vries (20)

Doing a PhD @ DOSSIER
Doing a PhD @ DOSSIERDoing a PhD @ DOSSIER
Doing a PhD @ DOSSIER
 
Masterclass Big Data (leerlingen)
Masterclass Big Data (leerlingen) Masterclass Big Data (leerlingen)
Masterclass Big Data (leerlingen)
 
Beverwedstrijd Big Data (klas 3/4/5/6)
Beverwedstrijd Big Data (klas 3/4/5/6) Beverwedstrijd Big Data (klas 3/4/5/6)
Beverwedstrijd Big Data (klas 3/4/5/6)
 
Beverwedstrijd Big Data (groep 5/6 en klas 1/2)
Beverwedstrijd Big Data (groep 5/6 en klas 1/2)Beverwedstrijd Big Data (groep 5/6 en klas 1/2)
Beverwedstrijd Big Data (groep 5/6 en klas 1/2)
 
Web Archives and the dream of the Personal Search Engine
Web Archives and the dream of the Personal Search EngineWeb Archives and the dream of the Personal Search Engine
Web Archives and the dream of the Personal Search Engine
 
Information Retrieval and Social Media
Information Retrieval and Social MediaInformation Retrieval and Social Media
Information Retrieval and Social Media
 
Information Retrieval intro TMM
Information Retrieval intro TMMInformation Retrieval intro TMM
Information Retrieval intro TMM
 
ACM SIGIR 2017 - Opening - PC Chairs
ACM SIGIR 2017 - Opening - PC ChairsACM SIGIR 2017 - Opening - PC Chairs
ACM SIGIR 2017 - Opening - PC Chairs
 
Data Science Master Specialisation
Data Science Master SpecialisationData Science Master Specialisation
Data Science Master Specialisation
 
PUC Masterclass Big Data
PUC Masterclass Big DataPUC Masterclass Big Data
PUC Masterclass Big Data
 
Bigdata processing with Spark - part II
Bigdata processing with Spark - part IIBigdata processing with Spark - part II
Bigdata processing with Spark - part II
 
Bigdata processing with Spark
Bigdata processing with SparkBigdata processing with Spark
Bigdata processing with Spark
 
TREC 2016: Looking Forward Panel
TREC 2016: Looking Forward PanelTREC 2016: Looking Forward Panel
TREC 2016: Looking Forward Panel
 
The personal search engine
The personal search engineThe personal search engine
The personal search engine
 
Models for Information Retrieval and Recommendation
Models for Information Retrieval and RecommendationModels for Information Retrieval and Recommendation
Models for Information Retrieval and Recommendation
 
Similarity & Recommendation - CWI Scientific Meeting - Sep 27th, 2013
Similarity & Recommendation - CWI Scientific Meeting - Sep 27th, 2013Similarity & Recommendation - CWI Scientific Meeting - Sep 27th, 2013
Similarity & Recommendation - CWI Scientific Meeting - Sep 27th, 2013
 
ESSIR 2013 - IR and Social Media
ESSIR 2013 - IR and Social MediaESSIR 2013 - IR and Social Media
ESSIR 2013 - IR and Social Media
 
Looking beyond plain text for document representation in the enterprise
Looking beyond plain text for document representation in the enterpriseLooking beyond plain text for document representation in the enterprise
Looking beyond plain text for document representation in the enterprise
 
Recommendation and Information Retrieval: Two Sides of the Same Coin?
Recommendation and Information Retrieval: Two Sides of the Same Coin?Recommendation and Information Retrieval: Two Sides of the Same Coin?
Recommendation and Information Retrieval: Two Sides of the Same Coin?
 
Searching Political Data by Strategy
Searching Political Data by StrategySearching Political Data by Strategy
Searching Political Data by Strategy
 

Recently uploaded

Human genetics..........................pptx
Human genetics..........................pptxHuman genetics..........................pptx
Human genetics..........................pptx
Silpa
 
POGONATUM : morphology, anatomy, reproduction etc.
POGONATUM : morphology, anatomy, reproduction etc.POGONATUM : morphology, anatomy, reproduction etc.
POGONATUM : morphology, anatomy, reproduction etc.
Silpa
 
Digital Dentistry.Digital Dentistryvv.pptx
Digital Dentistry.Digital Dentistryvv.pptxDigital Dentistry.Digital Dentistryvv.pptx
Digital Dentistry.Digital Dentistryvv.pptx
MohamedFarag457087
 
Module for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learningModule for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learning
levieagacer
 

Recently uploaded (20)

Thyroid Physiology_Dr.E. Muralinath_ Associate Professor
Thyroid Physiology_Dr.E. Muralinath_ Associate ProfessorThyroid Physiology_Dr.E. Muralinath_ Associate Professor
Thyroid Physiology_Dr.E. Muralinath_ Associate Professor
 
GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)
 
Velocity and Acceleration PowerPoint.ppt
Velocity and Acceleration PowerPoint.pptVelocity and Acceleration PowerPoint.ppt
Velocity and Acceleration PowerPoint.ppt
 
Dr. E. Muralinath_ Blood indices_clinical aspects
Dr. E. Muralinath_ Blood indices_clinical  aspectsDr. E. Muralinath_ Blood indices_clinical  aspects
Dr. E. Muralinath_ Blood indices_clinical aspects
 
An introduction on sequence tagged site mapping
An introduction on sequence tagged site mappingAn introduction on sequence tagged site mapping
An introduction on sequence tagged site mapping
 
CURRENT SCENARIO OF POULTRY PRODUCTION IN INDIA
CURRENT SCENARIO OF POULTRY PRODUCTION IN INDIACURRENT SCENARIO OF POULTRY PRODUCTION IN INDIA
CURRENT SCENARIO OF POULTRY PRODUCTION IN INDIA
 
module for grade 9 for distance learning
module for grade 9 for distance learningmodule for grade 9 for distance learning
module for grade 9 for distance learning
 
Molecular markers- RFLP, RAPD, AFLP, SNP etc.
Molecular markers- RFLP, RAPD, AFLP, SNP etc.Molecular markers- RFLP, RAPD, AFLP, SNP etc.
Molecular markers- RFLP, RAPD, AFLP, SNP etc.
 
Exploring Criminology and Criminal Behaviour.pdf
Exploring Criminology and Criminal Behaviour.pdfExploring Criminology and Criminal Behaviour.pdf
Exploring Criminology and Criminal Behaviour.pdf
 
Human genetics..........................pptx
Human genetics..........................pptxHuman genetics..........................pptx
Human genetics..........................pptx
 
PATNA CALL GIRLS 8617370543 LOW PRICE ESCORT SERVICE
PATNA CALL GIRLS 8617370543 LOW PRICE ESCORT SERVICEPATNA CALL GIRLS 8617370543 LOW PRICE ESCORT SERVICE
PATNA CALL GIRLS 8617370543 LOW PRICE ESCORT SERVICE
 
Introduction of DNA analysis in Forensic's .pptx
Introduction of DNA analysis in Forensic's .pptxIntroduction of DNA analysis in Forensic's .pptx
Introduction of DNA analysis in Forensic's .pptx
 
FAIRSpectra - Enabling the FAIRification of Analytical Science
FAIRSpectra - Enabling the FAIRification of Analytical ScienceFAIRSpectra - Enabling the FAIRification of Analytical Science
FAIRSpectra - Enabling the FAIRification of Analytical Science
 
POGONATUM : morphology, anatomy, reproduction etc.
POGONATUM : morphology, anatomy, reproduction etc.POGONATUM : morphology, anatomy, reproduction etc.
POGONATUM : morphology, anatomy, reproduction etc.
 
Zoology 5th semester notes( Sumit_yadav).pdf
Zoology 5th semester notes( Sumit_yadav).pdfZoology 5th semester notes( Sumit_yadav).pdf
Zoology 5th semester notes( Sumit_yadav).pdf
 
Stages in the normal growth curve
Stages in the normal growth curveStages in the normal growth curve
Stages in the normal growth curve
 
Digital Dentistry.Digital Dentistryvv.pptx
Digital Dentistry.Digital Dentistryvv.pptxDigital Dentistry.Digital Dentistryvv.pptx
Digital Dentistry.Digital Dentistryvv.pptx
 
Module for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learningModule for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learning
 
Call Girls Ahmedabad +917728919243 call me Independent Escort Service
Call Girls Ahmedabad +917728919243 call me Independent Escort ServiceCall Girls Ahmedabad +917728919243 call me Independent Escort Service
Call Girls Ahmedabad +917728919243 call me Independent Escort Service
 
FAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
FAIRSpectra - Enabling the FAIRification of Spectroscopy and SpectrometryFAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
FAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
 

Better Contextual Suggestions by Applying Domain Knowledge

  • 1. Better Contextual Suggestions by Applying Domain Knowledge Thaer Samar, Alejandro Bellogin, Arjen P. de Vries
  • 2. Contextual Suggestions  Given a user profile and a context, make suggestions  AKA Context-aware Recommendation, zero-query Information Retrieval, …
  • 3. “Entertain me”  Recommend “things to do”, where  User profile consists of opinions about attractions  Context consists of a specific geo-location
  • 6. My Suggestions WORM Poortgebouw
  • 7. TREC Contextual Suggestions (1/3)  Given a user profile  70 – 100 POIs represented by a title, description and URL (situated in Chicago / Santa Fe)  Rated on a scale 0 – 4 125, Adler Planetarium & Astronomy Museum, ''Interactive exhibits & high-tech sky shows entertain stargazers -- lakefront views are a bonus.'', http://www.adlerplanetarium.org/ 131,Lincoln Park Zoo,"Lincoln Park Zoo is a free 35-acre zoo located in Lincoln Park in Chicago, Illinois. The zoo was founded in 1868, making it one of the oldest zoos in the U.S. It is also one of a few free admission zoos in the United States.", http://www.lpzoo.org/ 700, 125, 4, 4 700, 131, 0, 1
  • 8. TREC Contextual Suggestions (2/3)  … and a context  Corresponding to a metropolitan area in the USA, e.g., 109, Kalamazoo, MI
  • 9. TREC Contextual Suggestions (3/3)  Suggest Web pages / snippets  From the Open Web, or from ClueWeb 700, 109 ,1,"About KIA History Kalamazoo Institute of Arts KIA History","The Kalamazoo Institute of Arts is a nonprofit art museum and school. Since , the institute has offered art classes and free admission programming, including exhibitions, lectures, events, activities and a permanent collection. The KIAs mission is to cultivate the creation and appreciation of the visual arts for the communities",clueweb12-1811wb-14-09165
  • 10. Approach  For a given location, select candidate web pages from Clueweb  Rank the candidates based on their cosine-similarity to the POIs in the user profile (separated in a positive and a negative profile)
  • 11. Snippet Generation  Generate POI title:  Extract <title> or <header> tags  Generate personalized POI description:  Extract <description> tag  Break documents into sentences, ranked on their similarity with the user profile  Concatenate until 512 bytes reached
  • 12. Candidate selection  In 2013, the CWI Clueweb based run ranked far below all other (Open Web) runs  A few issues related to evaluation, see our ECIR 2014 short paper  But, also, the commercial Open Web search engines (Google, Bing or Yahoo!) return much better candidates for queries derived from the context than we did
  • 13. Geo-Filtering  Exact mention of given context  Format: {City, ST} e.g., Miami, FL  Exclude documents that mention multiple contexts  E.g., a Wikipedia page about cities in Florida state
  • 14. Domain Knowledge (1/2)  Point-of-Interest heuristic:  POIs will be represented on the major tourist information sites {yelp, tripadvisor, wikitravel, zagat, xpedia, orbitz, and travel.yahoo}  Extract the Clueweb documents from these domains (TouristListFiltered)  E.g., http://www.zagat.com/miami  Expand with the outlinks also contained in ClueWeb12 (TouristOutlinksFiltered)
  • 15. Domain Knowledge (2/2)  Use Foursquare API to identify the URLs of POIs for the given context 50 attractions per context Format: attraction name, URL e.g., Cortés Restaurant, http://cortesrestaurant.com Miami, FL  If the POI has no corresponding URL, use Google API with a query using foursquare POI + context, i.e., “Cortés Restaurant Miami, FL”  Extract any document from Clueweb whose host matches the (1,454 unique) hosts of the URLs identified (AttractionFiltered)
  • 16. Candidate Selection ClueWeb12 733,019,372 web pages “City, ST” 8,883,068 docs TouristListFiltered (175,260) TouristOutlinksFiltered (97,678) AttractionsFiltered (102,604) GeoFiltered TouristFiltered
  • 17. Overall Results  Note: TouristFiltered >> GeoFiltered  P@5 and MRR consider three dimensions of relevance: geographical (geo), description (desc) and document (doc) relevance
  • 18. TouristFiltered vs. GeoFiltered (I.e., TouristFiltered suggests better POIs for 33.1% of the judged topics) % topics
  • 19. Decompose metrics  Ignoring geo-relevance: GeoFiltered ~ TouristFiltered
  • 20. Decompose metrics  Ignore geo-relevance:  Geo-relevance only: The two runs have almost similar performance in the desc and doc dimensions TouristFiltered is more geographically appropriate
  • 21. Type of domain knowledge  TouristFiltered consists of three parts:  TouristListFiltered (TLF)  TouristOutlinksFiltered (TOF)  AttractionFiltered (AF) Foursquare gives the most significant improvement in performance
  • 22. Conclusions  Domain knowledge about sites that are more likely to offer attractions lead to better suggestions  The best results were obtained when identifying attractions through specialized services such as Foursquare
  • 23. Next Steps  Improve our recommendation algorithm  E.g., weighted candidate selection  Understand the remaining difference with Open Web based results  Our Clueweb results are reproduceable but not yet as good