SlideShare a Scribd company logo
1 of 42
Download to read offline
WHAT TO READ NEXT?
THREE STAGES OF DATA-DRIVEN BOOK DISCOVERY
METTE SKOV
TOINE BOGERS
AALBORG UNIVERSITY
AOIR PANEL ON ‘NETWORKED READING’ OCTOBER 7, 2016
INTRODUCTION
PART 1
BOOKS ARE NOT DEAD (THEY AREN’T EVEN SICK!)
Books remain very popular!
– Slow but steady increase in book sales to
2.7 billion books in the US in 2015
E-books make up 18.9% of that in the US
– Total sales revenue: $29.2 billion in US in
2015
E-book sales revenue was $5.3 billion
So there is definitely a market & need for
discovering (new) interesting books!
3
BOOK DISCOVERY IS NOT THAT EASY...
Readers often struggle with existing systems (search
engines & recommender systems) to discover new books
– Information needs are highly complex
Topical match, complex relevance aspects, personal interests
& preferences, context of use
– Search engines and recommenders are ill-equipped to
address such needs!
4
EXAMPLESOF COMPLEX INFORMATION NEEDS 5
SOCIAL BOOK SEARCH LAB
Series of workshops (2011-2016) with a shared data
challenge using data from LibraryThing and Amazon
Focus is on the design, development & evaluation of
systems that can address complex book requests
1. Detecting complex book requests
2. Analyzing book requests for relevance aspects
3. Developing better algorithms for suggesting relevant
books
4. Exploring interactionswith book search engines
6
OVERVIEW OF DATA SOURCES 7
2.8 million books
944 annotated
requests
Books
Users
Book
requests
ANNOTATED LT TOPIC
8
Group name
Request title
Narrative
BOOK REQUESTS
Forum posts describing realistic book search requests
– Book request narratives can touch upon many different
aspects
Users search for topics, genres, authors, plots, etc.
Users want books that are engaging, funny, well-written,
educational, etc.
Users have different preferences, knowledge, reading level,
etc.
– Book discussion fora contain many such focused requests!
LibraryThing, Goodreads, …
9
OVERVIEW OF DATA SOURCES 10
2.8 million books
944 annotated
requests
5,658 suggestionsSuggestions
Books
Users
Book
requests
ANNOTATED LT TOPIC
11
Recommended
books
Group name
Request title
Narrative
Catalog additions
12
Forum suggestion added
after the request was posted
OVERVIEW OF DATA SOURCES 13
94,000 user
profiles
2.8 million books
944 annotated
requests
5,658 suggestionsSuggestions
Books
Users
Book
requests User profiles
OVERVIEW OF DATA SOURCES 14
Suggestions
Books
Users
Book
requests
Bibliographic
metadata
Curated
metadata
User-generated
content
User profiles
Tags
Reviews
DETECTINGCOMPLEX
BOOK REQUESTS
PART 2
DETECTING BOOK REQUESTS
How common are requests for book recommendations in
the LibraryThing Forums?
– Currently 233,000+ threads in the LibraryThing forums
– Annotated a random sample of 4,000 threads of which
15.1% were book requests
– Means there are potentially over 35,000 book requests on
LibraryThing!
17
DETECTING BOOK REQUESTS
Can we detect such book requests automatically?
– Initial experiments achieved an accuracy of 94.17% on a test
set of 2,000 annotated book requests
– Most predictive characteristics
Words such as any, suggestions, looking, recommendations,
thanks, anyone, read, books, and recommend
No. of sentencesending in a question mark
Degree of expertiseof LibraryThing users replying to the thread
Ratio of suggested books cataloged afterwards by the requester
18
ANALYZING
BOOK REQUESTS
PART 3
ANALYZING BOOK REQUESTS
Book requests contain many elements that could be mined
to benefit search engines & recommendation systems
– Example: Relevance aspects
What makes a suggested book relevant to the user?
Identified eight relevance aspects in book search
requests (Reuter, 2007; Koolen et al., 2015)
20
ANALYZING BOOK REQUESTS
Accessibility
– Accessibility in terms of the language, length, or level of difficulty of a
book.
Content
– Aspects such as topic, plot, genre, style, or comprehensiveness of a book.
Engagement
– Books that fit a particular mood or interest, books that are considered high
quality, or provide a particular reading experience.
Familiarity
– Books that are similar to known books or related to a previous experience.
21
ANALYZING BOOK REQUESTS
Known-item
– Descriptions of known books with the sole purposeof identifying its title and/or
author.
Metadata
– Books with a certain title or by a certain author, editor, illustrator, publisher, in a
particular format, or written or published in certain year or period.
Novelty
– Books with content that is novel to the reader, books that are unusualor quirky.
Socio-Cultural
– Books related to the user’s socio-culturalbackground or values, books that are
popular or obscure, or books that have had a particular culturalor social impact.
22
ANALYZING BOOK REQUESTS
Distribution of relevance aspects & prediction success
24
Aspect % of narratives
(N = 944) Precision
Accessibility 16.1 % 0.31
Content 73.9 % 0.63
Engagement 22.6 % 0.32
Familiarity 35.8 % 0.47
Known-item 21.4 % 0.71
Metadata 28.0 % 0.17
Novelty 3.6 % 0.03
Socio-cultural 14.2 % 0.23
SUGGESTING
RELEVANT BOOKS
PART 4
SUGGESTING RELEVANTBOOKS 26
Suggestions
Books
Users
Book
requests
Bibliographic
metadata
Curated
metadata
User-generated
content
User profiles
Tags
Reviews
SUGGESTING RELEVANTBOOKS 29
- Title
- Publisher
- Editorial
- Creator
- Series
- Award
- Character
- Place
• Different
grou
- Blurb
- Epigraph
- First words
- Last words
- Quotation
- User reviews
• Different
grou
- Dewey
- Thesaurus
- Index terms
- Tags
Bibliographic metadata
Content
Curated metadata
Reviews
Tags
Different types of book metadata fields
SUGGESTING RELEVANTBOOKS 30
Set of metadata fields NDCG@10
Bibliographic metadata 0.2015
Content 0.0115
Curated metadata 0.0691
Tags 0.2056
Reviews 0.2832
All fields combined 0.3058
NDCG@10
0.0 0.1 0.2 0.3 0.4
SUGGESTING RELEVANTBOOKS 31
Set of metadata fields NDCG@10
Bibliographic metadata 0.2015
Content 0.0115
Curated metadata 0.0691
Tags 0.2056
Reviews 0.2832
All fields combined 0.3058
NDCG@10
0.0 0.1 0.2 0.3 0.4
ANALYZING BOOK
SEARCH BEHAVIOR
PART 5
AIM & APPROACH
We aim to contribute to building dedicated book
search and discovery services
Our long-term goal is to investigate book search
behaviour through a range of user tasks and interfaces:
– How should the user interface combine professional,
curated metadata and user-generated metadata?
– How should the user interface adapt itself as the user
progresses through their search task, and if so, how?
– When do users prefer to browse or search?
– How can we best support different types of search
tasks?
USER STUDY OF INTERACTIVE BOOK SEARCH BEHAVIOUR
Comparative user studies with 192 + 111 participants (2015 & 2016)
Welcome
Informed
Consent
Background
Pre-Task
Information
Task
Post-Task
Questions
Experience Thank you
EXPERIMENTAL TASKS
Goal-oriented task: Imagine you participate in an experiment at a desert-
island for one month. There will be no people, no TV, radio or other
distraction. The only things you are allowed to take with you are 5 books:
– On surviving on a desert island
– That will teach you something new
– Highly recommended by other users
– For fun
– About one of your personal hobbies or interests
Non-goal task: Imagine you are waiting to meet a friend in a coffee shop
or pub or the airport or your office. While waiting, you come across this
website and explore it looking for any book that you find interesting, or
engaging or relevant. Explore anything you wish until you are completely
and utterly bored…
BASELINE INTERFACE
MULTISTAGE INTERFACE – BROWSE VIEW
MULTISTAGE INTERFACE – SEARCH VIEW
MULTISTAGE INTERFACE – BOOK-BAG VIEW
WHAT HAVE WE LEARNED SO FAR?
Need for heterogeneous record information (user
generated and curated, professional data)
Multi-stage interface:
– Longer search session
– Less queries issued (more browsing)
– No differences in number of books added to book bag
Clear differences in search behaviour between the
different types of tasks
(Gäde et al. 2015, 2016)
OPEN
QUESTIONS
PART 6
CONCLUSIONS
Tens of thousands of information needs are going unmet
– Just the tip of the iceberg?
– Search engines and recommender systems are ill-equipped
to deal with this!
42
OPEN QUESTIONS
How (dis)similar are relevance aspects for books to those
for other domains?
How do relevance aspects influence the choice of
algorithm(s) & data representation(s)?
How does the combination of data from different sources
(Amazon, LibraryThing, Library of Congress, British Library)
affect the quality of the results and UX?
Decontextualized metadata: What happens when we mix
metadata from different sources?
– Example: reuse of recommendations or tags ‘out of context’
43
QUESTIONS?
“ALWAYS READ SOMETHING THAT
WILL MAKE YOU LOOK GOOD IF
YOU DIE IN THE MIDDLE OF IT.”
P.J. O’Rourke
REFERENCES
Slide 3
– Book sales statistics taken from https://www.statista.com/topics/1177/book-
market/ and https://www.statista.com/topics/1474/e-books/; last visited
October 1, 2016
Slide 6
– Official website of the Social Book Search lab: http://social-book-
search.humanities.uva.nl/
Slide 20
– Reuter, K. (2007). Assessing Aesthetic Relevance: Children’s Book Selection in a
Digital Library. JASIST, 58(12), 1745–1763.
– Koolen, M., Bogers, T., Van den Bosch, A., and Kamps, J. (2015). Looking for
Books in Social Media: An Analysis of Complex Search Requests. Proceedings of
ECIR 2015, Volume 9022 of the series Lecture Notes in Computer Science, pp.
184-196
46
REFERENCES
Slide 40
– Gäde, M., Hall, M., Huurdeman, H., Kamps, J., Koolen, M., Skov, M., Toms, E. &
Walsh, D. (2015). Overview of the SBS 2015 Interactive Track . Working Notes of
CLEF 2015 – Conference and Labs of the Evaluation Forum, CEUR workshop
proceedings, vol. 1391
– Gäde, M., Hall, M., Huurdeman, H., Kamps, J., Koolen, M., Skov, M., Bogers, T. &
Walsh, D. (2015). Overview of the SBS 2016 Interactive Track. Working Notes of
CLEF 2016 – Conference and Labs of the Evaluation Forum, CEUR workshop
proceedings, vol. 1609
47

More Related Content

Viewers also liked

RDA and the semantic Web
RDA and the semantic WebRDA and the semantic Web
RDA and the semantic WebGordon Dunsire
 
BIBFRAME : the future of cataloguing?
BIBFRAME : the future of cataloguing?BIBFRAME : the future of cataloguing?
BIBFRAME : the future of cataloguing?Thomas Meehan
 
Supporting privacy protection in personalized web search
Supporting privacy protection in personalized web search Supporting privacy protection in personalized web search
Supporting privacy protection in personalized web search Adz91 Digital Ads Pvt Ltd
 
Supporting privacy protection in personalized web search
Supporting privacy protection in personalized web searchSupporting privacy protection in personalized web search
Supporting privacy protection in personalized web searchIGEEKS TECHNOLOGIES
 
Personalized search
Personalized searchPersonalized search
Personalized searchToine Bogers
 
Cataloging with RDA: An Overview
Cataloging with RDA: An OverviewCataloging with RDA: An Overview
Cataloging with RDA: An OverviewEmily Nimsakont
 

Viewers also liked (7)

RDA and the semantic Web
RDA and the semantic WebRDA and the semantic Web
RDA and the semantic Web
 
BIBFRAME : the future of cataloguing?
BIBFRAME : the future of cataloguing?BIBFRAME : the future of cataloguing?
BIBFRAME : the future of cataloguing?
 
Supporting privacy protection in personalized web search
Supporting privacy protection in personalized web search Supporting privacy protection in personalized web search
Supporting privacy protection in personalized web search
 
Supporting privacy protection in personalized web search
Supporting privacy protection in personalized web searchSupporting privacy protection in personalized web search
Supporting privacy protection in personalized web search
 
Personalized search
Personalized searchPersonalized search
Personalized search
 
Personalized Web Search
Personalized Web SearchPersonalized Web Search
Personalized Web Search
 
Cataloging with RDA: An Overview
Cataloging with RDA: An OverviewCataloging with RDA: An Overview
Cataloging with RDA: An Overview
 

More from Toine Bogers

"If I like BLANK, what else will I like?": Analyzing a Human Recommendation C...
"If I like BLANK, what else will I like?": Analyzing a Human Recommendation C..."If I like BLANK, what else will I like?": Analyzing a Human Recommendation C...
"If I like BLANK, what else will I like?": Analyzing a Human Recommendation C...Toine Bogers
 
Hands-free but not Eyes-free: A Usability Evaluation of Siri while Driving
Hands-free but not Eyes-free: A Usability Evaluation of Siri while DrivingHands-free but not Eyes-free: A Usability Evaluation of Siri while Driving
Hands-free but not Eyes-free: A Usability Evaluation of Siri while DrivingToine Bogers
 
“Looking for an Amazing Game I Can Relax and Sink Hours into...”: A Study of ...
“Looking for an Amazing Game I Can Relax and Sink Hours into...”: A Study of ...“Looking for an Amazing Game I Can Relax and Sink Hours into...”: A Study of ...
“Looking for an Amazing Game I Can Relax and Sink Hours into...”: A Study of ...Toine Bogers
 
A Study of Usage and Usability of Intelligent Personal Assistants in Denmark
A Study of Usage and Usability of Intelligent Personal Assistants in DenmarkA Study of Usage and Usability of Intelligent Personal Assistants in Denmark
A Study of Usage and Usability of Intelligent Personal Assistants in DenmarkToine Bogers
 
“What was this movie about this chick?”: A Comparative Study of Relevance Asp...
“What was this movie about this chick?”: A Comparative Study of Relevance Asp...“What was this movie about this chick?”: A Comparative Study of Relevance Asp...
“What was this movie about this chick?”: A Comparative Study of Relevance Asp...Toine Bogers
 
"I just scroll through my stuff until I find it or give up": A Contextual Inq...
"I just scroll through my stuff until I find it or give up": A Contextual Inq..."I just scroll through my stuff until I find it or give up": A Contextual Inq...
"I just scroll through my stuff until I find it or give up": A Contextual Inq...Toine Bogers
 
Natural Language Processing
Natural Language ProcessingNatural Language Processing
Natural Language ProcessingToine Bogers
 
Defining and Supporting Narrative-driven Recommendation
Defining and Supporting Narrative-driven RecommendationDefining and Supporting Narrative-driven Recommendation
Defining and Supporting Narrative-driven RecommendationToine Bogers
 
A Longitudinal Analysis of Search Engine Index Size
A Longitudinal Analysis of Search Engine Index SizeA Longitudinal Analysis of Search Engine Index Size
A Longitudinal Analysis of Search Engine Index SizeToine Bogers
 
Tagging vs. Controlled Vocabulary: Which is More Helpful for Book Search?
Tagging vs. Controlled Vocabulary: Which is More Helpful for Book Search?Tagging vs. Controlled Vocabulary: Which is More Helpful for Book Search?
Tagging vs. Controlled Vocabulary: Which is More Helpful for Book Search?Toine Bogers
 
Measuring System Performance in Cultural Heritage Systems
Measuring System Performance in Cultural Heritage SystemsMeasuring System Performance in Cultural Heritage Systems
Measuring System Performance in Cultural Heritage SystemsToine Bogers
 
How 'Social' are Social News Sites? Exploring the Motivations for Using Reddi...
How 'Social' are Social News Sites? Exploring the Motivations for Using Reddi...How 'Social' are Social News Sites? Exploring the Motivations for Using Reddi...
How 'Social' are Social News Sites? Exploring the Motivations for Using Reddi...Toine Bogers
 
Search & Recommendation: Birds of a Feather?
Search & Recommendation: Birds of a Feather?Search & Recommendation: Birds of a Feather?
Search & Recommendation: Birds of a Feather?Toine Bogers
 
Micro-Serendipity: Meaningful Coincidences in Everyday Life Shared on Twitter
Micro-Serendipity: Meaningful Coincidences in Everyday Life Shared on TwitterMicro-Serendipity: Meaningful Coincidences in Everyday Life Shared on Twitter
Micro-Serendipity: Meaningful Coincidences in Everyday Life Shared on TwitterToine Bogers
 
Benchmarking Domain-specific Expert Search using Workshop Program Committees
Benchmarking Domain-specific Expert Search using Workshop Program CommitteesBenchmarking Domain-specific Expert Search using Workshop Program Committees
Benchmarking Domain-specific Expert Search using Workshop Program CommitteesToine Bogers
 

More from Toine Bogers (15)

"If I like BLANK, what else will I like?": Analyzing a Human Recommendation C...
"If I like BLANK, what else will I like?": Analyzing a Human Recommendation C..."If I like BLANK, what else will I like?": Analyzing a Human Recommendation C...
"If I like BLANK, what else will I like?": Analyzing a Human Recommendation C...
 
Hands-free but not Eyes-free: A Usability Evaluation of Siri while Driving
Hands-free but not Eyes-free: A Usability Evaluation of Siri while DrivingHands-free but not Eyes-free: A Usability Evaluation of Siri while Driving
Hands-free but not Eyes-free: A Usability Evaluation of Siri while Driving
 
“Looking for an Amazing Game I Can Relax and Sink Hours into...”: A Study of ...
“Looking for an Amazing Game I Can Relax and Sink Hours into...”: A Study of ...“Looking for an Amazing Game I Can Relax and Sink Hours into...”: A Study of ...
“Looking for an Amazing Game I Can Relax and Sink Hours into...”: A Study of ...
 
A Study of Usage and Usability of Intelligent Personal Assistants in Denmark
A Study of Usage and Usability of Intelligent Personal Assistants in DenmarkA Study of Usage and Usability of Intelligent Personal Assistants in Denmark
A Study of Usage and Usability of Intelligent Personal Assistants in Denmark
 
“What was this movie about this chick?”: A Comparative Study of Relevance Asp...
“What was this movie about this chick?”: A Comparative Study of Relevance Asp...“What was this movie about this chick?”: A Comparative Study of Relevance Asp...
“What was this movie about this chick?”: A Comparative Study of Relevance Asp...
 
"I just scroll through my stuff until I find it or give up": A Contextual Inq...
"I just scroll through my stuff until I find it or give up": A Contextual Inq..."I just scroll through my stuff until I find it or give up": A Contextual Inq...
"I just scroll through my stuff until I find it or give up": A Contextual Inq...
 
Natural Language Processing
Natural Language ProcessingNatural Language Processing
Natural Language Processing
 
Defining and Supporting Narrative-driven Recommendation
Defining and Supporting Narrative-driven RecommendationDefining and Supporting Narrative-driven Recommendation
Defining and Supporting Narrative-driven Recommendation
 
A Longitudinal Analysis of Search Engine Index Size
A Longitudinal Analysis of Search Engine Index SizeA Longitudinal Analysis of Search Engine Index Size
A Longitudinal Analysis of Search Engine Index Size
 
Tagging vs. Controlled Vocabulary: Which is More Helpful for Book Search?
Tagging vs. Controlled Vocabulary: Which is More Helpful for Book Search?Tagging vs. Controlled Vocabulary: Which is More Helpful for Book Search?
Tagging vs. Controlled Vocabulary: Which is More Helpful for Book Search?
 
Measuring System Performance in Cultural Heritage Systems
Measuring System Performance in Cultural Heritage SystemsMeasuring System Performance in Cultural Heritage Systems
Measuring System Performance in Cultural Heritage Systems
 
How 'Social' are Social News Sites? Exploring the Motivations for Using Reddi...
How 'Social' are Social News Sites? Exploring the Motivations for Using Reddi...How 'Social' are Social News Sites? Exploring the Motivations for Using Reddi...
How 'Social' are Social News Sites? Exploring the Motivations for Using Reddi...
 
Search & Recommendation: Birds of a Feather?
Search & Recommendation: Birds of a Feather?Search & Recommendation: Birds of a Feather?
Search & Recommendation: Birds of a Feather?
 
Micro-Serendipity: Meaningful Coincidences in Everyday Life Shared on Twitter
Micro-Serendipity: Meaningful Coincidences in Everyday Life Shared on TwitterMicro-Serendipity: Meaningful Coincidences in Everyday Life Shared on Twitter
Micro-Serendipity: Meaningful Coincidences in Everyday Life Shared on Twitter
 
Benchmarking Domain-specific Expert Search using Workshop Program Committees
Benchmarking Domain-specific Expert Search using Workshop Program CommitteesBenchmarking Domain-specific Expert Search using Workshop Program Committees
Benchmarking Domain-specific Expert Search using Workshop Program Committees
 

Recently uploaded

Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCRStunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCRDelhi Call girls
 
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptxUnlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptxanandsmhk
 
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPirithiRaju
 
Chemistry 4th semester series (krishna).pdf
Chemistry 4th semester series (krishna).pdfChemistry 4th semester series (krishna).pdf
Chemistry 4th semester series (krishna).pdfSumit Kumar yadav
 
Chromatin Structure | EUCHROMATIN | HETEROCHROMATIN
Chromatin Structure | EUCHROMATIN | HETEROCHROMATINChromatin Structure | EUCHROMATIN | HETEROCHROMATIN
Chromatin Structure | EUCHROMATIN | HETEROCHROMATINsankalpkumarsahoo174
 
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...Sérgio Sacani
 
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdfPests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdfPirithiRaju
 
Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)PraveenaKalaiselvan1
 
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...Sérgio Sacani
 
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 60009654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000Sapana Sha
 
VIRUSES structure and classification ppt by Dr.Prince C P
VIRUSES structure and classification ppt by Dr.Prince C PVIRUSES structure and classification ppt by Dr.Prince C P
VIRUSES structure and classification ppt by Dr.Prince C PPRINCE C P
 
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls Agency
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls AgencyHire 💕 9907093804 Hooghly Call Girls Service Call Girls Agency
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls AgencySheetal Arora
 
Biopesticide (2).pptx .This slides helps to know the different types of biop...
Biopesticide (2).pptx  .This slides helps to know the different types of biop...Biopesticide (2).pptx  .This slides helps to know the different types of biop...
Biopesticide (2).pptx .This slides helps to know the different types of biop...RohitNehra6
 
DIFFERENCE IN BACK CROSS AND TEST CROSS
DIFFERENCE IN  BACK CROSS AND TEST CROSSDIFFERENCE IN  BACK CROSS AND TEST CROSS
DIFFERENCE IN BACK CROSS AND TEST CROSSLeenakshiTyagi
 
Formation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disksFormation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disksSérgio Sacani
 
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral AnalysisRaman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral AnalysisDiwakar Mishra
 
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...Lokesh Kothari
 
Botany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdfBotany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdfSumit Kumar yadav
 

Recently uploaded (20)

Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCRStunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
 
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptxUnlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptx
 
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
 
Chemistry 4th semester series (krishna).pdf
Chemistry 4th semester series (krishna).pdfChemistry 4th semester series (krishna).pdf
Chemistry 4th semester series (krishna).pdf
 
Chromatin Structure | EUCHROMATIN | HETEROCHROMATIN
Chromatin Structure | EUCHROMATIN | HETEROCHROMATINChromatin Structure | EUCHROMATIN | HETEROCHROMATIN
Chromatin Structure | EUCHROMATIN | HETEROCHROMATIN
 
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
 
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdfPests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
 
Engler and Prantl system of classification in plant taxonomy
Engler and Prantl system of classification in plant taxonomyEngler and Prantl system of classification in plant taxonomy
Engler and Prantl system of classification in plant taxonomy
 
Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)
 
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
 
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 60009654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
 
VIRUSES structure and classification ppt by Dr.Prince C P
VIRUSES structure and classification ppt by Dr.Prince C PVIRUSES structure and classification ppt by Dr.Prince C P
VIRUSES structure and classification ppt by Dr.Prince C P
 
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls Agency
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls AgencyHire 💕 9907093804 Hooghly Call Girls Service Call Girls Agency
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls Agency
 
Biopesticide (2).pptx .This slides helps to know the different types of biop...
Biopesticide (2).pptx  .This slides helps to know the different types of biop...Biopesticide (2).pptx  .This slides helps to know the different types of biop...
Biopesticide (2).pptx .This slides helps to know the different types of biop...
 
DIFFERENCE IN BACK CROSS AND TEST CROSS
DIFFERENCE IN  BACK CROSS AND TEST CROSSDIFFERENCE IN  BACK CROSS AND TEST CROSS
DIFFERENCE IN BACK CROSS AND TEST CROSS
 
Formation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disksFormation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disks
 
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral AnalysisRaman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis
 
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
 
9953056974 Young Call Girls In Mahavir enclave Indian Quality Escort service
9953056974 Young Call Girls In Mahavir enclave Indian Quality Escort service9953056974 Young Call Girls In Mahavir enclave Indian Quality Escort service
9953056974 Young Call Girls In Mahavir enclave Indian Quality Escort service
 
Botany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdfBotany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdf
 

What to Read Next? Three Stages of Data-driven Book Discovery

  • 1. WHAT TO READ NEXT? THREE STAGES OF DATA-DRIVEN BOOK DISCOVERY METTE SKOV TOINE BOGERS AALBORG UNIVERSITY AOIR PANEL ON ‘NETWORKED READING’ OCTOBER 7, 2016
  • 3. BOOKS ARE NOT DEAD (THEY AREN’T EVEN SICK!) Books remain very popular! – Slow but steady increase in book sales to 2.7 billion books in the US in 2015 E-books make up 18.9% of that in the US – Total sales revenue: $29.2 billion in US in 2015 E-book sales revenue was $5.3 billion So there is definitely a market & need for discovering (new) interesting books! 3
  • 4. BOOK DISCOVERY IS NOT THAT EASY... Readers often struggle with existing systems (search engines & recommender systems) to discover new books – Information needs are highly complex Topical match, complex relevance aspects, personal interests & preferences, context of use – Search engines and recommenders are ill-equipped to address such needs! 4
  • 6. SOCIAL BOOK SEARCH LAB Series of workshops (2011-2016) with a shared data challenge using data from LibraryThing and Amazon Focus is on the design, development & evaluation of systems that can address complex book requests 1. Detecting complex book requests 2. Analyzing book requests for relevance aspects 3. Developing better algorithms for suggesting relevant books 4. Exploring interactionswith book search engines 6
  • 7. OVERVIEW OF DATA SOURCES 7 2.8 million books 944 annotated requests Books Users Book requests
  • 8. ANNOTATED LT TOPIC 8 Group name Request title Narrative
  • 9. BOOK REQUESTS Forum posts describing realistic book search requests – Book request narratives can touch upon many different aspects Users search for topics, genres, authors, plots, etc. Users want books that are engaging, funny, well-written, educational, etc. Users have different preferences, knowledge, reading level, etc. – Book discussion fora contain many such focused requests! LibraryThing, Goodreads, … 9
  • 10. OVERVIEW OF DATA SOURCES 10 2.8 million books 944 annotated requests 5,658 suggestionsSuggestions Books Users Book requests
  • 11. ANNOTATED LT TOPIC 11 Recommended books Group name Request title Narrative
  • 12. Catalog additions 12 Forum suggestion added after the request was posted
  • 13. OVERVIEW OF DATA SOURCES 13 94,000 user profiles 2.8 million books 944 annotated requests 5,658 suggestionsSuggestions Books Users Book requests User profiles
  • 14. OVERVIEW OF DATA SOURCES 14 Suggestions Books Users Book requests Bibliographic metadata Curated metadata User-generated content User profiles Tags Reviews
  • 16. DETECTING BOOK REQUESTS How common are requests for book recommendations in the LibraryThing Forums? – Currently 233,000+ threads in the LibraryThing forums – Annotated a random sample of 4,000 threads of which 15.1% were book requests – Means there are potentially over 35,000 book requests on LibraryThing! 17
  • 17. DETECTING BOOK REQUESTS Can we detect such book requests automatically? – Initial experiments achieved an accuracy of 94.17% on a test set of 2,000 annotated book requests – Most predictive characteristics Words such as any, suggestions, looking, recommendations, thanks, anyone, read, books, and recommend No. of sentencesending in a question mark Degree of expertiseof LibraryThing users replying to the thread Ratio of suggested books cataloged afterwards by the requester 18
  • 19. ANALYZING BOOK REQUESTS Book requests contain many elements that could be mined to benefit search engines & recommendation systems – Example: Relevance aspects What makes a suggested book relevant to the user? Identified eight relevance aspects in book search requests (Reuter, 2007; Koolen et al., 2015) 20
  • 20. ANALYZING BOOK REQUESTS Accessibility – Accessibility in terms of the language, length, or level of difficulty of a book. Content – Aspects such as topic, plot, genre, style, or comprehensiveness of a book. Engagement – Books that fit a particular mood or interest, books that are considered high quality, or provide a particular reading experience. Familiarity – Books that are similar to known books or related to a previous experience. 21
  • 21. ANALYZING BOOK REQUESTS Known-item – Descriptions of known books with the sole purposeof identifying its title and/or author. Metadata – Books with a certain title or by a certain author, editor, illustrator, publisher, in a particular format, or written or published in certain year or period. Novelty – Books with content that is novel to the reader, books that are unusualor quirky. Socio-Cultural – Books related to the user’s socio-culturalbackground or values, books that are popular or obscure, or books that have had a particular culturalor social impact. 22
  • 22. ANALYZING BOOK REQUESTS Distribution of relevance aspects & prediction success 24 Aspect % of narratives (N = 944) Precision Accessibility 16.1 % 0.31 Content 73.9 % 0.63 Engagement 22.6 % 0.32 Familiarity 35.8 % 0.47 Known-item 21.4 % 0.71 Metadata 28.0 % 0.17 Novelty 3.6 % 0.03 Socio-cultural 14.2 % 0.23
  • 25. SUGGESTING RELEVANTBOOKS 29 - Title - Publisher - Editorial - Creator - Series - Award - Character - Place • Different grou - Blurb - Epigraph - First words - Last words - Quotation - User reviews • Different grou - Dewey - Thesaurus - Index terms - Tags Bibliographic metadata Content Curated metadata Reviews Tags Different types of book metadata fields
  • 26. SUGGESTING RELEVANTBOOKS 30 Set of metadata fields NDCG@10 Bibliographic metadata 0.2015 Content 0.0115 Curated metadata 0.0691 Tags 0.2056 Reviews 0.2832 All fields combined 0.3058 NDCG@10 0.0 0.1 0.2 0.3 0.4
  • 27. SUGGESTING RELEVANTBOOKS 31 Set of metadata fields NDCG@10 Bibliographic metadata 0.2015 Content 0.0115 Curated metadata 0.0691 Tags 0.2056 Reviews 0.2832 All fields combined 0.3058 NDCG@10 0.0 0.1 0.2 0.3 0.4
  • 29. AIM & APPROACH We aim to contribute to building dedicated book search and discovery services Our long-term goal is to investigate book search behaviour through a range of user tasks and interfaces: – How should the user interface combine professional, curated metadata and user-generated metadata? – How should the user interface adapt itself as the user progresses through their search task, and if so, how? – When do users prefer to browse or search? – How can we best support different types of search tasks?
  • 30. USER STUDY OF INTERACTIVE BOOK SEARCH BEHAVIOUR Comparative user studies with 192 + 111 participants (2015 & 2016) Welcome Informed Consent Background Pre-Task Information Task Post-Task Questions Experience Thank you
  • 31. EXPERIMENTAL TASKS Goal-oriented task: Imagine you participate in an experiment at a desert- island for one month. There will be no people, no TV, radio or other distraction. The only things you are allowed to take with you are 5 books: – On surviving on a desert island – That will teach you something new – Highly recommended by other users – For fun – About one of your personal hobbies or interests Non-goal task: Imagine you are waiting to meet a friend in a coffee shop or pub or the airport or your office. While waiting, you come across this website and explore it looking for any book that you find interesting, or engaging or relevant. Explore anything you wish until you are completely and utterly bored…
  • 35. MULTISTAGE INTERFACE – BOOK-BAG VIEW
  • 36. WHAT HAVE WE LEARNED SO FAR? Need for heterogeneous record information (user generated and curated, professional data) Multi-stage interface: – Longer search session – Less queries issued (more browsing) – No differences in number of books added to book bag Clear differences in search behaviour between the different types of tasks (Gäde et al. 2015, 2016)
  • 38. CONCLUSIONS Tens of thousands of information needs are going unmet – Just the tip of the iceberg? – Search engines and recommender systems are ill-equipped to deal with this! 42
  • 39. OPEN QUESTIONS How (dis)similar are relevance aspects for books to those for other domains? How do relevance aspects influence the choice of algorithm(s) & data representation(s)? How does the combination of data from different sources (Amazon, LibraryThing, Library of Congress, British Library) affect the quality of the results and UX? Decontextualized metadata: What happens when we mix metadata from different sources? – Example: reuse of recommendations or tags ‘out of context’ 43
  • 40. QUESTIONS? “ALWAYS READ SOMETHING THAT WILL MAKE YOU LOOK GOOD IF YOU DIE IN THE MIDDLE OF IT.” P.J. O’Rourke
  • 41. REFERENCES Slide 3 – Book sales statistics taken from https://www.statista.com/topics/1177/book- market/ and https://www.statista.com/topics/1474/e-books/; last visited October 1, 2016 Slide 6 – Official website of the Social Book Search lab: http://social-book- search.humanities.uva.nl/ Slide 20 – Reuter, K. (2007). Assessing Aesthetic Relevance: Children’s Book Selection in a Digital Library. JASIST, 58(12), 1745–1763. – Koolen, M., Bogers, T., Van den Bosch, A., and Kamps, J. (2015). Looking for Books in Social Media: An Analysis of Complex Search Requests. Proceedings of ECIR 2015, Volume 9022 of the series Lecture Notes in Computer Science, pp. 184-196 46
  • 42. REFERENCES Slide 40 – Gäde, M., Hall, M., Huurdeman, H., Kamps, J., Koolen, M., Skov, M., Toms, E. & Walsh, D. (2015). Overview of the SBS 2015 Interactive Track . Working Notes of CLEF 2015 – Conference and Labs of the Evaluation Forum, CEUR workshop proceedings, vol. 1391 – Gäde, M., Hall, M., Huurdeman, H., Kamps, J., Koolen, M., Skov, M., Bogers, T. & Walsh, D. (2015). Overview of the SBS 2016 Interactive Track. Working Notes of CLEF 2016 – Conference and Labs of the Evaluation Forum, CEUR workshop proceedings, vol. 1609 47