SlideShare a Scribd company logo
1 of 31
Influence of Timeline and Named-entity Components 
on User Engagement 
Yashar Moshfeghi1, Michael Matthews2, Roi Blanco2, Joemon M. Jose1 
1 School of Computing Science, University of Glasgow, Glasgow, UK 
2 Yahoo! Labs, Barcelona, Spain 
Yashar.Moshfeghi@glasgow.ac.uk 
ECIR 2013, Moscow, Russia
Outline 
• User Engagement 
• Prediction of User-centred metrics 
• Evaluation Methodology 
• Results 
• Conclusions
but also 
engaging
Time Named-Entity 
a Cranfield-style 
paradigm 
user 
engagement
Research Question 
• We aim to answer the following research 
question: 
– “can timeline and named-entity components 
improve user engagement in the context of a 
news retrieval system?”
Multi-faceted 
concept: 
emotional, 
cognitive and 
behavioural 
Subjective measures 
(O’Brien and Toms): 
focused attention, 
aesthetics, 
perceived usability, 
endurability, 
novelty, 
involvement 
Objective measures: 
Subjective Perception 
of Time
An increase of information-rich user experiences in 
the search realm (logged interaction data) 
Prediction of user 
preferences for web 
search results 
Prediction of user-centred 
metrics of 
an IIR system 
Build search applications in which the layout and elements 
displayed adapt to the needs of the user or context
Submit Query 
Retrieved Results 
The News System Anatomy 
Timeline Component 
Entity Component
Experimental Methodology 
• Design 
– A ‘within-subjects’ design was used in this study. 
• The independent variable 
– the system (with two levels: baseline, enriched), 
– controlled by the viewing timeline and named-entity 
components (enriched) or hiding them (baseline). 
• The dependent variables were: 
– (i) user engagement 
• (involvement, novelty, endurability, usability, aesthetics, attention) 
– (ii) system preference
Experimental Methodology - Task 
• We used a simulated information need 
situation. 
• The simulated task was defined as follow: 
– “Imagine you are reading today’s news events and 
one of them is very important or interesting to 
you, and you want to learn more. Find as much 
relevant news information as possible so that you 
can construct an overall (big) picture of the event 
and also cover the important parts of it.”
Experimental Methodology - Task 
• The search task was presented twice to each 
participant with different search topics.
• Advantages: 
• Reduced monetary cost 
• Ease of engaging a large number of users in the study. 
• Disadvantages 
• Low quality data and in turn, the challenge is to improve 
and assure data quality. 
• Need for techniques to minimise 
• spammers, 
• multiple account workers 
• Lazy worker
• Multiple response technique for our questionnaire 
• known to be very effective and cost efficient to improve 
the data quality 
• Browser cookies were used to guard against multiple account 
workers 
• To avoid spammers (as recommended in the literature), 
• Population screening based on location (United States) 
• HIT approval rate greater than 95% 
• To reduce attrition, demographic questions were put at the 
beginning of the experimental procedure.
Experimental Methodology - Procedure 
• Participants were instructed that the experiment 
would take approximately 60 minutes to complete 
• They were informed that they could only participate 
in this study once 
• Payment for study completion was $5 (The total cost 
of the evaluation was $510 ) 
• Each participant had to complete two search tasks, 
one for each level of independent variable (i.e. 
baseline and enriched system)
Experimental Methodology - 
Procedure
Experimental Methodology 
• We considered six dimensions introduced by O’Brien et al.: 
– focused attention, aesthetics, perceived usability, endurability, 
novelty, and involvement 
• The different dimensions were measured through a number 
of forced-choice type questions. 
• A 5-point scale respond (strong disagree to strong agree) 
– “Based on this news retrieval experience, please indicate whether 
you agree or disagree with each statement”. 
• In total, in each post-search questionnaire we have asked 
31 questions related to user engagement 
– adapted from O’Brien et al. 
– randomised its assignment to participants
Experimental Methodology 
• Pilot Studies: 
– We run three pilot studies using 10 participants. 
– Other changes consisted of 
• modifications to the questionnaires to clarify questions, 
• modifications to the system to improve logging capabilities 
• improvements to the training video. 
– After the final pilot, it was determined that 
• the participants were able to complete the user study 
without problems 
• the system was correctly logging the interaction data.
Results Analysis – Data Preprocessing 
• To ensure the availability of relevant documents 
– two evaluators manually calculated 
• the Precision@1, 5, and 10 
• for all the topics 
• a set of queries issued by the participants. 
– Precision@1, 5 and 10 were 0.85, 0.84, and 0.86 
respectively, 
– Judges had a very high inter-annotator agreement with 
Kappa > 0.9. 
– This indicates that the queries the users issued into the 
system had good coverage and the ranking was 
accurate enough.
Results Analysis – Data Preprocessing 
• 63 out of 92 users successfully completed the study. 
• A relatively even split by condition, with 47% in the scenario 
where group 1, and 53% conversely. 
• We removed the 
– incomplete surveys 
– participants who repeated the study 
– participants who completed the survey incorrectly (based on task 
conditions) 
• they had to visit at least three relevant documents for a given topic, and 
• the issued queries should be related to the selected topic 
– identifying suspect attempts by checking 
• the extremely short task durations 
• comments that are repeated verbatim across multiple open-ended questions
Results Analysis – Demographic Info. 
• 126 search sessions that were successfully carried out by 63 participants. 
• The 63 participants 
– female=46%, male=54%, prefer not to say=0% 
– were mainly under the age of 41 (84%) 
• with the largest group between the ages of 24-29 (33.3%). 
• Participants had 
– a high school diploma or equivalent (11.11%), 
– associates degree (15.87%), 
– graduate degree (11.11%), 
– bachelor (31.7%) or 
– some college degree (30.15%). 
• They were 
– primarily employed for a company or organisation (39.68%), 
– though there were a number of self-employed (22.22%), 
– students (11.11%), and 
– not employed (26.98%).
Results Analysis 
Enriched ** 
Baseline 
Enriched * 
Baseline 
Enriched * 
Baseline 
Enriched 
Baseline 
Enriched ** 
Baseline 
Enriched 
Baseline 
1 
2 
3 
4 
5 
User Engagement 
Involvement 
Novelty 
Endurablility 
Usability 
Aesthetics 
Attention
Results Analysis 
• We did not find any statistically significant 
difference between the two systems for 
Subjective Perception of Time metric 
– with mean and standard deviation of 10.03, ± 
5.22, and 10.12, ±4.95, for the baseline and 
enriched system respectively
Results Analysis - System Preference 
• the exit questionnaire posed the question 
– “Please select the system you preferred? (answer: 
1: First System, 2: Second System)” 
– and overall, 76% of the participants preferred the 
enriched system better than the baseline system.
Prediction of User-centred Metrics: 
• The demographic features 
– participants’ age, gender, education, and occupation 
• The search habits features 
– the number of years they have used web search and online news 
systems, 
– the frequency they engaged in different news search intention 
such as browsing, navigating, searching, etc. 
– the news domain they are interested in 
• The interaction features (derived from log information) 
– the total time they spent on each component and to complete a 
task, 
– the number of clicks, retrieved documents, queries, 
– the number of times they used the previous/next button, and 
other functionality of the systems
Prediction of User-centred Metrics: 
• We chose 
– the System Preference question 
– all the user engagement dimensions. 
• For System Preference question, 
– we have a binary class of “−1” indicating the participant did 
not prefer the enriched system and “+1” otherwise. 
• For the user engagement dimensions, 
– we used the final value calculated by aggregating all the 
questions related to each dimension 
– We transformed the values for each dimension to binary by 
mapping 4-5 to “+1” and otherwise to “−1”
Prediction of User-centred Metrics: 
• We learned a model to discriminate between the 
two classes using 
– SVMs trained with a polynomial kernel, 
– based on our analysis in the majority of cases, 
outperformed other SVM kernels (linear, and radial-basis). 
• We also tried other models such as bayesian 
logistic regression and decision trees but they 
underperformed with respect to SVMs.
Prediction of User-centred Metrics: 
• classification performance 
– averaged over the 63 participants of the study 
– using 10-fold cross validation 
 Results indicate that 
◦ for all the user engagement dimensions (excluding focused 
attention), the combination of all features leads to the best 
prediction accuracy 
◦ Regarding the system preference question, user-system 
interaction features determine with high accuracy the 
participants’ preference of a system (over 87%).
Summary 
• Given the competitiveness of the market on the web, applications 
nowadays are designed to be both efficient and engaging. 
• Thus, a new line of research is to identify system features that 
steer user engagement. 
• This work studies the interplay between user engagement and 
retrieval of named-entities and time, in an interactive search 
scenario. 
• We devised an experimental setup that exposed our participants 
on two news systems, one with a timeline and named-entity 
components and one without. 
• Two search tasks were performed by the participants and through 
questionnaires, user engagement was analysed.
Conclusions 
• Overall findings based on user questionnaires, show that substantial user 
engagement improvements can be achieved by integrating time and entity 
information into the system. 
• Further analysis of the results show that the majority of the participants 
preferred the enriched system over the baseline system. 
• We also investigated the hypothesis that user-centred metrics can be 
predicted in an IIR scenario given the participants’ demographics and 
search habits, and/or interaction with the system. 
• The results obtained across all the user engagement dimensions as well as 
System Preference question, supported our hypothesis. 
• As future work, we will continue to study how user interactions can be 
leveraged to predict satisfaction measures and possibly build interfaces 
that adapt based on user interaction patterns.
Acknowledgement: This work was partially supported by the EU FP7 LiMoSINe project 
(288024). 
This work was performed while intern at Yahoo! Research lab in

More Related Content

What's hot

INFORMATION RETRIEVAL ‎AND DISSEMINATION
INFORMATION RETRIEVAL ‎AND DISSEMINATIONINFORMATION RETRIEVAL ‎AND DISSEMINATION
INFORMATION RETRIEVAL ‎AND DISSEMINATIONLibcorpio
 
Model of information retrieval (3)
Model  of information retrieval (3)Model  of information retrieval (3)
Model of information retrieval (3)9866825059
 
CS6007 information retrieval - 5 units notes
CS6007   information retrieval - 5 units notesCS6007   information retrieval - 5 units notes
CS6007 information retrieval - 5 units notesAnandh Arumugakan
 
Tutorial 1 (information retrieval basics)
Tutorial 1 (information retrieval basics)Tutorial 1 (information retrieval basics)
Tutorial 1 (information retrieval basics)Kira
 
Konsep Dasar Information Retrieval - Edi faizal
Konsep Dasar Information Retrieval - Edi faizal Konsep Dasar Information Retrieval - Edi faizal
Konsep Dasar Information Retrieval - Edi faizal EdiFaizal2
 
Information retrieval
Information retrievalInformation retrieval
Information retrievalhplap
 
Information storage and retrieval
Information storage and retrievalInformation storage and retrieval
Information storage and retrievalSadaf Rafiq
 
INFORMATION RETRIEVAL Anandraj.L
INFORMATION RETRIEVAL Anandraj.LINFORMATION RETRIEVAL Anandraj.L
INFORMATION RETRIEVAL Anandraj.Lanujessy
 
Tdm information retrieval
Tdm information retrievalTdm information retrieval
Tdm information retrievalKU Leuven
 
Information retrieval s
Information retrieval sInformation retrieval s
Information retrieval ssilambu111
 
Large-Scale Semantic Search
Large-Scale Semantic SearchLarge-Scale Semantic Search
Large-Scale Semantic SearchRoi Blanco
 
Semantic Search
Semantic SearchSemantic Search
Semantic Searchsssw2012
 
Functions of information retrival system(1)
Functions of information retrival system(1)Functions of information retrival system(1)
Functions of information retrival system(1)silambu111
 
Information Retrieval
Information RetrievalInformation Retrieval
Information Retrievalrchbeir
 
Using “Distant Reading” to Explore Discussion Threads in Online Courses
Using “Distant Reading” to Explore Discussion Threads in Online CoursesUsing “Distant Reading” to Explore Discussion Threads in Online Courses
Using “Distant Reading” to Explore Discussion Threads in Online CoursesShalin Hai-Jew
 
"Mass Surveillance" through Distant Reading
"Mass Surveillance" through Distant Reading"Mass Surveillance" through Distant Reading
"Mass Surveillance" through Distant ReadingShalin Hai-Jew
 
Aggregation for searching complex information spaces
Aggregation for searching complex information spacesAggregation for searching complex information spaces
Aggregation for searching complex information spacesMounia Lalmas-Roelleke
 
Information Storage and Retrieval : A Case Study
Information Storage and Retrieval : A Case StudyInformation Storage and Retrieval : A Case Study
Information Storage and Retrieval : A Case StudyBhojaraju Gunjal
 
Tovek Presentation by Livio Costantini
Tovek Presentation by Livio CostantiniTovek Presentation by Livio Costantini
Tovek Presentation by Livio Costantinimaxfalc
 

What's hot (20)

INFORMATION RETRIEVAL ‎AND DISSEMINATION
INFORMATION RETRIEVAL ‎AND DISSEMINATIONINFORMATION RETRIEVAL ‎AND DISSEMINATION
INFORMATION RETRIEVAL ‎AND DISSEMINATION
 
Model of information retrieval (3)
Model  of information retrieval (3)Model  of information retrieval (3)
Model of information retrieval (3)
 
CS6007 information retrieval - 5 units notes
CS6007   information retrieval - 5 units notesCS6007   information retrieval - 5 units notes
CS6007 information retrieval - 5 units notes
 
Tutorial 1 (information retrieval basics)
Tutorial 1 (information retrieval basics)Tutorial 1 (information retrieval basics)
Tutorial 1 (information retrieval basics)
 
Konsep Dasar Information Retrieval - Edi faizal
Konsep Dasar Information Retrieval - Edi faizal Konsep Dasar Information Retrieval - Edi faizal
Konsep Dasar Information Retrieval - Edi faizal
 
Information retrieval
Information retrievalInformation retrieval
Information retrieval
 
Information storage and retrieval
Information storage and retrievalInformation storage and retrieval
Information storage and retrieval
 
Ir 01
Ir   01Ir   01
Ir 01
 
INFORMATION RETRIEVAL Anandraj.L
INFORMATION RETRIEVAL Anandraj.LINFORMATION RETRIEVAL Anandraj.L
INFORMATION RETRIEVAL Anandraj.L
 
Tdm information retrieval
Tdm information retrievalTdm information retrieval
Tdm information retrieval
 
Information retrieval s
Information retrieval sInformation retrieval s
Information retrieval s
 
Large-Scale Semantic Search
Large-Scale Semantic SearchLarge-Scale Semantic Search
Large-Scale Semantic Search
 
Semantic Search
Semantic SearchSemantic Search
Semantic Search
 
Functions of information retrival system(1)
Functions of information retrival system(1)Functions of information retrival system(1)
Functions of information retrival system(1)
 
Information Retrieval
Information RetrievalInformation Retrieval
Information Retrieval
 
Using “Distant Reading” to Explore Discussion Threads in Online Courses
Using “Distant Reading” to Explore Discussion Threads in Online CoursesUsing “Distant Reading” to Explore Discussion Threads in Online Courses
Using “Distant Reading” to Explore Discussion Threads in Online Courses
 
"Mass Surveillance" through Distant Reading
"Mass Surveillance" through Distant Reading"Mass Surveillance" through Distant Reading
"Mass Surveillance" through Distant Reading
 
Aggregation for searching complex information spaces
Aggregation for searching complex information spacesAggregation for searching complex information spaces
Aggregation for searching complex information spaces
 
Information Storage and Retrieval : A Case Study
Information Storage and Retrieval : A Case StudyInformation Storage and Retrieval : A Case Study
Information Storage and Retrieval : A Case Study
 
Tovek Presentation by Livio Costantini
Tovek Presentation by Livio CostantiniTovek Presentation by Livio Costantini
Tovek Presentation by Livio Costantini
 

Viewers also liked

Pkbm ekonomi 11 01
Pkbm ekonomi 11 01Pkbm ekonomi 11 01
Pkbm ekonomi 11 01Ridwan Gucci
 
Amiel pangilinan how to use vlc
Amiel pangilinan how to use vlcAmiel pangilinan how to use vlc
Amiel pangilinan how to use vlcAmiel Pangilinan
 
A GREATER Halifax: 2011-16 Economic Strategy for Halifax
A GREATER Halifax: 2011-16 Economic Strategy for HalifaxA GREATER Halifax: 2011-16 Economic Strategy for Halifax
A GREATER Halifax: 2011-16 Economic Strategy for HalifaxHalifax Partnership
 
Boris Chan - FITC SCREENS - Becoming Social By Default on Mobile
Boris Chan - FITC SCREENS - Becoming Social By Default on MobileBoris Chan - FITC SCREENS - Becoming Social By Default on Mobile
Boris Chan - FITC SCREENS - Becoming Social By Default on MobileBoris Chan
 
How to use spagepark billing
How to use spagepark billingHow to use spagepark billing
How to use spagepark billingAmiel Pangilinan
 
CESSI en Information Technology - Exportar conocimiento, la clave para crecer
CESSI en Information Technology - Exportar conocimiento, la clave para crecerCESSI en Information Technology - Exportar conocimiento, la clave para crecer
CESSI en Information Technology - Exportar conocimiento, la clave para crecerCESSI ArgenTIna
 
Best of the web ms.hs
Best of the web ms.hsBest of the web ms.hs
Best of the web ms.hsLeah Vestal
 
My name is, max p.
My name is, max p.My name is, max p.
My name is, max p.PearsallMax
 
Deloitte: The Future of Productivity
Deloitte: The Future of Productivity Deloitte: The Future of Productivity
Deloitte: The Future of Productivity Halifax Partnership
 
Beyond xUnit example-based testing: property-based testing with ScalaCheck
Beyond xUnit example-based testing: property-based testing with ScalaCheckBeyond xUnit example-based testing: property-based testing with ScalaCheck
Beyond xUnit example-based testing: property-based testing with ScalaCheckFranklin Chen
 
Mission mercury
Mission mercuryMission mercury
Mission mercuryLisa Baird
 

Viewers also liked (20)

Pkbm ekonomi 11 01
Pkbm ekonomi 11 01Pkbm ekonomi 11 01
Pkbm ekonomi 11 01
 
Tarzan profile 英文
Tarzan profile 英文Tarzan profile 英文
Tarzan profile 英文
 
#myHFXpledge
#myHFXpledge#myHFXpledge
#myHFXpledge
 
The Image Of Den helder
The Image Of Den helderThe Image Of Den helder
The Image Of Den helder
 
Amiel pangilinan how to use vlc
Amiel pangilinan how to use vlcAmiel pangilinan how to use vlc
Amiel pangilinan how to use vlc
 
A GREATER Halifax: 2011-16 Economic Strategy for Halifax
A GREATER Halifax: 2011-16 Economic Strategy for HalifaxA GREATER Halifax: 2011-16 Economic Strategy for Halifax
A GREATER Halifax: 2011-16 Economic Strategy for Halifax
 
MY PROFILE
MY PROFILEMY PROFILE
MY PROFILE
 
Boris Chan - FITC SCREENS - Becoming Social By Default on Mobile
Boris Chan - FITC SCREENS - Becoming Social By Default on MobileBoris Chan - FITC SCREENS - Becoming Social By Default on Mobile
Boris Chan - FITC SCREENS - Becoming Social By Default on Mobile
 
How to use spagepark billing
How to use spagepark billingHow to use spagepark billing
How to use spagepark billing
 
Halifax Index 2012 Presentation
Halifax Index 2012 PresentationHalifax Index 2012 Presentation
Halifax Index 2012 Presentation
 
CESSI en Information Technology - Exportar conocimiento, la clave para crecer
CESSI en Information Technology - Exportar conocimiento, la clave para crecerCESSI en Information Technology - Exportar conocimiento, la clave para crecer
CESSI en Information Technology - Exportar conocimiento, la clave para crecer
 
Gic2012 aula7-ingles
Gic2012 aula7-inglesGic2012 aula7-ingles
Gic2012 aula7-ingles
 
Best of the web ms.hs
Best of the web ms.hsBest of the web ms.hs
Best of the web ms.hs
 
Ludmi y mika
Ludmi y mikaLudmi y mika
Ludmi y mika
 
My name is, max p.
My name is, max p.My name is, max p.
My name is, max p.
 
Deloitte: The Future of Productivity
Deloitte: The Future of Productivity Deloitte: The Future of Productivity
Deloitte: The Future of Productivity
 
Beyond xUnit example-based testing: property-based testing with ScalaCheck
Beyond xUnit example-based testing: property-based testing with ScalaCheckBeyond xUnit example-based testing: property-based testing with ScalaCheck
Beyond xUnit example-based testing: property-based testing with ScalaCheck
 
Mission mercury
Mission mercuryMission mercury
Mission mercury
 
Summit on Youth in NS Economy
Summit on Youth in NS EconomySummit on Youth in NS Economy
Summit on Youth in NS Economy
 
Pres
PresPres
Pres
 

Similar to Influence of Timeline and Named-entity Components on User Engagement

Evaluation in Audio Music Similarity
Evaluation in Audio Music SimilarityEvaluation in Audio Music Similarity
Evaluation in Audio Music SimilarityJulián Urbano
 
Measuring the usefulness of Knowledge Organization Systems in Information Ret...
Measuring the usefulness of Knowledge Organization Systems in Information Ret...Measuring the usefulness of Knowledge Organization Systems in Information Ret...
Measuring the usefulness of Knowledge Organization Systems in Information Ret...GESIS
 
Introduction to Usability Testing for Survey Research
Introduction to Usability Testing for Survey ResearchIntroduction to Usability Testing for Survey Research
Introduction to Usability Testing for Survey ResearchCaroline Jarrett
 
Effect of Computer-Based Testing on Candidates
Effect of Computer-Based Testing on CandidatesEffect of Computer-Based Testing on Candidates
Effect of Computer-Based Testing on CandidatesAssessment Systems
 
evaluation technique uni 2
evaluation technique uni 2evaluation technique uni 2
evaluation technique uni 2vrgokila
 
Mustafa Degerli - 2012 - SEPG EUROPE 2012 - Poster - Factors Influencing the ...
Mustafa Degerli - 2012 - SEPG EUROPE 2012 - Poster - Factors Influencing the ...Mustafa Degerli - 2012 - SEPG EUROPE 2012 - Poster - Factors Influencing the ...
Mustafa Degerli - 2012 - SEPG EUROPE 2012 - Poster - Factors Influencing the ...Dr. Mustafa Değerli
 
Quantitative & Qualitative Data Collection.pptx
Quantitative & Qualitative Data Collection.pptxQuantitative & Qualitative Data Collection.pptx
Quantitative & Qualitative Data Collection.pptxminervainez1
 
Evaluating Systems Change
Evaluating Systems ChangeEvaluating Systems Change
Evaluating Systems ChangeNoel Hatch
 
Usability evaluation methods (part 2) and performance metrics
Usability evaluation methods (part 2) and performance metricsUsability evaluation methods (part 2) and performance metrics
Usability evaluation methods (part 2) and performance metricsAndres Baravalle
 
Analytic emperical Mehods
Analytic emperical MehodsAnalytic emperical Mehods
Analytic emperical MehodsM Surendar
 
Usability of Online Instruction
Usability of Online InstructionUsability of Online Instruction
Usability of Online InstructionMichael Wilder
 
Botor_project_research_methodology_2016
Botor_project_research_methodology_2016Botor_project_research_methodology_2016
Botor_project_research_methodology_2016Shayne Botor
 
Online Learning to Rank
Online Learning to RankOnline Learning to Rank
Online Learning to Rankewhuang3
 
Module-7-Descriptive Research-survey.pdf
Module-7-Descriptive Research-survey.pdfModule-7-Descriptive Research-survey.pdf
Module-7-Descriptive Research-survey.pdfVikramjit Singh
 

Similar to Influence of Timeline and Named-entity Components on User Engagement (20)

Evaluation in Audio Music Similarity
Evaluation in Audio Music SimilarityEvaluation in Audio Music Similarity
Evaluation in Audio Music Similarity
 
Measuring the usefulness of Knowledge Organization Systems in Information Ret...
Measuring the usefulness of Knowledge Organization Systems in Information Ret...Measuring the usefulness of Knowledge Organization Systems in Information Ret...
Measuring the usefulness of Knowledge Organization Systems in Information Ret...
 
Introduction to Usability Testing for Survey Research
Introduction to Usability Testing for Survey ResearchIntroduction to Usability Testing for Survey Research
Introduction to Usability Testing for Survey Research
 
Effect of Computer-Based Testing on Candidates
Effect of Computer-Based Testing on CandidatesEffect of Computer-Based Testing on Candidates
Effect of Computer-Based Testing on Candidates
 
Ai in hrm
Ai in hrmAi in hrm
Ai in hrm
 
evaluation technique uni 2
evaluation technique uni 2evaluation technique uni 2
evaluation technique uni 2
 
Other metrics
Other metricsOther metrics
Other metrics
 
HM404 Ab120916 ch06
HM404 Ab120916 ch06HM404 Ab120916 ch06
HM404 Ab120916 ch06
 
Mustafa Degerli - 2012 - SEPG EUROPE 2012 - Poster - Factors Influencing the ...
Mustafa Degerli - 2012 - SEPG EUROPE 2012 - Poster - Factors Influencing the ...Mustafa Degerli - 2012 - SEPG EUROPE 2012 - Poster - Factors Influencing the ...
Mustafa Degerli - 2012 - SEPG EUROPE 2012 - Poster - Factors Influencing the ...
 
Medlars
MedlarsMedlars
Medlars
 
E3 chap-09
E3 chap-09E3 chap-09
E3 chap-09
 
Quantitative & Qualitative Data Collection.pptx
Quantitative & Qualitative Data Collection.pptxQuantitative & Qualitative Data Collection.pptx
Quantitative & Qualitative Data Collection.pptx
 
Evaluating Systems Change
Evaluating Systems ChangeEvaluating Systems Change
Evaluating Systems Change
 
Usability evaluation methods (part 2) and performance metrics
Usability evaluation methods (part 2) and performance metricsUsability evaluation methods (part 2) and performance metrics
Usability evaluation methods (part 2) and performance metrics
 
Analytic emperical Mehods
Analytic emperical MehodsAnalytic emperical Mehods
Analytic emperical Mehods
 
Usability of Online Instruction
Usability of Online InstructionUsability of Online Instruction
Usability of Online Instruction
 
Botor_project_research_methodology_2016
Botor_project_research_methodology_2016Botor_project_research_methodology_2016
Botor_project_research_methodology_2016
 
Chapter Eight Quantitative Methods
Chapter Eight Quantitative MethodsChapter Eight Quantitative Methods
Chapter Eight Quantitative Methods
 
Online Learning to Rank
Online Learning to RankOnline Learning to Rank
Online Learning to Rank
 
Module-7-Descriptive Research-survey.pdf
Module-7-Descriptive Research-survey.pdfModule-7-Descriptive Research-survey.pdf
Module-7-Descriptive Research-survey.pdf
 

More from Roi Blanco

From Queries to Answers in the Web
From Queries to Answers in the WebFrom Queries to Answers in the Web
From Queries to Answers in the WebRoi Blanco
 
Entity Linking via Graph-Distance Minimization
Entity Linking via Graph-Distance MinimizationEntity Linking via Graph-Distance Minimization
Entity Linking via Graph-Distance MinimizationRoi Blanco
 
Introduction to Big Data
Introduction to Big DataIntroduction to Big Data
Introduction to Big DataRoi Blanco
 
Mining Web content for Enhanced Search
Mining Web content for Enhanced Search Mining Web content for Enhanced Search
Mining Web content for Enhanced Search Roi Blanco
 
Searching over the past, present and future
Searching over the past, present and futureSearching over the past, present and future
Searching over the past, present and futureRoi Blanco
 
Beyond document retrieval using semantic annotations
Beyond document retrieval using semantic annotations Beyond document retrieval using semantic annotations
Beyond document retrieval using semantic annotations Roi Blanco
 
Keyword Search over RDF Graphs
Keyword Search over RDF GraphsKeyword Search over RDF Graphs
Keyword Search over RDF GraphsRoi Blanco
 
Extending BM25 with multiple query operators
Extending BM25 with multiple query operatorsExtending BM25 with multiple query operators
Extending BM25 with multiple query operatorsRoi Blanco
 
Energy-Price-Driven Query Processing in Multi-center Web Search Engines
Energy-Price-Driven Query Processing in Multi-center WebSearch EnginesEnergy-Price-Driven Query Processing in Multi-center WebSearch Engines
Energy-Price-Driven Query Processing in Multi-center Web Search EnginesRoi Blanco
 
Effective and Efficient Entity Search in RDF data
Effective and Efficient Entity Search in RDF dataEffective and Efficient Entity Search in RDF data
Effective and Efficient Entity Search in RDF dataRoi Blanco
 
Caching Search Engine Results over Incremental Indices
Caching Search Engine Results over Incremental IndicesCaching Search Engine Results over Incremental Indices
Caching Search Engine Results over Incremental IndicesRoi Blanco
 
Finding support sentences for entities
Finding support sentences for entitiesFinding support sentences for entities
Finding support sentences for entitiesRoi Blanco
 

More from Roi Blanco (12)

From Queries to Answers in the Web
From Queries to Answers in the WebFrom Queries to Answers in the Web
From Queries to Answers in the Web
 
Entity Linking via Graph-Distance Minimization
Entity Linking via Graph-Distance MinimizationEntity Linking via Graph-Distance Minimization
Entity Linking via Graph-Distance Minimization
 
Introduction to Big Data
Introduction to Big DataIntroduction to Big Data
Introduction to Big Data
 
Mining Web content for Enhanced Search
Mining Web content for Enhanced Search Mining Web content for Enhanced Search
Mining Web content for Enhanced Search
 
Searching over the past, present and future
Searching over the past, present and futureSearching over the past, present and future
Searching over the past, present and future
 
Beyond document retrieval using semantic annotations
Beyond document retrieval using semantic annotations Beyond document retrieval using semantic annotations
Beyond document retrieval using semantic annotations
 
Keyword Search over RDF Graphs
Keyword Search over RDF GraphsKeyword Search over RDF Graphs
Keyword Search over RDF Graphs
 
Extending BM25 with multiple query operators
Extending BM25 with multiple query operatorsExtending BM25 with multiple query operators
Extending BM25 with multiple query operators
 
Energy-Price-Driven Query Processing in Multi-center Web Search Engines
Energy-Price-Driven Query Processing in Multi-center WebSearch EnginesEnergy-Price-Driven Query Processing in Multi-center WebSearch Engines
Energy-Price-Driven Query Processing in Multi-center Web Search Engines
 
Effective and Efficient Entity Search in RDF data
Effective and Efficient Entity Search in RDF dataEffective and Efficient Entity Search in RDF data
Effective and Efficient Entity Search in RDF data
 
Caching Search Engine Results over Incremental Indices
Caching Search Engine Results over Incremental IndicesCaching Search Engine Results over Incremental Indices
Caching Search Engine Results over Incremental Indices
 
Finding support sentences for entities
Finding support sentences for entitiesFinding support sentences for entities
Finding support sentences for entities
 

Recently uploaded

Google AI Hackathon: LLM based Evaluator for RAG
Google AI Hackathon: LLM based Evaluator for RAGGoogle AI Hackathon: LLM based Evaluator for RAG
Google AI Hackathon: LLM based Evaluator for RAGSujit Pal
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitecturePixlogix Infotech
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure servicePooja Nehwal
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersThousandEyes
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesSinan KOZAK
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘RTylerCroy
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024The Digital Insurer
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Servicegiselly40
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...gurkirankumar98700
 

Recently uploaded (20)

Google AI Hackathon: LLM based Evaluator for RAG
Google AI Hackathon: LLM based Evaluator for RAGGoogle AI Hackathon: LLM based Evaluator for RAG
Google AI Hackathon: LLM based Evaluator for RAG
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC Architecture
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen Frames
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
 

Influence of Timeline and Named-entity Components on User Engagement

  • 1. Influence of Timeline and Named-entity Components on User Engagement Yashar Moshfeghi1, Michael Matthews2, Roi Blanco2, Joemon M. Jose1 1 School of Computing Science, University of Glasgow, Glasgow, UK 2 Yahoo! Labs, Barcelona, Spain Yashar.Moshfeghi@glasgow.ac.uk ECIR 2013, Moscow, Russia
  • 2. Outline • User Engagement • Prediction of User-centred metrics • Evaluation Methodology • Results • Conclusions
  • 3.
  • 5. Time Named-Entity a Cranfield-style paradigm user engagement
  • 6. Research Question • We aim to answer the following research question: – “can timeline and named-entity components improve user engagement in the context of a news retrieval system?”
  • 7. Multi-faceted concept: emotional, cognitive and behavioural Subjective measures (O’Brien and Toms): focused attention, aesthetics, perceived usability, endurability, novelty, involvement Objective measures: Subjective Perception of Time
  • 8. An increase of information-rich user experiences in the search realm (logged interaction data) Prediction of user preferences for web search results Prediction of user-centred metrics of an IIR system Build search applications in which the layout and elements displayed adapt to the needs of the user or context
  • 9. Submit Query Retrieved Results The News System Anatomy Timeline Component Entity Component
  • 10. Experimental Methodology • Design – A ‘within-subjects’ design was used in this study. • The independent variable – the system (with two levels: baseline, enriched), – controlled by the viewing timeline and named-entity components (enriched) or hiding them (baseline). • The dependent variables were: – (i) user engagement • (involvement, novelty, endurability, usability, aesthetics, attention) – (ii) system preference
  • 11. Experimental Methodology - Task • We used a simulated information need situation. • The simulated task was defined as follow: – “Imagine you are reading today’s news events and one of them is very important or interesting to you, and you want to learn more. Find as much relevant news information as possible so that you can construct an overall (big) picture of the event and also cover the important parts of it.”
  • 12. Experimental Methodology - Task • The search task was presented twice to each participant with different search topics.
  • 13. • Advantages: • Reduced monetary cost • Ease of engaging a large number of users in the study. • Disadvantages • Low quality data and in turn, the challenge is to improve and assure data quality. • Need for techniques to minimise • spammers, • multiple account workers • Lazy worker
  • 14. • Multiple response technique for our questionnaire • known to be very effective and cost efficient to improve the data quality • Browser cookies were used to guard against multiple account workers • To avoid spammers (as recommended in the literature), • Population screening based on location (United States) • HIT approval rate greater than 95% • To reduce attrition, demographic questions were put at the beginning of the experimental procedure.
  • 15. Experimental Methodology - Procedure • Participants were instructed that the experiment would take approximately 60 minutes to complete • They were informed that they could only participate in this study once • Payment for study completion was $5 (The total cost of the evaluation was $510 ) • Each participant had to complete two search tasks, one for each level of independent variable (i.e. baseline and enriched system)
  • 17. Experimental Methodology • We considered six dimensions introduced by O’Brien et al.: – focused attention, aesthetics, perceived usability, endurability, novelty, and involvement • The different dimensions were measured through a number of forced-choice type questions. • A 5-point scale respond (strong disagree to strong agree) – “Based on this news retrieval experience, please indicate whether you agree or disagree with each statement”. • In total, in each post-search questionnaire we have asked 31 questions related to user engagement – adapted from O’Brien et al. – randomised its assignment to participants
  • 18. Experimental Methodology • Pilot Studies: – We run three pilot studies using 10 participants. – Other changes consisted of • modifications to the questionnaires to clarify questions, • modifications to the system to improve logging capabilities • improvements to the training video. – After the final pilot, it was determined that • the participants were able to complete the user study without problems • the system was correctly logging the interaction data.
  • 19. Results Analysis – Data Preprocessing • To ensure the availability of relevant documents – two evaluators manually calculated • the Precision@1, 5, and 10 • for all the topics • a set of queries issued by the participants. – Precision@1, 5 and 10 were 0.85, 0.84, and 0.86 respectively, – Judges had a very high inter-annotator agreement with Kappa > 0.9. – This indicates that the queries the users issued into the system had good coverage and the ranking was accurate enough.
  • 20. Results Analysis – Data Preprocessing • 63 out of 92 users successfully completed the study. • A relatively even split by condition, with 47% in the scenario where group 1, and 53% conversely. • We removed the – incomplete surveys – participants who repeated the study – participants who completed the survey incorrectly (based on task conditions) • they had to visit at least three relevant documents for a given topic, and • the issued queries should be related to the selected topic – identifying suspect attempts by checking • the extremely short task durations • comments that are repeated verbatim across multiple open-ended questions
  • 21. Results Analysis – Demographic Info. • 126 search sessions that were successfully carried out by 63 participants. • The 63 participants – female=46%, male=54%, prefer not to say=0% – were mainly under the age of 41 (84%) • with the largest group between the ages of 24-29 (33.3%). • Participants had – a high school diploma or equivalent (11.11%), – associates degree (15.87%), – graduate degree (11.11%), – bachelor (31.7%) or – some college degree (30.15%). • They were – primarily employed for a company or organisation (39.68%), – though there were a number of self-employed (22.22%), – students (11.11%), and – not employed (26.98%).
  • 22. Results Analysis Enriched ** Baseline Enriched * Baseline Enriched * Baseline Enriched Baseline Enriched ** Baseline Enriched Baseline 1 2 3 4 5 User Engagement Involvement Novelty Endurablility Usability Aesthetics Attention
  • 23. Results Analysis • We did not find any statistically significant difference between the two systems for Subjective Perception of Time metric – with mean and standard deviation of 10.03, ± 5.22, and 10.12, ±4.95, for the baseline and enriched system respectively
  • 24. Results Analysis - System Preference • the exit questionnaire posed the question – “Please select the system you preferred? (answer: 1: First System, 2: Second System)” – and overall, 76% of the participants preferred the enriched system better than the baseline system.
  • 25. Prediction of User-centred Metrics: • The demographic features – participants’ age, gender, education, and occupation • The search habits features – the number of years they have used web search and online news systems, – the frequency they engaged in different news search intention such as browsing, navigating, searching, etc. – the news domain they are interested in • The interaction features (derived from log information) – the total time they spent on each component and to complete a task, – the number of clicks, retrieved documents, queries, – the number of times they used the previous/next button, and other functionality of the systems
  • 26. Prediction of User-centred Metrics: • We chose – the System Preference question – all the user engagement dimensions. • For System Preference question, – we have a binary class of “−1” indicating the participant did not prefer the enriched system and “+1” otherwise. • For the user engagement dimensions, – we used the final value calculated by aggregating all the questions related to each dimension – We transformed the values for each dimension to binary by mapping 4-5 to “+1” and otherwise to “−1”
  • 27. Prediction of User-centred Metrics: • We learned a model to discriminate between the two classes using – SVMs trained with a polynomial kernel, – based on our analysis in the majority of cases, outperformed other SVM kernels (linear, and radial-basis). • We also tried other models such as bayesian logistic regression and decision trees but they underperformed with respect to SVMs.
  • 28. Prediction of User-centred Metrics: • classification performance – averaged over the 63 participants of the study – using 10-fold cross validation  Results indicate that ◦ for all the user engagement dimensions (excluding focused attention), the combination of all features leads to the best prediction accuracy ◦ Regarding the system preference question, user-system interaction features determine with high accuracy the participants’ preference of a system (over 87%).
  • 29. Summary • Given the competitiveness of the market on the web, applications nowadays are designed to be both efficient and engaging. • Thus, a new line of research is to identify system features that steer user engagement. • This work studies the interplay between user engagement and retrieval of named-entities and time, in an interactive search scenario. • We devised an experimental setup that exposed our participants on two news systems, one with a timeline and named-entity components and one without. • Two search tasks were performed by the participants and through questionnaires, user engagement was analysed.
  • 30. Conclusions • Overall findings based on user questionnaires, show that substantial user engagement improvements can be achieved by integrating time and entity information into the system. • Further analysis of the results show that the majority of the participants preferred the enriched system over the baseline system. • We also investigated the hypothesis that user-centred metrics can be predicted in an IIR scenario given the participants’ demographics and search habits, and/or interaction with the system. • The results obtained across all the user engagement dimensions as well as System Preference question, supported our hypothesis. • As future work, we will continue to study how user interactions can be leveraged to predict satisfaction measures and possibly build interfaces that adapt based on user interaction patterns.
  • 31. Acknowledgement: This work was partially supported by the EU FP7 LiMoSINe project (288024). This work was performed while intern at Yahoo! Research lab in

Editor's Notes

  1. “how and why people develop a relationship with technology and integrate it into their lives.”
  2. Thus, a new line of research is to identify system features that steer user engagement, which has become a key concept in designing user-centred web applications. Given the ubiquity of the choices on the web the competitiveness of the market, applications nowadays are designed to not only be efficient, effective, or satisfying but also engaging.
  3. There has been great attention on retrieving named entities, and using the time dimension for retrieval. Those approaches are evaluated exclusively focusing on a Cranfield-style paradigm, with little or no attention on user input, context and interaction. However, it is difficult to correlate user engagement with traditional retrieval metrics such as MAP. This problem becomes exacerbated when the user has to cope with content-rich user interfaces that include different sources of evidence and information nuggets of a different nature. This work studies the interplay between user engagement and retrieval of named-entities and time, in an interactive search scenario.
  4. User engagement is a multi-faceted concept associated with the emotional, cognitive and behavioural connection of user with a technological resource at any point of interaction period [1]. O’Brien and Toms defined a model characterising the key indicative dimensions of user engagement: focused attention, aesthetics, perceived usability, endurability, novelty, involvement. These factors elaborate the user engagement notion over the emotional, cognitive and behavioural aspects. Subjective and objective measures are proposed to evaluate user engagement [1], the former being considered to be the most for evaluation. We use the subjective measures proposed by O’Brien et al. [3]. Objective measures include subjective perception of time (SPT) and information retrieval metrics among others. SPT is calculated by asking participants to estimate the time taken to complete their searching task, which is compared with the actual time [1].
  5. Given the increase of information-rich user experiences in the search realm, we leverage the amount of logged interaction data. Prediction of user preferences for web search results based on user interaction with the system has been studied previously. In this work, we try to predict user-centred metrics of an IIR system rather than user preferences for its search results. Our positive findings could steer research into building search applications in which the layout and elements displayed adapt to the needs of the user or context.
  6. To provide a use case for our investigation, we experiment with a news search system, which encourages interaction due to the information overload problem associated with the news domain. One way to facilitate user interaction in such scenarios is to develop new methods of accessing such electronic resources. For this purpose, we carefully varied the components of a news retrieval system page. We experimented with a timeline and named-entity component (enriched) or hiding them (baseline), while keeping everything else fixed, and tested whether adding these components can help improve user engagement. To study the predictability of the user centred metrics, we repeat our interactive experiments at two different points in time, with a tightly controlled setting. As an outcome of those experiments, we conclude that the user-centred metrics can be predicted with high accuracy given their interaction with the system and their demographics and search habits are provided as an input.
  7. We introduced a short cover story that helped us describe to our participants the source of their information need, the environment of the situation and the problem to be solved. 2) This facilitated a better understanding of the search objective and, in addition, introduced a layer of realism, while preserving well-defined relevance criteria.
  8. We prepared a number of search topics that covered a variety of contexts, from entertainment and sport to crime and political issues, in order to capture participants’ interests as best as possible.
  9. We make use of Amazon’s Mechanical Turk (M-Turk), as our crowdsourcing platform.
  10. Particular attention was paid in our experimental design to help motivate participants to respond honestly to the self-report questions and take the tasks seriously.
  11. - though they would be given 120 minutes between the time they accepted and submitted the HIT assignment. ------- - they would not be paid if they had participated in any of the previous pilot studies. ------- - Given the findings of Mason and Watts, we expect the increase in wage just to change the rate of incoming workers to accept the HITS, and not affect their performance. - The total cost of the evaluation was $510 including the cost of the pilot studies and some of the rejected participants, which we consider to be cost-effective. ------- - The order in which each participant was introduced to the systems was randomised to soften any bias, e.g. the effect of task and/or fatigue. - Subsequently, participants were assigned to one of two systems (baseline or enriched) by clicking the link to the external survey. -------
  12. At the beginning of the experiment, the participants were introduced to an entry questionnaire, Demographic information, Previous experience with online news, in particular, browsing and search habits to estimate their familiarity with news retrieval systems and their related tasks. At the beginning of each task, the participants completed a Pre-search questionnaire, to understand the reason why a particular topic was selected At the end of each task, the participants completed a post-search questionnaire to elicit subject’s viewpoint on all user engagement dimensions. Finally, an exit questionnaire was introduced at the end of the study. In this questionnaire we gathered information about the user study in general: which system and task they preferred and why and their general comments.
  13. For example, involvement was measured by adapting three questions from [3]: (1) I was really drawn into my news search task. (2) I felt involved in this news search task. (3) This news search experience was fun.
  14. In each iteration, a number of changes were made to the system based on feedback from the pilot study. For example, for each dimension we computed Cronbach’s alpha to evaluate the reliability of the questions adopted for each dimension. We finalised the questions of each dimension by confirming their Cronbach’s alpha value (> 0.8). Cronbach’s alpha is used as a measure of the internal consistency of a psychometric test score for a sample of subjects.
  15. This is further explained by the fact that the topics were timely and most news providers including in the index contained articles related to them.
  16. after either abandoning it part-way through or had completed it once before.
  17. Figures 2 shows the box plot for the user engagement analysis, for the two systems (baseline and enriched), based on the post-study questionnaire. The box plot reports, over the data gathered from 63 participants, five important pieces of information namely: the minimum, first, second (median), third, and maximum quartiles. We performed a paired Wilcoxon Mann-Whitney test between measures obtained for the enriched system for each user to check the significance of the difference with the baseline system. We use (*) and (**) to denote the fact that a dimension had results different from that of the baseline with the confidence levels (p < 0.05) and (p < 0.01) respectively. As shown in Figure 2, the enriched system has a better median and/or mean and lower variance than the baseline system across all dimensions. This shows that substantial user engagement improvements can be achieved by integrating time and entity information into the system. The findings also show that participants are significantly more engaged both from cognition (considering endurability and involvement) and emotion (considering the aesthetics and novelty) aspects when time and entity dimensions of the information space are provided (i.e. enriched system).
  18. (we refer to as System Preference)
  19. We investigate whether user engagement and in a more general sense user-centred metrics can be predicted, given the participants’ demographic and search habits information, and/or their interaction with the system.
  20. taken from exit and post-search questionnaire respectively
  21. Remarkably, the machine learned model is able to predict with a low error all of the user and system metrics. Given these positive findings, it is possible to move towards personalised search applications in which the layout and elements displayed adapt to the needs of the user or context which in turn results in increasing the users’ engagement as well as their preference of the system.