SlideShare a Scribd company logo
1 of 36
Download to read offline
SOCIAL 
SEMANTIC WEB 
AND 
CROWDSOURCING 
ELENA SIMPERL, UNIVERSITY OF 
SOUTHAMPTON, UK 
E.SIMPERL@SOTON.AC.UK, @ESIMPERL 
WITH CONTRIBUTIONS FROM “COMBINING THE SOCIAL AND THE SEMANTIC WEB”, TUTORIAL @ESWC2011 BY 
SHEILA KINSELLA, DERI, IE AND DENNY VRANDECIC, KIT, DE, 
HTTP://SEMANTICWEB.ORG/WIKI/COMBINING_THE_SOCIAL_AND_THE_SEMANTIC_WEB 
03.09.14 
1
FUNDAMENTALS OF THE SOCIAL WEB 
03.09.14 
2
THE SOCIAL WEB 
Software deployed over the Web, designed and developed to 
foster and leverage social interaction 
Users are connected via common digital artifacts they share 
and through social ties 
• Sharing platforms such as flickr 
• Collaborative platforms such Wikipedia, discussion forums, 
Quora, Groupon etc 
• Social networks such as Facebook combining both aspects 
03.09.14 
3
DEVELOPMENT 
Early ‘90s 
• Read-only, static Web sites 
• Interaction via posts on personal Web 
sites possibly responding to someone 
else’s statements 
Mid '90s 
• Read-write applications 
• Gradual increase in interaction between 
site owners and their readers/customers 
through various communication features 
Late ‘90s to present 
• First social sites, increasingly ubiquitous 
deployment of social features in non-social 
Web sites 
4
CHALLENGES OF 
SOCIAL SOFTWARE 
Many isolated communities 
• Data silos 
Challenges 
• Data cannot be shared and reused easily 
• No control over the management and transfer of 
personal data 
• User engagement and incentives engineering 
5 
03.09.14
SEMANTIC SOCIAL SOFTWARE 
03.09.14 
6
THE ROLE OF SEMANTIC 
TECHNOLOGIES 
SEMANTICS FOR… 
Advanced 
information 
management 
Data interoperability 
Data privacy 
Data liberation 
SCENARIOS 
1. Semantic technologies 
as integral part of the 
functionality of social 
software, enabling 
interoperability 
2. Enhanced information 
management through 
the usage of machine-processable 
semantics 
03.09.14 
7
ONTOLOGIES 
FOAF (Friend-of-a-Friend) 
SIOC (Semantically-Interlinked Online Communities) 
OPO (Online Presence Ontology) 
Tagging ontologies 
03.09.14 
8
OPEN GRAPH 
Represent Web content in a social 
graph in an interoperable way 
Design decisions, see 
http://www.scribd.com/doc/ 
30715288/The-Open-Graph-Protocol- 
Design-Decisions 
Used by Facebook (‘stories’), 
Google (snippets), IMDb etc. 
Facebook: actors, apps, objects 
with metadata to create stories 
• Example: Elena has finished 
reading ‘The Economist’, an 
object of type Newspaper 
• Types with attributes, extensions 
allowed 
• Pre-defined and custom actions 
on objects 
9 
http://ogp.me/ 
Image from 
https://developers.facebook.com/docs/ 
opengraph/creating-custom-stories 
03.09.14
SOFTWARE 
PLATFORMS 
• Drupal (RDFa), https://www.drupal.org/ 
• Semantic MediaWiki, https://semantic-mediawiki.org/ 
• Semantic blogs, http://www.zemanta.com/ 
03.09.14 
10
11 
COLLABORATIVE TECHNOLOGIES FOR THE 
SEMANTIC WEB 
03.09.14
BASICS 
Idea: leverage Social Web technologies and platforms to 
create and manage Semantic Web content 
Two classes of approaches 
• Mining user contributions, e.g., turning folksonomies into 
OWL ontologies, not part of this tutorial 
• Crowdsource Semantic Web tasks, e.g., GWAPs, paid 
microtasks, campaigns, contents 
03.09.14 
12
CROWDSOURCING: 
PROBLEM SOLVING VIA 
OPEN CALLS 
"Simply defined, crowdsourcing represents the act of a 
company or institution taking a function once performed by 
employees and outsourcing it to an undefined (and generally 
large) network of people in the form of an open call. This can 
take the form of peer-production (when the job is performed 
collaboratively), but is also often undertaken by sole 
individuals. The crucial prerequisite is the use of the open 
call format and the large network of potential .“ 
[Howe, 2006] 
13 
03.09.14
THE MANY FACES OF 
CROWDSOURCING 
14 
Image from http://03.09.14 dailycrowdsource.com/
RELATED: HUMAN 
COMPUTATION 
Outsourcing tasks that machines find difficult to solve 
to humans 
03.09.14 
15
IN THIS TALK: 
CROWDSOURCING DATA 
CITATION 
‘The USEWOD experiment ‘ 
• Goal: collect information about 
the usage of Linked Data sets 
in research papers 
• Explore different crowdsourcing 
methods 
• Online tool to link publications 
to data sets (and their versions) 
• 1st feasibility study with 10 
researchers in May 2014 
03.09.14 
http://prov.usewod.org/ 
16 
9650 publications
HOW TO 
CROWDSOURCE 
What: Goal 
Who: Contributors 
How: Process 
Why: Motivation and 
incentives 
http:// 
sloanreview.mit.edu/ 
article/the-collective-intelligence- 
genome/ 
17 
Image at 
http:// 
jasonbiv.files.wordpress.com/ 
2012/08/ 
collectiveintelligence_genome 
.jpg
EXAMPLE: CITIZEN SCIENCE 
WHAT IS OUTSOURCED 
• Object recognition, labeling, categorization 
in media content 
WHO IS THE CROWD 
• Anyone 
HOW IS THE TASK OUTSOURCED 
• Highly parallelizable tasks 
• Every item is handled by multiple 
annotators 
• Every annotator provides an answer 
• Consolidated answers solve scientific 
problems 
WHY DO PEOPLE CONTRIBUTE 
• Fun, interest in science, desire to 
contribute to science 
03.09.14 
18 
zooniverse.org
WHY 
Money (rewards) 
Love 
Glory 
Love and glory reduce costs 
Money and glory make the crowd 
move faster 
Intrinsic vs extrinsic motivation 
Rewards/incentives influence 
motivation. 
• They are granted by an external 
‘judge’ 
Successful volunteer 
crowdsourcing is difficult to 
predict or replicate 
• Highly context-specific 
• Not applicable to arbitrary tasks 
Reward models often easier to 
study and control (if performance 
can be reliably measured) 
• Not always easy to abstract from 
social aspects (free-riding, social 
pressure) 
• May undermine intrinsic motivation 
19
WHY (2) 
In our example: 
Who benefits from the results 
Who owns the results 
Money 
• Different models: pay-per-time, pay-per-unit, winner-takes-it-all 
• Define the rewards, analyze trade-offs accuracy vs costs, avoid 
spam 
Love 
• Authors, researchers, Dbpedia community, open access 
advocates, publishers, casual gamers etc. 
Glory 
• Competitions, mentions, awards 
03.09.14 
20 
Image from Nature.com
WHAT 
In general: something you cannot do (fully) automatically 
In our example: annotating research papers with data set 
information 
• Alternative representations of the domain 
• What if the paper is not available? 
• What if the domain is not known in advance or is infinite? 
• Do we know the list of potential answers? 
• Is there only one correct solution to each atomic task? 
• How many people would solve the same task? 
03.09.14 
21
WHO 
In general 
• An open (‘unknown’) crowd 
• Scale helps 
• Qualifications might be a pre-requisite 
In our example 
• People who know the papers or the data sets 
• Experts in the (broader) field 
• Casual gamers 
• Librarians 
• Anyone (knowledgeable of English, with a computer/cell phone 
etc) 
• Combinations thereof 
03.09.14 
22
HOW: PROCESS 
In general: 
• Explicit vs. implicit participation 
• Affects motivation 
See also [Quinn & Bederson, 2012] 
• Tasks broken down into smaller units undertaken in parallel by different 
people 
• Does not apply to all forms of crowdsourcing 
• Coordination required to handle cases with more complex workflows 
• Sometimes using different crowds for workflow parts 
• Example: text summarization 
• Task assignment to match skills, preferences, and contribution history 
• Example: random assignment vs meritocracy vs full autonomy 
• Partial or independent answers consolidated and aggregated into 
complete solution 
• Example: challenges (e.g., Netflix) vs aggregation (e.g., Wikipedia) 
03.09.14 
23
HOW: PROCESS (2) 
In our example: 
• Use the data collected here to train a IE algorithm 
• Use paid microtask workers to go a first screening, then expert crowd to sort 
out challenging cases 
• What if you have very long documents potentially mentioning different/ 
unknown data sets? 
• Competition via Twitter 
• ‘Which version of DBpedia does this paper use?’ 
• One question a day, prizes 
• Needs golden standard to bootstrap and redundancy 
• Involve the authors 
• Use crowdsourcing to find out Twitter accounts, then launch campaign on 
Twitter 
• Write an email to the authors… 
• Change the task 
• Which papers use Dbpedia 3.X? 
• Competition to find all papers 
03.09.14 
24
HOW: VALIDATION 
In general: 
• Solutions space closed vs. open 
• Redundancy vs. iterations/peer-review 
• Performance measurements/ground truth 
• Available and accepted 
• Automated techniques employed to predict accurate solutions 
In our example: 
• Domain is fairly restricted 
• Spam and obvious wrong answers can be detected easily 
• When are two answers the same? 
• Can there be more than one correct answer per question? 
• Redundancy may not be the final answer 
• Most people will be able to identify the data set, but sometimes the 
actual version is not trivial to reproduce 
• Make educated version guess based on time intervals and other features 
03.09.14 
25
03.09.14 
26 
EXAMPLES 
Image courtesy of M. Acosta
TASTE IT! TRY IT! 
Restaurant review Android app developed in the Insemtives project 
Uses Dbpedia concepts to generate structured reviews 
Uses mechanism design/gamification to configure incentives 
User study 
• 2274 reviews by 180 reviewers referring to 900 restaurants, using 5667 DPpedia concepts 
2500 
2000 
1500 
1000 
500 
03.09.14 
27 
https://play.google.com/store/apps/details?id=insemtives.android&hl=en 
0 
CAFE FASTFOOD PUB RESTAURANT 
Numer of reviews 
Number of semantic annotations (type of cuisine) 
Number of semantic annotations (dishes)
ONTOPRONTO 
GWAP ~ Game with a purpose 
Players‘ contributions are used to 
improve the results of technically 
challenging tasks 
OntoPronto 
• Task: map Wikipedia topics to 
classes of an ontology 
• Selection-agreement game: 
players have to agree on answers 
• Step 1: decide whether a topic is a 
class or an instance 
• Step 2: classify topic in the 
ontology 
03.09.14 
1 
2 
28
URBANOPOLY 
03.09.14 
29 
http://www.cefriel.com/en/urbanopoly-game/
See also [Sarasua, Simperl, Noy, ISWC2012] 
CROWDMAP 
Experiments using MTurk, CrowdFlower and established benchmarks 
Enhancing the results of automatic techniques 
Fast, accurate, cost-effective 
30 
03.09.14 
CartP 
301-­‐304 
100R50P 
Edas-­‐Iasted 
100R50P 
Ekaw-­‐Iasted 
100R50P 
Cmt-­‐Ekaw 
100R50P 
ConfOf-­‐Ekaw 
Imp 
301-­‐304 
PRECISION 
0.53 
0.8 
1.0 
1.0 
0.93 
0.73 
RECALL 
1.0 
0.42 
0.7 
0.75 
0.65 
1.0
See also [Acosta et al., ISWC 2013] 
DBPEDIA CURATION 
Find 
Contest 
Linked Data experts 
Difficult task 
Final prize 
Verify 
Microtasks 
Workers 
Easy task 
Micropayments 
TripleCheckMate 
[Kontoskostas2013] 
MTurk 
http://mturk.com 
31 
03.09.14
CROWDQ 
Understand the meaning of a keyword query 
Build a structured (SPARQL) query template 
Answer the query over Linked Open Data 
Gianluca Demartini, Beth Trushkowsky, Tim Kraska, and Michael Franklin. 
CrowdQ: Crowdsourced Query Understanding. In: 6th Biennial Conference 
on Innovative Data Systems Research (CIDR 2013) 03.09.14 
32
LODREFINE 
03.09.14 
33 
http://research.zemanta.com/crowds-to-the-rescue/
PROBLEMS AND 
CHALLENGES 
• What is feasible and how can tasks be optimally crowdsourced, e.g., translated into 
microtasks? 
• Examples: data quality assessment for technical and contextual features; subjective vs 
objective tasks (also in modeling); open-ended questions 
• What to show to users 
• Natural language descriptions of Linked Data/SPARQL 
• How much context 
• What form of rendering 
• How about links? 
• How to combine with automatic tools 
• Which results to validate 
• Low precision (no fun for gamers...) 
• Low recall (vs all possible questions) 
• How to embed crowdsourcing into an existing application 
• Tasks are fine granular, perceived as additional burden to the actual functionality 
• What to do with the resulting data? 
• Integration into existing practices 
• Vocabularies 
03.09.14 
34
FURTHER READING 
03.09.14 
35
E.SIMPERL@SOTON.AC.UK 
@ESIMPERL 
WWW.SOCIAM.ORG 
WWW.PLANET-DATA.EU 
03.09.14 
36

More Related Content

What's hot

The rise of platforms could see off the web
The rise of platforms could see off the webThe rise of platforms could see off the web
The rise of platforms could see off the webKen Chad Consulting Ltd
 
Social Web 2014: Final Presentations (Part II)
Social Web 2014: Final Presentations (Part II)Social Web 2014: Final Presentations (Part II)
Social Web 2014: Final Presentations (Part II)Lora Aroyo
 
Emerging Technologies for Libraries and Librarians, 2013
Emerging Technologies for Libraries and Librarians, 2013Emerging Technologies for Libraries and Librarians, 2013
Emerging Technologies for Libraries and Librarians, 2013Jennifer Baxmeyer
 
Lecture 7: How to STUDY the Social Web? (2014)
Lecture 7: How to STUDY the Social Web? (2014)Lecture 7: How to STUDY the Social Web? (2014)
Lecture 7: How to STUDY the Social Web? (2014)Lora Aroyo
 
Social Media Tools and Mobile Apps for Research and Publishing
Social Media Tools and Mobile Apps for Research and PublishingSocial Media Tools and Mobile Apps for Research and Publishing
Social Media Tools and Mobile Apps for Research and PublishingCheryl Peltier-Davis
 
Lecture 4: How can we MINE, ANALYSE & VISUALISE the Social Web? (2014)
Lecture 4: How can we MINE, ANALYSE & VISUALISE the Social Web? (2014)Lecture 4: How can we MINE, ANALYSE & VISUALISE the Social Web? (2014)
Lecture 4: How can we MINE, ANALYSE & VISUALISE the Social Web? (2014)Lora Aroyo
 
Social Software and Publishers - Gavin Bell - O'Reilly Tools of Change 2007
Social Software and Publishers - Gavin Bell - O'Reilly Tools of Change 2007Social Software and Publishers - Gavin Bell - O'Reilly Tools of Change 2007
Social Software and Publishers - Gavin Bell - O'Reilly Tools of Change 2007Gavin Bell
 
Using Social Media to Enhance Your Research Activities
Using Social Media to Enhance Your Research ActivitiesUsing Social Media to Enhance Your Research Activities
Using Social Media to Enhance Your Research Activitieslisbk
 
NM TIE 09 New Learning Communities: A Theoretical Framework
NM TIE 09 New Learning Communities: A Theoretical FrameworkNM TIE 09 New Learning Communities: A Theoretical Framework
NM TIE 09 New Learning Communities: A Theoretical FrameworkHolly Rae Bemis-Schurtz
 
VU University Amsterdam - The Social Web 2016 - Lecture 5
VU University Amsterdam - The Social Web 2016 - Lecture 5VU University Amsterdam - The Social Web 2016 - Lecture 5
VU University Amsterdam - The Social Web 2016 - Lecture 5Davide Ceolin
 
Metadata in a Crowd: Shared Knowledge Production
Metadata in a Crowd: Shared Knowledge ProductionMetadata in a Crowd: Shared Knowledge Production
Metadata in a Crowd: Shared Knowledge ProductionKevin Rundblad
 
Using Social Media to Promote 'Good News'
Using Social Media to Promote 'Good News'Using Social Media to Promote 'Good News'
Using Social Media to Promote 'Good News'lisbk
 
Social Media Tools for Administrators - CTEC 2016
Social Media Tools for Administrators - CTEC 2016Social Media Tools for Administrators - CTEC 2016
Social Media Tools for Administrators - CTEC 2016Cheryl Peltier-Davis
 
Acuril 2015 suriname davis, ramtahal, dwarika-bhagat
Acuril 2015 suriname  davis, ramtahal, dwarika-bhagatAcuril 2015 suriname  davis, ramtahal, dwarika-bhagat
Acuril 2015 suriname davis, ramtahal, dwarika-bhagatCheryl Peltier-Davis
 
A Rubric for Assessing the UX of Online Museum Collections: Preliminary Findi...
A Rubric for Assessing the UX of Online Museum Collections: Preliminary Findi...A Rubric for Assessing the UX of Online Museum Collections: Preliminary Findi...
A Rubric for Assessing the UX of Online Museum Collections: Preliminary Findi...craigmmacdonald
 
Providing Services to our Remote Users: Open Source Solutions
Providing Services to our Remote Users: Open Source SolutionsProviding Services to our Remote Users: Open Source Solutions
Providing Services to our Remote Users: Open Source SolutionsNicole C. Engard
 
User Experience at the European Bioinformatics Institute
User Experience at the European Bioinformatics Institute User Experience at the European Bioinformatics Institute
User Experience at the European Bioinformatics Institute Jennifer Cham
 
Designing with the user in mind: how user-centred design (UCD) can work for ...
Designing with the user in mind: how user-centred design (UCD) can work for ...Designing with the user in mind: how user-centred design (UCD) can work for ...
Designing with the user in mind: how user-centred design (UCD) can work for ...Jennifer Cham
 
How information spreads on social networks when unexpected events occur
How information spreads on social networks when unexpected events occurHow information spreads on social networks when unexpected events occur
How information spreads on social networks when unexpected events occurFarida Vis
 
Cascon 2016 Keynote: Disrupting Developer Productivity One Bot at a Time
Cascon 2016 Keynote: Disrupting Developer Productivity One Bot at a TimeCascon 2016 Keynote: Disrupting Developer Productivity One Bot at a Time
Cascon 2016 Keynote: Disrupting Developer Productivity One Bot at a TimeMargaret-Anne Storey
 

What's hot (20)

The rise of platforms could see off the web
The rise of platforms could see off the webThe rise of platforms could see off the web
The rise of platforms could see off the web
 
Social Web 2014: Final Presentations (Part II)
Social Web 2014: Final Presentations (Part II)Social Web 2014: Final Presentations (Part II)
Social Web 2014: Final Presentations (Part II)
 
Emerging Technologies for Libraries and Librarians, 2013
Emerging Technologies for Libraries and Librarians, 2013Emerging Technologies for Libraries and Librarians, 2013
Emerging Technologies for Libraries and Librarians, 2013
 
Lecture 7: How to STUDY the Social Web? (2014)
Lecture 7: How to STUDY the Social Web? (2014)Lecture 7: How to STUDY the Social Web? (2014)
Lecture 7: How to STUDY the Social Web? (2014)
 
Social Media Tools and Mobile Apps for Research and Publishing
Social Media Tools and Mobile Apps for Research and PublishingSocial Media Tools and Mobile Apps for Research and Publishing
Social Media Tools and Mobile Apps for Research and Publishing
 
Lecture 4: How can we MINE, ANALYSE & VISUALISE the Social Web? (2014)
Lecture 4: How can we MINE, ANALYSE & VISUALISE the Social Web? (2014)Lecture 4: How can we MINE, ANALYSE & VISUALISE the Social Web? (2014)
Lecture 4: How can we MINE, ANALYSE & VISUALISE the Social Web? (2014)
 
Social Software and Publishers - Gavin Bell - O'Reilly Tools of Change 2007
Social Software and Publishers - Gavin Bell - O'Reilly Tools of Change 2007Social Software and Publishers - Gavin Bell - O'Reilly Tools of Change 2007
Social Software and Publishers - Gavin Bell - O'Reilly Tools of Change 2007
 
Using Social Media to Enhance Your Research Activities
Using Social Media to Enhance Your Research ActivitiesUsing Social Media to Enhance Your Research Activities
Using Social Media to Enhance Your Research Activities
 
NM TIE 09 New Learning Communities: A Theoretical Framework
NM TIE 09 New Learning Communities: A Theoretical FrameworkNM TIE 09 New Learning Communities: A Theoretical Framework
NM TIE 09 New Learning Communities: A Theoretical Framework
 
VU University Amsterdam - The Social Web 2016 - Lecture 5
VU University Amsterdam - The Social Web 2016 - Lecture 5VU University Amsterdam - The Social Web 2016 - Lecture 5
VU University Amsterdam - The Social Web 2016 - Lecture 5
 
Metadata in a Crowd: Shared Knowledge Production
Metadata in a Crowd: Shared Knowledge ProductionMetadata in a Crowd: Shared Knowledge Production
Metadata in a Crowd: Shared Knowledge Production
 
Using Social Media to Promote 'Good News'
Using Social Media to Promote 'Good News'Using Social Media to Promote 'Good News'
Using Social Media to Promote 'Good News'
 
Social Media Tools for Administrators - CTEC 2016
Social Media Tools for Administrators - CTEC 2016Social Media Tools for Administrators - CTEC 2016
Social Media Tools for Administrators - CTEC 2016
 
Acuril 2015 suriname davis, ramtahal, dwarika-bhagat
Acuril 2015 suriname  davis, ramtahal, dwarika-bhagatAcuril 2015 suriname  davis, ramtahal, dwarika-bhagat
Acuril 2015 suriname davis, ramtahal, dwarika-bhagat
 
A Rubric for Assessing the UX of Online Museum Collections: Preliminary Findi...
A Rubric for Assessing the UX of Online Museum Collections: Preliminary Findi...A Rubric for Assessing the UX of Online Museum Collections: Preliminary Findi...
A Rubric for Assessing the UX of Online Museum Collections: Preliminary Findi...
 
Providing Services to our Remote Users: Open Source Solutions
Providing Services to our Remote Users: Open Source SolutionsProviding Services to our Remote Users: Open Source Solutions
Providing Services to our Remote Users: Open Source Solutions
 
User Experience at the European Bioinformatics Institute
User Experience at the European Bioinformatics Institute User Experience at the European Bioinformatics Institute
User Experience at the European Bioinformatics Institute
 
Designing with the user in mind: how user-centred design (UCD) can work for ...
Designing with the user in mind: how user-centred design (UCD) can work for ...Designing with the user in mind: how user-centred design (UCD) can work for ...
Designing with the user in mind: how user-centred design (UCD) can work for ...
 
How information spreads on social networks when unexpected events occur
How information spreads on social networks when unexpected events occurHow information spreads on social networks when unexpected events occur
How information spreads on social networks when unexpected events occur
 
Cascon 2016 Keynote: Disrupting Developer Productivity One Bot at a Time
Cascon 2016 Keynote: Disrupting Developer Productivity One Bot at a TimeCascon 2016 Keynote: Disrupting Developer Productivity One Bot at a Time
Cascon 2016 Keynote: Disrupting Developer Productivity One Bot at a Time
 

Similar to Social Semantic Web and Crowdsourcing

Social machines: theory design and incentives
Social machines: theory design and incentivesSocial machines: theory design and incentives
Social machines: theory design and incentivesElena Simperl
 
Putting Linked Data to Use in a Large Higher-Education Organisation
Putting Linked Data to Use in a Large Higher-Education OrganisationPutting Linked Data to Use in a Large Higher-Education Organisation
Putting Linked Data to Use in a Large Higher-Education OrganisationMathieu d'Aquin
 
Visual and interactive storytelling slides cmg 2015-final
Visual and interactive storytelling slides    cmg 2015-finalVisual and interactive storytelling slides    cmg 2015-final
Visual and interactive storytelling slides cmg 2015-finalKatherine-CWACanada
 
Crowdsourcing for research libraries
Crowdsourcing for research librariesCrowdsourcing for research libraries
Crowdsourcing for research librariesElena Simperl
 
Designing and deploying mobile user studies in the wild: a practical guide
Designing and deploying mobile user studies in the wild: a practical guideDesigning and deploying mobile user studies in the wild: a practical guide
Designing and deploying mobile user studies in the wild: a practical guideKaren Church
 
Towards the Design of Intelligible Object-based Applications for the Web of T...
Towards the Design of Intelligible Object-based Applications for the Web of T...Towards the Design of Intelligible Object-based Applications for the Web of T...
Towards the Design of Intelligible Object-based Applications for the Web of T...Pierrick Thébault
 
Tablet computers for dissemination of digital scholarly editions
Tablet computers for dissemination of digital scholarly editionsTablet computers for dissemination of digital scholarly editions
Tablet computers for dissemination of digital scholarly editionsAodhán Kelly
 
Designing a second generation of open data platforms
Designing a second generation of open data platformsDesigning a second generation of open data platforms
Designing a second generation of open data platformsCharalampos Alexopoulos
 
Requirements Engineering for the Humanities
Requirements Engineering for the HumanitiesRequirements Engineering for the Humanities
Requirements Engineering for the HumanitiesShawn Day
 
Open domain Question Answering System - Research project in NLP
Open domain  Question Answering System - Research project in NLPOpen domain  Question Answering System - Research project in NLP
Open domain Question Answering System - Research project in NLPGVS Chaitanya
 
SciSoftDays Talk - Howison: Spreading the work in software ecosystems
SciSoftDays Talk - Howison: Spreading the work in software ecosystemsSciSoftDays Talk - Howison: Spreading the work in software ecosystems
SciSoftDays Talk - Howison: Spreading the work in software ecosystemsJames Howison
 
Human Computation for Big Data
Human Computation for Big DataHuman Computation for Big Data
Human Computation for Big DataeXascale Infolab
 
talk for HK SME center about web3.0 , AI, mobile apps
talk for HK SME center about web3.0 , AI, mobile appstalk for HK SME center about web3.0 , AI, mobile apps
talk for HK SME center about web3.0 , AI, mobile appsAlex Hung
 
Conducting User Research
Conducting User ResearchConducting User Research
Conducting User ResearchJeremy Horn
 
Multi-Mode Conceptual Clustering Algorithm Based Social Group Identification ...
Multi-Mode Conceptual Clustering Algorithm Based Social Group Identification ...Multi-Mode Conceptual Clustering Algorithm Based Social Group Identification ...
Multi-Mode Conceptual Clustering Algorithm Based Social Group Identification ...inventionjournals
 
Choosing the right crowd. Expert finding in social networks. edbt 2013
Choosing the right crowd. Expert finding in social networks. edbt 2013Choosing the right crowd. Expert finding in social networks. edbt 2013
Choosing the right crowd. Expert finding in social networks. edbt 2013Marco Brambilla
 

Similar to Social Semantic Web and Crowdsourcing (20)

Social machines: theory design and incentives
Social machines: theory design and incentivesSocial machines: theory design and incentives
Social machines: theory design and incentives
 
Putting Linked Data to Use in a Large Higher-Education Organisation
Putting Linked Data to Use in a Large Higher-Education OrganisationPutting Linked Data to Use in a Large Higher-Education Organisation
Putting Linked Data to Use in a Large Higher-Education Organisation
 
Visual and interactive storytelling slides cmg 2015-final
Visual and interactive storytelling slides    cmg 2015-finalVisual and interactive storytelling slides    cmg 2015-final
Visual and interactive storytelling slides cmg 2015-final
 
Crowdsourcing for research libraries
Crowdsourcing for research librariesCrowdsourcing for research libraries
Crowdsourcing for research libraries
 
Howison si2 keynote
Howison si2 keynoteHowison si2 keynote
Howison si2 keynote
 
Designing and deploying mobile user studies in the wild: a practical guide
Designing and deploying mobile user studies in the wild: a practical guideDesigning and deploying mobile user studies in the wild: a practical guide
Designing and deploying mobile user studies in the wild: a practical guide
 
Web 1.0, 2.0 & 3.0
Web 1.0, 2.0 & 3.0Web 1.0, 2.0 & 3.0
Web 1.0, 2.0 & 3.0
 
Web 1.0, 3.0. 3.0 School of Business
Web 1.0, 3.0. 3.0 School of BusinessWeb 1.0, 3.0. 3.0 School of Business
Web 1.0, 3.0. 3.0 School of Business
 
Towards the Design of Intelligible Object-based Applications for the Web of T...
Towards the Design of Intelligible Object-based Applications for the Web of T...Towards the Design of Intelligible Object-based Applications for the Web of T...
Towards the Design of Intelligible Object-based Applications for the Web of T...
 
Tablet computers for dissemination of digital scholarly editions
Tablet computers for dissemination of digital scholarly editionsTablet computers for dissemination of digital scholarly editions
Tablet computers for dissemination of digital scholarly editions
 
Designing a second generation of open data platforms
Designing a second generation of open data platformsDesigning a second generation of open data platforms
Designing a second generation of open data platforms
 
Requirements Engineering for the Humanities
Requirements Engineering for the HumanitiesRequirements Engineering for the Humanities
Requirements Engineering for the Humanities
 
Open domain Question Answering System - Research project in NLP
Open domain  Question Answering System - Research project in NLPOpen domain  Question Answering System - Research project in NLP
Open domain Question Answering System - Research project in NLP
 
SciSoftDays Talk - Howison: Spreading the work in software ecosystems
SciSoftDays Talk - Howison: Spreading the work in software ecosystemsSciSoftDays Talk - Howison: Spreading the work in software ecosystems
SciSoftDays Talk - Howison: Spreading the work in software ecosystems
 
Benoit Visual Only Retrieval
Benoit Visual Only RetrievalBenoit Visual Only Retrieval
Benoit Visual Only Retrieval
 
Human Computation for Big Data
Human Computation for Big DataHuman Computation for Big Data
Human Computation for Big Data
 
talk for HK SME center about web3.0 , AI, mobile apps
talk for HK SME center about web3.0 , AI, mobile appstalk for HK SME center about web3.0 , AI, mobile apps
talk for HK SME center about web3.0 , AI, mobile apps
 
Conducting User Research
Conducting User ResearchConducting User Research
Conducting User Research
 
Multi-Mode Conceptual Clustering Algorithm Based Social Group Identification ...
Multi-Mode Conceptual Clustering Algorithm Based Social Group Identification ...Multi-Mode Conceptual Clustering Algorithm Based Social Group Identification ...
Multi-Mode Conceptual Clustering Algorithm Based Social Group Identification ...
 
Choosing the right crowd. Expert finding in social networks. edbt 2013
Choosing the right crowd. Expert finding in social networks. edbt 2013Choosing the right crowd. Expert finding in social networks. edbt 2013
Choosing the right crowd. Expert finding in social networks. edbt 2013
 

More from eswcsummerschool

Semantic Aquarium - ESWC SSchool 14 - Student project
Semantic Aquarium - ESWC SSchool 14 - Student projectSemantic Aquarium - ESWC SSchool 14 - Student project
Semantic Aquarium - ESWC SSchool 14 - Student projecteswcsummerschool
 
Syrtaki - ESWC SSchool 14 - Student project
Syrtaki  - ESWC SSchool 14 - Student projectSyrtaki  - ESWC SSchool 14 - Student project
Syrtaki - ESWC SSchool 14 - Student projecteswcsummerschool
 
Keep fit (a bit) - ESWC SSchool 14 - Student project
Keep fit (a bit)  - ESWC SSchool 14 - Student projectKeep fit (a bit)  - ESWC SSchool 14 - Student project
Keep fit (a bit) - ESWC SSchool 14 - Student projecteswcsummerschool
 
Arabic Sentiment Lexicon - ESWC SSchool 14 - Student project
Arabic Sentiment Lexicon - ESWC SSchool 14 - Student projectArabic Sentiment Lexicon - ESWC SSchool 14 - Student project
Arabic Sentiment Lexicon - ESWC SSchool 14 - Student projecteswcsummerschool
 
FIT-8BIT An activity music assistant - ESWC SSchool 14 - Student project
FIT-8BIT An activity music assistant - ESWC SSchool 14 - Student projectFIT-8BIT An activity music assistant - ESWC SSchool 14 - Student project
FIT-8BIT An activity music assistant - ESWC SSchool 14 - Student projecteswcsummerschool
 
Personal Tours at the British Museum - ESWC SSchool 14 - Student project
Personal Tours at the British Museum  - ESWC SSchool 14 - Student projectPersonal Tours at the British Museum  - ESWC SSchool 14 - Student project
Personal Tours at the British Museum - ESWC SSchool 14 - Student projecteswcsummerschool
 
Exhibition recommendation using British Museum data and Event Registry - ESWC...
Exhibition recommendation using British Museum data and Event Registry - ESWC...Exhibition recommendation using British Museum data and Event Registry - ESWC...
Exhibition recommendation using British Museum data and Event Registry - ESWC...eswcsummerschool
 
Empowering fishing business using Linked Data - ESWC SSchool 14 - Student pro...
Empowering fishing business using Linked Data - ESWC SSchool 14 - Student pro...Empowering fishing business using Linked Data - ESWC SSchool 14 - Student pro...
Empowering fishing business using Linked Data - ESWC SSchool 14 - Student pro...eswcsummerschool
 
Keynote: Global Media Monitoring - M. Grobelnik - ESWC SS 2014
Keynote: Global Media Monitoring - M. Grobelnik - ESWC SS 2014Keynote: Global Media Monitoring - M. Grobelnik - ESWC SS 2014
Keynote: Global Media Monitoring - M. Grobelnik - ESWC SS 2014eswcsummerschool
 
Hands On: Amazon Mechanical Turk - M. Acosta - ESWC SS 2014
Hands On: Amazon Mechanical Turk - M. Acosta - ESWC SS 2014 Hands On: Amazon Mechanical Turk - M. Acosta - ESWC SS 2014
Hands On: Amazon Mechanical Turk - M. Acosta - ESWC SS 2014 eswcsummerschool
 
Tutorial: Querying a Marine Data Warehouse Using SPARQL - I. Fundulaki - ESWC...
Tutorial: Querying a Marine Data Warehouse Using SPARQL - I. Fundulaki - ESWC...Tutorial: Querying a Marine Data Warehouse Using SPARQL - I. Fundulaki - ESWC...
Tutorial: Querying a Marine Data Warehouse Using SPARQL - I. Fundulaki - ESWC...eswcsummerschool
 
Mon norton tut_publishing01
Mon norton tut_publishing01Mon norton tut_publishing01
Mon norton tut_publishing01eswcsummerschool
 
Mon domingue introduction to the school
Mon domingue introduction to the schoolMon domingue introduction to the school
Mon domingue introduction to the schooleswcsummerschool
 
Mon norton tut_querying cultural heritage data
Mon norton tut_querying cultural heritage dataMon norton tut_querying cultural heritage data
Mon norton tut_querying cultural heritage dataeswcsummerschool
 
Tue acosta hands_on_providinglinkeddata
Tue acosta hands_on_providinglinkeddataTue acosta hands_on_providinglinkeddata
Tue acosta hands_on_providinglinkeddataeswcsummerschool
 
Thu bernstein key_warp_speed
Thu bernstein key_warp_speedThu bernstein key_warp_speed
Thu bernstein key_warp_speedeswcsummerschool
 
Fri schreiber key_knowledge engineering
Fri schreiber key_knowledge engineeringFri schreiber key_knowledge engineering
Fri schreiber key_knowledge engineeringeswcsummerschool
 
Mon norton tut_queryinglinkeddata02
Mon norton tut_queryinglinkeddata02Mon norton tut_queryinglinkeddata02
Mon norton tut_queryinglinkeddata02eswcsummerschool
 
Mon fundulaki tut_querying linked data
Mon fundulaki tut_querying linked dataMon fundulaki tut_querying linked data
Mon fundulaki tut_querying linked dataeswcsummerschool
 
Mon domingue key_introduction to semantic
Mon domingue key_introduction to semanticMon domingue key_introduction to semantic
Mon domingue key_introduction to semanticeswcsummerschool
 

More from eswcsummerschool (20)

Semantic Aquarium - ESWC SSchool 14 - Student project
Semantic Aquarium - ESWC SSchool 14 - Student projectSemantic Aquarium - ESWC SSchool 14 - Student project
Semantic Aquarium - ESWC SSchool 14 - Student project
 
Syrtaki - ESWC SSchool 14 - Student project
Syrtaki  - ESWC SSchool 14 - Student projectSyrtaki  - ESWC SSchool 14 - Student project
Syrtaki - ESWC SSchool 14 - Student project
 
Keep fit (a bit) - ESWC SSchool 14 - Student project
Keep fit (a bit)  - ESWC SSchool 14 - Student projectKeep fit (a bit)  - ESWC SSchool 14 - Student project
Keep fit (a bit) - ESWC SSchool 14 - Student project
 
Arabic Sentiment Lexicon - ESWC SSchool 14 - Student project
Arabic Sentiment Lexicon - ESWC SSchool 14 - Student projectArabic Sentiment Lexicon - ESWC SSchool 14 - Student project
Arabic Sentiment Lexicon - ESWC SSchool 14 - Student project
 
FIT-8BIT An activity music assistant - ESWC SSchool 14 - Student project
FIT-8BIT An activity music assistant - ESWC SSchool 14 - Student projectFIT-8BIT An activity music assistant - ESWC SSchool 14 - Student project
FIT-8BIT An activity music assistant - ESWC SSchool 14 - Student project
 
Personal Tours at the British Museum - ESWC SSchool 14 - Student project
Personal Tours at the British Museum  - ESWC SSchool 14 - Student projectPersonal Tours at the British Museum  - ESWC SSchool 14 - Student project
Personal Tours at the British Museum - ESWC SSchool 14 - Student project
 
Exhibition recommendation using British Museum data and Event Registry - ESWC...
Exhibition recommendation using British Museum data and Event Registry - ESWC...Exhibition recommendation using British Museum data and Event Registry - ESWC...
Exhibition recommendation using British Museum data and Event Registry - ESWC...
 
Empowering fishing business using Linked Data - ESWC SSchool 14 - Student pro...
Empowering fishing business using Linked Data - ESWC SSchool 14 - Student pro...Empowering fishing business using Linked Data - ESWC SSchool 14 - Student pro...
Empowering fishing business using Linked Data - ESWC SSchool 14 - Student pro...
 
Keynote: Global Media Monitoring - M. Grobelnik - ESWC SS 2014
Keynote: Global Media Monitoring - M. Grobelnik - ESWC SS 2014Keynote: Global Media Monitoring - M. Grobelnik - ESWC SS 2014
Keynote: Global Media Monitoring - M. Grobelnik - ESWC SS 2014
 
Hands On: Amazon Mechanical Turk - M. Acosta - ESWC SS 2014
Hands On: Amazon Mechanical Turk - M. Acosta - ESWC SS 2014 Hands On: Amazon Mechanical Turk - M. Acosta - ESWC SS 2014
Hands On: Amazon Mechanical Turk - M. Acosta - ESWC SS 2014
 
Tutorial: Querying a Marine Data Warehouse Using SPARQL - I. Fundulaki - ESWC...
Tutorial: Querying a Marine Data Warehouse Using SPARQL - I. Fundulaki - ESWC...Tutorial: Querying a Marine Data Warehouse Using SPARQL - I. Fundulaki - ESWC...
Tutorial: Querying a Marine Data Warehouse Using SPARQL - I. Fundulaki - ESWC...
 
Mon norton tut_publishing01
Mon norton tut_publishing01Mon norton tut_publishing01
Mon norton tut_publishing01
 
Mon domingue introduction to the school
Mon domingue introduction to the schoolMon domingue introduction to the school
Mon domingue introduction to the school
 
Mon norton tut_querying cultural heritage data
Mon norton tut_querying cultural heritage dataMon norton tut_querying cultural heritage data
Mon norton tut_querying cultural heritage data
 
Tue acosta hands_on_providinglinkeddata
Tue acosta hands_on_providinglinkeddataTue acosta hands_on_providinglinkeddata
Tue acosta hands_on_providinglinkeddata
 
Thu bernstein key_warp_speed
Thu bernstein key_warp_speedThu bernstein key_warp_speed
Thu bernstein key_warp_speed
 
Fri schreiber key_knowledge engineering
Fri schreiber key_knowledge engineeringFri schreiber key_knowledge engineering
Fri schreiber key_knowledge engineering
 
Mon norton tut_queryinglinkeddata02
Mon norton tut_queryinglinkeddata02Mon norton tut_queryinglinkeddata02
Mon norton tut_queryinglinkeddata02
 
Mon fundulaki tut_querying linked data
Mon fundulaki tut_querying linked dataMon fundulaki tut_querying linked data
Mon fundulaki tut_querying linked data
 
Mon domingue key_introduction to semantic
Mon domingue key_introduction to semanticMon domingue key_introduction to semantic
Mon domingue key_introduction to semantic
 

Social Semantic Web and Crowdsourcing

  • 1. SOCIAL SEMANTIC WEB AND CROWDSOURCING ELENA SIMPERL, UNIVERSITY OF SOUTHAMPTON, UK E.SIMPERL@SOTON.AC.UK, @ESIMPERL WITH CONTRIBUTIONS FROM “COMBINING THE SOCIAL AND THE SEMANTIC WEB”, TUTORIAL @ESWC2011 BY SHEILA KINSELLA, DERI, IE AND DENNY VRANDECIC, KIT, DE, HTTP://SEMANTICWEB.ORG/WIKI/COMBINING_THE_SOCIAL_AND_THE_SEMANTIC_WEB 03.09.14 1
  • 2. FUNDAMENTALS OF THE SOCIAL WEB 03.09.14 2
  • 3. THE SOCIAL WEB Software deployed over the Web, designed and developed to foster and leverage social interaction Users are connected via common digital artifacts they share and through social ties • Sharing platforms such as flickr • Collaborative platforms such Wikipedia, discussion forums, Quora, Groupon etc • Social networks such as Facebook combining both aspects 03.09.14 3
  • 4. DEVELOPMENT Early ‘90s • Read-only, static Web sites • Interaction via posts on personal Web sites possibly responding to someone else’s statements Mid '90s • Read-write applications • Gradual increase in interaction between site owners and their readers/customers through various communication features Late ‘90s to present • First social sites, increasingly ubiquitous deployment of social features in non-social Web sites 4
  • 5. CHALLENGES OF SOCIAL SOFTWARE Many isolated communities • Data silos Challenges • Data cannot be shared and reused easily • No control over the management and transfer of personal data • User engagement and incentives engineering 5 03.09.14
  • 7. THE ROLE OF SEMANTIC TECHNOLOGIES SEMANTICS FOR… Advanced information management Data interoperability Data privacy Data liberation SCENARIOS 1. Semantic technologies as integral part of the functionality of social software, enabling interoperability 2. Enhanced information management through the usage of machine-processable semantics 03.09.14 7
  • 8. ONTOLOGIES FOAF (Friend-of-a-Friend) SIOC (Semantically-Interlinked Online Communities) OPO (Online Presence Ontology) Tagging ontologies 03.09.14 8
  • 9. OPEN GRAPH Represent Web content in a social graph in an interoperable way Design decisions, see http://www.scribd.com/doc/ 30715288/The-Open-Graph-Protocol- Design-Decisions Used by Facebook (‘stories’), Google (snippets), IMDb etc. Facebook: actors, apps, objects with metadata to create stories • Example: Elena has finished reading ‘The Economist’, an object of type Newspaper • Types with attributes, extensions allowed • Pre-defined and custom actions on objects 9 http://ogp.me/ Image from https://developers.facebook.com/docs/ opengraph/creating-custom-stories 03.09.14
  • 10. SOFTWARE PLATFORMS • Drupal (RDFa), https://www.drupal.org/ • Semantic MediaWiki, https://semantic-mediawiki.org/ • Semantic blogs, http://www.zemanta.com/ 03.09.14 10
  • 11. 11 COLLABORATIVE TECHNOLOGIES FOR THE SEMANTIC WEB 03.09.14
  • 12. BASICS Idea: leverage Social Web technologies and platforms to create and manage Semantic Web content Two classes of approaches • Mining user contributions, e.g., turning folksonomies into OWL ontologies, not part of this tutorial • Crowdsource Semantic Web tasks, e.g., GWAPs, paid microtasks, campaigns, contents 03.09.14 12
  • 13. CROWDSOURCING: PROBLEM SOLVING VIA OPEN CALLS "Simply defined, crowdsourcing represents the act of a company or institution taking a function once performed by employees and outsourcing it to an undefined (and generally large) network of people in the form of an open call. This can take the form of peer-production (when the job is performed collaboratively), but is also often undertaken by sole individuals. The crucial prerequisite is the use of the open call format and the large network of potential .“ [Howe, 2006] 13 03.09.14
  • 14. THE MANY FACES OF CROWDSOURCING 14 Image from http://03.09.14 dailycrowdsource.com/
  • 15. RELATED: HUMAN COMPUTATION Outsourcing tasks that machines find difficult to solve to humans 03.09.14 15
  • 16. IN THIS TALK: CROWDSOURCING DATA CITATION ‘The USEWOD experiment ‘ • Goal: collect information about the usage of Linked Data sets in research papers • Explore different crowdsourcing methods • Online tool to link publications to data sets (and their versions) • 1st feasibility study with 10 researchers in May 2014 03.09.14 http://prov.usewod.org/ 16 9650 publications
  • 17. HOW TO CROWDSOURCE What: Goal Who: Contributors How: Process Why: Motivation and incentives http:// sloanreview.mit.edu/ article/the-collective-intelligence- genome/ 17 Image at http:// jasonbiv.files.wordpress.com/ 2012/08/ collectiveintelligence_genome .jpg
  • 18. EXAMPLE: CITIZEN SCIENCE WHAT IS OUTSOURCED • Object recognition, labeling, categorization in media content WHO IS THE CROWD • Anyone HOW IS THE TASK OUTSOURCED • Highly parallelizable tasks • Every item is handled by multiple annotators • Every annotator provides an answer • Consolidated answers solve scientific problems WHY DO PEOPLE CONTRIBUTE • Fun, interest in science, desire to contribute to science 03.09.14 18 zooniverse.org
  • 19. WHY Money (rewards) Love Glory Love and glory reduce costs Money and glory make the crowd move faster Intrinsic vs extrinsic motivation Rewards/incentives influence motivation. • They are granted by an external ‘judge’ Successful volunteer crowdsourcing is difficult to predict or replicate • Highly context-specific • Not applicable to arbitrary tasks Reward models often easier to study and control (if performance can be reliably measured) • Not always easy to abstract from social aspects (free-riding, social pressure) • May undermine intrinsic motivation 19
  • 20. WHY (2) In our example: Who benefits from the results Who owns the results Money • Different models: pay-per-time, pay-per-unit, winner-takes-it-all • Define the rewards, analyze trade-offs accuracy vs costs, avoid spam Love • Authors, researchers, Dbpedia community, open access advocates, publishers, casual gamers etc. Glory • Competitions, mentions, awards 03.09.14 20 Image from Nature.com
  • 21. WHAT In general: something you cannot do (fully) automatically In our example: annotating research papers with data set information • Alternative representations of the domain • What if the paper is not available? • What if the domain is not known in advance or is infinite? • Do we know the list of potential answers? • Is there only one correct solution to each atomic task? • How many people would solve the same task? 03.09.14 21
  • 22. WHO In general • An open (‘unknown’) crowd • Scale helps • Qualifications might be a pre-requisite In our example • People who know the papers or the data sets • Experts in the (broader) field • Casual gamers • Librarians • Anyone (knowledgeable of English, with a computer/cell phone etc) • Combinations thereof 03.09.14 22
  • 23. HOW: PROCESS In general: • Explicit vs. implicit participation • Affects motivation See also [Quinn & Bederson, 2012] • Tasks broken down into smaller units undertaken in parallel by different people • Does not apply to all forms of crowdsourcing • Coordination required to handle cases with more complex workflows • Sometimes using different crowds for workflow parts • Example: text summarization • Task assignment to match skills, preferences, and contribution history • Example: random assignment vs meritocracy vs full autonomy • Partial or independent answers consolidated and aggregated into complete solution • Example: challenges (e.g., Netflix) vs aggregation (e.g., Wikipedia) 03.09.14 23
  • 24. HOW: PROCESS (2) In our example: • Use the data collected here to train a IE algorithm • Use paid microtask workers to go a first screening, then expert crowd to sort out challenging cases • What if you have very long documents potentially mentioning different/ unknown data sets? • Competition via Twitter • ‘Which version of DBpedia does this paper use?’ • One question a day, prizes • Needs golden standard to bootstrap and redundancy • Involve the authors • Use crowdsourcing to find out Twitter accounts, then launch campaign on Twitter • Write an email to the authors… • Change the task • Which papers use Dbpedia 3.X? • Competition to find all papers 03.09.14 24
  • 25. HOW: VALIDATION In general: • Solutions space closed vs. open • Redundancy vs. iterations/peer-review • Performance measurements/ground truth • Available and accepted • Automated techniques employed to predict accurate solutions In our example: • Domain is fairly restricted • Spam and obvious wrong answers can be detected easily • When are two answers the same? • Can there be more than one correct answer per question? • Redundancy may not be the final answer • Most people will be able to identify the data set, but sometimes the actual version is not trivial to reproduce • Make educated version guess based on time intervals and other features 03.09.14 25
  • 26. 03.09.14 26 EXAMPLES Image courtesy of M. Acosta
  • 27. TASTE IT! TRY IT! Restaurant review Android app developed in the Insemtives project Uses Dbpedia concepts to generate structured reviews Uses mechanism design/gamification to configure incentives User study • 2274 reviews by 180 reviewers referring to 900 restaurants, using 5667 DPpedia concepts 2500 2000 1500 1000 500 03.09.14 27 https://play.google.com/store/apps/details?id=insemtives.android&hl=en 0 CAFE FASTFOOD PUB RESTAURANT Numer of reviews Number of semantic annotations (type of cuisine) Number of semantic annotations (dishes)
  • 28. ONTOPRONTO GWAP ~ Game with a purpose Players‘ contributions are used to improve the results of technically challenging tasks OntoPronto • Task: map Wikipedia topics to classes of an ontology • Selection-agreement game: players have to agree on answers • Step 1: decide whether a topic is a class or an instance • Step 2: classify topic in the ontology 03.09.14 1 2 28
  • 29. URBANOPOLY 03.09.14 29 http://www.cefriel.com/en/urbanopoly-game/
  • 30. See also [Sarasua, Simperl, Noy, ISWC2012] CROWDMAP Experiments using MTurk, CrowdFlower and established benchmarks Enhancing the results of automatic techniques Fast, accurate, cost-effective 30 03.09.14 CartP 301-­‐304 100R50P Edas-­‐Iasted 100R50P Ekaw-­‐Iasted 100R50P Cmt-­‐Ekaw 100R50P ConfOf-­‐Ekaw Imp 301-­‐304 PRECISION 0.53 0.8 1.0 1.0 0.93 0.73 RECALL 1.0 0.42 0.7 0.75 0.65 1.0
  • 31. See also [Acosta et al., ISWC 2013] DBPEDIA CURATION Find Contest Linked Data experts Difficult task Final prize Verify Microtasks Workers Easy task Micropayments TripleCheckMate [Kontoskostas2013] MTurk http://mturk.com 31 03.09.14
  • 32. CROWDQ Understand the meaning of a keyword query Build a structured (SPARQL) query template Answer the query over Linked Open Data Gianluca Demartini, Beth Trushkowsky, Tim Kraska, and Michael Franklin. CrowdQ: Crowdsourced Query Understanding. In: 6th Biennial Conference on Innovative Data Systems Research (CIDR 2013) 03.09.14 32
  • 33. LODREFINE 03.09.14 33 http://research.zemanta.com/crowds-to-the-rescue/
  • 34. PROBLEMS AND CHALLENGES • What is feasible and how can tasks be optimally crowdsourced, e.g., translated into microtasks? • Examples: data quality assessment for technical and contextual features; subjective vs objective tasks (also in modeling); open-ended questions • What to show to users • Natural language descriptions of Linked Data/SPARQL • How much context • What form of rendering • How about links? • How to combine with automatic tools • Which results to validate • Low precision (no fun for gamers...) • Low recall (vs all possible questions) • How to embed crowdsourcing into an existing application • Tasks are fine granular, perceived as additional burden to the actual functionality • What to do with the resulting data? • Integration into existing practices • Vocabularies 03.09.14 34
  • 36. E.SIMPERL@SOTON.AC.UK @ESIMPERL WWW.SOCIAM.ORG WWW.PLANET-DATA.EU 03.09.14 36