SlideShare a Scribd company logo
1 of 27
Download to read offline
LOTUS:
Adaptive Text Search
for Big Linked Data
F. Ilievski | W. Beek | M. van Erp | L. Rietveld | S. Schlobach
INTRODUCTION
A wealth of information is potentially available in Linked Open
Data sources.
This information could be exploited by researchers and
developers for tools and evaluations on a LOD-scale.
But, accesing Big Linked Data is not trivial
INTRODUCTION
No centralized query
service for Linked Data
A wealth of information is potentially available in Linked Open
Data sources.
This information could be exploited by researchers and
developers for tools and evaluations on a LOD-scale.
But, accesing Big Linked Data is not trivial
INTRODUCTION
Limited natural language
access to Linked Data
No centralized query
service for Linked Data
A wealth of information is potentially available in Linked Open
Data sources.
This information could be exploited by researchers and
developers for tools and evaluations on a LOD-scale.
But, accesing Big Linked Data is not trivial
INTRODUCTION
Limited natural language
access to Linked Data
No centralized query
service for Linked Data
Text-based retrieval is not
customizable
A wealth of information is potentially available in Linked Open
Data sources.
This information could be exploited by researchers and
developers for tools and evaluations on a LOD-scale.
But, accesing Big Linked Data is not trivial
““The lack of a global entry point to resources
through a flexible text index is a serious
obstacle for linked data consumption.”
““The lack of a global entry point to resources
through a flexible text index is a serious
obstacle for linked data consumption.”
by researchers and developers
For a global
text-based
Entry point
To LOD
REQUIREMENTS!
REQUIREMENTS
1. Text-based queries
2. Resilience (of text search)
3. Findability (of authoritative and non-authoritative
statements)
4. Availability
5. Scalability
6. Serviceability (for both machines and humans)
7. Customizability
The
LOTUS
APPROACH
LOD Laundromat
A centralized Linked Data
cleaning and publishing architecture
Allows access to a big subset of the LOD Cloud
38 billion statements
LOTUS!
I am
Jayden
Smith
lotus.lodlaundromat.org
LOTUS
Linguistic entry
point
LOTUS is a linguistic
entry point to the LOD
Laundromat data
collection.
Approximate
matching
Allows statements to
be findable based on
approximate string
matching on
associated literals.
Adaptive
Framework
LOTUS allows the
resource retrieval to be
tailored to fit various
use cases.
4 matchings
X 8 rankings
=32 retrieval
options
Customizability of retrieval
Matching options
▸ Phrase matching
▸ Disjunctive token
matching
▸ Conjunctive token
matching
▸ Conjunctive token
matching with
character edit distance
Retrieval options
Matching options
▸ Phrase matching
▸ Disjunctive token
matching
▸ Conjunctive token
matching
▸ Conjunctive token
matching with
character edit distance
Retrieval options
Ranking algorithms
▸ Length normalization
▸ Practical scoring
function
▸ Phrase proximity
▸ Terminological
richness
▸ Semantic richness
▸ Recency
▸ Degree popularity
▸ Appearance popularity
Content
-based
Document
-based
Resource
-based
Implementation
Implementation
Web Interface & API
lotus.lodlaundromat.org
Distributed architecture
4,334,672,073 Indexed literals
Scaled horizontally over 5 servers
Data replication to ensure high
runtime availability of LOTUS
And
Usage scenarios
Performance!
Scaling and performance
We used 18k queries to benchmark 18 retrieval combinations
of LOTUS
Conclusions
A Centralized
linguistic entry to
big linked data
LOTUS indexes over 4
billion literals from the
LOD Laundromat
An Adaptive
Retrieval
framework
LOTUS allows its
retrieval to be
customized to fit users’
needs by offering 32
matching+ranking
options.
“connecting the
dots”
LOTUS relies heavily
on 2 existing systems
(LOD Laundromat &
ES), but fills the gap by
offering a much
needed tool for
scientific evaluation.
LOTUS is
Vision: Scaling applications and evaluations at LOD
scale with LOD Lab
The Precision and
recall
of LOTUS should be
evaluated on concrete
applications, such as Entity
Linking and Network
Analysis.
Future work
The Precision and
recall
of LOTUS should be
evaluated on concrete
applications, such as Entity
Linking and Network
Analysis.
Future work
Context-dependent
ranking
could be added in the
future to take the query
context into account in
order to improve the
ranking accuracy.
Thanks!
any
questions
?
You can find me at
@earthling91 / f.ilievski@vu.nl
lotus.lodlaundromat.org

More Related Content

What's hot

Usage of Linked Data: Introduction and Application Scenarios
Usage of Linked Data: Introduction and Application ScenariosUsage of Linked Data: Introduction and Application Scenarios
Usage of Linked Data: Introduction and Application ScenariosEUCLID project
 
Linked data as a library data platform
Linked data as a library data platformLinked data as a library data platform
Linked data as a library data platformJindřich Mynarz
 
Toward Semantic Representation of Science in Electronic Laboratory Notebooks ...
Toward Semantic Representation of Science in Electronic Laboratory Notebooks ...Toward Semantic Representation of Science in Electronic Laboratory Notebooks ...
Toward Semantic Representation of Science in Electronic Laboratory Notebooks ...Stuart Chalk
 
Ontologies and semantic web
Ontologies and semantic webOntologies and semantic web
Ontologies and semantic webStanley Wang
 
RDF and Open Linked Data, a first approach
RDF and Open Linked Data, a first approachRDF and Open Linked Data, a first approach
RDF and Open Linked Data, a first approachhorvadam
 
Introduction To RDF and RDFS
Introduction To RDF and RDFSIntroduction To RDF and RDFS
Introduction To RDF and RDFSNilesh Wagmare
 
Knowledge Graph Construction and the Role of DBPedia
Knowledge Graph Construction and the Role of DBPediaKnowledge Graph Construction and the Role of DBPedia
Knowledge Graph Construction and the Role of DBPediaPaul Groth
 
Building Linked Data Applications
Building Linked Data ApplicationsBuilding Linked Data Applications
Building Linked Data ApplicationsEUCLID project
 

What's hot (13)

Usage of Linked Data: Introduction and Application Scenarios
Usage of Linked Data: Introduction and Application ScenariosUsage of Linked Data: Introduction and Application Scenarios
Usage of Linked Data: Introduction and Application Scenarios
 
Linked (Open) Data
Linked (Open) DataLinked (Open) Data
Linked (Open) Data
 
SWT Lecture Session 2 - RDF
SWT Lecture Session 2 - RDFSWT Lecture Session 2 - RDF
SWT Lecture Session 2 - RDF
 
Linked data as a library data platform
Linked data as a library data platformLinked data as a library data platform
Linked data as a library data platform
 
Toward Semantic Representation of Science in Electronic Laboratory Notebooks ...
Toward Semantic Representation of Science in Electronic Laboratory Notebooks ...Toward Semantic Representation of Science in Electronic Laboratory Notebooks ...
Toward Semantic Representation of Science in Electronic Laboratory Notebooks ...
 
Semantic web
Semantic webSemantic web
Semantic web
 
Ontologies and semantic web
Ontologies and semantic webOntologies and semantic web
Ontologies and semantic web
 
RDF and Open Linked Data, a first approach
RDF and Open Linked Data, a first approachRDF and Open Linked Data, a first approach
RDF and Open Linked Data, a first approach
 
Scaling the (evolving) web data –at low cost-
Scaling the (evolving) web data –at low cost-Scaling the (evolving) web data –at low cost-
Scaling the (evolving) web data –at low cost-
 
Introduction To RDF and RDFS
Introduction To RDF and RDFSIntroduction To RDF and RDFS
Introduction To RDF and RDFS
 
Knowledge Graph Construction and the Role of DBPedia
Knowledge Graph Construction and the Role of DBPediaKnowledge Graph Construction and the Role of DBPedia
Knowledge Graph Construction and the Role of DBPedia
 
Building Linked Data Applications
Building Linked Data ApplicationsBuilding Linked Data Applications
Building Linked Data Applications
 
SWT Lecture Session 8 - Rules
SWT Lecture Session 8 - RulesSWT Lecture Session 8 - Rules
SWT Lecture Session 8 - Rules
 

Viewers also liked

Palavras cruz romana
Palavras cruz romanaPalavras cruz romana
Palavras cruz romanaSara Oliveira
 
Mvelez resume-chi 81116
Mvelez resume-chi 81116Mvelez resume-chi 81116
Mvelez resume-chi 81116Miriam Velez
 
Is the future of food growing from your smartphone?
Is the future of food growing from your smartphone? Is the future of food growing from your smartphone?
Is the future of food growing from your smartphone? Agnieszka Nazaruk
 
Levenswijsheden
LevenswijshedenLevenswijsheden
LevenswijshedenAHolleman
 
De rol van Lean Analytics in je omnichannel strategie | Webwinkel Vakdagen
De rol van Lean Analytics in je omnichannel strategie | Webwinkel VakdagenDe rol van Lean Analytics in je omnichannel strategie | Webwinkel Vakdagen
De rol van Lean Analytics in je omnichannel strategie | Webwinkel VakdagenOnline Boswachters
 
Ficha numeracao romana
Ficha numeracao romanaFicha numeracao romana
Ficha numeracao romanaSara Oliveira
 
Retail marketing funnel & o2 o activities
Retail marketing funnel & o2 o activitiesRetail marketing funnel & o2 o activities
Retail marketing funnel & o2 o activitiesMarco Ma
 
Deep Dive into Google Analytics (mike van hoenselaar, juni 2016)
Deep Dive into Google Analytics (mike van hoenselaar, juni 2016)Deep Dive into Google Analytics (mike van hoenselaar, juni 2016)
Deep Dive into Google Analytics (mike van hoenselaar, juni 2016)➚ Mike van Hoenselaar
 
Pneumatics mainpresentation
Pneumatics mainpresentationPneumatics mainpresentation
Pneumatics mainpresentationDhananjay Dhore
 
Matemática 3º ano - A grande Aventura - Fichas de Avaliação
Matemática 3º ano - A grande Aventura - Fichas de AvaliaçãoMatemática 3º ano - A grande Aventura - Fichas de Avaliação
Matemática 3º ano - A grande Aventura - Fichas de AvaliaçãoMadalena Silva
 
Fichasestudodomeio1ano 130323211506-phpapp02
Fichasestudodomeio1ano 130323211506-phpapp02Fichasestudodomeio1ano 130323211506-phpapp02
Fichasestudodomeio1ano 130323211506-phpapp02Juraci Sousa
 
Software is from the Bay, Hardware is from Shenzhen
Software is from the Bay, Hardware is from ShenzhenSoftware is from the Bay, Hardware is from Shenzhen
Software is from the Bay, Hardware is from ShenzhenHAX
 
Gorilla Labs - Venture Builder
Gorilla Labs - Venture BuilderGorilla Labs - Venture Builder
Gorilla Labs - Venture BuilderNikhil Jacob
 

Viewers also liked (14)

Palavras cruz romana
Palavras cruz romanaPalavras cruz romana
Palavras cruz romana
 
Mvelez resume-chi 81116
Mvelez resume-chi 81116Mvelez resume-chi 81116
Mvelez resume-chi 81116
 
Is the future of food growing from your smartphone?
Is the future of food growing from your smartphone? Is the future of food growing from your smartphone?
Is the future of food growing from your smartphone?
 
Levenswijsheden
LevenswijshedenLevenswijsheden
Levenswijsheden
 
De rol van Lean Analytics in je omnichannel strategie | Webwinkel Vakdagen
De rol van Lean Analytics in je omnichannel strategie | Webwinkel VakdagenDe rol van Lean Analytics in je omnichannel strategie | Webwinkel Vakdagen
De rol van Lean Analytics in je omnichannel strategie | Webwinkel Vakdagen
 
Ficha numeracao romana
Ficha numeracao romanaFicha numeracao romana
Ficha numeracao romana
 
Retail marketing funnel & o2 o activities
Retail marketing funnel & o2 o activitiesRetail marketing funnel & o2 o activities
Retail marketing funnel & o2 o activities
 
FRBL
FRBLFRBL
FRBL
 
Deep Dive into Google Analytics (mike van hoenselaar, juni 2016)
Deep Dive into Google Analytics (mike van hoenselaar, juni 2016)Deep Dive into Google Analytics (mike van hoenselaar, juni 2016)
Deep Dive into Google Analytics (mike van hoenselaar, juni 2016)
 
Pneumatics mainpresentation
Pneumatics mainpresentationPneumatics mainpresentation
Pneumatics mainpresentation
 
Matemática 3º ano - A grande Aventura - Fichas de Avaliação
Matemática 3º ano - A grande Aventura - Fichas de AvaliaçãoMatemática 3º ano - A grande Aventura - Fichas de Avaliação
Matemática 3º ano - A grande Aventura - Fichas de Avaliação
 
Fichasestudodomeio1ano 130323211506-phpapp02
Fichasestudodomeio1ano 130323211506-phpapp02Fichasestudodomeio1ano 130323211506-phpapp02
Fichasestudodomeio1ano 130323211506-phpapp02
 
Software is from the Bay, Hardware is from Shenzhen
Software is from the Bay, Hardware is from ShenzhenSoftware is from the Bay, Hardware is from Shenzhen
Software is from the Bay, Hardware is from Shenzhen
 
Gorilla Labs - Venture Builder
Gorilla Labs - Venture BuilderGorilla Labs - Venture Builder
Gorilla Labs - Venture Builder
 

Similar to LOTUS: Adaptive Text Search for Big Linked Data

Linked Data Driven Data Virtualization for Web-scale Integration
Linked Data Driven Data Virtualization for Web-scale IntegrationLinked Data Driven Data Virtualization for Web-scale Integration
Linked Data Driven Data Virtualization for Web-scale Integrationrumito
 
Corrib.org - OpenSource and Research
Corrib.org - OpenSource and ResearchCorrib.org - OpenSource and Research
Corrib.org - OpenSource and Researchadameq
 
Detecting Good Practices and Pitfalls when Publishing Vocabularies on the Web
Detecting Good Practices and Pitfalls when Publishing Vocabularies on the Web Detecting Good Practices and Pitfalls when Publishing Vocabularies on the Web
Detecting Good Practices and Pitfalls when Publishing Vocabularies on the Web María Poveda Villalón
 
Semantic web assignment1
Semantic web assignment1Semantic web assignment1
Semantic web assignment1BarryK88
 
Linked Open Data Visualization
Linked Open Data VisualizationLinked Open Data Visualization
Linked Open Data VisualizationLaura Po
 
Environmental Thesauri Under the Lens of Reusability (EGOVIS 2014)
Environmental Thesauri Under the Lens of Reusability (EGOVIS 2014)Environmental Thesauri Under the Lens of Reusability (EGOVIS 2014)
Environmental Thesauri Under the Lens of Reusability (EGOVIS 2014)Riccardo Albertoni
 
Semantic Web Technologies: Changing Bibliographic Descriptions?
Semantic Web Technologies: Changing Bibliographic Descriptions?Semantic Web Technologies: Changing Bibliographic Descriptions?
Semantic Web Technologies: Changing Bibliographic Descriptions?Stuart Weibel
 
What Are Links in Linked Open Data? A Characterization and Evaluation of Link...
What Are Links in Linked Open Data? A Characterization and Evaluation of Link...What Are Links in Linked Open Data? A Characterization and Evaluation of Link...
What Are Links in Linked Open Data? A Characterization and Evaluation of Link...Armin Haller
 
How To Make Linked Data More than Data
How To Make Linked Data More than DataHow To Make Linked Data More than Data
How To Make Linked Data More than DataAmit Sheth
 
Managing Metadata for Science and Technology Studies: the RISIS case
Managing Metadata for Science and Technology Studies: the RISIS caseManaging Metadata for Science and Technology Studies: the RISIS case
Managing Metadata for Science and Technology Studies: the RISIS caseRinke Hoekstra
 
Vital AI: Big Data Modeling
Vital AI: Big Data ModelingVital AI: Big Data Modeling
Vital AI: Big Data ModelingVital.AI
 
Linked data HHS 2015
Linked data HHS 2015Linked data HHS 2015
Linked data HHS 2015Cason Snow
 
2011linked science4mccuskermcguinnessfinal
2011linked science4mccuskermcguinnessfinal2011linked science4mccuskermcguinnessfinal
2011linked science4mccuskermcguinnessfinalDeborah McGuinness
 
PoolParty SKOS and Linked Data
PoolParty SKOS and Linked DataPoolParty SKOS and Linked Data
PoolParty SKOS and Linked DataAndreas Blumauer
 
Knowledge discoverylaurahollink
Knowledge discoverylaurahollinkKnowledge discoverylaurahollink
Knowledge discoverylaurahollinkSSSW
 

Similar to LOTUS: Adaptive Text Search for Big Linked Data (20)

Linked Data Driven Data Virtualization for Web-scale Integration
Linked Data Driven Data Virtualization for Web-scale IntegrationLinked Data Driven Data Virtualization for Web-scale Integration
Linked Data Driven Data Virtualization for Web-scale Integration
 
Corrib.org - OpenSource and Research
Corrib.org - OpenSource and ResearchCorrib.org - OpenSource and Research
Corrib.org - OpenSource and Research
 
Detecting Good Practices and Pitfalls when Publishing Vocabularies on the Web
Detecting Good Practices and Pitfalls when Publishing Vocabularies on the Web Detecting Good Practices and Pitfalls when Publishing Vocabularies on the Web
Detecting Good Practices and Pitfalls when Publishing Vocabularies on the Web
 
NISO/NFAIS Joint Virtual Conference: Connecting the Library to the Wider Wor...
NISO/NFAIS Joint Virtual Conference:  Connecting the Library to the Wider Wor...NISO/NFAIS Joint Virtual Conference:  Connecting the Library to the Wider Wor...
NISO/NFAIS Joint Virtual Conference: Connecting the Library to the Wider Wor...
 
Semantic web assignment1
Semantic web assignment1Semantic web assignment1
Semantic web assignment1
 
Linked Open Data Visualization
Linked Open Data VisualizationLinked Open Data Visualization
Linked Open Data Visualization
 
Environmental Thesauri Under the Lens of Reusability (EGOVIS 2014)
Environmental Thesauri Under the Lens of Reusability (EGOVIS 2014)Environmental Thesauri Under the Lens of Reusability (EGOVIS 2014)
Environmental Thesauri Under the Lens of Reusability (EGOVIS 2014)
 
Linked sensor data
Linked sensor dataLinked sensor data
Linked sensor data
 
Linked data 20171106
Linked data 20171106Linked data 20171106
Linked data 20171106
 
Semantic Web Technologies: Changing Bibliographic Descriptions?
Semantic Web Technologies: Changing Bibliographic Descriptions?Semantic Web Technologies: Changing Bibliographic Descriptions?
Semantic Web Technologies: Changing Bibliographic Descriptions?
 
What Are Links in Linked Open Data? A Characterization and Evaluation of Link...
What Are Links in Linked Open Data? A Characterization and Evaluation of Link...What Are Links in Linked Open Data? A Characterization and Evaluation of Link...
What Are Links in Linked Open Data? A Characterization and Evaluation of Link...
 
How To Make Linked Data More than Data
How To Make Linked Data More than DataHow To Make Linked Data More than Data
How To Make Linked Data More than Data
 
How To Make Linked Data More than Data
How To Make Linked Data More than DataHow To Make Linked Data More than Data
How To Make Linked Data More than Data
 
Managing Metadata for Science and Technology Studies: the RISIS case
Managing Metadata for Science and Technology Studies: the RISIS caseManaging Metadata for Science and Technology Studies: the RISIS case
Managing Metadata for Science and Technology Studies: the RISIS case
 
Vital AI: Big Data Modeling
Vital AI: Big Data ModelingVital AI: Big Data Modeling
Vital AI: Big Data Modeling
 
Linked data HHS 2015
Linked data HHS 2015Linked data HHS 2015
Linked data HHS 2015
 
2011linked science4mccuskermcguinnessfinal
2011linked science4mccuskermcguinnessfinal2011linked science4mccuskermcguinnessfinal
2011linked science4mccuskermcguinnessfinal
 
PoolParty SKOS and Linked Data
PoolParty SKOS and Linked DataPoolParty SKOS and Linked Data
PoolParty SKOS and Linked Data
 
EDS for JIBS
EDS for JIBSEDS for JIBS
EDS for JIBS
 
Knowledge discoverylaurahollink
Knowledge discoverylaurahollinkKnowledge discoverylaurahollink
Knowledge discoverylaurahollink
 

More from Filip Ilievski

The Commonsense Knowledge Graph
The Commonsense Knowledge GraphThe Commonsense Knowledge Graph
The Commonsense Knowledge GraphFilip Ilievski
 
Commonsense knowledge in Wikidata
Commonsense knowledge in WikidataCommonsense knowledge in Wikidata
Commonsense knowledge in WikidataFilip Ilievski
 
SemEval-2018 task 5: Counting events and participants in the long tail
SemEval-2018 task 5: Counting events and participants in the long tailSemEval-2018 task 5: Counting events and participants in the long tail
SemEval-2018 task 5: Counting events and participants in the long tailFilip Ilievski
 
A look inside Babelfy: Examining the bubble
A look inside Babelfy: Examining the bubbleA look inside Babelfy: Examining the bubble
A look inside Babelfy: Examining the bubbleFilip Ilievski
 
2nd Spinoza workshop: Looking at the Long Tail - introductory slides
2nd Spinoza workshop: Looking at the Long Tail - introductory slides2nd Spinoza workshop: Looking at the Long Tail - introductory slides
2nd Spinoza workshop: Looking at the Long Tail - introductory slidesFilip Ilievski
 
Systematic Study of Long Tail Phenomena in Entity Linking
Systematic Study of Long Tail Phenomena in Entity LinkingSystematic Study of Long Tail Phenomena in Entity Linking
Systematic Study of Long Tail Phenomena in Entity LinkingFilip Ilievski
 
NAF2SEM and cross-document Event Coreference
NAF2SEM and cross-document Event CoreferenceNAF2SEM and cross-document Event Coreference
NAF2SEM and cross-document Event CoreferenceFilip Ilievski
 
Mini seminar presentation on context-based NED optimization
Mini seminar presentation on context-based NED optimizationMini seminar presentation on context-based NED optimization
Mini seminar presentation on context-based NED optimizationFilip Ilievski
 
CLiN 25: NED with two-stage coherence optimization
CLiN 25: NED with two-stage coherence optimizationCLiN 25: NED with two-stage coherence optimization
CLiN 25: NED with two-stage coherence optimizationFilip Ilievski
 

More from Filip Ilievski (10)

The Commonsense Knowledge Graph
The Commonsense Knowledge GraphThe Commonsense Knowledge Graph
The Commonsense Knowledge Graph
 
Commonsense knowledge in Wikidata
Commonsense knowledge in WikidataCommonsense knowledge in Wikidata
Commonsense knowledge in Wikidata
 
SemEval-2018 task 5: Counting events and participants in the long tail
SemEval-2018 task 5: Counting events and participants in the long tailSemEval-2018 task 5: Counting events and participants in the long tail
SemEval-2018 task 5: Counting events and participants in the long tail
 
A look inside Babelfy: Examining the bubble
A look inside Babelfy: Examining the bubbleA look inside Babelfy: Examining the bubble
A look inside Babelfy: Examining the bubble
 
2nd Spinoza workshop: Looking at the Long Tail - introductory slides
2nd Spinoza workshop: Looking at the Long Tail - introductory slides2nd Spinoza workshop: Looking at the Long Tail - introductory slides
2nd Spinoza workshop: Looking at the Long Tail - introductory slides
 
Systematic Study of Long Tail Phenomena in Entity Linking
Systematic Study of Long Tail Phenomena in Entity LinkingSystematic Study of Long Tail Phenomena in Entity Linking
Systematic Study of Long Tail Phenomena in Entity Linking
 
NoSQL databases
NoSQL databasesNoSQL databases
NoSQL databases
 
NAF2SEM and cross-document Event Coreference
NAF2SEM and cross-document Event CoreferenceNAF2SEM and cross-document Event Coreference
NAF2SEM and cross-document Event Coreference
 
Mini seminar presentation on context-based NED optimization
Mini seminar presentation on context-based NED optimizationMini seminar presentation on context-based NED optimization
Mini seminar presentation on context-based NED optimization
 
CLiN 25: NED with two-stage coherence optimization
CLiN 25: NED with two-stage coherence optimizationCLiN 25: NED with two-stage coherence optimization
CLiN 25: NED with two-stage coherence optimization
 

Recently uploaded

FAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
FAIRSpectra - Enabling the FAIRification of Spectroscopy and SpectrometryFAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
FAIRSpectra - Enabling the FAIRification of Spectroscopy and SpectrometryAlex Henderson
 
Pests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdfPests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdfPirithiRaju
 
COST ESTIMATION FOR A RESEARCH PROJECT.pptx
COST ESTIMATION FOR A RESEARCH PROJECT.pptxCOST ESTIMATION FOR A RESEARCH PROJECT.pptx
COST ESTIMATION FOR A RESEARCH PROJECT.pptxFarihaAbdulRasheed
 
Kochi ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Kochi ESCORT SERVICE❤CALL GIRL
Kochi ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Kochi ESCORT SERVICE❤CALL GIRLKochi ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Kochi ESCORT SERVICE❤CALL GIRL
Kochi ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Kochi ESCORT SERVICE❤CALL GIRLkantirani197
 
Pulmonary drug delivery system M.pharm -2nd sem P'ceutics
Pulmonary drug delivery system M.pharm -2nd sem P'ceuticsPulmonary drug delivery system M.pharm -2nd sem P'ceutics
Pulmonary drug delivery system M.pharm -2nd sem P'ceuticssakshisoni2385
 
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.Nitya salvi
 
Presentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptxPresentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptxgindu3009
 
Nanoparticles synthesis and characterization​ ​
Nanoparticles synthesis and characterization​  ​Nanoparticles synthesis and characterization​  ​
Nanoparticles synthesis and characterization​ ​kaibalyasahoo82800
 
GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)Areesha Ahmad
 
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...ssifa0344
 
Botany 4th semester series (krishna).pdf
Botany 4th semester series (krishna).pdfBotany 4th semester series (krishna).pdf
Botany 4th semester series (krishna).pdfSumit Kumar yadav
 
Zoology 4th semester series (krishna).pdf
Zoology 4th semester series (krishna).pdfZoology 4th semester series (krishna).pdf
Zoology 4th semester series (krishna).pdfSumit Kumar yadav
 
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls Agency
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls AgencyHire 💕 9907093804 Hooghly Call Girls Service Call Girls Agency
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls AgencySheetal Arora
 
Chemical Tests; flame test, positive and negative ions test Edexcel Internati...
Chemical Tests; flame test, positive and negative ions test Edexcel Internati...Chemical Tests; flame test, positive and negative ions test Edexcel Internati...
Chemical Tests; flame test, positive and negative ions test Edexcel Internati...ssuser79fe74
 
Forensic Biology & Its biological significance.pdf
Forensic Biology & Its biological significance.pdfForensic Biology & Its biological significance.pdf
Forensic Biology & Its biological significance.pdfrohankumarsinghrore1
 
Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...
Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...
Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...Monika Rani
 
American Type Culture Collection (ATCC).pptx
American Type Culture Collection (ATCC).pptxAmerican Type Culture Collection (ATCC).pptx
American Type Culture Collection (ATCC).pptxabhishekdhamu51
 
Chemistry 4th semester series (krishna).pdf
Chemistry 4th semester series (krishna).pdfChemistry 4th semester series (krishna).pdf
Chemistry 4th semester series (krishna).pdfSumit Kumar yadav
 
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPirithiRaju
 
SCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptx
SCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptxSCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptx
SCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptxRizalinePalanog2
 

Recently uploaded (20)

FAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
FAIRSpectra - Enabling the FAIRification of Spectroscopy and SpectrometryFAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
FAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
 
Pests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdfPests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdf
 
COST ESTIMATION FOR A RESEARCH PROJECT.pptx
COST ESTIMATION FOR A RESEARCH PROJECT.pptxCOST ESTIMATION FOR A RESEARCH PROJECT.pptx
COST ESTIMATION FOR A RESEARCH PROJECT.pptx
 
Kochi ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Kochi ESCORT SERVICE❤CALL GIRL
Kochi ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Kochi ESCORT SERVICE❤CALL GIRLKochi ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Kochi ESCORT SERVICE❤CALL GIRL
Kochi ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Kochi ESCORT SERVICE❤CALL GIRL
 
Pulmonary drug delivery system M.pharm -2nd sem P'ceutics
Pulmonary drug delivery system M.pharm -2nd sem P'ceuticsPulmonary drug delivery system M.pharm -2nd sem P'ceutics
Pulmonary drug delivery system M.pharm -2nd sem P'ceutics
 
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
 
Presentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptxPresentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptx
 
Nanoparticles synthesis and characterization​ ​
Nanoparticles synthesis and characterization​  ​Nanoparticles synthesis and characterization​  ​
Nanoparticles synthesis and characterization​ ​
 
GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)
 
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
 
Botany 4th semester series (krishna).pdf
Botany 4th semester series (krishna).pdfBotany 4th semester series (krishna).pdf
Botany 4th semester series (krishna).pdf
 
Zoology 4th semester series (krishna).pdf
Zoology 4th semester series (krishna).pdfZoology 4th semester series (krishna).pdf
Zoology 4th semester series (krishna).pdf
 
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls Agency
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls AgencyHire 💕 9907093804 Hooghly Call Girls Service Call Girls Agency
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls Agency
 
Chemical Tests; flame test, positive and negative ions test Edexcel Internati...
Chemical Tests; flame test, positive and negative ions test Edexcel Internati...Chemical Tests; flame test, positive and negative ions test Edexcel Internati...
Chemical Tests; flame test, positive and negative ions test Edexcel Internati...
 
Forensic Biology & Its biological significance.pdf
Forensic Biology & Its biological significance.pdfForensic Biology & Its biological significance.pdf
Forensic Biology & Its biological significance.pdf
 
Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...
Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...
Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...
 
American Type Culture Collection (ATCC).pptx
American Type Culture Collection (ATCC).pptxAmerican Type Culture Collection (ATCC).pptx
American Type Culture Collection (ATCC).pptx
 
Chemistry 4th semester series (krishna).pdf
Chemistry 4th semester series (krishna).pdfChemistry 4th semester series (krishna).pdf
Chemistry 4th semester series (krishna).pdf
 
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
 
SCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptx
SCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptxSCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptx
SCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptx
 

LOTUS: Adaptive Text Search for Big Linked Data

  • 1. LOTUS: Adaptive Text Search for Big Linked Data F. Ilievski | W. Beek | M. van Erp | L. Rietveld | S. Schlobach
  • 2. INTRODUCTION A wealth of information is potentially available in Linked Open Data sources. This information could be exploited by researchers and developers for tools and evaluations on a LOD-scale. But, accesing Big Linked Data is not trivial
  • 3. INTRODUCTION No centralized query service for Linked Data A wealth of information is potentially available in Linked Open Data sources. This information could be exploited by researchers and developers for tools and evaluations on a LOD-scale. But, accesing Big Linked Data is not trivial
  • 4. INTRODUCTION Limited natural language access to Linked Data No centralized query service for Linked Data A wealth of information is potentially available in Linked Open Data sources. This information could be exploited by researchers and developers for tools and evaluations on a LOD-scale. But, accesing Big Linked Data is not trivial
  • 5. INTRODUCTION Limited natural language access to Linked Data No centralized query service for Linked Data Text-based retrieval is not customizable A wealth of information is potentially available in Linked Open Data sources. This information could be exploited by researchers and developers for tools and evaluations on a LOD-scale. But, accesing Big Linked Data is not trivial
  • 6. ““The lack of a global entry point to resources through a flexible text index is a serious obstacle for linked data consumption.”
  • 7. ““The lack of a global entry point to resources through a flexible text index is a serious obstacle for linked data consumption.” by researchers and developers
  • 8. For a global text-based Entry point To LOD REQUIREMENTS!
  • 9. REQUIREMENTS 1. Text-based queries 2. Resilience (of text search) 3. Findability (of authoritative and non-authoritative statements) 4. Availability 5. Scalability 6. Serviceability (for both machines and humans) 7. Customizability
  • 11. LOD Laundromat A centralized Linked Data cleaning and publishing architecture Allows access to a big subset of the LOD Cloud 38 billion statements
  • 13. LOTUS Linguistic entry point LOTUS is a linguistic entry point to the LOD Laundromat data collection. Approximate matching Allows statements to be findable based on approximate string matching on associated literals. Adaptive Framework LOTUS allows the resource retrieval to be tailored to fit various use cases.
  • 14. 4 matchings X 8 rankings =32 retrieval options Customizability of retrieval
  • 15. Matching options ▸ Phrase matching ▸ Disjunctive token matching ▸ Conjunctive token matching ▸ Conjunctive token matching with character edit distance Retrieval options
  • 16. Matching options ▸ Phrase matching ▸ Disjunctive token matching ▸ Conjunctive token matching ▸ Conjunctive token matching with character edit distance Retrieval options Ranking algorithms ▸ Length normalization ▸ Practical scoring function ▸ Phrase proximity ▸ Terminological richness ▸ Semantic richness ▸ Recency ▸ Degree popularity ▸ Appearance popularity Content -based Document -based Resource -based
  • 19. Web Interface & API lotus.lodlaundromat.org
  • 20. Distributed architecture 4,334,672,073 Indexed literals Scaled horizontally over 5 servers Data replication to ensure high runtime availability of LOTUS
  • 22. Scaling and performance We used 18k queries to benchmark 18 retrieval combinations of LOTUS
  • 23. Conclusions A Centralized linguistic entry to big linked data LOTUS indexes over 4 billion literals from the LOD Laundromat An Adaptive Retrieval framework LOTUS allows its retrieval to be customized to fit users’ needs by offering 32 matching+ranking options. “connecting the dots” LOTUS relies heavily on 2 existing systems (LOD Laundromat & ES), but fills the gap by offering a much needed tool for scientific evaluation. LOTUS is
  • 24. Vision: Scaling applications and evaluations at LOD scale with LOD Lab
  • 25. The Precision and recall of LOTUS should be evaluated on concrete applications, such as Entity Linking and Network Analysis. Future work
  • 26. The Precision and recall of LOTUS should be evaluated on concrete applications, such as Entity Linking and Network Analysis. Future work Context-dependent ranking could be added in the future to take the query context into account in order to improve the ranking accuracy.
  • 27. Thanks! any questions ? You can find me at @earthling91 / f.ilievski@vu.nl lotus.lodlaundromat.org