SlideShare a Scribd company logo
1 of 38
Optimized Index Structures
for Querying RDF from the
Web
Presented by : Mahdi Atawna
1
About the Paper
 The paper was published at the Third Latin
American Web Congress in 2005
 Have 56 citation.
Andreas Harth
National University
of Galway, Ireland
Prof. Stefan Decker
National University of
Galway, Ireland
2
Outline
 Overview of Semantic Web.
 Overview of Indexes.
 Paper motivation.
 Methodology.
 Experiment & Result.
 Conclusion.
3
Semantic Web
 Also called :
 Web 3.0.
 the Linked Data Web.
 the Web of Data…whatever you call it.
 the next major evolution in connecting information.
4
Why semantic web?
 It enables data to be linked from a source to any
other source.
 It can be understood by computers so that they can
perform increasingly sophisticated tasks on our
behalf.
5
6
Source: http://lod-cloud.net
Semantic Web Standards
 RDF (Resource Description Framework): The data modeling
language for the Semantic Web (like UML). All Semantic Web
information is stored and represented in the RDF.
 SPARQL : The query language of the Semantic Web.
 OWL (Web Ontology Language) The schema language, or
knowledge representation (KR) language, of the Semantic Web.
7
What is RDF?
 RDF is the data model of the Semantic Web.
 That means that all data in Semantic Web
technologies is represented as RDF.
 If you store Semantic Web data, it's in RDF.
 If you query Semantic Web data (typically using
SPARQL), it's RDF data. If you send Semantic Web
data to your friend, it's RDF.
8
9
10
Source : http://www.w3.org/TR/rdf11-primer/
RDF triples
 are representations of graph edges.
11
Subject Object
Predicate
Mahdi Hebron
born in
RDF example
@prefix foaf: <http://xmlns.com/foaf/0.1/ .
<http://example.org/bob#me>
foaf:topic_interest
<http://wikidata.org/entity/Q12418> .
12
RDF example
@prefix foaf: <http://xmlns.com/foaf/0.1/ .
<http://example.org/bob#me>
foaf:topic_interest
<http://wikidata.org/entity/Q12418> .
Subject
predicate
Object
13
Sparql query language
SELECT ?p ?o
{
<http://nasa.dataincubator.org/spacecraft/1968-089A> ?p ?o
}
14
What is database index?
 A database index is a data structure that improves the
speed of data retrieval operations on a database table at
the cost of additional writes and storage space to maintain
the index data structure.
 Index goal : The index structure enables fast retrieval of
data
15
16
Index example
key articles
Leonardo [104,70,12,98]
Mona Lisa [2,201,7,20,12]
Francesco [1,8,900,104]
17
Paper motivation
 Previous Systems provide a storage infrastructure for RDF data, but
index structure which do not support typical query scenarios
for data from the Web which results in poor query answering
performance in some cases.
18
Methodology
 The researchers present a new index structure that handle the
data from the Web .
 Implemented the index structure in a lightweight software called
YARS
19
RDF Index structures
 The authors suggested an index structure that contains two
sets:
1. Lexicon : covers the string representation of RDF graph
(r,l,b)
2. Quad indexes : cover the quads (triples).
20
1. Lexicon indexes
 NodeOID and OIDNode Index
 Keyword Index
21
1. Lexicon indexes
 NodeOID and OIDNode Index :
22
Key value
<http://www.harth.org/andreas/#me> 3
<http://decker.cn/stefan/> 14
<http://sw.deri.org/ aharth/foaf.rdf> 11
<http://www.deri.org/> 1
1. Lexicon indexes
 Keyword Index (Popular in search engines)
23
Key No of hits List of hits
“Andreas” 1 3
“Decker” 1 11
”Harth” 1 3
“Stefan” 2 11,13
2. Quad indexes
 Access Patterns
 Combined Indexes
 Occurrence Counts
24
2. Quad indexes
A- Access Patterns
25
No Access pattern No Access pattern
1 (?:?:?:?) 9 (s:?:o:c)
2 (s:?:?:?) 10 (?:?:o:c)
3 (s:p:?:?) 11 (?:?:o:?)
4 (s:p:o:?) 12 (?:?:?:c)
5 (s:p:o:c) 13 (s:?:?:c)
6 (?:p:?:?) 14 (s:p:?:c)
7 (?:p:o:?) 15 (?:p:?:c)
8 (?:p:o:c) 16 (s:?:o:?)
2. Quad indexes
A- Access Patterns
26
No Access pattern values
1 (?:?:?:?) [1,5,3]
2 (s:?:?:?) [2]
3 (s:p:?:?) [9,8,2,3]
4 (s:p:o:?) [1,3]
5 (s:p:o:c) [1]
6 (?:p:?:?) [76,9]
7 (?:p:o:?) [2,3]
2. Quad indexes
B- Combined Indexes
27
spoc poc osc csp cp os
(?:?:?:?) (?:p:?:?) (?:?:o:?)
(?:?:?:c) (?:p:?:c) (s:?:o:?)
(s:?:?:?) (?:p:o:?) (?:?:o:c)
(s:?:?:c)
(s:p:?:?) (?:p:o:c) (s:?:o:c)
(s:p:?:c)
(s:p:o:?)
(s:p:o:c)
2. Quad indexes
C- Occurrence Counts
28
No
Access
pattern
values count
1 (?:?:?:?) [1,5,3] 3
2 (s:?:?:?) [2] 1
3 (s:p:?:?) [9,8,2,3] 4
4 (s:p:o:?) [1,3] 2
5 (s:p:o:c) [1] 1
6 (?:p:?:?) [76,9] 2
7 (?:p:o:?) [2,3] 2
YARS
 Web application that built in JAVA.
 Has two parts:
 a storage component that handles both persistent and in-
memory indexes.
 a query handler to perform query processing and evaluation.
29
Experiment
 They evaluated the performance based on a dataset of 2.8
million triples (293 MB).
 The testing server has :
 Pentium-4 2.4 GHz
 4 GB RAM
 running Debian Sarge .
30
Experiment
They considered the following RDF stores for evaluation:
 Sesame.
 Kowari (failed to get a running version).
 Redland.
 Jena2. ([9] shows that Sesame generally supersedes Jena in
performance results)
31
Experiment
Experiment parts:
 Index Construction.
 Queries.
32
Result – index construction
System Index size (bytes)
Redland 2.164.019.200
Sesame MySQL 340.381.636
Sesame native 39.997.992
YARS 1.090.002.944
33
Table 8: Index size for the synthetic Univ20 dataset.
Result – index construction
34
Result - queries
No Query
1 ?x rdf:type univ:UndergraduateStudent
2 ?x ?p ”UndergraduateStudent0”
3 <http://www.University965.edu> ?p ?o
4 ?x univ:worksFor ?y
35
Query Redland Sesame MySQL Sesame Native YARS
1 0:10.48 0:18.87 1:05.16 0:18.41
2 0:44.14 0:00.73 0:00.55 0:00.49
3 0:44.15 0:00.46 0:00.47 0:00.32
4 3:04.21 0:03.42 0:01.95 0:00.47
Performance results for quad queries.
Conclusion
 The auther introduced query processing for RDF which is an I
portant issue in sematic web.
 YARS has some overhead for resolving the dependencies and
order in comparison with others.
36
Criticism
 - In experiment , the researchers removed “Kowari “ engine
because the cannot install it.
37
Questions?
Thank you
38

More Related Content

What's hot

14. Files - Data Structures using C++ by Varsha Patil
14. Files - Data Structures using C++ by Varsha Patil14. Files - Data Structures using C++ by Varsha Patil
14. Files - Data Structures using C++ by Varsha Patilwidespreadpromotion
 
6. Linked list - Data Structures using C++ by Varsha Patil
6. Linked list - Data Structures using C++ by Varsha Patil6. Linked list - Data Structures using C++ by Varsha Patil
6. Linked list - Data Structures using C++ by Varsha Patilwidespreadpromotion
 
3. Stack - Data Structures using C++ by Varsha Patil
3. Stack - Data Structures using C++ by Varsha Patil3. Stack - Data Structures using C++ by Varsha Patil
3. Stack - Data Structures using C++ by Varsha Patilwidespreadpromotion
 
LDQL: A Query Language for the Web of Linked Data
LDQL: A Query Language for the Web of Linked DataLDQL: A Query Language for the Web of Linked Data
LDQL: A Query Language for the Web of Linked DataOlaf Hartig
 
5. Queue - Data Structures using C++ by Varsha Patil
5. Queue - Data Structures using C++ by Varsha Patil5. Queue - Data Structures using C++ by Varsha Patil
5. Queue - Data Structures using C++ by Varsha Patilwidespreadpromotion
 
R Programming Language
R Programming LanguageR Programming Language
R Programming LanguageNareshKarela1
 
R programming groundup-basic-section-i
R programming groundup-basic-section-iR programming groundup-basic-section-i
R programming groundup-basic-section-iDr. Awase Khirni Syed
 
Introduction to the language R
Introduction to the language RIntroduction to the language R
Introduction to the language Rfbenault
 
Rdf conjunctive query selectivity estimation
Rdf conjunctive query selectivity estimationRdf conjunctive query selectivity estimation
Rdf conjunctive query selectivity estimationINRIA-OAK
 
Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 3 (...
Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 3 (...Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 3 (...
Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 3 (...Olaf Hartig
 
final_copy_camera_ready_paper (7)
final_copy_camera_ready_paper (7)final_copy_camera_ready_paper (7)
final_copy_camera_ready_paper (7)Ankit Rathi
 
R programming Fundamentals
R programming  FundamentalsR programming  Fundamentals
R programming FundamentalsRagia Ibrahim
 
Positional Data Organization and Compression in Web Inverted Indexes
Positional Data Organization and Compression in Web Inverted IndexesPositional Data Organization and Compression in Web Inverted Indexes
Positional Data Organization and Compression in Web Inverted IndexesLeonidas Akritidis
 
Introduction to Data Mining with R and Data Import/Export in R
Introduction to Data Mining with R and Data Import/Export in RIntroduction to Data Mining with R and Data Import/Export in R
Introduction to Data Mining with R and Data Import/Export in RYanchang Zhao
 
Why R? A Brief Introduction to the Open Source Statistics Platform
Why R? A Brief Introduction to the Open Source Statistics PlatformWhy R? A Brief Introduction to the Open Source Statistics Platform
Why R? A Brief Introduction to the Open Source Statistics PlatformSyracuse University
 
R programming Language , Rahul Singh
R programming Language , Rahul SinghR programming Language , Rahul Singh
R programming Language , Rahul SinghRavi Basil
 

What's hot (20)

14. Files - Data Structures using C++ by Varsha Patil
14. Files - Data Structures using C++ by Varsha Patil14. Files - Data Structures using C++ by Varsha Patil
14. Files - Data Structures using C++ by Varsha Patil
 
6. Linked list - Data Structures using C++ by Varsha Patil
6. Linked list - Data Structures using C++ by Varsha Patil6. Linked list - Data Structures using C++ by Varsha Patil
6. Linked list - Data Structures using C++ by Varsha Patil
 
3. Stack - Data Structures using C++ by Varsha Patil
3. Stack - Data Structures using C++ by Varsha Patil3. Stack - Data Structures using C++ by Varsha Patil
3. Stack - Data Structures using C++ by Varsha Patil
 
LDQL: A Query Language for the Web of Linked Data
LDQL: A Query Language for the Web of Linked DataLDQL: A Query Language for the Web of Linked Data
LDQL: A Query Language for the Web of Linked Data
 
Getting Started with R
Getting Started with RGetting Started with R
Getting Started with R
 
5. Queue - Data Structures using C++ by Varsha Patil
5. Queue - Data Structures using C++ by Varsha Patil5. Queue - Data Structures using C++ by Varsha Patil
5. Queue - Data Structures using C++ by Varsha Patil
 
R Programming Language
R Programming LanguageR Programming Language
R Programming Language
 
R programming groundup-basic-section-i
R programming groundup-basic-section-iR programming groundup-basic-section-i
R programming groundup-basic-section-i
 
Triple Stores
Triple StoresTriple Stores
Triple Stores
 
Introduction to the language R
Introduction to the language RIntroduction to the language R
Introduction to the language R
 
Rdf conjunctive query selectivity estimation
Rdf conjunctive query selectivity estimationRdf conjunctive query selectivity estimation
Rdf conjunctive query selectivity estimation
 
Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 3 (...
Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 3 (...Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 3 (...
Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 3 (...
 
final_copy_camera_ready_paper (7)
final_copy_camera_ready_paper (7)final_copy_camera_ready_paper (7)
final_copy_camera_ready_paper (7)
 
Data Structure
Data StructureData Structure
Data Structure
 
R programming Fundamentals
R programming  FundamentalsR programming  Fundamentals
R programming Fundamentals
 
Positional Data Organization and Compression in Web Inverted Indexes
Positional Data Organization and Compression in Web Inverted IndexesPositional Data Organization and Compression in Web Inverted Indexes
Positional Data Organization and Compression in Web Inverted Indexes
 
Efficient RDF Interchange (ERI) Format for RDF Data Streams
Efficient RDF Interchange (ERI) Format for RDF Data StreamsEfficient RDF Interchange (ERI) Format for RDF Data Streams
Efficient RDF Interchange (ERI) Format for RDF Data Streams
 
Introduction to Data Mining with R and Data Import/Export in R
Introduction to Data Mining with R and Data Import/Export in RIntroduction to Data Mining with R and Data Import/Export in R
Introduction to Data Mining with R and Data Import/Export in R
 
Why R? A Brief Introduction to the Open Source Statistics Platform
Why R? A Brief Introduction to the Open Source Statistics PlatformWhy R? A Brief Introduction to the Open Source Statistics Platform
Why R? A Brief Introduction to the Open Source Statistics Platform
 
R programming Language , Rahul Singh
R programming Language , Rahul SinghR programming Language , Rahul Singh
R programming Language , Rahul Singh
 

Similar to Optimized index structures for querying rdf from the web

Semantic Web and Related Work at W3C
Semantic Web and Related Work at W3CSemantic Web and Related Work at W3C
Semantic Web and Related Work at W3CIvan Herman
 
Data integration with a façade. The case of knowledge graph construction.
Data integration with a façade. The case of knowledge graph construction.Data integration with a façade. The case of knowledge graph construction.
Data integration with a façade. The case of knowledge graph construction.Enrico Daga
 
Validating statistical Index Data represented in RDF using SPARQL Queries: Co...
Validating statistical Index Data represented in RDF using SPARQL Queries: Co...Validating statistical Index Data represented in RDF using SPARQL Queries: Co...
Validating statistical Index Data represented in RDF using SPARQL Queries: Co...Jose Emilio Labra Gayo
 
Modelling and Querying Lists in RDF. A Pragmatic Study
Modelling and Querying Lists in RDF. A Pragmatic StudyModelling and Querying Lists in RDF. A Pragmatic Study
Modelling and Querying Lists in RDF. A Pragmatic StudyAlbert Meroño-Peñuela
 
Towards efficient processing of RDF data streams
Towards efficient processing of RDF data streamsTowards efficient processing of RDF data streams
Towards efficient processing of RDF data streamsAlejandro Llaves
 
Towards efficient processing of RDF data streams
Towards efficient processing of RDF data streamsTowards efficient processing of RDF data streams
Towards efficient processing of RDF data streamsAlejandro Llaves
 
Strata NYC 2015 - What's coming for the Spark community
Strata NYC 2015 - What's coming for the Spark communityStrata NYC 2015 - What's coming for the Spark community
Strata NYC 2015 - What's coming for the Spark communityDatabricks
 
Ontology mapping for the semantic web
Ontology mapping for the semantic webOntology mapping for the semantic web
Ontology mapping for the semantic webWorawith Sangkatip
 
2021 04-20 apache arrow and its impact on the database industry.pptx
2021 04-20  apache arrow and its impact on the database industry.pptx2021 04-20  apache arrow and its impact on the database industry.pptx
2021 04-20 apache arrow and its impact on the database industry.pptxAndrew Lamb
 
ACS 248th Paper 108 NIST-IUPAC Solubility Data
ACS 248th Paper 108 NIST-IUPAC Solubility DataACS 248th Paper 108 NIST-IUPAC Solubility Data
ACS 248th Paper 108 NIST-IUPAC Solubility DataStuart Chalk
 
RDF Stream Processing: Let's React
RDF Stream Processing: Let's ReactRDF Stream Processing: Let's React
RDF Stream Processing: Let's ReactJean-Paul Calbimonte
 
What's new in spark 2.0?
What's new in spark 2.0?What's new in spark 2.0?
What's new in spark 2.0?Örjan Lundberg
 
معرفی کاربردهای یادگیری عمیق و چالش های آن در کلان داده
معرفی کاربردهای یادگیری عمیق و چالش های آن در کلان دادهمعرفی کاربردهای یادگیری عمیق و چالش های آن در کلان داده
معرفی کاربردهای یادگیری عمیق و چالش های آن در کلان دادهWeb Standards School
 
Nutch and lucene_framework
Nutch and lucene_frameworkNutch and lucene_framework
Nutch and lucene_frameworksamuelhard
 
CMPT470-usask-guest-lecture
CMPT470-usask-guest-lectureCMPT470-usask-guest-lecture
CMPT470-usask-guest-lectureMasud Rahman
 
State of the Semantic Web
State of the Semantic WebState of the Semantic Web
State of the Semantic WebIvan Herman
 
Spark Application Carousel: Highlights of Several Applications Built with Spark
Spark Application Carousel: Highlights of Several Applications Built with SparkSpark Application Carousel: Highlights of Several Applications Built with Spark
Spark Application Carousel: Highlights of Several Applications Built with SparkDatabricks
 
RDF APIs for .NET Framework
RDF APIs for .NET FrameworkRDF APIs for .NET Framework
RDF APIs for .NET FrameworkAdriana Ivanciu
 

Similar to Optimized index structures for querying rdf from the web (20)

Semantic Web and Related Work at W3C
Semantic Web and Related Work at W3CSemantic Web and Related Work at W3C
Semantic Web and Related Work at W3C
 
Democratizing Big Semantic Data management
Democratizing Big Semantic Data managementDemocratizing Big Semantic Data management
Democratizing Big Semantic Data management
 
Data integration with a façade. The case of knowledge graph construction.
Data integration with a façade. The case of knowledge graph construction.Data integration with a façade. The case of knowledge graph construction.
Data integration with a façade. The case of knowledge graph construction.
 
Validating statistical Index Data represented in RDF using SPARQL Queries: Co...
Validating statistical Index Data represented in RDF using SPARQL Queries: Co...Validating statistical Index Data represented in RDF using SPARQL Queries: Co...
Validating statistical Index Data represented in RDF using SPARQL Queries: Co...
 
Modelling and Querying Lists in RDF. A Pragmatic Study
Modelling and Querying Lists in RDF. A Pragmatic StudyModelling and Querying Lists in RDF. A Pragmatic Study
Modelling and Querying Lists in RDF. A Pragmatic Study
 
Towards efficient processing of RDF data streams
Towards efficient processing of RDF data streamsTowards efficient processing of RDF data streams
Towards efficient processing of RDF data streams
 
Towards efficient processing of RDF data streams
Towards efficient processing of RDF data streamsTowards efficient processing of RDF data streams
Towards efficient processing of RDF data streams
 
Strata NYC 2015 - What's coming for the Spark community
Strata NYC 2015 - What's coming for the Spark communityStrata NYC 2015 - What's coming for the Spark community
Strata NYC 2015 - What's coming for the Spark community
 
Ontology mapping for the semantic web
Ontology mapping for the semantic webOntology mapping for the semantic web
Ontology mapping for the semantic web
 
2021 04-20 apache arrow and its impact on the database industry.pptx
2021 04-20  apache arrow and its impact on the database industry.pptx2021 04-20  apache arrow and its impact on the database industry.pptx
2021 04-20 apache arrow and its impact on the database industry.pptx
 
ACS 248th Paper 108 NIST-IUPAC Solubility Data
ACS 248th Paper 108 NIST-IUPAC Solubility DataACS 248th Paper 108 NIST-IUPAC Solubility Data
ACS 248th Paper 108 NIST-IUPAC Solubility Data
 
RDF Stream Processing: Let's React
RDF Stream Processing: Let's ReactRDF Stream Processing: Let's React
RDF Stream Processing: Let's React
 
What's new in spark 2.0?
What's new in spark 2.0?What's new in spark 2.0?
What's new in spark 2.0?
 
معرفی کاربردهای یادگیری عمیق و چالش های آن در کلان داده
معرفی کاربردهای یادگیری عمیق و چالش های آن در کلان دادهمعرفی کاربردهای یادگیری عمیق و چالش های آن در کلان داده
معرفی کاربردهای یادگیری عمیق و چالش های آن در کلان داده
 
Nutch and lucene_framework
Nutch and lucene_frameworkNutch and lucene_framework
Nutch and lucene_framework
 
Metadata crosswalks
Metadata crosswalksMetadata crosswalks
Metadata crosswalks
 
CMPT470-usask-guest-lecture
CMPT470-usask-guest-lectureCMPT470-usask-guest-lecture
CMPT470-usask-guest-lecture
 
State of the Semantic Web
State of the Semantic WebState of the Semantic Web
State of the Semantic Web
 
Spark Application Carousel: Highlights of Several Applications Built with Spark
Spark Application Carousel: Highlights of Several Applications Built with SparkSpark Application Carousel: Highlights of Several Applications Built with Spark
Spark Application Carousel: Highlights of Several Applications Built with Spark
 
RDF APIs for .NET Framework
RDF APIs for .NET FrameworkRDF APIs for .NET Framework
RDF APIs for .NET Framework
 

More from Mahdi Atawneh

Improving ip geolocation using query logs
Improving ip geolocation using query logsImproving ip geolocation using query logs
Improving ip geolocation using query logsMahdi Atawneh
 
Improvement of shortest path algorithms using subgraphs heuristics
Improvement of shortest path algorithms using subgraphs heuristicsImprovement of shortest path algorithms using subgraphs heuristics
Improvement of shortest path algorithms using subgraphs heuristicsMahdi Atawneh
 
SILT: A Memory-Efficient, High-Performance Key-Value Store
SILT: A Memory-Efficient, High-Performance Key-Value StoreSILT: A Memory-Efficient, High-Performance Key-Value Store
SILT: A Memory-Efficient, High-Performance Key-Value StoreMahdi Atawneh
 
OWL reasoning with WebPIE: calculating the closer of 100 billion triples
OWL reasoning with WebPIE: calculating the closer of 100 billion triplesOWL reasoning with WebPIE: calculating the closer of 100 billion triples
OWL reasoning with WebPIE: calculating the closer of 100 billion triplesMahdi Atawneh
 
Bat algorithm explained. slides ppt pptx
Bat algorithm explained. slides ppt pptxBat algorithm explained. slides ppt pptx
Bat algorithm explained. slides ppt pptxMahdi Atawneh
 
01 nosql and multi model database
01   nosql and multi model database01   nosql and multi model database
01 nosql and multi model databaseMahdi Atawneh
 

More from Mahdi Atawneh (6)

Improving ip geolocation using query logs
Improving ip geolocation using query logsImproving ip geolocation using query logs
Improving ip geolocation using query logs
 
Improvement of shortest path algorithms using subgraphs heuristics
Improvement of shortest path algorithms using subgraphs heuristicsImprovement of shortest path algorithms using subgraphs heuristics
Improvement of shortest path algorithms using subgraphs heuristics
 
SILT: A Memory-Efficient, High-Performance Key-Value Store
SILT: A Memory-Efficient, High-Performance Key-Value StoreSILT: A Memory-Efficient, High-Performance Key-Value Store
SILT: A Memory-Efficient, High-Performance Key-Value Store
 
OWL reasoning with WebPIE: calculating the closer of 100 billion triples
OWL reasoning with WebPIE: calculating the closer of 100 billion triplesOWL reasoning with WebPIE: calculating the closer of 100 billion triples
OWL reasoning with WebPIE: calculating the closer of 100 billion triples
 
Bat algorithm explained. slides ppt pptx
Bat algorithm explained. slides ppt pptxBat algorithm explained. slides ppt pptx
Bat algorithm explained. slides ppt pptx
 
01 nosql and multi model database
01   nosql and multi model database01   nosql and multi model database
01 nosql and multi model database
 

Recently uploaded

Roles & Responsibilities in Pharmacovigilance
Roles & Responsibilities in PharmacovigilanceRoles & Responsibilities in Pharmacovigilance
Roles & Responsibilities in PharmacovigilanceSamikshaHamane
 
How to do quick user assign in kanban in Odoo 17 ERP
How to do quick user assign in kanban in Odoo 17 ERPHow to do quick user assign in kanban in Odoo 17 ERP
How to do quick user assign in kanban in Odoo 17 ERPCeline George
 
Choosing the Right CBSE School A Comprehensive Guide for Parents
Choosing the Right CBSE School A Comprehensive Guide for ParentsChoosing the Right CBSE School A Comprehensive Guide for Parents
Choosing the Right CBSE School A Comprehensive Guide for Parentsnavabharathschool99
 
Karra SKD Conference Presentation Revised.pptx
Karra SKD Conference Presentation Revised.pptxKarra SKD Conference Presentation Revised.pptx
Karra SKD Conference Presentation Revised.pptxAshokKarra1
 
Science 7 Quarter 4 Module 2: Natural Resources.pptx
Science 7 Quarter 4 Module 2: Natural Resources.pptxScience 7 Quarter 4 Module 2: Natural Resources.pptx
Science 7 Quarter 4 Module 2: Natural Resources.pptxMaryGraceBautista27
 
Full Stack Web Development Course for Beginners
Full Stack Web Development Course  for BeginnersFull Stack Web Development Course  for Beginners
Full Stack Web Development Course for BeginnersSabitha Banu
 
Influencing policy (training slides from Fast Track Impact)
Influencing policy (training slides from Fast Track Impact)Influencing policy (training slides from Fast Track Impact)
Influencing policy (training slides from Fast Track Impact)Mark Reed
 
USPS® Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...
USPS® Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...USPS® Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...
USPS® Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...Postal Advocate Inc.
 
How to Add Barcode on PDF Report in Odoo 17
How to Add Barcode on PDF Report in Odoo 17How to Add Barcode on PDF Report in Odoo 17
How to Add Barcode on PDF Report in Odoo 17Celine George
 
DATA STRUCTURE AND ALGORITHM for beginners
DATA STRUCTURE AND ALGORITHM for beginnersDATA STRUCTURE AND ALGORITHM for beginners
DATA STRUCTURE AND ALGORITHM for beginnersSabitha Banu
 
Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17
Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17
Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17Celine George
 
ACC 2024 Chronicles. Cardiology. Exam.pdf
ACC 2024 Chronicles. Cardiology. Exam.pdfACC 2024 Chronicles. Cardiology. Exam.pdf
ACC 2024 Chronicles. Cardiology. Exam.pdfSpandanaRallapalli
 
AMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdf
AMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdfAMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdf
AMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdfphamnguyenenglishnb
 
4.18.24 Movement Legacies, Reflection, and Review.pptx
4.18.24 Movement Legacies, Reflection, and Review.pptx4.18.24 Movement Legacies, Reflection, and Review.pptx
4.18.24 Movement Legacies, Reflection, and Review.pptxmary850239
 
Keynote by Prof. Wurzer at Nordex about IP-design
Keynote by Prof. Wurzer at Nordex about IP-designKeynote by Prof. Wurzer at Nordex about IP-design
Keynote by Prof. Wurzer at Nordex about IP-designMIPLM
 
Difference Between Search & Browse Methods in Odoo 17
Difference Between Search & Browse Methods in Odoo 17Difference Between Search & Browse Methods in Odoo 17
Difference Between Search & Browse Methods in Odoo 17Celine George
 
Gas measurement O2,Co2,& ph) 04/2024.pptx
Gas measurement O2,Co2,& ph) 04/2024.pptxGas measurement O2,Co2,& ph) 04/2024.pptx
Gas measurement O2,Co2,& ph) 04/2024.pptxDr.Ibrahim Hassaan
 

Recently uploaded (20)

Roles & Responsibilities in Pharmacovigilance
Roles & Responsibilities in PharmacovigilanceRoles & Responsibilities in Pharmacovigilance
Roles & Responsibilities in Pharmacovigilance
 
How to do quick user assign in kanban in Odoo 17 ERP
How to do quick user assign in kanban in Odoo 17 ERPHow to do quick user assign in kanban in Odoo 17 ERP
How to do quick user assign in kanban in Odoo 17 ERP
 
Choosing the Right CBSE School A Comprehensive Guide for Parents
Choosing the Right CBSE School A Comprehensive Guide for ParentsChoosing the Right CBSE School A Comprehensive Guide for Parents
Choosing the Right CBSE School A Comprehensive Guide for Parents
 
Karra SKD Conference Presentation Revised.pptx
Karra SKD Conference Presentation Revised.pptxKarra SKD Conference Presentation Revised.pptx
Karra SKD Conference Presentation Revised.pptx
 
Science 7 Quarter 4 Module 2: Natural Resources.pptx
Science 7 Quarter 4 Module 2: Natural Resources.pptxScience 7 Quarter 4 Module 2: Natural Resources.pptx
Science 7 Quarter 4 Module 2: Natural Resources.pptx
 
Full Stack Web Development Course for Beginners
Full Stack Web Development Course  for BeginnersFull Stack Web Development Course  for Beginners
Full Stack Web Development Course for Beginners
 
FINALS_OF_LEFT_ON_C'N_EL_DORADO_2024.pptx
FINALS_OF_LEFT_ON_C'N_EL_DORADO_2024.pptxFINALS_OF_LEFT_ON_C'N_EL_DORADO_2024.pptx
FINALS_OF_LEFT_ON_C'N_EL_DORADO_2024.pptx
 
Influencing policy (training slides from Fast Track Impact)
Influencing policy (training slides from Fast Track Impact)Influencing policy (training slides from Fast Track Impact)
Influencing policy (training slides from Fast Track Impact)
 
USPS® Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...
USPS® Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...USPS® Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...
USPS® Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...
 
How to Add Barcode on PDF Report in Odoo 17
How to Add Barcode on PDF Report in Odoo 17How to Add Barcode on PDF Report in Odoo 17
How to Add Barcode on PDF Report in Odoo 17
 
DATA STRUCTURE AND ALGORITHM for beginners
DATA STRUCTURE AND ALGORITHM for beginnersDATA STRUCTURE AND ALGORITHM for beginners
DATA STRUCTURE AND ALGORITHM for beginners
 
Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17
Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17
Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17
 
ACC 2024 Chronicles. Cardiology. Exam.pdf
ACC 2024 Chronicles. Cardiology. Exam.pdfACC 2024 Chronicles. Cardiology. Exam.pdf
ACC 2024 Chronicles. Cardiology. Exam.pdf
 
AMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdf
AMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdfAMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdf
AMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdf
 
4.18.24 Movement Legacies, Reflection, and Review.pptx
4.18.24 Movement Legacies, Reflection, and Review.pptx4.18.24 Movement Legacies, Reflection, and Review.pptx
4.18.24 Movement Legacies, Reflection, and Review.pptx
 
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
 
Keynote by Prof. Wurzer at Nordex about IP-design
Keynote by Prof. Wurzer at Nordex about IP-designKeynote by Prof. Wurzer at Nordex about IP-design
Keynote by Prof. Wurzer at Nordex about IP-design
 
Difference Between Search & Browse Methods in Odoo 17
Difference Between Search & Browse Methods in Odoo 17Difference Between Search & Browse Methods in Odoo 17
Difference Between Search & Browse Methods in Odoo 17
 
Gas measurement O2,Co2,& ph) 04/2024.pptx
Gas measurement O2,Co2,& ph) 04/2024.pptxGas measurement O2,Co2,& ph) 04/2024.pptx
Gas measurement O2,Co2,& ph) 04/2024.pptx
 
Raw materials used in Herbal Cosmetics.pptx
Raw materials used in Herbal Cosmetics.pptxRaw materials used in Herbal Cosmetics.pptx
Raw materials used in Herbal Cosmetics.pptx
 

Optimized index structures for querying rdf from the web

  • 1. Optimized Index Structures for Querying RDF from the Web Presented by : Mahdi Atawna 1
  • 2. About the Paper  The paper was published at the Third Latin American Web Congress in 2005  Have 56 citation. Andreas Harth National University of Galway, Ireland Prof. Stefan Decker National University of Galway, Ireland 2
  • 3. Outline  Overview of Semantic Web.  Overview of Indexes.  Paper motivation.  Methodology.  Experiment & Result.  Conclusion. 3
  • 4. Semantic Web  Also called :  Web 3.0.  the Linked Data Web.  the Web of Data…whatever you call it.  the next major evolution in connecting information. 4
  • 5. Why semantic web?  It enables data to be linked from a source to any other source.  It can be understood by computers so that they can perform increasingly sophisticated tasks on our behalf. 5
  • 7. Semantic Web Standards  RDF (Resource Description Framework): The data modeling language for the Semantic Web (like UML). All Semantic Web information is stored and represented in the RDF.  SPARQL : The query language of the Semantic Web.  OWL (Web Ontology Language) The schema language, or knowledge representation (KR) language, of the Semantic Web. 7
  • 8. What is RDF?  RDF is the data model of the Semantic Web.  That means that all data in Semantic Web technologies is represented as RDF.  If you store Semantic Web data, it's in RDF.  If you query Semantic Web data (typically using SPARQL), it's RDF data. If you send Semantic Web data to your friend, it's RDF. 8
  • 9. 9
  • 11. RDF triples  are representations of graph edges. 11 Subject Object Predicate Mahdi Hebron born in
  • 12. RDF example @prefix foaf: <http://xmlns.com/foaf/0.1/ . <http://example.org/bob#me> foaf:topic_interest <http://wikidata.org/entity/Q12418> . 12
  • 13. RDF example @prefix foaf: <http://xmlns.com/foaf/0.1/ . <http://example.org/bob#me> foaf:topic_interest <http://wikidata.org/entity/Q12418> . Subject predicate Object 13
  • 14. Sparql query language SELECT ?p ?o { <http://nasa.dataincubator.org/spacecraft/1968-089A> ?p ?o } 14
  • 15. What is database index?  A database index is a data structure that improves the speed of data retrieval operations on a database table at the cost of additional writes and storage space to maintain the index data structure.  Index goal : The index structure enables fast retrieval of data 15
  • 16. 16
  • 17. Index example key articles Leonardo [104,70,12,98] Mona Lisa [2,201,7,20,12] Francesco [1,8,900,104] 17
  • 18. Paper motivation  Previous Systems provide a storage infrastructure for RDF data, but index structure which do not support typical query scenarios for data from the Web which results in poor query answering performance in some cases. 18
  • 19. Methodology  The researchers present a new index structure that handle the data from the Web .  Implemented the index structure in a lightweight software called YARS 19
  • 20. RDF Index structures  The authors suggested an index structure that contains two sets: 1. Lexicon : covers the string representation of RDF graph (r,l,b) 2. Quad indexes : cover the quads (triples). 20
  • 21. 1. Lexicon indexes  NodeOID and OIDNode Index  Keyword Index 21
  • 22. 1. Lexicon indexes  NodeOID and OIDNode Index : 22 Key value <http://www.harth.org/andreas/#me> 3 <http://decker.cn/stefan/> 14 <http://sw.deri.org/ aharth/foaf.rdf> 11 <http://www.deri.org/> 1
  • 23. 1. Lexicon indexes  Keyword Index (Popular in search engines) 23 Key No of hits List of hits “Andreas” 1 3 “Decker” 1 11 ”Harth” 1 3 “Stefan” 2 11,13
  • 24. 2. Quad indexes  Access Patterns  Combined Indexes  Occurrence Counts 24
  • 25. 2. Quad indexes A- Access Patterns 25 No Access pattern No Access pattern 1 (?:?:?:?) 9 (s:?:o:c) 2 (s:?:?:?) 10 (?:?:o:c) 3 (s:p:?:?) 11 (?:?:o:?) 4 (s:p:o:?) 12 (?:?:?:c) 5 (s:p:o:c) 13 (s:?:?:c) 6 (?:p:?:?) 14 (s:p:?:c) 7 (?:p:o:?) 15 (?:p:?:c) 8 (?:p:o:c) 16 (s:?:o:?)
  • 26. 2. Quad indexes A- Access Patterns 26 No Access pattern values 1 (?:?:?:?) [1,5,3] 2 (s:?:?:?) [2] 3 (s:p:?:?) [9,8,2,3] 4 (s:p:o:?) [1,3] 5 (s:p:o:c) [1] 6 (?:p:?:?) [76,9] 7 (?:p:o:?) [2,3]
  • 27. 2. Quad indexes B- Combined Indexes 27 spoc poc osc csp cp os (?:?:?:?) (?:p:?:?) (?:?:o:?) (?:?:?:c) (?:p:?:c) (s:?:o:?) (s:?:?:?) (?:p:o:?) (?:?:o:c) (s:?:?:c) (s:p:?:?) (?:p:o:c) (s:?:o:c) (s:p:?:c) (s:p:o:?) (s:p:o:c)
  • 28. 2. Quad indexes C- Occurrence Counts 28 No Access pattern values count 1 (?:?:?:?) [1,5,3] 3 2 (s:?:?:?) [2] 1 3 (s:p:?:?) [9,8,2,3] 4 4 (s:p:o:?) [1,3] 2 5 (s:p:o:c) [1] 1 6 (?:p:?:?) [76,9] 2 7 (?:p:o:?) [2,3] 2
  • 29. YARS  Web application that built in JAVA.  Has two parts:  a storage component that handles both persistent and in- memory indexes.  a query handler to perform query processing and evaluation. 29
  • 30. Experiment  They evaluated the performance based on a dataset of 2.8 million triples (293 MB).  The testing server has :  Pentium-4 2.4 GHz  4 GB RAM  running Debian Sarge . 30
  • 31. Experiment They considered the following RDF stores for evaluation:  Sesame.  Kowari (failed to get a running version).  Redland.  Jena2. ([9] shows that Sesame generally supersedes Jena in performance results) 31
  • 32. Experiment Experiment parts:  Index Construction.  Queries. 32
  • 33. Result – index construction System Index size (bytes) Redland 2.164.019.200 Sesame MySQL 340.381.636 Sesame native 39.997.992 YARS 1.090.002.944 33 Table 8: Index size for the synthetic Univ20 dataset.
  • 34. Result – index construction 34
  • 35. Result - queries No Query 1 ?x rdf:type univ:UndergraduateStudent 2 ?x ?p ”UndergraduateStudent0” 3 <http://www.University965.edu> ?p ?o 4 ?x univ:worksFor ?y 35 Query Redland Sesame MySQL Sesame Native YARS 1 0:10.48 0:18.87 1:05.16 0:18.41 2 0:44.14 0:00.73 0:00.55 0:00.49 3 0:44.15 0:00.46 0:00.47 0:00.32 4 3:04.21 0:03.42 0:01.95 0:00.47 Performance results for quad queries.
  • 36. Conclusion  The auther introduced query processing for RDF which is an I portant issue in sematic web.  YARS has some overhead for resolving the dependencies and order in comparison with others. 36
  • 37. Criticism  - In experiment , the researchers removed “Kowari “ engine because the cannot install it. 37