SlideShare a Scribd company logo
1 of 34
Javier D. Fernández, Miguel A. Martínez-Prieto, Claudio Gutiérrez Compact Representation of Large RDF Data Sets for Publishing and Exchange
The Motivation ,[object Object]
Syntaxes oriented mainly to represent documents
RDF/XML, N3, Turtle, JSON, etc.
Document-centric data-centricview
Redundancy
No structure(chunks)
Lackof metadata
sequentiality of theinformation
Use?
examples:
Billion Triple 2010 (~3200M triples, 318 gzippedchunks, ~27GB)
Uniprot (~845M, 12 gzippedchunks, ~23GB)Pag 2 Image: renjithkrishnan / FreeDigitalPhotos.net
Real World example: Billion Triple 2010 Where is the metadata? Who did publish this? Do I have all the data? ? [318] PUBLICATION EXCHANGE RDF RDF RDF gzip RDF RDF RDF gzip [318] basicoperations Pag 3
Needs Theaims of theformat are:  ,[object Object]
Metadata
Compactness
Efficient exchange
RDF compression
Basic data operations Pag 4 Image: jscreationzs / FreeDigitalPhotos.net
HDT Overview HDT ,[object Object]
Phylosophy of publication and exchange,
Compact RDF representation
basedon 3 maincomponents:  Header, Dictionary and TriplesPag 5
HDT Overview Pag 6
Header Metadatainformationaboutthe RDF collection ,[object Object]
Source and providerinformation
Publication data
Data set statistics
Otherinformation
Information required to retrieve and process the represented data
Location/s, format/s, encoding/s, etc.Pag 7
Header use ? Header Header [318] HDT HDT RDF RDF RDF HDT HDT RDF RDF RDF HDT HDT Dictionary &Triples [318] Dictionary &Triples Pag 8
Header in Practice http://purl.org/HDT/hdt# SWP SCOVO, SDMX, hdt Void, DublinCore, etc hdt Pag 9

More Related Content

What's hot

Rdf In A Nutshell V1
Rdf In A Nutshell V1Rdf In A Nutshell V1
Rdf In A Nutshell V1Fabien Gandon
 
Getting Started with Knowledge Graphs
Getting Started with Knowledge GraphsGetting Started with Knowledge Graphs
Getting Started with Knowledge GraphsPeter Haase
 
Towards an Open Research Knowledge Graph
Towards an Open Research Knowledge GraphTowards an Open Research Knowledge Graph
Towards an Open Research Knowledge GraphSören Auer
 
Knowledge Graph Introduction
Knowledge Graph IntroductionKnowledge Graph Introduction
Knowledge Graph IntroductionSören Auer
 
Linked Data and Knowledge Graphs -- Constructing and Understanding Knowledge ...
Linked Data and Knowledge Graphs -- Constructing and Understanding Knowledge ...Linked Data and Knowledge Graphs -- Constructing and Understanding Knowledge ...
Linked Data and Knowledge Graphs -- Constructing and Understanding Knowledge ...Jeff Z. Pan
 
A Primer on Entity Resolution
A Primer on Entity ResolutionA Primer on Entity Resolution
A Primer on Entity ResolutionBenjamin Bengfort
 
Java programming pdf
Java programming pdfJava programming pdf
Java programming pdfJi Hoon Lee
 
Programming the Semantic Web
Programming the Semantic WebProgramming the Semantic Web
Programming the Semantic WebLuigi De Russis
 
Building an Enterprise Knowledge Graph @Uber: Lessons from Reality
Building an Enterprise Knowledge Graph @Uber: Lessons from RealityBuilding an Enterprise Knowledge Graph @Uber: Lessons from Reality
Building an Enterprise Knowledge Graph @Uber: Lessons from RealityJoshua Shinavier
 
ESWC 2017 Tutorial Knowledge Graphs
ESWC 2017 Tutorial Knowledge GraphsESWC 2017 Tutorial Knowledge Graphs
ESWC 2017 Tutorial Knowledge GraphsPeter Haase
 
BLOGIC. (ISWC 2009 Invited Talk)
BLOGIC.  (ISWC 2009 Invited Talk)BLOGIC.  (ISWC 2009 Invited Talk)
BLOGIC. (ISWC 2009 Invited Talk)Pat Hayes
 
Data Modeling with Neo4j
Data Modeling with Neo4jData Modeling with Neo4j
Data Modeling with Neo4jNeo4j
 
Introduction to Knowledge Graphs and Semantic AI
Introduction to Knowledge Graphs and Semantic AIIntroduction to Knowledge Graphs and Semantic AI
Introduction to Knowledge Graphs and Semantic AISemantic Web Company
 
Κανονικοποίηση βάσης δεδομένων
Κανονικοποίηση βάσης δεδομένωνΚανονικοποίηση βάσης δεδομένων
Κανονικοποίηση βάσης δεδομένωνNikos Mpalatsoukas
 
Training Series - Intro to Neo4j
Training Series - Intro to Neo4jTraining Series - Intro to Neo4j
Training Series - Intro to Neo4jNeo4j
 
RDF 개념 및 구문 소개
RDF 개념 및 구문 소개RDF 개념 및 구문 소개
RDF 개념 및 구문 소개Dongbum Kim
 

What's hot (20)

Rdf In A Nutshell V1
Rdf In A Nutshell V1Rdf In A Nutshell V1
Rdf In A Nutshell V1
 
Getting Started with Knowledge Graphs
Getting Started with Knowledge GraphsGetting Started with Knowledge Graphs
Getting Started with Knowledge Graphs
 
Towards an Open Research Knowledge Graph
Towards an Open Research Knowledge GraphTowards an Open Research Knowledge Graph
Towards an Open Research Knowledge Graph
 
Knowledge Graph Introduction
Knowledge Graph IntroductionKnowledge Graph Introduction
Knowledge Graph Introduction
 
Linked Data and Knowledge Graphs -- Constructing and Understanding Knowledge ...
Linked Data and Knowledge Graphs -- Constructing and Understanding Knowledge ...Linked Data and Knowledge Graphs -- Constructing and Understanding Knowledge ...
Linked Data and Knowledge Graphs -- Constructing and Understanding Knowledge ...
 
A Primer on Entity Resolution
A Primer on Entity ResolutionA Primer on Entity Resolution
A Primer on Entity Resolution
 
Metadata Workshop
Metadata WorkshopMetadata Workshop
Metadata Workshop
 
Java programming pdf
Java programming pdfJava programming pdf
Java programming pdf
 
Programming the Semantic Web
Programming the Semantic WebProgramming the Semantic Web
Programming the Semantic Web
 
Building an Enterprise Knowledge Graph @Uber: Lessons from Reality
Building an Enterprise Knowledge Graph @Uber: Lessons from RealityBuilding an Enterprise Knowledge Graph @Uber: Lessons from Reality
Building an Enterprise Knowledge Graph @Uber: Lessons from Reality
 
ESWC 2017 Tutorial Knowledge Graphs
ESWC 2017 Tutorial Knowledge GraphsESWC 2017 Tutorial Knowledge Graphs
ESWC 2017 Tutorial Knowledge Graphs
 
BLOGIC. (ISWC 2009 Invited Talk)
BLOGIC.  (ISWC 2009 Invited Talk)BLOGIC.  (ISWC 2009 Invited Talk)
BLOGIC. (ISWC 2009 Invited Talk)
 
Data Modeling with Neo4j
Data Modeling with Neo4jData Modeling with Neo4j
Data Modeling with Neo4j
 
SHACL by example
SHACL by exampleSHACL by example
SHACL by example
 
RDF 해설서
RDF 해설서RDF 해설서
RDF 해설서
 
OWL and OBO
OWL and OBOOWL and OBO
OWL and OBO
 
Introduction to Knowledge Graphs and Semantic AI
Introduction to Knowledge Graphs and Semantic AIIntroduction to Knowledge Graphs and Semantic AI
Introduction to Knowledge Graphs and Semantic AI
 
Κανονικοποίηση βάσης δεδομένων
Κανονικοποίηση βάσης δεδομένωνΚανονικοποίηση βάσης δεδομένων
Κανονικοποίηση βάσης δεδομένων
 
Training Series - Intro to Neo4j
Training Series - Intro to Neo4jTraining Series - Intro to Neo4j
Training Series - Intro to Neo4j
 
RDF 개념 및 구문 소개
RDF 개념 및 구문 소개RDF 개념 및 구문 소개
RDF 개념 및 구문 소개
 

Viewers also liked

F14 101 syllabus
F14 101 syllabusF14 101 syllabus
F14 101 syllabusGale Pooley
 
Lecture 5 handout
Lecture 5 handoutLecture 5 handout
Lecture 5 handoutGale Pooley
 
Creative Writing Projects at the National Aspergillosis Centre
Creative Writing Projects at the National Aspergillosis CentreCreative Writing Projects at the National Aspergillosis Centre
Creative Writing Projects at the National Aspergillosis CentreGraham Atherton
 
Oracle Day 2013 ~ MySQL Replication
Oracle Day 2013 ~ MySQL Replication Oracle Day 2013 ~ MySQL Replication
Oracle Day 2013 ~ MySQL Replication Manuel Contreras
 
ソーシャル+動画 セミナー
ソーシャル+動画 セミナーソーシャル+動画 セミナー
ソーシャル+動画 セミナーYuichi Morito
 
BrandZ Top 50 Most Valuable Latin American Brands 2014
BrandZ Top 50 Most Valuable Latin American Brands 2014BrandZ Top 50 Most Valuable Latin American Brands 2014
BrandZ Top 50 Most Valuable Latin American Brands 2014Kantar
 
Brand Ministry - Cas Client Mercialys
Brand Ministry - Cas Client MercialysBrand Ministry - Cas Client Mercialys
Brand Ministry - Cas Client MercialysBrand Ministry
 
Get visible on google+
Get visible on google+Get visible on google+
Get visible on google+Bill Layton
 
Artical. list. report
Artical. list. reportArtical. list. report
Artical. list. reportsadia213
 
Millward Brown AdReaction 2012: Kenya
Millward Brown AdReaction 2012: KenyaMillward Brown AdReaction 2012: Kenya
Millward Brown AdReaction 2012: KenyaKantar
 
Yard shed designs
Yard shed designsYard shed designs
Yard shed designsJim Young
 
clean tech Industry Analysis
clean tech Industry Analysisclean tech Industry Analysis
clean tech Industry AnalysisManvindra Singh
 

Viewers also liked (20)

F14 101 syllabus
F14 101 syllabusF14 101 syllabus
F14 101 syllabus
 
Lecture 5 handout
Lecture 5 handoutLecture 5 handout
Lecture 5 handout
 
Creative Writing Projects at the National Aspergillosis Centre
Creative Writing Projects at the National Aspergillosis CentreCreative Writing Projects at the National Aspergillosis Centre
Creative Writing Projects at the National Aspergillosis Centre
 
Oracle Day 2013 ~ MySQL Replication
Oracle Day 2013 ~ MySQL Replication Oracle Day 2013 ~ MySQL Replication
Oracle Day 2013 ~ MySQL Replication
 
ソーシャル+動画 セミナー
ソーシャル+動画 セミナーソーシャル+動画 セミナー
ソーシャル+動画 セミナー
 
Headache
HeadacheHeadache
Headache
 
BrandZ Top 50 Most Valuable Latin American Brands 2014
BrandZ Top 50 Most Valuable Latin American Brands 2014BrandZ Top 50 Most Valuable Latin American Brands 2014
BrandZ Top 50 Most Valuable Latin American Brands 2014
 
Brand Ministry - Cas Client Mercialys
Brand Ministry - Cas Client MercialysBrand Ministry - Cas Client Mercialys
Brand Ministry - Cas Client Mercialys
 
I session short
I session shortI session short
I session short
 
Get visible on google+
Get visible on google+Get visible on google+
Get visible on google+
 
How do I TWEET?
How do I TWEET?How do I TWEET?
How do I TWEET?
 
Artical. list. report
Artical. list. reportArtical. list. report
Artical. list. report
 
590 2
590 2590 2
590 2
 
367 lecture 4
367 lecture 4367 lecture 4
367 lecture 4
 
Millward Brown AdReaction 2012: Kenya
Millward Brown AdReaction 2012: KenyaMillward Brown AdReaction 2012: Kenya
Millward Brown AdReaction 2012: Kenya
 
Yard shed designs
Yard shed designsYard shed designs
Yard shed designs
 
Sigue tuestrella
Sigue tuestrellaSigue tuestrella
Sigue tuestrella
 
374 2
374 2374 2
374 2
 
Text 1-113
Text 1-113Text 1-113
Text 1-113
 
clean tech Industry Analysis
clean tech Industry Analysisclean tech Industry Analysis
clean tech Industry Analysis
 

Similar to Compact Representation of Large RDF Data Sets for Publishing and Exchange

Inductive Triple Graphs: A purely functional approach to represent RDF
Inductive Triple Graphs: A purely functional approach to represent RDFInductive Triple Graphs: A purely functional approach to represent RDF
Inductive Triple Graphs: A purely functional approach to represent RDFJose Emilio Labra Gayo
 
Getty Vocabulary Program LOD: Ontologies and Semantic Representation
Getty Vocabulary Program LOD: Ontologies and Semantic RepresentationGetty Vocabulary Program LOD: Ontologies and Semantic Representation
Getty Vocabulary Program LOD: Ontologies and Semantic RepresentationVladimir Alexiev, PhD, PMP
 
Adaptation of Multilingual Transformer Encoder for Robust Enhanced Universal ...
Adaptation of Multilingual Transformer Encoder for Robust Enhanced Universal ...Adaptation of Multilingual Transformer Encoder for Robust Enhanced Universal ...
Adaptation of Multilingual Transformer Encoder for Robust Enhanced Universal ...Jinho Choi
 
Making Use of the Linked Data Cloud: The Role of Index Structures
Making Use of the Linked Data Cloud: The Role of Index StructuresMaking Use of the Linked Data Cloud: The Role of Index Structures
Making Use of the Linked Data Cloud: The Role of Index StructuresThomas Gottron
 
A middleware for storing massive RDF graphs into NoSQL
A middleware for storing massive RDF graphs into NoSQLA middleware for storing massive RDF graphs into NoSQL
A middleware for storing massive RDF graphs into NoSQLLuiz Henrique Zambom Santana
 
Image Similarity Detection at Scale Using LSH and Tensorflow with Andrey Gusev
Image Similarity Detection at Scale Using LSH and Tensorflow with Andrey GusevImage Similarity Detection at Scale Using LSH and Tensorflow with Andrey Gusev
Image Similarity Detection at Scale Using LSH and Tensorflow with Andrey GusevDatabricks
 
Graph Analytics with ArangoDB
Graph Analytics with ArangoDBGraph Analytics with ArangoDB
Graph Analytics with ArangoDBArangoDB Database
 
introtorandrstudio.ppt
introtorandrstudio.pptintrotorandrstudio.ppt
introtorandrstudio.pptMalkaParveen3
 
R Brown-bag seminars : Seminar-8
R Brown-bag seminars : Seminar-8R Brown-bag seminars : Seminar-8
R Brown-bag seminars : Seminar-8Muhammad Nabi Ahmad
 
THoSP: an Algorithm for Nesting Property Graphs
THoSP: an Algorithm for Nesting Property GraphsTHoSP: an Algorithm for Nesting Property Graphs
THoSP: an Algorithm for Nesting Property GraphsGiacomo Bergami
 
Map-Side Merge Joins for Scalable SPARQL BGP Processing
Map-Side Merge Joins for Scalable SPARQL BGP ProcessingMap-Side Merge Joins for Scalable SPARQL BGP Processing
Map-Side Merge Joins for Scalable SPARQL BGP ProcessingAlexander Schätzle
 
Rstudio is an integrated development environment for R that allows users to i...
Rstudio is an integrated development environment for R that allows users to i...Rstudio is an integrated development environment for R that allows users to i...
Rstudio is an integrated development environment for R that allows users to i...SWAROOP KUMAR K
 
From Data to Knowledge thru Grailog Visualization
From Data to Knowledge thru Grailog VisualizationFrom Data to Knowledge thru Grailog Visualization
From Data to Knowledge thru Grailog Visualizationgiurca
 
Learning spark ch04 - Working with Key/Value Pairs
Learning spark ch04 - Working with Key/Value PairsLearning spark ch04 - Working with Key/Value Pairs
Learning spark ch04 - Working with Key/Value Pairsphanleson
 
Supplementary material for my following paper: Infinite Latent Process Decomp...
Supplementary material for my following paper: Infinite Latent Process Decomp...Supplementary material for my following paper: Infinite Latent Process Decomp...
Supplementary material for my following paper: Infinite Latent Process Decomp...Tomonari Masada
 

Similar to Compact Representation of Large RDF Data Sets for Publishing and Exchange (20)

Inductive Triple Graphs: A purely functional approach to represent RDF
Inductive Triple Graphs: A purely functional approach to represent RDFInductive Triple Graphs: A purely functional approach to represent RDF
Inductive Triple Graphs: A purely functional approach to represent RDF
 
Democratizing Big Semantic Data management
Democratizing Big Semantic Data managementDemocratizing Big Semantic Data management
Democratizing Big Semantic Data management
 
Getty Vocabulary Program LOD: Ontologies and Semantic Representation
Getty Vocabulary Program LOD: Ontologies and Semantic RepresentationGetty Vocabulary Program LOD: Ontologies and Semantic Representation
Getty Vocabulary Program LOD: Ontologies and Semantic Representation
 
Adaptation of Multilingual Transformer Encoder for Robust Enhanced Universal ...
Adaptation of Multilingual Transformer Encoder for Robust Enhanced Universal ...Adaptation of Multilingual Transformer Encoder for Robust Enhanced Universal ...
Adaptation of Multilingual Transformer Encoder for Robust Enhanced Universal ...
 
Unit3 slides
Unit3 slidesUnit3 slides
Unit3 slides
 
Making Use of the Linked Data Cloud: The Role of Index Structures
Making Use of the Linked Data Cloud: The Role of Index StructuresMaking Use of the Linked Data Cloud: The Role of Index Structures
Making Use of the Linked Data Cloud: The Role of Index Structures
 
An Introduction To Map-Reduce
An Introduction To Map-ReduceAn Introduction To Map-Reduce
An Introduction To Map-Reduce
 
A middleware for storing massive RDF graphs into NoSQL
A middleware for storing massive RDF graphs into NoSQLA middleware for storing massive RDF graphs into NoSQL
A middleware for storing massive RDF graphs into NoSQL
 
3DRepo
3DRepo3DRepo
3DRepo
 
Image Similarity Detection at Scale Using LSH and Tensorflow with Andrey Gusev
Image Similarity Detection at Scale Using LSH and Tensorflow with Andrey GusevImage Similarity Detection at Scale Using LSH and Tensorflow with Andrey Gusev
Image Similarity Detection at Scale Using LSH and Tensorflow with Andrey Gusev
 
Graph Analytics with ArangoDB
Graph Analytics with ArangoDBGraph Analytics with ArangoDB
Graph Analytics with ArangoDB
 
introtorandrstudio.ppt
introtorandrstudio.pptintrotorandrstudio.ppt
introtorandrstudio.ppt
 
R Brown-bag seminars : Seminar-8
R Brown-bag seminars : Seminar-8R Brown-bag seminars : Seminar-8
R Brown-bag seminars : Seminar-8
 
THoSP: an Algorithm for Nesting Property Graphs
THoSP: an Algorithm for Nesting Property GraphsTHoSP: an Algorithm for Nesting Property Graphs
THoSP: an Algorithm for Nesting Property Graphs
 
Map-Side Merge Joins for Scalable SPARQL BGP Processing
Map-Side Merge Joins for Scalable SPARQL BGP ProcessingMap-Side Merge Joins for Scalable SPARQL BGP Processing
Map-Side Merge Joins for Scalable SPARQL BGP Processing
 
Rstudio is an integrated development environment for R that allows users to i...
Rstudio is an integrated development environment for R that allows users to i...Rstudio is an integrated development environment for R that allows users to i...
Rstudio is an integrated development environment for R that allows users to i...
 
Big Data & Hadoop
Big Data & HadoopBig Data & Hadoop
Big Data & Hadoop
 
From Data to Knowledge thru Grailog Visualization
From Data to Knowledge thru Grailog VisualizationFrom Data to Knowledge thru Grailog Visualization
From Data to Knowledge thru Grailog Visualization
 
Learning spark ch04 - Working with Key/Value Pairs
Learning spark ch04 - Working with Key/Value PairsLearning spark ch04 - Working with Key/Value Pairs
Learning spark ch04 - Working with Key/Value Pairs
 
Supplementary material for my following paper: Infinite Latent Process Decomp...
Supplementary material for my following paper: Infinite Latent Process Decomp...Supplementary material for my following paper: Infinite Latent Process Decomp...
Supplementary material for my following paper: Infinite Latent Process Decomp...
 

Compact Representation of Large RDF Data Sets for Publishing and Exchange