SlideShare a Scribd company logo
1 of 57
Herbert Van de Sompel & Miel Vander Sande
CNI Spring Meeting, San Antonio, TX, April 5 2016
Herbert Van de Sompel
@hvdsomp
Los Alamos National Laboratory
Acknowledgments: Lyudmila Balakireva, Harihar Shankar, Ruben Verborgh
Access to DBpedia Versions using
Memento and Triple Pattern Fragments
Miel Vander Sande
@Miel_vds
Ghent University
Herbert Van de Sompel & Miel Vander Sande
CNI Spring Meeting, San Antonio, TX, April 5 2016
Outline
• Prelude: Memento and Linked Data
• First Generation DBpedia Archive
• Devising Affordable/Useful Linked Data Archives
• Intermezzo: Triple Pattern Fragments (TPF)
• Intermezzo: Binary RDF Representation (HDT)
• Devising Affordable/Useful Linked Data Archives
• Second Generation DBpedia Archive
• Try this At Home
Herbert Van de Sompel & Miel Vander Sande
CNI Spring Meeting, San Antonio, TX, April 5 2016
Outline
• Prelude: Memento and Linked Data
• First Generation DBpedia Archive
• Devising Affordable/Useful Linked Data Archives
• Intermezzo: Triple Pattern Fragments (TPF)
• Intermezzo: Binary RDF Representation (HDT)
• Devising Affordable/Useful Linked Data Archives
• Second Generation DBpedia Archive
• Try this At Home
Herbert Van de Sompel & Miel Vander Sande
CNI Spring Meeting, San Antonio, TX, April 5 2016
Memento Framework
Herbert Van de Sompel & Miel Vander Sande
CNI Spring Meeting, San Antonio, TX, April 5 2016
Memento LDOW 2010 Submission
Herbert Van de Sompel et al. (2010) An HTTP-Based Versioning Mechanism for Linked Data
http://arxiv.org/abs/1003.3661
Herbert Van de Sompel & Miel Vander Sande
CNI Spring Meeting, San Antonio, TX, April 5 2016
Memento and Linked Data
Herbert Van de Sompel & Miel Vander Sande
CNI Spring Meeting, San Antonio, TX, April 5 2016
Memento and Linked Data
Herbert Van de Sompel & Miel Vander Sande
CNI Spring Meeting, San Antonio, TX, April 5 2016
Time-Series Analysis across DBpedia Versions
Data collected through “follow your nose” HTTP Navigation
Herbert Van de Sompel & Miel Vander Sande
CNI Spring Meeting, San Antonio, TX, April 5 2016
Outline
• Prelude: Memento and Linked Data
• First Generation DBpedia Archive
• Devising Affordable/Useful Linked Data Archives
• Intermezzo: Triple Pattern Fragments (TPF)
• Intermezzo: Binary RDF Representation (HDT)
• Devising Affordable/Useful Linked Data Archives
• Second Generation DBpedia Archive
• Try this At Home
Herbert Van de Sompel & Miel Vander Sande
CNI Spring Meeting, San Antonio, TX, April 5 2016
First Generation DBpedia Archive: Storage
Herbert Van de Sompel & Miel Vander Sande
CNI Spring Meeting, San Antonio, TX, April 5 2016
First Generation DBpedia Archive: Storage
Characteristics
upload software
custom
upload time
~ 24 hours per version
storage software
MongoDB
storage space
383 Gb for 10 versions
DBpedia versions
10 versions: 2.0 through 3.9
number of triples
~ 3 billion
Herbert Van de Sompel & Miel Vander Sande
CNI Spring Meeting, San Antonio, TX, April 5 2016
First Generation DBpedia Archive: Subject-URI Access
Herbert Van de Sompel & Miel Vander Sande
CNI Spring Meeting, San Antonio, TX, April 5 2016
First Generation DBpedia Archive: Subject-URI Access
http://dbpedia.mementodepot.org/memento/2009052/http://dbpedia.org/page/Oaxaca
Herbert Van de Sompel & Miel Vander Sande
CNI Spring Meeting, San Antonio, TX, April 5 2016
First Generation DBpedia Archive: Subject-URI Access
Characteristics
TimeGate software
custom
access type
Subject URI & datetime
external integration
current DBpedia
clients
• all clients: direct access to
Memento Subject-URI
• Memento clients: datetime
negotiation with Subject-URI
Herbert Van de Sompel & Miel Vander Sande
CNI Spring Meeting, San Antonio, TX, April 5 2016
DBpedia Archive @ LANL Since 2010
• Access based on Subject-URI (DBpedia Topic URI) only
• MongoDB storage
• A blob per Subject-URI per version
• Dynamically transformed to other RDF serializations
• No updates since version 3.9 (2013) of DBpedia as a result of
scalability problems
!!!
!!!
Herbert Van de Sompel & Miel Vander Sande
CNI Spring Meeting, San Antonio, TX, April 5 2016
Outline
• Prelude: Memento and Linked Data
• First Generation DBpedia Archive
• Devising Affordable/Useful Linked Data Archives
• Intermezzo: Triple Pattern Fragments (TPF)
• Intermezzo: Binary RDF Representation (HDT)
• Devising Affordable/Useful Linked Data Archives
• Second Generation DBpedia Archive
• Try this At Home
Herbert Van de Sompel & Miel Vander Sande
CNI Spring Meeting, San Antonio, TX, April 5 2016
Affordable & Useful Linked Data Archives
• A Linked Data Archive consists of temporal snapshots of one or
more Linked Data sets, whereby each temporal snapshot reflects
the state of a Linked Data set at a specific moment or interval in
time.
• How to make Linked Data Archives accessible in a manner that is
• affordable/sustainable for the publisher
• useful for the consumer
Herbert Van de Sompel & Miel Vander Sande
CNI Spring Meeting, San Antonio, TX, April 5 2016
Linked Data Archive: Characteristics
General Characteristics Publisher Consumer
Availability
Bandwidth
Cost
Functionality
Interface Expressiveness
LOD Integration
Memento Support
Cross Time/Data
Verdict:
• Publication perspective: $$$$
• Access perspective: ++++
Herbert Van de Sompel & Miel Vander Sande
CNI Spring Meeting, San Antonio, TX, April 5 2016
Linked Data Publishing
• The typical ways of publishing Linked Data on the Web:
• Subject URI access
• Data dump
• SPARQL endpoint
Let’s consider these from the perspective of Linked Data Archives,
i.e. archival storage and access
Herbert Van de Sompel & Miel Vander Sande
CNI Spring Meeting, San Antonio, TX, April 5 2016
Linked Data Archive with Subject-URI Access
• For each temporal snapshot of a Linked Data set, and for each
Subject in that snapshot, publish an RDF description (of the Subject)
at a URI that is specific per snapshot/subject
Herbert Van de Sompel & Miel Vander Sande
CNI Spring Meeting, San Antonio, TX, April 5 2016
Linked Data Archive with Subject-URI Access: Characteristics
General Characteristics Publisher Consumer
Availability rather high rather high
Bandwidth ~ description ~ description
Cost rather low rather high
Functionality
Interface Expressiveness rather low
LOD Integration yes
Memento Support possible
Cross Time/Data follow your nose
Verdict:
• Publication perspective: $$$$
• Access perspective: ++++
Herbert Van de Sompel & Miel Vander Sande
CNI Spring Meeting, San Antonio, TX, April 5 2016
Linked Data Archive Using Dumps
• Renders each temporal snapshot of a Linked Data set as a data
dump that places all temporal dataset triples (as they were at a
specific moment in time) into one or more files
Herbert Van de Sompel & Miel Vander Sande
CNI Spring Meeting, San Antonio, TX, April 5 2016
Linked Data Archive Using Dumps: Characteristics
General Characteristics Publisher Consumer
Availability high high
Bandwidth high high
Cost low high
Functionality
Interface Expressiveness download dataset
LOD Integration no
Memento Support not possible
Cross Time/Data download various datasets
Verdict:
• Publication perspective: $$$$
• Access perspective: ++++
Herbert Van de Sompel & Miel Vander Sande
CNI Spring Meeting, San Antonio, TX, April 5 2016
Linked Data Archive with SPARQL Endpoint(s)
• For each temporal snapshot of a Linked Data set, supports arbitrary
SPARQL queries.
• Different architectural set-ups possible; no standard approach
Herbert Van de Sompel & Miel Vander Sande
CNI Spring Meeting, San Antonio, TX, April 5 2016
Linked Data Archive Using SPARQL Endpoint(s): Characteristics
General Characteristics Publisher Consumer
Availability problematic problematic
Bandwidth ~ query ~ query
Cost high low
Functionality
Interface Expressiveness highly expressive
LOD Integration no
Memento Support hard
Cross Time/Data custom distributed queries
Verdict:
• Publication perspective: $$$$
• Access perspective: ++++
Herbert Van de Sompel & Miel Vander Sande
CNI Spring Meeting, San Antonio, TX, April 5 2016
Affordable & Useful Linked Data Archives
Linked Data Archive Type Publishing Consuming
Data Dump $$$$ ++++
SPARQL Endpoint(s) $$$$ ++++
Subject URI Access $$$$ ++++
Herbert Van de Sompel & Miel Vander Sande
CNI Spring Meeting, San Antonio, TX, April 5 2016
Outline
• Prelude: Memento and Linked Data
• First Generation DBpedia Archive
• Devising Affordable/Useful Linked Data Archives
• Intermezzo: Triple Pattern Fragments (TPF)
• Intermezzo: Binary RDF Representation (HDT)
• Devising Affordable/Useful Linked Data Archives
• Second Generation DBpedia Archive
• Try this At Home
Herbert Van de Sompel & Miel Vander Sande
CNI Spring Meeting, San Antonio, TX, April 5 2016
Linked Data Fragments (Ghent U)
• Every Linked Data interface offers specific fragments of a Linked
Data set
• A fragment is described by
• Selector: what questions can I ask?
• Controls: how do I get more fragments?
• Metadata: helpful information for consumption?
• Each interface type comes with tradeoffs
• cf. the analysis thus far
http://linkeddatafragments.org
Verborgh, R. et al. (2014) Querying datsets on the web with high availability. ISWC 2014
http://ruben.verborgh.org/publications/verborgh_iswc_2014/
Herbert Van de Sompel & Miel Vander Sande
CNI Spring Meeting, San Antonio, TX, April 5 2016
Triple Pattern Fragments (Ghent U)
• Triple Pattern Fragments is a new interface with a different set of
tradeoffs that are attractive from an archival perspective
http://www.hydra-cg.com/spec/latest/triple-pattern-fragments/
Herbert Van de Sompel & Miel Vander Sande
CNI Spring Meeting, San Antonio, TX, April 5 2016
Triple Pattern Fragments (Ghent U)
• Allows querying a Linked Data set according to
?Subject ?Predicate ?Object
patterns
Herbert Van de Sompel & Miel Vander Sande
CNI Spring Meeting, San Antonio, TX, April 5 2016
Triple Pattern Fragments (Ghent U)
Controls: Responses provide navigational help for clients
• Based on emerging Hydra vocabulary for self-describing
Hypermedia-Driven Web APIs
Metadata: dataset info, estimated count (to aid client applications)
http://www.hydra-cg.com/spec/latest/core/
Herbert Van de Sompel & Miel Vander Sande
CNI Spring Meeting, San Antonio, TX, April 5 2016
Outline
• Prelude: Memento and Linked Data
• First Generation DBpedia Archive
• Devising Affordable/Useful Linked Data Archives
• Intermezzo: Triple Pattern Fragments (TPF)
• Intermezzo: Binary RDF Representation (HDT)
• Devising Affordable/Useful Linked Data Archives
• Second Generation DBpedia Archive
• Try this At Home
Herbert Van de Sompel & Miel Vander Sande
CNI Spring Meeting, San Antonio, TX, April 5 2016
Binary RDF Representation for Publication and Exchange (HDT)
http://www.w3.org/Submission/HDT/
Herbert Van de Sompel & Miel Vander Sande
CNI Spring Meeting, San Antonio, TX, April 5 2016
Binary RDF Representation for Publication and Exchange (HDT)
http://www.w3.org/Submission/HDT/
• Header-Dictionary-Triple (HDT) is a compact, binary representation
of RDF datasets.
Herbert Van de Sompel & Miel Vander Sande
CNI Spring Meeting, San Antonio, TX, April 5 2016
Binary RDF Representation for Publication and Exchange (HDT)
http://www.w3.org/Submission/HDT/
• Able to represent massive data sets
• Dictionary/Triples structure achieves
• rapid search for ?subject ?predicate ?object pattern
• high compression rates
• Header provides metadata about the dataset
Herbert Van de Sompel & Miel Vander Sande
CNI Spring Meeting, San Antonio, TX, April 5 2016
Outline
• Prelude: Memento and Linked Data
• First Generation DBpedia Archive
• Devising Affordable/Useful Linked Data Archives
• Intermezzo: Triple Pattern Fragments (TPF)
• Intermezzo: Binary RDF Representation (HDT)
• Devising Affordable/Useful Linked Data Archives
• Second Generation DBpedia Archive
• Try this At Home
Herbert Van de Sompel & Miel Vander Sande
CNI Spring Meeting, San Antonio, TX, April 5 2016
HDT Linked Data Archive with TPF Support
• For each temporal snapshot of a Linked Data set, generate an HDT
serialization that provides access according to
?subject ?predicate ?object
patterns
Herbert Van de Sompel & Miel Vander Sande
CNI Spring Meeting, San Antonio, TX, April 5 2016
Linked Data Archive with ?s?p?o Access: Characteristics
General Characteristics Publisher Consumer
Availability high high
Bandwidth ~ query ~ query
Cost low medium
Functionality
Interface Expressiveness better than subject-URI only
LOD Integration yes
Memento Support possible
Cross Time/Data follow your nose
Verdict:
• Publication perspective: $$$$
• Access perspective: ++++
Herbert Van de Sompel & Miel Vander Sande
CNI Spring Meeting, San Antonio, TX, April 5 2016
Affordable & Useful Linked Data Archives
Linked Data Archive Type Publishing Consuming
Data Dump $$$$ ++++
SPARQL Endpoint(s) $$$$ ++++
Subject URI Access $$$$ ++++
HDT & TPF $$$$ ++++
Herbert Van de Sompel & Miel Vander Sande
CNI Spring Meeting, San Antonio, TX, April 5 2016
Outline
• Prelude: Memento and Linked Data
• First Generation DBpedia Archive
• Devising Affordable/Useful Linked Data Archives
• Intermezzo: Triple Pattern Fragments (TPF)
• Intermezzo: Binary RDF Representation (HDT)
• Devising Affordable/Useful Linked Data Archives
• Second Generation DBpedia Archive
• Try this At Home
Herbert Van de Sompel & Miel Vander Sande
CNI Spring Meeting, San Antonio, TX, April 5 2016
Second Generation DBpedia Archive: Storage
Herbert Van de Sompel & Miel Vander Sande
CNI Spring Meeting, San Antonio, TX, April 5 2016
Second Generation DBpedia Archive: Storage
Characteristics
upload software
HDT-CPP
upload time
~ 4 hours per version
storage software
HDT binary files
storage space
70 Gb for 12 versions
DBpedia versions
12 versions: 2.0 through 2015
number of triples
~ 5 billion
Herbert Van de Sompel & Miel Vander Sande
CNI Spring Meeting, San Antonio, TX, April 5 2016
Second Generation DBpedia Archive: ?s?p?o Query-URI Access
Herbert Van de Sompel & Miel Vander Sande
CNI Spring Meeting, San Antonio, TX, April 5 2016
Second Generation DBpedia Archive: ?s?p?o Query-URI Access
http://fragments.mementodepot.org/dbpedia_3_8?subject=&predicate=http://dbpedia.org/ontology/b
irthPlace&object=http://dbpedia.org/resource/Ghent
Herbert Van de Sompel & Miel Vander Sande
CNI Spring Meeting, San Antonio, TX, April 5 2016
Second Generation DBpedia Archive: ?s?p?o Query-URI Access
?s?p?o Query-URI Access
TimeGate URI http://fragments.mementodepot.org/timegate/dbpedia?
subject={DBpediaURI}&predicate={DBpediaURI}&object={DBpediaURI}
http://fragments.mementodepot.org/timegate/dbpedia?
subject=&predicate=&object=http://dbpedia.org/resource/Ghent
TimeMap URI not supported
Memento URI http://fragments.mementodepot.org/{DBpediaVersion}?subject={DBpediaURI
}&predicate={DBpediaURI}&object={DBpediaURI}
http://fragments.mementodepot.org/dbpedia_3_0?
subject=&predicate=&object=http://dbpedia.org/resource/Ghent
Further info http://mementoweb.org/depot/native/fragments/
Try it with Memento for Chrome – http://bit.ly/memento-for-chrome
Herbert Van de Sompel & Miel Vander Sande
CNI Spring Meeting, San Antonio, TX, April 5 2016
Second Generation DBpedia Archive: Subject-URI Access
Herbert Van de Sompel & Miel Vander Sande
CNI Spring Meeting, San Antonio, TX, April 5 2016
Second Generation DBpedia Archive: Subject-URI Access
Subject-URI Access
TimeGate URI http://dbpedia.mementodepot.org/timegate/{DBpediaURI}
http://dbpedia.mementodepot.org/timegate/http://dbpedia.org/data/Ghent
TimeMap URI http://dbpedia.mementodepot.org/timemap/link/{DBpediaURI}
http://dbpedia.mementodepot.org/timemap/link/http://dbpedia.org/data/Ghent
Memento URI http://dbpedia.mementodepot.org/{yyyymmdd}/{DBpediaURI}
http://dbpedia.mementodepot.org/20080103/http://dbpedia.org/data/Ghent
Further info http://mementoweb.org/depot/native/dbpedia/
Try it with Memento for Chrome – http://bit.ly/memento-for-chrome
Herbert Van de Sompel & Miel Vander Sande
CNI Spring Meeting, San Antonio, TX, April 5 2016
Second Generation DBpedia Archive: Access
Characteristics
TimeGate software
① node.js LDF server 2.0.0
② LDF js client
access type
① ?s?p?o Query-URI & datetime
② Subject-URI & datetime
external integration
① DBpedia LDF server
② current DBpedia
clients
• all clients: direct access to
Mementos of Subject-URI and
?s?p?o Query-URI
• Memento clients: datetime
negotiation with Subject-URI and
?s?p?o Query-URI
1
2
Herbert Van de Sompel & Miel Vander Sande
CNI Spring Meeting, San Antonio, TX, April 5 2016
Outline
• Prelude: Memento and Linked Data
• First Generation DBpedia Archive
• Devising Affordable/Useful Linked Data Archives
• Intermezzo: Triple Pattern Fragments (TPF)
• Intermezzo: Binary RDF Representation (HDT)
• Devising Affordable/Useful Linked Data Archives
• Second Generation DBpedia Archive
• Try this At Home
Herbert Van de Sompel & Miel Vander Sande
CNI Spring Meeting, San Antonio, TX, April 5 2016
Building a Linked Data Archive
• Convert the archival data set(s) to HDT using HDT-CPP
Herbert Van de Sompel & Miel Vander Sande
CNI Spring Meeting, San Antonio, TX, April 5 2016
HDT Software (C++)
https://github.com/rdfhdt/hdt-cpp
• input data requires cleaning
before processing, especially
regarding URI characters
• DBpedia data not clean
• DBpedia v3.5 was not
successfully processed
• No meaningful error
messages to help locate
problems
• memory intensive
• Kyoto Cabinet was used
to optimize storage
requirement and speed
during processing
• Java version exists but has
memory problems
Herbert Van de Sompel & Miel Vander Sande
CNI Spring Meeting, San Antonio, TX, April 5 2016
Building a Linked Data Archive
• Convert the archival data set(s) to HDT using HDT-CPP
• Download the Triple Fragment Server code
Herbert Van de Sompel & Miel Vander Sande
CNI Spring Meeting, San Antonio, TX, April 5 2016
Linked Data Fragment Server (Node.js)
https://github.com/LinkedDataFragments/Server.js
• provides ?s?p?o access to
local and/or remote Linked
Data sets
• supports HDT, Turtle files, N-
Triple files, JSON-LD files,
SPARQL endpoints, in-
memory store, and
BlazeGraph Linked Data sets
• version 2.0.0 (released March
31 2016) has built-in Memento
support
Herbert Van de Sompel & Miel Vander Sande
CNI Spring Meeting, San Antonio, TX, April 5 2016
Building a Linked Data Archive
• Convert the archival data set(s) to HDT using HDT-CPP
• Download the Triple Fragment Server code
• Create the JSON config file for Memento
Herbert Van de Sompel & Miel Vander Sande
CNI Spring Meeting, San Antonio, TX, April 5 2016
Linked Data Fragment Server, Memento Configuration
https://github.com/LinkedDataFragments/Server.js/wiki/Configuring-Memento
• declare archival data set(s)
• add datetime ranges for the
archival data set(s)
• add a TimeGate
• list the archival data set(s) for
which the TimeGate should
support datetime negotiation
Herbert Van de Sompel & Miel Vander Sande
CNI Spring Meeting, San Antonio, TX, April 5 2016
Building a Linked Data Archive
• Convert the archival data set(s) to HDT using HDT-CPP
• Download the Triple Fragment Server code
• Create the JSON config file for Memento
• Run the server
Herbert Van de Sompel & Miel Vander Sande
CNI Spring Meeting, San Antonio, TX, April 5 2016
Herbert Van de Sompel
@hvdsomp
Los Alamos National Laboratory
Acknowledgments: Lyudmila Balakireva, Harihar Shankar, Ruben Verborgh
Access to DBpedia Versions using
Memento and Triple Pattern Fragments
Miel Vander Sande
@Miel_vds
Ghent University

More Related Content

What's hot

Linked Open Data for Libraries
Linked Open Data for LibrariesLinked Open Data for Libraries
Linked Open Data for LibrariesLukas Koster
 
SWIB14 Weaving repository contents into the Semantic Web
SWIB14 Weaving repository contents into the Semantic WebSWIB14 Weaving repository contents into the Semantic Web
SWIB14 Weaving repository contents into the Semantic WebPascal-Nicolas Becker
 
towards interoperable archives: the Universal Preprint Service initiative
towards interoperable archives:  the Universal Preprint Service initiativetowards interoperable archives:  the Universal Preprint Service initiative
towards interoperable archives: the Universal Preprint Service initiativeHerbert Van de Sompel
 
Intro to Linked Open Data in Libraries, Archives & Museums
Intro to Linked Open Data in Libraries, Archives & MuseumsIntro to Linked Open Data in Libraries, Archives & Museums
Intro to Linked Open Data in Libraries, Archives & MuseumsJon Voss
 
What is #LODLAM?! Understanding linked open data in libraries, archives [and ...
What is #LODLAM?! Understanding linked open data in libraries, archives [and ...What is #LODLAM?! Understanding linked open data in libraries, archives [and ...
What is #LODLAM?! Understanding linked open data in libraries, archives [and ...Alison Hitchens
 
FAIR Signposting: A KISS Approach to a Burning Issue
FAIR Signposting: A KISS Approach to a Burning IssueFAIR Signposting: A KISS Approach to a Burning Issue
FAIR Signposting: A KISS Approach to a Burning IssueHerbert Van de Sompel
 
Interoperability for web based scholarship
Interoperability for web based scholarshipInteroperability for web based scholarship
Interoperability for web based scholarshipHerbert Van de Sompel
 
Intro to Linked Open Data in Libraries Archives & Museums.
Intro to Linked Open Data in Libraries Archives & Museums.Intro to Linked Open Data in Libraries Archives & Museums.
Intro to Linked Open Data in Libraries Archives & Museums.Jon Voss
 
Web Data Management with RDF
Web Data Management with RDFWeb Data Management with RDF
Web Data Management with RDFM. Tamer Özsu
 
Linked Data at ISAW: How and Why
Linked Data at ISAW: How and WhyLinked Data at ISAW: How and Why
Linked Data at ISAW: How and Whyparegorios
 
The SFX Framework for Context-Sensitive Reference Linking
The SFX Framework for  Context-Sensitive Reference LinkingThe SFX Framework for  Context-Sensitive Reference Linking
The SFX Framework for Context-Sensitive Reference LinkingHerbert Van de Sompel
 
Linked Data in Libraries
Linked Data in LibrariesLinked Data in Libraries
Linked Data in LibrariesCarl Hess
 
Linked open data and libraries
Linked open data and librariesLinked open data and libraries
Linked open data and librariesAlison Hitchens
 
What is #LODLAM?! (revised January 2015)
What is #LODLAM?! (revised January 2015)What is #LODLAM?! (revised January 2015)
What is #LODLAM?! (revised January 2015)Alison Hitchens
 
Web Data Management in the RDF Age
Web Data Management in the RDF AgeWeb Data Management in the RDF Age
Web Data Management in the RDF AgeM. Tamer Özsu
 
Why do they call it Linked Data when they want to say...?
Why do they call it Linked Data when they want to say...?Why do they call it Linked Data when they want to say...?
Why do they call it Linked Data when they want to say...?Oscar Corcho
 

What's hot (20)

Linked Open Data for Libraries
Linked Open Data for LibrariesLinked Open Data for Libraries
Linked Open Data for Libraries
 
SWIB14 Weaving repository contents into the Semantic Web
SWIB14 Weaving repository contents into the Semantic WebSWIB14 Weaving repository contents into the Semantic Web
SWIB14 Weaving repository contents into the Semantic Web
 
towards interoperable archives: the Universal Preprint Service initiative
towards interoperable archives:  the Universal Preprint Service initiativetowards interoperable archives:  the Universal Preprint Service initiative
towards interoperable archives: the Universal Preprint Service initiative
 
Intro to Linked Open Data in Libraries, Archives & Museums
Intro to Linked Open Data in Libraries, Archives & MuseumsIntro to Linked Open Data in Libraries, Archives & Museums
Intro to Linked Open Data in Libraries, Archives & Museums
 
ResourceSync Quick Overview
ResourceSync Quick OverviewResourceSync Quick Overview
ResourceSync Quick Overview
 
What is #LODLAM?! Understanding linked open data in libraries, archives [and ...
What is #LODLAM?! Understanding linked open data in libraries, archives [and ...What is #LODLAM?! Understanding linked open data in libraries, archives [and ...
What is #LODLAM?! Understanding linked open data in libraries, archives [and ...
 
A Clean Slate?
A Clean Slate?A Clean Slate?
A Clean Slate?
 
FAIR Signposting: A KISS Approach to a Burning Issue
FAIR Signposting: A KISS Approach to a Burning IssueFAIR Signposting: A KISS Approach to a Burning Issue
FAIR Signposting: A KISS Approach to a Burning Issue
 
Interoperability for web based scholarship
Interoperability for web based scholarshipInteroperability for web based scholarship
Interoperability for web based scholarship
 
Intro to Linked Open Data in Libraries Archives & Museums.
Intro to Linked Open Data in Libraries Archives & Museums.Intro to Linked Open Data in Libraries Archives & Museums.
Intro to Linked Open Data in Libraries Archives & Museums.
 
Web Data Management with RDF
Web Data Management with RDFWeb Data Management with RDF
Web Data Management with RDF
 
Linked Data at ISAW: How and Why
Linked Data at ISAW: How and WhyLinked Data at ISAW: How and Why
Linked Data at ISAW: How and Why
 
The SFX Framework for Context-Sensitive Reference Linking
The SFX Framework for  Context-Sensitive Reference LinkingThe SFX Framework for  Context-Sensitive Reference Linking
The SFX Framework for Context-Sensitive Reference Linking
 
The aDORe Federation Architecture
The aDORe Federation ArchitectureThe aDORe Federation Architecture
The aDORe Federation Architecture
 
DBpedia InsideOut
DBpedia InsideOutDBpedia InsideOut
DBpedia InsideOut
 
Linked Data in Libraries
Linked Data in LibrariesLinked Data in Libraries
Linked Data in Libraries
 
Linked open data and libraries
Linked open data and librariesLinked open data and libraries
Linked open data and libraries
 
What is #LODLAM?! (revised January 2015)
What is #LODLAM?! (revised January 2015)What is #LODLAM?! (revised January 2015)
What is #LODLAM?! (revised January 2015)
 
Web Data Management in the RDF Age
Web Data Management in the RDF AgeWeb Data Management in the RDF Age
Web Data Management in the RDF Age
 
Why do they call it Linked Data when they want to say...?
Why do they call it Linked Data when they want to say...?Why do they call it Linked Data when they want to say...?
Why do they call it Linked Data when they want to say...?
 

Viewers also liked

The bX project: Federating and Mining Usage Logs from Linking Servers
The bX project: Federating and Mining Usage Logs from Linking ServersThe bX project: Federating and Mining Usage Logs from Linking Servers
The bX project: Federating and Mining Usage Logs from Linking ServersHerbert Van de Sompel
 
Open Archives Initiative Object Re-Use & Exchange
Open Archives Initiative Object Re-Use & ExchangeOpen Archives Initiative Object Re-Use & Exchange
Open Archives Initiative Object Re-Use & ExchangeHerbert Van de Sompel
 
Augmenting interoperability across scholarly repositories
Augmenting interoperability across scholarly repositoriesAugmenting interoperability across scholarly repositories
Augmenting interoperability across scholarly repositoriesHerbert Van de Sompel
 
Attempts at innovation in scholarly communication
Attempts at innovation in scholarly communicationAttempts at innovation in scholarly communication
Attempts at innovation in scholarly communicationHerbert Van de Sompel
 
An Overview of the OAI Object Reuse and Exchange Interoperability Framework
An Overview of the OAI Object Reuse and Exchange Interoperability FrameworkAn Overview of the OAI Object Reuse and Exchange Interoperability Framework
An Overview of the OAI Object Reuse and Exchange Interoperability FrameworkHerbert Van de Sompel
 
MESUR: Making sense and use of usage data
MESUR: Making sense and use of usage dataMESUR: Making sense and use of usage data
MESUR: Making sense and use of usage dataHerbert Van de Sompel
 
An HTTP-Based Versioning Mechanism for Linked Data
An HTTP-Based Versioning Mechanism for Linked DataAn HTTP-Based Versioning Mechanism for Linked Data
An HTTP-Based Versioning Mechanism for Linked DataHerbert Van de Sompel
 
The Web as infrastructure for scholarly research and communication
The Web as infrastructure for scholarly research and communicationThe Web as infrastructure for scholarly research and communication
The Web as infrastructure for scholarly research and communicationHerbert Van de Sompel
 
Motivation, inspiration and innovation from frustration
Motivation, inspiration and innovation from frustrationMotivation, inspiration and innovation from frustration
Motivation, inspiration and innovation from frustrationHerbert Van de Sompel
 
Memento: Big Leaps Towards Seamless Navigation of the Web of the Past
Memento: Big Leaps Towards Seamless Navigation of the Web of the PastMemento: Big Leaps Towards Seamless Navigation of the Web of the Past
Memento: Big Leaps Towards Seamless Navigation of the Web of the PastHerbert Van de Sompel
 
Towards a Machine-Actionable Scholarly Communication System
Towards a Machine-Actionable Scholarly Communication SystemTowards a Machine-Actionable Scholarly Communication System
Towards a Machine-Actionable Scholarly Communication SystemHerbert Van de Sompel
 
The OAI-ORE Interoperability Framework in the Context of the Current Scholarl...
The OAI-ORE Interoperability Framework in the Context of the Current Scholarl...The OAI-ORE Interoperability Framework in the Context of the Current Scholarl...
The OAI-ORE Interoperability Framework in the Context of the Current Scholarl...Herbert Van de Sompel
 
Time travelling through DBpedia
Time travelling through DBpediaTime travelling through DBpedia
Time travelling through DBpediaMiel Vander Sande
 
Persistent Identifiers and the Web: The Need for an Unambiguous Mapping
Persistent Identifiers and the Web: The Need for an Unambiguous MappingPersistent Identifiers and the Web: The Need for an Unambiguous Mapping
Persistent Identifiers and the Web: The Need for an Unambiguous MappingHerbert Van de Sompel
 

Viewers also liked (19)

The bX project: Federating and Mining Usage Logs from Linking Servers
The bX project: Federating and Mining Usage Logs from Linking ServersThe bX project: Federating and Mining Usage Logs from Linking Servers
The bX project: Federating and Mining Usage Logs from Linking Servers
 
The Roof is on Fire
The Roof is on FireThe Roof is on Fire
The Roof is on Fire
 
Open Archives Initiative Object Re-Use & Exchange
Open Archives Initiative Object Re-Use & ExchangeOpen Archives Initiative Object Re-Use & Exchange
Open Archives Initiative Object Re-Use & Exchange
 
The djatoka Image Server
The djatoka Image ServerThe djatoka Image Server
The djatoka Image Server
 
Augmenting interoperability across scholarly repositories
Augmenting interoperability across scholarly repositoriesAugmenting interoperability across scholarly repositories
Augmenting interoperability across scholarly repositories
 
Attempts at innovation in scholarly communication
Attempts at innovation in scholarly communicationAttempts at innovation in scholarly communication
Attempts at innovation in scholarly communication
 
An Overview of the OAI Object Reuse and Exchange Interoperability Framework
An Overview of the OAI Object Reuse and Exchange Interoperability FrameworkAn Overview of the OAI Object Reuse and Exchange Interoperability Framework
An Overview of the OAI Object Reuse and Exchange Interoperability Framework
 
the UPS protoproto project
the UPS protoproto projectthe UPS protoproto project
the UPS protoproto project
 
MESUR: Making sense and use of usage data
MESUR: Making sense and use of usage dataMESUR: Making sense and use of usage data
MESUR: Making sense and use of usage data
 
An HTTP-Based Versioning Mechanism for Linked Data
An HTTP-Based Versioning Mechanism for Linked DataAn HTTP-Based Versioning Mechanism for Linked Data
An HTTP-Based Versioning Mechanism for Linked Data
 
The Web as infrastructure for scholarly research and communication
The Web as infrastructure for scholarly research and communicationThe Web as infrastructure for scholarly research and communication
The Web as infrastructure for scholarly research and communication
 
Motivation, inspiration and innovation from frustration
Motivation, inspiration and innovation from frustrationMotivation, inspiration and innovation from frustration
Motivation, inspiration and innovation from frustration
 
Memento: Time Travel for the Web
Memento: Time Travel for the WebMemento: Time Travel for the Web
Memento: Time Travel for the Web
 
Memento: Big Leaps Towards Seamless Navigation of the Web of the Past
Memento: Big Leaps Towards Seamless Navigation of the Web of the PastMemento: Big Leaps Towards Seamless Navigation of the Web of the Past
Memento: Big Leaps Towards Seamless Navigation of the Web of the Past
 
Towards a Machine-Actionable Scholarly Communication System
Towards a Machine-Actionable Scholarly Communication SystemTowards a Machine-Actionable Scholarly Communication System
Towards a Machine-Actionable Scholarly Communication System
 
The OAI-ORE Interoperability Framework in the Context of the Current Scholarl...
The OAI-ORE Interoperability Framework in the Context of the Current Scholarl...The OAI-ORE Interoperability Framework in the Context of the Current Scholarl...
The OAI-ORE Interoperability Framework in the Context of the Current Scholarl...
 
Untitled I: Challenges ahead
Untitled I: Challenges aheadUntitled I: Challenges ahead
Untitled I: Challenges ahead
 
Time travelling through DBpedia
Time travelling through DBpediaTime travelling through DBpedia
Time travelling through DBpedia
 
Persistent Identifiers and the Web: The Need for an Unambiguous Mapping
Persistent Identifiers and the Web: The Need for an Unambiguous MappingPersistent Identifiers and the Web: The Need for an Unambiguous Mapping
Persistent Identifiers and the Web: The Need for an Unambiguous Mapping
 

Similar to DBpedia Archive using Memento, Triple Pattern Fragments, and HDT

Illuminating DSpace's Linked Data Support
Illuminating DSpace's Linked Data SupportIlluminating DSpace's Linked Data Support
Illuminating DSpace's Linked Data SupportPascal-Nicolas Becker
 
Annotating Scholarly Resources
Annotating Scholarly ResourcesAnnotating Scholarly Resources
Annotating Scholarly ResourcesRobert Sanderson
 
Linked Data (1st Linked Data Meetup Malmö)
Linked Data (1st Linked Data Meetup Malmö)Linked Data (1st Linked Data Meetup Malmö)
Linked Data (1st Linked Data Meetup Malmö)Anja Jentzsch
 
Linked data demystified:Practical efforts to transform CONTENTDM metadata int...
Linked data demystified:Practical efforts to transform CONTENTDM metadata int...Linked data demystified:Practical efforts to transform CONTENTDM metadata int...
Linked data demystified:Practical efforts to transform CONTENTDM metadata int...Cory Lampert
 
First steps towards publishing library data on the semantic web
First steps towards publishing library data on the semantic webFirst steps towards publishing library data on the semantic web
First steps towards publishing library data on the semantic webhorvadam
 
Open Science Days 2014 - Becker - Repositories and Linked Data
Open Science Days 2014 - Becker - Repositories and Linked DataOpen Science Days 2014 - Becker - Repositories and Linked Data
Open Science Days 2014 - Becker - Repositories and Linked DataPascal-Nicolas Becker
 
Publishing and Using Linked Open Data - Day 1
Publishing and Using Linked Open Data - Day 1 Publishing and Using Linked Open Data - Day 1
Publishing and Using Linked Open Data - Day 1 Richard Urban
 
ResourceSync: Web-Based Resource Synchronization
ResourceSync: Web-Based Resource SynchronizationResourceSync: Web-Based Resource Synchronization
ResourceSync: Web-Based Resource SynchronizationHerbert Van de Sompel
 
An Introduction to Semantic Web Technology
An Introduction to Semantic Web TechnologyAn Introduction to Semantic Web Technology
An Introduction to Semantic Web TechnologyAnkur Biswas
 
RDF and Open Linked Data, a first approach
RDF and Open Linked Data, a first approachRDF and Open Linked Data, a first approach
RDF and Open Linked Data, a first approachhorvadam
 
Data Archiving and Sharing
Data Archiving and SharingData Archiving and Sharing
Data Archiving and SharingC. Tobin Magle
 
Materials Data Facility: Streamlined and automated data sharing, discovery, ...
Materials Data Facility: Streamlined and automated data sharing,  discovery, ...Materials Data Facility: Streamlined and automated data sharing,  discovery, ...
Materials Data Facility: Streamlined and automated data sharing, discovery, ...Ian Foster
 
Linked data presentation for libraries (COMO)
Linked data presentation for libraries (COMO)Linked data presentation for libraries (COMO)
Linked data presentation for libraries (COMO)robin fay
 

Similar to DBpedia Archive using Memento, Triple Pattern Fragments, and HDT (20)

NISO Forum, Denver, September 24, 2012: ResourceSync: Web-Based Resource Sync...
NISO Forum, Denver, September 24, 2012: ResourceSync: Web-Based Resource Sync...NISO Forum, Denver, September 24, 2012: ResourceSync: Web-Based Resource Sync...
NISO Forum, Denver, September 24, 2012: ResourceSync: Web-Based Resource Sync...
 
Illuminating DSpace's Linked Data Support
Illuminating DSpace's Linked Data SupportIlluminating DSpace's Linked Data Support
Illuminating DSpace's Linked Data Support
 
Annotating Scholarly Resources
Annotating Scholarly ResourcesAnnotating Scholarly Resources
Annotating Scholarly Resources
 
Linked Data (1st Linked Data Meetup Malmö)
Linked Data (1st Linked Data Meetup Malmö)Linked Data (1st Linked Data Meetup Malmö)
Linked Data (1st Linked Data Meetup Malmö)
 
Linked data demystified:Practical efforts to transform CONTENTDM metadata int...
Linked data demystified:Practical efforts to transform CONTENTDM metadata int...Linked data demystified:Practical efforts to transform CONTENTDM metadata int...
Linked data demystified:Practical efforts to transform CONTENTDM metadata int...
 
April 8 NISO Webinar: Experimenting with BIBFRAME: Reports from Early Adopters
April 8 NISO Webinar: Experimenting with BIBFRAME: Reports from Early AdoptersApril 8 NISO Webinar: Experimenting with BIBFRAME: Reports from Early Adopters
April 8 NISO Webinar: Experimenting with BIBFRAME: Reports from Early Adopters
 
First steps towards publishing library data on the semantic web
First steps towards publishing library data on the semantic webFirst steps towards publishing library data on the semantic web
First steps towards publishing library data on the semantic web
 
Open Science Days 2014 - Becker - Repositories and Linked Data
Open Science Days 2014 - Becker - Repositories and Linked DataOpen Science Days 2014 - Becker - Repositories and Linked Data
Open Science Days 2014 - Becker - Repositories and Linked Data
 
Linked Data
Linked DataLinked Data
Linked Data
 
Publishing and Using Linked Open Data - Day 1
Publishing and Using Linked Open Data - Day 1 Publishing and Using Linked Open Data - Day 1
Publishing and Using Linked Open Data - Day 1
 
ResourceSync: Web-Based Resource Synchronization
ResourceSync: Web-Based Resource SynchronizationResourceSync: Web-Based Resource Synchronization
ResourceSync: Web-Based Resource Synchronization
 
Web of Data Usage Mining
Web of Data Usage MiningWeb of Data Usage Mining
Web of Data Usage Mining
 
An Introduction to Semantic Web Technology
An Introduction to Semantic Web TechnologyAn Introduction to Semantic Web Technology
An Introduction to Semantic Web Technology
 
RDF and Open Linked Data, a first approach
RDF and Open Linked Data, a first approachRDF and Open Linked Data, a first approach
RDF and Open Linked Data, a first approach
 
Linked Open Data
Linked Open DataLinked Open Data
Linked Open Data
 
Data Archiving and Sharing
Data Archiving and SharingData Archiving and Sharing
Data Archiving and Sharing
 
ResourceSync Overview
ResourceSync OverviewResourceSync Overview
ResourceSync Overview
 
Materials Data Facility: Streamlined and automated data sharing, discovery, ...
Materials Data Facility: Streamlined and automated data sharing,  discovery, ...Materials Data Facility: Streamlined and automated data sharing,  discovery, ...
Materials Data Facility: Streamlined and automated data sharing, discovery, ...
 
Linked data presentation for libraries (COMO)
Linked data presentation for libraries (COMO)Linked data presentation for libraries (COMO)
Linked data presentation for libraries (COMO)
 
NISO/DCMI September 25 Webinar: Implementing Linked Data in Developing Countr...
NISO/DCMI September 25 Webinar: Implementing Linked Data in Developing Countr...NISO/DCMI September 25 Webinar: Implementing Linked Data in Developing Countr...
NISO/DCMI September 25 Webinar: Implementing Linked Data in Developing Countr...
 

More from Herbert Van de Sompel

The web is rotting and what to do about it
The web is rotting and what to do about itThe web is rotting and what to do about it
The web is rotting and what to do about itHerbert Van de Sompel
 
Researcher Pod: Scholarly Communication Using the Decentralized Web
Researcher Pod: Scholarly Communication Using the Decentralized WebResearcher Pod: Scholarly Communication Using the Decentralized Web
Researcher Pod: Scholarly Communication Using the Decentralized WebHerbert Van de Sompel
 
Persistent Identification: Easier Said than Done
Persistent Identification: Easier Said than DonePersistent Identification: Easier Said than Done
Persistent Identification: Easier Said than DoneHerbert Van de Sompel
 
Registration / Certification Interoperability Architecture (overlay peer-review)
Registration / Certification Interoperability Architecture (overlay peer-review)Registration / Certification Interoperability Architecture (overlay peer-review)
Registration / Certification Interoperability Architecture (overlay peer-review)Herbert Van de Sompel
 
Collecting the organizational scholarly record
Collecting the organizational scholarly recordCollecting the organizational scholarly record
Collecting the organizational scholarly recordHerbert Van de Sompel
 
Achieving Link Integrity for Managed Collections
Achieving Link Integrity for Managed CollectionsAchieving Link Integrity for Managed Collections
Achieving Link Integrity for Managed CollectionsHerbert Van de Sompel
 
Signposting Overview (Version November 2017)
Signposting Overview (Version November 2017)Signposting Overview (Version November 2017)
Signposting Overview (Version November 2017)Herbert Van de Sompel
 
Paint-Yourself-In-The-Corner Infrastructure
Paint-Yourself-In-The-Corner InfrastructurePaint-Yourself-In-The-Corner Infrastructure
Paint-Yourself-In-The-Corner InfrastructureHerbert Van de Sompel
 
ResourceSync: Conceptual and Technical Problem Perspective
ResourceSync: Conceptual and Technical Problem PerspectiveResourceSync: Conceptual and Technical Problem Perspective
ResourceSync: Conceptual and Technical Problem PerspectiveHerbert Van de Sompel
 

More from Herbert Van de Sompel (15)

The web is rotting and what to do about it
The web is rotting and what to do about itThe web is rotting and what to do about it
The web is rotting and what to do about it
 
Researcher Pod: Scholarly Communication Using the Decentralized Web
Researcher Pod: Scholarly Communication Using the Decentralized WebResearcher Pod: Scholarly Communication Using the Decentralized Web
Researcher Pod: Scholarly Communication Using the Decentralized Web
 
Persistent Identification: Easier Said than Done
Persistent Identification: Easier Said than DonePersistent Identification: Easier Said than Done
Persistent Identification: Easier Said than Done
 
Registration / Certification Interoperability Architecture (overlay peer-review)
Registration / Certification Interoperability Architecture (overlay peer-review)Registration / Certification Interoperability Architecture (overlay peer-review)
Registration / Certification Interoperability Architecture (overlay peer-review)
 
Collecting the organizational scholarly record
Collecting the organizational scholarly recordCollecting the organizational scholarly record
Collecting the organizational scholarly record
 
To the Rescue of Scholarly Orphans
To the Rescue of Scholarly OrphansTo the Rescue of Scholarly Orphans
To the Rescue of Scholarly Orphans
 
Almost two decades at LANL
Almost two decades at LANLAlmost two decades at LANL
Almost two decades at LANL
 
Perseverance on Persistence
Perseverance on PersistencePerseverance on Persistence
Perseverance on Persistence
 
Paul Evan Peters Lecture
Paul Evan Peters LecturePaul Evan Peters Lecture
Paul Evan Peters Lecture
 
Achieving Link Integrity for Managed Collections
Achieving Link Integrity for Managed CollectionsAchieving Link Integrity for Managed Collections
Achieving Link Integrity for Managed Collections
 
Signposting Overview (Version November 2017)
Signposting Overview (Version November 2017)Signposting Overview (Version November 2017)
Signposting Overview (Version November 2017)
 
Signposting Overview
Signposting OverviewSignposting Overview
Signposting Overview
 
ResourceSync tutorial OAI8
ResourceSync tutorial OAI8ResourceSync tutorial OAI8
ResourceSync tutorial OAI8
 
Paint-Yourself-In-The-Corner Infrastructure
Paint-Yourself-In-The-Corner InfrastructurePaint-Yourself-In-The-Corner Infrastructure
Paint-Yourself-In-The-Corner Infrastructure
 
ResourceSync: Conceptual and Technical Problem Perspective
ResourceSync: Conceptual and Technical Problem PerspectiveResourceSync: Conceptual and Technical Problem Perspective
ResourceSync: Conceptual and Technical Problem Perspective
 

Recently uploaded

SCM Symposium PPT Format Customer loyalty is predi
SCM Symposium PPT Format Customer loyalty is prediSCM Symposium PPT Format Customer loyalty is predi
SCM Symposium PPT Format Customer loyalty is predieusebiomeyer
 
PHP-based rendering of TYPO3 Documentation
PHP-based rendering of TYPO3 DocumentationPHP-based rendering of TYPO3 Documentation
PHP-based rendering of TYPO3 DocumentationLinaWolf1
 
A Good Girl's Guide to Murder (A Good Girl's Guide to Murder, #1)
A Good Girl's Guide to Murder (A Good Girl's Guide to Murder, #1)A Good Girl's Guide to Murder (A Good Girl's Guide to Murder, #1)
A Good Girl's Guide to Murder (A Good Girl's Guide to Murder, #1)Christopher H Felton
 
定制(Lincoln毕业证书)新西兰林肯大学毕业证成绩单原版一比一
定制(Lincoln毕业证书)新西兰林肯大学毕业证成绩单原版一比一定制(Lincoln毕业证书)新西兰林肯大学毕业证成绩单原版一比一
定制(Lincoln毕业证书)新西兰林肯大学毕业证成绩单原版一比一Fs
 
Film cover research (1).pptxsdasdasdasdasdasa
Film cover research (1).pptxsdasdasdasdasdasaFilm cover research (1).pptxsdasdasdasdasdasa
Film cover research (1).pptxsdasdasdasdasdasa494f574xmv
 
Font Performance - NYC WebPerf Meetup April '24
Font Performance - NYC WebPerf Meetup April '24Font Performance - NYC WebPerf Meetup April '24
Font Performance - NYC WebPerf Meetup April '24Paul Calvano
 
『澳洲文凭』买拉筹伯大学毕业证书成绩单办理澳洲LTU文凭学位证书
『澳洲文凭』买拉筹伯大学毕业证书成绩单办理澳洲LTU文凭学位证书『澳洲文凭』买拉筹伯大学毕业证书成绩单办理澳洲LTU文凭学位证书
『澳洲文凭』买拉筹伯大学毕业证书成绩单办理澳洲LTU文凭学位证书rnrncn29
 
定制(Management毕业证书)新加坡管理大学毕业证成绩单原版一比一
定制(Management毕业证书)新加坡管理大学毕业证成绩单原版一比一定制(Management毕业证书)新加坡管理大学毕业证成绩单原版一比一
定制(Management毕业证书)新加坡管理大学毕业证成绩单原版一比一Fs
 
定制(AUT毕业证书)新西兰奥克兰理工大学毕业证成绩单原版一比一
定制(AUT毕业证书)新西兰奥克兰理工大学毕业证成绩单原版一比一定制(AUT毕业证书)新西兰奥克兰理工大学毕业证成绩单原版一比一
定制(AUT毕业证书)新西兰奥克兰理工大学毕业证成绩单原版一比一Fs
 
Git and Github workshop GDSC MLRITM
Git and Github  workshop GDSC MLRITMGit and Github  workshop GDSC MLRITM
Git and Github workshop GDSC MLRITMgdsc13
 
Blepharitis inflammation of eyelid symptoms cause everything included along w...
Blepharitis inflammation of eyelid symptoms cause everything included along w...Blepharitis inflammation of eyelid symptoms cause everything included along w...
Blepharitis inflammation of eyelid symptoms cause everything included along w...Excelmac1
 
Elevate Your Business with Our IT Expertise in New Orleans
Elevate Your Business with Our IT Expertise in New OrleansElevate Your Business with Our IT Expertise in New Orleans
Elevate Your Business with Our IT Expertise in New Orleanscorenetworkseo
 
Contact Rya Baby for Call Girls New Delhi
Contact Rya Baby for Call Girls New DelhiContact Rya Baby for Call Girls New Delhi
Contact Rya Baby for Call Girls New Delhimiss dipika
 
办理(UofR毕业证书)罗切斯特大学毕业证成绩单原版一比一
办理(UofR毕业证书)罗切斯特大学毕业证成绩单原版一比一办理(UofR毕业证书)罗切斯特大学毕业证成绩单原版一比一
办理(UofR毕业证书)罗切斯特大学毕业证成绩单原版一比一z xss
 
Top 10 Interactive Website Design Trends in 2024.pptx
Top 10 Interactive Website Design Trends in 2024.pptxTop 10 Interactive Website Design Trends in 2024.pptx
Top 10 Interactive Website Design Trends in 2024.pptxDyna Gilbert
 
Q4-1-Illustrating-Hypothesis-Testing.pptx
Q4-1-Illustrating-Hypothesis-Testing.pptxQ4-1-Illustrating-Hypothesis-Testing.pptx
Q4-1-Illustrating-Hypothesis-Testing.pptxeditsforyah
 
Call Girls Near The Suryaa Hotel New Delhi 9873777170
Call Girls Near The Suryaa Hotel New Delhi 9873777170Call Girls Near The Suryaa Hotel New Delhi 9873777170
Call Girls Near The Suryaa Hotel New Delhi 9873777170Sonam Pathan
 
『澳洲文凭』买詹姆士库克大学毕业证书成绩单办理澳洲JCU文凭学位证书
『澳洲文凭』买詹姆士库克大学毕业证书成绩单办理澳洲JCU文凭学位证书『澳洲文凭』买詹姆士库克大学毕业证书成绩单办理澳洲JCU文凭学位证书
『澳洲文凭』买詹姆士库克大学毕业证书成绩单办理澳洲JCU文凭学位证书rnrncn29
 
Potsdam FH学位证,波茨坦应用技术大学毕业证书1:1制作
Potsdam FH学位证,波茨坦应用技术大学毕业证书1:1制作Potsdam FH学位证,波茨坦应用技术大学毕业证书1:1制作
Potsdam FH学位证,波茨坦应用技术大学毕业证书1:1制作ys8omjxb
 

Recently uploaded (20)

SCM Symposium PPT Format Customer loyalty is predi
SCM Symposium PPT Format Customer loyalty is prediSCM Symposium PPT Format Customer loyalty is predi
SCM Symposium PPT Format Customer loyalty is predi
 
PHP-based rendering of TYPO3 Documentation
PHP-based rendering of TYPO3 DocumentationPHP-based rendering of TYPO3 Documentation
PHP-based rendering of TYPO3 Documentation
 
A Good Girl's Guide to Murder (A Good Girl's Guide to Murder, #1)
A Good Girl's Guide to Murder (A Good Girl's Guide to Murder, #1)A Good Girl's Guide to Murder (A Good Girl's Guide to Murder, #1)
A Good Girl's Guide to Murder (A Good Girl's Guide to Murder, #1)
 
定制(Lincoln毕业证书)新西兰林肯大学毕业证成绩单原版一比一
定制(Lincoln毕业证书)新西兰林肯大学毕业证成绩单原版一比一定制(Lincoln毕业证书)新西兰林肯大学毕业证成绩单原版一比一
定制(Lincoln毕业证书)新西兰林肯大学毕业证成绩单原版一比一
 
Film cover research (1).pptxsdasdasdasdasdasa
Film cover research (1).pptxsdasdasdasdasdasaFilm cover research (1).pptxsdasdasdasdasdasa
Film cover research (1).pptxsdasdasdasdasdasa
 
Font Performance - NYC WebPerf Meetup April '24
Font Performance - NYC WebPerf Meetup April '24Font Performance - NYC WebPerf Meetup April '24
Font Performance - NYC WebPerf Meetup April '24
 
『澳洲文凭』买拉筹伯大学毕业证书成绩单办理澳洲LTU文凭学位证书
『澳洲文凭』买拉筹伯大学毕业证书成绩单办理澳洲LTU文凭学位证书『澳洲文凭』买拉筹伯大学毕业证书成绩单办理澳洲LTU文凭学位证书
『澳洲文凭』买拉筹伯大学毕业证书成绩单办理澳洲LTU文凭学位证书
 
定制(Management毕业证书)新加坡管理大学毕业证成绩单原版一比一
定制(Management毕业证书)新加坡管理大学毕业证成绩单原版一比一定制(Management毕业证书)新加坡管理大学毕业证成绩单原版一比一
定制(Management毕业证书)新加坡管理大学毕业证成绩单原版一比一
 
定制(AUT毕业证书)新西兰奥克兰理工大学毕业证成绩单原版一比一
定制(AUT毕业证书)新西兰奥克兰理工大学毕业证成绩单原版一比一定制(AUT毕业证书)新西兰奥克兰理工大学毕业证成绩单原版一比一
定制(AUT毕业证书)新西兰奥克兰理工大学毕业证成绩单原版一比一
 
Git and Github workshop GDSC MLRITM
Git and Github  workshop GDSC MLRITMGit and Github  workshop GDSC MLRITM
Git and Github workshop GDSC MLRITM
 
young call girls in Uttam Nagar🔝 9953056974 🔝 Delhi escort Service
young call girls in Uttam Nagar🔝 9953056974 🔝 Delhi escort Serviceyoung call girls in Uttam Nagar🔝 9953056974 🔝 Delhi escort Service
young call girls in Uttam Nagar🔝 9953056974 🔝 Delhi escort Service
 
Blepharitis inflammation of eyelid symptoms cause everything included along w...
Blepharitis inflammation of eyelid symptoms cause everything included along w...Blepharitis inflammation of eyelid symptoms cause everything included along w...
Blepharitis inflammation of eyelid symptoms cause everything included along w...
 
Elevate Your Business with Our IT Expertise in New Orleans
Elevate Your Business with Our IT Expertise in New OrleansElevate Your Business with Our IT Expertise in New Orleans
Elevate Your Business with Our IT Expertise in New Orleans
 
Contact Rya Baby for Call Girls New Delhi
Contact Rya Baby for Call Girls New DelhiContact Rya Baby for Call Girls New Delhi
Contact Rya Baby for Call Girls New Delhi
 
办理(UofR毕业证书)罗切斯特大学毕业证成绩单原版一比一
办理(UofR毕业证书)罗切斯特大学毕业证成绩单原版一比一办理(UofR毕业证书)罗切斯特大学毕业证成绩单原版一比一
办理(UofR毕业证书)罗切斯特大学毕业证成绩单原版一比一
 
Top 10 Interactive Website Design Trends in 2024.pptx
Top 10 Interactive Website Design Trends in 2024.pptxTop 10 Interactive Website Design Trends in 2024.pptx
Top 10 Interactive Website Design Trends in 2024.pptx
 
Q4-1-Illustrating-Hypothesis-Testing.pptx
Q4-1-Illustrating-Hypothesis-Testing.pptxQ4-1-Illustrating-Hypothesis-Testing.pptx
Q4-1-Illustrating-Hypothesis-Testing.pptx
 
Call Girls Near The Suryaa Hotel New Delhi 9873777170
Call Girls Near The Suryaa Hotel New Delhi 9873777170Call Girls Near The Suryaa Hotel New Delhi 9873777170
Call Girls Near The Suryaa Hotel New Delhi 9873777170
 
『澳洲文凭』买詹姆士库克大学毕业证书成绩单办理澳洲JCU文凭学位证书
『澳洲文凭』买詹姆士库克大学毕业证书成绩单办理澳洲JCU文凭学位证书『澳洲文凭』买詹姆士库克大学毕业证书成绩单办理澳洲JCU文凭学位证书
『澳洲文凭』买詹姆士库克大学毕业证书成绩单办理澳洲JCU文凭学位证书
 
Potsdam FH学位证,波茨坦应用技术大学毕业证书1:1制作
Potsdam FH学位证,波茨坦应用技术大学毕业证书1:1制作Potsdam FH学位证,波茨坦应用技术大学毕业证书1:1制作
Potsdam FH学位证,波茨坦应用技术大学毕业证书1:1制作
 

DBpedia Archive using Memento, Triple Pattern Fragments, and HDT

  • 1. Herbert Van de Sompel & Miel Vander Sande CNI Spring Meeting, San Antonio, TX, April 5 2016 Herbert Van de Sompel @hvdsomp Los Alamos National Laboratory Acknowledgments: Lyudmila Balakireva, Harihar Shankar, Ruben Verborgh Access to DBpedia Versions using Memento and Triple Pattern Fragments Miel Vander Sande @Miel_vds Ghent University
  • 2. Herbert Van de Sompel & Miel Vander Sande CNI Spring Meeting, San Antonio, TX, April 5 2016 Outline • Prelude: Memento and Linked Data • First Generation DBpedia Archive • Devising Affordable/Useful Linked Data Archives • Intermezzo: Triple Pattern Fragments (TPF) • Intermezzo: Binary RDF Representation (HDT) • Devising Affordable/Useful Linked Data Archives • Second Generation DBpedia Archive • Try this At Home
  • 3. Herbert Van de Sompel & Miel Vander Sande CNI Spring Meeting, San Antonio, TX, April 5 2016 Outline • Prelude: Memento and Linked Data • First Generation DBpedia Archive • Devising Affordable/Useful Linked Data Archives • Intermezzo: Triple Pattern Fragments (TPF) • Intermezzo: Binary RDF Representation (HDT) • Devising Affordable/Useful Linked Data Archives • Second Generation DBpedia Archive • Try this At Home
  • 4. Herbert Van de Sompel & Miel Vander Sande CNI Spring Meeting, San Antonio, TX, April 5 2016 Memento Framework
  • 5. Herbert Van de Sompel & Miel Vander Sande CNI Spring Meeting, San Antonio, TX, April 5 2016 Memento LDOW 2010 Submission Herbert Van de Sompel et al. (2010) An HTTP-Based Versioning Mechanism for Linked Data http://arxiv.org/abs/1003.3661
  • 6. Herbert Van de Sompel & Miel Vander Sande CNI Spring Meeting, San Antonio, TX, April 5 2016 Memento and Linked Data
  • 7. Herbert Van de Sompel & Miel Vander Sande CNI Spring Meeting, San Antonio, TX, April 5 2016 Memento and Linked Data
  • 8. Herbert Van de Sompel & Miel Vander Sande CNI Spring Meeting, San Antonio, TX, April 5 2016 Time-Series Analysis across DBpedia Versions Data collected through “follow your nose” HTTP Navigation
  • 9. Herbert Van de Sompel & Miel Vander Sande CNI Spring Meeting, San Antonio, TX, April 5 2016 Outline • Prelude: Memento and Linked Data • First Generation DBpedia Archive • Devising Affordable/Useful Linked Data Archives • Intermezzo: Triple Pattern Fragments (TPF) • Intermezzo: Binary RDF Representation (HDT) • Devising Affordable/Useful Linked Data Archives • Second Generation DBpedia Archive • Try this At Home
  • 10. Herbert Van de Sompel & Miel Vander Sande CNI Spring Meeting, San Antonio, TX, April 5 2016 First Generation DBpedia Archive: Storage
  • 11. Herbert Van de Sompel & Miel Vander Sande CNI Spring Meeting, San Antonio, TX, April 5 2016 First Generation DBpedia Archive: Storage Characteristics upload software custom upload time ~ 24 hours per version storage software MongoDB storage space 383 Gb for 10 versions DBpedia versions 10 versions: 2.0 through 3.9 number of triples ~ 3 billion
  • 12. Herbert Van de Sompel & Miel Vander Sande CNI Spring Meeting, San Antonio, TX, April 5 2016 First Generation DBpedia Archive: Subject-URI Access
  • 13. Herbert Van de Sompel & Miel Vander Sande CNI Spring Meeting, San Antonio, TX, April 5 2016 First Generation DBpedia Archive: Subject-URI Access http://dbpedia.mementodepot.org/memento/2009052/http://dbpedia.org/page/Oaxaca
  • 14. Herbert Van de Sompel & Miel Vander Sande CNI Spring Meeting, San Antonio, TX, April 5 2016 First Generation DBpedia Archive: Subject-URI Access Characteristics TimeGate software custom access type Subject URI & datetime external integration current DBpedia clients • all clients: direct access to Memento Subject-URI • Memento clients: datetime negotiation with Subject-URI
  • 15. Herbert Van de Sompel & Miel Vander Sande CNI Spring Meeting, San Antonio, TX, April 5 2016 DBpedia Archive @ LANL Since 2010 • Access based on Subject-URI (DBpedia Topic URI) only • MongoDB storage • A blob per Subject-URI per version • Dynamically transformed to other RDF serializations • No updates since version 3.9 (2013) of DBpedia as a result of scalability problems !!! !!!
  • 16. Herbert Van de Sompel & Miel Vander Sande CNI Spring Meeting, San Antonio, TX, April 5 2016 Outline • Prelude: Memento and Linked Data • First Generation DBpedia Archive • Devising Affordable/Useful Linked Data Archives • Intermezzo: Triple Pattern Fragments (TPF) • Intermezzo: Binary RDF Representation (HDT) • Devising Affordable/Useful Linked Data Archives • Second Generation DBpedia Archive • Try this At Home
  • 17. Herbert Van de Sompel & Miel Vander Sande CNI Spring Meeting, San Antonio, TX, April 5 2016 Affordable & Useful Linked Data Archives • A Linked Data Archive consists of temporal snapshots of one or more Linked Data sets, whereby each temporal snapshot reflects the state of a Linked Data set at a specific moment or interval in time. • How to make Linked Data Archives accessible in a manner that is • affordable/sustainable for the publisher • useful for the consumer
  • 18. Herbert Van de Sompel & Miel Vander Sande CNI Spring Meeting, San Antonio, TX, April 5 2016 Linked Data Archive: Characteristics General Characteristics Publisher Consumer Availability Bandwidth Cost Functionality Interface Expressiveness LOD Integration Memento Support Cross Time/Data Verdict: • Publication perspective: $$$$ • Access perspective: ++++
  • 19. Herbert Van de Sompel & Miel Vander Sande CNI Spring Meeting, San Antonio, TX, April 5 2016 Linked Data Publishing • The typical ways of publishing Linked Data on the Web: • Subject URI access • Data dump • SPARQL endpoint Let’s consider these from the perspective of Linked Data Archives, i.e. archival storage and access
  • 20. Herbert Van de Sompel & Miel Vander Sande CNI Spring Meeting, San Antonio, TX, April 5 2016 Linked Data Archive with Subject-URI Access • For each temporal snapshot of a Linked Data set, and for each Subject in that snapshot, publish an RDF description (of the Subject) at a URI that is specific per snapshot/subject
  • 21. Herbert Van de Sompel & Miel Vander Sande CNI Spring Meeting, San Antonio, TX, April 5 2016 Linked Data Archive with Subject-URI Access: Characteristics General Characteristics Publisher Consumer Availability rather high rather high Bandwidth ~ description ~ description Cost rather low rather high Functionality Interface Expressiveness rather low LOD Integration yes Memento Support possible Cross Time/Data follow your nose Verdict: • Publication perspective: $$$$ • Access perspective: ++++
  • 22. Herbert Van de Sompel & Miel Vander Sande CNI Spring Meeting, San Antonio, TX, April 5 2016 Linked Data Archive Using Dumps • Renders each temporal snapshot of a Linked Data set as a data dump that places all temporal dataset triples (as they were at a specific moment in time) into one or more files
  • 23. Herbert Van de Sompel & Miel Vander Sande CNI Spring Meeting, San Antonio, TX, April 5 2016 Linked Data Archive Using Dumps: Characteristics General Characteristics Publisher Consumer Availability high high Bandwidth high high Cost low high Functionality Interface Expressiveness download dataset LOD Integration no Memento Support not possible Cross Time/Data download various datasets Verdict: • Publication perspective: $$$$ • Access perspective: ++++
  • 24. Herbert Van de Sompel & Miel Vander Sande CNI Spring Meeting, San Antonio, TX, April 5 2016 Linked Data Archive with SPARQL Endpoint(s) • For each temporal snapshot of a Linked Data set, supports arbitrary SPARQL queries. • Different architectural set-ups possible; no standard approach
  • 25. Herbert Van de Sompel & Miel Vander Sande CNI Spring Meeting, San Antonio, TX, April 5 2016 Linked Data Archive Using SPARQL Endpoint(s): Characteristics General Characteristics Publisher Consumer Availability problematic problematic Bandwidth ~ query ~ query Cost high low Functionality Interface Expressiveness highly expressive LOD Integration no Memento Support hard Cross Time/Data custom distributed queries Verdict: • Publication perspective: $$$$ • Access perspective: ++++
  • 26. Herbert Van de Sompel & Miel Vander Sande CNI Spring Meeting, San Antonio, TX, April 5 2016 Affordable & Useful Linked Data Archives Linked Data Archive Type Publishing Consuming Data Dump $$$$ ++++ SPARQL Endpoint(s) $$$$ ++++ Subject URI Access $$$$ ++++
  • 27. Herbert Van de Sompel & Miel Vander Sande CNI Spring Meeting, San Antonio, TX, April 5 2016 Outline • Prelude: Memento and Linked Data • First Generation DBpedia Archive • Devising Affordable/Useful Linked Data Archives • Intermezzo: Triple Pattern Fragments (TPF) • Intermezzo: Binary RDF Representation (HDT) • Devising Affordable/Useful Linked Data Archives • Second Generation DBpedia Archive • Try this At Home
  • 28. Herbert Van de Sompel & Miel Vander Sande CNI Spring Meeting, San Antonio, TX, April 5 2016 Linked Data Fragments (Ghent U) • Every Linked Data interface offers specific fragments of a Linked Data set • A fragment is described by • Selector: what questions can I ask? • Controls: how do I get more fragments? • Metadata: helpful information for consumption? • Each interface type comes with tradeoffs • cf. the analysis thus far http://linkeddatafragments.org Verborgh, R. et al. (2014) Querying datsets on the web with high availability. ISWC 2014 http://ruben.verborgh.org/publications/verborgh_iswc_2014/
  • 29. Herbert Van de Sompel & Miel Vander Sande CNI Spring Meeting, San Antonio, TX, April 5 2016 Triple Pattern Fragments (Ghent U) • Triple Pattern Fragments is a new interface with a different set of tradeoffs that are attractive from an archival perspective http://www.hydra-cg.com/spec/latest/triple-pattern-fragments/
  • 30. Herbert Van de Sompel & Miel Vander Sande CNI Spring Meeting, San Antonio, TX, April 5 2016 Triple Pattern Fragments (Ghent U) • Allows querying a Linked Data set according to ?Subject ?Predicate ?Object patterns
  • 31. Herbert Van de Sompel & Miel Vander Sande CNI Spring Meeting, San Antonio, TX, April 5 2016 Triple Pattern Fragments (Ghent U) Controls: Responses provide navigational help for clients • Based on emerging Hydra vocabulary for self-describing Hypermedia-Driven Web APIs Metadata: dataset info, estimated count (to aid client applications) http://www.hydra-cg.com/spec/latest/core/
  • 32. Herbert Van de Sompel & Miel Vander Sande CNI Spring Meeting, San Antonio, TX, April 5 2016 Outline • Prelude: Memento and Linked Data • First Generation DBpedia Archive • Devising Affordable/Useful Linked Data Archives • Intermezzo: Triple Pattern Fragments (TPF) • Intermezzo: Binary RDF Representation (HDT) • Devising Affordable/Useful Linked Data Archives • Second Generation DBpedia Archive • Try this At Home
  • 33. Herbert Van de Sompel & Miel Vander Sande CNI Spring Meeting, San Antonio, TX, April 5 2016 Binary RDF Representation for Publication and Exchange (HDT) http://www.w3.org/Submission/HDT/
  • 34. Herbert Van de Sompel & Miel Vander Sande CNI Spring Meeting, San Antonio, TX, April 5 2016 Binary RDF Representation for Publication and Exchange (HDT) http://www.w3.org/Submission/HDT/ • Header-Dictionary-Triple (HDT) is a compact, binary representation of RDF datasets.
  • 35. Herbert Van de Sompel & Miel Vander Sande CNI Spring Meeting, San Antonio, TX, April 5 2016 Binary RDF Representation for Publication and Exchange (HDT) http://www.w3.org/Submission/HDT/ • Able to represent massive data sets • Dictionary/Triples structure achieves • rapid search for ?subject ?predicate ?object pattern • high compression rates • Header provides metadata about the dataset
  • 36. Herbert Van de Sompel & Miel Vander Sande CNI Spring Meeting, San Antonio, TX, April 5 2016 Outline • Prelude: Memento and Linked Data • First Generation DBpedia Archive • Devising Affordable/Useful Linked Data Archives • Intermezzo: Triple Pattern Fragments (TPF) • Intermezzo: Binary RDF Representation (HDT) • Devising Affordable/Useful Linked Data Archives • Second Generation DBpedia Archive • Try this At Home
  • 37. Herbert Van de Sompel & Miel Vander Sande CNI Spring Meeting, San Antonio, TX, April 5 2016 HDT Linked Data Archive with TPF Support • For each temporal snapshot of a Linked Data set, generate an HDT serialization that provides access according to ?subject ?predicate ?object patterns
  • 38. Herbert Van de Sompel & Miel Vander Sande CNI Spring Meeting, San Antonio, TX, April 5 2016 Linked Data Archive with ?s?p?o Access: Characteristics General Characteristics Publisher Consumer Availability high high Bandwidth ~ query ~ query Cost low medium Functionality Interface Expressiveness better than subject-URI only LOD Integration yes Memento Support possible Cross Time/Data follow your nose Verdict: • Publication perspective: $$$$ • Access perspective: ++++
  • 39. Herbert Van de Sompel & Miel Vander Sande CNI Spring Meeting, San Antonio, TX, April 5 2016 Affordable & Useful Linked Data Archives Linked Data Archive Type Publishing Consuming Data Dump $$$$ ++++ SPARQL Endpoint(s) $$$$ ++++ Subject URI Access $$$$ ++++ HDT & TPF $$$$ ++++
  • 40. Herbert Van de Sompel & Miel Vander Sande CNI Spring Meeting, San Antonio, TX, April 5 2016 Outline • Prelude: Memento and Linked Data • First Generation DBpedia Archive • Devising Affordable/Useful Linked Data Archives • Intermezzo: Triple Pattern Fragments (TPF) • Intermezzo: Binary RDF Representation (HDT) • Devising Affordable/Useful Linked Data Archives • Second Generation DBpedia Archive • Try this At Home
  • 41. Herbert Van de Sompel & Miel Vander Sande CNI Spring Meeting, San Antonio, TX, April 5 2016 Second Generation DBpedia Archive: Storage
  • 42. Herbert Van de Sompel & Miel Vander Sande CNI Spring Meeting, San Antonio, TX, April 5 2016 Second Generation DBpedia Archive: Storage Characteristics upload software HDT-CPP upload time ~ 4 hours per version storage software HDT binary files storage space 70 Gb for 12 versions DBpedia versions 12 versions: 2.0 through 2015 number of triples ~ 5 billion
  • 43. Herbert Van de Sompel & Miel Vander Sande CNI Spring Meeting, San Antonio, TX, April 5 2016 Second Generation DBpedia Archive: ?s?p?o Query-URI Access
  • 44. Herbert Van de Sompel & Miel Vander Sande CNI Spring Meeting, San Antonio, TX, April 5 2016 Second Generation DBpedia Archive: ?s?p?o Query-URI Access http://fragments.mementodepot.org/dbpedia_3_8?subject=&predicate=http://dbpedia.org/ontology/b irthPlace&object=http://dbpedia.org/resource/Ghent
  • 45. Herbert Van de Sompel & Miel Vander Sande CNI Spring Meeting, San Antonio, TX, April 5 2016 Second Generation DBpedia Archive: ?s?p?o Query-URI Access ?s?p?o Query-URI Access TimeGate URI http://fragments.mementodepot.org/timegate/dbpedia? subject={DBpediaURI}&predicate={DBpediaURI}&object={DBpediaURI} http://fragments.mementodepot.org/timegate/dbpedia? subject=&predicate=&object=http://dbpedia.org/resource/Ghent TimeMap URI not supported Memento URI http://fragments.mementodepot.org/{DBpediaVersion}?subject={DBpediaURI }&predicate={DBpediaURI}&object={DBpediaURI} http://fragments.mementodepot.org/dbpedia_3_0? subject=&predicate=&object=http://dbpedia.org/resource/Ghent Further info http://mementoweb.org/depot/native/fragments/ Try it with Memento for Chrome – http://bit.ly/memento-for-chrome
  • 46. Herbert Van de Sompel & Miel Vander Sande CNI Spring Meeting, San Antonio, TX, April 5 2016 Second Generation DBpedia Archive: Subject-URI Access
  • 47. Herbert Van de Sompel & Miel Vander Sande CNI Spring Meeting, San Antonio, TX, April 5 2016 Second Generation DBpedia Archive: Subject-URI Access Subject-URI Access TimeGate URI http://dbpedia.mementodepot.org/timegate/{DBpediaURI} http://dbpedia.mementodepot.org/timegate/http://dbpedia.org/data/Ghent TimeMap URI http://dbpedia.mementodepot.org/timemap/link/{DBpediaURI} http://dbpedia.mementodepot.org/timemap/link/http://dbpedia.org/data/Ghent Memento URI http://dbpedia.mementodepot.org/{yyyymmdd}/{DBpediaURI} http://dbpedia.mementodepot.org/20080103/http://dbpedia.org/data/Ghent Further info http://mementoweb.org/depot/native/dbpedia/ Try it with Memento for Chrome – http://bit.ly/memento-for-chrome
  • 48. Herbert Van de Sompel & Miel Vander Sande CNI Spring Meeting, San Antonio, TX, April 5 2016 Second Generation DBpedia Archive: Access Characteristics TimeGate software ① node.js LDF server 2.0.0 ② LDF js client access type ① ?s?p?o Query-URI & datetime ② Subject-URI & datetime external integration ① DBpedia LDF server ② current DBpedia clients • all clients: direct access to Mementos of Subject-URI and ?s?p?o Query-URI • Memento clients: datetime negotiation with Subject-URI and ?s?p?o Query-URI 1 2
  • 49. Herbert Van de Sompel & Miel Vander Sande CNI Spring Meeting, San Antonio, TX, April 5 2016 Outline • Prelude: Memento and Linked Data • First Generation DBpedia Archive • Devising Affordable/Useful Linked Data Archives • Intermezzo: Triple Pattern Fragments (TPF) • Intermezzo: Binary RDF Representation (HDT) • Devising Affordable/Useful Linked Data Archives • Second Generation DBpedia Archive • Try this At Home
  • 50. Herbert Van de Sompel & Miel Vander Sande CNI Spring Meeting, San Antonio, TX, April 5 2016 Building a Linked Data Archive • Convert the archival data set(s) to HDT using HDT-CPP
  • 51. Herbert Van de Sompel & Miel Vander Sande CNI Spring Meeting, San Antonio, TX, April 5 2016 HDT Software (C++) https://github.com/rdfhdt/hdt-cpp • input data requires cleaning before processing, especially regarding URI characters • DBpedia data not clean • DBpedia v3.5 was not successfully processed • No meaningful error messages to help locate problems • memory intensive • Kyoto Cabinet was used to optimize storage requirement and speed during processing • Java version exists but has memory problems
  • 52. Herbert Van de Sompel & Miel Vander Sande CNI Spring Meeting, San Antonio, TX, April 5 2016 Building a Linked Data Archive • Convert the archival data set(s) to HDT using HDT-CPP • Download the Triple Fragment Server code
  • 53. Herbert Van de Sompel & Miel Vander Sande CNI Spring Meeting, San Antonio, TX, April 5 2016 Linked Data Fragment Server (Node.js) https://github.com/LinkedDataFragments/Server.js • provides ?s?p?o access to local and/or remote Linked Data sets • supports HDT, Turtle files, N- Triple files, JSON-LD files, SPARQL endpoints, in- memory store, and BlazeGraph Linked Data sets • version 2.0.0 (released March 31 2016) has built-in Memento support
  • 54. Herbert Van de Sompel & Miel Vander Sande CNI Spring Meeting, San Antonio, TX, April 5 2016 Building a Linked Data Archive • Convert the archival data set(s) to HDT using HDT-CPP • Download the Triple Fragment Server code • Create the JSON config file for Memento
  • 55. Herbert Van de Sompel & Miel Vander Sande CNI Spring Meeting, San Antonio, TX, April 5 2016 Linked Data Fragment Server, Memento Configuration https://github.com/LinkedDataFragments/Server.js/wiki/Configuring-Memento • declare archival data set(s) • add datetime ranges for the archival data set(s) • add a TimeGate • list the archival data set(s) for which the TimeGate should support datetime negotiation
  • 56. Herbert Van de Sompel & Miel Vander Sande CNI Spring Meeting, San Antonio, TX, April 5 2016 Building a Linked Data Archive • Convert the archival data set(s) to HDT using HDT-CPP • Download the Triple Fragment Server code • Create the JSON config file for Memento • Run the server
  • 57. Herbert Van de Sompel & Miel Vander Sande CNI Spring Meeting, San Antonio, TX, April 5 2016 Herbert Van de Sompel @hvdsomp Los Alamos National Laboratory Acknowledgments: Lyudmila Balakireva, Harihar Shankar, Ruben Verborgh Access to DBpedia Versions using Memento and Triple Pattern Fragments Miel Vander Sande @Miel_vds Ghent University