SlideShare a Scribd company logo
1 of 57
Download to read offline
Producing, Publishing and Consuming
Linked Data
Three Lessons from the Bio2RDF Project
François Belleau
Centre de recherche du CHUQ, Laval University
Québec, Canada
@bio2rdf
• Looking backward to 2004
• Lessons :
1) How to produce RDF
2) How to publish Linked Data
3) How to consume SPARQL endpoints
• Looking forward for the next decade
The story of two images
or Bio2RDF fairy tale
2004 vision 2011 reality
Rdfizer inspiration
Data Integration problem in
bioinformatics
Where Bio2RDF got its name
Mashup !
FungalWeb
from Christopher Baker
YeastHub
from Kei-Hoi Cheung
ISMB 2005 Birds of a Feather
W3C conference in 2007
46 millions documents in SESAME
DILS conference in 2008
63 millions triples in Virtuoso
ISMB conference in 2008
65 millions triples in Virtuoso
March 2009 Linked Data cloud is
published
Bio2RDF 2,3 billions
triples represents 54%
of the global graph
W3C-HCLS F2F Meeting in 2009
41 in Virtuoso endpoints
CSHALS conference in 2013
1 billions triples in 19 Virtuoso endpoints with
Bio2RDF release 2 and still adding…
Bio2RDF is not alone anymore !
How to produce RDF
• Bio2RDF project transform existing public
database into RDF;
• Data format transformation to RDF triples is
simple to do;
• Transformation need to be done from many
kind of format (CSV, XML, JSON, HTML,
relational database) to RDF.
Methods
• 2006 Converting XML and HTML document
from the web using JSP JSTL library
• 2007-2010 Perl scripts, JSP web pages
• 2012 – Release 2.0 rdfiser are written in PHP
• 2013 – Use Talend ETL job
ETL definition from Wikipedia
In computing, Extract, Transform and Load
(ETL) refers to a process in database usage
and especially in data warehousing that
involves:
 Extracting data from outside sources
 Transforming it to fit operational which can
include quality levels)
 Loading it into the end target (database, more
specifically, operational data store, data mart
or data warehouse)
http://en.wikipedia.org/wiki/Extract,_transform,_load
Why not use ETL software
to rdfize existing data ?
Talend Open Studio for Data Integration
an open source free ETL software build with Eclipse
http://www.talend.com/
HGNC 2 Bio2RDF example
EXTRACT from the web
TRANSFORM to RDF
LOAD into triplestore
HGNC 2 Bio2RDF : EXTRACT
HGNC 2 Bio2RDF : TRANSFORM
HGNC 2 Bio2RDF : LOAD
This rdfizer is available on myExperiment
http://www.myexperiment.org/workflows/3420.html
Lesson #1
• Use existing ETL tool, like Talend, to do fast
and efficient transformation to RDF n-triples
format.
• Talend could be extended with new Semantic
web components to ease RDF transformation
and simplify SPARQL query submission.
How to publish
Linked Data
• Design your URI pattern;
• Publish SPARQL endpoint on the Internet;
• Offer a search engine and a browser;
• Register it to official registry like CKAN;
• Advertise it in SPARQL endpoint list;
• Describe your triples with an ontology or the way
Bio2RDF does;
• Publish SPARQL query example;
• Index your data in semantic search service like Sindice;
Design your URI pattern
• Bio2RDF use Banff manifesto URIs
• http://sourceforge.net/apps/mediawiki/bio2rdf/index.p
hp?title=Banff_Manifesto
• Example : http://bio2rdf.org/geneid:15275
• Apply the four linked data rules
• http://www.w3.org/DesignIssues/LinkedData.html
• Be polite with other URIs
• http://hackathon3.dbcls.jp/wiki/URI
• Example : http://purl.uniprot.org/uniprot/P05067
Publish SPARQL endpoint on the
Internet
• Choose a triplestore technology
• http://en.wikipedia.org/wiki/Triplestore
Offer a search engine and a browser
Register it to official registry like
CKAN
Advertise it in SPARQL endpoint list
http://www.freebase.com/view/base/politeuri/sparql_endpoint
http://beta.bio2rdf.org/
Describe your triples
Publish SPARQL query example
http://sourceforge.net/apps/mediawiki/bio2rdf/index.php?title=Essential_SPARQL_queries
Index your data in semantic search
service
Lesson #2
• To be present in the Linked Data cloud, just
publish your data through a SPARQL
endpoint.
• Register it to public resources, describe its
content and suggest SPARQL queries.
• We use OpenLink Virtuoso free edition since
2007. Without this first class triplestore
software there would not be a Bio2RDF
service.
How to consume
SPARQL endpoints
Two principles :
1. To answer a specific question first build a
mashup using public or private SPARQL
endpoints.
2. Then, ask your questions to the mashup.
How to build a semantic mashup
• 2005 - Import RDF file in Protégé.
• 2006 - Use ELMO RDF crawler to import RDF
data into SESAME triplestore.
• 2007 - We implement a import function in
SESAME based on derefencable URIs.
• 2008 - Use Virtuoso sponge option and Perl
scripts.
• 2009 - Use Taverna workflow engine to fetch
triples from SPARQL endpoint.
• 2012 Use a Talend workflow consuming
SPARQL endpoint.
Who is influential at CSHALS ?
http://cshals.mashup.bio2rdf.org/relfinder/
http://cshals.mashup.bio2rdf.org/sparql
Talend workflow to create the
needed semantic mashup
• Do a full text search for each author (~80)
who talked at CSHALS since 2007 and get
its publication;
• For each publication get its XML
description (~1000) and rdfize it;
• For each publication get its citation list;
• For each publication citing a previous one
get its description (~10 000).
Global workflow in 3 steps
Full text search
Describe publication
Describe citing
publication
Full text search using ncbi/esearch
Describe publication, pubmed rdfizer for
ncbi/efetch and ncbi/elink service
Describe citing publication using
ncbi/elinks
Then query the mashup
• What is CSHALS conference about ?
• Who are the most influential researchers in
the community ?
• Which articles in semantics as been mostly
cited ?
What is CSHALS conference about ?
select ?label2 as ?mesh count(*) as ?count
where {
?s <http://bio2rdf.org/pubmed_vocabulary#xFoundIn> ?pubmed .
?pubmed <http://bio2rdf.org/pubmed_vocabulary#xMesh> ?xMesh .
?xMesh rdfs:label "Semantics" .
?pubmed <http://bio2rdf.org/pubmed_vocabulary#xMesh> ?xMesh2 .
?xMesh2 rdfs:label ?label2 .
}
order by desc(2)
Who are the most influential
researchers in the community ?
select ?l3 as ?author count(distinct ?pubmed ) as ?citation
where {
?s a <http://bio2rdf.org/pubmed_vocabulary#searchResults> .
?s rdfs:label ?l .
?s <http://bio2rdf.org/pubmed_vocabulary#xFoundIn> ?pubmed .
?pubmed <http://bio2rdf.org/pubmed_vocabulary:xCitedIn>
?xCitedIn .
?pubmed rdfs:label ?l2 .
?pubmed <http://bio2rdf.org/pubmed_vocabulary#xMesh> ?xMesh .
?xMesh rdfs:label "Semantics" .
?pubmed <http://bio2rdf.org/pubmed_vocabulary#xPerson> ?xPerson .
?xPerson rdfs:label ?l3 .
}
order by desc(2)
Which articles in semantics has
been most cited ?
select ?l2 as ?title count(?xCitedIn) as ?count
where {
?s a <http://bio2rdf.org/pubmed_vocabulary#searchResults> .
?s rdfs:label ?l .
?s <http://bio2rdf.org/pubmed_vocabulary#xFoundIn> ?pubmed .
?pubmed <http://bio2rdf.org/pubmed_vocabulary:xCitedIn> ?xCitedIn .
?pubmed rdfs:label ?l2 .
?pubmed <http://bio2rdf.org/pubmed_vocabulary#xMesh> ?xMesh .
?xMesh rdfs:label "Semantics" .
} order by desc(2)
What is the relation between François
Belleau and Michel Dumontier ?
Using RelFinder
http://www.visualdataweb.org/relfinder.php
http://cshals.mashup.bio2rdf.org/relfinder
Using Sentient Knowledge Explorer
http://www.io-informatics.com/
Gruff for AllegroGraph
http://www.franz.com/agraph/gruff/
Lesson #3
• To answer a specific question build a mashup
from SPARQL endpoints and query it.
• To build your semantic mashup, use a
workflow which can be created with an ETL
like Talend.
• Explore the mashup with semantic software
like Virtuoso faceted browser, RelFinder,
Gruff or Sentient.
Projects
• Add new data source to Bio2RDF collection
of SPARQL endpoints;
• Develop Talend ETL Semantic web extension
to ease rdfizing and SPARQL endpoint
consumption needed to build mashup;
• Create a mobile application to browse
Bio2RDF or other SPARQL data sources.
Looking forward foir the next decade
• More data provider will expose their data as SPARQL endpoints,
but Bio2RDF is still needed.
• Now that Data has been converted to RDF (a dirty job) we need
to ask useful question to the Linked Data cloud (a hard one).
SPARQL query will not be sufficient and reasoner will be
essential.
• Semantic software for browsing, visualisation, edition will be
created and SPARQL federated query engine will become
available. This will be the next game changer.
• Intuitive mobile applications will give access to Semantic web
data in a user friendly manner.
• Data Integration experience will be successful for scientist user, if
our enthusiast community get organize, so governance for Linked
Data in Life Science is a major issue.
LSSEC - Life Science
SPARQL Endpoint Club
https://groups.google.com/d/forum/life-science-sparql-endpoint-club
A private club for SPARQL endpoint
publisher to gather and discuss their
concerns about Linked Data, Ontology
and promotion of the Semantic Web in
the Life Science community.
To become a member you need to
publish RDF or host a SPARQL endpoint
of interest for the Life Science
community.
Acknowledgements
• Bio2RDF is a community project available at http://bio2rdf.org
• The community can be joined at
https://groups.google.com/forum/?fromgroups#!forum/bio2rdf
• This work was done under the supervision of Dr Arnaud Droit,
assistant professor and director of the Centre de Biologie
Computationnelle du CRCHUQ at Laval University, where Bio2RDF
is hosted.
• Michel Dumontier, from the Dumontier Lab at Carleton University, is
also hosting Bio2RDF server and his team created new release 2.
• Thanks to all the people member of the Bio2RDF community, and
especially Marc-Alexandre Nolin and Peter Ansell, initial developers.
• This work was supported by Ministère du Développement
Economique, Innovation Exportation (MDEIE).

More Related Content

What's hot

JBrowse within the Arabidopsis Information Portal - PAG XXIII
JBrowse within the Arabidopsis Information Portal - PAG XXIIIJBrowse within the Arabidopsis Information Portal - PAG XXIII
JBrowse within the Arabidopsis Information Portal - PAG XXIIIVivek Krishnakumar
 
Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 1 (...
Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 1 (...Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 1 (...
Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 1 (...Olaf Hartig
 
Vaughn aip walkthru_pag2015
Vaughn aip walkthru_pag2015Vaughn aip walkthru_pag2015
Vaughn aip walkthru_pag2015Araport
 
Developing Apps: Exposing Your Data Through Araport
Developing Apps: Exposing Your Data Through AraportDeveloping Apps: Exposing Your Data Through Araport
Developing Apps: Exposing Your Data Through AraportMatthew Vaughn
 
HRGRN: enabling graph search and integrative analysis of Arabidopsis signalin...
HRGRN: enabling graph search and integrative analysis of Arabidopsis signalin...HRGRN: enabling graph search and integrative analysis of Arabidopsis signalin...
HRGRN: enabling graph search and integrative analysis of Arabidopsis signalin...Araport
 
Approaching Join Index: Presented by Mikhail Khludnev, Grid Dynamics
Approaching Join Index: Presented by Mikhail Khludnev, Grid DynamicsApproaching Join Index: Presented by Mikhail Khludnev, Grid Dynamics
Approaching Join Index: Presented by Mikhail Khludnev, Grid DynamicsLucidworks
 
PMR metabolomics and transcriptomics database and its RESTful web APIs: A dat...
PMR metabolomics and transcriptomics database and its RESTful web APIs: A dat...PMR metabolomics and transcriptomics database and its RESTful web APIs: A dat...
PMR metabolomics and transcriptomics database and its RESTful web APIs: A dat...Araport
 
The Galaxy bioinformatics workflow environment
The Galaxy bioinformatics workflow environmentThe Galaxy bioinformatics workflow environment
The Galaxy bioinformatics workflow environmentRutger Vos
 
A guided tour of Araport
A guided tour of AraportA guided tour of Araport
A guided tour of AraportAraport
 
Plant ontology web services on Araport
Plant ontology web services on AraportPlant ontology web services on Araport
Plant ontology web services on AraportAraport
 
GNU R in Clinical Research and Evidence-Based Medicine
GNU R in Clinical Research and Evidence-Based MedicineGNU R in Clinical Research and Evidence-Based Medicine
GNU R in Clinical Research and Evidence-Based MedicineAdrian Olszewski
 
Introduction to Galaxy (UEB-UAT Bioinformatics Course - Session 2.2 - VHIR, B...
Introduction to Galaxy (UEB-UAT Bioinformatics Course - Session 2.2 - VHIR, B...Introduction to Galaxy (UEB-UAT Bioinformatics Course - Session 2.2 - VHIR, B...
Introduction to Galaxy (UEB-UAT Bioinformatics Course - Session 2.2 - VHIR, B...VHIR Vall d’Hebron Institut de Recerca
 
Scientific data curation and processing with Apache Tika
Scientific data curation and processing with Apache TikaScientific data curation and processing with Apache Tika
Scientific data curation and processing with Apache TikaChris Mattmann
 
ICAR 2015 Plenary - Chris Town
ICAR 2015 Plenary - Chris TownICAR 2015 Plenary - Chris Town
ICAR 2015 Plenary - Chris TownAraport
 
i5k Workspace Workshop - AGS2017
i5k Workspace Workshop - AGS2017i5k Workspace Workshop - AGS2017
i5k Workspace Workshop - AGS2017Monica Poelchau
 
Querying Linked Data with SPARQL
Querying Linked Data with SPARQLQuerying Linked Data with SPARQL
Querying Linked Data with SPARQLOlaf Hartig
 
Solr search engine with multiple table relation
Solr search engine with multiple table relationSolr search engine with multiple table relation
Solr search engine with multiple table relationJay Bharat
 
Approaching Join Index - Lucene/Solr Revolution 2014
Approaching Join Index - Lucene/Solr Revolution 2014Approaching Join Index - Lucene/Solr Revolution 2014
Approaching Join Index - Lucene/Solr Revolution 2014Grid Dynamics
 
Apache Spark - Intro to Large-scale recommendations with Apache Spark and Python
Apache Spark - Intro to Large-scale recommendations with Apache Spark and PythonApache Spark - Intro to Large-scale recommendations with Apache Spark and Python
Apache Spark - Intro to Large-scale recommendations with Apache Spark and PythonChristian Perone
 

What's hot (20)

JBrowse within the Arabidopsis Information Portal - PAG XXIII
JBrowse within the Arabidopsis Information Portal - PAG XXIIIJBrowse within the Arabidopsis Information Portal - PAG XXIII
JBrowse within the Arabidopsis Information Portal - PAG XXIII
 
Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 1 (...
Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 1 (...Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 1 (...
Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 1 (...
 
Vaughn aip walkthru_pag2015
Vaughn aip walkthru_pag2015Vaughn aip walkthru_pag2015
Vaughn aip walkthru_pag2015
 
Developing Apps: Exposing Your Data Through Araport
Developing Apps: Exposing Your Data Through AraportDeveloping Apps: Exposing Your Data Through Araport
Developing Apps: Exposing Your Data Through Araport
 
HRGRN: enabling graph search and integrative analysis of Arabidopsis signalin...
HRGRN: enabling graph search and integrative analysis of Arabidopsis signalin...HRGRN: enabling graph search and integrative analysis of Arabidopsis signalin...
HRGRN: enabling graph search and integrative analysis of Arabidopsis signalin...
 
Approaching Join Index: Presented by Mikhail Khludnev, Grid Dynamics
Approaching Join Index: Presented by Mikhail Khludnev, Grid DynamicsApproaching Join Index: Presented by Mikhail Khludnev, Grid Dynamics
Approaching Join Index: Presented by Mikhail Khludnev, Grid Dynamics
 
Bio2RDF@BH2010
Bio2RDF@BH2010Bio2RDF@BH2010
Bio2RDF@BH2010
 
PMR metabolomics and transcriptomics database and its RESTful web APIs: A dat...
PMR metabolomics and transcriptomics database and its RESTful web APIs: A dat...PMR metabolomics and transcriptomics database and its RESTful web APIs: A dat...
PMR metabolomics and transcriptomics database and its RESTful web APIs: A dat...
 
The Galaxy bioinformatics workflow environment
The Galaxy bioinformatics workflow environmentThe Galaxy bioinformatics workflow environment
The Galaxy bioinformatics workflow environment
 
A guided tour of Araport
A guided tour of AraportA guided tour of Araport
A guided tour of Araport
 
Plant ontology web services on Araport
Plant ontology web services on AraportPlant ontology web services on Araport
Plant ontology web services on Araport
 
GNU R in Clinical Research and Evidence-Based Medicine
GNU R in Clinical Research and Evidence-Based MedicineGNU R in Clinical Research and Evidence-Based Medicine
GNU R in Clinical Research and Evidence-Based Medicine
 
Introduction to Galaxy (UEB-UAT Bioinformatics Course - Session 2.2 - VHIR, B...
Introduction to Galaxy (UEB-UAT Bioinformatics Course - Session 2.2 - VHIR, B...Introduction to Galaxy (UEB-UAT Bioinformatics Course - Session 2.2 - VHIR, B...
Introduction to Galaxy (UEB-UAT Bioinformatics Course - Session 2.2 - VHIR, B...
 
Scientific data curation and processing with Apache Tika
Scientific data curation and processing with Apache TikaScientific data curation and processing with Apache Tika
Scientific data curation and processing with Apache Tika
 
ICAR 2015 Plenary - Chris Town
ICAR 2015 Plenary - Chris TownICAR 2015 Plenary - Chris Town
ICAR 2015 Plenary - Chris Town
 
i5k Workspace Workshop - AGS2017
i5k Workspace Workshop - AGS2017i5k Workspace Workshop - AGS2017
i5k Workspace Workshop - AGS2017
 
Querying Linked Data with SPARQL
Querying Linked Data with SPARQLQuerying Linked Data with SPARQL
Querying Linked Data with SPARQL
 
Solr search engine with multiple table relation
Solr search engine with multiple table relationSolr search engine with multiple table relation
Solr search engine with multiple table relation
 
Approaching Join Index - Lucene/Solr Revolution 2014
Approaching Join Index - Lucene/Solr Revolution 2014Approaching Join Index - Lucene/Solr Revolution 2014
Approaching Join Index - Lucene/Solr Revolution 2014
 
Apache Spark - Intro to Large-scale recommendations with Apache Spark and Python
Apache Spark - Intro to Large-scale recommendations with Apache Spark and PythonApache Spark - Intro to Large-scale recommendations with Apache Spark and Python
Apache Spark - Intro to Large-scale recommendations with Apache Spark and Python
 

Viewers also liked

NGS analysis of micro-RNA
NGS analysis of micro-RNANGS analysis of micro-RNA
NGS analysis of micro-RNAcursoNGS
 
Inteligencia Artificial en Bioinformática. Algunas Aplicaciones.
Inteligencia Artificial en Bioinformática. Algunas Aplicaciones.Inteligencia Artificial en Bioinformática. Algunas Aplicaciones.
Inteligencia Artificial en Bioinformática. Algunas Aplicaciones.Alberto Labarga
 
Métodos y Resultados Actuales en Bioinformática: know-how y know-what de las ...
Métodos y Resultados Actuales en Bioinformática: know-how y know-what de las ...Métodos y Resultados Actuales en Bioinformática: know-how y know-what de las ...
Métodos y Resultados Actuales en Bioinformática: know-how y know-what de las ...Mikel Egaña Aranguren, Ph.D.
 
Linux for bioinformatics
Linux for bioinformaticsLinux for bioinformatics
Linux for bioinformaticscursoNGS
 
Towards an understanding of diversity in biological and biomedical systems
Towards an understanding of diversity in biological and biomedical systemsTowards an understanding of diversity in biological and biomedical systems
Towards an understanding of diversity in biological and biomedical systemscursoNGS
 
Linked Data in Healthcare and Life Sciences
Linked Data in Healthcare and Life SciencesLinked Data in Healthcare and Life Sciences
Linked Data in Healthcare and Life SciencesJames G. Boram Kim
 
SPARQL - Basic and Federated Queries
SPARQL - Basic and Federated QueriesSPARQL - Basic and Federated Queries
SPARQL - Basic and Federated QueriesKnud Möller
 
SEO y Web Semántica en Congreso Web
SEO y Web Semántica en Congreso WebSEO y Web Semántica en Congreso Web
SEO y Web Semántica en Congreso WebLakil Essady
 
Link Analysis of Life Sciences Linked Data
Link Analysis of Life Sciences Linked DataLink Analysis of Life Sciences Linked Data
Link Analysis of Life Sciences Linked DataMichel Dumontier
 

Viewers also liked (14)

Introduction of Linked Data for Science
Introduction of Linked Data for ScienceIntroduction of Linked Data for Science
Introduction of Linked Data for Science
 
NGS analysis of micro-RNA
NGS analysis of micro-RNANGS analysis of micro-RNA
NGS analysis of micro-RNA
 
Building DBpedia Japanese and Linked Data Cloud in Japanese
Building DBpedia Japanese and Linked Data Cloud in JapaneseBuilding DBpedia Japanese and Linked Data Cloud in Japanese
Building DBpedia Japanese and Linked Data Cloud in Japanese
 
Inteligencia Artificial en Bioinformática. Algunas Aplicaciones.
Inteligencia Artificial en Bioinformática. Algunas Aplicaciones.Inteligencia Artificial en Bioinformática. Algunas Aplicaciones.
Inteligencia Artificial en Bioinformática. Algunas Aplicaciones.
 
Applying Ontology Design Patterns in bio-ontologies
Applying Ontology Design Patterns in bio-ontologiesApplying Ontology Design Patterns in bio-ontologies
Applying Ontology Design Patterns in bio-ontologies
 
Aplicación de la Web Semántica en Bioinformática
Aplicación de la Web Semántica en BioinformáticaAplicación de la Web Semántica en Bioinformática
Aplicación de la Web Semántica en Bioinformática
 
Métodos y Resultados Actuales en Bioinformática: know-how y know-what de las ...
Métodos y Resultados Actuales en Bioinformática: know-how y know-what de las ...Métodos y Resultados Actuales en Bioinformática: know-how y know-what de las ...
Métodos y Resultados Actuales en Bioinformática: know-how y know-what de las ...
 
Life Sciences Linked Data
Life Sciences Linked DataLife Sciences Linked Data
Life Sciences Linked Data
 
Linux for bioinformatics
Linux for bioinformaticsLinux for bioinformatics
Linux for bioinformatics
 
Towards an understanding of diversity in biological and biomedical systems
Towards an understanding of diversity in biological and biomedical systemsTowards an understanding of diversity in biological and biomedical systems
Towards an understanding of diversity in biological and biomedical systems
 
Linked Data in Healthcare and Life Sciences
Linked Data in Healthcare and Life SciencesLinked Data in Healthcare and Life Sciences
Linked Data in Healthcare and Life Sciences
 
SPARQL - Basic and Federated Queries
SPARQL - Basic and Federated QueriesSPARQL - Basic and Federated Queries
SPARQL - Basic and Federated Queries
 
SEO y Web Semántica en Congreso Web
SEO y Web Semántica en Congreso WebSEO y Web Semántica en Congreso Web
SEO y Web Semántica en Congreso Web
 
Link Analysis of Life Sciences Linked Data
Link Analysis of Life Sciences Linked DataLink Analysis of Life Sciences Linked Data
Link Analysis of Life Sciences Linked Data
 

Similar to Producing, publishing and consuming linked data - CSHALS 2013

Data FAIRport Prototype & Demo - Presentation to Elsevier, Jul 10, 2015
Data FAIRport Prototype & Demo - Presentation to Elsevier, Jul 10, 2015Data FAIRport Prototype & Demo - Presentation to Elsevier, Jul 10, 2015
Data FAIRport Prototype & Demo - Presentation to Elsevier, Jul 10, 2015Mark Wilkinson
 
Consuming Linked Data 4/5 Semtech2011
Consuming Linked Data 4/5 Semtech2011Consuming Linked Data 4/5 Semtech2011
Consuming Linked Data 4/5 Semtech2011Juan Sequeda
 
Data Integration And Visualization
Data Integration And VisualizationData Integration And Visualization
Data Integration And VisualizationIvan Ermilov
 
Data FAIRport Skunkworks: Common Repository Access Via Meta-Metadata Descript...
Data FAIRport Skunkworks: Common Repository Access Via Meta-Metadata Descript...Data FAIRport Skunkworks: Common Repository Access Via Meta-Metadata Descript...
Data FAIRport Skunkworks: Common Repository Access Via Meta-Metadata Descript...datascienceiqss
 
State of the Semantic Web
State of the Semantic WebState of the Semantic Web
State of the Semantic WebIvan Herman
 
FAIR Data Prototype - Interoperability and FAIRness through a novel combinati...
FAIR Data Prototype - Interoperability and FAIRness through a novel combinati...FAIR Data Prototype - Interoperability and FAIRness through a novel combinati...
FAIR Data Prototype - Interoperability and FAIRness through a novel combinati...Mark Wilkinson
 
Usage of Linked Data: Introduction and Application Scenarios
Usage of Linked Data: Introduction and Application ScenariosUsage of Linked Data: Introduction and Application Scenarios
Usage of Linked Data: Introduction and Application ScenariosEUCLID project
 
Publishing "5 star" data: the case for RDF
Publishing "5 star" data: the case for RDFPublishing "5 star" data: the case for RDF
Publishing "5 star" data: the case for RDFPeterWinstanley1
 
2009 0807 Lod Gmod
2009 0807 Lod Gmod2009 0807 Lod Gmod
2009 0807 Lod GmodJun Zhao
 
Overview of the SPARQL-Generate language and latest developments
Overview of the SPARQL-Generate language and latest developmentsOverview of the SPARQL-Generate language and latest developments
Overview of the SPARQL-Generate language and latest developmentsMaxime Lefrançois
 
Tech. session : Interoperability and Data FAIRness emerges from a novel combi...
Tech. session : Interoperability and Data FAIRness emerges from a novel combi...Tech. session : Interoperability and Data FAIRness emerges from a novel combi...
Tech. session : Interoperability and Data FAIRness emerges from a novel combi...Mark Wilkinson
 
LDP4j: A framework for the development of interoperable read-write Linked Da...
LDP4j: A framework for the development of interoperable read-write Linked Da...LDP4j: A framework for the development of interoperable read-write Linked Da...
LDP4j: A framework for the development of interoperable read-write Linked Da...Nandana Mihindukulasooriya
 
Intro to-technologies-Green-City-Hackathon-Athens
Intro to-technologies-Green-City-Hackathon-AthensIntro to-technologies-Green-City-Hackathon-Athens
Intro to-technologies-Green-City-Hackathon-AthensStoitsis Giannis
 
RDF-Gen: Generating RDF from streaming and archival data
RDF-Gen: Generating RDF from streaming and archival dataRDF-Gen: Generating RDF from streaming and archival data
RDF-Gen: Generating RDF from streaming and archival dataGiorgos Santipantakis
 
GDG Meets U event - Big data & Wikidata - no lies codelab
GDG Meets U event - Big data & Wikidata -  no lies codelabGDG Meets U event - Big data & Wikidata -  no lies codelab
GDG Meets U event - Big data & Wikidata - no lies codelabCAMELIA BOBAN
 

Similar to Producing, publishing and consuming linked data - CSHALS 2013 (20)

Bio2RDF @ W3C HCLS2009
Bio2RDF @ W3C HCLS2009Bio2RDF @ W3C HCLS2009
Bio2RDF @ W3C HCLS2009
 
Data FAIRport Prototype & Demo - Presentation to Elsevier, Jul 10, 2015
Data FAIRport Prototype & Demo - Presentation to Elsevier, Jul 10, 2015Data FAIRport Prototype & Demo - Presentation to Elsevier, Jul 10, 2015
Data FAIRport Prototype & Demo - Presentation to Elsevier, Jul 10, 2015
 
Consuming Linked Data 4/5 Semtech2011
Consuming Linked Data 4/5 Semtech2011Consuming Linked Data 4/5 Semtech2011
Consuming Linked Data 4/5 Semtech2011
 
Data Integration And Visualization
Data Integration And VisualizationData Integration And Visualization
Data Integration And Visualization
 
Data FAIRport Skunkworks: Common Repository Access Via Meta-Metadata Descript...
Data FAIRport Skunkworks: Common Repository Access Via Meta-Metadata Descript...Data FAIRport Skunkworks: Common Repository Access Via Meta-Metadata Descript...
Data FAIRport Skunkworks: Common Repository Access Via Meta-Metadata Descript...
 
Publishing Linked Data from RDB
Publishing Linked Data from RDBPublishing Linked Data from RDB
Publishing Linked Data from RDB
 
State of the Semantic Web
State of the Semantic WebState of the Semantic Web
State of the Semantic Web
 
Bio2RDF should we do it
Bio2RDF should we do itBio2RDF should we do it
Bio2RDF should we do it
 
FAIR Data Prototype - Interoperability and FAIRness through a novel combinati...
FAIR Data Prototype - Interoperability and FAIRness through a novel combinati...FAIR Data Prototype - Interoperability and FAIRness through a novel combinati...
FAIR Data Prototype - Interoperability and FAIRness through a novel combinati...
 
Usage of Linked Data: Introduction and Application Scenarios
Usage of Linked Data: Introduction and Application ScenariosUsage of Linked Data: Introduction and Application Scenarios
Usage of Linked Data: Introduction and Application Scenarios
 
Publishing "5 star" data: the case for RDF
Publishing "5 star" data: the case for RDFPublishing "5 star" data: the case for RDF
Publishing "5 star" data: the case for RDF
 
2009 0807 Lod Gmod
2009 0807 Lod Gmod2009 0807 Lod Gmod
2009 0807 Lod Gmod
 
Overview of the SPARQL-Generate language and latest developments
Overview of the SPARQL-Generate language and latest developmentsOverview of the SPARQL-Generate language and latest developments
Overview of the SPARQL-Generate language and latest developments
 
NISO/DCMI September 25 Webinar: Implementing Linked Data in Developing Countr...
NISO/DCMI September 25 Webinar: Implementing Linked Data in Developing Countr...NISO/DCMI September 25 Webinar: Implementing Linked Data in Developing Countr...
NISO/DCMI September 25 Webinar: Implementing Linked Data in Developing Countr...
 
Semantic Web talk TEMPLATE
Semantic Web talk TEMPLATESemantic Web talk TEMPLATE
Semantic Web talk TEMPLATE
 
Tech. session : Interoperability and Data FAIRness emerges from a novel combi...
Tech. session : Interoperability and Data FAIRness emerges from a novel combi...Tech. session : Interoperability and Data FAIRness emerges from a novel combi...
Tech. session : Interoperability and Data FAIRness emerges from a novel combi...
 
LDP4j: A framework for the development of interoperable read-write Linked Da...
LDP4j: A framework for the development of interoperable read-write Linked Da...LDP4j: A framework for the development of interoperable read-write Linked Da...
LDP4j: A framework for the development of interoperable read-write Linked Da...
 
Intro to-technologies-Green-City-Hackathon-Athens
Intro to-technologies-Green-City-Hackathon-AthensIntro to-technologies-Green-City-Hackathon-Athens
Intro to-technologies-Green-City-Hackathon-Athens
 
RDF-Gen: Generating RDF from streaming and archival data
RDF-Gen: Generating RDF from streaming and archival dataRDF-Gen: Generating RDF from streaming and archival data
RDF-Gen: Generating RDF from streaming and archival data
 
GDG Meets U event - Big data & Wikidata - no lies codelab
GDG Meets U event - Big data & Wikidata -  no lies codelabGDG Meets U event - Big data & Wikidata -  no lies codelab
GDG Meets U event - Big data & Wikidata - no lies codelab
 

More from François Belleau

Pitch Reactome2json_ld @ swat4hcls 2020
Pitch Reactome2json_ld @ swat4hcls 2020Pitch Reactome2json_ld @ swat4hcls 2020
Pitch Reactome2json_ld @ swat4hcls 2020François Belleau
 
Pitch Qliic coopérathon 2017
Pitch Qliic coopérathon 2017Pitch Qliic coopérathon 2017
Pitch Qliic coopérathon 2017François Belleau
 
2015-11-17 Présentation SEAO et ES
2015-11-17 Présentation SEAO et ES2015-11-17 Présentation SEAO et ES
2015-11-17 Présentation SEAO et ESFrançois Belleau
 
BD2K hackathon - Bio2RDF submission
BD2K hackathon - Bio2RDF submissionBD2K hackathon - Bio2RDF submission
BD2K hackathon - Bio2RDF submissionFrançois Belleau
 
Découvrir le web sémantique en 15 minutes (Decideo 2014)
Découvrir le web sémantique en 15 minutes (Decideo 2014)Découvrir le web sémantique en 15 minutes (Decideo 2014)
Découvrir le web sémantique en 15 minutes (Decideo 2014)François Belleau
 
Bio2RDF poster for Biocurator 2014 conference
Bio2RDF poster for Biocurator 2014 conferenceBio2RDF poster for Biocurator 2014 conference
Bio2RDF poster for Biocurator 2014 conferenceFrançois Belleau
 
Acfas 2013 - Comment publier sur le web sémantique : la méthode de Bio2RDF
Acfas 2013 - Comment publier sur le web sémantique : la méthode de Bio2RDFAcfas 2013 - Comment publier sur le web sémantique : la méthode de Bio2RDF
Acfas 2013 - Comment publier sur le web sémantique : la méthode de Bio2RDFFrançois Belleau
 
Producing, Publishing and Consuming Linked Data Three lessons from the Bio2RD...
Producing, Publishing and Consuming Linked Data Three lessons from the Bio2RD...Producing, Publishing and Consuming Linked Data Three lessons from the Bio2RD...
Producing, Publishing and Consuming Linked Data Three lessons from the Bio2RD...François Belleau
 
Bio2RDF : A Semantic Web Atlas of post genomic knowledge about Human and Mouse
Bio2RDF : A Semantic Web Atlas of post genomic knowledge about Human and MouseBio2RDF : A Semantic Web Atlas of post genomic knowledge about Human and Mouse
Bio2RDF : A Semantic Web Atlas of post genomic knowledge about Human and MouseFrançois Belleau
 
Bio2RDF: Towards A Mashup To Build Bioinformatics Knowledge System
Bio2RDF: Towards A Mashup To Build Bioinformatics Knowledge SystemBio2RDF: Towards A Mashup To Build Bioinformatics Knowledge System
Bio2RDF: Towards A Mashup To Build Bioinformatics Knowledge SystemFrançois Belleau
 

More from François Belleau (16)

Bio2RDF @ DILS 2008
Bio2RDF @ DILS 2008Bio2RDF @ DILS 2008
Bio2RDF @ DILS 2008
 
Pitch Reactome2json_ld @ swat4hcls 2020
Pitch Reactome2json_ld @ swat4hcls 2020Pitch Reactome2json_ld @ swat4hcls 2020
Pitch Reactome2json_ld @ swat4hcls 2020
 
Show de boucane pour ELK
Show de boucane pour ELKShow de boucane pour ELK
Show de boucane pour ELK
 
Pitch Qliic coopérathon 2017
Pitch Qliic coopérathon 2017Pitch Qliic coopérathon 2017
Pitch Qliic coopérathon 2017
 
2015-11-17 Présentation SEAO et ES
2015-11-17 Présentation SEAO et ES2015-11-17 Présentation SEAO et ES
2015-11-17 Présentation SEAO et ES
 
Linuq 20160130
Linuq 20160130Linuq 20160130
Linuq 20160130
 
textOdossier
textOdossiertextOdossier
textOdossier
 
BD2K hackathon - Bio2RDF submission
BD2K hackathon - Bio2RDF submissionBD2K hackathon - Bio2RDF submission
BD2K hackathon - Bio2RDF submission
 
Découvrir le web sémantique en 15 minutes (Decideo 2014)
Découvrir le web sémantique en 15 minutes (Decideo 2014)Découvrir le web sémantique en 15 minutes (Decideo 2014)
Découvrir le web sémantique en 15 minutes (Decideo 2014)
 
Bio2RDF poster for Biocurator 2014 conference
Bio2RDF poster for Biocurator 2014 conferenceBio2RDF poster for Biocurator 2014 conference
Bio2RDF poster for Biocurator 2014 conference
 
Acfas 2013 - Comment publier sur le web sémantique : la méthode de Bio2RDF
Acfas 2013 - Comment publier sur le web sémantique : la méthode de Bio2RDFAcfas 2013 - Comment publier sur le web sémantique : la méthode de Bio2RDF
Acfas 2013 - Comment publier sur le web sémantique : la méthode de Bio2RDF
 
Producing, Publishing and Consuming Linked Data Three lessons from the Bio2RD...
Producing, Publishing and Consuming Linked Data Three lessons from the Bio2RD...Producing, Publishing and Consuming Linked Data Three lessons from the Bio2RD...
Producing, Publishing and Consuming Linked Data Three lessons from the Bio2RD...
 
Bio2RDF-ISMB2008
Bio2RDF-ISMB2008Bio2RDF-ISMB2008
Bio2RDF-ISMB2008
 
Bio2RDF : A Semantic Web Atlas of post genomic knowledge about Human and Mouse
Bio2RDF : A Semantic Web Atlas of post genomic knowledge about Human and MouseBio2RDF : A Semantic Web Atlas of post genomic knowledge about Human and Mouse
Bio2RDF : A Semantic Web Atlas of post genomic knowledge about Human and Mouse
 
Bio2RDF: Towards A Mashup To Build Bioinformatics Knowledge System
Bio2RDF: Towards A Mashup To Build Bioinformatics Knowledge SystemBio2RDF: Towards A Mashup To Build Bioinformatics Knowledge System
Bio2RDF: Towards A Mashup To Build Bioinformatics Knowledge System
 
Bio2RDF/Virtuoso
Bio2RDF/VirtuosoBio2RDF/Virtuoso
Bio2RDF/Virtuoso
 

Recently uploaded

AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAndrey Devyatkin
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodJuan lago vázquez
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024The Digital Insurer
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...apidays
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyKhushali Kathiriya
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProduct Anonymous
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
Manulife - Insurer Innovation Award 2024
Manulife - Insurer Innovation Award 2024Manulife - Insurer Innovation Award 2024
Manulife - Insurer Innovation Award 2024The Digital Insurer
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsTop 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsRoshan Dwivedi
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer
 

Recently uploaded (20)

AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
Manulife - Insurer Innovation Award 2024
Manulife - Insurer Innovation Award 2024Manulife - Insurer Innovation Award 2024
Manulife - Insurer Innovation Award 2024
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsTop 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 

Producing, publishing and consuming linked data - CSHALS 2013

  • 1. Producing, Publishing and Consuming Linked Data Three Lessons from the Bio2RDF Project François Belleau Centre de recherche du CHUQ, Laval University Québec, Canada @bio2rdf
  • 2. • Looking backward to 2004 • Lessons : 1) How to produce RDF 2) How to publish Linked Data 3) How to consume SPARQL endpoints • Looking forward for the next decade
  • 3. The story of two images or Bio2RDF fairy tale 2004 vision 2011 reality
  • 5. Data Integration problem in bioinformatics
  • 6. Where Bio2RDF got its name
  • 7. Mashup ! FungalWeb from Christopher Baker YeastHub from Kei-Hoi Cheung
  • 8. ISMB 2005 Birds of a Feather
  • 9. W3C conference in 2007 46 millions documents in SESAME
  • 10. DILS conference in 2008 63 millions triples in Virtuoso
  • 11. ISMB conference in 2008 65 millions triples in Virtuoso
  • 12. March 2009 Linked Data cloud is published Bio2RDF 2,3 billions triples represents 54% of the global graph
  • 13. W3C-HCLS F2F Meeting in 2009 41 in Virtuoso endpoints
  • 14. CSHALS conference in 2013 1 billions triples in 19 Virtuoso endpoints with Bio2RDF release 2 and still adding…
  • 15. Bio2RDF is not alone anymore !
  • 16. How to produce RDF • Bio2RDF project transform existing public database into RDF; • Data format transformation to RDF triples is simple to do; • Transformation need to be done from many kind of format (CSV, XML, JSON, HTML, relational database) to RDF.
  • 17. Methods • 2006 Converting XML and HTML document from the web using JSP JSTL library • 2007-2010 Perl scripts, JSP web pages • 2012 – Release 2.0 rdfiser are written in PHP • 2013 – Use Talend ETL job
  • 18. ETL definition from Wikipedia In computing, Extract, Transform and Load (ETL) refers to a process in database usage and especially in data warehousing that involves:  Extracting data from outside sources  Transforming it to fit operational which can include quality levels)  Loading it into the end target (database, more specifically, operational data store, data mart or data warehouse) http://en.wikipedia.org/wiki/Extract,_transform,_load
  • 19. Why not use ETL software to rdfize existing data ?
  • 20. Talend Open Studio for Data Integration an open source free ETL software build with Eclipse http://www.talend.com/
  • 21. HGNC 2 Bio2RDF example EXTRACT from the web TRANSFORM to RDF LOAD into triplestore
  • 22. HGNC 2 Bio2RDF : EXTRACT
  • 23. HGNC 2 Bio2RDF : TRANSFORM
  • 24. HGNC 2 Bio2RDF : LOAD
  • 25. This rdfizer is available on myExperiment http://www.myexperiment.org/workflows/3420.html
  • 26. Lesson #1 • Use existing ETL tool, like Talend, to do fast and efficient transformation to RDF n-triples format. • Talend could be extended with new Semantic web components to ease RDF transformation and simplify SPARQL query submission.
  • 27. How to publish Linked Data • Design your URI pattern; • Publish SPARQL endpoint on the Internet; • Offer a search engine and a browser; • Register it to official registry like CKAN; • Advertise it in SPARQL endpoint list; • Describe your triples with an ontology or the way Bio2RDF does; • Publish SPARQL query example; • Index your data in semantic search service like Sindice;
  • 28. Design your URI pattern • Bio2RDF use Banff manifesto URIs • http://sourceforge.net/apps/mediawiki/bio2rdf/index.p hp?title=Banff_Manifesto • Example : http://bio2rdf.org/geneid:15275 • Apply the four linked data rules • http://www.w3.org/DesignIssues/LinkedData.html • Be polite with other URIs • http://hackathon3.dbcls.jp/wiki/URI • Example : http://purl.uniprot.org/uniprot/P05067
  • 29. Publish SPARQL endpoint on the Internet • Choose a triplestore technology • http://en.wikipedia.org/wiki/Triplestore
  • 30. Offer a search engine and a browser
  • 31. Register it to official registry like CKAN
  • 32. Advertise it in SPARQL endpoint list http://www.freebase.com/view/base/politeuri/sparql_endpoint http://beta.bio2rdf.org/
  • 34. Publish SPARQL query example http://sourceforge.net/apps/mediawiki/bio2rdf/index.php?title=Essential_SPARQL_queries
  • 35. Index your data in semantic search service
  • 36. Lesson #2 • To be present in the Linked Data cloud, just publish your data through a SPARQL endpoint. • Register it to public resources, describe its content and suggest SPARQL queries. • We use OpenLink Virtuoso free edition since 2007. Without this first class triplestore software there would not be a Bio2RDF service.
  • 37. How to consume SPARQL endpoints Two principles : 1. To answer a specific question first build a mashup using public or private SPARQL endpoints. 2. Then, ask your questions to the mashup.
  • 38. How to build a semantic mashup • 2005 - Import RDF file in Protégé. • 2006 - Use ELMO RDF crawler to import RDF data into SESAME triplestore. • 2007 - We implement a import function in SESAME based on derefencable URIs. • 2008 - Use Virtuoso sponge option and Perl scripts. • 2009 - Use Taverna workflow engine to fetch triples from SPARQL endpoint. • 2012 Use a Talend workflow consuming SPARQL endpoint.
  • 39. Who is influential at CSHALS ? http://cshals.mashup.bio2rdf.org/relfinder/ http://cshals.mashup.bio2rdf.org/sparql
  • 40. Talend workflow to create the needed semantic mashup • Do a full text search for each author (~80) who talked at CSHALS since 2007 and get its publication; • For each publication get its XML description (~1000) and rdfize it; • For each publication get its citation list; • For each publication citing a previous one get its description (~10 000).
  • 41. Global workflow in 3 steps Full text search Describe publication Describe citing publication
  • 42. Full text search using ncbi/esearch
  • 43. Describe publication, pubmed rdfizer for ncbi/efetch and ncbi/elink service
  • 44. Describe citing publication using ncbi/elinks
  • 45. Then query the mashup • What is CSHALS conference about ? • Who are the most influential researchers in the community ? • Which articles in semantics as been mostly cited ?
  • 46. What is CSHALS conference about ? select ?label2 as ?mesh count(*) as ?count where { ?s <http://bio2rdf.org/pubmed_vocabulary#xFoundIn> ?pubmed . ?pubmed <http://bio2rdf.org/pubmed_vocabulary#xMesh> ?xMesh . ?xMesh rdfs:label "Semantics" . ?pubmed <http://bio2rdf.org/pubmed_vocabulary#xMesh> ?xMesh2 . ?xMesh2 rdfs:label ?label2 . } order by desc(2)
  • 47. Who are the most influential researchers in the community ? select ?l3 as ?author count(distinct ?pubmed ) as ?citation where { ?s a <http://bio2rdf.org/pubmed_vocabulary#searchResults> . ?s rdfs:label ?l . ?s <http://bio2rdf.org/pubmed_vocabulary#xFoundIn> ?pubmed . ?pubmed <http://bio2rdf.org/pubmed_vocabulary:xCitedIn> ?xCitedIn . ?pubmed rdfs:label ?l2 . ?pubmed <http://bio2rdf.org/pubmed_vocabulary#xMesh> ?xMesh . ?xMesh rdfs:label "Semantics" . ?pubmed <http://bio2rdf.org/pubmed_vocabulary#xPerson> ?xPerson . ?xPerson rdfs:label ?l3 . } order by desc(2)
  • 48. Which articles in semantics has been most cited ? select ?l2 as ?title count(?xCitedIn) as ?count where { ?s a <http://bio2rdf.org/pubmed_vocabulary#searchResults> . ?s rdfs:label ?l . ?s <http://bio2rdf.org/pubmed_vocabulary#xFoundIn> ?pubmed . ?pubmed <http://bio2rdf.org/pubmed_vocabulary:xCitedIn> ?xCitedIn . ?pubmed rdfs:label ?l2 . ?pubmed <http://bio2rdf.org/pubmed_vocabulary#xMesh> ?xMesh . ?xMesh rdfs:label "Semantics" . } order by desc(2)
  • 49. What is the relation between François Belleau and Michel Dumontier ?
  • 51. Using Sentient Knowledge Explorer http://www.io-informatics.com/
  • 53. Lesson #3 • To answer a specific question build a mashup from SPARQL endpoints and query it. • To build your semantic mashup, use a workflow which can be created with an ETL like Talend. • Explore the mashup with semantic software like Virtuoso faceted browser, RelFinder, Gruff or Sentient.
  • 54. Projects • Add new data source to Bio2RDF collection of SPARQL endpoints; • Develop Talend ETL Semantic web extension to ease rdfizing and SPARQL endpoint consumption needed to build mashup; • Create a mobile application to browse Bio2RDF or other SPARQL data sources.
  • 55. Looking forward foir the next decade • More data provider will expose their data as SPARQL endpoints, but Bio2RDF is still needed. • Now that Data has been converted to RDF (a dirty job) we need to ask useful question to the Linked Data cloud (a hard one). SPARQL query will not be sufficient and reasoner will be essential. • Semantic software for browsing, visualisation, edition will be created and SPARQL federated query engine will become available. This will be the next game changer. • Intuitive mobile applications will give access to Semantic web data in a user friendly manner. • Data Integration experience will be successful for scientist user, if our enthusiast community get organize, so governance for Linked Data in Life Science is a major issue.
  • 56. LSSEC - Life Science SPARQL Endpoint Club https://groups.google.com/d/forum/life-science-sparql-endpoint-club A private club for SPARQL endpoint publisher to gather and discuss their concerns about Linked Data, Ontology and promotion of the Semantic Web in the Life Science community. To become a member you need to publish RDF or host a SPARQL endpoint of interest for the Life Science community.
  • 57. Acknowledgements • Bio2RDF is a community project available at http://bio2rdf.org • The community can be joined at https://groups.google.com/forum/?fromgroups#!forum/bio2rdf • This work was done under the supervision of Dr Arnaud Droit, assistant professor and director of the Centre de Biologie Computationnelle du CRCHUQ at Laval University, where Bio2RDF is hosted. • Michel Dumontier, from the Dumontier Lab at Carleton University, is also hosting Bio2RDF server and his team created new release 2. • Thanks to all the people member of the Bio2RDF community, and especially Marc-Alexandre Nolin and Peter Ansell, initial developers. • This work was supported by Ministère du Développement Economique, Innovation Exportation (MDEIE).