SlideShare a Scribd company logo
1 of 116
Download to read offline
x
Towards a Linked Open Data Infrastructure
for Science, Technology & Innovation
Studies
Ali Khalili, PhD
Department of Computer Science/Artificial Intelligence
Knowledge Representation & Reasoning Research Group
Outline
• Linked (Open) Data
• RISIS Project
• Semantically Mapping Science (SMS) Platform
• Workflow
• Use Cases
• Adaptive Functional Urban Areas (FUAs) to Study Innovative Activities
• Gendered Dimensions in Grant Selection
Evolution of the Web
https://mcgratha.wordpress.com/
Evolution of the Web
https://mcgratha.wordpress.com/
Linked (Open) Data
• A set of best practices for publishing data on the Web.
• Follows 4 simple principles:
https://www.ted.com/talks/tim_berners_lee_on_the_next_web
• Use HTTP URIs so that users can look up (dereference) those names.
• When someone looks up a URI, provide useful information, using the
open standards.
• Include links to other URIs, so that users can discover more things.
• Use URIs as names (identifiers) for conceptual things.
Linked (Open) Data: Principles
WWW World
Linked (Open) Data: Principles
WWW World
Linked (Open) Data: Principles
WWW World
Linked (Open) Data: Principles
WWW World
Linked (Open) Data: Principles
WWW World
5 Open Data
make your stuff available on the Web (whatever format)
under an open license
make it available as structured data
(e.g., Excel instead of image scan of a table)
make it available in a non-proprietary open format
(e.g., CSV instead of Excel)
use Linked Data format
(URIs to identify things, RDF to represent data)
link your data to other people’s data to provide context
http://5stardata.info/
5 Open Data
Linked Open Data Cloud
8Linked Open DataAli Khalili
http://lod-cloud.net/
Linked Open Data Cloud
8Linked Open DataAli Khalili
http://lod-cloud.net/
Linked Open Data Cloud
8Linked Open DataAli Khalili
http://lod-cloud.net/
Linked Open Data Cloud
8Linked Open DataAli Khalili
http://lod-cloud.net/
Linked Open Data Cloud
8Linked Open DataAli Khalili
http://lod-cloud.net/
Linked Open Data Cloud
8Linked Open DataAli Khalili
http://lod-cloud.net/
Linked Open Data Cloud
8Linked Open DataAli Khalili
http://lod-cloud.net/
Linked Open Data Cloud
8Linked Open DataAli Khalili
http://lod-cloud.net/
Linked Open Data Cloud
http://lod-cloud.net/
Linked Open Data: Statistics
http://lodlaundromat.org/
http://stats.lod2.eu/
more than 3426 datasets
Linked Open Data: Examples
https://en.wikipedia.org/wiki/Paris
Linked Open Data: Examples
Resource Property Value
https://en.wikipedia.org/wiki/Paris
Linked Open Data: Examples
http://dbpedia.org/resource/Paris
Linked Open Data: Examples
• Give me a list of capital cities in Europe with population more than 500,000
• Who are mayors of central European towns elevated more than 1000m?
• Which movies are starring both Brad Pitt and Angelina Jolie?
• All soccer players, who played as goalkeeper for a club that has a stadium with
more than 40.000 seats and who are born in a country with more than 10 million
inhabitants
• …
Linked Open Data: Examples
• Give me a list of capital cities in Europe with population more than 500,000
• Who are mayors of central European towns elevated more than 1000m?
• Which movies are starring both Brad Pitt and Angelina Jolie?
• All soccer players, who played as goalkeeper for a club that has a stadium with
more than 40.000 seats and who are born in a country with more than 10 million
inhabitants
• …
Linked Open Data: Examples
Linked Open Data: Examples
Linked Open Data: Examples
https://www.google.com/cse/
Linked Open Data: Examples
http://www.wolframalpha.com/
http://risis.eu
RISIS EU Project (http://risis.eu)
http://datasets.risis.eu/
RISIS Datasets: Entity Types
Organization Product Agreement
Person Policy
Policy
Evaluation Location
CIB ETER EUPRO JOREP Leiden-Ranking
MORE I Nano Profile SIPER VICO
Higher
Education
Firm
Funding
Body
Publication
Patent
Project
Investment
Funding
Program
RISIS Datasets: Entity Types
Organization Product Agreement
Person Policy
Policy
Evaluation Location
CIB ETER EUPRO JOREP Leiden-Ranking
MORE I Nano Profile SIPER VICO
Higher
Education
Firm
Funding
Body
Publication
Patent
Project
Investment
Funding
Program
Semantically Mapping Science (SMS) Platform
http://sms.risis.eu
RISIS WP9 Vision
	Proposing	S&T	map	of	Europe
Functional Urban Areas (FUAs)
Functional Urban Areas (FUAs)
defined by OECD in collaboration with EC/Eurostat
consider factors beyond the predefined city boundaries to better
reflect the economic geography of where people live and work
Functional Urban Areas (FUAs)
OECD Metropolitan eXplorer: http://measuringurban.oecd.org
defined by OECD in collaboration with EC/Eurostat
consider factors beyond the predefined city boundaries to better
reflect the economic geography of where people live and work
population
area
GDP
environment (CO2 emissions and air pollution)
labour market (employment and unemployment growth)
innovation (patent intensity)
urban form and territorial organization
Functional Urban Areas (FUAs)
OECD Metropolitan eXplorer: http://measuringurban.oecd.org
Functional Urban Areas (FUAs)
FUAs Example: Netherlands
FUAs: Building Blocks
Municipalities
Problem
Address FUA
?
Problem
Address FUA
?
• Vrije Universiteit Amsterdam
• De Boelelaan 1105, 1081 HV Amsterdam
Amsterdam (NL002)
Problem
Address FUA
?
• Vrije Universiteit Amsterdam
• De Boelelaan 1105, 1081 HV Amsterdam
Amsterdam (NL002)
OECD FUAs List
Problem
Address FUA
?
• Vrije Universiteit Amsterdam
• De Boelelaan 1105, 1081 HV Amsterdam
Amsterdam (NL002)
- Geocode to LAU (municipality)
OECD FUAs List
Problem
Address FUA
?
• Vrije Universiteit Amsterdam
• De Boelelaan 1105, 1081 HV Amsterdam
Amsterdam (NL002)
- Geocode to LAU (municipality)
OECD FUAs List
Problem
Address FUA
?
• Vrije Universiteit Amsterdam
• De Boelelaan 1105, 1081 HV Amsterdam
Amsterdam (NL002)
- Geocode to LAU (municipality)
- Shapefiles for FUAs or LAUs?
OECD FUAs List
Problem
Address FUA
?
• Vrije Universiteit Amsterdam
• De Boelelaan 1105, 1081 HV Amsterdam
Amsterdam (NL002)
- Geocode to LAU (municipality)
- Shapefiles for FUAs or LAUs?
OECD FUAs List
Linked Open Data 28Ali Khalili
Linked Open Data
Interlinking
Enrichment
Quality
Analysis
Evolution
Exploration
Extraction
Storage/
Querying
Authoring
Linked (Open) Data
Lifecycle
http://stack.linkeddata.org/
Linked Open Data 28Ali Khalili
Linked Open Data
Interlinking
Enrichment
Quality
Analysis
Evolution
Exploration
Extraction
Storage/
Querying
Authoring
Linked (Open) Data
Lifecycle
http://stack.linkeddata.org/
Linked Open Data 29Ali Khalili
Linked Open Data Lifecycle
Exploration
Linked Open Data 29Ali Khalili
Linked Open Data Lifecycle
• Search
• Browse
• Visualize
Exploration
Search for Linked Data
Linked Open Data 30Ali Khalili
Linked Open Data Lifecycle Exploration
http://lov.okfn.org/
Search for Linked Data
Linked Open Data 31Ali Khalili
Linked Open Data Lifecycle Exploration
http://schema.org/
http://bl.ocks.org/danbri/1c121ea8bd2189cf411c
Search for Linked Data
Linked Open Data 32Ali Khalili
Linked Open Data Lifecycle
Data hub http://datahub.io
search for data, register published datasets, create and manage groups of datasets…
Exploration
Search for Linked Data
Linked Open Data 33Ali Khalili
Linked Open Data Lifecycle Exploration
http://lotus.lodlaundromat.org
Search for Linked Data
Linked Open Data 34Ali Khalili
Linked Open Data Lifecycle Exploration
• OpenStreepMap (OSM)
• Database of Global Administrative Areas (GADM)
• Flickr Shapefiles Dataset
• Published Shapefiles for Individual Countries
• Published Geospatial RDF Datasets
Example
OpenStreetMap (OSM)
• https://www.openstreetmap.org
• built by a community of mappers that contribute and
maintain data about roads, trails, cafés, railway
stations, and much more, all over the world.
• Administrative Boundaries
• Level 1: super-national administrations e.g. European Union.
• Level 2: country borders based on the political entities listed on
the ISO 3166 standard.
• Level 3 to 11: subnational borders such as ``state'', ``province'',
``region'' and ``district''.
• Data Access
• Nominatim Web API for querying OSM
• The Overpass API for fetching specific OSM data
• Planet.osm Data (over 617GB uncompressed!)
OSM: Nominatim Web API
• a tool to search OSM data by name and address and to
generate synthetic addresses of OSM points (reverse
geocoding)
• Several companies provide hosted instances of Nominatim
query API, e.g MapQuest Open Initiative, PickPoint or the
OpenCage Geocoder
• API documentation
• Example usage:
• http://nominatim.openstreetmap.org/search.php?
q=amsterdam&polygon=1&country=Netherlands&format=
json&addressdetails=1
• MapQuest API
OSM Data: Example
GADM (Global Administrative Areas)
• http://www.gadm.org
• GADM is developed by University of California, Berkeley
Museum of Vertebrate Zoology, the International Rice
Research Institute and the University of California, Davis, and
with contributions of many others.
• uses other existing sources: http://www.gadm.org/links
• Administrative Boundaries
• Level 0: countries.
• Level 1 to 5: lower level subdivisions such as provinces, departments,
counties, etc. depending on the size and availability of data for the
underlying country.
• Data Access
• data is available globally and for each individual country, in different
formats: geopackage,R SpatialPolygonsDataFrame, ESRI file geodatabase, Google Earth
Flickr geo-tagged pictures
• Data from 190M geo-tagged photos on Flickr
• new smart phone do not only have a camera but also the ability to capture
location information.
• plotted all the geotagged photos associated with a particular place to
generate a mostly accurate contour of that place (something more fine-
grained than a bounding box!).
• Where On Earth (WOE) IDs
• correspond to the hierarchy of places where a photo was taken: from
country (level 1), region (level 2) county (level 3), locality (level 4) to
neighborhood (level 5).
• for a given WOE entity, approximate shape of that place is inferred.
• shapes in GeoJSON format
• view shapes at http://polymaps.org/ex/flickr.html
• download at http://www.flickr.com/services/shapefiles/2.0.1/
• more info: http://code.flickr.net/2012/10/24/2273/
Published Shapefiles for Individual Countries
• Local administrative offices or Geo-related research
centres might provide shape files specific to a country.
• E.g. for the Netherlands, shapefiles are provided by
Centraal Bureau voor de Statistiek (CBS)
• Data collection needs to be done by a group of people
in contact with Geo-related organization in countries.
• Current status
Published Geospatial RDF Datasets
•http://linkedgeodata.org and http://geoknow.eu
•a large spatial knowledge base (>400m geo elements)
which has been derived from OpenStreetMap.
•provides unique URIs and has Mappings to DBpedia.
•GeoVocab.org
• GADM-RDF: Global Administrative Areas
• NUTS-RDF: EU's Nomenclature of Territorial Units for
Statistics
Published Geospatial RDF Datasets
•http://linkedgeodata.org and http://geoknow.eu
•a large spatial knowledge base (>400m geo elements)
which has been derived from OpenStreetMap.
•provides unique URIs and has Mappings to DBpedia.
•GeoVocab.org
• GADM-RDF: Global Administrative Areas
• NUTS-RDF: EU's Nomenclature of Territorial Units for
Statistics Outdated!
No Shapefiles!
Extraction
Linked Open Data 42Ali Khalili
Linked Open Data Lifecycle
from Semi-structured sources
Linked Open Data 43Ali Khalili
Linked Open Data Lifecycle Extraction
Resource Property Value
Linked Open Data 44Ali Khalili
Linked Open Data Lifecycle Extraction DBpedia
Linked Open Data 44Ali Khalili
Linked Open Data Lifecycle Extraction DBpedia
Persian DBpedia?
Persian DBpedia (mapping Wiki)
Linked Open Data 45Ali Khalili
Linked Open Data Lifecycle Extraction DBpedia
Linked Open Data 46Ali Khalili
Linked Open Data Lifecycle Extraction
• Ad-hoc
• DBpedia extraction framework
• Generic
• OpenRefine
from Semi-structured sources
from Unstructured sources
Linked Open Data 47Ali Khalili
Linked Open Data Lifecycle Extraction
…After leaving Apple, Jobs took a few of its members with him to
found NeXT, a computer platform development company based in
Redwood City, specializing in state-of-the-art computers for higher-
education and business markets. In addition, Jobs helped to initiate
the development of the visual effects industry when he funded the
spinout of the computer graphics division of George Lucas's
company Lucasfilm in 1986. The new company, Pixar, would
eventually produce the first fully computer-animated film, Toy Story…
NLP, Text mining, Annotation
from Unstructured sources
Linked Open Data 47Ali Khalili
Linked Open Data Lifecycle Extraction
…After leaving Apple, Jobs took a few of its members with him to
found NeXT, a computer platform development company based in
Redwood City, specializing in state-of-the-art computers for higher-
education and business markets. In addition, Jobs helped to initiate
the development of the visual effects industry when he funded the
spinout of the computer graphics division of George Lucas's
company Lucasfilm in 1986. The new company, Pixar, would
eventually produce the first fully computer-animated film, Toy Story…
NLP, Text mining, Annotation
Named Entity Recognition
from Unstructured sources
Linked Open Data 47Ali Khalili
Linked Open Data Lifecycle Extraction
…After leaving Apple, Jobs took a few of its members with him to
found NeXT, a computer platform development company based in
Redwood City, specializing in state-of-the-art computers for higher-
education and business markets. In addition, Jobs helped to initiate
the development of the visual effects industry when he funded the
spinout of the computer graphics division of George Lucas's
company Lucasfilm in 1986. The new company, Pixar, would
eventually produce the first fully computer-animated film, Toy Story…
NLP, Text mining, Annotation
Named Entity Recognition
foundedBy
Relation Extraction
Named Entity Recognition
Linked Open Data 48Ali Khalili
Linked Open Data Lifecycle Extraction
http://spotlight.dbpedia.org
http://bioportal.bioontology.org/annotator
from Structured sources: Triplification
Linked Open Data 49Ali Khalili
Linked Open Data Lifecycle Extraction
• Relational Database to RDF
R2RML: RDB to RDF Mapping Language
http://www.w3.org/TR/r2rml/
• D2R Server: Accessing databases with SPARQL &
as Linked Data
http://d2rq.org/
• Sparqlify
defining RDF views on relational databases
http://sparqlify.org/
DATA EXTRACTION & CONVERSION
GeoJSON
Enrichment
Functions
Mapping
Configurations
OSM XML
PBF
ESRI shapes
triplify
mapshaper
osmtogeojson
osmosis
DATA EXTRACTION & CONVERSION
Metadata about different levels provided by OSM
http://wiki.openstreetmap.org/wiki/Tag:boundary%3Dadministrative
Storage & Querying
Linked Open Data 52Ali Khalili
Linked Open Data Lifecycle
Relational Databases vs. Triple Stores
Linked Open Data 53Ali Khalili
Linked Open Data Lifecycle Storage/Querying
• A relational databases’ (e.g. MySQL, PostgreSQL, Oracle)
natural representation is a collection interlinked tables.
• A triple stores’ (e.g. OpenSesame, AllegroGraph, Neo4j)
natural representation is a multi-relational network, or graph.
* Triple Store: it is called a triple store because in RDF, the facts
are represented in form of a triple (Subject-Predicate-Object).
Existing Triple Stores
Linked Open Data 54Ali Khalili
Linked Open Data Lifecycle Storage/Querying
• Native triple stores
4Store, AllegroGraph, BigData, Jena TDB, Sesame,
Stardog, OWLIM and uRiKa
• RDBMS-backed triple stores
Jena SDB, IBM DB2 and OpenLink Virtuoso
• NoSQL triplestores
CumulusRDF
DATA STORAGE & QUERYING
Virtuoso Geo Spatial
Geometry as SMS
internal representation
for Geo-data in RDF
SPARQL – SQL for the Linked Data
Linked Open Data 56Ali Khalili
Linked Open Data Lifecycle Storage/Querying
What can be done with SPARQL that can't be done with SQL?
SPARQL – SQL for the Linked Data
Linked Open Data 56Ali Khalili
Linked Open Data Lifecycle Storage/Querying
What can be done with SPARQL that can't be done with SQL?
• SPARQL queries are considerably better aligned with users’ mental
models of a domain.
SPARQL – SQL for the Linked Data
Linked Open Data 56Ali Khalili
Linked Open Data Lifecycle Storage/Querying
What can be done with SPARQL that can't be done with SQL?
• SPARQL queries are considerably better aligned with users’ mental
models of a domain.
SPARQL – SQL for the Linked Data
Linked Open Data 57Ali Khalili
Linked Open Data Lifecycle Storage/Querying
• SPARQL allows the conceptual data model to be fully explored
through queries.
SPARQL – SQL for the Linked Data
Linked Open Data 57Ali Khalili
Linked Open Data Lifecycle Storage/Querying
• SPARQL allows the conceptual data model to be fully explored
through queries.
- example:workPhone rdfs:subPropertyOf example:phone

- example:cellPhone rdfs:subPropertyOf example:phone

- example:homePhone rdfs:subPropertyOf example:phone
SPARQL – SQL for the Linked Data
Linked Open Data 58Ali Khalili
Linked Open Data Lifecycle Storage/Querying
• Queries that have to traverse a chain of connections are
particularly complex in SQL while very simple in SPARQL.
SPARQL – SQL for the Linked Data
Linked Open Data 58Ali Khalili
Linked Open Data Lifecycle Storage/Querying
• Queries that have to traverse a chain of connections are
particularly complex in SQL while very simple in SPARQL.
SPARQL – SQL for the Linked Data
Linked Open Data 59Ali Khalili
Linked Open Data Lifecycle Storage/Querying
• In addition to SELECT, INSERT and DELETE, SPARQL supports
ASK queries.
• SPARQL includes syntax (i.e. SERVICE) to call two or more data
sources within a single query.
• …
SPARQL Query Interface
Linked Open Data 60Ali Khalili
Linked Open Data Lifecycle Storage/Querying
http://yasgui.org/
Interlinking
Linked Open Data 61Ali Khalili
Linked Open Data Lifecycle
Interlinking
Linked Open Data 62Ali Khalili
Linked Open Data Lifecycle
• The degree to which entities that represent the same
concepts are linked to each other.
• “Connecting things that are somehow related”
• Methods
• Automatic, Semi-automatic, Manual
• Universal, Domain-specific
<http://dbpedia.org/resource/VU_University_Amsterdam>
<https://www.wikidata.org/entity/Q1065414>
SameAs
Interlinking Methods
Linked Open Data 63Ali Khalili
Linked Open Data Lifecycle
• Ontology Matching
• establish links between ontologies underlying two
data sources.
• Instance Matching (Link Discovery)
• discover links between instances contained in two
data sources.
DATA LINKAGE
- Query on metadata about the
administrative boundaries
- Find the alignment between levels
in different datasets
DATA LINKAGE
- used the possible mappings between datasets at different levels.
- check the overlaps of areas at the similar level, and for the matching areas apply
string matching to make sure that they refer to the same administrative boundary.
DATA LINKAGE
OECD
FUAs
DBpedia
GeoNames
WikiData
GADM
Flickr
Shapes
OpenStreetMap
Administrative
Boundaries
DATA LINKAGE
OECD
FUAs
DBpedia
GeoNames
WikiData
GADM
Flickr
Shapes
OpenStreetMap
Administrative
Boundaries
DATA LINKAGE
OECD
FUAs
DBpedia
GeoNames
WikiData
GADM
Flickr
Shapes
OpenStreetMap
Administrative
Boundaries
Scientific Lenses
DBpedia
Wikidata
OrgRef
GRID FundRef
Geoname
ISNI
VIAF
Cordis
?
Semantically Mapping Science (SMS) Platform
http://sms.risis.eu
Linked Data Services
http://api.sms.risis.eu/
SERVICE TO APPLICATION
http://sms.risis.eu/demos
SERVICE TO APPLICATION
https://docs.google.com/spreadsheets/d/1XhXzdAf-veqHPj0kIaeZoXE3AJa8nwNugW_nOHH1jtk/edit?usp=sharing
Use Cases
https://hyperir.cartodb.com/viz/13b5f3da-4356-11e6-a365-0e5db1731f59/public_map
(research) and innovation subsidies for organizations and
companies in the Netherlands
Use Cases
(research) and innovation subsidies for organizations and companies in the Netherlands
People Hybrid OECD FUAsBusinesses
People Hybrid OECD FUAsBusinesses
Use Cases
Universities + Companies + Projects + Boundaries
Properties of container
administrative boundaries
Collaboration between
Universities and Companies
Properties of Universities
and Companies
Use Cases
Universities + Companies + Projects + Boundaries
RVO-NL
DBpedia OpenStreetMap
GADM
Flickr
OECD FUAs
CBS-NL
Properties of container
administrative boundaries
Collaboration between
Universities and Companies
Properties of Universities
and Companies
Use Cases
Universities + Companies + Projects + Boundaries
RVO-NL
DBpedia
Leiden-Ranking
ETER
OrgRef Cordis OpenStreetMap
GADM
Flickr
OECD FUAs
Grid
CBS-NL
Properties of container
administrative boundaries
Collaboration between
Universities and Companies
Properties of Universities
and Companies
Eurostat
Summary of the Use Case
Address
FUA
Administrative Boundaries
Coordinates
geocode
Summary of the Use Case
Address
FUA
Administrative Boundaries
Coordinates
geocode
References
Linked Open Data 77Ali Khalili
Linked Open Data
• http://slidewiki.org/deck/11936_semantic-data-web-lecture-series
• Introduction to linked data and its lifecycle on the web
• http://euclid-project.eu/
• http://videolectures.net/wims2011_auer_interlinked/
• https://vimeo.com/76257120
• http://www.slideshare.net/slidarko/evolving-the-web-into-a-giant-global-
database-3880018
• http://www.dataversity.net/introduction-to-triplestores/
• http://www.topquadrant.com/2014/05/05/comparing-sparql-with-sql/

More Related Content

Viewers also liked

UNICEF Innovation: Innovation Lab Do-It-Yourself Guide
UNICEF Innovation: Innovation Lab Do-It-Yourself GuideUNICEF Innovation: Innovation Lab Do-It-Yourself Guide
UNICEF Innovation: Innovation Lab Do-It-Yourself Guide
Christopher Fabian
 

Viewers also liked (17)

Use of big data technologies in capital markets
Use of big data technologies in capital marketsUse of big data technologies in capital markets
Use of big data technologies in capital markets
 
Data Science Connect, July 22nd 2014 @IBM Innovation Center Zurich
Data Science Connect, July 22nd 2014 @IBM Innovation Center ZurichData Science Connect, July 22nd 2014 @IBM Innovation Center Zurich
Data Science Connect, July 22nd 2014 @IBM Innovation Center Zurich
 
Linked Open Data Principles, Technologies and Examples
Linked Open Data Principles, Technologies and ExamplesLinked Open Data Principles, Technologies and Examples
Linked Open Data Principles, Technologies and Examples
 
Data juice
Data juiceData juice
Data juice
 
The DATALAB - building a world-class innovation centre in data science
The DATALAB - building a world-class innovation centre in data scienceThe DATALAB - building a world-class innovation centre in data science
The DATALAB - building a world-class innovation centre in data science
 
Big data Summit
Big data SummitBig data Summit
Big data Summit
 
The Complete Guide to Capital Markets for Quantitative Professionals - Summary
The Complete Guide to Capital Markets for Quantitative Professionals - SummaryThe Complete Guide to Capital Markets for Quantitative Professionals - Summary
The Complete Guide to Capital Markets for Quantitative Professionals - Summary
 
Data Science at Atlassian: 
The transition towards a data-driven organisation
Data Science at Atlassian: 
The transition towards a data-driven organisationData Science at Atlassian: 
The transition towards a data-driven organisation
Data Science at Atlassian: 
The transition towards a data-driven organisation
 
UNICEF Innovation: Innovation Lab Do-It-Yourself Guide
UNICEF Innovation: Innovation Lab Do-It-Yourself GuideUNICEF Innovation: Innovation Lab Do-It-Yourself Guide
UNICEF Innovation: Innovation Lab Do-It-Yourself Guide
 
The Epidemiology of Innovation
The Epidemiology of InnovationThe Epidemiology of Innovation
The Epidemiology of Innovation
 
Big Data and Data Science @ BNL - D. Morgagni & L. Dell'Anna
Big Data and Data Science @ BNL - D. Morgagni & L. Dell'AnnaBig Data and Data Science @ BNL - D. Morgagni & L. Dell'Anna
Big Data and Data Science @ BNL - D. Morgagni & L. Dell'Anna
 
Open Innovation
Open Innovation Open Innovation
Open Innovation
 
Innovation can be Trained
Innovation can be TrainedInnovation can be Trained
Innovation can be Trained
 
Understand Innovation in 5 Minutes
Understand Innovation in 5 MinutesUnderstand Innovation in 5 Minutes
Understand Innovation in 5 Minutes
 
Innovation Strategy
Innovation StrategyInnovation Strategy
Innovation Strategy
 
Booz Allen Field Guide to Data Science
Booz Allen Field Guide to Data Science Booz Allen Field Guide to Data Science
Booz Allen Field Guide to Data Science
 
What is Big Data?
What is Big Data?What is Big Data?
What is Big Data?
 

More from Ali Khalili

Human-Linked Data Interaction
Human-Linked Data InteractionHuman-Linked Data Interaction
Human-Linked Data Interaction
Ali Khalili
 
LD-R Presentation at ESWC2016 Developers Hackshop
LD-R Presentation at ESWC2016 Developers HackshopLD-R Presentation at ESWC2016 Developers Hackshop
LD-R Presentation at ESWC2016 Developers Hackshop
Ali Khalili
 
conTEXT -- Lightweight Text Analytics using Linked Data
conTEXT -- Lightweight Text Analytics using Linked DataconTEXT -- Lightweight Text Analytics using Linked Data
conTEXT -- Lightweight Text Analytics using Linked Data
Ali Khalili
 
SlideWiki: Elicitation and Sharing of Knowledge using Presentations
SlideWiki: Elicitation and Sharing of Knowledge using PresentationsSlideWiki: Elicitation and Sharing of Knowledge using Presentations
SlideWiki: Elicitation and Sharing of Knowledge using Presentations
Ali Khalili
 

More from Ali Khalili (14)

FERASAT: A Serendipity-Fostering Faceted Browser for Linked Data
FERASAT: A Serendipity-Fostering Faceted Browser for Linked DataFERASAT: A Serendipity-Fostering Faceted Browser for Linked Data
FERASAT: A Serendipity-Fostering Faceted Browser for Linked Data
 
An introduction to Linked Open Data
An introduction to Linked Open DataAn introduction to Linked Open Data
An introduction to Linked Open Data
 
Human-Linked Data Interaction
Human-Linked Data InteractionHuman-Linked Data Interaction
Human-Linked Data Interaction
 
WYSIWYQ -- What You See Is What You Query
WYSIWYQ -- What You See Is What You QueryWYSIWYQ -- What You See Is What You Query
WYSIWYQ -- What You See Is What You Query
 
Semantically Mapping Science (SMS) Platform
Semantically Mapping Science (SMS) PlatformSemantically Mapping Science (SMS) Platform
Semantically Mapping Science (SMS) Platform
 
ERSA 2017: A linked open data based system for flexible delineation of geogra...
ERSA 2017: A linked open data based system for flexible delineation of geogra...ERSA 2017: A linked open data based system for flexible delineation of geogra...
ERSA 2017: A linked open data based system for flexible delineation of geogra...
 
Semantically Mapping Science (SMS)
Semantically Mapping Science (SMS)Semantically Mapping Science (SMS)
Semantically Mapping Science (SMS)
 
Adaptive Linked Data-driven Web Components: Building Flexible and Reusable Se...
Adaptive Linked Data-driven Web Components: Building Flexible and Reusable Se...Adaptive Linked Data-driven Web Components: Building Flexible and Reusable Se...
Adaptive Linked Data-driven Web Components: Building Flexible and Reusable Se...
 
LD-R Presentation at ESWC2016 Developers Hackshop
LD-R Presentation at ESWC2016 Developers HackshopLD-R Presentation at ESWC2016 Developers Hackshop
LD-R Presentation at ESWC2016 Developers Hackshop
 
Web of Data and its Status on Persian Web Data Space
Web of Data and its Status on Persian Web Data SpaceWeb of Data and its Status on Persian Web Data Space
Web of Data and its Status on Persian Web Data Space
 
An introduction to Linked (Open) Data
An introduction to Linked (Open) DataAn introduction to Linked (Open) Data
An introduction to Linked (Open) Data
 
A Semantics-based User Interface Model for Content Annotation, Authoring and ...
A Semantics-based User Interface Model for Content Annotation, Authoring and ...A Semantics-based User Interface Model for Content Annotation, Authoring and ...
A Semantics-based User Interface Model for Content Annotation, Authoring and ...
 
conTEXT -- Lightweight Text Analytics using Linked Data
conTEXT -- Lightweight Text Analytics using Linked DataconTEXT -- Lightweight Text Analytics using Linked Data
conTEXT -- Lightweight Text Analytics using Linked Data
 
SlideWiki: Elicitation and Sharing of Knowledge using Presentations
SlideWiki: Elicitation and Sharing of Knowledge using PresentationsSlideWiki: Elicitation and Sharing of Knowledge using Presentations
SlideWiki: Elicitation and Sharing of Knowledge using Presentations
 

Recently uploaded

Gartner's Data Analytics Maturity Model.pptx
Gartner's Data Analytics Maturity Model.pptxGartner's Data Analytics Maturity Model.pptx
Gartner's Data Analytics Maturity Model.pptx
chadhar227
 
一比一原版(UCD毕业证书)加州大学戴维斯分校毕业证成绩单原件一模一样
一比一原版(UCD毕业证书)加州大学戴维斯分校毕业证成绩单原件一模一样一比一原版(UCD毕业证书)加州大学戴维斯分校毕业证成绩单原件一模一样
一比一原版(UCD毕业证书)加州大学戴维斯分校毕业证成绩单原件一模一样
wsppdmt
 
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get CytotecAbortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Riyadh +966572737505 get cytotec
 
Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...
nirzagarg
 
Top profile Call Girls In Vadodara [ 7014168258 ] Call Me For Genuine Models ...
Top profile Call Girls In Vadodara [ 7014168258 ] Call Me For Genuine Models ...Top profile Call Girls In Vadodara [ 7014168258 ] Call Me For Genuine Models ...
Top profile Call Girls In Vadodara [ 7014168258 ] Call Me For Genuine Models ...
gajnagarg
 
Reconciling Conflicting Data Curation Actions: Transparency Through Argument...
Reconciling Conflicting Data Curation Actions:  Transparency Through Argument...Reconciling Conflicting Data Curation Actions:  Transparency Through Argument...
Reconciling Conflicting Data Curation Actions: Transparency Through Argument...
Bertram Ludäscher
 
怎样办理伦敦大学城市学院毕业证(CITY毕业证书)成绩单学校原版复制
怎样办理伦敦大学城市学院毕业证(CITY毕业证书)成绩单学校原版复制怎样办理伦敦大学城市学院毕业证(CITY毕业证书)成绩单学校原版复制
怎样办理伦敦大学城市学院毕业证(CITY毕业证书)成绩单学校原版复制
vexqp
 
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...
gajnagarg
 
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...
nirzagarg
 
Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...
nirzagarg
 
Jual Cytotec Asli Obat Aborsi No. 1 Paling Manjur
Jual Cytotec Asli Obat Aborsi No. 1 Paling ManjurJual Cytotec Asli Obat Aborsi No. 1 Paling Manjur
Jual Cytotec Asli Obat Aborsi No. 1 Paling Manjur
ptikerjasaptiker
 
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
Klinik kandungan
 
怎样办理圣路易斯大学毕业证(SLU毕业证书)成绩单学校原版复制
怎样办理圣路易斯大学毕业证(SLU毕业证书)成绩单学校原版复制怎样办理圣路易斯大学毕业证(SLU毕业证书)成绩单学校原版复制
怎样办理圣路易斯大学毕业证(SLU毕业证书)成绩单学校原版复制
vexqp
 
PLE-statistics document for primary schs
PLE-statistics document for primary schsPLE-statistics document for primary schs
PLE-statistics document for primary schs
cnajjemba
 

Recently uploaded (20)

Gartner's Data Analytics Maturity Model.pptx
Gartner's Data Analytics Maturity Model.pptxGartner's Data Analytics Maturity Model.pptx
Gartner's Data Analytics Maturity Model.pptx
 
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
 
一比一原版(UCD毕业证书)加州大学戴维斯分校毕业证成绩单原件一模一样
一比一原版(UCD毕业证书)加州大学戴维斯分校毕业证成绩单原件一模一样一比一原版(UCD毕业证书)加州大学戴维斯分校毕业证成绩单原件一模一样
一比一原版(UCD毕业证书)加州大学戴维斯分校毕业证成绩单原件一模一样
 
Predicting HDB Resale Prices - Conducting Linear Regression Analysis With Orange
Predicting HDB Resale Prices - Conducting Linear Regression Analysis With OrangePredicting HDB Resale Prices - Conducting Linear Regression Analysis With Orange
Predicting HDB Resale Prices - Conducting Linear Regression Analysis With Orange
 
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get CytotecAbortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
 
Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...
 
Dubai Call Girls Peeing O525547819 Call Girls Dubai
Dubai Call Girls Peeing O525547819 Call Girls DubaiDubai Call Girls Peeing O525547819 Call Girls Dubai
Dubai Call Girls Peeing O525547819 Call Girls Dubai
 
DATA SUMMIT 24 Building Real-Time Pipelines With FLaNK
DATA SUMMIT 24  Building Real-Time Pipelines With FLaNKDATA SUMMIT 24  Building Real-Time Pipelines With FLaNK
DATA SUMMIT 24 Building Real-Time Pipelines With FLaNK
 
Top profile Call Girls In Vadodara [ 7014168258 ] Call Me For Genuine Models ...
Top profile Call Girls In Vadodara [ 7014168258 ] Call Me For Genuine Models ...Top profile Call Girls In Vadodara [ 7014168258 ] Call Me For Genuine Models ...
Top profile Call Girls In Vadodara [ 7014168258 ] Call Me For Genuine Models ...
 
Reconciling Conflicting Data Curation Actions: Transparency Through Argument...
Reconciling Conflicting Data Curation Actions:  Transparency Through Argument...Reconciling Conflicting Data Curation Actions:  Transparency Through Argument...
Reconciling Conflicting Data Curation Actions: Transparency Through Argument...
 
怎样办理伦敦大学城市学院毕业证(CITY毕业证书)成绩单学校原版复制
怎样办理伦敦大学城市学院毕业证(CITY毕业证书)成绩单学校原版复制怎样办理伦敦大学城市学院毕业证(CITY毕业证书)成绩单学校原版复制
怎样办理伦敦大学城市学院毕业证(CITY毕业证书)成绩单学校原版复制
 
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...
 
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...
 
Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...
 
Jual Cytotec Asli Obat Aborsi No. 1 Paling Manjur
Jual Cytotec Asli Obat Aborsi No. 1 Paling ManjurJual Cytotec Asli Obat Aborsi No. 1 Paling Manjur
Jual Cytotec Asli Obat Aborsi No. 1 Paling Manjur
 
Switzerland Constitution 2002.pdf.........
Switzerland Constitution 2002.pdf.........Switzerland Constitution 2002.pdf.........
Switzerland Constitution 2002.pdf.........
 
Vadodara 💋 Call Girl 7737669865 Call Girls in Vadodara Escort service book now
Vadodara 💋 Call Girl 7737669865 Call Girls in Vadodara Escort service book nowVadodara 💋 Call Girl 7737669865 Call Girls in Vadodara Escort service book now
Vadodara 💋 Call Girl 7737669865 Call Girls in Vadodara Escort service book now
 
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
 
怎样办理圣路易斯大学毕业证(SLU毕业证书)成绩单学校原版复制
怎样办理圣路易斯大学毕业证(SLU毕业证书)成绩单学校原版复制怎样办理圣路易斯大学毕业证(SLU毕业证书)成绩单学校原版复制
怎样办理圣路易斯大学毕业证(SLU毕业证书)成绩单学校原版复制
 
PLE-statistics document for primary schs
PLE-statistics document for primary schsPLE-statistics document for primary schs
PLE-statistics document for primary schs
 

Towards a Linked Open Data Infrastructure for Science, Technology & Innovation Studies

  • 1. x Towards a Linked Open Data Infrastructure for Science, Technology & Innovation Studies Ali Khalili, PhD Department of Computer Science/Artificial Intelligence Knowledge Representation & Reasoning Research Group
  • 2. Outline • Linked (Open) Data • RISIS Project • Semantically Mapping Science (SMS) Platform • Workflow • Use Cases • Adaptive Functional Urban Areas (FUAs) to Study Innovative Activities • Gendered Dimensions in Grant Selection
  • 3. Evolution of the Web https://mcgratha.wordpress.com/
  • 4. Evolution of the Web https://mcgratha.wordpress.com/
  • 5. Linked (Open) Data • A set of best practices for publishing data on the Web. • Follows 4 simple principles: https://www.ted.com/talks/tim_berners_lee_on_the_next_web • Use HTTP URIs so that users can look up (dereference) those names. • When someone looks up a URI, provide useful information, using the open standards. • Include links to other URIs, so that users can discover more things. • Use URIs as names (identifiers) for conceptual things.
  • 6. Linked (Open) Data: Principles WWW World
  • 7. Linked (Open) Data: Principles WWW World
  • 8. Linked (Open) Data: Principles WWW World
  • 9. Linked (Open) Data: Principles WWW World
  • 10. Linked (Open) Data: Principles WWW World
  • 11. 5 Open Data make your stuff available on the Web (whatever format) under an open license make it available as structured data (e.g., Excel instead of image scan of a table) make it available in a non-proprietary open format (e.g., CSV instead of Excel) use Linked Data format (URIs to identify things, RDF to represent data) link your data to other people’s data to provide context http://5stardata.info/
  • 13. Linked Open Data Cloud 8Linked Open DataAli Khalili http://lod-cloud.net/
  • 14. Linked Open Data Cloud 8Linked Open DataAli Khalili http://lod-cloud.net/
  • 15. Linked Open Data Cloud 8Linked Open DataAli Khalili http://lod-cloud.net/
  • 16. Linked Open Data Cloud 8Linked Open DataAli Khalili http://lod-cloud.net/
  • 17. Linked Open Data Cloud 8Linked Open DataAli Khalili http://lod-cloud.net/
  • 18. Linked Open Data Cloud 8Linked Open DataAli Khalili http://lod-cloud.net/
  • 19. Linked Open Data Cloud 8Linked Open DataAli Khalili http://lod-cloud.net/
  • 20. Linked Open Data Cloud 8Linked Open DataAli Khalili http://lod-cloud.net/
  • 21. Linked Open Data Cloud http://lod-cloud.net/
  • 22. Linked Open Data: Statistics http://lodlaundromat.org/ http://stats.lod2.eu/ more than 3426 datasets
  • 23. Linked Open Data: Examples https://en.wikipedia.org/wiki/Paris
  • 24. Linked Open Data: Examples Resource Property Value https://en.wikipedia.org/wiki/Paris
  • 25. Linked Open Data: Examples http://dbpedia.org/resource/Paris
  • 26. Linked Open Data: Examples • Give me a list of capital cities in Europe with population more than 500,000 • Who are mayors of central European towns elevated more than 1000m? • Which movies are starring both Brad Pitt and Angelina Jolie? • All soccer players, who played as goalkeeper for a club that has a stadium with more than 40.000 seats and who are born in a country with more than 10 million inhabitants • …
  • 27. Linked Open Data: Examples • Give me a list of capital cities in Europe with population more than 500,000 • Who are mayors of central European towns elevated more than 1000m? • Which movies are starring both Brad Pitt and Angelina Jolie? • All soccer players, who played as goalkeeper for a club that has a stadium with more than 40.000 seats and who are born in a country with more than 10 million inhabitants • …
  • 28. Linked Open Data: Examples
  • 29. Linked Open Data: Examples
  • 30. Linked Open Data: Examples https://www.google.com/cse/
  • 31. Linked Open Data: Examples http://www.wolframalpha.com/
  • 33. RISIS EU Project (http://risis.eu) http://datasets.risis.eu/
  • 34. RISIS Datasets: Entity Types Organization Product Agreement Person Policy Policy Evaluation Location CIB ETER EUPRO JOREP Leiden-Ranking MORE I Nano Profile SIPER VICO Higher Education Firm Funding Body Publication Patent Project Investment Funding Program
  • 35. RISIS Datasets: Entity Types Organization Product Agreement Person Policy Policy Evaluation Location CIB ETER EUPRO JOREP Leiden-Ranking MORE I Nano Profile SIPER VICO Higher Education Firm Funding Body Publication Patent Project Investment Funding Program
  • 36. Semantically Mapping Science (SMS) Platform http://sms.risis.eu
  • 40. defined by OECD in collaboration with EC/Eurostat consider factors beyond the predefined city boundaries to better reflect the economic geography of where people live and work Functional Urban Areas (FUAs) OECD Metropolitan eXplorer: http://measuringurban.oecd.org
  • 41. defined by OECD in collaboration with EC/Eurostat consider factors beyond the predefined city boundaries to better reflect the economic geography of where people live and work population area GDP environment (CO2 emissions and air pollution) labour market (employment and unemployment growth) innovation (patent intensity) urban form and territorial organization Functional Urban Areas (FUAs) OECD Metropolitan eXplorer: http://measuringurban.oecd.org
  • 46. Problem Address FUA ? • Vrije Universiteit Amsterdam • De Boelelaan 1105, 1081 HV Amsterdam Amsterdam (NL002)
  • 47. Problem Address FUA ? • Vrije Universiteit Amsterdam • De Boelelaan 1105, 1081 HV Amsterdam Amsterdam (NL002) OECD FUAs List
  • 48. Problem Address FUA ? • Vrije Universiteit Amsterdam • De Boelelaan 1105, 1081 HV Amsterdam Amsterdam (NL002) - Geocode to LAU (municipality) OECD FUAs List
  • 49. Problem Address FUA ? • Vrije Universiteit Amsterdam • De Boelelaan 1105, 1081 HV Amsterdam Amsterdam (NL002) - Geocode to LAU (municipality) OECD FUAs List
  • 50. Problem Address FUA ? • Vrije Universiteit Amsterdam • De Boelelaan 1105, 1081 HV Amsterdam Amsterdam (NL002) - Geocode to LAU (municipality) - Shapefiles for FUAs or LAUs? OECD FUAs List
  • 51. Problem Address FUA ? • Vrije Universiteit Amsterdam • De Boelelaan 1105, 1081 HV Amsterdam Amsterdam (NL002) - Geocode to LAU (municipality) - Shapefiles for FUAs or LAUs? OECD FUAs List
  • 52. Linked Open Data 28Ali Khalili Linked Open Data Interlinking Enrichment Quality Analysis Evolution Exploration Extraction Storage/ Querying Authoring Linked (Open) Data Lifecycle http://stack.linkeddata.org/
  • 53. Linked Open Data 28Ali Khalili Linked Open Data Interlinking Enrichment Quality Analysis Evolution Exploration Extraction Storage/ Querying Authoring Linked (Open) Data Lifecycle http://stack.linkeddata.org/
  • 54. Linked Open Data 29Ali Khalili Linked Open Data Lifecycle Exploration
  • 55. Linked Open Data 29Ali Khalili Linked Open Data Lifecycle • Search • Browse • Visualize Exploration
  • 56. Search for Linked Data Linked Open Data 30Ali Khalili Linked Open Data Lifecycle Exploration http://lov.okfn.org/
  • 57. Search for Linked Data Linked Open Data 31Ali Khalili Linked Open Data Lifecycle Exploration http://schema.org/ http://bl.ocks.org/danbri/1c121ea8bd2189cf411c
  • 58. Search for Linked Data Linked Open Data 32Ali Khalili Linked Open Data Lifecycle Data hub http://datahub.io search for data, register published datasets, create and manage groups of datasets… Exploration
  • 59. Search for Linked Data Linked Open Data 33Ali Khalili Linked Open Data Lifecycle Exploration http://lotus.lodlaundromat.org
  • 60. Search for Linked Data Linked Open Data 34Ali Khalili Linked Open Data Lifecycle Exploration • OpenStreepMap (OSM) • Database of Global Administrative Areas (GADM) • Flickr Shapefiles Dataset • Published Shapefiles for Individual Countries • Published Geospatial RDF Datasets Example
  • 61. OpenStreetMap (OSM) • https://www.openstreetmap.org • built by a community of mappers that contribute and maintain data about roads, trails, cafés, railway stations, and much more, all over the world. • Administrative Boundaries • Level 1: super-national administrations e.g. European Union. • Level 2: country borders based on the political entities listed on the ISO 3166 standard. • Level 3 to 11: subnational borders such as ``state'', ``province'', ``region'' and ``district''. • Data Access • Nominatim Web API for querying OSM • The Overpass API for fetching specific OSM data • Planet.osm Data (over 617GB uncompressed!)
  • 62. OSM: Nominatim Web API • a tool to search OSM data by name and address and to generate synthetic addresses of OSM points (reverse geocoding) • Several companies provide hosted instances of Nominatim query API, e.g MapQuest Open Initiative, PickPoint or the OpenCage Geocoder • API documentation • Example usage: • http://nominatim.openstreetmap.org/search.php? q=amsterdam&polygon=1&country=Netherlands&format= json&addressdetails=1 • MapQuest API
  • 64. GADM (Global Administrative Areas) • http://www.gadm.org • GADM is developed by University of California, Berkeley Museum of Vertebrate Zoology, the International Rice Research Institute and the University of California, Davis, and with contributions of many others. • uses other existing sources: http://www.gadm.org/links • Administrative Boundaries • Level 0: countries. • Level 1 to 5: lower level subdivisions such as provinces, departments, counties, etc. depending on the size and availability of data for the underlying country. • Data Access • data is available globally and for each individual country, in different formats: geopackage,R SpatialPolygonsDataFrame, ESRI file geodatabase, Google Earth
  • 65. Flickr geo-tagged pictures • Data from 190M geo-tagged photos on Flickr • new smart phone do not only have a camera but also the ability to capture location information. • plotted all the geotagged photos associated with a particular place to generate a mostly accurate contour of that place (something more fine- grained than a bounding box!). • Where On Earth (WOE) IDs • correspond to the hierarchy of places where a photo was taken: from country (level 1), region (level 2) county (level 3), locality (level 4) to neighborhood (level 5). • for a given WOE entity, approximate shape of that place is inferred. • shapes in GeoJSON format • view shapes at http://polymaps.org/ex/flickr.html • download at http://www.flickr.com/services/shapefiles/2.0.1/ • more info: http://code.flickr.net/2012/10/24/2273/
  • 66. Published Shapefiles for Individual Countries • Local administrative offices or Geo-related research centres might provide shape files specific to a country. • E.g. for the Netherlands, shapefiles are provided by Centraal Bureau voor de Statistiek (CBS) • Data collection needs to be done by a group of people in contact with Geo-related organization in countries. • Current status
  • 67. Published Geospatial RDF Datasets •http://linkedgeodata.org and http://geoknow.eu •a large spatial knowledge base (>400m geo elements) which has been derived from OpenStreetMap. •provides unique URIs and has Mappings to DBpedia. •GeoVocab.org • GADM-RDF: Global Administrative Areas • NUTS-RDF: EU's Nomenclature of Territorial Units for Statistics
  • 68. Published Geospatial RDF Datasets •http://linkedgeodata.org and http://geoknow.eu •a large spatial knowledge base (>400m geo elements) which has been derived from OpenStreetMap. •provides unique URIs and has Mappings to DBpedia. •GeoVocab.org • GADM-RDF: Global Administrative Areas • NUTS-RDF: EU's Nomenclature of Territorial Units for Statistics Outdated! No Shapefiles!
  • 69. Extraction Linked Open Data 42Ali Khalili Linked Open Data Lifecycle
  • 70. from Semi-structured sources Linked Open Data 43Ali Khalili Linked Open Data Lifecycle Extraction Resource Property Value
  • 71. Linked Open Data 44Ali Khalili Linked Open Data Lifecycle Extraction DBpedia
  • 72. Linked Open Data 44Ali Khalili Linked Open Data Lifecycle Extraction DBpedia Persian DBpedia?
  • 73. Persian DBpedia (mapping Wiki) Linked Open Data 45Ali Khalili Linked Open Data Lifecycle Extraction DBpedia
  • 74. Linked Open Data 46Ali Khalili Linked Open Data Lifecycle Extraction • Ad-hoc • DBpedia extraction framework • Generic • OpenRefine from Semi-structured sources
  • 75. from Unstructured sources Linked Open Data 47Ali Khalili Linked Open Data Lifecycle Extraction …After leaving Apple, Jobs took a few of its members with him to found NeXT, a computer platform development company based in Redwood City, specializing in state-of-the-art computers for higher- education and business markets. In addition, Jobs helped to initiate the development of the visual effects industry when he funded the spinout of the computer graphics division of George Lucas's company Lucasfilm in 1986. The new company, Pixar, would eventually produce the first fully computer-animated film, Toy Story… NLP, Text mining, Annotation
  • 76. from Unstructured sources Linked Open Data 47Ali Khalili Linked Open Data Lifecycle Extraction …After leaving Apple, Jobs took a few of its members with him to found NeXT, a computer platform development company based in Redwood City, specializing in state-of-the-art computers for higher- education and business markets. In addition, Jobs helped to initiate the development of the visual effects industry when he funded the spinout of the computer graphics division of George Lucas's company Lucasfilm in 1986. The new company, Pixar, would eventually produce the first fully computer-animated film, Toy Story… NLP, Text mining, Annotation Named Entity Recognition
  • 77. from Unstructured sources Linked Open Data 47Ali Khalili Linked Open Data Lifecycle Extraction …After leaving Apple, Jobs took a few of its members with him to found NeXT, a computer platform development company based in Redwood City, specializing in state-of-the-art computers for higher- education and business markets. In addition, Jobs helped to initiate the development of the visual effects industry when he funded the spinout of the computer graphics division of George Lucas's company Lucasfilm in 1986. The new company, Pixar, would eventually produce the first fully computer-animated film, Toy Story… NLP, Text mining, Annotation Named Entity Recognition foundedBy Relation Extraction
  • 78. Named Entity Recognition Linked Open Data 48Ali Khalili Linked Open Data Lifecycle Extraction http://spotlight.dbpedia.org http://bioportal.bioontology.org/annotator
  • 79. from Structured sources: Triplification Linked Open Data 49Ali Khalili Linked Open Data Lifecycle Extraction • Relational Database to RDF R2RML: RDB to RDF Mapping Language http://www.w3.org/TR/r2rml/ • D2R Server: Accessing databases with SPARQL & as Linked Data http://d2rq.org/ • Sparqlify defining RDF views on relational databases http://sparqlify.org/
  • 80. DATA EXTRACTION & CONVERSION GeoJSON Enrichment Functions Mapping Configurations OSM XML PBF ESRI shapes triplify mapshaper osmtogeojson osmosis
  • 81. DATA EXTRACTION & CONVERSION Metadata about different levels provided by OSM http://wiki.openstreetmap.org/wiki/Tag:boundary%3Dadministrative
  • 82. Storage & Querying Linked Open Data 52Ali Khalili Linked Open Data Lifecycle
  • 83. Relational Databases vs. Triple Stores Linked Open Data 53Ali Khalili Linked Open Data Lifecycle Storage/Querying • A relational databases’ (e.g. MySQL, PostgreSQL, Oracle) natural representation is a collection interlinked tables. • A triple stores’ (e.g. OpenSesame, AllegroGraph, Neo4j) natural representation is a multi-relational network, or graph. * Triple Store: it is called a triple store because in RDF, the facts are represented in form of a triple (Subject-Predicate-Object).
  • 84. Existing Triple Stores Linked Open Data 54Ali Khalili Linked Open Data Lifecycle Storage/Querying • Native triple stores 4Store, AllegroGraph, BigData, Jena TDB, Sesame, Stardog, OWLIM and uRiKa • RDBMS-backed triple stores Jena SDB, IBM DB2 and OpenLink Virtuoso • NoSQL triplestores CumulusRDF
  • 85. DATA STORAGE & QUERYING Virtuoso Geo Spatial Geometry as SMS internal representation for Geo-data in RDF
  • 86. SPARQL – SQL for the Linked Data Linked Open Data 56Ali Khalili Linked Open Data Lifecycle Storage/Querying What can be done with SPARQL that can't be done with SQL?
  • 87. SPARQL – SQL for the Linked Data Linked Open Data 56Ali Khalili Linked Open Data Lifecycle Storage/Querying What can be done with SPARQL that can't be done with SQL? • SPARQL queries are considerably better aligned with users’ mental models of a domain.
  • 88. SPARQL – SQL for the Linked Data Linked Open Data 56Ali Khalili Linked Open Data Lifecycle Storage/Querying What can be done with SPARQL that can't be done with SQL? • SPARQL queries are considerably better aligned with users’ mental models of a domain.
  • 89. SPARQL – SQL for the Linked Data Linked Open Data 57Ali Khalili Linked Open Data Lifecycle Storage/Querying • SPARQL allows the conceptual data model to be fully explored through queries.
  • 90. SPARQL – SQL for the Linked Data Linked Open Data 57Ali Khalili Linked Open Data Lifecycle Storage/Querying • SPARQL allows the conceptual data model to be fully explored through queries. - example:workPhone rdfs:subPropertyOf example:phone - example:cellPhone rdfs:subPropertyOf example:phone - example:homePhone rdfs:subPropertyOf example:phone
  • 91. SPARQL – SQL for the Linked Data Linked Open Data 58Ali Khalili Linked Open Data Lifecycle Storage/Querying • Queries that have to traverse a chain of connections are particularly complex in SQL while very simple in SPARQL.
  • 92. SPARQL – SQL for the Linked Data Linked Open Data 58Ali Khalili Linked Open Data Lifecycle Storage/Querying • Queries that have to traverse a chain of connections are particularly complex in SQL while very simple in SPARQL.
  • 93. SPARQL – SQL for the Linked Data Linked Open Data 59Ali Khalili Linked Open Data Lifecycle Storage/Querying • In addition to SELECT, INSERT and DELETE, SPARQL supports ASK queries. • SPARQL includes syntax (i.e. SERVICE) to call two or more data sources within a single query. • …
  • 94. SPARQL Query Interface Linked Open Data 60Ali Khalili Linked Open Data Lifecycle Storage/Querying http://yasgui.org/
  • 95. Interlinking Linked Open Data 61Ali Khalili Linked Open Data Lifecycle
  • 96. Interlinking Linked Open Data 62Ali Khalili Linked Open Data Lifecycle • The degree to which entities that represent the same concepts are linked to each other. • “Connecting things that are somehow related” • Methods • Automatic, Semi-automatic, Manual • Universal, Domain-specific <http://dbpedia.org/resource/VU_University_Amsterdam> <https://www.wikidata.org/entity/Q1065414> SameAs
  • 97. Interlinking Methods Linked Open Data 63Ali Khalili Linked Open Data Lifecycle • Ontology Matching • establish links between ontologies underlying two data sources. • Instance Matching (Link Discovery) • discover links between instances contained in two data sources.
  • 98. DATA LINKAGE - Query on metadata about the administrative boundaries - Find the alignment between levels in different datasets
  • 99. DATA LINKAGE - used the possible mappings between datasets at different levels. - check the overlaps of areas at the similar level, and for the matching areas apply string matching to make sure that they refer to the same administrative boundary.
  • 104. Semantically Mapping Science (SMS) Platform http://sms.risis.eu
  • 108. Use Cases https://hyperir.cartodb.com/viz/13b5f3da-4356-11e6-a365-0e5db1731f59/public_map (research) and innovation subsidies for organizations and companies in the Netherlands
  • 109. Use Cases (research) and innovation subsidies for organizations and companies in the Netherlands People Hybrid OECD FUAsBusinesses People Hybrid OECD FUAsBusinesses
  • 110. Use Cases Universities + Companies + Projects + Boundaries Properties of container administrative boundaries Collaboration between Universities and Companies Properties of Universities and Companies
  • 111. Use Cases Universities + Companies + Projects + Boundaries RVO-NL DBpedia OpenStreetMap GADM Flickr OECD FUAs CBS-NL Properties of container administrative boundaries Collaboration between Universities and Companies Properties of Universities and Companies
  • 112. Use Cases Universities + Companies + Projects + Boundaries RVO-NL DBpedia Leiden-Ranking ETER OrgRef Cordis OpenStreetMap GADM Flickr OECD FUAs Grid CBS-NL Properties of container administrative boundaries Collaboration between Universities and Companies Properties of Universities and Companies Eurostat
  • 113. Summary of the Use Case Address FUA Administrative Boundaries Coordinates geocode
  • 114. Summary of the Use Case Address FUA Administrative Boundaries Coordinates geocode
  • 115.
  • 116. References Linked Open Data 77Ali Khalili Linked Open Data • http://slidewiki.org/deck/11936_semantic-data-web-lecture-series • Introduction to linked data and its lifecycle on the web • http://euclid-project.eu/ • http://videolectures.net/wims2011_auer_interlinked/ • https://vimeo.com/76257120 • http://www.slideshare.net/slidarko/evolving-the-web-into-a-giant-global- database-3880018 • http://www.dataversity.net/introduction-to-triplestores/ • http://www.topquadrant.com/2014/05/05/comparing-sparql-with-sql/