SlideShare a Scribd company logo
1 of 27
Download to read offline
Designing a Multilingual Knowledge
Graph as a Service for Cultural
Heritage
Some Challenges and Solutions
Valentine Charles, Hugo Manguinhas, Antoine Isaac - Europeana
Foundation
Nuno Freire - INESC-ID
Sergiu Gordea - AIT
DCMI Conference 2018
What is Europeana?
CC BY-SA
We aggregate metadata:
•  From all EU countries
•  ~3,700 galleries, libraries,
archives and museums
•  More than 58M objects
•  In more than 40 languages
•  Huge amount of references to
places, agents, concepts, time
periods
Europeana aggregation infrastructure
Europeana| CC BY-SA
The Platform for Europe’s Digital Cultural Heritage
Designing a Multilingual Knowledge Graph as a Service for Cultural Heritage
Europeana Linked Data Strategy
Our lines of work
CC BY-SA
•  The Europeana Data Model (EDM) offers a base for linking
metadata
•  We apply automatic enrichment to link object metadata to
reference datasets
•  We encourage data providers to contribute their own links to
vocabularies
•  We encourage alignment activities between domain
vocabularies
Designing a Multilingual Knowledge Graph as a Service for Cultural Heritage
Europeana Linked Data Strategy
LOD Vocabularies currently recognized by Europeana in providers'
metadata
CC BY-SA
Designing a Multilingual Knowledge Graph as a Service for Cultural Heritage
Vocabulary URL
MIMO Concepts http://www.mimo-db.eu/
MIMO Instrument makers http://www.mimo-db.eu/
The Getty - Art & Architecture Thesaurus (AAT) http://vocab.getty.edu/
The Getty - Union List of Artist Names (ULAN) http://vocab.getty.edu/
Virtual International Authority File (VIAF) http://viaf.org/viaf/
Geonames http://sws.geonames.org/
IconClass http://iconclass.org/
Gemeinsame Normdatei (GND) http://d-nb.info/gnd
Israel Museum Jerusalem Concepts http://www.imj.org.il/imagine/thesaurus/objects/
Partage Plus concepts http://partage.vocnet.org/
data.europeana.eu WWI Concepts from Library of Congress
Subject Headings (LCSH) http://data.europeana.eu/concept/loc
Europeana Sounds Genres http://data.europeana.eu/concept/soundgenres/
EAGLE Material & Object Type http://www.eagle-network.eu/voc/
DISMARC Formats & Genres http://purl.org/dismarc/ns/
UDC http://udcdata.info/rdf/
UNESCO Thesaurus http://vocabularies.unesco.org/thesaurus/
Europeana Linked Data Strategy
A strategy for Entities
CC BY-SA
We are building an "Entity Collection"
•  A service that acts as a centralized point of reference and access to
data about contextual entities: places, agents (persons and
organizations), concepts...
•  Caching and curating data from the wider Linked Open Data cloud
•  A sort of Europeana "knowledge graph" with an API
•  A service can be re-used by everyone in our community
Designing a Multilingual Knowledge Graph as a Service for Cultural Heritage
Uses cases for the Entity Collection (1/2)
CC BY-SA
Improve user experience on Europeana services
●  Findability: users can search with and for people, places and subjects, not only objects. In many
more languages, and with less ambiguity
●  Contextualization: users see contextual information about cultural heritage objects. Entity Pages
group and present all assertions about an entity
●  Exploration: Browsing along relationships between objects and entities and between entities
Designing a Multilingual Knowledge Graph as a Service for Cultural Heritage
Semantic auto-
completion
Entity Pages Entity based facets
Europeana Food & Drink
Project
Uses cases for the Entity Collection (2/2)
CC BY-SA
Crowdsourcing
●  Objects can be annotated with references to
entities of their context
Automatic enrichment of providers' metadata
●  A controlled vocabulary to help recognize references to entities
Republication for Re-use
●  Entities can be republished as an open source to the community
Designing a Multilingual Knowledge Graph as a Service for Cultural Heritage
Semantic and
Metadata annotations
Pundit Annotation Client
from Digital Manuscripts to
Euiropeana (DM2E)
Related work
CC BY-SA
•  Knowledge graph creation and maintenance: Google's Knowledge
Graph, DBpedia, Wikidata, BabelNet, VIAF, Entity Facts, SNAC,
Europeana Food and Drinks
•  (Vocabulary) web services: STW, DigitalNZ's Concept API...
•  W3C's best practices for publishing (linked) data on the Web
•  Data alignment tools and methods
•  Semantic discovery services: Worldcat Identities, etc.
Designing a Multilingual Knowledge Graph as a Service for Cultural Heritage
In this presentation
CC BY-SA
•  Challenges, decisions and results for
•  Building a knowledge graph for Europeana
•  Accessing it for exploitation
•  Disclaimers:
•  The focus is operational, this is not groundbreaking research!
Sometimes we will state the obvious...
•  Europeana's Entity Collection is still work in progress!
Designing a Multilingual Knowledge Graph as a Service for Cultural Heritage
The things we'll be talking about
Entity Collection processes in Europeana
CC BY-SA
Designing a Multilingual Knowledge Graph as a Service for Cultural Heritage
Building a knowledge graph
for Europeana
France, Public Domain
1914, National Library of France
Agence de presse Meurisse
Concours de cycles nautiques sur le lac
d’Enghien : Berregent piloté par Austerling
Selecting data sources
CC BY-SA
An intellectual effort by data experts, leveraging the following criteria:
•  Availability and access: open license, published on the web as linked
data
•  Granularity and coverage: similar or complementary sets of entities,
multilingual data, helping to answer key user needs for Europeana's CH
collections. Too generic datasets can create too much ambiguity for the
simple processes we have (e.g. enrichment)
•  Size: larger vocabularies are useful, but sometimes create too much
ambiguity
•  Quality: intrinsic aspects like correctness of representation (data
structures)
•  Connectivity: good data sources are well-connected internally and
externally to other datasets
An approach based on pivot (e.g. DBpedia, Wikidata) and specialized data
sources (e.g. AAT) is likely to work well if links can be made between them
Designing a Multilingual Knowledge Graph as a Service for Cultural Heritage
Statement selection and mappings
CC BY-SA
Conceptually distinct steps, which may be grouped at implementation time:
•  Selection of entities within one dataset
•  E.g. selecting artists (excluding pop stars) from DBpedia
•  Mappings between model of data source and model of KG
•  Made easier if both use standards, e.g. SKOS or FOAF - both re-used
by the Europeana Data Model
•  Selection of relevant statements
•  E.g. filtering unwanted languages larger vocabularies are useful, but
sometimes create too much ambiguity
Designing a Multilingual Knowledge Graph as a Service for Cultural Heritage
Example: integrating DBpedia resources
CC BY-SA
Designing a Multilingual Knowledge Graph as a Service for Cultural Heritage
Data integration and reconciliation
CC BY-SA
•  Import of entities needs integrating old statements with new ones if
entities are redundant recognized to be the same (via available sameAs-
like equivalence statements)
•  Several options for data integration on entities:
a.  unification: lumping all statements together - possibly leading to
inconsistencies or cardinality constraints
b.  first come, first serve: only adding statements from most recent sources
when it's possible, i.e. not violating cardinality constraints
c.  most representative: possibly replacing old statements by new ones if
they come from a "prefered" source
d.  differentiated most representative: grouping data sources in sets that
have different level of preference, so as to apply c between these
sets. b and c may be applied within the sets.
Designing a Multilingual Knowledge Graph as a Service for Cultural Heritage
Alignment and curation
CC BY-SA
•  Alignment aims at recognizing more equivalent entities in the data
sources to be integrated
•  It can use automatic or semi-automatic tools like Wikidata
Mix'n'Match or CultuurLink, both experimented in Europeana's
context
•  It requires a lot of effort/expertise at the scale we're considering!
•  Curators should be able to edit data to maintain integrity in the Entity
Collection or elsewhere downstream
•  Removing, editing, adding statements or deprecating entities, e.g. to
prevent ambiguities that lead to wrong enrichment of object
metadata
Designing a Multilingual Knowledge Graph as a Service for Cultural Heritage
Data currently in the Entity Collection
CC BY-SA
Mostly corresponding to a selection made for Europeana's
Semantic Enrichment
•  Places
a subset of Geonames, corresponding to places which are part of
European countries and of some specific feature classes.
•  Agents
a subset of DBpedia corresponding to most of the instances of dbp:Artist
with some exceptions, and integrated from 49 DBpedia language editions.
•  Concepts
a subset of DBpedia corresponding to a selection concepts matching the
needs from Europeana Collections (e.g., WWI battles).
Europeana Sounds music genres (obtained from Wikidata)
Photo Consortium's photography vocabulary
•  Organizations
Extracted from Europeana's CRM and aligned to Wikidata when possible
216,302
resources
1,572
resources
165,005
resources
1,077
resources
Designing a Multilingual Knowledge Graph as a Service for Cultural Heritage
Multilingual coverage of the Entity Collection
And its contribution to automatic enrichment
CC BY-SA
Designing a Multilingual Knowledge Graph as a Service for Cultural Heritage
Entities effectively used to enrich Europeana Objects
Entities present in the Entity Collection
https://docs.google.com/document/d/1Nek_SPDtIR3waYwwdgRHY1YiTCj3SK8Eh9xYpnSkFkY
France, Public Domain
1932, National Library of France
Agence de presse Mondial Photo-Presse.
Tournoi royal de motos à Londres :
changement d'une roue de side-car en marche
Accessing and
exploiting the data in
the Entity Collection
Entity URIs
CC BY-SA
•  Step 0 for publishing our Linked Data on the web: minting URIs for our
entities
•  It's always a difficult choice between length, ease of persistence, human-
friendliness...
•  We've looked at best practices and consulted our community
•  Chosen pattern is http://data.europeana.eu/{entity_class}/
{scheme}/{localID}
•  Where localID is a sequential identifier
•  For example, the agent Leonardo da Vinci: http://data.europeana.eu/
agent/base/146741
Designing a Multilingual Knowledge Graph as a Service for Cultural Heritage
The Entity API
CC BY-SA
•  Still on alpha state
•  Looking at best practices for Linked Data and JSON-LD
•  Available documentation for this API at:
https://pro.europeana.eu/resources/apis/entity
Designing a Multilingual Knowledge Graph as a Service for Cultural Heritage
The Entity API - entity look-up/resolving
CC BY-SA
•  Linked Data content negotiation at data.europeana.eu (for JSON-LD and
HTML)
•  API call for JSON-LD via entity identifiers
•  https://www.europeana.eu/api/entities/[entity_class]/base/
[ID].jsonld
•  resolve method for getting data on an Entity for an external URI (that
appear in sameAs-like equivalence statement)
•  https://www.europeana.eu/api/entities/resolve?uri=[URI]
•  We created a specific JSON-LD context to make the data easier to
consume for web developers, e.g. hiding RDF namespace abbreviations
Designing a Multilingual Knowledge Graph as a Service for Cultural Heritage
The Entity Collection
DBpedia resource for “Mozart” in our data
CC BY-SA
Designing a Multilingual Knowledge Graph as a Service for Cultural Heritage
Coreference links to 6 other
datasets
(e.g. Freebase, Wikidata)
Inter-linking information… still
need to switch references to link
to Europeana Entities
Preferred labels for 48
languages
The Entity API - discovery
CC BY-SA
•  Suggestion of entities for a type string, used the auto-complete function in
Europeana's search box
•  https://www.europeana.eu/api/entities/suggest
•  For example: https://www.europeana.eu/api/entities/suggest?
wskey=apidemo&text=leo&type=agent
•  Ranking of suggestions is based on:
•  Europeana relevance: number of Europeana objects whose
description contains one of the entity's labels
•  Popularity as computed in the Wikidata pagerank (Diefenbach &
Thalhammer, 2018)
•  Coming soon: general entity search, based on free (Solr-style) querying on
metadata fields for entities
Designing a Multilingual Knowledge Graph as a Service for Cultural Heritage
Entity API - suggest method
CC BY-SA
Designing a Multilingual Knowledge Graph as a Service for Cultural Heritage
/entities/suggest.json?text=neo
Conclusions
CC BY-SA
•  We've made enough progress to release a first version of the
Entity Collection and its API, used in Europeana's production
services.
•  But there are still challenges and decisions to ensure
consistency and relevance over time:
•  Expand data coverage (and test extensibility) with new data
sources for, e.g., events
•  Continue elaborating and testing data integration and filtration
strategies
•  Employ the EC to better enrich Europeana object metadata
•  Enhance discoverability, especially for search engines, e.g. via
Schema.org publication
Designing a Multilingual Knowledge Graph as a Service for Cultural Heritage
Thank you!
antoine.isaac@europeana.eu
@antoine_isaac

More Related Content

What's hot

AAC Education Session
AAC Education Session AAC Education Session
AAC Education Session Antoine Isaac
 
W3C Library Linked Data Incubator Group - 2011
W3C Library Linked Data Incubator Group  - 2011W3C Library Linked Data Incubator Group  - 2011
W3C Library Linked Data Incubator Group - 2011Antoine Isaac
 
Europeana - American Art Collaborative LOD Meeting
Europeana - American Art Collaborative LOD MeetingEuropeana - American Art Collaborative LOD Meeting
Europeana - American Art Collaborative LOD MeetingAntoine Isaac
 
European databases in cultural heritage: making connections
European databases in cultural heritage: making connectionsEuropean databases in cultural heritage: making connections
European databases in cultural heritage: making connectionsCARARE
 
Europeana, more than data aggregation?
Europeana, more than data aggregation?Europeana, more than data aggregation?
Europeana, more than data aggregation?Antoine Isaac
 
Linking data for Europeana
Linking data for EuropeanaLinking data for Europeana
Linking data for EuropeanaAntoine Isaac
 
Europeana and the relevance of the DM2E results
Europeana and the relevance of the DM2E resultsEuropeana and the relevance of the DM2E results
Europeana and the relevance of the DM2E resultsAntoine Isaac
 
EuropeanaTech update - Europeana AGM 2015
EuropeanaTech update - Europeana AGM 2015EuropeanaTech update - Europeana AGM 2015
EuropeanaTech update - Europeana AGM 2015Antoine Isaac
 
A portrait of Europeana as a Linked Open Data case
A portrait of Europeana as a Linked Open Data caseA portrait of Europeana as a Linked Open Data case
A portrait of Europeana as a Linked Open Data caseAntoine Isaac
 
EIFL 2014 - Linked Open Data
EIFL 2014 - Linked Open DataEIFL 2014 - Linked Open Data
EIFL 2014 - Linked Open DataAntoine Isaac
 
The Europeana Datamodel: A semantic layer on top of Cultural Heritage Objects
The Europeana Datamodel: A semantic layer on top of Cultural Heritage ObjectsThe Europeana Datamodel: A semantic layer on top of Cultural Heritage Objects
The Europeana Datamodel: A semantic layer on top of Cultural Heritage ObjectsLIBIS
 
December 2, 2015: NISO/NFAIS Virtual Conference: Semantic Web: What's New and...
December 2, 2015: NISO/NFAIS Virtual Conference: Semantic Web: What's New and...December 2, 2015: NISO/NFAIS Virtual Conference: Semantic Web: What's New and...
December 2, 2015: NISO/NFAIS Virtual Conference: Semantic Web: What's New and...DeVonne Parks, CEM
 
Data modelling at Europeana and DM2E - SMW13
Data modelling at Europeana and DM2E - SMW13Data modelling at Europeana and DM2E - SMW13
Data modelling at Europeana and DM2E - SMW13Antoine Isaac
 
Europeana vision - Web as Literature 2013
Europeana vision - Web as Literature 2013Europeana vision - Web as Literature 2013
Europeana vision - Web as Literature 2013Antoine Isaac
 
Europeana DSI - LT-Accelerate 14
Europeana DSI -  LT-Accelerate 14Europeana DSI -  LT-Accelerate 14
Europeana DSI - LT-Accelerate 14Antoine Isaac
 
Europeana and Schema.org - DC2013
Europeana and Schema.org - DC2013Europeana and Schema.org - DC2013
Europeana and Schema.org - DC2013Antoine Isaac
 
Europeana Research Panel DH Benelux 2017
Europeana Research Panel DH Benelux 2017Europeana Research Panel DH Benelux 2017
Europeana Research Panel DH Benelux 2017Europeana
 
Semantic Web, Linked Data: the Europeana case(s)
Semantic Web, Linked Data: the Europeana case(s)Semantic Web, Linked Data: the Europeana case(s)
Semantic Web, Linked Data: the Europeana case(s)Antoine Isaac
 
Europeana @ NISO Bibliographic Roadmap Meeting
Europeana @ NISO Bibliographic Roadmap MeetingEuropeana @ NISO Bibliographic Roadmap Meeting
Europeana @ NISO Bibliographic Roadmap MeetingAntoine Isaac
 
EDM - American Art Collaborative LOD Meeting
EDM - American Art Collaborative LOD MeetingEDM - American Art Collaborative LOD Meeting
EDM - American Art Collaborative LOD MeetingAntoine Isaac
 

What's hot (20)

AAC Education Session
AAC Education Session AAC Education Session
AAC Education Session
 
W3C Library Linked Data Incubator Group - 2011
W3C Library Linked Data Incubator Group  - 2011W3C Library Linked Data Incubator Group  - 2011
W3C Library Linked Data Incubator Group - 2011
 
Europeana - American Art Collaborative LOD Meeting
Europeana - American Art Collaborative LOD MeetingEuropeana - American Art Collaborative LOD Meeting
Europeana - American Art Collaborative LOD Meeting
 
European databases in cultural heritage: making connections
European databases in cultural heritage: making connectionsEuropean databases in cultural heritage: making connections
European databases in cultural heritage: making connections
 
Europeana, more than data aggregation?
Europeana, more than data aggregation?Europeana, more than data aggregation?
Europeana, more than data aggregation?
 
Linking data for Europeana
Linking data for EuropeanaLinking data for Europeana
Linking data for Europeana
 
Europeana and the relevance of the DM2E results
Europeana and the relevance of the DM2E resultsEuropeana and the relevance of the DM2E results
Europeana and the relevance of the DM2E results
 
EuropeanaTech update - Europeana AGM 2015
EuropeanaTech update - Europeana AGM 2015EuropeanaTech update - Europeana AGM 2015
EuropeanaTech update - Europeana AGM 2015
 
A portrait of Europeana as a Linked Open Data case
A portrait of Europeana as a Linked Open Data caseA portrait of Europeana as a Linked Open Data case
A portrait of Europeana as a Linked Open Data case
 
EIFL 2014 - Linked Open Data
EIFL 2014 - Linked Open DataEIFL 2014 - Linked Open Data
EIFL 2014 - Linked Open Data
 
The Europeana Datamodel: A semantic layer on top of Cultural Heritage Objects
The Europeana Datamodel: A semantic layer on top of Cultural Heritage ObjectsThe Europeana Datamodel: A semantic layer on top of Cultural Heritage Objects
The Europeana Datamodel: A semantic layer on top of Cultural Heritage Objects
 
December 2, 2015: NISO/NFAIS Virtual Conference: Semantic Web: What's New and...
December 2, 2015: NISO/NFAIS Virtual Conference: Semantic Web: What's New and...December 2, 2015: NISO/NFAIS Virtual Conference: Semantic Web: What's New and...
December 2, 2015: NISO/NFAIS Virtual Conference: Semantic Web: What's New and...
 
Data modelling at Europeana and DM2E - SMW13
Data modelling at Europeana and DM2E - SMW13Data modelling at Europeana and DM2E - SMW13
Data modelling at Europeana and DM2E - SMW13
 
Europeana vision - Web as Literature 2013
Europeana vision - Web as Literature 2013Europeana vision - Web as Literature 2013
Europeana vision - Web as Literature 2013
 
Europeana DSI - LT-Accelerate 14
Europeana DSI -  LT-Accelerate 14Europeana DSI -  LT-Accelerate 14
Europeana DSI - LT-Accelerate 14
 
Europeana and Schema.org - DC2013
Europeana and Schema.org - DC2013Europeana and Schema.org - DC2013
Europeana and Schema.org - DC2013
 
Europeana Research Panel DH Benelux 2017
Europeana Research Panel DH Benelux 2017Europeana Research Panel DH Benelux 2017
Europeana Research Panel DH Benelux 2017
 
Semantic Web, Linked Data: the Europeana case(s)
Semantic Web, Linked Data: the Europeana case(s)Semantic Web, Linked Data: the Europeana case(s)
Semantic Web, Linked Data: the Europeana case(s)
 
Europeana @ NISO Bibliographic Roadmap Meeting
Europeana @ NISO Bibliographic Roadmap MeetingEuropeana @ NISO Bibliographic Roadmap Meeting
Europeana @ NISO Bibliographic Roadmap Meeting
 
EDM - American Art Collaborative LOD Meeting
EDM - American Art Collaborative LOD MeetingEDM - American Art Collaborative LOD Meeting
EDM - American Art Collaborative LOD Meeting
 

Similar to Designing a multilingual knowledge graph - DCMI2018

The Europeana Community: Semantics and Cultural Heritage Data
The Europeana Community: Semantics and Cultural Heritage DataThe Europeana Community: Semantics and Cultural Heritage Data
The Europeana Community: Semantics and Cultural Heritage DataNuno Freire
 
Europeana as a Linked Data (Quality) case
Europeana as a Linked Data (Quality) caseEuropeana as a Linked Data (Quality) case
Europeana as a Linked Data (Quality) caseAntoine Isaac
 
Linked Data for EuropeanaCultural Heritage: the Europeana approach
Linked Data for EuropeanaCultural Heritage: the Europeana approachLinked Data for EuropeanaCultural Heritage: the Europeana approach
Linked Data for EuropeanaCultural Heritage: the Europeana approachValentine Charles
 
Valentine Charles: Linking cultural heritage with KOS: the Europeana example
Valentine Charles: Linking cultural heritage with KOS: the Europeana example Valentine Charles: Linking cultural heritage with KOS: the Europeana example
Valentine Charles: Linking cultural heritage with KOS: the Europeana example COST Action TD1210
 
Building a Framework for Semantic Cultural Heritage Data
Building a Framework for Semantic Cultural Heritage DataBuilding a Framework for Semantic Cultural Heritage Data
Building a Framework for Semantic Cultural Heritage DataValentine Charles
 
When Semantics support Multilingual Access to Digital Cultural Heritage - the...
When Semantics support Multilingual Access to Digital Cultural Heritage - the...When Semantics support Multilingual Access to Digital Cultural Heritage - the...
When Semantics support Multilingual Access to Digital Cultural Heritage - the...Valentine Charles
 
Links, languages and semantics: linked data approaches in The European Libra...
Links, languages and semantics: linked data approaches in The European Libra...Links, languages and semantics: linked data approaches in The European Libra...
Links, languages and semantics: linked data approaches in The European Libra...Valentine Charles
 
The Europeana Data Model - TPDL2018
The Europeana Data Model - TPDL2018The Europeana Data Model - TPDL2018
The Europeana Data Model - TPDL2018Antoine Isaac
 
Building an ecosystem of networked references
Building an ecosystem of networked referencesBuilding an ecosystem of networked references
Building an ecosystem of networked referencesHugo Manguinhas
 
Data scale and diversity issues at Europeana
Data scale and diversity issues at EuropeanaData scale and diversity issues at Europeana
Data scale and diversity issues at EuropeanaAntoine Isaac
 
Alexandria winer20100623
Alexandria winer20100623Alexandria winer20100623
Alexandria winer20100623Dov Winer
 
Eun lre brussels_winer20100616
Eun lre brussels_winer20100616Eun lre brussels_winer20100616
Eun lre brussels_winer20100616Dov Winer
 
The Europeana Strategy and Linked Open Data
The Europeana Strategy and Linked Open DataThe Europeana Strategy and Linked Open Data
The Europeana Strategy and Linked Open DataDavid Haskiya
 
Europeana and open data
Europeana and open dataEuropeana and open data
Europeana and open dataRobinaClayphan
 
Challenges on modeling annotations in the Europeana Sounds project
Challenges on modeling annotations in the Europeana Sounds projectChallenges on modeling annotations in the Europeana Sounds project
Challenges on modeling annotations in the Europeana Sounds projectHugo Manguinhas
 
Challenges on modeling annotations in the europeana sounds project
Challenges on modeling annotations in the europeana sounds projectChallenges on modeling annotations in the europeana sounds project
Challenges on modeling annotations in the europeana sounds projectEuropeana_Sounds
 
Fondly Collisions: Archival hierarchy and the Europeana Data Model
Fondly Collisions: Archival hierarchy and the Europeana Data Model   Fondly Collisions: Archival hierarchy and the Europeana Data Model
Fondly Collisions: Archival hierarchy and the Europeana Data Model Valentine Charles
 
LDBC 19 November 2013
LDBC 19 November 2013  LDBC 19 November 2013
LDBC 19 November 2013 Europeana
 
3D content in Europeana: the challenges of providing access
3D content in Europeana: the challenges of providing access3D content in Europeana: the challenges of providing access
3D content in Europeana: the challenges of providing accessCARARE
 
Wikidata, a target for Europeana’s semantic strategy (Glam-Wiki 2015)
Wikidata, a target for Europeana’s semantic strategy (Glam-Wiki 2015)Wikidata, a target for Europeana’s semantic strategy (Glam-Wiki 2015)
Wikidata, a target for Europeana’s semantic strategy (Glam-Wiki 2015)Vladimir Alexiev, PhD, PMP
 

Similar to Designing a multilingual knowledge graph - DCMI2018 (20)

The Europeana Community: Semantics and Cultural Heritage Data
The Europeana Community: Semantics and Cultural Heritage DataThe Europeana Community: Semantics and Cultural Heritage Data
The Europeana Community: Semantics and Cultural Heritage Data
 
Europeana as a Linked Data (Quality) case
Europeana as a Linked Data (Quality) caseEuropeana as a Linked Data (Quality) case
Europeana as a Linked Data (Quality) case
 
Linked Data for EuropeanaCultural Heritage: the Europeana approach
Linked Data for EuropeanaCultural Heritage: the Europeana approachLinked Data for EuropeanaCultural Heritage: the Europeana approach
Linked Data for EuropeanaCultural Heritage: the Europeana approach
 
Valentine Charles: Linking cultural heritage with KOS: the Europeana example
Valentine Charles: Linking cultural heritage with KOS: the Europeana example Valentine Charles: Linking cultural heritage with KOS: the Europeana example
Valentine Charles: Linking cultural heritage with KOS: the Europeana example
 
Building a Framework for Semantic Cultural Heritage Data
Building a Framework for Semantic Cultural Heritage DataBuilding a Framework for Semantic Cultural Heritage Data
Building a Framework for Semantic Cultural Heritage Data
 
When Semantics support Multilingual Access to Digital Cultural Heritage - the...
When Semantics support Multilingual Access to Digital Cultural Heritage - the...When Semantics support Multilingual Access to Digital Cultural Heritage - the...
When Semantics support Multilingual Access to Digital Cultural Heritage - the...
 
Links, languages and semantics: linked data approaches in The European Libra...
Links, languages and semantics: linked data approaches in The European Libra...Links, languages and semantics: linked data approaches in The European Libra...
Links, languages and semantics: linked data approaches in The European Libra...
 
The Europeana Data Model - TPDL2018
The Europeana Data Model - TPDL2018The Europeana Data Model - TPDL2018
The Europeana Data Model - TPDL2018
 
Building an ecosystem of networked references
Building an ecosystem of networked referencesBuilding an ecosystem of networked references
Building an ecosystem of networked references
 
Data scale and diversity issues at Europeana
Data scale and diversity issues at EuropeanaData scale and diversity issues at Europeana
Data scale and diversity issues at Europeana
 
Alexandria winer20100623
Alexandria winer20100623Alexandria winer20100623
Alexandria winer20100623
 
Eun lre brussels_winer20100616
Eun lre brussels_winer20100616Eun lre brussels_winer20100616
Eun lre brussels_winer20100616
 
The Europeana Strategy and Linked Open Data
The Europeana Strategy and Linked Open DataThe Europeana Strategy and Linked Open Data
The Europeana Strategy and Linked Open Data
 
Europeana and open data
Europeana and open dataEuropeana and open data
Europeana and open data
 
Challenges on modeling annotations in the Europeana Sounds project
Challenges on modeling annotations in the Europeana Sounds projectChallenges on modeling annotations in the Europeana Sounds project
Challenges on modeling annotations in the Europeana Sounds project
 
Challenges on modeling annotations in the europeana sounds project
Challenges on modeling annotations in the europeana sounds projectChallenges on modeling annotations in the europeana sounds project
Challenges on modeling annotations in the europeana sounds project
 
Fondly Collisions: Archival hierarchy and the Europeana Data Model
Fondly Collisions: Archival hierarchy and the Europeana Data Model   Fondly Collisions: Archival hierarchy and the Europeana Data Model
Fondly Collisions: Archival hierarchy and the Europeana Data Model
 
LDBC 19 November 2013
LDBC 19 November 2013  LDBC 19 November 2013
LDBC 19 November 2013
 
3D content in Europeana: the challenges of providing access
3D content in Europeana: the challenges of providing access3D content in Europeana: the challenges of providing access
3D content in Europeana: the challenges of providing access
 
Wikidata, a target for Europeana’s semantic strategy (Glam-Wiki 2015)
Wikidata, a target for Europeana’s semantic strategy (Glam-Wiki 2015)Wikidata, a target for Europeana’s semantic strategy (Glam-Wiki 2015)
Wikidata, a target for Europeana’s semantic strategy (Glam-Wiki 2015)
 

More from Antoine Isaac

Addressing multilingual challenges at Europeana: An update - DCMI 2021
Addressing multilingual challenges at Europeana: An update - DCMI 2021Addressing multilingual challenges at Europeana: An update - DCMI 2021
Addressing multilingual challenges at Europeana: An update - DCMI 2021Antoine Isaac
 
Le Cadre de publication d'Europeana
Le Cadre de publication d'EuropeanaLe Cadre de publication d'Europeana
Le Cadre de publication d'EuropeanaAntoine Isaac
 
Metadata aggregation of IIIF Resources at Europeana: status and plans
Metadata aggregation of IIIF Resources at Europeana: status and plansMetadata aggregation of IIIF Resources at Europeana: status and plans
Metadata aggregation of IIIF Resources at Europeana: status and plansAntoine Isaac
 
IIIF and the Europeana mission
IIIF and the Europeana missionIIIF and the Europeana mission
IIIF and the Europeana missionAntoine Isaac
 
Lightweight rights modeling and linked data publication for online cultural h...
Lightweight rights modeling and linked data publication for online cultural h...Lightweight rights modeling and linked data publication for online cultural h...
Lightweight rights modeling and linked data publication for online cultural h...Antoine Isaac
 
Isaac - W3C Data on the Web Best Practices - Data Vocabularies
Isaac - W3C Data on the Web Best Practices - Data VocabulariesIsaac - W3C Data on the Web Best Practices - Data Vocabularies
Isaac - W3C Data on the Web Best Practices - Data VocabulariesAntoine Isaac
 
Modelling and exchanging annotations
Modelling and exchanging annotationsModelling and exchanging annotations
Modelling and exchanging annotationsAntoine Isaac
 
Modelling annotations for Europeana and related projects - DARIAH-EU WS
Modelling annotations for Europeana and related projects - DARIAH-EU WSModelling annotations for Europeana and related projects - DARIAH-EU WS
Modelling annotations for Europeana and related projects - DARIAH-EU WSAntoine Isaac
 
Classification schemes, thesauri and other Knowledge Organization Systems - a...
Classification schemes, thesauri and other Knowledge Organization Systems - a...Classification schemes, thesauri and other Knowledge Organization Systems - a...
Classification schemes, thesauri and other Knowledge Organization Systems - a...Antoine Isaac
 
Multilingual challenges for accessing digitized culture online - Riga Summit 15
Multilingual challenges for accessing digitized culture online - Riga Summit 15Multilingual challenges for accessing digitized culture online - Riga Summit 15
Multilingual challenges for accessing digitized culture online - Riga Summit 15Antoine Isaac
 
Enrichment and Europeana
Enrichment and EuropeanaEnrichment and Europeana
Enrichment and EuropeanaAntoine Isaac
 
Challenges for the Language Technology Industry
Challenges for the Language Technology IndustryChallenges for the Language Technology Industry
Challenges for the Language Technology IndustryAntoine Isaac
 

More from Antoine Isaac (14)

Addressing multilingual challenges at Europeana: An update - DCMI 2021
Addressing multilingual challenges at Europeana: An update - DCMI 2021Addressing multilingual challenges at Europeana: An update - DCMI 2021
Addressing multilingual challenges at Europeana: An update - DCMI 2021
 
Le Cadre de publication d'Europeana
Le Cadre de publication d'EuropeanaLe Cadre de publication d'Europeana
Le Cadre de publication d'Europeana
 
Metadata aggregation of IIIF Resources at Europeana: status and plans
Metadata aggregation of IIIF Resources at Europeana: status and plansMetadata aggregation of IIIF Resources at Europeana: status and plans
Metadata aggregation of IIIF Resources at Europeana: status and plans
 
IIIF and the Europeana mission
IIIF and the Europeana missionIIIF and the Europeana mission
IIIF and the Europeana mission
 
Lightweight rights modeling and linked data publication for online cultural h...
Lightweight rights modeling and linked data publication for online cultural h...Lightweight rights modeling and linked data publication for online cultural h...
Lightweight rights modeling and linked data publication for online cultural h...
 
Europeana et IIIF
Europeana et IIIFEuropeana et IIIF
Europeana et IIIF
 
Isaac - W3C Data on the Web Best Practices - Data Vocabularies
Isaac - W3C Data on the Web Best Practices - Data VocabulariesIsaac - W3C Data on the Web Best Practices - Data Vocabularies
Isaac - W3C Data on the Web Best Practices - Data Vocabularies
 
Europeana APIs
Europeana APIsEuropeana APIs
Europeana APIs
 
Modelling and exchanging annotations
Modelling and exchanging annotationsModelling and exchanging annotations
Modelling and exchanging annotations
 
Modelling annotations for Europeana and related projects - DARIAH-EU WS
Modelling annotations for Europeana and related projects - DARIAH-EU WSModelling annotations for Europeana and related projects - DARIAH-EU WS
Modelling annotations for Europeana and related projects - DARIAH-EU WS
 
Classification schemes, thesauri and other Knowledge Organization Systems - a...
Classification schemes, thesauri and other Knowledge Organization Systems - a...Classification schemes, thesauri and other Knowledge Organization Systems - a...
Classification schemes, thesauri and other Knowledge Organization Systems - a...
 
Multilingual challenges for accessing digitized culture online - Riga Summit 15
Multilingual challenges for accessing digitized culture online - Riga Summit 15Multilingual challenges for accessing digitized culture online - Riga Summit 15
Multilingual challenges for accessing digitized culture online - Riga Summit 15
 
Enrichment and Europeana
Enrichment and EuropeanaEnrichment and Europeana
Enrichment and Europeana
 
Challenges for the Language Technology Industry
Challenges for the Language Technology IndustryChallenges for the Language Technology Industry
Challenges for the Language Technology Industry
 

Recently uploaded

How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.Curtis Poe
 
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...Wes McKinney
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxLoriGlavin3
 
Modern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better StrongerModern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better Strongerpanagenda
 
UiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to HeroUiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to HeroUiPathCommunity
 
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...Alkin Tezuysal
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsPixlogix Infotech
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxLoriGlavin3
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxLoriGlavin3
 
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyesHow to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyesThousandEyes
 
Bridging Between CAD & GIS: 6 Ways to Automate Your Data Integration
Bridging Between CAD & GIS:  6 Ways to Automate Your Data IntegrationBridging Between CAD & GIS:  6 Ways to Automate Your Data Integration
Bridging Between CAD & GIS: 6 Ways to Automate Your Data Integrationmarketing932765
 
So einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdfSo einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdfpanagenda
 
Generative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfGenerative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfIngrid Airi González
 
[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality Assurance[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality AssuranceInflectra
 
Connecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdfConnecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdfNeo4j
 
Zeshan Sattar- Assessing the skill requirements and industry expectations for...
Zeshan Sattar- Assessing the skill requirements and industry expectations for...Zeshan Sattar- Assessing the skill requirements and industry expectations for...
Zeshan Sattar- Assessing the skill requirements and industry expectations for...itnewsafrica
 
2024 April Patch Tuesday
2024 April Patch Tuesday2024 April Patch Tuesday
2024 April Patch TuesdayIvanti
 
Glenn Lazarus- Why Your Observability Strategy Needs Security Observability
Glenn Lazarus- Why Your Observability Strategy Needs Security ObservabilityGlenn Lazarus- Why Your Observability Strategy Needs Security Observability
Glenn Lazarus- Why Your Observability Strategy Needs Security Observabilityitnewsafrica
 
Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...Farhan Tariq
 
Generative AI - Gitex v1Generative AI - Gitex v1.pptx
Generative AI - Gitex v1Generative AI - Gitex v1.pptxGenerative AI - Gitex v1Generative AI - Gitex v1.pptx
Generative AI - Gitex v1Generative AI - Gitex v1.pptxfnnc6jmgwh
 

Recently uploaded (20)

How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.
 
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
 
Modern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better StrongerModern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
 
UiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to HeroUiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to Hero
 
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and Cons
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
 
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyesHow to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
 
Bridging Between CAD & GIS: 6 Ways to Automate Your Data Integration
Bridging Between CAD & GIS:  6 Ways to Automate Your Data IntegrationBridging Between CAD & GIS:  6 Ways to Automate Your Data Integration
Bridging Between CAD & GIS: 6 Ways to Automate Your Data Integration
 
So einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdfSo einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdf
 
Generative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfGenerative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdf
 
[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality Assurance[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality Assurance
 
Connecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdfConnecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdf
 
Zeshan Sattar- Assessing the skill requirements and industry expectations for...
Zeshan Sattar- Assessing the skill requirements and industry expectations for...Zeshan Sattar- Assessing the skill requirements and industry expectations for...
Zeshan Sattar- Assessing the skill requirements and industry expectations for...
 
2024 April Patch Tuesday
2024 April Patch Tuesday2024 April Patch Tuesday
2024 April Patch Tuesday
 
Glenn Lazarus- Why Your Observability Strategy Needs Security Observability
Glenn Lazarus- Why Your Observability Strategy Needs Security ObservabilityGlenn Lazarus- Why Your Observability Strategy Needs Security Observability
Glenn Lazarus- Why Your Observability Strategy Needs Security Observability
 
Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...
 
Generative AI - Gitex v1Generative AI - Gitex v1.pptx
Generative AI - Gitex v1Generative AI - Gitex v1.pptxGenerative AI - Gitex v1Generative AI - Gitex v1.pptx
Generative AI - Gitex v1Generative AI - Gitex v1.pptx
 

Designing a multilingual knowledge graph - DCMI2018

  • 1. Designing a Multilingual Knowledge Graph as a Service for Cultural Heritage Some Challenges and Solutions Valentine Charles, Hugo Manguinhas, Antoine Isaac - Europeana Foundation Nuno Freire - INESC-ID Sergiu Gordea - AIT DCMI Conference 2018
  • 2. What is Europeana? CC BY-SA We aggregate metadata: •  From all EU countries •  ~3,700 galleries, libraries, archives and museums •  More than 58M objects •  In more than 40 languages •  Huge amount of references to places, agents, concepts, time periods Europeana aggregation infrastructure Europeana| CC BY-SA The Platform for Europe’s Digital Cultural Heritage Designing a Multilingual Knowledge Graph as a Service for Cultural Heritage
  • 3. Europeana Linked Data Strategy Our lines of work CC BY-SA •  The Europeana Data Model (EDM) offers a base for linking metadata •  We apply automatic enrichment to link object metadata to reference datasets •  We encourage data providers to contribute their own links to vocabularies •  We encourage alignment activities between domain vocabularies Designing a Multilingual Knowledge Graph as a Service for Cultural Heritage
  • 4. Europeana Linked Data Strategy LOD Vocabularies currently recognized by Europeana in providers' metadata CC BY-SA Designing a Multilingual Knowledge Graph as a Service for Cultural Heritage Vocabulary URL MIMO Concepts http://www.mimo-db.eu/ MIMO Instrument makers http://www.mimo-db.eu/ The Getty - Art & Architecture Thesaurus (AAT) http://vocab.getty.edu/ The Getty - Union List of Artist Names (ULAN) http://vocab.getty.edu/ Virtual International Authority File (VIAF) http://viaf.org/viaf/ Geonames http://sws.geonames.org/ IconClass http://iconclass.org/ Gemeinsame Normdatei (GND) http://d-nb.info/gnd Israel Museum Jerusalem Concepts http://www.imj.org.il/imagine/thesaurus/objects/ Partage Plus concepts http://partage.vocnet.org/ data.europeana.eu WWI Concepts from Library of Congress Subject Headings (LCSH) http://data.europeana.eu/concept/loc Europeana Sounds Genres http://data.europeana.eu/concept/soundgenres/ EAGLE Material & Object Type http://www.eagle-network.eu/voc/ DISMARC Formats & Genres http://purl.org/dismarc/ns/ UDC http://udcdata.info/rdf/ UNESCO Thesaurus http://vocabularies.unesco.org/thesaurus/
  • 5. Europeana Linked Data Strategy A strategy for Entities CC BY-SA We are building an "Entity Collection" •  A service that acts as a centralized point of reference and access to data about contextual entities: places, agents (persons and organizations), concepts... •  Caching and curating data from the wider Linked Open Data cloud •  A sort of Europeana "knowledge graph" with an API •  A service can be re-used by everyone in our community Designing a Multilingual Knowledge Graph as a Service for Cultural Heritage
  • 6. Uses cases for the Entity Collection (1/2) CC BY-SA Improve user experience on Europeana services ●  Findability: users can search with and for people, places and subjects, not only objects. In many more languages, and with less ambiguity ●  Contextualization: users see contextual information about cultural heritage objects. Entity Pages group and present all assertions about an entity ●  Exploration: Browsing along relationships between objects and entities and between entities Designing a Multilingual Knowledge Graph as a Service for Cultural Heritage Semantic auto- completion Entity Pages Entity based facets Europeana Food & Drink Project
  • 7. Uses cases for the Entity Collection (2/2) CC BY-SA Crowdsourcing ●  Objects can be annotated with references to entities of their context Automatic enrichment of providers' metadata ●  A controlled vocabulary to help recognize references to entities Republication for Re-use ●  Entities can be republished as an open source to the community Designing a Multilingual Knowledge Graph as a Service for Cultural Heritage Semantic and Metadata annotations Pundit Annotation Client from Digital Manuscripts to Euiropeana (DM2E)
  • 8. Related work CC BY-SA •  Knowledge graph creation and maintenance: Google's Knowledge Graph, DBpedia, Wikidata, BabelNet, VIAF, Entity Facts, SNAC, Europeana Food and Drinks •  (Vocabulary) web services: STW, DigitalNZ's Concept API... •  W3C's best practices for publishing (linked) data on the Web •  Data alignment tools and methods •  Semantic discovery services: Worldcat Identities, etc. Designing a Multilingual Knowledge Graph as a Service for Cultural Heritage
  • 9. In this presentation CC BY-SA •  Challenges, decisions and results for •  Building a knowledge graph for Europeana •  Accessing it for exploitation •  Disclaimers: •  The focus is operational, this is not groundbreaking research! Sometimes we will state the obvious... •  Europeana's Entity Collection is still work in progress! Designing a Multilingual Knowledge Graph as a Service for Cultural Heritage
  • 10. The things we'll be talking about Entity Collection processes in Europeana CC BY-SA Designing a Multilingual Knowledge Graph as a Service for Cultural Heritage
  • 11. Building a knowledge graph for Europeana France, Public Domain 1914, National Library of France Agence de presse Meurisse Concours de cycles nautiques sur le lac d’Enghien : Berregent piloté par Austerling
  • 12. Selecting data sources CC BY-SA An intellectual effort by data experts, leveraging the following criteria: •  Availability and access: open license, published on the web as linked data •  Granularity and coverage: similar or complementary sets of entities, multilingual data, helping to answer key user needs for Europeana's CH collections. Too generic datasets can create too much ambiguity for the simple processes we have (e.g. enrichment) •  Size: larger vocabularies are useful, but sometimes create too much ambiguity •  Quality: intrinsic aspects like correctness of representation (data structures) •  Connectivity: good data sources are well-connected internally and externally to other datasets An approach based on pivot (e.g. DBpedia, Wikidata) and specialized data sources (e.g. AAT) is likely to work well if links can be made between them Designing a Multilingual Knowledge Graph as a Service for Cultural Heritage
  • 13. Statement selection and mappings CC BY-SA Conceptually distinct steps, which may be grouped at implementation time: •  Selection of entities within one dataset •  E.g. selecting artists (excluding pop stars) from DBpedia •  Mappings between model of data source and model of KG •  Made easier if both use standards, e.g. SKOS or FOAF - both re-used by the Europeana Data Model •  Selection of relevant statements •  E.g. filtering unwanted languages larger vocabularies are useful, but sometimes create too much ambiguity Designing a Multilingual Knowledge Graph as a Service for Cultural Heritage
  • 14. Example: integrating DBpedia resources CC BY-SA Designing a Multilingual Knowledge Graph as a Service for Cultural Heritage
  • 15. Data integration and reconciliation CC BY-SA •  Import of entities needs integrating old statements with new ones if entities are redundant recognized to be the same (via available sameAs- like equivalence statements) •  Several options for data integration on entities: a.  unification: lumping all statements together - possibly leading to inconsistencies or cardinality constraints b.  first come, first serve: only adding statements from most recent sources when it's possible, i.e. not violating cardinality constraints c.  most representative: possibly replacing old statements by new ones if they come from a "prefered" source d.  differentiated most representative: grouping data sources in sets that have different level of preference, so as to apply c between these sets. b and c may be applied within the sets. Designing a Multilingual Knowledge Graph as a Service for Cultural Heritage
  • 16. Alignment and curation CC BY-SA •  Alignment aims at recognizing more equivalent entities in the data sources to be integrated •  It can use automatic or semi-automatic tools like Wikidata Mix'n'Match or CultuurLink, both experimented in Europeana's context •  It requires a lot of effort/expertise at the scale we're considering! •  Curators should be able to edit data to maintain integrity in the Entity Collection or elsewhere downstream •  Removing, editing, adding statements or deprecating entities, e.g. to prevent ambiguities that lead to wrong enrichment of object metadata Designing a Multilingual Knowledge Graph as a Service for Cultural Heritage
  • 17. Data currently in the Entity Collection CC BY-SA Mostly corresponding to a selection made for Europeana's Semantic Enrichment •  Places a subset of Geonames, corresponding to places which are part of European countries and of some specific feature classes. •  Agents a subset of DBpedia corresponding to most of the instances of dbp:Artist with some exceptions, and integrated from 49 DBpedia language editions. •  Concepts a subset of DBpedia corresponding to a selection concepts matching the needs from Europeana Collections (e.g., WWI battles). Europeana Sounds music genres (obtained from Wikidata) Photo Consortium's photography vocabulary •  Organizations Extracted from Europeana's CRM and aligned to Wikidata when possible 216,302 resources 1,572 resources 165,005 resources 1,077 resources Designing a Multilingual Knowledge Graph as a Service for Cultural Heritage
  • 18. Multilingual coverage of the Entity Collection And its contribution to automatic enrichment CC BY-SA Designing a Multilingual Knowledge Graph as a Service for Cultural Heritage Entities effectively used to enrich Europeana Objects Entities present in the Entity Collection https://docs.google.com/document/d/1Nek_SPDtIR3waYwwdgRHY1YiTCj3SK8Eh9xYpnSkFkY
  • 19. France, Public Domain 1932, National Library of France Agence de presse Mondial Photo-Presse. Tournoi royal de motos à Londres : changement d'une roue de side-car en marche Accessing and exploiting the data in the Entity Collection
  • 20. Entity URIs CC BY-SA •  Step 0 for publishing our Linked Data on the web: minting URIs for our entities •  It's always a difficult choice between length, ease of persistence, human- friendliness... •  We've looked at best practices and consulted our community •  Chosen pattern is http://data.europeana.eu/{entity_class}/ {scheme}/{localID} •  Where localID is a sequential identifier •  For example, the agent Leonardo da Vinci: http://data.europeana.eu/ agent/base/146741 Designing a Multilingual Knowledge Graph as a Service for Cultural Heritage
  • 21. The Entity API CC BY-SA •  Still on alpha state •  Looking at best practices for Linked Data and JSON-LD •  Available documentation for this API at: https://pro.europeana.eu/resources/apis/entity Designing a Multilingual Knowledge Graph as a Service for Cultural Heritage
  • 22. The Entity API - entity look-up/resolving CC BY-SA •  Linked Data content negotiation at data.europeana.eu (for JSON-LD and HTML) •  API call for JSON-LD via entity identifiers •  https://www.europeana.eu/api/entities/[entity_class]/base/ [ID].jsonld •  resolve method for getting data on an Entity for an external URI (that appear in sameAs-like equivalence statement) •  https://www.europeana.eu/api/entities/resolve?uri=[URI] •  We created a specific JSON-LD context to make the data easier to consume for web developers, e.g. hiding RDF namespace abbreviations Designing a Multilingual Knowledge Graph as a Service for Cultural Heritage
  • 23. The Entity Collection DBpedia resource for “Mozart” in our data CC BY-SA Designing a Multilingual Knowledge Graph as a Service for Cultural Heritage Coreference links to 6 other datasets (e.g. Freebase, Wikidata) Inter-linking information… still need to switch references to link to Europeana Entities Preferred labels for 48 languages
  • 24. The Entity API - discovery CC BY-SA •  Suggestion of entities for a type string, used the auto-complete function in Europeana's search box •  https://www.europeana.eu/api/entities/suggest •  For example: https://www.europeana.eu/api/entities/suggest? wskey=apidemo&text=leo&type=agent •  Ranking of suggestions is based on: •  Europeana relevance: number of Europeana objects whose description contains one of the entity's labels •  Popularity as computed in the Wikidata pagerank (Diefenbach & Thalhammer, 2018) •  Coming soon: general entity search, based on free (Solr-style) querying on metadata fields for entities Designing a Multilingual Knowledge Graph as a Service for Cultural Heritage
  • 25. Entity API - suggest method CC BY-SA Designing a Multilingual Knowledge Graph as a Service for Cultural Heritage /entities/suggest.json?text=neo
  • 26. Conclusions CC BY-SA •  We've made enough progress to release a first version of the Entity Collection and its API, used in Europeana's production services. •  But there are still challenges and decisions to ensure consistency and relevance over time: •  Expand data coverage (and test extensibility) with new data sources for, e.g., events •  Continue elaborating and testing data integration and filtration strategies •  Employ the EC to better enrich Europeana object metadata •  Enhance discoverability, especially for search engines, e.g. via Schema.org publication Designing a Multilingual Knowledge Graph as a Service for Cultural Heritage