Successfully reported this slideshow.
We use your LinkedIn profile and activity data to personalize ads and to show you more relevant ads. You can change your ad preferences anytime.
Entitifying Europeana: Building an
ecosystem of networked references
for Cultural Objects
Hugo Manguinhas, Valentine Charl...
What is Europeana?
CC BY-SA
We aggregate metadata:
• From all EU countries
• ~3,500 galleries, libraries,
archives and mus...
Europeana Linked Data Strategy
Our efforts and lines of work
CC BY-SA
• Europeana Data Model (EDM) offers a base for linki...
Europeana Linked Data Strategy
A strategy for Entities
CC BY-SA
As a cornerstone for our strategy we are building an
"Enti...
Europeana Linked Data Strategy
Motivation
CC BY-SA
• Improve user experience
• Support better ways of searching and naviga...
The Entity Collection
Use Cases
CC BY-SA
Europeana Collections Portal
● Findability: users can look for entities, not
only...
The Entity Collection
What can it enable?
CC BY-SA
Semantic auto-
completion
Semantic and
Metadata annotations
Entity Page...
The Entity Collection
How do we choose our target vocabularies?
CC BY-SA
As defined in the recent Europeana Tech Task Forc...
The Entity Collection
Which target vocabularies are we using?
CC BY-SA
For historical reasons, the target vocabularies cor...
The Entity Collection
Contribution to multilingual coverage
CC BY-SA
SWIB16 - Entitifying Europeana: Building an ecosystem...
The Entity Collection
Are these target vocabularies enough?
CC BY-SA
• Not enough coreferencing information to other vocab...
The Entity Collection
Challenges
CC BY-SA
Investigate and design strategies for:
• Integrating new vocabularies that can f...
The Entity Collection
Our roadmap for the next years
CC BY-SA
• Mint Europeana URIs for Entities and update internal
refer...
The Entity Collection
Alpha release of our new Entity API
CC BY-SA
SWIB16 - Entitifying Europeana: Building an ecosystem o...
The Entity Collection
DBpedia resource for “Mozart” in our data
CC BY-SA
SWIB16 - Entitifying Europeana: Building an ecosy...
The Entity Collection
Entity API - suggest method
CC BY-SA
SWIB16 - Entitifying Europeana: Building an ecosystem of networ...
Conclusion
CC BY-SA
• A Strategy for Entities is a “must” for Europeana
• There is no “one fits all” vocabulary
• We have ...
Thank you!
hugo.manguinhas@europeana.eu
Upcoming SlideShare
Loading in …5
×

Entitifying europeana - Building an ecosystem of networked references for cultural objects

905 views

Published on

In the past years, the number of references to places, peoples, concepts and time in Europeana’s metadata has grown considerably and with it new challenges have arisen. These contextual entities are provided as references as part of the metadata delivered to Europeana or selected by Europeana for semantic enrichment or crowdsourcing. However their diversity in terms of semantic and multilingual coverage and their very variable quality make it difficult for Europeana to fully exploit this rich information. Pursuing its efforts towards the creation of a semantic network around cultural heritage objects and intending in this way to further enhance its data and retrieval across languages, Europeana is now working on a long term strategy for entities. The cornerstone of this strategy is a “semantic entity collection” that acts as a centralised point of reference and access to data about contextual entities, which is based on the cached and curated data from the wider Linked Open Data cloud. While Europeana will have to address the technical challenges of integration and representation of the various sources, it will also have to define a content and curation plan for its maintenance. This presentation will highlight the design principles of the Europeana Entity Collection and its challenges. We will detail our plans regarding its curation and maintenance while providing the first examples of its use in Europeana users' services. We will also reflect on how our goals can fit our partners' processes and how can organizations like national cultural heritage portals and smaller institutions contribute to (and benefit from) such a project as a network.

Published in: Technology
  • Login to see the comments

Entitifying europeana - Building an ecosystem of networked references for cultural objects

  1. 1. Entitifying Europeana: Building an ecosystem of networked references for Cultural Objects Hugo Manguinhas, Valentine Charles, Antoine Isaac, Tim Hill| Europeana Foundation
  2. 2. What is Europeana? CC BY-SA We aggregate metadata: • From all EU countries • ~3,500 galleries, libraries, archives and museums • More than 53M objects • In about 50 languages • Huge amount of references to places, agents, concepts, time Europeana aggregation infrastructure Europeana| CC BY-SA The Platform for Europe’s Digital Cultural Heritage SWIB16 - Entitifying Europeana: Building an ecosystem of networked references for Cultural Objects
  3. 3. Europeana Linked Data Strategy Our efforts and lines of work CC BY-SA • Europeana Data Model (EDM) offers a base for linking data • We apply automatic enrichment to link source data to reference data • We encourage data providers to contribute their own vocabularies so that we can benefit from data links made at data providers’ level • We encourage alignment activities between domain vocabularies SWIB16 - Entitifying Europeana: Building an ecosystem of networked references for Cultural Objects Significant progress have been made, most of it presented in past SWIB!
  4. 4. Europeana Linked Data Strategy A strategy for Entities CC BY-SA As a cornerstone for our strategy we are building an "Entity Collection" • A service that acts as a centralized point of reference and access to data about contextual entities • Caching and curating data from the wider Linked Open Data cloud • A sort of Europeana "knowledge graph" SWIB16 - Entitifying Europeana: Building an ecosystem of networked references for Cultural Objects
  5. 5. Europeana Linked Data Strategy Motivation CC BY-SA • Improve user experience • Support better ways of searching and navigating through the collections, eliminating ambiguity and clarifying the meaning of descriptions • Adapt better to the language of the user • by improving the interlinking of data • Brings more context to the objects • Alleviates polysemy issues • Expands language coverage • Contributes to build a web of data ('knowledge graph') that third parties can use to improve their users' experience SWIB16 - Entitifying Europeana: Building an ecosystem of networked references for Cultural Objects
  6. 6. The Entity Collection Use Cases CC BY-SA Europeana Collections Portal ● Findability: users can look for entities, not only records (Entity-Based Search) ● Understandability: Entity Pages group and present all assertions about an entity ● Exploration: Navigation along relationships becomes possible Crowdsourcing ● Objects can be annotated with references to entities ● A controlled vocabulary for client applications Enrichment of Provider’s Data ● A controlled vocabulary to help identify named references to entities Republication for Re-use ● Entities can be republished as an open source to the community Entity Collection SWIB16 - Entitifying Europeana: Building an ecosystem of networked references for Cultural Objects
  7. 7. The Entity Collection What can it enable? CC BY-SA Semantic auto- completion Semantic and Metadata annotations Entity Pages Entity based facets SWIB16 - Entitifying Europeana: Building an ecosystem of networked references for Cultural Objects Google Knowledge CardPundit Annotation ClientFood & Drink Project
  8. 8. The Entity Collection How do we choose our target vocabularies? CC BY-SA As defined in the recent Europeana Tech Task Force on enrichment and evaluation (presented last year), we consider the following criteria when selecting a vocabulary: • Properly documented and supported by a community • Technically available on the web according to the Linked Data best practices and recipes • Available under an open licence • Multilingual • Abide to a minimal ontological commitment principle • Apply the best practices and standards for the representation, structure and description of vocabularies • Well-connected internally and externally to other vocabularies (preferably spine vocabularies) SWIB16 - Entitifying Europeana: Building an ecosystem of networked references for Cultural Objects
  9. 9. The Entity Collection Which target vocabularies are we using? CC BY-SA For historical reasons, the target vocabularies correspond to the ones being used for Semantic Enrichment (as of November 2016): • Places a subset of Geonames, corresponding to places which are part of European countries and of some specific feature classes. • Agents a subset of DBpedia corresponding to most of the instances of dbp:Artist with some exceptions, and integrated from 49 DBpedia language editions. • Concepts a subset of DBpedia corresponding to a handful of concepts matching the needs from Europeana Collections. • Time Spans The chronological periods from SemiumTime. 214,307 resources 274 resources 165,008 resources 2,566 resources SWIB16 - Entitifying Europeana: Building an ecosystem of networked references for Cultural Objects
  10. 10. The Entity Collection Contribution to multilingual coverage CC BY-SA SWIB16 - Entitifying Europeana: Building an ecosystem of networked references for Cultural Objects Entities effectively used to enrich Europeana Objects Entities present in the Entity Collection
  11. 11. The Entity Collection Are these target vocabularies enough? CC BY-SA • Not enough coreferencing information to other vocabularies • particularly to the ones we receive from data providers (e.g. musical instruments, MIMO) • Labels and values are not always accurate and normalized • need for better reference data (e.g. VIAF) • Missing relevant information • e.g. roles and professions • Need to expand coverage to other types of entities • namely Works and Events SWIB16 - Entitifying Europeana: Building an ecosystem of networked references for Cultural Objects
  12. 12. The Entity Collection Challenges CC BY-SA Investigate and design strategies for: • Integrating new vocabularies that can further improve • entity descriptions and multilingual coverage (e.g. VIAF) • linking between entities (e.g. Wikidata) • Integrating alignments, in particular: • links between local/domain vocabularies to pivot vocabularies • Supporting manual curation of existing and new entities • Keeping up-to-date the information collected from external sources SWIB16 - Entitifying Europeana: Building an ecosystem of networked references for Cultural Objects
  13. 13. The Entity Collection Our roadmap for the next years CC BY-SA • Mint Europeana URIs for Entities and update internal references • Make entity services and data available via an API • Make use of the API in the Collections Portal • Implement support for new vocabularies and entity types SWIB16 - Entitifying Europeana: Building an ecosystem of networked references for Cultural Objects ✔ ✔
  14. 14. The Entity Collection Alpha release of our new Entity API CC BY-SA SWIB16 - Entitifying Europeana: Building an ecosystem of networked references for Cultural Objects More methods will come, for: Creation, Update and Delete; URI resolution to Europeana Entities
  15. 15. The Entity Collection DBpedia resource for “Mozart” in our data CC BY-SA SWIB16 - Entitifying Europeana: Building an ecosystem of networked references for Cultural Objects Coreference links to 6 other datasets (e.g. Freebase, Wikidata) Inter-linking information… still need to switch references to link to Europeana Entities Preferred labels for 48 languages
  16. 16. The Entity Collection Entity API - suggest method CC BY-SA SWIB16 - Entitifying Europeana: Building an ecosystem of networked references for Cultural Objects /entity/suggest.json?text=neo&lang=en&rows=6
  17. 17. Conclusion CC BY-SA • A Strategy for Entities is a “must” for Europeana • There is no “one fits all” vocabulary • We have a long way to go… ...but we are making progress SWIB16 - Entitifying Europeana: Building an ecosystem of networked references for Cultural Objects
  18. 18. Thank you! hugo.manguinhas@europeana.eu

×