SlideShare a Scribd company logo
1 of 25
Download to read offline
ZBW is member of the Leibniz Association
Wikidata as opportunity for special collections:
the 20th Century Press Archives use case
Joachim Neubert
ZBW – Leibniz Information Centre for Economics, Kiel/Hamburg
LIBER 2019, Linked Open Data Working Group
26.06.2019, Dublin (Ireland)
Agenda
1. What are we dealing with?
2. Why Wikidata?
3. Transfering metadata
a. Link to existing items
b. Create missing items
c. Add metadata to the items
4. Using the data
5. Future work
Page 2
https://commons.wikimedia.org/wiki/File:ZBW-Personenarchiv_2015.jpg by Max-Michael Wannags
What are we dealing with?
Page 4
What are we dealing with?
Historic Press Archives, founded in 1909 (Hamburg) and 1914 (Kiel)
• Some material dating back to 1826
• Collections closed in 2005
Thematic dossiers covering
 Persons
• Companies
• Products
• General subjects and events
Page 5
Current state
Former DFG funded project, resulting in
• Digitized roll films (material before 1949)
• Relational database about dossiers, often with GND ID
• Big filesystem (containing more than 2m pages)
• Accessible via
• custom application “Pressemappe 20. Jahrhundert” and
• DFG-Viewer (METS/MODS files, per dossier)
All metadata available under CC 0 license
Long term sustainability?
Specialized
application for
discovery and
access,
architecturally
outdated and
expensive to
maintain
Page 6
http://webopac.hwwa.de/pressemappe20
Why Wikidata?
Page 7
Wikidata basics
• Knowledge base for Wikimedia projects
• All kinds of entities: concepts, places, people, works …
• Editable and extensible by everyone
• Data available under CC0
• http://query.wikidata.org/ (SPARQL)
• JSON API & database dumps
• Sustainable foundation for long-term available data
Page 8
Wikidata statements
Page 9
Page 10
Linking mechanism: external identifiers
• Property value: unique IDs from external database
• + URL stub in the property definition („formatter URL“)
• Almost 4,000 external identifier properties
• Examples:
• GND
• proteins
• African plants
• Swedish cultural heritage objects
Page 11
Transfering collection metadata to Wikidata
1. What are we dealing with?
2. Why Wikidata?
3. Transfering metadata
a. Link to existing items
b. Create missing items
c. Add metadata to the items
4. Using the data
5. Future work
Page 12
Wikidata property P4293 (PM20 folder ID)
• Property proposal and discussion within the community
Additional prerequisite:
• RDF representation of PM20 contents and a SPARQL endpoint,
allowing federated queries with the Wikidata endpoint
Page 13
Link to existing items
• Automatically inserted links derived from GND IDs
• Tool-supported manual linking
• Wikidata‘s Mix-n-match (great for persons, crowd-sourced)
• custom tools (like this)
• others (OpenRefine, …)
 ~ 95% of PM20 person folders linked by mid-June 2019!
Page 14
Checking proposed matches in Mix‘n‘match
Seite 15
Add missing items to Wikidata - automatically
Recommendations for item creation:
• Pay attention to Wikidata’s notability criteria
• Explain your plan and ask for feedback in the Wikidata project chat
• Apply for a bot account to make mass edits (example)
• Source every statement
Process:
• Transform query results to QuickStatements input file
• Copy & paste into QuickStatements
Page 16
QuickStatements input from PM20
• using a federated query to exclude existing Wikidata items
• query output transformed by a script
Page 17
Added Wikidata item
Page 18
 all 5200 PM20 person folders now linked from Wikidata!
Add metadata to Wikidata items
e.g., for all persons in Wikidata with PM20 ID and the PM20 “field of
activity”: “economics” or “business economics”, insert the according
occupation into the WD person item (script, query)
Page 19
Using the data on Wikidata
Page 20
„Proof of concept“ example: Map of economists
Page 21
Query link (on Wikidata SPARQL endpoint – see also list of all PM20 economists)
Display via DFG Viewer link
Page 22
Future work
• Build community support for further extension of the PM20 metadata
• Create an item structure for the subject and ware archives, and link
the folders (~ 12,000)
• Link/create items for company folders (~ 8,000)
• Create a static HTML site with one page per folder (+ additional
navigation pages) on the PM20 web site which hosts the digitized
images (= permanent reference)
• Optionally, create additional Wikidata-based searching/browsing
facilities
• Retire the present ColdFusion application
Page 23
Wikiproject 20th Century Press Archives
Page 24
https://www.wikidata.org/wiki/Wikidata:WikiProject_20th_Century_Press_Archives
Page 25
Thanks for listening!
Joachim Neubert
ZBW – Leibniz Information Centre for Economics
j.neubert@zbw.eu
http://zbw.eu/labs
https://www.wikidata.org/wiki/User:Jneubert

More Related Content

What's hot

DMP exercise: linking data management activities to services - EUDAT Summer ...
DMP exercise: linking data management activities to services  - EUDAT Summer ...DMP exercise: linking data management activities to services  - EUDAT Summer ...
DMP exercise: linking data management activities to services - EUDAT Summer ...EUDAT
 
Mantas Zimnickas - How Open is Lithuanian Government data? atviriduomenys.lt
Mantas Zimnickas - How Open is Lithuanian Government data? atviriduomenys.lt Mantas Zimnickas - How Open is Lithuanian Government data? atviriduomenys.lt
Mantas Zimnickas - How Open is Lithuanian Government data? atviriduomenys.lt Aidis Stukas
 
Linked Open Data (LOD) Pilot Austria
Linked Open Data (LOD) Pilot AustriaLinked Open Data (LOD) Pilot Austria
Linked Open Data (LOD) Pilot AustriaMartin Kaltenböck
 
TRAINING OBJECTIVES
TRAINING OBJECTIVESTRAINING OBJECTIVES
TRAINING OBJECTIVESFAO
 
Linked data activities in the Deutsche Nationalbibliothek
Linked data activities in the Deutsche NationalbibliothekLinked data activities in the Deutsche Nationalbibliothek
Linked data activities in the Deutsche NationalbibliothekLars G. Svensson
 
What can libraries do for researchers?
What can libraries do for researchers?What can libraries do for researchers?
What can libraries do for researchers?Michael Day
 
Linked Data - Overview and Potentials
Linked Data - Overview and PotentialsLinked Data - Overview and Potentials
Linked Data - Overview and PotentialsTobias Bürger
 
Vassilios Peristeras: From Open to Linked Government Data: (European Commissi...
Vassilios Peristeras: From Open to Linked Government Data: (European Commissi...Vassilios Peristeras: From Open to Linked Government Data: (European Commissi...
Vassilios Peristeras: From Open to Linked Government Data: (European Commissi...FIA2010
 
Building a Collection of the Historical UK Web for scholarly use
Building a Collection of the Historical UK Web for scholarly useBuilding a Collection of the Historical UK Web for scholarly use
Building a Collection of the Historical UK Web for scholarly useALISS
 
Web at 25 - W3C/Ontos Event on May 22, 2014. Agenda of the day
Web at 25 - W3C/Ontos Event on May 22, 2014. Agenda of the dayWeb at 25 - W3C/Ontos Event on May 22, 2014. Agenda of the day
Web at 25 - W3C/Ontos Event on May 22, 2014. Agenda of the dayAI4BD GmbH
 
20yrs:1998 Society of Archivists Conference
20yrs:1998 Society of Archivists Conference20yrs:1998 Society of Archivists Conference
20yrs:1998 Society of Archivists ConferenceNeil Beagrie
 
digitalHusserl. A Platform for digital access to the Husserl archives. (Roxan...
digitalHusserl. A Platform for digital access to the Husserl archives. (Roxan...digitalHusserl. A Platform for digital access to the Husserl archives. (Roxan...
digitalHusserl. A Platform for digital access to the Husserl archives. (Roxan...LIBIS
 
Bingham, De Wild & Aasman Presentation
Bingham, De Wild & Aasman PresentationBingham, De Wild & Aasman Presentation
Bingham, De Wild & Aasman PresentationWARCnet
 
Presenting /ch/open, Parldigi and RCDS at first European Software Freedom Pol...
Presenting /ch/open, Parldigi and RCDS at first European Software Freedom Pol...Presenting /ch/open, Parldigi and RCDS at first European Software Freedom Pol...
Presenting /ch/open, Parldigi and RCDS at first European Software Freedom Pol...Matthias Stürmer
 
Copy of a presentation given by Professor Sir Nigel Shadbolt
Copy of a presentation given by Professor Sir Nigel Shadbolt Copy of a presentation given by Professor Sir Nigel Shadbolt
Copy of a presentation given by Professor Sir Nigel Shadbolt Ben Proctor
 
From WG2 Datathon to AWAC2. Exploring IIPC special COVID collection thanks to...
From WG2 Datathon to AWAC2. Exploring IIPC special COVID collection thanks to...From WG2 Datathon to AWAC2. Exploring IIPC special COVID collection thanks to...
From WG2 Datathon to AWAC2. Exploring IIPC special COVID collection thanks to...WARCnet
 

What's hot (18)

DMP exercise: linking data management activities to services - EUDAT Summer ...
DMP exercise: linking data management activities to services  - EUDAT Summer ...DMP exercise: linking data management activities to services  - EUDAT Summer ...
DMP exercise: linking data management activities to services - EUDAT Summer ...
 
Approach to Open Data in Vienna
Approach to Open Data in ViennaApproach to Open Data in Vienna
Approach to Open Data in Vienna
 
Mantas Zimnickas - How Open is Lithuanian Government data? atviriduomenys.lt
Mantas Zimnickas - How Open is Lithuanian Government data? atviriduomenys.lt Mantas Zimnickas - How Open is Lithuanian Government data? atviriduomenys.lt
Mantas Zimnickas - How Open is Lithuanian Government data? atviriduomenys.lt
 
Linked Open Data (LOD) Pilot Austria
Linked Open Data (LOD) Pilot AustriaLinked Open Data (LOD) Pilot Austria
Linked Open Data (LOD) Pilot Austria
 
TRAINING OBJECTIVES
TRAINING OBJECTIVESTRAINING OBJECTIVES
TRAINING OBJECTIVES
 
Linked data activities in the Deutsche Nationalbibliothek
Linked data activities in the Deutsche NationalbibliothekLinked data activities in the Deutsche Nationalbibliothek
Linked data activities in the Deutsche Nationalbibliothek
 
V discoverdrupal
V discoverdrupalV discoverdrupal
V discoverdrupal
 
What can libraries do for researchers?
What can libraries do for researchers?What can libraries do for researchers?
What can libraries do for researchers?
 
Linked Data - Overview and Potentials
Linked Data - Overview and PotentialsLinked Data - Overview and Potentials
Linked Data - Overview and Potentials
 
Vassilios Peristeras: From Open to Linked Government Data: (European Commissi...
Vassilios Peristeras: From Open to Linked Government Data: (European Commissi...Vassilios Peristeras: From Open to Linked Government Data: (European Commissi...
Vassilios Peristeras: From Open to Linked Government Data: (European Commissi...
 
Building a Collection of the Historical UK Web for scholarly use
Building a Collection of the Historical UK Web for scholarly useBuilding a Collection of the Historical UK Web for scholarly use
Building a Collection of the Historical UK Web for scholarly use
 
Web at 25 - W3C/Ontos Event on May 22, 2014. Agenda of the day
Web at 25 - W3C/Ontos Event on May 22, 2014. Agenda of the dayWeb at 25 - W3C/Ontos Event on May 22, 2014. Agenda of the day
Web at 25 - W3C/Ontos Event on May 22, 2014. Agenda of the day
 
20yrs:1998 Society of Archivists Conference
20yrs:1998 Society of Archivists Conference20yrs:1998 Society of Archivists Conference
20yrs:1998 Society of Archivists Conference
 
digitalHusserl. A Platform for digital access to the Husserl archives. (Roxan...
digitalHusserl. A Platform for digital access to the Husserl archives. (Roxan...digitalHusserl. A Platform for digital access to the Husserl archives. (Roxan...
digitalHusserl. A Platform for digital access to the Husserl archives. (Roxan...
 
Bingham, De Wild & Aasman Presentation
Bingham, De Wild & Aasman PresentationBingham, De Wild & Aasman Presentation
Bingham, De Wild & Aasman Presentation
 
Presenting /ch/open, Parldigi and RCDS at first European Software Freedom Pol...
Presenting /ch/open, Parldigi and RCDS at first European Software Freedom Pol...Presenting /ch/open, Parldigi and RCDS at first European Software Freedom Pol...
Presenting /ch/open, Parldigi and RCDS at first European Software Freedom Pol...
 
Copy of a presentation given by Professor Sir Nigel Shadbolt
Copy of a presentation given by Professor Sir Nigel Shadbolt Copy of a presentation given by Professor Sir Nigel Shadbolt
Copy of a presentation given by Professor Sir Nigel Shadbolt
 
From WG2 Datathon to AWAC2. Exploring IIPC special COVID collection thanks to...
From WG2 Datathon to AWAC2. Exploring IIPC special COVID collection thanks to...From WG2 Datathon to AWAC2. Exploring IIPC special COVID collection thanks to...
From WG2 Datathon to AWAC2. Exploring IIPC special COVID collection thanks to...
 

Similar to ZBW links historic press archives to Wikidata

ESWC 2017 Tutorial Knowledge Graphs
ESWC 2017 Tutorial Knowledge GraphsESWC 2017 Tutorial Knowledge Graphs
ESWC 2017 Tutorial Knowledge GraphsPeter Haase
 
Wikidata Conference 2019 GLAM Panel - 20191025
Wikidata Conference 2019 GLAM Panel - 20191025Wikidata Conference 2019 GLAM Panel - 20191025
Wikidata Conference 2019 GLAM Panel - 20191025Beat Estermann
 
Linked Open Data Publications through Wikidata & Persistent Identification in...
Linked Open Data Publications through Wikidata & Persistent Identification in...Linked Open Data Publications through Wikidata & Persistent Identification in...
Linked Open Data Publications through Wikidata & Persistent Identification in...meemoo, Vlaams instituut voor het archief
 
Open Culture - How Wiki loves art and data - Packed
 Open Culture - How Wiki loves art and data - Packed Open Culture - How Wiki loves art and data - Packed
Open Culture - How Wiki loves art and data - PackedOpen Knowledge Belgium
 
Linked Open Data Publications through Wikidata & Persistent Identification...
Linked Open Data  Publications through  Wikidata &  Persistent Identification...Linked Open Data  Publications through  Wikidata &  Persistent Identification...
Linked Open Data Publications through Wikidata & Persistent Identification...PACKED vzw
 
20th Century Press Archives goes Wikidata
20th Century Press Archives goes Wikidata20th Century Press Archives goes Wikidata
20th Century Press Archives goes WikidataJoachim Neubert
 
Estermann Wikidata GLAM Example Projects 20170914
Estermann Wikidata GLAM Example Projects 20170914Estermann Wikidata GLAM Example Projects 20170914
Estermann Wikidata GLAM Example Projects 20170914Beat Estermann
 
Libraries in the Big Data Era: Strategies and Challenges in Archiving and Sha...
Libraries in the Big Data Era: Strategies and Challenges in Archiving and Sha...Libraries in the Big Data Era: Strategies and Challenges in Archiving and Sha...
Libraries in the Big Data Era: Strategies and Challenges in Archiving and Sha...Peter Löwe
 
LD4 conference 2020 The Use of Linked Data at the ISSN International Centre
LD4 conference 2020 The Use of Linked Data at the ISSN International CentreLD4 conference 2020 The Use of Linked Data at the ISSN International Centre
LD4 conference 2020 The Use of Linked Data at the ISSN International CentreISSN International Centre
 
Smart Data Applications powered by the Wikidata Knowledge Graph
Smart Data Applications powered by the Wikidata Knowledge GraphSmart Data Applications powered by the Wikidata Knowledge Graph
Smart Data Applications powered by the Wikidata Knowledge GraphPeter Haase
 
20th Century Press Archives goes Wikidata
20th Century Press Archives goes Wikidata20th Century Press Archives goes Wikidata
20th Century Press Archives goes WikidataJoachim Neubert
 
07 reusable padfield
07 reusable padfield07 reusable padfield
07 reusable padfieldShareCareX
 
Evolving the Web into a Global Database - Advances and Applications.
Evolving the Web into a Global Database - Advances and Applications. Evolving the Web into a Global Database - Advances and Applications.
Evolving the Web into a Global Database - Advances and Applications. Chris Bizer
 
Wikidata and performing_arts_20180116
Wikidata and performing_arts_20180116Wikidata and performing_arts_20180116
Wikidata and performing_arts_20180116Beat Estermann
 
Linked Data (1st Linked Data Meetup Malmö)
Linked Data (1st Linked Data Meetup Malmö)Linked Data (1st Linked Data Meetup Malmö)
Linked Data (1st Linked Data Meetup Malmö)Anja Jentzsch
 
ALIADA Project. AtCult
ALIADA Project. AtCultALIADA Project. AtCult
ALIADA Project. AtCultaliada project
 
Wikidata Introductory Workshop
Wikidata Introductory WorkshopWikidata Introductory Workshop
Wikidata Introductory WorkshopBeat Estermann
 
BESOCIAL A Knowledge Graph for Social Media Archiving
BESOCIAL A Knowledge Graph for Social Media ArchivingBESOCIAL A Knowledge Graph for Social Media Archiving
BESOCIAL A Knowledge Graph for Social Media ArchivingSven Lieber
 

Similar to ZBW links historic press archives to Wikidata (20)

ESWC 2017 Tutorial Knowledge Graphs
ESWC 2017 Tutorial Knowledge GraphsESWC 2017 Tutorial Knowledge Graphs
ESWC 2017 Tutorial Knowledge Graphs
 
Wikidata Conference 2019 GLAM Panel - 20191025
Wikidata Conference 2019 GLAM Panel - 20191025Wikidata Conference 2019 GLAM Panel - 20191025
Wikidata Conference 2019 GLAM Panel - 20191025
 
Linked Open Data Publications through Wikidata & Persistent Identification in...
Linked Open Data Publications through Wikidata & Persistent Identification in...Linked Open Data Publications through Wikidata & Persistent Identification in...
Linked Open Data Publications through Wikidata & Persistent Identification in...
 
Open Culture - How Wiki loves art and data - Packed
 Open Culture - How Wiki loves art and data - Packed Open Culture - How Wiki loves art and data - Packed
Open Culture - How Wiki loves art and data - Packed
 
Linked Open Data Publications through Wikidata & Persistent Identification...
Linked Open Data  Publications through  Wikidata &  Persistent Identification...Linked Open Data  Publications through  Wikidata &  Persistent Identification...
Linked Open Data Publications through Wikidata & Persistent Identification...
 
20th Century Press Archives goes Wikidata
20th Century Press Archives goes Wikidata20th Century Press Archives goes Wikidata
20th Century Press Archives goes Wikidata
 
Estermann Wikidata GLAM Example Projects 20170914
Estermann Wikidata GLAM Example Projects 20170914Estermann Wikidata GLAM Example Projects 20170914
Estermann Wikidata GLAM Example Projects 20170914
 
Libraries in the Big Data Era: Strategies and Challenges in Archiving and Sha...
Libraries in the Big Data Era: Strategies and Challenges in Archiving and Sha...Libraries in the Big Data Era: Strategies and Challenges in Archiving and Sha...
Libraries in the Big Data Era: Strategies and Challenges in Archiving and Sha...
 
LD4 conference 2020 The Use of Linked Data at the ISSN International Centre
LD4 conference 2020 The Use of Linked Data at the ISSN International CentreLD4 conference 2020 The Use of Linked Data at the ISSN International Centre
LD4 conference 2020 The Use of Linked Data at the ISSN International Centre
 
Hahn "Wikidata as a hub to library linked data re-use"
Hahn "Wikidata as a hub to library linked data re-use"Hahn "Wikidata as a hub to library linked data re-use"
Hahn "Wikidata as a hub to library linked data re-use"
 
Smart Data Applications powered by the Wikidata Knowledge Graph
Smart Data Applications powered by the Wikidata Knowledge GraphSmart Data Applications powered by the Wikidata Knowledge Graph
Smart Data Applications powered by the Wikidata Knowledge Graph
 
20th Century Press Archives goes Wikidata
20th Century Press Archives goes Wikidata20th Century Press Archives goes Wikidata
20th Century Press Archives goes Wikidata
 
07 reusable padfield
07 reusable padfield07 reusable padfield
07 reusable padfield
 
Evolving the Web into a Global Database - Advances and Applications.
Evolving the Web into a Global Database - Advances and Applications. Evolving the Web into a Global Database - Advances and Applications.
Evolving the Web into a Global Database - Advances and Applications.
 
Wikidata and performing_arts_20180116
Wikidata and performing_arts_20180116Wikidata and performing_arts_20180116
Wikidata and performing_arts_20180116
 
Linked Data (1st Linked Data Meetup Malmö)
Linked Data (1st Linked Data Meetup Malmö)Linked Data (1st Linked Data Meetup Malmö)
Linked Data (1st Linked Data Meetup Malmö)
 
KEDL DBpedia 2019
KEDL DBpedia  2019KEDL DBpedia  2019
KEDL DBpedia 2019
 
ALIADA Project. AtCult
ALIADA Project. AtCultALIADA Project. AtCult
ALIADA Project. AtCult
 
Wikidata Introductory Workshop
Wikidata Introductory WorkshopWikidata Introductory Workshop
Wikidata Introductory Workshop
 
BESOCIAL A Knowledge Graph for Social Media Archiving
BESOCIAL A Knowledge Graph for Social Media ArchivingBESOCIAL A Knowledge Graph for Social Media Archiving
BESOCIAL A Knowledge Graph for Social Media Archiving
 

More from Joachim Neubert

Exploring and mapping the category system of the world‘s largest public press...
Exploring and mapping the category system of the world‘s largest public press...Exploring and mapping the category system of the world‘s largest public press...
Exploring and mapping the category system of the world‘s largest public press...Joachim Neubert
 
Wikidata as a hub for the linked data cloud
Wikidata as a hub for the linked data cloudWikidata as a hub for the linked data cloud
Wikidata as a hub for the linked data cloudJoachim Neubert
 
Chancen und Herausforderungen einer komplementären Nutzung von GND und Wikidata
Chancen und Herausforderungen einer komplementären Nutzung von GND und WikidataChancen und Herausforderungen einer komplementären Nutzung von GND und Wikidata
Chancen und Herausforderungen einer komplementären Nutzung von GND und WikidataJoachim Neubert
 
Pressemappe 20. Jahrhundert: Personen- und Firmendossiers
Pressemappe 20. Jahrhundert: Personen- und FirmendossiersPressemappe 20. Jahrhundert: Personen- und Firmendossiers
Pressemappe 20. Jahrhundert: Personen- und FirmendossiersJoachim Neubert
 
Linking Knowledge Organization Systems via Wikidata (DCMI conference 2018)
Linking Knowledge Organization Systems via Wikidata (DCMI conference 2018)Linking Knowledge Organization Systems via Wikidata (DCMI conference 2018)
Linking Knowledge Organization Systems via Wikidata (DCMI conference 2018)Joachim Neubert
 
Making Wikidata fit as a Linking Hub for Knowledge Organization Systems
Making Wikidata fit as a Linking Hub for Knowledge Organization SystemsMaking Wikidata fit as a Linking Hub for Knowledge Organization Systems
Making Wikidata fit as a Linking Hub for Knowledge Organization SystemsJoachim Neubert
 
Linking authorities through Wikidata
Linking authorities through WikidataLinking authorities through Wikidata
Linking authorities through WikidataJoachim Neubert
 
Wikidata as a linking hub for knowledge organization systems? Integrating an ...
Wikidata as a linking hub for knowledge organization systems? Integrating an ...Wikidata as a linking hub for knowledge organization systems? Integrating an ...
Wikidata as a linking hub for knowledge organization systems? Integrating an ...Joachim Neubert
 
Wikidata as authority linking hub
Wikidata as authority linking hubWikidata as authority linking hub
Wikidata as authority linking hubJoachim Neubert
 
EconBiz Research Dataset (SWIB16 Lightning Talk)
EconBiz Research Dataset (SWIB16 Lightning Talk)EconBiz Research Dataset (SWIB16 Lightning Talk)
EconBiz Research Dataset (SWIB16 Lightning Talk)Joachim Neubert
 
Change Tracking in Knowledge Organization Systems with skos-history
Change Tracking in Knowledge Organization Systems with skos-historyChange Tracking in Knowledge Organization Systems with skos-history
Change Tracking in Knowledge Organization Systems with skos-historyJoachim Neubert
 
Anforderungen an Thesauri im Semantic Web
Anforderungen an Thesauri im Semantic WebAnforderungen an Thesauri im Semantic Web
Anforderungen an Thesauri im Semantic WebJoachim Neubert
 
Leveraging SKOS to trace the overhaul of the STW Thesaurus for Economics
Leveraging SKOS to trace the overhaul of the STW Thesaurus for EconomicsLeveraging SKOS to trace the overhaul of the STW Thesaurus for Economics
Leveraging SKOS to trace the overhaul of the STW Thesaurus for EconomicsJoachim Neubert
 
skos-history: Tracking the evolution of Knowledge Organization Systems
skos-history: Tracking the evolution of Knowledge Organization Systemsskos-history: Tracking the evolution of Knowledge Organization Systems
skos-history: Tracking the evolution of Knowledge Organization SystemsJoachim Neubert
 
KOS evolution in Linked Data
KOS evolution in Linked DataKOS evolution in Linked Data
KOS evolution in Linked DataJoachim Neubert
 
Exploiting the version history of SKOS files: skos-history (SWIB13 Lightning ...
Exploiting the version history of SKOS files: skos-history (SWIB13 Lightning ...Exploiting the version history of SKOS files: skos-history (SWIB13 Lightning ...
Exploiting the version history of SKOS files: skos-history (SWIB13 Lightning ...Joachim Neubert
 
Linked Data Publishing with Drupal (SWIB13 workshop)
Linked Data Publishing with Drupal (SWIB13 workshop)Linked Data Publishing with Drupal (SWIB13 workshop)
Linked Data Publishing with Drupal (SWIB13 workshop)Joachim Neubert
 
Constantly Under Construction: STW Thesaurus for Economics Linked Data Maint...
Constantly Under Construction: STW Thesaurus for Economics Linked Data Maint...Constantly Under Construction: STW Thesaurus for Economics Linked Data Maint...
Constantly Under Construction: STW Thesaurus for Economics Linked Data Maint...Joachim Neubert
 
Linked data enhanced publishing for special collections (with Drupal)
Linked data enhanced publishing for special collections (with Drupal)Linked data enhanced publishing for special collections (with Drupal)
Linked data enhanced publishing for special collections (with Drupal)Joachim Neubert
 

More from Joachim Neubert (20)

Exploring and mapping the category system of the world‘s largest public press...
Exploring and mapping the category system of the world‘s largest public press...Exploring and mapping the category system of the world‘s largest public press...
Exploring and mapping the category system of the world‘s largest public press...
 
Wikidata (für Archive)
Wikidata (für Archive)Wikidata (für Archive)
Wikidata (für Archive)
 
Wikidata as a hub for the linked data cloud
Wikidata as a hub for the linked data cloudWikidata as a hub for the linked data cloud
Wikidata as a hub for the linked data cloud
 
Chancen und Herausforderungen einer komplementären Nutzung von GND und Wikidata
Chancen und Herausforderungen einer komplementären Nutzung von GND und WikidataChancen und Herausforderungen einer komplementären Nutzung von GND und Wikidata
Chancen und Herausforderungen einer komplementären Nutzung von GND und Wikidata
 
Pressemappe 20. Jahrhundert: Personen- und Firmendossiers
Pressemappe 20. Jahrhundert: Personen- und FirmendossiersPressemappe 20. Jahrhundert: Personen- und Firmendossiers
Pressemappe 20. Jahrhundert: Personen- und Firmendossiers
 
Linking Knowledge Organization Systems via Wikidata (DCMI conference 2018)
Linking Knowledge Organization Systems via Wikidata (DCMI conference 2018)Linking Knowledge Organization Systems via Wikidata (DCMI conference 2018)
Linking Knowledge Organization Systems via Wikidata (DCMI conference 2018)
 
Making Wikidata fit as a Linking Hub for Knowledge Organization Systems
Making Wikidata fit as a Linking Hub for Knowledge Organization SystemsMaking Wikidata fit as a Linking Hub for Knowledge Organization Systems
Making Wikidata fit as a Linking Hub for Knowledge Organization Systems
 
Linking authorities through Wikidata
Linking authorities through WikidataLinking authorities through Wikidata
Linking authorities through Wikidata
 
Wikidata as a linking hub for knowledge organization systems? Integrating an ...
Wikidata as a linking hub for knowledge organization systems? Integrating an ...Wikidata as a linking hub for knowledge organization systems? Integrating an ...
Wikidata as a linking hub for knowledge organization systems? Integrating an ...
 
Wikidata as authority linking hub
Wikidata as authority linking hubWikidata as authority linking hub
Wikidata as authority linking hub
 
EconBiz Research Dataset (SWIB16 Lightning Talk)
EconBiz Research Dataset (SWIB16 Lightning Talk)EconBiz Research Dataset (SWIB16 Lightning Talk)
EconBiz Research Dataset (SWIB16 Lightning Talk)
 
Change Tracking in Knowledge Organization Systems with skos-history
Change Tracking in Knowledge Organization Systems with skos-historyChange Tracking in Knowledge Organization Systems with skos-history
Change Tracking in Knowledge Organization Systems with skos-history
 
Anforderungen an Thesauri im Semantic Web
Anforderungen an Thesauri im Semantic WebAnforderungen an Thesauri im Semantic Web
Anforderungen an Thesauri im Semantic Web
 
Leveraging SKOS to trace the overhaul of the STW Thesaurus for Economics
Leveraging SKOS to trace the overhaul of the STW Thesaurus for EconomicsLeveraging SKOS to trace the overhaul of the STW Thesaurus for Economics
Leveraging SKOS to trace the overhaul of the STW Thesaurus for Economics
 
skos-history: Tracking the evolution of Knowledge Organization Systems
skos-history: Tracking the evolution of Knowledge Organization Systemsskos-history: Tracking the evolution of Knowledge Organization Systems
skos-history: Tracking the evolution of Knowledge Organization Systems
 
KOS evolution in Linked Data
KOS evolution in Linked DataKOS evolution in Linked Data
KOS evolution in Linked Data
 
Exploiting the version history of SKOS files: skos-history (SWIB13 Lightning ...
Exploiting the version history of SKOS files: skos-history (SWIB13 Lightning ...Exploiting the version history of SKOS files: skos-history (SWIB13 Lightning ...
Exploiting the version history of SKOS files: skos-history (SWIB13 Lightning ...
 
Linked Data Publishing with Drupal (SWIB13 workshop)
Linked Data Publishing with Drupal (SWIB13 workshop)Linked Data Publishing with Drupal (SWIB13 workshop)
Linked Data Publishing with Drupal (SWIB13 workshop)
 
Constantly Under Construction: STW Thesaurus for Economics Linked Data Maint...
Constantly Under Construction: STW Thesaurus for Economics Linked Data Maint...Constantly Under Construction: STW Thesaurus for Economics Linked Data Maint...
Constantly Under Construction: STW Thesaurus for Economics Linked Data Maint...
 
Linked data enhanced publishing for special collections (with Drupal)
Linked data enhanced publishing for special collections (with Drupal)Linked data enhanced publishing for special collections (with Drupal)
Linked data enhanced publishing for special collections (with Drupal)
 

Recently uploaded

SCM Symposium PPT Format Customer loyalty is predi
SCM Symposium PPT Format Customer loyalty is prediSCM Symposium PPT Format Customer loyalty is predi
SCM Symposium PPT Format Customer loyalty is predieusebiomeyer
 
定制(Management毕业证书)新加坡管理大学毕业证成绩单原版一比一
定制(Management毕业证书)新加坡管理大学毕业证成绩单原版一比一定制(Management毕业证书)新加坡管理大学毕业证成绩单原版一比一
定制(Management毕业证书)新加坡管理大学毕业证成绩单原版一比一Fs
 
Blepharitis inflammation of eyelid symptoms cause everything included along w...
Blepharitis inflammation of eyelid symptoms cause everything included along w...Blepharitis inflammation of eyelid symptoms cause everything included along w...
Blepharitis inflammation of eyelid symptoms cause everything included along w...Excelmac1
 
Q4-1-Illustrating-Hypothesis-Testing.pptx
Q4-1-Illustrating-Hypothesis-Testing.pptxQ4-1-Illustrating-Hypothesis-Testing.pptx
Q4-1-Illustrating-Hypothesis-Testing.pptxeditsforyah
 
Potsdam FH学位证,波茨坦应用技术大学毕业证书1:1制作
Potsdam FH学位证,波茨坦应用技术大学毕业证书1:1制作Potsdam FH学位证,波茨坦应用技术大学毕业证书1:1制作
Potsdam FH学位证,波茨坦应用技术大学毕业证书1:1制作ys8omjxb
 
Font Performance - NYC WebPerf Meetup April '24
Font Performance - NYC WebPerf Meetup April '24Font Performance - NYC WebPerf Meetup April '24
Font Performance - NYC WebPerf Meetup April '24Paul Calvano
 
Elevate Your Business with Our IT Expertise in New Orleans
Elevate Your Business with Our IT Expertise in New OrleansElevate Your Business with Our IT Expertise in New Orleans
Elevate Your Business with Our IT Expertise in New Orleanscorenetworkseo
 
Magic exist by Marta Loveguard - presentation.pptx
Magic exist by Marta Loveguard - presentation.pptxMagic exist by Marta Loveguard - presentation.pptx
Magic exist by Marta Loveguard - presentation.pptxMartaLoveguard
 
PHP-based rendering of TYPO3 Documentation
PHP-based rendering of TYPO3 DocumentationPHP-based rendering of TYPO3 Documentation
PHP-based rendering of TYPO3 DocumentationLinaWolf1
 
Film cover research (1).pptxsdasdasdasdasdasa
Film cover research (1).pptxsdasdasdasdasdasaFilm cover research (1).pptxsdasdasdasdasdasa
Film cover research (1).pptxsdasdasdasdasdasa494f574xmv
 
Top 10 Interactive Website Design Trends in 2024.pptx
Top 10 Interactive Website Design Trends in 2024.pptxTop 10 Interactive Website Design Trends in 2024.pptx
Top 10 Interactive Website Design Trends in 2024.pptxDyna Gilbert
 
定制(Lincoln毕业证书)新西兰林肯大学毕业证成绩单原版一比一
定制(Lincoln毕业证书)新西兰林肯大学毕业证成绩单原版一比一定制(Lincoln毕业证书)新西兰林肯大学毕业证成绩单原版一比一
定制(Lincoln毕业证书)新西兰林肯大学毕业证成绩单原版一比一Fs
 
『澳洲文凭』买拉筹伯大学毕业证书成绩单办理澳洲LTU文凭学位证书
『澳洲文凭』买拉筹伯大学毕业证书成绩单办理澳洲LTU文凭学位证书『澳洲文凭』买拉筹伯大学毕业证书成绩单办理澳洲LTU文凭学位证书
『澳洲文凭』买拉筹伯大学毕业证书成绩单办理澳洲LTU文凭学位证书rnrncn29
 
定制(AUT毕业证书)新西兰奥克兰理工大学毕业证成绩单原版一比一
定制(AUT毕业证书)新西兰奥克兰理工大学毕业证成绩单原版一比一定制(AUT毕业证书)新西兰奥克兰理工大学毕业证成绩单原版一比一
定制(AUT毕业证书)新西兰奥克兰理工大学毕业证成绩单原版一比一Fs
 
A Good Girl's Guide to Murder (A Good Girl's Guide to Murder, #1)
A Good Girl's Guide to Murder (A Good Girl's Guide to Murder, #1)A Good Girl's Guide to Murder (A Good Girl's Guide to Murder, #1)
A Good Girl's Guide to Murder (A Good Girl's Guide to Murder, #1)Christopher H Felton
 
NSX-T and Service Interfaces presentation
NSX-T and Service Interfaces presentationNSX-T and Service Interfaces presentation
NSX-T and Service Interfaces presentationMarko4394
 
『澳洲文凭』买詹姆士库克大学毕业证书成绩单办理澳洲JCU文凭学位证书
『澳洲文凭』买詹姆士库克大学毕业证书成绩单办理澳洲JCU文凭学位证书『澳洲文凭』买詹姆士库克大学毕业证书成绩单办理澳洲JCU文凭学位证书
『澳洲文凭』买詹姆士库克大学毕业证书成绩单办理澳洲JCU文凭学位证书rnrncn29
 
Call Girls In The Ocean Pearl Retreat Hotel New Delhi 9873777170
Call Girls In The Ocean Pearl Retreat Hotel New Delhi 9873777170Call Girls In The Ocean Pearl Retreat Hotel New Delhi 9873777170
Call Girls In The Ocean Pearl Retreat Hotel New Delhi 9873777170Sonam Pathan
 

Recently uploaded (20)

SCM Symposium PPT Format Customer loyalty is predi
SCM Symposium PPT Format Customer loyalty is prediSCM Symposium PPT Format Customer loyalty is predi
SCM Symposium PPT Format Customer loyalty is predi
 
定制(Management毕业证书)新加坡管理大学毕业证成绩单原版一比一
定制(Management毕业证书)新加坡管理大学毕业证成绩单原版一比一定制(Management毕业证书)新加坡管理大学毕业证成绩单原版一比一
定制(Management毕业证书)新加坡管理大学毕业证成绩单原版一比一
 
Blepharitis inflammation of eyelid symptoms cause everything included along w...
Blepharitis inflammation of eyelid symptoms cause everything included along w...Blepharitis inflammation of eyelid symptoms cause everything included along w...
Blepharitis inflammation of eyelid symptoms cause everything included along w...
 
Q4-1-Illustrating-Hypothesis-Testing.pptx
Q4-1-Illustrating-Hypothesis-Testing.pptxQ4-1-Illustrating-Hypothesis-Testing.pptx
Q4-1-Illustrating-Hypothesis-Testing.pptx
 
young call girls in Uttam Nagar🔝 9953056974 🔝 Delhi escort Service
young call girls in Uttam Nagar🔝 9953056974 🔝 Delhi escort Serviceyoung call girls in Uttam Nagar🔝 9953056974 🔝 Delhi escort Service
young call girls in Uttam Nagar🔝 9953056974 🔝 Delhi escort Service
 
Potsdam FH学位证,波茨坦应用技术大学毕业证书1:1制作
Potsdam FH学位证,波茨坦应用技术大学毕业证书1:1制作Potsdam FH学位证,波茨坦应用技术大学毕业证书1:1制作
Potsdam FH学位证,波茨坦应用技术大学毕业证书1:1制作
 
Font Performance - NYC WebPerf Meetup April '24
Font Performance - NYC WebPerf Meetup April '24Font Performance - NYC WebPerf Meetup April '24
Font Performance - NYC WebPerf Meetup April '24
 
Elevate Your Business with Our IT Expertise in New Orleans
Elevate Your Business with Our IT Expertise in New OrleansElevate Your Business with Our IT Expertise in New Orleans
Elevate Your Business with Our IT Expertise in New Orleans
 
Magic exist by Marta Loveguard - presentation.pptx
Magic exist by Marta Loveguard - presentation.pptxMagic exist by Marta Loveguard - presentation.pptx
Magic exist by Marta Loveguard - presentation.pptx
 
PHP-based rendering of TYPO3 Documentation
PHP-based rendering of TYPO3 DocumentationPHP-based rendering of TYPO3 Documentation
PHP-based rendering of TYPO3 Documentation
 
Film cover research (1).pptxsdasdasdasdasdasa
Film cover research (1).pptxsdasdasdasdasdasaFilm cover research (1).pptxsdasdasdasdasdasa
Film cover research (1).pptxsdasdasdasdasdasa
 
Top 10 Interactive Website Design Trends in 2024.pptx
Top 10 Interactive Website Design Trends in 2024.pptxTop 10 Interactive Website Design Trends in 2024.pptx
Top 10 Interactive Website Design Trends in 2024.pptx
 
定制(Lincoln毕业证书)新西兰林肯大学毕业证成绩单原版一比一
定制(Lincoln毕业证书)新西兰林肯大学毕业证成绩单原版一比一定制(Lincoln毕业证书)新西兰林肯大学毕业证成绩单原版一比一
定制(Lincoln毕业证书)新西兰林肯大学毕业证成绩单原版一比一
 
『澳洲文凭』买拉筹伯大学毕业证书成绩单办理澳洲LTU文凭学位证书
『澳洲文凭』买拉筹伯大学毕业证书成绩单办理澳洲LTU文凭学位证书『澳洲文凭』买拉筹伯大学毕业证书成绩单办理澳洲LTU文凭学位证书
『澳洲文凭』买拉筹伯大学毕业证书成绩单办理澳洲LTU文凭学位证书
 
定制(AUT毕业证书)新西兰奥克兰理工大学毕业证成绩单原版一比一
定制(AUT毕业证书)新西兰奥克兰理工大学毕业证成绩单原版一比一定制(AUT毕业证书)新西兰奥克兰理工大学毕业证成绩单原版一比一
定制(AUT毕业证书)新西兰奥克兰理工大学毕业证成绩单原版一比一
 
Hot Sexy call girls in Rk Puram 🔝 9953056974 🔝 Delhi escort Service
Hot Sexy call girls in  Rk Puram 🔝 9953056974 🔝 Delhi escort ServiceHot Sexy call girls in  Rk Puram 🔝 9953056974 🔝 Delhi escort Service
Hot Sexy call girls in Rk Puram 🔝 9953056974 🔝 Delhi escort Service
 
A Good Girl's Guide to Murder (A Good Girl's Guide to Murder, #1)
A Good Girl's Guide to Murder (A Good Girl's Guide to Murder, #1)A Good Girl's Guide to Murder (A Good Girl's Guide to Murder, #1)
A Good Girl's Guide to Murder (A Good Girl's Guide to Murder, #1)
 
NSX-T and Service Interfaces presentation
NSX-T and Service Interfaces presentationNSX-T and Service Interfaces presentation
NSX-T and Service Interfaces presentation
 
『澳洲文凭』买詹姆士库克大学毕业证书成绩单办理澳洲JCU文凭学位证书
『澳洲文凭』买詹姆士库克大学毕业证书成绩单办理澳洲JCU文凭学位证书『澳洲文凭』买詹姆士库克大学毕业证书成绩单办理澳洲JCU文凭学位证书
『澳洲文凭』买詹姆士库克大学毕业证书成绩单办理澳洲JCU文凭学位证书
 
Call Girls In The Ocean Pearl Retreat Hotel New Delhi 9873777170
Call Girls In The Ocean Pearl Retreat Hotel New Delhi 9873777170Call Girls In The Ocean Pearl Retreat Hotel New Delhi 9873777170
Call Girls In The Ocean Pearl Retreat Hotel New Delhi 9873777170
 

ZBW links historic press archives to Wikidata

  • 1. ZBW is member of the Leibniz Association Wikidata as opportunity for special collections: the 20th Century Press Archives use case Joachim Neubert ZBW – Leibniz Information Centre for Economics, Kiel/Hamburg LIBER 2019, Linked Open Data Working Group 26.06.2019, Dublin (Ireland)
  • 2. Agenda 1. What are we dealing with? 2. Why Wikidata? 3. Transfering metadata a. Link to existing items b. Create missing items c. Add metadata to the items 4. Using the data 5. Future work Page 2
  • 4. Page 4 What are we dealing with? Historic Press Archives, founded in 1909 (Hamburg) and 1914 (Kiel) • Some material dating back to 1826 • Collections closed in 2005 Thematic dossiers covering  Persons • Companies • Products • General subjects and events
  • 5. Page 5 Current state Former DFG funded project, resulting in • Digitized roll films (material before 1949) • Relational database about dossiers, often with GND ID • Big filesystem (containing more than 2m pages) • Accessible via • custom application “Pressemappe 20. Jahrhundert” and • DFG-Viewer (METS/MODS files, per dossier) All metadata available under CC 0 license
  • 6. Long term sustainability? Specialized application for discovery and access, architecturally outdated and expensive to maintain Page 6 http://webopac.hwwa.de/pressemappe20
  • 8. Wikidata basics • Knowledge base for Wikimedia projects • All kinds of entities: concepts, places, people, works … • Editable and extensible by everyone • Data available under CC0 • http://query.wikidata.org/ (SPARQL) • JSON API & database dumps • Sustainable foundation for long-term available data Page 8
  • 11. Linking mechanism: external identifiers • Property value: unique IDs from external database • + URL stub in the property definition („formatter URL“) • Almost 4,000 external identifier properties • Examples: • GND • proteins • African plants • Swedish cultural heritage objects Page 11
  • 12. Transfering collection metadata to Wikidata 1. What are we dealing with? 2. Why Wikidata? 3. Transfering metadata a. Link to existing items b. Create missing items c. Add metadata to the items 4. Using the data 5. Future work Page 12
  • 13. Wikidata property P4293 (PM20 folder ID) • Property proposal and discussion within the community Additional prerequisite: • RDF representation of PM20 contents and a SPARQL endpoint, allowing federated queries with the Wikidata endpoint Page 13
  • 14. Link to existing items • Automatically inserted links derived from GND IDs • Tool-supported manual linking • Wikidata‘s Mix-n-match (great for persons, crowd-sourced) • custom tools (like this) • others (OpenRefine, …)  ~ 95% of PM20 person folders linked by mid-June 2019! Page 14
  • 15. Checking proposed matches in Mix‘n‘match Seite 15
  • 16. Add missing items to Wikidata - automatically Recommendations for item creation: • Pay attention to Wikidata’s notability criteria • Explain your plan and ask for feedback in the Wikidata project chat • Apply for a bot account to make mass edits (example) • Source every statement Process: • Transform query results to QuickStatements input file • Copy & paste into QuickStatements Page 16
  • 17. QuickStatements input from PM20 • using a federated query to exclude existing Wikidata items • query output transformed by a script Page 17
  • 18. Added Wikidata item Page 18  all 5200 PM20 person folders now linked from Wikidata!
  • 19. Add metadata to Wikidata items e.g., for all persons in Wikidata with PM20 ID and the PM20 “field of activity”: “economics” or “business economics”, insert the according occupation into the WD person item (script, query) Page 19
  • 20. Using the data on Wikidata Page 20
  • 21. „Proof of concept“ example: Map of economists Page 21 Query link (on Wikidata SPARQL endpoint – see also list of all PM20 economists)
  • 22. Display via DFG Viewer link Page 22
  • 23. Future work • Build community support for further extension of the PM20 metadata • Create an item structure for the subject and ware archives, and link the folders (~ 12,000) • Link/create items for company folders (~ 8,000) • Create a static HTML site with one page per folder (+ additional navigation pages) on the PM20 web site which hosts the digitized images (= permanent reference) • Optionally, create additional Wikidata-based searching/browsing facilities • Retire the present ColdFusion application Page 23
  • 24. Wikiproject 20th Century Press Archives Page 24 https://www.wikidata.org/wiki/Wikidata:WikiProject_20th_Century_Press_Archives
  • 25. Page 25 Thanks for listening! Joachim Neubert ZBW – Leibniz Information Centre for Economics j.neubert@zbw.eu http://zbw.eu/labs https://www.wikidata.org/wiki/User:Jneubert