SlideShare a Scribd company logo
1 of 31
Linked Data – the Future for Open Repositories?


                               Kultivate Linked Data Workshop
                                                   London, UK
                                           12th December 2011


                                                         Adrian Stevenson
                                UKOLN, University of Bath, UK (until end Dec 2011)
 Mimas, Libraries and Archives Team, University of Manchester, UK (from Jan 2012)
LOCAH and Linking Lives Projects

• Linked Open Copac and Archives Hub
   – Funded by #JiscEXPO 2/10 ‘Expose’ call
      • 1 year project. Started August 2010
   – Partners & Consultants:
      • UKOLN, Mimas, Eduserv, Talis, OCLC, Ed Summers
   – http://blogs.ukoln.ac.uk/locah/


• Linking Lives
   – JISC funded for ‘Mimas Enhancements’
   – 11 month project. Started Sept 2011
   – http://archiveshub.ac.uk/linkinglives/
Archives Hub and Copac
• UK National Data Services based at Mimas
• Archives Hub is an aggregation of archival
  descriptions from archive repositories
  across the UK
  – http://archiveshub.ac.uk
• Copacprovides access to the merged library
  catalogues of libraries throughout the
  UK, including all national libraries
  – http://copac.ac.uk
LOCAH Outputs
• Expose Archives Hub &Copac
  data as Linked Data
• Create a prototype visualisation
• Report on opportunities and
  barriers
How do we expose the Linked Data?

1.   Model our ‘things’ into RDF
2.   Transform the existing data into RDF/XML
3.   Enhance the data
4.   Load the RDF/XML into a triple store
5.   Create Linked Data Views
6.   Document the process, opportunities and
     barriers on LOCAH Blog
Modelling ‘things’ into RDF
• Archives Hub data in ‘Encoded Archival
  Description’ EAD XML form
  – http://www.loc.gov/ead/

• Copacdata in ‘Metadata Object Description
  Schema’ MODS XML form
  – http://www.loc.gov/standards/mods/

• Take a step back from the data format
  – What is EAD or MODS document “saying” about
    “things”?
  – What questions do we want to answer about those
    “things”?
URI Patterns
• Need to decide on patterns for URIs we generate
• Following guidance from W3C ‘Cool URIs for the Semantic
  Web’ and UK Cabinet Office ‘Designing URI Sets for the UK
  Public Sector’
   http://data.archiveshub.ac.uk/id/findingaid/gb1086skinner
      ‘thing’ URI
      redirects to …
   http://data.archiveshub.ac.uk/doc/findingaid/gb1086skinner
      document URI


    http://www.w3.org/TR/cooluris/
    http://www.cabinetoffice.gov.uk/resource-library/designing-uri-sets-uk-public-sector
Vocabularies

• Using existing RDF vocabularies
  – DC, SKOS, FOAF, BIBO, WGS84 Geo, Lexvo, ORE,
    LODE, Event and Time Ontologies
• Define additional RDF terms where required
  – hub:ArchivalResource
  – copac:Creator
• It can be hard to know where to find and how
  to use vocabularies and ontologies
Archives Hub Model
                                                                                                                   in
  Finding                              maintainedBy/       Repository          administeredBy/    Place                           Postcode
    Aid                                maintains            (Agent)            administers                                          Unit
   hasPart/           encodedAs/
   partOf             encodes              EAD
                                         Document
                                                                 accessProvidedBy/
                                                                                                    Level
Biographical                  hasBiogHist/     topic/
                                                                 providesAccessTo
   History                    isBiogHistFor    page
                                                                                       level      Language
                                                                Archival              language                          at time
   topic/
   page
                                origination    hasPart/         Resource
                                                                                     product of   Creation                        Temporal
                                               partOf
                                                           associatedWith
                                                                                                                                    Entity
                                                                                      extent
                                              inScheme
                                                                                                    Extent
   Agent                         Concept                        Concept
                                 Scheme
                                                                                                    representedBy
        Is-a                                       foaf:focus
                                                                                                                                   Object
                                                                                     Is-a         associatedWith
  Person                        Family             Organisation                 Place
                                                                                                                                    Book

            participates in

   Birth                        Death                                                                 Genre                       Function


                          at time
                                                            Temporal
                                                              Entity
CopacModel
Transforming into RDF/XML

• Transform EAD and MODS to RDF/XML based
  on our models
  – Hub: XSLT Stylesheet
  – Copac: created in-house Java transformation
    program


• Load RDF/XML into a triple store
We’re Linking Data!
• If something is identified, it can be linked to
• We take items from our datasets and link them
  to items from other datasets


      BBC
                                        Copac
                    VIAF



    DBPedia
                                         GeoNames
                      Archives Hub
Enhancing our data
• Already have some links:
   – Time - reference.data.gov.ukURIs
   – Location - UK Postcodes URIs and Ordnance
     Survey URIs
   – Names - Virtual International Authority File
      • VIAF matches and links widely-used authority
        files - http://viaf.org/
   – Names - DBPedia
• Also looking at:
   – Subjects - Library Congress Subject Headings
     and DBPedia
http://data.archiveshub.ac.uk/
http://data.archiveshub.ac.uk/id/person/nra/webbmarthabeatrice1858-1943socialreformer
http://data.copac.ac.uk/ (to be released very soon!)
Visualisation Prototype
Using Timemap –
 – Googlemaps and
   Simile
 –   http://code.google.com/p/time
     map/

Early stages with this
Will give location and
‘extent’ of archive.
Will link through to
Archives Hub
Linking Lives Project




               http://archiveshub.ac.uk/linkinglives/
BBC Music




http://www.bbc.co.uk/music/artists/f0ed72a3-ae8f-4cf7-b51d-2696a2330230
Key Benefit of Linked Data
• API based mashupswork against a fixed
  set of data sources
   • Hand crafted by humans
   • Don’t integrate well
• Linked Data promises an unbound global
  data space
   • Easy dataset integration
   • Generic ‘mesh-up’ tools
Challenges
Data Modelling
• Steep learning curve
  – RDF terminology “confusing”
  – Lack of archival examples
• Complexity
  – Archival description is hierarchical and
    multi-level
• ‘Dirty’ Data
Linking Subjects
Linking Places
Scalability / Provenance




• Same issue with attribution
• Solutions: Named graphs? Quads?
                                    Example by Bradley Allen, Elsevier at
• Best Practice                     LOD LAM Summit, SF, USA, June 2011
Licensing

• Ownership of data often not clear
• Difficult to track attribution and
  provenance
• CC0 for Archives Hub and Copac
  test datasets
Sustainability
• Can you rely on data sources long-term?
• Ed Summers at the Library of Congress
  created http://lcsh.info
• Linked Data interface for LOC subject
  headings
• People started using it
Library of Congress Subject Headings
Linked Data the Future for Open
            Repositories?
• Enables ‘straightforward’ integration of
  wide variety of data sources
• Repository data can ‘work harder’
• New channels into your data
• Researchers are more likely to discover
  sources
• ‘Hidden' collections become of the Web
Attribution and CC License
• Sections of this presentation adapted from materials created
  by other members of the LOCAH & Linking Lives Projects
• This presentation available under creative commonsNon
  Commercial-Share Alike:

  http://creativecommons.org/licenses/by-nc/2.0/uk/

More Related Content

Viewers also liked

What is Drupal Ladder?
What is Drupal Ladder?What is Drupal Ladder?
What is Drupal Ladder?hellodrupal
 
Drupal debugging tips
Drupal debugging tipsDrupal debugging tips
Drupal debugging tipsAdolfo Nasol
 
Drupal 7x Installation - Introduction to Drupal Concepts
Drupal 7x Installation - Introduction to Drupal ConceptsDrupal 7x Installation - Introduction to Drupal Concepts
Drupal 7x Installation - Introduction to Drupal ConceptsMicky Metts
 
Introduction to Drupal for Absolute Beginners
Introduction to Drupal for Absolute BeginnersIntroduction to Drupal for Absolute Beginners
Introduction to Drupal for Absolute Beginnerseverlearner
 
Introduction to Drupal Basics
Introduction to Drupal BasicsIntroduction to Drupal Basics
Introduction to Drupal BasicsJuha Niemi
 
Managing drupal views in code
Managing drupal views in codeManaging drupal views in code
Managing drupal views in codeAdolfo Nasol
 

Viewers also liked (11)

드루팔이란
드루팔이란드루팔이란
드루팔이란
 
What is Drupal Ladder?
What is Drupal Ladder?What is Drupal Ladder?
What is Drupal Ladder?
 
Why I Hate Drupal
Why I Hate DrupalWhy I Hate Drupal
Why I Hate Drupal
 
Hello Drupal
Hello DrupalHello Drupal
Hello Drupal
 
Drupal debugging tips
Drupal debugging tipsDrupal debugging tips
Drupal debugging tips
 
Drupal 7x Installation - Introduction to Drupal Concepts
Drupal 7x Installation - Introduction to Drupal ConceptsDrupal 7x Installation - Introduction to Drupal Concepts
Drupal 7x Installation - Introduction to Drupal Concepts
 
Group - Drupalcamp London 2016
Group - Drupalcamp London 2016Group - Drupalcamp London 2016
Group - Drupalcamp London 2016
 
Beginners Guide to Drupal
Beginners Guide to DrupalBeginners Guide to Drupal
Beginners Guide to Drupal
 
Introduction to Drupal for Absolute Beginners
Introduction to Drupal for Absolute BeginnersIntroduction to Drupal for Absolute Beginners
Introduction to Drupal for Absolute Beginners
 
Introduction to Drupal Basics
Introduction to Drupal BasicsIntroduction to Drupal Basics
Introduction to Drupal Basics
 
Managing drupal views in code
Managing drupal views in codeManaging drupal views in code
Managing drupal views in code
 

Similar to Linked Data Future

Linked Data - the Future for Open Repositories?
Linked Data - the Future for Open Repositories?Linked Data - the Future for Open Repositories?
Linked Data - the Future for Open Repositories?Adrian Stevenson
 
2014 06-04-presentation-mdn-2014
2014 06-04-presentation-mdn-20142014 06-04-presentation-mdn-2014
2014 06-04-presentation-mdn-2014Christophe Debruyne
 
RTÉ Content Discovery Project - Christophe Debruyne
RTÉ Content Discovery Project - Christophe DebruyneRTÉ Content Discovery Project - Christophe Debruyne
RTÉ Content Discovery Project - Christophe Debruynedri_ireland
 
Documents, services, and data on the web
Documents, services, and data on the webDocuments, services, and data on the web
Documents, services, and data on the webChiara Del Vescovo
 
Libraries and Linked Data: Looking to the Future (3)
Libraries and Linked Data: Looking to the Future (3)Libraries and Linked Data: Looking to the Future (3)
Libraries and Linked Data: Looking to the Future (3)ALATechSource
 
Libraries, OA research and OER: towards symbiosis?
Libraries, OA research and OER: towards symbiosis?Libraries, OA research and OER: towards symbiosis?
Libraries, OA research and OER: towards symbiosis?Nick Sheppard
 
Alexandria winer20100623
Alexandria winer20100623Alexandria winer20100623
Alexandria winer20100623Dov Winer
 
SWIB14 Weaving repository contents into the Semantic Web
SWIB14 Weaving repository contents into the Semantic WebSWIB14 Weaving repository contents into the Semantic Web
SWIB14 Weaving repository contents into the Semantic WebPascal-Nicolas Becker
 
Knowledge Organization System (KOS) for biodiversity information resources, G...
Knowledge Organization System (KOS) for biodiversity information resources, G...Knowledge Organization System (KOS) for biodiversity information resources, G...
Knowledge Organization System (KOS) for biodiversity information resources, G...Dag Endresen
 
20120411 travelalliancemcguinnessfinal
20120411 travelalliancemcguinnessfinal20120411 travelalliancemcguinnessfinal
20120411 travelalliancemcguinnessfinalDeborah McGuinness
 
Leyline: A provenance-based desktop search
Leyline: A provenance-based desktop searchLeyline: A provenance-based desktop search
Leyline: A provenance-based desktop searchSoroush Ghorashi
 
The current architecture of TYPO3 5.0
The current architecture of TYPO3 5.0The current architecture of TYPO3 5.0
The current architecture of TYPO3 5.0Robert Lemke
 
Knowledge Engineering for TELDAP
Knowledge Engineering for TELDAPKnowledge Engineering for TELDAP
Knowledge Engineering for TELDAPAAT Taiwan
 
Distro Recipes 2013 : Contribution of RDF metadata for traceability among pro...
Distro Recipes 2013 : Contribution of RDF metadata for traceability among pro...Distro Recipes 2013 : Contribution of RDF metadata for traceability among pro...
Distro Recipes 2013 : Contribution of RDF metadata for traceability among pro...Anne Nicolas
 
Presentation distro recipes-2013
Presentation distro recipes-2013Presentation distro recipes-2013
Presentation distro recipes-2013olberger
 
Eun lre brussels_winer20100616
Eun lre brussels_winer20100616Eun lre brussels_winer20100616
Eun lre brussels_winer20100616Dov Winer
 
Linked Data and Locah, UKSG2011
Linked Data and Locah, UKSG2011 Linked Data and Locah, UKSG2011
Linked Data and Locah, UKSG2011 Jane Stevenson
 
Pundit @ amsterdam textual scholarship
Pundit @ amsterdam textual scholarshipPundit @ amsterdam textual scholarship
Pundit @ amsterdam textual scholarshipNet7
 
Flexible Resources In 3 6 And E4
Flexible Resources In 3 6 And E4Flexible Resources In 3 6 And E4
Flexible Resources In 3 6 And E4szbra
 

Similar to Linked Data Future (20)

Linked Data - the Future for Open Repositories?
Linked Data - the Future for Open Repositories?Linked Data - the Future for Open Repositories?
Linked Data - the Future for Open Repositories?
 
2014 06-04-presentation-mdn-2014
2014 06-04-presentation-mdn-20142014 06-04-presentation-mdn-2014
2014 06-04-presentation-mdn-2014
 
RTÉ Content Discovery Project - Christophe Debruyne
RTÉ Content Discovery Project - Christophe DebruyneRTÉ Content Discovery Project - Christophe Debruyne
RTÉ Content Discovery Project - Christophe Debruyne
 
Documents, services, and data on the web
Documents, services, and data on the webDocuments, services, and data on the web
Documents, services, and data on the web
 
Sword Bl 0903[1]
Sword Bl 0903[1]Sword Bl 0903[1]
Sword Bl 0903[1]
 
Libraries and Linked Data: Looking to the Future (3)
Libraries and Linked Data: Looking to the Future (3)Libraries and Linked Data: Looking to the Future (3)
Libraries and Linked Data: Looking to the Future (3)
 
Libraries, OA research and OER: towards symbiosis?
Libraries, OA research and OER: towards symbiosis?Libraries, OA research and OER: towards symbiosis?
Libraries, OA research and OER: towards symbiosis?
 
Alexandria winer20100623
Alexandria winer20100623Alexandria winer20100623
Alexandria winer20100623
 
SWIB14 Weaving repository contents into the Semantic Web
SWIB14 Weaving repository contents into the Semantic WebSWIB14 Weaving repository contents into the Semantic Web
SWIB14 Weaving repository contents into the Semantic Web
 
Knowledge Organization System (KOS) for biodiversity information resources, G...
Knowledge Organization System (KOS) for biodiversity information resources, G...Knowledge Organization System (KOS) for biodiversity information resources, G...
Knowledge Organization System (KOS) for biodiversity information resources, G...
 
20120411 travelalliancemcguinnessfinal
20120411 travelalliancemcguinnessfinal20120411 travelalliancemcguinnessfinal
20120411 travelalliancemcguinnessfinal
 
Leyline: A provenance-based desktop search
Leyline: A provenance-based desktop searchLeyline: A provenance-based desktop search
Leyline: A provenance-based desktop search
 
The current architecture of TYPO3 5.0
The current architecture of TYPO3 5.0The current architecture of TYPO3 5.0
The current architecture of TYPO3 5.0
 
Knowledge Engineering for TELDAP
Knowledge Engineering for TELDAPKnowledge Engineering for TELDAP
Knowledge Engineering for TELDAP
 
Distro Recipes 2013 : Contribution of RDF metadata for traceability among pro...
Distro Recipes 2013 : Contribution of RDF metadata for traceability among pro...Distro Recipes 2013 : Contribution of RDF metadata for traceability among pro...
Distro Recipes 2013 : Contribution of RDF metadata for traceability among pro...
 
Presentation distro recipes-2013
Presentation distro recipes-2013Presentation distro recipes-2013
Presentation distro recipes-2013
 
Eun lre brussels_winer20100616
Eun lre brussels_winer20100616Eun lre brussels_winer20100616
Eun lre brussels_winer20100616
 
Linked Data and Locah, UKSG2011
Linked Data and Locah, UKSG2011 Linked Data and Locah, UKSG2011
Linked Data and Locah, UKSG2011
 
Pundit @ amsterdam textual scholarship
Pundit @ amsterdam textual scholarshipPundit @ amsterdam textual scholarship
Pundit @ amsterdam textual scholarship
 
Flexible Resources In 3 6 And E4
Flexible Resources In 3 6 And E4Flexible Resources In 3 6 And E4
Flexible Resources In 3 6 And E4
 

More from Adrian Stevenson

Tools for Data Manipulation - UKAD Open Refine Workshop
Tools for Data Manipulation - UKAD Open Refine WorkshopTools for Data Manipulation - UKAD Open Refine Workshop
Tools for Data Manipulation - UKAD Open Refine WorkshopAdrian Stevenson
 
Linking Data with sameAs: Challenges and Solutions - Workshop
Linking Data with sameAs: Challenges and Solutions - WorkshopLinking Data with sameAs: Challenges and Solutions - Workshop
Linking Data with sameAs: Challenges and Solutions - WorkshopAdrian Stevenson
 
“Il n’y a pas de hors-texte” - Challenges for Archival Linked Data
“Il n’y a pas de hors-texte” - Challenges for Archival Linked Data“Il n’y a pas de hors-texte” - Challenges for Archival Linked Data
“Il n’y a pas de hors-texte” - Challenges for Archival Linked DataAdrian Stevenson
 
Wrapping and Unwrapping History: What’s Gained and What’s Lost
Wrapping and Unwrapping History: What’s Gained and What’s LostWrapping and Unwrapping History: What’s Gained and What’s Lost
Wrapping and Unwrapping History: What’s Gained and What’s LostAdrian Stevenson
 
Very Gentle Linked Data Workshop
Very Gentle Linked Data WorkshopVery Gentle Linked Data Workshop
Very Gentle Linked Data WorkshopAdrian Stevenson
 
Digital Humanities and the First World War
Digital Humanities and the First World WarDigital Humanities and the First World War
Digital Humanities and the First World WarAdrian Stevenson
 
Lessons from ‘Linking Lives’ and ‘WW1 Discovery’ Projects
Lessons from ‘Linking Lives’ and ‘WW1 Discovery’ ProjectsLessons from ‘Linking Lives’ and ‘WW1 Discovery’ Projects
Lessons from ‘Linking Lives’ and ‘WW1 Discovery’ ProjectsAdrian Stevenson
 
The Winner Takes it All? -APIs and Linked Data Battle It Out
The Winner Takes it All? -APIs and Linked Data Battle It OutThe Winner Takes it All? -APIs and Linked Data Battle It Out
The Winner Takes it All? -APIs and Linked Data Battle It OutAdrian Stevenson
 
Introduction to APIs and Linked Data
Introduction to APIs and Linked DataIntroduction to APIs and Linked Data
Introduction to APIs and Linked DataAdrian Stevenson
 
GLAM Rocks! London Semantic Web Meetup
GLAM Rocks! London Semantic Web MeetupGLAM Rocks! London Semantic Web Meetup
GLAM Rocks! London Semantic Web MeetupAdrian Stevenson
 
High and Lows of Library Linked Data
High and Lows of Library Linked DataHigh and Lows of Library Linked Data
High and Lows of Library Linked DataAdrian Stevenson
 
2 minutes on LOCAH Linking Lives at Europeana Tech 2011
 2 minutes on LOCAH Linking Lives at Europeana Tech 2011 2 minutes on LOCAH Linking Lives at Europeana Tech 2011
2 minutes on LOCAH Linking Lives at Europeana Tech 2011Adrian Stevenson
 
Linked Open Data: Opportunities & Barriers for Archives
Linked Open Data: Opportunities & Barriers for ArchivesLinked Open Data: Opportunities & Barriers for Archives
Linked Open Data: Opportunities & Barriers for ArchivesAdrian Stevenson
 
Locah Project Show and Tell
Locah Project Show and TellLocah Project Show and Tell
Locah Project Show and TellAdrian Stevenson
 
Report on the International Linked Open Data for Libraries, Archives and Muse...
Report on the International Linked Open Data for Libraries, Archives and Muse...Report on the International Linked Open Data for Libraries, Archives and Muse...
Report on the International Linked Open Data for Libraries, Archives and Muse...Adrian Stevenson
 
Aggregation Using Linked Data – LOCAH Project Experiences
Aggregation Using Linked Data – LOCAH Project ExperiencesAggregation Using Linked Data – LOCAH Project Experiences
Aggregation Using Linked Data – LOCAH Project ExperiencesAdrian Stevenson
 
LOCAH Project and Considerations of Linked Data Approaches
LOCAH Project and Considerations of Linked Data ApproachesLOCAH Project and Considerations of Linked Data Approaches
LOCAH Project and Considerations of Linked Data ApproachesAdrian Stevenson
 
Do the LOCAH-Motion: How to Make Bibliographic and Archival Linked Data
Do the LOCAH-Motion: How to Make Bibliographic and Archival Linked DataDo the LOCAH-Motion: How to Make Bibliographic and Archival Linked Data
Do the LOCAH-Motion: How to Make Bibliographic and Archival Linked DataAdrian Stevenson
 

More from Adrian Stevenson (20)

Tools for Data Manipulation - UKAD Open Refine Workshop
Tools for Data Manipulation - UKAD Open Refine WorkshopTools for Data Manipulation - UKAD Open Refine Workshop
Tools for Data Manipulation - UKAD Open Refine Workshop
 
Exploring British Design
Exploring British DesignExploring British Design
Exploring British Design
 
SEO Matters
SEO MattersSEO Matters
SEO Matters
 
Linking Data with sameAs: Challenges and Solutions - Workshop
Linking Data with sameAs: Challenges and Solutions - WorkshopLinking Data with sameAs: Challenges and Solutions - Workshop
Linking Data with sameAs: Challenges and Solutions - Workshop
 
“Il n’y a pas de hors-texte” - Challenges for Archival Linked Data
“Il n’y a pas de hors-texte” - Challenges for Archival Linked Data“Il n’y a pas de hors-texte” - Challenges for Archival Linked Data
“Il n’y a pas de hors-texte” - Challenges for Archival Linked Data
 
Wrapping and Unwrapping History: What’s Gained and What’s Lost
Wrapping and Unwrapping History: What’s Gained and What’s LostWrapping and Unwrapping History: What’s Gained and What’s Lost
Wrapping and Unwrapping History: What’s Gained and What’s Lost
 
Very Gentle Linked Data Workshop
Very Gentle Linked Data WorkshopVery Gentle Linked Data Workshop
Very Gentle Linked Data Workshop
 
Digital Humanities and the First World War
Digital Humanities and the First World WarDigital Humanities and the First World War
Digital Humanities and the First World War
 
Lessons from ‘Linking Lives’ and ‘WW1 Discovery’ Projects
Lessons from ‘Linking Lives’ and ‘WW1 Discovery’ ProjectsLessons from ‘Linking Lives’ and ‘WW1 Discovery’ Projects
Lessons from ‘Linking Lives’ and ‘WW1 Discovery’ Projects
 
The Winner Takes it All? -APIs and Linked Data Battle It Out
The Winner Takes it All? -APIs and Linked Data Battle It OutThe Winner Takes it All? -APIs and Linked Data Battle It Out
The Winner Takes it All? -APIs and Linked Data Battle It Out
 
Introduction to APIs and Linked Data
Introduction to APIs and Linked DataIntroduction to APIs and Linked Data
Introduction to APIs and Linked Data
 
GLAM Rocks! London Semantic Web Meetup
GLAM Rocks! London Semantic Web MeetupGLAM Rocks! London Semantic Web Meetup
GLAM Rocks! London Semantic Web Meetup
 
High and Lows of Library Linked Data
High and Lows of Library Linked DataHigh and Lows of Library Linked Data
High and Lows of Library Linked Data
 
2 minutes on LOCAH Linking Lives at Europeana Tech 2011
 2 minutes on LOCAH Linking Lives at Europeana Tech 2011 2 minutes on LOCAH Linking Lives at Europeana Tech 2011
2 minutes on LOCAH Linking Lives at Europeana Tech 2011
 
Linked Open Data: Opportunities & Barriers for Archives
Linked Open Data: Opportunities & Barriers for ArchivesLinked Open Data: Opportunities & Barriers for Archives
Linked Open Data: Opportunities & Barriers for Archives
 
Locah Project Show and Tell
Locah Project Show and TellLocah Project Show and Tell
Locah Project Show and Tell
 
Report on the International Linked Open Data for Libraries, Archives and Muse...
Report on the International Linked Open Data for Libraries, Archives and Muse...Report on the International Linked Open Data for Libraries, Archives and Muse...
Report on the International Linked Open Data for Libraries, Archives and Muse...
 
Aggregation Using Linked Data – LOCAH Project Experiences
Aggregation Using Linked Data – LOCAH Project ExperiencesAggregation Using Linked Data – LOCAH Project Experiences
Aggregation Using Linked Data – LOCAH Project Experiences
 
LOCAH Project and Considerations of Linked Data Approaches
LOCAH Project and Considerations of Linked Data ApproachesLOCAH Project and Considerations of Linked Data Approaches
LOCAH Project and Considerations of Linked Data Approaches
 
Do the LOCAH-Motion: How to Make Bibliographic and Archival Linked Data
Do the LOCAH-Motion: How to Make Bibliographic and Archival Linked DataDo the LOCAH-Motion: How to Make Bibliographic and Archival Linked Data
Do the LOCAH-Motion: How to Make Bibliographic and Archival Linked Data
 

Recently uploaded

Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfAlex Barbosa Coqueiro
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii SoldatenkoFwdays
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Enterprise Knowledge
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Manik S Magar
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteDianaGray10
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxNavinnSomaal
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsPixlogix Infotech
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningLars Bell
 
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostLeverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostZilliz
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfAddepto
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyAlfredo García Lavilla
 
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo DayH2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo DaySri Ambati
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brandgvaughan
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxhariprasad279825
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.Curtis Poe
 

Recently uploaded (20)

Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdf
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project Setup
 
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptxE-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test Suite
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptx
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and Cons
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL Certs
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine Tuning
 
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostLeverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdf
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easy
 
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo DayH2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brand
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptx
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.
 

Linked Data Future

  • 1. Linked Data – the Future for Open Repositories? Kultivate Linked Data Workshop London, UK 12th December 2011 Adrian Stevenson UKOLN, University of Bath, UK (until end Dec 2011) Mimas, Libraries and Archives Team, University of Manchester, UK (from Jan 2012)
  • 2. LOCAH and Linking Lives Projects • Linked Open Copac and Archives Hub – Funded by #JiscEXPO 2/10 ‘Expose’ call • 1 year project. Started August 2010 – Partners & Consultants: • UKOLN, Mimas, Eduserv, Talis, OCLC, Ed Summers – http://blogs.ukoln.ac.uk/locah/ • Linking Lives – JISC funded for ‘Mimas Enhancements’ – 11 month project. Started Sept 2011 – http://archiveshub.ac.uk/linkinglives/
  • 3. Archives Hub and Copac • UK National Data Services based at Mimas • Archives Hub is an aggregation of archival descriptions from archive repositories across the UK – http://archiveshub.ac.uk • Copacprovides access to the merged library catalogues of libraries throughout the UK, including all national libraries – http://copac.ac.uk
  • 4. LOCAH Outputs • Expose Archives Hub &Copac data as Linked Data • Create a prototype visualisation • Report on opportunities and barriers
  • 5. How do we expose the Linked Data? 1. Model our ‘things’ into RDF 2. Transform the existing data into RDF/XML 3. Enhance the data 4. Load the RDF/XML into a triple store 5. Create Linked Data Views 6. Document the process, opportunities and barriers on LOCAH Blog
  • 6. Modelling ‘things’ into RDF • Archives Hub data in ‘Encoded Archival Description’ EAD XML form – http://www.loc.gov/ead/ • Copacdata in ‘Metadata Object Description Schema’ MODS XML form – http://www.loc.gov/standards/mods/ • Take a step back from the data format – What is EAD or MODS document “saying” about “things”? – What questions do we want to answer about those “things”?
  • 7. URI Patterns • Need to decide on patterns for URIs we generate • Following guidance from W3C ‘Cool URIs for the Semantic Web’ and UK Cabinet Office ‘Designing URI Sets for the UK Public Sector’ http://data.archiveshub.ac.uk/id/findingaid/gb1086skinner ‘thing’ URI redirects to … http://data.archiveshub.ac.uk/doc/findingaid/gb1086skinner document URI http://www.w3.org/TR/cooluris/ http://www.cabinetoffice.gov.uk/resource-library/designing-uri-sets-uk-public-sector
  • 8. Vocabularies • Using existing RDF vocabularies – DC, SKOS, FOAF, BIBO, WGS84 Geo, Lexvo, ORE, LODE, Event and Time Ontologies • Define additional RDF terms where required – hub:ArchivalResource – copac:Creator • It can be hard to know where to find and how to use vocabularies and ontologies
  • 9. Archives Hub Model in Finding maintainedBy/ Repository administeredBy/ Place Postcode Aid maintains (Agent) administers Unit hasPart/ encodedAs/ partOf encodes EAD Document accessProvidedBy/ Level Biographical hasBiogHist/ topic/ providesAccessTo History isBiogHistFor page level Language Archival language at time topic/ page origination hasPart/ Resource product of Creation Temporal partOf associatedWith Entity extent inScheme Extent Agent Concept Concept Scheme representedBy Is-a foaf:focus Object Is-a associatedWith Person Family Organisation Place Book participates in Birth Death Genre Function at time Temporal Entity
  • 11. Transforming into RDF/XML • Transform EAD and MODS to RDF/XML based on our models – Hub: XSLT Stylesheet – Copac: created in-house Java transformation program • Load RDF/XML into a triple store
  • 12. We’re Linking Data! • If something is identified, it can be linked to • We take items from our datasets and link them to items from other datasets BBC Copac VIAF DBPedia GeoNames Archives Hub
  • 13. Enhancing our data • Already have some links: – Time - reference.data.gov.ukURIs – Location - UK Postcodes URIs and Ordnance Survey URIs – Names - Virtual International Authority File • VIAF matches and links widely-used authority files - http://viaf.org/ – Names - DBPedia • Also looking at: – Subjects - Library Congress Subject Headings and DBPedia
  • 16. http://data.copac.ac.uk/ (to be released very soon!)
  • 17. Visualisation Prototype Using Timemap – – Googlemaps and Simile – http://code.google.com/p/time map/ Early stages with this Will give location and ‘extent’ of archive. Will link through to Archives Hub
  • 18. Linking Lives Project http://archiveshub.ac.uk/linkinglives/
  • 20. Key Benefit of Linked Data • API based mashupswork against a fixed set of data sources • Hand crafted by humans • Don’t integrate well • Linked Data promises an unbound global data space • Easy dataset integration • Generic ‘mesh-up’ tools
  • 22. Data Modelling • Steep learning curve – RDF terminology “confusing” – Lack of archival examples • Complexity – Archival description is hierarchical and multi-level • ‘Dirty’ Data
  • 25. Scalability / Provenance • Same issue with attribution • Solutions: Named graphs? Quads? Example by Bradley Allen, Elsevier at • Best Practice LOD LAM Summit, SF, USA, June 2011
  • 26. Licensing • Ownership of data often not clear • Difficult to track attribution and provenance • CC0 for Archives Hub and Copac test datasets
  • 27. Sustainability • Can you rely on data sources long-term? • Ed Summers at the Library of Congress created http://lcsh.info • Linked Data interface for LOC subject headings • People started using it
  • 28. Library of Congress Subject Headings
  • 29. Linked Data the Future for Open Repositories? • Enables ‘straightforward’ integration of wide variety of data sources • Repository data can ‘work harder’ • New channels into your data • Researchers are more likely to discover sources • ‘Hidden' collections become of the Web
  • 30.
  • 31. Attribution and CC License • Sections of this presentation adapted from materials created by other members of the LOCAH & Linking Lives Projects • This presentation available under creative commonsNon Commercial-Share Alike: http://creativecommons.org/licenses/by-nc/2.0/uk/

Editor's Notes

  1. Copac a union catalogueBoth successful JISC services running for many years nowLocah is a research project – will have to see if go into service with LD interface
  2. Encoded Archival Description is an XML standard for encoding archival finding aidsThe Object Description Schema (MODS) is an XML-based bibliographic description schemaMODS - Metadata Object Description Schema (MODS) is a schema for a bibliographic element set that may be used for a variety of purposes, and particularly for library applications.EAD - Things” include concepts and abstractions as well as material objects We want location – archives physical things so location importantAlso wanted event data, partly steered by the visualisation prototypeAlso ‘extent’ data – number of boxes
  3. 303 and Content Neg from ‘Cool URIs for the Semantic Web’
  4. Open Data Commons Public Domain DedicationCreative Commons CC0 license
  5. In hypertext web sites it is considered generally rather bad etiquette not to link to related external material. The value of your own information is very much a function of what it links to, as well as the inherent value of the information within the web page.  So it is also in the Semantic Web.Remember, this is about machines linking – machines need identifiers; humans generally know when something is a place or when it is a person. BBC + DBPedia + GeoNames + Archives Hub + Copac + VIAF = the Web as an exploratory spaceUsers very interested in related materials acc to Terry Catapano at SAA 2011. LD can really help with this.
  6. Can get XSLT stylesheet here too!
  7. Note that it is machine readable interface as well as the human interfaceCurrently have a few hundred in Locah. There are 25,000 EAD records on theHub srevice. We’re Intending to put about 2,000 up for Linking Lives Project.
  8. More aggregation
  9. Data can be integrated from many diff sourcesUsers very interested in related materials acc to Terry Catapano at SAA 2011. LD can really help with this.
  10. Steep learning curve: - RDF Linked Data modelingterminology - Lack of archive domain examples – though you now have LOCAH! - Certain level of expertise neededDirty Data - Joe Bloggs and others’ rather than just a name, or where the access points do not have rules or a source associated with them. - Extent data highly variableComplexity - “lower level” units interpreted in context of the higher levels of description - Arguably “incomplete” without the contextual data.Relations are asserted, e.g. member-of/component-ofBut there is no requirement or expectation that data consumers will follow the links describing the relations
  11. Ex