Successfully reported this slideshow.
We use your LinkedIn profile and activity data to personalize ads and to show you more relevant ads. You can change your ad preferences anytime.

Persistent Identifiers in EUDAT services| www.eudat.eu |

949 views

Published on

| www.eudat.eu | The EUDAT data domain handles registered data. Each digital object should have a persistent identifier. This persistent identifier is used for: Replica identification; Identification of the repository of record (in the case of replication); Querying of additional information; Checksum (time stamped)...

Published in: Technology
  • Login to see the comments

  • Be the first to like this

Persistent Identifiers in EUDAT services| www.eudat.eu |

  1. 1. www.eudat.euEUDAT receives funding from the European Union's Horizon 2020 programme - DG CONNECT e-Infrastructures. Contract No. 654065 Persistent Identifiers in EUDAT services PIDs in EUDAT Version 2 June 2017 This work is licensed under the Creative Commons CC-BY 4.0 licence
  2. 2. The EUDAT data domain handles registered data Each digital object should have a persistent identifier This persistent identifier is used for: Replica identification Identification of the repository of record (in the case of replication) Querying of additional information Checksum (time stamped) Actionable PIDs: Of the form http://<resolver>/<prefix>/<suffix> PIDs in EUDAT
  3. 3. The EUDAT Service Suite + PIDs http://www.eudat.eu/services Supports living objects  no PIDs PIDs (collections, files)  referable PIDs (collection, files)  long-term preservation User Access no PIDs PIDs  fetch data PIDs  refer to data PID management
  4. 4. B2SHARE A user-friendly, reliable and trustworthy tool for researchers, scientific communities and citizen scientists to store and share small-scale research data coming from diverse contexts. PIDs to every data collection, to make them referable, and to every file to ease automatic downloads
  5. 5. B2SHARE: The process - Assigns PIDs to every data collection to allow citation - Assigns PIDs to every file and also stores the checksum to allow automatic download and integrity checks
  6. 6. B2SHARE The persistent identifier for the collection
  7. 7. B2SHARE The persistent identifier for files
  8. 8. B2SAFE A robust, safe and highly available service which allows community and departmental repositories to implement data management policies on research data across multiple administrative domains, in a trustworthy manner PIDs at file level, for long-term preservation and linking replicas and their originals
  9. 9. B2SAFE: What happens step by step? iRods PID Data Center Store 1 Community repository Digital Object (DO) unique identifier (PID) to the DO PID Data ingestion Data replication own PID system OR iRODS rules iRods CommunityCentre iRods PID Data Center Store 2 Based on community policy PID assignment
  10. 10. B2SAFE: Original DO and replicas
  11. 11. B2STAGE A reliable, efficient, light-weight and easy-to-use service to transfer research data sets between EUDAT storage resources and high- performance computing (HPC) workspaces. PIDs, to fetch data Transfer large data collections In conjunction with B2SAFE, replicate community data sets, ingesting them onto EUDAT storage resources for long-term preservation Ingest computation results into the EUDAT infrastructure
  12. 12. B2STAGE
  13. 13. B2STAGE
  14. 14. B2FIND a discovery service offering a simple, user-friendly metadata catalogue of research data collections stored in EUDAT data centres and other repositories. PIDs, as source identifier Find collections of scientific data quickly and easily, irrespective of their origin, discipline or community Get quick overviews of available data Browse through collections using standardised facets
  15. 15. B2FIND Metadata are harvested from various research community repositories spanning a wide scope of research disciplines. The benefit for the communities publishing metadata in EUDAT is improved visibility and discoverability of their research data in an interdisciplinary, pan-European scope.
  16. 16. B2FIND – B2SHARE Community PID Training The Source is an identifier, therefore a unique string that identifies the resource. It may link to the data resource itself or to a landing page that points to the data. You may also find PID as an alternate identifier. B2FIND uses B2SHARE PIDs
  17. 17. B2FIND – SDL Community PID Training The SDL community supports DOI as alternate identifier B2FIND uses PID and DOI from the SDL Community.
  18. 18. B2HANDLE EUDAT has adopted Handle-based persistent identifiers based on a solution combining the Handle technology and the EPIC federation. B2HANDLE is a central service for managing persistent identifiers at EUDAT. PID management Why Handles? Stable globally unique IDs, stable cross-Links Simple Integration
  19. 19. PIDs created with B2HANDLE provide the abstraction layer between a globally unique persistent identifier and physical location of data objects Follows policies to register data and make it long term referable and citable Assignment of prefix via one of the EUDAT partners Hosting of PIDs, i.e. operation and maintenance of Handle servers and technical services Benefits of the B2HANDLE service
  20. 20. Replication for reliability and safe-keeping of PIDs via the EPIC federation Resolution mechanism based on Handle Easy maintenance and programmatic resolving of PIDs by the B2HANDLE Python library for general interaction with Handle servers Benefits of the B2HANDLE service
  21. 21. B2HANDLE – The Python library b2handle: A Python library for interaction with the EUDAT B2HANDLE service setuptools-enabled Python package; easy to deploy Requires access to one of the EUDAT Handle server sites Technical documentation: http://eudat- b2safe.github.io/B2HANDLE
  22. 22. B2HANDLE – B2SAFE example Where: Offers integration into iRODS via a script. This comes out of the box with a dedicated script employing the B2HANDLE python library How: The script takes credentials as input Supplied on the command line (or) Stored in a configuration file (iRODS or local fs) What: The script supports the following actions Searching Resolving Creation of PIDs with metadata specific to B2SAFE Modification
  23. 23. Conclusions PIDs run through the EUDAT services B2HANDLE aids the creation and management of PIDs, through web and programmatic interfaces The B2SHARE, B2SAFE and B2STAGE services create PIDs for digital objects created within EUDAT. B2FIND lists PIDs together with the rest of the metadata it collects EUDAT data can me accessed through the use of PIDs. PID Training
  24. 24. Thanks
  25. 25. www.eudat.eu Authors Contributors This work is licensed under the Creative Commons CC-BY 4.0 licence EUDAT receives funding from the European Union's Horizon 2020 programme - DG CONNECT e-Infrastructures. Contract No. 654065 Themis Zamani, GRNET Willem Elbers, CLARIN Christine Staiger, SURFsara Ellen Leenarts, DANS Kostas Kavoussanakis, EPCC Thank you

×