SlideShare a Scribd company logo
1 of 27
EUDAT receives funding from the European Union's Horizon 2020 programme - DG CONNECT e-Infrastructures. Contract No. 654065 www.eudat.eu
Introduction to metadata
Version 2
August 2016
This work is licensed under the Creative
Commons CC-BY 4.0 licence
What is metadata and why do we need it?
How to produce good quality metadata?
EUDAT and metadata
Overview
WHAT IS
METADATA?
Image CC-BY ‘Metadata is a love note to the future’ by
Cea+ www.flickr.com/photos/ centralasian/8071729256
Commonly defined as ‘data about data’, metadata helps to
make data findable and understandable
Metadata can be:
Descriptive: information about the content and context
of the data
Structural: information about the structure of the data
Administrative: information about the file type, rights
management and preservation processes
What is metadata?
Comprehensive metadata will:
Facilitate data discovery
Help users determine the applicability of the data
Enable interpretation and reuse
Allow any limitations to be understood
Clarify ownership and restrictions on reuse
Offer permanence as it transcends people and time
Provide interoperability
Why use metadata?
Metadata and documentation
Think about what will be needed in order to find, evaluate,
understand, and reuse the data.
Have you documented what you did and how?
Did you develop code to run analyses? If so, this should
be kept and shared too.
Is it clear what each bit of your dataset means? Make
sure the units are labelled and abbreviations explained.
Record all the information needed for you and others to
understand the data in the future
Information entropy
The Loss of Information about Data (Metadata) Over Time, Michener et al, 1997
Create metadata at the time of data creation
Information will be forgotten and there won’t be time or
effort left to capture it later.
Metadata benefits from quality control at an early stage
too.
Time matters!
Image CC-BY-SA ‘egg timer – hour glass running out’ by OpenDemocracy
www.flickr.com/photos/opendemocracy/523438942
GOOD QUALITY METADATA
Image CC-BY ‘Quality’ by Elizabeth Hahn www.flickr.com/photos/128185330@N03/17517769750
Use of standards
Controlled vocabularies for unambiguous keywords
Simple, complete and consistent information
Appropriate description
Explanation of limitations to support reuse
Avoid special characters e.g. !@<~ etc...
Provide persistent identifiers such as DOIs
What makes metadata good?
The good and the bad
Metres / seconds
2015-09-10T15:00:01+01:00
Longitudinal wind speed
PDF 1.7
2008 US Population statistics
Barcelona, Venezuela
Furlongs and fortnight
10th Sept. 2015 15:00:01
U
PDF
Population statistics
Barcelona
More precise and
standardised Ambiguous
Metadata standards
Metadata standards provide a structured way to describe
the data
Information is presented in a reliable and predictable
format which allows for computer interpretation
Use of standards enables data interoperability
Metadata Standards Directory
Catalogue initiated by the Digital Curation Centre (DCC)
now maintained as a community initiative via the
Research Data Alliance
www.dcc.ac.uk/resources/metadata-standards
There are a number of factors to consider:
Data type – look for standards to suit your data
Community norms – what is accepted and common
practice in your field?
Organisational policies – is one recommended?
Instruments being used – any automated metadata?
What resources are available? – there are tools to create
metadata in certain standards, more instructional
materials and support
How to choose a metadata standard?
How to write quality metadata
Organise your information and reuse where possible e.g.
project abstracts, lab notebooks, citations
Write your metadata using a metadata tool
Review for accuracy and completeness
Have someone else read your record
Revise based on comments from your reviewer
Review once more before you publish Draft
ReviewRevise
Review
Tips to follow when creating metadata
Do not use jargon
Define technical terms and acronyms:
– CA, LA, GPS, GIS : what do these mean?
Clearly state data limitations
– E.g. data set omissions, completeness of data
– Express considerations for appropriate re-use
Use “none” or “unknown” meaningfully
– None usually means that you knew about data and nothing
existed (e.g., a “0” cubic feet per second discharge value)
– Unknown means that you don’t know whether that data
existed or not (e.g., a null value)
Dataset titles
Titles are critical in helping readers find your data
– While individuals are searching for the most appropriate
data sets, they are most likely going to use the title as the
first criteria to determine if a dataset meets their needs.
– Treat the title as the opportunity to sell your dataset.
A complete title includes: What, Where, When, Who, and
Scale
An informative title includes: topic, timeliness of the data,
specific information about place and geography
Which is the better title?
Rivers
OR
Greater Yellowstone Rivers from 1:126,700 U.S. Forest
Service Visitor Maps (1961-1983)
Greater Yellowstone (where) Rivers (what) from 1:126,700
(scale) U.S. Forest Service (who) Visitor Maps (1961-
1983) (when)
Write for machines, not just humans
Remember: a computer will read your metadata
Do not use symbols that could be misinterpreted:
Examples: ! @ # % { } | /  < > ~
Don’t use tabs, indents, or line feeds/carriage returns
When copying and pasting from other sources, use a
text editor (e.g., Notepad) to eliminate hidden characters
Could someone use an automatic search to locate the
data?
Can others assess the usefulness of the data?
Could a novice understand it?
Is the metadata specific enough?
Is there enough information to re-use the data?
Is the information unambiguous – are all codes,
abbreviations and variables explained?
Remember to review your metadata!
EUDAT AND METADATA
Image CC-BY ‘University of Michigan Library Card Catalog’ by David Fulmer
www.flickr.com/photos/annarbor/4350629792
B2FIND is based on a comprehensive joint metadata
catalogue of research data collections stored in EUDAT
data centres and other repositories
It allows researchers or data users to find relevant data,
and supports communities and data providers to increase
visibility of their data
B2FIND provides a simple and user-friendly discovery
service on metadata steadily harvested from a wide
range of research communities
The B2FIND service
b2find.eudat.eu
The same term can be used by different disciplines
Species for chemists and zoologists
Andromeda for astronomers and historians
Some domain knowledge is therefore necessary
The EUDAT B2FIND service needs to suit a wide range of
different communities
The interdisciplinary problem
Metadata is harvested from different communities,
usually using the OAI-PMH protocol
The metadata (in a wide variety of standards) are
processed to map and transform them to the B2FIND
schema
How the B2FIND service works
INPUT
Metadata in community
standards e.g. DDI,
Dublin Core, CMDI, ISO
19115
OUTPUT
Homogenised metadata
in the B2FIND schema
Metadata records in B2FIND
http://b2find.eudat.eu/dataset/3a063891-6952-5bcf-a5ed-46f8a681c1c9
For more info: https://eudat.eu/services/b2find
User documentation: https://www.eudat.eu/services/userdoc/b2find-
integration
b2find.eudat.eu
www.eudat.eu
Authors Contributors
This work is licensed under the Creative Commons CC-BY 4.0 licence
EUDAT receives funding from the European Union's Horizon 2020 programme - DG CONNECT e-Infrastructures.
Contract No. 654065
Sarah Jones, Digital Curation Centre
Shaun de Witt, STFC
Sara Garavelli, Trust-IT
Thank you
Content has also been repurposed from the DataONE Educational
modules, ‘Metadata’ and ‘How to Write Good Quality Metadata’ Retrieved
from https://www.dataone.org/education-modules

More Related Content

What's hot

Introduction to Metadata
Introduction to MetadataIntroduction to Metadata
Introduction to MetadataJenn Riley
 
Data Mining: What is Data Mining?
Data Mining: What is Data Mining?Data Mining: What is Data Mining?
Data Mining: What is Data Mining?Seerat Malik
 
Lec 1 indexing and hashing
Lec 1 indexing and hashing Lec 1 indexing and hashing
Lec 1 indexing and hashing Md. Mashiur Rahman
 
METS(Metadata Encoding and Transmission Standard )
METS(Metadata Encoding and Transmission Standard )METS(Metadata Encoding and Transmission Standard )
METS(Metadata Encoding and Transmission Standard )Manu K M
 
Information retrieval system
Information retrieval systemInformation retrieval system
Information retrieval systemLeslie Vargas
 
Database indexing techniques
Database indexing techniquesDatabase indexing techniques
Database indexing techniquesahmadmughal0312
 
Model of information retrieval (3)
Model  of information retrieval (3)Model  of information retrieval (3)
Model of information retrieval (3)9866825059
 
Dimensional Modeling
Dimensional ModelingDimensional Modeling
Dimensional ModelingSunita Sahu
 
Tdm information retrieval
Tdm information retrievalTdm information retrieval
Tdm information retrievalKU Leuven
 
Taxonomies & folksonomies
Taxonomies  & folksonomiesTaxonomies  & folksonomies
Taxonomies & folksonomiesAparna Sane
 
Data management principles
Data management principlesData management principles
Data management principlesFiddy Prasetiya
 
Controlled Vocabulary
Controlled VocabularyControlled Vocabulary
Controlled Vocabularyguest118a9a
 
Subject Indexing & Techniques
Subject Indexing  & TechniquesSubject Indexing  & Techniques
Subject Indexing & TechniquesDr. Utpal Das
 
Introduction to metadata management
Introduction to metadata managementIntroduction to metadata management
Introduction to metadata managementOpen Data Support
 
Spatial Data Models
Spatial Data Models Spatial Data Models
Spatial Data Models RajalakshmiS34
 

What's hot (20)

Introduction to Metadata
Introduction to MetadataIntroduction to Metadata
Introduction to Metadata
 
Knowledge organization system
Knowledge organization systemKnowledge organization system
Knowledge organization system
 
Data Mining: What is Data Mining?
Data Mining: What is Data Mining?Data Mining: What is Data Mining?
Data Mining: What is Data Mining?
 
Lec 1 indexing and hashing
Lec 1 indexing and hashing Lec 1 indexing and hashing
Lec 1 indexing and hashing
 
Temporal databases
Temporal databasesTemporal databases
Temporal databases
 
METS(Metadata Encoding and Transmission Standard )
METS(Metadata Encoding and Transmission Standard )METS(Metadata Encoding and Transmission Standard )
METS(Metadata Encoding and Transmission Standard )
 
Information retrieval system
Information retrieval systemInformation retrieval system
Information retrieval system
 
Database indexing techniques
Database indexing techniquesDatabase indexing techniques
Database indexing techniques
 
Model of information retrieval (3)
Model  of information retrieval (3)Model  of information retrieval (3)
Model of information retrieval (3)
 
Dimensional Modeling
Dimensional ModelingDimensional Modeling
Dimensional Modeling
 
Tdm information retrieval
Tdm information retrievalTdm information retrieval
Tdm information retrieval
 
Taxonomies & folksonomies
Taxonomies  & folksonomiesTaxonomies  & folksonomies
Taxonomies & folksonomies
 
Data management principles
Data management principlesData management principles
Data management principles
 
Ddc
Ddc Ddc
Ddc
 
Controlled Vocabulary
Controlled VocabularyControlled Vocabulary
Controlled Vocabulary
 
Subject Indexing & Techniques
Subject Indexing  & TechniquesSubject Indexing  & Techniques
Subject Indexing & Techniques
 
Information Retrieval
Information RetrievalInformation Retrieval
Information Retrieval
 
Introduction to metadata management
Introduction to metadata managementIntroduction to metadata management
Introduction to metadata management
 
Metadata crosswalks
Metadata crosswalksMetadata crosswalks
Metadata crosswalks
 
Spatial Data Models
Spatial Data Models Spatial Data Models
Spatial Data Models
 

Viewers also liked

Geonode 2.0
Geonode 2.0Geonode 2.0
Geonode 2.0Paolo Corti
 
Status of WorldMap, 2016
Status of WorldMap, 2016Status of WorldMap, 2016
Status of WorldMap, 2016Paolo Corti
 
SEO bij Marketingfacts - 16 september 2014 Marketingfacts Updates
SEO bij Marketingfacts - 16 september 2014 Marketingfacts UpdatesSEO bij Marketingfacts - 16 september 2014 Marketingfacts Updates
SEO bij Marketingfacts - 16 september 2014 Marketingfacts UpdatesDanny Oosterveer
 
Metadata gebruiken, wat komt er bij kijken
Metadata gebruiken, wat komt er bij kijkenMetadata gebruiken, wat komt er bij kijken
Metadata gebruiken, wat komt er bij kijkenovonder
 
Dublin Core Wereldwijd; Interoperabiliteit als een visie
Dublin Core Wereldwijd; Interoperabiliteit als een visieDublin Core Wereldwijd; Interoperabiliteit als een visie
Dublin Core Wereldwijd; Interoperabiliteit als een visieplatform meta-informatie
 
Spatial Data Infrastructure Best Practices with GeoNode
Spatial Data Infrastructure Best Practices with GeoNodeSpatial Data Infrastructure Best Practices with GeoNode
Spatial Data Infrastructure Best Practices with GeoNodeSebastian Benthall
 
metadata & open source #osgeonl dag 2012
metadata & open source #osgeonl dag 2012 metadata & open source #osgeonl dag 2012
metadata & open source #osgeonl dag 2012 pvangenuchten
 
Optimaal inzetten van touchtables in ruimtelijke planvorming, Geodan
Optimaal inzetten van touchtables in ruimtelijke planvorming, GeodanOptimaal inzetten van touchtables in ruimtelijke planvorming, Geodan
Optimaal inzetten van touchtables in ruimtelijke planvorming, GeodanEsriGISConferentie
 
Het gemak van een Geoportaal, Esri Nederland
Het gemak van een Geoportaal, Esri NederlandHet gemak van een Geoportaal, Esri Nederland
Het gemak van een Geoportaal, Esri NederlandEsriGISConferentie
 
Geonode Presentation (ppt)
Geonode Presentation (ppt)Geonode Presentation (ppt)
Geonode Presentation (ppt)Iwl Pcu
 
Metadata & Google: a love story
Metadata & Google: a love storyMetadata & Google: a love story
Metadata & Google: a love storyArne van Elk
 
Micro services and Containers
Micro services and ContainersMicro services and Containers
Micro services and ContainersRichard Harvey
 
Metadata an overview
Metadata an overviewMetadata an overview
Metadata an overviewrobin fay
 

Viewers also liked (15)

Geonode 2.0
Geonode 2.0Geonode 2.0
Geonode 2.0
 
Meta made in Gelderland
Meta made in GelderlandMeta made in Gelderland
Meta made in Gelderland
 
2 Ine De Visser Geonovum
2 Ine De Visser Geonovum2 Ine De Visser Geonovum
2 Ine De Visser Geonovum
 
Status of WorldMap, 2016
Status of WorldMap, 2016Status of WorldMap, 2016
Status of WorldMap, 2016
 
SEO bij Marketingfacts - 16 september 2014 Marketingfacts Updates
SEO bij Marketingfacts - 16 september 2014 Marketingfacts UpdatesSEO bij Marketingfacts - 16 september 2014 Marketingfacts Updates
SEO bij Marketingfacts - 16 september 2014 Marketingfacts Updates
 
Metadata gebruiken, wat komt er bij kijken
Metadata gebruiken, wat komt er bij kijkenMetadata gebruiken, wat komt er bij kijken
Metadata gebruiken, wat komt er bij kijken
 
Dublin Core Wereldwijd; Interoperabiliteit als een visie
Dublin Core Wereldwijd; Interoperabiliteit als een visieDublin Core Wereldwijd; Interoperabiliteit als een visie
Dublin Core Wereldwijd; Interoperabiliteit als een visie
 
Spatial Data Infrastructure Best Practices with GeoNode
Spatial Data Infrastructure Best Practices with GeoNodeSpatial Data Infrastructure Best Practices with GeoNode
Spatial Data Infrastructure Best Practices with GeoNode
 
metadata & open source #osgeonl dag 2012
metadata & open source #osgeonl dag 2012 metadata & open source #osgeonl dag 2012
metadata & open source #osgeonl dag 2012
 
Optimaal inzetten van touchtables in ruimtelijke planvorming, Geodan
Optimaal inzetten van touchtables in ruimtelijke planvorming, GeodanOptimaal inzetten van touchtables in ruimtelijke planvorming, Geodan
Optimaal inzetten van touchtables in ruimtelijke planvorming, Geodan
 
Het gemak van een Geoportaal, Esri Nederland
Het gemak van een Geoportaal, Esri NederlandHet gemak van een Geoportaal, Esri Nederland
Het gemak van een Geoportaal, Esri Nederland
 
Geonode Presentation (ppt)
Geonode Presentation (ppt)Geonode Presentation (ppt)
Geonode Presentation (ppt)
 
Metadata & Google: a love story
Metadata & Google: a love storyMetadata & Google: a love story
Metadata & Google: a love story
 
Micro services and Containers
Micro services and ContainersMicro services and Containers
Micro services and Containers
 
Metadata an overview
Metadata an overviewMetadata an overview
Metadata an overview
 

Similar to Introduction to Metadata

GBIF and reuse of research data, Bergen (2016-12-14)
GBIF and reuse of research data, Bergen (2016-12-14)GBIF and reuse of research data, Bergen (2016-12-14)
GBIF and reuse of research data, Bergen (2016-12-14)Dag Endresen
 
Linking HPC to Data Management - EUDAT Summer School (Giuseppe Fiameni, CINECA)
Linking HPC to Data Management - EUDAT Summer School (Giuseppe Fiameni, CINECA)Linking HPC to Data Management - EUDAT Summer School (Giuseppe Fiameni, CINECA)
Linking HPC to Data Management - EUDAT Summer School (Giuseppe Fiameni, CINECA)EUDAT
 
L07 metadata
L07 metadataL07 metadata
L07 metadatathplayer127
 
EUDAT Research Data Management | www.eudat.eu |
EUDAT Research Data Management | www.eudat.eu | EUDAT Research Data Management | www.eudat.eu |
EUDAT Research Data Management | www.eudat.eu | EUDAT
 
Geospatial metadata and spatial data workshop: 19 June 2014
Geospatial metadata and spatial data workshop: 19 June 2014Geospatial metadata and spatial data workshop: 19 June 2014
Geospatial metadata and spatial data workshop: 19 June 2014EDINA, University of Edinburgh
 
What is a DMP
What is a DMPWhat is a DMP
What is a DMPSarah Jones
 
U - 2 Emerging.pptx
U - 2 Emerging.pptxU - 2 Emerging.pptx
U - 2 Emerging.pptxMulukenTamrat2
 
Dats nih-dccpc-kc7-april2018-prs-uoxf
Dats  nih-dccpc-kc7-april2018-prs-uoxfDats  nih-dccpc-kc7-april2018-prs-uoxf
Dats nih-dccpc-kc7-april2018-prs-uoxfPhilippe Rocca-Serra
 
Gettingstartedwithdigitalcollectionsweb[1]
Gettingstartedwithdigitalcollectionsweb[1]Gettingstartedwithdigitalcollectionsweb[1]
Gettingstartedwithdigitalcollectionsweb[1]guest410707c
 
Reference Model for an Open Archival Information Systems (OAIS): Overview and...
Reference Model for an Open Archival Information Systems (OAIS): Overview and...Reference Model for an Open Archival Information Systems (OAIS): Overview and...
Reference Model for an Open Archival Information Systems (OAIS): Overview and...faflrt
 
Dissemination Documentation
Dissemination DocumentationDissemination Documentation
Dissemination Documentationannegrete
 
Urm concept for sharing information inside of communities
Urm concept for sharing information inside of communitiesUrm concept for sharing information inside of communities
Urm concept for sharing information inside of communitiesKarel Charvat
 
A Generic Scientific Data Model and Ontology for Representation of Chemical Data
A Generic Scientific Data Model and Ontology for Representation of Chemical DataA Generic Scientific Data Model and Ontology for Representation of Chemical Data
A Generic Scientific Data Model and Ontology for Representation of Chemical DataStuart Chalk
 
The Data Lifecycle - EUDAT Summer School (Yann Le Franc)
The Data Lifecycle - EUDAT Summer School (Yann Le Franc)The Data Lifecycle - EUDAT Summer School (Yann Le Franc)
The Data Lifecycle - EUDAT Summer School (Yann Le Franc)EUDAT
 
FAIRy stories: the FAIR Data principles in theory and in practice
FAIRy stories: the FAIR Data principles in theory and in practiceFAIRy stories: the FAIR Data principles in theory and in practice
FAIRy stories: the FAIR Data principles in theory and in practiceCarole Goble
 
The Big Metadata
The Big MetadataThe Big Metadata
The Big MetadataDaniela Tomova
 
Modeling Data Life Cycles with PROV
Modeling Data Life Cycles with PROVModeling Data Life Cycles with PROV
Modeling Data Life Cycles with PROVEUDAT
 
Data Management Planning - 02/21/13
Data Management Planning - 02/21/13Data Management Planning - 02/21/13
Data Management Planning - 02/21/13Lizzy_Rolando
 
DataONE Education Module 07: Metadata
DataONE Education Module 07: MetadataDataONE Education Module 07: Metadata
DataONE Education Module 07: MetadataDataONE
 

Similar to Introduction to Metadata (20)

GBIF and reuse of research data, Bergen (2016-12-14)
GBIF and reuse of research data, Bergen (2016-12-14)GBIF and reuse of research data, Bergen (2016-12-14)
GBIF and reuse of research data, Bergen (2016-12-14)
 
Linking HPC to Data Management - EUDAT Summer School (Giuseppe Fiameni, CINECA)
Linking HPC to Data Management - EUDAT Summer School (Giuseppe Fiameni, CINECA)Linking HPC to Data Management - EUDAT Summer School (Giuseppe Fiameni, CINECA)
Linking HPC to Data Management - EUDAT Summer School (Giuseppe Fiameni, CINECA)
 
L07 metadata
L07 metadataL07 metadata
L07 metadata
 
EUDAT Research Data Management | www.eudat.eu |
EUDAT Research Data Management | www.eudat.eu | EUDAT Research Data Management | www.eudat.eu |
EUDAT Research Data Management | www.eudat.eu |
 
Geospatial metadata and spatial data workshop: 19 June 2014
Geospatial metadata and spatial data workshop: 19 June 2014Geospatial metadata and spatial data workshop: 19 June 2014
Geospatial metadata and spatial data workshop: 19 June 2014
 
What is a DMP
What is a DMPWhat is a DMP
What is a DMP
 
U - 2 Emerging.pptx
U - 2 Emerging.pptxU - 2 Emerging.pptx
U - 2 Emerging.pptx
 
Dats nih-dccpc-kc7-april2018-prs-uoxf
Dats  nih-dccpc-kc7-april2018-prs-uoxfDats  nih-dccpc-kc7-april2018-prs-uoxf
Dats nih-dccpc-kc7-april2018-prs-uoxf
 
Full Erdmann Ruttenberg Community Approaches to Open Data at Scale
Full Erdmann Ruttenberg Community Approaches to Open Data at ScaleFull Erdmann Ruttenberg Community Approaches to Open Data at Scale
Full Erdmann Ruttenberg Community Approaches to Open Data at Scale
 
Gettingstartedwithdigitalcollectionsweb[1]
Gettingstartedwithdigitalcollectionsweb[1]Gettingstartedwithdigitalcollectionsweb[1]
Gettingstartedwithdigitalcollectionsweb[1]
 
Reference Model for an Open Archival Information Systems (OAIS): Overview and...
Reference Model for an Open Archival Information Systems (OAIS): Overview and...Reference Model for an Open Archival Information Systems (OAIS): Overview and...
Reference Model for an Open Archival Information Systems (OAIS): Overview and...
 
Dissemination Documentation
Dissemination DocumentationDissemination Documentation
Dissemination Documentation
 
Urm concept for sharing information inside of communities
Urm concept for sharing information inside of communitiesUrm concept for sharing information inside of communities
Urm concept for sharing information inside of communities
 
A Generic Scientific Data Model and Ontology for Representation of Chemical Data
A Generic Scientific Data Model and Ontology for Representation of Chemical DataA Generic Scientific Data Model and Ontology for Representation of Chemical Data
A Generic Scientific Data Model and Ontology for Representation of Chemical Data
 
The Data Lifecycle - EUDAT Summer School (Yann Le Franc)
The Data Lifecycle - EUDAT Summer School (Yann Le Franc)The Data Lifecycle - EUDAT Summer School (Yann Le Franc)
The Data Lifecycle - EUDAT Summer School (Yann Le Franc)
 
FAIRy stories: the FAIR Data principles in theory and in practice
FAIRy stories: the FAIR Data principles in theory and in practiceFAIRy stories: the FAIR Data principles in theory and in practice
FAIRy stories: the FAIR Data principles in theory and in practice
 
The Big Metadata
The Big MetadataThe Big Metadata
The Big Metadata
 
Modeling Data Life Cycles with PROV
Modeling Data Life Cycles with PROVModeling Data Life Cycles with PROV
Modeling Data Life Cycles with PROV
 
Data Management Planning - 02/21/13
Data Management Planning - 02/21/13Data Management Planning - 02/21/13
Data Management Planning - 02/21/13
 
DataONE Education Module 07: Metadata
DataONE Education Module 07: MetadataDataONE Education Module 07: Metadata
DataONE Education Module 07: Metadata
 

More from EUDAT

EUDAT_Brochure_Generica_Jan_UPDATED(5).pdf
EUDAT_Brochure_Generica_Jan_UPDATED(5).pdfEUDAT_Brochure_Generica_Jan_UPDATED(5).pdf
EUDAT_Brochure_Generica_Jan_UPDATED(5).pdfEUDAT
 
EUDAT Booklet Mar22 (2).pdf
EUDAT Booklet Mar22 (2).pdfEUDAT Booklet Mar22 (2).pdf
EUDAT Booklet Mar22 (2).pdfEUDAT
 
EUDAT_Brochure_Generica_Jan_UPDATED (1).pdf
EUDAT_Brochure_Generica_Jan_UPDATED (1).pdfEUDAT_Brochure_Generica_Jan_UPDATED (1).pdf
EUDAT_Brochure_Generica_Jan_UPDATED (1).pdfEUDAT
 
EUDAT Brochure - B2HANDLE.pdf
EUDAT Brochure - B2HANDLE.pdfEUDAT Brochure - B2HANDLE.pdf
EUDAT Brochure - B2HANDLE.pdfEUDAT
 
EUDAT Brochure - B2DROP.pdf
EUDAT Brochure - B2DROP.pdfEUDAT Brochure - B2DROP.pdf
EUDAT Brochure - B2DROP.pdfEUDAT
 
EUDAT Brochure - B2SHARE.pdf
EUDAT Brochure - B2SHARE.pdfEUDAT Brochure - B2SHARE.pdf
EUDAT Brochure - B2SHARE.pdfEUDAT
 
EUDAT Brochure - B2SAFE.pdf
EUDAT Brochure - B2SAFE.pdfEUDAT Brochure - B2SAFE.pdf
EUDAT Brochure - B2SAFE.pdfEUDAT
 
EUDAT Brochure - B2FIND(1).pdf
EUDAT Brochure - B2FIND(1).pdfEUDAT Brochure - B2FIND(1).pdf
EUDAT Brochure - B2FIND(1).pdfEUDAT
 
EUDAT Brochure - B2ACCESS.pdf
EUDAT Brochure - B2ACCESS.pdfEUDAT Brochure - B2ACCESS.pdf
EUDAT Brochure - B2ACCESS.pdfEUDAT
 
Rob Carrillo - Writing effective service documentation for EUDAT services
Rob Carrillo - Writing effective service documentation for EUDAT servicesRob Carrillo - Writing effective service documentation for EUDAT services
Rob Carrillo - Writing effective service documentation for EUDAT servicesEUDAT
 
Ariyo - EUDAT CDI B2 services documentation
Ariyo - EUDAT CDI B2 services documentationAriyo - EUDAT CDI B2 services documentation
Ariyo - EUDAT CDI B2 services documentationEUDAT
 
Introduction to eudat and its services
Introduction to eudat and its servicesIntroduction to eudat and its services
Introduction to eudat and its servicesEUDAT
 
Using B2NOTE: The U.Porto Pilot
Using B2NOTE: The U.Porto PilotUsing B2NOTE: The U.Porto Pilot
Using B2NOTE: The U.Porto PilotEUDAT
 
OpenAIRE Advance - Kick off last week
OpenAIRE Advance - Kick off last weekOpenAIRE Advance - Kick off last week
OpenAIRE Advance - Kick off last weekEUDAT
 
European Open Science Cloud - Skills workshop
European Open Science Cloud - Skills workshopEuropean Open Science Cloud - Skills workshop
European Open Science Cloud - Skills workshopEUDAT
 
Linking service capabilities to data stweardship competences for professional...
Linking service capabilities to data stweardship competences for professional...Linking service capabilities to data stweardship competences for professional...
Linking service capabilities to data stweardship competences for professional...EUDAT
 
FAIRness of training materials
FAIRness of training materialsFAIRness of training materials
FAIRness of training materialsEUDAT
 
Training by EOSC-hub - Integrating and Managing services for the European Ope...
Training by EOSC-hub - Integrating and Managing services for the European Ope...Training by EOSC-hub - Integrating and Managing services for the European Ope...
Training by EOSC-hub - Integrating and Managing services for the European Ope...EUDAT
 
Draft Governance Framework for the EOSC
Draft Governance Framework for the EOSCDraft Governance Framework for the EOSC
Draft Governance Framework for the EOSCEUDAT
 
Building Interoperable AAI for Researchers
Building Interoperable AAI for ResearchersBuilding Interoperable AAI for Researchers
Building Interoperable AAI for ResearchersEUDAT
 

More from EUDAT (20)

EUDAT_Brochure_Generica_Jan_UPDATED(5).pdf
EUDAT_Brochure_Generica_Jan_UPDATED(5).pdfEUDAT_Brochure_Generica_Jan_UPDATED(5).pdf
EUDAT_Brochure_Generica_Jan_UPDATED(5).pdf
 
EUDAT Booklet Mar22 (2).pdf
EUDAT Booklet Mar22 (2).pdfEUDAT Booklet Mar22 (2).pdf
EUDAT Booklet Mar22 (2).pdf
 
EUDAT_Brochure_Generica_Jan_UPDATED (1).pdf
EUDAT_Brochure_Generica_Jan_UPDATED (1).pdfEUDAT_Brochure_Generica_Jan_UPDATED (1).pdf
EUDAT_Brochure_Generica_Jan_UPDATED (1).pdf
 
EUDAT Brochure - B2HANDLE.pdf
EUDAT Brochure - B2HANDLE.pdfEUDAT Brochure - B2HANDLE.pdf
EUDAT Brochure - B2HANDLE.pdf
 
EUDAT Brochure - B2DROP.pdf
EUDAT Brochure - B2DROP.pdfEUDAT Brochure - B2DROP.pdf
EUDAT Brochure - B2DROP.pdf
 
EUDAT Brochure - B2SHARE.pdf
EUDAT Brochure - B2SHARE.pdfEUDAT Brochure - B2SHARE.pdf
EUDAT Brochure - B2SHARE.pdf
 
EUDAT Brochure - B2SAFE.pdf
EUDAT Brochure - B2SAFE.pdfEUDAT Brochure - B2SAFE.pdf
EUDAT Brochure - B2SAFE.pdf
 
EUDAT Brochure - B2FIND(1).pdf
EUDAT Brochure - B2FIND(1).pdfEUDAT Brochure - B2FIND(1).pdf
EUDAT Brochure - B2FIND(1).pdf
 
EUDAT Brochure - B2ACCESS.pdf
EUDAT Brochure - B2ACCESS.pdfEUDAT Brochure - B2ACCESS.pdf
EUDAT Brochure - B2ACCESS.pdf
 
Rob Carrillo - Writing effective service documentation for EUDAT services
Rob Carrillo - Writing effective service documentation for EUDAT servicesRob Carrillo - Writing effective service documentation for EUDAT services
Rob Carrillo - Writing effective service documentation for EUDAT services
 
Ariyo - EUDAT CDI B2 services documentation
Ariyo - EUDAT CDI B2 services documentationAriyo - EUDAT CDI B2 services documentation
Ariyo - EUDAT CDI B2 services documentation
 
Introduction to eudat and its services
Introduction to eudat and its servicesIntroduction to eudat and its services
Introduction to eudat and its services
 
Using B2NOTE: The U.Porto Pilot
Using B2NOTE: The U.Porto PilotUsing B2NOTE: The U.Porto Pilot
Using B2NOTE: The U.Porto Pilot
 
OpenAIRE Advance - Kick off last week
OpenAIRE Advance - Kick off last weekOpenAIRE Advance - Kick off last week
OpenAIRE Advance - Kick off last week
 
European Open Science Cloud - Skills workshop
European Open Science Cloud - Skills workshopEuropean Open Science Cloud - Skills workshop
European Open Science Cloud - Skills workshop
 
Linking service capabilities to data stweardship competences for professional...
Linking service capabilities to data stweardship competences for professional...Linking service capabilities to data stweardship competences for professional...
Linking service capabilities to data stweardship competences for professional...
 
FAIRness of training materials
FAIRness of training materialsFAIRness of training materials
FAIRness of training materials
 
Training by EOSC-hub - Integrating and Managing services for the European Ope...
Training by EOSC-hub - Integrating and Managing services for the European Ope...Training by EOSC-hub - Integrating and Managing services for the European Ope...
Training by EOSC-hub - Integrating and Managing services for the European Ope...
 
Draft Governance Framework for the EOSC
Draft Governance Framework for the EOSCDraft Governance Framework for the EOSC
Draft Governance Framework for the EOSC
 
Building Interoperable AAI for Researchers
Building Interoperable AAI for ResearchersBuilding Interoperable AAI for Researchers
Building Interoperable AAI for Researchers
 

Recently uploaded

TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...ssifa0344
 
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...anilsa9823
 
Botany krishna series 2nd semester Only Mcq type questions
Botany krishna series 2nd semester Only Mcq type questionsBotany krishna series 2nd semester Only Mcq type questions
Botany krishna series 2nd semester Only Mcq type questionsSumit Kumar yadav
 
❀Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💩✅.
❀Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💩✅.❀Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💩✅.
❀Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💩✅.Nitya salvi
 
Spermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatidSpermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatidSarthak Sekhar Mondal
 
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPirithiRaju
 
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls Agency
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls AgencyHire 💕 9907093804 Hooghly Call Girls Service Call Girls Agency
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls AgencySheetal Arora
 
DIFFERENCE IN BACK CROSS AND TEST CROSS
DIFFERENCE IN  BACK CROSS AND TEST CROSSDIFFERENCE IN  BACK CROSS AND TEST CROSS
DIFFERENCE IN BACK CROSS AND TEST CROSSLeenakshiTyagi
 
Green chemistry and Sustainable development.pptx
Green chemistry  and Sustainable development.pptxGreen chemistry  and Sustainable development.pptx
Green chemistry and Sustainable development.pptxRajatChauhan518211
 
Pests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdfPests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdfPirithiRaju
 
Chromatin Structure | EUCHROMATIN | HETEROCHROMATIN
Chromatin Structure | EUCHROMATIN | HETEROCHROMATINChromatin Structure | EUCHROMATIN | HETEROCHROMATIN
Chromatin Structure | EUCHROMATIN | HETEROCHROMATINsankalpkumarsahoo174
 
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 bAsymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 bSĂ©rgio Sacani
 
Nanoparticles synthesis and characterization​ ​
Nanoparticles synthesis and characterization​  ​Nanoparticles synthesis and characterization​  ​
Nanoparticles synthesis and characterization​ ​kaibalyasahoo82800
 
Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)PraveenaKalaiselvan1
 
Animal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptxAnimal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptxUmerFayaz5
 
Isotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on IoIsotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on IoSĂ©rgio Sacani
 
Botany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdfBotany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdfSumit Kumar yadav
 
Disentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOSTDisentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOSTSĂ©rgio Sacani
 
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...SĂ©rgio Sacani
 

Recently uploaded (20)

TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
 
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
 
Botany krishna series 2nd semester Only Mcq type questions
Botany krishna series 2nd semester Only Mcq type questionsBotany krishna series 2nd semester Only Mcq type questions
Botany krishna series 2nd semester Only Mcq type questions
 
The Philosophy of Science
The Philosophy of ScienceThe Philosophy of Science
The Philosophy of Science
 
❀Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💩✅.
❀Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💩✅.❀Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💩✅.
❀Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💩✅.
 
Spermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatidSpermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatid
 
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
 
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls Agency
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls AgencyHire 💕 9907093804 Hooghly Call Girls Service Call Girls Agency
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls Agency
 
DIFFERENCE IN BACK CROSS AND TEST CROSS
DIFFERENCE IN  BACK CROSS AND TEST CROSSDIFFERENCE IN  BACK CROSS AND TEST CROSS
DIFFERENCE IN BACK CROSS AND TEST CROSS
 
Green chemistry and Sustainable development.pptx
Green chemistry  and Sustainable development.pptxGreen chemistry  and Sustainable development.pptx
Green chemistry and Sustainable development.pptx
 
Pests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdfPests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdf
 
Chromatin Structure | EUCHROMATIN | HETEROCHROMATIN
Chromatin Structure | EUCHROMATIN | HETEROCHROMATINChromatin Structure | EUCHROMATIN | HETEROCHROMATIN
Chromatin Structure | EUCHROMATIN | HETEROCHROMATIN
 
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 bAsymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
 
Nanoparticles synthesis and characterization​ ​
Nanoparticles synthesis and characterization​  ​Nanoparticles synthesis and characterization​  ​
Nanoparticles synthesis and characterization​ ​
 
Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)
 
Animal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptxAnimal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptx
 
Isotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on IoIsotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on Io
 
Botany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdfBotany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdf
 
Disentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOSTDisentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOST
 
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
 

Introduction to Metadata

  • 1. EUDAT receives funding from the European Union's Horizon 2020 programme - DG CONNECT e-Infrastructures. Contract No. 654065 www.eudat.eu Introduction to metadata Version 2 August 2016 This work is licensed under the Creative Commons CC-BY 4.0 licence
  • 2. What is metadata and why do we need it? How to produce good quality metadata? EUDAT and metadata Overview
  • 3. WHAT IS METADATA? Image CC-BY ‘Metadata is a love note to the future’ by Cea+ www.flickr.com/photos/ centralasian/8071729256
  • 4. Commonly defined as ‘data about data’, metadata helps to make data findable and understandable Metadata can be: Descriptive: information about the content and context of the data Structural: information about the structure of the data Administrative: information about the file type, rights management and preservation processes What is metadata?
  • 5. Comprehensive metadata will: Facilitate data discovery Help users determine the applicability of the data Enable interpretation and reuse Allow any limitations to be understood Clarify ownership and restrictions on reuse Offer permanence as it transcends people and time Provide interoperability Why use metadata?
  • 6. Metadata and documentation Think about what will be needed in order to find, evaluate, understand, and reuse the data. Have you documented what you did and how? Did you develop code to run analyses? If so, this should be kept and shared too. Is it clear what each bit of your dataset means? Make sure the units are labelled and abbreviations explained. Record all the information needed for you and others to understand the data in the future
  • 7. Information entropy The Loss of Information about Data (Metadata) Over Time, Michener et al, 1997
  • 8. Create metadata at the time of data creation Information will be forgotten and there won’t be time or effort left to capture it later. Metadata benefits from quality control at an early stage too. Time matters! Image CC-BY-SA ‘egg timer – hour glass running out’ by OpenDemocracy www.flickr.com/photos/opendemocracy/523438942
  • 9. GOOD QUALITY METADATA Image CC-BY ‘Quality’ by Elizabeth Hahn www.flickr.com/photos/128185330@N03/17517769750
  • 10. Use of standards Controlled vocabularies for unambiguous keywords Simple, complete and consistent information Appropriate description Explanation of limitations to support reuse Avoid special characters e.g. !@<~ etc... Provide persistent identifiers such as DOIs What makes metadata good?
  • 11. The good and the bad Metres / seconds 2015-09-10T15:00:01+01:00 Longitudinal wind speed PDF 1.7 2008 US Population statistics Barcelona, Venezuela Furlongs and fortnight 10th Sept. 2015 15:00:01 U PDF Population statistics Barcelona More precise and standardised Ambiguous
  • 12. Metadata standards Metadata standards provide a structured way to describe the data Information is presented in a reliable and predictable format which allows for computer interpretation Use of standards enables data interoperability
  • 13. Metadata Standards Directory Catalogue initiated by the Digital Curation Centre (DCC) now maintained as a community initiative via the Research Data Alliance www.dcc.ac.uk/resources/metadata-standards
  • 14. There are a number of factors to consider: Data type – look for standards to suit your data Community norms – what is accepted and common practice in your field? Organisational policies – is one recommended? Instruments being used – any automated metadata? What resources are available? – there are tools to create metadata in certain standards, more instructional materials and support How to choose a metadata standard?
  • 15. How to write quality metadata Organise your information and reuse where possible e.g. project abstracts, lab notebooks, citations Write your metadata using a metadata tool Review for accuracy and completeness Have someone else read your record Revise based on comments from your reviewer Review once more before you publish Draft ReviewRevise Review
  • 16. Tips to follow when creating metadata Do not use jargon Define technical terms and acronyms: – CA, LA, GPS, GIS : what do these mean? Clearly state data limitations – E.g. data set omissions, completeness of data – Express considerations for appropriate re-use Use “none” or “unknown” meaningfully – None usually means that you knew about data and nothing existed (e.g., a “0” cubic feet per second discharge value) – Unknown means that you don’t know whether that data existed or not (e.g., a null value)
  • 17. Dataset titles Titles are critical in helping readers find your data – While individuals are searching for the most appropriate data sets, they are most likely going to use the title as the first criteria to determine if a dataset meets their needs. – Treat the title as the opportunity to sell your dataset. A complete title includes: What, Where, When, Who, and Scale An informative title includes: topic, timeliness of the data, specific information about place and geography
  • 18. Which is the better title? Rivers OR Greater Yellowstone Rivers from 1:126,700 U.S. Forest Service Visitor Maps (1961-1983) Greater Yellowstone (where) Rivers (what) from 1:126,700 (scale) U.S. Forest Service (who) Visitor Maps (1961- 1983) (when)
  • 19. Write for machines, not just humans Remember: a computer will read your metadata Do not use symbols that could be misinterpreted: Examples: ! @ # % { } | / < > ~ Don’t use tabs, indents, or line feeds/carriage returns When copying and pasting from other sources, use a text editor (e.g., Notepad) to eliminate hidden characters
  • 20. Could someone use an automatic search to locate the data? Can others assess the usefulness of the data? Could a novice understand it? Is the metadata specific enough? Is there enough information to re-use the data? Is the information unambiguous – are all codes, abbreviations and variables explained? Remember to review your metadata!
  • 21. EUDAT AND METADATA Image CC-BY ‘University of Michigan Library Card Catalog’ by David Fulmer www.flickr.com/photos/annarbor/4350629792
  • 22. B2FIND is based on a comprehensive joint metadata catalogue of research data collections stored in EUDAT data centres and other repositories It allows researchers or data users to find relevant data, and supports communities and data providers to increase visibility of their data B2FIND provides a simple and user-friendly discovery service on metadata steadily harvested from a wide range of research communities The B2FIND service b2find.eudat.eu
  • 23. The same term can be used by different disciplines Species for chemists and zoologists Andromeda for astronomers and historians Some domain knowledge is therefore necessary The EUDAT B2FIND service needs to suit a wide range of different communities The interdisciplinary problem
  • 24. Metadata is harvested from different communities, usually using the OAI-PMH protocol The metadata (in a wide variety of standards) are processed to map and transform them to the B2FIND schema How the B2FIND service works INPUT Metadata in community standards e.g. DDI, Dublin Core, CMDI, ISO 19115 OUTPUT Homogenised metadata in the B2FIND schema
  • 25. Metadata records in B2FIND http://b2find.eudat.eu/dataset/3a063891-6952-5bcf-a5ed-46f8a681c1c9
  • 26. For more info: https://eudat.eu/services/b2find User documentation: https://www.eudat.eu/services/userdoc/b2find- integration b2find.eudat.eu
  • 27. www.eudat.eu Authors Contributors This work is licensed under the Creative Commons CC-BY 4.0 licence EUDAT receives funding from the European Union's Horizon 2020 programme - DG CONNECT e-Infrastructures. Contract No. 654065 Sarah Jones, Digital Curation Centre Shaun de Witt, STFC Sara Garavelli, Trust-IT Thank you Content has also been repurposed from the DataONE Educational modules, ‘Metadata’ and ‘How to Write Good Quality Metadata’ Retrieved from https://www.dataone.org/education-modules

Editor's Notes

  1. This presentation will give an introduction to the concept of metadata, why it is important and how to address this in research projects.
  2. There are three main topics that we will discuss: What is metadata and why is it important. Here we will think about the benefits that metadata can bring to you and others. Secondly, we will think about how to produce good quality metadata and offer some advice and tips To close, we’ll explain how EUDAT uses metadata in the B2FIND service
  3. So let’s begin by thinking about what metadata is. The quote in the image here that ‘metadata is a love note to the future’ really gets at the meaning. Metadata is critical to ensure data can be found, understood and reused. If you don’t create metadata, it’s unlikely you will still understand your data in a few years time. The act of creating metadata opens up possibilities for future use.
  4. Metadata is commonly defined as ‘data about data’. By creating metadata, you will ensure that others can find and understand your data. There are different types of metadata: Descriptive metadata includes things like the title, author, date, location, coverage and subjects. It’s all the basic information people would need to find the data and understand the main content and context. Structural metadata explains how data interrelate. For example, if a book has been digitised, you want to understand which set of images form each chapter. Administrative metadata may include information added by others e.g. preservation metadata added by a repository to note what processes have been performed on the data
  5. There are lots of reasons to create metadata. It: Facilitates data discovery so others can find your data and your research gains more recognition and impact Good metadata helps potential users determine whether the data meet their needs, and enables them to interpret and reuse the data Metadata should outline any limitations and clarify data ownership and restrictions on reuse. This ensures others use the data appropriately Without metadata, your data will become meaningless over time as others can’t understand and reuse them. Providing associated metadata will give your data permanence and ensure they live on. By creating high quality metadata and using standards, you can also make your data interoperable
  6. When you create metadata, it is useful to think broadly. You may come across the concept of ‘documentation’ to explain all the details you need to capture and share too. You should aim to provide all the information a third-party would need to understand and reuse your data. This may include a description of what you did, your workflows, any code created and data dictionaries or clarification of all terms and abbreviations.
  7. This diagram from Bill Michener from DataOne in the United States shows how much information is lost over time. Metadata is a way to formalise this knowledge so your data retain meaning.
  8. Time really matters when creating metadata – you should create metadata at the time of data creation as information is forgotten quickly. This also gives you an opportunity to do quality control early on.
  9. We have explained why metadata is so important. Let’s now think about how to create good quality metadata.
  10. There are lots of things you can do to improve the quality of your metadata: Primary among these is to use standards. There are lots out there so look for something relevant to your data type and discipline. Metadata standards don’t always prescribe how the information should be completed. For this you want to use controlled vocabularies or thesauri for keywords, and recognised ISO standards for common elements like languages or dates Be consistent in the information provided and ensure the description is appropriate – enough information to avoid being ambiguous but also simple and concise Any limitations with the dataset should be explained to ensure others reuse it appropriately and don’t make false assumptions You should avoid using special characters, particularly in file names or column headers in spreadsheets as some software may interpret these symbols as an operator Also provide persistent identifiers so others can reliably link to and locate your data. This helps with citation and tracking impact too.
  11. Let’s look at some good and bad examples. You can see what we’re looking for in terms of metadata is more precise and standardised entries rather than information that could be ambiguous. Metres and seconds are universally accepted units of measurement as opposed to furlongs or fortnights For clarity, provide dates and times in the ISO standard, specifying the timezone In the third example we can see a properly described variable as opposed to an abbreviation which others may not understand When stating file formats, it is always useful to specify the version too The final two examples show the need to be specific so others can understand the coverage properly
  12. It is highly recommended to use metadata standards. It enables interoperability and ensures the information is presented in a predictable way to allow it to be processed by computers.
  13. There are lots of standards that can be used and you can search for them by discipline. The DCC started a catalogue of disciplinary metadata standards which is now being taken forward as an international initiative via an RDA working group
  14. When choosing a metadata standard, you should consider: Your data type What is accepted practice for your field Whether your organisation or the tools and instruments being used suggest using one format over another (for example if one is recommended or if some metadata is created automatically in a given standard) Also think about what resources you have available. Some standards have associated tools and more comprehensive instructional materials, so they may be preferred.
  15. When writing metadata, reuse information where possible rather than starting from scratch. For example, you may be able to use a project abstract written for your proposal, information from lab notebooks or citations for data you are reusing. Where possible, write your metadata using a tool to make the processes easier and more consistent. Think about metadata creation as an iterative process. It’s best to ask somebody else to read your record to make sure it makes sense to others and then revise and review it again before publishing.
  16. Some general tips to follow include: Avoid using jargon Define any abbreviations, acronyms or technical terms Clearly state limitations and express considerations for appropriate reuse Use terms like ‘none’ or ‘unknown’ properly
  17. Be comprehensive when writing your dataset title as this is how others will determine whether to look into your data further. A complete title should explain what the data relates to, a location, time period, subject and scale.
  18. This example illustrates the importance of descriptive titles in metadata records. The second title gives enough detail for a reader to discern whether they might like more information about your data.
  19. When you are writing your metadata, remember that it will be read by machines as well as people. You should avoid using symbols that could be misinterpreted, and tabs/indents/breaks that may be stripped out. Using a text editor for copying data will ensure hidden characters and formatting are removed.
  20. The final point to reiterate is the need to review your metadata. It’s always useful to get a second opinion to make sure others can understand it and feel it’s clear and specific enough.
  21. To close we want to explain how EUDAT is approaching metadata and how services like B2FIND can help you
  22. B2FIND provides a simple and user friendly service that allows the users to discover a wide range of metadata from a variety of research communities. It is based upon a comprehensive metadata catalogue of data collections stored in the EUDAT data centres and harvested from other data repositories. The B2FIND service helps researchers to find relevant data to reuse, and helps data providers to increase the visibility of their data.
  23. Since EUDAT is a pan-European infrastructure supporting a wide range of disciplines, we have to think about how terms are used differently by different communities. Chemical species for examples are atoms, molecules, ions etc, whereas for zoologists, ‘species’ denotes different families of animals
  24. The B2FIND service works by harvesting metadata from different communities. This is done on a regular and incremental basis, usually using the OAI-PMH protocol. The metadata is provided in a range of community standards. It is then processed to transform it to the generic B2FIND schema to allow cross-search.
  25. This is what a record looks like in the B2FIND catalogue. There’s a basic description, a number of keyword tags and some additional information to note the source, creator, language etc
  26. To find out more about B2FIND or use the service, please follow the links provided.