SlideShare a Scribd company logo
1 of 46
www.geant.org
1 |
Click to edit Master title style
• Click to edit Master text styles
• Second level
• Third level
• Fourth level
• Fifth level
01/07/2021 1
Data Management Planning
for researchers
www.geant.org
Sarah Jones
EOSC Engagement Manager
sarah.jones@geant.org
Twitter: @sarahroams
Indonesian RDM webinar series
Friday 2nd July 2021
What is a DMP?
Image CC-BY-NC-SA by Leo Reynolds www.flickr.com/photos/lwr/13442910354
All manner of things that you produce in
the course of your research
What is research data?
“the active management and
appraisal of data over the lifecycle
of scholarly and scientific interest”
Data management is part of
good research practice
What is research data management?
Create
Document
Use
Store
Share
Preserve
A short plan that outlines:
• what data will be created and how
• how it will be managed (storage, back-up, access…)
• plans for data sharing and preservation
DMPs are often submitted as part of grant applications,
but are useful whenever researchers are creating data
What is a DMP?
1. Description of data to be collected / created
(i.e. content, type, format, volume...)
2. Standards / methodologies for data collection & management
3. Ethics and Intellectual Property
(highlight any restrictions on data sharing e.g. embargoes, confidentiality)
4. Plans for data sharing and access
(i.e. how, when, to whom)
5. Strategy for long-term preservation
Five common themes / questions in DMPs
Why create a DMP?
Image CC-BY by Ian Dooley https://unsplash.com/photos/DuBNA1QMpPA
www.geant.org
Many global funders ask for DMPs
Not comprehensive!
www.geant.org
Indonesian requirements
• No requirement for a DMP
• Regulation in law to encourage researchers to
deposit data
• Publishing and sharing is optional
9 |
www.geant.org
What do research funders want?
• A brief plan usually submitted in grant applications
• Some funders may want multiple stages of plans e.g. pre-
award, in-project, final report…
• 1-4 sides of A4 as attachment or a section in application
• Typically a prose statement covering suggested themes
• An outline of data management and sharing plans, justifying
decisions and any limitations
www.geant.org
Trend for DMPs to cover more than data
• Wellcome Trust issued new guidelines in 2017 that ask for an
Outputs Management Plan covering:
– datasets generated by your research
– original software created in the course of your research
– new materials you create – like antibodies, cell lines and reagents
– IP such as patents, copyright, design rights and confidential know-how
• The EPSRC has a requirement for Software Management Plans
www.geant.org
Why write a DMP / manage your data?
NON PECUNIAE INVESTIGATIONIS CURATORE
SED VITAE FACIMUS PROGRAMMAS DATORUM PROCURATIONIS
(Not for the research funder, but for life we make data management plans)
• Make your research easier
• Stop yourself drowning in irrelevant stuff
• Save data for later
• Avoid accusations of fraud or bad science
• Write a data paper
• Share your data for re-use
• Get credit for it
www.geant.org
Don’t undervalue research data
How can we make a good DMP?
14 |
Image CC-BY by Kelly Sikkema https://unsplash.com/photos/v9FQR4tbIq8
www.geant.org
Planning trick 1: think backwards
What data organisation would a re-user like?
CREATING
DATA
PROCESSING
DATA
PRESERVING
DATA
GIVING
ACCESS TO
DATA
RE-USING
DATA
www.geant.org
Data organisation
https://datasupport.researchdata.nl/en/start-the-course/iii-research-phase/organising-data
www.geant.org
Planning trick 2: include RDM stakeholders
Institution
RDM policy
Facilities
€$£
Research funders
Publishers
Data Availability
policy
Commercial partners
www.openaire.eu/briefpaper-rdm-infonoads
www.geant.org
Use the DMP as a talking point
Consulting, supporting and networking with
researchers & all other interest groups
Slide content courtesy of Mari Elisa
Kuusniemi (MEK), University of
Helsinki Library
www.geant.org
Planning trick 3: ground your plan in reality
Base plans on available skills, support and good practice
for the field – show it’s feasible to implement
www.geant.org
Planning trick 4: plan to share from the outset
Decisions made early on affect what you can do later
• Negotiation on licenses and consent agreement may preclude
later sharing if not careful
• Costings can’t be included retrospectively
• Useful to consider data issues at the consortium negotiation
stage to make sure potential issues are identified and sorted asap
Key tools and support
21 |
Image CC-BY by Barn Images https://unsplash.com/photos/t5YUoHW6zRo
www.geant.org
DCC support on DMPs
• Webinars and training materials
• How-to guides and other advisory documents
• Checklist on what to cover in DMPs
• Example DMPs
• DMPonline
https://www.dcc.ac.uk/dmps
www.geant.org
What is DMPonline?
A web-based tool to help researchers write DMPs
https://dmponline.dcc.ac.uk
www.geant.org
How does DMPonline work?
Select options to get tailored guidance and support
Guidance and examples from
funders, unis, research
disciplines and others
DMP
Requirements from
funders, institutions
and others
Create Share Review Export Update …..
www.geant.org
Many DMP tools available…
Platform Organisation(s) Resource link(s)
DMPRoadmap CDL| DCC | Portage Network | INIST CNRS https://github.com/DMPRoadmap/roadmap
University of Queensland
Research Data Manager
University of Queensland https://research.uq.edu.au/project/research-data-
manager-uqrdm
ReDBox DLC QCIF https://www.redboxresearchdata.com.au/rbdlc.html
RDMOrganiser (RDMO) AIP | FHP | KIT http://rdmorganiser.github.io/en
Data Stewardship Wizard ELIXIR | DTL https://github.com/DataStewardshipPortal
ezDMP IEDA https://www.iedadata.org
Data planning tool UNINETT Sigma2 https://www.sigma2.no/content/data-planning-tool
And more….
Please update at: https://activedmps.org
www.geant.org
Managing and sharing data:
a best practice guide
• How to write a DMP
• Formatting your data
• Documentation
• Ethics and consent
• Copyright
• Data sharing
• …
http://data-archive.ac.uk/media/2894/managingsharing.pdf
Questions and worked examples
Image Israel Palacio https://unsplash.com/photos/P6FgiDNe6W4
www.geant.org
1. Describing data to be collected
• What type of data will you produce?
• What file format(s) will your data be in?
• How much data will be produced?
• How will you create your data?
www.geant.org
Data description examples
The final dataset will include self-reported demographic and behavioural data from
interviews with the subjects and laboratory data from urine specimens provided.
From NIH data sharing statements
Every two days, we will subsample E. affinis populations growing under our
treatment conditions. We will use a microscope to identify the life stage and sex of
the subsampled individuals. We will document the information first in a laboratory
notebook and then copy the data into an Excel spreadsheet. The Excel spreadsheet
will be saved as a comma separated value (.csv) file.
From DataOne – E. affinis DMP example
www.geant.org
Some formats are better for long-term
It’s preferable to opt for formats that are:
• Uncompressed
• Non-proprietary
• Open, documented
• Standard representation (ASCII, Unicode)
Data centres may have preferred formats for deposit e.g.
Type Recommended Non-preferred
Tabular data CSV, TSV, SPSS portable Excel
Text Plain text, HTML, RTF
PDF/A only if layout matters
Word
Media Container: MP4, Ogg
Codec: Theora, Dirac, FLAC
Quicktime
H264
Images TIFF, JPEG2000, PNG GIF, JPG
Structured data XML, RDF RDBMS
Further examples: https://www.ukdataservice.ac.uk/manage-data/format/recommended-formats.aspx
www.geant.org
2. Standards and methodologies
• What metadata and documentation will you record?
• What standards are used in your field?
• How will your data be organised?
• Where will it be stored and backed-up?
www.geant.org
Metadata examples
Metadata will be tagged in XML using the Data Documentation Initiative (DDI) format.
The codebook will contain information on study design, sampling methodology,
fieldwork, variable-level detail, and all information necessary for a secondary analyst
to use the data accurately and effectively.
From ICPSR Framework for Creating a DMP
We will first document our metadata by taking careful notes in the laboratory notebook that
refer to specific data files and describe all columns, units, abbreviations, and missing value
identifiers. These notes will be transcribed into a .txt document that will be stored with the
data file. After all of the data are collected, we will then use EML (Ecological Metadata
Language) to digitize our metadata. EML is one of the accepted formats used in ecology, and
works well for the types of data we will be producing. We will create these metadata using
Morpho software, available through KNB. The metadata will fully describe the data files and the
context of the measurements.
From DataOne – E. affinis DMP example
www.geant.org
Where to find relevant standards?
Metadata Standards Directory
Broad, disciplinary listing of standards
and tools. Maintained by RDA group
https://rd-alliance.github.io/metadata-
directory
FAIRsharing
A portal of data standards,
databases, and policies
Focused on life, environmental and
biomedical sciences, but expanding
to other disciplines
https://fairsharing.org
www.geant.org
3. Ethical and IPR implications
• Are you seeking consent from participants?
• Are you re-using other people’s data?
• Who owns your data or has rights in it?
• Are restrictions on sharing needed?
www.geant.org
Examples restrictions
Because the STDs being studied are reportable diseases, we will be collecting identifying
information. Even though the final dataset will be stripped of identifiers prior to release
for sharing, we believe that there remains the possibility of deductive disclosure of
subjects with unusual characteristics. Thus, we will make the data and associated
documentation available to users only under a data-sharing agreement.
From NIH data sharing statements
1. Share data privately within 1 year.
Data will be held in Private Repository, but metadata will be public
2. Release data to public within 2 years.
Encouraged after one year to release data for public access.
3. Request, in writing, data privacy up to 4 years.
Extensions beyond 3 years will only be granted for compelling cases.
4. Consult with creators of private CZO datasets prior to use.
Pis required to seek consent before using private data they can access
From Boulder Creek Critical Zone Observatory DMP
www.geant.org
Seek consent for data sharing & preservation
•If you don’t ask, data centres won’t be able to accept
your data – regardless of any conditions on the original
grant or your desire for the data to be shared.
www.geant.org
4. Data sharing and reuse
• Are you allowed to share your data?
• Who will you share with and how?
• When and where will you make the data available?
• Do you need to impose conditions on reuse?
• How will you license the data for clarity?
www.geant.org
Data sharing examples
We will make the data and associated documentation available to users under a data-sharing
agreement that provides for: (1) a commitment to using the data only for research purposes and not
to identify any individual participant; (2) a commitment to securing the data using appropriate
computer technology; and (3) a commitment to destroying or returning the data after analyses are
completed.
From NIH data sharing statements
The videos will be made available via the bristol.ac.uk website (both as streaming media and downloads) HD and
SD versions will be provided to accommodate those with lower bandwidth. Videos will also be made available via
Vimeo, a platform that is already well used by research students at Bristol. Appropriate metadata will also be
provided to the existing Vimeo standard.
All video will also be available for download and re-editing by third parties. To facilitate this Creative Commons
licenses will be assigned to each item. In order to ensure this usage is possible, the required permissions will be
gathered from participants (using a suitable release form) before recording commences.
From University of Bristol Kitchen Cosmology DMP
www.geant.org
Dataset licensing
Horizon 2020
guidelines point to:
or
www.geant.org
5. Preservation
• Which data do you need to keep?
• Will you deposit your data in a repository?
• Do you need to prepare it for deposit?
www.geant.org
Archiving examples
Data will be provided in file formats considered appropriate for long-term access, as
recommended by the UK Data Service. For example, SPSS Portal format and tab-
delimited text for qualitative tabular data and RTF and PDF/A for interview
transcripts. Appropriate documentation necessary to understand the data will
also be provided. Anonymised data will be held for a minimum of 10 years
following project completion, in compliance with LSHTM’s Records Retention and
Disposal Schedule. Biological samples (output 3) will be deposited with the UK
BioBank for future use.
From Writing a Wellcome Trust Data Management and Sharing Plan
The investigators will work with staff at the UKDA to determine what to archive and
how long the deposited data should be retained. Future long-term use of the data
will be ensured by placing a copy of the data into the repository.
From ICPSR Framework for Creating a DMP
www.geant.org
Lists of repositories to choose from
http://databib.org
http://service.re3data.org/search
Zenodo
• OpenAIRE-CERN joint effort
• Multidisciplinary repository
• Multiple data types
– Publications
– Long tail of research data
• Citable data (DOI)
• Links to funder, publications, data
& software
www.zenodo.org
www.geant.org
Indonesian data repository
http://rin.lipi.go.id
43 |
www.geant.org
Example DMPs
• Public plans on DMPonline
https://dmponline.dcc.ac.uk/public_plans
• Plans from several funders and disciplines via DCC
www.dcc.ac.uk/resources/data-management-plans/guidance-examples
• 108 DMPs from the National Endowment for the Humanities
https://www.neh.gov/sites/default/files/inline-files/dmp_from_successful_grants.zip
• LIBER DMP catalogue in Zenodo
• https://libereurope.eu/working-group/research-data-management/plans
• DMPs published in RIO journal
• http://riojournal.com/browse_user_collection_documents.php?collection_id=3&journal_id=17
www.geant.org
Key messages
• Data management is part of good practice whether you
plan to make the data open or not
– it benefits you!
• Seek advice when developing your DMP - consider good
practice for your field
• Base plans on available skills & support so
implementation is feasible
• Justify decisions – particularly restrictions or costs
www.geant.org
Click to edit Master title style
• Click to edit Master text styles
• Second level
• Third level
• Fourth level
• Fifth level
01/07/2021 46
Thank you
www.geant.org
Any questions?
© GÉANT Association on behalf of the GN4 Phase 2 project (GN4-2).
The research leading to these results has received funding from
the European Union’s Horizon 2020 research and innovation
programme under Grant Agreement No. 731122 (GN4-2). 46 |

More Related Content

What's hot

Open, FAIR data and RDM
Open, FAIR data and RDMOpen, FAIR data and RDM
Open, FAIR data and RDMSarah Jones
 
RDM and DMP intro
RDM and DMP introRDM and DMP intro
RDM and DMP introSarah Jones
 
Managing and sharing data
Managing and sharing dataManaging and sharing data
Managing and sharing dataSarah Jones
 
LIBER Webinar: 23 Things About Research Data Management
LIBER Webinar: 23 Things About Research Data ManagementLIBER Webinar: 23 Things About Research Data Management
LIBER Webinar: 23 Things About Research Data ManagementLIBER Europe
 
Managing and sharing data
Managing and sharing dataManaging and sharing data
Managing and sharing dataSarah Jones
 
Supporting the development of a national Research Data Discovery Service - A ...
Supporting the development of a national Research Data Discovery Service - A ...Supporting the development of a national Research Data Discovery Service - A ...
Supporting the development of a national Research Data Discovery Service - A ...Historic Environment Scotland
 
Research Data Management
Research Data ManagementResearch Data Management
Research Data ManagementSarah Jones
 
FAIR Data Experiences - Kees van Bochove - The Hyve
FAIR Data Experiences - Kees van Bochove - The HyveFAIR Data Experiences - Kees van Bochove - The Hyve
FAIR Data Experiences - Kees van Bochove - The HyveKees van Bochove
 
H2020 open-data-pilot
H2020 open-data-pilotH2020 open-data-pilot
H2020 open-data-pilotSarah Jones
 
D4Science Data infrastructure: a facilitator for a FAIR data management
D4Science Data infrastructure: a facilitator for a FAIR data managementD4Science Data infrastructure: a facilitator for a FAIR data management
D4Science Data infrastructure: a facilitator for a FAIR data managementResearch Data Alliance
 
Intro to Data Management Plans
Intro to Data Management PlansIntro to Data Management Plans
Intro to Data Management PlansSarah Jones
 
Open Science Globally: Some Developments/Dr Simon Hodson
Open Science Globally: Some Developments/Dr Simon HodsonOpen Science Globally: Some Developments/Dr Simon Hodson
Open Science Globally: Some Developments/Dr Simon HodsonAfrican Open Science Platform
 

What's hot (20)

RDM & ELNs @ Edinburgh
RDM & ELNs @ EdinburghRDM & ELNs @ Edinburgh
RDM & ELNs @ Edinburgh
 
Open, FAIR data and RDM
Open, FAIR data and RDMOpen, FAIR data and RDM
Open, FAIR data and RDM
 
RDM and DMP intro
RDM and DMP introRDM and DMP intro
RDM and DMP intro
 
Fair data vs 5 star open data final
Fair data vs 5 star open data finalFair data vs 5 star open data final
Fair data vs 5 star open data final
 
Managing and sharing data
Managing and sharing dataManaging and sharing data
Managing and sharing data
 
"Cool" metadata for FAIR data
"Cool" metadata for FAIR data"Cool" metadata for FAIR data
"Cool" metadata for FAIR data
 
Research Data Management: Why is it important?
Research Data Management: Why is it  important?Research Data Management: Why is it  important?
Research Data Management: Why is it important?
 
LIBER Webinar: 23 Things About Research Data Management
LIBER Webinar: 23 Things About Research Data ManagementLIBER Webinar: 23 Things About Research Data Management
LIBER Webinar: 23 Things About Research Data Management
 
Managing and sharing data
Managing and sharing dataManaging and sharing data
Managing and sharing data
 
Supporting the development of a national Research Data Discovery Service - A ...
Supporting the development of a national Research Data Discovery Service - A ...Supporting the development of a national Research Data Discovery Service - A ...
Supporting the development of a national Research Data Discovery Service - A ...
 
Research Data Management
Research Data ManagementResearch Data Management
Research Data Management
 
NISO Working Group Connection Live! Research Data Metrics Landscape: An Updat...
NISO Working Group Connection Live! Research Data Metrics Landscape: An Updat...NISO Working Group Connection Live! Research Data Metrics Landscape: An Updat...
NISO Working Group Connection Live! Research Data Metrics Landscape: An Updat...
 
RDM for trainee physicians
RDM for trainee physiciansRDM for trainee physicians
RDM for trainee physicians
 
FAIR Data Experiences - Kees van Bochove - The Hyve
FAIR Data Experiences - Kees van Bochove - The HyveFAIR Data Experiences - Kees van Bochove - The Hyve
FAIR Data Experiences - Kees van Bochove - The Hyve
 
H2020 open-data-pilot
H2020 open-data-pilotH2020 open-data-pilot
H2020 open-data-pilot
 
D4Science Data infrastructure: a facilitator for a FAIR data management
D4Science Data infrastructure: a facilitator for a FAIR data managementD4Science Data infrastructure: a facilitator for a FAIR data management
D4Science Data infrastructure: a facilitator for a FAIR data management
 
Intro to Data Management Plans
Intro to Data Management PlansIntro to Data Management Plans
Intro to Data Management Plans
 
Tijerina-RDA-NISO-Task Groups-sept11
Tijerina-RDA-NISO-Task Groups-sept11Tijerina-RDA-NISO-Task Groups-sept11
Tijerina-RDA-NISO-Task Groups-sept11
 
Open Science Globally: Some Developments/Dr Simon Hodson
Open Science Globally: Some Developments/Dr Simon HodsonOpen Science Globally: Some Developments/Dr Simon Hodson
Open Science Globally: Some Developments/Dr Simon Hodson
 
Borgman - Privacy, Policy and Data Governance in the University
Borgman - Privacy, Policy and Data Governance in the UniversityBorgman - Privacy, Policy and Data Governance in the University
Borgman - Privacy, Policy and Data Governance in the University
 

Similar to Data Management Planning for researchers

The state of global research data initiatives: observations from a life on th...
The state of global research data initiatives: observations from a life on th...The state of global research data initiatives: observations from a life on th...
The state of global research data initiatives: observations from a life on th...Projeto RCAAP
 
What is-rdm
What is-rdmWhat is-rdm
What is-rdmSarah Jones
 
Introduction to Data Management Planning
Introduction to Data Management PlanningIntroduction to Data Management Planning
Introduction to Data Management PlanningSarah Jones
 
Research Data Management: An Introductory Webinar from OpenAIRE and EUDAT
Research Data Management: An Introductory Webinar from OpenAIRE and EUDATResearch Data Management: An Introductory Webinar from OpenAIRE and EUDAT
Research Data Management: An Introductory Webinar from OpenAIRE and EUDATTony Ross-Hellauer
 
Research Data Management: An Introductory Webinar from OpenAIRE and EUDAT
Research Data Management: An Introductory Webinar from OpenAIRE and EUDATResearch Data Management: An Introductory Webinar from OpenAIRE and EUDAT
Research Data Management: An Introductory Webinar from OpenAIRE and EUDATOpenAIRE
 
Research Data Management Introduction: EUDAT/Open AIRE Webinar| www.eudat.eu |
Research Data Management Introduction: EUDAT/Open AIRE Webinar| www.eudat.eu | Research Data Management Introduction: EUDAT/Open AIRE Webinar| www.eudat.eu |
Research Data Management Introduction: EUDAT/Open AIRE Webinar| www.eudat.eu | EUDAT
 
FAIRDOM data management support for ERACoBioTech Proposals
FAIRDOM data management support for ERACoBioTech ProposalsFAIRDOM data management support for ERACoBioTech Proposals
FAIRDOM data management support for ERACoBioTech ProposalsFAIRDOM
 
Data Management for Undergraduate Researchers
Data Management for Undergraduate ResearchersData Management for Undergraduate Researchers
Data Management for Undergraduate ResearchersRebekah Cummings
 
Research Lifecycles and RDM
Research Lifecycles and RDMResearch Lifecycles and RDM
Research Lifecycles and RDMMarieke Guy
 
EUDAT & OpenAIRE Webinar: How to write a Data Management Plan - July 7, 2016|...
EUDAT & OpenAIRE Webinar: How to write a Data Management Plan - July 7, 2016|...EUDAT & OpenAIRE Webinar: How to write a Data Management Plan - July 7, 2016|...
EUDAT & OpenAIRE Webinar: How to write a Data Management Plan - July 7, 2016|...EUDAT
 
Planning for Research Data Management
Planning for Research Data ManagementPlanning for Research Data Management
Planning for Research Data Managementdancrane_open
 
Data Management and Horizon 2020
Data Management and Horizon 2020Data Management and Horizon 2020
Data Management and Horizon 2020Sarah Jones
 
Data management plans (dmp) for nsf
Data management plans (dmp) for nsfData management plans (dmp) for nsf
Data management plans (dmp) for nsfBrad Houston
 
Data management plans (dmp) for nsf
Data management plans (dmp) for nsfData management plans (dmp) for nsf
Data management plans (dmp) for nsfBrad Houston
 
Planning for Research Data Management: 26th January 2016
Planning for Research Data Management: 26th January 2016Planning for Research Data Management: 26th January 2016
Planning for Research Data Management: 26th January 2016IzzyChad
 
Data management plans and planning - a gentle introduction
Data management plans and planning - a gentle introductionData management plans and planning - a gentle introduction
Data management plans and planning - a gentle introductionMartin Donnelly
 
OpenAIRE and Eudat services and tools to support FAIR DMP implementation
OpenAIRE and Eudat services and tools to support FAIR DMP implementation OpenAIRE and Eudat services and tools to support FAIR DMP implementation
OpenAIRE and Eudat services and tools to support FAIR DMP implementation Research Data Alliance
 

Similar to Data Management Planning for researchers (20)

The state of global research data initiatives: observations from a life on th...
The state of global research data initiatives: observations from a life on th...The state of global research data initiatives: observations from a life on th...
The state of global research data initiatives: observations from a life on th...
 
Managing your research data
Managing your research dataManaging your research data
Managing your research data
 
What is-rdm
What is-rdmWhat is-rdm
What is-rdm
 
Introduction to Data Management Planning
Introduction to Data Management PlanningIntroduction to Data Management Planning
Introduction to Data Management Planning
 
Research Data Management: An Introductory Webinar from OpenAIRE and EUDAT
Research Data Management: An Introductory Webinar from OpenAIRE and EUDATResearch Data Management: An Introductory Webinar from OpenAIRE and EUDAT
Research Data Management: An Introductory Webinar from OpenAIRE and EUDAT
 
Research Data Management: An Introductory Webinar from OpenAIRE and EUDAT
Research Data Management: An Introductory Webinar from OpenAIRE and EUDATResearch Data Management: An Introductory Webinar from OpenAIRE and EUDAT
Research Data Management: An Introductory Webinar from OpenAIRE and EUDAT
 
Research Data Management Introduction: EUDAT/Open AIRE Webinar| www.eudat.eu |
Research Data Management Introduction: EUDAT/Open AIRE Webinar| www.eudat.eu | Research Data Management Introduction: EUDAT/Open AIRE Webinar| www.eudat.eu |
Research Data Management Introduction: EUDAT/Open AIRE Webinar| www.eudat.eu |
 
FAIRDOM data management support for ERACoBioTech Proposals
FAIRDOM data management support for ERACoBioTech ProposalsFAIRDOM data management support for ERACoBioTech Proposals
FAIRDOM data management support for ERACoBioTech Proposals
 
Data Management for Undergraduate Researchers
Data Management for Undergraduate ResearchersData Management for Undergraduate Researchers
Data Management for Undergraduate Researchers
 
DC101 UWE
DC101 UWEDC101 UWE
DC101 UWE
 
Research Lifecycles and RDM
Research Lifecycles and RDMResearch Lifecycles and RDM
Research Lifecycles and RDM
 
EUDAT & OpenAIRE Webinar: How to write a Data Management Plan - July 7, 2016|...
EUDAT & OpenAIRE Webinar: How to write a Data Management Plan - July 7, 2016|...EUDAT & OpenAIRE Webinar: How to write a Data Management Plan - July 7, 2016|...
EUDAT & OpenAIRE Webinar: How to write a Data Management Plan - July 7, 2016|...
 
Planning for Research Data Management
Planning for Research Data ManagementPlanning for Research Data Management
Planning for Research Data Management
 
Data Management and Horizon 2020
Data Management and Horizon 2020Data Management and Horizon 2020
Data Management and Horizon 2020
 
Data management plans (dmp) for nsf
Data management plans (dmp) for nsfData management plans (dmp) for nsf
Data management plans (dmp) for nsf
 
Data management plans (dmp) for nsf
Data management plans (dmp) for nsfData management plans (dmp) for nsf
Data management plans (dmp) for nsf
 
Introduction to RDM for Geoscience PhD Students
Introduction to RDM for Geoscience PhD StudentsIntroduction to RDM for Geoscience PhD Students
Introduction to RDM for Geoscience PhD Students
 
Planning for Research Data Management: 26th January 2016
Planning for Research Data Management: 26th January 2016Planning for Research Data Management: 26th January 2016
Planning for Research Data Management: 26th January 2016
 
Data management plans and planning - a gentle introduction
Data management plans and planning - a gentle introductionData management plans and planning - a gentle introduction
Data management plans and planning - a gentle introduction
 
OpenAIRE and Eudat services and tools to support FAIR DMP implementation
OpenAIRE and Eudat services and tools to support FAIR DMP implementation OpenAIRE and Eudat services and tools to support FAIR DMP implementation
OpenAIRE and Eudat services and tools to support FAIR DMP implementation
 

More from Sarah Jones

Data training tips and tricks
Data training tips and tricksData training tips and tricks
Data training tips and tricksSarah Jones
 
EOSC and libraries
EOSC and librariesEOSC and libraries
EOSC and librariesSarah Jones
 
EOSC Association priorities and activities
EOSC Association priorities and activitiesEOSC Association priorities and activities
EOSC Association priorities and activitiesSarah Jones
 
Managing and sharing data: lessons from the European context
Managing and sharing data: lessons from the European contextManaging and sharing data: lessons from the European context
Managing and sharing data: lessons from the European contextSarah Jones
 
Reflections on Open Science
Reflections on Open ScienceReflections on Open Science
Reflections on Open ScienceSarah Jones
 
MAR comments analysis
MAR comments analysisMAR comments analysis
MAR comments analysisSarah Jones
 
Introduction to Open Science and EOSC
Introduction to Open Science and EOSCIntroduction to Open Science and EOSC
Introduction to Open Science and EOSCSarah Jones
 
EOSC-MAR-update.pptx
EOSC-MAR-update.pptxEOSC-MAR-update.pptx
EOSC-MAR-update.pptxSarah Jones
 
Intro-EOSC.pptx
Intro-EOSC.pptxIntro-EOSC.pptx
Intro-EOSC.pptxSarah Jones
 
Why is EOSC so hard?
Why is EOSC so hard?Why is EOSC so hard?
Why is EOSC so hard?Sarah Jones
 
Is Europe ready for Open Science
Is Europe ready for Open ScienceIs Europe ready for Open Science
Is Europe ready for Open ScienceSarah Jones
 
DMPonline: 10 years, 10 lessons
DMPonline: 10 years, 10 lessonsDMPonline: 10 years, 10 lessons
DMPonline: 10 years, 10 lessonsSarah Jones
 
Do & don't of supporting Open Science
Do & don't of supporting Open ScienceDo & don't of supporting Open Science
Do & don't of supporting Open ScienceSarah Jones
 
Why institutions need to raise their capabilities to support FAIR
Why institutions need to raise their capabilities to support FAIRWhy institutions need to raise their capabilities to support FAIR
Why institutions need to raise their capabilities to support FAIRSarah Jones
 
It takes more than a village: lessons on building global research commons
It takes more than a village: lessons on building global research commonsIt takes more than a village: lessons on building global research commons
It takes more than a village: lessons on building global research commonsSarah Jones
 
DMPTuuli - what's new?
DMPTuuli - what's new?DMPTuuli - what's new?
DMPTuuli - what's new?Sarah Jones
 
Intro to RDM
Intro to RDMIntro to RDM
Intro to RDMSarah Jones
 
Reflections on EOSC through the mirror of ARDC
Reflections on EOSC through the mirror of ARDCReflections on EOSC through the mirror of ARDC
Reflections on EOSC through the mirror of ARDCSarah Jones
 
Future EOSC roadmap
Future EOSC roadmapFuture EOSC roadmap
Future EOSC roadmapSarah Jones
 
Global Open Research Commons IG
Global Open Research Commons IGGlobal Open Research Commons IG
Global Open Research Commons IGSarah Jones
 

More from Sarah Jones (20)

Data training tips and tricks
Data training tips and tricksData training tips and tricks
Data training tips and tricks
 
EOSC and libraries
EOSC and librariesEOSC and libraries
EOSC and libraries
 
EOSC Association priorities and activities
EOSC Association priorities and activitiesEOSC Association priorities and activities
EOSC Association priorities and activities
 
Managing and sharing data: lessons from the European context
Managing and sharing data: lessons from the European contextManaging and sharing data: lessons from the European context
Managing and sharing data: lessons from the European context
 
Reflections on Open Science
Reflections on Open ScienceReflections on Open Science
Reflections on Open Science
 
MAR comments analysis
MAR comments analysisMAR comments analysis
MAR comments analysis
 
Introduction to Open Science and EOSC
Introduction to Open Science and EOSCIntroduction to Open Science and EOSC
Introduction to Open Science and EOSC
 
EOSC-MAR-update.pptx
EOSC-MAR-update.pptxEOSC-MAR-update.pptx
EOSC-MAR-update.pptx
 
Intro-EOSC.pptx
Intro-EOSC.pptxIntro-EOSC.pptx
Intro-EOSC.pptx
 
Why is EOSC so hard?
Why is EOSC so hard?Why is EOSC so hard?
Why is EOSC so hard?
 
Is Europe ready for Open Science
Is Europe ready for Open ScienceIs Europe ready for Open Science
Is Europe ready for Open Science
 
DMPonline: 10 years, 10 lessons
DMPonline: 10 years, 10 lessonsDMPonline: 10 years, 10 lessons
DMPonline: 10 years, 10 lessons
 
Do & don't of supporting Open Science
Do & don't of supporting Open ScienceDo & don't of supporting Open Science
Do & don't of supporting Open Science
 
Why institutions need to raise their capabilities to support FAIR
Why institutions need to raise their capabilities to support FAIRWhy institutions need to raise their capabilities to support FAIR
Why institutions need to raise their capabilities to support FAIR
 
It takes more than a village: lessons on building global research commons
It takes more than a village: lessons on building global research commonsIt takes more than a village: lessons on building global research commons
It takes more than a village: lessons on building global research commons
 
DMPTuuli - what's new?
DMPTuuli - what's new?DMPTuuli - what's new?
DMPTuuli - what's new?
 
Intro to RDM
Intro to RDMIntro to RDM
Intro to RDM
 
Reflections on EOSC through the mirror of ARDC
Reflections on EOSC through the mirror of ARDCReflections on EOSC through the mirror of ARDC
Reflections on EOSC through the mirror of ARDC
 
Future EOSC roadmap
Future EOSC roadmapFuture EOSC roadmap
Future EOSC roadmap
 
Global Open Research Commons IG
Global Open Research Commons IGGlobal Open Research Commons IG
Global Open Research Commons IG
 

Recently uploaded

Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsTechSoup
 
Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfsanyamsingh5019
 
social pharmacy d-pharm 1st year by Pragati K. Mahajan
social pharmacy d-pharm 1st year by Pragati K. Mahajansocial pharmacy d-pharm 1st year by Pragati K. Mahajan
social pharmacy d-pharm 1st year by Pragati K. Mahajanpragatimahajan3
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactdawncurless
 
CARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxCARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxGaneshChakor2
 
A Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformA Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformChameera Dedduwage
 
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions  for the students and aspirants of Chemistry12th.pptxOrganic Name Reactions  for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions for the students and aspirants of Chemistry12th.pptxVS Mahajan Coaching Centre
 
Sports & Fitness Value Added Course FY..
Sports & Fitness Value Added Course FY..Sports & Fitness Value Added Course FY..
Sports & Fitness Value Added Course FY..Disha Kariya
 
Russian Call Girls in Andheri Airport Mumbai WhatsApp 9167673311 đź’ž Full Nigh...
Russian Call Girls in Andheri Airport Mumbai WhatsApp  9167673311 đź’ž Full Nigh...Russian Call Girls in Andheri Airport Mumbai WhatsApp  9167673311 đź’ž Full Nigh...
Russian Call Girls in Andheri Airport Mumbai WhatsApp 9167673311 đź’ž Full Nigh...Pooja Nehwal
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdfQucHHunhnh
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)eniolaolutunde
 
The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13Steve Thomason
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...EduSkills OECD
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfciinovamais
 
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Sapana Sha
 
Advance Mobile Application Development class 07
Advance Mobile Application Development class 07Advance Mobile Application Development class 07
Advance Mobile Application Development class 07Dr. Mazin Mohamed alkathiri
 
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...fonyou31
 
Z Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphZ Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphThiyagu K
 
Mastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionMastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionSafetyChain Software
 

Recently uploaded (20)

INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptxINDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
 
Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The Basics
 
Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdf
 
social pharmacy d-pharm 1st year by Pragati K. Mahajan
social pharmacy d-pharm 1st year by Pragati K. Mahajansocial pharmacy d-pharm 1st year by Pragati K. Mahajan
social pharmacy d-pharm 1st year by Pragati K. Mahajan
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impact
 
CARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxCARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptx
 
A Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformA Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy Reform
 
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions  for the students and aspirants of Chemistry12th.pptxOrganic Name Reactions  for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
 
Sports & Fitness Value Added Course FY..
Sports & Fitness Value Added Course FY..Sports & Fitness Value Added Course FY..
Sports & Fitness Value Added Course FY..
 
Russian Call Girls in Andheri Airport Mumbai WhatsApp 9167673311 đź’ž Full Nigh...
Russian Call Girls in Andheri Airport Mumbai WhatsApp  9167673311 đź’ž Full Nigh...Russian Call Girls in Andheri Airport Mumbai WhatsApp  9167673311 đź’ž Full Nigh...
Russian Call Girls in Andheri Airport Mumbai WhatsApp 9167673311 đź’ž Full Nigh...
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)
 
The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdf
 
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
 
Advance Mobile Application Development class 07
Advance Mobile Application Development class 07Advance Mobile Application Development class 07
Advance Mobile Application Development class 07
 
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
 
Z Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphZ Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot Graph
 
Mastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionMastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory Inspection
 

Data Management Planning for researchers

  • 1. www.geant.org 1 | Click to edit Master title style • Click to edit Master text styles • Second level • Third level • Fourth level • Fifth level 01/07/2021 1 Data Management Planning for researchers www.geant.org Sarah Jones EOSC Engagement Manager sarah.jones@geant.org Twitter: @sarahroams Indonesian RDM webinar series Friday 2nd July 2021
  • 2. What is a DMP? Image CC-BY-NC-SA by Leo Reynolds www.flickr.com/photos/lwr/13442910354
  • 3. All manner of things that you produce in the course of your research What is research data?
  • 4. “the active management and appraisal of data over the lifecycle of scholarly and scientific interest” Data management is part of good research practice What is research data management? Create Document Use Store Share Preserve
  • 5. A short plan that outlines: • what data will be created and how • how it will be managed (storage, back-up, access…) • plans for data sharing and preservation DMPs are often submitted as part of grant applications, but are useful whenever researchers are creating data What is a DMP?
  • 6. 1. Description of data to be collected / created (i.e. content, type, format, volume...) 2. Standards / methodologies for data collection & management 3. Ethics and Intellectual Property (highlight any restrictions on data sharing e.g. embargoes, confidentiality) 4. Plans for data sharing and access (i.e. how, when, to whom) 5. Strategy for long-term preservation Five common themes / questions in DMPs
  • 7. Why create a DMP? Image CC-BY by Ian Dooley https://unsplash.com/photos/DuBNA1QMpPA
  • 8. www.geant.org Many global funders ask for DMPs Not comprehensive!
  • 9. www.geant.org Indonesian requirements • No requirement for a DMP • Regulation in law to encourage researchers to deposit data • Publishing and sharing is optional 9 |
  • 10. www.geant.org What do research funders want? • A brief plan usually submitted in grant applications • Some funders may want multiple stages of plans e.g. pre- award, in-project, final report… • 1-4 sides of A4 as attachment or a section in application • Typically a prose statement covering suggested themes • An outline of data management and sharing plans, justifying decisions and any limitations
  • 11. www.geant.org Trend for DMPs to cover more than data • Wellcome Trust issued new guidelines in 2017 that ask for an Outputs Management Plan covering: – datasets generated by your research – original software created in the course of your research – new materials you create – like antibodies, cell lines and reagents – IP such as patents, copyright, design rights and confidential know-how • The EPSRC has a requirement for Software Management Plans
  • 12. www.geant.org Why write a DMP / manage your data? NON PECUNIAE INVESTIGATIONIS CURATORE SED VITAE FACIMUS PROGRAMMAS DATORUM PROCURATIONIS (Not for the research funder, but for life we make data management plans) • Make your research easier • Stop yourself drowning in irrelevant stuff • Save data for later • Avoid accusations of fraud or bad science • Write a data paper • Share your data for re-use • Get credit for it
  • 14. How can we make a good DMP? 14 | Image CC-BY by Kelly Sikkema https://unsplash.com/photos/v9FQR4tbIq8
  • 15. www.geant.org Planning trick 1: think backwards What data organisation would a re-user like? CREATING DATA PROCESSING DATA PRESERVING DATA GIVING ACCESS TO DATA RE-USING DATA
  • 17. www.geant.org Planning trick 2: include RDM stakeholders Institution RDM policy Facilities €$ÂŁ Research funders Publishers Data Availability policy Commercial partners www.openaire.eu/briefpaper-rdm-infonoads
  • 18. www.geant.org Use the DMP as a talking point Consulting, supporting and networking with researchers & all other interest groups Slide content courtesy of Mari Elisa Kuusniemi (MEK), University of Helsinki Library
  • 19. www.geant.org Planning trick 3: ground your plan in reality Base plans on available skills, support and good practice for the field – show it’s feasible to implement
  • 20. www.geant.org Planning trick 4: plan to share from the outset Decisions made early on affect what you can do later • Negotiation on licenses and consent agreement may preclude later sharing if not careful • Costings can’t be included retrospectively • Useful to consider data issues at the consortium negotiation stage to make sure potential issues are identified and sorted asap
  • 21. Key tools and support 21 | Image CC-BY by Barn Images https://unsplash.com/photos/t5YUoHW6zRo
  • 22. www.geant.org DCC support on DMPs • Webinars and training materials • How-to guides and other advisory documents • Checklist on what to cover in DMPs • Example DMPs • DMPonline https://www.dcc.ac.uk/dmps
  • 23. www.geant.org What is DMPonline? A web-based tool to help researchers write DMPs https://dmponline.dcc.ac.uk
  • 24. www.geant.org How does DMPonline work? Select options to get tailored guidance and support Guidance and examples from funders, unis, research disciplines and others DMP Requirements from funders, institutions and others Create Share Review Export Update …..
  • 25. www.geant.org Many DMP tools available… Platform Organisation(s) Resource link(s) DMPRoadmap CDL| DCC | Portage Network | INIST CNRS https://github.com/DMPRoadmap/roadmap University of Queensland Research Data Manager University of Queensland https://research.uq.edu.au/project/research-data- manager-uqrdm ReDBox DLC QCIF https://www.redboxresearchdata.com.au/rbdlc.html RDMOrganiser (RDMO) AIP | FHP | KIT http://rdmorganiser.github.io/en Data Stewardship Wizard ELIXIR | DTL https://github.com/DataStewardshipPortal ezDMP IEDA https://www.iedadata.org Data planning tool UNINETT Sigma2 https://www.sigma2.no/content/data-planning-tool And more…. Please update at: https://activedmps.org
  • 26. www.geant.org Managing and sharing data: a best practice guide • How to write a DMP • Formatting your data • Documentation • Ethics and consent • Copyright • Data sharing • … http://data-archive.ac.uk/media/2894/managingsharing.pdf
  • 27. Questions and worked examples Image Israel Palacio https://unsplash.com/photos/P6FgiDNe6W4
  • 28. www.geant.org 1. Describing data to be collected • What type of data will you produce? • What file format(s) will your data be in? • How much data will be produced? • How will you create your data?
  • 29. www.geant.org Data description examples The final dataset will include self-reported demographic and behavioural data from interviews with the subjects and laboratory data from urine specimens provided. From NIH data sharing statements Every two days, we will subsample E. affinis populations growing under our treatment conditions. We will use a microscope to identify the life stage and sex of the subsampled individuals. We will document the information first in a laboratory notebook and then copy the data into an Excel spreadsheet. The Excel spreadsheet will be saved as a comma separated value (.csv) file. From DataOne – E. affinis DMP example
  • 30. www.geant.org Some formats are better for long-term It’s preferable to opt for formats that are: • Uncompressed • Non-proprietary • Open, documented • Standard representation (ASCII, Unicode) Data centres may have preferred formats for deposit e.g. Type Recommended Non-preferred Tabular data CSV, TSV, SPSS portable Excel Text Plain text, HTML, RTF PDF/A only if layout matters Word Media Container: MP4, Ogg Codec: Theora, Dirac, FLAC Quicktime H264 Images TIFF, JPEG2000, PNG GIF, JPG Structured data XML, RDF RDBMS Further examples: https://www.ukdataservice.ac.uk/manage-data/format/recommended-formats.aspx
  • 31. www.geant.org 2. Standards and methodologies • What metadata and documentation will you record? • What standards are used in your field? • How will your data be organised? • Where will it be stored and backed-up?
  • 32. www.geant.org Metadata examples Metadata will be tagged in XML using the Data Documentation Initiative (DDI) format. The codebook will contain information on study design, sampling methodology, fieldwork, variable-level detail, and all information necessary for a secondary analyst to use the data accurately and effectively. From ICPSR Framework for Creating a DMP We will first document our metadata by taking careful notes in the laboratory notebook that refer to specific data files and describe all columns, units, abbreviations, and missing value identifiers. These notes will be transcribed into a .txt document that will be stored with the data file. After all of the data are collected, we will then use EML (Ecological Metadata Language) to digitize our metadata. EML is one of the accepted formats used in ecology, and works well for the types of data we will be producing. We will create these metadata using Morpho software, available through KNB. The metadata will fully describe the data files and the context of the measurements. From DataOne – E. affinis DMP example
  • 33. www.geant.org Where to find relevant standards? Metadata Standards Directory Broad, disciplinary listing of standards and tools. Maintained by RDA group https://rd-alliance.github.io/metadata- directory FAIRsharing A portal of data standards, databases, and policies Focused on life, environmental and biomedical sciences, but expanding to other disciplines https://fairsharing.org
  • 34. www.geant.org 3. Ethical and IPR implications • Are you seeking consent from participants? • Are you re-using other people’s data? • Who owns your data or has rights in it? • Are restrictions on sharing needed?
  • 35. www.geant.org Examples restrictions Because the STDs being studied are reportable diseases, we will be collecting identifying information. Even though the final dataset will be stripped of identifiers prior to release for sharing, we believe that there remains the possibility of deductive disclosure of subjects with unusual characteristics. Thus, we will make the data and associated documentation available to users only under a data-sharing agreement. From NIH data sharing statements 1. Share data privately within 1 year. Data will be held in Private Repository, but metadata will be public 2. Release data to public within 2 years. Encouraged after one year to release data for public access. 3. Request, in writing, data privacy up to 4 years. Extensions beyond 3 years will only be granted for compelling cases. 4. Consult with creators of private CZO datasets prior to use. Pis required to seek consent before using private data they can access From Boulder Creek Critical Zone Observatory DMP
  • 36. www.geant.org Seek consent for data sharing & preservation •If you don’t ask, data centres won’t be able to accept your data – regardless of any conditions on the original grant or your desire for the data to be shared.
  • 37. www.geant.org 4. Data sharing and reuse • Are you allowed to share your data? • Who will you share with and how? • When and where will you make the data available? • Do you need to impose conditions on reuse? • How will you license the data for clarity?
  • 38. www.geant.org Data sharing examples We will make the data and associated documentation available to users under a data-sharing agreement that provides for: (1) a commitment to using the data only for research purposes and not to identify any individual participant; (2) a commitment to securing the data using appropriate computer technology; and (3) a commitment to destroying or returning the data after analyses are completed. From NIH data sharing statements The videos will be made available via the bristol.ac.uk website (both as streaming media and downloads) HD and SD versions will be provided to accommodate those with lower bandwidth. Videos will also be made available via Vimeo, a platform that is already well used by research students at Bristol. Appropriate metadata will also be provided to the existing Vimeo standard. All video will also be available for download and re-editing by third parties. To facilitate this Creative Commons licenses will be assigned to each item. In order to ensure this usage is possible, the required permissions will be gathered from participants (using a suitable release form) before recording commences. From University of Bristol Kitchen Cosmology DMP
  • 40. www.geant.org 5. Preservation • Which data do you need to keep? • Will you deposit your data in a repository? • Do you need to prepare it for deposit?
  • 41. www.geant.org Archiving examples Data will be provided in file formats considered appropriate for long-term access, as recommended by the UK Data Service. For example, SPSS Portal format and tab- delimited text for qualitative tabular data and RTF and PDF/A for interview transcripts. Appropriate documentation necessary to understand the data will also be provided. Anonymised data will be held for a minimum of 10 years following project completion, in compliance with LSHTM’s Records Retention and Disposal Schedule. Biological samples (output 3) will be deposited with the UK BioBank for future use. From Writing a Wellcome Trust Data Management and Sharing Plan The investigators will work with staff at the UKDA to determine what to archive and how long the deposited data should be retained. Future long-term use of the data will be ensured by placing a copy of the data into the repository. From ICPSR Framework for Creating a DMP
  • 42. www.geant.org Lists of repositories to choose from http://databib.org http://service.re3data.org/search Zenodo • OpenAIRE-CERN joint effort • Multidisciplinary repository • Multiple data types – Publications – Long tail of research data • Citable data (DOI) • Links to funder, publications, data & software www.zenodo.org
  • 44. www.geant.org Example DMPs • Public plans on DMPonline https://dmponline.dcc.ac.uk/public_plans • Plans from several funders and disciplines via DCC www.dcc.ac.uk/resources/data-management-plans/guidance-examples • 108 DMPs from the National Endowment for the Humanities https://www.neh.gov/sites/default/files/inline-files/dmp_from_successful_grants.zip • LIBER DMP catalogue in Zenodo • https://libereurope.eu/working-group/research-data-management/plans • DMPs published in RIO journal • http://riojournal.com/browse_user_collection_documents.php?collection_id=3&journal_id=17
  • 45. www.geant.org Key messages • Data management is part of good practice whether you plan to make the data open or not – it benefits you! • Seek advice when developing your DMP - consider good practice for your field • Base plans on available skills & support so implementation is feasible • Justify decisions – particularly restrictions or costs
  • 46. www.geant.org Click to edit Master title style • Click to edit Master text styles • Second level • Third level • Fourth level • Fifth level 01/07/2021 46 Thank you www.geant.org Any questions? © GÉANT Association on behalf of the GN4 Phase 2 project (GN4-2). The research leading to these results has received funding from the European Union’s Horizon 2020 research and innovation programme under Grant Agreement No. 731122 (GN4-2). 46 |