SlideShare a Scribd company logo
1 of 32
Managing research data
and Horizon 2020
Sarah Jones
Digital Curation Centre, Glasgow
sarah.jones@glasgow.ac.uk
Twitter: @sjDCC
Consorcio Madroño conference on Data Management Plans and Horizon 2020,
ETSI Industriales, Madrid, 25th February 2015
Funded by:
Benefits and drivers
WHY MANAGE RESEARCH DATA?
What is Research Data Management?
The active management of data throughout the lifecycle
Create
Document
Use
Store
Share
Preserve
• Data Management Planning
• Creating data
• Documenting data
• Accessing / using data
• Storage and backup
• Selecting what to keep
• Sharing data
• Data licensing and citation
• Preserving data
• …
Why manage research data?
Direct benefits for you
• To make your research
easier!
• Stop yourself drowning
in irrelevant stuff
• Have data organised so
you know which versions
are most up-to-date
• Make sure you can
understand and reuse
your data again later
Research integrity
• To avoid accusations of
fraud or bad science
• Evidence findings and
enable validation of
research methods
• Codes of practice on
good research conduct
• Many research funders
worldwide now require
Data Management and
Sharing Plans
Potential to share
• So others can reuse and
build on your data
• To gain credit – several
studies have shown
higher citation rates
when data are shared
• For greater visibility,
impact and new
research collaborations
• Promote innovation and
allow research in your
field to advance faster
It’s part of good research practice
Science as an open enterprise
https://royalsociety.org/policy/projects/science-public-enterprise/Report
“Much of the remarkable growth of scientific
understanding in recent centuries is due to open
practices; open communication and deliberation
sit at the heart of scientific practice.”
The Royal Society report calls for ‘intelligent
openness’ whereby data are accessible,
intelligible, assessable and usable.
More scientific breakthroughs
www.nytimes.com/2010/08/13/health/research/13alzheimer.html?pagewanted=all&_r=0
“It was unbelievable. Its not science
the way most of us have practiced in
our careers. But we all realised that
we would never get biomarkers unless
all of us parked our egos and
intellectual property noses outside the
door and agreed that all of our data
would be public immediately.”
Dr John Trojanowski, University of Pennsylvania
Increased use and economic benefit
UP TO 2008
Sold through the US Geological Survey
for US$600 per scene
Sales of 19,000 scenes per year
Annual revenue of $11.4 million
SINCE 2009
Freely available over the internet
Google Earth now uses the images
Transmission of 2,100,000 scenes per year.
Estimated to have created value for the
environmental management industry of $935
million, with direct benefit of more than
$100 million per year to the US economy
Has stimulated the development of
applications from a large number of
companies worldwide
The case of NASA Landsat satellite imagery of the Earth’s surface:
http://earthobservatory.nasa.gov/IOTD/view.php?id=83394&src=ve
But there are also opportunity costs
By Emilio Bruna
http://brunalab.org/blog/2014/09/04/the-opportunity-cost-
of-my-openscience-was-35-hours-690
For his most recent paper:
1. Double checking the main dataset and
reformatting to submit to Dryad: 5 hours
2. Creating complementary file and preparing
metadata: 3 hours
3. Submission of these two files and the
metadata to Dryad: 45 minutes
4. Preparing a map of the locations: 1 hour
5. Submission of map to Figshare: 15 minutes
6. Cleaning up and documenting the code,
uploading it to GitHub: 25 hours
7. Cost of archiving in Dryad: US$90
8. Page Charges: $600
So what needs to change?
Conclusions from Emilio Bruna:
• Develop a better system of incentives from the
community for archiving data and code
• Teach our students how to do this NOW - it’s much easier
if you develop good habits early
• Minimise the actual and opportunity costs
We need to stop telling people “You should” and get
better at telling people “Here’s how”
10 things to think about
HOW TO MANAGE DATA?
Image CC-BY-NC-SA by Leo Reynolds www.flickr.com/photos/lwr/13442910354
1. What file formats are most appropriate?
• Do you have a choice or do the instruments you use only export in certain formats?
• What is common in your field? Try to use something that is accepted and widespread.
• Does your data centre recommend formats? If so it’s best to use these.
If you want your data to be re-used and sustainable in the long-term, you
typically want to opt for open, non-proprietary formats.
Type Recommended Avoid for data sharing
Tabular data CSV, TSV, SPSS portable Excel
Text Plain text, HTML, RTF
PDF/A only if layout matters
Word
Media Container: MP4, Ogg
Codec: Theora, Dirac, FLAC
Quicktime
H264
Images TIFF, JPEG2000, PNG GIF, JPG
Structured data XML, RDF RDBMS
Further examples: http://www.data-archive.ac.uk/create-manage/format/formats-table
2. How will you name your files?
• Keep file and folder names short, but meaningful
• Agree a method for versioning
• Include dates in a set format e.g. YYYYMMDD
• Avoid using non-alphanumeric characters in file names
• Use hyphens or underscores not spaces e.g. day-sheet, day_sheet
• Order the elements in the most appropriate way to retrieve the record
www.jiscdigitalmedia.ac.uk/guide/choosing-a-file-name
Example from ARM Climate
Research Facility
www.arm.gov/data/docs/plan
3. Can others understand the data?
Think about what is needed in order to find, evaluate,
understand, and reuse the data.
• Have you documented what you did and how?
• Did you develop code to run analyses? If so, this should
be kept and shared too.
• Is it clear what each bit of your dataset means? Make
sure the units are labelled and abbreviations explained.
• Record metadata so others can find your work e.g. title,
date, creator(s), subject, format, rights…,
4. What metadata standards will you use?
Use relevant standards for interoperability
www.dcc.ac.uk/resources/metadata-standards
5. Where will you store the data?
• Your own device (laptop, flash drive, server etc.)
– And if you lose it? Or it breaks?
• Departmental drives or university servers
• “Cloud” storage
– Do they care as much about your data as you do?
The decision will be based on how sensitive your data are, how robust
you need the storage to be, and who needs access to the data and when
6. Who will do the backup?
Use managed services where possible (e.g. University
filestores rather than local or external hard drives), so
backup is done automatically
3… 2… 1… backup!
at least 3 copies of a file
on at least 2 different media
with at least 1 offsite
Ask central IT team for advice
7. Can you publish / share your data?
• Who owns the data?
• Have you got consent for sharing?
• Do any licences you’ve signed permit sharing?
• Is the data in suitable formats?
• Is there enough documentation?
www.dcc.ac.uk/resources/how-guides/license-research-data
8. How will you license your data?
• Can your data be made available
openly? e.g. CC0 or CC-BY
• Do you need to place certain
restrictions on who can use the data
or how?
This DCC how-to guide outlines pros
and cons of each approach and gives
practical advice on how to implement
your licence.
9. Which data need to be kept?
Five steps to follow
① Could this data be re-used
② Must it be kept as evidence or for legal reasons
③ Should it be kept for its potential value
④ Consider costs – do benefits outweigh cost?
⑤ Evaluate criteria to decide what to keep
5 steps to decide what data to keep
www.dcc.ac.uk/resources/how-guides/five-steps-decide-what-data-keep
10. Who will share & preserve the data?
http://databib.org
http://service.re3data.org/search
Zenodo
• Joint effort by OpenAIRE-CERN
• Multidisciplinary repository
• Multiple data types
– Publications
– Long tail of research data
• Citable data (DOI)
• Links funding, publications, data
& software
www.zenodo.org
• Does your publisher or funder suggest a repository?
• Are there data centres or community databases for your discipline?
• Does your university offer support for long-term preservation?
Managing and sharing data:
a best practice guide
http://data-archive.ac.uk/media/2894/managingsharing.pdf
Guidance and support to meet the EC requirements
HORIZON 2020 OPEN DATA PILOT
Image CC-BY-NC-SA by Tom Magllery www.flickr.com/photos/lwr/13442910354
Why open access and open data?
“The European Commission’s vision is
that information already paid for by the
public purse should not be paid for again
each time it is accessed or used, and that
it should benefit European companies and
citizens to the full.”
http://ec.europa.eu/research/participants/data/
ref/h2020/grants_manual/hi/oa_pilot/h2020-hi-oa-
pilot-guide_en.pdf
What is open data?
 make your stuff available on the Web (whatever format) under an open licence
 make it available as structured data (e.g. Excel instead of a scan of a table)
 use non-proprietary formats (e.g. CSV instead of Excel)
 use URIs to denote things, so that people can point at your stuff
 link your data to other data to provide context
Tim Berners-Lee’s proposal for five star open data - http://5stardata.info
“Open data and content can be freely used, modified
and shared by anyone for any purpose”
http://opendefinition.org
How to make data open?
1. Choose your dataset(s)
What can you may open? You may need to revisit this step if
you encounter problems later.
2. Apply an open license
Determine what IP exists. Apply a suitable licence e.g. CC-BY
3. Make the data available
Provide the data in a suitable format. Use repositories.
4. Make it discoverable
Post on the web, register in catalogues…
https://okfn.org
Support on Data Management Plans
What to cover in DMPs:
1. Description of data to be collected / created
2. Standards and methodologies for data collection & management
3. Any issues or restrictions due to ethics and Intellectual Property
4. Plans for data sharing and access
5. Strategy for long-term preservation
Example DMPs, guidance, tools and support at:
www.dcc.ac.uk/resources/data-management-plans
OpenAIRE
http://vimeo.com/108790101
Open Access Infrastructure for research in Europe
• aggregates data on OA publications
• mines & enriches it content by linking thing together
• provides services & APIs e.g.
to generate publication lists
www.openaire.eu
EUDAT
Data Infrastructure project offering various services:
• B2DROP: sync & exchange data
• B2SHARE: store & share
• B2STAGE: get data to computation
• B2SAFE: replicate data safely
• B2FIND: find data
They also offer a licensing wizard:
http://ufal.github.io/lindat-license-selector
www.eudat.eu
Discipline-specific infrastructure
FOSTER open science
• Open access and open data training across Europe
• Portal with access to reusable learning objects
• e-learning courses forthcoming
• Check out the training programme
www.fosteropenscience.eu/events
Sharing European Research Outcomes: Raising Awareness
on Open Access, Open Research Data and Open Science
May 13 (Madrid) – 14 (Valencia) – 28 (Gijón)
Gracias por su atención
DCC guidance, tools & case studies:
www.dcc.ac.uk/resources
Follow us on twitter:
@digitalcuration and #ukdcc

More Related Content

What's hot

Data Access & Storage @ UWA - UWA Research Week September 2017
Data Access & Storage @ UWA - UWA Research Week September 2017Data Access & Storage @ UWA - UWA Research Week September 2017
Data Access & Storage @ UWA - UWA Research Week September 2017Katina Toufexis
 
Data Management Planning
Data Management PlanningData Management Planning
Data Management PlanningSarah Jones
 
Data Management Planning for researchers
Data Management Planning for researchersData Management Planning for researchers
Data Management Planning for researchersSarah Jones
 
20160414 23 Research Data Things
20160414 23 Research Data Things20160414 23 Research Data Things
20160414 23 Research Data ThingsKatina Toufexis
 
Research Data Management: An Introductory Webinar from OpenAIRE and EUDAT
Research Data Management: An Introductory Webinar from OpenAIRE and EUDATResearch Data Management: An Introductory Webinar from OpenAIRE and EUDAT
Research Data Management: An Introductory Webinar from OpenAIRE and EUDATTony Ross-Hellauer
 
LEARN Conference - How to cost
LEARN Conference - How to costLEARN Conference - How to cost
LEARN Conference - How to costJisc RDM
 
NIH Data Policy or: How I Learned to Stop Worrying and Love the Data Manageme...
NIH Data Policy or: How I Learned to Stop Worrying and Love the Data Manageme...NIH Data Policy or: How I Learned to Stop Worrying and Love the Data Manageme...
NIH Data Policy or: How I Learned to Stop Worrying and Love the Data Manageme...Kristin Briney
 
Case Study Big Data: Socio-Technical Issues of HathiTrust Digital Texts
Case Study Big Data: Socio-Technical Issues of HathiTrust Digital TextsCase Study Big Data: Socio-Technical Issues of HathiTrust Digital Texts
Case Study Big Data: Socio-Technical Issues of HathiTrust Digital TextsBeth Plale
 
Practical Data Management - ACRL DCIG Webinar
Practical Data Management - ACRL DCIG WebinarPractical Data Management - ACRL DCIG Webinar
Practical Data Management - ACRL DCIG WebinarKristin Briney
 
Data management plans
Data management plansData management plans
Data management plansBrad Houston
 
Do & don't of supporting Open Science
Do & don't of supporting Open ScienceDo & don't of supporting Open Science
Do & don't of supporting Open ScienceSarah Jones
 
Scott Edmunds at OASP Asia: Open (and Big) Data – the next challenge
Scott Edmunds at OASP Asia: Open (and Big) Data – the next challengeScott Edmunds at OASP Asia: Open (and Big) Data – the next challenge
Scott Edmunds at OASP Asia: Open (and Big) Data – the next challengeGigaScience, BGI Hong Kong
 
DATA MANAGEMENT – WHAT DOES IT MEAN FOR RESEARCHERS?
DATA MANAGEMENT – WHAT DOES IT MEAN FOR RESEARCHERS?DATA MANAGEMENT – WHAT DOES IT MEAN FOR RESEARCHERS?
DATA MANAGEMENT – WHAT DOES IT MEAN FOR RESEARCHERS?Incremental Project
 
Data Citation Implementation Guidelines By Tim Clark
Data Citation Implementation Guidelines By Tim ClarkData Citation Implementation Guidelines By Tim Clark
Data Citation Implementation Guidelines By Tim Clarkdatascienceiqss
 
Implementing Open Access: Effective Management of Your Research Data
Implementing Open Access: Effective Management of Your Research DataImplementing Open Access: Effective Management of Your Research Data
Implementing Open Access: Effective Management of Your Research DataMartin Hamilton
 
Overcoming obstacles to sharing data about human subjects
Overcoming obstacles to sharing data about human subjectsOvercoming obstacles to sharing data about human subjects
Overcoming obstacles to sharing data about human subjectsRobin Rice
 

What's hot (20)

Data Access & Storage @ UWA - UWA Research Week September 2017
Data Access & Storage @ UWA - UWA Research Week September 2017Data Access & Storage @ UWA - UWA Research Week September 2017
Data Access & Storage @ UWA - UWA Research Week September 2017
 
Research Data Management: Policy Development
Research Data Management: Policy DevelopmentResearch Data Management: Policy Development
Research Data Management: Policy Development
 
Data Management Planning
Data Management PlanningData Management Planning
Data Management Planning
 
Data Management Planning for researchers
Data Management Planning for researchersData Management Planning for researchers
Data Management Planning for researchers
 
20160414 23 Research Data Things
20160414 23 Research Data Things20160414 23 Research Data Things
20160414 23 Research Data Things
 
Research Data Management: An Introductory Webinar from OpenAIRE and EUDAT
Research Data Management: An Introductory Webinar from OpenAIRE and EUDATResearch Data Management: An Introductory Webinar from OpenAIRE and EUDAT
Research Data Management: An Introductory Webinar from OpenAIRE and EUDAT
 
LEARN Conference - How to cost
LEARN Conference - How to costLEARN Conference - How to cost
LEARN Conference - How to cost
 
NIH Data Policy or: How I Learned to Stop Worrying and Love the Data Manageme...
NIH Data Policy or: How I Learned to Stop Worrying and Love the Data Manageme...NIH Data Policy or: How I Learned to Stop Worrying and Love the Data Manageme...
NIH Data Policy or: How I Learned to Stop Worrying and Love the Data Manageme...
 
Case Study Big Data: Socio-Technical Issues of HathiTrust Digital Texts
Case Study Big Data: Socio-Technical Issues of HathiTrust Digital TextsCase Study Big Data: Socio-Technical Issues of HathiTrust Digital Texts
Case Study Big Data: Socio-Technical Issues of HathiTrust Digital Texts
 
Intro to RDM
Intro to RDMIntro to RDM
Intro to RDM
 
Practical Data Management - ACRL DCIG Webinar
Practical Data Management - ACRL DCIG WebinarPractical Data Management - ACRL DCIG Webinar
Practical Data Management - ACRL DCIG Webinar
 
Data management plans
Data management plansData management plans
Data management plans
 
Do & don't of supporting Open Science
Do & don't of supporting Open ScienceDo & don't of supporting Open Science
Do & don't of supporting Open Science
 
Scott Edmunds at OASP Asia: Open (and Big) Data – the next challenge
Scott Edmunds at OASP Asia: Open (and Big) Data – the next challengeScott Edmunds at OASP Asia: Open (and Big) Data – the next challenge
Scott Edmunds at OASP Asia: Open (and Big) Data – the next challenge
 
DATA MANAGEMENT – WHAT DOES IT MEAN FOR RESEARCHERS?
DATA MANAGEMENT – WHAT DOES IT MEAN FOR RESEARCHERS?DATA MANAGEMENT – WHAT DOES IT MEAN FOR RESEARCHERS?
DATA MANAGEMENT – WHAT DOES IT MEAN FOR RESEARCHERS?
 
Data Citation Implementation Guidelines By Tim Clark
Data Citation Implementation Guidelines By Tim ClarkData Citation Implementation Guidelines By Tim Clark
Data Citation Implementation Guidelines By Tim Clark
 
Implementing Open Access: Effective Management of Your Research Data
Implementing Open Access: Effective Management of Your Research DataImplementing Open Access: Effective Management of Your Research Data
Implementing Open Access: Effective Management of Your Research Data
 
Overcoming obstacles to sharing data about human subjects
Overcoming obstacles to sharing data about human subjectsOvercoming obstacles to sharing data about human subjects
Overcoming obstacles to sharing data about human subjects
 
Digital Destiny
Digital DestinyDigital Destiny
Digital Destiny
 
Levine - Data Curation; Ethics and Legal Considerations
Levine - Data Curation; Ethics and Legal ConsiderationsLevine - Data Curation; Ethics and Legal Considerations
Levine - Data Curation; Ethics and Legal Considerations
 

Viewers also liked

DMPonline roadmap 2015
DMPonline roadmap 2015DMPonline roadmap 2015
DMPonline roadmap 2015Sarah Jones
 
Open science and its advocacy
Open science and its advocacyOpen science and its advocacy
Open science and its advocacySarah Jones
 
DMPonline new features
DMPonline new featuresDMPonline new features
DMPonline new featuresSarah Jones
 
DMPonline user group
DMPonline user groupDMPonline user group
DMPonline user groupSarah Jones
 
Research support-challenges
Research support-challengesResearch support-challenges
Research support-challengesSarah Jones
 
H2020 Open Research Data pilot
H2020 Open Research Data pilotH2020 Open Research Data pilot
H2020 Open Research Data pilotSarah Jones
 
Research data policy
Research data policyResearch data policy
Research data policySarah Jones
 
Data Management for Quantitative Biology - Lecture 1, Apr 16, 2015
Data Management for Quantitative Biology - Lecture 1, Apr 16, 2015Data Management for Quantitative Biology - Lecture 1, Apr 16, 2015
Data Management for Quantitative Biology - Lecture 1, Apr 16, 2015QBiC_Tue
 
Introduction to Data Management Planning
Introduction to Data Management PlanningIntroduction to Data Management Planning
Introduction to Data Management PlanningSarah Jones
 
Clinical data management india as a hub
Clinical data management india as a hubClinical data management india as a hub
Clinical data management india as a hubBhaswat Chakraborty
 
Horizon 2020 and the open research data pilot
Horizon 2020 and the open research data pilotHorizon 2020 and the open research data pilot
Horizon 2020 and the open research data pilotSarah Jones
 
The Imperative of Linking Clinical and Financial Data to Improve Outcomes - H...
The Imperative of Linking Clinical and Financial Data to Improve Outcomes - H...The Imperative of Linking Clinical and Financial Data to Improve Outcomes - H...
The Imperative of Linking Clinical and Financial Data to Improve Outcomes - H...Health Catalyst
 
The ABCs of Clinical Trial Management Systems
The ABCs of Clinical Trial Management SystemsThe ABCs of Clinical Trial Management Systems
The ABCs of Clinical Trial Management SystemsPerficient, Inc.
 
H2020 Open Data Pilot
H2020 Open Data PilotH2020 Open Data Pilot
H2020 Open Data PilotSarah Jones
 
Clinical trials ppt. Dr. Zubair Ali
Clinical trials ppt. Dr. Zubair AliClinical trials ppt. Dr. Zubair Ali
Clinical trials ppt. Dr. Zubair AliDr. Zubair Ali
 
Introduction to Oracle Clinical Data Model
Introduction to Oracle Clinical Data ModelIntroduction to Oracle Clinical Data Model
Introduction to Oracle Clinical Data ModelPerficient
 
Clinical data-management-overview
Clinical data-management-overviewClinical data-management-overview
Clinical data-management-overviewAcri India
 
Clinical data management process setup
Clinical data management process  setupClinical data management process  setup
Clinical data management process setupDr.K Pati
 

Viewers also liked (20)

DMPonline roadmap 2015
DMPonline roadmap 2015DMPonline roadmap 2015
DMPonline roadmap 2015
 
Open science and its advocacy
Open science and its advocacyOpen science and its advocacy
Open science and its advocacy
 
DMPonline new features
DMPonline new featuresDMPonline new features
DMPonline new features
 
Open data pilot
Open data pilotOpen data pilot
Open data pilot
 
DMPonline user group
DMPonline user groupDMPonline user group
DMPonline user group
 
Research support-challenges
Research support-challengesResearch support-challenges
Research support-challenges
 
H2020 Open Research Data pilot
H2020 Open Research Data pilotH2020 Open Research Data pilot
H2020 Open Research Data pilot
 
What is a DMP
What is a DMPWhat is a DMP
What is a DMP
 
Research data policy
Research data policyResearch data policy
Research data policy
 
Data Management for Quantitative Biology - Lecture 1, Apr 16, 2015
Data Management for Quantitative Biology - Lecture 1, Apr 16, 2015Data Management for Quantitative Biology - Lecture 1, Apr 16, 2015
Data Management for Quantitative Biology - Lecture 1, Apr 16, 2015
 
Introduction to Data Management Planning
Introduction to Data Management PlanningIntroduction to Data Management Planning
Introduction to Data Management Planning
 
Clinical data management india as a hub
Clinical data management india as a hubClinical data management india as a hub
Clinical data management india as a hub
 
Horizon 2020 and the open research data pilot
Horizon 2020 and the open research data pilotHorizon 2020 and the open research data pilot
Horizon 2020 and the open research data pilot
 
The Imperative of Linking Clinical and Financial Data to Improve Outcomes - H...
The Imperative of Linking Clinical and Financial Data to Improve Outcomes - H...The Imperative of Linking Clinical and Financial Data to Improve Outcomes - H...
The Imperative of Linking Clinical and Financial Data to Improve Outcomes - H...
 
The ABCs of Clinical Trial Management Systems
The ABCs of Clinical Trial Management SystemsThe ABCs of Clinical Trial Management Systems
The ABCs of Clinical Trial Management Systems
 
H2020 Open Data Pilot
H2020 Open Data PilotH2020 Open Data Pilot
H2020 Open Data Pilot
 
Clinical trials ppt. Dr. Zubair Ali
Clinical trials ppt. Dr. Zubair AliClinical trials ppt. Dr. Zubair Ali
Clinical trials ppt. Dr. Zubair Ali
 
Introduction to Oracle Clinical Data Model
Introduction to Oracle Clinical Data ModelIntroduction to Oracle Clinical Data Model
Introduction to Oracle Clinical Data Model
 
Clinical data-management-overview
Clinical data-management-overviewClinical data-management-overview
Clinical data-management-overview
 
Clinical data management process setup
Clinical data management process  setupClinical data management process  setup
Clinical data management process setup
 

Similar to Data Management and Horizon 2020

Research Data Management: An Introductory Webinar from OpenAIRE and EUDAT
Research Data Management: An Introductory Webinar from OpenAIRE and EUDATResearch Data Management: An Introductory Webinar from OpenAIRE and EUDAT
Research Data Management: An Introductory Webinar from OpenAIRE and EUDATOpenAIRE
 
Research Data Management Introduction: EUDAT/Open AIRE Webinar| www.eudat.eu |
Research Data Management Introduction: EUDAT/Open AIRE Webinar| www.eudat.eu | Research Data Management Introduction: EUDAT/Open AIRE Webinar| www.eudat.eu |
Research Data Management Introduction: EUDAT/Open AIRE Webinar| www.eudat.eu | EUDAT
 
EUDAT & OpenAIRE Webinar: How to write a Data Management Plan - July 14, 2016...
EUDAT & OpenAIRE Webinar: How to write a Data Management Plan - July 14, 2016...EUDAT & OpenAIRE Webinar: How to write a Data Management Plan - July 14, 2016...
EUDAT & OpenAIRE Webinar: How to write a Data Management Plan - July 14, 2016...EUDAT
 
Intro to Data Management Plans
Intro to Data Management PlansIntro to Data Management Plans
Intro to Data Management PlansSarah Jones
 
DMP health sciences
DMP health sciencesDMP health sciences
DMP health sciencesSarah Jones
 
EUDAT & OpenAIRE Webinar: How to write a Data Management Plan - July 7, 2016|...
EUDAT & OpenAIRE Webinar: How to write a Data Management Plan - July 7, 2016|...EUDAT & OpenAIRE Webinar: How to write a Data Management Plan - July 7, 2016|...
EUDAT & OpenAIRE Webinar: How to write a Data Management Plan - July 7, 2016|...EUDAT
 
DataONE Education Module 02: Data Sharing
DataONE Education Module 02: Data SharingDataONE Education Module 02: Data Sharing
DataONE Education Module 02: Data SharingDataONE
 
Managing data throughout the research lifecycle
Managing data throughout the research lifecycleManaging data throughout the research lifecycle
Managing data throughout the research lifecycleMarieke Guy
 
Open Science Globally: Some Developments/Dr Simon Hodson
Open Science Globally: Some Developments/Dr Simon HodsonOpen Science Globally: Some Developments/Dr Simon Hodson
Open Science Globally: Some Developments/Dr Simon HodsonAfrican Open Science Platform
 
Creating a Data Management Plan for your Grant Application
Creating a Data Management Plan for your Grant ApplicationCreating a Data Management Plan for your Grant Application
Creating a Data Management Plan for your Grant ApplicationHistoric Environment Scotland
 
Creating a Data Management Plan for your Grant Application
Creating a Data Management Plan for your Grant ApplicationCreating a Data Management Plan for your Grant Application
Creating a Data Management Plan for your Grant ApplicationEDINA, University of Edinburgh
 
Research Lifecycles and RDM
Research Lifecycles and RDMResearch Lifecycles and RDM
Research Lifecycles and RDMMarieke Guy
 
Meeting Federal Research Requirements for Data Management Plans, Public Acces...
Meeting Federal Research Requirements for Data Management Plans, Public Acces...Meeting Federal Research Requirements for Data Management Plans, Public Acces...
Meeting Federal Research Requirements for Data Management Plans, Public Acces...ICPSR
 
University of Hertfordshire researcher development - research data management
University of Hertfordshire researcher development - research data management University of Hertfordshire researcher development - research data management
University of Hertfordshire researcher development - research data management Bill Worthington
 

Similar to Data Management and Horizon 2020 (20)

Research Data Management: An Introductory Webinar from OpenAIRE and EUDAT
Research Data Management: An Introductory Webinar from OpenAIRE and EUDATResearch Data Management: An Introductory Webinar from OpenAIRE and EUDAT
Research Data Management: An Introductory Webinar from OpenAIRE and EUDAT
 
Research Data Management Introduction: EUDAT/Open AIRE Webinar| www.eudat.eu |
Research Data Management Introduction: EUDAT/Open AIRE Webinar| www.eudat.eu | Research Data Management Introduction: EUDAT/Open AIRE Webinar| www.eudat.eu |
Research Data Management Introduction: EUDAT/Open AIRE Webinar| www.eudat.eu |
 
EUDAT & OpenAIRE Webinar: How to write a Data Management Plan - July 14, 2016...
EUDAT & OpenAIRE Webinar: How to write a Data Management Plan - July 14, 2016...EUDAT & OpenAIRE Webinar: How to write a Data Management Plan - July 14, 2016...
EUDAT & OpenAIRE Webinar: How to write a Data Management Plan - July 14, 2016...
 
Research-Data-Management-and-your-PhD
Research-Data-Management-and-your-PhDResearch-Data-Management-and-your-PhD
Research-Data-Management-and-your-PhD
 
Intro to Data Management Plans
Intro to Data Management PlansIntro to Data Management Plans
Intro to Data Management Plans
 
DMP health sciences
DMP health sciencesDMP health sciences
DMP health sciences
 
Research Data Management and your PhD
Research Data Management and your PhDResearch Data Management and your PhD
Research Data Management and your PhD
 
EUDAT & OpenAIRE Webinar: How to write a Data Management Plan - July 7, 2016|...
EUDAT & OpenAIRE Webinar: How to write a Data Management Plan - July 7, 2016|...EUDAT & OpenAIRE Webinar: How to write a Data Management Plan - July 7, 2016|...
EUDAT & OpenAIRE Webinar: How to write a Data Management Plan - July 7, 2016|...
 
DataONE Education Module 02: Data Sharing
DataONE Education Module 02: Data SharingDataONE Education Module 02: Data Sharing
DataONE Education Module 02: Data Sharing
 
Managing data throughout the research lifecycle
Managing data throughout the research lifecycleManaging data throughout the research lifecycle
Managing data throughout the research lifecycle
 
Open Science Globally: Some Developments/Dr Simon Hodson
Open Science Globally: Some Developments/Dr Simon HodsonOpen Science Globally: Some Developments/Dr Simon Hodson
Open Science Globally: Some Developments/Dr Simon Hodson
 
Creating a Data Management Plan for your Grant Application
Creating a Data Management Plan for your Grant ApplicationCreating a Data Management Plan for your Grant Application
Creating a Data Management Plan for your Grant Application
 
Creating a Data Management Plan for your Grant Application
Creating a Data Management Plan for your Grant ApplicationCreating a Data Management Plan for your Grant Application
Creating a Data Management Plan for your Grant Application
 
Research Lifecycles and RDM
Research Lifecycles and RDMResearch Lifecycles and RDM
Research Lifecycles and RDM
 
What is-rdm
What is-rdmWhat is-rdm
What is-rdm
 
DC101 UWE
DC101 UWEDC101 UWE
DC101 UWE
 
Meeting Federal Research Requirements for Data Management Plans, Public Acces...
Meeting Federal Research Requirements for Data Management Plans, Public Acces...Meeting Federal Research Requirements for Data Management Plans, Public Acces...
Meeting Federal Research Requirements for Data Management Plans, Public Acces...
 
University of Hertfordshire researcher development - research data management
University of Hertfordshire researcher development - research data management University of Hertfordshire researcher development - research data management
University of Hertfordshire researcher development - research data management
 
Introduction to RDM for Geoscience PhD Students
Introduction to RDM for Geoscience PhD StudentsIntroduction to RDM for Geoscience PhD Students
Introduction to RDM for Geoscience PhD Students
 
How to elaborate a data management plan
How to elaborate a data management planHow to elaborate a data management plan
How to elaborate a data management plan
 

More from Sarah Jones

Data training tips and tricks
Data training tips and tricksData training tips and tricks
Data training tips and tricksSarah Jones
 
EOSC and libraries
EOSC and librariesEOSC and libraries
EOSC and librariesSarah Jones
 
EOSC Association priorities and activities
EOSC Association priorities and activitiesEOSC Association priorities and activities
EOSC Association priorities and activitiesSarah Jones
 
Managing and sharing data: lessons from the European context
Managing and sharing data: lessons from the European contextManaging and sharing data: lessons from the European context
Managing and sharing data: lessons from the European contextSarah Jones
 
Reflections on Open Science
Reflections on Open ScienceReflections on Open Science
Reflections on Open ScienceSarah Jones
 
MAR comments analysis
MAR comments analysisMAR comments analysis
MAR comments analysisSarah Jones
 
Introduction to Open Science and EOSC
Introduction to Open Science and EOSCIntroduction to Open Science and EOSC
Introduction to Open Science and EOSCSarah Jones
 
EOSC-MAR-update.pptx
EOSC-MAR-update.pptxEOSC-MAR-update.pptx
EOSC-MAR-update.pptxSarah Jones
 
Why is EOSC so hard?
Why is EOSC so hard?Why is EOSC so hard?
Why is EOSC so hard?Sarah Jones
 
The future of FAIR
The future of FAIRThe future of FAIR
The future of FAIRSarah Jones
 
Is Europe ready for Open Science
Is Europe ready for Open ScienceIs Europe ready for Open Science
Is Europe ready for Open ScienceSarah Jones
 
DMPonline: 10 years, 10 lessons
DMPonline: 10 years, 10 lessonsDMPonline: 10 years, 10 lessons
DMPonline: 10 years, 10 lessonsSarah Jones
 
Why institutions need to raise their capabilities to support FAIR
Why institutions need to raise their capabilities to support FAIRWhy institutions need to raise their capabilities to support FAIR
Why institutions need to raise their capabilities to support FAIRSarah Jones
 
It takes more than a village: lessons on building global research commons
It takes more than a village: lessons on building global research commonsIt takes more than a village: lessons on building global research commons
It takes more than a village: lessons on building global research commonsSarah Jones
 
DMPTuuli - what's new?
DMPTuuli - what's new?DMPTuuli - what's new?
DMPTuuli - what's new?Sarah Jones
 
DCC and FAIR initiatives
DCC and FAIR initiativesDCC and FAIR initiatives
DCC and FAIR initiativesSarah Jones
 
Reflections on EOSC through the mirror of ARDC
Reflections on EOSC through the mirror of ARDCReflections on EOSC through the mirror of ARDC
Reflections on EOSC through the mirror of ARDCSarah Jones
 
Future EOSC roadmap
Future EOSC roadmapFuture EOSC roadmap
Future EOSC roadmapSarah Jones
 
Global Open Research Commons IG
Global Open Research Commons IGGlobal Open Research Commons IG
Global Open Research Commons IGSarah Jones
 

More from Sarah Jones (20)

Data training tips and tricks
Data training tips and tricksData training tips and tricks
Data training tips and tricks
 
EOSC and libraries
EOSC and librariesEOSC and libraries
EOSC and libraries
 
EOSC Association priorities and activities
EOSC Association priorities and activitiesEOSC Association priorities and activities
EOSC Association priorities and activities
 
Managing and sharing data: lessons from the European context
Managing and sharing data: lessons from the European contextManaging and sharing data: lessons from the European context
Managing and sharing data: lessons from the European context
 
Reflections on Open Science
Reflections on Open ScienceReflections on Open Science
Reflections on Open Science
 
MAR comments analysis
MAR comments analysisMAR comments analysis
MAR comments analysis
 
Introduction to Open Science and EOSC
Introduction to Open Science and EOSCIntroduction to Open Science and EOSC
Introduction to Open Science and EOSC
 
EOSC-MAR-update.pptx
EOSC-MAR-update.pptxEOSC-MAR-update.pptx
EOSC-MAR-update.pptx
 
Intro-EOSC.pptx
Intro-EOSC.pptxIntro-EOSC.pptx
Intro-EOSC.pptx
 
Why is EOSC so hard?
Why is EOSC so hard?Why is EOSC so hard?
Why is EOSC so hard?
 
The future of FAIR
The future of FAIRThe future of FAIR
The future of FAIR
 
Is Europe ready for Open Science
Is Europe ready for Open ScienceIs Europe ready for Open Science
Is Europe ready for Open Science
 
DMPonline: 10 years, 10 lessons
DMPonline: 10 years, 10 lessonsDMPonline: 10 years, 10 lessons
DMPonline: 10 years, 10 lessons
 
Why institutions need to raise their capabilities to support FAIR
Why institutions need to raise their capabilities to support FAIRWhy institutions need to raise their capabilities to support FAIR
Why institutions need to raise their capabilities to support FAIR
 
It takes more than a village: lessons on building global research commons
It takes more than a village: lessons on building global research commonsIt takes more than a village: lessons on building global research commons
It takes more than a village: lessons on building global research commons
 
DMPTuuli - what's new?
DMPTuuli - what's new?DMPTuuli - what's new?
DMPTuuli - what's new?
 
DCC and FAIR initiatives
DCC and FAIR initiativesDCC and FAIR initiatives
DCC and FAIR initiatives
 
Reflections on EOSC through the mirror of ARDC
Reflections on EOSC through the mirror of ARDCReflections on EOSC through the mirror of ARDC
Reflections on EOSC through the mirror of ARDC
 
Future EOSC roadmap
Future EOSC roadmapFuture EOSC roadmap
Future EOSC roadmap
 
Global Open Research Commons IG
Global Open Research Commons IGGlobal Open Research Commons IG
Global Open Research Commons IG
 

Recently uploaded

Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?Antenna Manufacturer Coco
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProduct Anonymous
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Igalia
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slidevu2urc
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...Neo4j
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024The Digital Insurer
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEarley Information Science
 
Evaluating the top large language models.pdf
Evaluating the top large language models.pdfEvaluating the top large language models.pdf
Evaluating the top large language models.pdfChristopherTHyatt
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 

Recently uploaded (20)

Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
 
Evaluating the top large language models.pdf
Evaluating the top large language models.pdfEvaluating the top large language models.pdf
Evaluating the top large language models.pdf
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 

Data Management and Horizon 2020

  • 1. Managing research data and Horizon 2020 Sarah Jones Digital Curation Centre, Glasgow sarah.jones@glasgow.ac.uk Twitter: @sjDCC Consorcio Madroño conference on Data Management Plans and Horizon 2020, ETSI Industriales, Madrid, 25th February 2015 Funded by:
  • 2. Benefits and drivers WHY MANAGE RESEARCH DATA?
  • 3. What is Research Data Management? The active management of data throughout the lifecycle Create Document Use Store Share Preserve • Data Management Planning • Creating data • Documenting data • Accessing / using data • Storage and backup • Selecting what to keep • Sharing data • Data licensing and citation • Preserving data • …
  • 4. Why manage research data? Direct benefits for you • To make your research easier! • Stop yourself drowning in irrelevant stuff • Have data organised so you know which versions are most up-to-date • Make sure you can understand and reuse your data again later Research integrity • To avoid accusations of fraud or bad science • Evidence findings and enable validation of research methods • Codes of practice on good research conduct • Many research funders worldwide now require Data Management and Sharing Plans Potential to share • So others can reuse and build on your data • To gain credit – several studies have shown higher citation rates when data are shared • For greater visibility, impact and new research collaborations • Promote innovation and allow research in your field to advance faster
  • 5. It’s part of good research practice
  • 6. Science as an open enterprise https://royalsociety.org/policy/projects/science-public-enterprise/Report “Much of the remarkable growth of scientific understanding in recent centuries is due to open practices; open communication and deliberation sit at the heart of scientific practice.” The Royal Society report calls for ‘intelligent openness’ whereby data are accessible, intelligible, assessable and usable.
  • 7. More scientific breakthroughs www.nytimes.com/2010/08/13/health/research/13alzheimer.html?pagewanted=all&_r=0 “It was unbelievable. Its not science the way most of us have practiced in our careers. But we all realised that we would never get biomarkers unless all of us parked our egos and intellectual property noses outside the door and agreed that all of our data would be public immediately.” Dr John Trojanowski, University of Pennsylvania
  • 8. Increased use and economic benefit UP TO 2008 Sold through the US Geological Survey for US$600 per scene Sales of 19,000 scenes per year Annual revenue of $11.4 million SINCE 2009 Freely available over the internet Google Earth now uses the images Transmission of 2,100,000 scenes per year. Estimated to have created value for the environmental management industry of $935 million, with direct benefit of more than $100 million per year to the US economy Has stimulated the development of applications from a large number of companies worldwide The case of NASA Landsat satellite imagery of the Earth’s surface: http://earthobservatory.nasa.gov/IOTD/view.php?id=83394&src=ve
  • 9. But there are also opportunity costs By Emilio Bruna http://brunalab.org/blog/2014/09/04/the-opportunity-cost- of-my-openscience-was-35-hours-690 For his most recent paper: 1. Double checking the main dataset and reformatting to submit to Dryad: 5 hours 2. Creating complementary file and preparing metadata: 3 hours 3. Submission of these two files and the metadata to Dryad: 45 minutes 4. Preparing a map of the locations: 1 hour 5. Submission of map to Figshare: 15 minutes 6. Cleaning up and documenting the code, uploading it to GitHub: 25 hours 7. Cost of archiving in Dryad: US$90 8. Page Charges: $600
  • 10. So what needs to change? Conclusions from Emilio Bruna: • Develop a better system of incentives from the community for archiving data and code • Teach our students how to do this NOW - it’s much easier if you develop good habits early • Minimise the actual and opportunity costs We need to stop telling people “You should” and get better at telling people “Here’s how”
  • 11. 10 things to think about HOW TO MANAGE DATA? Image CC-BY-NC-SA by Leo Reynolds www.flickr.com/photos/lwr/13442910354
  • 12. 1. What file formats are most appropriate? • Do you have a choice or do the instruments you use only export in certain formats? • What is common in your field? Try to use something that is accepted and widespread. • Does your data centre recommend formats? If so it’s best to use these. If you want your data to be re-used and sustainable in the long-term, you typically want to opt for open, non-proprietary formats. Type Recommended Avoid for data sharing Tabular data CSV, TSV, SPSS portable Excel Text Plain text, HTML, RTF PDF/A only if layout matters Word Media Container: MP4, Ogg Codec: Theora, Dirac, FLAC Quicktime H264 Images TIFF, JPEG2000, PNG GIF, JPG Structured data XML, RDF RDBMS Further examples: http://www.data-archive.ac.uk/create-manage/format/formats-table
  • 13. 2. How will you name your files? • Keep file and folder names short, but meaningful • Agree a method for versioning • Include dates in a set format e.g. YYYYMMDD • Avoid using non-alphanumeric characters in file names • Use hyphens or underscores not spaces e.g. day-sheet, day_sheet • Order the elements in the most appropriate way to retrieve the record www.jiscdigitalmedia.ac.uk/guide/choosing-a-file-name Example from ARM Climate Research Facility www.arm.gov/data/docs/plan
  • 14. 3. Can others understand the data? Think about what is needed in order to find, evaluate, understand, and reuse the data. • Have you documented what you did and how? • Did you develop code to run analyses? If so, this should be kept and shared too. • Is it clear what each bit of your dataset means? Make sure the units are labelled and abbreviations explained. • Record metadata so others can find your work e.g. title, date, creator(s), subject, format, rights…,
  • 15. 4. What metadata standards will you use? Use relevant standards for interoperability www.dcc.ac.uk/resources/metadata-standards
  • 16. 5. Where will you store the data? • Your own device (laptop, flash drive, server etc.) – And if you lose it? Or it breaks? • Departmental drives or university servers • “Cloud” storage – Do they care as much about your data as you do? The decision will be based on how sensitive your data are, how robust you need the storage to be, and who needs access to the data and when
  • 17. 6. Who will do the backup? Use managed services where possible (e.g. University filestores rather than local or external hard drives), so backup is done automatically 3… 2… 1… backup! at least 3 copies of a file on at least 2 different media with at least 1 offsite Ask central IT team for advice
  • 18. 7. Can you publish / share your data? • Who owns the data? • Have you got consent for sharing? • Do any licences you’ve signed permit sharing? • Is the data in suitable formats? • Is there enough documentation?
  • 19. www.dcc.ac.uk/resources/how-guides/license-research-data 8. How will you license your data? • Can your data be made available openly? e.g. CC0 or CC-BY • Do you need to place certain restrictions on who can use the data or how? This DCC how-to guide outlines pros and cons of each approach and gives practical advice on how to implement your licence.
  • 20. 9. Which data need to be kept? Five steps to follow ① Could this data be re-used ② Must it be kept as evidence or for legal reasons ③ Should it be kept for its potential value ④ Consider costs – do benefits outweigh cost? ⑤ Evaluate criteria to decide what to keep 5 steps to decide what data to keep www.dcc.ac.uk/resources/how-guides/five-steps-decide-what-data-keep
  • 21. 10. Who will share & preserve the data? http://databib.org http://service.re3data.org/search Zenodo • Joint effort by OpenAIRE-CERN • Multidisciplinary repository • Multiple data types – Publications – Long tail of research data • Citable data (DOI) • Links funding, publications, data & software www.zenodo.org • Does your publisher or funder suggest a repository? • Are there data centres or community databases for your discipline? • Does your university offer support for long-term preservation?
  • 22. Managing and sharing data: a best practice guide http://data-archive.ac.uk/media/2894/managingsharing.pdf
  • 23. Guidance and support to meet the EC requirements HORIZON 2020 OPEN DATA PILOT Image CC-BY-NC-SA by Tom Magllery www.flickr.com/photos/lwr/13442910354
  • 24. Why open access and open data? “The European Commission’s vision is that information already paid for by the public purse should not be paid for again each time it is accessed or used, and that it should benefit European companies and citizens to the full.” http://ec.europa.eu/research/participants/data/ ref/h2020/grants_manual/hi/oa_pilot/h2020-hi-oa- pilot-guide_en.pdf
  • 25. What is open data?  make your stuff available on the Web (whatever format) under an open licence  make it available as structured data (e.g. Excel instead of a scan of a table)  use non-proprietary formats (e.g. CSV instead of Excel)  use URIs to denote things, so that people can point at your stuff  link your data to other data to provide context Tim Berners-Lee’s proposal for five star open data - http://5stardata.info “Open data and content can be freely used, modified and shared by anyone for any purpose” http://opendefinition.org
  • 26. How to make data open? 1. Choose your dataset(s) What can you may open? You may need to revisit this step if you encounter problems later. 2. Apply an open license Determine what IP exists. Apply a suitable licence e.g. CC-BY 3. Make the data available Provide the data in a suitable format. Use repositories. 4. Make it discoverable Post on the web, register in catalogues… https://okfn.org
  • 27. Support on Data Management Plans What to cover in DMPs: 1. Description of data to be collected / created 2. Standards and methodologies for data collection & management 3. Any issues or restrictions due to ethics and Intellectual Property 4. Plans for data sharing and access 5. Strategy for long-term preservation Example DMPs, guidance, tools and support at: www.dcc.ac.uk/resources/data-management-plans
  • 28. OpenAIRE http://vimeo.com/108790101 Open Access Infrastructure for research in Europe • aggregates data on OA publications • mines & enriches it content by linking thing together • provides services & APIs e.g. to generate publication lists www.openaire.eu
  • 29. EUDAT Data Infrastructure project offering various services: • B2DROP: sync & exchange data • B2SHARE: store & share • B2STAGE: get data to computation • B2SAFE: replicate data safely • B2FIND: find data They also offer a licensing wizard: http://ufal.github.io/lindat-license-selector www.eudat.eu
  • 31. FOSTER open science • Open access and open data training across Europe • Portal with access to reusable learning objects • e-learning courses forthcoming • Check out the training programme www.fosteropenscience.eu/events Sharing European Research Outcomes: Raising Awareness on Open Access, Open Research Data and Open Science May 13 (Madrid) – 14 (Valencia) – 28 (Gijón)
  • 32. Gracias por su atención DCC guidance, tools & case studies: www.dcc.ac.uk/resources Follow us on twitter: @digitalcuration and #ukdcc

Editor's Notes

  1. The Royal Society report ‘Science as an Open Enterprise’ makes a similar point. This emphasises that much of the growth of scientific understanding is due to open practices. Being open about your work and encouraging feedback from others is at the heart of scientific practice. The report calls for ‘intelligent openness’ – data shouldn’t just be accessible, they need to be intelligible by others so they can assess and reuse them.
  2. Certain research communities have also seen the benefit of sharing data as it speeds up the process of discovery. This article shows how researchers in the field of Alzheimer’s research have agreed as a community to share data immediately to make scientific breakthroughs.
  3. There’s also an economic benefit, as seen by the case of the NASA landsat satellite images. These were sold until 2008 for $600 a scene. Now they’re freely available and used by Google Earth. Previously they sold 19,000 images a year, whereas now they transmit 2.1 million. The revenue has gone up incredibly too from $11.4 million to an estimated value of $935 million with direct benefit of more than $100 million. The release has also stimulated the development of applications from companies worldwide. This case study comes from the Royal Society Report on Science as an Open Enterprise.
  4. It’s not all positive though – otherwise why isn’t everyone already doing this. There is a certain amount of effort and cost to open science, which this blog post by Emilio Bruna highlights. He calculated the cost of sharing his data for one paper and came to a total of 35 hours and $690. He breaks this down into the cost of preparing the dataset, creating complementary metadata and associated files, cleaning up and documenting the code (which involves a big mental leap), and the charges applied.
  5. He suggests what needs to change and I think this is quite pertinent for today. We need a better system of incentives so you get more tangible returns from openness – better chances on grant applications, more likelihood of promotion… This is likely to evolve during your careers. If we teach people what to do now, it will become part of your processes and be easier in the long run. The learning curve won’t be as steep. We also need to focus not on why you should be open, but how – and that’s what today’s presentations should give you.
  6. Guidance from the DCC can also help researchers to understand data licensing. This guide outlines the pros and cons of each approach e.g. the limitations of some CC options
  7. The background to this is about making the most of the data that has been created through publicly funded research. The guidelines speak of: Improved quality of results Greater efficiency Faster to market = faster growth Improved transparency of the scientific process
  8. Open science is also about making data available. Open data is data that can be used, modified and shared by anyone for any purpose. So for example it can be mined, exploited, reproduced, commercialised... Data under a non-commercial license wouldn’t be open as you’re restricting how it can be used and by whom. There are different levels of openness. Making content available on the web, in any format, under an open license is open data. It’s not necessarily that reusable though. By making it available as structured data, in non-proprietary formats, using URIs and linking to other data for context, makes it much more valuable.
  9. The Open Knowledge Foundation suggests four simple steps to make data open. First off you need to decide what to share. Not all data can be made openly available due to commercial restrictions or sensitivities. Once you’ve decided what to share, determine what IP exists and apply a suitable licence. You should then make the data available in a suitable format so others can bulk download it. Remember that for it to be useful you want to share appropriate metadata and documentation too. Using repositories is useful to make sure your data are properly managed and preserved for the long-term. The final step is to make your data discoverable. Put it online, tell others about it and add details to various registries so it gets found.
  10. OpenAIRE is also worth checking out. This is an EC-funded project to provide infrastructure for open access. They’ve recently released a short video that tells you how they can help. Essentially OpenAIRE aggregates metadata from different repositories to compile a complete list of publications and related outputs. They mine and enrich the content, de-duplicating entries and linking together publications with data, details about the project, authors, funders etc. OpenAIRE also provides a number of useful services & APIs, for example you can embed a publication list for your project in your website that is automatically updated whenever someone adds a new paper to a repository (this is harvested into OpenAIRE and pushed out to your list).