SlideShare a Scribd company logo
1 of 58
Download to read offline
Digital preservation
caring for our data to foster
 knowledge discovery and
       dissemination

     Claudia Bauzer Medeiros
      Institute of Computing
             UNICAMP
Pre-Saervare
 (Before) – (Save)
= save before disappears
Maintain
    Manu-tenere

= being able to get/find it
Dec 2008




Feb 2010
Data deluge
• At end of 2011 – info created and replicated > 1.8 zettabytes

• 90% data created in the last 2 years

• 5 hour flight – 240 Tbytes

• Facebook – 200 million users, >70 languages

• Each person in England is filmed 300 times/day

• Teenagers in the US send average 110 phone text messages a day

=> We need to build arks during the deluge - PRESERVATION
Outline
•   Why preserve?
•   What to preserve?
•   How to preserve?
•   Where to preserve?

And a few associated challenges
Outline
•   Why preserve?
•   What to preserve?
•   How to preserve?
•   Where to preserve?

And a few associated challenges
WHY PRESERVE
• Costly to produce

• Contribute to progress of science

• Intrinsic value
  culture/science/sustainability
WHY PRESERVE
• Costly to produce
   – Infrastructure, power, software, models, visualization,
     people
   – Hardware, Software, Peopleware
• Contribute to progress of science
   – Reproducibility and reusability
   – Publication and sharing
   – Quality
• Intrinsic value culture/science/sustainability
   – Digital humanities
   – Domesday project
   – Fonoteca Neotropical Jacques Vieillard
WHY PRESERVE
• Costly to produce
   – Infrastructure, power, software, models, visualization,
     people
   – Hardware, Software, Peopleware
• Contribute to progress of science
   – Reproducibility and reusability
   – Publication and sharing
   – Quality
• Intrinsic value culture/science/sustainability
   – Digital humanities
   – Domesday project
   – Fonoteca Neotropical Jacques Vieillard
WHY PRESERVE
• Costly to produce
   – Infrastructure, power, software, models, visualization,
     people
   – Hardware, Software, Peopleware
• Contribute to progress of science
   – Reproducibility and reusability
   – Publication and sharing
   – Quality
• Intrinsic value culture/science/sustainability
   – Digital humanities
   – Domesday project
   – Fonoteca Neotropical Jacques Vieillard
The Domesday Project 1086-1986
• Digital decay
• Equipment obsolescence
• Software obsolescence
Domesday reloaded
Fonoteca
Neotropical
Jacques
Vieillard
Outline
• Why preserve?

• What to preserve?
• How to preserve?

And associated challenges
What to preserve?
• Data

• BUT what is “data”?



• Only data?
What to preserve?
• Data
• BUT what is “data”?
  – Files and records
  – Models, documentation, annotations, sketches,
    experiments, recordings
• Only data?
What to preserve?
• Data
• BUT what is “data”?
  – Files and records
  – Models, documentation, annotations, sketches,
    experiments, recordings
• Only data?
  – How produced it – workflows, devices,
    methodologies, materials and methods,
    reasonings, logs --- provenance
What to preserve?
• Data
• Environment in which was produced

• Data needed to preserve occupies more space
  than the data itself
• Preservation means storing more than object
  itself
What about our research data?
               (slide adapted from Jim Gray)
Experiments
 Instruments

  Files                           Questions

  Papers                          Answers

   Simulations
          Models


             DATA



Data-driven science                    “Collaboratory”


                                                         23/10000
Data sources?
    Table of Product Characteristics
   id        Property name Value
 MilkProd     productsrep     MilkA
 MilkProd       quantity      10000
 MilkProd     validity date 10/06/2006
CheeseProd productsr          Minas
CheeseProd    epquantity      2000
CheeseProd validity date 12/02/2006
CheeseProd      shape        Circular




                                                         24/10000
eEnvironmental Science
• Direct and indirect observations




                                     25/10000
Data sources




               26/10000
27/10000
We are
 DATASCOPE
 engineers


Software is the
      device/tool
Outline
• Why preserve?
• What to preserve?

• How to preserve?

And associated challenges
How to preserve?

How to construct the ark during the
             deluge?

Presaervare, Manutenere and Share
How to preserve?
• To ensure retrievability and sharing
  – Index structures
  – Ontologies, metadata, keywords, standards
  – Workflows
• To ensure longevity
  – Media decay, software decay, hardware decay

• To ensure quality
  – Curation procedures
• To afford maintenance costs
  – Cloud? CAP theorem?
How to preserve?
• To ensure retrievability and sharing
  – Index structures
  – Ontologies, metadata, keywords, standards
  – Workflows
• To ensure longevity
  – Media decay, software decay, hardware decay

• To ensure quality
  – Curation procedures
• To afford maintenance costs
  – Cloud? CAP theorem?
How to preserve?
• To ensure retrievability and sharing
  – Index structures
  – Ontologies, metadata, keywords, standards
  – Workflows
• To ensure longevity
  – Media decay, software decay, hardware decay

• To ensure quality
  – Curation procedures
• To afford maintenance costs
  – Cloud? CAP theorem?
How to preserve?
• To ensure retrievability and sharing
  – Index structures
  – Ontologies, metadata, keywords, standards
  – Workflows
• To ensure longevity
  – Media decay, software decay, hardware decay

• To ensure quality
  – Curation procedures, metadata,standards
• To afford maintenance costs
  – Cloud? CAP theorem?
How to preserve?
• To ensure retrievability and sharing
  – Index structures
  – Ontologies, metadata, keywords, standards
  – Workflows
• To ensure longevity
  – Media decay, software decay, hardware decay

• To ensure quality
  – Curation procedures,metadata, standards
• To afford maintenance costs
  – Cloud? CAP theorem? =======     WHERE
How to preserve?
• To ensure retrievability and sharing
  – Index structures
  – Ontologies, metadata, keywords, standards
  – Workflows
• To ensure longevity
  – Media decay, software decay, hardware decay
  – PEOPLE DECAY
• To ensure quality
  – Curation procedures,metadata, standards
• To afford maintenance costs
  – Cloud? CAP theorem? =======     WHERE
Sharing and open access

NSF Data Management Policy

 Paper and data publication
Sharing of Data Leads to Progress on Alzheimer’s
                                        By GINA KOLATA
                                   Published: August 12, 2010
                                      = NEW YORK TIMES

In 2003, a group of scientists and executives from the National Institutes of Health, the Food and
Drug Administration, the drug and medical-imaging industries, universities and nonprofit groups
joined in a project that experts say had no precedent: a collaborative effort to find the biological
          markers that show the progression of Alzheimer’s disease in the human brain.



   share all the data, making every single
  finding public immediately, available to
 anyone with a computer anywhere in the
                    world
        => AVAILABILITY and REUSE
• Data must be properly curated throughout its
  life-cycle and released with the appropriate
  high-quality metadata.
• Medical Research Council UK




                                           40/10000
• Research data should be made available for
  use by other researchers. Researchers must
  retain research data, including electronic data,
  in a durable, indexed and retrievable form.
• Australian Govnmt National Health and
  Medical Research Council



                                              41/10000
Microsoft Academic Search
40M publications
19M authors
75 publishers (Wiley, Springer, ACM, IEEE …)




                                               42/10000
Google Scholar Citations




                      43/10000
• Citing data is as important as citing papers
• For researchers, publishers, data centers
• Over 1M DOI, several major national research
  libraries
  – Germany, France, Korea, Netherlands, Australia,
    USA...
• Present manager – German National Library of
  Science and Technology

                                                 44/10000
Publish on the Cloud
Add metadata
Pre-print sharing




                       45/10000
FNJV
       proj.lis.ic.unicamp.br/fnjv
• Sharing by publishing on the Web
• Retrievability by extending metadata




                                         46/10000
CURATION AND USE OF STANDARDS
Workflows and model preservation
Workflows and model preservation
         Comb-e-Chem
                   Video
                                                    Simulation

                                                                 Properties

                           Analysis
  Diffractometer




                                           Structures
                                           Database




X-Ray                                                                   Properties
e-Lab                                                                   e-Lab

                                      Grid Middleware

                                                                          52/10000
The cloud and CAP
Outline
•   Why preserve?
•   What to preserve?
•   How to preserve?
•   Where to preserve?

And a few associated challenges
PRE-SAVE and MANU-TENERE
Outline
• Why preserve?
  – Costly to produce (hardware, software, peopleware)
  – Contribute to progress of science
  – Value – culture, science, sustainability
• What to preserve?
  – Data [WHAT IS DATA?]
  – Context of production and use
• How to preserve?
  – Accessibility and sharing – standards, metadata,
    ontologies
  – Integrity and quality – context to use (hw, sw),
    standards
References
•




             56/10000
References
NSF – CISE Data management policy
The Domesday Project
http://www.atsf.co.uk/dottext/domesday.html
The CLARIN Project (languages)
Eigenfactor.org
Altmetrics movement
Thank you!!!!

More Related Content

What's hot

Wf4Ever: Advanced Workflow Preservation Technologies for Enhanced Science i
Wf4Ever: Advanced Workflow Preservation Technologies for Enhanced Science iWf4Ever: Advanced Workflow Preservation Technologies for Enhanced Science i
Wf4Ever: Advanced Workflow Preservation Technologies for Enhanced Science iJose Enrique Ruiz
 
If we build it will they come? BOSC2012 Keynote Goble
If we build it will they come? BOSC2012 Keynote GobleIf we build it will they come? BOSC2012 Keynote Goble
If we build it will they come? BOSC2012 Keynote GobleCarole Goble
 
If we build it will they come?
If we build it will they come?If we build it will they come?
If we build it will they come?myGrid team
 
Status of Alien Invasive Species Information in Canada
Status of Alien Invasive Species Information in CanadaStatus of Alien Invasive Species Information in Canada
Status of Alien Invasive Species Information in CanadaHans Herrmann
 
Saving private data, sharing Open Data? Role of libraries and institutional r...
Saving private data, sharing Open Data? Role of libraries and institutional r...Saving private data, sharing Open Data? Role of libraries and institutional r...
Saving private data, sharing Open Data? Role of libraries and institutional r...Chris Rusbridge
 
Challenges in setting up an RDM Support Service
Challenges in setting up an RDM Support ServiceChallenges in setting up an RDM Support Service
Challenges in setting up an RDM Support ServiceGarethKnight
 
Federation and Interoperability in the Nectar Research Cloud
Federation and Interoperability in the Nectar Research CloudFederation and Interoperability in the Nectar Research Cloud
Federation and Interoperability in the Nectar Research CloudOpenStack
 

What's hot (8)

Just Digitise It - Daniel Wilksch - 2015
Just Digitise It - Daniel Wilksch - 2015Just Digitise It - Daniel Wilksch - 2015
Just Digitise It - Daniel Wilksch - 2015
 
Wf4Ever: Advanced Workflow Preservation Technologies for Enhanced Science i
Wf4Ever: Advanced Workflow Preservation Technologies for Enhanced Science iWf4Ever: Advanced Workflow Preservation Technologies for Enhanced Science i
Wf4Ever: Advanced Workflow Preservation Technologies for Enhanced Science i
 
If we build it will they come? BOSC2012 Keynote Goble
If we build it will they come? BOSC2012 Keynote GobleIf we build it will they come? BOSC2012 Keynote Goble
If we build it will they come? BOSC2012 Keynote Goble
 
If we build it will they come?
If we build it will they come?If we build it will they come?
If we build it will they come?
 
Status of Alien Invasive Species Information in Canada
Status of Alien Invasive Species Information in CanadaStatus of Alien Invasive Species Information in Canada
Status of Alien Invasive Species Information in Canada
 
Saving private data, sharing Open Data? Role of libraries and institutional r...
Saving private data, sharing Open Data? Role of libraries and institutional r...Saving private data, sharing Open Data? Role of libraries and institutional r...
Saving private data, sharing Open Data? Role of libraries and institutional r...
 
Challenges in setting up an RDM Support Service
Challenges in setting up an RDM Support ServiceChallenges in setting up an RDM Support Service
Challenges in setting up an RDM Support Service
 
Federation and Interoperability in the Nectar Research Cloud
Federation and Interoperability in the Nectar Research CloudFederation and Interoperability in the Nectar Research Cloud
Federation and Interoperability in the Nectar Research Cloud
 

Viewers also liked

Small Solutions for Small Institutions – Steps Towards Archiving and Preserva...
Small Solutions for Small Institutions – Steps Towards Archiving and Preserva...Small Solutions for Small Institutions – Steps Towards Archiving and Preserva...
Small Solutions for Small Institutions – Steps Towards Archiving and Preserva...ariadnenetwork
 
Legal Hold and Data Preservation Best Practices
Legal Hold and Data Preservation Best PracticesLegal Hold and Data Preservation Best Practices
Legal Hold and Data Preservation Best PracticesZapproved
 
Research bites: Digital Preservation for Research Data
Research bites: Digital Preservation for Research DataResearch bites: Digital Preservation for Research Data
Research bites: Digital Preservation for Research DataLancaster University Library
 
D.3.1: State of the Art - Linked Data and Digital Preservation
D.3.1: State of the Art - Linked Data and Digital PreservationD.3.1: State of the Art - Linked Data and Digital Preservation
D.3.1: State of the Art - Linked Data and Digital PreservationPRELIDA Project
 
Digital Preservation
Digital PreservationDigital Preservation
Digital Preservationsmtcd
 
Digital preservation
Digital preservationDigital preservation
Digital preservationSarika Sawant
 
Basic of Human Resource Management
Basic of Human Resource ManagementBasic of Human Resource Management
Basic of Human Resource ManagementAshit Jain
 
Introduction to human resource management
Introduction to human resource managementIntroduction to human resource management
Introduction to human resource managementTanuj Poddar
 
Human resource management ppt
Human resource management ppt Human resource management ppt
Human resource management ppt Babasab Patil
 
Human Resource Management
Human Resource ManagementHuman Resource Management
Human Resource Managementgumbhir singh
 

Viewers also liked (13)

Data preservation
Data preservationData preservation
Data preservation
 
Small Solutions for Small Institutions – Steps Towards Archiving and Preserva...
Small Solutions for Small Institutions – Steps Towards Archiving and Preserva...Small Solutions for Small Institutions – Steps Towards Archiving and Preserva...
Small Solutions for Small Institutions – Steps Towards Archiving and Preserva...
 
Legal Hold and Data Preservation Best Practices
Legal Hold and Data Preservation Best PracticesLegal Hold and Data Preservation Best Practices
Legal Hold and Data Preservation Best Practices
 
Research bites: Digital Preservation for Research Data
Research bites: Digital Preservation for Research DataResearch bites: Digital Preservation for Research Data
Research bites: Digital Preservation for Research Data
 
D.3.1: State of the Art - Linked Data and Digital Preservation
D.3.1: State of the Art - Linked Data and Digital PreservationD.3.1: State of the Art - Linked Data and Digital Preservation
D.3.1: State of the Art - Linked Data and Digital Preservation
 
Data preservation 101
Data preservation 101Data preservation 101
Data preservation 101
 
Is Violent Crime Increasing or Decreasing?
Is Violent Crime Increasing or Decreasing?Is Violent Crime Increasing or Decreasing?
Is Violent Crime Increasing or Decreasing?
 
Digital Preservation
Digital PreservationDigital Preservation
Digital Preservation
 
Digital preservation
Digital preservationDigital preservation
Digital preservation
 
Basic of Human Resource Management
Basic of Human Resource ManagementBasic of Human Resource Management
Basic of Human Resource Management
 
Introduction to human resource management
Introduction to human resource managementIntroduction to human resource management
Introduction to human resource management
 
Human resource management ppt
Human resource management ppt Human resource management ppt
Human resource management ppt
 
Human Resource Management
Human Resource ManagementHuman Resource Management
Human Resource Management
 

Similar to Ensuring access to valuable digital data

ExLibris National Library Meeting @ IFLA-Helsinki - Aug 15th 2012
ExLibris National Library Meeting @ IFLA-Helsinki - Aug 15th 2012ExLibris National Library Meeting @ IFLA-Helsinki - Aug 15th 2012
ExLibris National Library Meeting @ IFLA-Helsinki - Aug 15th 2012Lee Dirks
 
iMicrobe and iVirus: Extending the iPlant cyberinfrastructure from plants to ...
iMicrobe and iVirus: Extending the iPlant cyberinfrastructure from plants to ...iMicrobe and iVirus: Extending the iPlant cyberinfrastructure from plants to ...
iMicrobe and iVirus: Extending the iPlant cyberinfrastructure from plants to ...Bonnie Hurwitz
 
The Data Management Ecosystem
The Data Management EcosystemThe Data Management Ecosystem
The Data Management EcosystemJohn Kunze
 
RDAP13 John Kunze: The Data Management Ecosystem
RDAP13 John Kunze: The Data Management EcosystemRDAP13 John Kunze: The Data Management Ecosystem
RDAP13 John Kunze: The Data Management EcosystemASIS&T
 
Research Cyberinfrastructure at UCSD - David Minor - RDAP12
Research Cyberinfrastructure at UCSD - David Minor - RDAP12Research Cyberinfrastructure at UCSD - David Minor - RDAP12
Research Cyberinfrastructure at UCSD - David Minor - RDAP12ASIS&T
 
2nd Microscopy Congress: Public archiving of bio-imaging data - perspectives,...
2nd Microscopy Congress: Public archiving of bio-imaging data - perspectives,...2nd Microscopy Congress: Public archiving of bio-imaging data - perspectives,...
2nd Microscopy Congress: Public archiving of bio-imaging data - perspectives,...Ardan Patwardhan
 
Preserving the Inputs and Outputs of Scholarship
Preserving the Inputs and Outputs of ScholarshipPreserving the Inputs and Outputs of Scholarship
Preserving the Inputs and Outputs of Scholarshiptsbbbu
 
Graham Pryor
Graham PryorGraham Pryor
Graham PryorEduserv
 
Big Data Europe SC6 WS 3: Ron Dekker, Director CESSDA European Open Science A...
Big Data Europe SC6 WS 3: Ron Dekker, Director CESSDA European Open Science A...Big Data Europe SC6 WS 3: Ron Dekker, Director CESSDA European Open Science A...
Big Data Europe SC6 WS 3: Ron Dekker, Director CESSDA European Open Science A...BigData_Europe
 
Collaboration and Sharing
Collaboration and SharingCollaboration and Sharing
Collaboration and SharingJisc
 
HKU Data Curation MLIM7350 Class 9
HKU Data Curation MLIM7350 Class 9 HKU Data Curation MLIM7350 Class 9
HKU Data Curation MLIM7350 Class 9 Scott Edmunds
 
Analyzing Big Data in Medicine with Virtual Research Environments and Microse...
Analyzing Big Data in Medicine with Virtual Research Environments and Microse...Analyzing Big Data in Medicine with Virtual Research Environments and Microse...
Analyzing Big Data in Medicine with Virtual Research Environments and Microse...Ola Spjuth
 
An Oz Mammals Bioinformatics and Data Resource
An Oz Mammals Bioinformatics and Data ResourceAn Oz Mammals Bioinformatics and Data Resource
An Oz Mammals Bioinformatics and Data ResourcePhilippa Griffin
 
Making your data work for you: Scratchpads, publishing & the biodiversity dat...
Making your data work for you: Scratchpads, publishing & the biodiversity dat...Making your data work for you: Scratchpads, publishing & the biodiversity dat...
Making your data work for you: Scratchpads, publishing & the biodiversity dat...Vince Smith
 
10th e concertation-brussels-06march2013-v2
10th e concertation-brussels-06march2013-v210th e concertation-brussels-06march2013-v2
10th e concertation-brussels-06march2013-v2Alex Hardisty
 
Elag workshop sessie 1 en 2 v10
Elag workshop sessie 1 en 2 v10Elag workshop sessie 1 en 2 v10
Elag workshop sessie 1 en 2 v10Jeroen Rombouts
 
Deroure Repo3
Deroure Repo3Deroure Repo3
Deroure Repo3guru122
 

Similar to Ensuring access to valuable digital data (20)

ExLibris National Library Meeting @ IFLA-Helsinki - Aug 15th 2012
ExLibris National Library Meeting @ IFLA-Helsinki - Aug 15th 2012ExLibris National Library Meeting @ IFLA-Helsinki - Aug 15th 2012
ExLibris National Library Meeting @ IFLA-Helsinki - Aug 15th 2012
 
iMicrobe and iVirus: Extending the iPlant cyberinfrastructure from plants to ...
iMicrobe and iVirus: Extending the iPlant cyberinfrastructure from plants to ...iMicrobe and iVirus: Extending the iPlant cyberinfrastructure from plants to ...
iMicrobe and iVirus: Extending the iPlant cyberinfrastructure from plants to ...
 
The Data Management Ecosystem
The Data Management EcosystemThe Data Management Ecosystem
The Data Management Ecosystem
 
RDAP13 John Kunze: The Data Management Ecosystem
RDAP13 John Kunze: The Data Management EcosystemRDAP13 John Kunze: The Data Management Ecosystem
RDAP13 John Kunze: The Data Management Ecosystem
 
Research Cyberinfrastructure at UCSD - David Minor - RDAP12
Research Cyberinfrastructure at UCSD - David Minor - RDAP12Research Cyberinfrastructure at UCSD - David Minor - RDAP12
Research Cyberinfrastructure at UCSD - David Minor - RDAP12
 
2nd Microscopy Congress: Public archiving of bio-imaging data - perspectives,...
2nd Microscopy Congress: Public archiving of bio-imaging data - perspectives,...2nd Microscopy Congress: Public archiving of bio-imaging data - perspectives,...
2nd Microscopy Congress: Public archiving of bio-imaging data - perspectives,...
 
Preserving the Inputs and Outputs of Scholarship
Preserving the Inputs and Outputs of ScholarshipPreserving the Inputs and Outputs of Scholarship
Preserving the Inputs and Outputs of Scholarship
 
Graham Pryor
Graham PryorGraham Pryor
Graham Pryor
 
Big Data Europe SC6 WS 3: Ron Dekker, Director CESSDA European Open Science A...
Big Data Europe SC6 WS 3: Ron Dekker, Director CESSDA European Open Science A...Big Data Europe SC6 WS 3: Ron Dekker, Director CESSDA European Open Science A...
Big Data Europe SC6 WS 3: Ron Dekker, Director CESSDA European Open Science A...
 
Researh data management
Researh data managementResearh data management
Researh data management
 
Collaboration and Sharing
Collaboration and SharingCollaboration and Sharing
Collaboration and Sharing
 
HKU Data Curation MLIM7350 Class 9
HKU Data Curation MLIM7350 Class 9 HKU Data Curation MLIM7350 Class 9
HKU Data Curation MLIM7350 Class 9
 
Analyzing Big Data in Medicine with Virtual Research Environments and Microse...
Analyzing Big Data in Medicine with Virtual Research Environments and Microse...Analyzing Big Data in Medicine with Virtual Research Environments and Microse...
Analyzing Big Data in Medicine with Virtual Research Environments and Microse...
 
An Oz Mammals Bioinformatics and Data Resource
An Oz Mammals Bioinformatics and Data ResourceAn Oz Mammals Bioinformatics and Data Resource
An Oz Mammals Bioinformatics and Data Resource
 
Making your data work for you: Scratchpads, publishing & the biodiversity dat...
Making your data work for you: Scratchpads, publishing & the biodiversity dat...Making your data work for you: Scratchpads, publishing & the biodiversity dat...
Making your data work for you: Scratchpads, publishing & the biodiversity dat...
 
10th e concertation-brussels-06march2013-v2
10th e concertation-brussels-06march2013-v210th e concertation-brussels-06march2013-v2
10th e concertation-brussels-06march2013-v2
 
Elag workshop sessie 1 en 2 v10
Elag workshop sessie 1 en 2 v10Elag workshop sessie 1 en 2 v10
Elag workshop sessie 1 en 2 v10
 
Big Data
Big Data Big Data
Big Data
 
Deroure Repo3
Deroure Repo3Deroure Repo3
Deroure Repo3
 
Deroure Repo3
Deroure Repo3Deroure Repo3
Deroure Repo3
 

More from Beniamino Murgante

Analyzing and assessing ecological transition in building sustainable cities
Analyzing and assessing ecological transition in building sustainable citiesAnalyzing and assessing ecological transition in building sustainable cities
Analyzing and assessing ecological transition in building sustainable citiesBeniamino Murgante
 
Smart Cities: New Science for the Cities
Smart Cities: New Science for the CitiesSmart Cities: New Science for the Cities
Smart Cities: New Science for the CitiesBeniamino Murgante
 
The evolution of spatial analysis and modeling in decision processes
The evolution of spatial analysis and modeling in decision processesThe evolution of spatial analysis and modeling in decision processes
The evolution of spatial analysis and modeling in decision processesBeniamino Murgante
 
Involving citizens in smart energy approaches: the experience of an energy pa...
Involving citizens in smart energy approaches: the experience of an energy pa...Involving citizens in smart energy approaches: the experience of an energy pa...
Involving citizens in smart energy approaches: the experience of an energy pa...Beniamino Murgante
 
Programmazione per la governance territoriale in tema di tutela della biodive...
Programmazione per la governance territoriale in tema di tutela della biodive...Programmazione per la governance territoriale in tema di tutela della biodive...
Programmazione per la governance territoriale in tema di tutela della biodive...Beniamino Murgante
 
Involving Citizens in a Participation Process for Increasing Walkability
Involving Citizens in a Participation Process for Increasing WalkabilityInvolving Citizens in a Participation Process for Increasing Walkability
Involving Citizens in a Participation Process for Increasing WalkabilityBeniamino Murgante
 
Presentation of ICCSA 2019 at the University of Saint petersburg
Presentation of ICCSA 2019 at the University of Saint petersburg Presentation of ICCSA 2019 at the University of Saint petersburg
Presentation of ICCSA 2019 at the University of Saint petersburg Beniamino Murgante
 
RISCHIO TERRITORIALE NEL GOVERNO DEL TERRITORIO: Ricerca e formazione nelle s...
RISCHIO TERRITORIALE NEL GOVERNO DEL TERRITORIO: Ricerca e formazione nelle s...RISCHIO TERRITORIALE NEL GOVERNO DEL TERRITORIO: Ricerca e formazione nelle s...
RISCHIO TERRITORIALE NEL GOVERNO DEL TERRITORIO: Ricerca e formazione nelle s...Beniamino Murgante
 
Presentation of ICCSA 2017 at the University of trieste
Presentation of ICCSA 2017 at the University of triestePresentation of ICCSA 2017 at the University of trieste
Presentation of ICCSA 2017 at the University of triesteBeniamino Murgante
 
GEOGRAPHIC INFORMATION – NEED TO KNOW (GI-N2K) Towards a more demand-driven g...
GEOGRAPHIC INFORMATION – NEED TO KNOW (GI-N2K) Towards a more demand-driven g...GEOGRAPHIC INFORMATION – NEED TO KNOW (GI-N2K) Towards a more demand-driven g...
GEOGRAPHIC INFORMATION – NEED TO KNOW (GI-N2K) Towards a more demand-driven g...Beniamino Murgante
 
Focussing Energy Consumers’ Behaviour Change towards Energy Efficiency and Lo...
Focussing Energy Consumers’ Behaviour Change towards Energy Efficiency and Lo...Focussing Energy Consumers’ Behaviour Change towards Energy Efficiency and Lo...
Focussing Energy Consumers’ Behaviour Change towards Energy Efficiency and Lo...Beniamino Murgante
 
Socio-Economic Planning profiles: Sciences VS Daily activities in public sector 
Socio-Economic Planning profiles: Sciences VS Daily activities in public sector Socio-Economic Planning profiles: Sciences VS Daily activities in public sector 
Socio-Economic Planning profiles: Sciences VS Daily activities in public sector Beniamino Murgante
 
GEOGRAPHIC INFORMATION – NEED TO KNOW (GI-N2K) Towards a more demand-driven g...
GEOGRAPHIC INFORMATION – NEED TO KNOW (GI-N2K) Towards a more demand-driven g...GEOGRAPHIC INFORMATION – NEED TO KNOW (GI-N2K) Towards a more demand-driven g...
GEOGRAPHIC INFORMATION – NEED TO KNOW (GI-N2K) Towards a more demand-driven g...Beniamino Murgante
 
Garden in motion. An experience of citizens involvement in public space regen...
Garden in motion. An experience of citizens involvement in public space regen...Garden in motion. An experience of citizens involvement in public space regen...
Garden in motion. An experience of citizens involvement in public space regen...Beniamino Murgante
 
Planning and Smartness: the true challenge
Planning and Smartness: the true challengePlanning and Smartness: the true challenge
Planning and Smartness: the true challengeBeniamino Murgante
 
GeoSDI: una piattaforma social di dati geografici basata sui principi di INSP...
GeoSDI: una piattaforma social di dati geografici basata sui principi di INSP...GeoSDI: una piattaforma social di dati geografici basata sui principi di INSP...
GeoSDI: una piattaforma social di dati geografici basata sui principi di INSP...Beniamino Murgante
 
Informazione Geografica, Città, Smartness
Informazione Geografica, Città, Smartness Informazione Geografica, Città, Smartness
Informazione Geografica, Città, Smartness Beniamino Murgante
 
Tecnologie, Territorio, Smartness
Tecnologie, Territorio, SmartnessTecnologie, Territorio, Smartness
Tecnologie, Territorio, SmartnessBeniamino Murgante
 

More from Beniamino Murgante (20)

Analyzing and assessing ecological transition in building sustainable cities
Analyzing and assessing ecological transition in building sustainable citiesAnalyzing and assessing ecological transition in building sustainable cities
Analyzing and assessing ecological transition in building sustainable cities
 
Smart Cities: New Science for the Cities
Smart Cities: New Science for the CitiesSmart Cities: New Science for the Cities
Smart Cities: New Science for the Cities
 
The evolution of spatial analysis and modeling in decision processes
The evolution of spatial analysis and modeling in decision processesThe evolution of spatial analysis and modeling in decision processes
The evolution of spatial analysis and modeling in decision processes
 
Smart City or Urban Science?
Smart City or Urban Science?Smart City or Urban Science?
Smart City or Urban Science?
 
Involving citizens in smart energy approaches: the experience of an energy pa...
Involving citizens in smart energy approaches: the experience of an energy pa...Involving citizens in smart energy approaches: the experience of an energy pa...
Involving citizens in smart energy approaches: the experience of an energy pa...
 
Programmazione per la governance territoriale in tema di tutela della biodive...
Programmazione per la governance territoriale in tema di tutela della biodive...Programmazione per la governance territoriale in tema di tutela della biodive...
Programmazione per la governance territoriale in tema di tutela della biodive...
 
Involving Citizens in a Participation Process for Increasing Walkability
Involving Citizens in a Participation Process for Increasing WalkabilityInvolving Citizens in a Participation Process for Increasing Walkability
Involving Citizens in a Participation Process for Increasing Walkability
 
Presentation of ICCSA 2019 at the University of Saint petersburg
Presentation of ICCSA 2019 at the University of Saint petersburg Presentation of ICCSA 2019 at the University of Saint petersburg
Presentation of ICCSA 2019 at the University of Saint petersburg
 
RISCHIO TERRITORIALE NEL GOVERNO DEL TERRITORIO: Ricerca e formazione nelle s...
RISCHIO TERRITORIALE NEL GOVERNO DEL TERRITORIO: Ricerca e formazione nelle s...RISCHIO TERRITORIALE NEL GOVERNO DEL TERRITORIO: Ricerca e formazione nelle s...
RISCHIO TERRITORIALE NEL GOVERNO DEL TERRITORIO: Ricerca e formazione nelle s...
 
Presentation of ICCSA 2017 at the University of trieste
Presentation of ICCSA 2017 at the University of triestePresentation of ICCSA 2017 at the University of trieste
Presentation of ICCSA 2017 at the University of trieste
 
GEOGRAPHIC INFORMATION – NEED TO KNOW (GI-N2K) Towards a more demand-driven g...
GEOGRAPHIC INFORMATION – NEED TO KNOW (GI-N2K) Towards a more demand-driven g...GEOGRAPHIC INFORMATION – NEED TO KNOW (GI-N2K) Towards a more demand-driven g...
GEOGRAPHIC INFORMATION – NEED TO KNOW (GI-N2K) Towards a more demand-driven g...
 
Focussing Energy Consumers’ Behaviour Change towards Energy Efficiency and Lo...
Focussing Energy Consumers’ Behaviour Change towards Energy Efficiency and Lo...Focussing Energy Consumers’ Behaviour Change towards Energy Efficiency and Lo...
Focussing Energy Consumers’ Behaviour Change towards Energy Efficiency and Lo...
 
Socio-Economic Planning profiles: Sciences VS Daily activities in public sector 
Socio-Economic Planning profiles: Sciences VS Daily activities in public sector Socio-Economic Planning profiles: Sciences VS Daily activities in public sector 
Socio-Economic Planning profiles: Sciences VS Daily activities in public sector 
 
GEOGRAPHIC INFORMATION – NEED TO KNOW (GI-N2K) Towards a more demand-driven g...
GEOGRAPHIC INFORMATION – NEED TO KNOW (GI-N2K) Towards a more demand-driven g...GEOGRAPHIC INFORMATION – NEED TO KNOW (GI-N2K) Towards a more demand-driven g...
GEOGRAPHIC INFORMATION – NEED TO KNOW (GI-N2K) Towards a more demand-driven g...
 
Garden in motion. An experience of citizens involvement in public space regen...
Garden in motion. An experience of citizens involvement in public space regen...Garden in motion. An experience of citizens involvement in public space regen...
Garden in motion. An experience of citizens involvement in public space regen...
 
Planning and Smartness: the true challenge
Planning and Smartness: the true challengePlanning and Smartness: the true challenge
Planning and Smartness: the true challenge
 
GeoSDI: una piattaforma social di dati geografici basata sui principi di INSP...
GeoSDI: una piattaforma social di dati geografici basata sui principi di INSP...GeoSDI: una piattaforma social di dati geografici basata sui principi di INSP...
GeoSDI: una piattaforma social di dati geografici basata sui principi di INSP...
 
Murgante smart energy
Murgante smart energyMurgante smart energy
Murgante smart energy
 
Informazione Geografica, Città, Smartness
Informazione Geografica, Città, Smartness Informazione Geografica, Città, Smartness
Informazione Geografica, Città, Smartness
 
Tecnologie, Territorio, Smartness
Tecnologie, Territorio, SmartnessTecnologie, Territorio, Smartness
Tecnologie, Territorio, Smartness
 

Recently uploaded

Beyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactBeyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactPECB
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityGeoBlogs
 
9548086042 for call girls in Indira Nagar with room service
9548086042  for call girls in Indira Nagar  with room service9548086042  for call girls in Indira Nagar  with room service
9548086042 for call girls in Indira Nagar with room servicediscovermytutordmt
 
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Sapana Sha
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfciinovamais
 
Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Celine George
 
Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfsanyamsingh5019
 
Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104misteraugie
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingTechSoup
 
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...Sapna Thakur
 
fourth grading exam for kindergarten in writing
fourth grading exam for kindergarten in writingfourth grading exam for kindergarten in writing
fourth grading exam for kindergarten in writingTeacherCyreneCayanan
 
Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsTechSoup
 
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...christianmathematics
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...EduSkills OECD
 
Web & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfWeb & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfJayanti Pande
 
Disha NEET Physics Guide for classes 11 and 12.pdf
Disha NEET Physics Guide for classes 11 and 12.pdfDisha NEET Physics Guide for classes 11 and 12.pdf
Disha NEET Physics Guide for classes 11 and 12.pdfchloefrazer622
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdfQucHHunhnh
 

Recently uploaded (20)

Beyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactBeyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global Impact
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activity
 
9548086042 for call girls in Indira Nagar with room service
9548086042  for call girls in Indira Nagar  with room service9548086042  for call girls in Indira Nagar  with room service
9548086042 for call girls in Indira Nagar with room service
 
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdf
 
Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17
 
Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdf
 
Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104
 
Advance Mobile Application Development class 07
Advance Mobile Application Development class 07Advance Mobile Application Development class 07
Advance Mobile Application Development class 07
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy Consulting
 
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...
 
Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1
 
fourth grading exam for kindergarten in writing
fourth grading exam for kindergarten in writingfourth grading exam for kindergarten in writing
fourth grading exam for kindergarten in writing
 
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
 
Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The Basics
 
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
 
Web & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfWeb & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdf
 
Disha NEET Physics Guide for classes 11 and 12.pdf
Disha NEET Physics Guide for classes 11 and 12.pdfDisha NEET Physics Guide for classes 11 and 12.pdf
Disha NEET Physics Guide for classes 11 and 12.pdf
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
 

Ensuring access to valuable digital data

  • 1. Digital preservation caring for our data to foster knowledge discovery and dissemination Claudia Bauzer Medeiros Institute of Computing UNICAMP
  • 2. Pre-Saervare (Before) – (Save) = save before disappears
  • 3. Maintain Manu-tenere = being able to get/find it
  • 4.
  • 6. Data deluge • At end of 2011 – info created and replicated > 1.8 zettabytes • 90% data created in the last 2 years • 5 hour flight – 240 Tbytes • Facebook – 200 million users, >70 languages • Each person in England is filmed 300 times/day • Teenagers in the US send average 110 phone text messages a day => We need to build arks during the deluge - PRESERVATION
  • 7. Outline • Why preserve? • What to preserve? • How to preserve? • Where to preserve? And a few associated challenges
  • 8. Outline • Why preserve? • What to preserve? • How to preserve? • Where to preserve? And a few associated challenges
  • 9. WHY PRESERVE • Costly to produce • Contribute to progress of science • Intrinsic value culture/science/sustainability
  • 10. WHY PRESERVE • Costly to produce – Infrastructure, power, software, models, visualization, people – Hardware, Software, Peopleware • Contribute to progress of science – Reproducibility and reusability – Publication and sharing – Quality • Intrinsic value culture/science/sustainability – Digital humanities – Domesday project – Fonoteca Neotropical Jacques Vieillard
  • 11. WHY PRESERVE • Costly to produce – Infrastructure, power, software, models, visualization, people – Hardware, Software, Peopleware • Contribute to progress of science – Reproducibility and reusability – Publication and sharing – Quality • Intrinsic value culture/science/sustainability – Digital humanities – Domesday project – Fonoteca Neotropical Jacques Vieillard
  • 12. WHY PRESERVE • Costly to produce – Infrastructure, power, software, models, visualization, people – Hardware, Software, Peopleware • Contribute to progress of science – Reproducibility and reusability – Publication and sharing – Quality • Intrinsic value culture/science/sustainability – Digital humanities – Domesday project – Fonoteca Neotropical Jacques Vieillard
  • 13. The Domesday Project 1086-1986 • Digital decay • Equipment obsolescence • Software obsolescence
  • 16.
  • 17.
  • 18. Outline • Why preserve? • What to preserve? • How to preserve? And associated challenges
  • 19. What to preserve? • Data • BUT what is “data”? • Only data?
  • 20. What to preserve? • Data • BUT what is “data”? – Files and records – Models, documentation, annotations, sketches, experiments, recordings • Only data?
  • 21. What to preserve? • Data • BUT what is “data”? – Files and records – Models, documentation, annotations, sketches, experiments, recordings • Only data? – How produced it – workflows, devices, methodologies, materials and methods, reasonings, logs --- provenance
  • 22. What to preserve? • Data • Environment in which was produced • Data needed to preserve occupies more space than the data itself • Preservation means storing more than object itself
  • 23. What about our research data? (slide adapted from Jim Gray) Experiments Instruments Files Questions Papers Answers Simulations Models DATA Data-driven science “Collaboratory” 23/10000
  • 24. Data sources? Table of Product Characteristics id Property name Value MilkProd productsrep MilkA MilkProd quantity 10000 MilkProd validity date 10/06/2006 CheeseProd productsr Minas CheeseProd epquantity 2000 CheeseProd validity date 12/02/2006 CheeseProd shape Circular 24/10000
  • 25. eEnvironmental Science • Direct and indirect observations 25/10000
  • 26. Data sources 26/10000
  • 28. We are DATASCOPE engineers Software is the device/tool
  • 29. Outline • Why preserve? • What to preserve? • How to preserve? And associated challenges
  • 30. How to preserve? How to construct the ark during the deluge? Presaervare, Manutenere and Share
  • 31. How to preserve? • To ensure retrievability and sharing – Index structures – Ontologies, metadata, keywords, standards – Workflows • To ensure longevity – Media decay, software decay, hardware decay • To ensure quality – Curation procedures • To afford maintenance costs – Cloud? CAP theorem?
  • 32. How to preserve? • To ensure retrievability and sharing – Index structures – Ontologies, metadata, keywords, standards – Workflows • To ensure longevity – Media decay, software decay, hardware decay • To ensure quality – Curation procedures • To afford maintenance costs – Cloud? CAP theorem?
  • 33. How to preserve? • To ensure retrievability and sharing – Index structures – Ontologies, metadata, keywords, standards – Workflows • To ensure longevity – Media decay, software decay, hardware decay • To ensure quality – Curation procedures • To afford maintenance costs – Cloud? CAP theorem?
  • 34. How to preserve? • To ensure retrievability and sharing – Index structures – Ontologies, metadata, keywords, standards – Workflows • To ensure longevity – Media decay, software decay, hardware decay • To ensure quality – Curation procedures, metadata,standards • To afford maintenance costs – Cloud? CAP theorem?
  • 35. How to preserve? • To ensure retrievability and sharing – Index structures – Ontologies, metadata, keywords, standards – Workflows • To ensure longevity – Media decay, software decay, hardware decay • To ensure quality – Curation procedures,metadata, standards • To afford maintenance costs – Cloud? CAP theorem? ======= WHERE
  • 36. How to preserve? • To ensure retrievability and sharing – Index structures – Ontologies, metadata, keywords, standards – Workflows • To ensure longevity – Media decay, software decay, hardware decay – PEOPLE DECAY • To ensure quality – Curation procedures,metadata, standards • To afford maintenance costs – Cloud? CAP theorem? ======= WHERE
  • 37. Sharing and open access NSF Data Management Policy Paper and data publication
  • 38.
  • 39. Sharing of Data Leads to Progress on Alzheimer’s By GINA KOLATA Published: August 12, 2010 = NEW YORK TIMES In 2003, a group of scientists and executives from the National Institutes of Health, the Food and Drug Administration, the drug and medical-imaging industries, universities and nonprofit groups joined in a project that experts say had no precedent: a collaborative effort to find the biological markers that show the progression of Alzheimer’s disease in the human brain. share all the data, making every single finding public immediately, available to anyone with a computer anywhere in the world => AVAILABILITY and REUSE
  • 40. • Data must be properly curated throughout its life-cycle and released with the appropriate high-quality metadata. • Medical Research Council UK 40/10000
  • 41. • Research data should be made available for use by other researchers. Researchers must retain research data, including electronic data, in a durable, indexed and retrievable form. • Australian Govnmt National Health and Medical Research Council 41/10000
  • 42. Microsoft Academic Search 40M publications 19M authors 75 publishers (Wiley, Springer, ACM, IEEE …) 42/10000
  • 44. • Citing data is as important as citing papers • For researchers, publishers, data centers • Over 1M DOI, several major national research libraries – Germany, France, Korea, Netherlands, Australia, USA... • Present manager – German National Library of Science and Technology 44/10000
  • 45. Publish on the Cloud Add metadata Pre-print sharing 45/10000
  • 46. FNJV proj.lis.ic.unicamp.br/fnjv • Sharing by publishing on the Web • Retrievability by extending metadata 46/10000
  • 47.
  • 48.
  • 49. CURATION AND USE OF STANDARDS
  • 50. Workflows and model preservation
  • 51.
  • 52. Workflows and model preservation Comb-e-Chem Video Simulation Properties Analysis Diffractometer Structures Database X-Ray Properties e-Lab e-Lab Grid Middleware 52/10000
  • 54. Outline • Why preserve? • What to preserve? • How to preserve? • Where to preserve? And a few associated challenges PRE-SAVE and MANU-TENERE
  • 55. Outline • Why preserve? – Costly to produce (hardware, software, peopleware) – Contribute to progress of science – Value – culture, science, sustainability • What to preserve? – Data [WHAT IS DATA?] – Context of production and use • How to preserve? – Accessibility and sharing – standards, metadata, ontologies – Integrity and quality – context to use (hw, sw), standards
  • 56. References • 56/10000
  • 57. References NSF – CISE Data management policy The Domesday Project http://www.atsf.co.uk/dottext/domesday.html The CLARIN Project (languages) Eigenfactor.org Altmetrics movement