SlideShare a Scribd company logo
1 of 30
Download to read offline
DIGITAL PRESERVATION IN
       THE WILD

     Tim Donohue
     Research Programmer - IDEALS
     University of Illinois
     (with many thanks to Sarah Shreeves)




     CARLI Digital Preservation Forum – July 21, 2009
It all starts with a small story




             a dream of sorts
IDEALS: the “dream”
“Create a reliable and easy to use repository service
to preserve, manage, and provide persistent and
widespread access to the digital scholarship faculty
and students now produce…”
IDEALS: the “dream”
“Create a reliable and easy to use repository service
to preserve, manage, and provide persistent and
widespread access to the digital scholarship faculty
and students now produce…”
                                                   Can we
  What’s it mean
                            BUT…                  preserve
  to preserve this                               everything?
       stuff?                 What kind of
                             infrastructure?

              What kind                        What kind of
             of expertise                       resources?
             do we need?
IDEALS: the initial reality



                              Backup tapes stored
                              next to the server!




Not Really Our Server Room!
1.   Brought in
     Preservation Librarian

2.   Training and self
     education

3.   Assessment of where
     we were and where
     we needed to go
The Foundation
      Open Archival Information System (OAIS) Model




                       http://public.ccsds.org/publications/archive/650x0b1.pdf



Image borrowed from the ICPSR Digital Preservation Tutorial:
http://www.icpsr.umich.edu/dpm/
The Foundation II
TRAC (Trustworthy Repositories Audit & Certification)
              http://www.crl.edu/PDF/trac.pdf

                                       Documentation

 Organizational Infr.
                                        Transparency
 Digital Object Mgmt
 Technical Infr. & Security
                                         Adequacy


                                        Measurability?
The Digital Preservation Platform




Image borrowed from the ICPSR Digital Preservation Tutorial:
http://www.icpsr.umich.edu/dpm/
From Dorothea Salo. 2009. Institutional repositories for the digital arts and
Humanities. Humanities Digital Curation Institute. Champaign IL. May 2009.
http://www.slideshare.net/cavlec/digital-preservation-and-institutional-repositories
   “Preservation” needs to be
    unpacked.

   Not about the technology.


   Explicitness is key.

   You don’t have to preserve
    everything to the fullest
    extent if you say you aren’t.
The 5 Stages of Preservation
   Denial / Ignorance
   Anger / Fear
   Bargaining
   Depression
   Acceptance & Hope




            Based on the Kübler-Ross five stages of grief:
            http://en.wikipedia.org/wiki/K%C3%BCbler-Ross_model
Denial / Ignorance
                           backups**
    ** - This service is entirely fictional




Again, Not Really Our Server Room!
Anger / Fear
                     Obsolescence
Data Loss
Bargaining
Please, just let me get this data migrated elsewhere
Depression
                      How can we
 This is too         ever preserve
   hard.              everything?
                                     Why even
We don’t have                          try?
the resources
   for this.
Acceptance & Hope
We can take small steps to…
 Preserve some things locally
 Develop policies (say what you do)
 Enact policies via procedures (do what you say)
 Work with others on best practices to preserve the rest
The Principles of Preservation

(1) Say what you do…

      (2) Do what you say…

  Based on: Sarah Shreeves. 2009. Saying what we do – Doing what we say: Preservation
  Issues (Metadata and Otherwise) in Institutional Repositories. American Library Association
  Conference. Chicago IL. July 2009.
IDEALS - Saying what we do

   Secured explicit administrative support and commitment
    for digital preservation management program in IDEALS.
    http://hdl.handle.net/2142/135

   Developed high level preservation policy:
    http://hdl.handle.net/2142/2383

   Developed actionable procedures and policies that can
    be reassessed and changed as needed

   Began next stage of identifying & documenting gaps
IDEALS Preservation Support Policy

   Format-based,
                                                     Low Confidence (gray area)
    “Categories of Support”
                                                             Openly Documented
      High Confidence
       Full Support
                                            No Embedded
      Medium Confidence                                                          Widely Adopted
                                            Content or DRM
       No migration promised

      Low Confidence
       “Bit-level” support only                Uncompressed or
                                                                          Widely Supported
                                              Lossless Compression




       https://services.ideals.uiuc.edu/wiki/bin/view/IDEALS/PreservationSupportPolicy
IDEALS Format Support Matrix
         Compilation of “known” formats
         Concentration on textual formats
                Microsoft Office                    OpenOffice.org, HTML
  Proprietary                                                                Open


     Limited    OpenOffice.org                      Microsoft Office, HTML
    Adoption                                                                 Widely Adopted

      Limited   Microsoft Office                        Adobe PDF, HTML
                                                                             Widely Supported
      Support

   Embedded     MS Powerpoint (w/ Audio or Video)         MS Powerpoint
                                                                             Nothing Embedded
Content / DRM

       Lossy    JPEG                                      TIFF, JPEG 2000    No/Lossless
 Compression                                                                 Compression
IDEALS Format Recommendations

Textual                                        Images
  CSV, Text, PDF/A, XML,                           TIFF, JPEG 2000
  Open Document Format
  RTF, MS Office, PDF, HTML                        GIF, JPEG, PNG

Audio                                          Video
  AIFF, WAVE, Ogg Vorbis                           AVI, Motion JPEG 2000

  AAC, MP3, Real, WMA                              MP2, MP4, Quicktime, WMV

                          High Confidence / Preference
                          Medium Confidence / Preference
        https://services.ideals.uiuc.edu/wiki/bin/view/IDEALS/FormatRecommendations
IDEALS – Doing what we say
   Basic Activities (All Items:              )
     Regular  Virus Scans, Checksum verification
     Nightly off-campus backups
     Refresh storage media
     Preservation Metadata (minimal)
       Format,   checksum, file size, etc.
     Permanent  Identifiers (Handles)
     Always keep the original document
     Monitoring and reassessment of formats
IDEALS – Doing what we say
   Intermediate Activities (        )
     Additional monitoring, more frequent reassessment
     When possible, attempt to migrate formats to preserve
      content and style (hopefully)
       No  promises that functionality will be preserved
       (e.g.) Powerpoint  PDF (possible functionality loss)
       (e.g.) PDF 1.4  PDF/A (possible style loss)
IDEALS – Doing what we say
   Full Support Activities (      )
     Additional monitoring, more frequent reassessment
     When necessary, migrate document to successive
      format.
     Attempt to preserve content, style and functionality
       (e.g.)   PDF/A  successor to PDF/A
Our First Preservation Problem…

   Character issues in Word
    (and PDF)
   Found by chance
   Consultation with
    submitter
   Caused by conversion to
    Word (from Wordperfect)
   Resubmitted as RTF
We Acknowledge our Gaps

              Not checking format
               validity (yet)
              Minimal metadata
               collection
              Not checking files for
               problems (besides
               viruses)
              Not checking every
               automated conversion
Back to that “dream”?
“Create a reliable and easy to use repository service
to preserve, manage, and provide persistent and
widespread access to the digital scholarship faculty
and students now produce…”




      Total Items: 11,500    Total Downloads: 870,000+
Credits
   Slide 2 (Book image): http://www.flickr.com/photos/riot/100006656/
   Slide 3 (MS office): http://www.flickr.com/photos/niallkennedy/374272762/
   Slide 5/13 (Server room): http://www.flickr.com/photos/sylvar/31436961/
   Slide 6 (Get your act…): http://www.flickr.com/photos/dreamsjung/3595425744/
   Slide 11 (I know this…): http://www.flickr.com/photos/ali_kat_xx/1373989245/
   Slide 13 (Disk backups): http://www.flickr.com/photos/tonyaustin/2355186770/
   Slide 14 (Disk lost): http://www.flickr.com/photos/mag3737/2415681602/
   Slide 14 (Broken cd): http://www.flickr.com/photos/rickheath/72041533/
   Slide 15 (VHS to DVD): http://www.flickr.com/photos/28910181@N05/3085532220/
   Slide 15 (Mac transfer): http://www.flickr.com/photos/cyprien/6173244/
   Slide 16 (Depression): http://www.flickr.com/photos/dbarefoot/2652496167/
   Slide 17 (Hope): http://www.flickr.com/photos/livenature/259458056/
   Slide 17 (Dioum Quote): http://www.flickr.com/photos/wallyg/469808717/
   Slide 27 (Gaps): http://www.flickr.com/photos/aduki/2416528101/
Contact Info


      Tim Donohue
      University of Illinois
      tdonohue@illinois.edu
      http://www.ideals.uiuc.edu/
      http://www.ideals.uiuc.edu/wiki/



       This work is licensed under a Creative Commons Attribution-
       Noncommercial 3.0 United States License

More Related Content

What's hot

Digital preservation: an introduction
Digital preservation: an introductionDigital preservation: an introduction
Digital preservation: an introductionPublicLibraryServices
 
Intro to Digital Preservation
Intro to Digital PreservationIntro to Digital Preservation
Intro to Digital PreservationBen Fino-radin
 
Digital preservation
Digital preservationDigital preservation
Digital preservationMichael Day
 
Digital Preservation
Digital PreservationDigital Preservation
Digital Preservationtrbeck
 
Preparation, Proceed and Review of preservation of Digital Library
Preparation, Proceed and Review of preservation of Digital Library Preparation, Proceed and Review of preservation of Digital Library
Preparation, Proceed and Review of preservation of Digital Library Asheesh Kamal
 
Digital preservation: an introduction
Digital preservation: an introductionDigital preservation: an introduction
Digital preservation: an introductionMichael Day
 
Digital Preservation
Digital PreservationDigital Preservation
Digital PreservationMichael Day
 
Dataverse at Cariniana network
Dataverse at Cariniana networkDataverse at Cariniana network
Dataverse at Cariniana networkCariniana Rede
 
DATA MANAGEMENT – WHAT DOES IT MEAN FOR RESEARCHERS?
DATA MANAGEMENT – WHAT DOES IT MEAN FOR RESEARCHERS?DATA MANAGEMENT – WHAT DOES IT MEAN FOR RESEARCHERS?
DATA MANAGEMENT – WHAT DOES IT MEAN FOR RESEARCHERS?Incremental Project
 
A strategic view of document and digital object management
A strategic view of document and digital object managementA strategic view of document and digital object management
A strategic view of document and digital object managementDerek Keats
 
The state of global research data initiatives: observations from a life on th...
The state of global research data initiatives: observations from a life on th...The state of global research data initiatives: observations from a life on th...
The state of global research data initiatives: observations from a life on th...Projeto RCAAP
 
Digital Preservation
Digital PreservationDigital Preservation
Digital PreservationMichael Day
 
Digital preservation from a records management perspective
Digital preservation from a records management perspectiveDigital preservation from a records management perspective
Digital preservation from a records management perspectiveMichael Day
 
University of Bath Research Data Management training for researchers
University of Bath Research Data Management training for researchersUniversity of Bath Research Data Management training for researchers
University of Bath Research Data Management training for researchersJez Cope
 

What's hot (20)

Digital preservation: an introduction
Digital preservation: an introductionDigital preservation: an introduction
Digital preservation: an introduction
 
Intro to Digital Preservation
Intro to Digital PreservationIntro to Digital Preservation
Intro to Digital Preservation
 
Digital preservation
Digital preservationDigital preservation
Digital preservation
 
Digital Preservation
Digital PreservationDigital Preservation
Digital Preservation
 
Preparation, Proceed and Review of preservation of Digital Library
Preparation, Proceed and Review of preservation of Digital Library Preparation, Proceed and Review of preservation of Digital Library
Preparation, Proceed and Review of preservation of Digital Library
 
Digital preservation: an introduction
Digital preservation: an introductionDigital preservation: an introduction
Digital preservation: an introduction
 
Digital Preservation
Digital PreservationDigital Preservation
Digital Preservation
 
Dataverse at Cariniana network
Dataverse at Cariniana networkDataverse at Cariniana network
Dataverse at Cariniana network
 
Dig c curr
Dig c currDig c curr
Dig c curr
 
Your Digital Preservation Cookbook
Your Digital Preservation CookbookYour Digital Preservation Cookbook
Your Digital Preservation Cookbook
 
Trm Introduction
Trm IntroductionTrm Introduction
Trm Introduction
 
Digital Libray
Digital LibrayDigital Libray
Digital Libray
 
DATA MANAGEMENT – WHAT DOES IT MEAN FOR RESEARCHERS?
DATA MANAGEMENT – WHAT DOES IT MEAN FOR RESEARCHERS?DATA MANAGEMENT – WHAT DOES IT MEAN FOR RESEARCHERS?
DATA MANAGEMENT – WHAT DOES IT MEAN FOR RESEARCHERS?
 
A strategic view of document and digital object management
A strategic view of document and digital object managementA strategic view of document and digital object management
A strategic view of document and digital object management
 
The state of global research data initiatives: observations from a life on th...
The state of global research data initiatives: observations from a life on th...The state of global research data initiatives: observations from a life on th...
The state of global research data initiatives: observations from a life on th...
 
Digital Preservation
Digital PreservationDigital Preservation
Digital Preservation
 
Digital Content Creation
Digital Content CreationDigital Content Creation
Digital Content Creation
 
Digital library
Digital libraryDigital library
Digital library
 
Digital preservation from a records management perspective
Digital preservation from a records management perspectiveDigital preservation from a records management perspective
Digital preservation from a records management perspective
 
University of Bath Research Data Management training for researchers
University of Bath Research Data Management training for researchersUniversity of Bath Research Data Management training for researchers
University of Bath Research Data Management training for researchers
 

Similar to Digital Preservation in the Wild: An IDEALS Perspective

KeepIt Course 3: preservation workflow
KeepIt Course 3: preservation workflowKeepIt Course 3: preservation workflow
KeepIt Course 3: preservation workflowJISC KeepIt project
 
Everyone's A Mechanic
Everyone's A MechanicEveryone's A Mechanic
Everyone's A MechanicBrad Houston
 
File Formats for Preservation
File Formats for PreservationFile Formats for Preservation
File Formats for PreservationStephen Gray
 
EPrints and the Cloud
EPrints and the CloudEPrints and the Cloud
EPrints and the CloudLeslie Carr
 
Document Archiving & Sharing System
Document Archiving & Sharing SystemDocument Archiving & Sharing System
Document Archiving & Sharing SystemAshik Iqbal
 
DevoxxUK 2016: "DevOps: Microservices, containers, platforms, tooling... Oh y...
DevoxxUK 2016: "DevOps: Microservices, containers, platforms, tooling... Oh y...DevoxxUK 2016: "DevOps: Microservices, containers, platforms, tooling... Oh y...
DevoxxUK 2016: "DevOps: Microservices, containers, platforms, tooling... Oh y...Daniel Bryant
 
Unified characterisation, please
Unified characterisation, pleaseUnified characterisation, please
Unified characterisation, pleaseAndy Jackson
 
EPrints Preservation: Why we need Preservation Planning
EPrints Preservation: Why we need Preservation PlanningEPrints Preservation: Why we need Preservation Planning
EPrints Preservation: Why we need Preservation PlanningJISC KeepIt project
 
Understanding Big Data And Hadoop
Understanding Big Data And HadoopUnderstanding Big Data And Hadoop
Understanding Big Data And HadoopEdureka!
 
JAXLondon 2015 "DevOps and the Cloud: All Hail the (Developer) King"
JAXLondon 2015 "DevOps and the Cloud: All Hail the (Developer) King"JAXLondon 2015 "DevOps and the Cloud: All Hail the (Developer) King"
JAXLondon 2015 "DevOps and the Cloud: All Hail the (Developer) King"Daniel Bryant
 
An Introduction to AtoM, Archivematica, and Artefactual Systems
An Introduction to AtoM, Archivematica, and Artefactual SystemsAn Introduction to AtoM, Archivematica, and Artefactual Systems
An Introduction to AtoM, Archivematica, and Artefactual SystemsArtefactual Systems - AtoM
 
Bodleian Library's DAMS system
Bodleian Library's DAMS systemBodleian Library's DAMS system
Bodleian Library's DAMS systembenosteen
 
DevOps and the cloud: all hail the (developer) king - Daniel Bryant, Steve Poole
DevOps and the cloud: all hail the (developer) king - Daniel Bryant, Steve PooleDevOps and the cloud: all hail the (developer) king - Daniel Bryant, Steve Poole
DevOps and the cloud: all hail the (developer) king - Daniel Bryant, Steve PooleJAXLondon_Conference
 
Information Management in a Web 2.0 World May 2009
Information Management in a Web 2.0 World May 2009Information Management in a Web 2.0 World May 2009
Information Management in a Web 2.0 World May 2009Collabor8now Ltd
 
[Webinar Slides] Developing a Successful Data Retention Policy
[Webinar Slides] Developing a Successful Data Retention Policy [Webinar Slides] Developing a Successful Data Retention Policy
[Webinar Slides] Developing a Successful Data Retention Policy AIIM International
 
Analytics with unified file and object
Analytics with unified file and object Analytics with unified file and object
Analytics with unified file and object Sandeep Patil
 

Similar to Digital Preservation in the Wild: An IDEALS Perspective (20)

KeepIt Course 3: preservation workflow
KeepIt Course 3: preservation workflowKeepIt Course 3: preservation workflow
KeepIt Course 3: preservation workflow
 
Everyone's A Mechanic
Everyone's A MechanicEveryone's A Mechanic
Everyone's A Mechanic
 
Resource space
Resource spaceResource space
Resource space
 
File Formats for Preservation
File Formats for PreservationFile Formats for Preservation
File Formats for Preservation
 
EPrints and the Cloud
EPrints and the CloudEPrints and the Cloud
EPrints and the Cloud
 
Electronic Records
Electronic RecordsElectronic Records
Electronic Records
 
Document Archiving & Sharing System
Document Archiving & Sharing SystemDocument Archiving & Sharing System
Document Archiving & Sharing System
 
DevoxxUK 2016: "DevOps: Microservices, containers, platforms, tooling... Oh y...
DevoxxUK 2016: "DevOps: Microservices, containers, platforms, tooling... Oh y...DevoxxUK 2016: "DevOps: Microservices, containers, platforms, tooling... Oh y...
DevoxxUK 2016: "DevOps: Microservices, containers, platforms, tooling... Oh y...
 
Unified characterisation, please
Unified characterisation, pleaseUnified characterisation, please
Unified characterisation, please
 
SHAREmodule2
SHAREmodule2SHAREmodule2
SHAREmodule2
 
EPrints Preservation: Why we need Preservation Planning
EPrints Preservation: Why we need Preservation PlanningEPrints Preservation: Why we need Preservation Planning
EPrints Preservation: Why we need Preservation Planning
 
Understanding Big Data And Hadoop
Understanding Big Data And HadoopUnderstanding Big Data And Hadoop
Understanding Big Data And Hadoop
 
JAXLondon 2015 "DevOps and the Cloud: All Hail the (Developer) King"
JAXLondon 2015 "DevOps and the Cloud: All Hail the (Developer) King"JAXLondon 2015 "DevOps and the Cloud: All Hail the (Developer) King"
JAXLondon 2015 "DevOps and the Cloud: All Hail the (Developer) King"
 
An Introduction to AtoM, Archivematica, and Artefactual Systems
An Introduction to AtoM, Archivematica, and Artefactual SystemsAn Introduction to AtoM, Archivematica, and Artefactual Systems
An Introduction to AtoM, Archivematica, and Artefactual Systems
 
Bodleian Library's DAMS system
Bodleian Library's DAMS systemBodleian Library's DAMS system
Bodleian Library's DAMS system
 
DevOps and the cloud: all hail the (developer) king - Daniel Bryant, Steve Poole
DevOps and the cloud: all hail the (developer) king - Daniel Bryant, Steve PooleDevOps and the cloud: all hail the (developer) king - Daniel Bryant, Steve Poole
DevOps and the cloud: all hail the (developer) king - Daniel Bryant, Steve Poole
 
Information Management in a Web 2.0 World May 2009
Information Management in a Web 2.0 World May 2009Information Management in a Web 2.0 World May 2009
Information Management in a Web 2.0 World May 2009
 
[Webinar Slides] Developing a Successful Data Retention Policy
[Webinar Slides] Developing a Successful Data Retention Policy [Webinar Slides] Developing a Successful Data Retention Policy
[Webinar Slides] Developing a Successful Data Retention Policy
 
Analytics with unified file and object
Analytics with unified file and object Analytics with unified file and object
Analytics with unified file and object
 
Cyverse: Extensible Cyberinfrastructure for Life Science
Cyverse: Extensible Cyberinfrastructure for Life ScienceCyverse: Extensible Cyberinfrastructure for Life Science
Cyverse: Extensible Cyberinfrastructure for Life Science
 

More from Tim Donohue

On the Road to DSpace 7: Angular UI + REST
On the Road to DSpace 7: Angular UI + RESTOn the Road to DSpace 7: Angular UI + REST
On the Road to DSpace 7: Angular UI + RESTTim Donohue
 
Introducing the New DSpace User Interface
Introducing the New DSpace User InterfaceIntroducing the New DSpace User Interface
Introducing the New DSpace User InterfaceTim Donohue
 
DSpace UI Prototype Challenge: Spring Boot + Thymeleaf
DSpace UI Prototype Challenge: Spring Boot + ThymeleafDSpace UI Prototype Challenge: Spring Boot + Thymeleaf
DSpace UI Prototype Challenge: Spring Boot + ThymeleafTim Donohue
 
Discussion on DSpace's Two UIs : DuraSpace 2015 Summit
Discussion on DSpace's Two UIs : DuraSpace 2015 SummitDiscussion on DSpace's Two UIs : DuraSpace 2015 Summit
Discussion on DSpace's Two UIs : DuraSpace 2015 SummitTim Donohue
 
How to "Hack" the DSpace Community
How to "Hack" the DSpace CommunityHow to "Hack" the DSpace Community
How to "Hack" the DSpace CommunityTim Donohue
 
DSpace Overview / Roadmap 2014
DSpace Overview / Roadmap 2014DSpace Overview / Roadmap 2014
DSpace Overview / Roadmap 2014Tim Donohue
 
DSpace RoadMap & Vision 2013 (OR13)
DSpace RoadMap & Vision 2013 (OR13)DSpace RoadMap & Vision 2013 (OR13)
DSpace RoadMap & Vision 2013 (OR13)Tim Donohue
 
DSpace RoadMap 2012
DSpace RoadMap 2012DSpace RoadMap 2012
DSpace RoadMap 2012Tim Donohue
 
DSpace RoadMap and Vision (at 2013 OAI8 DSpace User Group)
DSpace RoadMap and Vision (at 2013 OAI8 DSpace User Group)DSpace RoadMap and Vision (at 2013 OAI8 DSpace User Group)
DSpace RoadMap and Vision (at 2013 OAI8 DSpace User Group)Tim Donohue
 
Future Trends for Repositories
Future Trends for RepositoriesFuture Trends for Repositories
Future Trends for RepositoriesTim Donohue
 
DSpace & DuraCloud Integrations
DSpace & DuraCloud IntegrationsDSpace & DuraCloud Integrations
DSpace & DuraCloud IntegrationsTim Donohue
 
DSpace RoadMap 2011
DSpace RoadMap 2011DSpace RoadMap 2011
DSpace RoadMap 2011Tim Donohue
 
DSpace RoadMap 2010
DSpace RoadMap 2010DSpace RoadMap 2010
DSpace RoadMap 2010Tim Donohue
 
Improving DSpace Backups, Restores & Migrations
Improving DSpace Backups, Restores & MigrationsImproving DSpace Backups, Restores & Migrations
Improving DSpace Backups, Restores & MigrationsTim Donohue
 
BibApp 1.0 : Information In, Impact Out
BibApp 1.0 : Information In, Impact OutBibApp 1.0 : Information In, Impact Out
BibApp 1.0 : Information In, Impact OutTim Donohue
 
Making DSpace XMLUI Your Own
Making DSpace XMLUI Your OwnMaking DSpace XMLUI Your Own
Making DSpace XMLUI Your OwnTim Donohue
 

More from Tim Donohue (16)

On the Road to DSpace 7: Angular UI + REST
On the Road to DSpace 7: Angular UI + RESTOn the Road to DSpace 7: Angular UI + REST
On the Road to DSpace 7: Angular UI + REST
 
Introducing the New DSpace User Interface
Introducing the New DSpace User InterfaceIntroducing the New DSpace User Interface
Introducing the New DSpace User Interface
 
DSpace UI Prototype Challenge: Spring Boot + Thymeleaf
DSpace UI Prototype Challenge: Spring Boot + ThymeleafDSpace UI Prototype Challenge: Spring Boot + Thymeleaf
DSpace UI Prototype Challenge: Spring Boot + Thymeleaf
 
Discussion on DSpace's Two UIs : DuraSpace 2015 Summit
Discussion on DSpace's Two UIs : DuraSpace 2015 SummitDiscussion on DSpace's Two UIs : DuraSpace 2015 Summit
Discussion on DSpace's Two UIs : DuraSpace 2015 Summit
 
How to "Hack" the DSpace Community
How to "Hack" the DSpace CommunityHow to "Hack" the DSpace Community
How to "Hack" the DSpace Community
 
DSpace Overview / Roadmap 2014
DSpace Overview / Roadmap 2014DSpace Overview / Roadmap 2014
DSpace Overview / Roadmap 2014
 
DSpace RoadMap & Vision 2013 (OR13)
DSpace RoadMap & Vision 2013 (OR13)DSpace RoadMap & Vision 2013 (OR13)
DSpace RoadMap & Vision 2013 (OR13)
 
DSpace RoadMap 2012
DSpace RoadMap 2012DSpace RoadMap 2012
DSpace RoadMap 2012
 
DSpace RoadMap and Vision (at 2013 OAI8 DSpace User Group)
DSpace RoadMap and Vision (at 2013 OAI8 DSpace User Group)DSpace RoadMap and Vision (at 2013 OAI8 DSpace User Group)
DSpace RoadMap and Vision (at 2013 OAI8 DSpace User Group)
 
Future Trends for Repositories
Future Trends for RepositoriesFuture Trends for Repositories
Future Trends for Repositories
 
DSpace & DuraCloud Integrations
DSpace & DuraCloud IntegrationsDSpace & DuraCloud Integrations
DSpace & DuraCloud Integrations
 
DSpace RoadMap 2011
DSpace RoadMap 2011DSpace RoadMap 2011
DSpace RoadMap 2011
 
DSpace RoadMap 2010
DSpace RoadMap 2010DSpace RoadMap 2010
DSpace RoadMap 2010
 
Improving DSpace Backups, Restores & Migrations
Improving DSpace Backups, Restores & MigrationsImproving DSpace Backups, Restores & Migrations
Improving DSpace Backups, Restores & Migrations
 
BibApp 1.0 : Information In, Impact Out
BibApp 1.0 : Information In, Impact OutBibApp 1.0 : Information In, Impact Out
BibApp 1.0 : Information In, Impact Out
 
Making DSpace XMLUI Your Own
Making DSpace XMLUI Your OwnMaking DSpace XMLUI Your Own
Making DSpace XMLUI Your Own
 

Recently uploaded

Textual Evidence in Reading and Writing of SHS
Textual Evidence in Reading and Writing of SHSTextual Evidence in Reading and Writing of SHS
Textual Evidence in Reading and Writing of SHSMae Pangan
 
Beauty Amidst the Bytes_ Unearthing Unexpected Advantages of the Digital Wast...
Beauty Amidst the Bytes_ Unearthing Unexpected Advantages of the Digital Wast...Beauty Amidst the Bytes_ Unearthing Unexpected Advantages of the Digital Wast...
Beauty Amidst the Bytes_ Unearthing Unexpected Advantages of the Digital Wast...DhatriParmar
 
CLASSIFICATION OF ANTI - CANCER DRUGS.pptx
CLASSIFICATION OF ANTI - CANCER DRUGS.pptxCLASSIFICATION OF ANTI - CANCER DRUGS.pptx
CLASSIFICATION OF ANTI - CANCER DRUGS.pptxAnupam32727
 
BIOCHEMISTRY-CARBOHYDRATE METABOLISM CHAPTER 2.pptx
BIOCHEMISTRY-CARBOHYDRATE METABOLISM CHAPTER 2.pptxBIOCHEMISTRY-CARBOHYDRATE METABOLISM CHAPTER 2.pptx
BIOCHEMISTRY-CARBOHYDRATE METABOLISM CHAPTER 2.pptxSayali Powar
 
Active Learning Strategies (in short ALS).pdf
Active Learning Strategies (in short ALS).pdfActive Learning Strategies (in short ALS).pdf
Active Learning Strategies (in short ALS).pdfPatidar M
 
31 ĐỀ THI THỬ VÀO LỚP 10 - TIẾNG ANH - FORM MỚI 2025 - 40 CÂU HỎI - BÙI VĂN V...
31 ĐỀ THI THỬ VÀO LỚP 10 - TIẾNG ANH - FORM MỚI 2025 - 40 CÂU HỎI - BÙI VĂN V...31 ĐỀ THI THỬ VÀO LỚP 10 - TIẾNG ANH - FORM MỚI 2025 - 40 CÂU HỎI - BÙI VĂN V...
31 ĐỀ THI THỬ VÀO LỚP 10 - TIẾNG ANH - FORM MỚI 2025 - 40 CÂU HỎI - BÙI VĂN V...Nguyen Thanh Tu Collection
 
Indexing Structures in Database Management system.pdf
Indexing Structures in Database Management system.pdfIndexing Structures in Database Management system.pdf
Indexing Structures in Database Management system.pdfChristalin Nelson
 
CHEST Proprioceptive neuromuscular facilitation.pptx
CHEST Proprioceptive neuromuscular facilitation.pptxCHEST Proprioceptive neuromuscular facilitation.pptx
CHEST Proprioceptive neuromuscular facilitation.pptxAneriPatwari
 
ESP 4-EDITED.pdfmmcncncncmcmmnmnmncnmncmnnjvnnv
ESP 4-EDITED.pdfmmcncncncmcmmnmnmncnmncmnnjvnnvESP 4-EDITED.pdfmmcncncncmcmmnmnmncnmncmnnjvnnv
ESP 4-EDITED.pdfmmcncncncmcmmnmnmncnmncmnnjvnnvRicaMaeCastro1
 
MS4 level being good citizen -imperative- (1) (1).pdf
MS4 level   being good citizen -imperative- (1) (1).pdfMS4 level   being good citizen -imperative- (1) (1).pdf
MS4 level being good citizen -imperative- (1) (1).pdfMr Bounab Samir
 
Mythology Quiz-4th April 2024, Quiz Club NITW
Mythology Quiz-4th April 2024, Quiz Club NITWMythology Quiz-4th April 2024, Quiz Club NITW
Mythology Quiz-4th April 2024, Quiz Club NITWQuiz Club NITW
 
4.16.24 Poverty and Precarity--Desmond.pptx
4.16.24 Poverty and Precarity--Desmond.pptx4.16.24 Poverty and Precarity--Desmond.pptx
4.16.24 Poverty and Precarity--Desmond.pptxmary850239
 
DIFFERENT BASKETRY IN THE PHILIPPINES PPT.pptx
DIFFERENT BASKETRY IN THE PHILIPPINES PPT.pptxDIFFERENT BASKETRY IN THE PHILIPPINES PPT.pptx
DIFFERENT BASKETRY IN THE PHILIPPINES PPT.pptxMichelleTuguinay1
 
Decoding the Tweet _ Practical Criticism in the Age of Hashtag.pptx
Decoding the Tweet _ Practical Criticism in the Age of Hashtag.pptxDecoding the Tweet _ Practical Criticism in the Age of Hashtag.pptx
Decoding the Tweet _ Practical Criticism in the Age of Hashtag.pptxDhatriParmar
 
Oppenheimer Film Discussion for Philosophy and Film
Oppenheimer Film Discussion for Philosophy and FilmOppenheimer Film Discussion for Philosophy and Film
Oppenheimer Film Discussion for Philosophy and FilmStan Meyer
 
Expanded definition: technical and operational
Expanded definition: technical and operationalExpanded definition: technical and operational
Expanded definition: technical and operationalssuser3e220a
 
Team Lead Succeed – Helping you and your team achieve high-performance teamwo...
Team Lead Succeed – Helping you and your team achieve high-performance teamwo...Team Lead Succeed – Helping you and your team achieve high-performance teamwo...
Team Lead Succeed – Helping you and your team achieve high-performance teamwo...Association for Project Management
 
4.11.24 Poverty and Inequality in America.pptx
4.11.24 Poverty and Inequality in America.pptx4.11.24 Poverty and Inequality in America.pptx
4.11.24 Poverty and Inequality in America.pptxmary850239
 

Recently uploaded (20)

Textual Evidence in Reading and Writing of SHS
Textual Evidence in Reading and Writing of SHSTextual Evidence in Reading and Writing of SHS
Textual Evidence in Reading and Writing of SHS
 
Beauty Amidst the Bytes_ Unearthing Unexpected Advantages of the Digital Wast...
Beauty Amidst the Bytes_ Unearthing Unexpected Advantages of the Digital Wast...Beauty Amidst the Bytes_ Unearthing Unexpected Advantages of the Digital Wast...
Beauty Amidst the Bytes_ Unearthing Unexpected Advantages of the Digital Wast...
 
CLASSIFICATION OF ANTI - CANCER DRUGS.pptx
CLASSIFICATION OF ANTI - CANCER DRUGS.pptxCLASSIFICATION OF ANTI - CANCER DRUGS.pptx
CLASSIFICATION OF ANTI - CANCER DRUGS.pptx
 
BIOCHEMISTRY-CARBOHYDRATE METABOLISM CHAPTER 2.pptx
BIOCHEMISTRY-CARBOHYDRATE METABOLISM CHAPTER 2.pptxBIOCHEMISTRY-CARBOHYDRATE METABOLISM CHAPTER 2.pptx
BIOCHEMISTRY-CARBOHYDRATE METABOLISM CHAPTER 2.pptx
 
Active Learning Strategies (in short ALS).pdf
Active Learning Strategies (in short ALS).pdfActive Learning Strategies (in short ALS).pdf
Active Learning Strategies (in short ALS).pdf
 
31 ĐỀ THI THỬ VÀO LỚP 10 - TIẾNG ANH - FORM MỚI 2025 - 40 CÂU HỎI - BÙI VĂN V...
31 ĐỀ THI THỬ VÀO LỚP 10 - TIẾNG ANH - FORM MỚI 2025 - 40 CÂU HỎI - BÙI VĂN V...31 ĐỀ THI THỬ VÀO LỚP 10 - TIẾNG ANH - FORM MỚI 2025 - 40 CÂU HỎI - BÙI VĂN V...
31 ĐỀ THI THỬ VÀO LỚP 10 - TIẾNG ANH - FORM MỚI 2025 - 40 CÂU HỎI - BÙI VĂN V...
 
Indexing Structures in Database Management system.pdf
Indexing Structures in Database Management system.pdfIndexing Structures in Database Management system.pdf
Indexing Structures in Database Management system.pdf
 
Paradigm shift in nursing research by RS MEHTA
Paradigm shift in nursing research by RS MEHTAParadigm shift in nursing research by RS MEHTA
Paradigm shift in nursing research by RS MEHTA
 
CHEST Proprioceptive neuromuscular facilitation.pptx
CHEST Proprioceptive neuromuscular facilitation.pptxCHEST Proprioceptive neuromuscular facilitation.pptx
CHEST Proprioceptive neuromuscular facilitation.pptx
 
ESP 4-EDITED.pdfmmcncncncmcmmnmnmncnmncmnnjvnnv
ESP 4-EDITED.pdfmmcncncncmcmmnmnmncnmncmnnjvnnvESP 4-EDITED.pdfmmcncncncmcmmnmnmncnmncmnnjvnnv
ESP 4-EDITED.pdfmmcncncncmcmmnmnmncnmncmnnjvnnv
 
MS4 level being good citizen -imperative- (1) (1).pdf
MS4 level   being good citizen -imperative- (1) (1).pdfMS4 level   being good citizen -imperative- (1) (1).pdf
MS4 level being good citizen -imperative- (1) (1).pdf
 
Mythology Quiz-4th April 2024, Quiz Club NITW
Mythology Quiz-4th April 2024, Quiz Club NITWMythology Quiz-4th April 2024, Quiz Club NITW
Mythology Quiz-4th April 2024, Quiz Club NITW
 
4.16.24 Poverty and Precarity--Desmond.pptx
4.16.24 Poverty and Precarity--Desmond.pptx4.16.24 Poverty and Precarity--Desmond.pptx
4.16.24 Poverty and Precarity--Desmond.pptx
 
DIFFERENT BASKETRY IN THE PHILIPPINES PPT.pptx
DIFFERENT BASKETRY IN THE PHILIPPINES PPT.pptxDIFFERENT BASKETRY IN THE PHILIPPINES PPT.pptx
DIFFERENT BASKETRY IN THE PHILIPPINES PPT.pptx
 
Decoding the Tweet _ Practical Criticism in the Age of Hashtag.pptx
Decoding the Tweet _ Practical Criticism in the Age of Hashtag.pptxDecoding the Tweet _ Practical Criticism in the Age of Hashtag.pptx
Decoding the Tweet _ Practical Criticism in the Age of Hashtag.pptx
 
Mattingly "AI & Prompt Design: Large Language Models"
Mattingly "AI & Prompt Design: Large Language Models"Mattingly "AI & Prompt Design: Large Language Models"
Mattingly "AI & Prompt Design: Large Language Models"
 
Oppenheimer Film Discussion for Philosophy and Film
Oppenheimer Film Discussion for Philosophy and FilmOppenheimer Film Discussion for Philosophy and Film
Oppenheimer Film Discussion for Philosophy and Film
 
Expanded definition: technical and operational
Expanded definition: technical and operationalExpanded definition: technical and operational
Expanded definition: technical and operational
 
Team Lead Succeed – Helping you and your team achieve high-performance teamwo...
Team Lead Succeed – Helping you and your team achieve high-performance teamwo...Team Lead Succeed – Helping you and your team achieve high-performance teamwo...
Team Lead Succeed – Helping you and your team achieve high-performance teamwo...
 
4.11.24 Poverty and Inequality in America.pptx
4.11.24 Poverty and Inequality in America.pptx4.11.24 Poverty and Inequality in America.pptx
4.11.24 Poverty and Inequality in America.pptx
 

Digital Preservation in the Wild: An IDEALS Perspective

  • 1. DIGITAL PRESERVATION IN THE WILD Tim Donohue Research Programmer - IDEALS University of Illinois (with many thanks to Sarah Shreeves) CARLI Digital Preservation Forum – July 21, 2009
  • 2. It all starts with a small story a dream of sorts
  • 3. IDEALS: the “dream” “Create a reliable and easy to use repository service to preserve, manage, and provide persistent and widespread access to the digital scholarship faculty and students now produce…”
  • 4. IDEALS: the “dream” “Create a reliable and easy to use repository service to preserve, manage, and provide persistent and widespread access to the digital scholarship faculty and students now produce…” Can we What’s it mean BUT… preserve to preserve this everything? stuff? What kind of infrastructure? What kind What kind of of expertise resources? do we need?
  • 5. IDEALS: the initial reality Backup tapes stored next to the server! Not Really Our Server Room!
  • 6. 1. Brought in Preservation Librarian 2. Training and self education 3. Assessment of where we were and where we needed to go
  • 7. The Foundation Open Archival Information System (OAIS) Model http://public.ccsds.org/publications/archive/650x0b1.pdf Image borrowed from the ICPSR Digital Preservation Tutorial: http://www.icpsr.umich.edu/dpm/
  • 8. The Foundation II TRAC (Trustworthy Repositories Audit & Certification) http://www.crl.edu/PDF/trac.pdf Documentation  Organizational Infr. Transparency  Digital Object Mgmt  Technical Infr. & Security Adequacy Measurability?
  • 9. The Digital Preservation Platform Image borrowed from the ICPSR Digital Preservation Tutorial: http://www.icpsr.umich.edu/dpm/
  • 10. From Dorothea Salo. 2009. Institutional repositories for the digital arts and Humanities. Humanities Digital Curation Institute. Champaign IL. May 2009. http://www.slideshare.net/cavlec/digital-preservation-and-institutional-repositories
  • 11. “Preservation” needs to be unpacked.  Not about the technology.  Explicitness is key.  You don’t have to preserve everything to the fullest extent if you say you aren’t.
  • 12. The 5 Stages of Preservation  Denial / Ignorance  Anger / Fear  Bargaining  Depression  Acceptance & Hope Based on the Kübler-Ross five stages of grief: http://en.wikipedia.org/wiki/K%C3%BCbler-Ross_model
  • 13. Denial / Ignorance backups** ** - This service is entirely fictional Again, Not Really Our Server Room!
  • 14. Anger / Fear Obsolescence Data Loss
  • 15. Bargaining Please, just let me get this data migrated elsewhere
  • 16. Depression How can we This is too ever preserve hard. everything? Why even We don’t have try? the resources for this.
  • 17. Acceptance & Hope We can take small steps to…  Preserve some things locally  Develop policies (say what you do)  Enact policies via procedures (do what you say)  Work with others on best practices to preserve the rest
  • 18. The Principles of Preservation (1) Say what you do… (2) Do what you say… Based on: Sarah Shreeves. 2009. Saying what we do – Doing what we say: Preservation Issues (Metadata and Otherwise) in Institutional Repositories. American Library Association Conference. Chicago IL. July 2009.
  • 19. IDEALS - Saying what we do  Secured explicit administrative support and commitment for digital preservation management program in IDEALS. http://hdl.handle.net/2142/135  Developed high level preservation policy: http://hdl.handle.net/2142/2383  Developed actionable procedures and policies that can be reassessed and changed as needed  Began next stage of identifying & documenting gaps
  • 20. IDEALS Preservation Support Policy  Format-based, Low Confidence (gray area) “Categories of Support” Openly Documented High Confidence  Full Support No Embedded Medium Confidence Widely Adopted Content or DRM  No migration promised Low Confidence  “Bit-level” support only Uncompressed or Widely Supported Lossless Compression https://services.ideals.uiuc.edu/wiki/bin/view/IDEALS/PreservationSupportPolicy
  • 21. IDEALS Format Support Matrix  Compilation of “known” formats  Concentration on textual formats Microsoft Office OpenOffice.org, HTML Proprietary Open Limited OpenOffice.org Microsoft Office, HTML Adoption Widely Adopted Limited Microsoft Office Adobe PDF, HTML Widely Supported Support Embedded MS Powerpoint (w/ Audio or Video) MS Powerpoint Nothing Embedded Content / DRM Lossy JPEG TIFF, JPEG 2000 No/Lossless Compression Compression
  • 22. IDEALS Format Recommendations Textual Images CSV, Text, PDF/A, XML, TIFF, JPEG 2000 Open Document Format RTF, MS Office, PDF, HTML GIF, JPEG, PNG Audio Video AIFF, WAVE, Ogg Vorbis AVI, Motion JPEG 2000 AAC, MP3, Real, WMA MP2, MP4, Quicktime, WMV High Confidence / Preference Medium Confidence / Preference https://services.ideals.uiuc.edu/wiki/bin/view/IDEALS/FormatRecommendations
  • 23. IDEALS – Doing what we say  Basic Activities (All Items: )  Regular Virus Scans, Checksum verification  Nightly off-campus backups  Refresh storage media  Preservation Metadata (minimal)  Format, checksum, file size, etc.  Permanent Identifiers (Handles)  Always keep the original document  Monitoring and reassessment of formats
  • 24. IDEALS – Doing what we say  Intermediate Activities ( )  Additional monitoring, more frequent reassessment  When possible, attempt to migrate formats to preserve content and style (hopefully)  No promises that functionality will be preserved  (e.g.) Powerpoint  PDF (possible functionality loss)  (e.g.) PDF 1.4  PDF/A (possible style loss)
  • 25. IDEALS – Doing what we say  Full Support Activities ( )  Additional monitoring, more frequent reassessment  When necessary, migrate document to successive format.  Attempt to preserve content, style and functionality  (e.g.) PDF/A  successor to PDF/A
  • 26. Our First Preservation Problem…  Character issues in Word (and PDF)  Found by chance  Consultation with submitter  Caused by conversion to Word (from Wordperfect)  Resubmitted as RTF
  • 27. We Acknowledge our Gaps  Not checking format validity (yet)  Minimal metadata collection  Not checking files for problems (besides viruses)  Not checking every automated conversion
  • 28. Back to that “dream”? “Create a reliable and easy to use repository service to preserve, manage, and provide persistent and widespread access to the digital scholarship faculty and students now produce…” Total Items: 11,500 Total Downloads: 870,000+
  • 29. Credits  Slide 2 (Book image): http://www.flickr.com/photos/riot/100006656/  Slide 3 (MS office): http://www.flickr.com/photos/niallkennedy/374272762/  Slide 5/13 (Server room): http://www.flickr.com/photos/sylvar/31436961/  Slide 6 (Get your act…): http://www.flickr.com/photos/dreamsjung/3595425744/  Slide 11 (I know this…): http://www.flickr.com/photos/ali_kat_xx/1373989245/  Slide 13 (Disk backups): http://www.flickr.com/photos/tonyaustin/2355186770/  Slide 14 (Disk lost): http://www.flickr.com/photos/mag3737/2415681602/  Slide 14 (Broken cd): http://www.flickr.com/photos/rickheath/72041533/  Slide 15 (VHS to DVD): http://www.flickr.com/photos/28910181@N05/3085532220/  Slide 15 (Mac transfer): http://www.flickr.com/photos/cyprien/6173244/  Slide 16 (Depression): http://www.flickr.com/photos/dbarefoot/2652496167/  Slide 17 (Hope): http://www.flickr.com/photos/livenature/259458056/  Slide 17 (Dioum Quote): http://www.flickr.com/photos/wallyg/469808717/  Slide 27 (Gaps): http://www.flickr.com/photos/aduki/2416528101/
  • 30. Contact Info Tim Donohue University of Illinois tdonohue@illinois.edu http://www.ideals.uiuc.edu/ http://www.ideals.uiuc.edu/wiki/ This work is licensed under a Creative Commons Attribution- Noncommercial 3.0 United States License