SlideShare a Scribd company logo
1 of 54
Download to read offline
Dorothea Salo
University of Wisconsin
salo@wisc.edu
Institutional repositories
for the digital arts and
humanities
Dorothea Salo
University of Wisconsin
salo@wisc.edu
Preservation for the
digital arts and
humanities
Dorothea Salo
University of Wisconsin
salo@wisc.edu
Dorothea Salo
University of Wisconsin
salo@wisc.edu
Preservation and
institutional repositories
for the digital arts and
humanities
And I said...
... you’re giving me how
much time for this?
Environment
•As several of you are intimately aware, higher ed
is trying to figure out What To Do About Data.
•This spells opportunity... IF you can get a seat at
the table, and IF you know what to ask for!
• Humanists will not be the first people they think of, sadly.
•Serious (insoluble?) problem: data diversity
• Expect compromise “solutions.”
•Do not let IT pros intimidate you.
• They do not know everything they think they know.
PICK
SOFTWARE
LAST.
Friendly word
of advice:
Photo: “Briana Calderon; future educator of america.”
http://www.flickr.com/photos/46132085@N03/4703617843/
Arielle Calderon / CC-BY 2.0
IT’S WHAT
THE SOFTWARE
WON’T DO.
It’s not what the
software does
that’ll kill you.
Photo: “Briana Calderon; future educator of america.”
http://www.flickr.com/photos/46132085@N03/4703617843/
Arielle Calderon / CC-BY 2.0
DON’T CHASE
THE SHINY.
Another friendly word of
advice:
Photo: “Sparkle Texture”
http://www.flickr.com/photos/abbylanes/3214921616/
Abby Lane / CC-BY 2.0
it’s much less
shiny.
In five years...
Photo: “Sparkle Texture”
http://www.flickr.com/photos/abbylanes/3214921616/
Abby Lane / CC-BY 2.0
it’s not shiny
at all.
In ten years...
Photo: “Sparkle Texture”
http://www.flickr.com/photos/abbylanes/3214921616/
Abby Lane / CC-BY 2.0
In twenty years...
it’s probably
useless.
NOT A SOLUTION:
your graduate students
•You have a bright, tech-savvy grad student.
•She builds an Awesome Tech Thing.
•You have no idea how it works.
•She graduates. You’re hosed.
• Because she didn’t (know how to) build it sustainably...
• Because you don’t have any documentation...
• Because nobody made contingency plans for it...
•I have seen this pattern over and over again.
• It’s killed more digital culture and research materials than
anything I can think of in academe.
Am I saying “don’t experiment?”
•Nah, of course not.
•I’m saying “know what an experiment means.”
•I’m saying “don’t mistake an experiment for an
archive.”
•I’m saying “don’t experiment and then expect
everybody else to pick up your pieces because
you didn’t plan for metadata or preservation.”
That said?
•You gotta do what you gotta do.
•Some friendly advice:
• Know where the exits are. (Can you export your data? In a
reusable format?)
• When you finish a project, USE that export.
• Triply true if you’re relying on the cloud!
•Your overriding goal, while a project is in
progress: keep your eventual options open!
•Long-term... is a totally other kettle of fish.
Your best strategy
•The single best strategy for a digital humanist
concerned about long-term preservation...
•... is to figure out how to make it Somebody
Else’s Problem.
• Right now, this is hard. I do believe it will get easier.
•It’s a lot easier to figure this out from the
start than at the end.
• Different Somebody Elses will have different things that
they want. If you know that from the get-go, you’re much
better off.
Institution-internal solutions
Institution-internal solutions
•Rolling your own
Institution-internal solutions
•Rolling your own
• Please don’t, if you can possibly avoid it.
Institution-internal solutions
•Rolling your own
• Please don’t, if you can possibly avoid it.
•Adopting open-source software
Institution-internal solutions
•Rolling your own
• Please don’t, if you can possibly avoid it.
•Adopting open-source software
• e.g. Omeka, Dataverse, ArchivesSpace...
Institution-internal solutions
•Rolling your own
• Please don’t, if you can possibly avoid it.
•Adopting open-source software
• e.g. Omeka, Dataverse, ArchivesSpace...
• Better, but not foolproof. Upgrades? Security? Backups?
Institution-internal solutions
•Rolling your own
• Please don’t, if you can possibly avoid it.
•Adopting open-source software
• e.g. Omeka, Dataverse, ArchivesSpace...
• Better, but not foolproof. Upgrades? Security? Backups?
• Writing plugins/mods = rolling your own. Avoid if possible.
Institution-internal solutions
•Rolling your own
• Please don’t, if you can possibly avoid it.
•Adopting open-source software
• e.g. Omeka, Dataverse, ArchivesSpace...
• Better, but not foolproof. Upgrades? Security? Backups?
• Writing plugins/mods = rolling your own. Avoid if possible.
•Adopting institutional infrastructure
Institution-internal solutions
•Rolling your own
• Please don’t, if you can possibly avoid it.
•Adopting open-source software
• e.g. Omeka, Dataverse, ArchivesSpace...
• Better, but not foolproof. Upgrades? Security? Backups?
• Writing plugins/mods = rolling your own. Avoid if possible.
•Adopting institutional infrastructure
• Make sure it’ll survive your departure from the institution!
Outside the institution
Outside the institution
•Lists of data repositories
Outside the institution
•Lists of data repositories
• Databib: http://databib.org/
Outside the institution
•Lists of data repositories
• Databib: http://databib.org/
• re3data: http://re3data.org/
Outside the institution
•Lists of data repositories
• Databib: http://databib.org/
• re3data: http://re3data.org/
• N.b. you will find less here on the humanities than you
would probably prefer. Long story.
Outside the institution
•Lists of data repositories
• Databib: http://databib.org/
• re3data: http://re3data.org/
• N.b. you will find less here on the humanities than you
would probably prefer. Long story.
•Figshare
Outside the institution
•Lists of data repositories
• Databib: http://databib.org/
• re3data: http://re3data.org/
• N.b. you will find less here on the humanities than you
would probably prefer. Long story.
•Figshare
• ... and other web services springing up, e.g. omeka.net
You will be limited by...
•Infrastructure your library/IT has already
committed to
• this is why you want to be in on ground-floor discussions!
•Their willingness and ability to tweak, rewrite,
or replace it with something suiting your needs
•Your willingness and ability to evaluate, install,
and maintain a software stack that suits you
• ... perhaps indefinitely!
•The availability of hosted solutions, and your
ability to pay for them (perhaps indefinitely!)
You need to know
what the options are like.
•Your library and IT folks may well need
guidance. At minimum, they need clearly-
expressed requirements.
•The requirements you give them need to go
beyond end-user access, use, and UI.
• Back end: getting material in as efficiently as possible,
allowing for additions/changes/deletions
• Preservation requirements
• Data and metadata purity, clarity, preservability,
reusability, mashuppability, migratability, standards
Institutional
repositories
What’s an IR?
•“‘[A]ttics’ (and often fairly empty ones), with
random assortments of content of
questionable importance”
• Brown, Griffiths, Rascoff, “University publishing in a digital
age.” Ithaka 2007. http://www.sr.ithaka.org/research-
publications/university-publishing-digital-age
•A basic digital preservation-and-access
platform designed to allow faculty to deposit
and describe single PDFs.
•Quite commonly available in research libraries
or through library consortia.
• You probably have one available to you!
IR software
•Open source
• Fedora Commons: http://fedora-commons.info/ (you’ll
need a layer on top of this)
• DSpace: http://dspace.org/
• EPrints: http://eprints.org/
•Commercial
• ContentDM: http://contentdm.com/
• DigiTool: http://www.exlibrisgroup.com/category/
DigiToolOverview
•Hosted
• ContentDM: http://contentdm.com/
• BePress: http://bepress.com/
Two minutes!
•Find an IR available to you for depositing
content.
You can typically expect...
•To get in touch with someone in the library to
get an account set up, and a space for you to
deposit into
• Have a collection name and description ready.
• Default descriptors, if you have any, also a good idea.
• Need access controls? To delegate deposit? Talk about this.
•To be able to put materials in on your own,
through web forms
• To find the deposit process fiddly and annoying
•To have material appear on the web right after
deposit.
IRs work for...
•Small(ish), discrete files that never change
• So an Excel-using researcher is just fine with an IR.
•Documentation for data held elsewhere
•Some IRs can handle static website captures.
•Files with uncomplicated IP lives
• ... which complicates the “static website” question.
•Access restriction may be possible, as may
dark archiving; it depends on the IR platform.
Expect it to be annoying to implement, though.
IRs don’t work for
•Really Big Data
• including, sometimes, audio and video
• This is less a reflection on IR software than of most IRs being
horribly underprovisioned with storage and bandwidth.
•Work in progress; files that may change or be
updated
•Complex digital objects (except static websites)
•Digital objects that need interactivity
• Even something as simple as video streaming. IRs can’t.
•Anything that needs a DOI. (You’ll get a
permanent identifier, but it won’t be a DOI.)
•Datasets where the researcher wants to vet any
potential reusers
Digital libraries
Digital-library software
• Omeka, Greenstone (aging), ContentDM...
• Again, chances are your library already has some kind
of digital-collections software.
• Go ask a librarian what it is, and whether you can add material to it!
• Also ask if it’s attached to any kind of digitization or metadata-help
service. It may not be, but you never know.
• If not, there are hosted options
• if you’re prepared to pay for them indefinitely.
• Designed for image exhibitions
• May extend to audio and video, but UI won’t be ideal.
• May extend to page-scanned books, but may not. (Omeka is terrible
at these.)
Be aware
•The digital-preservation underpinnings of this
class of software are weak to nonexistent.
• It’s designed for exhibiting, not for archiving!
• It may also entice you into poor sustainability decisions,
such as using web-friendly but lossy JPG as your master
image format. Or not making backups.
•On the plus side, though, if it’s a library service,
the library feels an institutional commitment
to the materials in it.
• That’s a lot of the preservation battle won, right there.
Emerging solutions
•Often involve combining software to attack
different parts of the problem
• Preservation underlayers: Fedora Commons, microservices
• Deposit and management UI: Hydra, Islandora
• End-user UI: Hydra, Islandora, Omeka, plugins, mods, etc.
•Are still pretty DIY at this point
• If your library is doing active development, you’re one of
the lucky ones.
• The rest of you may have to wait. And lobby.
Archives platforms
•Designed for coping with an undifferentiated
mess of random digital stuff.
• I know, right? Nobody has that problem...
•Not usually designed to help other people use
or interact with that stuff.
• Also, designed for archivists’ ways of thinking. Archivists are
humanists, but not all humanists are archivists.
• Worth getting a software tour from an archivist!
•Archivematica, ArchivesSpace (in beta), Duke
Data Accessioner, CollectiveAccess, BitCurator
Data-management platforms
•Usually designed for the sciences, not the
humanities!
• But that doesn’t necessarily mean they won’t work for what
you have in mind.
• (“E-lab notebooks” will probably feel pretty foreign, though.)
•Look at Dataverse Network, http://thedata.org/
•Gigantor lists of everything ever:
• http://www.dcc.ac.uk/resources/external/tools-services/
archiving-and-preserving-information
• http://dirt.projectbamboo.org/categories/publishing
• http://foss4lib.org/packages
• Less helpful than you might think; there’s rarely any decision
apparatus alongside.
Dorothea’s cantankerous, crabby,
cynical, crude, choleric, churlish,
other-words-beginning-with-C
take on digital humanists working
with librarians and IT pros
Neil Gaiman on George R.R.
Martin and his eager fans
CENSORED
CENSORED
From: http://journal.neilgaiman.com/2009/05/entitlement-issues.html
Digital humanists:
Librarians and IT
professionals are
not your bitch.CENSORED
Not entirely your fault!
•$$$ is a consideration, unfortunately
• The sciences have it. Unfortunately.
•Your colleagues may have poisoned the well by
being prima donnas, even though you’re not!
•Different professional-advancement
infrastructure
•We may just. not. be. ready.
• Or the infrastructure we rely on may not be.
DH and libraries should be friends
•Involve the library from the outset.
• Please do NOT ask us to pick up your messes at the end!
• Expect us to have work for you to do, and quality expectations.
• Yes, I know that’s how it used to work with analog materials
and archivists. Digital is different.
•Come to us in groups.
• We serve all of campus. We cannot afford to move heaven and
earth for any one person. Please don’t be a prima donna!
• At minimum, have an idea how what you’re asking will
concretely benefit other campus constituencies.
•Offer quid pro quo. What’s in it for us?
• (Library advocacy in high places is always a good trade.)
•Be patient, please.
• We don’t turn on a dime.
Will this always work?
•Sadly, no.
•A good many libraries are just not ready to
take digital preservation and DH support
seriously.
• The presence of a “DH center” in the library is not always
proof of serious intent.
•Others have been burned before.
•Still others are skeptics.
•I can’t promise you’ll find help in the library, or
with campus IT. I can promise you won’t if you
don’t seriously approach them.
• (Miriam Posner’s article is a must-read!)
Thanks!
Questions?

More Related Content

What's hot

Conservation's Digital Landscape: one conservator's perspective
Conservation's Digital Landscape: one conservator's perspectiveConservation's Digital Landscape: one conservator's perspective
Conservation's Digital Landscape: one conservator's perspectiveNancie Ravenel
 
Sarah Callaghan Research Data Overview
Sarah Callaghan Research Data OverviewSarah Callaghan Research Data Overview
Sarah Callaghan Research Data OverviewOpenAIRE
 
OSFair2017 Training | Best practice in Open Science
OSFair2017 Training | Best practice in Open ScienceOSFair2017 Training | Best practice in Open Science
OSFair2017 Training | Best practice in Open ScienceOpen Science Fair
 
Data Science Folk Knowledge
Data Science Folk KnowledgeData Science Folk Knowledge
Data Science Folk KnowledgeKrishna Sankar
 
Information Extraction from Text, presented @ Deloitte
Information Extraction from Text, presented @ DeloitteInformation Extraction from Text, presented @ Deloitte
Information Extraction from Text, presented @ DeloitteDeep Kayal
 
Personal Information Management Strategies and Tools
Personal Information Management Strategies and ToolsPersonal Information Management Strategies and Tools
Personal Information Management Strategies and ToolsRobin D
 
DMTM 2015 - 02 Data Mining
DMTM 2015 - 02 Data MiningDMTM 2015 - 02 Data Mining
DMTM 2015 - 02 Data MiningPier Luca Lanzi
 
Towards An Improvement Community Platform for Service Innovation
Towards An Improvement Community Platform for Service InnovationTowards An Improvement Community Platform for Service Innovation
Towards An Improvement Community Platform for Service InnovationJack Park
 
Publishing and Using Linked Open Data - Day 1
Publishing and Using Linked Open Data - Day 1 Publishing and Using Linked Open Data - Day 1
Publishing and Using Linked Open Data - Day 1 Richard Urban
 
Provenance Management to Enable Data Sharing
Provenance Management to Enable Data SharingProvenance Management to Enable Data Sharing
Provenance Management to Enable Data SharingUniversity of Arizona
 
DMTM 2015 - 01 Course Introduction
DMTM 2015 - 01 Course IntroductionDMTM 2015 - 01 Course Introduction
DMTM 2015 - 01 Course IntroductionPier Luca Lanzi
 
ENGL 1221 Writing Seminar
ENGL 1221 Writing SeminarENGL 1221 Writing Seminar
ENGL 1221 Writing SeminarTraciwm
 
Comm 1130 technical_communication_march2012-alcock
Comm 1130 technical_communication_march2012-alcockComm 1130 technical_communication_march2012-alcock
Comm 1130 technical_communication_march2012-alcockMelanie Parlette-Stewart
 
ENGL 1221 Writing Seminar Putt
ENGL 1221 Writing Seminar PuttENGL 1221 Writing Seminar Putt
ENGL 1221 Writing Seminar PuttTraciwm
 
User-centered assessment: Leveraging what you know and filling in the gaps.
User-centered assessment: Leveraging what you know and filling in the gaps. User-centered assessment: Leveraging what you know and filling in the gaps.
User-centered assessment: Leveraging what you know and filling in the gaps. Lynn Connaway
 
R, Data Wrangling & Kaggle Data Science Competitions
R, Data Wrangling & Kaggle Data Science CompetitionsR, Data Wrangling & Kaggle Data Science Competitions
R, Data Wrangling & Kaggle Data Science CompetitionsKrishna Sankar
 
Web 2.0 for Biologists–Are any of the current tools worth using?
Web 2.0 for Biologists–Are any of the current tools worth using?Web 2.0 for Biologists–Are any of the current tools worth using?
Web 2.0 for Biologists–Are any of the current tools worth using?dacrotty
 
Writing Seminar Babbitt Spring 2012
Writing Seminar Babbitt Spring 2012Writing Seminar Babbitt Spring 2012
Writing Seminar Babbitt Spring 2012Traciwm
 
Empowering tools for Neuroimaging
Empowering tools for NeuroimagingEmpowering tools for Neuroimaging
Empowering tools for NeuroimagingAndreas Horn
 

What's hot (20)

Conservation's Digital Landscape: one conservator's perspective
Conservation's Digital Landscape: one conservator's perspectiveConservation's Digital Landscape: one conservator's perspective
Conservation's Digital Landscape: one conservator's perspective
 
Sarah Callaghan Research Data Overview
Sarah Callaghan Research Data OverviewSarah Callaghan Research Data Overview
Sarah Callaghan Research Data Overview
 
OSFair2017 Training | Best practice in Open Science
OSFair2017 Training | Best practice in Open ScienceOSFair2017 Training | Best practice in Open Science
OSFair2017 Training | Best practice in Open Science
 
Data Science Folk Knowledge
Data Science Folk KnowledgeData Science Folk Knowledge
Data Science Folk Knowledge
 
Information Extraction from Text, presented @ Deloitte
Information Extraction from Text, presented @ DeloitteInformation Extraction from Text, presented @ Deloitte
Information Extraction from Text, presented @ Deloitte
 
Personal Information Management Strategies and Tools
Personal Information Management Strategies and ToolsPersonal Information Management Strategies and Tools
Personal Information Management Strategies and Tools
 
DMTM 2015 - 02 Data Mining
DMTM 2015 - 02 Data MiningDMTM 2015 - 02 Data Mining
DMTM 2015 - 02 Data Mining
 
Towards An Improvement Community Platform for Service Innovation
Towards An Improvement Community Platform for Service InnovationTowards An Improvement Community Platform for Service Innovation
Towards An Improvement Community Platform for Service Innovation
 
Publishing and Using Linked Open Data - Day 1
Publishing and Using Linked Open Data - Day 1 Publishing and Using Linked Open Data - Day 1
Publishing and Using Linked Open Data - Day 1
 
Searching tech2
Searching tech2Searching tech2
Searching tech2
 
Provenance Management to Enable Data Sharing
Provenance Management to Enable Data SharingProvenance Management to Enable Data Sharing
Provenance Management to Enable Data Sharing
 
DMTM 2015 - 01 Course Introduction
DMTM 2015 - 01 Course IntroductionDMTM 2015 - 01 Course Introduction
DMTM 2015 - 01 Course Introduction
 
ENGL 1221 Writing Seminar
ENGL 1221 Writing SeminarENGL 1221 Writing Seminar
ENGL 1221 Writing Seminar
 
Comm 1130 technical_communication_march2012-alcock
Comm 1130 technical_communication_march2012-alcockComm 1130 technical_communication_march2012-alcock
Comm 1130 technical_communication_march2012-alcock
 
ENGL 1221 Writing Seminar Putt
ENGL 1221 Writing Seminar PuttENGL 1221 Writing Seminar Putt
ENGL 1221 Writing Seminar Putt
 
User-centered assessment: Leveraging what you know and filling in the gaps.
User-centered assessment: Leveraging what you know and filling in the gaps. User-centered assessment: Leveraging what you know and filling in the gaps.
User-centered assessment: Leveraging what you know and filling in the gaps.
 
R, Data Wrangling & Kaggle Data Science Competitions
R, Data Wrangling & Kaggle Data Science CompetitionsR, Data Wrangling & Kaggle Data Science Competitions
R, Data Wrangling & Kaggle Data Science Competitions
 
Web 2.0 for Biologists–Are any of the current tools worth using?
Web 2.0 for Biologists–Are any of the current tools worth using?Web 2.0 for Biologists–Are any of the current tools worth using?
Web 2.0 for Biologists–Are any of the current tools worth using?
 
Writing Seminar Babbitt Spring 2012
Writing Seminar Babbitt Spring 2012Writing Seminar Babbitt Spring 2012
Writing Seminar Babbitt Spring 2012
 
Empowering tools for Neuroimaging
Empowering tools for NeuroimagingEmpowering tools for Neuroimaging
Empowering tools for Neuroimaging
 

Viewers also liked

Open Sesame (and other open movements)
Open Sesame (and other open movements)Open Sesame (and other open movements)
Open Sesame (and other open movements)Dorothea Salo
 
Manufacturing Serendipity
Manufacturing SerendipityManufacturing Serendipity
Manufacturing SerendipityDorothea Salo
 
So are we winning yet?
So are we winning yet?So are we winning yet?
So are we winning yet?Dorothea Salo
 
Soylent Semantic Web Is People! (with notes)
Soylent Semantic Web Is People! (with notes)Soylent Semantic Web Is People! (with notes)
Soylent Semantic Web Is People! (with notes)Dorothea Salo
 
RDF, RDA, and other TLAs
RDF, RDA, and other TLAsRDF, RDA, and other TLAs
RDF, RDA, and other TLAsDorothea Salo
 
I own copyright, so I pwn you!
I own copyright, so I pwn you!I own copyright, so I pwn you!
I own copyright, so I pwn you!Dorothea Salo
 
So are we winning yet?
So are we winning yet?So are we winning yet?
So are we winning yet?Dorothea Salo
 
A Successful Failure: Community Requirements Gathering for DSpace
A Successful Failure: Community Requirements Gathering for DSpaceA Successful Failure: Community Requirements Gathering for DSpace
A Successful Failure: Community Requirements Gathering for DSpaceDorothea Salo
 
Even the Loons are Licensed
Even the Loons are LicensedEven the Loons are Licensed
Even the Loons are LicensedDorothea Salo
 
Solving Problems with Web 2.0
Solving Problems with Web 2.0Solving Problems with Web 2.0
Solving Problems with Web 2.0Dorothea Salo
 
Lipstick on a Pig: Integrated Library Systems
Lipstick on a Pig: Integrated Library SystemsLipstick on a Pig: Integrated Library Systems
Lipstick on a Pig: Integrated Library SystemsDorothea Salo
 
What's Driving Open Access?
What's Driving Open Access?What's Driving Open Access?
What's Driving Open Access?Dorothea Salo
 
Grab a bucket! It's raining data!
Grab a bucket! It's raining data!Grab a bucket! It's raining data!
Grab a bucket! It's raining data!Dorothea Salo
 
So you think you know libraries
So you think you know librariesSo you think you know libraries
So you think you know librariesDorothea Salo
 
Grab a bucket! It's raining data!
Grab a bucket! It's raining data!Grab a bucket! It's raining data!
Grab a bucket! It's raining data!Dorothea Salo
 
Save the Cows! Cyberinfrastructure for the Rest of Us
Save the Cows! Cyberinfrastructure for the Rest of UsSave the Cows! Cyberinfrastructure for the Rest of Us
Save the Cows! Cyberinfrastructure for the Rest of UsDorothea Salo
 

Viewers also liked (20)

Occupy Copyright!
Occupy Copyright!Occupy Copyright!
Occupy Copyright!
 
Open Sesame (and other open movements)
Open Sesame (and other open movements)Open Sesame (and other open movements)
Open Sesame (and other open movements)
 
Manufacturing Serendipity
Manufacturing SerendipityManufacturing Serendipity
Manufacturing Serendipity
 
So are we winning yet?
So are we winning yet?So are we winning yet?
So are we winning yet?
 
Soylent Semantic Web Is People! (with notes)
Soylent Semantic Web Is People! (with notes)Soylent Semantic Web Is People! (with notes)
Soylent Semantic Web Is People! (with notes)
 
Metadata
MetadataMetadata
Metadata
 
RDF, RDA, and other TLAs
RDF, RDA, and other TLAsRDF, RDA, and other TLAs
RDF, RDA, and other TLAs
 
I own copyright, so I pwn you!
I own copyright, so I pwn you!I own copyright, so I pwn you!
I own copyright, so I pwn you!
 
Encryption
EncryptionEncryption
Encryption
 
So are we winning yet?
So are we winning yet?So are we winning yet?
So are we winning yet?
 
A Successful Failure: Community Requirements Gathering for DSpace
A Successful Failure: Community Requirements Gathering for DSpaceA Successful Failure: Community Requirements Gathering for DSpace
A Successful Failure: Community Requirements Gathering for DSpace
 
Who owns our work?
Who owns our work?Who owns our work?
Who owns our work?
 
Even the Loons are Licensed
Even the Loons are LicensedEven the Loons are Licensed
Even the Loons are Licensed
 
Solving Problems with Web 2.0
Solving Problems with Web 2.0Solving Problems with Web 2.0
Solving Problems with Web 2.0
 
Lipstick on a Pig: Integrated Library Systems
Lipstick on a Pig: Integrated Library SystemsLipstick on a Pig: Integrated Library Systems
Lipstick on a Pig: Integrated Library Systems
 
What's Driving Open Access?
What's Driving Open Access?What's Driving Open Access?
What's Driving Open Access?
 
Grab a bucket! It's raining data!
Grab a bucket! It's raining data!Grab a bucket! It's raining data!
Grab a bucket! It's raining data!
 
So you think you know libraries
So you think you know librariesSo you think you know libraries
So you think you know libraries
 
Grab a bucket! It's raining data!
Grab a bucket! It's raining data!Grab a bucket! It's raining data!
Grab a bucket! It's raining data!
 
Save the Cows! Cyberinfrastructure for the Rest of Us
Save the Cows! Cyberinfrastructure for the Rest of UsSave the Cows! Cyberinfrastructure for the Rest of Us
Save the Cows! Cyberinfrastructure for the Rest of Us
 

Similar to Preservation and institutional repositories for the digital arts and humanities

POWRR Tools: Lessons learned from an IMLS National Leadership Grant
POWRR Tools: Lessons learned from an IMLS National Leadership GrantPOWRR Tools: Lessons learned from an IMLS National Leadership Grant
POWRR Tools: Lessons learned from an IMLS National Leadership GrantLynne Thomas
 
Write a better FM
Write a better FMWrite a better FM
Write a better FMRich Bowen
 
A collaborative approach to "filling the digital preservation gap" for Resear...
A collaborative approach to "filling the digital preservation gap" for Resear...A collaborative approach to "filling the digital preservation gap" for Resear...
A collaborative approach to "filling the digital preservation gap" for Resear...Jenny Mitcham
 
A collaborative approach to filling the digital preservation gap for RDM
A collaborative approach to filling the digital preservation gap for RDMA collaborative approach to filling the digital preservation gap for RDM
A collaborative approach to filling the digital preservation gap for RDMnortherncollaboration
 
PyData Texas 2015 Keynote
PyData Texas 2015 KeynotePyData Texas 2015 Keynote
PyData Texas 2015 KeynotePeter Wang
 
“Filling the digital preservation gap” an update from the Jisc Research Data ...
“Filling the digital preservation gap”an update from the Jisc Research Data ...“Filling the digital preservation gap”an update from the Jisc Research Data ...
“Filling the digital preservation gap” an update from the Jisc Research Data ...Jenny Mitcham
 
2010 AIRI Petabyte Challenge - View From The Trenches
2010 AIRI Petabyte Challenge - View From The Trenches2010 AIRI Petabyte Challenge - View From The Trenches
2010 AIRI Petabyte Challenge - View From The TrenchesGeorge Ang
 
"Startups, comment gérer une équipe de développeurs" par Laurent Cerveau
"Startups, comment gérer une équipe de développeurs" par Laurent Cerveau"Startups, comment gérer une équipe de développeurs" par Laurent Cerveau
"Startups, comment gérer une équipe de développeurs" par Laurent CerveauTheFamily
 
"Filling the Digital Preservation Gap" with Archivematica
"Filling the Digital Preservation Gap" with Archivematica"Filling the Digital Preservation Gap" with Archivematica
"Filling the Digital Preservation Gap" with ArchivematicaJenny Mitcham
 
Write A Better FM - Ohio Linux 2011
Write A Better FM - Ohio Linux 2011Write A Better FM - Ohio Linux 2011
Write A Better FM - Ohio Linux 2011Rich Bowen
 
Intro to Linked Open Data in Libraries, Archives & Museums
Intro to Linked Open Data in Libraries, Archives & MuseumsIntro to Linked Open Data in Libraries, Archives & Museums
Intro to Linked Open Data in Libraries, Archives & MuseumsJon Voss
 

Similar to Preservation and institutional repositories for the digital arts and humanities (20)

POWRR Tools: Lessons learned from an IMLS National Leadership Grant
POWRR Tools: Lessons learned from an IMLS National Leadership GrantPOWRR Tools: Lessons learned from an IMLS National Leadership Grant
POWRR Tools: Lessons learned from an IMLS National Leadership Grant
 
Write a better FM
Write a better FMWrite a better FM
Write a better FM
 
A collaborative approach to "filling the digital preservation gap" for Resear...
A collaborative approach to "filling the digital preservation gap" for Resear...A collaborative approach to "filling the digital preservation gap" for Resear...
A collaborative approach to "filling the digital preservation gap" for Resear...
 
A collaborative approach to filling the digital preservation gap for RDM
A collaborative approach to filling the digital preservation gap for RDMA collaborative approach to filling the digital preservation gap for RDM
A collaborative approach to filling the digital preservation gap for RDM
 
Carl idigpres
Carl idigpresCarl idigpres
Carl idigpres
 
CARLIdigpres
CARLIdigpresCARLIdigpres
CARLIdigpres
 
PyData Texas 2015 Keynote
PyData Texas 2015 KeynotePyData Texas 2015 Keynote
PyData Texas 2015 Keynote
 
“Filling the digital preservation gap” an update from the Jisc Research Data ...
“Filling the digital preservation gap”an update from the Jisc Research Data ...“Filling the digital preservation gap”an update from the Jisc Research Data ...
“Filling the digital preservation gap” an update from the Jisc Research Data ...
 
2010 AIRI Petabyte Challenge - View From The Trenches
2010 AIRI Petabyte Challenge - View From The Trenches2010 AIRI Petabyte Challenge - View From The Trenches
2010 AIRI Petabyte Challenge - View From The Trenches
 
The future of the DCC
The future of the DCCThe future of the DCC
The future of the DCC
 
"Startups, comment gérer une équipe de développeurs" par Laurent Cerveau
"Startups, comment gérer une équipe de développeurs" par Laurent Cerveau"Startups, comment gérer une équipe de développeurs" par Laurent Cerveau
"Startups, comment gérer une équipe de développeurs" par Laurent Cerveau
 
Can you Cope
Can you CopeCan you Cope
Can you Cope
 
"Filling the Digital Preservation Gap" with Archivematica
"Filling the Digital Preservation Gap" with Archivematica"Filling the Digital Preservation Gap" with Archivematica
"Filling the Digital Preservation Gap" with Archivematica
 
Library Linked Data
Library Linked DataLibrary Linked Data
Library Linked Data
 
Data Storage & Preservation
Data Storage & PreservationData Storage & Preservation
Data Storage & Preservation
 
Write A Better FM - Ohio Linux 2011
Write A Better FM - Ohio Linux 2011Write A Better FM - Ohio Linux 2011
Write A Better FM - Ohio Linux 2011
 
Binary crosswords
Binary crosswordsBinary crosswords
Binary crosswords
 
Storing and sharing
Storing and sharingStoring and sharing
Storing and sharing
 
SHAREmodule1
SHAREmodule1SHAREmodule1
SHAREmodule1
 
Intro to Linked Open Data in Libraries, Archives & Museums
Intro to Linked Open Data in Libraries, Archives & MuseumsIntro to Linked Open Data in Libraries, Archives & Museums
Intro to Linked Open Data in Libraries, Archives & Museums
 

More from Dorothea Salo

Soylent SemanticWeb Is People!
Soylent SemanticWeb Is People!Soylent SemanticWeb Is People!
Soylent SemanticWeb Is People!Dorothea Salo
 
Privacy and libraries
Privacy and librariesPrivacy and libraries
Privacy and librariesDorothea Salo
 
The Canonically Bad (Digital) Humanities Proposal (and how to avoid it)
The Canonically Bad (Digital) Humanities Proposal (and how to avoid it)The Canonically Bad (Digital) Humanities Proposal (and how to avoid it)
The Canonically Bad (Digital) Humanities Proposal (and how to avoid it)Dorothea Salo
 
Is this BIG DATA which I see before me?
Is this BIG DATA which I see before me?Is this BIG DATA which I see before me?
Is this BIG DATA which I see before me?Dorothea Salo
 
Research Data and Scholarly Communication
Research Data and Scholarly CommunicationResearch Data and Scholarly Communication
Research Data and Scholarly CommunicationDorothea Salo
 
Research Data and Scholarly Communication (with notes)
Research Data and Scholarly Communication (with notes)Research Data and Scholarly Communication (with notes)
Research Data and Scholarly Communication (with notes)Dorothea Salo
 
Librarians love data!
Librarians love data!Librarians love data!
Librarians love data!Dorothea Salo
 
Taming the Monster: Digital Preservation Planning and Implementation Tools
Taming the Monster: Digital Preservation Planning and Implementation ToolsTaming the Monster: Digital Preservation Planning and Implementation Tools
Taming the Monster: Digital Preservation Planning and Implementation ToolsDorothea Salo
 
Avoiding the Heron's Way
Avoiding the Heron's WayAvoiding the Heron's Way
Avoiding the Heron's WayDorothea Salo
 
Manufacturing Serendipity
Manufacturing SerendipityManufacturing Serendipity
Manufacturing SerendipityDorothea Salo
 
Databases, Markup, and Regular Expressions
Databases, Markup, and Regular ExpressionsDatabases, Markup, and Regular Expressions
Databases, Markup, and Regular ExpressionsDorothea Salo
 

More from Dorothea Salo (14)

Soylent SemanticWeb Is People!
Soylent SemanticWeb Is People!Soylent SemanticWeb Is People!
Soylent SemanticWeb Is People!
 
Privacy and libraries
Privacy and librariesPrivacy and libraries
Privacy and libraries
 
Paying for it
Paying for itPaying for it
Paying for it
 
The Canonically Bad (Digital) Humanities Proposal (and how to avoid it)
The Canonically Bad (Digital) Humanities Proposal (and how to avoid it)The Canonically Bad (Digital) Humanities Proposal (and how to avoid it)
The Canonically Bad (Digital) Humanities Proposal (and how to avoid it)
 
Is this BIG DATA which I see before me?
Is this BIG DATA which I see before me?Is this BIG DATA which I see before me?
Is this BIG DATA which I see before me?
 
FRBR and RDA
FRBR and RDAFRBR and RDA
FRBR and RDA
 
Research Data and Scholarly Communication
Research Data and Scholarly CommunicationResearch Data and Scholarly Communication
Research Data and Scholarly Communication
 
Research Data and Scholarly Communication (with notes)
Research Data and Scholarly Communication (with notes)Research Data and Scholarly Communication (with notes)
Research Data and Scholarly Communication (with notes)
 
Librarians love data!
Librarians love data!Librarians love data!
Librarians love data!
 
Taming the Monster: Digital Preservation Planning and Implementation Tools
Taming the Monster: Digital Preservation Planning and Implementation ToolsTaming the Monster: Digital Preservation Planning and Implementation Tools
Taming the Monster: Digital Preservation Planning and Implementation Tools
 
Avoiding the Heron's Way
Avoiding the Heron's WayAvoiding the Heron's Way
Avoiding the Heron's Way
 
Manufacturing Serendipity
Manufacturing SerendipityManufacturing Serendipity
Manufacturing Serendipity
 
Open Content
Open ContentOpen Content
Open Content
 
Databases, Markup, and Regular Expressions
Databases, Markup, and Regular ExpressionsDatabases, Markup, and Regular Expressions
Databases, Markup, and Regular Expressions
 

Recently uploaded

New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii SoldatenkoFwdays
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos
 
What is Artificial Intelligence?????????
What is Artificial Intelligence?????????What is Artificial Intelligence?????????
What is Artificial Intelligence?????????blackmambaettijean
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity PlanDatabarracks
 
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESSALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESmohitsingh558521
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxLoriGlavin3
 
The State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxThe State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxLoriGlavin3
 
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfLoriGlavin3
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Commit University
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfAddepto
 
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionDilum Bandara
 
Time Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsTime Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsNathaniel Shimoni
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsSergiu Bodiu
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 3652toLead Limited
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebUiPathCommunity
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero
 
A Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersA Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersNicole Novielli
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024Stephanie Beckett
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 

Recently uploaded (20)

New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
 
What is Artificial Intelligence?????????
What is Artificial Intelligence?????????What is Artificial Intelligence?????????
What is Artificial Intelligence?????????
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity Plan
 
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESSALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
 
The State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxThe State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptx
 
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdf
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdf
 
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An Introduction
 
Time Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsTime Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directions
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio Web
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
 
A Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersA Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software Developers
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 

Preservation and institutional repositories for the digital arts and humanities

  • 1. Dorothea Salo University of Wisconsin salo@wisc.edu
  • 2. Institutional repositories for the digital arts and humanities Dorothea Salo University of Wisconsin salo@wisc.edu
  • 3. Preservation for the digital arts and humanities Dorothea Salo University of Wisconsin salo@wisc.edu
  • 4. Dorothea Salo University of Wisconsin salo@wisc.edu Preservation and institutional repositories for the digital arts and humanities
  • 5. And I said... ... you’re giving me how much time for this?
  • 6. Environment •As several of you are intimately aware, higher ed is trying to figure out What To Do About Data. •This spells opportunity... IF you can get a seat at the table, and IF you know what to ask for! • Humanists will not be the first people they think of, sadly. •Serious (insoluble?) problem: data diversity • Expect compromise “solutions.” •Do not let IT pros intimidate you. • They do not know everything they think they know.
  • 7. PICK SOFTWARE LAST. Friendly word of advice: Photo: “Briana Calderon; future educator of america.” http://www.flickr.com/photos/46132085@N03/4703617843/ Arielle Calderon / CC-BY 2.0
  • 8. IT’S WHAT THE SOFTWARE WON’T DO. It’s not what the software does that’ll kill you. Photo: “Briana Calderon; future educator of america.” http://www.flickr.com/photos/46132085@N03/4703617843/ Arielle Calderon / CC-BY 2.0
  • 9. DON’T CHASE THE SHINY. Another friendly word of advice: Photo: “Sparkle Texture” http://www.flickr.com/photos/abbylanes/3214921616/ Abby Lane / CC-BY 2.0
  • 10. it’s much less shiny. In five years... Photo: “Sparkle Texture” http://www.flickr.com/photos/abbylanes/3214921616/ Abby Lane / CC-BY 2.0
  • 11. it’s not shiny at all. In ten years... Photo: “Sparkle Texture” http://www.flickr.com/photos/abbylanes/3214921616/ Abby Lane / CC-BY 2.0
  • 12. In twenty years... it’s probably useless.
  • 13. NOT A SOLUTION: your graduate students •You have a bright, tech-savvy grad student. •She builds an Awesome Tech Thing. •You have no idea how it works. •She graduates. You’re hosed. • Because she didn’t (know how to) build it sustainably... • Because you don’t have any documentation... • Because nobody made contingency plans for it... •I have seen this pattern over and over again. • It’s killed more digital culture and research materials than anything I can think of in academe.
  • 14. Am I saying “don’t experiment?” •Nah, of course not. •I’m saying “know what an experiment means.” •I’m saying “don’t mistake an experiment for an archive.” •I’m saying “don’t experiment and then expect everybody else to pick up your pieces because you didn’t plan for metadata or preservation.”
  • 15. That said? •You gotta do what you gotta do. •Some friendly advice: • Know where the exits are. (Can you export your data? In a reusable format?) • When you finish a project, USE that export. • Triply true if you’re relying on the cloud! •Your overriding goal, while a project is in progress: keep your eventual options open! •Long-term... is a totally other kettle of fish.
  • 16. Your best strategy •The single best strategy for a digital humanist concerned about long-term preservation... •... is to figure out how to make it Somebody Else’s Problem. • Right now, this is hard. I do believe it will get easier. •It’s a lot easier to figure this out from the start than at the end. • Different Somebody Elses will have different things that they want. If you know that from the get-go, you’re much better off.
  • 19. Institution-internal solutions •Rolling your own • Please don’t, if you can possibly avoid it.
  • 20. Institution-internal solutions •Rolling your own • Please don’t, if you can possibly avoid it. •Adopting open-source software
  • 21. Institution-internal solutions •Rolling your own • Please don’t, if you can possibly avoid it. •Adopting open-source software • e.g. Omeka, Dataverse, ArchivesSpace...
  • 22. Institution-internal solutions •Rolling your own • Please don’t, if you can possibly avoid it. •Adopting open-source software • e.g. Omeka, Dataverse, ArchivesSpace... • Better, but not foolproof. Upgrades? Security? Backups?
  • 23. Institution-internal solutions •Rolling your own • Please don’t, if you can possibly avoid it. •Adopting open-source software • e.g. Omeka, Dataverse, ArchivesSpace... • Better, but not foolproof. Upgrades? Security? Backups? • Writing plugins/mods = rolling your own. Avoid if possible.
  • 24. Institution-internal solutions •Rolling your own • Please don’t, if you can possibly avoid it. •Adopting open-source software • e.g. Omeka, Dataverse, ArchivesSpace... • Better, but not foolproof. Upgrades? Security? Backups? • Writing plugins/mods = rolling your own. Avoid if possible. •Adopting institutional infrastructure
  • 25. Institution-internal solutions •Rolling your own • Please don’t, if you can possibly avoid it. •Adopting open-source software • e.g. Omeka, Dataverse, ArchivesSpace... • Better, but not foolproof. Upgrades? Security? Backups? • Writing plugins/mods = rolling your own. Avoid if possible. •Adopting institutional infrastructure • Make sure it’ll survive your departure from the institution!
  • 27. Outside the institution •Lists of data repositories
  • 28. Outside the institution •Lists of data repositories • Databib: http://databib.org/
  • 29. Outside the institution •Lists of data repositories • Databib: http://databib.org/ • re3data: http://re3data.org/
  • 30. Outside the institution •Lists of data repositories • Databib: http://databib.org/ • re3data: http://re3data.org/ • N.b. you will find less here on the humanities than you would probably prefer. Long story.
  • 31. Outside the institution •Lists of data repositories • Databib: http://databib.org/ • re3data: http://re3data.org/ • N.b. you will find less here on the humanities than you would probably prefer. Long story. •Figshare
  • 32. Outside the institution •Lists of data repositories • Databib: http://databib.org/ • re3data: http://re3data.org/ • N.b. you will find less here on the humanities than you would probably prefer. Long story. •Figshare • ... and other web services springing up, e.g. omeka.net
  • 33. You will be limited by... •Infrastructure your library/IT has already committed to • this is why you want to be in on ground-floor discussions! •Their willingness and ability to tweak, rewrite, or replace it with something suiting your needs •Your willingness and ability to evaluate, install, and maintain a software stack that suits you • ... perhaps indefinitely! •The availability of hosted solutions, and your ability to pay for them (perhaps indefinitely!)
  • 34. You need to know what the options are like. •Your library and IT folks may well need guidance. At minimum, they need clearly- expressed requirements. •The requirements you give them need to go beyond end-user access, use, and UI. • Back end: getting material in as efficiently as possible, allowing for additions/changes/deletions • Preservation requirements • Data and metadata purity, clarity, preservability, reusability, mashuppability, migratability, standards
  • 36. What’s an IR? •“‘[A]ttics’ (and often fairly empty ones), with random assortments of content of questionable importance” • Brown, Griffiths, Rascoff, “University publishing in a digital age.” Ithaka 2007. http://www.sr.ithaka.org/research- publications/university-publishing-digital-age •A basic digital preservation-and-access platform designed to allow faculty to deposit and describe single PDFs. •Quite commonly available in research libraries or through library consortia. • You probably have one available to you!
  • 37. IR software •Open source • Fedora Commons: http://fedora-commons.info/ (you’ll need a layer on top of this) • DSpace: http://dspace.org/ • EPrints: http://eprints.org/ •Commercial • ContentDM: http://contentdm.com/ • DigiTool: http://www.exlibrisgroup.com/category/ DigiToolOverview •Hosted • ContentDM: http://contentdm.com/ • BePress: http://bepress.com/
  • 38. Two minutes! •Find an IR available to you for depositing content.
  • 39. You can typically expect... •To get in touch with someone in the library to get an account set up, and a space for you to deposit into • Have a collection name and description ready. • Default descriptors, if you have any, also a good idea. • Need access controls? To delegate deposit? Talk about this. •To be able to put materials in on your own, through web forms • To find the deposit process fiddly and annoying •To have material appear on the web right after deposit.
  • 40. IRs work for... •Small(ish), discrete files that never change • So an Excel-using researcher is just fine with an IR. •Documentation for data held elsewhere •Some IRs can handle static website captures. •Files with uncomplicated IP lives • ... which complicates the “static website” question. •Access restriction may be possible, as may dark archiving; it depends on the IR platform. Expect it to be annoying to implement, though.
  • 41. IRs don’t work for •Really Big Data • including, sometimes, audio and video • This is less a reflection on IR software than of most IRs being horribly underprovisioned with storage and bandwidth. •Work in progress; files that may change or be updated •Complex digital objects (except static websites) •Digital objects that need interactivity • Even something as simple as video streaming. IRs can’t. •Anything that needs a DOI. (You’ll get a permanent identifier, but it won’t be a DOI.) •Datasets where the researcher wants to vet any potential reusers
  • 43. Digital-library software • Omeka, Greenstone (aging), ContentDM... • Again, chances are your library already has some kind of digital-collections software. • Go ask a librarian what it is, and whether you can add material to it! • Also ask if it’s attached to any kind of digitization or metadata-help service. It may not be, but you never know. • If not, there are hosted options • if you’re prepared to pay for them indefinitely. • Designed for image exhibitions • May extend to audio and video, but UI won’t be ideal. • May extend to page-scanned books, but may not. (Omeka is terrible at these.)
  • 44. Be aware •The digital-preservation underpinnings of this class of software are weak to nonexistent. • It’s designed for exhibiting, not for archiving! • It may also entice you into poor sustainability decisions, such as using web-friendly but lossy JPG as your master image format. Or not making backups. •On the plus side, though, if it’s a library service, the library feels an institutional commitment to the materials in it. • That’s a lot of the preservation battle won, right there.
  • 45. Emerging solutions •Often involve combining software to attack different parts of the problem • Preservation underlayers: Fedora Commons, microservices • Deposit and management UI: Hydra, Islandora • End-user UI: Hydra, Islandora, Omeka, plugins, mods, etc. •Are still pretty DIY at this point • If your library is doing active development, you’re one of the lucky ones. • The rest of you may have to wait. And lobby.
  • 46. Archives platforms •Designed for coping with an undifferentiated mess of random digital stuff. • I know, right? Nobody has that problem... •Not usually designed to help other people use or interact with that stuff. • Also, designed for archivists’ ways of thinking. Archivists are humanists, but not all humanists are archivists. • Worth getting a software tour from an archivist! •Archivematica, ArchivesSpace (in beta), Duke Data Accessioner, CollectiveAccess, BitCurator
  • 47. Data-management platforms •Usually designed for the sciences, not the humanities! • But that doesn’t necessarily mean they won’t work for what you have in mind. • (“E-lab notebooks” will probably feel pretty foreign, though.) •Look at Dataverse Network, http://thedata.org/ •Gigantor lists of everything ever: • http://www.dcc.ac.uk/resources/external/tools-services/ archiving-and-preserving-information • http://dirt.projectbamboo.org/categories/publishing • http://foss4lib.org/packages • Less helpful than you might think; there’s rarely any decision apparatus alongside.
  • 48. Dorothea’s cantankerous, crabby, cynical, crude, choleric, churlish, other-words-beginning-with-C take on digital humanists working with librarians and IT pros
  • 49. Neil Gaiman on George R.R. Martin and his eager fans CENSORED CENSORED From: http://journal.neilgaiman.com/2009/05/entitlement-issues.html
  • 50. Digital humanists: Librarians and IT professionals are not your bitch.CENSORED
  • 51. Not entirely your fault! •$$$ is a consideration, unfortunately • The sciences have it. Unfortunately. •Your colleagues may have poisoned the well by being prima donnas, even though you’re not! •Different professional-advancement infrastructure •We may just. not. be. ready. • Or the infrastructure we rely on may not be.
  • 52. DH and libraries should be friends •Involve the library from the outset. • Please do NOT ask us to pick up your messes at the end! • Expect us to have work for you to do, and quality expectations. • Yes, I know that’s how it used to work with analog materials and archivists. Digital is different. •Come to us in groups. • We serve all of campus. We cannot afford to move heaven and earth for any one person. Please don’t be a prima donna! • At minimum, have an idea how what you’re asking will concretely benefit other campus constituencies. •Offer quid pro quo. What’s in it for us? • (Library advocacy in high places is always a good trade.) •Be patient, please. • We don’t turn on a dime.
  • 53. Will this always work? •Sadly, no. •A good many libraries are just not ready to take digital preservation and DH support seriously. • The presence of a “DH center” in the library is not always proof of serious intent. •Others have been burned before. •Still others are skeptics. •I can’t promise you’ll find help in the library, or with campus IT. I can promise you won’t if you don’t seriously approach them. • (Miriam Posner’s article is a must-read!)