6. Challenge for Institutions
Data is expensive
Time, instrumentation, inability to reproduce
Increasing regulation
Granting agencies and journals require submission
Inadequate training
No formal data management curriculum
Preservation is not a priority
Where is the Institutional Repository?
Data management plans (DMPs) and Policies
A challenge for research administration and
compliance, but really just the tip of the iceberg
7. Roles for Libraries
“What role do the research and academic
libraries envision for themselves and do
scientists envision for librarians in a digital
data framework…?”
- To Stand the Test of Time…(2006) ARL. p.24
8. In the Beginning: Purdue University
2004 initiative for Libraries to
collaborate with faculty across
campus—apply library science
knowledge and expertise to
research problems: manage,
organize, describe, disseminate,
preserve information.
Particular emphasis on addressing
Jim Mullins data curation issues.
Dean of Libraries
Purdue University
9. The Data Research Scientist
Help the Libraries move research
strategically forward.
Help ramp up interaction with
research faculty on campus.
Leverage interdisciplinary
research collaborations.
Address the social, cultural and
organizational aspects of data
curation.
10. Trends in Academic Libraries
Data Research Scientist Research Data Management Librarian
Lorry I. Lokey Science Data Services Librarian Science Data Librarian
11. Why the Library?
Neutral: works across the entire institution
Strong in relationship building: has experience
fostering discussion and relationships, and
cultivates an existing support network
Intellectual Property experts: has dealt with
copyright, can translate to data
Service-oriented: uniquely positioned as an
intellectual service unit within the institution
12. Access. Knowledge. Success
Research traditionally moves in this direction
“published” unpublished “published” published secondary
data/ research research research tertiary
datasets traditional/non non-traditional traditional resources
analyzed
data/ Library service traditionally moves in this direction
datasets
processed Given the changing nature of research and
data/ scholarly communication (i.e., e-research),
datasets how do librarians change to adapt to new
ideas regarding what they collect to support
“raw” research & education and how they provide
data/ access to it?
datasets Modified from: Brandt, D.S. “Scholarly Communication” (in To Stand the Test of Time: Long-Term Stewardship
of Digital Data Sets in Science and Engineering.: Final Report of Workshop New Collaborative Relationships:
Academic Libraries in the Digital Data Universe. ARL, Washington, DC, September 2006.)
13. Research Life Cycle
Data Re- Data Deposit
Discovery Use
Archive
Proposal Project Data Data Data End of
Planning Start Up Collection Analysis Sharing Project
Writing
Re-
Purpose
Data Life Cycle
14. Questions?
Sherry Lake
Senior Scientific Data Consultant, UVA Library
shlake@virginia.edu
Twitter: shlakeuva
Web: http://www.lib.virginia.edu/brown/data
15. References
US National Science Board. Long-Lived Digital Data
Collections Enabling Research and Education in the 21st
Century. Septem- ber 2005. http://www.nsf.gov/pubs/2005/
nsb0540/nsb0540_1.pdf
Association of Research Libraries. To Stand the Test of Time:
Long-term Stewardship of Digital Data Sets in Science and
Engineering. A report to the National Science Foundation from
the ARL Workshop on New Collaborative Relationships: the
Role of Academic Libraries in the Digital Data Universe.
September 26-27, 2006, Arlington, VA. http://www.arl.org/
bm~doc/digdatarpt.pdf
16. Image References
Title Slide
1. http://www.flickr.com/photos/grantloy/4577061495/ By grant_loy
2. http://www.flickr.com/photos/catchesthelight/3187651801/ By catchesthelight
3. http://www.flickr.com/photos/stevenm_61/2673806520/ By StevenM_61
4. http://www.flickr.com/photos/gregory-moine/4302464123/ By Gregory Moine
5. http://www.flickr.com/photos/54485353@N05/5784339186/ By janna487
Slide 2
http://www.guardian.co.uk/news/datablog/2011/jul/07/government-transparency-data-releases
http://www.osc.edu/research/csd/
http://blogs.ischool.utexas.edu/digitalcuration/2010/09/26/a-survey-of-scientific-data-repositories/
Slide 5
http://s184.photobucket.com/albums/x311/mdreamweaver/?action=view¤t=under_pressure.jpg&
http://digiex.net/attachments/general-discussions/computer-technology-discussion/1174d1231879288-post-your-
desktops-pc-desktop.jpg
http://faculty.virginia.edu/brodie/butch.html
http://amazon.evsc.virginia.edu/gallery23/main.php/v/equipment/PHCK_Met_2004/PC170012.jpg.html
http://www.cardcow.com/269117/national-radio-astronomy-observatory-300-foot-transit-telescope-green-bank-west-
virginia/
http://www.azoquantum.com/equipment-details.aspx?EquipID=103
http://grenkblog.com/wp-content/uploads/2011/01/e-waste1.jpg
Slide 8
http://www.krannert.purdue.edu/konline/2012s/viewpoints/mullins-p1.html
Editor's Notes
Extending our Reach: Librarians Supporting the Data Life CycleCC BY-SA 3.0This work is licensed under a Creative Commons Attribution-ShareAlike 3.0 Unported License
Open data: growing governmental interest in making publicly-funded research more transparent and more available (NIH, NSF)The Office of Management and Budget (OMB) Circular A-110 provides the federal administrative requirements for grants and agreements with institutions of higher education, hospitals and other non-profit organizations. In1999, revised to provide public access under some circumstances to research data through the Freedom of Information Act (FOIA). Funding agencies have implemented the OMB requirement in various ways. Broader critical review: greater interest evaluating original research data (Nature)Technological advances: sharing of research results easier and faster (Repositories, Web 2.0)Reuse/preservation of research data: increased consideration of the cost and value of research data and need to ensure its longevityhttp://www.guardian.co.uk/news/datablog/2011/jul/07/government-transparency-data-releasesScience (December 2, 2011, Volume: 334, Issue: 6060) with a special section focusing on data replication and reproducibility in the sciences http://pineda-krch.com/2011/12/16/special-section-in-science-on-reproducible-research/Cyberinfrastructure and Software Development Researchhttp://www.osc.edu/research/csd/http://www.anneofcarversville.com/fp/liberals-have-different-dopamine-gene-more-friends-say-scien.htmlhttp://blogs.ischool.utexas.edu/digitalcuration/2010/09/26/a-survey-of-scientific-data-repositories/
Publishers are beginning to require authors to submit datasets with their completed manuscripts Sustainable Digital Data Preservation and Access Network Partners (DataNet)NSF DataNET Awards: to create a set of national data research infrastructure organizations Major DataONE grant ($20 million) Global data access and preservation network for environmental sciences Now seeking Librarians/IT people to work on the implementation, coordination, collaboration aspectsDC Lead by Library at Johns Hopkins Univ.Funding agencies also require faculty to include a data management plan as a component of grant applications/NIH since 2003 Data Sharing Requirement for awards over $500,000NSF require that grantee share their data within a reasonable length of time ( since 2002)Reduces cost if share data.
National Science Board’s Report to the National Science Foundation (Sept. 2005)Long-Lived Digital Data Collections: Enabling Research and Education in the 21st CenturyDetailed Roles and Responsibilities of researchers and institutions. Purdue’s Data Research Scientist was based on the Data ScientistsDescribed in the report as those who are crucial to the successful management of digital data collections (described as librarians)From 2006, ARL Workshop:To Stand the Test of Time: Long-term Stewardship of Digital Data Sets in Science and EngineeringResearch libraries should provide access both to the body of scientific literature and to the dataupon which this literature is based.November 2007, ARL Task Force on Library Support for E-Science final Report was published. Charge of the Task Force focused on raising awareness and positioning researchlibraries to be players in this new arenaOUTCOME 4: Research libraries as active participants in the conceptualization anddevelopment of research infrastructure, including systems and services to support theprocesses of research and the full life cycle of research assets.
Pressure to abide by regulations by granting agencies and publishers/journalsChanging regulations, confusing regulations. Pressure to bring in the money, need grantsLack of formal data management training/understandingMany scientists do not know how to mange their data, now they have labs with more students, working collaborative across institutions.Not aware of problems with Data Management until someone leaves. Or until someone asks for their data. Or worse, their data is destroyed/lost!Not a priority for most researchers (takes time away from the work that is rewarded) Data Management takes time, away from what they like doing… the research, the science. They would rather be doing this than managing their dataData is expensive (time, instrumentation, inability to reproduce)Instrumentation is expensive, when you buy time on a telescope, that data is irriplaceable. Expensive, Research data is being lost because of poor data management and lack of institutional repositoriesResearchers have collected all this “data” during their project and now what do they do with it. Where will they put it, even if they want to. More often they create webpages. URL breaks.
If we think about it Institutions, like UVa, face the same challenges that researchers face.ProblemResearch is becoming more data driven.Increasingly, data has research value beyond the project it was generated for. Data Deluge – the amount of data being produced is increasing exponentially, and the systems, tools and infrastructures to organize, manage, disseminate and preserve data have not kept pace.
Early 2009, We saw Data Services (Research … diff. variations) an emerging service at a variety of peer institutions. These institutions set priorities with data services consistent with their research library missions.Other Institutions used their existing librarians/liaisons and created services and web pages to help support Data Management.These new services were behind the UK (creation of its Digital Curation Center to help support HE) and Australia (National Data Service).
Researchers?VPR? (Vice President for Research)CIO? (Chief Information Officer/IT)OSP? (Office of Sponsored Programs)UL? (University Library)
To date, Subject Librarians have supported the needs of researchers through relatively traditional services revolving around information discovery, collection development and some elements of information management. publication process tailored to deliver the peer-reviewed scholarly article shifting from a focus on managing information in its published form to managing the digital data sets on which findings may be published.Think about traditional libraries being involved in the downstream (the publication). Preserving the data is an “Upstream” role – to have a say (input) into what is preservered and how. We can have a greater influence here so it would be easier to handle downstream.Libraries have long played critical roles within this ecosystem, in the pre-digital world of knowledge-making and knowledge-sharing. Libraries’ role as custodians of “downstream” knowledge – the reports of research communicated in articles and books – is well established. Libraries also contribute to the knowledge lifecycle through less visible but crucial interventions “upstream,” by advising researchers and teaching new scholars how to use the communicative apparatus of their field.
Literature shows currently: evidence for a shift in the nature of support and services, which are becoming more diverse and extensive, and moving beyond purely information-‐related activities and towards a greater emphasis on research data management in particular. clear trend towards providing support for research that is driven more by the requirements of researchers than it has been in the recent past, and a movement in some institutions towards a more proactive model of engagement with researchers.