SlideShare a Scribd company logo
1 of 36
Download to read offline
Hot Topics Web Seminar Series: Research
Data in Repositories
The UC San Diego Experience

Third Webinar: The Researcher Perspective
Reminder: General Series Info
•

First webinar: Intro and Framing: UC San Diego
decisions and planning

•

Second Webinar: Deep dive into technology and
metadata

•

Third Webinar: The perspective from researchers,
next steps
Reminder: General Series Info

Slides and presentations from previous
webinars are available for download!
http://www.duraspace.org/hot-topics
Your esteemed presenters …
First webinar:

David Minor – Program Director, Research Data Curation
Declan Fleming - Chief Technology Strategist

Second webinar:
Declan Fleming - Chief Technology Strategist
Arwen Hutt - Metadata Librarian
Matt Critchlow - Manager of Development and Web Services

Third webinar:
David Minor – Program Director, Research Data Curation
Dick Norris – Professor, Scripps Institution of Oceanography
Rick Wagner – Data Scientist at San Diego Supercomputer Center
Today we will …

Discuss how researchers have approached
curation and data management
Reminder: UCSD Research Data Curation Pilots
• The Brain Observatory

• NSF OpenTopography Facility
• Levantine Archaeology Laboratory
• Scripps Institute of Oceanography
Geological Collections
• The Laboratory for Computational
Astrophysics
Reminder: UCSD Research Data Curation Pilots
• The Brain Observatory

• NSF OpenTopography Facility
• Levantine Archaeology Laboratory
• Scripps Institute of Oceanography
Geological Collections
• The Laboratory for Computational
Astrophysics
Richard Norris

Professor at Scripps Institution
of Oceanography
Rick Wagner
High Performance Computing
Manager at the San Diego
Supercomputer Center
Ph.D. Candidate within
The Laboratory for
Computational Astrophysics
SIO Geological Collections
General Series Intro
First webinar: Intro and Framing: UC San Diego
decisions and planning Part of the
Curator: Dick Norris
International
Marine and Lacustrine
CollectionsWebinar: Deep dive into technology and
• Second Manager:
Geological Collections
Alexandra Hangsterfer
metadata
•

•

With collections at
Third Webinar: The perspective fromOregon
Columbia, researchers,
next steps
State, Woods Hole,
USGS and more
Our Collection: Sediment cores and rocks
recovered from the oceans & long-lived lakes
Reef sediment-Panama

Salton Sea-CA
How we get them….
Mostly by Sea
(Ship, Cruise, Leg)

But also by Land
Country, Locality, Lat/Long
Collection events
Recovering a Gravity Core to collect
seafloor sediments

Deploying a Dredge to
collect seafloor rocks
A collection event is an Object
and includes:

•
•
•
•
•
•
•

Specimen(s) Latitude/Longitude)
Ship name and cruise number
Text descriptions
Thin-sections
Images, field notes, publications
Location in the repository
International Geological Sample
Number
The Sediment Core
Collection
Archive and Working
halves of ~7000 cores
from the world’s oceans
Typically 3-5
sections/core
+ core photos, chemical
data and sampling
history
The IODP Core collection, Bremen Germany
The Marine Rock collection…
• ~4000 dredge sites worldwide
• In an 8000 sq ft building
• Volcanic rocks, manganese nodules, reef rock
Our data resides with NGDC…
• NOAA’s National Geophysical Data Center
• And IGSN’s with Lamont’s SESAR
NGDC searches on ships, repositories,
sampling systems, and locations

But no keyword search, automated data input, ways to link associated
data, returns on nearest search terms, sampling history, etc….
What the Community Wants
• A unified National geo-referenced system
• Exploratory search by nearest word and mapbased system
• Links to associated data types (images, text,
data, references…)
• All data types linked by IGSNs
• Data entry through web forms with
publication by curators
What we did with RCI
• Identified one type of object
– Based in sampling events
– Ship-Cruise-Sampling device-Sample number
– Geo-referenced
– Includes associated materials: text description,
images, chemical data, references, records of
sampling event, sampling records, storage location

• NGDC records imported into UC Library
system
• Records searchable by any word in a record
What’s next?
• NSF-sponsored SEASAR (System for Earth
Sample Registration)
– Created the International GeoSample Number
– http://www.geosamples.org/

• NSF-sponsored workshop:
– Digital Environment for Sample Curation (June
2013)
– http://www.geosamples.org/news/descwebinarmaterials

• NSF “EarthCube” initiative
CyberInfrastructure needs (from DESC)
• Offline data entry at sea or in the field
• DESC should respect data moratoriums (typically 2
years, if collected with NSF grants)
• Automated release to public at close of moratorium
• Secure login-based data serving for project scientists
• Flexible search and access for users to view public
archive (view by location name, type, bounding
region) and associated data
• Flexible sample request submission
More cyberInfrastructure needs
• Display stored datasets and images hosted on other
servers (as in other repositories)
• Connections with Standard Visualization Tools Such
as Corelyzer, Correlator, PSICAT, CoreRef, GMT,
GeoMapApp
• Sampling database should be easily accessible by
researchers to submit requests
• Automatically updated by repository (personnel) to
reflect samples sent to the researchers
• Way of entering historical sampling information
These are general issues for
Natural History Collections
• Most museums have similar issues to us
– Geo-Referenced collections
– Mix of physical specimens, images, text
descriptions, sampling data, and affiliated data
files
– Many have home-grown data bases that are not
interoperable with other museums

Fish from the SIO Marine
Vertebrates Collection
Natural History Collections
• Need controlled vocabularies but flexibility to
search on variants
– Since nobody agrees on common vocabularies

• Value in cross-referencing to related
collections
– Such as samples (geology, biology, water)
collected on a cruise with ship track, sea floor
maps…
– Presently working on “Rolling deck to Repository”
NSF project
LCA
PI: Mike Norman
Current and Past
Students: Many
Research group focusing on numerical modeling of complex astrophysical
processes: cosmology, galaxy formation, turbulence, radiation hydrodynamics,
magneto-hydrodynamics, …

Image credit: NASA, IoA, A. Fabian et al.
Our simulations are large, based on the current definition of “large” (we grow
with the technology). Typical results are 1-100 TB.
This work is costly in terms of both the computer time and human effort, and we
see a benefit to the science community in sharing. (Citations are nice, too.)
http://bit.ly/sB30f1
http://bit.ly/IzTVV2
http://bit.ly/IE4iFd
http://bit.ly/HFYLQJ
Prior Sharing Efforts
Participation in the Virtual
Observatory
• Standards for simulation
metadata, search, and retrieval
• An odd fit beside the “pure”
astronomy projects and data
centers
• But, it meant we weren’t starting
from scratch in terms of describing
our data
Started the curation effort very
curious about how much of this
previous work would translate to
library space
Also wanted stable platform for data
hosting (e.g., not a closet server)
Curation Process

By E.gordienko (Own work) [CC-BY-SA-3.0
(http://creativecommons.org/licenses/by-sa/3.0) or GFDL
(http://www.gnu.org/copyleft/fdl.html)], via Wikimedia Commons

Several steps:
• Choosing the pilot dataset
• Cleaning up simulation cruft
• Identifying related publications
• Adding historical documents
(proposals, reports, etc.)
• Organize various data groups
• Simulations are a collection of
datasets from various points in
time, needed a description for
each type of digital object in each
dataset
• Bundle, checksum, and handoff

Decided near the end to replicate
the metadata record to a second site
as test of its portability
http://bit.ly/17yTc1n
Final result:
• Datasets from a high-resolution
cosmology simulation held at
UCSD
• Viewable both at UCSD, and via
the Online Archive of California
• Raw simulation data and various
analysis results accessible over
HTTP
Some thoughts:
• When it comes to metadata formats libraries are like any other science
domain and speak their own language
• If you have a highly-specialized domain-specific metadata dialect or
language, you may need an additional discovery service
• If not, it’s a good starting point
• We’re working on repeating this process on our own for another simulation
Next steps at UC San Diego

Move from pilot services to a scalable series of processes.
Work with additional researchers in same domains.
Work with new domains.
Broaden lifecycle management
mindset on campus.
Questions?

Rick Wagner - rpwagner@sdsc.edu
Richard Norris - rnorris@ucsd.edu
David Minor - dminor@ucsd.edu
http://www.duraspace.org/hot-topics

More Related Content

What's hot

6.15.17 DSpace-Cris Webinar Presentation Slides
6.15.17 DSpace-Cris Webinar Presentation Slides6.15.17 DSpace-Cris Webinar Presentation Slides
6.15.17 DSpace-Cris Webinar Presentation SlidesDuraSpace
 
5.15.17 Powering Linked Data and Hosted Solutions with Fedora Webinar Slides
5.15.17 Powering Linked Data and Hosted Solutions with Fedora Webinar Slides5.15.17 Powering Linked Data and Hosted Solutions with Fedora Webinar Slides
5.15.17 Powering Linked Data and Hosted Solutions with Fedora Webinar SlidesDuraSpace
 
Center for Open Science and the Open Science Framework: Dataverse Add-on by S...
Center for Open Science and the Open Science Framework: Dataverse Add-on by S...Center for Open Science and the Open Science Framework: Dataverse Add-on by S...
Center for Open Science and the Open Science Framework: Dataverse Add-on by S...datascienceiqss
 
DSpace-CRIS Workshop OR2015: Slides
DSpace-CRIS Workshop OR2015: SlidesDSpace-CRIS Workshop OR2015: Slides
DSpace-CRIS Workshop OR2015: SlidesAndrea Bollini
 
RJ Broker: Automating Delivery of Research Output to Repositories
RJ Broker: Automating Delivery of Research Output to RepositoriesRJ Broker: Automating Delivery of Research Output to Repositories
RJ Broker: Automating Delivery of Research Output to RepositoriesEDINA, University of Edinburgh
 
Dataverse in China: Internationalization, Curation and Promotion by Yin Shenqin
Dataverse in China: Internationalization, Curation and Promotion by Yin ShenqinDataverse in China: Internationalization, Curation and Promotion by Yin Shenqin
Dataverse in China: Internationalization, Curation and Promotion by Yin Shenqindatascienceiqss
 
FAIR Software (and Data) Citation: Europe, Research Object Systems, Networks ...
FAIR Software (and Data) Citation: Europe, Research Object Systems, Networks ...FAIR Software (and Data) Citation: Europe, Research Object Systems, Networks ...
FAIR Software (and Data) Citation: Europe, Research Object Systems, Networks ...Carole Goble
 
Collaboration to Curation: The High Rise Project meets Edinburgh DataShare
Collaboration to Curation: The High Rise Project meets Edinburgh DataShareCollaboration to Curation: The High Rise Project meets Edinburgh DataShare
Collaboration to Curation: The High Rise Project meets Edinburgh DataShareEDINA, University of Edinburgh
 
Repository technologies
Repository technologiesRepository technologies
Repository technologiesAndrea Bollini
 
DSpace for Cultural Heritage: adding support for images visualization,audio/v...
DSpace for Cultural Heritage: adding support for images visualization,audio/v...DSpace for Cultural Heritage: adding support for images visualization,audio/v...
DSpace for Cultural Heritage: adding support for images visualization,audio/v...Andrea Bollini
 
DSpace-CRIS: An Open Source Solution for Research - @THETA15
DSpace-CRIS: An Open Source Solution for Research - @THETA15DSpace-CRIS: An Open Source Solution for Research - @THETA15
DSpace-CRIS: An Open Source Solution for Research - @THETA15Michele Mennielli
 
DSpace-CRIS: new features and contribution to the DSpace mainstream
DSpace-CRIS: new features and contribution to the DSpace mainstreamDSpace-CRIS: new features and contribution to the DSpace mainstream
DSpace-CRIS: new features and contribution to the DSpace mainstreamAndrea Bollini
 
Archiving The Worlds E-Journals:The Keepers Registry As Global Monitor
Archiving The Worlds E-Journals:The Keepers Registry As Global MonitorArchiving The Worlds E-Journals:The Keepers Registry As Global Monitor
Archiving The Worlds E-Journals:The Keepers Registry As Global MonitorEDINA, University of Edinburgh
 
“Filling the digital preservation gap” an update from the Jisc Research Data ...
“Filling the digital preservation gap”an update from the Jisc Research Data ...“Filling the digital preservation gap”an update from the Jisc Research Data ...
“Filling the digital preservation gap” an update from the Jisc Research Data ...Jenny Mitcham
 
DSpace standard Data model and DSpace-CRIS
DSpace standard Data model and DSpace-CRISDSpace standard Data model and DSpace-CRIS
DSpace standard Data model and DSpace-CRISAndrea Bollini
 
Research Shared: researchobject.org
Research Shared: researchobject.orgResearch Shared: researchobject.org
Research Shared: researchobject.orgNorman Morrison
 
SPatially Explicit Data Discovery, Extraction and Evaluation Services (SPEDDE...
SPatially Explicit Data Discovery, Extraction and Evaluation Services (SPEDDE...SPatially Explicit Data Discovery, Extraction and Evaluation Services (SPEDDE...
SPatially Explicit Data Discovery, Extraction and Evaluation Services (SPEDDE...aceas13tern
 

What's hot (18)

6.15.17 DSpace-Cris Webinar Presentation Slides
6.15.17 DSpace-Cris Webinar Presentation Slides6.15.17 DSpace-Cris Webinar Presentation Slides
6.15.17 DSpace-Cris Webinar Presentation Slides
 
5.15.17 Powering Linked Data and Hosted Solutions with Fedora Webinar Slides
5.15.17 Powering Linked Data and Hosted Solutions with Fedora Webinar Slides5.15.17 Powering Linked Data and Hosted Solutions with Fedora Webinar Slides
5.15.17 Powering Linked Data and Hosted Solutions with Fedora Webinar Slides
 
Center for Open Science and the Open Science Framework: Dataverse Add-on by S...
Center for Open Science and the Open Science Framework: Dataverse Add-on by S...Center for Open Science and the Open Science Framework: Dataverse Add-on by S...
Center for Open Science and the Open Science Framework: Dataverse Add-on by S...
 
DSpace-CRIS Workshop OR2015: Slides
DSpace-CRIS Workshop OR2015: SlidesDSpace-CRIS Workshop OR2015: Slides
DSpace-CRIS Workshop OR2015: Slides
 
RJ Broker: Automating Delivery of Research Output to Repositories
RJ Broker: Automating Delivery of Research Output to RepositoriesRJ Broker: Automating Delivery of Research Output to Repositories
RJ Broker: Automating Delivery of Research Output to Repositories
 
Dataverse in China: Internationalization, Curation and Promotion by Yin Shenqin
Dataverse in China: Internationalization, Curation and Promotion by Yin ShenqinDataverse in China: Internationalization, Curation and Promotion by Yin Shenqin
Dataverse in China: Internationalization, Curation and Promotion by Yin Shenqin
 
FAIR Software (and Data) Citation: Europe, Research Object Systems, Networks ...
FAIR Software (and Data) Citation: Europe, Research Object Systems, Networks ...FAIR Software (and Data) Citation: Europe, Research Object Systems, Networks ...
FAIR Software (and Data) Citation: Europe, Research Object Systems, Networks ...
 
Collaboration to Curation: The High Rise Project meets Edinburgh DataShare
Collaboration to Curation: The High Rise Project meets Edinburgh DataShareCollaboration to Curation: The High Rise Project meets Edinburgh DataShare
Collaboration to Curation: The High Rise Project meets Edinburgh DataShare
 
Repository technologies
Repository technologiesRepository technologies
Repository technologies
 
DSpace for Cultural Heritage: adding support for images visualization,audio/v...
DSpace for Cultural Heritage: adding support for images visualization,audio/v...DSpace for Cultural Heritage: adding support for images visualization,audio/v...
DSpace for Cultural Heritage: adding support for images visualization,audio/v...
 
DSpace-CRIS: An Open Source Solution for Research - @THETA15
DSpace-CRIS: An Open Source Solution for Research - @THETA15DSpace-CRIS: An Open Source Solution for Research - @THETA15
DSpace-CRIS: An Open Source Solution for Research - @THETA15
 
DSpace-CRIS: new features and contribution to the DSpace mainstream
DSpace-CRIS: new features and contribution to the DSpace mainstreamDSpace-CRIS: new features and contribution to the DSpace mainstream
DSpace-CRIS: new features and contribution to the DSpace mainstream
 
Archiving The Worlds E-Journals:The Keepers Registry As Global Monitor
Archiving The Worlds E-Journals:The Keepers Registry As Global MonitorArchiving The Worlds E-Journals:The Keepers Registry As Global Monitor
Archiving The Worlds E-Journals:The Keepers Registry As Global Monitor
 
Accessing Treasure on lands and peoples
Accessing Treasure on lands and peoplesAccessing Treasure on lands and peoples
Accessing Treasure on lands and peoples
 
“Filling the digital preservation gap” an update from the Jisc Research Data ...
“Filling the digital preservation gap”an update from the Jisc Research Data ...“Filling the digital preservation gap”an update from the Jisc Research Data ...
“Filling the digital preservation gap” an update from the Jisc Research Data ...
 
DSpace standard Data model and DSpace-CRIS
DSpace standard Data model and DSpace-CRISDSpace standard Data model and DSpace-CRIS
DSpace standard Data model and DSpace-CRIS
 
Research Shared: researchobject.org
Research Shared: researchobject.orgResearch Shared: researchobject.org
Research Shared: researchobject.org
 
SPatially Explicit Data Discovery, Extraction and Evaluation Services (SPEDDE...
SPatially Explicit Data Discovery, Extraction and Evaluation Services (SPEDDE...SPatially Explicit Data Discovery, Extraction and Evaluation Services (SPEDDE...
SPatially Explicit Data Discovery, Extraction and Evaluation Services (SPEDDE...
 

Similar to 10-31-13 “Researcher Perspectives of Data Curation” Presentation Slides

Research Cyberinfrastructure at UCSD - David Minor - RDAP12
Research Cyberinfrastructure at UCSD - David Minor - RDAP12Research Cyberinfrastructure at UCSD - David Minor - RDAP12
Research Cyberinfrastructure at UCSD - David Minor - RDAP12ASIS&T
 
Digitised collections: Toward a digital strategy for for the NHM, London
Digitised collections: Toward a digital strategy for for the NHM, LondonDigitised collections: Toward a digital strategy for for the NHM, London
Digitised collections: Toward a digital strategy for for the NHM, LondonVince Smith
 
Slides anu talkwebarchivingaug2012
Slides anu talkwebarchivingaug2012Slides anu talkwebarchivingaug2012
Slides anu talkwebarchivingaug2012Roxanne Missingham
 
No specimen left behind: Collections digitisation at the NHM, London*
No specimen left behind:  Collections digitisation at the NHM, London*No specimen left behind:  Collections digitisation at the NHM, London*
No specimen left behind: Collections digitisation at the NHM, London*Vince Smith
 
Elag workshop sessie 1 en 2 v10
Elag workshop sessie 1 en 2 v10Elag workshop sessie 1 en 2 v10
Elag workshop sessie 1 en 2 v10Jeroen Rombouts
 
WebART: Facilitating Scholarly Use of Web Archives (IIPC, Apr. 2013)
WebART: Facilitating Scholarly Use of Web Archives (IIPC, Apr. 2013)WebART: Facilitating Scholarly Use of Web Archives (IIPC, Apr. 2013)
WebART: Facilitating Scholarly Use of Web Archives (IIPC, Apr. 2013)TimelessFuture
 
What do you want to discover today? / Janet Aucock, University of St Andrews
What do you want to discover today? / Janet Aucock, University of St AndrewsWhat do you want to discover today? / Janet Aucock, University of St Andrews
What do you want to discover today? / Janet Aucock, University of St AndrewsCIGScotland
 
EarthCube's OceanLink - Project Overview and Presentation Updates (March 2014)
EarthCube's OceanLink - Project Overview and Presentation Updates (March 2014)EarthCube's OceanLink - Project Overview and Presentation Updates (March 2014)
EarthCube's OceanLink - Project Overview and Presentation Updates (March 2014)EarthCube
 
Linked Open Data in Libraries, Archives & Museums
Linked Open Data in Libraries, Archives & MuseumsLinked Open Data in Libraries, Archives & Museums
Linked Open Data in Libraries, Archives & MuseumsJon Voss
 
"Data in Context" IG sessions @ RDA 3rd Plenary
"Data in Context" IG sessions @  RDA 3rd Plenary"Data in Context" IG sessions @  RDA 3rd Plenary
"Data in Context" IG sessions @ RDA 3rd PlenaryBrigitte Jörg
 
Data in Context Interest Group Sessions @ RDA 3rd Plenary, Dublin (March 26-2...
Data in Context Interest Group Sessions @ RDA 3rd Plenary, Dublin (March 26-2...Data in Context Interest Group Sessions @ RDA 3rd Plenary, Dublin (March 26-2...
Data in Context Interest Group Sessions @ RDA 3rd Plenary, Dublin (March 26-2...Brigitte Jörg
 
2013 DataCite Summer Meeting - Purdue University Research Repository (PURR) (...
2013 DataCite Summer Meeting - Purdue University Research Repository (PURR) (...2013 DataCite Summer Meeting - Purdue University Research Repository (PURR) (...
2013 DataCite Summer Meeting - Purdue University Research Repository (PURR) (...datacite
 
Leslie Johnston: Library Big Data Repository Services, Open Repositories 2012
Leslie Johnston: Library Big Data Repository Services, Open Repositories 2012Leslie Johnston: Library Big Data Repository Services, Open Repositories 2012
Leslie Johnston: Library Big Data Repository Services, Open Repositories 2012lljohnston
 
Web archiving challenges and opportunities
Web archiving challenges and opportunitiesWeb archiving challenges and opportunities
Web archiving challenges and opportunitiesAhmed AlSum
 
INIS Activities Main with Animations.-for Show
INIS Activities Main with Animations.-for ShowINIS Activities Main with Animations.-for Show
INIS Activities Main with Animations.-for ShowZaven Hakopov
 
Big Process for Big Data @ PNNL, May 2013
Big Process for Big Data @ PNNL, May 2013Big Process for Big Data @ PNNL, May 2013
Big Process for Big Data @ PNNL, May 2013Ian Foster
 
The eCrystals Federation
The eCrystals FederationThe eCrystals Federation
The eCrystals FederationManjulaPatel
 
Emtacl12, mlibraries12 conferences, 2012
Emtacl12, mlibraries12 conferences, 2012Emtacl12, mlibraries12 conferences, 2012
Emtacl12, mlibraries12 conferences, 2012Kerryn Amery
 

Similar to 10-31-13 “Researcher Perspectives of Data Curation” Presentation Slides (20)

Research Cyberinfrastructure at UCSD - David Minor - RDAP12
Research Cyberinfrastructure at UCSD - David Minor - RDAP12Research Cyberinfrastructure at UCSD - David Minor - RDAP12
Research Cyberinfrastructure at UCSD - David Minor - RDAP12
 
Digitised collections: Toward a digital strategy for for the NHM, London
Digitised collections: Toward a digital strategy for for the NHM, LondonDigitised collections: Toward a digital strategy for for the NHM, London
Digitised collections: Toward a digital strategy for for the NHM, London
 
Internet content as research data
Internet content as research dataInternet content as research data
Internet content as research data
 
Slides anu talkwebarchivingaug2012
Slides anu talkwebarchivingaug2012Slides anu talkwebarchivingaug2012
Slides anu talkwebarchivingaug2012
 
No specimen left behind: Collections digitisation at the NHM, London*
No specimen left behind:  Collections digitisation at the NHM, London*No specimen left behind:  Collections digitisation at the NHM, London*
No specimen left behind: Collections digitisation at the NHM, London*
 
Elag workshop sessie 1 en 2 v10
Elag workshop sessie 1 en 2 v10Elag workshop sessie 1 en 2 v10
Elag workshop sessie 1 en 2 v10
 
WebART: Facilitating Scholarly Use of Web Archives (IIPC, Apr. 2013)
WebART: Facilitating Scholarly Use of Web Archives (IIPC, Apr. 2013)WebART: Facilitating Scholarly Use of Web Archives (IIPC, Apr. 2013)
WebART: Facilitating Scholarly Use of Web Archives (IIPC, Apr. 2013)
 
What do you want to discover today? / Janet Aucock, University of St Andrews
What do you want to discover today? / Janet Aucock, University of St AndrewsWhat do you want to discover today? / Janet Aucock, University of St Andrews
What do you want to discover today? / Janet Aucock, University of St Andrews
 
EarthCube's OceanLink - Project Overview and Presentation Updates (March 2014)
EarthCube's OceanLink - Project Overview and Presentation Updates (March 2014)EarthCube's OceanLink - Project Overview and Presentation Updates (March 2014)
EarthCube's OceanLink - Project Overview and Presentation Updates (March 2014)
 
Researh data management
Researh data managementResearh data management
Researh data management
 
Linked Open Data in Libraries, Archives & Museums
Linked Open Data in Libraries, Archives & MuseumsLinked Open Data in Libraries, Archives & Museums
Linked Open Data in Libraries, Archives & Museums
 
"Data in Context" IG sessions @ RDA 3rd Plenary
"Data in Context" IG sessions @  RDA 3rd Plenary"Data in Context" IG sessions @  RDA 3rd Plenary
"Data in Context" IG sessions @ RDA 3rd Plenary
 
Data in Context Interest Group Sessions @ RDA 3rd Plenary, Dublin (March 26-2...
Data in Context Interest Group Sessions @ RDA 3rd Plenary, Dublin (March 26-2...Data in Context Interest Group Sessions @ RDA 3rd Plenary, Dublin (March 26-2...
Data in Context Interest Group Sessions @ RDA 3rd Plenary, Dublin (March 26-2...
 
2013 DataCite Summer Meeting - Purdue University Research Repository (PURR) (...
2013 DataCite Summer Meeting - Purdue University Research Repository (PURR) (...2013 DataCite Summer Meeting - Purdue University Research Repository (PURR) (...
2013 DataCite Summer Meeting - Purdue University Research Repository (PURR) (...
 
Leslie Johnston: Library Big Data Repository Services, Open Repositories 2012
Leslie Johnston: Library Big Data Repository Services, Open Repositories 2012Leslie Johnston: Library Big Data Repository Services, Open Repositories 2012
Leslie Johnston: Library Big Data Repository Services, Open Repositories 2012
 
Web archiving challenges and opportunities
Web archiving challenges and opportunitiesWeb archiving challenges and opportunities
Web archiving challenges and opportunities
 
INIS Activities Main with Animations.-for Show
INIS Activities Main with Animations.-for ShowINIS Activities Main with Animations.-for Show
INIS Activities Main with Animations.-for Show
 
Big Process for Big Data @ PNNL, May 2013
Big Process for Big Data @ PNNL, May 2013Big Process for Big Data @ PNNL, May 2013
Big Process for Big Data @ PNNL, May 2013
 
The eCrystals Federation
The eCrystals FederationThe eCrystals Federation
The eCrystals Federation
 
Emtacl12, mlibraries12 conferences, 2012
Emtacl12, mlibraries12 conferences, 2012Emtacl12, mlibraries12 conferences, 2012
Emtacl12, mlibraries12 conferences, 2012
 

More from DuraSpace

12.5.18 "How For-Profit Companies Can Be a Part of the Open Environment" pres...
12.5.18 "How For-Profit Companies Can Be a Part of the Open Environment" pres...12.5.18 "How For-Profit Companies Can Be a Part of the Open Environment" pres...
12.5.18 "How For-Profit Companies Can Be a Part of the Open Environment" pres...DuraSpace
 
11.20.18 DSpace for Research Data Management Webinar
11.20.18 DSpace for Research Data Management Webinar11.20.18 DSpace for Research Data Management Webinar
11.20.18 DSpace for Research Data Management WebinarDuraSpace
 
10.24.18 "Securing Community-Controlled Infrastructure: SPARC’s plan of actio...
10.24.18 "Securing Community-Controlled Infrastructure: SPARC’s plan of actio...10.24.18 "Securing Community-Controlled Infrastructure: SPARC’s plan of actio...
10.24.18 "Securing Community-Controlled Infrastructure: SPARC’s plan of actio...DuraSpace
 
9.26.18 Beyond NA presentation slides
9.26.18 Beyond NA presentation slides9.26.18 Beyond NA presentation slides
9.26.18 Beyond NA presentation slidesDuraSpace
 
9.19.18 ArchivesDirect Overview: Standards-Based Preservation with Hosted Arc...
9.19.18 ArchivesDirect Overview: Standards-Based Preservation with Hosted Arc...9.19.18 ArchivesDirect Overview: Standards-Based Preservation with Hosted Arc...
9.19.18 ArchivesDirect Overview: Standards-Based Preservation with Hosted Arc...DuraSpace
 
5.24.18 DuraCloud in 2018 Presentation Slides
5.24.18 DuraCloud in 2018 Presentation Slides5.24.18 DuraCloud in 2018 Presentation Slides
5.24.18 DuraCloud in 2018 Presentation SlidesDuraSpace
 
5.17.18 "The 2.5% Commitment: Investing in Open" presentation slides
5.17.18 "The 2.5% Commitment: Investing in Open" presentation slides5.17.18 "The 2.5% Commitment: Investing in Open" presentation slides
5.17.18 "The 2.5% Commitment: Investing in Open" presentation slidesDuraSpace
 
3.28.18 "Open Source Repository Upgrades: Top Advice from Practitioners" Pres...
3.28.18 "Open Source Repository Upgrades: Top Advice from Practitioners" Pres...3.28.18 "Open Source Repository Upgrades: Top Advice from Practitioners" Pres...
3.28.18 "Open Source Repository Upgrades: Top Advice from Practitioners" Pres...DuraSpace
 
2.28.18 Getting Started with Fedora presentation slides
2.28.18 Getting Started with Fedora presentation slides2.28.18 Getting Started with Fedora presentation slides
2.28.18 Getting Started with Fedora presentation slidesDuraSpace
 
Digital Preservation in Production (DPN and DuraCloud Vault)
Digital Preservation in Production (DPN and DuraCloud Vault)Digital Preservation in Production (DPN and DuraCloud Vault)
Digital Preservation in Production (DPN and DuraCloud Vault)DuraSpace
 
3.15.17 DSpace: How to Contribute Webinar Slides
3.15.17 DSpace: How to Contribute Webinar Slides3.15.17 DSpace: How to Contribute Webinar Slides
3.15.17 DSpace: How to Contribute Webinar SlidesDuraSpace
 
3.7.17 DSpace for Data: issues, solutions and challenges Webinar Slides
3.7.17 DSpace for Data: issues, solutions and challenges Webinar Slides3.7.17 DSpace for Data: issues, solutions and challenges Webinar Slides
3.7.17 DSpace for Data: issues, solutions and challenges Webinar SlidesDuraSpace
 
2.28.17 Introducing DSpace 7 Webinar Slides
2.28.17 Introducing DSpace 7 Webinar Slides2.28.17 Introducing DSpace 7 Webinar Slides
2.28.17 Introducing DSpace 7 Webinar SlidesDuraSpace
 
DuraSpace is OPEN, OR2016
DuraSpace is OPEN, OR2016DuraSpace is OPEN, OR2016
DuraSpace is OPEN, OR2016DuraSpace
 
DuraSpace and LYRASIS CEO Town Hall Meeting -- April 29, 2016
DuraSpace and LYRASIS CEO Town Hall Meeting -- April 29, 2016DuraSpace and LYRASIS CEO Town Hall Meeting -- April 29, 2016
DuraSpace and LYRASIS CEO Town Hall Meeting -- April 29, 2016DuraSpace
 
DuraSpace and LYRASIS CEO Town Hall Meeting -- April 21, 2016
DuraSpace and LYRASIS CEO Town Hall Meeting -- April 21, 2016DuraSpace and LYRASIS CEO Town Hall Meeting -- April 21, 2016
DuraSpace and LYRASIS CEO Town Hall Meeting -- April 21, 2016DuraSpace
 
How to Get Started Tracking Scholarly Activity with VIVO and SHARE
How to Get Started Tracking Scholarly Activity with VIVO and SHAREHow to Get Started Tracking Scholarly Activity with VIVO and SHARE
How to Get Started Tracking Scholarly Activity with VIVO and SHAREDuraSpace
 
3.11.16 Slides, “Institutional Perspectives on the Impact of SHARE and VIVO T...
3.11.16 Slides, “Institutional Perspectives on the Impact of SHARE and VIVO T...3.11.16 Slides, “Institutional Perspectives on the Impact of SHARE and VIVO T...
3.11.16 Slides, “Institutional Perspectives on the Impact of SHARE and VIVO T...DuraSpace
 
2.24.16 Slides, “VIVO plus SHARE: Closing the Loop on Tracking Scholarly Acti...
2.24.16 Slides, “VIVO plus SHARE: Closing the Loop on Tracking Scholarly Acti...2.24.16 Slides, “VIVO plus SHARE: Closing the Loop on Tracking Scholarly Acti...
2.24.16 Slides, “VIVO plus SHARE: Closing the Loop on Tracking Scholarly Acti...DuraSpace
 
The "Cloud" Services - DuraCloud and DuraCloud Vault
The "Cloud" Services - DuraCloud and DuraCloud VaultThe "Cloud" Services - DuraCloud and DuraCloud Vault
The "Cloud" Services - DuraCloud and DuraCloud VaultDuraSpace
 

More from DuraSpace (20)

12.5.18 "How For-Profit Companies Can Be a Part of the Open Environment" pres...
12.5.18 "How For-Profit Companies Can Be a Part of the Open Environment" pres...12.5.18 "How For-Profit Companies Can Be a Part of the Open Environment" pres...
12.5.18 "How For-Profit Companies Can Be a Part of the Open Environment" pres...
 
11.20.18 DSpace for Research Data Management Webinar
11.20.18 DSpace for Research Data Management Webinar11.20.18 DSpace for Research Data Management Webinar
11.20.18 DSpace for Research Data Management Webinar
 
10.24.18 "Securing Community-Controlled Infrastructure: SPARC’s plan of actio...
10.24.18 "Securing Community-Controlled Infrastructure: SPARC’s plan of actio...10.24.18 "Securing Community-Controlled Infrastructure: SPARC’s plan of actio...
10.24.18 "Securing Community-Controlled Infrastructure: SPARC’s plan of actio...
 
9.26.18 Beyond NA presentation slides
9.26.18 Beyond NA presentation slides9.26.18 Beyond NA presentation slides
9.26.18 Beyond NA presentation slides
 
9.19.18 ArchivesDirect Overview: Standards-Based Preservation with Hosted Arc...
9.19.18 ArchivesDirect Overview: Standards-Based Preservation with Hosted Arc...9.19.18 ArchivesDirect Overview: Standards-Based Preservation with Hosted Arc...
9.19.18 ArchivesDirect Overview: Standards-Based Preservation with Hosted Arc...
 
5.24.18 DuraCloud in 2018 Presentation Slides
5.24.18 DuraCloud in 2018 Presentation Slides5.24.18 DuraCloud in 2018 Presentation Slides
5.24.18 DuraCloud in 2018 Presentation Slides
 
5.17.18 "The 2.5% Commitment: Investing in Open" presentation slides
5.17.18 "The 2.5% Commitment: Investing in Open" presentation slides5.17.18 "The 2.5% Commitment: Investing in Open" presentation slides
5.17.18 "The 2.5% Commitment: Investing in Open" presentation slides
 
3.28.18 "Open Source Repository Upgrades: Top Advice from Practitioners" Pres...
3.28.18 "Open Source Repository Upgrades: Top Advice from Practitioners" Pres...3.28.18 "Open Source Repository Upgrades: Top Advice from Practitioners" Pres...
3.28.18 "Open Source Repository Upgrades: Top Advice from Practitioners" Pres...
 
2.28.18 Getting Started with Fedora presentation slides
2.28.18 Getting Started with Fedora presentation slides2.28.18 Getting Started with Fedora presentation slides
2.28.18 Getting Started with Fedora presentation slides
 
Digital Preservation in Production (DPN and DuraCloud Vault)
Digital Preservation in Production (DPN and DuraCloud Vault)Digital Preservation in Production (DPN and DuraCloud Vault)
Digital Preservation in Production (DPN and DuraCloud Vault)
 
3.15.17 DSpace: How to Contribute Webinar Slides
3.15.17 DSpace: How to Contribute Webinar Slides3.15.17 DSpace: How to Contribute Webinar Slides
3.15.17 DSpace: How to Contribute Webinar Slides
 
3.7.17 DSpace for Data: issues, solutions and challenges Webinar Slides
3.7.17 DSpace for Data: issues, solutions and challenges Webinar Slides3.7.17 DSpace for Data: issues, solutions and challenges Webinar Slides
3.7.17 DSpace for Data: issues, solutions and challenges Webinar Slides
 
2.28.17 Introducing DSpace 7 Webinar Slides
2.28.17 Introducing DSpace 7 Webinar Slides2.28.17 Introducing DSpace 7 Webinar Slides
2.28.17 Introducing DSpace 7 Webinar Slides
 
DuraSpace is OPEN, OR2016
DuraSpace is OPEN, OR2016DuraSpace is OPEN, OR2016
DuraSpace is OPEN, OR2016
 
DuraSpace and LYRASIS CEO Town Hall Meeting -- April 29, 2016
DuraSpace and LYRASIS CEO Town Hall Meeting -- April 29, 2016DuraSpace and LYRASIS CEO Town Hall Meeting -- April 29, 2016
DuraSpace and LYRASIS CEO Town Hall Meeting -- April 29, 2016
 
DuraSpace and LYRASIS CEO Town Hall Meeting -- April 21, 2016
DuraSpace and LYRASIS CEO Town Hall Meeting -- April 21, 2016DuraSpace and LYRASIS CEO Town Hall Meeting -- April 21, 2016
DuraSpace and LYRASIS CEO Town Hall Meeting -- April 21, 2016
 
How to Get Started Tracking Scholarly Activity with VIVO and SHARE
How to Get Started Tracking Scholarly Activity with VIVO and SHAREHow to Get Started Tracking Scholarly Activity with VIVO and SHARE
How to Get Started Tracking Scholarly Activity with VIVO and SHARE
 
3.11.16 Slides, “Institutional Perspectives on the Impact of SHARE and VIVO T...
3.11.16 Slides, “Institutional Perspectives on the Impact of SHARE and VIVO T...3.11.16 Slides, “Institutional Perspectives on the Impact of SHARE and VIVO T...
3.11.16 Slides, “Institutional Perspectives on the Impact of SHARE and VIVO T...
 
2.24.16 Slides, “VIVO plus SHARE: Closing the Loop on Tracking Scholarly Acti...
2.24.16 Slides, “VIVO plus SHARE: Closing the Loop on Tracking Scholarly Acti...2.24.16 Slides, “VIVO plus SHARE: Closing the Loop on Tracking Scholarly Acti...
2.24.16 Slides, “VIVO plus SHARE: Closing the Loop on Tracking Scholarly Acti...
 
The "Cloud" Services - DuraCloud and DuraCloud Vault
The "Cloud" Services - DuraCloud and DuraCloud VaultThe "Cloud" Services - DuraCloud and DuraCloud Vault
The "Cloud" Services - DuraCloud and DuraCloud Vault
 

Recently uploaded

Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘RTylerCroy
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAndrey Devyatkin
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUK Journal
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProduct Anonymous
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...Neo4j
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024The Digital Insurer
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century educationjfdjdjcjdnsjd
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoffsammart93
 

Recently uploaded (20)

Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 

10-31-13 “Researcher Perspectives of Data Curation” Presentation Slides

  • 1. Hot Topics Web Seminar Series: Research Data in Repositories The UC San Diego Experience Third Webinar: The Researcher Perspective
  • 2. Reminder: General Series Info • First webinar: Intro and Framing: UC San Diego decisions and planning • Second Webinar: Deep dive into technology and metadata • Third Webinar: The perspective from researchers, next steps
  • 3. Reminder: General Series Info Slides and presentations from previous webinars are available for download! http://www.duraspace.org/hot-topics
  • 4. Your esteemed presenters … First webinar: David Minor – Program Director, Research Data Curation Declan Fleming - Chief Technology Strategist Second webinar: Declan Fleming - Chief Technology Strategist Arwen Hutt - Metadata Librarian Matt Critchlow - Manager of Development and Web Services Third webinar: David Minor – Program Director, Research Data Curation Dick Norris – Professor, Scripps Institution of Oceanography Rick Wagner – Data Scientist at San Diego Supercomputer Center
  • 5. Today we will … Discuss how researchers have approached curation and data management
  • 6. Reminder: UCSD Research Data Curation Pilots • The Brain Observatory • NSF OpenTopography Facility • Levantine Archaeology Laboratory • Scripps Institute of Oceanography Geological Collections • The Laboratory for Computational Astrophysics
  • 7. Reminder: UCSD Research Data Curation Pilots • The Brain Observatory • NSF OpenTopography Facility • Levantine Archaeology Laboratory • Scripps Institute of Oceanography Geological Collections • The Laboratory for Computational Astrophysics
  • 8. Richard Norris Professor at Scripps Institution of Oceanography
  • 9. Rick Wagner High Performance Computing Manager at the San Diego Supercomputer Center Ph.D. Candidate within The Laboratory for Computational Astrophysics
  • 10. SIO Geological Collections General Series Intro First webinar: Intro and Framing: UC San Diego decisions and planning Part of the Curator: Dick Norris International Marine and Lacustrine CollectionsWebinar: Deep dive into technology and • Second Manager: Geological Collections Alexandra Hangsterfer metadata • • With collections at Third Webinar: The perspective fromOregon Columbia, researchers, next steps State, Woods Hole, USGS and more
  • 11. Our Collection: Sediment cores and rocks recovered from the oceans & long-lived lakes Reef sediment-Panama Salton Sea-CA
  • 12. How we get them…. Mostly by Sea (Ship, Cruise, Leg) But also by Land Country, Locality, Lat/Long
  • 13. Collection events Recovering a Gravity Core to collect seafloor sediments Deploying a Dredge to collect seafloor rocks
  • 14. A collection event is an Object and includes: • • • • • • • Specimen(s) Latitude/Longitude) Ship name and cruise number Text descriptions Thin-sections Images, field notes, publications Location in the repository International Geological Sample Number
  • 15. The Sediment Core Collection Archive and Working halves of ~7000 cores from the world’s oceans Typically 3-5 sections/core + core photos, chemical data and sampling history The IODP Core collection, Bremen Germany
  • 16. The Marine Rock collection… • ~4000 dredge sites worldwide • In an 8000 sq ft building • Volcanic rocks, manganese nodules, reef rock
  • 17. Our data resides with NGDC… • NOAA’s National Geophysical Data Center • And IGSN’s with Lamont’s SESAR
  • 18. NGDC searches on ships, repositories, sampling systems, and locations But no keyword search, automated data input, ways to link associated data, returns on nearest search terms, sampling history, etc….
  • 19. What the Community Wants • A unified National geo-referenced system • Exploratory search by nearest word and mapbased system • Links to associated data types (images, text, data, references…) • All data types linked by IGSNs • Data entry through web forms with publication by curators
  • 20. What we did with RCI • Identified one type of object – Based in sampling events – Ship-Cruise-Sampling device-Sample number – Geo-referenced – Includes associated materials: text description, images, chemical data, references, records of sampling event, sampling records, storage location • NGDC records imported into UC Library system • Records searchable by any word in a record
  • 21. What’s next? • NSF-sponsored SEASAR (System for Earth Sample Registration) – Created the International GeoSample Number – http://www.geosamples.org/ • NSF-sponsored workshop: – Digital Environment for Sample Curation (June 2013) – http://www.geosamples.org/news/descwebinarmaterials • NSF “EarthCube” initiative
  • 22. CyberInfrastructure needs (from DESC) • Offline data entry at sea or in the field • DESC should respect data moratoriums (typically 2 years, if collected with NSF grants) • Automated release to public at close of moratorium • Secure login-based data serving for project scientists • Flexible search and access for users to view public archive (view by location name, type, bounding region) and associated data • Flexible sample request submission
  • 23. More cyberInfrastructure needs • Display stored datasets and images hosted on other servers (as in other repositories) • Connections with Standard Visualization Tools Such as Corelyzer, Correlator, PSICAT, CoreRef, GMT, GeoMapApp • Sampling database should be easily accessible by researchers to submit requests • Automatically updated by repository (personnel) to reflect samples sent to the researchers • Way of entering historical sampling information
  • 24. These are general issues for Natural History Collections • Most museums have similar issues to us – Geo-Referenced collections – Mix of physical specimens, images, text descriptions, sampling data, and affiliated data files – Many have home-grown data bases that are not interoperable with other museums Fish from the SIO Marine Vertebrates Collection
  • 25. Natural History Collections • Need controlled vocabularies but flexibility to search on variants – Since nobody agrees on common vocabularies • Value in cross-referencing to related collections – Such as samples (geology, biology, water) collected on a cruise with ship track, sea floor maps… – Presently working on “Rolling deck to Repository” NSF project
  • 26. LCA PI: Mike Norman Current and Past Students: Many
  • 27. Research group focusing on numerical modeling of complex astrophysical processes: cosmology, galaxy formation, turbulence, radiation hydrodynamics, magneto-hydrodynamics, … Image credit: NASA, IoA, A. Fabian et al.
  • 28. Our simulations are large, based on the current definition of “large” (we grow with the technology). Typical results are 1-100 TB.
  • 29. This work is costly in terms of both the computer time and human effort, and we see a benefit to the science community in sharing. (Citations are nice, too.) http://bit.ly/sB30f1 http://bit.ly/IzTVV2 http://bit.ly/IE4iFd http://bit.ly/HFYLQJ
  • 30. Prior Sharing Efforts Participation in the Virtual Observatory • Standards for simulation metadata, search, and retrieval • An odd fit beside the “pure” astronomy projects and data centers • But, it meant we weren’t starting from scratch in terms of describing our data Started the curation effort very curious about how much of this previous work would translate to library space Also wanted stable platform for data hosting (e.g., not a closet server)
  • 31. Curation Process By E.gordienko (Own work) [CC-BY-SA-3.0 (http://creativecommons.org/licenses/by-sa/3.0) or GFDL (http://www.gnu.org/copyleft/fdl.html)], via Wikimedia Commons Several steps: • Choosing the pilot dataset • Cleaning up simulation cruft • Identifying related publications • Adding historical documents (proposals, reports, etc.) • Organize various data groups • Simulations are a collection of datasets from various points in time, needed a description for each type of digital object in each dataset • Bundle, checksum, and handoff Decided near the end to replicate the metadata record to a second site as test of its portability
  • 33. Final result: • Datasets from a high-resolution cosmology simulation held at UCSD • Viewable both at UCSD, and via the Online Archive of California • Raw simulation data and various analysis results accessible over HTTP
  • 34. Some thoughts: • When it comes to metadata formats libraries are like any other science domain and speak their own language • If you have a highly-specialized domain-specific metadata dialect or language, you may need an additional discovery service • If not, it’s a good starting point • We’re working on repeating this process on our own for another simulation
  • 35. Next steps at UC San Diego Move from pilot services to a scalable series of processes. Work with additional researchers in same domains. Work with new domains. Broaden lifecycle management mindset on campus.
  • 36. Questions? Rick Wagner - rpwagner@sdsc.edu Richard Norris - rnorris@ucsd.edu David Minor - dminor@ucsd.edu http://www.duraspace.org/hot-topics