1. Carly
Strasser
California
Digital
Library
@carlystrasser
April
2013
@
DataUp:
Helping
manage
&
archive
data
From
Flickr
by
Spatial
Mongrel
2. C.
Strasser
C.
Strasser
C.
Strasser
C.
Strasser
Courtesy
of
WHOI
3. Why
don’t
people
share
data?
Is
data
management
being
taught?
Do
attitudes
about
sharing
differ
among
disciplines?
What
role
can
libraries
play
in
data
education?
How
can
we
promote
storing
data
in
repositories?
What
barriers
to
sharing
can
we
eliminate?
4.
5. Why
is
data
management
a
hot
topic?
From
Flickr
by
Velo
Steve
6. Back in the day…
Da
Vinci
Curie
Newton
classicalschool.blogspot.com
Darwin
7. Digital
data
From
Flickr
by
Flickmor
From
Flickr
by
US
Army
Environmental
Command
From
Flickr
by
DW0825
C.
Strasser
Courtesey
of
WHOI
From
Flickr
by
deltaMike
22. Asked
~200
scientists
What
does
your
data
look
like?
How
do
you
capture
metadata?
Plans
for
saving
&
sharing
data?
Repositories?
23. What
the
tool
should
do:
Best
practices
check
Generate
metadata
(EML)
Get
identifier
+
citation
Post
data
to
repository
From
Flickr
by
Rennett
Stowe
24. Open
Source
Tool
Add-‐in
&
Web
Application
csv
&
xlsx
dataup.cdlib.org
Free
?
25. Add-‐in
• Software
you
download
&
install
• Appears
as
“ribbon”
in
Excel
• Works
for
Windows
Excel
2007+
Web-‐based
application
• Upload
file
to
website
• Works
for
any
platform
• But…
new
user
interface
VS
26. DataUp
Features
Best
practices
check
Generate
metadata
Get
identifier
&
citation
Post
data
to
repository
From
Flickr
by
SoulRider.222
27. Best
Practices
Check
• Embedded
charts,
tables,
pictures
• Embedded
comments
• Commas
• Special
characters
• Color-‐coded
text
&
cell
shading
• Columns
with
mixed
data
types
• Non-‐contiguous
data
• Merged
cells
• Blank
cells
• No
header
row
• Multiple
sheets
From
Flickr
by
ex.libris
28. DataUp
Features
Best
practices
check
Generate
metadata
Get
identifier
&
citation
Post
data
to
repository
From
Flickr
by
SoulRider.222
29. • Digital
context
• Name
of
the
data
set
• The
name(s)
of
the
data
file(s)
in
the
data
set
• Date
the
data
set
was
last
modified
• Example
data
file
records
for
each
data
type
file
• Pertinent
companion
files
• List
of
related
or
ancillary
data
sets
• Software
(including
version
number)
used
to
prepare/read
the
data
set
• Data
processing
that
was
performed
• Personnel
&
stakeholders
• Who
collected
• Who
to
contact
with
questions
• Funders
• Scientific
context
• Scientific
reason
why
the
data
were
collected
• What
data
were
collected
• What
instruments
(including
model
&
serial
number)
were
used
• Environmental
conditions
during
collection
• Where
collected
&
spatial
resolution
When
collected
&
temporal
resolution
• Standards
or
calibrations
used
• Information
about
parameters
• How
each
was
measured
or
produced
• Units
of
measure
• Format
used
in
the
data
set
• Precision
&
accuracy
if
known
• Information
about
data
• Definitions
of
codes
used
• Quality
assurance
&
control
measures
• Known
problems
that
limit
data
use
(e.g.
uncertainty,
sampling
problems)
• How
to
cite
the
data
set
Holy
Metadata!
30. ~45
elements
included
7
required
Creator
details
Title
Date
Keywords
Abstract
File-‐level
Metadata
31. Name
Definition
Type
(text,
date/time,
numeric)
Unit
Location
(sheet)
Attribute
Metadata
32. DataUp
Features
Best
practices
check
Generate
metadata
Get
identifier
&
citation
Post
data
to
repository
From
Flickr
by
SoulRider.222
33. Identifier
+
Citation
Allows
readers
to
find
data
products
Get
credit
for
data
and
publications
Promotes
reproducibility
Better
measure
of
research
impact
Example:
Sidlauskas,
B.
2007.
Data
from:
Testing
for
unequal
rates
of
morphological
diversification
in
the
absence
of
a
detailed
phylogeny:
a
case
study
from
characiform
fishes.
Dryad
Digital
Repository.
doi:10.5061/dryad.20
Persistent
Unique
Identifier
From
Flickr
by
maybeemily
34. DataUp
Features
Best
practices
check
Generate
metadata
Get
identifier
&
citation
Post
data
to
repository
From
Flickr
by
SoulRider.222
46. Add-‐In
• Windows
PC
2007+
• No
log-‐in
required
• Offline
&
online
• Can
view
metadata
via
tab
• See
check
alongside
data
• Select
header
row
Web
app
• Any
platform
• Log-‐in
required
• Online
only
• Can’t
view
metadata
once
generated
• Get
locations
for
check
• Manual
header
row
entry
VS
50. • New
language
• Focus
on
web
app
• Emphasize
Best
Practices
Check
• Leverage
existing
tools
• Enable
Customization
From
animationresources.org
51. dataup.cdlib.org
bitbucket.org/dataup/main
Website
Code
site
My
website
Email
me
Tweet
me
My
slides
CDL
Blog
carlystrasser.net
carlystrasser@gmail.com
@carlystrasser
slideshare.net/carlystrasser
datapub.cdlib.org