SlideShare a Scribd company logo
1 of 56
http://www.niso.org/news/events/2011/nisowebinars/semanticweb/



   Managing Data for
Scholarly Communications
   PART 2: Technical Management

           October 19, 2011

Speakers: Joan Starr, Mark McFarland,
       and MacKenzie Smith
Dataset	
  Iden*fica*on	
  &	
  Cita*on:	
  
             DataCite	
  and	
  EZID	
  


                     Joan	
  Starr	
  
            California	
  Digital	
  Library	
  
                  October,	
  2011	
  
Dataset	
  Iden*fica*on	
  &	
  Cita*on	
  
Introduc*on	
  
The	
  Researchers’	
  Challenge	
  
           Iden*fiers	
  are	
  a	
  tool	
  for	
  researchers	
  
DataCite	
  
           “Helping	
  you	
  find,	
  access	
  and	
  reuse	
  data.”	
  
EZID	
  
           Easy	
  crea*on	
  and	
  management	
  of	
  DataCite	
  DOIs	
  and	
  other	
  
              iden*fiers.	
  
Next	
  steps	
  
 	
   	
  For	
  DataCite,	
  EZID	
  and	
  you!	
  
California	
  Digital	
  Library	
  (CDL)	
  
The	
  Researchers’	
  Challenge	
  
Early	
  in	
  the	
  research	
  life	
  cycle	
  
Data-­‐intensive	
  research	
                                              +	
                 Wri*ng	
  up	
  the	
  results	
  



                          Where’s	
  
                          the	
  data?	
                              What	
  if	
  	
  I	
  
                                                                      move	
  it?	
  


                                                                                                    PERSISTENT	
  IDENTIFIERS	
  
                                                                                                      make	
  the	
  difference	
  


        by	
  Dave	
  Rogers	
  hWp://www.flickr.com/photos/dave-­‐rogers/2815036285/	
  
Working	
  on	
  a	
  federated	
  team	
  
           Data-­‐intensive	
  research	
  
                                                                                                           +	
   Regional	
  research	
  center	
  
                                                                                                           +	
   Aging	
  infrastructure	
  
                                                Where’s	
  
                                                                                                                            We	
  have	
  to	
  
                                                the	
  data?	
  
                                                                                                                            move	
  it!	
  



                                                                                                                                          PERSISTENT	
  IDENTIFIERS	
  
                                                                                                                                            make	
  the	
  difference	
  


©All	
  rights	
  reserved	
  by	
  University	
  of	
  California,	
  hWp://www.flickr.com/photos/universityofcalifornia/5405812887	
  
Making	
  a	
  career	
  move	
  
•  Data-­‐intensive	
  research	
                                                 +	
     •  Researcher(s)	
  on	
  the	
  
                                                                                             move	
  


                           I	
  know	
  
                           where	
  my	
  
                           data	
  is	
                  and	
  I’m	
  
                                                         taking	
  it	
  
                                                         with	
  me!	
  
                                                                                           PERSISTENT	
  IDENTIFIERS	
  
                                                                                             make	
  the	
  difference	
  



    ©All	
  rights	
  reserved	
  by	
  University	
  of	
  California,	
  	
  
    hWp://www.flickr.com/photos/universityofcalifornia/5406308654	
  
Mee*ng	
  funder	
  requirements	
  
•  Data-­‐intensive	
  research	
                                      +	
                   •  Grantor	
  requirements	
  
                                                                                                for	
  data	
  management	
  
         What	
  do	
  we	
                                                                     plan	
  
         put	
  here?	
  
                                                                               How	
  do	
  we	
  
                                                                               track	
  the	
  data?	
  




                                                                                                   PERSISTENT	
  IDENTIFIERS	
  
                                                                                                     make	
  the	
  difference	
  


    By	
  David	
  Mellis,	
  hWp://www.flickr.com/photos/mellis/7675610/	
  
DataCite	
  
German	
  Na8onal	
  Library	
  of	
  Economics	
  (ZBW)	
  	
  	
                            Canada	
  Ins8tute	
  for	
  Scien8fic	
  and	
  Technical	
  Informa8on	
  
German	
  Na8onal	
  Library	
  of	
  Science	
  and	
  Technology	
  (TIB)	
  	
                    (CISTI)	
  

German	
  Na8onal	
  Library	
  of	
  Medicine	
  (ZB	
  MED)	
                               Technical	
  Informa8on	
  Center	
  of	
  Denmark	
  

GESIS	
  -­‐	
  Leibniz	
  Ins8tute	
  for	
  the	
  Social	
  Sciences,	
  Germany	
  	
     Ins8tute	
  for	
  Scien8fic	
  &	
  Technical	
  Informa8on	
  (INIST-­‐

Australian	
  Na8onal	
  Data	
  Service	
  (ANDS)	
                                                 CNRS),	
  France	
  	
  

ETH	
  Zurich,	
  Switzerland	
                                                               TU	
  DelS	
  Library,	
  The	
  Netherlands	
  	
  
                                                                                              The	
  Swedish	
  Na8onal	
  Data	
  Service	
  (SNDS)	
  

                                                                                              The	
  Bri8sh	
  Library	
  ,	
  UK	
  

                                                                                              California	
  Digital	
  Library	
  (CDL),	
  USA	
  	
  

                                                                                              Office	
  of	
  Scien8fic	
  &	
  Technical	
  Informa8on	
  (OSTI),	
  USA	
  	
  

                                                                                              Purdue	
  University	
  Library	
  
DataCite	
  Metadata	
  V.	
  2.2	
  
•  Small	
  required	
  set	
  =	
  cita*on	
  elements	
  
•  Op*onal	
  descrip*ve	
  set:	
  
    –  extendable	
  lists	
  
    –  can	
  refer	
  to	
  other	
  standards,	
  schemes	
  
    –  domain-­‐neutral	
  
    –  rich	
  ability	
  to	
  describe	
  rela*onships	
  to	
  other	
  
       digital	
  objects	
  
•  Metadata	
  Search	
  (MDS)	
  is	
  full-­‐text	
  indexed	
  	
  
DataCite	
  Metadata	
  V.	
  2.2	
  
  Required	
  proper8es	
                                         Op8onal	
  proper8es	
  

1.    Iden8fier	
  (with	
  type	
  aWribute)	
             6.     Subject	
  (with	
  schema	
  aWribute)	
  
2.    Creator	
  (with	
  name	
  iden*fier	
               7.     Contributor	
  (with	
  type	
  &	
  name	
  iden*fier	
  
       aWributes)	
                                               aWributes)	
  
3.    Title	
  (with	
  op*onal	
  type	
  aWribute)	
     8.     Date	
  (with	
  type	
  aWribute)	
  
4.    Publisher	
                                          9.     Language	
  	
  	
  
5.    Publica8onYear	
                                     10.    ResourceType	
  (with	
  descrip*on	
  aWribute)	
  
                                                           11.    AlternateIden*fier	
  (with	
  type	
  aWribute)	
  
                                                           12.    RelatedIden*fier	
  (with	
  type	
  &rela*on	
  
                                                                  type	
  aWributes)	
  
                                                           13.    Size	
  	
  	
  
                                                           14.    Format	
  	
  	
  
                                                           15.    Version	
  
                                                           16.    Rights	
  
                                                           17.    Descrip*on	
  (with	
  type	
  aWribute)	
  
•    Get	
  iden*fiers	
  
•    Add	
  loca*on	
  
•    Add	
  metadata	
  
•    Update	
  loca*on	
  
•    Update	
  metadata	
  
hWp://n2t.net/ezid	
  
hWp://n2t.net/ezid	
  
hWp://n2t.net/ezid	
  
hWp://n2t.net/ezid	
  
hWp://n2t.net/ezid	
  
hWp://n2t.net/ezid	
  
hWp://n2t.net/ezid	
  
What	
  this	
  means…	
  
What	
  this	
  means…	
  
Next	
  Steps	
  
DataCite	
  
• 	
  Dublin	
  Core	
  applica*on	
  profile	
  
• 	
  Content	
  Service	
  
• 	
  Metadata	
  v.	
  2.3	
  
EZID	
  
• UI	
  redesign	
  
• Automated	
  link	
  checking	
  
• Exposure	
  for	
  cita*ons	
  


                                         By	
  Nicola	
  Whitaker	
  hWp://www.flickr.com/photos/nicolawhitaker/111009156/	
  
Next	
  Steps	
  for	
  you	
  
•  Get	
  more	
  informa*on,	
  and	
  
•  Try	
  EZID	
  for	
  yourself!	
  




                    By	
  Nicola	
  Whitaker	
  hWp://www.flickr.com/photos/nicolawhitaker/111009156/	
  
For	
  more	
  informa*on	
  
EZID	
  
EZID	
  applica*on:	
  hWp://n2t.net/ezid/	
  	
  
EZID	
  website:	
  hWp://www.cdlib.org/services/uc3/ezid/	
  
UC3	
  website:	
  hWp://www.cdlib.org/services/uc3/	
  


DataCite	
  
DataCite	
  Home:	
  hWp://datacite.org/	
  
DataCite	
  Metadata	
  Schema:	
  
   hWp://schema.datacite.org/meta/kernel-­‐2.2/index.html	
  
DataCite	
  Metadata	
  Search:	
  hWp://search.datacite.org	
  



Contact	
  Joan	
  Starr	
  at	
  uc3@ucop.edu	
  
Ques*ons?	
  




 by	
  Horia	
  Varlan	
  	
  
 hWp://www.flickr.com/photos/horiavarlan/4273168957/in/photostream/	
  
Digital	
  Library	
  Services	
  in	
  the	
  Cloud	
  
                         Mark	
  McFarland	
  
                Director,	
  Texas	
  Digital	
  Library	
  
Outline	
  

•       Who:	
  Texas	
  Digital	
  Library	
  
•       Where:	
  on	
  the	
  cloud	
  
•       Why:	
  mo*va*ons	
  
•       When:	
  late	
  2010	
  
•       What:	
  lessons	
  learned	
  



June	
  2011	
                                    30	
  
Who:	
  Texas	
  Digital	
  Library	
  

•  Consor*um	
  of	
  higher	
  educa*on	
  ins*tu*ons	
  in	
  Texas	
  
•  Current	
  services	
  include:	
  
           –  Ins*tu*on:	
  IR	
  (DSpace),	
  ETD	
  system	
  
           –  Faculty:	
  OJS,	
  OCS,	
  blogs,	
  wikis	
  
           –  Approximately	
  70	
  customer-­‐facing	
  service	
  instances	
  
•  Legacy	
  hardware	
  included	
  
           –  Compute	
  servers	
  
           –  Storage	
  servers	
  
           –  Network	
  support	
  devices	
  

June	
  2011	
                                                                       31	
  
Where:	
  on	
  the	
  cloud	
  

•  Migrated	
  customer-­‐facing	
  services	
  to	
  AWS	
  
           –  50	
  AWS	
  VM	
  instances	
  
•  Maintained	
  some	
  services	
  on	
  local	
  hardware	
  
•  Simplified	
  and	
  consolidated	
  system	
  
   architecture	
  




June	
  2011	
                                                     32	
  
Why:	
  mo*va*ons	
  /	
  When:	
  late	
  2010	
  

•  Disaster	
  recovery	
  plan	
  
           –  Prepare	
  for	
  data	
  center	
  move	
  
•  Elas*c	
  capacity	
  
           –  New	
  members,	
  collec*ons	
  
•  Personnel	
  savings	
  
           –  Fewer	
  competencies,	
  responsibili*es	
  
•  Began	
  Oct	
  2010	
  

June	
  2011	
                                                33	
  
What:	
  lessons	
  learned	
  

•  The	
  Good	
  
           –  Elas*c	
  capacity;	
  customers	
  did	
  not	
  no*ce	
  change	
  
           –  No	
  hardware	
  purchase	
  cycle	
  
•  The	
  Mixed	
  
           –  Lower	
  personnel	
  costs;	
  failover	
  
•  The	
  Unexpected	
  
           –  Development	
  tools;	
  concerns	
  about	
  AWS	
  being	
  in	
  
              U.S.;	
  excellent	
  management	
  console	
  
June	
  2011	
                                                                    34	
  
Future	
  

•  Preserva*on	
  
           –  DuraCloud	
  
•  Con*nue	
  to	
  evaluate	
  
           –  AWS	
  is	
  flexible	
  and	
  feature	
  rich,	
  but	
  may	
  s*ll	
  not	
  
              be	
  cost	
  effec*ve	
  




June	
  2011	
                                                                                   35	
  
For	
  more	
  informa*on	
  about	
  the	
  TDL,	
  please	
  visit	
  the	
  Texas	
  
         Digital	
  Library	
  website	
  at	
  hWp://www.tdl.org	
  	
  
                               or	
  contact	
  us	
  at	
  	
  
                                info@tdl.org.	
  	
  
Data	
  Governance	
  and	
  
                             Legal	
  Interoperability	
  


                        MacKenzie	
  Smith,	
  Science	
  Fellow	
  


©	
  Crea*ve	
  Commons,	
  2011.	
  This	
  work	
  is	
  licensed	
  under	
  a	
  Crea*ve	
  Commons	
  AWribu*on	
  3.0	
  United	
  States	
  License.	
  
Why	
  Data	
  Sharing	
  is	
  Good	
  	
  
•  research	
  reproducibility	
  
•  fiscal	
  responsibility	
  
•  broadest	
  possible	
  impact	
  
•  large-­‐scale	
  data	
  interoperability	
  
   –  Includes	
  technical,	
  social,	
  legal	
  and	
  policy	
  aspects	
  
   –  usual	
  focus	
  on	
  technical/social	
  
   –  focus	
  here	
  on	
  legal/policy	
  aspects	
  
Why	
  Data	
  Sharing	
  is	
  Hard	
  
•  No	
  incen*ves	
  to	
  improve	
  data	
  quality,	
  provide	
  
   missing	
  documenta*on	
  
•  Confiden*ality	
  and	
  privacy	
  concerns	
  
    (e.g.	
  HIPAA,	
  endangered	
  species)	
  
•  Patents	
  and	
  commercial	
  poten*al	
  
•  Closed	
  Access	
  to	
  journal	
  ar*cles	
  (i.e.	
  results)	
  
•  IP	
  issues	
  very	
  complicated	
  
Defini*ons	
  
Data	
  governance	
  is	
  the	
  system	
  of	
  decision	
  rights	
  and	
  
  accountabili8es	
  that	
  describe	
  who	
  can	
  take	
  what	
  ac8ons	
  
  with	
  what	
  data,	
  and	
  when,	
  under	
  what	
  circumstances,	
  using	
  
  what	
  methods	
  
•  strategies	
  for	
  data	
  quality	
  control	
  and	
  management,	
  and	
  processes	
  that	
  
   insure	
  important	
  data	
  assets	
  are	
  formally	
  managed	
  throughout	
  an	
  
   organiza*on;	
  
      –  organiza*ons	
  can	
  be	
  legal	
  en**es	
  like	
  universi*es,	
  or	
  virtual	
  organiza=ons	
  
         (e.g.	
  distributed	
  research	
  collabora*ons)	
  
      –  Includes	
  business	
  processes	
  and	
  risk	
  management;	
  
•  laws	
  and	
  policies	
  associated	
  with	
  data;	
  
•  ensures	
  that	
  data	
  can	
  be	
  trusted	
  and	
  that	
  people	
  are	
  accountable	
  for	
  
   ac*ons	
  affec*ng	
  the	
  data	
  
Defini*ons	
  
•  A"ribu'on	
  is	
  legally-­‐imposed,	
  remedy	
  is	
  lawsuit	
  
•  Credit	
  is	
  what	
  researchers	
  want	
  	
  
•  Cita'on	
  is	
  the	
  norm	
  in	
  scholarly	
  communica*on,	
  
   to	
  provide	
  suppor*ng	
  evidence,	
  now	
  proxy	
  for	
  
   credit	
  
AWribu*on	
  does	
  not	
  insure	
  credit	
  or	
  cita*on.	
  	
  
Legal	
  Mechanisms	
  for	
  Sharing	
  Data	
  

1. 	
  licenses	
  
                         Require	
  aWribu*on	
  
2. 	
  contracts	
  

3. 	
  waivers	
  	
     No	
  aWribu*on	
  
                          requirement	
  
Copyright	
  for	
  Data	
  
•  Does	
  not	
  apply	
  to	
  facts,	
  e.g.,	
  most	
  scien*fic	
  
   data	
  

•  Can	
  apply	
  to	
  a	
  collec=on	
  of	
  facts,	
  but	
  only	
  to	
  
   original	
  aspects,	
  not	
  facts	
  themselves	
  

•  Can	
  extract	
  facts	
  from	
  a	
  copyrighted	
  database	
  
   without	
  infringing	
  
Licenses	
  
•  Licenses	
  are	
  not	
  contracts	
  
    –  depend	
  on	
  underlying	
  rights,	
  e.g.	
  copyright	
  or	
  sui	
  generis	
  
       rights	
  
    –  Copyright	
  is	
  a	
  bundle	
  of	
  rights,	
  automa*c	
  when	
  fixed,	
  
       limited	
  in	
  scope	
  and	
  dura*on	
  


•  US	
  and	
  EU	
  differ	
  (EU	
  has	
  sui	
  generis	
  data	
  rights)	
  
   so	
  different	
  licenses	
  cover	
  copyright,	
  sui	
  generis	
  
   rights,	
  or	
  both	
  
Licenses	
  
•  Crea*ve	
  Commons	
  (CC-­‐BY)	
  example	
  

   –  applies	
  to	
  data	
  and	
  databases	
  to	
  the	
  extent	
  they’re	
  
      copyrightable	
  

   –  Only	
  data	
  uses	
  that	
  implicate	
  copyright	
  trigger	
  
      aWribu*on	
  requirement	
  

   –  uses	
  of	
  data	
  that	
  do	
  not	
  implicate	
  copyright,	
  e.g.	
  is	
  in	
  
      the	
  public	
  domain,	
  do	
  not	
  trigger	
  aWribu*on	
  
Licenses	
  
•  Hard	
  to	
  assess	
  copyright	
  for	
  par*cular	
  data	
  
   and	
  databases	
  

•  Hard	
  to	
  know	
  when	
  license	
  applies,	
  creates	
  
   risks:	
  
    –  data	
  provider	
  be	
  misled	
  
    –  data	
  user	
  will	
  under	
  or	
  over	
  comply	
  
Licenses	
  
•  AWribu*on	
  requirements	
  are	
  inflexible,	
  
   causing	
  absurd	
  situa*ons	
  
       –  e.g.	
  providing	
  aWribu*on	
  to	
  1,000	
  providers	
  	
  	
  
          in	
  1,000	
  different	
  ways	
  
       –  known	
  as	
  ‘aWribu*on	
  stacking’	
  	
  

•  Could	
  provide	
  aWribu*on	
  and	
  s*ll	
  not	
  sa*sfy	
  
   norms	
  or	
  expecta*ons	
  
Contracts	
  
Contracts	
  
•  Do	
  not	
  require	
  underlying	
  right	
  	
  
    –  rely	
  on	
  offer/acceptance,	
  click	
  through,	
  terms	
  of	
  use	
  
    –  require	
  formali*es,	
  e.g.	
  aWribu*on	
  

•  Downsides	
  
    –  confusing	
  obliga*ons,	
  no	
  standardiza*on,	
  each	
  user	
  
       agreement	
  can	
  have	
  different	
  requirements	
  

•  Researchers	
  may	
  avoid	
  data	
  if	
  they	
  can’t	
  
   understand	
  the	
  terms	
  of	
  use	
  
Contracts	
  
Unlike	
  licenses,	
  contracts	
  only	
  binds	
  par=es	
  

•  If	
  someone	
  obtains	
  licensed	
  data	
  and	
  shares	
  it,	
  anyone	
  
   who	
  obtains	
  data	
  from	
  that	
  user	
  is	
  s*ll	
  bound	
  by	
  the	
  
   license	
  

•  If	
  data	
  had	
  been	
  shared	
  by	
  contract,	
  anyone	
  obtaining	
  
   data	
  from	
  the	
  second	
  party	
  is	
  not	
  bound	
  by	
  the	
  
   contract	
  since	
  they	
  aren’t	
  a	
  party	
  to	
  the	
  contract	
  

•  In	
  this	
  respect,	
  contracts	
  are	
  more	
  limited	
  than	
  licenses	
  
Contracts	
  
•  Have	
  broader	
  reach	
  than	
  licenses	
  
   –  not	
  *ed	
  to	
  a	
  legal	
  right	
  
   –  can	
  take	
  away	
  rights	
  of	
  public	
  
Example	
  
Waivers	
  
•  Provide	
  legal	
  certainty	
  
    –  No	
  need	
  to	
  decipher	
  copyright	
  protec*on	
  or	
  six	
  through	
  confusing	
  
       legalese	
  
    –  BeWer	
  than	
  silence,	
  to	
  avoid	
  forcing	
  people	
  to	
  guess	
  what	
  their	
  risks	
  
       are	
  	
  

•  Mean	
  loss	
  of	
  control	
  
    –  Can’t	
  require	
  aWribu*on	
  or	
  other	
  terms	
  

•  Avoid	
  problems	
  and	
  rely	
  on	
  scholarly	
  norms	
  
    –  no	
  aWribu*on	
  stacking	
  or	
  inappropriate	
  obliga*ons	
  
3	
  levels:	
  Waiver,	
  Fall-­‐back	
  license,	
  Non-­‐asser*on	
  pledge	
  
Summary	
  
•  Law	
  is	
  messy,	
  each	
  approach	
  has	
  consequences	
  

•  Licenses	
  –	
  (1)	
  legal	
  uncertainty	
  about	
  scope,	
  (2)	
  
   requirements	
  can	
  be	
  inconsistent	
  with	
  norms	
  

•  Contracts	
  –	
  (1)	
  burdensome	
  requirements	
  with	
  custom	
  
   terms,	
  (2)	
  exceed	
  scope	
  of	
  rights	
  with	
  requirements	
  that	
  
   take	
  away	
  normal	
  rights	
  

•  Waivers	
  –	
  (1)	
  avoid	
  problems,	
  but	
  (2)	
  lose	
  control	
  and	
  
   rely	
  on	
  norms	
  
Summary	
  
•  Each	
  approach	
  requires	
  loss	
  of	
  control	
  

•  No	
  mechanism	
  imposes	
  legally-­‐binding	
  obliga*ons	
  in	
  
   way	
  that	
  perfectly	
  maps	
  to	
  scholarly	
  credit,	
  e.g.	
  
   cita*on	
  

•  Ideal	
  solu*on	
  creates	
  the	
  least	
  fric*on	
  to	
  scien*fic	
  
   progress	
  while	
  giving	
  credit	
  where	
  due,	
  i.e.,	
  waivers	
  
   and	
  norms	
  (the	
  community	
  governs	
  itself)	
  

More Related Content

Similar to NISO Webinar: Part 2: Managing Data for Scholarly Communications

Carmen O'Dell and Barbara Sen JIBS-RLUK event July 2012
Carmen O'Dell and Barbara Sen JIBS-RLUK event July 2012Carmen O'Dell and Barbara Sen JIBS-RLUK event July 2012
Carmen O'Dell and Barbara Sen JIBS-RLUK event July 2012sherif user group
 
3 tu.dc 5min nordbib jp rombouts
3 tu.dc 5min nordbib jp rombouts3 tu.dc 5min nordbib jp rombouts
3 tu.dc 5min nordbib jp romboutsJeroen Rombouts
 
Partnering for Research Data
Partnering for Research DataPartnering for Research Data
Partnering for Research DataLiz Lyon
 
Repository Federation: Towards Data Interoperability
Repository Federation: Towards Data InteroperabilityRepository Federation: Towards Data Interoperability
Repository Federation: Towards Data InteroperabilityRobert H. McDonald
 
Data Science: An Emerging Field for Future Jobs
Data Science: An Emerging Field for Future JobsData Science: An Emerging Field for Future Jobs
Data Science: An Emerging Field for Future JobsJian Qin
 
Research Data Management: What is it and why is the Library & Archives Servic...
Research Data Management: What is it and why is the Library & Archives Servic...Research Data Management: What is it and why is the Library & Archives Servic...
Research Data Management: What is it and why is the Library & Archives Servic...GarethKnight
 
Data mining - GDi Techno Solutions
Data mining - GDi Techno SolutionsData mining - GDi Techno Solutions
Data mining - GDi Techno SolutionsGDi Techno Solutions
 
Managing and Sharing Research Data
Managing and Sharing Research DataManaging and Sharing Research Data
Managing and Sharing Research DataMartin Donnelly
 
Libby Bishop, Ethics Of Data Sharing Ncess Jun 09 Final
Libby Bishop, Ethics Of Data Sharing Ncess Jun 09 FinalLibby Bishop, Ethics Of Data Sharing Ncess Jun 09 Final
Libby Bishop, Ethics Of Data Sharing Ncess Jun 09 Finala.carusi
 
Data Management for Citizen Science
Data Management for Citizen ScienceData Management for Citizen Science
Data Management for Citizen ScienceAndrea Wiggins
 
Big Data [sorry] & Data Science: What Does a Data Scientist Do?
Big Data [sorry] & Data Science: What Does a Data Scientist Do?Big Data [sorry] & Data Science: What Does a Data Scientist Do?
Big Data [sorry] & Data Science: What Does a Data Scientist Do?Data Science London
 
Pistoia Alliance debates AI in life science
Pistoia Alliance debates AI in life sciencePistoia Alliance debates AI in life science
Pistoia Alliance debates AI in life sciencePistoia Alliance
 
RDAP13 Jared Lyle: Domain Repositories and Institutional Repositories Partn…
RDAP13 Jared Lyle: Domain Repositories and Institutional Repositories Partn…RDAP13 Jared Lyle: Domain Repositories and Institutional Repositories Partn…
RDAP13 Jared Lyle: Domain Repositories and Institutional Repositories Partn…ASIS&T
 
TNGIC 2011 Keynote Managing Mountains of Data
TNGIC 2011 Keynote Managing Mountains of DataTNGIC 2011 Keynote Managing Mountains of Data
TNGIC 2011 Keynote Managing Mountains of DataZsoltNC
 
Stuart Phinn_Many kinds of infrastructure: resolving and advancing ecosystem ...
Stuart Phinn_Many kinds of infrastructure: resolving and advancing ecosystem ...Stuart Phinn_Many kinds of infrastructure: resolving and advancing ecosystem ...
Stuart Phinn_Many kinds of infrastructure: resolving and advancing ecosystem ...TERN Australia
 
Will We Command Our Data? From the Petascale to the Personal
Will We Command Our Data?  From the Petascale to the PersonalWill We Command Our Data?  From the Petascale to the Personal
Will We Command Our Data? From the Petascale to the PersonalRichard Akerman
 

Similar to NISO Webinar: Part 2: Managing Data for Scholarly Communications (20)

Carmen O'Dell and Barbara Sen JIBS-RLUK event July 2012
Carmen O'Dell and Barbara Sen JIBS-RLUK event July 2012Carmen O'Dell and Barbara Sen JIBS-RLUK event July 2012
Carmen O'Dell and Barbara Sen JIBS-RLUK event July 2012
 
3 tu.dc 5min nordbib jp rombouts
3 tu.dc 5min nordbib jp rombouts3 tu.dc 5min nordbib jp rombouts
3 tu.dc 5min nordbib jp rombouts
 
Partnering for Research Data
Partnering for Research DataPartnering for Research Data
Partnering for Research Data
 
Repository Federation: Towards Data Interoperability
Repository Federation: Towards Data InteroperabilityRepository Federation: Towards Data Interoperability
Repository Federation: Towards Data Interoperability
 
Data Science: An Emerging Field for Future Jobs
Data Science: An Emerging Field for Future JobsData Science: An Emerging Field for Future Jobs
Data Science: An Emerging Field for Future Jobs
 
Research Data Management: What is it and why is the Library & Archives Servic...
Research Data Management: What is it and why is the Library & Archives Servic...Research Data Management: What is it and why is the Library & Archives Servic...
Research Data Management: What is it and why is the Library & Archives Servic...
 
Data mining - GDi Techno Solutions
Data mining - GDi Techno SolutionsData mining - GDi Techno Solutions
Data mining - GDi Techno Solutions
 
Managing and Sharing Research Data
Managing and Sharing Research DataManaging and Sharing Research Data
Managing and Sharing Research Data
 
Libby Bishop, Ethics Of Data Sharing Ncess Jun 09 Final
Libby Bishop, Ethics Of Data Sharing Ncess Jun 09 FinalLibby Bishop, Ethics Of Data Sharing Ncess Jun 09 Final
Libby Bishop, Ethics Of Data Sharing Ncess Jun 09 Final
 
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
 
Data Management for Citizen Science
Data Management for Citizen ScienceData Management for Citizen Science
Data Management for Citizen Science
 
Big Data [sorry] & Data Science: What Does a Data Scientist Do?
Big Data [sorry] & Data Science: What Does a Data Scientist Do?Big Data [sorry] & Data Science: What Does a Data Scientist Do?
Big Data [sorry] & Data Science: What Does a Data Scientist Do?
 
Pistoia Alliance debates AI in life science
Pistoia Alliance debates AI in life sciencePistoia Alliance debates AI in life science
Pistoia Alliance debates AI in life science
 
RDAP13 Jared Lyle: Domain Repositories and Institutional Repositories Partn…
RDAP13 Jared Lyle: Domain Repositories and Institutional Repositories Partn…RDAP13 Jared Lyle: Domain Repositories and Institutional Repositories Partn…
RDAP13 Jared Lyle: Domain Repositories and Institutional Repositories Partn…
 
METRO RDM Webinar
METRO RDM WebinarMETRO RDM Webinar
METRO RDM Webinar
 
Michener Plenary PPSR2012
Michener Plenary PPSR2012Michener Plenary PPSR2012
Michener Plenary PPSR2012
 
TNGIC 2011 Keynote Managing Mountains of Data
TNGIC 2011 Keynote Managing Mountains of DataTNGIC 2011 Keynote Managing Mountains of Data
TNGIC 2011 Keynote Managing Mountains of Data
 
The field-guide-to-data-science
The field-guide-to-data-scienceThe field-guide-to-data-science
The field-guide-to-data-science
 
Stuart Phinn_Many kinds of infrastructure: resolving and advancing ecosystem ...
Stuart Phinn_Many kinds of infrastructure: resolving and advancing ecosystem ...Stuart Phinn_Many kinds of infrastructure: resolving and advancing ecosystem ...
Stuart Phinn_Many kinds of infrastructure: resolving and advancing ecosystem ...
 
Will We Command Our Data? From the Petascale to the Personal
Will We Command Our Data?  From the Petascale to the PersonalWill We Command Our Data?  From the Petascale to the Personal
Will We Command Our Data? From the Petascale to the Personal
 

More from National Information Standards Organization (NISO)

More from National Information Standards Organization (NISO) (20)

Bazargan "NISO Webinar, Sustainability in Publishing"
Bazargan "NISO Webinar, Sustainability in Publishing"Bazargan "NISO Webinar, Sustainability in Publishing"
Bazargan "NISO Webinar, Sustainability in Publishing"
 
Rapple "Scholarly Communications and the Sustainable Development Goals"
Rapple "Scholarly Communications and the Sustainable Development Goals"Rapple "Scholarly Communications and the Sustainable Development Goals"
Rapple "Scholarly Communications and the Sustainable Development Goals"
 
Compton "NISO Webinar, Sustainability in Publishing"
Compton "NISO Webinar, Sustainability in Publishing"Compton "NISO Webinar, Sustainability in Publishing"
Compton "NISO Webinar, Sustainability in Publishing"
 
Mattingly "AI & Prompt Design: Large Language Models"
Mattingly "AI & Prompt Design: Large Language Models"Mattingly "AI & Prompt Design: Large Language Models"
Mattingly "AI & Prompt Design: Large Language Models"
 
Hazen, Morse, and Varnum "Spring 2024 ODI Conformance Statement Workshop for ...
Hazen, Morse, and Varnum "Spring 2024 ODI Conformance Statement Workshop for ...Hazen, Morse, and Varnum "Spring 2024 ODI Conformance Statement Workshop for ...
Hazen, Morse, and Varnum "Spring 2024 ODI Conformance Statement Workshop for ...
 
Mattingly "AI & Prompt Design" - Introduction to Machine Learning"
Mattingly "AI & Prompt Design" - Introduction to Machine Learning"Mattingly "AI & Prompt Design" - Introduction to Machine Learning"
Mattingly "AI & Prompt Design" - Introduction to Machine Learning"
 
Mattingly "Text and Data Mining: Building Data Driven Applications"
Mattingly "Text and Data Mining: Building Data Driven Applications"Mattingly "Text and Data Mining: Building Data Driven Applications"
Mattingly "Text and Data Mining: Building Data Driven Applications"
 
Mattingly "Text and Data Mining: Searching Vectors"
Mattingly "Text and Data Mining: Searching Vectors"Mattingly "Text and Data Mining: Searching Vectors"
Mattingly "Text and Data Mining: Searching Vectors"
 
Mattingly "Text Mining Techniques"
Mattingly "Text Mining Techniques"Mattingly "Text Mining Techniques"
Mattingly "Text Mining Techniques"
 
Mattingly "Text Processing for Library Data: Representing Text as Data"
Mattingly "Text Processing for Library Data: Representing Text as Data"Mattingly "Text Processing for Library Data: Representing Text as Data"
Mattingly "Text Processing for Library Data: Representing Text as Data"
 
Carpenter "Designing NISO's New Strategic Plan: 2023-2026"
Carpenter "Designing NISO's New Strategic Plan: 2023-2026"Carpenter "Designing NISO's New Strategic Plan: 2023-2026"
Carpenter "Designing NISO's New Strategic Plan: 2023-2026"
 
Ross and Clark "Strategic Planning"
Ross and Clark "Strategic Planning"Ross and Clark "Strategic Planning"
Ross and Clark "Strategic Planning"
 
Mattingly "Data Mining Techniques: Classification and Clustering"
Mattingly "Data Mining Techniques: Classification and Clustering"Mattingly "Data Mining Techniques: Classification and Clustering"
Mattingly "Data Mining Techniques: Classification and Clustering"
 
Straza "Global collaboration towards equitable and open science: UNESCO Recom...
Straza "Global collaboration towards equitable and open science: UNESCO Recom...Straza "Global collaboration towards equitable and open science: UNESCO Recom...
Straza "Global collaboration towards equitable and open science: UNESCO Recom...
 
Lippincott "Beyond access: Accelerating discovery and increasing trust throug...
Lippincott "Beyond access: Accelerating discovery and increasing trust throug...Lippincott "Beyond access: Accelerating discovery and increasing trust throug...
Lippincott "Beyond access: Accelerating discovery and increasing trust throug...
 
Kriegsman "Integrating Open and Equitable Research into Open Science"
Kriegsman "Integrating Open and Equitable Research into Open Science"Kriegsman "Integrating Open and Equitable Research into Open Science"
Kriegsman "Integrating Open and Equitable Research into Open Science"
 
Mattingly "Ethics and Cleaning Data"
Mattingly "Ethics and Cleaning Data"Mattingly "Ethics and Cleaning Data"
Mattingly "Ethics and Cleaning Data"
 
Mercado-Lara "Open & Equitable Program"
Mercado-Lara "Open & Equitable Program"Mercado-Lara "Open & Equitable Program"
Mercado-Lara "Open & Equitable Program"
 
Ratner "Enhancing Open Science: Assessing Tools & Charting Progress"
Ratner "Enhancing Open Science: Assessing Tools & Charting Progress"Ratner "Enhancing Open Science: Assessing Tools & Charting Progress"
Ratner "Enhancing Open Science: Assessing Tools & Charting Progress"
 
Pfeiffer "Enhancing Open Science: Assessing Tools & Charting Progress"
Pfeiffer "Enhancing Open Science: Assessing Tools & Charting Progress"Pfeiffer "Enhancing Open Science: Assessing Tools & Charting Progress"
Pfeiffer "Enhancing Open Science: Assessing Tools & Charting Progress"
 

Recently uploaded

How to Make a Duplicate of Your Odoo 17 Database
How to Make a Duplicate of Your Odoo 17 DatabaseHow to Make a Duplicate of Your Odoo 17 Database
How to Make a Duplicate of Your Odoo 17 DatabaseCeline George
 
Q-Factor General Quiz-7th April 2024, Quiz Club NITW
Q-Factor General Quiz-7th April 2024, Quiz Club NITWQ-Factor General Quiz-7th April 2024, Quiz Club NITW
Q-Factor General Quiz-7th April 2024, Quiz Club NITWQuiz Club NITW
 
Grade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdf
Grade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdfGrade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdf
Grade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdfJemuel Francisco
 
Grade Three -ELLNA-REVIEWER-ENGLISH.pptx
Grade Three -ELLNA-REVIEWER-ENGLISH.pptxGrade Three -ELLNA-REVIEWER-ENGLISH.pptx
Grade Three -ELLNA-REVIEWER-ENGLISH.pptxkarenfajardo43
 
4.11.24 Mass Incarceration and the New Jim Crow.pptx
4.11.24 Mass Incarceration and the New Jim Crow.pptx4.11.24 Mass Incarceration and the New Jim Crow.pptx
4.11.24 Mass Incarceration and the New Jim Crow.pptxmary850239
 
Mythology Quiz-4th April 2024, Quiz Club NITW
Mythology Quiz-4th April 2024, Quiz Club NITWMythology Quiz-4th April 2024, Quiz Club NITW
Mythology Quiz-4th April 2024, Quiz Club NITWQuiz Club NITW
 
ICS2208 Lecture6 Notes for SL spaces.pdf
ICS2208 Lecture6 Notes for SL spaces.pdfICS2208 Lecture6 Notes for SL spaces.pdf
ICS2208 Lecture6 Notes for SL spaces.pdfVanessa Camilleri
 
Measures of Position DECILES for ungrouped data
Measures of Position DECILES for ungrouped dataMeasures of Position DECILES for ungrouped data
Measures of Position DECILES for ungrouped dataBabyAnnMotar
 
Unraveling Hypertext_ Analyzing Postmodern Elements in Literature.pptx
Unraveling Hypertext_ Analyzing  Postmodern Elements in  Literature.pptxUnraveling Hypertext_ Analyzing  Postmodern Elements in  Literature.pptx
Unraveling Hypertext_ Analyzing Postmodern Elements in Literature.pptxDhatriParmar
 
ROLES IN A STAGE PRODUCTION in arts.pptx
ROLES IN A STAGE PRODUCTION in arts.pptxROLES IN A STAGE PRODUCTION in arts.pptx
ROLES IN A STAGE PRODUCTION in arts.pptxVanesaIglesias10
 
ClimART Action | eTwinning Project
ClimART Action    |    eTwinning ProjectClimART Action    |    eTwinning Project
ClimART Action | eTwinning Projectjordimapav
 
Expanded definition: technical and operational
Expanded definition: technical and operationalExpanded definition: technical and operational
Expanded definition: technical and operationalssuser3e220a
 
Reading and Writing Skills 11 quarter 4 melc 1
Reading and Writing Skills 11 quarter 4 melc 1Reading and Writing Skills 11 quarter 4 melc 1
Reading and Writing Skills 11 quarter 4 melc 1GloryAnnCastre1
 
Narcotic and Non Narcotic Analgesic..pdf
Narcotic and Non Narcotic Analgesic..pdfNarcotic and Non Narcotic Analgesic..pdf
Narcotic and Non Narcotic Analgesic..pdfPrerana Jadhav
 
4.16.24 21st Century Movements for Black Lives.pptx
4.16.24 21st Century Movements for Black Lives.pptx4.16.24 21st Century Movements for Black Lives.pptx
4.16.24 21st Century Movements for Black Lives.pptxmary850239
 
Blowin' in the Wind of Caste_ Bob Dylan's Song as a Catalyst for Social Justi...
Blowin' in the Wind of Caste_ Bob Dylan's Song as a Catalyst for Social Justi...Blowin' in the Wind of Caste_ Bob Dylan's Song as a Catalyst for Social Justi...
Blowin' in the Wind of Caste_ Bob Dylan's Song as a Catalyst for Social Justi...DhatriParmar
 
week 1 cookery 8 fourth - quarter .pptx
week 1 cookery 8  fourth  -  quarter .pptxweek 1 cookery 8  fourth  -  quarter .pptx
week 1 cookery 8 fourth - quarter .pptxJonalynLegaspi2
 
4.16.24 Poverty and Precarity--Desmond.pptx
4.16.24 Poverty and Precarity--Desmond.pptx4.16.24 Poverty and Precarity--Desmond.pptx
4.16.24 Poverty and Precarity--Desmond.pptxmary850239
 
Congestive Cardiac Failure..presentation
Congestive Cardiac Failure..presentationCongestive Cardiac Failure..presentation
Congestive Cardiac Failure..presentationdeepaannamalai16
 

Recently uploaded (20)

Paradigm shift in nursing research by RS MEHTA
Paradigm shift in nursing research by RS MEHTAParadigm shift in nursing research by RS MEHTA
Paradigm shift in nursing research by RS MEHTA
 
How to Make a Duplicate of Your Odoo 17 Database
How to Make a Duplicate of Your Odoo 17 DatabaseHow to Make a Duplicate of Your Odoo 17 Database
How to Make a Duplicate of Your Odoo 17 Database
 
Q-Factor General Quiz-7th April 2024, Quiz Club NITW
Q-Factor General Quiz-7th April 2024, Quiz Club NITWQ-Factor General Quiz-7th April 2024, Quiz Club NITW
Q-Factor General Quiz-7th April 2024, Quiz Club NITW
 
Grade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdf
Grade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdfGrade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdf
Grade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdf
 
Grade Three -ELLNA-REVIEWER-ENGLISH.pptx
Grade Three -ELLNA-REVIEWER-ENGLISH.pptxGrade Three -ELLNA-REVIEWER-ENGLISH.pptx
Grade Three -ELLNA-REVIEWER-ENGLISH.pptx
 
4.11.24 Mass Incarceration and the New Jim Crow.pptx
4.11.24 Mass Incarceration and the New Jim Crow.pptx4.11.24 Mass Incarceration and the New Jim Crow.pptx
4.11.24 Mass Incarceration and the New Jim Crow.pptx
 
Mythology Quiz-4th April 2024, Quiz Club NITW
Mythology Quiz-4th April 2024, Quiz Club NITWMythology Quiz-4th April 2024, Quiz Club NITW
Mythology Quiz-4th April 2024, Quiz Club NITW
 
ICS2208 Lecture6 Notes for SL spaces.pdf
ICS2208 Lecture6 Notes for SL spaces.pdfICS2208 Lecture6 Notes for SL spaces.pdf
ICS2208 Lecture6 Notes for SL spaces.pdf
 
Measures of Position DECILES for ungrouped data
Measures of Position DECILES for ungrouped dataMeasures of Position DECILES for ungrouped data
Measures of Position DECILES for ungrouped data
 
Unraveling Hypertext_ Analyzing Postmodern Elements in Literature.pptx
Unraveling Hypertext_ Analyzing  Postmodern Elements in  Literature.pptxUnraveling Hypertext_ Analyzing  Postmodern Elements in  Literature.pptx
Unraveling Hypertext_ Analyzing Postmodern Elements in Literature.pptx
 
ROLES IN A STAGE PRODUCTION in arts.pptx
ROLES IN A STAGE PRODUCTION in arts.pptxROLES IN A STAGE PRODUCTION in arts.pptx
ROLES IN A STAGE PRODUCTION in arts.pptx
 
ClimART Action | eTwinning Project
ClimART Action    |    eTwinning ProjectClimART Action    |    eTwinning Project
ClimART Action | eTwinning Project
 
Expanded definition: technical and operational
Expanded definition: technical and operationalExpanded definition: technical and operational
Expanded definition: technical and operational
 
Reading and Writing Skills 11 quarter 4 melc 1
Reading and Writing Skills 11 quarter 4 melc 1Reading and Writing Skills 11 quarter 4 melc 1
Reading and Writing Skills 11 quarter 4 melc 1
 
Narcotic and Non Narcotic Analgesic..pdf
Narcotic and Non Narcotic Analgesic..pdfNarcotic and Non Narcotic Analgesic..pdf
Narcotic and Non Narcotic Analgesic..pdf
 
4.16.24 21st Century Movements for Black Lives.pptx
4.16.24 21st Century Movements for Black Lives.pptx4.16.24 21st Century Movements for Black Lives.pptx
4.16.24 21st Century Movements for Black Lives.pptx
 
Blowin' in the Wind of Caste_ Bob Dylan's Song as a Catalyst for Social Justi...
Blowin' in the Wind of Caste_ Bob Dylan's Song as a Catalyst for Social Justi...Blowin' in the Wind of Caste_ Bob Dylan's Song as a Catalyst for Social Justi...
Blowin' in the Wind of Caste_ Bob Dylan's Song as a Catalyst for Social Justi...
 
week 1 cookery 8 fourth - quarter .pptx
week 1 cookery 8  fourth  -  quarter .pptxweek 1 cookery 8  fourth  -  quarter .pptx
week 1 cookery 8 fourth - quarter .pptx
 
4.16.24 Poverty and Precarity--Desmond.pptx
4.16.24 Poverty and Precarity--Desmond.pptx4.16.24 Poverty and Precarity--Desmond.pptx
4.16.24 Poverty and Precarity--Desmond.pptx
 
Congestive Cardiac Failure..presentation
Congestive Cardiac Failure..presentationCongestive Cardiac Failure..presentation
Congestive Cardiac Failure..presentation
 

NISO Webinar: Part 2: Managing Data for Scholarly Communications

  • 1. http://www.niso.org/news/events/2011/nisowebinars/semanticweb/ Managing Data for Scholarly Communications PART 2: Technical Management October 19, 2011 Speakers: Joan Starr, Mark McFarland, and MacKenzie Smith
  • 2. Dataset  Iden*fica*on  &  Cita*on:   DataCite  and  EZID   Joan  Starr   California  Digital  Library   October,  2011  
  • 3. Dataset  Iden*fica*on  &  Cita*on   Introduc*on   The  Researchers’  Challenge   Iden*fiers  are  a  tool  for  researchers   DataCite   “Helping  you  find,  access  and  reuse  data.”   EZID   Easy  crea*on  and  management  of  DataCite  DOIs  and  other   iden*fiers.   Next  steps      For  DataCite,  EZID  and  you!  
  • 5.
  • 7. Early  in  the  research  life  cycle   Data-­‐intensive  research   +   Wri*ng  up  the  results   Where’s   the  data?   What  if    I   move  it?   PERSISTENT  IDENTIFIERS   make  the  difference   by  Dave  Rogers  hWp://www.flickr.com/photos/dave-­‐rogers/2815036285/  
  • 8. Working  on  a  federated  team   Data-­‐intensive  research   +   Regional  research  center   +   Aging  infrastructure   Where’s   We  have  to   the  data?   move  it!   PERSISTENT  IDENTIFIERS   make  the  difference   ©All  rights  reserved  by  University  of  California,  hWp://www.flickr.com/photos/universityofcalifornia/5405812887  
  • 9. Making  a  career  move   •  Data-­‐intensive  research   +   •  Researcher(s)  on  the   move   I  know   where  my   data  is   and  I’m   taking  it   with  me!   PERSISTENT  IDENTIFIERS   make  the  difference   ©All  rights  reserved  by  University  of  California,     hWp://www.flickr.com/photos/universityofcalifornia/5406308654  
  • 10. Mee*ng  funder  requirements   •  Data-­‐intensive  research   +   •  Grantor  requirements   for  data  management   What  do  we   plan   put  here?   How  do  we   track  the  data?   PERSISTENT  IDENTIFIERS   make  the  difference   By  David  Mellis,  hWp://www.flickr.com/photos/mellis/7675610/  
  • 11.
  • 12. DataCite   German  Na8onal  Library  of  Economics  (ZBW)       Canada  Ins8tute  for  Scien8fic  and  Technical  Informa8on   German  Na8onal  Library  of  Science  and  Technology  (TIB)     (CISTI)   German  Na8onal  Library  of  Medicine  (ZB  MED)   Technical  Informa8on  Center  of  Denmark   GESIS  -­‐  Leibniz  Ins8tute  for  the  Social  Sciences,  Germany     Ins8tute  for  Scien8fic  &  Technical  Informa8on  (INIST-­‐ Australian  Na8onal  Data  Service  (ANDS)   CNRS),  France     ETH  Zurich,  Switzerland   TU  DelS  Library,  The  Netherlands     The  Swedish  Na8onal  Data  Service  (SNDS)   The  Bri8sh  Library  ,  UK   California  Digital  Library  (CDL),  USA     Office  of  Scien8fic  &  Technical  Informa8on  (OSTI),  USA     Purdue  University  Library  
  • 13. DataCite  Metadata  V.  2.2   •  Small  required  set  =  cita*on  elements   •  Op*onal  descrip*ve  set:   –  extendable  lists   –  can  refer  to  other  standards,  schemes   –  domain-­‐neutral   –  rich  ability  to  describe  rela*onships  to  other   digital  objects   •  Metadata  Search  (MDS)  is  full-­‐text  indexed    
  • 14. DataCite  Metadata  V.  2.2   Required  proper8es   Op8onal  proper8es   1.  Iden8fier  (with  type  aWribute)   6.  Subject  (with  schema  aWribute)   2.  Creator  (with  name  iden*fier   7.  Contributor  (with  type  &  name  iden*fier   aWributes)   aWributes)   3.  Title  (with  op*onal  type  aWribute)   8.  Date  (with  type  aWribute)   4.  Publisher   9.  Language       5.  Publica8onYear   10.  ResourceType  (with  descrip*on  aWribute)   11.  AlternateIden*fier  (with  type  aWribute)   12.  RelatedIden*fier  (with  type  &rela*on   type  aWributes)   13.  Size       14.  Format       15.  Version   16.  Rights   17.  Descrip*on  (with  type  aWribute)  
  • 15. •  Get  iden*fiers   •  Add  loca*on   •  Add  metadata   •  Update  loca*on   •  Update  metadata  
  • 25. Next  Steps   DataCite   •   Dublin  Core  applica*on  profile   •   Content  Service   •   Metadata  v.  2.3   EZID   • UI  redesign   • Automated  link  checking   • Exposure  for  cita*ons   By  Nicola  Whitaker  hWp://www.flickr.com/photos/nicolawhitaker/111009156/  
  • 26. Next  Steps  for  you   •  Get  more  informa*on,  and   •  Try  EZID  for  yourself!   By  Nicola  Whitaker  hWp://www.flickr.com/photos/nicolawhitaker/111009156/  
  • 27. For  more  informa*on   EZID   EZID  applica*on:  hWp://n2t.net/ezid/     EZID  website:  hWp://www.cdlib.org/services/uc3/ezid/   UC3  website:  hWp://www.cdlib.org/services/uc3/   DataCite   DataCite  Home:  hWp://datacite.org/   DataCite  Metadata  Schema:   hWp://schema.datacite.org/meta/kernel-­‐2.2/index.html   DataCite  Metadata  Search:  hWp://search.datacite.org   Contact  Joan  Starr  at  uc3@ucop.edu  
  • 28. Ques*ons?   by  Horia  Varlan     hWp://www.flickr.com/photos/horiavarlan/4273168957/in/photostream/  
  • 29. Digital  Library  Services  in  the  Cloud   Mark  McFarland   Director,  Texas  Digital  Library  
  • 30. Outline   •  Who:  Texas  Digital  Library   •  Where:  on  the  cloud   •  Why:  mo*va*ons   •  When:  late  2010   •  What:  lessons  learned   June  2011   30  
  • 31. Who:  Texas  Digital  Library   •  Consor*um  of  higher  educa*on  ins*tu*ons  in  Texas   •  Current  services  include:   –  Ins*tu*on:  IR  (DSpace),  ETD  system   –  Faculty:  OJS,  OCS,  blogs,  wikis   –  Approximately  70  customer-­‐facing  service  instances   •  Legacy  hardware  included   –  Compute  servers   –  Storage  servers   –  Network  support  devices   June  2011   31  
  • 32. Where:  on  the  cloud   •  Migrated  customer-­‐facing  services  to  AWS   –  50  AWS  VM  instances   •  Maintained  some  services  on  local  hardware   •  Simplified  and  consolidated  system   architecture   June  2011   32  
  • 33. Why:  mo*va*ons  /  When:  late  2010   •  Disaster  recovery  plan   –  Prepare  for  data  center  move   •  Elas*c  capacity   –  New  members,  collec*ons   •  Personnel  savings   –  Fewer  competencies,  responsibili*es   •  Began  Oct  2010   June  2011   33  
  • 34. What:  lessons  learned   •  The  Good   –  Elas*c  capacity;  customers  did  not  no*ce  change   –  No  hardware  purchase  cycle   •  The  Mixed   –  Lower  personnel  costs;  failover   •  The  Unexpected   –  Development  tools;  concerns  about  AWS  being  in   U.S.;  excellent  management  console   June  2011   34  
  • 35. Future   •  Preserva*on   –  DuraCloud   •  Con*nue  to  evaluate   –  AWS  is  flexible  and  feature  rich,  but  may  s*ll  not   be  cost  effec*ve   June  2011   35  
  • 36. For  more  informa*on  about  the  TDL,  please  visit  the  Texas   Digital  Library  website  at  hWp://www.tdl.org     or  contact  us  at     info@tdl.org.    
  • 37. Data  Governance  and   Legal  Interoperability   MacKenzie  Smith,  Science  Fellow   ©  Crea*ve  Commons,  2011.  This  work  is  licensed  under  a  Crea*ve  Commons  AWribu*on  3.0  United  States  License.  
  • 38. Why  Data  Sharing  is  Good     •  research  reproducibility   •  fiscal  responsibility   •  broadest  possible  impact   •  large-­‐scale  data  interoperability   –  Includes  technical,  social,  legal  and  policy  aspects   –  usual  focus  on  technical/social   –  focus  here  on  legal/policy  aspects  
  • 39. Why  Data  Sharing  is  Hard   •  No  incen*ves  to  improve  data  quality,  provide   missing  documenta*on   •  Confiden*ality  and  privacy  concerns   (e.g.  HIPAA,  endangered  species)   •  Patents  and  commercial  poten*al   •  Closed  Access  to  journal  ar*cles  (i.e.  results)   •  IP  issues  very  complicated  
  • 40. Defini*ons   Data  governance  is  the  system  of  decision  rights  and   accountabili8es  that  describe  who  can  take  what  ac8ons   with  what  data,  and  when,  under  what  circumstances,  using   what  methods   •  strategies  for  data  quality  control  and  management,  and  processes  that   insure  important  data  assets  are  formally  managed  throughout  an   organiza*on;   –  organiza*ons  can  be  legal  en**es  like  universi*es,  or  virtual  organiza=ons   (e.g.  distributed  research  collabora*ons)   –  Includes  business  processes  and  risk  management;   •  laws  and  policies  associated  with  data;   •  ensures  that  data  can  be  trusted  and  that  people  are  accountable  for   ac*ons  affec*ng  the  data  
  • 41. Defini*ons   •  A"ribu'on  is  legally-­‐imposed,  remedy  is  lawsuit   •  Credit  is  what  researchers  want     •  Cita'on  is  the  norm  in  scholarly  communica*on,   to  provide  suppor*ng  evidence,  now  proxy  for   credit   AWribu*on  does  not  insure  credit  or  cita*on.    
  • 42. Legal  Mechanisms  for  Sharing  Data   1.   licenses   Require  aWribu*on   2.   contracts   3.   waivers     No  aWribu*on   requirement  
  • 43. Copyright  for  Data   •  Does  not  apply  to  facts,  e.g.,  most  scien*fic   data   •  Can  apply  to  a  collec=on  of  facts,  but  only  to   original  aspects,  not  facts  themselves   •  Can  extract  facts  from  a  copyrighted  database   without  infringing  
  • 44. Licenses   •  Licenses  are  not  contracts   –  depend  on  underlying  rights,  e.g.  copyright  or  sui  generis   rights   –  Copyright  is  a  bundle  of  rights,  automa*c  when  fixed,   limited  in  scope  and  dura*on   •  US  and  EU  differ  (EU  has  sui  generis  data  rights)   so  different  licenses  cover  copyright,  sui  generis   rights,  or  both  
  • 45. Licenses   •  Crea*ve  Commons  (CC-­‐BY)  example   –  applies  to  data  and  databases  to  the  extent  they’re   copyrightable   –  Only  data  uses  that  implicate  copyright  trigger   aWribu*on  requirement   –  uses  of  data  that  do  not  implicate  copyright,  e.g.  is  in   the  public  domain,  do  not  trigger  aWribu*on  
  • 46. Licenses   •  Hard  to  assess  copyright  for  par*cular  data   and  databases   •  Hard  to  know  when  license  applies,  creates   risks:   –  data  provider  be  misled   –  data  user  will  under  or  over  comply  
  • 47. Licenses   •  AWribu*on  requirements  are  inflexible,   causing  absurd  situa*ons   –  e.g.  providing  aWribu*on  to  1,000  providers       in  1,000  different  ways   –  known  as  ‘aWribu*on  stacking’     •  Could  provide  aWribu*on  and  s*ll  not  sa*sfy   norms  or  expecta*ons  
  • 49. Contracts   •  Do  not  require  underlying  right     –  rely  on  offer/acceptance,  click  through,  terms  of  use   –  require  formali*es,  e.g.  aWribu*on   •  Downsides   –  confusing  obliga*ons,  no  standardiza*on,  each  user   agreement  can  have  different  requirements   •  Researchers  may  avoid  data  if  they  can’t   understand  the  terms  of  use  
  • 50. Contracts   Unlike  licenses,  contracts  only  binds  par=es   •  If  someone  obtains  licensed  data  and  shares  it,  anyone   who  obtains  data  from  that  user  is  s*ll  bound  by  the   license   •  If  data  had  been  shared  by  contract,  anyone  obtaining   data  from  the  second  party  is  not  bound  by  the   contract  since  they  aren’t  a  party  to  the  contract   •  In  this  respect,  contracts  are  more  limited  than  licenses  
  • 51. Contracts   •  Have  broader  reach  than  licenses   –  not  *ed  to  a  legal  right   –  can  take  away  rights  of  public  
  • 53. Waivers   •  Provide  legal  certainty   –  No  need  to  decipher  copyright  protec*on  or  six  through  confusing   legalese   –  BeWer  than  silence,  to  avoid  forcing  people  to  guess  what  their  risks   are     •  Mean  loss  of  control   –  Can’t  require  aWribu*on  or  other  terms   •  Avoid  problems  and  rely  on  scholarly  norms   –  no  aWribu*on  stacking  or  inappropriate  obliga*ons  
  • 54. 3  levels:  Waiver,  Fall-­‐back  license,  Non-­‐asser*on  pledge  
  • 55. Summary   •  Law  is  messy,  each  approach  has  consequences   •  Licenses  –  (1)  legal  uncertainty  about  scope,  (2)   requirements  can  be  inconsistent  with  norms   •  Contracts  –  (1)  burdensome  requirements  with  custom   terms,  (2)  exceed  scope  of  rights  with  requirements  that   take  away  normal  rights   •  Waivers  –  (1)  avoid  problems,  but  (2)  lose  control  and   rely  on  norms  
  • 56. Summary   •  Each  approach  requires  loss  of  control   •  No  mechanism  imposes  legally-­‐binding  obliga*ons  in   way  that  perfectly  maps  to  scholarly  credit,  e.g.   cita*on   •  Ideal  solu*on  creates  the  least  fric*on  to  scien*fic   progress  while  giving  credit  where  due,  i.e.,  waivers   and  norms  (the  community  governs  itself)