SlideShare a Scribd company logo
1 of 24
Download to read offline
UNDERSTANDING THE
BIG PICTURE OF E-SCIENCE

Andrew Sallans
Head of Strategic Data Initiatives
University of Virginia Library

E-Science Bootcamp
Claude Moore Health Sciences Library, University of Virginia
4 March 2011
OUTLINE
 What it‟s all about
 Examples

 Implications

 UVA Libraries Response (Round 1)




                                     2
WHAT IT‟S ALL ABOUT (AROUND 1999)
"e-Science is about global collaboration in key areas
  of science, and the next generation of
  infrastructure that will enable it."

"e-Science will change the dynamic of the way
  science is undertaken."

                    Dr Sir John Taylor
                    Director General of Research Councils,
                    Office of Science and Technology
                    United Kingdom
                                                                  3
                    Source: http://webscience.org/person/8.html
WHAT MADE THIS POSSIBLE?
 Internet/World Wide Web
 Faster networking (fiber, special research
  networks, advances in grids)
 Better storage (higher capacity, faster access,
  better reliability)
 Cheap storage (costs keep decreasing)

 Major funding initiatives

 Broader interest in collaboration




                                                    4
SOME COMMON TERMS
 Computational science
 Scientific computing

 Research computing

 High-performance computing

 Cyberscience

 Cyberinfrastructure




                               5
CLIMATOLOGY RESEARCH




Sources:
1) Climate Simulation on Cray XT5 “Jaguar” supercomputer, ORNL                  6
   (http://www.ornl.gov/info/ornlreview/v42_3_09/article02.shtml)
2) Cray XT5 “Jaguar” supercomputer, ORNL
   (http://www.ornl.gov/info/ornlreview/v42_1_09/images/a05_p04_xt5_full.jpg)
LARGE HADRON COLLIDER AT CERN
                                               Circumference: 26,659 meters
                                               Magnets: 9,300
                                               Speed: protons move at
                                                99.9999991% speed of light)
                                               Collisions/second: 600 million
                                               Data produced: equivalent to
                                                100,000 dual layer DVDs per
                                                year
                                               LHC Grid: tens of thousands
                                                of computers around the world
                                                used collectively to analyze
                                                data (will take 15 years)

                                                                                                   7
Source: CERN website (http://cdsweb.cern.ch/record/975468/files/its-2006-003.gif?subformat=icon)
BIOMEDICAL INFORMATICS GRID (CABIG)
 Launched as test in 2004
 Adopted by over 50 NCI-designated cancer centers

 Focused on:
       Connecting scientists and practitioners through a
        shareable and interoperable infrastructure
       Development of standard rules and a common
        language to more easily share information
       Building or adapting tools for collecting, analyzing,
        integrating, and disseminating information associated
        with cancer research and care

    Source: caBIG website, National Cancer Institute (https://cabig.nci.nih.gov/)   8
CITIZEN SCIENCE…THE SOCIAL SIDE




   34,617,406 clicks done by 82,931 users!

 Source: Zooniverse, Real Science Online (http://www.zooniverse.org/home)   9
IMPLICATIONS FOR RESEARCH
 Greater emphasis on technology
 Increase in interdisciplinary research and
  collaboration
 Often bigger data, with far more complex
  associated issues (storage, access, expertise,
  funding, preservation, etc.)
 Need for innovative approaches and integration
  into education/curriculum



                                                   10
DATA TSUNAMI




     IDC estimate of about 1.7 zetabytes (1 trillion terabytes) around 2011
     ….twice the available space
Source:                                                                                                11
1) The Great Wave off Kanagawa, Katsushika Hokusai. Found on Wikipedia.
2) The Diverse and Exploding Digital Universe, IDC, May 2010 (http://www.emc.com/collateral/analyst-
   reports/diverse-exploding-digital-universe.pdf)
BUT, NOT ALL DATA IS EQUAL….




 Source: Long Tail, Wikipedia (http://en.wikipedia.org/wiki/The_Long_Tail)   12
CASE STUDY: UVA LIBRARIES RESPONSE
(ROUND 1)
 Collaboration established around 2005 through
  discussions between ITC and Library, and
  impetus of Frye Institute capstones.
 Research Computing Support services in need of
  greater visibility, Library seeking ways to
  support changes in scientific research, collocation
  provides mutual benefits.
 In 2006, staff moved to Library locations
  (Research Computing Lab & Scholars‟ Lab),
  setup new service points and services.

                                                        13
RESEARCH IN THE E-SCIENCE WORLD
 Heavy use of electronic information resources
 Work is predominantly done from a lab/office, not
  in the Library
 Collaboration is fundamental, but don‟t always
  know people in other domains
 Grad students are usually bringing new
  technology/methods into the team (learning more
  about grad students in a research study now)



                                                      14
IDENTIFIED E-SCIENCE TRENDS
   Various components
     Computationally intensive science
     IT/software/infrastructure
     Collaboration
     Data

   Often intertwined with Open Access initiatives




                                                     15
E-SCIENCE IN         OTHER LIBRARIES
   Purdue University
     Focus on data curation
     IATUL Conference, June 2010

   University of Illinois – Urbana Champaign
     Focus on data curation
     Summer Institute on Data Curation

   Cornell University
       Metadata consulting services
   University of New Mexico
       Major DataONE grant
                                                16
RESEARCH COMPUTING LAB RESPONSE
 Aiming to provide support across the entire
  scientific research data lifecycle
 Staff with expertise in:
     Data
     Quantitative data, statistics
     Modeling, visualization
     Scientific publishing

 Emphasis on consulting, not drop-off services
 Partnership with traditional librarians to help
  ease transition to new support models
                                                    17
RCL OUTREACH
University Community
 Speaker series 2006, 2007, 2008
 Research 2.0 Symposium
 Partnerships with courses, other units (ie.
  MLBS)
 Short course series each semester


Library Community
 Panel at the ACCS Conference in 2007
 Poster at ARL/CNI Forum in 2008
 Poster at STS Section of ALA in 2009
                                                18
 Journal article in JLA in 2009
SAMPLE RCL CONSULTATIONS
   STS Undergrad Environmental Justice (2008)
     Development of technology solutions for empowering the
      citizen scientist
     Web 2.0 tools, data collection/management
     Data analysis
   Economics Graduate Student (2008/2009)
     Airline flight price modeling
     Screen scraping, data collection/management
     Data analysis
   Mountain Lake Beetle Project (2009)
     Mobile data acquisition/collection solution
     Database development/management, programming
     Data analysis
   Archiving of dissertation data (2009)
     EVSC student, ModelMaker 4.0 data
     Biology student, IDL, Matlab, R code                     19
SPECIFICS FOR MEDICAL CENTER
 At least 600 RCL support requests from Medical
  Center from October „07 through December „09
 Medical Center patrons are heavy users of
  computational software like Matlab, SAS,
  LabView
 Increasing emphasis on collaboration
  (translational research)
 Greater attention to open access (NIH policy)

 Growing interest in areas like image integrity



                                                   20
TAKE-AWAYS
 This is the future
 Heavily growing space, lots of opportunity

 Requires big investment and commitment, the
  biggest being training and priority alignment
 Libraries and institutions need to make decisions
  on what to do and what not to do
 It‟s a culture change for both libraries,
  institutions, and researchers



                                                      21
COMING LATER….(ROUND 2)
   “Practical Applications of e-Science” in UVA
    Libraries today




                                                   22
QUESTIONS?
   Please feel free to contact me with questions:
     als9q@virginia.edu
     434-243-2180
     Twitter: asallans




                                                     23
ADDITIONAL INFORMATION
   E-Science Talking Points for ARL Deans and
    Directors, Elisabeth Jones, University of
    Washington, October 2008
    (http://www.arl.org/rtl/escience/)




                                                 24

More Related Content

What's hot

ICSTI Annual Meeting 2014 Tokyo Y. Murayama
ICSTI Annual Meeting 2014 Tokyo Y. MurayamaICSTI Annual Meeting 2014 Tokyo Y. Murayama
ICSTI Annual Meeting 2014 Tokyo Y. MurayamaYasuhiro Murayama
 
Federation and Interoperability in the Nectar Research Cloud
Federation and Interoperability in the Nectar Research CloudFederation and Interoperability in the Nectar Research Cloud
Federation and Interoperability in the Nectar Research CloudOpenStack
 
Science Engagement: A Non-Technical Approach to the Technical Divide
Science Engagement: A Non-Technical Approach to the Technical DivideScience Engagement: A Non-Technical Approach to the Technical Divide
Science Engagement: A Non-Technical Approach to the Technical DivideCybera Inc.
 
Making Small Data BIG (UT Austin, March 2016)
Making Small Data BIG (UT Austin, March 2016)Making Small Data BIG (UT Austin, March 2016)
Making Small Data BIG (UT Austin, March 2016)Kerstin Lehnert
 
Citizen science
Citizen scienceCitizen science
Citizen sciencesamar1407
 
HKU Data Curation MLIM7350 Class 8
HKU Data Curation MLIM7350 Class 8HKU Data Curation MLIM7350 Class 8
HKU Data Curation MLIM7350 Class 8Scott Edmunds
 
Mind the Gap: Reflections on Data Policies and Practice
Mind the Gap: Reflections on Data Policies and PracticeMind the Gap: Reflections on Data Policies and Practice
Mind the Gap: Reflections on Data Policies and PracticeLizLyon
 
Rogan esip overview
Rogan esip overviewRogan esip overview
Rogan esip overviewRebreid
 
BeSTGRID OpenGridForum 29 GIN session
BeSTGRID OpenGridForum 29 GIN sessionBeSTGRID OpenGridForum 29 GIN session
BeSTGRID OpenGridForum 29 GIN sessionNick Jones
 
The OAI-ORE Interoperability Framework in the Context of the Current Scholarl...
The OAI-ORE Interoperability Framework in the Context of the Current Scholarl...The OAI-ORE Interoperability Framework in the Context of the Current Scholarl...
The OAI-ORE Interoperability Framework in the Context of the Current Scholarl...Herbert Van de Sompel
 
Mapping The Escience (27 Oct2009)
Mapping The Escience (27 Oct2009)Mapping The Escience (27 Oct2009)
Mapping The Escience (27 Oct2009)Han Woo PARK
 
Science20brussels osimo april2013
Science20brussels osimo april2013Science20brussels osimo april2013
Science20brussels osimo april2013osimod
 

What's hot (20)

Data-Intensive Research
Data-Intensive ResearchData-Intensive Research
Data-Intensive Research
 
ICSTI Annual Meeting 2014 Tokyo Y. Murayama
ICSTI Annual Meeting 2014 Tokyo Y. MurayamaICSTI Annual Meeting 2014 Tokyo Y. Murayama
ICSTI Annual Meeting 2014 Tokyo Y. Murayama
 
Federation and Interoperability in the Nectar Research Cloud
Federation and Interoperability in the Nectar Research CloudFederation and Interoperability in the Nectar Research Cloud
Federation and Interoperability in the Nectar Research Cloud
 
Science Engagement: A Non-Technical Approach to the Technical Divide
Science Engagement: A Non-Technical Approach to the Technical DivideScience Engagement: A Non-Technical Approach to the Technical Divide
Science Engagement: A Non-Technical Approach to the Technical Divide
 
E scidocdays review
E scidocdays reviewE scidocdays review
E scidocdays review
 
Making Small Data BIG (UT Austin, March 2016)
Making Small Data BIG (UT Austin, March 2016)Making Small Data BIG (UT Austin, March 2016)
Making Small Data BIG (UT Austin, March 2016)
 
Citizen science
Citizen scienceCitizen science
Citizen science
 
Nicole Nogoy at the Auckland BMC RoadShow
Nicole Nogoy at the Auckland BMC RoadShowNicole Nogoy at the Auckland BMC RoadShow
Nicole Nogoy at the Auckland BMC RoadShow
 
Cyberistructure
CyberistructureCyberistructure
Cyberistructure
 
E research overview gahegan bioinformatics workshop 2010
E research overview gahegan bioinformatics workshop 2010E research overview gahegan bioinformatics workshop 2010
E research overview gahegan bioinformatics workshop 2010
 
HKU Data Curation MLIM7350 Class 8
HKU Data Curation MLIM7350 Class 8HKU Data Curation MLIM7350 Class 8
HKU Data Curation MLIM7350 Class 8
 
NewsletterIMS2015_w
NewsletterIMS2015_wNewsletterIMS2015_w
NewsletterIMS2015_w
 
Mind the Gap: Reflections on Data Policies and Practice
Mind the Gap: Reflections on Data Policies and PracticeMind the Gap: Reflections on Data Policies and Practice
Mind the Gap: Reflections on Data Policies and Practice
 
Cyberinfrastructure for Einstein's Equations and Beyond
Cyberinfrastructure for Einstein's Equations and BeyondCyberinfrastructure for Einstein's Equations and Beyond
Cyberinfrastructure for Einstein's Equations and Beyond
 
Rogan esip overview
Rogan esip overviewRogan esip overview
Rogan esip overview
 
BeSTGRID OpenGridForum 29 GIN session
BeSTGRID OpenGridForum 29 GIN sessionBeSTGRID OpenGridForum 29 GIN session
BeSTGRID OpenGridForum 29 GIN session
 
The OAI-ORE Interoperability Framework in the Context of the Current Scholarl...
The OAI-ORE Interoperability Framework in the Context of the Current Scholarl...The OAI-ORE Interoperability Framework in the Context of the Current Scholarl...
The OAI-ORE Interoperability Framework in the Context of the Current Scholarl...
 
Collins seattle-2014-final
Collins seattle-2014-finalCollins seattle-2014-final
Collins seattle-2014-final
 
Mapping The Escience (27 Oct2009)
Mapping The Escience (27 Oct2009)Mapping The Escience (27 Oct2009)
Mapping The Escience (27 Oct2009)
 
Science20brussels osimo april2013
Science20brussels osimo april2013Science20brussels osimo april2013
Science20brussels osimo april2013
 

Viewers also liked

Our changing state: the realities of austerity and devolution
Our changing state: the realities of austerity and devolutionOur changing state: the realities of austerity and devolution
Our changing state: the realities of austerity and devolutionBrowne Jacobson LLP
 
F.Blin IFLA Trend Report English_dk
F.Blin IFLA Trend Report English_dkF.Blin IFLA Trend Report English_dk
F.Blin IFLA Trend Report English_dkFrederic Blin
 
Parvat Pradesh Mein Pavas
Parvat Pradesh Mein PavasParvat Pradesh Mein Pavas
Parvat Pradesh Mein Pavaszainul2002
 
Brief Encounter: London Zoo
Brief Encounter: London ZooBrief Encounter: London Zoo
Brief Encounter: London ZooEarnest
 
Alfred day hershy
Alfred day hershyAlfred day hershy
Alfred day hershykimmygee_
 
5 of the Biggest Myths about Growing Your Business
5 of the Biggest Myths about Growing Your Business5 of the Biggest Myths about Growing Your Business
5 of the Biggest Myths about Growing Your BusinessVolaris Group
 
Navigating Uncertainty when Launching New Ideas
Navigating Uncertainty when Launching New IdeasNavigating Uncertainty when Launching New Ideas
Navigating Uncertainty when Launching New Ideashopperomatic
 
Start Writing Groovy
Start Writing GroovyStart Writing Groovy
Start Writing GroovyEvgeny Goldin
 
The Clientshare Academy Briefing - Gold Membership - by Practice Paradox
The Clientshare Academy Briefing - Gold Membership - by Practice ParadoxThe Clientshare Academy Briefing - Gold Membership - by Practice Paradox
The Clientshare Academy Briefing - Gold Membership - by Practice ParadoxPractice Paradox
 
LA Chef for OpenStack Hackday
LA Chef for OpenStack HackdayLA Chef for OpenStack Hackday
LA Chef for OpenStack HackdayMatt Ray
 
BioBankCloud: Machine Learning on Genomics + GA4GH @ Med at Scale
BioBankCloud: Machine Learning on Genomics + GA4GH  @ Med at ScaleBioBankCloud: Machine Learning on Genomics + GA4GH  @ Med at Scale
BioBankCloud: Machine Learning on Genomics + GA4GH @ Med at ScaleAndy Petrella
 
Icsi transformation 11-13 sept - agra
Icsi transformation   11-13 sept - agraIcsi transformation   11-13 sept - agra
Icsi transformation 11-13 sept - agraPavan Kumar Vijay
 
How to Improve Your Website
How to Improve Your WebsiteHow to Improve Your Website
How to Improve Your WebsiteBizSmart Select
 
Guía taller 2 a padres de familia ie medellin
Guía taller 2 a padres de familia ie medellinGuía taller 2 a padres de familia ie medellin
Guía taller 2 a padres de familia ie medellinCarlos Ríos Lemos
 
De la aldea a los recintos ceremoniales en la sociedad andina del periodo ini...
De la aldea a los recintos ceremoniales en la sociedad andina del periodo ini...De la aldea a los recintos ceremoniales en la sociedad andina del periodo ini...
De la aldea a los recintos ceremoniales en la sociedad andina del periodo ini...Gusstock Concha Flores
 
Créer et afficher une tag list sur scoop.it
Créer et afficher une tag list sur scoop.itCréer et afficher une tag list sur scoop.it
Créer et afficher une tag list sur scoop.itThierry Zenou
 

Viewers also liked (20)

Our changing state: the realities of austerity and devolution
Our changing state: the realities of austerity and devolutionOur changing state: the realities of austerity and devolution
Our changing state: the realities of austerity and devolution
 
F.Blin IFLA Trend Report English_dk
F.Blin IFLA Trend Report English_dkF.Blin IFLA Trend Report English_dk
F.Blin IFLA Trend Report English_dk
 
Parvat Pradesh Mein Pavas
Parvat Pradesh Mein PavasParvat Pradesh Mein Pavas
Parvat Pradesh Mein Pavas
 
Brief Encounter: London Zoo
Brief Encounter: London ZooBrief Encounter: London Zoo
Brief Encounter: London Zoo
 
Presentación taller 1
Presentación taller 1Presentación taller 1
Presentación taller 1
 
Italy weddings
Italy weddingsItaly weddings
Italy weddings
 
Alfred day hershy
Alfred day hershyAlfred day hershy
Alfred day hershy
 
Ngan hang-thuong-mai 2
Ngan hang-thuong-mai 2Ngan hang-thuong-mai 2
Ngan hang-thuong-mai 2
 
5 of the Biggest Myths about Growing Your Business
5 of the Biggest Myths about Growing Your Business5 of the Biggest Myths about Growing Your Business
5 of the Biggest Myths about Growing Your Business
 
Navigating Uncertainty when Launching New Ideas
Navigating Uncertainty when Launching New IdeasNavigating Uncertainty when Launching New Ideas
Navigating Uncertainty when Launching New Ideas
 
Start Writing Groovy
Start Writing GroovyStart Writing Groovy
Start Writing Groovy
 
The Clientshare Academy Briefing - Gold Membership - by Practice Paradox
The Clientshare Academy Briefing - Gold Membership - by Practice ParadoxThe Clientshare Academy Briefing - Gold Membership - by Practice Paradox
The Clientshare Academy Briefing - Gold Membership - by Practice Paradox
 
LA Chef for OpenStack Hackday
LA Chef for OpenStack HackdayLA Chef for OpenStack Hackday
LA Chef for OpenStack Hackday
 
Simplifying life
Simplifying lifeSimplifying life
Simplifying life
 
BioBankCloud: Machine Learning on Genomics + GA4GH @ Med at Scale
BioBankCloud: Machine Learning on Genomics + GA4GH  @ Med at ScaleBioBankCloud: Machine Learning on Genomics + GA4GH  @ Med at Scale
BioBankCloud: Machine Learning on Genomics + GA4GH @ Med at Scale
 
Icsi transformation 11-13 sept - agra
Icsi transformation   11-13 sept - agraIcsi transformation   11-13 sept - agra
Icsi transformation 11-13 sept - agra
 
How to Improve Your Website
How to Improve Your WebsiteHow to Improve Your Website
How to Improve Your Website
 
Guía taller 2 a padres de familia ie medellin
Guía taller 2 a padres de familia ie medellinGuía taller 2 a padres de familia ie medellin
Guía taller 2 a padres de familia ie medellin
 
De la aldea a los recintos ceremoniales en la sociedad andina del periodo ini...
De la aldea a los recintos ceremoniales en la sociedad andina del periodo ini...De la aldea a los recintos ceremoniales en la sociedad andina del periodo ini...
De la aldea a los recintos ceremoniales en la sociedad andina del periodo ini...
 
Créer et afficher une tag list sur scoop.it
Créer et afficher une tag list sur scoop.itCréer et afficher une tag list sur scoop.it
Créer et afficher une tag list sur scoop.it
 

Similar to Understanding the Big Picture of e-Science

Coupling Australia’s Researchers to the Global Innovation Economy
Coupling Australia’s Researchers to the Global Innovation EconomyCoupling Australia’s Researchers to the Global Innovation Economy
Coupling Australia’s Researchers to the Global Innovation EconomyLarry Smarr
 
Coupling Australia’s Researchers to the Global Innovation Economy
Coupling Australia’s Researchers to the Global Innovation EconomyCoupling Australia’s Researchers to the Global Innovation Economy
Coupling Australia’s Researchers to the Global Innovation EconomyLarry Smarr
 
Data Facilities Workshop - Panel on Current Concepts in Data Sharing & Intero...
Data Facilities Workshop - Panel on Current Concepts in Data Sharing & Intero...Data Facilities Workshop - Panel on Current Concepts in Data Sharing & Intero...
Data Facilities Workshop - Panel on Current Concepts in Data Sharing & Intero...EarthCube
 
High Performance Cyberinfrastructure to Support Data-Intensive Biomedical Res...
High Performance Cyberinfrastructure to Support Data-Intensive Biomedical Res...High Performance Cyberinfrastructure to Support Data-Intensive Biomedical Res...
High Performance Cyberinfrastructure to Support Data-Intensive Biomedical Res...Larry Smarr
 
Codes, Clouds & Constellations: Open Science in the Data Decade
Codes, Clouds & Constellations: Open Science in the Data DecadeCodes, Clouds & Constellations: Open Science in the Data Decade
Codes, Clouds & Constellations: Open Science in the Data DecadeLizLyon
 
Biodiversity Informatics: An Interdisciplinary Challenge
Biodiversity Informatics: An Interdisciplinary ChallengeBiodiversity Informatics: An Interdisciplinary Challenge
Biodiversity Informatics: An Interdisciplinary ChallengeBryan Heidorn
 
The eCrystals Federation
The eCrystals FederationThe eCrystals Federation
The eCrystals FederationManjulaPatel
 
Disciplinary and institutional perspectives on digital curation
Disciplinary and institutional perspectives on digital curationDisciplinary and institutional perspectives on digital curation
Disciplinary and institutional perspectives on digital curationMichael Day
 
Curation of Research Data
Curation of Research DataCuration of Research Data
Curation of Research DataMichael Day
 
Open Research Data: Licensing | Standards | Future
Open Research Data: Licensing | Standards | FutureOpen Research Data: Licensing | Standards | Future
Open Research Data: Licensing | Standards | FutureRoss Mounce
 
Living in the Future
Living in the FutureLiving in the Future
Living in the FutureLarry Smarr
 
Genomic Research: The Jump to Light Speed
Genomic Research: The Jump to Light SpeedGenomic Research: The Jump to Light Speed
Genomic Research: The Jump to Light SpeedLarry Smarr
 
Open Science Data Cloud - CCA 11
Open Science Data Cloud - CCA 11Open Science Data Cloud - CCA 11
Open Science Data Cloud - CCA 11Robert Grossman
 
Building a Global Collaboration System for Data-Intensive Discovery
Building a Global Collaboration System for Data-Intensive DiscoveryBuilding a Global Collaboration System for Data-Intensive Discovery
Building a Global Collaboration System for Data-Intensive DiscoveryLarry Smarr
 
06 e science-bio diversity@ pacc 18.07.2014
06 e science-bio diversity@ pacc 18.07.201406 e science-bio diversity@ pacc 18.07.2014
06 e science-bio diversity@ pacc 18.07.2014VinothkumaR Ramu
 
WOW13_RPITWC_Web Observatories
WOW13_RPITWC_Web ObservatoriesWOW13_RPITWC_Web Observatories
WOW13_RPITWC_Web Observatoriesgloriakt
 
High Performance Collaboration
High Performance CollaborationHigh Performance Collaboration
High Performance CollaborationLarry Smarr
 
British Library Datasets Programme 2010
British Library Datasets Programme 2010British Library Datasets Programme 2010
British Library Datasets Programme 2010ALISS
 

Similar to Understanding the Big Picture of e-Science (20)

Coupling Australia’s Researchers to the Global Innovation Economy
Coupling Australia’s Researchers to the Global Innovation EconomyCoupling Australia’s Researchers to the Global Innovation Economy
Coupling Australia’s Researchers to the Global Innovation Economy
 
Coupling Australia’s Researchers to the Global Innovation Economy
Coupling Australia’s Researchers to the Global Innovation EconomyCoupling Australia’s Researchers to the Global Innovation Economy
Coupling Australia’s Researchers to the Global Innovation Economy
 
Data Facilities Workshop - Panel on Current Concepts in Data Sharing & Intero...
Data Facilities Workshop - Panel on Current Concepts in Data Sharing & Intero...Data Facilities Workshop - Panel on Current Concepts in Data Sharing & Intero...
Data Facilities Workshop - Panel on Current Concepts in Data Sharing & Intero...
 
High Performance Cyberinfrastructure to Support Data-Intensive Biomedical Res...
High Performance Cyberinfrastructure to Support Data-Intensive Biomedical Res...High Performance Cyberinfrastructure to Support Data-Intensive Biomedical Res...
High Performance Cyberinfrastructure to Support Data-Intensive Biomedical Res...
 
British Library Datasets Programme Feb 2011
British Library Datasets Programme Feb 2011British Library Datasets Programme Feb 2011
British Library Datasets Programme Feb 2011
 
Codes, Clouds & Constellations: Open Science in the Data Decade
Codes, Clouds & Constellations: Open Science in the Data DecadeCodes, Clouds & Constellations: Open Science in the Data Decade
Codes, Clouds & Constellations: Open Science in the Data Decade
 
Biodiversity Informatics: An Interdisciplinary Challenge
Biodiversity Informatics: An Interdisciplinary ChallengeBiodiversity Informatics: An Interdisciplinary Challenge
Biodiversity Informatics: An Interdisciplinary Challenge
 
The eCrystals Federation
The eCrystals FederationThe eCrystals Federation
The eCrystals Federation
 
Disciplinary and institutional perspectives on digital curation
Disciplinary and institutional perspectives on digital curationDisciplinary and institutional perspectives on digital curation
Disciplinary and institutional perspectives on digital curation
 
Curation of Research Data
Curation of Research DataCuration of Research Data
Curation of Research Data
 
Open Research Data: Licensing | Standards | Future
Open Research Data: Licensing | Standards | FutureOpen Research Data: Licensing | Standards | Future
Open Research Data: Licensing | Standards | Future
 
Living in the Future
Living in the FutureLiving in the Future
Living in the Future
 
Genomic Research: The Jump to Light Speed
Genomic Research: The Jump to Light SpeedGenomic Research: The Jump to Light Speed
Genomic Research: The Jump to Light Speed
 
Open Science Data Cloud - CCA 11
Open Science Data Cloud - CCA 11Open Science Data Cloud - CCA 11
Open Science Data Cloud - CCA 11
 
Building a Global Collaboration System for Data-Intensive Discovery
Building a Global Collaboration System for Data-Intensive DiscoveryBuilding a Global Collaboration System for Data-Intensive Discovery
Building a Global Collaboration System for Data-Intensive Discovery
 
06 e science-bio diversity@ pacc 18.07.2014
06 e science-bio diversity@ pacc 18.07.201406 e science-bio diversity@ pacc 18.07.2014
06 e science-bio diversity@ pacc 18.07.2014
 
WOW13_RPITWC_Web Observatories
WOW13_RPITWC_Web ObservatoriesWOW13_RPITWC_Web Observatories
WOW13_RPITWC_Web Observatories
 
11 7 2007 EVA
11 7 2007  EVA11 7 2007  EVA
11 7 2007 EVA
 
High Performance Collaboration
High Performance CollaborationHigh Performance Collaboration
High Performance Collaboration
 
British Library Datasets Programme 2010
British Library Datasets Programme 2010British Library Datasets Programme 2010
British Library Datasets Programme 2010
 

More from Andrew Sallans

Improving Integrity, Transparency, and Reproducibility Through Connection of ...
Improving Integrity, Transparency, and Reproducibility Through Connection of ...Improving Integrity, Transparency, and Reproducibility Through Connection of ...
Improving Integrity, Transparency, and Reproducibility Through Connection of ...Andrew Sallans
 
Badges to Acknowledge Open Practices
Badges to Acknowledge Open PracticesBadges to Acknowledge Open Practices
Badges to Acknowledge Open PracticesAndrew Sallans
 
Open Science Framework (OSF)
Open Science Framework (OSF)Open Science Framework (OSF)
Open Science Framework (OSF)Andrew Sallans
 
Open Science Framework (OSF): Presentation and Training
Open Science Framework (OSF): Presentation and TrainingOpen Science Framework (OSF): Presentation and Training
Open Science Framework (OSF): Presentation and TrainingAndrew Sallans
 
DMPTool: Integration with other open science software
DMPTool:  Integration with other open science softwareDMPTool:  Integration with other open science software
DMPTool: Integration with other open science softwareAndrew Sallans
 
Aligning library services with emerging research data needs
Aligning library services with emerging research data needsAligning library services with emerging research data needs
Aligning library services with emerging research data needsAndrew Sallans
 
Hands-On Data Management Planning for Life Sciences
Hands-On Data Management Planning for Life SciencesHands-On Data Management Planning for Life Sciences
Hands-On Data Management Planning for Life SciencesAndrew Sallans
 
DMVitals: A Data Management Assessment Recommendations Tool - IASSIST 2012
DMVitals: A Data Management Assessment Recommendations Tool - IASSIST 2012DMVitals: A Data Management Assessment Recommendations Tool - IASSIST 2012
DMVitals: A Data Management Assessment Recommendations Tool - IASSIST 2012Andrew Sallans
 
DataONE User's Group Lifecycle Management: Planning
DataONE User's Group Lifecycle Management:  PlanningDataONE User's Group Lifecycle Management:  Planning
DataONE User's Group Lifecycle Management: PlanningAndrew Sallans
 
DMPTool: a community tool
DMPTool: a community toolDMPTool: a community tool
DMPTool: a community toolAndrew Sallans
 
Data Management Plan Advising? A New Business Venture for Libraries
Data Management Plan Advising?  A New Business Venture for LibrariesData Management Plan Advising?  A New Business Venture for Libraries
Data Management Plan Advising? A New Business Venture for LibrariesAndrew Sallans
 
UVa Library Scientific Data Consulting Group (SciDaC): New Partnerships and...
UVa Library Scientific Data Consulting Group (SciDaC):  New Partnerships and...UVa Library Scientific Data Consulting Group (SciDaC):  New Partnerships and...
UVa Library Scientific Data Consulting Group (SciDaC): New Partnerships and...Andrew Sallans
 
NSF Data Management Plan Case Study: UVa’s Response.
NSF Data Management Plan Case Study:  UVa’s Response.NSF Data Management Plan Case Study:  UVa’s Response.
NSF Data Management Plan Case Study: UVa’s Response.Andrew Sallans
 
Practical Applications of e-Science
Practical Applications of e-SciencePractical Applications of e-Science
Practical Applications of e-ScienceAndrew Sallans
 
NSF Data Management Plan - Implications for Librarians
NSF Data Management Plan - Implications for LibrariansNSF Data Management Plan - Implications for Librarians
NSF Data Management Plan - Implications for LibrariansAndrew Sallans
 

More from Andrew Sallans (15)

Improving Integrity, Transparency, and Reproducibility Through Connection of ...
Improving Integrity, Transparency, and Reproducibility Through Connection of ...Improving Integrity, Transparency, and Reproducibility Through Connection of ...
Improving Integrity, Transparency, and Reproducibility Through Connection of ...
 
Badges to Acknowledge Open Practices
Badges to Acknowledge Open PracticesBadges to Acknowledge Open Practices
Badges to Acknowledge Open Practices
 
Open Science Framework (OSF)
Open Science Framework (OSF)Open Science Framework (OSF)
Open Science Framework (OSF)
 
Open Science Framework (OSF): Presentation and Training
Open Science Framework (OSF): Presentation and TrainingOpen Science Framework (OSF): Presentation and Training
Open Science Framework (OSF): Presentation and Training
 
DMPTool: Integration with other open science software
DMPTool:  Integration with other open science softwareDMPTool:  Integration with other open science software
DMPTool: Integration with other open science software
 
Aligning library services with emerging research data needs
Aligning library services with emerging research data needsAligning library services with emerging research data needs
Aligning library services with emerging research data needs
 
Hands-On Data Management Planning for Life Sciences
Hands-On Data Management Planning for Life SciencesHands-On Data Management Planning for Life Sciences
Hands-On Data Management Planning for Life Sciences
 
DMVitals: A Data Management Assessment Recommendations Tool - IASSIST 2012
DMVitals: A Data Management Assessment Recommendations Tool - IASSIST 2012DMVitals: A Data Management Assessment Recommendations Tool - IASSIST 2012
DMVitals: A Data Management Assessment Recommendations Tool - IASSIST 2012
 
DataONE User's Group Lifecycle Management: Planning
DataONE User's Group Lifecycle Management:  PlanningDataONE User's Group Lifecycle Management:  Planning
DataONE User's Group Lifecycle Management: Planning
 
DMPTool: a community tool
DMPTool: a community toolDMPTool: a community tool
DMPTool: a community tool
 
Data Management Plan Advising? A New Business Venture for Libraries
Data Management Plan Advising?  A New Business Venture for LibrariesData Management Plan Advising?  A New Business Venture for Libraries
Data Management Plan Advising? A New Business Venture for Libraries
 
UVa Library Scientific Data Consulting Group (SciDaC): New Partnerships and...
UVa Library Scientific Data Consulting Group (SciDaC):  New Partnerships and...UVa Library Scientific Data Consulting Group (SciDaC):  New Partnerships and...
UVa Library Scientific Data Consulting Group (SciDaC): New Partnerships and...
 
NSF Data Management Plan Case Study: UVa’s Response.
NSF Data Management Plan Case Study:  UVa’s Response.NSF Data Management Plan Case Study:  UVa’s Response.
NSF Data Management Plan Case Study: UVa’s Response.
 
Practical Applications of e-Science
Practical Applications of e-SciencePractical Applications of e-Science
Practical Applications of e-Science
 
NSF Data Management Plan - Implications for Librarians
NSF Data Management Plan - Implications for LibrariansNSF Data Management Plan - Implications for Librarians
NSF Data Management Plan - Implications for Librarians
 

Understanding the Big Picture of e-Science

  • 1. UNDERSTANDING THE BIG PICTURE OF E-SCIENCE Andrew Sallans Head of Strategic Data Initiatives University of Virginia Library E-Science Bootcamp Claude Moore Health Sciences Library, University of Virginia 4 March 2011
  • 2. OUTLINE  What it‟s all about  Examples  Implications  UVA Libraries Response (Round 1) 2
  • 3. WHAT IT‟S ALL ABOUT (AROUND 1999) "e-Science is about global collaboration in key areas of science, and the next generation of infrastructure that will enable it." "e-Science will change the dynamic of the way science is undertaken." Dr Sir John Taylor Director General of Research Councils, Office of Science and Technology United Kingdom 3 Source: http://webscience.org/person/8.html
  • 4. WHAT MADE THIS POSSIBLE?  Internet/World Wide Web  Faster networking (fiber, special research networks, advances in grids)  Better storage (higher capacity, faster access, better reliability)  Cheap storage (costs keep decreasing)  Major funding initiatives  Broader interest in collaboration 4
  • 5. SOME COMMON TERMS  Computational science  Scientific computing  Research computing  High-performance computing  Cyberscience  Cyberinfrastructure 5
  • 6. CLIMATOLOGY RESEARCH Sources: 1) Climate Simulation on Cray XT5 “Jaguar” supercomputer, ORNL 6 (http://www.ornl.gov/info/ornlreview/v42_3_09/article02.shtml) 2) Cray XT5 “Jaguar” supercomputer, ORNL (http://www.ornl.gov/info/ornlreview/v42_1_09/images/a05_p04_xt5_full.jpg)
  • 7. LARGE HADRON COLLIDER AT CERN  Circumference: 26,659 meters  Magnets: 9,300  Speed: protons move at 99.9999991% speed of light)  Collisions/second: 600 million  Data produced: equivalent to 100,000 dual layer DVDs per year  LHC Grid: tens of thousands of computers around the world used collectively to analyze data (will take 15 years) 7 Source: CERN website (http://cdsweb.cern.ch/record/975468/files/its-2006-003.gif?subformat=icon)
  • 8. BIOMEDICAL INFORMATICS GRID (CABIG)  Launched as test in 2004  Adopted by over 50 NCI-designated cancer centers  Focused on:  Connecting scientists and practitioners through a shareable and interoperable infrastructure  Development of standard rules and a common language to more easily share information  Building or adapting tools for collecting, analyzing, integrating, and disseminating information associated with cancer research and care Source: caBIG website, National Cancer Institute (https://cabig.nci.nih.gov/) 8
  • 9. CITIZEN SCIENCE…THE SOCIAL SIDE 34,617,406 clicks done by 82,931 users! Source: Zooniverse, Real Science Online (http://www.zooniverse.org/home) 9
  • 10. IMPLICATIONS FOR RESEARCH  Greater emphasis on technology  Increase in interdisciplinary research and collaboration  Often bigger data, with far more complex associated issues (storage, access, expertise, funding, preservation, etc.)  Need for innovative approaches and integration into education/curriculum 10
  • 11. DATA TSUNAMI IDC estimate of about 1.7 zetabytes (1 trillion terabytes) around 2011 ….twice the available space Source: 11 1) The Great Wave off Kanagawa, Katsushika Hokusai. Found on Wikipedia. 2) The Diverse and Exploding Digital Universe, IDC, May 2010 (http://www.emc.com/collateral/analyst- reports/diverse-exploding-digital-universe.pdf)
  • 12. BUT, NOT ALL DATA IS EQUAL…. Source: Long Tail, Wikipedia (http://en.wikipedia.org/wiki/The_Long_Tail) 12
  • 13. CASE STUDY: UVA LIBRARIES RESPONSE (ROUND 1)  Collaboration established around 2005 through discussions between ITC and Library, and impetus of Frye Institute capstones.  Research Computing Support services in need of greater visibility, Library seeking ways to support changes in scientific research, collocation provides mutual benefits.  In 2006, staff moved to Library locations (Research Computing Lab & Scholars‟ Lab), setup new service points and services. 13
  • 14. RESEARCH IN THE E-SCIENCE WORLD  Heavy use of electronic information resources  Work is predominantly done from a lab/office, not in the Library  Collaboration is fundamental, but don‟t always know people in other domains  Grad students are usually bringing new technology/methods into the team (learning more about grad students in a research study now) 14
  • 15. IDENTIFIED E-SCIENCE TRENDS  Various components  Computationally intensive science  IT/software/infrastructure  Collaboration  Data  Often intertwined with Open Access initiatives 15
  • 16. E-SCIENCE IN OTHER LIBRARIES  Purdue University  Focus on data curation  IATUL Conference, June 2010  University of Illinois – Urbana Champaign  Focus on data curation  Summer Institute on Data Curation  Cornell University  Metadata consulting services  University of New Mexico  Major DataONE grant 16
  • 17. RESEARCH COMPUTING LAB RESPONSE  Aiming to provide support across the entire scientific research data lifecycle  Staff with expertise in:  Data  Quantitative data, statistics  Modeling, visualization  Scientific publishing  Emphasis on consulting, not drop-off services  Partnership with traditional librarians to help ease transition to new support models 17
  • 18. RCL OUTREACH University Community  Speaker series 2006, 2007, 2008  Research 2.0 Symposium  Partnerships with courses, other units (ie. MLBS)  Short course series each semester Library Community  Panel at the ACCS Conference in 2007  Poster at ARL/CNI Forum in 2008  Poster at STS Section of ALA in 2009 18  Journal article in JLA in 2009
  • 19. SAMPLE RCL CONSULTATIONS  STS Undergrad Environmental Justice (2008)  Development of technology solutions for empowering the citizen scientist  Web 2.0 tools, data collection/management  Data analysis  Economics Graduate Student (2008/2009)  Airline flight price modeling  Screen scraping, data collection/management  Data analysis  Mountain Lake Beetle Project (2009)  Mobile data acquisition/collection solution  Database development/management, programming  Data analysis  Archiving of dissertation data (2009)  EVSC student, ModelMaker 4.0 data  Biology student, IDL, Matlab, R code 19
  • 20. SPECIFICS FOR MEDICAL CENTER  At least 600 RCL support requests from Medical Center from October „07 through December „09  Medical Center patrons are heavy users of computational software like Matlab, SAS, LabView  Increasing emphasis on collaboration (translational research)  Greater attention to open access (NIH policy)  Growing interest in areas like image integrity 20
  • 21. TAKE-AWAYS  This is the future  Heavily growing space, lots of opportunity  Requires big investment and commitment, the biggest being training and priority alignment  Libraries and institutions need to make decisions on what to do and what not to do  It‟s a culture change for both libraries, institutions, and researchers 21
  • 22. COMING LATER….(ROUND 2)  “Practical Applications of e-Science” in UVA Libraries today 22
  • 23. QUESTIONS?  Please feel free to contact me with questions:  als9q@virginia.edu  434-243-2180  Twitter: asallans 23
  • 24. ADDITIONAL INFORMATION  E-Science Talking Points for ARL Deans and Directors, Elisabeth Jones, University of Washington, October 2008 (http://www.arl.org/rtl/escience/) 24