SlideShare a Scribd company logo
1 of 19
Download to read offline
Adoption of Cloud Computing
   in Scientific Research

              Yehia El-khatib
     School of Computing & Communications
               Lancaster University
Obligatory cloud image…
Outline
• Cloud Computing in Business
• Cloud Computing in Research
  – What does it offer
  – Comparison with other distributed paradigms
  – Different solutions
  – Examples
  – Challenges
• Conclusions
Cloud Computing
• Computational and storage resources provided in an
  on-demand fashion by large clusters of commodity
  computers.
• Offers opportunities:
   – Customised and isolated computing resources are
     obtained as and when required to handle user demand.
   – Pay per use model allows feasibility and sustainability
     through harnessing economies of scale.
   – Management via web service APIs.
   – Universal Internet-based access (all you need is / / /  / … ).
Cloud Computing in Business
        • Used to curb computing expenses without
          restricting the business.
             – Scale  to meet user demand.
             – Dynamically mitigate system failures.
             – Seamlessly roll out new capabilities.
        • Numerous users:



        • Cloud computing market
             – Worth $40.7bn in 2010
             – Expected $177bn in 2015
             – Expected $241bn in 2020

         http://www.forrester.com/rb/Research/sizing_cloud/q/id/58161/t/2
         http://www.gartner.com/it/page.jsp?id=1735214
Academic Research
• Researchers do not spend their
  entire time in the lab, field, etc.
• Collected data needs to be
  processed in order to distil some
  meaning.
• Such analysis processes range from
  scripts and spreadsheets to very
  complex computationally-intensive
  workflows.
• More data is being gathered using
  innovative methods (e.g. remote
  sensing).
Cloud in Academia
• People in academic circles are slowly adopting
  cloud computing for particular applications.
• What does the cloud offer?
  – ‘Everything as a service’ promotes integration and
    relatively easy collaboration across institutions,
    communities and disciplines.
  – Customised environments.
  – Elastic computing infrastructure.
  – More load off the users, i.e. scientists.
     More time to focus on their scientific processes.
Distributed Computing Paradigms
                    HPC              Grid               P2P             Cloud
  Ownership
                My university   Our universities    Our partners       3rd party
(management)
                                                      Trust in
    Trust        Very High           High                                 ?
                                                     partners
                                                     Depends on
  Reliability       High             High                             Very high
                                                   size & partners
                                 Individual &
                 Individual
 Accounting                     Organisational       Difficult…      Pay per use
                   Quotas
                                    quotas

Customisation     Very bad            Bad          Fairly flexible   Very flexible

   Access           Easy         Complicated        Complicated          Easy

                    Local           Remote         Local/Remote
   Support                                                           24x7 support
                  sysadmin         sysadmin          sysadmin
What solutions do clouds offer?
• Generic solutions:                 Research Support
  – Infrastructure (e.g. EmuLab)
  – Analysis (e.g. Biocep-R, CloudNumbers)
  – Space to discover (e.g. Academia.edu), share (e.g.
    myExperiment) and collaborate (e.g. Mendeley)
• Domain-driven solutions:
                                        Research
  – Workflow execution
  – Data normalisation
  – Data discovery, based on content rather than
    problem area
Domain-driven Cloud Solutions
• Environmental Virtual Observatory pilot
  (EVOp)
               http://www.EnvironmentalVirtualObservatory.org
  – To help:
     • Environmental scientists solve ‘big questions’.
     • Policy makers understand implications of decisions.
     • Raise awareness in and interact with local communities.
  – Use case for pilot phase: hydrology.
  – Deal with both geospatial and time series data.
  – Customisable modelling workflows for scientists.
  – Predefined analysis tools for non-specialists.
Domain-driven Cloud Solutions
• Penn State Integrated Hydrologic Model (PIHM)
                                http://slidesha.re/pFFMWp

  – Terrestrial watershed modelling in order to predict
    water distribution.
  – Data is sourced through a repository.
  – Cloud offers seamless access to abundant
    resources to carry out modelling workflows and
    simulations.
  – Results are delivered using bespoke visualisation
    (SaaS).
Domain-driven Cloud Solutions
• Coaddition of SDSS Astronomical Images
                                http://arxiv.org/abs/1010.1015
  – Using Apache Hadoop for coaddition of images from
    the Sloan Digital Sky Survey. (Coaddition increases the
    signal-to-noise ratio).
  – Runs over NSF cloud maintained by Google and IBM.
  – Experimented different approaches to coaddition
    using the MapReduce framework.
  – Improved performance was achieved by reducing job
    initialisation overhead using index files.
  – 300 million pixels processed in 3 minutes.
Domain-driven Cloud Solutions
• Cell structure analysis http://books.google.co.uk/books?id=C_aQqAa6rEoC
    – Hadoop jobs to analyse videos of single cell structures
      under varying conditions.
• European Space Agency                            http://www.esa.int
    – Uses AWS EC2 & S3 to deliver data about the current state
      of the planet to scientists, governmental agencies and other
      organizations worldwide.
• MD Anderson Cancer Center                        http://bit.ly/o0zDwl
    – Large private cloud (8,000 processors) maintained by The
      University of Texas.
    – Used to execute genomic processes against large clinical
      datasets (~1.4PB) on cancer.
Domain-driven Cloud Solutions
• NSF        http://www.nsf.gov/news/news_summ.jsp?cntn_id=119248

  – Approx. $4.5m to fund 13 research projects.
  – Mostly CS, but also bioinformatics & earth sciences.
• VENUS-C                                     http://www.venus-c.eu

  – 15 year-long pilots in different disciplines: architecture,
    biology, bioinformatics, chemistry, earth sciences, healthcare,
    maritime surveillance, mathematics, physics and social media.
• Masters @ SCC Lancaster
  – Corpus linguistics
  – Hydrological modelling
  – 3D imaging (volcanology)
Challenges
• Trust: security and privacy (even by law in some
  circumstances).
• Great divide between different disciplines.
• Data ownership.
   – Most data producers don’t mind sharing as long as they
     retain ownership.
• Software licenses.
• Belief that cloud/grid/etc is only for certain app’s.
• Investment into delivering cloud-based solutions
  to scientists.
   – Legacy applications & infrastructures.
Challenges
• Trust: security and privacy (even by law in some
  circumstances).
• Great divide between different disciplines.
• Data ownership.
   – Most data producers don’t mind sharing as long as they
     retain ownership.
• Software licenses.
• Belief that cloud/grid/etc is only for certain app’s.
• Investment into delivering cloud-based solutions
  to scientists.
   – Legacy applications & infrastructures.
Conclusions
• Need for cloud computing for scientific research:
   – Mainly: “I need more number crunching!”
   – Also: “I need to bridge data/discipline gaps.”
• Overall adoption is still relatively limited.
   – Various reasons, including trust. But also cloud-unrelated
     problems such as data ownership and software licensing.
• Investment into cloud-enabled research is important.
   – Not to browse articles via a mobile app while on the tube…
   – But for the added value of building and nurturing
     relationships.
   – And the economic model (less up front costs).
• Impact:
   – Better scientific tools, with less overhead on the scientists.
   – Potential for more integration.
Thank you!
                         Questions
Flickr credits:
• theaucitron        • stacylynn
• theplanetdotcom    • bpamerica
• Pnnl               • soilscience


                         http://www.comp.lancs.ac.uk/~elkhatib/
       Yehia El-khatib   @yelkhatib

                         http://www.EnvironmentalVirtualObservatory.org
                         @EVOpilot
Discussion
• Trust is not the problem; it is the perception of trust.
• Different academic communities have varying attitudes
  towards new technologies such as the cloud.
• More examples of funding to adopt cloud computing:
   o research: http://www.jisc.ac.uk/news/stories/2011/02/umf.aspx
   o Gov’t: http://www.cabinetoffice.gov.uk/content/government-ict-strategy

More Related Content

What's hot

Grid is Dead ? Nimrod on the Cloud
Grid is Dead ? Nimrod on the CloudGrid is Dead ? Nimrod on the Cloud
Grid is Dead ? Nimrod on the CloudAdianto Wibisono
 
云计算及其应用
云计算及其应用云计算及其应用
云计算及其应用lantianlcdx
 
Data-intensive applications on cloud computing resources: Applications in lif...
Data-intensive applications on cloud computing resources: Applications in lif...Data-intensive applications on cloud computing resources: Applications in lif...
Data-intensive applications on cloud computing resources: Applications in lif...Ola Spjuth
 
2017 bio it world
2017 bio it world2017 bio it world
2017 bio it worldChris Dwan
 
2016 05 sanger
2016 05 sanger2016 05 sanger
2016 05 sangerChris Dwan
 
Why manage research data?
Why manage research data?Why manage research data?
Why manage research data?Graham Pryor
 
2016 09 cxo forum
2016 09 cxo forum2016 09 cxo forum
2016 09 cxo forumChris Dwan
 
Above the Clouds: A Berkeley View of Cloud Computing: Paper Review
Above the Clouds: A Berkeley View of Cloud Computing:  Paper Review Above the Clouds: A Berkeley View of Cloud Computing:  Paper Review
Above the Clouds: A Berkeley View of Cloud Computing: Paper Review Mala Deep Upadhaya
 
Federation and Interoperability in the Nectar Research Cloud
Federation and Interoperability in the Nectar Research CloudFederation and Interoperability in the Nectar Research Cloud
Federation and Interoperability in the Nectar Research CloudOpenStack
 
UC-Wide Cyberinfrastructure for Data-Intensive Research
UC-Wide Cyberinfrastructure for Data-Intensive ResearchUC-Wide Cyberinfrastructure for Data-Intensive Research
UC-Wide Cyberinfrastructure for Data-Intensive ResearchLarry Smarr
 
An End-to-End Campus-Scale High Performance Cyberinfrastructure for Data-Inte...
An End-to-End Campus-Scale High Performance Cyberinfrastructure for Data-Inte...An End-to-End Campus-Scale High Performance Cyberinfrastructure for Data-Inte...
An End-to-End Campus-Scale High Performance Cyberinfrastructure for Data-Inte...Larry Smarr
 
Xiaoxin_Resume_combined_NCL
Xiaoxin_Resume_combined_NCLXiaoxin_Resume_combined_NCL
Xiaoxin_Resume_combined_NCLXiaoxin Ren
 
Grid'5000: Running a Large Instrument for Parallel and Distributed Computing ...
Grid'5000: Running a Large Instrument for Parallel and Distributed Computing ...Grid'5000: Running a Large Instrument for Parallel and Distributed Computing ...
Grid'5000: Running a Large Instrument for Parallel and Distributed Computing ...Frederic Desprez
 
Cloud and grid computing by Leen Blom, Centric
Cloud and grid computing by Leen Blom, CentricCloud and grid computing by Leen Blom, Centric
Cloud and grid computing by Leen Blom, CentricCentric
 
UK e-Infrastructure: Widening Access, Increasing Participation
UK e-Infrastructure: Widening Access, Increasing ParticipationUK e-Infrastructure: Widening Access, Increasing Participation
UK e-Infrastructure: Widening Access, Increasing ParticipationNeil Chue Hong
 
Azure Brain: 4th paradigm, scientific discovery & (really) big data
Azure Brain: 4th paradigm, scientific discovery & (really) big dataAzure Brain: 4th paradigm, scientific discovery & (really) big data
Azure Brain: 4th paradigm, scientific discovery & (really) big dataMicrosoft Technet France
 
Cloud computing - Terena 2011
Cloud computing - Terena 2011Cloud computing - Terena 2011
Cloud computing - Terena 2011Jisc
 
Rpi talk foster september 2011
Rpi talk foster september 2011Rpi talk foster september 2011
Rpi talk foster september 2011Ian Foster
 

What's hot (20)

Grid is Dead ? Nimrod on the Cloud
Grid is Dead ? Nimrod on the CloudGrid is Dead ? Nimrod on the Cloud
Grid is Dead ? Nimrod on the Cloud
 
云计算及其应用
云计算及其应用云计算及其应用
云计算及其应用
 
Data-intensive applications on cloud computing resources: Applications in lif...
Data-intensive applications on cloud computing resources: Applications in lif...Data-intensive applications on cloud computing resources: Applications in lif...
Data-intensive applications on cloud computing resources: Applications in lif...
 
2017 bio it world
2017 bio it world2017 bio it world
2017 bio it world
 
2016 05 sanger
2016 05 sanger2016 05 sanger
2016 05 sanger
 
From IoT Devices to Cloud
From IoT Devices to CloudFrom IoT Devices to Cloud
From IoT Devices to Cloud
 
Why manage research data?
Why manage research data?Why manage research data?
Why manage research data?
 
2016 09 cxo forum
2016 09 cxo forum2016 09 cxo forum
2016 09 cxo forum
 
Above the Clouds: A Berkeley View of Cloud Computing: Paper Review
Above the Clouds: A Berkeley View of Cloud Computing:  Paper Review Above the Clouds: A Berkeley View of Cloud Computing:  Paper Review
Above the Clouds: A Berkeley View of Cloud Computing: Paper Review
 
Federation and Interoperability in the Nectar Research Cloud
Federation and Interoperability in the Nectar Research CloudFederation and Interoperability in the Nectar Research Cloud
Federation and Interoperability in the Nectar Research Cloud
 
UC-Wide Cyberinfrastructure for Data-Intensive Research
UC-Wide Cyberinfrastructure for Data-Intensive ResearchUC-Wide Cyberinfrastructure for Data-Intensive Research
UC-Wide Cyberinfrastructure for Data-Intensive Research
 
An End-to-End Campus-Scale High Performance Cyberinfrastructure for Data-Inte...
An End-to-End Campus-Scale High Performance Cyberinfrastructure for Data-Inte...An End-to-End Campus-Scale High Performance Cyberinfrastructure for Data-Inte...
An End-to-End Campus-Scale High Performance Cyberinfrastructure for Data-Inte...
 
Xiaoxin_Resume_combined_NCL
Xiaoxin_Resume_combined_NCLXiaoxin_Resume_combined_NCL
Xiaoxin_Resume_combined_NCL
 
Grid'5000: Running a Large Instrument for Parallel and Distributed Computing ...
Grid'5000: Running a Large Instrument for Parallel and Distributed Computing ...Grid'5000: Running a Large Instrument for Parallel and Distributed Computing ...
Grid'5000: Running a Large Instrument for Parallel and Distributed Computing ...
 
Cloud and grid computing by Leen Blom, Centric
Cloud and grid computing by Leen Blom, CentricCloud and grid computing by Leen Blom, Centric
Cloud and grid computing by Leen Blom, Centric
 
UK e-Infrastructure: Widening Access, Increasing Participation
UK e-Infrastructure: Widening Access, Increasing ParticipationUK e-Infrastructure: Widening Access, Increasing Participation
UK e-Infrastructure: Widening Access, Increasing Participation
 
Cloud pres3
Cloud pres3Cloud pres3
Cloud pres3
 
Azure Brain: 4th paradigm, scientific discovery & (really) big data
Azure Brain: 4th paradigm, scientific discovery & (really) big dataAzure Brain: 4th paradigm, scientific discovery & (really) big data
Azure Brain: 4th paradigm, scientific discovery & (really) big data
 
Cloud computing - Terena 2011
Cloud computing - Terena 2011Cloud computing - Terena 2011
Cloud computing - Terena 2011
 
Rpi talk foster september 2011
Rpi talk foster september 2011Rpi talk foster september 2011
Rpi talk foster september 2011
 

Similar to Adoption of Cloud Computing in Scientific Research

Utilising Cloud Computing for Research through Infrastructure, Software and D...
Utilising Cloud Computing for Research through Infrastructure, Software and D...Utilising Cloud Computing for Research through Infrastructure, Software and D...
Utilising Cloud Computing for Research through Infrastructure, Software and D...David Wallom
 
Cloud computing
Cloud computingCloud computing
Cloud computingAmit Kumar
 
Speaker Presention by Irena Bojanova of the University of Maryland University...
Speaker Presention by Irena Bojanova of the University of Maryland University...Speaker Presention by Irena Bojanova of the University of Maryland University...
Speaker Presention by Irena Bojanova of the University of Maryland University...Tim Harvey
 
The wonders of Cloud Computing.pptx
The wonders of Cloud Computing.pptxThe wonders of Cloud Computing.pptx
The wonders of Cloud Computing.pptxOmSatpathy
 
key research challenges in cloud computing
key research challenges in cloud computingkey research challenges in cloud computing
key research challenges in cloud computingIgnacio M. Llorente
 
Concepts of Distributed Computing & Cloud Computing
Concepts of Distributed Computing & Cloud Computing Concepts of Distributed Computing & Cloud Computing
Concepts of Distributed Computing & Cloud Computing Hitesh Kumar Markam
 
Clould Computing and its application in Libraries
Clould Computing and its application in LibrariesClould Computing and its application in Libraries
Clould Computing and its application in LibrariesAmit Shaw
 
g-Social - Enhancing e-Science Tools with Social Networking Functionality
g-Social - Enhancing e-Science Tools with Social Networking Functionalityg-Social - Enhancing e-Science Tools with Social Networking Functionality
g-Social - Enhancing e-Science Tools with Social Networking FunctionalityNicholas Loulloudes
 
cloudcomputingdistributedcomputing-171208050503 (1).pdf
cloudcomputingdistributedcomputing-171208050503 (1).pdfcloudcomputingdistributedcomputing-171208050503 (1).pdf
cloudcomputingdistributedcomputing-171208050503 (1).pdfArchanaPandiyan
 
Federated Cloud Computing
Federated Cloud ComputingFederated Cloud Computing
Federated Cloud ComputingDavid Wallom
 
The e-Ciber Superfacility Project
The e-Ciber Superfacility ProjectThe e-Ciber Superfacility Project
The e-Ciber Superfacility ProjectLeandro Ciuffo
 
The Effectiveness, Efficiency and Legitimacy of Outsourcing Your Data
The Effectiveness, Efficiency and Legitimacy of Outsourcing Your Data The Effectiveness, Efficiency and Legitimacy of Outsourcing Your Data
The Effectiveness, Efficiency and Legitimacy of Outsourcing Your Data DataCentred
 
Cloud and Grid Computing
Cloud and Grid ComputingCloud and Grid Computing
Cloud and Grid ComputingLeen Blom
 
Cloud Computing and Virtualization Overview by Amr Ali
Cloud Computing and Virtualization Overview by Amr AliCloud Computing and Virtualization Overview by Amr Ali
Cloud Computing and Virtualization Overview by Amr AliAmr Ali
 
Cloud computing for education: A new dawn?
Cloud computing for education: A new dawn?Cloud computing for education: A new dawn?
Cloud computing for education: A new dawn?Sikder Tahsin Al-Amin
 
Supporting Research through "Desktop as a Service" models of e-infrastructure...
Supporting Research through "Desktop as a Service" models of e-infrastructure...Supporting Research through "Desktop as a Service" models of e-infrastructure...
Supporting Research through "Desktop as a Service" models of e-infrastructure...David Wallom
 
Declare Victory with Big Data
Declare Victory with Big DataDeclare Victory with Big Data
Declare Victory with Big DataJ On The Beach
 

Similar to Adoption of Cloud Computing in Scientific Research (20)

Utilising Cloud Computing for Research through Infrastructure, Software and D...
Utilising Cloud Computing for Research through Infrastructure, Software and D...Utilising Cloud Computing for Research through Infrastructure, Software and D...
Utilising Cloud Computing for Research through Infrastructure, Software and D...
 
Cloud computing
Cloud computingCloud computing
Cloud computing
 
Speaker Presention by Irena Bojanova of the University of Maryland University...
Speaker Presention by Irena Bojanova of the University of Maryland University...Speaker Presention by Irena Bojanova of the University of Maryland University...
Speaker Presention by Irena Bojanova of the University of Maryland University...
 
The wonders of Cloud Computing.pptx
The wonders of Cloud Computing.pptxThe wonders of Cloud Computing.pptx
The wonders of Cloud Computing.pptx
 
key research challenges in cloud computing
key research challenges in cloud computingkey research challenges in cloud computing
key research challenges in cloud computing
 
Concepts of Distributed Computing & Cloud Computing
Concepts of Distributed Computing & Cloud Computing Concepts of Distributed Computing & Cloud Computing
Concepts of Distributed Computing & Cloud Computing
 
Clould Computing and its application in Libraries
Clould Computing and its application in LibrariesClould Computing and its application in Libraries
Clould Computing and its application in Libraries
 
g-Social - Enhancing e-Science Tools with Social Networking Functionality
g-Social - Enhancing e-Science Tools with Social Networking Functionalityg-Social - Enhancing e-Science Tools with Social Networking Functionality
g-Social - Enhancing e-Science Tools with Social Networking Functionality
 
Networked 3-D Virtual Collaboration in Science and Education: Towards 'Web 3....
Networked 3-D Virtual Collaboration in Science and Education: Towards 'Web 3....Networked 3-D Virtual Collaboration in Science and Education: Towards 'Web 3....
Networked 3-D Virtual Collaboration in Science and Education: Towards 'Web 3....
 
Bertenthal
BertenthalBertenthal
Bertenthal
 
cloudcomputingdistributedcomputing-171208050503 (1).pdf
cloudcomputingdistributedcomputing-171208050503 (1).pdfcloudcomputingdistributedcomputing-171208050503 (1).pdf
cloudcomputingdistributedcomputing-171208050503 (1).pdf
 
Cloud Computing & Distributed Computing
Cloud Computing & Distributed ComputingCloud Computing & Distributed Computing
Cloud Computing & Distributed Computing
 
Federated Cloud Computing
Federated Cloud ComputingFederated Cloud Computing
Federated Cloud Computing
 
The e-Ciber Superfacility Project
The e-Ciber Superfacility ProjectThe e-Ciber Superfacility Project
The e-Ciber Superfacility Project
 
The Effectiveness, Efficiency and Legitimacy of Outsourcing Your Data
The Effectiveness, Efficiency and Legitimacy of Outsourcing Your Data The Effectiveness, Efficiency and Legitimacy of Outsourcing Your Data
The Effectiveness, Efficiency and Legitimacy of Outsourcing Your Data
 
Cloud and Grid Computing
Cloud and Grid ComputingCloud and Grid Computing
Cloud and Grid Computing
 
Cloud Computing and Virtualization Overview by Amr Ali
Cloud Computing and Virtualization Overview by Amr AliCloud Computing and Virtualization Overview by Amr Ali
Cloud Computing and Virtualization Overview by Amr Ali
 
Cloud computing for education: A new dawn?
Cloud computing for education: A new dawn?Cloud computing for education: A new dawn?
Cloud computing for education: A new dawn?
 
Supporting Research through "Desktop as a Service" models of e-infrastructure...
Supporting Research through "Desktop as a Service" models of e-infrastructure...Supporting Research through "Desktop as a Service" models of e-infrastructure...
Supporting Research through "Desktop as a Service" models of e-infrastructure...
 
Declare Victory with Big Data
Declare Victory with Big DataDeclare Victory with Big Data
Declare Victory with Big Data
 

Recently uploaded

"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr LapshynFwdays
 
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Wonjun Hwang
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):comworks
 
The Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdfThe Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdfSeasiaInfotech2
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Patryk Bandurski
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfAlex Barbosa Coqueiro
 
My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024The Digital Insurer
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsSergiu Bodiu
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebUiPathCommunity
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxhariprasad279825
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticscarlostorres15106
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr BaganFwdays
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 3652toLead Limited
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsMemoori
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Commit University
 

Recently uploaded (20)

"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
 
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):
 
The Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdfThe Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdf
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdf
 
My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio Web
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptx
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial Buildings
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 

Adoption of Cloud Computing in Scientific Research

  • 1. Adoption of Cloud Computing in Scientific Research Yehia El-khatib School of Computing & Communications Lancaster University
  • 3. Outline • Cloud Computing in Business • Cloud Computing in Research – What does it offer – Comparison with other distributed paradigms – Different solutions – Examples – Challenges • Conclusions
  • 4. Cloud Computing • Computational and storage resources provided in an on-demand fashion by large clusters of commodity computers. • Offers opportunities: – Customised and isolated computing resources are obtained as and when required to handle user demand. – Pay per use model allows feasibility and sustainability through harnessing economies of scale. – Management via web service APIs. – Universal Internet-based access (all you need is / / / / … ).
  • 5. Cloud Computing in Business • Used to curb computing expenses without restricting the business. – Scale  to meet user demand. – Dynamically mitigate system failures. – Seamlessly roll out new capabilities. • Numerous users: • Cloud computing market – Worth $40.7bn in 2010 – Expected $177bn in 2015 – Expected $241bn in 2020  http://www.forrester.com/rb/Research/sizing_cloud/q/id/58161/t/2  http://www.gartner.com/it/page.jsp?id=1735214
  • 6. Academic Research • Researchers do not spend their entire time in the lab, field, etc. • Collected data needs to be processed in order to distil some meaning. • Such analysis processes range from scripts and spreadsheets to very complex computationally-intensive workflows. • More data is being gathered using innovative methods (e.g. remote sensing).
  • 7. Cloud in Academia • People in academic circles are slowly adopting cloud computing for particular applications. • What does the cloud offer? – ‘Everything as a service’ promotes integration and relatively easy collaboration across institutions, communities and disciplines. – Customised environments. – Elastic computing infrastructure. – More load off the users, i.e. scientists.  More time to focus on their scientific processes.
  • 8. Distributed Computing Paradigms HPC Grid P2P Cloud Ownership My university Our universities Our partners 3rd party (management)  Trust in Trust Very High High ? partners Depends on Reliability High High Very high size & partners Individual & Individual Accounting Organisational Difficult… Pay per use Quotas quotas Customisation Very bad Bad Fairly flexible Very flexible Access Easy Complicated Complicated Easy Local Remote Local/Remote Support 24x7 support sysadmin sysadmin sysadmin
  • 9. What solutions do clouds offer? • Generic solutions: Research Support – Infrastructure (e.g. EmuLab) – Analysis (e.g. Biocep-R, CloudNumbers) – Space to discover (e.g. Academia.edu), share (e.g. myExperiment) and collaborate (e.g. Mendeley) • Domain-driven solutions: Research – Workflow execution – Data normalisation – Data discovery, based on content rather than problem area
  • 10. Domain-driven Cloud Solutions • Environmental Virtual Observatory pilot (EVOp) http://www.EnvironmentalVirtualObservatory.org – To help: • Environmental scientists solve ‘big questions’. • Policy makers understand implications of decisions. • Raise awareness in and interact with local communities. – Use case for pilot phase: hydrology. – Deal with both geospatial and time series data. – Customisable modelling workflows for scientists. – Predefined analysis tools for non-specialists.
  • 11. Domain-driven Cloud Solutions • Penn State Integrated Hydrologic Model (PIHM) http://slidesha.re/pFFMWp – Terrestrial watershed modelling in order to predict water distribution. – Data is sourced through a repository. – Cloud offers seamless access to abundant resources to carry out modelling workflows and simulations. – Results are delivered using bespoke visualisation (SaaS).
  • 12. Domain-driven Cloud Solutions • Coaddition of SDSS Astronomical Images http://arxiv.org/abs/1010.1015 – Using Apache Hadoop for coaddition of images from the Sloan Digital Sky Survey. (Coaddition increases the signal-to-noise ratio). – Runs over NSF cloud maintained by Google and IBM. – Experimented different approaches to coaddition using the MapReduce framework. – Improved performance was achieved by reducing job initialisation overhead using index files. – 300 million pixels processed in 3 minutes.
  • 13. Domain-driven Cloud Solutions • Cell structure analysis http://books.google.co.uk/books?id=C_aQqAa6rEoC – Hadoop jobs to analyse videos of single cell structures under varying conditions. • European Space Agency http://www.esa.int – Uses AWS EC2 & S3 to deliver data about the current state of the planet to scientists, governmental agencies and other organizations worldwide. • MD Anderson Cancer Center http://bit.ly/o0zDwl – Large private cloud (8,000 processors) maintained by The University of Texas. – Used to execute genomic processes against large clinical datasets (~1.4PB) on cancer.
  • 14. Domain-driven Cloud Solutions • NSF http://www.nsf.gov/news/news_summ.jsp?cntn_id=119248 – Approx. $4.5m to fund 13 research projects. – Mostly CS, but also bioinformatics & earth sciences. • VENUS-C http://www.venus-c.eu – 15 year-long pilots in different disciplines: architecture, biology, bioinformatics, chemistry, earth sciences, healthcare, maritime surveillance, mathematics, physics and social media. • Masters @ SCC Lancaster – Corpus linguistics – Hydrological modelling – 3D imaging (volcanology)
  • 15. Challenges • Trust: security and privacy (even by law in some circumstances). • Great divide between different disciplines. • Data ownership. – Most data producers don’t mind sharing as long as they retain ownership. • Software licenses. • Belief that cloud/grid/etc is only for certain app’s. • Investment into delivering cloud-based solutions to scientists. – Legacy applications & infrastructures.
  • 16. Challenges • Trust: security and privacy (even by law in some circumstances). • Great divide between different disciplines. • Data ownership. – Most data producers don’t mind sharing as long as they retain ownership. • Software licenses. • Belief that cloud/grid/etc is only for certain app’s. • Investment into delivering cloud-based solutions to scientists. – Legacy applications & infrastructures.
  • 17. Conclusions • Need for cloud computing for scientific research: – Mainly: “I need more number crunching!” – Also: “I need to bridge data/discipline gaps.” • Overall adoption is still relatively limited. – Various reasons, including trust. But also cloud-unrelated problems such as data ownership and software licensing. • Investment into cloud-enabled research is important. – Not to browse articles via a mobile app while on the tube… – But for the added value of building and nurturing relationships. – And the economic model (less up front costs). • Impact: – Better scientific tools, with less overhead on the scientists. – Potential for more integration.
  • 18. Thank you! Questions Flickr credits: • theaucitron • stacylynn • theplanetdotcom • bpamerica • Pnnl • soilscience http://www.comp.lancs.ac.uk/~elkhatib/ Yehia El-khatib @yelkhatib http://www.EnvironmentalVirtualObservatory.org @EVOpilot
  • 19. Discussion • Trust is not the problem; it is the perception of trust. • Different academic communities have varying attitudes towards new technologies such as the cloud. • More examples of funding to adopt cloud computing: o research: http://www.jisc.ac.uk/news/stories/2011/02/umf.aspx o Gov’t: http://www.cabinetoffice.gov.uk/content/government-ict-strategy