SlideShare a Scribd company logo
1 of 41
©  2016,  Amazon  Web  Services,  Inc.  or  its  Affiliates.  All  rights  reserved.
Adrian  White,  Head  of  Scientific  Computing,  APAC  
Amazon  Web  Services
Dr Peter  Blain,  Integrated  Marine  Observing  System
University  of  Tasmania
Scaling  Ideas:  Accelerating  
Research  with  AWS
Technical  301
“It  is  the  tension  between  creativity  and  
skepticism  that  has  produced  the  stunning  
and  unexpected  findings  of  science.”
Carl  Sagan  (1934-­1996)
Why  AWS  for  Research?
Time  to  Science
Access  research  
infrastructure  in  minutes
Low  Cost
Pay-­as-­you-­go  pricing
Elastic
Easily  add  or  remove  capacity
Globally  Accessible
Easily  Collaborate  with  
researchers  around  the  world
Secure
A  collection  of  tools  to
protect  data  and  privacy
Scalable
Access  to  effectively
limitless  capacity
Time  to  Science Scalable
Perhaps  the  Top  Three  Reasons?
Elastic
Researchers  Are  Using  AWS  For…
Life  Sciences
& Genomics
Space  Research
&  Astronomy
High  Energy  
Physics
Open  Data
e.g.  Satellite
Imagery
HPC  &  Grid
computing
Researchers  Are  Using  AWS  For…
Life  Sciences
& Genomics
Space  Research
&  Astronomy
High  Energy  
Physics
Open  Data
e.g.  Satellite
Imagery
HPC  &  Grid
computing
and  much  more….
Walter  and  Eliza  Hall  Institute  of  Medical  Research
DiUS &  WEHI  have  built  Two  Medical  
Research  Solutions:
• Building  an  auto-­scaling  R  cluster  using  
CfnCluster
• Scientific  image  processing  in  the  cloud  
with  Fiji/ImageJ
This  provides  on-­demand  and  templated
research  services  to  do  science.
• The  Australian  Museum,  University  of  
New  South  Wales,  and  others  are  
sequencing  and  assembling  the  Koala  
Genome  using  AWS
• They  are  running  large  clusters  using  
Spot  instances
• Have  used  over  300,000  core  hours  and  
can  use  new  processing  techniques  
previously  unavailable  to  them.
Sequencing  the  Koala  Genome
Solving  the  Mysteries  of  the  Universe
• Fermilab is  one  of  the  Tier  1  
data  centers  for  the  CMS  
experiment
• Looking  for  the  Higgs  Boson  to  
understand  mass.
• Launched  the  HEP  Cloud  
Project  in  June,  2015
• Recently  added  58,000  cores
(or  4x  increase  in  Fermilab
capacity)  to  simulate  500  million  
events  over  10  days
Source:  https://aws.amazon.com/blogs/aws/experiment-­that-­discovered-­the-­higgs-­boson-­uses-­aws-­to-­probe-­nature/
Seals  with  Sensors  
Dr Peter  Blain
IMOS  is  a  national  collaborative  research  infrastructure,  supported  by  Australian  Government.    It  is  led  by  University  of  Tasmania  in  partnership  with  
the  Australian  marine  &  climate  science  community.
Argo  Floats Ships Moorings Gliders
Autonomous  
Vehicles RadarAnimalsSatellite
…  and  others
IMOS  Observations  are  Taken  by  Sensors  on  Multiple  
Platforms...
Glider  Attacked  by  Shark
Seals
IMOS  data  is  targeted  at  the  scientific  community.
Research  themes:
• Long-­term  ocean  change.
• Climate  variability  and  weather  extremes.
• Boundary  currents.
• Continental  shelf  and  coastal  processes.
• Ecosystem  responses.
Thousands  of  publications  have  been  based  on  IMOS  data.
IMOS  data  is  also  available  to  government,  industry  and  the  general  public.  
End  Users  of  IMOS  Data
IMOS  Portal  Step  1  – Select  a  Data  Collection
IMOS  Portal  Step  2  – Create  a  Subset
IMOS  Portal  Step  3  – Download
Backend  Systems:
● Ingest  Tens  of  Millions  of  Fragmented  and  Highly  
Heterogeneous  Data  Files.
● Check  each  file  for  Conformance.
● Store  and  Index  each  file  (spatial  and  temporal).
● Extract,  Transform,  Load.
● Publish  via  Standardised (OGC)  Web  Services.
● Provide  Graphical  User  Interfaces.
Architecture  Running  at  AWS
Why  We  Migrated  to  AWS
1. Data  Durability
2. System  Reliability
3. Freedom  to  Innovate
4. Cost  Effectiveness
The  Challenges  
1. Making  the  Case
2. Data  Migration
3. Modify  Applications  to  Serve  Data  from  S3
4. Redesign  Data  Ingest  Process
THREDDSS3
Web  Services
WMS,  WCS,  OPeNDAP,  etc
IMOS  Enhancements  to  THREDDS
• THREDDS  is  an  Open  Source  Application  that  is  popular  in  the  
Geospatial  Community.
• It  serves  Scientific  Datasets  through  a  Variety  of  Standards  Compliant  
Web  Services.
• IMOS  Enhancements  to  Support  S3  will  be  Merged  into  Core.
The  Future
• Continue  to  Innovate
• Eg.  Improve  processing  of  large  gridded  datasets  (like  satellite  data)  using  
MapReduce.
• Build  the  Australian  Ocean  Data  Network  (AODN)
• Starting  in  July  the  IMOS  infrastructure  running  at  AWS  will  become  the  
foundation   of  the  AODN,  which  is  a  collaboration  between  government  
agencies  including:
• Royal  Australian  Navy
• GeoScience Australia
• CSIRO
• Bureau  of  Meteorology
• Australian  Institute  for  Marine  Science
• Australian  Antarctic  Division
Dr Peter  Blain
Contact:  peter.blain@utas.edu.au
IMOS  is  a  national  collaborative  research  infrastructure,  supported  by  Australian  Government.    It  is  led  by  University  of  Tasmania  in  partnership  with  
the  Australian  marine  &  climate  science  community.
How  About  the  Technology?
CfnCluster is  Familiar
• An  HPC  Head  Node
• Shared  NFS  Storage
• Compute  Nodes
• Common  Schedulers
• SGE,  Torque,  Slurm etc
• Bootstrap  Mechanism
But  CfnCluster is  Also  Different…
• Elastic  Compute  Nodes
• Amazon  S3  Integration
• Scheduler  is  Integrated
with  Auto  Scaling
Takes  15  minutes  to  build  new  
clusters.
Bootstrapping  Our  Own  Software  on  CfnCluster
#!/bin/bash
sudo apt-get -y update
sudo apt-get -y install python-pip python-numpy
python-scipy libgdal-dev libatlas-base-dev
gfortran libfreetype6-devsudo
pip install landsat-utilsudo apt-get -y install
imagemagick
• CfnCluster supports  Chef
to  configure  the  master  and
compute  nodes
• Or  we  can  write  simple  scripts
and  hook  into  different
lifecycle  events
• This  is  how  we  bootstrap
our  demo  cluster  with  Landsat  
imagery  tools  – we’ll  see  this  
later
AWS  Enables  New  Architectures
Scientific  Computing  with  AWS  Lambda
• Approaches  to  solve  common  data processing  patterns  
can  be  simplified,  made  more  scalable  and  at  much  
lower  cost
• AWS  Lambda  can  simplify  the compute architecture  
used  for  scientific  computing  as  well.
Data  Processing  with  MARVEL
MARVEL  is  the  Mars  Australian  
Remote  Virtual  Experiment  
Laboratory.
It  processes  radar  data  from  the  
Mars  Express  satellite
No  servers  are  used in  this  solution
www.marvelstem.org
S3  event  fires
on  PUT
Ingest  data
Process  the
payload  using
Lambda
Store  secondary
data  product  in  S3
for  consumption
Send  status  to
an  SNS  topic
Send  event  to
an  SNS  topic
Scientific  Computing  with  AWS  Lambda
CRISPR  Cas9  
allows  researchers  
to  edit  a  genome.
An  important  
technique  to  model  
disease,  and  cure  
genetic  disorders.
This  is  a  large  
search  problem,  
looking  for  a  20  
base  string  or  
“guide”.
Benchling run  C++  
in  Lambda  to  
search  multiple  
genomes  at  once.
$60/month  vs  
thousands  of  
dollars  using  
traditional  
architectures.
http://benchling.engineering/crispr-­aws-­lambda/  
Open  Data  with  AWS
Sharing  data  on  AWS  makes  it  accessible  to  a
large  and  growing  community  of  researchers  who
use  the  AWS  cloud.
The  Big  Data  Challenge
It’s  typically  consuming  and  expensive  to  acquire,  store,  and  analyse
large  data  sets.
Accessing the  full  historical  archive  on  demand  has  been  almost
impossible.  
Our  Solution  – Shared  Open  Data  on  AWS
Public  Data  Sets  on  AWS
Several  high-­value  datasets  are  Available  for  Anyone  to  Access  for  Free  on  AWS.  
Landsat  on  AWS3K  Rice  Genome NEXRAD  on  AWS
Demo:  Landsat-­util on  CfnCluster
Image  source:  https://developmentseed.org/blog/2014/08/29/landsat-­util/
Research  Data  Egress  Waiver
Why?
Researchers  
strongly  need  
Predictable  
Budgets
Who?
Available  to  
Degree-­granting  
/  Research  
Institutions  in  
APAC  (and  
elsewhere)
What?
Waives  data  
egress  charges  
from  Qualified  
Accounts  
capped  at  15%  
of  Total  Spend
How?
Contract  
Addendum  
Required.
Talk  to  your  
Account  Team.
All  qualifying  research  customers  should  use  this!
Spike  Your  Ideas  with  Research  Credits
AWS  Cloud  Credits  for  Research  Supports  Researchers:
• Proof  of  concepts  or  benchmark  tests
• Contribute  results,  code,  solutions
• Train  the  broader  community
Apply  via  https://aws.amazon.com/research-­credits/
In  Review…
• Come  and  Talk  to  us About  
your  Research  Ideas.
• We  Want  to  Help  you  
Experiment  on  the  AWS  
Platform.
Test  your  Ideas  at  Scale!
AWS  Training  &  Certification
Intro  Videos  &  Labs  
Free  videos  and  labs  to  
help  you  learn  to  work  
with  30+  AWS  services  
– in  minutes!
Training  Classes
In-­person  and  online  
courses  to  build  
technical  skills  –
taught  by  accredited  
AWS  instructors
Online  Labs  
Practice  working  with  
AWS  services  in  live  
environment  –
Learn  how  related  
services  work  
together
AWS  Certification
Validate  technical  
skills  and  expertise  –
identify  qualified  IT  
talent  or  show  you  
are  AWS  cloud  ready
Learn  more:  aws.amazon.com/training
Your  Training  Next  Steps:
ü Visit  the  AWS  Training  &  Certification  pod  to  discuss  your  
training  plan  &  AWS  Summit  training  offer
ü Register  &  attend  AWS  instructor  led  training
ü Get  Certified
AWS  Certified?  Visit  the  AWS  Summit  Certification  Lounge  to  pick  up  your  swag
Learn  more:  aws.amazon.com/training
Thank  You!

More Related Content

What's hot

February 2016 Webinar Series - 451 Research and AWS
February 2016 Webinar Series - 451 Research and AWSFebruary 2016 Webinar Series - 451 Research and AWS
February 2016 Webinar Series - 451 Research and AWSAmazon Web Services
 
ENT314 Automate Best Practices and Operational Health for Your AWS Resources
ENT314 Automate Best Practices and Operational Health for Your AWS ResourcesENT314 Automate Best Practices and Operational Health for Your AWS Resources
ENT314 Automate Best Practices and Operational Health for Your AWS ResourcesAmazon Web Services
 
Database and Analytics on the AWS Cloud - AWS Innovate Toronto
Database and Analytics on the AWS Cloud - AWS Innovate TorontoDatabase and Analytics on the AWS Cloud - AWS Innovate Toronto
Database and Analytics on the AWS Cloud - AWS Innovate TorontoAmazon Web Services
 
Best Practices Using Big Data on AWS | AWS Public Sector Summit 2017
Best Practices Using Big Data on AWS | AWS Public Sector Summit 2017Best Practices Using Big Data on AWS | AWS Public Sector Summit 2017
Best Practices Using Big Data on AWS | AWS Public Sector Summit 2017Amazon Web Services
 
AWS Innovate Montreal Keynote - by Chris Munns
AWS Innovate Montreal Keynote - by Chris MunnsAWS Innovate Montreal Keynote - by Chris Munns
AWS Innovate Montreal Keynote - by Chris MunnsAmazon Web Services
 
Faster Time to Science - Scaling BioMedical Research in the Cloud with SciOps...
Faster Time to Science - Scaling BioMedical Research in the Cloud with SciOps...Faster Time to Science - Scaling BioMedical Research in the Cloud with SciOps...
Faster Time to Science - Scaling BioMedical Research in the Cloud with SciOps...Amazon Web Services
 
Welcome Keynote - AWS Summit Stockholm
Welcome Keynote - AWS Summit Stockholm Welcome Keynote - AWS Summit Stockholm
Welcome Keynote - AWS Summit Stockholm Amazon Web Services
 
AWS re:Invent 2016: How to Build a Big Data Analytics Data Lake (LFS303)
AWS re:Invent 2016: How to Build a Big Data Analytics Data Lake (LFS303)AWS re:Invent 2016: How to Build a Big Data Analytics Data Lake (LFS303)
AWS re:Invent 2016: How to Build a Big Data Analytics Data Lake (LFS303)Amazon Web Services
 
February 2016 Webinar Series - Introduction to AWS Database Migration Service
February 2016 Webinar Series - Introduction to AWS Database Migration ServiceFebruary 2016 Webinar Series - Introduction to AWS Database Migration Service
February 2016 Webinar Series - Introduction to AWS Database Migration ServiceAmazon Web Services
 
SAP Workloads on the AWS Cloud - AWS Innovate Toronto
SAP Workloads on the AWS Cloud - AWS Innovate TorontoSAP Workloads on the AWS Cloud - AWS Innovate Toronto
SAP Workloads on the AWS Cloud - AWS Innovate TorontoAmazon Web Services
 
Optimizing Data Management Using AWS Storage and Data Migration Products | AW...
Optimizing Data Management Using AWS Storage and Data Migration Products | AW...Optimizing Data Management Using AWS Storage and Data Migration Products | AW...
Optimizing Data Management Using AWS Storage and Data Migration Products | AW...Amazon Web Services
 
Big Data Architectural Patterns and Best Practices on AWS
Big Data Architectural Patterns and Best Practices on AWSBig Data Architectural Patterns and Best Practices on AWS
Big Data Architectural Patterns and Best Practices on AWSAmazon Web Services
 
The State of Serverless Computing | AWS Public Sector Summit 2017
The State of Serverless Computing | AWS Public Sector Summit 2017The State of Serverless Computing | AWS Public Sector Summit 2017
The State of Serverless Computing | AWS Public Sector Summit 2017Amazon Web Services
 
AWS Compute Overview: Servers, Containers, Serverless, and Batch | AWS Public...
AWS Compute Overview: Servers, Containers, Serverless, and Batch | AWS Public...AWS Compute Overview: Servers, Containers, Serverless, and Batch | AWS Public...
AWS Compute Overview: Servers, Containers, Serverless, and Batch | AWS Public...Amazon Web Services
 
Fast Track to Your Data Lake on AWS
Fast Track to Your Data Lake on AWSFast Track to Your Data Lake on AWS
Fast Track to Your Data Lake on AWSAmazon Web Services
 
Big Data Architectural Patterns and Best Practices
Big Data Architectural Patterns and Best PracticesBig Data Architectural Patterns and Best Practices
Big Data Architectural Patterns and Best PracticesAmazon Web Services
 
Deep Dive on AWS Cloud Data Migration Services
Deep Dive on AWS Cloud Data Migration ServicesDeep Dive on AWS Cloud Data Migration Services
Deep Dive on AWS Cloud Data Migration ServicesAmazon Web Services
 

What's hot (20)

February 2016 Webinar Series - 451 Research and AWS
February 2016 Webinar Series - 451 Research and AWSFebruary 2016 Webinar Series - 451 Research and AWS
February 2016 Webinar Series - 451 Research and AWS
 
ENT314 Automate Best Practices and Operational Health for Your AWS Resources
ENT314 Automate Best Practices and Operational Health for Your AWS ResourcesENT314 Automate Best Practices and Operational Health for Your AWS Resources
ENT314 Automate Best Practices and Operational Health for Your AWS Resources
 
Database and Analytics on the AWS Cloud - AWS Innovate Toronto
Database and Analytics on the AWS Cloud - AWS Innovate TorontoDatabase and Analytics on the AWS Cloud - AWS Innovate Toronto
Database and Analytics on the AWS Cloud - AWS Innovate Toronto
 
Best Practices Using Big Data on AWS | AWS Public Sector Summit 2017
Best Practices Using Big Data on AWS | AWS Public Sector Summit 2017Best Practices Using Big Data on AWS | AWS Public Sector Summit 2017
Best Practices Using Big Data on AWS | AWS Public Sector Summit 2017
 
AWS Innovate Montreal Keynote - by Chris Munns
AWS Innovate Montreal Keynote - by Chris MunnsAWS Innovate Montreal Keynote - by Chris Munns
AWS Innovate Montreal Keynote - by Chris Munns
 
Faster Time to Science - Scaling BioMedical Research in the Cloud with SciOps...
Faster Time to Science - Scaling BioMedical Research in the Cloud with SciOps...Faster Time to Science - Scaling BioMedical Research in the Cloud with SciOps...
Faster Time to Science - Scaling BioMedical Research in the Cloud with SciOps...
 
AWS Storage and Data Migration
AWS Storage and Data MigrationAWS Storage and Data Migration
AWS Storage and Data Migration
 
Welcome Keynote - AWS Summit Stockholm
Welcome Keynote - AWS Summit Stockholm Welcome Keynote - AWS Summit Stockholm
Welcome Keynote - AWS Summit Stockholm
 
AWS Big Data Solution Days
AWS Big Data Solution DaysAWS Big Data Solution Days
AWS Big Data Solution Days
 
AWS re:Invent 2016: How to Build a Big Data Analytics Data Lake (LFS303)
AWS re:Invent 2016: How to Build a Big Data Analytics Data Lake (LFS303)AWS re:Invent 2016: How to Build a Big Data Analytics Data Lake (LFS303)
AWS re:Invent 2016: How to Build a Big Data Analytics Data Lake (LFS303)
 
February 2016 Webinar Series - Introduction to AWS Database Migration Service
February 2016 Webinar Series - Introduction to AWS Database Migration ServiceFebruary 2016 Webinar Series - Introduction to AWS Database Migration Service
February 2016 Webinar Series - Introduction to AWS Database Migration Service
 
SAP Workloads on the AWS Cloud - AWS Innovate Toronto
SAP Workloads on the AWS Cloud - AWS Innovate TorontoSAP Workloads on the AWS Cloud - AWS Innovate Toronto
SAP Workloads on the AWS Cloud - AWS Innovate Toronto
 
Optimizing Data Management Using AWS Storage and Data Migration Products | AW...
Optimizing Data Management Using AWS Storage and Data Migration Products | AW...Optimizing Data Management Using AWS Storage and Data Migration Products | AW...
Optimizing Data Management Using AWS Storage and Data Migration Products | AW...
 
Big Data Architectural Patterns and Best Practices on AWS
Big Data Architectural Patterns and Best Practices on AWSBig Data Architectural Patterns and Best Practices on AWS
Big Data Architectural Patterns and Best Practices on AWS
 
The State of Serverless Computing | AWS Public Sector Summit 2017
The State of Serverless Computing | AWS Public Sector Summit 2017The State of Serverless Computing | AWS Public Sector Summit 2017
The State of Serverless Computing | AWS Public Sector Summit 2017
 
AWS Compute Overview: Servers, Containers, Serverless, and Batch | AWS Public...
AWS Compute Overview: Servers, Containers, Serverless, and Batch | AWS Public...AWS Compute Overview: Servers, Containers, Serverless, and Batch | AWS Public...
AWS Compute Overview: Servers, Containers, Serverless, and Batch | AWS Public...
 
Fast Track to Your Data Lake on AWS
Fast Track to Your Data Lake on AWSFast Track to Your Data Lake on AWS
Fast Track to Your Data Lake on AWS
 
Securing Your Big Data on AWS
Securing Your Big Data on AWSSecuring Your Big Data on AWS
Securing Your Big Data on AWS
 
Big Data Architectural Patterns and Best Practices
Big Data Architectural Patterns and Best PracticesBig Data Architectural Patterns and Best Practices
Big Data Architectural Patterns and Best Practices
 
Deep Dive on AWS Cloud Data Migration Services
Deep Dive on AWS Cloud Data Migration ServicesDeep Dive on AWS Cloud Data Migration Services
Deep Dive on AWS Cloud Data Migration Services
 

Viewers also liked

WE2.L10.1: LANDSAT DATA PRODUCTS, FREE AND CLEAR
WE2.L10.1: LANDSAT DATA PRODUCTS, FREE AND CLEARWE2.L10.1: LANDSAT DATA PRODUCTS, FREE AND CLEAR
WE2.L10.1: LANDSAT DATA PRODUCTS, FREE AND CLEARgrssieee
 
NYAI - Understanding Music Through Machine Learning by Brian McFee
NYAI - Understanding Music Through Machine Learning by Brian McFeeNYAI - Understanding Music Through Machine Learning by Brian McFee
NYAI - Understanding Music Through Machine Learning by Brian McFeeRizwan Habib
 
Data Processing without Servers | AWS Public Sector Summit 2016
Data Processing without Servers | AWS Public Sector Summit 2016Data Processing without Servers | AWS Public Sector Summit 2016
Data Processing without Servers | AWS Public Sector Summit 2016Amazon Web Services
 
Unlocking Open Data in the Cloud
Unlocking Open Data in the CloudUnlocking Open Data in the Cloud
Unlocking Open Data in the CloudAmazon Web Services
 
AWS re:Invent 2016: Earth on AWS—Next-Generation Open Data Platforms (STG203)
AWS re:Invent 2016: Earth on AWS—Next-Generation Open Data Platforms (STG203)AWS re:Invent 2016: Earth on AWS—Next-Generation Open Data Platforms (STG203)
AWS re:Invent 2016: Earth on AWS—Next-Generation Open Data Platforms (STG203)Amazon Web Services
 
Big Open Data Transformation Through Public Data Sets - AWS Washington D.C. S...
Big Open Data Transformation Through Public Data Sets - AWS Washington D.C. S...Big Open Data Transformation Through Public Data Sets - AWS Washington D.C. S...
Big Open Data Transformation Through Public Data Sets - AWS Washington D.C. S...Amazon Web Services
 
The NOAA Big Data Project: Public-Private Partnerships at Scale
The NOAA Big Data Project: Public-Private Partnerships at ScaleThe NOAA Big Data Project: Public-Private Partnerships at Scale
The NOAA Big Data Project: Public-Private Partnerships at ScaleAmazon Web Services
 

Viewers also liked (11)

Jz 201 t
Jz 201 tJz 201 t
Jz 201 t
 
WE2.L10.1: LANDSAT DATA PRODUCTS, FREE AND CLEAR
WE2.L10.1: LANDSAT DATA PRODUCTS, FREE AND CLEARWE2.L10.1: LANDSAT DATA PRODUCTS, FREE AND CLEAR
WE2.L10.1: LANDSAT DATA PRODUCTS, FREE AND CLEAR
 
Earth Observation in the Cloud
Earth Observation in the CloudEarth Observation in the Cloud
Earth Observation in the Cloud
 
Open GIS Data
Open GIS DataOpen GIS Data
Open GIS Data
 
NYAI - Understanding Music Through Machine Learning by Brian McFee
NYAI - Understanding Music Through Machine Learning by Brian McFeeNYAI - Understanding Music Through Machine Learning by Brian McFee
NYAI - Understanding Music Through Machine Learning by Brian McFee
 
Data Processing without Servers | AWS Public Sector Summit 2016
Data Processing without Servers | AWS Public Sector Summit 2016Data Processing without Servers | AWS Public Sector Summit 2016
Data Processing without Servers | AWS Public Sector Summit 2016
 
Unlocking Open Data in the Cloud
Unlocking Open Data in the CloudUnlocking Open Data in the Cloud
Unlocking Open Data in the Cloud
 
AWS re:Invent 2016: Earth on AWS—Next-Generation Open Data Platforms (STG203)
AWS re:Invent 2016: Earth on AWS—Next-Generation Open Data Platforms (STG203)AWS re:Invent 2016: Earth on AWS—Next-Generation Open Data Platforms (STG203)
AWS re:Invent 2016: Earth on AWS—Next-Generation Open Data Platforms (STG203)
 
Real-Time Event Processing
Real-Time Event ProcessingReal-Time Event Processing
Real-Time Event Processing
 
Big Open Data Transformation Through Public Data Sets - AWS Washington D.C. S...
Big Open Data Transformation Through Public Data Sets - AWS Washington D.C. S...Big Open Data Transformation Through Public Data Sets - AWS Washington D.C. S...
Big Open Data Transformation Through Public Data Sets - AWS Washington D.C. S...
 
The NOAA Big Data Project: Public-Private Partnerships at Scale
The NOAA Big Data Project: Public-Private Partnerships at ScaleThe NOAA Big Data Project: Public-Private Partnerships at Scale
The NOAA Big Data Project: Public-Private Partnerships at Scale
 

Similar to Scaling Ideas: Accelerating Research with AWS - Technical 301

Accelerating Time to Science: Transforming Research in the Cloud
Accelerating Time to Science: Transforming Research in the CloudAccelerating Time to Science: Transforming Research in the Cloud
Accelerating Time to Science: Transforming Research in the CloudJamie Kinney
 
Time to Science/Time to Results: Transforming Research in the Cloud
Time to Science/Time to Results: Transforming Research in the CloudTime to Science/Time to Results: Transforming Research in the Cloud
Time to Science/Time to Results: Transforming Research in the CloudAmazon Web Services
 
Public Sector Case Studies - AWS Summit 2012 - NYC
Public Sector Case Studies - AWS Summit 2012 - NYCPublic Sector Case Studies - AWS Summit 2012 - NYC
Public Sector Case Studies - AWS Summit 2012 - NYCAmazon Web Services
 
Amazon Cloud Resources as Part of Scientific Workflows & HPC - Kevin Jorissen
Amazon Cloud Resources as Part of Scientific Workflows & HPC - Kevin JorissenAmazon Cloud Resources as Part of Scientific Workflows & HPC - Kevin Jorissen
Amazon Cloud Resources as Part of Scientific Workflows & HPC - Kevin JorissenLab San Isidro
 
HPC Clusters in the (almost) Infinite Cloud
HPC Clusters in the (almost) Infinite CloudHPC Clusters in the (almost) Infinite Cloud
HPC Clusters in the (almost) Infinite CloudAmazon Web Services
 
Using the Open Science Data Cloud for Data Science Research
Using the Open Science Data Cloud for Data Science ResearchUsing the Open Science Data Cloud for Data Science Research
Using the Open Science Data Cloud for Data Science ResearchRobert Grossman
 
Transforming Research in Collaboration with Funding Agencies
Transforming Research in Collaboration with Funding AgenciesTransforming Research in Collaboration with Funding Agencies
Transforming Research in Collaboration with Funding AgenciesAmazon Web Services
 
What is a Data Commons and Why Should You Care?
What is a Data Commons and Why Should You Care? What is a Data Commons and Why Should You Care?
What is a Data Commons and Why Should You Care? Robert Grossman
 
#EarthOnAWS: How the Cloud Is Transforming Earth Observation | AWS Public Sec...
#EarthOnAWS: How the Cloud Is Transforming Earth Observation | AWS Public Sec...#EarthOnAWS: How the Cloud Is Transforming Earth Observation | AWS Public Sec...
#EarthOnAWS: How the Cloud Is Transforming Earth Observation | AWS Public Sec...Amazon Web Services
 
Cloud Native, Cloud First and Hybrid: How Different Organizations are Approac...
Cloud Native, Cloud First and Hybrid: How Different Organizations are Approac...Cloud Native, Cloud First and Hybrid: How Different Organizations are Approac...
Cloud Native, Cloud First and Hybrid: How Different Organizations are Approac...Amazon Web Services
 
Overview of AWS Services for Data Storage and Migration - SRV205 - Anaheim AW...
Overview of AWS Services for Data Storage and Migration - SRV205 - Anaheim AW...Overview of AWS Services for Data Storage and Migration - SRV205 - Anaheim AW...
Overview of AWS Services for Data Storage and Migration - SRV205 - Anaheim AW...Amazon Web Services
 
교육의 진화, 클라우드는 어떤 역할을 하는가 :: Vincent Quah :: AWS Summit Seoul 2016
교육의 진화, 클라우드는 어떤 역할을 하는가 :: Vincent Quah :: AWS Summit Seoul 2016교육의 진화, 클라우드는 어떤 역할을 하는가 :: Vincent Quah :: AWS Summit Seoul 2016
교육의 진화, 클라우드는 어떤 역할을 하는가 :: Vincent Quah :: AWS Summit Seoul 2016Amazon Web Services Korea
 
Accelerating Discovery via Science Services
Accelerating Discovery via Science ServicesAccelerating Discovery via Science Services
Accelerating Discovery via Science ServicesIan Foster
 
Aktuelle Referenzen im Life Sciences Umfeld
Aktuelle Referenzen im Life Sciences UmfeldAktuelle Referenzen im Life Sciences Umfeld
Aktuelle Referenzen im Life Sciences UmfeldAWS Germany
 
Accelerating Science with Cloud Technologies in the ABoVE Science Cloud
Accelerating Science with Cloud Technologies in the ABoVE Science CloudAccelerating Science with Cloud Technologies in the ABoVE Science Cloud
Accelerating Science with Cloud Technologies in the ABoVE Science CloudGlobus
 
(BDT201) Big Data and HPC State of the Union | AWS re:Invent 2014
(BDT201) Big Data and HPC State of the Union | AWS re:Invent 2014(BDT201) Big Data and HPC State of the Union | AWS re:Invent 2014
(BDT201) Big Data and HPC State of the Union | AWS re:Invent 2014Amazon Web Services
 
Transitioning Geoscience Research to the Cloud: Opportunities and Challenges
Transitioning Geoscience Research to the Cloud: Opportunities and ChallengesTransitioning Geoscience Research to the Cloud: Opportunities and Challenges
Transitioning Geoscience Research to the Cloud: Opportunities and ChallengesAmazon Web Services
 
Science Services and Science Platforms: Using the Cloud to Accelerate and Dem...
Science Services and Science Platforms: Using the Cloud to Accelerate and Dem...Science Services and Science Platforms: Using the Cloud to Accelerate and Dem...
Science Services and Science Platforms: Using the Cloud to Accelerate and Dem...Ian Foster
 
re:Invent 2013-foster-madduri
re:Invent 2013-foster-maddurire:Invent 2013-foster-madduri
re:Invent 2013-foster-madduriRavi Madduri
 

Similar to Scaling Ideas: Accelerating Research with AWS - Technical 301 (20)

Accelerating Time to Science: Transforming Research in the Cloud
Accelerating Time to Science: Transforming Research in the CloudAccelerating Time to Science: Transforming Research in the Cloud
Accelerating Time to Science: Transforming Research in the Cloud
 
Time to Science/Time to Results: Transforming Research in the Cloud
Time to Science/Time to Results: Transforming Research in the CloudTime to Science/Time to Results: Transforming Research in the Cloud
Time to Science/Time to Results: Transforming Research in the Cloud
 
Public Sector Case Studies - AWS Summit 2012 - NYC
Public Sector Case Studies - AWS Summit 2012 - NYCPublic Sector Case Studies - AWS Summit 2012 - NYC
Public Sector Case Studies - AWS Summit 2012 - NYC
 
Amazon Cloud Resources as Part of Scientific Workflows & HPC - Kevin Jorissen
Amazon Cloud Resources as Part of Scientific Workflows & HPC - Kevin JorissenAmazon Cloud Resources as Part of Scientific Workflows & HPC - Kevin Jorissen
Amazon Cloud Resources as Part of Scientific Workflows & HPC - Kevin Jorissen
 
HPC Clusters in the (almost) Infinite Cloud
HPC Clusters in the (almost) Infinite CloudHPC Clusters in the (almost) Infinite Cloud
HPC Clusters in the (almost) Infinite Cloud
 
Using the Open Science Data Cloud for Data Science Research
Using the Open Science Data Cloud for Data Science ResearchUsing the Open Science Data Cloud for Data Science Research
Using the Open Science Data Cloud for Data Science Research
 
Transforming Research in Collaboration with Funding Agencies
Transforming Research in Collaboration with Funding AgenciesTransforming Research in Collaboration with Funding Agencies
Transforming Research in Collaboration with Funding Agencies
 
What is a Data Commons and Why Should You Care?
What is a Data Commons and Why Should You Care? What is a Data Commons and Why Should You Care?
What is a Data Commons and Why Should You Care?
 
#EarthOnAWS: How the Cloud Is Transforming Earth Observation | AWS Public Sec...
#EarthOnAWS: How the Cloud Is Transforming Earth Observation | AWS Public Sec...#EarthOnAWS: How the Cloud Is Transforming Earth Observation | AWS Public Sec...
#EarthOnAWS: How the Cloud Is Transforming Earth Observation | AWS Public Sec...
 
Cloud Native, Cloud First and Hybrid: How Different Organizations are Approac...
Cloud Native, Cloud First and Hybrid: How Different Organizations are Approac...Cloud Native, Cloud First and Hybrid: How Different Organizations are Approac...
Cloud Native, Cloud First and Hybrid: How Different Organizations are Approac...
 
Overview of AWS Services for Data Storage and Migration - SRV205 - Anaheim AW...
Overview of AWS Services for Data Storage and Migration - SRV205 - Anaheim AW...Overview of AWS Services for Data Storage and Migration - SRV205 - Anaheim AW...
Overview of AWS Services for Data Storage and Migration - SRV205 - Anaheim AW...
 
교육의 진화, 클라우드는 어떤 역할을 하는가 :: Vincent Quah :: AWS Summit Seoul 2016
교육의 진화, 클라우드는 어떤 역할을 하는가 :: Vincent Quah :: AWS Summit Seoul 2016교육의 진화, 클라우드는 어떤 역할을 하는가 :: Vincent Quah :: AWS Summit Seoul 2016
교육의 진화, 클라우드는 어떤 역할을 하는가 :: Vincent Quah :: AWS Summit Seoul 2016
 
Kinney j aws
Kinney j awsKinney j aws
Kinney j aws
 
Accelerating Discovery via Science Services
Accelerating Discovery via Science ServicesAccelerating Discovery via Science Services
Accelerating Discovery via Science Services
 
Aktuelle Referenzen im Life Sciences Umfeld
Aktuelle Referenzen im Life Sciences UmfeldAktuelle Referenzen im Life Sciences Umfeld
Aktuelle Referenzen im Life Sciences Umfeld
 
Accelerating Science with Cloud Technologies in the ABoVE Science Cloud
Accelerating Science with Cloud Technologies in the ABoVE Science CloudAccelerating Science with Cloud Technologies in the ABoVE Science Cloud
Accelerating Science with Cloud Technologies in the ABoVE Science Cloud
 
(BDT201) Big Data and HPC State of the Union | AWS re:Invent 2014
(BDT201) Big Data and HPC State of the Union | AWS re:Invent 2014(BDT201) Big Data and HPC State of the Union | AWS re:Invent 2014
(BDT201) Big Data and HPC State of the Union | AWS re:Invent 2014
 
Transitioning Geoscience Research to the Cloud: Opportunities and Challenges
Transitioning Geoscience Research to the Cloud: Opportunities and ChallengesTransitioning Geoscience Research to the Cloud: Opportunities and Challenges
Transitioning Geoscience Research to the Cloud: Opportunities and Challenges
 
Science Services and Science Platforms: Using the Cloud to Accelerate and Dem...
Science Services and Science Platforms: Using the Cloud to Accelerate and Dem...Science Services and Science Platforms: Using the Cloud to Accelerate and Dem...
Science Services and Science Platforms: Using the Cloud to Accelerate and Dem...
 
re:Invent 2013-foster-madduri
re:Invent 2013-foster-maddurire:Invent 2013-foster-madduri
re:Invent 2013-foster-madduri
 

More from Amazon Web Services

Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...Amazon Web Services
 
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...Amazon Web Services
 
Esegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS FargateEsegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS FargateAmazon Web Services
 
Costruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWSCostruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWSAmazon Web Services
 
Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot Amazon Web Services
 
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...Amazon Web Services
 
OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...Amazon Web Services
 
Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows WorkloadsMicrosoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows WorkloadsAmazon Web Services
 
Database Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatareDatabase Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatareAmazon Web Services
 
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJSCrea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJSAmazon Web Services
 
API moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e webAPI moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e webAmazon Web Services
 
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatareDatabase Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatareAmazon Web Services
 
Tools for building your MVP on AWS
Tools for building your MVP on AWSTools for building your MVP on AWS
Tools for building your MVP on AWSAmazon Web Services
 
How to Build a Winning Pitch Deck
How to Build a Winning Pitch DeckHow to Build a Winning Pitch Deck
How to Build a Winning Pitch DeckAmazon Web Services
 
Building a web application without servers
Building a web application without serversBuilding a web application without servers
Building a web application without serversAmazon Web Services
 
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...Amazon Web Services
 
Introduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container ServiceIntroduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container ServiceAmazon Web Services
 

More from Amazon Web Services (20)

Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
 
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
 
Esegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS FargateEsegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS Fargate
 
Costruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWSCostruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWS
 
Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot
 
Open banking as a service
Open banking as a serviceOpen banking as a service
Open banking as a service
 
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
 
OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...
 
Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows WorkloadsMicrosoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
 
Computer Vision con AWS
Computer Vision con AWSComputer Vision con AWS
Computer Vision con AWS
 
Database Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatareDatabase Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatare
 
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJSCrea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
 
API moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e webAPI moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e web
 
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatareDatabase Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
 
Tools for building your MVP on AWS
Tools for building your MVP on AWSTools for building your MVP on AWS
Tools for building your MVP on AWS
 
How to Build a Winning Pitch Deck
How to Build a Winning Pitch DeckHow to Build a Winning Pitch Deck
How to Build a Winning Pitch Deck
 
Building a web application without servers
Building a web application without serversBuilding a web application without servers
Building a web application without servers
 
Fundraising Essentials
Fundraising EssentialsFundraising Essentials
Fundraising Essentials
 
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
 
Introduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container ServiceIntroduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container Service
 

Recently uploaded

DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenHervé Boutemy
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfAddepto
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfAlex Barbosa Coqueiro
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...Fwdays
 
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo DayH2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo DaySri Ambati
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningLars Bell
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piececharlottematthew16
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 3652toLead Limited
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024Lonnie McRorey
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii SoldatenkoFwdays
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clashcharlottematthew16
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024Stephanie Beckett
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Manik S Magar
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):comworks
 
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionDilum Bandara
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Enterprise Knowledge
 

Recently uploaded (20)

DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache Maven
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdf
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdf
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
 
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo DayH2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine Tuning
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piece
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clash
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL Certs
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):
 
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An Introduction
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024
 

Scaling Ideas: Accelerating Research with AWS - Technical 301

  • 1. ©  2016,  Amazon  Web  Services,  Inc.  or  its  Affiliates.  All  rights  reserved. Adrian  White,  Head  of  Scientific  Computing,  APAC   Amazon  Web  Services Dr Peter  Blain,  Integrated  Marine  Observing  System University  of  Tasmania Scaling  Ideas:  Accelerating   Research  with  AWS Technical  301
  • 2. “It  is  the  tension  between  creativity  and   skepticism  that  has  produced  the  stunning   and  unexpected  findings  of  science.” Carl  Sagan  (1934-­1996)
  • 3. Why  AWS  for  Research? Time  to  Science Access  research   infrastructure  in  minutes Low  Cost Pay-­as-­you-­go  pricing Elastic Easily  add  or  remove  capacity Globally  Accessible Easily  Collaborate  with   researchers  around  the  world Secure A  collection  of  tools  to protect  data  and  privacy Scalable Access  to  effectively limitless  capacity
  • 4. Time  to  Science Scalable Perhaps  the  Top  Three  Reasons? Elastic
  • 5. Researchers  Are  Using  AWS  For… Life  Sciences & Genomics Space  Research &  Astronomy High  Energy   Physics Open  Data e.g.  Satellite Imagery HPC  &  Grid computing
  • 6. Researchers  Are  Using  AWS  For… Life  Sciences & Genomics Space  Research &  Astronomy High  Energy   Physics Open  Data e.g.  Satellite Imagery HPC  &  Grid computing and  much  more….
  • 7. Walter  and  Eliza  Hall  Institute  of  Medical  Research DiUS &  WEHI  have  built  Two  Medical   Research  Solutions: • Building  an  auto-­scaling  R  cluster  using   CfnCluster • Scientific  image  processing  in  the  cloud   with  Fiji/ImageJ This  provides  on-­demand  and  templated research  services  to  do  science.
  • 8. • The  Australian  Museum,  University  of   New  South  Wales,  and  others  are   sequencing  and  assembling  the  Koala   Genome  using  AWS • They  are  running  large  clusters  using   Spot  instances • Have  used  over  300,000  core  hours  and   can  use  new  processing  techniques   previously  unavailable  to  them. Sequencing  the  Koala  Genome
  • 9. Solving  the  Mysteries  of  the  Universe • Fermilab is  one  of  the  Tier  1   data  centers  for  the  CMS   experiment • Looking  for  the  Higgs  Boson  to   understand  mass. • Launched  the  HEP  Cloud   Project  in  June,  2015 • Recently  added  58,000  cores (or  4x  increase  in  Fermilab capacity)  to  simulate  500  million   events  over  10  days Source:  https://aws.amazon.com/blogs/aws/experiment-­that-­discovered-­the-­higgs-­boson-­uses-­aws-­to-­probe-­nature/
  • 10. Seals  with  Sensors   Dr Peter  Blain IMOS  is  a  national  collaborative  research  infrastructure,  supported  by  Australian  Government.    It  is  led  by  University  of  Tasmania  in  partnership  with   the  Australian  marine  &  climate  science  community.
  • 11. Argo  Floats Ships Moorings Gliders Autonomous   Vehicles RadarAnimalsSatellite …  and  others IMOS  Observations  are  Taken  by  Sensors  on  Multiple   Platforms...
  • 13. Seals
  • 14. IMOS  data  is  targeted  at  the  scientific  community. Research  themes: • Long-­term  ocean  change. • Climate  variability  and  weather  extremes. • Boundary  currents. • Continental  shelf  and  coastal  processes. • Ecosystem  responses. Thousands  of  publications  have  been  based  on  IMOS  data. IMOS  data  is  also  available  to  government,  industry  and  the  general  public.   End  Users  of  IMOS  Data
  • 15. IMOS  Portal  Step  1  – Select  a  Data  Collection
  • 16. IMOS  Portal  Step  2  – Create  a  Subset
  • 17. IMOS  Portal  Step  3  – Download
  • 18. Backend  Systems: ● Ingest  Tens  of  Millions  of  Fragmented  and  Highly   Heterogeneous  Data  Files. ● Check  each  file  for  Conformance. ● Store  and  Index  each  file  (spatial  and  temporal). ● Extract,  Transform,  Load. ● Publish  via  Standardised (OGC)  Web  Services. ● Provide  Graphical  User  Interfaces.
  • 20. Why  We  Migrated  to  AWS 1. Data  Durability 2. System  Reliability 3. Freedom  to  Innovate 4. Cost  Effectiveness
  • 21. The  Challenges   1. Making  the  Case 2. Data  Migration 3. Modify  Applications  to  Serve  Data  from  S3 4. Redesign  Data  Ingest  Process
  • 22. THREDDSS3 Web  Services WMS,  WCS,  OPeNDAP,  etc IMOS  Enhancements  to  THREDDS • THREDDS  is  an  Open  Source  Application  that  is  popular  in  the   Geospatial  Community. • It  serves  Scientific  Datasets  through  a  Variety  of  Standards  Compliant   Web  Services. • IMOS  Enhancements  to  Support  S3  will  be  Merged  into  Core.
  • 23. The  Future • Continue  to  Innovate • Eg.  Improve  processing  of  large  gridded  datasets  (like  satellite  data)  using   MapReduce. • Build  the  Australian  Ocean  Data  Network  (AODN) • Starting  in  July  the  IMOS  infrastructure  running  at  AWS  will  become  the   foundation   of  the  AODN,  which  is  a  collaboration  between  government   agencies  including: • Royal  Australian  Navy • GeoScience Australia • CSIRO • Bureau  of  Meteorology • Australian  Institute  for  Marine  Science • Australian  Antarctic  Division
  • 24. Dr Peter  Blain Contact:  peter.blain@utas.edu.au IMOS  is  a  national  collaborative  research  infrastructure,  supported  by  Australian  Government.    It  is  led  by  University  of  Tasmania  in  partnership  with   the  Australian  marine  &  climate  science  community.
  • 25. How  About  the  Technology?
  • 26. CfnCluster is  Familiar • An  HPC  Head  Node • Shared  NFS  Storage • Compute  Nodes • Common  Schedulers • SGE,  Torque,  Slurm etc • Bootstrap  Mechanism
  • 27. But  CfnCluster is  Also  Different… • Elastic  Compute  Nodes • Amazon  S3  Integration • Scheduler  is  Integrated with  Auto  Scaling Takes  15  minutes  to  build  new   clusters.
  • 28. Bootstrapping  Our  Own  Software  on  CfnCluster #!/bin/bash sudo apt-get -y update sudo apt-get -y install python-pip python-numpy python-scipy libgdal-dev libatlas-base-dev gfortran libfreetype6-devsudo pip install landsat-utilsudo apt-get -y install imagemagick • CfnCluster supports  Chef to  configure  the  master  and compute  nodes • Or  we  can  write  simple  scripts and  hook  into  different lifecycle  events • This  is  how  we  bootstrap our  demo  cluster  with  Landsat   imagery  tools  – we’ll  see  this   later
  • 29. AWS  Enables  New  Architectures
  • 30. Scientific  Computing  with  AWS  Lambda • Approaches  to  solve  common  data processing  patterns   can  be  simplified,  made  more  scalable  and  at  much   lower  cost • AWS  Lambda  can  simplify  the compute architecture   used  for  scientific  computing  as  well.
  • 31. Data  Processing  with  MARVEL MARVEL  is  the  Mars  Australian   Remote  Virtual  Experiment   Laboratory. It  processes  radar  data  from  the   Mars  Express  satellite No  servers  are  used in  this  solution www.marvelstem.org S3  event  fires on  PUT Ingest  data Process  the payload  using Lambda Store  secondary data  product  in  S3 for  consumption Send  status  to an  SNS  topic Send  event  to an  SNS  topic
  • 32. Scientific  Computing  with  AWS  Lambda CRISPR  Cas9   allows  researchers   to  edit  a  genome. An  important   technique  to  model   disease,  and  cure   genetic  disorders. This  is  a  large   search  problem,   looking  for  a  20   base  string  or   “guide”. Benchling run  C++   in  Lambda  to   search  multiple   genomes  at  once. $60/month  vs   thousands  of   dollars  using   traditional   architectures. http://benchling.engineering/crispr-­aws-­lambda/  
  • 33. Open  Data  with  AWS Sharing  data  on  AWS  makes  it  accessible  to  a large  and  growing  community  of  researchers  who use  the  AWS  cloud. The  Big  Data  Challenge It’s  typically  consuming  and  expensive  to  acquire,  store,  and  analyse large  data  sets. Accessing the  full  historical  archive  on  demand  has  been  almost impossible.   Our  Solution  – Shared  Open  Data  on  AWS
  • 34. Public  Data  Sets  on  AWS Several  high-­value  datasets  are  Available  for  Anyone  to  Access  for  Free  on  AWS.   Landsat  on  AWS3K  Rice  Genome NEXRAD  on  AWS
  • 35. Demo:  Landsat-­util on  CfnCluster Image  source:  https://developmentseed.org/blog/2014/08/29/landsat-­util/
  • 36. Research  Data  Egress  Waiver Why? Researchers   strongly  need   Predictable   Budgets Who? Available  to   Degree-­granting   /  Research   Institutions  in   APAC  (and   elsewhere) What? Waives  data   egress  charges   from  Qualified   Accounts   capped  at  15%   of  Total  Spend How? Contract   Addendum   Required. Talk  to  your   Account  Team. All  qualifying  research  customers  should  use  this!
  • 37. Spike  Your  Ideas  with  Research  Credits AWS  Cloud  Credits  for  Research  Supports  Researchers: • Proof  of  concepts  or  benchmark  tests • Contribute  results,  code,  solutions • Train  the  broader  community Apply  via  https://aws.amazon.com/research-­credits/
  • 38. In  Review… • Come  and  Talk  to  us About   your  Research  Ideas. • We  Want  to  Help  you   Experiment  on  the  AWS   Platform. Test  your  Ideas  at  Scale!
  • 39. AWS  Training  &  Certification Intro  Videos  &  Labs   Free  videos  and  labs  to   help  you  learn  to  work   with  30+  AWS  services   – in  minutes! Training  Classes In-­person  and  online   courses  to  build   technical  skills  – taught  by  accredited   AWS  instructors Online  Labs   Practice  working  with   AWS  services  in  live   environment  – Learn  how  related   services  work   together AWS  Certification Validate  technical   skills  and  expertise  – identify  qualified  IT   talent  or  show  you   are  AWS  cloud  ready Learn  more:  aws.amazon.com/training
  • 40. Your  Training  Next  Steps: ü Visit  the  AWS  Training  &  Certification  pod  to  discuss  your   training  plan  &  AWS  Summit  training  offer ü Register  &  attend  AWS  instructor  led  training ü Get  Certified AWS  Certified?  Visit  the  AWS  Summit  Certification  Lounge  to  pick  up  your  swag Learn  more:  aws.amazon.com/training