SlideShare a Scribd company logo
1 of 31
1
Scaling Galaxy on GCP
LynnLangit
Cloud and Data Architect
Google Developer Cloud Expert, AWS Community Hero, Microsoft Data Platform MVP
2
Agenda
• Scaling Up
• Virtual Machines
• Hello Galaxy
• Adding Tools to Galaxy
• Genomic Data on GCP
• Scaling Out
• Docker Containers
• Google Persistent Disks
• Pipelines
• Google Genomics APIs
• Big Query
Galaxy on Google Cloud Platform
3
Google Cloud in Australia
Data center here in 2017
4
Galaxy on GCP – Scale Up
5Google Cloud Platform 5
Demo 1- Hello Galaxy on Google Cloud
6Google Cloud Platform 6
Demo 2 - Adding Galaxy Tools
77
• Cloud Storage (file) buckets
• Source data
• Compute Engine Virtual Machines
• Virtual Machine Image files
• External VM persistent hard disks with your source data
GCP Virtual Machine Services
Key Concepts:
-- VM configuration as code
-- Fast, cheap scalable VMs
88
• Re-size Virtual Machines
• Attach more persistent disks
• Update base image
• Monitor with Stackdriver
Scale Up Patterns
99
1010
Genomic Data
• Files at GCS
• gs://genomics-public-data
• Query via BigQuery
• https://bigquery.cloud.google.com/queries/genomics-public-
data
• Code via Genomics API
• Implements Global Alliance for Genomics and Health
APIs
• Genome browser - https://gabrowse.appspot.com
• Google Genomics example code on GitHub
1111
12
Galaxy on GCP – Scale Out
1313
GCP Docker Container Services
• Cloud Storage
• Container Engine / Docker
Key Concepts:
-- Container configuration as code
-- Fast, cheap scalable Docker Containers
1414
1515
• Docker Container Cluster
• Kubernetes manager
• Container orchestration
Scale Out Patterns
1616
GCP Serverless Services
• Cloud Functions
• Microservices
Key Concepts:
-- Function configuration as code
-- Fast, cheap scalable Microservices
17
Galaxy on GCP – Advanced Pipelines
18Google Cloud Platform 18
Demo 3 – Using the Google Genomics API &
BigQuery
1919
BigQuery
• ANSI SQL Queries
• Query-as-a-service
Key Concepts:
-- SQL query configuration as code
-- Fast, cheap scalable SQL Queries
20
Private Datasets Public Datasets
Variant Analysis
MSSNG Autism
Cloud Storage
Scientist
High
Throughput
Genome
Sequencers
1000 Genomes
Cloud Storage
Patient Data
Cloud Storage
Illumina Platform
Cloud Storage
Ref Genomes
Cloud Storage
TCGA
Cloud Storage
Analytics
Online Analytics
BigQuery
Batch Analytics
Cloud Dataflow
Lab Notebooks
Cloud Datalab
Data Ingest
Genomics
BAM
FASTQ
21
Ingest
Elastic Cluster
Storage
Analytics
Carrier
Interconnect
High
Throughput
Genome
Sequencer
s
Scientist
Raw Datafiles
Cloud Storage
Processed Data
Cloud Storage
Metadata
Cloud SQL
Lab notebooks
Cloud Datalab
HPC Cluster
Compute
Engine
10 Nodes
Ingest Server
Compute
Engine
Online Analytics
BigQuery
Cloud Load
Balancing
Cloud
Network
Genomics, Secondary
Analysis
2222
• Cloud Storage / Public datasets on
GCP
• Big Query
• Cloud Dataflow
• Genomics API
Advanced GCP Pipelines Core
Products
Key Concepts:
-- Pipeline configuration as code
-- Fast, cheap scalable cloud services
23
Resources
2424
• Cloud Storage (files) -- here
• Compute Engine (VMs) -- here
• Container Engine (Docker) -- here
• Big Query (SQL) -- here
• Cloud Dataflow (pipelines) -- here
• Genomics API-- here
• Genomics Cookbook– here
• Public datasets on GCP-- here
• Google’s Genomic code samples – here
• Lynn’s GitHub code samples -- here
Resources
25
More about Google Cloud Services
26
Compute
Compute
Engine
App
Engine
Container
Engine
Container
Registry
Cloud
Functions
Networking
Cloud Virtual
Network
Cloud Load
Balancing
Cloud
CDN
Cloud
Interconnect
Cloud
DNS
Big Data
BigQuery
Cloud
Dataflow
Cloud
Dataproc
Cloud
Datalab
Cloud
Pub/Sub
Genomics
Identity & Security
Cloud IAM
Cloud Resource
Manager
Cloud Security
Scanner
Cloud Platform
Security
Storage and Databases
Cloud
Storage
Cloud
Bigtable
Cloud
Datastore
Cloud SQL
Persistent
Disk
Machine Learning
Cloud Machine
Learning
Vision API
Speech
API
Natural
Language API
Translation
API
Google Cloud Platform Services Part One
Jobs API
27
Management Tools
Stackdriver Monitoring Logging
Error
Reporting
Trace Debugger
Deployment
Manager
Cloud
Endpoints
Cloud
Console
Developer Tools
Cloud
SDK
Deployment
Manager
Cloud Source
Repositories
Cloud Tools for
Android Studio
Cloud Tools
for IntelliJ
Cloud Tools for
PowerShell
Cloud Tools for
Visual Studio
Google Plug-in
for Eclipse
Cloud Test
Lab
Google Cloud Platform Services Part Two
Cloud
Shell
Cloud Mobile
App
Billing
App
Cloud
APIs
28
29
30
GCE Persistence Options – Disks,
etc.… Created From Notes
Image GCS File or Disk File path <bucket>/<folder>/<file>
Disk must detached from VM
Snapshot Disk or Instance (boot) Can create an Instance FROM a Snapshot
Persistent
Disk
Image –or-
Snapshot –or-
Blank
Blank disk must be formatted
Can create an Instance or Snapshot FROM a Disk
Bucket GCS console for file Access via path gs://<bucketName>/<fileName>
VM Instance
Boot Disk
Image –or-
Snapshot –or-
Disk
Images -> OS, Application or Custom Image
N/A
From Saved Disk
VM Instance
Additional
Disk
Local Scratch –or-
Standard persistent –or-
SSD persistent
Max 8 at 375 GB each.
500 GB  64 TB
Read/Write or Read Only
Attach up to 16 Disks* per VM
31

More Related Content

What's hot

Google Cloud Platform
Google Cloud PlatformGoogle Cloud Platform
Google Cloud Platform
VMware Tanzu
 

What's hot (20)

Google Cloud Platform as a Backend Solution for your Product
Google Cloud Platform as a Backend Solution for your ProductGoogle Cloud Platform as a Backend Solution for your Product
Google Cloud Platform as a Backend Solution for your Product
 
Google Cloud Technologies Overview
Google Cloud Technologies OverviewGoogle Cloud Technologies Overview
Google Cloud Technologies Overview
 
Introduction to Google Cloud Platform
Introduction to Google Cloud PlatformIntroduction to Google Cloud Platform
Introduction to Google Cloud Platform
 
Cloud computing by Google Cloud Platform - Presentation
Cloud computing by Google Cloud Platform - PresentationCloud computing by Google Cloud Platform - Presentation
Cloud computing by Google Cloud Platform - Presentation
 
How to Puppetize Google Cloud Platform - PuppetConf 2014
How to Puppetize Google Cloud Platform - PuppetConf 2014How to Puppetize Google Cloud Platform - PuppetConf 2014
How to Puppetize Google Cloud Platform - PuppetConf 2014
 
Building a data warehouse with AWS Redshift, Matillion and Yellowfin
Building a data warehouse with AWS Redshift, Matillion and YellowfinBuilding a data warehouse with AWS Redshift, Matillion and Yellowfin
Building a data warehouse with AWS Redshift, Matillion and Yellowfin
 
Google cloud platform introduction
Google cloud platform introductionGoogle cloud platform introduction
Google cloud platform introduction
 
 Introduction google cloud platform
 Introduction google cloud platform Introduction google cloud platform
 Introduction google cloud platform
 
MongoDB Days UK: Run MongoDB on Google Cloud Platform
MongoDB Days UK: Run MongoDB on Google Cloud PlatformMongoDB Days UK: Run MongoDB on Google Cloud Platform
MongoDB Days UK: Run MongoDB on Google Cloud Platform
 
#DataUnlimited - Google Big Data Unlimited
#DataUnlimited - Google Big Data Unlimited#DataUnlimited - Google Big Data Unlimited
#DataUnlimited - Google Big Data Unlimited
 
Google Cloud Connect Korea - Sep 2017
Google Cloud Connect Korea - Sep 2017Google Cloud Connect Korea - Sep 2017
Google Cloud Connect Korea - Sep 2017
 
Google Cloud Platform
Google Cloud PlatformGoogle Cloud Platform
Google Cloud Platform
 
AWS for the Data Professional
AWS for the Data ProfessionalAWS for the Data Professional
AWS for the Data Professional
 
A Tour of Google Cloud Platform
A Tour of Google Cloud PlatformA Tour of Google Cloud Platform
A Tour of Google Cloud Platform
 
Google Cloud Dataflow
Google Cloud DataflowGoogle Cloud Dataflow
Google Cloud Dataflow
 
Managing Your Cloud Spend With PlanForCloud - RightScale Compute 2013
Managing Your Cloud Spend With PlanForCloud - RightScale Compute 2013Managing Your Cloud Spend With PlanForCloud - RightScale Compute 2013
Managing Your Cloud Spend With PlanForCloud - RightScale Compute 2013
 
Google Dataflow Intro
Google Dataflow IntroGoogle Dataflow Intro
Google Dataflow Intro
 
Tom Grey - Google Cloud Platform
Tom Grey - Google Cloud PlatformTom Grey - Google Cloud Platform
Tom Grey - Google Cloud Platform
 
New AWS Services for Bioinformatics
New AWS Services for BioinformaticsNew AWS Services for Bioinformatics
New AWS Services for Bioinformatics
 
TIAD : Automate everything with Google Cloud
TIAD : Automate everything with Google CloudTIAD : Automate everything with Google Cloud
TIAD : Automate everything with Google Cloud
 

Similar to Scaling Galaxy on Google Cloud Platform

IPC Global Big Data To Decision Solution Overview
IPC Global Big Data To Decision Solution OverviewIPC Global Big Data To Decision Solution Overview
IPC Global Big Data To Decision Solution Overview
pzybrick
 
Operating PostgreSQL at Scale with Kubernetes
Operating PostgreSQL at Scale with KubernetesOperating PostgreSQL at Scale with Kubernetes
Operating PostgreSQL at Scale with Kubernetes
Jonathan Katz
 
Monitoring docker, k8s and your applications with the elastic stack
Monitoring docker, k8s and your applications with the elastic stackMonitoring docker, k8s and your applications with the elastic stack
Monitoring docker, k8s and your applications with the elastic stack
SmartWave
 

Similar to Scaling Galaxy on Google Cloud Platform (20)

Getting started with GCP ( Google Cloud Platform)
Getting started with GCP ( Google  Cloud Platform)Getting started with GCP ( Google  Cloud Platform)
Getting started with GCP ( Google Cloud Platform)
 
Scale with a smile with Google Cloud Platform At DevConTLV (June 2014)
Scale with a smile with Google Cloud Platform At DevConTLV (June 2014)Scale with a smile with Google Cloud Platform At DevConTLV (June 2014)
Scale with a smile with Google Cloud Platform At DevConTLV (June 2014)
 
2016 08-30 Kubernetes talk for Waterloo DevOps
2016 08-30 Kubernetes talk for Waterloo DevOps2016 08-30 Kubernetes talk for Waterloo DevOps
2016 08-30 Kubernetes talk for Waterloo DevOps
 
Integrating Google Cloud Dataproc with Alluxio for faster performance in the ...
Integrating Google Cloud Dataproc with Alluxio for faster performance in the ...Integrating Google Cloud Dataproc with Alluxio for faster performance in the ...
Integrating Google Cloud Dataproc with Alluxio for faster performance in the ...
 
"What's New With Globus" Webinar: Spring 2018
"What's New With Globus" Webinar: Spring 2018"What's New With Globus" Webinar: Spring 2018
"What's New With Globus" Webinar: Spring 2018
 
IPC Global Big Data To Decision Solution Overview
IPC Global Big Data To Decision Solution OverviewIPC Global Big Data To Decision Solution Overview
IPC Global Big Data To Decision Solution Overview
 
Databases in the Hosted Cloud
Databases in the Hosted CloudDatabases in the Hosted Cloud
Databases in the Hosted Cloud
 
Operating PostgreSQL at Scale with Kubernetes
Operating PostgreSQL at Scale with KubernetesOperating PostgreSQL at Scale with Kubernetes
Operating PostgreSQL at Scale with Kubernetes
 
Monitoring docker, k8s and your applications with the elastic stack
Monitoring docker, k8s and your applications with the elastic stackMonitoring docker, k8s and your applications with the elastic stack
Monitoring docker, k8s and your applications with the elastic stack
 
Google Cloud and Data Pipeline Patterns
Google Cloud and Data Pipeline PatternsGoogle Cloud and Data Pipeline Patterns
Google Cloud and Data Pipeline Patterns
 
QMeeting 2018 - Como integrar qlik e cloudera
QMeeting 2018 - Como integrar qlik e clouderaQMeeting 2018 - Como integrar qlik e cloudera
QMeeting 2018 - Como integrar qlik e cloudera
 
[Study Guide] Google Professional Cloud Architect (GCP-PCA) Certification
[Study Guide] Google Professional Cloud Architect (GCP-PCA) Certification[Study Guide] Google Professional Cloud Architect (GCP-PCA) Certification
[Study Guide] Google Professional Cloud Architect (GCP-PCA) Certification
 
GlobusWorld 2020 Keynote
GlobusWorld 2020 KeynoteGlobusWorld 2020 Keynote
GlobusWorld 2020 Keynote
 
Analyzing the World's Largest Security Data Lake!
Analyzing the World's Largest Security Data Lake!Analyzing the World's Largest Security Data Lake!
Analyzing the World's Largest Security Data Lake!
 
What's New in Globus - Internet2 TechEXtra
What's New in Globus - Internet2 TechEXtraWhat's New in Globus - Internet2 TechEXtra
What's New in Globus - Internet2 TechEXtra
 
Google Cloud - Scale With A Smile (Dec 2014)
Google Cloud - Scale With A Smile (Dec 2014)Google Cloud - Scale With A Smile (Dec 2014)
Google Cloud - Scale With A Smile (Dec 2014)
 
Using PostgreSQL With Docker & Kubernetes - July 2018
Using PostgreSQL With Docker & Kubernetes - July 2018Using PostgreSQL With Docker & Kubernetes - July 2018
Using PostgreSQL With Docker & Kubernetes - July 2018
 
High Performance Computing with AWS
High Performance Computing with AWSHigh Performance Computing with AWS
High Performance Computing with AWS
 
With Automated ML, is Everyone an ML Engineer?
With Automated ML, is Everyone an ML Engineer?With Automated ML, is Everyone an ML Engineer?
With Automated ML, is Everyone an ML Engineer?
 
Enhancing Data Protection Workflows with Kanister And Argo Workflows
Enhancing Data Protection Workflows with Kanister And Argo WorkflowsEnhancing Data Protection Workflows with Kanister And Argo Workflows
Enhancing Data Protection Workflows with Kanister And Argo Workflows
 

More from Lynn Langit

More from Lynn Langit (20)

VariantSpark on AWS
VariantSpark on AWSVariantSpark on AWS
VariantSpark on AWS
 
Serverless Architectures
Serverless ArchitecturesServerless Architectures
Serverless Architectures
 
10+ Years of Teaching Kids Programming
10+ Years of Teaching Kids Programming10+ Years of Teaching Kids Programming
10+ Years of Teaching Kids Programming
 
Blastn plus jupyter on Docker
Blastn plus jupyter on DockerBlastn plus jupyter on Docker
Blastn plus jupyter on Docker
 
Testing in Ballerina Language
Testing in Ballerina LanguageTesting in Ballerina Language
Testing in Ballerina Language
 
Teaching Kids to create Alexa Skills
Teaching Kids to create Alexa SkillsTeaching Kids to create Alexa Skills
Teaching Kids to create Alexa Skills
 
Practical cloud
Practical cloudPractical cloud
Practical cloud
 
Understanding Jupyter notebooks using bioinformatics examples
Understanding Jupyter notebooks using bioinformatics examplesUnderstanding Jupyter notebooks using bioinformatics examples
Understanding Jupyter notebooks using bioinformatics examples
 
Genome-scale Big Data Pipelines
Genome-scale Big Data PipelinesGenome-scale Big Data Pipelines
Genome-scale Big Data Pipelines
 
Teaching Kids Programming
Teaching Kids ProgrammingTeaching Kids Programming
Teaching Kids Programming
 
Practical Cloud
Practical CloudPractical Cloud
Practical Cloud
 
Serverless Reality
Serverless RealityServerless Reality
Serverless Reality
 
Genomic Scale Big Data Pipelines
Genomic Scale Big Data PipelinesGenomic Scale Big Data Pipelines
Genomic Scale Big Data Pipelines
 
VariantSpark - a Spark library for genomics
VariantSpark - a Spark library for genomicsVariantSpark - a Spark library for genomics
VariantSpark - a Spark library for genomics
 
Bioinformatics Data Pipelines built by CSIRO on AWS
Bioinformatics Data Pipelines built by CSIRO on AWSBioinformatics Data Pipelines built by CSIRO on AWS
Bioinformatics Data Pipelines built by CSIRO on AWS
 
Redis Labs and SQL Server
Redis Labs and SQL ServerRedis Labs and SQL Server
Redis Labs and SQL Server
 
What is 'Teaching Kids Programming'
What is 'Teaching Kids Programming'What is 'Teaching Kids Programming'
What is 'Teaching Kids Programming'
 
Teaching Kids Programming for Developers
Teaching Kids Programming for DevelopersTeaching Kids Programming for Developers
Teaching Kids Programming for Developers
 
Cloud Big Data Architectures
Cloud Big Data ArchitecturesCloud Big Data Architectures
Cloud Big Data Architectures
 
Cloud-centric Internet of Things
Cloud-centric Internet of ThingsCloud-centric Internet of Things
Cloud-centric Internet of Things
 

Recently uploaded

Jual Obat Aborsi Di Dubai UAE Wa 0838-4800-7379 Obat Penggugur Kandungan Cytotec
Jual Obat Aborsi Di Dubai UAE Wa 0838-4800-7379 Obat Penggugur Kandungan CytotecJual Obat Aborsi Di Dubai UAE Wa 0838-4800-7379 Obat Penggugur Kandungan Cytotec
Jual Obat Aborsi Di Dubai UAE Wa 0838-4800-7379 Obat Penggugur Kandungan Cytotec
jualobat34
 
Premium Call Girls Dehradun {8854095900} ❤️VVIP ANJU Call Girls in Dehradun U...
Premium Call Girls Dehradun {8854095900} ❤️VVIP ANJU Call Girls in Dehradun U...Premium Call Girls Dehradun {8854095900} ❤️VVIP ANJU Call Girls in Dehradun U...
Premium Call Girls Dehradun {8854095900} ❤️VVIP ANJU Call Girls in Dehradun U...
Sheetaleventcompany
 
👉 Chennai Sexy Aunty’s WhatsApp Number 👉📞 7427069034 👉📞 Just📲 Call Ruhi Colle...
👉 Chennai Sexy Aunty’s WhatsApp Number 👉📞 7427069034 👉📞 Just📲 Call Ruhi Colle...👉 Chennai Sexy Aunty’s WhatsApp Number 👉📞 7427069034 👉📞 Just📲 Call Ruhi Colle...
👉 Chennai Sexy Aunty’s WhatsApp Number 👉📞 7427069034 👉📞 Just📲 Call Ruhi Colle...
rajnisinghkjn
 
Goa Call Girl Service 📞9xx000xx09📞Just Call Divya📲 Call Girl In Goa No💰Advanc...
Goa Call Girl Service 📞9xx000xx09📞Just Call Divya📲 Call Girl In Goa No💰Advanc...Goa Call Girl Service 📞9xx000xx09📞Just Call Divya📲 Call Girl In Goa No💰Advanc...
Goa Call Girl Service 📞9xx000xx09📞Just Call Divya📲 Call Girl In Goa No💰Advanc...
Sheetaleventcompany
 
💚Call Girls In Amritsar 💯Anvi 📲🔝8725944379🔝Amritsar Call Girl No💰Advance Cash...
💚Call Girls In Amritsar 💯Anvi 📲🔝8725944379🔝Amritsar Call Girl No💰Advance Cash...💚Call Girls In Amritsar 💯Anvi 📲🔝8725944379🔝Amritsar Call Girl No💰Advance Cash...
💚Call Girls In Amritsar 💯Anvi 📲🔝8725944379🔝Amritsar Call Girl No💰Advance Cash...
Sheetaleventcompany
 
💚Chandigarh Call Girls 💯Riya 📲🔝8868886958🔝Call Girls In Chandigarh No💰Advance...
💚Chandigarh Call Girls 💯Riya 📲🔝8868886958🔝Call Girls In Chandigarh No💰Advance...💚Chandigarh Call Girls 💯Riya 📲🔝8868886958🔝Call Girls In Chandigarh No💰Advance...
💚Chandigarh Call Girls 💯Riya 📲🔝8868886958🔝Call Girls In Chandigarh No💰Advance...
Sheetaleventcompany
 
Call Girl In Indore 📞9235973566📞 Just📲 Call Inaaya Indore Call Girls Service ...
Call Girl In Indore 📞9235973566📞 Just📲 Call Inaaya Indore Call Girls Service ...Call Girl In Indore 📞9235973566📞 Just📲 Call Inaaya Indore Call Girls Service ...
Call Girl In Indore 📞9235973566📞 Just📲 Call Inaaya Indore Call Girls Service ...
Sheetaleventcompany
 
Control of Local Blood Flow: acute and chronic
Control of Local Blood Flow: acute and chronicControl of Local Blood Flow: acute and chronic
Control of Local Blood Flow: acute and chronic
MedicoseAcademics
 
Whitefield { Call Girl in Bangalore ₹7.5k Pick Up & Drop With Cash Payment 63...
Whitefield { Call Girl in Bangalore ₹7.5k Pick Up & Drop With Cash Payment 63...Whitefield { Call Girl in Bangalore ₹7.5k Pick Up & Drop With Cash Payment 63...
Whitefield { Call Girl in Bangalore ₹7.5k Pick Up & Drop With Cash Payment 63...
dishamehta3332
 
Dehradun Call Girl Service ❤️🍑 8854095900 👄🫦Independent Escort Service Dehradun
Dehradun Call Girl Service ❤️🍑 8854095900 👄🫦Independent Escort Service DehradunDehradun Call Girl Service ❤️🍑 8854095900 👄🫦Independent Escort Service Dehradun
Dehradun Call Girl Service ❤️🍑 8854095900 👄🫦Independent Escort Service Dehradun
Sheetaleventcompany
 
Dehradun Call Girls Service {8854095900} ❤️VVIP ROCKY Call Girl in Dehradun U...
Dehradun Call Girls Service {8854095900} ❤️VVIP ROCKY Call Girl in Dehradun U...Dehradun Call Girls Service {8854095900} ❤️VVIP ROCKY Call Girl in Dehradun U...
Dehradun Call Girls Service {8854095900} ❤️VVIP ROCKY Call Girl in Dehradun U...
Sheetaleventcompany
 
👉 Amritsar Call Girls 👉📞 8725944379 👉📞 Just📲 Call Ruhi Call Girl Near Me Amri...
👉 Amritsar Call Girls 👉📞 8725944379 👉📞 Just📲 Call Ruhi Call Girl Near Me Amri...👉 Amritsar Call Girls 👉📞 8725944379 👉📞 Just📲 Call Ruhi Call Girl Near Me Amri...
👉 Amritsar Call Girls 👉📞 8725944379 👉📞 Just📲 Call Ruhi Call Girl Near Me Amri...
Sheetaleventcompany
 

Recently uploaded (20)

Chandigarh Call Girls Service ❤️🍑 9809698092 👄🫦Independent Escort Service Cha...
Chandigarh Call Girls Service ❤️🍑 9809698092 👄🫦Independent Escort Service Cha...Chandigarh Call Girls Service ❤️🍑 9809698092 👄🫦Independent Escort Service Cha...
Chandigarh Call Girls Service ❤️🍑 9809698092 👄🫦Independent Escort Service Cha...
 
Jual Obat Aborsi Di Dubai UAE Wa 0838-4800-7379 Obat Penggugur Kandungan Cytotec
Jual Obat Aborsi Di Dubai UAE Wa 0838-4800-7379 Obat Penggugur Kandungan CytotecJual Obat Aborsi Di Dubai UAE Wa 0838-4800-7379 Obat Penggugur Kandungan Cytotec
Jual Obat Aborsi Di Dubai UAE Wa 0838-4800-7379 Obat Penggugur Kandungan Cytotec
 
Premium Call Girls Dehradun {8854095900} ❤️VVIP ANJU Call Girls in Dehradun U...
Premium Call Girls Dehradun {8854095900} ❤️VVIP ANJU Call Girls in Dehradun U...Premium Call Girls Dehradun {8854095900} ❤️VVIP ANJU Call Girls in Dehradun U...
Premium Call Girls Dehradun {8854095900} ❤️VVIP ANJU Call Girls in Dehradun U...
 
👉 Chennai Sexy Aunty’s WhatsApp Number 👉📞 7427069034 👉📞 Just📲 Call Ruhi Colle...
👉 Chennai Sexy Aunty’s WhatsApp Number 👉📞 7427069034 👉📞 Just📲 Call Ruhi Colle...👉 Chennai Sexy Aunty’s WhatsApp Number 👉📞 7427069034 👉📞 Just📲 Call Ruhi Colle...
👉 Chennai Sexy Aunty’s WhatsApp Number 👉📞 7427069034 👉📞 Just📲 Call Ruhi Colle...
 
Chandigarh Call Girls Service ❤️🍑 9809698092 👄🫦Independent Escort Service Cha...
Chandigarh Call Girls Service ❤️🍑 9809698092 👄🫦Independent Escort Service Cha...Chandigarh Call Girls Service ❤️🍑 9809698092 👄🫦Independent Escort Service Cha...
Chandigarh Call Girls Service ❤️🍑 9809698092 👄🫦Independent Escort Service Cha...
 
Goa Call Girl Service 📞9xx000xx09📞Just Call Divya📲 Call Girl In Goa No💰Advanc...
Goa Call Girl Service 📞9xx000xx09📞Just Call Divya📲 Call Girl In Goa No💰Advanc...Goa Call Girl Service 📞9xx000xx09📞Just Call Divya📲 Call Girl In Goa No💰Advanc...
Goa Call Girl Service 📞9xx000xx09📞Just Call Divya📲 Call Girl In Goa No💰Advanc...
 
💚Call Girls In Amritsar 💯Anvi 📲🔝8725944379🔝Amritsar Call Girl No💰Advance Cash...
💚Call Girls In Amritsar 💯Anvi 📲🔝8725944379🔝Amritsar Call Girl No💰Advance Cash...💚Call Girls In Amritsar 💯Anvi 📲🔝8725944379🔝Amritsar Call Girl No💰Advance Cash...
💚Call Girls In Amritsar 💯Anvi 📲🔝8725944379🔝Amritsar Call Girl No💰Advance Cash...
 
💚Chandigarh Call Girls 💯Riya 📲🔝8868886958🔝Call Girls In Chandigarh No💰Advance...
💚Chandigarh Call Girls 💯Riya 📲🔝8868886958🔝Call Girls In Chandigarh No💰Advance...💚Chandigarh Call Girls 💯Riya 📲🔝8868886958🔝Call Girls In Chandigarh No💰Advance...
💚Chandigarh Call Girls 💯Riya 📲🔝8868886958🔝Call Girls In Chandigarh No💰Advance...
 
Call Girl In Indore 📞9235973566📞 Just📲 Call Inaaya Indore Call Girls Service ...
Call Girl In Indore 📞9235973566📞 Just📲 Call Inaaya Indore Call Girls Service ...Call Girl In Indore 📞9235973566📞 Just📲 Call Inaaya Indore Call Girls Service ...
Call Girl In Indore 📞9235973566📞 Just📲 Call Inaaya Indore Call Girls Service ...
 
Control of Local Blood Flow: acute and chronic
Control of Local Blood Flow: acute and chronicControl of Local Blood Flow: acute and chronic
Control of Local Blood Flow: acute and chronic
 
Whitefield { Call Girl in Bangalore ₹7.5k Pick Up & Drop With Cash Payment 63...
Whitefield { Call Girl in Bangalore ₹7.5k Pick Up & Drop With Cash Payment 63...Whitefield { Call Girl in Bangalore ₹7.5k Pick Up & Drop With Cash Payment 63...
Whitefield { Call Girl in Bangalore ₹7.5k Pick Up & Drop With Cash Payment 63...
 
Genuine Call Girls Hyderabad 9630942363 Book High Profile Call Girl in Hydera...
Genuine Call Girls Hyderabad 9630942363 Book High Profile Call Girl in Hydera...Genuine Call Girls Hyderabad 9630942363 Book High Profile Call Girl in Hydera...
Genuine Call Girls Hyderabad 9630942363 Book High Profile Call Girl in Hydera...
 
Gastric Cancer: Сlinical Implementation of Artificial Intelligence, Synergeti...
Gastric Cancer: Сlinical Implementation of Artificial Intelligence, Synergeti...Gastric Cancer: Сlinical Implementation of Artificial Intelligence, Synergeti...
Gastric Cancer: Сlinical Implementation of Artificial Intelligence, Synergeti...
 
Dehradun Call Girl Service ❤️🍑 8854095900 👄🫦Independent Escort Service Dehradun
Dehradun Call Girl Service ❤️🍑 8854095900 👄🫦Independent Escort Service DehradunDehradun Call Girl Service ❤️🍑 8854095900 👄🫦Independent Escort Service Dehradun
Dehradun Call Girl Service ❤️🍑 8854095900 👄🫦Independent Escort Service Dehradun
 
Dehradun Call Girls Service {8854095900} ❤️VVIP ROCKY Call Girl in Dehradun U...
Dehradun Call Girls Service {8854095900} ❤️VVIP ROCKY Call Girl in Dehradun U...Dehradun Call Girls Service {8854095900} ❤️VVIP ROCKY Call Girl in Dehradun U...
Dehradun Call Girls Service {8854095900} ❤️VVIP ROCKY Call Girl in Dehradun U...
 
tongue disease lecture Dr Assadawy legacy
tongue disease lecture Dr Assadawy legacytongue disease lecture Dr Assadawy legacy
tongue disease lecture Dr Assadawy legacy
 
👉 Amritsar Call Girls 👉📞 8725944379 👉📞 Just📲 Call Ruhi Call Girl Near Me Amri...
👉 Amritsar Call Girls 👉📞 8725944379 👉📞 Just📲 Call Ruhi Call Girl Near Me Amri...👉 Amritsar Call Girls 👉📞 8725944379 👉📞 Just📲 Call Ruhi Call Girl Near Me Amri...
👉 Amritsar Call Girls 👉📞 8725944379 👉📞 Just📲 Call Ruhi Call Girl Near Me Amri...
 
Kolkata Call Girls Naktala 💯Call Us 🔝 8005736733 🔝 💃 Top Class Call Girl Se...
Kolkata Call Girls Naktala  💯Call Us 🔝 8005736733 🔝 💃  Top Class Call Girl Se...Kolkata Call Girls Naktala  💯Call Us 🔝 8005736733 🔝 💃  Top Class Call Girl Se...
Kolkata Call Girls Naktala 💯Call Us 🔝 8005736733 🔝 💃 Top Class Call Girl Se...
 
Ahmedabad Call Girls Book Now 9630942363 Top Class Ahmedabad Escort Service A...
Ahmedabad Call Girls Book Now 9630942363 Top Class Ahmedabad Escort Service A...Ahmedabad Call Girls Book Now 9630942363 Top Class Ahmedabad Escort Service A...
Ahmedabad Call Girls Book Now 9630942363 Top Class Ahmedabad Escort Service A...
 
Shazia Iqbal 2024 - Bioorganic Chemistry.pdf
Shazia Iqbal 2024 - Bioorganic Chemistry.pdfShazia Iqbal 2024 - Bioorganic Chemistry.pdf
Shazia Iqbal 2024 - Bioorganic Chemistry.pdf
 

Scaling Galaxy on Google Cloud Platform

  • 1. 1 Scaling Galaxy on GCP LynnLangit Cloud and Data Architect Google Developer Cloud Expert, AWS Community Hero, Microsoft Data Platform MVP
  • 2. 2 Agenda • Scaling Up • Virtual Machines • Hello Galaxy • Adding Tools to Galaxy • Genomic Data on GCP • Scaling Out • Docker Containers • Google Persistent Disks • Pipelines • Google Genomics APIs • Big Query Galaxy on Google Cloud Platform
  • 3. 3 Google Cloud in Australia Data center here in 2017
  • 4. 4 Galaxy on GCP – Scale Up
  • 5. 5Google Cloud Platform 5 Demo 1- Hello Galaxy on Google Cloud
  • 6. 6Google Cloud Platform 6 Demo 2 - Adding Galaxy Tools
  • 7. 77 • Cloud Storage (file) buckets • Source data • Compute Engine Virtual Machines • Virtual Machine Image files • External VM persistent hard disks with your source data GCP Virtual Machine Services Key Concepts: -- VM configuration as code -- Fast, cheap scalable VMs
  • 8. 88 • Re-size Virtual Machines • Attach more persistent disks • Update base image • Monitor with Stackdriver Scale Up Patterns
  • 9. 99
  • 10. 1010 Genomic Data • Files at GCS • gs://genomics-public-data • Query via BigQuery • https://bigquery.cloud.google.com/queries/genomics-public- data • Code via Genomics API • Implements Global Alliance for Genomics and Health APIs • Genome browser - https://gabrowse.appspot.com • Google Genomics example code on GitHub
  • 11. 1111
  • 12. 12 Galaxy on GCP – Scale Out
  • 13. 1313 GCP Docker Container Services • Cloud Storage • Container Engine / Docker Key Concepts: -- Container configuration as code -- Fast, cheap scalable Docker Containers
  • 14. 1414
  • 15. 1515 • Docker Container Cluster • Kubernetes manager • Container orchestration Scale Out Patterns
  • 16. 1616 GCP Serverless Services • Cloud Functions • Microservices Key Concepts: -- Function configuration as code -- Fast, cheap scalable Microservices
  • 17. 17 Galaxy on GCP – Advanced Pipelines
  • 18. 18Google Cloud Platform 18 Demo 3 – Using the Google Genomics API & BigQuery
  • 19. 1919 BigQuery • ANSI SQL Queries • Query-as-a-service Key Concepts: -- SQL query configuration as code -- Fast, cheap scalable SQL Queries
  • 20. 20 Private Datasets Public Datasets Variant Analysis MSSNG Autism Cloud Storage Scientist High Throughput Genome Sequencers 1000 Genomes Cloud Storage Patient Data Cloud Storage Illumina Platform Cloud Storage Ref Genomes Cloud Storage TCGA Cloud Storage Analytics Online Analytics BigQuery Batch Analytics Cloud Dataflow Lab Notebooks Cloud Datalab Data Ingest Genomics BAM FASTQ
  • 21. 21 Ingest Elastic Cluster Storage Analytics Carrier Interconnect High Throughput Genome Sequencer s Scientist Raw Datafiles Cloud Storage Processed Data Cloud Storage Metadata Cloud SQL Lab notebooks Cloud Datalab HPC Cluster Compute Engine 10 Nodes Ingest Server Compute Engine Online Analytics BigQuery Cloud Load Balancing Cloud Network Genomics, Secondary Analysis
  • 22. 2222 • Cloud Storage / Public datasets on GCP • Big Query • Cloud Dataflow • Genomics API Advanced GCP Pipelines Core Products Key Concepts: -- Pipeline configuration as code -- Fast, cheap scalable cloud services
  • 24. 2424 • Cloud Storage (files) -- here • Compute Engine (VMs) -- here • Container Engine (Docker) -- here • Big Query (SQL) -- here • Cloud Dataflow (pipelines) -- here • Genomics API-- here • Genomics Cookbook– here • Public datasets on GCP-- here • Google’s Genomic code samples – here • Lynn’s GitHub code samples -- here Resources
  • 25. 25 More about Google Cloud Services
  • 26. 26 Compute Compute Engine App Engine Container Engine Container Registry Cloud Functions Networking Cloud Virtual Network Cloud Load Balancing Cloud CDN Cloud Interconnect Cloud DNS Big Data BigQuery Cloud Dataflow Cloud Dataproc Cloud Datalab Cloud Pub/Sub Genomics Identity & Security Cloud IAM Cloud Resource Manager Cloud Security Scanner Cloud Platform Security Storage and Databases Cloud Storage Cloud Bigtable Cloud Datastore Cloud SQL Persistent Disk Machine Learning Cloud Machine Learning Vision API Speech API Natural Language API Translation API Google Cloud Platform Services Part One Jobs API
  • 27. 27 Management Tools Stackdriver Monitoring Logging Error Reporting Trace Debugger Deployment Manager Cloud Endpoints Cloud Console Developer Tools Cloud SDK Deployment Manager Cloud Source Repositories Cloud Tools for Android Studio Cloud Tools for IntelliJ Cloud Tools for PowerShell Cloud Tools for Visual Studio Google Plug-in for Eclipse Cloud Test Lab Google Cloud Platform Services Part Two Cloud Shell Cloud Mobile App Billing App Cloud APIs
  • 28. 28
  • 29. 29
  • 30. 30 GCE Persistence Options – Disks, etc.… Created From Notes Image GCS File or Disk File path <bucket>/<folder>/<file> Disk must detached from VM Snapshot Disk or Instance (boot) Can create an Instance FROM a Snapshot Persistent Disk Image –or- Snapshot –or- Blank Blank disk must be formatted Can create an Instance or Snapshot FROM a Disk Bucket GCS console for file Access via path gs://<bucketName>/<fileName> VM Instance Boot Disk Image –or- Snapshot –or- Disk Images -> OS, Application or Custom Image N/A From Saved Disk VM Instance Additional Disk Local Scratch –or- Standard persistent –or- SSD persistent Max 8 at 375 GB each. 500 GB  64 TB Read/Write or Read Only Attach up to 16 Disks* per VM
  • 31. 31

Editor's Notes

  1. Documentation to share an image across multiple GCP projects -- https://cloud.google.com/compute/docs/images/sharing-images-across-projects Documentation to export an image as a file -- https://cloud.google.com/compute/docs/images/export-image
  2. http://googlegenomics.readthedocs.io/en/latest/use_cases/discover_public_data/index.html https://cloud.google.com/genomics/v1/public-data
  3. https://gabrowse.appspot.com
  4. https://kubernetes.io/docs/user-guide/
  5. https://github.com/bgruening/docker-galaxy-stable
  6. BigQuery demo -- https://cloud.google.com/genomics/v1/analyze-variants
  7. Icon and sample diagrams landing page https://cloud.google.com/icons
  8. Sample of Cloud ML and Genomics -- https://github.com/deflaux/cloudml-examples
  9. https://cloud.google.com/solutions/image-management-best-practices
  10. *for most VM instance types (if 3.75 GB RAM on your VM, then max disks to attach is 4)