SlideShare a Scribd company logo
1 of 23
Data storage
&
Database Services
in
AWS and Azure
A COMPARISONAs delivered in tech meetup #01
organized by www.edYoda.com
on 22nd April 2018
at zekeLabs Bangalore
About me:
Ashish Pandey
Corporate Trainer
- Cloud & DevOps
11+ years experience
Ex- Guavus, Ex- Aricent, Ex -
Cadence
Agenda
Comparative study of -
Services in Data and Big Data Paradigm including…
• Data Storage services
• Database services
… on AWS and Azure cloud
Data Services
Collect Store Analyze/Process Visualization
S3
Snowball
Storage
Gateway
EMRAthena
Redshift QuickSight
DynamoDB
ElastiCache
RDS
Data
Pipeline Glue
Amazon
Kinesis
Firehose
Kinesis
Kinesis
Streams
DMS
Lambda
Amazon
Glacier
Amazon EFS
Data services
Collect Store Analyze/Process Visualization
blob
Data box
StorSimple
HDInsightData Lake
Analytics
SQL Data
warehouse
Power BI
Cosmos DB
Redis CacheDatabase
Data Factory
Amazon
Events
hub
Stream
Analytics
Azure
Function
Datalake
Data Sync
Data Storage
services
Object based storage services
AWS: S3
▶ Supports replication based on storage class
▶ 3 storage classes:
▶ Standard
▶ S3- Infrequently Accessed
▶ One-zone IA
▶ Reduced Redundent
▶ Versioning enabled
Azure: Block Blobs
▶ Replication Depends on storage account
▶ 2 storage classes:
▶ Hot
▶ Cold
▶ Replication strategy:
▶ Locally redundant storage LRS
▶ Zone redundant storage ZRS
▶ Geo Redundant GRS
▶ Read Access Geo Redundant storage RA-GRS
Block based storage services
AWS: EBS
▶ Volume for OS and Data disks
▶ Max Size 16 TB
▶ Replication within AZ
Azure: Page Blobs
▶ Volume for OS and Data disks
▶ Max Size 8 TB
▶ Replication Depends on redundancy
strategy
Shared File storage services
AWS: Elastic File System EFS
▶ Uses NFS v4
▶ Max Size: Unlimited (Automatically scales)
▶ Multiple on-premise servers as well as cloud
servers can access them simultaneously
Azure: Azure Files
▶ Uses SMB3.0 and HTTPS
▶ Max Size 5 TB
▶ Multiple on-premise servers as well as cloud
servers can access them simultaneously
Extra storage services
AWS
▶ Glacier
▶ Lifecycle management rule
▶ Multiple type of EBS volume
▶ Simple Queue Service (SQS)
Azure
▶ Archive storage Tier in Blob
▶ Data Lifecycle Mgmt using Data Factory
▶ Storage account:
▶ Standard
▶ Premium
▶ Queue
▶ Table
Database
services
Database services in AWS
▶ SQL Servers on EC2 SQL Server on Virtual Machine
▶ Relational Database Services Managed Database compatible with
MySQL, PostgreSQL, Oracle etc.
▶ Redshift Elastic data warehouse as a
service with petabyte scale
▶ DynamoDB Highly distributed, NO-SQL database for
any scale
▶ ElastiCache Powers applications with high-throughput,
low-latency data access
▶ Glue Machine Learning Enabled ETL
tool
Database services in Azure
▶ SQL Server [IAAS] SQL Server on Virtual Machine
▶ Azure Databases Managed Database compatible with [SQL,
MySQL, PostgreSQL]
▶ SQL Data Warehouse Elastic data warehouse as a service with
enterprise-class features
▶ SQL Server Stretch Database Dynamically stretch on-premises SQL Server databases
to Azure
▶ Azure Cosmos DB Globally distributed, multi-model database for any
scale
▶ Table Storage NoSQL key-value store using semi-
structured datasets
Azure:
Legacy
Advantages
AWS:
Legacy
Advantages
▶ Pioneer in IAAS.
▶ Amazon Aurora: A cloud-native in-house Mysql and
PostgreSQL compatible fully Managed database.
▶ RedShift Data warehouse.
▶ Third party compatibility with no baggage
Managed Database
AWS: RDS
▶ Supports: MySQL, MSSQL, PostgreSQL, MariaDB,
Oracle, Aurora.
▶ Multi AZ replication
▶ Cross region Read Replica
▶ Aurora [High performance and throughput]
Azure: SQL Databases
▶ Supports: MSSQL, MySQL, PostgreSQL, mariaDB
▶ Other versions are supported on Virtual Machine
(IAAS)
▶ Stretch databases
▶ Elastic Database pools
Non-Relational Database
AWS: DynamoDB
▶ Automatically Scales
▶ 6 X cross regions Replication
Azure: CosmosDB
▶ Multi Modal Globally Distributed
▶ low latency NO-SQL database.
Data warehouses
AWS: Redshift
▶ Based on PostgreSQL
▶ Columnar based structure
▶ ELT techniques used
▶ High level of Query performance
▶ Integrates well with other AWS services.
Azure: SQL Warehouse
▶ Based on MSSQL
▶ Columnar based
▶ Unlimited scale
▶ Can be paused when no query expected.
ETL Tools
AWS: Glue
▶ Based on Spark
▶ Enabled with Machine Learning
▶ Crawlers
▶ Uses Data Catalogue
▶ Auto generates Spark code
Azure: Data Factory
▶ Orchestrates Data Pipeline activity
▶ Comprises of activity and data stores
▶ Triggers
▶ May use HDInsight, Spark, cosmosDB etc.
Other Related
Services
AWS
▶ Database Migration Services
▶ Schema Conversion Tool
▶ RDS Cluster
Azure
▶ SQL Data Sync
▶ SQL Server management system SSMS
▶ Database pools
Other factors to select your Cloud
▶ Pricing
▶ Legacy softwares being used
▶ Availability of skills
▶ Client Requirement
▶ Customer support
▶ Third party tool integration
▶ Integration with Existing Infrastructure
Challenges in Moving to Cloud
▶ Making the correct choice: SaaS, IaaS, PaaS
▶ Loss of Control
▶ Vendor Lock-in
▶ Security and Compliance
▶ Availability and Reliability
▶ Performance and Bandwidth Cost
▶ Integration with Existing Infrastructure
▶ Lack of Skills, Knowledge and Expertise
Thank You
Visit www.edYoda.com for more free Tech Videos about cutting edge technologies like
cloud, DevOps, Machine Learning, AI, Blockchain and many more.
You are more than welcome to upload your own tech videos on edYoda.

More Related Content

Recently uploaded

DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningLars Bell
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyAlfredo García Lavilla
 
unit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxunit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxBkGupta21
 
Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersRaghuram Pandurangan
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfAddepto
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteDianaGray10
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brandgvaughan
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenHervé Boutemy
 
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Commit University
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.Curtis Poe
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB
 
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfLoriGlavin3
 
What is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfWhat is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfMounikaPolabathina
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .Alan Dix
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxLoriGlavin3
 

Recently uploaded (20)

DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine Tuning
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easy
 
unit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxunit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptx
 
Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information Developers
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdf
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test Suite
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brand
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache Maven
 
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdf
 
What is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfWhat is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdf
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
 

Featured

PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024Neil Kimberley
 
Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)contently
 
How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024Albert Qian
 
Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsKurio // The Social Media Age(ncy)
 
Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Search Engine Journal
 
5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summarySpeakerHub
 
ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd Clark Boyd
 
Getting into the tech field. what next
Getting into the tech field. what next Getting into the tech field. what next
Getting into the tech field. what next Tessa Mero
 
Google's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentGoogle's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentLily Ray
 
Time Management & Productivity - Best Practices
Time Management & Productivity -  Best PracticesTime Management & Productivity -  Best Practices
Time Management & Productivity - Best PracticesVit Horky
 
The six step guide to practical project management
The six step guide to practical project managementThe six step guide to practical project management
The six step guide to practical project managementMindGenius
 
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...RachelPearson36
 
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...Applitools
 
12 Ways to Increase Your Influence at Work
12 Ways to Increase Your Influence at Work12 Ways to Increase Your Influence at Work
12 Ways to Increase Your Influence at WorkGetSmarter
 
Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G...
Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G...Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G...
Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G...DevGAMM Conference
 
Barbie - Brand Strategy Presentation
Barbie - Brand Strategy PresentationBarbie - Brand Strategy Presentation
Barbie - Brand Strategy PresentationErica Santiago
 

Featured (20)

PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024
 
Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)
 
How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024
 
Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie Insights
 
Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024
 
5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary
 
ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd
 
Getting into the tech field. what next
Getting into the tech field. what next Getting into the tech field. what next
Getting into the tech field. what next
 
Google's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentGoogle's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search Intent
 
How to have difficult conversations
How to have difficult conversations How to have difficult conversations
How to have difficult conversations
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
 
Time Management & Productivity - Best Practices
Time Management & Productivity -  Best PracticesTime Management & Productivity -  Best Practices
Time Management & Productivity - Best Practices
 
The six step guide to practical project management
The six step guide to practical project managementThe six step guide to practical project management
The six step guide to practical project management
 
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
 
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
 
12 Ways to Increase Your Influence at Work
12 Ways to Increase Your Influence at Work12 Ways to Increase Your Influence at Work
12 Ways to Increase Your Influence at Work
 
ChatGPT webinar slides
ChatGPT webinar slidesChatGPT webinar slides
ChatGPT webinar slides
 
More than Just Lines on a Map: Best Practices for U.S Bike Routes
More than Just Lines on a Map: Best Practices for U.S Bike RoutesMore than Just Lines on a Map: Best Practices for U.S Bike Routes
More than Just Lines on a Map: Best Practices for U.S Bike Routes
 
Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G...
Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G...Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G...
Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G...
 
Barbie - Brand Strategy Presentation
Barbie - Brand Strategy PresentationBarbie - Brand Strategy Presentation
Barbie - Brand Strategy Presentation
 

Data storage and Database Services in AWS and Azure

  • 1. Data storage & Database Services in AWS and Azure A COMPARISONAs delivered in tech meetup #01 organized by www.edYoda.com on 22nd April 2018 at zekeLabs Bangalore
  • 2. About me: Ashish Pandey Corporate Trainer - Cloud & DevOps 11+ years experience Ex- Guavus, Ex- Aricent, Ex - Cadence
  • 3. Agenda Comparative study of - Services in Data and Big Data Paradigm including… • Data Storage services • Database services … on AWS and Azure cloud
  • 4. Data Services Collect Store Analyze/Process Visualization S3 Snowball Storage Gateway EMRAthena Redshift QuickSight DynamoDB ElastiCache RDS Data Pipeline Glue Amazon Kinesis Firehose Kinesis Kinesis Streams DMS Lambda Amazon Glacier Amazon EFS
  • 5. Data services Collect Store Analyze/Process Visualization blob Data box StorSimple HDInsightData Lake Analytics SQL Data warehouse Power BI Cosmos DB Redis CacheDatabase Data Factory Amazon Events hub Stream Analytics Azure Function Datalake Data Sync
  • 7. Object based storage services AWS: S3 ▶ Supports replication based on storage class ▶ 3 storage classes: ▶ Standard ▶ S3- Infrequently Accessed ▶ One-zone IA ▶ Reduced Redundent ▶ Versioning enabled Azure: Block Blobs ▶ Replication Depends on storage account ▶ 2 storage classes: ▶ Hot ▶ Cold ▶ Replication strategy: ▶ Locally redundant storage LRS ▶ Zone redundant storage ZRS ▶ Geo Redundant GRS ▶ Read Access Geo Redundant storage RA-GRS
  • 8. Block based storage services AWS: EBS ▶ Volume for OS and Data disks ▶ Max Size 16 TB ▶ Replication within AZ Azure: Page Blobs ▶ Volume for OS and Data disks ▶ Max Size 8 TB ▶ Replication Depends on redundancy strategy
  • 9. Shared File storage services AWS: Elastic File System EFS ▶ Uses NFS v4 ▶ Max Size: Unlimited (Automatically scales) ▶ Multiple on-premise servers as well as cloud servers can access them simultaneously Azure: Azure Files ▶ Uses SMB3.0 and HTTPS ▶ Max Size 5 TB ▶ Multiple on-premise servers as well as cloud servers can access them simultaneously
  • 10. Extra storage services AWS ▶ Glacier ▶ Lifecycle management rule ▶ Multiple type of EBS volume ▶ Simple Queue Service (SQS) Azure ▶ Archive storage Tier in Blob ▶ Data Lifecycle Mgmt using Data Factory ▶ Storage account: ▶ Standard ▶ Premium ▶ Queue ▶ Table
  • 12. Database services in AWS ▶ SQL Servers on EC2 SQL Server on Virtual Machine ▶ Relational Database Services Managed Database compatible with MySQL, PostgreSQL, Oracle etc. ▶ Redshift Elastic data warehouse as a service with petabyte scale ▶ DynamoDB Highly distributed, NO-SQL database for any scale ▶ ElastiCache Powers applications with high-throughput, low-latency data access ▶ Glue Machine Learning Enabled ETL tool
  • 13. Database services in Azure ▶ SQL Server [IAAS] SQL Server on Virtual Machine ▶ Azure Databases Managed Database compatible with [SQL, MySQL, PostgreSQL] ▶ SQL Data Warehouse Elastic data warehouse as a service with enterprise-class features ▶ SQL Server Stretch Database Dynamically stretch on-premises SQL Server databases to Azure ▶ Azure Cosmos DB Globally distributed, multi-model database for any scale ▶ Table Storage NoSQL key-value store using semi- structured datasets
  • 15. AWS: Legacy Advantages ▶ Pioneer in IAAS. ▶ Amazon Aurora: A cloud-native in-house Mysql and PostgreSQL compatible fully Managed database. ▶ RedShift Data warehouse. ▶ Third party compatibility with no baggage
  • 16. Managed Database AWS: RDS ▶ Supports: MySQL, MSSQL, PostgreSQL, MariaDB, Oracle, Aurora. ▶ Multi AZ replication ▶ Cross region Read Replica ▶ Aurora [High performance and throughput] Azure: SQL Databases ▶ Supports: MSSQL, MySQL, PostgreSQL, mariaDB ▶ Other versions are supported on Virtual Machine (IAAS) ▶ Stretch databases ▶ Elastic Database pools
  • 17. Non-Relational Database AWS: DynamoDB ▶ Automatically Scales ▶ 6 X cross regions Replication Azure: CosmosDB ▶ Multi Modal Globally Distributed ▶ low latency NO-SQL database.
  • 18. Data warehouses AWS: Redshift ▶ Based on PostgreSQL ▶ Columnar based structure ▶ ELT techniques used ▶ High level of Query performance ▶ Integrates well with other AWS services. Azure: SQL Warehouse ▶ Based on MSSQL ▶ Columnar based ▶ Unlimited scale ▶ Can be paused when no query expected.
  • 19. ETL Tools AWS: Glue ▶ Based on Spark ▶ Enabled with Machine Learning ▶ Crawlers ▶ Uses Data Catalogue ▶ Auto generates Spark code Azure: Data Factory ▶ Orchestrates Data Pipeline activity ▶ Comprises of activity and data stores ▶ Triggers ▶ May use HDInsight, Spark, cosmosDB etc.
  • 20. Other Related Services AWS ▶ Database Migration Services ▶ Schema Conversion Tool ▶ RDS Cluster Azure ▶ SQL Data Sync ▶ SQL Server management system SSMS ▶ Database pools
  • 21. Other factors to select your Cloud ▶ Pricing ▶ Legacy softwares being used ▶ Availability of skills ▶ Client Requirement ▶ Customer support ▶ Third party tool integration ▶ Integration with Existing Infrastructure
  • 22. Challenges in Moving to Cloud ▶ Making the correct choice: SaaS, IaaS, PaaS ▶ Loss of Control ▶ Vendor Lock-in ▶ Security and Compliance ▶ Availability and Reliability ▶ Performance and Bandwidth Cost ▶ Integration with Existing Infrastructure ▶ Lack of Skills, Knowledge and Expertise
  • 23. Thank You Visit www.edYoda.com for more free Tech Videos about cutting edge technologies like cloud, DevOps, Machine Learning, AI, Blockchain and many more. You are more than welcome to upload your own tech videos on edYoda.