SlideShare a Scribd company logo
1 of 32
Download to read offline
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Rick Houlihan
Senior Practice Manager, DBS NoSQL
SRV307
Matching the Database to the Workload
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Agenda
• Database workload classifications
• Traditional approaches to scaling RDBMS
• How NoSQL databases compare
• The flavors of NoSQL on AWS
• What database to use when
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Why did you choose this database for your app?
“Because we heard X is the best new thing.”
“Because we have a site license for X.”
“Because X is what we know how to use.”
“Because this database is purpose built to support
what my application is designed to do.”
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Types of Database Workloads
• Online Transaction Processing (OLTP)
• Most common type of app
• Online Analytics Processing (OLAP)
• BI and ad-hoc data projections
• Decision Support Systems (DSS)
• Long running query aggregations and projections
Operations
Analytics
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Sizing the Workload
Unbounded problems are harder to solve
“I need a root cause analysis engine to correlate transaction level events
to trading patterns across global markets.”
Problems with limited scope are easier to solve
“I need a system to manage inventory in my store.”
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Sizing the Database
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Scaling Relational DBs
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Sharded Relational DBs?
A B C D
?
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
NoSQL Databases
• Denormalize and
shard to provide
horizontal scale
• Near unbounded
throughput and
storage
Collection 1
1 TB
Shard A
500 GB
Shard B
500 GB
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
00 55 A954 FFAA00 FF
Partition Keys in NoSQL
Id = 1
Name = Jim
Hash (1) = 7B
Id = 2
Name = Andy
Dept = Eng
Hash (2) = 48
Id = 3
Name = Kim
Dept = Ops
Hash (3) = CD
Key Space
Partition Key uniquely identifies an item
Partition Key is used for building an unordered hash index
Allows table to be partitioned for scale
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
The Iron Triangle of Data - All About CAP
C
A
PConsistency:
all clients always have
the same view of data
Partition tolerance:
the system works well despite physical
network partitions
Availability:
all clients can always
read and write
CA
MSSQL
Oracle
DB2
MySQL
Aster Data
Greenplum
Postgres
CP
Big Table
Hypertable
HBase
MongoDB
Terastore
Couchbase
Scalaris
DynamoDB
BerkeleyDB
Memcached
Redis
Pick Two
AP
Voldemort
Tokyo Cabinet
KAI
Amazon
DynamoDB
Cassandra
SimpleDB
CouchDB
Riak
Data Models:
Relational
Wide Column
Document
Key/Value
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Technology adoption and the hype curve
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
DW | Big Data Processing | Ad hoc
AWS Databases and Analytics
Broadest and deepest portfolio purpose-built for builders
Business Intelligence & Machine Learning
Data Movement
AWS Database Migration Service | AWS Snowball | AWS Snowmobile | Amazon Kinesis Data Firehose | Amazon Kinesis Data Streams
Amazon QuickSight
Relational Databases
Amazon
RDS
Amazon
Aurora
Data lake (Batch/ETL)
Amazon S3 / Amazon
Glacier
(Storage)
AWS Glue
(ETL & Data Catalog)
Machine Learning
Amazon Macie
(Data Protection)
Denormalized Databases
Analytics (OLAP/DSS)
DynamoDB
(Wide Column/Document)
Amazon ElastiCache
(Indexed Key Value)
Amazon
Redshift
Amazon
EMR
Amazon
Athena
Amazon
Kinesis Data
Analytics
Amazon
Elasticsearch
Service
Real-time
Opertional (OLTP)
Amazon
Neptune
(Graph)
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Amazon RDS
Managed relational database service with a choice of six popular database engines
Easy to administer Highly flexible Available &
durable
Fast
No need for infrastructure
provisioning, installing and
maintaining database software.
Scale database compute and
storage with a few mouse
clicks and zero downtime.
Multi-AZ: Automatically
replicates data.
Automated backup,
snapshots, failover.
Choose between dual SSD-
backed storage for high-
performance OLTP.
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Amazon DynamoDB
Fast and flexible NoSQL database service for any scale
Key-value NoSQL database that supports both document and wide column structures
Fast, consistent performance
Highly scalable Fully managed
Business-critical
reliability
Consistent single-digit
millisecond latencies at any
scale. DAX speeds up times
to microseconds.
Auto-scaling tables serving
millions of requests per second,
storing hundreds of terabytes of
data.
Automatic provisioning
and infrastructure
management.
Data replicated across
multiple AZs and
accessed with regionally
available APIs.
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
DynamoDB Schema
Table
Items
Attributes
Partition
Key
Sort
Key
Mandatory
Key-value access pattern
Determines data distribution
Optional
Model 1:N relationships
Enables rich query capabilities
All items for key
==, <, >, >=, <=
“begins with”
“between”
“contains”
“in”
sorted results
counts
top/bottom N values
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
SQL vs. NoSQL Design Pattern
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Amazon Neptune
Fully managed graph database
Fast Reliable Open
Query billions of
relationships with
millisecond latency
Six replicas of your data
across three AZs with full
backup and restore
Build powerful queries
easily with Gremlin and
SPARQL
Supports Apache
TinkerPop & W3C RDF
graph models
Easy
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Amazon Neptune High-Level Architecture
Bulk load
from S3
Database
Mgmt.
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Graph Workloads
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Normalized Graph Design Pattern
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
De-normalized Graph Design Pattern
Node Relationship Endpoint
Bill
is a Person
visited Eiffel Tower
Alice
is a Person
visited Eiffel Tower
friend of Bob
Bob
is a Person
born on 7/14/90
friend of Alice
interested in Mona Lisa
Leonardo daVinci is a Person
La Jaconde a Washington
is a Video
is about Mona Lisa
Eiffel Tower
is a Place
located in Paris
7/14/90 is a Date
Paris is a City
The Louvre
is a Museum
located in Paris
Mona Lisa
is a Painting
created by Leonardo daVinci
exhibited at The Louvre
Nodes are the
vertices of a graph
Relationships are the
edges of a graph
Select nodes to get
edges for an entity
Index Relationship and
Endpoint for edge type and
target aggregations
Follow the edges to
traverse the graph
Bob wants to see the Mona
Lisa. While he in Paris he
would like to see other things
his friends have enjoyed.
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Node Query (Primary)
• What entities are in the graph?
Edge Query (Index)
• What relationships do graph entities have?
Hybrid Query (Traversal)
• How are entities related through each other?
Graph Query Types
RDBMS, NoSQL,
GraphDB
GraphDB
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Amazon Redshift – Data Warehousing
Fast, powerful, and simple data warehousing at 1/10 the cost
Massively parallel, petabyte scale
Fast Inexpensive Scalable Secure
Columnar storage technology
to improve I/O efficiency and
parallelize queries. Data load
scales linearly.
As low as $1,000 per
terabyte per year, 1/10th
the cost of traditional data
warehouse solutions.
Resize your cluster up and
down as your performance
and capacity needs
change.
Data encrypted at rest and
transit. Isolate clusters with
VPC. Manage your own keys
with AWS KMS.
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Amazon Athena – Interactive Analysis
Interactive query service to analyze data in Amazon S3 using standard SQL
No infrastructure to set up or manage and no data to load
Ability to run SQL queries on data archived in Amazon Glacier (Coming soon)
Serverless
Zero setup cost. Just
point to Amazon S3,
and start querying.
Pay per query
Pay only for queries run.
Save 30–90% on per-
query costs through
compression.
Open
ANSI SQL interface,
JDBC/ODBC drivers, multiple
formats, compression types,
and complex Joins and data
types.
Easy
Serverless. Zero
infrastructure. Zero
administration.
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Categories of Database
Optimized for Storage Optimized for Compute
Optimized for
Relationships
Normalized relational or
dimensional DW
Denormalized document,
wide column or key value
Denormalized entity
relationship
Ad hoc queries and
aggregations
Instantiated views and
computed aggregations
Ad hoc entity/relationship
aggregations
Scale vertically Scale horizontally Hybrid
Great for OLAP and DSS Built for OLTP or DSS at scale
Designed for graph
traversals
SQL NoSQL Graph
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Infinite Scale:
The database can gracefully increase size
and throughput without practical limits
The Iron Triangle of Purpose (The PIE Theorem)
I
P
E Efficiency:
The database will deliver required query
latency for the workload at all times
Pattern Flexibility:
The database supports random access
patterns and ad hoc queries
PI
Amazon RDS
Elasticsearch
Aurora Serverless
Neptune
IE
Pick Two
PE
Data Models:
Relational
Wide Column
Document
Graph
Columnar
Unstructured
Amazon DynamoDB
Amazon Redshift
Athena
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Hundreds of thousands of customers use DynamoDB
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Hundreds of thousands more use Amazon RDS
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Purpose Built Database Solutions from AWS
Provisioning
Capacity planning
Monitoring
OS patching
Hardware upgrades
Database upgrades
Security patches
Scaling
Monitoring
Performance tuning
Replication across data centers
Re-replicate on server failureProvision new regions
Infrastructure Software
With Zero Unplanned Downtime
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Submit session feedback
1. Tap the Schedule icon.
2. Select the session you attended.
3. Tap Session Evaluation to submit your
feedback.
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Thank you!

More Related Content

What's hot

Master the Multi-Clustered Data Warehouse - Snowflake
Master the Multi-Clustered Data Warehouse - SnowflakeMaster the Multi-Clustered Data Warehouse - Snowflake
Master the Multi-Clustered Data Warehouse - SnowflakeMatillion
 
ADV Slides: Strategies for Fitting a Data Lake into a Modern Data Architecture
ADV Slides: Strategies for Fitting a Data Lake into a Modern Data ArchitectureADV Slides: Strategies for Fitting a Data Lake into a Modern Data Architecture
ADV Slides: Strategies for Fitting a Data Lake into a Modern Data ArchitectureDATAVERSITY
 
Customer Data Platform 101
Customer Data Platform 101Customer Data Platform 101
Customer Data Platform 101Kiyoto Tamura
 
Demystifying Data Warehousing as a Service - DFW
Demystifying Data Warehousing as a Service - DFWDemystifying Data Warehousing as a Service - DFW
Demystifying Data Warehousing as a Service - DFWKent Graziano
 
Enterprise Data Architecture Deliverables
Enterprise Data Architecture DeliverablesEnterprise Data Architecture Deliverables
Enterprise Data Architecture DeliverablesLars E Martinsson
 
Data at the Speed of Business with Data Mastering and Governance
Data at the Speed of Business with Data Mastering and GovernanceData at the Speed of Business with Data Mastering and Governance
Data at the Speed of Business with Data Mastering and GovernanceDATAVERSITY
 
Introducing Databricks Delta
Introducing Databricks DeltaIntroducing Databricks Delta
Introducing Databricks DeltaDatabricks
 
Data Mesh for Dinner
Data Mesh for DinnerData Mesh for Dinner
Data Mesh for DinnerKent Graziano
 
Get Intelligent with Metabase
Get Intelligent with MetabaseGet Intelligent with Metabase
Get Intelligent with MetabaseAnant Corporation
 
Data lineage and observability with Marquez - subsurface 2020
Data lineage and observability with Marquez - subsurface 2020Data lineage and observability with Marquez - subsurface 2020
Data lineage and observability with Marquez - subsurface 2020Julien Le Dem
 
Data and Analytics at Holland & Barrett: Building a '3-Michelin-star' Data Pl...
Data and Analytics at Holland & Barrett: Building a '3-Michelin-star' Data Pl...Data and Analytics at Holland & Barrett: Building a '3-Michelin-star' Data Pl...
Data and Analytics at Holland & Barrett: Building a '3-Michelin-star' Data Pl...Dobo Radichkov
 
Data Lakehouse, Data Mesh, and Data Fabric (r1)
Data Lakehouse, Data Mesh, and Data Fabric (r1)Data Lakehouse, Data Mesh, and Data Fabric (r1)
Data Lakehouse, Data Mesh, and Data Fabric (r1)James Serra
 
Modern Data Stack for Game Analytics / Dmitry Anoshin (Microsoft Gaming, The ...
Modern Data Stack for Game Analytics / Dmitry Anoshin (Microsoft Gaming, The ...Modern Data Stack for Game Analytics / Dmitry Anoshin (Microsoft Gaming, The ...
Modern Data Stack for Game Analytics / Dmitry Anoshin (Microsoft Gaming, The ...DevGAMM Conference
 
The Case for Graphs in Supply Chains
The Case for Graphs in Supply ChainsThe Case for Graphs in Supply Chains
The Case for Graphs in Supply ChainsNeo4j
 
Data Stewards – Defining and Assigning
Data Stewards – Defining and AssigningData Stewards – Defining and Assigning
Data Stewards – Defining and AssigningDATAVERSITY
 

What's hot (20)

Master the Multi-Clustered Data Warehouse - Snowflake
Master the Multi-Clustered Data Warehouse - SnowflakeMaster the Multi-Clustered Data Warehouse - Snowflake
Master the Multi-Clustered Data Warehouse - Snowflake
 
ADV Slides: Strategies for Fitting a Data Lake into a Modern Data Architecture
ADV Slides: Strategies for Fitting a Data Lake into a Modern Data ArchitectureADV Slides: Strategies for Fitting a Data Lake into a Modern Data Architecture
ADV Slides: Strategies for Fitting a Data Lake into a Modern Data Architecture
 
Customer Data Platform 101
Customer Data Platform 101Customer Data Platform 101
Customer Data Platform 101
 
Demystifying Data Warehousing as a Service - DFW
Demystifying Data Warehousing as a Service - DFWDemystifying Data Warehousing as a Service - DFW
Demystifying Data Warehousing as a Service - DFW
 
Enterprise Data Architecture Deliverables
Enterprise Data Architecture DeliverablesEnterprise Data Architecture Deliverables
Enterprise Data Architecture Deliverables
 
Data at the Speed of Business with Data Mastering and Governance
Data at the Speed of Business with Data Mastering and GovernanceData at the Speed of Business with Data Mastering and Governance
Data at the Speed of Business with Data Mastering and Governance
 
Introducing Databricks Delta
Introducing Databricks DeltaIntroducing Databricks Delta
Introducing Databricks Delta
 
Dremio introduction
Dremio introductionDremio introduction
Dremio introduction
 
Data Mesh for Dinner
Data Mesh for DinnerData Mesh for Dinner
Data Mesh for Dinner
 
Get Intelligent with Metabase
Get Intelligent with MetabaseGet Intelligent with Metabase
Get Intelligent with Metabase
 
Data lineage and observability with Marquez - subsurface 2020
Data lineage and observability with Marquez - subsurface 2020Data lineage and observability with Marquez - subsurface 2020
Data lineage and observability with Marquez - subsurface 2020
 
Data and Analytics at Holland & Barrett: Building a '3-Michelin-star' Data Pl...
Data and Analytics at Holland & Barrett: Building a '3-Michelin-star' Data Pl...Data and Analytics at Holland & Barrett: Building a '3-Michelin-star' Data Pl...
Data and Analytics at Holland & Barrett: Building a '3-Michelin-star' Data Pl...
 
Accelerating Data Warehouse Modernization
Accelerating Data Warehouse ModernizationAccelerating Data Warehouse Modernization
Accelerating Data Warehouse Modernization
 
Data Lakehouse, Data Mesh, and Data Fabric (r1)
Data Lakehouse, Data Mesh, and Data Fabric (r1)Data Lakehouse, Data Mesh, and Data Fabric (r1)
Data Lakehouse, Data Mesh, and Data Fabric (r1)
 
Elastic Data Warehousing
Elastic Data WarehousingElastic Data Warehousing
Elastic Data Warehousing
 
Snowflake Datawarehouse Architecturing
Snowflake Datawarehouse ArchitecturingSnowflake Datawarehouse Architecturing
Snowflake Datawarehouse Architecturing
 
Modern Data Stack for Game Analytics / Dmitry Anoshin (Microsoft Gaming, The ...
Modern Data Stack for Game Analytics / Dmitry Anoshin (Microsoft Gaming, The ...Modern Data Stack for Game Analytics / Dmitry Anoshin (Microsoft Gaming, The ...
Modern Data Stack for Game Analytics / Dmitry Anoshin (Microsoft Gaming, The ...
 
The Case for Graphs in Supply Chains
The Case for Graphs in Supply ChainsThe Case for Graphs in Supply Chains
The Case for Graphs in Supply Chains
 
Modern Data Architecture
Modern Data ArchitectureModern Data Architecture
Modern Data Architecture
 
Data Stewards – Defining and Assigning
Data Stewards – Defining and AssigningData Stewards – Defining and Assigning
Data Stewards – Defining and Assigning
 

Similar to Applying AWS Purpose-Built Database Strategy - SRV307 - Toronto AWS Summit

SRV307 Applying AWS Purpose-Built Database Strategy: Match Your Workload to ...
 SRV307 Applying AWS Purpose-Built Database Strategy: Match Your Workload to ... SRV307 Applying AWS Purpose-Built Database Strategy: Match Your Workload to ...
SRV307 Applying AWS Purpose-Built Database Strategy: Match Your Workload to ...Amazon Web Services
 
Applying AWS Purpose-Built Database Strategy - SRV307 - Anaheim AWS Summit
Applying AWS Purpose-Built Database Strategy - SRV307 - Anaheim AWS SummitApplying AWS Purpose-Built Database Strategy - SRV307 - Anaheim AWS Summit
Applying AWS Purpose-Built Database Strategy - SRV307 - Anaheim AWS SummitAmazon Web Services
 
Building with AWS Databases: Match Your Workload to the Right Database (DAT30...
Building with AWS Databases: Match Your Workload to the Right Database (DAT30...Building with AWS Databases: Match Your Workload to the Right Database (DAT30...
Building with AWS Databases: Match Your Workload to the Right Database (DAT30...Amazon Web Services
 
AWS Data Lake: data analysis @ scale
AWS Data Lake: data analysis @ scaleAWS Data Lake: data analysis @ scale
AWS Data Lake: data analysis @ scaleAmazon Web Services
 
Database Freedom. Database migration approaches to get to the Cloud - Marcus ...
Database Freedom. Database migration approaches to get to the Cloud - Marcus ...Database Freedom. Database migration approaches to get to the Cloud - Marcus ...
Database Freedom. Database migration approaches to get to the Cloud - Marcus ...Amazon Web Services
 
Analyze your Data Lake, Fast @ Any Scale - AWS Online Tech Talks
Analyze your Data Lake, Fast @ Any Scale - AWS Online Tech TalksAnalyze your Data Lake, Fast @ Any Scale - AWS Online Tech Talks
Analyze your Data Lake, Fast @ Any Scale - AWS Online Tech TalksAmazon Web Services
 
Immersion Day - Como simplificar o acesso ao seu ambiente analítico
Immersion Day - Como simplificar o acesso ao seu ambiente analíticoImmersion Day - Como simplificar o acesso ao seu ambiente analítico
Immersion Day - Como simplificar o acesso ao seu ambiente analíticoAmazon Web Services LATAM
 
Using data lakes to quench your analytics fire - AWS Summit Cape Town 2018
Using data lakes to quench your analytics fire - AWS Summit Cape Town 2018Using data lakes to quench your analytics fire - AWS Summit Cape Town 2018
Using data lakes to quench your analytics fire - AWS Summit Cape Town 2018Amazon Web Services
 
21st Century Analytics with Zopa
21st Century Analytics with Zopa21st Century Analytics with Zopa
21st Century Analytics with ZopaAmazon Web Services
 
Build Data Lakes & Analytics on AWS: Patterns & Best Practices
Build Data Lakes & Analytics on AWS: Patterns & Best PracticesBuild Data Lakes & Analytics on AWS: Patterns & Best Practices
Build Data Lakes & Analytics on AWS: Patterns & Best PracticesAmazon Web Services
 
Build Data Lakes and Analytics on AWS: Patterns & Best Practices
Build Data Lakes and Analytics on AWS: Patterns & Best PracticesBuild Data Lakes and Analytics on AWS: Patterns & Best Practices
Build Data Lakes and Analytics on AWS: Patterns & Best PracticesAmazon Web Services
 
Using AWS Purpose-Built Databases to Modernize your Applications
Using AWS Purpose-Built Databases to Modernize your ApplicationsUsing AWS Purpose-Built Databases to Modernize your Applications
Using AWS Purpose-Built Databases to Modernize your ApplicationsAmazon Web Services
 
Building a Modern Data Warehouse - Deep Dive on Amazon Redshift
Building a Modern Data Warehouse - Deep Dive on Amazon RedshiftBuilding a Modern Data Warehouse - Deep Dive on Amazon Redshift
Building a Modern Data Warehouse - Deep Dive on Amazon RedshiftAmazon Web Services
 
STG206_Big Data Data Lakes and Data Oceans
STG206_Big Data Data Lakes and Data OceansSTG206_Big Data Data Lakes and Data Oceans
STG206_Big Data Data Lakes and Data OceansAmazon Web Services
 
Builders' Day - Building Data Lakes for Analytics On AWS LC
Builders' Day - Building Data Lakes for Analytics On AWS LCBuilders' Day - Building Data Lakes for Analytics On AWS LC
Builders' Day - Building Data Lakes for Analytics On AWS LCAmazon Web Services LATAM
 
Preparing Your Data for Cloud Analytics & AI/ML
Preparing Your Data for Cloud Analytics & AI/MLPreparing Your Data for Cloud Analytics & AI/ML
Preparing Your Data for Cloud Analytics & AI/MLAmazon Web Services
 
Big Data Analytics Architectural Patterns and Best Practices (ANT201-R1) - AW...
Big Data Analytics Architectural Patterns and Best Practices (ANT201-R1) - AW...Big Data Analytics Architectural Patterns and Best Practices (ANT201-R1) - AW...
Big Data Analytics Architectural Patterns and Best Practices (ANT201-R1) - AW...Amazon Web Services
 

Similar to Applying AWS Purpose-Built Database Strategy - SRV307 - Toronto AWS Summit (20)

SRV307 Applying AWS Purpose-Built Database Strategy: Match Your Workload to ...
 SRV307 Applying AWS Purpose-Built Database Strategy: Match Your Workload to ... SRV307 Applying AWS Purpose-Built Database Strategy: Match Your Workload to ...
SRV307 Applying AWS Purpose-Built Database Strategy: Match Your Workload to ...
 
Applying AWS Purpose-Built Database Strategy - SRV307 - Anaheim AWS Summit
Applying AWS Purpose-Built Database Strategy - SRV307 - Anaheim AWS SummitApplying AWS Purpose-Built Database Strategy - SRV307 - Anaheim AWS Summit
Applying AWS Purpose-Built Database Strategy - SRV307 - Anaheim AWS Summit
 
Building with AWS Databases: Match Your Workload to the Right Database (DAT30...
Building with AWS Databases: Match Your Workload to the Right Database (DAT30...Building with AWS Databases: Match Your Workload to the Right Database (DAT30...
Building with AWS Databases: Match Your Workload to the Right Database (DAT30...
 
AWS Data Lake: data analysis @ scale
AWS Data Lake: data analysis @ scaleAWS Data Lake: data analysis @ scale
AWS Data Lake: data analysis @ scale
 
Database Freedom. Database migration approaches to get to the Cloud - Marcus ...
Database Freedom. Database migration approaches to get to the Cloud - Marcus ...Database Freedom. Database migration approaches to get to the Cloud - Marcus ...
Database Freedom. Database migration approaches to get to the Cloud - Marcus ...
 
Analyze your Data Lake, Fast @ Any Scale - AWS Online Tech Talks
Analyze your Data Lake, Fast @ Any Scale - AWS Online Tech TalksAnalyze your Data Lake, Fast @ Any Scale - AWS Online Tech Talks
Analyze your Data Lake, Fast @ Any Scale - AWS Online Tech Talks
 
Immersion Day - Como simplificar o acesso ao seu ambiente analítico
Immersion Day - Como simplificar o acesso ao seu ambiente analíticoImmersion Day - Como simplificar o acesso ao seu ambiente analítico
Immersion Day - Como simplificar o acesso ao seu ambiente analítico
 
Using data lakes to quench your analytics fire - AWS Summit Cape Town 2018
Using data lakes to quench your analytics fire - AWS Summit Cape Town 2018Using data lakes to quench your analytics fire - AWS Summit Cape Town 2018
Using data lakes to quench your analytics fire - AWS Summit Cape Town 2018
 
Data Warehouses and Data Lakes
Data Warehouses and Data LakesData Warehouses and Data Lakes
Data Warehouses and Data Lakes
 
Data Warehouses and Data Lakes
Data Warehouses and Data LakesData Warehouses and Data Lakes
Data Warehouses and Data Lakes
 
Data Warehouses and Data Lakes
Data Warehouses and Data LakesData Warehouses and Data Lakes
Data Warehouses and Data Lakes
 
21st Century Analytics with Zopa
21st Century Analytics with Zopa21st Century Analytics with Zopa
21st Century Analytics with Zopa
 
Build Data Lakes & Analytics on AWS: Patterns & Best Practices
Build Data Lakes & Analytics on AWS: Patterns & Best PracticesBuild Data Lakes & Analytics on AWS: Patterns & Best Practices
Build Data Lakes & Analytics on AWS: Patterns & Best Practices
 
Build Data Lakes and Analytics on AWS: Patterns & Best Practices
Build Data Lakes and Analytics on AWS: Patterns & Best PracticesBuild Data Lakes and Analytics on AWS: Patterns & Best Practices
Build Data Lakes and Analytics on AWS: Patterns & Best Practices
 
Using AWS Purpose-Built Databases to Modernize your Applications
Using AWS Purpose-Built Databases to Modernize your ApplicationsUsing AWS Purpose-Built Databases to Modernize your Applications
Using AWS Purpose-Built Databases to Modernize your Applications
 
Building a Modern Data Warehouse - Deep Dive on Amazon Redshift
Building a Modern Data Warehouse - Deep Dive on Amazon RedshiftBuilding a Modern Data Warehouse - Deep Dive on Amazon Redshift
Building a Modern Data Warehouse - Deep Dive on Amazon Redshift
 
STG206_Big Data Data Lakes and Data Oceans
STG206_Big Data Data Lakes and Data OceansSTG206_Big Data Data Lakes and Data Oceans
STG206_Big Data Data Lakes and Data Oceans
 
Builders' Day - Building Data Lakes for Analytics On AWS LC
Builders' Day - Building Data Lakes for Analytics On AWS LCBuilders' Day - Building Data Lakes for Analytics On AWS LC
Builders' Day - Building Data Lakes for Analytics On AWS LC
 
Preparing Your Data for Cloud Analytics & AI/ML
Preparing Your Data for Cloud Analytics & AI/MLPreparing Your Data for Cloud Analytics & AI/ML
Preparing Your Data for Cloud Analytics & AI/ML
 
Big Data Analytics Architectural Patterns and Best Practices (ANT201-R1) - AW...
Big Data Analytics Architectural Patterns and Best Practices (ANT201-R1) - AW...Big Data Analytics Architectural Patterns and Best Practices (ANT201-R1) - AW...
Big Data Analytics Architectural Patterns and Best Practices (ANT201-R1) - AW...
 

More from Amazon Web Services

Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...Amazon Web Services
 
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...Amazon Web Services
 
Esegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS FargateEsegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS FargateAmazon Web Services
 
Costruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWSCostruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWSAmazon Web Services
 
Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot Amazon Web Services
 
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...Amazon Web Services
 
OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...Amazon Web Services
 
Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows WorkloadsMicrosoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows WorkloadsAmazon Web Services
 
Database Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatareDatabase Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatareAmazon Web Services
 
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJSCrea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJSAmazon Web Services
 
API moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e webAPI moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e webAmazon Web Services
 
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatareDatabase Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatareAmazon Web Services
 
Tools for building your MVP on AWS
Tools for building your MVP on AWSTools for building your MVP on AWS
Tools for building your MVP on AWSAmazon Web Services
 
How to Build a Winning Pitch Deck
How to Build a Winning Pitch DeckHow to Build a Winning Pitch Deck
How to Build a Winning Pitch DeckAmazon Web Services
 
Building a web application without servers
Building a web application without serversBuilding a web application without servers
Building a web application without serversAmazon Web Services
 
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...Amazon Web Services
 
Introduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container ServiceIntroduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container ServiceAmazon Web Services
 

More from Amazon Web Services (20)

Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
 
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
 
Esegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS FargateEsegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS Fargate
 
Costruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWSCostruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWS
 
Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot
 
Open banking as a service
Open banking as a serviceOpen banking as a service
Open banking as a service
 
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
 
OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...
 
Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows WorkloadsMicrosoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
 
Computer Vision con AWS
Computer Vision con AWSComputer Vision con AWS
Computer Vision con AWS
 
Database Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatareDatabase Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatare
 
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJSCrea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
 
API moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e webAPI moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e web
 
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatareDatabase Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
 
Tools for building your MVP on AWS
Tools for building your MVP on AWSTools for building your MVP on AWS
Tools for building your MVP on AWS
 
How to Build a Winning Pitch Deck
How to Build a Winning Pitch DeckHow to Build a Winning Pitch Deck
How to Build a Winning Pitch Deck
 
Building a web application without servers
Building a web application without serversBuilding a web application without servers
Building a web application without servers
 
Fundraising Essentials
Fundraising EssentialsFundraising Essentials
Fundraising Essentials
 
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
 
Introduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container ServiceIntroduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container Service
 

Applying AWS Purpose-Built Database Strategy - SRV307 - Toronto AWS Summit

  • 1. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Rick Houlihan Senior Practice Manager, DBS NoSQL SRV307 Matching the Database to the Workload
  • 2. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Agenda • Database workload classifications • Traditional approaches to scaling RDBMS • How NoSQL databases compare • The flavors of NoSQL on AWS • What database to use when
  • 3. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Why did you choose this database for your app? “Because we heard X is the best new thing.” “Because we have a site license for X.” “Because X is what we know how to use.” “Because this database is purpose built to support what my application is designed to do.”
  • 4. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Types of Database Workloads • Online Transaction Processing (OLTP) • Most common type of app • Online Analytics Processing (OLAP) • BI and ad-hoc data projections • Decision Support Systems (DSS) • Long running query aggregations and projections Operations Analytics
  • 5. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Sizing the Workload Unbounded problems are harder to solve “I need a root cause analysis engine to correlate transaction level events to trading patterns across global markets.” Problems with limited scope are easier to solve “I need a system to manage inventory in my store.”
  • 6. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Sizing the Database
  • 7. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Scaling Relational DBs
  • 8. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Sharded Relational DBs? A B C D ?
  • 9. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. NoSQL Databases • Denormalize and shard to provide horizontal scale • Near unbounded throughput and storage Collection 1 1 TB Shard A 500 GB Shard B 500 GB
  • 10. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. 00 55 A954 FFAA00 FF Partition Keys in NoSQL Id = 1 Name = Jim Hash (1) = 7B Id = 2 Name = Andy Dept = Eng Hash (2) = 48 Id = 3 Name = Kim Dept = Ops Hash (3) = CD Key Space Partition Key uniquely identifies an item Partition Key is used for building an unordered hash index Allows table to be partitioned for scale
  • 11. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. The Iron Triangle of Data - All About CAP C A PConsistency: all clients always have the same view of data Partition tolerance: the system works well despite physical network partitions Availability: all clients can always read and write CA MSSQL Oracle DB2 MySQL Aster Data Greenplum Postgres CP Big Table Hypertable HBase MongoDB Terastore Couchbase Scalaris DynamoDB BerkeleyDB Memcached Redis Pick Two AP Voldemort Tokyo Cabinet KAI Amazon DynamoDB Cassandra SimpleDB CouchDB Riak Data Models: Relational Wide Column Document Key/Value
  • 12. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Technology adoption and the hype curve
  • 13. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. DW | Big Data Processing | Ad hoc AWS Databases and Analytics Broadest and deepest portfolio purpose-built for builders Business Intelligence & Machine Learning Data Movement AWS Database Migration Service | AWS Snowball | AWS Snowmobile | Amazon Kinesis Data Firehose | Amazon Kinesis Data Streams Amazon QuickSight Relational Databases Amazon RDS Amazon Aurora Data lake (Batch/ETL) Amazon S3 / Amazon Glacier (Storage) AWS Glue (ETL & Data Catalog) Machine Learning Amazon Macie (Data Protection) Denormalized Databases Analytics (OLAP/DSS) DynamoDB (Wide Column/Document) Amazon ElastiCache (Indexed Key Value) Amazon Redshift Amazon EMR Amazon Athena Amazon Kinesis Data Analytics Amazon Elasticsearch Service Real-time Opertional (OLTP) Amazon Neptune (Graph)
  • 14. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Amazon RDS Managed relational database service with a choice of six popular database engines Easy to administer Highly flexible Available & durable Fast No need for infrastructure provisioning, installing and maintaining database software. Scale database compute and storage with a few mouse clicks and zero downtime. Multi-AZ: Automatically replicates data. Automated backup, snapshots, failover. Choose between dual SSD- backed storage for high- performance OLTP.
  • 15. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Amazon DynamoDB Fast and flexible NoSQL database service for any scale Key-value NoSQL database that supports both document and wide column structures Fast, consistent performance Highly scalable Fully managed Business-critical reliability Consistent single-digit millisecond latencies at any scale. DAX speeds up times to microseconds. Auto-scaling tables serving millions of requests per second, storing hundreds of terabytes of data. Automatic provisioning and infrastructure management. Data replicated across multiple AZs and accessed with regionally available APIs.
  • 16. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. DynamoDB Schema Table Items Attributes Partition Key Sort Key Mandatory Key-value access pattern Determines data distribution Optional Model 1:N relationships Enables rich query capabilities All items for key ==, <, >, >=, <= “begins with” “between” “contains” “in” sorted results counts top/bottom N values
  • 17. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. SQL vs. NoSQL Design Pattern
  • 18. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Amazon Neptune Fully managed graph database Fast Reliable Open Query billions of relationships with millisecond latency Six replicas of your data across three AZs with full backup and restore Build powerful queries easily with Gremlin and SPARQL Supports Apache TinkerPop & W3C RDF graph models Easy
  • 19. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Amazon Neptune High-Level Architecture Bulk load from S3 Database Mgmt.
  • 20. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Graph Workloads
  • 21. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Normalized Graph Design Pattern
  • 22. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. De-normalized Graph Design Pattern Node Relationship Endpoint Bill is a Person visited Eiffel Tower Alice is a Person visited Eiffel Tower friend of Bob Bob is a Person born on 7/14/90 friend of Alice interested in Mona Lisa Leonardo daVinci is a Person La Jaconde a Washington is a Video is about Mona Lisa Eiffel Tower is a Place located in Paris 7/14/90 is a Date Paris is a City The Louvre is a Museum located in Paris Mona Lisa is a Painting created by Leonardo daVinci exhibited at The Louvre Nodes are the vertices of a graph Relationships are the edges of a graph Select nodes to get edges for an entity Index Relationship and Endpoint for edge type and target aggregations Follow the edges to traverse the graph Bob wants to see the Mona Lisa. While he in Paris he would like to see other things his friends have enjoyed.
  • 23. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Node Query (Primary) • What entities are in the graph? Edge Query (Index) • What relationships do graph entities have? Hybrid Query (Traversal) • How are entities related through each other? Graph Query Types RDBMS, NoSQL, GraphDB GraphDB
  • 24. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Amazon Redshift – Data Warehousing Fast, powerful, and simple data warehousing at 1/10 the cost Massively parallel, petabyte scale Fast Inexpensive Scalable Secure Columnar storage technology to improve I/O efficiency and parallelize queries. Data load scales linearly. As low as $1,000 per terabyte per year, 1/10th the cost of traditional data warehouse solutions. Resize your cluster up and down as your performance and capacity needs change. Data encrypted at rest and transit. Isolate clusters with VPC. Manage your own keys with AWS KMS.
  • 25. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Amazon Athena – Interactive Analysis Interactive query service to analyze data in Amazon S3 using standard SQL No infrastructure to set up or manage and no data to load Ability to run SQL queries on data archived in Amazon Glacier (Coming soon) Serverless Zero setup cost. Just point to Amazon S3, and start querying. Pay per query Pay only for queries run. Save 30–90% on per- query costs through compression. Open ANSI SQL interface, JDBC/ODBC drivers, multiple formats, compression types, and complex Joins and data types. Easy Serverless. Zero infrastructure. Zero administration.
  • 26. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Categories of Database Optimized for Storage Optimized for Compute Optimized for Relationships Normalized relational or dimensional DW Denormalized document, wide column or key value Denormalized entity relationship Ad hoc queries and aggregations Instantiated views and computed aggregations Ad hoc entity/relationship aggregations Scale vertically Scale horizontally Hybrid Great for OLAP and DSS Built for OLTP or DSS at scale Designed for graph traversals SQL NoSQL Graph
  • 27. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Infinite Scale: The database can gracefully increase size and throughput without practical limits The Iron Triangle of Purpose (The PIE Theorem) I P E Efficiency: The database will deliver required query latency for the workload at all times Pattern Flexibility: The database supports random access patterns and ad hoc queries PI Amazon RDS Elasticsearch Aurora Serverless Neptune IE Pick Two PE Data Models: Relational Wide Column Document Graph Columnar Unstructured Amazon DynamoDB Amazon Redshift Athena
  • 28. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Hundreds of thousands of customers use DynamoDB
  • 29. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Hundreds of thousands more use Amazon RDS
  • 30. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Purpose Built Database Solutions from AWS Provisioning Capacity planning Monitoring OS patching Hardware upgrades Database upgrades Security patches Scaling Monitoring Performance tuning Replication across data centers Re-replicate on server failureProvision new regions Infrastructure Software With Zero Unplanned Downtime
  • 31. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Submit session feedback 1. Tap the Schedule icon. 2. Select the session you attended. 3. Tap Session Evaluation to submit your feedback.
  • 32. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Thank you!