SlideShare a Scribd company logo
1 of 48
.©  2015,  Amazon  Web  Services,  Inc.  or  its  Affiliates.  All  rights  reserved.
John  Chang
Ecosystem  Solutions  Architect
April  2016
大數據運算
媒體業案例分享
©  2015,  Amazon  Web  Services,  Inc.  or  its  Affiliates.  All  rights  reserved.
What  is  big  data?
Big  data  on  AWS
Northbay customer  case  studies
Best  practices
APN  resources
©  2015,  Amazon  Web  Services,  Inc.  or  its  Affiliates.  All  rights  reserved.
What  Is  Big  Data  &  Why  Do  We  Care?
©  2015,  Amazon  Web  Services,  Inc.  or  its  Affiliates.  All  rights  reserved.
GB
TB
PB
ZB
EB
Big  Data:  Unconstrained  Growth
95%  of  the  1.2  
zettabytes of  data  in  the  
digital  universe  is  
unstructured
70%  of  this  data  is  user-­
generated  content  
Unstructured  data  
growth  is  explosive
Machine  data/IoT will  
only  steepen  the  curve
Source:  IDC
©  2015,  Amazon  Web  Services,  Inc.  or  its  Affiliates.  All  rights  reserved.
Data  Gap
1990 2000 2010 2020
Generated  Data
Available  for  Analysis
Data  Volume
Sources:  
Gartner:  User  Survey  Analysis:  Key  Trends  Shaping  the  Future  of  Data  Center  Infrastructure  Through  2011  
IDC:  Worldwide  Business  Analytics  Software  2012–2016  Forecast  and  2011  Vendor  Shares  
©  2015,  Amazon  Web  Services,  Inc.  or  its  Affiliates.  All  rights  reserved.
Big  Data  Evolution
Batch
Report
Real-­‐time  
Alerts
Prediction
Forecast
©  2015,  Amazon  Web  Services,  Inc.  or  its  Affiliates.  All  rights  reserved.
Plethora  of  Tools
Amazon  
Glacier
S3 DynamoDB  
RDS
EMR
Amazon  
Redshift
Data  PipelineAmazon  Kinesis  
Cassandra
CloudSearch
Kinesis-­
enabled  
app
Lambda ML
SQS
ElastiCache
DynamoDB
Streams  
©  2015,  Amazon  Web  Services,  Inc.  or  its  Affiliates.  All  rights  reserved.
A  complete  platform  for  big  data  &  analytics
Retrospective
analysis  and  
reporting
Here-­and-­now
real-­time  processing  
and  dashboards
Predictions
to  enable  smart  
applications
Amazon  Kinesis  
Amazon  EC2  
Amazon  Redshift  
Amazon  EMR
Amazon  ML
Amazon  EMR
©  2015,  Amazon  Web  Services,  Inc.  or  its  Affiliates.  All  rights  reserved.
Is  there  a  reference  architecture  ?
What  tools  should  I  use  ?
How  ?  
Why  ?
©  2015,  Amazon  Web  Services,  Inc.  or  its  Affiliates.  All  rights  reserved.
http://aws.amazon.com/marketplace
Big  Data  Case  Studies
Learn  from  other  AWS  customers
aws.amazon.com/solutions/case-­studies/big-­data
©  2015,  Amazon  Web  Services,  Inc.  or  its  Affiliates.  All  rights  reserved.
Simplify  Big  Data  Processing
ingest  /
collect
store
process  /
analyze
consume  /  
visualize
Time  to  Answer  (Latency)
Throughput
Cost
©  2015,  Amazon  Web  Services,  Inc.  or  its  Affiliates.  All  rights  reserved.
Collect  /
Ingest  
©  2015,  Amazon  Web  Services,  Inc.  or  its  Affiliates.  All  rights  reserved.
Types  of  Data
• Transactional
• Database  reads  &  writes  (OLTP)
• Cache  
• Search
• Logs
• Streams
• File
• Log  files  (/var/log)
• Log  collectors  &  frameworks
• Stream
• Log  records
• Sensors  &  IoT data
Database
File
Storage
Stream
Storage
A
iOS Android
Web  Apps
Logstash
LoggingIoTApplications
Transactional Data
File Data
Stream Data
Mobile  
Apps
Search Data
Search
Collect Store
LoggingIoT
©  2015,  Amazon  Web  Services,  Inc.  or  its  Affiliates.  All  rights  reserved.
Store
©  2015,  Amazon  Web  Services,  Inc.  or  its  Affiliates.  All  rights  reserved.
Stream  
Storage
A
iOS Android
Web  Apps
Logstash
Amazon
RDS
Amazon
DynamoDB
Amazon
ES
Amazon
S3
Apache
Kafka
Amazon
Glacier
Amazon
Kinesis
Amazon
DynamoDB
Amazon
ElastiCache
SearchSQLNoSQLCacheStreamStorageFileStorage
Transactional Data
File Data
Stream Data
Mobile  
Apps
Search Data
Database
File
Storage
Search
Collect Store
LoggingIoTApplications
ü
©  2015,  Amazon  Web  Services,  Inc.  or  its  Affiliates.  All  rights  reserved.
Why  Is  Amazon  S3  Good  for  Big  Data?
• Natively  supported  by  big  data  frameworks (Spark,  Hive,  Presto,  etc.)  
• No  need  to  run  compute  clusters  for  storage  (unlike  HDFS)
• Can  run  transient  Hadoop  clusters  &  Amazon  EC2  Spot  instances
• Multiple  distinct  (Spark,  Hive,  Presto)  clusters  can  use  the  same  data
• Unlimited  number  of  objects  
• Very  high  bandwidth    – no  aggregate  throughput  limit
• Highly  available  – can  tolerate  AZ  failure
• Designed  for  99.999999999%  durability
• Tired-­storage  (Standard,  IA,  Amazon  Glacier)  via  life-­cycle  policy
• Secure  – SSL,  client/server-­side  encryption  at  rest
• Low  cost
©  2015,  Amazon  Web  Services,  Inc.  or  its  Affiliates.  All  rights  reserved.
What  about  HDFS  &  Amazon  Glacier?
• Use  HDFS  for  very  frequently  
accessed  (hot)  data
• Use  Amazon  S3  Standard  for  
frequently  accessed  data  
• Use  Amazon  S3  Standard  –
IA  for  infrequently  accessed  
data
• Use  Amazon  Glacier  for  
archiving  cold  data  
©  2015,  Amazon  Web  Services,  Inc.  or  its  Affiliates.  All  rights  reserved.
Database  +  
Search  
Tier
A
iOS Android
Web  Apps
Logstash
Amazon
RDS
Amazon
DynamoDB
Amazon
ES
Amazon
S3
Apache
Kafka
Amazon
Glacier
Amazon
Kinesis
Amazon
DynamoDB
Amazon
ElastiCache
SearchSQLNoSQLCacheStreamStorageFileStorage
Transactional Data
File Data
Stream Data
Mobile  
Apps
Search Data
Collect Store
ü
©  2015,  Amazon  Web  Services,  Inc.  or  its  Affiliates.  All  rights  reserved.
Database  +  Search  Tier  Anti-­pattern
Database  +  Search  Tier
©  2015,  Amazon  Web  Services,  Inc.  or  its  Affiliates.  All  rights  reserved.
Best  Practice  — Use  the  Right  Tool  for  the  Job
Data  Tier
Search
Amazon  
Elasticsearch
Service
Amazon  
CloudSearch
Cache
Redis
Memcached
SQL
Amazon  Aurora
MySQL
PostgreSQL
Oracle
SQL  Server
NoSQL
Cassandra
Amazon  
DynamoDB
HBase
MongoDB
Database  +  Search  Tier
©  2015,  Amazon  Web  Services,  Inc.  or  its  Affiliates.  All  rights  reserved.
What  Data  Store  Should  I  Use?
• Data  structure  →  Fixed  schema,  JSON,  key-­value
• Access  patterns  →  Store  data  in  the  format  you  will  
access  it
• Data  /  access  characteristics  →  Hot,  warm,  cold
• Cost  →  Right  cost
©  2015,  Amazon  Web  Services,  Inc.  or  its  Affiliates.  All  rights  reserved.
Data  Structure  and  Access  Patterns
Access  Patterns What  to  use?
Put/Get  (Key, Value) Cache,  NoSQL
Simple relationships  →  1:N, M:N NoSQL
Cross table  joins,  transaction,  SQL SQL
Faceting,  Search   Search
Data Structure What  to  use?
Fixed  schema SQL,  NoSQL
Schema-­free (JSON) NoSQL,  Search
(Key, Value) Cache,  NoSQL
©  2015,  Amazon  Web  Services,  Inc.  or  its  Affiliates.  All  rights  reserved.
Data  /  Access  Characteristics:  Hot,  Warm,  Cold
Hot Warm Cold
Volume MB–GB GB–TB PB
Item  size B–KB KB–MB KB–TB
Latency ms ms,  sec min,  hrs
Durability Low–High High Very  High
Request  rate Very  High High Low
Cost/GB $$-­$ $-­¢¢ ¢
Hot  Data Warm  Data Cold  Data
©  2015,  Amazon  Web  Services,  Inc.  or  its  Affiliates.  All  rights  reserved.
What  Data  Store  Should  I  Use?
Amazon  
ElastiCache
Amazon
DynamoDB
Amazon
Aurora
Amazon
Elasticsearch
Amazon  
EMR  (HDFS)
Amazon  S3 Amazon Glacier
Average  
latency
ms ms ms,  sec ms,sec sec,min,hrs ms,sec,min
(~  size)
hrs
Data  volume GB GB–TBs
(no limit)
GB–TB
(64  TB  
Max)
GB–TB GB–PB
(~nodes)
MB–PB
(no limit)
GB–PB
(no limit)
Item  size B-­KB KB
(400  KB  
max)
KB
(64  KB)
KB
(1  MB  max)
MB-­GB KB-­GB
(5  TB max)
GB
(40  TB  max)
Request  rate High  -­
Very  High
Very  High
(no  limit)
High High Low  – Very  
High
Low  –
Very  High
(no limit)
Very  Low
Storage  cost
GB/month
$$ ¢¢ ¢¢ ¢¢ ¢ ¢ ¢/10
Durability Low  -­
Moderate
Very  High Very  High High High Very  High Very  High
Hot  Data Warm  Data Cold  Data
Hot  Data Warm  Data Cold  Data
©  2015,  Amazon  Web  Services,  Inc.  or  its  Affiliates.  All  rights  reserved.
Process  /
Analyze
©  2015,  Amazon  Web  Services,  Inc.  or  its  Affiliates.  All  rights  reserved.
AnalyzeA
iOS Android
Web  Apps
Logstash
Amazon
RDS
Amazon
DynamoDB
Amazon
ES
Amazon
S3
Apache
Kafka
Amazon
Glacier
Amazon
Kinesis
Amazon
DynamoDB
Amazon
Redshift
Impala
Pig
Amazon ML
Streaming
Amazon
Kinesis
AWS
Lambda
AmazonElasticMapReduce
Amazon
ElastiCache
SearchSQLNoSQLCache
StreamProcessingBatchInteractive
Logging
StreamStorage
IoTApplications
FileStorage
Hot
Cold
War
m
Hot
Hot
ML
Transactional Data
File Data
Stream Data
Mobile  
Apps
Search Data
Collect Store Analyze
ü ü
©  2015,  Amazon  Web  Services,  Inc.  or  its  Affiliates.  All  rights  reserved.
Process  /  Analyze
Analysis  of  data is  a  process  of  inspecting,  cleaning,  
transforming,  and  modeling data with  the  goal  of  discovering  
useful information,  suggesting  conclusions,  and  supporting  
decision-­making.
Examples
• Interactive  dashboards  → Interactive  analytics
• Daily/weekly/monthly  reports  →  Batch  analytics
• Billing/fraud  alerts,  1  minute  metrics  →  Real-­time  analytics
• Sentiment  analysis,  prediction  models  →  Machine  learning
©  2015,  Amazon  Web  Services,  Inc.  or  its  Affiliates.  All  rights  reserved.
Spark  Streaming  
Apache  Storm
AWS  Lambda
KCL
Amazon  
Redshift Spark  
Impala  
Presto
Hive
Amazon
Redshift
Hive
Spark  
Presto
Impala
Amazon   Kinesis
Apache  Kafka
Amazon  
DynamoDB
Amazon  S3data
Hot Cold
Data  Temperature
Processing  Latency
Low
High Answers
Amazon  EMR  
(HDFS)
Hive
Native
KCL
AWS  Lambda
Data  Temperature  vs  Processing  Latency
Batch
©  2015,  Amazon  Web  Services,  Inc.  or  its  Affiliates.  All  rights  reserved.
Interactive  Analytics
Takes  large  amount  of  (warm/cold)  data
Takes  seconds to  get  answers  back
Example:  Self-­service  dashboards
©  2015,  Amazon  Web  Services,  Inc.  or  its  Affiliates.  All  rights  reserved.
Batch  Analytics
Takes  large  amount  of  (warm/cold)  data
Takes  minutes  or  hours to  get  answers  back
Example:  Generating  daily,  weekly,  or  monthly  reports
©  2015,  Amazon  Web  Services,  Inc.  or  its  Affiliates.  All  rights  reserved.
Real-­Time  Analytics
Take  small  amount  of  hot  data  and  ask  questions  
Takes  short  amount  of  time  (milliseconds  or  seconds)  to  
get  your  answer  back
• Real-­time  (event)
• Real-­time  response  to  events  in  data  streams
• Example:  Billing/Fraud  Alerts  
• Near  real-­time  (micro-­batch)
• Near  real-­time  operations  on  small  batches  of  events  in  data  
streams
• Example:  1  Minute  Metrics
©  2015,  Amazon  Web  Services,  Inc.  or  its  Affiliates.  All  rights  reserved.
Predictions  via  Machine  Learning
ML  gives  computers  the  ability  to  learn  without  being  explicitly  
programmed
Machine  Learning  Algorithms:
-­ Supervised  Learning  ←  “teach”  program
-­ Classification  ← Is  this  transaction  fraud?  (Yes/No)  
-­ Regression  ← Customer  Life-­time  value?  
-­ Unsupervised  Learning  ←  let  it  learn  by  itself
-­ Clustering  ←  Market  Segmentation
©  2015,  Amazon  Web  Services,  Inc.  or  its  Affiliates.  All  rights  reserved.
Analysis  Tools  and  Frameworks
Machine  Learning
• Mahout,  Spark  ML,  Amazon  ML
Interactive  Analytics
• Amazon  Redshift,  Presto,  Impala,  Spark
Batch  Processing
• MapReduce,  Hive,  Pig,  Spark
Stream  Processing
• Micro-­batch:  Spark  Streaming,  KCL,  Hive,  Pig
• Real-­time:  Storm,  AWS  Lambda,  KCL
Amazon
Redshift
Impala
Pig
Amazon Machine
Learning
Streaming
Amazon
Kinesis
AWS
Lambda
AmazonElasticMapReduce
StreamProcessingBatchInteractiveML
Analyze
©  2015,  Amazon  Web  Services,  Inc.  or  its  Affiliates.  All  rights  reserved.
Real-­time  Analytics
Producer
Apache
Kafka
KCL
AWS  Lambda
Spark
Streaming
Apache  
Storm
Amazon  
SNS
Amazon
ML
Notifications
Amazon
ElastiCache
(Redis)
Amazon
DynamoDB
Amazon
RDS
Amazon
ES
Alert
App  state
Real-­time  Prediction
KPI
process
store
DynamoDB
Streams
Amazon  
Kinesis
©  2015,  Amazon  Web  Services,  Inc.  or  its  Affiliates.  All  rights  reserved.
Interactive  &  
Batch
Analytics
Producer Amazon  S3
Amazon  EMR
Hive
Pig
Spark
Amazon
ML
process
store
Consume
Amazon  
Redshift
Amazon  EMR
Presto
Impala
Spark
Batch
Interactive
Batch  Prediction
Real-­time  Prediction
©  2015,  Amazon  Web  Services,  Inc.  or  its  Affiliates.  All  rights  reserved.
Batch  Layer
Amazon
Kinesis
data
process
store
Lambda  Architecture
Amazon  
Kinesis  S3  
Connector  
Amazon  S3
A
p
p
l
i
c
a
t
i
o
n
s
Amazon  
Redshift
Amazon  EMR
Presto
Hive
Pig
Spark
answer
Speed  Layer
answer
Serving  
Layer
Amazon
ElastiCache
Amazon
DynamoDB
Amazon
RDS
Amazon
ES
answer
Amazon
ML
KCL
AWS  Lambda
Spark  Streaming
Storm
©  2015,  Amazon  Web  Services,  Inc.  or  its  Affiliates.  All  rights  reserved.
Consume  /  
Visualize
©  2015,  Amazon  Web  Services,  Inc.  or  its  Affiliates.  All  rights  reserved.
Collect Store Analyze Consume
A
iOS Android
Web  Apps
Logstash
Amazon
RDS
Amazon
DynamoDB
Amazon
ES
Amazon
S3
Apache
Kafka
Amazon
Glacier
Amazon
Kinesis
Amazon
DynamoDB
Amazon
Redshift
Impala
Pig
Amazon ML
Streaming
Amazon
Kinesis
AWS
Lambda
AmazonElasticMapReduce
Amazon
ElastiCache
SearchSQLNoSQLCache
StreamProcessingBatchInteractive
Logging
StreamStorage
IoTApplications
FileStorage
Analysis&Visualization
Hot
Cold
War
m
Hot
Slow
Hot
ML
Fast
Fast
Transactional Data
File Data
Stream Data
Notebook
s
Predictions
Apps & APIs
Mobile  
Apps
IDE
Search Data
ETL
Amazon  
QuickSight
©  2015,  Amazon  Web  Services,  Inc.  or  its  Affiliates.  All  rights  reserved.
Consume
• Predictions  
• Analysis  and  Visualization
• Notebooks
• IDE
• Applications  &  API
Consume
Analysis&Visualization
Amazon  
QuickSight
Notebook
s
Predictions
Apps & APIs
IDE
Store Analyze ConsumeETL
Business  
users
Data  Scientist,  
Developers
©  2015,  Amazon  Web  Services,  Inc.  or  its  Affiliates.  All  rights  reserved.
Putting  It  All  Together
©  2015,  Amazon  Web  Services,  Inc.  or  its  Affiliates.  All  rights  reserved.
Collect Store Analyze Consume
A
iOS Android
Web  Apps
Logstash
Amazon
RDS
Amazon
DynamoDB
Amazon
ES
Amazon
S3
Apache
Kafka
Amazon
Glacier
Amazon
Kinesis
Amazon
DynamoDB
Amazon
Redshift
Impala
Pig
Amazon ML
Streaming
Amazon
Kinesis
AWS
Lambda
AmazonElasticMapReduce
Amazon
ElastiCache
SearchSQLNoSQLCache
StreamProcessingBatchInteractive
Logging
StreamStorage
IoTApplications
FileStorage
Analysis&Visualization
Hot
Cold
War
m
Hot
Slow
Hot
ML
Fast
Fast
Amazon  
QuickSight
Transactional Data
File Data
Stream Data
Notebook
s
Predictions
Apps & APIs
Mobile  
Apps
IDE
Search Data
ETL
Reference  Architecture
©  2015,  Amazon  Web  Services,  Inc.  or  its  Affiliates.  All  rights  reserved.
Problem  Statement:
• Need  massive  scalability  and  elasticity
Use  of  AWS:
• Nearly  100%  of  its  online  video  service  on  AWS
• Global  use  of  Amazon  EC2,  Amazon  S3,  Amazon  SQS,  
Amazon  EMR,  Lambda,  etc.
• 30-­50K  EC2  instances
Business  Benefits:  
• Application  achieves  near  zero  downtime
• Massive  scalability  and  elasticity
• Transcoding  entire  library  to  ~60  output  renditions
“AWS  is  the  market  leader  and  has  been  able  to  create  a  continuous  and  virtuous  cycle.”  
– Kevin  McEntee,  VP  Content  Engineering,  Netflix
Customer:  Netflix
©  2015,  Amazon  Web  Services,  Inc.  or  its  Affiliates.  All  rights  reserved.
AdRoll Builds  Bidding  Platform  on  AWS  and  Cuts  Costs  by  83%
AdRoll is  a  global  leader  in  digital  advertising  
retargeting  products.
We’ve  been  able  to  
seamlessly  scale  our  
infrastructure  and  reduce  our  
fixed  costs  by  75%  and  
operational  costs  by  83%.”
Valentino  Volonghi
CTO,  AdRoll
”
“ • AdRoll manages  its  Real-­Time  Bidding  platform  using  
Amazon  EC2,  Amazon  Dynmo DB,  and  Amazon  S3
• Reduced  annual  operational  costs  by  83%
• Reduced  fixed  costs  by  75%
• Staff  now  95%  focused  on  new  product  development  
©  2015,  Amazon  Web  Services,  Inc.  or  its  Affiliates.  All  rights  reserved.
Problem  Statement:
• Needed  scalable,  high  performance,  and  highly  available  
storage  and  big  data  solutions
Use  of  AWS:
• Direct  Connect,  S3,  EMR,  other  AWS  services
• Went  from  ~5GB  of  logs  per  day  to  ~1300GB/day
Business  Benefits:  
• By  moving  to  AWS,  went  from  spending  $50K/mo to  
$13K/mo on  big  data  solutions
Xfinity X1  Set  Top  Box  Platform
Customer:  Comcast
©  2015,  Amazon  Web  Services,  Inc.  or  its  Affiliates.  All  rights  reserved.
MLB  Advanced  Media
「消費者行為正在改變。他們從行動裝置上網購物,這種技
術對於球賽的進化非常重要。」
「我們的努力中最令人興奮的事,就是 AWS  支援的
Statcast。我們首次可以測量以前無法測量的資料。」
©  2015,  Amazon  Web  Services,  Inc.  or  its  Affiliates.  All  rights  reserved.
Partnering  with  AWS
Thank  you!
©  2015,  Amazon  Web  Services,  Inc.  or  its  Affiliates.  All  rights  reserved.
Questions?
Thank  you!
©  2015,  Amazon  Web  Services,  Inc.  or  its  Affiliates.  All  rights  reserved.

More Related Content

What's hot

AWSome Day 2016 - Module 4: Databases: Amazon DynamoDB and Amazon RDS
AWSome Day 2016 - Module 4: Databases: Amazon DynamoDB and Amazon RDSAWSome Day 2016 - Module 4: Databases: Amazon DynamoDB and Amazon RDS
AWSome Day 2016 - Module 4: Databases: Amazon DynamoDB and Amazon RDSAmazon Web Services
 
AWS re:Invent 2016: Deep Dive: Amazon EMR Best Practices & Design Patterns (B...
AWS re:Invent 2016: Deep Dive: Amazon EMR Best Practices & Design Patterns (B...AWS re:Invent 2016: Deep Dive: Amazon EMR Best Practices & Design Patterns (B...
AWS re:Invent 2016: Deep Dive: Amazon EMR Best Practices & Design Patterns (B...Amazon Web Services
 
Amazon RDS: Deep Dive - SRV310 - Chicago AWS Summit
Amazon RDS: Deep Dive - SRV310 - Chicago AWS SummitAmazon RDS: Deep Dive - SRV310 - Chicago AWS Summit
Amazon RDS: Deep Dive - SRV310 - Chicago AWS SummitAmazon Web Services
 
A Data Culture with Embedded Analytics in Action
A Data Culture with Embedded Analytics in ActionA Data Culture with Embedded Analytics in Action
A Data Culture with Embedded Analytics in ActionAmazon Web Services
 
Introduction to Amazon Relational Database Service
Introduction to Amazon Relational Database ServiceIntroduction to Amazon Relational Database Service
Introduction to Amazon Relational Database ServiceAmazon Web Services
 
Backup and Recovery with Cloud-Native Deduplication and Use Cases from the Fi...
Backup and Recovery with Cloud-Native Deduplication and Use Cases from the Fi...Backup and Recovery with Cloud-Native Deduplication and Use Cases from the Fi...
Backup and Recovery with Cloud-Native Deduplication and Use Cases from the Fi...Amazon Web Services
 
Architectures for HPC and HTC Workloads on AWS | AWS Public Sector Summit 2017
Architectures for HPC and HTC Workloads on AWS | AWS Public Sector Summit 2017Architectures for HPC and HTC Workloads on AWS | AWS Public Sector Summit 2017
Architectures for HPC and HTC Workloads on AWS | AWS Public Sector Summit 2017Amazon Web Services
 
Operations: Cost Optimization - Don't Overspend on Infrastructure
Operations: Cost Optimization - Don't Overspend on Infrastructure Operations: Cost Optimization - Don't Overspend on Infrastructure
Operations: Cost Optimization - Don't Overspend on Infrastructure Amazon Web Services
 
Accelerate your Business with SAP on AWS - AWS Summit Cape Town 2017
Accelerate your Business with SAP on AWS - AWS Summit Cape Town 2017 Accelerate your Business with SAP on AWS - AWS Summit Cape Town 2017
Accelerate your Business with SAP on AWS - AWS Summit Cape Town 2017 Amazon Web Services
 
Module 2 - Datalake
Module 2 - DatalakeModule 2 - Datalake
Module 2 - DatalakeLam Le
 
Migrating Large Scale Data Sets to the Cloud
Migrating Large Scale Data Sets to the CloudMigrating Large Scale Data Sets to the Cloud
Migrating Large Scale Data Sets to the CloudAmazon Web Services
 
BDA302 Deep Dive on Migrating Big Data Workloads to Amazon EMR
BDA302 Deep Dive on Migrating Big Data Workloads to Amazon EMRBDA302 Deep Dive on Migrating Big Data Workloads to Amazon EMR
BDA302 Deep Dive on Migrating Big Data Workloads to Amazon EMRAmazon Web Services
 
AWS re:Invent 2016: Big Data Architectural Patterns and Best Practices on AWS...
AWS re:Invent 2016: Big Data Architectural Patterns and Best Practices on AWS...AWS re:Invent 2016: Big Data Architectural Patterns and Best Practices on AWS...
AWS re:Invent 2016: Big Data Architectural Patterns and Best Practices on AWS...Amazon Web Services
 
Bursting on-premise analytic workloads to Amazon EMR using Alluxio
Bursting on-premise analytic workloads to Amazon EMR using AlluxioBursting on-premise analytic workloads to Amazon EMR using Alluxio
Bursting on-premise analytic workloads to Amazon EMR using AlluxioAlluxio, Inc.
 
Deep Dive on Amazon RDS (May 2016)
Deep Dive on Amazon RDS (May 2016)Deep Dive on Amazon RDS (May 2016)
Deep Dive on Amazon RDS (May 2016)Julien SIMON
 
ENT305 Migrating Your Databases to AWS: Deep Dive on Amazon Relational Databa...
ENT305 Migrating Your Databases to AWS: Deep Dive on Amazon Relational Databa...ENT305 Migrating Your Databases to AWS: Deep Dive on Amazon Relational Databa...
ENT305 Migrating Your Databases to AWS: Deep Dive on Amazon Relational Databa...Amazon Web Services
 
Building with AWS Databases: Match Your Workload to the Right Database (DAT30...
Building with AWS Databases: Match Your Workload to the Right Database (DAT30...Building with AWS Databases: Match Your Workload to the Right Database (DAT30...
Building with AWS Databases: Match Your Workload to the Right Database (DAT30...Amazon Web Services
 
Migrating to Amazon RDS with Database Migration Service
Migrating to Amazon RDS with Database Migration ServiceMigrating to Amazon RDS with Database Migration Service
Migrating to Amazon RDS with Database Migration ServiceAmazon Web Services
 

What's hot (20)

AWSome Day 2016 - Module 4: Databases: Amazon DynamoDB and Amazon RDS
AWSome Day 2016 - Module 4: Databases: Amazon DynamoDB and Amazon RDSAWSome Day 2016 - Module 4: Databases: Amazon DynamoDB and Amazon RDS
AWSome Day 2016 - Module 4: Databases: Amazon DynamoDB and Amazon RDS
 
AWS re:Invent 2016: Deep Dive: Amazon EMR Best Practices & Design Patterns (B...
AWS re:Invent 2016: Deep Dive: Amazon EMR Best Practices & Design Patterns (B...AWS re:Invent 2016: Deep Dive: Amazon EMR Best Practices & Design Patterns (B...
AWS re:Invent 2016: Deep Dive: Amazon EMR Best Practices & Design Patterns (B...
 
Amazon RDS: Deep Dive - SRV310 - Chicago AWS Summit
Amazon RDS: Deep Dive - SRV310 - Chicago AWS SummitAmazon RDS: Deep Dive - SRV310 - Chicago AWS Summit
Amazon RDS: Deep Dive - SRV310 - Chicago AWS Summit
 
AWSome Day Leeds
AWSome Day Leeds AWSome Day Leeds
AWSome Day Leeds
 
A Data Culture with Embedded Analytics in Action
A Data Culture with Embedded Analytics in ActionA Data Culture with Embedded Analytics in Action
A Data Culture with Embedded Analytics in Action
 
Introduction to Amazon Relational Database Service
Introduction to Amazon Relational Database ServiceIntroduction to Amazon Relational Database Service
Introduction to Amazon Relational Database Service
 
Backup and Recovery with Cloud-Native Deduplication and Use Cases from the Fi...
Backup and Recovery with Cloud-Native Deduplication and Use Cases from the Fi...Backup and Recovery with Cloud-Native Deduplication and Use Cases from the Fi...
Backup and Recovery with Cloud-Native Deduplication and Use Cases from the Fi...
 
Architectures for HPC and HTC Workloads on AWS | AWS Public Sector Summit 2017
Architectures for HPC and HTC Workloads on AWS | AWS Public Sector Summit 2017Architectures for HPC and HTC Workloads on AWS | AWS Public Sector Summit 2017
Architectures for HPC and HTC Workloads on AWS | AWS Public Sector Summit 2017
 
Operations: Cost Optimization - Don't Overspend on Infrastructure
Operations: Cost Optimization - Don't Overspend on Infrastructure Operations: Cost Optimization - Don't Overspend on Infrastructure
Operations: Cost Optimization - Don't Overspend on Infrastructure
 
Create cloud service on AWS
Create cloud service on AWSCreate cloud service on AWS
Create cloud service on AWS
 
Accelerate your Business with SAP on AWS - AWS Summit Cape Town 2017
Accelerate your Business with SAP on AWS - AWS Summit Cape Town 2017 Accelerate your Business with SAP on AWS - AWS Summit Cape Town 2017
Accelerate your Business with SAP on AWS - AWS Summit Cape Town 2017
 
Module 2 - Datalake
Module 2 - DatalakeModule 2 - Datalake
Module 2 - Datalake
 
Migrating Large Scale Data Sets to the Cloud
Migrating Large Scale Data Sets to the CloudMigrating Large Scale Data Sets to the Cloud
Migrating Large Scale Data Sets to the Cloud
 
BDA302 Deep Dive on Migrating Big Data Workloads to Amazon EMR
BDA302 Deep Dive on Migrating Big Data Workloads to Amazon EMRBDA302 Deep Dive on Migrating Big Data Workloads to Amazon EMR
BDA302 Deep Dive on Migrating Big Data Workloads to Amazon EMR
 
AWS re:Invent 2016: Big Data Architectural Patterns and Best Practices on AWS...
AWS re:Invent 2016: Big Data Architectural Patterns and Best Practices on AWS...AWS re:Invent 2016: Big Data Architectural Patterns and Best Practices on AWS...
AWS re:Invent 2016: Big Data Architectural Patterns and Best Practices on AWS...
 
Bursting on-premise analytic workloads to Amazon EMR using Alluxio
Bursting on-premise analytic workloads to Amazon EMR using AlluxioBursting on-premise analytic workloads to Amazon EMR using Alluxio
Bursting on-premise analytic workloads to Amazon EMR using Alluxio
 
Deep Dive on Amazon RDS (May 2016)
Deep Dive on Amazon RDS (May 2016)Deep Dive on Amazon RDS (May 2016)
Deep Dive on Amazon RDS (May 2016)
 
ENT305 Migrating Your Databases to AWS: Deep Dive on Amazon Relational Databa...
ENT305 Migrating Your Databases to AWS: Deep Dive on Amazon Relational Databa...ENT305 Migrating Your Databases to AWS: Deep Dive on Amazon Relational Databa...
ENT305 Migrating Your Databases to AWS: Deep Dive on Amazon Relational Databa...
 
Building with AWS Databases: Match Your Workload to the Right Database (DAT30...
Building with AWS Databases: Match Your Workload to the Right Database (DAT30...Building with AWS Databases: Match Your Workload to the Right Database (DAT30...
Building with AWS Databases: Match Your Workload to the Right Database (DAT30...
 
Migrating to Amazon RDS with Database Migration Service
Migrating to Amazon RDS with Database Migration ServiceMigrating to Amazon RDS with Database Migration Service
Migrating to Amazon RDS with Database Migration Service
 

Viewers also liked

Google和apple招聘的那些事
Google和apple招聘的那些事Google和apple招聘的那些事
Google和apple招聘的那些事杰丰 余
 
大数据人才招聘
大数据人才招聘大数据人才招聘
大数据人才招聘杰丰 余
 
《重新定义团队》书籍介绍
《重新定义团队》书籍介绍《重新定义团队》书籍介绍
《重新定义团队》书籍介绍杰丰 余
 
理解素质 人才管理的基础
理解素质 人才管理的基础理解素质 人才管理的基础
理解素质 人才管理的基础杰丰 余
 
5个简单技巧学创新
5个简单技巧学创新5个简单技巧学创新
5个简单技巧学创新杰丰 余
 
24个经典心理学实验总结
24个经典心理学实验总结24个经典心理学实验总结
24个经典心理学实验总结杰丰 余
 
驱动力 蜜蜂笔记(全集)
驱动力 蜜蜂笔记(全集)驱动力 蜜蜂笔记(全集)
驱动力 蜜蜂笔记(全集)杰丰 余
 
如何利用 Amazon EMR 及Athena 打造高成本效益的大數據環境
如何利用 Amazon EMR 及Athena 打造高成本效益的大數據環境如何利用 Amazon EMR 及Athena 打造高成本效益的大數據環境
如何利用 Amazon EMR 及Athena 打造高成本效益的大數據環境Amazon Web Services
 
Dev-Ops与Docker的最佳实践 QCon2016 北京站演讲
Dev-Ops与Docker的最佳实践 QCon2016 北京站演讲Dev-Ops与Docker的最佳实践 QCon2016 北京站演讲
Dev-Ops与Docker的最佳实践 QCon2016 北京站演讲ChinaNetCloud
 
Building microservices in python @ pycon2017
Building microservices in python @ pycon2017Building microservices in python @ pycon2017
Building microservices in python @ pycon2017Jonas Cheng
 
以AWS Lambda與Amazon API Gateway打造無伺服器後端
以AWS Lambda與Amazon API Gateway打造無伺服器後端以AWS Lambda與Amazon API Gateway打造無伺服器後端
以AWS Lambda與Amazon API Gateway打造無伺服器後端Amazon Web Services
 
初探 AWS 平台上的 Docker 服務
初探 AWS 平台上的 Docker 服務初探 AWS 平台上的 Docker 服務
初探 AWS 平台上的 Docker 服務Amazon Web Services
 
如何規劃與執行大型資料中心遷移和案例分享
如何規劃與執行大型資料中心遷移和案例分享如何規劃與執行大型資料中心遷移和案例分享
如何規劃與執行大型資料中心遷移和案例分享Amazon Web Services
 
AWS電商和零售業解決方案介紹
AWS電商和零售業解決方案介紹AWS電商和零售業解決方案介紹
AWS電商和零售業解決方案介紹Amazon Web Services
 
應用程式迅速開發與串連廣大用戶要素
應用程式迅速開發與串連廣大用戶要素應用程式迅速開發與串連廣大用戶要素
應用程式迅速開發與串連廣大用戶要素Amazon Web Services
 
零到千万可扩展架构 AWS Architecture Overview
零到千万可扩展架构 AWS Architecture Overview零到千万可扩展架构 AWS Architecture Overview
零到千万可扩展架构 AWS Architecture OverviewLeon Li
 
客戶導入雲端的經驗分享 [Panel Discussion]
客戶導入雲端的經驗分享 [Panel Discussion]客戶導入雲端的經驗分享 [Panel Discussion]
客戶導入雲端的經驗分享 [Panel Discussion]Amazon Web Services
 
(SDD422) Amazon VPC Deep Dive | AWS re:Invent 2014
(SDD422) Amazon VPC Deep Dive | AWS re:Invent 2014(SDD422) Amazon VPC Deep Dive | AWS re:Invent 2014
(SDD422) Amazon VPC Deep Dive | AWS re:Invent 2014Amazon Web Services
 
Serverless api gateway + lambda
Serverless api gateway + lambdaServerless api gateway + lambda
Serverless api gateway + lambdaLeon Li
 

Viewers also liked (20)

Google和apple招聘的那些事
Google和apple招聘的那些事Google和apple招聘的那些事
Google和apple招聘的那些事
 
大数据人才招聘
大数据人才招聘大数据人才招聘
大数据人才招聘
 
《重新定义团队》书籍介绍
《重新定义团队》书籍介绍《重新定义团队》书籍介绍
《重新定义团队》书籍介绍
 
理解素质 人才管理的基础
理解素质 人才管理的基础理解素质 人才管理的基础
理解素质 人才管理的基础
 
5个简单技巧学创新
5个简单技巧学创新5个简单技巧学创新
5个简单技巧学创新
 
24个经典心理学实验总结
24个经典心理学实验总结24个经典心理学实验总结
24个经典心理学实验总结
 
驱动力 蜜蜂笔记(全集)
驱动力 蜜蜂笔记(全集)驱动力 蜜蜂笔记(全集)
驱动力 蜜蜂笔记(全集)
 
如何利用 Amazon EMR 及Athena 打造高成本效益的大數據環境
如何利用 Amazon EMR 及Athena 打造高成本效益的大數據環境如何利用 Amazon EMR 及Athena 打造高成本效益的大數據環境
如何利用 Amazon EMR 及Athena 打造高成本效益的大數據環境
 
Dev-Ops与Docker的最佳实践 QCon2016 北京站演讲
Dev-Ops与Docker的最佳实践 QCon2016 北京站演讲Dev-Ops与Docker的最佳实践 QCon2016 北京站演讲
Dev-Ops与Docker的最佳实践 QCon2016 北京站演讲
 
Building microservices in python @ pycon2017
Building microservices in python @ pycon2017Building microservices in python @ pycon2017
Building microservices in python @ pycon2017
 
以AWS Lambda與Amazon API Gateway打造無伺服器後端
以AWS Lambda與Amazon API Gateway打造無伺服器後端以AWS Lambda與Amazon API Gateway打造無伺服器後端
以AWS Lambda與Amazon API Gateway打造無伺服器後端
 
初探 AWS 平台上的 Docker 服務
初探 AWS 平台上的 Docker 服務初探 AWS 平台上的 Docker 服務
初探 AWS 平台上的 Docker 服務
 
如何規劃與執行大型資料中心遷移和案例分享
如何規劃與執行大型資料中心遷移和案例分享如何規劃與執行大型資料中心遷移和案例分享
如何規劃與執行大型資料中心遷移和案例分享
 
AWS電商和零售業解決方案介紹
AWS電商和零售業解決方案介紹AWS電商和零售業解決方案介紹
AWS電商和零售業解決方案介紹
 
應用程式迅速開發與串連廣大用戶要素
應用程式迅速開發與串連廣大用戶要素應用程式迅速開發與串連廣大用戶要素
應用程式迅速開發與串連廣大用戶要素
 
深入探討雲端安全
深入探討雲端安全深入探討雲端安全
深入探討雲端安全
 
零到千万可扩展架构 AWS Architecture Overview
零到千万可扩展架构 AWS Architecture Overview零到千万可扩展架构 AWS Architecture Overview
零到千万可扩展架构 AWS Architecture Overview
 
客戶導入雲端的經驗分享 [Panel Discussion]
客戶導入雲端的經驗分享 [Panel Discussion]客戶導入雲端的經驗分享 [Panel Discussion]
客戶導入雲端的經驗分享 [Panel Discussion]
 
(SDD422) Amazon VPC Deep Dive | AWS re:Invent 2014
(SDD422) Amazon VPC Deep Dive | AWS re:Invent 2014(SDD422) Amazon VPC Deep Dive | AWS re:Invent 2014
(SDD422) Amazon VPC Deep Dive | AWS re:Invent 2014
 
Serverless api gateway + lambda
Serverless api gateway + lambdaServerless api gateway + lambda
Serverless api gateway + lambda
 

Similar to AWS Big Data Analytics Solutions

Big Data Analytics Architectural Patterns and Best Practices (ANT201-R1) - AW...
Big Data Analytics Architectural Patterns and Best Practices (ANT201-R1) - AW...Big Data Analytics Architectural Patterns and Best Practices (ANT201-R1) - AW...
Big Data Analytics Architectural Patterns and Best Practices (ANT201-R1) - AW...Amazon Web Services
 
Data Warehouses & Data Lakes: Data Analytics Week SF
Data Warehouses & Data Lakes: Data Analytics Week SFData Warehouses & Data Lakes: Data Analytics Week SF
Data Warehouses & Data Lakes: Data Analytics Week SFAmazon Web Services
 
Building a Data Lake in Amazon S3 & Amazon Glacier (STG401-R1) - AWS re:Inven...
Building a Data Lake in Amazon S3 & Amazon Glacier (STG401-R1) - AWS re:Inven...Building a Data Lake in Amazon S3 & Amazon Glacier (STG401-R1) - AWS re:Inven...
Building a Data Lake in Amazon S3 & Amazon Glacier (STG401-R1) - AWS re:Inven...Amazon Web Services
 
AWS Lake Formation Deep Dive
AWS Lake Formation Deep DiveAWS Lake Formation Deep Dive
AWS Lake Formation Deep DiveCobus Bernard
 
Data Warehouses & Data Lakes: Data Analytics Week at the SF Loft
Data Warehouses & Data Lakes: Data Analytics Week at the SF LoftData Warehouses & Data Lakes: Data Analytics Week at the SF Loft
Data Warehouses & Data Lakes: Data Analytics Week at the SF LoftAmazon Web Services
 
ABD201-Big Data Architectural Patterns and Best Practices on AWS
ABD201-Big Data Architectural Patterns and Best Practices on AWSABD201-Big Data Architectural Patterns and Best Practices on AWS
ABD201-Big Data Architectural Patterns and Best Practices on AWSAmazon Web Services
 
Building Data Lake on AWS | AWS Floor28
Building Data Lake on AWS | AWS Floor28Building Data Lake on AWS | AWS Floor28
Building Data Lake on AWS | AWS Floor28Amazon Web Services
 
AWS Floor 28 - Building Data lake on AWS
AWS Floor 28 - Building Data lake on AWSAWS Floor 28 - Building Data lake on AWS
AWS Floor 28 - Building Data lake on AWSAdir Sharabi
 
Leveraging Cloud Analytics to Support Data-Driven Decisions
Leveraging Cloud Analytics to Support Data-Driven DecisionsLeveraging Cloud Analytics to Support Data-Driven Decisions
Leveraging Cloud Analytics to Support Data-Driven DecisionsAmazon Web Services
 
Everything You Need to Know About Big Data: From Architectural Principles to ...
Everything You Need to Know About Big Data: From Architectural Principles to ...Everything You Need to Know About Big Data: From Architectural Principles to ...
Everything You Need to Know About Big Data: From Architectural Principles to ...Amazon Web Services
 
Preparing Your Data for Cloud Analytics & AI/ML
Preparing Your Data for Cloud Analytics & AI/ML Preparing Your Data for Cloud Analytics & AI/ML
Preparing Your Data for Cloud Analytics & AI/ML Amazon Web Services
 
AWS Data Lake: data analysis @ scale
AWS Data Lake: data analysis @ scaleAWS Data Lake: data analysis @ scale
AWS Data Lake: data analysis @ scaleAmazon Web Services
 
Applying AWS Purpose-Built Database Strategy - SRV307 - Anaheim AWS Summit
Applying AWS Purpose-Built Database Strategy - SRV307 - Anaheim AWS SummitApplying AWS Purpose-Built Database Strategy - SRV307 - Anaheim AWS Summit
Applying AWS Purpose-Built Database Strategy - SRV307 - Anaheim AWS SummitAmazon Web Services
 
Builders' Day - Building Data Lakes for Analytics On AWS LC
Builders' Day - Building Data Lakes for Analytics On AWS LCBuilders' Day - Building Data Lakes for Analytics On AWS LC
Builders' Day - Building Data Lakes for Analytics On AWS LCAmazon Web Services LATAM
 

Similar to AWS Big Data Analytics Solutions (20)

Big Data Analytics Architectural Patterns and Best Practices (ANT201-R1) - AW...
Big Data Analytics Architectural Patterns and Best Practices (ANT201-R1) - AW...Big Data Analytics Architectural Patterns and Best Practices (ANT201-R1) - AW...
Big Data Analytics Architectural Patterns and Best Practices (ANT201-R1) - AW...
 
Data Warehouses and Data Lakes
Data Warehouses and Data LakesData Warehouses and Data Lakes
Data Warehouses and Data Lakes
 
Data Warehouses and Data Lakes
Data Warehouses and Data LakesData Warehouses and Data Lakes
Data Warehouses and Data Lakes
 
Data Warehouses and Data Lakes
Data Warehouses and Data LakesData Warehouses and Data Lakes
Data Warehouses and Data Lakes
 
Data Warehouses and Data Lakes
Data Warehouses and Data LakesData Warehouses and Data Lakes
Data Warehouses and Data Lakes
 
Data Warehouses & Data Lakes: Data Analytics Week SF
Data Warehouses & Data Lakes: Data Analytics Week SFData Warehouses & Data Lakes: Data Analytics Week SF
Data Warehouses & Data Lakes: Data Analytics Week SF
 
Building a Data Lake in Amazon S3 & Amazon Glacier (STG401-R1) - AWS re:Inven...
Building a Data Lake in Amazon S3 & Amazon Glacier (STG401-R1) - AWS re:Inven...Building a Data Lake in Amazon S3 & Amazon Glacier (STG401-R1) - AWS re:Inven...
Building a Data Lake in Amazon S3 & Amazon Glacier (STG401-R1) - AWS re:Inven...
 
AWS Lake Formation Deep Dive
AWS Lake Formation Deep DiveAWS Lake Formation Deep Dive
AWS Lake Formation Deep Dive
 
Data Warehouses and Data Lakes
Data Warehouses and Data LakesData Warehouses and Data Lakes
Data Warehouses and Data Lakes
 
Data Warehouses & Data Lakes: Data Analytics Week at the SF Loft
Data Warehouses & Data Lakes: Data Analytics Week at the SF LoftData Warehouses & Data Lakes: Data Analytics Week at the SF Loft
Data Warehouses & Data Lakes: Data Analytics Week at the SF Loft
 
ABD201-Big Data Architectural Patterns and Best Practices on AWS
ABD201-Big Data Architectural Patterns and Best Practices on AWSABD201-Big Data Architectural Patterns and Best Practices on AWS
ABD201-Big Data Architectural Patterns and Best Practices on AWS
 
Building Data Lake on AWS | AWS Floor28
Building Data Lake on AWS | AWS Floor28Building Data Lake on AWS | AWS Floor28
Building Data Lake on AWS | AWS Floor28
 
AWS Floor 28 - Building Data lake on AWS
AWS Floor 28 - Building Data lake on AWSAWS Floor 28 - Building Data lake on AWS
AWS Floor 28 - Building Data lake on AWS
 
Data_Analytics_and_AI_ML
Data_Analytics_and_AI_MLData_Analytics_and_AI_ML
Data_Analytics_and_AI_ML
 
Leveraging Cloud Analytics to Support Data-Driven Decisions
Leveraging Cloud Analytics to Support Data-Driven DecisionsLeveraging Cloud Analytics to Support Data-Driven Decisions
Leveraging Cloud Analytics to Support Data-Driven Decisions
 
Everything You Need to Know About Big Data: From Architectural Principles to ...
Everything You Need to Know About Big Data: From Architectural Principles to ...Everything You Need to Know About Big Data: From Architectural Principles to ...
Everything You Need to Know About Big Data: From Architectural Principles to ...
 
Preparing Your Data for Cloud Analytics & AI/ML
Preparing Your Data for Cloud Analytics & AI/ML Preparing Your Data for Cloud Analytics & AI/ML
Preparing Your Data for Cloud Analytics & AI/ML
 
AWS Data Lake: data analysis @ scale
AWS Data Lake: data analysis @ scaleAWS Data Lake: data analysis @ scale
AWS Data Lake: data analysis @ scale
 
Applying AWS Purpose-Built Database Strategy - SRV307 - Anaheim AWS Summit
Applying AWS Purpose-Built Database Strategy - SRV307 - Anaheim AWS SummitApplying AWS Purpose-Built Database Strategy - SRV307 - Anaheim AWS Summit
Applying AWS Purpose-Built Database Strategy - SRV307 - Anaheim AWS Summit
 
Builders' Day - Building Data Lakes for Analytics On AWS LC
Builders' Day - Building Data Lakes for Analytics On AWS LCBuilders' Day - Building Data Lakes for Analytics On AWS LC
Builders' Day - Building Data Lakes for Analytics On AWS LC
 

More from Amazon Web Services

Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...Amazon Web Services
 
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...Amazon Web Services
 
Esegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS FargateEsegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS FargateAmazon Web Services
 
Costruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWSCostruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWSAmazon Web Services
 
Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot Amazon Web Services
 
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...Amazon Web Services
 
OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...Amazon Web Services
 
Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows WorkloadsMicrosoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows WorkloadsAmazon Web Services
 
Database Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatareDatabase Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatareAmazon Web Services
 
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJSCrea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJSAmazon Web Services
 
API moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e webAPI moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e webAmazon Web Services
 
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatareDatabase Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatareAmazon Web Services
 
Tools for building your MVP on AWS
Tools for building your MVP on AWSTools for building your MVP on AWS
Tools for building your MVP on AWSAmazon Web Services
 
How to Build a Winning Pitch Deck
How to Build a Winning Pitch DeckHow to Build a Winning Pitch Deck
How to Build a Winning Pitch DeckAmazon Web Services
 
Building a web application without servers
Building a web application without serversBuilding a web application without servers
Building a web application without serversAmazon Web Services
 
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...Amazon Web Services
 
Introduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container ServiceIntroduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container ServiceAmazon Web Services
 

More from Amazon Web Services (20)

Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
 
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
 
Esegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS FargateEsegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS Fargate
 
Costruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWSCostruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWS
 
Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot
 
Open banking as a service
Open banking as a serviceOpen banking as a service
Open banking as a service
 
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
 
OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...
 
Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows WorkloadsMicrosoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
 
Computer Vision con AWS
Computer Vision con AWSComputer Vision con AWS
Computer Vision con AWS
 
Database Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatareDatabase Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatare
 
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJSCrea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
 
API moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e webAPI moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e web
 
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatareDatabase Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
 
Tools for building your MVP on AWS
Tools for building your MVP on AWSTools for building your MVP on AWS
Tools for building your MVP on AWS
 
How to Build a Winning Pitch Deck
How to Build a Winning Pitch DeckHow to Build a Winning Pitch Deck
How to Build a Winning Pitch Deck
 
Building a web application without servers
Building a web application without serversBuilding a web application without servers
Building a web application without servers
 
Fundraising Essentials
Fundraising EssentialsFundraising Essentials
Fundraising Essentials
 
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
 
Introduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container ServiceIntroduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container Service
 

Recently uploaded

A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxLoriGlavin3
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfAlex Barbosa Coqueiro
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxLoriGlavin3
 
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxLoriGlavin3
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii SoldatenkoFwdays
 
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESSALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESmohitsingh558521
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfAddepto
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxhariprasad279825
 
Sample pptx for embedding into website for demo
Sample pptx for embedding into website for demoSample pptx for embedding into website for demo
Sample pptx for embedding into website for demoHarshalMandlekar2
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxLoriGlavin3
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningLars Bell
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024Lonnie McRorey
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Manik S Magar
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero
 
A Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersA Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersNicole Novielli
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek SchlawackFwdays
 

Recently uploaded (20)

A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdf
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
 
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko
 
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESSALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdf
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptx
 
Sample pptx for embedding into website for demo
Sample pptx for embedding into website for demoSample pptx for embedding into website for demo
Sample pptx for embedding into website for demo
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine Tuning
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
 
A Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersA Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software Developers
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
 

AWS Big Data Analytics Solutions

  • 1. .©  2015,  Amazon  Web  Services,  Inc.  or  its  Affiliates.  All  rights  reserved. John  Chang Ecosystem  Solutions  Architect April  2016 大數據運算 媒體業案例分享
  • 2. ©  2015,  Amazon  Web  Services,  Inc.  or  its  Affiliates.  All  rights  reserved. What  is  big  data? Big  data  on  AWS Northbay customer  case  studies Best  practices APN  resources
  • 3. ©  2015,  Amazon  Web  Services,  Inc.  or  its  Affiliates.  All  rights  reserved. What  Is  Big  Data  &  Why  Do  We  Care?
  • 4. ©  2015,  Amazon  Web  Services,  Inc.  or  its  Affiliates.  All  rights  reserved. GB TB PB ZB EB Big  Data:  Unconstrained  Growth 95%  of  the  1.2   zettabytes of  data  in  the   digital  universe  is   unstructured 70%  of  this  data  is  user-­ generated  content   Unstructured  data   growth  is  explosive Machine  data/IoT will   only  steepen  the  curve Source:  IDC
  • 5. ©  2015,  Amazon  Web  Services,  Inc.  or  its  Affiliates.  All  rights  reserved. Data  Gap 1990 2000 2010 2020 Generated  Data Available  for  Analysis Data  Volume Sources:   Gartner:  User  Survey  Analysis:  Key  Trends  Shaping  the  Future  of  Data  Center  Infrastructure  Through  2011   IDC:  Worldwide  Business  Analytics  Software  2012–2016  Forecast  and  2011  Vendor  Shares  
  • 6. ©  2015,  Amazon  Web  Services,  Inc.  or  its  Affiliates.  All  rights  reserved. Big  Data  Evolution Batch Report Real-­‐time   Alerts Prediction Forecast
  • 7. ©  2015,  Amazon  Web  Services,  Inc.  or  its  Affiliates.  All  rights  reserved. Plethora  of  Tools Amazon   Glacier S3 DynamoDB   RDS EMR Amazon   Redshift Data  PipelineAmazon  Kinesis   Cassandra CloudSearch Kinesis-­ enabled   app Lambda ML SQS ElastiCache DynamoDB Streams  
  • 8. ©  2015,  Amazon  Web  Services,  Inc.  or  its  Affiliates.  All  rights  reserved. A  complete  platform  for  big  data  &  analytics Retrospective analysis  and   reporting Here-­and-­now real-­time  processing   and  dashboards Predictions to  enable  smart   applications Amazon  Kinesis   Amazon  EC2   Amazon  Redshift   Amazon  EMR Amazon  ML Amazon  EMR
  • 9. ©  2015,  Amazon  Web  Services,  Inc.  or  its  Affiliates.  All  rights  reserved. Is  there  a  reference  architecture  ? What  tools  should  I  use  ? How  ?   Why  ?
  • 10. ©  2015,  Amazon  Web  Services,  Inc.  or  its  Affiliates.  All  rights  reserved. http://aws.amazon.com/marketplace Big  Data  Case  Studies Learn  from  other  AWS  customers aws.amazon.com/solutions/case-­studies/big-­data
  • 11. ©  2015,  Amazon  Web  Services,  Inc.  or  its  Affiliates.  All  rights  reserved. Simplify  Big  Data  Processing ingest  / collect store process  / analyze consume  /   visualize Time  to  Answer  (Latency) Throughput Cost
  • 12. ©  2015,  Amazon  Web  Services,  Inc.  or  its  Affiliates.  All  rights  reserved. Collect  / Ingest  
  • 13. ©  2015,  Amazon  Web  Services,  Inc.  or  its  Affiliates.  All  rights  reserved. Types  of  Data • Transactional • Database  reads  &  writes  (OLTP) • Cache   • Search • Logs • Streams • File • Log  files  (/var/log) • Log  collectors  &  frameworks • Stream • Log  records • Sensors  &  IoT data Database File Storage Stream Storage A iOS Android Web  Apps Logstash LoggingIoTApplications Transactional Data File Data Stream Data Mobile   Apps Search Data Search Collect Store LoggingIoT
  • 14. ©  2015,  Amazon  Web  Services,  Inc.  or  its  Affiliates.  All  rights  reserved. Store
  • 15. ©  2015,  Amazon  Web  Services,  Inc.  or  its  Affiliates.  All  rights  reserved. Stream   Storage A iOS Android Web  Apps Logstash Amazon RDS Amazon DynamoDB Amazon ES Amazon S3 Apache Kafka Amazon Glacier Amazon Kinesis Amazon DynamoDB Amazon ElastiCache SearchSQLNoSQLCacheStreamStorageFileStorage Transactional Data File Data Stream Data Mobile   Apps Search Data Database File Storage Search Collect Store LoggingIoTApplications ü
  • 16. ©  2015,  Amazon  Web  Services,  Inc.  or  its  Affiliates.  All  rights  reserved. Why  Is  Amazon  S3  Good  for  Big  Data? • Natively  supported  by  big  data  frameworks (Spark,  Hive,  Presto,  etc.)   • No  need  to  run  compute  clusters  for  storage  (unlike  HDFS) • Can  run  transient  Hadoop  clusters  &  Amazon  EC2  Spot  instances • Multiple  distinct  (Spark,  Hive,  Presto)  clusters  can  use  the  same  data • Unlimited  number  of  objects   • Very  high  bandwidth    – no  aggregate  throughput  limit • Highly  available  – can  tolerate  AZ  failure • Designed  for  99.999999999%  durability • Tired-­storage  (Standard,  IA,  Amazon  Glacier)  via  life-­cycle  policy • Secure  – SSL,  client/server-­side  encryption  at  rest • Low  cost
  • 17. ©  2015,  Amazon  Web  Services,  Inc.  or  its  Affiliates.  All  rights  reserved. What  about  HDFS  &  Amazon  Glacier? • Use  HDFS  for  very  frequently   accessed  (hot)  data • Use  Amazon  S3  Standard  for   frequently  accessed  data   • Use  Amazon  S3  Standard  – IA  for  infrequently  accessed   data • Use  Amazon  Glacier  for   archiving  cold  data  
  • 18. ©  2015,  Amazon  Web  Services,  Inc.  or  its  Affiliates.  All  rights  reserved. Database  +   Search   Tier A iOS Android Web  Apps Logstash Amazon RDS Amazon DynamoDB Amazon ES Amazon S3 Apache Kafka Amazon Glacier Amazon Kinesis Amazon DynamoDB Amazon ElastiCache SearchSQLNoSQLCacheStreamStorageFileStorage Transactional Data File Data Stream Data Mobile   Apps Search Data Collect Store ü
  • 19. ©  2015,  Amazon  Web  Services,  Inc.  or  its  Affiliates.  All  rights  reserved. Database  +  Search  Tier  Anti-­pattern Database  +  Search  Tier
  • 20. ©  2015,  Amazon  Web  Services,  Inc.  or  its  Affiliates.  All  rights  reserved. Best  Practice  — Use  the  Right  Tool  for  the  Job Data  Tier Search Amazon   Elasticsearch Service Amazon   CloudSearch Cache Redis Memcached SQL Amazon  Aurora MySQL PostgreSQL Oracle SQL  Server NoSQL Cassandra Amazon   DynamoDB HBase MongoDB Database  +  Search  Tier
  • 21. ©  2015,  Amazon  Web  Services,  Inc.  or  its  Affiliates.  All  rights  reserved. What  Data  Store  Should  I  Use? • Data  structure  →  Fixed  schema,  JSON,  key-­value • Access  patterns  →  Store  data  in  the  format  you  will   access  it • Data  /  access  characteristics  →  Hot,  warm,  cold • Cost  →  Right  cost
  • 22. ©  2015,  Amazon  Web  Services,  Inc.  or  its  Affiliates.  All  rights  reserved. Data  Structure  and  Access  Patterns Access  Patterns What  to  use? Put/Get  (Key, Value) Cache,  NoSQL Simple relationships  →  1:N, M:N NoSQL Cross table  joins,  transaction,  SQL SQL Faceting,  Search   Search Data Structure What  to  use? Fixed  schema SQL,  NoSQL Schema-­free (JSON) NoSQL,  Search (Key, Value) Cache,  NoSQL
  • 23. ©  2015,  Amazon  Web  Services,  Inc.  or  its  Affiliates.  All  rights  reserved. Data  /  Access  Characteristics:  Hot,  Warm,  Cold Hot Warm Cold Volume MB–GB GB–TB PB Item  size B–KB KB–MB KB–TB Latency ms ms,  sec min,  hrs Durability Low–High High Very  High Request  rate Very  High High Low Cost/GB $$-­$ $-­¢¢ ¢ Hot  Data Warm  Data Cold  Data
  • 24. ©  2015,  Amazon  Web  Services,  Inc.  or  its  Affiliates.  All  rights  reserved. What  Data  Store  Should  I  Use? Amazon   ElastiCache Amazon DynamoDB Amazon Aurora Amazon Elasticsearch Amazon   EMR  (HDFS) Amazon  S3 Amazon Glacier Average   latency ms ms ms,  sec ms,sec sec,min,hrs ms,sec,min (~  size) hrs Data  volume GB GB–TBs (no limit) GB–TB (64  TB   Max) GB–TB GB–PB (~nodes) MB–PB (no limit) GB–PB (no limit) Item  size B-­KB KB (400  KB   max) KB (64  KB) KB (1  MB  max) MB-­GB KB-­GB (5  TB max) GB (40  TB  max) Request  rate High  -­ Very  High Very  High (no  limit) High High Low  – Very   High Low  – Very  High (no limit) Very  Low Storage  cost GB/month $$ ¢¢ ¢¢ ¢¢ ¢ ¢ ¢/10 Durability Low  -­ Moderate Very  High Very  High High High Very  High Very  High Hot  Data Warm  Data Cold  Data Hot  Data Warm  Data Cold  Data
  • 25. ©  2015,  Amazon  Web  Services,  Inc.  or  its  Affiliates.  All  rights  reserved. Process  / Analyze
  • 26. ©  2015,  Amazon  Web  Services,  Inc.  or  its  Affiliates.  All  rights  reserved. AnalyzeA iOS Android Web  Apps Logstash Amazon RDS Amazon DynamoDB Amazon ES Amazon S3 Apache Kafka Amazon Glacier Amazon Kinesis Amazon DynamoDB Amazon Redshift Impala Pig Amazon ML Streaming Amazon Kinesis AWS Lambda AmazonElasticMapReduce Amazon ElastiCache SearchSQLNoSQLCache StreamProcessingBatchInteractive Logging StreamStorage IoTApplications FileStorage Hot Cold War m Hot Hot ML Transactional Data File Data Stream Data Mobile   Apps Search Data Collect Store Analyze ü ü
  • 27. ©  2015,  Amazon  Web  Services,  Inc.  or  its  Affiliates.  All  rights  reserved. Process  /  Analyze Analysis  of  data is  a  process  of  inspecting,  cleaning,   transforming,  and  modeling data with  the  goal  of  discovering   useful information,  suggesting  conclusions,  and  supporting   decision-­making. Examples • Interactive  dashboards  → Interactive  analytics • Daily/weekly/monthly  reports  →  Batch  analytics • Billing/fraud  alerts,  1  minute  metrics  →  Real-­time  analytics • Sentiment  analysis,  prediction  models  →  Machine  learning
  • 28. ©  2015,  Amazon  Web  Services,  Inc.  or  its  Affiliates.  All  rights  reserved. Spark  Streaming   Apache  Storm AWS  Lambda KCL Amazon   Redshift Spark   Impala   Presto Hive Amazon Redshift Hive Spark   Presto Impala Amazon   Kinesis Apache  Kafka Amazon   DynamoDB Amazon  S3data Hot Cold Data  Temperature Processing  Latency Low High Answers Amazon  EMR   (HDFS) Hive Native KCL AWS  Lambda Data  Temperature  vs  Processing  Latency Batch
  • 29. ©  2015,  Amazon  Web  Services,  Inc.  or  its  Affiliates.  All  rights  reserved. Interactive  Analytics Takes  large  amount  of  (warm/cold)  data Takes  seconds to  get  answers  back Example:  Self-­service  dashboards
  • 30. ©  2015,  Amazon  Web  Services,  Inc.  or  its  Affiliates.  All  rights  reserved. Batch  Analytics Takes  large  amount  of  (warm/cold)  data Takes  minutes  or  hours to  get  answers  back Example:  Generating  daily,  weekly,  or  monthly  reports
  • 31. ©  2015,  Amazon  Web  Services,  Inc.  or  its  Affiliates.  All  rights  reserved. Real-­Time  Analytics Take  small  amount  of  hot  data  and  ask  questions   Takes  short  amount  of  time  (milliseconds  or  seconds)  to   get  your  answer  back • Real-­time  (event) • Real-­time  response  to  events  in  data  streams • Example:  Billing/Fraud  Alerts   • Near  real-­time  (micro-­batch) • Near  real-­time  operations  on  small  batches  of  events  in  data   streams • Example:  1  Minute  Metrics
  • 32. ©  2015,  Amazon  Web  Services,  Inc.  or  its  Affiliates.  All  rights  reserved. Predictions  via  Machine  Learning ML  gives  computers  the  ability  to  learn  without  being  explicitly   programmed Machine  Learning  Algorithms: -­ Supervised  Learning  ←  “teach”  program -­ Classification  ← Is  this  transaction  fraud?  (Yes/No)   -­ Regression  ← Customer  Life-­time  value?   -­ Unsupervised  Learning  ←  let  it  learn  by  itself -­ Clustering  ←  Market  Segmentation
  • 33. ©  2015,  Amazon  Web  Services,  Inc.  or  its  Affiliates.  All  rights  reserved. Analysis  Tools  and  Frameworks Machine  Learning • Mahout,  Spark  ML,  Amazon  ML Interactive  Analytics • Amazon  Redshift,  Presto,  Impala,  Spark Batch  Processing • MapReduce,  Hive,  Pig,  Spark Stream  Processing • Micro-­batch:  Spark  Streaming,  KCL,  Hive,  Pig • Real-­time:  Storm,  AWS  Lambda,  KCL Amazon Redshift Impala Pig Amazon Machine Learning Streaming Amazon Kinesis AWS Lambda AmazonElasticMapReduce StreamProcessingBatchInteractiveML Analyze
  • 34. ©  2015,  Amazon  Web  Services,  Inc.  or  its  Affiliates.  All  rights  reserved. Real-­time  Analytics Producer Apache Kafka KCL AWS  Lambda Spark Streaming Apache   Storm Amazon   SNS Amazon ML Notifications Amazon ElastiCache (Redis) Amazon DynamoDB Amazon RDS Amazon ES Alert App  state Real-­time  Prediction KPI process store DynamoDB Streams Amazon   Kinesis
  • 35. ©  2015,  Amazon  Web  Services,  Inc.  or  its  Affiliates.  All  rights  reserved. Interactive  &   Batch Analytics Producer Amazon  S3 Amazon  EMR Hive Pig Spark Amazon ML process store Consume Amazon   Redshift Amazon  EMR Presto Impala Spark Batch Interactive Batch  Prediction Real-­time  Prediction
  • 36. ©  2015,  Amazon  Web  Services,  Inc.  or  its  Affiliates.  All  rights  reserved. Batch  Layer Amazon Kinesis data process store Lambda  Architecture Amazon   Kinesis  S3   Connector   Amazon  S3 A p p l i c a t i o n s Amazon   Redshift Amazon  EMR Presto Hive Pig Spark answer Speed  Layer answer Serving   Layer Amazon ElastiCache Amazon DynamoDB Amazon RDS Amazon ES answer Amazon ML KCL AWS  Lambda Spark  Streaming Storm
  • 37. ©  2015,  Amazon  Web  Services,  Inc.  or  its  Affiliates.  All  rights  reserved. Consume  /   Visualize
  • 38. ©  2015,  Amazon  Web  Services,  Inc.  or  its  Affiliates.  All  rights  reserved. Collect Store Analyze Consume A iOS Android Web  Apps Logstash Amazon RDS Amazon DynamoDB Amazon ES Amazon S3 Apache Kafka Amazon Glacier Amazon Kinesis Amazon DynamoDB Amazon Redshift Impala Pig Amazon ML Streaming Amazon Kinesis AWS Lambda AmazonElasticMapReduce Amazon ElastiCache SearchSQLNoSQLCache StreamProcessingBatchInteractive Logging StreamStorage IoTApplications FileStorage Analysis&Visualization Hot Cold War m Hot Slow Hot ML Fast Fast Transactional Data File Data Stream Data Notebook s Predictions Apps & APIs Mobile   Apps IDE Search Data ETL Amazon   QuickSight
  • 39. ©  2015,  Amazon  Web  Services,  Inc.  or  its  Affiliates.  All  rights  reserved. Consume • Predictions   • Analysis  and  Visualization • Notebooks • IDE • Applications  &  API Consume Analysis&Visualization Amazon   QuickSight Notebook s Predictions Apps & APIs IDE Store Analyze ConsumeETL Business   users Data  Scientist,   Developers
  • 40. ©  2015,  Amazon  Web  Services,  Inc.  or  its  Affiliates.  All  rights  reserved. Putting  It  All  Together
  • 41. ©  2015,  Amazon  Web  Services,  Inc.  or  its  Affiliates.  All  rights  reserved. Collect Store Analyze Consume A iOS Android Web  Apps Logstash Amazon RDS Amazon DynamoDB Amazon ES Amazon S3 Apache Kafka Amazon Glacier Amazon Kinesis Amazon DynamoDB Amazon Redshift Impala Pig Amazon ML Streaming Amazon Kinesis AWS Lambda AmazonElasticMapReduce Amazon ElastiCache SearchSQLNoSQLCache StreamProcessingBatchInteractive Logging StreamStorage IoTApplications FileStorage Analysis&Visualization Hot Cold War m Hot Slow Hot ML Fast Fast Amazon   QuickSight Transactional Data File Data Stream Data Notebook s Predictions Apps & APIs Mobile   Apps IDE Search Data ETL Reference  Architecture
  • 42. ©  2015,  Amazon  Web  Services,  Inc.  or  its  Affiliates.  All  rights  reserved. Problem  Statement: • Need  massive  scalability  and  elasticity Use  of  AWS: • Nearly  100%  of  its  online  video  service  on  AWS • Global  use  of  Amazon  EC2,  Amazon  S3,  Amazon  SQS,   Amazon  EMR,  Lambda,  etc. • 30-­50K  EC2  instances Business  Benefits:   • Application  achieves  near  zero  downtime • Massive  scalability  and  elasticity • Transcoding  entire  library  to  ~60  output  renditions “AWS  is  the  market  leader  and  has  been  able  to  create  a  continuous  and  virtuous  cycle.”   – Kevin  McEntee,  VP  Content  Engineering,  Netflix Customer:  Netflix
  • 43. ©  2015,  Amazon  Web  Services,  Inc.  or  its  Affiliates.  All  rights  reserved. AdRoll Builds  Bidding  Platform  on  AWS  and  Cuts  Costs  by  83% AdRoll is  a  global  leader  in  digital  advertising   retargeting  products. We’ve  been  able  to   seamlessly  scale  our   infrastructure  and  reduce  our   fixed  costs  by  75%  and   operational  costs  by  83%.” Valentino  Volonghi CTO,  AdRoll ” “ • AdRoll manages  its  Real-­Time  Bidding  platform  using   Amazon  EC2,  Amazon  Dynmo DB,  and  Amazon  S3 • Reduced  annual  operational  costs  by  83% • Reduced  fixed  costs  by  75% • Staff  now  95%  focused  on  new  product  development  
  • 44. ©  2015,  Amazon  Web  Services,  Inc.  or  its  Affiliates.  All  rights  reserved. Problem  Statement: • Needed  scalable,  high  performance,  and  highly  available   storage  and  big  data  solutions Use  of  AWS: • Direct  Connect,  S3,  EMR,  other  AWS  services • Went  from  ~5GB  of  logs  per  day  to  ~1300GB/day Business  Benefits:   • By  moving  to  AWS,  went  from  spending  $50K/mo to   $13K/mo on  big  data  solutions Xfinity X1  Set  Top  Box  Platform Customer:  Comcast
  • 45. ©  2015,  Amazon  Web  Services,  Inc.  or  its  Affiliates.  All  rights  reserved. MLB  Advanced  Media 「消費者行為正在改變。他們從行動裝置上網購物,這種技 術對於球賽的進化非常重要。」 「我們的努力中最令人興奮的事,就是 AWS  支援的 Statcast。我們首次可以測量以前無法測量的資料。」
  • 46. ©  2015,  Amazon  Web  Services,  Inc.  or  its  Affiliates.  All  rights  reserved. Partnering  with  AWS
  • 47. Thank  you! ©  2015,  Amazon  Web  Services,  Inc.  or  its  Affiliates.  All  rights  reserved. Questions?
  • 48. Thank  you! ©  2015,  Amazon  Web  Services,  Inc.  or  its  Affiliates.  All  rights  reserved.