SlideShare a Scribd company logo
1 of 62
Download to read offline
Level 200 Hands-on Workshop
Visualize your data in Data Lake with
AWS Athena and AWS Quicksight
Jeff Ng, Solutions Architect
eCloudvalley 27 July, 2017
Agenda
 About eCloudvalley
 Overview of Amazon Athena?
 Overview of Amazon QuickSight?
 Athena+QuickSight vs ELK
 Demo
 Lab
About eCloudvalley
Create a New Cloud Standard by
becoming
TRUE CLOUD EXPERT.
10 X All 5-Cert Engineers
90+ AWS Certifications
AWS Authorized Instructor
Microsoft Certified Trainer
100% Focus on the
Largest Cloud- AWS
Born-in-the-cloud
System Integrator
https://aws.amazon.com/partners/premier/
The 1st and the Only Premier Partner in GCR
The 1st and the Only Premier
Partner in GCR
AWS China
Region
Consulting
Partner
AWS Premier
Consulting
Partner
(1st and the Only
Premier Partner in
GCR)
Marketing &
Commerce
Competency
AWS Audited
Managed
Services
Provider
Mobile
Competency
Our Offering
Official
AWS Training
DevOps/
Optimization
Next Generation
Managed Services
Fully Managed
AWS
Migration
DataOps/
Big Data/ BI
Overview of Data Analytics
and Visualize
Athena
AthenaS3 QuickSight
Challenges Customers Faced
Significant amount of work required to analyze data in
Amazon S3
Users often only have access to aggregated data sets
Managing a Hadoop cluster or data warehouse requires
expertise
Introducing Amazon Athena
Amazon Athena is an interactive query service that
makes it easy to analyze data directly from Amazon
S3 using Standard SQL
Athena is Serverless
No Infrastructure or
administration
Zero Spin up time
Transparent upgrades
Amazon Athena is Easy To Use
Log into the Console
Create a table
Type in a Hive DDL Statement
Use the console Add Tablewizard
Start querying
Amazon Athena is Highly Available
You connect to a service endpoint or log into the console
Athena uses warm compute pools across multiple
Availability Zones
Your data is in Amazon S3, which is also highly available
and designed for 99.999999999% durability
Query Data Directly from Amazon S3
No loading of data
Query data in its raw format
Text, CSV, JSON, weblogs, AWS service logs
Convert to an optimized form like ORC or Parquet for the best
performance and lowest cost
No ETL required
Stream data from directly from Amazon S3
Take advantage of Amazon S3 durability and availability
Use ANSI SQL
Start writing ANSI SQL
Support for complex joins, nested
queries & window functions
Support for complex data types
(arrays, structs)
Support for partitioning of data by
any key
(date, time, custom keys)
e.g., Year, Month, Day, Hour or
Customer Key, Date
Familiar Technologies Under the Covers
Used for SQL Queries
In-memory distributed query engine
ANSI-SQL compatible with extensions
Used for DDL functionality
Complex data types
Multitude of formats
Supports data partitioning
Amazon Athena Supports Multiple Data
Formats
Text files, e.g., CSV, raw logs
Apache Web Logs, TSV files
JSON (simple, nested)
Compressed files
Columnar formats such as Apache Parquet &Apache ORC
AVRO support
Amazon Athena is Fast
Tuned for performance
Automatically parallelizes
queries
Results are streamed to console
Results also stored in S3
Improve Query performance
Compress your data
Use columnar formats
Amazon Athena is Cost Effective
Pay per query
$5 per TB scanned from S3
DDL Queries and failed queries are free
Save by using compression, columnar formats, partitions
A Sample Pipeline
A Sample Pipeline
Ad-hoc access to raw data usingSQL
A Sample Pipeline
Ad-hoc access to raw data usingSQL Athena can query
aggregated datasets as well
Summary
No ETL required. No loading of data. Query data where it lives
Query data at whatever latitude and longitude you want
No infrastructure to manage
Accessing AmazonAthena
Simple Query
editor with key
bindings
Can also see a detailed view
in the catalog tab
You can also check the
properties. Note the location.
Use the JDBC Driver
Using Amazon Athena with Amazon QuickSight
AmazonS3
AmazonRDS
AmazonRedshift
AmazonAthena
QuickSight allows you to connect to data from a wide variety of AWS, third-party, and on-
premises sources including AmazonAthena
Overview of Amazon QiuckSight
What are the big data challenges
our customers face?
Who are my top customers and what are they buying?
Which devices are showing time for maintenance?
What is my product profitability by region?Why is my most profitable region not growing?
How much inventory do I have?
Has my fraud account expense
increased?
How is my marketing campaign performing?
How is my employee satisfaction
trending?
Lots of data
Lots and lots of questions
Few insights
Old-guard BI
Costs too much
Pay $ million before seeing first analysis
3 year TCO $150 to $250 per user per month
Takes too long
Spend 6 to 12 months of consulting
and software implementation time
Introducing
Amazon QuickSight
A fast, cloud-powered, BI service for
1/10 the cost of traditional BI software
$9
per user per month
With 1 year commitment
First analysis in about 60 seconds
Sign-in
Business user
QuickSightAPI
Data prep Metadata SuggestionsConnectors SPICE
Business User Business User
QuickSight UI
Mobile devices Web browsers
Partner BI products
Amazon
S3
Amazon
Kinesis
Amazon
DynamoDB
Amazon
EMR
Amazon
Redshift
Amazon RDSFiles Apps
Direct connect
JDBC/ODBC
On-premises data
Athena
I have multiple datasets both on-premises and on
AWS from different sources, and I need to make
data available and enable access by using
Amazon QuickSight.
How do I do this?
1. Data made available in “data lakes” using
Amazon S3 or
Amazon Redshift
2. Data access managed with bucket- or schema-
level policies
3. Data enabled by using Amazon QuickSight
Amazon EMR
or Apache
Hadoop
Log files,
applicationAPI
extracts
On-premisesdata
Amazon
Redshift
Amazon
DynamoDB or EC2
based MongoDB,
Cassandra
Amazon
S3
Data made
available in
data lakes
QuickSight
Mobile devices Web browsers
Bucket- or
schema-level
permissions by
user and data
access needs
Data access
managed at
the data lake
Data enabled
by user
in data marts
Easy exploration of AWS data
Securely discover and connect to AWS data
Quickly explore AWS data sources
• Relational databases
• NoSQL databases
• Amazon EMR, Amazon S3, files
• Streaming data sources
Easily import data from any table or file
Automatic detection of data types
Intuitive visualizations with AutoGraph
• Automatic detection of data types
• Optimal query generation
• Appropriate graph type selection
• Ability to customize the graph type
• Very fast response
Native mobile experience
• iOS,Android
• Full experience on tablets
• Consumption experience on smart phones
• Very fast response
Tell a story with your data
• Capture the critical snapshot of analysis
• Build a sequence of analysis
• Share it securely
• Enable interactive exploration
• Very fast response
Advantage of Amazon QuickSight
Fast to get started Fast insights with SPICEEasily explore any AWS data
Easy to use and share Effortless scale Low cost
Amazon QuickSight pricing
Standard edition Enterprise edition
Subscription Annual Monthly Annual Monthly
Price per user per month
$9 $12 $18 $24
SPICE Capacity (GB)* 10 10 10 10
Additional SPICE
GB-month
$0.25 $0.38
* Per user SPICE capacity is pooled across all users in an account. As an example, acustomer
with 100 user subscriptions will get 1,000 GB of SPICE capacity for theaccount.
Intergrate Amazon Athena
and QuickSight
ELK Dashboard Sample
Athena+Quickview vs. ELK stack
60
Athena (Serverless) ELK Stack
Response Time Inter-active partial results Few seconds
Pre-process time Almost 0 Few Seconds to Minutes
Query String SQL Structured queries
Infrastructure No infra Logstash & Kibana :
EC2
ElastiSearch :
Managed Service
Management effort Low Medium
Input Format CSV、JSON、ORC、Apache
Parquet 和 Avro
JSON 或 XML
Price model Charge by Scanned data Charge by
Infrastructure
Index doc
Front-end QuickSight Kibana
Please fill in feedback form to get
AWS credit
Thank you

More Related Content

What's hot

실시간 스트리밍 분석 Kinesis Data Analytics Deep Dive
실시간 스트리밍 분석  Kinesis Data Analytics Deep Dive실시간 스트리밍 분석  Kinesis Data Analytics Deep Dive
실시간 스트리밍 분석 Kinesis Data Analytics Deep DiveAmazon Web Services Korea
 
SRV403_Serverless Authentication and Authorization
SRV403_Serverless Authentication and AuthorizationSRV403_Serverless Authentication and Authorization
SRV403_Serverless Authentication and AuthorizationAmazon Web Services
 
Introducing AWS Elastic Beanstalk
Introducing AWS Elastic BeanstalkIntroducing AWS Elastic Beanstalk
Introducing AWS Elastic BeanstalkAmazon Web Services
 
AWS Glue - let's get stuck in!
AWS Glue - let's get stuck in!AWS Glue - let's get stuck in!
AWS Glue - let's get stuck in!Chris Taylor
 
AWS Security Best Practices in a Zero Trust Security Model - DEM08 - Toronto ...
AWS Security Best Practices in a Zero Trust Security Model - DEM08 - Toronto ...AWS Security Best Practices in a Zero Trust Security Model - DEM08 - Toronto ...
AWS Security Best Practices in a Zero Trust Security Model - DEM08 - Toronto ...Amazon Web Services
 
Building a Modern Data Architecture on AWS - Webinar
Building a Modern Data Architecture on AWS - WebinarBuilding a Modern Data Architecture on AWS - Webinar
Building a Modern Data Architecture on AWS - WebinarAmazon Web Services
 
A deep dive into Amazon MSK - ADB206 - Chicago AWS Summit
A deep dive into Amazon MSK - ADB206 - Chicago AWS SummitA deep dive into Amazon MSK - ADB206 - Chicago AWS Summit
A deep dive into Amazon MSK - ADB206 - Chicago AWS SummitAmazon Web Services
 
Heterogenous Migration with DMS & SCT
Heterogenous Migration with DMS & SCTHeterogenous Migration with DMS & SCT
Heterogenous Migration with DMS & SCTAmazon Web Services
 
Deep Dive on Amazon Athena - AWS Online Tech Talks
Deep Dive on Amazon Athena - AWS Online Tech TalksDeep Dive on Amazon Athena - AWS Online Tech Talks
Deep Dive on Amazon Athena - AWS Online Tech TalksAmazon Web Services
 
AWS EMR Cost optimization
AWS EMR Cost optimizationAWS EMR Cost optimization
AWS EMR Cost optimizationSANG WON PARK
 
Architecture Patterns for Multi-Region Active-Active Applications (ARC209-R2)...
Architecture Patterns for Multi-Region Active-Active Applications (ARC209-R2)...Architecture Patterns for Multi-Region Active-Active Applications (ARC209-R2)...
Architecture Patterns for Multi-Region Active-Active Applications (ARC209-R2)...Amazon Web Services
 
Databricks Platform.pptx
Databricks Platform.pptxDatabricks Platform.pptx
Databricks Platform.pptxAlex Ivy
 
Announcing Amazon Athena - Instantly Analyze Your Data in S3 Using SQL
Announcing Amazon Athena - Instantly Analyze Your Data in S3 Using SQLAnnouncing Amazon Athena - Instantly Analyze Your Data in S3 Using SQL
Announcing Amazon Athena - Instantly Analyze Your Data in S3 Using SQLAmazon Web Services
 
AWS Security Best Practices and Design Patterns
AWS Security Best Practices and Design PatternsAWS Security Best Practices and Design Patterns
AWS Security Best Practices and Design PatternsAmazon Web Services
 
How to build a data lake with aws glue data catalog (ABD213-R) re:Invent 2017
How to build a data lake with aws glue data catalog (ABD213-R)  re:Invent 2017How to build a data lake with aws glue data catalog (ABD213-R)  re:Invent 2017
How to build a data lake with aws glue data catalog (ABD213-R) re:Invent 2017Amazon Web Services
 

What's hot (20)

실시간 스트리밍 분석 Kinesis Data Analytics Deep Dive
실시간 스트리밍 분석  Kinesis Data Analytics Deep Dive실시간 스트리밍 분석  Kinesis Data Analytics Deep Dive
실시간 스트리밍 분석 Kinesis Data Analytics Deep Dive
 
AWS Secrets Manager
AWS Secrets ManagerAWS Secrets Manager
AWS Secrets Manager
 
SRV403_Serverless Authentication and Authorization
SRV403_Serverless Authentication and AuthorizationSRV403_Serverless Authentication and Authorization
SRV403_Serverless Authentication and Authorization
 
Introducing AWS Elastic Beanstalk
Introducing AWS Elastic BeanstalkIntroducing AWS Elastic Beanstalk
Introducing AWS Elastic Beanstalk
 
AWS Glue - let's get stuck in!
AWS Glue - let's get stuck in!AWS Glue - let's get stuck in!
AWS Glue - let's get stuck in!
 
AWS Security Best Practices in a Zero Trust Security Model - DEM08 - Toronto ...
AWS Security Best Practices in a Zero Trust Security Model - DEM08 - Toronto ...AWS Security Best Practices in a Zero Trust Security Model - DEM08 - Toronto ...
AWS Security Best Practices in a Zero Trust Security Model - DEM08 - Toronto ...
 
Building a Modern Data Architecture on AWS - Webinar
Building a Modern Data Architecture on AWS - WebinarBuilding a Modern Data Architecture on AWS - Webinar
Building a Modern Data Architecture on AWS - Webinar
 
A deep dive into Amazon MSK - ADB206 - Chicago AWS Summit
A deep dive into Amazon MSK - ADB206 - Chicago AWS SummitA deep dive into Amazon MSK - ADB206 - Chicago AWS Summit
A deep dive into Amazon MSK - ADB206 - Chicago AWS Summit
 
Amazon Aurora
Amazon AuroraAmazon Aurora
Amazon Aurora
 
Heterogenous Migration with DMS & SCT
Heterogenous Migration with DMS & SCTHeterogenous Migration with DMS & SCT
Heterogenous Migration with DMS & SCT
 
Deep Dive on Amazon Athena - AWS Online Tech Talks
Deep Dive on Amazon Athena - AWS Online Tech TalksDeep Dive on Amazon Athena - AWS Online Tech Talks
Deep Dive on Amazon Athena - AWS Online Tech Talks
 
AWS EMR Cost optimization
AWS EMR Cost optimizationAWS EMR Cost optimization
AWS EMR Cost optimization
 
Architecture Patterns for Multi-Region Active-Active Applications (ARC209-R2)...
Architecture Patterns for Multi-Region Active-Active Applications (ARC209-R2)...Architecture Patterns for Multi-Region Active-Active Applications (ARC209-R2)...
Architecture Patterns for Multi-Region Active-Active Applications (ARC209-R2)...
 
Databricks Platform.pptx
Databricks Platform.pptxDatabricks Platform.pptx
Databricks Platform.pptx
 
Announcing Amazon Athena - Instantly Analyze Your Data in S3 Using SQL
Announcing Amazon Athena - Instantly Analyze Your Data in S3 Using SQLAnnouncing Amazon Athena - Instantly Analyze Your Data in S3 Using SQL
Announcing Amazon Athena - Instantly Analyze Your Data in S3 Using SQL
 
AWS Tagging Strategy
AWS Tagging StrategyAWS Tagging Strategy
AWS Tagging Strategy
 
AWS Security Best Practices and Design Patterns
AWS Security Best Practices and Design PatternsAWS Security Best Practices and Design Patterns
AWS Security Best Practices and Design Patterns
 
How to build a data lake with aws glue data catalog (ABD213-R) re:Invent 2017
How to build a data lake with aws glue data catalog (ABD213-R)  re:Invent 2017How to build a data lake with aws glue data catalog (ABD213-R)  re:Invent 2017
How to build a data lake with aws glue data catalog (ABD213-R) re:Invent 2017
 
Introduction to AWS Glue
Introduction to AWS GlueIntroduction to AWS Glue
Introduction to AWS Glue
 
AWS CDK Introduction
AWS CDK IntroductionAWS CDK Introduction
AWS CDK Introduction
 

Viewers also liked

Finland powerpoint
Finland powerpointFinland powerpoint
Finland powerpointnagadez
 
Cohesive SDN Summit Presentation: OpenFlow is SDN, SDN is not OpenFlow
Cohesive SDN Summit Presentation: OpenFlow is SDN, SDN is not OpenFlowCohesive SDN Summit Presentation: OpenFlow is SDN, SDN is not OpenFlow
Cohesive SDN Summit Presentation: OpenFlow is SDN, SDN is not OpenFlowCohesive Networks
 
WTF is Sensu and Monitoring
WTF is Sensu and MonitoringWTF is Sensu and Monitoring
WTF is Sensu and MonitoringToby Jackson
 
Open Source Approach to Design and Deployment of Microservices-based VNF
Open Source Approach to Design and Deployment of Microservices-based VNFOpen Source Approach to Design and Deployment of Microservices-based VNF
Open Source Approach to Design and Deployment of Microservices-based VNFOpen Networking Summit
 
IBM Bluemix OpenWhisk: IBM Seminar 2016, Tokyo, Japan: The Future of Cloud Pr...
IBM Bluemix OpenWhisk: IBM Seminar 2016, Tokyo, Japan: The Future of Cloud Pr...IBM Bluemix OpenWhisk: IBM Seminar 2016, Tokyo, Japan: The Future of Cloud Pr...
IBM Bluemix OpenWhisk: IBM Seminar 2016, Tokyo, Japan: The Future of Cloud Pr...OpenWhisk
 
Yellowstone National Park and Grand Teton, USA
Yellowstone  National Park and Grand Teton, USAYellowstone  National Park and Grand Teton, USA
Yellowstone National Park and Grand Teton, USACherie Ng
 
Performance Pack
Performance PackPerformance Pack
Performance Packday
 
Delphi XE2, door André Mussche op de 4DotNet Developers Day
Delphi XE2, door André Mussche op de 4DotNet Developers DayDelphi XE2, door André Mussche op de 4DotNet Developers Day
Delphi XE2, door André Mussche op de 4DotNet Developers DayHanneke Dotnet
 
DOXLON November 2016: Facebook Engineering on cgroupv2
DOXLON November 2016: Facebook Engineering on cgroupv2DOXLON November 2016: Facebook Engineering on cgroupv2
DOXLON November 2016: Facebook Engineering on cgroupv2Outlyer
 
Building Awesome APIs with Lumen
Building Awesome APIs with LumenBuilding Awesome APIs with Lumen
Building Awesome APIs with LumenKit Brennan
 
John 15:12 Ministries
John 15:12 MinistriesJohn 15:12 Ministries
John 15:12 Ministriesannettemelk
 
Service Orchestrierung mit Apache Mesos
Service Orchestrierung mit Apache MesosService Orchestrierung mit Apache Mesos
Service Orchestrierung mit Apache MesosRalf Ernst
 
Yodlee Customer Presentation
Yodlee Customer PresentationYodlee Customer Presentation
Yodlee Customer PresentationSplunk
 
Finding HMAS Sydney Chapter 5 - Kormoran Database & the Mathematics of Reliab...
Finding HMAS Sydney Chapter 5 - Kormoran Database & the Mathematics of Reliab...Finding HMAS Sydney Chapter 5 - Kormoran Database & the Mathematics of Reliab...
Finding HMAS Sydney Chapter 5 - Kormoran Database & the Mathematics of Reliab...Elk Software Group
 
IT Infrastructure Monitoring Strategies in Healthcare
IT Infrastructure Monitoring Strategies in HealthcareIT Infrastructure Monitoring Strategies in Healthcare
IT Infrastructure Monitoring Strategies in HealthcareCA Technologies
 
Building mental models
Building mental modelsBuilding mental models
Building mental modelsEmily Kissner
 
Rez gateway (RezOS) innovate the future
Rez gateway  (RezOS) innovate the futureRez gateway  (RezOS) innovate the future
Rez gateway (RezOS) innovate the futureindikaMaligaspe
 

Viewers also liked (20)

Finland powerpoint
Finland powerpointFinland powerpoint
Finland powerpoint
 
Cohesive SDN Summit Presentation: OpenFlow is SDN, SDN is not OpenFlow
Cohesive SDN Summit Presentation: OpenFlow is SDN, SDN is not OpenFlowCohesive SDN Summit Presentation: OpenFlow is SDN, SDN is not OpenFlow
Cohesive SDN Summit Presentation: OpenFlow is SDN, SDN is not OpenFlow
 
WTF is Sensu and Monitoring
WTF is Sensu and MonitoringWTF is Sensu and Monitoring
WTF is Sensu and Monitoring
 
Open Source Approach to Design and Deployment of Microservices-based VNF
Open Source Approach to Design and Deployment of Microservices-based VNFOpen Source Approach to Design and Deployment of Microservices-based VNF
Open Source Approach to Design and Deployment of Microservices-based VNF
 
IBM Bluemix OpenWhisk: IBM Seminar 2016, Tokyo, Japan: The Future of Cloud Pr...
IBM Bluemix OpenWhisk: IBM Seminar 2016, Tokyo, Japan: The Future of Cloud Pr...IBM Bluemix OpenWhisk: IBM Seminar 2016, Tokyo, Japan: The Future of Cloud Pr...
IBM Bluemix OpenWhisk: IBM Seminar 2016, Tokyo, Japan: The Future of Cloud Pr...
 
Tic’s y enfermería
Tic’s y enfermeríaTic’s y enfermería
Tic’s y enfermería
 
Yellowstone National Park and Grand Teton, USA
Yellowstone  National Park and Grand Teton, USAYellowstone  National Park and Grand Teton, USA
Yellowstone National Park and Grand Teton, USA
 
Performance Pack
Performance PackPerformance Pack
Performance Pack
 
Delphi XE2, door André Mussche op de 4DotNet Developers Day
Delphi XE2, door André Mussche op de 4DotNet Developers DayDelphi XE2, door André Mussche op de 4DotNet Developers Day
Delphi XE2, door André Mussche op de 4DotNet Developers Day
 
DOXLON November 2016: Facebook Engineering on cgroupv2
DOXLON November 2016: Facebook Engineering on cgroupv2DOXLON November 2016: Facebook Engineering on cgroupv2
DOXLON November 2016: Facebook Engineering on cgroupv2
 
Building Awesome APIs with Lumen
Building Awesome APIs with LumenBuilding Awesome APIs with Lumen
Building Awesome APIs with Lumen
 
OS17 Brochure
OS17 BrochureOS17 Brochure
OS17 Brochure
 
John 15:12 Ministries
John 15:12 MinistriesJohn 15:12 Ministries
John 15:12 Ministries
 
Micropipetten
MicropipettenMicropipetten
Micropipetten
 
Service Orchestrierung mit Apache Mesos
Service Orchestrierung mit Apache MesosService Orchestrierung mit Apache Mesos
Service Orchestrierung mit Apache Mesos
 
Yodlee Customer Presentation
Yodlee Customer PresentationYodlee Customer Presentation
Yodlee Customer Presentation
 
Finding HMAS Sydney Chapter 5 - Kormoran Database & the Mathematics of Reliab...
Finding HMAS Sydney Chapter 5 - Kormoran Database & the Mathematics of Reliab...Finding HMAS Sydney Chapter 5 - Kormoran Database & the Mathematics of Reliab...
Finding HMAS Sydney Chapter 5 - Kormoran Database & the Mathematics of Reliab...
 
IT Infrastructure Monitoring Strategies in Healthcare
IT Infrastructure Monitoring Strategies in HealthcareIT Infrastructure Monitoring Strategies in Healthcare
IT Infrastructure Monitoring Strategies in Healthcare
 
Building mental models
Building mental modelsBuilding mental models
Building mental models
 
Rez gateway (RezOS) innovate the future
Rez gateway  (RezOS) innovate the futureRez gateway  (RezOS) innovate the future
Rez gateway (RezOS) innovate the future
 

Similar to Visualize your data in Data Lake with AWS Athena and AWS Quicksight Hands-on Workshop

Serverless Big Data Analytics using Amazon Athena and Amazon QuickSight - May...
Serverless Big Data Analytics using Amazon Athena and Amazon QuickSight - May...Serverless Big Data Analytics using Amazon Athena and Amazon QuickSight - May...
Serverless Big Data Analytics using Amazon Athena and Amazon QuickSight - May...Amazon Web Services
 
Serverless Big Data Analytics with Amazon Athena and Amazon Quicksight - May ...
Serverless Big Data Analytics with Amazon Athena and Amazon Quicksight - May ...Serverless Big Data Analytics with Amazon Athena and Amazon Quicksight - May ...
Serverless Big Data Analytics with Amazon Athena and Amazon Quicksight - May ...Amazon Web Services
 
AWS Summit 2013 | Singapore - Big Data Analytics, Presented by AWS, Intel and...
AWS Summit 2013 | Singapore - Big Data Analytics, Presented by AWS, Intel and...AWS Summit 2013 | Singapore - Big Data Analytics, Presented by AWS, Intel and...
AWS Summit 2013 | Singapore - Big Data Analytics, Presented by AWS, Intel and...Amazon Web Services
 
Getting Started with Amazon QuickSight
Getting Started with Amazon QuickSightGetting Started with Amazon QuickSight
Getting Started with Amazon QuickSightAmazon Web Services
 
Database and Analytics on the AWS Cloud - AWS Innovate Toronto
Database and Analytics on the AWS Cloud - AWS Innovate TorontoDatabase and Analytics on the AWS Cloud - AWS Innovate Toronto
Database and Analytics on the AWS Cloud - AWS Innovate TorontoAmazon Web Services
 
Understanding AWS Managed Database and Analytics Services | AWS Public Sector...
Understanding AWS Managed Database and Analytics Services | AWS Public Sector...Understanding AWS Managed Database and Analytics Services | AWS Public Sector...
Understanding AWS Managed Database and Analytics Services | AWS Public Sector...Amazon Web Services
 
Understanding AWS Managed Database and Analytics Services | AWS Public Sector...
Understanding AWS Managed Database and Analytics Services | AWS Public Sector...Understanding AWS Managed Database and Analytics Services | AWS Public Sector...
Understanding AWS Managed Database and Analytics Services | AWS Public Sector...Amazon Web Services
 
Welcome & AWS Big Data Solution Overview
Welcome & AWS Big Data Solution OverviewWelcome & AWS Big Data Solution Overview
Welcome & AWS Big Data Solution OverviewAmazon Web Services
 
AWS October Webinar Series - Introducing Amazon QuickSight
AWS October Webinar Series - Introducing Amazon QuickSightAWS October Webinar Series - Introducing Amazon QuickSight
AWS October Webinar Series - Introducing Amazon QuickSightAmazon Web Services
 
Best Practices for Distributed Machine Learning and Predictive Analytics Usin...
Best Practices for Distributed Machine Learning and Predictive Analytics Usin...Best Practices for Distributed Machine Learning and Predictive Analytics Usin...
Best Practices for Distributed Machine Learning and Predictive Analytics Usin...Amazon Web Services
 
Building your First Big Data Application on AWS
Building your First Big Data Application on AWSBuilding your First Big Data Application on AWS
Building your First Big Data Application on AWSAmazon Web Services
 
Fast Track to Your Data Lake on AWS
Fast Track to Your Data Lake on AWSFast Track to Your Data Lake on AWS
Fast Track to Your Data Lake on AWSAmazon Web Services
 
Getting Started with Amazon QuickSight
Getting Started with Amazon QuickSightGetting Started with Amazon QuickSight
Getting Started with Amazon QuickSightAmazon Web Services
 
Database and Analytics on the AWS Cloud
Database and Analytics on the AWS CloudDatabase and Analytics on the AWS Cloud
Database and Analytics on the AWS CloudAmazon Web Services
 
Driving Business Insights with a Modern Data Architecture AWS Summit SG 2017
Driving Business Insights with a Modern Data Architecture  AWS Summit SG 2017Driving Business Insights with a Modern Data Architecture  AWS Summit SG 2017
Driving Business Insights with a Modern Data Architecture AWS Summit SG 2017Amazon Web Services
 

Similar to Visualize your data in Data Lake with AWS Athena and AWS Quicksight Hands-on Workshop (20)

AWS Big Data Solution Days
AWS Big Data Solution DaysAWS Big Data Solution Days
AWS Big Data Solution Days
 
Serverless Big Data Analytics using Amazon Athena and Amazon QuickSight - May...
Serverless Big Data Analytics using Amazon Athena and Amazon QuickSight - May...Serverless Big Data Analytics using Amazon Athena and Amazon QuickSight - May...
Serverless Big Data Analytics using Amazon Athena and Amazon QuickSight - May...
 
Serverless Big Data Analytics with Amazon Athena and Amazon Quicksight - May ...
Serverless Big Data Analytics with Amazon Athena and Amazon Quicksight - May ...Serverless Big Data Analytics with Amazon Athena and Amazon Quicksight - May ...
Serverless Big Data Analytics with Amazon Athena and Amazon Quicksight - May ...
 
AWS Summit 2013 | Singapore - Big Data Analytics, Presented by AWS, Intel and...
AWS Summit 2013 | Singapore - Big Data Analytics, Presented by AWS, Intel and...AWS Summit 2013 | Singapore - Big Data Analytics, Presented by AWS, Intel and...
AWS Summit 2013 | Singapore - Big Data Analytics, Presented by AWS, Intel and...
 
Building your Datalake on AWS
Building your Datalake on AWSBuilding your Datalake on AWS
Building your Datalake on AWS
 
Auckland Summit Keynote
Auckland Summit KeynoteAuckland Summit Keynote
Auckland Summit Keynote
 
Getting Started with Amazon QuickSight
Getting Started with Amazon QuickSightGetting Started with Amazon QuickSight
Getting Started with Amazon QuickSight
 
2016 AWS Big Data Solution Days
2016 AWS Big Data Solution Days2016 AWS Big Data Solution Days
2016 AWS Big Data Solution Days
 
Database and Analytics on the AWS Cloud - AWS Innovate Toronto
Database and Analytics on the AWS Cloud - AWS Innovate TorontoDatabase and Analytics on the AWS Cloud - AWS Innovate Toronto
Database and Analytics on the AWS Cloud - AWS Innovate Toronto
 
Amazon QuickSight
Amazon QuickSightAmazon QuickSight
Amazon QuickSight
 
Understanding AWS Managed Database and Analytics Services | AWS Public Sector...
Understanding AWS Managed Database and Analytics Services | AWS Public Sector...Understanding AWS Managed Database and Analytics Services | AWS Public Sector...
Understanding AWS Managed Database and Analytics Services | AWS Public Sector...
 
Understanding AWS Managed Database and Analytics Services | AWS Public Sector...
Understanding AWS Managed Database and Analytics Services | AWS Public Sector...Understanding AWS Managed Database and Analytics Services | AWS Public Sector...
Understanding AWS Managed Database and Analytics Services | AWS Public Sector...
 
Welcome & AWS Big Data Solution Overview
Welcome & AWS Big Data Solution OverviewWelcome & AWS Big Data Solution Overview
Welcome & AWS Big Data Solution Overview
 
AWS October Webinar Series - Introducing Amazon QuickSight
AWS October Webinar Series - Introducing Amazon QuickSightAWS October Webinar Series - Introducing Amazon QuickSight
AWS October Webinar Series - Introducing Amazon QuickSight
 
Best Practices for Distributed Machine Learning and Predictive Analytics Usin...
Best Practices for Distributed Machine Learning and Predictive Analytics Usin...Best Practices for Distributed Machine Learning and Predictive Analytics Usin...
Best Practices for Distributed Machine Learning and Predictive Analytics Usin...
 
Building your First Big Data Application on AWS
Building your First Big Data Application on AWSBuilding your First Big Data Application on AWS
Building your First Big Data Application on AWS
 
Fast Track to Your Data Lake on AWS
Fast Track to Your Data Lake on AWSFast Track to Your Data Lake on AWS
Fast Track to Your Data Lake on AWS
 
Getting Started with Amazon QuickSight
Getting Started with Amazon QuickSightGetting Started with Amazon QuickSight
Getting Started with Amazon QuickSight
 
Database and Analytics on the AWS Cloud
Database and Analytics on the AWS CloudDatabase and Analytics on the AWS Cloud
Database and Analytics on the AWS Cloud
 
Driving Business Insights with a Modern Data Architecture AWS Summit SG 2017
Driving Business Insights with a Modern Data Architecture  AWS Summit SG 2017Driving Business Insights with a Modern Data Architecture  AWS Summit SG 2017
Driving Business Insights with a Modern Data Architecture AWS Summit SG 2017
 

More from Amazon Web Services

Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...Amazon Web Services
 
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...Amazon Web Services
 
Esegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS FargateEsegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS FargateAmazon Web Services
 
Costruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWSCostruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWSAmazon Web Services
 
Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot Amazon Web Services
 
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...Amazon Web Services
 
OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...Amazon Web Services
 
Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows WorkloadsMicrosoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows WorkloadsAmazon Web Services
 
Database Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatareDatabase Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatareAmazon Web Services
 
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJSCrea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJSAmazon Web Services
 
API moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e webAPI moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e webAmazon Web Services
 
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatareDatabase Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatareAmazon Web Services
 
Tools for building your MVP on AWS
Tools for building your MVP on AWSTools for building your MVP on AWS
Tools for building your MVP on AWSAmazon Web Services
 
How to Build a Winning Pitch Deck
How to Build a Winning Pitch DeckHow to Build a Winning Pitch Deck
How to Build a Winning Pitch DeckAmazon Web Services
 
Building a web application without servers
Building a web application without serversBuilding a web application without servers
Building a web application without serversAmazon Web Services
 
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...Amazon Web Services
 
Introduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container ServiceIntroduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container ServiceAmazon Web Services
 

More from Amazon Web Services (20)

Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
 
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
 
Esegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS FargateEsegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS Fargate
 
Costruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWSCostruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWS
 
Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot
 
Open banking as a service
Open banking as a serviceOpen banking as a service
Open banking as a service
 
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
 
OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...
 
Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows WorkloadsMicrosoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
 
Computer Vision con AWS
Computer Vision con AWSComputer Vision con AWS
Computer Vision con AWS
 
Database Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatareDatabase Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatare
 
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJSCrea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
 
API moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e webAPI moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e web
 
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatareDatabase Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
 
Tools for building your MVP on AWS
Tools for building your MVP on AWSTools for building your MVP on AWS
Tools for building your MVP on AWS
 
How to Build a Winning Pitch Deck
How to Build a Winning Pitch DeckHow to Build a Winning Pitch Deck
How to Build a Winning Pitch Deck
 
Building a web application without servers
Building a web application without serversBuilding a web application without servers
Building a web application without servers
 
Fundraising Essentials
Fundraising EssentialsFundraising Essentials
Fundraising Essentials
 
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
 
Introduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container ServiceIntroduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container Service
 

Visualize your data in Data Lake with AWS Athena and AWS Quicksight Hands-on Workshop

  • 1. Level 200 Hands-on Workshop Visualize your data in Data Lake with AWS Athena and AWS Quicksight Jeff Ng, Solutions Architect eCloudvalley 27 July, 2017
  • 2. Agenda  About eCloudvalley  Overview of Amazon Athena?  Overview of Amazon QuickSight?  Athena+QuickSight vs ELK  Demo  Lab
  • 4. Create a New Cloud Standard by becoming TRUE CLOUD EXPERT. 10 X All 5-Cert Engineers 90+ AWS Certifications AWS Authorized Instructor Microsoft Certified Trainer 100% Focus on the Largest Cloud- AWS Born-in-the-cloud System Integrator
  • 6. The 1st and the Only Premier Partner in GCR AWS China Region Consulting Partner AWS Premier Consulting Partner (1st and the Only Premier Partner in GCR) Marketing & Commerce Competency AWS Audited Managed Services Provider Mobile Competency
  • 7. Our Offering Official AWS Training DevOps/ Optimization Next Generation Managed Services Fully Managed AWS Migration DataOps/ Big Data/ BI
  • 8. Overview of Data Analytics and Visualize
  • 11. Challenges Customers Faced Significant amount of work required to analyze data in Amazon S3 Users often only have access to aggregated data sets Managing a Hadoop cluster or data warehouse requires expertise
  • 12. Introducing Amazon Athena Amazon Athena is an interactive query service that makes it easy to analyze data directly from Amazon S3 using Standard SQL
  • 13. Athena is Serverless No Infrastructure or administration Zero Spin up time Transparent upgrades
  • 14. Amazon Athena is Easy To Use Log into the Console Create a table Type in a Hive DDL Statement Use the console Add Tablewizard Start querying
  • 15. Amazon Athena is Highly Available You connect to a service endpoint or log into the console Athena uses warm compute pools across multiple Availability Zones Your data is in Amazon S3, which is also highly available and designed for 99.999999999% durability
  • 16. Query Data Directly from Amazon S3 No loading of data Query data in its raw format Text, CSV, JSON, weblogs, AWS service logs Convert to an optimized form like ORC or Parquet for the best performance and lowest cost No ETL required Stream data from directly from Amazon S3 Take advantage of Amazon S3 durability and availability
  • 17. Use ANSI SQL Start writing ANSI SQL Support for complex joins, nested queries & window functions Support for complex data types (arrays, structs) Support for partitioning of data by any key (date, time, custom keys) e.g., Year, Month, Day, Hour or Customer Key, Date
  • 18. Familiar Technologies Under the Covers Used for SQL Queries In-memory distributed query engine ANSI-SQL compatible with extensions Used for DDL functionality Complex data types Multitude of formats Supports data partitioning
  • 19. Amazon Athena Supports Multiple Data Formats Text files, e.g., CSV, raw logs Apache Web Logs, TSV files JSON (simple, nested) Compressed files Columnar formats such as Apache Parquet &Apache ORC AVRO support
  • 20. Amazon Athena is Fast Tuned for performance Automatically parallelizes queries Results are streamed to console Results also stored in S3 Improve Query performance Compress your data Use columnar formats
  • 21. Amazon Athena is Cost Effective Pay per query $5 per TB scanned from S3 DDL Queries and failed queries are free Save by using compression, columnar formats, partitions
  • 22.
  • 24. A Sample Pipeline Ad-hoc access to raw data usingSQL
  • 25. A Sample Pipeline Ad-hoc access to raw data usingSQL Athena can query aggregated datasets as well
  • 26. Summary No ETL required. No loading of data. Query data where it lives Query data at whatever latitude and longitude you want No infrastructure to manage
  • 28. Simple Query editor with key bindings
  • 29. Can also see a detailed view in the catalog tab
  • 30. You can also check the properties. Note the location.
  • 31.
  • 32. Use the JDBC Driver
  • 33. Using Amazon Athena with Amazon QuickSight AmazonS3 AmazonRDS AmazonRedshift AmazonAthena QuickSight allows you to connect to data from a wide variety of AWS, third-party, and on- premises sources including AmazonAthena
  • 34. Overview of Amazon QiuckSight
  • 35. What are the big data challenges our customers face?
  • 36. Who are my top customers and what are they buying? Which devices are showing time for maintenance? What is my product profitability by region?Why is my most profitable region not growing? How much inventory do I have? Has my fraud account expense increased? How is my marketing campaign performing? How is my employee satisfaction trending? Lots of data Lots and lots of questions Few insights
  • 37. Old-guard BI Costs too much Pay $ million before seeing first analysis 3 year TCO $150 to $250 per user per month Takes too long Spend 6 to 12 months of consulting and software implementation time
  • 39. A fast, cloud-powered, BI service for 1/10 the cost of traditional BI software
  • 40. $9 per user per month With 1 year commitment
  • 41. First analysis in about 60 seconds Sign-in Business user
  • 42. QuickSightAPI Data prep Metadata SuggestionsConnectors SPICE Business User Business User QuickSight UI Mobile devices Web browsers Partner BI products Amazon S3 Amazon Kinesis Amazon DynamoDB Amazon EMR Amazon Redshift Amazon RDSFiles Apps Direct connect JDBC/ODBC On-premises data Athena
  • 43. I have multiple datasets both on-premises and on AWS from different sources, and I need to make data available and enable access by using Amazon QuickSight. How do I do this?
  • 44. 1. Data made available in “data lakes” using Amazon S3 or Amazon Redshift 2. Data access managed with bucket- or schema- level policies 3. Data enabled by using Amazon QuickSight
  • 45. Amazon EMR or Apache Hadoop Log files, applicationAPI extracts On-premisesdata Amazon Redshift Amazon DynamoDB or EC2 based MongoDB, Cassandra Amazon S3 Data made available in data lakes QuickSight Mobile devices Web browsers Bucket- or schema-level permissions by user and data access needs Data access managed at the data lake Data enabled by user in data marts
  • 46. Easy exploration of AWS data Securely discover and connect to AWS data Quickly explore AWS data sources • Relational databases • NoSQL databases • Amazon EMR, Amazon S3, files • Streaming data sources Easily import data from any table or file Automatic detection of data types
  • 47. Intuitive visualizations with AutoGraph • Automatic detection of data types • Optimal query generation • Appropriate graph type selection • Ability to customize the graph type • Very fast response
  • 48. Native mobile experience • iOS,Android • Full experience on tablets • Consumption experience on smart phones • Very fast response
  • 49. Tell a story with your data • Capture the critical snapshot of analysis • Build a sequence of analysis • Share it securely • Enable interactive exploration • Very fast response
  • 50. Advantage of Amazon QuickSight Fast to get started Fast insights with SPICEEasily explore any AWS data Easy to use and share Effortless scale Low cost
  • 51. Amazon QuickSight pricing Standard edition Enterprise edition Subscription Annual Monthly Annual Monthly Price per user per month $9 $12 $18 $24 SPICE Capacity (GB)* 10 10 10 10 Additional SPICE GB-month $0.25 $0.38 * Per user SPICE capacity is pooled across all users in an account. As an example, acustomer with 100 user subscriptions will get 1,000 GB of SPICE capacity for theaccount.
  • 53.
  • 54.
  • 55.
  • 56.
  • 57.
  • 58.
  • 60. Athena+Quickview vs. ELK stack 60 Athena (Serverless) ELK Stack Response Time Inter-active partial results Few seconds Pre-process time Almost 0 Few Seconds to Minutes Query String SQL Structured queries Infrastructure No infra Logstash & Kibana : EC2 ElastiSearch : Managed Service Management effort Low Medium Input Format CSV、JSON、ORC、Apache Parquet 和 Avro JSON 或 XML Price model Charge by Scanned data Charge by Infrastructure Index doc Front-end QuickSight Kibana
  • 61. Please fill in feedback form to get AWS credit