SlideShare a Scribd company logo
1 of 50
Download to read offline
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Roy Hasson
Sr. Mgr, Business Development – Amazon Athena
AWS
Shane Andrade
Principal Engineer I - Email Infra Data Team
SendGrid
Amazon Athena: What’s new and how
SendGrid innovates using Athena
A N T 3 2 4
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Agenda
• Customer trends
• Workload isolation and cost controls
• How SendGrid built email replay using Amazon Athena
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Amazon Athena Customers
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Understanding our users
Data Consumer
• Easily discover data
• Choice of tools
• Performance
Data Engineer
• Security
• Maintainability & Scale
• Performance
• Cost
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Querying Your Data Lake
Devices
Web
Sensors
Social
EDW
Amazon Kinesis Data
Firehose writes
partitioned optimized
data
Ingest streaming
events in real time
with Amazon Kinesis
- Ingestion
S3://bucket/year/month/day/hour/file.parquet
S3://bucket/year/month/day/hour/file.orc
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Ingestion: Database and Data Warehouse
Devices
Web
Sensors
Social
EDW
Move snapshots and
incremental DB and
DWH tables
S3://bucket/table/LOAD001.csv
S3://bucket/table/20181127-1134010000.csv
S3://bucket/year/month/day/hour/file.parquet
S3://bucket/year/month/day/hour/file.orc
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Querying Your Data Lake – Transform & Automate
Devices
Web
Sensors
Social
EDW
Automate routine
tasks such as data
cleansing
Perform unique data
transformations and
ML
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Querying Your Data Lake – Catalog
Devices
Web
Sensors
Social
EDW
AWS Glue
Data Catalog
Permissions
Store transformed
data, crawl and
catalog its schema
Restrict access by
defining permissions
on databases and
tables
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Catalog: Access Control
AWS Glue
Data Catalog
{
"Effect": "Allow",
"Action": [
"glue:GetTables”,
"glue:GetTable”,
],
"Resource": [
"arn:aws:glue:us-east-1:123456789012:catalog",
"arn:aws:glue:us-east-1:123456789012:database/example_db",
"arn:aws:glue:us-east-1:123456789012:table/example_db/*"
]
}
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Querying Your Data Lake – Consume
Devices
Web
Sensors
Social
EDW
AWS Glue
Data Catalog
Data ConsumerData Engineer
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Data consumption
Permissions
Data Lake
AWS Cloud
AWS Cloud
Reporting &
Analytics
Machine
Learning
AWS Cloud
Custom
Applications
AWS Glue
Data Catalog
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Visualize your data with your favorite tools
Featured Athena Partners
Amazon QuickSight
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Data consumption – Data Analyst
AWS Glue
Data Catalog
JDBC/ODBC drivers
connect common BI
and SQL tools
Now 2-5x faster
Create optimized
tables on-demand
using Create Table
As Select
Abstract complex
queries & expose
only needed data
with Athena Views
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Data consumption – Data Analyst
JDBC/ODBC driver
integration with
Microsoft Active
Directory
jdbc:awsathena://AwsRegion=us-east-1;
S3OutputLocation=s3://bucket/path;
AwsCredentialsProviderClass=com.simba.athena.iamsupport.plugin.AdfsCredentialsProvider;
idp_host=example.adfs.server;
idp_port=233;
UID=HOMEjsmith;
PWD=simba12345;
preferred_role=arn:aws:iam::123456789123: role/JSMITH;
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Data consumption – Automated Reporting
athena.startQueryExecution("SELECT * FROM business_view”)
Query_ID
1
2
3 4
Email
notification
5
1. Schedule query
2. Track QueryID for status
3. Query results to Amazon S3
4. New file trigger
5. Job complete notification
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Data consumption – Data Scientist
AWS Glue
Data Catalog
Use PyAthena to query
Athena tables directly
from Amazon SageMaker
notebooks
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Data consumption – Custom Applications
AWS Glue
Data Catalog
Integrate with AWS
AppSync for easy access
to data, on and off-line
Get data to your
applications using AWS
SDK and Athena API
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Inspecting AWS service logs
Service logs are
written directly to
Amazon S3
- Ingestion
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Inspecting AWS service logs – Optimization
Optimize and
partition to improve
performance and
cost
Data stored
partitioned in Apache
Parquet format
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Optimization: Small Files
It is recommend to
merge many small
files into fewer larger
ones Improves performance by up to 5x
when accessing tables containing
large number of small files
Size (Bytes) File name
14408 kfhconnectblog-parquet-1-2018-05-11-16-01-9e7cc0b631d8.parquet
14408 kfhconnectblog-parquet-1-2018-05-11-16-01-206cf7098588.parquet
14408 kfhconnectblog-parquet-1-2018-05-11-16-03-6a3fa4c14e22.parquet
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Inspecting AWS service logs – Catalog
AWS Glue
Data Catalog
Permissions
AWS Glue crawler
catalogs data schema
and partitions
Restrict access by
defining permissions
on databases and
tables
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Inspecting AWS service logs – Consume
AWS Glue
Data Catalog
Data ConsumerData Engineer
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Athena Workgroups
Athena Workgroups are used to isolate queries
between different teams, workloads or applications,
and to set limits on amount of data each query or the
entire workgroup can process
Workload Isolation Query Metrics Cost Controls
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Workgroups – Workload Isolation
Unique query output
location per
Workgroup
Encrypt results with
unique AWS KMS key
per Workgroup
Collect and publish
aggregated metrics
per Workgroup to
AWS CloudWatch
Use Workgroup
settings eliminating
need to configure
individual users
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Workgroups – Metric Reporting
Total bytes scanned
per Workgroup
Total failed queries
per Workgroup
Total successful
queries per
Workgroup
Total query execution
time per Workgroup
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Workgroups – Cost Controls
• Per query data scanned threshold; exceeding, will cancel query
• Trigger alarms to notify of increasing usage and cost
• Disable Workgroup when all queries exceed a maximum threshold
Any Athena metric: successful/failed & total queries, query run time, etc.
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Workgroups – Usage Notifications
Define a hierarchy of
alarms to be alerted
as usage increases
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Denver, Colorado | San Francisco, California | Irvine, California | London, England
78,000 customers in 100+ countries | 45B emails monthly | 4 offices
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
OUR CUSTOMERS
Powering the customer engagement for the world’s
leading digital brands
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
SMTP/API
Transactional
Marketing
Campaigns
Promotional
Email ActivityParse API
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Originally built entirely on prem
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Customers were limited to 7 days of data
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
High volume customers were more constrained
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
New customer base had different needs
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Provisioning Risks
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Serverless Elasticity API Integration
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Initial architecture during internal beta
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Controlled access to Athena
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
We tested Athena query times vs file counts
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Historical data needed to be handle separately
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Athena architecture V3
AWS
GlueDynamoDB
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Benefits from using Athena
Scalability
Enable us to support
increased Email Activity data
in the future with little to no
additional cost.
Reduced variable
costs and ops
tickets
Improved customer
satisfaction
regarding access
to email data
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Serverless Elasticity API Integration
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Thank you!
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Roy Hasson
royon@amazon.com
Shane Andrade
shane.andrade@sendgrid.com

More Related Content

What's hot

Migrate Your Hadoop/Spark Workload to Amazon EMR and Architect It for Securit...
Migrate Your Hadoop/Spark Workload to Amazon EMR and Architect It for Securit...Migrate Your Hadoop/Spark Workload to Amazon EMR and Architect It for Securit...
Migrate Your Hadoop/Spark Workload to Amazon EMR and Architect It for Securit...Amazon Web Services
 
How to Build HR Lakes on AWS to Unlock New Business Insights (DAT367) - AWS r...
How to Build HR Lakes on AWS to Unlock New Business Insights (DAT367) - AWS r...How to Build HR Lakes on AWS to Unlock New Business Insights (DAT367) - AWS r...
How to Build HR Lakes on AWS to Unlock New Business Insights (DAT367) - AWS r...Amazon Web Services
 
Considerations for Building Your First Streaming Application (ANT359) - AWS r...
Considerations for Building Your First Streaming Application (ANT359) - AWS r...Considerations for Building Your First Streaming Application (ANT359) - AWS r...
Considerations for Building Your First Streaming Application (ANT359) - AWS r...Amazon Web Services
 
Build Your Own Log Analytics Solutions on AWS (ANT323-R) - AWS re:Invent 2018
Build Your Own Log Analytics Solutions on AWS (ANT323-R) - AWS re:Invent 2018Build Your Own Log Analytics Solutions on AWS (ANT323-R) - AWS re:Invent 2018
Build Your Own Log Analytics Solutions on AWS (ANT323-R) - AWS re:Invent 2018Amazon Web Services
 
Query in Place with AWS (STG315-R1) - AWS re:Invent 2018
Query in Place with AWS (STG315-R1) - AWS re:Invent 2018Query in Place with AWS (STG315-R1) - AWS re:Invent 2018
Query in Place with AWS (STG315-R1) - AWS re:Invent 2018Amazon Web Services
 
Extending Analytics Beyond the Data Warehouse, ft. Warner Bros. Analytics (AN...
Extending Analytics Beyond the Data Warehouse, ft. Warner Bros. Analytics (AN...Extending Analytics Beyond the Data Warehouse, ft. Warner Bros. Analytics (AN...
Extending Analytics Beyond the Data Warehouse, ft. Warner Bros. Analytics (AN...Amazon Web Services
 
Data Lake Implementation: Processing and Querying Data in Place (STG204-R1) -...
Data Lake Implementation: Processing and Querying Data in Place (STG204-R1) -...Data Lake Implementation: Processing and Querying Data in Place (STG204-R1) -...
Data Lake Implementation: Processing and Querying Data in Place (STG204-R1) -...Amazon Web Services
 
Visualization with Amazon QuickSight
Visualization with Amazon QuickSightVisualization with Amazon QuickSight
Visualization with Amazon QuickSightAmazon Web Services
 
Amazon EMR: Optimize Transient Clusters for Data Processing & ETL (ANT341) - ...
Amazon EMR: Optimize Transient Clusters for Data Processing & ETL (ANT341) - ...Amazon EMR: Optimize Transient Clusters for Data Processing & ETL (ANT341) - ...
Amazon EMR: Optimize Transient Clusters for Data Processing & ETL (ANT341) - ...Amazon Web Services
 
Serverless Data Prep with AWS Glue (ANT313) - AWS re:Invent 2018
Serverless Data Prep with AWS Glue (ANT313) - AWS re:Invent 2018Serverless Data Prep with AWS Glue (ANT313) - AWS re:Invent 2018
Serverless Data Prep with AWS Glue (ANT313) - AWS re:Invent 2018Amazon Web Services
 
Building Data Lakes That Cost Less and Deliver Results Faster - AWS Online Te...
Building Data Lakes That Cost Less and Deliver Results Faster - AWS Online Te...Building Data Lakes That Cost Less and Deliver Results Faster - AWS Online Te...
Building Data Lakes That Cost Less and Deliver Results Faster - AWS Online Te...Amazon Web Services
 
What's New with Amazon Redshift ft. Dow Jones (ANT350-R) - AWS re:Invent 2018
What's New with Amazon Redshift ft. Dow Jones (ANT350-R) - AWS re:Invent 2018What's New with Amazon Redshift ft. Dow Jones (ANT350-R) - AWS re:Invent 2018
What's New with Amazon Redshift ft. Dow Jones (ANT350-R) - AWS re:Invent 2018Amazon Web Services
 
Building a Data Lake for Your Enterprise, ft. Sysco (STG309) - AWS re:Invent ...
Building a Data Lake for Your Enterprise, ft. Sysco (STG309) - AWS re:Invent ...Building a Data Lake for Your Enterprise, ft. Sysco (STG309) - AWS re:Invent ...
Building a Data Lake for Your Enterprise, ft. Sysco (STG309) - AWS re:Invent ...Amazon Web Services
 
Success has Many Query Engines- Tel Aviv Summit 2018
Success has Many Query Engines- Tel Aviv Summit 2018Success has Many Query Engines- Tel Aviv Summit 2018
Success has Many Query Engines- Tel Aviv Summit 2018Amazon Web Services
 
Building Advanced Workflows with AWS Glue (ANT372) - AWS re:Invent 2018
Building Advanced Workflows with AWS Glue (ANT372) - AWS re:Invent 2018Building Advanced Workflows with AWS Glue (ANT372) - AWS re:Invent 2018
Building Advanced Workflows with AWS Glue (ANT372) - AWS re:Invent 2018Amazon Web Services
 
Big Data Analytics Architectural Patterns and Best Practices (ANT201-R1) - AW...
Big Data Analytics Architectural Patterns and Best Practices (ANT201-R1) - AW...Big Data Analytics Architectural Patterns and Best Practices (ANT201-R1) - AW...
Big Data Analytics Architectural Patterns and Best Practices (ANT201-R1) - AW...Amazon Web Services
 

What's hot (20)

Migrate Your Hadoop/Spark Workload to Amazon EMR and Architect It for Securit...
Migrate Your Hadoop/Spark Workload to Amazon EMR and Architect It for Securit...Migrate Your Hadoop/Spark Workload to Amazon EMR and Architect It for Securit...
Migrate Your Hadoop/Spark Workload to Amazon EMR and Architect It for Securit...
 
How to Build HR Lakes on AWS to Unlock New Business Insights (DAT367) - AWS r...
How to Build HR Lakes on AWS to Unlock New Business Insights (DAT367) - AWS r...How to Build HR Lakes on AWS to Unlock New Business Insights (DAT367) - AWS r...
How to Build HR Lakes on AWS to Unlock New Business Insights (DAT367) - AWS r...
 
Considerations for Building Your First Streaming Application (ANT359) - AWS r...
Considerations for Building Your First Streaming Application (ANT359) - AWS r...Considerations for Building Your First Streaming Application (ANT359) - AWS r...
Considerations for Building Your First Streaming Application (ANT359) - AWS r...
 
Build Your Own Log Analytics Solutions on AWS (ANT323-R) - AWS re:Invent 2018
Build Your Own Log Analytics Solutions on AWS (ANT323-R) - AWS re:Invent 2018Build Your Own Log Analytics Solutions on AWS (ANT323-R) - AWS re:Invent 2018
Build Your Own Log Analytics Solutions on AWS (ANT323-R) - AWS re:Invent 2018
 
Query in Place with AWS (STG315-R1) - AWS re:Invent 2018
Query in Place with AWS (STG315-R1) - AWS re:Invent 2018Query in Place with AWS (STG315-R1) - AWS re:Invent 2018
Query in Place with AWS (STG315-R1) - AWS re:Invent 2018
 
Extending Analytics Beyond the Data Warehouse, ft. Warner Bros. Analytics (AN...
Extending Analytics Beyond the Data Warehouse, ft. Warner Bros. Analytics (AN...Extending Analytics Beyond the Data Warehouse, ft. Warner Bros. Analytics (AN...
Extending Analytics Beyond the Data Warehouse, ft. Warner Bros. Analytics (AN...
 
Data Lake Implementation: Processing and Querying Data in Place (STG204-R1) -...
Data Lake Implementation: Processing and Querying Data in Place (STG204-R1) -...Data Lake Implementation: Processing and Querying Data in Place (STG204-R1) -...
Data Lake Implementation: Processing and Querying Data in Place (STG204-R1) -...
 
Preparing Data for the Lake
Preparing Data for the LakePreparing Data for the Lake
Preparing Data for the Lake
 
Visualization with Amazon QuickSight
Visualization with Amazon QuickSightVisualization with Amazon QuickSight
Visualization with Amazon QuickSight
 
Amazon EMR: Optimize Transient Clusters for Data Processing & ETL (ANT341) - ...
Amazon EMR: Optimize Transient Clusters for Data Processing & ETL (ANT341) - ...Amazon EMR: Optimize Transient Clusters for Data Processing & ETL (ANT341) - ...
Amazon EMR: Optimize Transient Clusters for Data Processing & ETL (ANT341) - ...
 
Serverless Data Prep with AWS Glue (ANT313) - AWS re:Invent 2018
Serverless Data Prep with AWS Glue (ANT313) - AWS re:Invent 2018Serverless Data Prep with AWS Glue (ANT313) - AWS re:Invent 2018
Serverless Data Prep with AWS Glue (ANT313) - AWS re:Invent 2018
 
Building Data Lakes That Cost Less and Deliver Results Faster - AWS Online Te...
Building Data Lakes That Cost Less and Deliver Results Faster - AWS Online Te...Building Data Lakes That Cost Less and Deliver Results Faster - AWS Online Te...
Building Data Lakes That Cost Less and Deliver Results Faster - AWS Online Te...
 
Analyzing Streams
Analyzing StreamsAnalyzing Streams
Analyzing Streams
 
Preparing Data for the Lake
Preparing Data for the LakePreparing Data for the Lake
Preparing Data for the Lake
 
What's New with Amazon Redshift ft. Dow Jones (ANT350-R) - AWS re:Invent 2018
What's New with Amazon Redshift ft. Dow Jones (ANT350-R) - AWS re:Invent 2018What's New with Amazon Redshift ft. Dow Jones (ANT350-R) - AWS re:Invent 2018
What's New with Amazon Redshift ft. Dow Jones (ANT350-R) - AWS re:Invent 2018
 
Building a Data Lake for Your Enterprise, ft. Sysco (STG309) - AWS re:Invent ...
Building a Data Lake for Your Enterprise, ft. Sysco (STG309) - AWS re:Invent ...Building a Data Lake for Your Enterprise, ft. Sysco (STG309) - AWS re:Invent ...
Building a Data Lake for Your Enterprise, ft. Sysco (STG309) - AWS re:Invent ...
 
Success has Many Query Engines- Tel Aviv Summit 2018
Success has Many Query Engines- Tel Aviv Summit 2018Success has Many Query Engines- Tel Aviv Summit 2018
Success has Many Query Engines- Tel Aviv Summit 2018
 
Building Advanced Workflows with AWS Glue (ANT372) - AWS re:Invent 2018
Building Advanced Workflows with AWS Glue (ANT372) - AWS re:Invent 2018Building Advanced Workflows with AWS Glue (ANT372) - AWS re:Invent 2018
Building Advanced Workflows with AWS Glue (ANT372) - AWS re:Invent 2018
 
Big Data Analytics Architectural Patterns and Best Practices (ANT201-R1) - AW...
Big Data Analytics Architectural Patterns and Best Practices (ANT201-R1) - AW...Big Data Analytics Architectural Patterns and Best Practices (ANT201-R1) - AW...
Big Data Analytics Architectural Patterns and Best Practices (ANT201-R1) - AW...
 
Log Analytics with AWS
Log Analytics with AWSLog Analytics with AWS
Log Analytics with AWS
 

Similar to Amazon Athena: What's New and How SendGrid Innovates (ANT324) - AWS re:Invent 2018

Operational Excellence with Containerized Workloads Using AWS Fargate (CON320...
Operational Excellence with Containerized Workloads Using AWS Fargate (CON320...Operational Excellence with Containerized Workloads Using AWS Fargate (CON320...
Operational Excellence with Containerized Workloads Using AWS Fargate (CON320...Amazon Web Services
 
Česko-Slovenský AWS Webinář 07 - Optimalizace nákladů v AWS
Česko-Slovenský AWS Webinář 07 - Optimalizace nákladů v AWSČesko-Slovenský AWS Webinář 07 - Optimalizace nákladů v AWS
Česko-Slovenský AWS Webinář 07 - Optimalizace nákladů v AWSVladimir Simek
 
Business Intelligence in Minutes with Amazon Athena and Amazon QuickSight - A...
Business Intelligence in Minutes with Amazon Athena and Amazon QuickSight - A...Business Intelligence in Minutes with Amazon Athena and Amazon QuickSight - A...
Business Intelligence in Minutes with Amazon Athena and Amazon QuickSight - A...Amazon Web Services
 
Business Intelligence in Minutes with Amazon Athena and Amazon QuickSight
Business Intelligence in Minutes with Amazon Athena and Amazon QuickSightBusiness Intelligence in Minutes with Amazon Athena and Amazon QuickSight
Business Intelligence in Minutes with Amazon Athena and Amazon QuickSightAmazon Web Services
 
Using Amazon VPC Flow Logs for Predictive Security Analytics (NET319) - AWS r...
Using Amazon VPC Flow Logs for Predictive Security Analytics (NET319) - AWS r...Using Amazon VPC Flow Logs for Predictive Security Analytics (NET319) - AWS r...
Using Amazon VPC Flow Logs for Predictive Security Analytics (NET319) - AWS r...Amazon Web Services
 
AWS Public Datasets: Learnings from Staging Petabytes of Data for Analysis in...
AWS Public Datasets: Learnings from Staging Petabytes of Data for Analysis in...AWS Public Datasets: Learnings from Staging Petabytes of Data for Analysis in...
AWS Public Datasets: Learnings from Staging Petabytes of Data for Analysis in...Amazon Web Services
 
在 AWS 上構建無服務器分析
在 AWS 上構建無服務器分析在 AWS 上構建無服務器分析
在 AWS 上構建無服務器分析Amazon Web Services
 
Log Analytics with AWS
Log Analytics with AWSLog Analytics with AWS
Log Analytics with AWSAWS Germany
 
Accelerate Analytics at Scale with Amazon EMR - AWS Summit Sydney 2018
Accelerate Analytics at Scale with Amazon EMR - AWS Summit Sydney 2018Accelerate Analytics at Scale with Amazon EMR - AWS Summit Sydney 2018
Accelerate Analytics at Scale with Amazon EMR - AWS Summit Sydney 2018Amazon Web Services
 
Using Tableau and AWS for Fearless Reporting at UMD
Using Tableau and AWS for Fearless Reporting at UMDUsing Tableau and AWS for Fearless Reporting at UMD
Using Tableau and AWS for Fearless Reporting at UMDAmazon Web Services
 
What Can Your Logs Tell You? (ANT215) - AWS re:Invent 2018
What Can Your Logs Tell You? (ANT215) - AWS re:Invent 2018What Can Your Logs Tell You? (ANT215) - AWS re:Invent 2018
What Can Your Logs Tell You? (ANT215) - AWS re:Invent 2018Amazon Web Services
 
Instrumenting Kubernetes for Observability Using AWS X-Ray and Amazon CloudWa...
Instrumenting Kubernetes for Observability Using AWS X-Ray and Amazon CloudWa...Instrumenting Kubernetes for Observability Using AWS X-Ray and Amazon CloudWa...
Instrumenting Kubernetes for Observability Using AWS X-Ray and Amazon CloudWa...Amazon Web Services
 
Scaling from zero to millions of users
Scaling from zero to millions of usersScaling from zero to millions of users
Scaling from zero to millions of usersAmazon Web Services
 
Using Search with a Database - Peter Dachnowicz
Using Search with a Database - Peter DachnowiczUsing Search with a Database - Peter Dachnowicz
Using Search with a Database - Peter DachnowiczAmazon Web Services
 
Building a Data Lake in Amazon S3 & Amazon Glacier (STG401-R1) - AWS re:Inven...
Building a Data Lake in Amazon S3 & Amazon Glacier (STG401-R1) - AWS re:Inven...Building a Data Lake in Amazon S3 & Amazon Glacier (STG401-R1) - AWS re:Inven...
Building a Data Lake in Amazon S3 & Amazon Glacier (STG401-R1) - AWS re:Inven...Amazon Web Services
 
Virtual AWSome Day October 2018 - Amazon Web Services
Virtual AWSome Day October 2018 - Amazon Web ServicesVirtual AWSome Day October 2018 - Amazon Web Services
Virtual AWSome Day October 2018 - Amazon Web ServicesAmazon Web Services
 
Have Your Front End and Monitor It, Too (ANT303) - AWS re:Invent 2018
Have Your Front End and Monitor It, Too (ANT303) - AWS re:Invent 2018Have Your Front End and Monitor It, Too (ANT303) - AWS re:Invent 2018
Have Your Front End and Monitor It, Too (ANT303) - AWS re:Invent 2018Amazon Web Services
 
Serverless Stream Processing Pipeline Best Practices (SRV316-R1) - AWS re:Inv...
Serverless Stream Processing Pipeline Best Practices (SRV316-R1) - AWS re:Inv...Serverless Stream Processing Pipeline Best Practices (SRV316-R1) - AWS re:Inv...
Serverless Stream Processing Pipeline Best Practices (SRV316-R1) - AWS re:Inv...Amazon Web Services
 

Similar to Amazon Athena: What's New and How SendGrid Innovates (ANT324) - AWS re:Invent 2018 (20)

Operational Excellence with Containerized Workloads Using AWS Fargate (CON320...
Operational Excellence with Containerized Workloads Using AWS Fargate (CON320...Operational Excellence with Containerized Workloads Using AWS Fargate (CON320...
Operational Excellence with Containerized Workloads Using AWS Fargate (CON320...
 
Česko-Slovenský AWS Webinář 07 - Optimalizace nákladů v AWS
Česko-Slovenský AWS Webinář 07 - Optimalizace nákladů v AWSČesko-Slovenský AWS Webinář 07 - Optimalizace nákladů v AWS
Česko-Slovenský AWS Webinář 07 - Optimalizace nákladů v AWS
 
Business Intelligence in Minutes with Amazon Athena and Amazon QuickSight - A...
Business Intelligence in Minutes with Amazon Athena and Amazon QuickSight - A...Business Intelligence in Minutes with Amazon Athena and Amazon QuickSight - A...
Business Intelligence in Minutes with Amazon Athena and Amazon QuickSight - A...
 
Business Intelligence in Minutes with Amazon Athena and Amazon QuickSight
Business Intelligence in Minutes with Amazon Athena and Amazon QuickSightBusiness Intelligence in Minutes with Amazon Athena and Amazon QuickSight
Business Intelligence in Minutes with Amazon Athena and Amazon QuickSight
 
Using Amazon VPC Flow Logs for Predictive Security Analytics (NET319) - AWS r...
Using Amazon VPC Flow Logs for Predictive Security Analytics (NET319) - AWS r...Using Amazon VPC Flow Logs for Predictive Security Analytics (NET319) - AWS r...
Using Amazon VPC Flow Logs for Predictive Security Analytics (NET319) - AWS r...
 
BI & Analytics
BI & AnalyticsBI & Analytics
BI & Analytics
 
AWS Public Datasets: Learnings from Staging Petabytes of Data for Analysis in...
AWS Public Datasets: Learnings from Staging Petabytes of Data for Analysis in...AWS Public Datasets: Learnings from Staging Petabytes of Data for Analysis in...
AWS Public Datasets: Learnings from Staging Petabytes of Data for Analysis in...
 
Log Analytics with AWS
Log Analytics with AWSLog Analytics with AWS
Log Analytics with AWS
 
在 AWS 上構建無服務器分析
在 AWS 上構建無服務器分析在 AWS 上構建無服務器分析
在 AWS 上構建無服務器分析
 
Log Analytics with AWS
Log Analytics with AWSLog Analytics with AWS
Log Analytics with AWS
 
Accelerate Analytics at Scale with Amazon EMR - AWS Summit Sydney 2018
Accelerate Analytics at Scale with Amazon EMR - AWS Summit Sydney 2018Accelerate Analytics at Scale with Amazon EMR - AWS Summit Sydney 2018
Accelerate Analytics at Scale with Amazon EMR - AWS Summit Sydney 2018
 
Using Tableau and AWS for Fearless Reporting at UMD
Using Tableau and AWS for Fearless Reporting at UMDUsing Tableau and AWS for Fearless Reporting at UMD
Using Tableau and AWS for Fearless Reporting at UMD
 
What Can Your Logs Tell You? (ANT215) - AWS re:Invent 2018
What Can Your Logs Tell You? (ANT215) - AWS re:Invent 2018What Can Your Logs Tell You? (ANT215) - AWS re:Invent 2018
What Can Your Logs Tell You? (ANT215) - AWS re:Invent 2018
 
Instrumenting Kubernetes for Observability Using AWS X-Ray and Amazon CloudWa...
Instrumenting Kubernetes for Observability Using AWS X-Ray and Amazon CloudWa...Instrumenting Kubernetes for Observability Using AWS X-Ray and Amazon CloudWa...
Instrumenting Kubernetes for Observability Using AWS X-Ray and Amazon CloudWa...
 
Scaling from zero to millions of users
Scaling from zero to millions of usersScaling from zero to millions of users
Scaling from zero to millions of users
 
Using Search with a Database - Peter Dachnowicz
Using Search with a Database - Peter DachnowiczUsing Search with a Database - Peter Dachnowicz
Using Search with a Database - Peter Dachnowicz
 
Building a Data Lake in Amazon S3 & Amazon Glacier (STG401-R1) - AWS re:Inven...
Building a Data Lake in Amazon S3 & Amazon Glacier (STG401-R1) - AWS re:Inven...Building a Data Lake in Amazon S3 & Amazon Glacier (STG401-R1) - AWS re:Inven...
Building a Data Lake in Amazon S3 & Amazon Glacier (STG401-R1) - AWS re:Inven...
 
Virtual AWSome Day October 2018 - Amazon Web Services
Virtual AWSome Day October 2018 - Amazon Web ServicesVirtual AWSome Day October 2018 - Amazon Web Services
Virtual AWSome Day October 2018 - Amazon Web Services
 
Have Your Front End and Monitor It, Too (ANT303) - AWS re:Invent 2018
Have Your Front End and Monitor It, Too (ANT303) - AWS re:Invent 2018Have Your Front End and Monitor It, Too (ANT303) - AWS re:Invent 2018
Have Your Front End and Monitor It, Too (ANT303) - AWS re:Invent 2018
 
Serverless Stream Processing Pipeline Best Practices (SRV316-R1) - AWS re:Inv...
Serverless Stream Processing Pipeline Best Practices (SRV316-R1) - AWS re:Inv...Serverless Stream Processing Pipeline Best Practices (SRV316-R1) - AWS re:Inv...
Serverless Stream Processing Pipeline Best Practices (SRV316-R1) - AWS re:Inv...
 

More from Amazon Web Services

Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...Amazon Web Services
 
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...Amazon Web Services
 
Esegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS FargateEsegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS FargateAmazon Web Services
 
Costruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWSCostruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWSAmazon Web Services
 
Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot Amazon Web Services
 
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...Amazon Web Services
 
OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...Amazon Web Services
 
Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows WorkloadsMicrosoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows WorkloadsAmazon Web Services
 
Database Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatareDatabase Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatareAmazon Web Services
 
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJSCrea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJSAmazon Web Services
 
API moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e webAPI moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e webAmazon Web Services
 
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatareDatabase Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatareAmazon Web Services
 
Tools for building your MVP on AWS
Tools for building your MVP on AWSTools for building your MVP on AWS
Tools for building your MVP on AWSAmazon Web Services
 
How to Build a Winning Pitch Deck
How to Build a Winning Pitch DeckHow to Build a Winning Pitch Deck
How to Build a Winning Pitch DeckAmazon Web Services
 
Building a web application without servers
Building a web application without serversBuilding a web application without servers
Building a web application without serversAmazon Web Services
 
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...Amazon Web Services
 
Introduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container ServiceIntroduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container ServiceAmazon Web Services
 

More from Amazon Web Services (20)

Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
 
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
 
Esegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS FargateEsegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS Fargate
 
Costruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWSCostruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWS
 
Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot
 
Open banking as a service
Open banking as a serviceOpen banking as a service
Open banking as a service
 
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
 
OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...
 
Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows WorkloadsMicrosoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
 
Computer Vision con AWS
Computer Vision con AWSComputer Vision con AWS
Computer Vision con AWS
 
Database Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatareDatabase Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatare
 
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJSCrea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
 
API moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e webAPI moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e web
 
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatareDatabase Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
 
Tools for building your MVP on AWS
Tools for building your MVP on AWSTools for building your MVP on AWS
Tools for building your MVP on AWS
 
How to Build a Winning Pitch Deck
How to Build a Winning Pitch DeckHow to Build a Winning Pitch Deck
How to Build a Winning Pitch Deck
 
Building a web application without servers
Building a web application without serversBuilding a web application without servers
Building a web application without servers
 
Fundraising Essentials
Fundraising EssentialsFundraising Essentials
Fundraising Essentials
 
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
 
Introduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container ServiceIntroduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container Service
 

Amazon Athena: What's New and How SendGrid Innovates (ANT324) - AWS re:Invent 2018

  • 1.
  • 2. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Roy Hasson Sr. Mgr, Business Development – Amazon Athena AWS Shane Andrade Principal Engineer I - Email Infra Data Team SendGrid Amazon Athena: What’s new and how SendGrid innovates using Athena A N T 3 2 4
  • 3. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Agenda • Customer trends • Workload isolation and cost controls • How SendGrid built email replay using Amazon Athena
  • 4. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Amazon Athena Customers
  • 5. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
  • 6. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Understanding our users Data Consumer • Easily discover data • Choice of tools • Performance Data Engineer • Security • Maintainability & Scale • Performance • Cost
  • 7. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Querying Your Data Lake Devices Web Sensors Social EDW Amazon Kinesis Data Firehose writes partitioned optimized data Ingest streaming events in real time with Amazon Kinesis - Ingestion S3://bucket/year/month/day/hour/file.parquet S3://bucket/year/month/day/hour/file.orc
  • 8. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Ingestion: Database and Data Warehouse Devices Web Sensors Social EDW Move snapshots and incremental DB and DWH tables S3://bucket/table/LOAD001.csv S3://bucket/table/20181127-1134010000.csv S3://bucket/year/month/day/hour/file.parquet S3://bucket/year/month/day/hour/file.orc
  • 9. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Querying Your Data Lake – Transform & Automate Devices Web Sensors Social EDW Automate routine tasks such as data cleansing Perform unique data transformations and ML
  • 10. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Querying Your Data Lake – Catalog Devices Web Sensors Social EDW AWS Glue Data Catalog Permissions Store transformed data, crawl and catalog its schema Restrict access by defining permissions on databases and tables
  • 11. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Catalog: Access Control AWS Glue Data Catalog { "Effect": "Allow", "Action": [ "glue:GetTables”, "glue:GetTable”, ], "Resource": [ "arn:aws:glue:us-east-1:123456789012:catalog", "arn:aws:glue:us-east-1:123456789012:database/example_db", "arn:aws:glue:us-east-1:123456789012:table/example_db/*" ] }
  • 12. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Querying Your Data Lake – Consume Devices Web Sensors Social EDW AWS Glue Data Catalog Data ConsumerData Engineer
  • 13. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Data consumption Permissions Data Lake AWS Cloud AWS Cloud Reporting & Analytics Machine Learning AWS Cloud Custom Applications AWS Glue Data Catalog
  • 14. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Visualize your data with your favorite tools Featured Athena Partners Amazon QuickSight
  • 15. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Data consumption – Data Analyst AWS Glue Data Catalog JDBC/ODBC drivers connect common BI and SQL tools Now 2-5x faster Create optimized tables on-demand using Create Table As Select Abstract complex queries & expose only needed data with Athena Views
  • 16. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Data consumption – Data Analyst JDBC/ODBC driver integration with Microsoft Active Directory jdbc:awsathena://AwsRegion=us-east-1; S3OutputLocation=s3://bucket/path; AwsCredentialsProviderClass=com.simba.athena.iamsupport.plugin.AdfsCredentialsProvider; idp_host=example.adfs.server; idp_port=233; UID=HOMEjsmith; PWD=simba12345; preferred_role=arn:aws:iam::123456789123: role/JSMITH;
  • 17. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Data consumption – Automated Reporting athena.startQueryExecution("SELECT * FROM business_view”) Query_ID 1 2 3 4 Email notification 5 1. Schedule query 2. Track QueryID for status 3. Query results to Amazon S3 4. New file trigger 5. Job complete notification
  • 18. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Data consumption – Data Scientist AWS Glue Data Catalog Use PyAthena to query Athena tables directly from Amazon SageMaker notebooks
  • 19. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Data consumption – Custom Applications AWS Glue Data Catalog Integrate with AWS AppSync for easy access to data, on and off-line Get data to your applications using AWS SDK and Athena API
  • 20. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Inspecting AWS service logs Service logs are written directly to Amazon S3 - Ingestion
  • 21. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Inspecting AWS service logs – Optimization Optimize and partition to improve performance and cost Data stored partitioned in Apache Parquet format
  • 22. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Optimization: Small Files It is recommend to merge many small files into fewer larger ones Improves performance by up to 5x when accessing tables containing large number of small files Size (Bytes) File name 14408 kfhconnectblog-parquet-1-2018-05-11-16-01-9e7cc0b631d8.parquet 14408 kfhconnectblog-parquet-1-2018-05-11-16-01-206cf7098588.parquet 14408 kfhconnectblog-parquet-1-2018-05-11-16-03-6a3fa4c14e22.parquet
  • 23. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Inspecting AWS service logs – Catalog AWS Glue Data Catalog Permissions AWS Glue crawler catalogs data schema and partitions Restrict access by defining permissions on databases and tables
  • 24. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Inspecting AWS service logs – Consume AWS Glue Data Catalog Data ConsumerData Engineer
  • 25. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
  • 26. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Athena Workgroups Athena Workgroups are used to isolate queries between different teams, workloads or applications, and to set limits on amount of data each query or the entire workgroup can process Workload Isolation Query Metrics Cost Controls
  • 27. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Workgroups – Workload Isolation Unique query output location per Workgroup Encrypt results with unique AWS KMS key per Workgroup Collect and publish aggregated metrics per Workgroup to AWS CloudWatch Use Workgroup settings eliminating need to configure individual users
  • 28. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Workgroups – Metric Reporting Total bytes scanned per Workgroup Total failed queries per Workgroup Total successful queries per Workgroup Total query execution time per Workgroup
  • 29. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Workgroups – Cost Controls • Per query data scanned threshold; exceeding, will cancel query • Trigger alarms to notify of increasing usage and cost • Disable Workgroup when all queries exceed a maximum threshold Any Athena metric: successful/failed & total queries, query run time, etc.
  • 30. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Workgroups – Usage Notifications Define a hierarchy of alarms to be alerted as usage increases
  • 31. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
  • 32. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Denver, Colorado | San Francisco, California | Irvine, California | London, England 78,000 customers in 100+ countries | 45B emails monthly | 4 offices
  • 33. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. OUR CUSTOMERS Powering the customer engagement for the world’s leading digital brands
  • 34. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. SMTP/API Transactional Marketing Campaigns Promotional Email ActivityParse API
  • 35. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Originally built entirely on prem
  • 36. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Customers were limited to 7 days of data
  • 37. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. High volume customers were more constrained
  • 38. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. New customer base had different needs
  • 39. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Provisioning Risks
  • 40. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
  • 41. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Serverless Elasticity API Integration
  • 42. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Initial architecture during internal beta
  • 43. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Controlled access to Athena
  • 44. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. We tested Athena query times vs file counts
  • 45. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Historical data needed to be handle separately
  • 46. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Athena architecture V3 AWS GlueDynamoDB
  • 47. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Benefits from using Athena Scalability Enable us to support increased Email Activity data in the future with little to no additional cost. Reduced variable costs and ops tickets Improved customer satisfaction regarding access to email data
  • 48. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Serverless Elasticity API Integration
  • 49. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
  • 50. Thank you! © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Roy Hasson royon@amazon.com Shane Andrade shane.andrade@sendgrid.com