In this session, you will learn how to designed clickstream analytics application and how you can use the same architecture to build your own and be ready to handle the changing world of clickstream data. Dive into how to perform advanced user retention and cohort analysis to make near–real time product and marketing decisions. Learn how to build infrastructure that is fast, easy, and cost-effective with AWS resources such as Amazon Kinesis, Spark on Amazon EMR, Amazon S3, Amazon Redshift, and Amazon Elasticsearch.
8. Answer
• User retention
• High spending customer
navigation pattern
• Product recommendation
• User journey in the shop
• UX improvement
• What deal/ad to try
next
Use case
Data source
• Page
• Click event
• Web log
• Thing event
17. Logstash
AWS SDK
Ingest Store
Bot AWS SDK
App
Crawlers
AWS SDK
Amazon
Kinesis
Firehose
Store
Amazon S3
Data Lake
ElasticSearch
Last 120mins
Analysts
AWS SDK
18.
19. Why do we need machine learning for this?
The social media stream is high-volume, and most of the
messages are not CS-actionable
20. Logstash
AWS SDK
Ingest Store
Bot AWS SDK
App
Crawlers
AWS SDK
Amazon
Kinesis
Process
Amazon
Lambda
Analyze
AWS SDK
Machine
learning
Notification
Action
Support
issue
Database
Feature
request
Keep training the ML model with new data
Action
Amazon S3
21. AWS SDK
Ingest Store
Bot AWS SDK
Messenger
Amazon
Kinesis
Process
Amazon
Lambda
Analysts
Machine
learning
Action
Bot
App
Get prediction
Keep training the ML model with new data
Amazon S3
26. Our Big Data Scale
Total ~25 PB DW on Amazon S3
Read ~10% DW daily
Write ~10% of read data daily
~ 550 billion events daily
~ 350 active platform users