SlideShare a Scribd company logo
1 of 109
Download to read offline
Using visualizations 
to monitor changes and harvest insights 
from log data at Twitter 
Krist Wongsuphasawat & Jimmy Lin 
@kristw 
@lintool
Logging user activities 
& data analysis
Twitter Use Users
Use Users 
Curious 
Twitter 
Product Managers
Use Users 
Curious 
Log data 
in Hadoop Write Twitter 
Instrument 
Engineers 
Product Managers
What are being logged? 
activities 
tweet
What are being logged? 
activities 
tweet from home timeline on twitter.com 
tweet from search page on iPhone
What are being logged? 
activities 
tweet from home timeline on twitter.com 
tweet from search page on iPhone 
sign up 
log in 
retweet 
etc.
Organize?
log event a.k.a. “client event” 
[Lee et al. 2012]
log event a.k.a. “client event” 
1) User ID 
2) Timestamp 
3) Event name 
client : page : section : component : element : action 
web : home : timeline : tweet_box : button : tweet 
4) Event detail 
[Lee et al. 2012]
Log data
Use Users 
Curious 
Log data 
in Hadoop Twitter 
Instrument 
Engineers 
Write 
Product Managers 
bigger than 
Tweet data
Use Users 
Curious 
Engineers 
Log data 
in Hadoop 
Data Scientists 
Ask 
Twitter 
Instrument 
Write 
Product Managers
Use Users 
Curious 
Engineers 
Log data 
in Hadoop 
Find 
Data Scientists 
Ask 
Twitter 
Instrument 
Write 
Product Managers
Log data
Use Users 
Curious 
Engineers 
Log data 
in Hadoop 
Find, Clean 
Data Scientists 
Ask 
Twitter 
Instrument 
Write 
Product Managers
Use Users 
Curious 
Engineers 
Log data 
in Hadoop 
Find, Clean 
Data Scientists 
Monitor 
Ask 
Twitter 
Instrument 
Write 
Product Managers
Use Users 
Curious 
Engineers 
Log data 
in Hadoop 
Find, Clean, Analyze 
Data Scientists 
Monitor 
Ask 
Twitter 
Instrument 
Write 
Product Managers
Log data 
Users in Hadoop 
Find, Clean, Analyze 
Data Scientists Engineers 
Use 
Monitor 
Ask 
Curious 
1 2 
Twitter 
Instrument 
Write 
Product Managers
Part I 
Find & Monitor 
Client Events
Motivation
Log data 
in Hadoop 
Engineers & Data Scientists 
billions of rows
Log data 
in Hadoop 
Aggregate 
Client event collection 
10,000+ event types 
date client page section comp. elem. action count 
20141011 web home home - - impression 100 
20141011 web home wtf - - click 20 
Engineers & Data Scientists
Log data 
in Hadoop 
Aggregate 
Client event collection 
10,000+ event types 
date client page section comp. elem. action count 
20141011 web home home - - impression 100 
20141011 web home wtf - - click 20 
(Who-to-Follow) 
Engineers & Data Scientists
Log data 
in Hadoop 
Aggregate 
Client event collection 
Engineers & Data Scientists
Log data 
in Hadoop 
Aggregate 
Client event collection 
client page section component element action 
Find 
Search 
Engineers & Data Scientists
Log data 
in Hadoop 
Aggregate 
Client event collection 
client page section component element action 
Find 
Search 
Engineers & Data Scientists
section? 
component? 
element?
Client event collection 
Search 
client page section component element action 
Find 
Log data 
in Hadoop 
Aggregate 
web home * * * impression 
Engineers & Data Scientists
Client event collection 
Search 
Query 
client page section component element action 
Find 
Aggregate 
Return 
Log data 
in Hadoop 
Results 
web : home : home : - : - : impression 
web : home : wtf : - : - : impression 
web home * * * impression 
Engineers & Data Scientists
Client event collection 
Search 
Query 
client page section component element action 
Find 
Aggregate 
Return 
Log data 
in Hadoop 
Results 
web : home : home : - : - : impression 
web : home : wtf : - : - : impression 
search can be better 
Engineers & Data Scientists
Client event collection 
Search 
Query 
client page section component element action 
Find 
Aggregate 
Return 
Log data 
in Hadoop 
Results 
web : home : home : - : - : impression 
web : home : wtf : - : - : impression 
10,000+ event types 
search can be better 
Engineers & Data Scientists
Client event collection 
10,000+ event types 
What are all sections under web:home? 
Search 
Query 
not everybody knows 
client page section component element action 
Find 
Aggregate 
Return 
Log data 
in Hadoop 
Results 
web : home : home : - : - : impression 
web : home : wtf : - : - : impression 
search can be better 
Engineers & Data Scientists
Client event collection 
Search 
Query 
client page section component element action 
Find 
Aggregate 
Return 
Log data 
in Hadoop 
Results 
web : home : home : - : - : impression 
search can be better 
one graph / event 
10,000+ event types 
not everybody knows 
What are all sections under web:home? 
Engineers & Data Scientists
Client event collection 
Search 
Query 
client page section component element action 
Find 
Aggregate 
Return 
Log data 
in Hadoop 
Results 
web : home : home : - : - : impression 
search can be better 
one graph / event 
x 10,000 
10,000+ event types 
not everybody knows 
What are all sections under web:home? 
Engineers & Data Scientists
! 
Goals 
• Search for client events 
• Explore client event collection 
• Monitor changes
• Session analysis 
[Lam et al. 2007, Shen et al. 2013] 
! 
Related work 
• Monitor network logs, not user activity logs 
[Ghoniem et al. 2013]
Design
Client event collection 
Engineers & Data Scientists
Client event collection 
See 
Engineers & Data Scientists
narrow down 
See 
Interactions 
search box => filter 
Client event collection 
Engineers & Data Scientists
See 
How to visualize? 
narrow down 
Client event collection 
Engineers & Data Scientists 
Interactions 
search box => filter
Interactions client : page : section : component : element : action 
search box => filter 
See 
How to visualize? 
narrow down 
Client event collection 
Engineers & Data Scientists
Client event hierarchy 
iphone:home:-:-:-:impression 
iphone:home:-:tweet:tweet:click 
iphone home - 
- - impression 
tweet tweet click
Detect changes 
iphone home - 
- - impression 
tweet tweet click 
iphone home - 
- - impression 
tweet tweet click 
TODAY 
7 DAYS AGO 
compared to
Calculate changes 
+5% +5% +5% 
+10% +10% +10% 
-5% -5% -5% 
DIFF
Display changes 
iphone home - 
- - impression 
tweet tweet click 
Map of the Market [Wattenberg 1999], StemView [Guerra-Gomez et al. 2013]
Display changes 
home - 
- - impression 
tweet tweet click 
iphone
Demo 
Scribe Radar
Twitter for Banana
• Since Dec 2013 
• 500 unique users, 10 users / day 
! 
• No training 
Deployment
Use cases 
Users: PMs, Data Scientists, Engineers 
• Search 
• Monitor 
• See effects after major product launch 
read the paper :)
Part II 
Analysis
Count page visits 
home page 
banana : home : - : - : - : impression
Funnel 
home page 
profile page
Funnel analysis 
banana : home : - : - : - : impression 
banana : profile : - : - : - : impression 
home page 1 job 
profile page 
1 hour
Funnel analysis 
home page 
banana : home : - : - : - : impression 
profile page search page 
2 jobs 
2 hours 
banana : profile : - : - : - : impression banana : search : - : - : - : impression
Funnel analysis 
home page 
banana : home : - : - : - : impression 
profile page search page 
banana : profile : - : - : - : impression banana : search : - : - : - : impression 
Specify all funnels manually! 
n jobs 
n hours
Goal 
home page 
banana : home : - : - : - : impression 
… … … 
1 job => all funnels, visualized
• Visualize an overview of event sequences 
! 
Related work 
[Wongsuphasawat et al. 2011, Monroe et al. 2013, …]
Related work 
• Visualize an overview of event sequences 
[Wongsuphasawat et al. 2011, Monroe et al. 2013, …] 
! 
• Big data? eBay checkout sequences 
[Shen et al. 2013] 
! 
One funnel at a time 
Checkout > Payment > Confirm > Success
LifeFlow 
[CHI2011] 
! 
(simplified)
User sessions 
Session#1 
start 
A 
B 
end 
Session#4 
start 
A 
end 
Session#2 
start 
A 
B 
end 
Session#3 
start 
A 
C 
end
Aggregate 
4 sessions 
A 
start 
A A 
B B C 
end end end 
A 
end
Aggregate 
start 
A 
B B C 
end end end 
end 
4 sessions
Aggregate 
C 
start 
end end end 
end 
A 
B 
4 sessions
Aggregate 
C 
start 
end end end 
end 
A 
B 
4 sessions
Aggregate 
4 sessions 
B C 
end 
start 
A 
end end end
Aggregate 
4 sessions 
B C 
end 
start 
A 
end end
Aggregate 
4 sessions 
B C 
end 
start 
A 
end end
Aggregate 
4 sessions 
start 
A 
B C end 
end end
Aggregate 
4,000,000 sessions 
start 
A 
B C end 
end end
try with sample data 
(~millions sessions, 10,000+ event types) 
! 
original paper 
(100,000 sessions, ~10 event types)
not meaningful 
! 
small slice of data 
but huge file
How to make it work?
# of unique sequences
Reduce # of unique sequences 
1. Reduce event types
Reduce # of unique sequences 
1. Reduce event types 
10,000 types select 
tweet 
sign up 
log out
Reduce # of unique sequences 
1. Reduce event types 
10,000 types select 
tweet 
sign up 
log out
Reduce # of unique sequences 
1. Reduce event types 
10,000 types select merge 
tweet from home timeline 
tweet from search page 
tweet … 
= tweet
Reduce # of unique sequences 
1. Reduce event types 
2. Reduce sequence length
Reduce # of unique sequences 
1. Reduce event types 
2. Reduce sequence length 
session 
1000 events
Reduce # of unique sequences 
1. Reduce event types 
2. Reduce sequence length 
session 
10 events after (window size & direction) 
1000 events 
visit home page (alignment)
Reduce # of unique sequences 
1. Reduce event types 
2. Reduce sequence length 
} Ask users for input
Reduce # of unique sequences 
} Ask users for input 
1. Reduce event types 
2. Reduce sequence length 
3. More aggregation on Hadoop
Collapse events 
Sequence 
ABBBCCCC 
ABBCC 
ABC 
ABCCCC 
e.g. 
ABCD 
ABCCCD 
ABCCE 
ABCDF 
ABCDG 
ABCDH 
tweet, tweet, tweet, … = tweet
Collapse events 
Sequence 
ABC 
ABC 
ABC 
ABC 
ABCD 
ABCD 
ABCE 
ABCDF 
ABCDG 
ABCDH
Group & Count 
Sequence 
Count 
ABC 
2000 
ABCD 
80 
ABCE 
20 
ABCDF 
1 
ABCDG 
1 
ABCDH 
1 
… 
…
Group & Count 
Sequence 
Count 
ABC 
2000 
ABCD 
80 
ABCE 
20 
ABCDF 
1 
ABCDG 
1 
ABCDH 
1 
ABCDI 
1 
ABCDJK 
1 
ABCDJL 
1 
rare sequences 
(count < threshold)
Truncate 
Sequence 
ABC 
ABCD 
ABCE 
ABCDx 
ABCDx 
ABCDx 
ABCDx 
ABCDJx 
ABCDJx 
Count 
2000 
80 
20 
1 
1 
1 
1 
1 
1 
Replace last event with x (…)
Group & Count 
Sequence 
ABC 
ABCD 
ABCE 
ABCDx 
ABCDJx 
Count 
2000 
80 
20 
4 
2
Truncate more 
Sequence 
ABC 
ABCD 
ABCE 
ABCDx 
ABCDx 
Count 
2000 
80 
20 
4 
2
Group & Count 
Sequence 
Count 
ABC 
2000 
ABCD 
80 
ABCE 
20 
ABCDx 
6
Final process 
1. Define set of events 
2. Pick alignment, direction and window size 
3. Run Hadoop job (with more aggregation) 
4. Wait for it… (2+ hrs) 
5. Visualize 
gazillion patterns (TBs) 
~100,000 patterns (10MB)
Demo 
Flying Sessions
Deployment 
• Since Jan 2013 
• Fewer users, but more in-depth ad-hoc analysis 
• Initial meeting to provide support
Case studies 
• What did users do when they visit Twitter? (in demo) 
• Where did users give up in the sign up process? 
• more in the paper
Case studies 
click on “sign up” 
fill personal info 
import address book 
etc. 
• What did users do when they visit Twitter? (in demo) 
• Where did users give up in the sign up process? 
• more in the paper
Case studies 
• What did users do when they visit Twitter? (in demo) 
• Where did users give up in the sign up process? 
• more in the paper 
read the paper :)
Conclusions & Future work 
• Large-scale User Activity Logs + Visual Analytics
Conclusions & Future work 
• Large-scale User Activity Logs + Visual Analytics 
• Find, Monitor & Explore 
+ Anomaly detection & automatic alert 
• Funnel Analysis 
+ More interactivity & data / reduce wait time / latency study? 
• Used in day-to-day operations at Twitter
Conclusions & Future work 
Challenge 
big data 
small data 
visualize & interact 
• Large-scale User Activity Logs + Visual Analytics 
• Find, Monitor & Explore 
+ Anomaly detection & automatic alert 
• Funnel Analysis 
+ More interactivity & data / reduce wait time / latency study? 
• Used in day-to-day operations at Twitter 
aggregate 
& sacrifice
Conclusions & Future work 
• Large-scale User Activity Logs + Visual Analytics 
• Find, Monitor & Explore 
+ Anomaly detection & automatic alert 
• Funnel Analysis 
+ More interactivity & data / reduce wait time / latency study? 
• Used in day-to-day operations at Twitter 
• Generalize to smaller systems 
Challenge 
big data 
aggregate 
& sacrifice 
small data 
visualize & interact
Acknowledgement 
• Data Scientists & Engineers @Twitter — Linus Lee, Chuang Liu 
• Feedback from reviewers, Ben Shneiderman & Catherine Plaisant
Conclusions & Future work 
• Large-scale User Activity Logs + Visual Analytics 
• Find, Monitor & Explore 
+ Anomaly detection & automatic alert 
• Funnel Analysis 
+ More interactivity & data / reduce wait time / latency study? 
• Used in day-to-day operations at Twitter 
• Generalize to smaller systems 
Challenge 
big data 
aggregate 
& sacrifice 
small data 
visualize & interact 
kristw@twitter.com / @kristw
Questions?
Thank you

More Related Content

What's hot

Fe Colleges Conference
Fe Colleges ConferenceFe Colleges Conference
Fe Colleges ConferenceAlun John
 
Alexis max-Creating a bot experience as good as your user experience - Alexis...
Alexis max-Creating a bot experience as good as your user experience - Alexis...Alexis max-Creating a bot experience as good as your user experience - Alexis...
Alexis max-Creating a bot experience as good as your user experience - Alexis...WeLoveSEO
 
Redefining technical SEO & how we should be thinking about it as an industry ...
Redefining technical SEO & how we should be thinking about it as an industry ...Redefining technical SEO & how we should be thinking about it as an industry ...
Redefining technical SEO & how we should be thinking about it as an industry ...WeLoveSEO
 
Conducting Twitter Reserch
Conducting Twitter ReserchConducting Twitter Reserch
Conducting Twitter ReserchKim Holmberg
 
Sourcing Candidates Using Twitter and Google+
Sourcing Candidates Using Twitter and Google+Sourcing Candidates Using Twitter and Google+
Sourcing Candidates Using Twitter and Google+HM Revenue & Customs
 
Branko Rihtman Linklove London 2012
Branko Rihtman Linklove London 2012 Branko Rihtman Linklove London 2012
Branko Rihtman Linklove London 2012 Branko Rihtman
 
Linking media, data, and services
Linking media, data, and servicesLinking media, data, and services
Linking media, data, and servicesRuben Verborgh
 
600+ SEARCHABLE Sourcing Tools compiled by Susanna Frazier @ohsusannamarie
600+ SEARCHABLE Sourcing Tools compiled by Susanna Frazier @ohsusannamarie600+ SEARCHABLE Sourcing Tools compiled by Susanna Frazier @ohsusannamarie
600+ SEARCHABLE Sourcing Tools compiled by Susanna Frazier @ohsusannamarieSusanna Frazier
 
Using Scientific Method in SEO
Using Scientific Method in SEOUsing Scientific Method in SEO
Using Scientific Method in SEOBranko Rihtman
 
Dangerous Google searching for secrets
Dangerous Google searching for secretsDangerous Google searching for secrets
Dangerous Google searching for secretsPim Piepers
 
Unleashing Twitter Data for Fun and Insight
Unleashing Twitter Data for Fun and InsightUnleashing Twitter Data for Fun and Insight
Unleashing Twitter Data for Fun and InsightMatthew Russell
 
When there is no Vendor: Statistics for Free Clickthroughs via the Online Cat...
When there is no Vendor: Statistics for Free Clickthroughs via the Online Cat...When there is no Vendor: Statistics for Free Clickthroughs via the Online Cat...
When there is no Vendor: Statistics for Free Clickthroughs via the Online Cat...Christopher Brown
 
Open Source Search Tools for www2010 conferencesourcesearchtoolswww20100426dA...
Open Source Search Tools for www2010 conferencesourcesearchtoolswww20100426dA...Open Source Search Tools for www2010 conferencesourcesearchtoolswww20100426dA...
Open Source Search Tools for www2010 conferencesourcesearchtoolswww20100426dA...Ted Drake
 

What's hot (16)

Fe Colleges Conference
Fe Colleges ConferenceFe Colleges Conference
Fe Colleges Conference
 
Alexis max-Creating a bot experience as good as your user experience - Alexis...
Alexis max-Creating a bot experience as good as your user experience - Alexis...Alexis max-Creating a bot experience as good as your user experience - Alexis...
Alexis max-Creating a bot experience as good as your user experience - Alexis...
 
Redefining technical SEO & how we should be thinking about it as an industry ...
Redefining technical SEO & how we should be thinking about it as an industry ...Redefining technical SEO & how we should be thinking about it as an industry ...
Redefining technical SEO & how we should be thinking about it as an industry ...
 
Conducting Twitter Reserch
Conducting Twitter ReserchConducting Twitter Reserch
Conducting Twitter Reserch
 
Sourcing Candidates Using Twitter and Google+
Sourcing Candidates Using Twitter and Google+Sourcing Candidates Using Twitter and Google+
Sourcing Candidates Using Twitter and Google+
 
Branko Rihtman Linklove London 2012
Branko Rihtman Linklove London 2012 Branko Rihtman Linklove London 2012
Branko Rihtman Linklove London 2012
 
Linking media, data, and services
Linking media, data, and servicesLinking media, data, and services
Linking media, data, and services
 
600+ SEARCHABLE Sourcing Tools compiled by Susanna Frazier @ohsusannamarie
600+ SEARCHABLE Sourcing Tools compiled by Susanna Frazier @ohsusannamarie600+ SEARCHABLE Sourcing Tools compiled by Susanna Frazier @ohsusannamarie
600+ SEARCHABLE Sourcing Tools compiled by Susanna Frazier @ohsusannamarie
 
Using Scientific Method in SEO
Using Scientific Method in SEOUsing Scientific Method in SEO
Using Scientific Method in SEO
 
Google Searchology
Google SearchologyGoogle Searchology
Google Searchology
 
Dangerous Google searching for secrets
Dangerous Google searching for secretsDangerous Google searching for secrets
Dangerous Google searching for secrets
 
Unleashing Twitter Data for Fun and Insight
Unleashing Twitter Data for Fun and InsightUnleashing Twitter Data for Fun and Insight
Unleashing Twitter Data for Fun and Insight
 
BD-ACA Week6
BD-ACA Week6BD-ACA Week6
BD-ACA Week6
 
Information updates january 2011
Information updates january 2011Information updates january 2011
Information updates january 2011
 
When there is no Vendor: Statistics for Free Clickthroughs via the Online Cat...
When there is no Vendor: Statistics for Free Clickthroughs via the Online Cat...When there is no Vendor: Statistics for Free Clickthroughs via the Online Cat...
When there is no Vendor: Statistics for Free Clickthroughs via the Online Cat...
 
Open Source Search Tools for www2010 conferencesourcesearchtoolswww20100426dA...
Open Source Search Tools for www2010 conferencesourcesearchtoolswww20100426dA...Open Source Search Tools for www2010 conferencesourcesearchtoolswww20100426dA...
Open Source Search Tools for www2010 conferencesourcesearchtoolswww20100426dA...
 

Viewers also liked

6 things to expect when you are visualizing
6 things to expect when you are visualizing6 things to expect when you are visualizing
6 things to expect when you are visualizingKrist Wongsuphasawat
 
Dazzing Data Depiction with D3.JS
Dazzing Data Depiction with D3.JSDazzing Data Depiction with D3.JS
Dazzing Data Depiction with D3.JSEric Carlisle
 
Democratising data by igniting a crowd powered movement.
Democratising data by igniting a crowd powered movement.Democratising data by igniting a crowd powered movement.
Democratising data by igniting a crowd powered movement.Steve Jennings
 
New Tools and Old Friends: Data journalism at #BBCdataday
New Tools and Old Friends: Data journalism at #BBCdatadayNew Tools and Old Friends: Data journalism at #BBCdataday
New Tools and Old Friends: Data journalism at #BBCdatadayPaul Bradshaw
 
Outflow: Exploring Flow, Factors and Outcome of Temporal Event Sequences
Outflow: Exploring Flow, Factors and Outcome of Temporal Event SequencesOutflow: Exploring Flow, Factors and Outcome of Temporal Event Sequences
Outflow: Exploring Flow, Factors and Outcome of Temporal Event SequencesKrist Wongsuphasawat
 
Krist Wongsuphasawat's Dissertation Defense: Interactive Exploration of Tempo...
Krist Wongsuphasawat's Dissertation Defense: Interactive Exploration of Tempo...Krist Wongsuphasawat's Dissertation Defense: Interactive Exploration of Tempo...
Krist Wongsuphasawat's Dissertation Defense: Interactive Exploration of Tempo...Krist Wongsuphasawat
 
LifeFlow: Understanding Millions of Event Sequences in a Million Pixels
LifeFlow: Understanding Millions of Event Sequences in a Million PixelsLifeFlow: Understanding Millions of Event Sequences in a Million Pixels
LifeFlow: Understanding Millions of Event Sequences in a Million PixelsKrist Wongsuphasawat
 
Information Visualization for Knowledge Discovery: An Introduction
Information Visualization for Knowledge Discovery: An IntroductionInformation Visualization for Knowledge Discovery: An Introduction
Information Visualization for Knowledge Discovery: An IntroductionKrist Wongsuphasawat
 
Semantic Search in E-Discovery
Semantic Search in E-DiscoverySemantic Search in E-Discovery
Semantic Search in E-DiscoveryDavid Graus
 
Between Minds
Between Minds Between Minds
Between Minds Mindjet
 
Collaboration: A hands-on demo using Confluence wiki
Collaboration: A hands-on demo using Confluence wikiCollaboration: A hands-on demo using Confluence wiki
Collaboration: A hands-on demo using Confluence wikiSarah Maddox
 
Confluence: Collaboration for the Enterprise
Confluence: Collaboration for the EnterpriseConfluence: Collaboration for the Enterprise
Confluence: Collaboration for the EnterpriseClearvision
 
Introduction To Confluence
Introduction To ConfluenceIntroduction To Confluence
Introduction To ConfluenceHua Soon Sim
 
14 steps to build a professional reseller partner program
14 steps to build a professional reseller partner program14 steps to build a professional reseller partner program
14 steps to build a professional reseller partner programDaniel Nilsson
 

Viewers also liked (16)

6 things to expect when you are visualizing
6 things to expect when you are visualizing6 things to expect when you are visualizing
6 things to expect when you are visualizing
 
Dazzing Data Depiction with D3.JS
Dazzing Data Depiction with D3.JSDazzing Data Depiction with D3.JS
Dazzing Data Depiction with D3.JS
 
Democratising data by igniting a crowd powered movement.
Democratising data by igniting a crowd powered movement.Democratising data by igniting a crowd powered movement.
Democratising data by igniting a crowd powered movement.
 
New Tools and Old Friends: Data journalism at #BBCdataday
New Tools and Old Friends: Data journalism at #BBCdatadayNew Tools and Old Friends: Data journalism at #BBCdataday
New Tools and Old Friends: Data journalism at #BBCdataday
 
Outflow: Exploring Flow, Factors and Outcome of Temporal Event Sequences
Outflow: Exploring Flow, Factors and Outcome of Temporal Event SequencesOutflow: Exploring Flow, Factors and Outcome of Temporal Event Sequences
Outflow: Exploring Flow, Factors and Outcome of Temporal Event Sequences
 
Krist Wongsuphasawat's Dissertation Defense: Interactive Exploration of Tempo...
Krist Wongsuphasawat's Dissertation Defense: Interactive Exploration of Tempo...Krist Wongsuphasawat's Dissertation Defense: Interactive Exploration of Tempo...
Krist Wongsuphasawat's Dissertation Defense: Interactive Exploration of Tempo...
 
LifeFlow: Understanding Millions of Event Sequences in a Million Pixels
LifeFlow: Understanding Millions of Event Sequences in a Million PixelsLifeFlow: Understanding Millions of Event Sequences in a Million Pixels
LifeFlow: Understanding Millions of Event Sequences in a Million Pixels
 
Information Visualization for Knowledge Discovery: An Introduction
Information Visualization for Knowledge Discovery: An IntroductionInformation Visualization for Knowledge Discovery: An Introduction
Information Visualization for Knowledge Discovery: An Introduction
 
Semantic Search in E-Discovery
Semantic Search in E-DiscoverySemantic Search in E-Discovery
Semantic Search in E-Discovery
 
Between Minds
Between Minds Between Minds
Between Minds
 
Collaboration: A hands-on demo using Confluence wiki
Collaboration: A hands-on demo using Confluence wikiCollaboration: A hands-on demo using Confluence wiki
Collaboration: A hands-on demo using Confluence wiki
 
Confluence: Collaboration for the Enterprise
Confluence: Collaboration for the EnterpriseConfluence: Collaboration for the Enterprise
Confluence: Collaboration for the Enterprise
 
d3Kit
d3Kitd3Kit
d3Kit
 
Art of Visual Thinking
Art of Visual ThinkingArt of Visual Thinking
Art of Visual Thinking
 
Introduction To Confluence
Introduction To ConfluenceIntroduction To Confluence
Introduction To Confluence
 
14 steps to build a professional reseller partner program
14 steps to build a professional reseller partner program14 steps to build a professional reseller partner program
14 steps to build a professional reseller partner program
 

Similar to Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

Growing into a proactive Data Platform
Growing into a proactive Data PlatformGrowing into a proactive Data Platform
Growing into a proactive Data PlatformLivePerson
 
Automate Your Business with Atlassian
Automate Your Business with AtlassianAutomate Your Business with Atlassian
Automate Your Business with AtlassianAtlassian
 
AnDevCon - Tracking User Behavior Creatively
AnDevCon - Tracking User Behavior CreativelyAnDevCon - Tracking User Behavior Creatively
AnDevCon - Tracking User Behavior CreativelyKiana Tennyson
 
Why And When Should We Consider Stream Processing In Our Solutions Teqnation ...
Why And When Should We Consider Stream Processing In Our Solutions Teqnation ...Why And When Should We Consider Stream Processing In Our Solutions Teqnation ...
Why And When Should We Consider Stream Processing In Our Solutions Teqnation ...Soroosh Khodami
 
Developer friendly open data
Developer friendly open dataDeveloper friendly open data
Developer friendly open dataAlbert O'Connor
 
Best Practices: What to Track with Your Analytics
Best Practices: What to Track with Your AnalyticsBest Practices: What to Track with Your Analytics
Best Practices: What to Track with Your AnalyticsKissmetrics on SlideShare
 
Building the BI system and analytics capabilities at the company based on Rea...
Building the BI system and analytics capabilities at the company based on Rea...Building the BI system and analytics capabilities at the company based on Rea...
Building the BI system and analytics capabilities at the company based on Rea...GameCamp
 
Building Client-side Search Applications with Solr
Building Client-side Search Applications with SolrBuilding Client-side Search Applications with Solr
Building Client-side Search Applications with Solrlucenerevolution
 
Data Visualization: A Quick Tour for Data Science Enthusiasts
Data Visualization: A Quick Tour for Data Science EnthusiastsData Visualization: A Quick Tour for Data Science Enthusiasts
Data Visualization: A Quick Tour for Data Science EnthusiastsKrist Wongsuphasawat
 
QA Fest 2018. Slavik Pashanin. Analytics Data Driven Testing
QA Fest 2018. Slavik Pashanin. Analytics Data Driven TestingQA Fest 2018. Slavik Pashanin. Analytics Data Driven Testing
QA Fest 2018. Slavik Pashanin. Analytics Data Driven TestingQAFest
 
Backstage 2019 - The Atlassian Journey with Amplitude - Itzik Feldman
Backstage 2019 - The Atlassian Journey with Amplitude - Itzik FeldmanBackstage 2019 - The Atlassian Journey with Amplitude - Itzik Feldman
Backstage 2019 - The Atlassian Journey with Amplitude - Itzik FeldmanAmplitude
 
Designing Evidence - Planning how to capture specific user behaviour as reada...
Designing Evidence - Planning how to capture specific user behaviour as reada...Designing Evidence - Planning how to capture specific user behaviour as reada...
Designing Evidence - Planning how to capture specific user behaviour as reada...Andrew Lewis
 
Me and my importers
Me and my importersMe and my importers
Me and my importersDonny Wals
 
You Put *What* in Your Stream?! Patterns and Practices for Event Design with ...
You Put *What* in Your Stream?! Patterns and Practices for Event Design with ...You Put *What* in Your Stream?! Patterns and Practices for Event Design with ...
You Put *What* in Your Stream?! Patterns and Practices for Event Design with ...HostedbyConfluent
 
DataXDay - Building a Real Time Analytics API at Scale
DataXDay - Building a Real Time Analytics API at ScaleDataXDay - Building a Real Time Analytics API at Scale
DataXDay - Building a Real Time Analytics API at ScaleDataXDay Conference by Xebia
 
An Architecture for Agile Machine Learning in Real-Time Applications
An Architecture for Agile Machine Learning in Real-Time ApplicationsAn Architecture for Agile Machine Learning in Real-Time Applications
An Architecture for Agile Machine Learning in Real-Time ApplicationsJohann Schleier-Smith
 
PredictionIO - Building Applications That Predict User Behavior Through Big D...
PredictionIO - Building Applications That Predict User Behavior Through Big D...PredictionIO - Building Applications That Predict User Behavior Through Big D...
PredictionIO - Building Applications That Predict User Behavior Through Big D...predictionio
 

Similar to Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter (20)

Tactical Information Gathering
Tactical Information GatheringTactical Information Gathering
Tactical Information Gathering
 
Growing into a proactive Data Platform
Growing into a proactive Data PlatformGrowing into a proactive Data Platform
Growing into a proactive Data Platform
 
Automate Your Business with Atlassian
Automate Your Business with AtlassianAutomate Your Business with Atlassian
Automate Your Business with Atlassian
 
AnDevCon - Tracking User Behavior Creatively
AnDevCon - Tracking User Behavior CreativelyAnDevCon - Tracking User Behavior Creatively
AnDevCon - Tracking User Behavior Creatively
 
Why And When Should We Consider Stream Processing In Our Solutions Teqnation ...
Why And When Should We Consider Stream Processing In Our Solutions Teqnation ...Why And When Should We Consider Stream Processing In Our Solutions Teqnation ...
Why And When Should We Consider Stream Processing In Our Solutions Teqnation ...
 
Developer friendly open data
Developer friendly open dataDeveloper friendly open data
Developer friendly open data
 
Best Practices: What to Track with Your Analytics
Best Practices: What to Track with Your AnalyticsBest Practices: What to Track with Your Analytics
Best Practices: What to Track with Your Analytics
 
Building the BI system and analytics capabilities at the company based on Rea...
Building the BI system and analytics capabilities at the company based on Rea...Building the BI system and analytics capabilities at the company based on Rea...
Building the BI system and analytics capabilities at the company based on Rea...
 
Building Client-side Search Applications with Solr
Building Client-side Search Applications with SolrBuilding Client-side Search Applications with Solr
Building Client-side Search Applications with Solr
 
Data Visualization: A Quick Tour for Data Science Enthusiasts
Data Visualization: A Quick Tour for Data Science EnthusiastsData Visualization: A Quick Tour for Data Science Enthusiasts
Data Visualization: A Quick Tour for Data Science Enthusiasts
 
Data Visualization
Data VisualizationData Visualization
Data Visualization
 
QA Fest 2018. Slavik Pashanin. Analytics Data Driven Testing
QA Fest 2018. Slavik Pashanin. Analytics Data Driven TestingQA Fest 2018. Slavik Pashanin. Analytics Data Driven Testing
QA Fest 2018. Slavik Pashanin. Analytics Data Driven Testing
 
Backstage 2019 - The Atlassian Journey with Amplitude - Itzik Feldman
Backstage 2019 - The Atlassian Journey with Amplitude - Itzik FeldmanBackstage 2019 - The Atlassian Journey with Amplitude - Itzik Feldman
Backstage 2019 - The Atlassian Journey with Amplitude - Itzik Feldman
 
Refactoring a web application with Python
Refactoring a web application with PythonRefactoring a web application with Python
Refactoring a web application with Python
 
Designing Evidence - Planning how to capture specific user behaviour as reada...
Designing Evidence - Planning how to capture specific user behaviour as reada...Designing Evidence - Planning how to capture specific user behaviour as reada...
Designing Evidence - Planning how to capture specific user behaviour as reada...
 
Me and my importers
Me and my importersMe and my importers
Me and my importers
 
You Put *What* in Your Stream?! Patterns and Practices for Event Design with ...
You Put *What* in Your Stream?! Patterns and Practices for Event Design with ...You Put *What* in Your Stream?! Patterns and Practices for Event Design with ...
You Put *What* in Your Stream?! Patterns and Practices for Event Design with ...
 
DataXDay - Building a Real Time Analytics API at Scale
DataXDay - Building a Real Time Analytics API at ScaleDataXDay - Building a Real Time Analytics API at Scale
DataXDay - Building a Real Time Analytics API at Scale
 
An Architecture for Agile Machine Learning in Real-Time Applications
An Architecture for Agile Machine Learning in Real-Time ApplicationsAn Architecture for Agile Machine Learning in Real-Time Applications
An Architecture for Agile Machine Learning in Real-Time Applications
 
PredictionIO - Building Applications That Predict User Behavior Through Big D...
PredictionIO - Building Applications That Predict User Behavior Through Big D...PredictionIO - Building Applications That Predict User Behavior Through Big D...
PredictionIO - Building Applications That Predict User Behavior Through Big D...
 

More from Krist Wongsuphasawat

What I tell myself before visualizing
What I tell myself before visualizingWhat I tell myself before visualizing
What I tell myself before visualizingKrist Wongsuphasawat
 
Navigating the Wide World of Data Visualization Libraries
Navigating the Wide World of Data Visualization LibrariesNavigating the Wide World of Data Visualization Libraries
Navigating the Wide World of Data Visualization LibrariesKrist Wongsuphasawat
 
Encodable: Configurable Grammar for Visualization Components
Encodable: Configurable Grammar for Visualization ComponentsEncodable: Configurable Grammar for Visualization Components
Encodable: Configurable Grammar for Visualization ComponentsKrist Wongsuphasawat
 
6 things to expect when you are visualizing (2020 Edition)
6 things to expect when you are visualizing (2020 Edition)6 things to expect when you are visualizing (2020 Edition)
6 things to expect when you are visualizing (2020 Edition)Krist Wongsuphasawat
 
What to expect when you are visualizing (v.2)
What to expect when you are visualizing (v.2)What to expect when you are visualizing (v.2)
What to expect when you are visualizing (v.2)Krist Wongsuphasawat
 
ร้อยเรื่องราวจากข้อมูล / Storytelling with Data
ร้อยเรื่องราวจากข้อมูล / Storytelling with Dataร้อยเรื่องราวจากข้อมูล / Storytelling with Data
ร้อยเรื่องราวจากข้อมูล / Storytelling with DataKrist Wongsuphasawat
 
Reveal the talking points of every episode of Game of Thrones from fans' conv...
Reveal the talking points of every episode of Game of Thrones from fans' conv...Reveal the talking points of every episode of Game of Thrones from fans' conv...
Reveal the talking points of every episode of Game of Thrones from fans' conv...Krist Wongsuphasawat
 
From Data to Visualization, what happens in between?
From Data to Visualization, what happens in between?From Data to Visualization, what happens in between?
From Data to Visualization, what happens in between?Krist Wongsuphasawat
 
A Narrative Display for Sports Tournament Recap
A Narrative Display for Sports Tournament RecapA Narrative Display for Sports Tournament Recap
A Narrative Display for Sports Tournament RecapKrist Wongsuphasawat
 
Visualization for Event Sequences Exploration
Visualization for Event Sequences ExplorationVisualization for Event Sequences Exploration
Visualization for Event Sequences ExplorationKrist Wongsuphasawat
 
Krist Wongsuphasawat's Dissertation Proposal Slides: Interactive Exploration ...
Krist Wongsuphasawat's Dissertation Proposal Slides: Interactive Exploration ...Krist Wongsuphasawat's Dissertation Proposal Slides: Interactive Exploration ...
Krist Wongsuphasawat's Dissertation Proposal Slides: Interactive Exploration ...Krist Wongsuphasawat
 
Information Visualization for Knowledge Discovery
Information Visualization for Knowledge DiscoveryInformation Visualization for Knowledge Discovery
Information Visualization for Knowledge DiscoveryKrist Wongsuphasawat
 
Information Visualization for Health Care
Information Visualization for Health CareInformation Visualization for Health Care
Information Visualization for Health CareKrist Wongsuphasawat
 
Finding Comparable Temporal Categorical Records: A Similarity Measure with an...
Finding Comparable Temporal Categorical Records: A Similarity Measure with an...Finding Comparable Temporal Categorical Records: A Similarity Measure with an...
Finding Comparable Temporal Categorical Records: A Similarity Measure with an...Krist Wongsuphasawat
 
Outflow: Visualizing Patients Flow by Symptoms & Outcome
Outflow: Visualizing Patients Flow by Symptoms & OutcomeOutflow: Visualizing Patients Flow by Symptoms & Outcome
Outflow: Visualizing Patients Flow by Symptoms & OutcomeKrist Wongsuphasawat
 

More from Krist Wongsuphasawat (17)

What I tell myself before visualizing
What I tell myself before visualizingWhat I tell myself before visualizing
What I tell myself before visualizing
 
Navigating the Wide World of Data Visualization Libraries
Navigating the Wide World of Data Visualization LibrariesNavigating the Wide World of Data Visualization Libraries
Navigating the Wide World of Data Visualization Libraries
 
Encodable: Configurable Grammar for Visualization Components
Encodable: Configurable Grammar for Visualization ComponentsEncodable: Configurable Grammar for Visualization Components
Encodable: Configurable Grammar for Visualization Components
 
6 things to expect when you are visualizing (2020 Edition)
6 things to expect when you are visualizing (2020 Edition)6 things to expect when you are visualizing (2020 Edition)
6 things to expect when you are visualizing (2020 Edition)
 
What to expect when you are visualizing (v.2)
What to expect when you are visualizing (v.2)What to expect when you are visualizing (v.2)
What to expect when you are visualizing (v.2)
 
ร้อยเรื่องราวจากข้อมูล / Storytelling with Data
ร้อยเรื่องราวจากข้อมูล / Storytelling with Dataร้อยเรื่องราวจากข้อมูล / Storytelling with Data
ร้อยเรื่องราวจากข้อมูล / Storytelling with Data
 
Reveal the talking points of every episode of Game of Thrones from fans' conv...
Reveal the talking points of every episode of Game of Thrones from fans' conv...Reveal the talking points of every episode of Game of Thrones from fans' conv...
Reveal the talking points of every episode of Game of Thrones from fans' conv...
 
From Data to Visualization, what happens in between?
From Data to Visualization, what happens in between?From Data to Visualization, what happens in between?
From Data to Visualization, what happens in between?
 
A Narrative Display for Sports Tournament Recap
A Narrative Display for Sports Tournament RecapA Narrative Display for Sports Tournament Recap
A Narrative Display for Sports Tournament Recap
 
Visualization for Event Sequences Exploration
Visualization for Event Sequences ExplorationVisualization for Event Sequences Exploration
Visualization for Event Sequences Exploration
 
Krist Wongsuphasawat's Dissertation Proposal Slides: Interactive Exploration ...
Krist Wongsuphasawat's Dissertation Proposal Slides: Interactive Exploration ...Krist Wongsuphasawat's Dissertation Proposal Slides: Interactive Exploration ...
Krist Wongsuphasawat's Dissertation Proposal Slides: Interactive Exploration ...
 
Usability of Google Docs
Usability of Google DocsUsability of Google Docs
Usability of Google Docs
 
Information Visualization for Knowledge Discovery
Information Visualization for Knowledge DiscoveryInformation Visualization for Knowledge Discovery
Information Visualization for Knowledge Discovery
 
Information Visualization for Health Care
Information Visualization for Health CareInformation Visualization for Health Care
Information Visualization for Health Care
 
Finding Comparable Temporal Categorical Records: A Similarity Measure with an...
Finding Comparable Temporal Categorical Records: A Similarity Measure with an...Finding Comparable Temporal Categorical Records: A Similarity Measure with an...
Finding Comparable Temporal Categorical Records: A Similarity Measure with an...
 
Outflow: Visualizing Patients Flow by Symptoms & Outcome
Outflow: Visualizing Patients Flow by Symptoms & OutcomeOutflow: Visualizing Patients Flow by Symptoms & Outcome
Outflow: Visualizing Patients Flow by Symptoms & Outcome
 
Finding Patterns in Temporal Data
Finding Patterns in Temporal DataFinding Patterns in Temporal Data
Finding Patterns in Temporal Data
 

Recently uploaded

RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝DelhiRS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhijennyeacort
 
Generative AI for Social Good at Open Data Science East 2024
Generative AI for Social Good at Open Data Science East 2024Generative AI for Social Good at Open Data Science East 2024
Generative AI for Social Good at Open Data Science East 2024Colleen Farrelly
 
Advanced Machine Learning for Business Professionals
Advanced Machine Learning for Business ProfessionalsAdvanced Machine Learning for Business Professionals
Advanced Machine Learning for Business ProfessionalsVICTOR MAESTRE RAMIREZ
 
RadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdfRadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdfgstagge
 
Thiophen Mechanism khhjjjjjjjhhhhhhhhhhh
Thiophen Mechanism khhjjjjjjjhhhhhhhhhhhThiophen Mechanism khhjjjjjjjhhhhhhhhhhh
Thiophen Mechanism khhjjjjjjjhhhhhhhhhhhYasamin16
 
Heart Disease Classification Report: A Data Analysis Project
Heart Disease Classification Report: A Data Analysis ProjectHeart Disease Classification Report: A Data Analysis Project
Heart Disease Classification Report: A Data Analysis ProjectBoston Institute of Analytics
 
Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...
Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...
Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...ssuserf63bd7
 
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...limedy534
 
GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]📊 Markus Baersch
 
ASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel CanterASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel Cantervoginip
 
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理e4aez8ss
 
Biometric Authentication: The Evolution, Applications, Benefits and Challenge...
Biometric Authentication: The Evolution, Applications, Benefits and Challenge...Biometric Authentication: The Evolution, Applications, Benefits and Challenge...
Biometric Authentication: The Evolution, Applications, Benefits and Challenge...GQ Research
 
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改yuu sss
 
Real-Time AI Streaming - AI Max Princeton
Real-Time AI  Streaming - AI Max PrincetonReal-Time AI  Streaming - AI Max Princeton
Real-Time AI Streaming - AI Max PrincetonTimothy Spann
 
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一F sss
 
RABBIT: A CLI tool for identifying bots based on their GitHub events.
RABBIT: A CLI tool for identifying bots based on their GitHub events.RABBIT: A CLI tool for identifying bots based on their GitHub events.
RABBIT: A CLI tool for identifying bots based on their GitHub events.natarajan8993
 
April 2024 - NLIT Cloudera Real-Time LLM Streaming 2024
April 2024 - NLIT Cloudera Real-Time LLM Streaming 2024April 2024 - NLIT Cloudera Real-Time LLM Streaming 2024
April 2024 - NLIT Cloudera Real-Time LLM Streaming 2024Timothy Spann
 
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档208367051
 
Conf42-LLM_Adding Generative AI to Real-Time Streaming Pipelines
Conf42-LLM_Adding Generative AI to Real-Time Streaming PipelinesConf42-LLM_Adding Generative AI to Real-Time Streaming Pipelines
Conf42-LLM_Adding Generative AI to Real-Time Streaming PipelinesTimothy Spann
 
Learn How Data Science Changes Our World
Learn How Data Science Changes Our WorldLearn How Data Science Changes Our World
Learn How Data Science Changes Our WorldEduminds Learning
 

Recently uploaded (20)

RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝DelhiRS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
 
Generative AI for Social Good at Open Data Science East 2024
Generative AI for Social Good at Open Data Science East 2024Generative AI for Social Good at Open Data Science East 2024
Generative AI for Social Good at Open Data Science East 2024
 
Advanced Machine Learning for Business Professionals
Advanced Machine Learning for Business ProfessionalsAdvanced Machine Learning for Business Professionals
Advanced Machine Learning for Business Professionals
 
RadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdfRadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdf
 
Thiophen Mechanism khhjjjjjjjhhhhhhhhhhh
Thiophen Mechanism khhjjjjjjjhhhhhhhhhhhThiophen Mechanism khhjjjjjjjhhhhhhhhhhh
Thiophen Mechanism khhjjjjjjjhhhhhhhhhhh
 
Heart Disease Classification Report: A Data Analysis Project
Heart Disease Classification Report: A Data Analysis ProjectHeart Disease Classification Report: A Data Analysis Project
Heart Disease Classification Report: A Data Analysis Project
 
Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...
Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...
Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...
 
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...
 
GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]
 
ASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel CanterASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel Canter
 
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
 
Biometric Authentication: The Evolution, Applications, Benefits and Challenge...
Biometric Authentication: The Evolution, Applications, Benefits and Challenge...Biometric Authentication: The Evolution, Applications, Benefits and Challenge...
Biometric Authentication: The Evolution, Applications, Benefits and Challenge...
 
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
 
Real-Time AI Streaming - AI Max Princeton
Real-Time AI  Streaming - AI Max PrincetonReal-Time AI  Streaming - AI Max Princeton
Real-Time AI Streaming - AI Max Princeton
 
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
 
RABBIT: A CLI tool for identifying bots based on their GitHub events.
RABBIT: A CLI tool for identifying bots based on their GitHub events.RABBIT: A CLI tool for identifying bots based on their GitHub events.
RABBIT: A CLI tool for identifying bots based on their GitHub events.
 
April 2024 - NLIT Cloudera Real-Time LLM Streaming 2024
April 2024 - NLIT Cloudera Real-Time LLM Streaming 2024April 2024 - NLIT Cloudera Real-Time LLM Streaming 2024
April 2024 - NLIT Cloudera Real-Time LLM Streaming 2024
 
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
 
Conf42-LLM_Adding Generative AI to Real-Time Streaming Pipelines
Conf42-LLM_Adding Generative AI to Real-Time Streaming PipelinesConf42-LLM_Adding Generative AI to Real-Time Streaming Pipelines
Conf42-LLM_Adding Generative AI to Real-Time Streaming Pipelines
 
Learn How Data Science Changes Our World
Learn How Data Science Changes Our WorldLearn How Data Science Changes Our World
Learn How Data Science Changes Our World
 

Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

  • 1. Using visualizations to monitor changes and harvest insights from log data at Twitter Krist Wongsuphasawat & Jimmy Lin @kristw @lintool
  • 2. Logging user activities & data analysis
  • 4. Use Users Curious Twitter Product Managers
  • 5. Use Users Curious Log data in Hadoop Write Twitter Instrument Engineers Product Managers
  • 6. What are being logged? activities tweet
  • 7. What are being logged? activities tweet from home timeline on twitter.com tweet from search page on iPhone
  • 8. What are being logged? activities tweet from home timeline on twitter.com tweet from search page on iPhone sign up log in retweet etc.
  • 10. log event a.k.a. “client event” [Lee et al. 2012]
  • 11. log event a.k.a. “client event” 1) User ID 2) Timestamp 3) Event name client : page : section : component : element : action web : home : timeline : tweet_box : button : tweet 4) Event detail [Lee et al. 2012]
  • 13. Use Users Curious Log data in Hadoop Twitter Instrument Engineers Write Product Managers bigger than Tweet data
  • 14. Use Users Curious Engineers Log data in Hadoop Data Scientists Ask Twitter Instrument Write Product Managers
  • 15. Use Users Curious Engineers Log data in Hadoop Find Data Scientists Ask Twitter Instrument Write Product Managers
  • 17. Use Users Curious Engineers Log data in Hadoop Find, Clean Data Scientists Ask Twitter Instrument Write Product Managers
  • 18. Use Users Curious Engineers Log data in Hadoop Find, Clean Data Scientists Monitor Ask Twitter Instrument Write Product Managers
  • 19. Use Users Curious Engineers Log data in Hadoop Find, Clean, Analyze Data Scientists Monitor Ask Twitter Instrument Write Product Managers
  • 20. Log data Users in Hadoop Find, Clean, Analyze Data Scientists Engineers Use Monitor Ask Curious 1 2 Twitter Instrument Write Product Managers
  • 21. Part I Find & Monitor Client Events
  • 23. Log data in Hadoop Engineers & Data Scientists billions of rows
  • 24. Log data in Hadoop Aggregate Client event collection 10,000+ event types date client page section comp. elem. action count 20141011 web home home - - impression 100 20141011 web home wtf - - click 20 Engineers & Data Scientists
  • 25. Log data in Hadoop Aggregate Client event collection 10,000+ event types date client page section comp. elem. action count 20141011 web home home - - impression 100 20141011 web home wtf - - click 20 (Who-to-Follow) Engineers & Data Scientists
  • 26. Log data in Hadoop Aggregate Client event collection Engineers & Data Scientists
  • 27. Log data in Hadoop Aggregate Client event collection client page section component element action Find Search Engineers & Data Scientists
  • 28. Log data in Hadoop Aggregate Client event collection client page section component element action Find Search Engineers & Data Scientists
  • 30. Client event collection Search client page section component element action Find Log data in Hadoop Aggregate web home * * * impression Engineers & Data Scientists
  • 31. Client event collection Search Query client page section component element action Find Aggregate Return Log data in Hadoop Results web : home : home : - : - : impression web : home : wtf : - : - : impression web home * * * impression Engineers & Data Scientists
  • 32. Client event collection Search Query client page section component element action Find Aggregate Return Log data in Hadoop Results web : home : home : - : - : impression web : home : wtf : - : - : impression search can be better Engineers & Data Scientists
  • 33. Client event collection Search Query client page section component element action Find Aggregate Return Log data in Hadoop Results web : home : home : - : - : impression web : home : wtf : - : - : impression 10,000+ event types search can be better Engineers & Data Scientists
  • 34. Client event collection 10,000+ event types What are all sections under web:home? Search Query not everybody knows client page section component element action Find Aggregate Return Log data in Hadoop Results web : home : home : - : - : impression web : home : wtf : - : - : impression search can be better Engineers & Data Scientists
  • 35. Client event collection Search Query client page section component element action Find Aggregate Return Log data in Hadoop Results web : home : home : - : - : impression search can be better one graph / event 10,000+ event types not everybody knows What are all sections under web:home? Engineers & Data Scientists
  • 36. Client event collection Search Query client page section component element action Find Aggregate Return Log data in Hadoop Results web : home : home : - : - : impression search can be better one graph / event x 10,000 10,000+ event types not everybody knows What are all sections under web:home? Engineers & Data Scientists
  • 37. ! Goals • Search for client events • Explore client event collection • Monitor changes
  • 38. • Session analysis [Lam et al. 2007, Shen et al. 2013] ! Related work • Monitor network logs, not user activity logs [Ghoniem et al. 2013]
  • 40. Client event collection Engineers & Data Scientists
  • 41. Client event collection See Engineers & Data Scientists
  • 42. narrow down See Interactions search box => filter Client event collection Engineers & Data Scientists
  • 43. See How to visualize? narrow down Client event collection Engineers & Data Scientists Interactions search box => filter
  • 44. Interactions client : page : section : component : element : action search box => filter See How to visualize? narrow down Client event collection Engineers & Data Scientists
  • 45. Client event hierarchy iphone:home:-:-:-:impression iphone:home:-:tweet:tweet:click iphone home - - - impression tweet tweet click
  • 46. Detect changes iphone home - - - impression tweet tweet click iphone home - - - impression tweet tweet click TODAY 7 DAYS AGO compared to
  • 47. Calculate changes +5% +5% +5% +10% +10% +10% -5% -5% -5% DIFF
  • 48. Display changes iphone home - - - impression tweet tweet click Map of the Market [Wattenberg 1999], StemView [Guerra-Gomez et al. 2013]
  • 49. Display changes home - - - impression tweet tweet click iphone
  • 52.
  • 53. • Since Dec 2013 • 500 unique users, 10 users / day ! • No training Deployment
  • 54. Use cases Users: PMs, Data Scientists, Engineers • Search • Monitor • See effects after major product launch read the paper :)
  • 56. Count page visits home page banana : home : - : - : - : impression
  • 57. Funnel home page profile page
  • 58. Funnel analysis banana : home : - : - : - : impression banana : profile : - : - : - : impression home page 1 job profile page 1 hour
  • 59. Funnel analysis home page banana : home : - : - : - : impression profile page search page 2 jobs 2 hours banana : profile : - : - : - : impression banana : search : - : - : - : impression
  • 60. Funnel analysis home page banana : home : - : - : - : impression profile page search page banana : profile : - : - : - : impression banana : search : - : - : - : impression Specify all funnels manually! n jobs n hours
  • 61. Goal home page banana : home : - : - : - : impression … … … 1 job => all funnels, visualized
  • 62. • Visualize an overview of event sequences ! Related work [Wongsuphasawat et al. 2011, Monroe et al. 2013, …]
  • 63. Related work • Visualize an overview of event sequences [Wongsuphasawat et al. 2011, Monroe et al. 2013, …] ! • Big data? eBay checkout sequences [Shen et al. 2013] ! One funnel at a time Checkout > Payment > Confirm > Success
  • 64. LifeFlow [CHI2011] ! (simplified)
  • 65. User sessions Session#1 start A B end Session#4 start A end Session#2 start A B end Session#3 start A C end
  • 66. Aggregate 4 sessions A start A A B B C end end end A end
  • 67. Aggregate start A B B C end end end end 4 sessions
  • 68. Aggregate C start end end end end A B 4 sessions
  • 69. Aggregate C start end end end end A B 4 sessions
  • 70. Aggregate 4 sessions B C end start A end end end
  • 71. Aggregate 4 sessions B C end start A end end
  • 72. Aggregate 4 sessions B C end start A end end
  • 73. Aggregate 4 sessions start A B C end end end
  • 74. Aggregate 4,000,000 sessions start A B C end end end
  • 75. try with sample data (~millions sessions, 10,000+ event types) ! original paper (100,000 sessions, ~10 event types)
  • 76. not meaningful ! small slice of data but huge file
  • 77. How to make it work?
  • 78. # of unique sequences
  • 79. Reduce # of unique sequences 1. Reduce event types
  • 80. Reduce # of unique sequences 1. Reduce event types 10,000 types select tweet sign up log out
  • 81. Reduce # of unique sequences 1. Reduce event types 10,000 types select tweet sign up log out
  • 82. Reduce # of unique sequences 1. Reduce event types 10,000 types select merge tweet from home timeline tweet from search page tweet … = tweet
  • 83. Reduce # of unique sequences 1. Reduce event types 2. Reduce sequence length
  • 84. Reduce # of unique sequences 1. Reduce event types 2. Reduce sequence length session 1000 events
  • 85. Reduce # of unique sequences 1. Reduce event types 2. Reduce sequence length session 10 events after (window size & direction) 1000 events visit home page (alignment)
  • 86. Reduce # of unique sequences 1. Reduce event types 2. Reduce sequence length } Ask users for input
  • 87. Reduce # of unique sequences } Ask users for input 1. Reduce event types 2. Reduce sequence length 3. More aggregation on Hadoop
  • 88. Collapse events Sequence ABBBCCCC ABBCC ABC ABCCCC e.g. ABCD ABCCCD ABCCE ABCDF ABCDG ABCDH tweet, tweet, tweet, … = tweet
  • 89. Collapse events Sequence ABC ABC ABC ABC ABCD ABCD ABCE ABCDF ABCDG ABCDH
  • 90. Group & Count Sequence Count ABC 2000 ABCD 80 ABCE 20 ABCDF 1 ABCDG 1 ABCDH 1 … …
  • 91. Group & Count Sequence Count ABC 2000 ABCD 80 ABCE 20 ABCDF 1 ABCDG 1 ABCDH 1 ABCDI 1 ABCDJK 1 ABCDJL 1 rare sequences (count < threshold)
  • 92. Truncate Sequence ABC ABCD ABCE ABCDx ABCDx ABCDx ABCDx ABCDJx ABCDJx Count 2000 80 20 1 1 1 1 1 1 Replace last event with x (…)
  • 93. Group & Count Sequence ABC ABCD ABCE ABCDx ABCDJx Count 2000 80 20 4 2
  • 94. Truncate more Sequence ABC ABCD ABCE ABCDx ABCDx Count 2000 80 20 4 2
  • 95. Group & Count Sequence Count ABC 2000 ABCD 80 ABCE 20 ABCDx 6
  • 96. Final process 1. Define set of events 2. Pick alignment, direction and window size 3. Run Hadoop job (with more aggregation) 4. Wait for it… (2+ hrs) 5. Visualize gazillion patterns (TBs) ~100,000 patterns (10MB)
  • 98. Deployment • Since Jan 2013 • Fewer users, but more in-depth ad-hoc analysis • Initial meeting to provide support
  • 99. Case studies • What did users do when they visit Twitter? (in demo) • Where did users give up in the sign up process? • more in the paper
  • 100. Case studies click on “sign up” fill personal info import address book etc. • What did users do when they visit Twitter? (in demo) • Where did users give up in the sign up process? • more in the paper
  • 101. Case studies • What did users do when they visit Twitter? (in demo) • Where did users give up in the sign up process? • more in the paper read the paper :)
  • 102. Conclusions & Future work • Large-scale User Activity Logs + Visual Analytics
  • 103. Conclusions & Future work • Large-scale User Activity Logs + Visual Analytics • Find, Monitor & Explore + Anomaly detection & automatic alert • Funnel Analysis + More interactivity & data / reduce wait time / latency study? • Used in day-to-day operations at Twitter
  • 104. Conclusions & Future work Challenge big data small data visualize & interact • Large-scale User Activity Logs + Visual Analytics • Find, Monitor & Explore + Anomaly detection & automatic alert • Funnel Analysis + More interactivity & data / reduce wait time / latency study? • Used in day-to-day operations at Twitter aggregate & sacrifice
  • 105. Conclusions & Future work • Large-scale User Activity Logs + Visual Analytics • Find, Monitor & Explore + Anomaly detection & automatic alert • Funnel Analysis + More interactivity & data / reduce wait time / latency study? • Used in day-to-day operations at Twitter • Generalize to smaller systems Challenge big data aggregate & sacrifice small data visualize & interact
  • 106. Acknowledgement • Data Scientists & Engineers @Twitter — Linus Lee, Chuang Liu • Feedback from reviewers, Ben Shneiderman & Catherine Plaisant
  • 107. Conclusions & Future work • Large-scale User Activity Logs + Visual Analytics • Find, Monitor & Explore + Anomaly detection & automatic alert • Funnel Analysis + More interactivity & data / reduce wait time / latency study? • Used in day-to-day operations at Twitter • Generalize to smaller systems Challenge big data aggregate & sacrifice small data visualize & interact kristw@twitter.com / @kristw