SlideShare a Scribd company logo
1 of 21
Beyond The Pretty Charts
Analytics for the rest of us
Toufic Boubez, Ph.D.
Co-Founder, CTO
Metafor Software
2
Toufic intro – who I am
• Co-Founder/CTO Metafor Software
• Co-Founder/CTO Layer 7 Technologies
– API Management
– Acquired by Computer Associates in 2013
• I escaped 
• Building large scale software systems for 20
years (I’m older than I look, I know!)
3
Why this talk?
• DevOps Days Austin: Open Space talk
– Blog: http://metaforsoftware.com/beyond-the-
pretty-charts-a-report-from-devopsdays-in-austin/
• Five major discussion points/lessons learned
• Note: no labels on charts – on purpose!!
• Note: real data
4
Wall of charts
5
1. We’ve moved beyond static thresholds
• Most current monitoring tools assume that
the underlying system is relatively static so we
can surround it with static thresholds and
rules. BUT:
– So what if my unicorn usage is at 91%, and has
been stable at 91% for a while?
– I’d much rather know if it’s at 60% and has been
rapidly increasing over the last few hours.
6
Need more better analytics
• Thresholds won’t help you in this case
• Need some more dynamic analytics
7
2. Context is really important
– Do I really want to be alerted when I know someone is
performing maintenance or backups?
– Is there an event that caused the change in behaviour (e.g. new
deploy)?
– Correlate your event line with your monitoring
Down for maintenance?
8
3. Know your data!!
– You need to understand the statistical properties
of your data, and where it comes from, in order to
determine what kind of analytics to use.
• For example, it’s important to know if your data is
normally distributed.
• http://codeascraft.com/2013/06/11/introducing-kale/
• https://github.com/etsy/skyline/blob/master/src/analy
zer/algorithms.py
– Three-sigma, Grubbs and other algorithms assume normal
distribution
9
What’s normal?
10
What’s my distribution?
11
Another common distribution
12
4. Is all data important to collect?
– Two camps:
• Data is data, let’s collect and analyze everything and
figure out the trends.
• Not all data is important, so let’s figure out what’s
important first and understand the underlying model so
we don’t waste resources on the rest.
– Similar to the very public bun fight between Noam
Chomsky and Peter Norvig
• http://norvig.com/chomsky.html
– Unresolved as far as I know 
13
Do we need both metrics?
14
5. We all want to automate
• Having humans in the way of detecting and
solving DevOps issues doesn’t scale.
• At some point, we need systems that can
detect anomalies before problems become
critical, and take appropriate action.
15
Open Loop Control System:
Heating your house – the wrong way!
• Steps:
– Tweak heater input
– Get to ideal temperature
– Lock gas valve
– Hope nothing changes
Controller
(gas valve)
System
(heater)
Sensor
(thermometer)
16
Controller
(gas valve)
System
(heater)
Sensor
(thermometer)
+
-
delta
desired
temperature
current
temperature
Open Loop Control System:
Heating your house – the right way
• Steps:
– Set the desired temperature
– Sit back and let the system deal with changes
17
Controller System
Sensor
+
-
Puppet
Chef
CFEngine
…
My
Infrastrucutre
Nagios
Cacti
Zabbix
…
?desired
state
current
state
What’s missing to get to self-healing systems
delta
• We have most of the tools already
• Need to add:
– Error tracking (anomaly detection)
– Corrective action
18
How much data do we need?
• Trend towards higher and higher sampling
rates in data collection
• Reminds me of Jorge Luis Borges’ story about
Funes the Memorious
– Perfect recollection of the slightest details of every
instant of his life, but lost the ability for
abstraction
• Our brain works on abstraction
– We notice patterns BECAUSE we can abstract
19
The danger of over-abstraction
+
= comfortable?
20
So, how much data DO you need?
– You don’t need more resolution that twice your
highest frequency (Nyquist-Shanon sampling
theorem)
– Most of the algorithms for analytics will smooth,
average, filter, and pre-process the data.
– Watch out for correlated metrics (e.g. used vs.
available memory)
21
More?
• I want to talk more about analytics, in more
depth, but time’s up!!
– (Actually John won’t let me)
• Come talk to me during the breaks!
• Thank you!

More Related Content

Viewers also liked

Enhance your Agility with DevOps
Enhance your Agility with DevOpsEnhance your Agility with DevOps
Enhance your Agility with DevOpsEdureka!
 
Agile analytics : An exploratory study of technical complexity management
Agile analytics : An exploratory study of technical complexity managementAgile analytics : An exploratory study of technical complexity management
Agile analytics : An exploratory study of technical complexity managementAgnirudra Sikdar
 
Why Use Analytics on Your Software
Why Use Analytics on Your SoftwareWhy Use Analytics on Your Software
Why Use Analytics on Your SoftwareDeskMetrics
 
Analytics for Software Development
Analytics for Software DevelopmentAnalytics for Software Development
Analytics for Software DevelopmentRay Buse
 
Information Needs for Software Development Analytics
Information Needs for Software Development AnalyticsInformation Needs for Software Development Analytics
Information Needs for Software Development AnalyticsRay Buse
 
Analytics for software development
Analytics for software developmentAnalytics for software development
Analytics for software developmentThomas Zimmermann
 
Listen to Your Machines: DevOps Analytics for Better Feedback Loops
Listen to Your Machines: DevOps Analytics for Better Feedback LoopsListen to Your Machines: DevOps Analytics for Better Feedback Loops
Listen to Your Machines: DevOps Analytics for Better Feedback LoopsSplunk
 
Visualization for Software Analytics
Visualization for Software AnalyticsVisualization for Software Analytics
Visualization for Software AnalyticsMargaret-Anne Storey
 
Simple math for anomaly detection toufic boubez - metafor software - monito...
Simple math for anomaly detection   toufic boubez - metafor software - monito...Simple math for anomaly detection   toufic boubez - metafor software - monito...
Simple math for anomaly detection toufic boubez - metafor software - monito...tboubez
 
The Importance of Analytics in Projects and Project Management by Issam Chalouhi
The Importance of Analytics in Projects and Project Management by Issam ChalouhiThe Importance of Analytics in Projects and Project Management by Issam Chalouhi
The Importance of Analytics in Projects and Project Management by Issam ChalouhiPMILebanonChapter
 
Agile Big Data Analytics Development: An Architecture-Centric Approach
Agile Big Data Analytics Development: An Architecture-Centric ApproachAgile Big Data Analytics Development: An Architecture-Centric Approach
Agile Big Data Analytics Development: An Architecture-Centric ApproachSoftServe
 
Big Data Agile Analytics by Ken Collier - Director Agile Analytics, Thoughtwo...
Big Data Agile Analytics by Ken Collier - Director Agile Analytics, Thoughtwo...Big Data Agile Analytics by Ken Collier - Director Agile Analytics, Thoughtwo...
Big Data Agile Analytics by Ken Collier - Director Agile Analytics, Thoughtwo...Thoughtworks
 
07. Analytics & Reporting Requirements Template
07. Analytics & Reporting Requirements Template07. Analytics & Reporting Requirements Template
07. Analytics & Reporting Requirements TemplateAlan D. Duncan
 
Open Development Analytics, a step beyond in project transparency
Open Development Analytics, a step beyond in project transparencyOpen Development Analytics, a step beyond in project transparency
Open Development Analytics, a step beyond in project transparencyOW2
 
The Data on DevOps: Making the Case for Awesome
The Data on DevOps: Making the Case for AwesomeThe Data on DevOps: Making the Case for Awesome
The Data on DevOps: Making the Case for AwesomeNicole Forsgren
 

Viewers also liked (17)

Enhance your Agility with DevOps
Enhance your Agility with DevOpsEnhance your Agility with DevOps
Enhance your Agility with DevOps
 
Agile analytics : An exploratory study of technical complexity management
Agile analytics : An exploratory study of technical complexity managementAgile analytics : An exploratory study of technical complexity management
Agile analytics : An exploratory study of technical complexity management
 
GUBI: Agile Analytics [pt-br]
GUBI: Agile Analytics [pt-br]GUBI: Agile Analytics [pt-br]
GUBI: Agile Analytics [pt-br]
 
Why Use Analytics on Your Software
Why Use Analytics on Your SoftwareWhy Use Analytics on Your Software
Why Use Analytics on Your Software
 
Analytics for Software Development
Analytics for Software DevelopmentAnalytics for Software Development
Analytics for Software Development
 
Information Needs for Software Development Analytics
Information Needs for Software Development AnalyticsInformation Needs for Software Development Analytics
Information Needs for Software Development Analytics
 
Analytics for software development
Analytics for software developmentAnalytics for software development
Analytics for software development
 
Listen to Your Machines: DevOps Analytics for Better Feedback Loops
Listen to Your Machines: DevOps Analytics for Better Feedback LoopsListen to Your Machines: DevOps Analytics for Better Feedback Loops
Listen to Your Machines: DevOps Analytics for Better Feedback Loops
 
Visualization for Software Analytics
Visualization for Software AnalyticsVisualization for Software Analytics
Visualization for Software Analytics
 
Simple math for anomaly detection toufic boubez - metafor software - monito...
Simple math for anomaly detection   toufic boubez - metafor software - monito...Simple math for anomaly detection   toufic boubez - metafor software - monito...
Simple math for anomaly detection toufic boubez - metafor software - monito...
 
The Importance of Analytics in Projects and Project Management by Issam Chalouhi
The Importance of Analytics in Projects and Project Management by Issam ChalouhiThe Importance of Analytics in Projects and Project Management by Issam Chalouhi
The Importance of Analytics in Projects and Project Management by Issam Chalouhi
 
Agile Big Data Analytics Development: An Architecture-Centric Approach
Agile Big Data Analytics Development: An Architecture-Centric ApproachAgile Big Data Analytics Development: An Architecture-Centric Approach
Agile Big Data Analytics Development: An Architecture-Centric Approach
 
Lean analytics
Lean analyticsLean analytics
Lean analytics
 
Big Data Agile Analytics by Ken Collier - Director Agile Analytics, Thoughtwo...
Big Data Agile Analytics by Ken Collier - Director Agile Analytics, Thoughtwo...Big Data Agile Analytics by Ken Collier - Director Agile Analytics, Thoughtwo...
Big Data Agile Analytics by Ken Collier - Director Agile Analytics, Thoughtwo...
 
07. Analytics & Reporting Requirements Template
07. Analytics & Reporting Requirements Template07. Analytics & Reporting Requirements Template
07. Analytics & Reporting Requirements Template
 
Open Development Analytics, a step beyond in project transparency
Open Development Analytics, a step beyond in project transparencyOpen Development Analytics, a step beyond in project transparency
Open Development Analytics, a step beyond in project transparency
 
The Data on DevOps: Making the Case for Awesome
The Data on DevOps: Making the Case for AwesomeThe Data on DevOps: Making the Case for Awesome
The Data on DevOps: Making the Case for Awesome
 

Similar to Beyond pretty charts, Analytics for the rest of us. Toufic Boubez DevOps Days Silicon Valley 2013-06-22

Velocity Europe 2013: Beyond Pretty Charts: Analytics for the cloud infrastru...
Velocity Europe 2013: Beyond Pretty Charts: Analytics for the cloud infrastru...Velocity Europe 2013: Beyond Pretty Charts: Analytics for the cloud infrastru...
Velocity Europe 2013: Beyond Pretty Charts: Analytics for the cloud infrastru...tboubez
 
Data centre analytics toufic boubez-metafor-dev ops days vancouver-2013-10-25
Data centre analytics toufic boubez-metafor-dev ops days vancouver-2013-10-25Data centre analytics toufic boubez-metafor-dev ops days vancouver-2013-10-25
Data centre analytics toufic boubez-metafor-dev ops days vancouver-2013-10-25tboubez
 
Five Things I Learned While Building Anomaly Detection Tools - Toufic Boubez ...
Five Things I Learned While Building Anomaly Detection Tools - Toufic Boubez ...Five Things I Learned While Building Anomaly Detection Tools - Toufic Boubez ...
Five Things I Learned While Building Anomaly Detection Tools - Toufic Boubez ...tboubez
 
Data science and Hadoop
Data science and HadoopData science and Hadoop
Data science and HadoopDonald Miner
 
Analyzing Census Data: Large databases and challenges to statistical softwares
Analyzing Census Data: Large databases and challenges to statistical softwaresAnalyzing Census Data: Large databases and challenges to statistical softwares
Analyzing Census Data: Large databases and challenges to statistical softwaresRogério Barbosa
 
Super Computers
Super ComputersSuper Computers
Super ComputersTHEFPS
 
What does "monitoring" mean? (FOSDEM 2017)
What does "monitoring" mean? (FOSDEM 2017)What does "monitoring" mean? (FOSDEM 2017)
What does "monitoring" mean? (FOSDEM 2017)Brian Brazil
 
Skynet project: Monitor, analyze, scale, and maintain a system in the Cloud
Skynet project: Monitor, analyze, scale, and maintain a system in the CloudSkynet project: Monitor, analyze, scale, and maintain a system in the Cloud
Skynet project: Monitor, analyze, scale, and maintain a system in the CloudSylvain Kalache
 
Big Data Analysis : Deciphering the haystack
Big Data Analysis : Deciphering the haystack Big Data Analysis : Deciphering the haystack
Big Data Analysis : Deciphering the haystack Srinath Perera
 
The Panda Experiment - evolution of DevOps culture at HolidayCheck
The Panda Experiment - evolution of DevOps culture at HolidayCheckThe Panda Experiment - evolution of DevOps culture at HolidayCheck
The Panda Experiment - evolution of DevOps culture at HolidayCheckŁukasz Przybył
 
MongoDB.local Atlanta: MongoDB @ Sensus: Xylem IoT and MongoDB
MongoDB.local Atlanta: MongoDB @ Sensus: Xylem IoT and MongoDBMongoDB.local Atlanta: MongoDB @ Sensus: Xylem IoT and MongoDB
MongoDB.local Atlanta: MongoDB @ Sensus: Xylem IoT and MongoDBMongoDB
 
Anomaly Detection Using the CLA
Anomaly Detection Using the CLAAnomaly Detection Using the CLA
Anomaly Detection Using the CLANumenta
 
Vulnerability, exploit to metasploit
Vulnerability, exploit to metasploitVulnerability, exploit to metasploit
Vulnerability, exploit to metasploitTiago Henriques
 
Management by data
Management by dataManagement by data
Management by dataLuca Foresti
 
MeetBSD2014 Performance Analysis
MeetBSD2014 Performance AnalysisMeetBSD2014 Performance Analysis
MeetBSD2014 Performance AnalysisBrendan Gregg
 
BioIT Trends - 2014 Internet2 Technology Exchange
BioIT Trends - 2014 Internet2 Technology ExchangeBioIT Trends - 2014 Internet2 Technology Exchange
BioIT Trends - 2014 Internet2 Technology ExchangeChris Dagdigian
 
Presentation on Big Data Analytics
Presentation on Big Data AnalyticsPresentation on Big Data Analytics
Presentation on Big Data AnalyticsS P Sajjan
 
From the Benchtop to the Datacenter: HPC Requirements in Life Science Research
From the Benchtop to the Datacenter: HPC Requirements in Life Science ResearchFrom the Benchtop to the Datacenter: HPC Requirements in Life Science Research
From the Benchtop to the Datacenter: HPC Requirements in Life Science ResearchAri Berman
 

Similar to Beyond pretty charts, Analytics for the rest of us. Toufic Boubez DevOps Days Silicon Valley 2013-06-22 (20)

Velocity Europe 2013: Beyond Pretty Charts: Analytics for the cloud infrastru...
Velocity Europe 2013: Beyond Pretty Charts: Analytics for the cloud infrastru...Velocity Europe 2013: Beyond Pretty Charts: Analytics for the cloud infrastru...
Velocity Europe 2013: Beyond Pretty Charts: Analytics for the cloud infrastru...
 
Data centre analytics toufic boubez-metafor-dev ops days vancouver-2013-10-25
Data centre analytics toufic boubez-metafor-dev ops days vancouver-2013-10-25Data centre analytics toufic boubez-metafor-dev ops days vancouver-2013-10-25
Data centre analytics toufic boubez-metafor-dev ops days vancouver-2013-10-25
 
Five Things I Learned While Building Anomaly Detection Tools - Toufic Boubez ...
Five Things I Learned While Building Anomaly Detection Tools - Toufic Boubez ...Five Things I Learned While Building Anomaly Detection Tools - Toufic Boubez ...
Five Things I Learned While Building Anomaly Detection Tools - Toufic Boubez ...
 
Data science and Hadoop
Data science and HadoopData science and Hadoop
Data science and Hadoop
 
Analyzing the census
Analyzing the censusAnalyzing the census
Analyzing the census
 
Analyzing Census Data: Large databases and challenges to statistical softwares
Analyzing Census Data: Large databases and challenges to statistical softwaresAnalyzing Census Data: Large databases and challenges to statistical softwares
Analyzing Census Data: Large databases and challenges to statistical softwares
 
Super Computers
Super ComputersSuper Computers
Super Computers
 
What does "monitoring" mean? (FOSDEM 2017)
What does "monitoring" mean? (FOSDEM 2017)What does "monitoring" mean? (FOSDEM 2017)
What does "monitoring" mean? (FOSDEM 2017)
 
Skynet project: Monitor, analyze, scale, and maintain a system in the Cloud
Skynet project: Monitor, analyze, scale, and maintain a system in the CloudSkynet project: Monitor, analyze, scale, and maintain a system in the Cloud
Skynet project: Monitor, analyze, scale, and maintain a system in the Cloud
 
Big Data Analysis : Deciphering the haystack
Big Data Analysis : Deciphering the haystack Big Data Analysis : Deciphering the haystack
Big Data Analysis : Deciphering the haystack
 
The Panda Experiment - evolution of DevOps culture at HolidayCheck
The Panda Experiment - evolution of DevOps culture at HolidayCheckThe Panda Experiment - evolution of DevOps culture at HolidayCheck
The Panda Experiment - evolution of DevOps culture at HolidayCheck
 
MongoDB.local Atlanta: MongoDB @ Sensus: Xylem IoT and MongoDB
MongoDB.local Atlanta: MongoDB @ Sensus: Xylem IoT and MongoDBMongoDB.local Atlanta: MongoDB @ Sensus: Xylem IoT and MongoDB
MongoDB.local Atlanta: MongoDB @ Sensus: Xylem IoT and MongoDB
 
Anomaly Detection Using the CLA
Anomaly Detection Using the CLAAnomaly Detection Using the CLA
Anomaly Detection Using the CLA
 
Vulnerability, exploit to metasploit
Vulnerability, exploit to metasploitVulnerability, exploit to metasploit
Vulnerability, exploit to metasploit
 
Management by data
Management by dataManagement by data
Management by data
 
MeetBSD2014 Performance Analysis
MeetBSD2014 Performance AnalysisMeetBSD2014 Performance Analysis
MeetBSD2014 Performance Analysis
 
Analyzing social media with Python and other tools (1/4)
Analyzing social media with Python and other tools (1/4)Analyzing social media with Python and other tools (1/4)
Analyzing social media with Python and other tools (1/4)
 
BioIT Trends - 2014 Internet2 Technology Exchange
BioIT Trends - 2014 Internet2 Technology ExchangeBioIT Trends - 2014 Internet2 Technology Exchange
BioIT Trends - 2014 Internet2 Technology Exchange
 
Presentation on Big Data Analytics
Presentation on Big Data AnalyticsPresentation on Big Data Analytics
Presentation on Big Data Analytics
 
From the Benchtop to the Datacenter: HPC Requirements in Life Science Research
From the Benchtop to the Datacenter: HPC Requirements in Life Science ResearchFrom the Benchtop to the Datacenter: HPC Requirements in Life Science Research
From the Benchtop to the Datacenter: HPC Requirements in Life Science Research
 

Recently uploaded

Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxLoriGlavin3
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Commit University
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsSergiu Bodiu
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .Alan Dix
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piececharlottematthew16
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...Fwdays
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsMiki Katsuragi
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxhariprasad279825
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024Stephanie Beckett
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Enterprise Knowledge
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr BaganFwdays
 

Recently uploaded (20)

Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
 
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptxE-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck Presentation
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL Certs
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piece
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering Tips
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptx
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
 

Beyond pretty charts, Analytics for the rest of us. Toufic Boubez DevOps Days Silicon Valley 2013-06-22

  • 1. Beyond The Pretty Charts Analytics for the rest of us Toufic Boubez, Ph.D. Co-Founder, CTO Metafor Software
  • 2. 2 Toufic intro – who I am • Co-Founder/CTO Metafor Software • Co-Founder/CTO Layer 7 Technologies – API Management – Acquired by Computer Associates in 2013 • I escaped  • Building large scale software systems for 20 years (I’m older than I look, I know!)
  • 3. 3 Why this talk? • DevOps Days Austin: Open Space talk – Blog: http://metaforsoftware.com/beyond-the- pretty-charts-a-report-from-devopsdays-in-austin/ • Five major discussion points/lessons learned • Note: no labels on charts – on purpose!! • Note: real data
  • 5. 5 1. We’ve moved beyond static thresholds • Most current monitoring tools assume that the underlying system is relatively static so we can surround it with static thresholds and rules. BUT: – So what if my unicorn usage is at 91%, and has been stable at 91% for a while? – I’d much rather know if it’s at 60% and has been rapidly increasing over the last few hours.
  • 6. 6 Need more better analytics • Thresholds won’t help you in this case • Need some more dynamic analytics
  • 7. 7 2. Context is really important – Do I really want to be alerted when I know someone is performing maintenance or backups? – Is there an event that caused the change in behaviour (e.g. new deploy)? – Correlate your event line with your monitoring Down for maintenance?
  • 8. 8 3. Know your data!! – You need to understand the statistical properties of your data, and where it comes from, in order to determine what kind of analytics to use. • For example, it’s important to know if your data is normally distributed. • http://codeascraft.com/2013/06/11/introducing-kale/ • https://github.com/etsy/skyline/blob/master/src/analy zer/algorithms.py – Three-sigma, Grubbs and other algorithms assume normal distribution
  • 12. 12 4. Is all data important to collect? – Two camps: • Data is data, let’s collect and analyze everything and figure out the trends. • Not all data is important, so let’s figure out what’s important first and understand the underlying model so we don’t waste resources on the rest. – Similar to the very public bun fight between Noam Chomsky and Peter Norvig • http://norvig.com/chomsky.html – Unresolved as far as I know 
  • 13. 13 Do we need both metrics?
  • 14. 14 5. We all want to automate • Having humans in the way of detecting and solving DevOps issues doesn’t scale. • At some point, we need systems that can detect anomalies before problems become critical, and take appropriate action.
  • 15. 15 Open Loop Control System: Heating your house – the wrong way! • Steps: – Tweak heater input – Get to ideal temperature – Lock gas valve – Hope nothing changes Controller (gas valve) System (heater) Sensor (thermometer)
  • 16. 16 Controller (gas valve) System (heater) Sensor (thermometer) + - delta desired temperature current temperature Open Loop Control System: Heating your house – the right way • Steps: – Set the desired temperature – Sit back and let the system deal with changes
  • 17. 17 Controller System Sensor + - Puppet Chef CFEngine … My Infrastrucutre Nagios Cacti Zabbix … ?desired state current state What’s missing to get to self-healing systems delta • We have most of the tools already • Need to add: – Error tracking (anomaly detection) – Corrective action
  • 18. 18 How much data do we need? • Trend towards higher and higher sampling rates in data collection • Reminds me of Jorge Luis Borges’ story about Funes the Memorious – Perfect recollection of the slightest details of every instant of his life, but lost the ability for abstraction • Our brain works on abstraction – We notice patterns BECAUSE we can abstract
  • 19. 19 The danger of over-abstraction + = comfortable?
  • 20. 20 So, how much data DO you need? – You don’t need more resolution that twice your highest frequency (Nyquist-Shanon sampling theorem) – Most of the algorithms for analytics will smooth, average, filter, and pre-process the data. – Watch out for correlated metrics (e.g. used vs. available memory)
  • 21. 21 More? • I want to talk more about analytics, in more depth, but time’s up!! – (Actually John won’t let me) • Come talk to me during the breaks! • Thank you!