SlideShare a Scribd company logo
1 of 27
So you want to switch off ? 
Time to say goodbye 
to your Nagios based 
setup! 
@olivjan - ojan@monitoring-fr.org © 2014 - Olivier Jan - Check my Website
About me 
❖ System admin and architect 
❖ Co-founder of « Communauté Francophone de la Supervision Libre » 
❖ Writer of the book « Nagios 3 au coeur de la supervision Open Source » 
❖ Co-founder of Check my Website, a SaaS service for remote monitoring of 
websites and applications (current)
Content 
❖ Why switch off ? the good and maybe not so good reasons to do so ! 
❖ Which way to take ? 
❖ Building a monitoring solution without Nagios : 
❖ Tools available 
❖ A personal work in progress 
❖ Migrating from Nagios to this kind of solution
Some reasons to switch off… 
❖ The godfather of OSS monitoring is dead as an 
Open Source project ? 
❖ Can’t do better with it 
❖ Cool new kids out there 
❖ Better « cloud » support 
❖ Clear states, metrics and messages monitoring 
distinction 
❖ Better charting solution 
❖ Near realtime monitoring 
❖ Routing, aggregation, correlation… 
❖ YOUR reasons ;)
Which way to take ? 
❖ The « 4 mousquetaires » 
❖ Naemon 
❖ Icinga 2 
❖ Shinken 
❖ Centreon 
❖ Reboot from building blocks 
❖ Collect 
❖ Store 
❖ Visualize 
❖ Alert
Tools : Collecting metrics and 
messages 
❖ Packetbeat (metrics & messages) 
❖ Rsyslog, NX log, Syslog-ng 
(messages) 
❖ sFlow Toolkit, Host sFlow 
❖ Logstash-forwarder (messages) 
❖ Collectd (metrics) 
❖ Diamond (metrics) 
❖ OSquery, WMI (metrics) 
❖ Network level (sFlow) 
❖ System Level 
❖ Application Level
Tools : External collecting 
❖ End user perspective 
❖ Controls done closest to the 
end-user 
❖ Application behavior 
❖ Real User Monitoring 
❖ Webpagetest 
❖ Selenium 
❖ PhantomasJS 
❖ Boomerang 
❖ Bucky
Tools : Routing metrics and messages 
❖ Messages : Logstash, Flume, Fluentd 
❖ Metrics : StatsD 
❖ Metrics : Carbon Relay NG 
One or more messages can fire an event
Tools : Databases 
❖ Graphite : The most used. 
❖ OpenTSDB : HBase 
❖ KairosDB : Cassandra 
❖ InfluxDB : The most promising ? 
❖ Elasticsearch : Index database
Tools : Visualizing 
metrics and messages 
❖ Kibana 
❖ Grafana 
❖ Dashboards collection
Tools : Alerting 
❖ Seyren : Alerting dashboard for 
Graphite. 
❖ Cabot : Get alerted when services 
go down or metrics go crazy 
❖ Bosun : An advanced, open-source 
monitoring and alerting system 
❖ Skyline : Real-time anomaly 
detection system 
❖ Oculus : Anomaly correlation 
component of Etsy's Kale system 
❖ Esper : Complex Event Processing
The French Monitoring Community 
Xperience 
❖ Reboot from building blocks 
❖ Collect 
❖ Store 
❖ Visualize 
❖ Alert
The French Monitoring Community 
Xperience 
Is it working ? What is not working ?
Collecting metrics : Collectd 
❖ InfluxDB Collectd proxy 
❖ In Golang like InfluxDB 
❖ Temporary solution 
❖ Native Collectd plugin 
LoadPlugin network 
<Plugin network> 
# proxy address 
Server "127.0.0.1" "8096" 
</Plugin> 
❖ PHP5-FPM metrics 
❖ Nginx metrics 
❖ MariaDB metrics 
❖ System metrics 
❖ <metricname>:<value>|<type>
Collecting messages : Rsyslog 
❖ Nearly ready log consumption 
❖ Native distribution package 
❖ Nginx Log, MySQL slow query 
log 
template(name=« ls_json" 
type=« list" option.json="on") { 
constant(value=« {") 
constant(value=""@timestamp":"") property(name="timereported" dateFormat=« rfc3339") 
constant(value=« ","@version":"1") 
constant(value="","message":"") property(name=« msg") 
constant(value="","host":"") property(name=« hostname") 
constant(value="","severity":"") property(name=« syslogseverity-text") 
constant(value="","facility":"") property(name=« syslogfacility-text") 
constant(value="","programname":"") property(name=« programname") 
constant(value="","procid":"") property(name=« procid") 
constant(value=« "}n") 
}
Collecting @ network level : Packetbeat 
❖ Specific agent 
❖ Collect traffic for 
❖ HTTP 
❖ MySQL 
❖ PostgreSQL 
❖ Redis
Routing messages : Logstash 
❖ Inputs 
❖ Codecs/filters 
❖ Outputs 
input { 
udp { 
port => 10514 
codec => "json" 
type => "syslog" 
} 
} 
filter { 
# This replaces the host field with the host that generated the message (sysloghost) 
if [sysloghost] { 
mutate { 
replace => [ "host", "%{sysloghost}" ] 
remove_field => "sysloghost" 
} 
} 
} 
output { 
elasticsearch { host => localhost } 
}
Routing metrics : 
StatsD 
❖ Is now a protocol implemented 
in all languages 
❖ InfluxDB plugin 
❖ Collectd can behave as a statsD 
daemon (plugin) 
❖ Very easy to push metrics 
echo "foo:1|c" | nc -u -w0 127.0.0.1 8125
Storing metrics : InfluxDB 
❖ Make it behave like Graphite 
❖ graphite-api 
❖ carbon-relay-ng 
❖ graphite-influxdb 
❖ Cluster, cluster, cluster 
❖ Design for events and metrics
Storing messages : Elasticsearch 
❖ Index database 
❖ Cluster, cluster, cluster 
❖ Full text search
Visualizing @ network level : Packetbeat 
❖ Kibana 3 modified version 
❖ Dashboards ready out 
of the box
Visualizing metrics : Grafana 
❖ Compatible 
❖ Graphite 
❖ InfluxDB 
❖ OpenTSDB 
❖ Built on Kibana 3
Visualizing messages : Kibana 4 
❖ Easy install 
❖ Interactive dashboards 
❖ Multiple indices
What's missing ? Wishes 
❖ Alerting 
❖ External monitoring 
❖ Repository for dashboards… 
❖ Giving sense to metrics and 
messages
Alerting reboot 
❖ Alert only on end user problems from an end 
user perspective 
❖ IRC, Chat channel… 
❖ Alert thresholds based on history vs static 
thresholds 
❖ Statistics functions 
❖ Boolean conditions 
❖ Dynamic thresholds 
❖ Anomaly detection 
❖ Standard deviation
Coming from Nagios 
❖ Graphios will inject perfdatas in Graphite or InfluxDB 
❖ Check_graphite can query Graphite API from Nagios for alert based on 
history 
❖ Logstash will send events to NSCA 
❖ Nagios log in Kibana with Grok %{NAGIOSLINE} 
❖ Keep Nagios for states ?
Questions ? 
@olivjan 
ojan@monitoring-fr.org

More Related Content

What's hot

Welcome Icinga Camp San Francisco 2014
Welcome Icinga Camp San Francisco 2014Welcome Icinga Camp San Francisco 2014
Welcome Icinga Camp San Francisco 2014Icinga
 
Icinga 2012 Development at 6th TF-NOC Meeting
Icinga 2012 Development at 6th TF-NOC MeetingIcinga 2012 Development at 6th TF-NOC Meeting
Icinga 2012 Development at 6th TF-NOC MeetingIcinga
 
Icinga Camp Antwerp - Current State of Icinga
Icinga Camp Antwerp - Current State of IcingaIcinga Camp Antwerp - Current State of Icinga
Icinga Camp Antwerp - Current State of IcingaIcinga
 
Icinga 2 at Icinga Camp San Francisco
Icinga 2 at Icinga Camp San FranciscoIcinga 2 at Icinga Camp San Francisco
Icinga 2 at Icinga Camp San FranciscoIcinga
 
Icinga 2011 at Nagios Workshop
Icinga 2011 at Nagios WorkshopIcinga 2011 at Nagios Workshop
Icinga 2011 at Nagios WorkshopIcinga
 
Icinga 1, Icinga 2 @ FrOSCon 2014
Icinga 1, Icinga 2 @ FrOSCon 2014Icinga 1, Icinga 2 @ FrOSCon 2014
Icinga 1, Icinga 2 @ FrOSCon 2014Icinga
 
Icinga 2 API @ Icinga Camp Portland at Puppetlabs
Icinga 2 API @ Icinga Camp Portland at PuppetlabsIcinga 2 API @ Icinga Camp Portland at Puppetlabs
Icinga 2 API @ Icinga Camp Portland at PuppetlabsIcinga
 
Why favour Icinga over Nagios @ OSDC 2015
Why favour Icinga over Nagios @ OSDC 2015Why favour Icinga over Nagios @ OSDC 2015
Why favour Icinga over Nagios @ OSDC 2015Icinga
 
Icinga Camp Barcelona - Current State of Icinga
Icinga Camp Barcelona - Current State of IcingaIcinga Camp Barcelona - Current State of Icinga
Icinga Camp Barcelona - Current State of IcingaIcinga
 
Icinga Camp Antwerp - Icinga2 Cluster
Icinga Camp Antwerp - Icinga2 ClusterIcinga Camp Antwerp - Icinga2 Cluster
Icinga Camp Antwerp - Icinga2 ClusterIcinga
 
Icinga Camp Barcelona - Icinga
Icinga Camp Barcelona - IcingaIcinga Camp Barcelona - Icinga
Icinga Camp Barcelona - IcingaIcinga
 
Why favor Icinga over Nagios @ DebConf15
Why favor Icinga over Nagios @ DebConf15Why favor Icinga over Nagios @ DebConf15
Why favor Icinga over Nagios @ DebConf15Icinga
 
Icinga @CLT 2013
Icinga @CLT 2013Icinga @CLT 2013
Icinga @CLT 2013Icinga
 
Icinga lsm 2015 copy
Icinga lsm 2015 copyIcinga lsm 2015 copy
Icinga lsm 2015 copyNETWAYS
 
Icinga @ OSMC 2014
Icinga @ OSMC 2014Icinga @ OSMC 2014
Icinga @ OSMC 2014Icinga
 
Why favour Icinga over Nagios - Rootconf 2015
Why favour Icinga over Nagios - Rootconf 2015Why favour Icinga over Nagios - Rootconf 2015
Why favour Icinga over Nagios - Rootconf 2015Icinga
 
Icinga 2010 at CeBIT
Icinga 2010 at CeBITIcinga 2010 at CeBIT
Icinga 2010 at CeBITIcinga
 
Icinga 2 @ SIG-NOC Meeting
Icinga 2 @ SIG-NOC MeetingIcinga 2 @ SIG-NOC Meeting
Icinga 2 @ SIG-NOC MeetingIcinga
 
Monitor OpenStack Environments from the bottom up and front to back
Monitor OpenStack Environments from the bottom up and front to backMonitor OpenStack Environments from the bottom up and front to back
Monitor OpenStack Environments from the bottom up and front to backIcinga
 
Why favour Icinga over Nagios @ FrOSCon 2015
Why favour Icinga over Nagios @ FrOSCon 2015Why favour Icinga over Nagios @ FrOSCon 2015
Why favour Icinga over Nagios @ FrOSCon 2015Icinga
 

What's hot (20)

Welcome Icinga Camp San Francisco 2014
Welcome Icinga Camp San Francisco 2014Welcome Icinga Camp San Francisco 2014
Welcome Icinga Camp San Francisco 2014
 
Icinga 2012 Development at 6th TF-NOC Meeting
Icinga 2012 Development at 6th TF-NOC MeetingIcinga 2012 Development at 6th TF-NOC Meeting
Icinga 2012 Development at 6th TF-NOC Meeting
 
Icinga Camp Antwerp - Current State of Icinga
Icinga Camp Antwerp - Current State of IcingaIcinga Camp Antwerp - Current State of Icinga
Icinga Camp Antwerp - Current State of Icinga
 
Icinga 2 at Icinga Camp San Francisco
Icinga 2 at Icinga Camp San FranciscoIcinga 2 at Icinga Camp San Francisco
Icinga 2 at Icinga Camp San Francisco
 
Icinga 2011 at Nagios Workshop
Icinga 2011 at Nagios WorkshopIcinga 2011 at Nagios Workshop
Icinga 2011 at Nagios Workshop
 
Icinga 1, Icinga 2 @ FrOSCon 2014
Icinga 1, Icinga 2 @ FrOSCon 2014Icinga 1, Icinga 2 @ FrOSCon 2014
Icinga 1, Icinga 2 @ FrOSCon 2014
 
Icinga 2 API @ Icinga Camp Portland at Puppetlabs
Icinga 2 API @ Icinga Camp Portland at PuppetlabsIcinga 2 API @ Icinga Camp Portland at Puppetlabs
Icinga 2 API @ Icinga Camp Portland at Puppetlabs
 
Why favour Icinga over Nagios @ OSDC 2015
Why favour Icinga over Nagios @ OSDC 2015Why favour Icinga over Nagios @ OSDC 2015
Why favour Icinga over Nagios @ OSDC 2015
 
Icinga Camp Barcelona - Current State of Icinga
Icinga Camp Barcelona - Current State of IcingaIcinga Camp Barcelona - Current State of Icinga
Icinga Camp Barcelona - Current State of Icinga
 
Icinga Camp Antwerp - Icinga2 Cluster
Icinga Camp Antwerp - Icinga2 ClusterIcinga Camp Antwerp - Icinga2 Cluster
Icinga Camp Antwerp - Icinga2 Cluster
 
Icinga Camp Barcelona - Icinga
Icinga Camp Barcelona - IcingaIcinga Camp Barcelona - Icinga
Icinga Camp Barcelona - Icinga
 
Why favor Icinga over Nagios @ DebConf15
Why favor Icinga over Nagios @ DebConf15Why favor Icinga over Nagios @ DebConf15
Why favor Icinga over Nagios @ DebConf15
 
Icinga @CLT 2013
Icinga @CLT 2013Icinga @CLT 2013
Icinga @CLT 2013
 
Icinga lsm 2015 copy
Icinga lsm 2015 copyIcinga lsm 2015 copy
Icinga lsm 2015 copy
 
Icinga @ OSMC 2014
Icinga @ OSMC 2014Icinga @ OSMC 2014
Icinga @ OSMC 2014
 
Why favour Icinga over Nagios - Rootconf 2015
Why favour Icinga over Nagios - Rootconf 2015Why favour Icinga over Nagios - Rootconf 2015
Why favour Icinga over Nagios - Rootconf 2015
 
Icinga 2010 at CeBIT
Icinga 2010 at CeBITIcinga 2010 at CeBIT
Icinga 2010 at CeBIT
 
Icinga 2 @ SIG-NOC Meeting
Icinga 2 @ SIG-NOC MeetingIcinga 2 @ SIG-NOC Meeting
Icinga 2 @ SIG-NOC Meeting
 
Monitor OpenStack Environments from the bottom up and front to back
Monitor OpenStack Environments from the bottom up and front to backMonitor OpenStack Environments from the bottom up and front to back
Monitor OpenStack Environments from the bottom up and front to back
 
Why favour Icinga over Nagios @ FrOSCon 2015
Why favour Icinga over Nagios @ FrOSCon 2015Why favour Icinga over Nagios @ FrOSCon 2015
Why favour Icinga over Nagios @ FrOSCon 2015
 

Viewers also liked

Open Source Monitoring in 2014, from #monitoringssucks to #monitoringlove and...
Open Source Monitoring in 2014, from #monitoringssucks to #monitoringlove and...Open Source Monitoring in 2014, from #monitoringssucks to #monitoringlove and...
Open Source Monitoring in 2014, from #monitoringssucks to #monitoringlove and...Kris Buytaert
 
Stop using Nagios (so it can die peacefully)
Stop using Nagios (so it can die peacefully)Stop using Nagios (so it can die peacefully)
Stop using Nagios (so it can die peacefully)Andy Sykes
 
Monitoring Workshop Kiel 2016 - Performancedaten Visualisierung mit Grafana /...
Monitoring Workshop Kiel 2016 - Performancedaten Visualisierung mit Grafana /...Monitoring Workshop Kiel 2016 - Performancedaten Visualisierung mit Grafana /...
Monitoring Workshop Kiel 2016 - Performancedaten Visualisierung mit Grafana /...Philip Griesbacher
 
Open Source Monitoring Tools Shootout
Open Source Monitoring Tools ShootoutOpen Source Monitoring Tools Shootout
Open Source Monitoring Tools Shootouttomdc
 
Grafana and MySQL - Benefits and Challenges
Grafana and MySQL - Benefits and ChallengesGrafana and MySQL - Benefits and Challenges
Grafana and MySQL - Benefits and ChallengesPhilip Wernersbach
 
Alerting in Grafana, Grafanacon 2015
Alerting in Grafana, Grafanacon 2015Alerting in Grafana, Grafanacon 2015
Alerting in Grafana, Grafanacon 2015Dieter Plaetinck
 
Velocity 2015 linux perf tools
Velocity 2015 linux perf toolsVelocity 2015 linux perf tools
Velocity 2015 linux perf toolsBrendan Gregg
 
SPOF - Single "Person" of Failure
SPOF - Single "Person" of FailureSPOF - Single "Person" of Failure
SPOF - Single "Person" of FailureSasha Rosenbaum
 
Chaos patterns - architecting for failure in distributed systems
Chaos patterns - architecting for failure in distributed systemsChaos patterns - architecting for failure in distributed systems
Chaos patterns - architecting for failure in distributed systemsJos Boumans
 
Un-broken Logging - Operability.io 2015 - Matthew Skelton
Un-broken Logging - Operability.io 2015 - Matthew SkeltonUn-broken Logging - Operability.io 2015 - Matthew Skelton
Un-broken Logging - Operability.io 2015 - Matthew SkeltonSkelton Thatcher Consulting Ltd
 
Monitoring Is Never Done
Monitoring Is Never DoneMonitoring Is Never Done
Monitoring Is Never DoneMelanie Cey
 
Devops and Immutable infrastructure - Cloud Expo 2015 NYC
Devops and Immutable infrastructure  - Cloud Expo 2015 NYCDevops and Immutable infrastructure  - Cloud Expo 2015 NYC
Devops and Immutable infrastructure - Cloud Expo 2015 NYCJohn Willis
 
What is Nagios XI and how is it different from Nagios Core
What is Nagios XI and how is it different from Nagios CoreWhat is Nagios XI and how is it different from Nagios Core
What is Nagios XI and how is it different from Nagios CoreSanjay Willie
 
Open Source Monitoring Tools
Open Source Monitoring ToolsOpen Source Monitoring Tools
Open Source Monitoring Toolsm_richardson
 
Production testing through monitoring
Production testing through monitoringProduction testing through monitoring
Production testing through monitoringLeon Fayer
 
Continuous Delivery Tools Collaboration Conways Law - QCon London - Matthew S...
Continuous Delivery Tools Collaboration Conways Law - QCon London - Matthew S...Continuous Delivery Tools Collaboration Conways Law - QCon London - Matthew S...
Continuous Delivery Tools Collaboration Conways Law - QCon London - Matthew S...Skelton Thatcher Consulting Ltd
 
DevOps Kaizen: Practical Steps to Start & Sustain a Transformation
DevOps Kaizen: Practical Steps to Start & Sustain a TransformationDevOps Kaizen: Practical Steps to Start & Sustain a Transformation
DevOps Kaizen: Practical Steps to Start & Sustain a Transformationdev2ops
 
Monitoring with Nagios and Ganglia
Monitoring with Nagios and GangliaMonitoring with Nagios and Ganglia
Monitoring with Nagios and GangliaMaciej Lasyk
 
Primum non nocere - Ethical Obligations in Internet Operations
Primum non nocere - Ethical Obligations in Internet OperationsPrimum non nocere - Ethical Obligations in Internet Operations
Primum non nocere - Ethical Obligations in Internet OperationsJan Schaumann
 

Viewers also liked (20)

Open Source Monitoring in 2014, from #monitoringssucks to #monitoringlove and...
Open Source Monitoring in 2014, from #monitoringssucks to #monitoringlove and...Open Source Monitoring in 2014, from #monitoringssucks to #monitoringlove and...
Open Source Monitoring in 2014, from #monitoringssucks to #monitoringlove and...
 
Stop using Nagios (so it can die peacefully)
Stop using Nagios (so it can die peacefully)Stop using Nagios (so it can die peacefully)
Stop using Nagios (so it can die peacefully)
 
Monitoring Workshop Kiel 2016 - Performancedaten Visualisierung mit Grafana /...
Monitoring Workshop Kiel 2016 - Performancedaten Visualisierung mit Grafana /...Monitoring Workshop Kiel 2016 - Performancedaten Visualisierung mit Grafana /...
Monitoring Workshop Kiel 2016 - Performancedaten Visualisierung mit Grafana /...
 
Open Source Monitoring Tools Shootout
Open Source Monitoring Tools ShootoutOpen Source Monitoring Tools Shootout
Open Source Monitoring Tools Shootout
 
Grafana and MySQL - Benefits and Challenges
Grafana and MySQL - Benefits and ChallengesGrafana and MySQL - Benefits and Challenges
Grafana and MySQL - Benefits and Challenges
 
Alerting in Grafana, Grafanacon 2015
Alerting in Grafana, Grafanacon 2015Alerting in Grafana, Grafanacon 2015
Alerting in Grafana, Grafanacon 2015
 
Velocity 2015 linux perf tools
Velocity 2015 linux perf toolsVelocity 2015 linux perf tools
Velocity 2015 linux perf tools
 
SPOF - Single "Person" of Failure
SPOF - Single "Person" of FailureSPOF - Single "Person" of Failure
SPOF - Single "Person" of Failure
 
Chaos patterns - architecting for failure in distributed systems
Chaos patterns - architecting for failure in distributed systemsChaos patterns - architecting for failure in distributed systems
Chaos patterns - architecting for failure in distributed systems
 
Un-broken Logging - Operability.io 2015 - Matthew Skelton
Un-broken Logging - Operability.io 2015 - Matthew SkeltonUn-broken Logging - Operability.io 2015 - Matthew Skelton
Un-broken Logging - Operability.io 2015 - Matthew Skelton
 
Monitoring Is Never Done
Monitoring Is Never DoneMonitoring Is Never Done
Monitoring Is Never Done
 
Devops and Immutable infrastructure - Cloud Expo 2015 NYC
Devops and Immutable infrastructure  - Cloud Expo 2015 NYCDevops and Immutable infrastructure  - Cloud Expo 2015 NYC
Devops and Immutable infrastructure - Cloud Expo 2015 NYC
 
What is Nagios XI and how is it different from Nagios Core
What is Nagios XI and how is it different from Nagios CoreWhat is Nagios XI and how is it different from Nagios Core
What is Nagios XI and how is it different from Nagios Core
 
Open Source Monitoring Tools
Open Source Monitoring ToolsOpen Source Monitoring Tools
Open Source Monitoring Tools
 
Production testing through monitoring
Production testing through monitoringProduction testing through monitoring
Production testing through monitoring
 
Continuous Delivery Tools Collaboration Conways Law - QCon London - Matthew S...
Continuous Delivery Tools Collaboration Conways Law - QCon London - Matthew S...Continuous Delivery Tools Collaboration Conways Law - QCon London - Matthew S...
Continuous Delivery Tools Collaboration Conways Law - QCon London - Matthew S...
 
Monitoring the #DevOps way
Monitoring the #DevOps wayMonitoring the #DevOps way
Monitoring the #DevOps way
 
DevOps Kaizen: Practical Steps to Start & Sustain a Transformation
DevOps Kaizen: Practical Steps to Start & Sustain a TransformationDevOps Kaizen: Practical Steps to Start & Sustain a Transformation
DevOps Kaizen: Practical Steps to Start & Sustain a Transformation
 
Monitoring with Nagios and Ganglia
Monitoring with Nagios and GangliaMonitoring with Nagios and Ganglia
Monitoring with Nagios and Ganglia
 
Primum non nocere - Ethical Obligations in Internet Operations
Primum non nocere - Ethical Obligations in Internet OperationsPrimum non nocere - Ethical Obligations in Internet Operations
Primum non nocere - Ethical Obligations in Internet Operations
 

Similar to Time to say goodbye to your Nagios based setup

OSMC 2014 | Time to say goodbye to your Nagios based setup? by Oliver Jan
OSMC 2014 | Time to say goodbye to your Nagios based setup? by Oliver JanOSMC 2014 | Time to say goodbye to your Nagios based setup? by Oliver Jan
OSMC 2014 | Time to say goodbye to your Nagios based setup? by Oliver JanNETWAYS
 
Experiences with Microservices at Tuenti
Experiences with Microservices at TuentiExperiences with Microservices at Tuenti
Experiences with Microservices at TuentiAndrés Viedma Peláez
 
Kentik Network@Scale (Dan Ellis)
Kentik Network@Scale (Dan Ellis)Kentik Network@Scale (Dan Ellis)
Kentik Network@Scale (Dan Ellis)gvillain
 
H2020 finsec-ibm- aidan-shribman-finsec-skydive 260820
H2020 finsec-ibm- aidan-shribman-finsec-skydive 260820H2020 finsec-ibm- aidan-shribman-finsec-skydive 260820
H2020 finsec-ibm- aidan-shribman-finsec-skydive 260820innov-acts-ltd
 
Streaming Data Ingest and Processing with Apache Kafka
Streaming Data Ingest and Processing with Apache KafkaStreaming Data Ingest and Processing with Apache Kafka
Streaming Data Ingest and Processing with Apache KafkaAttunity
 
Thinking DevOps in the era of the Cloud - Demi Ben-Ari
Thinking DevOps in the era of the Cloud - Demi Ben-AriThinking DevOps in the era of the Cloud - Demi Ben-Ari
Thinking DevOps in the era of the Cloud - Demi Ben-AriDemi Ben-Ari
 
Monitoring Big Data Systems Done "The Simple Way" - Codemotion Milan 2017 - D...
Monitoring Big Data Systems Done "The Simple Way" - Codemotion Milan 2017 - D...Monitoring Big Data Systems Done "The Simple Way" - Codemotion Milan 2017 - D...
Monitoring Big Data Systems Done "The Simple Way" - Codemotion Milan 2017 - D...Demi Ben-Ari
 
Demi Ben-Ari - Monitoring Big Data Systems Done "The Simple Way" - Codemotion...
Demi Ben-Ari - Monitoring Big Data Systems Done "The Simple Way" - Codemotion...Demi Ben-Ari - Monitoring Big Data Systems Done "The Simple Way" - Codemotion...
Demi Ben-Ari - Monitoring Big Data Systems Done "The Simple Way" - Codemotion...Codemotion
 
Cloud lunch and learn real-time streaming in azure
Cloud lunch and learn real-time streaming in azureCloud lunch and learn real-time streaming in azure
Cloud lunch and learn real-time streaming in azureTimothy Spann
 
Distributed tracing with erlang/elixir
Distributed tracing with erlang/elixirDistributed tracing with erlang/elixir
Distributed tracing with erlang/elixirIvan Glushkov
 
QNIBTerminal: Understand your datacenter by overlaying multiple information l...
QNIBTerminal: Understand your datacenter by overlaying multiple information l...QNIBTerminal: Understand your datacenter by overlaying multiple information l...
QNIBTerminal: Understand your datacenter by overlaying multiple information l...QNIB Solutions
 
Music city data Hail Hydrate! from stream to lake
Music city data Hail Hydrate! from stream to lakeMusic city data Hail Hydrate! from stream to lake
Music city data Hail Hydrate! from stream to lakeTimothy Spann
 
DC612 Day - Hands on Penetration Testing 101
DC612 Day - Hands on Penetration Testing 101DC612 Day - Hands on Penetration Testing 101
DC612 Day - Hands on Penetration Testing 101dc612
 
OSMC 2014: Interesting use cases of Zabbix improvements in latest versions | ...
OSMC 2014: Interesting use cases of Zabbix improvements in latest versions | ...OSMC 2014: Interesting use cases of Zabbix improvements in latest versions | ...
OSMC 2014: Interesting use cases of Zabbix improvements in latest versions | ...NETWAYS
 
Logs aggregation and analysis
Logs aggregation and analysisLogs aggregation and analysis
Logs aggregation and analysisDivante
 
Fluentd at HKOScon
Fluentd at HKOSconFluentd at HKOScon
Fluentd at HKOSconN Masahiro
 
Rock Solid Deployment of Web Applications
Rock Solid Deployment of Web ApplicationsRock Solid Deployment of Web Applications
Rock Solid Deployment of Web ApplicationsPablo Godel
 
Delivering the power of data using Spring Cloud DataFlow and DataStax Enterpr...
Delivering the power of data using Spring Cloud DataFlow and DataStax Enterpr...Delivering the power of data using Spring Cloud DataFlow and DataStax Enterpr...
Delivering the power of data using Spring Cloud DataFlow and DataStax Enterpr...VMware Tanzu
 
Using Event Streams in Serverless Applications
Using Event Streams in Serverless ApplicationsUsing Event Streams in Serverless Applications
Using Event Streams in Serverless ApplicationsJonathan Dee
 

Similar to Time to say goodbye to your Nagios based setup (20)

OSMC 2014 | Time to say goodbye to your Nagios based setup? by Oliver Jan
OSMC 2014 | Time to say goodbye to your Nagios based setup? by Oliver JanOSMC 2014 | Time to say goodbye to your Nagios based setup? by Oliver Jan
OSMC 2014 | Time to say goodbye to your Nagios based setup? by Oliver Jan
 
Experiences with Microservices at Tuenti
Experiences with Microservices at TuentiExperiences with Microservices at Tuenti
Experiences with Microservices at Tuenti
 
Apache Spark Streaming
Apache Spark StreamingApache Spark Streaming
Apache Spark Streaming
 
Kentik Network@Scale (Dan Ellis)
Kentik Network@Scale (Dan Ellis)Kentik Network@Scale (Dan Ellis)
Kentik Network@Scale (Dan Ellis)
 
H2020 finsec-ibm- aidan-shribman-finsec-skydive 260820
H2020 finsec-ibm- aidan-shribman-finsec-skydive 260820H2020 finsec-ibm- aidan-shribman-finsec-skydive 260820
H2020 finsec-ibm- aidan-shribman-finsec-skydive 260820
 
Streaming Data Ingest and Processing with Apache Kafka
Streaming Data Ingest and Processing with Apache KafkaStreaming Data Ingest and Processing with Apache Kafka
Streaming Data Ingest and Processing with Apache Kafka
 
Thinking DevOps in the era of the Cloud - Demi Ben-Ari
Thinking DevOps in the era of the Cloud - Demi Ben-AriThinking DevOps in the era of the Cloud - Demi Ben-Ari
Thinking DevOps in the era of the Cloud - Demi Ben-Ari
 
Monitoring Big Data Systems Done "The Simple Way" - Codemotion Milan 2017 - D...
Monitoring Big Data Systems Done "The Simple Way" - Codemotion Milan 2017 - D...Monitoring Big Data Systems Done "The Simple Way" - Codemotion Milan 2017 - D...
Monitoring Big Data Systems Done "The Simple Way" - Codemotion Milan 2017 - D...
 
Demi Ben-Ari - Monitoring Big Data Systems Done "The Simple Way" - Codemotion...
Demi Ben-Ari - Monitoring Big Data Systems Done "The Simple Way" - Codemotion...Demi Ben-Ari - Monitoring Big Data Systems Done "The Simple Way" - Codemotion...
Demi Ben-Ari - Monitoring Big Data Systems Done "The Simple Way" - Codemotion...
 
Cloud lunch and learn real-time streaming in azure
Cloud lunch and learn real-time streaming in azureCloud lunch and learn real-time streaming in azure
Cloud lunch and learn real-time streaming in azure
 
Distributed tracing with erlang/elixir
Distributed tracing with erlang/elixirDistributed tracing with erlang/elixir
Distributed tracing with erlang/elixir
 
QNIBTerminal: Understand your datacenter by overlaying multiple information l...
QNIBTerminal: Understand your datacenter by overlaying multiple information l...QNIBTerminal: Understand your datacenter by overlaying multiple information l...
QNIBTerminal: Understand your datacenter by overlaying multiple information l...
 
Music city data Hail Hydrate! from stream to lake
Music city data Hail Hydrate! from stream to lakeMusic city data Hail Hydrate! from stream to lake
Music city data Hail Hydrate! from stream to lake
 
DC612 Day - Hands on Penetration Testing 101
DC612 Day - Hands on Penetration Testing 101DC612 Day - Hands on Penetration Testing 101
DC612 Day - Hands on Penetration Testing 101
 
OSMC 2014: Interesting use cases of Zabbix improvements in latest versions | ...
OSMC 2014: Interesting use cases of Zabbix improvements in latest versions | ...OSMC 2014: Interesting use cases of Zabbix improvements in latest versions | ...
OSMC 2014: Interesting use cases of Zabbix improvements in latest versions | ...
 
Logs aggregation and analysis
Logs aggregation and analysisLogs aggregation and analysis
Logs aggregation and analysis
 
Fluentd at HKOScon
Fluentd at HKOSconFluentd at HKOScon
Fluentd at HKOScon
 
Rock Solid Deployment of Web Applications
Rock Solid Deployment of Web ApplicationsRock Solid Deployment of Web Applications
Rock Solid Deployment of Web Applications
 
Delivering the power of data using Spring Cloud DataFlow and DataStax Enterpr...
Delivering the power of data using Spring Cloud DataFlow and DataStax Enterpr...Delivering the power of data using Spring Cloud DataFlow and DataStax Enterpr...
Delivering the power of data using Spring Cloud DataFlow and DataStax Enterpr...
 
Using Event Streams in Serverless Applications
Using Event Streams in Serverless ApplicationsUsing Event Streams in Serverless Applications
Using Event Streams in Serverless Applications
 

Recently uploaded

EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWERMadyBayot
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherRemote DBA Services
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businesspanagenda
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodJuan lago vázquez
 
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Orbitshub
 
Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024The Digital Insurer
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MIND CTI
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistandanishmna97
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...DianaGray10
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Victor Rentea
 
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Angeliki Cooney
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...apidays
 
Cyberprint. Dark Pink Apt Group [EN].pdf
Cyberprint. Dark Pink Apt Group [EN].pdfCyberprint. Dark Pink Apt Group [EN].pdf
Cyberprint. Dark Pink Apt Group [EN].pdfOverkill Security
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfOrbitshub
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDropbox
 
AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024The Digital Insurer
 

Recently uploaded (20)

EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
 
Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistan
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
 
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
Cyberprint. Dark Pink Apt Group [EN].pdf
Cyberprint. Dark Pink Apt Group [EN].pdfCyberprint. Dark Pink Apt Group [EN].pdf
Cyberprint. Dark Pink Apt Group [EN].pdf
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024
 

Time to say goodbye to your Nagios based setup

  • 1. So you want to switch off ? Time to say goodbye to your Nagios based setup! @olivjan - ojan@monitoring-fr.org © 2014 - Olivier Jan - Check my Website
  • 2. About me ❖ System admin and architect ❖ Co-founder of « Communauté Francophone de la Supervision Libre » ❖ Writer of the book « Nagios 3 au coeur de la supervision Open Source » ❖ Co-founder of Check my Website, a SaaS service for remote monitoring of websites and applications (current)
  • 3. Content ❖ Why switch off ? the good and maybe not so good reasons to do so ! ❖ Which way to take ? ❖ Building a monitoring solution without Nagios : ❖ Tools available ❖ A personal work in progress ❖ Migrating from Nagios to this kind of solution
  • 4. Some reasons to switch off… ❖ The godfather of OSS monitoring is dead as an Open Source project ? ❖ Can’t do better with it ❖ Cool new kids out there ❖ Better « cloud » support ❖ Clear states, metrics and messages monitoring distinction ❖ Better charting solution ❖ Near realtime monitoring ❖ Routing, aggregation, correlation… ❖ YOUR reasons ;)
  • 5. Which way to take ? ❖ The « 4 mousquetaires » ❖ Naemon ❖ Icinga 2 ❖ Shinken ❖ Centreon ❖ Reboot from building blocks ❖ Collect ❖ Store ❖ Visualize ❖ Alert
  • 6. Tools : Collecting metrics and messages ❖ Packetbeat (metrics & messages) ❖ Rsyslog, NX log, Syslog-ng (messages) ❖ sFlow Toolkit, Host sFlow ❖ Logstash-forwarder (messages) ❖ Collectd (metrics) ❖ Diamond (metrics) ❖ OSquery, WMI (metrics) ❖ Network level (sFlow) ❖ System Level ❖ Application Level
  • 7. Tools : External collecting ❖ End user perspective ❖ Controls done closest to the end-user ❖ Application behavior ❖ Real User Monitoring ❖ Webpagetest ❖ Selenium ❖ PhantomasJS ❖ Boomerang ❖ Bucky
  • 8. Tools : Routing metrics and messages ❖ Messages : Logstash, Flume, Fluentd ❖ Metrics : StatsD ❖ Metrics : Carbon Relay NG One or more messages can fire an event
  • 9. Tools : Databases ❖ Graphite : The most used. ❖ OpenTSDB : HBase ❖ KairosDB : Cassandra ❖ InfluxDB : The most promising ? ❖ Elasticsearch : Index database
  • 10. Tools : Visualizing metrics and messages ❖ Kibana ❖ Grafana ❖ Dashboards collection
  • 11. Tools : Alerting ❖ Seyren : Alerting dashboard for Graphite. ❖ Cabot : Get alerted when services go down or metrics go crazy ❖ Bosun : An advanced, open-source monitoring and alerting system ❖ Skyline : Real-time anomaly detection system ❖ Oculus : Anomaly correlation component of Etsy's Kale system ❖ Esper : Complex Event Processing
  • 12. The French Monitoring Community Xperience ❖ Reboot from building blocks ❖ Collect ❖ Store ❖ Visualize ❖ Alert
  • 13. The French Monitoring Community Xperience Is it working ? What is not working ?
  • 14. Collecting metrics : Collectd ❖ InfluxDB Collectd proxy ❖ In Golang like InfluxDB ❖ Temporary solution ❖ Native Collectd plugin LoadPlugin network <Plugin network> # proxy address Server "127.0.0.1" "8096" </Plugin> ❖ PHP5-FPM metrics ❖ Nginx metrics ❖ MariaDB metrics ❖ System metrics ❖ <metricname>:<value>|<type>
  • 15. Collecting messages : Rsyslog ❖ Nearly ready log consumption ❖ Native distribution package ❖ Nginx Log, MySQL slow query log template(name=« ls_json" type=« list" option.json="on") { constant(value=« {") constant(value=""@timestamp":"") property(name="timereported" dateFormat=« rfc3339") constant(value=« ","@version":"1") constant(value="","message":"") property(name=« msg") constant(value="","host":"") property(name=« hostname") constant(value="","severity":"") property(name=« syslogseverity-text") constant(value="","facility":"") property(name=« syslogfacility-text") constant(value="","programname":"") property(name=« programname") constant(value="","procid":"") property(name=« procid") constant(value=« "}n") }
  • 16. Collecting @ network level : Packetbeat ❖ Specific agent ❖ Collect traffic for ❖ HTTP ❖ MySQL ❖ PostgreSQL ❖ Redis
  • 17. Routing messages : Logstash ❖ Inputs ❖ Codecs/filters ❖ Outputs input { udp { port => 10514 codec => "json" type => "syslog" } } filter { # This replaces the host field with the host that generated the message (sysloghost) if [sysloghost] { mutate { replace => [ "host", "%{sysloghost}" ] remove_field => "sysloghost" } } } output { elasticsearch { host => localhost } }
  • 18. Routing metrics : StatsD ❖ Is now a protocol implemented in all languages ❖ InfluxDB plugin ❖ Collectd can behave as a statsD daemon (plugin) ❖ Very easy to push metrics echo "foo:1|c" | nc -u -w0 127.0.0.1 8125
  • 19. Storing metrics : InfluxDB ❖ Make it behave like Graphite ❖ graphite-api ❖ carbon-relay-ng ❖ graphite-influxdb ❖ Cluster, cluster, cluster ❖ Design for events and metrics
  • 20. Storing messages : Elasticsearch ❖ Index database ❖ Cluster, cluster, cluster ❖ Full text search
  • 21. Visualizing @ network level : Packetbeat ❖ Kibana 3 modified version ❖ Dashboards ready out of the box
  • 22. Visualizing metrics : Grafana ❖ Compatible ❖ Graphite ❖ InfluxDB ❖ OpenTSDB ❖ Built on Kibana 3
  • 23. Visualizing messages : Kibana 4 ❖ Easy install ❖ Interactive dashboards ❖ Multiple indices
  • 24. What's missing ? Wishes ❖ Alerting ❖ External monitoring ❖ Repository for dashboards… ❖ Giving sense to metrics and messages
  • 25. Alerting reboot ❖ Alert only on end user problems from an end user perspective ❖ IRC, Chat channel… ❖ Alert thresholds based on history vs static thresholds ❖ Statistics functions ❖ Boolean conditions ❖ Dynamic thresholds ❖ Anomaly detection ❖ Standard deviation
  • 26. Coming from Nagios ❖ Graphios will inject perfdatas in Graphite or InfluxDB ❖ Check_graphite can query Graphite API from Nagios for alert based on history ❖ Logstash will send events to NSCA ❖ Nagios log in Kibana with Grok %{NAGIOSLINE} ❖ Keep Nagios for states ?
  • 27. Questions ? @olivjan ojan@monitoring-fr.org

Editor's Notes

  1. 3 years older since last time you see me there ! Always that typical french accent on english
  2. So these reasons or your own have convinced you to switch off. Welcome to the Club ! Graph from data loop.io blog
  3. A question i asked to myself. You can continue on the same godfather logic but with much more community friendly and powerful software. Athos Portos Aramis D’artagnan heka, Riemann.io : Metrics and messages Sensu sensu riemann
  4. I wanted to collect metrics, states and messages on all possible levels. The standard is the Graphite metric format. SQL powered operating system instrumentation and analytics
  5. Facette, Tasseo, Cubism, Gdash, Descartes, Giraffe, Grafene
  6. I wanted to monitor the monitoring-fr.org infrastructure in a new way. This is the classical website stack. Let’s see now what i wanted to do and the tools that might fit. Internal & External monitoring Alert me when it’s not available or degraded for end-users Push first, poll if needed Relative monitoring Better « Cloud » support Trends made 3 years ago Data-driven architecture
  7. I wanted to monitor the monitoring-fr.org infrastructure in a new way. This is the classical website stack. Let’s see now what i wanted to do and the tools that might fit. Internal & External monitoring Alert me when it’s not available or degraded for end-users Push first, poll if needed Relative monitoring Better « Cloud » support Trends made 3 years ago Data-driven architecture
  8. In fact, i also use a packetbeat agent ph5-fpm metrics, nginx metrics
  9. Packetbeat reads and parses network protocols on the fly, correlating the requests with the responses and measuring the response times.
  10. A network daemon that runs on the Node.js platform and listens for statistics, like counters and timers, sent over UDP or TCP and sends aggregates to one or more pluggable backend services (e.g., Graphite).
  11. I wanted to collect metrics, states and events on all possible levels. The standard is the Graphite metric format. http://dieter.plaetinck.be/on-graphite-whisper-and-influxdb.html
  12. I wanted to collect metrics, states and events on all possible levels. The standard is the Graphite metric format.
  13. I wanted to monitor the monitoring-fr.org infrastructure in a new way. This is the classical website stack. Let’s see now what i wanted to do and the tools that might fit. having repository to share dashboards, logstash configurations…
  14. • Alert thresholds based on history vs static thresholds (or both combined) • Statistics functions: Min, Percentile, Median, Deviations, Forecasting. You can change the duration that these evaluate over (i.e. 5 minutes, 1 hour, 1 week?) • Scope-aware: How should components in your environment be grouped? By Host, subsystem, cluster, a combination of those things • Boolean conditions: The interaction of multiple components
  15. Not convinced to switch off Side by side to explore