Submit Search
Upload
Мониторинг. Опять, rootconf 2016
•
2 likes
•
587 views
Vsevolod Polyakov
Follow
Слайды с доклада на РИТ++ 2016, секция рутконф.
Read less
Read more
Engineering
Report
Share
Report
Share
1 of 113
Download now
Download to read offline
Recommended
Путь мониторинга 2.0 всё стало другим / Всеволод Поляков (Grammarly)
Путь мониторинга 2.0 всё стало другим / Всеволод Поляков (Grammarly)
Ontico
Всеволод Поляков (DevOps Team Lead в Grammarly)
Всеволод Поляков (DevOps Team Lead в Grammarly)
Provectus
"Metrics: Where and How", Vsevolod Polyakov
"Metrics: Where and How", Vsevolod Polyakov
Yulia Shcherbachova
Developing High Performance Application with Aerospike & Go
Developing High Performance Application with Aerospike & Go
Chris Stivers
Jvm & Garbage collection tuning for low latencies application
Jvm & Garbage collection tuning for low latencies application
Quentin Ambard
Gnocchi v4 (preview)
Gnocchi v4 (preview)
Gordon Chung
Gnocchi Profiling 2.1.x
Gnocchi Profiling 2.1.x
Gordon Chung
Gnocchi v3 brownbag
Gnocchi v3 brownbag
Gordon Chung
Recommended
Путь мониторинга 2.0 всё стало другим / Всеволод Поляков (Grammarly)
Путь мониторинга 2.0 всё стало другим / Всеволод Поляков (Grammarly)
Ontico
Всеволод Поляков (DevOps Team Lead в Grammarly)
Всеволод Поляков (DevOps Team Lead в Grammarly)
Provectus
"Metrics: Where and How", Vsevolod Polyakov
"Metrics: Where and How", Vsevolod Polyakov
Yulia Shcherbachova
Developing High Performance Application with Aerospike & Go
Developing High Performance Application with Aerospike & Go
Chris Stivers
Jvm & Garbage collection tuning for low latencies application
Jvm & Garbage collection tuning for low latencies application
Quentin Ambard
Gnocchi v4 (preview)
Gnocchi v4 (preview)
Gordon Chung
Gnocchi Profiling 2.1.x
Gnocchi Profiling 2.1.x
Gordon Chung
Gnocchi v3 brownbag
Gnocchi v3 brownbag
Gordon Chung
Gnocchi Profiling v2
Gnocchi Profiling v2
Gordon Chung
Declarative Infrastructure Tools
Declarative Infrastructure Tools
Yulia Shcherbachova
Exactly once with spark streaming
Exactly once with spark streaming
Quentin Ambard
MongoUK 2011 - Rplacing RabbitMQ with MongoDB
MongoUK 2011 - Rplacing RabbitMQ with MongoDB
Boxed Ice
Thanos - Prometheus on Scale
Thanos - Prometheus on Scale
Bartłomiej Płotka
JVM performance options. How it works
JVM performance options. How it works
Dmitriy Dumanskiy
(JVM) Garbage Collection - Brown Bag Session
(JVM) Garbage Collection - Brown Bag Session
Jens Hadlich
opentsdb in a real enviroment
opentsdb in a real enviroment
Chen Robert
Go Profiling - John Graham-Cumming
Go Profiling - John Graham-Cumming
Cloudflare
ELK: Moose-ively scaling your log system
ELK: Moose-ively scaling your log system
Avleen Vig
Go debugging and troubleshooting tips - from real life lessons at SignalFx
Go debugging and troubleshooting tips - from real life lessons at SignalFx
SignalFx
Handling 20 billion requests a month
Handling 20 billion requests a month
Dmitriy Dumanskiy
Go Memory
Go Memory
Cloudflare
Tweaking performance on high-load projects
Tweaking performance on high-load projects
Dmitriy Dumanskiy
Monitoring MySQL with OpenTSDB
Monitoring MySQL with OpenTSDB
Geoffrey Anderson
Ceph Object Storage Performance Secrets and Ceph Data Lake Solution
Ceph Object Storage Performance Secrets and Ceph Data Lake Solution
Karan Singh
HBaseCon 2013: OpenTSDB at Box
HBaseCon 2013: OpenTSDB at Box
Cloudera, Inc.
Gnocchi v4 - past and present
Gnocchi v4 - past and present
Gordon Chung
On heap cache vs off-heap cache
On heap cache vs off-heap cache
rgrebski
Java 어플리케이션 성능튜닝 Part1
Java 어플리케이션 성능튜닝 Part1
상욱 송
Monitoring base, golang meetup, kyiv
Monitoring base, golang meetup, kyiv
Vsevolod Polyakov
Путь мониторинга, DevOps club в Grammarly
Путь мониторинга, DevOps club в Grammarly
Vsevolod Polyakov
More Related Content
What's hot
Gnocchi Profiling v2
Gnocchi Profiling v2
Gordon Chung
Declarative Infrastructure Tools
Declarative Infrastructure Tools
Yulia Shcherbachova
Exactly once with spark streaming
Exactly once with spark streaming
Quentin Ambard
MongoUK 2011 - Rplacing RabbitMQ with MongoDB
MongoUK 2011 - Rplacing RabbitMQ with MongoDB
Boxed Ice
Thanos - Prometheus on Scale
Thanos - Prometheus on Scale
Bartłomiej Płotka
JVM performance options. How it works
JVM performance options. How it works
Dmitriy Dumanskiy
(JVM) Garbage Collection - Brown Bag Session
(JVM) Garbage Collection - Brown Bag Session
Jens Hadlich
opentsdb in a real enviroment
opentsdb in a real enviroment
Chen Robert
Go Profiling - John Graham-Cumming
Go Profiling - John Graham-Cumming
Cloudflare
ELK: Moose-ively scaling your log system
ELK: Moose-ively scaling your log system
Avleen Vig
Go debugging and troubleshooting tips - from real life lessons at SignalFx
Go debugging and troubleshooting tips - from real life lessons at SignalFx
SignalFx
Handling 20 billion requests a month
Handling 20 billion requests a month
Dmitriy Dumanskiy
Go Memory
Go Memory
Cloudflare
Tweaking performance on high-load projects
Tweaking performance on high-load projects
Dmitriy Dumanskiy
Monitoring MySQL with OpenTSDB
Monitoring MySQL with OpenTSDB
Geoffrey Anderson
Ceph Object Storage Performance Secrets and Ceph Data Lake Solution
Ceph Object Storage Performance Secrets and Ceph Data Lake Solution
Karan Singh
HBaseCon 2013: OpenTSDB at Box
HBaseCon 2013: OpenTSDB at Box
Cloudera, Inc.
Gnocchi v4 - past and present
Gnocchi v4 - past and present
Gordon Chung
On heap cache vs off-heap cache
On heap cache vs off-heap cache
rgrebski
Java 어플리케이션 성능튜닝 Part1
Java 어플리케이션 성능튜닝 Part1
상욱 송
What's hot
(20)
Gnocchi Profiling v2
Gnocchi Profiling v2
Declarative Infrastructure Tools
Declarative Infrastructure Tools
Exactly once with spark streaming
Exactly once with spark streaming
MongoUK 2011 - Rplacing RabbitMQ with MongoDB
MongoUK 2011 - Rplacing RabbitMQ with MongoDB
Thanos - Prometheus on Scale
Thanos - Prometheus on Scale
JVM performance options. How it works
JVM performance options. How it works
(JVM) Garbage Collection - Brown Bag Session
(JVM) Garbage Collection - Brown Bag Session
opentsdb in a real enviroment
opentsdb in a real enviroment
Go Profiling - John Graham-Cumming
Go Profiling - John Graham-Cumming
ELK: Moose-ively scaling your log system
ELK: Moose-ively scaling your log system
Go debugging and troubleshooting tips - from real life lessons at SignalFx
Go debugging and troubleshooting tips - from real life lessons at SignalFx
Handling 20 billion requests a month
Handling 20 billion requests a month
Go Memory
Go Memory
Tweaking performance on high-load projects
Tweaking performance on high-load projects
Monitoring MySQL with OpenTSDB
Monitoring MySQL with OpenTSDB
Ceph Object Storage Performance Secrets and Ceph Data Lake Solution
Ceph Object Storage Performance Secrets and Ceph Data Lake Solution
HBaseCon 2013: OpenTSDB at Box
HBaseCon 2013: OpenTSDB at Box
Gnocchi v4 - past and present
Gnocchi v4 - past and present
On heap cache vs off-heap cache
On heap cache vs off-heap cache
Java 어플리케이션 성능튜닝 Part1
Java 어플리케이션 성능튜닝 Part1
Viewers also liked
Monitoring base, golang meetup, kyiv
Monitoring base, golang meetup, kyiv
Vsevolod Polyakov
Путь мониторинга, DevOps club в Grammarly
Путь мониторинга, DevOps club в Grammarly
Vsevolod Polyakov
Chef wtf
Chef wtf
Vsevolod Polyakov
Путь мониторинга: модульность, гибкость, devops
Путь мониторинга: модульность, гибкость, devops
Vsevolod Polyakov
Metrics: where and how
Metrics: where and how
Vsevolod Polyakov
Infrastructure as code might be literally impossible part 2
Infrastructure as code might be literally impossible part 2
ice799
Nginx Workshop Aftermath
Nginx Workshop Aftermath
Denis Zhdanov
Scaling graphite for application metrics
Scaling graphite for application metrics
Jim Plush
Graphite
Graphite
David Lutz
Мониторинг в высоконагруженных (и не только) проектах: сравнительный анализ с...
Мониторинг в высоконагруженных (и не только) проектах: сравнительный анализ с...
Ontico
PostgreSQL on EXT4, XFS, BTRFS and ZFS
PostgreSQL on EXT4, XFS, BTRFS and ZFS
Tomas Vondra
Linux tuning to improve PostgreSQL performance
Linux tuning to improve PostgreSQL performance
PostgreSQL-Consulting
Нейронечёткая классификация слабо формализуемых данных | Тимур Гильмуллин
Нейронечёткая классификация слабо формализуемых данных | Тимур Гильмуллин
Positive Hack Days
Viewers also liked
(13)
Monitoring base, golang meetup, kyiv
Monitoring base, golang meetup, kyiv
Путь мониторинга, DevOps club в Grammarly
Путь мониторинга, DevOps club в Grammarly
Chef wtf
Chef wtf
Путь мониторинга: модульность, гибкость, devops
Путь мониторинга: модульность, гибкость, devops
Metrics: where and how
Metrics: where and how
Infrastructure as code might be literally impossible part 2
Infrastructure as code might be literally impossible part 2
Nginx Workshop Aftermath
Nginx Workshop Aftermath
Scaling graphite for application metrics
Scaling graphite for application metrics
Graphite
Graphite
Мониторинг в высоконагруженных (и не только) проектах: сравнительный анализ с...
Мониторинг в высоконагруженных (и не только) проектах: сравнительный анализ с...
PostgreSQL on EXT4, XFS, BTRFS and ZFS
PostgreSQL on EXT4, XFS, BTRFS and ZFS
Linux tuning to improve PostgreSQL performance
Linux tuning to improve PostgreSQL performance
Нейронечёткая классификация слабо формализуемых данных | Тимур Гильмуллин
Нейронечёткая классификация слабо формализуемых данных | Тимур Гильмуллин
Similar to Мониторинг. Опять, rootconf 2016
Garbage First Garbage Collector (G1 GC) - Migration to, Expectations and Adva...
Garbage First Garbage Collector (G1 GC) - Migration to, Expectations and Adva...
Monica Beckwith
Am I reading GC logs Correctly?
Am I reading GC logs Correctly?
Tier1 App
Couchbase live 2016
Couchbase live 2016
Pierre Mavro
Tweaking perfomance on high-load projects_Думанский Дмитрий
Tweaking perfomance on high-load projects_Думанский Дмитрий
GeeksLab Odessa
Kafka to the Maxka - (Kafka Performance Tuning)
Kafka to the Maxka - (Kafka Performance Tuning)
DataWorks Summit
InfluxDB IOx Tech Talks: Intro to the InfluxDB IOx Read Buffer - A Read-Optim...
InfluxDB IOx Tech Talks: Intro to the InfluxDB IOx Read Buffer - A Read-Optim...
InfluxData
Hadoop Meetup Jan 2019 - Dynamometer and a Case Study in NameNode GC
Hadoop Meetup Jan 2019 - Dynamometer and a Case Study in NameNode GC
Erik Krogen
osdi20-slides_zhao.pptx
osdi20-slides_zhao.pptx
Cive1971
Basics of JVM Tuning
Basics of JVM Tuning
Vladislav Gangan
Presentation_Parallel GRASP algorithm for job shop scheduling
Presentation_Parallel GRASP algorithm for job shop scheduling
Antonio Maria Fiscarelli
ELK stack at weibo.com
ELK stack at weibo.com
琛琳 饶
[Outdated] Secrets of Performance Tuning Java on Kubernetes
[Outdated] Secrets of Performance Tuning Java on Kubernetes
Bruno Borges
JVM memory management & Diagnostics
JVM memory management & Diagnostics
Dhaval Shah
(DAT402) Amazon RDS PostgreSQL:Lessons Learned & New Features
(DAT402) Amazon RDS PostgreSQL:Lessons Learned & New Features
Amazon Web Services
Achieving 100k Queries per Hour on Hive on Tez
Achieving 100k Queries per Hour on Hive on Tez
DataWorks Summit/Hadoop Summit
Accelerating HBase with NVMe and Bucket Cache
Accelerating HBase with NVMe and Bucket Cache
Nicolas Poggi
Adaptive Linear Solvers and Eigensolvers
Adaptive Linear Solvers and Eigensolvers
inside-BigData.com
A G1GC Saga-KCJUG.pptx
A G1GC Saga-KCJUG.pptx
Monica Beckwith
Accumulo Summit 2014: Benchmarking Accumulo: How Fast Is Fast?
Accumulo Summit 2014: Benchmarking Accumulo: How Fast Is Fast?
Accumulo Summit
Hadoop performance optimization tips
Hadoop performance optimization tips
Subhas Kumar Ghosh
Similar to Мониторинг. Опять, rootconf 2016
(20)
Garbage First Garbage Collector (G1 GC) - Migration to, Expectations and Adva...
Garbage First Garbage Collector (G1 GC) - Migration to, Expectations and Adva...
Am I reading GC logs Correctly?
Am I reading GC logs Correctly?
Couchbase live 2016
Couchbase live 2016
Tweaking perfomance on high-load projects_Думанский Дмитрий
Tweaking perfomance on high-load projects_Думанский Дмитрий
Kafka to the Maxka - (Kafka Performance Tuning)
Kafka to the Maxka - (Kafka Performance Tuning)
InfluxDB IOx Tech Talks: Intro to the InfluxDB IOx Read Buffer - A Read-Optim...
InfluxDB IOx Tech Talks: Intro to the InfluxDB IOx Read Buffer - A Read-Optim...
Hadoop Meetup Jan 2019 - Dynamometer and a Case Study in NameNode GC
Hadoop Meetup Jan 2019 - Dynamometer and a Case Study in NameNode GC
osdi20-slides_zhao.pptx
osdi20-slides_zhao.pptx
Basics of JVM Tuning
Basics of JVM Tuning
Presentation_Parallel GRASP algorithm for job shop scheduling
Presentation_Parallel GRASP algorithm for job shop scheduling
ELK stack at weibo.com
ELK stack at weibo.com
[Outdated] Secrets of Performance Tuning Java on Kubernetes
[Outdated] Secrets of Performance Tuning Java on Kubernetes
JVM memory management & Diagnostics
JVM memory management & Diagnostics
(DAT402) Amazon RDS PostgreSQL:Lessons Learned & New Features
(DAT402) Amazon RDS PostgreSQL:Lessons Learned & New Features
Achieving 100k Queries per Hour on Hive on Tez
Achieving 100k Queries per Hour on Hive on Tez
Accelerating HBase with NVMe and Bucket Cache
Accelerating HBase with NVMe and Bucket Cache
Adaptive Linear Solvers and Eigensolvers
Adaptive Linear Solvers and Eigensolvers
A G1GC Saga-KCJUG.pptx
A G1GC Saga-KCJUG.pptx
Accumulo Summit 2014: Benchmarking Accumulo: How Fast Is Fast?
Accumulo Summit 2014: Benchmarking Accumulo: How Fast Is Fast?
Hadoop performance optimization tips
Hadoop performance optimization tips
Recently uploaded
Call Girls Wakad Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Wakad Call Me 7737669865 Budget Friendly No Advance Booking
roncy bisnoi
Double Revolving field theory-how the rotor develops torque
Double Revolving field theory-how the rotor develops torque
BhangaleSonal
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...
roncy bisnoi
chapter 5.pptx: drainage and irrigation engineering
chapter 5.pptx: drainage and irrigation engineering
mulugeta48
Call Girls in Ramesh Nagar Delhi 💯 Call Us 🔝9953056974 🔝 Escort Service
Call Girls in Ramesh Nagar Delhi 💯 Call Us 🔝9953056974 🔝 Escort Service
9953056974 Low Rate Call Girls In Saket, Delhi NCR
Double rodded leveling 1 pdf activity 01
Double rodded leveling 1 pdf activity 01
KreezheaRecto
(INDIRA) Call Girl Bhosari Call Now 8617697112 Bhosari Escorts 24x7
(INDIRA) Call Girl Bhosari Call Now 8617697112 Bhosari Escorts 24x7
Call Girls in Nagpur High Profile Call Girls
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
ssuser89054b
Work-Permit-Receiver-in-Saudi-Aramco.pptx
Work-Permit-Receiver-in-Saudi-Aramco.pptx
JuliansyahHarahap1
Booking open Available Pune Call Girls Pargaon 6297143586 Call Hot Indian Gi...
Booking open Available Pune Call Girls Pargaon 6297143586 Call Hot Indian Gi...
Call Girls in Nagpur High Profile
Unleashing the Power of the SORA AI lastest leap
Unleashing the Power of the SORA AI lastest leap
RishantSharmaFr
Block diagram reduction techniques in control systems.ppt
Block diagram reduction techniques in control systems.ppt
NANDHAKUMARA10
Top Rated Call Girls In chittoor 📱 {7001035870} VIP Escorts chittoor
Top Rated Call Girls In chittoor 📱 {7001035870} VIP Escorts chittoor
dharasingh5698
Unit 2- Effective stress & Permeability.pdf
Unit 2- Effective stress & Permeability.pdf
RagavanV2
Intze Overhead Water Tank Design by Working Stress - IS Method.pdf
Intze Overhead Water Tank Design by Working Stress - IS Method.pdf
Er. Suman Jyoti
AKTU Computer Networks notes --- Unit 3.pdf
AKTU Computer Networks notes --- Unit 3.pdf
ankushspencer015
Thermal Engineering-R & A / C - unit - V
Thermal Engineering-R & A / C - unit - V
DineshKumar4165
Call Girls In Bangalore ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bangalore ☎ 7737669865 🥵 Book Your One night Stand
amitlee9823
Unit 1 - Soil Classification and Compaction.pdf
Unit 1 - Soil Classification and Compaction.pdf
RagavanV2
Thermal Engineering -unit - III & IV.ppt
Thermal Engineering -unit - III & IV.ppt
DineshKumar4165
Recently uploaded
(20)
Call Girls Wakad Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Wakad Call Me 7737669865 Budget Friendly No Advance Booking
Double Revolving field theory-how the rotor develops torque
Double Revolving field theory-how the rotor develops torque
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...
chapter 5.pptx: drainage and irrigation engineering
chapter 5.pptx: drainage and irrigation engineering
Call Girls in Ramesh Nagar Delhi 💯 Call Us 🔝9953056974 🔝 Escort Service
Call Girls in Ramesh Nagar Delhi 💯 Call Us 🔝9953056974 🔝 Escort Service
Double rodded leveling 1 pdf activity 01
Double rodded leveling 1 pdf activity 01
(INDIRA) Call Girl Bhosari Call Now 8617697112 Bhosari Escorts 24x7
(INDIRA) Call Girl Bhosari Call Now 8617697112 Bhosari Escorts 24x7
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Work-Permit-Receiver-in-Saudi-Aramco.pptx
Work-Permit-Receiver-in-Saudi-Aramco.pptx
Booking open Available Pune Call Girls Pargaon 6297143586 Call Hot Indian Gi...
Booking open Available Pune Call Girls Pargaon 6297143586 Call Hot Indian Gi...
Unleashing the Power of the SORA AI lastest leap
Unleashing the Power of the SORA AI lastest leap
Block diagram reduction techniques in control systems.ppt
Block diagram reduction techniques in control systems.ppt
Top Rated Call Girls In chittoor 📱 {7001035870} VIP Escorts chittoor
Top Rated Call Girls In chittoor 📱 {7001035870} VIP Escorts chittoor
Unit 2- Effective stress & Permeability.pdf
Unit 2- Effective stress & Permeability.pdf
Intze Overhead Water Tank Design by Working Stress - IS Method.pdf
Intze Overhead Water Tank Design by Working Stress - IS Method.pdf
AKTU Computer Networks notes --- Unit 3.pdf
AKTU Computer Networks notes --- Unit 3.pdf
Thermal Engineering-R & A / C - unit - V
Thermal Engineering-R & A / C - unit - V
Call Girls In Bangalore ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bangalore ☎ 7737669865 🥵 Book Your One night Stand
Unit 1 - Soil Classification and Compaction.pdf
Unit 1 - Soil Classification and Compaction.pdf
Thermal Engineering -unit - III & IV.ppt
Thermal Engineering -unit - III & IV.ppt
Мониторинг. Опять, rootconf 2016
1.
МОНИТОРИНГ. ОПЯТЬ. Всеволод Поляков
2.
Platform Engineer .
Grammarly ctrlok.com
3.
Что такое метрики?
4.
5.
6.
Успешность
7.
Количество
8.
Время
9.
10.
Взаимодействие
11.
Внутренние процессы
12.
Системные метрики
13.
Зачем нужны метрики?
14.
Алерты
15.
Аналитика
16.
17.
Graphite
18.
Default graphite architecture
19.
what? • RRD-like (gram.ly/gfsx) •
so.it.is.my.metric → /so/it/is/my/metric.wsp • Fixed retention (by namepattern) • Fixed size (actually no)
20.
Retention and size •
1s:1d → 1 036 828 bytes • 10s:10d → 1 036 828 bytes • 1s:365d → 378 432 028 bytes (1 TB ~ 3 000) • 10s:365d → 37 843 228 bytes (1 TB ~ 30 000) whisper calc
21.
Retention and size •
10s:30d,1m:120d,10m:365d → 4 564 864 bytes • 240 864 metrics in 1 TB • aggregation: average, sum, min, max, and last. • can be assign per metric
22.
How • terraform (https://www.terraform.io/) •
docker (https://www.docker.com/) • ansible (https://www.ansible.com/) • rocker (https://github.com/grammarly/rocker) • rocker-compose (https://github.com/grammarly/rocker-compose)
23.
Default graphite architecture
24.
carbon-cache.py • single-core • many
options in config file • default link
25.
architecture carbon-cache.py
26.
Start load testing •
m4.xlarge instance (4 CPU, 16 GB ram, 256 GB disk EBS gp2) • retentions = 1s:1d • MAX_CACHE_SIZE, MAX_UPDATES_PER_SECOND, MAX_CREATES_PER_MINUTE = inf • defaults • almost 1.5h to get limit :(
27.
carbon-cache.py cache size
→ 75k ms
28.
29.
30.
results • 75 000
ms max • 60 000 ms flagman speed • IO :(
31.
Try to tune! •
WHISPER_SPARSE_CREATE = true (don’t allocate space on creation) non-linear IO load. • CACHE_WRITE_STRATEGY = sorted (default)
32.
cache size 1k
→ 195k ms
33.
results • 120 000
ms flagman speed • cache flush problem :(
34.
Try to tune! •
CACHE_WRITE_STRATEGY = max will give a strong flush preference to frequently updated metrics and will also reduce random file-io.
35.
from 1k to
150k
36.
results • 90 000
ms flagman speed • cache flush problem :(
37.
Try to tune! •
CACHE_WRITE_STRATEGY = naive just flush. Better with random IO.
38.
from 45k to
135k
39.
results • 120 000
ms flagman speed • still CPU
40.
sorted max naive
41.
• Maybe it’s
IO EBS limitation? → 512 GB disk. • No.
42.
go-carbon • multi-core single
daemon • written in golang • not many options to tune :( link
43.
Start load testing •
m4.xlarge instance (4 CPU, 16 GB ram, 256 GB disk EBS gp2) • retentions = 1s:1d • max-size = 0 • max-updates-per-second = 0 • almost 1h to get limit :(
44.
1k → 130k
ms ~3k/min
45.
46.
results • 120 000
ms flagman speed • but it’s without sparse. • try to implement
47.
try to tune! remaining
:= whisper.Size() - whisper.MetadataSize() whisper.file.Seek(int64(remaining-1), 0) whisper.file.Write([]byte{0}) chunkSize := 16384 zeros := make([]byte, chunkSize) for remaining > chunkSize { // if _, err = whisper.file.Write(zeros); err != nil { // return nil, err // } remaining -= chunkSize } if _, err = whisper.file.Write(zeros[:remaining]); err != nil { return nil, err }
48.
Уже есть в
go-carbon
49.
180 000 ms
!
50.
51.
try to tune! •
max update operation = 1500
52.
results • TLDR 210
000 - 240 000 ms flagman speed • 31 000 000 cache size!
53.
54.
try to tune! •
max update operation = 0 • input-buffer = 400 000
55.
results • 270 000
ms flagman speed • 10-20kk cache size!
56.
57.
try to tune! •
vm.dirty_background_ratio=40 • vm.dirty_ratio=60
58.
300 000 reqs
59.
results • 300 000
ms flagman speed • 180k+ ms ±without cache
60.
Re:Lays
61.
Default graphite architecture
62.
arch forward
63.
arch namedregexp
64.
arch hash
65.
arch hash replicafactor:
2
66.
carbon-relay.py • twisted based •
native
67.
Start load testing •
c4.xlarge instance (4 CPU, 7.5 GB ram) • ~1 Gb lan • default parameters • hashing • 10 connections
68.
WTF!
69.
carbon-relay-ng • golang-based • web-panel •
live-updates • aggregators • spooling link
70.
<150 000 reqs
71.
carbon-c-relay • написан на
C • advanced cluster management
72.
from 100 000
to 1 600 000 reqs
73.
1 400 000
flagman speed. Or not?
74.
Итак… go-carbon + carbon-c-relay
= ♡
75.
76.
77.
78.
79.
80.
Контейнеры
81.
Всё перепутано
82.
Различия • Окружение • Роль •
Трек (Модификатор) • IP • Датацентр • Что-угодно
83.
Теги
84.
TSDB с тегами •
influxDB • openTSDB (hbase) • cyanite (cassandra) • newTS (cassandra) • Prometheus
85.
(cluster) influx, 130k
metrics увеличить график
86.
87.
openTSDB single instance +
hbase cluster = upto 150k metrics
88.
89.
90.
91.
92.
Compaction
93.
94.
Graphite
95.
96.
97.
98.
99.
Найти уникальное
100.
101.
102.
103.
104.
105.
106.
107.
Работает с Grafana
108.
Zipper • https://github.com/grobian/carbonserver • https://github.com/dgryski/carbonzipper •
https://github.com/dgryski/carbonapi
109.
110.
111.
ALSO • https://github.com/jssjr/carbonate • https://github.com/jjneely/buckytools •
https://github.com/dgryski/carbonmem • https://github.com/grobian/carbonwriter
112.
Планы • Патч statsd
→ ES • Патч carbonserver → carbonlink
113.
feel free to
ask • Vsevolod Polyakov • ctrlok@gmail.com • skype: ctrlok1987 • github.com/ctrlok • twitter.com/ctrlok • slack: HangOps • Gitter: dev_ua/devops • skype: DevOps from Ukraine • slack.ukrops.club Мы хайрим!
Download now