Automating Cassandra Repairs with Cassandra Reaper

•

6 likes•3,685 views

Anti-entropy repairs are known to be a very peculiar maintenance operation of Cassandra clusters. They are problematic mostly because of the potential of having negative impact on the cluster's performance. Another problematic aspect is the difficulty of managing the repairs of Cassandra clusters in a careful way that would prevent the negative performance impact. Based on the long-term pain we have been experiencing with managing repairs of nearly 100 Cassandra clusters, and being unable to find a solution that would meet our needs, we went ahead and developed an open-source tool, named Cassandra Reaper [1], for easy management of Cassandra repairs. Cassandra Reaper is a tool that automates the management of anti-entropy repairs of Cassandra clusters in a rather smart, efficient and careful manner while requiring minimal Cassandra expertise. I will have to cover some basics of eventual consistency mechanisms of Cassandra, after which I will be able to focus on the features of Cassandra Reaper and our six months of experience having the tool managing the repairs of our production clusters.

Technology

Automating Cassandra
Repairs
Radovan Zvoncek
zvo@spotify.com
github.com/spotify/cassandra-reaper
#CassandraSummit

About zvo
Likes pancakes
Does this for the 3rd time

About zvo
Likes pancakes
Does this for the 3rd time
Works at Spotify

Working at Spotify
Is autonomous
Squads responsible for their full stack
Including Cassandra

Running Cassandra
Requires many things
One of them is keeping data consistent
Otherwise it can get lost or reappear

Running Cassandra
Requires many things
One of them is keeping data consistent
Eventually

Eventual consistency
Read Repairs
Cassandra
R W

Eventual consistency
Hinted Handoff
Cassandra

Eventual consistency
Anti-entropy Repair
Cassandra

Coordinated process
Four steps:
Anti-entropy Repair

Coordinated process
Four steps:
1: Hash
Anti-entropy Repair
#
#
#

Coordinated process
Four steps:
1: Hash
2: Compare
Anti-entropy Repair
###

Coordinated process
Four steps:
1: Hash
2: Compare
3: Stream
Anti-entropy Repair

Coordinated process
Four steps:
1: Hash
2: Compare
3: Stream
4: Merge
Anti-entropy Repair

Coordinated process
Four steps:
1: Hash
2: Compare
3: Stream
4: Merge
Can go wild...
Anti-entropy Repair

Repair gone wild
Eats a lot of disk IO
● because of hashing all the data

Repair gone wild
Eats a lot of disk IO
Saturates the network
● because of streaming a lot of data around

Repair gone wild
Eats a lot of disk IO
Saturates the network
Fills up the disk
● because of receiving all replicas, possibly
from all other data centers

Repair gone wild
Eats a lot of disk IO
Saturates the network
Fills up the disk
Causes a ton of compactions
● because of having to merge the received
data

Repair gone wild
Eats a lot of disk IO
Saturates the network
Fills up the disk
Causes a ton of compactions
… one better be careful

nodetool repair
Careful repair
All three intervals

Partitioner range
● nodetool repair -pr
Careful repair
This interval only

Start & end tokens
● nodetool repair -pr -st -et
Careful repair
A part of interval only

Requires splitting the ring into smaller intervals
Smaller intervals mean less data
Less data means fewer repairs gone wild
Careful repair

Smaller intervals also mean more intervals
More intervals mean more actual repairs
Repairs need to be babysat :(
Careful repair

The Spotify way
Feature teams meant to do features
Not waste time operating their C* clusters
Cron-ing nodetool repair is no good
● mostly due to no feedback loop
This all led to creation of the Reaper

The Reaper
REST(ish) service
Does a lot of JMX
Orchestrates repairs for you

The reaping
You:
curl http://reaper/cluster --data ‘{“seedHost” : “my.cassandra.host.net”}’
The Reaper:
● Figures out cluster info (e.g. name, partitioner)

$The reaping You: curl http://reaper/repair_run --data ‘{“clusterName”: “myCluster”}’ The Reaper: ● Prepares repair intervals$

The reaping
You:
curl -X PUT http://reaper/repair_run/42 -d state=RUNNING
The Reaper:
● Starts triggering repairs of repair intervals

Reaper’s features
Carefulness - doesn’t kill a node
● checks for node load
● backs off after repairing an interval

Reaper’s features
Carefulness - doesn’t kill a node
Resilience - retries when things break
● because things break all the time

Reaper’s features
Carefulness - doesn’t kill a node
Resilience - retries when things break
Parallelism - no idle nodes
● multiple small intervals in parallel

Reaper’s features
Carefulness - doesn’t kill a node
Resilience - retries when things break
Parallelism - no idle nodes
Scheduling - setup things only once
● regular full-ring repairs

What we reaped
First repair done 2015-01-28
1,700 repairs since then, recently 90 per week
176,000 (16%) segments failed at least once
60 repair failures

Reaper’s Future
CASSANDRA-10070
Whatever is needed until then

Greatest benefit
Cassandra Reaper automates a very tedious
maintenance operation of Cassandra clusters
in a rather smart, efficient and careful manner
while requiring minimal Cassandra expertise
github.com/spotify/cassandra-reaper
#CassandraSummit

Thank you!
github.com/spotify/cassandra-reaper
#CassandraSummit

What's hot

Autoscaling Flink with Reactive ModeFlink Forward

Hive + Tez: A Performance Deep DiveDataWorks Summit

Top 5 Mistakes When Writing Spark ApplicationsSpark Summit

Practical learnings from running thousands of Flink jobsFlink Forward

Running Apache Spark on a High-Performance Cluster Using RDMA and NVMe Flash ...Databricks

Spark shuffle introductioncolorant

Apache Spark Data Source V2 with Wenchen Fan and Gengliang WangDatabricks

The Columnar Era: Leveraging Parquet, Arrow and Kudu for High-Performance Ana...DataWorks Summit/Hadoop Summit

The Rise of ZStandard: Apache Spark/Parquet/ORC/AvroDatabricks

Building Robust ETL Pipelines with Apache SparkDatabricks

Reliability Guarantees for Apache Kafkaconfluent

Apache Spark on K8S and HDFS Security with Ilan FlonenkoDatabricks

HDFS on Kubernetes—Lessons Learned with Kimoon KimDatabricks

Streaming Data Lakes using Kafka Connect + Apache Hudi | Vinoth Chandar, Apac...HostedbyConfluent

Intelligent Auto-scaling of Kafka Consumers with Workload Prediction | Ming S...HostedbyConfluent

From flat files to deconstructed databaseJulien Le Dem

Spark SQLJoud Khattab

Scylla Summit 2022: Scylla 5.0 New Features, Part 1ScyllaDB

Migrating Airflow-based Apache Spark Jobs to Kubernetes – the Native WayDatabricks

Understanding How CQL3 Maps to Cassandra's Internal Data StructureDataStax

What's hot (20)

Autoscaling Flink with Reactive Mode

Hive + Tez: A Performance Deep Dive

Top 5 Mistakes When Writing Spark Applications

Practical learnings from running thousands of Flink jobs

Running Apache Spark on a High-Performance Cluster Using RDMA and NVMe Flash ...

Spark shuffle introduction

Apache Spark Data Source V2 with Wenchen Fan and Gengliang Wang

The Columnar Era: Leveraging Parquet, Arrow and Kudu for High-Performance Ana...

The Rise of ZStandard: Apache Spark/Parquet/ORC/Avro

Building Robust ETL Pipelines with Apache Spark

Reliability Guarantees for Apache Kafka

Apache Spark on K8S and HDFS Security with Ilan Flonenko

HDFS on Kubernetes—Lessons Learned with Kimoon Kim

Streaming Data Lakes using Kafka Connect + Apache Hudi | Vinoth Chandar, Apac...

Intelligent Auto-scaling of Kafka Consumers with Workload Prediction | Ming S...

From flat files to deconstructed database

Spark SQL

Scylla Summit 2022: Scylla 5.0 New Features, Part 1

Migrating Airflow-based Apache Spark Jobs to Kubernetes – the Native Way

Understanding How CQL3 Maps to Cassandra's Internal Data Structure

Viewers also liked

Real World Tales of Repair (Alexander Dejanovski, The Last Pickle) | Cassandr...DataStax

Cassandra London March 2016 - Lightening talk - introduction to incremental ...aaronmorton

Understanding AntiEntropy in CassandraJason Brown

Linux Performance Analysis and ToolsBrendan Gregg

LISA17 Container Performance AnalysisBrendan Gregg

Core java slidesAbhilash Nair

Learn 90% of Python in 90 MinutesMatt Harrison

Viewers also liked (7)

Real World Tales of Repair (Alexander Dejanovski, The Last Pickle) | Cassandr...

Cassandra London March 2016 - Lightening talk - introduction to incremental ...

Understanding AntiEntropy in Cassandra

Linux Performance Analysis and Tools

LISA17 Container Performance Analysis

Core java slides

Learn 90% of Python in 90 Minutes

Similar to Automating Cassandra Repairs with Cassandra Reaper

Real world tales of repair - Apache BigDataAlexander DEJANOVSKI

St Petersburg R user group meetup 2, Parallel RAndrew Bzikadze

Architectural Overview of MapR's Apache Hadoop Distributionmcsrivas

How we setup Rsync-powered Incremental Backupsnicholaspaun

Speedy TDD with RailsPatchSpace Ltd

AWS re:Invent 2016: Cross-Region Replication with Amazon DynamoDB Streams (DA...Amazon Web Services

Flink Forward SF 2017: Malo Deniélou - No shard left behind: Dynamic work re...Flink Forward

TorqueBox - Ruby Hoedown 2011Lance Ball

2014-10-30 Taverna as an Apache Incubator projectStian Soiland-Reyes

Cassandra from tarball to productionRon Kuris

How I Learned to Stop Worrying and Love the Cloud - Wesley Beary, Engine YardSV Ruby on Rails Meetup

Migrating Data Pipeline from MongoDB to CassandraDemi Ben-Ari

Wait! What’s going on inside my database? (PASS 2023 Update)Jeremy Schneider

Under the Hood with Docker Swarm Mode - Drew Erny and Nishant Totla, DockerDocker, Inc.

Using Sinatra to Build REST APIs in RubyLaunchAny

Using Cassandra with your Web Applicationsupertom

Reactive Stream Processing with Akka StreamsKonrad Malawski

Cassandra Codebase 2011gdusbabek

Torquebox OSCON Java 2011tobiascrawley

Cassandra South Bay Meetup - Backup And Restore For Apache Cassandraaaronmorton

Similar to Automating Cassandra Repairs with Cassandra Reaper (20)

Real world tales of repair - Apache BigData

St Petersburg R user group meetup 2, Parallel R

Architectural Overview of MapR's Apache Hadoop Distribution

How we setup Rsync-powered Incremental Backups

Speedy TDD with Rails

AWS re:Invent 2016: Cross-Region Replication with Amazon DynamoDB Streams (DA...

Flink Forward SF 2017: Malo Deniélou - No shard left behind: Dynamic work re...

TorqueBox - Ruby Hoedown 2011

2014-10-30 Taverna as an Apache Incubator project

Cassandra from tarball to production

How I Learned to Stop Worrying and Love the Cloud - Wesley Beary, Engine Yard

Migrating Data Pipeline from MongoDB to Cassandra

Wait! What’s going on inside my database? (PASS 2023 Update)

Under the Hood with Docker Swarm Mode - Drew Erny and Nishant Totla, Docker

Using Sinatra to Build REST APIs in Ruby

Using Cassandra with your Web Application

Reactive Stream Processing with Akka Streams

Cassandra Codebase 2011

Torquebox OSCON Java 2011

Cassandra South Bay Meetup - Backup And Restore For Apache Cassandra

Recently uploaded

Commit 2024 - Secret Management made easyAlfredo García Lavilla

DMCC Future of Trade Web3 - Special EditionDubai Multi Commodity Centre

E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptxnull - The Open Security Community

How to write a Business Continuity PlanDatabarracks

"Debugging python applications inside k8s environment", Andrii SoldatenkoFwdays

Anypoint Exchange: It’s Not Just a Repo!Manik S Magar

Designing IA for AI - Information Architecture Conference 2024Enterprise Knowledge

Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxLoriGlavin3

CloudStudio User manual (basic edition):comworks

"ML in Production",Oleksandr BaganFwdays

TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc

Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar

From Family Reminiscence to Scholarly Archive .Alan Dix

DSPy a system for AI to Write Prompts and Do Fine TuningLars Bell

Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm

Artificial intelligence in cctv survelliance.pptxhariprasad279825

Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB

Gen AI in Business - Global Trends Report 2024.pdfAddepto

DevoxxFR 2024 Reproducible Builds with Apache MavenHervé Boutemy

DevEX - reference for building teams, processes, and platformsSergiu Bodiu

Recently uploaded (20)

Commit 2024 - Secret Management made easy

DMCC Future of Trade Web3 - Special Edition

E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx

How to write a Business Continuity Plan

"Debugging python applications inside k8s environment", Andrii Soldatenko

Anypoint Exchange: It’s Not Just a Repo!

Designing IA for AI - Information Architecture Conference 2024

Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx

CloudStudio User manual (basic edition):

"ML in Production",Oleksandr Bagan

TrustArc Webinar - How to Build Consumer Trust Through Data Privacy

Unleash Your Potential - Namagunga Girls Coding Club

From Family Reminiscence to Scholarly Archive .

DSPy a system for AI to Write Prompts and Do Fine Tuning

Streamlining Python Development: A Guide to a Modern Project Setup

Artificial intelligence in cctv survelliance.pptx

Developer Data Modeling Mistakes: From Postgres to NoSQL

Gen AI in Business - Global Trends Report 2024.pdf

DevoxxFR 2024 Reproducible Builds with Apache Maven

DevEX - reference for building teams, processes, and platforms

Automating Cassandra Repairs with Cassandra Reaper

1. Automating Cassandra Repairs Radovan Zvoncek zvo@spotify.com github.com/spotify/cassandra-reaper #CassandraSummit

2. About zvo

3. About zvo Likes pancakes

4. About zvo Likes pancakes Does this for the 3rd time

5. About zvo Likes pancakes Does this for the 3rd time Works at Spotify

6. Working at Spotify Is autonomous Squads responsible for their full stack Including Cassandra

7. Cassandra

8. Node’s data Cassandra

9. Replication Cassandra

10. Running Cassandra Requires many things One of them is keeping data consistent Otherwise it can get lost or reappear

11. Running Cassandra Requires many things One of them is keeping data consistent Eventually

12. Eventual consistency Cassandra

13. Eventual consistency Read Repairs Cassandra R W

14. Eventual consistency Hinted Handoff Cassandra

15. Eventual consistency Anti-entropy Repair Cassandra

16. Coordinated process Anti-entropy Repair

17. Coordinated process Four steps: Anti-entropy Repair

18. Coordinated process Four steps: 1: Hash Anti-entropy Repair # # #

19. Coordinated process Four steps: 1: Hash 2: Compare Anti-entropy Repair ###

20. Coordinated process Four steps: 1: Hash 2: Compare 3: Stream Anti-entropy Repair

21. Coordinated process Four steps: 1: Hash 2: Compare 3: Stream 4: Merge Anti-entropy Repair

22. Coordinated process Four steps: 1: Hash 2: Compare 3: Stream 4: Merge Can go wild... Anti-entropy Repair

23. Repair gone wild

24. Repair gone wild Eats a lot of disk IO ● because of hashing all the data

25. Repair gone wild Eats a lot of disk IO Saturates the network ● because of streaming a lot of data around

26. Repair gone wild Eats a lot of disk IO Saturates the network Fills up the disk ● because of receiving all replicas, possibly from all other data centers

27. Repair gone wild Eats a lot of disk IO Saturates the network Fills up the disk Causes a ton of compactions ● because of having to merge the received data

28. Repair gone wild Eats a lot of disk IO Saturates the network Fills up the disk Causes a ton of compactions … one better be careful

29. Careful repair

30. nodetool repair Careful repair All three intervals

31. Partitioner range ● nodetool repair -pr Careful repair This interval only

32. Start & end tokens ● nodetool repair -pr -st -et Careful repair A part of interval only

33. Requires splitting the ring into smaller intervals Smaller intervals mean less data Less data means fewer repairs gone wild Careful repair

34. Smaller intervals also mean more intervals More intervals mean more actual repairs Repairs need to be babysat :( Careful repair

35. The Spotify way Feature teams meant to do features Not waste time operating their C* clusters Cron-ing nodetool repair is no good ● mostly due to no feedback loop This all led to creation of the Reaper

36. The Reaper REST(ish) service Does a lot of JMX Orchestrates repairs for you

37. The reaping You: curl http://reaper/cluster --data ‘{“seedHost” : “my.cassandra.host.net”}’ The Reaper: ● Figures out cluster info (e.g. name, partitioner)

38. The reaping You: curl http://reaper/repair_run --data ‘{“clusterName”: “myCluster”}’ The Reaper: ● Prepares repair intervals

39. The reaping You: curl -X PUT http://reaper/repair_run/42 -d state=RUNNING The Reaper: ● Starts triggering repairs of repair intervals

40. Reaper’s features

41. Reaper’s features Carefulness - doesn’t kill a node ● checks for node load ● backs off after repairing an interval

42. Reaper’s features Carefulness - doesn’t kill a node Resilience - retries when things break ● because things break all the time

43. Reaper’s features Carefulness - doesn’t kill a node Resilience - retries when things break Parallelism - no idle nodes ● multiple small intervals in parallel

44. Reaper’s features Carefulness - doesn’t kill a node Resilience - retries when things break Parallelism - no idle nodes Scheduling - setup things only once ● regular full-ring repairs

45. Reaper’s features Carefulness - doesn’t kill a node Resilience - retries when things break Parallelism - no idle nodes Scheduling - setup things only once Persistency - state saved somewhere ● a bit of extra resilience

46. What we reaped First repair done 2015-01-28 1,700 repairs since then, recently 90 per week 176,000 (16%) segments failed at least once 60 repair failures

47. What we reaped

48. What we reaped

49. Reaper’s Future CASSANDRA-10070 Whatever is needed until then

50. Greatest benefit Cassandra Reaper automates a very tedious maintenance operation of Cassandra clusters in a rather smart, efficient and careful manner while requiring minimal Cassandra expertise github.com/spotify/cassandra-reaper #CassandraSummit

51. Thank you! github.com/spotify/cassandra-reaper #CassandraSummit

Automating Cassandra Repairs with Cassandra Reaper

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Viewers also liked

Viewers also liked (7)

Similar to Automating Cassandra Repairs with Cassandra Reaper

Similar to Automating Cassandra Repairs with Cassandra Reaper (20)

More from DataStax Academy

More from DataStax Academy (20)

Recently uploaded

Recently uploaded (20)

Automating Cassandra Repairs with Cassandra Reaper