KafkaConsumer - Decoupling Consumption and Processing for Better Resource Utilization (Igor Buzatović, Inovativni trendovi d.o.o) Kafka Summit 2020

Decoupling consumption and processing
for better resource utilization
Igor Buzatović
@buzzgor
beegor

Thread per consumer model
poll
★ getting records
★ processing records
★ committing offsets
★ handling consumer group rebalance
process
Single thread is:
If subscription model with default conﬁguration is used,
everything on this list happens inside the poll method,
except processing

Key considerations
★ Offset committing
★ Group rebalancing
By default, both handled automatically
... when implementing threading model

Key considerations
★ Automatic
○ executed periodically
○ commit interval speciﬁed by auto.commit.interval.ms conﬁg (default value = 5000)
★ Manually
○ executed by calling commitSync/commitAsync method
○ commit offsets of the last processed record
Committing offsets

Key considerations
★ Each consumer group is coordinated by one of the brokers (group coordinator)
○ It manages group joining and leaving
○ One broker can be coordinator for many groups
★ One of the consumers in the group is the group leader
○ Responsible for partition assignment
★ Two versions of protocol
○ Eager rebalancing (old)
○ Incremental cooperative rebalancing (new)
Group rebalancing quick overview

Key considerations
★ Rebalancing is triggered by group coordinator when:
○ New consumer joins or existing consumer leaves the group
○ New partitions are added to a topic that is part of subscription
○ Consumer subscription changes
★ Coordinator notifies consumers about rebalance through heartbeat responses
★ Consumers must finish processing for revoked partitions before they can be reassigned
★ Consumers confirms partitions release by sending group join request on next poll
Group rebalancing quick overview

Motivation for multi-threaded model
Why don’t we just add more consumers ?
★ More consumers = more TCP connections to the cluster.
○ Kafka handles connections very efﬁciently so this is generally a small cost.
★ More consumers = more requests being sent to the server
○ causes slightly less batching of data which can cause some drop in I/O throughput.
★ Long record processing
○ can cause group rebalance due to poll interval timeout
○ can cause long rebalancing periods -waiting consumers to re-join group

Multi-threaded implementations
Disclaimer :)
Implementations show in this presentation are not meant to be production
ready.
The goal is to point out the challenges we face while implementing
multithreaded model and only suggest way to overcome them.
There are many more ways to implement multithreaded model.

“Fork join” threading model
poll process process process ...
★ Use multiple threads for processing
but wait for all of them to ﬁnish
before next poll call
★ To keep processing order
guaranties, process records from
same partition by same thread
★ Offset commiting and rebalancing
handled the same way as with single
thread model (by the poll method)

“Fork Join” demo implementation
1. Get records collection from poll method
2. Group records by partition (results in multiple collections of records)
3. Create Task (Callable implementation) for each collection from step 2
4. Use ExecutorService.invokeAll() to submit tasks from step 3
How it works

“FORK JOIN” DEMO

“Fully decoupled” multi-threaded model
poll submit tasks
process
process
process
★ Use multiple threads for processing
★ To keep processing order
guaranties, process records from
same partition by same thread
★ Do not wait for processing threads
to ﬁnish, instead continue to poll
★ Offset must be committed manually
★ Use ConsumerRebalanceListener
so you can ﬁnish processing for
revoked partitions

Decoupled multi-threaded model
★ Poll method is called in parallel with processing - automatic offset committing is not an option
★ Offsets must be committed manually in order to keep at-least-once delivery semantics
★ Thread synchronization is required in order to commit offsets only after processing is done
Committing offsets

★ Records from an individual partition must be processed by only one thread at the time
★ This can be achieved by :
○ Pausing the partition after submitting its records for processing
○ Resuming the partition after records processing is ﬁnished
○ KafkaConsumer is not thread safe, so thread synchronization is required when resuming
Preserving processing order guarantees

★ Consumer joins group rebalance during poll method execution
★ Some of the partitions assigned to the consumer can be revoked while its records are still being
processed by another thread
★ Registering ConsumerRebalanceListener gives us the ability to react to partition revocation
★ Upon receiving partition revoked event we can:
○ Do nothing - leads to unnecessary duplicate record processing and resource consumption
○ Wait for processing task to ﬁnish
○ Stop processing task (wait only current record processing to ﬁnish)
Reacting to group rebalance

“Fully decoupled” demo implementation
1. Get records collection from poll method
2. Group records by partition (results in multiple collections of records)
3. Create Task (Runnable implementation) for each collection from step 2
4. Use ExecutorService.submit() to submit tasks from step 3
5. Pause partitions for submitted tasks
6. Store reference to submitted tasks so we can access them later
Main consumer thread

public void run() {
while (!stopped.get()) {
ConsumerRecords<String, String> records = consumer.poll(...);
handleFetchedRecords(records);
checkActiveTasks();
commitOffsets();
}
}

void handleFetchedRecords(ConsumerRecords<...> allRecords) {
records.partitions().forEach(partition -> {
List<ConsumerRecord> partitionRecords = allRecords.records(partition);
Task task = new Task(partitionRecords);
executor.submit(task);
activeTasks.put(partition, task);
});
consumer.pause(records.partitions());
}

1. Commit offsets from task processing thread
○ Requires thread synchronization
○ Can cause too frequent commits
2. Commit offsets from main thread
○ Ask task for current offset on each main loop iteration and store them to a buffer
○ Commit offsets from buffer periodically
Offset committing - two approaches

private void checkActiveTasks() {
...
activeTasks.forEach((partition, task) -> {
...
long offset = task.getCurrentOffset();
if (offset > 0)
offsetsToCommit.put(partition, new OffsetAndMetadata(offset));
});
...
}
Offset commiting - from the main thread

private void commitOffsets() {
try {
long currentTimeMillis = System.currentTimeMillis();
if (currentTimeMillis - lastCommitTime > 5000) {
consumer.commitAsync(offsetsToCommit);
offsetsToCommit.clear();
lastCommitTime = currentTimeMillis;
}
} catch (Exception e) {
log.error("Failed to commit offsets!", e);
}
}

★ Stopping tasks for revoked partitions
○ Send stop signal to a task, so it can break out its processing loop
○ Wait for task completion using CompletableFuture
○ Commit offsets for revoked partitions (commitSync)
Group rebalance

“FULLY DECOUPLED” DEMO

Conclusion
★ Multi-threading is generally hard
★ KafkaConsumer (Java clients) is not thread-safe
★ Key considerations:
○ Offset committing
○ Reacting to group rebalance

The end
THANK YOU!
Demo app source code
https://github.com/beegor/ksl20-ib-demo-app
https://github.com/beegor/ksl20-ib-demo-frontend

KafkaConsumer - Decoupling Consumption and Processing for Better Resource Utilization (Igor Buzatović, Inovativni trendovi d.o.o) Kafka Summit 2020

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to KafkaConsumer - Decoupling Consumption and Processing for Better Resource Utilization (Igor Buzatović, Inovativni trendovi d.o.o) Kafka Summit 2020

Similar to KafkaConsumer - Decoupling Consumption and Processing for Better Resource Utilization (Igor Buzatović, Inovativni trendovi d.o.o) Kafka Summit 2020 (20)

More from confluent

More from confluent (20)

Recently uploaded

Recently uploaded (20)

KafkaConsumer - Decoupling Consumption and Processing for Better Resource Utilization (Igor Buzatović, Inovativni trendovi d.o.o) Kafka Summit 2020