Kafka Summit NYC 2017 - Introducing Exactly Once Semantics in Apache Kafka

•

7 likes•3,725 views

confluent

Presentation by Apurva Mehta, Jason Gustafson, Guozhang Wang, Engineer, Confluent

Software

1
Introducing Exactly Once
Semantics in Apache Kafka
Jason Gustafson, Guozhang Wang, Sriram
Subramaniam, and Apurva Mehta

2
On deck..
• Kafka’s existing delivery semantics.
• Why did we improve them?
• What’s new?
• How do you use it?
• Summary.

13
TL;DR – What we have today
• At least once in order delivery per partition.
• Producer retries can introduce duplicates.

15
Why improve?
• Stream processing is becoming an ever bigger part of the
data landscape.
• Apache Kafka is the heart of the streams platform.
• Strengthening Kafka’s semantics expands the universe of
streaming applications.

16
A motivating example..
A peer to peer lending platform which processes micro-
loans between users.

23
What’s new
• Exactly once in order delivery per partition
• Atomic writes across multiple partitions
• Performance considerations

24
What’s new, Part 1
Exactly once, in order, delivery per partition

33
TL;DR
• Sequence numbers and producer ids:
• enable de-dup
• are in the log.
• Hence de-dup works transparently across leader changes.
• Will not de-dup application-level resends.
• Works transparently – no API changes.

34
What’s new, part 2
Multi partition writes.

35
Introducing ‘transactions’
producer.initTransactions();
try {
producer.beginTransaction();
producer.send(record0);
producer.send(record1);
producer.sendOffsetsToTxn(…);
producer.commitTransaction();
} catch (ProducerFencedException e) {
producer.close();
} catch (KafkaException e) {
producer.abortTransaction();
}

44
Let’s review the APIs
producer.initTransactions();
try {
producer.beginTransaction();
producer.send(record0);
producer.send(record1);
producer.sendOffsetsToTxn(…);
producer.commitTransaction();
} catch (ProducerFencedException e) {
producer.close();
} catch (KafkaException e) {
producer.abortTransaction();
}

45
Let’s review the APIs
producer.initTransactions();
try {
producer.beginTransaction();
producer.send(record0);
producer.send(record1);
producer.sendOffsetsToTxn(…);
producer.commitTransaction();
} catch (ProducerFencedException e) {
producer.close();
} catch (KafkaException e) {
producer.abortTransaction();
}

46
Let’s review the APIs
producer.initTransactions();
try {
producer.beginTransaction();
producer.send(record0);
producer.send(record1);
producer.sendOffsetsToTxn(…);
producer.commitTransaction();
} catch (ProducerFencedException e) {
producer.close();
} catch (KafkaException e) {
producer.abortTransaction();
}

47
Let’s review the APIs
producer.initTransactions();
try {
producer.beginTransaction();
producer.send(record0);
producer.send(record1);
producer.sendOffsetsToTxn(…);
producer.commitTransaction();
} catch (ProducerFencedException e) {
producer.close();
} catch (KafkaException e) {
producer.abortTransaction();
}

48
Let’s review the APIs
producer.initTransactions();
try {
producer.beginTransaction();
producer.send(record0);
producer.send(record1);
producer.sendOffsetsToTxn(…);
producer.commitTransaction();
} catch (ProducerFencedException e) {
producer.close();
} catch (KafkaException e) {
producer.abortTransaction();
}

49
Consumer returns only committed messages

50
Some notes on consuming transactions
• Two ‘isolation levels’ : read_committed, and
read_uncommitted.
• Messages read in offset order.
• read_committed consumers read to the point where there
are no open transactions.

51
TL;DR
• Transaction coordinator and transaction log maintain
transaction state.
• Use the new producer APIs for transactions.
• Consumers can read only committed messages.

53
What’s new, part 3: Performance boost!
• Up to +20% producer throughput
• Up to +50% consumer throughput
• Up to -20% disk utilization
• Savings start when you batch
• Details: https://bit.ly/kafka-eos-perf

54
Too good to be true?
Let’s understand how!

60
A visual comparison with 7 records, 10 bytes each

61
TL;DR
• With a batch size of 2, the new format starts saving
space.
• Savings are maximal for large batches of small
messages.
• Hence higher throughput when IO bound.
• Works as soon as you upgrade to the new format.

63
Producer Configs
• enable.idempotence = true
• max.inflight.requests.per.connection=1
• acks = “all”
• retries > 1 (preferably MAX_INT)
• transactional.id = ‘some unique id’
• enable.idempotence = true

64
Consumer configs
• isolation.level:
• “read_committed”, or
• “read_uncommitted”

65
Streams config
• processing.mode = “exactly_once”

66
Putting it together
• We understood Kafka’s existing delivery semantics
• Understood why we want to improve them
• Learned how these have been strengthened
• Learned how the new semantics work

67
When is it available?
Available to try in Kafka 0.11, June 2017.

What's hot

Monitoring and Resiliency Testing our Apache Kafka Clusters at Goldman Sachs ...HostedbyConfluent

Kafka At Scale in the Cloudconfluent

Kafka Summit NYC 2017 - Building Advanced Streaming Applications using the La...confluent

Error Resilient Design: Building Scalable & Fault-Tolerant Microservices with...HostedbyConfluent

How to Lock Down Apache Kafka and Keep Your Streams Safeconfluent

Deploying Kafka at Dropbox, Mark Smith, Sean Fellowsconfluent

Exactly-once Semantics in Apache Kafkaconfluent

Cross the streams thanks to Kafka and Flink (Christophe Philemotte, Digazu) K...confluent

Better Kafka Performance Without Changing Any Code | Simon Ritter, AzulHostedbyConfluent

Kafka Summit NYC 2017 - Running Hundreds of Kafka Clusters with 5 Peopleconfluent

Exactly-once Stream Processing with Kafka StreamsGuozhang Wang

Spring Kafka beyond the basics - Lessons learned on our Kafka journey (Tim va...confluent

Kafka Summit SF 2017 - Kafka Stream Processing for Everyone with KSQLconfluent

How to manage large amounts of data with akka streamsIgor Mielientiev

Discover Kafka on OpenShift: Processing Real-Time Financial Events at Scale (...confluent

Introducing Exactly Once Semantics in Apache Kafka with Matthias J. SaxDatabricks

Running large scale Kafka upgrades at Yelp (Manpreet Singh,Yelp) Kafka Summit...confluent

Securing the Message Bus with Kafka Streams | Paul Otto and Ryan Salcido, Raf...HostedbyConfluent

ksqlDB: A Stream-Relational Database Systemconfluent

Flink forward-2017-netflix keystones-paasMonal Daxini

What's hot (20)

Monitoring and Resiliency Testing our Apache Kafka Clusters at Goldman Sachs ...

Kafka At Scale in the Cloud

Kafka Summit NYC 2017 - Building Advanced Streaming Applications using the La...

Error Resilient Design: Building Scalable & Fault-Tolerant Microservices with...

How to Lock Down Apache Kafka and Keep Your Streams Safe

Deploying Kafka at Dropbox, Mark Smith, Sean Fellows

Exactly-once Semantics in Apache Kafka

Cross the streams thanks to Kafka and Flink (Christophe Philemotte, Digazu) K...

Better Kafka Performance Without Changing Any Code | Simon Ritter, Azul

Kafka Summit NYC 2017 - Running Hundreds of Kafka Clusters with 5 People

Exactly-once Stream Processing with Kafka Streams

Spring Kafka beyond the basics - Lessons learned on our Kafka journey (Tim va...

Kafka Summit SF 2017 - Kafka Stream Processing for Everyone with KSQL

How to manage large amounts of data with akka streams

Discover Kafka on OpenShift: Processing Real-Time Financial Events at Scale (...

Introducing Exactly Once Semantics in Apache Kafka with Matthias J. Sax

Running large scale Kafka upgrades at Yelp (Manpreet Singh,Yelp) Kafka Summit...

Securing the Message Bus with Kafka Streams | Paul Otto and Ryan Salcido, Raf...

ksqlDB: A Stream-Relational Database System

Flink forward-2017-netflix keystones-paas

Similar to Kafka Summit NYC 2017 - Introducing Exactly Once Semantics in Apache Kafka

Springone2gx 2014 Reactive Streams and ReactorStéphane Maldini

Apache Flink(tm) - A Next-Generation Stream ProcessorAljoscha Krettek

BigDataSpain 2016: Stream Processing Applications with Apache ApexThomas Weise

Stream Processing use cases and applications with Apache Apex by Thomas WeiseBig Data Spain

Exactly-once Stream Processing Done Right with Matthias J SaxHostedbyConfluent

Stream processing in python with Apache Samza and BeamHai Lu

Apache KafkaJoe Stein

Why scala is not my ideal language and what I can do with thisRuslan Shevchenko

Journey into Reactive Streams and Akka StreamsKevin Webber

My internship presentation at WSO2Prabhath Suminda

Samza portable runner for beamHai Lu

Streaming Processing with a Distributed Commit LogJoe Stein

Tbp_mike

Highly concurrent yet natural programmingInfinit

NoSQL afternoon in Japan Kumofs & MessagePackSadayuki Furuhashi

NoSQL afternoon in Japan kumofs & MessagePackSadayuki Furuhashi

Creating Connector to Bridge the Worlds of Kafka and gRPC at Wework (Anoop Di...confluent

Reactive programming with examplesPeter Lawrey

How to debug slow lambda response timesYan Cui

/* pOrt80BKK */ - PHP Day - PHP Performance with APC + Memcached for WindowsFord AntiTrust

Similar to Kafka Summit NYC 2017 - Introducing Exactly Once Semantics in Apache Kafka (20)

Springone2gx 2014 Reactive Streams and Reactor

Apache Flink(tm) - A Next-Generation Stream Processor

BigDataSpain 2016: Stream Processing Applications with Apache Apex

Stream Processing use cases and applications with Apache Apex by Thomas Weise

Exactly-once Stream Processing Done Right with Matthias J Sax

Stream processing in python with Apache Samza and Beam

Apache Kafka

Why scala is not my ideal language and what I can do with this

Journey into Reactive Streams and Akka Streams

My internship presentation at WSO2

Samza portable runner for beam

Streaming Processing with a Distributed Commit Log

Tbp

Highly concurrent yet natural programming

NoSQL afternoon in Japan Kumofs & MessagePack

NoSQL afternoon in Japan kumofs & MessagePack

Creating Connector to Bridge the Worlds of Kafka and gRPC at Wework (Anoop Di...

Reactive programming with examples

How to debug slow lambda response times

/* pOrt80BKK */ - PHP Day - PHP Performance with APC + Memcached for Windows

Recently uploaded

HR Software Buyers Guide in 2024 - HRSoftware.comFatema Valibhai

The Ultimate Test Automation Guide_ Best Practices and Tips.pdfkalichargn70th171

The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...ICS

W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...panagenda

A Secure and Reliable Document Management System is Essential.docxComplianceQuest1

Unlocking the Future of AI Agents with Large Language Modelsaagamshah0812

Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...OnePlan Solutions

Microsoft AI Transformation Partner Playbook.pdfWilly Marroquin (WillyDevNET)

How To Troubleshoot Collaboration Apps for the Modern Connected WorkerThousandEyes

Advancing Engineering with AI through the Next Generation of Strategic Projec...OnePlan Solutions

why an Opensea Clone Script might be your perfect match.pdfjoe51371421

CALL ON ➥8923113531 🔝Call Girls Kakori Lucknow best sexual service Online ☂️anilsa9823

Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...harshavardhanraghave

CHEAP Call Girls in Pushp Vihar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE9953056974 Low Rate Call Girls In Saket, Delhi NCR

Test Automation Strategy for Frontend and BackendArshad QA

(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...gurkirankumar98700

Software Quality Assurance Interview QuestionsArshad QA

SyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AIABDERRAOUF MEHENNI

Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...MyIntelliSource, Inc.

Unveiling the Tech Salsa of LAMs with Janus in Real-Time ApplicationsAlberto González Trastoy

Recently uploaded (20)

HR Software Buyers Guide in 2024 - HRSoftware.com

The Ultimate Test Automation Guide_ Best Practices and Tips.pdf

The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...

W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...

A Secure and Reliable Document Management System is Essential.docx

Unlocking the Future of AI Agents with Large Language Models

Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...

Microsoft AI Transformation Partner Playbook.pdf

How To Troubleshoot Collaboration Apps for the Modern Connected Worker

Advancing Engineering with AI through the Next Generation of Strategic Projec...

why an Opensea Clone Script might be your perfect match.pdf

CALL ON ➥8923113531 🔝Call Girls Kakori Lucknow best sexual service Online ☂️

Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...

CHEAP Call Girls in Pushp Vihar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE

Test Automation Strategy for Frontend and Backend

(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...

Software Quality Assurance Interview Questions

SyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AI

Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...

Unveiling the Tech Salsa of LAMs with Janus in Real-Time Applications

Kafka Summit NYC 2017 - Introducing Exactly Once Semantics in Apache Kafka

1. 1 Introducing Exactly Once Semantics in Apache Kafka Jason Gustafson, Guozhang Wang, Sriram Subramaniam, and Apurva Mehta

2. 2 On deck.. • Kafka’s existing delivery semantics. • Why did we improve them? • What’s new? • How do you use it? • Summary.

3. 3 Apache Kafka’s existing semantics

4. 4 Existing Semantics

5. 5 Existing Semantics

6. 6 Existing Semantics

7. 7 Existing Semantics

8. 8 Existing Semantics

9. 9 Existing Semantics

10. 10 Existing Semantics

11. 11 Existing Semantics

12. 12 Existing Semantics

13. 13 TL;DR – What we have today • At least once in order delivery per partition. • Producer retries can introduce duplicates.

14. 14 Why improve?

15. 15 Why improve? • Stream processing is becoming an ever bigger part of the data landscape. • Apache Kafka is the heart of the streams platform. • Strengthening Kafka’s semantics expands the universe of streaming applications.

16. 16 A motivating example.. A peer to peer lending platform which processes micro- loans between users.

17. 17 A Peer to Peer Lender

18. 18 The Basic Flow

19. 19 Offset commits

20. 20 Reprocessed transfer, eek!

21. 21 Lost money! Eek eek!

22. 22 What’s new?

23. 23 What’s new • Exactly once in order delivery per partition • Atomic writes across multiple partitions • Performance considerations

24. 24 What’s new, Part 1 Exactly once, in order, delivery per partition

25. 25 The idempotent producer

26. 26 The idempotent producer

27. 27 The idempotent producer

28. 28 The idempotent producer

29. 29 The idempotent producer

30. 30 The idempotent producer

31. 31 The idempotent producer

32. 32 The idempotent producer

33. 33 TL;DR • Sequence numbers and producer ids: • enable de-dup • are in the log. • Hence de-dup works transparently across leader changes. • Will not de-dup application-level resends. • Works transparently – no API changes.

34. 34 What’s new, part 2 Multi partition writes.

35. 35 Introducing ‘transactions’ producer.initTransactions(); try { producer.beginTransaction(); producer.send(record0); producer.send(record1); producer.sendOffsetsToTxn(…); producer.commitTransaction(); } catch (ProducerFencedException e) { producer.close(); } catch (KafkaException e) { producer.abortTransaction(); }

36. 36 Introducing ‘transactions’

37. 37 Initializing ‘transactions’

38. 38 Transactional sends – part 1

39. 39 Transactional sends – part 2

40. 40 Commit – phase 1

41. 41 Commit – phase 2

42. 42 Commit – phase 2

43. 43 Success!

44. 44 Let’s review the APIs producer.initTransactions(); try { producer.beginTransaction(); producer.send(record0); producer.send(record1); producer.sendOffsetsToTxn(…); producer.commitTransaction(); } catch (ProducerFencedException e) { producer.close(); } catch (KafkaException e) { producer.abortTransaction(); }

45. 45 Let’s review the APIs producer.initTransactions(); try { producer.beginTransaction(); producer.send(record0); producer.send(record1); producer.sendOffsetsToTxn(…); producer.commitTransaction(); } catch (ProducerFencedException e) { producer.close(); } catch (KafkaException e) { producer.abortTransaction(); }

46. 46 Let’s review the APIs producer.initTransactions(); try { producer.beginTransaction(); producer.send(record0); producer.send(record1); producer.sendOffsetsToTxn(…); producer.commitTransaction(); } catch (ProducerFencedException e) { producer.close(); } catch (KafkaException e) { producer.abortTransaction(); }

47. 47 Let’s review the APIs producer.initTransactions(); try { producer.beginTransaction(); producer.send(record0); producer.send(record1); producer.sendOffsetsToTxn(…); producer.commitTransaction(); } catch (ProducerFencedException e) { producer.close(); } catch (KafkaException e) { producer.abortTransaction(); }

48. 48 Let’s review the APIs producer.initTransactions(); try { producer.beginTransaction(); producer.send(record0); producer.send(record1); producer.sendOffsetsToTxn(…); producer.commitTransaction(); } catch (ProducerFencedException e) { producer.close(); } catch (KafkaException e) { producer.abortTransaction(); }

49. 49 Consumer returns only committed messages

50. 50 Some notes on consuming transactions • Two ‘isolation levels’ : read_committed, and read_uncommitted. • Messages read in offset order. • read_committed consumers read to the point where there are no open transactions.

51. 51 TL;DR • Transaction coordinator and transaction log maintain transaction state. • Use the new producer APIs for transactions. • Consumers can read only committed messages.

52. 52 Part 3 Performance!

53. 53 What’s new, part 3: Performance boost! • Up to +20% producer throughput • Up to +50% consumer throughput • Up to -20% disk utilization • Savings start when you batch • Details: https://bit.ly/kafka-eos-perf

54. 54 Too good to be true? Let’s understand how!

55. 55 The old message format

56. 56 The new format

57. 57 The new format -> new fields

58. 58 The new format -> new fields

59. 59 The new format -> delta encoding

60. 60 A visual comparison with 7 records, 10 bytes each

61. 61 TL;DR • With a batch size of 2, the new format starts saving space. • Savings are maximal for large batches of small messages. • Hence higher throughput when IO bound. • Works as soon as you upgrade to the new format.

62. 62 Cool! But how do I use this?

63. 63 Producer Configs • enable.idempotence = true • max.inflight.requests.per.connection=1 • acks = “all” • retries > 1 (preferably MAX_INT) • transactional.id = ‘some unique id’ • enable.idempotence = true

64. 64 Consumer configs • isolation.level: • “read_committed”, or • “read_uncommitted”

65. 65 Streams config • processing.mode = “exactly_once”

66. 66 Putting it together • We understood Kafka’s existing delivery semantics • Understood why we want to improve them • Learned how these have been strengthened • Learned how the new semantics work

67. 67 When is it available? Available to try in Kafka 0.11, June 2017.

68. 68 Thank You!

Kafka Summit NYC 2017 - Introducing Exactly Once Semantics in Apache Kafka

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to Kafka Summit NYC 2017 - Introducing Exactly Once Semantics in Apache Kafka

Similar to Kafka Summit NYC 2017 - Introducing Exactly Once Semantics in Apache Kafka (20)

More from confluent

More from confluent (20)

Recently uploaded

Recently uploaded (20)

Kafka Summit NYC 2017 - Introducing Exactly Once Semantics in Apache Kafka