Change Data Capture using Kafka

•Download as PPTX, PDF•

6 likes•3,582 views

This is the slide deck which was used for a talk 'Change Data Capture using Kafka' at Kafka Meetup at Linkedin (Bangalore) held on 11th June 2016. The talk describes the need for CDC and why it's a good use case for Kafka.

Data & Analytics

Change Data Capture Using Kafka
Akash Vacher
2016/06/11

▪ Akash Vacher
Site Reliability Engineer
Data Infrastructure Streaming (Bengaluru)
Linkedin

Agenda
▪ Story of a webapp
▪ Problems
▪ Answer
▪ Proposed solution
▪ Why Kafka?
▪ Refined solution
▪ Q and A

The log should be…
• Scalable
• Durable
• Low latency
• High throughput
• Allow bootstrapping – Have data persistence

Kafka has it all
▪ Horizontally Scalable
▪ Durable – Replication at Partition level
▪ Low latency
▪ High throughput
▪ Data is kept on disk
▪ Log compaction

What's hot

Performance Tuning RocksDB for Kafka Streams' State Stores (Dhruba Borthakur,...confluent

Keep your Metadata Repository Current with Event-Driven Updates using CDC and...confluent

Building High-Throughput, Low-Latency Pipelines in Kafkaconfluent

Building an Event-oriented Data Platform with Kafka, Eric Sammer confluent

Real time dashboards with Kafka and DruidVenu Ryali

Building a derived data store using KafkaVenu Ryali

What's new in Confluent 3.2 and Apache Kafka 0.10.2 confluent

Kafka Connect - debeziumKasun Don

Introduction to Apache Kafka and why it matters - MadridPaolo Castagna

Migrating with DebeziumMike Fowler

Kafka error handling patterns and best practices | Hemant Desale and Aruna Ka...HostedbyConfluent

HBaseConEast2016: Splice machine open source rdbmsMichael Stack

How Kafka and MemSQL Became the Dynamic Duo (Sarung Tripathi, MemSQL) Kafka S...HostedbyConfluent

Kafka meetup seattle 2019 mirus reliable, high performance replication for ap...Nitin Kumar

Riak at shareaholicfreerobby

Kafka blr-meetup-presentation - Kafka internalsAyyappadas Ravindran (Appu)

Becoming Protocol-Agnostic with Kafka, REST, GraphQL & gRPC | Tyler Mills, Sm...HostedbyConfluent

Event Driven Architectures with Apache Kafka on HerokuHeroku

How Alibaba Cloud scaled ApsaraDB with MariaDB MaxScaleMariaDB plc

Developing a custom Kafka connector? Make it shine! | Igor Buzatović, Porsche...HostedbyConfluent

What's hot (20)

Performance Tuning RocksDB for Kafka Streams' State Stores (Dhruba Borthakur,...

Keep your Metadata Repository Current with Event-Driven Updates using CDC and...

Building High-Throughput, Low-Latency Pipelines in Kafka

Building an Event-oriented Data Platform with Kafka, Eric Sammer

Real time dashboards with Kafka and Druid

Building a derived data store using Kafka

What's new in Confluent 3.2 and Apache Kafka 0.10.2

Kafka Connect - debezium

Introduction to Apache Kafka and why it matters - Madrid

Migrating with Debezium

Kafka error handling patterns and best practices | Hemant Desale and Aruna Ka...

HBaseConEast2016: Splice machine open source rdbms

How Kafka and MemSQL Became the Dynamic Duo (Sarung Tripathi, MemSQL) Kafka S...

Kafka meetup seattle 2019 mirus reliable, high performance replication for ap...

Riak at shareaholic

Kafka blr-meetup-presentation - Kafka internals

Becoming Protocol-Agnostic with Kafka, REST, GraphQL & gRPC | Tyler Mills, Sm...

Event Driven Architectures with Apache Kafka on Heroku

How Alibaba Cloud scaled ApsaraDB with MariaDB MaxScale

Developing a custom Kafka connector? Make it shine! | Igor Buzatović, Porsche...

Viewers also liked

Introduction to DatabusAmy W. Tang

Databus: LinkedIn's Change Data Capture Pipeline SOCC 2012Shirshanka Das

Databus - LinkedIn's Change Data Capture PipelineSunil Nagaraj

Introduction to Apache KafkaJeff Holoman

IoT Data as Service with HadoopQuantified Self Dublin

MemSQL DB Class, Ankur GoyalSingleStore

Hadoop & Hive Change the Data Warehousing Game ForeverDataWorks Summit

IBM InfoSphere Data Replication for Big DataIBM Analytics

Strata SG 2015: LinkedIn Self Serve Reporting Platform on Hadoop Shirshanka Das

Apache Cassandra multi-datacenter essentialsJulien Anguenot

Strata 2016 - Architecting for Change: LinkedIn's new data ecosystemShirshanka Das

Case study: Camunda BPM in PwC projectcamunda services GmbH

Streaming Data Integration - For Women in Big Data MeetupGwen (Chen) Shapira

Hybrid Data Architecture: Integrating Hadoop with a Data WarehouseDataWorks Summit

Data Architectures for Robust Decision MakingGwen (Chen) Shapira

Putting Kafka Into OverdriveTodd Palino

Dealing with Changed Data in HadoopDataWorks Summit

EU General Data Protection RegulationRamiro Cid

Kafka for DBAsGwen (Chen) Shapira

Streaming Data Ingest and Processing with Apache KafkaAttunity

Viewers also liked (20)

Introduction to Databus

Databus: LinkedIn's Change Data Capture Pipeline SOCC 2012

Databus - LinkedIn's Change Data Capture Pipeline

Introduction to Apache Kafka

IoT Data as Service with Hadoop

MemSQL DB Class, Ankur Goyal

Hadoop & Hive Change the Data Warehousing Game Forever

IBM InfoSphere Data Replication for Big Data

Strata SG 2015: LinkedIn Self Serve Reporting Platform on Hadoop

Apache Cassandra multi-datacenter essentials

Strata 2016 - Architecting for Change: LinkedIn's new data ecosystem

Case study: Camunda BPM in PwC project

Streaming Data Integration - For Women in Big Data Meetup

Hybrid Data Architecture: Integrating Hadoop with a Data Warehouse

Data Architectures for Robust Decision Making

Putting Kafka Into Overdrive

Dealing with Changed Data in Hadoop

EU General Data Protection Regulation

Kafka for DBAs

Streaming Data Ingest and Processing with Apache Kafka

Similar to Change Data Capture using Kafka

Learn from HomeAway Hadoop Development and Operations Best PracticesDriven Inc.

Leveraging Databricks for Spark PipelinesRose Toomey

Leveraging Databricks for Spark pipelinesRose Toomey

From HDFS to S3: Migrate Pinterest Apache Spark ClustersDatabricks

How is Kafka so Fast?Ricardo Paiva

Facebook - Jonthan Gray - Hadoop World 2010Cloudera, Inc.

Healthcare Claim Reimbursement using Apache SparkDatabricks

Webinar: Overcoming the Storage Challenges Cassandra and Couchbase CreateStorage Switzerland

NoSQL_NightClarence J M Tauro

High Concurrency Architecture and Laravel Performance TuningAlbert Chen

MyRocks introduction and production deploymentYoshinori Matsunobu

The Metamorphosis of Database Changes With Tim Steinbach | Current 2022HostedbyConfluent

Performance stackShayne Bartlett

Applications in the CloudEberhard Wolff

GridGain 6.0: Open Source In-Memory Computing Platform - Nikita IvanovJAXLondon2014

It's Time To Stop Using Lambda ArchitectureYaroslav Tkachenko

Apache Performance Tuning: Scaling OutSander Temme

Cost Effectively Run Multiple Oracle Database Copies at Scale NetApp

Efficient State Management With Spark 2.0 And Scale-Out DatabasesJen Aman

Efficient State Management With Spark 2.x And Scale-Out DatabasesSnappyData

Similar to Change Data Capture using Kafka (20)

Learn from HomeAway Hadoop Development and Operations Best Practices

Leveraging Databricks for Spark Pipelines

Leveraging Databricks for Spark pipelines

From HDFS to S3: Migrate Pinterest Apache Spark Clusters

How is Kafka so Fast?

Facebook - Jonthan Gray - Hadoop World 2010

Healthcare Claim Reimbursement using Apache Spark

Webinar: Overcoming the Storage Challenges Cassandra and Couchbase Create

NoSQL_Night

High Concurrency Architecture and Laravel Performance Tuning

MyRocks introduction and production deployment

The Metamorphosis of Database Changes With Tim Steinbach | Current 2022

Performance stack

Applications in the Cloud

GridGain 6.0: Open Source In-Memory Computing Platform - Nikita Ivanov

It's Time To Stop Using Lambda Architecture

Apache Performance Tuning: Scaling Out

Cost Effectively Run Multiple Oracle Database Copies at Scale

Efficient State Management With Spark 2.0 And Scale-Out Databases

Efficient State Management With Spark 2.x And Scale-Out Databases

Recently uploaded

dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptSonatrach

Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝soniya singh

Generative AI for Social Good at Open Data Science East 2024Colleen Farrelly

9654467111 Call Girls In Munirka Hotel And Home ServiceSapana Sha

Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...Jack DiGiovanna

Customer Service Analytics - Make Sense of All Your Data.pptxEmmanuel Dauda

E-Commerce Order PredictionShraddha Kamble.pptxBoston Institute of Analytics

Top 5 Best Data Analytics Courses In Queensdataanalyticsqueen03

Predicting Salary Using Data Science: A Comprehensive Analysis.pdfBoston Institute of Analytics

Call Girls In Dwarka 9654467111 Escorts ServiceSapana Sha

GA4 Without Cookies [Measure Camp AMS]📊 Markus Baersch

Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...Sapana Sha

1:1定制(UQ毕业证）昆士兰大学毕业证成绩单修改留信学历认证原版一模一样vhwb25kk

原版1:1定制南十字星大学毕业证（SCU毕业证）#文凭成绩单#真实留信学历认证永久存档208367051

Call Girls in Saket 99530🔝 56974 Escort Service9953056974 Low Rate Call Girls In Saket, Delhi NCR

20240419 - Measurecamp Amsterdam - SAM.pdfHuman37

RABBIT: A CLI tool for identifying bots based on their GitHub events.natarajan8993

办理学位证纽约大学毕业证(NYU毕业证书）原版一比一fhwihughh

From idea to production in a day – Leveraging Azure ML and Streamlit to build...Florian Roscheck

INTERNSHIP ON PURBASHA COMPOSITE TEX LTDRafezzaman

Recently uploaded (20)

dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt

Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝

Generative AI for Social Good at Open Data Science East 2024

9654467111 Call Girls In Munirka Hotel And Home Service

Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...

Customer Service Analytics - Make Sense of All Your Data.pptx

E-Commerce Order PredictionShraddha Kamble.pptx

Top 5 Best Data Analytics Courses In Queens

Predicting Salary Using Data Science: A Comprehensive Analysis.pdf

Call Girls In Dwarka 9654467111 Escorts Service

GA4 Without Cookies [Measure Camp AMS]

Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...

1:1定制(UQ毕业证）昆士兰大学毕业证成绩单修改留信学历认证原版一模一样

原版1:1定制南十字星大学毕业证（SCU毕业证）#文凭成绩单#真实留信学历认证永久存档

Call Girls in Saket 99530🔝 56974 Escort Service

20240419 - Measurecamp Amsterdam - SAM.pdf

RABBIT: A CLI tool for identifying bots based on their GitHub events.

办理学位证纽约大学毕业证(NYU毕业证书）原版一比一

From idea to production in a day – Leveraging Azure ML and Streamlit to build...

INTERNSHIP ON PURBASHA COMPOSITE TEX LTD