Submit Search
Upload
introduction-to-apache-kafka
•
6 likes
•
3,307 views
Yifeng Jiang
Follow
Introduction to Apache Kafka. Kafka and real-time system.
Read less
Read more
Software
Report
Share
Report
Share
1 of 34
Download now
Download to read offline
Recommended
Meet HBase 2.0 and Phoenix 5.0
Meet HBase 2.0 and Phoenix 5.0
DataWorks Summit
Securing Spark Applications
Securing Spark Applications
DataWorks Summit/Hadoop Summit
Real-Time Data Flows with Apache NiFi
Real-Time Data Flows with Apache NiFi
Manish Gupta
Troubleshooting Kerberos in Hadoop: Taming the Beast
Troubleshooting Kerberos in Hadoop: Taming the Beast
DataWorks Summit
Accelerate Your Big Data Analytics Efforts with SAS and Hadoop
Accelerate Your Big Data Analytics Efforts with SAS and Hadoop
DataWorks Summit
Apache Hive 2.0: SQL, Speed, Scale
Apache Hive 2.0: SQL, Speed, Scale
DataWorks Summit/Hadoop Summit
Apache Ignite vs Alluxio: Memory Speed Big Data Analytics
Apache Ignite vs Alluxio: Memory Speed Big Data Analytics
DataWorks Summit
Cloudy with a Chance of Hadoop - Real World Considerations
Cloudy with a Chance of Hadoop - Real World Considerations
DataWorks Summit/Hadoop Summit
Recommended
Meet HBase 2.0 and Phoenix 5.0
Meet HBase 2.0 and Phoenix 5.0
DataWorks Summit
Securing Spark Applications
Securing Spark Applications
DataWorks Summit/Hadoop Summit
Real-Time Data Flows with Apache NiFi
Real-Time Data Flows with Apache NiFi
Manish Gupta
Troubleshooting Kerberos in Hadoop: Taming the Beast
Troubleshooting Kerberos in Hadoop: Taming the Beast
DataWorks Summit
Accelerate Your Big Data Analytics Efforts with SAS and Hadoop
Accelerate Your Big Data Analytics Efforts with SAS and Hadoop
DataWorks Summit
Apache Hive 2.0: SQL, Speed, Scale
Apache Hive 2.0: SQL, Speed, Scale
DataWorks Summit/Hadoop Summit
Apache Ignite vs Alluxio: Memory Speed Big Data Analytics
Apache Ignite vs Alluxio: Memory Speed Big Data Analytics
DataWorks Summit
Cloudy with a Chance of Hadoop - Real World Considerations
Cloudy with a Chance of Hadoop - Real World Considerations
DataWorks Summit/Hadoop Summit
Curb your insecurity with HDP
Curb your insecurity with HDP
DataWorks Summit/Hadoop Summit
Hive2.0 sql speed-scale--hadoop-summit-dublin-apr-2016
Hive2.0 sql speed-scale--hadoop-summit-dublin-apr-2016
alanfgates
Streamline Hadoop DevOps with Apache Ambari
Streamline Hadoop DevOps with Apache Ambari
DataWorks Summit/Hadoop Summit
An Overview on Optimization in Apache Hive: Past, Present Future
An Overview on Optimization in Apache Hive: Past, Present Future
DataWorks Summit/Hadoop Summit
The Power of Intelligent Flows: Real-Time IoT Botnet Classification with Apac...
The Power of Intelligent Flows: Real-Time IoT Botnet Classification with Apac...
DataWorks Summit
HPE Hadoop Solutions - From use cases to proposal
HPE Hadoop Solutions - From use cases to proposal
DataWorks Summit
The Future of Apache Ambari
The Future of Apache Ambari
DataWorks Summit
Running Zeppelin in Enterprise
Running Zeppelin in Enterprise
DataWorks Summit
Enabling Modern Application Architecture using Data.gov open government data
Enabling Modern Application Architecture using Data.gov open government data
DataWorks Summit
Big Data Day LA 2016/ Hadoop/ Spark/ Kafka track - Alluxio (formerly Tachyon)...
Big Data Day LA 2016/ Hadoop/ Spark/ Kafka track - Alluxio (formerly Tachyon)...
Data Con LA
Building Data Pipelines for Solr with Apache NiFi
Building Data Pipelines for Solr with Apache NiFi
Bryan Bende
Manage Add-On Services with Apache Ambari
Manage Add-On Services with Apache Ambari
DataWorks Summit
Scaling HDFS to Manage Billions of Files with Distributed Storage Schemes
Scaling HDFS to Manage Billions of Files with Distributed Storage Schemes
DataWorks Summit
Schema Registry - Set Your Data Free
Schema Registry - Set Your Data Free
DataWorks Summit
Big Data Storage - Comparing Speed and Features for Avro, JSON, ORC, and Parquet
Big Data Storage - Comparing Speed and Features for Avro, JSON, ORC, and Parquet
DataWorks Summit
Hadoop 3 in a Nutshell
Hadoop 3 in a Nutshell
DataWorks Summit/Hadoop Summit
HPE Keynote Hadoop Summit San Jose 2016
HPE Keynote Hadoop Summit San Jose 2016
DataWorks Summit/Hadoop Summit
Securing Hadoop in an Enterprise Context
Securing Hadoop in an Enterprise Context
DataWorks Summit/Hadoop Summit
Disaster Recovery and Cloud Migration for your Apache Hive Warehouse
Disaster Recovery and Cloud Migration for your Apache Hive Warehouse
DataWorks Summit
Sub-second-sql-on-hadoop-at-scale
Sub-second-sql-on-hadoop-at-scale
Yifeng Jiang
Hive2 Introduction -- Interactive SQL for Big Data
Hive2 Introduction -- Interactive SQL for Big Data
Yifeng Jiang
Big data spain keynote nov 2016
Big data spain keynote nov 2016
alanfgates
More Related Content
What's hot
Curb your insecurity with HDP
Curb your insecurity with HDP
DataWorks Summit/Hadoop Summit
Hive2.0 sql speed-scale--hadoop-summit-dublin-apr-2016
Hive2.0 sql speed-scale--hadoop-summit-dublin-apr-2016
alanfgates
Streamline Hadoop DevOps with Apache Ambari
Streamline Hadoop DevOps with Apache Ambari
DataWorks Summit/Hadoop Summit
An Overview on Optimization in Apache Hive: Past, Present Future
An Overview on Optimization in Apache Hive: Past, Present Future
DataWorks Summit/Hadoop Summit
The Power of Intelligent Flows: Real-Time IoT Botnet Classification with Apac...
The Power of Intelligent Flows: Real-Time IoT Botnet Classification with Apac...
DataWorks Summit
HPE Hadoop Solutions - From use cases to proposal
HPE Hadoop Solutions - From use cases to proposal
DataWorks Summit
The Future of Apache Ambari
The Future of Apache Ambari
DataWorks Summit
Running Zeppelin in Enterprise
Running Zeppelin in Enterprise
DataWorks Summit
Enabling Modern Application Architecture using Data.gov open government data
Enabling Modern Application Architecture using Data.gov open government data
DataWorks Summit
Big Data Day LA 2016/ Hadoop/ Spark/ Kafka track - Alluxio (formerly Tachyon)...
Big Data Day LA 2016/ Hadoop/ Spark/ Kafka track - Alluxio (formerly Tachyon)...
Data Con LA
Building Data Pipelines for Solr with Apache NiFi
Building Data Pipelines for Solr with Apache NiFi
Bryan Bende
Manage Add-On Services with Apache Ambari
Manage Add-On Services with Apache Ambari
DataWorks Summit
Scaling HDFS to Manage Billions of Files with Distributed Storage Schemes
Scaling HDFS to Manage Billions of Files with Distributed Storage Schemes
DataWorks Summit
Schema Registry - Set Your Data Free
Schema Registry - Set Your Data Free
DataWorks Summit
Big Data Storage - Comparing Speed and Features for Avro, JSON, ORC, and Parquet
Big Data Storage - Comparing Speed and Features for Avro, JSON, ORC, and Parquet
DataWorks Summit
Hadoop 3 in a Nutshell
Hadoop 3 in a Nutshell
DataWorks Summit/Hadoop Summit
HPE Keynote Hadoop Summit San Jose 2016
HPE Keynote Hadoop Summit San Jose 2016
DataWorks Summit/Hadoop Summit
Securing Hadoop in an Enterprise Context
Securing Hadoop in an Enterprise Context
DataWorks Summit/Hadoop Summit
Disaster Recovery and Cloud Migration for your Apache Hive Warehouse
Disaster Recovery and Cloud Migration for your Apache Hive Warehouse
DataWorks Summit
Sub-second-sql-on-hadoop-at-scale
Sub-second-sql-on-hadoop-at-scale
Yifeng Jiang
What's hot
(20)
Curb your insecurity with HDP
Curb your insecurity with HDP
Hive2.0 sql speed-scale--hadoop-summit-dublin-apr-2016
Hive2.0 sql speed-scale--hadoop-summit-dublin-apr-2016
Streamline Hadoop DevOps with Apache Ambari
Streamline Hadoop DevOps with Apache Ambari
An Overview on Optimization in Apache Hive: Past, Present Future
An Overview on Optimization in Apache Hive: Past, Present Future
The Power of Intelligent Flows: Real-Time IoT Botnet Classification with Apac...
The Power of Intelligent Flows: Real-Time IoT Botnet Classification with Apac...
HPE Hadoop Solutions - From use cases to proposal
HPE Hadoop Solutions - From use cases to proposal
The Future of Apache Ambari
The Future of Apache Ambari
Running Zeppelin in Enterprise
Running Zeppelin in Enterprise
Enabling Modern Application Architecture using Data.gov open government data
Enabling Modern Application Architecture using Data.gov open government data
Big Data Day LA 2016/ Hadoop/ Spark/ Kafka track - Alluxio (formerly Tachyon)...
Big Data Day LA 2016/ Hadoop/ Spark/ Kafka track - Alluxio (formerly Tachyon)...
Building Data Pipelines for Solr with Apache NiFi
Building Data Pipelines for Solr with Apache NiFi
Manage Add-On Services with Apache Ambari
Manage Add-On Services with Apache Ambari
Scaling HDFS to Manage Billions of Files with Distributed Storage Schemes
Scaling HDFS to Manage Billions of Files with Distributed Storage Schemes
Schema Registry - Set Your Data Free
Schema Registry - Set Your Data Free
Big Data Storage - Comparing Speed and Features for Avro, JSON, ORC, and Parquet
Big Data Storage - Comparing Speed and Features for Avro, JSON, ORC, and Parquet
Hadoop 3 in a Nutshell
Hadoop 3 in a Nutshell
HPE Keynote Hadoop Summit San Jose 2016
HPE Keynote Hadoop Summit San Jose 2016
Securing Hadoop in an Enterprise Context
Securing Hadoop in an Enterprise Context
Disaster Recovery and Cloud Migration for your Apache Hive Warehouse
Disaster Recovery and Cloud Migration for your Apache Hive Warehouse
Sub-second-sql-on-hadoop-at-scale
Sub-second-sql-on-hadoop-at-scale
Similar to introduction-to-apache-kafka
Hive2 Introduction -- Interactive SQL for Big Data
Hive2 Introduction -- Interactive SQL for Big Data
Yifeng Jiang
Big data spain keynote nov 2016
Big data spain keynote nov 2016
alanfgates
HDF 3.0 IoT Platform for Everyone
HDF 3.0 IoT Platform for Everyone
Yifeng Jiang
The Enterprise and Connected Data, Trends in the Apache Hadoop Ecosystem by A...
The Enterprise and Connected Data, Trends in the Apache Hadoop Ecosystem by A...
Big Data Spain
Introduction to Hadoop
Introduction to Hadoop
Timothy Spann
De-Mystifying the Apache Phoenix QueryServer
De-Mystifying the Apache Phoenix QueryServer
Josh Elser
Hortonworks - What's Possible with a Modern Data Architecture?
Hortonworks - What's Possible with a Modern Data Architecture?
Hortonworks
Hadoop & cloud storage object store integration in production (final)
Hadoop & cloud storage object store integration in production (final)
Chris Nauroth
Meet HBase 2.0 and Phoenix-5.0
Meet HBase 2.0 and Phoenix-5.0
DataWorks Summit
Hadoop & Cloud Storage: Object Store Integration in Production
Hadoop & Cloud Storage: Object Store Integration in Production
DataWorks Summit/Hadoop Summit
Dancing Elephants - Efficiently Working with Object Stores from Apache Spark ...
Dancing Elephants - Efficiently Working with Object Stores from Apache Spark ...
DataWorks Summit
Apache Zeppelin and Spark for Enterprise Data Science
Apache Zeppelin and Spark for Enterprise Data Science
Bikas Saha
Apache Zeppelin and Spark for Enterprise Data Science
Apache Zeppelin and Spark for Enterprise Data Science
Bikas Saha
Enabling Apache Zeppelin and Spark for Data Science in the Enterprise
Enabling Apache Zeppelin and Spark for Data Science in the Enterprise
DataWorks Summit/Hadoop Summit
Druid: Sub-Second OLAP queries over Petabytes of Streaming Data
Druid: Sub-Second OLAP queries over Petabytes of Streaming Data
DataWorks Summit
Future of Data New Jersey - HDF 3.0 Deep Dive
Future of Data New Jersey - HDF 3.0 Deep Dive
Aldrin Piri
Apache Zeppelin + LIvy: Bringing Multi Tenancy to Interactive Data Analysis
Apache Zeppelin + LIvy: Bringing Multi Tenancy to Interactive Data Analysis
DataWorks Summit/Hadoop Summit
[Hortonworks] Future Of Data: Madrid - HDF & Data in motion
[Hortonworks] Future Of Data: Madrid - HDF & Data in motion
Raúl Marín
Hadoop in adtech
Hadoop in adtech
Yuta Imai
Connecting the Drops with Apache NiFi & Apache MiNiFi
Connecting the Drops with Apache NiFi & Apache MiNiFi
DataWorks Summit
Similar to introduction-to-apache-kafka
(20)
Hive2 Introduction -- Interactive SQL for Big Data
Hive2 Introduction -- Interactive SQL for Big Data
Big data spain keynote nov 2016
Big data spain keynote nov 2016
HDF 3.0 IoT Platform for Everyone
HDF 3.0 IoT Platform for Everyone
The Enterprise and Connected Data, Trends in the Apache Hadoop Ecosystem by A...
The Enterprise and Connected Data, Trends in the Apache Hadoop Ecosystem by A...
Introduction to Hadoop
Introduction to Hadoop
De-Mystifying the Apache Phoenix QueryServer
De-Mystifying the Apache Phoenix QueryServer
Hortonworks - What's Possible with a Modern Data Architecture?
Hortonworks - What's Possible with a Modern Data Architecture?
Hadoop & cloud storage object store integration in production (final)
Hadoop & cloud storage object store integration in production (final)
Meet HBase 2.0 and Phoenix-5.0
Meet HBase 2.0 and Phoenix-5.0
Hadoop & Cloud Storage: Object Store Integration in Production
Hadoop & Cloud Storage: Object Store Integration in Production
Dancing Elephants - Efficiently Working with Object Stores from Apache Spark ...
Dancing Elephants - Efficiently Working with Object Stores from Apache Spark ...
Apache Zeppelin and Spark for Enterprise Data Science
Apache Zeppelin and Spark for Enterprise Data Science
Apache Zeppelin and Spark for Enterprise Data Science
Apache Zeppelin and Spark for Enterprise Data Science
Enabling Apache Zeppelin and Spark for Data Science in the Enterprise
Enabling Apache Zeppelin and Spark for Data Science in the Enterprise
Druid: Sub-Second OLAP queries over Petabytes of Streaming Data
Druid: Sub-Second OLAP queries over Petabytes of Streaming Data
Future of Data New Jersey - HDF 3.0 Deep Dive
Future of Data New Jersey - HDF 3.0 Deep Dive
Apache Zeppelin + LIvy: Bringing Multi Tenancy to Interactive Data Analysis
Apache Zeppelin + LIvy: Bringing Multi Tenancy to Interactive Data Analysis
[Hortonworks] Future Of Data: Madrid - HDF & Data in motion
[Hortonworks] Future Of Data: Madrid - HDF & Data in motion
Hadoop in adtech
Hadoop in adtech
Connecting the Drops with Apache NiFi & Apache MiNiFi
Connecting the Drops with Apache NiFi & Apache MiNiFi
More from Yifeng Jiang
Hive spark-s3acommitter-hbase-nfs
Hive spark-s3acommitter-hbase-nfs
Yifeng Jiang
Introduction to Streaming Analytics Manager
Introduction to Streaming Analytics Manager
Yifeng Jiang
Hortonworks Data Cloud for AWS 1.11 Updates
Hortonworks Data Cloud for AWS 1.11 Updates
Yifeng Jiang
Spark Security
Spark Security
Yifeng Jiang
Introduction to Hortonworks Data Cloud for AWS
Introduction to Hortonworks Data Cloud for AWS
Yifeng Jiang
Real-time Analytics in Financial
Real-time Analytics in Financial
Yifeng Jiang
sparksql-hive-bench-by-nec-hwx-at-hcj16
sparksql-hive-bench-by-nec-hwx-at-hcj16
Yifeng Jiang
Nifi workshop
Nifi workshop
Yifeng Jiang
Yifeng hadoop-present-public
Yifeng hadoop-present-public
Yifeng Jiang
Hive-sub-second-sql-on-hadoop-public
Hive-sub-second-sql-on-hadoop-public
Yifeng Jiang
Yifeng spark-final-public
Yifeng spark-final-public
Yifeng Jiang
Kinesis vs-kafka-and-kafka-deep-dive
Kinesis vs-kafka-and-kafka-deep-dive
Yifeng Jiang
Hive present-and-feature-shanghai
Hive present-and-feature-shanghai
Yifeng Jiang
Hadoop Present - Open Enterprise Hadoop
Hadoop Present - Open Enterprise Hadoop
Yifeng Jiang
Apache Hiveの今とこれから
Apache Hiveの今とこれから
Yifeng Jiang
HDFS Deep Dive
HDFS Deep Dive
Yifeng Jiang
Hadoop Trends & Hadoop on EC2
Hadoop Trends & Hadoop on EC2
Yifeng Jiang
Apache Ambari Overview -- Hadoop for Everyone
Apache Ambari Overview -- Hadoop for Everyone
Yifeng Jiang
HDP Security Overview
HDP Security Overview
Yifeng Jiang
Data Science on Hadoop
Data Science on Hadoop
Yifeng Jiang
More from Yifeng Jiang
(20)
Hive spark-s3acommitter-hbase-nfs
Hive spark-s3acommitter-hbase-nfs
Introduction to Streaming Analytics Manager
Introduction to Streaming Analytics Manager
Hortonworks Data Cloud for AWS 1.11 Updates
Hortonworks Data Cloud for AWS 1.11 Updates
Spark Security
Spark Security
Introduction to Hortonworks Data Cloud for AWS
Introduction to Hortonworks Data Cloud for AWS
Real-time Analytics in Financial
Real-time Analytics in Financial
sparksql-hive-bench-by-nec-hwx-at-hcj16
sparksql-hive-bench-by-nec-hwx-at-hcj16
Nifi workshop
Nifi workshop
Yifeng hadoop-present-public
Yifeng hadoop-present-public
Hive-sub-second-sql-on-hadoop-public
Hive-sub-second-sql-on-hadoop-public
Yifeng spark-final-public
Yifeng spark-final-public
Kinesis vs-kafka-and-kafka-deep-dive
Kinesis vs-kafka-and-kafka-deep-dive
Hive present-and-feature-shanghai
Hive present-and-feature-shanghai
Hadoop Present - Open Enterprise Hadoop
Hadoop Present - Open Enterprise Hadoop
Apache Hiveの今とこれから
Apache Hiveの今とこれから
HDFS Deep Dive
HDFS Deep Dive
Hadoop Trends & Hadoop on EC2
Hadoop Trends & Hadoop on EC2
Apache Ambari Overview -- Hadoop for Everyone
Apache Ambari Overview -- Hadoop for Everyone
HDP Security Overview
HDP Security Overview
Data Science on Hadoop
Data Science on Hadoop
Recently uploaded
Direct Style Effect Systems -The Print[A] Example- A Comprehension Aid
Direct Style Effect Systems -The Print[A] Example- A Comprehension Aid
Philip Schwarz
WSO2CON 2024 - WSO2's Digital Transformation Journey with Choreo: A Platforml...
WSO2CON 2024 - WSO2's Digital Transformation Journey with Choreo: A Platforml...
WSO2
%in tembisa+277-882-255-28 abortion pills for sale in tembisa
%in tembisa+277-882-255-28 abortion pills for sale in tembisa
masabamasaba
Architecture decision records - How not to get lost in the past
Architecture decision records - How not to get lost in the past
Papp Krisztián
WSO2CON 2024 - Building the API First Enterprise – Running an API Program, fr...
WSO2CON 2024 - Building the API First Enterprise – Running an API Program, fr...
WSO2
%+27788225528 love spells in Colorado Springs Psychic Readings, Attraction sp...
%+27788225528 love spells in Colorado Springs Psychic Readings, Attraction sp...
masabamasaba
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...
Steffen Staab
%in Rustenburg+277-882-255-28 abortion pills for sale in Rustenburg
%in Rustenburg+277-882-255-28 abortion pills for sale in Rustenburg
masabamasaba
WSO2CON 2024 - Does Open Source Still Matter?
WSO2CON 2024 - Does Open Source Still Matter?
WSO2
OpenChain - The Ramifications of ISO/IEC 5230 and ISO/IEC 18974 for Legal Pro...
OpenChain - The Ramifications of ISO/IEC 5230 and ISO/IEC 18974 for Legal Pro...
Shane Coughlan
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
panagenda
Artyushina_Guest lecture_YorkU CS May 2024.pptx
Artyushina_Guest lecture_YorkU CS May 2024.pptx
AnnaArtyushina1
MarTech Trend 2024 Book : Marketing Technology Trends (2024 Edition) How Data...
MarTech Trend 2024 Book : Marketing Technology Trends (2024 Edition) How Data...
Jittipong Loespradit
WSO2CON2024 - It's time to go Platformless
WSO2CON2024 - It's time to go Platformless
WSO2
Abortion Pill Prices Tembisa [(+27832195400*)] 🏥 Women's Abortion Clinic in T...
Abortion Pill Prices Tembisa [(+27832195400*)] 🏥 Women's Abortion Clinic in T...
Medical / Health Care (+971588192166) Mifepristone and Misoprostol tablets 200mg
Payment Gateway Testing Simplified_ A Step-by-Step Guide for Beginners.pdf
Payment Gateway Testing Simplified_ A Step-by-Step Guide for Beginners.pdf
kalichargn70th171
AI & Machine Learning Presentation Template
AI & Machine Learning Presentation Template
Presentation.STUDIO
%+27788225528 love spells in Knoxville Psychic Readings, Attraction spells,Br...
%+27788225528 love spells in Knoxville Psychic Readings, Attraction spells,Br...
masabamasaba
8257 interfacing 2 in microprocessor for btech students
8257 interfacing 2 in microprocessor for btech students
HimanshiGarg82
Abortion Pills In Pretoria ](+27832195400*)[ 🏥 Women's Abortion Clinic In Pre...
Abortion Pills In Pretoria ](+27832195400*)[ 🏥 Women's Abortion Clinic In Pre...
Medical / Health Care (+971588192166) Mifepristone and Misoprostol tablets 200mg
Recently uploaded
(20)
Direct Style Effect Systems -The Print[A] Example- A Comprehension Aid
Direct Style Effect Systems -The Print[A] Example- A Comprehension Aid
WSO2CON 2024 - WSO2's Digital Transformation Journey with Choreo: A Platforml...
WSO2CON 2024 - WSO2's Digital Transformation Journey with Choreo: A Platforml...
%in tembisa+277-882-255-28 abortion pills for sale in tembisa
%in tembisa+277-882-255-28 abortion pills for sale in tembisa
Architecture decision records - How not to get lost in the past
Architecture decision records - How not to get lost in the past
WSO2CON 2024 - Building the API First Enterprise – Running an API Program, fr...
WSO2CON 2024 - Building the API First Enterprise – Running an API Program, fr...
%+27788225528 love spells in Colorado Springs Psychic Readings, Attraction sp...
%+27788225528 love spells in Colorado Springs Psychic Readings, Attraction sp...
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...
%in Rustenburg+277-882-255-28 abortion pills for sale in Rustenburg
%in Rustenburg+277-882-255-28 abortion pills for sale in Rustenburg
WSO2CON 2024 - Does Open Source Still Matter?
WSO2CON 2024 - Does Open Source Still Matter?
OpenChain - The Ramifications of ISO/IEC 5230 and ISO/IEC 18974 for Legal Pro...
OpenChain - The Ramifications of ISO/IEC 5230 and ISO/IEC 18974 for Legal Pro...
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
Artyushina_Guest lecture_YorkU CS May 2024.pptx
Artyushina_Guest lecture_YorkU CS May 2024.pptx
MarTech Trend 2024 Book : Marketing Technology Trends (2024 Edition) How Data...
MarTech Trend 2024 Book : Marketing Technology Trends (2024 Edition) How Data...
WSO2CON2024 - It's time to go Platformless
WSO2CON2024 - It's time to go Platformless
Abortion Pill Prices Tembisa [(+27832195400*)] 🏥 Women's Abortion Clinic in T...
Abortion Pill Prices Tembisa [(+27832195400*)] 🏥 Women's Abortion Clinic in T...
Payment Gateway Testing Simplified_ A Step-by-Step Guide for Beginners.pdf
Payment Gateway Testing Simplified_ A Step-by-Step Guide for Beginners.pdf
AI & Machine Learning Presentation Template
AI & Machine Learning Presentation Template
%+27788225528 love spells in Knoxville Psychic Readings, Attraction spells,Br...
%+27788225528 love spells in Knoxville Psychic Readings, Attraction spells,Br...
8257 interfacing 2 in microprocessor for btech students
8257 interfacing 2 in microprocessor for btech students
Abortion Pills In Pretoria ](+27832195400*)[ 🏥 Women's Abortion Clinic In Pre...
Abortion Pills In Pretoria ](+27832195400*)[ 🏥 Women's Abortion Clinic In Pre...
introduction-to-apache-kafka
1.
1 © Hortonworks Inc. 2011 – 2017 All Rights Reserved1
© Hortonworks Inc. 2011 – 2017. All Rights Reserved はじめよう! Apache Kafkaでリアルタイムデータ処理 Yifeng Jiang Solutions Engineering Lead September 6, 2017
2.
2 © Hortonworks Inc. 2011 – 2016. All Rights Reserved About Me Ã
蒋 逸峰 (しょう いつほう / Yifeng Jiang) à Solutions Engineering Lead, NAPAC, Hortonworks – Hadooper since 2009 – HBase book author – Software engineer, cloud, PaaS, DevOps à Jogger, hiker à Twitter: @uprush
3.
3 © Hortonworks Inc. 2011 – 2016. All Rights Reserved DATA AT REST DATA IN MOTION ACTIONABLE INTELLIGENCE Modern IoT
Data Applications PERISHABLE INSIGHTS HISTORICAL INSIGHTS INTERNET OF ANYTHING Hortonworks DataFlow Hortonworks Data Platform Hortonworks Delivers Connected Data Platforms
4.
4 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Introduction to Apache Kafka
5.
5 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Apache Kafka Ã
Distributed messaging systems – Real-time – Scalable to handle large data volume – Low Latency – Fault tolerant à Originated at LinkedIn – Aimed at solving data movement across systems – Scala and Java – Open Source (Apache 2.0) – Adapted at many companies
6.
6 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Key Concepts and Terminology
7.
7 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Kafka:
Anatomy of a Topic Partition 0 Partition 1 Partition 2 0 0 0 1 1 1 2 2 2 3 3 3 4 4 4 5 5 5 6 6 6 7 7 7 8 8 8 9 9 9 10 10 11 11 12 Writes Old New à Messages (logs) are stored on broker’s local disk à Messages are appended to log file à Log Retention – time and size based
8.
8 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Kafka Replication Ã
Partition has replicas – Leader replica, Follower replicas à Replicas are distributed to multiple brokers à Leader maintains in-sync-replicas (ISR) https://www.slideshare.net/junrao/kafka-replication-apachecon2013
9.
9 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Kafka
Producer • Create a new message and publish to a Topic and Partition • Original messages are partitioned and then split into batches • Each split batch is sent to leader broker (and then replicated to ISR) • Each send is acknowledged by either leader broker and/or all ISR p3 p2 p1 p2 p1m5 m4 m3 m2 m1 Broker-0 P0.R0 (L) P1.R0 Broker-1 P0.R1 P2.R1 (L) Broker-2 P1.R2 (L) P2.R2 Topic with 3 partition and Replica factor 2 App Producer Lib partitioner Split batch
10.
10 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Kafka
Consumer à Consumers pull data from brokers à Consumer apps have to keep track of the topic-partition offset read à Consumer Groups: Allow multiple hosts to form a group to access a topic – Max parallelism – determined by topic partitions Broker-0 P3 Broker-1 P1 P2 C1 C2 Consumer Group - 1 C3 C4 Consumer Group - 2 C5 C6 P0
11.
11 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Kafka
– Why Kafka is fast Fast Writes Writes are appends to file system Partitions improve performance and throughput Uses OS buffer cache Lots of memory on the machine helps Fast Reads Hot data sits in memory, most time data is served without disk I/O File descriptor to socket descriptor efficient transfer Linux sendfile(), JVM transferTo() implementation Why Performance? Disk flushes are delayed Durability is guaranteed via replication When consumers are reading the latest data, it reads from page cache
12.
12 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Kafka & Real-time System
13.
1 3 © Hortonworks Inc. 2011 – 2017 All Rights Reserved Kafka at Scale http://events.linuxfoundation.jp/sites/events/files/slides/Kafka%20At%20Scale.pdf
14.
1 4 © Hortonworks Inc. 2011 – 2017 All Rights Reserved
15.
1 5 © Hortonworks Inc. 2011 – 2017 All Rights Reserved Use Case: Connected Car https://azure.microsoft.com/ja-jp/blog/announcing-public-preview-of-apache-kafka-on- hdinsight-with-azure-managed-disks/
16.
1 6 © Hortonworks Inc. 2011 – 2017 All Rights Reserved Real-time System Building Blocks • Streams –
An unbounded sequence of messages, events, information packets or tuples (named list of values) • Data Pipe – Message/Information bus – Decouple publishers (providers) and consumers (subscribers) – Scalability, Centralized, Distributed • Stream Processing – Semantics (operations and processing primitives) – Stateless or with state • Low Latency Storage – NoSQL database
17.
1 7 © Hortonworks Inc. 2011 – 2017 All Rights Reserved Real-time System Building Blocks • Streams
-- Nifi, Fluentd, etc. – An unbounded sequence of messages, events, information packets or tuples (named list of values) • Data Pipe – Message/Information bus -- Kafka – Decouple publishers (providers) and consumers (subscribers) – Scalability, Centralized, Distributed • Stream Processing -- Storm, Spark Streaming, etc. – Semantics (operations and processing primitives) – Stateless or with state • Low Latency Storage -- HBase, Redis, Druid, etc. – NoSQL database
18.
1 8 © Hortonworks Inc. 2011 – 2017 All Rights Reserved Apache Metron: Real-time
Big Data Cyber Security powered By Kafka
19.
19 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Data Services and Integration Layer ModulesReal-time Processing Cyber Security Engine Telemetry Parsers Apache Metron Telemetry Ingest Buffer Telemetry Data Collectors Real-time Enrich / Threat Intel Streams Performance Network Ingest Probes / OtherMachine Generated Logs (AD, App / Web Server, firewall, VPN, etc.) Security Endpoint Devices (Fireye, Palo Alto, BlueCoat, etc.) Network Data (PCAP, Netflow, Bro, etc.) IDS (Suricata, Snort, etc.) Threat Intelligence Feeds (Soltra, OpenTaxi, third-party feeds) Telemetry Data Sources Data Vault Real-Time Search Evidentiary Store Threat Intelligence Platform Model as a Service Community Models Data Science Workbench PCAP Forensics Threat IntelligenceEnrichment Indexers and WriterProfiler
Alert Triage Cyber Security Stream Processing Pipeline
20.
20 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Metron
Architecture – Real-time System Built on Kafka Parsers Kafka enrichments topic Kafka indexing topic
21.
21 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Metron
Architecture – Real-time System Built on Kafka Kafka indexing topic Metron UI and Dashbaords
22.
22 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Understood, but
that looks difficult…
23.
2 3 © Hortonworks Inc. 2011 – 2017 All Rights Reserved Hortonworks DataFlow (HDF) 3.0
24.
2 4 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
25.
2 5 © Hortonworks Inc. 2011 – 2017 All Rights Reserved Introducing Hortonworks Streaming Analytics Manager (SAM) Streaming Analytics Manager Design, develop, deploy and manage streaming analytics app with a drag-and-drop paradigm
26.
2 6 © Hortonworks Inc. 2011 – 2017 All Rights Reserved Introducing Hortonworks Schema Registry A shared repository for schemas allowing applications to save, retrieve and reuse schemas and flexibly interact with each other
27.
2 7 © Hortonworks Inc. 2011 – 2017 All Rights Reserved Centralized Security with Apache Ranger
28.
28 © Hortonworks
Inc. 2011 – 2016. All Rights Reserved Build Your First Streaming Analytics App in Under 30 Minutes
29.
2 9 © Hortonworks Inc. 2011 – 2017 All Rights Reserved Trucking company w/ large fleet of international trucks A truck generates millions of events for a given route; an event could be: § 'Normal' events: starting / stopping of the vehicle §
‘Violation’ events: speeding, excessive acceleration and breaking, unsafe tail distance § ‘Speed’ Events: The speed of a driver that comes in every minute. Company uses an application that monitors truck locations and violations from the truck/driver in real- time Route? Truck? Driver? Analysts query a broad history to understand if today’s violations are part of a larger problem with specific routes, trucks, or drivers
30.
3 0 © Hortonworks Inc. 2011 – 2017 All Rights Reserved
31.
3 1 © Hortonworks Inc. 2011 – 2017 All Rights Reserved DEMO
32.
32 © Hortonworks
Inc. 2011 – 2016. All Rights Reserved Question?
33.
Data PlatformConference Tokyo 2017ビッグデータ
x IoT / クラウド / AI(人工知能)を利用した データ駆動型ビジネスの本格的な実現に向けて 2017年10月10日開催 主催:株式会社インプレス 共催:ホートンワークスジャパン株式会社 申し込み・詳細 dataplatform.jp
34.
3 4 © Hortonworks Inc. 2011 – 2017 All Rights Reserved THANK YOU Yifeng
Jiang @uprush
Download now