SlideShare a Scribd company logo
1 of 11
Download to read offline
Copyright(C) 2019 KDDI Research, Inc. All Rights Reserved.
Kafka Streams VS Spark
~Kafka StreamsはどこまでSparkに迫れるか~
KDDI総合研究所
コネクティッドネットワーク部門
森澤 雄太
Copyright(C) 2019 KDDI Research, Inc. All Rights Reserved. 2
◼ KDDI総合研究所
⚫ KDDIの子会社
• 本社事業方針に沿った研究開発
• 「5G時代に向けた イノベーションの創出」「通信とライフデザインの融合」「ビッグデータの活
用」「グローバル事業の さらなる拡大」「金融事業の拡大」「グループとしての成長」「サステナ
ビリティ」
⚫ コネクティッドネットワーク部門
• ネットワーク、コネクティッドカー、自動運転、遠隔運転、IoT、運用自動化など……
◼ 自己紹介
⚫ ビッグデータ基盤・ストリームデータ、遠隔運転、 エッジコンピューティング、GPU
⚫ Apache Flinkのドキュメントの英文校正をちょっとやった
会社・自己紹介
Copyright(C) 2019 KDDI Research, Inc. All Rights Reserved. 3
やりたいこと:IoTデータ連携基盤のアーキテクチャ策定
車
防犯カメラ
センサー
データ基盤
データ
素敵なサービス
IoTデータを利用したサービス基盤
セーフティ インフォテイメント エージェント カーライフ
サポート
Copyright(C) 2019 KDDI Research, Inc. All Rights Reserved. 4
やりたいこと:IoTデータ連携基盤のアーキテクチャ策定
車
防犯カメラ
センサー
データ基盤
データ
ETL
素敵なサービス
サービスロジック
素敵なサービスの前には必ずETLがある
Copyright(C) 2019 KDDI Research, Inc. All Rights Reserved. 5
アイデア
開発したETL
新しいETL
Kafka Streamsで
ETLを代替できな
いか?
でも,パフォー
マンスは大丈夫
だろうか?
Copyright(C) 2019 KDDI Research, Inc. All Rights Reserved.
パフォーマンスベンチマーク
6
HadoopProxy
Device1
Device2
Device3
REST Kafka
情報抽出
型変換
データの粒度
調整
次段に転送
・3台のサーバ(Device)からバイナリデータをHttpで送出(120Mbps~)
・Kafka-REST(Proxy)でKafkaに変換
・Hadoop Clusterで情報抽出・型変換・粒度調整を実施
Kafka
比較範囲
Sink
Copyright(C) 2019 KDDI Research, Inc. All Rights Reserved. 7
◼ バージョン
⚫ Hadoop 2.9.2
⚫ Spark 2.4.3
⚫ Kafka 2.2.0
◼ 割り当てリソース
⚫ Master Memory : 2GB
⚫ Worker Memory : 3GB
⚫ CPU : 1Core / node
⚫ 5 node
環境
◼ 評価方法
⚫ InputとOutputのKafka Timestampから処
理遅延を算出
◼ 実装
⚫ Spark
• Structured Streaming
⚫ Kafka
• Scala
• Stream DSL(Timestampの取得のみ
Processor API)
Copyright(C) 2019 KDDI Research, Inc. All Rights Reserved. 8
レイテンシ比較
0
500
1000
1500
2000
2500
3000
3500
1 3 5
Latency[ms]
Nodes
kafka spark
Kafka Streamsの方が低遅延
ノード数を小さくすると差が顕著に=ノード辺りの処理性能もSparkより高い
Copyright(C) 2019 KDDI Research, Inc. All Rights Reserved.
Kafka Streams VS Spark
~Kafka StreamsはどこまでSparkに迫れるか~
KDDI総合研究所
コネクティッドネットワーク部門
森澤 雄太
迫りすぎて越えてしまった!
Copyright(C) 2019 KDDI Research, Inc. All Rights Reserved. 10
◼ 考察
⚫ Sparkは分散基盤 = 大規模環境でメモリインテンシブな処理があって初めて力を発揮する
⚫ 今回のETLはEmbarrassingly Parallel,かつ,小規模基盤なのでSparkのよいところが活かせな
かった
◼ 結論
⚫ Kafka StreamsはETLでとても有用(かもしれない)
• オーバーヘッドが小さいため
• 耐障害性などは未評価
⚫ シンプルで便利
• 耐障害性,スケール性など自作が面倒なことをカバーしてくれる
考察と結論
実装の議論や「それはおかしい!」という意見歓迎です!
個別に話しましょう!
Copyright(C) 2019 KDDI Research, Inc. All Rights Reserved. 11

More Related Content

Similar to Kafka Streams VS Spark ~Kafka StreamsはどこまでSparkに迫れるか~

How to Succeed in the Cloud (Financially)
How to Succeed in the Cloud (Financially)How to Succeed in the Cloud (Financially)
How to Succeed in the Cloud (Financially)Rand Group
 
E-Magazine September Issue 2021
E-Magazine September Issue 2021E-Magazine September Issue 2021
E-Magazine September Issue 2021VARINDIA
 
Avner algom feb 7 2012
Avner algom feb 7 2012Avner algom feb 7 2012
Avner algom feb 7 2012Avner Algom
 
Deploy and Manage Your Industrial IoT Edge Solutions In Weeks With EdgeOps
Deploy and Manage Your Industrial IoT Edge Solutions In Weeks With EdgeOpsDeploy and Manage Your Industrial IoT Edge Solutions In Weeks With EdgeOps
Deploy and Manage Your Industrial IoT Edge Solutions In Weeks With EdgeOpsTredence Inc
 
Unleash the cloud + 5 g + ai era
Unleash the cloud + 5 g + ai eraUnleash the cloud + 5 g + ai era
Unleash the cloud + 5 g + ai eramyehuman
 
The Future of 6G Wireless Networks Opportunities, Requirements, and Challenge...
The Future of 6G Wireless Networks Opportunities, Requirements, and Challenge...The Future of 6G Wireless Networks Opportunities, Requirements, and Challenge...
The Future of 6G Wireless Networks Opportunities, Requirements, and Challenge...ijtsrd
 
DCD Big Discussion Guide
DCD Big Discussion GuideDCD Big Discussion Guide
DCD Big Discussion GuideJames Laker
 
How to Select a Next-Generation Packet Broker to Manage Digital Transformation
How to Select a Next-Generation Packet Broker to Manage Digital TransformationHow to Select a Next-Generation Packet Broker to Manage Digital Transformation
How to Select a Next-Generation Packet Broker to Manage Digital TransformationEnterprise Management Associates
 
IDC Executive Overview
IDC Executive OverviewIDC Executive Overview
IDC Executive Overviewjkabrud
 
Crisis-Ready Crisis-Proof IT Infrastructure for the New Normal
Crisis-Ready Crisis-Proof IT Infrastructure for the New NormalCrisis-Ready Crisis-Proof IT Infrastructure for the New Normal
Crisis-Ready Crisis-Proof IT Infrastructure for the New NormalKalin Hitrov
 
Présentation Matinée SD-WAN Waycom & Citrix
Présentation Matinée SD-WAN Waycom & CitrixPrésentation Matinée SD-WAN Waycom & Citrix
Présentation Matinée SD-WAN Waycom & CitrixWaycom
 
Soonr Overview
Soonr OverviewSoonr Overview
Soonr Overviewgingerh
 
Microsoft Telecommunications Industry Newsletter | December 2019
Microsoft Telecommunications Industry Newsletter | December 2019Microsoft Telecommunications Industry Newsletter | December 2019
Microsoft Telecommunications Industry Newsletter | December 2019Rick Lievano
 
2019 technology innovations and investments
2019 technology innovations and investments2019 technology innovations and investments
2019 technology innovations and investmentsMarko Paris
 
Wed Sponsor Press Conf - 10.15
Wed Sponsor Press Conf - 10.15Wed Sponsor Press Conf - 10.15
Wed Sponsor Press Conf - 10.15Bessie Wang
 
ZStack for Datacenter as a Service - Product Deck
ZStack for Datacenter as a Service - Product DeckZStack for Datacenter as a Service - Product Deck
ZStack for Datacenter as a Service - Product DeckRyo Ardian
 

Similar to Kafka Streams VS Spark ~Kafka StreamsはどこまでSparkに迫れるか~ (20)

Cloud Computing and Edge Computing(CTO Kieun Park) - Edge Computing Seminar
Cloud Computing and Edge Computing(CTO Kieun Park) - Edge Computing SeminarCloud Computing and Edge Computing(CTO Kieun Park) - Edge Computing Seminar
Cloud Computing and Edge Computing(CTO Kieun Park) - Edge Computing Seminar
 
How to Succeed in the Cloud (Financially)
How to Succeed in the Cloud (Financially)How to Succeed in the Cloud (Financially)
How to Succeed in the Cloud (Financially)
 
E-Magazine September Issue 2021
E-Magazine September Issue 2021E-Magazine September Issue 2021
E-Magazine September Issue 2021
 
Avner algom feb 7 2012
Avner algom feb 7 2012Avner algom feb 7 2012
Avner algom feb 7 2012
 
Virtual Instruments Presentation
Virtual Instruments PresentationVirtual Instruments Presentation
Virtual Instruments Presentation
 
Deploy and Manage Your Industrial IoT Edge Solutions In Weeks With EdgeOps
Deploy and Manage Your Industrial IoT Edge Solutions In Weeks With EdgeOpsDeploy and Manage Your Industrial IoT Edge Solutions In Weeks With EdgeOps
Deploy and Manage Your Industrial IoT Edge Solutions In Weeks With EdgeOps
 
Unleash the cloud + 5 g + ai era
Unleash the cloud + 5 g + ai eraUnleash the cloud + 5 g + ai era
Unleash the cloud + 5 g + ai era
 
The Future of 6G Wireless Networks Opportunities, Requirements, and Challenge...
The Future of 6G Wireless Networks Opportunities, Requirements, and Challenge...The Future of 6G Wireless Networks Opportunities, Requirements, and Challenge...
The Future of 6G Wireless Networks Opportunities, Requirements, and Challenge...
 
DCD Big Discussion Guide
DCD Big Discussion GuideDCD Big Discussion Guide
DCD Big Discussion Guide
 
Network Evolution and Market Outlook
Network Evolution and Market OutlookNetwork Evolution and Market Outlook
Network Evolution and Market Outlook
 
How to Select a Next-Generation Packet Broker to Manage Digital Transformation
How to Select a Next-Generation Packet Broker to Manage Digital TransformationHow to Select a Next-Generation Packet Broker to Manage Digital Transformation
How to Select a Next-Generation Packet Broker to Manage Digital Transformation
 
IDC Executive Overview
IDC Executive OverviewIDC Executive Overview
IDC Executive Overview
 
Crisis-Ready Crisis-Proof IT Infrastructure for the New Normal
Crisis-Ready Crisis-Proof IT Infrastructure for the New NormalCrisis-Ready Crisis-Proof IT Infrastructure for the New Normal
Crisis-Ready Crisis-Proof IT Infrastructure for the New Normal
 
Présentation Matinée SD-WAN Waycom & Citrix
Présentation Matinée SD-WAN Waycom & CitrixPrésentation Matinée SD-WAN Waycom & Citrix
Présentation Matinée SD-WAN Waycom & Citrix
 
Soonr Overview
Soonr OverviewSoonr Overview
Soonr Overview
 
Microsoft Telecommunications Industry Newsletter | December 2019
Microsoft Telecommunications Industry Newsletter | December 2019Microsoft Telecommunications Industry Newsletter | December 2019
Microsoft Telecommunications Industry Newsletter | December 2019
 
Cisco Mobilize Magazine: Winter/Spring 2013
Cisco Mobilize Magazine: Winter/Spring 2013Cisco Mobilize Magazine: Winter/Spring 2013
Cisco Mobilize Magazine: Winter/Spring 2013
 
2019 technology innovations and investments
2019 technology innovations and investments2019 technology innovations and investments
2019 technology innovations and investments
 
Wed Sponsor Press Conf - 10.15
Wed Sponsor Press Conf - 10.15Wed Sponsor Press Conf - 10.15
Wed Sponsor Press Conf - 10.15
 
ZStack for Datacenter as a Service - Product Deck
ZStack for Datacenter as a Service - Product DeckZStack for Datacenter as a Service - Product Deck
ZStack for Datacenter as a Service - Product Deck
 

Recently uploaded

Industrial Safety Unit-I SAFETY TERMINOLOGIES
Industrial Safety Unit-I SAFETY TERMINOLOGIESIndustrial Safety Unit-I SAFETY TERMINOLOGIES
Industrial Safety Unit-I SAFETY TERMINOLOGIESNarmatha D
 
Class 1 | NFPA 72 | Overview Fire Alarm System
Class 1 | NFPA 72 | Overview Fire Alarm SystemClass 1 | NFPA 72 | Overview Fire Alarm System
Class 1 | NFPA 72 | Overview Fire Alarm Systemirfanmechengr
 
home automation using Arduino by Aditya Prasad
home automation using Arduino by Aditya Prasadhome automation using Arduino by Aditya Prasad
home automation using Arduino by Aditya Prasadaditya806802
 
Introduction-To-Agricultural-Surveillance-Rover.pptx
Introduction-To-Agricultural-Surveillance-Rover.pptxIntroduction-To-Agricultural-Surveillance-Rover.pptx
Introduction-To-Agricultural-Surveillance-Rover.pptxk795866
 
Main Memory Management in Operating System
Main Memory Management in Operating SystemMain Memory Management in Operating System
Main Memory Management in Operating SystemRashmi Bhat
 
The SRE Report 2024 - Great Findings for the teams
The SRE Report 2024 - Great Findings for the teamsThe SRE Report 2024 - Great Findings for the teams
The SRE Report 2024 - Great Findings for the teamsDILIPKUMARMONDAL6
 
Solving The Right Triangles PowerPoint 2.ppt
Solving The Right Triangles PowerPoint 2.pptSolving The Right Triangles PowerPoint 2.ppt
Solving The Right Triangles PowerPoint 2.pptJasonTagapanGulla
 
US Department of Education FAFSA Week of Action
US Department of Education FAFSA Week of ActionUS Department of Education FAFSA Week of Action
US Department of Education FAFSA Week of ActionMebane Rash
 
Indian Dairy Industry Present Status and.ppt
Indian Dairy Industry Present Status and.pptIndian Dairy Industry Present Status and.ppt
Indian Dairy Industry Present Status and.pptMadan Karki
 
Energy Awareness training ppt for manufacturing process.pptx
Energy Awareness training ppt for manufacturing process.pptxEnergy Awareness training ppt for manufacturing process.pptx
Energy Awareness training ppt for manufacturing process.pptxsiddharthjain2303
 
UNIT III ANALOG ELECTRONICS (BASIC ELECTRONICS)
UNIT III ANALOG ELECTRONICS (BASIC ELECTRONICS)UNIT III ANALOG ELECTRONICS (BASIC ELECTRONICS)
UNIT III ANALOG ELECTRONICS (BASIC ELECTRONICS)Dr SOUNDIRARAJ N
 
Correctly Loading Incremental Data at Scale
Correctly Loading Incremental Data at ScaleCorrectly Loading Incremental Data at Scale
Correctly Loading Incremental Data at ScaleAlluxio, Inc.
 
Vishratwadi & Ghorpadi Bridge Tender documents
Vishratwadi & Ghorpadi Bridge Tender documentsVishratwadi & Ghorpadi Bridge Tender documents
Vishratwadi & Ghorpadi Bridge Tender documentsSachinPawar510423
 
Steel Structures - Building technology.pptx
Steel Structures - Building technology.pptxSteel Structures - Building technology.pptx
Steel Structures - Building technology.pptxNikhil Raut
 
System Simulation and Modelling with types and Event Scheduling
System Simulation and Modelling with types and Event SchedulingSystem Simulation and Modelling with types and Event Scheduling
System Simulation and Modelling with types and Event SchedulingBootNeck1
 
IVE Industry Focused Event - Defence Sector 2024
IVE Industry Focused Event - Defence Sector 2024IVE Industry Focused Event - Defence Sector 2024
IVE Industry Focused Event - Defence Sector 2024Mark Billinghurst
 
Why does (not) Kafka need fsync: Eliminating tail latency spikes caused by fsync
Why does (not) Kafka need fsync: Eliminating tail latency spikes caused by fsyncWhy does (not) Kafka need fsync: Eliminating tail latency spikes caused by fsync
Why does (not) Kafka need fsync: Eliminating tail latency spikes caused by fsyncssuser2ae721
 

Recently uploaded (20)

Industrial Safety Unit-I SAFETY TERMINOLOGIES
Industrial Safety Unit-I SAFETY TERMINOLOGIESIndustrial Safety Unit-I SAFETY TERMINOLOGIES
Industrial Safety Unit-I SAFETY TERMINOLOGIES
 
POWER SYSTEMS-1 Complete notes examples
POWER SYSTEMS-1 Complete notes  examplesPOWER SYSTEMS-1 Complete notes  examples
POWER SYSTEMS-1 Complete notes examples
 
Class 1 | NFPA 72 | Overview Fire Alarm System
Class 1 | NFPA 72 | Overview Fire Alarm SystemClass 1 | NFPA 72 | Overview Fire Alarm System
Class 1 | NFPA 72 | Overview Fire Alarm System
 
home automation using Arduino by Aditya Prasad
home automation using Arduino by Aditya Prasadhome automation using Arduino by Aditya Prasad
home automation using Arduino by Aditya Prasad
 
Introduction-To-Agricultural-Surveillance-Rover.pptx
Introduction-To-Agricultural-Surveillance-Rover.pptxIntroduction-To-Agricultural-Surveillance-Rover.pptx
Introduction-To-Agricultural-Surveillance-Rover.pptx
 
Main Memory Management in Operating System
Main Memory Management in Operating SystemMain Memory Management in Operating System
Main Memory Management in Operating System
 
The SRE Report 2024 - Great Findings for the teams
The SRE Report 2024 - Great Findings for the teamsThe SRE Report 2024 - Great Findings for the teams
The SRE Report 2024 - Great Findings for the teams
 
Solving The Right Triangles PowerPoint 2.ppt
Solving The Right Triangles PowerPoint 2.pptSolving The Right Triangles PowerPoint 2.ppt
Solving The Right Triangles PowerPoint 2.ppt
 
US Department of Education FAFSA Week of Action
US Department of Education FAFSA Week of ActionUS Department of Education FAFSA Week of Action
US Department of Education FAFSA Week of Action
 
Indian Dairy Industry Present Status and.ppt
Indian Dairy Industry Present Status and.pptIndian Dairy Industry Present Status and.ppt
Indian Dairy Industry Present Status and.ppt
 
Energy Awareness training ppt for manufacturing process.pptx
Energy Awareness training ppt for manufacturing process.pptxEnergy Awareness training ppt for manufacturing process.pptx
Energy Awareness training ppt for manufacturing process.pptx
 
🔝9953056974🔝!!-YOUNG call girls in Rajendra Nagar Escort rvice Shot 2000 nigh...
🔝9953056974🔝!!-YOUNG call girls in Rajendra Nagar Escort rvice Shot 2000 nigh...🔝9953056974🔝!!-YOUNG call girls in Rajendra Nagar Escort rvice Shot 2000 nigh...
🔝9953056974🔝!!-YOUNG call girls in Rajendra Nagar Escort rvice Shot 2000 nigh...
 
Design and analysis of solar grass cutter.pdf
Design and analysis of solar grass cutter.pdfDesign and analysis of solar grass cutter.pdf
Design and analysis of solar grass cutter.pdf
 
UNIT III ANALOG ELECTRONICS (BASIC ELECTRONICS)
UNIT III ANALOG ELECTRONICS (BASIC ELECTRONICS)UNIT III ANALOG ELECTRONICS (BASIC ELECTRONICS)
UNIT III ANALOG ELECTRONICS (BASIC ELECTRONICS)
 
Correctly Loading Incremental Data at Scale
Correctly Loading Incremental Data at ScaleCorrectly Loading Incremental Data at Scale
Correctly Loading Incremental Data at Scale
 
Vishratwadi & Ghorpadi Bridge Tender documents
Vishratwadi & Ghorpadi Bridge Tender documentsVishratwadi & Ghorpadi Bridge Tender documents
Vishratwadi & Ghorpadi Bridge Tender documents
 
Steel Structures - Building technology.pptx
Steel Structures - Building technology.pptxSteel Structures - Building technology.pptx
Steel Structures - Building technology.pptx
 
System Simulation and Modelling with types and Event Scheduling
System Simulation and Modelling with types and Event SchedulingSystem Simulation and Modelling with types and Event Scheduling
System Simulation and Modelling with types and Event Scheduling
 
IVE Industry Focused Event - Defence Sector 2024
IVE Industry Focused Event - Defence Sector 2024IVE Industry Focused Event - Defence Sector 2024
IVE Industry Focused Event - Defence Sector 2024
 
Why does (not) Kafka need fsync: Eliminating tail latency spikes caused by fsync
Why does (not) Kafka need fsync: Eliminating tail latency spikes caused by fsyncWhy does (not) Kafka need fsync: Eliminating tail latency spikes caused by fsync
Why does (not) Kafka need fsync: Eliminating tail latency spikes caused by fsync
 

Kafka Streams VS Spark ~Kafka StreamsはどこまでSparkに迫れるか~

  • 1. Copyright(C) 2019 KDDI Research, Inc. All Rights Reserved. Kafka Streams VS Spark ~Kafka StreamsはどこまでSparkに迫れるか~ KDDI総合研究所 コネクティッドネットワーク部門 森澤 雄太
  • 2. Copyright(C) 2019 KDDI Research, Inc. All Rights Reserved. 2 ◼ KDDI総合研究所 ⚫ KDDIの子会社 • 本社事業方針に沿った研究開発 • 「5G時代に向けた イノベーションの創出」「通信とライフデザインの融合」「ビッグデータの活 用」「グローバル事業の さらなる拡大」「金融事業の拡大」「グループとしての成長」「サステナ ビリティ」 ⚫ コネクティッドネットワーク部門 • ネットワーク、コネクティッドカー、自動運転、遠隔運転、IoT、運用自動化など…… ◼ 自己紹介 ⚫ ビッグデータ基盤・ストリームデータ、遠隔運転、 エッジコンピューティング、GPU ⚫ Apache Flinkのドキュメントの英文校正をちょっとやった 会社・自己紹介
  • 3. Copyright(C) 2019 KDDI Research, Inc. All Rights Reserved. 3 やりたいこと:IoTデータ連携基盤のアーキテクチャ策定 車 防犯カメラ センサー データ基盤 データ 素敵なサービス IoTデータを利用したサービス基盤 セーフティ インフォテイメント エージェント カーライフ サポート
  • 4. Copyright(C) 2019 KDDI Research, Inc. All Rights Reserved. 4 やりたいこと:IoTデータ連携基盤のアーキテクチャ策定 車 防犯カメラ センサー データ基盤 データ ETL 素敵なサービス サービスロジック 素敵なサービスの前には必ずETLがある
  • 5. Copyright(C) 2019 KDDI Research, Inc. All Rights Reserved. 5 アイデア 開発したETL 新しいETL Kafka Streamsで ETLを代替できな いか? でも,パフォー マンスは大丈夫 だろうか?
  • 6. Copyright(C) 2019 KDDI Research, Inc. All Rights Reserved. パフォーマンスベンチマーク 6 HadoopProxy Device1 Device2 Device3 REST Kafka 情報抽出 型変換 データの粒度 調整 次段に転送 ・3台のサーバ(Device)からバイナリデータをHttpで送出(120Mbps~) ・Kafka-REST(Proxy)でKafkaに変換 ・Hadoop Clusterで情報抽出・型変換・粒度調整を実施 Kafka 比較範囲 Sink
  • 7. Copyright(C) 2019 KDDI Research, Inc. All Rights Reserved. 7 ◼ バージョン ⚫ Hadoop 2.9.2 ⚫ Spark 2.4.3 ⚫ Kafka 2.2.0 ◼ 割り当てリソース ⚫ Master Memory : 2GB ⚫ Worker Memory : 3GB ⚫ CPU : 1Core / node ⚫ 5 node 環境 ◼ 評価方法 ⚫ InputとOutputのKafka Timestampから処 理遅延を算出 ◼ 実装 ⚫ Spark • Structured Streaming ⚫ Kafka • Scala • Stream DSL(Timestampの取得のみ Processor API)
  • 8. Copyright(C) 2019 KDDI Research, Inc. All Rights Reserved. 8 レイテンシ比較 0 500 1000 1500 2000 2500 3000 3500 1 3 5 Latency[ms] Nodes kafka spark Kafka Streamsの方が低遅延 ノード数を小さくすると差が顕著に=ノード辺りの処理性能もSparkより高い
  • 9. Copyright(C) 2019 KDDI Research, Inc. All Rights Reserved. Kafka Streams VS Spark ~Kafka StreamsはどこまでSparkに迫れるか~ KDDI総合研究所 コネクティッドネットワーク部門 森澤 雄太 迫りすぎて越えてしまった!
  • 10. Copyright(C) 2019 KDDI Research, Inc. All Rights Reserved. 10 ◼ 考察 ⚫ Sparkは分散基盤 = 大規模環境でメモリインテンシブな処理があって初めて力を発揮する ⚫ 今回のETLはEmbarrassingly Parallel,かつ,小規模基盤なのでSparkのよいところが活かせな かった ◼ 結論 ⚫ Kafka StreamsはETLでとても有用(かもしれない) • オーバーヘッドが小さいため • 耐障害性などは未評価 ⚫ シンプルで便利 • 耐障害性,スケール性など自作が面倒なことをカバーしてくれる 考察と結論 実装の議論や「それはおかしい!」という意見歓迎です! 個別に話しましょう!
  • 11. Copyright(C) 2019 KDDI Research, Inc. All Rights Reserved. 11