SlideShare a Scribd company logo
1 of 71
IoT Architectures for a Digital Twin
with Apache Kafka and Event Streaming
A Digital Replica of Things - Open, Scalable and Reliable
Kai Waehner
Technology Evangelist
contact@kai-waehner.de
LinkedIn
@KaiWaehner
www.confluent.io
www.kai-waehner.de
IoT, Digital Twin and Event Streaming – @KaiWaehner - www.kai-waehner.de
Abstract
IoT Architectures for a Digital Twin with Apache Kafka and Event Streaming
A digital twin is a digital replica of a living or non-living physical entity. This session discusses the benefits and IoT architectures of a Digital Twin in Industrial IoT (IIoT)
and its relation to Apache Kafka and other IoT frameworks. Kafka is often used as central event streaming platform to build a scalable and reliable digital twin for real
time streaming sensor data. A live demo shows a scalable digital twin infrastructure for condition monitoring and predictive maintenance in real time for a connected
car infrastructure leveraging Kafka, MQTT and TensorFlow.
Key Take-Aways:
● Learn about use cases and characteristics of a digital twin in various industries
● Understand how to build a digital twin for every single (of tens of thousands) IoT device or machine
● See different IoT architectures with Kafka and other IoT technologies and products
● Bring added value to your IoT infrastructure by enabling use cases like predictive maintenance
● Understand how the Apache Kafka enables scalable and flexible end-to-end integration processing from IIoT data to various backend applications
● Watch a live demo of an end-to-end integration, real time processing and analytics of thousands of IoT devices
IoT, Digital Twin and Event Streaming – @KaiWaehner - www.kai-waehner.de
Key Takeaways
• A Digital Twin merges the physical and the digital world
• Apache Kafka enables an open, scalable and reliable infrastructure for a Digital Twin
• Event Streaming complements IoT platforms and other backend applications / databases.
+
3
IoT, Digital Twin and Event Streaming – @KaiWaehner - www.kai-waehner.de
Agenda
• Digital Twin - Merging the Physical and the Digital World
• Real World Challenges
• IoT Platforms
• Apache Kafka as Event Streaming Solution for IoT
• Spoilt for Choice for a Digital Twin
• Global IoT Architectures
• A Digital Twin for 100000 Connected Cars
4
IoT, Digital Twin and Event Streaming – @KaiWaehner - www.kai-waehner.de
Agenda
• Digital Twin - Merging the Physical and the Digital World
• Real World Challenges
• IoT Platforms
• Apache Kafka as Event Streaming Solution for IoT
• Spoilt for Choice for a Digital Twin
• Global IoT Architectures
• A Digital Twin for 100000 Connected Cars
5
IoT, Digital Twin and Event Streaming – @KaiWaehner - www.kai-waehner.de
Software and Digital Services become the Key Differentiator
6
https://www.mckinsey.com/industries/advanced-electronics/our-insights/iiot-platforms-the-technology-stack-as-value-driver-in-industrial-equipment-and-machinery
IoT, Digital Twin and Event Streaming – @KaiWaehner - www.kai-waehner.de
Digital Twin – Merging the Physical and the Digital World
7
• Downtime reduction
• Inventory management
• Fleet management
• What-if simulations
• Operational planning
• Servitization
• Product development
• Healthcare
• Customer experience
“Virtual representation of something else (Physical thing, process, service)”
“A living model that drives a business outcome”
https://www.youtube.com/watch?v=Ri0TD7kYsIQ
IoT, Digital Twin and Event Streaming – @KaiWaehner - www.kai-waehner.de
Granularity of Digital Twins
8
https://www.youtube.com/watch?v=cfbKR48nSyQ
Remaining Useful Life
IoT, Digital Twin and Event Streaming – @KaiWaehner - www.kai-waehner.de
Digital Thread
9
Digital Twin vs. Digital Thread?
I only use the term Digital Twin in the following slides.
Both terms overlap, often meaning the same.
Span
the
entire
lifecycle
IoT, Digital Twin and Event Streaming – @KaiWaehner - www.kai-waehner.de
Virtual Singapore:
A Digital Twin of the (Smart) City
10
Design, Monitor and Manage Cities
• Urban Planning (e.g. Crowd Simulation)
• Collaboration and Decision-Making
• Communication and Visualisation
• Improved Accessibility
• Analysis on Potential for Solar Energy
Production
• …
https://www.nrf.gov.sg/programmes/virtual-singapore
IoT, Digital Twin and Event Streaming – @KaiWaehner - www.kai-waehner.de
Smart Infrastructure:
Digital Solutions for Entire Building Lifecycle
11
https://new.siemens.com/global/en/products/buildings/digitalization/digital-building-lifecycle.html
• Safer, more secure, more efficient and resilient buildings
• Continuously interaction, learning and adaption to create environments that care
• Follow the entire digital building lifecycle
IoT, Digital Twin and Event Streaming – @KaiWaehner - www.kai-waehner.de
Connected Car Infrastructure
12
https://www.youtube.com/watch?v=yGLKi3TMJv8
• Real Time Data Analysis
• Swarm Intelligence
• Collaboration with Partners
• Predictive AI
• …
IoT, Digital Twin and Event Streaming – @KaiWaehner - www.kai-waehner.de
Twinning the Human Body to Enhance Medical Care
13
• Monitoring and evaluation without being in close proximity
• Testing the impact of changes on the performance of a system
• Smart machines are more advanced than humans
• Determine what actions to take
• Modelling an individual’s genomic makeup, physiological characteristics, and lifestyle to create personalized medicine
• Capturing the human body will have multiple benefits for doctors such as discovering undeveloped illnesses,
experimenting with treatments, and improving preparation for surgeries
https://www.challenge.org/insights/digital-twin-in-healthcare/
https://youtu.be/H6JzPCbyVSM
IoT, Digital Twin and Event Streaming – @KaiWaehner - www.kai-waehner.de
Market
2020
14
https://www.youtube.com/watch?v=w4TPeVO3dwo (Q1, 2020 Webinar IoT Analytics Research)
IoT, Digital Twin and Event Streaming – @KaiWaehner - www.kai-waehner.de
Digital Twin and Artificial Intelligence (AI) / Machine Learning
• Complementary Concepts
• Continuous Learning, Monitoring and Acting
• (Good) data is key for success
15
https://towardsdatascience.com/understanding-feature-engineering-part-1-continuous-numeric-data-da4e47099a7b
IoT, Digital Twin and Event Streaming – @KaiWaehner - www.kai-waehner.de
Digital Twin Applied…
16
https://www.youtube.com/watch?v=cfbKR48nSyQ
IoT, Digital Twin and Event Streaming – @KaiWaehner - www.kai-waehner.de
Digital Twin Applied…
17
https://www.youtube.com/watch?v=cfbKR48nSyQ
IoT, Digital Twin and Event Streaming – @KaiWaehner - www.kai-waehner.de
Digital Twin Applied…
18
https://www.youtube.com/watch?v=cfbKR48nSyQ
IoT, Digital Twin and Event Streaming – @KaiWaehner - www.kai-waehner.de
Digital Twin Applied…
19
https://www.youtube.com/watch?v=cfbKR48nSyQ
IoT, Digital Twin and Event Streaming – @KaiWaehner - www.kai-waehner.de
Agenda
• Digital Twin - Merging the Physical and the Digital World
• Real World Challenges
• IoT Platforms
• Apache Kafka as Event Streaming Solution for IoT
• Spoilt for Choice for a Digital Twin
• Global IoT Architectures
• A Digital Twin for 100000 Connected Cars
20
IoT, Digital Twin and Event Streaming – @KaiWaehner - www.kai-waehner.de
History of Automation Industry vs. Big Data and Cloud
https://foss-backstage.de/sites/foss-backstage.de/files/2018-07/Revolutionizing%20Industrial%20IoT%20with%20Apache%20PLC4X.pdf
IoT, Digital Twin and Event Streaming – @KaiWaehner - www.kai-waehner.de
Challenges in Automation Industry
IoT != IIoT != Buildings != Healthcare …
• IoT = Connected cars, smart home, … à Large scale, secure, scalable, open,
modern technologies
• IIoT / Buildings = Not connected at all or slow, insecure, not scalable,
proprietary
• Healthcare = Often not connected at all yet (huge security requirements)
Legacy / Proprietary IIoT Technologies
• Usually incompatible protocols, typically proprietary
• Usually serial connections (very low latency, nanoseconds) - with TCP /
UDP wrapper around it to integrate with “external world”
• Siemens S7, Modbus, Beckhoff, Profinet, Allen Bradley, etc.
• OPC-UA (required machine update + license cost)
Product Lifecycles
• Long lifecycle (tens of years)
• Factories and buildings cost millions, no simple changes / upgrades
• Still using Windows 7 without Service Packs => Usability and security issues
• Mantra: “Stay with your well-known vendor forever”
IoT, Digital Twin and Event Streaming – @KaiWaehner - www.kai-waehner.de
Challenges in Automation Industry
Monoliths
• No scalability
• No extendibility
• No real failover (start your backup machine)
Missing Security Capabilities
• Security in software development ==
Authentication, Authorization, Antivirus, SSL,
SASL, Kerberos
• Security in automation industry == Safety
• “if you press the red button, the machine stops
immediately”
• Insecure by nature => No Authentication /
Authorization / Encryption
• Mantra: “Our factory building and network is
secure, no access from outside”
• Contradicts with “move to cloud and big data
analytics”
IoT, Digital Twin and Event Streaming – @KaiWaehner - www.kai-waehner.de
Trends: Evolution of Convergence between IT and Industrial Automation
https://iot-analytics.com/5-industrial-connectivity-trends-driving-the-it-ot-convergence
IoT, Digital Twin and Event Streaming – @KaiWaehner - www.kai-waehner.de
Complexity, Cost and Scalability are Main Blockers
25
IoT, Digital Twin and Event Streaming – @KaiWaehner - www.kai-waehner.de
Huge demand to build an open, flexible, scalable platform
• Real time
• Scalability
• High availability
• Decoupling
• Cost reduction
• Flexibility
• Standards-based
• Extendibility
• Security
• Infrastructure-independent
• Multi-region / global
IoT, Digital Twin and Event Streaming – @KaiWaehner - www.kai-waehner.de
Agenda
• Digital Twin - Merging the Physical and the Digital World
• Real World Challenges
• IoT Platforms
• Apache Kafka as Event Streaming Solution for IoT
• Spoilt for Choice for a Digital Twin
• Global IoT Architectures
• A Digital Twin for 100000 Connected Cars
27
IoT, Digital Twin and Event Streaming – @KaiWaehner - www.kai-waehner.de
600+ IoT Platforms
28
https://iot-analytics.com/iot-platform-companies-landscape-2020/
IoT, Digital Twin and Event Streaming – @KaiWaehner - www.kai-waehner.de
Proprietary IoT Platforms
• Sophisticated integration for related IIoT protocols (like Siemens S7, Modbus, etc.) and standards
(like OPC-UA)
• Not a single product (plenty of acquisitions, OEMs and different code bases are typically the
foundation)
• Typically very expensive
• Proprietary (just open interfaces)
• Limited scalability
29
IoT, Digital Twin and Event Streaming – @KaiWaehner - www.kai-waehner.de
IoT Offerings from Cloud Providers
• Sophisticated tools for IoT management (devices, shadowing, …)
• Good integration with other cloud services (storage, analytics, …)
• Vendor lock-in
• No focus on hybrid and edge (but on prem products)
• Limited scalability
• Often high cost (beyond ’hello world’)
30
IoT, Digital Twin and Event Streaming – @KaiWaehner - www.kai-waehner.de
Standards-based / Open Source IoT Platforms
• Open and standards-based (e.g. MQTT)
• Open source / open core business model
• Infrastructure-independent
• Different vendors behind the core
technologies
• Sometimes less mature or non-existent
connectivity (especially to legacy and
proprietary protocols)
• Trade-off: Solid offering for one standard
(e.g. HiveMQ for MQTT) or diversity but
not for mission-critical scale (e.g. Node-
RED)
31
IoT, Digital Twin and Event Streaming – @KaiWaehner - www.kai-waehner.de
Agenda
• Digital Twin - Merging the Physical and the Digital World
• Real World Challenges
• IoT Platforms
• Apache Kafka as Event Streaming Solution for IoT
• Spoilt for Choice for a Digital Twin
• Global IoT Architectures
• A Digital Twin for 100000 Connected Cars
32
IoT, Digital Twin and Event Streaming – @KaiWaehner - www.kai-waehner.de
The Log ConnectorsConnectors
Producer Consumer
Streaming Engine
Apache Kafka - The Rise of an Event Streaming Platform
33
=
Messaging
+
Storage
+
Integration
+
Processing
IoT, Digital Twin and Event Streaming – @KaiWaehner - www.kai-waehner.de
Apache Kafka at Scale at Tech Giants
> 7 trillion messages / day > 6 Petabytes / day
“You name it”
* Kafka is not just used for big data
** Kafka Is not just used by tech giants
34
IoT, Digital Twin and Event Streaming – @KaiWaehner - www.kai-waehner.de
Improve
Customer
Experience
(CX)
Increase
Revenue
(make money)
Business
Value
Decrease
Costs
(save money)
Core Business
Platform
Increase
Operational
Efficiency
Migrate to
Cloud
Mitigate Risk
(protect money)
Key Drivers
Strategic Objectives
(sample)
Fraud
Detection
IoT sensor
ingestion
Digital
replatforming/
Mainframe Offload
Connected Car: Navigation & improved in-
car experience: Audi
Customer 360
Simplifying Omni-channel Retail at Scale:
Target
Faster transactional
processing / analysis
incl. Machine Learning / AI
Mainframe Offload: RBC
Microservices
Architecture
Online Fraud Detection
Online Security
(syslog, log aggregation,
Splunk replacement)
Middleware
replacement
Regulatory
Digital
Transformation
Application Modernization: Multiple
Examples
Website / Core
Operations
(Central Nervous System)
The [Silicon Valley] Digital Natives;
LinkedIn, Netflix, Uber, Yelp...
Predictive Maintenance: Audi
Streaming Platform in a regulated
environment (e.g. Electronic Medical
Records): Celmatix
Real-time app
updates
Real Time Streaming Platform for
Communications and Beyond: Capital One
Developer Velocity - Building Stateful
Financial Applications with Kafka Streams:
Funding Circle
Detect Fraud & Prevent Fraud in Real Time:
PayPal
Kafka as a Service - A Tale of Security and
Multi-Tenancy: Apple
Example Use Cases
$↑
$↓
$↔
Example Case Studies
(of many)
Confluent - Business Value per Use Case
35
IoT, Digital Twin and Event Streaming – @KaiWaehner - www.kai-waehner.de
P
Decoupling of Producers and Consumers
Time
C2 C3C1
36
IoT, Digital Twin and Event Streaming – @KaiWaehner - www.kai-waehner.de
Schema Enforcement (Client and Server Side)
37
IoT, Digital Twin and Event Streaming – @KaiWaehner - www.kai-waehner.de
Kafka Connect
Kafka Cluster
MQTT
Integration
Domain-Driven Design (DDD) for your Event Steaming Platform
OPC-UA
Integration
Real Time
Predictions
IoT Platform
Connector
Java / Python /
”you-name-it”
Schema
Registry
Event Streaming Platform
IoT Domain Legacy Domain Analytics Domain
è Independent and loosely coupled, but scalable, highly available and reliable!
38
IoT, Digital Twin and Event Streaming – @KaiWaehner - www.kai-waehner.de
10 Reasons for Event Streaming with Apache Kafka
Real Time
Scalable
Cost Reduction
24/7 – Zero downtime, zero data loss
Decoupling – Storage, Domain-driven Design
Data (re-)processing and stateful client applications
Integration – Connectivity to IoT, legacy, big data, everything
Hybrid Architecture – On Premises, multi cloud, edge computing
Fully managed cloud
No vendor locking
39
IoT, Digital Twin and Event Streaming – @KaiWaehner - www.kai-waehner.de
Digital Twin and AI / Machine Learning (with Kafka)
• Complementary Concepts
• Continuous Learning, Monitoring and Acting à Real time, scalable
• (Good) data is key for success à Integration, data processing
40
IoT, Digital Twin and Event Streaming – @KaiWaehner - www.kai-waehner.de
Apache Kafka and Machine Learning – More Details
41
Blog: How to Build and Deploy Scalable Machine Learning in Production with Apache Kafka:
https://www.confluent.io/blog/build-deploy-scalable-machine-learning-production-apache-kafka/
Blog: Using Apache Kafka to Drive Cutting-Edge Machine Learning:
https://www.confluent.io/blog/using-apache-kafka-drive-cutting-edge-machine-learning/
Blog: Machine Learning and Real-Time Analytics in Apache Kafka Applications:
https://www.confluent.io/blog/machine-learning-real-time-analytics-models-in-kafka-applications/
Blog: Streaming Machine Learning with Tiered Storage and Without a Data Lake:
https://www.confluent.io/blog/streaming-machine-learning-with-tiered-storage/
Slides + Video: Event-driven Model Scoring vs. RPC with Model Server:
https://www.confluent.io/kafka-summit-san-francisco-2019/event-driven-model-serving-stream-processing-vs-rpc-with-kafka-
and-tensorflow/
Few examples for real time scoring with Kafka Steams and TensorFlow, H2O, etc.:
https://github.com/kaiwaehner/kafka-streams-machine-learning-examples
Demo: Anomaly Detection with KSQL:
https://github.com/kaiwaehner/ksql-udf-deep-learning-mqtt-iot
Demo + Video: Connected Car demo with Kafka + Streaming ML
https://github.com/kaiwaehner/hivemq-mqtt-tensorflow-kafka-realtime-iot-machine-learning-training-inference
IoT, Digital Twin and Event Streaming – @KaiWaehner - www.kai-waehner.de
Hold on…
Kafka is NOT
an IoT Platform!
IoT, Digital Twin and Event Streaming – @KaiWaehner - www.kai-waehner.de
Device management
Unreliable networks
Connectivity beyond standards
Edge hardware
…
IoT, Digital Twin and Event Streaming – @KaiWaehner - www.kai-waehner.de
(De facto) Standards for Processing IoT Data
A Match Made In Heaven
+ =
IoT
Platform
IoT, Digital Twin and Event Streaming – @KaiWaehner - www.kai-waehner.de
Agenda
• Digital Twin - Merging the Physical and the Digital World
• Real World Challenges
• IoT Platforms
• Apache Kafka as Event Streaming Solution for IoT
• Spoilt for Choice for a Digital Twin
• Global IoT Architectures
• A Digital Twin for 100000 Connected Cars
45
IoT, Digital Twin and Event Streaming – @KaiWaehner - www.kai-waehner.de
Characteristics of Digital Twin Technology
• Connectivity
• Physical assets, enterprise software, customers
• Bidirectional communication to ingest, command and control
• Homogenization
• Decoupling and standardization
• Virtualization of information
• Shared with multiple agents, unconstrained by physical location or time
• Lower cost and easier testing, development and predictions
• Reprogrammable and smart
• Adjust and improve characteristics and develop new version of a product
• Digital traces
• Go back in time and analyse historical events to diagnose problems
• Modularity
• Design and customization of products and production modules
• Tweak modules of models and machines
46
IoT, Digital Twin and Event Streaming – @KaiWaehner - www.kai-waehner.de
Digital Twin Mapped to Apache Kafka
• Connectivity – Kafka Connect provides connectivity as scale in real time to IoT interfaces, big data
solutions and cloud services. The Kafka ecosystem is complementary, NOT competitive to other
Middleware and IoT Platforms.
• Homogenization – Real decoupling between clients (i.e. producers and consumers) is one of the
key strengths of Kafka. Schema management and enforcement leveraging different technologies
(JSON Schema, Avro, Profobuf, etc.) enables data awareness and standardization.
• Reprogrammable and smart – Kafka is the de facto standard for microservices for exactly this
reason: Separation of concerns and domain-driven design (DDD). Deploy new decoupled
applications and versions, do A/B testing, canarying.
• Digital traces – Kakfa is a distributed commit log. Events are appended, stored as long as you
want (potentially forever with rentention time = -1) and immutable. Seriously, what other
technology could be used better to build a digital trace for a digital twin?
• Modularity – The Kafka infrastructure itself is modular and scalable. This includes components like
Kafka brokers, Connect, Schema Registry, REST Proxy and client applications in different
languages like Java, Scala, Python, Go, .NET, C++ and others. With this modularity, you can easily
build the right Digital Twin architecture your your edge, hybrid or global scenarios and also
combine the Kafka components with any other IoT solutions.
47
IoT, Digital Twin and Event Streaming – @KaiWaehner - www.kai-waehner.de
Simplified Pipeline for a Digital Twin
48
Siemens S7, Modbus, Allen Bradley, Beckhoff ADS
IoT
Platform
Digital
Twin
Real
Time
App
Batch
App
Request
Response
App
Connectivity
Homogenization
Reprogrammable and smart
Digital traces
Modularity
IoT, Digital Twin and Event Streaming – @KaiWaehner - www.kai-waehner.de
Scenario 1: Digital Twin Monolith
49
Siemens S7, Modbus, Allen Bradley, Beckhoff ADS
IoT
Platform
Digital
Twin
Device Mgt.
Analytics
Connectivity
Homogenization
Reprogrammable and smart
Digital traces
Modularity
IoT, Digital Twin and Event Streaming – @KaiWaehner - www.kai-waehner.de
Scenario 2: Digital Twin as External Database
50
Siemens S7, Modbus, Allen Bradley, Beckhoff ADS
IoT
Platform
Digital
Twin
Device Mgt.
Database
XYZAnalytics
Connectivity
Homogenization
Reprogrammable and smart
Digital traces
Modularity
IoT, Digital Twin and Event Streaming – @KaiWaehner - www.kai-waehner.de
Apache
Kafka
Scenario 3: Kafka as Backbone for the
Digital Twin and the Rest of the Enterprise
51
Siemens S7, Modbus, Allen Bradley, Beckhoff ADS
IoT
Platform
Digital
Twin
Database
XYZ
Real
Time
App
Batch
App
Request
Response
App
Kafka
Connect
Connectivity
Homogenization
Reprogrammable and smart
Digital traces
Modularity
IoT, Digital Twin and Event Streaming – @KaiWaehner - www.kai-waehner.de
Apache Kafka
Scenario 4: Kafka as IoT Platform
52
Siemens S7, Modbus, Allen Bradley, Beckhoff ADS
IoT Cloud
Platform
Digital
Twin
Real
Time
App
Batch
App
Request
Response
App
Kafka Connect
Connectivity
Homogenization
Reprogrammable and smart
Digital traces
Modularity
Storage Processing
IoT, Digital Twin and Event Streaming – @KaiWaehner - www.kai-waehner.de
Apache Kafka
Scenario 5: Kafka as Digital Twin
53
Siemens S7, Modbus, Allen Bradley, Beckhoff ADS
Digital Twin
Real
Time
App
Batch
App
Request
Response
App
Kafka Connect
Storage Processing
Connectivity
Homogenization
Reprogrammable and smart
Digital traces
Modularity
IoT, Digital Twin and Event Streaming – @KaiWaehner - www.kai-waehner.de
Kafka as
Database?
Seriously?
IoT, Digital Twin and Event Streaming – @KaiWaehner - www.kai-waehner.de
P
Kafka’s Storage – A Distributed Commit Log
Time
C2 C3C1
55
IoT, Digital Twin and Event Streaming – @KaiWaehner - www.kai-waehner.de
Tiered Storage for Kafka
Object Store
Processing Storage
Transactions,
auth, quota
enforcement,
compaction, ...
Local
Remote
Kafka
Apps
(Only available in Confluent Platform)
www.kai-waehner.de | @KaiWaehner
IoT, Digital Twin and Event Streaming – @KaiWaehner - www.kai-waehner.de
Distributed System with Replication and High Availability
on Server and Client Side
read,
write
Kafka Client Kafka Server Side
Materialized View
in the Client App
(In-memory, RocksDB)
IoT, Digital Twin and Event Streaming – @KaiWaehner - www.kai-waehner.de
Stateful Kafka Client Applications
58
IoT, Digital Twin and Event Streaming – @KaiWaehner - www.kai-waehner.de
Kafka as Data Storage?
59
https://www.kai-waehner.de/blog/2020/03/12/can-apache-kafka-replace-database-acid-storage-transactions-sql-nosql-data-lake/
IoT, Digital Twin and Event Streaming – @KaiWaehner - www.kai-waehner.de
Agenda
• Digital Twin - Merging the Physical and the Digital World
• Real World Challenges
• IoT Platforms
• Apache Kafka as Event Streaming Solution for IoT
• Spoilt for Choice for a Digital Twin
• Global IoT Architectures
• A Digital Twin for 100000 Connected Cars
60
IoT, Digital Twin and Event Streaming – @KaiWaehner - www.kai-waehner.de
No matter which Digital Twin
Architecture I use…
Most Architectures are Hybrid
(Edge, Data Center, Cloud)
and sometimes even Global!
IoT, Digital Twin and Event Streaming – @KaiWaehner - www.kai-waehner.de
Edge Digital Twin
Single
Kafka Broker
(or Cluster)
Digital Twin
Self-managed or
certified OEM Hardware
Kafka
Cluster
in DC /
Cloud
Replicator
Siemens S7, Modbus, Allen Bradley, Beckhoff ADS
IoT, Digital Twin and Event Streaming – @KaiWaehner - www.kai-waehner.de
Centralized Digital Twin
Digital Twin
Single
Kafka Broker
(or Cluster)
Self-managed or
certified OEM Hardware
Siemens S7, Modbus, Allen Bradley, Beckhoff ADS
Single
Kafka Broker
(or Cluster)
Self-managed or
certified OEM Hardware
Siemens S7, Modbus, Allen Bradley, Beckhoff ADS
IoT, Digital Twin and Event Streaming – @KaiWaehner - www.kai-waehner.de
Global Digital Twin Architecture
Multiple Clusters and Aggregation
Factories à Analytics Cluster
Multi-Region Cluster
High Availability (Disaster Recovery)
Global Data Streaming
Outsourced
Development
IoT, Digital Twin and Event Streaming – @KaiWaehner - www.kai-waehner.de
Example of a Multi-Region Digital Twin Deployment
Order
Sensor
Order
Sensor
Logs Logs
synchronous
asynchronous
● Automate Disaster
Recovery
● Sync or Async Replication
per Topic
● Offset Preserving
● Automated Client Failover
with No Custom Code
Zero downtime, zero data loss
(even in cases of data center outage)
US West US East
IoT, Digital Twin and Event Streaming – @KaiWaehner - www.kai-waehner.de
Architecture patterns for distributed, hybrid,
edge and global Apache Kafka deployments
www.kai-waehner.de | @KaiWaehner
https://www.kai-waehner.de/blog/2020/01/29/deployment-patterns-distributed-hybrid-edge-global-multi-data-center-kafka-architecture/
IoT, Digital Twin and Event Streaming – @KaiWaehner - www.kai-waehner.de
Agenda
• Digital Twin - Merging the Physical and the Digital World
• Real World Challenges
• IoT Platforms
• Apache Kafka as Event Streaming Solution for IoT
• Spoilt for Choice for a Digital Twin
• Global IoT Architectures
• A Digital Twin for 100000 Connected Cars
67
IoT, Digital Twin and Event Streaming – @KaiWaehner - www.kai-waehner.de
A Digital Twin with Kafka and TensorFlow
68
MQTT
Proxy
Elastic Grafana
Kafka
Cluster
Kafka
Connect
Car Sensors
Kafka Ecosystem
TensorFlow
Other Components
Kafka
Streams
(Java)
All
Data
Critical
Data
Ingest
Data
Potential
Detect
KSQL
TensorFlow
Train
Analytic
Model
Consume
Data
Preprocess
Data
Analytic
Model
Deploy
Analytic
Model
Python
https://github.com/kaiwaehner/hivemq-mqtt-tensorflow-kafka-realtime-iot-machine-learning-training-inference
Connectivity
Homogenization
Reprogrammable and smart
Digital traces
Modularity
IoT, Digital Twin and Event Streaming – @KaiWaehner - www.kai-waehner.de
Architecture for 100000 Connected Cars
Kafka + KSQL + MQTT + TensorFlow + Kubernetes
69
https://www.kai-waehner.de/blog/2019/11/08/live-demo-iot-100-000-connected-cars-kubernetes-kafka-mqtt-tensorflow/
IoT, Digital Twin and Event Streaming – @KaiWaehner - www.kai-waehner.de
Key Takeaways
• A Digital Twin merges the physical and the digital world
• Apache Kafka enables an open, scalable and reliable infrastructure for a Digital Twin
• Event Streaming complements IoT platforms and other backend applications / databases.
+
70
Kai Waehner
Technology Evangelist
contact@kai-waehner.de
@KaiWaehner
www.kai-waehner.de
www.confluent.io
LinkedIn
Questions? Feedback?
Let’s connect!

More Related Content

What's hot

Data Mesh Part 4 Monolith to Mesh
Data Mesh Part 4 Monolith to MeshData Mesh Part 4 Monolith to Mesh
Data Mesh Part 4 Monolith to MeshJeffrey T. Pollock
 
The Heart of the Data Mesh Beats in Real-Time with Apache Kafka
The Heart of the Data Mesh Beats in Real-Time with Apache KafkaThe Heart of the Data Mesh Beats in Real-Time with Apache Kafka
The Heart of the Data Mesh Beats in Real-Time with Apache KafkaKai Wähner
 
The Modern Data Team for the Modern Data Stack: dbt and the Role of the Analy...
The Modern Data Team for the Modern Data Stack: dbt and the Role of the Analy...The Modern Data Team for the Modern Data Stack: dbt and the Role of the Analy...
The Modern Data Team for the Modern Data Stack: dbt and the Role of the Analy...Databricks
 
Databricks Fundamentals
Databricks FundamentalsDatabricks Fundamentals
Databricks FundamentalsDalibor Wijas
 
Evolution from EDA to Data Mesh: Data in Motion
Evolution from EDA to Data Mesh: Data in MotionEvolution from EDA to Data Mesh: Data in Motion
Evolution from EDA to Data Mesh: Data in Motionconfluent
 
Data Lakehouse, Data Mesh, and Data Fabric (r2)
Data Lakehouse, Data Mesh, and Data Fabric (r2)Data Lakehouse, Data Mesh, and Data Fabric (r2)
Data Lakehouse, Data Mesh, and Data Fabric (r2)James Serra
 
Pipelines and Packages: Introduction to Azure Data Factory (DATA:Scotland 2019)
Pipelines and Packages: Introduction to Azure Data Factory (DATA:Scotland 2019)Pipelines and Packages: Introduction to Azure Data Factory (DATA:Scotland 2019)
Pipelines and Packages: Introduction to Azure Data Factory (DATA:Scotland 2019)Cathrine Wilhelmsen
 
MLOps Virtual Event: Automating ML at Scale
MLOps Virtual Event: Automating ML at ScaleMLOps Virtual Event: Automating ML at Scale
MLOps Virtual Event: Automating ML at ScaleDatabricks
 
Microsoft Power BI Technical Overview
Microsoft Power BI Technical OverviewMicrosoft Power BI Technical Overview
Microsoft Power BI Technical OverviewDavid J Rosenthal
 
Metaverse and Digital Twins on Enterprise-Public.pdf
Metaverse and Digital Twins on Enterprise-Public.pdfMetaverse and Digital Twins on Enterprise-Public.pdf
Metaverse and Digital Twins on Enterprise-Public.pdf湯米吳 Tommy Wu
 
Modern Data architecture Design
Modern Data architecture DesignModern Data architecture Design
Modern Data architecture DesignKujambu Murugesan
 
Building Lakehouses on Delta Lake with SQL Analytics Primer
Building Lakehouses on Delta Lake with SQL Analytics PrimerBuilding Lakehouses on Delta Lake with SQL Analytics Primer
Building Lakehouses on Delta Lake with SQL Analytics PrimerDatabricks
 
Data Mesh using Microsoft Fabric
Data Mesh using Microsoft FabricData Mesh using Microsoft Fabric
Data Mesh using Microsoft FabricNathan Bijnens
 
How to build an ETL pipeline with Apache Beam on Google Cloud Dataflow
How to build an ETL pipeline with Apache Beam on Google Cloud DataflowHow to build an ETL pipeline with Apache Beam on Google Cloud Dataflow
How to build an ETL pipeline with Apache Beam on Google Cloud DataflowLucas Arruda
 
Differentiate Big Data vs Data Warehouse use cases for a cloud solution
Differentiate Big Data vs Data Warehouse use cases for a cloud solutionDifferentiate Big Data vs Data Warehouse use cases for a cloud solution
Differentiate Big Data vs Data Warehouse use cases for a cloud solutionJames Serra
 
Industry 4.0 and Internet of Things (IoT)- The Emerging Marketing Trends
Industry 4.0 and Internet of Things (IoT)- The Emerging Marketing TrendsIndustry 4.0 and Internet of Things (IoT)- The Emerging Marketing Trends
Industry 4.0 and Internet of Things (IoT)- The Emerging Marketing TrendsSuyati Technologies
 

What's hot (20)

Data Mesh Part 4 Monolith to Mesh
Data Mesh Part 4 Monolith to MeshData Mesh Part 4 Monolith to Mesh
Data Mesh Part 4 Monolith to Mesh
 
The Heart of the Data Mesh Beats in Real-Time with Apache Kafka
The Heart of the Data Mesh Beats in Real-Time with Apache KafkaThe Heart of the Data Mesh Beats in Real-Time with Apache Kafka
The Heart of the Data Mesh Beats in Real-Time with Apache Kafka
 
Webinar Data Mesh - Part 3
Webinar Data Mesh - Part 3Webinar Data Mesh - Part 3
Webinar Data Mesh - Part 3
 
The Modern Data Team for the Modern Data Stack: dbt and the Role of the Analy...
The Modern Data Team for the Modern Data Stack: dbt and the Role of the Analy...The Modern Data Team for the Modern Data Stack: dbt and the Role of the Analy...
The Modern Data Team for the Modern Data Stack: dbt and the Role of the Analy...
 
Databricks Fundamentals
Databricks FundamentalsDatabricks Fundamentals
Databricks Fundamentals
 
Evolution from EDA to Data Mesh: Data in Motion
Evolution from EDA to Data Mesh: Data in MotionEvolution from EDA to Data Mesh: Data in Motion
Evolution from EDA to Data Mesh: Data in Motion
 
Data Lakehouse, Data Mesh, and Data Fabric (r2)
Data Lakehouse, Data Mesh, and Data Fabric (r2)Data Lakehouse, Data Mesh, and Data Fabric (r2)
Data Lakehouse, Data Mesh, and Data Fabric (r2)
 
Pipelines and Packages: Introduction to Azure Data Factory (DATA:Scotland 2019)
Pipelines and Packages: Introduction to Azure Data Factory (DATA:Scotland 2019)Pipelines and Packages: Introduction to Azure Data Factory (DATA:Scotland 2019)
Pipelines and Packages: Introduction to Azure Data Factory (DATA:Scotland 2019)
 
MLOps for production-level machine learning
MLOps for production-level machine learningMLOps for production-level machine learning
MLOps for production-level machine learning
 
MLOps Virtual Event: Automating ML at Scale
MLOps Virtual Event: Automating ML at ScaleMLOps Virtual Event: Automating ML at Scale
MLOps Virtual Event: Automating ML at Scale
 
Digital Twin: Starting the journey
Digital Twin: Starting the journeyDigital Twin: Starting the journey
Digital Twin: Starting the journey
 
Microsoft Power BI Technical Overview
Microsoft Power BI Technical OverviewMicrosoft Power BI Technical Overview
Microsoft Power BI Technical Overview
 
Metaverse and Digital Twins on Enterprise-Public.pdf
Metaverse and Digital Twins on Enterprise-Public.pdfMetaverse and Digital Twins on Enterprise-Public.pdf
Metaverse and Digital Twins on Enterprise-Public.pdf
 
Machine Learning Operations & Azure
Machine Learning Operations & AzureMachine Learning Operations & Azure
Machine Learning Operations & Azure
 
Modern Data architecture Design
Modern Data architecture DesignModern Data architecture Design
Modern Data architecture Design
 
Building Lakehouses on Delta Lake with SQL Analytics Primer
Building Lakehouses on Delta Lake with SQL Analytics PrimerBuilding Lakehouses on Delta Lake with SQL Analytics Primer
Building Lakehouses on Delta Lake with SQL Analytics Primer
 
Data Mesh using Microsoft Fabric
Data Mesh using Microsoft FabricData Mesh using Microsoft Fabric
Data Mesh using Microsoft Fabric
 
How to build an ETL pipeline with Apache Beam on Google Cloud Dataflow
How to build an ETL pipeline with Apache Beam on Google Cloud DataflowHow to build an ETL pipeline with Apache Beam on Google Cloud Dataflow
How to build an ETL pipeline with Apache Beam on Google Cloud Dataflow
 
Differentiate Big Data vs Data Warehouse use cases for a cloud solution
Differentiate Big Data vs Data Warehouse use cases for a cloud solutionDifferentiate Big Data vs Data Warehouse use cases for a cloud solution
Differentiate Big Data vs Data Warehouse use cases for a cloud solution
 
Industry 4.0 and Internet of Things (IoT)- The Emerging Marketing Trends
Industry 4.0 and Internet of Things (IoT)- The Emerging Marketing TrendsIndustry 4.0 and Internet of Things (IoT)- The Emerging Marketing Trends
Industry 4.0 and Internet of Things (IoT)- The Emerging Marketing Trends
 

Similar to IoT Architectures for a Digital Twin with Apache Kafka, IoT Platforms and Machine Learning

Kai Waehner [Confluent] | Real-Time Streaming Analytics with 100,000 Cars Usi...
Kai Waehner [Confluent] | Real-Time Streaming Analytics with 100,000 Cars Usi...Kai Waehner [Confluent] | Real-Time Streaming Analytics with 100,000 Cars Usi...
Kai Waehner [Confluent] | Real-Time Streaming Analytics with 100,000 Cars Usi...InfluxData
 
IoT and Event Streaming at Scale with Apache Kafka
IoT and Event Streaming at Scale with Apache KafkaIoT and Event Streaming at Scale with Apache Kafka
IoT and Event Streaming at Scale with Apache Kafkaconfluent
 
IoT Architectures for Apache Kafka and Event Streaming - Industry 4.0, Digita...
IoT Architectures for Apache Kafka and Event Streaming - Industry 4.0, Digita...IoT Architectures for Apache Kafka and Event Streaming - Industry 4.0, Digita...
IoT Architectures for Apache Kafka and Event Streaming - Industry 4.0, Digita...Kai Wähner
 
Apache Kafka for Automotive Industry, Mobility Services & Smart City
Apache Kafka for Automotive Industry, Mobility Services & Smart CityApache Kafka for Automotive Industry, Mobility Services & Smart City
Apache Kafka for Automotive Industry, Mobility Services & Smart CityKai Wähner
 
Apache Kafka® and Analytics in a Connected IoT World
Apache Kafka® and Analytics in a Connected IoT WorldApache Kafka® and Analytics in a Connected IoT World
Apache Kafka® and Analytics in a Connected IoT Worldconfluent
 
Set Your Data In Motion - CTO Roundtable
Set Your Data In Motion - CTO RoundtableSet Your Data In Motion - CTO Roundtable
Set Your Data In Motion - CTO Roundtableconfluent
 
Event Streaming CTO Roundtable for Cloud-native Kafka Architectures
Event Streaming CTO Roundtable for Cloud-native Kafka ArchitecturesEvent Streaming CTO Roundtable for Cloud-native Kafka Architectures
Event Streaming CTO Roundtable for Cloud-native Kafka ArchitecturesKai Wähner
 
Oracle Digital Business Transformation and Internet of Things by Ermin Prašović
Oracle Digital Business Transformation and Internet of Things by Ermin PrašovićOracle Digital Business Transformation and Internet of Things by Ermin Prašović
Oracle Digital Business Transformation and Internet of Things by Ermin PrašovićBosnia Agile
 
Resilient Real-time Data Streaming across the Edge and Hybrid Cloud with Apac...
Resilient Real-time Data Streaming across the Edge and Hybrid Cloud with Apac...Resilient Real-time Data Streaming across the Edge and Hybrid Cloud with Apac...
Resilient Real-time Data Streaming across the Edge and Hybrid Cloud with Apac...Kai Wähner
 
Enabling Smarter Cities and Connected Vehicles with an Event Streaming Platfo...
Enabling Smarter Cities and Connected Vehicles with an Event Streaming Platfo...Enabling Smarter Cities and Connected Vehicles with an Event Streaming Platfo...
Enabling Smarter Cities and Connected Vehicles with an Event Streaming Platfo...Kai Wähner
 
FI Workshop Sesión Inaugural TID Chile
FI Workshop Sesión Inaugural TID ChileFI Workshop Sesión Inaugural TID Chile
FI Workshop Sesión Inaugural TID ChileTIDChile
 
FIWARE Tech Summit - FIWARE Overview and Description of GEs
FIWARE Tech Summit - FIWARE Overview and Description of GEsFIWARE Tech Summit - FIWARE Overview and Description of GEs
FIWARE Tech Summit - FIWARE Overview and Description of GEsFIWARE
 
Iot presentation JustPeople-ago2015
Iot presentation JustPeople-ago2015Iot presentation JustPeople-ago2015
Iot presentation JustPeople-ago2015TIDChile
 
A Real-Time Version of the Truth
 A Real-Time Version of the Truth A Real-Time Version of the Truth
A Real-Time Version of the TruthEric Kavanagh
 
Fast Data – Fast Cars: Wie Apache Kafka die Datenwelt revolutioniert
Fast Data – Fast Cars: Wie Apache Kafka die Datenwelt revolutioniertFast Data – Fast Cars: Wie Apache Kafka die Datenwelt revolutioniert
Fast Data – Fast Cars: Wie Apache Kafka die Datenwelt revolutioniertconfluent
 
Apache Kafka for Smart Grid, Utilities and Energy Production
Apache Kafka for Smart Grid, Utilities and Energy ProductionApache Kafka for Smart Grid, Utilities and Energy Production
Apache Kafka for Smart Grid, Utilities and Energy ProductionKai Wähner
 
Viele Autos, noch mehr Daten: IoT-Daten-Streaming mit MQTT & Kafka (Kai Waehn...
Viele Autos, noch mehr Daten: IoT-Daten-Streaming mit MQTT & Kafka (Kai Waehn...Viele Autos, noch mehr Daten: IoT-Daten-Streaming mit MQTT & Kafka (Kai Waehn...
Viele Autos, noch mehr Daten: IoT-Daten-Streaming mit MQTT & Kafka (Kai Waehn...confluent
 
Best Practices for Streaming IoT Data with MQTT and Apache Kafka
Best Practices for Streaming IoT Data with MQTT and Apache KafkaBest Practices for Streaming IoT Data with MQTT and Apache Kafka
Best Practices for Streaming IoT Data with MQTT and Apache KafkaKai Wähner
 
Internet of Things (IoT) - in the cloud or rather on-premises?
Internet of Things (IoT) - in the cloud or rather on-premises?Internet of Things (IoT) - in the cloud or rather on-premises?
Internet of Things (IoT) - in the cloud or rather on-premises?Guido Schmutz
 

Similar to IoT Architectures for a Digital Twin with Apache Kafka, IoT Platforms and Machine Learning (20)

Kai Waehner [Confluent] | Real-Time Streaming Analytics with 100,000 Cars Usi...
Kai Waehner [Confluent] | Real-Time Streaming Analytics with 100,000 Cars Usi...Kai Waehner [Confluent] | Real-Time Streaming Analytics with 100,000 Cars Usi...
Kai Waehner [Confluent] | Real-Time Streaming Analytics with 100,000 Cars Usi...
 
IoT and Event Streaming at Scale with Apache Kafka
IoT and Event Streaming at Scale with Apache KafkaIoT and Event Streaming at Scale with Apache Kafka
IoT and Event Streaming at Scale with Apache Kafka
 
IoT Architectures for Apache Kafka and Event Streaming - Industry 4.0, Digita...
IoT Architectures for Apache Kafka and Event Streaming - Industry 4.0, Digita...IoT Architectures for Apache Kafka and Event Streaming - Industry 4.0, Digita...
IoT Architectures for Apache Kafka and Event Streaming - Industry 4.0, Digita...
 
Apache Kafka for Automotive Industry, Mobility Services & Smart City
Apache Kafka for Automotive Industry, Mobility Services & Smart CityApache Kafka for Automotive Industry, Mobility Services & Smart City
Apache Kafka for Automotive Industry, Mobility Services & Smart City
 
Apache Kafka® and Analytics in a Connected IoT World
Apache Kafka® and Analytics in a Connected IoT WorldApache Kafka® and Analytics in a Connected IoT World
Apache Kafka® and Analytics in a Connected IoT World
 
Set Your Data In Motion - CTO Roundtable
Set Your Data In Motion - CTO RoundtableSet Your Data In Motion - CTO Roundtable
Set Your Data In Motion - CTO Roundtable
 
Event Streaming CTO Roundtable for Cloud-native Kafka Architectures
Event Streaming CTO Roundtable for Cloud-native Kafka ArchitecturesEvent Streaming CTO Roundtable for Cloud-native Kafka Architectures
Event Streaming CTO Roundtable for Cloud-native Kafka Architectures
 
Oracle Digital Business Transformation and Internet of Things by Ermin Prašović
Oracle Digital Business Transformation and Internet of Things by Ermin PrašovićOracle Digital Business Transformation and Internet of Things by Ermin Prašović
Oracle Digital Business Transformation and Internet of Things by Ermin Prašović
 
Resilient Real-time Data Streaming across the Edge and Hybrid Cloud with Apac...
Resilient Real-time Data Streaming across the Edge and Hybrid Cloud with Apac...Resilient Real-time Data Streaming across the Edge and Hybrid Cloud with Apac...
Resilient Real-time Data Streaming across the Edge and Hybrid Cloud with Apac...
 
Enabling Smarter Cities and Connected Vehicles with an Event Streaming Platfo...
Enabling Smarter Cities and Connected Vehicles with an Event Streaming Platfo...Enabling Smarter Cities and Connected Vehicles with an Event Streaming Platfo...
Enabling Smarter Cities and Connected Vehicles with an Event Streaming Platfo...
 
Fiware overview3
Fiware overview3Fiware overview3
Fiware overview3
 
FI Workshop Sesión Inaugural TID Chile
FI Workshop Sesión Inaugural TID ChileFI Workshop Sesión Inaugural TID Chile
FI Workshop Sesión Inaugural TID Chile
 
FIWARE Tech Summit - FIWARE Overview and Description of GEs
FIWARE Tech Summit - FIWARE Overview and Description of GEsFIWARE Tech Summit - FIWARE Overview and Description of GEs
FIWARE Tech Summit - FIWARE Overview and Description of GEs
 
Iot presentation JustPeople-ago2015
Iot presentation JustPeople-ago2015Iot presentation JustPeople-ago2015
Iot presentation JustPeople-ago2015
 
A Real-Time Version of the Truth
 A Real-Time Version of the Truth A Real-Time Version of the Truth
A Real-Time Version of the Truth
 
Fast Data – Fast Cars: Wie Apache Kafka die Datenwelt revolutioniert
Fast Data – Fast Cars: Wie Apache Kafka die Datenwelt revolutioniertFast Data – Fast Cars: Wie Apache Kafka die Datenwelt revolutioniert
Fast Data – Fast Cars: Wie Apache Kafka die Datenwelt revolutioniert
 
Apache Kafka for Smart Grid, Utilities and Energy Production
Apache Kafka for Smart Grid, Utilities and Energy ProductionApache Kafka for Smart Grid, Utilities and Energy Production
Apache Kafka for Smart Grid, Utilities and Energy Production
 
Viele Autos, noch mehr Daten: IoT-Daten-Streaming mit MQTT & Kafka (Kai Waehn...
Viele Autos, noch mehr Daten: IoT-Daten-Streaming mit MQTT & Kafka (Kai Waehn...Viele Autos, noch mehr Daten: IoT-Daten-Streaming mit MQTT & Kafka (Kai Waehn...
Viele Autos, noch mehr Daten: IoT-Daten-Streaming mit MQTT & Kafka (Kai Waehn...
 
Best Practices for Streaming IoT Data with MQTT and Apache Kafka
Best Practices for Streaming IoT Data with MQTT and Apache KafkaBest Practices for Streaming IoT Data with MQTT and Apache Kafka
Best Practices for Streaming IoT Data with MQTT and Apache Kafka
 
Internet of Things (IoT) - in the cloud or rather on-premises?
Internet of Things (IoT) - in the cloud or rather on-premises?Internet of Things (IoT) - in the cloud or rather on-premises?
Internet of Things (IoT) - in the cloud or rather on-premises?
 

More from Kai Wähner

Apache Kafka as Data Hub for Crypto, NFT, Metaverse (Beyond the Buzz!)
Apache Kafka as Data Hub for Crypto, NFT, Metaverse (Beyond the Buzz!)Apache Kafka as Data Hub for Crypto, NFT, Metaverse (Beyond the Buzz!)
Apache Kafka as Data Hub for Crypto, NFT, Metaverse (Beyond the Buzz!)Kai Wähner
 
When NOT to use Apache Kafka?
When NOT to use Apache Kafka?When NOT to use Apache Kafka?
When NOT to use Apache Kafka?Kai Wähner
 
Kafka for Live Commerce to Transform the Retail and Shopping Metaverse
Kafka for Live Commerce to Transform the Retail and Shopping MetaverseKafka for Live Commerce to Transform the Retail and Shopping Metaverse
Kafka for Live Commerce to Transform the Retail and Shopping MetaverseKai Wähner
 
Apache Kafka vs. Cloud-native iPaaS Integration Platform Middleware
Apache Kafka vs. Cloud-native iPaaS Integration Platform MiddlewareApache Kafka vs. Cloud-native iPaaS Integration Platform Middleware
Apache Kafka vs. Cloud-native iPaaS Integration Platform MiddlewareKai Wähner
 
Data Warehouse vs. Data Lake vs. Data Streaming – Friends, Enemies, Frenemies?
Data Warehouse vs. Data Lake vs. Data Streaming – Friends, Enemies, Frenemies?Data Warehouse vs. Data Lake vs. Data Streaming – Friends, Enemies, Frenemies?
Data Warehouse vs. Data Lake vs. Data Streaming – Friends, Enemies, Frenemies?Kai Wähner
 
Serverless Kafka and Spark in a Multi-Cloud Lakehouse Architecture
Serverless Kafka and Spark in a Multi-Cloud Lakehouse ArchitectureServerless Kafka and Spark in a Multi-Cloud Lakehouse Architecture
Serverless Kafka and Spark in a Multi-Cloud Lakehouse ArchitectureKai Wähner
 
Data Streaming with Apache Kafka in the Defence and Cybersecurity Industry
Data Streaming with Apache Kafka in the Defence and Cybersecurity IndustryData Streaming with Apache Kafka in the Defence and Cybersecurity Industry
Data Streaming with Apache Kafka in the Defence and Cybersecurity IndustryKai Wähner
 
Apache Kafka in the Healthcare Industry
Apache Kafka in the Healthcare IndustryApache Kafka in the Healthcare Industry
Apache Kafka in the Healthcare IndustryKai Wähner
 
Apache Kafka in the Healthcare Industry
Apache Kafka in the Healthcare IndustryApache Kafka in the Healthcare Industry
Apache Kafka in the Healthcare IndustryKai Wähner
 
Apache Kafka for Real-time Supply Chain in the Food and Retail Industry
Apache Kafka for Real-time Supply Chainin the Food and Retail IndustryApache Kafka for Real-time Supply Chainin the Food and Retail Industry
Apache Kafka for Real-time Supply Chain in the Food and Retail IndustryKai Wähner
 
Kafka for Real-Time Replication between Edge and Hybrid Cloud
Kafka for Real-Time Replication between Edge and Hybrid CloudKafka for Real-Time Replication between Edge and Hybrid Cloud
Kafka for Real-Time Replication between Edge and Hybrid CloudKai Wähner
 
Apache Kafka for Predictive Maintenance in Industrial IoT / Industry 4.0
Apache Kafka for Predictive Maintenance in Industrial IoT / Industry 4.0Apache Kafka for Predictive Maintenance in Industrial IoT / Industry 4.0
Apache Kafka for Predictive Maintenance in Industrial IoT / Industry 4.0Kai Wähner
 
Apache Kafka Landscape for Automotive and Manufacturing
Apache Kafka Landscape for Automotive and ManufacturingApache Kafka Landscape for Automotive and Manufacturing
Apache Kafka Landscape for Automotive and ManufacturingKai Wähner
 
Kappa vs Lambda Architectures and Technology Comparison
Kappa vs Lambda Architectures and Technology ComparisonKappa vs Lambda Architectures and Technology Comparison
Kappa vs Lambda Architectures and Technology ComparisonKai Wähner
 
The Top 5 Apache Kafka Use Cases and Architectures in 2022
The Top 5 Apache Kafka Use Cases and Architectures in 2022The Top 5 Apache Kafka Use Cases and Architectures in 2022
The Top 5 Apache Kafka Use Cases and Architectures in 2022Kai Wähner
 
Apache Kafka in the Public Sector (Government, National Security, Citizen Ser...
Apache Kafka in the Public Sector (Government, National Security, Citizen Ser...Apache Kafka in the Public Sector (Government, National Security, Citizen Ser...
Apache Kafka in the Public Sector (Government, National Security, Citizen Ser...Kai Wähner
 
Apache Kafka in the Transportation and Logistics
Apache Kafka in the Transportation and LogisticsApache Kafka in the Transportation and Logistics
Apache Kafka in the Transportation and LogisticsKai Wähner
 
Apache Kafka for Cybersecurity and SIEM / SOAR Modernization
Apache Kafka for Cybersecurity and SIEM / SOAR ModernizationApache Kafka for Cybersecurity and SIEM / SOAR Modernization
Apache Kafka for Cybersecurity and SIEM / SOAR ModernizationKai Wähner
 
Apache Kafka in the Automotive Industry (Connected Vehicles, Manufacturing 4....
Apache Kafka in the Automotive Industry (Connected Vehicles, Manufacturing 4....Apache Kafka in the Automotive Industry (Connected Vehicles, Manufacturing 4....
Apache Kafka in the Automotive Industry (Connected Vehicles, Manufacturing 4....Kai Wähner
 
Serverless Kafka on AWS as Part of a Cloud-native Data Lake Architecture
Serverless Kafka on AWS as Part of a Cloud-native Data Lake ArchitectureServerless Kafka on AWS as Part of a Cloud-native Data Lake Architecture
Serverless Kafka on AWS as Part of a Cloud-native Data Lake ArchitectureKai Wähner
 

More from Kai Wähner (20)

Apache Kafka as Data Hub for Crypto, NFT, Metaverse (Beyond the Buzz!)
Apache Kafka as Data Hub for Crypto, NFT, Metaverse (Beyond the Buzz!)Apache Kafka as Data Hub for Crypto, NFT, Metaverse (Beyond the Buzz!)
Apache Kafka as Data Hub for Crypto, NFT, Metaverse (Beyond the Buzz!)
 
When NOT to use Apache Kafka?
When NOT to use Apache Kafka?When NOT to use Apache Kafka?
When NOT to use Apache Kafka?
 
Kafka for Live Commerce to Transform the Retail and Shopping Metaverse
Kafka for Live Commerce to Transform the Retail and Shopping MetaverseKafka for Live Commerce to Transform the Retail and Shopping Metaverse
Kafka for Live Commerce to Transform the Retail and Shopping Metaverse
 
Apache Kafka vs. Cloud-native iPaaS Integration Platform Middleware
Apache Kafka vs. Cloud-native iPaaS Integration Platform MiddlewareApache Kafka vs. Cloud-native iPaaS Integration Platform Middleware
Apache Kafka vs. Cloud-native iPaaS Integration Platform Middleware
 
Data Warehouse vs. Data Lake vs. Data Streaming – Friends, Enemies, Frenemies?
Data Warehouse vs. Data Lake vs. Data Streaming – Friends, Enemies, Frenemies?Data Warehouse vs. Data Lake vs. Data Streaming – Friends, Enemies, Frenemies?
Data Warehouse vs. Data Lake vs. Data Streaming – Friends, Enemies, Frenemies?
 
Serverless Kafka and Spark in a Multi-Cloud Lakehouse Architecture
Serverless Kafka and Spark in a Multi-Cloud Lakehouse ArchitectureServerless Kafka and Spark in a Multi-Cloud Lakehouse Architecture
Serverless Kafka and Spark in a Multi-Cloud Lakehouse Architecture
 
Data Streaming with Apache Kafka in the Defence and Cybersecurity Industry
Data Streaming with Apache Kafka in the Defence and Cybersecurity IndustryData Streaming with Apache Kafka in the Defence and Cybersecurity Industry
Data Streaming with Apache Kafka in the Defence and Cybersecurity Industry
 
Apache Kafka in the Healthcare Industry
Apache Kafka in the Healthcare IndustryApache Kafka in the Healthcare Industry
Apache Kafka in the Healthcare Industry
 
Apache Kafka in the Healthcare Industry
Apache Kafka in the Healthcare IndustryApache Kafka in the Healthcare Industry
Apache Kafka in the Healthcare Industry
 
Apache Kafka for Real-time Supply Chain in the Food and Retail Industry
Apache Kafka for Real-time Supply Chainin the Food and Retail IndustryApache Kafka for Real-time Supply Chainin the Food and Retail Industry
Apache Kafka for Real-time Supply Chain in the Food and Retail Industry
 
Kafka for Real-Time Replication between Edge and Hybrid Cloud
Kafka for Real-Time Replication between Edge and Hybrid CloudKafka for Real-Time Replication between Edge and Hybrid Cloud
Kafka for Real-Time Replication between Edge and Hybrid Cloud
 
Apache Kafka for Predictive Maintenance in Industrial IoT / Industry 4.0
Apache Kafka for Predictive Maintenance in Industrial IoT / Industry 4.0Apache Kafka for Predictive Maintenance in Industrial IoT / Industry 4.0
Apache Kafka for Predictive Maintenance in Industrial IoT / Industry 4.0
 
Apache Kafka Landscape for Automotive and Manufacturing
Apache Kafka Landscape for Automotive and ManufacturingApache Kafka Landscape for Automotive and Manufacturing
Apache Kafka Landscape for Automotive and Manufacturing
 
Kappa vs Lambda Architectures and Technology Comparison
Kappa vs Lambda Architectures and Technology ComparisonKappa vs Lambda Architectures and Technology Comparison
Kappa vs Lambda Architectures and Technology Comparison
 
The Top 5 Apache Kafka Use Cases and Architectures in 2022
The Top 5 Apache Kafka Use Cases and Architectures in 2022The Top 5 Apache Kafka Use Cases and Architectures in 2022
The Top 5 Apache Kafka Use Cases and Architectures in 2022
 
Apache Kafka in the Public Sector (Government, National Security, Citizen Ser...
Apache Kafka in the Public Sector (Government, National Security, Citizen Ser...Apache Kafka in the Public Sector (Government, National Security, Citizen Ser...
Apache Kafka in the Public Sector (Government, National Security, Citizen Ser...
 
Apache Kafka in the Transportation and Logistics
Apache Kafka in the Transportation and LogisticsApache Kafka in the Transportation and Logistics
Apache Kafka in the Transportation and Logistics
 
Apache Kafka for Cybersecurity and SIEM / SOAR Modernization
Apache Kafka for Cybersecurity and SIEM / SOAR ModernizationApache Kafka for Cybersecurity and SIEM / SOAR Modernization
Apache Kafka for Cybersecurity and SIEM / SOAR Modernization
 
Apache Kafka in the Automotive Industry (Connected Vehicles, Manufacturing 4....
Apache Kafka in the Automotive Industry (Connected Vehicles, Manufacturing 4....Apache Kafka in the Automotive Industry (Connected Vehicles, Manufacturing 4....
Apache Kafka in the Automotive Industry (Connected Vehicles, Manufacturing 4....
 
Serverless Kafka on AWS as Part of a Cloud-native Data Lake Architecture
Serverless Kafka on AWS as Part of a Cloud-native Data Lake ArchitectureServerless Kafka on AWS as Part of a Cloud-native Data Lake Architecture
Serverless Kafka on AWS as Part of a Cloud-native Data Lake Architecture
 

Recently uploaded

Sending Calendar Invites on SES and Calendarsnack.pdf
Sending Calendar Invites on SES and Calendarsnack.pdfSending Calendar Invites on SES and Calendarsnack.pdf
Sending Calendar Invites on SES and Calendarsnack.pdf31events.com
 
What is Fashion PLM and Why Do You Need It
What is Fashion PLM and Why Do You Need ItWhat is Fashion PLM and Why Do You Need It
What is Fashion PLM and Why Do You Need ItWave PLM
 
Implementing Zero Trust strategy with Azure
Implementing Zero Trust strategy with AzureImplementing Zero Trust strategy with Azure
Implementing Zero Trust strategy with AzureDinusha Kumarasiri
 
Software Project Health Check: Best Practices and Techniques for Your Product...
Software Project Health Check: Best Practices and Techniques for Your Product...Software Project Health Check: Best Practices and Techniques for Your Product...
Software Project Health Check: Best Practices and Techniques for Your Product...Velvetech LLC
 
Machine Learning Software Engineering Patterns and Their Engineering
Machine Learning Software Engineering Patterns and Their EngineeringMachine Learning Software Engineering Patterns and Their Engineering
Machine Learning Software Engineering Patterns and Their EngineeringHironori Washizaki
 
英国UN学位证,北安普顿大学毕业证书1:1制作
英国UN学位证,北安普顿大学毕业证书1:1制作英国UN学位证,北安普顿大学毕业证书1:1制作
英国UN学位证,北安普顿大学毕业证书1:1制作qr0udbr0
 
Taming Distributed Systems: Key Insights from Wix's Large-Scale Experience - ...
Taming Distributed Systems: Key Insights from Wix's Large-Scale Experience - ...Taming Distributed Systems: Key Insights from Wix's Large-Scale Experience - ...
Taming Distributed Systems: Key Insights from Wix's Large-Scale Experience - ...Natan Silnitsky
 
Precise and Complete Requirements? An Elusive Goal
Precise and Complete Requirements? An Elusive GoalPrecise and Complete Requirements? An Elusive Goal
Precise and Complete Requirements? An Elusive GoalLionel Briand
 
PREDICTING RIVER WATER QUALITY ppt presentation
PREDICTING  RIVER  WATER QUALITY  ppt presentationPREDICTING  RIVER  WATER QUALITY  ppt presentation
PREDICTING RIVER WATER QUALITY ppt presentationvaddepallysandeep122
 
Maximizing Efficiency and Profitability with OnePlan’s Professional Service A...
Maximizing Efficiency and Profitability with OnePlan’s Professional Service A...Maximizing Efficiency and Profitability with OnePlan’s Professional Service A...
Maximizing Efficiency and Profitability with OnePlan’s Professional Service A...OnePlan Solutions
 
Software Coding for software engineering
Software Coding for software engineeringSoftware Coding for software engineering
Software Coding for software engineeringssuserb3a23b
 
Dealing with Cultural Dispersion — Stefano Lambiase — ICSE-SEIS 2024
Dealing with Cultural Dispersion — Stefano Lambiase — ICSE-SEIS 2024Dealing with Cultural Dispersion — Stefano Lambiase — ICSE-SEIS 2024
Dealing with Cultural Dispersion — Stefano Lambiase — ICSE-SEIS 2024StefanoLambiase
 
MYjobs Presentation Django-based project
MYjobs Presentation Django-based projectMYjobs Presentation Django-based project
MYjobs Presentation Django-based projectAnoyGreter
 
Unveiling Design Patterns: A Visual Guide with UML Diagrams
Unveiling Design Patterns: A Visual Guide with UML DiagramsUnveiling Design Patterns: A Visual Guide with UML Diagrams
Unveiling Design Patterns: A Visual Guide with UML DiagramsAhmed Mohamed
 
SensoDat: Simulation-based Sensor Dataset of Self-driving Cars
SensoDat: Simulation-based Sensor Dataset of Self-driving CarsSensoDat: Simulation-based Sensor Dataset of Self-driving Cars
SensoDat: Simulation-based Sensor Dataset of Self-driving CarsChristian Birchler
 
Folding Cheat Sheet #4 - fourth in a series
Folding Cheat Sheet #4 - fourth in a seriesFolding Cheat Sheet #4 - fourth in a series
Folding Cheat Sheet #4 - fourth in a seriesPhilip Schwarz
 
Alfresco TTL#157 - Troubleshooting Made Easy: Deciphering Alfresco mTLS Confi...
Alfresco TTL#157 - Troubleshooting Made Easy: Deciphering Alfresco mTLS Confi...Alfresco TTL#157 - Troubleshooting Made Easy: Deciphering Alfresco mTLS Confi...
Alfresco TTL#157 - Troubleshooting Made Easy: Deciphering Alfresco mTLS Confi...Angel Borroy López
 
Unveiling the Future: Sylius 2.0 New Features
Unveiling the Future: Sylius 2.0 New FeaturesUnveiling the Future: Sylius 2.0 New Features
Unveiling the Future: Sylius 2.0 New FeaturesŁukasz Chruściel
 
SuccessFactors 1H 2024 Release - Sneak-Peek by Deloitte Germany
SuccessFactors 1H 2024 Release - Sneak-Peek by Deloitte GermanySuccessFactors 1H 2024 Release - Sneak-Peek by Deloitte Germany
SuccessFactors 1H 2024 Release - Sneak-Peek by Deloitte GermanyChristoph Pohl
 

Recently uploaded (20)

Sending Calendar Invites on SES and Calendarsnack.pdf
Sending Calendar Invites on SES and Calendarsnack.pdfSending Calendar Invites on SES and Calendarsnack.pdf
Sending Calendar Invites on SES and Calendarsnack.pdf
 
What is Fashion PLM and Why Do You Need It
What is Fashion PLM and Why Do You Need ItWhat is Fashion PLM and Why Do You Need It
What is Fashion PLM and Why Do You Need It
 
Implementing Zero Trust strategy with Azure
Implementing Zero Trust strategy with AzureImplementing Zero Trust strategy with Azure
Implementing Zero Trust strategy with Azure
 
Software Project Health Check: Best Practices and Techniques for Your Product...
Software Project Health Check: Best Practices and Techniques for Your Product...Software Project Health Check: Best Practices and Techniques for Your Product...
Software Project Health Check: Best Practices and Techniques for Your Product...
 
Machine Learning Software Engineering Patterns and Their Engineering
Machine Learning Software Engineering Patterns and Their EngineeringMachine Learning Software Engineering Patterns and Their Engineering
Machine Learning Software Engineering Patterns and Their Engineering
 
英国UN学位证,北安普顿大学毕业证书1:1制作
英国UN学位证,北安普顿大学毕业证书1:1制作英国UN学位证,北安普顿大学毕业证书1:1制作
英国UN学位证,北安普顿大学毕业证书1:1制作
 
Taming Distributed Systems: Key Insights from Wix's Large-Scale Experience - ...
Taming Distributed Systems: Key Insights from Wix's Large-Scale Experience - ...Taming Distributed Systems: Key Insights from Wix's Large-Scale Experience - ...
Taming Distributed Systems: Key Insights from Wix's Large-Scale Experience - ...
 
Precise and Complete Requirements? An Elusive Goal
Precise and Complete Requirements? An Elusive GoalPrecise and Complete Requirements? An Elusive Goal
Precise and Complete Requirements? An Elusive Goal
 
PREDICTING RIVER WATER QUALITY ppt presentation
PREDICTING  RIVER  WATER QUALITY  ppt presentationPREDICTING  RIVER  WATER QUALITY  ppt presentation
PREDICTING RIVER WATER QUALITY ppt presentation
 
Maximizing Efficiency and Profitability with OnePlan’s Professional Service A...
Maximizing Efficiency and Profitability with OnePlan’s Professional Service A...Maximizing Efficiency and Profitability with OnePlan’s Professional Service A...
Maximizing Efficiency and Profitability with OnePlan’s Professional Service A...
 
Software Coding for software engineering
Software Coding for software engineeringSoftware Coding for software engineering
Software Coding for software engineering
 
Dealing with Cultural Dispersion — Stefano Lambiase — ICSE-SEIS 2024
Dealing with Cultural Dispersion — Stefano Lambiase — ICSE-SEIS 2024Dealing with Cultural Dispersion — Stefano Lambiase — ICSE-SEIS 2024
Dealing with Cultural Dispersion — Stefano Lambiase — ICSE-SEIS 2024
 
MYjobs Presentation Django-based project
MYjobs Presentation Django-based projectMYjobs Presentation Django-based project
MYjobs Presentation Django-based project
 
Unveiling Design Patterns: A Visual Guide with UML Diagrams
Unveiling Design Patterns: A Visual Guide with UML DiagramsUnveiling Design Patterns: A Visual Guide with UML Diagrams
Unveiling Design Patterns: A Visual Guide with UML Diagrams
 
SensoDat: Simulation-based Sensor Dataset of Self-driving Cars
SensoDat: Simulation-based Sensor Dataset of Self-driving CarsSensoDat: Simulation-based Sensor Dataset of Self-driving Cars
SensoDat: Simulation-based Sensor Dataset of Self-driving Cars
 
Folding Cheat Sheet #4 - fourth in a series
Folding Cheat Sheet #4 - fourth in a seriesFolding Cheat Sheet #4 - fourth in a series
Folding Cheat Sheet #4 - fourth in a series
 
Alfresco TTL#157 - Troubleshooting Made Easy: Deciphering Alfresco mTLS Confi...
Alfresco TTL#157 - Troubleshooting Made Easy: Deciphering Alfresco mTLS Confi...Alfresco TTL#157 - Troubleshooting Made Easy: Deciphering Alfresco mTLS Confi...
Alfresco TTL#157 - Troubleshooting Made Easy: Deciphering Alfresco mTLS Confi...
 
Unveiling the Future: Sylius 2.0 New Features
Unveiling the Future: Sylius 2.0 New FeaturesUnveiling the Future: Sylius 2.0 New Features
Unveiling the Future: Sylius 2.0 New Features
 
SuccessFactors 1H 2024 Release - Sneak-Peek by Deloitte Germany
SuccessFactors 1H 2024 Release - Sneak-Peek by Deloitte GermanySuccessFactors 1H 2024 Release - Sneak-Peek by Deloitte Germany
SuccessFactors 1H 2024 Release - Sneak-Peek by Deloitte Germany
 
Odoo Development Company in India | Devintelle Consulting Service
Odoo Development Company in India | Devintelle Consulting ServiceOdoo Development Company in India | Devintelle Consulting Service
Odoo Development Company in India | Devintelle Consulting Service
 

IoT Architectures for a Digital Twin with Apache Kafka, IoT Platforms and Machine Learning

  • 1. IoT Architectures for a Digital Twin with Apache Kafka and Event Streaming A Digital Replica of Things - Open, Scalable and Reliable Kai Waehner Technology Evangelist contact@kai-waehner.de LinkedIn @KaiWaehner www.confluent.io www.kai-waehner.de
  • 2. IoT, Digital Twin and Event Streaming – @KaiWaehner - www.kai-waehner.de Abstract IoT Architectures for a Digital Twin with Apache Kafka and Event Streaming A digital twin is a digital replica of a living or non-living physical entity. This session discusses the benefits and IoT architectures of a Digital Twin in Industrial IoT (IIoT) and its relation to Apache Kafka and other IoT frameworks. Kafka is often used as central event streaming platform to build a scalable and reliable digital twin for real time streaming sensor data. A live demo shows a scalable digital twin infrastructure for condition monitoring and predictive maintenance in real time for a connected car infrastructure leveraging Kafka, MQTT and TensorFlow. Key Take-Aways: ● Learn about use cases and characteristics of a digital twin in various industries ● Understand how to build a digital twin for every single (of tens of thousands) IoT device or machine ● See different IoT architectures with Kafka and other IoT technologies and products ● Bring added value to your IoT infrastructure by enabling use cases like predictive maintenance ● Understand how the Apache Kafka enables scalable and flexible end-to-end integration processing from IIoT data to various backend applications ● Watch a live demo of an end-to-end integration, real time processing and analytics of thousands of IoT devices
  • 3. IoT, Digital Twin and Event Streaming – @KaiWaehner - www.kai-waehner.de Key Takeaways • A Digital Twin merges the physical and the digital world • Apache Kafka enables an open, scalable and reliable infrastructure for a Digital Twin • Event Streaming complements IoT platforms and other backend applications / databases. + 3
  • 4. IoT, Digital Twin and Event Streaming – @KaiWaehner - www.kai-waehner.de Agenda • Digital Twin - Merging the Physical and the Digital World • Real World Challenges • IoT Platforms • Apache Kafka as Event Streaming Solution for IoT • Spoilt for Choice for a Digital Twin • Global IoT Architectures • A Digital Twin for 100000 Connected Cars 4
  • 5. IoT, Digital Twin and Event Streaming – @KaiWaehner - www.kai-waehner.de Agenda • Digital Twin - Merging the Physical and the Digital World • Real World Challenges • IoT Platforms • Apache Kafka as Event Streaming Solution for IoT • Spoilt for Choice for a Digital Twin • Global IoT Architectures • A Digital Twin for 100000 Connected Cars 5
  • 6. IoT, Digital Twin and Event Streaming – @KaiWaehner - www.kai-waehner.de Software and Digital Services become the Key Differentiator 6 https://www.mckinsey.com/industries/advanced-electronics/our-insights/iiot-platforms-the-technology-stack-as-value-driver-in-industrial-equipment-and-machinery
  • 7. IoT, Digital Twin and Event Streaming – @KaiWaehner - www.kai-waehner.de Digital Twin – Merging the Physical and the Digital World 7 • Downtime reduction • Inventory management • Fleet management • What-if simulations • Operational planning • Servitization • Product development • Healthcare • Customer experience “Virtual representation of something else (Physical thing, process, service)” “A living model that drives a business outcome” https://www.youtube.com/watch?v=Ri0TD7kYsIQ
  • 8. IoT, Digital Twin and Event Streaming – @KaiWaehner - www.kai-waehner.de Granularity of Digital Twins 8 https://www.youtube.com/watch?v=cfbKR48nSyQ Remaining Useful Life
  • 9. IoT, Digital Twin and Event Streaming – @KaiWaehner - www.kai-waehner.de Digital Thread 9 Digital Twin vs. Digital Thread? I only use the term Digital Twin in the following slides. Both terms overlap, often meaning the same. Span the entire lifecycle
  • 10. IoT, Digital Twin and Event Streaming – @KaiWaehner - www.kai-waehner.de Virtual Singapore: A Digital Twin of the (Smart) City 10 Design, Monitor and Manage Cities • Urban Planning (e.g. Crowd Simulation) • Collaboration and Decision-Making • Communication and Visualisation • Improved Accessibility • Analysis on Potential for Solar Energy Production • … https://www.nrf.gov.sg/programmes/virtual-singapore
  • 11. IoT, Digital Twin and Event Streaming – @KaiWaehner - www.kai-waehner.de Smart Infrastructure: Digital Solutions for Entire Building Lifecycle 11 https://new.siemens.com/global/en/products/buildings/digitalization/digital-building-lifecycle.html • Safer, more secure, more efficient and resilient buildings • Continuously interaction, learning and adaption to create environments that care • Follow the entire digital building lifecycle
  • 12. IoT, Digital Twin and Event Streaming – @KaiWaehner - www.kai-waehner.de Connected Car Infrastructure 12 https://www.youtube.com/watch?v=yGLKi3TMJv8 • Real Time Data Analysis • Swarm Intelligence • Collaboration with Partners • Predictive AI • …
  • 13. IoT, Digital Twin and Event Streaming – @KaiWaehner - www.kai-waehner.de Twinning the Human Body to Enhance Medical Care 13 • Monitoring and evaluation without being in close proximity • Testing the impact of changes on the performance of a system • Smart machines are more advanced than humans • Determine what actions to take • Modelling an individual’s genomic makeup, physiological characteristics, and lifestyle to create personalized medicine • Capturing the human body will have multiple benefits for doctors such as discovering undeveloped illnesses, experimenting with treatments, and improving preparation for surgeries https://www.challenge.org/insights/digital-twin-in-healthcare/ https://youtu.be/H6JzPCbyVSM
  • 14. IoT, Digital Twin and Event Streaming – @KaiWaehner - www.kai-waehner.de Market 2020 14 https://www.youtube.com/watch?v=w4TPeVO3dwo (Q1, 2020 Webinar IoT Analytics Research)
  • 15. IoT, Digital Twin and Event Streaming – @KaiWaehner - www.kai-waehner.de Digital Twin and Artificial Intelligence (AI) / Machine Learning • Complementary Concepts • Continuous Learning, Monitoring and Acting • (Good) data is key for success 15 https://towardsdatascience.com/understanding-feature-engineering-part-1-continuous-numeric-data-da4e47099a7b
  • 16. IoT, Digital Twin and Event Streaming – @KaiWaehner - www.kai-waehner.de Digital Twin Applied… 16 https://www.youtube.com/watch?v=cfbKR48nSyQ
  • 17. IoT, Digital Twin and Event Streaming – @KaiWaehner - www.kai-waehner.de Digital Twin Applied… 17 https://www.youtube.com/watch?v=cfbKR48nSyQ
  • 18. IoT, Digital Twin and Event Streaming – @KaiWaehner - www.kai-waehner.de Digital Twin Applied… 18 https://www.youtube.com/watch?v=cfbKR48nSyQ
  • 19. IoT, Digital Twin and Event Streaming – @KaiWaehner - www.kai-waehner.de Digital Twin Applied… 19 https://www.youtube.com/watch?v=cfbKR48nSyQ
  • 20. IoT, Digital Twin and Event Streaming – @KaiWaehner - www.kai-waehner.de Agenda • Digital Twin - Merging the Physical and the Digital World • Real World Challenges • IoT Platforms • Apache Kafka as Event Streaming Solution for IoT • Spoilt for Choice for a Digital Twin • Global IoT Architectures • A Digital Twin for 100000 Connected Cars 20
  • 21. IoT, Digital Twin and Event Streaming – @KaiWaehner - www.kai-waehner.de History of Automation Industry vs. Big Data and Cloud https://foss-backstage.de/sites/foss-backstage.de/files/2018-07/Revolutionizing%20Industrial%20IoT%20with%20Apache%20PLC4X.pdf
  • 22. IoT, Digital Twin and Event Streaming – @KaiWaehner - www.kai-waehner.de Challenges in Automation Industry IoT != IIoT != Buildings != Healthcare … • IoT = Connected cars, smart home, … à Large scale, secure, scalable, open, modern technologies • IIoT / Buildings = Not connected at all or slow, insecure, not scalable, proprietary • Healthcare = Often not connected at all yet (huge security requirements) Legacy / Proprietary IIoT Technologies • Usually incompatible protocols, typically proprietary • Usually serial connections (very low latency, nanoseconds) - with TCP / UDP wrapper around it to integrate with “external world” • Siemens S7, Modbus, Beckhoff, Profinet, Allen Bradley, etc. • OPC-UA (required machine update + license cost) Product Lifecycles • Long lifecycle (tens of years) • Factories and buildings cost millions, no simple changes / upgrades • Still using Windows 7 without Service Packs => Usability and security issues • Mantra: “Stay with your well-known vendor forever”
  • 23. IoT, Digital Twin and Event Streaming – @KaiWaehner - www.kai-waehner.de Challenges in Automation Industry Monoliths • No scalability • No extendibility • No real failover (start your backup machine) Missing Security Capabilities • Security in software development == Authentication, Authorization, Antivirus, SSL, SASL, Kerberos • Security in automation industry == Safety • “if you press the red button, the machine stops immediately” • Insecure by nature => No Authentication / Authorization / Encryption • Mantra: “Our factory building and network is secure, no access from outside” • Contradicts with “move to cloud and big data analytics”
  • 24. IoT, Digital Twin and Event Streaming – @KaiWaehner - www.kai-waehner.de Trends: Evolution of Convergence between IT and Industrial Automation https://iot-analytics.com/5-industrial-connectivity-trends-driving-the-it-ot-convergence
  • 25. IoT, Digital Twin and Event Streaming – @KaiWaehner - www.kai-waehner.de Complexity, Cost and Scalability are Main Blockers 25
  • 26. IoT, Digital Twin and Event Streaming – @KaiWaehner - www.kai-waehner.de Huge demand to build an open, flexible, scalable platform • Real time • Scalability • High availability • Decoupling • Cost reduction • Flexibility • Standards-based • Extendibility • Security • Infrastructure-independent • Multi-region / global
  • 27. IoT, Digital Twin and Event Streaming – @KaiWaehner - www.kai-waehner.de Agenda • Digital Twin - Merging the Physical and the Digital World • Real World Challenges • IoT Platforms • Apache Kafka as Event Streaming Solution for IoT • Spoilt for Choice for a Digital Twin • Global IoT Architectures • A Digital Twin for 100000 Connected Cars 27
  • 28. IoT, Digital Twin and Event Streaming – @KaiWaehner - www.kai-waehner.de 600+ IoT Platforms 28 https://iot-analytics.com/iot-platform-companies-landscape-2020/
  • 29. IoT, Digital Twin and Event Streaming – @KaiWaehner - www.kai-waehner.de Proprietary IoT Platforms • Sophisticated integration for related IIoT protocols (like Siemens S7, Modbus, etc.) and standards (like OPC-UA) • Not a single product (plenty of acquisitions, OEMs and different code bases are typically the foundation) • Typically very expensive • Proprietary (just open interfaces) • Limited scalability 29
  • 30. IoT, Digital Twin and Event Streaming – @KaiWaehner - www.kai-waehner.de IoT Offerings from Cloud Providers • Sophisticated tools for IoT management (devices, shadowing, …) • Good integration with other cloud services (storage, analytics, …) • Vendor lock-in • No focus on hybrid and edge (but on prem products) • Limited scalability • Often high cost (beyond ’hello world’) 30
  • 31. IoT, Digital Twin and Event Streaming – @KaiWaehner - www.kai-waehner.de Standards-based / Open Source IoT Platforms • Open and standards-based (e.g. MQTT) • Open source / open core business model • Infrastructure-independent • Different vendors behind the core technologies • Sometimes less mature or non-existent connectivity (especially to legacy and proprietary protocols) • Trade-off: Solid offering for one standard (e.g. HiveMQ for MQTT) or diversity but not for mission-critical scale (e.g. Node- RED) 31
  • 32. IoT, Digital Twin and Event Streaming – @KaiWaehner - www.kai-waehner.de Agenda • Digital Twin - Merging the Physical and the Digital World • Real World Challenges • IoT Platforms • Apache Kafka as Event Streaming Solution for IoT • Spoilt for Choice for a Digital Twin • Global IoT Architectures • A Digital Twin for 100000 Connected Cars 32
  • 33. IoT, Digital Twin and Event Streaming – @KaiWaehner - www.kai-waehner.de The Log ConnectorsConnectors Producer Consumer Streaming Engine Apache Kafka - The Rise of an Event Streaming Platform 33 = Messaging + Storage + Integration + Processing
  • 34. IoT, Digital Twin and Event Streaming – @KaiWaehner - www.kai-waehner.de Apache Kafka at Scale at Tech Giants > 7 trillion messages / day > 6 Petabytes / day “You name it” * Kafka is not just used for big data ** Kafka Is not just used by tech giants 34
  • 35. IoT, Digital Twin and Event Streaming – @KaiWaehner - www.kai-waehner.de Improve Customer Experience (CX) Increase Revenue (make money) Business Value Decrease Costs (save money) Core Business Platform Increase Operational Efficiency Migrate to Cloud Mitigate Risk (protect money) Key Drivers Strategic Objectives (sample) Fraud Detection IoT sensor ingestion Digital replatforming/ Mainframe Offload Connected Car: Navigation & improved in- car experience: Audi Customer 360 Simplifying Omni-channel Retail at Scale: Target Faster transactional processing / analysis incl. Machine Learning / AI Mainframe Offload: RBC Microservices Architecture Online Fraud Detection Online Security (syslog, log aggregation, Splunk replacement) Middleware replacement Regulatory Digital Transformation Application Modernization: Multiple Examples Website / Core Operations (Central Nervous System) The [Silicon Valley] Digital Natives; LinkedIn, Netflix, Uber, Yelp... Predictive Maintenance: Audi Streaming Platform in a regulated environment (e.g. Electronic Medical Records): Celmatix Real-time app updates Real Time Streaming Platform for Communications and Beyond: Capital One Developer Velocity - Building Stateful Financial Applications with Kafka Streams: Funding Circle Detect Fraud & Prevent Fraud in Real Time: PayPal Kafka as a Service - A Tale of Security and Multi-Tenancy: Apple Example Use Cases $↑ $↓ $↔ Example Case Studies (of many) Confluent - Business Value per Use Case 35
  • 36. IoT, Digital Twin and Event Streaming – @KaiWaehner - www.kai-waehner.de P Decoupling of Producers and Consumers Time C2 C3C1 36
  • 37. IoT, Digital Twin and Event Streaming – @KaiWaehner - www.kai-waehner.de Schema Enforcement (Client and Server Side) 37
  • 38. IoT, Digital Twin and Event Streaming – @KaiWaehner - www.kai-waehner.de Kafka Connect Kafka Cluster MQTT Integration Domain-Driven Design (DDD) for your Event Steaming Platform OPC-UA Integration Real Time Predictions IoT Platform Connector Java / Python / ”you-name-it” Schema Registry Event Streaming Platform IoT Domain Legacy Domain Analytics Domain è Independent and loosely coupled, but scalable, highly available and reliable! 38
  • 39. IoT, Digital Twin and Event Streaming – @KaiWaehner - www.kai-waehner.de 10 Reasons for Event Streaming with Apache Kafka Real Time Scalable Cost Reduction 24/7 – Zero downtime, zero data loss Decoupling – Storage, Domain-driven Design Data (re-)processing and stateful client applications Integration – Connectivity to IoT, legacy, big data, everything Hybrid Architecture – On Premises, multi cloud, edge computing Fully managed cloud No vendor locking 39
  • 40. IoT, Digital Twin and Event Streaming – @KaiWaehner - www.kai-waehner.de Digital Twin and AI / Machine Learning (with Kafka) • Complementary Concepts • Continuous Learning, Monitoring and Acting à Real time, scalable • (Good) data is key for success à Integration, data processing 40
  • 41. IoT, Digital Twin and Event Streaming – @KaiWaehner - www.kai-waehner.de Apache Kafka and Machine Learning – More Details 41 Blog: How to Build and Deploy Scalable Machine Learning in Production with Apache Kafka: https://www.confluent.io/blog/build-deploy-scalable-machine-learning-production-apache-kafka/ Blog: Using Apache Kafka to Drive Cutting-Edge Machine Learning: https://www.confluent.io/blog/using-apache-kafka-drive-cutting-edge-machine-learning/ Blog: Machine Learning and Real-Time Analytics in Apache Kafka Applications: https://www.confluent.io/blog/machine-learning-real-time-analytics-models-in-kafka-applications/ Blog: Streaming Machine Learning with Tiered Storage and Without a Data Lake: https://www.confluent.io/blog/streaming-machine-learning-with-tiered-storage/ Slides + Video: Event-driven Model Scoring vs. RPC with Model Server: https://www.confluent.io/kafka-summit-san-francisco-2019/event-driven-model-serving-stream-processing-vs-rpc-with-kafka- and-tensorflow/ Few examples for real time scoring with Kafka Steams and TensorFlow, H2O, etc.: https://github.com/kaiwaehner/kafka-streams-machine-learning-examples Demo: Anomaly Detection with KSQL: https://github.com/kaiwaehner/ksql-udf-deep-learning-mqtt-iot Demo + Video: Connected Car demo with Kafka + Streaming ML https://github.com/kaiwaehner/hivemq-mqtt-tensorflow-kafka-realtime-iot-machine-learning-training-inference
  • 42. IoT, Digital Twin and Event Streaming – @KaiWaehner - www.kai-waehner.de Hold on… Kafka is NOT an IoT Platform!
  • 43. IoT, Digital Twin and Event Streaming – @KaiWaehner - www.kai-waehner.de Device management Unreliable networks Connectivity beyond standards Edge hardware …
  • 44. IoT, Digital Twin and Event Streaming – @KaiWaehner - www.kai-waehner.de (De facto) Standards for Processing IoT Data A Match Made In Heaven + = IoT Platform
  • 45. IoT, Digital Twin and Event Streaming – @KaiWaehner - www.kai-waehner.de Agenda • Digital Twin - Merging the Physical and the Digital World • Real World Challenges • IoT Platforms • Apache Kafka as Event Streaming Solution for IoT • Spoilt for Choice for a Digital Twin • Global IoT Architectures • A Digital Twin for 100000 Connected Cars 45
  • 46. IoT, Digital Twin and Event Streaming – @KaiWaehner - www.kai-waehner.de Characteristics of Digital Twin Technology • Connectivity • Physical assets, enterprise software, customers • Bidirectional communication to ingest, command and control • Homogenization • Decoupling and standardization • Virtualization of information • Shared with multiple agents, unconstrained by physical location or time • Lower cost and easier testing, development and predictions • Reprogrammable and smart • Adjust and improve characteristics and develop new version of a product • Digital traces • Go back in time and analyse historical events to diagnose problems • Modularity • Design and customization of products and production modules • Tweak modules of models and machines 46
  • 47. IoT, Digital Twin and Event Streaming – @KaiWaehner - www.kai-waehner.de Digital Twin Mapped to Apache Kafka • Connectivity – Kafka Connect provides connectivity as scale in real time to IoT interfaces, big data solutions and cloud services. The Kafka ecosystem is complementary, NOT competitive to other Middleware and IoT Platforms. • Homogenization – Real decoupling between clients (i.e. producers and consumers) is one of the key strengths of Kafka. Schema management and enforcement leveraging different technologies (JSON Schema, Avro, Profobuf, etc.) enables data awareness and standardization. • Reprogrammable and smart – Kafka is the de facto standard for microservices for exactly this reason: Separation of concerns and domain-driven design (DDD). Deploy new decoupled applications and versions, do A/B testing, canarying. • Digital traces – Kakfa is a distributed commit log. Events are appended, stored as long as you want (potentially forever with rentention time = -1) and immutable. Seriously, what other technology could be used better to build a digital trace for a digital twin? • Modularity – The Kafka infrastructure itself is modular and scalable. This includes components like Kafka brokers, Connect, Schema Registry, REST Proxy and client applications in different languages like Java, Scala, Python, Go, .NET, C++ and others. With this modularity, you can easily build the right Digital Twin architecture your your edge, hybrid or global scenarios and also combine the Kafka components with any other IoT solutions. 47
  • 48. IoT, Digital Twin and Event Streaming – @KaiWaehner - www.kai-waehner.de Simplified Pipeline for a Digital Twin 48 Siemens S7, Modbus, Allen Bradley, Beckhoff ADS IoT Platform Digital Twin Real Time App Batch App Request Response App Connectivity Homogenization Reprogrammable and smart Digital traces Modularity
  • 49. IoT, Digital Twin and Event Streaming – @KaiWaehner - www.kai-waehner.de Scenario 1: Digital Twin Monolith 49 Siemens S7, Modbus, Allen Bradley, Beckhoff ADS IoT Platform Digital Twin Device Mgt. Analytics Connectivity Homogenization Reprogrammable and smart Digital traces Modularity
  • 50. IoT, Digital Twin and Event Streaming – @KaiWaehner - www.kai-waehner.de Scenario 2: Digital Twin as External Database 50 Siemens S7, Modbus, Allen Bradley, Beckhoff ADS IoT Platform Digital Twin Device Mgt. Database XYZAnalytics Connectivity Homogenization Reprogrammable and smart Digital traces Modularity
  • 51. IoT, Digital Twin and Event Streaming – @KaiWaehner - www.kai-waehner.de Apache Kafka Scenario 3: Kafka as Backbone for the Digital Twin and the Rest of the Enterprise 51 Siemens S7, Modbus, Allen Bradley, Beckhoff ADS IoT Platform Digital Twin Database XYZ Real Time App Batch App Request Response App Kafka Connect Connectivity Homogenization Reprogrammable and smart Digital traces Modularity
  • 52. IoT, Digital Twin and Event Streaming – @KaiWaehner - www.kai-waehner.de Apache Kafka Scenario 4: Kafka as IoT Platform 52 Siemens S7, Modbus, Allen Bradley, Beckhoff ADS IoT Cloud Platform Digital Twin Real Time App Batch App Request Response App Kafka Connect Connectivity Homogenization Reprogrammable and smart Digital traces Modularity Storage Processing
  • 53. IoT, Digital Twin and Event Streaming – @KaiWaehner - www.kai-waehner.de Apache Kafka Scenario 5: Kafka as Digital Twin 53 Siemens S7, Modbus, Allen Bradley, Beckhoff ADS Digital Twin Real Time App Batch App Request Response App Kafka Connect Storage Processing Connectivity Homogenization Reprogrammable and smart Digital traces Modularity
  • 54. IoT, Digital Twin and Event Streaming – @KaiWaehner - www.kai-waehner.de Kafka as Database? Seriously?
  • 55. IoT, Digital Twin and Event Streaming – @KaiWaehner - www.kai-waehner.de P Kafka’s Storage – A Distributed Commit Log Time C2 C3C1 55
  • 56. IoT, Digital Twin and Event Streaming – @KaiWaehner - www.kai-waehner.de Tiered Storage for Kafka Object Store Processing Storage Transactions, auth, quota enforcement, compaction, ... Local Remote Kafka Apps (Only available in Confluent Platform) www.kai-waehner.de | @KaiWaehner
  • 57. IoT, Digital Twin and Event Streaming – @KaiWaehner - www.kai-waehner.de Distributed System with Replication and High Availability on Server and Client Side read, write Kafka Client Kafka Server Side Materialized View in the Client App (In-memory, RocksDB)
  • 58. IoT, Digital Twin and Event Streaming – @KaiWaehner - www.kai-waehner.de Stateful Kafka Client Applications 58
  • 59. IoT, Digital Twin and Event Streaming – @KaiWaehner - www.kai-waehner.de Kafka as Data Storage? 59 https://www.kai-waehner.de/blog/2020/03/12/can-apache-kafka-replace-database-acid-storage-transactions-sql-nosql-data-lake/
  • 60. IoT, Digital Twin and Event Streaming – @KaiWaehner - www.kai-waehner.de Agenda • Digital Twin - Merging the Physical and the Digital World • Real World Challenges • IoT Platforms • Apache Kafka as Event Streaming Solution for IoT • Spoilt for Choice for a Digital Twin • Global IoT Architectures • A Digital Twin for 100000 Connected Cars 60
  • 61. IoT, Digital Twin and Event Streaming – @KaiWaehner - www.kai-waehner.de No matter which Digital Twin Architecture I use… Most Architectures are Hybrid (Edge, Data Center, Cloud) and sometimes even Global!
  • 62. IoT, Digital Twin and Event Streaming – @KaiWaehner - www.kai-waehner.de Edge Digital Twin Single Kafka Broker (or Cluster) Digital Twin Self-managed or certified OEM Hardware Kafka Cluster in DC / Cloud Replicator Siemens S7, Modbus, Allen Bradley, Beckhoff ADS
  • 63. IoT, Digital Twin and Event Streaming – @KaiWaehner - www.kai-waehner.de Centralized Digital Twin Digital Twin Single Kafka Broker (or Cluster) Self-managed or certified OEM Hardware Siemens S7, Modbus, Allen Bradley, Beckhoff ADS Single Kafka Broker (or Cluster) Self-managed or certified OEM Hardware Siemens S7, Modbus, Allen Bradley, Beckhoff ADS
  • 64. IoT, Digital Twin and Event Streaming – @KaiWaehner - www.kai-waehner.de Global Digital Twin Architecture Multiple Clusters and Aggregation Factories à Analytics Cluster Multi-Region Cluster High Availability (Disaster Recovery) Global Data Streaming Outsourced Development
  • 65. IoT, Digital Twin and Event Streaming – @KaiWaehner - www.kai-waehner.de Example of a Multi-Region Digital Twin Deployment Order Sensor Order Sensor Logs Logs synchronous asynchronous ● Automate Disaster Recovery ● Sync or Async Replication per Topic ● Offset Preserving ● Automated Client Failover with No Custom Code Zero downtime, zero data loss (even in cases of data center outage) US West US East
  • 66. IoT, Digital Twin and Event Streaming – @KaiWaehner - www.kai-waehner.de Architecture patterns for distributed, hybrid, edge and global Apache Kafka deployments www.kai-waehner.de | @KaiWaehner https://www.kai-waehner.de/blog/2020/01/29/deployment-patterns-distributed-hybrid-edge-global-multi-data-center-kafka-architecture/
  • 67. IoT, Digital Twin and Event Streaming – @KaiWaehner - www.kai-waehner.de Agenda • Digital Twin - Merging the Physical and the Digital World • Real World Challenges • IoT Platforms • Apache Kafka as Event Streaming Solution for IoT • Spoilt for Choice for a Digital Twin • Global IoT Architectures • A Digital Twin for 100000 Connected Cars 67
  • 68. IoT, Digital Twin and Event Streaming – @KaiWaehner - www.kai-waehner.de A Digital Twin with Kafka and TensorFlow 68 MQTT Proxy Elastic Grafana Kafka Cluster Kafka Connect Car Sensors Kafka Ecosystem TensorFlow Other Components Kafka Streams (Java) All Data Critical Data Ingest Data Potential Detect KSQL TensorFlow Train Analytic Model Consume Data Preprocess Data Analytic Model Deploy Analytic Model Python https://github.com/kaiwaehner/hivemq-mqtt-tensorflow-kafka-realtime-iot-machine-learning-training-inference Connectivity Homogenization Reprogrammable and smart Digital traces Modularity
  • 69. IoT, Digital Twin and Event Streaming – @KaiWaehner - www.kai-waehner.de Architecture for 100000 Connected Cars Kafka + KSQL + MQTT + TensorFlow + Kubernetes 69 https://www.kai-waehner.de/blog/2019/11/08/live-demo-iot-100-000-connected-cars-kubernetes-kafka-mqtt-tensorflow/
  • 70. IoT, Digital Twin and Event Streaming – @KaiWaehner - www.kai-waehner.de Key Takeaways • A Digital Twin merges the physical and the digital world • Apache Kafka enables an open, scalable and reliable infrastructure for a Digital Twin • Event Streaming complements IoT platforms and other backend applications / databases. + 70