Fallacies of Distributed Computing

•Download as PPTX, PDF•

0 likes•784 views

Arnon Rotem-Gal-Oz

Fallacies of distributed systems and a couple of examples of issues related to them (logical clocks, CRDTs)

Technology

Topics in Distributed Systems
Arnon Rotem-Gal-Oz

What’s a “distributed system”?
You know you have a distributed system when the crash of a computer you’ve never heard of
stops you from getting any work done. —LESLIE LAMPORT

Your mission, should you choose to accept it:
• Read data from one “place”
• Write it to another “place”

mov eax, [ebx]
mov [ecx],eax
(try
(let [[partitioner msg] (channel/pull chan)]
(kp/send-message @producer (kp/message topic (.getBytes ^String partitioner) (.getBytes ^String msg)))
(counter-fn))
(catch Exception ex …

System Event Actual Latency Scaled Latency
One CPU cycle 0.4 ns 1 s
Level 1 cache access 0.9 ns 2 s
Level 2 cache access 2.8 ns 7 s
Level 3 cache access 28 ns 1 min
Main memory access (DDR DIMM) ~100 ns 4 min
Intel® Optane™ DC persistent memory access ~350 ns 15 min
Intel® Optane™ DC SSD I/O <10 μs 7 hrs
NVMe SSD I/O ~25 μs 17 hrs
SSD I/O 50–150 μs 1.5–4 days
Rotational disk I/O 1–10 ms 1–9 months
Internet call: San Francisco to New York City 65 ms 5 years
Internet call: San Francisco to Hong Kong 141 ms 11 years
Systems Performance: Enterprise and the Cloud, Brendan

The network is reliable
skb rides the rocket…

Concurrent
Q
A
BX Y
Firewall blocks all traffic: P can’t communicate to Q
P

P
Q
A
B
P sends M
Q receives M
X
Causal reation

Q
LogicalClockQ
A P sends M
0 1 2 3
0 1 2
Q receives M B
X
P
LogicalClockP

Q computes:
LogicalClockQ = max(0, 3) + 1
P
LogicalClockP
Q
LogicalClockQ
A P sends M
0 1 2 3
0 1 4 5
Q receives M B
LogicalClockM = 3
X
Y

• Don’t take distributed
actions lightly
• Be careful when using
abstractions that hide
distributed calls
• Big data means low-
probability problems are
daily occurances

Read more
• Fallacies of distributed computing
• Vector clocks
• CRDTs - https://www.serverless.com/blog/crdt-explained-
supercharge-serverless-at-edge
• https://bartoszsypytkowski.com/the-state-of-a-state-based-crdts/
• Google Spanner
https://static.googleusercontent.com/media/research.google.com/en
//archive/spanner-osdi2012.pdf
• https://research.google/pubs/pub45855/

What's hot

Open ZFS Keynote (public)Dustin Kirkland

Windows Internals for Linux Kernel DevelopersKernel TLV

Seven problems of Linux ContainersKirill Kolyshkin

Introduction to linux containersGoogle

XPDS14: MirageOS 2.0: branch consistency for Xen Stub Domains - Anil Madhavap...The Linux Foundation

Swarm 2 Go - Build A Portable Multi-Arch Data Center with Pi and UP NodesStefan Scherer

What's hot (6)

Open ZFS Keynote (public)

Windows Internals for Linux Kernel Developers

Seven problems of Linux Containers

Introduction to linux containers

XPDS14: MirageOS 2.0: branch consistency for Xen Stub Domains - Anil Madhavap...

Swarm 2 Go - Build A Portable Multi-Arch Data Center with Pi and UP Nodes

Similar to Fallacies of Distributed Computing

In-Memory Computing EssentialsDenis Magda

IO Dubi Lebelsqlserver.co.il

Sql server engine cpu cache as the new ramChris Adkin

Pasig - Hashing presentation-2013Mike Smorul

An introduction and evaluations of a wide area distributed storage systemHiroki Kashiwazaki

Spca2014 advanced share point troubleshooting hessingNCCOMMS

PerformanceChristophe Marchal

Input and Output Devices and SystemsNajma Alam

Технологии работы с дисковыми хранилищами и файловыми системами Windows Serve...Виталий Стародубцев

Again musicvariable_orr

Data Grids with Oracle CoherenceBen Stopford

SQLIO - measuring storage performancevalerian_ceaus

The post release technologies of Crysis 3 (Slides Only) - Stewart NeedhamStewart Needham

Solve the colocation conundrum: Performance and density at scale with KubernetesNiklas Quarfot Nielsen

Oracle Open World 2014: Lies, Damned Lies, and I/O Statistics [ CON3671]Kyle Hailey

Distributed computingDeepak John

AF Ceph: Ceph Performance Analysis and Improvement on FlashCeph Community

Collaborate nfs kyle_finalKyle Hailey

Hands-on Lab: How to Unleash Your Storage Performance by Using NVM Express™ B...Odinot Stanislas

Google SpannerVaidas Brundza

Similar to Fallacies of Distributed Computing (20)

In-Memory Computing Essentials

IO Dubi Lebel

Sql server engine cpu cache as the new ram

Pasig - Hashing presentation-2013

An introduction and evaluations of a wide area distributed storage system

Spca2014 advanced share point troubleshooting hessing

Performance

Input and Output Devices and Systems

Технологии работы с дисковыми хранилищами и файловыми системами Windows Serve...

Again music

Data Grids with Oracle Coherence

SQLIO - measuring storage performance

The post release technologies of Crysis 3 (Slides Only) - Stewart Needham

Solve the colocation conundrum: Performance and density at scale with Kubernetes

Oracle Open World 2014: Lies, Damned Lies, and I/O Statistics [ CON3671]

Distributed computing

AF Ceph: Ceph Performance Analysis and Improvement on Flash

Collaborate nfs kyle_final

Hands-on Lab: How to Unleash Your Storage Performance by Using NVM Express™ B...

Google Spanner

Recently uploaded

Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxLoriGlavin3

What is DBT - The Ultimate Data Build Tool.pdfMounikaPolabathina

Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxLoriGlavin3

Nell’iperspazio con Rocket: il Framework Web di Rust!Commit University

Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada

Moving Beyond Passwords: FIDO Paris Seminar.pdfLoriGlavin3

What is Artificial Intelligence?????????blackmambaettijean

Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada

A Deep Dive on Passkeys: FIDO Paris Seminar.pptxLoriGlavin3

The Ultimate Guide to Choosing WordPress Pros and ConsPixlogix Infotech

SALESFORCE EDUCATION CLOUD | FEXLE SERVICESmohitsingh558521

The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxLoriGlavin3

SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero

(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...AliaaTarek5

A Journey Into the Emotions of Software DevelopersNicole Novielli

New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada

Sample pptx for embedding into website for demoHarshalMandlekar2

Digital Identity is Under Attack: FIDO Paris Seminar.pptxLoriGlavin3

unit 4 immunoblotting technique complete.pptxBkGupta21

Ryan Mahoney - Will Artificial Intelligence Replace Real Estate AgentsRyan Mahoney

Recently uploaded (20)

Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx

What is DBT - The Ultimate Data Build Tool.pdf

Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx

Nell’iperspazio con Rocket: il Framework Web di Rust!

Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024

Moving Beyond Passwords: FIDO Paris Seminar.pdf

What is Artificial Intelligence?????????

Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024

A Deep Dive on Passkeys: FIDO Paris Seminar.pptx

The Ultimate Guide to Choosing WordPress Pros and Cons

SALESFORCE EDUCATION CLOUD | FEXLE SERVICES

The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx

SIP trunking in Janus @ Kamailio World 2024

(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...

A Journey Into the Emotions of Software Developers

New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024

Sample pptx for embedding into website for demo

Digital Identity is Under Attack: FIDO Paris Seminar.pptx

unit 4 immunoblotting technique complete.pptx

Ryan Mahoney - Will Artificial Intelligence Replace Real Estate Agents

Fallacies of Distributed Computing

1. Topics in Distributed Systems Arnon Rotem-Gal-Oz

2. What’s a “distributed system”? You know you have a distributed system when the crash of a computer you’ve never heard of stops you from getting any work done. —LESLIE LAMPORT

3. Your mission, should you choose to accept it: • Read data from one “place” • Write it to another “place”

4. mov eax, [ebx] mov [ecx],eax (try (let [[partitioner msg] (channel/pull chan)] (kp/send-message @producer (kp/message topic (.getBytes ^String partitioner) (.getBytes ^String msg))) (counter-fn)) (catch Exception ex …

5. System Event Actual Latency Scaled Latency One CPU cycle 0.4 ns 1 s Level 1 cache access 0.9 ns 2 s Level 2 cache access 2.8 ns 7 s Level 3 cache access 28 ns 1 min Main memory access (DDR DIMM) ~100 ns 4 min Intel® Optane™ DC persistent memory access ~350 ns 15 min Intel® Optane™ DC SSD I/O <10 μs 7 hrs NVMe SSD I/O ~25 μs 17 hrs SSD I/O 50–150 μs 1.5–4 days Rotational disk I/O 1–10 ms 1–9 months Internet call: San Francisco to New York City 65 ms 5 years Internet call: San Francisco to Hong Kong 141 ms 11 years Systems Performance: Enterprise and the Cloud, Brendan

6. System Event Actual Latency Scaled Latency One CPU cycle 0.4 ns 1 s Level 1 cache access 0.9 ns 2 s Level 2 cache access 2.8 ns 7 s Level 3 cache access 28 ns 1 min Main memory access (DDR DIMM) ~100 ns 4 min Intel® Optane™ DC persistent memory access ~350 ns 15 min Intel® Optane™ DC SSD I/O <10 μs 7 hrs NVMe SSD I/O ~25 μs 17 hrs SSD I/O 50–150 μs 1.5–4 days Rotational disk I/O 1–10 ms 1–9 months Internet call: San Francisco to New York City 65 ms 5 years Internet call: San Francisco to Hong Kong 141 ms 11 years Systems Performance: Enterprise and the Cloud, Brendan

7. System Event Actual Latency Scaled Latency One CPU cycle 0.4 ns 1 s Level 1 cache access 0.9 ns 2 s Level 2 cache access 2.8 ns 7 s Level 3 cache access 28 ns 1 min Main memory access (DDR DIMM) ~100 ns 4 min Intel® Optane™ DC persistent memory access ~350 ns 15 min Intel® Optane™ DC SSD I/O <10 μs 7 hrs NVMe SSD I/O ~25 μs 17 hrs SSD I/O 50–150 μs 1.5–4 days Rotational disk I/O 1–10 ms 1–9 months Internet call: San Francisco to New York City 65 ms 5 years Internet call: San Francisco to Hong Kong 141 ms 11 years Systems Performance: Enterprise and the Cloud, Brendan mov eax, [ebx] mov [ecx],eax (try (let [[partitioner msg] (cha (kp/send-message @pr (kp/message topic (.getBytes partitioner) (.getBytes ^String

8. Request network

9. The network is reliable skb rides the rocket…

10. Latency is zero

11. System Event Actual Latency Scaled Latency One CPU cycle 0.4 ns 1 s Level 1 cache access 0.9 ns 2 s Level 2 cache access 2.8 ns 7 s Level 3 cache access 28 ns 1 min Main memory access (DDR DIMM) ~100 ns 4 min Intel® Optane™ DC persistent memory access ~350 ns 15 min Intel® Optane™ DC SSD I/O <10 μs 7 hrs NVMe SSD I/O ~25 μs 17 hrs SSD I/O 50–150 μs 1.5–4 days Rotational disk I/O 1–10 ms 1–9 months Internet call: San Francisco to New York City 65 ms 5 years Internet call: San Francisco to Hong Kong 141 ms 11 years Systems Performance: Enterprise and the Cloud, Brendan

12. Bandwidth is infinite

13. The network is secure

14. Topology doesn’t change

15. There is one administrator

16. Transport cost is zero

17. Network is homogeneous

18. Instances are free

19. Instances have identities

20. Latency is zero

21. Latency is constant

22.

23. What’s ”Happened Before”?

24. Concurrent Q A BX Y Firewall blocks all traffic: P can’t communicate to Q P

25. P Q A B P sends M Q receives M X Causal reation

26. Q LogicalClockQ A P sends M 0 1 2 3 0 1 2 Q receives M B X P LogicalClockP

27. Q computes: LogicalClockQ = max(0, 3) + 1 P LogicalClockP Q LogicalClockQ A P sends M 0 1 2 3 0 1 4 5 Q receives M B LogicalClockM = 3 X Y

28. Counter

29. Counter take 2

30. Decrements?

31. Sets ?

32. • Don’t take distributed actions lightly • Be careful when using abstractions that hide distributed calls • Big data means low- probability problems are daily occurances

33. Read more • Fallacies of distributed computing • Vector clocks • CRDTs - https://www.serverless.com/blog/crdt-explained- supercharge-serverless-at-edge • https://bartoszsypytkowski.com/the-state-of-a-state-based-crdts/ • Google Spanner https://static.googleusercontent.com/media/research.google.com/en //archive/spanner-osdi2012.pdf • https://research.google/pubs/pub45855/

Editor's Notes

8 fallacies Formulated by Peter Deutsch and James Gosling (fater of Java) in 1994-97
SKB – Linux socket buffer (fundamental structure that handles any packet sent or received ) [31334587.454365] xennet: skb rides the rocket: 21 slots[31334772.157791] xennet: skb rides the rocket: 20 slots[31335254.431489] xennet: skb rides the rocket: 19 slots http://vger.kernel.org/~davem/skb.html Anyway - not just the infrastructure, there’s also other things that can affect reliability like ddos attaches Switches have MTBF 50K hours, (just told Yaniv Erickson achieved nine 9s availability with their AXD301 switch) Aggrevated by Microservices 99.9930 = 99.7% uptime0.3% of 1 billion requests = 3,000,000 failures2+ hours downtime/month even if all dependencies have excellent uptime. Retry, Circuit breakers, caching , alert
Bandwidth keeps getting better and better – but latencies don’t , the light ahs a fast but finite speed ping from Europe to US and back is 30ms even if eveything is perfect We’ve seen the numbers
Bandwidth gets higher – but we also send much more data Generally we can get the bandwidth -> but it comes with $cost, so actually we need to keep in mind that we’d have to work with limitations
I don’t think that anyone is really likely to make this false assumption these days We all know we need to deal with security – but are we doing enough? (checkmarx, whitesmoke) But we’re jjust starting to move service-to-service to SSL, Kafka , spark still TBD) The reqs fof K8s security since the time I set up AKS to now changed significantly Build, runtime (kubei)
Same as the previous one – not likely to believe that That’s why we’re using configuration, discovery and such
The fact is no single person understands all aspects of the system Devops culture - > passing some responsibility to dev (you build it you own it) Monitoring – who is going to wake up? Again config
Opex – But more than that , serialization, encryption, …
Even my home has IOS, MacOS, Windows, Android (phones, streamer), Printer (embedded), SmartTVs We’re *mostly* C#
We have “BIG DATA” technologies we can *just* add more instances Audit – runnning on Hadoop so namenodes so zookeeper TCO - think operational complexity Choose the right tool for the job – if it is fit in memory don’t use needless techologies . I’ve answered countless times on Stackoverflow ”Why spark is slow” Doing things during the pipeline vs. adding machines to deal with queries
Cattle not pets
Bandwidth keeps getting better and better – but latencies don’t , the light ahs a fast but finite speed ping from Europe to US and back is 30ms even if eveything is perfect We’ve seen the numbers
No ordering !
Time Clock drift Getting NTP / PTP (Precision Time Protocol) TrueTime
Leslie Lamport is a famous distributed computing researcher Suppose that event A occurs in a data center, and then later event B. Did A “cause” B to happen? What if A was at 10am, and B at 11:30pm. Does knowing time help? What if A was a command to register a new student, and B was an internal action that creates her “meal card” account? What if A was an email from the department asking me about my teaching preferences, and B was my reply? For Leslie, event A causes event B if there was a computation that somehow was triggered by A, and B was part of it. Inspired by physics! But this is hard to discover automatically. Instead, Leslie focused on potential causality: A “might” have caused B. Under what conditions is this possible? Somehow, information must flow from A to B.
Let’s use LogicalClock(X) to denote the relevant LogicalClock value for x. We can time-stamp events and messages. If A  B, then LogicalClock(A) < LogicalClock (B) But… if LogicalClock (A) < LogicalClock (B), perhaps A didn’t happen before B! Can overcome that if we use VectorClock
Conflict Free Replicated data type No meaning for ordering Can be a base implementation for logical clocks (and vector clocks)
Growing only – (always increasing) Can handle multi invocation Still a problem around “zero” (ordering) -> effectively it is only a constructro
Any idea why we’d want 2 counters ? The max operation will not work with single counter – we can’t handle duplicate messages Allowing max values
Need causal ordering (remove => add => remove != remove => remove => add) 2 sets Need ordering Will also need 2 sets to support removes

Fallacies of Distributed Computing

Recommended

Recommended

More Related Content

What's hot

What's hot (6)

Similar to Fallacies of Distributed Computing

Similar to Fallacies of Distributed Computing (20)

More from Arnon Rotem-Gal-Oz

More from Arnon Rotem-Gal-Oz (20)

Recently uploaded

Recently uploaded (20)

Fallacies of Distributed Computing

Editor's Notes