So you think you can scale

•Download as PPTX, PDF•

10 likes•26,842 views

- Layer 7 load balancing (L7 LB) is needed for modern applications as architectures and users have changed from monolithic to microservices and APIs, and from only humans to humans, systems, and things. - Scaling modern applications requires choosing an architecture like horizontal duplication, functional decomposition, or data partitioning along the X, Y, and Z axes rather than just choosing a load balancing algorithm. - Architecting for scalability involves variations of L7 LB techniques like data partitioning, URL/HTTP dispatch, dynamic routing, functional decomposition, and API metering and versioning. Key considerations for scaling include API design, user/entity identification, API versioning, and monitoring.

Technology

SO YOU THINK YOU CAN
S C A L E
@lmacvittie @F5Networks

100 Milliseconds Slower
-1% SALES -0.2% SEARCHES -2% CONVERSION
$660M $45M $244M
H/T James Urquhart, SOASTA Data: Gartner, Walmart

1 Minute of Downtime
Data: Emerson Power
Costs an average of $7300
Average total cost of downtime per year across industries
PRODUCTIVITY IT PRODUCTIVITY LOST REVENUE
$53,608$140,543 $183,724

POLB
(Plain Old Load Balancing)
We throw POLB in front of everything

But architectures and apps are changing
THEN
MONOLITHIC MICROSERVICES & APIs
NOW

And so are “users”
THEN
Humans
NOW
Humans, Systems, Things

L7 LB
Layer 7 Load Balancing
Modern Apps Need Modern Scale

Scaling Modern Apps
THEN
CHOOSING AN
ALGORITHM
NOW
• Round Robin
• Least Connections
• Fastest Response
CHOOSING AN
ARCHITECTURE
• Horizontal
• Functional
• Partitioning


Growth and Innovation
Architectural Debt
Software has technical Debt. Networks have architectural debt.

L7 LB
Scalability architectures based on Scale Cube
Horizontal Duplication
Functional
Decomposition
Data Partitioning
(Sharding)
X-AXIS Y-AXIS Z-AXIS

• Like clustering
• Leverages cloning (horizontal duplication)
• Typically operates at layer 4 (TCP)
• This is POLB
X-AXIS

Y-AXIS
• Like routing / URL dispatch
• Uses identifiable variables (usually from HTTP headers)
• Operates at layer 7 (HTTP)
/login/
/checkout/
/search/

Z-AXIS
/car/[A-M]/
/car/[N-U]/
/car/[V-Z]/
• Like sharding
• Uses one or more identifiable variables (usually from HTTP headers)
• Operates at layer 7 (HTTP)

API
APIAPIAPI
q
API Dispatch
q
POLB
API Façade

• Data Partitioning (Sharding)
Architectures
• URL/HTTP dispatch
• Dynamic routing based on backend
data
• Scaling by Functional Decomposition
• API metering
• App versioning / migration
• API deprecation
L7 LB
Architecting
Scalability

Things to Consider Sooner Rather than Later
Is your API design well-suited to scaling in any way other than POLB?
How do you identify people, systems, and things?
How do you distinguish between API versions?
Multi-tenant or single tenant microservices?
The answers to these questions will impact your scaling architectural options*.
Emacs or vi?
*Maybe not that last one.
How are you monitoring (and what are you measuring with it)?

THANK YOU!
@lmacvittie @F5Networks
l.macvittie@f5.com

What's hot

Api gateway in microservicesKunal Hire

Magento Developer Talk. Microservice Architecture and Actor ModelIgor Miniailo

Scaling from 1 to 10 million users - HailoBoyan Dimitrov

Cultivating an eco system of success Kemp

Microservices: next-stepsBoyan Dimitrov

Github Projects Overview and IBM Streams V4.1lisanl

Moving to microservices – a technology and organisation transformational journeyBoyan Dimitrov

apidays LIVE Australia - Leveraging DevOps to visualize your digital ecosyste...apidays

The Most Common Errors That Aren’t CaughtNordic APIs

Introduction to the Spark MLLib Toolkit in IBM Streams V4.1lisanl

Continuous Integration and Delivery at Shapeways (Matt Boyle)Nordic APIs

Will ServerLess kill containers and OperationsStephane Woillez

Apache Flink Online TrainingLearntek1

Reducing MTTR and False Escalations: Event Correlation at LinkedInMichael Kehoe

How do async ap is survive in a rest world Luca Mattia Ferrari

Apache flinkJanu Jahnavi

REST vs. GraphQL: Critical LookNordic APIs

Let's Talk ProIV and AlexaAndrew Turner

Multi Team ArchitectureSigma Software

Apache flinkJanu Jahnavi

What's hot (20)

Api gateway in microservices

Magento Developer Talk. Microservice Architecture and Actor Model

Scaling from 1 to 10 million users - Hailo

Cultivating an eco system of success

Microservices: next-steps

Github Projects Overview and IBM Streams V4.1

Moving to microservices – a technology and organisation transformational journey

apidays LIVE Australia - Leveraging DevOps to visualize your digital ecosyste...

The Most Common Errors That Aren’t Caught

Introduction to the Spark MLLib Toolkit in IBM Streams V4.1

Continuous Integration and Delivery at Shapeways (Matt Boyle)

Will ServerLess kill containers and Operations

Apache Flink Online Training

Reducing MTTR and False Escalations: Event Correlation at LinkedIn

How do async ap is survive in a rest world

Apache flink

REST vs. GraphQL: Critical Look

Let's Talk ProIV and Alexa

Multi Team Architecture

Apache flink

Viewers also liked

State of Application Delivery 2017 - DevOps Insights Lori MacVittie

Rancher 快速打造叢集的解決方案Miles Chou

F5 Networks BIG-IP LTM Virtual EditionDSorensenCPR

HTTP by Hand: Exploring HTTP/1.0, 1.1 and 2.0Cory Forsyth

LTM essentialsbharadwajv

You are valuableolusegun victor Mamora

Csfpt ppcr mars 2017Dominique Gayraud

C All 2008 7 22Donna Davidson

Missing vin (1)Osopher

Debugging Javascript - 0 to HeisenbergChris Morrow

GeoMapFish, the Open Source WebGISCamptocamp

Dossier Pedagogique Festival Electrochoc 12 Les Abattoirs SMAC - Scène de Musiques Actuelles

Maar wat is Sosiale Media?KuberKat SwartBelt

Moving beyond Blackboard: The VLE journey at DundeeNatalie Lafferty

A Measurement Study of 4chan’s Politically Incorrect Forum and Its Effects on...Emiliano De Cristofaro

Viewers also liked (15)

State of Application Delivery 2017 - DevOps Insights

Rancher 快速打造叢集的解決方案

F5 Networks BIG-IP LTM Virtual Edition

HTTP by Hand: Exploring HTTP/1.0, 1.1 and 2.0

LTM essentials

You are valuable

Csfpt ppcr mars 2017

C All 2008 7 22

Missing vin (1)

Debugging Javascript - 0 to Heisenberg

GeoMapFish, the Open Source WebGIS

Dossier Pedagogique Festival Electrochoc 12

Maar wat is Sosiale Media?

Moving beyond Blackboard: The VLE journey at Dundee

A Measurement Study of 4chan’s Politically Incorrect Forum and Its Effects on...

Similar to So you think you can scale

REST - What's It All About? (SAP TechEd 2012, CD110)Sascha Wenninger

Future of SOA & Modern APIsRam Lakshmanan

Connect js nodejs_api_shubhraShubhra Kar

Building for, perceiving and measuring performance for mobile webRobin Glen

Data Summer Conf 2018, “Building unified Batch and Stream processing pipeline...Provectus

What is API - Understanding API SimplifiedJubin Aghara

Mmc Lsmw Intropeach9613

Role of Rest vs. Web Services and EIWSO2

Data Pipelines - Big Data meets Salesforceagarciaodeian

Moving existing apps to the cloudTiera Fann, MBA

Dubbo and Weidian's practice on micro-service architectureHuxing Zhang

Node.js Frameworks & Design Patterns WebinarShubhra Kar

Data Pipelines: Big Data Meets SalesforceSalesforce Developers

Angular jS Introduction by GoogleASG

Picking the Right Node.js Framework for Your Use CaseJimmy Guerrero

(ATS6-DEV02) Web Application StrategiesBIOVIA

Data Pipelines -Big Data Meets SalesforceCarolEnLaNube

APEX Alpe Adria Mike Hichwa Keynote April 11th 2019- ZagrebMichael Hichwa

[Luca/Van Campenhoudt] Microsoft Flow Beyone the Limits: Tips, Pitfalls, Patt...European Collaboration Summit

Microsoft Flow best practices European Collaboration Summit 2018serge luca

Similar to So you think you can scale (20)

REST - What's It All About? (SAP TechEd 2012, CD110)

Future of SOA & Modern APIs

Connect js nodejs_api_shubhra

Building for, perceiving and measuring performance for mobile web

Data Summer Conf 2018, “Building unified Batch and Stream processing pipeline...

What is API - Understanding API Simplified

Mmc Lsmw Intro

Role of Rest vs. Web Services and EI

Data Pipelines - Big Data meets Salesforce

Moving existing apps to the cloud

Dubbo and Weidian's practice on micro-service architecture

Node.js Frameworks & Design Patterns Webinar

Data Pipelines: Big Data Meets Salesforce

Angular jS Introduction by Google

Picking the Right Node.js Framework for Your Use Case

(ATS6-DEV02) Web Application Strategies

Data Pipelines -Big Data Meets Salesforce

APEX Alpe Adria Mike Hichwa Keynote April 11th 2019- Zagreb

[Luca/Van Campenhoudt] Microsoft Flow Beyone the Limits: Tips, Pitfalls, Patt...

Microsoft Flow best practices European Collaboration Summit 2018

Recently uploaded

SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero

Designing IA for AI - Information Architecture Conference 2024Enterprise Knowledge

Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfPrecisely

Vertex AI Gemini Prompt Engineering TipsMiki Katsuragi

Advanced Test Driven-Development @ php[tek] 2024Scott Keck-Warren

E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptxnull - The Open Security Community

Powerpoint exploring the locations used in television show Time Clashcharlottematthew16

Ensuring Technical Readiness For Copilot in Microsoft 3652toLead Limited

"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek SchlawackFwdays

CloudStudio User manual (basic edition):comworks

What's New in Teams Calling, Meetings and Devices March 2024Stephanie Beckett

SAP Build Work Zone - Overview L2-L3.pptxNavinnSomaal

The Ultimate Guide to Choosing WordPress Pros and ConsPixlogix Infotech

Nell’iperspazio con Rocket: il Framework Web di Rust!Commit University

Dev Dives: Streamline document processing with UiPath Studio WebUiPathCommunity

New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada

Commit 2024 - Secret Management made easyAlfredo García Lavilla

Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostZilliz

Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst

Search Engine Optimization SEO PDF for 2024.pdfRankYa

Recently uploaded (20)

SIP trunking in Janus @ Kamailio World 2024

Designing IA for AI - Information Architecture Conference 2024

Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf

Vertex AI Gemini Prompt Engineering Tips

Advanced Test Driven-Development @ php[tek] 2024

E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx

Powerpoint exploring the locations used in television show Time Clash

Ensuring Technical Readiness For Copilot in Microsoft 365

"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack

CloudStudio User manual (basic edition):

What's New in Teams Calling, Meetings and Devices March 2024

SAP Build Work Zone - Overview L2-L3.pptx

The Ultimate Guide to Choosing WordPress Pros and Cons

Nell’iperspazio con Rocket: il Framework Web di Rust!

Dev Dives: Streamline document processing with UiPath Studio Web

New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024

Commit 2024 - Secret Management made easy

Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost

Human Factors of XR: Using Human Factors to Design XR Systems

Search Engine Optimization SEO PDF for 2024.pdf

So you think you can scale

1. SO YOU THINK YOU CAN S C A L E @lmacvittie @F5Networks

2.    Why do we scale?

3. 100 Milliseconds Slower -1% SALES -0.2% SEARCHES -2% CONVERSION $660M $45M $244M H/T James Urquhart, SOASTA Data: Gartner, Walmart

4. 1 Minute of Downtime Data: Emerson Power Costs an average of $7300 Average total cost of downtime per year across industries PRODUCTIVITY IT PRODUCTIVITY LOST REVENUE $53,608$140,543 $183,724

5. =

6. UP OUT How do we scale today?

7. POLB (Plain Old Load Balancing) We throw POLB in front of everything

8. But architectures and apps are changing THEN MONOLITHIC MICROSERVICES & APIs NOW

9. And so are “users” THEN Humans NOW Humans, Systems, Things

10. L7 LB Layer 7 Load Balancing Modern Apps Need Modern Scale

11. Scaling Modern Apps THEN CHOOSING AN ALGORITHM NOW • Round Robin • Least Connections • Fastest Response CHOOSING AN ARCHITECTURE • Horizontal • Functional • Partitioning 

12. Growth and Innovation Architectural Debt Software has technical Debt. Networks have architectural debt.

13. L7 LB Scalability architectures based on Scale Cube Horizontal Duplication Functional Decomposition Data Partitioning (Sharding) X-AXIS Y-AXIS Z-AXIS

14. • Like clustering • Leverages cloning (horizontal duplication) • Typically operates at layer 4 (TCP) • This is POLB X-AXIS

15. Y-AXIS • Like routing / URL dispatch • Uses identifiable variables (usually from HTTP headers) • Operates at layer 7 (HTTP) /login/ /checkout/ /search/

16. Z-AXIS /car/[A-M]/ /car/[N-U]/ /car/[V-Z]/ • Like sharding • Uses one or more identifiable variables (usually from HTTP headers) • Operates at layer 7 (HTTP)

17. API APIAPIAPI q API Dispatch q POLB API Façade

18. L7 q Device Driven (Dispatch) L7 L7 L7 APIAPIAPI APIAPIAPIAPI Purpose Driven (functional decomposition) /status /update /status /update /billing /profile /update Capacity Driven (POLB) Variations on a Theme

19. • Data Partitioning (Sharding) Architectures • URL/HTTP dispatch • Dynamic routing based on backend data • Scaling by Functional Decomposition • API metering • App versioning / migration • API deprecation L7 LB Architecting Scalability

20. Things to Consider Sooner Rather than Later Is your API design well-suited to scaling in any way other than POLB? How do you identify people, systems, and things? How do you distinguish between API versions? Multi-tenant or single tenant microservices? The answers to these questions will impact your scaling architectural options*. Emacs or vi? *Maybe not that last one. How are you monitoring (and what are you measuring with it)?

21. THANK YOU! @lmacvittie @F5Networks l.macvittie@f5.com

Editor's Notes

Performance and availability. Business. Engagement. If we don’t scale, the business doesn’t grow, we impede productivity. We lose. We don’t just scale today for technical reasons. Availability and its related metric of performance are tied to business metrics today like conversion rates, abandonment, bounce. There are real dollars tied to these operational metrics. And because in many cases – regardless of industry – apps are the business, there are real business metrics tied to them, as well, such as productivity and revenue growth.
Three hundred million dollars was spent to lay a high-speed fiber optic cable from the futures market in Chicago to the exchanges in New Jersey to improve the speed of stock trading (particularly high- frequency trading — HFT) by 3 milliseconds. That’s 100 million dollars per millisecond. Most organizations work in bigger slices of time, like 100 milliseconds. A quick blink of your eye is 100 milliseconds. Google considers page load latency in scoring ad relevance. https://theeconomiccontrarian.wordpress.com/2014/04/01/what-is-worth-100-million-per-millisecond/
IT productivity is something we don’t often consider as a “thing” to be lost. But in the spring of 2000 I was a technical architect for a global transportation firm (they have big orange trucks ;-)) and we lost the primary application used to schedule and route shipments. For nearly 6 hours everyone not working to restore that system was diverted as runners between fax machines and the dispatch floor, trying to keep business running via sneaker net. That’s IT productivity lost right there, because I wasn’t coding – I was a messenger. http://www.emersonnetworkpower.com/documentation/en-us/brands/liebert/infographics/documents/ponemon-infographic-cost%20of%20downtime-r11-13-final.pdf
We scale because
How do we scale (everything) today? Up or out.
How do we scale out? This is the monolithic approach.
We need the ability to route, direct, and distribute load based on modern architectures that include highly decomposed applications (microservices), multiple versions of the same app and APIs and an increasingly diverse set of devices and things accessing those applications. That’s where L7 LB comes in. It has the visibility necessary and the location in an architecture (up front) to make the kinds of decisions required to scale modern, distributed architectures.
And that’s exactly what we need – architectures. Scalability today is not just about throwing an LB in front of that API or service. It’s not even about how it fits operationally into the architecture. Every LB out there can be shoved into a VM or container and automated using APIs or scripts today. That’s the easy part. The harder part is about how to enable the deployment of scalable architectures – either by being flexible enough to route in a way that’s needed or by being a part of the architecture itself. You want to be able to scale like Pinterest: from one little MySQL server to 180 Web Engines, 240 API Engines, 88 MySQL DBs (cc2.8xlarge) + 1 slave each, 110 Redis Instances, and 200 Memcache Instances. The secret? Architecture. Architecting for scale doesn’t mean just choosing stateless over stateful services. It doesn’t just mean choosing a data sharding strategy or whether to decompose by function or object. It means considering how the scale fits into the architecture what it can do for you to help when it’s needed most.
This is why you should think about architecting scale, not just deploying it. Complexity. Breaking up apps into microservices and APIs means we’re simplifying development but complexifying deployment. Operationally, we’re adding more and more components – like load balancing – to the mix. Guys from Pinterest: ““Sharding makes schema design harder · Waiting too long makes the transition harder” It’s important to think about the entire architecture – including the LB – before it’s deployed. How it scales and participates and even monitors is critical for keeping both technical and architectural debt down.
At the application tier, eBay segments different functions into separate application pools. Selling functionality is served by one set of application servers, bidding by another, search by yet another. In total, they organize roughly 16,000 application servers into 220 different pools. This allows them to scale each pool independently of one another, according to the demands and resource consumption of its function. It further allows them to isolate and rationalize resource dependencies - the selling pool only needs to talk to a relatively small subset of backend resources, for example.
API Façade: Interpolated in Network The façade provides the level of abstraction necessary for microservices-based apps today. Because each microservice are likely changing on different schedules, it’s highly disruptive to update the client for each and every one – especially early on. By virtualizing the API using a façade in the network, the API (and thus the client) can remain the same. Significant changes to any of the composite APIs can be reflected in the API façade without impacting everything else.
Variations on a Theme Now we’ve got to consider the rise of things and how they impact scalability architectures. How do we efficiently and effectively design an architecture that scales while taking into consideration performance and security for a whole bunch of different “things” and “users” at the same time? Monitoring at the first tier can also provide insight into usage patterns of consumers and devices, which means better planning around upgrades, patches, etc… that may need to be accessed. Monitoring at the purpose-driven layer gives you insight into specific usage patterns of the API/application, which can help give a better picture of how to segment future versions or new applications. Monitoring at the last tier is all about speed and capacity; improving performance and ensuring availability. This is where algorithms and monitoring are most important, because both have a serious impact operationally and on the business.
You’re architecting scale rather than bolting it on to the front.
Well designed APIs are consistent and able to be parsed into their respective scalability domains. Clients should provide some indication (in headers, in data, in URL) of what they are to assist in security and performance and scaling. API versioning is a significant source of frustration. Are versions clearly identifiable such that upstream dispatching can assist in a more seamless upgrade/migration experience? Tenancy of microservices has a big impact on the network and scalability. Decide early which way you’re going to go – and whether you’re going to leverage the network to do it. Right answer is always vi.

So you think you can scale

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Viewers also liked

Viewers also liked (15)

Similar to So you think you can scale

Similar to So you think you can scale (20)

More from Lori MacVittie

More from Lori MacVittie (13)

Recently uploaded

Recently uploaded (20)

So you think you can scale

Editor's Notes