Alluxio Monthly Webinar | Why a Multi-Cloud Strategy Matters for Your AI Platform

•

0 likes•73 views

Alluxio Monthly Webinar Feb. 27, 2024 For more Alluxio Events: https://www.alluxio.io/events/ Speaker: - Tarik Bennett (Senior Solutions Engineer, Alluxio) As GenAI and AI continue to transform businesses, scaling these workloads requires optimized underlying infrastructure. A multi-cloud architecture allows organizations to leverage different cloud services to meet diverse workload demands while maximizing efficiency, reducing costs, and avoiding vendor lock-in. However, achieving a multi-cloud vision can be challenging. In this webinar, Tarik will share how an agonistic data layer, like Alluxio, allows you to embrace the separation of storage from compute and simplify the adoption of multi-cloud for AI. - Learn why leveraging multiple cloud providers is critical for balancing performance, scalability, and cost of your AI platform - Discover how an agnostic data layer like Alluxio provides seamless data access in multi-cloud that bridges storage and compute without data replication - Gain insights into real-world examples and best practices for deploying AI across on-prem, hybrid, and multi-cloud environments

Software

Webinar:
Why a Multi-Cloud Strategy
Matters for Your AI Platform
Tarik Bennett
tarik.bennett@alluxio.com
February 27th, 2023

Senior Solutions Engineer
@ Alluxio
Tarik Bennett
2

Balancing performance, scalability, and cost
Agnostic data layer
Best practices for hybrid and multi-cloud
Agenda

Managing Costs
Cloud Agility or Resource Availability
Training Eﬃciency
Primary Scenarios Addressed

Source: Gartner 2023
1. By 2028, the adoption of AI will culminate in over
50% of cloud compute resources… up from less
than 10% in 2023.
2. Global spending on public cloud services is forecast to increase 20.4% in 2024… the
source of growth will be combination of cloud vendor price increases and increased
utilization.
3. Deep learning models fed by images, internet-scale applications or even telemetry data
have ever growing data requirements.
AI Adoption is Ballooning Cloud Costs

● Eﬃcient distributed computing
● Workload scheduling
● Modernizing or reducing legacy storage
● Minimizing data movement
● Improving data access
● Increasing scalability
Efficiencies via Platform Improvements

Source: Gartner 2023
According to the survey, almost half (47%) of C-suite
executives don’t feel prepared for the accelerating rate
of technological change.
Further, only 27% claim their organizations are ready to scale up generative AI, and 44% say it
will take more than six months to do so and take advantage of the potential beneﬁts.
Scalability and Cloud Agility

Technical
● Improves scalability
● Enables hybrid cloud
● Expanded access to GPUs
● Best-of-breed AI tools available
Non-Technical
● Leverage in cloud negotiations
● Security and governance, privacy, etc
● Service resilience
● Flexible access to the most
cost-eﬀective resources
Why Multi-Cloud?

Agility Comes with Some Overhead
● Data replication between DCs or regions
Multi-Cloud Challenges
Source: Alluxio

Agility Comes with Some Overhead
● Data replication between DCs or regions
● Disruptive, costly or prolonged migrations to upgrade
HDFS
Object
Store
Multi-Cloud Challenges

Agility Comes with Some Overhead
● Data replication between DCs or regions
● Disruptive, costly or prolonged migrations to upgrade
● Overlapping resources in cloud + on-prem
compute compute compute
Multi-Cloud Challenges

Given Multi-Cloud Benefits for AI, You Can Optimize
● Simplify wherever possible
● Reduce replication wherever possible
● Finding cost eﬃciencies via caching or other means
● Increase data locality
● Unify data access
● Increase throughput of commodity storage
● Reduce bandwidth congestion
Best Practices

● Multi-Cloud architecture
○ Google Cloud Platform (GCP)
○ Oracle® Cloud Infrastructure (OCI)
● Data orchestration and caching
Uber Multi-Cloud Architecture (Future)
Source: Uber Jing Zhao 2024

16
Alluxio Data Platform
High Performance data access, uniﬁed global view

18
Portability via Alluxio Kubernetes Operator

Reduced Data Replication
Source: Alluxio

Some data cannot be persisted in the cloud. Security teams will often
approve ephemeral cache, while other options will be denied.
High Performance Data Access
Sensitive model
training data
Data evicted
from the cache
Benefits of Caching for Sensitive Data

Standalone Cluster
High Performance Data Access Layer
Data from multiple sources served to GPU nodes
Virtual Caching Across Local
GPU Storage
Data source synced to Virtual Alluxio Storage and
shared between GPU nodes
Alluxio Deployment Options for AI

BUSINESS BENEFIT:
TECH BENEFIT:
Increase GPU
utilization
50%
93%
File System
Training
Data
Training
Data
M
o
d
e
l
s
Training
Data
Models
Model
Training
Model
Training
Model
Deployment
Model
Inference
Downstream
Applications
Model
Update
Training Clouds Oﬀline Cloud Online Cloud
APAC Quora CASE STUDY:
High Performance AI Platform for LLM
2 - 4X faster
time-to-market
Before Alluxio: (1) Low GPU Utilization, (2) Overloaded Storage, (3) Network Congestion & Slow Model Refresh

Similar to Alluxio Monthly Webinar | Why a Multi-Cloud Strategy Matters for Your AI Platform

Solving enterprise challenges through scale out storage & big compute finalAvere Systems

Maturing IoT solutions with Microsoft Azure (Sam Vanhoutte & Glenn Colpaert a...Codit

Maximizing Oil and Gas (Data) Asset Utilization with a Logical Data Fabric (A...Denodo

Impact of Cloud Computing on IT Infrastructure Support.pdfACS Networks & Technologies

Slides: Accelerating Queries on Cloud Data LakesDATAVERSITY

Simplifying Your Cloud Architecture with a Logical Data Fabric (APAC)Denodo

A Successful Journey to the Cloud with Data VirtualizationDenodo

Data Orchestration for the Hybrid Cloud EraAlluxio, Inc.

Fog Computing Platform霈萱蔡

Green Cloud Computing :Emerging TechnologyIRJET Journal

Equinix microsoft 2019 use case playbookchris edwards

Maximize the Capabilities of Oracle® Golden Gate: Replicate Data Bi-Direction...Equinix

Cloud Migration.pdfZen Bit Tech

Optimizing Your Hybrid IT StrategyAdvanced Technology Consulting (ATC)

Coud computingBenila Mendus

Peek into Neo4j Product Strategy and RoadmapNeo4j

Big Data Bellevue Meetup | Enhancing Python Data Loading in the Cloud for AI/MLAlluxio, Inc.

Accelerate Analytics and ML in the Hybrid Cloud EraAlluxio, Inc.

Conduit - A Lightweight Data Virtualization ToolRuthie Senanayake

Overview of GovCloud TodayGovCloud Network

Similar to Alluxio Monthly Webinar | Why a Multi-Cloud Strategy Matters for Your AI Platform (20)

Solving enterprise challenges through scale out storage & big compute final

Maturing IoT solutions with Microsoft Azure (Sam Vanhoutte & Glenn Colpaert a...

Maximizing Oil and Gas (Data) Asset Utilization with a Logical Data Fabric (A...

Impact of Cloud Computing on IT Infrastructure Support.pdf

Slides: Accelerating Queries on Cloud Data Lakes

Simplifying Your Cloud Architecture with a Logical Data Fabric (APAC)

A Successful Journey to the Cloud with Data Virtualization

Data Orchestration for the Hybrid Cloud Era

Fog Computing Platform

Green Cloud Computing :Emerging Technology

Equinix microsoft 2019 use case playbook

Maximize the Capabilities of Oracle® Golden Gate: Replicate Data Bi-Direction...

Cloud Migration.pdf

Optimizing Your Hybrid IT Strategy

Coud computing

Peek into Neo4j Product Strategy and Roadmap

Big Data Bellevue Meetup | Enhancing Python Data Loading in the Cloud for AI/ML

Accelerate Analytics and ML in the Hybrid Cloud Era

Conduit - A Lightweight Data Virtualization Tool

Overview of GovCloud Today

Recently uploaded

Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...kellynguyen01

Salesforce Certified Field Service ConsultantAxelRicardoTrocheRiq

cybersecurity notes for mca students for learningVitsRangannavar

Hand gesture recognition PROJECT PPT.pptxbodapatigopi8531

5 Signs You Need a Fashion PLM Software.pdfWave PLM

Der Spagat zwischen BIAS und FAIRNESS (2024)OPEN KNOWLEDGE GmbH

The Evolution of Karaoke From Analog to App.pdfPower Karaoke

Asset Management Software - InfographicHr365.us smith

Call Girls In Mukherjee Nagar 📱 9999965857 🤩 Delhi 🫦 HOT AND SEXY VVIP 🍎 SE...Call Girls In Delhi Whatsup 9873940964 Enjoy Unlimited Pleasure

why an Opensea Clone Script might be your perfect match.pdfjoe51371421

Cloud Management Software Platforms: OpenStackVICTOR MAESTRE RAMIREZ

DNT_Corporate presentation know about usDynamic Netsoft

Project Based Learning (A.I).pptx detail explanationkaushalgiri8080

Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...MyIntelliSource, Inc.

BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASEOrtus Solutions, Corp

Engage Usergroup 2024 - The Good The Bad_The UglyFrank van der Linden

The Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdfkalichargn70th171

The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...ICS

Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...MyIntelliSource, Inc.

Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...stazi3110

Recently uploaded (20)

Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...

Salesforce Certified Field Service Consultant

cybersecurity notes for mca students for learning

Hand gesture recognition PROJECT PPT.pptx

5 Signs You Need a Fashion PLM Software.pdf

Der Spagat zwischen BIAS und FAIRNESS (2024)

The Evolution of Karaoke From Analog to App.pdf

Asset Management Software - Infographic

Call Girls In Mukherjee Nagar 📱 9999965857 🤩 Delhi 🫦 HOT AND SEXY VVIP 🍎 SE...

why an Opensea Clone Script might be your perfect match.pdf

Cloud Management Software Platforms: OpenStack

DNT_Corporate presentation know about us

Project Based Learning (A.I).pptx detail explanation

Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...

BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASE

Engage Usergroup 2024 - The Good The Bad_The Ugly

The Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdf

The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...

Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...

Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...

Alluxio Monthly Webinar | Why a Multi-Cloud Strategy Matters for Your AI Platform

1. Webinar: Why a Multi-Cloud Strategy Matters for Your AI Platform Tarik Bennett tarik.bennett@alluxio.com February 27th, 2023

2. Senior Solutions Engineer @ Alluxio Tarik Bennett 2

3. Balancing performance, scalability, and cost Agnostic data layer Best practices for hybrid and multi-cloud Agenda

4. Managing Costs Cloud Agility or Resource Availability Training Eﬃciency Primary Scenarios Addressed

5. Source: Gartner 2023 1. By 2028, the adoption of AI will culminate in over 50% of cloud compute resources… up from less than 10% in 2023. 2. Global spending on public cloud services is forecast to increase 20.4% in 2024… the source of growth will be combination of cloud vendor price increases and increased utilization. 3. Deep learning models fed by images, internet-scale applications or even telemetry data have ever growing data requirements. AI Adoption is Ballooning Cloud Costs

6. ● Eﬃcient distributed computing ● Workload scheduling ● Modernizing or reducing legacy storage ● Minimizing data movement ● Improving data access ● Increasing scalability Efficiencies via Platform Improvements

7. Source: Gartner 2023 According to the survey, almost half (47%) of C-suite executives don’t feel prepared for the accelerating rate of technological change. Further, only 27% claim their organizations are ready to scale up generative AI, and 44% say it will take more than six months to do so and take advantage of the potential beneﬁts. Scalability and Cloud Agility

8. Technical ● Improves scalability ● Enables hybrid cloud ● Expanded access to GPUs ● Best-of-breed AI tools available Non-Technical ● Leverage in cloud negotiations ● Security and governance, privacy, etc ● Service resilience ● Flexible access to the most cost-eﬀective resources Why Multi-Cloud?

9. Agility Comes with Some Overhead ● Data replication between DCs or regions Multi-Cloud Challenges Source: Alluxio

10. Agility Comes with Some Overhead ● Data replication between DCs or regions ● Disruptive, costly or prolonged migrations to upgrade HDFS Object Store Multi-Cloud Challenges

11. Agility Comes with Some Overhead ● Data replication between DCs or regions ● Disruptive, costly or prolonged migrations to upgrade ● Overlapping resources in cloud + on-prem compute compute compute Multi-Cloud Challenges

12. Agility Comes with Some Overhead ● Data replication between DCs or regions ● Disruptive, costly or prolonged migrations to upgrade ● Overlapping resources in cloud + on-prem ● Need to address non-technical requirements within CSPs Multi-Cloud Challenges

13. Given Multi-Cloud Benefits for AI, You Can Optimize ● Simplify wherever possible ● Reduce replication wherever possible ● Finding cost eﬃciencies via caching or other means ● Increase data locality ● Unify data access ● Increase throughput of commodity storage ● Reduce bandwidth congestion Best Practices

14. ● Multi-Cloud architecture ○ Google Cloud Platform (GCP) ○ Oracle® Cloud Infrastructure (OCI) ● Data orchestration and caching Uber Multi-Cloud Architecture (Future) Source: Uber Jing Zhao 2024

15. Alluxio Intro

16. 16 Alluxio Data Platform High Performance data access, uniﬁed global view

17.

18. 18 Portability via Alluxio Kubernetes Operator

19. Reduced Data Replication Source: Alluxio

20. Some data cannot be persisted in the cloud. Security teams will often approve ephemeral cache, while other options will be denied. High Performance Data Access Sensitive model training data Data evicted from the cache Benefits of Caching for Sensitive Data

21. Standalone Cluster High Performance Data Access Layer Data from multiple sources served to GPU nodes Virtual Caching Across Local GPU Storage Data source synced to Virtual Alluxio Storage and shared between GPU nodes Alluxio Deployment Options for AI

22. Case Study

23. BUSINESS BENEFIT: TECH BENEFIT: Increase GPU utilization 50% 93% File System Training Data Training Data M o d e l s Training Data Models Model Training Model Training Model Deployment Model Inference Downstream Applications Model Update Training Clouds Oﬀline Cloud Online Cloud APAC Quora CASE STUDY: High Performance AI Platform for LLM 2 - 4X faster time-to-market Before Alluxio: (1) Low GPU Utilization, (2) Overloaded Storage, (3) Network Congestion & Slow Model Refresh

24. Thank You!

Alluxio Monthly Webinar | Why a Multi-Cloud Strategy Matters for Your AI Platform

Recommended

Recommended

More Related Content

Similar to Alluxio Monthly Webinar | Why a Multi-Cloud Strategy Matters for Your AI Platform

Similar to Alluxio Monthly Webinar | Why a Multi-Cloud Strategy Matters for Your AI Platform (20)

More from Alluxio, Inc.

More from Alluxio, Inc. (20)

Recently uploaded

Recently uploaded (20)

Alluxio Monthly Webinar | Why a Multi-Cloud Strategy Matters for Your AI Platform