Submit Search
Upload
Enterprise machine learning on k8s lessons learned and the road ahead
•
0 likes
•
174 views
Timothy Chen
Follow
Kubecon 2018 Seattle presentation
Read less
Read more
Software
Report
Share
Report
Share
1 of 27
Download now
Download to read offline
Recommended
Why AIOps Matters For Kubernetes
Why AIOps Matters For Kubernetes
Timothy Chen
Keeping your Kubernetes Cluster Secure
Keeping your Kubernetes Cluster Secure
Gene Gotimer
Can I Contain This?
Can I Contain This?
Eficode
Welcome - Kubernetes for the Enterprise - London
Welcome - Kubernetes for the Enterprise - London
VMware Tanzu
DCSF 19 Developing Apps with Containers, Functions and Cloud Services
DCSF 19 Developing Apps with Containers, Functions and Cloud Services
Docker, Inc.
GitOps, Jenkins X &Future of CI/CD
GitOps, Jenkins X &Future of CI/CD
Rakuten Group, Inc.
Episode 1: Building Kubernetes-as-a-Service
Episode 1: Building Kubernetes-as-a-Service
Mesosphere Inc.
Cloud monitoring
Cloud monitoring
Gang Tao
Recommended
Why AIOps Matters For Kubernetes
Why AIOps Matters For Kubernetes
Timothy Chen
Keeping your Kubernetes Cluster Secure
Keeping your Kubernetes Cluster Secure
Gene Gotimer
Can I Contain This?
Can I Contain This?
Eficode
Welcome - Kubernetes for the Enterprise - London
Welcome - Kubernetes for the Enterprise - London
VMware Tanzu
DCSF 19 Developing Apps with Containers, Functions and Cloud Services
DCSF 19 Developing Apps with Containers, Functions and Cloud Services
Docker, Inc.
GitOps, Jenkins X &Future of CI/CD
GitOps, Jenkins X &Future of CI/CD
Rakuten Group, Inc.
Episode 1: Building Kubernetes-as-a-Service
Episode 1: Building Kubernetes-as-a-Service
Mesosphere Inc.
Cloud monitoring
Cloud monitoring
Gang Tao
Breaking the Monolith
Breaking the Monolith
VMware Tanzu
Intro to Google Cloud Platform Data Engineering.- Endpoints
Intro to Google Cloud Platform Data Engineering.- Endpoints
Joseph Holbrook, Chief Learning Officer (CLO)
Kubernetes data science and machine learning
Kubernetes data science and machine learning
Kublr
High-Precision GPS Positioning for Spring Developers
High-Precision GPS Positioning for Spring Developers
VMware Tanzu
Secure Infrastructure Provisioning with Terraform Cloud, Vault + GitLab CI
Secure Infrastructure Provisioning with Terraform Cloud, Vault + GitLab CI
Mitchell Pronschinske
Manage thousands of k8s applications with minimal efforts using kube carrier
Manage thousands of k8s applications with minimal efforts using kube carrier
LibbySchulze
The Good, the Bad and the Ugly of Migrating Hundreds of Legacy Applications ...
The Good, the Bad and the Ugly of Migrating Hundreds of Legacy Applications ...
Josef Adersberger
Webinar: Operating Kubernetes at Scale
Webinar: Operating Kubernetes at Scale
Mesosphere Inc.
An Open, Open source way to enable your Cloud Native Journey
An Open, Open source way to enable your Cloud Native Journey
inwin stack
Top Considerations For Operating a Kubernetes Environment at Scale
Top Considerations For Operating a Kubernetes Environment at Scale
SignalFx
The Evolution of your Kubernetes Cluster
The Evolution of your Kubernetes Cluster
Kublr
Distributed architecture in a cloud native microservices ecosystem
Distributed architecture in a cloud native microservices ecosystem
Zhenzhong Xu
Kirin User Story: Migrating Mission Critical Applications to OpenStack Privat...
Kirin User Story: Migrating Mission Critical Applications to OpenStack Privat...
Motoki Kakinuma
Overseeing Ship's Surveys and Surveyors Globally Using IoT and Docker by Jay ...
Overseeing Ship's Surveys and Surveyors Globally Using IoT and Docker by Jay ...
Docker, Inc.
Continuous Everything in a Multi-cloud and Multi-platform Environment
Continuous Everything in a Multi-cloud and Multi-platform Environment
VMware Tanzu
Storage os kubernetes clusters need persistent data
Storage os kubernetes clusters need persistent data
LibbySchulze
Building cloud native apps
Building cloud native apps
Swaminathan Vetri
運用高效、敏捷全新平台極速落實雲原生開發
運用高效、敏捷全新平台極速落實雲原生開發
inwin stack
CF Summit North America 2017 - Technical Keynote
CF Summit North America 2017 - Technical Keynote
Chip Childers
PaaS application in Heroku
PaaS application in Heroku
Dileepa Jayakody
Cloud-Native Machine Learning: Emerging Trends and the Road Ahead
Cloud-Native Machine Learning: Emerging Trends and the Road Ahead
DataWorks Summit
Edge to AI: Analytics from Edge to Cloud with Efficient Movement of Machine ...
Edge to AI: Analytics from Edge to Cloud with Efficient Movement of Machine ...
Timothy Spann
More Related Content
What's hot
Breaking the Monolith
Breaking the Monolith
VMware Tanzu
Intro to Google Cloud Platform Data Engineering.- Endpoints
Intro to Google Cloud Platform Data Engineering.- Endpoints
Joseph Holbrook, Chief Learning Officer (CLO)
Kubernetes data science and machine learning
Kubernetes data science and machine learning
Kublr
High-Precision GPS Positioning for Spring Developers
High-Precision GPS Positioning for Spring Developers
VMware Tanzu
Secure Infrastructure Provisioning with Terraform Cloud, Vault + GitLab CI
Secure Infrastructure Provisioning with Terraform Cloud, Vault + GitLab CI
Mitchell Pronschinske
Manage thousands of k8s applications with minimal efforts using kube carrier
Manage thousands of k8s applications with minimal efforts using kube carrier
LibbySchulze
The Good, the Bad and the Ugly of Migrating Hundreds of Legacy Applications ...
The Good, the Bad and the Ugly of Migrating Hundreds of Legacy Applications ...
Josef Adersberger
Webinar: Operating Kubernetes at Scale
Webinar: Operating Kubernetes at Scale
Mesosphere Inc.
An Open, Open source way to enable your Cloud Native Journey
An Open, Open source way to enable your Cloud Native Journey
inwin stack
Top Considerations For Operating a Kubernetes Environment at Scale
Top Considerations For Operating a Kubernetes Environment at Scale
SignalFx
The Evolution of your Kubernetes Cluster
The Evolution of your Kubernetes Cluster
Kublr
Distributed architecture in a cloud native microservices ecosystem
Distributed architecture in a cloud native microservices ecosystem
Zhenzhong Xu
Kirin User Story: Migrating Mission Critical Applications to OpenStack Privat...
Kirin User Story: Migrating Mission Critical Applications to OpenStack Privat...
Motoki Kakinuma
Overseeing Ship's Surveys and Surveyors Globally Using IoT and Docker by Jay ...
Overseeing Ship's Surveys and Surveyors Globally Using IoT and Docker by Jay ...
Docker, Inc.
Continuous Everything in a Multi-cloud and Multi-platform Environment
Continuous Everything in a Multi-cloud and Multi-platform Environment
VMware Tanzu
Storage os kubernetes clusters need persistent data
Storage os kubernetes clusters need persistent data
LibbySchulze
Building cloud native apps
Building cloud native apps
Swaminathan Vetri
運用高效、敏捷全新平台極速落實雲原生開發
運用高效、敏捷全新平台極速落實雲原生開發
inwin stack
CF Summit North America 2017 - Technical Keynote
CF Summit North America 2017 - Technical Keynote
Chip Childers
PaaS application in Heroku
PaaS application in Heroku
Dileepa Jayakody
What's hot
(20)
Breaking the Monolith
Breaking the Monolith
Intro to Google Cloud Platform Data Engineering.- Endpoints
Intro to Google Cloud Platform Data Engineering.- Endpoints
Kubernetes data science and machine learning
Kubernetes data science and machine learning
High-Precision GPS Positioning for Spring Developers
High-Precision GPS Positioning for Spring Developers
Secure Infrastructure Provisioning with Terraform Cloud, Vault + GitLab CI
Secure Infrastructure Provisioning with Terraform Cloud, Vault + GitLab CI
Manage thousands of k8s applications with minimal efforts using kube carrier
Manage thousands of k8s applications with minimal efforts using kube carrier
The Good, the Bad and the Ugly of Migrating Hundreds of Legacy Applications ...
The Good, the Bad and the Ugly of Migrating Hundreds of Legacy Applications ...
Webinar: Operating Kubernetes at Scale
Webinar: Operating Kubernetes at Scale
An Open, Open source way to enable your Cloud Native Journey
An Open, Open source way to enable your Cloud Native Journey
Top Considerations For Operating a Kubernetes Environment at Scale
Top Considerations For Operating a Kubernetes Environment at Scale
The Evolution of your Kubernetes Cluster
The Evolution of your Kubernetes Cluster
Distributed architecture in a cloud native microservices ecosystem
Distributed architecture in a cloud native microservices ecosystem
Kirin User Story: Migrating Mission Critical Applications to OpenStack Privat...
Kirin User Story: Migrating Mission Critical Applications to OpenStack Privat...
Overseeing Ship's Surveys and Surveyors Globally Using IoT and Docker by Jay ...
Overseeing Ship's Surveys and Surveyors Globally Using IoT and Docker by Jay ...
Continuous Everything in a Multi-cloud and Multi-platform Environment
Continuous Everything in a Multi-cloud and Multi-platform Environment
Storage os kubernetes clusters need persistent data
Storage os kubernetes clusters need persistent data
Building cloud native apps
Building cloud native apps
運用高效、敏捷全新平台極速落實雲原生開發
運用高效、敏捷全新平台極速落實雲原生開發
CF Summit North America 2017 - Technical Keynote
CF Summit North America 2017 - Technical Keynote
PaaS application in Heroku
PaaS application in Heroku
Similar to Enterprise machine learning on k8s lessons learned and the road ahead
Cloud-Native Machine Learning: Emerging Trends and the Road Ahead
Cloud-Native Machine Learning: Emerging Trends and the Road Ahead
DataWorks Summit
Edge to AI: Analytics from Edge to Cloud with Efficient Movement of Machine ...
Edge to AI: Analytics from Edge to Cloud with Efficient Movement of Machine ...
Timothy Spann
The Edge to AI Deep Dive Barcelona Meetup March 2019
The Edge to AI Deep Dive Barcelona Meetup March 2019
Timothy Spann
Part 2: A Visual Dive into Machine Learning and Deep Learning
Part 2: A Visual Dive into Machine Learning and Deep Learning
Cloudera, Inc.
Edge to ai analytics from edge to cloud with efficient movement of machine data
Edge to ai analytics from edge to cloud with efficient movement of machine data
Timothy Spann
Introducing Cloudera Data Science Workbench for HDP 2.12.19
Introducing Cloudera Data Science Workbench for HDP 2.12.19
Cloudera, Inc.
Machine Learning in the Enterprise 2019
Machine Learning in the Enterprise 2019
Timothy Spann
How to go into production your machine learning models? #CWT2017
How to go into production your machine learning models? #CWT2017
Cloudera Japan
One Hadoop, Multiple Clouds - NYC Big Data Meetup
One Hadoop, Multiple Clouds - NYC Big Data Meetup
Andrei Savu
One Hadoop, Multiple Clouds
One Hadoop, Multiple Clouds
Cloudera, Inc.
Meetup Streaming Data Pipeline Development
Meetup Streaming Data Pipeline Development
Timothy Spann
Future of Data Milwaukee Meetup Streaming Data Pipeline Development 28 June 2023
Future of Data Milwaukee Meetup Streaming Data Pipeline Development 28 June 2023
ssuser73434e
Cloudera Analytics and Machine Learning Platform - Optimized for Cloud
Cloudera Analytics and Machine Learning Platform - Optimized for Cloud
Stefan Lipp
Big Data Fundamentals 6.6.18
Big Data Fundamentals 6.6.18
Cloudera, Inc.
Big Data Fundamentals
Big Data Fundamentals
Cloudera, Inc.
Hadoop on Cloud: Why and How?
Hadoop on Cloud: Why and How?
Cloudera, Inc.
26Oct2023_Adding Generative AI to Real-Time Streaming Pipelines_ NYC Meetup
26Oct2023_Adding Generative AI to Real-Time Streaming Pipelines_ NYC Meetup
Timothy Spann
PartnerSkillUp_Enable a Streaming CDC Solution
PartnerSkillUp_Enable a Streaming CDC Solution
Timothy Spann
April 2024 - NLIT Cloudera Real-Time LLM Streaming 2024
April 2024 - NLIT Cloudera Real-Time LLM Streaming 2024
Timothy Spann
NOVA Data Science Meetup 2-21-2018 Presentation Cloudera Data Science Workbench
NOVA Data Science Meetup 2-21-2018 Presentation Cloudera Data Science Workbench
NOVA DATASCIENCE
Similar to Enterprise machine learning on k8s lessons learned and the road ahead
(20)
Cloud-Native Machine Learning: Emerging Trends and the Road Ahead
Cloud-Native Machine Learning: Emerging Trends and the Road Ahead
Edge to AI: Analytics from Edge to Cloud with Efficient Movement of Machine ...
Edge to AI: Analytics from Edge to Cloud with Efficient Movement of Machine ...
The Edge to AI Deep Dive Barcelona Meetup March 2019
The Edge to AI Deep Dive Barcelona Meetup March 2019
Part 2: A Visual Dive into Machine Learning and Deep Learning
Part 2: A Visual Dive into Machine Learning and Deep Learning
Edge to ai analytics from edge to cloud with efficient movement of machine data
Edge to ai analytics from edge to cloud with efficient movement of machine data
Introducing Cloudera Data Science Workbench for HDP 2.12.19
Introducing Cloudera Data Science Workbench for HDP 2.12.19
Machine Learning in the Enterprise 2019
Machine Learning in the Enterprise 2019
How to go into production your machine learning models? #CWT2017
How to go into production your machine learning models? #CWT2017
One Hadoop, Multiple Clouds - NYC Big Data Meetup
One Hadoop, Multiple Clouds - NYC Big Data Meetup
One Hadoop, Multiple Clouds
One Hadoop, Multiple Clouds
Meetup Streaming Data Pipeline Development
Meetup Streaming Data Pipeline Development
Future of Data Milwaukee Meetup Streaming Data Pipeline Development 28 June 2023
Future of Data Milwaukee Meetup Streaming Data Pipeline Development 28 June 2023
Cloudera Analytics and Machine Learning Platform - Optimized for Cloud
Cloudera Analytics and Machine Learning Platform - Optimized for Cloud
Big Data Fundamentals 6.6.18
Big Data Fundamentals 6.6.18
Big Data Fundamentals
Big Data Fundamentals
Hadoop on Cloud: Why and How?
Hadoop on Cloud: Why and How?
26Oct2023_Adding Generative AI to Real-Time Streaming Pipelines_ NYC Meetup
26Oct2023_Adding Generative AI to Real-Time Streaming Pipelines_ NYC Meetup
PartnerSkillUp_Enable a Streaming CDC Solution
PartnerSkillUp_Enable a Streaming CDC Solution
April 2024 - NLIT Cloudera Real-Time LLM Streaming 2024
April 2024 - NLIT Cloudera Real-Time LLM Streaming 2024
NOVA Data Science Meetup 2-21-2018 Presentation Cloudera Data Science Workbench
NOVA Data Science Meetup 2-21-2018 Presentation Cloudera Data Science Workbench
Recently uploaded
8257 interfacing 2 in microprocessor for btech students
8257 interfacing 2 in microprocessor for btech students
HimanshiGarg82
The Top App Development Trends Shaping the Industry in 2024-25 .pdf
The Top App Development Trends Shaping the Industry in 2024-25 .pdf
ayushiqss
%in Midrand+277-882-255-28 abortion pills for sale in midrand
%in Midrand+277-882-255-28 abortion pills for sale in midrand
masabamasaba
MarTech Trend 2024 Book : Marketing Technology Trends (2024 Edition) How Data...
MarTech Trend 2024 Book : Marketing Technology Trends (2024 Edition) How Data...
Jittipong Loespradit
Chinsurah Escorts ☎️8617697112 Starting From 5K to 15K High Profile Escorts ...
Chinsurah Escorts ☎️8617697112 Starting From 5K to 15K High Profile Escorts ...
Nitya salvi
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
Delhi Call girls
VTU technical seminar 8Th Sem on Scikit-learn
VTU technical seminar 8Th Sem on Scikit-learn
AmarnathKambale
%in Stilfontein+277-882-255-28 abortion pills for sale in Stilfontein
%in Stilfontein+277-882-255-28 abortion pills for sale in Stilfontein
masabamasaba
Software Quality Assurance Interview Questions
Software Quality Assurance Interview Questions
Arshad QA
10 Trends Likely to Shape Enterprise Technology in 2024
10 Trends Likely to Shape Enterprise Technology in 2024
Mind IT Systems
introduction-to-automotive Andoid os-csimmonds-ndctechtown-2021.pdf
introduction-to-automotive Andoid os-csimmonds-ndctechtown-2021.pdf
VishalKumarJha10
Direct Style Effect Systems -The Print[A] Example- A Comprehension Aid
Direct Style Effect Systems -The Print[A] Example- A Comprehension Aid
Philip Schwarz
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
ICS
The Ultimate Test Automation Guide_ Best Practices and Tips.pdf
The Ultimate Test Automation Guide_ Best Practices and Tips.pdf
kalichargn70th171
Payment Gateway Testing Simplified_ A Step-by-Step Guide for Beginners.pdf
Payment Gateway Testing Simplified_ A Step-by-Step Guide for Beginners.pdf
kalichargn70th171
The Guide to Integrating Generative AI into Unified Continuous Testing Platfo...
The Guide to Integrating Generative AI into Unified Continuous Testing Platfo...
kalichargn70th171
AI & Machine Learning Presentation Template
AI & Machine Learning Presentation Template
Presentation.STUDIO
Microsoft AI Transformation Partner Playbook.pdf
Microsoft AI Transformation Partner Playbook.pdf
Willy Marroquin (WillyDevNET)
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
Delhi Call girls
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
Health
Recently uploaded
(20)
8257 interfacing 2 in microprocessor for btech students
8257 interfacing 2 in microprocessor for btech students
The Top App Development Trends Shaping the Industry in 2024-25 .pdf
The Top App Development Trends Shaping the Industry in 2024-25 .pdf
%in Midrand+277-882-255-28 abortion pills for sale in midrand
%in Midrand+277-882-255-28 abortion pills for sale in midrand
MarTech Trend 2024 Book : Marketing Technology Trends (2024 Edition) How Data...
MarTech Trend 2024 Book : Marketing Technology Trends (2024 Edition) How Data...
Chinsurah Escorts ☎️8617697112 Starting From 5K to 15K High Profile Escorts ...
Chinsurah Escorts ☎️8617697112 Starting From 5K to 15K High Profile Escorts ...
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
VTU technical seminar 8Th Sem on Scikit-learn
VTU technical seminar 8Th Sem on Scikit-learn
%in Stilfontein+277-882-255-28 abortion pills for sale in Stilfontein
%in Stilfontein+277-882-255-28 abortion pills for sale in Stilfontein
Software Quality Assurance Interview Questions
Software Quality Assurance Interview Questions
10 Trends Likely to Shape Enterprise Technology in 2024
10 Trends Likely to Shape Enterprise Technology in 2024
introduction-to-automotive Andoid os-csimmonds-ndctechtown-2021.pdf
introduction-to-automotive Andoid os-csimmonds-ndctechtown-2021.pdf
Direct Style Effect Systems -The Print[A] Example- A Comprehension Aid
Direct Style Effect Systems -The Print[A] Example- A Comprehension Aid
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
The Ultimate Test Automation Guide_ Best Practices and Tips.pdf
The Ultimate Test Automation Guide_ Best Practices and Tips.pdf
Payment Gateway Testing Simplified_ A Step-by-Step Guide for Beginners.pdf
Payment Gateway Testing Simplified_ A Step-by-Step Guide for Beginners.pdf
The Guide to Integrating Generative AI into Unified Continuous Testing Platfo...
The Guide to Integrating Generative AI into Unified Continuous Testing Platfo...
AI & Machine Learning Presentation Template
AI & Machine Learning Presentation Template
Microsoft AI Transformation Partner Playbook.pdf
Microsoft AI Transformation Partner Playbook.pdf
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
Enterprise machine learning on k8s lessons learned and the road ahead
1.
ENTERPRISE MACHINE LEARNING
ON K8S: LESSONS LEARNED AND THE ROAD AHEAD Tim Chen, Cloudera Tristan Zajonc, Cloudera
2.
2 © Cloudera,
Inc. All rights reserved. Tim Chen Cloudera @tnachen WHO ARE WE? Lead for Cloudera Machine Learning Former Hyperpilot CEO/Cofounder Apache PMC for Drill, Mesos Tristan Zajonc Cloudera @tristanzajonc CTO for Machine Learning at Cloudera Former Sense CEO/Cofounder
3.
3 © Cloudera,
Inc. All rights reserved. DISCLAIMER The information in this document is proprietary to Cloudera. No part of this document may be reproduced or disclosed without the express prior written permission of Cloudera. This document is not binding upon Cloudera in any way (including with respect to Cloudera’s course of business, product strategy, future releases, or development commitments), and is not a part of or subject to any license agreement or other agreement you may have with Cloudera. Cloudera makes no commitments about any future developments or functionality in any Cloudera product. The development, release, and timing of release of any software features or functionality described in this document remains at the discretion of Cloudera and you should not rely on any statements about development plans or anticipated functionality in this document when making any purchasing decisions. Cloudera assumes no responsibility for errors or omissions in this document. Cloudera does not warrant the accuracy or completeness of the information, text, graphics, links or other items contained within this material. This document is provided without a warranty of any kind, either express or implied, including but not limited to the implied warranties of merchantability, fitness for a particular purpose or non-infringement. Cloudera shall have no liability for damages of any kind including without limitation direct, special, indirect or consequential damages that may result from the use of these materials.”
4.
4 © Cloudera,
Inc. All rights reserved. +
5.
5 © Cloudera,
Inc. All rights reserved. CLOUDERA DATA SCIENCE WORKBENCH Collaborative data science experience powered by Kubernetes • Built with Docker and Kubernetes • Isolated, reproducible user environments • Supports both big and small data • Local Python, R, Scala runtimes • Connect to any data source • Schedule & share GPU resources • Scale to CDH with Spark, Impala, Hive • Secure and governed by default • Easy, audited access to Kerberized clusters • Leverages shared platform services • Deployed with Cloudera Manager CDH CDH Cloudera Manager gateway node(s) CDH nodes Hive, HDFS, ... CDSW CDSW ... Master ... Engine EngineEngine EngineEngine
6.
6 © Cloudera,
Inc. All rights reserved. CLOUDERA MACHINE LEARNING (PREVIEW) Cloud-native machine learning platform for the enterprise DATA SCIENCE DATA ENGINEERING MODEL OPERATIONS CLOUDERA ML RUNTIME Python/R, Spark, TensorFlow, etc, optimized for CPU/GPU Interactive Development Batch Pipelines Predictive APIs End-to-end ML lifecycle for teams to build at scale Rapid provisioning and elastic autoscaling with multi-cloud portability powered by Kubernetes Containerized workloads for open data science with easy dependency management Distributed GPU training for accelerated deep learning Secure data access to HDFS or cloud object storage with shared schema, governance KUBERNETES Amazon EKS, Microsoft AKS, Google GKE, Red Hat OpenShift DATA CATALOG GOVERNANCESECURITY LIFECYCLEWORKLOAD XM STORAGE Amazon S3 Microsoft ADLS HDFS KUDU
7.
7 © Cloudera,
Inc. All rights reserved. SOME LESSONS LEARNED
8.
8 © Cloudera,
Inc. All rights reserved. LESSON 1: ENTERPRISE ML REQUIRES BIGGER PICTURE Streaming Ingest Batch Ingest Machine Learning Tools BI Tools and SQL Editors Data Products DATA, METADATA, SECURITY, GOVERNANCE, WORKLOAD MANAGEMENT MACHINE LEARNING DATA ENGINEERING DATA WAREHOUSE OPERATIONAL DATABASE
9.
9 © Cloudera,
Inc. All rights reserved. UBER Michelangelo: Uber’s Machine Learning Platform Source: https://eng.uber.com/michelangelo/
10.
10 © Cloudera,
Inc. All rights reserved. NETFLIX Netflix recommendation infrastructure Source: https://medium.com/netflix-techblog/netflix-at-spark-ai-summit-2018-5304749ed7fa
11.
11 © Cloudera,
Inc. All rights reserved. FACEBOOK Facebook’s AI infrastructure Source: https://www.slideshare.net/KarthikMurugesan2/facebook-machine-learning-infrastructure-2018-slides
12.
12 © Cloudera,
Inc. All rights reserved. EMERGING CONSENSUS FOR ENTERPRISE ML AT SCALE Cloud Native Infrastructure Big Data Platform ML/AI Frameworks
13.
13 © Cloudera,
Inc. All rights reserved. Data scientists want to use every library under the sun, platform should support that Workbench (interactive) ANALYZ E Experiments (batch, ad-hoc) TRAIN Reports (shared output) SHARE Jobs (batch, scheduled) REPEAT Models (RPC APIs) SERVE LOOSELY COUPLED, ML/AI FRAMEWORK AGNOSTIC LESSON 2: FOCUS ON WORKFLOWS NOT FRAMEWORKS
14.
14 © Cloudera,
Inc. All rights reserved. Cloudera ML is built around one Operator and four “CRDs”: • Session (Interactive) • Experiment (Batch) • Job (Scheduled) • Model (Online API) No long-lived operators for particular libraries: TFJob, SparkJob, etc Kubernetes is not exposed to data scientist. Goal is serverless experience. WHAT DOES THIS LOOK LIKE IN K8S? Let’s talk operators and CRDs….
15.
15 © Cloudera,
Inc. All rights reserved. cldr ml run --cmd “python train.py”
16.
16 © Cloudera,
Inc. All rights reserved. from pyspark.sql import SparkSession spark = SparkSession.builder.getOrCreate() results = sc.parallelize(xrange(0, 1000)) .map(f).collect()
17.
17 © Cloudera,
Inc. All rights reserved. ROAD AHEAD
18.
18 © Cloudera,
Inc. All rights reserved. • Unified workflow for distributed data engineering and machine learning • Simple dependency management, particularly for ML workloads • python: pip install, R: install.packages, etc • Elastic compute (CPU / GPU) in cloud • Improved utilization and multi-tenancy WHY SPARK ON KUBERNETES?
19.
19 © Cloudera,
Inc. All rights reserved. SPARK ON KUBERNETES • Spark has native integration with Kubernetes since 2.3+ • Runs Spark fully containerized on K8s • Allows Spark to add / remove executors with user specified Spark configuration • Able to leverages Kubernetes features to facilitate launching Spark (e.g: Node Affinities, RBAC, Namespaces, etc).
20.
20 © Cloudera,
Inc. All rights reserved. Kubernetes CMLCML Engine Spark Driver (Client Mode) Spark Executor Fluentd Fluentd Project Volume Project Volume Spark Executor Project Volume Hive, HDFS, HMS, Sentry, Navigator- Create service account and namespace for quota - Configure Spark driver / executor configuration to connect to k8s. Also populate kerberos related information for accessing HDFS and other CDH services. - Populate dependency management related configurations using pod template: volume, image, user information, etc. - Fluentd configured to pick up logging and metric information to external storage SPARK ON KUBERNETES IN CML
21.
21 © Cloudera,
Inc. All rights reserved. DEMO
22.
22 © Cloudera,
Inc. All rights reserved. • Kerberos support • Dynamic allocation • GPU support • Scale and automation tests • Advanced scheduling (FairScheduler, Hierarchical Queues, etc) SPARK AND K8S UPSTREAM ONGOING WORK
23.
23 © Cloudera,
Inc. All rights reserved. Proactively optimize Workloads, Application Performance, and Infrastructure Capacity for Data Warehousing, Data Engineering, and Machine Learning Environments Migrate Analyze Optimize Manage AIOPS - AI-DRIVEN OPTIMIZATION AT SCALE
24.
24 © Cloudera,
Inc. All rights reserved. AIOPS ON K8S - NODE BOTTLENECK ANALYSIS
25.
25 © Cloudera,
Inc. All rights reserved. AIOPS ON K8S - CLUSTER AUTOMATION
26.
26 © Cloudera,
Inc. All rights reserved. Questions? Want to play around with this stuff? tiny.cloudera.com/CML Relevant Sigs sig-big-data, sig-ml, sig-scheduling
27.
THANK YOU
Download now