SlideShare a Scribd company logo
1 of 31
Download to read offline
What is MLOps?🚀
Machine Learning Operations
Leonardo Moraes
Speaker
www.amaris.com
Leonardo Moraes
● Team Leader &
Experienced Consultant
Amaris Consulting, 2023
● PhD in Computer Science
USP, Brazil, in progress
● MSc in Computer Science
USP, Brazil, 2020
● Bachelor in Computer Science
UFMS, Brasil, 2017
Agenda
● How does Data Science work?
● How does MLOps support Data Science?
● How can I join this wave?
1. What is Data Science?
4
5
New tech world
Without realizing it, people generate data all the time
on the Internet
● Marketplaces - Site, payment method, search...
● Media player - Search, recommendations, advertisements...
in offline
● Supermarket - What do we buy, how to organize the rows…
● Payment - Debit card, credit card, PIX, TED, DOC...
6
According to Gartner Research
● in 2020, an average of 44 trillion of
gigabytes (zettabytes) of data in the world
● 2.2 million terabytes generated per day
New tech world
How can we analyze it?
1. Business Specialist
2. Data Analyst
3. Data Scientist
7
Example - Borrowing money
Traditional design
● If you are 25 years old or older
and have an income of R$3,000.00
● I can borrow R$ 15,000.00 to pay in 5 years
Scientific design
● Considering a risk (e.g., 5% risk),
for the bank not to lose money
● Input: Age, gender, gross salary,...
● Output: Value and financing period.
What is Data Science?
8
Data science is a multidisciplinary
field that uses scientific processes,
algorithms, and systems to extract
knowledge and insights from data.
by Google Bard
Logo generated by ideogram
2. How does Data
Science work?
9
10
Knowledge Discovery in Databases
Data Science Procedure
Advances in Knowledge Discovery and Data Mining.
Fayyad, U. M., et. al (1996)
Procedure executed by
● Data Scientist
● Data Engineer
● ML Engineer
11
Knowledge Discovery in Databases
Data Science Procedure
Advances in Knowledge Discovery and Data Mining.
Fayyad, U. M., et. al (1996)
Selection
● Identification of the subset
of data that should be
considered in the process
12
Knowledge Discovery in Databases
Data Science Procedure
Advances in Knowledge Discovery and Data Mining.
Fayyad, U. M., et. al (1996)
Preprocessing
● Data cleaning; covers any
processing about data
quality and data integrity
13
Knowledge Discovery in Databases
Data Science Procedure
Advances in Knowledge Discovery and Data Mining.
Fayyad, U. M., et. al (1996)
Transformation
● Data aggregation,
transformation; encode
data into inputs recognized
by algorithms
14
Knowledge Discovery in Databases
Data Science Procedure
Advances in Knowledge Discovery and Data Mining.
Fayyad, U. M., et. al (1996)
Data Mining
● Practice of analyzing large
databases in order to
generate new information
● Tasks: classification,
clustering, image
recognition, etc..
15
Knowledge Discovery in Databases
Data Science Procedure
Advances in Knowledge Discovery and Data Mining.
Fayyad, U. M., et. al (1996)
Interpretation
● Transform identified/inferred
patterns into knowledge
● Make new knowledge
available to customers
16
Knowledge Discovery in Databases
Data Science Procedure
Advances in Knowledge Discovery and Data Mining.
Fayyad, U. M., et. al (1996)
Data Engineer
Data Scientist
M
L
E
n
g
i
n
e
e
r
● Data Engineer - Prepare
and organize the data
● Data Scientist - Generate
knowledge and insights
● ML Engineer - Automation
and delivery a product
3. How does MLOps
support Data Science?
17
18
MLOps
In the world of data science and machine
learning, the process of developing, deploying,
and maintaining models can be complex and
challenging. MLOps, short for Machine Learning
Operations, has emerged as a crucial discipline
that aims to streamline this process and make it
more manageable, efficient, and effective.
Essential MLOps (2023),
by Data Science Horizons
19
MLOps in Data Science
MLOps: Continuous delivery and
automation pipelines in machine learning
(Google, NIPS 2014 Workshop)
Challenge
● Technical debt
● High monetary risk factors
Goal
● apply DevOps principles to
ML systems (MLOps)
● automation and monitoring
20
MLOps x DevOps
● CI is no longer only about testing and validating code, but
also testing and validating data, data schemas, and models.
● CT is a new property, unique to ML systems, that's concerned
with automatically retraining and serving the models.
● CD is no longer about a single software package or a service,
but an ML training pipeline that should automatically deploy
another service (model prediction service).
21
MLOps Maturity Levels
MLOps: Continuous delivery and automation
pipelines in machine learning (Google)
MLOps Maturity Model with
Azure Machine Learning (Azure)
Maturity Level Training Process Release Process Technology
Level 0 - No MLOps Untracked file Manual, hand-off ● Manual builds and deployments
● Manual testing of model and application
● No tracking of model performance
Level 1 - DevOps no
MLOps
Untracked file Semi-automatized ● Automated builds
● Automated tests for application code
Level 2 - Automated
Training
Tracked, run results and
model artifacts
Automated release, code
is version controlled
● Automated model training
● Tracking of model training performance
● Model orchestration and management
Level 3 - Full MLOps
Automated Retraining
Tracked, run results and
model artifacts, retraining
set up based on metrics
Automated, CI/CD
pipeline set up, A/B testing
has been added
● Automated model training and testing
● Centralized metrics from deployed model
● Automated tests for all code
22
Maturity L.0 - no MLOps
Blogpost - MLOps Maturity Model with Azure Machine Learning (Azure)
23
Maturity L.1 - DevOps no MLOps
Blogpost - MLOps Maturity Model with Azure Machine Learning (Azure)
24
Maturity L.2 - Automated Training
Blogpost - MLOps Maturity Model with Azure Machine Learning (Azure)
25
Blogpost - MLOps Maturity Model with Azure Machine Learning (Azure)
Maturity L.3 - Full MLOps
4. How can I
join this wave?
26
27
1. Learn the basics of Data Science
2. High programming Skills, in Python, CI/CD, Git, Linux
Machine Learning Engineer
General
1. Build a Portfolio, like personal projects, contributions
2. Networking - Participation in Communities and Events
3. Stay updated - Lifelong Learning!
How To Become MLOps Engineer in 2024
by Asad iqbal
28
MLOps challenge 🎯
Driving Innovation: Leveraging Machine Learning Platforms
by Leonardo Moraes, 2024
5.
Conclusion
29
End!
30
MLOps: ML more efficient, reliable, and scalable.
● Observability: The ability to monitor models helps
identify degraded performance or anomalies quickly.
● Optimization: enhance the performance, efficiency,
and scalability of machine learning systems.
● Automation: allows for smooth and lower-risk
rollouts of new ML model versions.
Team Leader &
Experienced Consultant
Leonardo Moraes
Thank you for
your attention!

More Related Content

Similar to What is Machine Learning Operations (MLOps)?

DSDT Meetup April 2021
DSDT Meetup April 2021DSDT Meetup April 2021
DSDT Meetup April 2021
DSDT_MTL
 
Building Powerful and Intelligent Applications with Azure Machine Learning
Building Powerful and Intelligent Applications with Azure Machine LearningBuilding Powerful and Intelligent Applications with Azure Machine Learning
Building Powerful and Intelligent Applications with Azure Machine Learning
David Walker, CSM,CSD,MCP,MCAD,MCSD,MVP
 

Similar to What is Machine Learning Operations (MLOps)? (20)

2016 03-16 digital energy luncheon
2016 03-16 digital energy luncheon2016 03-16 digital energy luncheon
2016 03-16 digital energy luncheon
 
Credit card fraud detection using python machine learning
Credit card fraud detection using python machine learningCredit card fraud detection using python machine learning
Credit card fraud detection using python machine learning
 
Career in Python and data science
Career in Python and data science Career in Python and data science
Career in Python and data science
 
Machine learning at scale - Webinar By zekeLabs
Machine learning at scale - Webinar By zekeLabsMachine learning at scale - Webinar By zekeLabs
Machine learning at scale - Webinar By zekeLabs
 
From DevOps to MLOps: practical steps for a smooth transition
From DevOps to MLOps: practical steps for a smooth transitionFrom DevOps to MLOps: practical steps for a smooth transition
From DevOps to MLOps: practical steps for a smooth transition
 
Concepts, use cases and principles to build big data systems (1)
Concepts, use cases and principles to build big data systems (1)Concepts, use cases and principles to build big data systems (1)
Concepts, use cases and principles to build big data systems (1)
 
Experimentation to Industrialization: Implementing MLOps
Experimentation to Industrialization: Implementing MLOpsExperimentation to Industrialization: Implementing MLOps
Experimentation to Industrialization: Implementing MLOps
 
Building Data Products with Python (Georgetown)
Building Data Products with Python (Georgetown)Building Data Products with Python (Georgetown)
Building Data Products with Python (Georgetown)
 
The Machine Learning Audit. MIS ITAC 2017 Keynote
The Machine Learning Audit. MIS ITAC 2017 KeynoteThe Machine Learning Audit. MIS ITAC 2017 Keynote
The Machine Learning Audit. MIS ITAC 2017 Keynote
 
Machine Learning for Finance Master Class
Machine Learning for Finance Master Class Machine Learning for Finance Master Class
Machine Learning for Finance Master Class
 
Introduction to ml ops in daily apps
Introduction to ml ops in daily appsIntroduction to ml ops in daily apps
Introduction to ml ops in daily apps
 
AI hype or reality
AI  hype or realityAI  hype or reality
AI hype or reality
 
DSDT Meetup April 2021
DSDT Meetup April 2021DSDT Meetup April 2021
DSDT Meetup April 2021
 
Machine Learning
Machine LearningMachine Learning
Machine Learning
 
The Simple 5-Step Process for Creating a Winning Data Pipeline.pdf
The Simple 5-Step Process for Creating a Winning Data Pipeline.pdfThe Simple 5-Step Process for Creating a Winning Data Pipeline.pdf
The Simple 5-Step Process for Creating a Winning Data Pipeline.pdf
 
Operationalizing Machine Learning
Operationalizing Machine LearningOperationalizing Machine Learning
Operationalizing Machine Learning
 
Digicrome Data Science & AI 11 Month Course PDF.pdf
Digicrome Data Science & AI 11 Month Course PDF.pdfDigicrome Data Science & AI 11 Month Course PDF.pdf
Digicrome Data Science & AI 11 Month Course PDF.pdf
 
Data-X-Sparse-v2
Data-X-Sparse-v2Data-X-Sparse-v2
Data-X-Sparse-v2
 
Building Powerful and Intelligent Applications with Azure Machine Learning
Building Powerful and Intelligent Applications with Azure Machine LearningBuilding Powerful and Intelligent Applications with Azure Machine Learning
Building Powerful and Intelligent Applications with Azure Machine Learning
 
Why do the majority of Data Science projects never make it to production?
Why do the majority of Data Science projects never make it to production?Why do the majority of Data Science projects never make it to production?
Why do the majority of Data Science projects never make it to production?
 

Recently uploaded

Tales from a Passkey Provider Progress from Awareness to Implementation.pptx
Tales from a Passkey Provider  Progress from Awareness to Implementation.pptxTales from a Passkey Provider  Progress from Awareness to Implementation.pptx
Tales from a Passkey Provider Progress from Awareness to Implementation.pptx
FIDO Alliance
 
“Iamnobody89757” Understanding the Mysterious of Digital Identity.pdf
“Iamnobody89757” Understanding the Mysterious of Digital Identity.pdf“Iamnobody89757” Understanding the Mysterious of Digital Identity.pdf
“Iamnobody89757” Understanding the Mysterious of Digital Identity.pdf
Muhammad Subhan
 

Recently uploaded (20)

Microsoft CSP Briefing Pre-Engagement - Questionnaire
Microsoft CSP Briefing Pre-Engagement - QuestionnaireMicrosoft CSP Briefing Pre-Engagement - Questionnaire
Microsoft CSP Briefing Pre-Engagement - Questionnaire
 
Google I/O Extended 2024 Warsaw
Google I/O Extended 2024 WarsawGoogle I/O Extended 2024 Warsaw
Google I/O Extended 2024 Warsaw
 
Tales from a Passkey Provider Progress from Awareness to Implementation.pptx
Tales from a Passkey Provider  Progress from Awareness to Implementation.pptxTales from a Passkey Provider  Progress from Awareness to Implementation.pptx
Tales from a Passkey Provider Progress from Awareness to Implementation.pptx
 
Generative AI Use Cases and Applications.pdf
Generative AI Use Cases and Applications.pdfGenerative AI Use Cases and Applications.pdf
Generative AI Use Cases and Applications.pdf
 
Cyber Insurance - RalphGilot - Embry-Riddle Aeronautical University.pptx
Cyber Insurance - RalphGilot - Embry-Riddle Aeronautical University.pptxCyber Insurance - RalphGilot - Embry-Riddle Aeronautical University.pptx
Cyber Insurance - RalphGilot - Embry-Riddle Aeronautical University.pptx
 
(Explainable) Data-Centric AI: what are you explaininhg, and to whom?
(Explainable) Data-Centric AI: what are you explaininhg, and to whom?(Explainable) Data-Centric AI: what are you explaininhg, and to whom?
(Explainable) Data-Centric AI: what are you explaininhg, and to whom?
 
How we scaled to 80K users by doing nothing!.pdf
How we scaled to 80K users by doing nothing!.pdfHow we scaled to 80K users by doing nothing!.pdf
How we scaled to 80K users by doing nothing!.pdf
 
ChatGPT and Beyond - Elevating DevOps Productivity
ChatGPT and Beyond - Elevating DevOps ProductivityChatGPT and Beyond - Elevating DevOps Productivity
ChatGPT and Beyond - Elevating DevOps Productivity
 
Observability Concepts EVERY Developer Should Know (DevOpsDays Seattle)
Observability Concepts EVERY Developer Should Know (DevOpsDays Seattle)Observability Concepts EVERY Developer Should Know (DevOpsDays Seattle)
Observability Concepts EVERY Developer Should Know (DevOpsDays Seattle)
 
How to Check CNIC Information Online with Pakdata cf
How to Check CNIC Information Online with Pakdata cfHow to Check CNIC Information Online with Pakdata cf
How to Check CNIC Information Online with Pakdata cf
 
“Iamnobody89757” Understanding the Mysterious of Digital Identity.pdf
“Iamnobody89757” Understanding the Mysterious of Digital Identity.pdf“Iamnobody89757” Understanding the Mysterious of Digital Identity.pdf
“Iamnobody89757” Understanding the Mysterious of Digital Identity.pdf
 
1111 ChatGPT Prompts PDF Free Download - Prompts for ChatGPT
1111 ChatGPT Prompts PDF Free Download - Prompts for ChatGPT1111 ChatGPT Prompts PDF Free Download - Prompts for ChatGPT
1111 ChatGPT Prompts PDF Free Download - Prompts for ChatGPT
 
The Metaverse: Are We There Yet?
The  Metaverse:    Are   We  There  Yet?The  Metaverse:    Are   We  There  Yet?
The Metaverse: Are We There Yet?
 
Overview of Hyperledger Foundation
Overview of Hyperledger FoundationOverview of Hyperledger Foundation
Overview of Hyperledger Foundation
 
ERP Contender Series: Acumatica vs. Sage Intacct
ERP Contender Series: Acumatica vs. Sage IntacctERP Contender Series: Acumatica vs. Sage Intacct
ERP Contender Series: Acumatica vs. Sage Intacct
 
Human Expert Website Manual WCAG 2.0 2.1 2.2 Audit - Digital Accessibility Au...
Human Expert Website Manual WCAG 2.0 2.1 2.2 Audit - Digital Accessibility Au...Human Expert Website Manual WCAG 2.0 2.1 2.2 Audit - Digital Accessibility Au...
Human Expert Website Manual WCAG 2.0 2.1 2.2 Audit - Digital Accessibility Au...
 
The Zero-ETL Approach: Enhancing Data Agility and Insight
The Zero-ETL Approach: Enhancing Data Agility and InsightThe Zero-ETL Approach: Enhancing Data Agility and Insight
The Zero-ETL Approach: Enhancing Data Agility and Insight
 
Top 10 CodeIgniter Development Companies
Top 10 CodeIgniter Development CompaniesTop 10 CodeIgniter Development Companies
Top 10 CodeIgniter Development Companies
 
WebRTC and SIP not just audio and video @ OpenSIPS 2024
WebRTC and SIP not just audio and video @ OpenSIPS 2024WebRTC and SIP not just audio and video @ OpenSIPS 2024
WebRTC and SIP not just audio and video @ OpenSIPS 2024
 
TopCryptoSupers 12thReport OrionX May2024
TopCryptoSupers 12thReport OrionX May2024TopCryptoSupers 12thReport OrionX May2024
TopCryptoSupers 12thReport OrionX May2024
 

What is Machine Learning Operations (MLOps)?

  • 1. What is MLOps?🚀 Machine Learning Operations Leonardo Moraes
  • 2. Speaker www.amaris.com Leonardo Moraes ● Team Leader & Experienced Consultant Amaris Consulting, 2023 ● PhD in Computer Science USP, Brazil, in progress ● MSc in Computer Science USP, Brazil, 2020 ● Bachelor in Computer Science UFMS, Brasil, 2017
  • 3. Agenda ● How does Data Science work? ● How does MLOps support Data Science? ● How can I join this wave?
  • 4. 1. What is Data Science? 4
  • 5. 5 New tech world Without realizing it, people generate data all the time on the Internet ● Marketplaces - Site, payment method, search... ● Media player - Search, recommendations, advertisements... in offline ● Supermarket - What do we buy, how to organize the rows… ● Payment - Debit card, credit card, PIX, TED, DOC...
  • 6. 6 According to Gartner Research ● in 2020, an average of 44 trillion of gigabytes (zettabytes) of data in the world ● 2.2 million terabytes generated per day New tech world How can we analyze it? 1. Business Specialist 2. Data Analyst 3. Data Scientist
  • 7. 7 Example - Borrowing money Traditional design ● If you are 25 years old or older and have an income of R$3,000.00 ● I can borrow R$ 15,000.00 to pay in 5 years Scientific design ● Considering a risk (e.g., 5% risk), for the bank not to lose money ● Input: Age, gender, gross salary,... ● Output: Value and financing period.
  • 8. What is Data Science? 8 Data science is a multidisciplinary field that uses scientific processes, algorithms, and systems to extract knowledge and insights from data. by Google Bard Logo generated by ideogram
  • 9. 2. How does Data Science work? 9
  • 10. 10 Knowledge Discovery in Databases Data Science Procedure Advances in Knowledge Discovery and Data Mining. Fayyad, U. M., et. al (1996) Procedure executed by ● Data Scientist ● Data Engineer ● ML Engineer
  • 11. 11 Knowledge Discovery in Databases Data Science Procedure Advances in Knowledge Discovery and Data Mining. Fayyad, U. M., et. al (1996) Selection ● Identification of the subset of data that should be considered in the process
  • 12. 12 Knowledge Discovery in Databases Data Science Procedure Advances in Knowledge Discovery and Data Mining. Fayyad, U. M., et. al (1996) Preprocessing ● Data cleaning; covers any processing about data quality and data integrity
  • 13. 13 Knowledge Discovery in Databases Data Science Procedure Advances in Knowledge Discovery and Data Mining. Fayyad, U. M., et. al (1996) Transformation ● Data aggregation, transformation; encode data into inputs recognized by algorithms
  • 14. 14 Knowledge Discovery in Databases Data Science Procedure Advances in Knowledge Discovery and Data Mining. Fayyad, U. M., et. al (1996) Data Mining ● Practice of analyzing large databases in order to generate new information ● Tasks: classification, clustering, image recognition, etc..
  • 15. 15 Knowledge Discovery in Databases Data Science Procedure Advances in Knowledge Discovery and Data Mining. Fayyad, U. M., et. al (1996) Interpretation ● Transform identified/inferred patterns into knowledge ● Make new knowledge available to customers
  • 16. 16 Knowledge Discovery in Databases Data Science Procedure Advances in Knowledge Discovery and Data Mining. Fayyad, U. M., et. al (1996) Data Engineer Data Scientist M L E n g i n e e r ● Data Engineer - Prepare and organize the data ● Data Scientist - Generate knowledge and insights ● ML Engineer - Automation and delivery a product
  • 17. 3. How does MLOps support Data Science? 17
  • 18. 18 MLOps In the world of data science and machine learning, the process of developing, deploying, and maintaining models can be complex and challenging. MLOps, short for Machine Learning Operations, has emerged as a crucial discipline that aims to streamline this process and make it more manageable, efficient, and effective. Essential MLOps (2023), by Data Science Horizons
  • 19. 19 MLOps in Data Science MLOps: Continuous delivery and automation pipelines in machine learning (Google, NIPS 2014 Workshop) Challenge ● Technical debt ● High monetary risk factors Goal ● apply DevOps principles to ML systems (MLOps) ● automation and monitoring
  • 20. 20 MLOps x DevOps ● CI is no longer only about testing and validating code, but also testing and validating data, data schemas, and models. ● CT is a new property, unique to ML systems, that's concerned with automatically retraining and serving the models. ● CD is no longer about a single software package or a service, but an ML training pipeline that should automatically deploy another service (model prediction service).
  • 21. 21 MLOps Maturity Levels MLOps: Continuous delivery and automation pipelines in machine learning (Google) MLOps Maturity Model with Azure Machine Learning (Azure) Maturity Level Training Process Release Process Technology Level 0 - No MLOps Untracked file Manual, hand-off ● Manual builds and deployments ● Manual testing of model and application ● No tracking of model performance Level 1 - DevOps no MLOps Untracked file Semi-automatized ● Automated builds ● Automated tests for application code Level 2 - Automated Training Tracked, run results and model artifacts Automated release, code is version controlled ● Automated model training ● Tracking of model training performance ● Model orchestration and management Level 3 - Full MLOps Automated Retraining Tracked, run results and model artifacts, retraining set up based on metrics Automated, CI/CD pipeline set up, A/B testing has been added ● Automated model training and testing ● Centralized metrics from deployed model ● Automated tests for all code
  • 22. 22 Maturity L.0 - no MLOps Blogpost - MLOps Maturity Model with Azure Machine Learning (Azure)
  • 23. 23 Maturity L.1 - DevOps no MLOps Blogpost - MLOps Maturity Model with Azure Machine Learning (Azure)
  • 24. 24 Maturity L.2 - Automated Training Blogpost - MLOps Maturity Model with Azure Machine Learning (Azure)
  • 25. 25 Blogpost - MLOps Maturity Model with Azure Machine Learning (Azure) Maturity L.3 - Full MLOps
  • 26. 4. How can I join this wave? 26
  • 27. 27 1. Learn the basics of Data Science 2. High programming Skills, in Python, CI/CD, Git, Linux Machine Learning Engineer General 1. Build a Portfolio, like personal projects, contributions 2. Networking - Participation in Communities and Events 3. Stay updated - Lifelong Learning! How To Become MLOps Engineer in 2024 by Asad iqbal
  • 28. 28 MLOps challenge 🎯 Driving Innovation: Leveraging Machine Learning Platforms by Leonardo Moraes, 2024
  • 30. End! 30 MLOps: ML more efficient, reliable, and scalable. ● Observability: The ability to monitor models helps identify degraded performance or anomalies quickly. ● Optimization: enhance the performance, efficiency, and scalability of machine learning systems. ● Automation: allows for smooth and lower-risk rollouts of new ML model versions.
  • 31. Team Leader & Experienced Consultant Leonardo Moraes Thank you for your attention!