SlideShare a Scribd company logo
1 of 117
AI Orange Belt
Session 2 - Harness the power of AI abilities
1
AI ORANGE BELT SKILLS © PROPERTY OF AI BLACK BELT
ORANGE
BELT
The prerequisites : what is AI, how
does it work in real life
How to manage and implement an
artificial intelligence project
DEFINITION
PROJECT
2
• AI is a form of advanced computer science, that learns from data in order to expand
its generalization abilities on narrow tasks, as opposed to regular software
hardcoded instructions
• AI can be subdivided into supervised learning - the bulk of modern applications,
unsupervised learning - grouping for visualization and exploration purpose mainly,
and reinforcement learning - difficult to implement but powerful in some optimization
with actions cases
• The list of tasks AI can solve can broadly be divided into : classification, prediction,
clustering, outlier detection, recommandation, data generation
• The different subdomain of applications can be determined by the data input/output
types : vision, NLP (text&speech), structured classic, robotics
What have we seen last time?
3
Supervised Learning
4
if color == "green":
return "apple"
elif color == "orange":
return "orange"
else return "banana"
“apple”
“orange”
“banana”
“apple”
5
Can you write a computer program that does that ?
“apple”
“orange”
“banana”
“banana”
if shape == "round":
if color == "green"
return "apple"
else color == "orange"
return "orange"
else return "banana"
6
“apple”
“orange”
“banana”
“banana”
if shape == "round":
if color == "green"
return "apple"
elif color == "orange"
return "orange"
else return "banana"
7
“apple”
“orange”
“banana”
“banana”
?
“apple”
8
Input (A) Output (B)
Object
Detection
Applications
email Intent detection Text
Classification
audio Text transcript
Speech
Recognition
French Dutch Machine
Translation
Object
verification or
Identification
Anomaly
Detection
9
Unsupervised Learning
10
Reinforcement Learning
11
• AI is a form of advanced computer science, that learns from data in order to expand
its generalisation abilities on narrow tasks, as opposed to regular software
hardcoded instructions
• AI can be subdivided into supervised learning - the bulk of modern applications,
unsupervised learning - grouping for visualisation and exploration purpose mainly,
and reinforcement learning - difficult to implement but powerful in some optimisation
with actions cases
• The list of tasks AI can solve can broadly be divided into : classification, prediction,
clustering, outlier detection, recommandation, data generation
• The different subdomain of applications can be determined by the data input/output
types : vision, NLP (text&speech), structured classic, robotics
What have we seen last time?
12
•Understand the limits of AI and the main biases when it comes to
create intelligent machines in real life
•Lifecycle of an AI application, and how it differs from regular
workflows
•How to detect opportunities / use cases, and evaluate their impact on
the revenue of the company. Cost per task, revenue per task
•Team management, project management (create and deploy) and data
management
Our plan for today - the real world
13
14
What barriers are faced at work ?
15
01
03
02
06
04
05
MaintenanceIdentify
DeployData
EvaluateModel
Applied AI Lifecycle © PROPERTY OF AI BLACK BELT
16
Key steps of a machine learning project
Echo / Alexa
1. Collect data
2. Train model
Iterate many times until good enough
3. Deploy model
Get data back
Maintain / update the model
01
03
02
06
04
05
MaintenanceIdentify
DeployData
EvaluateModel
Source : deeplearning.ai
17
Key steps of a machine learning project
Self-driving car
1. Collect data image position of other cars
2. Train model
3. Deploy model
Get data back
Maintain model
01
03
02
06
04
05
MaintenanceIdentify
DeployData
EvaluateModel
Source : deeplearning.ai
18
01
Select the right question
Choose the performance metric
Decide the level of explainability
Identify
Applied AI Lifecycle © PROPERTY OF AI BLACK BELT
19
Discover what’s possible
What would be helpful for your business?
20
Anything you can do with 1 second of
thought, can probably be automated today
21
“The toy arrived two days late, so I wasn’t able to give it to
my nephew for his birthday.
Can I return it ?”
“Refund request”
Refund/Shipping/OrderInput text
“Oh sorry to hear that!
I hope you nephew had a good birthday.
Yes, we can help with ...
Complex personalised
empathetic response
Input text
“Yes you can. The refund procedure is ...”
Simple responseInput text
22
Diagnose pneumonia on ~ 10.000
images
Diagnose pneumonia from 10
images of a medical textbook
Ask to perform on new type of
data
23
Take a (deep) look at your work
Break down your workflow and your business unit
24
25
Baby food ingredient: safe or spoiled?
Patient: ideal medication dosage?
Email: spam or ham?
Recorded phone call to call center: issue topic?
Bottle of wine: will I like it or not?
Steering wheel: left or right?
Photo: which animal?
Game piece: which location on the board?
Start of a sentence: end of that sentence?
Stock: tomorrow’s price?
Transaction: legitimate or fraudulent?
Data center cooling system: warmer or cooler?
Machine: when will it need maintenance?
Inventory: when to restock?
Scene description: pixels in a visual rendering?
Today’s temperature: tomorrow’s temperature?
Auction: how much to bid?
Movie: will you like it or not?
Live lecture: text captions?
Poem: what does it sound like out loud?
Image of an invoice: total amount in dollars?
Service request: waiting time?
Expense report: budget category?
Sound recording: correct text captions?
Song lyrics: language?
Sentence in English: same meaning in Chinese?
Form incorrectly filled out: correct fields?
Clothing item: skirt or blouse or …?
Video: which actors?
Video game: joystick motion?
Toilet user: did they wash their hands?
Idea 1
Ask simple guesswork labelling question
26
27
Idea 2
find the ROI of (cheap) prediction
Level 1: as an optimisation tool
Level 2: as an improvement / help / recommandation
Level 3: as a new feature / product
28
29
Discover Opportunities-
Brainstorming framework
1. Think about automating tasks rather
than jobs!
2. What are the main drivers of
business value?
3. What are the main pain points in
your business ?
4. How much data is needed ? Is my
data clean ? Are we mature in terms
of data ?
30
What AI can do
Valuable cases
for your business
AI experts Domain experts
Cross-functional team 31
32
32
33
33
34
34
Real life case studies
Fromcorebusinesstolow-hangingfruits
35
35
Recommandations
“35 percent of what consumers purchase on Amazon come from product recommendations”
https://www.mckinsey.com/industries/retail/our-insights/how-retailers-can-keep-up-with-consumers
36
36
Amazon Go
“Amazon.com Inc. may open up to 3,000 Amazon Go outlets by 2021”
https://www.bloomberg.com/news/articles/2018-09-19/amazon-is-said-to-plan-up-to-3-000-cashierless-stores-by-2021
37
37
Amazon Echo
38
Amazon Robotics
39
Amazon Prime Air
40
41
41
Choose the performance metric
What are you willing to lose?
42
North star metric?
43
Optimizing vs satisficing metric
Possible metrics
Cost = accuracy – 0.5 x running time
Or
Maximize accuracy
Subject to running time <= 100 ms
Use case 1 : Cat classifier
Use case 2 :
Detect trigger words for Amazon Alexa Device
Possible metrics
Accuracy
Or
Maximize accuracy
Subject to <= 1 FP for every 24 hours
44
01
02
Select the right question
Choose the performance metric
Decide the level of explainability
Identify
Find the right data
Structure annotate data
Clean Data
Data
Applied AI Lifecycle © PROPERTY OF AI BLACK BELT
45
46
What data looks like?
47
48
49
50
51
Data exploration
52
53
54
Most applications require lots of data
55
Where will you get it?
Then prioritise by availability, accessibility & cost
- existing data sources
- data enrichment (feature engineering)
- data augmentation
- data generation
- manual data labeling
- create new data sources (e.g. sensors)
- Public data, scraping, etc
56
Data labeling
57
If the benefit of the performance
increase outweighs the cost of
acquiring more data, get more
data!
Diminishing returns mean that
more data won’t help
What amount of data ?
58
Hire, Crowdsource, Service
Three choices
59
Data flywheel strategy
use users to gather more (noisy) data
60
Example : reCAPTCHA
OCR Image Classifier
61
Synthetic data
62
Could you think of other features that you could
obtain that would help with this task?
Feature engineering
63
Increased execution risk
if data of not good quality
64
Garbage in, garbage out
65
What do you see here ?
66
Bias in a typical ML paradigm
67
Bias in a typical ML paradigm
68
Selection bias
69
69
Selection bias
70
70
Outgroup homogeneity bias
They are alike, we are diverse
71
71
Confirmation bias
72
72
Correlation fallacy
73
73
Automation bias
Don’t trust the machine
74
74
Design with Fairness in mind
! Consider the problem
! Ask experts
! Train the models to account for bias
! Interpret outcomes
! Publish with context
75
⊙ Big data is always better, but not
necessary
⊙ Clean data better than a lot of messy
data
⊙ Small data is almost always enough to
make progress (activate feedback loop)
⊙ If no data, don’t give up, if can be
generated or augmented!
⊙ Design ML model with fairness in mind
Go talk to a ML Engineer to figure it out
Data
Consideratio
ns
76
Exercice: find your data
77
01
03
02
Select the right question
Choose the performance metric
Decide the level of explainability
Identify
Find the right data
Structure annotate data
Clean Data
Data
Select the right algorithm
Tune the model
Model
Applied AI Lifecycle © PROPERTY OF AI BLACK BELT
78
Main steps for model training
1. Select your model family (and your performance metric)
2. Split you dataset into Train/Dev/Test set
3. Train model on training set
4. Take care of overfitting vs underfitting
5. Tuning hyper parameters
6. Select best model
79
1) Select the appropriate family of models
80
81
2) Dataset splitting into Train/Dev/Test
82
3) Train model on training set (Fit & tune model)
83
4) Take care of overfitting vs underfitting
84
5) Tuning hyperparameters (with cross-validation)
85
5) Tuning hyperparameters (with cross-validation)
86
6) Selecting best model
For each algorithm (i.e. regularized regression, random forest, etc.):
For each set of hyperparameter values to try:
Perform cross-validation using the training set.
Calculate cross-validated score.
87
Checkpoint quizz
! Pick one: better data or fancier algorithms ?
! When should you split your dataset into training and test sets, and why?
! What's the key difference between model parameters and hyperparameters?
! Explain how cross-validation helps you "tune" your models?
88
89
90
01
03
02
04
Select the right question
Choose the performance metric
Decide the level of explainability
Identify
Find the right data
Structure annotate data
Clean Data
Data
Decide on an acceptable error
Test on the right scope
Evaluate
Select the right algorithm
Tune the model
Model
Applied AI Lifecycle © PROPERTY OF AI BLACK BELT
91
North star metric?
92
92
Acceptable rate (comparting to human level performance)
93
Explainability of the model
● Depending on the machine learning model used, the results could be :
● Very simple to interpret: Like decision trees
● Very difficult to interpret: Like deep-learning neural networks
94
Explainability of the model
On a decision tree, set of rules are well defined
95
Explainability of the model
On a deep-learning neural network, interpretability of weights is difficult.
96
Explainability of the model
We could still use more sophisticated technique to partially understand
their predictions. This is an example on logo detection algorithms
Image Grad-cam Image Grad-cam
97
Performance of the model
TP
TNFP
FN
YES NO
YES
NO
Predicted
Actual
Confusion Matrix
98
Performance of the model
1.
4.3.
2.
YES NO
YES
NO
Predicted
Actual
How confusion matrix can help understand the model performance
Imagine you have a medical problem, do you go see your doctor?
1. If you should and you did, the fee is 25€
2. If you should and you didn’t, it gets worse and you will see a specialist, the fee is
70 €
3. If you shouldn't and you did, you still pay 25€
4. If you shouldn't and you didn’t, you do not pay anything
OK
OK
Loose 25 €
Loose 45 €
99
Performance of the model
200
10020
40
YES NO
YES
NO
Predicted
Actual
How confusion matrix can help understand the model performance
Which ML model is better, according to confusion matrices ?
Loose 25 €
Loose 45 €
210
8535
30
YES NO
YES
NO
Predicted
Actual
Loose 25 €
Loose 45 €
Loose 20 * 25 € + 40 * 45 € = 2 300 € Loose 35 * 25 € + 30 * 45 € = 2 225 € 100
Accuracy metric
Let us speak in terms of seeing your doctor:
● Accuracy: Over all the choices (see or not your doctor)
you make, how many of them were correct?
!""#$%"& =
() + (+
() + ,+ + ,) + (+
TP
TNFP
FN
YES NO
YES
NO
Predicted
Actual
101
Precision & Recall metrics
Let us speak in terms of seeing your doctor:
● Recall: Over all the times you should go see your doctor, how
many times you really went?
!"#$%% =
'(
'( + *+
● Precision: Over all the times you did go see your doctor, how
many of times you really needed to see him?
(,"#-.-/0 =
'(
'( + *(
TP
TNFP
FN
YES NO
YES
NO
Predicted
Actual
102
Accuracy VS Precision & Recall
● The accuracy is not used when the problem is not balanced.
● If 99% of your data are just one class
● An accuracy of 99% is just a majority vote
● Precision and recall are more useful in this case since you can focus on each class
individually
103
Which ML method is preferred ?
Use Case : Customer Churn
Target action A : phone call to potential churning customer
Target action B : send generous discount to potential churners
Which method is preferred for each target ?
104
105
Performance of the model
75% Precision
64% Volume
52% Volume
49% Volume
75% Volume
80% Precision
85% Precision
90% Precision
106
Performance of the model
Manual labeling cost /
cost of no action
Cost of error
107
Payout as a function of Threshold
108
Exercice - regression / classification
109
01
03
02
04
05
Select the right question
Choose the performance metric
Decide the level of explainability
Identify
Use the right architecture
Have the talents in place
Deploy
Find the right data
Structure annotate data
Clean Data
Data
Decide on an acceptable error
Test on the right scope
Evaluate
Select the right algorithm
Tune the model
Model
Applied AI Lifecycle © PROPERTY OF AI BLACK BELT
110
Basics of data engineering
A crash course
111
112
・Identifying fraudulent claims so that they can select claims for
deeper manual investigation; they have a business goal of
reducing fraud by 5% this year.
・Predicting weather patterns so that they can advise
customers to protect their vehicles by bringing them inside when
there’s a high chance of storms — thereby reducing vehicle
damage claims by 2%.
・Upselling other insurance products to the customer based on
the products they already have. The goal is to increase the
conversion rate for online upselling by 3%.
Still think vertical – 3 use cases
113
114
01
03
02
06
04
05
Monitoring & Updates
Have the right talents & solutions
Maintenance
Select the right question
Choose the performance metric
Decide the level of explainability
Identify
Use the right architecture
Have the talents in place
Deploy
Find the right data
Structure annotate data
Clean Data
Data
Decide on an acceptable error
Test on the right scope
Evaluate
Select the right algorithm
Tune the model
Model
Applied AI Lifecycle © PROPERTY OF AI BLACK BELT
115
A refined way of seeing it..
116
117

More Related Content

What's hot

Barga Data Science lecture 1
Barga Data Science lecture 1Barga Data Science lecture 1
Barga Data Science lecture 1Roger Barga
 
How to Interview a Data Scientist
How to Interview a Data ScientistHow to Interview a Data Scientist
How to Interview a Data ScientistDaniel Tunkelang
 
Data Science For Beginners | Who Is A Data Scientist? | Data Science Tutorial...
Data Science For Beginners | Who Is A Data Scientist? | Data Science Tutorial...Data Science For Beginners | Who Is A Data Scientist? | Data Science Tutorial...
Data Science For Beginners | Who Is A Data Scientist? | Data Science Tutorial...Edureka!
 
Intro to Data Science for Non-Data Scientists
Intro to Data Science for Non-Data ScientistsIntro to Data Science for Non-Data Scientists
Intro to Data Science for Non-Data ScientistsSri Ambati
 
Machine Learning for Statisticians - Introduction
Machine Learning for Statisticians - IntroductionMachine Learning for Statisticians - Introduction
Machine Learning for Statisticians - IntroductionDr Ganesh Iyer
 
"Introduction to Machine Learning and its Applications" at sapthgiri engineer...
"Introduction to Machine Learning and its Applications" at sapthgiri engineer..."Introduction to Machine Learning and its Applications" at sapthgiri engineer...
"Introduction to Machine Learning and its Applications" at sapthgiri engineer...Sachin Nagargoje
 
Machine Learning: Opening the Pandora's Box - Dhiana Deva @ QCon São Paulo 2019
Machine Learning: Opening the Pandora's Box - Dhiana Deva @ QCon São Paulo 2019Machine Learning: Opening the Pandora's Box - Dhiana Deva @ QCon São Paulo 2019
Machine Learning: Opening the Pandora's Box - Dhiana Deva @ QCon São Paulo 2019Dhiana Deva
 
Artificial Intelligence with Python | Edureka
Artificial Intelligence with Python | EdurekaArtificial Intelligence with Python | Edureka
Artificial Intelligence with Python | EdurekaEdureka!
 
Machine Learning Introduction
Machine Learning IntroductionMachine Learning Introduction
Machine Learning IntroductionYounesCharfaoui
 
Artificial Intelligence Overview PowerPoint Presentation Slides
Artificial Intelligence Overview PowerPoint Presentation Slides Artificial Intelligence Overview PowerPoint Presentation Slides
Artificial Intelligence Overview PowerPoint Presentation Slides SlideTeam
 
What your employees need to learn to work with data in the 21 st century
What your employees need to learn to work with data in the 21 st century What your employees need to learn to work with data in the 21 st century
What your employees need to learn to work with data in the 21 st century Human Capital Media
 
Barga Data Science lecture 4
Barga Data Science lecture 4Barga Data Science lecture 4
Barga Data Science lecture 4Roger Barga
 
Artificial Intelligence And Machine Learning PowerPoint Presentation Slides C...
Artificial Intelligence And Machine Learning PowerPoint Presentation Slides C...Artificial Intelligence And Machine Learning PowerPoint Presentation Slides C...
Artificial Intelligence And Machine Learning PowerPoint Presentation Slides C...SlideTeam
 
Guide to end end machine learning projects
Guide to end end machine learning projectsGuide to end end machine learning projects
Guide to end end machine learning projectsSkyl.ai
 
IBM Watson Question-Answering System and Cognitive Computing
IBM Watson Question-Answering System and Cognitive ComputingIBM Watson Question-Answering System and Cognitive Computing
IBM Watson Question-Answering System and Cognitive ComputingRakuten Group, Inc.
 
Data Science vs Machine Learning – What’s The Difference? | Data Science Cour...
Data Science vs Machine Learning – What’s The Difference? | Data Science Cour...Data Science vs Machine Learning – What’s The Difference? | Data Science Cour...
Data Science vs Machine Learning – What’s The Difference? | Data Science Cour...Edureka!
 
Artificial Intelligence PowerPoint Presentation Slide Template Complete Deck
Artificial Intelligence PowerPoint Presentation Slide Template Complete DeckArtificial Intelligence PowerPoint Presentation Slide Template Complete Deck
Artificial Intelligence PowerPoint Presentation Slide Template Complete DeckSlideTeam
 

What's hot (20)

Barga Data Science lecture 1
Barga Data Science lecture 1Barga Data Science lecture 1
Barga Data Science lecture 1
 
How to Interview a Data Scientist
How to Interview a Data ScientistHow to Interview a Data Scientist
How to Interview a Data Scientist
 
Data Science For Beginners | Who Is A Data Scientist? | Data Science Tutorial...
Data Science For Beginners | Who Is A Data Scientist? | Data Science Tutorial...Data Science For Beginners | Who Is A Data Scientist? | Data Science Tutorial...
Data Science For Beginners | Who Is A Data Scientist? | Data Science Tutorial...
 
Intro to Data Science for Non-Data Scientists
Intro to Data Science for Non-Data ScientistsIntro to Data Science for Non-Data Scientists
Intro to Data Science for Non-Data Scientists
 
Machine Learning for Statisticians - Introduction
Machine Learning for Statisticians - IntroductionMachine Learning for Statisticians - Introduction
Machine Learning for Statisticians - Introduction
 
"Introduction to Machine Learning and its Applications" at sapthgiri engineer...
"Introduction to Machine Learning and its Applications" at sapthgiri engineer..."Introduction to Machine Learning and its Applications" at sapthgiri engineer...
"Introduction to Machine Learning and its Applications" at sapthgiri engineer...
 
Machine Learning: Opening the Pandora's Box - Dhiana Deva @ QCon São Paulo 2019
Machine Learning: Opening the Pandora's Box - Dhiana Deva @ QCon São Paulo 2019Machine Learning: Opening the Pandora's Box - Dhiana Deva @ QCon São Paulo 2019
Machine Learning: Opening the Pandora's Box - Dhiana Deva @ QCon São Paulo 2019
 
Artificial Intelligence with Python | Edureka
Artificial Intelligence with Python | EdurekaArtificial Intelligence with Python | Edureka
Artificial Intelligence with Python | Edureka
 
Machine Learning Introduction
Machine Learning IntroductionMachine Learning Introduction
Machine Learning Introduction
 
Artificial Intelligence Overview PowerPoint Presentation Slides
Artificial Intelligence Overview PowerPoint Presentation Slides Artificial Intelligence Overview PowerPoint Presentation Slides
Artificial Intelligence Overview PowerPoint Presentation Slides
 
Data science - An Introduction
Data science - An IntroductionData science - An Introduction
Data science - An Introduction
 
What your employees need to learn to work with data in the 21 st century
What your employees need to learn to work with data in the 21 st century What your employees need to learn to work with data in the 21 st century
What your employees need to learn to work with data in the 21 st century
 
Barga Data Science lecture 4
Barga Data Science lecture 4Barga Data Science lecture 4
Barga Data Science lecture 4
 
Artificial Intelligence And Machine Learning PowerPoint Presentation Slides C...
Artificial Intelligence And Machine Learning PowerPoint Presentation Slides C...Artificial Intelligence And Machine Learning PowerPoint Presentation Slides C...
Artificial Intelligence And Machine Learning PowerPoint Presentation Slides C...
 
Guide to end end machine learning projects
Guide to end end machine learning projectsGuide to end end machine learning projects
Guide to end end machine learning projects
 
IBM Watson Question-Answering System and Cognitive Computing
IBM Watson Question-Answering System and Cognitive ComputingIBM Watson Question-Answering System and Cognitive Computing
IBM Watson Question-Answering System and Cognitive Computing
 
Managing machine learning
Managing machine learningManaging machine learning
Managing machine learning
 
Data Science vs Machine Learning – What’s The Difference? | Data Science Cour...
Data Science vs Machine Learning – What’s The Difference? | Data Science Cour...Data Science vs Machine Learning – What’s The Difference? | Data Science Cour...
Data Science vs Machine Learning – What’s The Difference? | Data Science Cour...
 
Artificial intelligence
Artificial intelligenceArtificial intelligence
Artificial intelligence
 
Artificial Intelligence PowerPoint Presentation Slide Template Complete Deck
Artificial Intelligence PowerPoint Presentation Slide Template Complete DeckArtificial Intelligence PowerPoint Presentation Slide Template Complete Deck
Artificial Intelligence PowerPoint Presentation Slide Template Complete Deck
 

Similar to AI Orange Belt - Session 2

Whats Next for Machine Learning
Whats Next for Machine LearningWhats Next for Machine Learning
Whats Next for Machine LearningOgilvy Consulting
 
FROM BI TO APPLIED AI
FROM BI TO APPLIED AIFROM BI TO APPLIED AI
FROM BI TO APPLIED AILior Sidi
 
What are the Assumptions About Data Products by Hiya.com Lead PM
What are the Assumptions About Data Products by Hiya.com Lead PMWhat are the Assumptions About Data Products by Hiya.com Lead PM
What are the Assumptions About Data Products by Hiya.com Lead PMProduct School
 
Artificial Intelligence
Artificial IntelligenceArtificial Intelligence
Artificial IntelligenceEnes Bolfidan
 
(In)convenient truths about applied machine learning
(In)convenient truths about applied machine learning(In)convenient truths about applied machine learning
(In)convenient truths about applied machine learningMax Pagels
 
Future of data science as a profession
Future of data science as a professionFuture of data science as a profession
Future of data science as a professionJose Quesada
 
Discussion - Weeks 1–2COLLAPSETop of FormShared Practice—Rol.docx
Discussion - Weeks 1–2COLLAPSETop of FormShared Practice—Rol.docxDiscussion - Weeks 1–2COLLAPSETop of FormShared Practice—Rol.docx
Discussion - Weeks 1–2COLLAPSETop of FormShared Practice—Rol.docxcuddietheresa
 
Artificial Intelligence beyond the hype: Local (Belgian) Machine Learning suc...
Artificial Intelligence beyond the hype: Local (Belgian) Machine Learning suc...Artificial Intelligence beyond the hype: Local (Belgian) Machine Learning suc...
Artificial Intelligence beyond the hype: Local (Belgian) Machine Learning suc...Patrick Van Renterghem
 
Putting data science in your business a first utility feedback
Putting data science in your business a first utility feedbackPutting data science in your business a first utility feedback
Putting data science in your business a first utility feedbackPeculium Crypto
 
Counting the World with AI Models
Counting the World with AI ModelsCounting the World with AI Models
Counting the World with AI ModelsGanes Kesari
 
Why analytics projects fail
Why analytics projects failWhy analytics projects fail
Why analytics projects failDr. Bülent Dal
 
WHY DO SO MANY ANALYTICS PROJECTS STILL FAIL?
WHY DO SO MANY ANALYTICS PROJECTS STILL FAIL?WHY DO SO MANY ANALYTICS PROJECTS STILL FAIL?
WHY DO SO MANY ANALYTICS PROJECTS STILL FAIL?Haluk Demirkan
 
Lunch and Learn Artificial intelligence
Lunch and Learn Artificial intelligence Lunch and Learn Artificial intelligence
Lunch and Learn Artificial intelligence The PNR
 
Where have all the data entry candidates gone?
Where have all the data entry candidates gone?Where have all the data entry candidates gone?
Where have all the data entry candidates gone?Infrrd
 
Understanding the New World of Cognitive Computing
Understanding the New World of Cognitive ComputingUnderstanding the New World of Cognitive Computing
Understanding the New World of Cognitive ComputingDATAVERSITY
 
Artificial Intelligence Primer
Artificial Intelligence PrimerArtificial Intelligence Primer
Artificial Intelligence PrimerImam Hoque
 
Fontys Eric van Tol
Fontys Eric van TolFontys Eric van Tol
Fontys Eric van TolTalentEvent
 
Operationalizing Machine Learning
Operationalizing Machine LearningOperationalizing Machine Learning
Operationalizing Machine LearningAgileThought
 
Project professionals: Ready for the future? AI and Change Management, James ...
Project professionals: Ready for the future? AI and Change Management, James ...Project professionals: Ready for the future? AI and Change Management, James ...
Project professionals: Ready for the future? AI and Change Management, James ...APMDonotuse
 

Similar to AI Orange Belt - Session 2 (20)

Whats Next for Machine Learning
Whats Next for Machine LearningWhats Next for Machine Learning
Whats Next for Machine Learning
 
FROM BI TO APPLIED AI
FROM BI TO APPLIED AIFROM BI TO APPLIED AI
FROM BI TO APPLIED AI
 
What are the Assumptions About Data Products by Hiya.com Lead PM
What are the Assumptions About Data Products by Hiya.com Lead PMWhat are the Assumptions About Data Products by Hiya.com Lead PM
What are the Assumptions About Data Products by Hiya.com Lead PM
 
Intro to ai application emeritus uob-final
Intro to ai application emeritus uob-finalIntro to ai application emeritus uob-final
Intro to ai application emeritus uob-final
 
Artificial Intelligence
Artificial IntelligenceArtificial Intelligence
Artificial Intelligence
 
(In)convenient truths about applied machine learning
(In)convenient truths about applied machine learning(In)convenient truths about applied machine learning
(In)convenient truths about applied machine learning
 
Future of data science as a profession
Future of data science as a professionFuture of data science as a profession
Future of data science as a profession
 
Discussion - Weeks 1–2COLLAPSETop of FormShared Practice—Rol.docx
Discussion - Weeks 1–2COLLAPSETop of FormShared Practice—Rol.docxDiscussion - Weeks 1–2COLLAPSETop of FormShared Practice—Rol.docx
Discussion - Weeks 1–2COLLAPSETop of FormShared Practice—Rol.docx
 
Artificial Intelligence beyond the hype: Local (Belgian) Machine Learning suc...
Artificial Intelligence beyond the hype: Local (Belgian) Machine Learning suc...Artificial Intelligence beyond the hype: Local (Belgian) Machine Learning suc...
Artificial Intelligence beyond the hype: Local (Belgian) Machine Learning suc...
 
Putting data science in your business a first utility feedback
Putting data science in your business a first utility feedbackPutting data science in your business a first utility feedback
Putting data science in your business a first utility feedback
 
Counting the World with AI Models
Counting the World with AI ModelsCounting the World with AI Models
Counting the World with AI Models
 
Why analytics projects fail
Why analytics projects failWhy analytics projects fail
Why analytics projects fail
 
WHY DO SO MANY ANALYTICS PROJECTS STILL FAIL?
WHY DO SO MANY ANALYTICS PROJECTS STILL FAIL?WHY DO SO MANY ANALYTICS PROJECTS STILL FAIL?
WHY DO SO MANY ANALYTICS PROJECTS STILL FAIL?
 
Lunch and Learn Artificial intelligence
Lunch and Learn Artificial intelligence Lunch and Learn Artificial intelligence
Lunch and Learn Artificial intelligence
 
Where have all the data entry candidates gone?
Where have all the data entry candidates gone?Where have all the data entry candidates gone?
Where have all the data entry candidates gone?
 
Understanding the New World of Cognitive Computing
Understanding the New World of Cognitive ComputingUnderstanding the New World of Cognitive Computing
Understanding the New World of Cognitive Computing
 
Artificial Intelligence Primer
Artificial Intelligence PrimerArtificial Intelligence Primer
Artificial Intelligence Primer
 
Fontys Eric van Tol
Fontys Eric van TolFontys Eric van Tol
Fontys Eric van Tol
 
Operationalizing Machine Learning
Operationalizing Machine LearningOperationalizing Machine Learning
Operationalizing Machine Learning
 
Project professionals: Ready for the future? AI and Change Management, James ...
Project professionals: Ready for the future? AI and Change Management, James ...Project professionals: Ready for the future? AI and Change Management, James ...
Project professionals: Ready for the future? AI and Change Management, James ...
 

Recently uploaded

EMBODO Lesson Plan Grade 9 Law of Sines.docx
EMBODO Lesson Plan Grade 9 Law of Sines.docxEMBODO Lesson Plan Grade 9 Law of Sines.docx
EMBODO Lesson Plan Grade 9 Law of Sines.docxElton John Embodo
 
The Contemporary World: The Globalization of World Politics
The Contemporary World: The Globalization of World PoliticsThe Contemporary World: The Globalization of World Politics
The Contemporary World: The Globalization of World PoliticsRommel Regala
 
Inclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdf
Inclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdfInclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdf
Inclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdfTechSoup
 
Presentation Activity 2. Unit 3 transv.pptx
Presentation Activity 2. Unit 3 transv.pptxPresentation Activity 2. Unit 3 transv.pptx
Presentation Activity 2. Unit 3 transv.pptxRosabel UA
 
INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptx
INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptxINTRODUCTION TO CATHOLIC CHRISTOLOGY.pptx
INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptxHumphrey A Beña
 
Choosing the Right CBSE School A Comprehensive Guide for Parents
Choosing the Right CBSE School A Comprehensive Guide for ParentsChoosing the Right CBSE School A Comprehensive Guide for Parents
Choosing the Right CBSE School A Comprehensive Guide for Parentsnavabharathschool99
 
Activity 2-unit 2-update 2024. English translation
Activity 2-unit 2-update 2024. English translationActivity 2-unit 2-update 2024. English translation
Activity 2-unit 2-update 2024. English translationRosabel UA
 
ANG SEKTOR NG agrikultura.pptx QUARTER 4
ANG SEKTOR NG agrikultura.pptx QUARTER 4ANG SEKTOR NG agrikultura.pptx QUARTER 4
ANG SEKTOR NG agrikultura.pptx QUARTER 4MiaBumagat1
 
HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...
HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...
HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...Nguyen Thanh Tu Collection
 
THEORIES OF ORGANIZATION-PUBLIC ADMINISTRATION
THEORIES OF ORGANIZATION-PUBLIC ADMINISTRATIONTHEORIES OF ORGANIZATION-PUBLIC ADMINISTRATION
THEORIES OF ORGANIZATION-PUBLIC ADMINISTRATIONHumphrey A Beña
 
How to do quick user assign in kanban in Odoo 17 ERP
How to do quick user assign in kanban in Odoo 17 ERPHow to do quick user assign in kanban in Odoo 17 ERP
How to do quick user assign in kanban in Odoo 17 ERPCeline George
 
ICS2208 Lecture6 Notes for SL spaces.pdf
ICS2208 Lecture6 Notes for SL spaces.pdfICS2208 Lecture6 Notes for SL spaces.pdf
ICS2208 Lecture6 Notes for SL spaces.pdfVanessa Camilleri
 
GRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTS
GRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTSGRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTS
GRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTSJoshuaGantuangco2
 
Keynote by Prof. Wurzer at Nordex about IP-design
Keynote by Prof. Wurzer at Nordex about IP-designKeynote by Prof. Wurzer at Nordex about IP-design
Keynote by Prof. Wurzer at Nordex about IP-designMIPLM
 
Daily Lesson Plan in Mathematics Quarter 4
Daily Lesson Plan in Mathematics Quarter 4Daily Lesson Plan in Mathematics Quarter 4
Daily Lesson Plan in Mathematics Quarter 4JOYLYNSAMANIEGO
 

Recently uploaded (20)

EMBODO Lesson Plan Grade 9 Law of Sines.docx
EMBODO Lesson Plan Grade 9 Law of Sines.docxEMBODO Lesson Plan Grade 9 Law of Sines.docx
EMBODO Lesson Plan Grade 9 Law of Sines.docx
 
LEFT_ON_C'N_ PRELIMS_EL_DORADO_2024.pptx
LEFT_ON_C'N_ PRELIMS_EL_DORADO_2024.pptxLEFT_ON_C'N_ PRELIMS_EL_DORADO_2024.pptx
LEFT_ON_C'N_ PRELIMS_EL_DORADO_2024.pptx
 
The Contemporary World: The Globalization of World Politics
The Contemporary World: The Globalization of World PoliticsThe Contemporary World: The Globalization of World Politics
The Contemporary World: The Globalization of World Politics
 
Inclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdf
Inclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdfInclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdf
Inclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdf
 
Presentation Activity 2. Unit 3 transv.pptx
Presentation Activity 2. Unit 3 transv.pptxPresentation Activity 2. Unit 3 transv.pptx
Presentation Activity 2. Unit 3 transv.pptx
 
INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptx
INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptxINTRODUCTION TO CATHOLIC CHRISTOLOGY.pptx
INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptx
 
FINALS_OF_LEFT_ON_C'N_EL_DORADO_2024.pptx
FINALS_OF_LEFT_ON_C'N_EL_DORADO_2024.pptxFINALS_OF_LEFT_ON_C'N_EL_DORADO_2024.pptx
FINALS_OF_LEFT_ON_C'N_EL_DORADO_2024.pptx
 
Choosing the Right CBSE School A Comprehensive Guide for Parents
Choosing the Right CBSE School A Comprehensive Guide for ParentsChoosing the Right CBSE School A Comprehensive Guide for Parents
Choosing the Right CBSE School A Comprehensive Guide for Parents
 
Activity 2-unit 2-update 2024. English translation
Activity 2-unit 2-update 2024. English translationActivity 2-unit 2-update 2024. English translation
Activity 2-unit 2-update 2024. English translation
 
ANG SEKTOR NG agrikultura.pptx QUARTER 4
ANG SEKTOR NG agrikultura.pptx QUARTER 4ANG SEKTOR NG agrikultura.pptx QUARTER 4
ANG SEKTOR NG agrikultura.pptx QUARTER 4
 
YOUVE_GOT_EMAIL_PRELIMS_EL_DORADO_2024.pptx
YOUVE_GOT_EMAIL_PRELIMS_EL_DORADO_2024.pptxYOUVE_GOT_EMAIL_PRELIMS_EL_DORADO_2024.pptx
YOUVE_GOT_EMAIL_PRELIMS_EL_DORADO_2024.pptx
 
HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...
HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...
HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...
 
THEORIES OF ORGANIZATION-PUBLIC ADMINISTRATION
THEORIES OF ORGANIZATION-PUBLIC ADMINISTRATIONTHEORIES OF ORGANIZATION-PUBLIC ADMINISTRATION
THEORIES OF ORGANIZATION-PUBLIC ADMINISTRATION
 
How to do quick user assign in kanban in Odoo 17 ERP
How to do quick user assign in kanban in Odoo 17 ERPHow to do quick user assign in kanban in Odoo 17 ERP
How to do quick user assign in kanban in Odoo 17 ERP
 
Paradigm shift in nursing research by RS MEHTA
Paradigm shift in nursing research by RS MEHTAParadigm shift in nursing research by RS MEHTA
Paradigm shift in nursing research by RS MEHTA
 
ICS2208 Lecture6 Notes for SL spaces.pdf
ICS2208 Lecture6 Notes for SL spaces.pdfICS2208 Lecture6 Notes for SL spaces.pdf
ICS2208 Lecture6 Notes for SL spaces.pdf
 
GRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTS
GRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTSGRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTS
GRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTS
 
YOUVE GOT EMAIL_FINALS_EL_DORADO_2024.pptx
YOUVE GOT EMAIL_FINALS_EL_DORADO_2024.pptxYOUVE GOT EMAIL_FINALS_EL_DORADO_2024.pptx
YOUVE GOT EMAIL_FINALS_EL_DORADO_2024.pptx
 
Keynote by Prof. Wurzer at Nordex about IP-design
Keynote by Prof. Wurzer at Nordex about IP-designKeynote by Prof. Wurzer at Nordex about IP-design
Keynote by Prof. Wurzer at Nordex about IP-design
 
Daily Lesson Plan in Mathematics Quarter 4
Daily Lesson Plan in Mathematics Quarter 4Daily Lesson Plan in Mathematics Quarter 4
Daily Lesson Plan in Mathematics Quarter 4
 

AI Orange Belt - Session 2

  • 1. AI Orange Belt Session 2 - Harness the power of AI abilities 1
  • 2. AI ORANGE BELT SKILLS © PROPERTY OF AI BLACK BELT ORANGE BELT The prerequisites : what is AI, how does it work in real life How to manage and implement an artificial intelligence project DEFINITION PROJECT 2
  • 3. • AI is a form of advanced computer science, that learns from data in order to expand its generalization abilities on narrow tasks, as opposed to regular software hardcoded instructions • AI can be subdivided into supervised learning - the bulk of modern applications, unsupervised learning - grouping for visualization and exploration purpose mainly, and reinforcement learning - difficult to implement but powerful in some optimization with actions cases • The list of tasks AI can solve can broadly be divided into : classification, prediction, clustering, outlier detection, recommandation, data generation • The different subdomain of applications can be determined by the data input/output types : vision, NLP (text&speech), structured classic, robotics What have we seen last time? 3
  • 5. if color == "green": return "apple" elif color == "orange": return "orange" else return "banana" “apple” “orange” “banana” “apple” 5 Can you write a computer program that does that ?
  • 6. “apple” “orange” “banana” “banana” if shape == "round": if color == "green" return "apple" else color == "orange" return "orange" else return "banana" 6
  • 7. “apple” “orange” “banana” “banana” if shape == "round": if color == "green" return "apple" elif color == "orange" return "orange" else return "banana" 7
  • 9. Input (A) Output (B) Object Detection Applications email Intent detection Text Classification audio Text transcript Speech Recognition French Dutch Machine Translation Object verification or Identification Anomaly Detection 9
  • 12. • AI is a form of advanced computer science, that learns from data in order to expand its generalisation abilities on narrow tasks, as opposed to regular software hardcoded instructions • AI can be subdivided into supervised learning - the bulk of modern applications, unsupervised learning - grouping for visualisation and exploration purpose mainly, and reinforcement learning - difficult to implement but powerful in some optimisation with actions cases • The list of tasks AI can solve can broadly be divided into : classification, prediction, clustering, outlier detection, recommandation, data generation • The different subdomain of applications can be determined by the data input/output types : vision, NLP (text&speech), structured classic, robotics What have we seen last time? 12
  • 13. •Understand the limits of AI and the main biases when it comes to create intelligent machines in real life •Lifecycle of an AI application, and how it differs from regular workflows •How to detect opportunities / use cases, and evaluate their impact on the revenue of the company. Cost per task, revenue per task •Team management, project management (create and deploy) and data management Our plan for today - the real world 13
  • 14. 14
  • 15. What barriers are faced at work ? 15
  • 17. Key steps of a machine learning project Echo / Alexa 1. Collect data 2. Train model Iterate many times until good enough 3. Deploy model Get data back Maintain / update the model 01 03 02 06 04 05 MaintenanceIdentify DeployData EvaluateModel Source : deeplearning.ai 17
  • 18. Key steps of a machine learning project Self-driving car 1. Collect data image position of other cars 2. Train model 3. Deploy model Get data back Maintain model 01 03 02 06 04 05 MaintenanceIdentify DeployData EvaluateModel Source : deeplearning.ai 18
  • 19. 01 Select the right question Choose the performance metric Decide the level of explainability Identify Applied AI Lifecycle © PROPERTY OF AI BLACK BELT 19
  • 20. Discover what’s possible What would be helpful for your business? 20
  • 21. Anything you can do with 1 second of thought, can probably be automated today 21
  • 22. “The toy arrived two days late, so I wasn’t able to give it to my nephew for his birthday. Can I return it ?” “Refund request” Refund/Shipping/OrderInput text “Oh sorry to hear that! I hope you nephew had a good birthday. Yes, we can help with ... Complex personalised empathetic response Input text “Yes you can. The refund procedure is ...” Simple responseInput text 22
  • 23. Diagnose pneumonia on ~ 10.000 images Diagnose pneumonia from 10 images of a medical textbook Ask to perform on new type of data 23
  • 24. Take a (deep) look at your work Break down your workflow and your business unit 24
  • 25. 25
  • 26. Baby food ingredient: safe or spoiled? Patient: ideal medication dosage? Email: spam or ham? Recorded phone call to call center: issue topic? Bottle of wine: will I like it or not? Steering wheel: left or right? Photo: which animal? Game piece: which location on the board? Start of a sentence: end of that sentence? Stock: tomorrow’s price? Transaction: legitimate or fraudulent? Data center cooling system: warmer or cooler? Machine: when will it need maintenance? Inventory: when to restock? Scene description: pixels in a visual rendering? Today’s temperature: tomorrow’s temperature? Auction: how much to bid? Movie: will you like it or not? Live lecture: text captions? Poem: what does it sound like out loud? Image of an invoice: total amount in dollars? Service request: waiting time? Expense report: budget category? Sound recording: correct text captions? Song lyrics: language? Sentence in English: same meaning in Chinese? Form incorrectly filled out: correct fields? Clothing item: skirt or blouse or …? Video: which actors? Video game: joystick motion? Toilet user: did they wash their hands? Idea 1 Ask simple guesswork labelling question 26
  • 27. 27
  • 28. Idea 2 find the ROI of (cheap) prediction Level 1: as an optimisation tool Level 2: as an improvement / help / recommandation Level 3: as a new feature / product 28
  • 29. 29
  • 30. Discover Opportunities- Brainstorming framework 1. Think about automating tasks rather than jobs! 2. What are the main drivers of business value? 3. What are the main pain points in your business ? 4. How much data is needed ? Is my data clean ? Are we mature in terms of data ? 30
  • 31. What AI can do Valuable cases for your business AI experts Domain experts Cross-functional team 31
  • 32. 32 32
  • 33. 33 33
  • 34. 34 34
  • 35. Real life case studies Fromcorebusinesstolow-hangingfruits 35 35
  • 36. Recommandations “35 percent of what consumers purchase on Amazon come from product recommendations” https://www.mckinsey.com/industries/retail/our-insights/how-retailers-can-keep-up-with-consumers 36 36
  • 37. Amazon Go “Amazon.com Inc. may open up to 3,000 Amazon Go outlets by 2021” https://www.bloomberg.com/news/articles/2018-09-19/amazon-is-said-to-plan-up-to-3-000-cashierless-stores-by-2021 37 37
  • 41. 41 41
  • 42. Choose the performance metric What are you willing to lose? 42
  • 44. Optimizing vs satisficing metric Possible metrics Cost = accuracy – 0.5 x running time Or Maximize accuracy Subject to running time <= 100 ms Use case 1 : Cat classifier Use case 2 : Detect trigger words for Amazon Alexa Device Possible metrics Accuracy Or Maximize accuracy Subject to <= 1 FP for every 24 hours 44
  • 45. 01 02 Select the right question Choose the performance metric Decide the level of explainability Identify Find the right data Structure annotate data Clean Data Data Applied AI Lifecycle © PROPERTY OF AI BLACK BELT 45
  • 46. 46
  • 47. What data looks like? 47
  • 48. 48
  • 49. 49
  • 50. 50
  • 51. 51
  • 53. 53
  • 54. 54
  • 55. Most applications require lots of data 55
  • 56. Where will you get it? Then prioritise by availability, accessibility & cost - existing data sources - data enrichment (feature engineering) - data augmentation - data generation - manual data labeling - create new data sources (e.g. sensors) - Public data, scraping, etc 56
  • 58. If the benefit of the performance increase outweighs the cost of acquiring more data, get more data! Diminishing returns mean that more data won’t help What amount of data ? 58
  • 60. Data flywheel strategy use users to gather more (noisy) data 60
  • 61. Example : reCAPTCHA OCR Image Classifier 61
  • 63. Could you think of other features that you could obtain that would help with this task? Feature engineering 63
  • 64. Increased execution risk if data of not good quality 64
  • 66. What do you see here ? 66
  • 67. Bias in a typical ML paradigm 67
  • 68. Bias in a typical ML paradigm 68
  • 71. Outgroup homogeneity bias They are alike, we are diverse 71 71
  • 74. Automation bias Don’t trust the machine 74 74
  • 75. Design with Fairness in mind ! Consider the problem ! Ask experts ! Train the models to account for bias ! Interpret outcomes ! Publish with context 75
  • 76. ⊙ Big data is always better, but not necessary ⊙ Clean data better than a lot of messy data ⊙ Small data is almost always enough to make progress (activate feedback loop) ⊙ If no data, don’t give up, if can be generated or augmented! ⊙ Design ML model with fairness in mind Go talk to a ML Engineer to figure it out Data Consideratio ns 76
  • 78. 01 03 02 Select the right question Choose the performance metric Decide the level of explainability Identify Find the right data Structure annotate data Clean Data Data Select the right algorithm Tune the model Model Applied AI Lifecycle © PROPERTY OF AI BLACK BELT 78
  • 79. Main steps for model training 1. Select your model family (and your performance metric) 2. Split you dataset into Train/Dev/Test set 3. Train model on training set 4. Take care of overfitting vs underfitting 5. Tuning hyper parameters 6. Select best model 79
  • 80. 1) Select the appropriate family of models 80
  • 81. 81
  • 82. 2) Dataset splitting into Train/Dev/Test 82
  • 83. 3) Train model on training set (Fit & tune model) 83
  • 84. 4) Take care of overfitting vs underfitting 84
  • 85. 5) Tuning hyperparameters (with cross-validation) 85
  • 86. 5) Tuning hyperparameters (with cross-validation) 86
  • 87. 6) Selecting best model For each algorithm (i.e. regularized regression, random forest, etc.): For each set of hyperparameter values to try: Perform cross-validation using the training set. Calculate cross-validated score. 87
  • 88. Checkpoint quizz ! Pick one: better data or fancier algorithms ? ! When should you split your dataset into training and test sets, and why? ! What's the key difference between model parameters and hyperparameters? ! Explain how cross-validation helps you "tune" your models? 88
  • 89. 89
  • 90. 90
  • 91. 01 03 02 04 Select the right question Choose the performance metric Decide the level of explainability Identify Find the right data Structure annotate data Clean Data Data Decide on an acceptable error Test on the right scope Evaluate Select the right algorithm Tune the model Model Applied AI Lifecycle © PROPERTY OF AI BLACK BELT 91
  • 93. Acceptable rate (comparting to human level performance) 93
  • 94. Explainability of the model ● Depending on the machine learning model used, the results could be : ● Very simple to interpret: Like decision trees ● Very difficult to interpret: Like deep-learning neural networks 94
  • 95. Explainability of the model On a decision tree, set of rules are well defined 95
  • 96. Explainability of the model On a deep-learning neural network, interpretability of weights is difficult. 96
  • 97. Explainability of the model We could still use more sophisticated technique to partially understand their predictions. This is an example on logo detection algorithms Image Grad-cam Image Grad-cam 97
  • 98. Performance of the model TP TNFP FN YES NO YES NO Predicted Actual Confusion Matrix 98
  • 99. Performance of the model 1. 4.3. 2. YES NO YES NO Predicted Actual How confusion matrix can help understand the model performance Imagine you have a medical problem, do you go see your doctor? 1. If you should and you did, the fee is 25€ 2. If you should and you didn’t, it gets worse and you will see a specialist, the fee is 70 € 3. If you shouldn't and you did, you still pay 25€ 4. If you shouldn't and you didn’t, you do not pay anything OK OK Loose 25 € Loose 45 € 99
  • 100. Performance of the model 200 10020 40 YES NO YES NO Predicted Actual How confusion matrix can help understand the model performance Which ML model is better, according to confusion matrices ? Loose 25 € Loose 45 € 210 8535 30 YES NO YES NO Predicted Actual Loose 25 € Loose 45 € Loose 20 * 25 € + 40 * 45 € = 2 300 € Loose 35 * 25 € + 30 * 45 € = 2 225 € 100
  • 101. Accuracy metric Let us speak in terms of seeing your doctor: ● Accuracy: Over all the choices (see or not your doctor) you make, how many of them were correct? !""#$%"& = () + (+ () + ,+ + ,) + (+ TP TNFP FN YES NO YES NO Predicted Actual 101
  • 102. Precision & Recall metrics Let us speak in terms of seeing your doctor: ● Recall: Over all the times you should go see your doctor, how many times you really went? !"#$%% = '( '( + *+ ● Precision: Over all the times you did go see your doctor, how many of times you really needed to see him? (,"#-.-/0 = '( '( + *( TP TNFP FN YES NO YES NO Predicted Actual 102
  • 103. Accuracy VS Precision & Recall ● The accuracy is not used when the problem is not balanced. ● If 99% of your data are just one class ● An accuracy of 99% is just a majority vote ● Precision and recall are more useful in this case since you can focus on each class individually 103
  • 104. Which ML method is preferred ? Use Case : Customer Churn Target action A : phone call to potential churning customer Target action B : send generous discount to potential churners Which method is preferred for each target ? 104
  • 105. 105
  • 106. Performance of the model 75% Precision 64% Volume 52% Volume 49% Volume 75% Volume 80% Precision 85% Precision 90% Precision 106
  • 107. Performance of the model Manual labeling cost / cost of no action Cost of error 107
  • 108. Payout as a function of Threshold 108
  • 109. Exercice - regression / classification 109
  • 110. 01 03 02 04 05 Select the right question Choose the performance metric Decide the level of explainability Identify Use the right architecture Have the talents in place Deploy Find the right data Structure annotate data Clean Data Data Decide on an acceptable error Test on the right scope Evaluate Select the right algorithm Tune the model Model Applied AI Lifecycle © PROPERTY OF AI BLACK BELT 110
  • 111. Basics of data engineering A crash course 111
  • 112. 112
  • 113. ・Identifying fraudulent claims so that they can select claims for deeper manual investigation; they have a business goal of reducing fraud by 5% this year. ・Predicting weather patterns so that they can advise customers to protect their vehicles by bringing them inside when there’s a high chance of storms — thereby reducing vehicle damage claims by 2%. ・Upselling other insurance products to the customer based on the products they already have. The goal is to increase the conversion rate for online upselling by 3%. Still think vertical – 3 use cases 113
  • 114. 114
  • 115. 01 03 02 06 04 05 Monitoring & Updates Have the right talents & solutions Maintenance Select the right question Choose the performance metric Decide the level of explainability Identify Use the right architecture Have the talents in place Deploy Find the right data Structure annotate data Clean Data Data Decide on an acceptable error Test on the right scope Evaluate Select the right algorithm Tune the model Model Applied AI Lifecycle © PROPERTY OF AI BLACK BELT 115
  • 116. A refined way of seeing it.. 116
  • 117. 117