SlideShare a Scribd company logo
1 of 23
Download to read offline
Classification Labels in a Fast Moving Environment
Classification Labels in a Fast Moving
Environment
Alessandro Magnani
@WalmartLabs, Walmart Global eCommerce
California, USA
Friday 13th November, 2015
Classification Labels in a Fast Moving Environment
Classification Model Performance
Items Classifier
Editor
N sampled items true label yi
estimate ˜yi
accuracyEvaluation
◮ correctly evaluating classification models is critical and
requires labels
◮ labeling products is expensive
◮ need to correctly and optimally use labels
Classification Labels in a Fast Moving Environment
Classification Model Performance
Items Classifier
Editor
N sampled items true label yi
estimate ˜yi
accuracyEvaluation
Measure accuracy common approach:
◮ sample uniformly at random N items
◮ compute accuracy 1
N
N
i=1 ½{˜yi =yi }
Classification Labels in a Fast Moving Environment
Practical challenges
Items Classifier
Editor
N sampled items true label yi
estimate ˜yi
accuracyEvaluation
◮ items change over time
Classification Labels in a Fast Moving Environment
Practical challenges
Items Classifier
Editor
N sampled items true label yi
estimate ˜yi
accuracyEvaluation
◮ items change over time
◮ evaluation required over multiple subsets
Classification Labels in a Fast Moving Environment
Practical challenges
Items Classifier
Editor
N sampled items true label yi
estimate ˜yi
accuracyEvaluation
◮ items change over time
◮ evaluation required over multiple subsets
◮ existing labels potentially hard to reuse
Classification Labels in a Fast Moving Environment
A motivating example
compute accuracy over 1M items
1K labels budget
◮ sample 1K items and get
labels yi
◮ measure accuracy
1
1K
1K
i=1 ½{˜yi =yi }
1M
p
1
1K
Classification Labels in a Fast Moving Environment
A motivating example
500K items added, compute accuracy on all 1.5M items
◮ use previous accuracy
measure
◮ most likely inaccurate
1M 1.5M
p
1
1K
Classification Labels in a Fast Moving Environment
A motivating example
500K items added, compute accuracy on all 1.5M items
500 labels extra budget
◮ sample 500 items from the
1.5M
◮ compute accuracy on new
500 labels
◮ previous 1K labels “wasted”
1M 1.5M
p
1
3K
Classification Labels in a Fast Moving Environment
A motivating example
500K items added, compute accuracy on all 1.5M items
500 labels extra budget, better approach
◮ sample 500 items from new
items
◮ compute accuracy on all 1.5K
labels
◮ no label “wasted”
1M 1.5M
p
1
1K
Classification Labels in a Fast Moving Environment
A motivating example
500K items added, compute accuracy on all 1.5M items
only 250 labels extra budget?
◮ sample 250 items from new
items
◮ need to account for difference
in sampling
◮ accuracy:
1M 1.5M
p
1
2K
1
1.5K
1K
i=1 ½{˜yi =yi } + 2 250
i=1 ½{˜ynew
i =ynew
i }
Classification Labels in a Fast Moving Environment
A motivating example
What are the challenges?
◮ sampling new test labels for every measure is generally
expensive
Classification Labels in a Fast Moving Environment
A motivating example
What are the challenges?
◮ sampling new test labels for every measure is generally
expensive
◮ knowing how previous labels were sampled required to
optimally sample new items for test
Classification Labels in a Fast Moving Environment
A motivating example
What are the challenges?
◮ sampling new test labels for every measure is generally
expensive
◮ knowing how previous labels were sampled required to
optimally sample new items for test
◮ computing accuracy using all labels requires knowledge of
sampling profile
Classification Labels in a Fast Moving Environment
A motivating example
What are the challenges?
◮ sampling new test labels for every measure is generally
expensive
◮ knowing how previous labels were sampled required to
optimally sample new items for test
◮ computing accuracy using all labels requires knowledge of
sampling profile
◮ overtime reusing labels can become very tricky
Classification Labels in a Fast Moving Environment
Evaluation framework
◮ pi is probability of item i to be selected for test (Bernoulli)
◮ each item carries pi and is marked if selected (store the
sampling profile)
◮ accuracy:
1
i selected
1
pi i selected
1
pi
½{˜yi =yi }
Classification Labels in a Fast Moving Environment
Evaluation framework
◮ pi is probability of item i to be selected for test (Bernoulli)
◮ each item carries pi and is marked if selected (store the
sampling profile)
◮ accuracy:
1
i selected
1
pi i selected
1
pi
½{˜yi =yi }
◮ for evaluation to be possible pj > 0 for all j labeled/unlabeled
Classification Labels in a Fast Moving Environment
Evaluation framework
◮ pi is probability of item i to be selected for test (Bernoulli)
◮ each item carries pi and is marked if selected (store the
sampling profile)
◮ accuracy:
1
i selected
1
pi i selected
1
pi
½{˜yi =yi }
◮ for evaluation to be possible pj > 0 for all j labeled/unlabeled
◮ all labels are used
Classification Labels in a Fast Moving Environment
Evaluation framework
◮ pi is probability of item i to be selected for test (Bernoulli)
◮ each item carries pi and is marked if selected (store the
sampling profile)
◮ accuracy:
1
i selected
1
pi i selected
1
pi
½{˜yi =yi }
◮ for evaluation to be possible pj > 0 for all j labeled/unlabeled
◮ all labels are used
◮ with uniform sampling this is simply “standard” accuracy
Classification Labels in a Fast Moving Environment
Evaluation framework
◮ pi is probability of item i to be selected for test (Bernoulli)
◮ each item carries pi and is marked if selected (store the
sampling profile)
◮ accuracy:
1
i selected
1
pi i selected
1
pi
½{˜yi =yi }
◮ for evaluation to be possible pj > 0 for all j labeled/unlabeled
◮ all labels are used
◮ with uniform sampling this is simply “standard” accuracy
◮ very closely related to importance sampling
Classification Labels in a Fast Moving Environment
Evaluation framework
given existing sampling pi and extra budget
how do we sample?
◮ minimize accuracy variance with budget constraint
◮ can be formulated as an optimization problem
◮ easy to solve
Classification Labels in a Fast Moving Environment
Evaluation framework
it works as you’d expect as budget grows:
p p
◮ new budget (blue) used more where pi is smaller
◮ given enough budget we obtain uniform sampling
Classification Labels in a Fast Moving Environment
Extensions
◮ framework works more generally for supervised learning
◮ framework can work with a wide range of different metrics
◮ optimal sampling can use model posterior to reduce variance
◮ this framework can be used on the training side together with
active learning

More Related Content

What's hot

What's hot (19)

Root Cause Analysis | 5 whys | Tools of accident investigation I Gaurav Singh...
Root Cause Analysis | 5 whys | Tools of accident investigation I Gaurav Singh...Root Cause Analysis | 5 whys | Tools of accident investigation I Gaurav Singh...
Root Cause Analysis | 5 whys | Tools of accident investigation I Gaurav Singh...
 
Experiences with Semi-Scripted Exploratory Testing
Experiences with Semi-Scripted Exploratory TestingExperiences with Semi-Scripted Exploratory Testing
Experiences with Semi-Scripted Exploratory Testing
 
Lifelong Analysis Skills for Explorers and Process Junkies alike!
Lifelong Analysis Skills for Explorers and Process Junkies alike!Lifelong Analysis Skills for Explorers and Process Junkies alike!
Lifelong Analysis Skills for Explorers and Process Junkies alike!
 
A Guide to the Five Whys Technique
A Guide to the Five Whys TechniqueA Guide to the Five Whys Technique
A Guide to the Five Whys Technique
 
HR Analysis: Really Cool Analytical Tools
HR Analysis: Really Cool Analytical ToolsHR Analysis: Really Cool Analytical Tools
HR Analysis: Really Cool Analytical Tools
 
5 whys
5 whys5 whys
5 whys
 
Failing: The Very Human Side of Testing
Failing: The Very Human Side of TestingFailing: The Very Human Side of Testing
Failing: The Very Human Side of Testing
 
5 why analysis training presentaion
5 why analysis training presentaion5 why analysis training presentaion
5 why analysis training presentaion
 
Testing for everyone agile yorkshire
Testing for everyone agile yorkshireTesting for everyone agile yorkshire
Testing for everyone agile yorkshire
 
Asking Questions and Writing Effectively
Asking Questions and Writing EffectivelyAsking Questions and Writing Effectively
Asking Questions and Writing Effectively
 
5 why analysis
5 why analysis5 why analysis
5 why analysis
 
5 whys
5 whys5 whys
5 whys
 
5 why analysis
5 why analysis5 why analysis
5 why analysis
 
Why why analysis
Why why analysisWhy why analysis
Why why analysis
 
User Research @ Bitspiration2013
User Research @ Bitspiration2013User Research @ Bitspiration2013
User Research @ Bitspiration2013
 
4YFN 2016 Guerrilla UX
4YFN 2016 Guerrilla UX4YFN 2016 Guerrilla UX
4YFN 2016 Guerrilla UX
 
Root cause analysis using 5 whys
Root cause analysis using 5 whysRoot cause analysis using 5 whys
Root cause analysis using 5 whys
 
Hypothesis testing
Hypothesis testingHypothesis testing
Hypothesis testing
 
Beyond the (Survey) Monkey Business
Beyond the (Survey) Monkey BusinessBeyond the (Survey) Monkey Business
Beyond the (Survey) Monkey Business
 

Viewers also liked

Ramaciotti digital media marketing 2012 9
Ramaciotti digital media marketing 2012 9Ramaciotti digital media marketing 2012 9
Ramaciotti digital media marketing 2012 9
Max Ramaciotti
 
Geetu Ambwani, Principal Data Scientist, Huffington Post at MLconf NYC - 4/15/16
Geetu Ambwani, Principal Data Scientist, Huffington Post at MLconf NYC - 4/15/16Geetu Ambwani, Principal Data Scientist, Huffington Post at MLconf NYC - 4/15/16
Geetu Ambwani, Principal Data Scientist, Huffington Post at MLconf NYC - 4/15/16
MLconf
 

Viewers also liked (12)

Ramaciotti digital media marketing 2012 9
Ramaciotti digital media marketing 2012 9Ramaciotti digital media marketing 2012 9
Ramaciotti digital media marketing 2012 9
 
Geetu Ambwani, Principal Data Scientist, Huffington Post at MLconf NYC - 4/15/16
Geetu Ambwani, Principal Data Scientist, Huffington Post at MLconf NYC - 4/15/16Geetu Ambwani, Principal Data Scientist, Huffington Post at MLconf NYC - 4/15/16
Geetu Ambwani, Principal Data Scientist, Huffington Post at MLconf NYC - 4/15/16
 
Starting a Taxonomy Project (Presented at SLA 2013 Conference)
Starting a Taxonomy Project (Presented at SLA 2013 Conference)Starting a Taxonomy Project (Presented at SLA 2013 Conference)
Starting a Taxonomy Project (Presented at SLA 2013 Conference)
 
Animashree Anandkumar, Electrical Engineering and CS Dept, UC Irvine at MLcon...
Animashree Anandkumar, Electrical Engineering and CS Dept, UC Irvine at MLcon...Animashree Anandkumar, Electrical Engineering and CS Dept, UC Irvine at MLcon...
Animashree Anandkumar, Electrical Engineering and CS Dept, UC Irvine at MLcon...
 
Narayanan Sundaram, Research Scientist, Intel Labs at MLconf SF - 11/13/15
Narayanan Sundaram, Research Scientist, Intel Labs at MLconf SF - 11/13/15Narayanan Sundaram, Research Scientist, Intel Labs at MLconf SF - 11/13/15
Narayanan Sundaram, Research Scientist, Intel Labs at MLconf SF - 11/13/15
 
Sergei Vassilvitskii, Research Scientist, Google at MLconf NYC - 4/15/16
Sergei Vassilvitskii, Research Scientist, Google at MLconf NYC - 4/15/16Sergei Vassilvitskii, Research Scientist, Google at MLconf NYC - 4/15/16
Sergei Vassilvitskii, Research Scientist, Google at MLconf NYC - 4/15/16
 
Melanie Warrick, Deep Learning Engineer, Skymind.io at MLconf SF - 11/13/15
Melanie Warrick, Deep Learning Engineer, Skymind.io at MLconf SF - 11/13/15Melanie Warrick, Deep Learning Engineer, Skymind.io at MLconf SF - 11/13/15
Melanie Warrick, Deep Learning Engineer, Skymind.io at MLconf SF - 11/13/15
 
Justin Basilico, Research/ Engineering Manager at Netflix at MLconf SF - 11/1...
Justin Basilico, Research/ Engineering Manager at Netflix at MLconf SF - 11/1...Justin Basilico, Research/ Engineering Manager at Netflix at MLconf SF - 11/1...
Justin Basilico, Research/ Engineering Manager at Netflix at MLconf SF - 11/1...
 
Shop vertical classification - Meetup Presentation
Shop vertical classification - Meetup PresentationShop vertical classification - Meetup Presentation
Shop vertical classification - Meetup Presentation
 
Alex Smola, Professor in the Machine Learning Department, Carnegie Mellon Uni...
Alex Smola, Professor in the Machine Learning Department, Carnegie Mellon Uni...Alex Smola, Professor in the Machine Learning Department, Carnegie Mellon Uni...
Alex Smola, Professor in the Machine Learning Department, Carnegie Mellon Uni...
 
Taxonomies for E-commerce
Taxonomies for E-commerceTaxonomies for E-commerce
Taxonomies for E-commerce
 
7 Machine Learning techniques in pratice in a Startup (Robson Motta - WIAMS U...
7 Machine Learning techniques in pratice in a Startup (Robson Motta - WIAMS U...7 Machine Learning techniques in pratice in a Startup (Robson Motta - WIAMS U...
7 Machine Learning techniques in pratice in a Startup (Robson Motta - WIAMS U...
 

Similar to Alessandro Magnani, Data Scientist, @WalmartLabs at MLconf SF - 11/13/15

Understanding the Validity of a LCA
Understanding the Validity of a LCAUnderstanding the Validity of a LCA
Understanding the Validity of a LCA
Laurel McEwen
 
Lecture9 conjoint analysis
Lecture9 conjoint analysisLecture9 conjoint analysis
Lecture9 conjoint analysis
Jameson Watts
 

Similar to Alessandro Magnani, Data Scientist, @WalmartLabs at MLconf SF - 11/13/15 (20)

Intro to A/B Testing by Ever's Senior Product Manager
Intro to A/B Testing by Ever's Senior Product ManagerIntro to A/B Testing by Ever's Senior Product Manager
Intro to A/B Testing by Ever's Senior Product Manager
 
A/B Testing: Common Pitfalls and How to Avoid Them
A/B Testing: Common Pitfalls and How to Avoid ThemA/B Testing: Common Pitfalls and How to Avoid Them
A/B Testing: Common Pitfalls and How to Avoid Them
 
Your A/B Tests are Lying to You
Your A/B Tests are Lying to YouYour A/B Tests are Lying to You
Your A/B Tests are Lying to You
 
Your A/B Tests are Lying to You
Your A/B Tests are Lying to YouYour A/B Tests are Lying to You
Your A/B Tests are Lying to You
 
Anton Muzhailo - Practical Test Process Improvement using ISTQB
Anton Muzhailo - Practical Test Process Improvement using ISTQBAnton Muzhailo - Practical Test Process Improvement using ISTQB
Anton Muzhailo - Practical Test Process Improvement using ISTQB
 
Understanding the Validity of a LCA
Understanding the Validity of a LCAUnderstanding the Validity of a LCA
Understanding the Validity of a LCA
 
Predicting Helpfulness of User-Generated Product Reviews Through Analytical M...
Predicting Helpfulness of User-Generated Product Reviews Through Analytical M...Predicting Helpfulness of User-Generated Product Reviews Through Analytical M...
Predicting Helpfulness of User-Generated Product Reviews Through Analytical M...
 
Lecture9 conjoint analysis
Lecture9 conjoint analysisLecture9 conjoint analysis
Lecture9 conjoint analysis
 
Meta-Analyses in Experimentation: The Whats and Hows
Meta-Analyses in Experimentation: The Whats and HowsMeta-Analyses in Experimentation: The Whats and Hows
Meta-Analyses in Experimentation: The Whats and Hows
 
"How we killed 80% of features and increased outcomes of a/b testing by 100%"...
"How we killed 80% of features and increased outcomes of a/b testing by 100%"..."How we killed 80% of features and increased outcomes of a/b testing by 100%"...
"How we killed 80% of features and increased outcomes of a/b testing by 100%"...
 
Automated solutions for product and pricing research
Automated solutions for product and pricing researchAutomated solutions for product and pricing research
Automated solutions for product and pricing research
 
Revisiting the Experimental Design Choices for Approaches for the Automated R...
Revisiting the Experimental Design Choices for Approaches for the Automated R...Revisiting the Experimental Design Choices for Approaches for the Automated R...
Revisiting the Experimental Design Choices for Approaches for the Automated R...
 
Product Experimentation Pitfalls & How to Avoid Them
Product Experimentation Pitfalls & How to Avoid Them Product Experimentation Pitfalls & How to Avoid Them
Product Experimentation Pitfalls & How to Avoid Them
 
Product Experimentation Pitfalls & How to Avoid Them
Product Experimentation Pitfalls & How to Avoid Them Product Experimentation Pitfalls & How to Avoid Them
Product Experimentation Pitfalls & How to Avoid Them
 
2 anton muzhailo - formal test process improvement. how to invest to the te...
2   anton muzhailo - formal test process improvement. how to invest to the te...2   anton muzhailo - formal test process improvement. how to invest to the te...
2 anton muzhailo - formal test process improvement. how to invest to the te...
 
slide->title; ?>
slide->title; ?>slide->title; ?>
slide->title; ?>
 
KDD
KDDKDD
KDD
 
Data-Driven Decision Making by Expedia Sr PM
Data-Driven Decision Making by Expedia Sr PMData-Driven Decision Making by Expedia Sr PM
Data-Driven Decision Making by Expedia Sr PM
 
PREP Webinar June 18th 2015 06-18
PREP Webinar June 18th 2015 06-18PREP Webinar June 18th 2015 06-18
PREP Webinar June 18th 2015 06-18
 
Can I Test More Than One Variable at a Time? Statisticians answer some of th...
Can I Test More Than One Variable at a  Time? Statisticians answer some of th...Can I Test More Than One Variable at a  Time? Statisticians answer some of th...
Can I Test More Than One Variable at a Time? Statisticians answer some of th...
 

More from MLconf

Ted Willke - The Brain’s Guide to Dealing with Context in Language Understanding
Ted Willke - The Brain’s Guide to Dealing with Context in Language UnderstandingTed Willke - The Brain’s Guide to Dealing with Context in Language Understanding
Ted Willke - The Brain’s Guide to Dealing with Context in Language Understanding
MLconf
 
Justin Armstrong - Applying Computer Vision to Reduce Contamination in the Re...
Justin Armstrong - Applying Computer Vision to Reduce Contamination in the Re...Justin Armstrong - Applying Computer Vision to Reduce Contamination in the Re...
Justin Armstrong - Applying Computer Vision to Reduce Contamination in the Re...
MLconf
 
Jekaterina Novikova - Machine Learning Methods in Detecting Alzheimer’s Disea...
Jekaterina Novikova - Machine Learning Methods in Detecting Alzheimer’s Disea...Jekaterina Novikova - Machine Learning Methods in Detecting Alzheimer’s Disea...
Jekaterina Novikova - Machine Learning Methods in Detecting Alzheimer’s Disea...
MLconf
 
Anoop Deoras - Building an Incrementally Trained, Local Taste Aware, Global D...
Anoop Deoras - Building an Incrementally Trained, Local Taste Aware, Global D...Anoop Deoras - Building an Incrementally Trained, Local Taste Aware, Global D...
Anoop Deoras - Building an Incrementally Trained, Local Taste Aware, Global D...
MLconf
 
Vito Ostuni - The Voice: New Challenges in a Zero UI World
Vito Ostuni - The Voice: New Challenges in a Zero UI WorldVito Ostuni - The Voice: New Challenges in a Zero UI World
Vito Ostuni - The Voice: New Challenges in a Zero UI World
MLconf
 

More from MLconf (20)

Jamila Smith-Loud - Understanding Human Impact: Social and Equity Assessments...
Jamila Smith-Loud - Understanding Human Impact: Social and Equity Assessments...Jamila Smith-Loud - Understanding Human Impact: Social and Equity Assessments...
Jamila Smith-Loud - Understanding Human Impact: Social and Equity Assessments...
 
Ted Willke - The Brain’s Guide to Dealing with Context in Language Understanding
Ted Willke - The Brain’s Guide to Dealing with Context in Language UnderstandingTed Willke - The Brain’s Guide to Dealing with Context in Language Understanding
Ted Willke - The Brain’s Guide to Dealing with Context in Language Understanding
 
Justin Armstrong - Applying Computer Vision to Reduce Contamination in the Re...
Justin Armstrong - Applying Computer Vision to Reduce Contamination in the Re...Justin Armstrong - Applying Computer Vision to Reduce Contamination in the Re...
Justin Armstrong - Applying Computer Vision to Reduce Contamination in the Re...
 
Igor Markov - Quantum Computing: a Treasure Hunt, not a Gold Rush
Igor Markov - Quantum Computing: a Treasure Hunt, not a Gold RushIgor Markov - Quantum Computing: a Treasure Hunt, not a Gold Rush
Igor Markov - Quantum Computing: a Treasure Hunt, not a Gold Rush
 
Josh Wills - Data Labeling as Religious Experience
Josh Wills - Data Labeling as Religious ExperienceJosh Wills - Data Labeling as Religious Experience
Josh Wills - Data Labeling as Religious Experience
 
Vinay Prabhu - Project GaitNet: Ushering in the ImageNet moment for human Gai...
Vinay Prabhu - Project GaitNet: Ushering in the ImageNet moment for human Gai...Vinay Prabhu - Project GaitNet: Ushering in the ImageNet moment for human Gai...
Vinay Prabhu - Project GaitNet: Ushering in the ImageNet moment for human Gai...
 
Jekaterina Novikova - Machine Learning Methods in Detecting Alzheimer’s Disea...
Jekaterina Novikova - Machine Learning Methods in Detecting Alzheimer’s Disea...Jekaterina Novikova - Machine Learning Methods in Detecting Alzheimer’s Disea...
Jekaterina Novikova - Machine Learning Methods in Detecting Alzheimer’s Disea...
 
Meghana Ravikumar - Optimized Image Classification on the Cheap
Meghana Ravikumar - Optimized Image Classification on the CheapMeghana Ravikumar - Optimized Image Classification on the Cheap
Meghana Ravikumar - Optimized Image Classification on the Cheap
 
Noam Finkelstein - The Importance of Modeling Data Collection
Noam Finkelstein - The Importance of Modeling Data CollectionNoam Finkelstein - The Importance of Modeling Data Collection
Noam Finkelstein - The Importance of Modeling Data Collection
 
June Andrews - The Uncanny Valley of ML
June Andrews - The Uncanny Valley of MLJune Andrews - The Uncanny Valley of ML
June Andrews - The Uncanny Valley of ML
 
Sneha Rajana - Deep Learning Architectures for Semantic Relation Detection Tasks
Sneha Rajana - Deep Learning Architectures for Semantic Relation Detection TasksSneha Rajana - Deep Learning Architectures for Semantic Relation Detection Tasks
Sneha Rajana - Deep Learning Architectures for Semantic Relation Detection Tasks
 
Anoop Deoras - Building an Incrementally Trained, Local Taste Aware, Global D...
Anoop Deoras - Building an Incrementally Trained, Local Taste Aware, Global D...Anoop Deoras - Building an Incrementally Trained, Local Taste Aware, Global D...
Anoop Deoras - Building an Incrementally Trained, Local Taste Aware, Global D...
 
Vito Ostuni - The Voice: New Challenges in a Zero UI World
Vito Ostuni - The Voice: New Challenges in a Zero UI WorldVito Ostuni - The Voice: New Challenges in a Zero UI World
Vito Ostuni - The Voice: New Challenges in a Zero UI World
 
Anna choromanska - Data-driven Challenges in AI: Scale, Information Selection...
Anna choromanska - Data-driven Challenges in AI: Scale, Information Selection...Anna choromanska - Data-driven Challenges in AI: Scale, Information Selection...
Anna choromanska - Data-driven Challenges in AI: Scale, Information Selection...
 
Janani Kalyanam - Machine Learning to Detect Illegal Online Sales of Prescrip...
Janani Kalyanam - Machine Learning to Detect Illegal Online Sales of Prescrip...Janani Kalyanam - Machine Learning to Detect Illegal Online Sales of Prescrip...
Janani Kalyanam - Machine Learning to Detect Illegal Online Sales of Prescrip...
 
Esperanza Lopez Aguilera - Using a Bayesian Neural Network in the Detection o...
Esperanza Lopez Aguilera - Using a Bayesian Neural Network in the Detection o...Esperanza Lopez Aguilera - Using a Bayesian Neural Network in the Detection o...
Esperanza Lopez Aguilera - Using a Bayesian Neural Network in the Detection o...
 
Neel Sundaresan - Teaching a machine to code
Neel Sundaresan - Teaching a machine to codeNeel Sundaresan - Teaching a machine to code
Neel Sundaresan - Teaching a machine to code
 
Rishabh Mehrotra - Recommendations in a Marketplace: Personalizing Explainabl...
Rishabh Mehrotra - Recommendations in a Marketplace: Personalizing Explainabl...Rishabh Mehrotra - Recommendations in a Marketplace: Personalizing Explainabl...
Rishabh Mehrotra - Recommendations in a Marketplace: Personalizing Explainabl...
 
Soumith Chintala - Increasing the Impact of AI Through Better Software
Soumith Chintala - Increasing the Impact of AI Through Better SoftwareSoumith Chintala - Increasing the Impact of AI Through Better Software
Soumith Chintala - Increasing the Impact of AI Through Better Software
 
Roy Lowrance - Predicting Bond Prices: Regime Changes
Roy Lowrance - Predicting Bond Prices: Regime ChangesRoy Lowrance - Predicting Bond Prices: Regime Changes
Roy Lowrance - Predicting Bond Prices: Regime Changes
 

Recently uploaded

+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 

Recently uploaded (20)

Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamDEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
WSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering DevelopersWSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering Developers
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
Platformless Horizons for Digital Adaptability
Platformless Horizons for Digital AdaptabilityPlatformless Horizons for Digital Adaptability
Platformless Horizons for Digital Adaptability
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistan
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptx
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
 

Alessandro Magnani, Data Scientist, @WalmartLabs at MLconf SF - 11/13/15

  • 1. Classification Labels in a Fast Moving Environment Classification Labels in a Fast Moving Environment Alessandro Magnani @WalmartLabs, Walmart Global eCommerce California, USA Friday 13th November, 2015
  • 2. Classification Labels in a Fast Moving Environment Classification Model Performance Items Classifier Editor N sampled items true label yi estimate ˜yi accuracyEvaluation ◮ correctly evaluating classification models is critical and requires labels ◮ labeling products is expensive ◮ need to correctly and optimally use labels
  • 3. Classification Labels in a Fast Moving Environment Classification Model Performance Items Classifier Editor N sampled items true label yi estimate ˜yi accuracyEvaluation Measure accuracy common approach: ◮ sample uniformly at random N items ◮ compute accuracy 1 N N i=1 ½{˜yi =yi }
  • 4. Classification Labels in a Fast Moving Environment Practical challenges Items Classifier Editor N sampled items true label yi estimate ˜yi accuracyEvaluation ◮ items change over time
  • 5. Classification Labels in a Fast Moving Environment Practical challenges Items Classifier Editor N sampled items true label yi estimate ˜yi accuracyEvaluation ◮ items change over time ◮ evaluation required over multiple subsets
  • 6. Classification Labels in a Fast Moving Environment Practical challenges Items Classifier Editor N sampled items true label yi estimate ˜yi accuracyEvaluation ◮ items change over time ◮ evaluation required over multiple subsets ◮ existing labels potentially hard to reuse
  • 7. Classification Labels in a Fast Moving Environment A motivating example compute accuracy over 1M items 1K labels budget ◮ sample 1K items and get labels yi ◮ measure accuracy 1 1K 1K i=1 ½{˜yi =yi } 1M p 1 1K
  • 8. Classification Labels in a Fast Moving Environment A motivating example 500K items added, compute accuracy on all 1.5M items ◮ use previous accuracy measure ◮ most likely inaccurate 1M 1.5M p 1 1K
  • 9. Classification Labels in a Fast Moving Environment A motivating example 500K items added, compute accuracy on all 1.5M items 500 labels extra budget ◮ sample 500 items from the 1.5M ◮ compute accuracy on new 500 labels ◮ previous 1K labels “wasted” 1M 1.5M p 1 3K
  • 10. Classification Labels in a Fast Moving Environment A motivating example 500K items added, compute accuracy on all 1.5M items 500 labels extra budget, better approach ◮ sample 500 items from new items ◮ compute accuracy on all 1.5K labels ◮ no label “wasted” 1M 1.5M p 1 1K
  • 11. Classification Labels in a Fast Moving Environment A motivating example 500K items added, compute accuracy on all 1.5M items only 250 labels extra budget? ◮ sample 250 items from new items ◮ need to account for difference in sampling ◮ accuracy: 1M 1.5M p 1 2K 1 1.5K 1K i=1 ½{˜yi =yi } + 2 250 i=1 ½{˜ynew i =ynew i }
  • 12. Classification Labels in a Fast Moving Environment A motivating example What are the challenges? ◮ sampling new test labels for every measure is generally expensive
  • 13. Classification Labels in a Fast Moving Environment A motivating example What are the challenges? ◮ sampling new test labels for every measure is generally expensive ◮ knowing how previous labels were sampled required to optimally sample new items for test
  • 14. Classification Labels in a Fast Moving Environment A motivating example What are the challenges? ◮ sampling new test labels for every measure is generally expensive ◮ knowing how previous labels were sampled required to optimally sample new items for test ◮ computing accuracy using all labels requires knowledge of sampling profile
  • 15. Classification Labels in a Fast Moving Environment A motivating example What are the challenges? ◮ sampling new test labels for every measure is generally expensive ◮ knowing how previous labels were sampled required to optimally sample new items for test ◮ computing accuracy using all labels requires knowledge of sampling profile ◮ overtime reusing labels can become very tricky
  • 16. Classification Labels in a Fast Moving Environment Evaluation framework ◮ pi is probability of item i to be selected for test (Bernoulli) ◮ each item carries pi and is marked if selected (store the sampling profile) ◮ accuracy: 1 i selected 1 pi i selected 1 pi ½{˜yi =yi }
  • 17. Classification Labels in a Fast Moving Environment Evaluation framework ◮ pi is probability of item i to be selected for test (Bernoulli) ◮ each item carries pi and is marked if selected (store the sampling profile) ◮ accuracy: 1 i selected 1 pi i selected 1 pi ½{˜yi =yi } ◮ for evaluation to be possible pj > 0 for all j labeled/unlabeled
  • 18. Classification Labels in a Fast Moving Environment Evaluation framework ◮ pi is probability of item i to be selected for test (Bernoulli) ◮ each item carries pi and is marked if selected (store the sampling profile) ◮ accuracy: 1 i selected 1 pi i selected 1 pi ½{˜yi =yi } ◮ for evaluation to be possible pj > 0 for all j labeled/unlabeled ◮ all labels are used
  • 19. Classification Labels in a Fast Moving Environment Evaluation framework ◮ pi is probability of item i to be selected for test (Bernoulli) ◮ each item carries pi and is marked if selected (store the sampling profile) ◮ accuracy: 1 i selected 1 pi i selected 1 pi ½{˜yi =yi } ◮ for evaluation to be possible pj > 0 for all j labeled/unlabeled ◮ all labels are used ◮ with uniform sampling this is simply “standard” accuracy
  • 20. Classification Labels in a Fast Moving Environment Evaluation framework ◮ pi is probability of item i to be selected for test (Bernoulli) ◮ each item carries pi and is marked if selected (store the sampling profile) ◮ accuracy: 1 i selected 1 pi i selected 1 pi ½{˜yi =yi } ◮ for evaluation to be possible pj > 0 for all j labeled/unlabeled ◮ all labels are used ◮ with uniform sampling this is simply “standard” accuracy ◮ very closely related to importance sampling
  • 21. Classification Labels in a Fast Moving Environment Evaluation framework given existing sampling pi and extra budget how do we sample? ◮ minimize accuracy variance with budget constraint ◮ can be formulated as an optimization problem ◮ easy to solve
  • 22. Classification Labels in a Fast Moving Environment Evaluation framework it works as you’d expect as budget grows: p p ◮ new budget (blue) used more where pi is smaller ◮ given enough budget we obtain uniform sampling
  • 23. Classification Labels in a Fast Moving Environment Extensions ◮ framework works more generally for supervised learning ◮ framework can work with a wide range of different metrics ◮ optimal sampling can use model posterior to reduce variance ◮ this framework can be used on the training side together with active learning