Alessandro Magnani, Data Scientist, @WalmartLabs at MLconf SF - 11/13/15

•

2 likes•1,430 views

- Classifying products in a fast-changing environment poses challenges for accurately evaluating classification models over time. Labeling products is expensive so labels must be used optimally. - An evaluation framework is proposed that stores the sampling profile (probability pi of each item i being selected) for labeled items. Accuracy is calculated based on these sampling probabilities to account for non-uniform sampling. - Given an existing sampling and extra budget, new items can be sampled optimally to minimize accuracy variance while satisfying budget constraints. This allows for better reuse of existing labels over time as products and models change.

Technology

Classiﬁcation Labels in a Fast Moving Environment
Classiﬁcation Labels in a Fast Moving
Environment
Alessandro Magnani
@WalmartLabs, Walmart Global eCommerce
California, USA
Friday 13th November, 2015

Classiﬁcation Labels in a Fast Moving Environment
Classiﬁcation Model Performance
Items Classiﬁer
Editor
N sampled items true label yi
estimate ˜yi
accuracyEvaluation
◮ correctly evaluating classiﬁcation models is critical and
requires labels
◮ labeling products is expensive
◮ need to correctly and optimally use labels

Classiﬁcation Labels in a Fast Moving Environment
Practical challenges
Items Classiﬁer
Editor
N sampled items true label yi
estimate ˜yi
accuracyEvaluation
◮ items change over time

Classiﬁcation Labels in a Fast Moving Environment
A motivating example
compute accuracy over 1M items
1K labels budget
◮ sample 1K items and get
labels yi
◮ measure accuracy
1
1K
1K
i=1 ½{˜yi =yi }
1M
p
1
1K

Classiﬁcation Labels in a Fast Moving Environment
A motivating example
500K items added, compute accuracy on all 1.5M items
◮ use previous accuracy
measure
◮ most likely inaccurate
1M 1.5M
p
1
1K

Classiﬁcation Labels in a Fast Moving Environment
A motivating example
500K items added, compute accuracy on all 1.5M items
500 labels extra budget
◮ sample 500 items from the
1.5M
◮ compute accuracy on new
500 labels
◮ previous 1K labels “wasted”
1M 1.5M
p
1
3K

Classiﬁcation Labels in a Fast Moving Environment
A motivating example
500K items added, compute accuracy on all 1.5M items
only 250 labels extra budget?
◮ sample 250 items from new
items
◮ need to account for diﬀerence
in sampling
◮ accuracy:
1M 1.5M
p
1
2K
1
1.5K
1K
i=1 ½{˜yi =yi } + 2 250
i=1 ½{˜ynew
i =ynew
i }

Classiﬁcation Labels in a Fast Moving Environment
A motivating example
What are the challenges?
◮ sampling new test labels for every measure is generally
expensive

Classiﬁcation Labels in a Fast Moving Environment
Evaluation framework
◮ pi is probability of item i to be selected for test (Bernoulli)
◮ each item carries pi and is marked if selected (store the
sampling proﬁle)
◮ accuracy:
1
i selected
1
pi i selected
1
pi
½{˜yi =yi }

Classiﬁcation Labels in a Fast Moving Environment
Evaluation framework
given existing sampling pi and extra budget
how do we sample?
◮ minimize accuracy variance with budget constraint
◮ can be formulated as an optimization problem
◮ easy to solve

Classiﬁcation Labels in a Fast Moving Environment
Evaluation framework
it works as you’d expect as budget grows:
p p
◮ new budget (blue) used more where pi is smaller
◮ given enough budget we obtain uniform sampling

Classiﬁcation Labels in a Fast Moving Environment
Extensions
◮ framework works more generally for supervised learning
◮ framework can work with a wide range of diﬀerent metrics
◮ optimal sampling can use model posterior to reduce variance
◮ this framework can be used on the training side together with
active learning

What's hot

Root Cause Analysis | 5 whys | Tools of accident investigation I Gaurav Singh...

Gaurav Singh Rajput

Experiences with Semi-Scripted Exploratory Testing

Simon Morley

Lifelong Analysis Skills for Explorers and Process Junkies alike!

Simon Morley

A Guide to the Five Whys Technique

Olivier Serrat

HR Analysis: Really Cool Analytical Tools

mikeharmer

5 whys

Aakash Kulkarni

Failing: The Very Human Side of Testing

Simon Morley

5 why analysis training presentaion

Dharmesh Panchal

Testing for everyone agile yorkshire

Ady Stokes

Asking Questions and Writing Effectively

5 why analysis

5 whys

5 why analysis

Why why analysis

মোঃ ফুরকান উদ্দিন জুয়েল

User Research @ Bitspiration2013

BDressler

In this fun, fast-paced workshop you'll learn the key guerrilla UX tools you need to get quick, actionable feedback from real customers. Participants will get fast, hands-on experience setting up, running and analysing usability tests with real products. If you have a product or functioning prototype you want to test, fill in this form. We will select a few products and run at least 2 full rounds of tests on them during the session. (NB: You don’t have to have a product to participate: this workshop is for anyone who wants to do UX research faster and cheaper). Workshop is led by senior UX specialist Sarah Rink. She will share tools, tips and tricks she’s gathered working for clients like News Corp, Orange, Vodafone, Visa, Samsung, Desigual and countless startups. Co-moderated by Ian Collingwood (COO, Startupbootcamp), Marta Ros (senior Lean Marketing at b.wom) and Silvia Calvet (senior UX specialist) https://4yfn.com/activityDetail?id=1075

4YFN 2016 Guerrilla UX

Sarah Rink

Root cause analysis using 5 whys

Fahmi Ramadhan Putra

Hypothesis testing

Arnab Sadhu

Beyond the (Survey) Monkey Business

Jo Flick

What's hot (19)

Root Cause Analysis | 5 whys | Tools of accident investigation I Gaurav Singh...

Experiences with Semi-Scripted Exploratory Testing

Lifelong Analysis Skills for Explorers and Process Junkies alike!

A Guide to the Five Whys Technique

HR Analysis: Really Cool Analytical Tools

5 whys

Failing: The Very Human Side of Testing

5 why analysis training presentaion

Testing for everyone agile yorkshire

Asking Questions and Writing Effectively

5 why analysis

5 whys

5 why analysis

Why why analysis

User Research @ Bitspiration2013

4YFN 2016 Guerrilla UX

Root cause analysis using 5 whys

Hypothesis testing

Beyond the (Survey) Monkey Business

Viewers also liked

Ramaciotti digital media marketing 2012 9

Max Ramaciotti

Geetu Ambwani, Principal Data Scientist, Huffington Post at MLconf NYC - 4/15/16

MLconf

Starting a Taxonomy Project (Presented at SLA 2013 Conference)

Miraida Morales

Anima Anandkumar is a faculty at the EECS Dept. at U.C.Irvine since August 2010. Her research interests are in the area of large-scale machine learning and high-dimensional statistics. She received her B.Tech in Electrical Engineering from IIT Madras in 2004 and her PhD from Cornell University in 2009. She has been a visiting faculty at Microsoft Research New England in 2012 and a postdoctoral researcher at the Stochastic Systems Group at MIT between 2009-2010. She is the recipient of the Microsoft Faculty Fellowship, ARO Young Investigator Award, NSF CAREER Award, and IBM Fran Allen PhD fellowship.

Animashree Anandkumar, Electrical Engineering and CS Dept, UC Irvine at MLcon...

MLconf

GraphMat: Bridging the Productivity-Performance Gap in Graph Analytics: With increasing interest in large-scale distributed graph analytics for machine learning and data mining, more data scientists and developers are struggling to achieve high performance without sacrificing productivity on large graph problems. In this talk, I will discuss our solution to this problem: GraphMat. Using generalized sparse matrix-based primitives, we are able to achieve performance that is very close to hand-optimized native code, while allowing users to write programs using the familiar vertex-centric programming paradigm. I will show how we optimized GraphMat to achieve this performance on distributed platforms and provide programming examples. We have integrated GraphMat with Apache Spark in a manner that allows the combination to outperform all other distributed graph frameworks. I will explain the reasons for this performance and show that our approach achieves very high hardware efficiency in both single-node and distributed environments using primitives that are applicable to many machine learning and HPC problems. GraphMat is open source software and available for download.

Narayanan Sundaram, Research Scientist, Intel Labs at MLconf SF - 11/13/15

MLconf

Sergei Vassilvitskii, Research Scientist, Google at MLconf NYC - 4/15/16

MLconf

Attention Neural Net Model Fundamentals: Neural networks have regained popularity over the last decade because they are demonstrating real world value in different applications (e.g. targeted advertising, recommender engines, Siri, self driving cars, facial recognition). Several model types are currently explored in the field with recurrent neural networks (RNN) and convolution neural networks (CNN) taking the top focus. The attention model, a recently developed RNN variant, has started to play a larger role in both natural language processing and image analysis research. This talk will cover the fundamentals of the attention model structure and how its applied to visual and speech analysis. I will provide an overview of the model functionality and math including a high-level differentiation between soft and hard types. The goal is to give you enough of an understanding of what the model is, how it works and where to apply it.

Melanie Warrick, Deep Learning Engineer, Skymind.io at MLconf SF - 11/13/15

MLconf

Recommendations for Building Machine Learning Software: Building a real system that uses machine learning can be a difficult both in terms of the algorithmic and engineering challenges involved. In this talk, I will focus on the engineering side and discuss some of the practical lessons we’ve learned from years of developing the machine learning systems that power Netflix. I will go over what it takes to get machine learning working in a real-life feedback loop with our users and how that imposes different requirements and a different focus than doing machine learning only within a lab environment. This involves lessons around challenges such as where to place algorithmic components, how to handle distribution and parallelism, what kinds of modularity are useful, how to support both production experimentation, and how to test machine learning systems.

Justin Basilico, Research/ Engineering Manager at Netflix at MLconf SF - 11/1...

MLconf

Shop vertical classification - Meetup Presentation

prevota

Fast, Cheap and Deep – Scaling Machine Learning: Distributed high throughput machine learning is both a challenge and a key enabling technology. Using a Parameter Server template we are able to distribute algorithms efficiently over multiple GPUs and in the cloud. This allows us to design very fast recommender systems, factorization machines, classifiers, and deep networks. This degree of scalability allows us to tackle computationally expensive problems efficiently, yielding excellent results e.g. in visual question answering.

Alex Smola, Professor in the Machine Learning Department, Carnegie Mellon Uni...

MLconf

Taxonomies for E-commerce

Heather Hedden

7 Machine Learning techniques in pratice in a Startup (Robson Motta - WIAMS U...

Robson Motta

Viewers also liked (12)

Ramaciotti digital media marketing 2012 9

Geetu Ambwani, Principal Data Scientist, Huffington Post at MLconf NYC - 4/15/16

Starting a Taxonomy Project (Presented at SLA 2013 Conference)

Animashree Anandkumar, Electrical Engineering and CS Dept, UC Irvine at MLcon...

Narayanan Sundaram, Research Scientist, Intel Labs at MLconf SF - 11/13/15

Sergei Vassilvitskii, Research Scientist, Google at MLconf NYC - 4/15/16

Melanie Warrick, Deep Learning Engineer, Skymind.io at MLconf SF - 11/13/15

Justin Basilico, Research/ Engineering Manager at Netflix at MLconf SF - 11/1...

Shop vertical classification - Meetup Presentation

Alex Smola, Professor in the Machine Learning Department, Carnegie Mellon Uni...

Taxonomies for E-commerce

7 Machine Learning techniques in pratice in a Startup (Robson Motta - WIAMS U...

Similar to Alessandro Magnani, Data Scientist, @WalmartLabs at MLconf SF - 11/13/15

Intro to A/B Testing by Ever's Senior Product Manager

Product School

A/B Testing: Common Pitfalls and How to Avoid Them

Igor Karpov

Your A/B Tests are Lying to You

John Clevenger

Your A/B Tests are Lying to You

John Clevenger

Anton Muzhailo - Practical Test Process Improvement using ISTQB

Ievgenii Katsan

Understanding the Validity of a LCA

Laurel McEwen

Customer reviews are an important feature on Amazon’s vast array of products. Many customers rely heavily on the honest reviews of past users during purchasing decisions. Currently, the only way to regulate the quality of these reviews is for other users to voluntarily thumbs up/down a review as ‘helpful’ or ‘not helpful’. It is in the best interest of Amazon (and potential customers) to be shown the most helpful reviews first and de-prioritize (or flag) useless reviews. Thus, we wanted to try and create a model that could successfully predict whether or not customers would find user product reviews helpful. With such a model, Amazon would be able to better prioritize user reviews displayed on product pages from the moment a review is posted.

Predicting Helpfulness of User-Generated Product Reviews Through Analytical M...

Ankita Kaul

Lecture9 conjoint analysis

Jameson Watts

Ruben shows a step-by-step method of using research and experimentation to combine all learnings into behavioral insights. With this method, you will not only improve your research and conversion rate optimization practices but actually learn and provide a much more pleasant journey for your potential customers! This will undoubtedly help you increase conversion rates and become a lot more successful in your job.

Meta-Analyses in Experimentation: The Whats and Hows

VWO

In this talk, I’ll share our experience at Preply on how we approach the challenge of “Extracting the maximum business value, by keeping the product as simple as possible?” We run more than 100 A/B tests simultaneously, meaning that there are no 2 users on the whole internet who would see the same version of Preply. I’ll share our vision on product development as well as technical details and challenges we faced during fostering and adopting the A/B testing culture at Preply. Among other topics I’ll cover: How to build an A/B testing system and product that will handle more than 100 A/B tests simultaneously? Can the A/B testing hurt the business? Hidden costs of running a/b tests Does it make sense to a/b test technical changes and refactorings? How A/B testing can protect you from losing several million $ on a simple library update?

"How we killed 80% of features and increased outcomes of a/b testing by 100%"...

Fwdays

In this webinar Ray Poynter interviews Conjoint.ly’s founder Nik Samoylov about automated tools for product and pricing research. Conjoint.ly is both a research firm and an automation provider, which means that the tools are tested in full-service projects, particularly in the FMCG and technology industries. In the webinar you will learn how you can access: - Claims testing - Conjoint analysis - Prediction markets - Pricing techniques such as Gabor-Granger and Van Westendorp

Automated solutions for product and pricing research

Ray Poynter

Revisiting the Experimental Design Choices for Approaches for the Automated R...

SAIL_QU

More and more teams are looking for ways to iterate gradually and validate their ideas with data. Techniques like A/B testing, feature flagging, and gradual rollouts are quickly going from niche to mainstream. More experimentation means faster development and better products. But like any trend, product experimentation is a good idea that can easily go wrong. For every game-changing A/B test, there’s a trail of testing mistakes that led well-meaning teams down the wrong path. In this webinar, Jon Noronha, Director of Product Management at Optimizely, shares some common ways he’s seen A/B testing go wrong, along with tips for avoiding these pitfalls. What you’ll learn: - The most common mistakes product teams make when running experiments - How to scale experimentation across multiple teams and squads - How the world’s top technology companies are able to experiment on all product decisions

Product Experimentation Pitfalls & How to Avoid Them

Optimizely

Product Experimentation Pitfalls & How to Avoid Them

Optimizely

2 anton muzhailo - formal test process improvement. how to invest to the te...

Ievgenii Katsan

slide->title; ?>

Shyam Singh

KDD

Shyam Singh

Data-Driven Decision Making by Expedia Sr PM

Product School

PREP Webinar June 18th 2015 06-18

preprecyclability

A/B testing on the Web has become incredibly sophisticated in the last few years. New software makes it easier than ever to have a test up and running on your site. Still, a software program can only take you so far, and many marketers find themselves with questions. In our next Web clinic, statisticians and testing experts from the MECLABS research lab will be answering some of the most common questions associated with online testing: • Can I test more than one variable at a time? • What is a multivariate test? • Is a multivariate testing better than an A/B split test? • Which page element(s) should I test?

Can I Test More Than One Variable at a Time? Statisticians answer some of th...

MarketingExperiments

Similar to Alessandro Magnani, Data Scientist, @WalmartLabs at MLconf SF - 11/13/15 (20)

Intro to A/B Testing by Ever's Senior Product Manager

A/B Testing: Common Pitfalls and How to Avoid Them

Your A/B Tests are Lying to You

Anton Muzhailo - Practical Test Process Improvement using ISTQB

Understanding the Validity of a LCA

Predicting Helpfulness of User-Generated Product Reviews Through Analytical M...

Lecture9 conjoint analysis

Meta-Analyses in Experimentation: The Whats and Hows

"How we killed 80% of features and increased outcomes of a/b testing by 100%"...

Automated solutions for product and pricing research

Revisiting the Experimental Design Choices for Approaches for the Automated R...

Product Experimentation Pitfalls & How to Avoid Them

2 anton muzhailo - formal test process improvement. how to invest to the te...

slide->title; ?>

KDD

Data-Driven Decision Making by Expedia Sr PM

PREP Webinar June 18th 2015 06-18

Can I Test More Than One Variable at a Time? Statisticians answer some of th...

More from MLconf

Understanding Human Impact: Social and Equity Assessments for AI Technologies Social and Equity Impact Assessments have broad applications but can be a useful tool to explore and mitigate for Machine Learning fairness issues and can be applied to product specific questions as a way to generate insights and learnings about users, as well as impacts on society broadly as a result of the deployment of new and emerging technologies. In this presentation, my goal is to advocate for and highlight the need to consult community and external stakeholder engagement to develop a new knowledge base and understanding of the human and social consequences of algorithmic decision making and to introduce principles, methods and process for these types of impact assessments.

Jamila Smith-Loud - Understanding Human Impact: Social and Equity Assessments...

MLconf

The Brain’s Guide to Dealing with Context in Language Understanding Like the visual cortex, the regions of the brain involved in understanding language represent information hierarchically. But whereas the visual cortex organizes things into a spatial hierarchy, the language regions encode information into a hierarchy of timescale. This organization is key to our uniquely human ability to integrate semantic information across narratives. More and more, deep learning-based approaches to natural language understanding embrace models that incorporate contextual information at varying timescales. This has not only led to state-of-the art performance on many difficult natural language tasks, but also to breakthroughs in our understanding of brain activity. In this talk, we will discuss the important connection between language understanding and context at different timescales. We will explore how different deep learning architectures capture timescales in language and how closely their encodings mimic the brain. Along the way, we will uncover some surprising discoveries about what depth does and doesn’t buy you in deep recurrent neural networks. And we’ll describe a new, more flexible way to think about these architectures and ease design space exploration. Finally, we’ll discuss some of the exciting applications made possible by these breakthroughs.

Ted Willke - The Brain’s Guide to Dealing with Context in Language Understanding

MLconf

Applying Computer Vision to Reduce Contamination in the Recycling Stream With China’s recent refusal of most foreign recyclables, North American waste haulers are scrambling to figure out how to make on-shore recycling cost-effective in order to continue providing recycling services. Recyclables that were once being shipped to China for manual sorting are now primarily being redirected to landfills or incinerators. Without a solution, a nearly $5 billion annual recycling market could come to a halt. Purity in the recycling stream is key to this effort as contaminants in the stream can increase the cost of operations, damage equipment and reduce the ability to create pure commodities suitable for creating recycled goods. This market disruption as a result of China’s new regulations, however, provides us the chance to re-examine and improve our current disposal & collection habits with modern monitoring & artificial intelligence technology. Using images from our in-dumpster cameras, Compology has developed an ML-based process that helps identify, measure and alert for contaminants in recycling containers before they are picked-up, helping keep the recycling stream clean. Our convolutional neural network flags potential instances of contamination inside a dumpster, enabling garbage haulers to know which containers have the wrong type of material inside. This allows them to provide targeted, timely education, and when appropriate, assess fines, to improve recycling compliance at the businesses and residences they serve, helping keep recycling services financially viable. In this presentation, we will walk through our ML-based contamination measurement and scoring process by showing how Waste Management, a national waste hauler, has experienced 57% contamination reduction in nearly 2,000 containers over six months, This progress shows significant strides towards financially viable recycling services.

Justin Armstrong - Applying Computer Vision to Reduce Contamination in the Re...

MLconf

Quantum Computing: a Treasure Hunt, not a Gold Rush Quantum computers promise a significant step up in computational power over conventional computers, but also suffer a number of counterintuitive limitations --- both in their computational model and in leading lab implementations. In this talk, we review how quantum computers compete with conventional computers and how conventional computers try to hold their ground. Then we outline what stands in the way of successful quantum ML applications.

Igor Markov - Quantum Computing: a Treasure Hunt, not a Gold Rush

MLconf

Data Labeling as Religious Experience One of the most common places to deploy a production machine learning systems is as a replacement for a legacy rules-based system that is having a hard time keeping up with new edge cases and requirements. I'll be walking through the process and tooling we used to help us design, train, and deploy a model to replace a set of static rules we had for handling invite spam at Slack, talk about what we learned, and discuss some problems to solve in order to make these migrations easier for everyone.

Josh Wills - Data Labeling as Religious Experience

MLconf

Project GaitNet: Ushering in the ImageNet moment for human Gait kinematics The emergence of the upright human bipedal gait can be traced back 4 to 2.8 million years ago, to the now extinct hominin Australopithecus afarensis. Fine grained analysis of gait using the modern MEMS sensors found on all smartphones not just reveals a lot about the person’s orthopedic and neuromuscular health status, but also has enough idiosyncratic clues that it can be harnessed as a passive biometric. While there were many siloed attempts made by the machine learning community to model Bipedal Gait sensor data, these were done with small datasets oft collected in restricted academic environs. In this talk, we will introduce the ImageNet moment for human gait analysis by presenting 'Project GaitNet', the largest ever planet-sized motion sensor based human bipedal gait dataset ever curated. We’ll also present the associated state-of-the-art results in classifying humans harnessing novel deep neural architectures and the related success stories we have enjoyed in transfer-learning into disparate domains of human kinematics analysis.

Vinay Prabhu - Project GaitNet: Ushering in the ImageNet moment for human Gai...

MLconf

Machine Learning Methods in Detecting Alzheimer’s Disease from Speech and Language Alzheimer's disease affects millions of people worldwide, and it is important to predict the disease as early and as accurate as possible. In this talk, I will discuss development of novel ML models that help classifying healthy people from those who develop Alzheimer's, using short samples of human speech. As an input to the model, features of different modalities are extracted from speech audio samples and transcriptions: (1) syntactic measures, such as e.g. production rules extracted from syntactic parse trees, (2) lexical measures, such as e.g. features of lexical richness and complexity and lexical norms, and (3) acoustic measures, such as e.g. standard Mel-frequency cepstral coefficients. I will present the ML model that detects cognitive impairment by reaching agreement among modalities. The resulting model is able to achieve state of the art performance in both supervised and semi-supervised manner, using manual transcripts of human speech. Additionally, I will discuss potential limitations of any fully-automated speech-based Alzheimer's disease detection model, focusing mostly on the analysis of the impact of a not-so-accurate automatic speech recognition (ASR) on the classification performance. To illustrate this, I will present the experiments with controlled amounts of artificially generated ASR errors and explain how the deletion errors affect Alzheimer's detection performance the most, due to their impact on the features of syntactic and lexical complexity.

Jekaterina Novikova - Machine Learning Methods in Detecting Alzheimer’s Disea...

MLconf

Optimized Image Classification on the Cheap In this talk, we anchor on building an image classifier trained on the Stanford Cars dataset to evaluate two approaches to transfer learning -fine tuning and feature extraction- and the impact of hyperparameter optimization on these techniques. Once we define the most performant transfer learning technique for Stanford Cars, we will double the size of the dataset through image augmentation to boost the classifier’s performance. We will use Bayesian optimization to learn the hyperparameters associated with image transformations using the downstream image classifier’s performance as the guide. In conjunction with model performance, we will also focus on the features of these augmented images and the downstream implications for our image classifier. To both maximize model performance on a budget and explore the impact of optimization on these methods, we apply a particularly efficient implementation of Bayesian optimization to each of these architectures in this comparison. Our goal is to draw on a rigorous set of experimental results that can help us answer the question: how can resource-constrained teams make trade-offs between efficiency and effectiveness using pre-trained models?

Meghana Ravikumar - Optimized Image Classification on the Cheap

MLconf

The Importance of Modeling Data Collection Data sets used in machine learning are often collected in a systematically biased way - certain data points are more likely to be collected than others. We call this "observation bias". For example, in health care, we are more likely to see lab tests when the patient is feeling unwell than otherwise. Failing to account for observation bias can, of course, result in poor predictions on new data. By contrast, properly accounting for this bias allows us to make better use of the data we do have. In this presentation, we discuss practical and theoretical approaches to dealing with observation bias. When the nature of the bias is known, there are simple adjustments we can make to nonparametric function estimation techniques, such as Gaussian Process models. We also discuss the scenario where the data collection model is unknown. In this case, there are steps we can take to estimate it from observed data. Finally, we demonstrate that having a small subset of data points that are known to be collected at random - that is, in an unbiased way - can vastly improve our ability to account for observation bias in the rest of the data set. My hope is that attendees of this presentation will be aware of the perils of observation bias in their own work, and be equipped with tools to address it.

Noam Finkelstein - The Importance of Modeling Data Collection

MLconf

The Uncanny Valley of ML Every so often, the conundrum of the Uncanny Valley re-emerges as advanced technologies evolve from clearly experimental products to refined accepted technologies. We have seen its effects in robotics, computer graphics, and page load times. The debate of how to handle the new technology detracts from its benefits. When machine learning is added to human decision systems a similar effect can be measured in increased response time and decreased accuracy. These systems include radiology, judicial assignments, bus schedules, housing prices, power grids and a growing variety of applications. Unfortunately, the Uncanny Valley of ML can be hard to detect in these systems and can lead to degraded system performance when ML is introduced, at great expense. Here, we'll introduce key design principles for introducing ML into human decision systems to navigate around the Uncanny Valley and avoid its pitfalls.

June Andrews - The Uncanny Valley of ML

MLconf

Deep Learning Architectures for Semantic Relation Detection Tasks Recognizing and distinguishing specific semantic relations from other types of semantic relations is an essential part of language understanding systems. Identifying expressions with similar and contrasting meanings is valuable for NLP systems which go beyond recognizing semantic relatedness and require to identify specific semantic relations. In this talk, I will first present novel techniques for creating labelled datasets required for training deep learning models for classifying semantic relations between phrases. I will further present various neural network architectures that integrate morphological features into integrated path-based and distributional relation detection algorithms and demonstrate that this model outperforms state-of-the-art models in distinguishing semantic relations and is capable of efficiently handling multi-word expressions.

Sneha Rajana - Deep Learning Architectures for Semantic Relation Detection Tasks

MLconf

Building an Incrementally Trained, Local Taste Aware, Global Deep Learned Recommender System Model At Netflix, our main goal is to maximize our members’ enjoyment of the selected show by minimizing the amount of time it takes for them to find it. We try to achieve this goal by personalizing almost all the aspects of our product -- from what shows to recommend, to how to present these shows and construct their home-pages to what images to select per show, among many other things. Everything is recommendations for us and as an applied Machine Learning group, we spend our time building models for personalization that will eventually increase the joy and satisfaction of our members. In this talk we will primarily focus our attention on a) making a global deep learned recommender model that is regional tastes and popularity aware and b) adapting this model to changing taste preferences as well as dynamic catalog availability. We will first go through some standard recommender system models that use Matrix Factorization and Topic Models and then compare and contrast them with more powerful and higher capacity deep learning based models such as sequence models that use recurrent neural networks. We will show what it entails to build a global model that is aware of regional taste preferences and catalog availability. We will show how models that are built on simple Maximum Likelihood principle fail to do that. We will then describe one solution that we have employed in order to enable the global deep learned models to focus their attention on capturing regional taste preferences and changing catalog.In the latter half of the talk, we will discuss how we do incremental learning of deep learned recommender system models. Why do we need to do that ? Everything changes with time. Users’ tastes change with time. What’s available on Netflix and what’s popular also change over time. Therefore, updating or improving recommendation systems over time is necessary to bring more joy to users. In addition to how we apply incremental learning, we will discuss some of the challenges we face involving large-scale data preparation, infrastructure setup for incremental model training as well as pipeline scheduling. The incremental training enables us to serve fresher models trained on fresher and larger amounts of data. This helps our recommender system to nicely and quickly adapt to catalog and users’ taste changes, and improve overall performance.

Anoop Deoras - Building an Incrementally Trained, Local Taste Aware, Global D...

MLconf

Vito Ostuni - The Voice: New Challenges in a Zero UI World The adoption of voice-enabled devices has seen an explosive growth in the last few years and music consumption is among the most popular use cases. Music personalization and recommendation plays a major role at Pandora in providing a daily delightful listening experience for millions of users. In turn, providing the same perfectly tailored listening experience through these novel voice interfaces brings new interesting challenges and exciting opportunities. In this talk we will describe how we apply personalization and recommendation techniques in three common voice scenarios which can be defined in terms of request types: known-item, thematic, and broad open-ended. We will describe how we use deep learning slot filling techniques and query classification to interpret the user intent and identify the main concepts in the query. We will also present the differences and challenges regarding evaluation of voice powered recommendation systems. Since pure voice interfaces do not contain visual UI elements, relevance labels need to be inferred through implicit actions such as play time, query reformulations or other types of session level information. Another difference is that while the typical recommendation task corresponds to recommending a ranked list of items, a voice play request translates into a single item play action. Thus, some considerations about closed feedback loops need to be made. In summary, improving the quality of voice interactions in music services is a relatively new challenge and many exciting opportunities for breakthroughs still remain. There are many new aspects of recommendation system interfaces to address to bring a delightful and effortless experience for voice users. We will share a few open challenges to solve for the future.

Vito Ostuni - The Voice: New Challenges in a Zero UI World

MLconf

Anna choromanska - Data-driven Challenges in AI: Scale, Information Selection...

MLconf

Janani Kalyanam - Machine Learning to Detect Illegal Online Sales of Prescrip...

MLconf

Esperanza Lopez Aguilera - Using a Bayesian Neural Network in the Detection o...

MLconf

Neel Sundaresan - Teaching a machine to code

MLconf

Rishabh Mehrotra - Recommendations in a Marketplace: Personalizing Explainabl...

MLconf

Soumith Chintala - Increasing the Impact of AI Through Better Software

MLconf

Roy Lowrance - Predicting Bond Prices: Regime Changes

MLconf

More from MLconf (20)

Jamila Smith-Loud - Understanding Human Impact: Social and Equity Assessments...

Ted Willke - The Brain’s Guide to Dealing with Context in Language Understanding

Justin Armstrong - Applying Computer Vision to Reduce Contamination in the Re...

Igor Markov - Quantum Computing: a Treasure Hunt, not a Gold Rush

Josh Wills - Data Labeling as Religious Experience

Vinay Prabhu - Project GaitNet: Ushering in the ImageNet moment for human Gai...

Jekaterina Novikova - Machine Learning Methods in Detecting Alzheimer’s Disea...

Meghana Ravikumar - Optimized Image Classification on the Cheap

Noam Finkelstein - The Importance of Modeling Data Collection

June Andrews - The Uncanny Valley of ML

Sneha Rajana - Deep Learning Architectures for Semantic Relation Detection Tasks

Anoop Deoras - Building an Incrementally Trained, Local Taste Aware, Global D...

Vito Ostuni - The Voice: New Challenges in a Zero UI World

Anna choromanska - Data-driven Challenges in AI: Scale, Information Selection...

Janani Kalyanam - Machine Learning to Detect Illegal Online Sales of Prescrip...

Esperanza Lopez Aguilera - Using a Bayesian Neural Network in the Detection o...

Neel Sundaresan - Teaching a machine to code

Rishabh Mehrotra - Recommendations in a Marketplace: Personalizing Explainabl...

Soumith Chintala - Increasing the Impact of AI Through Better Software

Roy Lowrance - Predicting Bond Prices: Regime Changes

Recently uploaded

Three things you will take away from the session: • How to run an effective tenant-to-tenant migration • Best practices for before, during, and after migration • Tips for using migration as a springboard to prepare for Copilot in Microsoft 365 Main ideas: Migration Overview: The presentation covers the current reality of cross-tenant migrations, the triggers, phases, best practices, and benefits of a successful tenant migration Considerations: When considering a migration, it is important to consider the migration scope, performance, customization, flexibility, user-friendly interface, automation, monitoring, support, training, scalability, data integrity, data security, cost, and licensing structure Next Wave: The next wave of change includes the launch of Copilot, which requires businesses to be prepared for upcoming changes related to Copilot and the cloud, and to consolidate data and tighten governance ShareGate: ShareGate can help with pre-migration analysis, configurable migration tool, and automated, end-user driven collaborative governance

Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff

sammart93

💥 You’re lucky! We’ve found two different (lead) developers that are willing to share their valuable lessons learned about using UiPath Document Understanding! Based on recent implementations in appealing use cases at Partou and SPIE. Don’t expect fancy videos or slide decks, but real and practical experiences that will help you with your own implementations. 📕 Topics that will be addressed: • Training the ML-model by humans: do or don't? • Rule-based versus AI extractors • Tips for finding use cases • How to start 👨‍🏫👨‍💻 Speakers: o Dion Morskieft, RPA Product Owner @Partou o Jack Klein-Schiphorst, Automation Developer @Tacstone Technology

DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam

UiPathCommunity

The Good, the Bad and the Governed - Why is governance a dirty word? David O'Neill, Chief Operating Officer - APIContext Apidays New York 2024: The API Economy in the AI Era (April 30 & May 1, 2024) ------ Check out our conferences at https://www.apidays.global/ Do you want to sponsor or talk at one of our conferences? https://apidays.typeform.com/to/ILJeAaV8 Learn more on APIscene, the global media made by the community for the community: https://www.apiscene.io Explore the API ecosystem with the API Landscape: https://apilandscape.apiscene.io/

Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...

apidays

Discover the innovative features and strategic vision that keep WSO2 an industry leader. Explore the exciting 2024 roadmap of WSO2 API management, showcasing innovations, unified APIM/APK control plane, natural language API interaction, and cloud native agility. Discover how open source solutions, microservices architecture, and cloud native technologies unlock seamless API management in today's dynamic landscapes. Leave with a clear blueprint to revolutionize your API journey and achieve industry success!

WSO2's API Vision: Unifying Control, Empowering Developers

WSO2

💉💊+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHABI}}+971581248768 +971581248768 Mtp-Kit (500MG) Prices » Dubai [(+971581248768**)] Abortion Pills For Sale In Dubai, UAE, Mifepristone and Misoprostol Tablets Available In Dubai, UAE CONTACT DR.Maya Whatsapp +971581248768 We Have Abortion Pills / Cytotec Tablets /Mifegest Kit Available in Dubai, Sharjah, Abudhabi, Ajman, Alain, Fujairah, Ras Al Khaimah, Umm Al Quwain, UAE, Buy cytotec in Dubai +971581248768''''Abortion Pills near me DUBAI | ABU DHABI|UAE. Price of Misoprostol, Cytotec” +971581248768' Dr.DEEM ''BUY ABORTION PILLS MIFEGEST KIT, MISOPROTONE, CYTOTEC PILLS IN DUBAI, ABU DHABI,UAE'' Contact me now via What's App…… abortion Pills Cytotec also available Oman Qatar Doha Saudi Arabia Bahrain Above all, Cytotec Abortion Pills are Available In Dubai / UAE, you will be very happy to do abortion in Dubai we are providing cytotec 200mg abortion pill in Dubai, UAE. Medication abortion offers an alternative to Surgical Abortion for women in the early weeks of pregnancy. We only offer abortion pills from 1 week-6 Months. We then advise you to use surgery if its beyond 6 months. Our Abu Dhabi, Ajman, Al Ain, Dubai, Fujairah, Ras Al Khaimah (RAK), Sharjah, Umm Al Quwain (UAQ) United Arab Emirates Abortion Clinic provides the safest and most advanced techniques for providing non-surgical, medical and surgical abortion methods for early through late second trimester, including the Abortion By Pill Procedure (RU 486, Mifeprex, Mifepristone, early options French Abortion Pill), Tamoxifen, Methotrexate and Cytotec (Misoprostol). The Abu Dhabi, United Arab Emirates Abortion Clinic performs Same Day Abortion Procedure using medications that are taken on the first day of the office visit and will cause the abortion to occur generally within 4 to 6 hours (as early as 30 minutes) for patients who are 3 to 12 weeks pregnant. When Mifepristone and Misoprostol are used, 50% of patients complete in 4 to 6 hours; 75% to 80% in 12 hours; and 90% in 24 hours. We use a regimen that allows for completion without the need for surgery 99% of the time. All advanced second trimester and late term pregnancies at our Tampa clinic (17 to 24 weeks or greater) can be completed within 24 hours or less 99% of the time without the need surgery. The procedure is completed with minimal to no complications. Our Women's Health Center located in Abu Dhabi, United Arab Emirates, uses the latest medications for medical abortions (RU-486, Mifeprex, Mifegyne, Mifepristone, early options French abortion pill), Methotrexate and Cytotec (Misoprostol). The safety standards of our Abu Dhabi, United Arab Emirates Abortion Doctors remain unparalleled. They consistently maintain the lowest complication rates throughout the nation. Our Physicians and staff are always available to answer questions and care for women in one of the most difficult times in their lives. The decision to have an abortion at the Abortion Cl

+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...

?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@

"I see eyes in my soup": How Delivery Hero implemented the safety system for ...

Zilliz

Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood

Juan lago vázquez

In this keynote, Asanka Abeysinghe, CTO,WSO2 will explore the shift towards platformless technology ecosystems and their importance in driving digital adaptability and innovation. We will discuss strategies for leveraging decentralized architectures and integrating diverse technologies, with a focus on building resilient, flexible, and future-ready IT infrastructures. We will also highlight WSO2's roadmap, emphasizing our commitment to supporting this transformative journey with our evolving product suite.

Platformless Horizons for Digital Adaptability

WSO2

Artificial Intelligence Chap.5 : Uncertainty

Khushali Kathiriya

CNIC Information System with Pakdata Cf In Pakistan

danishmna97

Corporate and higher education. Two industries that, in the past, have had a clear divide with very little crossover. The difference in goals, learning styles and objectives paved the way for differing learning technologies platforms to evolve. Now, those stark lines are blurring as both sides are discovering they have content that’s relevant to the other. Join Tammy Rutherford as she walks through the pros and cons of corporate and higher ed collaborating. And the challenges of these different technology platforms working together for a brighter future.

Corporate and higher education May webinar.pptx

Rustici Software

AWS Community Day CPH - Three problems of Terraform

Andrey Devyatkin

Following the popularity of "Cloud Revolution: Exploring the New Wave of Serverless Spatial Data," we're thrilled to announce this much-anticipated encore webinar. In this sequel, we'll dive deeper into the Cloud-Native realm by uncovering practical applications and FME support for these new formats, including COGs, COPC, FlatGeoBuf, GeoParquet, STAC, and ZARR. Building on the foundation laid by industry leaders Michelle Roby of Radiant Earth and Chris Holmes of Planet in the first webinar, this second part offers an in-depth look at the real-world application and behind-the-scenes dynamics of these cutting-edge formats. We will spotlight specific use-cases and workflows, showcasing their efficiency and relevance in practical scenarios. Discover the vast possibilities each format holds, highlighted through detailed discussions and demonstrations. Our expert speakers will dissect the key aspects and provide critical takeaways for effective use, ensuring attendees leave with a thorough understanding of how to apply these formats in their own projects. Elevate your understanding of how FME supports these cutting-edge technologies, enhancing your ability to manage, share, and analyze spatial data. Whether you're building on knowledge from our initial session or are new to the serverless spatial data landscape, this webinar is your gateway to mastering cloud-native formats in your workflows.

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME

Safe Software

ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke

Product Anonymous

Angeliki Cooney has spent over twenty years at the forefront of the life sciences industry, working out of Wynantskill, NY. She is highly regarded for her dedication to advancing the development and accessibility of innovative treatments for chronic diseases, rare disorders, and cancer. Her professional journey has centered on strategic consulting for biopharmaceutical companies, facilitating digital transformation, enhancing omnichannel engagement, and refining strategic commercial practices. Angeliki's innovative contributions include pioneering several software-as-a-service (SaaS) products for the life sciences sector, earning her three patents. As the Senior Vice President of Life Sciences at Avenga, Angeliki orchestrated the firm's strategic entry into the U.S. market. Avenga, a renowned digital engineering and consulting firm, partners with significant entities in the pharmaceutical and biotechnology fields. Her leadership was instrumental in expanding Avenga's client base and establishing its presence in the competitive U.S. market.

Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...

Angeliki Cooney

Dubai, known for its towering skyscrapers, luxurious lifestyle, and relentless pursuit of innovation, often finds itself in the global spotlight. However, amidst the glitz and glamour, the emirate faces its own set of challenges, including the occasional threat of flooding. In recent years, Dubai has experienced sporadic but significant floods, disrupting normalcy and posing unique challenges to its infrastructure. Among the critical nodes in this bustling metropolis is the Dubai International Airport, a vital hub connecting the world. This article delves into the intersection of Dubai flood events and the resilience demonstrated by the Dubai International Airport in the face of such challenges.

Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf

Orbitshub

The value of a flexible API Management solution for Open Banking Steve Melan, Manager for IT Innovation and Architecture - State's and Saving's Bank of Luxembourg Apidays New York 2024: The API Economy in the AI Era (April 30 & May 1, 2024) ------ Check out our conferences at https://www.apidays.global/ Do you want to sponsor or talk at one of our conferences? https://apidays.typeform.com/to/ILJeAaV8 Learn more on APIscene, the global media made by the community for the community: https://www.apiscene.io Explore the API ecosystem with the API Landscape: https://apilandscape.apiscene.io/

Apidays New York 2024 - The value of a flexible API Management solution for O...

apidays

Accelerating FinTech Innovation: Unleashing API Economy and GenAI Vasa Krishnan, Chief Technology Officer - FinResults Apidays New York 2024: The API Economy in the AI Era (April 30 & May 1, 2024) ------ Check out our conferences at https://www.apidays.global/ Do you want to sponsor or talk at one of our conferences? https://apidays.typeform.com/to/ILJeAaV8 Learn more on APIscene, the global media made by the community for the community: https://www.apiscene.io Explore the API ecosystem with the API Landscape: https://apilandscape.apiscene.io/

Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...

apidays

MINDCTI Revenue Release Quarter One 2024

MIND CTI

Elevate Developer Efficiency & build GenAI Application with Amazon Q

Bhuvaneswari Subramani

Recently uploaded (20)

Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff

DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam

Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...

WSO2's API Vision: Unifying Control, Empowering Developers

+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...

"I see eyes in my soup": How Delivery Hero implemented the safety system for ...

Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood

Platformless Horizons for Digital Adaptability

Artificial Intelligence Chap.5 : Uncertainty

CNIC Information System with Pakdata Cf In Pakistan

Corporate and higher education May webinar.pptx

AWS Community Day CPH - Three problems of Terraform

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME

ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke

Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...

Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf

Apidays New York 2024 - The value of a flexible API Management solution for O...

Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...

MINDCTI Revenue Release Quarter One 2024

Elevate Developer Efficiency & build GenAI Application with Amazon Q

Alessandro Magnani, Data Scientist, @WalmartLabs at MLconf SF - 11/13/15

1. Classiﬁcation Labels in a Fast Moving Environment Classiﬁcation Labels in a Fast Moving Environment Alessandro Magnani @WalmartLabs, Walmart Global eCommerce California, USA Friday 13th November, 2015

2. Classification Labels in a Fast Moving Environment Classification Model Performance Items Classifier Editor N sampled items true label yi estimate ˜yi accuracyEvaluation ◮ correctly evaluating classification models is critical and requires labels ◮ labeling products is expensive ◮ need to correctly and optimally use labels

3. Classification Labels in a Fast Moving Environment Classification Model Performance Items Classifier Editor N sampled items true label yi estimate ˜yi accuracyEvaluation Measure accuracy common approach: ◮ sample uniformly at random N items ◮ compute accuracy 1 N N i=1 ½{˜yi =yi }

4. Classiﬁcation Labels in a Fast Moving Environment Practical challenges Items Classiﬁer Editor N sampled items true label yi estimate ˜yi accuracyEvaluation ◮ items change over time

5. Classiﬁcation Labels in a Fast Moving Environment Practical challenges Items Classiﬁer Editor N sampled items true label yi estimate ˜yi accuracyEvaluation ◮ items change over time ◮ evaluation required over multiple subsets

6. Classiﬁcation Labels in a Fast Moving Environment Practical challenges Items Classiﬁer Editor N sampled items true label yi estimate ˜yi accuracyEvaluation ◮ items change over time ◮ evaluation required over multiple subsets ◮ existing labels potentially hard to reuse

7. Classiﬁcation Labels in a Fast Moving Environment A motivating example compute accuracy over 1M items 1K labels budget ◮ sample 1K items and get labels yi ◮ measure accuracy 1 1K 1K i=1 ½{˜yi =yi } 1M p 1 1K

8. Classiﬁcation Labels in a Fast Moving Environment A motivating example 500K items added, compute accuracy on all 1.5M items ◮ use previous accuracy measure ◮ most likely inaccurate 1M 1.5M p 1 1K

9. Classiﬁcation Labels in a Fast Moving Environment A motivating example 500K items added, compute accuracy on all 1.5M items 500 labels extra budget ◮ sample 500 items from the 1.5M ◮ compute accuracy on new 500 labels ◮ previous 1K labels “wasted” 1M 1.5M p 1 3K

10. Classiﬁcation Labels in a Fast Moving Environment A motivating example 500K items added, compute accuracy on all 1.5M items 500 labels extra budget, better approach ◮ sample 500 items from new items ◮ compute accuracy on all 1.5K labels ◮ no label “wasted” 1M 1.5M p 1 1K

11. Classiﬁcation Labels in a Fast Moving Environment A motivating example 500K items added, compute accuracy on all 1.5M items only 250 labels extra budget? ◮ sample 250 items from new items ◮ need to account for diﬀerence in sampling ◮ accuracy: 1M 1.5M p 1 2K 1 1.5K 1K i=1 ½{˜yi =yi } + 2 250 i=1 ½{˜ynew i =ynew i }

12. Classiﬁcation Labels in a Fast Moving Environment A motivating example What are the challenges? ◮ sampling new test labels for every measure is generally expensive

13. Classiﬁcation Labels in a Fast Moving Environment A motivating example What are the challenges? ◮ sampling new test labels for every measure is generally expensive ◮ knowing how previous labels were sampled required to optimally sample new items for test

14. Classiﬁcation Labels in a Fast Moving Environment A motivating example What are the challenges? ◮ sampling new test labels for every measure is generally expensive ◮ knowing how previous labels were sampled required to optimally sample new items for test ◮ computing accuracy using all labels requires knowledge of sampling proﬁle

15. Classiﬁcation Labels in a Fast Moving Environment A motivating example What are the challenges? ◮ sampling new test labels for every measure is generally expensive ◮ knowing how previous labels were sampled required to optimally sample new items for test ◮ computing accuracy using all labels requires knowledge of sampling proﬁle ◮ overtime reusing labels can become very tricky

16. Classiﬁcation Labels in a Fast Moving Environment Evaluation framework ◮ pi is probability of item i to be selected for test (Bernoulli) ◮ each item carries pi and is marked if selected (store the sampling proﬁle) ◮ accuracy: 1 i selected 1 pi i selected 1 pi ½{˜yi =yi }

17. Classiﬁcation Labels in a Fast Moving Environment Evaluation framework ◮ pi is probability of item i to be selected for test (Bernoulli) ◮ each item carries pi and is marked if selected (store the sampling proﬁle) ◮ accuracy: 1 i selected 1 pi i selected 1 pi ½{˜yi =yi } ◮ for evaluation to be possible pj > 0 for all j labeled/unlabeled

18. Classiﬁcation Labels in a Fast Moving Environment Evaluation framework ◮ pi is probability of item i to be selected for test (Bernoulli) ◮ each item carries pi and is marked if selected (store the sampling proﬁle) ◮ accuracy: 1 i selected 1 pi i selected 1 pi ½{˜yi =yi } ◮ for evaluation to be possible pj > 0 for all j labeled/unlabeled ◮ all labels are used

19. Classiﬁcation Labels in a Fast Moving Environment Evaluation framework ◮ pi is probability of item i to be selected for test (Bernoulli) ◮ each item carries pi and is marked if selected (store the sampling proﬁle) ◮ accuracy: 1 i selected 1 pi i selected 1 pi ½{˜yi =yi } ◮ for evaluation to be possible pj > 0 for all j labeled/unlabeled ◮ all labels are used ◮ with uniform sampling this is simply “standard” accuracy

20. Classiﬁcation Labels in a Fast Moving Environment Evaluation framework ◮ pi is probability of item i to be selected for test (Bernoulli) ◮ each item carries pi and is marked if selected (store the sampling proﬁle) ◮ accuracy: 1 i selected 1 pi i selected 1 pi ½{˜yi =yi } ◮ for evaluation to be possible pj > 0 for all j labeled/unlabeled ◮ all labels are used ◮ with uniform sampling this is simply “standard” accuracy ◮ very closely related to importance sampling

21. Classiﬁcation Labels in a Fast Moving Environment Evaluation framework given existing sampling pi and extra budget how do we sample? ◮ minimize accuracy variance with budget constraint ◮ can be formulated as an optimization problem ◮ easy to solve

22. Classiﬁcation Labels in a Fast Moving Environment Evaluation framework it works as you’d expect as budget grows: p p ◮ new budget (blue) used more where pi is smaller ◮ given enough budget we obtain uniform sampling

23. Classiﬁcation Labels in a Fast Moving Environment Extensions ◮ framework works more generally for supervised learning ◮ framework can work with a wide range of diﬀerent metrics ◮ optimal sampling can use model posterior to reduce variance ◮ this framework can be used on the training side together with active learning

Alessandro Magnani, Data Scientist, @WalmartLabs at MLconf SF - 11/13/15

Recommended

Recommended

More Related Content

What's hot

What's hot (19)

Viewers also liked

Viewers also liked (12)

Similar to Alessandro Magnani, Data Scientist, @WalmartLabs at MLconf SF - 11/13/15

Similar to Alessandro Magnani, Data Scientist, @WalmartLabs at MLconf SF - 11/13/15 (20)

More from MLconf

More from MLconf (20)

Recently uploaded

Recently uploaded (20)

Alessandro Magnani, Data Scientist, @WalmartLabs at MLconf SF - 11/13/15