SlideShare a Scribd company logo
1 of 28
Review Analysis:
An Approach to Leveraging User-Generated Content in
the Context of Retail
Jennifer Prendki, Principal Data Scientist
Walmart Global e-Commerce
California, USA
The Machine Learning Conference, San Francisco, CA
11/11/2016
Outline
• Business motivation
• Algorithm Pipeline
• Feature Space Computation
• Sentiment Capture
• Real-Life Examples and Results
• Future Work and Conclusions
2
 Business
Motivation
 Algorithm
 Feature Space
Computation
 Sentiment
Capture
 Real-Life
Examples
 Future Work and
Conclusions
Business Motivation
3
 Business
Motivation
 Algorithm
 Feature Space
Computation
 Sentiment
Capture
 Real-Life
Examples
 Future Work and
Conclusions
Business Motivation
4
 Business
Motivation
 Algorithm
 Feature Space
Computation
 Sentiment
Capture
 Real-Life
Examples
 Future Work and
Conclusions
Business Motivation
5
 Business
Motivation
 Algorithm
 Feature Space
Computation
 Sentiment
Capture
 Real-Life
Examples
 Future Work and
Conclusions
Business Motivation
6
 Business
Motivation
 Algorithm
 Feature Space
Computation
 Sentiment
Capture
 Real-Life
Examples
 Future Work and
Conclusions
Business Motivation
“ I bought this for my daughter to
do her college work on, it's been
great, no problems so far. “
[SuperMom72]
“ Works like a charm, would
definitely recommend to anyone
on a budget. “
[Vamsy]
“ Fast CPU but slow disk drive
slows everything down. ”
[TalonBay]
“ I don't do gaming or downloading movies or music, so for those
folks I can't speak to the performance. But for surfing the web,
checking email, etc., this computer will save you time for watching
the little ball spin!”
[Anonymous]
7
 Business
Motivation
 Algorithm
 Feature Space
Computation
 Sentiment
Capture
 Real-Life
Examples
 Future Work and
Conclusions
Review Analysis: A Current Landscape
• Sentiment analysis
• Best known use case: Social Media Analysis/Tweets
Why tweets?  shorter, condensed, highly sentimental content
• Movie review analysis:
Kaggle: Analysis of the ‘Rotten Tomatoes’ Dataset
• Regarding product review analysis
• Little to no papers regarding product review analysis at commercial scale
• Shortage of work regarding combination of topic modeling and sentiment
analysis
8
Our research: Combine feature computation and sentiment analysis to
summarize reviewers’ opinions about a specific product
 Business
Motivation
 Algorithm
 Feature Space
Computation
 Sentiment
Capture
 Real-Life
Examples
 Future Work and
Conclusions
Algorithm Pipeline
9
Product 𝛼
Product 𝛽
Product 𝛾
Review
Review
Review
Review
Review
Review
Fc
Category C
Feature
Space
Computation
F 𝛼
F 𝛽
F 𝛾
Feature
Space
Reduction
 Business
Motivation
 Algorithm
 Feature Space
Computation
 Sentiment
Capture
 Real-Life
Examples
 Future Work and
Conclusions
Algorithm Pipeline
10
Product 𝛼
Product 𝛽
Product 𝛾
Review
Review
Review
Review
Review
Review
Category C
F 𝛼
F 𝛽
F 𝛾
Sentiment
Sentiment
Sentiment
Sentiment
Sentiment
Sentiment
Sentiment
Computation
For Each Review
… Sentiment
Computed For
Relevant Features
 Business
Motivation
 Algorithm
 Feature Space
Computation
 Sentiment
Capture
 Real-Life
Examples
 Future Work and
Conclusions
Algorithm Pipeline
11
Product 𝛼
Product 𝛽
Product 𝛾
Review
Review
Review
Review
Review
Review
Category C
Sentiment
Sentiment
Sentiment
Sentiment
Sentiment
Sentiment
𝜎𝑡, 𝛼, 𝑓
𝜎𝑡, 𝛽, 𝑓
𝜎𝑡, 𝛾, 𝑓
∀ 𝑡 ∈ 𝜏
∀ 𝑓 ∈ F 𝛼
∀ 𝑡 ∈ 𝜏
∀ 𝑓 ∈ Fβ
∀ 𝑡 ∈ 𝜏
∀ 𝑓 ∈ F 𝛾
 Business
Motivation
 Algorithm
 Feature Space
Computation
 Sentiment
Capture
 Real-Life
Examples
 Future Work and
Conclusions
Sentiment
Computation
For Each Review
… Sentiment
Computed For
Relevant Features
Feature Space Computation
• Textual reviews go through a careful process:
• TF/TF-Idf transform on documents
• Stop words removal, stemming, part-of-speech selection
• Spell-checking
• etc.
• ‘Synonym’ computation
• Can be done using Word Embedding (glove, word2vec)
• Can be done building synonym graph using dictionary/Wikipedia
• Is complex and tricky, context-sensitive, unsupervised
12
In short: Creating synonym sets is difficult, and challenging as an online algorithm
In short: Preprocessing crucial to extracting relevant features
 Business
Motivation
 Algorithm
 Feature Space
Computation
 Sentiment
Capture
 Real-Life
Examples
 Future Work and
Conclusions
[0-1]
Intensity of negativity
in sentence
{'neg': 0.0, 'neu': 0.58, 'pos': 0.42, 'compound': 0.4404}
[0-1]
Intensity
of neutrality
in sentence
[0-1]
Intensity
of positivity
in sentence
[-1,1]
Combination of positive and
negative sentiments.
Allows positive and negative to
‘compensate’ one another
Sentiment Capture with Vader
VADER: Valence Aware Dictionary and sEntiment
Reasoner
• Is a Python sub-module found of the nltk module
• Is a lexicon and rule-based sentiment analysis tool
• Is specifically attuned to sentiments expressed in social media
• Is fully open-sourced, developed and licensed by MIT
13
Sentiment is not boolean
posneg neu
Sentiment as
a PDF
 Business
Motivation
 Algorithm
 Feature Space
Computation
 Sentiment
Capture
 Real-Life
Examples
 Future Work and
Conclusions
Sentiment Capture with Vader
14
”This computer is good deal.” “This computer is a bad deal.”
pos 0.42 0.0
neu 0.58 0.533
neg 0.0 0.467
compound 0.4404 -0.5423
”This computer is
not powerful.”
“This computer is
not that powerful.”
“This computer is not
powerful, but I like it
anyways.”
“This computer is not
that powerful, but I
like it anyways.”
pos 0.0 0.0 0.0 0.252
neu 0.632 0.682 0.618 0.619
neg 0.368 0.318 0.382 0.129
compound -0.3252 -0.3252 -0.5157 0.3786
 Vader is sensitive to adverbs, punctuation, case, emoticons and nuances…
 Business
Motivation
 Algorithm
 Feature Space
Computation
 Sentiment
Capture
 Real-Life
Examples
 Future Work and
Conclusions
Sentiment Capture with Vader
15
Product A
“The design and picture quality are amazing!”
I love it! Just perfect for people on a budget. And it is
beautifully designed!!
“Pretty good, but I am not a fan of the design.”
“I don’t think it’s possible to find better for the price”
design
Product B
“I just HATE the design!!”
“Okay computer. Wish I read the other reviews first.”
design
picture
quality
picture
quality
+ 0.39 + 0.39
+ 0.56
~ 0.48
- 0.62
 Business
Motivation
 Algorithm
 Feature Space
Computation
 Sentiment
Capture
 Real-Life
Examples
 Future Work and
Conclusions
Sentiment Capture with Vader
16
Product A
design
Product B
design
picture
quality
picture
quality
design
picture quality
battery
value
design
battery
CPU
processor
0.48
0.39
0.46 0.62
0.49 NA
NA NA
0.75
0.87NANA
0.43 NA NA
3
1
4
2
1
0.39
0.60
NA
NA
1
1
1
NA
NA
NA
NA
0.62
✍
Scraping Summarizing Sentiment Intensity
+ 0.39
+ 0.56
~ 0.49
- 0.62
+ 0.39 Business
Motivation
 Algorithm
 Feature Space
Computation
 Sentiment
Capture
 Real-Life
Examples
 Future Work and
Conclusions
Results Discussion: Real Life Example
17
Product BProduct A
 Business
Motivation
 Algorithm
 Feature Space
Computation
 Sentiment
Capture
 Real-Life
Examples
 Future Work and
Conclusions
Results Discussion: Real Life Example
18
screen
design
quality
performance
NEG NEU POSscreen
design
performance
quality
screen
design
quality
performance
Some dissatisfaction with overall quality
Reviewers are rather
happy with keyboard
Weight is better for product
A than for product B
Customers satisfied with keyboard,
display, screen, design, …Product B’s weakness is battery life The product’s features
are well documented
Product A
Product B
 Business
Motivation
 Algorithm
 Feature Space
Computation
 Sentiment
Capture
 Real-Life
Examples
 Future Work and
Conclusions
Results Discussion: Real Life Example
19
This laptop exceeds my expectations. It's fast,
it's powerful, it's compact and great to travel
with.
“The screen is amazing and the keyboard
too. the weight is so light, it's become my
portable.”
It’s durable, good keyboard, decent screen,
and a good battery life.
Plasticky build quality but holds up with my
rough and tough handling. Is is surprisingly
light. Keyboard is the best but it tales a bit of
getting used to[…]
Very light to carry and the carbon color gives
an elegant finishing touch. Business
Motivation
 Algorithm
 Feature Space
Computation
 Sentiment
Capture
 Real-Life
Examples
 Future Work and
Conclusions
Product A
11 reviews, ~ 53 words per review
Results Discussion: Real Life Example
The design is what caught my eye. Everything
about this laptop is okay, except the battery
life.
Overall a great laptop with good display and
build quality, solid performance and sleek
design the only major concern is battery life.
The touch screen is absolutely first rate […],
and the back-lit keyboard has just the right
feel.
This is the best computer I've ever owned.
[…]. I love the backlit keyboard, the easily
adjustable resolution and the long battery life.
Pros: great screen, keyboard feels nice, best
touchpad, very fast, extremely light, built
durable
Cons: battery life is less than competitors
[…].
It's really light weight yet really durable. I love
the keyboard and mouse pad.
28
 Business
Motivation
 Algorithm
 Feature Space
Computation
 Sentiment
Capture
 Real-Life
Examples
 Future Work and
Conclusions
Product B
40 reviews, ~ 83 words per review
Conclusion and Future Work
Work in Progress
• Synonym computation: work in progress
• Observed bias in sentiment, needs particular attention
• Alternative when no/little reviews exist?
Potential future applications
• Offer a snapshot of product reviews to customers
• Assist customers in finding similar items with enhanced feature(s)
• Process seller satisfaction information/rating
• Customer email processing, determine subject of request automatically
21
 Business
Motivation
 Algorithm
 Feature Space
Computation
 Sentiment
Capture
 Real-Life
Examples
 Future Work and
Conclusions
References
[1] Gensim
https://radimrehurek.com/gensim/models/word2vec.html
[2] GloVe
http://nlp.stanford.edu/projects/glove/
[3] Wordnet
https://wordnet.princeton.edu/
[4] nltk.stem
http://www.nltk.org/api/nltk.stem.html
[5] nltk.vader
Paper: VADER: A Parsimonious Rule-based Model for Sentiment
Analysis of Social Media Text, C.J. Hutto, Eric Gilbert
Code: http://www.nltk.org/_modules/nltk/sentiment/vader.html
22
 Business
Motivation
 Algorithm
 Feature Space
Computation
 Sentiment
Capture
 Real-Life
Examples
 Future Work and
Conclusions
Questions?
23
Back-Up Slides
Sentiment Capture with Vader
25
>>> sentence1 = ”This computer is a good deal."
{'neg':0.0,'neu':0.58,'pos':0.42,'compound':0.4404}
>>> sentence2 = “This computer is a very good deal.”
{'neg':0.0,'neu':0.61,'pos':0.39,'compound':0.4927}
>>> sentence3 = “This computer is a very good deal!!”
{'neg':0.0,'neu':0.57,'pos':0.43,'compound':0.5827}
>>> sentence4 = “This computer is a very good deal!! :-)”
{'neg':0.0,'neu':0.441,'pos':0.559,'compound':0.7462}
>>> sentence5 = “This computer is a VERY good deal!! :-)”
{'neg':0.0,'neu':0.393,'pos':0.607,'compound':0.8287}
>>> sentence6 = “This computer is a very bad deal!! :-(”
{'neg':0.588,'neu':0.412,'pos':0.0,'compound':-0.7987}
Adverb
addition
Punctuation
addition
Emoticon addition
Case
enhancement
Inverse polarity
 Business
Motivation
 Algorithm
 Feature Space
Computation
 Sentiment
Capture
 Real-Life
Examples
 Future Work and
Conclusions
Ratings vs Sentiment Analysis
26
Ratings (number of stars)
Average sentiment from
text review
negativeneutralpositive
 Business
Motivation
 Algorithm
 Feature Space
Computation
 Sentiment
Capture
 Real-Life
Examples
 Future Work and
Conclusions
Ratings vs Sentiment Analysis
27
Good reviews
Bad reviews
user bias =
𝑛 𝑝𝑜𝑠 − 𝑛(𝑛𝑒𝑔)
𝑛 𝑝𝑜𝑠 + 𝑛(𝑛𝑒𝑔)
where:
pos = number of prior good reviews
neg = number of prior bad reviews
 Business
Motivation
 Algorithm
 Feature Space
Computation
 Sentiment
Capture
 Real-Life
Examples
 Future Work and
Conclusions
Review Bias
28
• Where is subjectivity coming from?
• Language bias / gender bias / etc.
• Vader package biases due to development specificity?
(remember: originally developed for social media)
• Incentivized customers/reviewers
• Why is it important to correct for it?
• Filtering/sorting with ratings doesn’t work as well as expected
• Possible options
• Filter reviews with large bias
• Weight results
• Re-center the output of Vader to fit our definition of ’neutrality’
In short: Biases in both ratings and textual sentiment, both need attention
 Business
Motivation
 Algorithm
 Feature Space
Computation
 Sentiment
Capture
 Real-Life
Examples
 Future Work and
Conclusions

More Related Content

Viewers also liked

Josh Patterson, Advisor, Skymind – Deep learning for Industry at MLconf ATL 2016
Josh Patterson, Advisor, Skymind – Deep learning for Industry at MLconf ATL 2016Josh Patterson, Advisor, Skymind – Deep learning for Industry at MLconf ATL 2016
Josh Patterson, Advisor, Skymind – Deep learning for Industry at MLconf ATL 2016MLconf
 
Brian Lucena, Senior Data Scientist, Metis at MLconf SF 2016
Brian Lucena, Senior Data Scientist, Metis at MLconf SF 2016Brian Lucena, Senior Data Scientist, Metis at MLconf SF 2016
Brian Lucena, Senior Data Scientist, Metis at MLconf SF 2016MLconf
 
Ben Lau, Quantitative Researcher, Hobbyist, at MLconf NYC 2017
Ben Lau, Quantitative Researcher, Hobbyist, at MLconf NYC 2017Ben Lau, Quantitative Researcher, Hobbyist, at MLconf NYC 2017
Ben Lau, Quantitative Researcher, Hobbyist, at MLconf NYC 2017MLconf
 
Stephanie deWet, Software Engineer, Pinterest at MLconf SF 2016
Stephanie deWet, Software Engineer, Pinterest at MLconf SF 2016Stephanie deWet, Software Engineer, Pinterest at MLconf SF 2016
Stephanie deWet, Software Engineer, Pinterest at MLconf SF 2016MLconf
 
Scott Clark, Co-Founder and CEO, SigOpt at MLconf SF 2016
Scott Clark, Co-Founder and CEO, SigOpt at MLconf SF 2016Scott Clark, Co-Founder and CEO, SigOpt at MLconf SF 2016
Scott Clark, Co-Founder and CEO, SigOpt at MLconf SF 2016MLconf
 
Mayur Thakur, Managing Director, Goldman Sachs, at MLconf NYC 2017
Mayur Thakur, Managing Director, Goldman Sachs, at MLconf NYC 2017Mayur Thakur, Managing Director, Goldman Sachs, at MLconf NYC 2017
Mayur Thakur, Managing Director, Goldman Sachs, at MLconf NYC 2017MLconf
 
Ashirth Barthur, Security Scientist, H2O, at MLconf Seattle 2017
Ashirth Barthur, Security Scientist, H2O, at MLconf Seattle 2017Ashirth Barthur, Security Scientist, H2O, at MLconf Seattle 2017
Ashirth Barthur, Security Scientist, H2O, at MLconf Seattle 2017MLconf
 
Sanjeev Satheesj, Research Scientist, Baidu at The AI Conference 2017
Sanjeev Satheesj, Research Scientist, Baidu at The AI Conference 2017Sanjeev Satheesj, Research Scientist, Baidu at The AI Conference 2017
Sanjeev Satheesj, Research Scientist, Baidu at The AI Conference 2017MLconf
 
Scott Clark, CEO, SigOpt, at The AI Conference 2017
Scott Clark, CEO, SigOpt, at The AI Conference 2017Scott Clark, CEO, SigOpt, at The AI Conference 2017
Scott Clark, CEO, SigOpt, at The AI Conference 2017MLconf
 
Alexandra Johnson, Software Engineer, SigOpt, at MLconf NYC 2017
Alexandra Johnson, Software Engineer, SigOpt, at MLconf NYC 2017Alexandra Johnson, Software Engineer, SigOpt, at MLconf NYC 2017
Alexandra Johnson, Software Engineer, SigOpt, at MLconf NYC 2017MLconf
 
Layla El Asri, Research Scientist, Maluuba
Layla El Asri, Research Scientist, Maluuba Layla El Asri, Research Scientist, Maluuba
Layla El Asri, Research Scientist, Maluuba MLconf
 
Aaron Roth, Associate Professor, University of Pennsylvania, at MLconf NYC 2017
Aaron Roth, Associate Professor, University of Pennsylvania, at MLconf NYC 2017Aaron Roth, Associate Professor, University of Pennsylvania, at MLconf NYC 2017
Aaron Roth, Associate Professor, University of Pennsylvania, at MLconf NYC 2017MLconf
 
Daniel Shank, Data Scientist, Talla at MLconf SF 2016
Daniel Shank, Data Scientist, Talla at MLconf SF 2016Daniel Shank, Data Scientist, Talla at MLconf SF 2016
Daniel Shank, Data Scientist, Talla at MLconf SF 2016MLconf
 
Caroline Sinders, Online Harassment Researcher, Wikimedia at The AI Conferenc...
Caroline Sinders, Online Harassment Researcher, Wikimedia at The AI Conferenc...Caroline Sinders, Online Harassment Researcher, Wikimedia at The AI Conferenc...
Caroline Sinders, Online Harassment Researcher, Wikimedia at The AI Conferenc...MLconf
 
Jeff Bradshaw, Founder, Adaptris
Jeff Bradshaw, Founder, AdaptrisJeff Bradshaw, Founder, Adaptris
Jeff Bradshaw, Founder, AdaptrisMLconf
 
Jean-François Puget, Distinguished Engineer, Machine Learning and Optimizatio...
Jean-François Puget, Distinguished Engineer, Machine Learning and Optimizatio...Jean-François Puget, Distinguished Engineer, Machine Learning and Optimizatio...
Jean-François Puget, Distinguished Engineer, Machine Learning and Optimizatio...MLconf
 
Andrew Musselman, Committer and PMC Member, Apache Mahout, at MLconf Seattle ...
Andrew Musselman, Committer and PMC Member, Apache Mahout, at MLconf Seattle ...Andrew Musselman, Committer and PMC Member, Apache Mahout, at MLconf Seattle ...
Andrew Musselman, Committer and PMC Member, Apache Mahout, at MLconf Seattle ...MLconf
 
Hanie Sedghi, Research Scientist at Allen Institute for Artificial Intelligen...
Hanie Sedghi, Research Scientist at Allen Institute for Artificial Intelligen...Hanie Sedghi, Research Scientist at Allen Institute for Artificial Intelligen...
Hanie Sedghi, Research Scientist at Allen Institute for Artificial Intelligen...MLconf
 
Scott Clark, CEO, SigOpt, at MLconf Seattle 2017
Scott Clark, CEO, SigOpt, at MLconf Seattle 2017Scott Clark, CEO, SigOpt, at MLconf Seattle 2017
Scott Clark, CEO, SigOpt, at MLconf Seattle 2017MLconf
 

Viewers also liked (19)

Josh Patterson, Advisor, Skymind – Deep learning for Industry at MLconf ATL 2016
Josh Patterson, Advisor, Skymind – Deep learning for Industry at MLconf ATL 2016Josh Patterson, Advisor, Skymind – Deep learning for Industry at MLconf ATL 2016
Josh Patterson, Advisor, Skymind – Deep learning for Industry at MLconf ATL 2016
 
Brian Lucena, Senior Data Scientist, Metis at MLconf SF 2016
Brian Lucena, Senior Data Scientist, Metis at MLconf SF 2016Brian Lucena, Senior Data Scientist, Metis at MLconf SF 2016
Brian Lucena, Senior Data Scientist, Metis at MLconf SF 2016
 
Ben Lau, Quantitative Researcher, Hobbyist, at MLconf NYC 2017
Ben Lau, Quantitative Researcher, Hobbyist, at MLconf NYC 2017Ben Lau, Quantitative Researcher, Hobbyist, at MLconf NYC 2017
Ben Lau, Quantitative Researcher, Hobbyist, at MLconf NYC 2017
 
Stephanie deWet, Software Engineer, Pinterest at MLconf SF 2016
Stephanie deWet, Software Engineer, Pinterest at MLconf SF 2016Stephanie deWet, Software Engineer, Pinterest at MLconf SF 2016
Stephanie deWet, Software Engineer, Pinterest at MLconf SF 2016
 
Scott Clark, Co-Founder and CEO, SigOpt at MLconf SF 2016
Scott Clark, Co-Founder and CEO, SigOpt at MLconf SF 2016Scott Clark, Co-Founder and CEO, SigOpt at MLconf SF 2016
Scott Clark, Co-Founder and CEO, SigOpt at MLconf SF 2016
 
Mayur Thakur, Managing Director, Goldman Sachs, at MLconf NYC 2017
Mayur Thakur, Managing Director, Goldman Sachs, at MLconf NYC 2017Mayur Thakur, Managing Director, Goldman Sachs, at MLconf NYC 2017
Mayur Thakur, Managing Director, Goldman Sachs, at MLconf NYC 2017
 
Ashirth Barthur, Security Scientist, H2O, at MLconf Seattle 2017
Ashirth Barthur, Security Scientist, H2O, at MLconf Seattle 2017Ashirth Barthur, Security Scientist, H2O, at MLconf Seattle 2017
Ashirth Barthur, Security Scientist, H2O, at MLconf Seattle 2017
 
Sanjeev Satheesj, Research Scientist, Baidu at The AI Conference 2017
Sanjeev Satheesj, Research Scientist, Baidu at The AI Conference 2017Sanjeev Satheesj, Research Scientist, Baidu at The AI Conference 2017
Sanjeev Satheesj, Research Scientist, Baidu at The AI Conference 2017
 
Scott Clark, CEO, SigOpt, at The AI Conference 2017
Scott Clark, CEO, SigOpt, at The AI Conference 2017Scott Clark, CEO, SigOpt, at The AI Conference 2017
Scott Clark, CEO, SigOpt, at The AI Conference 2017
 
Alexandra Johnson, Software Engineer, SigOpt, at MLconf NYC 2017
Alexandra Johnson, Software Engineer, SigOpt, at MLconf NYC 2017Alexandra Johnson, Software Engineer, SigOpt, at MLconf NYC 2017
Alexandra Johnson, Software Engineer, SigOpt, at MLconf NYC 2017
 
Layla El Asri, Research Scientist, Maluuba
Layla El Asri, Research Scientist, Maluuba Layla El Asri, Research Scientist, Maluuba
Layla El Asri, Research Scientist, Maluuba
 
Aaron Roth, Associate Professor, University of Pennsylvania, at MLconf NYC 2017
Aaron Roth, Associate Professor, University of Pennsylvania, at MLconf NYC 2017Aaron Roth, Associate Professor, University of Pennsylvania, at MLconf NYC 2017
Aaron Roth, Associate Professor, University of Pennsylvania, at MLconf NYC 2017
 
Daniel Shank, Data Scientist, Talla at MLconf SF 2016
Daniel Shank, Data Scientist, Talla at MLconf SF 2016Daniel Shank, Data Scientist, Talla at MLconf SF 2016
Daniel Shank, Data Scientist, Talla at MLconf SF 2016
 
Caroline Sinders, Online Harassment Researcher, Wikimedia at The AI Conferenc...
Caroline Sinders, Online Harassment Researcher, Wikimedia at The AI Conferenc...Caroline Sinders, Online Harassment Researcher, Wikimedia at The AI Conferenc...
Caroline Sinders, Online Harassment Researcher, Wikimedia at The AI Conferenc...
 
Jeff Bradshaw, Founder, Adaptris
Jeff Bradshaw, Founder, AdaptrisJeff Bradshaw, Founder, Adaptris
Jeff Bradshaw, Founder, Adaptris
 
Jean-François Puget, Distinguished Engineer, Machine Learning and Optimizatio...
Jean-François Puget, Distinguished Engineer, Machine Learning and Optimizatio...Jean-François Puget, Distinguished Engineer, Machine Learning and Optimizatio...
Jean-François Puget, Distinguished Engineer, Machine Learning and Optimizatio...
 
Andrew Musselman, Committer and PMC Member, Apache Mahout, at MLconf Seattle ...
Andrew Musselman, Committer and PMC Member, Apache Mahout, at MLconf Seattle ...Andrew Musselman, Committer and PMC Member, Apache Mahout, at MLconf Seattle ...
Andrew Musselman, Committer and PMC Member, Apache Mahout, at MLconf Seattle ...
 
Hanie Sedghi, Research Scientist at Allen Institute for Artificial Intelligen...
Hanie Sedghi, Research Scientist at Allen Institute for Artificial Intelligen...Hanie Sedghi, Research Scientist at Allen Institute for Artificial Intelligen...
Hanie Sedghi, Research Scientist at Allen Institute for Artificial Intelligen...
 
Scott Clark, CEO, SigOpt, at MLconf Seattle 2017
Scott Clark, CEO, SigOpt, at MLconf Seattle 2017Scott Clark, CEO, SigOpt, at MLconf Seattle 2017
Scott Clark, CEO, SigOpt, at MLconf Seattle 2017
 

More from MLconf

Jamila Smith-Loud - Understanding Human Impact: Social and Equity Assessments...
Jamila Smith-Loud - Understanding Human Impact: Social and Equity Assessments...Jamila Smith-Loud - Understanding Human Impact: Social and Equity Assessments...
Jamila Smith-Loud - Understanding Human Impact: Social and Equity Assessments...MLconf
 
Ted Willke - The Brain’s Guide to Dealing with Context in Language Understanding
Ted Willke - The Brain’s Guide to Dealing with Context in Language UnderstandingTed Willke - The Brain’s Guide to Dealing with Context in Language Understanding
Ted Willke - The Brain’s Guide to Dealing with Context in Language UnderstandingMLconf
 
Justin Armstrong - Applying Computer Vision to Reduce Contamination in the Re...
Justin Armstrong - Applying Computer Vision to Reduce Contamination in the Re...Justin Armstrong - Applying Computer Vision to Reduce Contamination in the Re...
Justin Armstrong - Applying Computer Vision to Reduce Contamination in the Re...MLconf
 
Igor Markov - Quantum Computing: a Treasure Hunt, not a Gold Rush
Igor Markov - Quantum Computing: a Treasure Hunt, not a Gold RushIgor Markov - Quantum Computing: a Treasure Hunt, not a Gold Rush
Igor Markov - Quantum Computing: a Treasure Hunt, not a Gold RushMLconf
 
Josh Wills - Data Labeling as Religious Experience
Josh Wills - Data Labeling as Religious ExperienceJosh Wills - Data Labeling as Religious Experience
Josh Wills - Data Labeling as Religious ExperienceMLconf
 
Vinay Prabhu - Project GaitNet: Ushering in the ImageNet moment for human Gai...
Vinay Prabhu - Project GaitNet: Ushering in the ImageNet moment for human Gai...Vinay Prabhu - Project GaitNet: Ushering in the ImageNet moment for human Gai...
Vinay Prabhu - Project GaitNet: Ushering in the ImageNet moment for human Gai...MLconf
 
Jekaterina Novikova - Machine Learning Methods in Detecting Alzheimer’s Disea...
Jekaterina Novikova - Machine Learning Methods in Detecting Alzheimer’s Disea...Jekaterina Novikova - Machine Learning Methods in Detecting Alzheimer’s Disea...
Jekaterina Novikova - Machine Learning Methods in Detecting Alzheimer’s Disea...MLconf
 
Meghana Ravikumar - Optimized Image Classification on the Cheap
Meghana Ravikumar - Optimized Image Classification on the CheapMeghana Ravikumar - Optimized Image Classification on the Cheap
Meghana Ravikumar - Optimized Image Classification on the CheapMLconf
 
Noam Finkelstein - The Importance of Modeling Data Collection
Noam Finkelstein - The Importance of Modeling Data CollectionNoam Finkelstein - The Importance of Modeling Data Collection
Noam Finkelstein - The Importance of Modeling Data CollectionMLconf
 
June Andrews - The Uncanny Valley of ML
June Andrews - The Uncanny Valley of MLJune Andrews - The Uncanny Valley of ML
June Andrews - The Uncanny Valley of MLMLconf
 
Sneha Rajana - Deep Learning Architectures for Semantic Relation Detection Tasks
Sneha Rajana - Deep Learning Architectures for Semantic Relation Detection TasksSneha Rajana - Deep Learning Architectures for Semantic Relation Detection Tasks
Sneha Rajana - Deep Learning Architectures for Semantic Relation Detection TasksMLconf
 
Anoop Deoras - Building an Incrementally Trained, Local Taste Aware, Global D...
Anoop Deoras - Building an Incrementally Trained, Local Taste Aware, Global D...Anoop Deoras - Building an Incrementally Trained, Local Taste Aware, Global D...
Anoop Deoras - Building an Incrementally Trained, Local Taste Aware, Global D...MLconf
 
Vito Ostuni - The Voice: New Challenges in a Zero UI World
Vito Ostuni - The Voice: New Challenges in a Zero UI WorldVito Ostuni - The Voice: New Challenges in a Zero UI World
Vito Ostuni - The Voice: New Challenges in a Zero UI WorldMLconf
 
Anna choromanska - Data-driven Challenges in AI: Scale, Information Selection...
Anna choromanska - Data-driven Challenges in AI: Scale, Information Selection...Anna choromanska - Data-driven Challenges in AI: Scale, Information Selection...
Anna choromanska - Data-driven Challenges in AI: Scale, Information Selection...MLconf
 
Janani Kalyanam - Machine Learning to Detect Illegal Online Sales of Prescrip...
Janani Kalyanam - Machine Learning to Detect Illegal Online Sales of Prescrip...Janani Kalyanam - Machine Learning to Detect Illegal Online Sales of Prescrip...
Janani Kalyanam - Machine Learning to Detect Illegal Online Sales of Prescrip...MLconf
 
Esperanza Lopez Aguilera - Using a Bayesian Neural Network in the Detection o...
Esperanza Lopez Aguilera - Using a Bayesian Neural Network in the Detection o...Esperanza Lopez Aguilera - Using a Bayesian Neural Network in the Detection o...
Esperanza Lopez Aguilera - Using a Bayesian Neural Network in the Detection o...MLconf
 
Neel Sundaresan - Teaching a machine to code
Neel Sundaresan - Teaching a machine to codeNeel Sundaresan - Teaching a machine to code
Neel Sundaresan - Teaching a machine to codeMLconf
 
Rishabh Mehrotra - Recommendations in a Marketplace: Personalizing Explainabl...
Rishabh Mehrotra - Recommendations in a Marketplace: Personalizing Explainabl...Rishabh Mehrotra - Recommendations in a Marketplace: Personalizing Explainabl...
Rishabh Mehrotra - Recommendations in a Marketplace: Personalizing Explainabl...MLconf
 
Soumith Chintala - Increasing the Impact of AI Through Better Software
Soumith Chintala - Increasing the Impact of AI Through Better SoftwareSoumith Chintala - Increasing the Impact of AI Through Better Software
Soumith Chintala - Increasing the Impact of AI Through Better SoftwareMLconf
 
Roy Lowrance - Predicting Bond Prices: Regime Changes
Roy Lowrance - Predicting Bond Prices: Regime ChangesRoy Lowrance - Predicting Bond Prices: Regime Changes
Roy Lowrance - Predicting Bond Prices: Regime ChangesMLconf
 

More from MLconf (20)

Jamila Smith-Loud - Understanding Human Impact: Social and Equity Assessments...
Jamila Smith-Loud - Understanding Human Impact: Social and Equity Assessments...Jamila Smith-Loud - Understanding Human Impact: Social and Equity Assessments...
Jamila Smith-Loud - Understanding Human Impact: Social and Equity Assessments...
 
Ted Willke - The Brain’s Guide to Dealing with Context in Language Understanding
Ted Willke - The Brain’s Guide to Dealing with Context in Language UnderstandingTed Willke - The Brain’s Guide to Dealing with Context in Language Understanding
Ted Willke - The Brain’s Guide to Dealing with Context in Language Understanding
 
Justin Armstrong - Applying Computer Vision to Reduce Contamination in the Re...
Justin Armstrong - Applying Computer Vision to Reduce Contamination in the Re...Justin Armstrong - Applying Computer Vision to Reduce Contamination in the Re...
Justin Armstrong - Applying Computer Vision to Reduce Contamination in the Re...
 
Igor Markov - Quantum Computing: a Treasure Hunt, not a Gold Rush
Igor Markov - Quantum Computing: a Treasure Hunt, not a Gold RushIgor Markov - Quantum Computing: a Treasure Hunt, not a Gold Rush
Igor Markov - Quantum Computing: a Treasure Hunt, not a Gold Rush
 
Josh Wills - Data Labeling as Religious Experience
Josh Wills - Data Labeling as Religious ExperienceJosh Wills - Data Labeling as Religious Experience
Josh Wills - Data Labeling as Religious Experience
 
Vinay Prabhu - Project GaitNet: Ushering in the ImageNet moment for human Gai...
Vinay Prabhu - Project GaitNet: Ushering in the ImageNet moment for human Gai...Vinay Prabhu - Project GaitNet: Ushering in the ImageNet moment for human Gai...
Vinay Prabhu - Project GaitNet: Ushering in the ImageNet moment for human Gai...
 
Jekaterina Novikova - Machine Learning Methods in Detecting Alzheimer’s Disea...
Jekaterina Novikova - Machine Learning Methods in Detecting Alzheimer’s Disea...Jekaterina Novikova - Machine Learning Methods in Detecting Alzheimer’s Disea...
Jekaterina Novikova - Machine Learning Methods in Detecting Alzheimer’s Disea...
 
Meghana Ravikumar - Optimized Image Classification on the Cheap
Meghana Ravikumar - Optimized Image Classification on the CheapMeghana Ravikumar - Optimized Image Classification on the Cheap
Meghana Ravikumar - Optimized Image Classification on the Cheap
 
Noam Finkelstein - The Importance of Modeling Data Collection
Noam Finkelstein - The Importance of Modeling Data CollectionNoam Finkelstein - The Importance of Modeling Data Collection
Noam Finkelstein - The Importance of Modeling Data Collection
 
June Andrews - The Uncanny Valley of ML
June Andrews - The Uncanny Valley of MLJune Andrews - The Uncanny Valley of ML
June Andrews - The Uncanny Valley of ML
 
Sneha Rajana - Deep Learning Architectures for Semantic Relation Detection Tasks
Sneha Rajana - Deep Learning Architectures for Semantic Relation Detection TasksSneha Rajana - Deep Learning Architectures for Semantic Relation Detection Tasks
Sneha Rajana - Deep Learning Architectures for Semantic Relation Detection Tasks
 
Anoop Deoras - Building an Incrementally Trained, Local Taste Aware, Global D...
Anoop Deoras - Building an Incrementally Trained, Local Taste Aware, Global D...Anoop Deoras - Building an Incrementally Trained, Local Taste Aware, Global D...
Anoop Deoras - Building an Incrementally Trained, Local Taste Aware, Global D...
 
Vito Ostuni - The Voice: New Challenges in a Zero UI World
Vito Ostuni - The Voice: New Challenges in a Zero UI WorldVito Ostuni - The Voice: New Challenges in a Zero UI World
Vito Ostuni - The Voice: New Challenges in a Zero UI World
 
Anna choromanska - Data-driven Challenges in AI: Scale, Information Selection...
Anna choromanska - Data-driven Challenges in AI: Scale, Information Selection...Anna choromanska - Data-driven Challenges in AI: Scale, Information Selection...
Anna choromanska - Data-driven Challenges in AI: Scale, Information Selection...
 
Janani Kalyanam - Machine Learning to Detect Illegal Online Sales of Prescrip...
Janani Kalyanam - Machine Learning to Detect Illegal Online Sales of Prescrip...Janani Kalyanam - Machine Learning to Detect Illegal Online Sales of Prescrip...
Janani Kalyanam - Machine Learning to Detect Illegal Online Sales of Prescrip...
 
Esperanza Lopez Aguilera - Using a Bayesian Neural Network in the Detection o...
Esperanza Lopez Aguilera - Using a Bayesian Neural Network in the Detection o...Esperanza Lopez Aguilera - Using a Bayesian Neural Network in the Detection o...
Esperanza Lopez Aguilera - Using a Bayesian Neural Network in the Detection o...
 
Neel Sundaresan - Teaching a machine to code
Neel Sundaresan - Teaching a machine to codeNeel Sundaresan - Teaching a machine to code
Neel Sundaresan - Teaching a machine to code
 
Rishabh Mehrotra - Recommendations in a Marketplace: Personalizing Explainabl...
Rishabh Mehrotra - Recommendations in a Marketplace: Personalizing Explainabl...Rishabh Mehrotra - Recommendations in a Marketplace: Personalizing Explainabl...
Rishabh Mehrotra - Recommendations in a Marketplace: Personalizing Explainabl...
 
Soumith Chintala - Increasing the Impact of AI Through Better Software
Soumith Chintala - Increasing the Impact of AI Through Better SoftwareSoumith Chintala - Increasing the Impact of AI Through Better Software
Soumith Chintala - Increasing the Impact of AI Through Better Software
 
Roy Lowrance - Predicting Bond Prices: Regime Changes
Roy Lowrance - Predicting Bond Prices: Regime ChangesRoy Lowrance - Predicting Bond Prices: Regime Changes
Roy Lowrance - Predicting Bond Prices: Regime Changes
 

Recently uploaded

Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Victor Rentea
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAndrey Devyatkin
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsNanddeep Nachan
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodJuan lago vázquez
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FMESafe Software
 
AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024The Digital Insurer
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MIND CTI
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobeapidays
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...apidays
 
Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024The Digital Insurer
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistandanishmna97
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FMESafe Software
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...apidays
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Jeffrey Haguewood
 
Spring Boot vs Quarkus the ultimate battle - DevoxxUK
Spring Boot vs Quarkus the ultimate battle - DevoxxUKSpring Boot vs Quarkus the ultimate battle - DevoxxUK
Spring Boot vs Quarkus the ultimate battle - DevoxxUKJago de Vreede
 

Recently uploaded (20)

Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectors
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistan
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
 
Spring Boot vs Quarkus the ultimate battle - DevoxxUK
Spring Boot vs Quarkus the ultimate battle - DevoxxUKSpring Boot vs Quarkus the ultimate battle - DevoxxUK
Spring Boot vs Quarkus the ultimate battle - DevoxxUK
 

Jennifer Prendki, Principal Data Scientist, @WalmartLabs at MLconf SF 2016

  • 1. Review Analysis: An Approach to Leveraging User-Generated Content in the Context of Retail Jennifer Prendki, Principal Data Scientist Walmart Global e-Commerce California, USA The Machine Learning Conference, San Francisco, CA 11/11/2016
  • 2. Outline • Business motivation • Algorithm Pipeline • Feature Space Computation • Sentiment Capture • Real-Life Examples and Results • Future Work and Conclusions 2  Business Motivation  Algorithm  Feature Space Computation  Sentiment Capture  Real-Life Examples  Future Work and Conclusions
  • 3. Business Motivation 3  Business Motivation  Algorithm  Feature Space Computation  Sentiment Capture  Real-Life Examples  Future Work and Conclusions
  • 4. Business Motivation 4  Business Motivation  Algorithm  Feature Space Computation  Sentiment Capture  Real-Life Examples  Future Work and Conclusions
  • 5. Business Motivation 5  Business Motivation  Algorithm  Feature Space Computation  Sentiment Capture  Real-Life Examples  Future Work and Conclusions
  • 6. Business Motivation 6  Business Motivation  Algorithm  Feature Space Computation  Sentiment Capture  Real-Life Examples  Future Work and Conclusions
  • 7. Business Motivation “ I bought this for my daughter to do her college work on, it's been great, no problems so far. “ [SuperMom72] “ Works like a charm, would definitely recommend to anyone on a budget. “ [Vamsy] “ Fast CPU but slow disk drive slows everything down. ” [TalonBay] “ I don't do gaming or downloading movies or music, so for those folks I can't speak to the performance. But for surfing the web, checking email, etc., this computer will save you time for watching the little ball spin!” [Anonymous] 7  Business Motivation  Algorithm  Feature Space Computation  Sentiment Capture  Real-Life Examples  Future Work and Conclusions
  • 8. Review Analysis: A Current Landscape • Sentiment analysis • Best known use case: Social Media Analysis/Tweets Why tweets?  shorter, condensed, highly sentimental content • Movie review analysis: Kaggle: Analysis of the ‘Rotten Tomatoes’ Dataset • Regarding product review analysis • Little to no papers regarding product review analysis at commercial scale • Shortage of work regarding combination of topic modeling and sentiment analysis 8 Our research: Combine feature computation and sentiment analysis to summarize reviewers’ opinions about a specific product  Business Motivation  Algorithm  Feature Space Computation  Sentiment Capture  Real-Life Examples  Future Work and Conclusions
  • 9. Algorithm Pipeline 9 Product 𝛼 Product 𝛽 Product 𝛾 Review Review Review Review Review Review Fc Category C Feature Space Computation F 𝛼 F 𝛽 F 𝛾 Feature Space Reduction  Business Motivation  Algorithm  Feature Space Computation  Sentiment Capture  Real-Life Examples  Future Work and Conclusions
  • 10. Algorithm Pipeline 10 Product 𝛼 Product 𝛽 Product 𝛾 Review Review Review Review Review Review Category C F 𝛼 F 𝛽 F 𝛾 Sentiment Sentiment Sentiment Sentiment Sentiment Sentiment Sentiment Computation For Each Review … Sentiment Computed For Relevant Features  Business Motivation  Algorithm  Feature Space Computation  Sentiment Capture  Real-Life Examples  Future Work and Conclusions
  • 11. Algorithm Pipeline 11 Product 𝛼 Product 𝛽 Product 𝛾 Review Review Review Review Review Review Category C Sentiment Sentiment Sentiment Sentiment Sentiment Sentiment 𝜎𝑡, 𝛼, 𝑓 𝜎𝑡, 𝛽, 𝑓 𝜎𝑡, 𝛾, 𝑓 ∀ 𝑡 ∈ 𝜏 ∀ 𝑓 ∈ F 𝛼 ∀ 𝑡 ∈ 𝜏 ∀ 𝑓 ∈ Fβ ∀ 𝑡 ∈ 𝜏 ∀ 𝑓 ∈ F 𝛾  Business Motivation  Algorithm  Feature Space Computation  Sentiment Capture  Real-Life Examples  Future Work and Conclusions Sentiment Computation For Each Review … Sentiment Computed For Relevant Features
  • 12. Feature Space Computation • Textual reviews go through a careful process: • TF/TF-Idf transform on documents • Stop words removal, stemming, part-of-speech selection • Spell-checking • etc. • ‘Synonym’ computation • Can be done using Word Embedding (glove, word2vec) • Can be done building synonym graph using dictionary/Wikipedia • Is complex and tricky, context-sensitive, unsupervised 12 In short: Creating synonym sets is difficult, and challenging as an online algorithm In short: Preprocessing crucial to extracting relevant features  Business Motivation  Algorithm  Feature Space Computation  Sentiment Capture  Real-Life Examples  Future Work and Conclusions
  • 13. [0-1] Intensity of negativity in sentence {'neg': 0.0, 'neu': 0.58, 'pos': 0.42, 'compound': 0.4404} [0-1] Intensity of neutrality in sentence [0-1] Intensity of positivity in sentence [-1,1] Combination of positive and negative sentiments. Allows positive and negative to ‘compensate’ one another Sentiment Capture with Vader VADER: Valence Aware Dictionary and sEntiment Reasoner • Is a Python sub-module found of the nltk module • Is a lexicon and rule-based sentiment analysis tool • Is specifically attuned to sentiments expressed in social media • Is fully open-sourced, developed and licensed by MIT 13 Sentiment is not boolean posneg neu Sentiment as a PDF  Business Motivation  Algorithm  Feature Space Computation  Sentiment Capture  Real-Life Examples  Future Work and Conclusions
  • 14. Sentiment Capture with Vader 14 ”This computer is good deal.” “This computer is a bad deal.” pos 0.42 0.0 neu 0.58 0.533 neg 0.0 0.467 compound 0.4404 -0.5423 ”This computer is not powerful.” “This computer is not that powerful.” “This computer is not powerful, but I like it anyways.” “This computer is not that powerful, but I like it anyways.” pos 0.0 0.0 0.0 0.252 neu 0.632 0.682 0.618 0.619 neg 0.368 0.318 0.382 0.129 compound -0.3252 -0.3252 -0.5157 0.3786  Vader is sensitive to adverbs, punctuation, case, emoticons and nuances…  Business Motivation  Algorithm  Feature Space Computation  Sentiment Capture  Real-Life Examples  Future Work and Conclusions
  • 15. Sentiment Capture with Vader 15 Product A “The design and picture quality are amazing!” I love it! Just perfect for people on a budget. And it is beautifully designed!! “Pretty good, but I am not a fan of the design.” “I don’t think it’s possible to find better for the price” design Product B “I just HATE the design!!” “Okay computer. Wish I read the other reviews first.” design picture quality picture quality + 0.39 + 0.39 + 0.56 ~ 0.48 - 0.62  Business Motivation  Algorithm  Feature Space Computation  Sentiment Capture  Real-Life Examples  Future Work and Conclusions
  • 16. Sentiment Capture with Vader 16 Product A design Product B design picture quality picture quality design picture quality battery value design battery CPU processor 0.48 0.39 0.46 0.62 0.49 NA NA NA 0.75 0.87NANA 0.43 NA NA 3 1 4 2 1 0.39 0.60 NA NA 1 1 1 NA NA NA NA 0.62 ✍ Scraping Summarizing Sentiment Intensity + 0.39 + 0.56 ~ 0.49 - 0.62 + 0.39 Business Motivation  Algorithm  Feature Space Computation  Sentiment Capture  Real-Life Examples  Future Work and Conclusions
  • 17. Results Discussion: Real Life Example 17 Product BProduct A  Business Motivation  Algorithm  Feature Space Computation  Sentiment Capture  Real-Life Examples  Future Work and Conclusions
  • 18. Results Discussion: Real Life Example 18 screen design quality performance NEG NEU POSscreen design performance quality screen design quality performance Some dissatisfaction with overall quality Reviewers are rather happy with keyboard Weight is better for product A than for product B Customers satisfied with keyboard, display, screen, design, …Product B’s weakness is battery life The product’s features are well documented Product A Product B  Business Motivation  Algorithm  Feature Space Computation  Sentiment Capture  Real-Life Examples  Future Work and Conclusions
  • 19. Results Discussion: Real Life Example 19 This laptop exceeds my expectations. It's fast, it's powerful, it's compact and great to travel with. “The screen is amazing and the keyboard too. the weight is so light, it's become my portable.” It’s durable, good keyboard, decent screen, and a good battery life. Plasticky build quality but holds up with my rough and tough handling. Is is surprisingly light. Keyboard is the best but it tales a bit of getting used to[…] Very light to carry and the carbon color gives an elegant finishing touch. Business Motivation  Algorithm  Feature Space Computation  Sentiment Capture  Real-Life Examples  Future Work and Conclusions Product A 11 reviews, ~ 53 words per review
  • 20. Results Discussion: Real Life Example The design is what caught my eye. Everything about this laptop is okay, except the battery life. Overall a great laptop with good display and build quality, solid performance and sleek design the only major concern is battery life. The touch screen is absolutely first rate […], and the back-lit keyboard has just the right feel. This is the best computer I've ever owned. […]. I love the backlit keyboard, the easily adjustable resolution and the long battery life. Pros: great screen, keyboard feels nice, best touchpad, very fast, extremely light, built durable Cons: battery life is less than competitors […]. It's really light weight yet really durable. I love the keyboard and mouse pad. 28  Business Motivation  Algorithm  Feature Space Computation  Sentiment Capture  Real-Life Examples  Future Work and Conclusions Product B 40 reviews, ~ 83 words per review
  • 21. Conclusion and Future Work Work in Progress • Synonym computation: work in progress • Observed bias in sentiment, needs particular attention • Alternative when no/little reviews exist? Potential future applications • Offer a snapshot of product reviews to customers • Assist customers in finding similar items with enhanced feature(s) • Process seller satisfaction information/rating • Customer email processing, determine subject of request automatically 21  Business Motivation  Algorithm  Feature Space Computation  Sentiment Capture  Real-Life Examples  Future Work and Conclusions
  • 22. References [1] Gensim https://radimrehurek.com/gensim/models/word2vec.html [2] GloVe http://nlp.stanford.edu/projects/glove/ [3] Wordnet https://wordnet.princeton.edu/ [4] nltk.stem http://www.nltk.org/api/nltk.stem.html [5] nltk.vader Paper: VADER: A Parsimonious Rule-based Model for Sentiment Analysis of Social Media Text, C.J. Hutto, Eric Gilbert Code: http://www.nltk.org/_modules/nltk/sentiment/vader.html 22  Business Motivation  Algorithm  Feature Space Computation  Sentiment Capture  Real-Life Examples  Future Work and Conclusions
  • 25. Sentiment Capture with Vader 25 >>> sentence1 = ”This computer is a good deal." {'neg':0.0,'neu':0.58,'pos':0.42,'compound':0.4404} >>> sentence2 = “This computer is a very good deal.” {'neg':0.0,'neu':0.61,'pos':0.39,'compound':0.4927} >>> sentence3 = “This computer is a very good deal!!” {'neg':0.0,'neu':0.57,'pos':0.43,'compound':0.5827} >>> sentence4 = “This computer is a very good deal!! :-)” {'neg':0.0,'neu':0.441,'pos':0.559,'compound':0.7462} >>> sentence5 = “This computer is a VERY good deal!! :-)” {'neg':0.0,'neu':0.393,'pos':0.607,'compound':0.8287} >>> sentence6 = “This computer is a very bad deal!! :-(” {'neg':0.588,'neu':0.412,'pos':0.0,'compound':-0.7987} Adverb addition Punctuation addition Emoticon addition Case enhancement Inverse polarity  Business Motivation  Algorithm  Feature Space Computation  Sentiment Capture  Real-Life Examples  Future Work and Conclusions
  • 26. Ratings vs Sentiment Analysis 26 Ratings (number of stars) Average sentiment from text review negativeneutralpositive  Business Motivation  Algorithm  Feature Space Computation  Sentiment Capture  Real-Life Examples  Future Work and Conclusions
  • 27. Ratings vs Sentiment Analysis 27 Good reviews Bad reviews user bias = 𝑛 𝑝𝑜𝑠 − 𝑛(𝑛𝑒𝑔) 𝑛 𝑝𝑜𝑠 + 𝑛(𝑛𝑒𝑔) where: pos = number of prior good reviews neg = number of prior bad reviews  Business Motivation  Algorithm  Feature Space Computation  Sentiment Capture  Real-Life Examples  Future Work and Conclusions
  • 28. Review Bias 28 • Where is subjectivity coming from? • Language bias / gender bias / etc. • Vader package biases due to development specificity? (remember: originally developed for social media) • Incentivized customers/reviewers • Why is it important to correct for it? • Filtering/sorting with ratings doesn’t work as well as expected • Possible options • Filter reviews with large bias • Weight results • Re-center the output of Vader to fit our definition of ’neutrality’ In short: Biases in both ratings and textual sentiment, both need attention  Business Motivation  Algorithm  Feature Space Computation  Sentiment Capture  Real-Life Examples  Future Work and Conclusions