SlideShare a Scribd company logo
1 of 71
Download to read offline
The Science and the Magic
       of User Feedback
  for Recommender Systems




      Xavier Amatriain




                         Bay Area, March '11
But first...




About Telefonica and Telefonica R&D
Telefonica is a fast-growing Telecom


                            1989                   2000                    2008
  Clients                        About 12          About 68          About 260
                                  million           million           million
                                subscribers       customers          customers
 Services                        Basic        Wireline and mobile    Integrated ICT
                            telephone and       voice, data and     solutions for all
                             data services     Internet services       customers
Geographies
                                                 Operations in      Operations in
                              Spain                                 25 countries
                                                 16 countries

   Staff
                     About 71,000                About 149,000         About 257,000
                     professionals                professionals         professionals

 Finances                   Rev: 4,273 M€       Rev: 28,485 M€      Rev: 57,946 M€
                            EPS(1): 0.45 €      EPS(1): 0.67 €        EPS: 1.63 €
            (1) EPS: Earnings per share
Currently among the largest in the world
       Telco sector worldwide ranking by market cap (US$ bn)




                                                         Source: Bloomberg, 06/12/09




       Just announced 2010 results: record net earnings, 
       first Spanish company ever to make > 10B €
Leader in South America

Data as of March ‘09



 1      2 Argentina: 20.9 million                                                            Wireline market rank 
 2      1 Brazil: 61.4 million                                                               Mobile market rank
        2 Central America: 6.1 million
 1      2 Colombia: 12.6 million
 1      1 Chile: 10.1 million
        2 Ecuador: 3.3 million
        2 Mexico: 15.7 million
 1      1 Peru: 15.2 million
        1
          Uruguay: 1.5 million
        2 Venezuela: 12.0 million
                                                                      Total Accesses (as of March ‘09)
                                                                                       159.5 million

Notes:
- Central America includes Guatemala, Panama, El Salvador and Nicaragua
- Total accesses figure includes Narrowband Internet accesses of Terra Brasil and Terra Colombia, and Broadband
Internet accesses of Terra Brasil, Telefónica de Argentina, Terra Guatemala and Terra México.
And a significant footprint in Europe

                                      Wireline market rank
                                      Mobile market rank
Data as of March ‘09



                             1 1   Spain: 47.2 million
                                 1 UK: 20.8 million
                                 4 Germany: 16.0 million
                                 2
                                   Ireland: 1.7 million
                                   Czech Republic: 7.7 million
                             1   2
                                   Slovakia: 0.4 million
                                 3


                                     Total Accesses (as of March ’09)
                                              93.8 million
Scientific Research
                  Mobile and Ubicomp
Multimedia Core                                    User Modelling &
                                                     Data Mining



                                            HCIR

                                                   DATA MINING



                                                        Wireless Systems
Content Distribution & P2P
                                Social Networks
Projects
      Recommendation 
      Algorithms
                             Tourist routes

                        Social 
                       contacts     Music


User Analysis                 Movies
& Modeling                                                 Contextual
                           The Wisdom of the 
          Noise in users’        Few
                                            Mobile
         ratings 
                            Tourist 
                           behavior        Microprofiles
           Implicit user 
          feedback
                                       Multiverse 
                                      Tensor 
            IPTV viewing habits
                                      Factorization
Projects
      Recommendation 
      Algorithms
                             Tourist routes

                        Social 
                       contacts     Music


User Analysis                 Movies
& Modeling                                                 Contextual
                           The Wisdom of the 
          Noise in users’        Few
                                            Mobile
         ratings 
                            Tourist 
                           behavior        Microprofiles
           Implicit user 
          feedback
                                       Multiverse 
                                      Tensor 
            IPTV viewing habits
                                      Factorization
And about the world we live in...
Information Overload
More is Less
                         W
                          or
                            se
                                 D
                                  ec
                                    is
         ns
                                      io
       io

                                        ns
     is
   ec
  D
   s
 es
L
Analysis Paralysis is making
         headlines
Search engines don’t always hold the answer
What about discovery?
What about curiosity?
What about information to help take decisions?
The Age of Search has come to
                           an end

... long live the Age of Recommendation!
●


●
    Chris Anderson in “The Long Tail”
    ●
        “We are leaving the age of information and entering the age of
        recommendation”
●
    CNN Money, “The race to create a 'smart' Google”:
    ●
        “The Web, they say, is leaving the era of search and entering
        one of discovery. What's the difference? Search is what you do
        when you're looking for something. Discovery is when
        something wonderful that you didn't know existed, or didn't
        know how to ask for, finds you.”
Recommender
                Systems
            Recommendations



Read this

            Attend this conference
Data mining +
           all those other things
●   User Interface
●   User modeling
●   System requirements (efficiency, scalability,
    privacy....)
●   Business Logic
●   Serendipity
●   ....
Approaches to
                   Recommendation
Collaborative Filtering
●

    ●
        Recommend items based only on the users past behavior

Content-based
●

    ●
        Recommend based on features inherent to the items

Social recommendations (trust-based)
●
What works

●
    It depends on the domain and particular problem
    ●
        As a general rule, it is usually a good idea to combine:
        Hybrid Recommender Systems
●
 However, in the general case it has been
demonstrated that (currently) the best isolated
approach is CF.
    ●
        Item-based in general more efficient and better but
        mixing CF approaches can improve result
    ●
        Other approaches can improve results in specific
        cases (cold-start problem...)
The CF Ingredients

● List of m Users and a list of n Items
● Each user has a list of items with associated opinion

  ● Explicit opinion - a rating score (numerical scale)


  ● Implicit feedback – purchase records or listening

    history
● Active user for whom the prediction task is performed


● A metric for measuring similarity between users


● A method for selecting a subset of neighbors


● A method for predicting a rating for items not rated by

the active user.

                                                        24
The Netflix Prize

●   500K users x 17K movie
    titles = 100M ratings = $1M
    (if you “only” improve
    existing system by 10%!
    From 0.95 to 0.85 RMSE)
    ●   49K contestants on 40K teams from
        184 countries.
    ●   41K valid submissions from 5K
        teams; 64 submissions per day
    ●   Wining approach uses hundreds of
        predictors from several teams
But ...
User Feedback is Noisy




                        DID YOU HEAR WHAT
                            I LIKE??!!




...and limits Our Prediction
          Accuracy
The Magic Barrier

●   Magic Barrier = Limit on prediction accuracy
    due to noise in original data
●   Natural Noise = involuntary noise introduced by
    users when giving feedback
    ●   Due to (a) mistakes, and (b) lack of resolution in
        personal rating scale
●   Magic Barrier >= Natural Noise Threshold
    ●   Our prediction error cannot be smaller than the
        error in the original data
Our related research questions
X. Amatriain, J.M. Pujol, N. Oliver (2009) "I like It... I like It Not: Measuring Users
 Ratings Noise in Recommender Systems", in UMAP 09


    ●   Q1. Are users inconsistent when providing
        explicit feedback to Recommender Systems via
        the common Rating procedure?
    ●   Q2. How large is the prediction error due to
        these inconsistencies?
    ●   Q3. What factors affect user inconsistencies?
Experimental Setup

●   100 Movies selected from Netflix dataset doing
    a stratified random sampling on popularity
●   Ratings on a 1 to 5 star scale
    ●   Special “not seen” symbol.
●   Trial 1 and 3 = random order; trial 2 = ordered
    by popularity
User Feedback is Noisy

●   Users are inconsistent
●   Inconsistencies are not
    random and depend on
    many factors
User Feedback is Noisy

●   Users are inconsistent
●   Inconsistencies are not
    random and depend on
    many factors
    ●   More inconsistencies for mild
        opinions
User Feedback is Noisy

●   Users are inconsistent
●   Inconsistencies are not
    random and depend on
    many factors
    ●   More inconsistencies for mild
        opinions
    ●   More inconsistencies for
        negative opinions
User’s ratings are far from
           ground truth
                    #Ti       #Tj          #              RMSE


                                                                  
      T1, T2        2185      1961      1838     2308     0.573    0.707


      T1, T3        2185      1909      1774     2320     0.637    0.765


      T2, T3        1969      1909      1730     2140     0.557    0.694



Pairwise comparison between trials, RMSE is already > 0.55 or > 0.69 (Netflix Prize
  was to get below 0.85 !!!)
Algorithm Robustness to NN
Trial 2 is 
       Alg./Trial
consistently the       <T1       T2       T3      Tworst /Tbest
least noisy
       User            1.2011   1.1469   1.1945       4.7%
       Average
         Item          1.0555   1.0361   1.0776        4%
         Average
         User­based    0.9990   0.9640   1.0171       5.5%
         kNN
         Item­based    1.0429   1.0031   1.0417        4%
         kNN
         SVD           1.0244   0.9861   1.0285       4.3%


      RMSE for different Recommendation algorithms 
     ●


     when predicting each of the trials
Rate it Again
X. Amatriain et al. (2009)"Rate it Again: Increasing Recommendation
 Accuracy by User re-Rating", 2009 ACM RecSys
   ●
       Given that users are noisy… can we benefit from
       asking to rate the same movie more than once?

   ●
       We propose an algorithm to allow for multiple ratings of
       the same <user,item> tuple.
       ●
           The algorithm is subjected to two fairness conditions:
            – Algorithm should remove as few ratings as possible (i.e.
              only when there is some certainty that the rating is only
              adding noise)
            – Algorithm should not make up new ratings but decide on
              which of the existing ones are valid.
Re-rating Algorithm
• One source re­rating case:

                                             Examples:
                                             {3, 1}    →Ø
                                             {4}       →4
                                             {3, 4}    →3

                                             (2 source)
                                             {3, 4, 5}  →3


• Given the following milding function:   
Results

●    One-source re-rating (Denoised⊚Denoising)
                           T1⊚T2    ΔT1        T1⊚T3    ΔT1        T2⊚T3    ΔT2
    User­based kNN         0.8861   11.3%   0.8960      10.3%     0.8984    6.8%


    SVD                    0.9121   11.0%      0.9274   9.5%       0.9159   7.1%



●    Two-source re-rating (Denoising T1with the other 2)
          Datasets                  T1⊚(T2, T3)                  ΔT1
          User­based kNN              0.8647                    13.4%
          SVD                         0.8800                    14.1%
Rate it again

●   By asking users to rate items again we can
    remove noise in the dataset
    ●   Improvements of up to 14% in accuracy!
●   Because we don't want all users to re-rate all
    items we design ways to do partial denoising
    ●   Data-dependent: only denoise extreme ratings
    ●   User-dependent: detect “noisy” users
Denoising only noisy users




●    Improvement in RMSE when doing one­source as a function of 
the percentage of denoised ratings and users: selecting only noisy 
users and extreme ratings
The value or a re-rating




                Adding new ratings
                increases performance
                of the CF algorithm
The value or a re-rating



                But you are better off
                doing re-rating than
                new ratings !!
The value or a re-rating



         And much better if you
         know which ratings to
         re-rate!!
Let's recap

●   Users are inconsistent
●   Inconsistencies can depend on many things including
    how the items are presented
●   Inconsistencies produce natural noise
●   Natural noise reduces our prediction accuracy
    independently of the algorithm
●   By asking (some) users to re-rate (some) items again
    we can remove noise and improve accuracy
●   Having users repeat existing ratings may have more
    value than adding new ones
Crowds are not always wise




                           ●   Diversity of opinion
Conditions that are        ●   Independence
needed to guarantee the    ●   Decentralization
Wisdom in a Crowd          ●   Aggregation
Crowds are not always wise




                vs.




        Who  won?
The Wisdom of the Few
    X. Amatriain et al. "The wisdom of the few: a collaborative filtering
     approach based on expert opinions from the web", SIGIR '09
“It is really only experts 
who can reliably account 
   for their reactions”
Expert-based CF
●   expert = individual that we can trust to have produced
    thoughtful, consistent and reliable evaluations (ratings) of
    items in a given domain
●   Expert-based Collaborative Filtering
    ●   Find neighbors from a reduced set of experts instead of
        regular users.
         1. Identify domain experts with reliable ratings
         2. For each user, compute “expert neighbors”
         3. Compute recommendations similar to standard kNN CF
User Study
●   57 participants, only 14.5 ratings/participant
●   50% of the users consider Expert-based CF to be
    good or very good
●   Expert-based CF: only algorithm with an average
    rating over 3 (on a 0-4 scale)
Advantages of the Approach

●   Noise                          ●   Cold Start problem
    ●   Experts introduce less         ●   Experts rate items as
        natural noise                      soon as they are
●   Malicious Ratings                      available
    ●   Dataset can be monitored
                                   ●   Scalability
        to avoid shilling              ●   Dataset is several order of
●   Data Sparsity                          magnitudes smaller
    ●   Reduced set of domain
                                   ●   Privacy
        experts can be motivated       ●   Recommendations can be
        to rate items                      computed locally
So...

●   Can we generate meaningful and personalized
    recommendations ensuring 100% privacy?
    ●   YES!
●   Can we have a recommendation algorithm that
    is so efficient to run on a phone?
    ●   YES!
●   Can we have a recommender system that
    works even if there is only one user?
    ●   YES!
Architecture of the approach
Some implementations

●   A distributed Music Recommendation engine
Some implementations (II)

●   A geo-localized Mobile Movie Recommender
    iPhone App
Geo-localized Expert Movie
   Recommendations




                             0




       Powered by...
Expert CF...

●   Recreates the old paradigm of manually finding
    your favorite experts in magazines but in a fully
    automatic non-supervised manner.
What if we don't have ratings?


The fascinating world of implicit user feedback



 Examples of implicit feedback:
 ● Movies you watched

 ● Links you visited

 ● Songs you listened to

 ● Items you bought

 ● ....
Main features of implicit
               feedback
●   Our starting hypothesis are different from those
    in previous works:
    1.Implicit feedback can contain negative feedback –
      given the right granularity and diversity, low
      feedback = negative feedback
    2.Numerical value of implicit feedback can be
      mapped to preference given the appropriate
      mapping
    3.Once we have a trustworthy mapping, we can
      evaluate implicit feedback predictions same way as
      with explicit feedback.
Our questions

●   Q1. Is it possible to predict ratings a user would
    give to items given their implicit feedback?
●   Q2. Are there other variables that affect this
    mapping?
Experimental setup

●   Online user study on the music domain
●   Users required to have a music profile in lastfm
●   Goal: Compare explicit ratings with their
    listening history taking to account a number of
    controlled variables
Results. Do explicit ratings relate
      to implicit feedback?




                                Almost perfect linear 
                              relation between ratings 
                               and quantized implicit 
                                      feedback
Results. Do explicit ratings relate
          to implicit feedback?




Extreme ratings have clear 
  ascending/descending 
  trend, but mild ratings 
 respond more to changes 
      in one direction
Results. Do other variables affect?




             Albums listened to more 
              recently tend to receive 
               more positive ratings
Results. Do other variables affect?




       Contrary to our expectations, 
       global album popularity does 
             not affect ratings
Results. What about user
                  variables?

●   We obtained many demographic (age, sex, location...)
    and usage variables (hours of music per week,
    concerts, music magazines, ways of buying music...)
    in the study.
●   We performed an ANOVA analysis on the data to
    understand which variables explained some of its
    variance.
●   Only one of the usage variables, contributed (Sig.
    Value < 0.05) → “Listening Style” encoded whether the
    user listened preferably to tracks, full albums, or both.
Results. Regression
                   Analysis
           –   Model   1:   riu   =   β0   +   β1   ·   ifiu
           –   Model   2:   riu   =   β0   +   β1   ·   ifiu + β2 · reiu
           –   Model   3:   riu   =   β0   +   β1   ·   ifiu + β2 · reiu + β3 · gpi
           –   Model   4:   riu   =   β0   +   β1   ·   ifiu + β2 · reiu + β3 · ifiu · reiu

Model    R2                 F-value                         p-value        β0        β1       β2       β3

  1      0.125    F (1, 10120) = 1146                   < 2.2 · 10−16      2.726    0.499

  2     0.1358    F (2, 10019) = 794.8                  < 2.2 · 10−16      2.491    0.484     0.133

  3     0.1362    F (3, 10018) = 531.8                  < 2.2 · 10−16      2.435    0.486     0.134   0.0285

  4     0.1368    F (3, 10018) = 534.7                  < 2.2 · 10−16      2.677    0.379     0.038    0.053



All models meaningfully explain the data. Introducing “recentness” 
improves 10% but “global popularity” or interaction between variables do 
not make much difference
Results. Predictive power
               Model RMSE –
                     Excluding non-rated items


        User Average                    1.131

                   1                    1.026

                   2                    1.017

                   3                    1.016

                   4                    1.016




Error in predicting 20% of the ratings, having trained our 
regression model on the other 80%
Conclusions
●   Recommender systems and similar applications
    usually focus on having more data
●   But... many times is not about having more but rather
    better data
●   User feedback can not always be treated as ground
    truth and needs to be processed
●   Crowds are not always wise and sometimes we are
    better off using experts
●   Implicit feedback represents a good alternative to
    understand users but mapping is not trivial
Colleagues
●   Josep M. Pujol and Nuria Oliver (Telefonica)
    worked on Natural Noise and Wisdom of the
    Few projects
●   Nava Tintarev (Telefonica) worked on
    Natural Noise


          External Collaborators
●   Neal Lathia (UCL, London), Haewook Ahn
    (KAIST, Korea), Jaewook Ahn (Pittsburgh
    Univ.), and Josep Bachs (UPF, Barcelona)
    on Wisdom of the Few
●   Denis Parra (Pittsburgh Univ.) worked on
    implicit-explicit
Thanks!


        Questions?

    Xavier Amatriain
          xar@tid.es
      http://xavier.amatriain.net
http://technocalifornia.blogspot.com
            @xamat

More Related Content

What's hot

Collaborative Filtering using KNN
Collaborative Filtering using KNNCollaborative Filtering using KNN
Collaborative Filtering using KNNŞeyda Hatipoğlu
 
Recommendation System
Recommendation SystemRecommendation System
Recommendation SystemAnamta Sayyed
 
Collaborative Filtering Recommendation System
Collaborative Filtering Recommendation SystemCollaborative Filtering Recommendation System
Collaborative Filtering Recommendation SystemMilind Gokhale
 
Recommender system introduction
Recommender system   introductionRecommender system   introduction
Recommender system introductionLiang Xiang
 
Recommendation system (1).pptx
Recommendation system (1).pptxRecommendation system (1).pptx
Recommendation system (1).pptxprathammishra28
 
Recommendation system
Recommendation systemRecommendation system
Recommendation systemDing Li
 
Matrix Factorization Techniques For Recommender Systems
Matrix Factorization Techniques For Recommender SystemsMatrix Factorization Techniques For Recommender Systems
Matrix Factorization Techniques For Recommender SystemsLei Guo
 
Recent advances in deep recommender systems
Recent advances in deep recommender systemsRecent advances in deep recommender systems
Recent advances in deep recommender systemsNAVER Engineering
 
Movie recommendation system using collaborative filtering system
Movie recommendation system using collaborative filtering system Movie recommendation system using collaborative filtering system
Movie recommendation system using collaborative filtering system Mauryasuraj98
 
Recommendation system
Recommendation systemRecommendation system
Recommendation systemRishabh Mehta
 
Movie Recommendation engine
Movie Recommendation engineMovie Recommendation engine
Movie Recommendation engineJayesh Lahori
 
Collaborative Filtering 1: User-based CF
Collaborative Filtering 1: User-based CFCollaborative Filtering 1: User-based CF
Collaborative Filtering 1: User-based CFYusuke Yamamoto
 
Recommender systems for E-commerce
Recommender systems for E-commerceRecommender systems for E-commerce
Recommender systems for E-commerceAlexander Konduforov
 
Recommender Systems
Recommender SystemsRecommender Systems
Recommender SystemsLior Rokach
 

What's hot (20)

Recommender Systems
Recommender SystemsRecommender Systems
Recommender Systems
 
Collaborative Filtering using KNN
Collaborative Filtering using KNNCollaborative Filtering using KNN
Collaborative Filtering using KNN
 
Recommender system
Recommender systemRecommender system
Recommender system
 
Recommendation System
Recommendation SystemRecommendation System
Recommendation System
 
Collaborative Filtering Recommendation System
Collaborative Filtering Recommendation SystemCollaborative Filtering Recommendation System
Collaborative Filtering Recommendation System
 
Recommender system introduction
Recommender system   introductionRecommender system   introduction
Recommender system introduction
 
Recommendation system (1).pptx
Recommendation system (1).pptxRecommendation system (1).pptx
Recommendation system (1).pptx
 
Recommendation system
Recommendation systemRecommendation system
Recommendation system
 
Matrix Factorization Techniques For Recommender Systems
Matrix Factorization Techniques For Recommender SystemsMatrix Factorization Techniques For Recommender Systems
Matrix Factorization Techniques For Recommender Systems
 
Recent advances in deep recommender systems
Recent advances in deep recommender systemsRecent advances in deep recommender systems
Recent advances in deep recommender systems
 
Recommender Systems
Recommender SystemsRecommender Systems
Recommender Systems
 
Collaborative filtering
Collaborative filteringCollaborative filtering
Collaborative filtering
 
Content based filtering
Content based filteringContent based filtering
Content based filtering
 
Movie recommendation system using collaborative filtering system
Movie recommendation system using collaborative filtering system Movie recommendation system using collaborative filtering system
Movie recommendation system using collaborative filtering system
 
Recommendation system
Recommendation systemRecommendation system
Recommendation system
 
Movie Recommendation engine
Movie Recommendation engineMovie Recommendation engine
Movie Recommendation engine
 
Recommender Systems
Recommender SystemsRecommender Systems
Recommender Systems
 
Collaborative Filtering 1: User-based CF
Collaborative Filtering 1: User-based CFCollaborative Filtering 1: User-based CF
Collaborative Filtering 1: User-based CF
 
Recommender systems for E-commerce
Recommender systems for E-commerceRecommender systems for E-commerce
Recommender systems for E-commerce
 
Recommender Systems
Recommender SystemsRecommender Systems
Recommender Systems
 

Viewers also liked

Key Lessons Learned Building Recommender Systems for Large-Scale Social Netw...
 Key Lessons Learned Building Recommender Systems for Large-Scale Social Netw... Key Lessons Learned Building Recommender Systems for Large-Scale Social Netw...
Key Lessons Learned Building Recommender Systems for Large-Scale Social Netw...Christian Posse
 
BIG2016- Lessons Learned from building real-life user-focused Big Data systems
BIG2016- Lessons Learned from building real-life user-focused Big Data systemsBIG2016- Lessons Learned from building real-life user-focused Big Data systems
BIG2016- Lessons Learned from building real-life user-focused Big Data systemsXavier Amatriain
 
Machine Learning for Q&A Sites: The Quora Example
Machine Learning for Q&A Sites: The Quora ExampleMachine Learning for Q&A Sites: The Quora Example
Machine Learning for Q&A Sites: The Quora ExampleXavier Amatriain
 
Past, present, and future of Recommender Systems: an industry perspective
Past, present, and future of Recommender Systems: an industry perspectivePast, present, and future of Recommender Systems: an industry perspective
Past, present, and future of Recommender Systems: an industry perspectiveXavier Amatriain
 
Past present and future of Recommender Systems: an Industry Perspective
Past present and future of Recommender Systems: an Industry PerspectivePast present and future of Recommender Systems: an Industry Perspective
Past present and future of Recommender Systems: an Industry PerspectiveXavier Amatriain
 
Machine Learning to Grow the World's Knowledge
Machine Learning to Grow  the World's KnowledgeMachine Learning to Grow  the World's Knowledge
Machine Learning to Grow the World's KnowledgeXavier Amatriain
 
Building Large-scale Real-world Recommender Systems - Recsys2012 tutorial
Building Large-scale Real-world Recommender Systems - Recsys2012 tutorialBuilding Large-scale Real-world Recommender Systems - Recsys2012 tutorial
Building Large-scale Real-world Recommender Systems - Recsys2012 tutorialXavier Amatriain
 
Setting up Kubernetes with tectonic
Setting up Kubernetes with tectonicSetting up Kubernetes with tectonic
Setting up Kubernetes with tectonicVishal Biyani
 
Kdd 2014 Tutorial - the recommender problem revisited
Kdd 2014 Tutorial -  the recommender problem revisitedKdd 2014 Tutorial -  the recommender problem revisited
Kdd 2014 Tutorial - the recommender problem revisitedXavier Amatriain
 
Recsys 2016 tutorial: Lessons learned from building real-life recommender sys...
Recsys 2016 tutorial: Lessons learned from building real-life recommender sys...Recsys 2016 tutorial: Lessons learned from building real-life recommender sys...
Recsys 2016 tutorial: Lessons learned from building real-life recommender sys...Xavier Amatriain
 
Using AWS to Build a Graph-Based Product Recommendation System (BDT303) | AWS...
Using AWS to Build a Graph-Based Product Recommendation System (BDT303) | AWS...Using AWS to Build a Graph-Based Product Recommendation System (BDT303) | AWS...
Using AWS to Build a Graph-Based Product Recommendation System (BDT303) | AWS...Amazon Web Services
 
Approximate nearest neighbor methods and vector models – NYC ML meetup
Approximate nearest neighbor methods and vector models – NYC ML meetupApproximate nearest neighbor methods and vector models – NYC ML meetup
Approximate nearest neighbor methods and vector models – NYC ML meetupErik Bernhardsson
 
Recommender Systems with Apache Spark's ALS Function
Recommender Systems with Apache Spark's ALS FunctionRecommender Systems with Apache Spark's ALS Function
Recommender Systems with Apache Spark's ALS FunctionWill Johnson
 
Recommender system algorithm and architecture
Recommender system algorithm and architectureRecommender system algorithm and architecture
Recommender system algorithm and architectureLiang Xiang
 
Collaborative Filtering with Spark
Collaborative Filtering with SparkCollaborative Filtering with Spark
Collaborative Filtering with SparkChris Johnson
 
The Universal Recommender
The Universal RecommenderThe Universal Recommender
The Universal RecommenderPat Ferrel
 
Recsys 2014 Tutorial - The Recommender Problem Revisited
Recsys 2014 Tutorial - The Recommender Problem RevisitedRecsys 2014 Tutorial - The Recommender Problem Revisited
Recsys 2014 Tutorial - The Recommender Problem RevisitedXavier Amatriain
 
Music Recommendations at Scale with Spark
Music Recommendations at Scale with SparkMusic Recommendations at Scale with Spark
Music Recommendations at Scale with SparkChris Johnson
 
How to Build a Recommendation Engine on Spark
How to Build a Recommendation Engine on SparkHow to Build a Recommendation Engine on Spark
How to Build a Recommendation Engine on SparkCaserta
 

Viewers also liked (20)

Discovery
DiscoveryDiscovery
Discovery
 
Key Lessons Learned Building Recommender Systems for Large-Scale Social Netw...
 Key Lessons Learned Building Recommender Systems for Large-Scale Social Netw... Key Lessons Learned Building Recommender Systems for Large-Scale Social Netw...
Key Lessons Learned Building Recommender Systems for Large-Scale Social Netw...
 
BIG2016- Lessons Learned from building real-life user-focused Big Data systems
BIG2016- Lessons Learned from building real-life user-focused Big Data systemsBIG2016- Lessons Learned from building real-life user-focused Big Data systems
BIG2016- Lessons Learned from building real-life user-focused Big Data systems
 
Machine Learning for Q&A Sites: The Quora Example
Machine Learning for Q&A Sites: The Quora ExampleMachine Learning for Q&A Sites: The Quora Example
Machine Learning for Q&A Sites: The Quora Example
 
Past, present, and future of Recommender Systems: an industry perspective
Past, present, and future of Recommender Systems: an industry perspectivePast, present, and future of Recommender Systems: an industry perspective
Past, present, and future of Recommender Systems: an industry perspective
 
Past present and future of Recommender Systems: an Industry Perspective
Past present and future of Recommender Systems: an Industry PerspectivePast present and future of Recommender Systems: an Industry Perspective
Past present and future of Recommender Systems: an Industry Perspective
 
Machine Learning to Grow the World's Knowledge
Machine Learning to Grow  the World's KnowledgeMachine Learning to Grow  the World's Knowledge
Machine Learning to Grow the World's Knowledge
 
Building Large-scale Real-world Recommender Systems - Recsys2012 tutorial
Building Large-scale Real-world Recommender Systems - Recsys2012 tutorialBuilding Large-scale Real-world Recommender Systems - Recsys2012 tutorial
Building Large-scale Real-world Recommender Systems - Recsys2012 tutorial
 
Setting up Kubernetes with tectonic
Setting up Kubernetes with tectonicSetting up Kubernetes with tectonic
Setting up Kubernetes with tectonic
 
Kdd 2014 Tutorial - the recommender problem revisited
Kdd 2014 Tutorial -  the recommender problem revisitedKdd 2014 Tutorial -  the recommender problem revisited
Kdd 2014 Tutorial - the recommender problem revisited
 
Recsys 2016 tutorial: Lessons learned from building real-life recommender sys...
Recsys 2016 tutorial: Lessons learned from building real-life recommender sys...Recsys 2016 tutorial: Lessons learned from building real-life recommender sys...
Recsys 2016 tutorial: Lessons learned from building real-life recommender sys...
 
Using AWS to Build a Graph-Based Product Recommendation System (BDT303) | AWS...
Using AWS to Build a Graph-Based Product Recommendation System (BDT303) | AWS...Using AWS to Build a Graph-Based Product Recommendation System (BDT303) | AWS...
Using AWS to Build a Graph-Based Product Recommendation System (BDT303) | AWS...
 
Approximate nearest neighbor methods and vector models – NYC ML meetup
Approximate nearest neighbor methods and vector models – NYC ML meetupApproximate nearest neighbor methods and vector models – NYC ML meetup
Approximate nearest neighbor methods and vector models – NYC ML meetup
 
Recommender Systems with Apache Spark's ALS Function
Recommender Systems with Apache Spark's ALS FunctionRecommender Systems with Apache Spark's ALS Function
Recommender Systems with Apache Spark's ALS Function
 
Recommender system algorithm and architecture
Recommender system algorithm and architectureRecommender system algorithm and architecture
Recommender system algorithm and architecture
 
Collaborative Filtering with Spark
Collaborative Filtering with SparkCollaborative Filtering with Spark
Collaborative Filtering with Spark
 
The Universal Recommender
The Universal RecommenderThe Universal Recommender
The Universal Recommender
 
Recsys 2014 Tutorial - The Recommender Problem Revisited
Recsys 2014 Tutorial - The Recommender Problem RevisitedRecsys 2014 Tutorial - The Recommender Problem Revisited
Recsys 2014 Tutorial - The Recommender Problem Revisited
 
Music Recommendations at Scale with Spark
Music Recommendations at Scale with SparkMusic Recommendations at Scale with Spark
Music Recommendations at Scale with Spark
 
How to Build a Recommendation Engine on Spark
How to Build a Recommendation Engine on SparkHow to Build a Recommendation Engine on Spark
How to Build a Recommendation Engine on Spark
 

Similar to The Science and the Magic of User Feedback for Recommender Systems

Media, data, context... and the Holy Grail of User Taste Prediction
Media, data, context... and the Holy Grail of User Taste PredictionMedia, data, context... and the Holy Grail of User Taste Prediction
Media, data, context... and the Holy Grail of User Taste PredictionXavier Amatriain
 
User-driven Approaches to Recsys
User-driven Approaches to RecsysUser-driven Approaches to Recsys
User-driven Approaches to RecsysXavier Amatriain
 
The Beauty of Computing with People
The Beauty of Computing with PeopleThe Beauty of Computing with People
The Beauty of Computing with PeopleXavier Amatriain
 
Presentatie Big Data Forum 22 januari 2013 - Big Data en Big Society
Presentatie Big Data Forum 22 januari 2013 - Big Data en Big SocietyPresentatie Big Data Forum 22 januari 2013 - Big Data en Big Society
Presentatie Big Data Forum 22 januari 2013 - Big Data en Big SocietySURFnet
 
Girardin lift france10_notes
Girardin lift france10_notesGirardin lift france10_notes
Girardin lift france10_notesFing
 
Beyond the Internet: Seamless Global Communication
Beyond the Internet: Seamless Global CommunicationBeyond the Internet: Seamless Global Communication
Beyond the Internet: Seamless Global CommunicationJerry Fishenden
 
IT: Current Stage and Future Trend
IT: Current Stage and Future Trend IT: Current Stage and Future Trend
IT: Current Stage and Future Trend Widy Widyawan
 
Internet of Things - The Tip of the Iceberg or The Tipping Point
Internet of Things - The Tip of the Iceberg or The Tipping PointInternet of Things - The Tip of the Iceberg or The Tipping Point
Internet of Things - The Tip of the Iceberg or The Tipping PointDr. Mazlan Abbas
 
Introducing the Internet of Things: lecture @IULM University
Introducing the Internet of Things: lecture @IULM UniversityIntroducing the Internet of Things: lecture @IULM University
Introducing the Internet of Things: lecture @IULM UniversityLeandro Agro'
 
Mobile Monday Malta Launch - Jari Tammisto
Mobile Monday Malta Launch - Jari TammistoMobile Monday Malta Launch - Jari Tammisto
Mobile Monday Malta Launch - Jari TammistoMobile Monday Malta
 
MICROSTRATEGY - Sessione introduttiva sulla piattaforma di Business Intelligence
MICROSTRATEGY - Sessione introduttiva sulla piattaforma di Business IntelligenceMICROSTRATEGY - Sessione introduttiva sulla piattaforma di Business Intelligence
MICROSTRATEGY - Sessione introduttiva sulla piattaforma di Business IntelligenceTwinergy
 
Towards user co-creation of value on the Internet-of-Things (IoT)
Towards user co-creation of value on the Internet-of-Things (IoT)Towards user co-creation of value on the Internet-of-Things (IoT)
Towards user co-creation of value on the Internet-of-Things (IoT)trappenl
 
Arti e Culture Digitali Lezione 1, Luca Simeone
Arti e Culture Digitali Lezione 1, Luca SimeoneArti e Culture Digitali Lezione 1, Luca Simeone
Arti e Culture Digitali Lezione 1, Luca Simeonemeetingsalongtheedge
 
PLA 2012: Librarian's Field Guide to Near Field Communication
PLA 2012: Librarian's Field Guide to Near Field CommunicationPLA 2012: Librarian's Field Guide to Near Field Communication
PLA 2012: Librarian's Field Guide to Near Field CommunicationKristen Yarmey
 
Ross Rubin's Presentation
Ross Rubin's PresentationRoss Rubin's Presentation
Ross Rubin's PresentationMediabistro
 
ChaCha ThinkLA Mobile Breakfast Keynote - Scott A. Jones
ChaCha ThinkLA Mobile Breakfast Keynote - Scott A. JonesChaCha ThinkLA Mobile Breakfast Keynote - Scott A. Jones
ChaCha ThinkLA Mobile Breakfast Keynote - Scott A. JonesChaCha
 

Similar to The Science and the Magic of User Feedback for Recommender Systems (20)

Media, data, context... and the Holy Grail of User Taste Prediction
Media, data, context... and the Holy Grail of User Taste PredictionMedia, data, context... and the Holy Grail of User Taste Prediction
Media, data, context... and the Holy Grail of User Taste Prediction
 
User-driven Approaches to Recsys
User-driven Approaches to RecsysUser-driven Approaches to Recsys
User-driven Approaches to Recsys
 
Being Social
Being SocialBeing Social
Being Social
 
The Beauty of Computing with People
The Beauty of Computing with PeopleThe Beauty of Computing with People
The Beauty of Computing with People
 
Presentatie Big Data Forum 22 januari 2013 - Big Data en Big Society
Presentatie Big Data Forum 22 januari 2013 - Big Data en Big SocietyPresentatie Big Data Forum 22 januari 2013 - Big Data en Big Society
Presentatie Big Data Forum 22 januari 2013 - Big Data en Big Society
 
Now & Next - Mediacom
Now & Next - MediacomNow & Next - Mediacom
Now & Next - Mediacom
 
Girardin lift france10_notes
Girardin lift france10_notesGirardin lift france10_notes
Girardin lift france10_notes
 
Beyond the Internet: Seamless Global Communication
Beyond the Internet: Seamless Global CommunicationBeyond the Internet: Seamless Global Communication
Beyond the Internet: Seamless Global Communication
 
IT: Current Stage and Future Trend
IT: Current Stage and Future Trend IT: Current Stage and Future Trend
IT: Current Stage and Future Trend
 
Internet of Things - The Tip of the Iceberg or The Tipping Point
Internet of Things - The Tip of the Iceberg or The Tipping PointInternet of Things - The Tip of the Iceberg or The Tipping Point
Internet of Things - The Tip of the Iceberg or The Tipping Point
 
Introducing the Internet of Things: lecture @IULM University
Introducing the Internet of Things: lecture @IULM UniversityIntroducing the Internet of Things: lecture @IULM University
Introducing the Internet of Things: lecture @IULM University
 
Mobile Monday Malta Launch - Jari Tammisto
Mobile Monday Malta Launch - Jari TammistoMobile Monday Malta Launch - Jari Tammisto
Mobile Monday Malta Launch - Jari Tammisto
 
MICROSTRATEGY - Sessione introduttiva sulla piattaforma di Business Intelligence
MICROSTRATEGY - Sessione introduttiva sulla piattaforma di Business IntelligenceMICROSTRATEGY - Sessione introduttiva sulla piattaforma di Business Intelligence
MICROSTRATEGY - Sessione introduttiva sulla piattaforma di Business Intelligence
 
Towards user co-creation of value on the Internet-of-Things (IoT)
Towards user co-creation of value on the Internet-of-Things (IoT)Towards user co-creation of value on the Internet-of-Things (IoT)
Towards user co-creation of value on the Internet-of-Things (IoT)
 
Arti e Culture Digitali Lezione 1, Luca Simeone
Arti e Culture Digitali Lezione 1, Luca SimeoneArti e Culture Digitali Lezione 1, Luca Simeone
Arti e Culture Digitali Lezione 1, Luca Simeone
 
Web². The Internet of everything
Web². The Internet of everythingWeb². The Internet of everything
Web². The Internet of everything
 
CAPS at infoday
CAPS at infodayCAPS at infoday
CAPS at infoday
 
PLA 2012: Librarian's Field Guide to Near Field Communication
PLA 2012: Librarian's Field Guide to Near Field CommunicationPLA 2012: Librarian's Field Guide to Near Field Communication
PLA 2012: Librarian's Field Guide to Near Field Communication
 
Ross Rubin's Presentation
Ross Rubin's PresentationRoss Rubin's Presentation
Ross Rubin's Presentation
 
ChaCha ThinkLA Mobile Breakfast Keynote - Scott A. Jones
ChaCha ThinkLA Mobile Breakfast Keynote - Scott A. JonesChaCha ThinkLA Mobile Breakfast Keynote - Scott A. Jones
ChaCha ThinkLA Mobile Breakfast Keynote - Scott A. Jones
 

More from Xavier Amatriain

Data/AI driven product development: from video streaming to telehealth
Data/AI driven product development: from video streaming to telehealthData/AI driven product development: from video streaming to telehealth
Data/AI driven product development: from video streaming to telehealthXavier Amatriain
 
AI-driven product innovation: from Recommender Systems to COVID-19
AI-driven product innovation: from Recommender Systems to COVID-19AI-driven product innovation: from Recommender Systems to COVID-19
AI-driven product innovation: from Recommender Systems to COVID-19Xavier Amatriain
 
AI for COVID-19 - Q42020 update
AI for COVID-19 - Q42020 updateAI for COVID-19 - Q42020 update
AI for COVID-19 - Q42020 updateXavier Amatriain
 
AI for COVID-19: An online virtual care approach
AI for COVID-19: An online virtual care approachAI for COVID-19: An online virtual care approach
AI for COVID-19: An online virtual care approachXavier Amatriain
 
Lessons learned from building practical deep learning systems
Lessons learned from building practical deep learning systemsLessons learned from building practical deep learning systems
Lessons learned from building practical deep learning systemsXavier Amatriain
 
AI for healthcare: Scaling Access and Quality of Care for Everyone
AI for healthcare: Scaling Access and Quality of Care for EveryoneAI for healthcare: Scaling Access and Quality of Care for Everyone
AI for healthcare: Scaling Access and Quality of Care for EveryoneXavier Amatriain
 
Towards online universal quality healthcare through AI
Towards online universal quality healthcare through AITowards online universal quality healthcare through AI
Towards online universal quality healthcare through AIXavier Amatriain
 
From one to zero: Going smaller as a growth strategy
From one to zero: Going smaller as a growth strategyFrom one to zero: Going smaller as a growth strategy
From one to zero: Going smaller as a growth strategyXavier Amatriain
 
Learning to speak medicine
Learning to speak medicineLearning to speak medicine
Learning to speak medicineXavier Amatriain
 
Recommender Systems In Industry
Recommender Systems In IndustryRecommender Systems In Industry
Recommender Systems In IndustryXavier Amatriain
 
Medical advice as a Recommender System
Medical advice as a Recommender SystemMedical advice as a Recommender System
Medical advice as a Recommender SystemXavier Amatriain
 
Staying Shallow & Lean in a Deep Learning World
Staying Shallow & Lean in a Deep Learning WorldStaying Shallow & Lean in a Deep Learning World
Staying Shallow & Lean in a Deep Learning WorldXavier Amatriain
 
Strata 2016 - Lessons Learned from building real-life Machine Learning Systems
Strata 2016 -  Lessons Learned from building real-life Machine Learning SystemsStrata 2016 -  Lessons Learned from building real-life Machine Learning Systems
Strata 2016 - Lessons Learned from building real-life Machine Learning SystemsXavier Amatriain
 
Barcelona ML Meetup - Lessons Learned
Barcelona ML Meetup - Lessons LearnedBarcelona ML Meetup - Lessons Learned
Barcelona ML Meetup - Lessons LearnedXavier Amatriain
 
10 more lessons learned from building Machine Learning systems - MLConf
10 more lessons learned from building Machine Learning systems - MLConf10 more lessons learned from building Machine Learning systems - MLConf
10 more lessons learned from building Machine Learning systems - MLConfXavier Amatriain
 
10 more lessons learned from building Machine Learning systems
10 more lessons learned from building Machine Learning systems10 more lessons learned from building Machine Learning systems
10 more lessons learned from building Machine Learning systemsXavier Amatriain
 
MLConf Seattle 2015 - ML@Quora
MLConf Seattle 2015 - ML@QuoraMLConf Seattle 2015 - ML@Quora
MLConf Seattle 2015 - ML@QuoraXavier Amatriain
 
Lean DevOps - Lessons Learned from Innovation-driven Companies
Lean DevOps - Lessons Learned from Innovation-driven CompaniesLean DevOps - Lessons Learned from Innovation-driven Companies
Lean DevOps - Lessons Learned from Innovation-driven CompaniesXavier Amatriain
 
10 Lessons Learned from Building Machine Learning Systems
10 Lessons Learned from Building Machine Learning Systems10 Lessons Learned from Building Machine Learning Systems
10 Lessons Learned from Building Machine Learning SystemsXavier Amatriain
 

More from Xavier Amatriain (20)

Data/AI driven product development: from video streaming to telehealth
Data/AI driven product development: from video streaming to telehealthData/AI driven product development: from video streaming to telehealth
Data/AI driven product development: from video streaming to telehealth
 
AI-driven product innovation: from Recommender Systems to COVID-19
AI-driven product innovation: from Recommender Systems to COVID-19AI-driven product innovation: from Recommender Systems to COVID-19
AI-driven product innovation: from Recommender Systems to COVID-19
 
AI for COVID-19 - Q42020 update
AI for COVID-19 - Q42020 updateAI for COVID-19 - Q42020 update
AI for COVID-19 - Q42020 update
 
AI for COVID-19: An online virtual care approach
AI for COVID-19: An online virtual care approachAI for COVID-19: An online virtual care approach
AI for COVID-19: An online virtual care approach
 
Lessons learned from building practical deep learning systems
Lessons learned from building practical deep learning systemsLessons learned from building practical deep learning systems
Lessons learned from building practical deep learning systems
 
AI for healthcare: Scaling Access and Quality of Care for Everyone
AI for healthcare: Scaling Access and Quality of Care for EveryoneAI for healthcare: Scaling Access and Quality of Care for Everyone
AI for healthcare: Scaling Access and Quality of Care for Everyone
 
Towards online universal quality healthcare through AI
Towards online universal quality healthcare through AITowards online universal quality healthcare through AI
Towards online universal quality healthcare through AI
 
From one to zero: Going smaller as a growth strategy
From one to zero: Going smaller as a growth strategyFrom one to zero: Going smaller as a growth strategy
From one to zero: Going smaller as a growth strategy
 
Learning to speak medicine
Learning to speak medicineLearning to speak medicine
Learning to speak medicine
 
ML to cure the world
ML to cure the worldML to cure the world
ML to cure the world
 
Recommender Systems In Industry
Recommender Systems In IndustryRecommender Systems In Industry
Recommender Systems In Industry
 
Medical advice as a Recommender System
Medical advice as a Recommender SystemMedical advice as a Recommender System
Medical advice as a Recommender System
 
Staying Shallow & Lean in a Deep Learning World
Staying Shallow & Lean in a Deep Learning WorldStaying Shallow & Lean in a Deep Learning World
Staying Shallow & Lean in a Deep Learning World
 
Strata 2016 - Lessons Learned from building real-life Machine Learning Systems
Strata 2016 -  Lessons Learned from building real-life Machine Learning SystemsStrata 2016 -  Lessons Learned from building real-life Machine Learning Systems
Strata 2016 - Lessons Learned from building real-life Machine Learning Systems
 
Barcelona ML Meetup - Lessons Learned
Barcelona ML Meetup - Lessons LearnedBarcelona ML Meetup - Lessons Learned
Barcelona ML Meetup - Lessons Learned
 
10 more lessons learned from building Machine Learning systems - MLConf
10 more lessons learned from building Machine Learning systems - MLConf10 more lessons learned from building Machine Learning systems - MLConf
10 more lessons learned from building Machine Learning systems - MLConf
 
10 more lessons learned from building Machine Learning systems
10 more lessons learned from building Machine Learning systems10 more lessons learned from building Machine Learning systems
10 more lessons learned from building Machine Learning systems
 
MLConf Seattle 2015 - ML@Quora
MLConf Seattle 2015 - ML@QuoraMLConf Seattle 2015 - ML@Quora
MLConf Seattle 2015 - ML@Quora
 
Lean DevOps - Lessons Learned from Innovation-driven Companies
Lean DevOps - Lessons Learned from Innovation-driven CompaniesLean DevOps - Lessons Learned from Innovation-driven Companies
Lean DevOps - Lessons Learned from Innovation-driven Companies
 
10 Lessons Learned from Building Machine Learning Systems
10 Lessons Learned from Building Machine Learning Systems10 Lessons Learned from Building Machine Learning Systems
10 Lessons Learned from Building Machine Learning Systems
 

Recently uploaded

What is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfWhat is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfMounikaPolabathina
 
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESSALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESmohitsingh558521
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity PlanDatabarracks
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr BaganFwdays
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxLoriGlavin3
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsPixlogix Infotech
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Commit University
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfAddepto
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed
 
Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersRaghuram Pandurangan
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxLoriGlavin3
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brandgvaughan
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxLoriGlavin3
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024Stephanie Beckett
 
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionDilum Bandara
 

Recently uploaded (20)

What is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfWhat is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdf
 
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESSALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity Plan
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project Setup
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and Cons
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdf
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL Certs
 
Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information Developers
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brand
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024
 
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An Introduction
 

The Science and the Magic of User Feedback for Recommender Systems

  • 1. The Science and the Magic of User Feedback for Recommender Systems Xavier Amatriain Bay Area, March '11
  • 2. But first... About Telefonica and Telefonica R&D
  • 3. Telefonica is a fast-growing Telecom 1989 2000 2008 Clients About 12 About 68 About 260 million million million subscribers customers customers Services Basic Wireline and mobile Integrated ICT telephone and voice, data and solutions for all data services Internet services customers Geographies Operations in Operations in Spain 25 countries 16 countries Staff About 71,000 About 149,000 About 257,000 professionals professionals professionals Finances Rev: 4,273 M€ Rev: 28,485 M€ Rev: 57,946 M€ EPS(1): 0.45 € EPS(1): 0.67 € EPS: 1.63 € (1) EPS: Earnings per share
  • 4. Currently among the largest in the world Telco sector worldwide ranking by market cap (US$ bn) Source: Bloomberg, 06/12/09 Just announced 2010 results: record net earnings,  first Spanish company ever to make > 10B €
  • 5. Leader in South America Data as of March ‘09 1 2 Argentina: 20.9 million Wireline market rank  2 1 Brazil: 61.4 million Mobile market rank 2 Central America: 6.1 million 1 2 Colombia: 12.6 million 1 1 Chile: 10.1 million 2 Ecuador: 3.3 million 2 Mexico: 15.7 million 1 1 Peru: 15.2 million 1 Uruguay: 1.5 million 2 Venezuela: 12.0 million Total Accesses (as of March ‘09) 159.5 million Notes: - Central America includes Guatemala, Panama, El Salvador and Nicaragua - Total accesses figure includes Narrowband Internet accesses of Terra Brasil and Terra Colombia, and Broadband Internet accesses of Terra Brasil, Telefónica de Argentina, Terra Guatemala and Terra México.
  • 6. And a significant footprint in Europe Wireline market rank Mobile market rank Data as of March ‘09 1 1 Spain: 47.2 million 1 UK: 20.8 million 4 Germany: 16.0 million 2 Ireland: 1.7 million Czech Republic: 7.7 million 1 2 Slovakia: 0.4 million 3 Total Accesses (as of March ’09) 93.8 million
  • 7. Scientific Research Mobile and Ubicomp Multimedia Core User Modelling & Data Mining HCIR DATA MINING Wireless Systems Content Distribution & P2P Social Networks
  • 8. Projects Recommendation  Algorithms Tourist routes Social  contacts Music User Analysis  Movies & Modeling Contextual The Wisdom of the  Noise in users’  Few Mobile ratings  Tourist  behavior Microprofiles Implicit user  feedback Multiverse  Tensor  IPTV viewing habits Factorization
  • 9. Projects Recommendation  Algorithms Tourist routes Social  contacts Music User Analysis  Movies & Modeling Contextual The Wisdom of the  Noise in users’  Few Mobile ratings  Tourist  behavior Microprofiles Implicit user  feedback Multiverse  Tensor  IPTV viewing habits Factorization
  • 10. And about the world we live in...
  • 12. More is Less W or se D ec is ns io io ns is ec D s es L
  • 13. Analysis Paralysis is making headlines
  • 14. Search engines don’t always hold the answer
  • 15.
  • 18. What about information to help take decisions?
  • 19. The Age of Search has come to an end ... long live the Age of Recommendation! ● ● Chris Anderson in “The Long Tail” ● “We are leaving the age of information and entering the age of recommendation” ● CNN Money, “The race to create a 'smart' Google”: ● “The Web, they say, is leaving the era of search and entering one of discovery. What's the difference? Search is what you do when you're looking for something. Discovery is when something wonderful that you didn't know existed, or didn't know how to ask for, finds you.”
  • 20. Recommender Systems Recommendations Read this Attend this conference
  • 21. Data mining + all those other things ● User Interface ● User modeling ● System requirements (efficiency, scalability, privacy....) ● Business Logic ● Serendipity ● ....
  • 22. Approaches to Recommendation Collaborative Filtering ● ● Recommend items based only on the users past behavior Content-based ● ● Recommend based on features inherent to the items Social recommendations (trust-based) ●
  • 23. What works ● It depends on the domain and particular problem ● As a general rule, it is usually a good idea to combine: Hybrid Recommender Systems ● However, in the general case it has been demonstrated that (currently) the best isolated approach is CF. ● Item-based in general more efficient and better but mixing CF approaches can improve result ● Other approaches can improve results in specific cases (cold-start problem...)
  • 24. The CF Ingredients ● List of m Users and a list of n Items ● Each user has a list of items with associated opinion ● Explicit opinion - a rating score (numerical scale) ● Implicit feedback – purchase records or listening history ● Active user for whom the prediction task is performed ● A metric for measuring similarity between users ● A method for selecting a subset of neighbors ● A method for predicting a rating for items not rated by the active user. 24
  • 25. The Netflix Prize ● 500K users x 17K movie titles = 100M ratings = $1M (if you “only” improve existing system by 10%! From 0.95 to 0.85 RMSE) ● 49K contestants on 40K teams from 184 countries. ● 41K valid submissions from 5K teams; 64 submissions per day ● Wining approach uses hundreds of predictors from several teams
  • 27. User Feedback is Noisy DID YOU HEAR WHAT I LIKE??!! ...and limits Our Prediction Accuracy
  • 28. The Magic Barrier ● Magic Barrier = Limit on prediction accuracy due to noise in original data ● Natural Noise = involuntary noise introduced by users when giving feedback ● Due to (a) mistakes, and (b) lack of resolution in personal rating scale ● Magic Barrier >= Natural Noise Threshold ● Our prediction error cannot be smaller than the error in the original data
  • 29. Our related research questions X. Amatriain, J.M. Pujol, N. Oliver (2009) "I like It... I like It Not: Measuring Users Ratings Noise in Recommender Systems", in UMAP 09 ● Q1. Are users inconsistent when providing explicit feedback to Recommender Systems via the common Rating procedure? ● Q2. How large is the prediction error due to these inconsistencies? ● Q3. What factors affect user inconsistencies?
  • 30. Experimental Setup ● 100 Movies selected from Netflix dataset doing a stratified random sampling on popularity ● Ratings on a 1 to 5 star scale ● Special “not seen” symbol. ● Trial 1 and 3 = random order; trial 2 = ordered by popularity
  • 31. User Feedback is Noisy ● Users are inconsistent ● Inconsistencies are not random and depend on many factors
  • 32. User Feedback is Noisy ● Users are inconsistent ● Inconsistencies are not random and depend on many factors ● More inconsistencies for mild opinions
  • 33. User Feedback is Noisy ● Users are inconsistent ● Inconsistencies are not random and depend on many factors ● More inconsistencies for mild opinions ● More inconsistencies for negative opinions
  • 34. User’s ratings are far from ground truth #Ti #Tj # RMSE     T1, T2 2185 1961 1838 2308 0.573 0.707 T1, T3 2185 1909 1774 2320 0.637 0.765 T2, T3 1969 1909 1730 2140 0.557 0.694 Pairwise comparison between trials, RMSE is already > 0.55 or > 0.69 (Netflix Prize was to get below 0.85 !!!)
  • 35. Algorithm Robustness to NN Trial 2 is  Alg./Trial consistently the  <T1 T2 T3 Tworst /Tbest least noisy User  1.2011 1.1469 1.1945 4.7% Average Item  1.0555 1.0361 1.0776 4% Average User­based  0.9990 0.9640 1.0171 5.5% kNN Item­based  1.0429 1.0031 1.0417 4% kNN SVD 1.0244 0.9861 1.0285 4.3%  RMSE for different Recommendation algorithms  ● when predicting each of the trials
  • 36. Rate it Again X. Amatriain et al. (2009)"Rate it Again: Increasing Recommendation Accuracy by User re-Rating", 2009 ACM RecSys ● Given that users are noisy… can we benefit from asking to rate the same movie more than once? ● We propose an algorithm to allow for multiple ratings of the same <user,item> tuple. ● The algorithm is subjected to two fairness conditions: – Algorithm should remove as few ratings as possible (i.e. only when there is some certainty that the rating is only adding noise) – Algorithm should not make up new ratings but decide on which of the existing ones are valid.
  • 37. Re-rating Algorithm • One source re­rating case: Examples: {3, 1} →Ø {4} →4 {3, 4} →3 (2 source) {3, 4, 5} →3 • Given the following milding function:   
  • 38. Results ● One-source re-rating (Denoised⊚Denoising) T1⊚T2 ΔT1 T1⊚T3 ΔT1 T2⊚T3 ΔT2 User­based kNN 0.8861 11.3% 0.8960 10.3% 0.8984 6.8% SVD 0.9121 11.0% 0.9274 9.5% 0.9159 7.1% ● Two-source re-rating (Denoising T1with the other 2) Datasets T1⊚(T2, T3) ΔT1 User­based kNN 0.8647 13.4% SVD 0.8800 14.1%
  • 39. Rate it again ● By asking users to rate items again we can remove noise in the dataset ● Improvements of up to 14% in accuracy! ● Because we don't want all users to re-rate all items we design ways to do partial denoising ● Data-dependent: only denoise extreme ratings ● User-dependent: detect “noisy” users
  • 40. Denoising only noisy users ●  Improvement in RMSE when doing one­source as a function of  the percentage of denoised ratings and users: selecting only noisy  users and extreme ratings
  • 41. The value or a re-rating Adding new ratings increases performance of the CF algorithm
  • 42. The value or a re-rating But you are better off doing re-rating than new ratings !!
  • 43. The value or a re-rating And much better if you know which ratings to re-rate!!
  • 44. Let's recap ● Users are inconsistent ● Inconsistencies can depend on many things including how the items are presented ● Inconsistencies produce natural noise ● Natural noise reduces our prediction accuracy independently of the algorithm ● By asking (some) users to re-rate (some) items again we can remove noise and improve accuracy ● Having users repeat existing ratings may have more value than adding new ones
  • 45. Crowds are not always wise ● Diversity of opinion Conditions that are  ● Independence needed to guarantee the  ● Decentralization Wisdom in a Crowd ● Aggregation
  • 46. Crowds are not always wise vs. Who  won?
  • 47. The Wisdom of the Few X. Amatriain et al. "The wisdom of the few: a collaborative filtering approach based on expert opinions from the web", SIGIR '09
  • 49. Expert-based CF ● expert = individual that we can trust to have produced thoughtful, consistent and reliable evaluations (ratings) of items in a given domain ● Expert-based Collaborative Filtering ● Find neighbors from a reduced set of experts instead of regular users. 1. Identify domain experts with reliable ratings 2. For each user, compute “expert neighbors” 3. Compute recommendations similar to standard kNN CF
  • 50. User Study ● 57 participants, only 14.5 ratings/participant ● 50% of the users consider Expert-based CF to be good or very good ● Expert-based CF: only algorithm with an average rating over 3 (on a 0-4 scale)
  • 51. Advantages of the Approach ● Noise ● Cold Start problem ● Experts introduce less ● Experts rate items as natural noise soon as they are ● Malicious Ratings available ● Dataset can be monitored ● Scalability to avoid shilling ● Dataset is several order of ● Data Sparsity magnitudes smaller ● Reduced set of domain ● Privacy experts can be motivated ● Recommendations can be to rate items computed locally
  • 52. So... ● Can we generate meaningful and personalized recommendations ensuring 100% privacy? ● YES! ● Can we have a recommendation algorithm that is so efficient to run on a phone? ● YES! ● Can we have a recommender system that works even if there is only one user? ● YES!
  • 54. Some implementations ● A distributed Music Recommendation engine
  • 55. Some implementations (II) ● A geo-localized Mobile Movie Recommender iPhone App
  • 56. Geo-localized Expert Movie Recommendations 0 Powered by...
  • 57. Expert CF... ● Recreates the old paradigm of manually finding your favorite experts in magazines but in a fully automatic non-supervised manner.
  • 58. What if we don't have ratings? The fascinating world of implicit user feedback Examples of implicit feedback: ● Movies you watched ● Links you visited ● Songs you listened to ● Items you bought ● ....
  • 59. Main features of implicit feedback ● Our starting hypothesis are different from those in previous works: 1.Implicit feedback can contain negative feedback – given the right granularity and diversity, low feedback = negative feedback 2.Numerical value of implicit feedback can be mapped to preference given the appropriate mapping 3.Once we have a trustworthy mapping, we can evaluate implicit feedback predictions same way as with explicit feedback.
  • 60. Our questions ● Q1. Is it possible to predict ratings a user would give to items given their implicit feedback? ● Q2. Are there other variables that affect this mapping?
  • 61. Experimental setup ● Online user study on the music domain ● Users required to have a music profile in lastfm ● Goal: Compare explicit ratings with their listening history taking to account a number of controlled variables
  • 62. Results. Do explicit ratings relate to implicit feedback? Almost perfect linear  relation between ratings  and quantized implicit  feedback
  • 63. Results. Do explicit ratings relate to implicit feedback? Extreme ratings have clear  ascending/descending  trend, but mild ratings  respond more to changes  in one direction
  • 64. Results. Do other variables affect? Albums listened to more  recently tend to receive  more positive ratings
  • 65. Results. Do other variables affect? Contrary to our expectations,  global album popularity does  not affect ratings
  • 66. Results. What about user variables? ● We obtained many demographic (age, sex, location...) and usage variables (hours of music per week, concerts, music magazines, ways of buying music...) in the study. ● We performed an ANOVA analysis on the data to understand which variables explained some of its variance. ● Only one of the usage variables, contributed (Sig. Value < 0.05) → “Listening Style” encoded whether the user listened preferably to tracks, full albums, or both.
  • 67. Results. Regression Analysis – Model 1: riu = β0 + β1 · ifiu – Model 2: riu = β0 + β1 · ifiu + β2 · reiu – Model 3: riu = β0 + β1 · ifiu + β2 · reiu + β3 · gpi – Model 4: riu = β0 + β1 · ifiu + β2 · reiu + β3 · ifiu · reiu Model R2 F-value p-value β0 β1 β2 β3 1 0.125 F (1, 10120) = 1146 < 2.2 · 10−16 2.726 0.499 2 0.1358 F (2, 10019) = 794.8 < 2.2 · 10−16 2.491 0.484 0.133 3 0.1362 F (3, 10018) = 531.8 < 2.2 · 10−16 2.435 0.486 0.134 0.0285 4 0.1368 F (3, 10018) = 534.7 < 2.2 · 10−16 2.677 0.379 0.038 0.053 All models meaningfully explain the data. Introducing “recentness”  improves 10% but “global popularity” or interaction between variables do  not make much difference
  • 68. Results. Predictive power Model RMSE – Excluding non-rated items User Average 1.131 1 1.026 2 1.017 3 1.016 4 1.016 Error in predicting 20% of the ratings, having trained our  regression model on the other 80%
  • 69. Conclusions ● Recommender systems and similar applications usually focus on having more data ● But... many times is not about having more but rather better data ● User feedback can not always be treated as ground truth and needs to be processed ● Crowds are not always wise and sometimes we are better off using experts ● Implicit feedback represents a good alternative to understand users but mapping is not trivial
  • 70. Colleagues ● Josep M. Pujol and Nuria Oliver (Telefonica) worked on Natural Noise and Wisdom of the Few projects ● Nava Tintarev (Telefonica) worked on Natural Noise External Collaborators ● Neal Lathia (UCL, London), Haewook Ahn (KAIST, Korea), Jaewook Ahn (Pittsburgh Univ.), and Josep Bachs (UPF, Barcelona) on Wisdom of the Few ● Denis Parra (Pittsburgh Univ.) worked on implicit-explicit
  • 71. Thanks! Questions? Xavier Amatriain xar@tid.es http://xavier.amatriain.net http://technocalifornia.blogspot.com @xamat