SlideShare a Scribd company logo
1 of 1
Download to read offline
Intent-Oriented Diversity in Recommender Systems
                                                                                                                                                                                              Saúl Vargas, Pablo Castells and David Vallet
                                                                                                                                                                                                    Universidad Autónoma de Madrid
                                                                                                                                                                                             {saul.vargas,pablo.castells,david.vallet}@uam.es
                                                                                                                                                                                                                                                                                                                     IRG IR Group @ UAM

                         Motivation
                                                       Diversity in Information Retrieval                                                                                                    Recommendation accuracy vs. diversity
                  Addressed as an issue of uncertainty in queries: ambiguity, underspecifi-                                                                         Research in the Recommender Systems field has focused on accuracy
                   cation (Zhai 2003, Chen 2006, Agrawal 2008, Clarke 2009, Santos 2010)                                                                              in matching user tastes
                  Maximize the probability of returning at least some relevant doc                                                                                  Yet diversity is as important or more in recommendation than in search
                  Revision of document relevance independence assumption                                                                                                –         Accuracy alone is not enough to achieve useful recommendations in real scenarios
                                                                                                                                                                         –         Novelty and diversity are key dimensions of utility, along with accuracy
                  Formulated/solved in terms of query aspects / subtopics / subqueries /
                                                                                                                                                                         –         Raising awareness in the field –diversity has become an important emerging research
                   nuggets / intents / categories…                                                                                                                                 topic in Recommender Systems

                  Diversification algorithms: MMR, IA-Select, xQuAD, risk/return, etc.                                                                              The rationale for search diversity also applies in recommendation
                  Metrics: -nDCG, nDCG-IA, ERR-IA, NRBP, subtopic recall, etc.                                                                                         –         User needs are conveyed implicitly  they involve even more uncertainty than queries
                                                                                                                                                                         –         A diverse recommendation increases the chances that at least some item is liked by
                  Diversity is also studied in many other fields: biology, ecology, genetics,                                                                                     avoiding a too narrow array of choice
                   demographics, telecommunications, finance… recommender systems                                                                                        –         The utility of recommended items is not mutually independent                                                                Research on diversity in Recommender Systems
                                                                                                                                                                     Further reasons to diversify                                                                                              Diversification approaches
                                                              The recommendation task                                                                                    –         Diverse recommendations enrich the user experience                                                              –    Optimization of diversity/accuracy tradeoff
                      Given:                                                                                                                                            –         The chances that the user discovers new interests is increased                                                  –    Objective functions, greedy optimization, promotion of long-tail items in the ranking, etc.
                         –      A set of users , a set of items                                                                                                        –         The business is enhanced by increasing the diversity of sales
                                                                                                                                                                                                                                                                                                Metrics
                         –      A history of observed interaction (evidence of user preference for items)                                                            Diversity and accuracy understood as separate, opposing goals                                                                –    Average intra-list diversity (Ziegler 2005, Zhang 2008)                        IL D 
                                                                                                                                                                                                                                                                                                                                                                                                     2
                                                                                                                                                                                                                                                                                                                                                                                                                             d  i j , ik   
                                     Users accessing items:         = set of timestamps (e.g. Last.fm, Amazon)                                                                                                                                                                                                                                                                          R   R    1   i j ,ik  R
                                                                                                                                                                                                                                                                                                                                                                                                                   j k
                                                                                                                                                                                                                  Items                                                                           –    Aggregate diversity (Adomavicius 2011)                    AD             Ru
                                     Or users rating items: r :                        = set of rating values (e.g. Netflix, Amazon)                                                                                                                                                                                                                                   u


                         –      No query                                                                                                                                                                                                                                                           –    Novelty (not the focus of our present work –which is on diversity)
                                                                                                                                                                                                                                                                                                   –    Diversity metric framework, rank and relevance sensitiveness (Vargas 2011)
                      Predict:                                                                                                                                                                     1    3         2         4                       3
                         –      A personalized ranking of items that each user may like                                                                                                  1     2    5         4         1            2        4             5
                                                                                                                                                                                                                                                                                                Some gaps
                                      For each u    R = <i1, i2, …, in> ik                                                                                                                                                                                                                   –    Problem statement and formalization not quite the same as in search diversity
                                                                                                                                                                                         4          ?    3    5              5                       2
                                                                                                                                                                       Users 




                                                                                                                                                                                                                                                                                                   –    Not the same level of methodological consensus/convergence as in search diversity
                         –      Equivalently, define a retrieval function to rank items                                                                                                        2              5    4    4            5                      4
                                                                                                                                                                                                                                                                                                   –    Metrics are rank-insensitive (except Vargas 2011)
                                      f :      (often stated as rating prediction)
                                                                                                                                                                                               3    4         5              4       3        5             4                                      –    Diversity addressed independently from relevance: complementary accuracy metrics
                      Common methods: content-based, CF, kNN, matrix factorization…                                                                                                     3          2         1    5         3                       5                                             –    Room for further studies on metric properties in general
                         –      Focused on accuracy, driven by similarity                                                                                                                3               2              3            5               1

                      Evaluation: split user preference data into training and test                                                                                                   User-item preference data in the recommendation task                                             Since recommendation can be stated as an IR task, is it possible to find
                                                                                                                                                                                                                                                                                         convergence of diversity theories, methods, and metrics from both fields?


                         From search diversity to recommendation diversity
                                                          Mapping search diversity to recommendation diversity                                                                                                                                                                      Adaptation of search diversity techniques to recommender systems
                  Search diversity is based on query uncertainty, intent… –but no query in the recommendation task!                                                                                                                                                                               Diversification algorithms
                  Notion of user profile aspect as an analogous to query aspect                                                                                                                                             Reranking baseline recommendation by greedy maximization of objective function
                        –      A natural idea: the interests of a user may have many different sides and subareas: professional, politics, movies, travel, etc.
                                                                                                                                                                                                                             IA-Select (Agrawal 2008), objective function:                                                                 u         Target user for recommendation
                        –      Different user preference aspects can be relevant or totally irrelevant at different times
                                                                                                                                                                                                                                                                                                                                            R         Baseline recommendation
                                uncertainty about what user interest area should play in a given context
                                                                                                                                                                                                                                              p  f u  rn o rm  u , i  p  f i  
                                                                                                                                                                                                                                                         ˆ                                       1  p  f j  rˆn o rm
                                                                                                                                                                                                                                                                                                                           u , j         S         Reranked recommendation
                                                                                                                                                                                                                                         f                                              j S                                                        The feature space (explicit or implicit)
                Search task                                                                                    Recommendation task
                                                                                                                                                                                                                                                                                                                                            ˆ
                                                                                                                                                                                                                                                                                                                                            rn o rm   The rating prediction function (i.e. retrieval function)
                Query (representation of information need)                                                     User profile (evidence of broad, implicit user need)                                                                                             i    u f  i                                 i  f                               of the recommender, normalized to [0,1]
                                                                                                                                                                                                                                         p f u                                                  p f i
                Document                                                                                       Item (movie, book, music track)                                                                                                                                                                                              u         The set of items accessed by user u (his user profile)
                                                                                                                                                                                                                                                            i        u g  i                                     i
                                                                                                                                                                                                                                                                                                                                            i         The set of features of item i
                                                                                                                                                                                                                                                           g
                Document content (words)                                                                       Item features (movie genre / director / cast, track artist, etc.)
                                                                                                                                                                                                                                                                                5                                                                       1 if f  i
                                                                                                                                                                                                                                                                                                                                                       
                Relevance judgment                                                                             User preference data (rating, purchase / access records)                                                                  E x a m p le : p  C o m e d y u                                                                  i  f   
                                                                                                                                                                                                                                                                               20
                                                                                                                                                                                                                                                                                                                                                        0 o th e r w is e
                                                                                                                                                                                                                                                                                                                                                       
                Subtopic, query aspect, intent, category                                                       User profile aspect                                                                                           Maximum Marginal Relevance (MMR, Carbonell 1998),
                                                                                                                                                                                                                              objective function:
                                                                    Problem: how to extract user profile aspects?                                                                                                                            ˆn o r m  u , i    a v g  1  s i m  i , j  
                                                                                                                                                                                                                                     1    r                                                                                             sim An item similarity function (cosine on feature vectors
                                                                                                                                                                                                                                                                                j S                                                            in our experiments)

                       Explicit approach: available item features                                                                        Implicit approach: matrix factorization                                             Two scenarios are considered:
                                                                                                                                                                                       Items                                     –         Feature data is explicit and known to the diversification method                            the above algorithms use the explicit features as binary vectors
                                                  Explicit user profile aspects
                                                                                                                                                                                                                                 –         No feature knowledge, only user-item preference data is available                           latent feature vectors are extracted by rating matrix factorization
                               0.2                     0.25                  0.35                     0.15                                                                                                                                                                                                                              (binarized for probability estimations in IA-Select)
                                                                                                                                  Latent      f1
                                                                     Tous les matins du monde




                                                                                                                                                    f2                                                                                                                                                    Diversity metrics
                                               Broadway Danny Rose




                                                                                                  2001: A space odyssey




                                                                                                                                 features
                                                                                                                                                         f3
                         Lawrence of Arabia




                                                                                                                                                                                                                             -nDCG (Clarke 2008)
                                               American beauty




                                                                     Ordinary People
                         Seven Samurai




                                                                                                                                                                                                                                 –         User aspects from explicit features play the role of query “nuggets”                             Always use the explicit features
                                               Delicatessen




                                                                                                                                                                                   Implicit user
                         Dersu Uzala




                                                                     The 7th seal
                                                                     Taxi Driver
                                               Caro diario




                                                                                                  The Matrix
                                                                     Rashomon
                         Spartacus




                                                                                                                                                                                  profile aspects                                                                                                                                          in the metrics
                                                                     Interiors




          u                                                                                                                                                                                                                   Intent-aware metrics (Agrawal 2008)
                                                                     Ghandi




                                                                                                  Avatar
                                               Elling




                                                                                                                                 Users




                                                                                                                                                                                                                                 –         User aspects from explicit features play the role of “categories”, for instance:
                                                                                                                                                                                      f1 f2 f3
                         Adventure                Comedy                    Drama                  Sci-Fi                                                                                                                                     n D C G -I A                  p  f u  n D C G u f           where nDCG(u|f) counts as relevant the items that u likes and have the feature f
                                                                                                                                                                                                                                                                      f
                                                                                                                                                                                 Feature space 

                                                         Feature space                                                                                                                                                                                                                                                         Conclusions and advantages
                                                                                                                                     Preliminary experiments
                                                   = 0.5                                                                                                                                                                                                                                      Adaptation of search diversity techniques to recommender systems
                                               -nDCG@50               ERR-IA@50                nDCG-IA@50                        ILD@50                         Dataset: Movielens 100K ratings, ~1K users, ~1.6K items                                                                      New rationale for diversity in recommender systems (theory and models)
                                              kNN             MF       kNN          MF          kNN                       MF   kNN          MF                   Recommender baselines: user-based kNN, matrix factorization (MF)                                                             New diversity metrics for recommender systems: -nDCG, IA metrics
                                                                                                                                                                 Diversification algorithms: rerank top 500 recommended items
              IA-Sel




                        Explicit              0.1589     0.1838      0.0409      0.0516         0.0604              0.0755     0.8659      0.8734
                                                                                                                                                                                                                                                                                               Introduction of rank sensitiveness in diversity metrics
                        Latent                0.1596     0.1597      0.0465      0.0458         0.0618              0.0637     0.7951      0.7817                Explicit features: movie genres       Relevant items  rating > 3                                                           Introduction of relevance and diversity in single metrics
                                                                                                                                                              
 = 0.5




                        Explicit              0.1334     0.1652      0.0367      0.0431         0.0461              0.0555     0.8601      0.8761
              MMR




                                                                                                                                                                  Metrics based on explicit features (ILD taking Jaccard distance)                                                             Moving towards shared consensus and common evaluation methodologies
                        Latent                0.1320     0.1742      0.0373      0.0528         0.0492              0.0705     0.7906      0.7740                5-fold 80% – 20% training-test splits provided in the dataset                                                                Further diversification algorithms (e.g. xQuAD, Santos 2010), further uni-
              Baseline RS                     0.1213     0.1451      0.0352      0.0425         0.0440              0.0561     0.7787      0.7655                Consistent behavior of diversification, improving baselines                                                                   fication of recommendation novelty and diversity metrics (Vargas 2011)
              All differences to baseline are statistically significant (Wilcoxon p < 0.005), except gray cells                                                  Diversification on latent profile aspects competitive w.r.t. explicit!                                                       Future direction: user profile aspect extraction is a rich research problem


                                                                                                                                                                                                             References
                               (Adomavicius 2011)                       G. Adomavicius and Y. Kwon. Improving Aggregate Recommendation Diversity Using Ranking-Based Techniques. IEEE Transactions on Knowledge and Data Engineering, In press.
                               (Agrawal 2008)                           R. Agrawal, S. Gollapudi, A. Halverson, and S. Ieong. Diversifying search results. WSDM 2009, Barcelona, Spain, 2009, pp. 5-14.
                               (Carbonell 1998)                         J. G. Carbonell and J. Goldstein. The Use of MMR, Diversity-Based Reranking for Reordering Documents and Producing Summaries. SIGIR 1998, Melbourne, Australia, 1998, pp. 335-336.
                               (Chen 2006)                              H. Chen and D. R. Karger. Less is More. SIGIR 2006, Seattle, WA, USA, 2006, pp. 429-436.
                               (Clarke 2008)                            C. L. A. Clarke et al. Novelty and diversity in information retrieval evaluation. SIGIR 2008, Singapore, 2008, pp. 659-666.
                               (Santos 2010)                            R. L. T. Santos, C. Macdonald, and I. Ounis. Exploiting Query Reformulation for Web Search Result Diversification. WWW 2010, Raleigh, NC, USA, April 2010, pp. 881-890.
                               (Vargas 2011)                            S. Vargas and P. Castells. Rank and Relevance in Novelty and Diversity Metrics for Recommender Systems. RecSys 2011, Chicago, IL, USA, October 2011.
                               (Wang 2009)                              J. Wang and J. Zhu. Portfolio theory of information retrieval. SIGIR 2009, Boston, MA, USA, pp. 115-122.
                               (Zhai 2003)                              C. Zhai, W. W. Cohen, and J. Lafferty. Beyond independent relevance: Methods and evaluation metrics for subtopic retrieval. SIGIR 2003,Toronto, Canada, 2003, pp. 10-17.
                               (Zhang 2008)
                               (Ziegler 2005)
                                                                        M. Zhang and N. Hurley. Avoiding Monotony: Improving the Diversity of Recommendation Lists. RecSys 2008, Lausanne, Switzerland, October 2008, pp. 123-130.
                                                                        C-N. Ziegler, S. M. McNee, J. A. Konstan, and G. Lausen. Improving recommendation lists through topic diversification. WWW 2005, Chiba, Japan, May 2005, pp. 22-32.                                                                                                                                                                                                      1
                                                                                                                               34th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2011)

More Related Content

Similar to SIGIR 2011 Poster - Intent-Oriented Diversity in Recommender Systems

MOVIE RECOMMENDATION SYSTEM
MOVIE RECOMMENDATION SYSTEMMOVIE RECOMMENDATION SYSTEM
MOVIE RECOMMENDATION SYSTEMIRJET Journal
 
Anthony J brookes
Anthony J brookesAnthony J brookes
Anthony J brookesEduserv
 
Chi1417 Richardson
Chi1417 RichardsonChi1417 Richardson
Chi1417 Richardsonmerichar
 
Using data for impact_Pelletier_5.1.12
Using data for impact_Pelletier_5.1.12Using data for impact_Pelletier_5.1.12
Using data for impact_Pelletier_5.1.12CORE Group
 
The Mezzanine Group Improving Decisions With The Power Of Interview Research ...
The Mezzanine Group Improving Decisions With The Power Of Interview Research ...The Mezzanine Group Improving Decisions With The Power Of Interview Research ...
The Mezzanine Group Improving Decisions With The Power Of Interview Research ...Milena (Nazaruk) Doroszuk
 
The Mezzanine Group Improving Decisions With The Power Of Interview Research ...
The Mezzanine Group Improving Decisions With The Power Of Interview Research ...The Mezzanine Group Improving Decisions With The Power Of Interview Research ...
The Mezzanine Group Improving Decisions With The Power Of Interview Research ...meredithlow
 
McNeill 2007 Fractal Frameworks
McNeill 2007 Fractal FrameworksMcNeill 2007 Fractal Frameworks
McNeill 2007 Fractal FrameworksJeff Mcneill
 
FAIR and metadata standards - FAIRsharing and Neuroscience
FAIR and metadata standards - FAIRsharing and NeuroscienceFAIR and metadata standards - FAIRsharing and Neuroscience
FAIR and metadata standards - FAIRsharing and NeuroscienceSusanna-Assunta Sansone
 
CDE personalised learning
CDE personalised learningCDE personalised learning
CDE personalised learningJames Ballard
 
e-Healthcare Infrastructural Research
e-Healthcare Infrastructural Researche-Healthcare Infrastructural Research
e-Healthcare Infrastructural ResearchDavid Paschane, Ph.D.
 
Advancing Transformational Science Doebbeling Cpmrc 1.22.11
Advancing Transformational  Science Doebbeling Cpmrc 1.22.11Advancing Transformational  Science Doebbeling Cpmrc 1.22.11
Advancing Transformational Science Doebbeling Cpmrc 1.22.11Brad Doebbeling
 
CHAPTER 8 - TOPIC 5- REPORTER 11- BANSIL.pptx
CHAPTER 8 - TOPIC 5- REPORTER 11- BANSIL.pptxCHAPTER 8 - TOPIC 5- REPORTER 11- BANSIL.pptx
CHAPTER 8 - TOPIC 5- REPORTER 11- BANSIL.pptxxyrieltolentino
 
Research methods - qual or quant
Research methods - qual or quantResearch methods - qual or quant
Research methods - qual or quantTracy Harwood
 
Scan Community Narrative To Understand Your Consumer
Scan Community Narrative To Understand Your ConsumerScan Community Narrative To Understand Your Consumer
Scan Community Narrative To Understand Your Consumerjheylmun
 
Empowering user participation with converged semantic services
Empowering user participation with converged semantic servicesEmpowering user participation with converged semantic services
Empowering user participation with converged semantic servicesAnna Fensel
 

Similar to SIGIR 2011 Poster - Intent-Oriented Diversity in Recommender Systems (20)

MOVIE RECOMMENDATION SYSTEM
MOVIE RECOMMENDATION SYSTEMMOVIE RECOMMENDATION SYSTEM
MOVIE RECOMMENDATION SYSTEM
 
Anthony J brookes
Anthony J brookesAnthony J brookes
Anthony J brookes
 
Chi1417 Richardson
Chi1417 RichardsonChi1417 Richardson
Chi1417 Richardson
 
Bv31491493
Bv31491493Bv31491493
Bv31491493
 
Exploring session search
Exploring session searchExploring session search
Exploring session search
 
Using data for impact_Pelletier_5.1.12
Using data for impact_Pelletier_5.1.12Using data for impact_Pelletier_5.1.12
Using data for impact_Pelletier_5.1.12
 
The Mezzanine Group Improving Decisions With The Power Of Interview Research ...
The Mezzanine Group Improving Decisions With The Power Of Interview Research ...The Mezzanine Group Improving Decisions With The Power Of Interview Research ...
The Mezzanine Group Improving Decisions With The Power Of Interview Research ...
 
The Mezzanine Group Improving Decisions With The Power Of Interview Research ...
The Mezzanine Group Improving Decisions With The Power Of Interview Research ...The Mezzanine Group Improving Decisions With The Power Of Interview Research ...
The Mezzanine Group Improving Decisions With The Power Of Interview Research ...
 
McNeill 2007 Fractal Frameworks
McNeill 2007 Fractal FrameworksMcNeill 2007 Fractal Frameworks
McNeill 2007 Fractal Frameworks
 
UX Research
UX ResearchUX Research
UX Research
 
FAIR and metadata standards - FAIRsharing and Neuroscience
FAIR and metadata standards - FAIRsharing and NeuroscienceFAIR and metadata standards - FAIRsharing and Neuroscience
FAIR and metadata standards - FAIRsharing and Neuroscience
 
CDE personalised learning
CDE personalised learningCDE personalised learning
CDE personalised learning
 
e-Healthcare Infrastructural Research
e-Healthcare Infrastructural Researche-Healthcare Infrastructural Research
e-Healthcare Infrastructural Research
 
The EDGE Advantage Diversity Training Series
The EDGE Advantage Diversity Training SeriesThe EDGE Advantage Diversity Training Series
The EDGE Advantage Diversity Training Series
 
Advancing Transformational Science Doebbeling Cpmrc 1.22.11
Advancing Transformational  Science Doebbeling Cpmrc 1.22.11Advancing Transformational  Science Doebbeling Cpmrc 1.22.11
Advancing Transformational Science Doebbeling Cpmrc 1.22.11
 
CHAPTER 8 - TOPIC 5- REPORTER 11- BANSIL.pptx
CHAPTER 8 - TOPIC 5- REPORTER 11- BANSIL.pptxCHAPTER 8 - TOPIC 5- REPORTER 11- BANSIL.pptx
CHAPTER 8 - TOPIC 5- REPORTER 11- BANSIL.pptx
 
A Framework for Health IT Evaluation
A Framework for Health IT EvaluationA Framework for Health IT Evaluation
A Framework for Health IT Evaluation
 
Research methods - qual or quant
Research methods - qual or quantResearch methods - qual or quant
Research methods - qual or quant
 
Scan Community Narrative To Understand Your Consumer
Scan Community Narrative To Understand Your ConsumerScan Community Narrative To Understand Your Consumer
Scan Community Narrative To Understand Your Consumer
 
Empowering user participation with converged semantic services
Empowering user participation with converged semantic servicesEmpowering user participation with converged semantic services
Empowering user participation with converged semantic services
 

More from Pablo Castells

Rational and irrational bias in recommendation
Rational and irrational bias in recommendationRational and irrational bias in recommendation
Rational and irrational bias in recommendationPablo Castells
 
Bias in recommendation: avoid it or embrace it?
Bias in recommendation: avoid it or embrace it?Bias in recommendation: avoid it or embrace it?
Bias in recommendation: avoid it or embrace it?Pablo Castells
 
RecSys 2020 - On Target Item Sampling in Offline Recommender System Evaluation
RecSys 2020 - On Target Item Sampling in Offline Recommender System EvaluationRecSys 2020 - On Target Item Sampling in Offline Recommender System Evaluation
RecSys 2020 - On Target Item Sampling in Offline Recommender System EvaluationPablo Castells
 
REVEAL @ RecSys 2018 - Characterization of Fair Experiments for Recommender S...
REVEAL @ RecSys 2018 - Characterization of Fair Experiments for Recommender S...REVEAL @ RecSys 2018 - Characterization of Fair Experiments for Recommender S...
REVEAL @ RecSys 2018 - Characterization of Fair Experiments for Recommender S...Pablo Castells
 
SIGIR 2018 - Should I Follow the Crowd? A Probabilistic Analysis of the Effec...
SIGIR 2018 - Should I Follow the Crowd? A Probabilistic Analysis of the Effec...SIGIR 2018 - Should I Follow the Crowd? A Probabilistic Analysis of the Effec...
SIGIR 2018 - Should I Follow the Crowd? A Probabilistic Analysis of the Effec...Pablo Castells
 
SIGIR 2017 - A Probabilistic Reformulation of Memory-Based Collaborative Filt...
SIGIR 2017 - A Probabilistic Reformulation of Memory-Based Collaborative Filt...SIGIR 2017 - A Probabilistic Reformulation of Memory-Based Collaborative Filt...
SIGIR 2017 - A Probabilistic Reformulation of Memory-Based Collaborative Filt...Pablo Castells
 
RSWeb @ ACM RecSys 2014 - Exploring social network effects on popularity bias...
RSWeb @ ACM RecSys 2014 - Exploring social network effects on popularity bias...RSWeb @ ACM RecSys 2014 - Exploring social network effects on popularity bias...
RSWeb @ ACM RecSys 2014 - Exploring social network effects on popularity bias...Pablo Castells
 
SIGIR 2012 - Explicit Relevance Models in Intent-Oriented Information Retrie...
SIGIR 2012 - Explicit Relevance Models in Intent-Oriented  Information Retrie...SIGIR 2012 - Explicit Relevance Models in Intent-Oriented  Information Retrie...
SIGIR 2012 - Explicit Relevance Models in Intent-Oriented Information Retrie...Pablo Castells
 
ACM RecSys 2011 - Rank and Relevance in Novelty and Diversity Metrics for Rec...
ACM RecSys 2011 - Rank and Relevance in Novelty and Diversity Metrics for Rec...ACM RecSys 2011 - Rank and Relevance in Novelty and Diversity Metrics for Rec...
ACM RecSys 2011 - Rank and Relevance in Novelty and Diversity Metrics for Rec...Pablo Castells
 

More from Pablo Castells (9)

Rational and irrational bias in recommendation
Rational and irrational bias in recommendationRational and irrational bias in recommendation
Rational and irrational bias in recommendation
 
Bias in recommendation: avoid it or embrace it?
Bias in recommendation: avoid it or embrace it?Bias in recommendation: avoid it or embrace it?
Bias in recommendation: avoid it or embrace it?
 
RecSys 2020 - On Target Item Sampling in Offline Recommender System Evaluation
RecSys 2020 - On Target Item Sampling in Offline Recommender System EvaluationRecSys 2020 - On Target Item Sampling in Offline Recommender System Evaluation
RecSys 2020 - On Target Item Sampling in Offline Recommender System Evaluation
 
REVEAL @ RecSys 2018 - Characterization of Fair Experiments for Recommender S...
REVEAL @ RecSys 2018 - Characterization of Fair Experiments for Recommender S...REVEAL @ RecSys 2018 - Characterization of Fair Experiments for Recommender S...
REVEAL @ RecSys 2018 - Characterization of Fair Experiments for Recommender S...
 
SIGIR 2018 - Should I Follow the Crowd? A Probabilistic Analysis of the Effec...
SIGIR 2018 - Should I Follow the Crowd? A Probabilistic Analysis of the Effec...SIGIR 2018 - Should I Follow the Crowd? A Probabilistic Analysis of the Effec...
SIGIR 2018 - Should I Follow the Crowd? A Probabilistic Analysis of the Effec...
 
SIGIR 2017 - A Probabilistic Reformulation of Memory-Based Collaborative Filt...
SIGIR 2017 - A Probabilistic Reformulation of Memory-Based Collaborative Filt...SIGIR 2017 - A Probabilistic Reformulation of Memory-Based Collaborative Filt...
SIGIR 2017 - A Probabilistic Reformulation of Memory-Based Collaborative Filt...
 
RSWeb @ ACM RecSys 2014 - Exploring social network effects on popularity bias...
RSWeb @ ACM RecSys 2014 - Exploring social network effects on popularity bias...RSWeb @ ACM RecSys 2014 - Exploring social network effects on popularity bias...
RSWeb @ ACM RecSys 2014 - Exploring social network effects on popularity bias...
 
SIGIR 2012 - Explicit Relevance Models in Intent-Oriented Information Retrie...
SIGIR 2012 - Explicit Relevance Models in Intent-Oriented  Information Retrie...SIGIR 2012 - Explicit Relevance Models in Intent-Oriented  Information Retrie...
SIGIR 2012 - Explicit Relevance Models in Intent-Oriented Information Retrie...
 
ACM RecSys 2011 - Rank and Relevance in Novelty and Diversity Metrics for Rec...
ACM RecSys 2011 - Rank and Relevance in Novelty and Diversity Metrics for Rec...ACM RecSys 2011 - Rank and Relevance in Novelty and Diversity Metrics for Rec...
ACM RecSys 2011 - Rank and Relevance in Novelty and Diversity Metrics for Rec...
 

Recently uploaded

Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Alan Dix
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxOnBoard
 
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...gurkirankumar98700
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
Google AI Hackathon: LLM based Evaluator for RAG
Google AI Hackathon: LLM based Evaluator for RAGGoogle AI Hackathon: LLM based Evaluator for RAG
Google AI Hackathon: LLM based Evaluator for RAGSujit Pal
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Paola De la Torre
 
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | DelhiFULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhisoniya singh
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 

Recently uploaded (20)

Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptx
 
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
Google AI Hackathon: LLM based Evaluator for RAG
Google AI Hackathon: LLM based Evaluator for RAGGoogle AI Hackathon: LLM based Evaluator for RAG
Google AI Hackathon: LLM based Evaluator for RAG
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101
 
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | DelhiFULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 

SIGIR 2011 Poster - Intent-Oriented Diversity in Recommender Systems

  • 1. Intent-Oriented Diversity in Recommender Systems Saúl Vargas, Pablo Castells and David Vallet Universidad Autónoma de Madrid {saul.vargas,pablo.castells,david.vallet}@uam.es IRG IR Group @ UAM Motivation Diversity in Information Retrieval Recommendation accuracy vs. diversity  Addressed as an issue of uncertainty in queries: ambiguity, underspecifi-  Research in the Recommender Systems field has focused on accuracy cation (Zhai 2003, Chen 2006, Agrawal 2008, Clarke 2009, Santos 2010) in matching user tastes  Maximize the probability of returning at least some relevant doc  Yet diversity is as important or more in recommendation than in search  Revision of document relevance independence assumption – Accuracy alone is not enough to achieve useful recommendations in real scenarios – Novelty and diversity are key dimensions of utility, along with accuracy  Formulated/solved in terms of query aspects / subtopics / subqueries / – Raising awareness in the field –diversity has become an important emerging research nuggets / intents / categories… topic in Recommender Systems  Diversification algorithms: MMR, IA-Select, xQuAD, risk/return, etc.  The rationale for search diversity also applies in recommendation  Metrics: -nDCG, nDCG-IA, ERR-IA, NRBP, subtopic recall, etc. – User needs are conveyed implicitly  they involve even more uncertainty than queries – A diverse recommendation increases the chances that at least some item is liked by  Diversity is also studied in many other fields: biology, ecology, genetics, avoiding a too narrow array of choice demographics, telecommunications, finance… recommender systems – The utility of recommended items is not mutually independent Research on diversity in Recommender Systems  Further reasons to diversify  Diversification approaches The recommendation task – Diverse recommendations enrich the user experience – Optimization of diversity/accuracy tradeoff  Given: – The chances that the user discovers new interests is increased – Objective functions, greedy optimization, promotion of long-tail items in the ranking, etc. – A set of users , a set of items  – The business is enhanced by increasing the diversity of sales  Metrics – A history of observed interaction (evidence of user preference for items)  Diversity and accuracy understood as separate, opposing goals – Average intra-list diversity (Ziegler 2005, Zhang 2008) IL D  2  d  i j , ik  Users accessing items:         = set of timestamps (e.g. Last.fm, Amazon) R R  1 i j ,ik  R j k Items  – Aggregate diversity (Adomavicius 2011) AD  Ru Or users rating items: r :       = set of rating values (e.g. Netflix, Amazon) u – No query – Novelty (not the focus of our present work –which is on diversity) – Diversity metric framework, rank and relevance sensitiveness (Vargas 2011)  Predict: 1 3 2 4 3 – A personalized ranking of items that each user may like 1 2 5 4 1 2 4 5  Some gaps For each u    R = <i1, i2, …, in> ik   – Problem statement and formalization not quite the same as in search diversity 4 ? 3 5 5 2 Users  – Not the same level of methodological consensus/convergence as in search diversity – Equivalently, define a retrieval function to rank items 2 5 4 4 5 4 – Metrics are rank-insensitive (except Vargas 2011) f :      (often stated as rating prediction) 3 4 5 4 3 5 4 – Diversity addressed independently from relevance: complementary accuracy metrics  Common methods: content-based, CF, kNN, matrix factorization… 3 2 1 5 3 5 – Room for further studies on metric properties in general – Focused on accuracy, driven by similarity 3 2 3 5 1  Evaluation: split user preference data into training and test User-item preference data in the recommendation task  Since recommendation can be stated as an IR task, is it possible to find convergence of diversity theories, methods, and metrics from both fields? From search diversity to recommendation diversity Mapping search diversity to recommendation diversity Adaptation of search diversity techniques to recommender systems  Search diversity is based on query uncertainty, intent… –but no query in the recommendation task! Diversification algorithms  Notion of user profile aspect as an analogous to query aspect  Reranking baseline recommendation by greedy maximization of objective function – A natural idea: the interests of a user may have many different sides and subareas: professional, politics, movies, travel, etc.  IA-Select (Agrawal 2008), objective function: u Target user for recommendation – Different user preference aspects can be relevant or totally irrelevant at different times R Baseline recommendation  uncertainty about what user interest area should play in a given context  p  f u  rn o rm  u , i  p  f i   ˆ  1  p  f j  rˆn o rm u , j   S Reranked recommendation f j S  The feature space (explicit or implicit) Search task Recommendation task ˆ rn o rm The rating prediction function (i.e. retrieval function) Query (representation of information need) User profile (evidence of broad, implicit user need) i  u f  i i  f  of the recommender, normalized to [0,1] p f u  p f i Document Item (movie, book, music track) u The set of items accessed by user u (his user profile)  i  u g  i i i The set of features of item i g Document content (words) Item features (movie genre / director / cast, track artist, etc.) 5  1 if f  i  Relevance judgment User preference data (rating, purchase / access records) E x a m p le : p  C o m e d y u  i  f    20  0 o th e r w is e  Subtopic, query aspect, intent, category User profile aspect  Maximum Marginal Relevance (MMR, Carbonell 1998), objective function:  Problem: how to extract user profile aspects? ˆn o r m  u , i    a v g  1  s i m  i , j   1    r sim An item similarity function (cosine on feature vectors j S in our experiments) Explicit approach: available item features Implicit approach: matrix factorization  Two scenarios are considered: Items – Feature data is explicit and known to the diversification method  the above algorithms use the explicit features as binary vectors Explicit user profile aspects – No feature knowledge, only user-item preference data is available  latent feature vectors are extracted by rating matrix factorization 0.2 0.25 0.35 0.15 (binarized for probability estimations in IA-Select) Latent f1 Tous les matins du monde f2 Diversity metrics Broadway Danny Rose 2001: A space odyssey features f3 Lawrence of Arabia  -nDCG (Clarke 2008) American beauty Ordinary People Seven Samurai – User aspects from explicit features play the role of query “nuggets” Always use the explicit features Delicatessen Implicit user Dersu Uzala The 7th seal Taxi Driver Caro diario The Matrix Rashomon Spartacus profile aspects  in the metrics Interiors u Intent-aware metrics (Agrawal 2008) Ghandi Avatar Elling Users – User aspects from explicit features play the role of “categories”, for instance: f1 f2 f3 Adventure Comedy Drama Sci-Fi n D C G -I A   p  f u  n D C G u f  where nDCG(u|f) counts as relevant the items that u likes and have the feature f f Feature space  Feature space  Conclusions and advantages Preliminary experiments  = 0.5  Adaptation of search diversity techniques to recommender systems -nDCG@50 ERR-IA@50 nDCG-IA@50 ILD@50  Dataset: Movielens 100K ratings, ~1K users, ~1.6K items  New rationale for diversity in recommender systems (theory and models) kNN MF kNN MF kNN MF kNN MF  Recommender baselines: user-based kNN, matrix factorization (MF)  New diversity metrics for recommender systems: -nDCG, IA metrics  Diversification algorithms: rerank top 500 recommended items IA-Sel Explicit 0.1589 0.1838 0.0409 0.0516 0.0604 0.0755 0.8659 0.8734  Introduction of rank sensitiveness in diversity metrics Latent 0.1596 0.1597 0.0465 0.0458 0.0618 0.0637 0.7951 0.7817  Explicit features: movie genres  Relevant items  rating > 3  Introduction of relevance and diversity in single metrics   = 0.5 Explicit 0.1334 0.1652 0.0367 0.0431 0.0461 0.0555 0.8601 0.8761 MMR Metrics based on explicit features (ILD taking Jaccard distance)  Moving towards shared consensus and common evaluation methodologies Latent 0.1320 0.1742 0.0373 0.0528 0.0492 0.0705 0.7906 0.7740  5-fold 80% – 20% training-test splits provided in the dataset  Further diversification algorithms (e.g. xQuAD, Santos 2010), further uni- Baseline RS 0.1213 0.1451 0.0352 0.0425 0.0440 0.0561 0.7787 0.7655  Consistent behavior of diversification, improving baselines fication of recommendation novelty and diversity metrics (Vargas 2011) All differences to baseline are statistically significant (Wilcoxon p < 0.005), except gray cells  Diversification on latent profile aspects competitive w.r.t. explicit!  Future direction: user profile aspect extraction is a rich research problem References (Adomavicius 2011) G. Adomavicius and Y. Kwon. Improving Aggregate Recommendation Diversity Using Ranking-Based Techniques. IEEE Transactions on Knowledge and Data Engineering, In press. (Agrawal 2008) R. Agrawal, S. Gollapudi, A. Halverson, and S. Ieong. Diversifying search results. WSDM 2009, Barcelona, Spain, 2009, pp. 5-14. (Carbonell 1998) J. G. Carbonell and J. Goldstein. The Use of MMR, Diversity-Based Reranking for Reordering Documents and Producing Summaries. SIGIR 1998, Melbourne, Australia, 1998, pp. 335-336. (Chen 2006) H. Chen and D. R. Karger. Less is More. SIGIR 2006, Seattle, WA, USA, 2006, pp. 429-436. (Clarke 2008) C. L. A. Clarke et al. Novelty and diversity in information retrieval evaluation. SIGIR 2008, Singapore, 2008, pp. 659-666. (Santos 2010) R. L. T. Santos, C. Macdonald, and I. Ounis. Exploiting Query Reformulation for Web Search Result Diversification. WWW 2010, Raleigh, NC, USA, April 2010, pp. 881-890. (Vargas 2011) S. Vargas and P. Castells. Rank and Relevance in Novelty and Diversity Metrics for Recommender Systems. RecSys 2011, Chicago, IL, USA, October 2011. (Wang 2009) J. Wang and J. Zhu. Portfolio theory of information retrieval. SIGIR 2009, Boston, MA, USA, pp. 115-122. (Zhai 2003) C. Zhai, W. W. Cohen, and J. Lafferty. Beyond independent relevance: Methods and evaluation metrics for subtopic retrieval. SIGIR 2003,Toronto, Canada, 2003, pp. 10-17. (Zhang 2008) (Ziegler 2005) M. Zhang and N. Hurley. Avoiding Monotony: Improving the Diversity of Recommendation Lists. RecSys 2008, Lausanne, Switzerland, October 2008, pp. 123-130. C-N. Ziegler, S. M. McNee, J. A. Konstan, and G. Lausen. Improving recommendation lists through topic diversification. WWW 2005, Chiba, Japan, May 2005, pp. 22-32. 1 34th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2011)