SlideShare a Scribd company logo
1 of 28
Download to read offline
EPG content recommendation in
large scale: a case study on
interactive TV platform
D. Zibriczky, Z. Petres, M. Waszlavik, D. Tikk
ICMLA 2013 - Machine Learning with Multimedia Data
7th December 2013. Miami. United States
Outline
• Introduction
• Problem
• Solution
• Offline results
• Online results
• Conclusion
Enter date in master2 ICMLA 2013 - Machine Learning with Multimedia Data. 2013. Miami. United States
Introduction / Consumption trends
3 ICMLA 2013 - Machine Learning with Multimedia Data. 2013. Miami. United States
Introduction / Electronic Program Guide
4 ICMLA 2013 - Machine Learning with Multimedia Data. 2013. Miami. United States
Introduction / Goal
• SaskTel
• Finding relevant contents with minimal effort
• Time-shifting
• Multiple devices per household
• Graphical User Interface
• Increasing content consumption / watching length
• Increasing click through rate (CTR) using Gravity’s GUI
Enter date in master5 ICMLA 2013 - Machine Learning with Multimedia Data. 2013. Miami. United States
Problem / Recommendation concept
• User: Device
 Users cannot be distinguished explicitly
 More than one device per household
• Item: Scheduled contents (time, program id, channel id)
 Typically series or programs without episodes
 Metadata: Information about the items
• Event: Remote controller / set-top-box based implicit feedbacks
 Switching channel, set to record, rewind, replay, stop, pause
 Next schedule, watching duration
• Recommendable items
 Set of series or programs that are broadcasted at the moment of
recommendation request or later (on now, on later scenario)
• Recommendation
 Sorting recommendable items by prediction values
 Other recommendation logic (randomization, mixing, etc..)
Enter date in master6 ICMLA 2013 - Machine Learning with Multimedia Data. 2013. Miami. United States
Problem / Difficulties
• Implicit feedbacks only (no explicit data)
• Huge but noisy data set (zapping, leave-on, irrelevant events, …)
• Cold start problem (new items, short lifetime)
• Small recommendable set at a time
• Context dependency (time, multiple users per household)
• Difference between offline and online optimization
Enter date in master7 ICMLA 2013 - Machine Learning with Multimedia Data. 2013. Miami. United States
Solution / Baselines
• Most popular channels
• Most popular contents (series or programs)
• Users’ favourite channels
• Users’ favourite contents (series or programs)
Enter date in master8 ICMLA 2013 - Machine Learning with Multimedia Data. 2013. Miami. United States
Solution / Content-based filtering
• Cosine Similarity
• User model: Weighted average of meta vectors
• Prediction: Cosine similarity of vectors
• Improvement: Term frequency based weighting (TFIDF)
Enter date in master9 ICMLA 2013 - Machine Learning with Multimedia Data. 2013. Miami. United States
M The Simpsons
How I Met
Your Mother
Futurama …
Genre = Animation 1 0 1 …
Genre = Comedy 1 1 1 …
… … … … …
Director = Matt Groening 1 0 1 …
Director = Carter Bays 0 1 0 …
Actor = Dan Castellaneta 1 0 0 …
Actor = Billy West 0 0 1 …
… … … … …
User 1
0.53
0.81
…
0.18
0.00
0.18
0.00
…
Solution / Collaborative Filtering
• Matrix Factorization
• User model: User factors
• Prediction: Dot product of latent factors
• Solver: Alternating Least Squares with Coordinate Descent (IALS1)
Enter date in master10 ICMLA 2013 - Machine Learning with Multimedia Data. 2013. Miami. United States
R The Simpsons
How I Met
Your Mother
Futurama …
User 1 1 …
User 2 1 1 u2*i3 …
User 3 1 …
… … … … …
Item factors
i11 i21 i31 …
i21 i22 i32 …
User factors
u11 u12
u21 u22
u31 u32
… …
Solution / Hybrid filtering
Enter date in master11 ICMLA 2013 - Machine Learning with Multimedia Data. 2013. Miami. United States
R* The Simpsons
How I Met
Your Mother
Futurama …
User 1 1 0 0 …
User 2 1 1 0 …
User 3 0 1 0 …
… … … … …
Genre = Animation 1 0 1 …
Genre = Comedy 1 1 1 …
… … … … …
Director = Matt Groening 1 0 1 …
Director = Carter Bays 0 1 0 …
Actor = Dan Castellaneta 1 0 0 …
Actor = Billy West 0 0 1 …
… … … … …
User factors
u11 u12
u21 u22
u31 u32
… …
pu11 pu12
pu22 pu22
… …
… …
… …
… …
… …
… …
Item factors
i11 i21 I31 …
i21 i22 I32 …
• Hybrid IALS1
TP
factors
Solution / Channel recommendation
• Tensor factorization (ITALS1)
• Prediction: Hadamard product of latent factors
• Improvement: Watching duration based weighting
Enter date in master12 ICMLA 2013 - Machine Learning with Multimedia Data. 2013. Miami. United States
R (4:00-12:00)
Channel
Sports 1
Channel
Sports 2
Channel
News 1
…
User 1 1 1 …
User 2 1 …
User 3 1 …
… … … … …
R (12:00-20:00)
Channel
Sports 1
Channel
Sports 2
Channel
News 1
…
User 1 1 …
User 2 1 1 …
User 3 1 …
… … … … …
R (20:00-4:00)
Channel
Sports 1
Channel
Sports 2
Channel
News 1
…
User 1 1 …
User 2 1 …
User 3 u3°i2°t3 …
… … … … …
User factors
u11 u12
u21 u22
u31 u32
… …
Item factors
i11 i21 i31 …
i21 i22 i32 …
t11
t12
t21
t22
t31
t32
Solution / Item grouping
Enter date in master13 ICMLA 2013 - Machine Learning with Multimedia Data. 2013. Miami. United States
Solution / Preprocessing
Enter date in master14 ICMLA 2013 - Machine Learning with Multimedia Data. 2013. Miami. United States
Original set
308M
82M
23M
Train set
22M
Test set
676K
Event type based filtering by significance
1
2
3 3
1
2
3
Filtering by leave-on and short duration
Splitting by time
Offline results / Measurement
Enter date in master15 ICMLA 2013 - Machine Learning with Multimedia Data. 2013. Miami. United States
• Metrics:
 Recall@N
 Mean Reciprocial Rank (MRR)
• Item splits:
 Having events on training set or not
o old items
o new items
 Popularity 20-80 split
o popular items
o tail items
 Episode of a series or not
o series
o non-series
• Recall@15
* Items are grouped by series ids or program ids
** Items are grouped by channel ids
*** Blend: Combination of CosineSim, IALS1, ITALS1, HybridIALS1 and favourite programs/series
Offline results / Comparison
Enter date in master16 ICMLA 2013 - Machine Learning with Multimedia Data. 2013. Miami. United States
Algorithm Type All items
Old
items
New
items
Popular
items
Tail
items
Series
Non-
series
Most popular channels BL 17.45% 17.81% 10.59% 19.95% 3.90% 18.58% 2.91%
Most popular series BL 25.68% 27.06% 0.00% 30.35% 0.44% 27.69% 0.00%
Favourite channels BL 30.98% 31.61% 19.20% 32.68% 21.80% 32.52% 11.28%
Favourite series / programs BL 48.58% 51.13% 0.00% 53.55% 21.83% 52.34% 1.09%
CosineSim CBF 52.02% 52.92% 34.94% 53.58% 43.65% 53.69% 30.93%
IALS1* CF 46.75% 49.26% 0.00% 52.30% 16.78% 50.28% 1.65%
ITALS1** CF 41.68% 42.60% 24.48% 44.53% 26.26% 43.84% 14.06%
Hybrid IALS1* HF 51.08% 53.82% 6.78% 56.46% 22.01% 54.95% 1.63%
Blend*** 55.48% 56.98% 26.15% 57.64% 43.61% 57.91% 24.41%
Blend*** (MRR) 0.1038 0.1070 0.0405 0.1097 0.0712 0.1094 0.0322
• Recall@15
* Items are grouped by series ids or program ids
** Items are grouped by channel ids
*** Blend: Combination of CosineSim, IALS1, ITALS1, HybridIALS1 and favourite programs/series
Offline results / Comparison
Enter date in master17 ICMLA 2013 - Machine Learning with Multimedia Data. 2013. Miami. United States
Algorithm Type All items
Old
items
New
items
Popular
items
Tail
items
Series
Non-
series
Most popular channels BL 17.45% 17.81% 10.59% 19.95% 3.90% 18.58% 2.91%
Most popular series BL 25.68% 27.06% 0.00% 30.35% 0.44% 27.69% 0.00%
Favourite channels BL 30.98% 31.61% 19.20% 32.68% 21.80% 32.52% 11.28%
Favourite series / programs BL 48.58% 51.13% 0.00% 53.55% 21.83% 52.34% 1.09%
CosineSim CBF 52.02% 52.92% 34.94% 53.58% 43.65% 53.69% 30.93%
IALS1* CF 46.75% 49.26% 0.00% 52.30% 16.78% 50.28% 1.65%
ITALS1** CF 41.68% 42.60% 24.48% 44.53% 26.26% 43.84% 14.06%
Hybrid IALS1* HF 51.08% 53.82% 6.78% 56.46% 22.01% 54.95% 1.63%
Blend*** 55.48% 56.98% 26.15% 57.64% 43.61% 57.91% 24.41%
Blend*** (MRR) 0.1038 0.1070 0.0405 0.1097 0.0712 0.1094 0.0322
Online results / User Interface
Enter date in master18 ICMLA 2013 - Machine Learning with Multimedia Data. 2013. Miami. United States
Online results / Measurement
Enter date in master19 ICMLA 2013 - Machine Learning with Multimedia Data. 2013. Miami. United States
• Metrics:
 Click-through rate (CTR)
 Watching Length Ratio (WR): The average watching length of the contents
that were watched at least 1 minutes by the user.
 Completed Watched Ratio (CWR): The average ratio of the events in the
content was watched at least 90% of it’s remaining length.
• Methods:
 EPG-Z: Standard consumption method (EPG and channel zapping)
 R4U: Recommended 4 U
Online results / Measurement
Enter date in master20 ICMLA 2013 - Machine Learning with Multimedia Data. 2013. Miami. United States
• Metrics:
 Click-through rate (CTR)
 Watching Length Ratio (WR): The average watching length of the contents
that were watched at least 1 minutes by the user.
 Completed Watched Ratio (CWR): The average ratio of the events in the
content was watched at least 90% of it’s remaining length.
• Methods:
 EPG-Z: Standard consumption method (EPG and channel zapping)
 R4U: Recommended 4 U
EPG-Z vs. R4U?
Online results / Clicks
Enter date in master21 ICMLA 2013 - Machine Learning with Multimedia Data. 2013. Miami. United States
0%
10%
20%
30%
40%
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15
Distribution of clicks by position
• Users like to click on the first item.
• 80% of the clicks comes from one of the Top5 positions.
• More clicks in the 15th position (2.2%) than in the 14th (1.3%).
Online results / CTR
Enter date in master22 ICMLA 2013 - Machine Learning with Multimedia Data. 2013. Miami. United States
33,20%
35,30%
33,12%
26,49%
39,05%
52,07%
33,16%
All items
Non-series
Series
Tail items
Popular items
New items
Old items
Click-through rate by different item splits
Online results / CTR by usage
Enter date in master23 ICMLA 2013 - Machine Learning with Multimedia Data. 2013. Miami. United States
0%
10%
20%
30%
40%
50%
60%
70%
80%
1 10 100
Average CTR vs. # of rec. requests from the first use of R4U
Online results / Watching behavior
Enter date in master24 ICMLA 2013 - Machine Learning with Multimedia Data. 2013. Miami. United States
Watching Length Ratio Completed Watching Ratio
Item splits EPG-Z R4U EPG-Z R4U
Old items 30.02% 42.04% 16.02% 31.03%
New items 21.11% 35.51% 8.01% 23.12%
Popular items 30.81% 44.19% 16.30% 32.27%
Long-tail items 28.01% 38.43% 15.11% 27.66%
Series 31.04% 43.00% 16.92% 31.51%
Non-series 17.94% 15.26% 5.31% 7.22%
All items 29.90% 42.02% 15.91% 30.53%
Online results / Watching behavior
• Contents selected via R4U are watched 40% longer and completed
with almost twice more probability than in standard way.
Enter date in master25 ICMLA 2013 - Machine Learning with Multimedia Data. 2013. Miami. United States
Watching Length Ratio Completed Watching Ratio
Item splits EPG-Z R4U EPG-Z R4U
Old items 30.02% 42.04% 16.02% 31.03%
New items 21.11% 35.51% 8.01% 23.12%
Popular items 30.81% 44.19% 16.30% 32.27%
Long-tail items 28.01% 38.43% 15.11% 27.66%
Series 31.04% 43.00% 16.92% 31.51%
Non-series 17.94% 15.26% 5.31% 7.22%
All items 29.90% 42.02% 15.91% 30.53%<< <<
Online results / Offline vs. Online metrics
• High correlation between Recall/MRR and Completed Waching Ratio
Enter date in master26 ICMLA 2013 - Machine Learning with Multimedia Data. 2013. Miami. United States
Offline Online
Recall@15 MRR CWR
Old items 56.98% 0.1070 31.03%
New items 26.15% 0.0405 23.12%
Popular items 57.64% 0.1097 32.27%
Long-tail items 43.61% 0.0712 27.66%
Series 57.91% 0.1094 31.51%
Non-series 24.41% 0.0322 7.22%
All items 55.48% 0.1038 30.53%
Conclusion
• Linear recommendation difficulties.
• Metadata based item modeling (CBF) is quite effective, additional
improvement by combining with CF.
• Users prefer first items, they don’t do much effort.
• High click-through rate, especially for new items.
• R4U affects user behavior and satisfaction.
• Contents selected via R4U are watched 40% longer and completed
with almost twice more probability than in standard way.
• High correlation between the proposed offline and online metrics.
Enter date in master27 ICMLA 2013 - Machine Learning with Multimedia Data. 2013. Miami. United States
ICMLA 2013 - Machine Learning with Multimedia Data. 2013. Miami. United States

More Related Content

Viewers also liked

Case american airlines and the sabre system
Case   american airlines and the sabre systemCase   american airlines and the sabre system
Case american airlines and the sabre systemMarshal Richard
 
Rohit Talwar- Broadening the Ancillary Revenue Stream for the Airline Retail ...
Rohit Talwar- Broadening the Ancillary Revenue Stream for the Airline Retail ...Rohit Talwar- Broadening the Ancillary Revenue Stream for the Airline Retail ...
Rohit Talwar- Broadening the Ancillary Revenue Stream for the Airline Retail ...Rohit Talwar
 
Understanding Product Recommendations: Value, Functionality & Best Practices
Understanding Product Recommendations: Value, Functionality & Best PracticesUnderstanding Product Recommendations: Value, Functionality & Best Practices
Understanding Product Recommendations: Value, Functionality & Best PracticesDynamic Yield
 
How Comcast uses Data Science to Improve the Customer Experience
How Comcast uses Data Science to Improve the Customer ExperienceHow Comcast uses Data Science to Improve the Customer Experience
How Comcast uses Data Science to Improve the Customer ExperienceTuri, Inc.
 
The Revolution to Stay Relevant in Travel
The Revolution to Stay Relevant in TravelThe Revolution to Stay Relevant in Travel
The Revolution to Stay Relevant in TravelSabre Corporation
 
Personalization - 10 Lessons Learned from Netflix
Personalization - 10 Lessons Learned from NetflixPersonalization - 10 Lessons Learned from Netflix
Personalization - 10 Lessons Learned from NetflixPancrazio Auteri
 

Viewers also liked (6)

Case american airlines and the sabre system
Case   american airlines and the sabre systemCase   american airlines and the sabre system
Case american airlines and the sabre system
 
Rohit Talwar- Broadening the Ancillary Revenue Stream for the Airline Retail ...
Rohit Talwar- Broadening the Ancillary Revenue Stream for the Airline Retail ...Rohit Talwar- Broadening the Ancillary Revenue Stream for the Airline Retail ...
Rohit Talwar- Broadening the Ancillary Revenue Stream for the Airline Retail ...
 
Understanding Product Recommendations: Value, Functionality & Best Practices
Understanding Product Recommendations: Value, Functionality & Best PracticesUnderstanding Product Recommendations: Value, Functionality & Best Practices
Understanding Product Recommendations: Value, Functionality & Best Practices
 
How Comcast uses Data Science to Improve the Customer Experience
How Comcast uses Data Science to Improve the Customer ExperienceHow Comcast uses Data Science to Improve the Customer Experience
How Comcast uses Data Science to Improve the Customer Experience
 
The Revolution to Stay Relevant in Travel
The Revolution to Stay Relevant in TravelThe Revolution to Stay Relevant in Travel
The Revolution to Stay Relevant in Travel
 
Personalization - 10 Lessons Learned from Netflix
Personalization - 10 Lessons Learned from NetflixPersonalization - 10 Lessons Learned from Netflix
Personalization - 10 Lessons Learned from Netflix
 

Similar to EPG content recommendation in large scale: a case study on interactive TV platform

[CS570] Machine Learning Team Project (I know what items really are)
[CS570] Machine Learning Team Project (I know what items really are)[CS570] Machine Learning Team Project (I know what items really are)
[CS570] Machine Learning Team Project (I know what items really are)Kunwoo Park
 
Event Mining in Social Multimedia
Event Mining in Social MultimediaEvent Mining in Social Multimedia
Event Mining in Social MultimediaSymeon Papadopoulos
 
Cikm 2013 - Beyond Data From User Information to Business Value
Cikm 2013 - Beyond Data From User Information to Business ValueCikm 2013 - Beyond Data From User Information to Business Value
Cikm 2013 - Beyond Data From User Information to Business ValueXavier Amatriain
 
Profiler for Smartphone Users Interests Using Modified Hierarchical Agglomera...
Profiler for Smartphone Users Interests Using Modified Hierarchical Agglomera...Profiler for Smartphone Users Interests Using Modified Hierarchical Agglomera...
Profiler for Smartphone Users Interests Using Modified Hierarchical Agglomera...Lippo Group Digital
 
Qcon SF 2013 - Machine Learning & Recommender Systems @ Netflix Scale
Qcon SF 2013 - Machine Learning & Recommender Systems @ Netflix ScaleQcon SF 2013 - Machine Learning & Recommender Systems @ Netflix Scale
Qcon SF 2013 - Machine Learning & Recommender Systems @ Netflix ScaleXavier Amatriain
 
Scalable Topic-Specific Influence Analysis on Microblogs
Scalable Topic-Specific Influence Analysis on MicroblogsScalable Topic-Specific Influence Analysis on Microblogs
Scalable Topic-Specific Influence Analysis on MicroblogsYuanyuan Tian
 
18.02.05_IAAI2018_Mobille Network Failure Event Detection and Forecasting wit...
18.02.05_IAAI2018_Mobille Network Failure Event Detection and Forecasting wit...18.02.05_IAAI2018_Mobille Network Failure Event Detection and Forecasting wit...
18.02.05_IAAI2018_Mobille Network Failure Event Detection and Forecasting wit...LINE Corp.
 
Data Summer Conf 2018, “Architecting IoT system with Machine Learning (ENG)” ...
Data Summer Conf 2018, “Architecting IoT system with Machine Learning (ENG)” ...Data Summer Conf 2018, “Architecting IoT system with Machine Learning (ENG)” ...
Data Summer Conf 2018, “Architecting IoT system with Machine Learning (ENG)” ...Provectus
 
Architecting IoT with Machine Learning
Architecting IoT with Machine LearningArchitecting IoT with Machine Learning
Architecting IoT with Machine LearningRudradeb Mitra
 
The Data Distribution Service Tutorial
The Data Distribution Service TutorialThe Data Distribution Service Tutorial
The Data Distribution Service TutorialAngelo Corsaro
 
Synthetic Data Generation with DoppelGanger
Synthetic Data Generation with DoppelGangerSynthetic Data Generation with DoppelGanger
Synthetic Data Generation with DoppelGangerQuantUniversity
 
Analysis random org nist2005
Analysis random org nist2005Analysis random org nist2005
Analysis random org nist2005eliecerherrera
 
VSSML17 L6. Time Series and Deepnets
VSSML17 L6. Time Series and DeepnetsVSSML17 L6. Time Series and Deepnets
VSSML17 L6. Time Series and DeepnetsBigML, Inc
 
Digital transformation: Highly resilient streaming architecture and strategie...
Digital transformation: Highly resilient streaming architecture and strategie...Digital transformation: Highly resilient streaming architecture and strategie...
Digital transformation: Highly resilient streaming architecture and strategie...HostedbyConfluent
 
Master's Thesis Presentation
Master's Thesis PresentationMaster's Thesis Presentation
Master's Thesis PresentationWajdi Khattel
 
[Paper Review] Personalized Top-N Sequential Recommendation via Convolutional...
[Paper Review] Personalized Top-N Sequential Recommendation via Convolutional...[Paper Review] Personalized Top-N Sequential Recommendation via Convolutional...
[Paper Review] Personalized Top-N Sequential Recommendation via Convolutional...Jihoo Kim
 
Test Tool for Industrial Ethernet Network Performance (June 2009)
Test Tool for Industrial Ethernet Network Performance (June 2009)Test Tool for Industrial Ethernet Network Performance (June 2009)
Test Tool for Industrial Ethernet Network Performance (June 2009)Jim Gilsinn
 
Big Data - Big Insights - Waze @Google
Big Data - Big Insights - Waze @GoogleBig Data - Big Insights - Waze @Google
Big Data - Big Insights - Waze @GoogleDaniel Marcous
 
Scalable Distributed Real-Time Clustering for Big Data Streams
Scalable Distributed Real-Time Clustering for Big Data StreamsScalable Distributed Real-Time Clustering for Big Data Streams
Scalable Distributed Real-Time Clustering for Big Data StreamsAntonio Severien
 

Similar to EPG content recommendation in large scale: a case study on interactive TV platform (20)

[CS570] Machine Learning Team Project (I know what items really are)
[CS570] Machine Learning Team Project (I know what items really are)[CS570] Machine Learning Team Project (I know what items really are)
[CS570] Machine Learning Team Project (I know what items really are)
 
Event Mining in Social Multimedia
Event Mining in Social MultimediaEvent Mining in Social Multimedia
Event Mining in Social Multimedia
 
Cikm 2013 - Beyond Data From User Information to Business Value
Cikm 2013 - Beyond Data From User Information to Business ValueCikm 2013 - Beyond Data From User Information to Business Value
Cikm 2013 - Beyond Data From User Information to Business Value
 
Profiler for Smartphone Users Interests Using Modified Hierarchical Agglomera...
Profiler for Smartphone Users Interests Using Modified Hierarchical Agglomera...Profiler for Smartphone Users Interests Using Modified Hierarchical Agglomera...
Profiler for Smartphone Users Interests Using Modified Hierarchical Agglomera...
 
Qcon SF 2013 - Machine Learning & Recommender Systems @ Netflix Scale
Qcon SF 2013 - Machine Learning & Recommender Systems @ Netflix ScaleQcon SF 2013 - Machine Learning & Recommender Systems @ Netflix Scale
Qcon SF 2013 - Machine Learning & Recommender Systems @ Netflix Scale
 
Scalable Topic-Specific Influence Analysis on Microblogs
Scalable Topic-Specific Influence Analysis on MicroblogsScalable Topic-Specific Influence Analysis on Microblogs
Scalable Topic-Specific Influence Analysis on Microblogs
 
18.02.05_IAAI2018_Mobille Network Failure Event Detection and Forecasting wit...
18.02.05_IAAI2018_Mobille Network Failure Event Detection and Forecasting wit...18.02.05_IAAI2018_Mobille Network Failure Event Detection and Forecasting wit...
18.02.05_IAAI2018_Mobille Network Failure Event Detection and Forecasting wit...
 
Data Summer Conf 2018, “Architecting IoT system with Machine Learning (ENG)” ...
Data Summer Conf 2018, “Architecting IoT system with Machine Learning (ENG)” ...Data Summer Conf 2018, “Architecting IoT system with Machine Learning (ENG)” ...
Data Summer Conf 2018, “Architecting IoT system with Machine Learning (ENG)” ...
 
Architecting IoT with Machine Learning
Architecting IoT with Machine LearningArchitecting IoT with Machine Learning
Architecting IoT with Machine Learning
 
The Data Distribution Service Tutorial
The Data Distribution Service TutorialThe Data Distribution Service Tutorial
The Data Distribution Service Tutorial
 
Synthetic Data Generation with DoppelGanger
Synthetic Data Generation with DoppelGangerSynthetic Data Generation with DoppelGanger
Synthetic Data Generation with DoppelGanger
 
Analysis random org nist2005
Analysis random org nist2005Analysis random org nist2005
Analysis random org nist2005
 
VSSML17 L6. Time Series and Deepnets
VSSML17 L6. Time Series and DeepnetsVSSML17 L6. Time Series and Deepnets
VSSML17 L6. Time Series and Deepnets
 
Digital transformation: Highly resilient streaming architecture and strategie...
Digital transformation: Highly resilient streaming architecture and strategie...Digital transformation: Highly resilient streaming architecture and strategie...
Digital transformation: Highly resilient streaming architecture and strategie...
 
Master's Thesis Presentation
Master's Thesis PresentationMaster's Thesis Presentation
Master's Thesis Presentation
 
[Paper Review] Personalized Top-N Sequential Recommendation via Convolutional...
[Paper Review] Personalized Top-N Sequential Recommendation via Convolutional...[Paper Review] Personalized Top-N Sequential Recommendation via Convolutional...
[Paper Review] Personalized Top-N Sequential Recommendation via Convolutional...
 
Machine Learning Impact on IoT - Part 2
Machine Learning Impact on IoT - Part 2Machine Learning Impact on IoT - Part 2
Machine Learning Impact on IoT - Part 2
 
Test Tool for Industrial Ethernet Network Performance (June 2009)
Test Tool for Industrial Ethernet Network Performance (June 2009)Test Tool for Industrial Ethernet Network Performance (June 2009)
Test Tool for Industrial Ethernet Network Performance (June 2009)
 
Big Data - Big Insights - Waze @Google
Big Data - Big Insights - Waze @GoogleBig Data - Big Insights - Waze @Google
Big Data - Big Insights - Waze @Google
 
Scalable Distributed Real-Time Clustering for Big Data Streams
Scalable Distributed Real-Time Clustering for Big Data StreamsScalable Distributed Real-Time Clustering for Big Data Streams
Scalable Distributed Real-Time Clustering for Big Data Streams
 

More from David Zibriczky

Highlights from the 8th ACM Conference on Recommender Systems (RecSys 2014)
Highlights from the 8th ACM Conference on Recommender Systems (RecSys 2014)Highlights from the 8th ACM Conference on Recommender Systems (RecSys 2014)
Highlights from the 8th ACM Conference on Recommender Systems (RecSys 2014)David Zibriczky
 
Predictive Solutions and Analytics for TV & Entertainment Businesses
Predictive Solutions and Analytics for TV & Entertainment BusinessesPredictive Solutions and Analytics for TV & Entertainment Businesses
Predictive Solutions and Analytics for TV & Entertainment BusinessesDavid Zibriczky
 
Improving the TV User Experience by Algorithms: Personalized Content Recommen...
Improving the TV User Experience by Algorithms: Personalized Content Recommen...Improving the TV User Experience by Algorithms: Personalized Content Recommen...
Improving the TV User Experience by Algorithms: Personalized Content Recommen...David Zibriczky
 
Recommender Systems meet Finance - A literature review
Recommender Systems meet Finance - A literature reviewRecommender Systems meet Finance - A literature review
Recommender Systems meet Finance - A literature reviewDavid Zibriczky
 
A Combination of Simple Models by Forward Predictor Selection for Job Recomme...
A Combination of Simple Models by Forward Predictor Selection for Job Recomme...A Combination of Simple Models by Forward Predictor Selection for Job Recomme...
A Combination of Simple Models by Forward Predictor Selection for Job Recomme...David Zibriczky
 
Fast ALS-Based Matrix Factorization for Recommender Systems
Fast ALS-Based Matrix Factorization for Recommender SystemsFast ALS-Based Matrix Factorization for Recommender Systems
Fast ALS-Based Matrix Factorization for Recommender SystemsDavid Zibriczky
 
An introduction to Recommender Systems
An introduction to Recommender SystemsAn introduction to Recommender Systems
An introduction to Recommender SystemsDavid Zibriczky
 
Data Modeling in IPTV and OTT Recommender Systems
Data Modeling in IPTV and OTT Recommender SystemsData Modeling in IPTV and OTT Recommender Systems
Data Modeling in IPTV and OTT Recommender SystemsDavid Zibriczky
 
Entropy based asset pricing
Entropy based asset pricingEntropy based asset pricing
Entropy based asset pricingDavid Zibriczky
 

More from David Zibriczky (9)

Highlights from the 8th ACM Conference on Recommender Systems (RecSys 2014)
Highlights from the 8th ACM Conference on Recommender Systems (RecSys 2014)Highlights from the 8th ACM Conference on Recommender Systems (RecSys 2014)
Highlights from the 8th ACM Conference on Recommender Systems (RecSys 2014)
 
Predictive Solutions and Analytics for TV & Entertainment Businesses
Predictive Solutions and Analytics for TV & Entertainment BusinessesPredictive Solutions and Analytics for TV & Entertainment Businesses
Predictive Solutions and Analytics for TV & Entertainment Businesses
 
Improving the TV User Experience by Algorithms: Personalized Content Recommen...
Improving the TV User Experience by Algorithms: Personalized Content Recommen...Improving the TV User Experience by Algorithms: Personalized Content Recommen...
Improving the TV User Experience by Algorithms: Personalized Content Recommen...
 
Recommender Systems meet Finance - A literature review
Recommender Systems meet Finance - A literature reviewRecommender Systems meet Finance - A literature review
Recommender Systems meet Finance - A literature review
 
A Combination of Simple Models by Forward Predictor Selection for Job Recomme...
A Combination of Simple Models by Forward Predictor Selection for Job Recomme...A Combination of Simple Models by Forward Predictor Selection for Job Recomme...
A Combination of Simple Models by Forward Predictor Selection for Job Recomme...
 
Fast ALS-Based Matrix Factorization for Recommender Systems
Fast ALS-Based Matrix Factorization for Recommender SystemsFast ALS-Based Matrix Factorization for Recommender Systems
Fast ALS-Based Matrix Factorization for Recommender Systems
 
An introduction to Recommender Systems
An introduction to Recommender SystemsAn introduction to Recommender Systems
An introduction to Recommender Systems
 
Data Modeling in IPTV and OTT Recommender Systems
Data Modeling in IPTV and OTT Recommender SystemsData Modeling in IPTV and OTT Recommender Systems
Data Modeling in IPTV and OTT Recommender Systems
 
Entropy based asset pricing
Entropy based asset pricingEntropy based asset pricing
Entropy based asset pricing
 

Recently uploaded

1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样vhwb25kk
 
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degreeyuu sss
 
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default  Presentation : Data Analysis Project PPTPredictive Analysis for Loan Default  Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPTBoston Institute of Analytics
 
Biometric Authentication: The Evolution, Applications, Benefits and Challenge...
Biometric Authentication: The Evolution, Applications, Benefits and Challenge...Biometric Authentication: The Evolution, Applications, Benefits and Challenge...
Biometric Authentication: The Evolution, Applications, Benefits and Challenge...GQ Research
 
RadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdfRadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdfgstagge
 
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...Boston Institute of Analytics
 
GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]📊 Markus Baersch
 
Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...
Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...
Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...ssuserf63bd7
 
NLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptx
NLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptxNLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptx
NLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptxBoston Institute of Analytics
 
While-For-loop in python used in college
While-For-loop in python used in collegeWhile-For-loop in python used in college
While-For-loop in python used in collegessuser7a7cd61
 
Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)Cathrine Wilhelmsen
 
Student profile product demonstration on grades, ability, well-being and mind...
Student profile product demonstration on grades, ability, well-being and mind...Student profile product demonstration on grades, ability, well-being and mind...
Student profile product demonstration on grades, ability, well-being and mind...Seán Kennedy
 
Student Profile Sample report on improving academic performance by uniting gr...
Student Profile Sample report on improving academic performance by uniting gr...Student Profile Sample report on improving academic performance by uniting gr...
Student Profile Sample report on improving academic performance by uniting gr...Seán Kennedy
 
Learn How Data Science Changes Our World
Learn How Data Science Changes Our WorldLearn How Data Science Changes Our World
Learn How Data Science Changes Our WorldEduminds Learning
 
RABBIT: A CLI tool for identifying bots based on their GitHub events.
RABBIT: A CLI tool for identifying bots based on their GitHub events.RABBIT: A CLI tool for identifying bots based on their GitHub events.
RABBIT: A CLI tool for identifying bots based on their GitHub events.natarajan8993
 
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...limedy534
 
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...Amil Baba Dawood bangali
 
Predicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdfPredicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdfBoston Institute of Analytics
 
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)jennyeacort
 

Recently uploaded (20)

1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
 
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
 
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default  Presentation : Data Analysis Project PPTPredictive Analysis for Loan Default  Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPT
 
Biometric Authentication: The Evolution, Applications, Benefits and Challenge...
Biometric Authentication: The Evolution, Applications, Benefits and Challenge...Biometric Authentication: The Evolution, Applications, Benefits and Challenge...
Biometric Authentication: The Evolution, Applications, Benefits and Challenge...
 
RadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdfRadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdf
 
Call Girls in Saket 99530🔝 56974 Escort Service
Call Girls in Saket 99530🔝 56974 Escort ServiceCall Girls in Saket 99530🔝 56974 Escort Service
Call Girls in Saket 99530🔝 56974 Escort Service
 
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
 
GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]
 
Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...
Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...
Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...
 
NLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptx
NLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptxNLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptx
NLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptx
 
While-For-loop in python used in college
While-For-loop in python used in collegeWhile-For-loop in python used in college
While-For-loop in python used in college
 
Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)
 
Student profile product demonstration on grades, ability, well-being and mind...
Student profile product demonstration on grades, ability, well-being and mind...Student profile product demonstration on grades, ability, well-being and mind...
Student profile product demonstration on grades, ability, well-being and mind...
 
Student Profile Sample report on improving academic performance by uniting gr...
Student Profile Sample report on improving academic performance by uniting gr...Student Profile Sample report on improving academic performance by uniting gr...
Student Profile Sample report on improving academic performance by uniting gr...
 
Learn How Data Science Changes Our World
Learn How Data Science Changes Our WorldLearn How Data Science Changes Our World
Learn How Data Science Changes Our World
 
RABBIT: A CLI tool for identifying bots based on their GitHub events.
RABBIT: A CLI tool for identifying bots based on their GitHub events.RABBIT: A CLI tool for identifying bots based on their GitHub events.
RABBIT: A CLI tool for identifying bots based on their GitHub events.
 
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...
 
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...
 
Predicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdfPredicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdf
 
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
 

EPG content recommendation in large scale: a case study on interactive TV platform

  • 1. EPG content recommendation in large scale: a case study on interactive TV platform D. Zibriczky, Z. Petres, M. Waszlavik, D. Tikk ICMLA 2013 - Machine Learning with Multimedia Data 7th December 2013. Miami. United States
  • 2. Outline • Introduction • Problem • Solution • Offline results • Online results • Conclusion Enter date in master2 ICMLA 2013 - Machine Learning with Multimedia Data. 2013. Miami. United States
  • 3. Introduction / Consumption trends 3 ICMLA 2013 - Machine Learning with Multimedia Data. 2013. Miami. United States
  • 4. Introduction / Electronic Program Guide 4 ICMLA 2013 - Machine Learning with Multimedia Data. 2013. Miami. United States
  • 5. Introduction / Goal • SaskTel • Finding relevant contents with minimal effort • Time-shifting • Multiple devices per household • Graphical User Interface • Increasing content consumption / watching length • Increasing click through rate (CTR) using Gravity’s GUI Enter date in master5 ICMLA 2013 - Machine Learning with Multimedia Data. 2013. Miami. United States
  • 6. Problem / Recommendation concept • User: Device  Users cannot be distinguished explicitly  More than one device per household • Item: Scheduled contents (time, program id, channel id)  Typically series or programs without episodes  Metadata: Information about the items • Event: Remote controller / set-top-box based implicit feedbacks  Switching channel, set to record, rewind, replay, stop, pause  Next schedule, watching duration • Recommendable items  Set of series or programs that are broadcasted at the moment of recommendation request or later (on now, on later scenario) • Recommendation  Sorting recommendable items by prediction values  Other recommendation logic (randomization, mixing, etc..) Enter date in master6 ICMLA 2013 - Machine Learning with Multimedia Data. 2013. Miami. United States
  • 7. Problem / Difficulties • Implicit feedbacks only (no explicit data) • Huge but noisy data set (zapping, leave-on, irrelevant events, …) • Cold start problem (new items, short lifetime) • Small recommendable set at a time • Context dependency (time, multiple users per household) • Difference between offline and online optimization Enter date in master7 ICMLA 2013 - Machine Learning with Multimedia Data. 2013. Miami. United States
  • 8. Solution / Baselines • Most popular channels • Most popular contents (series or programs) • Users’ favourite channels • Users’ favourite contents (series or programs) Enter date in master8 ICMLA 2013 - Machine Learning with Multimedia Data. 2013. Miami. United States
  • 9. Solution / Content-based filtering • Cosine Similarity • User model: Weighted average of meta vectors • Prediction: Cosine similarity of vectors • Improvement: Term frequency based weighting (TFIDF) Enter date in master9 ICMLA 2013 - Machine Learning with Multimedia Data. 2013. Miami. United States M The Simpsons How I Met Your Mother Futurama … Genre = Animation 1 0 1 … Genre = Comedy 1 1 1 … … … … … … Director = Matt Groening 1 0 1 … Director = Carter Bays 0 1 0 … Actor = Dan Castellaneta 1 0 0 … Actor = Billy West 0 0 1 … … … … … … User 1 0.53 0.81 … 0.18 0.00 0.18 0.00 …
  • 10. Solution / Collaborative Filtering • Matrix Factorization • User model: User factors • Prediction: Dot product of latent factors • Solver: Alternating Least Squares with Coordinate Descent (IALS1) Enter date in master10 ICMLA 2013 - Machine Learning with Multimedia Data. 2013. Miami. United States R The Simpsons How I Met Your Mother Futurama … User 1 1 … User 2 1 1 u2*i3 … User 3 1 … … … … … … Item factors i11 i21 i31 … i21 i22 i32 … User factors u11 u12 u21 u22 u31 u32 … …
  • 11. Solution / Hybrid filtering Enter date in master11 ICMLA 2013 - Machine Learning with Multimedia Data. 2013. Miami. United States R* The Simpsons How I Met Your Mother Futurama … User 1 1 0 0 … User 2 1 1 0 … User 3 0 1 0 … … … … … … Genre = Animation 1 0 1 … Genre = Comedy 1 1 1 … … … … … … Director = Matt Groening 1 0 1 … Director = Carter Bays 0 1 0 … Actor = Dan Castellaneta 1 0 0 … Actor = Billy West 0 0 1 … … … … … … User factors u11 u12 u21 u22 u31 u32 … … pu11 pu12 pu22 pu22 … … … … … … … … … … … … Item factors i11 i21 I31 … i21 i22 I32 … • Hybrid IALS1
  • 12. TP factors Solution / Channel recommendation • Tensor factorization (ITALS1) • Prediction: Hadamard product of latent factors • Improvement: Watching duration based weighting Enter date in master12 ICMLA 2013 - Machine Learning with Multimedia Data. 2013. Miami. United States R (4:00-12:00) Channel Sports 1 Channel Sports 2 Channel News 1 … User 1 1 1 … User 2 1 … User 3 1 … … … … … … R (12:00-20:00) Channel Sports 1 Channel Sports 2 Channel News 1 … User 1 1 … User 2 1 1 … User 3 1 … … … … … … R (20:00-4:00) Channel Sports 1 Channel Sports 2 Channel News 1 … User 1 1 … User 2 1 … User 3 u3°i2°t3 … … … … … … User factors u11 u12 u21 u22 u31 u32 … … Item factors i11 i21 i31 … i21 i22 i32 … t11 t12 t21 t22 t31 t32
  • 13. Solution / Item grouping Enter date in master13 ICMLA 2013 - Machine Learning with Multimedia Data. 2013. Miami. United States
  • 14. Solution / Preprocessing Enter date in master14 ICMLA 2013 - Machine Learning with Multimedia Data. 2013. Miami. United States Original set 308M 82M 23M Train set 22M Test set 676K Event type based filtering by significance 1 2 3 3 1 2 3 Filtering by leave-on and short duration Splitting by time
  • 15. Offline results / Measurement Enter date in master15 ICMLA 2013 - Machine Learning with Multimedia Data. 2013. Miami. United States • Metrics:  Recall@N  Mean Reciprocial Rank (MRR) • Item splits:  Having events on training set or not o old items o new items  Popularity 20-80 split o popular items o tail items  Episode of a series or not o series o non-series
  • 16. • Recall@15 * Items are grouped by series ids or program ids ** Items are grouped by channel ids *** Blend: Combination of CosineSim, IALS1, ITALS1, HybridIALS1 and favourite programs/series Offline results / Comparison Enter date in master16 ICMLA 2013 - Machine Learning with Multimedia Data. 2013. Miami. United States Algorithm Type All items Old items New items Popular items Tail items Series Non- series Most popular channels BL 17.45% 17.81% 10.59% 19.95% 3.90% 18.58% 2.91% Most popular series BL 25.68% 27.06% 0.00% 30.35% 0.44% 27.69% 0.00% Favourite channels BL 30.98% 31.61% 19.20% 32.68% 21.80% 32.52% 11.28% Favourite series / programs BL 48.58% 51.13% 0.00% 53.55% 21.83% 52.34% 1.09% CosineSim CBF 52.02% 52.92% 34.94% 53.58% 43.65% 53.69% 30.93% IALS1* CF 46.75% 49.26% 0.00% 52.30% 16.78% 50.28% 1.65% ITALS1** CF 41.68% 42.60% 24.48% 44.53% 26.26% 43.84% 14.06% Hybrid IALS1* HF 51.08% 53.82% 6.78% 56.46% 22.01% 54.95% 1.63% Blend*** 55.48% 56.98% 26.15% 57.64% 43.61% 57.91% 24.41% Blend*** (MRR) 0.1038 0.1070 0.0405 0.1097 0.0712 0.1094 0.0322
  • 17. • Recall@15 * Items are grouped by series ids or program ids ** Items are grouped by channel ids *** Blend: Combination of CosineSim, IALS1, ITALS1, HybridIALS1 and favourite programs/series Offline results / Comparison Enter date in master17 ICMLA 2013 - Machine Learning with Multimedia Data. 2013. Miami. United States Algorithm Type All items Old items New items Popular items Tail items Series Non- series Most popular channels BL 17.45% 17.81% 10.59% 19.95% 3.90% 18.58% 2.91% Most popular series BL 25.68% 27.06% 0.00% 30.35% 0.44% 27.69% 0.00% Favourite channels BL 30.98% 31.61% 19.20% 32.68% 21.80% 32.52% 11.28% Favourite series / programs BL 48.58% 51.13% 0.00% 53.55% 21.83% 52.34% 1.09% CosineSim CBF 52.02% 52.92% 34.94% 53.58% 43.65% 53.69% 30.93% IALS1* CF 46.75% 49.26% 0.00% 52.30% 16.78% 50.28% 1.65% ITALS1** CF 41.68% 42.60% 24.48% 44.53% 26.26% 43.84% 14.06% Hybrid IALS1* HF 51.08% 53.82% 6.78% 56.46% 22.01% 54.95% 1.63% Blend*** 55.48% 56.98% 26.15% 57.64% 43.61% 57.91% 24.41% Blend*** (MRR) 0.1038 0.1070 0.0405 0.1097 0.0712 0.1094 0.0322
  • 18. Online results / User Interface Enter date in master18 ICMLA 2013 - Machine Learning with Multimedia Data. 2013. Miami. United States
  • 19. Online results / Measurement Enter date in master19 ICMLA 2013 - Machine Learning with Multimedia Data. 2013. Miami. United States • Metrics:  Click-through rate (CTR)  Watching Length Ratio (WR): The average watching length of the contents that were watched at least 1 minutes by the user.  Completed Watched Ratio (CWR): The average ratio of the events in the content was watched at least 90% of it’s remaining length. • Methods:  EPG-Z: Standard consumption method (EPG and channel zapping)  R4U: Recommended 4 U
  • 20. Online results / Measurement Enter date in master20 ICMLA 2013 - Machine Learning with Multimedia Data. 2013. Miami. United States • Metrics:  Click-through rate (CTR)  Watching Length Ratio (WR): The average watching length of the contents that were watched at least 1 minutes by the user.  Completed Watched Ratio (CWR): The average ratio of the events in the content was watched at least 90% of it’s remaining length. • Methods:  EPG-Z: Standard consumption method (EPG and channel zapping)  R4U: Recommended 4 U EPG-Z vs. R4U?
  • 21. Online results / Clicks Enter date in master21 ICMLA 2013 - Machine Learning with Multimedia Data. 2013. Miami. United States 0% 10% 20% 30% 40% 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 Distribution of clicks by position • Users like to click on the first item. • 80% of the clicks comes from one of the Top5 positions. • More clicks in the 15th position (2.2%) than in the 14th (1.3%).
  • 22. Online results / CTR Enter date in master22 ICMLA 2013 - Machine Learning with Multimedia Data. 2013. Miami. United States 33,20% 35,30% 33,12% 26,49% 39,05% 52,07% 33,16% All items Non-series Series Tail items Popular items New items Old items Click-through rate by different item splits
  • 23. Online results / CTR by usage Enter date in master23 ICMLA 2013 - Machine Learning with Multimedia Data. 2013. Miami. United States 0% 10% 20% 30% 40% 50% 60% 70% 80% 1 10 100 Average CTR vs. # of rec. requests from the first use of R4U
  • 24. Online results / Watching behavior Enter date in master24 ICMLA 2013 - Machine Learning with Multimedia Data. 2013. Miami. United States Watching Length Ratio Completed Watching Ratio Item splits EPG-Z R4U EPG-Z R4U Old items 30.02% 42.04% 16.02% 31.03% New items 21.11% 35.51% 8.01% 23.12% Popular items 30.81% 44.19% 16.30% 32.27% Long-tail items 28.01% 38.43% 15.11% 27.66% Series 31.04% 43.00% 16.92% 31.51% Non-series 17.94% 15.26% 5.31% 7.22% All items 29.90% 42.02% 15.91% 30.53%
  • 25. Online results / Watching behavior • Contents selected via R4U are watched 40% longer and completed with almost twice more probability than in standard way. Enter date in master25 ICMLA 2013 - Machine Learning with Multimedia Data. 2013. Miami. United States Watching Length Ratio Completed Watching Ratio Item splits EPG-Z R4U EPG-Z R4U Old items 30.02% 42.04% 16.02% 31.03% New items 21.11% 35.51% 8.01% 23.12% Popular items 30.81% 44.19% 16.30% 32.27% Long-tail items 28.01% 38.43% 15.11% 27.66% Series 31.04% 43.00% 16.92% 31.51% Non-series 17.94% 15.26% 5.31% 7.22% All items 29.90% 42.02% 15.91% 30.53%<< <<
  • 26. Online results / Offline vs. Online metrics • High correlation between Recall/MRR and Completed Waching Ratio Enter date in master26 ICMLA 2013 - Machine Learning with Multimedia Data. 2013. Miami. United States Offline Online Recall@15 MRR CWR Old items 56.98% 0.1070 31.03% New items 26.15% 0.0405 23.12% Popular items 57.64% 0.1097 32.27% Long-tail items 43.61% 0.0712 27.66% Series 57.91% 0.1094 31.51% Non-series 24.41% 0.0322 7.22% All items 55.48% 0.1038 30.53%
  • 27. Conclusion • Linear recommendation difficulties. • Metadata based item modeling (CBF) is quite effective, additional improvement by combining with CF. • Users prefer first items, they don’t do much effort. • High click-through rate, especially for new items. • R4U affects user behavior and satisfaction. • Contents selected via R4U are watched 40% longer and completed with almost twice more probability than in standard way. • High correlation between the proposed offline and online metrics. Enter date in master27 ICMLA 2013 - Machine Learning with Multimedia Data. 2013. Miami. United States
  • 28. ICMLA 2013 - Machine Learning with Multimedia Data. 2013. Miami. United States