Collaborative Filtering & Content-Based Recommending CS 293S. - PowerPoint PPT Presentation

Collaborative Filtering & Content-Based Recommending CS 293S. T. Yang Slides based on R. Mooney at UT Austin 1

Recommendation Systems • Systems for recommending items (e.g. books, movies, music, web pages, newsgroup messages) to users based on examples of their preferences. – Amazon, Netflix. Increase sales at on-line stores. • Basic approaches to recommending: – Collaborative Filtering (a.k.a. social filtering) – Content-based • Instances of personalization software. – adapting to the individual needs, interests, and preferences of each user with recommending, filtering, & predicting 2

Process of Book Recommendation Red Mars Found ation Juras- Machine User sic Park Learning Profile Lost World 2001 Neuro- 2010 mancer Differ- ence Engine 3

Collaborative Filtering • Maintain a database of many users’ ratings of a variety of items. • For a given user, find other similar users whose ratings strongly correlate with the current user. • Recommend items rated highly by these similar users, but not rated by the current user. • Almost all existing commercial recommenders use this approach (e.g. Amazon). User rating? User rating User rating User rating User rating User rating Item recommendation 4

Collaborative Filtering A 9 A A 5 A A 6 A 10 User B 3 B B 3 B B 4 B 4 C C 9 C C 8 C C 8 Database : : : : : : : : : : . . Z 5 Z 10 Z 7 Z Z Z 1 A 9 A 10 B 3 B 4 Correlation C C 8 Match : : . . Z 5 Z 1 A 9 Extract Active C B 3 C Recommendations User . . Z 5 5

Collaborative Filtering Method 1. Weight all users with respect to similarity with the active user. 2. Select a subset of the users ( neighbors ) to use as predictors. 3. Normalize ratings and compute a prediction from a weighted combination of the selected neighbors’ ratings. 4. Present items with highest predicted ratings as recommendations. 6

Find users with similar ratings/interests A 9 A A 5 A A 6 A 10 User B 3 B B 3 B B 4 B 4 C C 9 C C 8 C C 8 Database : : : : : : : : : : . . Z 5 Z 10 Z 7 Z Z Z 1 r u Which users have similar ratings? A 9 r a Active B 3 C User . . Z 5 7

Similarity Weighting • Similarity of two rating vectors for active user, a , and another user, u . covar ( r , r ) , = a u c – Pearson correlation coefficient a u s s r r – a cosine similarity formula a u r a and r u are the ratings vectors for the m items rated by both a and u A 9 A A 5 A A 6 A 10 User B 3 B B 3 B B 4 B 4 C C 9 C C 8 C C 8 Database : : : : : : : : : : . . 8 Z 5 Z 10 Z 7 Z Z Z 1

Definition: Covariance and Standard Deviation • Covariance: m å - - ( r r )( r r ) a , i a u , i u = = i 1 covar ( r , r ) a u m m å r x , i = = r i 1 m å x - 2 m ( r r ) x , i x s = = i 1 r x m • Standard Deviation: • Pearson correlation coefficient covar ( r , r ) = = - - a u c Cosine ( r r , r r ) a , u a a u u s s r r a u 9

Neighbor Selection • For a given active user, a , select correlated users to serve as source of predictions. – Standard approach is to use the most similar n users, u , based on similarity weights, w a,u – Alternate approach is to include all users whose similarity weight is above a given threshold. Sim( r a , r u )> t a 10

Significance Weighting • Important not to trust correlations based on very few co-rated items. • Include significance weights , s a,u , based on number of co-rated items, m . , = w s c a u a , u a , u > ì ü 1 if m 50 ï ï = m s í ý £ if m 50 a , u ï ï î þ 50 11

Rating Prediction (Version 0) • Predict a rating, p a,i , for each item i , for active user, a , by using the n selected neighbor users, u Î {1,2,… n }. • Weight users’ ratings contribution by their similarity to the active user. n å w r a , u u , i User a = = p u 1 a , i n å w a , u = u 1 Item i 12

Rating Prediction (Version 1) • Predict a rating, p a,i , for each item i , for active user, a , by using the n selected neighbor users, u Î {1,2,… n }. • To account for users different ratings levels, base predictions on differences from a user’s average rating. • Weight users’ ratings contribution by their similarity to the active user. User a n å - w ( r r ) a , u u , i u = + = p r u 1 a , i a n å w a , u = u 1 Item i 13

Problems with Collaborative Filtering • Cold Start : There needs to be enough other users already in the system to find a match. • Sparsity : If there are many items to be recommended, even if there are many users, the user/ratings matrix is sparse, and it is hard to find users that have rated the same items. • First Rater : Cannot recommend an item that has not been previously rated. – New items, esoteric items • Popularity Bias : Cannot recommend items to someone with unique tastes. – Tends to recommend popular items. 14

Recommendation vs Web Ranking User click data Text Content User rating Link popularity Content Item recommendation Web page ranking 15

Content-Based Recommendation • Recommendations are based on information on the content of items rather than on other users’ opinions. – Less dependence for data on other users. • Able to recommend to users with unique tastes. • Able to recommend new and unpopular items – No first-rater problem. – No cold-start or sparsity problems.. 16

Example: LIBRA System Amazon Book Pages LIBRA Database Information Extraction Uses information Rated Author Examples Machine Learning Title Editorial Reviews Learner Customer Comments Subject terms Recommendations Related authors Related titles 1.~~~~~~ User Profile 2.~~~~~~~ 3.~~~~~ : : Predictor : 17

Combining Content and Collaboration • Content-based and collaborative methods have complementary strengths and weaknesses. • Combine methods to obtain the best of both. • Various hybrid approaches: – Apply both methods and combine recommendations. – Use collaborative data as content. – Use content-based predictor as another collaborator. – Use content-based predictor to complete collaborative data. 18

Content-Boosted Collaborative Filtering EachMovie Web Crawler IMDb Movie Content Database User Ratings Full User Matrix (Sparse) Ratings Matrix Content-based Predictor Collaborative Active Filtering User Ratings Recommendations 19

Content-Boosted Collaborative Filtering User-ratings Vector Training Examples Content-Based Predictor Pseudo User-ratings Vector User-rated Items Unrated Items Items with Predicted Ratings 20

Content-Boosted Collaborative Filtering Content-Based User Ratings Pseudo User Predictor Matrix Ratings Matrix • Compute pseudo user ratings matrix – Full matrix – approximates actual full user ratings matrix • Perform collaborative filtering – Using Pearson corr. between pseudo user-rating vectors 21

Conclusions • Recommending and personalization are important approaches to combating information over-load. • Machine Learning is an important part of systems for these tasks. • Collaborative filtering has problems. • Content-based methods address these problems (but have problems of their own). • Integrating both is best. 22

Collaborative Filtering & Content-Based Recommending CS 293S. - PowerPoint PPT Presentation

Collaborative Filtering & Content-Based Recommending CS 293S. T. Yang Slides based on R. Mooney at UT Austin 1 Recommendation Systems Systems for recommending items (e.g. books, movies, music, web pages, newsgroup messages) to users

CS490W: What is Collaborative Filtering? Collaborative Filtering (CF): Making recommendation

Filtering Cubemaps Filtering Cubemaps Angular Extent Filtering and Edge Seam Fixup Methods

Traffic Control Mechanisms Filtering Source address filtering Other forms of filtering

Lesson 7 Rate Conversion Filtering and Downsampling interchange Filtering and Upsampling

Part 14: Content-Based Filtering and Hybrid Systems Francesco Ricci Content p Typologies of

A Case Study on Recommending Software Components using Collaborative Filtering Mel Cinnide

Collaborative Filtering Yun-Ta Tsai 1 , Markus Steinberger 2 , Dawid Pajk 3 , Kari Pulli 4 1

Collaborative Filtering Presentation by Alex Hugger Filtering Documents Mittwoch, 28. April 2010

Recommending and Targeting Gabrielle Demange Paris School of Economics July 9, 2015 Gabrielle

Collaborative Filtering Practical Machine Learning, CS 294-34 Lester Mackey Based on slides by

1 Collaborative Filtering Collaborative Filtering Maintain a database of many users

aHomestake Array and Wiener Filtering Array Coherence Wiener Filtering Velocity Measurements

Least-Action Filtering L. C. G. Rogers Statistical Laboratory, University of Cambridge

The Filtering Matrix Interrogating Internet Filtering and Surveillance Practices Worldwide Nart

Statistical Filtering and Control for AI and Robotics Part I. Bayes filtering Riccardo Muradore

1 An Filtering System that Monitors Document Search Engines Can Help, But Not Enough!

On the design of UAS horizontal separation maneuvers Enric Pastor Marc Prez-Batlle Pablo Royo

Implementing Raft protocol by coroutines and Ktor Framework Andrii Rodionov @AndriiRodionov

Simple objects on Z Fix any i : Z X closed and complementary j : U X open for

Cartesian Cubical Computational Type Theory Favonia U of Minnesota Joint work with Carlo

Lets get a federated identity Do you have access to your email? Youll need a valid email

QCrypt 2018: On the possibility of classical client blind quantum computing Alexandru Cojocaru,

bias ASD98b s d1 d2 d3 disc lvds d diffamp/shaper stages s d1 d2 d3 disc lvds d 1

Renewable Energy in Alaska Kathleen Davenport & Laurel Stevens University of Alaska

Collaborative Filtering & Content-Based Recommending CS 293S. - PowerPoint PPT Presentation

Collaborative Filtering & Content-Based Recommending CS 293S. T. Yang Slides based on R. Mooney at UT Austin 1 Recommendation Systems Systems for recommending items (e.g. books, movies, music, web pages, newsgroup messages) to users

CS490W: What is Collaborative Filtering? Collaborative Filtering (CF): Making recommendation

Filtering Cubemaps Filtering Cubemaps Angular Extent Filtering and Edge Seam Fixup Methods

Traffic Control Mechanisms Filtering Source address filtering Other forms of filtering

Lesson 7 Rate Conversion Filtering and Downsampling interchange Filtering and Upsampling

Part 14: Content-Based Filtering and Hybrid Systems Francesco Ricci Content p Typologies of

A Case Study on Recommending Software Components using Collaborative Filtering Mel Cinnide

Collaborative Filtering Yun-Ta Tsai 1 , Markus Steinberger 2 , Dawid Pajk 3 , Kari Pulli 4 1

Collaborative Filtering Presentation by Alex Hugger Filtering Documents Mittwoch, 28. April 2010

Recommending and Targeting Gabrielle Demange Paris School of Economics July 9, 2015 Gabrielle

Collaborative Filtering Practical Machine Learning, CS 294-34 Lester Mackey Based on slides by

1 Collaborative Filtering Collaborative Filtering Maintain a database of many users

aHomestake Array and Wiener Filtering Array Coherence Wiener Filtering Velocity Measurements

Least-Action Filtering L. C. G. Rogers Statistical Laboratory, University of Cambridge

The Filtering Matrix Interrogating Internet Filtering and Surveillance Practices Worldwide Nart

Statistical Filtering and Control for AI and Robotics Part I. Bayes filtering Riccardo Muradore

1 An Filtering System that Monitors Document Search Engines Can Help, But Not Enough!

On the design of UAS horizontal separation maneuvers Enric Pastor Marc Prez-Batlle Pablo Royo

Implementing Raft protocol by coroutines and Ktor Framework Andrii Rodionov @AndriiRodionov

Simple objects on Z Fix any i : Z X closed and complementary j : U X open for

Cartesian Cubical Computational Type Theory Favonia U of Minnesota Joint work with Carlo

Lets get a federated identity Do you have access to your email? Youll need a valid email

QCrypt 2018: On the possibility of classical client blind quantum computing Alexandru Cojocaru,

bias ASD98b s d1 d2 d3 disc lvds d diffamp/shaper stages s d1 d2 d3 disc lvds d 1

Renewable Energy in Alaska Kathleen Davenport &amp; Laurel Stevens University of Alaska

Renewable Energy in Alaska Kathleen Davenport & Laurel Stevens University of Alaska