Understanding Similarity Metrics in Neighbour-based Recommender - PowerPoint PPT Presentation

Understanding Similarity Metrics in Neighbour-based Recommender Systems Alejandro Bellogín , Arjen de Vries Information Access CWI ICTIR, October 2013

Motivation  Why some recommendation methods perform better than others? 2 Alejandro Bellogín – ICTIR, October 2013

Motivation  Why some recommendation methods perform better than others?  Focus: nearest-neighbour recommenders • What aspects of the similarity functions are more important? • How can we exploit that information? 3 Alejandro Bellogín – ICTIR, October 2013

Context  Recommender systems • Users interact (rate, purchase, click) with items 4 Alejandro Bellogín – ICTIR, October 2013

Context  Recommender systems • Users interact (rate, purchase, click) with items • Which items will the user like ? 7 Alejandro Bellogín – ICTIR, October 2013

Context  Nearest-neighbour recommendation methods • The item prediction is based on “similar” users 8 Alejandro Bellogín – ICTIR, October 2013

Context  Nearest-neighbour recommendation methods • The item prediction is based on “similar” users 9 Alejandro Bellogín – ICTIR, October 2013

Different similarity metrics – different neighbours 10 Alejandro Bellogín – ICTIR, October 2013

Different similarity metrics – different recommendations 11 Alejandro Bellogín – ICTIR, October 2013

Different similarity metrics – different recommendations s( , ) sim( , )s( , ) 12 Alejandro Bellogín – ICTIR, October 2013

Research question  How does the choice of a similarity metric determine the quality of the recommendations? 13 Alejandro Bellogín – ICTIR, October 2013

Problem: sparsity  Too many items exist, not enough ratings will be available  A user’s neighbourhood is likely to introduce not-so-similar users 14 Alejandro Bellogín – ICTIR, October 2013

Different similarity metrics – which one is better?  Consider Cosine vs Pearson similarity  Most existing studies report Pearson correlation to lead superior recommendation accuracy 15 Alejandro Bellogín – ICTIR, October 2013

Different similarity metrics – which one is better?  Consider Cosine vs Pearson similarity  Common variations to deal with sparsity • Thresholding: threshold to filter out similarities (no observed difference) • Item selection: use full profiles or only the overlap • Imputation: default value for unrated items 16 Alejandro Bellogín – ICTIR, October 2013

Different similarity metrics – which one is better?  Which similarity metric is better? • Cosine is not superior for every variation  Which variation is better? • They do not show consistent results  Why some variations improve/decrease performance? → Analysis of similarity features 17 Alejandro Bellogín – ICTIR, October 2013

Analysis of similarity metrics  Based on • Distance/Similarity distribution • Nearest-neighbour graph 18 Alejandro Bellogín – ICTIR, October 2013

Analysis of similarity metrics  Distance distribution  In high dimensions, nearest neighbour is unstable: If the distance from query point to most data points is less than (1 + ε ) times the distance from the query point to its nearest neighbour Beyer et al. When is “nearest neighbour” meaningful? ICDT 1999 19 Alejandro Bellogín – ICTIR, October 2013

Analysis of similarity metrics  Distance distribution • Quality q(n, f) : fraction of users for which the similarity function has ranked at least n percentage of the whole community within a factor f of the nearest neighbour’s similarity value 21 Alejandro Bellogín – ICTIR, October 2013

Analysis of similarity metrics  Distance distribution • Quality q(n, f) : fraction of users for which the similarity function has ranked at least n percentage of the whole community within a factor f of the nearest neighbour’s similarity value • Other features: 22 Alejandro Bellogín – ICTIR, October 2013

Analysis of similarity metrics  Nearest neighbour graph (NN k ) • Binary relation of whether a user belongs or not to a neighbourhood 23 Alejandro Bellogín – ICTIR, October 2013

Experimental setup  Dataset • MovieLens 1M: 6K users, 4K items, 1M ratings • Random 5-fold training/test split  JUNG library for graph related metrics  Evaluation • Generate a ranking for each relevant item, containing 100 not relevant items • Metric: mean reciprocal rank (MRR) 24 Alejandro Bellogín – ICTIR, October 2013

Performance analysis  Correlations between performance and features of each similarity (and its variations) 25 Alejandro Bellogín – ICTIR, October 2013

Performance analysis – quality  Correlations between performance and characteristics of each similarity (and its variations)  For a user • If most of the user population is far away, low quality correlates with effectiveness (discriminative similarity) • If most of the user population is close, high quality correlates with ineffectiveness (not discriminative enough) Quality q(n, f) : fraction of users for which the similarity function has ranked at least n percentage of the whole community within a factor f of the nearest neighbour’s similarity value 26 Alejandro Bellogín – ICTIR, October 2013

Performance analysis – examples 27 Alejandro Bellogín – ICTIR, October 2013

Conclusions (so far)  We have found similarity features correlated with their final performance • They are global properties, in contrast with query performance predictors • Compatible results with those in database: the stability of a metric is related with its ability to discriminate between good and bad neighbours 28 Alejandro Bellogín – ICTIR, October 2013

Application  Transform “ bad ” similarity metrics into “ better performing ” ones • Adjusting their values according to the correlations found  Transform their distributions • Using a distribution-based normalisation [Fernández, Vallet, Castells, ECIR 06] • Take as ideal distribution ( ) the best performing similarity (Cosine Full0) F 29 Alejandro Bellogín – ICTIR, October 2013

Application  Transform “ bad ” similarity metrics into “ better performing ” ones • Adjusting their values according to the correlations found  Transform their distributions • Using a distribution-based normalisation [Fernández, Vallet, Castells, ECIR 06] • Take as ideal distribution ( ) the best performing similarity (Cosine Full0) F  Results The rest of the characteristics are not (necessarily) inherited 30 Alejandro Bellogín – ICTIR, October 2013

Conclusions  We have found similarity features correlated with their final performance • They are global properties, in contrast with query performance predictors • Compatible results with those in database: the stability of a metric is related with its ability to discriminate between good and bad neighbours  Not conclusive results when transforming bad-performing similarities based on distribution normalisations • We want to explore (and adapt to) other features, e.g., graph distance • We aim to develop other applications based on these results, e.g., hybrid recommendation 31 Alejandro Bellogín – ICTIR, October 2013

Thank you Understanding Similarity Metrics in Neighbour-based Recommender Systems Alejandro Bellogín, Arjen de Vries Information Access CWI ICTIR, October 2013 32 Alejandro Bellogín – ICTIR, October 2013

Different similarity metrics – all the results  Performance results for variations of two metrics • Cosine • Pearson  Variations • Thresholding: threshold to filter out similarities (no observed difference) • Imputation: default value for unrated items 33 Alejandro Bellogín – ICTIR, October 2013

Beyer’s “ quality ” 34 Alejandro Bellogín – ICTIR, October 2013

Understanding Similarity Metrics in Neighbour-based Recommender - PowerPoint PPT Presentation

Understanding Similarity Metrics in Neighbour-based Recommender Systems Alejandro Bellogn , Arjen de Vries Information Access CWI ICTIR, October 2013 Motivation Why some recommendation methods perform better than others? 2 Alejandro

Semantic Similarity MultiJEDI ERC 259234 Semantic Similarity Semantic Similarity Mostly

SEcure Neighbour Discovery (SEND) Arun Raghavan Department of Computer Science IIT Kanpur

Nearest Neighbour Searching in Metric Spaces Kenneth Clarkson (1999, 2006) Nearest Neighbour

Align, Disambiguate, and Walk A Unified Approach for Measuring Semantic Similarity Semantic

Time- -dependent Similarity Measure dependent Similarity Measure Time Time-dependent Similarity

Learning Similarity Metrics for Numerical Simulations Georg Kohl Kiwon Um Nils Thuerey

What we learned from Community Metrics Agenda Why are metrics used? How metrics are used

Performance Metrics for Graph Mining Tasks 1 Outline Introduction to Performance Metrics

AGENCY OPERATIONS METRICS The Metrics of Me The Metrics of Me x 159 13,006 5 days old books

Proposal Metrics Dashboard What Gets Measured Gets Done Topics Why Keep Metrics? What

Neighbour-swap Graphs Generating linear extensions of posets by adjacent transpositions Gijs

Fast Nearest Neighbour Classification Gordon Lesti July 17, 2015 Structure Introduction

Basic Classification Algorithms Rules, Linear Regression, Nearest Neighbour Outline Rules

Basic Classification Algorithms (2) Rules, Linear Regression, Nearest Neighbour Outline Rules

Non-parametric Methods Oliver Schulte - CMPT 726 Bishop PRML Ch. 2.5 Kernel Density Estimation

Neighbour-transitive codes in Johnson graphs Mark Ioppolo Centre for Mathematics of Symmetry and

EECS PhD Qualifying Exam 1 Department of Electrical Engineering and Computer Science (EECS)

Integro-differential equations: Regularity theory and Pohozaev identities Xavier Ros Oton

A St Andrews PhD Dr Clare Peddie ProDean Taught Postgraduates pgtprodean@st-andrews.ac.uk A St

The Redistributive Consequences of Segregation Lisa Windsteiger London School of Economics

Towards End-to-end Data Exchange in the IoT Georgios Bouloukakis Joint work with Nikolaos

Innovation and Collaboration: The Transdisciplinary Obesity Prevention (TOP) Graduate Certificate

Life After Your PhD Keyi Chen & Jamie Nelson What are your plans? Do you want to ... 1.

From Rutgers PhD to Medical Affairs Professional Dharm Patel, PhD Medical Strategy &

Understanding Similarity Metrics in Neighbour-based Recommender - PowerPoint PPT Presentation

Understanding Similarity Metrics in Neighbour-based Recommender Systems Alejandro Bellogn , Arjen de Vries Information Access CWI ICTIR, October 2013 Motivation Why some recommendation methods perform better than others? 2 Alejandro

Semantic Similarity MultiJEDI ERC 259234 Semantic Similarity Semantic Similarity Mostly

SEcure Neighbour Discovery (SEND) Arun Raghavan Department of Computer Science IIT Kanpur

Nearest Neighbour Searching in Metric Spaces Kenneth Clarkson (1999, 2006) Nearest Neighbour

Align, Disambiguate, and Walk A Unified Approach for Measuring Semantic Similarity Semantic

Time- -dependent Similarity Measure dependent Similarity Measure Time Time-dependent Similarity

Learning Similarity Metrics for Numerical Simulations Georg Kohl Kiwon Um Nils Thuerey

What we learned from Community Metrics Agenda Why are metrics used? How metrics are used

Performance Metrics for Graph Mining Tasks 1 Outline Introduction to Performance Metrics

AGENCY OPERATIONS METRICS The Metrics of Me The Metrics of Me x 159 13,006 5 days old books

Proposal Metrics Dashboard What Gets Measured Gets Done Topics Why Keep Metrics? What

Neighbour-swap Graphs Generating linear extensions of posets by adjacent transpositions Gijs

Fast Nearest Neighbour Classification Gordon Lesti July 17, 2015 Structure Introduction

Basic Classification Algorithms Rules, Linear Regression, Nearest Neighbour Outline Rules

Basic Classification Algorithms (2) Rules, Linear Regression, Nearest Neighbour Outline Rules

Non-parametric Methods Oliver Schulte - CMPT 726 Bishop PRML Ch. 2.5 Kernel Density Estimation

Neighbour-transitive codes in Johnson graphs Mark Ioppolo Centre for Mathematics of Symmetry and

EECS PhD Qualifying Exam 1 Department of Electrical Engineering and Computer Science (EECS)

Integro-differential equations: Regularity theory and Pohozaev identities Xavier Ros Oton

A St Andrews PhD Dr Clare Peddie ProDean Taught Postgraduates pgtprodean@st-andrews.ac.uk A St

The Redistributive Consequences of Segregation Lisa Windsteiger London School of Economics

Towards End-to-end Data Exchange in the IoT Georgios Bouloukakis Joint work with Nikolaos

Innovation and Collaboration: The Transdisciplinary Obesity Prevention (TOP) Graduate Certificate

Life After Your PhD Keyi Chen &amp; Jamie Nelson What are your plans? Do you want to ... 1.

From Rutgers PhD to Medical Affairs Professional Dharm Patel, PhD Medical Strategy &amp;

Life After Your PhD Keyi Chen & Jamie Nelson What are your plans? Do you want to ... 1.

From Rutgers PhD to Medical Affairs Professional Dharm Patel, PhD Medical Strategy &