cross domain recommendation via clustering on multi layer
play

Cross-Domain Recommendation via Clustering on Multi-Layer Graphs Al - PowerPoint PPT Presentation

Cross-Domain Recommendation via Clustering on Multi-Layer Graphs Al Aleksandr Fa Farseev, Ivan Samborskii, Andrey Filchenkov, Tat-Seng Chua By AleksandrFarseev http://farseev .com Aug 8 th , 2017 Venue Category Recommendation Collaborative


  1. Cross-Domain Recommendation via Clustering on Multi-Layer Graphs Al Aleksandr Fa Farseev, Ivan Samborskii, Andrey Filchenkov, Tat-Seng Chua By AleksandrFarseev http://farseev .com Aug 8 th , 2017

  2. Venue Category Recommendation Collaborative Venue Category Recommendation – recommendation of venue categories (i.e. restaurant, cinema) to user using information about his/her profile (i.e. past visits) and/or information about users from the same domain. Venue categories: Clothing Store Hotel Venue categories: Ice Cream Shop Total 764 different categories

  3. Idea 1: Utilization of Individual And Group Knowledge for Better Recommendation

  4. User Community-Based Collaborative Recommendation We perform venue category recommendation based on both individual and group knowledge => naturally models the impact of society on an individual's behavior during the selection of a new place to go: βˆ‘ 𝑀𝑓𝑑 0 0∈2 3 𝑠𝑓𝑑 𝑣 = 𝑑𝑝𝑠𝑒 𝛿 * 𝑀𝑓𝑑 , + πœ„ 𝐷 , +

  5. What do we need user communities for? + Users from the same community (extracted from multi-source data) may have similar location preferences + Search within user community significantly reduces search space during the recommendation process

  6. Example of User Communities (1) Community 1: Gingers Community K: Darker Hair

  7. User Relation and Community Representations One way to find user communities is to model users' relationships in the form of a graph so that dense subgraphs are considered to be user communities.

  8. Community Detection based on a single data source One of the commonly formulations is MinCut problem. For a given number k of subsets, the MinCut involves choosing a partition 𝐷 ; ,…, 𝐷 > such that it minimizes the expression: > 𝑑𝑣𝑒 𝐷 ; ,… ,𝐷 > = ? 𝑋(𝐷 B ,𝐷̅ B ) BE; *W is the sum of weights of edges attached to vertices in 𝐷 B

  9. How to solve MinCut problem? Approximation of MinCut as standard tr st trace mi minimi mization problem: m: H∈I JΓ—L tr 𝑉 O 𝑀𝑉 ,s.t. 𝑉 O 𝑉 = 𝐽 min which can be solved by Sp Spectral Clu lusterin ing: Calculates Laplacian matrix 𝑀 ∈ 𝑆 UΓ—U 1. 2. Builds matrix of the first 𝑙 eigenvectors 𝑉 ∈ 𝑆 UΓ—> correspond to the smallest eigenvalues of 𝑀 3. Clusters data in a new space 𝑉 using i.e. 𝑙 -means algorithm

  10. Idea 2: Utilization of Multi-Source Data

  11. Most of user actively use β‰ˆ 3 social networks Accounts Ac ~6 registered social network ~6 accounts per person* 5 Ac Active Usage 4 6 People actively use ~3 ~3 social platforms simultaneously* 3 7 2 8 1 9 0 10 * GlobalWebIndex. 2016. GWI Social report. http://www.globalwebindex.net/blog/internet-users-have-average-of-5-social-media-accounts

  12. Multi-source data describe user from multiple views

  13. Cross-Domain Venue Category Recommendation Cr Cross Domain - Ve Venue ca category reco commendation – recommendation of venue categories (i.e. restaurant, cinema) using information about his/her profile (i.e. past visits) and/or information about users from other sources (i.e. images, texts, location types). Venue categories: Clothing Store Hotel Ice Cream Shop Multi-Source Data:

  14. Community Detection must performed in a Cross-Source Manner… Problems: β€’ Data source integration β€’ Community detection

  15. How to represent multi-source data? Mu Multi-la layer graph – graph 𝐻 , where 𝐻 = 𝐻 B , 𝐻 B = π‘Š,𝐹 B

  16. Extending definition of spectral clustering [ H∈I JΓ—L ? tr 𝑉 O 𝑀 B 𝑉 , s.t.𝑉 O 𝑉 = 𝐽 min BE; [ H∈I JΓ—L tr 𝑉 O 𝑀 \,] 𝑉 , where 𝑀 \,] = ? 𝑀 B min BE; Such approximation could suffer from poor poor ge gene neralization on abi bility.

  17. Regularized Clustering on Multi-layer Graph -1 Use Gr Grassman Ma Manifolds to keep final latent representation β€œclose” to all layers of multi-layer graph*. Where projected distance between two spaces 𝑍 ; and 𝑍 b : b = 1 b ,where 𝐡 k is the Frobenius norm O βˆ’ 𝑍 b O 𝑒 defg 𝑍 ; ,𝑍 2 𝑍 ; 𝑍 b 𝑍 ; b k [ = 𝑙𝑁 βˆ’ ?tr(𝑇𝑇 O βˆ’ 𝑇 B 𝑇 B b [ O ) 𝑒 defg 𝑇, 𝑇 B BE; BE; * X. Dong, P. Frossard, P. Vandergheynst, and N. Nefedov. Clustering on multi-layer graphs via subspace analysis on grassmann manifolds. IEEE Transactions on Signal Processing, 2014.

  18. Regularized Clustering on Multi-layer Graph -2 Extends the objective function to introduce the subspace analysis regularization [ [ O 𝑉 O 𝑀 B 𝑉 + 𝛽 𝑉𝑉 O 𝑉 B 𝑉 B ,s.t. 𝑉 O 𝑉 = 𝐽 Hβˆˆβ„ JΓ—L ? tr min 𝑙𝑁 βˆ’ ? tr BE; BE; Hβˆˆβ„ JΓ—L tr 𝑉 O 𝑀 ]ft 𝑉 min [ O ) 𝑀 ]ft = ?(𝑀 B βˆ’ 𝛽𝑉 B 𝑉 B BE;

  19. Idea 4: Making use of Inter-Layer (Inter-Source) Relations

  20. Incorporating inter-layer relationship (1) By using distance on Grassman Manifolds, we present the new objective function for the 𝑗 th layer: [ v B O 𝑀 B 𝑉 v B + 𝛾 B v B 𝑉 v B O 𝑉 O v w βˆˆβ„ JΓ—L tr 𝑉 min 𝑙𝑁 βˆ’ ? π‘₯ B,g tr 𝑉 g 𝑉 g H gE;,gzB O 𝑀 v B { B 𝑉 v B v w βˆˆβ„ JΓ—L tr 𝑉 min H [ { B = 𝑀 B βˆ’ 𝛾 B O 𝑀 ? π‘₯ B,g tr 𝑉 g 𝑉 g gE;,gzB

  21. But how can we determine w |,} when computing i-th layer ? O 𝑀 v B { B 𝑉 v B v w βˆˆβ„ JΓ—L tr 𝑉 min H [ { B = 𝑀 B βˆ’ 𝛾 B O 𝑀 ? π‘₯ B,g tr 𝑉 g 𝑉 g gE;,gzB In Inter-la layer rela latio ionship ip graph 𝑺(𝑾,𝑭) – weighted graph which represents the similarity between layers. 𝑁 B,> βˆ’ 𝑁 g,> β€ž βˆ‘ 1 βˆ’ >Eb 𝑂 𝑂 βˆ’ 1 βˆ€ 𝑗,π‘˜ ∈ 𝐹, π‘₯ B,g = 𝐿 βˆ’ 1 where 𝑁 B,> is clustering co-occurrence matrix of layer 𝑗 , 𝑛 ‑,Λ† = 1, if users 𝑏 and 𝑐 assigned to the same cluster , and 0 otherwise.

  22. Final objective function Let’s combine equations from previous slides to define the final objective function: [ [ { B 𝑉 + 𝛽 v B 𝑉 v B O 𝑉 O 𝑀 𝑉𝑉 O 𝑉 min βˆˆβ„ JΓ—L ?tr 𝑙𝑁 βˆ’ ? tr = H BE; BE; [ βˆˆβ„ JΓ—L tr 𝑉 O ?(𝑀 { B βˆ’ 𝛽𝑉 v B 𝑉 v B O ) = min 𝑉 H BE;

  23. Problems β€’ Community detection β€’ Data source integration

  24. Recall: Community-Based Cross-Domain Recommendation We perform venue category recommendation based on both individual and group knowledge, where group knowledge is obtained from multiple sources: βˆ‘ 𝑀𝑓𝑑 0 0∈2 3 𝑠𝑓𝑑 𝑣 = 𝑑𝑝𝑠𝑒 𝛿 * 𝑀𝑓𝑑 , + πœ„ 𝐷 , +

  25. Foursquare Instagram NUS-MSS Dataset Dataset* is presented as a set of features, extracted from user-generated data in three social networks: - text based fromTwitter (LDA, LIWC, text features) - image based from Instagram (concepts) - location based from Foursquare (LDA, categories, Mobility Features) Foursquare categories is splited into two parts: 3 months data (train) and 2 months (test). Twitter * A. Farseev, N. Liqiang, M. Akbari, and T.-S. Chua. Ha Harvesting multiple so sources s for use ser profile learning: a Big data st study. ACM International Conference on Multimedia Retrieval (ICMR). China. June 23-26, 2015.

  26. Data Sources Text Features: Linguistic features: LIWC; Latent Topics Heuristic features: Writing behavior LIWC LDA Location Features: Location Semantics: Venue Category Distribution Mobility Location Type Mobility Features: Areas of Interest (AOI) Preferences Image Features Image Google Net Concepts Image Concept Distribution (Image Net) Images

  27. Evaluation Baselines Re Recommender Systems Co Community Detection Approaches β€’ 𝐣 β€” C ’ R recommendation without inter-layer 𝐃 πŸ’ 𝐒 βˆ’ 𝐌 Po Popular (PO POP) P) β€”recommendation based on user’s past regularization experience β€’ 𝐣 - 𝐌 β€’ 𝐍𝐩𝐞 β€” C ’ R recommendation without inter-layer 𝐃 πŸ’ 𝐒 βˆ’ 𝐌 Popular Al All (POP Al All) ) β€”recommendation based on experience of regularization and sub-space regularization all users 𝐃 πŸ’ 𝐒 βˆ’ 𝑫𝒑𝒏𝒏 β€” C ’ R recommendation without user Mu Multi-So Source Re-Ra Ranking (MSRR) RR) β€” linearly combines community extraction recommendation results from all data modalities 𝐃 πŸ’ 𝐒 (DB ) β€” C ’ R recommendation, where user Nearest Ne Ne Neighbor Collaborative Filtering (CF) β€” DBScan) recommendation based on top k most similar Foursquare users communities are detected by Density-Based clustering (DBScan) Ea Early Fusion (EF EF) β€” fuses multi-source data into a single feature 𝐃 πŸ’ 𝐒 (x means) β€” C ’ R recommendation, where user vector (x-me communities are detected by x-means clustering SV SVD++ β€” makes use of the β€œimplicit feedback” information 𝐃 πŸ’ 𝐒 (H (Hierarchical) β€” C ’ R recommendation, where user FMβ€” brings together the advantages of different factorization- FM communities are detected by Hierarchical Clustering based models via regularization. 𝐃 πŸ’ 𝐒 β€” Our Ap Approach

  28. Evaluation against other recommender systems

  29. Evaluation against other community detection approaches + Incorporation of group knowledge is is important + Multi-modal clustering performs better than single-source clustering + Incorporation of Inter-Source relationshipis crucial.

  30. Evaluation against source combinations + In different geo regions, different data sources are of different importance + Location data is more powerful than other data modalities

Recommend


More recommend