Edge-Weighted Personalized PageRank: Breaking a Decade-Old - PowerPoint PPT Presentation

Edge-Weighted Personalized PageRank: Breaking a Decade-Old Performance Barrier W. Xie D. Bindel A. Demers J. Gehrke 12 Aug 2015 W. Xie, D. Bindel , A. Demers, J. Gehrke KDD2015 12 Aug 2015 1 / 1

PageRank Model Unweighted Node weighted Edge weighted Random surfer model: x ( t +1) = ↵ Px ( t ) + (1 � ↵ ) v where P = AD − 1 Stationary distribution: Mx = b where M = ( I � ↵ P ) , b = (1 � ↵ ) v W. Xie, D. Bindel , A. Demers, J. Gehrke KDD2015 12 Aug 2015 2 / 1

Edge Weight vs Node Weight Personalization v i = v i ( w ) � ij = � ij ( w ) w 2 R d Introduce personalization parameters w 2 R d in two ways: Node weights: M x(w) = b(w) Edge weights: M(w) x(w) = b W. Xie, D. Bindel , A. Demers, J. Gehrke KDD2015 12 Aug 2015 3 / 1

Edge Weight vs Node Weight Personalization Node weight personalization is well-studied Topic-sensitive PageRank: fast methods based on linearity Localized PageRank: fast methods based on sparsity Some work on edge weight personalization ObjectRank/ScaleRank: personalize weights for di ff erent edge types But lots of work incorporates edge weights without personalization Our goal : General, fast methods for edge weight personalization W. Xie, D. Bindel , A. Demers, J. Gehrke KDD2015 12 Aug 2015 4 / 1

Model Reduction Expensive full model ( Mx = b ) ⇡ U Reduced model ( ˜ My = ˜ b ) = Reduced basis = Approximation ansatz Model reduction procedure from physical simulation world: O ffl ine : Construct reduced basis U 2 R n × k O ffl ine : Choose � k equations to pick approximation ˆ x = Uy Online : Solve for y ( w ) given w and reconstruct ˆ x W. Xie, D. Bindel , A. Demers, J. Gehrke KDD2015 12 Aug 2015 5 / 1

Reduced Basis Construction: SVD (aka POD/PCA/KL) Snapshot matrix Σ V T x 1 x 2 . . . x r ⇡ U w r w 2 Sample points w 1 W. Xie, D. Bindel , A. Demers, J. Gehrke KDD2015 12 Aug 2015 6 / 1

Approximation Ansatz Want r = MUy � b ⇡ 0. Consider two approximation conditions: Method Ansatz Properties U T r = 0 Bubnov-Galerkin Good accuracy empirically Fast for P ( w ) linear DEIM min k r I k Fast even for nonlinear P ( w ) Complex cost/accuracy tradeo ff Similar error analysis framework for both (see paper): Consistency + Stability = Accuracy Consistency: Does the subspace contain good approximants? Stability: Is the approximation subproblem far from singular? W. Xie, D. Bindel , A. Demers, J. Gehrke KDD2015 12 Aug 2015 7 / 1

Bubnov-Galerkin Method y U T � b M U = 0 . Linear case: w i = probability of transition with edge type i X ! X ! w i P ( i ) ˜ w i ˜ P ( i ) M ( w ) = I � ↵ , M ( w ) = I � ↵ i i P ( i ) = U T P ( i ) U where we can precompute ˜ Nonlinear: Cost to form ˜ M ( w ) comparable to cost of PageRank! W. Xie, D. Bindel , A. Demers, J. Gehrke KDD2015 12 Aug 2015 8 / 1

Discrete Empirical Interpolation Method (DEIM) y Equations in I � b = 0 . M U I Ansatz: Minimize k r I k for chosen indices I Only need a few rows of M (and associated rows of U ) Di ff erence from physics applications: high-degree nodes! W. Xie, D. Bindel , A. Demers, J. Gehrke KDD2015 12 Aug 2015 9 / 1

Interpolation Costs Consider subgraph relevant to one interpolation equation: i 2 I . . . 1 / 3 1 / 50 Incoming neighbors of i Really care about weights of edges incident on I Need more edges to normalize (unless A ( w ) is linear) High in/out degree are expensive but informative Key question : how to choose I to balance cost vs accuracy ? W. Xie, D. Bindel , A. Demers, J. Gehrke KDD2015 12 Aug 2015 10 / 1

Interpolation Accuracy Key: keep M I , : far from singular. If |I| = k , this is a subset selection over rows of MU . Have standard techniques (e.g. pivoted QR) Want to pick I once , so look at rows of ⇥ ⇤ Z = M ( w 1 ) U M ( w 2 ) U . . . for sample parameters w ( i ) . Helps to explicitly enforce P i ˆ x i = 1 Several heuristics for cost/accuracy tradeo ff (see paper) W. Xie, D. Bindel , A. Demers, J. Gehrke KDD2015 12 Aug 2015 11 / 1

Online Costs If ` = # PR components needed, online costs are: Form ˜ O ( dk 2 ) for B-G M More complex for DEIM Factor ˜ O ( k 3 ) M O ( k 2 ) Solve for y Form Uy O ( k ` ) Online costs do not depend on graph size! (unless you want the whole PR vector) W. Xie, D. Bindel , A. Demers, J. Gehrke KDD2015 12 Aug 2015 12 / 1

Example Networks DBLP (citation network) Weibo (micro-blogging) 3.5M nodes / 18.5M edges 1.9M nodes / 50.7M edges Seven edge types = ) Weight edges by topical seven parameters similarity of posts P ( w ) linear Number of parameters = Competition: ScaleRank number of topics (5, 10, 20) (Studied global and local PageRank – see paper for latter.) W. Xie, D. Bindel , A. Demers, J. Gehrke KDD2015 12 Aug 2015 13 / 1

Singular Value Decay 10 6 DBLP-L Weibo-S5 10 5 Weibo-S10 10 4 Weibo-S20 Value 10 3 10 2 10 1 10 0 10 -1 0 50 100 150 200 i th Largest Singular Value r = 1000 samples, k = 100 W. Xie, D. Bindel , A. Demers, J. Gehrke KDD2015 12 Aug 2015 14 / 1

DBLP Accuracy 10 0 Kendall@100 10 -1 Normalized L1 10 -2 10 -3 10 -4 10 -5 G D D D S c a E E E a l I I I e l M M M e r k R - - - i a 1 1 2 n n 0 2 0 0 0 0 k W. Xie, D. Bindel , A. Demers, J. Gehrke KDD2015 12 Aug 2015 15 / 1

DBLP Running Times (All Nodes) 0.7 Coefficients 0.6 Construction Running time (s) 0.5 0.4 0.3 0.2 0.1 0 G D D D S c a E E E a l I I I e l M M M e r k R - - - i a 1 1 2 n n 0 2 0 0 0 0 k W. Xie, D. Bindel , A. Demers, J. Gehrke KDD2015 12 Aug 2015 16 / 1

Weibo Accuracy Kendall@100 10 -1 Normalized L1 10 -2 10 -3 10 -4 5 10 20 # Parameters W. Xie, D. Bindel , A. Demers, J. Gehrke KDD2015 12 Aug 2015 17 / 1

Weibo Running Times (All Nodes) 0.5 Coefficients Construction Running time (s) 0.4 0.3 0.2 0.1 0 5 10 20 # Parameters W. Xie, D. Bindel , A. Demers, J. Gehrke KDD2015 12 Aug 2015 18 / 1

Application: Learning to Rank Goal: Given T = { ( i q , j q ) } | T | q =1 , find w that mostly ranks i q over j 1 . (c.f. Backstrom and Leskovec, WSDM 2011) Standard: Gradient descent on full problem One PR computation for objective One PR computation for each gradient component Costs d + 1 PR computations per step With model reduction Rephrase objective in reduced coordinate space Use factorization to solve PR for objective Re-use same factorization for gradient W. Xie, D. Bindel , A. Demers, J. Gehrke KDD2015 12 Aug 2015 19 / 1

DBLP Learning Task 400 Standard Objective Function Value Galerkin 350 DEIM-200 300 250 200 150 100 0 2 4 6 8 10 12 14 16 18 20 Iteration (8 papers for training + 7 params) W. Xie, D. Bindel , A. Demers, J. Gehrke KDD2015 12 Aug 2015 20 / 1

The Punchline Test case: DBLP, 3.5M nodes, 18.5M edges, 7 params Cost per Iteration: Method Standard Bubnov-Galerkin DEIM-200 Time(sec) 159.3 0.002 0.033 W. Xie, D. Bindel , A. Demers, J. Gehrke KDD2015 12 Aug 2015 21 / 1

Roads Not Taken In the paper (but not the talk) Selecting interpolation equations for DEIM Localized PageRank experiments (Weibo and DBLP) Comparison to BCA for localized PageRank Quasi-optimality framework for error analysis Room for future work! Analysis, applications, systems, ... W. Xie, D. Bindel , A. Demers, J. Gehrke KDD2015 12 Aug 2015 22 / 1

Questions? Edge-Weighted Personalized PageRank: Breaking a Decade-Old Performance Barrier Wenlei Xie, David Bindel, Johannes Gehrke, and Al Demers KDD 2015, paper 117 Sponsors: NSF (IIS-0911036 and IIS-1012593) iAd Project from the National Research Council of Norway W. Xie, D. Bindel , A. Demers, J. Gehrke KDD2015 12 Aug 2015 23 / 1

Edge-Weighted Personalized PageRank: Breaking a Decade-Old - PowerPoint PPT Presentation

Edge-Weighted Personalized PageRank: Breaking a Decade-Old Performance Barrier W. Xie D. Bindel A. Demers J. Gehrke 12 Aug 2015 W. Xie, D. Bindel , A. Demers, J. Gehrke KDD2015 12 Aug 2015 1 / 1 PageRank Model Unweighted Node weighted

Cloud Cloud Cloud Cloud network Edge Edge Edge Edge as a Edge Edge Edge Edge Edge

Get the edge Get the edge Get the edge Get the edge Get the edge Get the edge Get the edge

Graph Mining - PageRank Mert Terzihan-Zhixiong Chen Content 1. Web as a Graph 2. Why is

Sublinear Algorithms for Personalized PageRank, with Applications Ashish Goel Joint work with

Weighted graphs Weighted graphs Weighted graphs Weighted graphs Graphs with numbers, called

IV.4 Topic-Specific & Personalized PageRank PageRank produces one-size-fits-all

Personalized PageRank Document Understanding, session 4 CS6200: Information Retrieval

The PageRank Algorithm and Web Search John Orr Engines Introduction PageRank Computation

PageRank CS16: Introduction to Data Structures & Algorithms Spring 2020 Outline The WWW

Realizing the Dreams of Personalized Medicine Realizing the Dreams of Personalized Medicine

Design Considerations for a DECADE SDT draft-kutscher-decade-protocol-00

Weighted graphs 2 Weighted graphs So far we have only considered weighted graphs with

Edge-based Segmentation Transform Hough Edge Tracking Linking Edge Detection Canny Edge

PAGERANK-RELATED METHODS FOR ANALYZING CITATION NETWORKS Author: Ludo Waltman and Erjia Yan

PageRank Google's PageRank algorithm. [Sergey Brin and Larry Page, 1998] Measure

Web and PageRank Lecture 4 CSCI 4974/6971 12 Sep 2016 1 / 16 Todays Biz 1. Review MPI 2.

Discontinuous Galerkin Methods for Anisotropic and Semi-Definite Diffusion with Advection Daniele

A reconstruction-enhanced discontinuous Galerkin method for hyperbolic problems V aclav Ku

An optimal adaptive wavelet method discrete problems Linear operator equations for strongly

A time discontinuous Petrov-Galerkin method for the integration of inelastic constitutive

Using non-Galerkin coarse grid operators in multigrid methods General considerations and case

Et maintenant, les quations du tsunami ! 1 1 ( y r ) COMPOSANTES D E VITESSE RVAMON

GPU Metaprogramming using PyCUDA: Methods & Applications Andreas Kl ockner Division of

A Priori Error Analysis of the Petrov Galerkin Crank Nicolson Scheme for Parabolic Optimal

Edge-Weighted Personalized PageRank: Breaking a Decade-Old - PowerPoint PPT Presentation

Edge-Weighted Personalized PageRank: Breaking a Decade-Old Performance Barrier W. Xie D. Bindel A. Demers J. Gehrke 12 Aug 2015 W. Xie, D. Bindel , A. Demers, J. Gehrke KDD2015 12 Aug 2015 1 / 1 PageRank Model Unweighted Node weighted

Cloud Cloud Cloud Cloud network Edge Edge Edge Edge as a Edge Edge Edge Edge Edge

Get the edge Get the edge Get the edge Get the edge Get the edge Get the edge Get the edge

Graph Mining - PageRank Mert Terzihan-Zhixiong Chen Content 1. Web as a Graph 2. Why is

Sublinear Algorithms for Personalized PageRank, with Applications Ashish Goel Joint work with

Weighted graphs Weighted graphs Weighted graphs Weighted graphs Graphs with numbers, called

IV.4 Topic-Specific &amp; Personalized PageRank PageRank produces one-size-fits-all

Personalized PageRank Document Understanding, session 4 CS6200: Information Retrieval

The PageRank Algorithm and Web Search John Orr Engines Introduction PageRank Computation

PageRank CS16: Introduction to Data Structures &amp; Algorithms Spring 2020 Outline The WWW

Realizing the Dreams of Personalized Medicine Realizing the Dreams of Personalized Medicine

Design Considerations for a DECADE SDT draft-kutscher-decade-protocol-00

Weighted graphs 2 Weighted graphs So far we have only considered weighted graphs with

Edge-based Segmentation Transform Hough Edge Tracking Linking Edge Detection Canny Edge

PAGERANK-RELATED METHODS FOR ANALYZING CITATION NETWORKS Author: Ludo Waltman and Erjia Yan

PageRank Google's PageRank algorithm. [Sergey Brin and Larry Page, 1998] Measure

Web and PageRank Lecture 4 CSCI 4974/6971 12 Sep 2016 1 / 16 Todays Biz 1. Review MPI 2.

Discontinuous Galerkin Methods for Anisotropic and Semi-Definite Diffusion with Advection Daniele

A reconstruction-enhanced discontinuous Galerkin method for hyperbolic problems V aclav Ku

An optimal adaptive wavelet method discrete problems Linear operator equations for strongly

A time discontinuous Petrov-Galerkin method for the integration of inelastic constitutive

Using non-Galerkin coarse grid operators in multigrid methods General considerations and case

Et maintenant, les quations du tsunami ! 1 1 ( y r ) COMPOSANTES D E VITESSE RVAMON

GPU Metaprogramming using PyCUDA: Methods &amp; Applications Andreas Kl ockner Division of

A Priori Error Analysis of the Petrov Galerkin Crank Nicolson Scheme for Parabolic Optimal

IV.4 Topic-Specific & Personalized PageRank PageRank produces one-size-fits-all

PageRank CS16: Introduction to Data Structures & Algorithms Spring 2020 Outline The WWW

GPU Metaprogramming using PyCUDA: Methods & Applications Andreas Kl ockner Division of