sparse random graphs with exchangeable point processes
play

Sparse random graphs with exchangeable point processes Fran cois - PowerPoint PPT Presentation

Sparse random graphs with exchangeable point processes Fran cois Caron Department of Statistics, Oxford Statistics Seminar, Bocconi University March 26, 2015 Joint work with Emily Fox (U. Washington) F. Caron 1 / 57 Introduction


  1. Sparse random graphs with exchangeable point processes Fran¸ cois Caron Department of Statistics, Oxford Statistics Seminar, Bocconi University March 26, 2015 Joint work with Emily Fox (U. Washington) F. Caron 1 / 57

  2. Introduction Exchangeable matrices and their limitations Statistical network models using exchangeable random measures Exchangeability and sparsity properties Special case: Generalized gamma process Posterior characterization & Inference Experimental results F. Caron 2 / 57

  3. Outline Introduction Exchangeable matrices and their limitations Statistical network models using exchangeable random measures Exchangeability and sparsity properties Special case: Generalized gamma process Posterior characterization & Inference Experimental results F. Caron 3 / 57

  4. Introduction 4 2 3 2 3 1 2 3 1 1 ◮ Multi-edges directed graphs ◮ Emails ◮ Citations ◮ WWW ◮ Simple graphs ◮ Social network ◮ Protein-protein interaction F. Caron 4 / 57

  5. Introduction Readers/Customers A 1 A 2 B 1 B 2 B 3 B 4 ◮ Bipartite graphs ◮ Scientists authoring papers ◮ Readers reading books ◮ Internet users posting messages on forums ◮ Customers buying items F. Caron 5 / 57

  6. Introduction ◮ Build a statistical model of the network to ◮ Find interpretable structure in the network ◮ Predict missing edges ◮ Predict connections of new nodes F. Caron 6 / 57

  7. Introduction ◮ Properties of real world networks ◮ Sparsity Dense graph: n e = Θ( n 2 ) Sparse graph: n e = o ( n 2 ) with n e the number of edges and n the number of nodes ◮ Power-law degree distributions [Newman, 2009, Clauset et al., 2009] F. Caron 7 / 57

  8. Book-crossing community network 5 000 readers, 36 000 books, 50 000 edges F. Caron 8 / 57

  9. Book-crossing community network Degree distributions on log-log scale 0 0 10 10 −1 −1 10 10 −2 −2 10 10 Distribution Distribution −3 −3 10 10 −4 −4 10 10 −5 −5 10 10 −6 −6 10 10 −7 −7 10 10 0 1 2 3 4 0 1 2 10 10 10 10 10 10 10 10 Degree Degree (a) Readers (b) Books F. Caron 9 / 57

  10. Outline Introduction Exchangeable matrices and their limitations Statistical network models using exchangeable random measures Exchangeability and sparsity properties Special case: Generalized gamma process Posterior characterization & Inference Experimental results F. Caron 10 / 57

  11. Introduction ◮ Statistical network modeling ◮ Probabilistic symmetry: exchangeability ◮ Ordering of the nodes is irrelevant 2 3 1 F. Caron 11 / 57

  12. Introduction ◮ Statistical network modeling ◮ Probabilistic symmetry: exchangeability ◮ Ordering of the nodes is irrelevant 3 1 2 F. Caron 12 / 57

  13. Introduction ◮ Graphs usually represented by a discrete structure ◮ Adjacency matrix X ij ∈ { 0 , 1 } , ( i, j ) ∈ N 2 ◮ Joint exchangeability d ( X ij ) = ( X π ( i ) π ( j ) ) for any permutation π of N          π         � �� � π F. Caron 13 / 57

  14. Introduction ◮ Aldous-Hoover representation theorem ( X ij ) = ( F ( U i , U j , U { ij } )) where U i , U { ij } are uniform random variables and F is a random function from [0 , 1] 3 to { 0 , 1 } ◮ Several network models fit in this framework (e.g. stochastic blockmodel, infinite relational model, etc.) [Hoover, 1979, Aldous, 1981, Lloyd et al., 2012] F. Caron 14 / 57

  15. Introduction ◮ Corollary of A-H theorem Exchangeable random graphs are either empty or dense ◮ To quote the survey paper of Orbanz and Roy “the theory [...] clarifies the limitations of exchangeable models. It shows, for example, that most Bayesian models of network data are inherently misspecified” ◮ Give up exchangeability for sparsity? e.g. preferential attachment model [Barab´ asi and Albert, 1999, Orbanz and Roy, 2015] F. Caron 15 / 57

  16. Outline Introduction Exchangeable matrices and their limitations Statistical network models using exchangeable random measures Exchangeability and sparsity properties Special case: Generalized gamma process Posterior characterization & Inference Experimental results F. Caron 16 / 57

  17. Point process representation ◮ Representation of a graph as a (marked) point process over R 2 + ◮ Representation theorem by Kallenberg for jointly exchangeable point processes on the plane ◮ Construction based on a completely random measure ◮ Properties of the model ◮ Exchangeability ◮ Sparsity ◮ Power-law degree distributions (with exponential cut-off) ◮ Interpretable parameters and hyperparameters ◮ Reinforced urn process construction ◮ Posterior characterization ◮ Scalable inference [Kallenberg, 2005, Caron and Fox, 2014] F. Caron 17 / 57

  18. Point process representation ◮ Undirected graph represented as a point process on R 2 + � Z = z ij δ ( θ i ,θ j ) i,j with θ i ∈ R , z ij ∈ { 0 , 1 } with z ij = z ji 0 F. Caron 18 / 57

  19. Point process representation Joint exchangeability Let A i = [ h ( i − 1) , hi ] for i ∈ N then d ( Z ( A i × A j )) = ( Z ( A π ( i ) × A π ( j ) )) for any permutation π of N and any h > 0 0 F. Caron 19 / 57

  20. Point process representation ◮ Kallenberg derived a de Finetti style representation theorem for jointly and separately exchangeable point processes on the plane ◮ Representation via random transformations of unit rate Poisson processes and uniform variables ◮ Continuous-time equivalent of Aldous-Hoover for binary variables ◮ Our construction will fit into this framework [Kallenberg, 1990, Kallenberg, 2005] F. Caron 20 / 57

  21. Completely random measures ◮ Nodes are embedded at some location θ i ∈ R + ◮ To each node is associated some sociability parameter w i ◮ Homogeneous completely random measure on R + ∞ � W = w i δ θ i W ∼ CRM ( ρ, λ ) . i =1 w i 0 θ i ◮ L´ evy measure ν ( dw, dθ ) = ρ ( dw ) λ ( dθ ) with λ the Lebesgue measure [Kingman, 1967] F. Caron 21 / 57

  22. Completely random measures ◮ L´ evy measure ν ( dw, dθ ) = ρ ( dw ) λ ( dθ ) with λ the Lebesgue measure ◮ ρ is a measure on R + such that � ∞ (1 − e − w ) ρ ( dw ) < ∞ . (1) 0 which implies that W ([0 , T ]) < ∞ for any T < ∞ . � ∞ ρ ( dw ) = ∞ = ⇒ Infinite number of jumps in any interval [0 , T ] 0 “Infinite activity CRM” � ∞ ρ ( dw ) < ∞ = ⇒ Finite number of jumps in any interval [0 , T ] 0 “Finite activity CRM” F. Caron 22 / 57

  23. Model for multi-edges directed graphs We represent the integer-weighted directed graph using an atomic measure on R 2 + ∞ ∞ � � D = n ij δ ( θ i ,θ j ) , i =1 j =1 where n ij counts the number of directed edges from node θ i to node θ j . 4 3 Counts 2 θ 2 θ 3 1 4 1 0 θ 2 θ 1 θ 2 2 3 θ 1 θ 1 θ 3 θ 3 F. Caron 23 / 57

  24. Model for multi-edges directed graphs ◮ Conditional Poisson process with intensity measure � W = W × W on the product space R 2 + : D | W ∼ PP ( W × W ) 18 18 4.5 16 16 4 14 14 12 12 3.5 10 10 3 8 8 6 6 2.5 4 4 2 2 2 0 0 1.5 1 1 0.8 0.8 1 1 1 0.6 0.8 0.6 0.8 0.6 0.6 0.4 0.4 0.5 0.4 0.4 0.2 0.2 0.2 0.2 0 0 0 0 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 (d) Intensity measure � (c) CRM W W (e) Poisson process D F. Caron 24 / 57

  25. Model for multi-edges directed graphs ◮ By construction, for any bounded intervals A and B of R + , � W ( A × B ) = W ( A ) W ( B ) < ∞ ◮ Finite number of counts over A × B ⊂ R 2 + D ( A × B ) < ∞ F. Caron 25 / 57

  26. Model for undirected graphs ◮ Point process ∞ ∞ � � Z = z ij δ ( θ i ,θ j ) , i =1 j =1 with the convention z ij = z ji ∈ { 0 , 1 } ◮ Constructed from D by setting z ij = z ji = 1 if n ij + n ji > 0 and z ij = z ji = 0 otherwise 4 3 Counts 2 θ 2 1 θ 3 θ 2 θ 3 4 1 0 θ 2 θ 1 θ 1 θ 1 2 3 θ 2 θ 1 θ 3 θ 3 (a) D (b) Integer-valued (c) Undirected graph directed graph F. Caron 26 / 57

  27. Model for undirected graphs ◮ Hierarchical model W = � ∞ i =1 w i δ θ i W ∼ CRM ( ρ, λ ) D = � ij n ij δ ( θ i ,θ j ) D ∼ PP ( W × W ) Z = � ij min( n ij + n ji , 1) δ ( θ i ,θ j ) F. Caron 27 / 57

  28. Model for undirected graphs ◮ Equivalent direct formulation for i ≤ j � 1 − exp( − 2 w i w j ) i � = j Pr( z ij = 1 | w ) = 1 − exp( − w 2 i ) i = j and z ji = z ij 0 F. Caron 28 / 57

  29. Outline Introduction Exchangeable matrices and their limitations Statistical network models using exchangeable random measures Exchangeability and sparsity properties Special case: Generalized gamma process Posterior characterization & Inference Experimental results F. Caron 29 / 57

  30. Properties: Exchangeability Exchangeability Let h > 0 and A i = [ h ( i − 1) , hi ] , i ∈ N . By construction, d ( Z ( A i × A j )) = ( Z ( A π ( i ) × A π ( j ) )) for any permutation π of N . F. Caron 30 / 57

  31. Properties: Sparsity ◮ W ( R + ) = ∞ , so infinite number of edges on R 2 + ◮ Restrictions D α and Z α of D and Z , respectively, to the box [0 , α ] 2 . ◮ N α number of nodes, and N ( e ) number of edges α α 0 α F. Caron 31 / 57

  32. Properties: Sparsity N α N ( e ) 6 α 4 2 0 α 0 0 F. Caron 32 / 57

Recommend


More recommend