of graph embeddings
play

of Graph Embeddings Aleksandar Bojchevski Technical University of - PowerPoint PPT Presentation

Uncertainty and Robustness of Graph Embeddings Aleksandar Bojchevski Technical University of Munich, Germany Graph Embedding Day 2018 - Lyon Neglected aspects of graph embeddings Capturing uncertainty Robustness to noise Robustness to


  1. Uncertainty and Robustness of Graph Embeddings Aleksandar Bojchevski Technical University of Munich, Germany Graph Embedding Day 2018 - Lyon

  2. Neglected aspects of graph embeddings Capturing uncertainty Robustness to noise Robustness to adversarial attacks Uncertainty and Robustness of Graph Embeddings - Bojchevski 2

  3. Neglected aspects of graph embeddings Capturing uncertainty Robustness to noise Robustness to adversarial attacks Uncertainty and Robustness of Graph Embeddings - Bojchevski 3

  4. Nodes are points in a low-dimensional space Uncertainty and Robustness of Graph Embeddings - Bojchevski 4

  5. Nodes are distributions Uncertainty and Robustness of Graph Embeddings - Bojchevski 5

  6. Graph2Gauss - 3 key modeling ideas 1. Uncertainty 2. Personalized ranking 3. Inductiveness 𝑦 𝑗 𝑙 = 2 𝑔 πœ„ (𝑦 𝑗 ) deep 𝑙 = 1 encoder π’ͺ( 𝜈 𝑗 , Ξ£ 𝑗 ) Uncertainty and Robustness of Graph Embeddings - Bojchevski 6

  7. Uncertainty Embed nodes as (Gaussian) distributions Sources of uncertainty: β€’ Conflicting structure and attributes β€’ Heterogenous neighborhood β€’ Noise, outliers, anomalies, …. Uncertainty and Robustness of Graph Embeddings - Bojchevski 7

  8. Personalized ranking 𝑙 = 2 For each node 𝑗 : nodes in its (𝑙) -hop neighborhood 𝑙 = 1 should be closer to 𝑗 compared to nodes in its (𝑙 + 1) -hop neighborhood Uncertainty and Robustness of Graph Embeddings - Bojchevski 8

  9. Personalized ranking 𝑙 = 2 For each node 𝑗 : nodes in its (𝑙) -hop neighborhood 𝑙 = 1 should be closer to 𝑗 compared to nodes in its (𝑙 + 1) -hop neighborhood Uncertainty and Robustness of Graph Embeddings - Bojchevski 9

  10. Personalized ranking For each node 𝑗 : nodes in its (𝑙) -hop neighborhood 𝑙 = 2 should be closer to 𝑗 compared to nodes in its (𝑙 + 1) -hop neighborhood 𝑙 = 1 Example: closer in terms of the KL Diveregence KL is asymmetric β‡’ handles directed graphs Uncertainty and Robustness of Graph Embeddings - Bojchevski 10

  11. Personalized ranking 𝑙 = 2 Personalized ranking implies pairwise constraints for node 𝑗 𝑙 = 1 D 𝐿𝑀 (π’ͺ π‘˜ ||π’ͺ 𝑗 ) < D 𝐿𝑀 (π’ͺ π‘˜ β€² ||π’ͺ 𝑗 ) (𝑙 β€² ) , βˆ€π‘™ < 𝑙′ (𝑙) , βˆ€π‘˜ β€² ∈ 𝑂 𝑗 βˆ€π‘˜ ∈ 𝑂 𝑗 set of nodes in the 𝑙 -hop neighborhood of node 𝑗 Uncertainty and Robustness of Graph Embeddings - Bojchevski 11

  12. Inductiveness 𝑦 𝑗 𝑔 πœ„ (𝑦 𝑗 ) deep Generalize to unseen nodes by learning encoder a mapping from features to embeddings π’ͺ( 𝜈 𝑗 , Ξ£ 𝑗 ) Uncertainty and Robustness of Graph Embeddings - Bojchevski 12

  13. Graph2Gauss - 3 key modeling ideas 1. Uncertainty 2. Personalized ranking 3. Inductiveness 𝑦 𝑗 𝑙 = 2 𝑔 πœ„ (𝑦 𝑗 ) deep 𝑙 = 1 encoder π’ͺ( 𝜈 𝑗 , Ξ£ 𝑗 ) Uncertainty and Robustness of Graph Embeddings - Bojchevski 13

  14. Learning with energy-based loss 2 + exp βˆ’πΉ π‘—π‘˜β€² ) β„’ = Οƒ 𝑗,π‘˜,π‘˜ β€² (𝐹 π‘—π‘˜ 𝐹 π‘—π‘˜ = D 𝐿𝑀 (π’ͺ π‘˜ | π’ͺ 𝑗 Closer nodes should have lower energy Naively: 𝑃(𝑂 3 ) complexity Node-anchored sampling strategy: β€’ For each node same one another node from every neighborhood β€’ Less than 4.2% triplets seen to match performance β€’ Lower gradient variance Uncertainty and Robustness of Graph Embeddings - Bojchevski 14

  15. Graph2Gauss is parameter/data efficient Uncertainty and Robustness of Graph Embeddings - Bojchevski 15

  16. Graph2Gauss captures uncertainty Uncertainty correlates with diversity Diversity: number of distinct classes in a node’s k -hop neighborhood Uncertainty and Robustness of Graph Embeddings - Bojchevski 16

  17. Graph2Gauss captures uncertainty Uncertainty reveals the intrinsic latent dimensionality of the graph Detected latent dimensions β‰ˆ number ground-truth communities Uncertainty and Robustness of Graph Embeddings - Bojchevski 17

  18. Uncertainty and link prediction Prune dimensions with high uncertainty Maintaining link prediction performance Uncertainty and Robustness of Graph Embeddings - Bojchevski 18

  19. Graph2Gauss is effective for visualization Uncertainty and Robustness of Graph Embeddings - Bojchevski 19

  20. Neglected aspects of graph embeddings Capturing uncertainty Robustness to noise Robustness to adversarial attacks Uncertainty and Robustness of Graph Embeddings - Bojchevski 20

  21. Why spectral embedding https://www.semanticscholar.org Uncertainty and Robustness of Graph Embeddings - Bojchevski 21

  22. What is spectral clustering 𝐸 = 5 4 4 3 3 2 2 1 1 𝑂 = 9 Spectral Similarity 0 0 Embedding Graph -4 -2 0 2 4 -4 -2 0 2 4 -1 -1 -2 -2 -3 -3 -4 -4 -5 -5 Graph clustering β€’ Maximize within-cluster edges β€’ Minimize between cluster edges Uncertainty and Robustness of Graph Embeddings - Bojchevski 22

  23. The minimum cut Partition V into two sets 𝐷 1 and 𝐷 2 , such that the sum of the inter-cluster edge weights cut 𝐷 1 , 𝐷 2 = Οƒ 𝑀 1 ∈𝐷 1 ,𝑀 2 ∈𝐷 2 π‘₯(𝑀 1 , 𝑀 2 ) is minimized 1 2 4 2 4 0 5 2 3 1 4 2 3 4 2 Drawbacks: β€’ Tends to cut small vertex sets from the rest of the graph β€’ Considers only inter-cluster edges, no intra-cluster edges Uncertainty and Robustness of Graph Embeddings - Bojchevski 23

  24. The normalized cut 𝑑𝑣𝑒(𝐷 1 ,𝐷 2 ) 𝑑𝑣𝑒(𝐷 2 ,𝐷 1 ) Ratio Cut: Minimize + |𝐷 1 | |𝐷 2 | 𝑑𝑣𝑒(𝐷 1 ,𝐷 2 ) 𝑑𝑣𝑒(𝐷 1 ,𝐷 2 ) Normalized Cut: Minimize vol(𝐷 1 ) + vol(𝐷 2 ) 1 2 4 1 2 4 2 4 2 4 0 5 2 3 0 5 2 3 1 1 4 2 4 2 3 4 3 4 2 2 Uncertainty and Robustness of Graph Embeddings - Bojchevski 24

  25. Multi-way graph partitioning Generalization to 𝑙 β‰₯ 2 clusters Partition V into disjoint clusters 𝐷 1 , … , 𝐷 𝑙 such that 𝑙 β€’ Cut: min Οƒ 𝑗=1 𝑑𝑣𝑒(𝐷 i , V\𝐷 i ) 1 2 4 C 1 ,…,C k 2 4 𝑑𝑣𝑒(𝐷 i ,V\𝐷 i ) 𝑙 β€’ Ratio Cut: min Οƒ 𝑗=1 0 5 2 3 |𝐷 i | C 1 ,…,C k 1 4 2 𝑑𝑣𝑒(𝐷 i ,V\𝐷 i ) 𝑙 β€’ Normalized Cut: min Οƒ 𝑗=1 3 4 2 vol(𝐷 𝑗 ) C 1 ,…,C k Minimum Cut for 𝑙 = 3 Finding the optimal solution is NP-hard How to compute an approximate solution efficiently? Uncertainty and Robustness of Graph Embeddings - Bojchevski 25

  26. Graph Laplacian Laplacian matrix 𝑀 = 𝐸 βˆ’ 𝐡 β€’ 𝐡 = (weighted) adjacency matrix, 𝐸 = degree matrix Observation: For any vector 𝑔 we have 𝑔 π‘ˆ β‹… 𝑀 β‹… 𝑔 = 1 𝑀 2 2 β‹… Οƒ 𝑣,𝑀 ∈ 𝐹 𝑋 𝑣𝑀 𝑔 𝑣 βˆ’ 𝑔 Normalized Laplacian 𝑀 𝑑𝑧𝑛 = 𝐸 βˆ’ 1 2 𝑀𝐸 βˆ’ 1 2 = 𝐽 βˆ’ 𝐸 βˆ’ 1 2 𝐡𝐸 βˆ’ 1 2 Uncertainty and Robustness of Graph Embeddings - Bojchevski 26

  27. Physical interpretation of the Laplacian (I) Let f be a heat distribution over a graph with 𝑔 𝑗 = the heat at node 𝑀 𝑗 The heat transferred between 𝑀 𝑗 and 𝑀 π‘˜ is prop. to (𝑔 𝑗 βˆ’π‘” π‘˜ ) if 𝑗, π‘˜ ∈ 𝐹 https://en.wikipedia.org/wiki/Laplacian_matrix#/media/ File:Graph_Laplacian_Diffusion_Example.gif Uncertainty and Robustness of Graph Embeddings - Bojchevski 27

  28. Physical interpretation of the Laplacian (I) Graph is viewed as an electrical circuit with edges as wires (resistors) Apply voltage at some nodes and measure induced voltage at other nodes Induced voltages minimizes Οƒ 𝑣,𝑀 ∈ 𝐹 𝑦 𝑣 βˆ’ 𝑦 𝑀 2 We can find the voltage by minimizing 𝑦 π‘ˆ 𝑀x Uncertainty and Robustness of Graph Embeddings - Bojchevski 28

  29. Properties of the Graph Laplacian L is symmetric and positive semi-definite The number of eigenvectors of 𝑀 with eigenvalue 0 corresponds to the number of connected components Algebraic connectivity of a graph is πœ‡ 2 (𝑀) β€’ The magnitude reflects how well connected the graph overall is The spectrum of 𝑀 encodes useful information about the graph β€’ Unfortunately, there exist co-spectral graphs Uncertainty and Robustness of Graph Embeddings - Bojchevski 29

  30. Minimum cut and the graph Laplacian 1 1 2 4 𝑗𝑔 𝑀 𝑗 ∈ 𝐷 𝑙 Define indicator vector: : β„Ž 𝐷 𝑙 𝑗 = ቐ 2 4 |C 𝑗 | 0 5 2 3 0 π‘“π‘šπ‘‘π‘“ 1 4 2 Let H = [β„Ž 𝐷 1 ; β„Ž 𝐷 2 ; … ; β„Ž 𝐷 𝑙 ] 3 4 2 Observations: 𝐼 π‘ˆ 𝐼 = 𝐽𝑒 is orthonormal π‘ˆ β‹… 𝑀 β‹… β„Ž 𝑑 𝑗 = 𝑑𝑣𝑒 𝐷 𝑗 ,π‘Š\𝐷 𝑗 π‘ˆ β‹… 𝑀 β‹… β„Ž 𝑑 𝑗 = (𝐼 π‘ˆ 𝑀𝐼) 𝑗𝑗 β„Ž 𝐷 𝑗 and β„Ž 𝐷 𝑗 𝐷 𝑗 𝑙 (𝐼 π‘ˆ 𝑀𝐼) 𝑗𝑗 = 𝑒𝑠𝑏𝑑𝑓(𝐼 π‘ˆ 𝑀𝐼) 𝑑𝑣𝑒 𝐷 𝑗 ,π‘Š\𝐷 𝑗 𝑙 𝑆𝑏𝑒𝑗𝑝𝐷𝑣𝑒(𝐷 1 , … , 𝐷 𝑙 ) = Οƒ 𝑗=1 = Οƒ 𝑗=1 𝐷 𝑗 NetGAN: Generating Graphs via Random Walks - Bojchevski, Shchur, ZΓΌgner, GΓΌnnemann. 30

  31. Minimum cut and the graph Laplacian Minimizing ratio-cut (normalized cut with 𝑀 𝑑𝑧𝑛 ) is equivalent to 𝐷 1 ,…,𝐷 𝑙 𝑒𝑠𝑏𝑑𝑓(𝐼 π‘ˆ 𝑀𝐼) subject to 𝐼 π‘ˆ 𝐼 = 𝐽𝑒 min Constraint relaxation: allow arbitrary values for H πΌβˆˆπ‘† π‘ŠΓ—πΏ 𝑒𝑠𝑏𝑑𝑓(𝐼 π‘ˆ 𝑀𝐼) subject to 𝐼 π‘ˆ 𝐼 = 𝐽𝑒 min Standard trace minimization problem Optimal 𝐼 = First 𝐿 smallest eigenvectors of 𝑀 Uncertainty and Robustness of Graph Embeddings - Bojchevski 31

Recommend


More recommend