Uncertainty and Robustness of Graph Embeddings Aleksandar Bojchevski Technical University of Munich, Germany Graph Embedding Day 2018 - Lyon
Neglected aspects of graph embeddings Capturing uncertainty Robustness to noise Robustness to adversarial attacks Uncertainty and Robustness of Graph Embeddings - Bojchevski 2
Neglected aspects of graph embeddings Capturing uncertainty Robustness to noise Robustness to adversarial attacks Uncertainty and Robustness of Graph Embeddings - Bojchevski 3
Nodes are points in a low-dimensional space Uncertainty and Robustness of Graph Embeddings - Bojchevski 4
Nodes are distributions Uncertainty and Robustness of Graph Embeddings - Bojchevski 5
Graph2Gauss - 3 key modeling ideas 1. Uncertainty 2. Personalized ranking 3. Inductiveness π¦ π π = 2 π π (π¦ π ) deep π = 1 encoder πͺ( π π , Ξ£ π ) Uncertainty and Robustness of Graph Embeddings - Bojchevski 6
Uncertainty Embed nodes as (Gaussian) distributions Sources of uncertainty: β’ Conflicting structure and attributes β’ Heterogenous neighborhood β’ Noise, outliers, anomalies, β¦. Uncertainty and Robustness of Graph Embeddings - Bojchevski 7
Personalized ranking π = 2 For each node π : nodes in its (π) -hop neighborhood π = 1 should be closer to π compared to nodes in its (π + 1) -hop neighborhood Uncertainty and Robustness of Graph Embeddings - Bojchevski 8
Personalized ranking π = 2 For each node π : nodes in its (π) -hop neighborhood π = 1 should be closer to π compared to nodes in its (π + 1) -hop neighborhood Uncertainty and Robustness of Graph Embeddings - Bojchevski 9
Personalized ranking For each node π : nodes in its (π) -hop neighborhood π = 2 should be closer to π compared to nodes in its (π + 1) -hop neighborhood π = 1 Example: closer in terms of the KL Diveregence KL is asymmetric β handles directed graphs Uncertainty and Robustness of Graph Embeddings - Bojchevski 10
Personalized ranking π = 2 Personalized ranking implies pairwise constraints for node π π = 1 D πΏπ (πͺ π ||πͺ π ) < D πΏπ (πͺ π β² ||πͺ π ) (π β² ) , βπ < πβ² (π) , βπ β² β π π βπ β π π set of nodes in the π -hop neighborhood of node π Uncertainty and Robustness of Graph Embeddings - Bojchevski 11
Inductiveness π¦ π π π (π¦ π ) deep Generalize to unseen nodes by learning encoder a mapping from features to embeddings πͺ( π π , Ξ£ π ) Uncertainty and Robustness of Graph Embeddings - Bojchevski 12
Graph2Gauss - 3 key modeling ideas 1. Uncertainty 2. Personalized ranking 3. Inductiveness π¦ π π = 2 π π (π¦ π ) deep π = 1 encoder πͺ( π π , Ξ£ π ) Uncertainty and Robustness of Graph Embeddings - Bojchevski 13
Learning with energy-based loss 2 + exp βπΉ ππβ² ) β = Ο π,π,π β² (πΉ ππ πΉ ππ = D πΏπ (πͺ π | πͺ π Closer nodes should have lower energy Naively: π(π 3 ) complexity Node-anchored sampling strategy: β’ For each node same one another node from every neighborhood β’ Less than 4.2% triplets seen to match performance β’ Lower gradient variance Uncertainty and Robustness of Graph Embeddings - Bojchevski 14
Graph2Gauss is parameter/data efficient Uncertainty and Robustness of Graph Embeddings - Bojchevski 15
Graph2Gauss captures uncertainty Uncertainty correlates with diversity Diversity: number of distinct classes in a nodeβs k -hop neighborhood Uncertainty and Robustness of Graph Embeddings - Bojchevski 16
Graph2Gauss captures uncertainty Uncertainty reveals the intrinsic latent dimensionality of the graph Detected latent dimensions β number ground-truth communities Uncertainty and Robustness of Graph Embeddings - Bojchevski 17
Uncertainty and link prediction Prune dimensions with high uncertainty Maintaining link prediction performance Uncertainty and Robustness of Graph Embeddings - Bojchevski 18
Graph2Gauss is effective for visualization Uncertainty and Robustness of Graph Embeddings - Bojchevski 19
Neglected aspects of graph embeddings Capturing uncertainty Robustness to noise Robustness to adversarial attacks Uncertainty and Robustness of Graph Embeddings - Bojchevski 20
Why spectral embedding https://www.semanticscholar.org Uncertainty and Robustness of Graph Embeddings - Bojchevski 21
What is spectral clustering πΈ = 5 4 4 3 3 2 2 1 1 π = 9 Spectral Similarity 0 0 Embedding Graph -4 -2 0 2 4 -4 -2 0 2 4 -1 -1 -2 -2 -3 -3 -4 -4 -5 -5 Graph clustering β’ Maximize within-cluster edges β’ Minimize between cluster edges Uncertainty and Robustness of Graph Embeddings - Bojchevski 22
The minimum cut Partition V into two sets π· 1 and π· 2 , such that the sum of the inter-cluster edge weights cut π· 1 , π· 2 = Ο π€ 1 βπ· 1 ,π€ 2 βπ· 2 π₯(π€ 1 , π€ 2 ) is minimized 1 2 4 2 4 0 5 2 3 1 4 2 3 4 2 Drawbacks: β’ Tends to cut small vertex sets from the rest of the graph β’ Considers only inter-cluster edges, no intra-cluster edges Uncertainty and Robustness of Graph Embeddings - Bojchevski 23
The normalized cut ππ£π’(π· 1 ,π· 2 ) ππ£π’(π· 2 ,π· 1 ) Ratio Cut: Minimize + |π· 1 | |π· 2 | ππ£π’(π· 1 ,π· 2 ) ππ£π’(π· 1 ,π· 2 ) Normalized Cut: Minimize vol(π· 1 ) + vol(π· 2 ) 1 2 4 1 2 4 2 4 2 4 0 5 2 3 0 5 2 3 1 1 4 2 4 2 3 4 3 4 2 2 Uncertainty and Robustness of Graph Embeddings - Bojchevski 24
Multi-way graph partitioning Generalization to π β₯ 2 clusters Partition V into disjoint clusters π· 1 , β¦ , π· π such that π β’ Cut: min Ο π=1 ππ£π’(π· i , V\π· i ) 1 2 4 C 1 ,β¦,C k 2 4 ππ£π’(π· i ,V\π· i ) π β’ Ratio Cut: min Ο π=1 0 5 2 3 |π· i | C 1 ,β¦,C k 1 4 2 ππ£π’(π· i ,V\π· i ) π β’ Normalized Cut: min Ο π=1 3 4 2 vol(π· π ) C 1 ,β¦,C k Minimum Cut for π = 3 Finding the optimal solution is NP-hard How to compute an approximate solution efficiently? Uncertainty and Robustness of Graph Embeddings - Bojchevski 25
Graph Laplacian Laplacian matrix π = πΈ β π΅ β’ π΅ = (weighted) adjacency matrix, πΈ = degree matrix Observation: For any vector π we have π π β π β π = 1 π€ 2 2 β Ο π£,π€ β πΉ π π£π€ π π£ β π Normalized Laplacian π π‘π§π = πΈ β 1 2 ππΈ β 1 2 = π½ β πΈ β 1 2 π΅πΈ β 1 2 Uncertainty and Robustness of Graph Embeddings - Bojchevski 26
Physical interpretation of the Laplacian (I) Let f be a heat distribution over a graph with π π = the heat at node π€ π The heat transferred between π€ π and π€ π is prop. to (π π βπ π ) if π, π β πΉ https://en.wikipedia.org/wiki/Laplacian_matrix#/media/ File:Graph_Laplacian_Diffusion_Example.gif Uncertainty and Robustness of Graph Embeddings - Bojchevski 27
Physical interpretation of the Laplacian (I) Graph is viewed as an electrical circuit with edges as wires (resistors) Apply voltage at some nodes and measure induced voltage at other nodes Induced voltages minimizes Ο π£,π€ β πΉ π¦ π£ β π¦ π€ 2 We can find the voltage by minimizing π¦ π πx Uncertainty and Robustness of Graph Embeddings - Bojchevski 28
Properties of the Graph Laplacian L is symmetric and positive semi-definite The number of eigenvectors of π with eigenvalue 0 corresponds to the number of connected components Algebraic connectivity of a graph is π 2 (π) β’ The magnitude reflects how well connected the graph overall is The spectrum of π encodes useful information about the graph β’ Unfortunately, there exist co-spectral graphs Uncertainty and Robustness of Graph Embeddings - Bojchevski 29
Minimum cut and the graph Laplacian 1 1 2 4 ππ π€ π β π· π Define indicator vector: : β π· π π = α 2 4 |C π | 0 5 2 3 0 πππ‘π 1 4 2 Let H = [β π· 1 ; β π· 2 ; β¦ ; β π· π ] 3 4 2 Observations: πΌ π πΌ = π½π is orthonormal π β π β β π π = ππ£π’ π· π ,π\π· π π β π β β π π = (πΌ π ππΌ) ππ β π· π and β π· π π· π π (πΌ π ππΌ) ππ = π’π πππ(πΌ π ππΌ) ππ£π’ π· π ,π\π· π π πππ’πππ·π£π’(π· 1 , β¦ , π· π ) = Ο π=1 = Ο π=1 π· π NetGAN: Generating Graphs via Random Walks - Bojchevski, Shchur, ZΓΌgner, GΓΌnnemann. 30
Minimum cut and the graph Laplacian Minimizing ratio-cut (normalized cut with π π‘π§π ) is equivalent to π· 1 ,β¦,π· π π’π πππ(πΌ π ππΌ) subject to πΌ π πΌ = π½π min Constraint relaxation: allow arbitrary values for H πΌβπ πΓπΏ π’π πππ(πΌ π ππΌ) subject to πΌ π πΌ = π½π min Standard trace minimization problem Optimal πΌ = First πΏ smallest eigenvectors of π Uncertainty and Robustness of Graph Embeddings - Bojchevski 31
Recommend
More recommend