Identifying Undirected Network Structure via Semidefinite Relaxation - PowerPoint PPT Presentation

Identifying Undirected Network Structure via Semidefinite Relaxation Rasoul Shafipour, Santiago Segarra , Antonio G. Marques and Gonzalo Mateos Institute for Data, Systems, and Society Massachusetts Institute of Technology segarra@mit.edu http://www.mit.edu/~segarra/ ICASSP, April 20, 2018 Santiago Segarra 1 / 18

Network Science analytics Online social media Internet Clean energy and grid analy,cs ◮ Desiderata: Process, analyze and learn from network data [Kolaczyk09] Santiago Segarra 2 / 18

Network Science analytics Online social media Internet Clean energy and grid analy,cs ◮ Desiderata: Process, analyze and learn from network data [Kolaczyk09] ◮ Network as graph G : encode pairwise relationships ◮ Sometimes both G and data at the nodes are available ⇒ Leverage G to process network data ⇒ Graph Signal Processing Santiago Segarra 2 / 18

Network Science analytics Online social media Internet Clean energy and grid analy,cs ◮ Desiderata: Process, analyze and learn from network data [Kolaczyk09] ◮ Network as graph G : encode pairwise relationships ◮ Sometimes both G and data at the nodes are available ⇒ Leverage G to process network data ⇒ Graph Signal Processing ◮ Sometimes we have access to network data but not to G itself ⇒ Leverage the relation between them to infer G from the data Santiago Segarra 2 / 18

Graph signal processing (GSP) x 2 x 4 ◮ Undirected G with adjacency matrix A 2 4 ⇒ A ij = Proximity between i and j x 1 1 ◮ Define a signal x on top of the graph ⇒ x i = Signal value at node i 3 5 x 3 x 5 Santiago Segarra 3 / 18

Graph signal processing (GSP) x 2 x 4 ◮ Undirected G with adjacency matrix A 2 4 ⇒ A ij = Proximity between i and j x 1 1 ◮ Define a signal x on top of the graph ⇒ x i = Signal value at node i 3 5 x 3 x 5 ◮ Associated with G is the graph-shift operator S = VΛV T ∈ M N ⇒ S ij = 0 for i � = j and ( i , j ) �∈ E (local structure in G ) ⇒ Ex: adjacency A and Laplacian L = D − A matrices Santiago Segarra 3 / 18

Graph signal processing (GSP) x 2 x 4 ◮ Undirected G with adjacency matrix A 2 4 ⇒ A ij = Proximity between i and j x 1 1 ◮ Define a signal x on top of the graph ⇒ x i = Signal value at node i 3 5 x 3 x 5 ◮ Associated with G is the graph-shift operator S = VΛV T ∈ M N ⇒ S ij = 0 for i � = j and ( i , j ) �∈ E (local structure in G ) ⇒ Ex: adjacency A and Laplacian L = D − A matrices ◮ Graph filters H : R N → R N are maps between graph signals ⇒ Polynomial in S with coefficients h ∈ R L +1 ⇒ H := � L l =0 h l S l ◮ How to use GSP to infer the graph topology? Santiago Segarra 3 / 18

Topology inference: Motivation and context ◮ Network topology inference from nodal observations [Kolaczyk09] ◮ Partial correlations and conditional dependence [Dempster74] ◮ Sparsity [Friedman07] and consistency [Meinshausen06] ◮ [Banerjee08], [Lake10], [Slawski15], [Karanikolas16] ◮ Key in neuroscience [Sporns10] ⇒ Functional net inferred from activity Santiago Segarra 4 / 18

Topology inference: Motivation and context ◮ Network topology inference from nodal observations [Kolaczyk09] ◮ Partial correlations and conditional dependence [Dempster74] ◮ Sparsity [Friedman07] and consistency [Meinshausen06] ◮ [Banerjee08], [Lake10], [Slawski15], [Karanikolas16] ◮ Key in neuroscience [Sporns10] ⇒ Functional net inferred from activity ◮ Noteworthy GSP-based approaches ◮ Gaussian graphical models [Egilmez16] ◮ Smooth signals [Dong15], [Kalofolias16] ◮ Stationary signals [Pasdeloup15], [Segarra16] ◮ Directed graphs [Mei15], [Shen16] ◮ Low-rank excitation [Wai18] ◮ Our contribution: topology inference from non-stationary graph signals Santiago Segarra 4 / 18

Problem formulation ◮ Underlying graph G with undirected unknown GSO S ◮ Observe signals { y i } K i =1 defined on the unknown graph Setup y 1 y 2 y 3 Santiago Segarra 5 / 18

Problem formulation ◮ Underlying graph G with undirected unknown GSO S ◮ Observe signals { y i } K i =1 defined on the unknown graph Setup y 1 y 2 y 3 Problem statement Given observations { y i } K i =1 , determine the network S knowing that: { y i } K i =1 are outputs of a diffusion process on S . Santiago Segarra 5 / 18

Problem formulation ◮ Consider an arbitrary linear network process on the GSO S ⇒ Every realization corresponds to a different input x i � L � � h l S l y i = x i = Hx i , i = 1 , . . . , K l =0 ◮ Goal : Recover S from the observation of K signals { y i } K i =1 ◮ Additional unknowns ⇒ The degree of the filter L ⇒ The filter coefficients { h l } L l =0 ⇒ The specific inputs x i ; but we know that x i ∼ N ( 0 , C x ) Santiago Segarra 6 / 18

Blueprint of our solution % STEP%1:%Es5mate% { y i } K the%eigenvectors%of% i =1 % S % STEP%2:%Find% ˆ S eigenvalues%via% op5miza5on% A%priori%info%and% desirable%features%% Santiago Segarra 7 / 18

Blueprint of our solution % STEP%1:%Es5mate% { y i } K the%eigenvectors%of% i =1 % ˆ V :%noisy% S % STEP%2:%Find% ˆ S eigenvalues%via% op5miza5on% Sparsity%and% A%priori%info%and% desirable%features%% GSO%feasibility% Santiago Segarra 7 / 18

Step 1: Estimating the eigenvectors of S ◮ y is the output of a local diffusion process on the graph ∞ � N − 1 � � � h l S l y = α 0 ( I − α l S ) x = x := Hx l =1 l =0 ◮ Whenever the input x is white ⇒ graph stationary process on S [Marques17, Girault15, Perraudin17] Santiago Segarra 8 / 18

Step 1: Estimating the eigenvectors of S ◮ y is the output of a local diffusion process on the graph ∞ � N − 1 � � � h l S l y = α 0 ( I − α l S ) x = x := Hx l =1 l =0 ◮ Whenever the input x is white ⇒ graph stationary process on S [Marques17, Girault15, Perraudin17] Stationary case ◮ The covariance C y of y shares V with S C y = H 2 = h 2 1 S 2 + ... 0 I + 2 h 0 h 1 S + h 2 C y ⇒ Diagonalize ⇒ Obtain ˆ i =1 as ˆ ◮ Estimate covariance from { y i } K V Santiago Segarra 8 / 18

Non-stationary graph signals ◮ Q: What if the signal y = Hx is not stationary (i.e., x colored)? ⇒ Matrices S and C y no longer simultaneously diagonalizable since C y = HC x H Santiago Segarra 9 / 18

Non-stationary graph signals ◮ Q: What if the signal y = Hx is not stationary (i.e., x colored)? ⇒ Matrices S and C y no longer simultaneously diagonalizable since C y = HC x H l =0 h l S l diagonalized by the eigenvectors V of S ◮ Key: still H = � L − 1 ⇒ Infer V by estimating the unknown diffusion (graph) filter H ⇒ Step 1 boils down to system identification + eigendecomposition ˆ % H % System% { y i } K ˆ Eigendecomposi5on% V Iden5fica5on% i =1 % % Santiago Segarra 9 / 18

System identification Define C xyx := C 1 / 2 C y C 1 / 2 , with eigenvectors V xyx . If C x is non- x x singular then all admissible symmetric filters H are of the form H = C − 1 / 2 C 1 / 2 xyx V xyx diag( b ) V T xyx C − 1 / 2 , x x where b ∈ {− 1 , 1 } N is a binary (signed) vector. Santiago Segarra 10 / 18

System identification Define C xyx := C 1 / 2 C y C 1 / 2 , with eigenvectors V xyx . If C x is non- x x singular then all admissible symmetric filters H are of the form H = C − 1 / 2 C 1 / 2 xyx V xyx diag( b ) V T xyx C − 1 / 2 , x x where b ∈ {− 1 , 1 } N is a binary (signed) vector. ◮ Even if we get C y exactly, H is not identifiable ⇒ Not surprising since we only have second moment info ◮ Consider having access to multiple input distributions { C x , m } M m =1 Santiago Segarra 10 / 18

Multiple input processes ◮ Define A m := ( C − 1 / 2 x , m V xyx , m ) ⊙ ( C − 1 / 2 x , m C 1 / 2 xyx , m V xyx , m )  A 1 − A 2 0 · · · 0 0  0 A 2 − A 3 · · · 0 0   Ψ :=  . . . . .  ... . . . . .   . . . . .   0 0 0 · · · A M − 1 − A M ◮ b m ∈ {− 1 , 1 } N and b = [ b T M ] T , then Ψb ∗ = 0 1 , b T 2 , . . . , b T Santiago Segarra 11 / 18

Multiple input processes ◮ Define A m := ( C − 1 / 2 x , m V xyx , m ) ⊙ ( C − 1 / 2 x , m C 1 / 2 xyx , m V xyx , m )  A 1 − A 2 0 · · · 0 0  0 A 2 − A 3 · · · 0 0   Ψ :=  . . . . .  ... . . . . .   . . . . .   0 0 0 · · · A M − 1 − A M ◮ b m ∈ {− 1 , 1 } N and b = [ b T M ] T , then Ψb ∗ = 0 1 , b T 2 , . . . , b T C y , m are available, we can estimate b ∗ as Whenever only estimates ˆ T ˆ b ∗ = b ∈{− 1 , 1 } NM b T ˆ ˆ argmin Ψ Ψb , obtaining our estimate for the filter H as M H = 1 ˆ x , m ˆ xyx , m ˆ V xyx , m diag(ˆ m )ˆ � C − 1 / 2 C 1 / 2 b ∗ V T xyx , m C − 1 / 2 x , m M m =1 Santiago Segarra 11 / 18

Boolean quadratic program ◮ Our problem then reduces to solving the BQP T ˆ b ∗ = b ∈{− 1 , 1 } NM b T ˆ ˆ argmin Ψ Ψb T ˆ ◮ Define ˆ W = ˆ Ψ and B = bb T Ψ B � 0 tr( ˆ min WB ) s. to rank( B ) = 1 , B ii = 1 , i = 1 , . . . , NM ◮ Drop source of non-convexity to obtain the semi-definite relaxation B ∗ = argmin tr( ˆ WB ) s. to B ii = 1 , i = 1 , . . . , NM B � 0 Santiago Segarra 12 / 18

Performance guarantee ◮ For l = 1 , . . . , L , draw z l ∼ N ( 0 , B ∗ ), round ˜ b l = sign ( z l ), to obtain l ∗ = argmin ˜ l ˆ W ˜ b T b l l =1 ,..., L Santiago Segarra 13 / 18

Identifying Undirected Network Structure via Semidefinite Relaxation - PowerPoint PPT Presentation

Identifying Undirected Network Structure via Semidefinite Relaxation Rasoul Shafipour, Santiago Segarra , Antonio G. Marques and Gonzalo Mateos Institute for Data, Systems, and Society Massachusetts Institute of Technology segarra@mit.edu

Undirected Graphical Models Aaron Courville, Universit de Montral 2 (UNDIRECTED) GRAPHICAL

Lecture 3: Semidefinite Programming Lecture Outline Part I: Semidefinite programming,

Semidefinite Programming Pekka Orponen T-79.7001 Postgraduate Course on Theoretical Computer

QoS-Guaranteed User Association in Sokun, Gohary, HetNets via Semidefinite Relaxation

Undirected Graphical Models: Markov Random Fields Probabilistic Graphical Models Sharif

Identifying Network Traffic Activity Via Flow Sizes Overview Motivation identifying

Chapter 3 - Graphs Undirected Graphs Undirected graph. G = (V, E) V = nodes. E = edges

Camden Unweighted undirected k-spanners Peleg and Ullman 1987 Input: An undirected graph

Undirected Graphical Models Undirected Graphs Chris Williams, School of Informatics, University

Undirected graphical models Graph G : arbitrary undirected graph Useful when variables interact

Undirected graphs anhtt-fit@mail.hut.edu.vn dungct@it-hut.edu.vn Undirected graphs A graph

Sparsity and decomposition in semidefinite optimization Lieven Vandenberghe ECE Department, UCLA

Sparsity and decomposition in semidefinite optimization Lieven Vandenberghe Electrical and

13. Cones and semidefinite constraints Geometry of cones Second order cone programs

Completely positive semidefinite matrices: conic approximations and matrix factorization ranks

Semidefinite programming bounds for codes and anticodes in Cayley graphs Frank Vallentin

Advanced Methods for MILP Marco Chiarandini Department of Mathematics & Computer Science

Steiner Forest (Approximation Algorithms by V. Vazirani, Chapter 22) - Problem statement;

Planning and Optimization C2. Delete Relaxation: Finding Relaxed Plans Malte Helmert and Gabriele

From obfuscation to white-box crypto: relaxation and security notions Matthieu Rivain WhibOx

Computer Science & Engineering 423/823 Design and Analysis of Algorithms Lecture 06

Scalable Laplacian K-modes Imtiaz Masud Ziko, Eric Granger and Ismail Ben Ayed Laplacian K-modes

An Improved DOA Estimator Based On Partial Relaxation Approach Minh Trinh Hoang, Mats Viberg and

Robust guarantees for networks with flexible routing Sanjay Rao* Joint work with Yiyang Chang*

Identifying Undirected Network Structure via Semidefinite Relaxation - PowerPoint PPT Presentation

Identifying Undirected Network Structure via Semidefinite Relaxation Rasoul Shafipour, Santiago Segarra , Antonio G. Marques and Gonzalo Mateos Institute for Data, Systems, and Society Massachusetts Institute of Technology segarra@mit.edu

Undirected Graphical Models Aaron Courville, Universit de Montral 2 (UNDIRECTED) GRAPHICAL

Lecture 3: Semidefinite Programming Lecture Outline Part I: Semidefinite programming,

Semidefinite Programming Pekka Orponen T-79.7001 Postgraduate Course on Theoretical Computer

QoS-Guaranteed User Association in Sokun, Gohary, HetNets via Semidefinite Relaxation

Undirected Graphical Models: Markov Random Fields Probabilistic Graphical Models Sharif

Identifying Network Traffic Activity Via Flow Sizes Overview Motivation identifying

Chapter 3 - Graphs Undirected Graphs Undirected graph. G = (V, E) V = nodes. E = edges

Camden Unweighted undirected k-spanners Peleg and Ullman 1987 Input: An undirected graph

Undirected Graphical Models Undirected Graphs Chris Williams, School of Informatics, University

Undirected graphical models Graph G : arbitrary undirected graph Useful when variables interact

Undirected graphs anhtt-fit@mail.hut.edu.vn dungct@it-hut.edu.vn Undirected graphs A graph

Sparsity and decomposition in semidefinite optimization Lieven Vandenberghe ECE Department, UCLA

Sparsity and decomposition in semidefinite optimization Lieven Vandenberghe Electrical and

13. Cones and semidefinite constraints Geometry of cones Second order cone programs

Completely positive semidefinite matrices: conic approximations and matrix factorization ranks

Semidefinite programming bounds for codes and anticodes in Cayley graphs Frank Vallentin

Advanced Methods for MILP Marco Chiarandini Department of Mathematics &amp; Computer Science

Steiner Forest (Approximation Algorithms by V. Vazirani, Chapter 22) - Problem statement;

Planning and Optimization C2. Delete Relaxation: Finding Relaxed Plans Malte Helmert and Gabriele

From obfuscation to white-box crypto: relaxation and security notions Matthieu Rivain WhibOx

Computer Science &amp; Engineering 423/823 Design and Analysis of Algorithms Lecture 06

Scalable Laplacian K-modes Imtiaz Masud Ziko, Eric Granger and Ismail Ben Ayed Laplacian K-modes

An Improved DOA Estimator Based On Partial Relaxation Approach Minh Trinh Hoang, Mats Viberg and

Robust guarantees for networks with flexible routing Sanjay Rao* Joint work with Yiyang Chang*

Advanced Methods for MILP Marco Chiarandini Department of Mathematics & Computer Science

Computer Science & Engineering 423/823 Design and Analysis of Algorithms Lecture 06