latent factor analysis of gaussian distributions under
play

Latent Factor Analysis of Gaussian Distributions under Graphical - PowerPoint PPT Presentation

Latent Factor Analysis of Gaussian Distributions under Graphical Constraints Md Mahmudul Hasan, Shuangqing Wei, Ali Moharrer School of Electrical Engineering and Computer Science, Louisiana State University Md Mahmudul Hasan, S Wei, Ali Moharrer


  1. Latent Factor Analysis of Gaussian Distributions under Graphical Constraints Md Mahmudul Hasan, Shuangqing Wei, Ali Moharrer School of Electrical Engineering and Computer Science, Louisiana State University Md Mahmudul Hasan, S Wei, Ali Moharrer (LSU) ISIT 2020 June 2020 1 / 23

  2. Outline Motivation 1 Introduction 2 Main results 3 Building a Gaussian Tree 4 Md Mahmudul Hasan, S Wei, Ali Moharrer (LSU) ISIT 2020 June 2020 2 / 23

  3. Motivation Outline Motivation 1 Introduction 2 Main results 3 Building a Gaussian Tree 4 Md Mahmudul Hasan, S Wei, Ali Moharrer (LSU) ISIT 2020 June 2020 3 / 23

  4. Motivation Motivation • Here the primary aim is to do dimension reduction using rank of a matrix as the object function, which is hard to achieve. Hence we use trace as the object function, which is effectively almost as good as rank minimization. • Unlike traditional ways to do factor analysis i.e. numerically, providing algorithms or providing convergence proof, we are trying to do factor analysis under graphical constrainsts. Md Mahmudul Hasan, S Wei, Ali Moharrer (LSU) ISIT 2020 June 2020 4 / 23

  5. Introduction Outline Motivation 1 Introduction 2 Main results 3 Building a Gaussian Tree 4 Md Mahmudul Hasan, S Wei, Ali Moharrer (LSU) ISIT 2020 June 2020 5 / 23

  6. Introduction Definitions The well-known factor analytic decomposition of an nxn population covariance matrix Σ , Σ = (Σ − D ) + D (1) • MTFA- matrix Σ − D is Gramian and matrix D is diagonal • CMTFA- matrix Σ − D and matrix D is both are Gramian. Md Mahmudul Hasan, S Wei, Ali Moharrer (LSU) ISIT 2020 June 2020 6 / 23

  7. Introduction A brief on CMTFA CMTFA seeks a minimum trace Σ t that solves the following decomposition problem. Σ x = Σ t + D (2) such that, Σ t is low rank (rank < n ), D is diagonal, both Σ t and D are Gramian matrices. It was shown in [1] that, the above decomposition problem is equivalent to solving the following convex optimization problem. min − tr ( D ) D s.t. − λ min ( D ) ≤ 0 and − D i,i ≤ 0 , i = 1 , . . . , n (3) where, λ min ( D ) is the minimum eigenvalue of the matrix D . 1 Giacomo Della Riccia and Alexander Shapiro, ”Minimum Rank and Minimum Trace of Covariance Matrices”, Psychometrika, vol. 47, No. 4, December, 1982. Md Mahmudul Hasan, S Wei, Ali Moharrer (LSU) ISIT 2020 June 2020 7 / 23

  8. Introduction A brief on CMTFA (cont’d) The following Theorem given in the same paper, sets the ground rules for a matrix D ∗ to be the solution for the optimization problem given by (3). Theorem The matrix D ∗ is a solution of the CMTFA problem if and only if D ∗ i,i ≥ 0 , 1 ≤ i ≤ n , λ min (Σ x − D ∗ ) = 0 , and there exists n × r matrix T such that � t ∗ ,i ∈ N (Σ x − D ∗ ) , i = 1 , ...., r and the following holds, r � � t 2 � µ j � � 1 = ∗ ,i − ξ j (4) i =1 j ∈ I ( D ∗ ) where r ≤ n indicating the number of columns of the matrix T , I ( D ∗ ) = { i : D ∗ j ∈ I ( D ∗ ) } are non-negative numbers and i,i = 0 , 1 ≤ i ≤ n } , { µ j , ξ j , j ∈ I ( D ∗ ) } are column vectors in R n with all the components equal to 0 except for { � the j th component which is equal to 1 . Md Mahmudul Hasan, S Wei, Ali Moharrer (LSU) ISIT 2020 June 2020 8 / 23

  9. Introduction Problem Statement Our goal is to characterize the solution space of CMTFA, when CMTFA is applied to a specially generated Σ x i.e. generated from the following model.  x 1   α 1   z 1  . . . .  = . � � + . (5) y       . . .      x n α n z n where, y ∼ N (0 , 1) , 0 ≤ | α j | ≤ 1 , j = 1 , 2 , . . . , n . and { z i } are independent Gausian random varables with z i ∼ N (0 , 1 − α 2 i ) The above model makes the following star topology with y being the latent variable. Figure: A star topology generative model. Md Mahmudul Hasan, S Wei, Ali Moharrer (LSU) ISIT 2020 June 2020 9 / 23

  10. Introduction State of the Art In [2], a necessary and sufficient condition was found on the subspace of Σ x for MTFA solution of Σ x to recover a star structure, when Σ x is equipped with a latent star graphical constraint. The main differences between their work and ours are, • We found the same condtion for CMTFA as they did for MTFA. • Even more importantly, we also characterized the solution for the case when CMTFA fails to recover a star structure. • We characterized the solution for all possible situations. 2 J. SAUNDERSON, V. CHANDRASEKARAN, P. A. PARRILO , AND A. S. WILLSKY, ”DIAGONAL AND LOW-RANK MATRIX DECOMPOSITIONS, CORRELATION MATRICES, AND ELLIPSOID FITTING” SIAM J. MATRIX ANAL. APPL., vol. 33, no. 4, pp. 1395-1416, 2015. Md Mahmudul Hasan, S Wei, Ali Moharrer (LSU) ISIT 2020 June 2020 10 / 23

  11. Introduction Our contributions α = [ α 1 , α 2 , α 3 , ....., α n ] ′ , and without the loss of generality we have We define, vector � assumed | α 1 | ≥ | α 2 | ≥ · · · ≥ | α n | . We apply CMTFA to the following Σ x , aiming to characterize the solution space. 1  . . . α 1 α n  . . ... . . Σ x = (6)   . .   1 α n α 1 . . . Our contributions can be summarized by the following two theorems. Md Mahmudul Hasan, S Wei, Ali Moharrer (LSU) ISIT 2020 June 2020 11 / 23

  12. Main results Outline Motivation 1 Introduction 2 Main results 3 Building a Gaussian Tree 4 Md Mahmudul Hasan, S Wei, Ali Moharrer (LSU) ISIT 2020 June 2020 12 / 23

  13. Main results Theorem α is non-dominant, i.e. | α 1 | ≤ � n i =2 | α i | . CMTFA solution of Σ x is Σ t,ND if and only if, � α 2  α 1 α 2 . . . α 1 α n  1 α 2 α 2 α 1 . . . α 2 α n 2   Σ t,ND = (7)  . . .  ... . . .   . . .   α 2 α n α 1 α n α 2 . . . n Theorem α is dominant, i.e. | α 1 | > � n CMTFA solution of Σ x is Σ t,DM if and only if, � i =2 | α i | . | α 1 | ( � n i =2 | α i | )  α 1 α 2 . . . α 1 α n  | α 2 | ( | α 1 | − � i � =1 , 2 | α i | ) α 2 α 1 . . . α 2 α n   Σ t,DM =  . . .  ... . . .   . . .   α n α 1 α 1 α 2 . . . | α n | ( | α 1 | − � i � =1 ,n | α i | ) (8) Md Mahmudul Hasan, S Wei, Ali Moharrer (LSU) ISIT 2020 June 2020 13 / 23

  14. Main results Proof of the non-dominant case The following Lemma will help us prove our first theorem. Lemma α given by (9) is a necessary condition for the existence of such Non-dominance of vector � t j, ∗ || 2 = 1 , n × r matrix T that � 1 ≤ i ≤ r and || � t ∗ ,i ∈ N (Σ t,ND ) , 1 ≤ j ≤ n . n � | α 1 | ≤ | α i | (9) i =2 Md Mahmudul Hasan, S Wei, Ali Moharrer (LSU) ISIT 2020 June 2020 14 / 23

  15. Main results Proof of the non-dominant case (cont’d) Referring to the necessary and sufficient condtion for CMTFA solution. • Σ t,ND is rank 1 , its minimum eigenvalue is 0 . • We only need to show the existance of such n × r matrix T that t j, ∗ || 2 = 1 , � 1 ≤ i ≤ r and || � t ∗ ,i ∈ N (Σ t,ND ) , 1 ≤ j ≤ n where 1 ≤ r ≤ n . • the Lemma has already stated that, for the existence of such T non-dominance of � α is a necessary condition. • We next show that for the existence of such a T , non-dominance of � α is also a sufficient condition. Md Mahmudul Hasan, S Wei, Ali Moharrer (LSU) ISIT 2020 June 2020 15 / 23

  16. Main results Proof of the non-dominant case (cont’d) It is straightforward to find the following basis vectors for the null space of Σ t,ND , − α 2 − α 3 − α n       α 1 α 1 α 1 1 0 0        0   1   0  v 1 = v 2 = v n − 1 = (10) � ,� , . . . , �        .   .   .  . . .       . . .       0 0 1 v n − 1 , � n − 1 v i ] , where c i ∈ { 1 , − 1 } , We define, V = [ � v 1 , . . .� i =1 c i +1 � i = 2 , . . . , n . Let, T n × n = V n × n · B n × n , where B n × n is a diagonal matrix. We have, TT T = V BB T V T = V βV T (11) The diagonal matrix β = BB T can have only non-negative entries. Md Mahmudul Hasan, S Wei, Ali Moharrer (LSU) ISIT 2020 June 2020 16 / 23

  17. Main results Proof of non-dominant case (cont’d) t j, ∗ || 2 = 1 , Because of || � 1 ≤ j ≤ n condition on T , we get the following n equations, � n � 2 n − 1 α 2 c j α j � i +1 � β ii + β nn = 1 (12) α 2 α 1 1 i =1 j =2 β ii + c 2 i = 1 , . . . , n − 1 i +1 β nn = 1 , (13) If we select c i α i = | α i | , i = 2 , . . . , n and solve the above equations under such selections, we get β ii ≥ 0 , 1 ≤ i ≤ n . That completes the proof of our first Theorem . Md Mahmudul Hasan, S Wei, Ali Moharrer (LSU) ISIT 2020 June 2020 17 / 23

  18. Main results Proof of the dominant case The following two Lemmas play pivotal role in proving the second theorem we proposed. Lemma Σ t,DM is a rank n − 1 matrix. Lemma There exists a column vector Φ = [Φ 1 , Φ 2 , ...., Φ n ] ′ such that Σ t,DM Φ = 0 , where Φ i ∈ {− 1 , 1 } , 1 ≤ i ≤ n . Md Mahmudul Hasan, S Wei, Ali Moharrer (LSU) ISIT 2020 June 2020 18 / 23

Recommend


More recommend