positive semidefinite rank
play

Positive semidefinite rank Pablo A. Parrilo Laboratory for - PowerPoint PPT Presentation

Positive semidefinite rank Pablo A. Parrilo Laboratory for Information and Decision Systems Electrical Engineering and Computer Science Massachusetts Institute of Technology Based on joint work with Hamza Fawzi (MIT), Jo ao Gouveia (U.


  1. Positive semidefinite rank Pablo A. Parrilo Laboratory for Information and Decision Systems Electrical Engineering and Computer Science Massachusetts Institute of Technology Based on joint work with Hamza Fawzi (MIT), Jo˜ ao Gouveia (U. Coimbra), James Saunderson (MIT), Richard Robinson and Rekha Thomas (U. Washington) Cargese 2014 Parrilo (MIT) Positive semidefinite rank Cargese 2014 1 / 32

  2. Question: representability of convex sets Existence and efficiency: When is a convex set representable by conic optimization? How to quantify the number of additional variables that are needed? Given a convex set C , is it possible to repre- sent it as C = π ( K ∩ L ) where K is a cone, L is an affine subspace, and π is a linear map? Parrilo (MIT) Positive semidefinite rank Cargese 2014 2 / 32

  3. Factorizations Factorizations Given a matrix M ∈ R m × n , can factorize it as M = AB , i.e., B A R n → R k → R m − − Ideally, k is small (matrix M is low-rank), so we’re factorizing through a “small subspace.” Why is this useful? Realization theory (e.g., factorization of a Hankel matrix) Principal component analysis (e.g., factorization of covariance of a Gaussian process) And many others... Standard notion in linear algebra. Parrilo (MIT) Positive semidefinite rank Cargese 2014 3 / 32

  4. Factorizations Factorizations Given a matrix M ∈ R m × n , can factorize it as M = AB , i.e., B A R n → R k → R m − − Ideally, k is small (matrix M is low-rank), so we’re factorizing through a “small subspace.” Why is this useful? Realization theory (e.g., factorization of a Hankel matrix) Principal component analysis (e.g., factorization of covariance of a Gaussian process) And many others... Standard notion in linear algebra. Parrilo (MIT) Positive semidefinite rank Cargese 2014 3 / 32

  5. Factorizations More Factorizations... However, often we need further conditions on M = AB ... Norm conditions on the factors A , B : Want factors A , B to be “small” in some norm Well-studied topic in Banach space theory, through the notion of factorization norms 1 2( � A � 2 F + � B � 2 For instance, the nuclear norm � M � ⋆ := min F ) A , B : M = AB Nonnegativity conditions: Matrix M is (componentwise) nonnegative, and so must be the factors. This is the nonnegative factorization problem. Many applications, e.g., in probability (conditional independence) and machine learning (additive features). Parrilo (MIT) Positive semidefinite rank Cargese 2014 4 / 32

  6. Factorizations More Factorizations... However, often we need further conditions on M = AB ... Norm conditions on the factors A , B : Want factors A , B to be “small” in some norm Well-studied topic in Banach space theory, through the notion of factorization norms 1 2( � A � 2 F + � B � 2 For instance, the nuclear norm � M � ⋆ := min F ) A , B : M = AB Nonnegativity conditions: Matrix M is (componentwise) nonnegative, and so must be the factors. This is the nonnegative factorization problem. Many applications, e.g., in probability (conditional independence) and machine learning (additive features). Parrilo (MIT) Positive semidefinite rank Cargese 2014 4 / 32

  7. Factorizations Nonnegative factorization and hidden variables Let X , Y be discrete random variables, with joint distribution P [ X = i , Y = j ] = P ij . The nonnegative rank of P is the smallest support of a random variable Z , such that X and Y are conditionally independent given Z (i.e., X − Z − Y is Markov): � P [ X = i , Y = j ] = P [ Z = s ] · P [ X = i | Z = s ] · P [ Y = j | Z = s ] . s =1 ,..., k Relations with information theory, “correlation generation,” communication complexity, etc. As we’ll see, also fundamental in optimization . . . Parrilo (MIT) Positive semidefinite rank Cargese 2014 5 / 32

  8. Factorizations Nonnegative factorization and hidden variables Let X , Y be discrete random variables, with joint distribution P [ X = i , Y = j ] = P ij . The nonnegative rank of P is the smallest support of a random variable Z , such that X and Y are conditionally independent given Z (i.e., X − Z − Y is Markov): � P [ X = i , Y = j ] = P [ Z = s ] · P [ X = i | Z = s ] · P [ Y = j | Z = s ] . s =1 ,..., k Relations with information theory, “correlation generation,” communication complexity, etc. As we’ll see, also fundamental in optimization . . . Parrilo (MIT) Positive semidefinite rank Cargese 2014 5 / 32

  9. Factorizations Conic factorizations Conic factorizations We’re interested in a different class: conic factorizations [GPT11] Let M ∈ R m × n be a nonnegative matrix, and K be a convex cone in R k . + Then, we want M = AB , where B A R n → R m − → K − + + M maps the nonnegative orthant into the nonnegative orthant. For K = R k + , this is a standard nonnegative factorization. In general, factorize a linear map through a “small cone” Important special case: K is the cone of psd matrices... Parrilo (MIT) Positive semidefinite rank Cargese 2014 6 / 32

  10. Factorizations Conic factorizations Conic factorizations We’re interested in a different class: conic factorizations [GPT11] Let M ∈ R m × n be a nonnegative matrix, and K be a convex cone in R k . + Then, we want M = AB , where B A R n → R m − → K − + + M maps the nonnegative orthant into the nonnegative orthant. For K = R k + , this is a standard nonnegative factorization. In general, factorize a linear map through a “small cone” Important special case: K is the cone of psd matrices... Parrilo (MIT) Positive semidefinite rank Cargese 2014 6 / 32

  11. Factorizations Conic factorizations Conic factorizations We’re interested in a different class: conic factorizations [GPT11] Let M ∈ R m × n be a nonnegative matrix, and K be a convex cone in R k . + Then, we want M = AB , where B A R n → R m − → K − + + M maps the nonnegative orthant into the nonnegative orthant. For K = R k + , this is a standard nonnegative factorization. In general, factorize a linear map through a “small cone” Important special case: K is the cone of psd matrices... Parrilo (MIT) Positive semidefinite rank Cargese 2014 6 / 32

  12. Factorizations Positive semidefinite rank PSD rank of a nonnegative matrix Let M ∈ R m × n be a nonnegative matrix. Definition [GPT11]: The PSD rank of M , denoted rank psd , is the smallest r for which there exists r × r PSD matrices { A 1 , . . . , A m } and { B 1 , . . . , B n } such that M ij = trace A i B j , i = 1 , . . . , m j = 1 , . . . , n . (The maps are then given by x �→ � i x i A i , and Y �→ trace YB j .) Natural definition, generalization of nonnegative rank. Parrilo (MIT) Positive semidefinite rank Cargese 2014 7 / 32

  13. Factorizations Positive semidefinite rank PSD rank of a nonnegative matrix Let M ∈ R m × n be a nonnegative matrix. Definition [GPT11]: The PSD rank of M , denoted rank psd , is the smallest r for which there exists r × r PSD matrices { A 1 , . . . , A m } and { B 1 , . . . , B n } such that M ij = trace A i B j , i = 1 , . . . , m j = 1 , . . . , n . (The maps are then given by x �→ � i x i A i , and Y �→ trace YB j .) Natural definition, generalization of nonnegative rank. Parrilo (MIT) Positive semidefinite rank Cargese 2014 7 / 32

  14. Factorizations Positive semidefinite rank PSD rank of a nonnegative matrix Let M ∈ R m × n be a nonnegative matrix. Definition [GPT11]: The PSD rank of M , denoted rank psd , is the smallest r for which there exists r × r PSD matrices { A 1 , . . . , A m } and { B 1 , . . . , B n } such that M ij = trace A i B j , i = 1 , . . . , m j = 1 , . . . , n . (The maps are then given by x �→ � i x i A i , and Y �→ trace YB j .) Natural definition, generalization of nonnegative rank. Parrilo (MIT) Positive semidefinite rank Cargese 2014 7 / 32

  15. Factorizations Positive semidefinite rank Example (I)   0 1 1  . M = 1 0 1  1 1 0 M admits a psd factorization of size 2: � 1 � 0 � � � � 0 0 1 − 1 A 1 = A 2 = A 3 = 0 0 0 1 − 1 1 � 0 � 1 � 1 0 � 0 � 1 � B 1 = B 2 = B 3 = . 0 1 0 0 1 1 One can easily check that the matrices A i and B j are positive semidefinite, and that M ij = � A i , B j � . This factorization shows that rank psd ( M ) ≤ 2, and in fact rank psd ( M ) = 2. Parrilo (MIT) Positive semidefinite rank Cargese 2014 8 / 32

  16. Factorizations Positive semidefinite rank Example (I)   0 1 1  . M = 1 0 1  1 1 0 M admits a psd factorization of size 2: � 1 � 0 � � � � 0 0 1 − 1 A 1 = A 2 = A 3 = 0 0 0 1 − 1 1 � 0 � 1 � 1 0 � 0 � 1 � B 1 = B 2 = B 3 = . 0 1 0 0 1 1 One can easily check that the matrices A i and B j are positive semidefinite, and that M ij = � A i , B j � . This factorization shows that rank psd ( M ) ≤ 2, and in fact rank psd ( M ) = 2. Parrilo (MIT) Positive semidefinite rank Cargese 2014 8 / 32

  17. Factorizations Positive semidefinite rank Example (II) 4 Consider the matrix 3   a b c c 2  . M ( a , b , c ) = c a b  1 b c a 0 0 1 2 3 4 b Usual rank of M ( a , b , c ) is 3, unless a = b = c (then, rank is 1). One can show that a 2 + b 2 + c 2 ≤ 2( ab + bc + ac ) . rank psd ( M ( a , b , c )) ≤ 2 ⇐ ⇒ Parrilo (MIT) Positive semidefinite rank Cargese 2014 9 / 32

Recommend


More recommend