Semidefinite Programming Pekka Orponen T-79.7001 Postgraduate - PowerPoint PPT Presentation

Semidefinite Programming Semidefinite Programming Pekka Orponen T-79.7001 Postgraduate Course on Theoretical Computer Science 24.4.2008 T–79.7001 Postgraduate Course on Theoretical Computer Science 24.4.2008

Semidefinite Programming Outline ◮ 1. Strict Quadratic Programs and Vector Programs ◮ Strict quadratic programs ◮ The vector program relaxation ◮ 2. Semidefinite Programs ◮ Vector programs as matrix linear programs ◮ Properties of semidefinite matrices ◮ From vector programs to semidefinite programs ◮ Notes on computation ◮ 3. Randomised Rounding of Vector Programs ◮ Randomised rounding for MAX CUT T–79.7001 Postgraduate Course on Theoretical Computer Science 24.4.2008

Semidefinite Programming 1. Strict Quadratic Programs and Vector Programs ◮ A quadratic program concerns optimising a quadratic function of integer variables, with quadratic constraints. ◮ A quadratic program is strict if it contains no linear terms, i.e. each monomial appearing in it is either constant or of degree 2. ◮ E.g. a strict quadratic program for weighted MAX CUT: ◮ Given a weighted graph G = ( N , E , w ) , N = [ n ] = { 1 ,..., n } . ◮ Associate to each vertex i ∈ N a variable y i ∈ { + 1 , − 1 } . A cut ( S , ¯ S ) is determined as S = { i | y i = + 1 } , ¯ S = { i | y i = − 1 } . ◮ The program: max 1 ∑ w ij ( 1 − y i y j ) 2 1 ≤ i < j ≤ n y 2 i ∈ N i = 1 , s.t. y i ∈ Z , i ∈ N . T–79.7001 Postgraduate Course on Theoretical Computer Science 24.4.2008

Semidefinite Programming The vector program relaxation ◮ Given a strict quadratic program on n variables y i , relax the variables into n -dimensional vectors v i ∈ R n , and replace quadratic terms by inner products of these. ◮ E.g. the MAX CUT vector program: max 1 ∑ w ij ( 1 − v T i v j ) 2 1 ≤ i < j ≤ n v T i v i = 1 , i ∈ N s.t. v i ∈ R n , i ∈ N . ◮ Feasible solutions correspond to families of points on the n -dimensional unit sphere S n − 1 . ◮ Original program given by restriction to 1-dimensional solutions, e.g. all points along the x -axis: v i = ( y i , 0 ,..., 0 ) . ◮ We shall see that vector programs can in fact be solved in polynomial time, and projections to 1 dimension yield nice approximations for the original problem. T–79.7001 Postgraduate Course on Theoretical Computer Science 24.4.2008

Semidefinite Programming 2. Semidefinite Programming ◮ A vector program on n n -dimensional vectors { v 1 ,..., v n } can also be viewed as a linear program on the n × n matrix Y of their inner products, Y = [ v T i v j ] ij . ◮ However there is a “structural” constraint on the respective matrix linear program: the feasible solutions must be specifically inner product matrices. ◮ This turns out to imply (cf. later) that the feasible solution matrices Y are symmetric and positive semidefinite, i.e. x T Yx ≥ 0 , for all x ∈ R n . ◮ Thus vector programming problems can be reformulated as semidefinite programming problems. T–79.7001 Postgraduate Course on Theoretical Computer Science 24.4.2008

Semidefinite Programming ◮ Define the Frobenius (inner) product of two n × n matrices A , B ∈ R n × n as n n a ij b ij = tr ( A T B ) . ∑ ∑ A • B = i = 1 j = 1 ◮ Denote the family of symmetric n × n real matrices by M n , and the condition that Y ∈ M n be positive semidefinite by Y � 0. ◮ Let C , D 1 ,..., D k ∈ M n and d 1 ,..., d k ∈ R . Then the general semidefinite programming problem is max / min C • Y s.t. D i • Y = d i , i = 1 ,..., k Y � 0 , Y ∈ M n . T–79.7001 Postgraduate Course on Theoretical Computer Science 24.4.2008

Semidefinite Programming ◮ E.g. the MAX CUT semidefinite program relaxation: max 1 ∑ w ij ( 1 − y ij ) 2 1 ≤ i < j ≤ n y ii = 1 , i ∈ N s.t. Y � 0 , Y ∈ M n . ◮ Or, equivalently: min W • Y D i • Y = 1 , i ∈ N s.t. Y � 0 , Y ∈ M n . where Y = [ y ij ] ij , W = [ w ij ] ij , D i = [ 1 ] ii . T–79.7001 Postgraduate Course on Theoretical Computer Science 24.4.2008

Semidefinite Programming Properties of positive semidefinite matrices Let A be a real, symmetric n × n matrix. Then A has n (not necessarily distinct) real eigenvalues, and associated n linearly independent eigenvectors. Theorem 1. Let A ∈ M n . Then the following are equivalent: 1. x T Ax ≥ 0 for all x ∈ R n . 2. All eigenvalues of A are nonnegative. 3. A = W T W for some W ∈ R n × n . T–79.7001 Postgraduate Course on Theoretical Computer Science 24.4.2008

Semidefinite Programming Proof ( 1 ⇒ 2 ). ◮ Let λ be an eigenvalue of A , and v a corresponding eigenvector. ◮ Then Av = λ v and v T Av = λ v T v . ◮ By assumption (1), v T Av ≥ 0, and since v T v > 0, necessarily λ ≥ 0. T–79.7001 Postgraduate Course on Theoretical Computer Science 24.4.2008

Semidefinite Programming Proof ( 2 ⇒ 3 ). ◮ Decompose A as A = Q Λ Q T , where Λ = diag ( λ 1 ,..., λ n ) , with λ 1 ,..., λ n ≥ 0 the n eigenvalues of A . ◮ Since by assumption (2), λ i ≥ 0 for each i , we can further √ √ decompose Λ = DD T , where D = diag ( λ 1 ,..., λ n ) . ◮ Denote W = ( QD ) T . Then A = Q Λ Q T = QDD T Q = W T W . T–79.7001 Postgraduate Course on Theoretical Computer Science 24.4.2008

Semidefinite Programming Proof ( 3 ⇒ 1 ). ◮ By assumption (3), A can be decomposed as A = W T W . ◮ Then for any x ∈ R n : x T Ax = x T W T Wx = ( Wx ) T ( Wx ) ≥ 0 . T–79.7001 Postgraduate Course on Theoretical Computer Science 24.4.2008

Semidefinite Programming From vector programs to semidefinite programs Given a vector program V , define a corresponding semidefinite program S on the inner product matrix of the vector variables, as described earlier. Corollary 2. Vector program V and semidefinite program S are equivalent (have essentially the same feasible solutions). Proof. Let v 1 ,..., v n be a feasible solution to V . Let W be a matrix with columns v 1 ,..., v n . Then Y = W T W is a feasible solution to S with the same objective function value as v 1 ,..., v n . Conversely, let Y be a feasible solution to S . By Theorem 1 (iii), Y can be decomposed as Y = W T W . Let v 1 ,..., v n be the columns of W . Then v 1 ,..., v n is a feasible solution to V with the same objective function value as Y . T–79.7001 Postgraduate Course on Theoretical Computer Science 24.4.2008

Semidefinite Programming Notes on computation ◮ Using Cholesky decomposition, a matrix A ∈ M n can be decomposed in polynomial time as A = U Λ U T , where Λ is a diagonal matrix whose entries are the eigenvalues of A . ◮ By Theorem 1 (ii), this gives a polynomial time test for positive semidefiniteness. ◮ The decomposition of Theorem (iii), A = WW T , is not in general polynomial time computable, because W may contain irrational entries. It may however be approximated efficiently to arbitrary precision. In the following this slight inaccuracy is ignored. ◮ Note also that any convex combination of positive semidefinite matrices is again positive semidefinite. T–79.7001 Postgraduate Course on Theoretical Computer Science 24.4.2008

Semidefinite Programming ◮ Semidefinite programs can be solved (to arbitrary accuracy) by the ellipsoid algorithm. ◮ To validate this, it suffices to show the existence of a polynomial time separation oracle. Theorem 3. Let S be a semidefinite program and A ∈ R n . One can determine in polynomial time whether A is feasible for S and, if not, find a separating hyperplane. T–79.7001 Postgraduate Course on Theoretical Computer Science 24.4.2008

Semidefinite Programming Proof. A is feasible for S if it is symmetric, positive semidefinite, and satisfies all of the linear constraints. Each of these conditions can be tested in polynomial time. In the case of infeasible A , a separating hyperplane can be determined as follows: ◮ If A is not symmetric, then a ij > a ji for some i , j . Then y ij ≤ y ji is a separating hyperplane. ◮ If A is not positive semidefinite, then it has a negative eigenvalue, say λ . Let v be a corresponding eigenvector. Then ( vv T ) • Y = v T Yv ≥ 0 is a separating hyperplane. ◮ If any of the linear constraints is violated, it directly yields a separating hyperplane. T–79.7001 Postgraduate Course on Theoretical Computer Science 24.4.2008

Semidefinite Programming 3. Randomised Rounding of Vector Programs ◮ Recall the outline of the present approximation scheme: 1. Formulate the problem of interest as a strict quadratic program P . 2. Relax P into a vector program V . 3. Reformulate V as a semidefinite program S and solve (approximately) using the ellipsoid method. 4. Round the solution of V back into P by projecting it on some 1-dimensional subspace. ◮ We shall now address the fourth task, using the MAX CUT program as an example. T–79.7001 Postgraduate Course on Theoretical Computer Science 24.4.2008

Semidefinite Programming Randomised rounding for MAX CUT ◮ Let v 1 ,..., v n ∈ S n − 1 be an optimal solution to the MAX CUT vector program, and let OPT v be its objective function value. We want to obtain a cut ( S , ¯ S ) whose weight is a large fraction of OPT v . ◮ The contribution of a pair of vectors v i , v j ( i < j ) to OPT v is w ij 2 ( 1 − cos θ ij ) , where θ ij denotes the (unsigned) angle between v i and v j . ◮ We would like vertices i , j to be separated by the cut if cos θ ij is large (close to π ). T–79.7001 Postgraduate Course on Theoretical Computer Science 24.4.2008

Semidefinite Programming Pekka Orponen T-79.7001 Postgraduate - PowerPoint PPT Presentation

Semidefinite Programming Semidefinite Programming Pekka Orponen T-79.7001 Postgraduate Course on Theoretical Computer Science 24.4.2008 T79.7001 Postgraduate Course on Theoretical Computer Science 24.4.2008 Semidefinite Programming Outline

Lecture 3: Semidefinite Programming Lecture Outline Part I: Semidefinite programming,

Semidefinite programming bounds for codes and anticodes in Cayley graphs Frank Vallentin

CS672: Approximation Algorithms Spring 2020 Intro to Semidefinite Programming Instructor:

Flyspeck Inequalities and Semidefinite Programming Victor Magron , RA Imperial College Memory

Moments, Sums of Squares and Semidefinite Programming Jean B. LASSERRE LAAS-CNRS, and Institute

QoS-Guaranteed User Association in Sokun, Gohary, HetNets via Semidefinite Relaxation

Sparsity and decomposition in semidefinite optimization Lieven Vandenberghe Electrical and

13. Cones and semidefinite constraints Geometry of cones Second order cone programs

Completely positive semidefinite matrices: conic approximations and matrix factorization ranks

Infinite Positive Semidefinite Tensor Factorization for Source Separation of Mixture Signals

Sparsity and decomposition in semidefinite optimization Lieven Vandenberghe ECE Department, UCLA

SI231 Matrix Computations Lecture 6: Positive Semidefinite Matrices Ziping Zhao Fall Term

Semidefinite Optimization using MOSEK Joachim Dahl ISMP Berlin, August 23, 2012

Multi-Pitch Estimation via Semidefinite Programming August 24, 2016 T. L. Jensen Joint work

Condition numbers in nonarchimedean semidefinite programming . . . and what they say about

Certifying the Global Optimality of Graph Cuts via Semidefinite Programming: A Theoretic

Media Graph Partitioning Introduction modules, cluster, communities, groups, partitions (more on

Network Flows Marco Chiarandini Department of Mathematics & Computer Science University of

Graph partitioning using matrix differential equations Nicola Guglielmi Gran Sasso Science

CS137: Electronic Design Automation Day 7: January 28, 2002 Partitioning 2 (spectral, network

Optimization for Machine Learning Lecture 3: Bundle Methods S.V . N. (vishy) Vishwanathan Purdue

Chapter 7: Maximum Flow Problems (cp. Cook, Cunningham, Pulleyblank & Schrijver, Chapter 3)

tr r rt Prt

Combinatorial Optimization Games Maria Serna Fall 2016 AGT-MIRI Cooperative Game Theory

Semidefinite Programming Pekka Orponen T-79.7001 Postgraduate - PowerPoint PPT Presentation

Semidefinite Programming Semidefinite Programming Pekka Orponen T-79.7001 Postgraduate Course on Theoretical Computer Science 24.4.2008 T79.7001 Postgraduate Course on Theoretical Computer Science 24.4.2008 Semidefinite Programming Outline

Lecture 3: Semidefinite Programming Lecture Outline Part I: Semidefinite programming,

Semidefinite programming bounds for codes and anticodes in Cayley graphs Frank Vallentin

CS672: Approximation Algorithms Spring 2020 Intro to Semidefinite Programming Instructor:

Flyspeck Inequalities and Semidefinite Programming Victor Magron , RA Imperial College Memory

Moments, Sums of Squares and Semidefinite Programming Jean B. LASSERRE LAAS-CNRS, and Institute

QoS-Guaranteed User Association in Sokun, Gohary, HetNets via Semidefinite Relaxation

Sparsity and decomposition in semidefinite optimization Lieven Vandenberghe Electrical and

13. Cones and semidefinite constraints Geometry of cones Second order cone programs

Completely positive semidefinite matrices: conic approximations and matrix factorization ranks

Infinite Positive Semidefinite Tensor Factorization for Source Separation of Mixture Signals

Sparsity and decomposition in semidefinite optimization Lieven Vandenberghe ECE Department, UCLA

SI231 Matrix Computations Lecture 6: Positive Semidefinite Matrices Ziping Zhao Fall Term

Semidefinite Optimization using MOSEK Joachim Dahl ISMP Berlin, August 23, 2012

Multi-Pitch Estimation via Semidefinite Programming August 24, 2016 T. L. Jensen Joint work

Condition numbers in nonarchimedean semidefinite programming . . . and what they say about

Certifying the Global Optimality of Graph Cuts via Semidefinite Programming: A Theoretic

Media Graph Partitioning Introduction modules, cluster, communities, groups, partitions (more on

Network Flows Marco Chiarandini Department of Mathematics &amp; Computer Science University of

Graph partitioning using matrix differential equations Nicola Guglielmi Gran Sasso Science

CS137: Electronic Design Automation Day 7: January 28, 2002 Partitioning 2 (spectral, network

Optimization for Machine Learning Lecture 3: Bundle Methods S.V . N. (vishy) Vishwanathan Purdue

Chapter 7: Maximum Flow Problems (cp. Cook, Cunningham, Pulleyblank &amp; Schrijver, Chapter 3)

tr r rt Prt

Combinatorial Optimization Games Maria Serna Fall 2016 AGT-MIRI Cooperative Game Theory

Network Flows Marco Chiarandini Department of Mathematics & Computer Science University of

Chapter 7: Maximum Flow Problems (cp. Cook, Cunningham, Pulleyblank & Schrijver, Chapter 3)