Talk 1: my review of nonlinear nonconvex optimization Back to the - PowerPoint PPT Presentation

Talk 1: my review of nonlinear nonconvex optimization

Back to the pooling problem We are given a directed, acyclic graph with three classes of vertices inputs pools, (mixing units) outputs

inputs pools, (mixing units) outputs 1. We have K commodities (’specs’) present at the inputs in different amounts. 2. Flows have to be routed to the outputs subject to flow conservation and capacity constraints. 3. Flows that reach a pool become mixed , and the proportion of each spec is upper- and lower-bounded. 4. Optimize a linear function of the flows. Usual version: capacity constraints and costs are on total flows, not per-spec

Formulation • I = set of inputs, M = set of pools, • λ ik = fraction of spec k at input i (data) � ← y ij = total flow on ij min c ij y ij ij ∈ A s.t. flow conservation , capacity constraints on y ij and for all spec k , pool j , � i ∈ I λ ik y ij + � m ∈ M p mk y mj ← p jk = fraction of spec k in pool j p jk = � i ∈ I ∪ M y ij p min ≤ p jk ≤ p max jk jk

Problem 2: AC-PF and -OPF problems on power grids generators demands (loads) • Graph is undirected • Each power line has a (complex) admittance • Send power from generators to loads, subject to laws of physics and equip- ment constraints

Physics • Each bus (node) k has a complex voltage V k . Voltage = potential energy • Line (directed version of edge) km → complex current I km I km = y km ( V k − V m ) (y = admittance) • Line (directed version of edge) km → complex power S km S km = V k I ∗ km = y ∗ km V k ( V k − V m ) ∗ this is the complex power injected into km at k • Generators produce current at a certain voltage • Demands (loads) expressed in units of complex power • This is a time-averaged (steady-state) representation

Formulation • Must choose voltage V k at every bus k • Network constraints: total net power injected by each bus is constrained km V k ( V k − V m ) ∗ ≤ S max � S min y ∗ ≤ k k km ∈ δ ( k ) (two ranged inequalities) 1. At a generator, this says that total generated complex power is upper and lower bounded 2. At a load, S min = S max = − (complex) demand k k • Line constraints: e.g. | y ∗ km V k ( V k − V m ) ∗ | ≤ L km • Voltage constraints: U min ≤ | V k | ≤ U max k k

• V k = voltage bus k • Network constraints: total net power injected by each bus is constrained ≤ S k . km V k ( V k − V m ) ∗ ≤ S max � S min y ∗ = k k km ∈ δ ( k ) • Line constraints: | y ∗ km V k ( V k − V m ) ∗ | ≤ L km • Voltage constraints: U min ≤ | V k | ≤ U max k k 1. Feasibility version: PF or power flow problem 2. Optimization version, or OPF : � min c g ( R e( S g )) g ∈ G ( G = set of generator nodes) Each function c g is convex quadratic. Want to minimize total cost of generation.

A generalization - network polynomial problems Both the pooling problem and ACOPF are special cases of a general problem • We are given an undirected graph G • For each node u ∈ G there is an associated set of variables, X u . Assume pairwise-disjoint. • Likewise each constraint is associated with some node. A constraint associated with u takes the form: � p u,v ( X u ∪ X v ) ≥ 0 { u,v }∈ δ ( u ) where each p u,v is a polynomial function. u v polynomial depends on X u and X v

How to solve QCQPs?

How to solve QCQPs? → IPOPT? (W¨ achter, Biegler, Laird)

How to solve QCQPs? → IPOPT? (W¨ achter, Biegler, Laird) min f ( x ) s.t. g ( x ) = 0 x ≥ 0

How to solve QCQPs? → IPOPT? (W¨ achter, Biegler, Laird) min f ( x ) s.t. g ( x ) = 0 x ≥ 0 � → min f ( x ) − µ log( x i ) (3a) i g ( x ) = 0 s.t. (3b) Here µ > 0 is the barrier parameter, and we want µ → 0 .

How to solve QCQPs? → IPOPT? (W¨ achter, Biegler, Laird) min f ( x ) s.t. g ( x ) = 0 x ≥ 0 � → min f ( x ) − µ log( x i ) (4a) i g ( x ) = 0 s.t. (4b) Here µ > 0 is the barrier parameter, and we want µ → 0 . Algorithm 1. For given µ approximately solve problem (4a), (4b). 2. Effectively, attempt to find a solution to the first-order optimality condi- tions for (4a), (4b): (damped) Newton method 3. Then decrease µ and go to 1. 4. But a lot of cleverness employed in Step 3 (filter method).

How to solve QCQPs? → IPOPT? (W¨ achter, Biegler, Laird) optimum sequence produced by algorithm

How to solve QCQPs? → IPOPT? (W¨ achter, Biegler, Laird) optimum sequence produced by algorithm Claim: IPOPT globally solves all ACOPF instances

How to solve QCQPs? → IPOPT? (W¨ achter, Biegler, Laird) optimum sequence produced by algorithm Claim: IPOPT globally solves all ACOPF instances What does this mean?

Three basic techniques 1. McCormick relaxation 2. Spatial branch-and-bound 3. RLT: lifting to higher-dimensional representation

McCormick relaxation: a very widely used technique McCormick (1976), Al-Khayal and Falk (1983) given: x ∈ [ ℓ x , u x ] , y ∈ [ ℓ y , u y ] , z = xy The convex hull of ( x, y, z ) in this set is given by z ≥ max { u y x + u x y − u y u x , ℓ y x + ℓ x y − ℓ y ℓ x } z ≤ min { u y x + ℓ x y − u y ℓ x , ℓ y x + u x y − ℓ y u x } . • Can be used directly to reformulate any polynomial optimization problem • But some codes avoid this so as to not introduce the variables w • And the quality of the relaxation is in general poor • Unless the bounds ℓ x , u x or ℓ y , u y are tight

Spatial Branch-and-Bound: a very widely used technique Tuy, 1998 • Used in many codes, e.g. BARON • Directly applicable to McCormick relaxations Example: approximate sin( x ) for 0 ≤ x ≤ π/ 2

Branch at x = π/ 4 : 0 ≤ x ≤ π/ 4 π/ 4 ≤ x ≤ π/ 2

RLT: another very widely used technique Sherali and Adams (1992) Example: Suppose 5 x 2 1 + 2 x 2 − 4 ≥ 0 and 0 ≤ x 3 ≤ 10 are valid inequalities Then: (5 x 2 1 + 2 x 2 − 4) x 3 ≥ 0 and (5 x 2 1 + 2 x 2 − 4)(10 − x 3 ) ≥ 0 also valid x 2 • Any nonlinear terms, e.g. 1 x 3 are linearized via McCormick • It may be the case that the nonlinear terms are already found elsewhere • General idea: multiplication of valid inequalities • Which inequalities: using all is too expensive • (Misener): scan possible products, keep if estimate of relaxation improves Back to McCormick: x ∈ [ ℓ x , u x ] , y ∈ [ ℓ y , u y ] , z = xy e.g. can do ( x − ℓ x )( u y − y ) ≥ 0 or u y x + ℓ x y − ℓ x u y ≥ xy

Hierarchies min x T Qx + 2 c T x (QCQP): x T A i x + 2 b T s.t. i x + r i ≥ 0 i = 1 , . . . , m x ∈ R n . → form the semidefinite relaxation � 0 c T � (SR): min • X c Q � r i b T � i s.t. • X ≥ 0 i = 1 , . . . , m b i A i X � 0 , X 00 = 1 . Here, for symmetric matrices M , N , � M • N = M hk N hk h,k So if SR has a rank-1 solution , the lower bound is exact . Unfortunately, SR typically does not have a rank-1 solution. Why? • → Lavaei and Low (2010): on ACOPF , the semidefinite relaxation is often strong • And it may even have a rank-1 solution. • There remains the issue of solving the d***n SDP

Moment relaxations and polynomial optimization Consider the polynomial optimization problem . f ∗ x ∈ R n } , = min { f 0 ( x ) : f i ( x ) ≥ 0 , 1 ≤ i ≤ m, 0 π ∈ S ( i ) a i,π x π . f i ( x ) = � where each f i ( x ) is a polynomial i.e. . = x π 1 1 x π 2 • Each π is a tuple π 1 , π 2 , . . . , π n of nonnegative integers , and x π . . . x π n 2 n • Each S ( i ) is a finite set of tuples , and the a i,π are reals. inf µ E µ f 0 ( x ) , over all measures µ over K . = { x ∈ R n : f i ( x ) ≥ 0 , 1 ≤ i ≤ m } . We know f ∗ = 0 � � � f ∗ i.e. = inf π ∈ S (0) a 0 ,π y π : y is a K -moment 0 Here, y is a K -moment if there is a measure µ over K with y π = E µ x π for each tuple π

Polynomial optimization Consider the polynomial optimization problem . f ∗ x ∈ R n } , = min { f 0 ( x ) : f i ( x ) ≥ 0 , 1 ≤ i ≤ m, 0 π ∈ S ( i ) a i,π x π . f i ( x ) = � where each f i ( x ) is a polynomial i.e. . = x π 1 1 x π 2 • Each π is a tuple π 1 , π 2 , . . . , π n of nonnegative integers , and x π . . . x π n 2 n • Each S ( i ) is a finite set of tuples , and the a i,π are reals. inf µ E µ f 0 ( x ) , over all measures µ over K . = { x ∈ R n : f i ( x ) ≥ 0 , 1 ≤ i ≤ m } . We know f ∗ = 0 � � � f ∗ i.e. = inf π ∈ S (0) a 0 ,π y π : y is a K -moment 0 Here, y is a K -moment if there is a measure µ over K with y π = E µ x π for each tuple π (Cough! Here, y is an infinite-dimensional vector ) .

Polynomial optimization Consider the polynomial optimization problem . f ∗ x ∈ R n } , = min { f 0 ( x ) : f i ( x ) ≥ 0 , 1 ≤ i ≤ m, 0 π ∈ S ( i ) a i,π x π . f i ( x ) = � where each f i ( x ) is a polynomial i.e. . = x π 1 1 x π 2 • Each π is a tuple π 1 , π 2 , . . . , π n of nonnegative integers , and x π . . . x π n 2 n • Each S ( i ) is a finite set of tuples , and the a i,π are reals. inf µ E µ f 0 ( x ) , over all measures µ over K . = { x ∈ R n : f i ( x ) ≥ 0 , 1 ≤ i ≤ m } . We know f ∗ = 0 � � � f ∗ i.e. = inf π ∈ S (0) a 0 ,π y π : y is a K -moment 0 Here, y is a K -moment if there is a measure µ over K with y π = E µ x π for each tuple π (Cough! Here, y is an infinite-dimensional vector ) . Can we make an easier statement?

Talk 1: my review of nonlinear nonconvex optimization Back to the - PowerPoint PPT Presentation

Talk 1: my review of nonlinear nonconvex optimization Back to the pooling problem We are given a directed, acyclic graph with three classes of vertices inputs pools, (mixing units) outputs inputs pools, (mixing units) outputs 1. We have K

Stochastic Cubic Regularization for Fast Nonconvex Optimization Nilesh Tripuraneni, Mitchell

Nonlinear Control Lecture # 31 Nonlinear Observers Nonlinear Control Lecture # 31 Nonlinear

Nonlinear Control Lecture # 22 Special nonlinear Forms Nonlinear Control Lecture # 22 Special

Nonlinear Control Lecture # 21 Special nonlinear Forms Nonlinear Control Lecture # 21 Special

Interior point method for nonlinear nonconvex optimization Ladislav Lukan, Ctirad Matonoha, Jan

Welcome back... Welcome back... ..to me. Welcome back... ..to me. Test out Welcome back...

Nonconvex Demixing from Bilinear Measurements Yuanming Shi 1 Outline Motivations Blind

Nonlinear Control Lecture # 8 Special nonlinear Forms Nonlinear Control Lecture # 8 Special

Nonlinear Control Lecture # 12 Nonlinear Observers and Output Feedback Stabilization Nonlinear

Nonlinear Control Lecture # 20 Special nonlinear Forms Nonlinear Control Lecture # 20 Special

A Lagrangean Based Branch-and-Cut Algorithm for Global Optimization of Nonconvex Mixed-Integer

Implicit Regularization in Nonconvex Statistical Estimation Yuxin Chen Electrical Engineering,

PRACTICAL AUGMENTED LAGRANGIAN METHODS FOR NONCONVEX PROBLEMS Jos e Mario Mart nez

Pathwise Coordinate Optimization for Nonconvex Sparse Learning Tuo Zhao

Accelerated Douglas-Rachford splitting and ADMM for structured nonconvex optimization Panos

Nonlinear Optimization: Discrete optimization INSEAD, Spring 2006 Jean-Philippe Vert Ecole des

Dejeans conjecture and letter frequency Jrmie Chalopin Pascal Ochem LaBRI, Universit

The Price of Routing Unsplittable Flow Baruch Awerbuch Yossi Azar Amir Epstein presented by

Frequency Downshift in a Viscous Fluid John D. Carter carterj1@seattleu.edu Frequency Downshift

Council 3 April 2019 Welcoming Communities Journey Implementation Councils and communities

Cost sharing and allowance allocation in a nutrient trading system for the Lake Rotorua

Geometric View to Deep Learning David Xianfeng Gu 1 1 Computer Science & Applied Mathematics

Discrete planes: an arithmetic and dynamical approach V. Berth e LIAFA-CNRS-Paris-France

Plosives 0.9 0 -0.5012 0 4.1054 Time (s) 3000 0 0 4.1054 Time (s) [aba] [ada] [aa]

Talk 1: my review of nonlinear nonconvex optimization Back to the - PowerPoint PPT Presentation

Talk 1: my review of nonlinear nonconvex optimization Back to the pooling problem We are given a directed, acyclic graph with three classes of vertices inputs pools, (mixing units) outputs inputs pools, (mixing units) outputs 1. We have K

Stochastic Cubic Regularization for Fast Nonconvex Optimization Nilesh Tripuraneni, Mitchell

Nonlinear Control Lecture # 31 Nonlinear Observers Nonlinear Control Lecture # 31 Nonlinear

Nonlinear Control Lecture # 22 Special nonlinear Forms Nonlinear Control Lecture # 22 Special

Nonlinear Control Lecture # 21 Special nonlinear Forms Nonlinear Control Lecture # 21 Special

Interior point method for nonlinear nonconvex optimization Ladislav Lukan, Ctirad Matonoha, Jan

Welcome back... Welcome back... ..to me. Welcome back... ..to me. Test out Welcome back...

Nonconvex Demixing from Bilinear Measurements Yuanming Shi 1 Outline Motivations Blind

Nonlinear Control Lecture # 8 Special nonlinear Forms Nonlinear Control Lecture # 8 Special

Nonlinear Control Lecture # 12 Nonlinear Observers and Output Feedback Stabilization Nonlinear

Nonlinear Control Lecture # 20 Special nonlinear Forms Nonlinear Control Lecture # 20 Special

A Lagrangean Based Branch-and-Cut Algorithm for Global Optimization of Nonconvex Mixed-Integer

Implicit Regularization in Nonconvex Statistical Estimation Yuxin Chen Electrical Engineering,

PRACTICAL AUGMENTED LAGRANGIAN METHODS FOR NONCONVEX PROBLEMS Jos e Mario Mart nez

Pathwise Coordinate Optimization for Nonconvex Sparse Learning Tuo Zhao

Accelerated Douglas-Rachford splitting and ADMM for structured nonconvex optimization Panos

Nonlinear Optimization: Discrete optimization INSEAD, Spring 2006 Jean-Philippe Vert Ecole des

Dejeans conjecture and letter frequency Jrmie Chalopin Pascal Ochem LaBRI, Universit

The Price of Routing Unsplittable Flow Baruch Awerbuch Yossi Azar Amir Epstein presented by

Frequency Downshift in a Viscous Fluid John D. Carter carterj1@seattleu.edu Frequency Downshift

Council 3 April 2019 Welcoming Communities Journey Implementation Councils and communities

Cost sharing and allowance allocation in a nutrient trading system for the Lake Rotorua

Geometric View to Deep Learning David Xianfeng Gu 1 1 Computer Science &amp; Applied Mathematics

Discrete planes: an arithmetic and dynamical approach V. Berth e LIAFA-CNRS-Paris-France

Plosives 0.9 0 -0.5012 0 4.1054 Time (s) 3000 0 0 4.1054 Time (s) [aba] [ada] [aa]

Geometric View to Deep Learning David Xianfeng Gu 1 1 Computer Science & Applied Mathematics