ALADINAn Algorithm for Distributed Non-Convex Optimization and - PowerPoint PPT Presentation

ALADIN—An Algorithm for Distributed Non-Convex Optimization and Control Boris Houska, Yuning Jiang, Janick Frasch, Rien Quirynen, Dimitris Kouzoupis, Moritz Diehl ShanghaiTech University, University of Magdeburg, University of Freiburg 1

Motivation: sensor network localization Decoupled case: each sensor takes measurement η i of its position χ i and solves χ i � χ i − η i � 2 ∀ i ∈ { 1 , . . . , 7 } , min 2 . 2

Motivation: sensor network localization Coupled case: sensors additionally measure the distance to their neighbors 7 � η i ) 2 � � � χ i − η i � 2 min 2 + ( � χ i − χ i +1 � 2 − ¯ with χ 8 = χ 1 χ i =1 3

Motivation: sensor network localization Equivalent formulation: set x 1 = ( χ 1 , ζ 1 ) with ζ 1 = χ 2 , set x 2 = ( χ 2 , ζ 2 ) with ζ 2 = χ 3 , and so on 4

Motivation: sensor network localization Equivalent formulation (cont.): new variables x i = ( χ i , ζ i ) separable non-convex objectives f i ( x i ) = 1 2 + 1 2 + 1 η i ) 2 2 � χ i − η i � 2 2 � ζ i − η i +1 � 2 2 ( � χ i − ζ i � 2 − ¯ affine coupling, ζ i = χ i +1 , can be written as 7 � A i x i = 0 . i =1 7

Motivation: sensor network localization Optimization problem: 7 7 � � min f i ( x i ) s . t . A i x i = 0 . x i =1 i =1 10

Aim of distributed optimization algorithms Find local minimizers of N N � � min f i ( x i ) s . t . A i x i = b x i =1 i =1 Functions f i : R n → R potentially non-convex. Matrices A i ∈ R m × n and vectors b ∈ R m given. Problem: N is large. 11

Overview • Theory - Distributed optimization algorithms - ALADIN • Applications - Sensor network localization - MPC with long horizons 15

Distributed optimization problem Find local minimizers of N N � � min f i ( x i ) s . t . A i x i = b . x i =1 i =1 Functions f i : R n → R potentially non-convex. Matrices A i ∈ R m × n and vectors b ∈ R m given. Problem: N is large. 16

Dual decomposition Main idea: solve dual problem N � { f i ( x i ) + λ T A i x i } − λ T b max d ( λ ) with d ( λ ) = min x λ i =1 Evaluation of d can be parallelized. Applicable if f i s are (strictly) convex For non-convex f i : duality gap possible H. Everett. Generalized Lagrange multiplier method for solving problems of optimum allocation of resources, 1963. 17

Dual decomposition Main idea: solve dual problem N � x i { f i ( x i ) + λ T A i x i } − λ T b max d ( λ ) with d ( λ ) = min λ i =1 Evaluation of d can be parallelized. Applicable if f i s are (strictly) convex For non-convex f i : duality gap possible H. Everett. Generalized Lagrange multiplier method for solving problems of optimum allocation of resources, 1963. 18

ADMM (consensus variant) Alternating Direction Method of Multipliers Input: Initial guesses x i ∈ R n and λ i ∈ R m ; ρ > 0 , ǫ > 0 . Repeat: 1. Solve decoupled NLPs i A i y i + ρ 2 � A i ( y i − x i ) � 2 f i ( y i ) + λ T min 2 . y i 2. Implement dual gradient steps λ + i = λ i + ρ A i ( y i − x i ) . 3. Solve coupled QP N N � ρ � 2 � � � � � A i ( y i − x + � 2 − ( λ + i ) T A i x + A i x + min i ) s . t . = b . i i 2 x + i =1 i =1 4. Update the iterates x ← x + and λ ← λ + . D. Gabay, B. Mercier. A dual algorithm for the solution of nonlinear variational problems via finite element approximations, 1976. 21

Limitations of ADMM 1) Convergence rate of ADMM is very scaling dependent. 2) ADMM may be divergent, if f i s are nonconvex. Example: min x 1 · x 2 s . t . x 1 − x 2 = 0 . x 2 = λ ∗ = 0 . unique and regular minimizer at x ∗ 1 = x ∗ For ρ = 3 4 all sub-problems are strictly convex. ADMM is divergent; λ + = − 2 λ . This talk: addresses Problem 2), mitigates Problem 1) 26

ALADINAn Algorithm for Distributed Non-Convex Optimization and - PowerPoint PPT Presentation

ALADINAn Algorithm for Distributed Non-Convex Optimization and Control Boris Houska, Yuning Jiang, Janick Frasch, Rien Quirynen, Dimitris Kouzoupis, Moritz Diehl ShanghaiTech University, University of Magdeburg, University of Freiburg 1

Convex Hell 362 dnc CS 16: Convex Hull Whoops, I mean... Convex Hull Whats a Convex Hull?

Convex hull 1 - 1 Convex hull 1 - 2 Convex hull 1 - 3 Convex hull Definition, extremal

CS133 Computational Geometry Convex Hull 1 Convex Hull Given a set of n points, find the

constrained convex optimization virgil pavlu 1 convex set a set X in a vector space is convex if

Can simple MOS bring improvement into ALADIN T 2m forecast? ak Ivan Ba st Dur an

UV CHARACTERIZATION OF ENGINEERING QUALIFICATION MODEL OF ALADIN LASER TRANSMITTER INTERNATIONAL

Forecasters meeting The ALADIN consortium: goals and work practices P. Termonia Ankara, 10-11

Optimizing Convex Functions over Non-Convex Domains Dan Bienstock and Alex Michalka

A Unified Distributed Algorithm for Non- Games Non-cooperative, Non-convex, and Non-differentiable

CS675: Convex and Combinatorial Optimization Spring 2018 Convex Sets Instructor: Shaddin Dughmi

Convex hull: basic facts Convex hull: basic facts CG Lecture 1 CG Lecture 1 Problem : give a set

Convex hulls of spheres and convex hulls of convex polytopes lying on parallel hyperplanes

CS675: Convex and Combinatorial Optimization Fall 2019 Convex Functions Instructor: Shaddin

Convex Analysis Jos e De Don a September 2004 Centre of Complex Dynamic Systems and

CS675: Convex and Combinatorial Optimization Fall 2019 Convex Sets Instructor: Shaddin Dughmi

CS133 Computational Geometry Convex Hull 4/12/2018 1 Convex Hull Given a set of n points,

Capacity Allocation for Big Data Applications in the Cloud 27 th April 2017 QUDOS 2017@ICPE

Distributed computation of optimal allocations using potential games Pierre Coucheney, Corinne

CS 345 Data Mining Online algorithms Search advertising Online algorithms Classic model of

Topics Thoughts on R development and the Extensibility of the kernel/core to facilitate

Lower Bounds of Mechanisms for Scheduling Unrelated Machines Elias Koutsoupias Department of

Primary Airport Slot Allocation with Price-setting Auctions Mario Ramrez Ferrero

Adaptive Ensemble Optimal Interpolation for Efficient Assimilation in the Red Sea Habib Toye 1 ,

Painless Stochastic Gradient Descent : Interpolation, Line-Search, and Convergence Rates. NeurIPS