The Mixed-integer Conic Optimizer in MOSEK 23rd International - PowerPoint PPT Presentation

The Mixed-integer Conic Optimizer in MOSEK 23rd International Symposium on Mathematical Programming, July 2nd 2018, Bordeaux Sven Wiese www.mosek.com

Mixed-Integer Conic Optimization We consider problems of the form c T x minimize subject to Ax = b Z p × R n − p � � x ∈ K ∩ , where K is a convex cone. Typically, K = K 1 × K 2 × · · · × K K is a product of lower-dimensional cones - so-called conic building blocks. 1 / 29

What is MOSEK ? MOSEK is a Copenhagen-based company developing the homonymous software package since 1997. exponential Conic Optimization LP LP convex (MI)NLP (Mixed-integer) cones M general I P MOSEK convex power version 9 SOCP SOCP cones (convex QCP) (convex QCP) SDP SDP 2 / 29

What is MOSEK ? (cont.) MOSEK at ISMP 2018: • Henrik A. Friberg, Projection and presolve in MOSEK: exponential and power cones , Tue, 8:30AM • Joachim Dahl, o p e k t s o i m m R Extending MOSEK i z e r with exponential cones , A x R C Wed, 8:30AM o P b l o I o Matlab Julia T MOSEK • Erling D. Andersen, APIs MOSEK version 9 , C++ Python Wed, 3:15PM Java .NET • Micha� l Adamaszek, Exponential cone in MOSEK: F u overview and applications , s i o n Fri, 3:15PM 3 / 29

Symmetric cones (supported by MOSEK 8) • the nonnegative orthant + := { x ∈ R n | x j ≥ 0 , j = 1 , . . . , n } , R n • the quadratic cone Q n = { x ∈ R n | x 1 ≥ � 1 / 2 } , x 2 2 + · · · + x 2 � n • the rotated quadratic cone r = { x ∈ R n | 2 x 1 x 2 ≥ x 2 Q n 3 + . . . x 2 n , x 1 , x 2 ≥ 0 } . • the semidefinite matrix cone S n = { x ∈ R n ( n +1) / 2 | z T mat ( x ) z ≥ 0 , ∀ z } , √ √  x 2 / 2 . . . x n / 2  x 1 √ √ x 2 / 2 x n +1 . . . x 2 n − 1 / 2   with mat ( x ) :=  .   . . . . . .   . . .  √ √ x n / 2 x 2 n − 1 / 2 . . . x n ( n +1) / 2 4 / 29

Quadratic cones in dimension 3 x 1 x 1 x 3 x 3 x 2 x 2 5 / 29

Examples of quadratic cones • Absolute value: ( t , x ) ∈ Q 2 . | x | ≤ t ⇐ ⇒ • Euclidean norm: ( t , x ) ∈ Q n +1 , � x � 2 ≤ t ⇐ ⇒ • Second-order cone inequality: � Ax + b � 2 ≤ c T x + d ( c T x + d , Ax + b ) ∈ Q m +1 . ⇐ ⇒ 6 / 29

Examples of rotated quadratic cones • Squared Euclidean norm: � x � 2 (1 / 2 , t , x ) ∈ Q n +2 2 ≤ t ⇐ ⇒ . r • Convex quadratic inequality: (1 / 2) x T Qx ≤ c T x + d (1 / 2 , c T x + d , F T x ) ∈ Q k +2 ⇐ ⇒ r with Q = F T F , F ∈ R n × k . 7 / 29

Examples of rotated quadratic cones (cont.) • Convex hyperbolic function: √ 1 2) ∈ Q 3 x ≤ t , x > 0 ⇐ ⇒ ( x , t , r . • Convex negative rational power: √ 1 ( t , 1 2) ∈ Q 3 x 2 ≤ t , x > 0 ⇐ ⇒ 2 , s ) , ( x , s , r . • Square roots: √ x ≥ t , x ≥ 0 (1 2 , x , t ) ∈ Q 3 ⇐ ⇒ r . • Convex positive rational power: x 3 / 2 ≤ t , x ≥ 0 ( s , t , x ) , ( x , 1 / 8 , s ) ∈ Q 3 ⇐ ⇒ r . 8 / 29

Non-symmetric cones (in next MOSEK release) • the three-dimensional exponential cone K exp = cl { x ∈ R 3 | x 1 ≥ x 2 exp( x 3 / x 2 ) , x 2 > 0 } . • the three-dimensional power cone P α = { x ∈ R 3 | x α 1 x (1 − α ) ≥ | x 3 | , x 1 , x 2 ≥ 0 } , 2 for 0 < α < 1. Interior-point methods for non-symmetric cones are less studied, and less mature. 9 / 29

The exponential cone x 1 x 3 x 2 10 / 29

Examples of exponential cones • Expontial: e x ≤ t ⇐ ⇒ ( t , 1 , x ) ∈ K exp . • Logarithm: log x ≥ t ⇐ ⇒ ( x , 1 , t ) ∈ K exp . • Entropy: − x log x ≥ t ⇐ ⇒ (1 , x , t ) ∈ K exp . • Softplus function: log(1+ e x ) ≤ t ⇐ ⇒ ( u , 1 , x − t ) , ( v , 1 , − t ) ∈ K exp , u + v ≤ 1 . • Log-sum-exp: � � e x i ) ≤ t ⇐ log( ⇒ u i ≤ 1 , ( u i , 1 , x i − t ) ∈ K exp , i = 1 , . . . , n . i 11 / 29

Examples of power cones The power cone models many quadratic cone examples more succinctly. • Powers: t ≥ | x | p ( t , 1 , x ) ∈ P 1 / p ⇐ ⇒ • p -norm cones ( p > 1): � r i = t , ( r i , t , x i ) ∈ P 1 / p , i = 1 , . . . , n . t ≥ � x � p ⇐ ⇒ 12 / 29

A logistic regression example Given n binary training-points { ( x i , y i ) } in R d +1 , we want to determine the classifier 1 h θ ( x ) = 1 + exp( − θ T x ) . Training with 2 n exponential cones: � minimize t i + F · |{ j | θ j � = 0 }| i t i ≥ log(1 + exp( − θ T x i )) , subject to y i = 1 , t i ≥ log(1 + exp( θ T x i )) , y i = 0 , Some authors consider simultaneous Feature selection [9], giving rise to additional d binary variables! 13 / 29

A logistic regression example (cont.) # t >= log(1 + exp(x)) def softplus(M, t, x): aux = M.variable(2) M.constraint(Expr.sum(aux), Domain.lessThan(1.0)) M.constraint(Expr.hstack(aux, Expr.constTerm(2, 1.0), Expr.vstack(Expr.sub(x,t), Expr.neg(t))), Domain.inPExpCone()) # Model logistic regression def logisticRegression(X, y, F=1.0, bigM=100): n, d = X.shape M = Model() theta = M.variable(d) t = M.variable(n) z = M.variable(d, Domain.binary()) # objective M.objective(ObjectiveSense.Minimize, Expr.add(Expr.sum(t), Expr.mul(F, Expr.sum(z)))) for i in range(n): # 2n cone constraints dot = Expr.dot(X[i], theta) softplus(M, t.index(i), Expr.neg(dot)) if y[i] == 1 else softplus(M, t.index(i), dot) for j in range(d): # 2d bigM constraints M.constraint(Expr.dot([1.0, bigM], Expr.vstack(theta.index(j), z.index(j))), Domain.greaterThan(0.0)) M.constraint(Expr.dot([-1.0, bigM], Expr.vstack(theta.index(j), z.index(j))), Domain.greaterThan(0.0)) return M, theta, z 14 / 29

A logistic regression example (cont.) Problem Objective sense : min Type : CONIC (conic optimization problem) Constraints : 882 Cones : 236 Scalar variables : 1118 Matrix variables : 0 Integer variables : 28 Optimizer started. Mixed integer optimizer started. Threads used: 20 Presolve started. Presolve terminated. Time = 0.02 Presolved problem: 764 variables, 292 constraints, 3885 non-zeros Presolved problem: 0 general integer, 28 binary, 736 continuous Clique table size: 0 BRANCHES RELAXS ACT_NDS DEPTH BEST_INT_OBJ BEST_RELAX_OBJ REL_GAP(%) TIME 0 1 1 0 1.2123260449e+02 9.8928494362e+01 18.40 0.1 0 1 1 0 1.1848950471e+02 9.8928494362e+01 16.51 0.4 8 12 7 3 1.1848950471e+02 1.0134750080e+02 14.47 0.9 13 17 10 4 1.1669250047e+02 1.0195462270e+02 12.63 1.0 24 28 17 5 1.1669250047e+02 1.0510431665e+02 9.93 1.1 37 41 26 7 1.1669250047e+02 1.0510431665e+02 9.93 1.2 57 61 34 6 1.1669250047e+02 1.0510431665e+02 9.93 1.4 71 75 28 3 1.1669250047e+02 1.0604068619e+02 9.13 1.5 84 88 33 7 1.1606141255e+02 1.0604068619e+02 8.63 1.5 110 109 25 9 1.1606141255e+02 1.0604068619e+02 8.63 1.6 122 121 19 8 1.1589020619e+02 1.0604068619e+02 8.50 1.7 131 130 14 9 1.1428084164e+02 1.0604068619e+02 7.21 1.7 144 137 7 4 1.1370049054e+02 1.0963644001e+02 3.57 1.8 152 144 3 5 1.1174946324e+02 1.1131570072e+02 0.39 1.9 An optimal solution satisfying the relative gap tolerance of 1.00e-02(%) has been located. The relative gap is 0.00e+00(%). An optimal solution satisfying the absolute gap tolerance of 0.00e+00 has been located. The absolute gap is 0.00e+00. Objective of best integer solution : 1.117494632384e+02 Best objective bound : 1.117494632384e+02 Construct solution objective : Not employed Construct solution # roundings : 0 User objective cut value : 0 Number of cuts generated : 0 Number of branches : 155 Number of relaxations solved : 145 Number of interior point iterations: 2268 Number of simplex iterations : 0 Time spend presolving the root : 0.02 15 / 29 Mixed integer optimizer terminated. Time: 1.98

A logistic regression example (cont.) No IC, selected 28 out of 28 features Akaike IC, selected 9 out of 28 features 1.0 1.0 0.5 0.5 0.0 0.0 0.5 0.5 1.0 1.0 1.0 0.5 0.0 0.5 1.0 1.0 0.5 0.0 0.5 1.0 Bayes IC, selected 6 out of 28 features 1.0 0.5 0.0 0.5 1.0 1.0 0.5 0.0 0.5 1.0 Decision regions for different information criteria. Data lifted to the space of degree 6 polynomials. 16 / 29

The beauty of Conic Optimization In continuous optimization, conic (re-)formulations have been highly advocated for quite some time, e.g., by Nemirovski [8]. • Separation of data and structure: • Data: c , A and b . • Structure: K . • Structural convexity. • Duality (almost...). • No issues with smoothness and differentiability. We call modeling with the aforementioned 5 cones extremely disciplined convex programming : “Almost all convex constraints which arise in practice are representable by using these cones.” 17 / 29

Cones in Mixed-Integer Optimization Lubin et al. [6] show that all convex instances (333) in MINLPLIB2 are conic representable using only 4 types of cones. The exploitation of conic structures in the mixed-integer case is slightly newer, but nonetheless an active research area: • MISOCP: • Extended Formulations: Vielma et al. [10]. • Cutting planes: Andersen and Jensen [1], Kılın¸ c-Karzan and Yıldız [4], Belotti et al. [2], ... • Primal heuristics: C ¸ay, P´ olik and Terlaky [3]. • Duality: Mor´ an, Dey and Vielma [7]. • Outer approximation: Lubin [5]. • ... 18 / 29

The Mixed-integer Conic Optimizer in MOSEK 23rd International - PowerPoint PPT Presentation

The Mixed-integer Conic Optimizer in MOSEK 23rd International Symposium on Mathematical Programming, July 2nd 2018, Bordeaux Sven Wiese www.mosek.com Mixed-Integer Conic Optimization We consider problems of the form c T x minimize subject

Mixed-integer conic optimization and MOSEK Dagstuhl seminar on MINLP, February 20th 2018 Sven

Mixed-integer conic optimization and MOSEK Dagstuhl seminar on MINLP, February 20th 2018 Sven

On the Linear Algebra Employed in the MOSEK Conic Optimizer Monday Jul 13 Erling D. Andersen

On recent improvements in MOSEK Erling D. Andersen MOSEK ApS, Fruebjergvej 3, Box 16, 2100

Tour de MOSEK 7: The short version Erling D. Andersen MOSEK ApS, Fruebjergvej 3, Box 16, 2100

Solving conic optimization problems using MOSEK December 16th 2017 e.d.andersen@mosek.com

On Using MOSEK to Solve The MOSEK solvers Large-Scale Linear and Conic Optimization Problems

An overview of MOSEK Erling D. Andersen MOSEK ApS Fruebjergvej 3, Symbion Science park, Box 16

Extending MOSEK with exponential cones ISMP Bordeaux 2018 joachim.dahl@mosek.com www.mosek.com

What is Mosek up to January 15, 2019 Erling D. Andersen www.mosek.com Mosek A software

MOSEK version 9 (work in progress) July 4, 2018 Erling D. Andersen www.mosek.com Mosek A

The Feasibility Pump heuristic for Mixed-Integer Conic Programming Workshop on Discrepancy Theory

On Recent Improvements in the Interior-Point Optimizer in MOSEK ISMP2015 14 July 2015

Recent Advances In The New Mosek 8 16-November-2016 Erling D. Andersen www.mosek.com

Exponential cone in MOSEK ISMP2018, Relative Entropy Optimization, 6 July 2018 Micha l

The R-to-MOSEK Optimization Interface Henrik Alsing Friberg MOSEK ApS, Fruebjergvej 3, Box 16,

Symbolic Model Checking Binary Decision Diagrams 2 Combinatorial Circuits 3 Eight Queen

Chapter 27 Entropy, Randomness, and Information CS 573: Algorithms, Fall 2013 December 5, 2013

Binary Classification Many slides attributable to: Prof. Mike Hughes Erik Sudderth (UCI) Finale

An introduction to R: Organisation and Basics of Algorithmics emie Becker, Sonja Grath & Dirk

R04 - Regression with Categorical Explanatory Variables STAT 587 (Engineering) Iowa State

Measuring Inequality by Asset Indices: The case of South Africa Martin Wittenberg and Murray

Semantics of Probabilistic and Differential Programming Workshop on program transformations at

Geometric Whitney problem and inverse problems Matti Lassas in collaboration with Charles

The Mixed-integer Conic Optimizer in MOSEK 23rd International - PowerPoint PPT Presentation

The Mixed-integer Conic Optimizer in MOSEK 23rd International Symposium on Mathematical Programming, July 2nd 2018, Bordeaux Sven Wiese www.mosek.com Mixed-Integer Conic Optimization We consider problems of the form c T x minimize subject

Mixed-integer conic optimization and MOSEK Dagstuhl seminar on MINLP, February 20th 2018 Sven

Mixed-integer conic optimization and MOSEK Dagstuhl seminar on MINLP, February 20th 2018 Sven

On the Linear Algebra Employed in the MOSEK Conic Optimizer Monday Jul 13 Erling D. Andersen

On recent improvements in MOSEK Erling D. Andersen MOSEK ApS, Fruebjergvej 3, Box 16, 2100

Tour de MOSEK 7: The short version Erling D. Andersen MOSEK ApS, Fruebjergvej 3, Box 16, 2100

Solving conic optimization problems using MOSEK December 16th 2017 e.d.andersen@mosek.com

On Using MOSEK to Solve The MOSEK solvers Large-Scale Linear and Conic Optimization Problems

An overview of MOSEK Erling D. Andersen MOSEK ApS Fruebjergvej 3, Symbion Science park, Box 16

Extending MOSEK with exponential cones ISMP Bordeaux 2018 joachim.dahl@mosek.com www.mosek.com

What is Mosek up to January 15, 2019 Erling D. Andersen www.mosek.com Mosek A software

MOSEK version 9 (work in progress) July 4, 2018 Erling D. Andersen www.mosek.com Mosek A

The Feasibility Pump heuristic for Mixed-Integer Conic Programming Workshop on Discrepancy Theory

On Recent Improvements in the Interior-Point Optimizer in MOSEK ISMP2015 14 July 2015

Recent Advances In The New Mosek 8 16-November-2016 Erling D. Andersen www.mosek.com

Exponential cone in MOSEK ISMP2018, Relative Entropy Optimization, 6 July 2018 Micha l

The R-to-MOSEK Optimization Interface Henrik Alsing Friberg MOSEK ApS, Fruebjergvej 3, Box 16,

Symbolic Model Checking Binary Decision Diagrams 2 Combinatorial Circuits 3 Eight Queen

Chapter 27 Entropy, Randomness, and Information CS 573: Algorithms, Fall 2013 December 5, 2013

Binary Classification Many slides attributable to: Prof. Mike Hughes Erik Sudderth (UCI) Finale

An introduction to R: Organisation and Basics of Algorithmics emie Becker, Sonja Grath &amp; Dirk

R04 - Regression with Categorical Explanatory Variables STAT 587 (Engineering) Iowa State

Measuring Inequality by Asset Indices: The case of South Africa Martin Wittenberg and Murray

Semantics of Probabilistic and Differential Programming Workshop on program transformations at

Geometric Whitney problem and inverse problems Matti Lassas in collaboration with Charles

An introduction to R: Organisation and Basics of Algorithmics emie Becker, Sonja Grath & Dirk