cs675 convex and combinatorial optimization fall 2014
play

CS675: Convex and Combinatorial Optimization Fall 2014 Convex - PowerPoint PPT Presentation

CS675: Convex and Combinatorial Optimization Fall 2014 Convex Functions Instructor: Shaddin Dughmi Outline Convex Functions 1 Examples of Convex and Concave Functions 2 Convexity-Preserving Operations 3 Convex Functions A function f : R n


  1. CS675: Convex and Combinatorial Optimization Fall 2014 Convex Functions Instructor: Shaddin Dughmi

  2. Outline Convex Functions 1 Examples of Convex and Concave Functions 2 Convexity-Preserving Operations 3

  3. Convex Functions A function f : R n → R is convex if the line segment between any points on the graph of f lies above f . i.e. if x, y ∈ R n and θ ∈ [0 , 1] , then f ( θx + (1 − θ ) y ) ≤ θf ( x ) + (1 − θ ) f ( y ) Convex Functions 0/23

  4. Convex Functions A function f : R n → R is convex if the line segment between any points on the graph of f lies above f . i.e. if x, y ∈ R n and θ ∈ [0 , 1] , then f ( θx + (1 − θ ) y ) ≤ θf ( x ) + (1 − θ ) f ( y ) Inequality called Jensen’s inequality (basic form) Convex Functions 0/23

  5. Convex Functions A function f : R n → R is convex if the line segment between any points on the graph of f lies above f . i.e. if x, y ∈ R n and θ ∈ [0 , 1] , then f ( θx + (1 − θ ) y ) ≤ θf ( x ) + (1 − θ ) f ( y ) Inequality called Jensen’s inequality (basic form) f is convex iff its restriction to any line { x + tv : t ∈ R } is convex Convex Functions 0/23

  6. Convex Functions A function f : R n → R is convex if the line segment between any points on the graph of f lies above f . i.e. if x, y ∈ R n and θ ∈ [0 , 1] , then f ( θx + (1 − θ ) y ) ≤ θf ( x ) + (1 − θ ) f ( y ) Inequality called Jensen’s inequality (basic form) f is convex iff its restriction to any line { x + tv : t ∈ R } is convex f is strictly convex if inequality strict when x � = y . Convex Functions 0/23

  7. Convex Functions A function f : R n → R is convex if the line segment between any points on the graph of f lies above f . i.e. if x, y ∈ R n and θ ∈ [0 , 1] , then f ( θx + (1 − θ ) y ) ≤ θf ( x ) + (1 − θ ) f ( y ) Inequality called Jensen’s inequality (basic form) f is convex iff its restriction to any line { x + tv : t ∈ R } is convex f is strictly convex if inequality strict when x � = y . Analogous definition when the domain of f is a convex subset D of R n Convex Functions 0/23

  8. Concave and Affine Functions A function is f : R n → R is concave if − f is convex. Equivalently: Line segment between any points on the graph of f lies below f . If x, y ∈ R n and θ ∈ [0 , 1] , then f ( θx + (1 − θ ) y ) ≥ θf ( x ) + (1 − θ ) f ( y ) Convex Functions 1/23

  9. Concave and Affine Functions A function is f : R n → R is concave if − f is convex. Equivalently: Line segment between any points on the graph of f lies below f . If x, y ∈ R n and θ ∈ [0 , 1] , then f ( θx + (1 − θ ) y ) ≥ θf ( x ) + (1 − θ ) f ( y ) f : R n → R is affine if it is both concave and convex. Equivalently: Line segment between any points on the graph of f lies on the graph of f . f ( x ) = a ⊺ x + b for some a ∈ R n and b ∈ R . Convex Functions 1/23

  10. We will now look at some equivalent definitions of convex functions First Order Definition A differentiable f : R n → R is convex if and only if the first-order approximation centered at any point x underestimates f everywhere. f ( y ) ≥ f ( x ) + ( ▽ f ( x )) ⊺ ( y − x ) Convex Functions 2/23

  11. We will now look at some equivalent definitions of convex functions First Order Definition A differentiable f : R n → R is convex if and only if the first-order approximation centered at any point x underestimates f everywhere. f ( y ) ≥ f ( x ) + ( ▽ f ( x )) ⊺ ( y − x ) Local information → global information If ▽ f ( x ) = 0 then x is a global minimizer of f Convex Functions 2/23

  12. Second Order Definition A twice differentiable f : R n → R is convex if and only if its derivative is nondecreasing in all directions. Formally, ▽ 2 f ( x ) � 0 for all x . Convex Functions 3/23

  13. Second Order Definition A twice differentiable f : R n → R is convex if and only if its derivative is nondecreasing in all directions. Formally, ▽ 2 f ( x ) � 0 for all x . Intepretation Recall definition of PSD: z ⊺ ▽ 2 f ( x ) z > 0 for all z ∈ R n At x + δ� z , infitisimal change in gradient is in roughly in the same direction as � z Graph of f curves upwards along any line When n = 1 , this is f ′′ ( x ) ≥ 0 . Convex Functions 3/23

  14. Epigraph The epigraph of f is the set of points above the graph of f . Formally, epi ( f ) = { ( x, t ) : t ≥ f ( x ) } Convex Functions 4/23

  15. Epigraph The epigraph of f is the set of points above the graph of f . Formally, epi ( f ) = { ( x, t ) : t ≥ f ( x ) } Epigraph Definition f is a convex function if and only if its epigraph is a convex set. Convex Functions 4/23

  16. Jensen’s Inequality (General Form) f : R n → R is convex if and only if For every x 1 , . . . , x k in the domain of f , and θ 1 , . . . , θ k ≥ 0 such that � i θ i = 1 , we have � � f ( θ i x i ) ≤ θ i f ( x i ) i i Given a probability measure D on the domain of f , and x ∼ D , f ( E [ x ]) ≤ E [ f ( x )] Convex Functions 5/23

  17. Jensen’s Inequality (General Form) f : R n → R is convex if and only if For every x 1 , . . . , x k in the domain of f , and θ 1 , . . . , θ k ≥ 0 such that � i θ i = 1 , we have � � f ( θ i x i ) ≤ θ i f ( x i ) i i Given a probability measure D on the domain of f , and x ∼ D , f ( E [ x ]) ≤ E [ f ( x )] Adding noise to x can only increase f ( x ) in expectation. Convex Functions 5/23

  18. Local and Global Optimality Local minimum x is a local minimum of f if there is a an open ball B containing x where f ( y ) ≥ f ( x ) for all y ∈ B . Local and Global Optimality When f is convex, x is a local minimum of f if and only if it is a global minimum. Convex Functions 6/23

  19. Local and Global Optimality Local minimum x is a local minimum of f if there is a an open ball B containing x where f ( y ) ≥ f ( x ) for all y ∈ B . Local and Global Optimality When f is convex, x is a local minimum of f if and only if it is a global minimum. This fact underlies much of the tractability of convex optimization. Convex Functions 6/23

  20. Sub-level sets � x 2 + y 2 Level sets of f ( x, y ) = Sublevel set The α -sublevel set of f is { x ∈ domain ( f ) : f ( x ) ≤ α } . Convex Functions 7/23

  21. Sub-level sets � x 2 + y 2 Level sets of f ( x, y ) = Sublevel set The α -sublevel set of f is { x ∈ domain ( f ) : f ( x ) ≤ α } . Fact Every sub-level set of a convex function is a convex set. This fact also underlies tractability of convex optimization Convex Functions 7/23

  22. Sub-level sets � x 2 + y 2 Level sets of f ( x, y ) = Sublevel set The α -sublevel set of f is { x ∈ domain ( f ) : f ( x ) ≤ α } . Fact Every sub-level set of a convex function is a convex set. This fact also underlies tractability of convex optimization Note: converse false, but nevertheless useful check. Convex Functions 7/23

  23. Other Basic Properties Continuity Convex functions are continuous. Convex Functions 8/23

  24. Other Basic Properties Continuity Convex functions are continuous. Extended-value extension If a function f : D → R is convex on its domain, and D is convex, then it can be extended to a convex function on R n . by setting f ( x ) = ∞ whenever x / ∈ D . f : D → R � ∞ is “convex” This simplifies notation. Resulting function � with respect to the ordering on R � ∞ Convex Functions 8/23

  25. Outline Convex Functions 1 Examples of Convex and Concave Functions 2 Convexity-Preserving Operations 3

  26. Functions on the reals Affine: ax + b Exponential: e ax convex for any a ∈ R Powers: x a convex on R ++ when a ≥ 1 or a ≤ 0 , and concave for 0 ≤ a ≤ 1 Logarithm: log x concave on R ++ . Examples of Convex and Concave Functions 9/23

  27. Norms Norms are convex. || θx + (1 − θ ) y || ≤ || θx || + || (1 − θ ) y || = θ || x || + (1 − θ ) || y || Uses both norm axioms: triangle inequality, and homogeneity. Applies to matrix norms, such as the spectral norm (radius of induced ellipsoid) Examples of Convex and Concave Functions 10/23

  28. Norms Norms are convex. || θx + (1 − θ ) y || ≤ || θx || + || (1 − θ ) y || = θ || x || + (1 − θ ) || y || Uses both norm axioms: triangle inequality, and homogeneity. Applies to matrix norms, such as the spectral norm (radius of induced ellipsoid) Max max i x i is convex max i ( θx + (1 − θ ) y ) i = max i ( θx i + (1 − θ ) y i ) ≤ max θx i + max i (1 − θ ) y i i = θ max x i + (1 − θ ) max y i i i If i’m allowed to pick the maximum entry of θx and θy independently, I can do only better. Examples of Convex and Concave Functions 10/23

  29. Log-sum-exp: log( e x 1 + e x 2 + . . . + e x n ) is convex Geometric mean: ( � n 1 n is concave i =1 x i ) Log-determinant: log det X is concave Quadratic form: x ⊺ Ax is convex iff A � 0 Other examples in book f ( x, y ) = log( e x + e y ) Examples of Convex and Concave Functions 11/23

  30. Log-sum-exp: log( e x 1 + e x 2 + . . . + e x n ) is convex Geometric mean: ( � n 1 n is concave i =1 x i ) Log-determinant: log det X is concave Quadratic form: x ⊺ Ax is convex iff A � 0 Other examples in book f ( x, y ) = log( e x + e y ) Proving convexity often comes down to case-by-case reasoning, involving: Definition: restrict to line and check Jensen’s inequality Write down the Hessian and prove PSD Express as a combination of other convex functions through convexity-preserving operations (Next) Examples of Convex and Concave Functions 11/23

Recommend


More recommend