Optimization Optimization Goal: Find the minimizer ! that minimizes - PowerPoint PPT Presentation

Optimization

Optimization Goal: Find the minimizer ! ∗ that minimizes the objective (cost) function # ! : ℛ & → ℛ Unconstrained Optimization # ! ∗ = min ! # ! Constrained Optimization # ! ∗ = min ! # ! s.t. , ! = - Equality constraints . ! ≤ - Inequality constraints

Optimization • What if we are looking for a maximizer " ∗ ? ! " ∗ = max ! " " We can instead solve the minimization problem ! " ∗ = min " (−! " ) • What if constraint is ) " > + ? • What if method only has inequality constraints?

Calculus problem: maximize the rectangle area subject to perimeter constraint max $ ∈ ℛ ' Demo: Constrained-Problem-2D

$%&' = ! " ! # ! # ! " )&%*+&,&% = 2(! " + ! # ) ! # ! "

Does the solution exists? Local or global solution?

Types of optimization problems ! " ∗ = min !: nonlinear, continuous " ! " and smooth Gradient-free methods Evaluate ! " Gradient (first-derivative) methods Evaluate ! " , #! " Second-derivative methods Evaluate ! " , #! " , # % ! "

Taking derivatives…

What is the optimal solution? " # ∗ = min # " # (First-order) Necessary condition "′ ' = 0 !" # = % (Second-order) Sufficient condition " && ' > 0 ! + " # = , - is positive definite

Example (1D) Consider the function ! " = $ % & − $ ( ) − 11 + , + 40+ Find the stationary point and check the sufficient condition 100 - 6 - 4 - 2 2 4 6 - 100 - 200

Example (ND) ( + 4" % % + 2" % − 24" # Consider the function ! " # , " % = 2" # Find the stationary point and check the sufficient condition

Optimization in 1D: Golden Section Search • Similar idea of bisection method for root finding • Needs to bracket the minimum inside an interval • Required the function to be unimodal A function !: ℛ → ℛ is unimodal on an interval [&, (] ü There is a unique * ∗ ∈ [&, (] such that !(* ∗ ) is the minimum in [&, (] ü For any / 0 , / 1 ∈ [&, (] with / 0 < / 1 / 1 < * ∗ ⟹ !(/ 0 ) > !(/ 1 ) § / 0 > * ∗ ⟹ !(/ 0 ) < !(/ 1 ) §

) ) % % ) ) ' ' ) ( ) ) & & ) ( $ % $ % $ ' $ ' $ & $ ( $ & $ ( # # ! ! " " ) & < ) ) & > ) ( ( $ ∗ , $ % , $ ( $ ∗ , $ & , $ ' Such method would in general require 2 new function evaluations per iteration. How can we select the points $ & , $ ( such that only one function evaluation is required?

Golden Section Search

Demo: Golden Golden Section Search Section Proportions What happens with the length of the interval after one iteration? ℎ " = $ ℎ % Or in general: ℎ &'" = $ ℎ & Hence the interval gets reduced by ( (for bisection method to solve nonlinear equations, $ =0.5) For recursion: $ ℎ " = (1 − $) ℎ % $ $ ℎ % = (1 − $) ℎ % $ - = (1 − $) ( = .. 012

Golden Section Search • Derivative free method! • Slow convergence: |( $)* | lim = 0.618 1 = 1 (345(61 7859(1:(57() ( $ $→& • Only one function evaluation per iteration

Iclicker question

Newton’s Method Using Taylor Expansion, we can approximate the function ! with a quadratic function about " $ ! " ≈ ! " $ + ! & " $ (" − " $ ) + * + ! & ′ " $ (" − " $ ) + And we want to find the minimum of the quadratic function using the first-order necessary condition ! & " = 0 ! & " $ + ! & ′ " $ (" − " $ ) = 0 ℎ = −! & " $ ℎ = (" − " $ ) ! && " $ Note that this is the same as the step for the Newton’s method to solve the nonlinear equation !′ " = 0

Newton’s Method • Algorithm: ! " = starting guess ! -./ = ! - − 1′ ! - /1′′ ! - • Convergence: • Typical quadratic convergence • Local convergence (start guess close to solution) • May fail to converge, or converge to a maximum or point of inflection Demo: ”Newton’s method in 1D” And “Newton’s method Initial Guess”

Newton’s Method (Graphical Representation)

Iclicker Consider the function ! " = 4 " % + 2 " ( + 5 " + 40 If we use the initial guess " + = 2 , what would be the value of " after one iteration of the Newton’s method? A) " . = 2.852 B) " . = 1.147 C) " . = 3.173 D) " . = 0.827 E) NOTA

Optimization in ND: Steepest Descent Method ! ) * , ) , = () * − 1) 1 +() , − 1) 1 Given a function ! " : ℛ % → ℛ at a point " , the function will decrease its value in the direction of steepest descent: −(! " Iclicker question: What is the steepest descent direction?

Steepest Descent Method Start with initial guess: ! " # , " % = (" # − 1) + +(" % − 1) + - . = 3 3 Check the update: - # = - . − 0! - . 0! - = 2(" # − 1) 2(" % − 1) - # = 3 3 − 4 4 = − 1 1 How far along the gradient direction should we go?

Steepest Descent Method Update the variable with: ! " # , " % = (" # − 1) + +(" % − 1) + - ./# = - . − 0 . 1! - . How far along the gradient should we go? What is the “best size” for 0 . ? A) 0 B) 0.5 C) 1 D) 2 E) Cannot be determined

Steepest Descent Method Algorithm: Initial guess: ! " Evaluate: # $ = −'( ! $ Perform a line search to obtain ) $ (for example, Golden Section Search) ) $ = argmin ( ! $ + ) # $ 0 Update: ! $23 = ! $ + ) $ # $

Steepest Descent Method Demo: Steepest Descent Convergence: linear Demo: ”Steepest Descent”

Iclicker question: Consider minimizing the function ! " # , " % = 10(" # ) + − " % % + " # − 1 Given the initial guess " # = 2, " % = 2 what is the direction of the first step of gradient descent? A) −61 C) −120 4 4 B) −61 D) −121 2 4

Newton’s Method Using Taylor Expansion, we build the approximation: ! " + $ ≈ ! " + ∇! " ' $ + 1 2 $ ' * + " $ = - ! $ And we want to find the minimum - ! $ , so we enforce the first-order necessary condition + 1 ∇ - ∇! " 2 2 * + " $ = 0 ! $ = / * + " $ = −∇! " Which becomes a system of linear equations where we need to solve for the Newton step $

Newton’s Method Algorithm: Initial guess: ! " Solve: # $ ! % & % = −)* ! % Update: ! %+, = ! % + & % Note that the Hessian is related to the curvature and therefore contains the information about how large the step should be.

Iclicker question = 3" ' + To find a minimum of the function ! ", $ 2$ ' , which is the expression for one step of Newton’s method? 1, 6" * A) " *+, $ *+, = " * $ * − 6 0 4$ * 0 4 1, 6" * B) " *+, $ *+, = − 6 0 4$ * 0 4 2 6" * C) " *+, $ *+, = 6 0 4$ * 0 4 2 6" * D) " *+, $ *+, = " * $ * − 6 0 4$ * 0 4

Iclicker question: = 0.5" ) + 2.5$ ) ! ", $ When using the Newton’s Method to find the minimizer of this function, estimate the number of iterations it would take for convergence? A) 1 B) 2-5 C) 5-10 D) More than 10 E) Depends on the initial guess

Newton’s Method Summary Algorithm: Initial guess: ! " Solve: # $ ! % & % = −)* ! % Update: ! %+, = ! % + & % About the method… • Typical quadratic convergence J • Need second derivatives L • Local convergence (start guess close to solution) • Works poorly when Hessian is nearly indefinite Cost per iteration: .(0 1 ) • Demo: ”Newton’s method in n dimensions”

Demo: ”Newton’s method in n dimensions” Example: https://en.wikipedia.org/wiki/Rosenbrock_function

Iclicker question: Recall Newton's method and the steepest descent method for minimizing a function ! " : ℛ % → ℛ . How many statements below describe the Newton Method’s only (not both)? 1. Convergence is linear 2. Requires a line search at each iteration 3. Evaluates the Gradient of ! " at each iteration 4. Evaluates the Hessian of ! " at each iteration 5. Computational cost per iteration is '() * ) A) 1 B) 2 C) 3 D) 4 E) 5

Optimization Optimization Goal: Find the minimizer ! that minimizes - PowerPoint PPT Presentation

Optimization Optimization Goal: Find the minimizer ! that minimizes the objective (cost) function # ! : & Unconstrained Optimization # ! = min ! # ! Constrained Optimization # ! = min ! # ! s.t. , ! = - Equality

15-780: Optimization J. Zico Kolter March 14-16, 2015 1 Outline Introduction to optimization

Convex Optimization 4. Convex Optimization Problems Prof. Ying Cui Department of Electrical

P2P Combinatorial Optimization Amir H. Payberah (amir@sics.se) P2P Combinatorial Optimization, 13

Optimization of HPSG Grammar Implementations in Trale Georgiana Dinu Optimization of HPSG

Search Engine Optimization What is Search Engine Optimization Search Engine Optimization is the

Five Steps to Optimization Five Steps to Optimization Beyond Best Practices Beyond Best

St Stress Aware Layout Stress Aware Layout St A A L L t t Optimization Optimization

TEG: A New Post-Layout TEG: A New Post-Layout Optimization Method Optimization Method Shuo

Evolutionary Algorithm 2. Swarm Intelligence and Ant Colony Optimization Ant Colony Optimization

Optimization Process Done by an Optimization Algorithm Jose Rueda Torres Learning Objectives

Optimization (Introduction) Optimization Goal: Find the minimizer that minimizes the

CS675: Convex and Combinatorial Optimization Fall 2019 Convex Optimization Problems Instructor:

MATHEMATICS 1 CONTENTS Unconstrained optimization Constrained optimization Lagrange method

Convex Optimization by Stephen Boyd, and Lieven Vandenberghe. Optimization for Machine Learning by

AM 205: lecture 20 Today: PDE optimization, constrained optimization example New topic:

Optimization Unconstrained optimization Constrained optimization Newton with equality

An Equilibrium Theory of Learning, Search and Wages Francisco Gonzalez Shouyong Shi U. Calgary

Learning Deep Generative Models Inference & Representation Lecture 12 Rahul G. Krishnan

d i E Applied Maxima and Minima a l l u d Dr. Abdulla Eid b A College of Science . r

Robust Pricing in Contextual Auctions Authors: Negin Golrezaei (Massachusetts Institute of

Building a High-Performing Team: Keep Calm and Play to Your Carolyn Sullivan Strengths

Extending characterizations of truthful mechanisms from subdomains to domains Angelina Vidali

Steiner-Star-Free Graphs and Equivalence of Steiner Tree Relaxations Andreas Emil Feldmann 1

Welcome to The Nature Place and the Leadership Denver 2016 Retreat! 1 Leadership Denver 2016

Optimization Optimization Goal: Find the minimizer ! that minimizes - PowerPoint PPT Presentation

Optimization Optimization Goal: Find the minimizer ! that minimizes the objective (cost) function # ! : & Unconstrained Optimization # ! = min ! # ! Constrained Optimization # ! = min ! # ! s.t. , ! = - Equality

15-780: Optimization J. Zico Kolter March 14-16, 2015 1 Outline Introduction to optimization

Convex Optimization 4. Convex Optimization Problems Prof. Ying Cui Department of Electrical

P2P Combinatorial Optimization Amir H. Payberah (amir@sics.se) P2P Combinatorial Optimization, 13

Optimization of HPSG Grammar Implementations in Trale Georgiana Dinu Optimization of HPSG

Search Engine Optimization What is Search Engine Optimization Search Engine Optimization is the

Five Steps to Optimization Five Steps to Optimization Beyond Best Practices Beyond Best

St Stress Aware Layout Stress Aware Layout St A A L L t t Optimization Optimization

TEG: A New Post-Layout TEG: A New Post-Layout Optimization Method Optimization Method Shuo

Evolutionary Algorithm 2. Swarm Intelligence and Ant Colony Optimization Ant Colony Optimization

Optimization Process Done by an Optimization Algorithm Jose Rueda Torres Learning Objectives

Optimization (Introduction) Optimization Goal: Find the minimizer that minimizes the

CS675: Convex and Combinatorial Optimization Fall 2019 Convex Optimization Problems Instructor:

MATHEMATICS 1 CONTENTS Unconstrained optimization Constrained optimization Lagrange method

Convex Optimization by Stephen Boyd, and Lieven Vandenberghe. Optimization for Machine Learning by

AM 205: lecture 20 Today: PDE optimization, constrained optimization example New topic:

Optimization Unconstrained optimization Constrained optimization Newton with equality

An Equilibrium Theory of Learning, Search and Wages Francisco Gonzalez Shouyong Shi U. Calgary

Learning Deep Generative Models Inference &amp; Representation Lecture 12 Rahul G. Krishnan

d i E Applied Maxima and Minima a l l u d Dr. Abdulla Eid b A College of Science . r

Robust Pricing in Contextual Auctions Authors: Negin Golrezaei (Massachusetts Institute of

Building a High-Performing Team: Keep Calm and Play to Your Carolyn Sullivan Strengths

Extending characterizations of truthful mechanisms from subdomains to domains Angelina Vidali

Steiner-Star-Free Graphs and Equivalence of Steiner Tree Relaxations Andreas Emil Feldmann 1

Welcome to The Nature Place and the Leadership Denver 2016 Retreat! 1 Leadership Denver 2016

Learning Deep Generative Models Inference & Representation Lecture 12 Rahul G. Krishnan