Structure of the Hessian Graham C. Goodwin September 2004 Centre - PowerPoint PPT Presentation

Structure of the Hessian Graham C. Goodwin September 2004 Centre for Complex Dynamic Systems and Control

1. Introduction We saw in earlier lectures that a core ingredient in quadratic constrained optimisation problems is the Hessian matrix H . So far we have simply given an “in principle” approach to the evaluation of this matrix. That is, for a system with state equations x k + 1 = Ax k + Bu k , we can compute the Hessian as H = Γ  Q Γ + R , where Γ , Q , R . Centre for Complex Dynamic Systems and Control

This will be satisfactory for simple problems. However, for more complex problems (for example, high order systems or problems having mixed stable and unstable modes) this “brute force” approach may fail. A hint as to the source of the difficulties is that the direct way of computing the Hessian depends on powers of the system matrix A . Clearly, if the system has unstable modes, then some entries of Γ will diverge as N increases. Centre for Complex Dynamic Systems and Control

We will show in this lecture that this problem can be resolved by focusing attention on the stable and unstable parts of the system separately. Centre for Complex Dynamic Systems and Control

2. Separating Stable and Unstable Modes Our eventual goal in this lecture is to gain a better understanding of the structure of the Hessian matrix particularly for large optimisation horizons. However, as mentioned above, the straightforward approach to evaluating the Hessian will often meet difficulties for open loop unstable plants due to exponential divergence of the system impulse response. Centre for Complex Dynamic Systems and Control

One way of addressing this problem is to recognise that there is an intimate connection between “stability” and “causality.” In particular, a system having all modes unstable becomes stable if viewed in reverse time, that is, as an anti-causal system. This line of reasoning leads to an alternative viewpoint in which unstable modes are treated differently. We show below that this leads to an equivalent problem formulation with a different Hessian having different properties. Centre for Complex Dynamic Systems and Control

Consider a discrete time linear system and suppose that it has no eigenvalues on the unit circle. We can then partition the state vector x k ∈ R n as � � x s k x k = , x u k where the states x s k and x u k are associated with the stable and unstable modes, respectively. Centre for Complex Dynamic Systems and Control

We can then factor the state equations into stable and unstable parts as follows: � � � � � � � � x s x s A s 0 B s k + 1 k = + u k , (1) x u x u 0 A u B u k + 1 k � � � x s � k y k = C x k = C s C u , x u k where u k ∈ R m and y k ∈ R p ( p ≥ m ) . The eigenvalues of A s have moduli less than one, and the eigenvalues of A u have moduli greater than one. Centre for Complex Dynamic Systems and Control

We can then express the solution of the system equations as k − 1 � A k − 1 − j x s k = A k s x s 0 + B s u j for k = 1 , . . . , N , (2) s j = 0 N − 1 � k = A − ( N − k ) A k − 1 − j x u µ − B u u j for k = 0 , . . . , N − 1 . (3) u u j = k x u N � µ, Centre for Complex Dynamic Systems and Control

We then need an equality constraint that both µ and the sequence of control signals { u 0 , . . . , u N − 1 } need to satisfy in order to bring the unstable states back to their correct initial values, that is, N − 1 � A − j − 1 A − N B u u j = x u u µ − 0 . (4) u j = 0 Centre for Complex Dynamic Systems and Control

We are thus led to the following equivalent statement of the optimisation problem. P N ( x ) : V  N ( x ) � min V N ( { x k } , { u k } , µ ) , (5) subject to: � � x s k x k = for k = 0 , . . . , N , x u k k − 1 � A k − 1 − j x s k = A k s x s 0 + B s u j for k = 1 , . . . , N , s j = 0 N − 1 � x u k = A − ( N − k ) A k − 1 − j µ − B u u j for k = 0 , . . . , N − 1 , u u j = k (6) Centre for Complex Dynamic Systems and Control

� � x s x 0 = 0 = x , x u 0 u k ∈ U for k = 0 , . . . , N − 1 , x k ∈ X for k = 0 , . . . , N − 1 , � � x s N x N = ∈ X f ⊂ X , µ where N − 1 � �  � � x s x s V N ( { x k } , { u k } , µ ) � 1 + 1 � N N ( x  k Qx k + u  P k Ru k ) . (7) 2 µ µ 2 k = 0 Centre for Complex Dynamic Systems and Control

The above formulation of the problem, at the expense of the introduction of the additional optimisation variable µ , avoids exponentially diverging terms in the computation of the Hessian matrix. Thus, at least intuitively, it would seem to be more apposite for studying the structure of the problem for large horizons. Centre for Complex Dynamic Systems and Control

3. Numerical Issues in the Computation of the Hessian We represent the time evolution of the system output using the usual vector notation. Thus, let �  , � y = y  y  . . . y  1 2 N (8) �  . � u = u  u  u  . . . 0 1 N − 1 Centre for Complex Dynamic Systems and Control

y = ( Γ s + Γ u ) u + Ω s x s 0 + Ω u µ, (9) � �� ¯ Γ where     C s A s C s B s 0 · · · 0             C s A 2    C s A s B s C s B s 0  · · ·      s        Ω s =   , Γ s =   , . . . .    ...      . . . .      .   . . .                  C s A N C s A N − 1 C s A N − 2 B s B s · · · C s B s s s s Centre for Complex Dynamic Systems and Control

 C u A − ( N − 1 )   C u A − ( N − 1 )  C u A − 1 C u A − 2 0 u B u u B u . . . B u     u u          C u A − ( N − 2 )   C u A − ( N − 2 )  C u A − 1     0 0 u B u B u . . .         u u          .   . . .  ... ...     Ω u = . , Γ u = − . . .         . . . .                 C u A − 1 C u A − 1     0 0 · · · 0 u B u         u             C u 0 0 0 0 · · · Centre for Complex Dynamic Systems and Control

The matrix ¯ Γ � Γ s + Γ u has the form  ¯ ¯ ¯ ¯  h 0 h − 1 h − 2 h − ( N − 1 ) . . . . . .       ¯ ¯ ¯ ¯   h 1 h 0 h − 1 . . . . . . h − ( N − 2 )          . . .  ... ...   . . .    . . .        ¯ Γ � Γ s + Γ u =   , (10)  . . .  ... ...   . . .    . . .          . .    . .  ¯ ¯   h 0 h − 1 . .          ¯ ¯ ¯ ¯  h N − 1 h N − 2 . . . . . . h 1 h 0 Centre for Complex Dynamic Systems and Control

where { ¯ h k : k = − ( N − 1 ) , . . . , N − 1 } , is a finite subsequence of the infinite sequence { ¯ h k : k = −∞ , . . . , ∞} defined by { ¯ h k : k = 0 , . . . , ∞} � { C s B s , C s A s B s , C s A 2 s B s , . . . } , (11) { ¯ h k : k = − 1 , . . . , −∞} � {− C u A − 1 u B u , − C u A − 2 u B u , − C u A − 3 u B u , . . . } . (12) Centre for Complex Dynamic Systems and Control

Consider, P = Q and Q = C  C and R = ρ I m > 0 . V N as follows 1 : V N ( x , u , y , µ ) = 1 2 ( x  Qx + y  y + ρ u  u ) . 1 We keep the function V N but change its arguments as appropriate. Centre for Complex Dynamic Systems and Control

V N ( x , u , µ ) = 1 � � ¯ �  � ¯ � � Γ u + Ω s x s Γ u + Ω s x s + ρ u  u x  Qx + 0 + Ω u µ 0 + Ω u µ 2 = 1 Γ  � � Γ  ¯ Γ + ρ I ] u + u  ¯ 2 u  [¯ Ω s x s 0 + Ω u µ + 1 � �  � � Ω s x s Ω s x s 0 + Ω u µ 0 + Ω u µ . (13) 2 Centre for Complex Dynamic Systems and Control

With respect to the new optimisation variables ( u , µ ) , the modified Hessian of the quadratic objective function is � ¯ Γ  ¯ � ¯ Γ + ρ I Γ  Ω u H ′ � . (14) u ¯ Ω  Γ Ω  u Ω u Centre for Complex Dynamic Systems and Control

For future use, we extract the left-upper submatrix, which we call the “regularised sub-Hessian”: Γ  ¯ H N � ¯ ¯ Γ + ρ I . (15) Centre for Complex Dynamic Systems and Control

The above problem formulation has the ability to ameliorate the numerical difficulties encountered when dealing with unstable plants. Indeed, we see that all terms depend only on exponentially decaying quantities. Centre for Complex Dynamic Systems and Control

Structure of the Hessian Graham C. Goodwin September 2004 Centre - PowerPoint PPT Presentation

Structure of the Hessian Graham C. Goodwin September 2004 Centre for Complex Dynamic Systems and Control 1. Introduction We saw in earlier lectures that a core ingredient in quadratic constrained optimisation problems is the Hessian matrix H .

Twisted Hessian curves cr.yp.to/papers.html#hessian Daniel J. Bernstein University of Illinois

Twisted Hessian curves 1986 ChudnovskyChudnovsky, Sequences of numbers

HESSIAN vs OFFSET method PDF4LHC F b PDF4LHC February 2008 2008 A M Cooper-Sarkar Comparisons

Reduced-Hessian Methods for Constrained Optimization Philip E. Gill University of California,

c cientific The Hessian Module omputing Current status and future perspectives

Koszul/Souriau Fisher Metric Spaces & Optimization by Maximum Entropy: Hessian Information

EigenDamage: Structured Pruning in the Kronecker-Factored Eigenbasis Chaoqi Wang , Roger Grosse,

A Hybrid Variational-Ensemble Data Assimilation Method with an Implicit Optimal Hessian

RESILIENCE IN CIVIL SOCIETY: Chris Hessian, RSW Art Fisher ACHIEVING STRUCTURAL CHANGE THROUGH

Games for eigenvalues of the Hessian and concave/convex envelopes. Julio D. Rossi (joint work

An Investigation into Neural Net Optimization via Hessian Eigenvalue Density Behrooz Ghorbani

Partial-Hessian Strategies for Fast Learning of Nonlinear Embeddings Max Vladymyrov and Miguel

Hessian Aided Policy Gradient Z. Shen 1 , A. Ribeiro 2 , H. Hassani 2 , H. Qian 1 , C. Mi 1 1

Partial-Hessian Strategies for Fast Learning of Nonlinear Embeddings Max Vladymyrov and Miguel

Hessian-based sampling in high dimensions for goal-oriented model order reduction Peng Chen Omar

STRUCTURE STRUCTURE Highlight the structure of Highlight the structure of material material

Inverse Kinematics (part 2) CSE169: Computer Animation Instructor: Steve Rotenberg UCSD, Winter

Elements of differential calculus and optimization. Joan Alexis Glaun` es October 24, 2019 1/29

Local Universality of Repulsive Particle Systems and Random Matrices Friedrich Gtze joint with

Interpolating Curves Intro to curve interpolation & approximation Polynomial

Numerical Computation Sargur N. Srihari srihari@cedar.buffalo.edu This is part of lecture slides

Computational Optimization Convexity and Unconstrained Optimization 1/29/08 and 2/1(revised)

Survival Analysis Objective : to establish a connection between a set of features and the time

Basics of Numerical Optimization: Preliminaries Ju Sun Computer Science & Engineering

Sambuz

Useful Links

Newsletter

Mail Us

Structure of the Hessian Graham C. Goodwin September 2004 Centre - PowerPoint PPT Presentation

Structure of the Hessian Graham C. Goodwin September 2004 Centre for Complex Dynamic Systems and Control 1. Introduction We saw in earlier lectures that a core ingredient in quadratic constrained optimisation problems is the Hessian matrix H .

Twisted Hessian curves cr.yp.to/papers.html#hessian Daniel J. Bernstein University of Illinois

Twisted Hessian curves 1986 ChudnovskyChudnovsky, Sequences of numbers

HESSIAN vs OFFSET method PDF4LHC F b PDF4LHC February 2008 2008 A M Cooper-Sarkar Comparisons

Reduced-Hessian Methods for Constrained Optimization Philip E. Gill University of California,

c cientific The Hessian Module omputing Current status and future perspectives

Koszul/Souriau Fisher Metric Spaces &amp; Optimization by Maximum Entropy: Hessian Information

EigenDamage: Structured Pruning in the Kronecker-Factored Eigenbasis Chaoqi Wang , Roger Grosse,

A Hybrid Variational-Ensemble Data Assimilation Method with an Implicit Optimal Hessian

RESILIENCE IN CIVIL SOCIETY: Chris Hessian, RSW Art Fisher ACHIEVING STRUCTURAL CHANGE THROUGH

Games for eigenvalues of the Hessian and concave/convex envelopes. Julio D. Rossi (joint work

An Investigation into Neural Net Optimization via Hessian Eigenvalue Density Behrooz Ghorbani

Partial-Hessian Strategies for Fast Learning of Nonlinear Embeddings Max Vladymyrov and Miguel

Hessian Aided Policy Gradient Z. Shen 1 , A. Ribeiro 2 , H. Hassani 2 , H. Qian 1 , C. Mi 1 1

Partial-Hessian Strategies for Fast Learning of Nonlinear Embeddings Max Vladymyrov and Miguel

Hessian-based sampling in high dimensions for goal-oriented model order reduction Peng Chen Omar

STRUCTURE STRUCTURE Highlight the structure of Highlight the structure of material material

Inverse Kinematics (part 2) CSE169: Computer Animation Instructor: Steve Rotenberg UCSD, Winter

Elements of differential calculus and optimization. Joan Alexis Glaun` es October 24, 2019 1/29

Local Universality of Repulsive Particle Systems and Random Matrices Friedrich Gtze joint with

Interpolating Curves Intro to curve interpolation &amp; approximation Polynomial

Numerical Computation Sargur N. Srihari srihari@cedar.buffalo.edu This is part of lecture slides

Computational Optimization Convexity and Unconstrained Optimization 1/29/08 and 2/1(revised)

Survival Analysis Objective : to establish a connection between a set of features and the time

Basics of Numerical Optimization: Preliminaries Ju Sun Computer Science &amp; Engineering

Sambuz

Useful Links

Newsletter

Mail Us

Koszul/Souriau Fisher Metric Spaces & Optimization by Maximum Entropy: Hessian Information

Interpolating Curves Intro to curve interpolation & approximation Polynomial

Basics of Numerical Optimization: Preliminaries Ju Sun Computer Science & Engineering