transporting iterative algorithms from euclidean space to
play

Transporting iterative algorithms from Euclidean space to manifolds - PowerPoint PPT Presentation

Transporting iterative algorithms from Euclidean space to manifolds Jochen Trumpf Jochen.Trumpf@anu.edu.au Department of Information Engineering Research School of Information Sciences and Engineering The Australian National University and


  1. Transporting iterative algorithms from Euclidean space to manifolds Jochen Trumpf Jochen.Trumpf@anu.edu.au Department of Information Engineering Research School of Information Sciences and Engineering The Australian National University and National ICT Australia Ltd. Transporting iterative algorithms from Euclidean space to manifolds – p. 1/18

  2. overview the Newton iteration Transporting iterative algorithms from Euclidean space to manifolds – p. 2/18

  3. overview the Newton iteration parametrisations Transporting iterative algorithms from Euclidean space to manifolds – p. 2/18

  4. overview the Newton iteration parametrisations the new algorithm Transporting iterative algorithms from Euclidean space to manifolds – p. 2/18

  5. overview the Newton iteration parametrisations the new algorithm convergence properties Transporting iterative algorithms from Euclidean space to manifolds – p. 2/18

  6. overview the Newton iteration parametrisations the new algorithm convergence properties an example Transporting iterative algorithms from Euclidean space to manifolds – p. 2/18

  7. overview the Newton iteration parametrisations the new algorithm convergence properties an example general iterates Transporting iterative algorithms from Euclidean space to manifolds – p. 2/18

  8. overview the Newton iteration parametrisations the new algorithm convergence properties an example general iterates joint work with J. Manton Transporting iterative algorithms from Euclidean space to manifolds – p. 2/18

  9. the Newton iteration Newton’s method x k +1 = x k − { Hess f ( x k ) } − 1 grad f ( x k ) , x 0 ∈ R n is an iteration x 0 ∈ R n x k +1 = N( f )( x k ) , which is defined for any twice differentiable function f : R n − → R . Transporting iterative algorithms from Euclidean space to manifolds – p. 3/18

  10. the Newton iteration The sequence x k = { N( f ) } k ( x 0 ) it generates converges locally quadratic to non-degenerate critical points of f . Transporting iterative algorithms from Euclidean space to manifolds – p. 4/18

  11. the Newton iteration The sequence x k = { N( f ) } k ( x 0 ) it generates converges locally quadratic to non-degenerate critical points of f . In particular, it converges locally to any (isolated) strict local maximum of f . Transporting iterative algorithms from Euclidean space to manifolds – p. 4/18

  12. parametrisations Sometimes a to be maximised function is not naturally defined on an R n but rather on some smooth manifold (curved space), e.g. the sphere. Transporting iterative algorithms from Euclidean space to manifolds – p. 5/18

  13. parametrisations Sometimes a to be maximised function is not naturally defined on an R n but rather on some smooth manifold (curved space), e.g. the sphere. One description of manifolds is that they look locally like an R n . This means that the manifold can be covered by a collection of subsets for each of which there is a homeomorphism (coordinate chart) onto an open set in R n . Transporting iterative algorithms from Euclidean space to manifolds – p. 5/18

  14. parametrisations Sometimes a to be maximised function is not naturally defined on an R n but rather on some smooth manifold (curved space), e.g. the sphere. One description of manifolds is that they look locally like an R n . This means that the manifold can be covered by a collection of subsets for each of which there is a homeomorphism (coordinate chart) onto an open set in R n . The whole atlas has to fit nicely together, i.e. via diffeomorphisms in overlapping regions. Transporting iterative algorithms from Euclidean space to manifolds – p. 5/18

  15. parametrisations This implies that for each point p of the manifold M there exists a local parametrisation , i.e. a smooth injective map µ p : R n − → M, µ p (0) = p Transporting iterative algorithms from Euclidean space to manifolds – p. 6/18

  16. parametrisations This implies that for each point p of the manifold M there exists a local parametrisation , i.e. a smooth injective map µ p : R n − → M, µ p (0) = p We consider the special case where µ p varies locally smoothly with the base point, which might only be possible in a small neighborhood of a given point p ∗ (hedgehog theorem). Transporting iterative algorithms from Euclidean space to manifolds – p. 6/18

  17. parametrisations Take e.g. the sphere and the operation of the special orthogonal group on it φ : SO ( n + 1) × S n − → S n , ( Q, p ) �→ Qp and consider the exponential map exp : so ( n + 1) − → SO ( n + 1) , Ω �→ exp Ω . Transporting iterative algorithms from Euclidean space to manifolds – p. 7/18

  18. parametrisations It can be shown that φ (exp( . ) , p ∗ ) : so ( n + 1) − → S n is locally injective around 0 when restricted to the subspace     0 Z  | Z ∈ R k × ( n − k )    . − Z ⊤  0  This defines a local parametrisation µ p ∗ which can be “moved around” S n by applying φ . Transporting iterative algorithms from Euclidean space to manifolds – p. 8/18

  19. the new algorithm Let µ p and ν p be two families of local parametrisations and consider the iteration p k +1 = ν p k (N( f ◦ µ p k )(0)) , p 0 ∈ M which is defined for every twice differentiable function f : M − → R . Transporting iterative algorithms from Euclidean space to manifolds – p. 9/18

  20. the new algorithm Let µ p and ν p be two families of local parametrisations and consider the iteration p k +1 = ν p k (N( f ◦ µ p k )(0)) , p 0 ∈ M which is defined for every twice differentiable function f : M − → R . Note that for M = R n and ν p = µ p the obvious parametrisation x �→ p + x this is the standard Newton method. Transporting iterative algorithms from Euclidean space to manifolds – p. 9/18

  21. convergence properties Theorem: If µ p and ν p are smooth around a non-degenerate critical point p ∗ of f and if moreover µ ′ p ∗ (0) = ν ′ p ∗ (0) then our algorithm converges locally quadratic to p ∗ . Transporting iterative algorithms from Euclidean space to manifolds – p. 10/18

  22. convergence properties Theorem: If µ p and ν p are smooth around a non-degenerate critical point p ∗ of f and if moreover µ ′ p ∗ (0) = ν ′ p ∗ (0) then our algorithm converges locally quadratic to p ∗ . In general, nothing is said (and known) about global convergence. Transporting iterative algorithms from Euclidean space to manifolds – p. 10/18

  23. an example Consider a real symmetric n × n matrix N with eigenvalues λ 1 ≥ · · · ≥ λ k > λ k +1 ≥ · · · ≥ λ n . Its k -dimensional principal eigenspace is the subspace spanned by the eigenvectors to λ 1 , . . . , λ k . Transporting iterative algorithms from Euclidean space to manifolds – p. 11/18

  24. an example Consider a real symmetric n × n matrix N with eigenvalues λ 1 ≥ · · · ≥ λ k > λ k +1 ≥ · · · ≥ λ n . Its k -dimensional principal eigenspace is the subspace spanned by the eigenvectors to λ 1 , . . . , λ k . Consider the function (generalised Rayleigh quotient) → R , [ X ] �→ tr X ⊤ NX f : Grass( k, n ) − Transporting iterative algorithms from Euclidean space to manifolds – p. 11/18

  25. an example µ p is given by      I p =  Q   0       0 Z  I µ p ( Z ) =  Q exp − Z ⊤     0 0 where Q ∈ O ( n ) and Z is k × ( n − k ) . Transporting iterative algorithms from Euclidean space to manifolds – p. 12/18

  26. an example Then        I 0  0 − N 12  =  Q ⊤ NQ, grad( f ◦ µ p )(0) = N ⊤   0 0 0 12 Transporting iterative algorithms from Euclidean space to manifolds – p. 13/18

  27. an example Then        I 0  0 − N 12  =  Q ⊤ NQ, grad( f ◦ µ p )(0) = N ⊤   0 0 0 12 and   0 ZN 22 − N 11 Z Hess( f ◦ µ p )(0) Z = Z ⊤ N 11 − N 12 Z ⊤   0 Transporting iterative algorithms from Euclidean space to manifolds – p. 13/18

  28. an example So computing N( f ◦ µ p )(0) amounts to solving the Sylvester equation N 11 Z − ZN 22 = − N 12 Transporting iterative algorithms from Euclidean space to manifolds – p. 14/18

  29. an example So computing N( f ◦ µ p )(0) amounts to solving the Sylvester equation N 11 Z − ZN 22 = − N 12 This Z could than be plugged into       0 Z  I ν p ( Z ) =  Q exp − Z ⊤     0 0 to get a new Q . Transporting iterative algorithms from Euclidean space to manifolds – p. 14/18

  30. an example It’s much better though to use an orthogonal projection onto O ( n ) instead by computing a QR -decomposition of    I  = Q Z R − Z ⊤ and to use QQ Z as the new Q . Transporting iterative algorithms from Euclidean space to manifolds – p. 15/18

  31. general iterates Replacing the Newton iteration N( f ) : R n − → R n by any other iteration G( f ) : R n − → R n that is locally order q converging to non-degenerate critical points of f , we can derive sufficient conditions on a family µ p of local parametrisations that guarantee local order q convergence of the “transported algorithm” p k +1 = µ p k (G( f ◦ µ p k )(0)) , p 0 ∈ M Transporting iterative algorithms from Euclidean space to manifolds – p. 16/18

Recommend


More recommend