optimal control lqr trajectory optimization
play

Optimal Control, LQR, Trajectory Optimization Lecture 13 What will - PowerPoint PPT Presentation

Optimal Control, LQR, Trajectory Optimization Lecture 13 What will you take home today? Intro Optimal Control Principle of Optimality Bellman Equation Deriving LQR Trajectory Optimization Paper Optimal Control and Reinforcement Learning


  1. Optimal Control, LQR, Trajectory Optimization Lecture 13

  2. What will you take home today? Intro Optimal Control Principle of Optimality Bellman Equation Deriving LQR Trajectory Optimization Paper

  3. Optimal Control and Reinforcement Learning from a unified point of view Optimal Control Problem

  4. Principle of Optimality – Example: Graph Search problem

  5. Quiz

  6. Forward Search

  7. Backward Search

  8. Principle of Optimality

  9. Problem setup System Dynamics Cost function

  10. Goal

  11. Formalize Cost-to-Go / Value function

  12. Optimal Value function = V with lowest cost

  13. Deriving the Bellman Equation

  14. Comparing Optimal Bellman and Value Function

  15. Infinite time horizon, deterministic system

  16. Infinite time horizon, deterministic system

  17. Infinite time horizon, deterministic system

  18. Finite Horizon, Stochastic system Cost function Stochastic System Dynamics

  19. Finite Horizon, Stochastic system Value function Optimal Value function Optimal Policy

  20. Finite Horizon, Stochastic system Bellman Equation Optimal Bellman Equation

  21. Infinite Horizon, Stochastic system - Combining formulation from infinite horizon - discrete system with stochastic system derivation - See lecture notes on webpage

  22. Continuous time systems Hamilton-Jacobi-Bellman Equation

  23. How do you solve these equations?

  24. Sequential Quadratic Programming

  25. Sequential Quadratic Programming

  26. Example – Newton-Raphson Method

  27. Sequential Linear Quadratic Programming

  28. SLQ Algorithm

  29. Iterative Linear Quadratic Controller: ILQC

  30. ILQC – Linearizing Dynamics

  31. ILQC - Quadratizing the cost

  32. Deriving the Value function and Bellman Equation

  33. Incremental Updates

  34. Linear Dynamical Systems, Quadratic cost – L inear Q uadratic R egulator (LQR)

  35. Linear Dynamical Systems, Quadratic cost – L inear Q uadratic R egulator (LQR)

  36. Trajectory Optimization

Recommend


More recommend