Linear Optimal Control (LQR) Robert Platt Northeastern University

The linear control problem Given: System:

The linear control problem Given: System: Cost function: where:

The linear control problem Given: System: Cost function: where: Initial state: Calculate: U that minimizes J( X,U )

The linear control problem Given: System: Important problem! Cost function: How do we solve it? where: Initial state: Calculate: U that minimizes J( X,U )

One solution: least squares

One solution: least squares where

One solution: least squares where:

One solution: least squares Given: System: Cost function: where: Initial state: Calculate: U that minimizes J( X,U )

One solution: least squares Given: System: Cost function: Initial state: Calculate: U that minimizes J( X,U )

One solution: least squares Substitute X into J : Minimize by setting d J/dU =0: Solve for U :

What can this do? Start here End here at time=T Solve for optimal trajectory: Image: van den Berg, 2015

What can this do? This is cool, but... – only works for finite horizon problems – doesn't account for noise – requires you to invert a big matrix

Bellman solution Cost-to-go function: V(x) – the cost that we have yet to experience if we travel along the minimum cost path. – given the cost-to-go function, you can calculate the optimal path/policy Example: The number in each cell describes the number of steps “to-go” before reaching the goal state

Bellman solution Bellman optimality principle: Cost of this time step (Cost of future time steps)

Bellman solution Bellman optimality principle:

Bellman solution Bellman optimality principle: Cost-to-go from state Cost-to-go from (Ax+Bu) at time t+1 state x at time t Cost incurred on Cost incurred after this time step this time step

Bellman solution For the sake of argument, suppose that the cost-to-go is always a quadratic function like this: where:

Bellman solution For the sake of argument, suppose that the cost-to-go is always a quadratic function like this: where: Then:

Bellman solution For the sake of argument, suppose that the cost-to-go is always a quadratic function like this: where: Then: How do we minimize this term? – take derivative and set it to zero.

Bellman solution How do we minimize this term? – take derivative and set it to zero. optimal control as a function of state – but: it depends on P_{t+1}...

Bellman solution How do we minimize this term? – take derivative and set it to zero. How solve for P_{t+1}??? optimal control as a function of state – but: it depends on P_{t+1}...

Bellman solution Substitute u into V_t(x) :

Bellman solution Substitute u into V_t(x) : Dynamic Riccati Equation

Example: planar double integrator Build the LQR controller for: Initial state: Initial position Initial velocity of the puck Time horizon: m =1 b =0.1 Cost fn: Goal position u =applied force Air hockey table

Example: planar double integrator Step 1: Calculate P backward from T: P_100, P_99, P_98, … , P_1 HOW? Air hockey table

Example: planar double integrator Step 1: Calculate P backward from T: P_100, P_99, P_98, … , P_1 Air hockey table

Example: planar double integrator Step 1: Calculate P backward from T: P_100, P_99, P_98, … , P_1 ... Air hockey table ...

Example: planar double integrator Step 2: Calculate u starting at t=1 and going forward to t=T-1 ... ... Air hockey table

Example: planar double integrator 1 origin 0.2 0 0 0.2 0

Example: planar double integrator u_x, u_y t

Example: planar double integrator

Example: planar double integrator origin 0 0

The infinite horizon case So far: we have optimized cost over a fixed horizon, T . – optimal if you only have T time steps to do the job But, what if time doesn't end in T steps? One idea: – at each time step, assume that you always have T more time steps to go – this is called a receding horizon controller

The infinite horizon case x i r t a m P f o s t n e m e l E Time step Notice that elt's of P stop changing (much) more than 20 or 30 time steps prior to horizon. – what does this imply about the infinite horizon case?

The infinite horizon case x i r t a m P f o s t n Converging toward fixed P e m e l E Time step Notice that elt's of P stop changing (much) more than 20 or 30 time steps prior to horizon. – what does this imply about the infinite horizon case?

The infinite horizon case We can solve for the infinite horizon P exactly: Discrete Time Algebraic Riccati Equation

So, what are we optimizing for now? Given: System: Cost function: where: Initial state: Calculate: U that minimizes J( X,U )

Controllability A system is controllable if it is possible to reach any goal state from any other start state in a finite period of time. When is a linear system controllable? It's property of the system dynamics...

Controllability A system is controllable if it is possible to reach any goal state from any other start state in a finite period of time. When is a linear system controllable? Remember this?

Controllability What property must this matrix have?

Controllability This submatrix must be full rank. – i.e. the rank must equal the dimension of the state space

Linear Optimal Control (LQR) Robert Platt Northeastern University - PowerPoint PPT Presentation

Linear Optimal Control (LQR) Robert Platt Northeastern University The linear control problem Given: System: The linear control problem Given: System: Cost function: where: The linear control problem Given: System: Cost function:

Optimal Control, LQR, Trajectory Optimization Lecture 13 What will you take home today? Intro

Linear Optimal Control How does this guy remain upright? Overview 1. expressing a linear system

Inverse problems and control optimal in non-linear mechanics C. Stolz 1 2 Introduction

Minimum-Snap Trajectory Generator with Error-State LQR 16.31 Project: Final Presentation Andrew

CS 7616 Pattern Recognition Linear, Linear, Linear Aaron Bobick School of Interactive

Draft EE 8235: Lecture 23 1 Lecture 23: Optimal control of distributed systems Linear

High Warehouse Racks: Optimal Feedback Control and High Warehouse Racks: Optimal Feedback Control

Trajectory Optimization, Imitation Learning Lecture 14 What will you take home today? Recap LQR

Linear MPC Robert Platt Northeastern University Linear Model Predictive Control Drawbacks to

Optimal Control Theory The theory Optimal control theory is a mature mathematical discipline

Optimal Control Theory The theory Optimal control theory is a mature mathematical discipline

Part 23 Optimal Control: Examples 142 Definition of optimal control problems Commonly

Reciprocal Collision Avoidance for Quadrotor Helicopters using LQR- Obstacles Daman F. Bareiss

Chapter 4 State Feedback LQR Motivation Quadratic minimization : least squares Consider the

Optimal Agents Nick Hay 27th September 2005 1 / 36 Nick Hay Optimal Agents The Optimal Agent

Toward Computing Towards an Optimal . . . An (Almost) Optimal . . . Minor Problem an Optimal

APT TECHNICAL CPD - MAF TRANSFER PRICING AND PERFORMANCE EVALUATION Transfer Pricing and

Introduction to Cost Accounting Dr. Varadraj Bapat Indian Institute of Technology, Mumbai

1 Splatting Splatting Algorithm: Process from closest voxel to furthest voxel

DBMS on a modern processor: where does time go? Anastasia Ailamaki, David DeWitt, Mark Hill and

Department of Pediatrics Meeting October 29, 2020 Topic: Department of Pediatrics Year-End

Q3 2020 earnings call November 5, 2020 Please refer to page 2 for risks and uncertainties related

ControlVAE: Controllable Variational Autoencoder Huajie Shao, Shuochao Yao, Dachun Sun, Aston

Optimal Control McGill COMP 765 Oct 3 rd , 2017 Classical Control Quiz Question 1: Can a PID