Trajectory Optimization (this is a draft, to be updated before - PowerPoint PPT Presentation

Trajectory Optimization (this is a draft, to be updated before lecture) McGill COMP 765 Oct 5 th , 2017

Recall: LQR • Provided a globally optimal control solution in a single pass, under fairly restrictive assumptions • As with the KF/EKF, no practical robots meet these assumptions, but we can still make use of the math through clever approximations • We will consider several of these approaches today

Non-linear Extensions • Trajectory optimization: solve locally about a path, which is jointly improved: • Dynamic programming with local linearization • Constrained optimization: cost is objective, dynamics are constraints • "Direct" methods that search in the space of policies • Approximate the value function with learning approaches

Differential Dynamic Programming • A "shooting" local trajectory optimization method that build upon LQR ideas • Approximate the familiar value function with a 2nd order approximation: • Computed around a reference trajectory from a forward pass integrating current policy • Delta x and u are deviations from that, where we can assume linearity

DDP Backwards Pass • Solving for optimizing control relative to current forward "rollout" is called a backwards pass • The math follows the LQR pattern: • Expand • Take derivative w.r.t. u • Compute control and best value • Iterate

DDP analysis • What can DDP do? • Swing up pendulums from rest (rapidly!) • Grasping motions for robot arms • Full-body motions for humanoid robots (and animated humans) • What are the limitations? • No guarantees about global solution quality • Sensitive to starting controller • Model knowledge is still needed • Posted papers for examples of use in recent research: • "Probabilistic Differential Dynamic Programming" • "Control-Limited Differential Dynamic Programming" • "Guided Policy Search" (states that it uses iLQR … what is the difference?)

What about under-actuation? • So far our optimal control formulation includes solutions 𝑣 = ±∞ , this will often be the solution: • For example in our time-optimal 2 nd order linear actuator problem, just accelerate to infinite speed and reach in 0 time. • While this may be the objective of some cab drivers in Montreal, passengers may prefer a limited acceleration. • Many methods exist to represent control limits: • Penalize large control values using the reward function • Form hard constraints and solve a constrained optimization

Constrained optimal control max 𝑠(𝑦 𝑢 ) 𝑢 𝑡. 𝑢. 𝑦 𝑢 = 𝑔(𝑦 𝑢−1 , 𝜌 𝑦 𝑢−1 ) 𝜌 𝑦 𝑢 < 𝑑 ∀𝑢 • This can be solved easily for certain classes of reward and constraint • Example: sequential quadratic programs used in walking control

What to do if our models are not precise? • Recall: this is not an “if” but a “when”! • Model error will quickly invalidate the results of our extensive computations as they are recursively applied over time • Many exciting solutions for this, but two of the most classical are: • Computing control policies while “knowing what we don’t know”: robust control • Limiting the horizon and re-computing our controls often: model-predictive control (MPC)

Trajectory Optimization (this is a draft, to be updated before - PowerPoint PPT Presentation

Trajectory Optimization (this is a draft, to be updated before lecture) McGill COMP 765 Oct 5 th , 2017 Recall: LQR Provided a globally optimal control solution in a single pass, under fairly restrictive assumptions As with the KF/EKF,

Moving Object Trajectory Mining Moving Object Trajectory Mining Trajectory decomposition

Lagranto 2.0 Contents An new object - trajectory Tutorial trajectory case study

Pe Pedestria ian n Tra Trajectory jectory Predi redicti ction on Ov Overv rview

Sensor-based trajectory optimization ABB Robotics Master thesis Martin Biel Supervisor: Mikael

Trajectory Optimization, Imitation Learning Lecture 14 What will you take home today? Recap LQR

Optimal Control, LQR, Trajectory Optimization Lecture 13 What will you take home today? Intro

Energy-efficient Trajectory Tracking for Mobile Devices Based on "Energy-efficient

Trajectory Code Validation j y Slides 04/12/08 04/12/08 AAE 450 Spring 2008 Trajectory

Trajectory planning Trajectory planning 1 1 Basilio Bona 1 ROBOTICA 03CFIOR

Similarity-based Analysis for Trajectory Data Kevin Zheng 25/04/2014 DASFAA 2014 Tutorial 1

Trajectory Option Set Planning Optimization in CTOP Leonardo Cruciol TransLab, UnB Outline

15-780: Optimization J. Zico Kolter March 14-16, 2015 1 Outline Introduction to optimization

Optimization-Based Control: Direct Collocation Methods for Trajectory and Policy Optimization CS

SCP for trajectory optimization Basic problem minimize_{traj} path_length + other costs

DART: A Machine Learning Approach to Trajectory Prediction and Demand Capacity Balancing

HEALTH SYSTEMS ASSESSMENT Mhondoro Ngezi District 2019 Purpose, Objectives and Program Trajectory

Cyber@UC Meeting 34 Wireshark, Packets, PCAPs, and Pings If Youre New! Join our Slack

Zeek (Bro) Network Security Monitor Sareena K P RISE Lab What is Bro? Facilitates broader

Down the Black Hole: Dismantling Operational Practices of BGP Blackholing at IXPs Marcin

DATA AND SCHEMA MODIFICATIONS CHAPTERS 4,5 (6/E) CHAPTER 8 (5/E) 1 CHAPTER 5 OUTLINE

SoftRDMA: Rekindling High Performance Software RDMA over Commodity Ethernet Mao Miao, Fengyuan

Dependence Makes You Vulnerable: Differential Privacy Under Dependent Tuples Changchang Liu 1 ,

Design and Implementation of the iWarp Protocol in Software Dennis Dalessandro, Ananth

Repeated On-Chip Interconnect Analysis and Evaluation of Delay, Power, and Bandwidth Metrics under

Trajectory Optimization (this is a draft, to be updated before - PowerPoint PPT Presentation

Trajectory Optimization (this is a draft, to be updated before lecture) McGill COMP 765 Oct 5 th , 2017 Recall: LQR Provided a globally optimal control solution in a single pass, under fairly restrictive assumptions As with the KF/EKF,

Moving Object Trajectory Mining Moving Object Trajectory Mining Trajectory decomposition

Lagranto 2.0 Contents An new object - trajectory Tutorial trajectory case study

Pe Pedestria ian n Tra Trajectory jectory Predi redicti ction on Ov Overv rview

Sensor-based trajectory optimization ABB Robotics Master thesis Martin Biel Supervisor: Mikael

Trajectory Optimization, Imitation Learning Lecture 14 What will you take home today? Recap LQR

Optimal Control, LQR, Trajectory Optimization Lecture 13 What will you take home today? Intro

Energy-efficient Trajectory Tracking for Mobile Devices Based on &quot;Energy-efficient

Trajectory Code Validation j y Slides 04/12/08 04/12/08 AAE 450 Spring 2008 Trajectory

Trajectory planning Trajectory planning 1 1 Basilio Bona 1 ROBOTICA 03CFIOR

Similarity-based Analysis for Trajectory Data Kevin Zheng 25/04/2014 DASFAA 2014 Tutorial 1

Trajectory Option Set Planning Optimization in CTOP Leonardo Cruciol TransLab, UnB Outline

15-780: Optimization J. Zico Kolter March 14-16, 2015 1 Outline Introduction to optimization

Optimization-Based Control: Direct Collocation Methods for Trajectory and Policy Optimization CS

SCP for trajectory optimization Basic problem minimize_{traj} path_length + other costs

DART: A Machine Learning Approach to Trajectory Prediction and Demand Capacity Balancing

HEALTH SYSTEMS ASSESSMENT Mhondoro Ngezi District 2019 Purpose, Objectives and Program Trajectory

Cyber@UC Meeting 34 Wireshark, Packets, PCAPs, and Pings If Youre New! Join our Slack

Zeek (Bro) Network Security Monitor Sareena K P RISE Lab What is Bro? Facilitates broader

Down the Black Hole: Dismantling Operational Practices of BGP Blackholing at IXPs Marcin

DATA AND SCHEMA MODIFICATIONS CHAPTERS 4,5 (6/E) CHAPTER 8 (5/E) 1 CHAPTER 5 OUTLINE

SoftRDMA: Rekindling High Performance Software RDMA over Commodity Ethernet Mao Miao, Fengyuan

Dependence Makes You Vulnerable: Differential Privacy Under Dependent Tuples Changchang Liu 1 ,

Design and Implementation of the iWarp Protocol in Software Dennis Dalessandro, Ananth

Repeated On-Chip Interconnect Analysis and Evaluation of Delay, Power, and Bandwidth Metrics under

Energy-efficient Trajectory Tracking for Mobile Devices Based on "Energy-efficient