Learning based Robust Control: Guaranteeing Stability while - PowerPoint PPT Presentation

Learning ‐ based Robust Control: Guaranteeing Stability while Improving Performance Felix Berkenkamp and Angela P. Schoellig IROS 2014 – Machine Learning in Planning and Control of Robot Motion Workshop (14. Sep.)

Why is model ‐ based control not always sufficient? • Model inaccuracies limit achievable performance! • Surface Material • Topography • Unknown dynamics Felix Berkenkamp 2

How these problems have been tackled so far Robust Robust control control Onl Online ne le learning • Specify prior uncertainty • Learn from online data in model • Improve the model • Guarantee stability and performance for all possible models Online learning Nominal model Nominal model True model True model Set of possible models Felix Berkenkamp 3

The missing link Nominal model True model Learning- Robust Online based Robust Control learning Control Set of possible models    Models uncertainty Online    learning Guarantees stability    True model Improves Nominal model online Set of possible models S. Schaal and C. G. Atkeson, “Learning control in robotics,” IEEE Robotics & Automation Magazine, vol. 17, no. 2, pp. 20–29, 2010. Felix Berkenkamp 4

Model for our approach • Stabilization of an operating point • Robust Control: Guaranteed stability / performance • Gaussian Process: Online learning Felix Berkenkamp 5

Framework: linear Robust Control • Extended linear plant • Output • Uncertainty • Disturbance • Error signal Goal: Find K such that • Stab ability ility: stable for all allowed • Perform Performance: minimized  Convex optimization problem Felix Berkenkamp 6

Gaussian Process (GP) learning • Objective: learn the model error with uncertainties from input/output data • Assumption: Correlation of data given by a kernel function Similar inputs will lead to similar outputs • Hyperparameters (noise and scaling): learned from data Felix Berkenkamp 7

Gaussian Process learning Felix Berkenkamp 8

Gaussian Process in our context • Model: Nominal model • GP inputs: • GP outputs: True model Set of possible models Felix Berkenkamp 9

Example: simple linear/affine system • True model: • Without prior knowledge, use GP to learn system dynamics Felix Berkenkamp 10

Parameters estimated using GP Felix Berkenkamp 11

Bringing GPs and Robust Control together This generalizes to non-linear, non-scalar, MIMO systems using the same process for element- wise uncertainty Felix Berkenkamp 12

The complete process Nominal model True model True model Nominal model Set of possible models Set of possible models 13 Felix Berkenkamp

Experimental results: https://youtu.be/YqhLnCm0KXY?list=PLC12E387419CEAFF2 step response step response step response 0.2 0.2 0.2 800 800 800 0 0 0 1000 1000 1000 -0.2 -0.2 -0.2 desired 2000 2000 x position [m] x position [m] x position [m] 3000 desired -0.4 -0.4 -0.4 desired -0.6 -0.6 -0.6 -0.8 -0.8 -0.8 -1 -1 -1 0 0 0 2 2 2 4 4 4 6 6 6 8 8 8 10 10 10 12 12 12 14 14 14 time [s] time [s] time [s] 0.2 0.2 0.2 800 800 800 0 0 0 1000 1000 1000 -0.2 -0.2 -0.2 2000 desired 2000 y position [m] y position [m] y position [m] 3000 desired -0.4 -0.4 -0.4 desired -0.6 -0.6 -0.6 -0.8 -0.8 -0.8 -1 -1 -1 0 0 0 2 2 2 4 4 4 6 6 6 8 8 8 10 10 10 12 12 12 14 14 14 time [s] time [s] time [s] Felix Berkenkamp 14

Summary • Combined Combined Gaussian Process lear arni ning ng with Linear Robust Robust Cont Control rol • Enables contro controlle ller perform performance to to im improve onlin online while providing stab stability ility guaran guarantees Nominal model True model True model Nominal model Set of possible models Set of possible models Felix Berkenkamp 15

Thank you! Felix Berkenkamp befelix@ethz.ch

Learning based Robust Control: Guaranteeing Stability while - PowerPoint PPT Presentation

Learning based Robust Control: Guaranteeing Stability while Improving Performance Felix Berkenkamp and Angela P. Schoellig IROS 2014 Machine Learning in Planning and Control of Robot Motion Workshop (14. Sep.) Why is model based control

Guaranteeing High Prices by Guaranteeing the Lowest Price M ATTHEW C. C ORCORAN * Many businesses

Bessel inequality for robust stability analysis of time-delay system F. Gouaisbaut, Y. Ariba, A.

Outlier Outlier Outlier- Outlier - -robust - robust robust robust identification

Short Course in Supervised Learning Robust Optimization and Machine Learning Robust Supervised

Robust stability analysis of uncertain Linear Positive Systems via Integral Linear Constraints: L 1

Fragility of controllers Robust and non-fragile control systems Robust and non-fragile control

Action Robust Reinforcement Learning and Applications in Continuous Control Chen Tessler *,

Guaranteeing Performance in High- Density OpenStack Environments Endre Sara VP of Dev

A tour on Bridgeland stability Paolo Stellari Hamburg, June 2015 Paolo Stellari A tour on

Robust Deep Learning Based on Meta-learning Deyu Meng Xian Jiaotong University

Stability, Networks: Stability, Networks: Control, and Optimality Control, and Optimality

Nonlinear Control Lecture # 18 Stability of Feedback Systems Nonlinear Control Lecture # 18

Nonlinear Control Lecture # 14 Input-Output Stability Nonlinear Control Lecture # 14 Input-Output

, , Weakly Supervised Classification Robust Learning and More: Robust Learning and More:

SmartMill Technology Guaranteeing Particle Size, Quality and Production with On-site Sorbent

SmartMill Technology Guaranteeing Particle Size, Quality and Production with On-site Sorbent

Robust Estimation and Generative Adversarial Networks Weizhi ZHU Hong Kong University of Science

Subjectivity of Autonomous August 28, 2012 Agents. Some Philosophical and Legal Remarks

Intelligent Vehicle Systems Southwest Research Ins6tute Mapviz 2D

Be A Developer Experience Super Hero: Robust Dev Scripts For Peace and Joy 1 Hi, Im Dustin

COMPSTAT 2010 Robust forecasting of non-stationary time series Koen Mahieu K.U.Leuven

Robust Scene Categorization by Learning Image Statistics in Context for BBC rushes Jan van

Robust Monopoly Regulation Yingni Guo, Eran Shmaya Northwestern University CCET, Sep 2019

jVPFS: Adding Robustness to a Secure Stacked File System with Untrusted Local Storage Components

Sambuz

Useful Links

Newsletter

Mail Us