Safe Learning of Regions of Attraction for Uncertain, Nonlinear - PowerPoint PPT Presentation

Safe Learning of Regions of Attraction for Uncertain, Nonlinear Systems with Gaussian Processes Felix Berkenkamp, Riccardo Moriconi, Angela P. Schoellig, Andreas Krause @CDC, December 2016

What is control? Modelling Model Control theory Implement Felix Berkenkamp 2

One small assumption… Model Degraded performance ce Instability Felix Berkenkamp 3

What is control? Modelling Model Control theory Implement Felix Berkenkamp 4

Why is learning not commonly used? Because safety matters!

What can go wrong? Modelling Model Control theory Feedback ck Implement Exci citation? Stability? Felix Berkenkamp 6

Problem definition Can we learn about dynamics cs while remaining stable? with Lipschitz continuous Bounded RKHS norm Where is this control policy safe to use? You can experiment, but no system failures! Felix Berkenkamp 7

Challenges with Bayesian learning Exploration (excitation) Stability certifi ficates (robustness) ✓ ✓ Linear systems [L. Jung, SAP’98] Linear controllers [F.Berkenkamp et al, ECC’15] ✓ ? Finite domains [R.I.Brafman et al, JMLR‘02] Nonlinear systems [A.K.Akametalu et al, CDC’14] ? Nonlinear, continuous This paper: Use ideas from sensor placement Lyapunov stability (nonlinear, unce certain systems) with high probability Felix Berkenkamp 8

Region of attraction Felix Berkenkamp 9

Lyapunov functions [A.M. Lyapunov 1966] Felix Berkenkamp 10

What about unknown dynamics? known systems: [R. Bobiti, M. Lazar, CDC 2016] Felix Berkenkamp 11

Gaussian process models high probability confidence intervals Lipschitz continuous Felix Berkenkamp 12

What about unknown dynamics? True system is stable within with high probability! Felix Berkenkamp 13

Exploring the safe set Felix Berkenkamp 14

Challenges with Bayesian learning Exploration (excitation) Stability certifi ficates (robustness) ✓ ✓ Linear systems [L. Jung, SAP’98] Linear controllers [F.Berkenkamp et al, ECC’15] ✓ ? Finite domains [R.I.Brafman et al, JMLR‘02] Nonlinear systems [A.K.Akametalu et al, CDC’14] ? Nonlinear, continuous This paper: Use ideas from sensor placement Lyapunov stability (nonlinear, unce certain systems) with high probability Felix Berkenkamp 15

How to explore? How to actively explore? Do we converge to maximum safe set? The policy is safe: keeps us in Apply Felix Berkenkamp 16

Theoretical result Close-to-optimal measurements: [A.Krause, C.Guestrin , UAI’05] Theorem: Theorem: Theorem: Theorem: Guaranteed to converge to the maximum safe levelset up to a certain accuracy after a Guaranteed to converge to the maximum safe levelset up to a certain accuracy after a Guaranteed to converge to the maximum safe levelset Guaranteed to converge to the maximum safe levelset up to a certain accuracy finite number of data points – without leaving this safe levelset with high probability. finite number of data points Bound depends on • Size of the maximum safe levelset • Information capacity of the Gaussian process model • Accuracy Felix Berkenkamp 17

Inverted pendulum Maximum torque limited! Safe exploration so that the pendulum doesn’t fall. Controller: LQR with prior mean model Quadratic Lyapunov function Felix Berkenkamp 18

Safe learning for an inverted pendulum Felix Berkenkamp 19

Conclusion Can simultaneously learn system dynamics and give stability guarantees Lyapunovstability for nonlinear, unce certain systems (with high probability, discretization) Convergence ce guarantees There is hope for safe fe reinfo force cement learning! Code is open source Example notebooks More safe learning at http://berkenkamp.me Felix Berkenkamp 20

Safe Learning of Regions of Attraction for Uncertain, Nonlinear - PowerPoint PPT Presentation

Safe Learning of Regions of Attraction for Uncertain, Nonlinear Systems with Gaussian Processes Felix Berkenkamp, Riccardo Moriconi, Angela P. Schoellig, Andreas Krause @CDC, December 2016 What is control? Modelling Model Control theory

COVID-19 VIRTUAL FORUM STRATEGY IN UNCERTAIN TIMES COVID-19: STRATEGY IN UNCERTAIN TIMES APRIL

Uncertain< T > A First-Order Type for Uncertain Data James Bornholt Australian National

Electronegativity is defined as the elements attraction for electrons and is based on a scale of

Rationality and Traffic Attraction Rationality and Traffic Attraction Incentives for Honest Path

Uncertain Centroid based Partitional Clustering of Uncertain Data Francesco Gullo Andrea

Top-k Queries over Uncertain Scores Qing Liu, Debabrota Basu, Talel Abdessalem, St ephane

regions and cities the role of the European Committee of the Regions Startup Europe Regions

Evaluating regions of attraction of LTI systems with saturation in IQS framework Dimitri

DESIGNING ROBUST SYSTEMS DESIGNING ROBUST SYSTEMS with with UNCERTAIN INFORMATION UNCERTAIN

$810 capital investment Average salary $41,282 FY2014 Q1 Client Activity Business Attraction and

www.WheelhouseCounseling.com Dating 101: Social Assessment and Personal Presentation

Decco closes Fruit Attraction with the presentation of its new fungicides Decco Ibrica has

Attraction and Avoidance Detection from Movements Zhenhui Jessie Li (with Bolin Ding, Fei Wu,

Polarization in Attraction-Repulsion Models Elisabetta Cornacchia, Neta Singer, Emmanuel Abbe

Health Care the Danish Model Janet Samuel, Danish Regions Danish Regions The Danish Health

ICANN s s geographical geographical ICANN II : the sequel : the sequel Regions II

Representations of classical Lie groups: two regimes of growth Alexey Bufetov University of Bonn

Asymptotics of representations of classical Lie groups Alexey Bufetov Department of Mathematics,

Generalization theory Daniel Hsu Columbia TRIPODS Bootcamp 1 Motivation 2 Support vector

Poisson Convergence Will Perkins February 28, 2013 Back to the Birthday Problem On HW # 2, you

Effectiveness of Freq Pat Mining Too many patterns! A pattern a 1 a 2 a n contains 2 n

Reflexive Tactics From: Introduction to the COQ Proof-Assistant for Practical Software

Multiagent Systems: Spring 2006 Ulle Endriss Institute for Logic, Language and Computation

Feature Selection & the Shapley-Folkman Theorem. Alexandre dAspremont , CNRS & D.I.,

Sambuz

Useful Links

Newsletter

Mail Us

Safe Learning of Regions of Attraction for Uncertain, Nonlinear - PowerPoint PPT Presentation

Safe Learning of Regions of Attraction for Uncertain, Nonlinear Systems with Gaussian Processes Felix Berkenkamp, Riccardo Moriconi, Angela P. Schoellig, Andreas Krause @CDC, December 2016 What is control? Modelling Model Control theory

COVID-19 VIRTUAL FORUM STRATEGY IN UNCERTAIN TIMES COVID-19: STRATEGY IN UNCERTAIN TIMES APRIL

Uncertain&lt; T &gt; A First-Order Type for Uncertain Data James Bornholt Australian National

Electronegativity is defined as the elements attraction for electrons and is based on a scale of

Rationality and Traffic Attraction Rationality and Traffic Attraction Incentives for Honest Path

Uncertain Centroid based Partitional Clustering of Uncertain Data Francesco Gullo Andrea

Top-k Queries over Uncertain Scores Qing Liu, Debabrota Basu, Talel Abdessalem, St ephane

regions and cities the role of the European Committee of the Regions Startup Europe Regions

Evaluating regions of attraction of LTI systems with saturation in IQS framework Dimitri

DESIGNING ROBUST SYSTEMS DESIGNING ROBUST SYSTEMS with with UNCERTAIN INFORMATION UNCERTAIN

$810 capital investment Average salary $41,282 FY2014 Q1 Client Activity Business Attraction and

www.WheelhouseCounseling.com Dating 101: Social Assessment and Personal Presentation

Decco closes Fruit Attraction with the presentation of its new fungicides Decco Ibrica has

Attraction and Avoidance Detection from Movements Zhenhui Jessie Li (with Bolin Ding, Fei Wu,

Polarization in Attraction-Repulsion Models Elisabetta Cornacchia, Neta Singer, Emmanuel Abbe

Health Care the Danish Model Janet Samuel, Danish Regions Danish Regions The Danish Health

ICANN s s geographical geographical ICANN II : the sequel : the sequel Regions II

Representations of classical Lie groups: two regimes of growth Alexey Bufetov University of Bonn

Asymptotics of representations of classical Lie groups Alexey Bufetov Department of Mathematics,

Generalization theory Daniel Hsu Columbia TRIPODS Bootcamp 1 Motivation 2 Support vector

Poisson Convergence Will Perkins February 28, 2013 Back to the Birthday Problem On HW # 2, you

Effectiveness of Freq Pat Mining Too many patterns! A pattern a 1 a 2 a n contains 2 n

Reflexive Tactics From: Introduction to the COQ Proof-Assistant for Practical Software

Multiagent Systems: Spring 2006 Ulle Endriss Institute for Logic, Language and Computation

Feature Selection &amp; the Shapley-Folkman Theorem. Alexandre dAspremont , CNRS &amp; D.I.,

Sambuz

Useful Links

Newsletter

Mail Us

Uncertain< T > A First-Order Type for Uncertain Data James Bornholt Australian National

Feature Selection & the Shapley-Folkman Theorem. Alexandre dAspremont , CNRS & D.I.,