safe reinforcement learning in robotics with bayesian
play

Safe Reinforcement Learning in Robotics with Bayesian Models Feli - PowerPoint PPT Presentation

Safe Reinforcement Learning in Robotics with Bayesian Models Feli lix Berk rkenkamp, Matteo Turchetta, Angela P. Schoellig, Andreas Krause @Workshop on Reliable AI, October 2017 A new era of autonomy Images: rethink robotics, Waymob, iRobot


  1. Safe Reinforcement Learning in Robotics with Bayesian Models Feli lix Berk rkenkamp, Matteo Turchetta, Angela P. Schoellig, Andreas Krause @Workshop on Reliable AI, October 2017

  2. A new era of autonomy Images: rethink robotics, Waymob, iRobot Felix Berkenkamp 2

  3. Reinforcement learning Explo loration Policy Poli licy update Image: Plainicon, https://flaticon.com Felix Berkenkamp 3

  4. Dangers of autonomous learning Safety despite uncertain inty Safe exp xploration Image: Freepik, https://flaticon.com Felix Berkenkamp 4

  5. Safe reinforcement learning Bayesian models for safety Model-free Model-based Exploration Policy Policy update Image: Plainicon, https://flaticon.com Felix Berkenkamp 5

  6. Model-free reinforcement learning Tracking performance Few experiments Safety constraint Sa Safety for r all ll experiments Felix Berkenkamp 6

  7. Gaussian process Felix Berkenkamp 7

  8. Constrained Bayesian optimization Felix Berkenkamp 8

  9. Vid ideo avail ilable at http:/ ://t /tiny.cc/ic icra16_video 9 Felix Berkenkamp

  10. 10 Felix Berkenkamp

  11. Safe reinforcement learning Bayesian models for safety Model-free Model-based Exploration Policy Policy update Image: Plainicon, https://flaticon.com Felix Berkenkamp 11

  12. Model-based reinforcement learning Modelling Model Control Theory Implement Felix Berkenkamp 12

  13. Approximate dynamic programming Dynamics Expected cost Poli licy update Felix Berkenkamp 13

  14. Uncertain dynamics Dynamics model Safety-critical Felix Berkenkamp 14

  15. Approximate dynamic programming Dynamics Felix Berkenkamp 15

  16. Reinforcement learning Sa Safe exploration Explo loration Policy Sa Safe poli licy update Poli licy update Image: Plainicon, https://flaticon.com Felix Berkenkamp 16

  17. Region of attraction Felix Berkenkamp 17

  18. Lyapunov functions [A.M. Lyapunov 1892] Felix Berkenkamp 18

  19. Safe policy optimization (NIPS 2017) Optimize policy for performance Determine safe region Poli licy update Felix Berkenkamp 19

  20. Policy optimization Policy Felix Berkenkamp 20

  21. Policy optimization Need to explore! Felix Berkenkamp 21

  22. Obtaining data Felix Berkenkamp 22

  23. Experimental results Felix Berkenkamp 23

  24. Policy performance Felix Berkenkamp 24

  25. Conclusion Sa Safe fe re rein info forcement lea learnin ing! Can use st statis istic ical models to give high-probability safety guarantees Theoretical guarantees in the paper Code at github.com/befelix More safe learning at http://berkenkamp.me Felix Berkenkamp 25

Recommend


More recommend