Safe Reinforcement Learning in Robotics with Bayesian Models Feli - PowerPoint PPT Presentation

Dec 19, 2023 •121 likes •383 views

Safe Reinforcement Learning in Robotics with Bayesian Models Feli lix Berk rkenkamp, Matteo Turchetta, Angela P. Schoellig, Andreas Krause @Workshop on Reliable AI, October 2017 A new era of autonomy Images: rethink robotics, Waymob, iRobot

Safe Reinforcement Learning in Robotics with Bayesian Models Feli lix Berk rkenkamp, Matteo Turchetta, Angela P. Schoellig, Andreas Krause @Workshop on Reliable AI, October 2017
A new era of autonomy Images: rethink robotics, Waymob, iRobot Felix Berkenkamp 2
Reinforcement learning Explo loration Policy Poli licy update Image: Plainicon, https://flaticon.com Felix Berkenkamp 3
Dangers of autonomous learning Safety despite uncertain inty Safe exp xploration Image: Freepik, https://flaticon.com Felix Berkenkamp 4
Safe reinforcement learning Bayesian models for safety Model-free Model-based Exploration Policy Policy update Image: Plainicon, https://flaticon.com Felix Berkenkamp 5
Model-free reinforcement learning Tracking performance Few experiments Safety constraint Sa Safety for r all ll experiments Felix Berkenkamp 6
Gaussian process Felix Berkenkamp 7
Constrained Bayesian optimization Felix Berkenkamp 8
Vid ideo avail ilable at http:/ ://t /tiny.cc/ic icra16_video 9 Felix Berkenkamp
10 Felix Berkenkamp
Safe reinforcement learning Bayesian models for safety Model-free Model-based Exploration Policy Policy update Image: Plainicon, https://flaticon.com Felix Berkenkamp 11
Model-based reinforcement learning Modelling Model Control Theory Implement Felix Berkenkamp 12
Approximate dynamic programming Dynamics Expected cost Poli licy update Felix Berkenkamp 13
Uncertain dynamics Dynamics model Safety-critical Felix Berkenkamp 14
Approximate dynamic programming Dynamics Felix Berkenkamp 15
Reinforcement learning Sa Safe exploration Explo loration Policy Sa Safe poli licy update Poli licy update Image: Plainicon, https://flaticon.com Felix Berkenkamp 16
Region of attraction Felix Berkenkamp 17
Lyapunov functions [A.M. Lyapunov 1892] Felix Berkenkamp 18
Safe policy optimization (NIPS 2017) Optimize policy for performance Determine safe region Poli licy update Felix Berkenkamp 19
Policy optimization Policy Felix Berkenkamp 20
Policy optimization Need to explore! Felix Berkenkamp 21
Obtaining data Felix Berkenkamp 22
Experimental results Felix Berkenkamp 23
Policy performance Felix Berkenkamp 24
Conclusion Sa Safe fe re rein info forcement lea learnin ing! Can use st statis istic ical models to give high-probability safety guarantees Theoretical guarantees in the paper Code at github.com/befelix More safe learning at http://berkenkamp.me Felix Berkenkamp 25

Recommend

CSC2621 Topics in Robotics Reinforcement Learning in Robotics Week 11: Hierarchical Reinforcement

CSC2621 Topics in Robotics Reinforcement Learning in Robotics Week 11: Hierarchical Reinforcement Learning Animesh Garg Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning Richard S. Sutton , Doina

2.19k views • 29 slides

Safe Reinforcement Learning Philip S. Thomas Stanford CS234: Reinforcement Learning, Guest

Safe Reinforcement Learning Philip S. Thomas Stanford CS234: Reinforcement Learning, Guest Lecture May 24, 2017 Lecture overview What makes a reinforcement learning algorithm safe ? Notation Creating a safe reinforcement learning

1.42k views • 88 slides

Reinforcement Learning AIMA Chapters: 21.1, 21.2, 21.3. Sutton and Barto, Reinforcement Learning:

Reinforcement Learning Reinforcement Learning AIMA Chapters: 21.1, 21.2, 21.3. Sutton and Barto, Reinforcement Learning: an Introduction, 2nd Edition: Chapters 6 (6.1 6.5) Outline Reinforcement Learning Reinforcement Learning: the

587 views • 27 slides

Reinforcement Learning Timothy Chou Charlie Tong Vincent Zhuang April 19, 2016 Reinforcement

Reinforcement Learning Q-Learning Deep Q-Learning on Atari Reinforcement Learning Timothy Chou Charlie Tong Vincent Zhuang April 19, 2016 Reinforcement Learning Q-Learning Deep Q-Learning on Atari Table of Contents Reinforcement Learning

939 views • 63 slides

Mobile & Service Robotics Mobile & Service Robotics Sensors for Robotics Sensors for

Mobile & Service Robotics Mobile & Service Robotics Sensors for Robotics Sensors for Robotics 3 Sensors for Robotics Sensors for Robotics 3 Laser sensors Rays are transmitted and received coaxially Rays are transmitted and

618 views • 50 slides

Mobile & Service Robotics Mobile & Service Robotics Sensors for Robotics Sensors for

Mobile & Service Robotics Mobile & Service Robotics Sensors for Robotics Sensors for Robotics 2 Sensors for Robotics Sensors for Robotics 2 Sensors for mobile robots Objectives: perceive, analyze and understand the environment

562 views • 42 slides

Mobile & Service Robotics Mobile & Service Robotics Sensors for Sensors for Robotics

Mobile & Service Robotics Mobile & Service Robotics Sensors for Sensors for Robotics Sensors for Sensors for Robotics Robotics 1 Robotics 1 An Example of robots with their sensors 2 Basilio Bona Robotica 03CFIOR 2011 Another

584 views • 20 slides

RL Overview of topics About Reinforcement Learning The Reinforcement Learning Problem

Introduction to Reinforcement Learning RL Overview of topics About Reinforcement Learning The Reinforcement Learning Problem Inside an RL agent Temporal difference learning Many faces of Reinforcement Learning What is

552 views • 35 slides

Reinforcement Learning UMaine COS 470/570 Introduction to AI Why reinforcement learning?

Reinforcement Learning<br/><br/> 4/25/19, 8*06 PM Reinforcement Learning<br/><br/> 4/25/19, 8*06 PM Reinforcement Learning UMaine COS 470/570 Introduction to AI Why reinforcement learning? Spring 2019 Created:

371 views • 15 slides

Reinforcement Learning and Simulation-Based Search David Silver Reinforcement Learning and

Reinforcement Learning and Simulation-Based Search Reinforcement Learning and Simulation-Based Search David Silver Reinforcement Learning and Simulation-Based Search Outline 1 Reinforcement Learning 2 Simulation-Based Search 3 Planning Under

425 views • 20 slides

Reinforcement Learning Reinforcement Learning Reinforcement Learning in a nutshell g Imagine

Reinforcement Learning Reinforcement Learning Reinforcement Learning in a nutshell g Imagine playing a new game whose rules you dont know; after a hundred or so moves your don t know; after a hundred or so moves, your opponent announces, You

512 views • 30 slides

CSC2621 Topics in Robotics Reinforcement Learning in Robotics Week 4: Q-Value based RL Animesh

CSC2621 Topics in Robotics Reinforcement Learning in Robotics Week 4: Q-Value based RL Animesh Garg Deep Reinforcement Learning with Double Q-learning Hado van Hasselt, Arthur Guez, David Silver Dueling Network Architectures for Deep

536 views • 24 slides

Outline Intro to RL and Bayesian Learning History of Bayesian RL Model-based Bayesian

Outline Intro to RL and Bayesian Learning History of Bayesian RL Model-based Bayesian RL Prior knowledge, policy optimization, discussion, Bayesian approaches for other RL variants Model-free Bayesian RL Gaussian

1.23k views • 63 slides

Human-Oriented Robotics Octave/Matlab Tutorial Kai Arras Social Robotics Lab, University of

Human-Oriented Robotics Prof. Kai Arras Social Robotics Lab Human-Oriented Robotics Octave/Matlab Tutorial Kai Arras Social Robotics Lab, University of Freiburg 1 Human-Oriented Robotics Contents Prof. Kai Arras Social Robotics Lab

1.78k views • 121 slides

Robotics Engineering Prof. Michael Gennert Robotics Engineering Program Director Fall 2016

Robotics Engineering Prof. Michael Gennert Robotics Engineering Program Director Fall 2016 Robotics Education Gap Robotics Research PhD @ Large Research Robotics University Engineering Industrial Robotics Technology AA, AS @ Making

945 views • 22 slides

LEGO Develops a new LEGO Develops a new robotics platform - WeDo robotics platform - WeDo

LEGO Develops a new LEGO Develops a new robotics platform - WeDo robotics platform - WeDo 1. LEGO Robotics and the 1. LEGO Robotics and the Robotics continuum Robotics continuum 2.

408 views • 18 slides

Robot ics J uly 26, 2005 CS 486/ 686 Universit y of Wat erloo Out line Robot ics

598 views • 22 slides

CS 188: Artificial Intelligence Advanced Applications: Robotics Pieter Abbeel UC Berkeley A

CS 188: Artificial Intelligence Advanced Applications: Robotics Pieter Abbeel UC Berkeley A few slides from Sebastian Thrun, Dan Klein 2 So Far Mostly Foundational Methods 3 1 Advanced Applications 4 [DEMO: Race, Short] Autonomous

412 views • 13 slides

Dependency Dependency- -Based Automatic Evaluation Based Automatic Evaluation Dependency

Dependency Dependency- -Based Automatic Evaluation Based Automatic Evaluation Dependency Dependency - - Based Automatic Evaluation Based Automatic Evaluation for Machine Translation for Machine Translation for Machine Translation for

589 views • 11 slides

ICTIR 2016 Slides - The Impact of Fixed-Cost Pooling Strategies on Test Collection Bias

See discussions, stats, and author profiles for this publication at: https://www.researchgate.net/publication/308120269 ICTIR 2016 Slides - The Impact of Fixed-Cost Pooling Strategies on Test Collection Bias Presentation September 2016

728 views • 53 slides

CS 354 Autonomous Robotics Particle Filters Instructors: Dr. Kevin Molloy and Dr. Nathan

CS 354 Autonomous Robotics Particle Filters Instructors: Dr. Kevin Molloy and Dr. Nathan Sprague SA-1 Objectives Process of determining where a mobile Localization robot is located with respect to its environment. Methods we know so far:

352 views • 31 slides

Slide 1 / 38 Slide 2 / 38 1 A student throws a ball upward where the initial potential energy

Slide 1 / 38 Slide 2 / 38 1 A student throws a ball upward where the initial potential energy is 0. At a height of 15 meters the ball has a potential energy of 60 joules and is moving upward with a kinetic energy of 40 joules. AP Physics C

199 views • 7 slides

Branding on a Budget Public Health Communications Webinar Series June 17, 2019 Webinar

Branding on a Budget Public Health Communications Webinar Series June 17, 2019 Webinar Objectives Understand the importance of a strong brand Discuss basic principles of branding and tips for defining your brand Share recommendations

627 views • 59 slides

Background memCellsF09 Allen Tanner built an SRAM/ROM generator program back in 2004 Single-

Background memCellsF09 Allen Tanner built an SRAM/ROM generator program back in 2004 Single- and Double-port SRAM the ROM seems to work fine there are building blocks fabricated examples that work the SRAM isnt as good

301 views • 6 slides

Safe Reinforcement Learning in Robotics with Bayesian Models Feli - PowerPoint PPT Presentation

Safe Reinforcement Learning in Robotics with Bayesian Models Feli lix Berk rkenkamp, Matteo Turchetta, Angela P. Schoellig, Andreas Krause @Workshop on Reliable AI, October 2017 A new era of autonomy Images: rethink robotics, Waymob, iRobot

CSC2621 Topics in Robotics Reinforcement Learning in Robotics Week 11: Hierarchical Reinforcement

Safe Reinforcement Learning Philip S. Thomas Stanford CS234: Reinforcement Learning, Guest

Reinforcement Learning AIMA Chapters: 21.1, 21.2, 21.3. Sutton and Barto, Reinforcement Learning:

Reinforcement Learning Timothy Chou Charlie Tong Vincent Zhuang April 19, 2016 Reinforcement

Mobile &amp; Service Robotics Mobile &amp; Service Robotics Sensors for Robotics Sensors for

Mobile &amp; Service Robotics Mobile &amp; Service Robotics Sensors for Robotics Sensors for

Mobile &amp; Service Robotics Mobile &amp; Service Robotics Sensors for Sensors for Robotics

RL Overview of topics About Reinforcement Learning The Reinforcement Learning Problem

Reinforcement Learning UMaine COS 470/570 Introduction to AI Why reinforcement learning?

Reinforcement Learning and Simulation-Based Search David Silver Reinforcement Learning and

Reinforcement Learning Reinforcement Learning Reinforcement Learning in a nutshell g Imagine

CSC2621 Topics in Robotics Reinforcement Learning in Robotics Week 4: Q-Value based RL Animesh

Outline Intro to RL and Bayesian Learning History of Bayesian RL Model-based Bayesian

Human-Oriented Robotics Octave/Matlab Tutorial Kai Arras Social Robotics Lab, University of

Robotics Engineering Prof. Michael Gennert Robotics Engineering Program Director Fall 2016

LEGO Develops a new LEGO Develops a new robotics platform - WeDo robotics platform - WeDo

Robot ics J uly 26, 2005 CS 486/ 686 Universit y of Wat erloo Out line Robot ics

CS 188: Artificial Intelligence Advanced Applications: Robotics Pieter Abbeel UC Berkeley A

Dependency Dependency- -Based Automatic Evaluation Based Automatic Evaluation Dependency

ICTIR 2016 Slides - The Impact of Fixed-Cost Pooling Strategies on Test Collection Bias

CS 354 Autonomous Robotics Particle Filters Instructors: Dr. Kevin Molloy and Dr. Nathan

Slide 1 / 38 Slide 2 / 38 1 A student throws a ball upward where the initial potential energy

Branding on a Budget Public Health Communications Webinar Series June 17, 2019 Webinar

Background memCellsF09 Allen Tanner built an SRAM/ROM generator program back in 2004 Single-

Mobile & Service Robotics Mobile & Service Robotics Sensors for Robotics Sensors for

Mobile & Service Robotics Mobile & Service Robotics Sensors for Robotics Sensors for

Mobile & Service Robotics Mobile & Service Robotics Sensors for Sensors for Robotics