Interactive Reinforcement Learning Human Generated Reward - PowerPoint PPT Presentation

Mar 12, 2024 •165 likes •254 views

Interactive Reinforcement Learning Human Generated Reward Presentation for Summer Camp 2015 May 25 2015 Reinforcement Learning Trial and error learning Explore and exploit Sutton and Barto 1988 Represent, predict and control

Interactive   Reinforcement Learning Human Generated Reward Presentation for Summer Camp 2015 May 25 2015
Reinforcement Learning • Trial and error learning • Explore and exploit Sutton and Barto 1988 • Represent, predict and control • Connect actions with rewards • Maximize future reward
Interactive Machine Learning Fails and Olsen Jr. 2003
Human Generated Reward • Humans know more! • Shaping systems to adapt Knox and Stone 2012 • Effectively reward learning • Transfer learning through collaboration • How can RL harness human reward?
Learning from Shaping Learning from Advice Kuhlmann et al. 2004 Blumberg et al. 2002 Learning from Demonstration Left: Argall et al. 2010   Right: Koenemann et al. 2014 Thomaz et al. 2006
Learning from Trial and Error Learning from Refinement Levine et al. 2015 Cakmak et al. 2012
Application • Shared control • Augmented representation • Integrate human and   non-human interaction • Autonomous prosthetics

Recommend

Generated by CamScanner Generated by CamScanner Generated by CamScanner Generated by CamScanner

Generated by CamScanner Generated by CamScanner Generated by CamScanner Generated by CamScanner Generated by CamScanner Generated by CamScanner Generated by CamScanner Generated by CamScanner Generated by CamScanner Generated by CamScanner

138 views • 10 slides

The ULTIMATE Business Incentive Company REWARD YOUR CUSTOMERS; REWARD YOUR EMPLOYEES REWARD YOUR

The ULTIMATE Business Incentive Company REWARD YOUR CUSTOMERS; REWARD YOUR EMPLOYEES REWARD YOUR CUSTOMERS; REWARD YOUR EMPLOYEES Reward Your Customers; Reward Your Employees The Company Masterz Myind Sdn Bhd (Co. No. 568489-H) is a B2B agency

349 views • 19 slides

Risk/Reward Risk/Reward If you buy here, what is the target? What is the risk? 1 221

Risk/Reward Risk/Reward If you buy here, what is the target? What is the risk? 1 221 Risk/reward Risk/reward With a purchase at the hammer and a target to the falling window we have a good r/r trade. 222 Risk- -reward reward Risk

458 views • 29 slides

Reward Shaping in Episodic Reinforcement Learning Marek Grze s Canterbury, UK AAMAS 2017

Reward Shaping in Episodic Reinforcement Learning Marek Grze s Canterbury, UK AAMAS 2017 S ao Paulo, May 812 Motivating Reward Shaping Reinforcement Learning Agent reward state action r t a t s t r t+ 1 Environment s t+ 1 [Sutt

1.42k views • 16 slides

Reinforcement Learning AIMA Chapters: 21.1, 21.2, 21.3. Sutton and Barto, Reinforcement Learning:

Reinforcement Learning Reinforcement Learning AIMA Chapters: 21.1, 21.2, 21.3. Sutton and Barto, Reinforcement Learning: an Introduction, 2nd Edition: Chapters 6 (6.1 6.5) Outline Reinforcement Learning Reinforcement Learning: the

592 views • 27 slides

Reinforcement Learning Timothy Chou Charlie Tong Vincent Zhuang April 19, 2016 Reinforcement

Reinforcement Learning Q-Learning Deep Q-Learning on Atari Reinforcement Learning Timothy Chou Charlie Tong Vincent Zhuang April 19, 2016 Reinforcement Learning Q-Learning Deep Q-Learning on Atari Table of Contents Reinforcement Learning

942 views • 63 slides

Foundations of Machine Learning Reinforcement Learning Reinforcement Learning Agent exploring

Foundations of Machine Learning Reinforcement Learning Reinforcement Learning Agent exploring environment. Interactions with environment: action state Agent Environment reward Problem: find action policy that maximizes cumulative reward

831 views • 66 slides

Chicken Human 1 Human 2 Rat Chicken Human 1 Human 2 Rat Chicken Human 1 Human 2 Rat

Chicken Human 1 Human 2 Rat Chicken Human 1 Human 2 Rat Chicken Human 1 Human 2 Rat Chicken Human 1 Human 2 Rat Chicken Human 2 Rat/Human1 Chicken Human 2 Rat/Human1 Chicken Human 2 Rat/Human1 Chicken Human 2 Rat/Human1

363 views • 11 slides

Where's The Reward? Where's The Reward? A Review of Reinforcement Learning for Instructional

Where's The Reward? Where's The Reward? A Review of Reinforcement Learning for Instructional Sequencing Shayan Doroudi 1 2 2 Research Question Research Question Over the past 50 years, how Over the past 50 years, how successful has RL

1.92k views • 163 slides

RL Overview of topics About Reinforcement Learning The Reinforcement Learning Problem

Introduction to Reinforcement Learning RL Overview of topics About Reinforcement Learning The Reinforcement Learning Problem Inside an RL agent Temporal difference learning Many faces of Reinforcement Learning What is

555 views • 35 slides

Inverse Reinforcement Learning CS 294-112: Deep Reinforcement Learning Sergey Levine Todays

Inverse Reinforcement Learning CS 294-112: Deep Reinforcement Learning Sergey Levine Todays Lecture 1. So far: manually design reward function to define a task 2. What if we want to learn the reward function from observing an expert, and

651 views • 33 slides

Reinforcement Learning: How Does It Work? We detect a state Reinforcement Learning We choose an

1 Reinforcement Learning: How Does It Work? We detect a state Reinforcement Learning We choose an action Lecture 2 We get a reward Gillian Hayes Our aim is to learn a policy what action to choose in what state to get maximum reward 11th

643 views • 5 slides

Reinforcement Learning UMaine COS 470/570 Introduction to AI Why reinforcement learning?

Reinforcement Learning<br/><br/> 4/25/19, 8*06 PM Reinforcement Learning<br/><br/> 4/25/19, 8*06 PM Reinforcement Learning UMaine COS 470/570 Introduction to AI Why reinforcement learning? Spring 2019 Created:

372 views • 15 slides

Reinforcement Learning and Simulation-Based Search David Silver Reinforcement Learning and

Reinforcement Learning and Simulation-Based Search Reinforcement Learning and Simulation-Based Search David Silver Reinforcement Learning and Simulation-Based Search Outline 1 Reinforcement Learning 2 Simulation-Based Search 3 Planning Under

426 views • 20 slides

Reinforcement Learning Reinforcement Learning Reinforcement Learning in a nutshell g Imagine

Reinforcement Learning Reinforcement Learning Reinforcement Learning in a nutshell g Imagine playing a new game whose rules you dont know; after a hundred or so moves your don t know; after a hundred or so moves, your opponent announces, You

512 views • 30 slides

Safe Reinforcement Learning Philip S. Thomas Stanford CS234: Reinforcement Learning, Guest

Safe Reinforcement Learning Philip S. Thomas Stanford CS234: Reinforcement Learning, Guest Lecture May 24, 2017 Lecture overview What makes a reinforcement learning algorithm safe ? Notation Creating a safe reinforcement learning

1.43k views • 88 slides

Guidelines to help formulate model policy for an evolving technology Officer-involved shooting

Guidelines to help formulate model policy for an evolving technology Officer-involved shooting in Three Rivers Officers discuss the benefits of body cameras in the wake of a fatal shooting. Ft. Worth deploys 200 body cameras, plans to

322 views • 18 slides

ST GEORGES STREET, CANTERBURY PROPOSED PUBLIC REALM IMPROVEMENTS The starting point

ST GEORGES STREET, CANTERBURY PROPOSED PUBLIC REALM IMPROVEMENTS The starting point Understanding the past is an important prelude to good design. In the Canterbury context the starting point is often looking back in history to understand

502 views • 20 slides

Transitioning to 20mph limits being the norm for most of our urban realm .. I am not here to tell

Transitioning to 20mph limits being the norm for most of our urban realm .. I am not here to tell you what to do ..but maybe I can talk about what is happening elsewhere (my apologies for being in old units!) 20mph =32 km/h, 30mph = 48

328 views • 30 slides

A Strategic Plan for UT Arlingtons Future Location and History 25 years ago: Arlington:

A Strategic Plan for UT Arlingtons Future Location and History 25 years ago: Arlington: 250,000 UTA: 18,000 2012: Arlington: >370,000 UTA > 67,000 (incl. DED) Destinations of Choice A City And A University 4,800

1.03k views • 53 slides

MagLab Summer Science Camps Panelists: Carlos Villa (Director of MagLab Camps) Roxanne Hughes,

MagLab Summer Science Camps Panelists: Carlos Villa (Director of MagLab Camps) Roxanne Hughes, Ph.D. (Director of CIRL) Kim Kelling (WFSU) Charmane Caldwell, Ph.D. (FAMU-FSU College of Engineering) ACE Presentation 12/8/17 Mission and Goals

242 views • 9 slides

ESPA Comprehensive Aquifer Management Plan ESPA CAMP Advisory Committee September 25, 2008

Idaho Water Resource Board ESPA Comprehensive Aquifer Management Plan ESPA CAMP Advisory Committee September 25, 2008 Outline Background and Overview of CAMP Process Framework Goal and Objectives Initial CAMP Recommendations

402 views • 26 slides

Youth Recreation Camps Presentation to GOFERR Legislative Advisory Committee June 17, 2020

NEW HAMPSHIRE CAMP DIRECTORS ASSOCIATION NHCAMPS Youth Recreation Camps Presentation to GOFERR Legislative Advisory Committee June 17, 2020 Economic Impact of NH Camps Data from American Camp Association 2016 Economic Impact Study Licensed

525 views • 8 slides

Womens Earth and Climate Ac4on Network (WECAN) Educa4on and

Womens Earth and Climate Ac4on Network (WECAN) Educa4on and Advocacy on-Line Training Casey Camp-Hornik We demand the governments recognize,

291 views • 11 slides