Meta Reinforcement Learning Chelsea Finn Why are humans so good at - PowerPoint PPT Presentation

Jan 06, 2024 •615 likes •1.04k views

Meta Reinforcement Learning Chelsea Finn Why are humans so good at RL? People have prior experience. People have an existing representation of the world. Can we learn a representation under which RL is fast? Key idea : Explicitly optimize for

Meta Reinforcement Learning Chelsea Finn
Why are humans so good at RL? People have prior experience. People have an existing representation of the world. Can we learn a representation under which RL is fast? Key idea : Explicitly optimize for such a representation “Learn how to reinforcement learn”
Outline Meta-RL Problem Formulation & Examples Method Classes: Recurrent Models, Gradient-Based Models Challenges & Latest Developments
The Meta-Learning Problem Supervised Learning: Inputs: Outputs: Data: Meta Supervised Learning: Inputs: Outputs: Data: { Why is this view useful? Reduces the problem to the design & optimization of f . Finn & Levine. Meta-learning and Universality: Deep Representation… ICLR 2018
Example: Few-Shot Classification Given 1 example of 5 classes: Classify new examples test set training data meta-training training classes … … diagram adapted from Ravi & Larochelle ‘17
Meta-RL Example: Maze Navigation Given a small amount of experience Learn to solve the task By learning how to learn many other tasks: … diagram adapted from Duan et al. ‘17
The Meta Reinforcement Learning Problem Reinforcement Learning: Inputs: Outputs: Data: Meta Reinforcement Learning: Inputs: Outputs: Data: { dataset of datasets k rollouts from collected for each task *and* collecting appropriate data Design & optimization of f (learning to explore) Finn. Learning to Learn with Gradients. PhD Thesis 2018
Meta-RL Example: Maze Navigation Given a small amount of experience Learn to solve the task [meta] test time [meta] train time By learning how to learn many other tasks: meta-training … tasks diagram adapted from Duan et al. ‘17
The Meta Reinforcement Learning Problem Meta Reinforcement Learning: Inputs: Outputs: { Episodic Variant k rollouts from Inputs: Outputs: { Online Variant 1…k timesteps from
Outline Meta-RL Problem Formulation & Examples Method Classes: Recurrent Models, Gradient-Based Models Challenges & Latest Developments

Recommend

Reinforcement Learning AIMA Chapters: 21.1, 21.2, 21.3. Sutton and Barto, Reinforcement Learning:

Reinforcement Learning Reinforcement Learning AIMA Chapters: 21.1, 21.2, 21.3. Sutton and Barto, Reinforcement Learning: an Introduction, 2nd Edition: Chapters 6 (6.1 6.5) Outline Reinforcement Learning Reinforcement Learning: the

589 views • 27 slides

Meta- Meta -Programming with Programming with Modelica Modelica for Meta- for Meta

Meta- Meta -Programming with Programming with Modelica Modelica for Meta- for Meta -Modeling and Modeling and Model Transformations Model Transformations Peter Fritzson, Adrian Pop Peter Fritzson, Adrian Pop OpenModelica Course, 2007

382 views • 14 slides

Reinforcement Learning Timothy Chou Charlie Tong Vincent Zhuang April 19, 2016 Reinforcement

Reinforcement Learning Q-Learning Deep Q-Learning on Atari Reinforcement Learning Timothy Chou Charlie Tong Vincent Zhuang April 19, 2016 Reinforcement Learning Q-Learning Deep Q-Learning on Atari Table of Contents Reinforcement Learning

939 views • 63 slides

RL Overview of topics About Reinforcement Learning The Reinforcement Learning Problem

Introduction to Reinforcement Learning RL Overview of topics About Reinforcement Learning The Reinforcement Learning Problem Inside an RL agent Temporal difference learning Many faces of Reinforcement Learning What is

552 views • 35 slides

Reinforcement Learning UMaine COS 470/570 Introduction to AI Why reinforcement learning?

Reinforcement Learning<br/><br/> 4/25/19, 8*06 PM Reinforcement Learning<br/><br/> 4/25/19, 8*06 PM Reinforcement Learning UMaine COS 470/570 Introduction to AI Why reinforcement learning? Spring 2019 Created:

372 views • 15 slides

Reinforcement Learning and Simulation-Based Search David Silver Reinforcement Learning and

Reinforcement Learning and Simulation-Based Search Reinforcement Learning and Simulation-Based Search David Silver Reinforcement Learning and Simulation-Based Search Outline 1 Reinforcement Learning 2 Simulation-Based Search 3 Planning Under

425 views • 20 slides

Reinforcement Learning Reinforcement Learning Reinforcement Learning in a nutshell g Imagine

Reinforcement Learning Reinforcement Learning Reinforcement Learning in a nutshell g Imagine playing a new game whose rules you dont know; after a hundred or so moves your don t know; after a hundred or so moves, your opponent announces, You

512 views • 30 slides

Safe Reinforcement Learning Philip S. Thomas Stanford CS234: Reinforcement Learning, Guest

Safe Reinforcement Learning Philip S. Thomas Stanford CS234: Reinforcement Learning, Guest Lecture May 24, 2017 Lecture overview What makes a reinforcement learning algorithm safe ? Notation Creating a safe reinforcement learning

1.43k views • 88 slides

Meta Reinforcement Learning as Task Inference Jan Humplik, Alexandre Galashov, Leonard

Meta Reinforcement Learning as Task Inference Jan Humplik, Alexandre Galashov, Leonard Hasenclever, Pedro A.Ortega, Yee Whye Teh, Nicholas Heess Topic: Bayesian RL Presenter: Ram Ananth Why meta Reinforcement Learning? First Wave of Deep

912 views • 63 slides

Meta Reinforcement Learning Kate Rakelly 11/13/19 Questions we seek to answer Motivation : What

Meta Reinforcement Learning Kate Rakelly 11/13/19 Questions we seek to answer Motivation : What problem is meta-RL trying to solve? Context : What is the connection to other problems in RL? Solutions : What are solution methods for meta-RL and

788 views • 52 slides

Bayesian Model-Agnostic Meta-Learning Taesup Kim* (presenter), Jaesik Yoon* Ousmane Dia,

Bayesian Model-Agnostic Meta-Learning Taesup Kim* (presenter), Jaesik Yoon* Ousmane Dia, Sungwoong Kim, Yoshua Bengio, Sungjin Ahn Model-Agnostic Meta-learning (MAML) gradient-based meta-learning framework meta-update task adaptation

887 views • 22 slides

META Seal of Recognition and META Prize Award Ceremony Georg Rehm (DFKI) on behalf of the

META Seal of Recognition and META Prize Award Ceremony Georg Rehm (DFKI) on behalf of the META Technology Council and the META-NET Executive Board

811 views • 28 slides

CS885 Reinforcement Learning Module 2: June 6, 2020 Maximum Entropy Reinforcement Learning

CS885 Reinforcement Learning Module 2: June 6, 2020 Maximum Entropy Reinforcement Learning Haarnoja, Tang et al. (2017) Reinforcement Learning with Deep Energy Based Policies, ICML . Haarnoja, Zhou et al. (2018) Soft Actor-Critic: Off-Policy

684 views • 24 slides

Introduction to Reinforcement Learning Kevin Chen and Zack Khan Lecture 1: Introduction to

Lecture 1: Introduction to Reinforcement Learning Introduction to Reinforcement Learning Kevin Chen and Zack Khan Lecture 1: Introduction to Reinforcement Learning Outline 1. Course Logistics 2. What is Reinforcement Learning? 3.

930 views • 67 slides

Meta Learning Shengchao Liu Background Meta Learning (AKA Learning to Learn) A

Meta Learning Shengchao Liu Background Meta Learning (AKA Learning to Learn) A fast-learning algorithm: quickly adapted from the source tasks to the target tasks Key terminologies Support Set & Query Set C-Way K-Shot

2k views • 41 slides

Introduction to Reinforcement Learning and Q-Learning Skyler Seto (ss3349) May 2, 2016 Skyler

Reinforcement Learning and Markov Decision Process Q-Learning Q-Learning Convergence Introduction to Reinforcement Learning and Q-Learning Skyler Seto (ss3349) May 2, 2016 Skyler Seto (ss3349) Introduction to Reinforcement Learning and

565 views • 27 slides

Slides for Lecture 27 ENEL 353: Digital Circuits Fall 2013 Term Steve Norman, PhD, PEng

Slides for Lecture 27 ENEL 353: Digital Circuits Fall 2013 Term Steve Norman, PhD, PEng Electrical & Computer Engineering Schulich School of Engineering University of Calgary 13 November, 2013 slide 2/19 ENEL 353 F13 Section 02

333 views • 19 slides

D Answer Answer D .003 m/s D .003 m/s [This object is a pull tab] Slide 4 / 38 Slide 4

Slide 1 / 38 Slide 2 / 38 New Jersey Center for Teaching and Learning Progressive Science Initiative Forces & Motion This material is made freely available at www.njctl.org and is intended for the non-commercial use of students and

494 views • 13 slides

Matsya Systems Next generation Underwater Ship Hull Cleaning Customer Pain Points Biofouling

Matsya Systems Next generation Underwater Ship Hull Cleaning Customer Pain Points Biofouling occurs on ships, despite advances like anti-biofouling paint Causes increase in fuel consumption by 5-10% (mild growth) 10-25% (within 4-6 months)

1.19k views • 63 slides

Evolutionary Search Knowledge Reservoir Set of possible solutions Gleaning a reservoir of

global maximum local maxima local maxima Evolutionary Search Knowledge Reservoir Set of possible solutions Gleaning a reservoir of knowledge om interactions ith the environment. Selection Fitness dependent number of o

483 views • 8 slides

Snail Express Conquering the Chaos of Mail User Feedback Retrieving Mail = Easy Opening

Red B Snail Express Conquering the Chaos of Mail User Feedback Retrieving Mail = Easy Opening Mail = 50% Unopened Bills and Clutter = Stress How It Works Scan Open Extract Unfold Scan Extract Vacuum + Shake

473 views • 10 slides

Snail deterrent properties of a soot based flexible superhydrophobic surface Nicasio Geraldi,

Snail deterrent properties of a soot based flexible superhydrophobic surface Nicasio Geraldi, Robert Morris, Glen McHale and Michael Newton Introduction Snails enjoying eating the leaves of many garden plants including food crops.

425 views • 13 slides

2020 MS. KELLYS SIXTH GRADE GLOBAL THINKERS STUDENT OF THE WEEK: STUDENT OF THE WEEK FOR

INSTRUCTION WEEK OF MAY 4 TH 2020 MS. KELLYS SIXTH GRADE GLOBAL THINKERS STUDENT OF THE WEEK: STUDENT OF THE WEEK FOR APRIL 27 TH 2020: SAMANTA! FOR ALWAYS ATTENDING ZOOM SESSIONS AND WORKING HARD TO SUBMIT WORK EACH DAY LAST WEEK!

763 views • 30 slides

D Answer 600 m/s C 0.003 m/s D [This object is a pull tab] Slide 4 / 40 2 A blimp travels

Slide 1 / 40 Slide 2 / 40 8th Grade Forces and Motion Study Guide www.njctl.org Slide 3 / 40 1 A snail travels 10 m in 3000 seconds. What is the snail's average speed? 60000 m/s A 0.02 m/s B 600 m/s C 0.003 m/s D Slide 3 (Answer) /

474 views • 26 slides