Statistics and Samples in Distributional Reinforcement Learning - PowerPoint PPT Presentation

Statistics and Samples in Distributional Reinforcement Learning Mark Rowland, Robert Dadashi, Saurabh Kumar, Rémi Munos, Marc G. Bellemare, Will Dabney ICML 2019 Google Research Brain team

Distributional Reinforcement Learning Distributional RL aims to learn Distributional Bellman operator: full return distributions. Return distribution: [Bellemare et al., 2017] Distributional Bellman equation: Statistics and Samples in Distributional Reinforcement Learning — MARK ROWLAND

Distributional Reinforcement Learning In practice , we often work with parametric approximate distributions . Non-parametric Statistics and Samples in Distributional Reinforcement Learning — MARK ROWLAND

Distributional Reinforcement Learning In practice , we often work with parametric approximate distributions . Non-parametric Categorical [Bellemare et al., 2017] Statistics and Samples in Distributional Reinforcement Learning — MARK ROWLAND

Distributional Reinforcement Learning In practice , we often work with parametric approximate distributions . Non-parametric Categorical [Bellemare et al., 2017] Dirac deltas [Dabney et al., 2018] Statistics and Samples in Distributional Reinforcement Learning — MARK ROWLAND

Main Contribution: An Alternative Perspective Distributional RL algorithms learn statistical functionals of the return distribution. ● Moments, tail probabilities, expectations, etc. Statistics and Samples in Distributional Reinforcement Learning — MARK ROWLAND

Main Contribution: An Alternative Perspective Distributional RL algorithms learn statistical functionals of the return distribution. ● Moments, tail probabilities, expectations, etc. Theory: What properties of return distributions can be learnt through dynamic programming? Algorithmic: A general framework for approximate learning of statistics. Statistics and Samples in Distributional Reinforcement Learning — MARK ROWLAND

A General Framework for Distributional RL Algorithms Current statistics Bellman-updated statistics Imputation strategy Imputed samples Bellman-updated distribution Statistics and Samples in Distributional Reinforcement Learning — MARK ROWLAND

Application: Expectiles We apply this framework to learn expectiles of return distributions. New deep RL agent: Expectile Regression DQN (ER-DQN) , with improved mean performance on Atari-57 relative to QR-DQN. Statistics and Samples in Distributional Reinforcement Learning — MARK ROWLAND

Summary A new perspective on distributional RL Theoretical progress on what it is possible to learn A general framework for distributional RL algorithms Statistics and Samples in Distributional Reinforcement Learning — MARK ROWLAND

THANK YOU Poster #113

Statistics and Samples in Distributional Reinforcement Learning - PowerPoint PPT Presentation

Statistics and Samples in Distributional Reinforcement Learning Mark Rowland, Robert Dadashi, Saurabh Kumar, Rmi Munos, Marc G. Bellemare, Will Dabney ICML 2019 Google Research Brain team Distributional Reinforcement Learning Distributional

Statistics and Samples in Distributional Reinforcement Learning Rowland, Dadashi, Kumar, Munos,

Business Statistics CONTENTS Comparing two samples Comparing two unrelated samples Comparing

Distributional Semantics The unsupervised modeling of meaning on a large scale Tim Van de Cruys

Reinforcement Learning AIMA Chapters: 21.1, 21.2, 21.3. Sutton and Barto, Reinforcement Learning:

Deep he(a)p, big feat arXiv:1707.06887 A Distributional Perspective on Reinforcement Learning

Samples Advertising of samples and handing out samples Advertising Education and Assurance

-Samples [AB98] Hyp: domain S is a smooth curve or surface. S 1 -Samples [AB98] Hyp:

Reinforcement Learning and Simulation-Based Search David Silver Reinforcement Learning and

Linear mixed models with improper priors and flexible distributional assumptions for longitudinal

RL Overview of topics About Reinforcement Learning The Reinforcement Learning Problem

Reinforcement Learning UMaine COS 470/570 Introduction to AI Why reinforcement learning?

Reinforcement Learning Reinforcement Learning Reinforcement Learning in a nutshell g Imagine

Safe Reinforcement Learning Philip S. Thomas Stanford CS234: Reinforcement Learning, Guest

Reinforcement Learning Timothy Chou Charlie Tong Vincent Zhuang April 19, 2016 Reinforcement

Compositional Distributional Semantic Models for Semantic Relatedness and Entailment Sidharth

Distributional Reinforcement Learning for Efficient Exploration Hengshuai Yao Huawei Hi-Silicon

18.650 Statistics for Applications Chapter 1: Introduction 1/43 Goals Goals: To give you a

IHI Expedition Eliminating Overuse in Medical Imaging Jim Duncan, MD, PhD Kelly McCutcheon

Management Dr. Stefan Wagner Technische Universitt Mnchen Garching 9 July 2010 1 Last

Production Automation System Software Introduction Neil Baliga 1 Problem 2 High Level Problem

User Pays User Committee 3 rd September 2012 Agenda Introduction Minutes of last

Running the Court Activity Detail Report For District, Statutory County and Constitutional

Statutory Audit Services Market Inves4ga4on ACE Conference

AFRINIC FINANCE UPDATE By: Patrisse AFRINIC23 Pointe Noire, Congo 28 th

Explore More Topics

Sambuz

Useful Links

Newsletter

Mail Us