Making Deep Q-learning Approaches Robust to Time Discretization - PowerPoint PPT Presentation

Dec 24, 2023 •279 likes •335 views

Making Deep Q-learning Approaches Robust to Time Discretization Corentin Tallec L eonard Blier Yann Ollivier Universit e Paris-Sud, Facebook AI Research June 4, 2019 C. Tallec et al. (UPSUD, FAIR) Framerate robust DQ Learning June 4,

Making Deep Q-learning Approaches Robust to Time Discretization Corentin Tallec L´ eonard Blier Yann Ollivier Universit´ e Paris-Sud, Facebook AI Research June 4, 2019 C. Tallec et al. (UPSUD, FAIR) Framerate robust DQ Learning June 4, 2019 1 / 4
Reinforcement Learning in Near Continuous Time What happens when using standard RL methods with small time discretization or high framerate ? Usual RL algorithm + high framerate → failure Scalability limited by algorithms ! Better hardware, sensors, actuators → Worse performance Contributes to lack of robustness of Deep RL: New environment → different framerate → new hyperparameters. Low FPS High FPS C. Tallec et al. (UPSUD, FAIR) Framerate robust DQ Learning June 4, 2019 2 / 4
Why is near continuous Q-learning failing? There is no continuous time Q-learning As δ t → 0, Q π ( s , a ) → V π ( s ) Q π does not depend on actions when δ t → 0 ⇒ Cannot use Q π to select actions! = There is no continuous time ε -greedy exploration ε -greedy, ε = 1 pendulum: δ t = . 05 δ t = . 0001 C. Tallec et al. (UPSUD, FAIR) Framerate robust DQ Learning June 4, 2019 3 / 4
Can we solve this? YES To know how: Poster #32 this evening Low FPS High FPS C. Tallec et al. (UPSUD, FAIR) Framerate robust DQ Learning June 4, 2019 4 / 4

Recommend

Outlier Outlier Outlier- Outlier - -robust - robust robust robust identification

GESG seminar, 16 October 2015, UFM Outlier Outlier Outlier- Outlier - -robust - robust robust robust identification identification identification of identification of of of switching regimes: switching regimes: switching regimes:

582 views • 27 slides

Short Course in Supervised Learning Robust Optimization and Machine Learning Robust Supervised

Robust Optimization & Machine Learning 6. Robust Optimization Short Course in Supervised Learning Robust Optimization and Machine Learning Robust Supervised Learning Motivations Examples Thresholding and robustness Boolean data

724 views • 48 slides

Robust Deep Learning Based on Meta-learning Deyu Meng Xian Jiaotong University

Robust Deep Learning Based on Meta-learning Deyu Meng Xian Jiaotong University dymeng@mail.xjtu.edu.cn http://gr.xjtu.edu.cn/web/dymeng Deep Learning Robust Meta-learning The Success of Deep Learning Relies on

1k views • 46 slides

Hao Su July 6, 2017 Outline Overview of 3D deep learning 3D deep learning algorithms

Deep 3D Representation Learning for Visual Computing Hao Su July 6, 2017 Outline Overview of 3D deep learning 3D deep learning algorithms Conclusion 2 Outline Overview of 3D deep learning Background 3D deep learning tasks 3D deep

1.66k views • 122 slides

All You Want To Know About CNNs Yukun Zhu Deep Learning Deep Learning Image from

All You Want To Know About CNNs Yukun Zhu Deep Learning Deep Learning Image from http://imgur.com/ Deep Learning Image from http://imgur.com/ Deep Learning Image from http://imgur.com/ Deep Learning Image from http://imgur.com/ Deep

1.15k views • 79 slides

Deep Neural Networks and Deep Reinforcement Learning Deep Learning, Goodfellow, Bengio and

Deep Neural Networks and Deep Reinforcement Learning Deep Neural Networks and Deep Reinforcement Learning Deep Learning, Goodfellow, Bengio and Courville [chapt. 6,7,8]; AIMA [sect. 21.1-21.3]; Sutton and Barto, Reinforcement Learning: an

528 views • 35 slides

AGN deep multiwavelength AGN deep multiwavelength AGN deep multiwavelength surveys: surveys:

AGN deep multiwavelength AGN deep multiwavelength AGN deep multiwavelength surveys: surveys: surveys: the case of the Chandra Deep Field South the case of the Chandra Deep Field South the case of the Chandra Deep Field South Fabrizio Fiore

423 views • 21 slides

Deep Learning: Theory and Practice Deep Learning - Practical 02-04-2020 Considerations

Deep Learning: Theory and Practice Deep Learning - Practical 02-04-2020 Considerations deeplearning.cce2020@gmail.com Deep Networks Intuition Neural networks with multiple hidden layers - Deep networks [Hinton, 2006] Deep Networks Intuition

1.33k views • 28 slides

Making maps pretty Andrea Aime Jim Groffen Making Maps Pretty Making Maps Pretty 1 1 Making

Making maps pretty Andrea Aime Jim Groffen Making Maps Pretty Making Maps Pretty 1 1 Making maps pretty Introduction Making Maps Pretty Making Maps Pretty 2 2 Introducing carthography Depiciting shape and location, conveing

589 views • 48 slides

Presentation about Deep Learning --- Zhongwu xie Contents 1.Brief introduction of Deep learning.

Presentation about Deep Learning --- Zhongwu xie Contents 1.Brief introduction of Deep learning. 2.Brief introduction of Backpropagation. 3.Brief introduction of Convolutional Neural Networks. Deep learning I . Introduction to Deep Learning

609 views • 22 slides

Deep Learning on GPUs March 2016 What is Deep Learning? GPUs and DL AGENDA DL in practice

Deep Learning on GPUs March 2016 What is Deep Learning? GPUs and DL AGENDA DL in practice Scaling up DL 2 What is Deep Learning? 3 DEEP LEARNING EVERYWHERE INTERNET & CLOUD MEDICINE & BIOLOGY SECURITY & DEFENSE MEDIA &

1.45k views • 39 slides

Deep learning Deep reinforcement learning Hamid Beigy Sharif university of technology December

Deep learning Deep learning Deep reinforcement learning Hamid Beigy Sharif university of technology December 25, 2018 Hamid Beigy | Sharif university of technology | December 25, 2018 1 / 65 Deep learning Table of contents 1 Introduction 2

836 views • 65 slides

Differen'able Func'onal Programming Noel Welsh @noelwelsh underscore Goals Deep learning

Differen'able Func'onal Programming Noel Welsh @noelwelsh underscore Goals Deep learning isnt deep Deep learning is func'onal programming You can implement it yourself Deep Learning Deep learning is supervised learning of

1.91k views • 161 slides

DSC 102 Systems for Scalable Analytics Arun Kumar Topic 6: Deep Learning Systems 1 Outline

DSC 102 Systems for Scalable Analytics Arun Kumar Topic 6: Deep Learning Systems 1 Outline Rise of Deep Learning Methods Deep Learning Systems: Specification Deep Learning Systems: Execution Future of Deep Learning Systems

561 views • 25 slides

Cycle time: 40 sec Cycle time: 12 sec Cycle time: 0.75 sec Cycle time: 1.25 sec Cycle time: 5

Cycle time: 40 sec Cycle time: 12 sec Cycle time: 0.75 sec Cycle time: 1.25 sec Cycle time: 5 sec Cycle time: 18 sec Cycle time: 11 sec Cycle time: 18 sec Cycle time: 12 sec Cycle time: 6.5 sec Cycle time: 30 sec Cycle time: 3 sec

151 views • 13 slides

ACCELERATE DEEP LEARNING WITH NVIDIA'S DEEP LEARNING PLATFORM | STEPHEN JONES | GTC16 DEEP

ACCELERATE DEEP LEARNING WITH NVIDIA'S DEEP LEARNING PLATFORM | STEPHEN JONES | GTC16 DEEP LEARNING EVERYWHERE INTERNET & CLOUD MEDICINE & BIOLOGY MEDIA & ENTERTAINMENT SECURITY & DEFENSE AUTONOMOUS MACHINES Image

314 views • 26 slides

Requirements of requirements (specifications) Quantifiable Should involve a number.

Requirements of requirements (specifications) Quantifiable Should involve a number. Remember our discussions on reducing ambiguity? Words like very, many, lots. Relevant A specification for a DC motor probably

285 views • 5 slides

Bank Performance During the Crisis This Time is the Same: Using Bank Performance in 1998 to

Bank Performance During the Crisis This Time is the Same: Using Bank Performance in 1998 to Explain Bank Performance During the Recent Financial Crisis R. Fahlenbrach, R. Prilmeier, R. Stulz Safety-Net Benefits Conferred on

417 views • 13 slides

View Planning for Object Recognition Gabriel Oliveira and Volkan Isler RSN Lab Motivation 2/30

View Planning for Object Recognition Gabriel Oliveira and Volkan Isler RSN Lab Motivation 2/30 Objective Cloud-Based (Active) Object recognition Goal: Find the minimum amount of views for recognition 3/30 Problem Definition 4/30

792 views • 29 slides

Ultra- high speed imaging at megaframes per second with a megapixel CMOS image sensor J. Crooks,

Ultra- high speed imaging at megaframes per second with a megapixel CMOS image sensor J. Crooks, B. Marsh, R. Turchetta , STFC- Rutherford Appleton Laboratory, UK K. Taylor, W. Chan , Specialised Imaging, UK A. Lahav, A. Fenigstein, TowerJazz

513 views • 14 slides

NEURAL DUAL BACKGROUND MODELING FOR REAL-TIME STOPPED OBJECT DETECTION Giorgio Gemignani Lucia

NEURAL DUAL BACKGROUND MODELING FOR REAL-TIME STOPPED OBJECT DETECTION Giorgio Gemignani Lucia Maddalena Alfredo Petrosino Giorgio Gemignani PhD Student University of Milan associated with University of Naples Parthenope NEURAL DUAL

437 views • 16 slides

Multiplayer Online Games An-Cheng Huang Network Reading Group Meeting Nov. 14, 2003 Outline

Multiplayer Online Games An-Cheng Huang Network Reading Group Meeting Nov. 14, 2003 Outline Overview of multiplayer online games (MOGs) Research issues Sample of recent papers A few observations Types of MOG: Categorization

662 views • 47 slides

Processing Forecasting Queries Processing Forecasting Queries Songyun Duan, Shivnath Babu Duke

Processing Forecasting Queries Processing Forecasting Queries Songyun Duan, Shivnath Babu Duke University Motivation Motivation Real-time forecasting of future events based on historical data is useful in many domains Proactive system

991 views • 25 slides

I NTRODUCTION TO GPU C OMPUTING Ilya Kuzovkin 13 May 2014, Tartu P ART I T EAPOT S IMPLE O

I NTRODUCTION TO GPU C OMPUTING Ilya Kuzovkin 13 May 2014, Tartu P ART I T EAPOT S IMPLE O PEN GL P ROGRAM Idea of computing on GPU emerged because GPUs became very good at parallel computations. S IMPLE O PEN GL P ROGRAM Idea of computing

1.3k views • 71 slides