Deep Reinforcement Learning with a Natural Language Action Space - PowerPoint PPT Presentation

Aug 17, 2022 •449 likes •575 views

Deep Reinforcement Learning with a Natural Language Action Space Authors: Ji He, Jianshu Chen, Xiaodong He, Jianfeng Gao, Lihong Li, Li Deng and Mari Ostendorf Presented by: Victor Ge Background Motivation How to do credit assignment when

Deep Reinforcement Learning with a Natural Language Action Space Authors: Ji He, Jianshu Chen, Xiaodong He, Jianfeng Gao, Lihong Li, Li Deng and Mari Ostendorf Presented by: Victor Ge
Background
Motivation ● How to do credit assignment when the action space is discrete and potentially unbounded. ● I.e. human-computer dialog systems, tutoring systems, and text-based games.
Q-learning architectures
Deep Reinforcement Relevance Network (DRRN) ● Factorize DQN into state representation and action representation. ● Interaction function – can be inner product, bilinear operation, nonlinear function, etc. ● In experiments, inner product and bilinear operation give similar results. ● Using nonlinear function (i.e. DNN) degrades performance.
Details ● Bag of words text embedding ● 1-2 hidden layers ● Experience replay buffer ● Softmax action selection:
Experiments – text-based games ● Parser-based games can be reduced to choice- based games if there is a finite number of phrases that the parser accepts.
Experiments – text-based games
Experiments – text-based games ● Human baselines: ● "Saving John": -5.5 ● "Machine of Death": 16.0
Experiments – paraphrased actions ● Question: Is DRRN memorizing the right action? ● State space is small (<1000) ● Replace 81.4% of action descriptions with human paraphrased descriptions. ● Standard 4-gram BLEU score between paraphrased and original actions is 0.325 ● DRRN gets 10.5 average reward on paraphrased game vs 11.2 for original "Machine of Death" game
Experiments – paraphrased actions
Experiments – paraphrased actions

Recommend

Deep Neural Networks and Deep Reinforcement Learning Deep Learning, Goodfellow, Bengio and

Deep Neural Networks and Deep Reinforcement Learning Deep Neural Networks and Deep Reinforcement Learning Deep Learning, Goodfellow, Bengio and Courville [chapt. 6,7,8]; AIMA [sect. 21.1-21.3]; Sutton and Barto, Reinforcement Learning: an

528 views • 35 slides

Reinforcement Learning Timothy Chou Charlie Tong Vincent Zhuang April 19, 2016 Reinforcement

Reinforcement Learning Q-Learning Deep Q-Learning on Atari Reinforcement Learning Timothy Chou Charlie Tong Vincent Zhuang April 19, 2016 Reinforcement Learning Q-Learning Deep Q-Learning on Atari Table of Contents Reinforcement Learning

939 views • 63 slides

Reinforcement Learning AIMA Chapters: 21.1, 21.2, 21.3. Sutton and Barto, Reinforcement Learning:

Reinforcement Learning Reinforcement Learning AIMA Chapters: 21.1, 21.2, 21.3. Sutton and Barto, Reinforcement Learning: an Introduction, 2nd Edition: Chapters 6 (6.1 6.5) Outline Reinforcement Learning Reinforcement Learning: the

587 views • 27 slides

1 Deep Reinforcement Learning Qianqian Li, Nayeon Koong, Langtian He What is deep reinforcement

1 Deep Reinforcement Learning Qianqian Li, Nayeon Koong, Langtian He What is deep reinforcement learning? Agent/Actor + Action + Environment + State + Reward How does reinforcement learning work?

793 views • 31 slides

RL Overview of topics About Reinforcement Learning The Reinforcement Learning Problem

Introduction to Reinforcement Learning RL Overview of topics About Reinforcement Learning The Reinforcement Learning Problem Inside an RL agent Temporal difference learning Many faces of Reinforcement Learning What is

552 views • 35 slides

Deep Reinforcement Learning [Mastering the Game of Go with Deep Reinforcement Learning and Tree

Deep Reinforcement Learning [Mastering the Game of Go with Deep Reinforcement Learning and Tree Search, Nature 2016] CS 486/686 University of Waterloo Lecture 21: July 12, 2017 Outline AlphaGo Supervised Learning of Policy Networks

541 views • 15 slides

Reinforcement Learning UMaine COS 470/570 Introduction to AI Why reinforcement learning?

Reinforcement Learning<br/><br/> 4/25/19, 8*06 PM Reinforcement Learning<br/><br/> 4/25/19, 8*06 PM Reinforcement Learning UMaine COS 470/570 Introduction to AI Why reinforcement learning? Spring 2019 Created:

371 views • 15 slides

Reinforcement Learning and Simulation-Based Search David Silver Reinforcement Learning and

Reinforcement Learning and Simulation-Based Search Reinforcement Learning and Simulation-Based Search David Silver Reinforcement Learning and Simulation-Based Search Outline 1 Reinforcement Learning 2 Simulation-Based Search 3 Planning Under

425 views • 20 slides

Reinforcement Learning Reinforcement Learning Reinforcement Learning in a nutshell g Imagine

Reinforcement Learning Reinforcement Learning Reinforcement Learning in a nutshell g Imagine playing a new game whose rules you dont know; after a hundred or so moves your don t know; after a hundred or so moves, your opponent announces, You

512 views • 30 slides

Safe Reinforcement Learning Philip S. Thomas Stanford CS234: Reinforcement Learning, Guest

Safe Reinforcement Learning Philip S. Thomas Stanford CS234: Reinforcement Learning, Guest Lecture May 24, 2017 Lecture overview What makes a reinforcement learning algorithm safe ? Notation Creating a safe reinforcement learning

1.42k views • 88 slides

Deep Reinforcement Learning [Human-Level Control through deep reinforcement learning, Nature

Deep Reinforcement Learning [Human-Level Control through deep reinforcement learning, Nature 2015] CS 486/686 University of Waterloo Lecture 20: July 10, 2017 Outline Value Function Approximation Linear approximation Neural

706 views • 19 slides

Deep learning Deep reinforcement learning Hamid Beigy Sharif university of technology December

Deep learning Deep learning Deep reinforcement learning Hamid Beigy Sharif university of technology December 25, 2018 Hamid Beigy | Sharif university of technology | December 25, 2018 1 / 65 Deep learning Table of contents 1 Introduction 2

836 views • 65 slides

CS885 Reinforcement Learning Module 2: June 6, 2020 Maximum Entropy Reinforcement Learning

CS885 Reinforcement Learning Module 2: June 6, 2020 Maximum Entropy Reinforcement Learning Haarnoja, Tang et al. (2017) Reinforcement Learning with Deep Energy Based Policies, ICML . Haarnoja, Zhou et al. (2018) Soft Actor-Critic: Off-Policy

684 views • 24 slides

Deep Reinforcement Learning Philipp Koehn 21 April 2020 Philipp Koehn Artificial Intelligence:

Deep Reinforcement Learning Philipp Koehn 21 April 2020 Philipp Koehn Artificial Intelligence: Deep Reinforcement Learning 21 April 2020 Reinforcement Learning 1 Sequence of actions moves in chess driving controls in car

815 views • 63 slides

Deep Reinforcement Learning Philipp Koehn 18 April 2019 Philipp Koehn Artificial Intelligence:

Deep Reinforcement Learning Philipp Koehn 18 April 2019 Philipp Koehn Artificial Intelligence: Deep Reinforcement Learning 18 April 2019 Reinforcement Learning 1 Sequence of actions moves in chess driving controls in car

861 views • 63 slides

Deep he(a)p, big feat arXiv:1707.06887 A Distributional Perspective on Reinforcement Learning

Deep he(a)p, big feat arXiv:1707.06887 A Distributional Perspective on Reinforcement Learning arXiv:1702.08165 Reinforcement Learning with Deep Energy-Based Policies 1 / 25 Reinforcement Learning Environment Action Reward Interpreter State

531 views • 25 slides

Tracking particles in space and time Besides a few indirect signals of new physics, particle

Tracking particles in space and time Besides a few indirect signals of new physics, particle physics today faces an extraordinary drought. Nicolo Cartiglia, INFN, Torino Tracking in 4D We need to cross an energy - cross section desert to

1.36k views • 55 slides

Introduction to Deep Models Part II: Variational Autoencoders and Latent Spaces Nick Winovich

TensorFlow Workshop 2018 Introduction to Deep Models Part II: Variational Autoencoders and Latent Spaces Nick Winovich Department of Mathematics Purdue University July 2018 SIAM@Purdue 2018 - Nick Winovich Introduction to Deep Models : Part

836 views • 25 slides

a Tool to Investigate the Laws of Gravity Luciano Iess Dipartimento di Ingegneria Meccanica e

Deep-Space Navigation: a Tool to Investigate the Laws of Gravity Luciano Iess Dipartimento di Ingegneria Meccanica e Aerospaziale Universit La Sapienza Rome, Italy Outline Laws of gravity in the solar system: observables, space probe

888 views • 53 slides

Extended RA Database Systems: The Complete Book Ch 5.1-5.2, 15.4 1 Relational Algebra A Set of

Extended RA Database Systems: The Complete Book Ch 5.1-5.2, 15.4 1 Relational Algebra A Set of Tuples A Bag of Tuples A List of Tuples Data Extended [Set] Relational Bag Relational Relational Relational Algebra Algebra Algebra

1.21k views • 87 slides

What is Artificial Intelligence? } Historical definition (Dartmouth Workshop on AI, 1956): The

What is Artificial Intelligence? } Historical definition (Dartmouth Workshop on AI, 1956): The study of the conjecture that every aspect of learning or any other feature of intelligence can in principle be so precisely described that a machine

485 views • 3 slides

Recursion, Efficiency, and the Time-Space Trade Off; Selection Sort and Big-Oh Checkout

Recursion, Efficiency, and the Time-Space Trade Off; Selection Sort and Big-Oh Checkout Recursion2 project from SVN What is a recur ursive sive method? Answer: A method that calls itself but on a simpler problem, so that it makes

586 views • 19 slides

Chapter 20 Galaxies 20.1 Islands of Stars And the Foundation of Modern Cosmology Our goals

Chapter 20 Galaxies 20.1 Islands of Stars And the Foundation of Modern Cosmology Our goals for learning How are the lives of galaxies connected with the history of the universe? What are the three major types of galaxies? How

434 views • 11 slides

Computer Generation of Efficient Software Viterbi Decoders Frdric de Mesmay, Srinivas

Carnegie Mellon Computer Generation of Efficient Software Viterbi Decoders Frdric de Mesmay, Srinivas Chellappa, Franz Franchetti, Markus Pschel Electrical and Computer Engineering Carnegie Mellon University Co-Founder SpiralGen, Inc.

475 views • 26 slides