Evolution Strategies using TensorForce LSDPO (2017/2018) Project - PowerPoint PPT Presentation

Aug 16, 2023 •10 likes •100 views

Evolution Strategies using TensorForce LSDPO (2017/2018) Project Presentation Tudor Tiplea (tpt26) What is TensorForce? Open-Source Reinforcement Learning Library Built on top of TensorFlow Provides a strict separation of agents,

Evolution Strategies using TensorForce LSDPO (2017/2018) Project Presentation Tudor Tiplea (tpt26)
What is TensorForce? Open-Source Reinforcement Learning Library ● Built on top of TensorFlow ● Provides a strict separation of agents, environments and update logic ● A number of out-of-the-box state-of-the-art RL algorithms already implemented: ● A3C, DQN, Double-DQN, etc. ○
Why is it useful? ● Suppose you want to employ deep RL to control some aspect of your system Lots of resources and introductions to theoretical RL ● Also, lots of starter agents and their applications available online ● However, much of the existing code has several disadvantages. E.g.: ● Tight integration with simulation platforms ○ Fixed network architectures ○ TensorForce provides the out-of-the-box agents, but they are highly configurable ● It also employs a shift of paradigm: the environment calls out to the agent when it needs a decision rather ● than the other way around
Evolution Strategies An alternative to MDP-based RL techniques such as Q-learning or Policy Gradient ● A heuristic search procedure inspired by natural evolution ● At each iteration (generation): ● Perturb a population of parameter vectors ○ Evaluate the objective function for each ○ Best performing ones are recombined to form the population at the next step ○ Can be scaled and parallelised between multiple workers, with limited intercommunication ●
Non-parallelised algorithm
Work plan Connect the existing weight update part of the simple ES algorithm to a model, producing the ● first agent Implement the parallelised ES agent to run in multi-threaded manner on my laptop ● Evaluate the two on simple environments (due to long training time) from OpenAI Gym ● Compare against already implemented agents such as A3C and DQN ●
Possible extensions First, set up an EC2 instance using a student account ● Evaluate the implemented agents in more complex environments, such as Atari 2600 games ● Extend the parallelised ES agent to run in a distributed manner, across multiple machines ● Evaluate the distributed ES agent ●
Questions Thank you!
References [1] TensorForce: https://github.com/reinforceio/tensorforce [2] Evolution Strategies as a Scalable Alternative to Reinforcement Learning: https://arxiv.org/abs/1703.03864

Recommend

Implementing Cross Entropy Method for TensorForce Tom Brady TensorForce* Open Source

Implementing Cross Entropy Method for TensorForce Tom Brady TensorForce* Open Source (Apache 2.0) Reinforcement Learning library Built on top of TensorFlow and compatible with Python 2.7 and >3.5 Goal: clear APIs, readability

119 views • 7 slides

EVOLUTION X3 - 1 - Evolution X3 Marketing Dpt. November 2006 - 2 - EVOLUTION X3 Evolution X3

Marketing Dpt. November 2006 EVOLUTION X3 - 1 - Evolution X3 Marketing Dpt. November 2006 - 2 - EVOLUTION X3 Evolution X3 Evolution X3 Technical Features: HP Pum p Highly reliable plunger pump with brass head and 3 ceramic pistons for

405 views • 13 slides

Evolution of valley depth and width Evolution of valley depth and width Evolution of valley depth

Evolution of valley depth and width Evolution of valley depth and width Evolution of valley depth and width Evolution of valley depth and width during base- -level fluctuations level fluctuations during base during base- -level fluctuations

492 views • 14 slides

Lecture 1 Chapter 9 Software evolution 1 Topics covered Evolution processes Change

Chapter 9 Software Evolution Lecture 1 Chapter 9 Software evolution 1 Topics covered Evolution processes Change processes for software systems Program evolution dynamics Understanding software evolution Software

831 views • 38 slides

Evolution Strategies Distributed deep reinforcement learning (blog.otoro.net) Evolutionary

Evolution Strategies Distributed deep reinforcement learning (blog.otoro.net) Evolutionary Strategies Steven Schmatz November 21, 2017 @stevenschmatz Deep Reinforcement Learning Evolution Strategies Steven Schmatz November 21, 2017

653 views • 38 slides

EVOLUTION Its a Family Affair TODAYS LESSON Diversity and Evolution of Living Organisms

EVOLUTION Its a Family Affair TODAYS LESSON Diversity and Evolution of Living Organisms I. The scientific theory of evolution is the organizing principle of life science. II. The scientific theory of evolution is supported by multiple

869 views • 31 slides

EVOLUTION Paper 2: 66 marks THEORIES OF EVOLUTION EVOLUTION : Change over Time Compiled by

EVOLUTION Paper 2: 66 marks THEORIES OF EVOLUTION EVOLUTION : Change over Time Compiled by Mr G. D. MABOTE C.O.P. Member in the NORTHERN CAPE PROVINCE A former Lecturer at THE NATIONAL INSTITUTE FOR HIGHER EDUCATION DEFINE EACH OF THESE

467 views • 29 slides

Technology Evolution Technology Focused Evolution Architectural Changes Impact on

Introduction - Products Product Evolution Market Focused Evolution New Features Improvements Technology Evolution Technology Focused Evolution Architectural Changes Impact on Architecture of a Technology

421 views • 11 slides

Science Evolution and Inheritance Year One Science | Year 6 | Evolution and Inheritance | Theory

Science Evolution and Inheritance Year One Science | Year 6 | Evolution and Inheritance | Theory of Evolution | Lesson 2 Ai Aim I can identify the key ideas of the theory of evolution. Succe Success Cri Criteri ria I can demonstrate

551 views • 19 slides

Meta-Evolution Style for Software Architecture Evolution lah Ad Adel Ha Hassan n and Mourad d

Meta-Evolution Style for Software Architecture Evolution lah Ad Adel Ha Hassan n and Mourad d Oussalah SOFSEM 2016 1 Harrachov, Czech Republic Outline Motivation Preliminaries Evolution styles Meta-evolution style

554 views • 18 slides

Rehabilitation Consequences of Road Collisions ine Carroll Evolution Evolution Evolution

Rehabilitation Consequences of Road Collisions ine Carroll Evolution Evolution Evolution Exoskeleton Endoskeleton Soft and squishy! Lascaux Consequences Pedestrian RTCs Brain Spinal cord injury Common Clinical Patterns Upper Limbs

706 views • 54 slides

1. Evolution and Classification 1.1 Origin of Life and Plants 1.2 Animal Evolution 1.3 Human

1. Evolution and Classification 1.1 Origin of Life and Plants 1.2 Animal Evolution 1.3 Human Evolution 1.4 Mechanisms of Evolution 1.5 Hardy-Weinberg Equilibrium 1.6 Mechanisms of Speciation 1.7 Classification of Living Organisms 1.1

921 views • 79 slides

Models of Language Evolution models thereof its evolution language Models of Language Evolution

Models of Language Evolution models thereof its evolution language Models of Language Evolution ? What is language? never start with a dictionary definition!! the language of Google search query completion A language is a dialect with

1.13k views • 30 slides

Evolution Change over time but what is the process? Evolution: Change through time

Evolution Change over time but what is the process? Evolution: Change through time Unrolling Lamarckian No extinction Evolution by Natural Selection Charles Darwin Alfred Russel Wallace Evolution by Natural

1.11k views • 33 slides

The Generalized Theories of Evolution Why it is the Theory of Evolution that is Constantly

The Generalized Theories of Evolution Why it is the Theory of Evolution that is Constantly Generalized? Ozan Altan Altinok The Generalized Theories of Evolution Outline - Introduction: Too quick generalizations? r: - Evolution, theories,

336 views • 22 slides

One Step Mutation (OSM) matrices joint work with Sequence Evolution 1 Sequence Evolution

One Step Mutation (OSM) matrices joint work with Sequence Evolution 1 Sequence Evolution acggcatagccgattac Sequence Evolution acgggatagcccattac acggcatagccgattac 2 Sequence Evolution acgggat--cccattac acggcatatccactggattac

471 views • 18 slides

Winds of convection Peter Bechtold with special thanks to Martin Steinheimer , Michael

Winds of convection Peter Bechtold with special thanks to Martin Steinheimer , Michael Hermann, . Fuchs, King - Fai Li, L. Schlemmer, A. Subramanian, F. Vitart, N. agar, C. Zhang and our excellent organizer Parthasarthi Mukhopadhyay

514 views • 33 slides

MATH 590: Meshfree Methods Chapter 36: Generalized Hermite Interpolation Greg Fasshauer

MATH 590: Meshfree Methods Chapter 36: Generalized Hermite Interpolation Greg Fasshauer Department of Applied Mathematics Illinois Institute of Technology Fall 2010 fasshauer@iit.edu MATH 590 Chapter 36 1 Outline The Generalized

762 views • 20 slides

Runoff Resolvers Teacher Advisor: David McLoda Student Names: Hunter Adams, Kimmy Chang, Nolan

Land and Water Challenge a Runoff Resolvers Teacher Advisor: David McLoda Student Names: Hunter Adams, Kimmy Chang, Nolan Kuo, Aadit Mehta, Leo Troik School of Science and Engineering Magnet Dallas, TX Principal Name: Tiffany Huitt

286 views • 15 slides

January March 2019 April 15, 2019 Help Desk SLA The IT Help Desk has the following

IT IT Service Le Level Agreement In Information January March 2019 April 15, 2019 Help Desk SLA The IT Help Desk has the following Service Level Agreement standards: Customer Satisfaction is 4.0 or greater on a 5.0 scale.

258 views • 14 slides

Task 879.1: Intelligent Demand Aggregation and Forecasting Task Leader: Argon Chen

SRC Project 879 Progress report Task 879.1: Intelligent Demand Aggregation and Forecasting Task Leader: Argon Chen Co-Investigators: Ruey-Shan Guo Shi-Chung Chang Students: Jakey Blue, Felix Chang, Ken Chen, Ziv Hsia, B.W. Hsie, Peggy Lin

689 views • 34 slides

Inter Partes Review: Who is a Privy of the Petitioner? Written by Allard Chu Posted:

233 views • 6 slides

October 2016 Important Notice This presentation shall be read in conjunction with Mapletree

Investor Presentation October 2016 Important Notice This presentation shall be read in conjunction with Mapletree Industrial Trusts (MIT) financial results for Second Quarter Financial Year 2016/2017 in the SGXNET announcement dated 25

480 views • 35 slides

John G. Ruggie UN SRSG for Business & Human Rights Keynote Presentation at EU Presidency

CHECK AGAINST DELIVERY John G. Ruggie UN SRSG for Business & Human Rights Keynote Presentation at EU Presidency Conference on the Protect, Respect and Remedy Framework Stockholm, November 10-11, 2009 I am truly honored that Sweden, in

581 views • 6 slides