Symbolic Plans as High-Level Instructions for Reinforcement Learning - PowerPoint PPT Presentation

May 23, 2023 •446 likes •600 views

Symbolic Plans as High-Level Instructions for Reinforcement Learning Len Illanes , Xi Yan, Rodrigo Toro Icarte, Sheila A. McIlraith ICAPS 2020 1 What is this presentation about? We want to tell an RL agent to do a specific task We

Symbolic Plans as High-Level Instructions for Reinforcement Learning León Illanes , Xi Yan, Rodrigo Toro Icarte, Sheila A. McIlraith ICAPS 2020 1
What is this presentation about? ● We want to tell an RL agent to do a specific task ● We want declarative task specification... ○ like planning! ● ...without having a full description of the environment. ○ like RL! Combine them? 2
Why use RL? ● Impressive results in low-level control problems ○ e.g., Rubik’s cube manipulated by a robot hand ● Applicable without a given model ○ and without trying to learn one ...and why avoid it? ● Can be extremely inefficient ○ will need millions of training steps ● Is hard to use correctly! ○ specifying a reward is hard ○ value alignment problem 3
Why use AI Planning? ● It’s very efficient! ● Given a model, specifying new tasks is easy ...and why avoid it? ● Needs a model 4
A simple idea ● Use high-level model to define a task ○ Construct a high-level plan ○ Let RL deal with the low-level details ● Best of both worlds? 5
Our contributions ● Defined a new type of RL problem: Taskable RL ○ augments RL environments with high-level propositional symbols ○ this allows for easy representation of final-state goal problems ● Built a system to leverage symbolic models ○ high-level actions are used to identify options for hierarchical RL ○ learned option policies can be immediately transferred to new tasks ○ high-level plans are used as instructions, improving sample efficiency ● Showed that the approach is sound ○ Theoretically; when models are built properly ○ Empirically on some simple RL environments 6
Taskable RL Environments ● 〈 S , A , r , p , 𝛿 〉 is an MDP ● P is a set of propositions ● L : S → 2 P is a labelling function ● R ∈ ℝ is the goal reward parameter 7
Plans as High-Level Instructions ● Given a model, we can find plans ● Given a plan, we can try to execute it ○ Learn low-level policies for planning actions ● Issues: ○ Suboptimality ■ Dealt with by partial-order planning ○ Unexpected outcomes (bad models, bad policies, etc.) ■ Execution monitoring 8
Experiments and results - The Office World 9
10
11
12
13
Other experiments - The Minecraft World 14
Summary ● Defined Taskable RL , a new type of RL problem ● Built a system that leverage symbolic models ● Showed that the approach is sound and effective 15

Recommend

Decidability Decidability and Symbolic Symbolic Verification Symbolic Symbolic Verification

Decidability Decidability and Symbolic Symbolic Verification Symbolic Symbolic Verification Verification Verification Kim G. Larsen Kim G. Larsen Aalborg Aalborg University Aalborg Aalborg University University DENMARK University, ,

682 views • 46 slides

UN High UN High UN High UN High- - - -Level Meeting on TB Level Meeting on TB Level Meeting

UN High UN High UN High UN High- - - -Level Meeting on TB Level Meeting on TB Level Meeting on TB Level Meeting on TB Berlin, 18 May 2017 Berlin, 18 May 2017 Berlin, 18 May 2017 Berlin, 18 May 2017 Lucica Ditiu and Greg Paton, Stop TB

668 views • 20 slides

Chapter 17 Employee Benefits: Retirement Plans Fundamentals of Private Retirement Plans

3/27/2015 Chapter 17 Employee Benefits: Retirement Plans Fundamentals of Private Retirement Plans Defined Contribution Plans Defined Benefit Plans Section 401(k) Plans Section 403(b) Plans Profit-sharing Plans

538 views • 12 slides

Hierarchical Exact Symbolic Analysis y y of Large Analog Integrated Circuits By Symbolic Stamps

Hierarchical Exact Symbolic Analysis y y of Large Analog Integrated Circuits By Symbolic Stamps Symbolic Stamps Hui Xu, Guoyong Shi and Xiaopeng Li School of Microelectronics, Shanghai Jiao Tong Univ. Shanghai, China Presentation at Asia

655 views • 36 slides

Lazy Heap Analysis with Symbolic Memory Graphs Alexander Driemeyer Outline 1. Motivation 2.

Lazy Heap Analysis with Symbolic Memory Graphs Alexander Driemeyer Outline 1. Motivation 2. CPAchecker and Symbolic Memory Graphs 3. Abstractions of Symbolic Memory Graphs 4. Using counterexample guided abstraction refinement with Symbolic

412 views • 24 slides

Symbolic data analysis Symbolic data analysis Clustering of large data sets of mixed units

Symbolic data analysis V. Batagelj Symbolic data analysis Symbolic data analysis Clustering of large data sets of mixed units Clustering and optimization Leaders method Vladimir Batagelj Agglomerative method Examples IMFM Ljubljana

871 views • 33 slides

CS 478 - Tools for Machine Learning and Data Mining Symbolic Clustering - COBWEB Symbolic

COBWEB CS 478 - Tools for Machine Learning and Data Mining Symbolic Clustering - COBWEB Symbolic Clustering - COBWEB CS 478 - Tools for Machine Learning and Data Mining COBWEB COBWEB Overview Symbolic approach to category formation.

250 views • 7 slides

Neural-Symbolic Integration Strategies Neural-Symbolic Integration Unification Hybrid

Neural-Symbolic Integration Strategies Neural-Symbolic Integration Unification Hybrid Strategies Systems Neuronal Connectionist Hybrid Hybrid by Modeling Logic Systems by Translation Function Neural-Symbolic Learning Systems CILP:

670 views • 37 slides

Symbolic Execution of Linux binaries About Symbolic Execution Dynamically explore all

A tool for the Symbolic Execution of Linux binaries About Symbolic Execution Dynamically explore all program branches. Inputs are considered symbolic variables. Symbols remain uninstantiated and become constrained at execution

451 views • 24 slides

Cognitive Modeling Symbolic School Lecture 2: Approaches Symbolic Models 2 Symbolic

Approaches to Cognitive Modeling Approaches to Cognitive Modeling Symbolic Models Symbolic Models Connectionist Models Connectionist Models Hybrid Models Hybrid Models Cognitive Architectures Cognitive Architectures Approaches to Cognitive

534 views • 7 slides

Formal Verification Methods 2: Symbolic Simulation John Harrison Intel Corporation

Formal Verification Methods 2: Symbolic Simulation Formal Verification Methods 2: Symbolic Simulation John Harrison Intel Corporation Simulation Symbolic and ternary simulation BDDs Quaternary lattice Symbolic trajectory

335 views • 14 slides

Symbolic Execution: Applications Symbolic execution is widely used in practice. Tools based on

Symbolic Execution: Applications Symbolic execution is widely used in practice. Tools based on symbolic execution have found serious errors and security vulnerabilities in various systems: Network servers File systems Device drivers

733 views • 40 slides

Symbolic Mathematics Dr. Mihail November 20, 2018 (Dr. Mihail) Symbolic November 20, 2018 1 /

Symbolic Mathematics Dr. Mihail November 20, 2018 (Dr. Mihail) Symbolic November 20, 2018 1 / 16 Overview Symbolic So far in this course we dealt with MATLAB variables that were placeholders for numeric types (e.g., scalars, vectors,

953 views • 17 slides

Symbolic execution as search, and the rise of solvers Search and SMT Symbolic execution is

Symbolic execution as search, and the rise of solvers Search and SMT Symbolic execution is appealingly simple and useful , but computationally expensive ! We will see how the effective use of symbolic execution boils down to a kind of

494 views • 24 slides

Outline 2.1 Assembly language program structure 2.2 Data transfer instructions 2.3 Arithmetic

Outline 2.1 Assembly language program structure 2.2 Data transfer instructions 2.3 Arithmetic instructions 2.4 Branch and loop instructions 2.5 Shift and rotate instructions 2.6 Boolean logic instructions 2.7 Bit test and manipulate

337 views • 23 slides

Symbolic execution for binary-level security / 50 3 A number of shades of symbolic execution /

Symbolic execution for binary-level security / 50 3 A number of shades of symbolic execution / / Sbastien Bardin & Richard Bonichon 20180409 CEA LIST 1 1,1,L Model Source qb int foo (int t ) { 0,1,L 0,1,L int y = t * t - 4 * t ;

568 views • 34 slides

Smart and Adaptive Cyber-Physical Systems Chapters 1,2 Cyber-Physical Systems Smart mobility

Smart and Adaptive Cyber-Physical Systems Chapters 1,2 Cyber-Physical Systems Smart mobility Smart factory Smart grid Smart XX Smart health care Smart city But what does it mean to be smart ?

1.79k views • 102 slides

Paraconsistent Relational Model: A Quasi-Classic Logic Approach 1 Authors: Badrinath Jayakumar*

Paraconsistent Relational Model: A Quasi-Classic Logic Approach 1 Authors: Badrinath Jayakumar* and Rajshekhar Sunderraman Institute: Georgia State University, Georgia, USA Corresponding author: bjayakumar2@cs.gsu.edu Overview 2

318 views • 30 slides

s r r r

s r r r trt t ts r

706 views • 58 slides

Disclosures Advisory board for healthfinch (HIT start-up) 1 10/26/2015 Agenda

10/26/2015 Joy in Practice: Reconnecting with the Meaning and Mission of our Work 19 th Annual Management of the Hospitalized Patient UCSF Christine A Sinsky, MD, FACP Oct 15., 2015 Disclosures Advisory board for healthfinch (HIT

693 views • 34 slides

Learning Prof. Kuan-Ting Lai 2019/7/2 Deep Learning a new Buzzword 2 AI Papers 3

Introduction to Deep Learning Prof. Kuan-Ting Lai 2019/7/2 Deep Learning a new Buzzword 2 AI Papers 3 Registration of NIPS 4 AL/ML Investement 5 Source: Sand Hill Econometrics 6 Source: Sand Hill Econometrics 7 AlphaGo 8 So,

1.15k views • 81 slides

Type Synonyms What if I want to call int * int *

Type Synonyms What if I want to call int * int * int a date? Sec$on 2 CSE341 type date = int * int * int Type Synonyms Type Synonym

365 views • 3 slides

Lesson 8 Vocabulary & Anti synonym Different words with synonym similar meanings

Lesson 8 Vocabulary & Anti synonym Different words with synonym similar meanings Different words with synonym similar meanings antonym Different words with synonym similar meanings Different words with

1.31k views • 55 slides

Design = To plan or organize Synonym = plan Design is essentially the opposite of chance.

DESIGN BASICS ChAptEr 1 - DESIGN proCESS Design = To plan or organize Synonym = plan Design is essentially the opposite of chance. The design process involves seeking visual solutions to problems. Creative Visual Problem Solving =

343 views • 19 slides