Learning Domain-Independent Heuristics over Hypergraphs William Shen - PowerPoint PPT Presentation

Learning Domain-Independent Heuristics over Hypergraphs William Shen , Felipe Trevizan, Sylvie Thiébaux The Australian National University

Learn domain-independent heuristics Learn entirely from scratch ● Do not use hand-crafted features ○ e.g. Learning Generalized Reactive Policies using Deep Neural Networks ■ [Groshev et al. 2018] Do not rely on existing heuristics as input features ○ e.g. Action Schema Networks: Generalised Policies with Deep Learning ■ [Toyer et. al 2017] Do not learn an improvement for an existing heuristic ○ e.g. Learning heuristic functions from relaxed plans [Yoon et al. 2006] ■ 2

Learn domain-independent heuristics Generalise to: ● different initial states, goals ○ different number of objects ○ different domains ○ domains unseen during training ■ domain-independent! 3

STRIPS unstack(1, 2) F is the set of propositions ● A is the set of actions ● Each action has preconditions, add-effects & ○ delete-effects I ⊆ F is the initial state ● unstack(1, 2) PRE: on(1, 2), clear(1) ... G ⊆ F is the goal states ● EFF: holding(1) clear(2) c is the cost function ● ¬on(1, 2) ... 4

Hypergraph for the delete relaxation Hyperedge : edge that joins any number of vertices ● The delete-relaxation P + of problem P can be represented by a hypergraph Delete-Relaxation : ignore delete effects for each action 5

h add heuristic Estimate cost of goal as ● sum of costs of each proposition Assumes achieving each ● proposition is independent Overcounting ○ Non-admissible! ○ 6

h max heuristic Estimate cost of goal as ● the most expensive goal proposition Admissible but not as ● informative as h add 7

Learning Heuristics over Hypergraphs Learn a function ⊕ which better approximates shortest paths ● 8

Learning Heuristics over Hypergraphs Learn function h : hypergraph → R ● 9

Hypergraph Networks (HGN) Our generalisation of Graph Networks [Battaglia et al. 2018] to ● hypergraphs Hypergraph Network (HGN) Block ● Powerful and flexible building block ○ Hypergraph-to-Hypergraph mapping ○ Uses message passing to aggregate and update features with ○ update/aggregation functions 10 10

Hypergraph Networks (HGN) Analogous to Message Passing Figure from Battaglia et al. 2018 11 11

STRIPS-HGN 12

STRIPS-HGN Input features Hypergraph structure 13

STRIPS-HGN Encoder Block 14

STRIPS-HGN Encoder Latent proposition and action features 15

STRIPS-HGN Encoder Multilayer Perceptrons Latent proposition and action features 16

STRIPS-HGN Initial Latent features 17

STRIPS-HGN Recurrent Latent features Initial Latent features 18

STRIPS-HGN Core Message Passing Block Propagates information through the hypergraph! 19

STRIPS-HGN Processing Latent heuristic value! Updated proposition and action features 20

STRIPS-HGN Updated Latent features 21

STRIPS-HGN Repeat! 22

STRIPS-HGN 23

STRIPS-HGN Updated Latent features 24

STRIPS-HGN Decoder Block 25

STRIPS-HGN Decoder Decoded heuristic value (real number) 26

Training a STRIPS-HGN Input Features - learning from scratch ● Proposition: ○ [proposition in current state, proposition in goal state] Action: [cost, #preconditions, #add-effects] ○ Generate Training Data ● Run an optimal planner for a set of training problems ○ Use the states encountered in the optimal plans ○ Aim to learn the optimal heuristic value ○ Train using Gradient Descent, treat as regression problem ● 27 27

Experimental Results Evaluate using A* Search ● Baseline Heuristics ● h add (inadmissible), h max , blind and Landmark Cut (admissible) ○ STRIPS-HGN : h HGN ● Train and evaluate on a single CPU core ○ Run core block 10 times (i.e., M = 10) ○ Powerful generalisation but slower to compute ○ 28 28

Evaluation on domains we trained on Training Testing Zenotravel Gripper Blocksworld Gripper Blocksworld 10 small 3 small 10 small 18 larger 100 larger Training Training Training Testing Testing Problems Problems Problems Problems Problems 2-3 cities 1-3 balls 4-5 blocks 4-20 balls 6-10 blocks Train and evaluate a single network on 3 domains . ● Training time: 15 min ● 29 29

Blocksworld (trained on) Train on Zenotravel, Gripper & Blocksworld 95% confidence interval shown for h HGN over 10 repeated experiments. 30 30

Gripper (trained on) Train on Zenotravel, Gripper & Blocksworld 31 31

Evaluation on domains we did not train on Training Testing Blocksworld Zenotravel Gripper 10 small 3 small 50 Testing Training Training Problems Problems Problems 4-8 blocks 2-3 cities 1-3 balls Train a single network on 2 domains. Evaluate on new unseen domain. ● Training time: 10 min ● 32 32

Blocksworld ( not trained on) Train on Zenotravel and Gripper only. 33 33

Future Work Speeding up a STRIPS-HGN ● Slow to evaluate - bottleneck ○ Optimise Hypergraph Networks implementation ○ Take advantage of multiple cores or use GPUs for parallelisation ○ Improve Generalisation Performance ● Use richer set of input features ○ Careful study of hyperparameter space, similar to [Ferber et al. 2020] ○ 34 34

Thanks! 35 35

Learning Domain-Independent Heuristics over Hypergraphs William Shen - PowerPoint PPT Presentation

Learning Domain-Independent Heuristics over Hypergraphs William Shen , Felipe Trevizan, Sylvie Thibaux The Australian National University Learn domain-independent heuristics Learn entirely from scratch Do not use hand-crafted features

Hypergraphs in Chaos JULIUS LISCHEID Graphs and Hypergraphs Hypergraphs (, ) are

Heuristics and biases Tina Nane 2 Heuristics and biases Lotto Icon by Dapete is

Outline CP for VRP DMP204 SCHEDULING, TIMETABLING AND ROUTING 1. Construction Heuristics

On the Cycle Structures of Hypergraphs Jianfang Wang Academy of Mathematics and System Science,

Transversals and Domination in Hypergraphs Michael A. Henning Department of Mathematics

Lagrangians of Hypergraphs Peng, Yuejian Hunan University November 09, 2013 Lagrangians of

Decision Aid Methodologies In Transportation Lecture 7: Heuristics Heuristics Shadi SHARIF

Learning Heuristics Focus Group Initial Results Learning Heuristics Focus Group-Initial Results

Eulerian-type properties of hypergraphs Mateja Sajna University of Ottawa Joint

A signed structure theory for oriented hypergraphs Lucas J. Rusnak CombinaTexas 2016 8 May 2016

Trinities, hypergraphs, and contact structures Daniel V. Mathews Daniel.Mathews@monash.edu

Checking Graph-Transformation Systems for Confluence Detlef Plump Hypergraphs E TOP TRUE

On the Regularity Method for Hypergraphs Mathias Schacht October 2004 Regularity Method for

Coloring Simple Hypergraphs Dhruv Mubayi Department of Mathematics, Statistics and Computer

Edge-connectivity of permutation hypergraphs Zolt an Szigeti Laboratoire G-SCOP INP Grenoble,

IMDAD ULLAH KHAN UMM AL-QURA UNIVERSITY June 12, 2013 Hypergraphs A hypergraph H is a family of

Weakly-supervised learning from videos and scripts Ivan Laptev ivan.laptev@inria.fr WILLOW,

Partnering For Leadership A Unique Business/ Non-Profit Action Learning Approach - ASTD

Illinois Early Childhood Collaborations: Community Highlights and Peer Exchange Part 1 of 2:

Policy gradients CMU 10-403 Katerina Fragkiadaki Used Materials Disclaimer : Much of the

Q-Learning 2/22/17 MDP Examples MDPs model environments where state transitions are affected

What AI is A.Y. 2019/2020 A taste of AI http://bit.ly/2RW7xlv All problems present in a few

Introduction to AI & Intelligent Agents This Lecture Chapters 1 and 2 Next Lecture

CSC421 Intro to Artificial Intelligence UNIT 00: Overview & Introduction Overview

Learning Domain-Independent Heuristics over Hypergraphs William Shen - PowerPoint PPT Presentation

Learning Domain-Independent Heuristics over Hypergraphs William Shen , Felipe Trevizan, Sylvie Thibaux The Australian National University Learn domain-independent heuristics Learn entirely from scratch Do not use hand-crafted features

Hypergraphs in Chaos JULIUS LISCHEID Graphs and Hypergraphs Hypergraphs (, ) are

Heuristics and biases Tina Nane 2 Heuristics and biases Lotto Icon by Dapete is

Outline CP for VRP DMP204 SCHEDULING, TIMETABLING AND ROUTING 1. Construction Heuristics

On the Cycle Structures of Hypergraphs Jianfang Wang Academy of Mathematics and System Science,

Transversals and Domination in Hypergraphs Michael A. Henning Department of Mathematics

Lagrangians of Hypergraphs Peng, Yuejian Hunan University November 09, 2013 Lagrangians of

Decision Aid Methodologies In Transportation Lecture 7: Heuristics Heuristics Shadi SHARIF

Learning Heuristics Focus Group Initial Results Learning Heuristics Focus Group-Initial Results

Eulerian-type properties of hypergraphs Mateja Sajna University of Ottawa Joint

A signed structure theory for oriented hypergraphs Lucas J. Rusnak CombinaTexas 2016 8 May 2016

Trinities, hypergraphs, and contact structures Daniel V. Mathews Daniel.Mathews@monash.edu

Checking Graph-Transformation Systems for Confluence Detlef Plump Hypergraphs E TOP TRUE

On the Regularity Method for Hypergraphs Mathias Schacht October 2004 Regularity Method for

Coloring Simple Hypergraphs Dhruv Mubayi Department of Mathematics, Statistics and Computer

Edge-connectivity of permutation hypergraphs Zolt an Szigeti Laboratoire G-SCOP INP Grenoble,

IMDAD ULLAH KHAN UMM AL-QURA UNIVERSITY June 12, 2013 Hypergraphs A hypergraph H is a family of

Weakly-supervised learning from videos and scripts Ivan Laptev ivan.laptev@inria.fr WILLOW,

Partnering For Leadership A Unique Business/ Non-Profit Action Learning Approach - ASTD

Illinois Early Childhood Collaborations: Community Highlights and Peer Exchange Part 1 of 2:

Policy gradients CMU 10-403 Katerina Fragkiadaki Used Materials Disclaimer : Much of the

Q-Learning 2/22/17 MDP Examples MDPs model environments where state transitions are affected

What AI is A.Y. 2019/2020 A taste of AI http://bit.ly/2RW7xlv All problems present in a few

Introduction to AI &amp; Intelligent Agents This Lecture Chapters 1 and 2 Next Lecture

CSC421 Intro to Artificial Intelligence UNIT 00: Overview &amp; Introduction Overview

Introduction to AI & Intelligent Agents This Lecture Chapters 1 and 2 Next Lecture

CSC421 Intro to Artificial Intelligence UNIT 00: Overview & Introduction Overview