landmarks revisited
play

Landmarks Revisited Silvia Richter 1 Malte Helmert 2 Matthias - PowerPoint PPT Presentation

Introduction to SAS + Planning Landmarks in Previous Work Using Landmarks as Pseudo-Heuristic Extended Landmark Generation Landmarks Revisited Silvia Richter 1 Malte Helmert 2 Matthias Westphal 2 1 Griffith University & NICTA, Australia 2


  1. Introduction to SAS + Planning Landmarks in Previous Work Using Landmarks as Pseudo-Heuristic Extended Landmark Generation Landmarks Revisited Silvia Richter 1 Malte Helmert 2 Matthias Westphal 2 1 Griffith University & NICTA, Australia 2 Albert-Ludwigs-Universit¨ at Freiburg, Germany AAAI 2008

  2. Introduction to SAS + Planning Landmarks in Previous Work Using Landmarks as Pseudo-Heuristic Extended Landmark Generation Outline Introduction to SAS + Planning 1 Landmarks in Previous Work 2 Using Landmarks as Pseudo-Heuristic 3 Extended Landmark Generation 4

  3. Introduction to SAS + Planning Landmarks in Previous Work Using Landmarks as Pseudo-Heuristic Extended Landmark Generation Outline Introduction to SAS + Planning 1 Landmarks in Previous Work 2 Using Landmarks as Pseudo-Heuristic 3 Extended Landmark Generation 4

  4. Introduction to SAS + Planning Landmarks in Previous Work Using Landmarks as Pseudo-Heuristic Extended Landmark Generation SAS + planning task SAS + planning task: Π = �V , A , s 0 , s ⋆ � V : state variables with finite domain D v Fact: variable-value pair v �→ d ( v ∈ V , d ∈ D v ) State: variable assignment for all v ∈ V A : actions � pre , eff � , with pre , eff fact sets Action a = � pre , eff � applicable in state s if pre ⊆ s Applying a in s updates s s 0 : initial state s ⋆ : partial variable assignment called the goal Sequence of actions π a plan iff s ⋆ ⊆ s 0 [ π ].

  5. Introduction to SAS + Planning Landmarks in Previous Work Using Landmarks as Pseudo-Heuristic Extended Landmark Generation Encoding of example task A p B C E o D t V = { v o , v t , v p } D v o = { A , B , C , D , E , t , p } D v t = { A , B , C , D } , D v p = { C , E } A = { drive-t-D-B , load-o-t-B , . . . } = { v o �→ B , v t �→ D , v p �→ E } s 0 s ⋆ = { v o �→ E }

  6. Introduction to SAS + Planning Landmarks in Previous Work Using Landmarks as Pseudo-Heuristic Extended Landmark Generation Encoding of example task A p B C E o D t V = { v o , v t , v p } D v o = { A , B , C , D , E , t , p } D v t = { A , B , C , D } , D v p = { C , E } A = { drive-t-D-B , load-o-t-B , . . . } = { v o �→ B , v t �→ D , v p �→ E } s 0 s ⋆ = { v o �→ E }

  7. Introduction to SAS + Planning Landmarks in Previous Work Using Landmarks as Pseudo-Heuristic Extended Landmark Generation Encoding of example task A p B C E o D t V = { v o , v t , v p } D v o = { A , B , C , D , E , t , p } D v t = { A , B , C , D } , D v p = { C , E } A = { drive-t-D-B , load-o-t-B , . . . } = { v o �→ B , v t �→ D , v p �→ E } s 0 s ⋆ = { v o �→ E }

  8. Introduction to SAS + Planning Landmarks in Previous Work Using Landmarks as Pseudo-Heuristic Extended Landmark Generation Encoding of example task A p B C E o D t V = { v o , v t , v p } D v o = { A , B , C , D , E , t , p } D v t = { A , B , C , D } , D v p = { C , E } A = { drive-t-D-B , load-o-t-B , . . . } = { v o �→ B , v t �→ D , v p �→ E } s 0 s ⋆ = { v o �→ E }

  9. Introduction to SAS + Planning Landmarks in Previous Work Using Landmarks as Pseudo-Heuristic Extended Landmark Generation Encoding of example task A p B C E o D t V = { v o , v t , v p } D v o = { A , B , C , D , E , t , p } D v t = { A , B , C , D } , D v p = { C , E } A = { drive-t-D-B , load-o-t-B , . . . } = { v o �→ B , v t �→ D , v p �→ E } s 0 s ⋆ = { v o �→ E }

  10. Introduction to SAS + Planning Landmarks in Previous Work Using Landmarks as Pseudo-Heuristic Extended Landmark Generation Encoding of example task A p B C E o D t V = { v o , v t , v p } D v o = { A , B , C , D , E , t , p } D v t = { A , B , C , D } , D v p = { C , E } A = { drive-t-D-B , load-o-t-B , . . . } = { v o �→ B , v t �→ D , v p �→ E } s 0 s ⋆ = { v o �→ E } o-at-E

  11. Introduction to SAS + Planning Landmarks in Previous Work Using Landmarks as Pseudo-Heuristic Extended Landmark Generation Encoding of example task cont’d A p B C E o D t load-o-t-B : � Pre = { v o �→ B , v t �→ B } , Eff = { v o �→ t }�

  12. Introduction to SAS + Planning Landmarks in Previous Work Using Landmarks as Pseudo-Heuristic Extended Landmark Generation Preferred Operators t-at-D o-at-B ... drive-t-D-B t-at-B ... o-at-B ... load-o t-at-B o-in-t ... Improvement of heuristic search approaches (Helmert 2006) Idea: prefer actions that are likely to improve heuristic value E. g. those which are part of plan for simplified problem

  13. Introduction to SAS + Planning Landmarks in Previous Work Using Landmarks as Pseudo-Heuristic Extended Landmark Generation Outline Introduction to SAS + Planning 1 Landmarks in Previous Work 2 Using Landmarks as Pseudo-Heuristic 3 Extended Landmark Generation 4

  14. Introduction to SAS + Planning Landmarks in Previous Work Using Landmarks as Pseudo-Heuristic Extended Landmark Generation Landmarks in Previous Work Facts that must be true in every plan (Porteous et al. 2001 & 2002; Hoffmann et al. 2004) Intuitively helpful to direct seach Automatically found, incl. orderings o-at-B t-at-B o-in-t A p t-at-C B C E p-at-C o-at-C o D o-in-p t o-at-E

  15. Introduction to SAS + Planning Landmarks in Previous Work Using Landmarks as Pseudo-Heuristic Extended Landmark Generation Landmarks in Previous Work cont’d o-at-B t-at-B Find landmarks by backchaining o-in-t Every goal is a landmark If B is landmark and all actions that first t-at-C achieve B have A as precondition, then p-at-C o-at-C A is a landmark Approximation with RPGs: consider o-in-p all achievers “possibly before” B o-at-E (Porteous et al. 2002) Disjunctive landmarks also possible: (o-in-p 1 ∨ o-in-p 2 )

  16. Introduction to SAS + Planning Landmarks in Previous Work Using Landmarks as Pseudo-Heuristic Extended Landmark Generation Landmarks in Previous Work cont’d o-at-B t-at-B Use as subgoals, then simply concatenate plans of subtasks o-in-t (“LM-local”) t-at-C Greatly speeds up search p-at-C o-at-C in many domains But: bad-quality plans, incomplete o-in-p (dead ends) o-at-E Any base planner possible for subtasks

  17. Introduction to SAS + Planning Landmarks in Previous Work Using Landmarks as Pseudo-Heuristic Extended Landmark Generation Outline Introduction to SAS + Planning 1 Landmarks in Previous Work 2 Using Landmarks as Pseudo-Heuristic 3 Extended Landmark Generation 4

  18. Introduction to SAS + Planning Landmarks in Previous Work Using Landmarks as Pseudo-Heuristic Extended Landmark Generation Landmark Heuristic + Preferred Operators Novel usage of landmarks Pseudo-Heuristic = #landmarks that still need to be achieved Take orderings into account (see paper for details) Preferred operators = landmark-achieving operators or operators in relaxed plan to nearest landmark Combination with other heuristics through multi-heuristic BFS (Helmert 2006) Experiments with several heuristics (FF, CG, blind) on all tasks from past planning competitions

  19. Introduction to SAS + Planning Landmarks in Previous Work Using Landmarks as Pseudo-Heuristic Extended Landmark Generation Results: %Tasks solved (Average) Algorithm Base Heuristic base LM-local LM-heur FF heuristic 87 82 88 CG heuristic 74 66 87 blind heuristic 25 52 84 Note: updated results for LM-local With all 3 heuristics, LM-heur dominates other approaches LM-local worse than base with CG and blind heuristic (dead ends in 8 domains) FF-heuristic: base and LM-local are close. . .

  20. Introduction to SAS + Planning Landmarks in Previous Work Using Landmarks as Pseudo-Heuristic Extended Landmark Generation Results: %Tasks solved (Average) Algorithm Base Heuristic base LM-local LM-heur FF heuristic 82 87 88 CG heuristic 74 66 87 blind heuristic 25 52 84 With all 3 heuristics, LM-heur dominates other approaches LM-local worse than base with CG and blind heuristic (dead ends in 8 domains) FF-heuristic: base and LM-local are close. . .

  21. Introduction to SAS + Planning Landmarks in Previous Work Using Landmarks as Pseudo-Heuristic Extended Landmark Generation Results: #Tasks solved exclusively (FF heuristic) FF heuristic Domain base LM-heur Airport (50) 2 6 Depot (22) 0 2 Freecell (80) 1 3 Logistics-1998 (35) 0 2 Miconic-FullADL (150) 2 0 MPrime (35) 0 3 Mystery (30) 0 1 Pathways (30) 1 2 Philosophers (48) 0 2 Pipesworld-NoTankage (50) 0 2 Pipesworld-Tankage (50) 1 5 Schedule (150) 0 1 Storage (30) 1 0 Total 12 25 LM-heur solves twice as many tasks exclusively as base

  22. Introduction to SAS + Planning Landmarks in Previous Work Using Landmarks as Pseudo-Heuristic Extended Landmark Generation Outline Introduction to SAS + Planning 1 Landmarks in Previous Work 2 Using Landmarks as Pseudo-Heuristic 3 Extended Landmark Generation 4

  23. Introduction to SAS + Planning Landmarks in Previous Work Using Landmarks as Pseudo-Heuristic Extended Landmark Generation Extended Landmark Generation Adapted previous procedures to SAS + planning Admit disjunctive landmarks Find additional landmarks through DTGs

Recommend


More recommend