CSC2542 Bernhard Nebel, and Jussi Rintanen. SAT-Based Planning For - PDF document

Acknowledgements Some of the slides used in this course are modifications of Dana Nau’s lecture slides for the textbook Automated Planning, licensed under the Creative Commons Attribution-NonCommercial-ShareAlike License: http://creativecommons.org/licenses/by-nc-sa/2.0/ Other slides are modifications of slides developed by Malte Helmert, CSC2542 Bernhard Nebel, and Jussi Rintanen. SAT-Based Planning For this topic, some slides come from Henry Kautz, Ulrich Scholz, and Yiqiao Wang. Sheila McIlraith I have also used some material prepared by Dan Weld, P@trick Haslum Department of Computer Science and Rao Kambhampati. University of Toronto I would like to gratefully acknowledge the contributions of these Fall 2010 researchers, and thank them for generously permitting me to use aspects of their presentation material. 1 2 Segue Motivation � The problem of finding a valid plan from the planning graph � Propositional satisfiability (SAT): can be encoded on any combinatorial substrate Given a boolean formula � Alternatives: e.g., (P ∨ Q) ∧ ( ¬ Q ∨ R ∨ S) ∧ ( ¬ R ∨ ¬ P) , � CSP [GP-CSP – Do & Kambhampati, 2000] Does there exist a model � SAT [Blackbox; SATPLAN – Kautz & Selman, 1996+] i.e., an assignment of truth values to the propositions � ASP [Son et al] that makes the formula true? � IP [Vossen et al] � This was the first problem shown to be NP-complete. � Lots of research on algorithms for solving SAT. � This is the notion of “Translation to General Problem Solver” � Key idea behind SAT-based planning: that we discussed in our first technical lecture. � Translate classical planning problems into satisfiability problems, and solving them using a highly optimized SAT Here we discuss SAT as the combinatorial substrate. solver. 3 4

Basic Approach Evolution of SAT-based planners � Suppose a plan of length n exists � The success of this approach has largely been the result of impressive advances in the proficiency of � Encode this hypothesis in SAT SAT solvers. � Initial state is true at t 0 � A continued limiting factor to this approach is the size of � Goal is true at t n the CNF encoding of some problems. � Actions imply effects, etc � Thus, a key challenge to this approach has been how to � Look for satisfying assignment encode the planning problem effectively. Such � Decode into plan encodings have marked the evolution of SAT-based planners. 5 6 History History… …History (enter SAT-based planners)… 1992 Satplan “approach” (Kautz & Selman ECAI-92) 1969 Plan synthesis as theorem proving (Green IJCAI-69) � � � convention for encoding STRIPS-style linear planning in axiom 1971 STRIPS (Fikes & Nilsson AIJ-71) � schema Decades of work on “specialized theorem provers” � � Didn’t appear practical . . . Rapid progress on SAT solving � 1996 (Kautz & Selman AAAI-96) (Kautz, McAllester & Selman KR-96) � � Electrifying results (on hand coded formulae) � Key technical advance: parallel encodings where noninterfering actions could occur at the same time (i.e., Graphplan ideas) (but no compiler) 1997 MEDIC (Ernst et al. IJCAI-97) � � First complete implementation of Satplan (with compiler) 1998 Blackbox (Kautz & Selman AIPS98 workshop) � � Also performed mutex propagation before generating encoding . . . 7 8

…History (IPC)…. Outline 1998 IPC-1 Blackbox performance comparable to the best � Encoding planning problems as satisfiability problems � 2000 IPC-2 Blackbox performance abysmal (Graphplan-style planners � � Extracting plans from truth values dominated) � Satisfiability algorithms 2002 IPC-3 No SAT-based planners entered � � Combining satisfiability with planning graphs 2004 IPC-4 Satplan04 was clear winner of “optimal propositional planners” � � Blackbox & SatPlan 2006 IPC-5 Satplan06 & Maxplan* (Chen Xing & Zhang IJCAI-07) dominated** � What accounts for the success in 2004 and 2006? 1) Huge advances in SAT solvers 2000-2004 (e.g., Seige, ZChaff) (indeed in 2004 they ran out of time and didn’t include mutex propagation) 2) New competition problems that were “intrinsically hard” * Also a SAT-based planner ** dominated the “optimal planners” track. Note however that in the so-called “satisficing planners” track, e.g. the heuristic-search based planners that could not guarantee optimal length, satificing planners were able to solve much larger problems! 9 10 The SATPLAN Approach* The SATPLAN Approach* Overall Approach � A bounded planning problem is a pair ( P,n ): � P is a planning problem; n is a positive integer axiom problem � Any solution for P of length n is a solution for ( P,n ) instantiate schemas description instantiated propositional clauses � Planning algorithm: length � Do iterative deepening as we did with Graphplan: mapping � for n = 0, 1, 2, …, � encode ( P,n ) as a satisfiability problem Φ � if Φ is satisfiable, then satisfying SAT plan interpret � From the set of truth values that satisfies Φ , a model engine(s) solution plan can be constructed, return it and exit. * Terminology: “SATPLAN approach” (circa 1992) vs. the SATPLAN planner of 2004, 2006 etc., the successor of Blackbox. 11 12

Notation Fluents � For satisfiability problems we need to use propositional logic � If π = 〈 a 0 , a 1 , …, a n –1 〉 is a solution for ( P,n ), it generates these states: � Need to encode ground atoms into propositions s 0 , s 1 = γ ( s 0 , a 0 ), s 2 = γ ( s 1 , a 1 ), …, s n = γ ( s n –1 , a n –1 ) � For set-theoretic planning we encoded atoms into propositions by rewriting them as shown here: � Fluent: proposition saying a particular atom is true in a particular � Atom: at(r1,loc1) state, e.g., � Proposition: at-r1-loc1 � at(r1,loc1, i ) is a fluent that’s true iff at(r1,loc1) is in s i � For planning as satisfiability we’ll do the same thing � But we won’t bother to do a syntactic rewrite � We’ll use l i to denote the fluent for literal l in state s i � Just use at(r1,loc1) itself as the proposition � e.g., if l = at(r1,loc1) then l i = at(r1,loc1, i ) � Also, we’ll write plans starting at a 0 rather than a 1 � π = 〈 a 0 , a 1 , …, a n –1 〉 � a i is a fluent saying that a is the i’ th step of π � e.g., if a = move(r1,loc2,loc1) then a i = move(r1,loc2,loc1, i ) 13 14 Formulae in Φ Encoding Planning Problems � Encode ( P,n ) as a formula Φ such that Formula describing the initial state : � . { l 0 | l ∈ s 0 } ∧ . { ¬ l 0 | l ∈ L – s 0 } π = 〈 a 0 , a 1 , …, a n –1 〉 is a solution for ( P,n ) if and only if There is a satisfying assignment for Φ such that fluents Formula describing the goal : � a 0 , …, a n –1 are true . { l n | l ∈ g + } ∧ . { ¬ l n | l ∈ g – } For every action a in A , formulae describing what changes a would make � � Let if it were the i ’th step of the plan: � a i ⇒ . { p i | p ∈ Precond( a )} ∧ . { e i +1 | e ∈ Effects( a )} � A = {all actions in the planning domain} � S = {all states in the planning domain} Complete exclusion axiom: � � L = {all literals in the language} � For all actions a and b , formulas saying they can’t occur at the same time ¬ a i ∨ ¬ b i Φ is the conjunct of many other formulas … � � this guarantees there can be only one action at a time (i.e., a sequential plan. This is revisted in the blackbox encoding later. Is this enough? � 15 16

Frame Axioms Example � Planning domain: � Frame axioms : � one robot r1 � Formulas describing what doesn’t change between steps i and i +1 � two adjacent locations l1, l2 � Several ways to write these � one operator (move the robot) � Encode ( P,n ) where n = 1 � One way: explanatory frame axioms � One axiom for every literal l � Initial state: {at(r1,l1)} � Says that if l changes between s i and s i +1 , at(r1,l1,0) ∧ ¬ at(r1,l2,0) Encoding: then the action at step i must be responsible: � Goal: {at(r1,l2)} ( ¬ l i ∧ l i +1 ⇒ V a in A { a i | l ∈ effects + ( a )}) at(r1,l2,1) ∧ ¬ at(r1,l1,1) Encoding: ∧ ( l i ∧ ¬ l i +1 ⇒ V a in A { a i | l ∈ effects – ( a )}) � Operator: see next slide 17 18 Example (continued) Example (continued) Operator: move(r,l,l’) � precond: at(r,l) Solution: Add typing of parameters effects: at(r,l’), ¬ at(r,l) Encoding: Locations: l1, l2 � move(r1,l1,l2,0) ⇒ at(r1,l1,0) ∧ at(r1,l2,1) ∧ ¬ at(r1,l1,1) Robots: r1 � move(r1,l2,l1,0) ⇒ at(r1,l2,0) ∧ at(r1,l1,1) ∧ ¬ at(r1,l2,1) move(r1,l1,l1,0) ⇒ at(r1,l1,0) ∧ at(r1,l1,1) ∧ ¬ at(r1,l1,1) contradictions Operator: move(r : robot, l : location, l’ : location) � move(r1,l2,l2,0) ⇒ at(r1,l2,0) ∧ at(r1,l2,1) ∧ ¬ at(r1,l2,1) (easy to detect) precond: at(r,l) move(l1,r1,l2,0) ⇒ … effects: at(r,l’), ¬ at(r,l) move(l2,l1,r1,0) ⇒ … nonsensical move(l1,l2,r1,0) ⇒ … Encoding: move(l2,l1,r1,0) ⇒ … move(r1,l1,l2,0) ⇒ at(r1,l1,0) ∧ at(r1,l2,1) ∧ ¬ at(r1,l1,1) move(r1,l2,l1,0) ⇒ at(r1,l2,0) ∧ at(r1,l1,1) ∧ ¬ at(r1,l2,1) How to avoid generating the last four actions? � � Assign data types to the constant symbols 19 20

CSC2542 Bernhard Nebel, and Jussi Rintanen. SAT-Based Planning For - PDF document

Acknowledgements Some of the slides used in this course are modifications of Dana Naus lecture slides for the textbook Automated Planning, licensed under the Creative Commons Attribution-NonCommercial-ShareAlike License:

CSC2542 Representations for (Classical) Planning Sheila McIlraith Department of Computer

CSC2542 Planning-Graph Techniques Sheila McIlraith Department of Computer Science University of

CSC2542 Introduction to Planning Sheila McIlraith Department of Computer Science University of

CSC2542 Paper Presentations and Critiques Sheila McIlraith Department of Computer Science

CSC2542 SAT-Based Planning Sheila McIlraith Department of Computer Science University of

CSC2542 Paper Presentations and Critiques Sheila McIlraith Department of Computer Science

CSC2542 Introduction to Planning Sheila McIlraith Department of Computer Science University of

CSC2542 Domain-Customized Planning Sheila McIlraith Department of Computer Science University

CSC2542 State-Space Planning Sheila McIlraith Department of Computer Science University of

CSC2542 SAT-Based Planning Sheila McIlraith Department of Computer Science University of

CSC2542 State-Space Planning Sheila McIlraith Department of Computer Science University of

CSC2542 Class Project Sheila McIlraith Department of Computer Science University of Toronto

CSC2542 Topics in Knowledge Representation & Reasoning: Automated Planning & Reasoning

CSC2542 Planning-Graph Techniques The lecture in 2 weeks will be given by our TA, Christian

CSC2542 Domain-Customized Planning Sheila McIlraith Department of Computer Science University

CSC2542 Bernhard Nebel, and Jussi Rintanen. Introduction to Planning I have also used some

Summary of What We Know Definition (Polynomial Time Reducibility) We write A P B iff there is

Recursion and Induction: Boolean Satisfiability; Davis-Putnam Procedure Greg Plaxton Theory in

Prolog, Resolution and Logic Programming Alan Williams Room 2.107 email: alanw@cs.man.ac.uk

Decision Procedures An Algorithmic Point of View Part I Decision Procedures for Propositional

Co Combining the k-CN CNF and XOR Phase se-Transitions Jeffrey M. Dudek , Kuldeep S. Meel,

NP -completeness Lecture in INF4130 Department of Informatics November 1st, 2018 Recap from

Elimination Techniques In Modern Propositional Logic Reasoning Norbert Manthey

SAT Preprocessing for MUS Extraction and MaxSAT Anton Belov Joint work with: Matti J arvisalo,

CSC2542 Bernhard Nebel, and Jussi Rintanen. SAT-Based Planning For - PDF document

Acknowledgements Some of the slides used in this course are modifications of Dana Naus lecture slides for the textbook Automated Planning, licensed under the Creative Commons Attribution-NonCommercial-ShareAlike License:

CSC2542 Representations for (Classical) Planning Sheila McIlraith Department of Computer

CSC2542 Planning-Graph Techniques Sheila McIlraith Department of Computer Science University of

CSC2542 Introduction to Planning Sheila McIlraith Department of Computer Science University of

CSC2542 Paper Presentations and Critiques Sheila McIlraith Department of Computer Science

CSC2542 SAT-Based Planning Sheila McIlraith Department of Computer Science University of

CSC2542 Paper Presentations and Critiques Sheila McIlraith Department of Computer Science

CSC2542 Introduction to Planning Sheila McIlraith Department of Computer Science University of

CSC2542 Domain-Customized Planning Sheila McIlraith Department of Computer Science University

CSC2542 State-Space Planning Sheila McIlraith Department of Computer Science University of

CSC2542 SAT-Based Planning Sheila McIlraith Department of Computer Science University of

CSC2542 State-Space Planning Sheila McIlraith Department of Computer Science University of

CSC2542 Class Project Sheila McIlraith Department of Computer Science University of Toronto

CSC2542 Topics in Knowledge Representation &amp; Reasoning: Automated Planning &amp; Reasoning

CSC2542 Planning-Graph Techniques The lecture in 2 weeks will be given by our TA, Christian

CSC2542 Domain-Customized Planning Sheila McIlraith Department of Computer Science University

CSC2542 Bernhard Nebel, and Jussi Rintanen. Introduction to Planning I have also used some

Summary of What We Know Definition (Polynomial Time Reducibility) We write A P B iff there is

Recursion and Induction: Boolean Satisfiability; Davis-Putnam Procedure Greg Plaxton Theory in

Prolog, Resolution and Logic Programming Alan Williams Room 2.107 email: alanw@cs.man.ac.uk

Decision Procedures An Algorithmic Point of View Part I Decision Procedures for Propositional

Co Combining the k-CN CNF and XOR Phase se-Transitions Jeffrey M. Dudek , Kuldeep S. Meel,

NP -completeness Lecture in INF4130 Department of Informatics November 1st, 2018 Recap from

Elimination Techniques In Modern Propositional Logic Reasoning Norbert Manthey

SAT Preprocessing for MUS Extraction and MaxSAT Anton Belov Joint work with: Matti J arvisalo,

CSC2542 Topics in Knowledge Representation & Reasoning: Automated Planning & Reasoning