Incremental and Non-incremental Learning of Control Knowledge for - PowerPoint PPT Presentation

1 Incremental and Non-incremental Learning of Control Knowledge for Planning Daniel Borrajo Mill´ an joint work with Manuela Veloso, Ricardo Aler, and Susana Fern´ andez Universidad Carlos III de Madrid Avda. de la Universidad, 30. 28911 Madrid, SPAIN Web: http://scalab.uc3m.es/ ∼ dborrajo

2 Incremental and Non-incremental Learning of Control Knowledge for Planning 1. Motivation 2. Incremental learning. hamlet 3. Learning by genetic programming. evock 4. Discussion

3 Motivation Motivation for hamlet Control knowledge learning techniques that worked well for linear planning, had problems in nonlinear planning

3 Motivation Motivation for hamlet Control knowledge learning techniques that worked well for linear planning, had problems in nonlinear planning ebl generated over-general or over-specific control knowledge sometimes they required domain axioms utility and expensive chunk problems

3 Motivation Motivation for hamlet Control knowledge learning techniques that worked well for linear planning, had problems in nonlinear planning ebl generated over-general or over-specific control knowledge sometimes they required domain axioms utility and expensive chunk problems Pure inductive techniques did not use available domain knowledge: difficulty to focus on what is important required powerful representation mechanisms beyond attribute-value: predicate logic ( ilp ) huge hypothesis spaces very difficult to search without the use of learning heuristics

4 Motivation Our solution Incremental approach Learning task: Given: a domain theory, a set of training problems (it might be empty), a set of initial control rules (usually empty), and a set of parameters (quality metric, learning time bound, modes, . . . ) Output: a set of control rules that “efficiently” solves test problems generating “good quality” solutions

4 Motivation Our solution Incremental approach Learning task: Given: a domain theory, a set of training problems (it might be empty), a set of initial control rules (usually empty), and a set of parameters (quality metric, learning time bound, modes, . . . ) Output: a set of control rules that “efficiently” solves test problems generating “good quality” solutions Main idea: Uses ebl for acquiring control rules from problem solving traces Uses relational induction (in the spirit of version spaces) to generalize and specialize control rules

5 Incremental and Non-incremental Learning of Control Knowledge for Planning 1. Motivation 2. Incremental learning. hamlet 3. Learning by genetic programming. evock 4. Discussion

6 Hybrid Learning. hamlet Planning architecture. prodigy Integrated architecture for non-linear problem solving and learning Means-ends analysis with bidirectional search Control knowledge learning for efficiency Prodigy/EBL Static Dynamic Alpine Prodigy/Analogy Hamlet Observe Planner Quality Experiment Apprentice Control knowledge learning Domain knowledge acquisition for quality

7 Hybrid Learning. hamlet prodigy search tree 1 Choose a goal1 goal g goal Choose an operator operator 1 o operator Choose binding binding bindings 1 b Decide to reduce differences (apply) 2 or continue exploring apply operator subgoal (subgoal) apply operator subgoal goal1 goal g 3 4

8 Hybrid Learning. hamlet Incremental learning. hamlet Quality HAMLET Metric Analytical Learning Learning Mode Inductive Optimality Learning parameter Learned heuristics Problems (control rules) Control Domain PRODIGY

9 Hybrid Learning. hamlet Example of control rule (control-rule select-operators-unload-airplane (if (current-goal (at < object > < location1 > )) (true-in-state (at < object > < location2 > )) (true-in-state (loc-at < location1 > < city1 > )) (true-in-state (loc-at < location2 > < city2 > )) (type-of-object < object > object) (type-of-object < location1 > location)) (then select operator unload-airplane))

9 Hybrid Learning. hamlet Example of control rule (control-rule select-operators-unload-airplane (if (current-goal (at < object > < location1 > )) (true-in-state (at < object > < location2 > )) (true-in-state (loc-at < location1 > < city1 > )) (true-in-state (loc-at < location2 > < city2 > )) (type-of-object < object > object) (type-of-object < location1 > location)) (then select operator unload-airplane)) Difficulties: variables have to be bound to different values (cities) constants have to be of a specific type ( object and location1 ) there are conditions that might not relate to the goal regression ( loc-at )

10 Hybrid Learning. hamlet Target concepts representation (control-rule name (control-rule name (if (and (current-operator operator-name ) (if (current-goal goal-name ) (current-goal goal-name ) [(prior-goals ( literal ∗ ))] [(prior-goals ( literal ∗ ))] (true-in-state literal ) ∗ (true-in-state literal ) ∗ (other-goals ( literal ∗ )) (other-goals ( literal ∗ )) (type-of-object object type ) ∗ ) (type-of-object object type ) ∗ )) (then select operators operator-name )) (then select bindings bindings )) (control-rule name (control-rule name (if (and (applicable-op operator ) (if (and (target-goal literal ) [(prior-goals ( literal ∗ ))] [(prior-goals ( literal ∗ ))] (true-in-state literal ) ∗ (true-in-state literal ) ∗ (other-goals ( literal ∗ )) (other-goals ( literal ∗ )) (type-of-object object type ) ∗ )) (type-of-object object type ) ∗ )) (then decide { apply | sub-goal } )) (then select goals literal ))

11 Hybrid Learning. hamlet Analytical learning The Bounded Explanation module ( ebl ) extracts positive examples of the decisions made from the search trees generates control rules from them selecting their preconditions

11 Hybrid Learning. hamlet Analytical learning The Bounded Explanation module ( ebl ) extracts positive examples of the decisions made from the search trees generates control rules from them selecting their preconditions Target concepts: select an unachieved goal select an operator to achieve some goal select bindings for an operator when trying to achieve a goal decide to apply an operator for achieving a goal or subgoal on an unachieved goal

11 Hybrid Learning. hamlet Analytical learning The Bounded Explanation module ( ebl ) extracts positive examples of the decisions made from the search trees generates control rules from them selecting their preconditions Target concepts: select an unachieved goal select an operator to achieve some goal select bindings for an operator when trying to achieve a goal decide to apply an operator for achieving a goal or subgoal on an unachieved goal hamlet considers multiple target concepts, each one being a disjunction of conjunctions (partially solves the utility problem)

12 Hybrid Learning. hamlet Example of logistics problem C1 C3 A PL1 PL2 C2

13 Hybrid Learning. hamlet Example of search tree done *finish* *finish*() at−object(A,C2) unload−airplane unload−truck unload−airplane(A,PL1,C2) unload−airplane(A,PL2,C2) inside−airplane(A,PL1) inside−airplane(A,PL2) load−airplane load−airplane load−airplane(A,PL1,C1) load−airplane(A,PL2,C1) at−airplane(PL1,C1) LOAD−AIRPLANE(A,PL2,C1) fly−airplane at−airplane(PL2,C2) fly−airplane(PL1,C3,C1) fly−airplane FLY−AIRPLANE(PL1,C3,C1) fly−airplane(PL2,C1,C2) LOAD−AIRPLANE(A,PL1,C1) FLY−AIRPLANE(PL2,C1,C2) at−airplane(PL1,C2) UNLOAD−AIRPLANE(A,PL2,C1) fly−airplane fly−airplane(PL1,C1,C2) FLY−AIRPLANE(PL1,C1,C2) UNLOAD−AIRPLANE(A,PL1,C1)

14 Hybrid Learning. hamlet Learning for plan length done *finish* *finish*() at−object(A,C2) unload−airplane unload−truck unload−airplane(A,PL1,C2) unload−airplane(A,PL2,C2) inside−airplane(A,PL1) inside−airplane(A,PL2) load−airplane load−airplane load−airplane(A,PL1,C1) load−airplane(A,PL2,C1) at−airplane(PL1,C1) LOAD−AIRPLANE(A,PL2,C1) fly−airplane at−airplane(PL2,C2) fly−airplane(PL1,C3,C1) fly−airplane FLY−AIRPLANE(PL1,C3,C1) fly−airplane(PL2,C1,C2) LOAD−AIRPLANE(A,PL1,C1) FLY−AIRPLANE(PL2,C1,C2) at−airplane(PL1,C2) UNLOAD−AIRPLANE(A,PL2,C1) fly−airplane fly−airplane(PL1,C1,C2) FLY−AIRPLANE(PL1,C1,C2) UNLOAD−AIRPLANE(A,PL1,C1)

15 Hybrid Learning. hamlet Learning for quality done *finish* *finish*() at−object(A,C2) unload−airplane unload−truck unload−airplane(A,PL1,C2) unload−airplane(A,PL2,C2) inside−airplane(A,PL1) inside−airplane(A,PL2) load−airplane load−airplane load−airplane(A,PL1,C1) load−airplane(A,PL2,C1) at−airplane(PL1,C1) LOAD−AIRPLANE(A,PL2,C1) 20 fly−airplane at−airplane(PL2,C2) fly−airplane(PL1,C3,C1) fly−airplane 300 FLY−AIRPLANE(PL1,C3,C1) fly−airplane(PL2,C1,C2) LOAD−AIRPLANE(A,PL1,C1) 20 FLY−AIRPLANE(PL2,C1,C2) 600 at−airplane(PL1,C2) UNLOAD−AIRPLANE(A,PL2,C1) 20 fly−airplane 640 fly−airplane(PL1,C1,C2) 200 FLY−AIRPLANE(PL1,C1,C2) 20 UNLOAD−AIRPLANE(A,PL1,C1) 540

Incremental and Non-incremental Learning of Control Knowledge for - PowerPoint PPT Presentation

1 Incremental and Non-incremental Learning of Control Knowledge for Planning Daniel Borrajo Mill an joint work with Manuela Veloso, Ricardo Aler, and Susana Fern andez Universidad Carlos III de Madrid Avda. de la Universidad, 30. 28911

Incremental Garbage Collection Part II Roland Schatz Incremental Garbage Collection p.1/22

Batch-Incremental vs. Instance-Incremental Learning in Dynamic and Evolving Data Jesse Read 1 ,

Incremental Classification: First Step into Lifelong Learning PAN Xinyu MMLab, Department of IE

Industrial Robots Industrial Robots Control Control Part 1 Control Control Part 1 Part 1

Extent- -based Incremental Identification based Incremental Identification Extent of Reaction

Efficient Incremental Dynamic Invariant Detection Jeff Perkins and Michael Ernst MIT CSAIL Page

ENTSOG: 5 th Stakeholder Joint Working Session for the Incremental Proposal 8 April 2014 5th SJWS

RCGC is naturally incremental, how about making it concurrent Review Incremental mark-sweep

Incremental Construction Cost Incremental Construction Cost Analysis for New Homes Robin Snyder,

Incremental SAT Library Integration using Abstract Stobjs Sol Swords Centaur Technology, Inc.

Incremental Consistency Guarantees For Replicated Objects Rachid Guerraoui, Matej Pavlovic,

Incremental Change of Software Taxonomy of Evolution Changes Incremental change (IC)

Incremental Event Calculus for Run-Time Reasoning Efthimis Tsilionis, Alexander Artikis, Georgios

Riemannian Walk for Incremental Learning: Understanding Forgetting and Intransigence Arslan

Incremental learning of motion primitives for full body motions Dana Kuli c Department of

Paper Presentation Insights for incremental learning Zheng Shou 03-04-2015 1 Overview 1.

Apprent entices iceship hip Ass sses essme sment nt: what training providers need to know

What is a Degree or Higher Apprenticeship? Huw Morgan, Keele University Jane Tunnicliff,

Apprenticeship Funding Content Context to apprenticeship reforms Reforms and the apprenticeship

Early Years Workforce & Qualifications The

AB 86: Adult Education Los Angeles Regional Adult Education Consortium All Districts Faculty

Mary ary Kay ay Shaf afer Local Conservation Corps Grant Program Supervisor, CalRecycle J im

The Modern Apprenticeship Route Linda Clinton East Renfrewshire Council Developing Young

Title Apprentice Transition Plan Staff Briefing Subtitle Subtitle What is the Apprentice

Sambuz

Useful Links

Newsletter

Mail Us

Incremental and Non-incremental Learning of Control Knowledge for - PowerPoint PPT Presentation

1 Incremental and Non-incremental Learning of Control Knowledge for Planning Daniel Borrajo Mill an joint work with Manuela Veloso, Ricardo Aler, and Susana Fern andez Universidad Carlos III de Madrid Avda. de la Universidad, 30. 28911

Incremental Garbage Collection Part II Roland Schatz Incremental Garbage Collection p.1/22

Batch-Incremental vs. Instance-Incremental Learning in Dynamic and Evolving Data Jesse Read 1 ,

Incremental Classification: First Step into Lifelong Learning PAN Xinyu MMLab, Department of IE

Industrial Robots Industrial Robots Control Control Part 1 Control Control Part 1 Part 1

Extent- -based Incremental Identification based Incremental Identification Extent of Reaction

Efficient Incremental Dynamic Invariant Detection Jeff Perkins and Michael Ernst MIT CSAIL Page

ENTSOG: 5 th Stakeholder Joint Working Session for the Incremental Proposal 8 April 2014 5th SJWS

RCGC is naturally incremental, how about making it concurrent Review Incremental mark-sweep

Incremental Construction Cost Incremental Construction Cost Analysis for New Homes Robin Snyder,

Incremental SAT Library Integration using Abstract Stobjs Sol Swords Centaur Technology, Inc.

Incremental Consistency Guarantees For Replicated Objects Rachid Guerraoui, Matej Pavlovic,

Incremental Change of Software Taxonomy of Evolution Changes Incremental change (IC)

Incremental Event Calculus for Run-Time Reasoning Efthimis Tsilionis, Alexander Artikis, Georgios

Riemannian Walk for Incremental Learning: Understanding Forgetting and Intransigence Arslan

Incremental learning of motion primitives for full body motions Dana Kuli c Department of

Paper Presentation Insights for incremental learning Zheng Shou 03-04-2015 1 Overview 1.

Apprent entices iceship hip Ass sses essme sment nt: what training providers need to know

What is a Degree or Higher Apprenticeship? Huw Morgan, Keele University Jane Tunnicliff,

Apprenticeship Funding Content Context to apprenticeship reforms Reforms and the apprenticeship

Early Years Workforce &amp; Qualifications The

AB 86: Adult Education Los Angeles Regional Adult Education Consortium All Districts Faculty

Mary ary Kay ay Shaf afer Local Conservation Corps Grant Program Supervisor, CalRecycle J im

The Modern Apprenticeship Route Linda Clinton East Renfrewshire Council Developing Young

Title Apprentice Transition Plan Staff Briefing Subtitle Subtitle What is the Apprentice

Sambuz

Useful Links

Newsletter

Mail Us

Early Years Workforce & Qualifications The