best first utility guided search
play

Best-first Utility-guided Search Wheeler Ruml and Minh B. Do - PowerPoint PPT Presentation

Best-first Utility-guided Search Wheeler Ruml and Minh B. Do Embedded Reasoning Area Palo Alto Research Center { ruml , minhdo } @parc.com Wheeler Ruml (PARC) Best-first Utility-guided Search 1 / 10 Overview Overview Anytime Algs


  1. Best-first Utility-guided Search Wheeler Ruml and Minh B. Do Embedded Reasoning Area Palo Alto Research Center { ruml , minhdo } @parc.com Wheeler Ruml (PARC) Best-first Utility-guided Search – 1 / 10

  2. Overview ■ Overview ■ Anytime Algs ■ Bugsy ■ Properties ■ Evaluation A* takes too long: we must trade cost for time. ■ Gridworld ■ MSA ■ Planning ■ Summary Wheeler Ruml (PARC) Best-first Utility-guided Search – 2 / 10

  3. Overview ■ Overview ■ Anytime Algs ■ Bugsy ■ Properties ■ Evaluation A* takes too long: we must trade cost for time. ■ Gridworld ■ MSA ■ Planning ■ Summary U ( s, t ) = − w f · f ( s ) − w t · t Wheeler Ruml (PARC) Best-first Utility-guided Search – 2 / 10

  4. Overview ■ Overview ■ Anytime Algs ■ Bugsy ■ Properties ■ Evaluation A* takes too long: we must trade cost for time. ■ Gridworld ■ MSA ■ Planning ■ Summary U ( s, t ) = − w f · f ( s ) − w t · t Anytime algorithms are annoying to use and to design. Utility-guided search is a promising alternative. Wheeler Ruml (PARC) Best-first Utility-guided Search – 2 / 10

  5. The Anytime Approach ■ Overview ■ Anytime Algs ■ Bugsy ■ Properties ■ Evaluation ■ Gridworld ■ MSA ■ Planning ■ Summary Wheeler Ruml (PARC) Best-first Utility-guided Search – 3 / 10

  6. The Anytime Approach Requires a termination policy, assuming: ■ Overview ■ Anytime Algs relevant features for predicting progress are known ■ ■ Bugsy ■ Properties training data available ■ ■ Evaluation new instance is similar in relevant aspects to training ■ ■ Gridworld ■ MSA relevant aspects are known ■ ■ Planning ■ Summary Wheeler Ruml (PARC) Best-first Utility-guided Search – 3 / 10

  7. The Anytime Approach Requires a termination policy, assuming: ■ Overview ■ Anytime Algs relevant features for predicting progress are known ■ ■ Bugsy ■ Properties training data available ■ ■ Evaluation new instance is similar in relevant aspects to training ■ ■ Gridworld ■ MSA relevant aspects are known ■ ■ Planning ■ Summary Impossible to design optimally: f = 5 f = 4 d = 1 d = 2 Must know the user’s utility function! Wheeler Ruml (PARC) Best-first Utility-guided Search – 3 / 10

  8. Best-first Utility-guided Search, Yes! Want best-first search according to: ■ ■ Overview ■ Anytime Algs ■ Bugsy U ( n ) = s under n ( − w f · f ( s ) − w t · t ( s )) max ■ Properties ■ Evaluation ■ Gridworld ■ MSA ■ Planning ■ Summary Wheeler Ruml (PARC) Best-first Utility-guided Search – 4 / 10

  9. Best-first Utility-guided Search, Yes! Want best-first search according to: ■ ■ Overview ■ Anytime Algs ■ Bugsy U ( n ) = s under n ( − w f · f ( s ) − w t · t ( s )) max ■ Properties ■ Evaluation ■ Gridworld ■ MSA Approximate s under n by cheapest and nearest ■ ■ Planning ■ Summary ◆ f ( cheapest ) = f ( n ) = g ( n ) + h ( n ) ◆ f ( nearest ) seems straightforward in many domains Estimate d ( n ) and convert to t ( n ) . ■ ◆ d ( n ) seems straightforward in many domains Wheeler Ruml (PARC) Best-first Utility-guided Search – 4 / 10

  10. Properties Different from anytime algorithms ■ Overview ■ Anytime Algs no need for termination policy, training data ■ ■ Bugsy ■ Properties can spend all effort pursuing one solution ■ ■ Evaluation no fixed trade-off ■ ■ Gridworld ■ MSA ■ Planning cost ■ Summary Bugsy anytime solutions utility time Wheeler Ruml (PARC) Best-first Utility-guided Search – 5 / 10

  11. Properties Different from anytime algorithms ■ Overview ■ Anytime Algs no need for termination policy, training data ■ ■ Bugsy ■ Properties can spend all effort pursuing one solution ■ ■ Evaluation no fixed trade-off ■ ■ Gridworld ■ MSA ■ Planning cost ■ Summary anytime solutions utility Bugsy time Wheeler Ruml (PARC) Best-first Utility-guided Search – 5 / 10

  12. Properties Different from anytime algorithms ■ Overview ■ Anytime Algs no need for termination policy, training data ■ ■ Bugsy ■ Properties can spend all effort pursuing one solution ■ ■ Evaluation no fixed trade-off ■ ■ Gridworld ■ MSA ■ Planning ■ Summary Similar to weighted A* iff h = d . but intuitive meaning for weight ■ otherwise, exploits additional information ■ Reasonable properties Complete if h and d are reasonable ■ Optimal if h and d are perfect ■ Wheeler Ruml (PARC) Best-first Utility-guided Search – 5 / 10

  13. Empirical Evaluation Algorithms: ■ ■ Overview ■ Anytime Algs ◆ Bugsy ■ Bugsy ■ Properties Anytime Replanning A* (ARA*), Likhachev et al. (2004) ◆ ■ Evaluation ◆ Anytime A* (AA*), Hansen et al. (1997) ■ Gridworld ■ MSA ◆ Greedy (Gr), Doran and Michie (1966) ■ Planning ◆ A*, Hart et al. (1968) ■ Summary Wide variety of utility functions. ■ Record CPU time and solution quality for every solution. ■ Assume clairvoyant termination for anytime algorithms. ■ Normalize utilities from 0–100. ■ See paper for full results. ■ (These results are conservative.) Wheeler Ruml (PARC) Best-first Utility-guided Search – 6 / 10

  14. Gridworld Pathfinding U () ARA* AA* Gr A* Bugsy ■ Overview ■ Anytime Algs time only 59 100 100 100 100 ■ Bugsy 500 microsec 99 99 99 59 100 ■ Properties ■ Evaluation 1 msec 98 98 59 99 99 ■ Gridworld 5 msec 91 93 90 59 99 ■ MSA ■ Planning 10 msec 82 86 80 59 99 ■ Summary 50 msec 97 25 54 19 65 0.1 sec 97 60 63 19 82 cost only 19 98 98 98 98 Wheeler Ruml (PARC) Best-first Utility-guided Search – 7 / 10

  15. Multiple Sequence Alignment U () ARA* AA* Gr A* Bugsy ■ Overview ■ Anytime Algs time only 54 100 100 100 100 ■ Bugsy 0.1 sec 97 98 96 54 99 ■ Properties ■ Evaluation 0.5 sec 83 88 76 52 92 ■ Gridworld 1 sec 68 79 54 51 80 ■ MSA ■ Planning 5 sec 68 71 25 73 75 ■ Summary 10 secs 78 75 74 25 78 cost only 82 82 82 24 82 Wheeler Ruml (PARC) Best-first Utility-guided Search – 8 / 10

  16. Temporal Planning U () ARA* AA* A* Bugsy ■ Overview ■ Anytime Algs zenotravel-7 ■ Bugsy 500 microsec 69 0 81 100 ■ Properties ■ Evaluation 1 msec 71 0 83 100 ■ Gridworld 5 msec 74 0 85 100 ■ MSA ■ Planning 10 msec 84 0 96 100 ■ Summary 50 msec 91 91 0 100 0.5 sec 97 97 0 100 5 sec 99 99 0 100 rovers-5 500 microsec 67 0 62 100 1 msec 72 0 66 100 5 msec 77 0 71 100 10 msec 92 0 93 100 50 msec 78 0 93 100 Wheeler Ruml (PARC) Best-first Utility-guided Search – 9 / 10

  17. Summary Anytime algorithms are annoying to use and to design. ■ ■ Overview ■ Anytime Algs ■ Bugsy Utility-based search is a promising and practical alternative. ■ ■ Properties ■ Evaluation Extendable to solving deadlines and plan execution deadlines. ■ ■ Gridworld ■ MSA ■ Planning ■ Summary There’s information beyond g ( n ) and h ( n ) , namely d ( n ) . ■ Wheeler Ruml (PARC) Best-first Utility-guided Search – 10 / 10

Recommend


More recommend