Proving the Convergence of Monte Carlo Tree Search to Brownian - PowerPoint PPT Presentation

Jul 27, 2023 •40 likes •180 views

Proving the Convergence of Monte Carlo Tree Search to Brownian Motion Elana Kozak United States Naval Academy Motivation- Machine Learning Have you ever played a game against a computer? Have you ever talked to Siri or Alexa? Have you ever

Proving the Convergence of Monte Carlo Tree Search to Brownian Motion Elana Kozak United States Naval Academy
Motivation- Machine Learning Have you ever played a game against a computer? Have you ever talked to Siri or Alexa? Have you ever used GPS to estimate travel time? Has Facebook ever suggested new friends for you? Has Amazon ever suggested a new product for you?
Military Applications ➢ Autonomous warfare platforms ➢ Cybersecurity programs ➢ Logistics and transportation ➢ Target recognition ➢ Combat simulation and training ➢ ISR missions ➢ Data processing ➢ Search and rescue From MarketResearch.com
AI Decision Methods ➢ Random ➢ Cheat ➢ Script ➢ Monte Carlo Tree Search From oreilly.com
“Game” or Decision Tree Game state Root node (v) Child nodes (v i ) Terminal node Generic Tree Tic-Tac-Toe Example
MCTS Steps From Kelly and Churchill, 2017
Upper Confidence Bound (UCB1) aka Upper Confidence Bound for Trees (UCT) V i : node V: parent node Q: win count N: visit count C: exploration constant From int8.io
Current Applications and Advantages ➢ Artificial Intelligence (AI) game players ○ Chess ○ Go ○ Tic-Tac-Toe ○ And more … ➢ Adjustable Computation ○ No initial strategy ○ Only stores end state ○ Set time limit But … not always accurate ➢ ○ Inherent randomness ○ Doesn’t cover all paths
Can we apply MCTS to search and detection? YES! Imagine a game … Moves = up, down, left, right Goal = find the target Our question: how does this method behave?
Theorem 1 A 2-D Monte Carlo Tree Search that uses the UCT selection policy and a uniformly random, unknown target will converge to a symmetric random walk as M, the size of the search lattice, goes to infinity.
Proof ● Let ε>0 and choose K(ε) such that (1/K(ε)) < ε as the radius of a region E around the origin ○ Thus K(ε) is the minimum number of steps required to exit this region ● Choose M as the dimension of the square grid such that P(dist(T, S(0))> K(ε)) = 1- δ ● Q = 1/k represents the success rate On average, k >> K(ε) so Q < 1/K(ε) < ε ○ Recall:
Proof (continued) V 1 ● N(v) is the same for all v i 1. First four trials pick i randomly, then UCT is equal for all i V 2 2. Visited nodes have a lower UCT, so next move is chosen randomly from remaining nodes V 4 3. Process repeats, randomly cycling through the moves since UCT is always equal V 3 Recall:
Future Work ❖ Theorem 2: When a stationary target is known, a 2-D Monte Carlo Tree Search will converge to an optimal “straight” line path as the number of iterations goes to infinity. ❖ Test MCTS in more complex scenarios ➢ More targets ➢ More searchers ➢ Different distributions ❖ How does MCTS compare to other search methods? ➢ Time, accuracy, computational complexity, etc. ❖ What real-world scenarios can we apply MCTS to? ➢ Search and rescue ➢ Animal foraging ➢ Submarine detection
Thank You

Recommend

Monte Carlo Generators Monte Carlo Generators Monte Carlo Generators QCD Lecture III P .

P . Skands QCD Lecture III Monte Carlo Generators Monte Carlo Generators Monte Carlo Generators QCD Lecture III P . Skands 1 P . Skands QCD Lecture III A Monte Carlo technique: is any technique making use of random numbers to solve a

2.37k views • 105 slides

Monte-Carlo tree search for Monte-Carlo tree search for multi-player, no-limit multi-player,

Monte-Carlo tree search for Monte-Carlo tree search for multi-player, no-limit multi-player, no-limit Texas hold'em poker Texas hold'em poker Guy Van den Broeck Should I bluff? Deceptive play Should I bluff? Is he bluffing? Opponent

1.14k views • 112 slides

Monte Carlo Tree Search 2-15-16 Reading Quiz What is the relationship between Monte Carlo tree

Monte Carlo Tree Search 2-15-16 Reading Quiz What is the relationship between Monte Carlo tree search and upper confidence bound applied to trees? a) MCTS is a type of UCB b) UCB is a type of MCTS c) both (they are the same algorithm) d)

331 views • 17 slides

Monte Carlo Methods Guojin Chen Christopher Cprek Chris Rambicure Monte Carlo Methods 1.

Monte Carlo Methods Guojin Chen Christopher Cprek Chris Rambicure Monte Carlo Methods 1. Introduction 2. History 3. Examples Introduction Monte Carlo methods are stochastic techniques. Monte Carlo method is very

1.27k views • 68 slides

Monte Carlo Approximation of Monte Carlo Filters Adam M. Johansen et al. Collaborators Include:

Monte Carlo Approximation of Monte Carlo Filters Adam M. Johansen et al. Collaborators Include: Arnaud Doucet, Axel Finke, Anthony Lee, Nick Whiteley 7th January 2014 Introduction Monte Carlo Approximationof Monte Carlo Filters Approximating

1.46k views • 31 slides

BROCHURE 2019 TETRA JUICES DEL MONTE DEL MONTE 6 x 1L GOLD PINEAPPLE 6 x 1L 6 x 1L 6 x 1L

BROCHURE 2019 TETRA JUICES DEL MONTE DEL MONTE 6 x 1L GOLD PINEAPPLE 6 x 1L 6 x 1L 6 x 1L 6 x 1L 6 x 1L 6 x 1L PINEAPPLE- GOLD DEL MONTE DEL MONTE COCCO DEL MONTE DEL MONTE DEL MONTE DEL MONTE DEL MONTE 8x1 lt PINEAPPLE

531 views • 12 slides

Modern Monte Carlo Tree Search Andrew Li, John Chen, Keiran Paster 1 Outline Motivation

Modern Monte Carlo Tree Search Andrew Li, John Chen, Keiran Paster 1 Outline Motivation Optimistic Exploration and Bandits Monte Carlo Tree Search (MCTS) Learning to Search in MCTS Thinking Fast and Slow with Deep Learning

567 views • 35 slides

Balanced Search Trees Binary Search Trees Binary Search Tree Binary Search Tree A binary tree is

Balanced Search Trees Binary Search Trees Binary Search Tree Binary Search Tree A binary tree is a binary search tree if each element in the left subtree is smaller than the root, each element in the right subtree is larger than the root,

757 views • 51 slides

Chapter 5: Monte Carlo Methods Monte Carlo methods are learning methods Experience

Chapter 5: Monte Carlo Methods Monte Carlo methods are learning methods Experience values, policy Monte Carlo methods can be used in two ways: ! model-free: No model necessary and still attains optimality ! Simulated: Needs only a

2.17k views • 32 slides

Draft Introduction to (randomized) quasi-Monte Carlo Pierre LEcuyer MCQMC Conference,

1 Draft Introduction to (randomized) quasi-Monte Carlo Pierre LEcuyer MCQMC Conference, Stanford University, August 2016 2 Draft Program Monte Carlo, Quasi-Monte Carlo, Randomized quasi-Monte Carlo QMC point sets and

1.98k views • 148 slides

Monte Carlo Estimation 7 January 2019 OSU CSE 1 Monte Carlo Methods Class of computational

Monte Carlo Estimation 7 January 2019 OSU CSE 1 Monte Carlo Methods Class of computational methods that use random sampling to estimate results Named after the famous Monte Carlo Casino 7 January 2019 OSU CSE 2 Throwing Darts 3 5

392 views • 12 slides

Monte Carlo Localization Ximing Yu March 24, 2009 Ximing Yu Monte Carlo Localization 1

Outline Introduction MCL Mixture-MCL End Monte Carlo Localization Ximing Yu March 24, 2009 Ximing Yu Monte Carlo Localization 1 Outline Introduction MCL Mixture-MCL End Introduction 1 Localization Problem Bayes Filter Monte Carlo

414 views • 23 slides

Monte Carlo Control CMPUT 366: Intelligent Systems S&B 5.3-5.5, 5.7 Lecture Outline 1.

Monte Carlo Control CMPUT 366: Intelligent Systems S&B 5.3-5.5, 5.7 Lecture Outline 1. Recap 2. Estimating Action Values 3. Monte Carlo Control 4. Importance Sampling 5. Off-Policy Monte Carlo Control Recap: Monte Carlo vs.

263 views • 22 slides

4. THE MONTE CARLO METHOD 4.1 I ntroduction This chapter is aimed at describing the Monte Carlo

C hapter 4: Monte Carlo Modeling of Grain Growth and Recrystallization, A.D. Rollett & P. Manohar 4. THE MONTE CARLO METHOD 4.1 I ntroduction This chapter is aimed at describing the Monte Carlo method for the simulation of grain growth and

2.05k views • 37 slides

CS171: Artificial Intelligence Monte Carlo Tree Search and Alpha Go Jia Chen Dec 5, 2017 1

CS171: Artificial Intelligence Monte Carlo Tree Search and Alpha Go Jia Chen Dec 5, 2017 1 Schedule Introduction Monte-Carlo Tree Search Policy and Value Networks Results 2 Introduction Go originated 2,500+ years ago

882 views • 47 slides

Monte Carlo Tree Search for Algorithm Configuration: MOSAIC Herilalaina Rakotoarison and Mich`

Monte Carlo Tree Search for Algorithm Configuration: MOSAIC Herilalaina Rakotoarison and Mich` ele Sebag TAU CNRS INRIA LRI Universit e Paris-Sud NeurIPS MetaLearning Wshop Dec. 8, 2018 1 / 14 Monte Carlo Tree Search for

574 views • 28 slides

identification of potential sewer mining locations . K. Tsoukalas, C. K. Makropoulos and S. N.

13th IWA Specialized Conference on Small Water and Wastewater Systems 14 - 16 September 2016, Athens, Greece Session: Small Scale and Decentralized Wastewater Treatment and Management A Monte-Carlo based method for the identification of

390 views • 17 slides

Refining Estimates of Sampling Variability for the Planning Databases Low Response Score Luke

A Simulation-Based Approach to Refining Estimates of Sampling Variability for the Planning Databases Low Response Score Luke J. Larsen U.S. Census Bureau July 29 August 3, 2018 2018 JSM Conference Vancouver, BC This presentation is

596 views • 23 slides

Alpha Presentation Surge xOS: Visualization of Automated Underwriting The Capstone Experience

Alpha Presentation Surge xOS: Visualization of Automated Underwriting The Capstone Experience Team Surge Solutions Pawel Babkowski Dakota Klatt Prudhvi Kuchipudi Erika Lustig Drew Rutt YuanYuan Zhou Department of Computer Science and

388 views • 9 slides

ENFORSING A SYSTEM APPROACH TO COMPOSITE FAILURE CRITERIA FOR RELIABILITY ANALYSIS N. Dimitrov 1*

18 TH INTERNATIONAL CONFERENCE ON COMPOSITE MATERIALS ENFORSING A SYSTEM APPROACH TO COMPOSITE FAILURE CRITERIA FOR RELIABILITY ANALYSIS N. Dimitrov 1* , P. Friis-Hansen 2 C. Berggreen 3 , 1 Structure and Mechanics Department, Siemens Wind Power

529 views • 5 slides

Monte Carlo simulation for a doubly nonlinear problem in finance Lokman Abbas-Turki First part

Monte Carlo simulation for a doubly nonlinear problem in finance Lokman Abbas-Turki First part from a joint work with M. A. Mikou Last part from a joint work with S. Graillat UPMC, LPMA 10 June 2015 Lokman (UPMC, LPMA) Journes inaugurales

712 views • 22 slides

Kevin McLaughlin Outline Advance of Fab technologies and the evolution of raw materials for

Quality of Semiconductor Raw Materials: Evolution and Challenges Yongqiang Lu Kevin McLaughlin Outline Advance of Fab technologies and the evolution of raw materials for ever higher quality Challenges: metrology Case study--ICP MS

577 views • 18 slides

An Agent-Based Boom-Bust Business Cycle Model with Search-for-Yield and Heterogeneous

An Agent-Based Boom-Bust Business Cycle Model with Search-for-Yield and Heterogeneous Expectations in the Bond Market Carl Chiarella (UTS) Corrado di Guilmi (UTS) Timo Henckel (ANU) October 2013 Chiarella, Di Guilmi & Henckel () Boom-Bust

524 views • 41 slides

Portable Monte Carlo Transport Performance Evaluation in the PATMOS Prototype Tao CHANG 1

Portable Monte Carlo Transport Performance Evaluation in the PATMOS Prototype Tao CHANG 1 DEN-Service dEtudes des R eacteurs et de Math ematiques Appliqu ees (SERMA) November 27, 2019 Portable Monte Carlo Transport Performance

921 views • 36 slides