Markov Chains & Zifan Yu Department of Mathematics, University - PowerPoint PPT Presentation

Directed Reading Program 2019 Spring Markov Chains & Zifan Yu Department of Mathematics, University of Maryland Random Walks Mentored by Pranav Jayanti (1/18)

Weather model: Sunny Cloudy Storm Sunny Cloudy Storm ❖ Questions to consider: Given the probability distribution of the weather today is [a, b, c] • How do we predict the weather for tomorrow, if for each day, the probabilities of weather changes are all the same? • Is it possible that after a thousand years, the chances of weather for each day remain unchanged? (2/18)

Markov Chains - what is it? ❖ Formally, a Markov chain is defined to be a sequence of random ( X n ) n ≥ 0 variables , taking values in a set of states, which we P denote by S, with initial distribution and transition matrix , λ if X 0 λ = { λ i | i ∈ S } • has distribution • Transition matrix , and the Markov property holds: P = ( p ij ) i , j ∈ S P ( X n = i n | X n − 1 = i n − 1 , . . . , X 0 = i 0 ) = P ( X n = i n | X n − 1 = i n − 1 ) = p i n − 1 i n Probability distributions ❖ P ( X n = j ) = ( λ P n ) j P i ( X n = j ) = P ( X n + m = j | X m = j ) = p ( n ) ij (3/18)

Markov Chains-communicating classes and irreducibility We say that a state i communicate with state j if one can get to i from j, as well as from j to i with only finite many evolution times. We denote this relation as i <—>j. A B C D p ik 1 , . . . , p k n − 1 j Note: i —> j if and only if > 0. Also it requires the sequence k 1 , . . . , k n − 1 to be finite. Also note that i<—>j means this relation is (1) symmetric: if i —> j then j —> i; (2) reflective: i <—> i; (3) transitive: i <—> j and j <—> k imply i <—> k. (4/18)

Markov Chains-communicating classes and irreducibility The sets of states with states having such relation jointly are called communicating classes. Therefore we can partition the set S, into communicating classes with respect to this equivalence relation. A B C D Definition : A Markov chain is irreducible if its set of states S is a single communicating class. (5/18)

Markov Chains-communicating classes and irreducibility Illustration of irreducible and reducible Markov chains: Note: Irreducibility of a Markov chain prepares us to study the equilibrium state of this chain. (6/18)

Markov Chains-aperiodicity of Markov chains ❖ Definition: A state i is called aperiodic, if there exists a p ( n ) n ≥ N positive integer N, such that for all . > 0 ii ❖ Theorem: If P is irreducible, and has an aperiodic state i, p ( n ) jk > 0 then for all states j and k, for all sufficiently large n. (therefore all states are aperiodic) Sketch of the proof: = ∑ p ( r + n + s ) p ( r ) ji 1 p i 1 i 2 . . . p i n − 1 i n p ( s ) i n k ≥ p ( r ) ji p ( n ) ii p ( s ) ik > 0 jk i 1 ,..., i n ❖ Definition: We call a Markov chain aperiodic if all its states are aperiodic . Now, recall the question: after sufficiently large evolution times, will the distribution of states reach an equilibrium? (7/18)

Markov Chains-Invariant distributions λ = { λ i ≥ 0 | i ∈ S } A measure on a Markov chain is any vector ❖ ∑ In addition, is a distribution if λ λ i = 1 ❖ i ∈ S We say a measure is invariant if . λ λ = λ P ❖ ❖ Theorem: Suppose that is a Markov chain with transition ( X n ) n ≥ 0 matrix P and initial distribution . If P is both irreducible and λ π aperiodic, and has an invariant distribution , then P ( X n = j ) = ( λ P n ) j → π j as n → ∞ for all j. In particular, p ( n ) → π j for all i,j. ij (8/18)

Markov Chains-Invariant distributions (picture credit to Seattle Refined) (picture credit to BBC NEWS) (picture credit to smithsonian.com) (9/18)

Markov Chains-Invariant distributions By assuming that the finite-state Markov chain is irreducible and aperiodic, we can apply the Perron-Frobenius Theorem. ❖ The Perron-Frobenius Theorem: Let A be a positive square matrix. Then • A has one largest eigenvalue in absolute value and it ρ ( A ) has an positive eigenvector. • has geometric multiplicity 1. ρ ( A ) • has algebraic multiplicity 1. ρ ( A ) A m Note: Also hold for nonnegative A s.t is positive after some power m. By applying the Perron-Frobenius Theorem to P, π • with unique positive left eigenvector . π P = π ⇔ ρ ( P ) = 1 • All other eigenvalues are of absolute values < 1. (10/18)

Markov Chains-Invariant distributions (11/18)

Markov Chains - Recurrence and transience. Let be a Markov chain with transition matrix P. Then a ( X n ) n ≥ 0 ❖ state is recurrent if i ∈ S P i ( X n = i for infinitely many n ) = 1 We say that is transient if i ❖ P i ( X n = i for infinitely many n ) = 0 Now we are ready to see one implementation of the abstract Markov chains- -the random walks. (12/18)

Simple random walks-one dimension We start by studying simple random walk on the integer latices. At each time step, the random walker flips a fair coin to decide its next move. Let denote the position at time n, be the position it starts at. At each x S n time step j, 1, if Head appears on the j-th throw; { X j = − 1, otherwise. S n = x + X 1 + . . . + X n we have P ( X j = 1) = P ( X j = − 1) = 1/2 Questions: • On average, how far is the walker from the starting point ? • Does the walker keeps returning to the origin or does it eventually leave forever? (13/18)

Simple random walks-one dimension It’s easy to check that E ( S n ) = x + E ( X 1 ) + . . . + E ( X n ) = x + 0 + . . . + 0 = x ; and since (assume the walker starts from 0) Var ( X ) = E ( X 2 ) − E ( X ) 2 = E ( X 2 ) = 1 we have Var ( S n ) = 0 + Var ( X 1 ) + . . . + Var ( X n ) = n σ S n = n (typical distance from the origin) ❖ What does this inform to us? In one dimension, there are at most integers that are within n typical distance with the mean distance. So the chance of lying on a particular integer should shrink as a n − 1 a constant times . 2 P ( S n = j ) ∼ C n (14/18)

Simple random walks-one dimension We may notice that after an odd number of steps, the walker must end at an odd integer; similarly in order to get to an even integer, we need even steps. So we claim that the return probability n )( 1 2 ) n ( 1 2 ) n = (2 n )! n ! n ! ( 1 P ( S 2 n = 0) = ( 2 n 2 ) 2 n n → ∞ Stirling’s formula states that as , 2 e − n . 2 π n n + 1 n ! ∼ 2 π n 1/2 = C 0 2 P ( S 2 n = 0) = (2 n )! n ! n ! ( 1 2 ) 2 n ∼ n 1/2 . Then (15/18)

Simple random walks-one dimension Define to be a random variable that denotes the number of V ❖ time the walker returns to 0, then ∞ ∑ V = I { S 2 n = 0} n =0 (where I{A} is an indicator function) ❖ Consider the mean of the number of visits ∞ ∞ ∞ 2 n − 1 ∑ ∑ ∑ E ( V ) = E ( I { S 2 n = 0}) = 1 + P ( S 2 n = 0) = 1 + 2 2 π n =0 n =1 n =1 ∞ 2 ∑ n − 1 2 = ∞ = 1 + 2 π n =1 ∞ 1 n − 1 ∑ (Recall that the sum diverges since .) 2 < 1 2 n =1 If we let q = P(the walker ever return to 0), then we can show that q = 1 by supposing q < 1, and draw contradiction that E(V) will actually be finite. (16/18)

Simple random walks-higher dimensions ❖ What will happen if the random walker takes action in higher dimensions, say ? Z d • In each direction, the random walks will be performed as in one dimension • In 2n steps, we expect (2n/d) steps to be taken in each of the d-directions P ( any particular integer ) ∼ c d n d /2 • Return to origin: P ( S n = 0) ∼ c d Since n d /2 n d /2 = { ∞ ∞ c d < ∞ , d ≥ 3 ∑ ∑ E ( V ) = P ( S n = 0) ∼ = ∞ , d = 1,2 2 n =0 n =0 ❖ The results correspond to the facts that if the Markov chain is a simple symmetric Z d d ≥ 3 Z 2 on , all states are recurrent; if it’s on , , all states are transient. (17/18)

References: ❖ Cameron, M. (n.d.). Discrete time Markov chains. ❖ Lawler, G. F. (2011). Random walk and the heat equation . Providence, RI: American Mathematical Soc. ❖ Cairns, H. (2014). A short proof of Perron’s theorem. (18/18)

Markov Chains & Zifan Yu Department of Mathematics, University - PowerPoint PPT Presentation

Directed Reading Program 2019 Spring Markov Chains & Zifan Yu Department of Mathematics, University of Maryland Random Walks Mentored by Pranav Jayanti (1/18) Weather model: Sunny Cloudy Storm Sunny Cloudy Storm Questions to

Markov Chains Markov Processes Discrete-time Markov Chains Continuous-time Markov Chains Dr

Markov chains and Hidden Markov Models 9000 Markov chains and HMMs We will discuss: Markov

CSCE 471/871 Lecture 3: Markov Chains Markov Chains and and Hidden Markov Models Hidden

Imprecise Markov chains From basic theory to applications II prof. Jasper De Bock Imprecise

Overview Motivation Verifying Continuous-Time Markov Chains 1 Lecture 1+2: Discrete-Time Markov

Discrete time Markov chains Today: Discrete Time Markov Chains, Limiting Discrete time Markov

Discrete Time Markov Chains Discrete-Time Markov Chains Books - Introduction to Stochastic

Overview Verifying Continuous-Time Markov Chains Negative exponential distributions 1 Lecture

Markov Chains and Hidden Markov Models COMP 571 Luay Nakhleh, Rice University Markov Chains and

Markov Chains and Hidden Markov Models COMP 571 Luay Nakhleh, Rice University 2 Markov Chains

Hidden Markov Models Discrete Markov Processes 1 Hidden Markov Models Hidden Markov Models 2

Simulation of Discrete-Time Markov Chains Discrete-Time Markov Chains (DTMCs) Numerical Solution

Under Interval and Fuzzy From the . . . Symmetric Markov Chains Uncertainty, Symmetric In

Stochastic Processes Markov Processes Hamid R. Rabiee 1 Overview o Markov Property o Markov

Markov chains and MCMC methods Ingo Blechschmidt November 7th, 2014 Kleine Bayessche AG Markov

Markov chains Dr. Jarad Niemi STAT 544 - Iowa State University April 2, 2018 Jarad Niemi

Universality for zeros of random polynomials Motivation Random polynomials Turgay Bayraktar

Critical interfaces in random media: random bond Potts model and logarithmic CFTs Raoul

MonkeySort Keith Gallagher Florida Institute of Technology An Introduction.... The Quark

Community Detection on an Euclidean Random Graph Abishek Sankararaman, Emmanuel Abbe and Franois

Learning from random moments Rmi Gribonval - Inria Rennes - Bretagne Atlantique

Paths and Random Walks on Graphs Based on materials by

DISCRETE PROBABILITY Discrete Probability is a finite or countable set called the

Reconstruction from Anisotropic Random Measurements Mark Rudelson and Shuheng Zhou The