Convergence of Random Processes DS GA 1002 Probability and - PowerPoint PPT Presentation

Convergence of Random Processes DS GA 1002 Probability and Statistics for Data Science http://www.cims.nyu.edu/~cfgranda/pages/DSGA1002_fall17 Carlos Fernandez-Granda

Aim Define convergence for random processes Describe two convergence phenomena: the law of large numbers and the central limit theorem

Types of convergence Law of Large Numbers Central Limit Theorem Monte Carlo simulation

Convergence of deterministic sequences A deterministic sequence of real numbers x 1 , x 2 , . . . converges to x ∈ R , i →∞ x i = x lim if x i is arbitrarily close to x as i grows For any ǫ > 0 there is an i 0 such that for all i > i 0 | x i − x | < ǫ Problem: Random sequences do not have fixed values

Convergence with probability one Consider a discrete random process � X and a random variable X defined on the same probability space If we fix the outcome ω , � X ( i , ω ) is a deterministic sequence and X ( ω ) is a constant We can determine whether � lim X ( i , ω ) = X ( ω ) i →∞ for that particular ω

Convergence with probability one � X converges with probability one to X if �� ω | ω ∈ Ω , lim X ( ω, i ) = X ( ω ) = 1 P i →∞ Deterministic convergence occurs with probability one

Puddle Initial amount of water is uniform between 0 and 1 gallon After a time interval i there is i times less water D ( ω, i ) := ω � i , i = 1 , 2 , . . .

Puddle ω = 0 . 31 ω = 0 . 89 0 . 8 ω = 0 . 52 0 . 6 D ( ω, i ) 0 . 4 � 0 . 2 0 1 2 3 4 5 6 7 8 9 10 i

Puddle If we fix ω ∈ ( 0 , 1 ) ω � lim D ( ω, i ) = lim i i →∞ i →∞ = 0 � D converges to zero with probability one

Puddle 1 D ( ω, i ) 0 . 5 � 0 0 10 20 30 40 50 i

Alternative idea Idea: Instead of fixing ω and checking deterministic convergence: 1. Measure how close � X ( i ) and X are for a fixed i using a deterministic quantity 2. Check whether the quantity tends to zero

Convergence in mean square The mean square of Y − X measures how close X and Y are � ( X − Y ) 2 � If E = 0 then X = Y with probability one Proof : By Markov’s inequality for any ǫ > 0 � ( X − Y ) 2 � � � E ( Y − X ) 2 > ǫ P ≤ = 0 ǫ

Convergence in mean square � X converges to X in mean square if �� 2 � X − � lim X ( i ) = 0 i →∞ E

Convergence in probability Alternative measure: Probability that | Y − X | > ǫ for small ǫ � X converges to X in probability if for any ǫ > 0 �� X − � lim X ( i ) � > ǫ = 0 i →∞ P

Conv. in mean square implies conv. in probability �� X − � i →∞ P lim X ( i ) � > ǫ

Conv. in mean square implies conv. in probability �� 2 � � � X − � X − � > ǫ 2 i →∞ P lim X ( i ) � > ǫ = lim i →∞ P X ( i )

Conv. in mean square implies conv. in probability �� 2 � � � X − � X − � > ǫ 2 i →∞ P lim X ( i ) � > ǫ = lim i →∞ P X ( i ) �� 2 � X − � E X ( i ) ≤ lim ǫ 2 i →∞

Conv. in mean square implies conv. in probability �� 2 � � � X − � X − � > ǫ 2 i →∞ P lim X ( i ) � > ǫ = lim i →∞ P X ( i ) �� 2 � X − � E X ( i ) ≤ lim ǫ 2 i →∞ = 0

Conv. in mean square implies conv. in probability �� 2 � � � X − � X − � > ǫ 2 i →∞ P lim X ( i ) � > ǫ = lim i →∞ P X ( i ) �� 2 � X − � E X ( i ) ≤ lim ǫ 2 i →∞ = 0 Convergence with probability one also implies convergence in probability

Convergence in distribution The distribution of ˜ X ( i ) converges to the distribution of X � X converges in distribution to X if lim X ( i ) ( x ) = F X ( x ) i →∞ F � for all x at which F X is continuous

Convergence in distribution Convergence in distribution does not imply that ˜ X ( i ) and X are close as i → ∞ ! Convergence in probability does imply convergence in distribution

Binomial tends to Poisson ◮ � X ( i ) is binomial with parameters i and p := λ/ i ◮ X is a Poisson random variable with parameter λ ◮ � X ( i ) converges to X in distribution � i � p x ( 1 − p ) ( i − x ) lim X ( i ) ( x ) = lim i →∞ p � x i →∞ = λ x e − λ x ! = p X ( x )

Probability mass function of � X ( 40 ) 0 . 15 0 . 1 5 · 10 − 2 0 0 10 20 30 40 k

Probability mass function of X 0 . 15 0 . 1 5 · 10 − 2 0 0 10 20 30 40 k

Moving average The moving average � A of a discrete random process � X is i � A ( i ) := 1 � � X ( j ) i j = 1

Weak law of large numbers Let � X be an iid discrete random process with mean µ � X := µ and bounded variance σ 2 The average � A of � X converges in mean square to µ

Proof � � � A ( i ) E

Proof   � � i �  1 � �  A ( i ) = E X ( j ) E i j = 1

Proof   � � i �  1 � �  A ( i ) = E X ( j ) E i j = 1 � � � i = 1 � E X ( j ) i j = 1

Proof   � � i �  1 � �  A ( i ) = E X ( j ) E i j = 1 � � � i = 1 � E X ( j ) i j = 1 = µ

Proof � � � A ( i ) Var

Proof   � � i �  1 � �  A ( i ) = Var X ( j ) Var i j = 1

Proof   � � i �  1 � �  A ( i ) = Var X ( j ) Var i j = 1 i � � � = 1 � X ( j ) Var i 2 j = 1

Proof   � � i �  1 � �  A ( i ) = Var X ( j ) Var i j = 1 i � � � = 1 � X ( j ) Var i 2 j = 1 = σ 2 i

Proof �� 2 � � i →∞ E lim A ( i ) − µ

Proof �� 2 � �� 2 � � � � � i →∞ E lim A ( i ) − µ = lim i →∞ E A ( i ) − E A ( i )

Proof �� 2 � �� 2 � � � � � i →∞ E lim A ( i ) − µ = lim i →∞ E A ( i ) − E A ( i ) � � � = lim i →∞ Var A ( i )

Proof �� 2 � �� 2 � � � � � i →∞ E lim A ( i ) − µ = lim i →∞ E A ( i ) − E A ( i ) � � � = lim i →∞ Var A ( i ) σ 2 = lim i i →∞

Proof �� 2 � �� 2 � � � � � i →∞ E lim A ( i ) − µ = lim i →∞ E A ( i ) − E A ( i ) � � � = lim i →∞ Var A ( i ) σ 2 = lim i i →∞ = 0

Strong law of large numbers Let � X be an iid discrete random process with mean µ � X := µ and bounded variance σ 2 The average � A of � X converges with probability one to µ

iid standard Gaussian 2.0 Moving average 1.5 Mean of iid seq. 1.0 0.5 0.0 0.5 1.0 1.5 2.0 0 10 20 30 40 50 i

iid geometric with p = 0 . 4 12 Moving average 10 Mean of iid seq. 8 6 4 2 0 0 10 20 30 40 50 i

iid Cauchy 30 Moving average 25 Median of iid seq. 20 15 10 5 0 5 0 10 20 30 40 50 i

iid Cauchy 10 Moving average Median of iid seq. 5 0 5 10 0 100 200 300 400 500 i

iid Cauchy 30 Moving average 20 Median of iid seq. 10 0 10 20 30 40 50 60 0 1000 2000 3000 4000 5000 i

Central Limit Theorem Let � X be an iid discrete random process with mean µ � X := µ and bounded variance σ 2 � � √ n � A − µ converges in distribution to a Gaussian random variable with mean 0 and variance σ 2 The average � A is approximately Gaussian with mean µ and variance σ 2 / i

Height data ◮ Example: Data from a population of 25 000 people ◮ We compare the histogram of the heights and the pdf of a Gaussian random variable fitted to the data

Height data 0.25 Gaussian distribution Real data 0.20 0.15 0.10 0.05 60 62 64 66 68 70 72 74 76 Height (inches)

Sketch of proof Pdf of sum of two independent random variables is the convolution of their pdfs � ∞ f X + Y ( z ) = f X ( z − y ) f Y ( y ) d y y = −∞ Repeated convolutions of any pdf with bounded variance result in a Gaussian!

Repeated convolutions i = 1 i = 2 i = 3 i = 4 i = 5

Convergence of Random Processes DS GA 1002 Probability and - PowerPoint PPT Presentation

Convergence of Random Processes DS GA 1002 Probability and Statistics for Data Science http://www.cims.nyu.edu/~cfgranda/pages/DSGA1002_fall17 Carlos Fernandez-Granda Aim Define convergence for random processes Describe two convergence

Probability and Random Processes Lecture 10 Random processes Kolmogorovs extension

Convergence of discrete random processes DS GA 1002 Statistical and Mathematical Models

Random Numbers RANDOM VS PSEUDO RANDOM Truly Random numbers From Wolfram: A random number

Probability and Random Processes Lecture 11 Measurable dynamical systems Random processes

Birth and Death Processes Today: Birth processes Birth and Death Processes Death

Programs, Processes, and Threads Programs, Processes, and Threads (Chapter 2) Processes

On the rate of convergence of the Biggins martingale The rate of convergence Biggins martingale

Chapter 2: Random Variables In this chapter we will cover: 1. Discrete Random variables, ( 2.1

Random Numbers, Files, and Onwards Random Numbers Computers cannot produce truly random numbers.

Probability and Random Processes Lecture 5 Probability and random variables The law of

Multi- -Disciplinary Convergence in Life Sciences: Disciplinary Convergence in Life Sciences:

OPCW SAB TWG OPCW SAB TWG OPCW SAB TWG OPCW SAB TWG Convergence in Chemistry and Biology

Asymptotics Review Harvard Math Camp - Econometrics Ashesh Rambachan Summer 2018 Outline Types

II of large Number Lattin in probability almost convergence convergence sure - - "

NS NSF Convergence Accelerator Chaitan Baru Senior Science Advisor, Convergence Accelerator

CS 557 BGP Convergence Improved BGP Convergence via Ghost Flushing Bremler-Barr, Afek, Schwarz,

Global exact controllability in infnite time of Schrdinger equation Vahagn Nersesyan

Some topics related to bounding by canonical functions Sean Cox Institute for mathematical logic

Africa: the case of Rwanda Andy McKay WIDER Inequality Conference, 5-6 September 2014 Inequality

Sampling strategies Introduction to Data Why not take a census? Conducting a census is very

Limit theorems for adaptive MCMC algorithms Gersende FORT LTCI CNRS - TELECOM ParisTech In

Can Random Matrices Change the Future of Machine Learning? Malik TIOMOKO and Romain COUILLET

Convergence theorems for barycentric maps Fumio Hiai Tohoku University 2018, July (at Be

Building Random Trees from Blocks Mohan Gopaladesikan Department of Statistics, Purdue University

Convergence of Random Processes DS GA 1002 Probability and - PowerPoint PPT Presentation

Convergence of Random Processes DS GA 1002 Probability and Statistics for Data Science http://www.cims.nyu.edu/~cfgranda/pages/DSGA1002_fall17 Carlos Fernandez-Granda Aim Define convergence for random processes Describe two convergence

Probability and Random Processes Lecture 10 Random processes Kolmogorovs extension

Convergence of discrete random processes DS GA 1002 Statistical and Mathematical Models

Random Numbers RANDOM VS PSEUDO RANDOM Truly Random numbers From Wolfram: A random number

Probability and Random Processes Lecture 11 Measurable dynamical systems Random processes

Birth and Death Processes Today: Birth processes Birth and Death Processes Death

Programs, Processes, and Threads Programs, Processes, and Threads (Chapter 2) Processes

On the rate of convergence of the Biggins martingale The rate of convergence Biggins martingale

Chapter 2: Random Variables In this chapter we will cover: 1. Discrete Random variables, ( 2.1

Random Numbers, Files, and Onwards Random Numbers Computers cannot produce truly random numbers.

Probability and Random Processes Lecture 5 Probability and random variables The law of

Multi- -Disciplinary Convergence in Life Sciences: Disciplinary Convergence in Life Sciences:

OPCW SAB TWG OPCW SAB TWG OPCW SAB TWG OPCW SAB TWG Convergence in Chemistry and Biology

Asymptotics Review Harvard Math Camp - Econometrics Ashesh Rambachan Summer 2018 Outline Types

II of large Number Lattin in probability almost convergence convergence sure - - &quot;

NS NSF Convergence Accelerator Chaitan Baru Senior Science Advisor, Convergence Accelerator

CS 557 BGP Convergence Improved BGP Convergence via Ghost Flushing Bremler-Barr, Afek, Schwarz,

Global exact controllability in infnite time of Schrdinger equation Vahagn Nersesyan

Some topics related to bounding by canonical functions Sean Cox Institute for mathematical logic

Africa: the case of Rwanda Andy McKay WIDER Inequality Conference, 5-6 September 2014 Inequality

Sampling strategies Introduction to Data Why not take a census? Conducting a census is very

Limit theorems for adaptive MCMC algorithms Gersende FORT LTCI CNRS - TELECOM ParisTech In

Can Random Matrices Change the Future of Machine Learning? Malik TIOMOKO and Romain COUILLET

Convergence theorems for barycentric maps Fumio Hiai Tohoku University 2018, July (at Be

Building Random Trees from Blocks Mohan Gopaladesikan Department of Statistics, Purdue University

II of large Number Lattin in probability almost convergence convergence sure - - "