A Primer on Asymptotics Eric Zivot Department of Economics - PDF document

A Primer on Asymptotics Eric Zivot Department of Economics University of Washington September 30, 2003 Revised: January 7, 2013 1 Introduction The two main concepts in asymptotic theory covered in these notes are • Consistency • Asymptotic Normality Intuition • consistency: as we get more and more data, we eventually know the truth • asymptotic normality: as we get more and more data, averages of random variables behave like normally distributed random variables 1.1 Motivating Example Let  1        denote an independent and identically distributed (iid ) random sample with  [   ] =  and var (   ) =  2  We don’t know the probability density function (pdf)  (    θ )  but we know the value of  2  The goal is to estimate the mean value  from the random sample of data. A natural estimate is the sample mean X   = 1   = ¯ ˆ    =1 Using the iid assumption, straightforward calculations show that  ) =  2  [ˆ  ] =  var(ˆ   1

Since we don’t know  (    θ ) we don’t know the pdf of ˆ  All we know about the pdf  ) =  2  ) =  2 of  is that  [ˆ  ] =  and var (ˆ   However, as  → ∞  var (ˆ  → 0 and the pdf of ˆ  collapses at  Intuitively, as  → ∞  ˆ  converges in some sense to  In other words, the estimator ˆ  is consistent for  Furthermore, consider the standardized random variable µ ˆ ¶ = √   −  ˆ = ˆ  −   −  p q  =   var(ˆ  )  2  For any value of   [  ] = 0 and var (  ) = 1  but we don’t know the pdf of  since we don’t know  (    )  Asymptotic normality says that as  gets large, the pdf of  is well approximated by the standard normal density. We use the short-hand notation µ ˆ ¶  = √   −   ∼  (0  1) (1)   to represent this approximation. The symbol “ ∼ ” denotes “asymptotically distributed as”, and represents the asymptotic normality approximation. Dividing both sides of (1) by √  and adding  the asymptotic approximation may be re-written as µ ¶   2  =  +   √  ˆ ∼   (2)  The above is interpreted as follows: the pdf of the estimate ˆ  is asymptotically distributed as a normal random variable with mean  and variance  2   The quantity  2  is often referred to as the asymptotic variance of ˆ  and is denoted avar (ˆ  )  The square root of avar (ˆ  ) is called the asymptotic standard error of ˆ  and is denoted ASE( ˆ  )  With this notation, (2) may be re-expressed as ¡  ) 2 ¢   ˆ  ∼  (  avar(ˆ  )) or ˆ  ∼   ASE(ˆ  (3) The quantity  2 in (2) is sometimes referred to as the asymptotic variance of √  (ˆ  −  )  The asymptotic normality result (2) is commonly used to construct a con fi dence interval for  For example, an asymptotic 95% con fi dence interval for  has the form p ˆ  ± 1  96 × avar(ˆ  ) = 1  96 × ASE(ˆ  )  This con fi dence interval is asymptotically valid in that, for large enough samples, the probability that the interval covers  is approximately 95%. The asymptotic normality result (3) is also commonly used for hypothesis testing. For example, consider testing the hypothesis  0 :  =  0 against the hypothesis  1 :  6 =  0  A commonly used test statistic is the t-ratio µ ˆ ¶  ) = √    =  0 = ˆ  −  0  −  0  (4) ASE(ˆ  2

If the null hypothesis  0 :  =  0 is true then the asymptotic normality result (1) shows that the t-ratio (4) is asymptotically distributed as a standard normal random variable. Hence, for  ∈ (0  1) we can reject  0 :  =  0 at the  × 100% level if ¯ ¯ ¯   =  0 ¯   1 −  2  where  1 −  2 is the (1 −  2) × 100% quantile of the standard normal distribution. For example, if  = 0  05 then  1 −  2 =  0  975 = 1  96  Remarks 1. A natural question is: how large does  have to be in order for the asymptotic distribution to be accurate? Unfortunately, there is no general answer. In some cases (e.g.,   ∼  Bernoulli), the asymptotic approximation is accurate for  as small as 15  In other cases (e.g.,   follows a near unit root process),  may have to be over 1000  If we know the exact fi nite sample distribution of ˆ  then, for example, we can evaluate the accuracy of the asymptotic normal approximation for a given  by comparing the quantiles of the exact distribution with those from the asymptotic approximation. 2. The asymptotic normality result is based on the Central Limit Theorem . This type of asymptotic result is called fi rst-order because it can be derived from a fi rst-order Taylor series type expansion. More accurate results can be derived, in certain circumstances, using so-called higher-order expansions or approximations. The most common higher-order expansion is the Edgeworth expansion . Another related approximation is called the saddlepoint approximation . 3. We can use Monte Carlo simulation experiments to evaluate the asymptotic approximations for particular cases. Monte Carlo simulation involves using the computer to generate pseudo random observations from  (    θ ) and using these observations to approximate the exact fi nite sample distribution of ˆ  for a given sample size  The error in the Monte Carlo approximation depends on the number of simulations and can be made very small by using a very large number of simulations. 4. We can often use bootstrap techniques to provide numerical estimates for avar(ˆ  ) and asymptotic con fi dence intervals. These are alternatives to the analytic formulas derived from asymptotic theory. An advantage of the bootstrap is that under certain conditions, it can provide more accurate approximations than the asymptotic normal approximations. Bootstrapping, in contrast to Monte Carlo simulation, does not require specifying the distribution of  In particular, nonparametric bootstrapping relies on resampling from the observed data. Parametric bootstrapping relies on using the computer to generate pseudo random observations from  (    ˆ θ )  where ˆ θ is the sample estimate of θ  3

 2 is a consistent estimate 5. If we don’t know  2  we have to estimate avar (ˆ  )  If ˆ for  2  then we can compute a consistent estimate for the asymptotic variance of √  (ˆ  2 for  2 in (2) and compute an estimate for avar (ˆ  −  ) by plugging in ˆ  )  2  ) = ˆ d avar(ˆ   This gives rise to the asymptotic approximation ³  ) 2 ´    d ∼  (  d ˆ  avar(ˆ  )) or ˆ  ∼  ASE(ˆ  (5) which is typically less accurate than the approximation (2) because of the estimate error in d avar(ˆ  )  2 Probability Theory Tools The main statistical tool for establishing consistency of estimators is the Law of Large Numbers (LLN). The main tool for establishing asymptotic normality is the Central Limit Theorem (CLT). There are several versions of the LLN and CLT, that are based on various assumptions. In most textbooks, the simplest versions of these theorems are given to build intuition. However, these simple versions often do not technically cover the cases of interest. An excellent compilation of LLN and CLT results that are applicable for proving general results in econometrics is provided in White (1984). 2.1 Laws of Large Numbers Let  1        be a iid random variables with pdf  (  θ )  For a given function  de fi ne the sequence of random variables based on the sample  1 =  (  1 )   2 =  (  1   2 )  . . .   =  (  1        )  P  For example, let  ∼  (   2 ) so that θ = (   2 ) and de fi ne   = ¯   = 1  =1     This notation emphasizes that sample statistics are functions of the sample size and can be treated as a sequence of random variables. De fi nition 1 Convergence in Probability Let  1        be a sequence of random variables. We say that   converges in probability to a constant, or random variable,  and write    →  4

A Primer on Asymptotics Eric Zivot Department of Economics - PDF document

A Primer on Asymptotics Eric Zivot Department of Economics University of Washington September 30, 2003 Revised: January 7, 2013 1 Introduction The two main concepts in asymptotic theory covered in these notes are Consistency

Asymptotics of symmetric functions with applications to Setup Asymptotics of statistical

Table S2. Gene-specific PCR primer pairs for all validated SBSs. Forward primer Reverse primer

Asymptotics Will Perkins January 22, 2013 Asymptotics In many theorems and questions in

Statistical mechanics via Answers: GUE asymptotics of symmetric functions Probability via Schur

Foundations of Computer Science Lecture 9 Sums And Asymptotics Computing Sums Asymptotics:

Tariff Primer: A Graphic Presentation of the Fordney- Tariff Primer: A Graphic Presentation of the

Linac Simulation Linac Simulation Primer Primer J.-F. Ostiguy APC ostiguy@fnal.gov September

Wald Test Asymptotics of LRT Lecture 21 Biostatistics 602 - Statistical Inference . . . .

What is this talk about? Applied Asymptotics in R an R package bundle Examples of the use of

Asymptotics of radiation fields in asymptotically Minkowski spacetimes Dean Baskin joint with

Cauchy-Riemann (CR) Manifolds, Szeg o kernel asymptotics and Morse inequalities on CR-manifolds

Data Asymptotics Dr. Jarad Niemi STAT 544 - Iowa State University February 7, 2018 Jarad Niemi

Sum of matrix entries of representations of the symmetric group and its asymptotics Dario De

On Third-Order Asymptotics for DMCs Vincent Y. F. Tan Institute for Infocomm Research (I 2 R)

From the master equation to mean field game asymptotics Daniel Lacker Division of Applied

M-Estimation under High-Dimensional Asymptotics DLD, Andrea Montanari 2014-05-01 DLD, Andrea

A National Web Conference on the Purpose and Demonstration of the Health IT Hazard Manager and

CrIMSS Error Modeling with ATMS Proxy Data Bill Blackwell, Laura Jairam, Vince Leslie, Michael

Comparison of Bayesian and Frequentisot Inference 18.05 Spring 2014 Jeremy Orloff and Jonathan

Recent Advances in Post-Selection Statistical Inference Robert Tibshirani, Stanford University

Day 4: Resampling Methods Lucas Leemann Essex Summer School Introduction to Statistical Learning

MARKOV CHAIN MONTE CARLO METHODS MARKOV CHAIN MONTE CARLO METHODS MARKO LAINE, FMI MARKO LAINE,

State-of-the-Art ! 30-85 errors are made per 1000 lines of source CS 619 Introduction to OO Design

Lesson 3 Approximating Fourier series 1 Last lecture, we saw that the trapezoidal rule was

Sambuz

Useful Links

Newsletter

Mail Us