variance estimation for survey weighted data using
play

Variance Estimation for Survey-Weighted Data Using Bootstrap - PowerPoint PPT Presentation

Variance Estimation for Survey-Weighted Data Using Bootstrap Resampling Methods: 2013 Methods-of-Payment Survey Questionnaire Heng Chen and Q. Rallye Shen 2017 STATA The views expressed in this presentation are those of the author. No


  1. Variance Estimation for Survey-Weighted Data Using Bootstrap Resampling Methods: 2013 Methods-of-Payment Survey Questionnaire Heng Chen and Q. Rallye Shen 2017 STATA The views expressed in this presentation are those of the author. No responsibility for them should be attributed to the Bank of Canada. June 9, 2017 Heng Chen Variance Estimation for Survey-Weighted Data Using Bootstrap Resampling Methods: 2013 (The views expressed in this presentation are those of the author. No re June 9, 2017 1 / 10

  2. Outline Basic concepts Calibration estimator in survey: related to incomplete data estimator in econometrics Variance estimation of the calibration estimator Result

  3. Basic Concepts Let U be a finite population of size N and the population total be T y = ∑ i ∈ U y i . A sample s is selected according to a sampling design p ( s ) with inclusion probabilities, π 1 , ..., π N . Calibration estimator: T y = ∑ � w i y i i ∈ s where w i is the calibrated weight. � � � We want to estimate Var T y .

  4. Calibrated Weight Each unit i is assigned a design weight : � � d i = π − 1 ; i ∈ s . i Calibration adjustment : The weight d i are adjusted to match known(imputed) population counts. calibration d i → w i where w i from minimizing �� w i � � w i � � log ( w i ∑ ) − + 1 d i d i d i d i i ∈ s subject to ∑ i ∈ s w i x i = X .

  5. Problem Recall � � Var ( � ∑ T y ) = Var w i y i i ∈ s where there are two sources of randomness in � T y : Sampling variation in the s ; 1 Sampling variation in the calibrated weight w i . 2 However, people usually ignore the sampling variation in w i , and this is problematic.

  6. Problem (cont.) The estimated variance can be written as y i − � y i − � d ij − d i d j β x i β x i T y ) ≈ ∑ Var ( � � d ij d i d j i , j ∈ s i ] − 1 [ ∑ s y i x � where � β is [ ∑ s x i x � i ] . Compare to w ij − w i w j y i y j ∗ ( � T y ) ≈ ∑ � Var w ij w i w j i , j ∈ s which ignores the sampling variation in w i .

  7. Bootstrap in STATA Bootstrap is easy to implement. Use the ipfraking and bsweights commands in Stata (Kolenikov, 1 2010, 2014). Generate replicate calibrated weights instead of recomputing the 2 statistics for each resample.

  8. Implementation in Stata Step 1: Input the initial weight ( d ). Step 2: Generate the calibrated weight ( w ) using the ipfraking command. calibration Steps: d i → w i Step 3: Generate the replicate raking weights using the bsweights command. Step 4: Declare the bootstrap survey environment in Stata: svyset [pw=], vce(bootstrap) bsrw()

  9. Result Ignoring w Considering w Cash on Hand Mean 1.03 1.03 Variance 1.51 0.85 Usage of CTC Mean 0.93 0.93 Variance 2.10 1.24 Note: The numbers in the second and third columns are divided by the numbers in the first column. CTC stands for the contactless feature of a credit card.

Recommend


More recommend