Implementing Quantile Selection Models in Stata Mariel Siravegna - PowerPoint PPT Presentation

Implementing Quantile Selection Models in Stata Mariel Siravegna Ercio Munoz Georgetown University The Graduate Center, CUNY July 30, 2020 1 / 20

Non-random sample selection is a major issue in empirical work - A simple sample selection model can be written as the latent model Y ∗ = X ′ β + µ but Y ∗ is only observed if S=1 S = 1 ( Z ′ γ + ν ≥ 0 ) - Since the seminal work of Heckman (1979), much progress has been made in methods that extend the original model or relax some of its assumptions - And recently Arellano and Bonhomme (2017) proposed a copula-based method to correct for sample selection in quantile regression 2 / 20

Two Recent Applications 3 / 20

Maasoumi and Wang (JPE 2019) - In this paper the authors use the CPS between 1976-2013 to see how the gender wage gap vary across the wage distribution - They assess how selective participation of individuals in the labor market affects the gender gap 4 / 20

Comparison of Female and Male Wage CDF (Without correction) (Correcting for Selection) 5 / 20

Bollinger et al. (JPE 2019) - Survey earnings response is not random - In this paper the authors match the survey earnings responses to administrative records to see how response vary across the earnings distribution - They find that non-response rate follows an U shape across earnings and this produces an underestimation of inequality, which can be corrected using this copula-based approach 6 / 20

Bollinger et al. (JPE 2019) 7 / 20

Estimation 8 / 20

Three-step Algorithm of Arellano and Bohnomme (2017) Given an i.i.d sample ( Y i , Z i , S i ) , i = 1 , ..., N where Z i = ( X i , W i ) and assuming that quantile functions are linear: q ( τ , x ) = x ′ β τ , for all τ ∈ ( 0 , 1 ) and x ∈ X (3) the algorithm is as follows: 1. Estimation of the propensity score p ( z ) 2. Estimation of the dependence parameter or degree of selection ρ using this moment restriction: E [ I ( Y ≤ X ′ ˆ β τ ) − G ( τ , p ( z ) ; ρ ) | S = 1 , Z = z ] = 0 9 / 20

Second Step Taken to the sample by choosing a ρ that minimizes the following objective function: N L S i ϕ τ l ( z i )[ I { Y i ≤ X ′ β τ l ( ρ ) } − G ( τ l , p ( z ′ ∑ ∑ i ˜ ρ = argmin ρ � i ) ; ρ )] � ˆ i = 1 l = 1 where � . � is the Euclidean norm, τ 1 < τ 2 < · · · < τ L is a finite grid on ( 0 , 1 ) , and the instrument functions are defined as ϕ τ l ( z i ) , G ( τ l , p ( z ′ i ) ; ρ ) is the conditional copula indexed by a parameter ρ , and: N i β ) + + ( 1 − G τ , i ( Y i − X ′ S i [ G τ i ( Y i − X ′ i β ) − ] ˜ ∑ β τ ( ρ ) = argmin β i = 1 where a + = max { a , 0 } , a − = max {− a , 0 } , and G τ , i = G ( τ , p ( z ) ; ρ ) . 10 / 20

Third Step ρ , ˆ 3. Given the estimated ˆ β τ can be estimated by minimizing a rotated check function of the form: N i β ) + + ( 1 − ˆ S i [ ˆ G τ , i ( Y i − X ′ G τ , i )( Y i − X ′ ˆ i β ) − ] ∑ β τ = argmin β i = 1 where ˆ β τ will be a consistent estimator of the τ -th quantile regression coefficient. Note that this step is unnecessary if the researcher is interested on the quantiles included in the finite grid of step 2. 11 / 20

Implementing the method in Stata 12 / 20

Syntax � � � � � � � � depvar S = varlist S ) qregsel depvar indepvars if in , select( quantile( # ) grid min(grid minvalue) grid max(grid maxvalue) � � grid length(grid lengthvalue) copula( copula ) noconstant plot 13 / 20

Empirical Example 14 / 20

Wages of women used in Heckman command 15 / 20

Grid for minimization 16 / 20

Counterfactual distribution: Corrected versus uncorrected quantiles 17 / 20

Conclusions 18 / 20

Conclusions - We have introduced a new Stata command that implements a copula-based method to correct for sample selection in quantile regressions proposed in Arellano and Bonhomme (2017) - This command may be useful for Stata users doing empirical work, as we have illustrated with the case of two recently published papers - The code is for now only available in our github repo - Questions, comments, and suggestions are welcome 19 / 20

References - Arellano, M., and S. Bonhomme (2017), “Quantile Selection Models with an Application to Understanding Changes in Wage Inequality.” Econometrica 85(1) - Bollinger, C., B. Hirsch, C. Hokayem, and J. Ziliak (2019), “Trouble in the Tails? What We Know about Earnings Nonresponse Thirty Years after Lillard, Smith, and Welch.” Journal of Political Economy 127(5). - Maasoumi, E., and L. Wang (2019), “The Gender Gap between Earnings Distributions.” Journal of Political Economy 127(5). - Munoz, E., and M. Siravegna (2020), “Implementing Quantile Selection Models in Stata.” 20 / 20

Implementing Quantile Selection Models in Stata Mariel Siravegna - PowerPoint PPT Presentation

Implementing Quantile Selection Models in Stata Mariel Siravegna Ercio Munoz Georgetown University The Graduate Center, CUNY July 30, 2020 1 / 20 Non-random sample selection is a major issue in empirical work - A simple sample selection

Generalized Quantile Regression in Stata Matthew Baker, Hunter College David Powell, RAND Travis

Bayesian hierarchical models in Stata Nikolay Balov StataCorp LP 2016 Stata Conference Nikolay

QUANTILE AUTOREGRESSION ROGER KOENKER AND ZHIJIE XIAO Abstract. We consider quantile

Quantile plots: New planks in an old campaign Nicholas J. Cox Department of Geography 1

Quantile Regression in R: For Fin and Fun Roger Koenker University of Illinois at

Applications of Normal Quantile Plots David Rose June 13, 2011 David Rose () Applications of

) Quantile Estimation Peter J. Haas CS 590M: Simulation Spring Semester 2020 1 / 20 Quantile

Checking Assumptions Normal distributions: use probability plot (or quantile-quantile plot);

Python applications in Stata 16 BPLIM 2020 Portuguese Stata Conference BPLIM Python

Bayesian Analysis using Stata Bill Rising StataCorp LP 2016 Brazilian Stata Users Group Meeting

Estimating dynamic stochastic general equilibrium models in Stata David Schenck Senior

Nonlinear dynamic stochastic general equilibrium models in Stata 16 David Schenck Senior

Implementing machine learning methods in Stata Austin Nichols 6 September 2018 Austin Nichols

Quantile regression: Basics and recent advances J. M.C. Santos Silva University of Surrey 2019

Nonlinear dynamic stochastic general equilibrium models in Stata 16 David Schenck Senior

Stata: Basics, Shortcuts, and Integration with Introduction LaTeX Stata Syntax and Shortcuts

Selective inference: a conditional perspective Xiaoying Tian Harris Joint work with Jonathan

MOVING THE WORLD AT WORK Wilson R. Jones President and Chief Executive Officer Oshkosh

s r rt

Family Advisors Webinar 2: Identifying and Training Advisors Pam Dardess, MPH Principal

Policy Exploration for JITDs (Java) By Team Datum Cracking Results from Paper vs. Observed

Selection Detection and Two-Sample-Testing: Generalized Greenwood Statistics and their

Introduction to Machine Learning Active Learning Barnabs Pczos 1 Credits Some of the

From selective inference to adaptive data analysis Xiaoying Tian Harris December 9, 2016

Sambuz

Useful Links

Newsletter

Mail Us