Multivariate Sarmanov Count Data Models Eugenio J. Miravete - PowerPoint PPT Presentation

Introduction Sarmanov Empirical Summary Multivariate Sarmanov Count Data Models Eugenio J. Miravete University of Texas at Austin & Centre for Economic Policy Research February 12, 2009 Eugenio J. Miravete Multivariate Sarmanov Count Data Models

Introduction Sarmanov Empirical Summary Disclaimer Info Motivation Literature Disclaimer: Despite what you are about to see, I do not consider myself an econometrician but rather an empirical IO economist who happen to run into obscure econometric problems. Eugenio J. Miravete Multivariate Sarmanov Count Data Models

Introduction Sarmanov Empirical Summary Disclaimer Info Motivation Literature Fact: I first encountered the Sarmanov distribution when searching for arguments to convince an NSF reviewer of a proposal dealing with the estimation of a structural model of nonlinear pricing competition. Eugenio J. Miravete Multivariate Sarmanov Count Data Models

Introduction Sarmanov Empirical Summary Disclaimer Info Motivation Literature Refreshing Your Memory: Andrews, Econometrica’02, pp. 119-162: k -step bootstrap: Under very general conditions, we can approximate the parameter estimates of each bootstrap sample by taking k iterations of a Newton-Raphson procedure starting from the ML parameter estimate using the full sample (it also works with (Gauss-Newton). Eugenio J. Miravete Multivariate Sarmanov Count Data Models

Introduction Sarmanov Empirical Summary Disclaimer Info Motivation Literature More Memory Refreshing: Andrews, Econometrica’00, pp. 399-405: Rescaled bootstrap (a variation of subsampling) is appropriate to obtain consistent inference when parameters may be on the boundary defined by a set of linear or nonlinear inequality constraints. Eugenio J. Miravete Multivariate Sarmanov Count Data Models

Introduction Sarmanov Empirical Summary Disclaimer Info Motivation Literature A Request: Questions: General: Do you see any concerns in combining k -step bootstrap with subsampling or rescaled bootstrap? Particular: Is there anything in my model that makes the combination of these two methods questionable? Eugenio J. Miravete Multivariate Sarmanov Count Data Models

Introduction Sarmanov Empirical Summary Disclaimer Info Motivation Literature Application: Competing with Menus of Tariff Options Do firms offer a similar number of tariff options? If they do beyond what the heterogeneity of consumers and cost of offering these options justifies, we should expect them to be positively correlated. ⇒ The number of tariff options are strategic complements. Do firms, on the contrary, try to differentiate themselves by offering a very different menu of tariff options? In this case, the number of tariff options should be negatively correlated. ⇒ The number of tariff options are strategic substitutes. Eugenio J. Miravete Multivariate Sarmanov Count Data Models

Introduction Sarmanov Empirical Summary Disclaimer Info Motivation Literature Empirical Challenge We need an econometric model that allows for the possibility that counts are negatively correlated. The model also needs to accommodate for the possibility of underdispersion of counts that, while less frequent, it appears to be present in the data. Eugenio J. Miravete Multivariate Sarmanov Count Data Models

Introduction Sarmanov Empirical Summary Disclaimer Info Motivation Literature Over and Underdispersion: I am aware of only two univariate models can accommodate both features (ignore hurdle and zero-inflated models): Efron (1986): Double Poisson model. Winkelmann (1995): Count data model based on a gamma distributed renewal process. Eugenio J. Miravete Multivariate Sarmanov Count Data Models

Introduction Sarmanov Empirical Summary Disclaimer Info Motivation Literature Existing Multivariate Count Data Regression Models: There are very few: Kocherlakota-Kocherlakota (1993); Marshall-Olkin (1990); Gourieroux-Monfort-Trognon (1984). They are all restricted to the case where the correlation coefficient is positive and of limited range. Correlation is modeled as a consequence of the same unobserved heterogeneity that explains over or underdispersion. Most of the times the over/underdispersion of all counts and the correlation among them is modeled as a function of a single parameter. Eugenio J. Miravete Multivariate Sarmanov Count Data Models

Introduction Sarmanov Empirical Summary General DPS GS Features of the Present Model It can accommodate both over and underdispersion of the distribution of each count separately. Both positive and negative correlations are possible. Dispersion and correlation depends on different parameters of the model. The model can easily be extended beyond the bivariate case. The estimation is not particularly time consuming (bootstraping is a different matter): the likelihood function can always be written in closed form and thus, simulation is not needed to obtain the parameter estimates. The range of the correlation coefficient may be smaller than [ − 1 , 1] and is effectively bounded by the estimates of the rest of parameters of the model. Eugenio J. Miravete Multivariate Sarmanov Count Data Models

Introduction Sarmanov Empirical Summary General DPS GS The Sarmanov Family of Distributions Let y k , k =1 , 2 denote two random variables with univariate probability density function f k ( y k ) on A k � R and with mean and variance: ∞ ∞ � � ( y k − µ k ) 2 f k ( y k ) dy k . σ 2 µ k = y k f k ( y k ) dy k and k = −∞ −∞ This bivariate Sarmanov probability density function is written as: f 12 ( y 1 , y 2 ) = f 1 ( y 1 ) f 2 ( y 2 ) × [1 + ω 12 ψ 1 ( y 1 ) ψ 2 ( y 2 )] , where ψ k ( y k ) , k = 1 , 2 are bounded and nonconstant mixing functions such as: ∞ � ψ k ( y k ) f k ( y k ) dy k = 0 . −∞ Eugenio J. Miravete Multivariate Sarmanov Count Data Models

Introduction Sarmanov Empirical Summary General DPS GS Sarmanov Distributions on Positive Orthants Assume that marginal distributions have support on R + . Lee (1996) shows that the mixing functions are then given by: ψ k ( y k ) = exp( − y k ) − L k (1) , ∀ y k ≥ 0 , where: ∞ � L k ( ζ ) = exp( − ζy k ) f k ( y k ) dy k . 0 is the Laplace transform of the assumed marginal distribution evaluated at ζ = 1 . Eugenio J. Miravete Multivariate Sarmanov Count Data Models

Introduction Sarmanov Empirical Summary General DPS GS Constraints For the Sarmanov distribution to be properly defined we need: ω 12 ∈ R : 1 + ω 12 ψ 1 ( y 1 ) ψ 2 ( y 2 ) ≥ 0 ∀ y 1 , y 2 , or equivalently: ω 12 ≤ ω 12 ≤ ω 12 , where: − 1 max { L 1 (1) L 2 (1) , [1 − L 1 (1)][1 − L 2 (1)] } , ω 12 = 1 max { L 1 (1)[1 − L 2 (1)] , [1 − L 1 (1)] L 2 (1) } . ω 12 = Eugenio J. Miravete Multivariate Sarmanov Count Data Models

Introduction Sarmanov Empirical Summary General DPS GS Correlation Additionally: E [ y 1 y 2 ] = µ 1 µ 2 + ω 12 ν 1 ν 2 , ∞ � y k ψ k ( y k ) f k ( y k ) dy k = − L ′ ν k = k (1) − L k (1) µ k , −∞ ρ 12 = ω 12 ν 1 ν 2 . σ 1 σ 2 Eugenio J. Miravete Multivariate Sarmanov Count Data Models

Introduction Sarmanov Empirical Summary General DPS GS Double Poisson Distribution Let y k = 0 , 1 , 2 , . . . be distributed according to a double Poisson distribution with parameters µ k and θ k , conditional on a set of regressors x k in a sample with i = 1 , 2 , . . . , n observations. The probability function of a double Poisson distribution is: ˜ f k ( y k | µ k , θ k ) = c ( µ k , θ k ) f k ( y k | µ k , θ k ) , � θ k y k θ k exp( − θ k µ k ) exp( − y k ) y ky k � eµ k � f k ( y k | µ k , θ k ) = , y k ! y k ∞ f k ( y k | µ k , θ k ) ≃ 1 + 1 − θ k � 1 � 1 � 1 + , c ( µ k , θ k ) = 12 θ k µ k θ k µ k y k =0 Eugenio J. Miravete Multivariate Sarmanov Count Data Models

Introduction Sarmanov Empirical Summary General DPS GS Over and Underdispersion E[ y ki | x ki ] ≃ µ ki , ki = Var[ y ki | x ki ] ≃ µ ki σ 2 , θ k x ′ � � µ ki = exp ki β k . Eugenio J. Miravete Multivariate Sarmanov Count Data Models

Introduction Sarmanov Empirical Summary General DPS GS Approximations √ 2 πz · z z · exp( − z ) for z = y k and Use Stirling’s formula z ! ≃ z = θ k y k , respectively to approximate the double Poisson frequency function: f k ( y k | µ k , θ k ) ≃ θ k exp( − θ k µ k ) ( θ k µ k ) θ k y k Γ( θ k y k + 1) , so that the approximation to the corresponding Laplace transform evaluated at ζ = 1 is: ∞ ( θ k µ k ) θ k y k exp( − y k ) � L k (1 | µ k , θ k ) ≃ c ( µ k , θ k ) θ k exp( − θ k µ k ) . Γ( θ k y k + 1) y k =0 Eugenio J. Miravete Multivariate Sarmanov Count Data Models

Introduction Sarmanov Empirical Summary General DPS GS More Approximations The approximate mixing function for the double Poisson-Sarmanov distribution ψ k ( y k | µ k , θ k ) is: ∞ ( θ k µ k ) θ k y k exp( − y k ) � exp( − y k ) − c ( µ k , θ k ) θ k exp( − θ k µ k ) , Γ( θ k y k + 1) y k =0 and the approximate mixing function weighted mean ν k ( µ k , θ k ) is: ∞ ( θ k µ k ) θ k y k exp( − y k ) � c ( µ k , θ k ) θ k exp( − θ k µ k ) ( y k − µ k ) . Γ( θ k y k + 1) y k =0 Eugenio J. Miravete Multivariate Sarmanov Count Data Models

Multivariate Sarmanov Count Data Models Eugenio J. Miravete - PowerPoint PPT Presentation

Introduction Sarmanov Empirical Summary Multivariate Sarmanov Count Data Models Eugenio J. Miravete University of Texas at Austin & Centre for Economic Policy Research February 12, 2009 Eugenio J. Miravete Multivariate Sarmanov Count

Outline Multivariate Data 1 Multivariate Parametric Methods Multivariate Normal Distribution 2

Making Every Contact Count (MECC) Content What is Making Every Contact Count? Who is

Recitation 4 Question 3: Flying off the handle Parent Child fork() count++; print(count); 1

Reading multivariate data Surajit Ray Reader, University of Glasgow DataCamp Multivariate

Multivariate t-distributions Surajit Ray Reader, University of Glasgow DataCamp Multivariate

Characteristics of cosymmetric association rules Jana Michal Burda Marian Mindek Sarmanov

Multivariate and Partially observed models Erik Lindstrm n T Briefly on multivariate models N

Multivariate Ordination Analyses: Principal Component Analysis Dilys Vela Tatiana Boza Tatiana

Regression Diagnostics and the Forward Search 3. A Single Multivariate Sample Anthony Atkinson,

Count 2020 2020 Count The 2020 Everybody Counts Point-in-Time Count effort is one way the City of

Multivariate Linear Regression Max Turgeon STAT 4690Applied Multivariate Analysis

Robust Statistics Part 2: Multivariate location and scatter Peter Rousseeuw LARS-IASC School,

Advanced PHP Dr. Steven Bitner A/B and Multivariate testing Why use multivariate testing If

Multivariate normal distribution Surajit Ray Reader, University of Glasgow DataCamp

Multivariate Normal Distribution Max Turgeon STAT 4690Applied Multivariate Analysis Building

Tests for Multivariate Linear Models with the car Package John Fox McMaster University Hamilton,

Lecture 3 - Cosmological parameter dependence of the temperature power spectrum - Polarisation of

Alfvn Eigenmodes in Spherical Tokamaks S.E.Sharapov, M.P.Gryaznevich, and the MAST Team

Implications of dedicated seismometer cancellation for Advanced LIGO M. Coughlin measurements

GNSS Ocean Reflected Signals Per Heg DTU Space Technical University of Denmark Content

Engineering Notebook What is an Engineering Notebook? An engineering notebook helps a team to

Indigenous Names for South African Frogs and Aspects of Folk Taxonomy, A Zululand Case Study

The Ubiquitous Research Paper: A Two-Part Series Research Writing Roundtable Part 1: The

The Relationship Between Insects and Toads with Water Level Thesis Paragraph In our experiment,