Mixed models for binary and count data Rasmus Waagepetersen - PowerPoint PPT Presentation

Mixed models for binary and count data Rasmus Waagepetersen Department of Mathematics Aalborg University Denmark October 9, 2017 1 / 17

Variance for binomial and Poisson For binomial and Poisson variables, variance is determined by mean. Y binomial b ( n , p ): E Y = np V ar Y = np (1 − p ) Binary case, n = 1: E Y = p V ar Y = p (1 − p ) Y Poisson med middelværdi λ : E Y = λ V ar Y = λ 2 / 17

Overdispersion Binomial and Poisson default models in case of binary and count data. In some applications we see larger variability in the data than predicted by variance formulas for binomial or Poisson. This is called overdispersion and can be due to correlation in the data, latent factors, biological heterogeneity, genetics,.... Latent factors can be modeled explicity using random effects - i.e. mixed models for binary and count data. 3 / 17

Wheezing data The wheezing (Ohio) data has variables resp (binary indicator of wheezing status), id, age (of child), smoke (binary, mother smoker or not). Aggregated data: (black=smoke, red=no smoke) ● 0.20 ● ● ● 0.15 ● ● ● Wheeze proportin 0.10 ● 0.05 0.00 −2.0 −1.5 −1.0 −0.5 0.0 0.5 1.0 4 / 17 age

Let Y ij denote wheezing status of i th child at j th age. Assuming Y ij is b ( p ij , 1) we try logistic regression logit( p ij ) = β 0 + β 1 age j + β 2 smoke i Assuming independence between observations from the same child, and letting Y i · be the sum of observations from i th child, V ar Y i · = V ar ( Y i 1 + Y i 2 + Y i 3 + Y i 4 ) = V ar Y i 1 + V ar Y i 2 + V ar Y i 3 + V ar Y i 4 = p i 1 (1 − p i 1 ) + p i 2 (1 − p i 2 ) + p i 3 (1 − p i 3 ) + p i 4 (1 − p i 4 ) Note: same variance of Y i · for all children with same value of smoke. We can calculate above theoretical variance from fitted model and compare with empirical variances. Smoke=0: theoretical: 0.58 empirical: 1.22. Smoke=1: theoretical: 0.48 empirical: 0.975 5 / 17

Issue: observations from same child are correlated - if we know first observation is non-wheeze then very likely three remaining observations non-wheeze too. Correlation can be due to genetics or the environment (more or less polluted) for the child. Explicit model these effects using random effect: logit( p ij ) = β 0 + β 1 age j + β 2 smoke i + U i where U i are N (0 , τ 2 ) and independent among children. Such a model can be fitted by the R -procedure glmer with syntax very close related to lmer and glm 6 / 17

Logistic regression > fit=glm(resp~age+smoke,family=binomial,data=ohio) Coefficients: Estimate Std. Error z value Pr(>|z|) (Intercept) -1.88373 0.08384 -22.467 <2e-16 *** age -0.11341 0.05408 -2.097 0.0360 * smoke 0.27214 0.12347 2.204 0.0275 * --- Signif. codes: 0 ’***’ 0.001 ’**’ 0.01 ’*’ 0.05 ’.’ 0.1 ’ According to above results, age and smoke both significant at the 5% level. 7 / 17

Mixed model analysis > fiter=glmer(resp~age+smoke+(1|id),family=binomial,data=ohio) > summary(fiter) Random effects: Groups Name Variance Std.Dev. id (Intercept) 5.491 2.343 Number of obs: 2148, groups: id, 537 Fixed effects: Estimate Std. Error z value Pr(>|z|) (Intercept) -3.37396 0.27496 -12.271 <2e-16 *** age -0.17677 0.06797 -2.601 0.0093 ** smoke 0.41478 0.28705 1.445 0.1485 Now only age is significant on the 5% level. Note large variance 5.491 for the U i . 8 / 17

Variance 5.491 corresponds to standard deviation 2.343. This means 95% probability interval for U i is [ − 4 . 686 , 4 . 686]. Large part of the variation explained by the U i relative to the fixed effects. 9 / 17

Poisson regression with random effects For count data we can also add random effects to the model. Recall in this case we model the logarithm of the mean λ i for i th observation Y i : log λ i = α + β z i + U i Again use glmer but now with family poisson . 10 / 17

Interpretation of variance components For linear mixed model we can directly interpret variances of random effects in terms of proportions of variance and intra-class correlation for the response variable. This is not possible for logistic and Poisson mixed models. E.g. for logistic regression, the variance is V ar Y i = E p i (1 − p i ) + V ar p i where the expectation and variance is with respect to U i in exp( α + β z i + U i ) p i = 1 + exp( α + β z i + U i ) There is no simple formula for this variance. Here p i (1 − p i ) is the conditional variance of Y i given U i - but this can not be evaluated since U i is unobserved. 11 / 17

For the Poisson case with λ i = exp( α + β z i + U i ) we have (complicated) formulae for the mean E Y i = exp( α + β z i + τ 2 / 2) and variance: exp( α + β z i + 3 τ 2 / 2) − exp( α + τ 2 / 2) + 1 � � V ar Y i = E Y i Note: τ 2 not a simple proportion of total variance. Formula indeed shows that τ 2 > 0 gives overdispersion: V ar Y i = exp( α + β z i + 3 τ 2 / 2) − exp( α + τ 2 / 2) + 1 > 1 E Y i if τ 2 > 0. 12 / 17

Computation Due to non-linear relation between mean of observations and random effects, computation of likelihood is not straightforward. Huge statistical literature on how to compute good approximations of the likelihood. glmer uses numerical integration (adaptive Gaussian quadrature) and the accuracy is controlled using the argument nAGQ (default is nAGQ=1 ). SPSS use so-called penalized quasi-likelihood based on (very crude) approximation of likelihood. For the wheeze data set R and SPSS estimates differ but we get qualitatively similar results regarding significance of fixed effects. 13 / 17

Wheeze results with different values of nAGQ 5 quadrature points: > fiter5=glmer(resp~age+smoke+(1|id),family=binomial, data=ohio,nAGQ=5) Groups Name Variance Std.Dev. id (Intercept) 4.198 2.049 Number of obs: 2148, groups: id, 537 Fixed effects: Estimate Std. Error z value Pr(>|z|) (Intercept) -3.02398 0.20353 -14.857 < 2e-16 *** age -0.17319 0.06718 -2.578 0.00994 ** smoke 0.39448 0.26305 1.500 0.13371 14 / 17

10 quadrature points: > fiter10=glmer(resp~age+smoke+(1|id),family=binomial ,data=ohio,nAGQ=10) Random effects: Groups Name Variance Std.Dev. id (Intercept) 4.614 2.148 Number of obs: 2148, groups: id, 537 Fixed effects: Estimate Std. Error z value Pr(>|z|) (Intercept) -3.08959 0.21557 -14.332 < 2e-16 *** age -0.17533 0.06762 -2.593 0.00952 ** smoke 0.39799 0.27167 1.465 0.14293 Some sensivity regarding variance estimate. Fixed effects results quite stable. Results with 20 quadrature points very similar to those with 10 quadrature points. 15 / 17

Summary ◮ logistic and Poisson regression very useful for binary and count data where linear normal models not appropriate. ◮ in some applications there is evidence of overdispersion (extra variance) ◮ easy to add random effects to model sources of overdispersion and thereby correctly model correlation between observations e.g. for same subject. ◮ thereby we get more trustworthy standard deviations for fixed effects estimates. ◮ disadvantage: not easy to interpret random effects variances in terms of variances and correlations of the response variable Y i . 16 / 17

Exercises 1. Consider again the epilepsy data. Introduce subject specific random intercepts. What is the fitted variance for the random intercepts ? Compare the results regarding fixed effects with those of the previous analysis. 17 / 17

Mixed models for binary and count data Rasmus Waagepetersen - PowerPoint PPT Presentation

Mixed models for binary and count data Rasmus Waagepetersen Department of Mathematics Aalborg University Denmark October 9, 2017 1 / 17 Variance for binomial and Poisson For binomial and Poisson variables, variance is determined by mean. Y

Binary Numbers Binary numbers look like this Binary Numbers or Binary Code Binary numbers or

Regression 2: Mixed Models Marco Baroni Practical Statistics in R Outline Mixed models with

A Quick Review Decimal to binary Binary to decimal Binary to hexadecimal

Mixing it up with random effects Joshua Loftus Mixed models Intro to mixed models What is a

61A Lecture 21 Announcements Binary Trees Binary Tree Class 4 Binary Tree Class class

Binary Trees, Heaps Binary Trees, Heaps Binary trees Binary trees A binary tree (

Making Every Contact Count (MECC) Content What is Making Every Contact Count? Who is

Recitation 4 Question 3: Flying off the handle Parent Child fork() count++; print(count); 1

Balanced Search Trees Binary Search Trees Binary Search Tree Binary Search Tree A binary tree is

Binary Numbers 723 Binary Numbers 723 = 7x100 + 2x10 + 3x1 Binary Numbers 723 = 7x100 + 2x10 +

Mixed Oxides in Selective Mixed Oxides in Selective Mixed Oxides in Selective Mixed Oxides in

Mixed Precision Training PAI Overview What is mixed-precision

Mixed Methodological Analysis David F. Feldon Utah State University May 8, 2018 Mixed Methods

Why Mixed Effects Models? Mixed Effects Models Recap/Intro Three issues with ANOVA

Binary Search Trees and Balanced Binary Search Trees using AVL Trees Mark Redekopp David Kempe

LECTURE 2 Review 1 Binary Math and Assembly BINARY MATH In this section, we review Binary

Perturbation Hiding and the Batch Steganography Problem Andrew Ker @

Automorphism Groups of Graphical Models and Lifted Variational Inference Hung Hai Bui 1 Tuyen N.

On The Information Geometry of Word Embedding Riccardo Volpi, joint work with D. Marinelli, P.

Introduction to the R Statistical Computing Environment Linear and Generalized Linear Models in R

scheduling 3 1 Changelog Changes not seen in fjrst lecture: 4 Feb 2020: MLFQ example: number

Models for Replicated Discrimination Tests: A Synthesis of Latent Class Mixture Models and

Improved Measurements of the -Decay Response of Liquid Xenon with the LUX Detector Jon Balajthy

Adaptive HMC via the Infinite Exponential Family Arthur Gretton Gatsby Unit, CSML, University