Outline Introduction and motivation Bootstrap methods for Mixed - PowerPoint PPT Presentation

The R User Conference 2009 July 8-10, Agrocampus-Ouest, Rennes, France An R implementation of bootstrap procedures for mixed models José A. Sánchez-Espigares Universitat Politècnica de Catalunya Jordi Ocaña Universitat de Barcelona

Outline • Introduction and motivation • Bootstrap methods for Mixed Models • Implementation details • Some examples • Conclusions

(Generalized) Linear Mixed Models • Repeated measures or Longitudinal data: Response vector Y i for i th subject Y = Y Y ( ,..., )' i i in 1 i Observations on the same unit can be correlated • Conditional / Hierarchical approach: Between-subject variability explained by Random-effects b i b i N D ~ ( 0 , ) usually with Normal distribution = µ E Y b ( | ) ij i ij µ = θ = β + g X Z b ( ) ij ij ij ij i

Estimation in (G)LMM • Random-effects are not directly observed • Estimation of parameters based on Marginal Likelihood, after integration of Random-effects n ∏ β φ = β φ = L D Y f Y D ( , , | ) ( | , , ) i i i = 1 n n i ∏ ∫ ∏ β φ f Y b f b D db ( | , , ) ( | ) ij ij i i i = = i j 1 1 • MLE: – Analytic solution in the Normal case (Linear Mixed Models) – Approximations are needed in the general case. • lme4 package: common framework for L-GL-NL/MM – Fast and efficient estimation for ML and REML criteria via Laplace Approximation/Adaptative Gaussian Quadrature for GLMM.

Inference (G)LMM • Wald-type and F-tests ( summary ) – Asymptotic standard errors for the fixed effects parameters • Likelihood ratio test ( anova ) – Comparison of likelihood of two models • Bayesian Inference ( mcmcsamp ) – MCMC sampling procedure for posteriors on parameters • Some drawbacks: – Asymptotic results – Degrees of freedom of the reference distribution in F-test – Likelihood Ratio test can be conservative under some conditions – Tests on Variance components close to the boundary of the parameter space.

Motivation • Inference based on bootstrap for LMM and GLMM • Inference on functions of parameters i.e. confidence intervals and hypothesis test for ratio of variance components • Robust approaches i.e. in presence of influential data and outliers • Effect of misspecification i.e. non-gaussian random effects and/or residuals

Extension of the package lmer merBoot provides methods for Monte Carlo and • bootstrap techniques in generalized and linear mixed- effects models • The implementation is object-oriented • It takes profit of specificities of the applied algorithms to enhance efficiency, using less time and memory. • It has a flexible interface to design complex experiments.

Bootstrap in linear models • For (Generalized) linear models (without random effects) there is only one random component � generation of the response variable according to the conditional mean. µ = β µ σ X Y N ~ ( , ) − µ = β g X Y F 1 ( ) ~ µ • Residual resampling: – Estimate parameters for the systematic part of the model – Resample random part of the model (parametric or empirical) – Some variants to deal with heterocedasticity (Wild bootstrap)

Bootstrap in Mixed Models • In Mixed models, the systematic part has a random component � generation of the response variable in two steps: – Bootstrap of the conditional mean (function of the linear predictor) – Bootstrap of the response variable µ = β + θ µ σ X Zb b N Y N ~ ( 0 , ) ~ ( , ) − µ = β + θ g X Zb b N Y F 1 ( ) ~ ( 0 , ) ~ µ Two objects in the merBoot implementation: • – BGP : Set-up for the Bootstrap Generation Process – merBoot : Coefficients for the resamples and methods for analysis

Implementation details �� BGP �� merBoot

Bootstrap Generation Process �� • Fixed parameters β • Design matrices X i Z i • Random effects generator ( b i * ) * from a multivariate gaussian distribution – Parametric: generating b i ˆ * from – Semiparametric/Empirical (from a fitted object): sampling b i b i with replacement. – User-defined: any other distribution/criteria to generate b i * η = β + X Z b * * ij ij ij i

Bootstrap Generation Process �� • Family (distribution F + link function g ) • Response generator ( Y ij * ) – Parametric: µ ij * = g -1 ( η ι j * ) , sample Y ij * ~F ( .; µ ij * ) – Semiparametric/Empirical (from a fitted object): * like in linear heterocedastic models, • Residual-based: builds Y ij depending on type of residuals = µ + ε Y * * * ij ij i • Distribution-based: resamples estimated quantiles

Residuals in GLM − µ Y ˆ • Raw residuals: ij ij − µ Y ˆ ij ij = e • Pearson residuals: ij µ a V ˆ ( ) ij ij e = ij r • Standardized Pearson residuals: ij − µ h ˆ 1 ( ) ij − µ g Y g ˆ ( ) ( ) • Standardized residuals ij ij = l ij µ µ − on the linear predictor scale: a g V h 2 ˆ ˆ ' ( ) ( )( 1 ) ij ij ij ij = − µ r sign Y d • Deviance residuals: ˆ ( ) ij ij ij ij

Empirical residual-based • Standardized Pearson residuals: – Resample e ij * from centered e ij   ˆ φ   = µ + µ Y V e * * * * ( ) – Calculate   ij ij ij ij a   i • Standardized Pearson residuals on the linear predictor scale: – Resample l ij * from l ij     ˆ φ     − = η + η µ Y g g V l * 1 * * * * ' ( ) ( ) – Calculate     ij ij ij ij ij a     i

Empirical distribution-based • Randomized Quantile residuals (Dunn & Smyth,1996): – inverting the estimated distribution function for each observation Φ − to obtain exactly uniform ( q ij ) or standard normal residuals q 1 ( ( )) ij for diagnostics. Randomization needed for discrete distributions. • Resampling scheme with Quantiles Residuals for (G)LMM: = µ q F Y ˆ ˆ ( ; ) – Calculate ij ij ij q – Sample q ij * with replacement from ˆ ij − = µ Y F q * 1 * * ˆ ( ; ) – Generate ij ij ij

Response generation • For Normal family and identity link function, all three strategies (pearson, linear predictor and quantile residuals) are the same. • In all the schemes, response is rounded to the nearest valid value, according to the family considered. • For discrete variables, randomization of the quantiles allows for continuous uniform residuals. • Transformation of the random effects in order to have the first and second moments equal to the parameters (adjusted bootstrap). • For all the schemes, if resample of residuals/quantiles is restricted to the subject obtained in the linear predictor level, a nested bootstrap is performed.

Bootstrap Generation Process ��

BGP Methods – generateLinpred * ~ η = β + b F X Z b * • BGPparam: i ij ij ij i θ ˆ ˆ η = β + b F b X Z b * * ~ (., ) • BGPsemipar: i n i ij ij ij i ˆ η = β + b F b w W X Z w b • BGPsemiparWild: * * * * ~ (., ) ~ i n i i ij ij ij i i – generate (lmer) � generateLinpred +Residual-based * ~ ε = µ + ε F Y * * * • BGPparam: θ ij ij ij ij ε ˆ = µ + ε F e Y * * * * • BGPsemipar: ~ (., ) ij n ij ij ij ij ε ˆ = µ + ε F e w W Y w * * * * * * ~ (., ) ~ • BGPsemiparWild: ij n ij i ij ij i ij ε ˆ = µ + ε F e Y * * * * ~ (., ) • BGPsemiparNested ij n i j ij ij ij * – generate (glmer) � generateLinpred +Distribution-based • BGPparam: − = q F Y F q * * 1 * • BGPsemipar: ~ ( ) µ ij ij ij µ * ij ij • BGPsemiparWild: • BGPSemiparNested

Object merBoot ��

Outline Introduction and motivation Bootstrap methods for Mixed - PowerPoint PPT Presentation

The R User Conference 2009 July 8-10, Agrocampus-Ouest, Rennes, France An R implementation of bootstrap procedures for mixed models Jos A. Snchez-Espigares Universitat Politcnica de Catalunya Jordi Ocaa Universitat de Barcelona

Ins Domingues Breast Cancer Workshop April 7th 2015 Outline Outline Outline Outline

Presentation Preparation Outline Speech Outline Template ***Use this outline to guide you in

Outline for St Outline for St Outline for

Beob Kyun Kim, S oonwook Hwang {kyun, hwang}@ kisti.re.kr KIS TI, Korea Outline Outline

Catherine Revels, World Bank November 2009 Presentation outline Presentation outline

Battlestar Galactica Battlestar Galactica Galactica Battlestar Outline Outline Outline

Outline 2 Outline 2 ZSim core simulation techniques Outline 2 ZSim core simulation

Appendix J: Capstone Presentation Outline Revised Spring 2016 CAPSTONE PRESENTATION OUTLINE This

PT1 TMP Presentation Outline 1 Group Members: ___________________________________ Use this outline

Broverview Outline 2 Outline Philosophy and Architecture A framework for network traffic

Xingqian Peng, Huaqiao University, China Presented by Zhen Wu Presented by Zhen Wu October 30,2011

1 Web Application Development 2 3 Web Application Development CSS Outline An outline is a

Lecture Outline Strengthening Induction Hypothesis. Lecture Outline Strengthening Induction

STAT 213 Simple Linear Regression I Colin Reimer Dawson Oberlin College 5 October 2016 Outline

High Dimensional Approximation - Outline Background and Sources Wolfgang Dahmen Seminar: USC,

Outline Outline Deaf and Hearing Impaired Deaf and Hearing Impaired Physical Structures of

A Consequence of Scale Interaction? De-Zheng Sun CIRES, University of Colorado & Earth

If it is in the EHR it must be true Using EHR data for research Keith Marsolo, PhD Jareen

Characterizing and Synthesizing Task Dependencies of Data-Parallel Jobs in Alibaba Cloud

. logit

The Confusing Conundrum of Capillary Blood Specimen Collection and Analysis Disclosures

In this work, we aim to render participating media in a manner that is robust to media

CS 147: Computer Systems Performance Analysis Selecting Techniques 1 / 37 Overview CS147

Statistics and Data Analysis Descriptive Statistics (2): Summarization Ling-Chieh Kung

Outline Introduction and motivation Bootstrap methods for Mixed - PowerPoint PPT Presentation

The R User Conference 2009 July 8-10, Agrocampus-Ouest, Rennes, France An R implementation of bootstrap procedures for mixed models Jos A. Snchez-Espigares Universitat Politcnica de Catalunya Jordi Ocaa Universitat de Barcelona

Ins Domingues Breast Cancer Workshop April 7th 2015 Outline Outline Outline Outline

Presentation Preparation Outline Speech Outline Template ***Use this outline to guide you in

Outline for St Outline for St Outline for

Beob Kyun Kim, S oonwook Hwang {kyun, hwang}@ kisti.re.kr KIS TI, Korea Outline Outline

Catherine Revels, World Bank November 2009 Presentation outline Presentation outline

Battlestar Galactica Battlestar Galactica Galactica Battlestar Outline Outline Outline

Outline 2 Outline 2 ZSim core simulation techniques Outline 2 ZSim core simulation

Appendix J: Capstone Presentation Outline Revised Spring 2016 CAPSTONE PRESENTATION OUTLINE This

PT1 TMP Presentation Outline 1 Group Members: ___________________________________ Use this outline

Broverview Outline 2 Outline Philosophy and Architecture A framework for network traffic

Xingqian Peng, Huaqiao University, China Presented by Zhen Wu Presented by Zhen Wu October 30,2011

1 Web Application Development 2 3 Web Application Development CSS Outline An outline is a

Lecture Outline Strengthening Induction Hypothesis. Lecture Outline Strengthening Induction

STAT 213 Simple Linear Regression I Colin Reimer Dawson Oberlin College 5 October 2016 Outline

High Dimensional Approximation - Outline Background and Sources Wolfgang Dahmen Seminar: USC,

Outline Outline Deaf and Hearing Impaired Deaf and Hearing Impaired Physical Structures of

A Consequence of Scale Interaction? De-Zheng Sun CIRES, University of Colorado &amp; Earth

If it is in the EHR it must be true Using EHR data for research Keith Marsolo, PhD Jareen

Characterizing and Synthesizing Task Dependencies of Data-Parallel Jobs in Alibaba Cloud

. logit

The Confusing Conundrum of Capillary Blood Specimen Collection and Analysis Disclosures

In this work, we aim to render participating media in a manner that is robust to media

CS 147: Computer Systems Performance Analysis Selecting Techniques 1 / 37 Overview CS147

Statistics and Data Analysis Descriptive Statistics (2): Summarization Ling-Chieh Kung

A Consequence of Scale Interaction? De-Zheng Sun CIRES, University of Colorado & Earth