Checking model assumptions with regression diagnostics Graeme L. - - PowerPoint PPT Presentation

checking model assumptions with regression diagnostics
SMART_READER_LITE
LIVE PREVIEW

Checking model assumptions with regression diagnostics Graeme L. - - PowerPoint PPT Presentation

@graemeleehickey www.glhickey.com graeme.hickey@liverpool.ac.uk Checking model assumptions with regression diagnostics Graeme L. Hickey University of Liverpool Co Confl flicts s of f interest None Assistant Editor (Statistical


slide-1
SLIDE 1

Checking model assumptions with regression diagnostics

Graeme L. Hickey University of Liverpool

@graemeleehickey www.glhickey.com graeme.hickey@liverpool.ac.uk

slide-2
SLIDE 2

Co Confl flicts s of f interest

  • None
  • Assistant Editor (Statistical Consultant) for EJCTS and ICVTS
slide-3
SLIDE 3
slide-4
SLIDE 4

Question: who routinely checks model assumptions when analyzing data?

(raise your hand if the answer is Yes)

slide-5
SLIDE 5

Ou Outline

  • Illustrate with multiple linear regression
  • Plethora of residuals and diagnostics for other model types
  • Focus is not to “what to do if you detect a problem”, but “how to

diagnose (potential) problems”

slide-6
SLIDE 6

My My personal experi rience*

  • Reviewer of EJCTS and ICVTS for 5-years
  • Authors almost never report if they assessed model assumptions
  • Example: only one paper submitted where authors considered

sphericity in RM-ANOVA at first submission

  • Usually one or more comment is sent to authors regarding model

assumptions

* My views do not reflect those of the EJCTS, ICVTS, or of other statistical reviewers

slide-7
SLIDE 7

Li Linear r regression mo modelling

  • Collect some data
  • 𝑧": the observed continuous outcome for subject 𝑗 (e.g. biomarker)
  • 𝑦%", 𝑦'", … , 𝑦)": p covariates (e.g. age, male, …)
  • Want to fit the model
  • 𝑧" = 𝛾, + 𝛾%𝑦%" + 𝛾'𝑦"' + ⋯ + 𝛾)𝑦)" + 𝜁"
  • Estimate the regression coefficients
  • 𝛾

0,, 𝛾 0%, 𝛾 0', … , 𝛾 0)

  • Report the coefficients and make inference, e.g. report 95% CIs
  • But we do not stop there…
slide-8
SLIDE 8

Re Residuals

  • For a linear regression model, the residual for the 𝑗-th observation is

𝑠" = 𝑧" − 𝑧 3"

  • where 𝑧

3" is the predicted value given by 𝑧 3" = 𝛾 0, + 𝛾 0%𝑦%" + 𝛾 0'𝑦"' + ⋯ + 𝛾 0)𝑦)"

  • Lots of useful diagnostics are based on residuals
slide-9
SLIDE 9

Li Lineari rity of functional form rm

  • Assumption: scatterplot of (𝑦", 𝑠") should not show any systematic

trends

  • Trends imply that higher-order terms are required, e.g. quadratic,

cubic, etc.

slide-10
SLIDE 10
  • 20

40 60 80 5 10 15 20

X Y

A

  • −10

−5 5 10 5 10 15 20

X Residual

B

  • 20

40 60 80 5 10 15 20

X Y

C

  • −4

4 8 5 10 15 20

X Residual

D

Fitted model:

𝑍 = 𝛾, + 𝛾%𝑌 + 𝜁 𝑍 = 𝛾, + 𝛾%𝑌 + 𝛾'𝑌' + 𝜁

slide-11
SLIDE 11

Ho Homogeneity eneity

  • We often assume assume that 𝜁" ∼ 𝑂 0, 𝜏'
  • The assumption here is that the variance is constant, i.e.

homogeneous

  • Estimates and predictions are robust to violation, but not inferences

(e.g. F-tests, confidence intervals)

  • We should not see any pattern in a scatterplot of 𝑧

3", 𝑠"

  • Residuals should be symmetric about 0
slide-12
SLIDE 12

Homoscedastic residuals Heteroscedastic residuals

  • −10

−5 5 5 10 15 20 25

Fitted value Residual

A

  • −10

−5 5 5 10 15 20 25

Fitted value Residual

B

slide-13
SLIDE 13

No Normality

  • If we want to make inferences, we generally assume 𝜁" ∼ 𝑂 0, 𝜏'
  • Not always a critical assumption, e.g.:
  • Want to estimate the ‘best fit’ line
  • Want to make predictions
  • The sample size is quite large and the other assumptions are met
  • We can assess graphically using a Q-Q plot, histogram
  • Note: the assumption is about the errors, not the outcomes 𝑧"
slide-14
SLIDE 14
  • −2

−1 1 2 −6 −2 2 4 6

Normal residuals

Theoretical Quantiles Sample Quantiles

  • −2

−1 1 2 5 10 15

Skewed residuals

Theoretical Quantiles Sample Quantiles Residuals Frequency −6 −4 −2 2 4 6 8 5 10 15 20 25 Residuals Frequency 5 10 15 5 10 20 30

slide-15
SLIDE 15

Independenc Independence

  • We assume the errors are independent
  • Usually able to identify this assumption from the study design and

analysis plan

  • E.g. if repeated measures, we should not treat each measurement as

independent

  • If independence holds, plotting the residuals against the time (or
  • rder of the observations) should show no pattern
slide-16
SLIDE 16
  • −60

−30 30 25 50 75 100

X Residual

A

  • −150

−100 −50 50 100 25 50 75 100

X Residual

B

Independent Non-independent

slide-17
SLIDE 17

Mu Multicollineari rity

  • Correlation among the predictors (independent variables) is known as

collinearity (multicollinearity when >2 predictors)

  • If aim is inference, can lead to
  • Inflated standard errors (in some cases very large)
  • Nonsensical parameter estimates (e.g. wrong signs or extremely large)
  • If aim is prediction, it tends not to be a problem
  • Standard diagnostic is the variance inflation factor (VIF)

𝑊𝐽𝐺 𝑌

? =

1 1 − 𝑆?

'

Rule of thumb: VIF > 10 indicates multicollinearity

slide-18
SLIDE 18

Ou Outliers s & infl fluential points

  • r = 0.82
  • r = 0.82
  • r = 0.82
  • r = 0.82

Dataset 1 Dataset 2 Dataset 3 Dataset 4 4 8 12 4 8 12 5 10 15 5 10 15

Measurement 1 Measurement 2

y = 3.00 + 0.500x y = 3.00 + 0.500x y = 3.00 + 0.500x y = 3.00 + 0.500x x y

Outlier High leverage point

slide-19
SLIDE 19

Diag Diagnostics ics t to d detect ct in influ luential p ial poin ints

  • DFBETA (or Δβ)
  • Leave out i-th observation out and refit the model
  • Get estimates of 𝛾

0, C" , 𝛾 0% C" , 𝛾 0' −𝑗 , … , 𝛾 0) C"

  • Repeat for 𝑗 = 1, 2, … , 𝑜
  • Cook’s distance D-statistic
  • A measure of how influential each data point is
  • Automatically computer / visualized in modern software
  • Rule of thumb: 𝐸" > 1 implies point is influential
slide-20
SLIDE 20

Re Residuals from other models

GLMs (incl. logistic regression)

  • Deviance
  • Pearson
  • Response
  • Partial
  • Δβ

Cox regression

  • Martingale
  • Deviance
  • Score
  • Schoenfeld
  • Δβ

Useful for exploring the influence of individual observations and model fit

slide-21
SLIDE 21

Tw Two scenarios

Statistical methods routinely submitted to EJCTS / ICVTS include:

  • 1. Repeated measures ANOVA
  • 2. Cox proportional hazards regression

Each has very important assumptions

slide-22
SLIDE 22

Re Repeated measures ANOVA

  • Assumptions: those used for classical ANOVA + sphericity
  • Sphericity: the variances of the differences of all pairs of the within

subject conditions (e.g. time) are equal

  • It’s a questionable a priori assumption for longitudinal data

Patient T0 T1 T2 T0 – T1 T0 – T2 T1 – T2 1 30 27 20 3 10 7 2 35 30 28 5 7 2 3 25 30 20 −5 5 10 4 15 15 12 3 3 5 9 12 7 −3 2 5 Variance 17.0 10.3 10.3

slide-23
SLIDE 23

Ma Mauchly's 's te test

  • A popular test (but criticized due to power and robustness)
  • H0: sphericity satisfied (i.e. 𝜏H

ICH J

'

= 𝜏H

ICH K

'

= 𝜏H

JCH K

'

)

  • H1: non-sphericity (at least one variance is different)
  • If rejected, it is usual to apply a correction to the degrees of freedom

(df) in the RM-ANOVA F-test

  • The correction is 𝜗 x df, where 𝜗 = epsilon statistic (either

Greenhouse-Geisser or Huynh-Feldt)

  • Software (e.g. SPSS) will automatically report 𝜗 and the corrected

tests

slide-24
SLIDE 24

Pr Proportionality assumption

  • Cox regression assumes proportional hazards:
  • Equivalently, the hazard ratio must be constant over time
  • There are many ways to assess this assumption, including two using

residual diagnostics:

  • Graphical inspection of the (scaled) Schoenfeld residuals
  • A test* based on the Schoenfeld residuals

* Grambsch & Therneau. Biometrika. 1994; 81: 515-26.

slide-25
SLIDE 25
  • −0.3

−0.2 −0.1 0.0 0.1 0.2 0.3 56 150 200 280 350 450 570 730

Time Beta(t) for age

Schoenfeld Individual Test p: 0.5385

  • ● ●

−2 −1 1 2 3 56 150 200 280 350 450 570 730

Time Beta(t) for sex

Schoenfeld Individual Test p: 0.1253

  • −0.2

0.0 0.2 56 150 200 280 350 450 570 730

Time Beta(t) for wt.loss

Schoenfeld Individual Test p: 0.8769

Global Schoenfeld Test p: 0.416

  • Simple Cox model fitted to the North

Central Cancer Treatment Group lung cancer data set*

  • If proportionality is valid, then we should

not see any association between the residuals and time

  • Can formally test the correlation for each

covariate

  • Can also formally test the “global”

proportionality

*Loprinzi CL et al. Journal of Clinical Oncology. 12(3) :601-7, 1994.

slide-26
SLIDE 26

Co Conclusi sions

  • Residuals are incredibly powerful for diagnosing issues in regression

models

  • If a model doesn’t satisfy the required assumptions, don’t expect

subsequent inferences to be correct

  • Assumptions can usually be assessed using methods other than (or in

combination with) residuals

  • Always report in manuscript
  • What diagnostics were used, even if they are absent from the Results section
  • Any corrections or adjustments made as a result of diagnostics
slide-27
SLIDE 27

Slides available (shortly) from: www.glhickey.com

Thanks for listening Any questions?

Statistical Primer article to be published soon!