Lecture 10. Modeling Process and Model Diagnostics Nan Ye School - PowerPoint PPT Presentation

Lecture 10. Modeling Process and Model Diagnostics Nan Ye School of Mathematics and Physics University of Queensland 1 / 21

This Lecture • Modeling process • Goodness of fit • Residuals 2 / 21

Modeling Process Some key modeling activities model class use fit validate model model model data 3 / 21

Some key modelling activities model class use fit validate model model model data • The choice of a model class is often driven by many factors, including data characteristics, expressiveness, interpretability, computational efficiency... • If predictive performance (expressiveness) is the main concern • try deep neural networks for image/text/speech data. • try random forests when high-level features are available. • GLMs can be good in terms of interpretability. 4 / 21

Some key modelling activities model class use fit validate model model model data • More data is often better. • With right features, even simple models can work well. • Exploratory analysis can suggest useful features and models. 4 / 21

Some key modelling activities model class use fit validate model model model data • Fitting is usually formulated as an optimization problem. • MLE is often used to learn a statistical model. • If predictive performance is the main concern, optimize the performance measure directly. • Sophisticated optimization algorithms may be needed. • For GLM, Fisher scoring often works well for MLE. 4 / 21

Some key modelling activities model class use fit validate model model model data • Check model assumption • Check goodness of fit, residual plot et al on training set. • A good fit on the training set may mean overfitting. • Check predictive performance • Check cross-validation score, validation set performance. • Reconsider model class or data if checks are not satisfactory. 4 / 21

Some key modelling activities model class use fit validate model model model data • After checks on the model, the model can then be used to make predictions or draw conclusions (such as significance of variables, variable importance). 4 / 21

Goodness of Fit Deviance • Null model • Includes only the intercept term in the GLM. • Variation in y ’s comes from the random component only. • Full model (saturated model) • Fit an exponential family distribution for each example. • The exponential family distribution for ( x i , y i ) is f ( y | mean = y i ). • Variation in y ’s comes from the systematic component only. • GLM • Summarizes data with a few parameters. • The exponential family distribution for ( x i , y i ) is f ( y | mean = ˆ µ i ), i ˆ µ i = g − 1 ( x ⊤ where ˆ β ). 5 / 21

• Scaled deviance ∑︂ ∑︂ D * ( y ; ˆ µ ) = 2 ln f ( y i | mean = y i ) − 2 ln f ( y i | mean = ˆ µ i ) i i This is twice the difference between log-likelihood of the full model and the maximum log-likelihood achievable for the GLM. • Deviance µ ) = b ( φ ) D * ( y ; ˆ D ( y ; ˆ µ ) . Deviance is thus scaled deviance with the nuisance parameter removed. 6 / 21

Example. Gaussian The scaled deviance is D * ( y ; ˆ µ ) − ( y i − y i ) 2 µ i ) 2 (︃ 1 )︃ (︃ 1 − ( y i − ˆ )︃ ∑︂ ∑︂ = 2 ln √ − 2 ln √ 2 σ 2 2 σ 2 2 πσ 2 πσ i i µ i ) 2 ( y i − ˆ ∑︂ = . σ 2 i The deviance is µ ) = σ 2 D * ( y ; ˆ ∑︂ µ i ) 2 . D ( y ; ˆ µ ) = ( y i − ˆ i 7 / 21

distribution deviance µ ) 2 normal ∑︁ ( y − ˆ 2 ∑︁ ( y ln y Poisson µ − ( y − ˆ µ )) ˆ 2 ∑︁ ( y ln y µ + ( m − y ) ln m − y binomial µ ) ˆ m − ˆ 2 ∑︁ ( − ln y µ + y − ˆ µ Gamma µ ) ˆ ˆ µ ) 2 / (ˆ µ 2 y ) inverse Gaussian ∑︁ ( y − ˆ 8 / 21

Recall > logLik(fit.ig.inv) ' log Lik. ' -25.33805 (df=5) > logLik(fit.ig.invquad) ' log Lik. ' -50.26075 (df=5) > logLik(fit.ig.log) ' log Lik. ' -45.55859 (df=5) Inverse Gaussian regression with inverse link has the best fit (much better than the other two). 9 / 21

> summary(fit.ig.inv) Null deviance: 0.24788404 on 17 degrees of freedom Residual deviance: 0.00097459 on 14 degrees of freedom > summary(fit.ig.invquad) Null deviance: 0.24788 on 17 degrees of freedom Residual deviance: 0.01554 on 14 degrees of freedom > summary(fit.ig.log) Null deviance: 0.2478840 on 17 degrees of freedom Residual deviance: 0.0092164 on 14 degrees of freedom • Inverse link has best fit. • Same conclusion as obtained by looking at the log-likelihoods. • summary function provides a comparison with the full model and null model. 10 / 21

Generalized Pearson X 2 statistic • Recall: var( Y ) = b ( φ ) A ′′ ( η ) for a natural exponential family. • var( Y ) / b ( φ ) depends only on η , and thus only on µ . • Often, var( Y ) / b ( φ ) is called the variance function V ( µ ). • Pearson X 2 statistic is X 2 = ∑︂ µ ) 2 / V (ˆ ( y − ˆ µ ) , where V (ˆ µ ) is the estimated variance function. • The scaled version is X 2 / b ( φ ). 11 / 21

X 2 distribution µ ) 2 normal ∑︁ ( y − ˆ µ ) 2 / ˆ Poisson ∑︁ ( y − ˆ µ ∑︁ ( y − ˆ µ ) 2 binomial µ (1 − ˆ ˆ µ ) µ ) 2 ) / ˆ µ 2 ∑︁ ( y − ˆ Gamma µ ) 2 / ˆ µ 3 inverse Gaussian ∑︁ ( y − ˆ 12 / 21

Asymptotic distribution • If the model is true, then the scaled deviance or the scaled Pearson X 2 statistic asymptotically follows χ 2 n − p , where n is the number of examples, and p is the number of parameters estimated. • In principle, this can be used to test goodness of fit, but this does not really work well. • A test on the scaled deviance or the scaled Pearson X 2 statistic cannot be used to justify that the model is correct. 13 / 21

Residuals Response residual • This is the difference between the output and fitted mean r R = y − ˆ µ. • Measures deviation from systematic effect on an absolute scale. 14 / 21

Pearson residuals • This is the normalized response residual r P = y − ˆ µ √︁ V (ˆ µ ) • Constant variance and mean zero if model is correct. 15 / 21

distribution Pearson residual normal y − ˆ µ µ ) / √ ˆ Poisson ( y − ˆ µ √︁ binomial ( y − ˆ µ ) / µ (1 − ˆ ˆ µ ) Gamma ( y − ˆ µ ) / ˆ µ µ 3 / 2 inverse Gaussian ( y − ˆ µ ) / ˆ 16 / 21

Working residuals • Recall: in the IRLS interpretation of Fisher scoring, at each iteration we try to fit the adjusted response vector z = Gy − G µ + X β, where G = diag( g ′ ( µ 1 ) , . . . , g ′ ( µ n )). • The adjusted response for ( x , y ) is z = g ′ ( µ )( y − µ ) + x ⊤ β. • The working residual is µ ) ∂ξ µ ) g ′ ( µ ) = ( y − ˆ r W = z − ξ = ( y − ˆ ∂µ | µ =ˆ µ , where ξ = x ⊤ β . 17 / 21

Deviance residuals • This is the signed contribution of each example to the deviance √ r D = sign( y − ˆ µ ) d , where ∑︁ i d i = D . • Closer to a normal distribution (less skewed) than Pearson residuals. • Often better for spotting outliers. 18 / 21

distribution deviance residual normal y − ˆ µ √︂ 2( y ln y Poisson sign( y − ˆ µ ) µ − ( y − ˆ µ )) ˆ √︂ 2( y ln y µ + ( m − y ) ln m − y binomial sign( y − ˆ µ ) µ ) ˆ m − ˆ √︂ µ + y − ˆ µ 2( − ln y Gamma sign( y − ˆ µ ) µ ) ˆ ˆ µ √ y inverse Gaussian ( y − ˆ µ ) / ˆ 19 / 21

Computing residuals in R > resid(fit.ig.inv, ' response ' ) > resid(fit.ig.inv, ' pearson ' ) > resid(fit.ig.inv, ' working ' ) > resid(fit.ig.inv, ' deviance ' ) 20 / 21

What You Need to Know • Modeling process • Goodness of fit: deviance and Pearson X 2 statistic • Response, working, Pearson, and deviance residuals 21 / 21

Lecture 10. Modeling Process and Model Diagnostics Nan Ye School - PowerPoint PPT Presentation

Lecture 10. Modeling Process and Model Diagnostics Nan Ye School of Mathematics and Physics University of Queensland 1 / 21 This Lecture Modeling process Goodness of fit Residuals 2 / 21 Modeling Process Some key modeling

Data Cleaning Nan Tang, QCRI Big Data Cleaning Nan Tang, QCRI Big Data Cleaning Nan Tang,

Lecture 11. Modelling Process and Model Diagnostics (cont.) Nan Ye School of Mathematics and

CNBC Matlab Mini-Course Inf and NaN 3/0 returns Inf 0/0 returns NaN David S. Touretzky

Development of Taiwans Bunun Tribe Nan-An Tribe Natural Environment in Nan-An Location

Repairing Four-Atom Conjecture Ting-Ting Nan Advisor: Nigel Boston SP Coding and Information

Maron & Ibben Dron Nan Kojbarok Aurok In Dren Maron & Ibben Dron Nan Kojbarok Aurok In Dren

Application of Local Influence Diagnostics to the Buckley-James Model Nazrina Aziz 1 and Dong Q

Diagnostics Applications, Limitations and Outlook Dr. Dirk Biskup, CeGaT Companion

Regression Diagnostics and the Forward Search 1 A. C. Atkinson, London School of Economics

X- X -ray Diagnostics of ray Diagnostics of Pre- -main Sequence Accretion and main Sequence

Ion source diagnostics and ion beam diagnostics for ECRIS intensity profile emittance

Innatoss Innovative diagnostics Expert in intracellular infectious diseases Diagnostics for Lyme

Diagnostics: A focus on use in development of drugs for MDR pathogens John H. Rex, MD

Nonintercepting ODR Diagnostics for Multi-GeV Electron Beams Alex H. Lumpkin ASD Diagnostics

Modeling of proteins and complexes High resolution Low resolution Modeling of domains Modeling

Virtual Reality Modeling Virtual Reality Modeling from http://www.okino.com/ Modeling Modeling

COMS 4721: Machine Learning for Data Science Lecture 6, 2/2/2017 Prof. John Paisley Department

Distance sampling: Advanced topics David L Miller Recap Line transects - general idea Calculate

The Statistical Method Will Perkins February 24, 2013 What is statistics? A method for

How to assess the fit of multilevel logit models with Stata? Meeting of the German Stata User

Multiple Regression and Logistic Regression II Dajiang Liu @PHS 525 Apr-19-2016 Materials from

Measurement of from B DK and related modes at LHCb Till Moritz Karbach CERN

Neutrino masses and mixings and light particles, Dark Matter, Dark Energy, SuperSplit

WISEgrants How WISEgrants uses WUFAR to assist subrecipients in meeting Allowable Cost

Lecture 10. Modeling Process and Model Diagnostics Nan Ye School - PowerPoint PPT Presentation

Lecture 10. Modeling Process and Model Diagnostics Nan Ye School of Mathematics and Physics University of Queensland 1 / 21 This Lecture Modeling process Goodness of fit Residuals 2 / 21 Modeling Process Some key modeling

Data Cleaning Nan Tang, QCRI Big Data Cleaning Nan Tang, QCRI Big Data Cleaning Nan Tang,

Lecture 11. Modelling Process and Model Diagnostics (cont.) Nan Ye School of Mathematics and

CNBC Matlab Mini-Course Inf and NaN 3/0 returns Inf 0/0 returns NaN David S. Touretzky

Development of Taiwans Bunun Tribe Nan-An Tribe Natural Environment in Nan-An Location

Repairing Four-Atom Conjecture Ting-Ting Nan Advisor: Nigel Boston SP Coding and Information

Maron &amp; Ibben Dron Nan Kojbarok Aurok In Dren Maron &amp; Ibben Dron Nan Kojbarok Aurok In Dren

Application of Local Influence Diagnostics to the Buckley-James Model Nazrina Aziz 1 and Dong Q

Diagnostics Applications, Limitations and Outlook Dr. Dirk Biskup, CeGaT Companion

Regression Diagnostics and the Forward Search 1 A. C. Atkinson, London School of Economics

X- X -ray Diagnostics of ray Diagnostics of Pre- -main Sequence Accretion and main Sequence

Ion source diagnostics and ion beam diagnostics for ECRIS intensity profile emittance

Innatoss Innovative diagnostics Expert in intracellular infectious diseases Diagnostics for Lyme

Diagnostics: A focus on use in development of drugs for MDR pathogens John H. Rex, MD

Nonintercepting ODR Diagnostics for Multi-GeV Electron Beams Alex H. Lumpkin ASD Diagnostics

Modeling of proteins and complexes High resolution Low resolution Modeling of domains Modeling

Virtual Reality Modeling Virtual Reality Modeling from http://www.okino.com/ Modeling Modeling

COMS 4721: Machine Learning for Data Science Lecture 6, 2/2/2017 Prof. John Paisley Department

Distance sampling: Advanced topics David L Miller Recap Line transects - general idea Calculate

The Statistical Method Will Perkins February 24, 2013 What is statistics? A method for

How to assess the fit of multilevel logit models with Stata? Meeting of the German Stata User

Multiple Regression and Logistic Regression II Dajiang Liu @PHS 525 Apr-19-2016 Materials from

Measurement of from B DK and related modes at LHCb Till Moritz Karbach CERN

Neutrino masses and mixings and light particles, Dark Matter, Dark Energy, SuperSplit

WISEgrants How WISEgrants uses WUFAR to assist subrecipients in meeting Allowable Cost

Maron & Ibben Dron Nan Kojbarok Aurok In Dren Maron & Ibben Dron Nan Kojbarok Aurok In Dren