CROSS VALIDATION Jeff Goldsmith, PhD Department of Biostatistics 1 - PowerPoint PPT Presentation

Oct 03, 2023 •332 likes •463 views

CROSS VALIDATION Jeff Goldsmith, PhD Department of Biostatistics 1 Model selection When you have lots of possible variables, you have you choose which ones will go in your model In the best case, you have a clear hypothesis you want

CROSS VALIDATION Jeff Goldsmith, PhD Department of Biostatistics � 1
Model selection • When you have lots of possible variables, you have you choose which ones will go in your model • In the best case, you have a clear hypothesis you want to test in the context of known confounders • (Always keep in mind that no model is “true”) � 2
Model selection is hard • Lots of times you’re not in the best case, but still have to do something • This isn’t an easy thing to do • For nested models, you have tests – You have to be worried about multiple comparisons and “fishing” • For non-nested models, you don’t have tests – AIC / BIC / etc are traditional tools – Balance goodness of fit with “complexity” � 3
Questioning fit • These are basically the same question: – Is my model not complex enough? Too complex? – Am I underfitting? Overfitting? – Do I have high bias? High variance? • Another way to think of this is out-of-sample goodness of fit: – Will my model generalize to future datasets? � 4
Flexibility vs fit � 5
Prediction accuracy • Ideally, you could – Build your model given a dataset – Go out and get new data – Confirm that your model “works” for the new data • That doesn’t really happen • So maybe just act like it does? � 6
Cross validation • � 7
Cross validation Training Full data Build model Split RMSE Testing Apply model � 8
Refinements and variations • Individual training / testing splits are subject to randomness • Repeating the process – Illustrates variability in prediction accuracy – Can indicate whether differences in models are consistent across splits • I usually repeat the training / testing split • Folding (5-fold, 10-fold, k-fold, LOOCV) partitions data into equally-sized subsets – One fold is used as testing, with remaining folds as training – Repeated for each fold as testing • I don’t do this as often � 9
Cross validation is general • Can use to compare candidate models that are all “traditional” • Comes up a lot in “modern” methods – Automated variable selection (e.g. lasso) – Additive models – Regression trees � 10
Prediction as a goal • In the best case, you have a clear hypothesis you want to test in the context of known confounders – I know I already said this, but it’s important • Prediction accuracy matters as well – Different goal than statistical significance – Models that make poor predictions probably don’t adequately describe the data generating mechanism, and that’s bad � 11
Tools for CV • Lots of helpful functions in modelr – add_predictions() and add_residuals() – rmse() – crossv_mc() • Since repeating the process can help, list columns and map come in handy a lot too :-) � 12

Recommend

Cross-validation and the Bootstrap In the section we discuss two resampling methods:

Cross-validation and the Bootstrap In the section we discuss two resampling methods: cross-validation and the bootstrap. 1 / 44 Cross-validation and the Bootstrap In the section we discuss two resampling methods: cross-validation and the

1.13k views • 66 slides

STAT 213 Cross-Validation (and Multifactor ANOVA?) Colin Reimer Dawson Oberlin College 12

Outline Last Time Cross-Validation STAT 213 Cross-Validation (and Multifactor ANOVA?) Colin Reimer Dawson Oberlin College 12 April 2016 Outline Last Time Cross-Validation Outline Last Time Cross-Validation Outline Last Time

381 views • 25 slides

Progress to Date in A3: Method Transfer, Partial Validation and Cross validation A3: Method

Progress to Date in A3: Method Transfer, Partial Validation and Cross validation A3: Method Transfer, partial and cross validation Team members: In scope Life cycle of a method after first full validation or relation Team

252 views • 10 slides

Introduction to Data Science: Classifier n 1 n 1 k k Suppose you want to compare two

Classier evaluation Classier evaluation Leave-one-out Cross-Validation Leave-one-out Cross-Validation Leave-one-out Cross-Validation Classier evaluation Classier evaluation Leave-one-out Cross-Validation Resampled validation set

984 views • 51 slides

02 | 27 SOUTHERN CROSS 23.04 03 | 27 SOUTHERN CROSS 23.04 04 | 27 SOUTHERN CROSS 23.04 06

302 views • 27 slides

The Shadow of the Cross The Cross of Jesus part 1B The Shadow of the Cross Hebrews 10:1-14 The

The Shadow of the Cross The Cross of Jesus part 1B The Shadow of the Cross Hebrews 10:1-14 The Shadow of the Cross The Shadow of the Cross OT Glimpses of the Cross OT Glimpses of the Cross Heb 8:5 & 10:1 Heb 8:5 & 10:1 OT Glimpses

359 views • 35 slides

Validation of National Burn Severity Validation of National Burn Severity Validation of National

Validation of National Burn Severity Validation of National Burn Severity Validation of National Burn Severity Validation of National Burn Severity Mapping Project Techniques Within Mapping Project Techniques Within the Apalachicola National

212 views • 17 slides

Form Validation 1 CS380 What is form validation? 2 validation: ensuring that form's values

Form Validation 1 CS380 What is form validation? 2 validation: ensuring that form's values are correct some types of validation: preventing blank values (email address) ensuring the type of values integer, real number,

284 views • 24 slides

Stratified Cross-Validation in Multi-Label Classification Using Genetic Algorithms 7-8/02/2013

Stratified Cross-Validation in Multi-Label Classification Using Genetic Algorithms 7-8/02/2013 TIN2010-20900-C04 Albacete Index Introduction Multilabel Classification Cross-Validation and Stratified Cross-Validation Methods and

682 views • 43 slides

Holdout and Cross- -Validation Validation Holdout and Cross Methods Overfitting Avoidance

Holdout and Cross- -Validation Validation Holdout and Cross Methods Overfitting Avoidance Methods Overfitting Avoidance Decision Trees Decision Trees Reduce error pruning Reduce error pruning Cost Cost- -complexity pruning

699 views • 17 slides

Criticality experiments and benchmarks for for validation of cross validation of cross sections:

Criticality experiments and benchmarks Criticality experiments and benchmarks for for validation of cross validation of cross sections: the neptunium sections: the neptunium case case L.S.Leong, L.Tassan-Got, L.Audouin, C. Paradela,

464 views • 21 slides

Importance-Weighted Cross- Importance-Weighted Cross- Validation for Covariate Shift Validation

Importance-Weighted Cross- Importance-Weighted Cross- Validation for Covariate Shift Validation for Covariate Shift (1) (2) Masashi Sugiyama , Benjamin Blankertz , (2,3) (2) Matthias Krauledat , Guido Dornhege , (3,2) Klaus-Robert

623 views • 26 slides

Data Mining II Model Validation Heiko Paulheim Why Model Validation? We have seen so far

Data Mining II Model Validation Heiko Paulheim Why Model Validation? We have seen so far Various metrics (e.g., accuracy, F-measure, RMSE, ) Evaluation protocol setups Split Validation Cross Validation Special

1.04k views • 55 slides

in Spark Using GPU Minsik Cho, Rajesh Bordawekar IBM TJW Research 1 Cross-Validation 101

Accelerating Cross-Validation in Spark Using GPU Minsik Cho, Rajesh Bordawekar IBM TJW Research 1 Cross-Validation 101 [Wikipedia] Popular Model Validation Technique to avoid overfitting, for better generalization useful when not

430 views • 20 slides

LaGov LaGov Version 2.2 Updated: 12/17/08 Visit our website for Blueprint Presentations,

Inventory and Warehouse Management Inventory and Warehouse Management Inventory and Warehouse Management Validation Session Validation Session Validation Session LOG- -IM/WM IM/WM- -Validation Validation LOG LOG-IM/WM-Validation January

1.59k views • 146 slides

LaGov LaGov Validation Session Agenda Validation Session Agenda Purpose Work Session

Validation Session Validation Session Validation Session Budget Prep Budget Prep Budget Prep Operating Budget Operating Budget Operating Budget December 01, 2008 December 01, 2008 December 01, 2008 LaGov LaGov Validation Session Agenda

725 views • 55 slides

Combinatorial Methods for Modelling Composed Software Systems Ludwig Kampel, Bernhard Garn and

Combinatorial Methods for Modelling Composed Software Systems Ludwig Kampel, Bernhard Garn and Dimitris E. Simos SBA Research, Austria 6th International Workshop on Combinatorial Testing (IWCT 2017) Waseda University, Nishiwaseda Campus,

968 views • 69 slides

Humans, Machine, and the Future of Work Moshe Y. Vardi Rice University Houston, TX, USA

Humans, Machine, and the Future of Work Moshe Y. Vardi Rice University Houston, TX, USA vardi@cs.rice.edu Follow me on social media! Where Are The Jobs? The Great Coupling The Great Decoupling The Neoluddites J. Sachs and L. Kotlikoff

873 views • 25 slides

Choice theory Michel Bierlaire michel.bierlaire@epfl.ch Transport and Mobility Laboratory

Choice theory Michel Bierlaire michel.bierlaire@epfl.ch Transport and Mobility Laboratory Choice theory p. 1/26 Framework Choice: outcome of a sequential decision-making process Definition of the choice problem: How do I get to EPFL?

494 views • 26 slides

Joining data: a real- world necessity PAN DAS JOIN S F OR S P READS H EET US ERS John Miller

Joining data: a real- world necessity PAN DAS JOIN S F OR S P READS H EET US ERS John Miller Principal Data Scientist Pandas for spreadsheet users Learn based on similarities to spreadsheets Understand the power and exibility of pandas

621 views • 24 slides

What is a hierarchical model? Richard Erickson Quantitative Ecologist DataCamp Hierarchical

DataCamp Hierarchical and Mixed Effects Models in R HIERARCHICAL AND MIXED EFFECTS MODELS IN R What is a hierarchical model? Richard Erickson Quantitative Ecologist DataCamp Hierarchical and Mixed Effects Models in R Why do we use a

718 views • 25 slides

Network meta-analysis using integrated nested Laplace approximations (INLA) Burak Krsad Gnhan

Network meta-analysis using integrated nested Laplace approximations (INLA) Burak Krsad Gnhan 1 Tim Friede 1 Leonhard Held 2 1 Department of Medical Statistics, University Medical Center Gttingen, Gttingen, Germany 2 Epidemiology,

396 views • 29 slides

Models for QBFs Uwe Bubeck and Hans Kleine Bning University of Paderborn July 11, 2013

SAT 2013 Nested Boolean Functions as Models for QBFs Uwe Bubeck and Hans Kleine Bning University of Paderborn July 11, 2013 Outline Introduction: QBF and (Counter-)Models Free Variables and Models NBF Representation

691 views • 27 slides

Functors in Computable Model Theory Russell Miller Queens College & CUNY Graduate Center

Functors in Computable Model Theory Russell Miller Queens College & CUNY Graduate Center Infinity Workshop Kurt G odel Research Center Vienna, Austria 10 July 2014 (Joint work with many researchers.) Russell Miller (CUNY) Functors

398 views • 23 slides