Cross validation COMS 4721 1 / 8 The model selection problem - PowerPoint PPT Presentation

Cross validation COMS 4721 1 / 8

The model selection problem Objective ◮ Often necessary to consider many different models (e.g., types of classifiers) for a given problem. ◮ Sometimes “model” simply means particular setting of hyper-parameters (e.g., k in k -NN, number of nodes in decision tree). Terminology The problem of choosing a good model is called model selection . 2 / 8

Model selection by hold-out validation (Henceforth, use h to denote particular setting of hyper-parameters / model choice.) Hold-out validation Model selection : 1. Randomly split data into three sets: training, validation, and test data. Training Validation Test 2. Train classifier ˆ f h on Training data for different values of h . 3. Compute Validation (“hold-out”) error for each ˆ err( ˆ f h : f h , Validation ) . 4. Selection: ˆ h = value of h with lowest Validation error. 5. Train classifier ˆ f using ˆ h with Training and Validation data. Model assessment : 6. Finally: estimate true error rate of ˆ f using test data. 3 / 8

Main idea behind hold-out validation Training Validation Test Classifier ˆ → err( ˆ f h trained on Training data − f h , Validation ) . Training and Validation Test Classifier ˆ → err( ˆ f h trained on Training and Validation data − f h , Test ) . 4 / 8

Main idea behind hold-out validation Training Validation Test Classifier ˆ → err( ˆ f h trained on Training data − f h , Validation ) . Training and Validation Test Classifier ˆ → err( ˆ f h trained on Training and Validation data − f h , Test ) . The hope is that these quantities are similar! 4 / 8

Main idea behind hold-out validation Training Validation Test Classifier ˆ → err( ˆ f h trained on Training data − f h , Validation ) . Training and Validation Test Classifier ˆ → err( ˆ f h trained on Training and Validation data − f h , Test ) . The hope is that these quantities are similar! (Making this rigorous is actually rather tricky.) 4 / 8

Beyond simple hold-out validation Standard hold-out validation: Training Validation Test Classifier ˆ → err( ˆ f h trained on Training data − f h , Validation ) . 5 / 8

Beyond simple hold-out validation Standard hold-out validation: Training Validation Test Classifier ˆ → err( ˆ f h trained on Training data − f h , Validation ) . Could also swap roles of Validation and Training: ◮ train ˆ f h using Validation data, and ◮ evaluate ˆ f h using Training data. Training Validation Test Classifier ˆ → err( ˆ f h trained on Validation data − f h , Training ) . 5 / 8

Beyond simple hold-out validation Standard hold-out validation: Training Validation Test Classifier ˆ → err( ˆ f h trained on Training data − f h , Validation ) . Could also swap roles of Validation and Training: ◮ train ˆ f h using Validation data, and ◮ evaluate ˆ f h using Training data. Training Validation Test Classifier ˆ → err( ˆ f h trained on Validation data − f h , Training ) . Idea : Do both , and average results as overall validation error rate for h . 5 / 8

Model selection by K -fold cross validation Model selection : 1. Set aside some test data. 2. Of remaining data, split into K parts (“folds”) S 1 , S 2 , . . . , S K . 3. For each value of h : ◮ For each k ∈ { 1 , 2 , . . . , K } : ◮ Train classifier ˆ f h,k using all S i except S k . ◮ Evaluate classifier ˆ err( ˆ f h,k using S k : f h,k , S k ) Example: K = 5 and k = 4 Training Training Training Validation Training K 1 � err( ˆ ◮ K -fold cross-validation error rate for h : f h,k , S k ) . K k =1 4. Set ˆ h to the value h with lowest K -fold cross-validation error rate. 5. Train classifier ˆ f using selected ˆ h with all S 1 , S 2 , . . . , S K . Model assessment : 6. Finally: estimate true error rate of ˆ f using test data. 6 / 8

How to choose K ? Argument for small K Better simulates “variation” between different training samples drawn from underlying distribution. K = 2 Training Validation Validation Training K = 4 Validation Training Training Training Training Validation Training Training Training Training Validation Training Training Training Training Validation 7 / 8

How to choose K ? Argument for small K Better simulates “variation” between different training samples drawn from underlying distribution. K = 2 Training Validation Validation Training K = 4 Validation Training Training Training Training Validation Training Training Training Training Validation Training Training Training Training Validation Argument for large K Some learning algorithms exhibit phase transition behavior (e.g., output is complete rubbish until sample size sufficiently large). Using large K best simulates training on all data (except test, of course). 7 / 8

How to choose K ? Argument for small K Better simulates “variation” between different training samples drawn from underlying distribution. K = 2 Training Validation Validation Training K = 4 Validation Training Training Training Training Validation Training Training Training Training Validation Training Training Training Training Validation Argument for large K Some learning algorithms exhibit phase transition behavior (e.g., output is complete rubbish until sample size sufficiently large). Using large K best simulates training on all data (except test, of course). In practice: usually K = 5 or K = 10 . 7 / 8

Recap ◮ Model selection : goal is to pick best model (e.g., hyper-parameter settings) to achieve low true error. 8 / 8

Recap ◮ Model selection : goal is to pick best model (e.g., hyper-parameter settings) to achieve low true error. ◮ Two common methods : hold-out validation and K -fold cross validation (with K = 5 or K = 10 ). 8 / 8

Recap ◮ Model selection : goal is to pick best model (e.g., hyper-parameter settings) to achieve low true error. ◮ Two common methods : hold-out validation and K -fold cross validation (with K = 5 or K = 10 ). ◮ Caution : considering too many different models can lead to overfitting, even with hold-out / cross-validation. 8 / 8

Recap ◮ Model selection : goal is to pick best model (e.g., hyper-parameter settings) to achieve low true error. ◮ Two common methods : hold-out validation and K -fold cross validation (with K = 5 or K = 10 ). ◮ Caution : considering too many different models can lead to overfitting, even with hold-out / cross-validation. (Sometimes “ averaging ” the models in some way can help.) 8 / 8

Cross validation COMS 4721 1 / 8 The model selection problem - PowerPoint PPT Presentation

Cross validation COMS 4721 1 / 8 The model selection problem Objective Often necessary to consider many different models (e.g., types of classifiers) for a given problem. Sometimes model simply means particular setting of

Cross-validation and the Bootstrap In the section we discuss two resampling methods:

STAT 213 Cross-Validation (and Multifactor ANOVA?) Colin Reimer Dawson Oberlin College 12

Progress to Date in A3: Method Transfer, Partial Validation and Cross validation A3: Method

Introduction to Data Science: Classifier n 1 n 1 k k Suppose you want to compare two

02 | 27 SOUTHERN CROSS 23.04 03 | 27 SOUTHERN CROSS 23.04 04 | 27 SOUTHERN CROSS 23.04 06

The Shadow of the Cross The Cross of Jesus part 1B The Shadow of the Cross Hebrews 10:1-14 The

Validation of National Burn Severity Validation of National Burn Severity Validation of National

Form Validation 1 CS380 What is form validation? 2 validation: ensuring that form's values

Stratified Cross-Validation in Multi-Label Classification Using Genetic Algorithms 7-8/02/2013

Holdout and Cross- -Validation Validation Holdout and Cross Methods Overfitting Avoidance

Criticality experiments and benchmarks for for validation of cross validation of cross sections:

Importance-Weighted Cross- Importance-Weighted Cross- Validation for Covariate Shift Validation

Data Mining II Model Validation Heiko Paulheim Why Model Validation? We have seen so far

in Spark Using GPU Minsik Cho, Rajesh Bordawekar IBM TJW Research 1 Cross-Validation 101

LaGov LaGov Version 2.2 Updated: 12/17/08 Visit our website for Blueprint Presentations,

LaGov LaGov Validation Session Agenda Validation Session Agenda Purpose Work Session

A Public/Private Gary McCuin Eureka County Extension Educator Partnership Wild Horse and Burro

Kathmandu - FY11 Results Presentation 1 Contents Results Overview Key Line Items

Alpha Presentation Pig E. Bank The Capstone Experience Team Auto-Owners Nathan Sriro Benjamin

Supervision Mandatory Webinar 4 Webinar overview I. Background II. Why supervision? III.

Adaptive Data Analysis Machine learning in science and society Christos Dimitrakakis August 21,

Progress on Nowcasting convection occurrence from space-borne instability predictors P .

Emerging Issues in Sovereign Debt What can developing countries do? Benu Schneider G24 Technical

South San Francisco Station Improvement Project Pre-Construction Meeting June 22, 2017 Agenda