Week 2 Video 5 Cross-Validation and Over-Fitting Over-Fitting Ive - PowerPoint PPT Presentation

Week 2 Video 5 Cross-Validation and Over-Fitting

Over-Fitting ¨ I’ve mentioned over-fitting a few times during the last few weeks ¨ Fitting to the noise as well as the signal

Over-Fitting 25 25 20 20 15 15 10 10 5 5 0 0 0 5 10 15 20 25 0 5 10 15 20 25 Good fit Over fit

Reducing Over-Fitting ¨ Use simpler models ¤ Fewer variables (BiC, AIC, Occam’s Razor) ¤ Less complex functions (MDL)

Eliminating Over-Fitting? ¨ Every model is over-fit in some fashion ¨ The questions are: ¤ How bad? ¤ What is it over-fit to?

Assessing Generalizability ¨ Does your model transfer to new contexts? ¨ Or is it over-fit to a specific context?

Training Set/Test Set ¨ Split your data into a training set and test set

Notes ¨ Model tested on unseen data ¨ But uses data unevenly

Cross-validation 9 ¨ Split data points into N equal-size groups

Cross-validation 10 ¨ Train on all groups but one, test on last group ¨ For each possible combination

You can do both! ¨ Use cross-validation to tune algorithm parameters or select algorithms ¨ Use held-out test set to get less over-fit final estimate of model goodness

How many groups? ¨ K-fold ¤ Pick a number K, split into this number of groups ¨ Leave-out-one ¤ Every data point is a fold

How many groups? ¨ K-fold ¤ Pick a number K, split into this number of groups ¤ Quicker; preferred by some theoreticians ¨ Leave-out-one ¤ Every data point is a fold ¤ More stable ¤ Avoids issue of how to select folds (stratification issues)

Cross-validation variants ¨ Flat Cross-Validation ¤ Each point has equal chance of being placed into each fold ¨ Stratified Cross-Validation ¤ Biases fold selection so that some variable is equally represented in each fold ¤ The variable you’re trying to predict ¤ Or some variable that is thought to be an important context

Student-level cross-validation ¨ Folds are selected so that no student’s data is represented in two folds ¨ Allows you to test model generalizability to new students ¨ As opposed to testing model generalizability to new data from the same students

Student-level cross-validation ¨ Usually seen as the minimum cross-validation needed, in the EDM conference ¨ Papers that don’t pay attention to this issue are usually rejected ¤ OK to explicitly choose something else and discuss that choice ¤ Not OK to just ignore the issue and do what’s easiest

Student-level cross-validation ¨ Easy to do with Batch X-Validation in RapidMiner

Other Levels Sometimes Used for Cross-Validation ¨ Lesson/Content ¨ School ¨ Demographic (Urban/Rural/Suburban, Race, Gender) ¨ Software Package ¨ Session (in MOOCs, behavior in later sessions differs from behavior in earlier sessions – Whitehill et al., 2017)

Important Consideration ¨ Where do you want to be able to use your model? ¤ New students? ¤ New schools? ¤ New populations? ¤ New software content? ¨ Make sure to cross-validate at that level

Next Lecture ¨ More on Generalization and Validity

Week 2 Video 5 Cross-Validation and Over-Fitting Over-Fitting Ive - PowerPoint PPT Presentation

Week 2 Video 5 Cross-Validation and Over-Fitting Over-Fitting Ive mentioned over-fitting a few times during the last few weeks Fitting to the noise as well as the signal Over-Fitting 25 25 20 20 15 15 10 10 5 5 0 0 0 5 10

MATH2130-F17 Week 13 Week 14 Week 15, Inner Farid Aliniaeifard Product Space CU BOULDER

Time Matters Week 7 Week 6 Prototyping + Needfinding Week 7 Week 8 Implementation Week 9

Math 610 Section 700 - Recitation week 3 week 4 week 6 week 8 TA: Peng Wei Office: Blocker

Video Games Written and Researched by: Patrick Kania First Video Game The first Video Game made

Galatians: week 3 Galatians 3:1-29 Week 1: Galatians 1:1-2:14 Week 2: Galatians 2:15-21 Week 3:

Vermont M nt Marble: A e: Americas s nt Stone Monument Sto Class S s Schedule e Week

Week 1: Christ: The Source of True Happiness Week 2: Happiness, the Gospel and Living Well Week

NVIDIA VIDEO TECHNOLOGIES Abhijit Patait, 3/20/2019 NVIDIA Video Technologies Overview Turing

NVIDIA VIDEO TECHNOLOGIES Abhijit Patait, 3/26/2018 NVIDIA Video Technologies Overview Video

Video Sur Video Sur rveillance, rveillance, , Video Analyti Video Analyti ics, and You.

Islands of the Pacific Northwest One or Two Week Cruise Week 1: September 14 th 20 th Week 2:

Menu Day Week 1 Week 2 Week 3 Week 4 Monday +Pork and Apple Casserole or +Meat Loaf or Lamb

www. velpaprojects .com Finishing your property the VELPA way Time plan Week 1 - 4 Week 5 - 8

Case-X Progress Report By: MELRR Engineering Group #3 Weekly Updates Week Week Week Week

INSTRUCTION WEEK OF MAY 18 TH 2020 MS. KELLYS SIXTH GRADE GLOBAL THINKERS STUDENT OF THE WEEK:

INSTRUCTION WEEK OF MAY 18 TH 2020 MS. KELLYS SIXTH GRADE GLOBAL THINKERS STUDENT OF THE WEEK:

Time - dela y ed feat u res and a u to - regressi v e models MAC H IN E L E AR N IN G FOR TIME

Lecture 5: Regularization ML Methodology Aykut Erdem February 2016 Hacettepe University

CS 6316 Machine Learning Model Selection and Validation Yangfeng Ji Department of Computer

Bayesian leave-one-out cross-validation for large data Mns Magnusson (Aalto University) Michael

Introduction to Data Science: Classifier n 1 n 1 k k Suppose you want to compare two

STAT 213 Cross-Validation (and Multifactor ANOVA?) Colin Reimer Dawson Oberlin College 12

Introduction to Machine Learning Model Validation and Selection Dr. Ilija Bogunovic Learning

MLCC 2019 Local Methods and Bias Variance Trade-Off Lorenzo Rosasco UNIGE-MIT-IIT About this