Overfitting, Cross-Validation Recommended reading: Neural nets: - PowerPoint PPT Presentation

Feb 05, 2024 •190 likes •439 views

Overfitting, Cross-Validation Recommended reading: Neural nets: Mitchell Chapter 4 Decision trees: Mitchell Chapter 3 Machine Learning 10-701 Tom M. Mitchell Carnegie Mellon University Overview Followup on neural networks

Overfitting, Cross-Validation Recommended reading: • Neural nets: Mitchell Chapter 4 • Decision trees: Mitchell Chapter 3 Machine Learning 10-701 Tom M. Mitchell Carnegie Mellon University
Overview • Followup on neural networks – Example: Face classification • Cross validation – Training error – Test error – True error • Decision trees – ID3, C4.5 – Trees and rules
# of gradient descent steps �
# of gradient descent steps �
# of gradient descent steps �
Cognitive Neuroscience Models Based on ANN’s [McClelland & Rogers, Nature 2003]
How should we choose the number of weight updates?
How should we choose the number of weight updates? How should we allocate N examples to training, validation sets? How will curves change if we double training set size? How will curves change if we double validation set size? What is our best unbiased estimate of true network error?
Overfitting and Cross Validation Overfitting: a learning algorithm overfits the training data if it outputs a hypothesis , h 2 H, when there exists h’ 2 H such that: where
Three types of error True error: Train set error: Test set error:
Bias in estimates Gives a biased (optimistically) estimate for Gives an unbiased estimate for
Leave one out cross validation Method for estimating true error of h’ • e=0 • For each training example z – Training on {data – z} – Test on single example z; if error, then e � e+1 Final error estimate (for training on sample of size |data|-1) is: e / |data|
Leave one out cross validation The leave-one-out error , e / |data|, gives an almost unbiased estimate for
Leave one out cross validation In fact, the e / |data| estimate of leave-one-out cross validation is a slightly pessimistic estimate of
How should we choose the number of weight updates? How should we allocate N examples to training, validation sets? How will curves change if we double training set size? How will curves change if we double validation set size? What is our best unbiased estimate of true network error?
What you should know: • Neural networks – Hidden layer representations • Cross validation – Training error, test error, true error – Cross validation as low-bias estimator

Recommend

Overfitting Can Happen Overfitting Can Happen Overfitting Can Happen Overfitting Can Happen

Overfitting Can Happen Overfitting Can Happen Overfitting Can Happen Overfitting Can Happen Overfitting Can Happen 30 25 test 20 error 15 (boosting stumps on heart-disease dataset) train 10 5 0 1 10 100 1000 # rounds

138 views • 3 slides

The Problem of Overfitting The Problem of Overfitting BR data: neural network with 20%

The Problem of Overfitting The Problem of Overfitting BR data: neural network with 20% classification noise, 307 training examples Overfitting on BR (2) Overfitting on BR (2) Overfitting: h H overfits training set S if there exists H

551 views • 40 slides

Holdout and Cross- -Validation Validation Holdout and Cross Methods Overfitting Avoidance

Holdout and Cross- -Validation Validation Holdout and Cross Methods Overfitting Avoidance Methods Overfitting Avoidance Decision Trees Decision Trees Reduce error pruning Reduce error pruning Cost Cost- -complexity pruning

699 views • 17 slides

Learning From Data Lecture 11 Overfitting What is Overfitting When does Overfitting Occur

Learning From Data Lecture 11 Overfitting What is Overfitting When does Overfitting Occur Stochastic and Deterministic Noise M. Magdon-Ismail CSCI 4100/6100 recap: Nonlinear Transforms X -space is R d d Z -space is R 1 1

373 views • 26 slides

Overfitting Validation process. Overfitting Ettore Lanzarone March 18, 2020 LESSON 3 Lesson 3

Lesson 3 MEDICAL SUPPORT SYSTEMS FOR CHRONIC DISEASES Engineering and Management for Health University of Bergamo Overfitting Validation process. Overfitting Ettore Lanzarone March 18, 2020 LESSON 3 Lesson 3 Overfitting Linear

510 views • 23 slides

Cross-validation and the Bootstrap In the section we discuss two resampling methods:

Cross-validation and the Bootstrap In the section we discuss two resampling methods: cross-validation and the bootstrap. 1 / 44 Cross-validation and the Bootstrap In the section we discuss two resampling methods: cross-validation and the

1.13k views • 66 slides

STAT 213 Cross-Validation (and Multifactor ANOVA?) Colin Reimer Dawson Oberlin College 12

Outline Last Time Cross-Validation STAT 213 Cross-Validation (and Multifactor ANOVA?) Colin Reimer Dawson Oberlin College 12 April 2016 Outline Last Time Cross-Validation Outline Last Time Cross-Validation Outline Last Time

381 views • 25 slides

in Spark Using GPU Minsik Cho, Rajesh Bordawekar IBM TJW Research 1 Cross-Validation 101

Accelerating Cross-Validation in Spark Using GPU Minsik Cho, Rajesh Bordawekar IBM TJW Research 1 Cross-Validation 101 [Wikipedia] Popular Model Validation Technique to avoid overfitting, for better generalization useful when not

430 views • 20 slides

Progress to Date in A3: Method Transfer, Partial Validation and Cross validation A3: Method

Progress to Date in A3: Method Transfer, Partial Validation and Cross validation A3: Method Transfer, partial and cross validation Team members: In scope Life cycle of a method after first full validation or relation Team

252 views • 10 slides

Introduction to Data Science: Classifier n 1 n 1 k k Suppose you want to compare two

Classier evaluation Classier evaluation Leave-one-out Cross-Validation Leave-one-out Cross-Validation Leave-one-out Cross-Validation Classier evaluation Classier evaluation Leave-one-out Cross-Validation Resampled validation set

984 views • 51 slides

02 | 27 SOUTHERN CROSS 23.04 03 | 27 SOUTHERN CROSS 23.04 04 | 27 SOUTHERN CROSS 23.04 06

302 views • 27 slides

Learning Algorithm Evaluation Outline Why ? Overfitting How? Holdout vs Cross-validation

Disease + - t l TP FP + TP+FP u s e r FN TN t FN+TN - s e t TP+FN FP+TN Learning Algorithm Evaluation Outline Why ? Overfitting How? Holdout vs Cross-validation What? Evaluation measures Who wins?

1.01k views • 80 slides

The Shadow of the Cross The Cross of Jesus part 1B The Shadow of the Cross Hebrews 10:1-14 The

The Shadow of the Cross The Cross of Jesus part 1B The Shadow of the Cross Hebrews 10:1-14 The Shadow of the Cross The Shadow of the Cross OT Glimpses of the Cross OT Glimpses of the Cross Heb 8:5 & 10:1 Heb 8:5 & 10:1 OT Glimpses

359 views • 35 slides

Validation of National Burn Severity Validation of National Burn Severity Validation of National

Validation of National Burn Severity Validation of National Burn Severity Validation of National Burn Severity Validation of National Burn Severity Mapping Project Techniques Within Mapping Project Techniques Within the Apalachicola National

212 views • 17 slides

Form Validation 1 CS380 What is form validation? 2 validation: ensuring that form's values

Form Validation 1 CS380 What is form validation? 2 validation: ensuring that form's values are correct some types of validation: preventing blank values (email address) ensuring the type of values integer, real number,

284 views • 24 slides

Stratified Cross-Validation in Multi-Label Classification Using Genetic Algorithms 7-8/02/2013

Stratified Cross-Validation in Multi-Label Classification Using Genetic Algorithms 7-8/02/2013 TIN2010-20900-C04 Albacete Index Introduction Multilabel Classification Cross-Validation and Stratified Cross-Validation Methods and

682 views • 43 slides

CSC411 Tutorial #3 Cross-Validation and Decision Trees February 3, 2016 Boris Ivanovic*

CSC411 Tutorial #3 Cross-Validation and Decision Trees February 3, 2016 Boris Ivanovic* csc411ta@cs.toronto.edu *Based on the tutorial given by Erin Grant, Ziyu Zhang, and Ali Punjani in previous years. Outline for Today Cross-Validation

355 views • 24 slides

5: Overtraining and Cross-validation Machine Learning and Real-world Data Simone Teufel and Ann

5: Overtraining and Cross-validation Machine Learning and Real-world Data Simone Teufel and Ann Copestake Computer Laboratory University of Cambridge Lent 2017 Last session: smoothing and significance testing You looked at various possible

208 views • 16 slides

Mustafa Koplay1, Serkan Guneyli2, Hakan Cebeci1, Huseyin Korkmaz3, Halil Haldun Emiroglu4,Tamer

Mustafa Koplay1, Serkan Guneyli2, Hakan Cebeci1, Huseyin Korkmaz3, Halil Haldun Emiroglu4,Tamer Sekmenli5, Yahya Paksoy1 1Department of Radiology, Medical Faculty, Selcuk University, Konya, Turkey 2Department of Radiology, Medical Faculty, Bulent

415 views • 23 slides

The NIH Human Microbiome Project: An Update Eric Green, M.D., Ph.D. Director, NHGRI The NIH

The NIH Human Microbiome Project: An Update Eric Green, M.D., Ph.D. Director, NHGRI The NIH Human Microbiome Project (HMP) (HMPI: 2008-2012) (Canada) Sequencing Centers Demonstration Projects Data Analysis & Coordination Center

319 views • 11 slides

Co-training Embeddings of Knowledge Graphs and Entity Descriptions for Cross-lingual Entity

Co-training Embeddings of Knowledge Graphs and Entity Descriptions for Cross-lingual Entity Alignment Muhao Chen 1 , Yingtao Tian 2 , Kai-Wei Chang 1 , Steven Skiena 2 , and Carlo Zaniolo 1 1 University of California, Los Angeles 2 Stony Brook

344 views • 18 slides

Adversarial Examples and Adversarial Training Ian Goodfellow, OpenAI Research Scientist Guest

Adversarial Examples and Adversarial Training Ian Goodfellow, OpenAI Research Scientist Guest lecture for CS 294-131, UC Berkeley, 2016-10-05 In this presentation Intriguing Properties of Neural Networks Szegedy et al, 2013

660 views • 44 slides

Cross-lingual named entity disambiguation for concept translation Tadej tajner Joef Stefan

Cross-lingual named entity disambiguation for concept translation Tadej tajner Joef Stefan Institute 15 March 2012, Luxembourg W3C Workshop: The Multilingual Web The Way Ahead ailab.ijs.si Motivation Translating proper names

525 views • 14 slides

Achieving Cross-System Collaboration to Support Young People in the Transition Years: A Tip

Achieving Cross-System Collaboration to Support Young People in the Transition Years: A Tip Sheet for Service Providers December 2016 What Is Cross-System Collaboration to Serve Youth? Cross-system collaboratjon can be defjned as reaching

119 views • 8 slides