Introduction to Machine Learning Evaluation: Measures for Regression - PowerPoint PPT Presentation

Introduction to Machine Learning Evaluation: Measures for Regression Learning goals Know the definitions of mean squared error (MSE) and mean absolute error (MAE) Understand the connections of MSE and MAE to L2 and L1 loss Know the definitions of R 2 and generalized R 2

MEAN SQUARED ERROR n ( y ( i ) − ˆ y ( i ) ) 2 ∈ [ 0 ; ∞ ) MSE = 1 � → L2 loss. n i = 1 Single observations with a large prediction error heavily influence the MSE , as they enter quadratically. 7 15 6 5 10 ^ , y ) 4 L ( y y 6.65 6.65 1.15 5 3 2 1.15 0 1 −4 −2 0 2 4 0 2 4 Residuals = y − y ^ x Similar measures: sum of squared errors (SSE), root mean squared error (RMSE, brings measurement back to the original scale of the outcome). � c Introduction to Machine Learning – 1 / 4

MEAN ABSOLUTE ERROR n | y ( i ) − ˆ MAE = 1 y ( i ) | ∈ [ 0 ; ∞ ) � → L1 loss. n i = 1 Less influenced by large errors and maybe more intuitive than the MSE. 7 15 6 5 10 ^ , y ) 4 L ( y y 2.58 6.65 1.07 5 3 2.58 2 1.07 1.15 0 1 −4 −2 0 2 4 1 2 3 4 5 Residuals = y − y ^ x Similar measures: median absolute error (for even more robustness). � c Introduction to Machine Learning – 2 / 4

R 2 Well-known measure from statistics. n ( y ( i ) − ˆ y ( i ) ) 2 � = 1 − SSE LinMod R 2 = 1 − i = 1 n SSE Intercept ( y ( i ) − ¯ y ) 2 � i = 1 Usually introduced as fraction of variance explained by the model Simpler: compares SSE of constant model (baseline) with complex model (LM) R 2 = 1: all residuals are 0, we predict perfectly, R 2 = 0: we predict as badly as the constant model If measured on the training data, R 2 ∈ [ 0 ; 1 ] (LM must be at least as good as the constant) On other data R 2 can even be negative as there is no guarantee that the LM generalizes better than a constant (overfitting) � c Introduction to Machine Learning – 3 / 4

GENERALIZED R 2 FOR ML A simple generalization of R 2 for ML seems to be: 1 − Loss ComplexModel Loss SimplerModel Works for arbitrary measures (not only SSE), for arbitrary models, on any data set of interest E.g. model vs constant, LM vs non-linear model, tree vs forest, model without some features vs model with them included Fairly unknown; our terminology (generalized R 2 ) is non-standard � c Introduction to Machine Learning – 4 / 4

Introduction to Machine Learning Evaluation: Measures for Regression - PowerPoint PPT Presentation

Introduction to Machine Learning Evaluation: Measures for Regression Learning goals Know the definitions of mean squared error (MSE) and mean absolute error (MAE) Understand the connections of MSE and MAE to L2 and L1 loss Know the

Introduction to Machine Learning Introduction to Machine Learning Introduction to Machine

Quantum Machine Learning Adam Brown, HEP-AI Quantum Computing Machine Learning Quantum

MICROSOFT AZURE MACHINE LEARNING Oscar Naim Microsoft Microsoft Azure Machine Learning What is

MACHINE LEARNING Overview 1 1 APPLIED MACHINE LEARNING 2011-2012 APPLIED MACHINE LEARNING

MACHINE LEARNING kernels 1 MACHINE LEARNING 2012 MACHINE LEARNING Kernels: Intuition How

A Machine Learning Approach A Machine Learning Approach A Machine Learning Approach A Machine

Introduction to Machine Learning COMPSCI 371D Machine Learning COMPSCI 371D Machine

INTRODUCTION TO MACHINE LEARNING Joseph C. Osborn CS 51A Spring 2020 Machine Learning is

Introduction to Machine Learning Evaluation: Measures for Binary Classification: ROC Measures

Introduction to Machine Learning COMPSCI 371D Machine Learning COMPSCI 371D Machine

Chapter 12. Evaluation Research Chapter 12. Evaluation Research evaluation research? evaluation

User Interface Evaluation Empirical evaluation Heuristic evaluation 1 CS 349 - UI evaluation

Welcome to the Machine Learning Toolbox! Machine Learning Toolbox Supervised learning caret

Human and Machine Learning Tom Mitchell Machine Learning Department Carnegie Mellon University

Machine Learning Algorithms for Classification Machine Learning Algorithms for Classification

Machine Learning - Intro Aarti Singh Machine Learning 10-701/15-781 Sept 8, 2010 You tell me

Lecture 2 Diagnostics and Model Evaluation Colin Rundel 1/23/2017 1 From last time 2 Linear

10703 Deep Reinforcement Learning Policy Gradient Methods Part 3 Tom Mitchell October 8, 2018

Introduction to Machine Learning Milan Straka October 07, 2019 Charles University in Prague

MY FIRST STEPS IN SLIT SPECTROSCOPY BAAVSS Spectroscopy Workshop Norman Lockyer Observatory

CSE 158 Lecture 2 Web Mining and Recommender Systems Supervised learning Regression

Day 6: Model Selection II Lucas Leemann Essex Summer School Introduction to Statistical Learning

( Y n a bX n ) 2 . n = 1 Thus, Note that E [ X ] = 0 and E [ Y ] = 0 in these

The Paradox of Overfitting Volker Nannen February 1, 2003 Artificial Intelligence

Explore More Topics

Sambuz

Useful Links

Newsletter

Mail Us