Regression Albert Bifet May 2012 COMP423A/COMP523A Data Stream - PowerPoint PPT Presentation

Regression Albert Bifet May 2012

COMP423A/COMP523A Data Stream Mining Outline 1. Introduction 2. Stream Algorithmics 3. Concept drift 4. Evaluation 5. Classification 6. Ensemble Methods 7. Regression 8. Clustering 9. Frequent Pattern Mining 10. Distributed Streaming

Data Streams Big Data & Real Time

Regression Definition Given a numeric class attribute, a regression algorithm builds a model that predicts for every unlabelled instance I a numeric value with accuracy. y = f ( x ) Example Stock-Market price prediction Example Airplane delays

Evaluation 1. Error estimation: Hold-out or Prequential 2. Evaluation performance measures: MSE or MAE 3. Statistical significance validation: Nemenyi test Evaluation Framework

2. Performance Measures Regression mean measures ◮ Mean square error: � ( f ( x i ) − y i ) 2 / N MSE = ◮ Root mean square error: √ �� ( f ( x i ) − y i ) 2 / N RMSE = MSE = Forgetting mechanism for estimating measures Sliding window of size w with the most recent observations

2. Performance Measures Regression relative measures ◮ Relative Square error: � ( f ( x i ) − y i ) 2 / � y i − y i ) 2 (¯ RSE = ◮ Root relative square error: √ �� ( f ( x i ) − y i ) 2 / (¯ y i ) − y i ) 2 RRSE = RSE = Forgetting mechanism for estimating measures Sliding window of size w with the most recent observations

Linear Methods for Regression Linear Least Squares fitting ◮ Linear Regression Model p � f ( x ) = β 0 + β j x j = X β j = 1 ◮ Minimize residual sum of squares N � ( y i − f ( x i )) 2 / N = ( y − X β ) ′ ( y − X β ) RSS ( β ) = i = 1 ◮ Solution: ˆ β = ( X ′ X ) − 1 X ′ y

Perceptron w 1 Attribute 1 w 2 Attribute 2 w 3 w ( � Output h � x i ) Attribute 3 w 4 Attribute 4 w 5 Attribute 5 ◮ Data stream: � � x i , y i � ◮ Classical perceptron: h � w ( � x i ) = � w T � x i , w ) = 1 x i )) 2 ◮ Minimize Mean-square error: J ( � � ( y i − h � w ( � 2

Perceptron w ) = 1 ◮ Minimize Mean-square error: J ( � w ( � x i )) 2 � ( y i − h � 2 ◮ Stochastic Gradient Descent: � w = � w − η ∇ J � x i ◮ Gradient of the error function: � w ( � ∇ J = − ( y i − h � x i )) i ◮ Weight update rule � w = � � w ( � x i )) � w + η ( y i − h � x i i

Fast Incremental Model Tree with Drift Detection FIMT-DD FIMT-DD differences with HT: 1. Splitting Criterion 2. Numeric attribute handling using BINTREE 3. Linear model at the leaves 4. Concept Drift Handling: Page-Hinckley 5. Alternate Tree adaption strategy

Splitting Criterion Standard Deviation Reduction Measure ◮ Classification Information Gain = Entropy(before Split) − Entropy(after split) c � Entropy = − p i · log p i c c � � p 2 Gini Index = p i ( 1 − p i ) = 1 − i ◮ Regression Gain = SD(before Split) − SD(after split) �� (¯ y − y i ) 2 / N StandardDeviation (SD) =

Numeric Handling Methods Exhaustive Binary Tree (BINTREE – Gama et al, 2003) ◮ Closest implementation of a batch method ◮ Incrementally update a binary tree as data is observed ◮ Issues: high memory cost, high cost of split search, data order

Page Hinckley Test ◮ The CUSUM test g 0 = 0 , g t = max ( 0 , g t − 1 + ǫ t − υ ) if g t > h then alarm and g t = 0 ◮ The Page Hinckley Test g 0 = 0 , g t = g t − 1 + ( ǫ t − υ ) G t = min ( g t ) if g t − G t > h then alarm and g t = 0

Lazy Methods kNN Nearest Neighbours: 1. Mean value of the k nearest neighbours � k i = 1 f ( x i ) ˆ f ( x q ) = k 2. Depends on distance function

Regression Albert Bifet May 2012 COMP423A/COMP523A Data Stream - PowerPoint PPT Presentation

Regression Albert Bifet May 2012 COMP423A/COMP523A Data Stream Mining Outline 1. Introduction 2. Stream Algorithmics 3. Concept drift 4. Evaluation 5. Classification 6. Ensemble Methods 7. Regression 8. Clustering 9. Frequent Pattern

Regression 3: Logistic Regression Marco Baroni Practical Statistics in R Outline Logistic

Regression Methods 1. Linear Regression and Logistic Regression: definitions, and a common

Logistic Regression James H. Steiger Department of Psychology and Human Development Vanderbilt

Regression 1: Linear Regression Marco Baroni Practical Statistics in R Outline Classic linear

Business Statistics CONTENTS Multiple regression Dummy regressors Assumptions of regression

Kernel Methods for Regression Support Vector Regression Gaussian Mixture Regression Gaussian

Lecture 8: Regression Trees Instructor: Saravanan Thirumuruganathan CSE 5334 Saravanan

Multiple Regression and Logistic Regression I Dajiang Liu @PHS 525 Apr-14-2016 Multiple

Planning and Optimization B2. Regression: Introduction & STRIPS Case Malte Helmert and

10-601 Machine Learning Regression Outline Regression vs Classification Linear regression

Linear regression How to measure the accuracy of linear regression models Linear Regression

CS70: Lecture 35. Regression (contd.): Linear and Beyond CS70: Lecture 35. Regression (contd.):

Analysis of variance and regression Other types of regression models Other types of regression

Linear Models for Regression Greg Mori - CMPT 419/726 Bishop PRML Ch. 3 Regression Linear Basis

Linear regression Linear regression is a simple approach to supervised learning. It assumes

STARTS: STARTS: STARTS: STARTS: STAtic STAtic Regression Test Selection Regression Test

Shortcuts through Colocation Facilities Vasileios Kotronis 1 , George Nomikos 1 , Lefteris

From Variational to Deterministic Autoencoders or the joys of density estimation in latent spaces

Interference of Lexico-Syntactic Gender in Bilingual Spoken-Word Recognition: An Eye-Tracking

Precision Higgs physics: a gateway to New Physics Jonas M. Lindert SM@LHC 2018 Higgs-session

Text analysis Natural Language Processing, or How to do cool stuff with words. Emily Rae

Dynamic Micro Targeting: Fitness- Based Approach to Predicting Individual Preferences Tianyi

Health Benefits Survey Release Slides October 3, 2018 Drew Altman President and CEO, KFF Gary

Business Perspective Moderator: Holly Emrick Svetz, Womble Carlyle Alison Brown, NAVSYS

Regression Albert Bifet May 2012 COMP423A/COMP523A Data Stream - PowerPoint PPT Presentation

Regression Albert Bifet May 2012 COMP423A/COMP523A Data Stream Mining Outline 1. Introduction 2. Stream Algorithmics 3. Concept drift 4. Evaluation 5. Classification 6. Ensemble Methods 7. Regression 8. Clustering 9. Frequent Pattern

Regression 3: Logistic Regression Marco Baroni Practical Statistics in R Outline Logistic

Regression Methods 1. Linear Regression and Logistic Regression: definitions, and a common

Logistic Regression James H. Steiger Department of Psychology and Human Development Vanderbilt

Regression 1: Linear Regression Marco Baroni Practical Statistics in R Outline Classic linear

Business Statistics CONTENTS Multiple regression Dummy regressors Assumptions of regression

Kernel Methods for Regression Support Vector Regression Gaussian Mixture Regression Gaussian

Lecture 8: Regression Trees Instructor: Saravanan Thirumuruganathan CSE 5334 Saravanan

Multiple Regression and Logistic Regression I Dajiang Liu @PHS 525 Apr-14-2016 Multiple

Planning and Optimization B2. Regression: Introduction &amp; STRIPS Case Malte Helmert and

10-601 Machine Learning Regression Outline Regression vs Classification Linear regression

Linear regression How to measure the accuracy of linear regression models Linear Regression

CS70: Lecture 35. Regression (contd.): Linear and Beyond CS70: Lecture 35. Regression (contd.):

Analysis of variance and regression Other types of regression models Other types of regression

Linear Models for Regression Greg Mori - CMPT 419/726 Bishop PRML Ch. 3 Regression Linear Basis

Linear regression Linear regression is a simple approach to supervised learning. It assumes

STARTS: STARTS: STARTS: STARTS: STAtic STAtic Regression Test Selection Regression Test

Shortcuts through Colocation Facilities Vasileios Kotronis 1 , George Nomikos 1 , Lefteris

From Variational to Deterministic Autoencoders or the joys of density estimation in latent spaces

Interference of Lexico-Syntactic Gender in Bilingual Spoken-Word Recognition: An Eye-Tracking

Precision Higgs physics: a gateway to New Physics Jonas M. Lindert SM@LHC 2018 Higgs-session

Text analysis Natural Language Processing, or How to do cool stuff with words. Emily Rae

Dynamic Micro Targeting: Fitness- Based Approach to Predicting Individual Preferences Tianyi

Health Benefits Survey Release Slides October 3, 2018 Drew Altman President and CEO, KFF Gary

Business Perspective Moderator: Holly Emrick Svetz, Womble Carlyle Alison Brown, NAVSYS

Planning and Optimization B2. Regression: Introduction & STRIPS Case Malte Helmert and