Lecture 8: Regression Trees Instructor: Saravanan Thirumuruganathan - PowerPoint PPT Presentation

Lecture 8: Regression Trees Instructor: Saravanan Thirumuruganathan CSE 5334 Saravanan Thirumuruganathan

Outline 1 Regression 2 Linear Regression 3 Regression Trees CSE 5334 Saravanan Thirumuruganathan

Regression and Linear Regression CSE 5334 Saravanan Thirumuruganathan

Supervised Learning Dataset: Training (labeled) data: D = { ( x i , y i ) } x i ∈ R d Test (unlabeled) data: x 0 ∈ R d Tasks: Classification: y i ∈ { 1 , 2 , . . . , C } Regression: y i ∈ R Objective: Given x 0 , predict y 0 Supervised learning as y i was given during training CSE 5334 Saravanan Thirumuruganathan

Regression Predict cost of house from details Predict job salary from job description Predict SAT, GRE scores Predict future price of Petrol from past prices Predict future GDP of a country, valuation of a company CSE 5334 Saravanan Thirumuruganathan

Linear Regression : One-dimensional Case CSE 5334 Saravanan Thirumuruganathan

Linear Regression: Poverty vs HS Graduation Rate CSE 5334 Saravanan Thirumuruganathan

Residuals CSE 5334 Saravanan Thirumuruganathan

A measure for the best line CSE 5334 Saravanan Thirumuruganathan

Least Squares Line CSE 5334 Saravanan Thirumuruganathan

Prediction CSE 5334 Saravanan Thirumuruganathan

Linear Regression in Higher Dimensions CSE 5334 Saravanan Thirumuruganathan

Linear Regression: Objective Function CSE 5334 Saravanan Thirumuruganathan

Linear Regression: Gradient Descent based Solution CSE 5334 Saravanan Thirumuruganathan

Regression Trees CSE 5334 Saravanan Thirumuruganathan

Predicting Baseball salary data Salary is color-coded from low (blue, green) to high (yellow,red) CSE 5334 Saravanan Thirumuruganathan

Decision tree for Baseball Salary Prediction CSE 5334 Saravanan Thirumuruganathan

Interpreting the Decision Tree CSE 5334 Saravanan Thirumuruganathan

Interpreting the Decision Tree Years is the most important factor in determining Salary, and players with less experience earn lower salaries than more experienced players. Given that a player is less experienced, the number of Hits that he made in the previous year seems to play little role in his Salary . But among players who have been in the major leagues for five or more years, the number of Hits made in the previous year does affect Salary , and players who made more Hits last year tend to have higher salaries. Surely an over-simplification, but compared to a regression model, it is easy to display, interpret and explain CSE 5334 Saravanan Thirumuruganathan

High Level Idea Classification Tree: Quality of split measured by general “Impurity measure” Regression Tree: Quality of split measured by “Squared error” CSE 5334 Saravanan Thirumuruganathan

High Level Idea We divide the feature space into J distinct and non-overlapping regions R 1 , R 2 , . . . , R J For every observation that falls into the region R i , we make same prediction, which is simply the mean of the response values for the training observations in R i Objective: Find boxes R 1 , R 2 , . . . , R J that minimizes Residual Sum of Square (RSS) � J � y R i ) 2 ( y j − � RSS = i =1 j ∈ R i where � y R i is the mean response for the training in the i -th box. CSE 5334 Saravanan Thirumuruganathan

Building Regression Trees We first select the feature X i and the cutpoint s such that splitting the feature space into the regions { X | X i < s } and { X | X i ≥ s } leads to the greatest possible reduction in RSS. Next, we repeat the process, looking for the best attribute and best cutpoint in order to split the data further so as to minimize the RSS within each of the resulting regions. The process continues until a stopping criterion is reached; for instance, we may continue until no region contains more than five observations. CSE 5334 Saravanan Thirumuruganathan

Summary Major Concepts: Geometric interpretation of Classification Decision trees CSE 5334 Saravanan Thirumuruganathan

Slide Material References Slides from ISLR book Slides by Piyush Rai Slides from OpenIntro Statistics book ( http://www.webpages.uidaho.edu/~stevel/251/ slides/os2_slides_07.pdf ) See also the footnotes CSE 5334 Saravanan Thirumuruganathan

Lecture 8: Regression Trees Instructor: Saravanan Thirumuruganathan - PowerPoint PPT Presentation

Lecture 8: Regression Trees Instructor: Saravanan Thirumuruganathan CSE 5334 Saravanan Thirumuruganathan Outline 1 Regression 2 Linear Regression 3 Regression Trees CSE 5334 Saravanan Thirumuruganathan Regression and Linear Regression CSE

Trees Trees CSE, IIT KGP Trees and Spanning Trees Trees and Spanning Trees A graph having

( ( ) ) ( ) ( ) = = Work = h log t n B- B -Trees Trees B B- -Trees

Trees Chapter 11 Chapter Summary Introduction to Trees Applications of Trees Tree

Trees Eric McCreath Overview In this lecture we will explore: general trees, binary trees,

Trees Applied Multivariate Statistics Spring 2012 Overview Intuition for Trees

Outline Univariate Trees 1 Decision Trees Classification Regression Pruning Steven J Zeil

Regression trees DAAG Chapter 11 Learning objectives In this section, we will learn about

Splay Trees and B-Trees CSE 373 Data Structures Lecture 9 Readings Reading Sections

2-3-4 Trees and Red- Black Trees 204 erm CS 16: Balanced Trees 2-3-4 Trees Revealed Nodes

/ + - * * 5 3 2 6 5 2 Examples Binary Trees BSTs Augmenting BinExpr General Trees

Decision Trees Lecture 23 To left or to right 1 Decision Trees 2 Decision Trees A different

Decision Trees Lecture 22 To left or to right 1 Decision Trees 2 Decision Trees A different

Computational Geometry Lecture 8: Range trees 1 Computational Geometry Lecture 8: Range trees

Computational Geometry Lecture 7: Range searching and kd-trees Computational Geometry Lecture 7:

Computational Geometry Lecture 7: Range searching and kd-trees Computational Geometry Lecture 7:

Regression 3: Logistic Regression Marco Baroni Practical Statistics in R Outline Logistic

MA/CSSE 473 Day 35 Greedy Algorithms MA/CSSE 473 Day 35 HW 13 due tomorrow HW 14

The Short Introduction to Imbalanced Classification Zeyu Qin 07.02.2020 Overview Reference

Class 15: Calculation of natural frequency Class 15: Calculation of natural frequency Old Slide

Kindergarten students to another location). Overflow does become an added cost to the district

Decision Tree Learning Based on Machine Learning, T. Mitchell, McGRAW Hill, 1997, ch. 3

1 Real-valued features Non-binary class variable Noise and overfjtting 1.1

Introduction to Artificial Intelligence Decision Trees, Random Forest Janyl Jumadinova October

Ensemble Learning 4/10/17 Ensemble Learning Hypothesis Space: Supervised learning (data has