Decision Trees TJ Machine Learning Club Classification vs. - PowerPoint PPT Presentation

Dec 28, 2023 •233 likes •505 views

Decision Trees TJ Machine Learning Club Classification vs. Regression Classification Classifying photos of fruits Determining whether tumor is benign or malignant Regression Predicting COVID-19 cases given demographic data

Decision Trees TJ Machine Learning Club
Classification vs. Regression Classification ○ Classifying photos of fruits ■ Determining whether tumor is benign or malignant ■ Regression ○ Predicting COVID-19 cases given demographic data ■ Predicting house prices given house features ■ Source: https://medium.com/datasoc/whats-the-problem-1ff8b338094b
Features Labels Features vs. Labels Features (like x): Characteristics of the input - In the picture, features are whether or not patient smokes (smoke), consumes alcohol (alco), and performs physical activity (active) Label (like y): The prediction or classification of the input - Whether or not patient has cardiovascular disease (cardio)
Training and Testing Datasets Training data has both features and labels Testing data only has the features Need to predict cardio
What is a Decision Tree? A decision tree is just a series of questions ● The key in creating a decision tree is asking the right questions ●
Gini Impurity - Measure of how “messy” some collection of data is i = some data k = class index c = total number of classes p(k|i) = probability of randomly selecting item of class k from data
Ex. Gini Impurity Let’s calculate the Gini Impurity for these groups of data, where the two possible classes are blue or red:
Ex. Gini Impurity 0.444 0.5 0
Ex. Gini Impurity 0.444 0.5 0 Minimum possible impurity Maximum possible impurity
Information Gain - D p , D left , D right are the parent node, left node dataset, and right node dataset respectively - I is a measure of impurity (like Gini Impurity) - N p , N left , and N right are the number of items in the parent, left, and right nodes respectively - f is the question you are asking to create the split
Let’s figure out which question is a better question T = Tennis Player to ask to split the athletes according to sport B = Basketball Player T B T B B T T B T B B T Age > 27? Height > 6’4’’ Y Y N N T B B T T B T T B B B T
T B T B B T Age > 27? Y N T B B B T T
1/2 T B T B B T Age > 27? Y N 4/9 4/9 T B B B T T
1/2 T B T B B T Age > 27? Y N 4/9 4/9 T B B B T T
T B T B B T Height > 6’4’’ Y N T T B B B T
1/2 T B T B B T Height > 6’4’’ Y N 0 3/8 T T B B B T
1/2 T B T B B T Height > 6’4’’ Y N 0 3/8 T T B B B T
Information Gain: 0.055556 Information Gain: 0.25 Since Information Gain is higher, this the better question to ask to classify our athletes T B T B B T T B T B B T Age > 27? Height > 6’4’’ Y Y N N T B B T T B T T B B B T
How to Come Up with Values for the Questions? - The most straightforward way: Try out different values from the items in your training dataset
Overfitting Techniques to prevent overfitting in decision trees: ● Continue recursively generating nodes only if information gain is larger than some threshold (e.g. ● 0.1) After creating the tree, prune all nodes that are at a depth greater than some threshold ●

Recommend

Decision Trees Lecture 23 To left or to right 1 Decision Trees 2 Decision Trees A different

Decision Trees Lecture 23 To left or to right 1 Decision Trees 2 Decision Trees A different complexity measure 2 Decision Trees A different complexity measure Number of bits of input read 2 Decision Trees A different complexity measure

1.3k views • 107 slides

Decision Trees Lecture 22 To left or to right 1 Decision Trees 2 Decision Trees A different

Decision Trees Lecture 22 To left or to right 1 Decision Trees 2 Decision Trees A different complexity measure 2 Decision Trees A different complexity measure Number of bits of input read 2 Decision Trees A different complexity measure

1.62k views • 107 slides

Learning Decision Trees Representation is a decision tree. Bias is towards simple decision

Learning Decision Trees Representation is a decision tree. Bias is towards simple decision trees. Search through the space of decision trees, from simple decision trees to more complex ones. Decision trees A decision tree is

484 views • 10 slides

Trees Trees CSE, IIT KGP Trees and Spanning Trees Trees and Spanning Trees A graph having

Trees Trees CSE, IIT KGP Trees and Spanning Trees Trees and Spanning Trees A graph having no cycles is A graph having no cycles is acyclic acyclic. . A A forest forest is an is an acyclic acyclic graph. graph. A A

226 views • 9 slides

( ( ) ) ( ) ( ) = = Work = h log t n B- B -Trees Trees B B- -Trees

B- B -Trees Trees B B- -Trees Trees Search for key R ( ( ) ) ( ) ( ) = = Work = h log t n B- B -Trees Trees B B- -Trees Trees Each Disk-Read or Disk-Write = one Basic unit of work O(1) Typical Node x

287 views • 4 slides

Trees Chapter 11 Chapter Summary Introduction to Trees Applications of Trees Tree

Trees Chapter 11 Chapter Summary Introduction to Trees Applications of Trees Tree Traversal Spanning Trees Minimum Spanning Trees Introduction to Trees Section 11.1 Section Summary Introduction to Trees Rooted Trees

639 views • 42 slides

Decision Tree R Greiner Cmput 466 / 551 Learning Decision Trees Def'n: Decision Trees

HTF: 9.2 B: 14.4 R N , C h a p t e r 1 8 1 8 . 3 Decision Tree R Greiner Cmput 466 / 551 Learning Decision Trees Def'n: Decision Trees Algorithm for Learning Decision Trees Entropy, Inductive Bias (Occam's

1.43k views • 87 slides

Trees Eric McCreath Overview In this lecture we will explore: general trees, binary trees,

Trees Eric McCreath Overview In this lecture we will explore: general trees, binary trees, binary search trees, and AVL and B-Trees. 2 Trees Trees are recursive data structures. They are useful for: representing items that have a tree

622 views • 27 slides

Lecture 23: Decision Trees Decision trees Prof. Julia Hockenmaier

CS440/ECE448: Intro to Artificial Intelligence Lecture 23: Decision Trees Decision trees Prof. Julia Hockenmaier juliahmr@illinois.edu http://cs.illinois.edu/fa11/cs440 Decision trees Decision tree

338 views • 7 slides

Outline Univariate Trees 1 Decision Trees Classification Regression Pruning Steven J Zeil

Univariate Trees Rule Extraction Multivariate Trees Univariate Trees Rule Extraction Multivariate Trees Outline Univariate Trees 1 Decision Trees Classification Regression Pruning Steven J Zeil Old Dominion Univ. Rule Extraction 2

285 views • 4 slides

2-3-4 Trees and Red- Black Trees 204 erm CS 16: Balanced Trees 2-3-4 Trees Revealed Nodes

CS 16: Balanced Trees 2-3-4 Trees and Red- Black Trees 204 erm CS 16: Balanced Trees 2-3-4 Trees Revealed Nodes store 1, 2, or 3 keys and have 2, 3, or 4 children, respectively All leaves have the same depth k n r b e h n r b e

801 views • 31 slides

/ + - * * 5 3 2 6 5 2 Examples Binary Trees BSTs Augmenting BinExpr General Trees

Trees Readings: HtDP , sections 14, 15, 16. Topics: Introductory examples and terminology Binary trees Binary search trees Augmenting trees Binary expression trees General arithmetic expression trees Nested lists Examples Binary Trees

1.19k views • 31 slides

Learning Decision Trees Machine Learning 1 Some slides from Tom Mitchell, Dan Roth and others

Learning Decision Trees Machine Learning 1 Some slides from Tom Mitchell, Dan Roth and others This lecture: Learning Decision Trees 1. Representation : What are decision trees? 2. Algorithm : Learning decision trees The ID3 algorithm: A greedy

1.2k views • 67 slides

Optimal Sparse Decision Trees Xiyang Hu Cynthia Rudin Margo Seltzer Carnegie Mellon Duke

Optimal Sparse Decision Trees Xiyang Hu Cynthia Rudin Margo Seltzer Carnegie Mellon Duke University University of British University Columbia Decision Trees Decision Trees Decision Trees Should I click on the link in this email? Do I

589 views • 35 slides

Decision Trees: Discussion Machine Learning 1 Some slides from Tom Mitchell, Dan Roth and others

Decision Trees: Discussion Machine Learning 1 Some slides from Tom Mitchell, Dan Roth and others This lecture: Learning Decision Trees 1. Representation : What are decision trees? 2. Algorithm : Learning decision trees The ID3 algorithm: A

697 views • 41 slides

Decision trees Decision Trees / Discrete Variables Location Season Location Fun? Ski Slope

Decision trees Decision Trees / Discrete Variables Location Season Location Fun? Ski Slope Prison summer prison -1 Beach summer beach +1 -1 +1 Winter ski-slope +1 Season Winter beach -1 Winter Summer -1 +1 Decision Trees

320 views • 17 slides

Photoionisation processes are studied via ab initio calculations. One active electron approach has

I ONIZATION OF RUBIDIUM WITH ULTRASHORT INTENSE LASER PULSES Mihly Andrs Pocsai 1 , 2 , Imre Ferenc Barna 1 1 Wigner Research Centre of the H.A.S. 2 University of Pcs, Faculty of Sciences, Department of Physics Budapest, 5 th of May, 2017 2

458 views • 33 slides

FY2011 Financial Results 17 January 2012 Building Strengths. Defining Distinction. Important

FY2011 Financial Results 17 January 2012 Building Strengths. Defining Distinction. Important Notice The past performance of K-REIT Asia is not necessarily indicative of its future performance. Certain statements made in this presentation may not

585 views • 29 slides

SECOND QUARTER 2010 FINANCIAL RESULTS SECOND QUARTER 2010 FINANCIAL RESULTS 19 July 2010 1

SECOND QUARTER 2010 FINANCIAL RESULTS SECOND QUARTER 2010 FINANCIAL RESULTS 19 July 2010 1 Important Notice Important Notice The value of units in K REIT Asia (Units) and the income from them may fall as well as rise. Units are not

825 views • 34 slides

Automotive Requirements for Future Mobile Networks Andreas Kwoczek Head of Positioning and

Automotive Requirements for Future Mobile Networks Andreas Kwoczek Head of Positioning and Communication, Volkswagen Group Research January, 21 st 2016 Automotive Requirements for Future Mobile Networks Motivation Customer Expectation Expected

331 views • 9 slides

Management Framework Financial Risk Management Division 31 January 2019 Presentation Overview

GSF Act Pt.6 & the Whole-of-State Risk Management Framework Financial Risk Management Division 31 January 2019 Presentation Overview 1. GSF and Financial Risk Management in context 2. GSF Act Pt. 6 Key Changes and Implementation Update

162 views • 13 slides

Preliminary Results on The Identity Problem in Description Logic Ontologies Adrian Nuradiansyah

Preliminary Results on The Identity Problem in Description Logic Ontologies Adrian Nuradiansyah Franz Baader, Daniel Borchmann Technische Universitt Dresden July 21, 2017 Adrian Nuradiansyah Description Logic Workshop 2017 July 21, 2017 1

1.02k views • 28 slides

Conditionals and Cognitive Science Summer School on Mathematical Philosophy for Female Students

Conditionals and Cognitive Science Summer School on Mathematical Philosophy for Female Students 2015 31 July 2015 Karolina Krzy anowska (MCMP , LMU Munich) Psychology Linguistics Cognitive Science Philosophy Neuroscience Artificial

1.06k views • 63 slides

BNM Annual Report 2010 & Financial Stability and Payment Systems Report 2010 Briefing to

BNM Annual Report 2010 & Financial Stability and Payment Systems Report 2010 Briefing to Analysts & Fund Managers by Dato' Muhammad bin Ibrahim Deputy Governor 23 March 2011 1 Briefing to cover: Economic and financial

458 views • 42 slides