Info 1301 Linear Regression (cont.) 14 November 2016 Prof. - PowerPoint PPT Presentation

Info ¡1301 Linear ¡Regression ¡(cont.) 14 ¡November ¡2016 Prof. ¡Michael ¡Paul Prof. ¡William ¡Aspray

Least ¡Squares ¡Regression • We ¡just ¡eyeballed ¡the ¡linear ¡regression ¡in ¡the ¡past. • Now ¡we ¡want ¡to ¡give ¡a ¡rigorous ¡approach ¡for ¡finding ¡the ¡line. • Want ¡the ¡line ¡to ¡fit ¡the ¡data ¡as ¡well ¡as ¡possible. • This ¡means ¡reducing ¡the ¡residuals ¡as ¡much ¡as ¡possible • Could ¡minimize ¡the ¡sum ¡of ¡the ¡absolute ¡values ¡of ¡the ¡residuals. • More ¡commonly ¡minimize ¡the ¡sum ¡of ¡the ¡squares ¡of ¡the ¡residuals. • Common, ¡emphasizes ¡the ¡problems ¡with ¡individual ¡large ¡residuals, ¡easier ¡to ¡ calculate

Root ¡mean ¡square ¡error • RMSE ¡= ¡ [Σe 2 /n] .5 • Do ¡an ¡example: ¡residuals ¡are ¡1,2,-‑1,2,-‑2 • RMSE ¡= ¡[[(1) 2 +(2) 2 +(-‑1) 2 +(2) 2 +(-‑2) 2 ]/5] .5 = ¡[[1+4+1+4+4]/5] .5 = ¡[14/5] .5 = ¡1.67 ¡approx • Generally, ¡68% ¡of ¡residuals ¡are ¡within ¡1 ¡RSME ¡of ¡regression ¡line. • Generally ¡, ¡95% ¡of ¡residuals ¡are ¡within ¡2 ¡RSME ¡of ¡regression ¡line. • In ¡this ¡example, ¡2/5 ¡within ¡1 ¡RSME, ¡5/5 ¡within ¡2 ¡RMSE. • Look ¡familiar? ¡ • In ¡nice ¡cases, ¡residuals ¡form ¡a ¡normal ¡distribution ¡around ¡the ¡least ¡square ¡line ¡ • Can ¡use ¡Z-‑table.

Nice ¡Conditions ¡for ¡Least ¡Squared ¡Line • Eyeball ¡that ¡a ¡line ¡fits ¡the ¡data ¡(as ¡we ¡have ¡done ¡before) • Random ¡residuals ¡not ¡too ¡far ¡from ¡line ¡(outliers ¡can ¡be ¡a ¡problem) • The ¡size ¡of ¡the ¡residuals ¡roughly ¡constant ¡(not ¡those ¡that ¡get ¡larger ¡at ¡ one ¡end) • No ¡repeating ¡patterns ¡in ¡the ¡data ¡(e.g. ¡time ¡series)[Look ¡at ¡examples ¡ on ¡p. ¡342 ¡(Fig. ¡7.13)

Formulas ¡for ¡the ¡line ¡y=mx+b for ¡linear ¡ regression • b = ¡[( Σy)(Σx 2 ) ¡– (Σx) (Σxy)]/[n(Σx 2 ) ¡–(Σx) 2 ] • m ¡= ¡[n(Σxy) ¡– (Σx)(Σy)]/[n(Σx 2 ) ¡– (Σx) 2 ] • n = ¡number ¡of ¡points

Diabetes ¡Example ¡– Relationship ¡of ¡Age ¡to ¡ Glucose ¡Level • Frederic ¡Grant ¡Banting ¡(1891 ¡– 1941) ¡born ¡on ¡this ¡day ¡– Nobel ¡Prize ¡for ¡treating ¡diabetes ¡ with ¡insulin • Diabetes ¡is ¡one ¡of ¡the ¡most ¡common ¡diseases ¡– 8% ¡of ¡world’s ¡population • Serious ¡– doubles ¡your ¡risk ¡of ¡early ¡death; ¡damages ¡eyes, ¡kidneys, ¡blood ¡vessels, ¡etc. • High ¡blood ¡sugar ¡either ¡because ¡pancreas ¡does ¡not ¡create ¡enough ¡insulin ¡to ¡control ¡ glucose ¡(type ¡1) ¡or ¡insulin ¡resistance ¡where ¡cells ¡do ¡not ¡respond ¡properly ¡to ¡insulin ¡(type ¡ 2) • Risk ¡of ¡developing ¡diabetes ¡increases ¡with ¡age • Observe ¡the ¡data ¡set ¡of ¡6 ¡points: (43,99), ¡(21,65), ¡(25,79), ¡(42,75), ¡(57,87), ¡(59,81) Plot ¡the ¡points. Sketch ¡in ¡the ¡line ¡by ¡sight. Eyeball ¡the ¡correlation.

Diabetes ¡Example ¡(cont.) subject Age (x) Glucose ¡(y) xy xx yy 1 43 99 4257 1849 9801 2 21 65 1365 441 4225 3 25 79 1975 625 6241 4 42 75 3150 1764 5625 5 57 87 4959 3249 7569 6 59 81 4779 3481 6561 Σ 247 486 20485 11409 40022

Plugging ¡in ¡to ¡the ¡formulas • b ¡= ¡[(486)(11409) ¡– (247)(20485)]/[(6)(11409) ¡– (247) 2 ] = ¡484947/7445 = ¡65.14 m ¡= ¡[(6)(20485) ¡– (247)(486)]/[(6)(11409) ¡– (247) 2 ] = ¡2868/7445 = ¡.385225 Y ¡=.385225x ¡+ ¡65.14 What ¡is ¡the ¡meaning ¡of ¡this ¡equation?

Homework ¡Problem • Use ¡Minitab ¡Express ¡to ¡calculate ¡the ¡line ¡and ¡the ¡root ¡mean ¡square ¡ error ¡for ¡the ¡diabetes ¡example ¡that ¡we ¡just ¡did

The ¡word ¡‘regression’ • Root ¡is ¡‘regress’ • 1550s ¡– return ¡to ¡a ¡former ¡state ¡– from ¡Latin ¡ regressus • 1823 ¡– to ¡move ¡backward • 1926 ¡– to ¡return ¡to ¡an ¡earlier ¡(and ¡usually ¡worse ¡or ¡less ¡developed) ¡ state ¡of ¡life • Statistics ¡– move ¡away ¡from ¡the ¡random ¡variation ¡in ¡a ¡sample ¡to ¡its ¡ primitive ¡state • Let’s ¡consider ¡a ¡different ¡place ¡where ¡’regression’ ¡appears ¡in ¡ statistical ¡discussion • Use ¡a ¡baseball ¡example

Info 1301 Linear Regression (cont.) 14 November 2016 Prof. - PowerPoint PPT Presentation

Info 1301 Linear Regression (cont.) 14 November 2016 Prof. Michael Paul Prof. William Aspray Least Squares Regression We just eyeballed the linear regression in the

Bayesian linear regression (cont.) Dr. Jarad Niemi STAT 544 - Iowa State University April 20,

Regression 1: Linear Regression Marco Baroni Practical Statistics in R Outline Classic linear

Why Transformers Work. More info blablabla More info blablabla More info blablabla More

Linear Regression Part 2: Residuals and Errors INFO-1301, Quantitative Reasoning 1 University of

Linear regression Linear regression is a simple approach to supervised learning. It assumes

Linear regression Linear regression is a simple approach to supervised learning. It assumes

Regression Methods 1. Linear Regression and Logistic Regression: definitions, and a common

Linear Regression Part 1: Fitting Lines INFO-1301, Quantitative Reasoning 1 University of

Linear regression How to measure the accuracy of linear regression models Linear Regression

Linear Models for Regression Greg Mori - CMPT 419/726 Bishop PRML Ch. 3 Regression Linear Basis

STAT 213 Simple Linear Regression I Colin Reimer Dawson Oberlin College 5 October 2016 Outline

Why LINEX Our Explanation (cont-d) Our Explanation (cont-d) (Linear Exponential) Our

Linear regression Linear regression is a simple approach to supervised learning. It assumes

Logistic regression CS 446 1. Linear classifiers Linear regression Last two lectures, we studied

LINEAR REGRESSION LINEAR REGRESSION - FROM A MACHINE LEARNING POINT OF VIEW 25 SIMPLE LINEAR

Notes on the Non-linear Regression The model Non-linear regression models, like ordinary linear

Discrepancies in Self-Report Diabetes Survey Questions y Q using NHANES, NHIS, and CHIS data

Removing Hidden Confounding by Experimental Grounding Nathan Kallus Aahlad Manas Puli Uri

Part D Senior Savings Model Overview March 12, 2020 CMS Innovation Center 2 CMS Innovation

Attention Deficit/Hyperactivity Disorder: Medication Therapy Management Kristen Hillebrand,

Disclosure Financial relationships: 1 Employee of New Mexico State University Coordinator of the

complex lesions Johannes Waltenberger, MD, F.E.S.C. Uniklinik Mnster Mnster, Germany

Appendix A: Analysis slides of each case study a. Type 2 Diabetes b. Stroke c. Hip Osteoarthritis

CHAMPION PHOENIX Deepak L. Bhatt, MD, MPH, Gregg W. Stone, MD, Kenneth W. Mahaffey, MD, C.

Info 1301 Linear Regression (cont.) 14 November 2016 Prof. - PowerPoint PPT Presentation

Info 1301 Linear Regression (cont.) 14 November 2016 Prof. Michael Paul Prof. William Aspray Least Squares Regression We just eyeballed the linear regression in the

Bayesian linear regression (cont.) Dr. Jarad Niemi STAT 544 - Iowa State University April 20,

Regression 1: Linear Regression Marco Baroni Practical Statistics in R Outline Classic linear

Why Transformers Work. *More info blablabla *More info blablabla *More info blablabla *More

Linear Regression Part 2: Residuals and Errors INFO-1301, Quantitative Reasoning 1 University of

Linear regression Linear regression is a simple approach to supervised learning. It assumes

Linear regression Linear regression is a simple approach to supervised learning. It assumes

Regression Methods 1. Linear Regression and Logistic Regression: definitions, and a common

Linear Regression Part 1: Fitting Lines INFO-1301, Quantitative Reasoning 1 University of

Linear regression How to measure the accuracy of linear regression models Linear Regression

Linear Models for Regression Greg Mori - CMPT 419/726 Bishop PRML Ch. 3 Regression Linear Basis

STAT 213 Simple Linear Regression I Colin Reimer Dawson Oberlin College 5 October 2016 Outline

Why LINEX Our Explanation (cont-d) Our Explanation (cont-d) (Linear Exponential) Our

Linear regression Linear regression is a simple approach to supervised learning. It assumes

Logistic regression CS 446 1. Linear classifiers Linear regression Last two lectures, we studied

LINEAR REGRESSION LINEAR REGRESSION - FROM A MACHINE LEARNING POINT OF VIEW 25 SIMPLE LINEAR

Notes on the Non-linear Regression The model Non-linear regression models, like ordinary linear

Discrepancies in Self-Report Diabetes Survey Questions y Q using NHANES, NHIS, and CHIS data

Removing Hidden Confounding by Experimental Grounding Nathan Kallus Aahlad Manas Puli Uri

Part D Senior Savings Model Overview March 12, 2020 CMS Innovation Center 2 CMS Innovation

Attention Deficit/Hyperactivity Disorder: Medication Therapy Management Kristen Hillebrand,

Disclosure Financial relationships: 1 Employee of New Mexico State University Coordinator of the

complex lesions Johannes Waltenberger, MD, F.E.S.C. Uniklinik Mnster Mnster, Germany

Appendix A: Analysis slides of each case study a. Type 2 Diabetes b. Stroke c. Hip Osteoarthritis

CHAMPION PHOENIX Deepak L. Bhatt, MD, MPH, Gregg W. Stone, MD, Kenneth W. Mahaffey, MD, C.

Why Transformers Work. More info blablabla More info blablabla More info blablabla More