Logistic Regression CS60010: Deep Learning Abir Das IIT Kharagpur - PowerPoint PPT Presentation

Logistic Regression CS60010: Deep Learning Abir Das IIT Kharagpur Jan 22, 23 and 24, 2020

Logistics Agenda Linear Regression Logistic Regression Some Logistics Related Information § This Friday (Jan 24), no paper will be presented. It will be a regular lecture. § The first surprise quiz is today!! Abir Das (IIT Kharagpur) CS60010 Jan 22, 23 and 24, 2020 2 / 35

Logistics Agenda Linear Regression Logistic Regression Surprise Quiz 1 § The duration of the test is 10 minutes. § Question 1: Find the eigenvalues of the following matrix A . Clearly mention if you are making any assumption. [2 Marks]   2 0 0   1 3 0 − 1 0 1 § Question 2: Consider the half-space given by the set of points S = { x ∈ R d | a T x ≤ b } . Prove that the halfspace is convex. [3 Marks] Abir Das (IIT Kharagpur) CS60010 Jan 22, 23 and 24, 2020 3 / 35

Logistics Agenda Linear Regression Logistic Regression Surprise Quiz 1: Answer Keys § Question 1: Find the eigenvalues of the following matrix A . Clearly mention if you are making any assumption.   2 0 0   1 3 0 − 1 0 1 Use the property of eigenvalues of a triangular matrix. § Question 2: Consider the half-space given by the set of points S = { x ∈ R d | a T x ≤ b } . Prove that the halfspace is convex. : If x , y belong to S , then a T x ≤ b and a T y ≤ b . Now, for 0 ≤ θ ≤ 1 , a T { θ x + (1 − θ ) y } = θ a T x + (1 − θ ) a T y ≤ θb + (1 − θ ) b = b Abir Das (IIT Kharagpur) CS60010 Jan 22, 23 and 24, 2020 4 / 35

Logistics Agenda Linear Regression Logistic Regression Agenda § Understand regression and classification with linear models. § Brush-up concepts of maximum likelihood and its use to understand linear regression. § Using logistic function for binary classification and estimating logistic regression parameters. Abir Das (IIT Kharagpur) CS60010 Jan 22, 23 and 24, 2020 5 / 35

Logistics Agenda Linear Regression Logistic Regression Resources § The Elements of Statistical Learning by T Hastie, R Tibshirani, J Friedman. [Link] [Chapter 3 and 4] § Artificial Intelligence: A Modern Approach by S Russell and P Norvig. [Link] [Chapter 18] Abir Das (IIT Kharagpur) CS60010 Jan 22, 23 and 24, 2020 6 / 35

Logistics Agenda Linear Regression Logistic Regression Linear Regression § In a regression problem we want to find the relation between some input variables x and output variables y , where x ∈ R d and y ∈ R . § Inputs are also often referred to as covariates, predictors and features; while outputs are known as variates, targets and labels. § Examples of such input-output pairs can be ◮ { Outside temperature, People inside classroom, target room temperature | Energy requirement } ◮ { Size, Number of Bedrooms, Number of Floors, Age of the Home | Price } Abir Das (IIT Kharagpur) CS60010 Jan 22, 23 and 24, 2020 7 / 35

Logistics Agenda Linear Regression Logistic Regression Linear Regression § In a regression problem we want to find the relation between some input variables x and output variables y , where x ∈ R d and y ∈ R . § Inputs are also often referred to as covariates, predictors and features; while outputs are known as variates, targets and labels. § Examples of such input-output pairs can be ◮ { Outside temperature, People inside classroom, target room temperature | Energy requirement } ◮ { Size, Number of Bedrooms, Number of Floors, Age of the Home | Price } § We have a set of N observations of y as { y 1 , y 2 , · · · , y N } and the corresponding input variables { x 1 , x 2 , · · · , x N } . 𝑧 (#) 𝒚 (#) Abir Das (IIT Kharagpur) CS60010 Jan 22, 23 and 24, 2020 7 / 35

Logistics Agenda Linear Regression Logistic Regression Linear Regression § The input and output variables are assumed to be related via a relation, known as hypothesis. � y = h θ ( x ) , where θ is the parameter vector. y ∗ = f ( x ∗ ) for an arbitrary § The goal is to predict the output variable � value of the input variable x ∗ . § Let us start with scalar inputs ( x ) and scalar outputs ( y ). Abir Das (IIT Kharagpur) CS60010 Jan 22, 23 and 24, 2020 8 / 35

Logistics Agenda Linear Regression Logistic Regression Univariate Linear Regression § hypothesis: h θ ( x ) = θ 0 + θ 1 x . § Cost Function: Sum of squared errors. 𝑧 (#) N � � h θ ( x ( i ) ) − y ( i ) � 2 1 J ( θ 0 , θ 1 ) = 2 N i =1 𝒚 (#) § Optimization objective: find model parameters ( θ 0 , θ 1 ) that will minimize the sum of squared errors. Abir Das (IIT Kharagpur) CS60010 Jan 22, 23 and 24, 2020 9 / 35

Logistics Agenda Linear Regression Logistic Regression Univariate Linear Regression § hypothesis: h θ ( x ) = θ 0 + θ 1 x . § Cost Function: Sum of squared errors. 𝑧 (#) N � � h θ ( x ( i ) ) − y ( i ) � 2 1 J ( θ 0 , θ 1 ) = 2 N i =1 𝒚 (#) § Optimization objective: find model parameters ( θ 0 , θ 1 ) that will minimize the sum of squared errors. § Gradient of the cost function w.r.t. θ 0 : � N � h θ ( x ( i ) ) − y ( i ) � J ( θ 0 , θ 1 ) = 1 θ 0 N i =1 § Gradient of the cost function w.r.t. θ 1 : � N � h θ ( x ( i ) ) − y ( i ) � J ( θ 0 , θ 1 ) = 1 x ( i ) θ 1 N i =1 § Apply your favorite gradient based optimization algorithm. Abir Das (IIT Kharagpur) CS60010 Jan 22, 23 and 24, 2020 9 / 35

Logistics Agenda Linear Regression Logistic Regression Univariate Linear Regression § These being linear equations of θ , have a unique closed form solution too. � N x ( i ) �� N y ( i ) � � N � � y ( i ) x ( i ) − N i =1 i =1 i =1 θ 1 = � x ( i ) � 2 − � N x ( i ) � 2 � N � N i =1 i =1 � N N � � x ( i ) � θ 0 = 1 y ( i ) − θ 1 N i =1 i =1 Abir Das (IIT Kharagpur) CS60010 Jan 22, 23 and 24, 2020 10 / 35

Logistics Agenda Linear Regression Logistic Regression Multivariate Linear Regression § We can easily extend to multivariate linear regression problems, where x ∈ R d § hypothesis: h θ ( x ) = θ 0 + θ 1 x 1 + θ 2 x 2 + · · · + θ d x d . For convenience of notation, define x 0 = 1 . § Thus h is simply the dot product of the parameters and the input vector. h θ ( x ) = θ T x § Cost Function: Sum of squared errors. N � � θ T x ( i ) − y ( i ) � 2 1 J ( θ ) = J ( θ 0 , θ 1 , · · · , θ d ) = (1) 2 N i =1 § We will use the following to write the cost function in a compact matrix vector notation h θ ( x ) = θ T x = x T θ Abir Das (IIT Kharagpur) CS60010 Jan 22, 23 and 24, 2020 11 / 35

Logistics Agenda Linear Regression Logistic Regression Multivariate Linear Regression         θ 0 x (1) x (1) x (1) x (1) y (1) h θ ( x ( 1 ) ) · · · � 0 1 2   d θ 1       x (2) x (2) x (2) x (2)   y (2) h θ ( x ( 2 ) ) �  · · ·        0 1 2 θ 2  d   =  = (2)       . . . . . ...   . . . . .    .  . .  . . .  .   . y ( N ) h θ ( x ( N ) ) x ( N ) x ( N ) x ( N ) x ( N ) � · · · θ d 0 1 2 d � y = X θ Here, X is a N × ( d + 1) matrix with each row an input vector. � y is a N length vector of the outputs in the training set. Abir Das (IIT Kharagpur) CS60010 Jan 22, 23 and 24, 2020 12 / 35

Logistics Agenda Linear Regression Logistic Regression Multivariate Linear Regression § Eqn. (1), gives, N N � � � θ T x ( i ) − y ( i ) � 2 = � y ( i ) − y ( i ) � 2 1 1 J ( θ ) = � (3) 2 N 2 N i =1 i =1 �� T �� 1 1 y − y || 2 = 2 N || � 2 = y − y y − y 2 N � � T � � � θ T � � � 1 1 X T X θ − θ T X T y − y T X θ + y T y = X θ − y X θ − y = 2 N 2 N � θ T � � � � T θ − � � T θ + y T y � 1 X T X X T y X T y = θ − 2 N � θ T � � � � T θ + y T y � 1 X T X X T y = θ − 2 2 N Abir Das (IIT Kharagpur) CS60010 Jan 22, 23 and 24, 2020 13 / 35

Logistics Agenda Linear Regression Logistic Regression Multivariate Linear Regression § Equating the gradient of the cost function to 0, � � 1 2 X T X θ − 2 X T y + 0 ∇ θ J ( θ ) = = 0 2 N X T X θ − X T y = 0 � � − 1 X T y X T X θ = (4) Abir Das (IIT Kharagpur) CS60010 Jan 22, 23 and 24, 2020 14 / 35

Logistics Agenda Linear Regression Logistic Regression Multivariate Linear Regression § Equating the gradient of the cost function to 0, � � 1 2 X T X θ − 2 X T y + 0 ∇ θ J ( θ ) = = 0 2 N X T X θ − X T y = 0 � � − 1 X T y X T X θ = (4) § This gives a closed form solution, but another option is to use iterative solution (just like the univariate case). � N � h θ ( x ( i ) ) − y ( i ) � ∂J ( θ ) = 1 x ( i ) j ∂θ j N i =1 Abir Das (IIT Kharagpur) CS60010 Jan 22, 23 and 24, 2020 14 / 35

Logistics Agenda Linear Regression Logistic Regression Multivariate Linear Regression § Iterative Gradient Descent needs to perform many iterations and need to choose a stepsize parameter judiciously. But it works equally well even if the number of features ( d ) is large. § For the least square solution, there is no need to choose the step size � � − 1 can be X T X parameter or no need to iterate. But, evaluating slow if d is large. Abir Das (IIT Kharagpur) CS60010 Jan 22, 23 and 24, 2020 15 / 35

Logistic Regression CS60010: Deep Learning Abir Das IIT Kharagpur - PowerPoint PPT Presentation

Logistic Regression CS60010: Deep Learning Abir Das IIT Kharagpur Jan 22, 23 and 24, 2020 Logistics Agenda Linear Regression Logistic Regression Some Logistics Related Information This Friday (Jan 24), no paper will be presented. It will

Regression 3: Logistic Regression Marco Baroni Practical Statistics in R Outline Logistic

Logistic Regression James H. Steiger Department of Psychology and Human Development Vanderbilt

Todays lecture Logistic regression How can we use logistic regression for reranking? Shay

From Logistic Regression to Neural Networks CMSC 470 Marine Carpuat Logistic Regression What

LEARNING Outline Math Behind Logistic Regression Visualizing Logistic Regression Loss

Workshop 10.5a: Logistic regression Murray Logan August 23, 2016 Table of contents 1 Logistic

Logistic Regression using OLS1D in Excel 2013 XL4D: V0H XL4D: V0H XL4D: V0H 2015 Schield

Workshop 10.5a: Logistic regression Murray Logan 05 Sep 2016 Section 1 Logistic regression

Lecture 3: Logistic Regression Feng Li Shandong University fli@sdu.edu.cn September 21, 2020

Regression Methods 1. Linear Regression and Logistic Regression: definitions, and a common

XL4B: Logistic Regression using OLS1B in Excel 2013 25 Feb 2018 V0C-2x XL4B: V0C-2x XL4B: V0C-2x

Logistic regression Shay Cohen (based on slides by Sharon Goldwater) 28 October 2019 Todays

Machine Learning Logistic Regression Hamid R. Rabiee Spring 2015

Learning From Data Lecture 9 Logistic Regression and Gradient Descent Logistic Regression

Logistic regression Predict binary outcomes (success/failure) from numerical or categorical

Logistic Regression: MLE vs. OLS3 in Excel2013 25 Aug 2016 V0H V0H V0H Schield MLE vs.

XL4A: Logistic Model using OLS1A in Excel 2013 1 Mar 2017 V0E 2x XL4A: V0E2x XL4A: V0E2x 2015

Logistics Karl Stratos Rutgers University Karl Stratos CS 533: Natural Language Processing 1/9

Logistics PS 2 due today Midterm in one week Covers all material through value iteration

Introduction & Logistics Mike George (in for Elisavet Kozyri) Summer 2013 Cornell University

Bringing Agility into Linked Data Development An Industrial Use Case in Logistics Domain Pinar

Homework Logistics HW1 is due this Friday 09/02. Login to Gradescope TODAY to see if you

Learning Logistic Circuits Yitao Liang, Guy Van den Broeck January 31, 2019 Which model to

Logistic Regression and POS Tagging CSE392 - Spring 2019 Special Topic in CS Task

Sambuz

Useful Links

Newsletter

Mail Us

Logistic Regression CS60010: Deep Learning Abir Das IIT Kharagpur - PowerPoint PPT Presentation

Logistic Regression CS60010: Deep Learning Abir Das IIT Kharagpur Jan 22, 23 and 24, 2020 Logistics Agenda Linear Regression Logistic Regression Some Logistics Related Information This Friday (Jan 24), no paper will be presented. It will

Regression 3: Logistic Regression Marco Baroni Practical Statistics in R Outline Logistic

Logistic Regression James H. Steiger Department of Psychology and Human Development Vanderbilt

Todays lecture Logistic regression How can we use logistic regression for reranking? Shay

From Logistic Regression to Neural Networks CMSC 470 Marine Carpuat Logistic Regression What

LEARNING Outline Math Behind Logistic Regression Visualizing Logistic Regression Loss

Workshop 10.5a: Logistic regression Murray Logan August 23, 2016 Table of contents 1 Logistic

Logistic Regression using OLS1D in Excel 2013 XL4D: V0H XL4D: V0H XL4D: V0H 2015 Schield

Workshop 10.5a: Logistic regression Murray Logan 05 Sep 2016 Section 1 Logistic regression

Lecture 3: Logistic Regression Feng Li Shandong University fli@sdu.edu.cn September 21, 2020

Regression Methods 1. Linear Regression and Logistic Regression: definitions, and a common

XL4B: Logistic Regression using OLS1B in Excel 2013 25 Feb 2018 V0C-2x XL4B: V0C-2x XL4B: V0C-2x

Logistic regression Shay Cohen (based on slides by Sharon Goldwater) 28 October 2019 Todays

Machine Learning Logistic Regression Hamid R. Rabiee Spring 2015

Learning From Data Lecture 9 Logistic Regression and Gradient Descent Logistic Regression

Logistic regression Predict binary outcomes (success/failure) from numerical or categorical

Logistic Regression: MLE vs. OLS3 in Excel2013 25 Aug 2016 V0H V0H V0H Schield MLE vs.

XL4A: Logistic Model using OLS1A in Excel 2013 1 Mar 2017 V0E 2x XL4A: V0E2x XL4A: V0E2x 2015

Logistics Karl Stratos Rutgers University Karl Stratos CS 533: Natural Language Processing 1/9

Logistics PS 2 due today Midterm in one week Covers all material through value iteration

Introduction &amp; Logistics Mike George (in for Elisavet Kozyri) Summer 2013 Cornell University

Bringing Agility into Linked Data Development An Industrial Use Case in Logistics Domain Pinar

Homework Logistics HW1 is due this Friday 09/02. Login to Gradescope *TODAY* to see if you

Learning Logistic Circuits Yitao Liang, Guy Van den Broeck January 31, 2019 Which model to

Logistic Regression and POS Tagging CSE392 - Spring 2019 Special Topic in CS Task

Sambuz

Useful Links

Newsletter

Mail Us

Introduction & Logistics Mike George (in for Elisavet Kozyri) Summer 2013 Cornell University

Homework Logistics HW1 is due this Friday 09/02. Login to Gradescope TODAY to see if you