Classification: K-Nearest Neighbors 3/27/17 Recall: Machine - PowerPoint PPT Presentation

Classification: K-Nearest Neighbors 3/27/17

Recall: Machine Learning Taxonomy Supervised Learning • For each input, we know the right output. • Regression • Outputs are continuous. • Classification • Outputs come from a (relatively small) discrete set. Unsupervised Learning • We just have a bunch of inputs. Semi-Supervised Learning • We have inputs, and occasional feedback.

Classification Examples Labeling the city an apartment is in. Labeling hand-written digits.

Hypothesis Space for Classification • The hypothesis space is the types of functions we can learn. • This is partly defined by the problem, and partly by the learning algorithm. • In classification we have: • Continuous inputs • Discrete output labels • The algorithm will constrain the possible functions from input to output. • Perceptrons learn linear decision boundaries.

K-nearest neighbors algorithm Training: • Store all of the test points and their labels. • Can use a data structure like a kd-tree that speeds up localized lookup. Prediction: • Find the k training inputs closest to the test input. • Output the most common label among them.

KNN implementation decisions (and possible answers) • How should we measure distance? • (Euclidean distance between input vectors.) • What if there’s a tie for the nearest points? • (Include all points that are tied.) • What if there’s a tie for the most-common label? • (Remove the most-distant point until a plurality is achieved.) • What if there’s a tie for both? • (We need some arbitrary tie-breaking rule.)

Weighted nearest neighbors • Idea: closer points should matter more. • Solution: weight the vote by • Instead of contributing one vote for its label, each neighbor contributes votes for its label.

Why do we even need k neighbors? Idea: if we’re weighting by distance, we can give all training points a vote. • Points that are far away will just have really small weight. Why might this be a bad idea? • Slow: we have to sum over every point in the training set. • If we’re using a kd-tree, we can get the neighbors quickly and sum over a small set.

The same ideas can apply to regression. • K-nearest neighbors setting: • Supervised learning (we know the correct output for each test point). • Classification (small number of discrete labels). vs. • Locally-weighted regression setting: • Supervised learning (we know the correct output for each test point). • Regression (outputs are continuous).

Locally-Weighted Average • Instead of taking a majority vote, average the y-values. • We could average over the k nearest neighbors. • We could weight the average by distance. • Better yet, do both.

Locally-weighted (linear) regression Least squares linear regression solves the following problem: • Select weights weights w 0 , …, w D for each dimension to minimize squared error: Instead, we can minimize the distance-weighted squared error:

Decision Trees • Solve classification problems by repeatedly splitting the space of possible inputs; store splits in a tree. • To classify a new input, compare it to successive splits until a leaf (with a label) is reached. Who plays tennis when it’s raining but not when it’s humid?

Building a Decision Tree Greedy algorithm: 1. Within a region, pick the best: • feature to split on elevation • value at which to split it 2. Sort the training data into the sub-regions. 3. Recursively build decision $ / sq. ft. trees for the sub-regions. Does this give us an optimal decision tree?

Compare the Hypothesis Spaces • K-nearest neighbors Considerations: • Inputs • Outputs • Possible mappings • Decision trees • Locally-weighted regression

Classification: K-Nearest Neighbors 3/27/17 Recall: Machine - PowerPoint PPT Presentation

Classification: K-Nearest Neighbors 3/27/17 Recall: Machine Learning Taxonomy Supervised Learning For each input, we know the right output. Regression Outputs are continuous. Classification Outputs come from a (relatively

Approximate Nearest Neighbors Search Approximate Nearest Neighbors Search in High Dimensions in

K-Nearest Neighbors Nicolas Indelicato K-Nearest Neighbors Dataset Background How the

k-Nearest Neighbors Lecture 2 k-Nearest Neighbors September 16, 2015 1 Wentworth Institute of

Approximate Nearest Neighbors Sariel Har Peled: Notes Arya, Mount, Netenyahu, Silverman, Wu An

Simple and Fast Nearest Neighbor Search Marcel Birn, Manuel Holtgrewe, Peter Sanders , Johannes

FAST APPROXIMATE NEAREST NEIGHBORS WITH AUTOMATIC ALGORITHM CONFIGURATION Marius Muja, David G.

c i,j max k,m c k,m 4 Wednesday, 2 Oct. 2019 Machine Learning (COMP 135) 3 Wednesday, 2

CSC 411: Lecture 05: Nearest Neighbors Class based on Raquel Urtasun & Rich Zemels lectures

c i,j max k,m c k,m 4 Wednesday, 26 Feb. 2020 Machine Learning (COMP 135) 3 Wednesday, 26

Inference and Estimation Using Nearest Neighbors 2019 The Second Korea-Japan Machine Learning

New directions in approximate nearest neighbors for the angular distance Thijs Laarhoven

Nearest Neighbor and Locality-Sensitive Hashing Nearest Neighbor Set Similarity

Classification K-nearest neighbor classification D istance functions Choice of k Choice of k

Nearest Neighbor Classification Seed classification by area and What should we compactness

Nearest Neighbor Classification Machine Learning 1 This lecture K-nearest neighbor

awareness Contention between neighbors in carrier- sensing range (c- B C A neighbors)

1 Weights Need Not be Reals Goal: Parameterized FSMs a/ q / p b/ r Parameterized FSM:

CSE 105 THEORY OF COMPUTATION Fall 2016 http://cseweb.ucsd.edu/classes/fa16/cse105-abc/

Baumgartner, POLI 203 Spring 2016 Public opinion over time Reading: Chapter 6 of Decline of DP

Weight Selection for a Model Weight Selection for a Model Average Estimator Average Estimator Alan

Lecture 2: Convolution Mark Hasegawa-Johnson ECE 401: Signal and Image Analysis, Fall 2020

Causal inference Part I.b: randomized experiments, matching and regression (this lecture starts

Topic 4 Kirchoffs Laws & Nodal Analysis Professor Peter YK Cheung Dyson School of Design

Exploring AIRS and other Atmospheric Data with Giovanni Gregory Leptoukh, S. Ahmad, S. Berrick,

Classification: K-Nearest Neighbors 3/27/17 Recall: Machine - PowerPoint PPT Presentation

Classification: K-Nearest Neighbors 3/27/17 Recall: Machine Learning Taxonomy Supervised Learning For each input, we know the right output. Regression Outputs are continuous. Classification Outputs come from a (relatively

Approximate Nearest Neighbors Search Approximate Nearest Neighbors Search in High Dimensions in

K-Nearest Neighbors Nicolas Indelicato K-Nearest Neighbors Dataset Background How the

k-Nearest Neighbors Lecture 2 k-Nearest Neighbors September 16, 2015 1 Wentworth Institute of

Approximate Nearest Neighbors Sariel Har Peled: Notes Arya, Mount, Netenyahu, Silverman, Wu An

Simple and Fast Nearest Neighbor Search Marcel Birn, Manuel Holtgrewe, Peter Sanders , Johannes

FAST APPROXIMATE NEAREST NEIGHBORS WITH AUTOMATIC ALGORITHM CONFIGURATION Marius Muja, David G.

c i,j max k,m c k,m 4 Wednesday, 2 Oct. 2019 Machine Learning (COMP 135) 3 Wednesday, 2

CSC 411: Lecture 05: Nearest Neighbors Class based on Raquel Urtasun &amp; Rich Zemels lectures

c i,j max k,m c k,m 4 Wednesday, 26 Feb. 2020 Machine Learning (COMP 135) 3 Wednesday, 26

Inference and Estimation Using Nearest Neighbors 2019 The Second Korea-Japan Machine Learning

New directions in approximate nearest neighbors for the angular distance Thijs Laarhoven

Nearest Neighbor and Locality-Sensitive Hashing Nearest Neighbor Set Similarity

Classification K-nearest neighbor classification D istance functions Choice of k Choice of k

Nearest Neighbor Classification Seed classification by area and What should we compactness

Nearest Neighbor Classification Machine Learning 1 This lecture K-nearest neighbor

awareness Contention between neighbors in carrier- sensing range (c- B C A neighbors)

1 Weights Need Not be Reals Goal: Parameterized FSMs a/ q / p b/ r Parameterized FSM:

CSE 105 THEORY OF COMPUTATION Fall 2016 http://cseweb.ucsd.edu/classes/fa16/cse105-abc/

Baumgartner, POLI 203 Spring 2016 Public opinion over time Reading: Chapter 6 of Decline of DP

Weight Selection for a Model Weight Selection for a Model Average Estimator Average Estimator Alan

Lecture 2: Convolution Mark Hasegawa-Johnson ECE 401: Signal and Image Analysis, Fall 2020

Causal inference Part I.b: randomized experiments, matching and regression (this lecture starts

Topic 4 Kirchoffs Laws &amp; Nodal Analysis Professor Peter YK Cheung Dyson School of Design

Exploring AIRS and other Atmospheric Data with Giovanni Gregory Leptoukh, S. Ahmad, S. Berrick,

CSC 411: Lecture 05: Nearest Neighbors Class based on Raquel Urtasun & Rich Zemels lectures

Topic 4 Kirchoffs Laws & Nodal Analysis Professor Peter YK Cheung Dyson School of Design