Classifier Inspired Scaling for Training Set Selection Walter - PowerPoint PPT Presentation

Classifier Inspired Scaling for Training Set Selection Walter Bennette DISTRIBUTION A: Approved for public release: distribution unlimited: 16 May 2016. Case #88ABW-2016- 2511

Outline · Instance-based classification · Training set selection - ENN - DROP3 - CHC · Scaling approaches - Stratified - Classifier inspired · Experimental results 2/46

Instance-based classification

Instance-based classification 4/46

Instance-based classification What are they used for? · Classification of gene expression · Content-based image retrieval · Text categorization · Load forecasting assistant for power company 14/46

Instance-based classification What if there is a large amount of data? 15/46

Instance-based classification What if there is a huge amount of data? 16/46

Instance-based classification What if there is a serious amount of data? 17/46

Training set selection (TSS)

Training set selection (TSS) · Instead of maintaining all of the training data · Keep only certain necessary data points 19/46

Edited Nearest Neighbors (ENN) Formulation: · An instance is removed from the training data if its does not agree with the majority of it nearest neighbors k Effect: · Makes decision boundaries smoother · Doesn't remove much data 20/46

Edited Neares Neighbors (ENN) 21/46

DROP3 Formulation: DROP3 (Training set TR): Selection set S. Let S = TR after applying ENN. For each instance Xi in S: Find the k +1 nearest neighbors of Xi in S. Add Xi to each of its lists of associates. For each instance Xi in S: Let with = # of associates of Xi classified correctly with Xi as a neighbor. Let without = # of associates of Xi classified correctly without Xi. If without ≥ with Remove Xi from S. For each associate a of Xi Remove Xi from a’s list of neighbors. Find a new nearest neighbor for a. Add a to its new list of associates. Endif Return S. 22/46

DROP3 Formulation: · Iterative procedure that compares accuracy of neighbors with and without members Effect: · Removes much more data than ENN · Maintains acceptable accuracy 23/46

DROP3 24/46

Genetic algorithm (CHC) Formulation: · A chromosome is a subset of the training data · A binary gene represents each instance · Fitness = α ∗ Accuracy + (1 − α ) ∗ Reduction Effectiveness: · Removes a large amount of data · Achieves acceptable accuracy 25/46

Genetic algorithm (CHC) 26/46

Scaling

Scaling · As datasets grow, TSS becomes more and more expensive · May be prohibitive · The vast majority of scaling approaches rely on a stratified approach 28/46

No scaling 29/46

Stratified scaling 30/46

Representative Data Detection (ReDD) · Lin et al. 2015 · Used for support vector machines and did not consider data reduction 31/46

Our approach

Classifier inspired approach · Based heavily on ReDD · Used for kNN and monitor data reduction 33/46

The filter The "Balance"" dataset · Determine scale positions - Balanced - Leaning right - Leaning left · Attributes - Left weight - Left distance - Right weight - Right distance 34/46

The filter 35/46

The filter 36/46

The filter 37/46

Experimentation Parameters: · Learn a Random Forest for the filter · Split data into 1/3rd, 2/3rd Design: · Perform for ENN, CHC, and DROP3 with 3-NN · Compare no scaling, stratified, and classifier inspired · Calculate reduction, accuracy, and computation time with 10-fold CV 38/46

Datasets · 10 experimental datasets from KEEL 39/46

Reduction 40/46

Accuracy 41/46

Time 42/46

Results · Maintains accuracy (mostly) · Maintains data reduction · Slower than stratified approach, but may improve for larger datasets 43/46

Future work · Perform for many more datasets · Apply to very large datasets · Investigate if damage can be spotted apriori 44/46

Conclusion Promising candidate for scaling Training Set Selection to large datasets 45/46

Questions Walter Bennette walter.bennette.1@us.af.mil 315-330-4957 46/46

Classifier Inspired Scaling for Training Set Selection Walter - PowerPoint PPT Presentation

Classifier Inspired Scaling for Training Set Selection Walter Bennette DISTRIBUTION A: Approved for public release: distribution unlimited: 16 May 2016. Case #88ABW-2016- 2511 Outline Instance-based classification Training set

The Nave Bayes Classifier Machine Learning 1 Todays lecture The nave Bayes Classifier

Outline Scaling Scalinga Plenitude of Power Laws Scaling-at-large Scaling-at-large

UP UP AND OUT: SCALING SOFTWARE WITH AKKA Jonas Bonr CTO Typesafe @jboner Scaling software

Classifier Selection Nicholas Ver Hoeve Craig Martek Ben Gardner Classifier Ensembles Assume

Lazy Associative Classification Decision Tree Classifier (Eager) Associative Classifier By

Dynamic Classifier Selection Based on Imprecise Probabilities Meizhu Li Ghent University

Analysis of Scaling Algorithms for Matrix & Operator Scaling Contents Scaling Algorithms

Selection 2 Selection Selection given a set of (distinct) elements, finding the element larger

Improving Cross-Validation Classifier Selection Accuracy through Meta- learning Jesse H. Krijthe

ERP Selection KIRTANE & PANDIT Suhas Deshpande Why ERP Selection is important ?

{ output 1 if a q . y = 0 if a < q w n x n 3 1 9/27/2016 Training a classifier

Maximum Entropy Classifier Ensembling using Ge- netic Algorithm for NER in Bengali Asif Ekbal 1

Effectively Scaling Effectively Scaling up/universalizing exclusive up/universalizing exclusive

Scaling From simple models to rich strategies PPPLab Day, November 30th Scaling: recent

Outline Scalinga Plenitude of Power Laws Scaling-at-large Scaling-at-large Principles of

Data Mining with Weka Class 2 Lesson 1 Be a classifier! Ian H. Witten Department of Computer

Scaled Machine Learning at Matroid Reza Zadeh @Reza_Zadeh | http://reza-zadeh.com Machine Learning

disordered field theories Ofer Aharony Weizmann Institute of Science CRM-PCTS workshop, October

Designing your SaaS Database for Scale with Postgres Lukas

Performance Scaling How is my parallel code performing and scaling? Performance metrics

Large-scale Graph Mining @ Google NY Vahab Mirrokni Google Research New York, NY DIMACS

Scaling container policy management with kernel features Joe Stringer Cilium.io Linux Plumbers

Scaling Methodology Scaling Methodology Dan Smith Director HW Engineering dsmith@nvidia.com

Scaling the Practical Education Experience Joel Sommers Andrew Moore Colgate University

Classifier Inspired Scaling for Training Set Selection Walter - PowerPoint PPT Presentation

Classifier Inspired Scaling for Training Set Selection Walter Bennette DISTRIBUTION A: Approved for public release: distribution unlimited: 16 May 2016. Case #88ABW-2016- 2511 Outline Instance-based classification Training set

The Nave Bayes Classifier Machine Learning 1 Todays lecture The nave Bayes Classifier

Outline Scaling Scalinga Plenitude of Power Laws Scaling-at-large Scaling-at-large

UP UP AND OUT: SCALING SOFTWARE WITH AKKA Jonas Bonr CTO Typesafe @jboner Scaling software

Classifier Selection Nicholas Ver Hoeve Craig Martek Ben Gardner Classifier Ensembles Assume

Lazy Associative Classification Decision Tree Classifier (Eager) Associative Classifier By

Dynamic Classifier Selection Based on Imprecise Probabilities Meizhu Li Ghent University

Analysis of Scaling Algorithms for Matrix &amp; Operator Scaling Contents Scaling Algorithms

Selection 2 Selection Selection given a set of (distinct) elements, finding the element larger

Improving Cross-Validation Classifier Selection Accuracy through Meta- learning Jesse H. Krijthe

ERP Selection KIRTANE &amp; PANDIT Suhas Deshpande Why ERP Selection is important ?

{ output 1 if a q . y = 0 if a &lt; q w n x n 3 1 9/27/2016 Training a classifier

Maximum Entropy Classifier Ensembling using Ge- netic Algorithm for NER in Bengali Asif Ekbal 1

Effectively Scaling Effectively Scaling up/universalizing exclusive up/universalizing exclusive

Scaling From simple models to rich strategies PPPLab Day, November 30th Scaling: recent

Outline Scalinga Plenitude of Power Laws Scaling-at-large Scaling-at-large Principles of

Data Mining with Weka Class 2 Lesson 1 Be a classifier! Ian H. Witten Department of Computer

Scaled Machine Learning at Matroid Reza Zadeh @Reza_Zadeh | http://reza-zadeh.com Machine Learning

disordered field theories Ofer Aharony Weizmann Institute of Science CRM-PCTS workshop, October

Designing your SaaS Database for Scale with Postgres Lukas

Performance Scaling How is my parallel code performing and scaling? Performance metrics

Large-scale Graph Mining @ Google NY Vahab Mirrokni Google Research New York, NY DIMACS

Scaling container policy management with kernel features Joe Stringer Cilium.io Linux Plumbers

Scaling Methodology Scaling Methodology Dan Smith Director HW Engineering dsmith@nvidia.com

Scaling the Practical Education Experience Joel Sommers Andrew Moore Colgate University

Analysis of Scaling Algorithms for Matrix & Operator Scaling Contents Scaling Algorithms

ERP Selection KIRTANE & PANDIT Suhas Deshpande Why ERP Selection is important ?

{ output 1 if a q . y = 0 if a < q w n x n 3 1 9/27/2016 Training a classifier