Dimensionality reduction AI F UN DAMEN TALS Nemanja Radojkovic - PowerPoint PPT Presentation

Dimensionality reduction AI F UN DAMEN TALS Nemanja Radojkovic Senior Data Scientist

De�nition "Dimensionality reduction is the process of reducing the number of variables under consideration by obtaining a set of principal variables." AI FUNDAMENTALS

Why? Pro's Reduce over�tting Obtain independent features Lower computational intensity Enable visualization Con's Compression => Loss of information => loss of performance AI FUNDAMENTALS

Types Feature selection (B ? A) Feature extraction (B ? A) Selecting a subset of existing features, Transforming and combining existing based on predictive power features into new ones. Non-trivial problem: Looking for the best Linear or non-linear projections . "team of features", not individually best features! AI FUNDAMENTALS

Common algorithms Linear (faster, deterministic) Non-linear (slower, non-deterministic) Principal Component Analysis (PCA) Isomap from sklearn.decomposition \ from sklearn.manifold import Isomap import PCA t-distributed Stochastic Neighbor Latent Dirichlet Allocation Embedding (t-SNE) from sklearn.decomposition \ from sklearn.manifold import TSNE import LatentDirichletAllocation AI FUNDAMENTALS

Principal Component Analysis (PCA) Family : Linear methods. Intuition : Principal components are directions of highest variability in data. Code example: Reduction = keeping only top #N principal components. from sklearn.decomposition import PCA Assumption: Normal distribution of data. pca = PCA(n_dimensions=3) Caveat: Very sensitive to outliers. X_reduced = pca.fit_transform(X) AI FUNDAMENTALS

Use it wisely! AI F UN DAMEN TALS

Clustering AI F UN DAMEN TALS Nemanja Radojkovic Senior Data Scientist

What is clustering? Cluster = Group of entities or events sharing similar attributes. Clustering (AI) = The process of applying Machine Learning algorithms for automatic discovery of clusters. AI FUNDAMENTALS

Popular clustering algorithms KMeans clustering from sklearn.cluster import KMeans Spectral clustering from sklearn.cluster import SpectralClustering DBSCAN from sklearn.cluster import DBSCAN AI FUNDAMENTALS

AI FUNDAMENTALS

How many clusters do I have? –> Elbow method! AI FUNDAMENTALS

How many clusters do I have? AI FUNDAMENTALS

Cluster analysis and tuning Unsupervised (no "ground truth", no expectations) Variance Ratio Criterion: sklearn.metrics.calinski_harabaz_score "What is the average distance of each point to the center of the cluster AND what is the distance between the clusters?" Silhouette score: sklearn.metrics.silhouette_score "How close is each point to its own cluster VS how close it is to the others?" Supervised ("ground truth"/expectations provided) Mutual information (MI) criterion: sklearn.metrics.mutual_info_score Homogeneity score: sklearn.metrics.homogeneity_score AI FUNDAMENTALS

Explore, experiment and tune! AI F UN DAMEN TALS

Anomaly detection AI F UN DAMEN TALS Nemanja Radojkovic Senior Data Scientist

De�nition and use cases Detecting unusual entities or events. Hard to de�ne what's odd, but possible to de�ne what's normal. Use cases Credit card fraud detection Network security monitoring Heart-rate monitoring AI FUNDAMENTALS

Approaches: Thresholding AI FUNDAMENTALS

Approaches: Rate of change AI FUNDAMENTALS

Approaches: Shape monitoring AI FUNDAMENTALS

Algorithms Robust covariance (assumes normal distribution) from sklearn.covariance import EllipticEnvelope Isolation Forest (powerful, but more computationally demanding) from sklearn.ensemble import IsolationForest One-Class SVM (sensitive to outliers, many false negatives) from sklearn.svm import OneClassSVM AI FUNDAMENTALS

AI FUNDAMENTALS

Training and testing Example: Isolation Forest from sklearn.ensemble import IsolationForest algorithm = IsolationForest() # Fit the model algorithm.fit(X) # Apply the model and detect the outliers results = algorithm.predict(X) AI FUNDAMENTALS

Evaluation Example: Arrhythmia detection from sklearn.metrics \ import (confusion_matrix, precision_score, recall_score) confusion_matrix(y_true, y_predicted) Precision = How many of the anomalies I have detected are TRUE anomalies? Recall = How many of the TRUE anomalies I have managed to detect? AI FUNDAMENTALS

Want to learn more? AI F UN DAMEN TALS

Selecting the right model AI F UN DAMEN TALS Nemanja Radojkovic Senior Data Scientist

Model-to-problem �t Type of Learning Target variable de�ned & known? => Supervised. Classi�cation? Regression No target variable, exploration? => Unsupervised. Dimensionality Reduction? Clustering? Anomaly Detection? AI FUNDAMENTALS

De�ning the priorities Interpretable models Linear regression (Linear, Logistic, Lasso, Ridge) Decision Trees Well performing models Tree ensembles (Random Forests, Gradient Boosted Trees) Support Vector Machines Arti�cial Neural Networks Simplicity �rst! AI FUNDAMENTALS

Using multiple metrics Satisfying metrics Cut-off criteria that every candidate model needs to meet. Multiple satisfying metrics possible (e.g. minimum accuracy, maximum execution time, etc) Optimizing metrics Illustrates the ultimate business priority (e.g. "minimize false positives", "maximize recall") "There can be only one" Final model: Passes the bar on all satisfying metrics and has the best score on the optimization metric. AI FUNDAMENTALS

Interpretation Global "What are the general decision-making rules of this model?" Common approaches: Decision tree visualization Feature importance plot Local "Why was this speci�c example classi�ed in this way?" LIME algorithm (Local Interpretable Model-Agnostic Explanations) AI FUNDAMENTALS

Model selection and interpretation AI F UN DAMEN TALS

Dimensionality reduction AI F UN DAMEN TALS Nemanja Radojkovic - PowerPoint PPT Presentation

Dimensionality reduction AI F UN DAMEN TALS Nemanja Radojkovic Senior Data Scientist Denition "Dimensionality reduction is the process of reducing the number of variables under consideration by obtaining a set of principal

STAT 209 Dimensionality Reduction November 26, 2019 Colin Reimer Dawson 1 / 24 Dimensionality

Dimensionality Reduction Alexandros Tantos Assistant Professor Aristotle University of

Investigating Dimensionality Dimensionality Dimensionality with with Investigating

WIKIPEDIA ARTICLE GROUP 9 Contents Article Overview 1. Dimensionality Reduction 2.

Nonlinear Dimensionality Reduction Donovan Parks Overview Direct visualization vs.

Dimensionality Reduction Algorithms (and how to interpret their output) Dalya Baron (Tel Aviv

Exploring Multivariate Data with Clustering and Dimensionality Reduction Marco Baroni Practical

Applied Machine Learning Dimensionality reduction using PCA Siamak Ravanbakhsh COMP 551 (Fall

Preprocessing and Dimensionality Reduction J er emy Fix CentraleSup elec

DIMENSIONALITY REDUCTION DIMENSIONALITY REDUCTION MATTHIEU BLOCH April 21, 2020 1 / 26

Probabilistic Dimensionality Reduction Neil D. Lawrence University of Sheffield Facebook, London

Kernel-Based Dimensionality Reduction Methods on Synthesized and Facial Image Data Jonathan L.

Spatial Data: Dimensionality Reduction CS444 Techniques, Lecture 3 In this subfield, we think

Spatial Data: Dimensionality Reduction CSC444 Techniques In this subfield, we think of a data

Dimensionality Reduction INFO-4604, Applied Machine Learning University of Colorado Boulder

Dimensionality Reduction Techniques for Proximity Problems Piotr Indyk, SODA 2000 CS 468 |

Stakeholder Engagement Task Force September 17, 2020 Antitrust Policy All WECC meetings are

Monitoring In Motion Challenges in monitoring kubernetes, containers, and dynamic infrastructure.

Processes, Execution, and State Operating Systems Principles 4A. Introduction to Scheduling 4B.

Sharing and Contributing Annotations Sean Boisen (sean@logos.com) Director of Content Innovation

Work Track 5 meeting 4 April 2018 Agenda 2 3 1 Welcome/Agenda Update from Review of

ECEN 5682 Theory and Practice of Error Control Codes Short Introduction Peter Mathys University

4 5 6 CSE 142 vs CSE 143 CSE 142 / AP CS A CSE 143 You learned how to write Return of

Curve25519, Curve41417, E-521 Curve25519 D. J. Bernstein Introduced in ECC 2005 talk University