Manifold Learning to Detect Changes in Networks Kenneth Heafield - PowerPoint PPT Presentation

Manifold Learning to Detect Changes in Networks Kenneth Heafield Richard and Dena Krown SURF Fellow Mentor: Steven Low

Problem ➲ Monitor systems and watch for changes ➲ Unsupervised ● Computer must be able to learn patterns ● Automatically determine if deviation is significant ➲ Fast ● Test for anomalies as data comes in ● Incorporate new data into model ➲ Non-linear ● Algorithm needs to work in many envi- ronments

Applications to Networking ➲ Monitor network packets and streams ● Collect header information, particularly port numbers ➲ Security ● Detect worms by large, structural changes ● Detect viruses by small numbers of devia- tions from fit ➲ Optimization ● Automatically learn traffic patterns and react to them ● Anticipate traffic

Outline ➲ How to phrase the problem mathemat- ically ➲ Linear regression in multiple dimensions with Principal Component Analy- sis (PCA) ➲ Extending PCA to estimate errors in principal components ● How to use the errors ➲ Kernel PCA adds non-linearity ➲ Future ● Implementation

Thinking Geometrically ➲ Each packet is a data point with coor- dinates equal to its information ➲ Fit a manifold to find patterns ● Compare with previous fits by storing manifold parameters ● Structure of manifold can tell us about un- derlying processes ➲ Distance from manifold indicates deviation

Principal Component Analysis ➲ Choose directions of greatest variance ● These are the eigenvectors of the covari- ance matrix ● Called Principal Components ➲ Widespread use in science ➲ Linear ● Many non-linear extensions—we will focus on kernel PCA later ● Equivalent to least-squares ➲ Jolliffe 2002

Error Finding ➲ Goal: Find errors in Principal Compo- nents. ● Assume uncorrelated, multivariate normal distribution ➲ Find out how much each component contributes to estimating each point ➲ Get error of estimate in terms of (un- known) errors in components. ● Use residual to approximate error ➲ Out pops a regression problem which we can solve

Finding the Nearest Point ➲ Principal Component Analysis defines a subspace ● Example: Linear regression finds a one- dimensional subspace of the two-dimensional input ● Components are orthonormal ➲ Project data point into subspace ● Data point X i ● Components C k m ● Nearest point N i = ∑  X i ⋅ C k  C k k = 1

Error in Nearest Point ➲ is the closest point to data N i X i ● Residual is X i − N i ➲ What is the error in this estimate? ● Predictor variance  i 2 N i ● Component variance  k 2 C k ● Symmetric about component, spread evenly in the possible dimensions p − 1 ● Propagate the error: m 1 p − 1 ∑  i 2 =  k 2  X i ⋅ X i − 2 X i ⋅ N i  p  X i ⋅ C k  2  k = 1

Idea: Regression Problem 2 ∥ X i − N i ∥ ➲ Use squared residual length ● This should, on average, equal predictor 2 variance  i ➲ Goal: Find  k ● This is a linear regression problem: m 1 p − 1 ∑ ∥ X i − N i ∥ 2 ≈  k 2  X i ⋅ X i − 2 X i ⋅ N i  p  X i ⋅ C k  2  k = 1 ● Subject to constraints 2  1 To be a variance, 0  k ●

What All That Math Just Meant ➲ We did linear regression in multiple dimensions ➲ Found the point closest to each data point ➲ The residuals estimate error present ➲ Error is allocated to the contributing components

Using the Errors ➲ Recall assumptions about error ➲ Compare time slices to find structural changes ● Match up components then test for similar- ity ➲ Measure distances to anomalous points ● We can find the standard deviation at any point on the manifold ● Compare residual to standard deviation and test

Kernel Principal Component Analysis ➲ Non-linear manifold fitting algorithm ➲ Conceptually uses Principal Compo- nent Analysis (PCA) as a subroutine ● Non-linearly maps data points (linearizes) into an abstract feature space ● Performs PCA in feature space ➲ Errors ● Error computation is conceptually the same ➲ Schölkopf et al. 1996

Kernels ➲ Feature space can be high or even in- finite dimensional ● Avoid computing in feature space ➲ Map two points into feature space and compute dot product simultaneously ● Kernel function takes two data points and computes their dot products in feature space Non-data points are expressed as linear combi- ● nations ● Example: polynomials of degree d k  x , y = x ⋅ y  1  d

Future ➲ Implementation ● Working kernel PCA implementation ● Hungarian algorithm for matching components ● Use constrained least-squares regression algorithm ➲ Use ● Time slice incoming network data ● Compare fits between slices ● Classify regions of manifold as potential problems

Summary ➲ Problem arising from computer networks ➲ Application of Principal Component Analysis (PCA) ➲ Extensions to PCA ● Accounting for and using error ● Kernel PCA ➲ Future of project

Acknowledgements ➲ Richard and Dena Krown SURF Fellow ➲ SURF Office

Manifold Learning to Detect Changes in Networks Kenneth Heafield - PowerPoint PPT Presentation

Manifold Learning to Detect Changes in Networks Kenneth Heafield Richard and Dena Krown SURF Fellow Mentor: Steven Low Problem Monitor systems and watch for changes Unsupervised Computer must be able to learn patterns

Linear Manifold Clustering Robert Haralick and Rave Harpaz Outline Background The linear

n -dimensional manifold M with T := TM n -dimensional manifold M with T := TM T n -dimensional

Manifold Learning: Applications in Neuroimaging Robin Wolz 23/09/2011 Overview Manifold

Can We Detect Crisp Sets Based Only on How to Detect 1- . . . the Subsethood Ordering of Fuzzy

Game Bot Identification Game Bot Identification based on Manifold Learning based on Manifold

Charting the Right Manifold: Manifold Mixup for Few-Shot Learning Puneet Mangla 1,2* , Mayank

Manifold Regularization Lorenzo Rosasco 9.520 Class 10 March 6, 2011 L. Rosasco Manifold

Manifold Regularization Lorenzo Rosasco MIT, 9.520 L. Rosasco Manifold Regularization About

Manifold-driven spirals and rings Lia Athanassoula LAM, Marseille Lia Athanassoula Manifold

Manifold Construction and Parameterization for Nonlinear Manifold-Based Model Reduction Chenjie

A manifold structure on the set of functional observers Jochen Trumpf University of W urzburg

Gender classification and manifold learning on functional brain networks Sofia Ira Ktena ,

Efficient Krylov Approximation for Manifold Learning Shinjae Yoo Computational Science

Co-manifold learning with missing data Gal Mishne, Eric C. Chi and Ronald R. Coifman Department

Image Masking Schemes for Local Manifold Learning Methods Marco F. Duarte Joint work with

Large-Scale Face Manifold Learning Sanjiv Kumar Google Research New York, NY * Joint work with

Unsupervised Data Discretization of Mixed Data Types Jee Vang Outline Introduction

Connecting the dots 2 Planning the Retreat Schedule a mandatory retreat date Find a location

Access Link at UHCprovider.com Sign in to Link by clicking on the Link button in the top right

Multiple Object Tracking Using Local PCA C. Beleznai 1 , B. Frhstck 2 , H. Bischof 3 1 Advanced

QUEEENSLAND OUTLOOK Source: ABS, Deloitte Access Economics Business Outlook SUNSHINE COAST OUTLOOK

National Marine Conservation Area Background Presentation to the National Advisory Panel on

INFRASTRUCTURE EDUCATION San Joaquin County Employees Retirement Association July 14, 2017

Kernel PCA for SNe Kernel PCA for SNe photometric classification photometric classification