Non-Local Manifold Parzen Windows Yoshua Bengio, Hugo Larochelle and - PowerPoint PPT Presentation

Non-Local Manifold Parzen Windows Non-Local Manifold Parzen Windows Yoshua Bengio, Hugo Larochelle and Pascal Vincent D´ epartement d’informatique et de recherche op´ erationnelle Universit´ e de Montr´ eal July 15 th , 2005

Non-Local Manifold Parzen Windows Plan Introduction 1 Local vs Non-Local learning 2 Experiments and Results 3 Conclusion 4

Non-Local Manifold Parzen Windows Introduction Plan Introduction 1 Local vs Non-Local learning 2 Experiments and Results 3 Conclusion 4

Non-Local Manifold Parzen Windows Introduction About this talk... What : density estimation of high dimensional continuous data, lying on a lower dimensional manifold

Non-Local Manifold Parzen Windows Introduction About this talk... What : density estimation of high dimensional continuous data, lying on a lower dimensional manifold How : using the Manifold Parzen Windows model learning the model’s parameters with a neural network

Non-Local Manifold Parzen Windows Introduction About this talk... What : density estimation of high dimensional continuous data, lying on a lower dimensional manifold How : using the Manifold Parzen Windows model learning the model’s parameters with a neural network Why :

Non-Local Manifold Parzen Windows Introduction About this talk... What : density estimation of high dimensional continuous data, lying on a lower dimensional manifold How : using the Manifold Parzen Windows model learning the model’s parameters with a neural network Why : ... because my supervisor wants me to work on that

Non-Local Manifold Parzen Windows Introduction About this talk... What : density estimation of high dimensional continuous data, lying on a lower dimensional manifold How : using the Manifold Parzen Windows model learning the model’s parameters with a neural network Why : ... because my supervisor wants me to work on that ... to publish papers

Non-Local Manifold Parzen Windows Introduction About this talk... What : density estimation of high dimensional continuous data, lying on a lower dimensional manifold How : using the Manifold Parzen Windows model learning the model’s parameters with a neural network Why : ... because my supervisor wants me to work on that ... to publish papers but mostly to use and make a point about non-local learning

Non-Local Manifold Parzen Windows Introduction Manifold Parzen Windows (Vincent and Bengio, 2003) Extension of the Parzen Windows model (mixture of spherical Gaussians, centered on the training points) The Gaussians are parametrized so that most of the density is situated on the underlying manifold F IG .: Parzen Windows vs Manifold Parzen Windows

Non-Local Manifold Parzen Windows Introduction Manifold Parzen Windows (Vincent and Bengio, 2003) Density estimator : n p ( x ) = 1 � N ( x ; µ ( x t ) , Σ( x t )) n t = 1

Non-Local Manifold Parzen Windows Introduction Manifold Parzen Windows (Vincent and Bengio, 2003) Density estimator : n p ( x ) = 1 � N ( x ; µ ( x t ) , Σ( x t )) n t = 1 Parametrization : d Σ( x t ) = σ 2 � noise ( x t ) I + s j ( x t ) v j ( x t ) v j ( x t ) ′ j = 1

Non-Local Manifold Parzen Windows Introduction Manifold Parzen Windows (Vincent and Bengio, 2003) Density estimator : n p ( x ) = 1 � N ( x ; µ ( x t ) , Σ( x t )) n t = 1 Parametrization : d Σ( x t ) = σ 2 � noise ( x t ) I + s j ( x t ) v j ( x t ) v j ( x t ) ′ j = 1 Training : µ ( x t ) = x t is fixed for each x t , use principal eigenvalues ( s j ( x t ) ) and eigenvectors ( v j ( x t ) ) of k nearest neighbors covariance matrix σ noise ( x t ) is an hyper-parameter

Non-Local Manifold Parzen Windows Introduction Non-Local Manifold Parzen Windows In Manifold Parzen Windows, µ ( x t ) , σ noise ( x t ) , s j ( x t ) and v j ( x t ) are stored in memory for every training point x t

Non-Local Manifold Parzen Windows Introduction Non-Local Manifold Parzen Windows In Manifold Parzen Windows, µ ( x t ) , σ noise ( x t ) , s j ( x t ) and v j ( x t ) are stored in memory for every training point x t In Non-Local Manifold Parzen Windows, µ ( x t ) , σ noise ( x t ) , s j ( x t ) and v j ( x t ) are functions of x t , modeled by a neural network

Non-Local Manifold Parzen Windows Introduction Non-Local Manifold Parzen Windows In Manifold Parzen Windows, µ ( x t ) , σ noise ( x t ) , s j ( x t ) and v j ( x t ) are stored in memory for every training point x t In Non-Local Manifold Parzen Windows, µ ( x t ) , σ noise ( x t ) , s j ( x t ) and v j ( x t ) are functions of x t , modeled by a neural network The neural network can capture global information about the underlying manifold, and share it among all training points

Non-Local Manifold Parzen Windows Introduction Non-Local Manifold Parzen Windows In Manifold Parzen Windows, µ ( x t ) , σ noise ( x t ) , s j ( x t ) and v j ( x t ) are stored in memory for every training point x t In Non-Local Manifold Parzen Windows, µ ( x t ) , σ noise ( x t ) , s j ( x t ) and v j ( x t ) are functions of x t , modeled by a neural network The neural network can capture global information about the underlying manifold, and share it among all training points The neural network is trained using stochastic gradient descent on the average negative log-likelihood of the training set

Non-Local Manifold Parzen Windows Local vs Non-Local learning Plan Introduction 1 Local vs Non-Local learning 2 Experiments and Results 3 Conclusion 4

Non-Local Manifold Parzen Windows Local vs Non-Local learning Informal definitions What is local learning : A learning algorithm is said to be local if it uses mostly nearby points of x to make a prediction at x Examples : k nearest neighbors, SVM, most popular dimensionality reduction algorithms, Manifold Parzen Windows

Non-Local Manifold Parzen Windows Local vs Non-Local learning Informal definitions What is local learning : A learning algorithm is said to be local if it uses mostly nearby points of x to make a prediction at x Examples : k nearest neighbors, SVM, most popular dimensionality reduction algorithms, Manifold Parzen Windows What is non-local learning : A learning algorithm is said to be non-local if it is able to use information from training points far from x to generalize at x

Non-Local Manifold Parzen Windows Local vs Non-Local learning Toy example We are trying to learn a density using this training set : 0.6 0.4 0.2 0 −0.2 −0.4 −0.4 −0.2 0 0.2 0.4 0.6 F IG .: Samples from a spiral distribution

Non-Local Manifold Parzen Windows Local vs Non-Local learning Toy example We are trying to learn a density using this training set : 0.6 0.4 0.2 0 −0.2 −0.4 −0.4 −0.2 0 0.2 0.4 0.6 F IG .: Samples from a spiral distribution Let’s train a Manifold Parzen Windows model, and look at the first principal direction of variance of the training point gaussians

Non-Local Manifold Parzen Windows Local vs Non-Local learning Toy example Because the training of Manifold Parzen Windows uses only local information, some of the principal directions of variance of badly estimated. F IG .: On the left : training points. On the right : first principal direction of variance.

Non-Local Manifold Parzen Windows Local vs Non-Local learning Real life examples A lot of large scale, real life problems are likely to befenit from non-local learning : Vision : the pixels at a certain position from very different images share the same properties with respect to certain transformations (e.g. translation, rotation) ; Natural Language Processing : words that are very different in some aspect still usually share a lot of properties (e.g. two nouns, even if they have very different meanings, will still obey to the same grammatical rules)

Non-Local Manifold Parzen Windows Experiments and Results Plan Introduction 1 Local vs Non-Local learning 2 Experiments and Results 3 Conclusion 4

Non-Local Manifold Parzen Windows Experiments and Results Toy 2D data experiments Sinusoidal distribution Spiral distribution 1.5 0.6 1 0.4 0.5 0.2 0 0 −0.5 −0.2 −0.4 −1 −1.5 −5 −4 −3 −2 −1 0 1 2 3 4 5 −0.4 −0.2 0 0.2 0.4 0.6 1.5 0.6 1 0.4 0.5 0.2 0 0 −0.5 −0.2 −1 −0.4 −1.5 −5 −4 −3 −2 −1 0 1 2 3 4 5 −0.4 −0.2 0 0.2 0.4 0.6

Non-Local Manifold Parzen Windows Experiments and Results Toy 2D data experiments Results : Algorithm sinus spiral Non-Local MP 1.144 -1.346 Manifold Parzen 1.345 -0.914 Gauss Mix Full 1.567 -0.857 Parzen Windows 1.841 -0.487 T AB .: Average out-of-sample negative log-likelihood on two toy problems, for Non-Local Manifold Parzen, a Gaussian mixture with full covariance, Manifold Parzen and Parzen Windows. The non-local algorithm dominates all the others.

Non-Local Manifold Parzen Windows Experiments and Results Toy 2D data experiments F IG .: From left to right, top to bottom, densities learned by Non-Local Manifold Parzen, a Gaussian mixture with full covariance, Manifold Parzen and Parzen Windows.

Non-Local Manifold Parzen Windows Yoshua Bengio, Hugo Larochelle and - PowerPoint PPT Presentation

Non-Local Manifold Parzen Windows Non-Local Manifold Parzen Windows Yoshua Bengio, Hugo Larochelle and Pascal Vincent D epartement dinformatique et de recherche op erationnelle Universit e de Montr eal July 15 th , 2005 Non-Local

Simplifying mixtures of Parzen windows GRETSI 2011, Bordeaux, France Olivier Schwander Frank

Image Masking Schemes for Local Manifold Learning Methods Marco F. Duarte Joint work with

Windows Not just for houses Windows 1-10 Windows Server Essentially a jacked up windows 8 box

Manifold Regularization Lorenzo Rosasco MIT, 9.520 L. Rosasco Manifold Regularization About

Windows Not Just For Houses Everyone Uses Windows! Versions of Windows 10 There are multiple

Linear Manifold Clustering Robert Haralick and Rave Harpaz Outline Background The linear

Charting the Right Manifold: Manifold Mixup for Few-Shot Learning Puneet Mangla 1,2* , Mayank

Manifold Learning: Applications in Neuroimaging Robin Wolz 23/09/2011 Overview Manifold

Windows 8 Heap Internals Windows 8 Heap Internals Windows 8 Heap Internals INTRODUCTION Windows 8

Manifold Regularization Lorenzo Rosasco 9.520 Class 10 March 6, 2011 L. Rosasco Manifold

embedded in a 3-manifold is in- Def. A surface compressible if is injective.

mc-a ISSUED DATE McAlpine Architecture PC (T) E-mail: 34-26 Street 212.366 .0700

Windows NT Security Windows 95, 3.1, 3.11 are basically DOS and they have no security

Manifold Construction and Parameterization for Nonlinear Manifold-Based Model Reduction Chenjie

Roadmap for Section B A Brief History of Windows and Linux Comparing the Windows and Linux kernel

1. 2. 3. 1. 2. 3. Windows 10 IoT Core Universal Windows Platform (UWP) Microsoft Azure v7

Lecture 6: Non-Parametric Methods Parzen Estimation Dr. Chengjiang Long Computer Vision

Game Bot Identification Game Bot Identification based on Manifold Learning based on Manifold

Manifold-driven spirals and rings Lia Athanassoula LAM, Marseille Lia Athanassoula Manifold

Windows 8/RT and Wine Admittedly not usually ARM-specific Ridiculous Terminology MS

26.1 FUNDAMENTAL WINDOWS SECURITY ARCHITECTURE Anyone who wants to understand Windows security

A manifold structure on the set of functional observers Jochen Trumpf University of W urzburg

Installing and Demonstrating Cantera 1.7 for Windows ChE 641 Combustion Modeling System

Density Estimation Parametric techniques Maximum Likelihood Maximum A Posteriori