Clustering and Classification by Optimum-Path Forest Alexandre Falc - PowerPoint PPT Presentation

Clustering and Classification by Optimum-Path Forest Alexandre Falc˜ ao Institute of Computing - University of Campinas afalcao@ic.unicamp.br Alexandre Falc˜ ao MC920/MO443 - Indrodu¸ c˜ ao ao Proc. de Imagens

Introduction New technologies for data acquisition and storage have provided large datasets with millions (or more) of samples for statistical analysis. Alexandre Falc˜ ao MC920/MO443 - Indrodu¸ c˜ ao ao Proc. de Imagens

Introduction New technologies for data acquisition and storage have provided large datasets with millions (or more) of samples for statistical analysis. We need more efficient and effective pattern recognition methods for large datasets. Alexandre Falc˜ ao MC920/MO443 - Indrodu¸ c˜ ao ao Proc. de Imagens

Introduction New technologies for data acquisition and storage have provided large datasets with millions (or more) of samples for statistical analysis. We need more efficient and effective pattern recognition methods for large datasets. The applications are in many fields of the sciences and engineering. Alexandre Falc˜ ao MC920/MO443 - Indrodu¸ c˜ ao ao Proc. de Imagens

Introduction New technologies for data acquisition and storage have provided large datasets with millions (or more) of samples for statistical analysis. We need more efficient and effective pattern recognition methods for large datasets. The applications are in many fields of the sciences and engineering. Our main focus has been on image analysis. Alexandre Falc˜ ao MC920/MO443 - Indrodu¸ c˜ ao ao Proc. de Imagens

Introduction Each sample s (spel, image or object) of a dataset Z can be interpreted as a point of a distance space defined by a simple or composite descriptor. Alexandre Falc˜ ao MC920/MO443 - Indrodu¸ c˜ ao ao Proc. de Imagens

Introduction Each sample s (spel, image or object) of a dataset Z can be interpreted as a point of a distance space defined by a simple or composite descriptor. We wish to design a classifier which can assign the correct label for any sample s ∈ Z . Alexandre Falc˜ ao MC920/MO443 - Indrodu¸ c˜ ao ao Proc. de Imagens

Introduction Each sample s (spel, image or object) of a dataset Z can be interpreted as a point of a distance space defined by a simple or composite descriptor. We wish to design a classifier which can assign the correct label for any sample s ∈ Z . In supervised learning, a labeled set T ⊂ Z is available to train the classifier. Alexandre Falc˜ ao MC920/MO443 - Indrodu¸ c˜ ao ao Proc. de Imagens

Introduction Each sample s (spel, image or object) of a dataset Z can be interpreted as a point of a distance space defined by a simple or composite descriptor. We wish to design a classifier which can assign the correct label for any sample s ∈ Z . In supervised learning, a labeled set T ⊂ Z is available to train the classifier. In unsupervised learning, there is no knowledge about the labels in T . Clusters can be found and class labels may be assigned to them based on some prior knowledge. Alexandre Falc˜ ao MC920/MO443 - Indrodu¸ c˜ ao ao Proc. de Imagens

Introduction Some common mistakes are to assume that Alexandre Falc˜ ao MC920/MO443 - Indrodu¸ c˜ ao ao Proc. de Imagens

Introduction Some common mistakes are to assume that the classes/clusters form compact clouds of points in the distance space. Alexandre Falc˜ ao MC920/MO443 - Indrodu¸ c˜ ao ao Proc. de Imagens

Introduction Some common mistakes are to assume that the classes/clusters form compact clouds of points in the distance space. they do not overlap each other. Alexandre Falc˜ ao MC920/MO443 - Indrodu¸ c˜ ao ao Proc. de Imagens

Introduction Some common mistakes are to assume that the classes/clusters form compact clouds of points in the distance space. they do not overlap each other. one cluster corresponds to one class. Alexandre Falc˜ ao MC920/MO443 - Indrodu¸ c˜ ao ao Proc. de Imagens

Introduction Some common mistakes are to assume that the classes/clusters form compact clouds of points in the distance space. they do not overlap each other. one cluster corresponds to one class. the probability density function of the classes/clusters present known shapes for parametric modeling. Alexandre Falc˜ ao MC920/MO443 - Indrodu¸ c˜ ao ao Proc. de Imagens

Introduction We assume that two samples in a same cluster/class should be at least connected by a chain of nearby samples (transitive property). Alexandre Falc˜ ao MC920/MO443 - Indrodu¸ c˜ ao ao Proc. de Imagens

Introduction We assume that two samples in a same cluster/class should be at least connected by a chain of nearby samples (transitive property). A graph ( T , A ) is defined by an adjacency relation A between training samples using the distance space. Alexandre Falc˜ ao MC920/MO443 - Indrodu¸ c˜ ao ao Proc. de Imagens

Introduction We assume that two samples in a same cluster/class should be at least connected by a chain of nearby samples (transitive property). A graph ( T , A ) is defined by an adjacency relation A between training samples using the distance space. A connectivity function f ( π t ) assigns a value to any path π t from its root R ( π t ) to its terminal node t . Alexandre Falc˜ ao MC920/MO443 - Indrodu¸ c˜ ao ao Proc. de Imagens

Introduction We assume that two samples in a same cluster/class should be at least connected by a chain of nearby samples (transitive property). A graph ( T , A ) is defined by an adjacency relation A between training samples using the distance space. A connectivity function f ( π t ) assigns a value to any path π t from its root R ( π t ) to its terminal node t . The minimization (maximization) of the connectivity map V ( s ) = ∀ t ∈ Π( T , A , t ) { f ( π t ) } min produces an optimum-path forest rooted at nodes called prototypes. Alexandre Falc˜ ao MC920/MO443 - Indrodu¸ c˜ ao ao Proc. de Imagens

Introduction In supervised learning, each class is an optimum-path forest rooted at its prototypes, which propagate the class label to the remaining nodes of the forest. class A class A class B Alexandre Falc˜ ao MC920/MO443 - Indrodu¸ c˜ ao ao Proc. de Imagens

Introduction In unsupervised learning, each cluster is an optimum-path tree rooted at some prototype, which propagates a cluster label to the remaining nodes of the tree. cluster C cluster A cluster D cluster B Alexandre Falc˜ ao MC920/MO443 - Indrodu¸ c˜ ao ao Proc. de Imagens

Introduction This methodology does not assume known shapes, non-overlapping classes, or parametric models. Alexandre Falc˜ ao MC920/MO443 - Indrodu¸ c˜ ao ao Proc. de Imagens

Introduction This methodology does not assume known shapes, non-overlapping classes, or parametric models. Both learning approaches are fast and robust for training sets of reasonable sizes. Alexandre Falc˜ ao MC920/MO443 - Indrodu¸ c˜ ao ao Proc. de Imagens

Introduction This methodology does not assume known shapes, non-overlapping classes, or parametric models. Both learning approaches are fast and robust for training sets of reasonable sizes. Label propagation to new samples t ∈ Z\T is efficiently performed based on a local processing of the forest’s attributes and distances between nodes s ∈ T and t . Alexandre Falc˜ ao MC920/MO443 - Indrodu¸ c˜ ao ao Proc. de Imagens

Organization of this lecture Supervised classification by OPF [1]. Alexandre Falc˜ ao MC920/MO443 - Indrodu¸ c˜ ao ao Proc. de Imagens

Organization of this lecture Supervised classification by OPF [1]. Its application to image retrieval [2]. Alexandre Falc˜ ao MC920/MO443 - Indrodu¸ c˜ ao ao Proc. de Imagens

Organization of this lecture Supervised classification by OPF [1]. Its application to image retrieval [2]. Clustering by OPF [3]. Alexandre Falc˜ ao MC920/MO443 - Indrodu¸ c˜ ao ao Proc. de Imagens

Organization of this lecture Supervised classification by OPF [1]. CSF Its application to image retrieval [2]. Clustering by OPF [3]. Its application to 3D brain tissue segmentation [4]. WM GM Alexandre Falc˜ ao MC920/MO443 - Indrodu¸ c˜ ao ao Proc. de Imagens

Supervised classification Dataset Consider samples from two classes of a dataset. Alexandre Falc˜ ao MC920/MO443 - Indrodu¸ c˜ ao ao Proc. de Imagens

Supervised classification Training Consider samples from two classes of a dataset. A training set (filled bullets) may not represent data distribution. Alexandre Falc˜ ao MC920/MO443 - Indrodu¸ c˜ ao ao Proc. de Imagens

Supervised classification 1NN classification Consider samples from two classes of a dataset. A training set (filled bullets) may not represent data distribution. Classification by nearest neighbor fails, when training samples are close to test samples (empty bullets) from other classes. Alexandre Falc˜ ao MC920/MO443 - Indrodu¸ c˜ ao ao Proc. de Imagens

Supervised learning OPF training We can create an optimum-path forest, where V ( s ) is penalized when s is not closely connected to its class. s Alexandre Falc˜ ao MC920/MO443 - Indrodu¸ c˜ ao ao Proc. de Imagens

Clustering and Classification by Optimum-Path Forest Alexandre Falc - PowerPoint PPT Presentation

Clustering and Classification by Optimum-Path Forest Alexandre Falc ao Institute of Computing - University of Campinas afalcao@ic.unicamp.br Alexandre Falc ao MC920/MO443 - Indrodu c ao ao Proc. de Imagens Introduction New

Graph Clustering Graph Clustering What is clustering? What is clustering? Finding patterns

Subspace Clustering Ensemble Clustering Subspace Clustering, Ensemble Clustering, Alternative

Evolutionary Clustering Presenter: Lei Tang Evolutionary Clustering Evolutionary Clustering

Clustering A Categorization of Major Clustering Methods Partitioning Methods

U.S. Forest Service Forest Service U.S. Forest Inventory and Analysis Forest Service Research

Clustering kMeans, Expectation Maximization, Self-Organizing Maps Outline K-means

Chino Basin Optimum Chino Basin Optimum Basin Management Basin Management Program Program

OPTIMUM OPTIMUM ADAPTIVE ALGORITHMS ADAPTIVE ALGORITHMS for for SYSTEM IDENTIFICATION SYSTEM

Clustering Hierarchical clustering and k-mean clustering Genome 373 Genomic Informatics

Clustering Hierarchical clustering, k-mean clustering Genome 559: Introduction to Statistical and

Clustering and Dimensionality Reduction Preview Clustering K -means clustering

PAC-Bayesian Analysis of Co-clustering, Graph Clustering and Pairwise Clustering Yevgeny Seldin

Introduction to Machine Learning, Clustering and EM Barnab s P czos Contents Clustering

Trust based Clustering for Group Trust based Clustering for Group Trust based Clustering for

Finding Clusters Types of Clustering Approaches: Linkage Based, e.g. Hierarchical Clustering

Cl Clustering t i A Categorization of Major Clustering Methods Partitioning Methods

Data Mining Learning from Large Data Sets Lecture 8

Lecture 14: Inference in Dirichlet Processes (Blei & Jordan, Variational inference for

Background Poisson or Binomial data with the following properties GLM with clustered data A

Advanced Reconstruction Algorithms for the CMS High Granularity Calorimeter Kevin Pedro (FNAL)

Big Data Era 1 1 https://vimeo.com/102998774 The big problem: Scalability Visualization

New Developments In The Theory Of Clustering thats all very well in practice, but does it work

Towards a Statistical Theory of Clustering Ulrike von Luxburg, Shai Ben-David Page 1 Ulrike von

Percolation Theory Percolation Theory Jie Gao Computer Science Department Stony Brook

Clustering and Classification by Optimum-Path Forest Alexandre Falc - PowerPoint PPT Presentation

Clustering and Classification by Optimum-Path Forest Alexandre Falc ao Institute of Computing - University of Campinas afalcao@ic.unicamp.br Alexandre Falc ao MC920/MO443 - Indrodu c ao ao Proc. de Imagens Introduction New

Graph Clustering Graph Clustering What is clustering? What is clustering? Finding patterns

Subspace Clustering Ensemble Clustering Subspace Clustering, Ensemble Clustering, Alternative

Evolutionary Clustering Presenter: Lei Tang Evolutionary Clustering Evolutionary Clustering

Clustering A Categorization of Major Clustering Methods Partitioning Methods

U.S. Forest Service Forest Service U.S. Forest Inventory and Analysis Forest Service Research

Clustering kMeans, Expectation Maximization, Self-Organizing Maps Outline K-means

Chino Basin Optimum Chino Basin Optimum Basin Management Basin Management Program Program

OPTIMUM OPTIMUM ADAPTIVE ALGORITHMS ADAPTIVE ALGORITHMS for for SYSTEM IDENTIFICATION SYSTEM

Clustering Hierarchical clustering and k-mean clustering Genome 373 Genomic Informatics

Clustering Hierarchical clustering, k-mean clustering Genome 559: Introduction to Statistical and

Clustering and Dimensionality Reduction Preview Clustering K -means clustering

PAC-Bayesian Analysis of Co-clustering, Graph Clustering and Pairwise Clustering Yevgeny Seldin

Introduction to Machine Learning, Clustering and EM Barnab s P czos Contents Clustering

Trust based Clustering for Group Trust based Clustering for Group Trust based Clustering for

Finding Clusters Types of Clustering Approaches: Linkage Based, e.g. Hierarchical Clustering

Cl Clustering t i A Categorization of Major Clustering Methods Partitioning Methods

Data Mining Learning from Large Data Sets Lecture 8

Lecture 14: Inference in Dirichlet Processes (Blei &amp; Jordan, Variational inference for

Background Poisson or Binomial data with the following properties GLM with clustered data A

Advanced Reconstruction Algorithms for the CMS High Granularity Calorimeter Kevin Pedro (FNAL)

Big Data Era 1 1 https://vimeo.com/102998774 The big problem: Scalability Visualization

New Developments In The Theory Of Clustering thats all very well in practice, but does it work

Towards a Statistical Theory of Clustering Ulrike von Luxburg, Shai Ben-David Page 1 Ulrike von

Percolation Theory Percolation Theory Jie Gao Computer Science Department Stony Brook

Lecture 14: Inference in Dirichlet Processes (Blei & Jordan, Variational inference for