Unsupervised clustering with growing self-organizing neural network - PowerPoint PPT Presentation

Unsupervised clustering with growing self-organizing neural network A comparison with non-neural approach Martin Hynar, Michal Burda, Jana ˇ Sarmanov´ a Department of Computer Science, Vˇ SB – Technical University of Ostrava, 17. listopadu 15, 708 00, Ostrava – Poruba, Czech Republic Dateso 2005

Unsupervised clustering Introduction Outline ◮ K-means based methods ◮ CLASS method - flexible k-means ◮ Self-Organizing Map ◮ Growing Neural Gas ◮ Examples and comparison ◮ Conclusions Martin Hynar, Michal Burda, Jana ˇ Sarmanov´ a Unsupervised clustering

Unsupervised clustering Introduction Introduction ◮ k-means method belongs to the most used ones in dm . ◮ It must be given the number of expected clusters. ◮ What to do if it could not be determined? 1. Make multiple computations with varying settings. 2. Adapt the algorithm to determine the count of clusters by itself. Martin Hynar, Michal Burda, Jana ˇ Sarmanov´ a Unsupervised clustering

Unsupervised clustering Introduction The goal ◮ Describe the ”classical” approach of determining clusters using k-means based methods. ◮ Describe the solution using self-organizing neural network. ◮ Compare both approaches. Martin Hynar, Michal Burda, Jana ˇ Sarmanov´ a Unsupervised clustering

Unsupervised clustering K-means based methods K-means based methods Phases 1. Choose typical points. 2. Clustering. 3. Recompute typical points. 4. Check termination condition. Martin Hynar, Michal Burda, Jana ˇ Sarmanov´ a Unsupervised clustering

Unsupervised clustering CLASS method - flexible k-means CLASS method - flexible k-means ◮ Tries to determine number of clusters on-line. ◮ During the clustering process it performs splitting of large clusters. ◮ The very first step is one k-means clustering iteration. It divide patterns into base clustering. ◮ Each iteration starts with exclusion of small clusters ◮ Excessively variable clusters are dispersed. Martin Hynar, Michal Burda, Jana ˇ Sarmanov´ a Unsupervised clustering

Unsupervised clustering CLASS method - flexible k-means CLASS method - phases Phases 1. Excluding small clusters. 2. Splitting clusters. 3. Revoking clusters. Martin Hynar, Michal Burda, Jana ˇ Sarmanov´ a Unsupervised clustering

Unsupervised clustering CLASS method - flexible k-means CLASS method - splitting clusters ◮ The splitting threshold is determined with equation S m = S m − 1 + 1 − S 0 GAMA ◮ Then for each cluster two average deviations are computed ◮ for points on the left side of the typical point ◮ for the right side k c D jc = 1 � d ij c ∈ { l , r } k c i =1 ◮ Using these deviations we compute splitting control parameters a 1 and a 2 (relative ratios). If then: ◮ Number of clusters > 2 K ◮ a 1 > S m or a 2 > S m ◮ Number of processed patterns > 2( THETAN + 1) we split the cluster according to j th attribute. Martin Hynar, Michal Burda, Jana ˇ Sarmanov´ a Unsupervised clustering

Unsupervised clustering CLASS method - flexible k-means CLASS method - revoking clusters ◮ Determine average minimum distance of h current clusters h TAU = 1 � D i h i =1 ◮ D i is the minimum distance of i th typical point to others. 2 we revoke i th cluster. ◮ If for some i holds D i < TAU and h > K ◮ The clustering ends in GAMA th iteration. Martin Hynar, Michal Burda, Jana ˇ Sarmanov´ a Unsupervised clustering

Unsupervised clustering Self-Organizing Map Self-Organizing Map ◮ A set A of neurons mutually interconnected, forming some topological grid. ◮ The pattern is presented to the net to determine the winner. c = argmin a ∈A {|| � x − � w a ||} ◮ The weight vectors of the winner and its neighbours are adapted � w ji ( t ) + h cj ( t )( x i ( t ) − w ji ( t )) j ∈ N ( c ) w ji ( t + 1) = w ji ( t ) otherwise. ◮ The som network preserves topology so neurons are placed in the most dense regions. Martin Hynar, Michal Burda, Jana ˇ Sarmanov´ a Unsupervised clustering

Unsupervised clustering Growing Neural Gas Growing Neural Gas ◮ Introduced by Bernd Fritzke ◮ Motivation: ◮ The net can have variable size. ◮ Neurons are added and/or replaced according to proportions in the net. ◮ Impermanent connections between neurons. ◮ The resulting net could be in fact set of independent nets. Martin Hynar, Michal Burda, Jana ˇ Sarmanov´ a Unsupervised clustering

Unsupervised clustering Growing Neural Gas GNG - phases Phases 1. Competition. 2. Adaptation. 3. Removing. 4. Inserting new neurons. 5. Check termination condition. Martin Hynar, Michal Burda, Jana ˇ Sarmanov´ a Unsupervised clustering

Unsupervised clustering Growing Neural Gas GNG - competition ◮ Determine the two nearest neurons s 1 and s 2 to the pattern � x . ◮ If does not exists add a connection between these two neurons. ◮ The age of the connection is set to 0. ◮ The local error variable of the winner is increased by squared distance to the pattern. w s 1 || 2 ∆ E s 1 = || � x − � Martin Hynar, Michal Burda, Jana ˇ Sarmanov´ a Unsupervised clustering

Unsupervised clustering Growing Neural Gas GNG - adaptation & removing Adaptation ◮ Weight vectors of neuron s 1 and its topological neighbours are adapted by fractions ǫ b and ǫ n . ∆ w s 1 = ǫ b ( � x − � w s 1 ) ∆ w i = ǫ n ( � x − � w i ) ∀ i ∈ N s 1 ◮ The age of all winner’s outgoing edges is increased by 1. Removing ◮ All connections with age greater than age max are removed. ◮ All standalone neurons are removed. Martin Hynar, Michal Burda, Jana ˇ Sarmanov´ a Unsupervised clustering

Unsupervised clustering Growing Neural Gas GNG - inserting new neurons ◮ New neurons are added every λ th step using this procedure: 1. Determine neuron p with largest accumulated local error and its neighbour r with largest accumulated local error. 2. Create new neuron q and set its weight to the mean of p and r neurons weights. 3. Remove connection between p and r and add new between p and q and q and r . 4. Local accumulated errors of neurons p and r are decreased by fraction α and local accumulated error of neuron q is set to the mean of p and r errors. 5. Local accumulated errors of all other neurons are decreased by fraction β . Martin Hynar, Michal Burda, Jana ˇ Sarmanov´ a Unsupervised clustering

Unsupervised clustering Examples and comparison Examples and comparison - the basis ◮ The set of 1000 patterns with given distribution. ◮ The k-means and class methods use discrete points, som and gng use continuosly generated points from same distribution. (a) Distribution (b) Objects from distribution Martin Hynar, Michal Burda, Jana ˇ Sarmanov´ a Unsupervised clustering

Unsupervised clustering Examples and comparison Examples and comparison - k-means and som ◮ Test if both methods will produce similar partitioning with number of units equal to number of clusters. (c) k -means with K = 4 (d) som with 4 neurons Martin Hynar, Michal Burda, Jana ˇ Sarmanov´ a Unsupervised clustering

Unsupervised clustering Examples and comparison k-means and som ◮ The dangerous situation occurs when: ◮ Number of representatives is very slightly higher or lower ◮ Result is hardly interpretable - i.e. typical points does not represent clusters. (e) k -means with K = 5 (f) k -means with K = 3 Martin Hynar, Michal Burda, Jana ˇ Sarmanov´ a Unsupervised clustering

Unsupervised clustering Examples and comparison Examples and comparison - class and gng ◮ Compare results reached with both methods. ◮ Both methods modify number of clusters using different approaches ◮ compare them when they have identical cluster’s count. ◮ in the early iterations (few representatives) – 4 ◮ little more representatives – 9 ◮ enough representatives – 25 Martin Hynar, Michal Burda, Jana ˇ Sarmanov´ a Unsupervised clustering

Unsupervised clustering Examples and comparison class and gng – 4 representatives ◮ Both results represent rough partitioning. ◮ Representatives are near centers - covering clusters as a whole. (g) class (h) gng Martin Hynar, Michal Burda, Jana ˇ Sarmanov´ a Unsupervised clustering

Unsupervised clustering Examples and comparison class and gng – 9 representatives ◮ More fine grained partitioning - an effort to cover smaller parts of clusters. ◮ gng expresses the topology of clusters using connections. ◮ gng ’s result could be interpreted as ”three clusters”, but ... (i) class (j) gng Martin Hynar, Michal Burda, Jana ˇ Sarmanov´ a Unsupervised clustering

Unsupervised clustering Examples and comparison class and gng – 25 representatives ◮ Dislocation of representatives looks similar. ◮ gng ’s result is nicely interpretable - 4 clusters with some topology. (k) class (l) gng Martin Hynar, Michal Burda, Jana ˇ Sarmanov´ a Unsupervised clustering

Unsupervised clustering Conclusions Conclusions ◮ Both approaches produce similar results. ◮ Suitable interpretation of connections could make results clearer. ◮ Good feature of gng - set of independent sets of neurons. ◮ Gives additional useful information. ◮ Need to be interpreted with care. ◮ Situation in n-dimensional space - future work. Martin Hynar, Michal Burda, Jana ˇ Sarmanov´ a Unsupervised clustering

Unsupervised clustering That’s all, thank you for your attention. Questions welcome. Martin Hynar, Michal Burda, Jana ˇ Sarmanov´ a Unsupervised clustering

Unsupervised clustering with growing self-organizing neural network - PowerPoint PPT Presentation

Unsupervised clustering with growing self-organizing neural network A comparison with non-neural approach Martin Hynar, Michal Burda, Jana Sarmanov a Department of Computer Science, V SB Technical University of Ostrava, 17.

Parallelizing the Growing Self-Organizing Maps algorithm using Software Transactional Memory

CHAPTER VIII VIII CHAPTER Data Clustering and Data Clustering and Self- -Organizing Feature

Clustering kMeans, Expectation Maximization, Self-Organizing Maps Outline K-means

UNSUPERVISED LEARNING, CLUSTERING UNSUPERVISED LEARNING UNSUPERVISED LEARNING Supervised

Self-Organizing Maps Kyle Thayer Organizing Marbles Self-Organizing Maps Algorithm

Unsupervised Learning and Clustering l In unsupervised learning you are given a data set with no

Self-Organizing Maps Cao Mai December 14, 2009 Outline n Define Self-Organizing Maps (SOM)

Self organizing robot Self organizing robot gathering Seminar in Distributed Computing Christof

Chapter 7: Clustering (Unsupervised Data Organization) 7.1 Hierarchical Clustering 7.2 Flat

Graph Clustering Graph Clustering What is clustering? What is clustering? Finding patterns

Subspace Clustering Ensemble Clustering Subspace Clustering, Ensemble Clustering, Alternative

Hierarchical Clustering 4-4-16 Hierarchical clustering: the setting Unsupervised learning

Evolutionary Clustering Presenter: Lei Tang Evolutionary Clustering Evolutionary Clustering

Clustering A Categorization of Major Clustering Methods Partitioning Methods

Mathematical Modeling of Mathematical Modeling of Self-Organizing Systems Self Organizing

Lecture 11 Jan-Willem van de Meent Clustering Clustering Unsupervised learning (no labels

Pattern Analysis and Machine Intelligence Lecture Notes on Clustering (II) 2013-2014 Davide

False-name-proofness in Online Mechanisms Taiki Todo, Takayuki Mouri, Atsushi Iwasaki, and

Week 6.1, Monday, Sept 23 Homework 3 Due Tonight: 11:59PM on Gradescope Late Submissions: Close

Learning on Humanoid Robots Vadym Gryshchuk 19.11.2018 Outline Motivation Background

1 Real Neural Learning Artificial Neuron Model Model network as a graph with cells as nodes

Hebbian Learning Algorithms for Training Convolutional Neural Networks Gabriele Lagani Computer

Distributed Computation of Feature-Detectors for Medical Image Processing on GPGPU and Cell

Clustering I Hamid R. Rabiee Jafar Muhammadi, Nima Pourdamghani Spring 2015

Sambuz

Useful Links

Newsletter

Mail Us