How Many Dissimilarity/Kernel Self Organizing Map Variants Do We - PowerPoint PPT Presentation

How Many Dissimilarity/Kernel Self Organizing Map Variants Do We Need? Fabrice Rossi SAMM, Université Paris 1 WSOM 2014 Mittweida

How Many Dissimilarity/Kernel Self Organizing Map Variants Do We Need? Fabrice Rossi SAMM, Université Paris 1 WSOM 2014 Mittweida “a little bit small compared to Paris”

Data complexity is increasing Modern data are complex ◮ text everywhere (comments, messages, status, etc.) ◮ images everywhere ◮ relations (friends/contact, like/plus, ad hoc discussion, etc.) ◮ mixed data (buyers/items, listeners/songs, etc.)

Data complexity is increasing Modern data are complex ◮ text everywhere (comments, messages, status, etc.) ◮ images everywhere ◮ relations (friends/contact, like/plus, ad hoc discussion, etc.) ◮ mixed data (buyers/items, listeners/songs, etc.) The vector model... ◮ in which all objects ( x i ) 1 ≤ i ≤ N live in a fixed vector space R p ◮ ...is less and less relevant Solutions 1. specific solutions (e.g., probabilistic models for relational data) 2. generic solutions via a comparison measure

Dissimilarity/Kernel Data Data model ◮ a data space X (might be implicit) ◮ N observations ( x i ) 1 ≤ i ≤ N from X (possibly with no attached description) Dissimilarity ◮ a symmetric dissimilarity d function from X 2 to R + ◮ or a symmetric matrix D = ( d ( x i , x j )) 1 ≤ i ≤ N , 1 ≤ j ≤ N Kernel ◮ a kernel function k from X 2 to R , symmetric and positive definite ◮ or a symmetric positive definite matrix K = ( k ( x i , x j )) 1 ≤ i ≤ N , 1 ≤ j ≤ N

SOM Low dimensional prior structure ◮ a regular lattice of K units/neurons in R 2 : ( r k ) 1 ≤ k ≤ K ◮ a time dependent neighborhood function h kl ( t ) , e.g. � � − � r k − r l � 2 h kl ( t ) = exp 2 σ 2 ( t ) Mapping ◮ each neuron r k is associated to a prototype/model m k in the data space ◮ each m k / r k is responsible of a cluster of data points, the C k : quantization/clustering aspect ◮ if r k and r l are close according to h kl then m k and m l should be close: topology preservation aspect

Training Algorithms Stochastic/Online SOM 1. select a random data point x 2. find its best matching unit k ∈{ 1 ,..., K } � x − m k ( t ) � 2 c = arg min 3. update all prototypes m k ( t + 1 ) = m k ( t ) + ǫ ( t ) h kc ( t )( x − m k ( t )) 4. loop to 1 until convergence

Training Algorithms Batch SOM 1. compute the best matching unit for all data points k ∈{ 1 ,..., K } � x i − m k ( t ) � 2 c i ( t ) = arg min 2. update all prototypes � N i = 1 h kc i ( t ) ( t ) x i m k ( t + 1 ) = � N i = 1 h kc i ( t ) ( t ) 3. loop to 1 until convergence

Demo + + + ++ + + + + + + + + + + + + + + + + + + + + + + + ++ + + + + + + + ++ + + + + + + + + + + + + + + + + + + + ++ + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + ++ + + + + + + ++ + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + +++ + + + + + + + + + + + + ++ + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + ++ + + + + + + + + + ++ + + + + + + + + + + ++ + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + ++ + + + + + + + + + + + + + + + + + + + + + + ++ + + + + + + + + + + + + + + + ++ + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + A simple 2D dataset The original grid

Demo + + + + + ++ + + + ++ + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + ++ + + + + + + + ++ + + + + + + ++ + + + + + + + + + + + + ++ + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + ++ + + + + + + + + + ++ + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + ++ + + + + + + + + + + ++ + + + + + + ++ + + + + + + + + ++ + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + +++ + + + +++ + + + + + + + + + + + + + + + + + + + + + + + + + ++ + + + + + + + + + + + + + ++ + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + ++ + + + + + ++ + + + + + + + + + ++ + + + ++ + + + + + + + + ++ + + + + + + + + + + + ++ + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + ++ + + + + + + + ++ + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + ++ + ++ + + + + + + + + + + + + + + + + + + + + + ++ + + + + + + ++ + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + A simple 2D dataset Prototype positions in the data space

How Many Dissimilarity/Kernel Self Organizing Map Variants Do We - PowerPoint PPT Presentation

How Many Dissimilarity/Kernel Self Organizing Map Variants Do We Need? Fabrice Rossi SAMM, Universit Paris 1 WSOM 2014 Mittweida How Many Dissimilarity/Kernel Self Organizing Map Variants Do We Need? Fabrice Rossi SAMM, Universit Paris

Self-Organizing Maps Cao Mai December 14, 2009 Outline n Define Self-Organizing Maps (SOM)

Self-Organizing Maps Kyle Thayer Organizing Marbles Self-Organizing Maps Algorithm

Some Clustering Methods on Some Clustering Methods on Some Clustering Methods on Dissimilarity

Self organizing robot Self organizing robot gathering Seminar in Distributed Computing Christof

??? Encode dissimilarity between locations as edge weights distance

Mathematical Modeling of Mathematical Modeling of Self-Organizing Systems Self Organizing

Parallelizing the Growing Self-Organizing Maps algorithm using Software Transactional Memory

Tight Kernel Query Complexity of Kernel Ridge Regression and Kernel -means Clustering Manuel

Reranking with Contextual dissimilarity measures from representational Bregman k -means VISAPP

Quantale-valued dissimilarity Lili Shen (joint with Hongliang Lai, Yuanye Tao and Dexue Zhang)

map-D map-D data refined map-D data refined map-D A GPU Database for Real-Time Big Data

Tight Kernel Query Complexity of Kernel Ridge Regression and Kernel -means Clustering Manuel

Debugging the Linux Kernel with GDB Kieran Bingham Debugging the Linux Kernel with GDB Many

MANAGEMENT FUNDAMENTALS ORGANIZING ORGANIZING Lesson 3 Part 2 After developing plans,

Black Kernel Rot Malady of Pecan B Wood, C Bock, l Wells, T Cottrell, M Hotchkiss Black Kernel

Kernel Properties - Convexity Leila Wehbe October 1st 2013 Leila Wehbe Kernel Properties -

work_mem

Mendel at NERSC: Multiple Workloads on a Single Linux Cluster Larry Pezzaglia NERSC

CEE 370 Environmental Engineering Principles Lecture #17 Ecosystems IV: Microbiology &

CloudBATCH: A Batch Job Queuing System on Clouds with Hadoop and HBase Chen Zhang Hans De Sterck

Learning about the process and organism: Batch Sef Heijnen, Department of Biotechnology, Faculty

APEL Accounting: Data Flow and Work Plan Adrian Coveney, Greg Corbett apel-admins@stfc.ac.uk

Distributed Training Across the World 183ms 23Mbps California 35ms Tokyo 17Mbps 63Mbps

Learning Deconvolution Network for Semantic Segmentation Hyeonwoo Noh, Seunghoon Hong, Bohyung

Sambuz

Useful Links

Newsletter

Mail Us

How Many Dissimilarity/Kernel Self Organizing Map Variants Do We - PowerPoint PPT Presentation

How Many Dissimilarity/Kernel Self Organizing Map Variants Do We Need? Fabrice Rossi SAMM, Universit Paris 1 WSOM 2014 Mittweida How Many Dissimilarity/Kernel Self Organizing Map Variants Do We Need? Fabrice Rossi SAMM, Universit Paris

Self-Organizing Maps Cao Mai December 14, 2009 Outline n Define Self-Organizing Maps (SOM)

Self-Organizing Maps Kyle Thayer Organizing Marbles Self-Organizing Maps Algorithm

Some Clustering Methods on Some Clustering Methods on Some Clustering Methods on Dissimilarity

Self organizing robot Self organizing robot gathering Seminar in Distributed Computing Christof

??? Encode dissimilarity between locations as edge weights distance

Mathematical Modeling of Mathematical Modeling of Self-Organizing Systems Self Organizing

Parallelizing the Growing Self-Organizing Maps algorithm using Software Transactional Memory

Tight Kernel Query Complexity of Kernel Ridge Regression and Kernel -means Clustering Manuel

Reranking with Contextual dissimilarity measures from representational Bregman k -means VISAPP

Quantale-valued dissimilarity Lili Shen (joint with Hongliang Lai, Yuanye Tao and Dexue Zhang)

map-D map-D data refined map-D data refined map-D A GPU Database for Real-Time Big Data

Tight Kernel Query Complexity of Kernel Ridge Regression and Kernel -means Clustering Manuel

Debugging the Linux Kernel with GDB Kieran Bingham Debugging the Linux Kernel with GDB Many

MANAGEMENT FUNDAMENTALS ORGANIZING ORGANIZING Lesson 3 Part 2 After developing plans,

Black Kernel Rot Malady of Pecan B Wood, C Bock, l Wells, T Cottrell, M Hotchkiss Black Kernel

Kernel Properties - Convexity Leila Wehbe October 1st 2013 Leila Wehbe Kernel Properties -

work_mem

Mendel at NERSC: Multiple Workloads on a Single Linux Cluster Larry Pezzaglia NERSC

CEE 370 Environmental Engineering Principles Lecture #17 Ecosystems IV: Microbiology &amp;

CloudBATCH: A Batch Job Queuing System on Clouds with Hadoop and HBase Chen Zhang Hans De Sterck

Learning about the process and organism: Batch Sef Heijnen, Department of Biotechnology, Faculty

APEL Accounting: Data Flow and Work Plan Adrian Coveney, Greg Corbett apel-admins@stfc.ac.uk

Distributed Training Across the World 183ms 23Mbps California 35ms Tokyo 17Mbps 63Mbps

Learning Deconvolution Network for Semantic Segmentation Hyeonwoo Noh, Seunghoon Hong, Bohyung

Sambuz

Useful Links

Newsletter

Mail Us

CEE 370 Environmental Engineering Principles Lecture #17 Ecosystems IV: Microbiology &