Axioms for graph clustering Twan van Laarhoven and Elena Marchiori - PowerPoint PPT Presentation

Axioms for graph clustering Twan van Laarhoven and Elena Marchiori Institute for Computing and Information Sciences Radboud University Nijmegen, The Netherlands 27th September 2013 1 / 49

Outline Introduction Axioms for data clustering Axioms for graph clustering Modularity Conclusion 2 / 49

Clustering • Image processing, medicine, • biology, economy, ... see, e.g., UCI ML repository. 4 / 49

Clustering • social sciences, • life sciences, brain research, ... see, e.g., UCI Network Data repository. 5 / 49

Clustering: what is it? • Informally: grouping objects in such a way that objects in each group are more similar to each other than to objects in other groups. • Formally: an optimization problem. Define an objective function whose optimization yields a division of objects into (disjoint) groups. k-means clustering objective: � � || � x − � µ c || 2 , where � µ c = � x ∈ c � x / | c | . � c ∈ C � x ∈ c 6 / 49

Clustering: how to do it? • Clustering as an optimization problem is in general NP-hard. • Efficient heuristic and approximation algorithms are developed to find sub optimal solutions. 7 / 49

Clustering: data versus graphs • Data clustering uses a distance function that quantifies the similarity be- tween each pair of patterns. • Graph clustering uses weighted edges describing a relation over patterns. 8 / 49

From data to graph clustering • Proximity graphs may be used to transform a data clustering problem into a graph clustering one. Distance matrix → k NN graph → Graph clustering   · · · · · · · · · · · ·     · · · · · ·     · · · · · · 9 / 49

Why axioms? • There is no unique definition of clustering. • Can we formalize our intuition of good objective functions? • Are existing objective functions good? • Can we design better objective functions? 11 / 49

Axioms for data clustering Kleinberg’ s axiomatic framework Kleinberg proved an impossibility result concerning the axiomatization of the notion of data clustering. He focused on clustering functions ˆ C : D → C , from distance functions over a dataset S to clusterings of S , d �→ C . Theorem (Kleinberg 2002) There is no clustering function that is scale invariant, consistent and rich. 12 / 49

Kleinberg’s axioms • Scale-Invariance . C ( d ) = ˆ ˆ ∀ d ∈ D , α > 0 . C ( α d ).   c   a c  a ˆ  = ˆ C C     d b d b 13 / 49

Kleinberg’s axioms • Richness . range( ˆ C ) is equal to the set of all partitions of S . ∃ d . ˆ C ( d ) = a c b d a e.g. d = c d b 13 / 49

Kleinberg’s axioms • Consistency . � ˆ ∀ d , d ′ ∈ D . C ( d ) = C and d ′ is a C -transformation of d ⇒ ˆ � C ( d ′ ) = C . d ′ is a C-transformation of d if ∀ i , j ∈ S • i ∼ C j ⇒ d ′ ( i , j ) ≤ d ( i , j ); • i �∼ C j ⇒ d ′ ( i , j ) ≥ d ( i , j ).   a ˆ  = a C c c b  b � � a ⇒ ˆ c = a c C b b 13 / 49

Kleinberg’s axioms • Scale-Invariance . C ( d ) = ˆ ˆ ∀ d ∈ D , α > 0 . C ( α d ). • Richness . range( ˆ C ) is equal to the set of all partitions of S . • Consistency . � ˆ ∀ d , d ′ ∈ D . C ( d ) = C and d ′ is a C -transformation of d ⇒ ˆ � C ( d ′ ) = C . d ′ is a C-transformation of d if ∀ i , j ∈ S • i ∼ C j ⇒ d ′ ( i , j ) ≤ d ( i , j ); • i �∼ C j ⇒ d ′ ( i , j ) ≥ d ( i , j ). 13 / 49

Kleinberg result C ′ is a refinement of C ( C ′ ⊑ C ) if ∀ c ′ ∈ C ′ ∃ c ∈ C s.t. c ′ ⊆ c . { C 1 , . . . , C n } ⊂ C is an antichain if ∀ i , j i � = j ⇒ C i �⊑ C j . Theorem If ˆ C is Scale Invariant and Consistent then range ( ˆ C ) is an antichain. Proof (sketch) Suppose ˆ C is Consistent and Scale Invariant. Let C 0 ⊑ C 1 in range( ˆ C ). Construct d such that ˆ C ( d ) = C 1 . Choose α such that d ′ = α d and ˆ C ( d ′ ) = C 0 . 14 / 49

Other results Quality functions Ackerman and Ben-David used quality functions Q instead of clustering functions. Q : D × C → R ≥ 0 , mapping a distance function and a clustering into a non-negative real number, ( d , C ) �→ r . Theorem (Ackerman, Ben-David 2008) There is a clustering quality function that is permutation invariant, scale invariant, monotonic and rich. C-index = ( s − s min ) / ( s max − s min ), where s = � i ∼ C j d ( i , j ), s min is the sum of the n minimal (over all pairs of patterns) distances, s max is the sum of the n maximal distances, n = |{ ( i , j ) | i ∼ C j }| . 15 / 49

To summarize • Previous work on axioms for clustering objective functions are framed in terms of distance functions. • Kleinberg’s impossibility result is for clustering functions. • Quality functions are more flexible and allow for axiomatization of data clustering. • What about graph clustering? This is a different - although related - story ... 16 / 49

Graphs Distance functions Graphs d ( i , j ) E ( i , j ) a b a b c c d d 18 / 49

Graphs Distance functions Graphs d ( i , j ) E ( i , j ) a b a b - c c d d 18 / 49

Graphs b d f a e k g c h i j A symmetric weighted graph (or network) is a pair ( V , E ) of • a finite set V of nodes , and • a function E : V × V → R ≥ 0 of edge weights , such that E ( i , j ) = E ( j , i ) for all i , j ∈ V . 19 / 49

Graph clustering b d f a e k g c h i j A clustering C of a graph G = ( V , E ) is a partition of its nodes. 19 / 49

Clustering: formalizations 1. Clustering function ˆ C : Graph → Clustering   b d b d  a  = a ˆ C   e e c c 2. Quality function Q : Graph × Clustering → R 3. Quality relation · � G · ⊆ Clustering × Clustering 20 / 49

Clustering: formalizations 1. Clustering function ˆ C : Graph → Clustering 2. Quality function Q : Graph × Clustering → R   b d  a Q  = 0 . 1234   e c 3. Quality relation · � G · ⊆ Clustering × Clustering 20 / 49

Clustering: formalizations 1. Clustering function ˆ C : Graph → Clustering 2. Quality function Q : Graph × Clustering → R 3. Quality relation · � G · ⊆ Clustering × Clustering b d b d a � a e e c c 20 / 49

Some quality functions • Connected components • Total weight of within cluster edges � Q ( G , C ) = w c c ∈ C • Modularity � w c / v V − ( v c / v V ) 2 � � Q ( G , C ) = c ∈ C • Many more � − w c log( v c / v V ) Q ( G , C ) = c ∈ C · · · 21 / 49

Families of quality functions • Connected components with threshold • Total weight of within cluster edges with penalty � Q ( G , C ) = w c − α | C | c ∈ C • Modularity � Q γ w c / v V − γ ( v c / v V ) 2 � � RB ( G , C ) = c ∈ C • Many more � − w c log( v c /α ) Q ( G , C ) = c ∈ C · · · 22 / 49

Axiom 1: Scale invariance Intuition: The magnitude of the edge weights shouldn’t matter.     b d b d  a  a ˆ  = ˆ C C      e e c c 23 / 49

Axiom 1: Scale invariance Intuition: The magnitude of the edge weights shouldn’t matter.     b d b d  a  a Q  = Q      e e c c 23 / 49

Axiom 1: Scale invariance Intuition: The magnitude of the edge weights shouldn’t matter.     b d b d  a  a Q  = α Q      e e c c 23 / 49

Axiom 1: Scale invariance Intuition: The magnitude of the edge weights shouldn’t matter.     Q ≥ Q         �     ≥ Q Q         23 / 49

Axiom 1: Scale invariance Intuition: The magnitude of the edge weights shouldn’t matter. A quality function Q is scale invariant if • for all graphs G = ( V , E ), • all constants α > 0, Q ( G , C 1 ) ≥ Q ( G , C 2 ) if and only if Q ( α G , C 1 ) ≥ Q ( α G , C 2 ). 23 / 49

Axiom 2: Permutation invariance Intuition: Only the edge weights should matter.     x y b d  a z Q  = Q       e c u v 24 / 49

Axiom 2: Permutation invariance Intuition: Only the edge weights should matter. A quality function Q is permutation invariant if Q ( G , C ) = Q ( f ( G ) , f ( C )) . for all • graphs G = ( V , E ) and • all isomorphisms f : V → V ′ , where f is extended to graphs and clusterings in the obvious way. 24 / 49

Axiom 3: Richness Intuition: • All clusterings must be possible. So, • no trivial quality functions. • no fixed number of clusters. A quality function Q is rich if • for all sets V and • all partitions C ∗ of V , there is • a graph G = ( V , E ) • such that C ∗ is the optimal clustering of G . 25 / 49

Axioms for graph clustering Twan van Laarhoven and Elena Marchiori - PowerPoint PPT Presentation

Axioms for graph clustering Twan van Laarhoven and Elena Marchiori Institute for Computing and Information Sciences Radboud University Nijmegen, The Netherlands 27th September 2013 1 / 49 Outline Introduction Axioms for data clustering

Graph Clustering Graph Clustering What is clustering? What is clustering? Finding patterns

Subspace Clustering Ensemble Clustering Subspace Clustering, Ensemble Clustering, Alternative

Graph Clustering Why graph clustering is useful? Distance matrices are graphs as useful as

PAC-Bayesian Analysis of Co-clustering, Graph Clustering and Pairwise Clustering Yevgeny Seldin

Axioms for graph clustering objective functions Twan van Laarhoven Institute for Computing and

Evolutionary Clustering Presenter: Lei Tang Evolutionary Clustering Evolutionary Clustering

Clustering A Categorization of Major Clustering Methods Partitioning Methods

Trust based Clustering for Group Trust based Clustering for Group Trust based Clustering for

Finding Clusters Types of Clustering Approaches: Linkage Based, e.g. Hierarchical Clustering

Clustering Hierarchical clustering and k-mean clustering Genome 373 Genomic Informatics

Cl Clustering t i A Categorization of Major Clustering Methods Partitioning Methods

Clustering Hierarchical clustering, k-mean clustering Genome 559: Introduction to Statistical and

CSCE 478/878 Lecture 8: Stephen Scott Clustering Introduction Outline Clustering Stephen

Clustering and Dimensionality Reduction Preview Clustering K -means clustering

Clustering kMeans, Expectation Maximization, Self-Organizing Maps Outline K-means

Lecture 23: Spectral clustering Hierarchical clustering What is a good clustering?

RESEARCH POSTER WORKSHOP URSI 2017 Vassar College Chris Gahn WHY MAKE A RESEARCH POSTER?

Work-up of Abdominal Pain " ,

Question Area assessed Experts

A differentiated strategy for cancer care David Epstein, MD August 27, 2012 We put people at the

Pesticide Safety: Labels, Toxicology & PPE What is a Pesticide? Pesticides are chemicals

Winter Driving Safety PPT-SM-WNTRDRVNG 1 V.A.0.0 Winter Driving The leading cause of death

CRITICAL DOCUMENTS AND VALUABLES When disaster strikes, your immediate concern will be your

November 21, 2012 Hydraulic Fracturing: The Good, The Bad, and the Ugly Stephan MacLellan, P.Eng

Axioms for graph clustering Twan van Laarhoven and Elena Marchiori - PowerPoint PPT Presentation

Axioms for graph clustering Twan van Laarhoven and Elena Marchiori Institute for Computing and Information Sciences Radboud University Nijmegen, The Netherlands 27th September 2013 1 / 49 Outline Introduction Axioms for data clustering

Graph Clustering Graph Clustering What is clustering? What is clustering? Finding patterns

Subspace Clustering Ensemble Clustering Subspace Clustering, Ensemble Clustering, Alternative

Graph Clustering Why graph clustering is useful? Distance matrices are graphs as useful as

PAC-Bayesian Analysis of Co-clustering, Graph Clustering and Pairwise Clustering Yevgeny Seldin

Axioms for graph clustering objective functions Twan van Laarhoven Institute for Computing and

Evolutionary Clustering Presenter: Lei Tang Evolutionary Clustering Evolutionary Clustering

Clustering A Categorization of Major Clustering Methods Partitioning Methods

Trust based Clustering for Group Trust based Clustering for Group Trust based Clustering for

Finding Clusters Types of Clustering Approaches: Linkage Based, e.g. Hierarchical Clustering

Clustering Hierarchical clustering and k-mean clustering Genome 373 Genomic Informatics

Cl Clustering t i A Categorization of Major Clustering Methods Partitioning Methods

Clustering Hierarchical clustering, k-mean clustering Genome 559: Introduction to Statistical and

CSCE 478/878 Lecture 8: Stephen Scott Clustering Introduction Outline Clustering Stephen

Clustering and Dimensionality Reduction Preview Clustering K -means clustering

Clustering kMeans, Expectation Maximization, Self-Organizing Maps Outline K-means

Lecture 23: Spectral clustering Hierarchical clustering What is a good clustering?

RESEARCH POSTER WORKSHOP URSI 2017 Vassar College Chris Gahn WHY MAKE A RESEARCH POSTER?

Work-up of Abdominal Pain &quot; ,

Question Area assessed Experts

A differentiated strategy for cancer care David Epstein, MD August 27, 2012 We put people at the

Pesticide Safety: Labels, Toxicology &amp; PPE What is a Pesticide? Pesticides are chemicals

Winter Driving Safety PPT-SM-WNTRDRVNG 1 V.A.0.0 Winter Driving The leading cause of death

CRITICAL DOCUMENTS AND VALUABLES When disaster strikes, your immediate concern will be your

November 21, 2012 Hydraulic Fracturing: The Good, The Bad, and the Ugly Stephan MacLellan, P.Eng

Work-up of Abdominal Pain " ,

Pesticide Safety: Labels, Toxicology & PPE What is a Pesticide? Pesticides are chemicals