axioms for graph clustering objective functions
play

Axioms for graph clustering objective functions Twan van Laarhoven - PowerPoint PPT Presentation

Introduction Axioms Modularity Adaptive Modularity Conclusion Axioms for graph clustering objective functions Twan van Laarhoven Institute for Computing and Information Sciences Radboud University Nijmegen, The Netherlands 28th June 2013 1


  1. Introduction Axioms Modularity Adaptive Modularity Conclusion Axioms for graph clustering objective functions Twan van Laarhoven Institute for Computing and Information Sciences Radboud University Nijmegen, The Netherlands 28th June 2013 1 / 32

  2. Introduction Axioms Modularity Adaptive Modularity Conclusion Outline Introduction Axioms Modularity Adaptive Modularity Conclusion 2 / 32

  3. Introduction Axioms Modularity Adaptive Modularity Conclusion The motivation • There is no strict definition of clustering. • Can we formalize our intuition? • Previous work is about distance based clustering (hierarchical clustering, K-means, etc.) • What about graphs? 3 / 32

  4. Introduction Axioms Modularity Adaptive Modularity Conclusion The setting Definition (Graph) A symmetric weighted graph is a pair ( V , E ) of • a finite set V of nodes , and • a function E : V × V → R ≥ 0 of edge weights , such that E ( i , j ) = E ( j , i ) for all i , j ∈ V . • Larger weight = stronger connection. • We allow self loops. 4 / 32

  5. Introduction Axioms Modularity Adaptive Modularity Conclusion The setting (cont.) Definition (Clustering) A clustering C of a graph G = ( V , E ) is a partition of its nodes. Definition (Clustering function) A graph clustering function f is a function from graphs G to clusterings of G . Definition (Objective function) A graph clustering objective function Q is a function from graphs G and clusterings of G to R . • Larger objective value = better. 5 / 32

  6. Introduction Axioms Modularity Adaptive Modularity Conclusion Outline Introduction Axioms Modularity Adaptive Modularity Conclusion 6 / 32

  7. Introduction Axioms Modularity Adaptive Modularity Conclusion The form of axioms Things that define clusterings Form Notation 1 Clustering function f ( G ) = argmax C Q ( G , C ) 2 Objective function Q ( G , C ) Q ( G , C ) ≥ Q ( G , D ) or C ≥ G D 3 Objective relation 7 / 32

  8. Introduction Axioms Modularity Adaptive Modularity Conclusion Basic axioms Axiom 1: Scale invariance (first form) A graph clustering objective function Q is scale invariant if • for all graphs G = ( V , E ), • all constants α > 0, f ( G ) = f ( α G ). (where α G = ( V , ( i , j ) �→ α E ( i , j )).) Example     b d b d b d  a  a  = a  = f f e e e c c c 8 / 32

  9. Introduction Axioms Modularity Adaptive Modularity Conclusion Basic axioms Axiom 1: Scale invariance (second form) A graph clustering objective function Q is scale invariant if • for all graphs G = ( V , E ), • all constants α > 0, • all clusterings C of G , Q ( G , C ) = Q ( α G , C ). (where α G = ( V , ( i , j ) �→ α E ( i , j )).) Example     b d b d  a  a  = Q Q  e e c c 8 / 32

  10. Introduction Axioms Modularity Adaptive Modularity Conclusion Basic axioms Axiom 1: Scale invariance (second form) A graph clustering objective function Q is scale invariant if • for all graphs G = ( V , E ), • all constants α > 0, • all clusterings C of G , Q ( G , C ) = α Q ( α G , C ) ??? (where α G = ( V , ( i , j ) �→ α E ( i , j )).) Example     b d b d  a  a  = α Q Q  e e c c 8 / 32

  11. Introduction Axioms Modularity Adaptive Modularity Conclusion Basic axioms Axiom 1: Scale invariance (third form) A graph clustering objective function Q is scale invariant if • for all graphs G = ( V , E ), • all constants α > 0, • all clusterings C 1 , C 2 of G , Q ( G , C 1 ) ≥ Q ( G , C 2 ) if and only if Q ( α G , C 1 ) ≥ Q ( α G , C 2 ). (where α G = ( V , ( i , j ) �→ α E ( i , j )).) Example � � � � � � � � Q ≥ Q ⇐ ⇒ Q ≥ Q 8 / 32

  12. Introduction Axioms Modularity Adaptive Modularity Conclusion Basic axioms Axiom 2: permutation invariance A graph clustering objective function Q is permutation invariant if • for all graphs G = ( V , E ) and • all isomorphisms f : V → V ′ , it is the case that Q ( G , C ) = Q ( f ( G ) , f ( C )). (where f is extended to graphs and clusterings in the obvious way.) Example    x  y b d  a z  = Q Q   e c u v 9 / 32

  13. Introduction Axioms Modularity Adaptive Modularity Conclusion Basic axioms Axiom 3: Richness A graph clustering objective function Q is rich if • for all sets V and • all partitions C ∗ of V , there is • a graph G = ( V , E ) • such that C ∗ is the optimal clustering of G . Intuition: • No trivial objective functions. • No fixed number of clusters. 10 / 32

  14. Introduction Axioms Modularity Adaptive Modularity Conclusion Basic axioms Definition (Consistent improvement) Let • G = ( V , E ) and G ′ = ( V , E ′ ) be graphs, and • C be a clustering of G and G ′ . Then G ′ is a C-consistent improvement of G if • E ′ ( i , j ) ≥ E ( i , j ) for all i ∼ C j and • E ′ ( i , j ) ≤ E ( i , j ) for all i �∼ C j . Intuition: • Consistent improvements make a clustering fit better. 11 / 32

  15. Introduction Axioms Modularity Adaptive Modularity Conclusion Basic axioms Axiom 4: Monotonicity A graph clustering objective function Q is monotonic if • for all graphs G , • all clusterings C of G and • all C -consistent improvements G ′ of G it is the case that Q ( G ′ , C ) ≥ Q ( G , C ). Example     b d b d  a  a  ≥ Q Q  e e c c 12 / 32

  16. Introduction Axioms Modularity Adaptive Modularity Conclusion Local changes Definition (agreement) Let • G 1 = ( V 1 , E 1 ) and G 2 = ( V 2 , E 2 ) be two graphs and • V a ⊆ V 1 ∩ V 2 . The graphs agree on V a if E 1 ( i , j ) = E 2 ( i , j ) for all i , j ∈ V a . Definition (agreement on neighborhood) The graphs also agree on the neighborhood of V a if E 1 ( i , j ) = E 2 ( i , j ) for all i ∈ V a , j ∈ V 1 ∩ V 2 , and E 1 ( i , j ) = 0 for all i ∈ V a , j ∈ V 1 \ V 2 , and E 2 ( i , j ) = 0 for all i ∈ V a , j ∈ V 2 \ V 1 . What this means: • For nodes/clusters in V a , all incident edges are the same. 13 / 32

  17. Introduction Axioms Modularity Adaptive Modularity Conclusion Local changes Definition (agreement) Let • G 1 = ( V 1 , E 1 ) and G 2 = ( V 2 , E 2 ) be two graphs and • V a ⊆ V 1 ∩ V 2 . The graphs agree on V a if E 1 ( i , j ) = E 2 ( i , j ) for all i , j ∈ V a . Definition (agreement on neighborhood) The graphs also agree on the neighborhood of V a if E 1 ( i , j ) = E 2 ( i , j ) for all i ∈ V a , j ∈ V 1 ∩ V 2 , and E 1 ( i , j ) = 0 for all i ∈ V a , j ∈ V 1 \ V 2 , and E 2 ( i , j ) = 0 for all i ∈ V a , j ∈ V 2 \ V 1 . What this means: • For nodes/clusters in V a , all incident edges are the same. 13 / 32

  18. Introduction Axioms Modularity Adaptive Modularity Conclusion Local changes Axiom 5: Locality A graph clustering objective function Q is local if • for all graphs G 1 = ( V 1 , E 1 ) and G 2 = ( V 2 , E 2 ) that agree on a set V a and its neighborhood, • for all clusterings C 1 of V 1 \ V a , C 2 of V 2 \ V a and C a , D a of V a . if Q ( G 1 , C a ∪ C 1 ) ≥ Q ( G 1 , D a ∪ C 1 ) then Q ( G 2 , C a ∪ C 2 ) ≥ Q ( G 2 , D a ∪ C 2 ). 14 / 32

  19. Introduction Axioms Modularity Adaptive Modularity Conclusion Local changes Example     · · b b a a Q ≥ Q     · · · · c c �     · · b b a a     Q ≥ Q  · ·   · ·  c c         15 / 32

  20. Introduction Axioms Modularity Adaptive Modularity Conclusion Local changes Special cases • G 1 = G 2 : change part of a clustering. In practice: optimize parts separately (divide and conquer). • V a = ∅ : union of two disjoint graphs. 16 / 32

  21. Introduction Axioms Modularity Adaptive Modularity Conclusion Interlude: Related work Theorem (Kleinberg 2002) There is no clustering function that is permutation invariant, scale invariant, monotonic and rich. Theorem (Ackerman, Ben-David 2008) There is a clustering quality function that is permutation invariant, scale invariant, monotonic and rich. 17 / 32

  22. Introduction Axioms Modularity Adaptive Modularity Conclusion Discontinuity is magic Theorem There is a graph clustering function that is scale invariant, permutation invariant, monotonic, rich and local. Connected components f coco ( G ) = the connected components of G Q coco ( G , C ) = 1 [ C are the connected components of G ] Huh!?!? • Doesn’t this contradict Kleinberg’s theorem? • No: edge weight 0 = distance ∞ . 18 / 32

  23. Introduction Axioms Modularity Adaptive Modularity Conclusion Discontinuity is magic Theorem There is a graph clustering function that is scale invariant, permutation invariant, monotonic, rich and local. Connected components f coco ( G ) = the connected components of G Q coco ( G , C ) = 1 [ C are the connected components of G ] Huh!?!? • Doesn’t this contradict Kleinberg’s theorem? • No: edge weight 0 = distance ∞ . 18 / 32

Recommend


More recommend