an impossibility theorem
play

An Impossibility Theorem Seminar Algorithms Kevin Chang Eindhoven - PowerPoint PPT Presentation

An Impossibility Theorem Seminar Algorithms Kevin Chang Eindhoven University of Technology June 19, 2018 Contents Motivation Definitions The Impossibility Theorem Centroid-Based Clustering and Consistency Relaxing the Properties Table of


  1. An Impossibility Theorem Seminar Algorithms Kevin Chang Eindhoven University of Technology June 19, 2018

  2. Contents Motivation Definitions The Impossibility Theorem Centroid-Based Clustering and Consistency Relaxing the Properties

  3. Table of Contents Motivation Definitions The Impossibility Theorem Centroid-Based Clustering and Consistency Relaxing the Properties

  4. Motivation

  5. Motivation ◮ The mapper algorithm needs a ’good’ cluster algorithm

  6. Motivation Clustering : ◮ ‘Clustering’ cannot be precisely defined ◮ Intuitively, group set of objects that are ‘similar’. ◮ Unsupervised

  7. Motivation Clustering : ◮ ‘Clustering’ cannot be precisely defined ◮ Intuitively, group set of objects that are ‘similar’. ◮ Unsupervised Example:

  8. Motivation Clustering : ◮ ‘Clustering’ cannot be precisely defined ◮ Intuitively, group set of objects that are ‘similar’. ◮ Unsupervised Example:

  9. Motivation ◮ There exist no universal good clustering algorithm. ◮ Every clustering algorithm assumes a certain model. ◮ e.g. k-means tends to generate hyperspherical clusters.

  10. Motivation Example:

  11. Motivation Example: k-means

  12. Motivation Example: Single-link

  13. Motivation Example:

  14. Motivation Example: k-means

  15. Motivation Example: Single-link

  16. Motivation The idea of no universal clustering algorithm is partially captured by the impossibility theorem: ◮ There is no single clustering algorithm simultaneously satisfies a set of basic intuitive axioms of data clustering.

  17. Table of Contents Motivation Definitions The Impossibility Theorem Centroid-Based Clustering and Consistency Relaxing the Properties

  18. Definitions ◮ S is a set of n points

  19. Definitions ◮ S is a set of n points ◮ A distance function is any function d : S × S → R such that: ◮ For distinct i , j ∈ S , we have d ( i , j ) ≥ 0. ◮ d ( i , j ) = 0 iff i = j . ◮ d ( i , j ) = d ( j , i ) .

  20. Definitions ◮ S is a set of n points ◮ A distance function is any function d : S × S → R such that: ◮ For distinct i , j ∈ S , we have d ( i , j ) ≥ 0. ◮ d ( i , j ) = 0 iff i = j . ◮ d ( i , j ) = d ( j , i ) . ◮ A clustering function is any function f ( d ) that takes a distance function d , and returns a partition of Γ of S . ◮ Points are not assumed to belong to any ambient space. ◮ The sets in Γ will be called its clusters.

  21. Definitions Example: Set of points

  22. Definitions Example: Distance function

  23. Definitions Example: A partition of S, existing out of 3 clusters

  24. Table of Contents Motivation Definitions The Impossibility Theorem Centroid-Based Clustering and Consistency Relaxing the Properties

  25. Scale-Invariance Axiom 1: Scale-Invariance For any distance function d and any α > 0 , we have f ( d ) = f ( α · d ) ◮ i.e. cluster functions should not have a built-in ’length-scale’.

  26. Scale-Invariance

  27. Richness Let Range( f ) denote the set of all partitions Γ such that f ( d ) = Γ for some distance function d Axiom 2: Richness Range( f ) is equal to the set of all partitions of S . ◮ i.e. every partition of S is a possible output.

  28. Richness

  29. Consistency ◮ Let Γ be a partition of S , and d and d ’ two distance functions on S.

  30. Consistency ◮ Let Γ be a partition of S , and d and d ’ two distance functions on S. ◮ d ’ is a Γ -transformation of d if 1. for all i , j ∈ S belonging to the same cluster of Γ , we have d ’ ( i , j ) ≤ d ( i , j ) ; 2. for all i , j ∈ S belonging to different clusters of Γ , we have d ’ ( i , j ) ≥ d ( i , j )

  31. Consistency ◮ Let Γ be a partition of S , and d and d ’ two distance functions on S. ◮ d ’ is a Γ -transformation of d if 1. for all i , j ∈ S belonging to the same cluster of Γ , we have d ’ ( i , j ) ≤ d ( i , j ) ; 2. for all i , j ∈ S belonging to different clusters of Γ , we have d ’ ( i , j ) ≥ d ( i , j ) Axiom 3: Consistency Let d and d ’ be two distance functions. If f ( d ) = Γ , and d ’ is a Γ -transformation of d , then f ( d ’ ) = Γ ◮ i.e. Cluster stays the same after reducing the distance within cluster and enlarging distance between cluster.

  32. Consistency

  33. The Impossibility Theorem Theorem 2.1 For each n ≥ 2, there is no clustering function f that satisfies Scale-Invariance, Richness, and Consistency.

  34. Single-linkage ◮ Single-linkage is a family of clustering function. ◮ Initialize each point as its own cluster. ◮ Repeatedly merge pair of clusters whose distance to one another is minimum until a stopping condition is reached.

  35. Single-linkage Example:

  36. Single-linkage Example:

  37. Single-linkage Example:

  38. Single-linkage Example:

  39. Single-linkage Example:

  40. Single-linkage Example:

  41. Examples of Impossibility ◮ k-cluster stopping condition : Stop adding edges when there are k connected components.

  42. Examples of Impossibility ◮ k-cluster stopping condition : Stop adding edges when there are k connected components. ◮ For any k ≥ 1, and n ≥ k , this stopping condition satisfies Scale-Invariance and Consistency.

  43. Examples of Impossibility ◮ k-cluster stopping condition : Stop adding edges when there are k connected components. ◮ For any k ≥ 1, and n ≥ k , this stopping condition satisfies Scale-Invariance and Consistency.

  44. Examples of Impossibility ◮ k-cluster stopping condition : Stop adding edges when there are k connected components. ◮ For any k ≥ 1, and n ≥ k , this stopping condition satisfies Scale-Invariance and Consistency.

  45. Examples of Impossibility ◮ distance-r stopping condition : Only add edges of weight at most r .

  46. Examples of Impossibility ◮ distance-r stopping condition : Only add edges of weight at most r . ◮ For any r > 0, and any n ≥ 2, this stopping condition satisfies Richness and Consistency.

  47. Examples of Impossibility ◮ distance-r stopping condition : Only add edges of weight at most r . ◮ For any r > 0, and any n ≥ 2, this stopping condition satisfies Richness and Consistency. r

  48. Examples of Impossibility ◮ distance-r stopping condition : Only add edges of weight at most r . ◮ For any r > 0, and any n ≥ 2, this stopping condition satisfies Richness and Consistency. r

  49. Examples of Impossibility ◮ scale- α stopping condition : Let p ∗ denote the maximum pairwise distance. Add only edges of weight at most α p ∗

  50. Examples of Impossibility ◮ scale- α stopping condition : Let p ∗ denote the maximum pairwise distance. Add only edges of weight at most α p ∗ ◮ For any positive α < 1, and n ≥ 3, this stopping condition satisfies Scale-Invariance and Richness

  51. Examples of Impossibility ◮ scale- α stopping condition : Let p ∗ denote the maximum pairwise distance. Add only edges of weight at most α p ∗ ◮ For any positive α < 1, and n ≥ 3, this stopping condition satisfies Scale-Invariance and Richness p ∗ α p ∗

  52. Examples of Impossibility ◮ scale- α stopping condition : Let p ∗ denote the maximum pairwise distance. Add only edges of weight at most α p ∗ ◮ For any positive α < 1, and n ≥ 3, this stopping condition satisfies Scale-Invariance and Richness p* α p ∗

  53. The Impossibility Theorem Proof Intuition

  54. The Impossibility Theorem Proof First some notions. ◮ A partition Γ ’ is a refinement of a partition Γ if for every set C ’ ∈ Γ ’, there is a set C ∈ Γ such that C ’ ⊆ C . Partition Γ ’ Partition Γ

  55. The Impossibility Theorem Proof First some notions. ◮ A partition Γ ’ is a refinement of a partition Γ if for every set C ’ ∈ Γ ’, there is a set C ∈ Γ such that C ’ ⊆ C . ◮ A collection of partitions is an antichain if it does not contain two distinct partitions such that one is a refinement of the other. Partition Γ ’ Partition Γ

  56. The Impossibility Theorem Proof The impossibility result follows from: Theorem 3.1 If a clustering function f satisfies Scale-Invariance and Consistency, then Range( f ) is an antichain.

  57. The Impossibility Theorem Proof Some more notions needed to prove theorem 3.1: ◮ For a partition Γ a distance function d ( a , b ) -conforms to Γ if, ◮ for all pairs of points i , j that belong to the same cluster of Γ , we have d ( i , j ) ≤ a ◮ while all pairs of points i , j that belong to the different cluster of Γ , we have d ( i , j ) ≥ b

  58. The Impossibility Theorem Proof Some more notions needed to prove theorem 3.1: ◮ For a partition Γ a distance function d ( a , b ) -conforms to Γ if, ◮ for all pairs of points i , j that belong to the same cluster of Γ , we have d ( i , j ) ≤ a ◮ while all pairs of points i , j that belong to the different cluster of Γ , we have d ( i , j ) ≥ b Example: Partition Γ 5 3 d (3 , 5) -conforms to Γ

Recommend


More recommend