why p in fuzzy clustering
play

Why p in Fuzzy Clustering? Our Explanation Proof Kehinde Akinola, - PowerPoint PPT Presentation

Formulation of the . . . Why p in Fuzzy Clustering? Our Explanation Proof Kehinde Akinola, Ahnaf Farhan, and Vladik Kreinovich Home Page Title Page University of Texas at El Paso El Paso, TX 79968, USA


  1. Formulation of the . . . Why µ p in Fuzzy Clustering? Our Explanation Proof Kehinde Akinola, Ahnaf Farhan, and Vladik Kreinovich Home Page Title Page University of Texas at El Paso El Paso, TX 79968, USA ◭◭ ◮◮ kaakinola@miners.utep.edu, ◭ ◮ afarhan@miners.utep.edu, Page 1 of 8 vladik@utep.edu Go Back Full Screen Close Quit

  2. 1. Formulation of the Problem Formulation of the . . . • One of the main algorithms for clustering n given Our Explanation d -dimensional points: Proof – selects K “typical” values c k and – selects assignments k ( i ) for each i from 1 to n Home Page – so as to minimize the sum Title Page � ( x i − c k ( i ) ) 2 . ◭◭ ◮◮ i ◭ ◮ • This minimization is usually done iteratively. Page 2 of 8 • First, we pick c k and assign each point x i to the cluster k whose representative c k is the closest to x i . Go Back Full Screen • Then, we freeze k ( i ) and select new typical represen- tatives c k by minimizing the objective function. Close • This leads to c k being an average of all the points x i Quit assigned to the k -th cluster.

  3. 2. Formulation of the Problem (cont-d) Formulation of the . . . • Then, the procedure repeats again and again – until Our Explanation the process converges. Proof • In practice, for some objects, we cannot definitely as- sign them to a single cluster. Home Page • In such cases, it is reasonable to assign, to each object i , Title Page – degrees µ ik of belongs to different clusters k , ◭◭ ◮◮ – so that � µ ik = 1. ◭ ◮ k • In this case, it seems reasonable to take each term Page 3 of 8 ( x i − c k ) 2 with the weight µ ik . Go Back • In other words, it seems reasonable to find the values Full Screen µ ik and c k by minimizing the expression Close � µ ik · ( x i − c k ) 2 . Quit i,k

  4. 3. Formulation of the Problem (cont-d) Formulation of the . . . • It seems reasonable to minimize Our Explanation � µ ik · ( x i − c k ) 2 . Proof i,k • However, this expression is linear in µ ik . Home Page • It is known that the minimum of a linear function un- Title Page der linear constraints is always at a vertex. ◭◭ ◮◮ • Thus, the minimum is attained when one value µ ik is ◭ ◮ 1 and the rest are 0s. Page 4 of 8 • We want to come up with truly fuzzy clustering, with Go Back 0 < µ ik < 1 for some i and k . Full Screen • Thus, we need to replace the factor µ ik with a non- linear expression f ( µ ik ). Close f ( µ ik ) · ( x i − c k ) 2 . • Then, we minimize the expression � Quit i,k

  5. 4. Formulation of the Problem (cont-d) Formulation of the . . . • We minimize the expression Our Explanation Proof � f ( µ ik ) · ( x i − c k ) 2 . i,k • In practice, the functions f ( µ ) = µ p works the best. Home Page Title Page • Why? ◭◭ ◮◮ ◭ ◮ Page 5 of 8 Go Back Full Screen Close Quit

  6. 5. Our Explanation Formulation of the . . . • The weights µ ik are normalized so that their sum is 1. Our Explanation • So, if we delete some clusters or add more clusters, we Proof need to re-normalize these values. • A usual way to do it is to multiply them by a normal- Home Page ization constant c . Title Page • It is therefore reasonable to require that: ◭◭ ◮◮ – the relative quality of different clustering ideas ◭ ◮ – not change is we simply re-scale. Page 6 of 8 • This implies, e.g., that: Go Back – if f ( µ 1 ) · v 1 = f ( µ 2 ) · v 2 , Full Screen – then after re-scaling µ i → c · µ i , we should have Close f ( c · µ 1 ) · v 1 = f ( c · µ 2 ) · v 2 . Quit • We show that this condition implies that f ( µ ) = µ p .

  7. 6. Our Explanation (cont-d) Formulation of the . . . • Indeed, f ( c · µ 2 ) = f ( µ 2 ) f ( c · µ 1 ) = v 1 Our Explanation f ( µ 1 ) . v 2 Proof = f ( c · µ 1 ) = f ( c · µ 2 ) def • Thus r for all µ 1 and µ 2 . f ( µ 1 ) f ( µ 2 ) Home Page • So, the ratio r does not depend on µ : r = r ( c ), and Title Page f ( c · µ ) = r ( c ) · f ( µ ) . ◭◭ ◮◮ ◭ ◮ • It is known that the only continuous solutions of this functional equations are f ( µ ) = C · µ p . Page 7 of 8 • Minimization is not affected if we divide the objective Go Back function by C and get f ( µ ) = µ p . Full Screen Close Quit

  8. 7. Proof Formulation of the . . . • We can solve the equation f ( c · µ ) = r ( c ) · f ( µ ) when Our Explanation f ( µ ) is differentiable. Proof • Indeed, since f ( µ ) is differentiable, the ratio r ( c ) = f ( c · µ ) is also differentiable. Home Page f ( µ ) Title Page • If we differentiate both sides of the equation with re- spect to c , we get µ · f ′ ( c · µ ) = r ′ ( c ) · f ( µ ). ◭◭ ◮◮ • For c = 1, we get µ · d f def ◭ ◮ dµ = p · f , where p = r ′ (1). Page 8 of 8 • If we move all the terms containing f to one side and Go Back all others to another, we get d f = p · dµ f µ . Full Screen • Integrating, we get ln( f ) = p · ln( µ ) + c 1 . Close • If we apply exp to both sides, we get f ( µ ) = C · µ p , Quit where C = exp( c 1 ) .

Recommend


More recommend