diffusion maps and coarse graining
play

Diffusion Maps and Coarse-Graining: A unified framework for - PowerPoint PPT Presentation

Diffusion Maps and Coarse-Graining: A unified framework for dimensionality reduction, graph partitioning and data set parameterization Authors: Stephane Lafon and Ann B. Lee (PAMI, Sept. 2006) Presented by Shihao Ji Duke University Machine


  1. Diffusion Maps and Coarse-Graining: A unified framework for dimensionality reduction, graph partitioning and data set parameterization Authors: Stephane Lafon and Ann B. Lee (PAMI, Sept. 2006) Presented by Shihao Ji Duke University Machine Learning Group Sept. 28, 2006

  2. Outline • Diffusion distances and Maps • Graph partitioning and subsampling • Numerical examples

  3. Diffusion distances • Let be a finite graph with n nodes, and weight matrix W satisfies the following conditions: – symmetry: – positivity: i.e. Gaussian kernel • Markov random walk • t step random walk

  4. Diffusion distances (cont’d) • Definition where is the unique stationary distribution of P.

  5. Diffusion Maps • The transition matrix P is adjoint to a symmetric matrix thus, P and P s share the same eigenvalues. • Since P s is a symmetric matrix eigenvalues: eigenvectors: form the orthonormal basis.

  6. Diffusion Maps (cont’d) • The left and right eigenvectors of P : • Biorthogonal spectral decomposition • Diffusion distances

  7. Diffusion Maps (cont’d) • Diffusion Maps • Diffusion distances

  8. Graph partitioning • Consider an arbitrary partition

  9. Graph partitioning (cont’d) • Definition ( geometric centroid ): • Theorem : for , we have where and

  10. Graph partitioning (Cont’d) • This theorem tells us 1. If then and are approximate left and right eigenvectors of with approximate eigenvalue . 2. In order to maximize the quality of approximation, we need to minimize the following distortion in diffusion space: • This provides a rigorous justification for k-means clustering in diffusion space.

  11. Numerical examples • Diffusion distance vs. Euclidean distance The Swiss roll, and its quantization by k-means ( k =4)

  12. Numerical examples (cont’d) • Robustness of the diffusion distance Diffusion distance Dijkstra’s algorithm

  13. Numerical examples (cont’d) Averaged on 1000 instances

  14. Messages • Diffusion maps provide a unified framework for dimensionality reduction, graph partitioning and data set parameterization. • Coarse-graining gives a rigorous justification of k- means clustering in diffusion space. • Diffusion distance is robust to noise and small perturbations of the data.

Recommend


More recommend