manifold alignment of high dimensional datasets
play

Manifold Alignment of High- Dimensional Datasets Sridhar Mahadevan - PowerPoint PPT Presentation

Manifold Alignment of High- Dimensional Datasets Sridhar Mahadevan (PI) & Rui Wang (co-PI) Thomas Boucher, Clifton Carey, Stefan Dernbach, Blake Foster, Hoa Vu, Chang Wang (IBM) School of Computer Science University of Massachusetts,


  1. Manifold Alignment of High- Dimensional Datasets Sridhar Mahadevan (PI) & Rui Wang (co-PI) Thomas Boucher, Clifton Carey, Stefan Dernbach, Blake Foster, Hoa Vu, Chang Wang (IBM) School of Computer Science University of Massachusetts, Amherst Thursday, December 13, 12

  2. Learning from Multiple Datasets • In many applications, multiple “views” or multiple datasets are constructed • Bioinformatics • Activity recognition • Computer graphics • Scientific exploration (MARS rover) • Cross-lingual information retrieval • Spectral methods for learning latent variable models Thursday, December 13, 12

  3. Canonical Correlation Analysis (Hotelling, 1936) Acceleration MPG Displacement Horsepower Weight Thursday, December 13, 12

  4. Canonical Correlation Analysis (Hotelling, 1936) Acceleration MPG Displacement Horsepower Weight Thursday, December 13, 12

  5. Canonical Correlation Analysis (Hotelling, 1936) Acceleration MPG Displacement Horsepower Weight Thursday, December 13, 12

  6. Canonical Correlation Analysis (Hotelling, 1936) Acceleration MPG Displacement Horsepower Weight Find u,v that maximizes Thursday, December 13, 12

  7. Canonical Correlation Analysis (Hotelling, 1936) Acceleration MPG Displacement Horsepower Weight Find u,v that maximizes Thursday, December 13, 12

  8. Canonical Correlation Analysis (Hotelling, 1936) Acceleration MPG Pioneer of the first two statistics departments in the US! UNC, Chapel Hill Displacement Horsepower Weight Columbia University Find u,v that maximizes Thursday, December 13, 12

  9. FODAVA project: main contribution • We developed a new class of methods, called manifold alignment, that outperforms CCA in many domains • Linear + Nonlinear • Local + Global • Supervised + Unsupervised • If you use multiple datasets, you should try manifold alignment! Thursday, December 13, 12

  10. Manifold Projections Thursday, December 13, 12

  11. Manifold Projections We want to find mapping functions α , β to minimize the cost function C ( α , β ) , where ∑∑ ∑ ∑ T T 2 i , j T T 2 i , j T T 2 i , j C ( α , β ) = µ ( α x − β y ) W + 0 . 5 ( α x − α x ) W + 0 . 5 ( β y − β y ) W i j i j x i j y i j i , j i , j Thursday, December 13, 12

  12. Manifold Projections We want to find mapping functions α , β to minimize the cost function C ( α , β ) , where ∑∑ ∑ ∑ T T 2 i , j T T 2 i , j T T 2 i , j C ( α , β ) = µ ( α x − β y ) W + 0 . 5 ( α x − α x ) W + 0 . 5 ( β y − β y ) W i j i j x i j y i j i , j i , j Thursday, December 13, 12

  13. Manifold Projections We want to find mapping functions α , β to minimize the cost function C ( α , β ) , where ∑∑ ∑ ∑ T T 2 i , j T T 2 i , j T T 2 i , j C ( α , β ) = µ ( α x − β y ) W + 0 . 5 ( α x − α x ) W + 0 . 5 ( β y − β y ) W i j i j x i j y i j i , j i , j Thursday, December 13, 12

  14. Manifold Projections We want to find mapping functions α , β to minimize the cost function C ( α , β ) , where ∑∑ ∑ ∑ T T 2 i , j T T 2 i , j T T 2 i , j C ( α , β ) = µ ( α x − β y ) W + 0 . 5 ( α x − α x ) W + 0 . 5 ( β y − β y ) W i j i j x i j y i j i , j i , j Preserve correspondences Thursday, December 13, 12

  15. Manifold Projections We want to find mapping functions α , β to minimize the cost function C ( α , β ) , where ∑∑ ∑ ∑ T T 2 i , j T T 2 i , j T T 2 i , j C ( α , β ) = µ ( α x − β y ) W + 0 . 5 ( α x − α x ) W + 0 . 5 ( β y − β y ) W i j i j x i j y i j i , j i , j Preserve correspondences Thursday, December 13, 12

  16. Manifold Projections We want to find mapping functions α , β to minimize the cost function C ( α , β ) , where ∑∑ ∑ ∑ T T 2 i , j T T 2 i , j T T 2 i , j C ( α , β ) = µ ( α x − β y ) W + 0 . 5 ( α x − α x ) W + 0 . 5 ( β y − β y ) W i j i j x i j y i j i , j i , j Preserve correspondences Preserve local geometry Thursday, December 13, 12

  17. Manifold Projections We want to find mapping functions α , β to minimize the cost function C ( α , β ) , where ∑∑ ∑ ∑ T T 2 i , j T T 2 i , j T T 2 i , j C ( α , β ) = µ ( α x − β y ) W + 0 . 5 ( α x − α x ) W + 0 . 5 ( β y − β y ) W i j i j x i j y i j i , j i , j Preserve correspondences Preserve local geometry (2) Theorem 1 : α , β to minimize C ( α , β ) are given by the eigenvecto rs correspond ing to the smallest eigenvalue s of T T ZLZ γ = λ ZDZ γ . Thursday, December 13, 12

  18. Thursday, December 13, 12

  19. A Summary of Manifold Alignment Approaches Given Given Unsupervised correspondences labels alignment Preserve Local geometry Preserve Global geometry One-step alignment Two-step alignment Feature-level Instance-level Procrustes alignment Manifold Projections (MP) Extensions of MP Thursday, December 13, 12

  20. Manifold Warping (Hoa, Carey, Mahadevan: AAAI, 2012) Dynamic Time Warping + Iterate: • Find projection to lower-dimensional space • Find new set of correspondences Manifold Alignment Thursday, December 13, 12

  21. Activity Recognition CCA+DTW (Zhou, NIPS 2009) The resulted alignment path of manifold warping is much closer to the ground truth alignment Vu, Carey, and Mahadevan, AAAI 2012 Thursday, December 13, 12 • • • • • ’ •

  22. Social Network Alignment Sparse Manifold Alignment Use Lasso to find a sparse solution. DBLP Social Network • • • Wang, Liu, Vu, and Mahadevan, 2012 Thursday, December 13, 12 • • ’ •

  23. Cross-Lingual Transfer in IR Thursday, December 13, 12

  24. Cross-Lingual Transfer in IR Madam President, on a point of order. You will be aware from the press and television that there have been a number of bomb English explosions and killings in Sri Lanka. documents Signora Presidente, intervengo per una mozione d'ordine.Come avrà letto sui giornali o sentito alla Italian televisione, in Sri Lanka si sono verificati numerosi documents assassinii ed esplosioni di ordigni. Frau Präsidentin, zur Geschäftsordnung. German Wie Sie sicher aus der Presse und dem Fernsehen documents wissen, gab es in Sri Lanka mehrere Bombenexplosionen mit zahlreichen Toten. Thursday, December 13, 12

  25. Cross-Lingual Transfer in IR Madam President, on a point of order. You will be aware from the press and television that there have been a number of bomb English explosions and killings in Sri Lanka. documents Signora Presidente, intervengo per una mozione d'ordine.Come avrà letto sui giornali o sentito alla Italian televisione, in Sri Lanka si sono verificati numerosi documents assassinii ed esplosioni di ordigni. Frau Präsidentin, zur Geschäftsordnung. German Wie Sie sicher aus der Presse und dem Fernsehen documents wissen, gab es in Sri Lanka mehrere Bombenexplosionen mit zahlreichen Toten. Proceedings of the EU Thursday, December 13, 12

  26. Cross-lingual IR Thursday, December 13, 12

  27. Cross-lingual IR Thursday, December 13, 12

  28. Impact of Work Thursday, December 13, 12

  29. Impact of Work • The most useful research I have done in 20 years! Thursday, December 13, 12

  30. Impact of Work • The most useful research I have done in 20 years! • Led to several new collaborations Thursday, December 13, 12

  31. Impact of Work • The most useful research I have done in 20 years! • Led to several new collaborations • Mars rover Curiosity (Darby Dyar, Mount Holyoke, NASA/ JPL scientific team) Thursday, December 13, 12

  32. Impact of Work • The most useful research I have done in 20 years! • Led to several new collaborations • Mars rover Curiosity (Darby Dyar, Mount Holyoke, NASA/ JPL scientific team) • Proposals submitted to CDS&E and BIGDATA Thursday, December 13, 12

  33. Impact of Work • The most useful research I have done in 20 years! • Led to several new collaborations • Mars rover Curiosity (Darby Dyar, Mount Holyoke, NASA/ JPL scientific team) • Proposals submitted to CDS&E and BIGDATA • Papers: 100+ citations on Google Scholar Thursday, December 13, 12

  34. Impact of Work • The most useful research I have done in 20 years! • Led to several new collaborations • Mars rover Curiosity (Darby Dyar, Mount Holyoke, NASA/ JPL scientific team) • Proposals submitted to CDS&E and BIGDATA • Papers: 100+ citations on Google Scholar • Many many applications (bioinformatics, graphics, robotics, science, IR) Thursday, December 13, 12

Recommend


More recommend