big data management analytics
play

Big Data Management & Analytics EXERCISE 9 SVD, CUR 11th of - PowerPoint PPT Presentation

Big Data Management & Analytics EXERCISE 9 SVD, CUR 11th of January, 2016 Sabrina Friedl LMU Munich 1 PCA REVISION 2 PCA Summary 3 Goals of PCA Detect hidden correla3ons Remove redundant and noisy features


  1. Big Data Management & Analytics EXERCISE 9 – SVD, CUR 11th of January, 2016 Sabrina Friedl LMU Munich 1

  2. PCA REVISION 2

  3. PCA – Summary 3

  4. Goals of PCA ◦ Detect hidden correla3ons ◦ Remove redundant and noisy features ◦ Interpreta3on and visualiza3on ◦ Easier storage and processing of dat -> Most helpful when there is a linear rela3onship between observed and hidden variables d=2 d=3 4

  5. Problems with PCA When applying PCA to a dataset of unknown structure 1. Unnormalized data can skew the result -> before PCA, norm the data! 2. Relevant structures might get lost 5

  6. Problems with PCA 3. Outliers can skew the PCA result 6

  7. Single Value Decomposition (SVD) REVISION AND EXERCISE 7

  8. SVD Any matrix X can be wriSen as (singular value decomposi3on) ◦ X Data matrix (n x d) ◦ V Right singular vectors: eigenvectors of ◦ U LeX-singular vectors of X: eigenvectors of ◦ Σ Singular Values: square roots of eigenvalues (elements on diagonal) hSps://de.wikipedia.org/wiki/Singul%C3%A4rwertzerlegung Usage example: Image compression 8

  9. SVD n x d n x n n x d d x d 9

  10. SVD- How to find matrices? Remember the Eigenwertproblem: v = eigenvector or λ = eigenvalue T = eigenvector matrix Λ diagonal eigenvalue matrix For ◦ Find V: ◦ Find U: or use: 10

  11. SVD - Example Given Matrix M Eigenvalues: Eigenpairs Eigenvectors : 11

  12. SVD - Example Eigenvalue decomposi3on Now we already know: 12

  13. SVD - Example Note: At this point we could write the SVD as follows: u 1 , u 2 and u 3 must build an How to find u 3 ? orthonormal basis! 13

  14. SVD - Example One-dimensional approxima3on of matrix M Recommended further reading: hSp://www.ams.org/samplings/feature-column/fcarc-svd 14

  15. CUR REVISION AND EXERCISE 15

  16. CUR Alterna3ve to SVD, which beSer respects the structure of the data 16

  17. Example Find CUR-decomposi3on of the given matrix with two rows and two columns! Sample size r = 2 Steps 1. Create sample matrices C and R 2. Construct U from C and R 17

  18. 1a. Create sample matrix C 18

  19. 1a. Create sample matrix C = 3 * 51 + 2*45 19

  20. 1a. Create sample matrix C 20

  21. 1b. Create sample matrix R 21

  22. 1b. Create sample matrix C Row 5 * Row 6 * 22

  23. 2. Construct U from C and R a) Create r x r matrix W as intersec3on of C and R b) Apply SVD on c) Compute as the pseudoinverse of d) Compute 23

  24. 2. Construct U from C and R a) Create matrix W: b) Apply SVD on W: c) Pseudo-Inverse of : d) Calculate 24

  25. Result of CUR decomposition 25

Recommend


More recommend