germain van bever joint work with b li h oja f critchley
play

Germain Van Bever (joint work with B. Li, H. Oja, F. Critchley and R. - PowerPoint PPT Presentation

Germain Van Bever (joint work with B. Li, H. Oja, F. Critchley and R. Sabolova) Universit de Namur IWAFDA (III), May 24th, 2019 The cocktail party effect 1 / 21 Departing from elliptical symmetry PCA fails to uncover hidden structure...


  1. Germain Van Bever (joint work with B. Li, H. Oja, F. Critchley and R. Sabolova) Université de Namur IWAFDA (III), May 24th, 2019

  2. The cocktail party effect 1 / 21

  3. Departing from elliptical symmetry PCA fails to uncover hidden structure... since emphasis is on another dispersion measure. 2 / 21

  4. Departing from elliptical symmetry PCA fails to uncover hidden structure... since emphasis is on another dispersion measure. 3 / 21

  5. Departing from elliptical symmetry PCA fails to uncover hidden structure... since emphasis is on another dispersion measure. 4 / 21

  6. Functional example: Australian weather data 150 Precipitations (mm) 100 50 0 0 100 200 300 day Figure: Australian weather data. Daily measurements in 190 weather stations from 1840 to 1990 (length of records vary from one station to the other). 5 / 21

  7. Australian weather data: principal components 50 0.10 40 0.05 PCA eigenfunctions Frequency 30 0.00 20 -0.05 10 -0.10 0 0 90 180 270 360 -1 0 1 2 3 4 5 PC1 scores Figure: Left: First four (from thickest to thinnest) principal eigenfunctions. Right: Histogram of the first principal score. 6 / 21

  8. Australian weather data: principal clustering 7 / 21

  9. The IC model ◮ X follows an Independent Component Model ( X ∼ IC ( Z ) ) if X = Ω Z, for Z = ( Z 1 , . . . , Z p ) T with independent marginals and Ω a nonsin- gular p × p mixing matrix. ◮ IC models ⊂ BSS models ◮ Independent component analysis (ICA): Find (one) unmixing matrix Γ such that Γ X has independent marginals. ◮ If Z has at most one gaussian marginal, then Γ = Ω − 1 up to permu- tation and multiplication by a diagonal matrix. 8 / 21

  10. Whitening Proposition . Write X st = Σ − 1 / 2 X, where Σ − 1 / 2 Let X ∼ IC(Z) and let Σ = Var ( X ) is the symmetric inverse square root of Σ . Then, Z = U T X st = U T Σ − 1 / 2 X for some orthogonal matrix U = ( u 1 · · · u p ) . ◮ ICA problems are thus often reduced to the estimation of an orthog- onal matrix after whitening the distribution. 9 / 21

  11. The FOBI procedure in a nutshell ( X T X ) XX T � � Let Σ 0 ( X ) = Var ( X ) and let Σ 1 ( X ) = E . If Z has inde- pendent components, then Σ 1 ( Z ) is diagonal. Theorem Let X ∼ IC ( Z ) . Assume the components of Z have finite and dis- tinct kurtoses. Let V Λ V T be the eigendecomposition of Σ 1 ( X st ) . Then, V T X st = V T Σ − 1 / 2 X has independent components. 0 10 / 21

  12. Functional data Observations are typically assumed to belong to H = L 2 ( I ) � allows general treatment of FOBI in a Hilbert space H . Extension to a generic H faces three hurdles: ◮ No notion of marginals in general. ◮ No standardization possible: Σ − 1 / 2 X does not exist. 0 ◮ Intrinsic limit to knowledge requires regularization. 11 / 21

  13. Functional IC model ◮ Assume X ( n ) resides in a finite-dimensional subspace H n of H . ◮ The dimension m n of H n is thought to increase to infinity with n . ◮ The subspaces H n are assumed to be nested in n . Definition Suppose E || X || 2 H n < ∞ and let { f i : i ∈ N 0 } be a Σ 0 -ONB. We say that X follows a functional independent component model with respect to a functional operator Γ ∈ B ( H n ) if the sequence of random variables {� Γ X, f i � : i ∈ N 0 } is independent. If this condition is satisfied then we write X ∼ FIC (Γ) . 12 / 21

  14. FOBI operator ( X ⊗ X ) 2 � � Define the FOBI operator Σ 1 ( X ) as E . Theorem If E � X � 4 H n < ∞ , then Σ 1 ( X ) is a trace-class operator and is unitarily equivariant, that is Σ 1 ( UX ) = U Σ 1 ( X ) U ∗ for any operator satisfying UU ∗ = U ∗ U = I . 13 / 21

  15. Fisher consistency Assume (a) X ∈ H n with E � X � 4 < ∞ and { f i : 1 , · · · , m n } is a Σ 0 -ONB; (b) X ∼ FIC (Γ) for some Γ ∈ B ( H n ) ; and (c) kurt ( � Γ X, f i � ) , i = 1 , . . . , m n are all distinct. i =1 τ i ( h i ⊗ h i ) be the spectral decomposition of Σ 1 (Σ − 1 / 2 Let � m n X ) . Let 0 m n � V = ( h i ⊗ f i ) . i =1 Then {� V ∗ Σ − 1 / 2 X, f i � H n : i = 1 , . . . , m n } is independent. 0 Remark: Σ − 1 / 2 denotes the Moore-Penrose inverse operator of Σ 0 . 0 14 / 21

  16. Karhunen-Loève revisited ◮ Classical KL expansion: X = � � X, f i � H f i . ◮ Alternative expansion: X = � � V ∗ Σ − 1 / 2 X, g i � H g i . 0 ◮ In the latter, the (FOBI) coefficients are independent rather than only uncorrelated. ◮ The resulting transformation X �→ V ∗ Σ − 1 / 2 X is similarly denoted 0 FOBI ( X ) . ◮ X �→ FOBI ( X ) is affine-invariant 15 / 21

  17. Consistency Let λ m n denote the smallest eigenvalue of Σ 0 . Let F n,X denote the oper- ator Σ 1 (Σ − 1 / 2 X ) and ˆ F n,X denote its empirical version. 0 Theorem H n < ∞ , X 1 , . . . , X n are i.i.d X , and n − 1 / 7 ≺ λ m n � 1 , If lim sup n E � X � 8 then � ˆ F n,X − F n,X � OP = O P ( n − 1 / 2 λ − 7 / 2 m n ) . 16 / 21

  18. Functional example: Australian weather data 150 Precipitations (mm) 100 50 0 0 100 200 300 day Figure: Australian weather data. Daily measurements in 190 weather stations from 1840 to 1990 (length of records vary from one station to the other). 17 / 21

  19. Australian weather data: principal clustering 18 / 21

  20. Australian weather data: FOBI eigenfunctions 0.10 20 FOBI eigenfunctions 0.05 15 Frequency 0.00 10 -0.05 5 -0.10 0 0 90 180 270 360 -3 -2 -1 0 1 2 FFOBI 4 scores Figure: Left: Four functional FOBI eigenfunctions. Right: Histogram of the fourth FOBI score. 19 / 21

  21. Australian weather data: FOBI clustering 20 / 21

  22. References ◮ Cardoso, J. F. (1989), Source separation using higher order moments, Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing , Glasgow, UK, 2109-2112. ◮ Li, B., Van Bever, G., Oja, H., Sabolova, R. and Critchley, F., Func- tional independent component analysis: an extension of fourth order blind identification. Submitted. ◮ Ramsay, J. and Silverman, B. (2005), Functional Data Analysis , 2nd Ed., Springer-Verlag. ◮ Tyler, D., Critchley, F., Dümbgen, L. and Oja, H. (2009), Invariant coordinate selection. Journal of Royal Statistical Society B, 71 , 549- 592. Thank you for your attention. 21 / 21

Recommend


More recommend