robust estimation of precision matrices under cellwise
play

Robust estimation of precision matrices under cellwise contamination - PowerPoint PPT Presentation

Scale Covariance Multivariate Conclusion Robust estimation of precision matrices under cellwise contamination Garth Tarr, Samuel M uller and Neville Weber COMPSTAT 2014 Scale Covariance Multivariate Conclusion Outline Robust Scale


  1. Scale Covariance Multivariate Conclusion Robust estimation of precision matrices under cellwise contamination Garth Tarr, Samuel M¨ uller and Neville Weber COMPSTAT 2014

  2. Scale Covariance Multivariate Conclusion Outline Robust Scale Estimator, P n Covariance Covariance Matrix Autocovariance Long Range Short Range Inverse Covariance Dependence Dependence Matrix Estimation

  3. Scale Covariance Multivariate Conclusion Outline Robust scale estimation with P n Robust pairwise covariance estimation Robust covariance and precision matrices Summary and key references

  4. Scale Covariance Multivariate Conclusion Pairwise mean scale estimator: P n • Consider the U -statistic, based on the pairwise mean kernel, ◆ − 1 X ✓ n X i + X j U n ( X ) := . 2 2 i<j • Let H ( t ) = P (( X i + X j ) / 2 ≤ t ) be the cdf of the kernels with corresponding empirical distribution function, ◆ − 1 X ✓ n ⇢ X i + X j � H n ( t ) := ≤ t , for t ∈ R . I 2 2 i<j Definition (Interquartile range of pairwise means) ⇥ H − 1 (0 . 75) − H − 1 ⇤ P n = c (0 . 25) , n n where c ≈ 1 . 048 is a correction factor to ensure P n is consistent for the standard deviation when the underlying observations are Gaussian.

  5. Scale Covariance Multivariate Conclusion Outline Robust scale estimation with P n Robust pairwise covariance estimation Robust covariance and precision matrices Summary and key references

  6. Scale Covariance Multivariate Conclusion From scale to covariance: the GK device • Gnanadesikan and Kettenring (1972) relate scale and covariance using the following identity, 1 cov( X, Y ) = 4 ↵� [var( ↵ X + � Y ) − var( ↵ X − � Y )] , where X and Y are random variables. • In general, X and Y can have di ff erent units, so we set p p ↵ = 1 / var( X ) and � = 1 / var( Y ) . • Replacing variance with P 2 n we can similarly construct, 1 ⇥ P 2 n ( ↵ X + � Y ) − P 2 ⇤ � P ( X, Y ) = n ( ↵ X − � Y ) , 4 ↵� where ↵ = 1 /P n ( X ) and � = 1 /P n ( Y ) .

  7. Scale Covariance Multivariate Conclusion Outline Robust scale estimation with P n Robust pairwise covariance estimation Robust covariance and precision matrices Summary and key references

  8. Scale Covariance Multivariate Conclusion Estimating dependence Problem: To estimate dependence in multivariate settings with cellwise contamination.

  9. Scale Covariance Multivariate Conclusion 100% 100 75% Rows contaminated Observations 50% 50 p=30 25% 1 0% 1 15 30 0% 5% 10% Variables Cells contaminated For details see Alqallaf et al. (2009).

  10. Scale Covariance Multivariate Conclusion Estimating dependence Problem: To estimate dependence in multivariate settings with cellwise contamination. Solution: 1. pairwise covariance matrices 2. 3.

  11. Scale Covariance Multivariate Conclusion Pairwise approach to the rescue? 100% 100 75% Rows contaminated Observations 50% 50 p=30 p=2 25% 1 0% 1 2 0% 5% 10% Variables Cells contaminated

  12. Scale Covariance Multivariate Conclusion Estimating dependence Problem: To estimate dependence in multivariate settings with cellwise contamination. Solution: 1. pairwise covariance matrices 2. correct for positive definiteness 3.

  13. Scale Covariance Multivariate Conclusion Positive definite? • Standard approach of Maronna and Zamar (2002) su ff ers from outlier propagation so fails for cellwise contamination. • Higham (2002) outlines the nearest positive definite (NPD) approach: 1. Perform a spectral decomposition of the symmetric matrix of pairwise covariances 2. Any negative eigenvalues are set to some small positive constant 3. Reconstruct the covariance matrix using the adjusted eigenstructure ! NPD approach produces poorly conditioned covariance matrices.

  14. Scale Covariance Multivariate Conclusion Estimating dependence Problem: To estimate dependence in multivariate settings with cellwise contamination. Solution: 1. pairwise covariance matrices 2. correct for positive definiteness 3. regularisation procedure

  15. Scale Covariance Multivariate Conclusion Precision matrices • In many practical applications, the covariance matrix is not what is really required. • PCA, Mahalanobis distance, LDA, etc. use the inverse covariance matrix: the precision matrix, Θ = Σ − 1 . • Precision matrices are also of interest in modelling Gaussian Markov random fields, where zeros in the correspond to conditional independence between variables. Sparsity! In many applications, it is often useful to impose a level of sparsity on the estimated precision matrix.

  16. Scale Covariance Multivariate Conclusion Regularisation techniques Graphical lasso (glasso) (Friedman, Hastie, Tibshirani, 2007) minimises the penalised negative Gaussian log-likelihood: f ( Θ ) = tr( ˆ ΣΘ ) − log | Θ | + � || Θ || 1 , where || Θ || 1 is the L 1 norm and � is a tuning parameter for the amount of shrinkage. Quadratic Inverse Covariance (QUIC) (Hsieh, et. al. 2011) solves the same minimisation problem as the glasso but uses a second order approach. Constrained ` 1 -minimisation for inverse matrix estimation (CLIME) (Cai, Liu and Luo, 2011) solves the following objective function: subject to: || ˆ min || Θ || 1 ΣΘ − I || ∞ ≤ � .

  17. Scale Covariance Multivariate Conclusion Estimating dependence Problem: To estimate dependence in multivariate settings with cellwise contamination. Solution: 1. pairwise covariance matrices 2. correct for positive definiteness 3. regularisation procedure Evaluation: • Is the estimate “close” to the truth? •

  18. Scale Covariance Multivariate Conclusion Simulation design sample size n = 100 replications N = 100 variables p = 30 , 60 , 90 scenarios banded, sparse and dense precision matrices, Θ true data N ( 0 , Σ ) where Σ = Θ − 1 contamination 0% to 25% randomly scattered component-wise 1. Banded 2. Sparse 3. Dense

  19. Scale Covariance Multivariate Conclusion 10% contamination Extreme Moderate 20 20 10 10 X 2 X 2 0 0 -10 -10 -20 -20 -20 -10 0 10 20 -20 -10 0 10 20 X 1 X 1

  20. Scale Covariance Multivariate Conclusion Evaluating performance Entropy Loss Measures how “close” ˆ Θ is to Θ , Θ ) = tr( Θ − 1 ˆ Θ ) − log | Θ − 1 ˆ L ( Θ , ˆ Θ | − p. Reported as percentage relative improvement in average loss: Θ ) = L ( Θ , ˆ Θ 0 ) − L ( Θ , ˆ Θ ) PRIAL ( ˆ × 100 , L ( Θ , ˆ Θ 0 ) where ˆ Θ 0 is the estimated precision matrix after a regularisation technique has been applied to the classical sample covariance matrix for uncontaminated data.

  21. Scale Covariance Multivariate Conclusion Changing dimension: p = 90 0 CLIME, banded, p = 90 Q n with NPD Entropy loss PRIAL -100 ⌧ with NPD P n with NPD MCD -200 P n with OGK Classical -300 0 5 10 15 20 25 Percent contamination in each variable

  22. Scale Covariance Multivariate Conclusion Changing dimension: p = 60 0 CLIME, banded, p = 60 Q n with NPD Entropy loss PRIAL -100 ⌧ with NPD P n with NPD MCD -200 P n with OGK Classical -300 0 5 10 15 20 25 Percent contamination in each variable

  23. Scale Covariance Multivariate Conclusion Changing dimension: p = 30 0 CLIME, banded, p = 30 Q n with NPD Entropy loss PRIAL -100 ⌧ with NPD P n with NPD MCD -200 P n with OGK Classical -300 0 5 10 15 20 25 Percent contamination in each variable

  24. Scale Covariance Multivariate Conclusion Changing regularisation routine: GLASSO 0 GLASSO, banded, p = 60 Q n with NPD Entropy loss PRIAL -100 ⌧ with NPD P n with NPD MCD -200 P n with OGK Classical -300 0 5 10 15 20 25 Percent contamination in each variable

  25. Scale Covariance Multivariate Conclusion Changing regularisation routine: QUIC 0 QUIC, banded, p = 60 Q n with NPD Entropy loss PRIAL -100 ⌧ with NPD P n with NPD MCD -200 P n with OGK Classical -300 0 5 10 15 20 25 Percent contamination in each variable

  26. Scale Covariance Multivariate Conclusion Changing experiment: banded 0 QUIC, banded, p = 90 Q n with NPD Entropy loss PRIAL -100 ⌧ with NPD P n with NPD MCD -200 P n with OGK Classical -300 0 5 10 15 20 25 Percent contamination in each variable

  27. Scale Covariance Multivariate Conclusion Changing experiment: sparse 0 QUIC, scattered, p = 90 Q n with NPD Entropy loss PRIAL -100 ⌧ with NPD P n with NPD MCD -200 P n with OGK Classical -300 0 5 10 15 20 25 Percent contamination in each variable

  28. Scale Covariance Multivariate Conclusion Changing experiment: dense 0 QUIC, dense, p = 30 Q n with NPD Entropy loss PRIAL -100 ⌧ with NPD P n with NPD MCD -200 P n with OGK Classical -300 0 5 10 15 20 25 Percent contamination in each variable

  29. Scale Covariance Multivariate Conclusion Estimating dependence Problem: To estimate dependence in multivariate settings with cellwise contamination. Solution: 1. pairwise covariance matrices 2. correct for positive definiteness 3. regularisation procedure Evaluation: • Is the estimate “close” to the truth? • Are we able to recover the support of a Gaussian graphical model?

  30. Scale Covariance Multivariate Conclusion QUIC, uncontaminated data, N = 100 replications True Θ Classic MCD 100 80 60 40 20 0 P n with NPD Q n with NPD ⌧ with NPD 100 80 60 40 20 0

Recommend


More recommend