empirical composite likelihoods
play

Empirical Composite Likelihoods Nicola Lunardon, Francesco Pauli, - PowerPoint PPT Presentation

Empirical Composite Likelihoods Nicola Lunardon, Francesco Pauli, Laura Ventura Dept. of Statistics, University of Padova, Italy email: ventura@stat.unipd.it N. Lunardon, F. Pauli, L. Ventura COMPSTAT2010 Paris, August 2227 2010 1/ 22


  1. Empirical Composite Likelihoods Nicola Lunardon, Francesco Pauli, Laura Ventura Dept. of Statistics, University of Padova, Italy email: ventura@stat.unipd.it N. Lunardon, F. Pauli, L. Ventura – COMPSTAT2010 – Paris, August 22–27 2010 1/ 22

  2. Outline • Composite likelihoods may be useful for approximating likelihood based inference when the full likelihood is too complex to deal with. • Stemming from a misspecified model, the asymptotic distribution of the composite likelihood ratio statistic departs from the familiar standard chi-square asymptotic distribution. • Several adjustments have been proposed in the literature, which all require the elements of the Godambe information. • This paper proposes and discusses a computationally and theoretically attractive approach based on the derivation of an empirical likelihood function from the composite score. • For the special case of the pairwise likelihood, our proposal can allow reference to the usual asymptotic chi-square distribution. N. Lunardon, F. Pauli, L. Ventura – COMPSTAT2010 – Paris, August 22–27 2010 2/ 22

  3. Composite likelihoods N. Lunardon, F. Pauli, L. Ventura – COMPSTAT2010 – Paris, August 22–27 2010 3/ 22

  4. Composite likelihood • Consider independent observations y i of a random vector Y i = ( Y i 1 , . . . , Y iq ) , i = 1 , . . . , n , with Y i ∼ f ( y i ; θ ) , R d , d ≥ 1 , y i ∈ Y . θ ∈ Θ ⊆ I • In some situations it may be difficult to evaluate f ( y ; θ ) and thus the full likelihood L ( θ ) . • However, suppose it may be possible to compute likelihood contributions L k ( θ ; y i ) = L ( θ ; A k ( y i )) , for the events A k ( y i ) , k = 1 , . . . , K , on Y . • The composite likelihood is then defined as (Lindsay 1988, Varin et al 2010) n K � � L k ( θ ; y i ) w k cL ( θ ; y ) = i =1 k =1 with w k positive weights. • Let cℓ ( θ ) = log cL ( θ ; y ) be the composite loglikelihood and let cU ( θ ) be the composite score function ( ∂/∂θ ) cℓ ( θ ) . N. Lunardon, F. Pauli, L. Ventura – COMPSTAT2010 – Paris, August 22–27 2010 4/ 22

  5. An example: The pairwise likelihood • When the events A k ( y i ) are defined in terms of pairs of observations ( y ir , y is ) from the bivariate marginal density f ( y ir , y is ; θ ) , the pairwise likelihood is obtained (Cox Reid 2004) q − 1 q n � � � pL ( θ ; y ) = f ( y ir , y is ; θ ) i =1 r =1 s = r +1 • The pairwise loglikelihood is q − 1 q n � � � pℓ ( θ ; y ) = log f ( y ir , y is ; θ ) i =1 r =1 s = r +1 • The pairwise score function is q − 1 q n ∂ � � � pU ( θ ; y ) = ∂θ log f ( y ir , y is ; θ ) i =1 r =1 s = r +1 N. Lunardon, F. Pauli, L. Ventura – COMPSTAT2010 – Paris, August 22–27 2010 5/ 22

  6. Composite likelihood: Properties • The validity of inference on θ using cL ( θ ; y ) can be justified invoking the theory of unbiased estimating functions. • Indeed, cU ( θ ; y ) is still an unbiased estimating function, since it is a linear combination of valid score functions. • The composite MLE ˆ θ c is consistent and approximately normal with mean θ and variance V ( θ ) = H ( θ ) − 1 J ( θ ) H ( θ ) − 1 T ) and J ( θ ) = E ( cU ( θ ) cU ( θ ) T ) . with H ( θ ) = E ( − ∂cU ( θ ) /∂θ • Matrix G ( θ ) = V ( θ ) − 1 is the Godambe information. N. Lunardon, F. Pauli, L. Ventura – COMPSTAT2010 – Paris, August 22–27 2010 6/ 22

  7. First order asymptotics • The asymptotic distribution of the Wald-type statistic cw w ( θ ) = (ˆ T G ( θ )(ˆ θ c − θ ) is χ 2 θ c − θ ) d . The same result holds T J ( θ ) − 1 cU ( θ ) . for the score-type statistic cw s ( θ ) = cU ( θ ) • Let cw ( θ ) = 2( cℓ (ˆ θ c ) − cℓ ( θ )) be the composite likelihood ratio statistic. • Its asymptotic null distribution is d � λ i Z 2 cw ( θ ) ˙ ∼ i i =1 with Z 2 i independent χ 2 1 random variables and λ i eigenvalues of H ( θ ) − 1 J ( θ ) . • All the above results extend to the case of partial interest about ψ , with θ = ( ψ, λ ) . N. Lunardon, F. Pauli, L. Ventura – COMPSTAT2010 – Paris, August 22–27 2010 7/ 22

  8. Adjustments of composite likelihood ratios: Why needed? • Wald-type statistics lack invariance under reparameterization and force confidence regions to have an elliptical shape. • Score-type statistics seem to suffer from numerical instability (Molenberghs Verbeke 2005, Ch. 9) . • Under this respect, a likelihood ratio type statistic would be more appealing. • However, its approximate � λ i Z 2 i distribution departs from the familiar pivot result. This calls for adjustments in order to obtain the standard χ 2 d distribution: ◮ For d = 1 , most proposed adjustments agree and lead to the exact asymptotic reference. ◮ For d > 1 , some adjustments are not parameterization invariant or only match some moments of the asymptotic reference. • All the adjustments require the evaluation of H ( θ ) and J ( θ ) . N. Lunardon, F. Pauli, L. Ventura – COMPSTAT2010 – Paris, August 22–27 2010 8/ 22

  9. Adjustments of composite likelihood ratios: Available solutions • Simple adjustments are based on moments conditions: 1. First order moment matching gives cw 1 ( θ ) = cw ( θ ) / ˜ λ , with λ = � λ i /d = tr ( H ( θ ) − 1 J ( θ )) /d , with a χ 2 ˜ d approximate null distribution. 2. First and second order moment matching gives the Satterthwaite (1946) adjustment cw 2 ( θ ) = cw ( θ ) /κ , with a χ 2 approximate null distribution, where κ = � λ 2 i / � λ i and ν ν = ( � λ i ) 2 / ( � λ 2 i ) . 3. Matching of moments up to higher order are available (see Lindsay et al 2000) . • Chandler and Bate (2007) propose a vertical scaling of cw ( θ ) giving cw cb ( θ ) = cw ( θ ) cw w ( θ ) / (ˆ T H (ˆ θ c )(ˆ θ c − θ ) θ c − θ ) having χ 2 d null distribution, but which is not parameterization invariant. • Pace et al (2010) propose the parameterization invariant scaling T H ( θ ) − 1 cU ( θ ) also having the cw inv ( θ ) = cw ( θ ) cw s ( θ ) /cU ( θ ) usual asymptotic null distribution. N. Lunardon, F. Pauli, L. Ventura – COMPSTAT2010 – Paris, August 22–27 2010 9/ 22

  10. Empirical likelihood from the composite score function N. Lunardon, F. Pauli, L. Ventura – COMPSTAT2010 – Paris, August 22–27 2010 10/ 22

  11. Empirical likelihood • We can define an empirical likelihood based on a general R d : unbiased estimating equation for θ ∈ I m η ( y ; θ ) = 1 � η j ( Y j ; θ ) = 0 , with Y j ⊂ Y m j =1 • The empirical likelihood is defined as (Owen 2001) m L e ( θ ) = 1 1 � (1 + λ T η j ( Y j ; θ )) m j =1 where the Lagrangian multiplier λ satisfies η j ( Y j ; θ ) (1 /m ) � m (1+ λ T η j ( Y j ; θ )) = 0 j =1 • The empirical likelihood ratio statistic for θ derived from η ( y ; θ ) is m � T η ( Y j ; θ )) w e ( θ ) = 2 log(1 + λ j =1 N. Lunardon, F. Pauli, L. Ventura – COMPSTAT2010 – Paris, August 22–27 2010 11/ 22

  12. Empirical composite likelihood ratio statistic • The empirical composite likelihood ratio statistic derived from η ( y ; θ ) = cU ( θ ) is K � T cU ( θ ; A k )) cw e ( θ ) = 2 log(1 + λ k =1 • Under suitable conditions (see Adimari and Guolo 2010) it can be shown that: 1. When d = 1 , cw e ( θ ) / ˜ λ ˙ ∼ χ 2 1 . 2. When d > 1 , the asymptotic null distribution of cw e 1 ( θ ) = cw e ( θ ) / ˜ λ can be approximated with a χ 2 d (as for cw 1 ( θ ) ). • These results hold also for the pairwise score function K � pU ( θ ) = pU ( θ ; A k ) k =1 with K = nq ( q − 1) / 2 , obtaining pw e 1 ( θ ) . N. Lunardon, F. Pauli, L. Ventura – COMPSTAT2010 – Paris, August 22–27 2010 12/ 22

  13. Empirical likelihood from the pairwise score • Let us focus on the pairwise likelihood function. • The pairwise score function with K = n can be written with q − 1 q ∂ � � pU ( θ ; y i ) = ∂θ log f ( y is , y ir ; θ ) s =1 r = s +1 • The pairwise empirical likelihood ratio is n � T pU ( θ ; y i )) pw e ( θ ) = 2 log(1 + λ i =1 ∼ χ 2 • In this situation, we have pw e ( θ ) ˙ d (the proof follows from Adimari and Guolo 2010 ). N. Lunardon, F. Pauli, L. Ventura – COMPSTAT2010 – Paris, August 22–27 2010 13/ 22

  14. Simulation results N. Lunardon, F. Pauli, L. Ventura – COMPSTAT2010 – Paris, August 22–27 2010 14/ 22

  15. Example 1: Equicorrelated multivariate normal data • One-way normal-theory random effects model: Y ir = µ + ξ i + ǫ ir , i = 1 , . . . , n , r = 1 , . . . , q , and ξ i and ǫ ir independently normally distributed with zero mean and variances σ 2 ξ and σ 2 ǫ . • The problem can be reformulated by writing Y i as a multivariate normal with components having mean µ and variance σ 2 = σ 2 ξ + σ 2 ǫ , and with correlation ρ = σ 2 ξ / ( σ 2 ξ + σ 2 ǫ ) between any two components of the same vector. • This example has been chosen so that we can easily do closed form calculations both of complete and pairwise likelihood quantities, and not for direct interest in the application of composite likelihood. • The special case with µ = 0 , σ 2 = 1 and θ = ρ has been treated in detail by Cox Reid (2004) . • Here interest on inference about θ = ( µ, σ 2 , ρ ) . N. Lunardon, F. Pauli, L. Ventura – COMPSTAT2010 – Paris, August 22–27 2010 15/ 22

Recommend


More recommend