fitting markovian binary trees using global and
play

Fitting Markovian binary trees using global and individual - PowerPoint PPT Presentation

Motivation in population biology The MBT model Global data Individual data Numerical results Fitting Markovian binary trees using global and individual population data Sophie Hautphenne * joint work with Melanie Massaro and Kate Turner *The


  1. Motivation in population biology The MBT model Global data Individual data Numerical results Fitting Markovian binary trees using global and individual population data Sophie Hautphenne * joint work with Melanie Massaro and Kate Turner *The University of Melbourne & EPFL MAM9 Budapest, 28–30 June 2016 1

  2. Motivation in population biology The MBT model Global data Individual data Numerical results Motivation in population biology 1 The MBT model 2 Global data 3 Individual data 4 Numerical results 5 2

  3. Motivation in population biology The MBT model Global data Individual data Numerical results Motivation : Modelling the black robin population The Chatham Islands Picture: Melanie Massaro 3

  4. Motivation in population biology The MBT model Global data Individual data Numerical results Saving the world’s most endangered bird By 1980, the population of black robins had declined to five birds, including only a single successful breeding pair. In 1980-1989, intensive conservation efforts helped the population recover to 93 birds by 1990 1 . In 1990-1998, the population was closely monitored, but without human intervention, and continued to grow rapidly to 197 adults by 1998. From 1999, the population growth slowed considerably and it only reached about 250 adults in 2014. 1. D. Butler and D. Merton. The Black Robin : Saving the World’s Most Endangered Bird . Oxford University Press, Auckland (1992) 4

  5. Motivation in population biology The MBT model Global data Individual data Numerical results Motivation : Modelling the black robin population We work in collaboration with field biologists who have been collecting raw data on the population for more than 30 years We want to fit branching processes to these data to make an age-specific demographic analysis of the population 5

  6. Motivation in population biology The MBT model Global data Individual data Numerical results Motivation in population biology 1 The MBT model 2 Global data 3 Individual data 4 Numerical results 5 6

  7. Motivation in population biology The MBT model Global data Individual data Numerical results Markovian binary trees : Trade-off between realism and tractability Linear birth-and-death process : Lifetimes follow an exponential distribution Reproduction occurs according to a Poisson process Not realistic enough ! Bellman-Harris branching processes : Lifetimes follow an arbitrary distribution Reproduction occurs according to a more general process Not tractable enough ! We consider an intermediate type of branching process, called the Markovian binary tree (MBT), which is at the same time very general and tractable. In an MBT, individuals’ lifetime is structured into phases. 7

  8. Motivation in population biology The MBT model Global data Individual data Numerical results Phase-structured lifetime 1 0 Parent 2 3 3 types of transitions: - ``Hidden’’ transitions - Birth - Death 1 Child 0 2 3 8

  9. Motivation in population biology The MBT model Global data Individual data Numerical results The individuals’ lifetime in an MBT Lifetime controlled by an underlying Markov process with n transient phases and one absorbing phase ; i → j i → k i → 0 i ✉ s α i ( D 0 ) ij d i j . . . B i , jk α : initial phase distribution (1 × n vector) ; D 0 : hidden phase transition rates ( n × n matrix) ; B : transition rates associated with a birth ( n × n 2 matrix) ; d : transition rates associated with the death ( n × 1 vector). n phases → n 3 + n 2 + n − 1 parameters ! 9

  10. Motivation in population biology The MBT model Global data Individual data Numerical results Simple structure 0 µ 1 µ 3 µ 2 Parent 1 2 3 ϒ 1 ϒ 2 λ 1 λ 2 λ 3 1 2 3 Child ϒ 1 ϒ 2 µ 2 µ 1 µ 3 0 Now, α 1 = 1 , ( D 0 ) i , i +1 = γ i , B i , 1 i = λ i , d i = µ i n phases → 3 n − 1 parameters γ 1 , . . . , γ n − 1 , λ 1 , . . . , λ n , µ 1 , . . . , µ n 10

  11. Motivation in population biology The MBT model Global data Individual data Numerical results Reproduction and lifetime in the simple MBT In this MBT model, reproduction events occur according to a transient Markov modulated Poisson process, the lifetime of an individual has a Coxian phase-type distribution. 11

  12. Motivation in population biology The MBT model Global data Individual data Numerical results Parameter fitting using population data Aim : to fit the parameters of the MBT to different types of population data sets, distinguishing between 1 global population data : averaged age-specific fertility and mortality rates over the whole population, or 2 individual population data : age-specific fertility and mortality counts per individual. The estimation method depends on the precision of the available data. We aim at choosing the optimal number of phases n using some validation methods. Once an estimator is known for the parameters, we derive confidence intervals for the model outcomes. 12

  13. Motivation in population biology The MBT model Global data Individual data Numerical results Motivation in population biology 1 The MBT model 2 Global data 3 Individual data 4 Numerical results 5 13

  14. Motivation in population biology The MBT model Global data Individual data Numerical results Global population data and non-linear regression Assume that we have estimates of the mean mortality and fertility rates at age x , denoted respectively by d x and b x , for ages x = 0 , 1 , . . . , M . These are our data points. Example : black robin data, close monitoring period 1990-2001 1 Mortality rate, d x 0.8 0.6 0.4 0.2 0 1 2 3 4 5 6 7 8 9 10 11 12 Age x 2 Fertility rate, b x 1.5 1 0.5 0 0 1 2 3 4 5 6 7 8 9 10 11 12 Age x 14

  15. Motivation in population biology The MBT model Global data Individual data Numerical results Mean age-specific fertility and mortality rates Let L be the lifetime of an individual, and D = D 0 +diag( λ ). We can show that the expressions for the mean mortality and fertility rates at age x in the MBT model are respectively given by P ( x < L ≤ x + 1 | L > x ) = α e Dx ( I − e D ) 1 ¯ d ( x ) = α e Dx 1 E ( N ([ x , x + 1)) | L > x ) = α e Dx ( I − e D )( − D ) − 1 λ ¯ β ( x ) = α e Dx 1 These functions are non-linear in both the input variable x and in the parameters. 15

  16. Motivation in population biology The MBT model Global data Individual data Numerical results Weighted non-linear least square estimates We want the same MBT model to fit both the age-specific fertility and mortality data. The 3 n − 1 parameters are estimated by minimizing the sum of weighted squared errors M d ( x )) 2 + ( β x − ¯ � ( d x − ¯ β ( x )) 2 � � F = S x , x =0 where S x = (1 − d 0 )(1 − d 1 ) · · · (1 − d x − 1 ) is the observed probability of survival until age x . 16

  17. Motivation in population biology The MBT model Global data Individual data Numerical results Motivation in population biology 1 The MBT model 2 Global data 3 Individual data 4 Numerical results 5 17

  18. Motivation in population biology The MBT model Global data Individual data Numerical results Individual population data and maximum likelihood Instead of using the mean age-specific mortality and fertility rates, we can do much better by directly exploiting the individual data counts We organise these data into samples of i.i.d. individual life vectors v ( i ) = [0 , − 2 , 0 , 2 , 6 , 2 , 2 , 5 , 0 , − 1 , − 1 , − 1 , − 1] , 1 ≤ i ≤ N , recording life and reproduction data for each age class (allowing for missing information) 18

  19. Motivation in population biology The MBT model Global data Individual data Numerical results Maximum likelihood estimation The log-likelihood function is N L ( θ ; v (1) , . . . , v ( N ) ) = � log p ( v ( j ) | θ ) , (1) j =1 where θ = { α , D 0 , λ , d } , and p ( v ( j ) | θ ) is the probability of observing v ( j ) , under the model parameter θ . The probabilities p ( v ( j ) | θ ) can be decomposed into products involving matrices P ( k ) for 1 ≤ k ≤ K , where P ij ( k ) = P [ N (1) = k , ϕ (1) = j | N (0) = 0 , ϕ (0) = i ] , and K = max i , j { v ( j ) : 1 ≤ i , 1 ≤ j ≤ N } . i 19

  20. Motivation in population biology The MBT model Global data Individual data Numerical results Computing P ( k ) (I) P ij ( k , t ) := P [ N ( t ) = k , ϕ ( t ) = j | N (0) = 0 , ϕ (0) = i ] P ( k , t ) = ( P ij ( k , t )) 1 ≤ i , j ≤ n P ( k , t ) = ∂ k P ∗ ( z , t ) � P ∗ ( z , t ) := � k ≥ 0 P ( k , t ) z k , � � k !( ∂ z ) k � z =0 Kolmogorov equations : ∂ P ∗ ( z , t ) /∂ t ( D 0 + zD 1 ) P ∗ ( z , t ) = → P ∗ ( z , t ) = exp[( D 0 + zD 1 ) t ] . 20

  21. Motivation in population biology The MBT model Global data Individual data Numerical results Computing P ( k ) (II) Successive differentiations w.r. to z : ∂ P ∗ ( z , t ) /∂ t = ( D 0 + zD 1 ) P ∗ ( z , t ) ∂ 2 P ∗ ( z , t ) / ( ∂ t )( ∂ z ) D 1 P ∗ ( z , t ) + ( D 0 + zD 1 ) ∂ P ∗ ( z , t ) /∂ z = . . . ∂ ( K +1) P ∗ ( z , t ) / ( ∂ t )( ∂ z ) K KD 1 ∂ ( K − 1) P ∗ ( z , t ) / ( ∂ z ) ( K − 1) = +( D 0 + zD 1 ) ∂ K P ∗ ( z , t ) / ( ∂ z ) K . Equivalent to ∂ t Y ( z , t ) = A ( z ) Y ( z , t ) with Y i ( z , t ) = ∂ i − 1 P ∗ ( z , t ) / ( ∂ z ) i − 1 1 ≤ i ≤ K + 1 ,   ( D 0 + zD 1 ) ( D 0 + zD 1 ) D 1 2 D 1 ( D 0 + zD 1 ) A ( z ) =   ...   KD 1 ( D 0 + zD 1 ) 21

Recommend


More recommend