inverse problems mathematical and statistical methodology
play

Inverse Problems: Mathematical and Statistical Methodology H. T. - PowerPoint PPT Presentation

Inverse Problems: Mathematical and Statistical Methodology H. T. Banks Center for Research in Scientific Computation (CRSC) Center for Quantitative Sciences in Biomedicine (CQSB) North Carolina State University Raleigh, NC 27695 C enter for Q


  1. Inverse Problems: Mathematical and Statistical Methodology H. T. Banks Center for Research in Scientific Computation (CRSC) Center for Quantitative Sciences in Biomedicine (CQSB) North Carolina State University Raleigh, NC 27695 C enter for Q uantitative S ciences in B iomedicine North Carolina State University 1

  2. Outline of Talk • Motivation: Non-invasive Detection and Characterization of Occlusion and Cardiac Stenosis • Inverse Problems and Parameter Estimation: MLE, OLS, GLS • Computation of Σ, Standard Errors and Confidence Intervals • Error Analysis and Residual Plots Techniques • Model Comparison Techniques 2

  3. References [1] H.T. Banks and N. S. Luke, Simulations of propagating shear waves in biotissue employing an internal variable approach to dissipation , Tech. Rep. CRSC-TR06-28, NCSU, December, 2006; Communications in Computational Physics , 3 (2008), 603–640. [2] H.T. Banks and J.R. Samuels, Jr.,Detection of cardiac occlusions using viscoelastic wave propagation, CRSC-TR08-23, December, 2008; Advances in Applied Mathematics and Mechanics, 1 (2009), 1–28. [3] H.T. Banks, M. Davidian, J.R. Samuels, Jr., and K.L. Sutton, An Inverse Problem Statistical Methodology Summary, CRSC-TR08-01 , January, 2008; Chapter 11 in Statistical Estimation Approaches in Epidemiology , (edited by Gerardo 3

  4. Chowell, Mac Hyman, Nick Hengartner, Luis M.A Bettencourt and Carlos Castillo-Chavez), Springer, Berlin Heidelberg New York, 2009, pp. 249–302. 4

  5. Motivation • Develop models and inverse problem techniques for detection and characterization of cardiac stenosis • Stenosis induced turbulence in arteries produces normal forces on artery walls–propagated as shear waves to chest surface Stenosis or Arterial Occlusion 5

  6. • Sensors developed at MedAcoustics measure shear waves–need to understand sensor capabilities, sensitivities, uncertainty in estimates produced by inverse algorithms–experiments to be designed with scientists, physicians at Brunel University (Shaw, Whiteman), Queen Mary Univ and Royal London Hospital (Greenwald), Barts/London NHS Trust (Birch) Piezoelectric Sensors 6

  7. Shear Waves 7

  8. Model Geometry gel phantoms, pigs, humans 8

  9. 2-D geometry with multiple chest sensors 9

  10. 2-D Model [BL] (internal variable based viscoelastic) ρ∂ 2 u 1 ∂ + 1 ∂ + 1 ǫ λ + ǫ 11 ǫ 21 ǫ 11 µ − ǫ 22 � � � � � � = µ µ µ ∂t 2 ∂r r ∂θ r ρ∂ 2 u 2 ∂ + 1 ∂ + 1 ǫ 12 ǫ λ + ǫ 22 ǫ 12 µ + ǫ 21 � � � � � � = µ µ µ ∂t 2 ∂r r ∂θ r ∂ǫ λ k ∂ � � S (e) 11 + S (e) = − ν λ k ǫ λ k + C λ k 22 ∂t ∂t ∂ǫ 11 ∂ � � µ k S (e) − ν µ k ǫ 11 = µ k + C µ k 11 ∂t ∂t ∂ǫ 12 ∂ � � µ k S (e) − ν µ k ǫ 12 = µ k + C µ k 12 ∂t ∂t ∂ǫ 22 ∂ � � µ k S (e) − ν µ k ǫ 22 = µ k + C µ k 22 ∂t ∂t 10

  11. Relevant model terms: • radial ( r ) and tangential ( θ ) displacement of the shear wave ( u 1 and u 2 respectively) • ”elastic” stress tensor S (e) ij and internal strain variables ǫ λ and ǫ kl µ • parameters C λ k and C µ k for k = 1 , 2 in approximating reduced relaxation parameter G in Fung’s quasi-linear viscoelastic model • occulsion parameters q 1 and q 2 which arise in the inner radius boundary conditions Possible parameters to be estimated: � θ = ( q 1 , q 2 , C λ k , C µ k , ν λ k , ν µ k ) T 11

  12. Inverse Problems and Parameter Estimation: MLE, OLS, and GLS The Underlying Mathematical and Statistical Models Inverse or parameter estimation problems in context of parameterized (with vector parameter � θ ) dynamical system or mathematical model d� x x ( t ) , � dt ( t ) = � g ( t, � θ ) (1) with observation process x ( t ; � y ( t ) = C � � θ ) . (2) 12

  13. As usual assume a discrete form of observations– n longitudinal observations corresponding to x ( t j ; � � y ( t j ) = C � θ ) , j = 1 , . . . , n. (3) In general corresponding observations or data { � y j } will not be exactly � y ( t j ) — must treat this uncertainty pertaining to the observations with a statistical model for the observation process 13

  14. Description of Statistical Model We consider a statistical model of the form Y j = � � f ( t j , � θ 0 ) + � E j , j = 1 , . . . , n, (4) where • � f ( t j , � x ( t j ; � θ ) = C � θ ), j = 1 , . . . , n , corresponds to solution of mathematical model at the j th covariate for a particular vector of parameters � x ∈ R N , � θ ∈ R p , � f ∈ R m , • C is an m × N observation matrix • � θ 0 represents the “truth” or the parameters that generate the observations { � Y j } n j =1 • The term � E j can represent measurement error, “system fluctuations” or other phenomena that cause observations to not fall exactly on the points � f ( t j , � θ ) from the smooth path � f ( t, � θ ) 14

  15. • Fluctuations are unknown to modeler—assume � E j generated from a probability distribution that reflects the assumptions regarding these phenomena, e.g., in a statistical model for pharmacokinetics of drug in human blood samples, a natural distribution for E = ( E 1 , . . . , E n ) T might be the multivariate normal distribution � • Needs: methodology related to the estimation of the true value of the parameters � θ 0 from a set Θ of admissible parameters, the variance of the error var( � E j ) and resulting uncertainties • Here: two inverse problem methodologies for calculation of estimates ˆ θ for � θ 0 : the ordinary least-squares (OLS) and generalized least-squares (GLS) formulations as well as the popular maximum likelihood estimate (MLE) formulation in the case one assumes distributions of the error process { � E j } are known 15

  16. MLE: Known error processes: Normally distributed error • In statistical model, made no mention of the probability distribution that generates the error � E j • In many situations one readily assumes that errors � E j , j = 1 , . . . , n, are independent and identically distributed (iid) • In some cases, one is able to make further assumptions on the error , namely that the distribution for � E j is known . In this case maximum likelihood techniques may be used. • Discuss first for a scalar observation system, i.e., m = 1. Most common assumption: sufficient evidence to suspect the error is generated by a normal distribution then willing to assume 0 ), and hence Y j ∼ N ( f ( t j , � E j ∼ N (0 , σ 2 θ 0 ) , σ 2 0 ). 16

  17. Can then obtain expression for determining � θ 0 and σ 0 by seeking maximum over ( � θ, σ 2 ) ∈ Θ × (0 , ∞ ) of likelihood function for E j = Y j − f ( t j , � θ ) defined by n 2 πσ 2 exp {− 1 1 L ( � Y | � � 2 σ 2 [ Y j − f ( t j , � θ, σ 2 ) = θ )] 2 } √ (5) j =1 • Resulting solutions θ MLE and σ 2 MLE are the maximum likelihood estimators (MLEs) for � θ 0 and σ 2 0 , respectively • Solutions θ MLE = θ MLE ( � MLE ( � Y ) and σ 2 MLE = σ 2 Y ) are random variables because � Y is a random variable • Corresponding maximum likelihood estimates are obtained by maximizing (5) with { Y j } replaced by a given realization y = { y j } —will be denoted by ˆ � θ MLE and ˆ σ MLE , respectively. 17

  18. Maximizing (5) equivalent to maximizing log likelihood n θ, σ 2 ) = − n 2 log(2 π ) − n 1 2 log σ 2 − θ )] 2 (6) log L ( � Y | � � [ Y j − f ( t j , � 2 σ 2 j =1 Can determine maximum of (6) by differentiating with respect to � θ (with σ 2 fixed) and with respect to σ 2 (with � θ fixed), setting the resulting equations equal to zero and solving for � θ and σ 2 . With σ 2 θ log L ( � Y | � ∂ θ, σ 2 ) = 0 which is equivalent to fixed we solve ∂� n � [ Y j − f ( t j , � θ )] ∇ f ( t j , � θ ) = 0 (7) j =1 Solving (7) is the same as the least squares optimization n θ MLE ( � J ( � Y , � � [ Y j − f ( t j , � θ )] 2 Y ) = arg min θ ) = arg min (8) � � θ ∈ Θ θ ∈ Θ j =1 18

  19. Next fix � ∂σ 2 log L ( � ∂ Y | θ MLE , σ 2 ) = 0, which θ to be θ MLE and solve yields Y ) = 1 MLE ( � nJ ( � σ 2 Y , θ MLE ) (9) Note that we can solve for θ MLE and σ 2 MLE separately – a desirable feature—one that won’t arise in more complicated formulations discussed later The 2 nd derivative test verifies that expressions above for θ MLE and σ 2 MLE do indeed maximize (6) 19

  20. Vector Observations For vector observations, for the j th covariate t j statistical model reformulated as Y j = � � f ( t j , � θ 0 ) + � E j (10) f ∈ R m and where � V 0 = var( � E j ) = diag( σ 2 0 , 1 , . . . , σ 2 0 ,m ) (11) for j = 1 , . . . , n • allows for possibility that observation coordinates Y i j may have different constant variances σ 2 0 ,i , i.e., σ 2 0 ,i does not necessarily have to equal σ 2 0 ,k • if (again) there is sufficient evidence to claim errors are i.i.d. and generated by a normal distribution then � E j ∼ N m (0 , V 0 ) 20

Recommend


More recommend