Model-based Geostatistics Peter J Diggle Lancaster University and - PowerPoint PPT Presentation

Comments on maximum likelihood • likelihood-based methods preferable to variogram-based methods • restricted maximum likelihood is widely recommended but in our experience is sensitive to mis-specification of the mean model. • in spatial models, distinction between µ ( x ) and S ( x ) is not sharp. • composite likelihood treats contributions from pairs ( Y i , Y j ) as if independent • approximate likelihoods useful for handling large data-sets • examining profile likelihoods is advisable, to check for poorly identified parameters

Swiss rainfall data

Swiss rainfall: trans-Gaussian model � ( y i ) λ − 1 if λ � = 0 Y ∗ i = h λ ( Y ) = λ log( y i ) if λ = 0 − 1 2 { log | σ 2 V | + ( h λ ( y ) − Dβ ) ′ { σ 2 V } − 1 ( h λ ( y ) − Dβ ) } ℓ ( β, θ, λ ) = n � ( y i ) λ − 1 � � + log i =1

Swiss rainfall: profile log-likelihoods for λ Left panel: κ = 0 . 5 Centre panel: κ = 1 Right panel: κ = 2

Swiss rainfall: MLE’s ( λ = 0 . 5) ˆ log ˆ σ 2 τ 2 κ µ ˆ ˆ φ ˆ L 0 . 5 18.36 118.82 87.97 2.48 -2464.315 1 20.13 105.06 35.79 6.92 -2462.438 2 21.36 88.58 17.73 8.72 -2464.185 Likelihood criterion favours κ = 1

Swiss rainfall: profile log-likelihoods ( λ = 0 . 5, κ = 1) Left panel: σ 2 Right panel: τ 2 Centre panel: φ

Swiss rainfall: plug-in predictions and prediction variances 250 250 200 200 150 150 Y Coord Y Coord 100 100 50 50 0 0 100 200 300 400 20 40 60 80 −50 −50 0 50 100 150 200 250 300 0 50 100 150 200 250 300 X Coord X Coord

Swiss rainfall: non-linear prediction 250 150 200 150 Frequency Y Coord 100 100 50 50 0 0.2 0.4 0.6 0.8 −50 0 0.36 0.38 0.40 0.42 0 50 100 150 200 250 300 A200 X Coord Left-panel: plug-in prediction for proportion of total area with rainfall exceeding 200 (= 20mm) Right-panel: plug-in predictive map of P (rainfall > 250 | Y )

Section 3 Bayesian inference

Prediction: S → S ∗ • expand model specification to [ Y, S ∗ , θ ] = [ θ ][ S | θ ][ Y | S, θ ][ S ∗ | S, θ ] • plug-in predictive distribution is [ S ∗ | Y, ˆ θ ] • Bayesian predictive distribution is � [ S ∗ | Y ] = [ S ∗ | Y, θ ][ θ | Y ] dθ • for any target T = t ( S ∗ ), required predictive distribution [ T | Y ] follows

Notes • likelihood function is central to both classical and Bayesian inference • Bayesian prediction is a weighted average of plug-in predictions, with different plug-in values of θ weighted according to their conditional probabilities given the observed data. • Bayesian prediction is usually more conservative than plug-in prediction

Bayesian computation 1. Evaluating the integral which defines [ S ∗ | Y ] is often difficult 2. Markov Chain Monte Carlo methods are widely used 3. but for geostatistical problems, reliable implementation of MCMC is not straightforward (no natural Markovian structure) 4. for the Gaussian model, direct simulation is available

Gaussian models: known ( σ 2 , φ ) Y ∼ N( Dβ, σ 2 R ( φ )) m β ; σ 2 V β � � • choose conjugate prior β ∼ N � � β, σ 2 V ˆ ˆ β | Y, σ 2 , φ � � • posterior for β is ∼ N β ˆ ( V − 1 + D ′ R − 1 D ) − 1 ( V − 1 m β + D ′ R − 1 y ) β = β β σ 2 ( V − 1 + D ′ R − 1 D ) − 1 ) V ˆ = β β • predictive distribution for S ∗ is � p ( S ∗ | Y, σ 2 , φ ) p ( S ∗ | Y, β, σ 2 , φ ) p ( β | Y, σ 2 , φ ) dβ. =

Notes • mean and variance of predictive distribution can be written explicitly (but not given here) • predictive mean compromises between prior and weighted average of Y • predictive variance has three components: – a priori variance, – minus information in data – plus uncertainty in β • limiting case V β → ∞ corresponds to ordinary kriging.

Gaussian models: unknown ( σ 2 , φ ) Convenient choice of prior is: [ β | σ 2 , φ ] ∼ N m b , σ 2 V b [ σ 2 | φ ] ∼ χ 2 n σ , S 2 � � � � [ φ ] ∼ arbitrary ScI σ • results in explicit expression for [ β, σ 2 | Y, φ ] and computable expression for [ φ | Y ], depending on choice of prior for φ • in practice, use arbitrary discrete prior for φ and combine posteriors conditional on φ by weighted averaging

Algorithm 1: 1. choose lower and upper bounds for φ according to the particular application, and assign a discrete uniform prior for φ on a set of values spanning the chosen range 2. compute posterior [ φ | Y ] on this discrete support set 3. sample φ from posterior, [ φ | Y ] 4. attach sampled value of φ to conditional posterior, [ β, σ 2 | y, φ ], and sample ( β, σ 2 ) from this distribution 5. repeat steps (3) and (4) as many times as required; resulting sample of triplets ( β, σ 2 , φ ) is a sample from joint posterior distribution, [ β, σ 2 , φ | Y ]

Predictive distribution for S ∗ given φ is tractable, hence write � p ( S ∗ | Y ) p ( S ∗ | Y, φ ) p ( φ | y ) dφ. = Algorithm 2: 1. discretise [ φ | Y ], as in Algorithm 1. 2. compute posterior [ φ | Y ] 3. sample φ from posterior [ φ | Y ] 4. attach sampled value of φ to [ S ∗ | y, φ ] and sample from this to obtain realisations from [ S ∗ | Y ] 5. repeat steps (3) and (4) as required Note: Extends immediately to multivariate φ (but may be computationally awkward)

Swiss rainfall Priors/posteriors for φ (left) and ν 2 (right) 0.6 prior prior 0.25 posterior posterior 0.5 0.20 0.4 0.15 0.3 0.10 0.2 0.05 0.1 0.00 0.0 0 15 37.5 60 82.5 105 135 0 0.1 0.2 0.3 0.4 0.5

Swiss rainfall Mean (left-panel) and variance (right-panel) of predictive distribution 250 250 200 200 150 150 Y Coord Y Coord 100 100 50 50 0 0 100 200 300 400 20 40 60 80 −50 −50 0 50 100 150 200 250 300 0 50 100 150 200 250 300 X Coord X Coord

Swiss rainfall: posterior means and 95% cred- ible intervals parameter estimate 95% interval β 144.35 [53.08, 224.28] σ 2 13662.15 [8713.18, 27116.35] φ 49.97 [30, 82.5] ν 2 0.03 [0, 0.05]

Swiss rainfall: non-linear prediction 250 30 200 150 Y Coord f ( A 200 ) 20 100 50 10 0 0.2 0.4 0.6 0.8 −50 0 0.36 0.38 0.40 0.42 0.44 0 50 100 150 200 250 300 X Coord A 200 Left-panel: Bayesian (solid) and plug-in (dashed) prediction for proportion of total area with rainfall exceeding 200 (= 20mm) Right-panel: Bayesian predictive map of P (rainfall > 250 | Y )

Section 4 Generalized linear models

Generalized linear geostatistical model • Latent spatial process S ( x ) ∼ SGP { 0 , σ 2 , ρ ( u )) } ρ ( u ) = exp( −| u | /φ ) • Linear predictor η ( x ) = d ( x ) ′ β + S ( x ) • Link function E[ Y i ] = µ i = h { η ( x i ) } • Conditional distribution for Y i : i = 1 , ..., n Y i | S ( · ) ∼ f ( y ; η ) mutually independent

GLGM • usually just a single realisation is available, in contrast with GLMM for longitudinal data analysis • GLM approach is most appealing when there is a natural sampling mechanism, for example Poisson model for counts or logistic-linear models for proportions • transformed Gaussian models may be more useful for non-Gaussian continuous respones • theoretical variograms can be derived but are less natural as summary statistics than in Gaussian case • but empirical variograms of GLM residuals can still be useful for exploratory analysis

A binomial logistic-linear model • S ( · ) ∼ zero-mean Gaussian process • [ Y ( x i ) | S ( x i )] ∼ Bin( n i ; p i ) • h ( p i ) = log { p i / (1 − p i ) } = � k j =1 d ij β j + S ( x i ) • model can be expanded by adding uncorrelated random effects Z i , k � h ( p i ) = d ij β j + S ( x i ) + Z i j =1 to distinguish between two forms of the nugget effect: – binomial variation is analogue of measurement error – Z i is analogue of short-range spatial variation

Simulation of a binary logistic-linear model 1.0 0.8 0.6 data 0.4 0.2 0.0 0.0 0.2 0.4 0.6 0.8 1.0 locations • data contain little information about S ( x ) • leads to wide prediction intervals for p ( x ) • more informative for binomial responses with large n i

Inference • Likelihood function n � � f ( y i ; h − 1 ( s i )) f ( s | θ ) ds 1 , . . . , ds n L ( θ ) = I R n i • involves high-dimensional integration • MCMC algorithms exploit conditional independence structure

Conditional independence graph θ β � ❅ � ❅ � ❅ � ❅ � ❅ � ❅ � ❅ � ❅ � ❅ � ❅ S ∗ S Y • only need vertex S ∗ at prediction stage • corresponding DAG would delete edge between S and β

Prediction with known parameters • simulate s (1) , . . . , s ( m ) from [ S | y ] (using MCMC). • simulate s ∗ ( j ) from [ S ∗ | s ( j )], j = 1 , . . . , m (multivariate Gaussian) � m 1 • approximate E[ T ( S ∗ ) | y ] by j =1 T ( s ∗ ( j )) m • if possible reduce Monte Carlo error by – calculating E[ T ( S ∗ ) | s ( j )] directly � m 1 – estimating E[ T ( S ∗ ) | y ] by j =1 E[ T ( S ∗ ) | s ( j )] m

MCMC for conditional simulation • Let S = D ′ β + Σ 1 / 2 Γ, Γ ∼ N n (0 , I ). • Conditional density: f ( γ | y ) ∝ f ( y | γ ) f ( γ ) Langevin-Hastings algorithm • Proposal: γ ′ from a N n ( ξ ( γ ) , hI ), ξ ( γ ) = γ + h 2 ∇ log f ( γ | y ) • Example: Poisson-log-linear spatial model: ∇ log f ( γ | y ) = − γ + (Σ 1 / 2 ) ′ ( y − exp( s )) , s = Σ 1 / 2 γ . • expression generalises to other generalised linear spatial models • MCMC output γ (1) , . . . , γ ( m ), hence sample s ( m ) = Σ 1 / 2 γ ( m ) from [ S | y ].

MCMC for Bayesian inference Posterior: • update Γ from [Γ | y, β, log σ ) , log( φ )] (Langevin-Hastings)) • update β from [ β | Γ , log( σ ) , log( φ )] (RW-Metropolis) • update log( σ ) from [log( σ ) | Γ , β, log( φ )] (RW-Metropolis) • update log( φ ) from [log( φ ) | Γ , β, log( σ )] (RW-Metropolis) Predictive: • simulate ( s ( j ) , β ( j ) , σ 2 ( j ) , φ ( j )), j = 1 , . . . , m (MCMC) • simulate s ∗ ( j ) from [ S ∗ | s ( j ) , β ( j ) , σ 2 ( j ) , φ ( j )], j = 1 , . . . , m (multivariate Gaussian)

Comments • above is not necessarily the most efficient algorithm available • discrete prior for φ reduces computing time • can thin MCMC output if storage is a limiting factor • similar algorithms can be developed for MCMC maximum likelihood estimation

Some computational resources • geoR package: http://www.est.ufpr.br/geoR • geoRglm package: http://www.est.ufpr.br/geoRglm • R-project: http://www.R-project.org • CRAN spatial task view: http://cran.r-project.org/src/contrib/Views/Spatial.html • AI-Geostats web-site: http://www.ai-geostats.org • and more ...

[image removed]

African Programme for Onchocerciasis Control • “river blindness” – an endemic disease in wet tropical regions • donation programme of mass treatment with ivermectin • approximately 30 million treatments to date • serious adverse reactions experienced by some patients highly co-infected with Loa loa parasites • precautionary measures put in place before mass treatment in areas of high Loa loa prevalence http://www.who.int/pbd/blindness/onchocerciasis/en/

The Loa loa prediction problem Ground-truth survey data • random sample of subjects in each of a number of villages • blood-samples test positive/negative for Loa loa Environmental data (satellite images) • measured on regular grid to cover region of interest • elevation, green-ness of vegetation Objectives • predict local prevalence throughout study-region (Cameroon) • compute local exceedance probabilities, P(prevalence > 0 . 2 | data)

Loa loa: a generalised linear model • Latent spatial process S ( x ) ∼ SGP { 0 , σ 2 , ρ ( u )) } ρ ( u ) = exp( −| u | /φ ) • Linear predictor d ( x ) = environmental variables at location x η ( x ) = d ( x ) ′ β + S ( x ) p ( x ) = log[ η ( x ) / { 1 − η ( x ) } ] • Error distribution Y i | S ( · ) ∼ Bin { n i , p ( x i ) }

Schematic representation of Loa loa model 1.0 sampling error 0.8 prevalence local fluctuation 0.6 0.4 0.2 environmental gradient 0.0 0.0 0.2 0.4 0.6 0.8 1.0 location

The modelling strategy • use relationship between environmental variables and ground- truth prevalence to construct preliminary predictions via logistic regression • use local deviations from regression model to estimate smooth residual spatial variation • Bayesian paradigm for quantification of uncertainty in resulting model-based predictions

logit prevalence vs elevation 0 −1 logit prevalence −2 −3 −4 −5 0 500 1000 1500 elevation

logit prevalence vs MAX = max NDVI 0 −1 logit prevalence −2 −3 −4 −5 0.65 0.70 0.75 0.80 0.85 0.90 Max Greeness

Comparing non-spatial and spatial predictions in Cameroon Non-spatial 60 50 40 30 20 10 0 0 10 20 30

Spatial 60 50 40 30 20 10 0 0 10 20 30 40

Probabilistic prediction in Cameroon

Next Steps • analysis confirms value of local ground-truth prevalence data • in some areas, need more ground-truth data to reduce predictive uncertainty • but parasitological surveys are expensive

Field-work is difficult! [image removed]

RAPLOA • a cheaper alternative to parasitological sampling: – have you ever experienced eye-worm? – did it look like this photograph? – did it go away within a week? • RAPLOA data to be collected: – in sample of villages previously surveyed parasitologically (to calibrate parasitology vs RAPLOA estimates) – in villages not surveyed parasitologically (to reduce local uncertainty) • bivariate model needed for combined analysis of parasitological and RAPLOA prevalence estimates

[image removed]

RAPLOA calibration 100 2 80 parasitology prevalence parasitology logit 0 60 −2 40 −4 20 −6 0 0 20 40 60 80 100 −6 −4 −2 0 2 RAPLOA prevalence RAPLOA logit Empirical logit transformation linearises relationship Colour-coding corresponds to four surveys in different regions

RAPLOA calibration (ctd) 100 80 parasitology prevalence 60 40 20 0 0 20 40 60 80 100 RAPLOA prevalence Fit linear functional relationship on logit scale and back-transform

Parasitology/RAPLOA bivariate model • treat prevalence estimates as conditionally independent binomial responses • with bivariate latent Gaussian process { S 1 ( x ) , S 2 ( x ) } in linear predictor • to ease computation, write joint distribution as [ S 1 ( x ) , S 2 ( x )] = [ S 1 ( x )][ S 2 ( x ) | S 1 ( x )] with low-rank spline representation of S 1 ( x )

Lecture 3 Geostatistical design; geostatistics and marked point processes

Section 5 Geostatistical design

Geostatistical design • Retrospective Add to, or delete from, an existing set of measurement locations x i ∈ A : i = 1 , ..., n . • Prospective Choose optimal positions for a new set of measurement locations x i ∈ A : i = 1 , ..., n .

Naive design folklore • Spatial correlation decreases with increasing distance. • Therefore, close pairs of points are wasteful. • Therefore, spatially regular designs are a good thing.

Less naive design folklore • Spatial correlation decreases with increasing distance. • Therefore, close pairs of points are wasteful if you know the correct model. • But in practice, at best, you need to estimate unknown model parameters. • And to estimate model parameters, you need your design to include a wide range of inter-point distances. • Therefore, spatially regular designs should be tempered by the inclusion of some close pairs of points.

Examples of compromise designs A) Lattice plus close pairs design B) Lattice plus in−fill design 1 1 0 0 0 1 0 1

A Bayesian design criterion Assume goal is prediction of S ( x ) for all x ∈ A . � [ S | Y ] = [ S | Y, θ ][ θ | Y ] dθ For retrospective design, minimise � v = ¯ Var { S ( x ) | Y } dx A For prospective design, minimise � � E(¯ v ) = Var { S ( x ) | y } f ( y ) dy y A where f ( y ) corresponds to � [ Y ] = [ Y | θ ][ θ ] dθ

Results Retrospective: deletion of points from a monitoring network Start design 1 0 0 1

Selected final designs τ 2 / σ 2 =0 τ 2 / σ 2 =0 τ 2 / σ 2 =0 A B C 1 1 1 0 0 0 0 1 0 1 0 1 τ 2 / σ 2 =0.3 τ 2 / σ 2 =0.3 τ 2 / σ 2 =0.3 D E F 1 1 1 0 0 0 0 1 0 1 0 1 τ 2 / σ 2 =0.6 τ 2 / σ 2 =0.6 τ 2 / σ 2 =0.6 G H I 1 1 1 0 0 0 0 1 0 1 0 1

Model-based Geostatistics Peter J Diggle Lancaster University and - PowerPoint PPT Presentation

Model-based Geostatistics Peter J Diggle Lancaster University and Johns Hopkins University School of Public Health and Paulo Justiniano Ribeiro, Jr Department of Statistics, Universidade Federal do Paran a Outline 1. Introduction -

Geostatistics for Gaussian processes Hans Wackernagel Geostatistics group MINES ParisTech

Statistical Modelling Approaches to Disease Mapping Peter J Diggle Lancaster University and

Red Streams Blue Lancaster County Lancaster County Lancaster County Clean Water Partners Clean

LIGHTEN UP LANCASTER COUNTY AND LIVEWELL LANCASTER COUNTY ACTIVE TRANSPORTATION EFFORTS Fall

Geostatistical Inference under Preferential Sampling Marie Ozanne and Justin Strait Diggle,

MODEL BASED GEOSTATISTICS (Course Slides) Presented by: Paulo Justiniano Ribeiro Jr 1

m LANCASTER COUNTY CONVENTION CENTER AUTHORITY LANCASTER COUNTY CONVENTION CENTER AUTHORITY o

Beyond the Castle (Lancaster City Park) What is Beyond the Castle? Lancaster Square Routes 'Three

Thoughts on Presentations Related to Geostatistics Clayton V. Deutsc h Professor and Director of

GeoNeurale of Munich and Geovariances at the UTG - Augsburg, Germany Geostatistics: the

Geostatistics Portal the platform for census data geovisualization Mirosaw Migacz Chief GIS

Can the SPDE approach replace traditional Geostatistics for industrial applications? N. Desassis,

Introduction to Geostatistics Abhi Datta 1 , Sudipto Banerjee 2 and Andrew O. Finley 3 July 31,

S.R. Moss Cigar Co./Lancaster Press Building N. Prince Street, Lancaster, PA Historic

O U R H E R I T A G E ROYAL LANCASTER LONDON EVENTS PRESENTATION L O C A T I O N & S U

Modelling functional connectivity pathways for bats in urban landscapes Gemma Davies 1 , James Hale

Greg Taylor (UNM) The LWA Instrument 10-88 MHz Aperture Synthesis Telescope 4 beams x 2

Background Cox-MAZE open chest, cardiac surgery was a very successful invasive procedure for

J ~ 2 qt~R\ q/\~Q~3fltL~~f\% tn n r~- :; :N~I.L)7aL~C~Q) :~ xfl F-Wil 3 g~o

Functions that Underlie First-Order Logic William Craig Department of Philosophy University of

Rigid Cohomology for Algebraic Stacks David Zureick-Brown Emory University, Department of

The Message in the Shadow: Noise AND Knowledge William Sharpe, Columbia University oise

Sarcasm on Social Media Dipto Das and Anthony J Clark Department of Computer Science Missouri

Lean Architecture (I) Arie van Deursen 1 2 3 Projects Picked So Far Micronaut-core

Model-based Geostatistics Peter J Diggle Lancaster University and - PowerPoint PPT Presentation

Model-based Geostatistics Peter J Diggle Lancaster University and Johns Hopkins University School of Public Health and Paulo Justiniano Ribeiro, Jr Department of Statistics, Universidade Federal do Paran a Outline 1. Introduction -

Geostatistics for Gaussian processes Hans Wackernagel Geostatistics group MINES ParisTech

Statistical Modelling Approaches to Disease Mapping Peter J Diggle Lancaster University and

Red Streams Blue Lancaster County Lancaster County Lancaster County Clean Water Partners Clean

LIGHTEN UP LANCASTER COUNTY AND LIVEWELL LANCASTER COUNTY ACTIVE TRANSPORTATION EFFORTS Fall

Geostatistical Inference under Preferential Sampling Marie Ozanne and Justin Strait Diggle,

MODEL BASED GEOSTATISTICS (Course Slides) Presented by: Paulo Justiniano Ribeiro Jr 1

m LANCASTER COUNTY CONVENTION CENTER AUTHORITY LANCASTER COUNTY CONVENTION CENTER AUTHORITY o

Beyond the Castle (Lancaster City Park) What is Beyond the Castle? Lancaster Square Routes 'Three

Thoughts on Presentations Related to Geostatistics Clayton V. Deutsc h Professor and Director of

GeoNeurale of Munich and Geovariances at the UTG - Augsburg, Germany Geostatistics: the

Geostatistics Portal the platform for census data geovisualization Mirosaw Migacz Chief GIS

Can the SPDE approach replace traditional Geostatistics for industrial applications? N. Desassis,

Introduction to Geostatistics Abhi Datta 1 , Sudipto Banerjee 2 and Andrew O. Finley 3 July 31,

S.R. Moss Cigar Co./Lancaster Press Building N. Prince Street, Lancaster, PA Historic

O U R H E R I T A G E ROYAL LANCASTER LONDON EVENTS PRESENTATION L O C A T I O N &amp; S U

Modelling functional connectivity pathways for bats in urban landscapes Gemma Davies 1 , James Hale

Greg Taylor (UNM) The LWA Instrument 10-88 MHz Aperture Synthesis Telescope 4 beams x 2

Background Cox-MAZE open chest, cardiac surgery was a very successful invasive procedure for

J ~ 2 qt~R\ q/\~Q~3fltL~~f\% tn n r~- :; :N~I.L)7aL~C~Q) :~ xfl F-Wil 3 g~o

Functions that Underlie First-Order Logic William Craig Department of Philosophy University of

Rigid Cohomology for Algebraic Stacks David Zureick-Brown Emory University, Department of

The Message in the Shadow: Noise AND Knowledge William Sharpe, Columbia University oise

Sarcasm on Social Media Dipto Das and Anthony J Clark Department of Computer Science Missouri

Lean Architecture (I) Arie van Deursen 1 2 3 Projects Picked So Far Micronaut-core

O U R H E R I T A G E ROYAL LANCASTER LONDON EVENTS PRESENTATION L O C A T I O N & S U