Practical advice Real survey data is messy Distance sampling in the - PowerPoint PPT Presentation

Practical advice

Real survey data is messy

Distance sampling in the Real World We've talked a lot about models We've also talked about assumptions Our example is relatively well-behaved What can we do about all the nasty real world stuff?

Some days...

Aims Here we want to cover common questions Not definitive answers Some guidance on where to look for answers

What should my sample size be?

What do we mean by "sample size"? Number of animal (groups) recorded detection function Number of segments spatial model Number of segments with observations spatial model

Re-frame

How would we know when we have enough samples? We don't Heavily context-dependent Go back to assumptions

"How many data?"

Pilot studies and "you get what you pay for" Designing surveys is hard Designing surveys is essential Better to fail one season than fail for 5, 10 years Get information early, get it cheap Inform design from a pilot study

Avoiding rules of thumb Think about assumptions Detection function Spatial model Think about design Spatial coverage Covariate coverage

Spatial coverage (IWC POWER)

Covariate coverage

Sometimes things are complicated Weather has a big effect on detectability Need to record during survey Disambiguate between distribution/detectability Potential confounding can be BAD

Visibility during POWER 2014 Thanks to Hiroto Murase and co. for this data!

Covariates can make a big difference!

Disappointment Sometimes you don't have enough data Or, enough coverage Or, the right covariates Sometimes, you can't build a spatial model

@kitabet

"Which of options X, Y, Z is correct?"

Alternatives problem When faced with options, try them. Where does the sensitivity lie? What's really going on? What is your objective ?

"How big should our segments be?"

Segment size If you think it's an issue test it Resolution of covariates also important Maybe species-/domain-dependent? (Solutions on the horizon to avoid this)

"Is our model right?"

Model validation Some variety of cross-validation Temporal replication Leave out 1 year, fit to others, predict, assess Spatial “pseudo-jackknife” Leave out every segment, refit, … n th (Maybe leave out 2, 3 etc…)

Modelling philosophy

Which covariates should we include? Dynamic vs static variables Spatial terms? Habitat models?

Getting help

Resources Bibliography has pointers to these topics Distance sampling Google Group Friendly, helpful, low traffic see distancesampling.org/distancelist.html

Advanced topics

This is a whirlwind tour...

...and some of this is experimental

Smoother zoo

Cyclic smooths What if things “wrap around”? (Time, angles, …) Match value and derivative Use bs="cc" See ?smooth.construct.cs.smooth.spec

Smoothing in complex regions Edges are important Whales don't live on land Bad things happen when we don't account for this Include boundary info in smoother ?soap

Multivariate smooths Thin plate splines are isotropic 1 unit in any direction is equal Fine for space, not for other things

Tensor products ( x , z ) = ∑ k 1 ∑ k 2 β k s x ( x ) ( z ) s x , z s z As many covariates as you like! (But takes time) te() or ti() (instead of s() )

Black bears like to sunbathe

Random effects normal random effects exploits equivalence of random effects and splines ? gam.vcomp useful when you just have a “few” random effects ?random.effects

Making things faster

Parallel processing Some models are very big/slow Run on multiple cores Use engine="bam" ! Some constraints in what you can do Wood, Goude and Shaw (2015)

Summary Lots of complicated problems Lots of potential solutions (see also “other approaches” mini-lecture) Need to get simple things right first Trade assumptions for data

Practical advice Real survey data is messy Distance sampling in the - PowerPoint PPT Presentation

Practical advice Real survey data is messy Distance sampling in the Real World We've talked a lot about models We've also talked about assumptions Our example is relatively well-behaved What can we do about all the nasty real world stuff?

Introduction to Data Science: Common observation to be religion, income, frequency where sex and

Starting point: Mission Data Set - Messy Data Set Messy Storage Messy to Arrange

Tidy data Tidy datasets are all alike but every messy dataset is messy in its own way

Distance Education Distance education used to be about the distance. 1700s 1800s 1900s 2000s

Mid Norfolk Citizens Advice Diss & Thetford Citizens Advice Norfolk Citizens Advice ADVICE

Mark-recapture distance sampling (MRDS) in Distance 7.1 Setting up Distance for MRDS

Distance in data space Notion of distance (metrics) in data space Who is my closest neighbor?

Real graduates, Real graduates, real transitions, real transitions, real stories: real

EU Advice Project Citizens Advice Wandsworth Caroline Dunne 2018 EU Advice Project EU

The Statistics of Dirty Data Sanjay Krishnan coax treasure out of messy, unstructured data 204

Chapter 9. Survey Research Chapter 9. Survey Research survey research methods? survey research

MESSY DATA AND RELUCTANT USERS - THE TROUBLE WITH HEALTHCARE DATA Sam Bail @spbail DataCouncil

Practical Experience with Practical Experience with Practical Experience with Practical

Better Advice, Better Lives Adults Select Committee 21 st June Usk 1 Better Advice, Better Lives

Welcome The Governance Advice Officer Package offers: Strategic advice and support for your

Distance Education Technologies: Distance Education Technologies: Distance Education

CSE 599B: Technology-Enabled Misinformation Franziska (Franzi) Roesner franzi@cs.washington.edu

Whats new with Andrew Davison UNIC, CNRS FACETS CodeJam #2 Gif sur Yvette, 5th-8th May 2008

Affordable Deep Learning on the Cloud The Perks of Being a (KTH) Student Sina Sheikholeslami

Open Notice A Call for Collaboration Mark Lizar & Reuben Binns http://www.opennotice.org

Gender Diversity in Online Software Teams Aid or Barrier? Bogdan Vasilescu @b_vasilescu

UEC&Mee,ng& Friday,&June&12,&2015&(WH1E)&

Artifcial intelligence has the potential to change the world, more so than almost any other

Digital Futures? The Difference that Web Science Makes Susan Halford susan.halford@Bristol.ac.uk

Sambuz

Useful Links

Newsletter

Mail Us

Practical advice Real survey data is messy Distance sampling in the - PowerPoint PPT Presentation

Practical advice Real survey data is messy Distance sampling in the Real World We've talked a lot about models We've also talked about assumptions Our example is relatively well-behaved What can we do about all the nasty real world stuff?

Introduction to Data Science: Common observation to be religion, income, frequency where sex and

Starting point: Mission Data Set - Messy Data Set Messy Storage Messy to Arrange

Tidy data Tidy datasets are all alike but every messy dataset is messy in its own way

Distance Education Distance education used to be about the distance. 1700s 1800s 1900s 2000s

Mid Norfolk Citizens Advice Diss &amp; Thetford Citizens Advice Norfolk Citizens Advice ADVICE

Mark-recapture distance sampling (MRDS) in Distance 7.1 Setting up Distance for MRDS

Distance in data space Notion of distance (metrics) in data space Who is my closest neighbor?

Real graduates, Real graduates, real transitions, real transitions, real stories: real

EU Advice Project Citizens Advice Wandsworth Caroline Dunne 2018 EU Advice Project EU

The Statistics of Dirty Data Sanjay Krishnan coax treasure out of messy, unstructured data 204

Chapter 9. Survey Research Chapter 9. Survey Research survey research methods? survey research

MESSY DATA AND RELUCTANT USERS - THE TROUBLE WITH HEALTHCARE DATA Sam Bail @spbail DataCouncil

Practical Experience with Practical Experience with Practical Experience with Practical

Better Advice, Better Lives Adults Select Committee 21 st June Usk 1 Better Advice, Better Lives

Welcome The Governance Advice Officer Package offers: Strategic advice and support for your

Distance Education Technologies: Distance Education Technologies: Distance Education

CSE 599B: Technology-Enabled Misinformation Franziska (Franzi) Roesner franzi@cs.washington.edu

Whats new with Andrew Davison UNIC, CNRS FACETS CodeJam #2 Gif sur Yvette, 5th-8th May 2008

Affordable Deep Learning on the Cloud The Perks of Being a (KTH) Student Sina Sheikholeslami

Open Notice A Call for Collaboration Mark Lizar &amp; Reuben Binns http://www.opennotice.org

Gender Diversity in Online Software Teams Aid or Barrier? Bogdan Vasilescu @b_vasilescu

UEC&amp;Mee,ng&amp; Friday,&amp;June&amp;12,&amp;2015&amp;(WH1E)&amp;

Artifcial intelligence has the potential to change the world, more so than almost any other

Digital Futures? The Difference that Web Science Makes Susan Halford susan.halford@Bristol.ac.uk

Sambuz

Useful Links

Newsletter

Mail Us

Mid Norfolk Citizens Advice Diss & Thetford Citizens Advice Norfolk Citizens Advice ADVICE

Open Notice A Call for Collaboration Mark Lizar & Reuben Binns http://www.opennotice.org

UEC&Mee,ng& Friday,&June&12,&2015&(WH1E)&