Implementing Bootstrap Methods in R GETTING STARTED WITH - PowerPoint PPT Presentation

Central Limit Theorem A group of means of N samples drawn from any distribution (even a non-normal distribution) approaches normality as N approaches infinity.

Implication of the Central Limit Theorem Mean of non-normal population can be estimated easily by sampling Draw N samples, compute mean of each sample Compute mean of these means As N -> ∞ this mean of means approaches population mean

Establishing Confidence Intervals Conventional Bootstrap Approach Approach Sample multiple Sample once; make Sample once; times with or without strong assumptions resample that sample out replacement about population with replacement The Central Limit Theorem only applies to a group of means, so computing multiple samples is key

Establishing Confidence Intervals Conventional Bootstrap Approach Approach Sample multiple Sample once; make Sample once; times with or without strong assumptions resample that sample out replacement about population with replacement Not a very realistic approach in the real world

Establishing Confidence Intervals Conventional Bootstrap Approach Approach Sample multiple Sample once; make Sample once; times with or without strong assumptions resample that sample out replacement about population with replacement

Establishing Confidence Intervals Conventional Bootstrap Approach Approach Sample multiple Sample once; make Sample once; times with or without strong assumptions resample that sample out replacement about population with replacement Instead modelers choose only to work with data whose distributions are known

Establishing Confidence Intervals Conventional Bootstrap Approach Approach Sample multiple Sample once; make Sample once; times with or without strong assumptions resample that sample out replacement about population with replacement For normally distributed data we can often work with just one sample to estimate mean

Confidence Intervals from Normal Data Simple random sample . . . Population

Confidence Intervals from Normal Data Simple random sample . . . ↓ Calculate mean of sample ↓ Compute confidence intervals analytically Population

Demo The central limit theorem

Demo Observing the central limit theorem on a real dataset

Drawbacks of Conventional Methods

Drawbacks of Conventional Methods Make strong assumptions of the distribution of data Use analytical formulae to estimate statistics based on data distributions The analytical formula may not exist for certain combinations

Drawbacks of Conventional Methods Need to draw a large number of samples from the population Estimate statistics based on sampling distribution May not be practical or realistic

Estimating Population Statistic Conventional Bootstrap Approach Approach Sample population Sample once; once; calculate resample that sample sample statistic with replacement

Establishing Confidence Intervals Conventional Bootstrap Approach Approach Sample multiple Sample once; make Sample once; times with or without strong assumptions resample that sample out replacement about population with replacement

Establishing Confidence Intervals Conventional Bootstrap Approach Approach Sample multiple Sample once; make Sample once; times with or without strong assumptions resample that sample out replacement about population with replacement Parametric Method

Establishing Confidence Intervals Conventional Bootstrap Approach Approach Sample multiple Sample once; make Sample once; times with or without strong assumptions resample that sample out replacement about population with replacement Non-parametric Methods

The basic Bootstrap method is non- parametric, however parametric variants exist too

The Bootstrap Method

Conventional Methods Population Sample ∞ Sample 2 Sample 4 Sample 3 Sample 1

Confidence Intervals from Non-normal Data Sample values . . . ↓ Repeat Calculate mean of each multiple times sample ↓ 97.5% percentile 2.5% percentile Population Sample Means

Bootstrap Method Population Sample ∞ Sample 2 Sample 4 Sample 3 Sample 1 Draw just one sample from the population

Bootstrap Method Population Sample 1 Draw just one sample from the population

The Bootstrap Sample Population Bootstrap Sample Treat that one sample as if it were the population

Bootstrap Method Sample ∞ Sample 2 Sample 4 Sample 3 Sample 1 Draw multiple samples from the one sample with replacement

Bootstrap Method Sample ∞ Sample 2 Sample 4 Sample 3 Sample 1 Each of these samples is sometimes called a Bootstrap Replication

Estimate Statistics using the Bootstrap Method Sample ∞ Sample 2 Sample 4 Sample 3 Sample 1 With each bootstrap replication calculate the statistic e.g. mean

Estimate Statistics using the Bootstrap Method Sample ∞ Sample 2 Sample 4 Sample 3 Sample 1 Each estimate from a bootstrapped replication is called a bootstrap realization of the statistic

Confidence Intervals using the Bootstrap Method Sample ∞ Sample 2 Sample 4 Sample 3 Sample 1 Calculate confidence intervals using the bootstrap distribution of the statistic

Sampling with replacement is essential Else each Bootstrap Replication will merely reproduce the Bootstrap Sample

Sampling with Replacement Reusing the same data multiple times “Bootstrapping” comes from the phrase “pulling yourself up by your own bootstraps” Has empirically been shown to produce meaningful results

Sampling with Replacement Bootstrapping does not create new data Creates the samples that could have been drawn from the original population Assumes that the bootstrap sample accurately represents the population

The Bootstrap Method seems like cheating, but it is both theoretically sound and very robust

The Bootstrap Method and Confidence Intervals

Confidence Intervals with the Bootstrap Method Bootstrap Sample (treated as Population)

Confidence Intervals with the Bootstrap Method Sample values with replacement . . . Bootstrap Sample (treated as Population)

Confidence Intervals with the Bootstrap Method Sample values with replacement . . . ↓ Repeat Calculate mean of each multiple times sample ↓ Bootstrap Sample (treated as Population)

Confidence Intervals with the Bootstrap Method Sample values with replacement . . . ↓ Repeat Calculate mean of each multiple times sample ↓ Bootstrap Sample (treated as Population) Sample Means

Confidence Intervals with the Bootstrap Method Sample values with replacement . . . ↓ Repeat Calculate mean of each multiple times sample ↓ 97.5% percentile 2.5% percentile Bootstrap Sample (treated as Population) Sample Means

The Bootstrap Method Conventional Approach Bootstrap Method Sample population just once if no Sample population just once confidence intervals needed under all circumstances No need to re-sample for Re-sample bootstrap sample confidence intervals for common with replacement under all use-cases circumstances Re-sample population if No change in procedure, works confidence intervals needed for equally well for common and complex cases complex cases

The Bootstrap Method Great for - Arbitrary population (unknown distribution) - Arbitrary statistics (not commonly studied for arbitrary population) - Confidence interval around arbitrary statistics

Implementing Bootstrap Methods in R GETTING STARTED WITH - PowerPoint PPT Presentation

Implementing Bootstrap Methods in R GETTING STARTED WITH BOOTSTRAPPING IN R Janani Ravi CO-FOUNDER, LOONYCORN www.loonycorn.com Estimating statistics and calculating Overview confidence intervals The Central Limit Theorem Conventional

A better Bootstrap, Mack, and the ELRF and PTF modelling Frameworks Bootstrap technique- a

STAT 113 Bootstrap Confidence Intervals Colin Reimer Dawson Oberlin College 3 March 2017

1 Get Started 2 3 Web Application Development What is Bootstrap? Bootstrap is a free

Cross-validation and the Bootstrap In the section we discuss two resampling methods:

Bootstrap Shan-Hung Wu CS, NTHU Landing Page HTML/CSS taught so far Bootstrap 4 (alpha

AngularJS & Bootstrap Tabs, Forms, Models Tabs inside out Bootstrap has classes that

Bootstrap Percolation on Periodic Trees Milan Bradonji work with Iraj Saniee Bell Labs,

Lecture 21: Bootstrap and Permutation Tests The bootstrap Bootstrapping generally refers to

Bootstrap: A framework for CSS Jay Urbain, Ph.D. Credits: http://getbootstrap.com/

Unit 4: Inference for numerical variables Lecture 1: Bootstrap, paired, and two sample Statistics

Anima IETF 93 Charter Discussion Design Team Update bootstrap design team

Bootstrap method for misspecified stochastic differential equation models Yuma Uehara The

Parametric bootstrap August 30, 2017 Resampling from the data or from distribution Simple

Stochastic Simulation The Bootstrap method Bo Friis Nielsen Institute of Mathematical Modelling

Bootstrapping 18.05 Spring 2018 Agenda Leftover from 5/2 : binomial confidence intervals

Stochastic Simulation Non-parametric technique The Bootstrap method Bo Friis Nielsen

Compressed sensing, sparsity and p-values Sara van de Geer April 16, 2015 (Leiden) Dantzig

LINEAR REGRESSION LINEAR REGRESSION - FROM A MACHINE LEARNING POINT OF VIEW 25 SIMPLE LINEAR

STAT 213 Regression Inference II Colin Reimer Dawson Oberlin College 18 February 2016 Outline

Introduction to Data Science Winter Semester 2019/20 Oliver Ernst TU Chemnitz, Fakultt fr

Probability: Reasoning Under Uncertainty CS171, Winter Quarter, 2019 Introduction to Artificial

Probabilistic Reasoning Philipp Koehn 4 April 2017 Philipp Koehn Artificial Intelligence:

Thomas Bayes Needs a Volunteer So good to see you again! Two Envelopes I have two envelopes,

Machine Learning for Computational Linguistics Some probability distributions .

Sambuz

Useful Links

Newsletter

Mail Us

Implementing Bootstrap Methods in R GETTING STARTED WITH - PowerPoint PPT Presentation

Implementing Bootstrap Methods in R GETTING STARTED WITH BOOTSTRAPPING IN R Janani Ravi CO-FOUNDER, LOONYCORN www.loonycorn.com Estimating statistics and calculating Overview confidence intervals The Central Limit Theorem Conventional

A better Bootstrap, Mack, and the ELRF and PTF modelling Frameworks Bootstrap technique- a

STAT 113 Bootstrap Confidence Intervals Colin Reimer Dawson Oberlin College 3 March 2017

1 Get Started 2 3 Web Application Development What is Bootstrap? Bootstrap is a free

Cross-validation and the Bootstrap In the section we discuss two resampling methods:

Bootstrap Shan-Hung Wu CS, NTHU Landing Page HTML/CSS taught so far Bootstrap 4 (alpha

AngularJS &amp; Bootstrap Tabs, Forms, Models Tabs inside out Bootstrap has classes that

Bootstrap Percolation on Periodic Trees Milan Bradonji work with Iraj Saniee Bell Labs,

Lecture 21: Bootstrap and Permutation Tests The bootstrap Bootstrapping generally refers to

Bootstrap: A framework for CSS Jay Urbain, Ph.D. Credits: http://getbootstrap.com/

Unit 4: Inference for numerical variables Lecture 1: Bootstrap, paired, and two sample Statistics

Anima IETF 93 Charter Discussion Design Team Update bootstrap design team

Bootstrap method for misspecified stochastic differential equation models Yuma Uehara The

Parametric bootstrap August 30, 2017 Resampling from the data or from distribution Simple

Stochastic Simulation The Bootstrap method Bo Friis Nielsen Institute of Mathematical Modelling

Bootstrapping 18.05 Spring 2018 Agenda Leftover from 5/2 : binomial confidence intervals

Stochastic Simulation Non-parametric technique The Bootstrap method Bo Friis Nielsen

Compressed sensing, sparsity and p-values Sara van de Geer April 16, 2015 (Leiden) Dantzig

LINEAR REGRESSION LINEAR REGRESSION - FROM A MACHINE LEARNING POINT OF VIEW 25 SIMPLE LINEAR

STAT 213 Regression Inference II Colin Reimer Dawson Oberlin College 18 February 2016 Outline

Introduction to Data Science Winter Semester 2019/20 Oliver Ernst TU Chemnitz, Fakultt fr

Probability: Reasoning Under Uncertainty CS171, Winter Quarter, 2019 Introduction to Artificial

Probabilistic Reasoning Philipp Koehn 4 April 2017 Philipp Koehn Artificial Intelligence:

Thomas Bayes Needs a Volunteer So good to see you again! Two Envelopes I have two envelopes,

Machine Learning for Computational Linguistics Some probability distributions .

Sambuz

Useful Links

Newsletter

Mail Us

AngularJS & Bootstrap Tabs, Forms, Models Tabs inside out Bootstrap has classes that