02 Sampling algorithms Shravan Vasishth SMLP Shravan Vasishth 02 - PowerPoint PPT Presentation

02 Sampling algorithms Shravan Vasishth SMLP Shravan Vasishth 02 Sampling algorithms SMLP 1 / 43

MCMC sampling The inversion method for sampling This method works when we know the closed form of the pdf we want to simulate from and can derive the inverse of that function. Steps: ´ z Sample one number u from Unif (0 , 1). Let u = F ( z ) = L f ( x ) dx 1 (here, L is the lower bound of the pdf f). Then z = F − 1 ( u ) is a draw from f ( x ). 2 Shravan Vasishth 02 Sampling algorithms SMLP 2 / 43

Example 1: Samples from Standard Normal Take a sample from the Uniform(0,1): u<- runif (1,min=0,max=1) Let f(x) be a Normal density—we want to sample from this density. The inverse of the CDF in R is qnorm. It takes as input a probability and returns a quantile. qnorm (u) ## [1] 0.87794 Shravan Vasishth 02 Sampling algorithms SMLP 3 / 43

Example 1: Samples from Standard Normal If we do this repeatedly, we will get samples from the Normal distribution (here, the standard normal). nsim<-10000 samples<- rep (NA,nsim) for (i in 1 : nsim){ u <- runif (1,min=0,max=1) samples[i]<- qnorm (u) } Shravan Vasishth 02 Sampling algorithms SMLP 4 / 43

Example 1: Samples from Standard Normal hist (samples,freq=FALSE, main="Standard Normal") Standard Normal 0.3 Density 0.2 0.1 0.0 −4 −2 0 2 4 samples Shravan Vasishth 02 Sampling algorithms SMLP 5 / 43

Example 2: Samples from Exponential or Gamma Now try this with the exponential with rate 1: nsim<-10000 samples<- rep (NA,nsim) for (i in 1 : nsim){ u <- runif (1,min=0,max=1) samples[i]<- qexp (u) } Shravan Vasishth 02 Sampling algorithms SMLP 6 / 43

Example 2: Samples from Exponential or Gamma hist (samples,freq=FALSE,main="Exponential") Exponential 0.8 0.6 Density 0.4 0.2 0.0 0 2 4 6 8 10 samples Shravan Vasishth 02 Sampling algorithms SMLP 7 / 43

Example 2: Samples from Exponential or Gamma Or the Gamma with rate and shape 1: nsim<-10000 samples<- rep (NA,nsim) for (i in 1 : nsim){ u <- runif (1,min=0,max=1) samples[i]<- qgamma (u,rate=1,shape=1) } Shravan Vasishth 02 Sampling algorithms SMLP 8 / 43

Example 2: Samples from Exponential or Gamma hist (samples,freq=FALSE,main="Gamma") Gamma 0.8 0.6 Density 0.4 0.2 0.0 0 2 4 6 8 samples Shravan Vasishth 02 Sampling algorithms SMLP 9 / 43

Example 3 Let f ( x ) = 1 40 (2 x + 3), with 0 < x < 5. Now, we can’t just use the family of q functions in R, because this density is not defined in R. We have to draw a number from the uniform distribution and then solve for z, which amounts to finding the inverse function: ˆ z 1 u = 40(2 x + 3) (1) 0 u<- runif (1000,min=0,max=1) z<-(1 / 2) * ( - 3 + sqrt (160 * u + 9)) This method can’t be used if we can’t find the inverse, and it can’t be used with multivariate distributions. Shravan Vasishth 02 Sampling algorithms SMLP 10 / 43

Gibbs sampling Gibbs sampling is a very commonly used method in Bayesian statistics. Here is how it works. Let Θ be a vector of parameter values, let length of Θ be k . Let j index the j -th iteration. Assign some starting values to Θ: 1 Θ j =0 ← S Set j ← j + 1 2 1. Sample θ j 1 | θ j − 1 . . . θ j − 1 . 3 2 k 1 θ j − 1 . . . θ j − 1 2. Sample θ j 2 | θ j . 3 k . . . k. Sample θ j k | θ j 1 . . . θ j k − 1 . Return to step 1. 4 Shravan Vasishth 02 Sampling algorithms SMLP 11 / 43

Example: A simple bivariate distribution Assume that our bivariate (joint) density is: f ( x , y ) = 1 28(2 x + 3 y + 2) (2) Using the methods discussed in the Foundations chapter, it is possible to analytically work out the conditional distributions from the joint distribution: f ( x | y ) = f ( x , y ) = (2 x + 3 y + 2) (3) f ( y ) 6 y + 8 f ( y | x ) = f ( x , y ) = (2 x + 3 y + 2) (4) f ( x ) 4 y + 10 Shravan Vasishth 02 Sampling algorithms SMLP 12 / 43

Example: A simple bivariate distribution The Gibbs sampler algorithm is: Set starting values for the two parameters x = − 5 , y = − 5. Set j=0. 1 Sample x j +1 from f ( x | y ) using inversion sampling. You need to work 2 out the inverse of f ( x | y ) and f ( y | x ) first. To do this, for f ( x | u ), we have find z 1 : ˆ z 1 (2 x + 3 y + 2) u = dx (5) 6 y + 8 0 And for f ( y | x ), we have to find z 2 : ˆ z 2 (2 x + 3 y + 2) u = (6) dy 4 y + 10 0 Shravan Vasishth 02 Sampling algorithms SMLP 13 / 43

Example: A simple bivariate distribution x<- rep (NA,2000) y<- rep (NA,2000) x[1]<- -5 ## initial values y[1]<- -5 for (i in 2 : 2000) { #sample from x | y u<- runif (1,min=0, max=1) x[i]<- sqrt (u * (6 * y[i-1] + 8) + (1.5 * y[i-1] + 1) * (1.5 * y[i-1] + 1)) - (1.5 * y[i-1] + 1) #sample from y | x u<- runif (1,min=0,max=1) y[i]<- sqrt ((2 * u * (4 * x[i] + 10)) / 3 + ((2 * x[i] + 2) / 3) * ((2 * x[i] + 2) / 3)) - ((2 * x[i] + 2) / 3) } Shravan Vasishth 02 Sampling algorithms SMLP 14 / 43

Example: A simple bivariate distribution You can run this code to visualize the simulated posterior distribution. See Figure 1. library (MASS) bivar.kde<- kde2d (x,y) persp (bivar.kde,phi=10,theta=90,shade=0,border=NA, main="Simulated bivariate density using Gibbs sampling") Simulated bivariate density using Gibbs sampling Z bivar.kde Y Figure 1: Example of posterior distribution of a bivariate distribution. Shravan Vasishth 02 Sampling algorithms SMLP 15 / 43

Example: A simple bivariate distribution A central insight here is that knowledge of the conditional distributions is enough to simulate from the joint distribution, provided such a joint distribution exists. Shravan Vasishth 02 Sampling algorithms SMLP 16 / 43

Random walk Metropolis Start at random location θ 0 ∈ Θ For step i = 1 , . . . , I ◮ Propose new location using a “symmetric jumping distribution” ◮ Calculate ratio = lik ( θ i +1 ) × prior ( θ i +1) lik ( θ i ) × prior ( θ i ) ◮ Generate u ∼ Uniform (0 , 1) ◮ r>u, move from θ i to θ i +1 , else stay at θ i Shravan Vasishth 02 Sampling algorithms SMLP 17 / 43

Random Walk Metropolis Posterior 0.008 0.006 likelihood x prior 0.004 0.002 0.000 −150 −100 −50 0 50 100 θ Shravan Vasishth 02 Sampling algorithms SMLP 18 / 43

Random Walk Metropolis Propose location to jump to 0.008 0.006 likelihood x prior 0.004 0.002 0.000 −150 −100 −50 0 50 100 θ Shravan Vasishth 02 Sampling algorithms SMLP 19 / 43

Random Walk Metropolis Calculate ratio of proposed/current likxprior 0.008 0.006 likelihood x prior ratio=0.83 0.004 0.002 0.000 −150 −100 −50 0 50 100 θ Shravan Vasishth 02 Sampling algorithms SMLP 20 / 43

Random Walk Metropolis Take a sample u ∼ Uniform (0 , 1). Suppose u = 0.90. Since ratio < u , remain at current position (reject proposal). Calculate ratio of proposed/current likxprior 0.008 0.006 likelihood x prior ratio=0.83 0.004 0.002 0.000 −150 −100 −50 0 50 100 θ Shravan Vasishth 02 Sampling algorithms SMLP 21 / 43

Random Walk Metropolis Make new proposal, compute proposal/original ratio 0.008 ratio=1.33 0.006 likelihood x prior 0.004 0.002 0.000 −150 −100 −50 0 50 100 θ Shravan Vasishth 02 Sampling algorithms SMLP 22 / 43

Random Walk Metropolis Move to new location because ratio > 1 0.008 ratio=1.33 0.006 likelihood x prior 0.004 0.002 0.000 −150 −100 −50 0 50 100 θ Shravan Vasishth 02 Sampling algorithms SMLP 23 / 43

Hamiltonian Monte Carlo Instead of Gibbs sampling or Metropolis etc., Stan uses this more efficient sampling approach. HMC works well for the high-dimensional models we will fit (hierarchical models). Gibbs sampling faces difficulties with some of the complex hierarchical models we will be fitting later. HMC will always succeed for these complex models. Shravan Vasishth 02 Sampling algorithms SMLP 24 / 43

Hamiltonian Monte Carlo One limitation of HMC (which Gibbs sampling does not have) is that HMC only works with continuous parameters (not discrete parameters). For our purposes, it is enough to know what sampling using MCMC is, and that HMC gives us posterior samples efficiently. A good reference explaining HMC is Neal 2011. However, this paper is technically very demanding. More intuitively accessible introductions are available via Michael Betancourt’s home page: https://betanalpha.github.io/. In particular, this video is helpful: https://youtu.be/jUSZboSq1zg. Shravan Vasishth 02 Sampling algorithms SMLP 25 / 43

Background: Hamiltonian dynamics Imagine an ice puck moving over a frictionless surface of varying heights. The puck moves at constant velocity (momentum) k on flat surface When the puck moves up an incline, it’s kinetic energy goes down, and its potential energy goes up When the puck slows down and comes to a halt, kinetic energy becomes 0. When the puck slides back, kinetic energy goes up, potential energy goes down. See animation. Shravan Vasishth 02 Sampling algorithms SMLP 26 / 43

02 Sampling algorithms Shravan Vasishth SMLP Shravan Vasishth 02 - PowerPoint PPT Presentation

02 Sampling algorithms Shravan Vasishth SMLP Shravan Vasishth 02 Sampling algorithms SMLP 1 / 43 MCMC sampling The inversion method for sampling This method works when we know the closed form of the pdf we want to simulate from and can

Sampling Methods Oliver Schulte - CMPT 419/726 Bishop PRML Ch. 11 Sampling Rejection Sampling

Chapter 7. Sampling Chapter 7. Sampling methods? methods? Two types of sampling methods Two

Multiple importance sampling Slides for CS6630 lecture 6 sampling the BRDF sampling the

What is the strengths and weakness of these sampling methods? Sampling Strengths /

Sampling Overview R toy sampling Non-probability sampling Probability Methods (AKA random)

Sampling Sediment and Sampling Sediment and Sampling Sediment and Porewater Sampling Sediment

Sampling Methods CMSC 678 UMBC Outline Recap Monte Carlo methods Sampling Techniques Uniform

Newfound Water Quality Sampling: In Lake Sampling 8 Historic Sampling locations

Sampling Distributions Sampling Distribution of the Mean & Hypothesis Testing Sampling

Overview of Sampling Topics (Shannon) sampling theorem Impulse-train sampling

Sampling Algorithms for Data Sampling Algorithms for Data Collection in Online Networks

Faster Gaussian Lattice Sampling using Information Leakage Gaussian Sampling Our Work Lazy

Introduction to Sampling for Non-Statisticians Dr. Safaa R. Amer Overview Part I Part II

Medicare and Medicaid Audit Sampling Strategies Sampling Strategies Creating Sampling Plans and

CS786 Lecture 13: May 14, 2012 Sampling techniques [KF Chapter 12] CS786 P. Poupart 2012 1

Double, Multiple, and Sequential Sampling Double-sampling In a double-sampling plan, a first

An Unusual Two-Higgs Doublet Model from Warped Space We-Fu Chang 1 John N. Ng 2 . Spray 2 Andrew

University of Minnesota Scaling Up The Performance of Distributed Key-Value Stores Using Emerging

Meshfree methods for conservation laws using kinetic approach and alternate least squares

Kinetic Monte Carlo Methods C n e n o t i t e a r t f u o p r CC DC Kinetic

Axial kinetic theory and spin transport for massive fermions Di-Lun Yang Keio Institute of Pure

Param etric an d Kin etic Min im um Span n in g Trees Pan kaj K. Agarwal David Eppstein Leon

Brook Abegaz, Tennessee Technological University, Fall 2013 1 Tennessee Technological University

On variational kinetic formulations for scalar conservation laws and the Euler equations of gas

02 Sampling algorithms Shravan Vasishth SMLP Shravan Vasishth 02 - PowerPoint PPT Presentation

02 Sampling algorithms Shravan Vasishth SMLP Shravan Vasishth 02 Sampling algorithms SMLP 1 / 43 MCMC sampling The inversion method for sampling This method works when we know the closed form of the pdf we want to simulate from and can

Sampling Methods Oliver Schulte - CMPT 419/726 Bishop PRML Ch. 11 Sampling Rejection Sampling

Chapter 7. Sampling Chapter 7. Sampling methods? methods? Two types of sampling methods Two

Multiple importance sampling Slides for CS6630 lecture 6 sampling the BRDF sampling the

What is the strengths and weakness of these sampling methods? Sampling Strengths /

Sampling Overview R toy sampling Non-probability sampling Probability Methods (AKA random)

Sampling Sediment and Sampling Sediment and Sampling Sediment and Porewater Sampling Sediment

Sampling Methods CMSC 678 UMBC Outline Recap Monte Carlo methods Sampling Techniques Uniform

Newfound Water Quality Sampling: In Lake Sampling 8 Historic Sampling locations

Sampling Distributions Sampling Distribution of the Mean &amp; Hypothesis Testing Sampling

Overview of Sampling Topics (Shannon) sampling theorem Impulse-train sampling

Sampling Algorithms for Data Sampling Algorithms for Data Collection in Online Networks

Faster Gaussian Lattice Sampling using Information Leakage Gaussian Sampling Our Work Lazy

Introduction to Sampling for Non-Statisticians Dr. Safaa R. Amer Overview Part I Part II

Medicare and Medicaid Audit Sampling Strategies Sampling Strategies Creating Sampling Plans and

CS786 Lecture 13: May 14, 2012 Sampling techniques [KF Chapter 12] CS786 P. Poupart 2012 1

Double, Multiple, and Sequential Sampling Double-sampling In a double-sampling plan, a first

An Unusual Two-Higgs Doublet Model from Warped Space We-Fu Chang 1 John N. Ng 2 . Spray 2 Andrew

University of Minnesota Scaling Up The Performance of Distributed Key-Value Stores Using Emerging

Meshfree methods for conservation laws using kinetic approach and alternate least squares

Kinetic Monte Carlo Methods C n e n o t i t e a r t f u o p r CC DC Kinetic

Axial kinetic theory and spin transport for massive fermions Di-Lun Yang Keio Institute of Pure

Param etric an d Kin etic Min im um Span n in g Trees Pan kaj K. Agarwal David Eppstein Leon

Brook Abegaz, Tennessee Technological University, Fall 2013 1 Tennessee Technological University

On variational kinetic formulations for scalar conservation laws and the Euler equations of gas

Sampling Distributions Sampling Distribution of the Mean & Hypothesis Testing Sampling