STA 326 2.0 Programming and Data Analysis with R Generating Random - PDF document

STA 326 2.0 Programming and Data Analysis with R Generating Random Numbers Using the Inverse Transform Method Prepared by Dr Thiyanga Talagala 1. Probability distribution functions in R to generate random numbers rbeta beta distribution rlnorm log-normal distribution binomial distribution multinomial distribution rbinom rmultinom Cauchy distribution negative binomial distribution rcauchy rnbinom chi-squared distribution normal distribution rchisq rnorm exponential distribution Poisson distribution rexp rpois F distribution Student’s t distribution rf rt gamma distribution uniform distribution rgamma runif geometric distribution Weibull distribution rgeom rweibull hyper-geometric distribution rhyper There are other methods of generating random numbers from a particular distribution. In this lectorial we will discuss Inverse Transform Method . 2. Inverse transform method Theorem 1: Probability Integral Transformation Let X have continuous cdf F X ( x ) and define the random variable Y as Y = F X ( X ). Then Y is uniformly distributed on (0, 1), that is, P ( Y ≤ y ) = y, 0 < y < 1 . Let’s try to understand the theorem using an example. 1

Useful results to prove the theorem. Result 1: If F X is strictly increasing, then F − 1 is well defined by X F − 1 X ( y ) = x ⇔ F X ( x ) = y. If F X is constant on some interval, then F − 1 is not well defined by the above equation. To avoid this problem X we define F − 1 X ( y ) for 0 < y < 1 by F − 1 X ( y ) = inf { x : F X ( x ) ≥ y } . 3

Result 2: If F X is strictly increasing, then it is true that F − 1 X ( F X ( x )) = x. Proof of Theorem 1: For Y = F X ( X ) we have, for 0 < y < 1 , 4

We can use Theorem 1 to generate random numbers from a particular distribution. 5

3. Steps in deriving random numbers using integral transformation method 1. Derive the cumulative distribution function of f X ( x ) 2. Derive the inverse function F − 1 X ( u ). 3. Write a function to generate random numbers. • Generate u from Uniform (0 , 1). • compute x = F − 1 X ( u ). Example 1 Write a function to generate n random numbers from the distribution with density f X ( x ) = 3 x 2 , 0 < x < 1 . Step 1: Find the cumulative distribution function of f X ( x ), F X ( x ) = x 3 for 0 < x < 1 6

Step 2: Next we need to compute F − 1 X ( u ), 1 F − 1 3 . X ( u ) = u Step 3: R function generate_it <- function (n){ # Generate random numbers u <- runif (n) xgen <- u ^ (1 / 3) xgen } set.seed (2020) generate_it (10) 7

[1] 0.8648611 0.7332437 0.8520145 0.7812795 0.5143788 0.4069300 0.5054766 [8] 0.7325562 0.1372012 0.8527963 Visualisation of theoretical distribution library (tidyverse) # Theoretical distribution values theoretical.df <- tibble (x = seq (0, 1, 0.01), fx = 3 * x ^ 2) ggplot (theoretical.df, aes (x = x, y = fx)) + geom_line (col = "red") 3 2 fx 1 0 0.00 0.25 0.50 0.75 1.00 x Visualize empirical distribution - counts empirical.df <- data.frame (data.emp = generate_it (1000)) # Plot empirical distribution - counts ggplot (empirical.df, aes (x = data.emp)) + geom_histogram (col = "white", binwidth = 0.05) 100 count 50 0 0.25 0.50 0.75 1.00 data.emp 8

Visualize empirical distribution - density ggplot (empirical.df, aes (x = data.emp, y=..density..)) + geom_histogram (col = "white", binwidth = 0.05) 2 density 1 0 0.25 0.50 0.75 1.00 data.emp Visualize theoretical distribution and empirical distribution together ggplot (empirical.df, aes (x = data.emp, y=..density..)) + geom_histogram (col = "white", binwidth = 0.05) + geom_line (data = theoretical.df, aes (x = x, y = fx), color = 'red') 3 2 density 1 0 0.00 0.25 0.50 0.75 1.00 data.emp 9

Function to generate random numbers and visualize theoretical and empirical distributions generate_it_dist <- function (n){ # Generate random numbers u <- runif (n) xgen <- u ^ (1 / 3) xgen # values for empirical distribution empirical.df <- data.frame (xgen=xgen) # values for the theoretical distribution theoretical.df <- tibble (x = seq (0, 1, 0.01), fx = 3 * x ^ 2) # arrange values and plot into a list list ( xgen, ggplot2 ::ggplot (empirical.df, aes (x=xgen, y=..density..)) + geom_histogram (col="white", binwidth = 0.01) + geom_line (data = theoretical.df, aes (x = x, y = fx), color = 'red') ) } Run the following codes and check the outputs. # Sample size 10 generate_it_dist (10) # Sample size 100 n100 <- generate_it_dist (100) n100 n100[[1]] n100[[2]] # Sample size 10000 n10000 <- generate_it_dist (10000) n10000[[2]] 10

Example 2 i) Write a function to generate random numbers from the Exponential ( λ ) distribution using the inverse transformation method. ii) Generate 1000 random numbers from the Exponential (2) distribution. 11

iii) Graph the density histogram of the sample with the Exponential (2) density superimposed for compari- son. 12

STA 326 2.0 Programming and Data Analysis with R Generating Random - PDF document

STA 326 2.0 Programming and Data Analysis with R Generating Random Numbers Using the Inverse Transform Method Prepared by Dr Thiyanga Talagala 1. Probability distribution functions in R to generate random num- bers rbeta beta

Sta$s$cs Sta$s$cs Fourth Dimension of a Sta$s$cal Programmer

F orwa rd L ooking Sta te me nt Ce rta in o f the sta te me nts ma de in this Pre se nta tio

Overview of ASC 326-20 (CECL) FASB Accounting Standards Update (ASU) 2016-13, Financial

STA Graduation 2019/20 STA Graduation Application https://forms.gle/tZsKJXUmbAQgcSn57 This google

2011 11 12 12 th th at t Sta tate te (3:18.02) :18.02) 2012 12 10 10 th th at t

STA STA 2Q 2Q19 19 An Analyst lyst Pre Presentation entation 1 CO CONTENTS TENTS 1. .

STA STA 4Q 4Q19 19 & FY & FY19 19 An Analy lyst st Pre Presentat sentation ion

STA STA 1Q 1Q19 19 An Analyst lyst Pre Presentation entation 1 CO CONTENTS TENTS 1.

Open Water Swimming Speaker: Dave Candler, STA President Qualifications STA Level 1 Award for

STA STA 1Q 1Q20 20 Pr Prese esentation ntation Opportu ortunity nity Day 5 June e 2020

STA 214: Probability & Statistical Models STA 214: Analysis of Statistical Models

CSE 326: Data Structures distinguished vertex s , find the shortest weighted path from s to every

Lesson 2 Lexical Analysis CS 226/326 Spring 2003 Lexical Analysis Transform source program

COMPSCI 326 Web Programming React State and Interactivity Objectives Understand React State

COMPSCI 326 Web Programming Week 09: ER Diagram Sketches Agenda 4:00 4:35 ER Diagram

STA - Static Timing Analysis STA Lecturer: Gil Rahav Semester B , EE Dept. BGU. Freescale

A study of entropy transfers in the Linux Random Number Generator Th. Vuillemin, F . Goichon, G.

More Graphics and Objects Rose-Hulman Institute of Technology Computer Science and Software

Group Keys Mathy Vanhoef - imec-DistriNet, KU Leuven @vanhoefm Observation General Wi-Fi

Lecture 20 Random Samples 0/ 13 One of the most important concepts in statistics is that of a

Student Responsibilities Mat 2170 Week 9 Reading: Textbook, Sections 6.1 6.3 Objects and

Extending Ant Steve Loughran stevel@apache.org About the speaker Research on deployment at HP

Chapter 7: Sampling In this chapter we will cover: 1. Samples and Populations ( 7.1, 7.2 Rice)

Lecture: Sampling and Standard Error 6.0002 LECTURE 8 1 Annou An ouncem emen ents Relevant

STA 326 2.0 Programming and Data Analysis with R Generating Random - PDF document

STA 326 2.0 Programming and Data Analysis with R Generating Random Numbers Using the Inverse Transform Method Prepared by Dr Thiyanga Talagala 1. Probability distribution functions in R to generate random num- bers rbeta beta

Sta$s$cs Sta$s$cs Fourth Dimension of a Sta$s$cal Programmer

F orwa rd L ooking Sta te me nt Ce rta in o f the sta te me nts ma de in this Pre se nta tio

Overview of ASC 326-20 (CECL) FASB Accounting Standards Update (ASU) 2016-13, Financial

STA Graduation 2019/20 STA Graduation Application https://forms.gle/tZsKJXUmbAQgcSn57 This google

2011 11 12 12 th th at t Sta tate te (3:18.02) :18.02) 2012 12 10 10 th th at t

STA STA 2Q 2Q19 19 An Analyst lyst Pre Presentation entation 1 CO CONTENTS TENTS 1. .

STA STA 4Q 4Q19 19 &amp; FY &amp; FY19 19 An Analy lyst st Pre Presentat sentation ion

STA STA 1Q 1Q19 19 An Analyst lyst Pre Presentation entation 1 CO CONTENTS TENTS 1.

Open Water Swimming Speaker: Dave Candler, STA President Qualifications STA Level 1 Award for

STA STA 1Q 1Q20 20 Pr Prese esentation ntation Opportu ortunity nity Day 5 June e 2020

STA 214: Probability &amp; Statistical Models STA 214: Analysis of Statistical Models

CSE 326: Data Structures distinguished vertex s , find the shortest weighted path from s to every

Lesson 2 Lexical Analysis CS 226/326 Spring 2003 Lexical Analysis Transform source program

COMPSCI 326 Web Programming React State and Interactivity Objectives Understand React State

COMPSCI 326 Web Programming Week 09: ER Diagram Sketches Agenda 4:00 4:35 ER Diagram

STA - Static Timing Analysis STA Lecturer: Gil Rahav Semester B , EE Dept. BGU. Freescale

A study of entropy transfers in the Linux Random Number Generator Th. Vuillemin, F . Goichon, G.

More Graphics and Objects Rose-Hulman Institute of Technology Computer Science and Software

Group Keys Mathy Vanhoef - imec-DistriNet, KU Leuven @vanhoefm Observation General Wi-Fi

Lecture 20 Random Samples 0/ 13 One of the most important concepts in statistics is that of a

Student Responsibilities Mat 2170 Week 9 Reading: Textbook, Sections 6.1 6.3 Objects and

Extending Ant Steve Loughran stevel@apache.org About the speaker Research on deployment at HP

Chapter 7: Sampling In this chapter we will cover: 1. Samples and Populations ( 7.1, 7.2 Rice)

Lecture: Sampling and Standard Error 6.0002 LECTURE 8 1 Annou An ouncem emen ents Relevant

STA STA 4Q 4Q19 19 & FY & FY19 19 An Analy lyst st Pre Presentat sentation ion

STA 214: Probability & Statistical Models STA 214: Analysis of Statistical Models