Probability and Statistics for Computer Science Can we call - PowerPoint PPT Presentation

Probability and Statistics ì for Computer Science Can we call the e exci-ng ? e � n � 1 + 1 e = lim n n →∞ Credit: wikipedia Hongye Liu, Teaching Assistant Prof, CS361, UIUC, 10.1.2020

Last time

Objectives ✺ Normal (Gaussian) distribu-on ✺ Exponen-al distribu-on

Cumulative distribution of continuous uniform distribution ✺ Cumula-ve distribu-on func-on (CDF) � x P ( X ≤ x ) = p ( x ) dx −∞ of a uniform random variable X is: CDF p ( x ) 1 1 b − a 1 0 a 0 a b X b X

Normal (Gaussian) distribution ✺ The most famous con-nuous random variable distribu-on. The probability density is this: 2 π exp ( − ( x − µ ) 2 1 p ( x ) = ) √ 2 σ 2 σ Carl F. Gauss (1777-1855) Credit: wikipedia

Normal (Gaussian) distribution ✺ The most famous con-nuous random variable distribu-on. The probability density is this: 2 π exp ( − ( x − µ ) 2 1 p ( x ) = ) √ 2 σ 2 σ E [ X ] = µ & var [ X ] = σ 2 Carl F. Gauss (1777-1855) Credit: wikipedia

Normal (Gaussian) distribution ✺ The most famous con-nuous random variable distribu-on. 2 π exp ( − ( x − µ ) 2 1 p ( x ) = ) √ 2 σ 2 σ � + ∞ p ( x ) dx = 1 −∞ E [ X ] = µ & var [ X ] = σ 2 Carl F. Gauss (1777-1855) Credit: wikipedia

Normal (Gaussian) distribution ✺ A lot of data in nature are approximately normally distributed, ie. Adult height , etc. 2 π exp ( − ( x − µ ) 2 1 p ( x ) = ) √ 2 σ 2 σ E [ X ] = µ & var [ X ] = σ 2 Carl F. Gauss (1777-1855) Credit: wikipedia

PDF and CDF of normal distribution curves Credit: wikipedia

Spread of normal (Gaussian) distributed data 99.7% 95% 68% Credit: wikipedia

Standard normal distribution ✺ If we standardize the normal distribu-on (by subtrac-ng μ and dividing by σ), we get a random variable that has standard normal distribu-on. ✺ A con-nuous random variable X is standard normal if 2 π exp ( − x 2 1 p ( x ) = 2 ) √

Derivation of standard normal distribution � + ∞ x = x − µ ˆ p ( x ) dx σ −∞ � + ∞ 2 π exp ( − ( x − µ ) 2 1 = ) dx √ 2 σ 2 σ −∞ � + ∞ x 2 1 2 π exp ( − ˆ = 2 ) σ d ˆ x √ σ Call this standard and omit −∞ � + ∞ x 2 2 π exp ( − ˆ 1 using a hat = 2 ) d ˆ x √ −∞ � + ∞ 2 π exp ( − x 2 1 = p (ˆ x ) dx p ( x ) = 2 ) −∞ √

Q. What is the mean of standard normal? A. 0 B. 1

Q. What is the standard deviation of standard normal? A. 0 B. 1

Standard normal distribution ✺ If we standardize the normal distribu-on (by subtrac-ng μ and dividing by σ), we get a random variable that has standard normal distribu-on. ✺ A con-nuous random variable X is standard normal if 2 π exp ( − x 2 1 p ( x ) = 2 ) √ E [ X ] = 0 & var [ X ] = 1

Another way to check the spread of normal distributed data ✺ Frac-on of normal data within 1 standard devia-on from the mean. � 1 exp ( − x 2 1 2 ) dx ≃ 0 . 68 √ 2 π − 1 ✺ Frac-on of normal data within k standard devia-ons from the mean. � k exp ( − x 2 1 2 ) dx √ 2 π − k

Using the standard normal’s table to calculate for a normal distribution’s probability ✺ If X ~ N (μ=3, σ 2 =16) (normal distribu-on) P ( X ≤ 5) =?

Q. ✺ If X ~ N (μ=3, σ 2 =16) (normal distribu-on) A . 0.5199 B. 0.5987 C. 0.6915 P ( X ≤ 5) =?

Q. Is the table with only positive x values enough? A. Yes B. No.

Central limit theorem (CLT) ✺ The distribu-on of the sum of N independent iden-cal (IID) random variables tends toward a normal distribu-on as N ∞ ✺ Even when the component random variables are not exactly IID, the result is approximately true and very useful in prac-ce

Central limit theorem (CLT) ✺ CLT helps explain the prevalence of normal distribu-ons in nature ✺ A binomial random variable tends toward a normal distribu-on when N is large due to the fact it is the sum of IID Bernoulli random variables

The Binomial distributed beads of the Galton Board The Binomial distribu-on looks very similar to Normal when N is large

Binomial approximation with Normal Binomial distribu-on Approxima-on with Normal Binomial Normal 0.20 0.20 n=20,p=0.5 n=20,p=0.7 ● μ = 20, σ 2 = 10 n=40,p=0.5 ● ● ● ● ● 0.15 0.15 n= 40, p=0.5 ● ● ● ● ● ● ● ● ● ● probability probability 0.10 0.10 ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● 0.05 0.05 ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● 0.00 ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● 0.00 ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● 0 10 20 30 40 0 10 20 30 40 k k

Binomial approximation with Normal ✺ Let k be the number of heads appeared in 40 tosses of fair coin ✺ The goal is to es-mate the following with normal 25 � 40 � � 0 . 5 k 0 . 5 40 − k P (10 ≤ k ≤ 25) = k k =10 25 � 40 � 0 . 5 40 ≃ 0 . 96 � = k k =10 � E [ k ] = np = 40 · 0 . 5 = 20 std [ k ] = np (1 − p ) √ √ = 40 · 0 . 5 · 0 . 5 = 10

Binomial approximation with Normal ✺ Use the same mean and standard devia-on of the original binomial distribu-on. √ µ = 20 σ = 10 ≃ 3 . 16 ✺ Then standardize the normal to do the calcula-on � 25 exp ( − ( x − µ ) 2 1 P (10 ≤ k ≤ 25) ≃ ) dx √ 2 σ 2 2 π σ 10 25 − 20 exp ( − x 2 1 � 3 . 16 = 2 ) dx √ 2 π 10 − 20 3 . 16 ≃ 0 . 94

Exponential distribution ✺ Common p ( x ) = λ e − λ x for x ≥ 0 Model for wai-ng -me ✺ Associated with the Poisson distribu-on with the same λ Credit: wikipedia

Exponential distribution ✺ A con-nuous random variable X is exponen-al if it represent the “-me” un-l next incident in a Poisson distribu-on with intensity λ . Proof See Degroot et al Pg 324. p ( x ) = λ e − λ x for x ≥ 0 ✺ It’s similar to Geometric distribu>on – the discrete version of wai-ng in queue

Expectations of Exponential distribution ✺ A con-nuous random variable X is exponen-al if it represent the “-me” un-l next incident in a Poisson distribu-on with intensity λ . p ( x ) = λ e − λ x for x ≥ 0 E [ X ] = 1 & var [ X ] = 1 x λ 2 λ

Example of exponential distribution ✺ How long will it take un-l the next call to be received by a call center? Suppose it’s a random variable T . If the number of incoming call is a Poisson distribu-on with intensity λ = 20 in an hour . What is the expected -me for T?

Q: ✺ A store has a number of customers coming on Sat. that can be modeled as a Poisson distribu-on. In order to measure the average rate of customers in the day, the staff recorded the -me between the arrival of customers, can he reach the same goal? A. Yes B. No

Additional References ✺ Charles M. Grinstead and J. Laurie Snell "Introduc-on to Probability” ✺ Morris H. Degroot and Mark J. Schervish "Probability and Sta-s-cs”

See you next time See You!

Probability and Statistics for Computer Science Can we call - PowerPoint PPT Presentation

Probability and Statistics for Computer Science Can we call the e exci-ng ? e n 1 + 1 e = lim n n Credit: wikipedia Hongye Liu, Teaching Assistant Prof, CS361, UIUC, 10.1.2020 Last time Objectives Normal

Probability Basics Martin Emms October 1, 2020 Probability Basics Outline Probability

Continuing Probability. Wrap up: Total Probability and Conditional Probability. Continuing

Chapter 2 Probability 1. Definition of Probability 2. Probability of disjoint events 3.

Probability Basics Probability Background Martin Emms October 1, 2020 Probability Basics

Chapter 2 Probability 1. Definition of Probability 2. Probability of disjoint events 3.

Categorical Probability and Statistics Peter McCullagh Department of Statistics University of

Counting and Probability Whats to come? Counting and Probability Whats to come?

Unit 2: Probability and distributions Lecture 1: Probability and conditional probability

Which probability Which probability Which probability Which probability theory for cosmology?

Recap of Basic Probability Elements of basic probability theory probability theory The

1 2 3 4 Stopping Probability Visiting Probability 5 Stopping

ACMS 20340 Statistics for Life Sciences Chapter 9: Introducing Probability Why Consider

Statistics 1B Statistics 1B 1 (11) 0. Lecture 1. Introduction and probability review

Statistics 370 Probability and Statistics for Engineers Instructor: Peter Bloomfield Course

Chapter II.2: Basic Probability Theory and Statistics 1. What is a probability? 1.1. Probability

Official Statistics Matt Dray, Assistant Statistician Official Statistics 2 Official

Laboratorio de Ciberseguridad Probability, Random Processes and Inference Dr. Ponciano Jorge

A functional central limit theorem for branching random walks, with applications to Quicksort

6 7 9 2 8 9 7 4 6 8 1 2 1 8 7 2 1 2 8 2 5 7 1 9 2 4 7 8 8 9 3 4 1 4 2 9 6 6 2 4 7

Basic Statistics and Probability Theory Based on Foundations of Statistical NLP C. Manning

Combinatorics of Biomolecules C.M. Reidys Nankai University Center for Combinatorics, LMPC 1

Method of cumulants and mod-Gaussian convergence of the graphon models Pierre-Loc Mliot

Central Toronto Integrated Regional Resource Plan Meeting Torontos Electricity Needs for the

Borrego Valley Groundwater Basin Borrego Springs Subbasin Chapters 1-5 Draft Groundwater