Statistics 300: Elementary Statistics Sections 7-2, 7-3, 7-4, 7-5 - PDF document

Statistics 300: Elementary Statistics Sections 7-2, 7-3, 7-4, 7-5 Parameter Estimation • Point Estimate – Best single value to use • Question – What is the probability this estimate is the correct value? Parameter Estimation • Question – What is the probability this estimate is the correct value? • Answer – zero : assuming “ x” is a continuous random variable – Example for Uniform Distribution 1

If X ~ U[100,500] then • P(x = 300) = (300-300)/(500-100) • = 0 100 300 400 500 Parameter Estimation µ • Pop. mean – Sample mean x p • Pop. proportion ˆ – Sample proportion p σ • Pop. standard deviation – Sample standard deviation s Problem with Point Estimates • The unknown parameter ( µ , p, etc.) is not exactly equal to our sample-based point estimate. • So, how far away might it be? • An interval estimate answers this question. 2

Confidence Interval • A range of values that contains the true value of the population parameter with a ... • Specified “level of confidence”. • [L(ower limit),U(pper limit)] Terminology • Confidence Level (a.k.a. Degree of Confidence) – expressed as a percent (%) • Critical Values (a.k.a. Confidence Coefficients) Terminology • “alpha” “ α ” = 1-Confidence – more about α in Chapter 7 • Critical values – express the confidence level 3

Confidence Interval for µ lf σ is known (this is a rare situation) = ± x E  σ  = ⋅   E z α 2   n Confidence Interval for µ lf σ is known (this is a rare situation) if x ~N(?, σ ) σ   = ± ⋅   x z α   n 2 Why does the Confidence Interval for µ look like this ? σ   = ± ⋅   x z α   n 2 4

σ µ x ~ N ( , ) n make an x value into a z - score. The general z - score expression is − µ ) = x ( z σ for x , µ µ is : unchanged x and σ σ is x n 5

so a z - score based on x is − µ x = z σ n Using the Empirical Rule Make a probabilit y statement :     − µ ( x ) − < < =   P 2 2 95 % σ     n Normal Distribution 0 4 . 5 Relative likelihood 0 . 4 0 α 0 . 3 5 α         0 . 3 0     2 2 0 2 . 5 . 0 2 0 0 1 . 5 0 . 1 0 0 . 5 0 . 0 0 0 -3 -2 -1 0 1 2 3 Value of Observation 6

Check out the “Confidence z-scores” on the WEB page. (In pdf format.) Use basic rules of algebra to rearrange the parts of this z-score. Manipulate the probabilit y statement :       ( ) σ σ − < − µ < =       P 2 x 2 0 . 95       n n 7

Manipulate the probabilit y statement :       σ σ − − < − µ < − + =       P x 2 x 2 0 . 95       n n Confidence = 95% α = 1 - 95% = 5% α/2 = 2.5% = 0.025 Manipulate the probabilit y statement : multiply t hrough by (-1) and change the order of the terms       σ σ  − < µ < +  =     P x 2 x 2 0 . 95       n n Confidence = 95% α = 1 - 95% = 5% α/2 = 2.5% = 0.025 Confidence Interval for µ lf σ is not known (usual situation)   s = ± ⋅   x t α   n 2 8

Sample Size Needed to Estimate µ within E, with Confidence = 1- α 2   ⋅ σ ˆ Z α   = 2 n   E   Components of Sample Size Formula when Estimating µ • Z α /2 reflects confidence level – standard normal distribution σ σ • ˆ is an estimate of , the standard deviation of the pop. • E is the acceptable “margin of error” when estimating µ Confidence Interval for p • The Binomial Distribution gives us a starting point for determining the distribution of the sample proportion : p ˆ x successes = = ˆ p n trials 9

For Binomial “x” µ = np σ = npq For the Sample Proportion 1 ( ) x = = ˆ p x n n x is a random variable n is a constant Time Out for a Principle: µ If is the mean of X and “a” is a constant, what is the mean of aX? ⋅ µ . a Answer: 10

Apply that Principle! • Let “a” be equal to “1/n”   1 X • so = = =   p ˆ aX X   n n µ = µ = • and a a ( np ) ˆ p x   1 = ⋅ =   np p   n Time Out for another Principle: σ 2 If is the variance of X and “a” x is a constant, what is the variance of aX? σ = a σ . 2 2 2 Answer: aX x Apply that Principle! • Let x be the binomial “x” • Its variance is npq = np(1-p), which is the square of is standard deviation 11

Apply that Principle! • Let “a” be equal to “1/n”   1 X = = =   • so p ˆ aX X   n n ( ) σ = σ = 2 2 2 2 • and a 1 / n ( npq ) ˆ p X Apply that Principle! 2   1 pq ⋅ = = σ   2 npq p ˆ   n n and pq σ = ˆ p n When n is Large,   pq   µ = σ = ˆ p ~ N p ,    n  12

What is a Large “n” in this situation? • Large enough so np > 5 • Large enough so n(1-p) > 5 • Examples: – (100)(0.04) = 4 (too small) – (1000)(0.01) = 10 (big enough) Now make a z-score − = ˆ p p z pq n And rearrange for a CI(p) Using the Empirical Rule Make a probability statement:     − ˆ p p   − < < = P 1.96 1.96 95%   pq     n 13

Normal Distribution 0 4 . 5 Relative likelihood . 0 4 0 0 . 3 5  α   α      0 . 3 0     2 2 0 2 . 5 0 . 2 0 0 1 . 5 0 . 1 0 0 . 0 5 . 0 0 0 -3 -2 -1 0 1 2 3 Value of Observation Use basic rules of algebra to rearrange the parts of this z-score. Manipulate the probability statement: pq Step 1: Multiply through by : n   pq pq ( ) − < − < =   P 1.96 p ˆ p 1.96 0.95    n n  14

Manipulate the probability statement: Step 2: Subract from all parts of the expression: ˆ p   pq pq − − < − < − + =   P p ˆ 1.96 p p ˆ 1.96 0.95    n n  Manipulate the probability statement: Step 3: Multiply through by -1: (remember to switch the directions of < >)   pq pq + > > − =   ˆ ˆ P p 1.96 p p 1.96 0.95    n n  Manipulate the probability statement: Step 4: Swap the left and right sides to put in conventional < < form: p   pq pq − < < + =   ˆ ˆ P p 1.96 p p 1.96 0.95   n n   15

Confidence Interval for p (but the unknown p is in the formula. What can we do?) pq = ± ⋅ ˆ p z α n 2 Confidence Interval for p (substitute sample statistic for p) p ˆ q ˆ = ± 2 ⋅ ˆ p z α n Sample Size Needed to Estimate “p” within E, with Confid.=1- α   2 Z   α 2 ⋅ = ˆ ˆ n p q     2 E   16

Components of Sample Size Formula when Estimating “p” • Z α /2 is based on α using the standard normal distribution • p and q are estimates of the population proportions of “successes” and “failures” • E is the acceptable “margin of error” when estimating µ Components of Sample Size Formula when Estimating “p” • p and q are estimates of the population proportions of “successes” and “failures” • Use relevant information to estimate p and q if available • Otherwise, use p = q = 0.5, so the product pq = 0.25 Confidence Interval for σ starts with this fact µ σ if ~ ( , ) x N then ( ) n − 2 1 s χ 2 ~ (chi square) σ 2 17

What have we studied already that connects with Chi-square random values? ( ) n − 2 1 s χ 2 ~ (chi square) σ 2 ∑ ( ) − µ 2 x ( ) − n 1 ( ) ( ) − − 2 n 1 s n 1 = σ σ 2 2 ∑ ( ) − µ 2 x = σ 2 ( ) ( )   2 − µ 2  − µ  x x ∑ ∑   =   σ σ 2       ∑ = 2 a sum of squared z standard normal values 18

Confidence Interval for σ ( ) − 2 n 1 s = LB χ 2 R ( ) − 2 n 1 s = UB χ 2 L 19

Statistics 300: Elementary Statistics Sections 7-2, 7-3, 7-4, 7-5 - PDF document

Statistics 300: Elementary Statistics Sections 7-2, 7-3, 7-4, 7-5 Parameter Estimation Point Estimate Best single value to use Question What is the probability this estimate is the correct value? Parameter Estimation

Statistics 300: Elementary Statistics Section 8-2 1 Hypothesis Testing Principles

Statistics 300: Elementary Statistics Section 6-5 Central Limit Theorem Given: X has mean =

The Nature of Probability The Nature of Probability y

Statistics for Social Sciences I: Introduction to Statistics Introduction to Statistics

Statistics 3-1 Definitions Statistics is a branch of Mathematics

Who we are? OECD STATISTICS ESTONIA AUSTRALIAN BUREAU OF STATISTICS STATISTICS NEW ZEALAND

Statistics Liberia Office of National Statistics The compilation of External Trade

The Statistics Network The Statistics Network Statistics network Compute servers Desktop PCs

Statistics, Visualization and More Using R (298.916) Block IV: (Elementary) hypothesis

1 Practical Information 2 Introduction to Statistics Per Bruun Brockhoff 3 Descriptive Statistics:

Quality Assurance in Official Statistics Directorate of Economics & Statistics, Planning

Rolling the Dice: Flipping an elementary probability and statistics classroom Jerry Orloff and

Statistics Sierra Leone Statistics Sierra Leone PRESENTATION : Compilation process of Sierra

Statistics Ghana The Ghana Statistical Service (GSS) 1 PRACTICES IN INTERNATIONAL

Bayesian statistics DS GA 1002 Probability and Statistics for Data Science

Inferential Statistics Inferential statistics are used to test

STATISTICS OF OCCUPATIONAL INJURIES Resolution concerning statistics of occupational injuries

Blogging about R Audio recording of this talk is on: R-statistics.com R-statistics.com

Descriptive Statistics DS GA 1002 Probability and Statistics for Data Science

Paper 4 Statistics & Research Statistics & Research Methodology - BK SAVITRI, Pandav

Probability & Statistics Thomas Schwarz, SJ Overview Statistics is the lifeblood of data

Statistics Physics Chemistry Economics Biology

Descripti v e statistics P R AC TIC IN G STATISTIC S IN TE R VIE W QU E STION S IN P YTH ON

What can go wrong with statistics: Some typical errors & How to lie with statistics Content

Statistics 300: Elementary Statistics Sections 7-2, 7-3, 7-4, 7-5 - PDF document

Statistics 300: Elementary Statistics Sections 7-2, 7-3, 7-4, 7-5 Parameter Estimation Point Estimate Best single value to use Question What is the probability this estimate is the correct value? Parameter Estimation

Statistics 300: Elementary Statistics Section 8-2 1 Hypothesis Testing Principles

Statistics 300: Elementary Statistics Section 6-5 Central Limit Theorem Given: X has mean =

The Nature of Probability The Nature of Probability y

Statistics for Social Sciences I: Introduction to Statistics Introduction to Statistics

Statistics 3-1 Definitions Statistics is a branch of Mathematics

Who we are? OECD STATISTICS ESTONIA AUSTRALIAN BUREAU OF STATISTICS STATISTICS NEW ZEALAND

Statistics Liberia Office of National Statistics The compilation of External Trade

The Statistics Network The Statistics Network Statistics network Compute servers Desktop PCs

Statistics, Visualization and More Using R (298.916) Block IV: (Elementary) hypothesis

1 Practical Information 2 Introduction to Statistics Per Bruun Brockhoff 3 Descriptive Statistics:

Quality Assurance in Official Statistics Directorate of Economics &amp; Statistics, Planning

Rolling the Dice: Flipping an elementary probability and statistics classroom Jerry Orloff and

Statistics Sierra Leone Statistics Sierra Leone PRESENTATION : Compilation process of Sierra

Statistics Ghana The Ghana Statistical Service (GSS) 1 PRACTICES IN INTERNATIONAL

Bayesian statistics DS GA 1002 Probability and Statistics for Data Science

Inferential Statistics Inferential statistics are used to test

STATISTICS OF OCCUPATIONAL INJURIES Resolution concerning statistics of occupational injuries

Blogging about R Audio recording of this talk is on: R-statistics.com R-statistics.com

Descriptive Statistics DS GA 1002 Probability and Statistics for Data Science

Paper 4 Statistics &amp; Research Statistics &amp; Research Methodology - BK SAVITRI, Pandav

Probability &amp; Statistics Thomas Schwarz, SJ Overview Statistics is the lifeblood of data

Statistics Physics Chemistry Economics Biology

Descripti v e statistics P R AC TIC IN G STATISTIC S IN TE R VIE W QU E STION S IN P YTH ON

What can go wrong with statistics: Some typical errors &amp; How to lie with statistics Content

Quality Assurance in Official Statistics Directorate of Economics & Statistics, Planning

Paper 4 Statistics & Research Statistics & Research Methodology - BK SAVITRI, Pandav

Probability & Statistics Thomas Schwarz, SJ Overview Statistics is the lifeblood of data

What can go wrong with statistics: Some typical errors & How to lie with statistics Content