statistics 300 elementary statistics
play

Statistics 300: Elementary Statistics Sections 7-2, 7-3, 7-4, 7-5 - PDF document

Statistics 300: Elementary Statistics Sections 7-2, 7-3, 7-4, 7-5 Parameter Estimation Point Estimate Best single value to use Question What is the probability this estimate is the correct value? Parameter Estimation


  1. Statistics 300: Elementary Statistics Sections 7-2, 7-3, 7-4, 7-5 Parameter Estimation • Point Estimate – Best single value to use • Question – What is the probability this estimate is the correct value? Parameter Estimation • Question – What is the probability this estimate is the correct value? • Answer – zero : assuming “ x” is a continuous random variable – Example for Uniform Distribution 1

  2. If X ~ U[100,500] then • P(x = 300) = (300-300)/(500-100) • = 0 100 300 400 500 Parameter Estimation µ • Pop. mean – Sample mean x p • Pop. proportion ˆ – Sample proportion p σ • Pop. standard deviation – Sample standard deviation s Problem with Point Estimates • The unknown parameter ( µ , p, etc.) is not exactly equal to our sample-based point estimate. • So, how far away might it be? • An interval estimate answers this question. 2

  3. Confidence Interval • A range of values that contains the true value of the population parameter with a ... • Specified “level of confidence”. • [L(ower limit),U(pper limit)] Terminology • Confidence Level (a.k.a. Degree of Confidence) – expressed as a percent (%) • Critical Values (a.k.a. Confidence Coefficients) Terminology • “alpha” “ α ” = 1-Confidence – more about α in Chapter 7 • Critical values – express the confidence level 3

  4. Confidence Interval for µ lf σ is known (this is a rare situation) = ± x E  σ  = ⋅   E z α 2   n Confidence Interval for µ lf σ is known (this is a rare situation) if x ~N(?, σ ) σ   = ± ⋅   x z α   n 2 Why does the Confidence Interval for µ look like this ? σ   = ± ⋅   x z α   n 2 4

  5. σ µ x ~ N ( , ) n make an x value into a z - score. The general z - score expression is − µ ) = x ( z σ for x , µ µ is : unchanged x and σ σ is x n 5

  6. so a z - score based on x is − µ x = z σ n Using the Empirical Rule Make a probabilit y statement :     − µ ( x ) − < < =   P 2 2 95 % σ     n Normal Distribution 0 4 . 5 Relative likelihood 0 . 4 0 α 0 . 3 5 α         0 . 3 0     2 2 0 2 . 5 . 0 2 0 0 1 . 5 0 . 1 0 0 . 5 0 . 0 0 0 -3 -2 -1 0 1 2 3 Value of Observation 6

  7. Check out the “Confidence z-scores” on the WEB page. (In pdf format.) Use basic rules of algebra to rearrange the parts of this z-score. Manipulate the probabilit y statement :       ( ) σ σ − < − µ < =       P 2 x 2 0 . 95       n n 7

  8. Manipulate the probabilit y statement :       σ σ − − < − µ < − + =       P x 2 x 2 0 . 95       n n Confidence = 95% α = 1 - 95% = 5% α/2 = 2.5% = 0.025 Manipulate the probabilit y statement : multiply t hrough by (-1) and change the order of the terms       σ σ  − < µ < +  =     P x 2 x 2 0 . 95       n n Confidence = 95% α = 1 - 95% = 5% α/2 = 2.5% = 0.025 Confidence Interval for µ lf σ is not known (usual situation)   s = ± ⋅   x t α   n 2 8

  9. Sample Size Needed to Estimate µ within E, with Confidence = 1- α 2   ⋅ σ ˆ Z α   = 2 n   E   Components of Sample Size Formula when Estimating µ • Z α /2 reflects confidence level – standard normal distribution σ σ • ˆ is an estimate of , the standard deviation of the pop. • E is the acceptable “margin of error” when estimating µ Confidence Interval for p • The Binomial Distribution gives us a starting point for determining the distribution of the sample proportion : p ˆ x successes = = ˆ p n trials 9

  10. For Binomial “x” µ = np σ = npq For the Sample Proportion 1 ( ) x = = ˆ p x n n x is a random variable n is a constant Time Out for a Principle: µ If is the mean of X and “a” is a constant, what is the mean of aX? ⋅ µ . a Answer: 10

  11. Apply that Principle! • Let “a” be equal to “1/n”   1 X • so = = =   p ˆ aX X   n n µ = µ = • and a a ( np ) ˆ p x   1 = ⋅ =   np p   n Time Out for another Principle: σ 2 If is the variance of X and “a” x is a constant, what is the variance of aX? σ = a σ . 2 2 2 Answer: aX x Apply that Principle! • Let x be the binomial “x” • Its variance is npq = np(1-p), which is the square of is standard deviation 11

  12. Apply that Principle! • Let “a” be equal to “1/n”   1 X = = =   • so p ˆ aX X   n n ( ) σ = σ = 2 2 2 2 • and a 1 / n ( npq ) ˆ p X Apply that Principle! 2   1 pq ⋅ = = σ   2 npq p ˆ   n n and pq σ = ˆ p n When n is Large,   pq   µ = σ = ˆ p ~ N p ,    n  12

  13. What is a Large “n” in this situation? • Large enough so np > 5 • Large enough so n(1-p) > 5 • Examples: – (100)(0.04) = 4 (too small) – (1000)(0.01) = 10 (big enough) Now make a z-score − = ˆ p p z pq n And rearrange for a CI(p) Using the Empirical Rule Make a probability statement:     − ˆ p p   − < < = P 1.96 1.96 95%   pq     n 13

  14. Normal Distribution 0 4 . 5 Relative likelihood . 0 4 0 0 . 3 5  α   α      0 . 3 0     2 2 0 2 . 5 0 . 2 0 0 1 . 5 0 . 1 0 0 . 0 5 . 0 0 0 -3 -2 -1 0 1 2 3 Value of Observation Use basic rules of algebra to rearrange the parts of this z-score. Manipulate the probability statement: pq Step 1: Multiply through by : n   pq pq ( ) − < − < =   P 1.96 p ˆ p 1.96 0.95    n n  14

  15. Manipulate the probability statement: Step 2: Subract from all parts of the expression: ˆ p   pq pq − − < − < − + =   P p ˆ 1.96 p p ˆ 1.96 0.95    n n  Manipulate the probability statement: Step 3: Multiply through by -1: (remember to switch the directions of < >)   pq pq + > > − =   ˆ ˆ P p 1.96 p p 1.96 0.95    n n  Manipulate the probability statement: Step 4: Swap the left and right sides to put in conventional < < form: p   pq pq − < < + =   ˆ ˆ P p 1.96 p p 1.96 0.95   n n   15

  16. Confidence Interval for p (but the unknown p is in the formula. What can we do?) pq = ± ⋅ ˆ p z α n 2 Confidence Interval for p (substitute sample statistic for p) p ˆ q ˆ = ± 2 ⋅ ˆ p z α n Sample Size Needed to Estimate “p” within E, with Confid.=1- α   2 Z   α 2 ⋅ = ˆ ˆ n p q     2 E   16

  17. Components of Sample Size Formula when Estimating “p” • Z α /2 is based on α using the standard normal distribution • p and q are estimates of the population proportions of “successes” and “failures” • E is the acceptable “margin of error” when estimating µ Components of Sample Size Formula when Estimating “p” • p and q are estimates of the population proportions of “successes” and “failures” • Use relevant information to estimate p and q if available • Otherwise, use p = q = 0.5, so the product pq = 0.25 Confidence Interval for σ starts with this fact µ σ if ~ ( , ) x N then ( ) n − 2 1 s χ 2 ~ (chi square) σ 2 17

  18. What have we studied already that connects with Chi-square random values? ( ) n − 2 1 s χ 2 ~ (chi square) σ 2 ∑ ( ) − µ 2 x ( ) − n 1 ( ) ( ) − − 2 n 1 s n 1 = σ σ 2 2 ∑ ( ) − µ 2 x = σ 2 ( ) ( )   2 − µ 2  − µ  x x ∑ ∑   =   σ σ 2       ∑ = 2 a sum of squared z standard normal values 18

  19. Confidence Interval for σ ( ) − 2 n 1 s = LB χ 2 R ( ) − 2 n 1 s = UB χ 2 L 19

Recommend


More recommend