Outline Experiments with Two Factors (14.1) Experiments with Three - - PDF document

outline
SMART_READER_LITE
LIVE PREVIEW

Outline Experiments with Two Factors (14.1) Experiments with Three - - PDF document

2/15/2007 219323 Probability and Statistics for Software Statistics for Software and Knowledge Engineers Lecture 12: Multifactor Experimental Multifactor Experimental Design and Analysis Monchai Sopitkamon, Ph.D. Outline Experiments


slide-1
SLIDE 1

2/15/2007 1

219323 Probability and Statistics for Software Statistics for Software and Knowledge Engineers

Lecture 12: Multifactor Experimental Multifactor Experimental Design and Analysis

Monchai Sopitkamon, Ph.D.

Outline

Experiments with Two Factors (14.1) Experiments with Three or More Factors

(14.2)

slide-2
SLIDE 2

2/15/2007 2

Experiments with Two Factors (14.1)

Two-Factor Experimental Designs (14.1.1) Models for Two-Factor Experiments (14 1 2) Models for Two-Factor Experiments (14.1.2) Analysis of Variance Table (14.1.3) Pairwise Comparisons of the Factor Level

Means (14.1.4)

Two-Factor Experimental Designs I (14.1.1)

ANOVA in Chapter 11 concerns relationship

between a response variable of interest and between a response variable of interest and various levels of a single factor of interest

Objectives:

– To simultaneously investigate the relationship between a response variable and various levels

  • f two or more factors of interest

– To extend the ANOVA technique to analyze the To extend the ANOVA technique to analyze the structure of the relationship between the response variable and the two factors of interest – To assess any interaction between the two factors where the response variable in influenced

slide-3
SLIDE 3

2/15/2007 3

Two-Factor Experimental Designs II (14.1.1)

Investigate how a response variable

depends on two factors of interest depends on two factors of interest

Ex: washing quality (response var) of a

detergent may depend on two factors: water temperature and detergent type.

Ex: a student’s grade (response var) may

depend on two factors: in-class t ti d dili concentration and diligence.

Two-Factor Experimental Designs III (14.1.1)

A tw o factor experim ent w ith a × b experim ental configurations

slide-4
SLIDE 4

2/15/2007 4

Two-Factor Experimental Designs IV: Two-Factor Experiments

Consider an experiment w/ two factors, A

and B. If factor A has a levels and factor B and B. If factor A has a levels and factor B has b levels, then there are ab experimental configurations or cells.

In a complete balanced experiment, n

replicate measurements are taken at each configuration, resulting in a total sample size

  • f n = abn data observations
  • f nT = abn data observations.

The data observation xijk represents the

value of the kth measurement obtained at the ith level of factor A and the jth level of factor B.

Two-Factor Experimental Designs V: Two-Factor Experiments

Data observations and sam ple averages for a tw o factor experim ent

slide-5
SLIDE 5

2/15/2007 5

al Designs tor s Two-Factor Experimenta V: Two-Fac Experiments T E V E

Data set for the car body assem bly line exam ple

Models for Two-Factor Experiments I (14.1.2)

An observation xijk for the kth observation of

level i of factor A and level j of factor B can j be written as: xijk = µij + εijk where µij are unknown parameters of interest in a two-factor experiment or cell means, and εijk is the error term with N(0,σ2)

Consider three aspects of the relationship

p p between the expected value of the response variable and the two factors of interest.

– the interaction effect between factors A and B – the main effect of factor A – the main effect of factor B

slide-6
SLIDE 6

2/15/2007 6

Models for Two-Factor Experiments II (14.1.2)

Cell m eans for a tw o factor experim ent

Models for Two-Factor Experiments III (14.1.2)

I nterpretation of m odels for tw o factor experim ents

slide-7
SLIDE 7

2/15/2007 7

Models for Two-Factor Experiments IV (14.1.2)

I nterpretation of m odels for tw o factor experim ents

Models for Two-Factor Experiments V (14.1.2)

I nterpretation of m odels for tw o factor experim ents

slide-8
SLIDE 8

2/15/2007 8

Models for Two-Factor Experiments VI (14.1.2)

I nterpretation of m odels for tw o factor experim ents

Models for Two-Factor Experiments VII (14.1.2)

I nterpretation of m odels for tw o factor experim ents

slide-9
SLIDE 9

2/15/2007 9

nce Table I sis of Varian 3) Analys (14.1.3

The sum of squares decom position for a tw o factor experim ent

Analysis of Variance Table II (14.1.3) : SST

Total Sum of Squares (SST) – a measure of

the total variability in the data set the total variability in the data set

( )

∑∑∑ ∑∑∑

= = = = = =

− = − =

a i b j n k ijk a i b j n k ijk

x abn x x x SST

1 1 1 2 ... 2 2 1 1 1 ...

where

1 1 1 a i b j n k ijk

x

∑∑∑

:

1 1 1 ... T i j k

n x

= = =

=

  • verall or grand mean.

:

ijk

x

k-th observation in level i.of factor A and level j of factor B nT: total number of observations: abn

slide-10
SLIDE 10

2/15/2007 10

The sum of squares decom position for a tw o factor experim ent

Analysis of Variance Table II (14.1.3) : SSA

Sum of Squares for Factor A (SSA) – a

measure of the variability due to factor A. measure of the variability due to factor A.

( )

2 ... 1 2 1 2 ...

x abn x bn x x bn SSA

a i i a i i

− = − =

∑ ∑

= ⋅ ⋅ = ⋅ ⋅

where l h l l

1 ∑∑

b n

sample averages at each level

  • f factor A.

: b

number of levels of factor B

: n

number of observations at level i of factor A and level j of factor B

:

1 1

∑∑

= = ⋅ ⋅ = j k ijk i

x bn x

slide-11
SLIDE 11

2/15/2007 11

Partitioning the Total Sum of Squares VIII (11.1.2)

The sum of squares decom position for a tw o factor experim ent

Analysis of Variance Table III (14.1.3) : SSB

Sum of Squares for Factor B (SSB) – a

measure of the variability due to factor B. measure of the variability due to factor B. where

( )

2 ... 1 2 1 2 ...

x abn x an x x an SSB

b j j b j j

− = − =

∑ ∑

= ⋅ ⋅ = ⋅ ⋅

1 ∑∑

a n

l

: a

number of levels of factor A

: 1

1 1

∑∑

= = ⋅ ⋅ = i k ijk j

x an x

sample averages at each level of factor B.

: n

number of observations at level i of factor A and level j of factor B

slide-12
SLIDE 12

2/15/2007 12

The sum of squares decom position for a tw o factor experim ent

Analysis of Variance Table IV (14.1.3) : SSAB

Sum of Squares for Interaction (SSAB) – a

measure of the variability due to interaction measure of the variability due to interaction effect between factors A and B.

( )

∑∑

= = ⋅ ⋅ ⋅ ⋅ ⋅

− − − =

a i b j j i ij

x x x x n SSAB

1 1 2 ...

where

: 1

1

= ⋅ = n k ijk ij

x n x

Cell averages at level i of factor A and level j of factor B.

: n

number of observations at level i of factor A and level j of factor B

slide-13
SLIDE 13

2/15/2007 13

The sum of squares decom position for a tw o factor experim ent

Analysis of Variance Table V (14.1.3) : SSE

Sum of Squares for Error (SSE) – a

measure of the variability within each of the measure of the variability within each of the ab experimental configuration.

( )

∑∑ ∑∑∑ ∑∑∑

= = ⋅ = = = = = =

− = − =

a i b j ij a i b j n k ijk a i b j n k ij ijk

x n x x x SSE

1 1 2 1 1 1 2 1 1 1 2 .

where

: 1

1

= ⋅ = n k ijk ij

x n x

Cell averages at level i of factor A and level j of factor B.

: n

number of observations at level i of factor A and level j of factor B

slide-14
SLIDE 14

2/15/2007 14

The Analysis of Variance Table VI (14.1.3)

Analysis of variance table for a tw o factor experim ent Reject H0 if F-statistic > F-critical (from

Table IV or Excel’s FINV function)

  • r Reject H0 if p-value < α (as in previous

chapter)

The Analysis of Variance Table VII (14.1.3) : An Example

Ex.9 pg.654: Car Body Assembly Line

The analysis of variance table for the car body assem bly line exam ple

No machine main effect (accept H0) There is solder main effect (reject H0) No interaction effect between machine and

solder (accept H0)

the car body assem bly line exam ple

Excel sheet

slide-15
SLIDE 15

2/15/2007 15

The Analysis of Variance Table VIII (14.1.3) : Another Example

Ex.74 pg.655: Company Transportation Costs

The analysis of variance table for the com pany transportation costs exam ple

There is route main effect (reject H0) There is period main effect (reject H0) No interaction effect between the route taken

and the period of the day (accept H0)

the com pany transportation costs exam ple

Excel sheet

The Analysis of Variance Table IX (14.1.3) : Another Example

A plot of the cell sam ple averages for the com pany transportation transportation costs exam ple

slide-16
SLIDE 16

2/15/2007 16

Pairwise Comparisons of the Factor Level Means I (14.1.4)

If H0 is accepted for interaction effect but H0

is rejected for one or more factor, we’d like is rejected for one or more factor, we d like to be able to tell which samples are different and by how much.

Factor A: perform multiple comparisons to

compare all levels simultaneously

– By computing the differences μI – μj for 1 ≤ i < j ≤ a among all a(a 1)/2 pairs of factor level ≤ a among all a(a – 1)/2 pairs of factor level means

Compute confidence intervals

bn MSE q x x

n ab a j i j i ) 1 ( , , − ⋅ ⋅ ⋅ ⋅

± − ∈ −

α

μ μ

Pairwise Comparisons of the Factor Level Means II (14.1.4)

If the CI for the difference μI – μj contains 0,

then factor levels i and j of factor A are not then factor levels i and j of factor A are not significantly different.

If the CI for the difference μI – μj does not

contains 0, then factor levels i and j for factor A are significantly different.

The CI indicates by how much the factor

l l h t b diff t level means are shown to be different.

If H0 is accepted for factor A, then all CIs will

contain 0, which means no significant difference between different levels of factor A

slide-17
SLIDE 17

2/15/2007 17

Pairwise Comparisons of the Factor Level Means III (14.1.4)

If there is interaction effect, then these CIs

are not interpretable since the different are not interpretable since the different effects of factor A depend on the level of factor B.

If factor A has two levels (a = 2), constructed

CI for μ1 – μ2 is equivalent to the CI constructed from a two-sample t-procedure w/ pooled variance estimate as in Chapter w/ pooled variance estimate as in Chapter 9.3.2.

Pairwise Comparisons of the Factor Level Means IV (14.1.4)

Factor B: perform multiple comparisons to

compare all levels simultaneously compare all levels simultaneously

– By computing the differences μI – μj for 1 ≤ i < j ≤ b among all b(b – 1)/2 pairs of factor level means

Compute confidence intervals

MSE q x x ± ∈ μ μ an q x x

n ab b j i j i ) 1 ( , , − ⋅ ⋅ ⋅ ⋅

± − ∈ −

α

μ μ

slide-18
SLIDE 18

2/15/2007 18

Pairwise Comparisons of the Factor Level Means V (14.1.4)

  • Ex. 9 pg.657: Car Body Assembly Line

– No interaction effect between machines and No interaction effect between machines and solder formulations, but solder main effect existed. – Perform pairwise comparisons of b = 4 levels of solder formulations. q0.05,4, 24 = 3.90 (From Table V)

92 MSE 248 . 1 3 3 92 . 90 . 3

) 1 ( , ,

= × × =

an MSE q

n ab b α

Excel sheet

Pairwise Comparisons of the Factor Level Means VI (14.1.4)

CIs of Diff lower upper μ1 − μ2

  • 0.131

2.365 1 3 1 182 1 313

  • 3 solder types (1, 2, 3) are not

significantly different than one

μ1 − μ3

  • 1.182

1.313 μ1 − μ4

  • 3.623 -1.128

μ2 − μ3

  • 2.299

0.197 μ2 − μ4

  • 4.740 -2.244

μ3 − μ4

  • 3.689 -1.193

significantly different than one another

  • Solder 4 has significant effect on

welding strength than any other 3 solder types

Com parisons of the solders for the car body assem bly line exam ple

Excel sheet

slide-19
SLIDE 19

2/15/2007 19

Pairwise Comparisons of the Factor Level Means VII (14.1.4)

  • Ex. 74 pg.658: Company Transportation

Costs Costs

– No interaction effect between route and period, but route and period main effects existed. – Perform pairwise comparisons of both factors A (route, a = 2) and B (period, b = 3 levels). Consider Factor A: Route (a = 2) Consider Factor A: Route (a 2) q0.05,2, 24 = 2.92 (From Table V)

6 . 14 5 3 7 . 373 92 . 2

) 1 ( , ,

= × × =

bn MSE q

n ab a α

Excel sheet

Pairwise Comparisons of the Factor Level Means VIII (14.1.4)

CIs of Diff lower upper

  • Route 1 takes on average at least 8

mins longer than route 2, and takes as much as 37 mins longer on average. μ1 − μ2 8.0 37.2 Since no interaction effect between route and period of day, this result holds regardless of time of day the truck leaves the factory.

slide-20
SLIDE 20

2/15/2007 20

Pairwise Comparisons of the Factor Level Means IX (14.1.4)

  • Ex. 74 pg.658: Company Transportation

Costs Costs

Consider Factor B: Period of time (b = 3) Let level 1 be morning, level 2 be afternoon, level 3 be evening q = 3 53 (From Table V) q0.05,3, 24 = 3.53 (From Table V)

6 . 21 5 2 7 . 373 53 . 3

) 1 ( , ,

= × × =

an MSE q

n ab b α

Excel sheet

Pairwise Comparisons of the Factor Level Means X (14.1.4)

CIs of Diff lower upper μ1 μ2 5 4 37 8

  • No significant difference in

driving time between morning

μ1 − μ2

  • 5.4

37.8 μ1 − μ3 26.4 69.6 μ2 − μ3 10.2 53.4

  • Trucks leaving in the evening takes shorter expected driving times

than in the morning and afternoon.

  • The decrease is between 10 and 53 mins compared to afternoon,

and between 26 and 69 mins compared to the morning period driving time between morning and afternoon periods. and between 26 and 69 mins compared to the morning period. Excel sheet

slide-21
SLIDE 21

2/15/2007 21

Experiments with Three or More Factors (14.2)

Three-Factor Experiments (14.2.1) 2k Experiments (14 2 2) 2 Experiments (14.2.2)

Three-Factor Experiments I (14.2.1)

Investigate how a response variable

depends on three or more factors of interest. depends on three or more factors of interest.

This is similar analysis to that for the two-

factor one, except that there are more interaction terms to be considered.

slide-22
SLIDE 22

2/15/2007 22

Three-Factor Experiments II (14.2.1)

The sum of squares decom position for three factor experim ent

Three-Factor Experiments III (14.2.1)

Analysis of variance table for a three factor experim ent

slide-23
SLIDE 23

2/15/2007 23

2k Experiments I (14.2.2)

If there are k factors that effect a response

variable, each factor having the number of g levels as l1,…, lk, the total number of experiments required is

l1 x l2 x ⋅ ⋅ ⋅ x lk

If there are n replicates for each experiment,

the total sample size required is

nT = l1 x l2 x ⋅ ⋅ ⋅ x lk x n

f

This sample size can become large for large

number of k factors.

Screening experiments are therefore

necessary where each factor is considered at only 2 levels, or li = 2, 1 ≤ i ≤ k

2k Experiments II (14.2.2)

So the total number of experiments with k

factors, each factor having 2 levels, are now d d t reduced to 2 x 2 x 2 x ⋅ ⋅ ⋅ x 2

This is known as 2k experimental design, and

the total sample size is

2k

k times

nT = 2k x n Where n is the number of replicates at each experimental configuration.

Use the typical model where the data

  • bservations are composed of a cell mean

together with an error term.

slide-24
SLIDE 24

2/15/2007 24

2k Experiments III (14.2.2)

The cell means are

– the sum of the k individual factor main effects + – all the possible two-way interaction effects + – higher order interaction effects

There are r-way interaction effects (the

number of ways that a subset of r factors can b t k f th k f t ⎟ ⎟ ⎠ ⎞ ⎜ ⎜ ⎝ ⎛ r k be taken from the k factors.

Construct the ANOVA table as usual with all

rows having 1 degree of freedom, except for the error’s degrees of freedom, which are 2k(n – 1).

The total degrees of freedom are 2kn – 1

Data set for chem ical yields exam ple

slide-25
SLIDE 25

2/15/2007 25

2k Experiments IV (14.2.2)

  • The analysis of variance table for

the chem ical yields exam ple

2k Experiments V (14.2.2)

<

The sam ple averages for the chem ical yields exam ple

< > < <