Univariate 1-Way ANOVA as a Linear Model with Fixed Regressors - PowerPoint PPT Presentation

Univariate 1-Way ANOVA as a Linear Model with Fixed Regressors Group 1 Group 2 Group 3 x x x 11 12 13 x x x 21 22 23 x x x 31 32 33 Linear Model = μ + α + ε x (1) , , i j j i j

Reparameterized Linear Model = μ + ε x (2) , , i j j i j Matrix Form = μ + x D ε (3) D is a “design matrix” with 1’s and 0’s, and [ ] ′ μ = α α α μ � (4) 1 2 J

Example. ε ⎡ ⎤ ⎡ ⎤ ⎡ ⎤ x 1 0 0 1 1,1 1,1 ⎢ ⎥ ⎢ ⎥ ⎢ ⎥ ε x 1 0 0 1 ⎢ ⎥ ⎢ ⎥ ⎢ ⎥ 2,1 2,1 ε ⎢ ⎥ ⎢ ⎥ ⎢ ⎥ x 1 0 0 1 3,1 α 3,1 ⎡ ⎤ ⎢ ⎥ ⎢ ⎥ ⎢ ⎥ ε 1 x 0 1 0 1 ⎢ ⎥ ⎢ ⎥ ⎢ ⎥ ⎢ ⎥ 1,2 1,2 α ⎢ ⎥ ε ⎢ ⎥ ⎢ ⎥ = + ⎢ ⎥ 2 x 0 1 0 1 2,2 α 2,2 ⎢ ⎥ ⎢ ⎥ ⎢ ⎥ ⎢ ⎥ ε 3 x 0 1 0 1 ⎢ ⎥ ⎢ ⎥ ⎢ ⎥ ⎢ ⎥ ⎣ 3,2 μ 3,2 ⎦ ⎢ ⎥ ⎢ ⎥ ε ⎢ ⎥ x 0 0 1 1 1,3 1,3 ⎢ ⎥ ⎢ ⎥ ⎢ ⎥ ε x 0 0 1 1 (5) ⎢ ⎥ ⎢ ⎥ ⎢ ⎥ 2,3 2,3 ⎢ ⎥ ⎢ ⎥ ⎢ ⎥ ε ⎣ ⎦ ⎣ x ⎦ ⎣ ⎦ 0 0 1 1 3,3 3,3

Notice that the D matrix is not of full rank. (Why not? C.P.) D is of rank 3. We can partition D into two sub- matrices, one of which is full rank, and the other of which contains columns that are “superfluous” in the sense that they are linearly dependent. [ ] = D D D (6) 1 2 We can similarly partition

μ ⎡ ⎤ 1 μ = μ (7) ⎢ ⎥ ⎣ ⎦ 2 Note that we could redefine μ as μ + α μ ⎡ ⎤ ⎡ ⎤ 1 1 ⎢ ⎥ ⎢ ⎥ ∗ μ = μ + α = μ (8) ⎢ ⎥ ⎢ ⎥ 2 2 μ + α μ ⎢ ⎥ ⎢ ⎥ ⎣ ⎦ ⎣ ⎦ 3 3 In which case ∗ = μ + ε x D (9) 1

The null hypothesis in ANOVA can be expressed as a “general linear hypothesis” of the form ∗ ′ μ = 0 H (10) Let’s try it. Suppose − ⎡ ⎤ 1 0 1 ′ = ⎢ H (11) ⎥ − ⎣ ⎦ 0 1 1 Then

α + μ − α − μ α − α ⎡ ⎤ ⎡ ⎤ ∗ ′ μ = = 1 3 1 3 H (12) ⎢ ⎥ ⎢ ⎥ α + μ − α − μ α − α ⎣ ⎦ ⎣ ⎦ 2 3 2 3 Notice that if both elements of the expression on α the right in Equation (12) are equal, then all 3 j must be equal, and the null hypothesis of equal means must be true. Notice also that there are infinitely many ways you can write a “hypothesis matrix” H that expresses

the same null hypothesis. All of them will be of J − . rank 2, or, in the case of J groups, 1 ′ × , and let x have N H be of order p Let q elements. (Note that this N is the total N in the case of ANOVA.) Under H , the following statistic has an F 0 distribution:

− ( ) ( ) ( ) 1 − ⎡ − ⎤ − 1 1 1 ′ ′ ′ ′ ′ ′ ′ x D D D H H D D H H D D D x ⎢ ⎥ 1 1 1 1 1 1 1 1 ⎣ ⎦ (13) p = F ( ) − − ⎡ ⎤ p N q , 1 ′ ′ ′ − x I D D D D x ⎢ ⎥ 1 1 1 1 ⎣ ⎦ − N q This looks incomprehensible at first glance! Let’s examine the formula with reference to our simple 3 group ANOVA example, and let’s assume that sample sizes are equal in all groups. Look closely at the denominator. Note that it is the complementary projection operator for the column

D . So the entire denominator can be space of 1 ′ ′ = * * x Q Q x x x . So it is a sum of written as D D 1 1 squares of the projection of x into the space orthogonal simultaneously to all the columns of D . If you study D , you will see that in order for x 1 1 to simultaneously be orthogonal to all the columns D , each group of 3 scores must have a sum of of 1 zero for its group. So this is just a fancy way of computing mean square within . Now consider the matrix expression in the numerator. Recognizing that

( ) − 1 ′ ′ = D D D x x (14) 1 1 1 On the left and the right, we have expressions of the form ′ x and x . Now, with equal n per group, it is easy to see that ′ = D D I n (15) 1 1 and so we can write the entire numerator matrix expression as

( ) − 1 − ′ ′ ′ ′ = 1 x H H IH H x x P x n n (16) H At this point, deciphering the meaning of Equation P . (16) requires us to take a step back and look at H It is a column space projector for H , which has two columns in 3 dimensional space. But note that these two columns are both orthogonal to the unit vector 1 , and are linearly independent. Consequently, they span the space orthogonal to 1 , ′ 11 = = − = − ′ P Q I P I and so it must be that 1 1 . H 1 1 Consequently, the matrix quantity in Equation (16)

is simply n times the sum of squared deviations of the sample means around their overall mean. Since p in equation (13) is one less than the number of groups, we recognize that the entire expression is simply 2 nS x = F (17) σ 2 ˆ

Two-Way ANOVA ∗ = μ + , all the x D ε Notice that in the model 1 scores in the experiment are in a single vector x , and in the hypothesis statement of Equation (10), all the means are in a single vector. Moreover, the F statistic depends on the data, N, and just two matrix quantities, i.e., the hypothesis matrix H and D . The design matrix is a simple the design matrix 1 function of the linear model in scalar form (i.e., Equation (2). However, it is less obvious how to

construct the hypothesis matrices for row, column, and interaction, effects. We need a heuristic! Constructing the Hypothesis Matrix × ANOVA, For simplicity, suppose we have a 2 2 with 2 observations per cell. The cell means are μ μ ⎡ ⎤ = ⎢ 1,1 1,2 U ⎥ (18) μ μ ⎣ ⎦ 2,1 2,2

Notice that it is easy to write the null hypothesis in = AUB 0 . We can write the null the form hypothesis for rows as μ μ ⎡ ⎤ ⎡ ⎤ 1 [ ] − 1,1 1,2 ⎢ ⎥ ⎢ ⎥ H : 1 1 μ μ 0 ⎣ ⎦ ⎣ ⎦ 1 (19) 2,1 2,2 = μ + μ − μ + μ = ( ) 0 1,1 1,2 2,1 2,2 This can be written = OU1 0 H 0 : (20)

We call O an omnibus contrast matrix, and it is always of the same basic form, i.e., [ ] = − O I 1 (21) (Examples. C.P.) In a similar vein, we can write the column effect null hypothesis as μ μ ⎡ ⎤ ⎡ ⎤ 1 [ ] 1,1 1,2 : 1 1 ⎢ ⎥ ⎢ H ⎥ μ μ − 0 ⎣ ⎦ ⎣ ⎦ 1 (22) 2,1 2,2 = μ + μ − μ + μ = ( ) 0 1,1 2,1 1,2 2,2

This is of the form ′ ′ = 1 UO 0 H 0 : (23) The interaction null hypothesis is that there are no differences of differences, and consequently is of the form ′ = OUO 0 H 0 : (24) × Anova (C.P. Write the null hypotheses for a 3 2 row effect.)

(C.P. Write the null hypothesis for a simple main effect for columns at level 1 of the row effect.) As straightforward as this system is, unfortunately the null hypothesis requires all the means in a single vector μ , not in a matrix, and there is only one hypothesis matrix, not two as in the examples above. So how do we proceed? First, we need to define the Kronecker Product.

Kronecker (Direct) Product The Kronecker product of two matrices A and B is ⊗ A B , and can be written in partitioned denoted form as B B � B ⎡ ⎤ a a a 1,1 1,2 1, q ⎢ ⎥ B B � B a a a ⎢ ⎥ ⊗ = ⎢ 2,1 2,2 2, q A B (25) ⎥ � � � � p q r s ⎢ ⎥ � B B B a a a ⎣ ⎦ p ,1 p ,2 p q ,

× The Kronecker product above is of order pq . rs Kronecker products have some interesting c X be a column vector properties. Let vec ( ) consisting of the columns of X stacked on top of r X be a column vector each other. Let vec ( ) consisting of the rows of X transposed into columns and stacked on top of each other. Then ( ) ′ = ⊗ BSA A B S vec ( ) vec ( ) (26) c c ′ = S S , we also have and, since vec ( ) vec ( ) r c

( ) ′ = ⊗ ASB A B S vec ( ) vec ( ) (27) r r ( ) ′ = ⊗ ASB A B S (28) vec ( ) vec ( ) r r Consider the null hypothesis of Equation (20). This = OU1 0 , or as can be written as H 0 : ( ) ( ) ′ ⊗ = O 1 U 0 H 0 : vec r (29)

Example. Consider the null hypothesis of no row × ANOVA. effect in the 2 2 μ ⎡ ⎤ 1,1 ⎢ ⎥ μ ( ) [ ] [ ] ⎢ ⎥ ( ) ′ ⊗ 1 μ = − ⊗ 1,2 O 1 1 1 1 μ ⎢ ⎥ 2,1 ⎢ ⎥ μ ⎣ ⎦ 2,2 μ ⎡ ⎤ 1,1 ⎢ ⎥ μ [ ] ⎢ ⎥ = − − 1,2 1 1 1 1 μ ⎢ ⎥ 2,1 (30) ⎢ ⎥ μ ⎣ ⎦ 2,2

Equation (30) gives ( ) ( ) μ + μ − μ + μ , 1,1 1,2 2,1 2,2 which is the quantity that is zero when there is no row main effect.

General Specification of the Hypothesis Matrix in Factorial Anova 1. Call the effects A, B, C etc. 2. A conformable Omnibus Matrix for effect A is O . denoted A 3. A conformable Summing Vector for effect A is ′ 1 . denoted A ′ s 4. A selection vector is a row vector A j , conformable with factor A with a 1 in position j, and zeroes elsewhere.

Univariate 1-Way ANOVA as a Linear Model with Fixed Regressors - PowerPoint PPT Presentation

Univariate 1-Way ANOVA as a Linear Model with Fixed Regressors Group 1 Group 2 Group 3 x x x 11 12 13 x x x 21 22 23 x x x 31 32 33 Linear Model = + + x (1) , , i j j i j Reparameterized Linear Model = +

Two-Way ANOVA Two-way ANOVA So far, our ANOVA problems had only one dependent variable and

STAT 213 ANOVA as Multiple Regression Colin Reimer Dawson Oberlin College 5 April 2016 Outline

STAT 213 Two-Way ANOVA II Colin Reimer Dawson Oberlin College May 2, 2018 1 / 21 Outline

STAT 401A - Statistical Methods for Research Workers Two-way ANOVA Jarad Niemi (Dr. J) Iowa

STAT 215 Multifactor ANOVA I Colin Reimer Dawson Oberlin College November 28, 2017 1 / 25

R06 - ANOVA and F-tests STAT 587 (Engineering) Iowa State University November 3, 2020

Unit 4: Inference for numerical variables Lecture 3: ANOVA Statistics 101 Thomas Leininger June

Workshop 7.6a: Factorial ANOVA Murray Logan 19 Jul 2017 Section 1 Background Factorial ANOVA

Model Assumptions in Model Assumptions in ANOVA ANOVA

EDUR 8131 Chat 13: ANOVA , Part 2 1 Notes 9a: One-way ANOVA Previous chat covered through

Computing a one- way ANOVA Rick Balkin, Ph.D., LPC, NCC Department of Counseling Texas A&M

Lionel Riou Fransca Univariate & bivariate Two kind of analysis Univariate

Outline Univariate Trees 1 Decision Trees Classification Regression Pruning Steven J Zeil

Multivariate Analysis of Variance (MANOVA) Consider Univariate ANOVA Used when you have 3 or

Analysis of variance April 16, 2009 Contents Comparison of several groups One-way ANOVA

CS 7616 Pattern Recognition Linear, Linear, Linear Aaron Bobick School of Interactive

Applied Political Research Session 11: Analysis of Variance (ANOVA) Lecturer: Prof. A.

PS 405 Week 4 Section: Difference of means, ANOVA, and Matrix Algebra D.J. Flynn February 4,

Continuous Improvement Toolkit ANOVA Continuous Improvement Toolkit . www.citoolkit.com Managing

Overview Kursus 02402 Introduction to Statistics Oneway analysis of Variance (ANOVA) 1 Intro

Sta$s$cs & Experimental Design with R Barbara Kitchenham

Multivariate Analysis of Variance Max Turgeon STAT 4690Applied Multivariate Analysis Quick

Rcourse: Linear model Sonja Grath, No emie Becker & Dirk Metzler Winter semester 2014-15

Statistics and learning Analysis of variance (ANOVA) Emmanuel Rachelson and Matthieu Vignes ISAE

Sambuz

Useful Links

Newsletter

Mail Us

Univariate 1-Way ANOVA as a Linear Model with Fixed Regressors - PowerPoint PPT Presentation

Univariate 1-Way ANOVA as a Linear Model with Fixed Regressors Group 1 Group 2 Group 3 x x x 11 12 13 x x x 21 22 23 x x x 31 32 33 Linear Model = + + x (1) , , i j j i j Reparameterized Linear Model = +

Two-Way ANOVA Two-way ANOVA So far, our ANOVA problems had only one dependent variable and

STAT 213 ANOVA as Multiple Regression Colin Reimer Dawson Oberlin College 5 April 2016 Outline

STAT 213 Two-Way ANOVA II Colin Reimer Dawson Oberlin College May 2, 2018 1 / 21 Outline

STAT 401A - Statistical Methods for Research Workers Two-way ANOVA Jarad Niemi (Dr. J) Iowa

STAT 215 Multifactor ANOVA I Colin Reimer Dawson Oberlin College November 28, 2017 1 / 25

R06 - ANOVA and F-tests STAT 587 (Engineering) Iowa State University November 3, 2020

Unit 4: Inference for numerical variables Lecture 3: ANOVA Statistics 101 Thomas Leininger June

Workshop 7.6a: Factorial ANOVA Murray Logan 19 Jul 2017 Section 1 Background Factorial ANOVA

Model Assumptions in Model Assumptions in ANOVA ANOVA

EDUR 8131 Chat 13: ANOVA , Part 2 1 Notes 9a: One-way ANOVA Previous chat covered through

Computing a one- way ANOVA Rick Balkin, Ph.D., LPC, NCC Department of Counseling Texas A&amp;M

Lionel Riou Fransca Univariate &amp; bivariate Two kind of analysis Univariate

Outline Univariate Trees 1 Decision Trees Classification Regression Pruning Steven J Zeil

Multivariate Analysis of Variance (MANOVA) Consider Univariate ANOVA Used when you have 3 or

Analysis of variance April 16, 2009 Contents Comparison of several groups One-way ANOVA

CS 7616 Pattern Recognition Linear, Linear, Linear Aaron Bobick School of Interactive

Applied Political Research Session 11: Analysis of Variance (ANOVA) Lecturer: Prof. A.

PS 405 Week 4 Section: Difference of means, ANOVA, and Matrix Algebra D.J. Flynn February 4,

Continuous Improvement Toolkit ANOVA Continuous Improvement Toolkit . www.citoolkit.com Managing

Overview Kursus 02402 Introduction to Statistics Oneway analysis of Variance (ANOVA) 1 Intro

Sta$s$cs &amp; Experimental Design with R Barbara Kitchenham

Multivariate Analysis of Variance Max Turgeon STAT 4690Applied Multivariate Analysis Quick

Rcourse: Linear model Sonja Grath, No emie Becker &amp; Dirk Metzler Winter semester 2014-15

Statistics and learning Analysis of variance (ANOVA) Emmanuel Rachelson and Matthieu Vignes ISAE

Sambuz

Useful Links

Newsletter

Mail Us

Computing a one- way ANOVA Rick Balkin, Ph.D., LPC, NCC Department of Counseling Texas A&M

Lionel Riou Fransca Univariate & bivariate Two kind of analysis Univariate

Sta$s$cs & Experimental Design with R Barbara Kitchenham

Rcourse: Linear model Sonja Grath, No emie Becker & Dirk Metzler Winter semester 2014-15