Introduction to Multilevel Analysis Prof. Dr. Ulrike Cress - PowerPoint PPT Presentation

Introduction to Multilevel Analysis Prof. Dr. Ulrike Cress Knowledge Media Research Center Tübingen

Aim of this session • What‘s the problem about multilevel data? • Options to handle multilevel data in CSCL Caution: After this presentation you will not be able to do or fully understand a HLM model – but you will be aware of all the mistakes you can do! give you some take-home messages 2

Into to topic .... „ Extraverted children perform better in school “ What may be the reason for that? What may be the processes behind? What does this mean statistically? 3

What is the problem about multi-level data? 4

Example: Effect of Extraversion on Learning Outcome IV: Extraversion DV: performance 5

First view on the data Extraversion 7 8 6 5 2 4 4 5 4 5 Performance 13 14 13 9 14 7 12 12 11 10   14,00     12,00  post Pooled (n=10) r = .26  10,00  8,00  2,00 4,00 6,00 8,00   pre 14,00     12,00 Aggregated (Mean of the groups; n=3) r = .99  post  10,00  8,00  2,00 4,00 6,00 8,00 pre   14,00 Mean correlation (n=3)     12,00 r=.86 r=-.82 r=-.30 r = -0.08  post  10,00  6 8,00  2,00 4,00 6,00 8,00 pre

Hierarchical data Individual observations are not independent 7

Question • What does it statistically mean, if the variance within the groups is small? • with regard to standard-deviation? • with regard to F? • with regard to alpha? 8

Impact on statistics • Analysis of Variance: heavily leans on the assumption of independence of observations Var F  between Var within • Underestimation of the standard error • Large number of spuriously “significant” results • Inflation of Alpha 9

Alpha-Inflation no. of group groups size 10

1st take-home message you are not allowed to use standard statistics with multi-level data 11

Stochastic Non-Independency …. is caused by 1. Composition : people of the groups are already similar before the study even begins is a problem if you can not randomize 12

Stochastical Non-Independency …. is caused by 2. Common fate caused through shared experiences during the experiment is always a problem in CL 13

Stochastic Non-Independency …. is caused by 3. Interaction & reciprocal influence        14

Hierarchical data Intra-class correlation 15

2nd take-home message: Relevance for Learning Sciences • CL explicitly bases on the idea of creating non - independency • We want people to interact, to learn from each others, etc. • CL should even aim at considering effects of non- independency • if you work on CL-data, you have to consider the multi-level structure of the data not just as noice but as an intended effect 16

How to do this adequately? Possible solutions 1. Working with fakes 2. Groups as unit of analysis 3. Slopes as outcomes 4. Hierarchical linear analysis (HLM) 5. Fragmentary (but useful) solutions 17

Solution 1: Working with fake fake classical experiment: conformity study Asch (1950) fake fake confederates bogus feedback 18

Solution 1: Working with fake Pros: • well established method in social psychology • high standardization • situation makes people behaving like being in a group, but it leads to statistically independent data • causality • sometimes easy to do in CSCL  anonymity 19

Solution 1: Working with fake Cons: • artificial situation • no flexibility • only simple action-reaction pairs can be faked. No real process of reciprocal interaction • non dynamics 20

Solution 2: Unit of Analysis • Group level : Aggregated data M(x) M(y) M(x) M(y) M(x) M(y) M(x) M(y) Pros: • statistically independent measures Cons: • need of many groups • waste of data • results not valid for individual level  Robinson - Effect 21

Robinson-Effect (1950) • illiteracy level in nine geographic regions (1930) • percentage of blacks (1930) regions r = 0.95 individuals r = 0.20  Ecological Fallacy: inferences about the nature of specific individuals are based solely upon aggregate statistics collected for the group to which those individuals belong. Problem: Unit of analysis 22

3rd take-home message You can use group-level data - but the results just describe the groups, not the individuals 23

Solution 2: Unit of Analysis • Individual level : centering around the group mean / standardization  elimination of group effects M(x) M(y) M(x) M(y) M(x) M(y) M(x) M(y) x-M(x) ... x-M(x) ... x-M(x) ... x-M(x) ... y-M (y) … y-M (y) … y-M (y) … y-M (y) … 24

Solution 2: Unit of Analysis Pros: • easy to do • makes use of all data of the individual level Cons: • works only, if variances are homogeneouos (centering) • loss of information about heterogeneous variances (standardization) • differences between groups are just seen as error-variance 25

Solution 3: Slopes as Outcomes Burstein, 1982 performance y Team 2 y=ax+b . . . . . . Team 1 y=ax+b . . . . . . . . . Extraversion x 26

Solution 3: Slopes as Outcomes Pros: • uses all information • focus is on interaction effects between group- level (team) and individual-level variable Cons: • descriptive • just comparing the groups which are given  no random-effects are considered 27

4th take-home message Consider the slopes of the different groups. They show group effects! e.g. it is a feature of the group, if extraverted members are more effective  slopes describe groups  slopes are DVs 28

Solution 4: Hierarchical Linear Model Bryk & Raudenbush, 1992 Two Main ideas the groups (you have data from) represent a randomly choosen sample of a population of groups! (random effect model) The slopes and intercepts are systematically varying variables. 29

Solution 4: Hierarchical Linear Model Bryk & Raudenbush, 1992 variation of slopes variation of intercepts performance y predicted with 2 nd Team 2 level variables y= a x+b . . . . . Team 1 . . y=ax+b . . . . . . . . extraversion x 30

Solution 4: Hierarchical Linear Model Bryk & Raudenbush, 1992 Equation system of systematically varying regressions Y ij = β 0j + β 1j X ij + r ij Level 1: y . . . . . . . . . . . . . . . x β 0 j = intercept for group j b 1j = regression slope group j r ij = residual error 31

HLM: Equation system Y ij = β 0j + β 1j X ij + r ij Level 1: y β 0j = g 00 + g 01 W j + u 0j Level 2: . β 1j = g 10 + g 11 W j + u 1j . . . . . . . . . . . . . . x W = explanatory variable on level 2 e.g. teacher experience 32

Total model Y ij = β 0j + β 1j X ij + r ij Level 1: (1) β 0j = g 00 + g 01 W j + u 0j Level 2: (2) β 1j = g 10 + g 11 W j + u 1j (3) Put (2) and (3) in (1) Y ij = ( g 00 + g 01 W j + u 0j ) + ( g 10 X ij + g 11 W j X ij + u 1j X ij ) + r ij (4) Y ij = ( g 00 + g 01 W j + g 10 X ij + g 11 W j X ij ) + (u 1j X ij + u 0j + r ij ) (5) Random (error) part Fixed part 33

W = +1 y=perfo g 11 rmance 1 W = 0 W = group g 10 predictor (e.g. g 01 teacher 1 experience) W = -1 g 00 x=extravers ion performance influence random part of individuum 0 at x=0 extraversion slopes residuum Y ij = ( g 00 + g 01 W j + g 10 X ij + g 11 W j X ij ) + (u 1j X ij + u 0j + r ij ) influence random cross-level 34 teacher exper. intercept interaction

How to do? Iterative testing of different models 35

Baseline model: null model, intercept-only model Y ij = ( g 00 + g 01 W j + g 10 X ij + g 11 W j X ij ) + (u 1j X ij + u 0j + r ij ) Y ij = g 00 + + u 0j + r ij Variance Grand Mean between groups residuum 36

Baseline model: null model or intercept-only model randomly varying intercepts g 00 u 0j r ij Y ij = g 00 + u 0j + r 1ij 37

Baseline model: null model, intercept-only model Y ij = ( g 00 + g 01 W j + g 10 X ij + g 11 W j X ij ) + (u 1j X ij + u 0j + r ij ) Y ij = ( g 00 + + u 0j + r ij Variance Grand Mean between groups residuum which amount of variance is explained through the groups? Var (u o )  Intraclasscorrelation ICC = Var (u o )+ Var (r ij ) 38

2nd model: Random intercept model with first level predictor We predict the individual measures with a first-level predictor Y ij = ( g 00 + g 01 W j + g 10 X ij + g 11 W j X ij ) + (u 1j X ij + u 0j + r ij ) Y ij = ( g 00 + g 10 X ij + u 0j + r ij ) first level predictor 39

2nd model: Random intercept model with first level predictor • randomly varying intercepts; • sampe slope for all groups g 10 g 00 1 u 0j r ij Y ij = g 00 + g 10 X jj + u 0j + r ij 40

3rd model: Random intercept model with second-level predictor We predict the the intercepts with a second-level predictor Y ij = ( g 00 + g 01 W j + g 10 X ij + g 11 W j X ij ) + (u 1j X ij + u 0j + r ij ) Y ij = ( g 00 + g 01 W j + g 10 X ij + + u 0j + r ij ) 2nd level predictor 41

Introduction to Multilevel Analysis Prof. Dr. Ulrike Cress - PowerPoint PPT Presentation

Introduction to Multilevel Analysis Prof. Dr. Ulrike Cress Knowledge Media Research Center Tbingen Aim of this session Whats the problem about multilevel data? Options to handle multilevel data in CSCL Caution: After this

Figure 1a: Multilevel Visualization of Market Power Cases in Tree Format Figure 1b: Multilevel

Agenda 1. More on multilevel R formulas 2. Generalized Multilevel Models 3. GMLM in R 1 More

Multilevel Krylov Methods Deflation Deflation, DD, MG Reinhard Nabben Multilevel Krylov

Introduction to Multilevel Analysis Introduction to Multilevel Analysis PSYC 575 PSYC 575 Mark

Multilevel Cooling Systems August Hale May 16, 2014 August Hale Multilevel Cooling Systems May

The Challenge of The Challenge of Multilevel Security Multilevel Security Rick Smith, Ph.D.,

brms: Bayesian Multilevel Models using Stan Paul Brkner 2018-04-09 1 Why using Multilevel

Cascadic Multilevel Methods for Cascadic Multilevel Methods for Large-Scale Ill-Posed Problems

Multilevel domain decomposition at extreme scales S. Badia, A. Martin, J. Principe Universitat

Multilevel TRILL draft-perlman-trill-rbridge-multilevel-00.txt Radia Perlman

Multilevel Monte Carlo A few words on concurrency in C++11 Vincent Lemaire LPMA UPMC June

Multilevel Mixed (hierarchical) models Christopher F Baum ECON 8823: Applied Econometrics Boston

Multilevel Mixed (hierarchical) models Christopher F Baum EC 823: Applied Econometrics Boston

Multilevel Hybrid Split Step Implicit Tau-Leap for Stochastic Reaction Networks (SRNs) Chiheb Ben

Algebraic Multilevel ILU-Based Preconditioning for Stationary Flow Problems Jan Mayer Hendryk

The Multilevel Change Model James H. Steiger Department of Psychology and Human Development

On-line Random Forests Amir Saffari, Christian Leistner, Jakob Santner Martin Godec, Horst

OBJECTIVES Describe the state of the science in intervention research with dementia caregivers

Random Linear Network Coding on Programmable Switches D. Gonalves 1 , S. Signorello 1 , F . M.

Eigenvalues of symmetrized shuffling operators Nadia Lafrenire Universit du Qubec

Random forests and wine Machine Learning Toolbox Random forests Popular type of machine

RapidChain:Scaling Blockchain via Full Sharding Jinghui Liao Outlines Background

Random Projections, Margins, Kernels and Feature Selection Adithya Pediredla Rice University

Masked Ballot Voting for Receipt-Free Online Elections Roland Wen and Richard Buckland School of