The Set of 3 4 4 Contingency Tables has 3-Neighborhood Property - PowerPoint PPT Presentation

The Set of 3 × 4 × 4 Contingency Tables has 3-Neighborhood Property Toshio Sumi and Toshio Sakata Faculty of Design, Kyushu University COMPSTAT2010 25 August 2010 in Paris, France

Contents Three way contingency tables 1 2 Motivation r neighbourhood property and the main results 3 4 Markov basis General Perspectives 5 6 3 × 3 × 4 contingency tables Future work 7

Contingency tables One way contingency tables · · · · · · vectors ( h i ) Two way contingency tables · · · · · · matrices ( h ij ) Three way contingency tables · · · · · · ( h ijk ) An I × J × K contingency table · · · · · · ( h ijk ) , where 1 ≤ i ≤ I , 1 ≤ j ≤ J , and 1 ≤ k ≤ K . I × J × K contingency tables are 1 − 1 corresponding with functions from [ 1 .. I ] × [ 1 .. J ] × [ 1 .. K ] to Z ≥ 0 . [ 1 .. n ] := { 1 , 2 , . . . , n }

Motivation In the analysis of three-way contingency tables we often use the conditional inference and recently the conditional test of three way tables has seen some enthusiasm. (cf. Diaconis and Sturmfels: 1998, Aoki and Takemura: 2003, 2004). In I × J × K three-way tables the probability function is given by � x ijk p ijk ( i , j , k ) ∈Z P { X = x | p } ∼ � , x ijk ! ( i , j , k ) ∈Z where Z = { ( i , j , k ) | 1 ≤ i ≤ I , 1 ≤ j ≤ J , 1 ≤ k ≤ K } , x = ( x ijk ; ( i , j , k ) ∈ Z ) is a family of cell counts, and p = ( p ijk ; ( i , j , k ) ∈ Z ) is a family of cell probabilities.

Motivation In the log-linear model the probability p ijk is expressed as log p ijk = µ + τ X i + τ Y j + τ Z k + τ XY + τ YZ jk + τ XZ ik + τ XYZ ij ijk where τ XYZ is called three-way interaction effect (Agresti:1996). The hypothesis to be tested is H : τ XYZ = 0 for all ( i , j , k ) ∈ Z ijk which means that there is no three-way interaction. Under the hypothesis H the sufficient statistic is the set of two way x - y , y - z , x - z marginals and so the conditional probabilities becomes free from parameters under H .

Motivation Upon fixing all two way marginals the conditional distribution of X becomes � 1 / x ijk ! ( i , j , k ) ∈Z P H { X = x | α, β, γ } = � � , 1 / y ijk ! y ∈F ( i , j , k ) ∈Z where F = F ( α, β, γ ) is the set of three-way contingency tables with the two-way marginals, and α , β and γ are the x - y , y - z , x - z marginals of the observed table respectively. The important thing is that the distribution is of parameter free under H . When X = x 0 was observed, our primary concern is to evaluate the probability p -value = P H { T ( X ) ≥ T ( x 0 ) } , where T is an appropriate test statistic.

Motivation Let F ( α, β, γ ) be the set of contingency tables with marginals α, β, γ and F ( H ) the set of contingency tables with marginals as same as those of H . To evaluate the p -value, we consider about the Monte Carlo method. The Monte Carlo method estimates F ( α, β, γ ) by running a Markov chain. The Markov chain must be irreducible and in order to generate an irreducible Markov chain we need a Markov basis B by which all elements in F ( α, β, γ ) become mutually reachable by a sequence of elements in B without violating non-negativity condition.

Sequential conditional test In the ( t − 1 ) -stage, for a given dataset we obtain a contingency table H t − 1 and let consider the set F ( H t − 1 ) . If one data is obtained at ( i t , j t , k t ) , we have a new contingency table H t by combining it with the given dataset and consider the set F ( H t ) . ⎧ ⎪ ⎪ H t − 1 ( · , j , k ) ( j , k ) � ( j t , k t ) ⎨ H t ( · , j , k ) = ⎪ ⎪ ⎩ H t − 1 ( · , j t , k t ) + 1 ( j , k ) = ( j t , k t ) ⎧ ⎪ ⎪ H t − 1 ( i , · , k ) ( i , k ) � ( i t , k t ) ⎨ H t ( i , · , k ) = ⎪ ⎪ ⎩ H t − 1 ( i t , · , k t ) + 1 ( i , k ) = ( i t , k t ) ⎧ ⎪ ⎪ H t − 1 ( i , j , · ) ( i , j ) � ( i t , j t ) ⎨ H t ( i , j , · ) = ⎪ ⎪ ⎩ H t − 1 ( i t , j t , · ) + 1 ( i , j ) = ( i t , j t ) .

Sequential conditional test In the sequential conditional test, consider F ( H 1 ) → F ( H 2 ) → · · · → F ( H t − 1 ) → F ( H t ) → · · · . Although MCMC test by Metropolos-Hastings’s algorithm is general in the sequential conditional test, our purpose is to obtain F ( H t ) by using the previous F ( H t − 1 ) and completely exact probabilities in Fisher’s exact test.

Probabilities – ex 1 Programming by R for 3 × 3 × 3 contingency tables Step t |F t | Ours MCMC1 MCMC2 21 12 0 . 2727273 0 . 2692 0 . 2715 22 15 0 . 3628319 0 . 3622 0 . 3628 23 19 0 . 4824798 0 . 4952 0 . 4747 24 25 0 . 3872708 0 . 3766 0 . 3886 25 32 0 . 1602634 0 . 1628 0 . 1618 26 99 0 . 1176134 0 . 123 0 . 1203 27 144 0 . 05369225 0 . 0534 0 . 0503 28 152 0 . 03016754 0 . 0322 0 . 0291 MCMC1: (5 ∗ 10 3 ,5 ∗ 10 2 ) Select 5 ∗ 10 3 tables each of which is got by 5 ∗ 10 2 skip. MCMC2: (10 4 ,10 3 )

Times – ex 1 Step t Ours MCMC1 MCMC2 |F t | 21 12 0 . 016 74 . 464 297 . 655 22 15 0 . 033 73 . 699 302 . 268 23 19 0 . 04 75 . 482 309 . 072 24 25 0 . 046 77 . 406 314 . 035 25 32 0 . 089 80 . 639 326 . 231 26 99 0 . 232 88 . 107 354 . 706 27 144 0 . 275 91 . 275 368 . 7 28 152 0 . 138 91 . 899 372 . 771 0 . 869 652 . 971 2645 . 438 MCMC1: (5 ∗ 10 3 ,5 ∗ 10 2 ) MCMC2: (10 4 ,10 3 )

Times – ex 2 Step t |F t | Ours MCMC1 MCMC2 Prob. 21 4 0 . 026 66 . 655 282 . 103 1 . 0 31 63 0 . 045 83 . 659 337 . 438 0 . 2169823 42 253 0 . 565 96 . 445 390 . 396 0 . 2925166 50 1168 0 . 132 114 . 436 464 . 128 0 . 4415128 51 1493 1 . 961 117 . 895 479 . 225 0 . 4047599 60 6663 15 . 482 141 . 59 572 . 312 0 . 1865068 61 11599 42 . 942 151 . 726 617 . 728 0 . 1059830 71 15784 0 . 556 154 . 862 626 . 537 0 . 06565988 72 17285 15 . 624 154 . 978 628 . 453 0 . 05635573 73 17285 0 . 727 154 . 922 626 . 177 0 . 04961687 149 . 51 5921 . 512 23969 . 86 MCMC1: (5 ∗ 10 3 ,5 ∗ 10 2 ) MCMC2: (10 4 ,10 3 )

Times – ex 2 Step t |F t | Ours MCMC1 MCMC2 Prob. 21 4 0 . 026 66 . 655 282 . 103 1 . 0 31 63 0 . 045 83 . 659 337 . 438 0 . 2169823 42 253 0 . 565 96 . 445 390 . 396 0 . 2925166 51 1493 1 . 961 117 . 895 479 . 225 0 . 4047599 60 6663 15 . 482 141 . 59 572 . 312 0 . 1865068 61 11599 42 . 942 151 . 726 617 . 728 0 . 1059830 71 15784 0 . 556 154 . 862 626 . 537 0 . 06565988 72 17285 15 . 624 154 . 978 628 . 453 0 . 05635573 73 17285 0 . 727 154 . 922 626 . 177 0 . 04961687 149 . 51 5921 . 512 23969 . 86 MCMC1: (5 ∗ 10 3 ,5 ∗ 10 2 ) MCMC2: (10 4 ,10 3 )

I × J × K contingency table I × J × K contingency table consists of K slices of I × J matrices consisting non-negative integers. i \ j k = 1 i \ j k = 2 i \ j k = 3 h 111 h 121 h 131 h 112 h 122 h 132 h 113 h 123 h 133 h 211 h 221 h 231 h 212 h 222 h 232 h 213 h 223 h 233 h 311 h 321 h 331 h 312 h 322 h 332 h 313 h 323 h 333 Table: 3 × 3 × 3 contingency table

Marginals for an I × J × K contingency table i \ j x - y marginal j \ k y - z marginal i \ k x - z marginal h 11 · h 12 · h 13 · h · 11 h · 12 h · 13 h 1 · 1 h 1 · 2 h 1 · 3 h 21 · h 22 · h 23 · h · 21 h · 22 h · 23 h 2 · 1 h 2 · 2 h 2 · 3 h 31 · h 32 · h 33 · h · 31 h · 32 h · 33 h 3 · 1 h 3 · 2 h 3 · 3 Table: Marginals of a 3 × 3 × 3 contingency table � K � I � J h ij · = h ijs , h · jk = h sjk , and h i · k = h isk s = 1 s = 1 s = 1

Markov basis Find F ( H t ) from F ( H t − 1 ) . Put F t = F ( H t ) for any t . Let φ t be a map from F t − 1 to F t by simply adding 1 in the ( i t , j t , k t ) -cell. Remark A table T of F t with T i t j t k t > 0 lies in the image of φ t . Thus we may find all tables T of F t with T i t j t k t = 0.

Markov basis From now on we assume ( i t , j t , k t ) = ( 1 , 1 , 1 ) for simplicity. By the above remark, we need to consider how we can generate H ∈ F t with H 111 = 0.

Markov basis From now on we assume ( i t , j t , k t ) = ( 1 , 1 , 1 ) for simplicity. By the above remark, we need to consider how we can generate H ∈ F t with H 111 = 0. An idea is to use a Markov basis.

T Im( �� H Markov basis From now on we assume ( i t , j t , k t ) = ( 1 , 1 , 1 ) for simplicity. By the above remark, we need to consider how we can generate H ∈ F t with H 111 = 0. For any T , H ∈ F t , there is a sequence of moves F 1 , . . . , F s of the Markov basis such that H 1 := H + F 1 ∈ F t H 2 := H 1 + F 2 ∈ F t . . . H s := H s − 1 + F s ∈ F t T 111 �� H 111 �� T = H s

Im( �� T H Markov basis From now on we assume ( i t , j t , k t ) = ( 1 , 1 , 1 ) for simplicity. By the above remark, we need to consider how we can generate H ∈ F t with H 111 = 0. We want to find a set of moves B such that for any T ∈ F t with T 111 = 0, there are H ∈ φ ( F t − 1 ) and F ∈ B such that T + F = H . � ��

The Set of 3 4 4 Contingency Tables has 3-Neighborhood Property - PowerPoint PPT Presentation

The Set of 3 4 4 Contingency Tables has 3-Neighborhood Property Toshio Sumi and Toshio Sakata Faculty of Design, Kyushu University COMPSTAT2010 25 August 2010 in Paris, France Contents Three way contingency tables 1 2 Motivation r

Business Statistics CONTENTS Contingency tables Independence of categorical variables 2 2

Statistical Inference on Large Contingency Tables: Convergence, Testability, Stability Marianna

Counting Contingency Tables Igor Pak, UCLA Combinatorics Seminar, OSU, September 17, 2020 1

Presentation of medical data. Frequency tables and contingency tables. Visualization.

TIMES TABLES HOW WE TEACH TIMES TABLES AND HOW YOU CAN HELP WHY ARE TIMES TABLES IMPORTANT?

NZ Data Tables Data tables sit alongside the Active NZ main report The data tables provide

Symbol tables COMP 520 Fall 2013 Symbol tables (2) Symbol tables are used to describe and analyse

STAT 113 Describing Categorical Data Colin Reimer Dawson Oberlin College September 7, 2017 1 /

Fundamentals of Evolution Session 22 - 11/27/2018 Contingency and Development 1 Contingency in

Contingency planning and Outbreak management Nia Meddins Plant Health Policy Lead What does

Contingency Plan Contingency Plan i h i h in the events of in the events of f f Aberrant

Development of the Asia/Pacific Regional ATM Contingency Plan Shane Sumner Regional Officer

Humanitarian Response Plan Crisis preparedness and contingency Ongoing crisis Changes

Input. A set of men M , and a set of women W . Input. A set of men M , and a set of women W .

X1D: Create Pivot Tables using Excel 2013 3/07/2018 V1N Create Pivot Tables using Excel 2013 1

Create Pivot Tables using Excel 2008/2013 1/26/2016 V1H Create Pivot Tables using Excel 2008 1

I 06 - p -values STAT 587 (Engineering) Iowa State University September 27, 2020 p -values p

RGG An XML based GUI Generator for R Ilhami Visne 1 , Klemens Vierlinger 1 , Friedrich Leisch 2 ,

Acknowledgements Acknowledgements Coauthors: Amy Wilson-Stronks, The Joint Commission,

Calibrate p values by taking the square root Rutgers Foundations of Probability Seminar

Statistically-Significant Correlations 11 Oct, 2014 0F 2014 NNN4 Statistically-Significant

A Novel Design for Decision Rules Based on Statistical Testing Strategies in a Definitive

B I O I N F O R M A T I C S Kristel Van Steen, PhD 2 Montefiore Institute - Systems and Modeling

Regression Marc H. Mehlman marcmehlman@yahoo.com University of New Haven the

Sambuz

Useful Links

Newsletter

Mail Us