exponential family random network models ernm
play

Exponential-family Random Network Models (ERNM) Ian Fellows UCLA - PowerPoint PPT Presentation

Exponential-family Random Network Models (ERNM) Ian Fellows UCLA January 9, 2012 Ian Fellows Exponential-family Random Network Models (ERNM) The landscape Random graphs == Random connections, Fixed nodal attributes Gibbs/Markov


  1. Exponential-family Random Network Models (ERNM) Ian Fellows UCLA January 9, 2012 Ian Fellows Exponential-family Random Network Models (ERNM)

  2. The landscape ◮ Random graphs == Random connections, Fixed nodal attributes ◮ Gibbs/Markov random fields == Fixed connections, Random nodal attributes ◮ ERNM == Random connections, Random nodal attributes Ian Fellows Exponential-family Random Network Models (ERNM)

  3. ERNM Formulation Let Y be an n by n matrix who’s entries Y i , j indicate whether subject i and j are connected, where n is the size of the population. Further let X be a n × q matrix of nodal variates. We define the network to be the random variable ( Y , X ). Then a joint exponential family model for the network may be written as: 1 c ( η ) e η h ( x , y ) , P ( X = x , Y = y | η ) = ( x , y ) ∈ N (1) Ian Fellows Exponential-family Random Network Models (ERNM)

  4. Uninteresting Example: Separable Models Suppose that h is composed such that the model can be expressed as 1 c ( η 1 , η 2 ) e η 1 h 1 ( x )+ η 2 h 2 ( y ) P ( X = x , Y = y | η 1 , η 2 ) = ( x , y ) ∈ N . (2) Then 1 c 1 ( η 1 ) e η 1 h 1 ( x ) P ( X = x | η 1 ) = 1 c 2 ( η 2 ) e η 2 h 2 ( y ) . P ( Y = y | η 2 ) = Ian Fellows Exponential-family Random Network Models (ERNM)

  5. Pathological Example: Ising as a Joint Model Joint model: � � j y i , j + η 2 � � P ( X = x , Y = y | η 1 , η 2 ) ∝ e η 1 j x i y i , j x j . i i With conditional distributions being: e η 1 y i , j + η 2 x i y i , j x j P ( Y i , j = y i , j | X = x , η 1 , η 2 ) ∝ e η 2 � � j x i y i , j x j P ( X = x | Y = y , η 2 ) ∝ i Ian Fellows Exponential-family Random Network Models (ERNM)

  6. Pathological Example: Ising as a Joint Model Degeneracy: Oh My!!! # of edges within x = 1 # of edges within x = -1 Frequency Frequency 20000 20000 0 0 0 50 100 150 200 0 50 100 150 200 Count Count # of edges between x = 1 and x = -1 # of nodes with x = 1 Frequency Frequency 6000 4000 0 0 0 20 40 60 80 100 120 0 5 10 15 20 Count Count Figure: 100,000 draws from an Ising Joint Model with η 1 = 0 and η 2 = 0 . 13. Mean values are marked in red. Ian Fellows Exponential-family Random Network Models (ERNM)

  7. Dispare Oh well, better give up. Ian Fellows Exponential-family Random Network Models (ERNM)

  8. Hope But wait, is there a better measure of homophily which doesn’t display degeneracy? Ian Fellows Exponential-family Random Network Models (ERNM)

  9. ... ... 6 months pass .... Ian Fellows Exponential-family Random Network Models (ERNM)

  10. Regularized Homophily � � � d i , l − E binom ( reg homophily ( k , l ) = d i , l ) , i : x i = k where d i , l is the number of edges connecting node i to nodes in group l , and E binom ( � d i , l ) is the expectation of the square root of a binomial variable, with probability equal to the proportion of nodes in group l and size equal to the out-degree of node i . Ian Fellows Exponential-family Random Network Models (ERNM)

  11. Logistic Regression in Network Data 1 c ( β, η, λ ) e zx β · + η h ( x , y )+ λ g ( z , y ) . P ( Z = z , X = x , Y = y | η, β, λ ) = (3) e x i β P ( z i = 1 | z − i , x i , Y = y , β, λ ) = (4) e λ [ g ( z − , y ) − g ( z + , y )] + e x i β . where z − i represents the set of z not including z i , z + represents z where z i = 1, z − is z where z i = 0,and x i represents the i th row of X . Ian Fellows Exponential-family Random Network Models (ERNM)

  12. A Super-population Model for an Add Health High School Std. Error Z p-value η Mean Degree -167.90 8.51 -19.73 < 0.001 Log Variance of Degree 22.18 10.01 2.22 0.027 Degree = 0 3.91 0.47 8.28 < 0.001 Degree = 1 2.20 0.38 5.86 < 0.001 Degree = 2 0.73 0.35 2.05 0.041 Grade = 9 0.88 0.78 1.13 0.258 Grade = 10 1.74 0.92 1.89 0.058 Grade = 11 2.53 0.79 3.20 0.001 Within Grade Homophily 3.97 0.47 8.44 < 0.001 +1 Grade Homophily 0.50 0.33 1.54 0.125 +2 Grade Homophily -1.07 0.27 -4.03 < 0.001 +3 Grade Homophily -0.59 0.40 -1.47 0.143 Table: ERNM Model with Standard Errors Based on the Fisher Information Ian Fellows Exponential-family Random Network Models (ERNM)

  13. A Super-population Model for an Add Health High School Figure: Model-Based Simulated High School Ian Fellows Exponential-family Random Network Models (ERNM)

  14. A Super-population Model for an Add Health High School In-Degree Out-Degree 25 25 20 20 15 15 10 10 5 5 0 0 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 # of Edges Between Grades Grade Counts 140 30 120 25 100 80 20 60 15 40 20 10 0 10-10 11-10 12-10 9-10 10-11 11-11 12-11 9-11 10-12 11-12 12-12 9-12 10-9 11-9 12-9 9-9 9 10 11 12 Figure: Model Diagnostics Ian Fellows Exponential-family Random Network Models (ERNM)

  15. Logistic Regression on Substance Use: Naive model Std. Error Z p-value β Intercept -1.70 0.44 -3.84 < 0.001 Male 1.18 0.57 2.09 0.037 Table: Simple Logistic Regression Model Ignoring Network Structure Ian Fellows Exponential-family Random Network Models (ERNM)

  16. Logistic Regression on Substance Use: ERNM model Bootstrap Asymptotic η Std. Error Std. Error Z p-value Mean Degree -164.18 7.86 8.07 -20.36 < 0.001 Log Variance of Degree 20.35 8.85 9.07 2.24 0.025 Degree 0 4.01 0.45 0.44 9.12 < 0.001 Degree 1 2.25 0.37 0.35 6.44 < 0.001 Degree 2 0.74 0.36 0.35 2.08 0.038 Grade Homophily 3.85 0.46 0.46 8.41 < 0.001 +1 Grade Homophily 0.45 0.33 0.33 1.39 0.166 +2 Grade Homophily -1.14 0.28 0.25 -4.50 < 0.001 +3 Grade Homophily -0.58 0.39 0.38 -1.52 0.129 Sex Homophily 0.98 0.28 0.27 3.56 < 0.001 Substance Homophily 0.88 0.25 0.26 3.44 < 0.001 Intercept -1.79 0.49 0.43 -4.11 < 0.001 Male 0.94 0.56 0.52 1.81 0.070 Table: ERNM Model Inference Ian Fellows Exponential-family Random Network Models (ERNM)

  17. Logistic Regression on Substance Use: homophily diagnostics # of edges within non-substance users # of edges within substance users 200 200 Frequency Frequency 100 100 0 0 100 150 200 250 300 350 0 20 40 60 80 Count Count # of edges between users and non-users # of non-substance users 200 200 Frequency Frequency 100 100 0 0 20 40 60 80 100 120 140 45 50 55 60 65 Count Count Figure: Substance Use Homophily Diagnostics. The values of the observed statistics are marked in red. Ian Fellows Exponential-family Random Network Models (ERNM)

  18. Conclusion ERNM is a framework for inference about networks, including both the graph and the nodal characteristics. Ian Fellows Exponential-family Random Network Models (ERNM)

Recommend


More recommend