a tale of two communities
play

A Tale of two communities Assessing Homophily in Node-Link Diagrams - PowerPoint PPT Presentation

A Tale of two communities Assessing Homophily in Node-Link Diagrams 23rd International Symposium on Graph-Drawing and Network Visualization Los Angeles, September 26, 2015 Wouter Meulemans City University London Andr e Schulz


  1. A Tale of two communities Assessing Homophily in Node-Link Diagrams 23rd International Symposium on Graph-Drawing and Network Visualization Los Angeles, September 26, 2015 Wouter Meulemans City University London Andr´ e Schulz FernUniversit¨ at in Hagen

  2. Homophily A Tale of two Communities Meulemans and Schulz, GD15

  3. Homophily � homophily is a concept in social network analysis A Tale of two Communities Meulemans and Schulz, GD15

  4. Homophily � homophily is a concept in social network analysis � more likely that two individuals with a common charactristic form a link → homophily A Tale of two Communities Meulemans and Schulz, GD15

  5. Homophily � homophily is a concept in social network analysis � more likely that two individuals with a common charactristic form a link → homophily (example: same-gender links are more likely in a friendship-networks) A Tale of two Communities Meulemans and Schulz, GD15

  6. Homophily � homophily is a concept in social network analysis � more likely that two individuals with a common charactristic form a link → homophily (example: same-gender links are more likely in a friendship-networks) � reason 1 for homophily: “Birds of feather flock together” (social selection) A Tale of two Communities Meulemans and Schulz, GD15

  7. Homophily � homophily is a concept in social network analysis � more likely that two individuals with a common charactristic form a link → homophily (example: same-gender links are more likely in a friendship-networks) � reason 1 for homophily: “Birds of feather flock together” (social selection) � reason 2 for homophily: we form characteristics similar to our friends (social influence) A Tale of two Communities Meulemans and Schulz, GD15

  8. Homophily � homophily is a concept in social network analysis � more likely that two individuals with a common charactristic form a link → homophily (example: same-gender links are more likely in a friendship-networks) � reason 1 for homophily: “Birds of feather flock together” (social selection) � reason 2 for homophily: we form characteristics similar to our friends (social influence) � also effects opposite to homophily can occur (heterophily) A Tale of two Communities Meulemans and Schulz, GD15

  9. Homophily � homophily is a concept in social network analysis � more likely that two individuals with a common charactristic form a link → homophily (example: same-gender links are more likely in a friendship-networks) � reason 1 for homophily: “Birds of feather flock together” (social selection) � reason 2 for homophily: we form characteristics similar to our friends (social influence) � also effects opposite to homophily can occur (heterophily) � homophily is not restricted to social networks (Question: groups = clusters?) A Tale of two Communities Meulemans and Schulz, GD15

  10. Formalizing Homophily A Tale of two Communities Meulemans and Schulz, GD15

  11. Formalizing Homophily Group B Group A fraction p of the individuals fraction q of the individuals A Tale of two Communities Meulemans and Schulz, GD15

  12. Formalizing Homophily q 2 2 pq Group B Group A p 2 fraction p of the individuals fraction q of the individuals A random link is - with probability p 2 : A ↔ A - with probability q 2 : B ↔ B - with probability 2 pq : A ↔ B A Tale of two Communities Meulemans and Schulz, GD15

  13. Formalizing Homophily q 2 2 pq Group B Group A p 2 fraction p of the individuals fraction q of the individuals A random link is - with probability p 2 : A ↔ A - with probability q 2 : B ↔ B - with probability 2 pq : A ↔ B Homophily Test If the fraction of the between-group links is significantly smaller than 2 pq we have homophily. A Tale of two Communities Meulemans and Schulz, GD15

  14. Degree of Homophily � we want to measure the degree of homophily in a network A Tale of two Communities Meulemans and Schulz, GD15

  15. Degree of Homophily � we want to measure the degree of homophily in a network � only cross-group links (heterophily) 1 Important Cases � 2 pq cross-group links (balanced) 2 � no cross-group links (homophily) 3 A Tale of two Communities Meulemans and Schulz, GD15

  16. Degree of Homophily � we want to measure the degree of homophily in a network � only cross-group links (heterophily) 1 Important Cases 0 � 2 pq cross-group links (balanced) 2 1/2 � no cross-group links (homophily) 3 1 Degree of Homophily A Tale of two Communities Meulemans and Schulz, GD15

  17. Degree of Homophily � we want to measure the degree of homophily in a network � only cross-group links (heterophily) 1 Important Cases 0 � 2 pq cross-group links (balanced) 2 1/2 � no cross-group links (homophily) 3 1 Degree of Homophily degree of homophily 1 1 / 2 interpolate all other values linearly 0 fraction of cross-group links 1 2 pq 0 A Tale of two Communities Meulemans and Schulz, GD15

  18. Research Questions A Tale of two Communities Meulemans and Schulz, GD15

  19. Research Questions Can an observer assess homophily in a node-link diagram? A Tale of two Communities Meulemans and Schulz, GD15

  20. Research Questions Can an observer assess homophily in a node-link diagram? Subquestions: � Which node-link diagram layout is best suitable for detecting homophily? A Tale of two Communities Meulemans and Schulz, GD15

  21. Research Questions Can an observer assess homophily in a node-link diagram? Subquestions: � Which node-link diagram layout is best suitable for detecting homophily? � Is there a tendency for overestimation or underestimation? A Tale of two Communities Meulemans and Schulz, GD15

  22. Research Questions Can an observer assess homophily in a node-link diagram? Subquestions: � Which node-link diagram layout is best suitable for detecting homophily? � Is there a tendency for overestimation or underestimation? � Are there general design principles to improve homophily detection? A Tale of two Communities Meulemans and Schulz, GD15

  23. Research Questions Can an observer assess homophily in a node-link diagram? Subquestions: � Which node-link diagram layout is best suitable for detecting homophily? � Is there a tendency for overestimation or underestimation? � Are there general design principles to improve homophily detection? � We only consider node-link diagrams and the ! “two-groups-scenario” A Tale of two Communities Meulemans and Schulz, GD15

  24. Layouts force-directed polarized bipartite A Tale of two Communities Meulemans and Schulz, GD15

  25. Layouts force-directed polarized bipartite � layout based on the Fruchtermann–Reingold Algorithm � implementation taken from the d3.js library A Tale of two Communities Meulemans and Schulz, GD15

  26. Layouts force-directed polarized bipartite � modification of the force-directed layout � additional forces pull blue vertices to the left and red vertices to the right A Tale of two Communities Meulemans and Schulz, GD15

  27. Layouts force-directed polarized bipartite � groups are placed on opposing vertical lines � barycentric layout + sifting to remove crossings � different shapes for cross-group/within-group edges A Tale of two Communities Meulemans and Schulz, GD15

  28. Layouts force-directed polarized bipartite group separation A Tale of two Communities Meulemans and Schulz, GD15

  29. Layouts force-directed polarized bipartite group separation homophily detection easier? other tasks more difficult? A Tale of two Communities Meulemans and Schulz, GD15

  30. Hypothesis A Tale of two Communities Meulemans and Schulz, GD15

  31. Hypothesis H1 For Homophily assessment we have force-directed < polarized < bipartite x < y means y is better than x A Tale of two Communities Meulemans and Schulz, GD15

  32. Hypothesis H1 For Homophily assessment we have force-directed < polarized < bipartite x < y means y is better than x H2 For Homophily assesment we have unbalanced < balanced A Tale of two Communities Meulemans and Schulz, GD15

  33. Hypothesis H1 For Homophily assessment we have force-directed < polarized < bipartite x < y means y is better than x H2 For Homophily assesment we have unbalanced < balanced H3 For shortest path queries we have force-directed > polarized > bipartite A Tale of two Communities Meulemans and Schulz, GD15

  34. User Study Design A Tale of two Communities Meulemans and Schulz, GD15

  35. User Study Design mixed design (too much trials otherwise) A Tale of two Communities Meulemans and Schulz, GD15

  36. User Study Design mixed design (too much trials otherwise) between subject - 3 graph sizes (20-28 nodes, 20-40 edges) A Tale of two Communities Meulemans and Schulz, GD15

  37. User Study Design mixed design (too much trials otherwise) between subject - 3 graph sizes (20-28 nodes, 20-40 edges) within subjects - 3 layouts - balanced (50:50) and unbalanced (25:75) - 5 degree of homophily levels (only 3 for unbalanced) - 2 tasks (homophily / length of shortest path) A Tale of two Communities Meulemans and Schulz, GD15

Recommend


More recommend