an introduction to visual analysis of social networks
play

An Introduction to Visual Analysis of Social Networks Nan Cao @ - PowerPoint PPT Presentation

An Introduction to Visual Analysis of Social Networks Nan Cao @ HKUST nancao@cse.ust.hk April 2011 Agenda Introductions to visual analysis Community representation Analysis on Rich Context Social Medias Introduction Equation Tag


  1. An Introduction to Visual Analysis of Social Networks Nan Cao @ HKUST nancao@cse.ust.hk April 2011

  2. Agenda • Introductions to visual analysis • Community representation • Analysis on Rich Context Social Medias

  3. Introduction Equation Tag Clouds extracted from “Mining Organizational Structure in Social Network” • How can we understand and interpreted the analysis results in an intuitive way ? • The data mining results are not 100% correct, how can we estimate the errors and refine them precisely ?

  4. Introduction • Traditional data mining techniques – An automatic analysis process bases on various models for different purposes – Maximize the power of machines • Traditional data visualization techniques – Leverage human’s capability on pattern recognition and represent the multidimensional data in an intuitive way using various visual encodings – Maximize the power for human beings • Visual Analysis – A semi-automatic analysis process that combines analysis model (DM) , visual representation (Visualization) as well as user interactions (HCI) together. – Seamlessly connect humans with machines for the analysis purpose

  5. Introduction

  6. Introduction Visual Form Abstract Data View Raw Data User rendering filtering interactions Data Layout / Coloring Display / Sizing Mining Reference Model For Information Visualization and Visual Analysis References [1] Readings in Information Visualization: Using Vision to Think, Stuart K. Card, Jock Mackinlay, Ben Shneiderma. 1999 [2] prefuse: A Toolkit for Interactive Information Visualization, Jeffery Heer, Stuart K. Card, James A. Landay, ACM sigCHI, 2005

  7. Visualization On Social Networks www.visualcomplexity.com

  8. Visualization On Social Networks www.visualcomplexity.com Visualization is not to generate beautiful figures. More importantly, it help users to understand the information insights

  9. Agenda Visual Form Abstract Data View Raw Data User rendering filtering interactions Data Layout / Coloring Display / Sizing Mining • Introductions to visual analysis • Community representation • Analysis on Rich Context Social Medias

  10. Community (Cluster) Representations • Graph Layout Problem – Graph layout, as a branch of graph theory, applies topology and geometry to derive two-dimensional representations of graphs – Wikipedia • Layouts for cluster representations – Group the nodes with strong connections together (same as community detection). – Reduce overlaps of the nodes – Minimize the average edge length (reduce line crossings) – Keep a good symmetry of the graph (It is easier for users to identify patterns in a symmetry structure)

  11. Graph Layout Structure Edge oriented oriented Orthogonal Radial Cluster Hierarchy Force-Directed Hierarchical oriented oriented

  12. Graph Layout Structure Edge oriented oriented Graph layout, as a branch of graph Orthogonal theory, applies topology and Radial geometry to derive 2D representations of graphs – Wikipeia Cluster Hierarchy Force-Directed Hierarchical oriented oriented

  13. Force-directed graph layout • Graph layout = Energy minimization Ene • rgy Hence, the drawing algorithm is an iterative optimization process • Convergence to global minimum is not guaranteed ! Layou t Fine Result Radom Layout 13

  14. Force-directed graph layout • Cluster Properties – Proximity preservation: similar nodes are drawn closely • Aesthetical properties – Symmetry preservation: isomorphic sub- graphs are drawn identically – Minimized Edge length: reduce edge intersections – No external influences: “Let the graph speak for itself”

  15. Spring Embed Model  Edges are springs  Vertices are repelling particles  Force on vertex: f uv is force on spring  g uv is repelling force      ( ) F v f g uv uv   { , } u v E u V References: [3]A heuristic for drawing graph, P.Eades, 84. [4]Graph Drawing by Force-Directed Graph, Fruchterman, 91. [5]Drawing Graph Nicely Using Simulated Annealing, Davidson, 96. [6]A Fast Adaptive Layout Algorithm for Undirected Graphs, Frick, 94. [7]Spring Algorithms and Symmetry, Eades and X Lin, 99 15

  16. Model Comparison Clustering Model Layout Model      1    2 MDS:   2 min || || X X d min || || X X d i j ij i j ij 2 d   i j i j     n 1     2 Spectrum: T min min ( ) T X LX X X min Tr X LX ij i j 2  , i j E Spectrum Model [9, 10] Spring Embed Model [3-7] MDS Layout Model [8] [8] Graph Drawing by Stress Majorization, 2002, Graph Drawing [9] An r-Dimensional Quadratic Placement Algorithm, Kenneth M. Hall, 1970 [10] ACE: A fast multiscale eigenvector computation for drawing huge graphs, Y.Koren, L. Carmel and D. Harel, InfoVis 2002

  17. Agenda Visual Form Abstract Data View Raw Data User rendering filtering interactions Data Layout / Coloring Display / Sizing Mining • Introductions to visual analysis • Community representation • Explorative Analysis on Rich Context Social Media

  18. Rich Context Social Network The vertexes are connected Each vertex has by multiple relations multiple attributes age / sex / jobs location : city /county /state friends contact : emails / phones colleagues classmate Degree / Closeness / family Betweenness / Spectrum • How to analysis the network topology by considering multiple relationships? • How to analysis the network beyond the graph topology by considering the vertex attributes?

  19. Visual Analysis on Complex Relational Patterns (1) [11] NodeTrix: A Hybrid Visualization of Social Networks, Nathalie Henry et al. IEEE TVCG 2007 Demo:http://www.youtube.com/watch?v=7G3MxyOcHKQ

  20. Visual Analysis on Complex Relational Patterns (1) [11] NodeTrix: A Hybrid Visualization of Social Networks, Nathalie Henry et al. IEEE TVCG 2007 Demo:http://www.youtube.com/watch?v=7G3MxyOcHKQ

  21. Visual Analysis on Complex Relational Patterns (1) [11] NodeTrix: A Hybrid Visualization of Social Networks, Nathalie Henry et al. IEEE TVCG 2007 Demo:http://www.youtube.com/watch?v=7G3MxyOcHKQ

  22. Visual Analysis on Complex Relational Patterns (2) [12] FacetAtlas: Multifaceted Visualization for Rich Text Corpora, Nan Cao, et al. IEEE TVCG 2010

  23. • Symptoms • Treatments multiple facets • Causes • Tests & Diagnosis • Prognosis • Prevention • Complications 23

  24. Type2 (Q1) How to model the document contents into Metabolic Diabetes multifaceted relation Syndrome data? (Q2) How to intuitively Type1 visualize multifaceted document contents and their relations? (Q3) How to find the Gestational insight patterns Diabetes visually driven by users’ interests? 24

  25. Type2 (Q1) How to model the document contents into Metabolic Diabetes multifaceted relation Syndrome data? (Q2) How to intuitively Type1 visualize multifaceted document contents and their relations? (Q3) How to find the Gestational insight patterns Diabetes visually driven by users’ interests? How to visualize the relations of multifaceted document contents? 25

  26. (Q1) How to model the document contents into multifaceted relational data ? facet document set segmentation entity extraction entity set multifaceted entity relational data model type 1 type 2 diabetes diabetes Internal disease thirst blurred relations symptom vision treatment take blood sugar medications control External relations 26

  27. Rich Context Social Network The vertexes are connected Each vertex has by multiple relations multiple attributes age / sex / jobs location : city /county /state friends contact : emails / phones colleagues Degree / Closeness / classmate family Betweenness / Spectrum • How to analysis the network topology by considering multiple relationships? • How to analysis the network beyond the graph topology by considering the vertex attributes?

  28. Visual Analysis on Multidimensional Patterns (1) • Centrality : – Degree – Closeness – Betweenness – Eigenvector • Cluster Coefficient • Node Index Scatter Plot Matrix [13] The FlowVizMenu and Parallel Scatterplot Matrix: Hybrid Multidimensional Visualizations for Network Exploration. IEEE TVCG 2010 Demo: http://www.youtube.com/watch?v=f9Z0mPOnG_M

  29. Parallel Coordinates max min Degree Index Closeness Cluster Coef Eigenvector [14] A. Inselberg and B. Dimsdale. Parallel coordinates: a tool for visualizing multi- dimensional geometry, InfoVis 2000

  30. P-SPLOMs • Combine the parallel coordinates with the scatter plot matrix – Provide flexible interactions and let users to explore the whole dataset from multiple aspects will help on the pattern detectoin

  31. Demo

Recommend


More recommend