on the diversity of graphs with high variable node degrees
play

On the Diversity of Graphs with High Variable Node Degrees Lun Li - PowerPoint PPT Presentation

On the Diversity of Graphs with High Variable Node Degrees Lun Li David Alderson John C. Doyle Walter Willinger 2006 WIT Main Ideas of This Talk In general, there exist multiple graphs having the same aggregate statistics (we know


  1. On the Diversity of Graphs with High Variable Node Degrees Lun Li David Alderson John C. Doyle Walter Willinger 2006 WIT

  2. Main Ideas of This Talk • In general, there exist multiple graphs having the same aggregate statistics (we know this!) • BUT, how to characterize the “diversity” among these graphs? – Use degree distribution as an example – Similar questions arise for other metrics • Some graph theoretic metrics implicitly measure against a “background set”. – the nature of this background set can have serious implications for its interpretation – OR… are all graph theoretic measures comparable?

  3. Some notation Let denote the degree of node i Call degree sequence of graph We will focus on diversity among graphs having the SAME degree sequence D… … particularly when D is scaling. Deterministic form of scaling Relationship:

  4. Scaling and high variability • For a sequence D, If D is Scaling (n → ∞ ), α <2 , CV(D) = ∞ – Star (n → ∞ ), CV(D) = ∞ – Chain (n → ∞ ), CV(D) = 0 – – If D has exponential form, CV(D) = Constant

  5. Variability in the space of graphs G(D) (b) Random (e) Graph Degree (a) HSFnet Node Rank 1 10 0 10 (d) HOTnet (c) Poor 1 2 10 10 Design Node Degree Link / Router Speed (Gbps) 5.0 – 10.0 0.05 – 0.1 50 – 100 0.5 – 0.1 0.01 – 0.05 1.0 – 5.0 10 – 50 0.1 – 0.5 0.5 – 1.0 0.005 – 0.01 5 – 10 0.05 – 0.1 0.1 – 0.5 0.001 – 0.05 1 – 5 0.01 – 0.5

  6. A Structural Approach • s-metric ∑ = s ( g ) d i d j ∈ ε i j ( , ) • Properties: – Differentiate graphs with the same degree sequence – Depends only on the connectivity of a given graph not on the generation mechanism – High s(g) is achieved by connecting high degree nodes to each other – Quantify the role of the highly connected hubs

  7. For any degree sequence D, one can construct an s max Graph • The s max graph is the graph having the largest s(g)-value • Its value depends on the “Background Set” of graphs

  8. Impact of Background Sets on the s max Graph • • Among graphs in G(D) (simple, connected) – Deterministic way to generate – Order all potential links (i, j) according to their weight – Among Acyclic graphs (trees) • From high degree node to low degree node ⇒ We will use the normalized metric: S(g) = s(g) / s max

  9. (e) Graph Degree (a) HSFnet Node Rank 1 10 0 10 (d) HOTnet 1 2 10 10 S(g)=0.98 Node Degree Link / Router Speed (Gbps) 5.0 – 10.0 0.05 – 0.1 50 – 100 0.5 – 0.1 0.01 – 0.05 1.0 – 5.0 10 – 50 0.1 – 0.5 0.5 – 1.0 0.005 – 0.01 5 – 10 0.05 – 0.1 0.1 – 0.5 0.001 – 0.05 1 – 5 0.01 – 0.5 S(g)=0.39

  10. Graph diversity and Perf(g) vs. s(g) HOTnet S(g) = 0.3952 P(g) = 2.93 x 10 11 Performance Perf(g) 10 11 Random "S=1“ S(g) = 0.8098 S(g)=1 P(g) = 9.23 x 10 9 P(g)=6.73 x 10 9 HSFnet S(g) = 0.9791 P(g) = 6.08 x 10 9 10 10 "Poor Design” S(g) = 0.4536 P(g) = 1.02 x 10 10 0.4 0.5 0.6 0.7 0.8 0.9 1.0 s(g) / s max

  11. s max and graph metrics • Node Centrality – In s max graph, high degree nodes have high centrality • Self similarity – s max graph remains s max by trimming, coarse graining, highest connect motif • Graph likelihood – s max has highest likelihood to generate by GRG • Conjecture: – s max graphs are largely unique in terms of their structure

  12. s-metric and Degree Correlations • Assume an underlying probabilistic graph model • Degree correlation between two adjacent vertices k, k’ is defined as where • The s- metric is related to the degree correlation:

  13. s-metric and Assortativity r(g) • A notion of degree correlation – Assortative mixing: a preference for high-degree vertices to attach to other high-degree vertices – Disassortative mixing: the converse • Definition [Newman]: [ ] ∈ − r 1 , 1 • r>0, assortatitive, social networks • r<0, disassortitive, internet, biology networks

  14. Graph diversity and r(g) vs. s(g) HOTnet Performance Perf(g) S(g) = 0.3952 P(g) = 2.93 x 10 11 11 10 HSFnet S(g) = 0.9791 Perf = 2.93X10e11 P(g) = 6.08 x 10 9 S(g)=0.39 10 10 r = -0.4815, 0.4 0.5 0.6 0.7 0.8 0.9 1.0 s(g) / s max Both graphs have same degree distribution and very similar assortativity!!! Perf = 6.06X10e9 S(g)=0.98 r = -0.4283,

  15. Assortativity r(g) • For a given graph, assortativity is: • Normalization term s max of unconstrained graph • Centering term Center of unconstrained graph • r=1: all the nodes connect to themselves • r=-1: depends on the degree sequence • Background set is the unconstrained graph!

  16. simple experiment • generate multiple trees by adding a node k to an existing node j, with probability Π (j) ∝ (d j ) p ⇔ linear preferential attachment – p=1 ⇔ uniform attachment – p=0 – p →∞ ⇔ attach to max degree node (result = a star) – p → - ∞ ⇔ attach to min degree node (result = a chain) • each trial results in a tree having – its own degree sequence D, s-value, CV(D) – its own s min and s max values (from D), – its own r min and r max values (from D)

  17. for n=100 CV(chain) = 0.0711 for n=100 CV(star) = 4.9495 10000 smax 9000 smin generated 50 p = 4 8000 trees of size p = 3 n=100 using 7000 p = 2 each of the p = 1.5 6000 following p = 1 5000 attachment p = 0.5 exponents: 4000 p = 0 p = -1 3000 p = -2 2000 p = -3 p = -4 1000 0 0 0.5 1 1.5 2 2.5 3 3.5 4 4.5 5 CV(D)

  18. for n=100 for n=100 CV(chain) = 0.0711 CV(star) = 4.9495 0.6 0.4 rmax generated 50 p = 4 trees of size rmin 0.2 p = 3 n=100 using p = 2 0 each of the p = 1.5 following p = 1 -0.2 attachment p = 0.5 exponents: -0.4 p = 0 p = -1 -0.6 p = -2 p = -3 -0.8 p = -4 -1 0 0.5 1 1.5 2 2.5 3 3.5 4 4.5 5 CV(D)

  19. p = 2 p = 1.5 S vs. CV Assortativity vs. CV p = 1 p = 0.5 p = 0 p = -1 p = -2 10000 0.6 smax p = -3 smin 9000 0.4 p = -4 8000 0.2 7000 0 6000 5000 -0.2 4000 -0.4 3000 -0.6 2000 -0.8 1000 0 -1 0 0.5 1 1.5 2 2.5 3 3.5 4 4.5 5 0 0.5 1 1.5 2 2.5 3 3.5 4 4.5 5 CV(degree) CV(degree)

  20. p = 2 p = 1.5 S vs. CV Assortativity vs. CV p = 1 p = 0.5 p = 0 5 p = -1 x 10 10 0.6 p = -2 9 0.4 p = -3 8 p = -4 0.2 7 0 6 5 -0.2 4 -0.4 3 -0.6 2 -0.8 1 0 -1 0 2 4 6 8 10 12 14 16 0 2 4 6 8 10 12 14 16

  21. p = 2 S vs. CV Assortativity vs. CV p = 1.5 p = 1 p = 0.5 p = 0 p = -1 1 p = -2 12000 0.8 p = -3 p = -4 0.6 10000 0.4 8000 0.2 0 6000 -0.2 4000 -0.4 -0.6 2000 -0.8 0 -1 0 0.5 1 1.5 2 2.5 3 3.5 4 4.5 5 0 0.5 1 1.5 2 2.5 3 3.5 4 4.5 5

  22. p = 2 p = 1.5 p = 1 p = 0.5 1 4 x 10 p = 0 2.5 0.8 p = -1 p = -2 0.6 2 p = -3 0.4 p = -4 0.2 1.5 0 -0.2 1 -0.4 0.5 -0.6 -0.8 0 -1 0 0.5 1 1.5 2 2.5 3 3.5 4 0 0.5 1 1.5 2 2.5 3 3.5 4

  23. Conclusions • The set G(D) of graphs g with fixed scaling degree D can be extremely diverse • s-metric can highlight the difference of the graphs in G(D). • s -metric has a rich connection to self-similarity, likelihood, betweeness and assortativity • the nature of this background set can have serious implications for its interpretation • These issues apply to metrics other than simple degree sequence (e.g., two-point degree correlations) Towards a Theory of Scale-Free Graphs: Definition, Properties, and Implications Lun Li, David Alderson, John C. Doyle, Walter Willinger Internet Mathematics . In Press. (2006)

Recommend


More recommend