discrete mathematics and its applications
play

Discrete Mathematics and Its Applications Lecture 7: Graphs: - PowerPoint PPT Presentation

Discrete Mathematics and Its Applications Lecture 7: Graphs: Proximity MING GAO DaSE@ECNU (for course related communications) mgao@dase.ecnu.edu.cn Jan. 6, 2019 Outline Community Structures 1 Node Proximity 2 Simple Approaches


  1. Discrete Mathematics and Its Applications Lecture 7: Graphs: Proximity MING GAO DaSE@ECNU (for course related communications) mgao@dase.ecnu.edu.cn Jan. 6, 2019

  2. Outline Community Structures 1 Node Proximity 2 Simple Approaches Graph-theoretic Approaches SimRank Random Walk based Approaches MING GAO (DaSE@ECNU) Discrete Mathematics and Its Applications Jan. 6, 2019 2 / 24

  3. Community Structures Community structures Definition Community structure indicates that the network divides naturally in- to groups of nodes with dense connections internally and sparser connections between groups.

  4. Community Structures Community structures Definition Community structure indicates that the network divides naturally in- to groups of nodes with dense connections internally and sparser connections between groups. Global community structures: clustering-based approach, spectral clustering, modularity-based approach, etc.

  5. Community Structures Community structures Definition Community structure indicates that the network divides naturally in- to groups of nodes with dense connections internally and sparser connections between groups. Global community structures: clustering-based approach, spectral clustering, modularity-based approach, etc. Local community structures: node-centric community, group-centric community Traditional network: clique, quasi-clique, k -clique, k -core, etc.

  6. Community Structures Community structures Definition Community structure indicates that the network divides naturally in- to groups of nodes with dense connections internally and sparser connections between groups. Global community structures: clustering-based approach, spectral clustering, modularity-based approach, etc. Local community structures: node-centric community, group-centric community Traditional network: clique, quasi-clique, k -clique, k -core, etc. Bipartite network: bi-clique, quasi-bi-clique, etc.

  7. Community Structures Community structures Definition Community structure indicates that the network divides naturally in- to groups of nodes with dense connections internally and sparser connections between groups. Global community structures: clustering-based approach, spectral clustering, modularity-based approach, etc. Local community structures: node-centric community, group-centric community Traditional network: clique, quasi-clique, k -clique, k -core, etc. Bipartite network: bi-clique, quasi-bi-clique, etc. Signed network: antagonistic community, quasi-antagonistic community, etc. MING GAO (DaSE@ECNU) Discrete Mathematics and Its Applications Jan. 6, 2019 3 / 24

  8. Community Structures Global community structures Goal Partition nodes of a network into disjoint sets.

  9. Community Structures Global community structures Goal Partition nodes of a network into disjoint sets.

  10. Community Structures Global community structures Goal Partition nodes of a network into disjoint sets. Clustering based on vertex similarity

  11. Community Structures Global community structures Goal Partition nodes of a network into disjoint sets. Clustering based on vertex similarity Latent space models

  12. Community Structures Global community structures Goal Partition nodes of a network into disjoint sets. Clustering based on vertex similarity Latent space models Spectral clustering

  13. Community Structures Global community structures Goal Partition nodes of a network into disjoint sets. Clustering based on vertex similarity Latent space models Spectral clustering Modularity maximization

  14. Community Structures Global community structures Goal Partition nodes of a network into disjoint sets. Clustering based on vertex similarity Latent space models Spectral clustering Modularity maximization In the study of complex networks, a network is said to have commu- nity structure if the nodes of the network can be easily grouped into (potentially overlapping) sets of nodes such that each set of nodes is densely connected internally. MING GAO (DaSE@ECNU) Discrete Mathematics and Its Applications Jan. 6, 2019 4 / 24

  15. Community Structures Clustering based on vertex similarity K-means v 1 , · · · , v n are vertices of a graph;

  16. Community Structures Clustering based on vertex similarity K-means v 1 , · · · , v n are vertices of a graph; Each vertex v i will be assigned to one and only one cluster;

  17. Community Structures Clustering based on vertex similarity K-means v 1 , · · · , v n are vertices of a graph; Each vertex v i will be assigned to one and only one cluster; C ( i ) denotes cluster number for vertex v i ;

  18. Community Structures Clustering based on vertex similarity K-means v 1 , · · · , v n are vertices of a graph; Each vertex v i will be assigned to one and only one cluster; C ( i ) denotes cluster number for vertex v i ; Similarity measure or dissimilarity measure: Euclidean distance metric or Jaccard coefficient;

  19. Community Structures Clustering based on vertex similarity K-means v 1 , · · · , v n are vertices of a graph; Each vertex v i will be assigned to one and only one cluster; C ( i ) denotes cluster number for vertex v i ; Similarity measure or dissimilarity measure: Euclidean distance metric or Jaccard coefficient; K-means minimizes within-cluster point scatter: K W ( C ) = 1 � � � � x i − x j � 2 2 k =1 C ( i )= k C ( j )= k K � � � x i − m k � 2 , = N k k =1 C ( i )= k where N k is the number of vertices in k − th cluster MING GAO (DaSE@ECNU) Discrete Mathematics and Its Applications Jan. 6, 2019 5 / 24

  20. Community Structures K-means example MING GAO (DaSE@ECNU) Discrete Mathematics and Its Applications Jan. 6, 2019 6 / 24

  21. Community Structures Spectral clustering Let L be normalized Laplacian of graph G , the algorithm partitions nodes into two sets ( B 1 , B 2 ) based on the eigenvector v corresponding to the second-smallest eigenvalue of L . Partitioning may be done in various ways:

  22. Community Structures Spectral clustering Let L be normalized Laplacian of graph G , the algorithm partitions nodes into two sets ( B 1 , B 2 ) based on the eigenvector v corresponding to the second-smallest eigenvalue of L . Partitioning may be done in various ways: Assign all nodes whose component in v satisfies certain condition in B 1 , and B 2 otherwise, e.g., larger than median, the sign of each entry of v .

  23. Community Structures Spectral clustering Let L be normalized Laplacian of graph G , the algorithm partitions nodes into two sets ( B 1 , B 2 ) based on the eigenvector v corresponding to the second-smallest eigenvalue of L . Partitioning may be done in various ways: Assign all nodes whose component in v satisfies certain condition in B 1 , and B 2 otherwise, e.g., larger than median, the sign of each entry of v . The algorithm can be used for hierarchical clustering by repeatedly partitioning the subsets in this fashion. MING GAO (DaSE@ECNU) Discrete Mathematics and Its Applications Jan. 6, 2019 7 / 24

  24. Community Structures Modularity Idea Graph has community structure, if it is different from random graph (not expected to have community structure for random graph).

  25. Community Structures Modularity Idea Graph has community structure, if it is different from random graph (not expected to have community structure for random graph). Modularity [Newman 2006]: M = 1 ( A ij − d ( i ) d ( j ) � ) δ ( C i , C j ) . 2 m 2 m i , j where m and C i denote # edges and the i − th community in the graph.

  26. Community Structures Modularity Idea Graph has community structure, if it is different from random graph (not expected to have community structure for random graph). Modularity [Newman 2006]: M = 1 ( A ij − d ( i ) d ( j ) � ) δ ( C i , C j ) . 2 m 2 m i , j where m and C i denote # edges and the i − th community in the graph. Compares the number of edges within a community with the expected such number in a corresponding random graph.

  27. Community Structures Modularity Idea Graph has community structure, if it is different from random graph (not expected to have community structure for random graph). Modularity [Newman 2006]: M = 1 ( A ij − d ( i ) d ( j ) � ) δ ( C i , C j ) . 2 m 2 m i , j where m and C i denote # edges and the i − th community in the graph. Compares the number of edges within a community with the expected such number in a corresponding random graph. It can be used as a measure to evaluate the communities quality. MING GAO (DaSE@ECNU) Discrete Mathematics and Its Applications Jan. 6, 2019 8 / 24

  28. Community Structures Louvain method MING GAO (DaSE@ECNU) Discrete Mathematics and Its Applications Jan. 6, 2019 9 / 24

  29. Community Structures Clique Definition A clique is a subset of vertices of an undirected graph such that its induced subgraph is complete. A maximal clique is a clique that cannot be extended by including one more adjacent vertex. Normally use cliques as a core or a seed to find larger communities.

  30. Community Structures Clique Definition A clique is a subset of vertices of an undirected graph such that its induced subgraph is complete. A maximal clique is a clique that cannot be extended by including one more adjacent vertex. Normally use cliques as a core or a seed to find larger communities. Find out all cliques of size k in a given network (NP-complete)

Recommend


More recommend