CS 5220: Graph Partitioning David Bindel 2017-11-07 1 Reminder: - PowerPoint PPT Presentation

CS 5220: Graph Partitioning David Bindel 2017-11-07 1

Reminder: Sparsity and partitioning 1 2 3 4 5 Matrix Graph Want to partition sparse graphs so that • Subgraphs are same size (load balance) • Cut size is minimal (minimize communication) Uses: parallel sparse matvec, nested dissection solves, ... 2 A =

A common theme Common idea: partition static data (or networked things): • Physical network design (telephone layout, VLSI layout) • Sparse matvec • Preconditioners for PDE solvers • Sparse Gaussian elimination • Data clustering • Image segmentation Goal: Keep chunks big, minimize the “surface area” between 3

Graph partitioning We want to partition G into k pieces such that • Node weights are balanced across partitions. • Weight of cut edges is minimized. 4 Given: G = ( V , E ) , possibly with weights and coordinates. Important special case: k = 2.

Graph partitioning: Vertex separator 5

Graph partitioning: Edge separator 6

Node to edge and back again Can convert between node and edge separators • Node to edge: cut all edges from separator to one side • Edge to node: remove nodes on one side of cut edges Fine if graph is degree bounded (e.g. near-neighbor meshes). Optimal vertex/edge separators very different for social networks! 7

Cost How many partitionings are there? If n is even, n Finding the optimal one is NP-complete. We need heuristics! 8 ( ) n ! (( n / 2 )!) 2 ≈ 2 n √ = 2 / ( π n ) . n / 2

Partitioning with coordinates • Lots of partitioning problems from “nice” meshes • Planar meshes (maybe with regularity condition) (Tarjan, Lipton; Miller, Teng, Thurston, Vavasis) • Edges link nearby vertices • Get useful information from vertex density • Ignore edges (but can use them in later refinement) 9 • k -ply meshes (works for d > 2) ⇒ partition with O ( n 1 − 1 / d ) edge cuts • Nice enough =

Recursive coordinate bisection Idea: Cut with hyperplane parallel to a coordinate axis. • Pro: Fast and simple • Con: Not always great quality 10

Inertial bisection x i x with normal n . i n Idea: Optimize cutting hyperplane based on vertex density x 11 n n ∑ ¯ x = 1 i = 1 ¯ r i = x i − ¯ ∑ [ ] I = ∥ r i ∥ 2 I − r i r T i = 1 Let ( λ n , n ) be the minimal eigenpair for the inertia tensor I , and choose the hyperplane through ¯

Inertial bisection • Pro: Still simple, more flexible than coordinate planes • Con: Still restricted to hyperplanes 12

Random circles (Gilbert, Miller, Teng) • Stereographic projection • Find centerpoint (any plane is an even partition) In practice, use an approximation. • Conformally map sphere, moving centerpoint to origin • Choose great circle (at random) • Undo stereographic projection • Convert circle to separator May choose best of several random great circles. 13

Coordinate-free methods • Don’t always have natural coordinates • Example: the web graph • Can sometimes add coordinates (metric embedding) • So use edge information for geometry! 14

Breadth-first search • Pick a start vertex v 0 • Might start from several different vertices • Use BFS to label nodes by distance from v 0 • We’ve seen this before – remember RCM? • Could use a different order – minimize edge cuts locally (Karypis, Kumar) • Partition by distance from v 0 15

Spectral partitioning 4 subject to the even partition requirement i But this is NP hard, so we need a trick. 16 Label vertex i with x i = ± 1. We want to minimize ∑ edges cut = 1 ( x i − x j ) 2 ( i , j ) ∈ E ∑ x i = 0 .

Spectral partitioning 4 x T Lx Write Laplacian: 17 4 ( x i − x j ) 2 = 1 4 ∥ Cx ∥ 2 = 1 ∑ edges cut = 1 ( i , j ) ∈ E where C is the incidence matrix and L = C T C is the graph   1 , e j = ( i , k ) d ( i ) , i = j       C ij = L ij = − 1 , e j = ( k , i ) − 1 , i ̸ = j , ( i , j ) ∈ E ,     0 , otherwise , 0 , otherwise .   Note that Ce = 0 (so Le = 0), e = ( 1 , 1 , 1 , . . . , 1 ) T .

Spectral partitioning corresponding eigenvector x , also called the Fiedler vector . Partition according to sign of x i . How to approximate x ? Use a Krylov subspace method (Lanczos)! Expensive, but gives high-quality partitions. 18 Now consider the relaxed problem with x ∈ R n : minimize x T Lx s.t. x T e = 0 and x T x = 1 . Equivalent to finding the second-smallest eigenvalue λ 2 and

Spectral partitioning 19

Spectral coordinates Alternate view: define a coordinate system with the first d non-trivial Laplacian eigenvectors. • Spectral partitioning = bisection in spectral coordinates • Can cluster in other ways as well (e.g. k -means) 20

Refinement by swapping D is external - internal edge costs: Cut size: 5 21 Cut size: 4 Gain from swapping ( a , b ) is D ( a ) + D ( b ) − 2 w ( a , b ) , where ∑ w ( a , b ′ ) − ∑ w ( a , a ′ ) D ( a ) = a ′ ∈ A , a ′ ̸ = a b ′ ∈ B ∑ ∑ w ( b , a ′ ) − w ( b , b ′ ) D ( b ) = a ′ ∈ A b ′ ∈ B , b ′ ̸ = b

Greedy refinement Cut size: 5 Cut size: 4 • Purely greedy strategy: until no positive gain • Choose swap with most gain • Update D in neighborhood of swap; update gains • Local minima are a problem. 22 Start with a partition V = A ∪ B and refine. • gain ( a , b ) = D ( a ) + D ( b ) − 2 w ( a , b )

Kernighan-Lin In one sweep: While no vertices marked Mark a and b (but don’t swap) Further improvements (Karypis, Kumar): only consider vertices on boundary, don’t complete full sweep. 23 Choose ( a , b ) with greatest gain Update D ( v ) for all unmarked v as if ( a , b ) were swapped Find j such that swaps 1 , . . . , j yield maximal gain Apply swaps 1 , . . . , j Usually converges in a few (2-6) sweeps. Each sweep is O ( | V | 3 ) . Can be improved to O ( | E | ) (Fiduccia, Mattheyses).

Multilevel ideas Basic idea (same will work in other contexts): • Coarsen • Solve coarse problem • Interpolate (and possibly refine) May apply recursively. 24

Maximal matching One idea for coarsening: maximal matchings • Maximal : cannot add edges and remain matching. • Constructed by an obvious greedy algorithm. • Maximal matchings are non-unique; some may be preferable to others (e.g. choose heavy edges first). 25 • Matching of G = ( V , E ) is E m ⊂ E with no common vertices.

Coarsening via maximal matching • Collapse nodes connected in matching into coarse nodes • Add all edge weights between connected coarse nodes 26

Software All these use some flavor(s) of multilevel: • METIS/ParMETIS (Kapyris) • PARTY (U. Paderborn) • Chaco (Sandia) • Scotch (INRIA) • Jostle (now commercialized) • Zoltan (Sandia) 27

Graph partitioning: Is this it? Consider partitioning just for sparse matvec: • Should we minimize max communication volume? • Looked at communication volume – what about latencies? Some go beyond graph partitioning (e.g. hypergraph in Zoltan). 28 • Edge cuts ̸ = communication volume

Graph partitioning: Is this it? Additional work on: • Partitioning power law graphs • Covering sets with small overlaps Also: Classes of graphs with no small cuts (expanders) 29

Graph partitioning: Is this it? Recall: partitioning for matvec and preconditioner • Block Jacobi (or Schwarz) – relax on each partition • Want to consider edge cuts and physics • E.g. consider edges = beams • Cutting a stiff beam worse than a flexible beam? • Doesn’t show up from just the topology • Multiple ways to deal with this • Encode physics via edge weights? • Partition geometrically? • Tradeoffs are why we need to be informed users 30

Graph partitioning: Is this it? So far, considered problems with static interactions • What about particle simulations? • Or what about tree searches? • Or what about...? Next time: more general load balancing issues 31

CS 5220: Graph Partitioning David Bindel 2017-11-07 1 Reminder: - PowerPoint PPT Presentation

CS 5220: Graph Partitioning David Bindel 2017-11-07 1 Reminder: Sparsity and partitioning 1 2 3 4 5 Matrix Graph Want to partition sparse graphs so that Subgraphs are same size (load balance) Cut size is minimal (minimize

Lecture 1: Introduction to CS 5220 David Bindel 24 Aug 2011 CS 5220: Applications of Parallel

CS 5220: Introduction David Bindel 2017-08-22 1 CS 5220: Applications of Parallel Computers

Partitioning and Divide-and- Conquer Strategies Partitioning Strategies Partitioning simply

Partitioning Introduction to Partitioning Mahapatra-Texas A&M-Spring02 1 System

Partitioning Problem and Usage Lecture 8 CSCI 4974/6971 26 Sep 2016 1 / 14 Todays Biz 1.

Graph Partitioning for Scalable Distributed Graph Computations Aydn Bulu Kamesh

Partitioning under the hood in MySQL 5.5 Mattias Jonsson, Partitioning developer Mikael

1 1 Slide 5 Slide 6 Partitioning and Load Balancing Partitioning Goals Assignment of

Investigating hypergraph-partitioning-based sparse matrix partitioning methods Bora U car

Prioritized Restreaming Algorithms for Balanced Graph Partitioning Amel Awadelkarim

Lecture 19: Graph Partitioning David Bindel 3 Nov 2011 Logistics Please finish your project

GRAPH MINING AND GRAPH KERNELS Part I: Graph Mining Karsten Borgwardt^ and Xifeng Yan*

Co-clustering documents and words using Bipartite Spectral Graph Partitioning Inderjit S. Dhillon

GRAPH MINING AND GRAPH KERNELS Part II: Graph Kernels Karsten Borgwardt^ and Xifeng Yan*

Background MapReduce Model SCOPE Language and Cosmos system Advanced partitioning

Power grid partitioning Data-Driven Partitioning of Power Networks Via Koopman Mode

Minimum Spanning Tree Minimum spanning tree. Given a connected graph G = (V, E) with real -valued

Prologs Cut Operator Dr. Mattox Beckman University of Illinois at Urbana-Champaign Department

Node Multiway Cut and Subset Feedback Vertex Set on Graphs of Bounded Mim-width Benjamin

Cut per region Marc Verderi GEANT4 collaboration meeting 01/ 10/ 2002 Introduction Cut here

Today. Types of graphs. Complete Graphs. Trees. Hypercubes. Complete Graph. K n complete graph

Two domination parameters in graphs Guangjun Xu Department of Mathematics and Statistics The

Reasoning about connectivity without paths Alberto Casagrande and Eugenio G. Omodeo Dip.

Graph Theory: Introduction Graph Theory: Introduction Pallab Dasgupta Dept. of CSE, IIT

CS 5220: Graph Partitioning David Bindel 2017-11-07 1 Reminder: - PowerPoint PPT Presentation

CS 5220: Graph Partitioning David Bindel 2017-11-07 1 Reminder: Sparsity and partitioning 1 2 3 4 5 Matrix Graph Want to partition sparse graphs so that Subgraphs are same size (load balance) Cut size is minimal (minimize

Lecture 1: Introduction to CS 5220 David Bindel 24 Aug 2011 CS 5220: Applications of Parallel

CS 5220: Introduction David Bindel 2017-08-22 1 CS 5220: Applications of Parallel Computers

Partitioning and Divide-and- Conquer Strategies Partitioning Strategies Partitioning simply

Partitioning Introduction to Partitioning Mahapatra-Texas A&amp;M-Spring02 1 System

Partitioning Problem and Usage Lecture 8 CSCI 4974/6971 26 Sep 2016 1 / 14 Todays Biz 1.

Graph Partitioning for Scalable Distributed Graph Computations Aydn Bulu Kamesh

Partitioning under the hood in MySQL 5.5 Mattias Jonsson, Partitioning developer Mikael

1 1 Slide 5 Slide 6 Partitioning and Load Balancing Partitioning Goals Assignment of

Investigating hypergraph-partitioning-based sparse matrix partitioning methods Bora U car

Prioritized Restreaming Algorithms for Balanced Graph Partitioning Amel Awadelkarim

Lecture 19: Graph Partitioning David Bindel 3 Nov 2011 Logistics Please finish your project

GRAPH MINING AND GRAPH KERNELS Part I: Graph Mining Karsten Borgwardt^ and Xifeng Yan*

Co-clustering documents and words using Bipartite Spectral Graph Partitioning Inderjit S. Dhillon

GRAPH MINING AND GRAPH KERNELS Part II: Graph Kernels Karsten Borgwardt^ and Xifeng Yan*

Background MapReduce Model SCOPE Language and Cosmos system Advanced partitioning

Power grid partitioning Data-Driven Partitioning of Power Networks Via Koopman Mode

Minimum Spanning Tree Minimum spanning tree. Given a connected graph G = (V, E) with real -valued

Prologs Cut Operator Dr. Mattox Beckman University of Illinois at Urbana-Champaign Department

Node Multiway Cut and Subset Feedback Vertex Set on Graphs of Bounded Mim-width Benjamin

Cut per region Marc Verderi GEANT4 collaboration meeting 01/ 10/ 2002 Introduction Cut here

Today. Types of graphs. Complete Graphs. Trees. Hypercubes. Complete Graph. K n complete graph

Two domination parameters in graphs Guangjun Xu Department of Mathematics and Statistics The

Reasoning about connectivity without paths Alberto Casagrande and Eugenio G. Omodeo Dip.

Graph Theory: Introduction Graph Theory: Introduction Pallab Dasgupta Dept. of CSE, IIT

Partitioning Introduction to Partitioning Mahapatra-Texas A&M-Spring02 1 System