Graph Ordering Lecture 16 CSCI 4974/6971 27 Oct 2016 1 / 12

Today’s Biz 1. Reminders 2. Review 3. Distributed Graph Processing 2 / 12

Reminders ◮ Project Update Presentation: In class November 3rd ◮ Assignment 4: due date TBD (early November, probably 10th) ◮ Setting up and running on CCI clusters ◮ Assignment 5: due date TBD (before Thanksgiving break, probably 22nd) ◮ Assignment 6: due date TBD (early December) ◮ Office hours: Tuesday & Wednesday 14:00-16:00 Lally 317 ◮ Or email me for other availability 3 / 12

Today’s Biz 1. Reminders 2. Review 3. Graph vertex ordering 4 / 12

Quick Review Distributed Graph Processing 1. Can’t store full graph on every node 2. Efficiently store local information - owned vertices / ghost vertices ◮ Arrays for days - hashing is slow, not memory optimal ◮ Relabel vertex identifiers 3. Vertex block, edge block, random, other partitioning strategies 4. Partitioning strategy important for performance!!! 5 / 12

Today’s Biz 1. Reminders 2. Review 3. Graph vertex ordering 6 / 12

Vertex Ordering ◮ Idea: improve cache utilization by re-organizing adjacency list ◮ Idea comes from linear solvers ◮ Reorder matrix for fill reduction, etc. ◮ Efficient cache performance is secondary ◮ Many many methods, but what to optimize for? 7 / 12

Sparse Matrices and Optimized Parallel Implementations Slides from Stan Tomov, University of Tennessee 8 / 12

Part III Reordering algorithms and Parallelization Slide 26 / 34

Reorder to preserve locality 100 115 332 10 201 35 eg. Cuthill-McKee Ordering : start from arbitrary node, say '10' and reorder * '10' becomes 0 * neighbors are ordered next to become 1, 2, 3, 4, 5, denote this as level 1 * neighbors to level 1 nodes are next consecutively reordered, and so on until end Slide 27 / 34

Cuthill-McKee Ordering • Reversing the ordering (RCM) results in ordering that is better for sparse LU • Reduces matrix bandwidth (see example) • Improves cache performance • Can be used as partitioner ( parallelization ) p1 but in general does not reduce edge cut p2 p3 p4 Slide 28 / 34

Self-Avoiding Walks (SAW) • Enumeration of mesh elements through 'consecutive elements' (sharing face, edge, vertex, etc) * similar to space-filling curves but for unstructured meshes * improves cache reuse * can be used as partitioner with good load balance but in general does not reduce edge cut Slide 29 / 34

Graph partitioning • Refer back to Lecture #8, Part II Mesh Generation and Load Balancing • Can be used for reordering • Metis/ParMetis: – multilevel partitioning – Good load balance and minimize edge cut Slide 30 / 34

Parallel Mat-Vec Product • Easiest way: p1 p2 – 1D partitioning p3 – May lead to load unbalance (why?) p4 – May need a lot of communication for x • Can use any of the just mentioned techniques • Most promising seems to be spectral multilevel methods (as in Metis/ParMetis) Slide 31 / 34

Possible optimizations • Block communication – And send the min required from x – eg. pre-compute blocks of interfaces • Load balance, minimize edge cut – eg. a good partitioner would do it • Reordering • Advantage of additional structure (symmetry, bands, etc) Slide 32 / 34

Comparison Distributed memory implementation (by X. Li, L. Oliker, G. Heber, R. Biswas) – ORIG ordering has large edge cut (interprocessor comm) and poor locality (high number of cache misses) – MeTiS minimizes edge cut, while SAW minimizes cache misses Slide 33 / 34

Matrix Bandwidth ◮ Bandwidth: maximum band size ◮ Max distance between nonzeros in single row of adjacency matrix ◮ In terms of graph representation: maximum distance between vertex identifiers appearing in neighborhood of a given vertex ◮ Is bandwidth a good measure for irregular sparse matrices? ◮ Does it represent cache utilization? 9 / 12

Other measures ◮ Quantifying the gaps in the adjacency list ◮ Difficult to reduce bandwidth due to high degree vertices ◮ High degree vertices will have multiple cache misses, low degrees ideally only one - want to account for both ◮ Minimum (linear/logarithmic) gap arrangement problem: ◮ Minimize the sum of distances between vertex identifiers in the adjacency list ◮ More representative of cache utilization ◮ To be discussed later: impact on graph compressibility 10 / 12

Today: vertex ordering ◮ Natural order ◮ Random order ◮ BFS order ◮ RCM order ◮ psuedo-RCM order ◮ Impacts on execution time of various graphs/algorithms 11 / 12

Distributed Processing Blank code and data available on website (Lecture 15) www.cs.rpi.edu/ ∼ slotag/classes/FA16/index.html 12 / 12

Graph Ordering Lecture 16 CSCI 4974/6971 27 Oct 2016 1 / 12 - PowerPoint PPT Presentation

Graph Ordering Lecture 16 CSCI 4974/6971 27 Oct 2016 1 / 12 Todays Biz 1. Reminders 2. Review 3. Distributed Graph Processing 2 / 12 Reminders Project Update Presentation: In class November 3rd Assignment 4: due date TBD (early

Information Ordering Ling573 Systems & Applications April 20, 2017 Roadmap

CS5412: HOW MUCH ORDERING? Lecture XVI Ken Birman Ordering 2 The key to consistency turns

Variable & Value Ordering Heuristics Heuristics for backtracking algorithms Variable

GRAPH MINING AND GRAPH KERNELS Part I: Graph Mining Karsten Borgwardt^ and Xifeng Yan*

Speedup Graph Processing by Graph Ordering Hao Wei, Jeffrey Xu Yu, Can Lu, Xuemin Lin

GRAPH MINING AND GRAPH KERNELS Part II: Graph Kernels Karsten Borgwardt^ and Xifeng Yan*

Ordering in Tees Valley and County Durham CCGs Changes to Repeat Prescription Ordering in Tees

Online ordering system - Product overview - Complete online ordering solution Allow your

Ordering TBR Business Cards: When ordering TBR business cards please do not use the Staples

AMENDED DOCUMENT ORDERING HOURS 01 JAN 2018 Day Reading room Document Document Reader Reader

The Predicate Ordering Syntactic Refinement (ppt) 7ai Predicate Ordering Strategy : Give each

Information Ordering Ling 573 Systems and Applications May 5, 2015 Roadmap Ordering

Distributed Systems: Ordering and Consistent Cuts by Maofan (Ted) Yin my428@cornell.edu Time,

Word Ordering Without Syntax Allen Schmaltz Alexander M. Rush Stuart M. Shieber Harvard

Big Picture: Control Flow Ordering in Program Execution Ordering/Flow Mechanisms: ! Sequencing

Information Ordering Ling 573 Systems and Applications May 3, 2016 Roadmap Ordering

Graph-Processing Systems (focusing on GraphChi) Recall: PageRank in MapReduce (Hadoop) (a,[c])

The Craft of XML Text Encoding in historical and humanistic context Wendell Piez JADH 2015

Learning Action Representations for Reinforcement Learning Georgios Scott Yash James Philip

Question How is HTE information typically presented in prescription drug labeling, and are there

Coresets Meet EDCS: Algorithms for Matching and Vertex Cover on Massive Graphs Sepehr Assadi

Cuckoo Hashing for Storage Systems Yuanyuan Sun, Yu Hua, Zhangyu Chen, Yuncheng Guo Huazhong

Quadratic functions Elementary Functions In the last lecture we studied polynomials of simple form

1 2 3 4 By fixed I mean the topology is fixed. Model can be dynamic in that it moves

Sambuz

Useful Links

Newsletter

Mail Us