Graphs and Linear Measurements Sudipto Guha University of - PowerPoint PPT Presentation

Graphs and Linear Measurements Sudipto Guha University of Pennsylvania (based on joint work with K. Ahn & A. McGregor)

Graphs • One of the fundamental representation models in all Computer Science. • A natural counterpoint to “Big - Vectors”. • Structure is often more easily represented using graphs. • And often defined using graphs.

Linear Measurements • Inner products. – Mostly with (pseudo) random vectors. – Fingerprints. Coding Theory. – Compress(ed)(or)(ive) sensing. – Machine Learning. • (Very) Easily parallelizable.

This Talk : Questions • Is it feasible to devise graph algorithms using linear projections? – Construct witnesses, not just answering yes/no – Approximating the structure of the answer or the value of the answer can be very different • Are there: – Fundamental problems? – Fundamental Algorithmic Techniques? – Fundamental Analysis avenues?

This Talk: Some answers • Ahn, Guha, McGregor – SODA 2012, PODS 2012, manuscripts • A Problem: Sampling from a cut in a graph. • A Technique: Parallel information gathering, sequential use • Analysis Themes: Adaptivity of actions. Linearity. • Many more exist. We need more. • We will not focus on specific models too much.

This Talk: Some answers • Ahn, Guha, McGregor – SODA 2012, PODS 2012, manuscripts • A Problem: Sampling from a cut in a graph. • A Technique: Parallel information gathering, sequential use • Analysis Themes: Adaptivity of actions. Linearity. • Semi-Streaming model m → n • Map-Reduce (with some central processing)

The importance of being linear • Order independent ⇒ Deletions come free ⇒ Obviously incremental ⇒ Obvious applications to dynamic graph algorithms • Suppose a ∃ one pass streaming algorithm then – Sort the data (order independence) – Remove duplicates (deletions/affine-ness) – One way access to hash functions! ⇒ We can assume perfect hash functions ⇒ Algorithm designer only needs to focus on space ⇒ Running times can be improved subsequently (possibly use historical data driven/derived features)

This Talk: Some answers • Ahn, Guha, McGregor – SODA 2012, PODS 2012, manuscripts • A Problem: Sampling from a cut in a graph. • A Technique: Parallel information gathering, sequential use • Analysis Themes: Adaptivity of actions. Linearity. • Semi-Streaming model • Map-Reduce (with some central processing)

A Technique (and problem to go along) • Parallel Information gathering • Yet sequential use • A graph presented one edge at a time • Can we maintain connectivity? • Can we maintain connectivity in O(n) space? • What if edges are now deleted?

Connectivity in O(log n) rounds • Every vertex chooses an edge UAR • Collapse the connected components • Number of surviving sub-components halves • Primitive: Given a vertex choose an edge UAR 1 1 • Primitive’: Given a set of nodes choose an edge UAR 2 2 – the edges have long sailed on by now – we are using the linear projections only 5 5 3 3 ⇒ Given a cut choose an edge UAR ⇒ Note that the cuts are chosen adaptively! 4 4 ⇒ But we can produce O(log n) data structures at once.

Connectivity in O(log n) rounds • Every vertex chooses an edge UAR • Collapse the connected components • Number of surviving sub-components halves • Primitive: Given a vertex choose an edge UAR • Primitive’: Given a set of nodes choose an edge UAR – the edges have long sailed on by now – we are using the linear projections only ⇒ Given a cut choose an edge UAR ⇒ Note that the cuts are chosen adaptively! ⇒ But we can produce O(log n) data structures at once.

Connectivity in O(log n) rounds • Every vertex chooses an edge UAR • Collapse the connected components • Choose O(log n) data structures – Given an arbitrary set, chooses an edge out of it. – Disjoint vertex sets “queried simultaneously” – This is the “sampling from a cut” problem. – We use Ộ(n) space.

Sampling from a Cut • Consider a graph + 1 - • Add orientations 2 - - – Arbitrary but consistent + + - 5 – Number the edges 3 - + - + – “Consider” the vertex-edge adjacancy 4 + 1 0 0 0 0 0 0 0 0 0 -1 0 0 0 1 -1 -1 0 0 0 0 0 0 0 -1 0 0 -1 1 0 0 0 0 0 0 1 0 1 0 0 0 0 0 0 0 0 1 0 -1 0

Sampling from a Cut • Give arbitrary weights + 1 - • Add up weights for a vertex r1 2 - - • Given set, compute the sum + + r5 r6 - 5 3 r9 - + - + 4 + r1 r2 r3 r4 r5 r6 r7 r8 r9 r10 1 0 0 0 0 0 0 0 0 0 - r1 + r5 - r6 - r7 -1 0 0 0 1 -1 -1 0 0 0 0 0 0 0 -1 0 0 -1 1 0 0 0 0 0 0 1 0 1 0 0 r7 - r9 0 0 0 0 0 0 1 0 -1 0

Sampling from a Cut • Reduces to a streaming problem! + 1 • “Stream”= Union of vertex streams - r1 2 • Solutions exist ( l 0 sampling) - - + + r5 r6 • Space is Ộ(1) per vertex - 5 3 r9 - + - + 4 + r1 r2 r3 r4 r5 r6 r7 r8 r9 r10 1 0 0 0 0 0 0 0 0 0 - r1 + r5 - r6 - r7 -1 0 0 0 1 -1 -1 0 0 0 0 0 0 0 -1 0 0 -1 1 0 0 0 0 0 0 1 0 1 0 0 r7 - r9 0 0 0 0 0 0 1 0 -1 0

Recap: l 0 Solutions • Consider known universe [a] of positive integers • Suppose we knew the number of distinct elements x up to powers of 2 • Hash [a] → [0,1] retain values [0,x/a] • Of all v ∈ [a] that hash to [0,x/a] – Maintain count Sufficient to test if all items are equal – Sum – Sum of squares • Return average

Sampling from a Cut • Why is this a fundamental problem? • Lets consider some applications …

Minimum Spanning Trees • Exact computation based on linear projections is provably hard. • Kruskal’s algorithm – add least weighted edge • If the edge weights are integers; it suffices to count the number of components! • (1+ ε ) approximation in 1 pass and Ộ(n) space

Min Cut • (In general) Connectivity answers – Is there an edge across this cut • Suppose we asked how many? Say, find the MinCut. • Karger’s algorithm via uniform sampling. • Easy in insertion model • With deletions, remove k spanning trees – Sequentially (but compute them in 1 pass in parallel) – If cuts were small then we have all the edges! – If cuts are large then? “Layered graphs”

Cut Sparsification • (In general) Connectivity answers – Is there an edge across this cut • Suppose we asked how many? • Goal: Store few edges and estimate each cut to 1 ± ε • Benczur & Karger 1996: sampling • Easy in insertion model • With deletions, remove k spanning trees – Sequentially (but compute them in 1 pass in parallel) – If cuts were small then we have all the edges! – If cuts are large then? “Layered graphs”

Chain of results • Sampling from a cut → Connectivity → MST → MinCut → Cut-Sparsification • Maximum Matching (Dual of Cut-Covering) • Multicut → Correlation Clustering • Spectral Sparsification • Counting number of subgraphs – Replace vertex-edge incidence by subgraph-edge incidences. Otherwise similar idea applies.

Spectral Sparsification Spielman & Srivastava • Conductance, mixing of random walks, clustering • A generalization of cuts. • A vector X with ± 1 entries represent a cut. • X T LX = size of a cut where L=D-A and A is the vertex-vertex • adjacency matrix; D=diagonal matrix of degrees Sparsification: preserve all X T LX where X is a vector with ± 1 entries • Spectral sparsification : X is an arbitrary vector. •

Spectral Sparsification • Each edge is a 1 Ohm resistor • Basic sub-problem: – Given s,t estimate the effective resistance. • (small space, 1 pass, using linear sketches) ? • Yes: sample e w.p. proportional to r e and give weight 1/r e • 1 ≤ r e c e ≤ n 2/3 for simple unweighted graphs. • And this is tight! • Subquadratic space algorithm

Conclusion • Examples of graph problems using linear projections • Need more problems & connections • Showed ∃ results; lots of places for improvements

Graphs and Linear Measurements Sudipto Guha University of - PowerPoint PPT Presentation

Graphs and Linear Measurements Sudipto Guha University of Pennsylvania (based on joint work with K. Ahn & A. McGregor) Graphs One of the fundamental representation models in all Computer Science. A natural counterpoint to Big -

Graphs () Graphs () Graphs Graphs Graphs are collections of nodes

Weighted graphs Weighted graphs Weighted graphs Weighted graphs Graphs with numbers, called

CS 7616 Pattern Recognition Linear, Linear, Linear Aaron Bobick School of Interactive

Week 4 Kullmann Graphs and directed graphs Elementary Graph Algorithms Representing graphs

Graphs Graphs Examples Definitions Implementation/Representation of graphs Graphs

On some classes of Deza graphs Deza graphs without 3-cocliques Line graphs V.V. Kabanov 1 Deza

Microsticky Microsticky Measurements by Measurements by Measurements by Microsticky

Reeb Graphs and Piecewise Linear Functions Koen Klaren Eindhoven University of Technology

Searching on Graphs November 16, 2016 CMPE 250 Graphs- Searching on Graphs November 16, 2016 1

CS200: Graphs Prichard Ch. 14 Rosen Ch. 10 CS200 - Graphs 1 Graphs A collection of What can

Graphs Graphs Simple graphs Algorithms Depth-first search Breadth-first search

Today. Types of graphs. Today. Types of graphs. Complete Graphs. Trees. Hypercubes. Today.

Measurements of BB Angular Correlations Measurements of BB Angular Correlations Measurements of

Graphics 2014 Linear Algebra II Linear Maps & Matrices Linear Maps & Matrices CORE

Eighth Grade Common Core Lesson 8.F.A.2 Study the graphs of two linear equations. Which line has

Examples of Obstructions to Apex Graphs, Edge-Apex Graphs, and Contraction-Apex Graphs

23. Cutting planes and branch & bound Algorithms for solving MIPs Cutting plane methods

Intersection Cuts from Bilinear Disjunctions Matteo Fischetti, University of Padova (joint work

Create PowerPoint Audio and Video V0B August 2020 V0B V0B Schield: 2020 PPTX Create Audio-Video

Multiparticle Cuts of Scattering Amplitudes Pierpaolo Mastrolia Institute of Theoretical Physics,

Kernel Normalized Cut: a Theoretical Revisit * Yoshikazu Terada 1,3 & Michio Yamamoto 2,3 1

Global State and Gossip CS 240: Computing Systems and Concurrency Lecture 6 Marco Canini

Bipolar Junction Transistor (BJT) Lecture notes: Sec. 3 Sedra & Smith (6 th Ed): Sec.

Measuring Hadronic Showers in a Totally Active Dual-Readout Crystal Calorimeter A simulation

Graphs and Linear Measurements Sudipto Guha University of - PowerPoint PPT Presentation

Graphs and Linear Measurements Sudipto Guha University of Pennsylvania (based on joint work with K. Ahn & A. McGregor) Graphs One of the fundamental representation models in all Computer Science. A natural counterpoint to Big -

Graphs () Graphs () Graphs Graphs Graphs are collections of nodes

Weighted graphs Weighted graphs Weighted graphs Weighted graphs Graphs with numbers, called

CS 7616 Pattern Recognition Linear, Linear, Linear Aaron Bobick School of Interactive

Week 4 Kullmann Graphs and directed graphs Elementary Graph Algorithms Representing graphs

Graphs Graphs Examples Definitions Implementation/Representation of graphs Graphs

On some classes of Deza graphs Deza graphs without 3-cocliques Line graphs V.V. Kabanov 1 Deza

Microsticky Microsticky Measurements by Measurements by Measurements by Microsticky

Reeb Graphs and Piecewise Linear Functions Koen Klaren Eindhoven University of Technology

Searching on Graphs November 16, 2016 CMPE 250 Graphs- Searching on Graphs November 16, 2016 1

CS200: Graphs Prichard Ch. 14 Rosen Ch. 10 CS200 - Graphs 1 Graphs A collection of What can

Graphs Graphs Simple graphs Algorithms Depth-first search Breadth-first search

Today. Types of graphs. Today. Types of graphs. Complete Graphs. Trees. Hypercubes. Today.

Measurements of BB Angular Correlations Measurements of BB Angular Correlations Measurements of

Graphics 2014 Linear Algebra II Linear Maps &amp; Matrices Linear Maps &amp; Matrices CORE

Eighth Grade Common Core Lesson 8.F.A.2 Study the graphs of two linear equations. Which line has

Examples of Obstructions to Apex Graphs, Edge-Apex Graphs, and Contraction-Apex Graphs

23. Cutting planes and branch &amp; bound Algorithms for solving MIPs Cutting plane methods

Intersection Cuts from Bilinear Disjunctions Matteo Fischetti, University of Padova (joint work

Create PowerPoint Audio and Video V0B August 2020 V0B V0B Schield: 2020 PPTX Create Audio-Video

Multiparticle Cuts of Scattering Amplitudes Pierpaolo Mastrolia Institute of Theoretical Physics,

Kernel Normalized Cut: a Theoretical Revisit * Yoshikazu Terada 1,3 &amp; Michio Yamamoto 2,3 1

Global State and Gossip CS 240: Computing Systems and Concurrency Lecture 6 Marco Canini

Bipolar Junction Transistor (BJT) Lecture notes: Sec. 3 Sedra &amp; Smith (6 th Ed): Sec.

Measuring Hadronic Showers in a Totally Active Dual-Readout Crystal Calorimeter A simulation

Graphics 2014 Linear Algebra II Linear Maps & Matrices Linear Maps & Matrices CORE

23. Cutting planes and branch & bound Algorithms for solving MIPs Cutting plane methods

Kernel Normalized Cut: a Theoretical Revisit * Yoshikazu Terada 1,3 & Michio Yamamoto 2,3 1

Bipolar Junction Transistor (BJT) Lecture notes: Sec. 3 Sedra & Smith (6 th Ed): Sec.