Finding Graph Matchings in Data Streams Andrew McGregor, UPenn The - PowerPoint PPT Presentation

Finding Graph Matchings in Data Streams Andrew McGregor, UPenn

The Streaming Model

The Streaming Model • Classic Problem: Median Finding [Munro & Paterson]

The Streaming Model • Classic Problem: Median Finding [Munro & Paterson] • Parameters of the Model: • How much memory? • How many passes? • How much computation time between data elements?

The Streaming Model • Classic Problem: Median Finding [Munro & Paterson] • Parameters of the Model: • How much memory? • How many passes? • How much computation time between data elements? • Statistics, Norms and Histograms…

The Streaming Model • Classic Problem: Median Finding [Munro & Paterson] • Parameters of the Model: • How much memory? • How many passes? • How much computation time between data elements? • Statistics, Norms and Histograms… • What about graph problems?

Graph Streaming • Instance of graph problem G = ( V, E ) • Edges arrive in arbitrary order: e 1 , e 2 , e 3 , …, e m • Memory limit O( n polylog n ) where n = |V| • Spanner Construction, Bipartite Matching, Lower Bounds [Feigenbaum, Kannan, M. , Suri, Zhang ’04 &’05] • “Annotation” Stream Model [Aggarwal, Datar, Rajagopalan, Ruhl ’04, Demetrescu, Finocchi, Ribichini ’05]

Matching • A matching - set of edges with no two edges sharing an end point. • Problems: Find the matching of maximum cardinality (MCM) Find the matching of maximum weight (MWM) • (Non-streamable) Algorithms: Exact polytime algorithm for both [Gabow ’90] Linear-time 1+ ε approx for MCM [Kalantari & Shokoufandeh ’95] Linear-time 3/2+ ε approx for MWM [Drake & Hougardy ’03]

Results • Unweighted Matchings: 1+ ε approximation in constant passes. • Weighted Matchings: 3+2 √ 2 approximation in single pass. 2+ ε approximation in constant passes.

Unweighted Matchings.

An Easy 2 Approximation • Greedy Algorithm: Store an edge if it is not adjacent to stored edge • Construct a maximal matching - 2 Approximation

Augmenting Paths

Augmenting Paths Matching M

Augmenting Paths • Augmenting Path: simple path starting and ending at unmatched nodes such that edges alternate between M and E\M .

Augmenting Paths • Consider augmenting paths defined by taking the symmetric difference between current (maximal) matching and optimum matching. • Let P i be the number of length i augmenting paths � | M | + P i ≥ OPT (1 − 1 /k ) 1 ≤ i ≤ k

Algorithm Outline 1. Find a maximal matching 2. For 1 ≤ i ≤ k : Find a set, S i , of length i augmenting paths 3. Augment current matching with S j where j = argmax S i 4. Repeat from 2 unless S j is small

Projecting to Layered Graphs G

Projecting to Layered Graphs G L(G)

Lemma: If there are P i length i augmenting paths in G then we expect P i / 2(2i) i node disjoint paths in L(G) .

Lemma: If there are P i length i augmenting paths in G then we expect P i / 2(2i) i node disjoint paths in L(G) . Lemma: A maximal set of node disjoint paths in L(G) , is an i+2 approximation to the maximum set of node disjoint paths in L(G) .

Lemma: If there are P i length i augmenting paths in G then we expect P i / 2(2i) i node disjoint paths in L(G) . Lemma: A maximal set of node disjoint paths in L(G) , is an i+2 approximation to the maximum set of node disjoint paths in L(G) . To find a constant fraction of length i augmenting paths P i , create layered graph and greedily find node disjoint paths.

Limiting Backtracking

Limiting Backtracking • Solution: If number of paths being grown falls below threshold δ n then delete and backtrack. Good: Only backtrack a constant number of times Bad: Don’t find a maximal set of node disjoint paths • In a constant number of passes, we find a constant fraction of length i node disjoint paths/augmenting paths.

Weighted Matching.

Single Pass 3+2 √ 2 Approximation

Single Pass 3+2 √ 2 Approximation • At all times we store some matching M

Single Pass 3+2 √ 2 Approximation • At all times we store some matching M • For each edge e : Compute total weight W of edges e 1 , e 2 in M incident to e If w ( e ) > ( 1+ γ ) W then M ← M ∪ {e} \ {e 1 ,e 2 }

Single Pass 3+2 √ 2 Approximation • At all times we store some matching M • For each edge e : Compute total weight W of edges e 1 , e 2 in M incident to e If w ( e ) > ( 1+ γ ) W then M ← M ∪ {e} \ {e 1 ,e 2 } • We say e is “ born ” and “ killed ” e 1 and e 2

Proof (Sketch)

Proof (Sketch) • We say an edge e is a survivor if it is born and was never killed.

Proof (Sketch) • We say an edge e is a survivor if it is born and was never killed. • Let S = all survivors.

Proof (Sketch) • We say an edge e is a survivor if it is born and was never killed. • Let S = all survivors. • For survivor e we define the trail of the dead T(e) to be the transitive closure of edges killed by e .

Proof (Sketch) • We say an edge e is a survivor if it is born and was never killed. • Let S = all survivors. • For survivor e we define the trail of the dead T(e) to be the transitive closure of edges killed by e . • Claim 1: w ( T ( e )) ≤ w ( e ) / γ

Proof (Sketch) • We say an edge e is a survivor if it is born and was never killed. • Let S = all survivors. • For survivor e we define the trail of the dead T(e) to be the transitive closure of edges killed by e . • Claim 1: w ( T ( e )) ≤ w ( e ) / γ • Claim 2: Can charge the weights of edges in OPT such that: • At most (1 + γ ) w ( T ( e )) is charged to T(e) • At most 2(1+ γ ) w ( e ) is charged to e

Proof (Sketch) • We say an edge e is a survivor if it is born and was never killed. • Let S = all survivors. • For survivor e we define the trail of the dead T(e) to be the transitive closure of edges killed by e . • Claim 1: w ( T ( e )) ≤ w ( e ) / γ • Claim 2: Can charge the weights of edges in OPT such that: • At most (1 + γ ) w ( T ( e )) is charged to T(e) • At most 2(1+ γ ) w ( e ) is charged to e • Hence w (OPT) ≤ (1 + γ ) w ( T ( S )) + 2(1+ γ ) w ( S )< (3+2 √ 2) w ( S )

Multi-pass 2+ ε Approximation

Multi-pass 2+ ε Approximation • First pass: find a constant approximate M 1

Multi-pass 2+ ε Approximation • First pass: find a constant approximate M 1 • Subsequent passes: create M i from M i-1 by running the previous algorithm with γ ( ε )

Multi-pass 2+ ε Approximation • First pass: find a constant approximate M 1 • Subsequent passes: create M i from M i-1 by running the previous algorithm with γ ( ε ) • Repeat if | M i |/ | M i-1 |> 1+ κ ( ε )

Multi-pass 2+ ε Approximation • First pass: find a constant approximate M 1 • Subsequent passes: create M i from M i-1 by running the previous algorithm with γ ( ε ) • Repeat if | M i |/ | M i-1 |> 1+ κ ( ε ) • Claim 1: A constant number of passes suffices

Multi-pass 2+ ε Approximation • First pass: find a constant approximate M 1 • Subsequent passes: create M i from M i-1 by running the previous algorithm with γ ( ε ) • Repeat if | M i |/ | M i-1 |> 1+ κ ( ε ) • Claim 1: A constant number of passes suffices • Claim 2: When | M i |/ | M i-1 | ≤ 1+ κ we have a 2+ ε approx.

Conclusions • Unweighted Matchings: 1+ ε approximation in constant passes. • Weighted Matchings: 3+2 √ 2 approximation in single pass. 2+ ε approximation in constant passes.

Thanks.

Finding Graph Matchings in Data Streams Andrew McGregor, UPenn The - PowerPoint PPT Presentation

Finding Graph Matchings in Data Streams Andrew McGregor, UPenn The Streaming Model The Streaming Model Classic Problem: Median Finding [Munro & Paterson] The Streaming Model Classic Problem: Median Finding [Munro & Paterson]

Finding your way in a graph Finding your way in a graph Finding your way in a graph Finding your

Approximate Matchings in Dynamic Graph Streams Sanjeev Khanna University of Pennsylvania Joint

Characterization for perfect matchings Often we are interested in perfect matchings. X Y 1 5

The cavity method for matchings Marc Lelarge INRIA & ENS Cargese 2014 1 MAXIMUM MATCHINGS

From Matchings to Independent Sets Vadim Lozin Mathematics Institute University of Warwick From

WITH C++ Prof. Amr Goneid AUC Part 9. Streams & Files Prof. amr Goneid, AUC 1 Streams

TREE-TRANSVERSE MATCHINGS AND LOCAL CONSISTENCY Ross Churchley Simon Fraser University 13 June

Fair Stable Matchings Christine T. Cheng University of Wisconsin-Milwaukee April 17, 2015

Rank Maximal Matchings Meghana Nasre talk based on paper by R. W. Irving, T. Kavitha, K.

MAP Estimation with Perfect Graphs Tony Jebara July 21, 2009 Background Matchings Perfect

Computing Large Matchings Fast Ignaz Rutter Alexander Wolff Karlsruhe University TU Eindhoven

Stream Algorithmics Albert Bifet March 2012 Data Streams Big Data & Real Time Data Streams

Environmental Health Science Data Streams Data Streams Health Data Health Data Brian S.

Graphs Lecture 15 Matchings A matching in a graph G=(V ,E) is a set of edges which do not share

Data Streams Many large sources of data are generated as streams of updates: IP Network

Data Streams Many large sources of data are generated as streams of updates: IP Network

5 Rounds of SHA-3 Using Generalized Internal Differentials Itai Dinur 1 , Orr Dunkelman 1,2 and

Teoria Erg odica Diferenci avel lecture 1: spaces with measure Instituto Nacional de

Graphs More Examples More Examples More Examples Path graph P n : V = {1,,n} and E = {

9/18/17 sets Set: An unordered collection of objects CS 220: Discrete Structures and their The

Estimating metrics suitable to an empirical manifold of shapes, using transport against the

Combinatorial aspects of fullerenes and quadrangulations of surfaces Mat ej Stehlk 5/5/2020

Borel reducibility and symmetric models Assaf Shani UCLA Boise Extravaganza in Set Theory

An Introduction to the Combinatorics of Symmetric Functions Peter McNamara Pomona College 4

Finding Graph Matchings in Data Streams Andrew McGregor, UPenn The - PowerPoint PPT Presentation

Finding Graph Matchings in Data Streams Andrew McGregor, UPenn The Streaming Model The Streaming Model Classic Problem: Median Finding [Munro & Paterson] The Streaming Model Classic Problem: Median Finding [Munro & Paterson]

Finding your way in a graph Finding your way in a graph Finding your way in a graph Finding your

Approximate Matchings in Dynamic Graph Streams Sanjeev Khanna University of Pennsylvania Joint

Characterization for perfect matchings Often we are interested in perfect matchings. X Y 1 5

The cavity method for matchings Marc Lelarge INRIA &amp; ENS Cargese 2014 1 MAXIMUM MATCHINGS

From Matchings to Independent Sets Vadim Lozin Mathematics Institute University of Warwick From

WITH C++ Prof. Amr Goneid AUC Part 9. Streams &amp; Files Prof. amr Goneid, AUC 1 Streams

TREE-TRANSVERSE MATCHINGS AND LOCAL CONSISTENCY Ross Churchley Simon Fraser University 13 June

Fair Stable Matchings Christine T. Cheng University of Wisconsin-Milwaukee April 17, 2015

Rank Maximal Matchings Meghana Nasre talk based on paper by R. W. Irving, T. Kavitha, K.

MAP Estimation with Perfect Graphs Tony Jebara July 21, 2009 Background Matchings Perfect

Computing Large Matchings Fast Ignaz Rutter Alexander Wolff Karlsruhe University TU Eindhoven

Stream Algorithmics Albert Bifet March 2012 Data Streams Big Data &amp; Real Time Data Streams

Environmental Health Science Data Streams Data Streams Health Data Health Data Brian S.

Graphs Lecture 15 Matchings A matching in a graph G=(V ,E) is a set of edges which do not share

Data Streams Many large sources of data are generated as streams of updates: IP Network

Data Streams Many large sources of data are generated as streams of updates: IP Network

5 Rounds of SHA-3 Using Generalized Internal Differentials Itai Dinur 1 , Orr Dunkelman 1,2 and

Teoria Erg odica Diferenci avel lecture 1: spaces with measure Instituto Nacional de

Graphs More Examples More Examples More Examples Path graph P n : V = {1,,n} and E = {

9/18/17 sets Set: An unordered collection of objects CS 220: Discrete Structures and their The

Estimating metrics suitable to an empirical manifold of shapes, using transport against the

Combinatorial aspects of fullerenes and quadrangulations of surfaces Mat ej Stehlk 5/5/2020

Borel reducibility and symmetric models Assaf Shani UCLA Boise Extravaganza in Set Theory

An Introduction to the Combinatorics of Symmetric Functions Peter McNamara Pomona College 4

The cavity method for matchings Marc Lelarge INRIA & ENS Cargese 2014 1 MAXIMUM MATCHINGS

WITH C++ Prof. Amr Goneid AUC Part 9. Streams & Files Prof. amr Goneid, AUC 1 Streams

Stream Algorithmics Albert Bifet March 2012 Data Streams Big Data & Real Time Data Streams