Public-Private Model in Graphs Brian Brubach Soheil Ehsani - PowerPoint PPT Presentation

Public-Private Model in Graphs Brian Brubach ● Soheil Ehsani ● Karthik Sankararaman ●

Overview Introduction of the model ● Simple Example to illustrate the model ● Comparison to other well-studied models ● Algorithm to illustrate the all-pairs shortest path ● Community Detection aka Densest sub-graph problem ● Extension to other sub-additive functions e.g. MaxCut ● Algorithm for Vertex Cover ● Experimental Results* ● Future Directions ●

The Public-Private Model ● Introduced by Chierichetti, Epasto, Kumar, Lattanzi, Mirrokni ○ KDD 2015 Best Paper Award ● The public graph G = ( V , E ) is known ● For each node u , there is an unknown private graph G u = ( V , E u ) ○ For all ( v , w ) in E u both v and w are at most distance 2 from u . Why? ○ WLOG E ∩ E u = ∅ u ● Together they form the public- private graph G ∪ G u

Motivation: Social Networks Facebook, Google+, Twitter ● Nodes represent people/users ● Edges represent connections (eg. friendship, group membership) ● Private graph edges represent private friend lists, private groups, etc ○ Among 1.4 million New York Facebook users, 52.6% hid their friends (Dey, Jelveh, Ross 2012) ○ Private friends Private circle (google+) Private group u u u v

Motivation: Social Networks Very large graphs (Big data!) ● YouTube: 1,000,000+ nodes ○ Problem: processing the public-private ● graph for each node/person is too slow Goal: preprocess the public graph to ● answer queries fast when the private graph is revealed How fast? ○ u

The Public-Private Model Known public graph G = ( V , E ) ● Unknown private graph G u = ( V , E u ) ● ○ For all ( v , w ) in E u both v and w are at most distance 2 from u WLOG E ∩ E u = ∅ ○ Goal: ● Preprocess the public graph using poly(| E |) ○ time and Õ(| V |) space When G u is revealed, answer queries using ○ u time/space Õ(| E u |) and poly(lg | V |)

Warm-up: Number of Connected Components Algorithm ● Label the components of the public graph ○ and store total number of components O( m ) time, O( n lg n ) space ■ Count the number of different ○ components that G u connects O(| E u |) time ■ u

All Pairs Shortest Path (APSP) Important problem in Social Networks ● In learning algorithms, distance between two people can be used as a feature ● E.g. Gives information of likelihood of a person following a celebrity ○ Can be solved exactly in O(n 3 ) time offline ● Too slow for large graphs ○ Will later describe a O(poly log n) approximation in near-linear time ●

APSP in public-private model Will use the poly-log approximation to get an algorithm in the public-private ● model Here, we look at the restricted model where distance from u is at most 2 in private graph ○ Compute a poly log (n) approximation on the public graph ● For a private graph query with u, we need to find dist(u, *) ● ○ We can have the following cases(described in the next few slides) for dist(u,v) Take the one with the minimum of all of them as dist(u,v) in the union graph ●

Case 1 dist(u,v) in union graph is same dist(u,v) in public graph ● In this case, no new computation needs to be done ●

Case 2 dist(u,v) in union graph is 1+ dist(w,v) where w is a neighbor of u in private graph ● and dist(w,v) is the distance in public graph v u w

Case 3 dist(u,v) in union graph is 2+ dist(z,v) where z is at distance 2 of u in private ● graph and dist(z,v) is the distance in public graph v u z

O(poly log n) approximation to APSP Due to Das Sharma, Gollapudi, Najork, Panigraphy[WSDM 2010] ● A sampling based approach ● Choose a random subset of vertices and find distance to this random subset ● Use this distance to estimate distance between any two pairs ●

Estimating dist(u, v) u3 v3 u2 v2 u1 v1 q S0 S1 S2 S3 v u SKETCH(u) = {q, u1, u2, u3} SKETCH(v) = {q, v1, v2, v3} n = 11 r = ⎣ log n ⎦ = 3 CommonSketch = SKETCH(u) ∩ SKETCH(v) dist(u,v) = min{dist(u, w) + dist(w, v): w ∈ CommonSketch}

Analysis Single run of the algorithm gives a O(polylog n) approximation in expectation ● Proof omitted here ○ Success probability can be amplified by running the algorithm O(log n) times and ● taking the sketches to be the union of the sketches in each iteration Finally computing the distances using the common sketch as before on this union ● of sketches gives a O(polylog n) with high probability Chernoff Bound type arguments on the generated subsets ○

Putting it together Preprocessing takes O(m polylog n) time ● The closest vertex computation can be performed by BFS from each set Si to all vertices ○ For each vertex a O(polylog n) sketch stored; Hence total space O(n polylog n) ● Query takes O(|E u | polylog n) time ●

Community Detection Central question in Social Network: Do node A and node B in a graph share a ● core similarity? ○ E.g.: Same geographical location in Yelp, Papers in similar topics in DBLP Many notions and various algorithms in the Social Networks literature ● Important problem outside CS community ● E.g.: Communities in protein interaction graphs studied by Biologists ○

Example of Community Detection Nodes: A topic-dedicated stack exchange Edges: If a user is part of both the sites Colors: Different communities

Densest Subgraph Concept of Community Detection often formalized as ● the densest subgraph problem ○ Formal definition in the following slide Often well-captures the intuitive definition of “well- ● connected” nodes

The Densest Subgraph Find a set S of vertices maximizing ●

Future Works Can we give a similar approach for other functions, such as ● sub-modular ○ matroid ○ Can we formulate this method as a general tool which includes all cases such as ● ○ union intersection ○ ○ maximum minimum ○ Can we modify the model to capture other real world problems? ● What if we allow the private graph to delete edges (eg. “unfollowing” on Facebook)? ○ What if two private graphs G u and G v are revealed together (eg. friend request)? ○

Public-Private Model in Graphs Brian Brubach Soheil Ehsani - PowerPoint PPT Presentation

Public-Private Model in Graphs Brian Brubach Soheil Ehsani Karthik Sankararaman Overview Introduction of the model Simple Example to illustrate the model Comparison to other well-studied models Algorithm to

Graphs () Graphs () Graphs Graphs Graphs are collections of nodes

Weighted graphs Weighted graphs Weighted graphs Weighted graphs Graphs with numbers, called

Week 4 Kullmann Graphs and directed graphs Elementary Graph Algorithms Representing graphs

On some classes of Deza graphs Deza graphs without 3-cocliques Line graphs V.V. Kabanov 1 Deza

Graphs Graphs Examples Definitions Implementation/Representation of graphs Graphs

Grid.java public public class class Grid { private private final final int int width;

Graphs Graphs Simple graphs Algorithms Depth-first search Breadth-first search

Searching on Graphs November 16, 2016 CMPE 250 Graphs- Searching on Graphs November 16, 2016 1

CS200: Graphs Prichard Ch. 14 Rosen Ch. 10 CS200 - Graphs 1 Graphs A collection of What can

Today. Types of graphs. Today. Types of graphs. Complete Graphs. Trees. Hypercubes. Today.

Examples of Obstructions to Apex Graphs, Edge-Apex Graphs, and Contraction-Apex Graphs

STACKED GRAPHS STACKED GRAPHS EVOLUTION OF STACKED GRAPHS Stacked Area Chart Themeriver

Algorithms for Lipschitz Learning on Graphs Sushant Sachdeva Yale Institute of Network Sciences

Graphs Graph definitions There are two kinds of graphs: directed graphs (sometimes called

Graphs Graphs Simple graphs Algorithms Depth-first search Breadth-first search

House of Graphs: Introduction what are interesting graphs? GraPHedron First Definition of

Units without degeneracy, from polycategories to sequent calculi Amar Hadzihasanovic (

Tota otal S Sens ensitivity B Bas ased ed DFM FM Optimiz mizat ation ion of of Standar

Typically-Correct Derandomization for Small Time and Space William M. Hoza 1 University of Texas

ASIC Physical Design CMOS Processes Smith Text: Chapters 2 & 3 Weste CMOS VLSI

Islands of tractability in ontology-based data access Michael Zakharyaschev Department of

Programmable Hash Functions in the Multilinear Setting Eduarda S. V. Freire, Dennis Hofheinz,

Exponential-Time Approximation of Hard Problems Lukasz Kowalik joint work with: Marek Cygan,

Polyteam Semantics Team Semantics Axiomatizations in team semantics Polyteams and Jonni

Public-Private Model in Graphs Brian Brubach Soheil Ehsani - PowerPoint PPT Presentation

Public-Private Model in Graphs Brian Brubach Soheil Ehsani Karthik Sankararaman Overview Introduction of the model Simple Example to illustrate the model Comparison to other well-studied models Algorithm to

Graphs () Graphs () Graphs Graphs Graphs are collections of nodes

Weighted graphs Weighted graphs Weighted graphs Weighted graphs Graphs with numbers, called

Week 4 Kullmann Graphs and directed graphs Elementary Graph Algorithms Representing graphs

On some classes of Deza graphs Deza graphs without 3-cocliques Line graphs V.V. Kabanov 1 Deza

Graphs Graphs Examples Definitions Implementation/Representation of graphs Graphs

Grid.java public public class class Grid { private private final final int int width;

Graphs Graphs Simple graphs Algorithms Depth-first search Breadth-first search

Searching on Graphs November 16, 2016 CMPE 250 Graphs- Searching on Graphs November 16, 2016 1

CS200: Graphs Prichard Ch. 14 Rosen Ch. 10 CS200 - Graphs 1 Graphs A collection of What can

Today. Types of graphs. Today. Types of graphs. Complete Graphs. Trees. Hypercubes. Today.

Examples of Obstructions to Apex Graphs, Edge-Apex Graphs, and Contraction-Apex Graphs

STACKED GRAPHS STACKED GRAPHS EVOLUTION OF STACKED GRAPHS Stacked Area Chart Themeriver

Algorithms for Lipschitz Learning on Graphs Sushant Sachdeva Yale Institute of Network Sciences

Graphs Graph definitions There are two kinds of graphs: directed graphs (sometimes called

Graphs Graphs Simple graphs Algorithms Depth-first search Breadth-first search

House of Graphs: Introduction what are interesting graphs? GraPHedron First Definition of

Units without degeneracy, from polycategories to sequent calculi Amar Hadzihasanovic (

Tota otal S Sens ensitivity B Bas ased ed DFM FM Optimiz mizat ation ion of of Standar

Typically-Correct Derandomization for Small Time and Space William M. Hoza 1 University of Texas

ASIC Physical Design CMOS Processes Smith Text: Chapters 2 &amp; 3 Weste CMOS VLSI

Islands of tractability in ontology-based data access Michael Zakharyaschev Department of

Programmable Hash Functions in the Multilinear Setting Eduarda S. V. Freire, Dennis Hofheinz,

Exponential-Time Approximation of Hard Problems Lukasz Kowalik joint work with: Marek Cygan,

Polyteam Semantics Team Semantics Axiomatizations in team semantics Polyteams and Jonni

ASIC Physical Design CMOS Processes Smith Text: Chapters 2 & 3 Weste CMOS VLSI