A Graph Modification Approach for Finding CorePeriphery Structures - PowerPoint PPT Presentation

A Graph Modification Approach for Finding Core–Periphery Structures in Protein Interaction Networks Sharon Bruckner 1 uffner 2 Christian Komusiewicz 2 Falk H¨ 1 Institut f¨ ur Mathematik, Freie Universit¨ at Berlin 2 Institut f¨ ur Softwaretechnik und Theoretische Informatik, TU Berlin 30 September 2014 S. Bruckner et al. (FU Berlin & TU Berlin) Core–Periphery Structures in Protein Interaction Networks 1

Protein Complex Identification Task: Given a protein interaction network, identify its protein complexes and functional modules Common assumptions: Complexes and functional modules are dense subnetworks Functional modules have no or only small overlap � Formulation as graph clustering problem Cluster Editing Input: An undirected graph G = ( V , E ). Task: Find a minimum-size set of edge deletions and edge insertions that converts the graph into a cluster graph , that is, a graph where each connected component is a clique. S. Bruckner et al. (FU Berlin & TU Berlin) Core–Periphery Structures in Protein Interaction Networks 2

Denseness of Complexes and Functional Units Problem: Functional units are not necessarily dense Nucleosome remodeling deacetylase (NuRD) complex of M. musculus and its interactions with transcription factors � Core–periphery model of protein complexes [Gavin et al., Nature ’06] S. Bruckner et al. (FU Berlin & TU Berlin) Core–Periphery Structures in Protein Interaction Networks 3

Core–Periphery Model Aim: Uncover global core–periphery structure of given PPI network with dense cores and sparse peripheries. Formalization: Split graph = can be partitioned into clique and independent set Split cluster graph = every connected component is a split graph � Split Cluster Editing Input: An undirected graph G = ( V , E ). Task: Find a minimum-size set of edge deletions and edge insertions that converts the graph into a split cluster graph. S. Bruckner et al. (FU Berlin & TU Berlin) Core–Periphery Structures in Protein Interaction Networks 4

Shared Peripheries So far: Complexes and functional modules are dense subnetworks have core–periphery structure Functional modules have no or only small overlap Now: allow overlap but only in peripheries � Monopolar graph = can be partitioned into cluster graph and independent set � Monopolar Editing Input: An undirected graph G = ( V , E ). Task: Find a minimum-size set of edge deletions and edge insertions that converts the graph into a monopolar graph. S. Bruckner et al. (FU Berlin & TU Berlin) Core–Periphery Structures in Protein Interaction Networks 5

Problem Complexity—Split Cluster Editing Theorem: (Foldes & Hammer ’71) A graph is a split graph iff it does not contain an induced subgraph that is a 2 K 2 , C 4 , or C 5 . 2 K 2 necktie bowtie C 4 C 5 P 5 Main Results: A graph is a split cluster graph iff it does not contain an induced subgraph that is a C 4 , C 5 , P 5 , necktie, or bowtie. Split Cluster Editing is APX-hard and NP-hard even on graphs with maximum degree 11. Split Cluster Editing can be solved in O (10 k · m ) time, where k is the number of necessary edge modifications. S. Bruckner et al. (FU Berlin & TU Berlin) Core–Periphery Structures in Protein Interaction Networks 6

Problem Complexity—Monopolar Editing Observation: Monopolar graphs have infinitely many forbidden subgraphs (smallest and only with 5 vertices is the wheel W 4 ( )). Known: Vertex-partitioning into fixed additive induced-hereditary properties is NP-hard [Farrugia, Electron. J. Combin. ’04]. � Deciding whether a graph is monopolar is NP-hard. S. Bruckner et al. (FU Berlin & TU Berlin) Core–Periphery Structures in Protein Interaction Networks 7

ILP formulations Forbidden subgraph-based Partition variables Column generation S. Bruckner et al. (FU Berlin & TU Berlin) Core–Periphery Structures in Protein Interaction Networks 8

Forbidden subgraph-based ILP formulation for SCE First try: use forbidden subgraph characterization � Binary variable e uv = 1 if { u , v } is in the solution graph Define ¯ e uv := 1 − e uv � � minimize e uv + ¯ e uv { u , v }∈ E { u , v } / ∈ E subject to � � ∀ forbidden subgraph F : e uv + ¯ e uv ≥ 1 { u , v }∈ F { u , v } / ∈ F O ( n 5 ) constraints � use row generation (lazy constraints) S. Bruckner et al. (FU Berlin & TU Berlin) Core–Periphery Structures in Protein Interaction Networks 9

Partition variable ILP formulation for SCE Idea : Fix the assignment to core and periphery before destroying the forbidden subgraphs Lemma: Let G = ( V , E ) be a graph and C ˙ ∪ I = V a partition of the vertices. Then G is a split cluster graph with core vertices C and independent set vertices I iff it does not contain an edge with both endpoints in I , nor an induced P 3 with both endpoints in C . S. Bruckner et al. (FU Berlin & TU Berlin) Core–Periphery Structures in Protein Interaction Networks 10

Partition variable ILP formulation for SCE Binary variable e uv = 1 if { u , v } is in the solution graph. Define ¯ e uv := 1 − e uv Binary variable c u = 1 if u is a core vertex. Define ¯ c u := 1 − c u . � � minimize e uv + ¯ e uv { u , v }∈ E { u , v } / ∈ E subject to ∀ u , v : c u + c v + ¯ e uv ≥ 1 ∀ u � = v , v � = w > u : ¯ e uv + ¯ e vw + e uw + ¯ c u + ¯ c w ≥ 1 O ( n 3 ) constraints � still use row generation (lazy constraints) S. Bruckner et al. (FU Berlin & TU Berlin) Core–Periphery Structures in Protein Interaction Networks 11

Partition variable ILP formulation for Monopolar Editing Idea (again) : Fix the assignment to core and periphery before destroying the forbidden subgraphs Lemma: Let G = ( V , E ) be a graph and C ˙ ∪ I = V a partition of the vertices. Then G is a split cluster graph with core vertices C and independent set vertices I iff it does not contain an edge with both endpoints in I , nor an induced P 3 consisting only of vertices in C . S. Bruckner et al. (FU Berlin & TU Berlin) Core–Periphery Structures in Protein Interaction Networks 12

Partition variable ILP formulation for Monopolar Editing Binary variable e uv = 1 if { u , v } is in the solution graph. Define ¯ e uv := 1 − e uv Binary variable c u = 1 if u is a core vertex. Define ¯ c u := 1 − c u . � � minimize e uv + ¯ e uv { u , v }∈ E { u , v } / ∈ E subject to ∀ u , v : c u + c v + ¯ e uv ≥ 1 ∀ u � = v , v � = w > u : ¯ e uv + ¯ e vw + e uw + ¯ c u + ¯ c v + ¯ c w ≥ 1 O ( n 3 ) constraints � still use row generation (lazy constraints) S. Bruckner et al. (FU Berlin & TU Berlin) Core–Periphery Structures in Protein Interaction Networks 13

Column generation for Split Cluster Editing Binary variables z C = 1 if cluster C ∈ 2 V is part of the solution. � maximize c C z C , C ∈ 2 V � s. t. z C = 1 ∀ u ∈ V , C ∈ 2 V | u ∈ C where c C is the “value” of the cluster (number of edges of G [ C ] minus the splittance of G [ C ], that is, the number of edge insertions and deletions to make it a split graph). Problem: Exponentially many variables. Idea: Successively add only those variables (“columns”) that are “needed”, that is, their introduction improves the objective. S. Bruckner et al. (FU Berlin & TU Berlin) Core–Periphery Structures in Protein Interaction Networks 14

Column Generation: Auxiliary problem Lemma: For the relaxation of the ILP, the objective function change from adding a cluster C is � c C − λ u , u ∈ C where λ u is the shadow price associated with the constraint of vertex u . � need to find a cluster that maximizes cluster value minus vertex weights. Idea: Use an ILP. S. Bruckner et al. (FU Berlin & TU Berlin) Core–Periphery Structures in Protein Interaction Networks 15

ILP tuning tricks Warm start with heuristic solution MIP emphasis: balance between proving optimality and finding better solutions Cutting planes for P 5 : for all distinct u , v , w , x , y ∈ V : e xy + 1 2 e uw + e vx + 1 2 e wy + 1 2 e xu + 1 e uv + ¯ ¯ e vw + ¯ e wx + ¯ 2 e yv ≥ 1 . (for monopolar, W 4 ) S. Bruckner et al. (FU Berlin & TU Berlin) Core–Periphery Structures in Protein Interaction Networks 16

Heuristics Forbidden subgraph-based Simulated annealing S. Bruckner et al. (FU Berlin & TU Berlin) Core–Periphery Structures in Protein Interaction Networks 17

Forbidden subgraph heuristic for Split Cluster Editing Idea Edit an edge that destroys many forbidden subgraphs. Problems Slow Can get caught in loops Not very good results S. Bruckner et al. (FU Berlin & TU Berlin) Core–Periphery Structures in Protein Interaction Networks 18

Simulated Annealing heuristic for Split Cluster Editing Simulated Annealing Start with a clustering where each vertex is a singleton. Randomly move a vertex to a cluster that contains one of its neighbors. Accept if this improves the objective k ; otherwise, accept with small probability that decreases over time. To evaluate the objective, we can use the following theorem: Theorem (Hammer & Simeone ’81) The minimum number of edits to make a graph a split graph can be found in linear time. S. Bruckner et al. (FU Berlin & TU Berlin) Core–Periphery Structures in Protein Interaction Networks 19

A Graph Modification Approach for Finding CorePeriphery Structures - PowerPoint PPT Presentation

A Graph Modification Approach for Finding CorePeriphery Structures in Protein Interaction Networks Sharon Bruckner 1 uffner 2 Christian Komusiewicz 2 Falk H 1 Institut f ur Mathematik, Freie Universit at Berlin 2 Institut f ur

Finding your way in a graph Finding your way in a graph Finding your way in a graph Finding your

Welcome Welcome Core: Core A Regional Destination Core: Core UL Core: Core Downtown

Secondment immersed in community life Transformation from fisheries to experience economy - and

The Grid with Intelligent Periphery Kameshwar Poolla UC Berkeley June 24, 2013 Kameshwar Poolla

Joint work with Noga Alon WOLA 2019 GRAPH MODIFICATION For an input graph find the minimum

Caching, Parallelism, Fault Tolerance Marco Serafini COMPSCI 532 Lectures 2-3 Memory Hierarchy

Phase Transition of the 2-Choices Dynamics on Core-Periphery Networks E. Cruciani, E. Natale, A.

GRAPH MINING AND GRAPH KERNELS Part I: Graph Mining Karsten Borgwardt^ and Xifeng Yan*

GRAPH TRAVERSAL PATH FINDING AND GRAPH TRAVERSAL Path finding refers to determining the shortest

GRAPH MINING AND GRAPH KERNELS Part II: Graph Kernels Karsten Borgwardt^ and Xifeng Yan*

Finding Hidden Supernovae with Finding Hidden Supernovae with Finding Hidden Supernovae with

Graph Indexing: Tree + Delta Delta >= Graph >= Graph Graph Indexing: Tree + Peixian Zhao,

Graph Mining Marco Serafini COMPSCI 532 Lecture 11 Classes of Graph Systems Graph

Fields modification Fields modification in high Al or In content III- -nitrides nitrides in

Fiber Modification Kraft-TMP Bonding Fiber Modification: Fiber Bond Enhancement Background:

Protection of metals from corrosion. 1.Modification of environment. 2.Modification of

Machine Learning and Association rules Petr Berka, Jan Rauch University of Economics, Prague

Genomic Ancestry Analysis in Wild Hybrid House Mice Megan Frayer Ph.D. Student, Laboratory of

2 3 4 5 6 7

Phylogene)c Trees COMPSCI 260 Spring 2016 Phylogene)cs

Eight Regional Liaisons Mission, Vision, Values, Bylaws New website Jen Spivey,

Life Enrichment Lenda Dincer -EHMG About Me Education Bachelors Degree in Family Studies

Person Centered Care for Person's With Dementia TRO Conference London 2017 Introduction Katelyn

1. Exergames in Rehabilitation WuppDi!, University Bremen University Ulster Assad et al. 2011

A Graph Modification Approach for Finding CorePeriphery Structures - PowerPoint PPT Presentation

A Graph Modification Approach for Finding CorePeriphery Structures in Protein Interaction Networks Sharon Bruckner 1 uffner 2 Christian Komusiewicz 2 Falk H 1 Institut f ur Mathematik, Freie Universit at Berlin 2 Institut f ur

Finding your way in a graph Finding your way in a graph Finding your way in a graph Finding your

Welcome Welcome Core: Core A Regional Destination Core: Core UL Core: Core Downtown

Secondment immersed in community life Transformation from fisheries to experience economy - and

The Grid with Intelligent Periphery Kameshwar Poolla UC Berkeley June 24, 2013 Kameshwar Poolla

Joint work with Noga Alon WOLA 2019 GRAPH MODIFICATION For an input graph find the minimum

Caching, Parallelism, Fault Tolerance Marco Serafini COMPSCI 532 Lectures 2-3 Memory Hierarchy

Phase Transition of the 2-Choices Dynamics on Core-Periphery Networks E. Cruciani, E. Natale, A.

GRAPH MINING AND GRAPH KERNELS Part I: Graph Mining Karsten Borgwardt^ and Xifeng Yan*

GRAPH TRAVERSAL PATH FINDING AND GRAPH TRAVERSAL Path finding refers to determining the shortest

GRAPH MINING AND GRAPH KERNELS Part II: Graph Kernels Karsten Borgwardt^ and Xifeng Yan*

Finding Hidden Supernovae with Finding Hidden Supernovae with Finding Hidden Supernovae with

Graph Indexing: Tree + Delta Delta &gt;= Graph &gt;= Graph Graph Indexing: Tree + Peixian Zhao,

Graph Mining Marco Serafini COMPSCI 532 Lecture 11 Classes of Graph Systems Graph

Fields modification Fields modification in high Al or In content III- -nitrides nitrides in

Fiber Modification Kraft-TMP Bonding Fiber Modification: Fiber Bond Enhancement Background:

Protection of metals from corrosion. 1.Modification of environment. 2.Modification of

Machine Learning and Association rules Petr Berka, Jan Rauch University of Economics, Prague

Genomic Ancestry Analysis in Wild Hybrid House Mice Megan Frayer Ph.D. Student, Laboratory of

2 3 4 5 6 7

Phylogene)c Trees COMPSCI 260 Spring 2016 Phylogene)cs

Eight Regional Liaisons Mission, Vision, Values, Bylaws New website Jen Spivey,

Life Enrichment Lenda Dincer -EHMG About Me Education Bachelors Degree in Family Studies

Person Centered Care for Person's With Dementia TRO Conference London 2017 Introduction Katelyn

1. Exergames in Rehabilitation WuppDi!, University Bremen University Ulster Assad et al. 2011

Graph Indexing: Tree + Delta Delta >= Graph >= Graph Graph Indexing: Tree + Peixian Zhao,