Creating In Silico Interactomes Creating In Silico Interactomes - PowerPoint PPT Presentation

Creating In Silico Interactomes Creating In Silico Interactomes � Tony Chiang � Denise Scholtens � Robert Gentleman

Objectives Objectives � Define interactomes – Biological and in silico � Describe the process of construction � Relate the data structure – How this structure is comprehensive to detailing the data – Why this structure is good for some statistical modeling � Simple examples in using the interactome � Future Work

Introduction and Background Introduction and Background � Basic Terminology – Protein Complex � Group of 2 or more associated proteins � Conduct some biological process – Protein Complex Interactome � Coordinated set of protein complexes � Specific to each cell or tissue type � Variable over environmental conditions

Graph Theoretic Representation Graph Theoretic Representation � Hyper-graph – Generalization of ordinary graph � Vertex set, V, is the collection of unique proteins – Let |V| = n � Hyper-edge, E, is the collection of unique protein complexes – Then |E| ≤ 2 n - (n+1) � Interactome ↔ Hyper-graph – Most protein complex identification experiments occur in some biological interactome

In Silico Interactome In Silico Interactome � Collection of estimated protein complexes representing an in silico model organism – The ISI is a simulated organism with which we can conduct computational experiments � ISI is modeled after biological interactomes � Storage of the ISI – Incidence Matrix Representation of the Hyper-Graph � Rows indexed by the vertices (expressed proteins) � Columns indexed by the hyper-edges (complexes) � Incidence is equivalent to membership

Interactome to Incidence Matrix to Incidence Matrix Interactome 3 Complex 1 Complex 2 1 2 Pr otein 1 1 0 Pr otein 2 1 1 Pr otein 3 1 0 2 4 Pr otein 4 0 1

Why hyper- -graph representation graph representation Why hyper The hyper-graph representation encapsulates more information than a graph representation. We look at the example of PP2A I, II, III By example, we show why protein-protein interaction graphs and co-membership graphs cannot incorporate protein membership information

Protein-Protein Protein- Protein Complex Direct Interaction Co-Membership Graph Graph RST1 RST1 TDP3 PHP21 TDP3 PHP21 Php22 CDC55 Php22 CDC55 Neither graph can determine Protein Complex Membership

A Hyper-Graph (Forgive me) details protein membership, co-membership, but not interaction data RTS1 TDP3 PHP22 PHP21 CDC55

Constructing the ISI Constructing the ISI � Presently, the simulated model organism is based on Saccharomyces cerevisiae � Constructing the in silico interactome – Collecting protein complex composition data � Gene Ontology � MIPS � High Through-Put Affinity Purification - Mass Spectrometric Experimentation – Protein Complex Estimation via apComplex

ISI - - Limitations Limitations ISI � Comprehensive – It does not contain an exhaustive list of all protein complexes since it reflects known biology � Definitive – It contains mostly estimated protein complexes via both low and high through-put technologies � Meant to replace experimental de novo research – It cannot give insight to unknown biological complexes and interactomes

ISI - - Benefits Benefits ISI � Dynamic – It can be updated and modified as new data is discovered and old data is revised � Simplified – Redundancies from different data sources can be eliminated as well as irrelevant protein complexes � Versatile – An ISI can be modeled after any organism from yeast to mice to men

Why build in in silico silico interactomes interactomes Why build � Reasons to build valid in silico interactomes: – Provides one single data structure with which to conduct in silico experiments – Provides tool with which simulated wet-lab experiments can be conducted – Use in the generation of multiple data sets – Develop tools and strategy for small scale experiments – Study of perturbation in networks – Effects of varying sampling paradigms on large, non- random networks

Integrating Data and Deriving Statistics GO MIPS Gavin Ho Krogan In Silico Interactome Computational Statistics

In Silico Interactome Silico Interactome for Yeast for Yeast - - In ScISI ScISI � Computational parsing data from GO and MIPS – Term mining � [Cc]omplex � Suffix “-ase” (e.g. RNA polymerase II) � Suffix “-some” (e.g. ribosome) � Manual parsing resultant protein complexes � Collecting estimates from apComplex – Experiments � Gavin et al. (2002, 2006*) � Ho et al. (2002) � Krogan et al. (2004)

ScISI - - a model example a model example ScISI � In silico S. cerevisiae – 1661 unique expressed proteins – 734 distinct protein complexes � Basic statistical profile – Complex � Cardinality range = [2,57] � Median cardinality = 4 � Mean cardinality = 5.98 – Protein � Membership range = [1,31] � Median membership = 1 � Mean membership = 2.64

In Silico Silico experiments on ScISI experiments on ScISI In � Determining protein complex structures – Let A be the incidence matrix of ScISI � Then [AA T ] ij counts the number of complexes to which protein i and protein j belong, that is how many complexes these two proteins share co- membership – Transformation gives a measure of protein affiliation but not direct binary interaction

Graphical representation of in silico silico Graphical representation of in experiments experiments � We make use of the equivalence of hyper-graphs to bi-partite graph – Equivalence is determined by letting the set of hyper- edges be the second set of nodes. � The operation AA T is a contraction on the protein complex nodes of the bi-partite graph – This process takes us from protein complex membership to protein-protein complex co- membership

Ordinary Graph: Bi-partite Graph: Protein-Protein Protein Complex Complex Co - Membership Membership 1 A 2 1 2 B 3 3 4 C 4

Where to from here? Where to from here? � Let’s re-iterate the 5 reasons to build valid in silico interactomes: – Provides tool with which simulated wet-lab experiments can be conducted – Use in the generation of multiple data sets – Develop tools and strategy for small scale experiments – Study of perturbation in networks – Effects of varying sampling paradigms on large, non- random networks � All 5 of which are still open ended…

Future Direction Future Direction � An interesting question… – Many of the protein complexes are estimates obtained from Affinity Purification - Mass Spectrometry experiments – Can we validate these estimates? � Each interactome built needs to be validated before conducting computational experiments – We present two different methods to validate the interactomes.

Validating ISI Validating ISI � Using direct binary interaction data to verify protein complex composition – Necessary and sufficient condition is that induced interaction graph be connected on the sub-set of proteins in each protein complex � Hard to verify – Binary interaction data is sparse – Error Rates are extremely high – There is a need to decipher between true negative interactions between two proteins and un-tested interactions between two proteins – Induced interaction graph is almost always dis- connected

Validating ISI Validating ISI � Simulation Models – Simulate the AP-MS technology and derive data-sets on which we can apply estimation algorithm. – Determine how effective estimation algorithm based on statistical significance – Compare with other estimation algorithms

Creating In Silico Interactomes Creating In Silico Interactomes - PowerPoint PPT Presentation

Creating In Silico Interactomes Creating In Silico Interactomes Tony Chiang Denise Scholtens Robert Gentleman Objectives Objectives Define interactomes Biological and in silico Describe the process of construction

In Silico Spectra Lab Slide 1 In Silico Spectra Lab Explore & investigate Explore &

Building and Validating Oncosimulators and Oncosimulator Based Hypermodels as Clinical Decision

Module 4: Creating Data Types and Tables Overview Creating Data Types Creating Tables

STRUCTURAL BIOLOGY AND RADIOBIOLOGY LAB I2BC - CEA Saclay PROTEIN INTERACTIONS AT THE HEART OF

Creating Dashboards of Direct and Creating Dashboards of Direct and Creating Dashboards of Direct

Creating a Community of Inquiry Creating a Community of Inquiry : Creating a Community of Inquiry

Module 3: Creating and Managing Databases Overview Creating Databases Creating

Fighting Infections and Antimicrobial Resistance Through GPU-Accelerated In Silico Models Radu

In silico blood genotyping from exome sequencing data Manuel Giollo 1,2 , Giovanni Minervini 1 ,

Donovan N. Chin & R. Aldrin Denny Traditional Drug Discovery (insert graph) In Silico

The WISDOM initiative Wide In Silico Docking On Malaria Yannick Legr, CNRS/IN2P3 on behalf oh

NCI efforts to build a foundation for in-silico cancer research Ken Buetow NCICB/NCI/NIH/DHHS

Diversity in vivo, Multicore in silico : How to link metagenomics and community ecology Alain

Potential applicability and challenges of using in pp y g g vitro and in silico methodologies

Prediction in silico of major clearance pathway of drugs in vivo Kota Toshimoto Sugiyama

A knowledge-based approach to the in silico assessment of toxicity Carol Marchant

The Academic programme and the module Maritime Sustainability and MoS 2015

INDUSTRY GLOBAL COMPANY WESCO MANUFACTURER & VETERANS WITH LOCAL PRESENCE BROADCAST &

Smart Cities and Infrastructure Introduction of the Secretary General's Report Ms. Dong Wu

Overview Introduction to HIV therapy Arevir geno2pheno Patient consent and patient identifiers

aquaDam Dammed Real Time Water Quality Monitoring INNOVATIVE SOLUTIONS FOR WATER &

The balance of excitation transfer and recombination processes in MoS 2 nanotubes and flakes Olga

Christopher M. Jones (Nasdaq: WWR) Chief Executive Officer 1 CAUTIONARY STATEMENT CAUTIONARY

I dentification of inflammatory gene modules based on variations of human endothelial cell

Sambuz

Useful Links

Newsletter

Mail Us

Creating In Silico Interactomes Creating In Silico Interactomes - PowerPoint PPT Presentation

Creating In Silico Interactomes Creating In Silico Interactomes Tony Chiang Denise Scholtens Robert Gentleman Objectives Objectives Define interactomes Biological and in silico Describe the process of construction

In Silico Spectra Lab Slide 1 In Silico Spectra Lab Explore &amp; investigate Explore &amp;

Building and Validating Oncosimulators and Oncosimulator Based Hypermodels as Clinical Decision

Module 4: Creating Data Types and Tables Overview Creating Data Types Creating Tables

STRUCTURAL BIOLOGY AND RADIOBIOLOGY LAB I2BC - CEA Saclay PROTEIN INTERACTIONS AT THE HEART OF

Creating Dashboards of Direct and Creating Dashboards of Direct and Creating Dashboards of Direct

Creating a Community of Inquiry Creating a Community of Inquiry : Creating a Community of Inquiry

Module 3: Creating and Managing Databases Overview Creating Databases Creating

Fighting Infections and Antimicrobial Resistance Through GPU-Accelerated In Silico Models Radu

In silico blood genotyping from exome sequencing data Manuel Giollo 1,2 , Giovanni Minervini 1 ,

Donovan N. Chin &amp; R. Aldrin Denny Traditional Drug Discovery (insert graph) In Silico

The WISDOM initiative Wide In Silico Docking On Malaria Yannick Legr, CNRS/IN2P3 on behalf oh

NCI efforts to build a foundation for in-silico cancer research Ken Buetow NCICB/NCI/NIH/DHHS

Diversity in vivo, Multicore in silico : How to link metagenomics and community ecology Alain

Potential applicability and challenges of using in pp y g g vitro and in silico methodologies

Prediction in silico of major clearance pathway of drugs in vivo Kota Toshimoto Sugiyama

A knowledge-based approach to the in silico assessment of toxicity Carol Marchant

The Academic programme and the module Maritime Sustainability and MoS 2015

INDUSTRY GLOBAL COMPANY WESCO MANUFACTURER &amp; VETERANS WITH LOCAL PRESENCE BROADCAST &amp;

Smart Cities and Infrastructure Introduction of the Secretary General's Report Ms. Dong Wu

Overview Introduction to HIV therapy Arevir geno2pheno Patient consent and patient identifiers

aquaDam Dammed Real Time Water Quality Monitoring INNOVATIVE SOLUTIONS FOR WATER &amp;

The balance of excitation transfer and recombination processes in MoS 2 nanotubes and flakes Olga

Christopher M. Jones (Nasdaq: WWR) Chief Executive Officer 1 CAUTIONARY STATEMENT CAUTIONARY

I dentification of inflammatory gene modules based on variations of human endothelial cell

Sambuz

Useful Links

Newsletter

Mail Us

In Silico Spectra Lab Slide 1 In Silico Spectra Lab Explore & investigate Explore &

Donovan N. Chin & R. Aldrin Denny Traditional Drug Discovery (insert graph) In Silico

INDUSTRY GLOBAL COMPANY WESCO MANUFACTURER & VETERANS WITH LOCAL PRESENCE BROADCAST &

aquaDam Dammed Real Time Water Quality Monitoring INNOVATIVE SOLUTIONS FOR WATER &