toward comprehensive whole cell models
play

Toward comprehensive whole-cell models Genomic and biochemical data - PowerPoint PPT Presentation

Toward comprehensive whole-cell models Genomic and biochemical data Pathway submodels Rule-based modeling Multi-algorithmic modeling KarrLab.org Jonathan Karr July 6, 2017 karr@mssm.edu Outline Introduction to whole-cell (WC) modeling


  1. Toward comprehensive whole-cell models Genomic and biochemical data Pathway submodels Rule-based modeling Multi-algorithmic modeling KarrLab.org Jonathan Karr July 6, 2017 karr@mssm.edu

  2. Outline Introduction to whole-cell (WC) modeling • What is a WC model? • Motivation • Challenges • Feasibility Methodology • Data aggregation • Data organization • Hybrid simulation New tools to accelerate WC modeling • Data aggregation • Model representation • Parallel simulation

  3. Outline Introduction to whole-cell (WC) modeling • What is a WC model? • Motivation • Challenges • Feasibility Methodology • Data aggregation • Data organization • Hybrid simulation New tools to accelerate WC modeling • Data aggregation • Model representation • Parallel simulation

  4. Features of whole-cell (WC) models Whole organism Dynamic Whole genome Stochastic including each gene Accurate Whole cell cycle Mechanistic Species-specific AGTC Karr et al., 2015

  5. Outline Introduction to whole-cell (WC) modeling • What is a WC model? • Motivation • Challenges • Feasibility Methodology • Data aggregation • Data organization • Hybrid simulation New tools to accelerate WC modeling • Data aggregation • Model representation • Parallel simulation

  6. Genome design requires WC models Biosensors Biofactories Tissue engineering

  7. Example: drug biosynthesis

  8. Example: drug biosynthesis

  9. Example: drug biosynthesis

  10. Example: drug biosynthesis

  11. Example: drug biosynthesis

  12. Example: drug biosynthesis

  13. Example: drug biosynthesis

  14. Personalized medicine requires WC models

  15. Outline Introduction to whole-cell (WC) modeling • What is a WC model? • Motivation • Challenges • Feasibility Methodology • Data aggregation • Data organization • Hybrid simulation New tools to accelerate WC modeling • Data aggregation • Model representation • Parallel simulation

  16. WC models are a grand challenge

  17. Challenge: multiple time and length scales Growth Length Replication Transcription Metabolism Time

  18. Challenge: heterogeneous data Protein expression Mass-spec, Western blot Transcription Single-cell variation RNA-seq Microscopy

  19. Challenge: sparse data

  20. Challenge: heterogeneous granularity Transcriptional regulatory Signaling Metabolic

  21. Outline Introduction to whole-cell (WC) modeling • What is a WC model? • Motivation • Challenges • Feasibility Methodology • Data aggregation • Data organization • Hybrid simulation New tools to accelerate WC modeling • Data aggregation • Model representation • Parallel simulation

  22. WC modeling is now feasible Genomic and biochemical data Pathway submodels Rule-based modeling Multi-algorithmic modeling

  23. Extensive molecular data is available

  24. Numerous predictors are available • miRNA targets: TargetScan • Operons: OperonPredictor • Protein half-lives: N-end rule • Protein localization: PSORT • Signal sequences: SignalP • Transcription start site: Promoter

  25. Numerous databases are available

  26. Model design tools are available MetaFlux

  27. Model languages are available

  28. Numerous pathway models are available

  29. Numerous simulators are available Uptake FBA Composition Metabolism FBA Composition Transcription Stochastic binding Gene expression Translation Stochastic binding Gene expression Replication Chemical kinetics DNA sequence

  30. Testing tools are available PRISM

  31. Numerous other tools • Automated model construction • Model refinement • Parallel simulation • Calibration • Analysis and visualization • …

  32. Outline Introduction to whole-cell (WC) modeling • What is a WC model? • Motivation • Challenges • Feasibility Methodology • Data aggregation • Data organization • Hybrid simulation New tools to accelerate WC modeling • Data aggregation • Model representation • Parallel simulation

  33. Pathway modeling workflow 1. Choose a system to model 2. Determine the scope and granularity of the model 3. Determine the mathematical representation of the model 4. Reconstruct the species, reactions, rate laws, and rate parameters from the literature 5. Debug and calibrate the model by comparison to data 6. Test the model by comparison to independent data

  34. Predictive modeling methodologies Boolean WC model Bolouri, 2000’s FBA Scope Palsson, 1990’s ODE Shuler, 1970’s PDE Gillespie Luthey-Schulten, 2011 Detail

  35. Scaling pathway modeling to whole-cells • Aggregate more data – Accelerate data aggregation through automation – Organize input data using pathway/genome databases • Build models collaboratively using web-based tools – Define the semantic meaning of every model component – Track every assumption and data source • Describe models clearly – Explicitly describe the data used to build models – Describe models in terms of rules • Describe and simulate hybrid models

  36. Scaling pathway modeling to whole-cells Genomics, bioinformatics ↔ Mechanistic modeling Pathway/genome databases ↔ Model design tools Polymers, sequences ↔ Rule-based modeling Stochastic modeling ↔ Steady-state modeling (FBA) Numerical simulation ↔ Big data analytics Model design tools ↔ Collaboration tools

  37. WC modeling workflow

  38. Aggregate data Genome Epigenome Transcriptome DNA-seq Meth-seq RNA-seq Proteome Metabolome Mass-spectrometry Mass-spectrometry Fraser et al., 1995; Kühner et al., 2009; Lluch-Senar et al., 2013; Maier et al., 2013; Yus et al. 2012

  39. Organize input data Karr et al., 2013

  40. Design submodels 1. Update RNA polymerase states 2. Calculate promoter affinities Fur HcrA GntR LuxR Spx Free Promoter Active Bound glpF dnaJ dnaK gntR trxB polC Bound 3. Bind RNA polymerase 4. Elongate and terminate transcripts Sequence AUGAUCCGUCUCUAAUGUCUAC Transcript UTCAACGUGAGGUAAUAAAGUC UCCACGAUGCUACUGUAUC GCCUCAUACUGCGGAU UUACGUAUCAGUGAUCAGUACU

  41. Design pathway submodels Uptake FBA Composition Metabolism FBA Composition Transcription Stochastic events Gene expression Translation Stochastic events Gene expression Replication Chemical kinetics DNA sequence

  42. Combine submodels Mass, shape Uptake FBA Composition Metabolite, RNA, protein counts Metabolism FBA Transcript, polypeptide Composition sequences Transcription Stochastic events Gene expression DNA polymerization, proteins, modifications Translation Stochastic events Gene expression FtsZ ring Replication Chemical kinetics DNA sequence Mammalian host States Submodels

  43. Concurrently integrate submodels Uptake Uptake Uptake Metabolism Metabolism Metabolism Cell states Cell states Cell states Transcription Transcription Transcription Translation Translation Translation Replication Replication Replication 1 s

  44. Calibrate model 1.Estimate individual parameters 2.Generate reduced models of individual pathways and to calibrate individual pathways 3.Refine joint parameter values using full models

  45. Verify model against known biology  Matches training data  Matches theory  Cell mass, volume  Mass conservation  Biomass composition  Central dogma  RNA, protein expression, half-lives  Cell theory  Superhelicity  Evolution  Matches published data  No obvious errors  Metabolite concentrations  Plot model predictions  DNA-bound protein density  Manually inspect data  Gene essentiality  Compare to known biology

  46. Verify model against known biology  Matches training data  Matches theory  Cell mass, volume  Mass conservation  Biomass composition  Central dogma  RNA, protein expression, half-lives  Cell theory  Superhelicity  Evolution  Matches published data  No obvious errors  Metabolite concentrations  Plot model predictions  DNA-bound protein density  Manually inspect data  Gene essentiality  Compare to known biology

  47. Verify model against known biology  Matches training data  Matches theory  Cell mass, volume  Mass conservation  Biomass composition  Central dogma  RNA, protein expression, half-lives  Cell theory  Superhelicity  Evolution  Matches published data  No obvious errors  Metabolite concentrations  Plot model predictions  DNA-bound protein density  Manually inspect data  Gene essentiality  Compare to known biology

  48. Verify model against known biology  Matches training data  Matches theory  Cell mass, volume  Mass conservation  Biomass composition  Central dogma  RNA, protein expression, half-lives  Cell theory  Superhelicity  Evolution  Matches published data  No obvious errors  Metabolite concentrations  Plot model predictions  DNA-bound protein density  Manually inspect data  Gene essentiality  Compare to known biology

Recommend


More recommend