comparative protein structure modeling of genes and
play

Comparative protein structure modeling of genes and genomes Marc A. - PowerPoint PPT Presentation

Comparative protein structure modeling of genes and genomes Marc A. Marti-Renom Department of Biopharmaceutical Sciences University of California, San Francisco Comparative Modeling Why protein structure prediction? Y 2003 Y 2005


  1. Comparative protein structure modeling of genes and genomes Marc A. Marti-Renom Department of Biopharmaceutical Sciences University of California, San Francisco

  2. Comparative Modeling…

  3. Why protein structure prediction? Y 2003 Y 2005 Sequences 1,000,000 millions Structures 18,000 50,000

  4. Why protein structure prediction? Theory Y 2003 Sequences 1,000,000 Structures 18,000 Experiment

  5. Why protein structure prediction? Theory Y 2003 Sequences 1,000,000 400,000 Structures 18,000 Experiment http://salilab.org/ modbase

  6. Principles of Protein Structure GFCHIKAYTRLIMVG… Desulfovibrio vulgaris Anacystis nidulans Condrus crispus Anabaena 7120 Folding Evolution Ab initio prediction Threading Comparative Modeling

  7. Comparative Modeling by Satisfaction of Spatial Restraints (M ODELLER) 3D GKITFYERGFQGHCYESDC-NLQP… SEQ GKITFYERG---RCYESDCPNLQP… A. Š ali & T. Blundell. J. Mol. Biol. 234 , 779, 1993. http://salilab.org/modeller J.P. Overington & A. Š ali. Prot. Sci . 3 , 1582, 1994. A. Fiser, R. Do & A. Š ali. Prot Sci . 9 , 1753, 2000.

  8. Comparative Modeling by Satisfaction of Spatial Restraints (M ODELLER) 3D GKITFYERGFQGHCYESDC-NLQP… SEQ GKITFYERG---RCYESDCPNLQP… 1. Extract spatial restraints A. Š ali & T. Blundell. J. Mol. Biol. 234 , 779, 1993. http://salilab.org/modeller J.P. Overington & A. Š ali. Prot. Sci . 3 , 1582, 1994. A. Fiser, R. Do & A. Š ali. Prot Sci . 9 , 1753, 2000.

  9. Comparative Modeling by Satisfaction of Spatial Restraints (M ODELLER) 3D GKITFYERGFQGHCYESDC-NLQP… SEQ GKITFYERG---RCYESDCPNLQP… 1. Extract spatial restraints 2. Satisfy spatial restraints F ( R ) = Π p i (f i / I) i A. Š ali & T. Blundell. J. Mol. Biol. 234 , 779, 1993. http://salilab.org/modeller J.P. Overington & A. Š ali. Prot. Sci . 3 , 1582, 1994. A. Fiser, R. Do & A. Š ali. Prot Sci . 9 , 1753, 2000.

  10. Steps in Comparative Protein Structure Modeling START TARGET ASILPKRLFGNCEQTSDEGLK IERTPLVPHISAQNVCLKIDD VPERLIPERASFQWMNDK A. Š ali, Curr. Opin. Biotech. 6, 437, 1995. R. Sánchez & A. Š ali, Curr. Opin. Str. Biol. 7, 206, 1997. M. A. Martí-Renom et al . Ann. Rev. Biophys. Biomolec. Struct ., 29, 291, 2000.

  11. Steps in Comparative Protein Structure Modeling START TARGET TEMPLATE ASILPKRLFGNCEQTSDEGLK Template Search IERTPLVPHISAQNVCLKIDD VPERLIPERASFQWMNDK A. Š ali, Curr. Opin. Biotech. 6, 437, 1995. R. Sánchez & A. Š ali, Curr. Opin. Str. Biol. 7, 206, 1997. M. A. Martí-Renom et al . Ann. Rev. Biophys. Biomolec. Struct ., 29, 291, 2000.

  12. Steps in Comparative Protein Structure Modeling START TARGET TEMPLATE ASILPKRLFGNCEQTSDEGLK Template Search IERTPLVPHISAQNVCLKIDD VPERLIPERASFQWMNDK Target – Template ASILPKRLFGNCEQTSDEGLKIERTPLVPHISAQNVCLKIDDVPERLIPE MSVIPKRLYGNCEQTSEEAIRIEDSPIV---TADLVCLKIDEIPERLVGE Alignment A. Š ali, Curr. Opin. Biotech. 6, 437, 1995. R. Sánchez & A. Š ali, Curr. Opin. Str. Biol. 7, 206, 1997. M. A. Martí-Renom et al . Ann. Rev. Biophys. Biomolec. Struct ., 29, 291, 2000.

  13. Steps in Comparative Protein Structure Modeling START TARGET TEMPLATE ASILPKRLFGNCEQTSDEGLK Template Search IERTPLVPHISAQNVCLKIDD VPERLIPERASFQWMNDK Target – Template ASILPKRLFGNCEQTSDEGLKIERTPLVPHISAQNVCLKIDDVPERLIPE MSVIPKRLYGNCEQTSEEAIRIEDSPIV---TADLVCLKIDEIPERLVGE Alignment Model Building A. Š ali, Curr. Opin. Biotech. 6, 437, 1995. R. Sánchez & A. Š ali, Curr. Opin. Str. Biol. 7, 206, 1997. M. A. Martí-Renom et al . Ann. Rev. Biophys. Biomolec. Struct ., 29, 291, 2000.

  14. Steps in Comparative Protein Structure Modeling START TARGET TEMPLATE ASILPKRLFGNCEQTSDEGLK Template Search IERTPLVPHISAQNVCLKIDD VPERLIPERASFQWMNDK Target – Template ASILPKRLFGNCEQTSDEGLKIERTPLVPHISAQNVCLKIDDVPERLIPE MSVIPKRLYGNCEQTSEEAIRIEDSPIV---TADLVCLKIDEIPERLVGE Alignment Model Building Model Evaluation OK? Yes END A. Š ali, Curr. Opin. Biotech. 6, 437, 1995. R. Sánchez & A. Š ali, Curr. Opin. Str. Biol. 7, 206, 1997. M. A. Martí-Renom et al . Ann. Rev. Biophys. Biomolec. Struct ., 29, 291, 2000.

  15. Steps in Comparative Protein Structure Modeling START TARGET TEMPLATE ASILPKRLFGNCEQTSDEGLK Template Search IERTPLVPHISAQNVCLKIDD VPERLIPERASFQWMNDK Target – Template ASILPKRLFGNCEQTSDEGLKIERTPLVPHISAQNVCLKIDDVPERLIPE MSVIPKRLYGNCEQTSEEAIRIEDSPIV---TADLVCLKIDEIPERLVGE Alignment Model Building Model Evaluation No OK? Yes END A. Š ali, Curr. Opin. Biotech. 6, 437, 1995. R. Sánchez & A. Š ali, Curr. Opin. Str. Biol. 7, 206, 1997. M. A. Martí-Renom et al . Ann. Rev. Biophys. Biomolec. Struct ., 29, 291, 2000.

  16. Model Accuracy as a Function of Target-Template Sequence Identity

  17. Typical Errors in Comparative Models Incorrect template Misalignment MODEL X-RAY TEMPLATE Region without a Distortion in correctly Sidechain packing template aligned regions

  18. Model Accuracy Marti-Renom et al. Annu.Rev.Biophys.Biomol.Struct. 29 , 291-325, 2000. HIGH ACCURACY MEDIUM ACCURACY LOW ACCURACY NM23 CRABP EDN Seq id 77% Seq id 41% Seq id 33% X-RAY

  19. Model Accuracy Marti-Renom et al. Annu.Rev.Biophys.Biomol.Struct. 29 , 291-325, 2000. HIGH ACCURACY MEDIUM ACCURACY LOW ACCURACY NM23 CRABP EDN Seq id 77% Seq id 41% Seq id 33% C α equiv 147/148 RMSD 0.41Å Sidechains Core backbone Loops X-RAY / MODEL

  20. Model Accuracy Marti-Renom et al. Annu.Rev.Biophys.Biomol.Struct. 29 , 291-325, 2000. HIGH ACCURACY MEDIUM ACCURACY LOW ACCURACY NM23 CRABP EDN Seq id 77% Seq id 41% Seq id 33% C α equiv 147/148 C α equiv 122/137 RMSD 0.41Å RMSD 1.34Å Sidechains Sidechains Core backbone Core backbone Loops Loops Alignment X-RAY / MODEL

  21. Model Accuracy Marti-Renom et al. Annu.Rev.Biophys.Biomol.Struct. 29 , 291-325, 2000. HIGH ACCURACY MEDIUM ACCURACY LOW ACCURACY NM23 CRABP EDN Seq id 77% Seq id 41% Seq id 33% C α equiv 147/148 C α equiv 90/134 C α equiv 122/137 RMSD 0.41Å RMSD 1.17Å RMSD 1.34Å Sidechains Sidechains Sidechains Core backbone Core backbone Core backbone Loops Loops Loops Alignment Alignment Fold assignment X-RAY / MODEL

  22. Applications of Comparative Models D. Baker & A. Sali. Science 294, 93, 2001 . A. Sali & J. Kuriyan. TIBS 22 , M20, 1999.

  23. genes…

  24. Do mast cell proteases bind proteoglycans? Where? When? Predicting features of a model that are not present in the template 1. mMCPs bind negatively charged proteoglycans through electrostatic interactions? 2. Comparative models used to find clusters of positively charged surface residues. 3. Tested by site-directed mutagenesis. . Huang et al . J. Clin. Immunol . 18 ,169,1998. Matsumoto et al . J.Biol.Chem . 270 ,19524,1995. Š ali et al . J. Biol. Chem . 268 , 9023, 1993. Native mMCP-7 at p H=5 (His + ) Native mMCP-7 at p H=7 (His 0 )

  25. What is the physiological ligand of Brain Lipid-Binding Protein? Predicting features of a model that are not present in the template 1. BLBP binds fatty BLBP/oleic acid BLBP/Docosahexaenoic acid acids. Cavity is not filled Cavity is filled Ligand binding 2. Build a 3D model. cavity 3. Find the fatty acid that fits most snuggly into the ligand binding cavity. L. Xu, R. Sánchez, A. Š ali, N. Heintz, J. Biol. Chem. 271 , 24711, 1996.

  26. Some Models Can Be Used in Docking to Density Maps (Yeast Ribosomal 40S subunit) Docking of comparative models into the cryo-EM map. Spahn et al. 2001 Cell 107 :373-386 Small 30S subunit from Thermus thermophilus Large 50S subunit from Haloarcula marismortui

  27. 60S Subunit  43 proteins could be modeled on 20-56% seq.id. to a known structure.  The coverage of the models ranges from 34-99%.  Models were manually docked into the 15Å cryo-electron density map.  The solid orange in the 60S subunit and the solid green in the 40S subunit correspond to proteins without known bacterial homologs. 40S Subunit

  28. Structural analysis of missense mutations in human BRCA1 BRCT domains Nebojsa Mirkovic, Marc A. Marti-Renom, Andrej Sali Alvaro N.A. Monteiro (Sprang Center, Cornell U.)

  29. Human BRCA1 and its two BRCT domains RING NLS BRCT Globular regions 200 aa Nonglobular regions BRCA1 BRCT repeats, 1jnx Williams, Green, Glover. Nat.Struct.Biol. 8, 838, 2001

  30. Missense Mutations in BRCT Domains by Function cancer not cancer ? associated associated F1761S L1705PS M1652K C1697R M1775E 1715NS L1657P R1699W M1775K 1722FF E1660G no transcription A1708E L1780P 1734LG H1686Q activation S1715R I1807S 1738EG R1699Q P1749R V1833E 1743RA K1702E M1775R A1843T 1752PF Y1703H 1761I F1704S V1665M transcription D1692N M1652I G1706A activation A1669S D1733G M1775V P1806A M1652T W1718S R1751P C1787S A1823T V1653M T1720A R1751Q G1788 V1833M L1664P W1730S R1758G D W1837R T1685A F1734S L1764P G1788V W1837G ? T1685I E1735K I1766S G1803A S1841N M1689R V1736A P1771L V1804D A1843P D1692Y G1738R T1773S V1808A T1852S F1695L D1739E P1776S V1809A P1856T V1696L D1739G D1778N V1809F P1859R R1699L D1739Y D1778G V1810G G1706E V1741G D1778H Q1811R W1718C H1746N M1783T P1812S N1819S 9/18/02

Recommend


More recommend