predicting protein folding paths
play

Predicting Protein Folding Paths S.Will, 18.417, Fall 2011 Protein - PowerPoint PPT Presentation

Predicting Protein Folding Paths S.Will, 18.417, Fall 2011 Protein Folding by Robotics S.Will, 18.417, Fall 2011 Probabilistic Roadmap Planning (PRM): Thomas, Song, Amato. Protein folding by motion planning . Phys. Biol., 2005 Aims Find good


  1. Predicting Protein Folding Paths S.Will, 18.417, Fall 2011

  2. Protein Folding by Robotics S.Will, 18.417, Fall 2011 Probabilistic Roadmap Planning (PRM): Thomas, Song, Amato. Protein folding by motion planning . Phys. Biol., 2005

  3. Aims Find good quality folding paths (into given native structure) no structure prediction! Predict formation orders (of secondary structure) S.Will, 18.417, Fall 2011

  4. Motion planning Motion planning Probabilistic roadmap planing Sampling of configuration space Q S.Will, 18.417, Fall 2011 Connect nearest configurations by (simple) local planner Apply graph algorithms to “roadmap”: Find shortest path

  5. Motion planning Motion planning Probabilistic roadmap planing Sampling of configuration space Q S.Will, 18.417, Fall 2011 Connect nearest configurations by (simple) local planner Apply graph algorithms to “roadmap”: Find shortest path

  6. Motion planning Motion planning Probabilistic roadmap planing Sampling of configuration space Q S.Will, 18.417, Fall 2011 Connect nearest configurations by (simple) local planner Apply graph algorithms to “roadmap”: Find shortest path

  7. Motion planning Motion planning Probabilistic roadmap planing Sampling of configuration space Q S.Will, 18.417, Fall 2011 Connect nearest configurations by (simple) local planner Apply graph algorithms to “roadmap”: Find shortest path

  8. Motion planning Motion planning Probabilistic roadmap planing Sampling of configuration space Q S.Will, 18.417, Fall 2011 Connect nearest configurations by (simple) local planner Apply graph algorithms to “roadmap”: Find shortest path

  9. More on PRM for motion planning tree-like robots ( articulated robots ) Articulated Joint configuration = vector of angles configuration space Q = { q | q ∈ S n } S — set of angles S.Will, 18.417, Fall 2011 n — number of angles = degrees of freedom (dof)

  10. More on PRM for motion planning tree-like robots ( articulated robots ) configuration = vector of angles configuration space Q = { q | q ∈ S n } S — set of angles S.Will, 18.417, Fall 2011 n — number of angles = degrees of freedom (dof)

  11. Proteins are Robots (aren’t they?) Obvious similarity ;-) == ? Our model S.Will, 18.417, Fall 2011 Protein == vector of phi and psi angles (treelike robot with 2n dof) possible models range from only backbone up to full atom

  12. Proteins are Robots (aren’t they?) Obvious similarity ;-) == ? Our model S.Will, 18.417, Fall 2011 Protein == vector of phi and psi angles (treelike robot with 2n dof) possible models range from only backbone up to full atom

  13. Proteins are Robots (aren’t they?) Obvious similarity ;-) == ? Our model O O N C N C C C N C C O S.Will, 18.417, Fall 2011 Protein == vector of phi and psi angles (treelike robot with 2n dof) possible models range from only backbone up to full atom

  14. Proteins are Robots (aren’t they?) Obvious similarity ;-) == ? Our model O O N C N C phi C psi C N C C O S.Will, 18.417, Fall 2011 Protein == vector of phi and psi angles (treelike robot with 2n dof) possible models range from only backbone up to full atom

  15. Differences to usual PRM no external obstacles, but self-avoidingness torsion angles quality of paths low energy intermediate states kinetically prefered paths highly probable paths S.Will, 18.417, Fall 2011

  16. Energy Function method can use any potential Our coarse potential [ Levitt. J.Mol.Biol., 1983. ] each sidechain by only one “atom” (zero dof) K d { [( d i − d 0 ) 2 + d 2 1 � 2 − d c } + E hp U tot = c ] restraints first term favors known secondary structure through main chain hydrogen bonds and disulphide bonds second term hydrophobic effect Van der Waals interaction modeled by step function S.Will, 18.417, Fall 2011 All-atom potential: EEF1 [Lazaridis, Karplus. Proteins, 1999. ]

  17. PRM method for Proteins Sampling Connecting Extracting S.Will, 18.417, Fall 2011

  18. Sampling — Node Generation Sampling Connecting Extracting S.Will, 18.417, Fall 2011

  19. Node Generation No uniform sampling configuration space too large ⇒ need biased sampling strategy Gaussian sampling centered around native conformation with different STDs 5 ◦ , 10 ◦ , . . . , 160 ◦ ensure representants for different numbers of native contacts Selection by energy  1 if E ( q ) < E min   E max − E ( q ) S.Will, 18.417, Fall 2011 P (accept q ) = if E min ≤ E ( q ) ≤ E max E max − E min  0 if E ( q ) > E max 

  20. More on Node Generation Visualization of Sampling Strategy Distribution S.Will, 18.417, Fall 2011 Psi and Phi angles RMSD vs. Energy

  21. Node Connection Sampling Connecting Extracting S.Will, 18.417, Fall 2011

  22. Connecting Nodes by Local Planner connect configurations in close distance generate N intermediary nodes by local planner assign weights to edges S.Will, 18.417, Fall 2011 � e − ∆ E N if ∆ E > 0 kT � P i = Weight = − log ( P i ) 1 if ∆ E ≤ 0 i =0

  23. Connecting Nodes by Local Planner connect configurations in close distance generate N intermediary nodes by local planner assign weights to edges S.Will, 18.417, Fall 2011 � e − ∆ E N if ∆ E > 0 kT � P i = Weight = − log ( P i ) 1 if ∆ E ≤ 0 i =0

  24. Connecting Nodes by Local Planner connect configurations in close distance generate N intermediary nodes by local planner assign weights to edges S.Will, 18.417, Fall 2011 � e − ∆ E N if ∆ E > 0 kT � P i = Weight = − log ( P i ) 1 if ∆ E ≤ 0 i =0

  25. Connecting Nodes by Local Planner connect configurations in close distance generate N intermediary nodes by local planner assign weights to edges S.Will, 18.417, Fall 2011 � e − ∆ E N if ∆ E > 0 kT � P i = Weight = − log ( P i ) 1 if ∆ E ≤ 0 i =0

  26. Connecting Nodes by Local Planner connect configurations in close distance generate N intermediary nodes by local planner assign weights to edges S.Will, 18.417, Fall 2011 � e − ∆ E N if ∆ E > 0 kT � P i = Weight = − log ( P i ) 1 if ∆ E ≤ 0 i =0

  27. Connecting Nodes by Local Planner connect configurations in close distance generate N intermediary nodes by local planner P5 P4 P2 P3 P1 assign weights to edges S.Will, 18.417, Fall 2011 � e − ∆ E N if ∆ E > 0 kT � P i = Weight = − log ( P i ) 1 if ∆ E ≤ 0 i =0

  28. Connecting Nodes by Local Planner connect configurations in close distance generate N intermediary nodes by local planner Weight assign weights to edges S.Will, 18.417, Fall 2011 � e − ∆ E N if ∆ E > 0 kT � P i = Weight = − log ( P i ) 1 if ∆ E ≤ 0 i =0

  29. Connecting Nodes by Local Planner connect configurations in close distance generate N intermediary nodes by local planner assign weights to edges S.Will, 18.417, Fall 2011 � e − ∆ E N if ∆ E > 0 kT � P i = Weight = − log ( P i ) 1 if ∆ E ≤ 0 i =0

  30. Extracting Paths Sampling Connecting Extracting S.Will, 18.417, Fall 2011

  31. Extracting Paths Shortest Path extract one shortest path from some starting conformation, one path at a time Single Source Shortest Paths (SSSP) extract shortest paths from all starting conformation compute paths simultaneously generate tree of shortest paths (SSSP tree) S.Will, 18.417, Fall 2011

  32. Big Picture Sampling Connecting Extracting S.Will, 18.417, Fall 2011

  33. Studied Proteins Overview of studied proteins, roadmap size, and construction times S.Will, 18.417, Fall 2011

  34. Formation orders formation order of secondary structure for verifying method formation orders can be determined experimentally [ Li, Woodward. Protein Science, 1999. ] Pulse labeling Out-exchange prediction of formation orders single paths averaging over multiple paths (SSSP-tree) S.Will, 18.417, Fall 2011

  35. Timed Contact Maps S.Will, 18.417, Fall 2011

  36. Formation Order no (reported) contradictions between prediction and validation S.Will, 18.417, Fall 2011 different kind of information from experiment and prediction

  37. The Proteins G and L Studied in more detail good test case structurally similar: 1 α + 4 β fold differently S.Will, 18.417, Fall 2011 Protein G: β -turn 2 forms first Protein L: β -turn 1 forms first

  38. Comparison of Analysis Techniques β -Turn Formation S.Will, 18.417, Fall 2011

  39. Conclusion PRM can be applied to “realistic” protein models Introduced method makes verifiable prediction Coarse potential is sufficient Predictions in good accordance to experimental data S.Will, 18.417, Fall 2011

Recommend


More recommend