e ffi cient use of semidefinite programming for the
play

E ffi cient use of semidefinite programming for the selection of - PowerPoint PPT Presentation

E ffi cient use of semidefinite programming for the selection of rotamers in protein conformation Forbes Burkowski, Yuen-Lam Cheung & Henry Wolkowicz Retrospective Workshop on Discrete Geometry, Optimization and Symmetry November 2013


  1. E ffi cient use of semidefinite programming for the selection of rotamers in protein conformation Forbes Burkowski, Yuen-Lam Cheung & Henry Wolkowicz Retrospective Workshop on Discrete Geometry, Optimization and Symmetry November 2013 Y.-L. Cheung Side chain positioning 2013 1 / 25

  2. Outline ◦ Primer on protein conformation ◦ Side chain positioning : IP formulation ◦ SDP relaxation and minimal face ◦ Implementation: a cutting plane technique ◦ Quality measurement for integral solutions, and numerics Y.-L. Cheung Side chain positioning 2013 2 / 25

  3. Motivation: a subproblem from protein conformation Protein conformation: a primer Basics about proteins An amino acid has five components: H ◦ alpha carbon H H O ◦ hydrogen atom ◦ carboxyl group N C C ◦ amino group H O R ◦ side chain A protein is a polymer formed from a chain of amino acids (with di ff erent side chains). Y.-L. Cheung Side chain positioning 2013 3 / 25

  4. Motivation: a subproblem from protein conformation Protein conformation: a primer Basics about proteins An amino acid has five components: H ◦ alpha carbon H H O ◦ hydrogen atom ◦ carboxyl group N C C ◦ amino group H O R ◦ side chain A protein is a polymer formed from a chain of amino acids (with di ff erent side chains). Y.-L. Cheung Side chain positioning 2013 3 / 25

  5. Motivation: a subproblem from protein conformation Protein conformation: a primer Basics about proteins An amino acid has five components: H ◦ alpha carbon H H O ◦ hydrogen atom ◦ carboxyl group N C C ◦ amino group H O R ◦ side chain A protein is a polymer formed from a chain of amino acids (with di ff erent side chains). Y.-L. Cheung Side chain positioning 2013 3 / 25

  6. Motivation: a subproblem from protein conformation Protein conformation: a primer Basics about proteins An amino acid has five components: H ◦ alpha carbon H H O ◦ hydrogen atom ◦ carboxyl group N C C ◦ amino group H O R ◦ side chain A protein is a polymer formed from a chain of amino acids (with di ff erent side chains). Y.-L. Cheung Side chain positioning 2013 3 / 25

  7. Motivation: a subproblem from protein conformation Protein conformation: a primer Forming a protein through condensation A protein is a polymer formed from a chain of amino acids, bonded via a condensation process: H H H H H O H O N C C + N C C H O H O R 1 R 2 Y.-L. Cheung Side chain positioning 2013 4 / 25

  8. Motivation: a subproblem from protein conformation Protein conformation: a primer Forming a protein through condensation A protein is a polymer formed from a chain of amino acids, bonded via a condensation process: H H H H H O H O N C C + N C C H O H O R 1 R 2 Y.-L. Cheung Side chain positioning 2013 4 / 25

  9. Motivation: a subproblem from protein conformation Protein conformation: a primer Forming a protein through condensation A protein is a polymer formed from a chain of amino acids, bonded via a condensation process: H H H H H O H O N C C + N C C H O H O R 1 R 2 H H H O H H N C C −→ N C C O + H 2 O R 2 H O R 1 Y.-L. Cheung Side chain positioning 2013 4 / 25

  10. Motivation: a subproblem from protein conformation Protein conformation: a primer Backbone and the side chain positioning H H ... H H N C C N C C O R 2 ... O R 1 Protein conformation problem : Given a 2D chain of residues of a protein, find the 3D positions of all the atoms so that ◦ the bond lengths and bond angles are respected, and ◦ the total energy of the resultant protein conformation is at global minimum. Y.-L. Cheung Side chain positioning 2013 5 / 25

  11. Motivation: a subproblem from protein conformation Protein conformation: a primer Backbone and the side chain positioning H H ... H H N C C N C C O R 2 ... O R 1 Side chain positioning problem , a subproblem for protein conformation: ◦ Suppose we know the positions of the backbone atoms. Find the 3D positions of the atoms in the side chains so that the total energy of the resultant conformation is at global min . ◦ Further assumption: each of the side chains can take one of finitely many possible positions, a.k.a. rotamers . Y.-L. Cheung Side chain positioning 2013 5 / 25

  12. Side chain positioning: IP formulation Side chain positioning problem Setup Given a weighted complete p -partite graph with vertex set p � k − 1 � k � � � V = V k , where V 1 = 1 : m 1 , V k = 1 + m l : m l , ∀ k = 2, . . . , p , k = 1 l = 1 l = 1 (and m ∈ Z p is a positive vector), with p � edge weight E ij = E ji , ∀ { i , j } ∈ ( 1 : n 0 ) × ( 1 : n 0 ) , where n 0 = m j . k = 1 Y.-L. Cheung Side chain positioning 2013 6 / 25

  13. Side chain positioning: IP formulation Side chain positioning Statement of the sidechain positioning problem Pick exactly one vertex from each partition V k ( ∀ k = 1, 2, . . . , p ) s.t. the total edge weight of the induced subgraph is minimized. Y.-L. Cheung Side chain positioning 2013 7 / 25

  14. Side chain positioning: IP formulation Side chain positioning Statement of the sidechain positioning problem Pick exactly one vertex from each partition V k ( ∀ k = 1, 2, . . . , p ) s.t. the total edge weight of the induced subgraph is minimized. Y.-L. Cheung Side chain positioning 2013 7 / 25

  15. Side chain positioning: IP formulation Side chain positioning Statement of the sidechain positioning problem Pick exactly one vertex from each partition V k ( ∀ k = 1, 2, . . . , p ) s.t. the total edge weight of the induced subgraph is minimized. Y.-L. Cheung Side chain positioning 2013 7 / 25

  16. Side chain positioning: IP formulation Side chain positioning problem Statement of the sidechain positioning problem Pick exactly one vertex from each partition V k ( ∀ k = 1, 2, . . . , p ) s.t. the total edge weight of the induced subgraph is minimized. Complexity of sidechain positioning problem ◦ NP-hard [Akutsu, 1997; Pierce and Winfree, 2002] ◦ Special cases of the sidechain positioning problem: • MAX 3-SAT [Chazelle et al. , 2004] = ⇒ side chain positioning problem is “inapproximable” • maximum k -cut problem Y.-L. Cheung Side chain positioning 2013 8 / 25

  17. Side chain positioning: IP formulation Side chain positioning problem: IP formulation Statement of the sidechain positioning problem Pick exactly one vertex from each partition V k ( ∀ k = 1, 2, . . . , p ) s.t. the total edge weight of the induced subgraph is minimized. Integer quadratic programming formulation x ⊤ Ex v SCP = min x v ( 1 ) ; v ( 2 ) ; · · · ; v ( p ) � ∈ { 0, 1 } n 0 , � s.t. x = e ⊤ v ( k ) = 1, ∀ k = 1, . . . , p . ¯ x ∈ R n 0 is an incident vector for the choices of vertices in each partition. Y.-L. Cheung Side chain positioning 2013 9 / 25

  18. Side chain positioning: IP formulation Side chain positioning problem: IP formulation Statement of the sidechain positioning problem Pick exactly one vertex from each partition V k ( ∀ k = 1, 2, . . . , p ) s.t. the total edge weight of the induced subgraph is minimized. Integer quadratic programming formulation x ⊤ Ex v SCP = min x e ∈ R p , s.t. Ax = ¯ x ∈ { 0, 1 } n 0 , where m 1 m 2 m p e ⊤  ¯ 0 · · · 0  1 e ⊤ 0 ¯ · · · 0 1    ∈ R p × n 0 . A =  . . .  ... . . .   . . .  e ⊤ 0 0 · · · ¯ 1 Y.-L. Cheung Side chain positioning 2013 9 / 25

  19. Side chain positioning: IP formulation Side chain positioning problem: IP formulation x ⊤ Ex v SCP = min x e ∈ R p , s.t. Ax = ¯ (SCP) x ∈ { 0, 1 } n 0 . Valid constraints on x and X := xx ⊤ ◦ nonegativity, i.e., X � 0; ◦ all the diagonal blocks of X are diagonal, i.e., ( A ⊤ A − I ) ◦ X = 0; ◦ the “arrow” constraint, i.e., diag ( X ) = x ; e � 2 ◦ � Ax − ¯ 2 = 0, i.e., � A ⊤ A , X � − 2¯ e ⊤ x + p = 0. Indeed any x ∈ R n 0 together with X = xx ⊤ satisfying the 3rd-4th constraints is feasible for (SCP). Y.-L. Cheung Side chain positioning 2013 10 / 25

  20. Side chain positioning: IP formulation Side chain positioning problem: IP formulation Valid constraints on x and X := xx ⊤ ◦ nonegativity, i.e., X � 0; ◦ all the diagonal blocks of X are diagonal, i.e., ( A ⊤ A − I ) ◦ X = 0; ◦ the “arrow” constraint, i.e., diag ( X ) = x ; e � 2 2 = 0, i.e., � A ⊤ A , X � − 2¯ e ⊤ x + p = 0. ◦ � Ax − ¯ Indeed any x ∈ R n 0 together with X = xx ⊤ satisfying the 3rd-4th constraints is feasible for (SCP). Equivalent formulation of (SCP) v SCP = � E , X � min x , X ( A ⊤ A − I ) ◦ X = 0, s.t. � A ⊤ A , X � − 2¯ e ⊤ x + p = 0, diag ( X ) = x , X = xx ⊤ . Y.-L. Cheung Side chain positioning 2013 11 / 25

  21. Side chain positioning: SDP relaxation and facial reduction Side chain positioning problem: SDP relaxation SDP relaxation of (SCP) v SCP � v SCP ( SDP ) := min � E , X � x , X s.t. diag ( X ) = x , � A ⊤ A , X � − 2¯ e ⊤ x + p = 0, ( A ⊤ A − I ) ◦ X = 0, X = xx ⊤ . Y.-L. Cheung Side chain positioning 2013 12 / 25

  22. Side chain positioning: SDP relaxation and facial reduction Side chain positioning problem: SDP relaxation SDP relaxation of (SCP) v SCP � v SCP ( SDP ) := min � E , X � x , X s.t. diag ( X ) = x , � A ⊤ A , X � − 2¯ e ⊤ x + p = 0, ( A ⊤ A − I ) ◦ X = 0, X � xx ⊤ ( i.e., X − xx ⊤ ∈ S n + ) . Y.-L. Cheung Side chain positioning 2013 12 / 25

Recommend


More recommend