the indispensibility of neutrality in molecular evolution
play

The Indispensibility of Neutrality in Molecular Evolution Peter - PowerPoint PPT Presentation

The Indispensibility of Neutrality in Molecular Evolution Peter Schuster Institut fr Theoretische Chemie und Molekulare Strukturbiologie der Universitt Wien Max-Planck-Institut fr Physik Komplexer Systeme Dresden, 05.07.2004 Web-Page for


  1. The Indispensibility of Neutrality in Molecular Evolution Peter Schuster Institut für Theoretische Chemie und Molekulare Strukturbiologie der Universität Wien Max-Planck-Institut für Physik Komplexer Systeme Dresden, 05.07.2004

  2. Web-Page for further information: http://www.tbi.univie.ac.at/~pks

  3. f 1 (A) + I 1 I 1 I 1 + f 2 (A) + I 2 I 2 I 2 Φ = ( Φ ) + dx / dt = x - x f x f i - i i i i i Φ = Σ ; Σ = 1 ; i,j =1,2,...,n f x x j j j j j i � i =1,2,...,n ; [I ] = x 0 ; i f i I i [A] = a = constant (A) + (A) + I i I i + + fm = max { ; j=1,2,...,n} fj � � � xm(t) 1 for t f m I m I m (A) + (A) + I m + f n I n (A) + (A) + I n I n + + Reproduction of organisms or replication of molecules as the basis of selection

  4. James D. Watson and Francis H.C. Crick Nobel prize 1962 1953 – 2003 fifty years double helix Base pairs: A = T and G � C Stacking of base pairs in nucleic acid double helices (B-DNA)

  5. 5' 3' Plus Strand G C C C G Minus Strand C G G G C 5' 5' 3' 3' + Plus Strand G C C C G Minus Strand C G G G C 5' 5' 3' 3' Plus Strand G C C C G Minus Strand C G G G C 5' 3' Direct replication of DNA is a higly complex copying mechanism involving more than ten different protein molecules. Complementarity is determined by Watson-Crick base pairs: A = T and G � C

  6. Selection equation : [I i ] = x i � 0 , f i > 0 ( ) dx ∑ ∑ = − φ = n = φ = n = , 1 , 2 , , ; 1 ; i L x f i n x f x f i i = i = j j 1 1 i j dt Mean fitness or dilution flux, φ (t), is a non-decreasing function of time, ( ) φ = ∑ n dx d { } 2 = − = ≥ 2 i var 0 f f f f i dt dt = 1 i Solutions are obtained by integrating factor transformation ( ) ( ) ⋅ 0 exp ( ) x f t = = i i ; 1 , 2 , L , x t i n ( ) ( ) ∑ = i n ⋅ 0 exp x f t j j 1 j

  7. s = ( f 2 - f 1 ) / f 1 ; f 2 > f 1 ; x 1 (0) = 1 - 1/N ; x 2 (0) = 1/N 1 Fraction of advantageous variant 0.8 0.6 s = 0.1 s = 0.02 0.4 0.2 s = 0.01 0 0 200 400 600 800 1000 Time [Generations] Selection of advantageous mutants in populations of N = 10 000 individuals

  8. „...Variations neither useful not injurious would not be affected by natural selection, and would be left either a fluctuating element, as perhaps we see in certain polymorphic species, or would ultimately become fixed, owing to the nature of the organism and the nature of the conditions. ...“ Charles Darwin, Origin of species (1859)

  9. Motoo Kimura’s population genetics of neutral evolution. Evolutionary rate at the molecular level. Nature 217 : 624-626, 1955. The Neutral Theory of Molecular Evolution . Cambridge University Press. Cambridge, UK, 1983.

  10. Motoo Kimura. The Neutral Theory of Molecular Evolution . Cambridge University Press. Cambridge, UK, 1983.

  11. CGTCGTTACAATTTA GTTATGTGCGAATTC CAAATT AAAA ACAAGAG..... G A G T CGTCGTTACAATTTA GTTATGTGCGAATTC CAAATT AAAA ACAAGAG..... A C A C Hamming distance d (I ,I ) = 4 H 1 2 (i) d (I ,I ) = 0 H 1 1 (ii) d (I ,I ) = d (I ,I ) H 1 2 H 2 1 � (iii) d (I ,I ) d (I ,I ) + d (I ,I ) H 1 3 H 1 2 H 2 3 The Hamming distance between genotypes induces a metric in sequence space

  12. Motoo Kimura. The Neutral Theory of Molecular Evolution . Cambridge University Press. Cambridge, UK, 1983.

  13. The molecular clock of evolution Motoo Kimura. The Neutral Theory of Molecular Evolution . Cambridge University Press. Cambridge, UK, 1983.

  14. Bacterial Evolution S. F. Elena, V. S. Cooper, R. E. Lenski. Punctuated evolution caused by selection of rare beneficial mutants . Science 272 (1996), 1802-1804 D. Papadopoulos, D. Schneider, J. Meier-Eiss, W. Arber, R. E. Lenski, M. Blot. Genomic evolution during a 10,000-generation experiment with bacteria . Proc.Natl.Acad.Sci.USA 96 (1999), 3807-3812

  15. lawn of E.coli 24 h 24 h nutrient agar Serial transfer of Escherichia coli cultures in Petri dishes � 1 day 6.67 generations � 1 month 200 generations � 1 year 2400 generations

  16. 1 year Epochal evolution of bacteria in serial transfer experiments under constant conditions S. F. Elena, V. S. Cooper, R. E. Lenski. Punctuated evolution caused by selection of rare beneficial mutants . Science 272 (1996), 1802-1804

  17. Hamming distance to ancestor 25 20 15 10 5 2000 4000 6000 8000 Generations Time Variation of genotypes in a bacterial serial transfer experiment D. Papadopoulos, D. Schneider, J. Meier-Eiss, W. Arber, R. E. Lenski, M. Blot. Genomic evolution during a 10,000-generation experiment with bacteria . Proc.Natl.Acad.Sci.USA 96 (1999), 3807-3812

  18. No new principle will declare itself from below a heap of facts. Sir Peter Medawar, 1985

  19. In evolution variation occurs on genotypes but selection operates on the phenotype . Mappings from genotypes into phenotypes are highly complex objects. The only computationally accessible case is in the evolution of RNA molecules. The mapping from RNA sequences into secondary structures and function, sequence � structure � fitness , is used as a model for the complex relations between genotypes and phenotypes. Fertile progeny measured in terms of fitness in population biology is determined quantitatively by replication rate constants of RNA molecules. Population biology Molecular genetics Evolution of RNA molecules Genotype Genome RNA sequence Phenotype Organism RNA structure and function Fitness Reproductive success Replication rate constant The RNA model

  20. 5' - end N 1 O CH 2 O GCGGAU UUA GCUC AGUUGGGA GAGC CCAGA G CUGAAGA UCUGG AGGUC CUGUG UUCGAUC CACAG A AUUCGC ACCA 5'-end 3’-end N A U G C k = , , , OH O N 2 O P O CH 2 O Na � O O OH N 3 O P O CH 2 O Na � 3'-end O O OH Definition of RNA structure 5’-end N 4 O P O CH 2 O Na � 70 O O OH 60 O P 3' - end O 10 Na � O 50 20 30 40

  21. 5'-End 3'-End Sequence GCGGAUUUAGCUCAGDDGGGAGAGCMCCAGACUGAAYAUCUGGAGMUCCUGUGTPCGAUCCACAGAAUUCGCACCA 3'-End 5'-End 70 60 Secondary structure 10 50 20 40 30 Base pairs: A = U , G � C , and G - U

  22. Definition and physical relevance of RNA secondary structures RNA secondary structures are listings of Watson-Crick and GU wobble base pairs, which are free of knots and pseudokots . D.Thirumalai, N.Lee, S.A.Woodson, and D.K.Klimov. Annu.Rev.Phys.Chem . 52 :751-762 (2001): „ Secondary structures are folding intermediates in the formation of full three-dimensional structures .“

  23. RNA sequence Biophysical chemistry: thermodynamics and kinetics Inverse folding of RNA : RNA folding : Biotechnology, Structural biology, design of biomolecules spectroscopy of with predefined biomolecules, structures and functions Empirical parameters understanding molecular function RNA structure of minimal free energy Sequence, structure, and design

  24. 5’-end 3’-end A C (h) C S 5 (h) S 3 U G C (h) S 4 U A A U (h) S 1 U G (h) S 2 C G (h) S 8 0 G (h) (h) S 9 G C S 7 � A U y g A A r e n A e (h) C C S 6 U e A e Suboptimal conformations U r G G C C F A G G U U U G G G A C C A U G A G G G C U G (h) S 0 Minimum of free energy The minimum free energy structures on a discrete space of conformations

  25. Criterion of Minimum Free Energy UUUAGCCAGCGCGAGUCGUGCGGACGGGGUUAUCUCUGUCGGGCUAGGGCGC GUGAGCGCGGGGCACAGUUUCUCAAGGAUGUAAGUUUUUGCCGUUUAUCUGG UUAGCGAGAGAGGAGGCUUCUAGACCCAGCUCUCUGGGUCGUUGCUGAUGCG CAUUGGUGCUAAUGAUAUUAGGGCUGUAUUCCUGUAUAGCGAUCAGUGUCCG GUAGGCCCUCUUGACAUAAGAUUUUUCCAAUGGUGGGAGAUGGCCAUUGCAG Sequence Space Shape Space

  26. Reference for postulation and in silico verification of neutral networks

  27. Evolution in silico W. Fontana, P. Schuster, Science 280 (1998), 1451-1455

  28. ψ = ( ) Sk I. = ( ) fk f Sk Sequence space Real numbers Structure space Mapping from sequence space into structure space and into function

  29. ψ = ( ) Sk I. = ( ) fk f Sk Sequence space Real numbers Structure space

  30. ψ = ( ) Sk I. Sequence space Structure space

  31. ψ = ( ) Sk I. Sequence space Structure space The pre-image of the structure S k in sequence space is the neutral network G k

  32. Neutral networks are sets of sequences forming the same object in a phenotype space. The neutral network G k is, for example, the pre- image of the structure S k in sequence space: G k = � -1 (S k ) π { � j | � (I j ) = S k } The set is converted into a graph by connecting all sequences of Hamming distance one. Neutral networks of small biomolecules can be computed by exhaustive folding of complete sequence spaces, i.e. all RNA sequences of a given chain length. This number, N=4 n , becomes very large with increasing length, and is prohibitive for numerical computations. Neutral networks can be modelled by random graphs in sequence space. In this approach, nodes are inserted randomly into sequence space until the size of the pre-image, i.e. the number of neutral sequences, matches the neutral network to be studied.

Recommend


More recommend