jsmc practical course inferring phylogeny based on
play

- JSMC Practical Course - Inferring Phylogeny Based on Sequence - PowerPoint PPT Presentation

- JSMC Practical Course - Inferring Phylogeny Based on Sequence Information Thursday Friday, March 21 22 Room 316, Philosophenweg 12 Wireless LAN: eduroam Username: tagung09@uni-jena.de Password: Gver58ges Phylogenetics -


  1. - JSMC Practical Course - Inferring Phylogeny Based on Sequence Information Thursday – Friday, March 21 – 22 Room 316, Philosophenweg 12 Wireless LAN: eduroam Username: tagung09@uni-jena.de Password: Gver58ges

  2. Phylogenetics - Phylogeny = evolutionary history of a specific group of organisms - Discipline of phylogenetics aims to find a classification for a specific group of organisms or genes that represents their true evolutionary relationship - Distinction between ancestral (plesiomorphic) and derived (apomorphic) features - Kinds of features: - morphological data - biochemical data - molecular data -> evolve relatively continuously -> homologies may be detected more easily -> very high quantity

  3. Molecular phylogenetics Types of molecular features: nucleotide sequences of ribosomal RNA- or tRNA-genes presence or absence of a certain gene within the genome Phylogenetic distance genomic rearrangements presence or absence of introns amino acid sequences of proteins nucleotide sequences of protein coding genes nucleotide sequences of introns nucleotide sequences of intergenic regions single nucleotide polymorphisms (SNPs)

  4. Molecular phylogenetics Types of molecular features: Distinction of distantly related species nucleotide sequences of ribosomal RNA- or tRNA-genes presence or absence of a certain gene within the genome Phylogenetic distance genomic rearrangements presence or absence of introns broad range amino acid sequences of proteins of applications nucleotide sequences of protein coding genes nucleotide sequences of introns nucleotide sequences of intergenic regions single nucleotide polymorphisms (SNPs) Distinction of single individuals e.g. paternity tests or criminal biology

  5. Interpretation of phylogenetic trees Root The order of the taxa (terminal branches) is not of importance. Node Each sub-tree can be arbitrarily rotated at each node (so that the order of the taxa changes). Branch Only the topology of the tree (i.e. the node structure) specifies the phylogenetic relationship!

  6. Types of phylogenetic trees Cladogram Dendrogram Phylogram No meaning Feature change Time

  7. Display formats

  8. Gene trees and species trees Horizontal Gene Speciation Gene loss gene transfer duplication

  9. Process of phylogeny inference - Data collection - Homologous sequences are searched based on sequence similarity -> BLAST - Multiple sequence alignment - Homologous sites are detected and aligned along each other -> MAFFT - Selection of an appropriate model to infer phylogeny - Based on the level of sequence identity/similarity among the alignment members their phylogenetic relation is reconstructed -> Neighbour joining -> Bayesian phylogeny inference Holder and Lewis, 2003

  10. Process of phylogeny inference - Data collection - Homologous sequences are searched based on sequence similarity -> BLAST - Multiple sequence alignment - Homologous sites are detected and aligned along each other -> MAFFT - Selection of an appropriate model to infer phylogeny - Based on the level of sequence identity/similarity among the alignment members their phylogenetic relation is reconstructed -> Neighbour joining -> Bayesian phylogeny inference Holder and Lewis, 2003

  11. Process of phylogeny inference Tree construction and searching methods Tree evaluation methods (optimality criteria) - Stepwise addition - Minimum evolution - Star decomposition - Parsimony - Maximum likelihood - Heuristic search - Exact search

  12. Tree construction and searching methods - Stepwise addition Attaches linage by linage according to their relative similarity - Star decomposition Joins linage by linage according to their relative similarity

  13. Tree construction and searching methods Global optimum - Heuristic search Performs branch swapping to generate Local alternative trees in attempt to find a better optima Tree quality tree - Exact search Searches the complete ‘space’ of possible trees Space of all possible trees

  14. Tree evaluation methods (optimality criteria) Position 1 Position 2 Position 3 Minimum Evolution Sequence 1 A A A Sequence 2 A T G - Uses a distance matrix to evaluate tree quality Sequence 3 A T C - For every tree the branch length are estimated that best explain the observed distances S 1 S 2 S 3 S 2 S 3 S 1 0.5 0.5 S 1 0 2 2 -> fast S 2 2 0 1 1 -> can correct for unseen changes 0.5 S 3 2 1 0 -> weaknesses for long branches (i.e. high evolutionary distances)

  15. Tree evaluation methods (optimality criteria) Position 1 Position 2 Position 3 Parsimony Sequence 1 A A A Sequence 2 A T G - Maps sequence history onto tree Sequence 3 A T C - Evaluates tree quality by finding the minimum number of mutations that could explain the data A T G A T C A A A G -> C -> fast enough for hundreds of sequences -> does not correct for multiple mutational A T G T -> A pathways of the same tree A -> G -> performs poorly if branch length differ A T A

  16. Tree evaluation methods (optimality criteria) Maximum likelihood - Maps sequence history onto tree - Finds the tree that is most likely to explain the data -> captures all possible mutational pathways -> corrects for multiple mutational events at the same site -> slow

  17. Process of phylogeny inference Tree construction and searching methods Tree evaluation methods (optimality criteria) - Stepwise addition - Minimum evolution Produce only - Star decomposition - Parsimony one tree - Maximum likelihood - Heuristic search No information about the reliability of single branches - Exact search -> Bootstrapping

  18. Bootstrapping - Creates pseudo-replicates of original data - Performs the same tree search for all pseudo- replicates and stores the trees - The reliability of a certain grouping is determined based on the number of trees that show this grouping -> very time consuming Holder and Lewis, 2003

  19. Bayesian phylogenetics - Performs tree search and measure of support simultaneously - Uses Markov chain Monte Carlo (MCMC) simulations to produce alternative trees - Not a strict ‘hill-climber’ (does not only accept better trees) - Higher probability to reach global optimum Holder and Lewis, 2003

  20. Bayesian phylogenetics

  21. Basic terms of phylogenetics - Monophyletic group - Analogy - Paraphyletic group - Homology - Polyphyletic group - Homoplasy - Apomorphic feature - Convergence - Plesiomorphic feature - MRCA - Autapomorphy - Extant species - Synapomomorphy - Extinct species - Symplesiomorphy - Dichotomy - Polytomy

  22. The Tree Thinking Challenge Is the frog more closely related to the human or to the fish?

  23. The Tree Thinking Challenge Is the frog more closely related to the human or to the fish?

  24. The Tree Thinking Challenge

  25. The Tree Thinking Challenge

  26. The Tree Thinking Challenge

  27. The Tree Thinking Challenge

  28. The Tree Thinking Challenge

  29. The Tree Thinking Challenge

  30. The Tree Thinking Challenge

  31. The Tree Thinking Challenge

  32. The Tree Thinking Challenge

  33. The Tree Thinking Challenge

  34. The flower development of angiosperms Petals Stamens Carpels Sepals Ovules

  35. The ABCDE model Petals Stamens Carpels Sepals Ovules

  36. The ABCDE model SHATTERPROOF1 SHATTERPROOF2 APETALA3 PISTILLATA SEEDSTICK APETALA1 AGAMOUS SEPALLATA1 SEPALLATA2 SEPALLATA3 SEPALLATA4

  37. The floral quartet model

  38. Phylogeny of seed plants Gymnosperms Basal angiosperms Magnoliids Monocots Basal eudicots Core eudicots Arabidopsis thaliana

  39. Phylogeny of seed plants APETALA1 SEPALLATA1 ? Gymnosperms SEPALLATA2 APETALA3 SEPALLATA3 Basal angiosperms PISTILLATA SEPALLATA4 Magnoliids AGAMOUS Monocots Basal eudicots Core eudicots Arabidopsis thaliana

  40. Phylogeny of seed plants - Search for orthologs of floral APETALA1 SEPALLATA1 homeotic genes in distantly Gymnosperms SEPALLATA2 related angiosperm and APETALA3 SEPALLATA3 gymnosperm species Basal angiosperms PISTILLATA SEPALLATA4 - Examine the phylogenetic Magnoliids AGAMOUS relationship of the gene families Monocots Basal eudicots Core eudicots Arabidopsis thaliana

Recommend


More recommend