structural bioinformatics
play

Structural Bioinformatics Davide Ba Staff Scientist Genome - PowerPoint PPT Presentation

Structural Bioinformatics Davide Ba Staff Scientist Genome Biology Group (CNAG) Structural Genomics Group (CRG) dbau@pcb.ub.cat Proteins Amino Acids The peptide bond Properties A peptide bond is a covalent bond formed between two


  1. Structural Bioinformatics Davide Baù Staff Scientist Genome Biology Group (CNAG) Structural Genomics Group (CRG) dbau@pcb.ub.cat

  2. Proteins

  3. Amino Acids

  4. The peptide bond Properties A peptide bond is a covalent bond formed between two molecules when the carboxyl group of one molecule reacts with the amino group of the other molecule, causing the release of a molecule of water (H 2 O). Polypeptides and proteins are chains of amino acids held together by peptide bonds.

  5. The peptide bond The peptide bond is planar Fixed Fixed Only 2 bonds can freely rotate: C α –N and C α - C(O) Adapted from http://oregonstate.edu

  6. Ramachandran plots Protein structures Φ and Ψ angles fall within allowed regions (displayed in green and red). Secondary structure elements are defined by specific pairs of Φ and Ψ angles: Image credits: http://www.imb-jena.de/ ~rake

  7. Take home message Proteins Chains of amino acids held together by the peptide bond Configuration Defined by limited pairs of Φ and Ψ angles Role Fundamental constituents of the cell

  8. Summary Protein structural levels Primary Secondary Tertiary Quaternary Image credits: http:// iitb.vlab.co.in/

  9. Protein structure relevance The biochemical function (activity) of a protein is defined by its interactions with other molecules. The biological function is in large part a consequence of these interactions . The 3D structure is more informative than sequence because interactions are determined by residues that are close in space but are frequently distant in sequence.

  10. Protein prediction vs protein determination X-Ray NMR Experimental inferred data data Comparative Modeling Threading Ab-initio

  11. Utility of protein structure models, despite errors D. Baker & A. Sali. Science 294, 93, 2001.

  12. NMR spectroscopy Nuclear magnetic resonance γ V21 7.5 7.5 γ I5 γ V2 γ V21 γ Nle8 13/14 12/13 27/28+ 1.5 1.5 28/29 γ Nle8 γ R25 γ L11 γ L24 31/32 β L7 25/26 γ R25 30/31 β L28 β R25 2.0 β L11 β Nle8 β I5 β L24 β R20 β Q6 β E4 8.0 β V2 8.0 24/25 16/17 β E22 β Q29 8/9 β V31 4/5 β V21 γ Q6 2.5 2.5 22/23 γ E22 γ E4 α− β Ala18 γ Q29 9/10 α− β Ala19 β N33+ β N16 21/22 3.0 β N16 β N10 β D30 β Y34 2/3 β H32 β H14 3/4 δ R25 β H32 δ R20 8.5 8.5 3.5 3.5 β H9 β H14 β− β Ala18 α G12 β− β Ala19 20/21 α Nle8 4.0 α V21 α E4 α Q29 α G12 α Q6 α L11 α R20 α V2 α H9 4.5 α Y34 ppm ppm α N10 ppm α D30 ppm ppm 8.5 8.0 8.5 8.0 ppm 8.5 8.5 8.0 8.0 ppm ppm TOCSY NOESY

  13. NMR spectroscopy Nuclear magnetic resonance Superimposition of the ensemble of lowest energy structures of a peptide.

  14. X-RAY crystallography

  15. X-RAY crystallography

  16. Take home message Biochemical function Activity depends on the 3D structure Evolution conserve Structure is more conserved than sequence Protein types Fibrous Membrane Globular

  17. Nucleic acids DNA and RNA

  18. Nucleic acids DNA and RNA DNA and RNA are polymers made up of repeating units called nucleotides . Each nucleotide is composed of a nitrogen-containing nucleobase , a monosaccharide sugar and a phosphate group. The nucleotides are joined to one another in a chain by sugar- nucleobase covalent bonds. DNA (Deoxyribonucleic acid) encodes the genetic information. RNA (Ribonucleic acid) is implicated in various biological roles including coding, decoding, regulation, and expression of genes.

  19. The nucleotides DNA Phosphate group Nitrogenous base Guanine (G), Adenine (A), Thymine (T), or Cytosine (C) Sugar

  20. The nucleotides DNA Phosphate group Nitrogenous base Guanine (G), Adenine (A), Thymine (T), or Cytosine (C) Uracil (U) OH Sugar RNA

  21. Nitrogens bases DNA Adenine ( A ) Thymine ( T ) Guanine ( G ) Cytosine ( C )

  22. Nitrogens bases DNA RNA Adenine ( A ) Thymine ( T ) Uracil ( U ) Guanine ( G ) Cytosine ( C )

  23. The phosphodiester bond P B S P

  24. The phosphodiester bond P B S P

  25. Helix stability Hydrogen bonds and base-stacking interactions The two types of base pairs form different numbers of hydrogen bonds ( 2 for AT, 3 for GC ). The DNA double helix is maintained largely by the intra-strand base stacking interactions (GC > AT). The stability of the dsDNA form depends also on sequence and length . DNA with high GC-content is more stable than DNA with low GC- content.

  26. Base pairing DNA

  27. Base pairing RNA

  28. Nucleic acids helical structures A-DNA B-DNA Z-DNA

  29. Nucleic acids helical structures A B Z R R L Helix sense 11 10 12 bp per turn Vertical rise per 2.56 3.4 3.7 bp (Å) Rotation per bp +33 +36 -30 (degrees) Helical 23 19 18 diameter (Å)

  30. Nucleic acids helical structures A-DNA B-DNA Z-DNA

  31. Major and minor groove Major groove Minor groove

  32. The helical structure and DNA Rosalind Franklin

  33. Take home message DNA and RNA Polymers of nucleotide units Nucleotides Nucleobase (G,C,A,T - U) + sugar +phosphate DNA Store the genetic information RNA Implicated in various biological processes

  34. Genomes Limited data types

  35. The role of chromatin structure Activity Organization hormone Processes

  36. Chromatin definition Chromatin is composed of DNA complexed with histone proteins and other bio-molecules . Chromatin formation enables the genome to be hierarchically packaged or condensed so that it can fit inside the nuclear space. The compaction allows to modulate gene transcription , DNA repair , recombination , and replication . Chromatin structure is considered highly dynamic .

  37. Chromatin structures

  38. The nuclear organization of DNA Chromosome Chromatin fibre Nucleosome Adapted from Richard E. Ballermann, 2012

  39. The resolution gap What do we “really” know? Knowledge IDM INM DNA length 10 10 10 10 nt Volume 10 10 10 10 10 μ m Time 10 10 10 10 10 10 10 10 s Resolution 10 10 10 μ

  40. The nucleosome DNA Methyl group Histone Gene Histone proteins Acetyl group Histone tail

  41. The nucleosome & chromatin marks DNA Methyl group Histone Gene Histone proteins Acetyl group Histone tail Modification H3K4 H3K9 H3K14 H3K27 H3K79 H4K20 H2BK5 mono- activation activation activation activation activation activation methylation repressio di-methylation activation repression activation n tri- repressio activation , activation repression repression methylation n repression acetylation activation activation

  42. Euchromatin and heterochromatin Electron microscopy Euchromatin: chromatin that is located away from the nuclear lamina, is generally less densely packed, and contains actively transcribed genes Heterochromatin: chromatin that is near the nuclear lamina, tightly condensed, and transcriptionally silent

  43. Complex genome organization Takizawa, T., Meaburn, K. J. & Misteli, Cell 135, 9–13 (2008) Chromosome size Gene density Expression  

  44. Lamina-genome interactions to neural/glial The poising’’ “Unlocking” Neuronal ), AC gene Stemcell gene genes in Cell-cycle promoters gene nuclear membrane nuclear lamina here internal chromatin (mostly active) lamina-associated domains and (repressed) architec- Genes over- mRNA large step Most genes in Lamina Associated Domains are transcriptionally silent, suggesting that lamina-genome interactions are widely involved in the control of gene expression Adapted from Molecular Cell 38, 603-613, 2010

  45. Complex genome organization Cavalli, G. & Misteli, Nat Struct Mol Biol 20, 290–299 (2013) Lamina Transcription hub Centromere cluster Chromosome territories Active Non- Nuclear coding pore Inactive Chromatin Superdomains DNA domains Marina Corral Nucleus

  46. Chromatin loops Gene Gene enhancers Gene activity Loops bring distal genomic regions in close proximity to one another. This in turn can have profound effects on gene transcription . Enhancers can be thousands of kilobases away from their target genes in any direction (or even on a separate chromosome).

  47. Main approaches

  48. 5C technology http://my5C.umassmed.edu Job Dekker Dostie et al. Genome Res (2006) vol. 16 (10) pp. 1299-309

  49. Structure determination using Hi-C data Biomolecular structure determination 2D-NOESY data Chromosome structure determination 3C-based data

  50. Interpreting chromatin interaction data Nuclear envelope or lamina Protein- complex- mediated interaction Subnuclear body or transcription factory Direct interaction Bystander interaction Baseline (polymer) Interaction with same subnuclear interaction structures Adapted from Dekker et all, (2013) Nat Rev Genetics

  51. ��������������������� Hi-C data and genomic tracks data ��������������������� Mouse chromosome 18 20 Mb ��������� Interaction depletion DNase I sensitivity Interaction enrichment RefSeq genes Adapted from Dekker et all, (2013) Nat Rev Genetics ��������������������������������������������������������

Recommend


More recommend