MolMed Course “ Genetics for Dummies ” Rotterdam, 7 November, 2018 Introduction to Complex Genetics: Concepts and Tools: part A André G Uitterlinden Genetic Laboratory Department of Internal Medicine Department of Epidemiology Department of Clinical Chemistry www.glimdna.org Our website… Professor Trifonius Zonnebloem Professor Cuthbert Calculus Professeur Tryphon Tournesol
ROTTERDAM – OLDEBARNEVELDSTRAAT - MULTATULI Viewed from the moon we are all equal Portret gemaakt door Mathieu Ficheroux, 1974
We differ from each other … DNA variation causes differences in: Development Appearance Behaviour Ageing Diseases
AGING RESEARCH
From DNA to RNA to Protein.... . . . . A A C C G C A T A A G G DNA base pair . . . . T T G G C G T A T T C C “Genetics” sequence exon Gene structure “Genomics” mRNA “Proteomics” Protein
Why do we study DNA variation ? *Biology: - Mechanism : understand cause of disease - Treatment : finding new potential drug targets *Prediction: - (Early) diagnostics with a stable marker : understand how DNA variation contributes to variation in: - Risk of disease (vulnarability): “personalized medicine” - “Response -to- treatment” (medication, diet): “pharmacogenetics”
“The Human Genome Project” What will DNA tell about this stain in a dress Bill Clinton Tony Blair Craig Venter Francis Collins * 26 Juni 2000: Press conference Bill Clinton & Tony Blair: "working draft“, 95% gesequenced * 14 april 2003: finished: 99% gesequenced. >>Cheaper and Faster!! Costs: $ 2.7 miljard (instead of $ 3 billion estimated costs) Timing: 1990 - 2003 (instead of 2005)
AGGAGTCTGACTGACCATTGGACTAGGGGATTGACCAGTAGGCTGCGATTCGGATGCGGATTGACGATTAAAAAGGATTACGATTAGCTGT COMPLEX GENETICS: HUMAN DNA IS HIGHLY VARIABLE GACGTGCAGGATGCTGCGATGCTGGACTGAACGCCCCCCGGGCTTCTTTATTAGCTGCTGACGTGCCAGATGCTGACGTGCAGTGCGG TGACGGTGCTTACCTGGATCGGATGCTACCAGTCGATCGATCGATCGTAGCGTAGCGTATGCTAGCTAGTGATCGATGCTAGTAGCTA GCTGATCGATCATCGATCGTAGCTAGCTAGCTAGCTAGCTGATCGATCGATGCTAGCTAGCTAGCTAGTCATCTGTGGTGGGGGGTTA CGATTGCCGCTAGCTAGAACAAAATAGCG G TATTTTGGGGAGTCTGACTGACCATTGGACTAGGGGATTGACCAGTAGGCTGCGATTCGGA DNA Variants are: TGCGGATTGACGATTAAAAAGGATTACGATTAGCTGTGACGTGCAGGATGCTGCGATGCTGGACTGAACGCCCCCCGGGCTTCTTTA “SNP= Single Nucleotide Polymorphism ” CTGCTGACGTGCCAGATGCTGACGTGCAGTGCGGCTGACGG A GTCTGACTGACCATTGGACTAGGGGATTGACCAGTAGGCTGCGA *Frequent in the Genome (based on 500k WGS/WES ): GGATGCGGATTGACGATTAAAAAGGATTACGATTAGCTGTGACGTGCAGGATGCTGCGATGCTGGACTGAACGCCCCCCGGGCTTCT TAGCTGCTGACGTGC C AGATGCTGACGTGCAGTGCGGCTGACGGTGCTTACCTGGATCGGATGCTACCAGTCGATCGATCGATCGT - Many : >150 million variable loci in human genome (~3%) TAGCGTATGCTAGCTAGTGATCGATGCTA G TAGCTAGCTA G CTGATCGATCATCGATCGTAGCTAGCTAGCTAGCTAGCTGATCGATCGA CTAGCTAGCTAGCTAGTCATCTGTGGTGGGGGGTTAAATGCGATTGCCGCTAGCTAGAACAAAATAGCGGTATTTTGGGGAGTCTGACTGA - Types: “ SNPs ” , in/del, CNV, VNTR CCATTGG A CTAGGGGATTGACCAGTA G GCTGCG A TTCGGATGCGGATTGACGATTAAAAAGGATTACGATTAGCTGTGACGTGCAGGA CTGCGATGCTGGACTG A ACGCCCCTCGGGCTTCTTTATTAGCTGCTGACGTGCCAGATGCTGACGTGCAGTGAGGAGTCTGACTGACCA -Databases: dbSNP, HapMap , 1KG, “ local ” NGS efforts,.. TGGACTAGGGGATTGACCAGTAGGCTGCGATTCG G ATGCGGATTGACGATTAAAAAGGATTACGATTAGCTGTGACGTGCAGGATGCTGC GATGCTGGACTGAACGCCCCCCGGGCTTCTTTATTAGCTGCTGACGTGCCAGATGCTGACGTGCAGTGCGGCTGACGGTGCTTACC ATCGGATGCTACCAGTCGATCGATCGATCGTAGCGTAGCGTATGCTAGCTAGTGATCGATGCTAGTAGCTAGCT A GCTGATCGATCAT AACCG TATA AGGGCTAGCTAGCTGATCGATCGATGCTAGC T AGCTAGCTAGTCATCTGTGGTGGGGGGTTAAATGCGATTGCCGCTA *Frequent in the Population: “IN/DEL= Insertion Deletion ” GAACAAAATAGCGGTATTTTGGCGGCTGACGGTGCTTACCTGGATCGGATGCTACCAGTCGATCGATCGATCGTAGCGTAGCGTATGCT > 5 % = common polymorphism CTAGTGAT C GATGCTAGTAAGGAGTCTGACTGACCATTG G ACTAGGGGATTGACCAGTAGGCTGCGATTCGGATGCGGATTGACGATT AAGGATTACGATTAGCTGTGACGTGCAGGATGCTGCGATGCTGGACTGAACGCCCCCCGGGCTTCTTTATTAGCTGCTGACGTGCCA 1 – 5 % = less common variant GCTGACGTGCAGTGCGGCTGACGGTGCTTACCTGGATCGGA T GCTACCAGTCGATCGATCGATCGTAGCGTAGCGTATGCTAGCTA TCGATGCTAGTAGCTAGCTAGCTGATCGATCATCGATCGTAGCTAGCTAGCTAGCTAGCTGATCGATCGATGCTAGCTAGCTAGCTAGTC “CNV=Copy Number Variation ” < 1 % = rare variant/mutation TGTGGTGGGGGGTTAAATGCGATTGCCGCTAGCTAGAACAAAATAGC GGTATTTTGGAGGAGTCTGACTGACCATTGGACTAGGGG ACCAGTAGGCTGCGATTCGGATGCGGATTGACGATTAAAAAGGAT TACGATTAGCTGTGACGTGCAGGATGCTGCGATGCTGGACT CGCCCCCCGGGCTTCTTTATTAGCTGCTGACGTGCCAGATGCTGACGTGCAGTGCGGCTGACGGTGCTTACCTGGATCGGATGCTA GTCGATCGATCGA T CGTAGCGTAGCGTATGCTAGCTAGTGATCGATGCTAGTAGCTAGCTAGCTGATCGATCATCGATCGTAG C TAGCT TAGCTAGCTGATCGATCGATGCTAGCTAGCTAGCTAGTCATCTGTGGTGGGGGGTTAAATGCGATTGCCGCTAGCTAGAACAAAATAGCGG “VNTR= Variable Nunber of Tandem Repeats ” TATTTTGGGCTAGCTAGCTGATCGATCAT C GATCGTAGCTAGCTAGCTAGCTAGCTGATCGATCGATGCTAGCTAGCTAGCTAGTCATCT GTGGGGGGTTAAATG CACACACACACACACACACACACACACACACACA GATTGCCGCTAGCTAGAACAAAATAGCGGTATTTTGG GCTTACCTGGATCGGATGCTACCAGTCGATCGATCGATCGTAGCGTAGCGTATGCTAGCTAGTGATCGATGCTAGTAGCTAGCTAGCTGA GATCATCGATCGTAGCTAGCTAGCTAGCTAGCTGATCGATCGATGCTAGCTAGCTAGCTAGTCATCTGT G GTGGGGGGTTAAATGCGA CGCTAGCTAGAACAAAATAGCGGTATTT T GGAGGAGTCTGACTGACCATTGGACTAGG G GATTGACCAGTAGGCTGCGATTCGGATGCGG ATTGACGATTAAAAAGGATTACGATTAGCTGTGACGTGCAGG A TGCTGCGATGCTGGACTGAACGCCCCCCGGGCTTCTTTATTAGCT GACGTGCCAGATGCTGACGTGCAGTGCGGCTGACGGTGCTTACCTGGATCGGATGCTACCAGTCGATCGATCGATCGTAGCGTAGCGT GCTAGCTAGTGATCGATGCTAGT A GCTAGCTAGCTGATCGA
AGGAGTCTGACTGACCATTGGACTAGGGGATTGACCAGTAGGCTGCGATTCGGATGCGGATTGACGATT AAAA AGGATTACGATT AGCTGTGACGTGCAGGATGCTGCGATGCTGGACTGAACGCCCCCCGGGCTTCTTTATTAGCTGCTGACGTGCCAGATGCTGAC GTGCAGTGCGGCTGACGGTGCTTACCTGGATCGGATGCTACCAGTCGATCGATCGATCGTAGCGTAGCGTATGCTAGCTAGTGAT CGATGCTAGTAG C TAGCTAGCTGATCGATCATCGATCGTAGCTAGCTAGCTAGCTAGCTGATCGATCGATGCTAGCTAGCTAGCTA GTCATCTGTGGTGGGGGGTTAAATGCGATTGCCGCTAGCTAGAACAAAATAGCGGTATTTTGGGGAGTCTGACTGACCATTGGAC TAGGGGATTGACCAGTAGGCTGCGATTCGGATGCGGATTGACGATTAAAAAGGATTACGATTAGCTGTGACGTGCAGGATGCTGC GATGCTGGACTGAACGCCCCCCGGGCTTCTTTATTAGCTGCTGACGTGCCAGATGCTGACGTGCAGTGCGGCTGACGGAGTCT GACTGACCATTGGACTAGGGGATTGACCAGTAGGCTGCGATTCGGATGCGGATTGAC G ATTAAAAAGGATTACGATTAGCTGTGA “SNP” CGTGCAGGATGCTGCGATGCTGGACTGAACGCCCCCCGGGCTTCTTTATTAGCTGCTGACGTGCCAGATGCTGACGTGCAGTG CGGCTGACGGTGCTTACCTGGATCGGATGCTACCAGTCGATCGATCGATCGTAGCGTAGCGTATGCTAGCTAGTGATCGATGCTA GTAGCTAGCTAGCTGATCGATCATCGATCGTAGCTAGCTAGCTAGCTAGCTGATCGATCGATGCTAGCTAGCTAGCTAGTCATCTGT GGTGGGGGGTTAAATGCGATTGCCGCTAGCTAGAACAAAATAGCGGTATTTTGGGGAGTCTGACTGACCATTGGACTAGGGGATT GACCAGT A GGCTGCGATTCGGATGCGGATTGACGATTAAAAAGGATTACGATTAGCTGTGACGTGCAGGATGCTGCGATGCTGG ACTGAACGCCCCCCGGGCTTCTTTATTA G CT G CTGACGTGCCAGATGCTGACGTGCAGTGAGGAGTCTGACTGACCATTGGACT AGGGGATTGACCAGTAGGCTGCGATTCGGATGCGGATTGACGATTAAAAAGGATTACGATTAGCTGTGACGTGCAGGATGCTGC GATGCTGGACTGAACGCCCCCCGGGCTTCTTTATTAGCTGCTGACGTGCCAGATGCTGACGTGCAGTGCGGCTGACGGTGCTTA CCTGGATCGGATGCTACCAGTCGATCGATCGATCGTAGCGTAGCGTATGCTAGCTAGTGATCGATGCTAGTAGCTAGCTAGCTGAT CGATCATCGATAACCG T ATAAGGGCTAGCTAGCTGATCGATCGATGCTAGCTAGCTAGCTAGTCATCTGTGGTGGGGGGTTAAATG CGATTGCCGCTAGCTAGAACAAAATAGCGGTATTTTGGCGGCTGACGGTGCTTACCTGGATCGGATGCTACCAGTCGATCGATCG ATCGTAGCGTAGCGTATGCTAGCTAGTGATCGATGCTAGTAAGGAGTCTGACTGACCATTGGACTAGGGGATTGACCAGTAGGCT GCGATTCGGATGCGGATTGACGATTAAAAAGGATTACGATTAGCTGTGAC G TGCAGGATGCTGCGATGCTGGACTGAACGCCCC T- C “HAPLOTYPE” CCGGGCTTCTTTATTAGCTGCTGACGTGCCAGATGCTGACGTGCAGTGCGGCTGACGGTGCTTACCTGGATCGGATGCTACCAG TCGAT C GATCGATCGTAGCGTAGCGTATGCTAGCTAGTGATCGATGCTAGTAGCTAGCTAGCTGATCGATCATCGATCGTAGCTAG CTAGCTAGCTAGCTGATCGATCGATGCTAGCTAGCTAGCTAGTCATCTGTGGTGGGGGGTTAAATGCGATTGCCGCTAGCTAGAA CAAAATAGCGGTATTTTGGAGGAGTCTGACTGACCATTGGACTAGGGGATTGACCAGTAGGCTGCGATTCGGATGCGGATTGAC GATTAAAAAGGATTACGATTAGCTGTGACGTGCAGGATGCTGCGATGCTGGACTGAACGCCCCCCGGGCTTCTTTATTA G CTGCT GACGTGCCAGATGCTGACGTGCAGTGCGGCTGACGGTGCTTACCTGGATCGGATGCTACCAGTCGATCGATCGATCGTAGCGTA GCGTATGCTAGCTAGTGATCGATGCTAGTAGCTAGCTAGCTGATCGATCATCGATCGTAGCTAGCTAGCTAGCTAGCTGATCGATC GATGCTAGCTAGCTAGCTAGTCATCTGTGGTGGGGGGTTAAATGCGATTGCCGCTAGCTAGAACAAAATAGCGGTATTTTGGGCTA GCTAGCTGA TCG ATCATCGATCGTAGCTAGCTAGCTAGCTAGCTGATCGATCGATGCTAGCTAGCTAGCTAGTCATCTGTGGTGGG GGGTTAAATGCGATTGCCGCTAGCTAGAACAAAATAGCGGTATTTTGGGGTGCTTACCTGGATCGGATGCTACCAGTCGATCGATC GATCGTAGCGTAGCGTATGCTAGCTAGTGATCGATGCTAGTAGCTAGCTAGCTGATCGATCATCGATCGT A GCTAGCTAGCTAGCT AGCTGATCGATCGATGCTAGCTAGCTAGCTAGTCATCTGTGGTGGGGGGTTAAATGCGATTGCCGCTAGCTAGAACAAAATAGCG GTATTTTGGAGGAGTCTGACTGACCATTGGACTAGGGGATTGACCAGTAGGCTGCGATTCGGATGCGGATTGACGATTAAAAAGG ATTACGATTAGCTGTGACGTGCAGGATGCTGCGATGCTGGACTGAACGCCCCCCGGGCTTCTTTATTAGCTGCTGACGTGCCAGA TGCT G ACGTGCAGTGCGGCTGACGGTGCTTACCTGGATCGGATGCTACCAGTCGATCGATCGATCGTAGCGTAGCGTATGCTAG CTAGTGATCGATGCTAGTAGCTAGCTAGCTGATCGA
SNPs, alleles, genotypes and haplotypes SNP= Single Nucleotide Polymorphism Chromosomes: strand Allele A A C G from Father + + from Mother C G A T Genotype Haplotype Allele A A C G C G A T
Single Nucleotide Polymorphisms (SNPs) are common and have subtle effects Codon 222 Codon 222 ..AACCG C ATAAGG.. ..AACCG T ATAAGG.. ..TTGGC G TATTCC.. ..TTGGC A TATTCC.. DNA: C677T Alanine Valine protein: Ala222Val c u enzyme activity Ala Val Hcy level Population frequency: 65% 35% Disease risk
“Simple” versus “Complex” Disease “Simple”/Monogenic Disease “Complex” Disease • severe phenotype • mild phenotype • early onset • late onset • rare • common • Mendelian inheritance • complex inheritance • e.g.: cystic fibrosis, • e.g.: diabetes, asthma, osteogenesis imperfecta osteoporosis, etc. Mutations Polymorphisms Cause: + polymorphisms + mutations
Recommend
More recommend