Genomes and Complex Diseases 02-223 Personalized Medicine: - PowerPoint PPT Presentation

Genomes ¡and ¡Complex ¡Diseases ¡ 02-‑223 ¡Personalized ¡Medicine: ¡ Understanding ¡Your ¡Own ¡Genome ¡ Fall ¡2014 ¡

Genome ¡Polymorphisms ¡

A Human TCGAGGTATTAAC Genealogy ¡ The ¡ancestral ¡chromosome ¡

From SNPS … ¡ TCGAGGTATTAAC TCTAGGTATTAAC TCGAGGCATTAAC TCTAGGTGTTAAC TCGAGGTATTAGC TCTAGGTATCAAC * ** * * A ¡disease ¡mutaCon ¡

Finding ¡Disease ¡Muta9ons ¡ • Case/control ¡data ¡are ¡collected ¡from ¡unrelated ¡individuals ¡ – All ¡individuals ¡are ¡related ¡if ¡we ¡go ¡back ¡far ¡enough ¡in ¡the ¡ancestry ¡ Balding, ¡Nature ¡Reviews ¡GeneCcs, ¡2006 ¡

Family-‑Based ¡and ¡Popula9on-‑Based ¡Studies ¡ • How ¡can ¡we ¡idenCfy ¡disease-‑related ¡geneCc ¡loci? ¡ • Linkage ¡analysis ¡ – Data ¡are ¡collected ¡for ¡family ¡members ¡ – Difficult ¡to ¡collect ¡data ¡on ¡a ¡large ¡number ¡of ¡families ¡ – EffecCve ¡for ¡rare ¡diseases ¡ • Genome-‑wide ¡associaCon ¡studies ¡(GWAS) ¡ – Data ¡are ¡collected ¡for ¡unrelated ¡individuals ¡ – Easier ¡to ¡find ¡a ¡large ¡number ¡of ¡affected ¡individuals ¡ – EffecCve ¡for ¡common ¡diseases, ¡compared ¡to ¡family-‑based ¡ method ¡

Family-‑Based ¡and ¡Popula9on-‑Based ¡Studies ¡ • How ¡do ¡the ¡LD ¡pa]erns ¡in ¡genomes ¡differ ¡between ¡family ¡and ¡ populaCon ¡data? ¡ • Large ¡or ¡small ¡linked ¡regions? ¡ • ResoluCon ¡for ¡pinpoinCng ¡the ¡disease ¡locus? ¡ ¡

Linkage ¡Analysis ¡vs. ¡Associa9on ¡Analysis ¡ (Number ¡of ¡recombinaCon ¡sites) ¡ (shared ¡genome ¡segments ¡among ¡different ¡individuals ¡in ¡the ¡study) ¡ Strachan ¡& ¡Read, ¡Human ¡Molecular ¡GeneCcs, ¡2001 ¡

SNP ¡Genotyping ¡vs. ¡Whole ¡Genome ¡Sequencing ¡ • How ¡would ¡you ¡idenCfy ¡disease ¡mutaCons ¡using ¡whole ¡ genome ¡sequencing ¡technology? ¡ • How ¡would ¡you ¡idenCfy ¡disease ¡mutaCons ¡using ¡SNP ¡ genotyping ¡technology? ¡

How ¡Can ¡We ¡Iden9fy ¡the ¡Unknown ¡Disease ¡Locus? ¡ • Idea: ¡Given ¡a ¡map ¡of ¡SNP ¡geneCc ¡markers, ¡let’s ¡look ¡for ¡the ¡ markers ¡that ¡are ¡linked ¡to ¡the ¡unknown ¡disease ¡locus ¡(i.e. ¡ linkage ¡between ¡the ¡disease ¡locus ¡and ¡the ¡marker ¡locus) ¡ Disease ¡ ¡ Locus ¡ SNP ¡near ¡the ¡ SNPs ¡far ¡from ¡the ¡ disease ¡locus ¡ disease ¡locus ¡ ¡ ( r 2 >>0.5) ¡ ( r 2 <<0.5) ¡

Genome-‑Wide ¡Associa9on ¡Study ¡(GWAS) ¡ • Data ¡are ¡collected ¡for ¡genotypes ¡and ¡phenotypes ¡for ¡a ¡large ¡ number ¡of ¡unrelated ¡individuals ¡ – Genotypes: ¡ ¡ • ofen ¡SNP ¡genotypes ¡are ¡used ¡because ¡of ¡the ¡ease ¡of ¡genotyping ¡ and ¡abundance ¡across ¡genomes. ¡ • For ¡SNPs, ¡minor ¡allele ¡homozygous, ¡heterozygous, ¡and ¡major ¡allele ¡ heterozygous ¡sites ¡are ¡coded ¡as ¡0, ¡1, ¡and ¡2. ¡ – Phenotypes: ¡ ¡ • Categorical ¡data ¡(e.g., ¡case/control ¡labels ¡for ¡individuals) ¡ • ConCnuous-‑valued ¡data ¡(e.g., ¡height, ¡cholesterol ¡level, ¡blood ¡IgE ¡ level) ¡

Overview ¡ • Case ¡control ¡studies ¡ – Discrete ¡phenotype ¡ – Are ¡you ¡a ¡healthy ¡normal ¡or ¡a ¡paCent? ¡ • QuanCtaCve ¡trait ¡studies ¡ – ConCnuous-‑valued ¡phenotypes ¡ – Height, ¡eye ¡color, ¡blood ¡pressure, ¡cholesterol ¡level, ¡body-‑mass ¡index ¡ etc. ¡

Genome-‑Wide ¡Associa9on ¡Study ¡(GWAS) ¡ • Data ¡collected ¡for ¡GWAS ¡can ¡be ¡represented ¡as ¡two ¡matrices ¡ J ¡SNPs ¡ ¡ K ¡phenotypes ¡ N ¡individuals ¡ N ¡individuals ¡ Genotype ¡Data ¡ ¡ Phenotype ¡Data ¡

Genome-‑Wide ¡Associa9on ¡Study ¡(GWAS) ¡ • For ¡each ¡SNP ¡and ¡each ¡phenotype, ¡perform ¡a ¡staCsCcal ¡test ¡ for ¡“associaCon” ¡ ¡ • Repeat ¡this ¡for ¡all ¡(SNP, ¡phenotype) ¡pairs ¡ • IdenCfy ¡the ¡(SNP, ¡phenotype) ¡pairs ¡with ¡“significant” ¡ associaCon. ¡ ¡ – The ¡genome ¡region ¡around ¡the ¡SNP ¡is ¡likely ¡to ¡influence ¡the ¡ phenotype ¡ J ¡SNPs ¡ ¡ K ¡phenotypes ¡ N ¡individuals ¡ N ¡individuals ¡ Genotype ¡Data ¡ Phenotype ¡Data ¡ Are ¡the ¡SNP ¡and ¡the ¡phenotype ¡ significantly ¡associated? ¡

GWAS: ¡Case/Control ¡Study ¡ One ¡phenotype ¡for ¡case/ J ¡SNPs ¡ ¡ control ¡labels ¡ N ¡individuals ¡ N ¡individuals ¡ Genotype ¡Data ¡ Are ¡the ¡SNP ¡and ¡the ¡phenotype ¡ significantly ¡associated? ¡

GWAS: ¡Case/Control ¡Study ¡ • For ¡each ¡marker ¡locus, ¡find ¡the ¡ 3x2 ¡conCngency ¡table ¡containing ¡ the ¡counts ¡of ¡three ¡genotypes ¡ Genotype Case Control Total AA N case,AA N control,AA N AA Aa N case,Aa N control,Aa N Aa aa N case,aa N control,aa N aa Total N case N control N 2 • ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡test ¡with ¡2 ¡df ¡(degree ¡of ¡ χ freedom) ¡under ¡the ¡null ¡ Genotype ¡score ¡= ¡the ¡number ¡of ¡minor ¡alleles ¡ ¡ hypothesis ¡of ¡no ¡associaCon ¡ ¡

GWAS: ¡Case/Control ¡Study ¡ • AlternaCvely, ¡assume ¡an ¡addiCve ¡model, ¡where ¡the ¡ heterozygote ¡risk ¡is ¡approximately ¡between ¡the ¡two ¡ homozygotes ¡ • Form ¡a ¡2x2 ¡conCngency ¡table. ¡Each ¡individual ¡contributes ¡ twice ¡from ¡each ¡of ¡the ¡two ¡chromosomes. ¡ Allele Case Control Total Type A G case,A G control,A G A a G case,a G control,a G a Total 2xN case 2xN control 2N 2 χ • ¡ ¡ ¡ ¡ ¡ ¡test ¡with ¡1df ¡

2 χ Test ¡(Chi-‑square ¡Test) ¡ • StaCsCcal ¡test ¡of ¡associaCon ¡ • In ¡case/control ¡associaCon ¡study, ¡the ¡null ¡hypothesis ¡is ¡ ¡ ¡ H 0 : ¡There ¡is ¡no ¡associaCon ¡between ¡the ¡given ¡marker ¡and ¡disease ¡labels. ¡ • P-‑value ¡= ¡probability ¡of ¡the ¡observed ¡data ¡under ¡the ¡null ¡hypothesis ¡ • Low ¡p-‑value ¡(p-‑value ¡< ¡α, ¡where ¡α ¡is ¡a ¡user-‑specified ¡value) ¡means ¡ the ¡observed ¡data ¡are ¡unlikely ¡under ¡the ¡null ¡hypothesis. ¡Thus, ¡we ¡ reject ¡the ¡null ¡hypothesis ¡( H 0 ) ¡and ¡declare ¡there ¡is ¡a ¡significant ¡ associaCon ¡between ¡the ¡SNP ¡and ¡disease ¡states. ¡ – Ofen ¡α=0.01 ¡or ¡0.05 ¡is ¡used. ¡

Chi-‑Square ¡Test: ¡Null ¡Hypothesis ¡from ¡ Con9ngency ¡Table ¡ We ¡have ¡two ¡random ¡variables: ¡ • Allele Case Control Total – Y: ¡disease ¡status ¡(Case/Control) ¡ Type – X: ¡allele ¡type ¡(A/a) ¡ A G case,A G control,A G A Null ¡hypothesis: ¡the ¡two ¡variables ¡are ¡ • a G case,a G control,a G a independent ¡of ¡each ¡other ¡(i.e., ¡the ¡two ¡ variables ¡are ¡unrelated) ¡ Total 2xN case 2xN control 2N

Chi-‑Square ¡Test ¡ ObservaCons ¡ Allele Case Control Total Type A G case,A G control,A G A a G case,a G control,a G a Total 2xN case 2xN control 2N Expected ¡values ¡ Allele Case Control Total Type A ? ? G A a ? ? G a Total 2xN case 2xN control 2N

Chi-‑square ¡sta9s9c ¡ n ( O i − E i ) 2 χ 2 = ∑ E i i = 1 O i ¡= ¡observed ¡frequency ¡for ¡ i th ¡outcome ¡ ¡ ¡ (the ¡value ¡can ¡be ¡read ¡off ¡of ¡the ¡conCngency ¡table) ¡ E i ¡= ¡expected ¡frequency ¡for ¡ i th ¡outcome ¡ ¡ ¡ (the ¡value ¡can ¡be ¡obtained ¡as ¡described ¡in ¡the ¡previous ¡slides) ¡ n ¡= ¡total ¡number ¡of ¡outcomes ¡ The ¡probability ¡distribuCon ¡of ¡this ¡staCsCc ¡is ¡given ¡by ¡the ¡ α= ¡ chi-‑square ¡distribuCon. ¡ Test ¡staCsCc= ¡ Using ¡chi-‑square ¡test, ¡we ¡can ¡test ¡how ¡well ¡observed ¡values ¡fit ¡expected ¡values ¡ computed ¡under ¡the ¡independence ¡hypothesis ¡

Genomes and Complex Diseases 02-223 Personalized Medicine: - PowerPoint PPT Presentation

Genomes and Complex Diseases 02-223 Personalized Medicine: Understanding Your Own Genome Fall 2014 Genome Polymorphisms A Human TCGAGGTATTAAC Genealogy The ancestral

Genomes for LIfe Cohort study of Genomes

The 1000 genomes project The 1000 genomes project Genetic variation > 1% 1000 2500

Rheumatological Rheumatological Diseases Diseases Rheumatological Rheumatological Diseases

Algorithms in Bioinformatics: A Practical Introduction Genome Alignment Complete genomes

Complex Numbers Complex Numbers 1 / 19 Complex Numbers Complex numbers ( C ) are an extension of

VACCINE PREVENTABLE DISEASES: VACCINE PREVENTABLE DISEASES: VACCINE PREVENTABLE DISEASES:

Intermembrane Space H + H + Cyt c Co Q Complex Complex III IV H + ATPase H + Complex

An introduction to complex numbers The complex numbers Are the real numbers not sufficient? A

Honey Bee Pests and Diseases Dale McMahan Honey Bee Pests and Diseases Pests Diseases

Therapeutic products Therapeutic products for respiratory diseases for respiratory diseases for

Therapeutic products Therapeutic products for respiratory diseases for respiratory diseases for

Comparative protein structure modeling of genes, genomes and complexes Marc A. Marti-Renom

Comparative protein structure modeling of genes and genomes Marc A. Marti-Renom Department of

Working with gene features and genomes Typical workflow when working with sequence data (e.g.,

Interuniversity Attraction Pole BioMAGNet (IAP P6/25) Bioinformatics and Modeling: from Genomes

Cancer Genomes 02-223 How to Analyze Your Own Genome Cancer vs.

A/E Selection Process A/E Selection Process from the Corps perspective from the Corps

IGE/EMI INTEROPERABILITY DEMONSTRATOR Ismael Carrion (IGE), Steve Crouch (IGE), Andrew Grimshaw

Radiation Induced Heart Disease Alexander (Sandy) Dick, MD Outline Pericardial Constriction

Scoliosis Awareness Program Recommended Presentation Script Have the Title Slide projected on the

Free and Open Source Geospatial Software at the University of Applied Sciences University of

Discussion of Intergenerational Mobility in China by Yi Fan, Junjian Yi, and Junsen Zhang

Identity management in the European Grid Infrastructure Established solutions, new needs, open

Drivers of Mobility Patrizio Piraino 5 September 2019 Introduction Evidence of positive

Sambuz

Useful Links

Newsletter

Mail Us

Genomes and Complex Diseases 02-223 Personalized Medicine: - PowerPoint PPT Presentation

Genomes and Complex Diseases 02-223 Personalized Medicine: Understanding Your Own Genome Fall 2014 Genome Polymorphisms A Human TCGAGGTATTAAC Genealogy The ancestral

Genomes for LIfe Cohort study of Genomes

The 1000 genomes project The 1000 genomes project Genetic variation &gt; 1% 1000 2500

Rheumatological Rheumatological Diseases Diseases Rheumatological Rheumatological Diseases

Algorithms in Bioinformatics: A Practical Introduction Genome Alignment Complete genomes

Complex Numbers Complex Numbers 1 / 19 Complex Numbers Complex numbers ( C ) are an extension of

VACCINE PREVENTABLE DISEASES: VACCINE PREVENTABLE DISEASES: VACCINE PREVENTABLE DISEASES:

Intermembrane Space H + H + Cyt c Co Q Complex Complex III IV H + ATPase H + Complex

An introduction to complex numbers The complex numbers Are the real numbers not sufficient? A

Honey Bee Pests and Diseases Dale McMahan Honey Bee Pests and Diseases Pests Diseases

Therapeutic products Therapeutic products for respiratory diseases for respiratory diseases for

Therapeutic products Therapeutic products for respiratory diseases for respiratory diseases for

Comparative protein structure modeling of genes, genomes and complexes Marc A. Marti-Renom

Comparative protein structure modeling of genes and genomes Marc A. Marti-Renom Department of

Working with gene features and genomes Typical workflow when working with sequence data (e.g.,

Interuniversity Attraction Pole BioMAGNet (IAP P6/25) Bioinformatics and Modeling: from Genomes

Cancer Genomes 02-223 How to Analyze Your Own Genome Cancer vs.

A/E Selection Process A/E Selection Process from the Corps perspective from the Corps

IGE/EMI INTEROPERABILITY DEMONSTRATOR Ismael Carrion (IGE), Steve Crouch (IGE), Andrew Grimshaw

Radiation Induced Heart Disease Alexander (Sandy) Dick, MD Outline Pericardial Constriction

Scoliosis Awareness Program Recommended Presentation Script Have the Title Slide projected on the

Free and Open Source Geospatial Software at the University of Applied Sciences University of

Discussion of Intergenerational Mobility in China by Yi Fan, Junjian Yi, and Junsen Zhang

Identity management in the European Grid Infrastructure Established solutions, new needs, open

Drivers of Mobility Patrizio Piraino 5 September 2019 Introduction Evidence of positive

Sambuz

Useful Links

Newsletter

Mail Us

The 1000 genomes project The 1000 genomes project Genetic variation > 1% 1000 2500