csci 8980 advanced topics in graphical models analysis of
play

CSci 8980: Advanced Topics in Graphical Models Analysis of Genetic - PowerPoint PPT Presentation

Basics HMDP Inference Results HDPM Results CSci 8980: Advanced Topics in Graphical Models Analysis of Genetic Variation Instructor: Arindam Banerjee November 26, 2007 Basics HMDP Inference Results HDPM Results Genetic Polymorphism


  1. Basics HMDP Inference Results HDPM Results CSci 8980: Advanced Topics in Graphical Models Analysis of Genetic Variation Instructor: Arindam Banerjee November 26, 2007

  2. Basics HMDP Inference Results HDPM Results Genetic Polymorphism Single nucleotide polymorphism (SNP)

  3. Basics HMDP Inference Results HDPM Results Genetic Polymorphism Single nucleotide polymorphism (SNP) Two possible kinds of nucleotides at a single locus

  4. Basics HMDP Inference Results HDPM Results Genetic Polymorphism Single nucleotide polymorphism (SNP) Two possible kinds of nucleotides at a single locus Nucleotide can be one of { A , C , T , G }

  5. Basics HMDP Inference Results HDPM Results Genetic Polymorphism Single nucleotide polymorphism (SNP) Two possible kinds of nucleotides at a single locus Nucleotide can be one of { A , C , T , G } Most genetic human variation are related to SNPs

  6. Basics HMDP Inference Results HDPM Results Genetic Polymorphism Single nucleotide polymorphism (SNP) Two possible kinds of nucleotides at a single locus Nucleotide can be one of { A , C , T , G } Most genetic human variation are related to SNPs Each variant is called an allele

  7. Basics HMDP Inference Results HDPM Results Genetic Polymorphism Single nucleotide polymorphism (SNP) Two possible kinds of nucleotides at a single locus Nucleotide can be one of { A , C , T , G } Most genetic human variation are related to SNPs Each variant is called an allele Haplotype

  8. Basics HMDP Inference Results HDPM Results Genetic Polymorphism Single nucleotide polymorphism (SNP) Two possible kinds of nucleotides at a single locus Nucleotide can be one of { A , C , T , G } Most genetic human variation are related to SNPs Each variant is called an allele Haplotype List of alleles in a local region of a chromosome

  9. Basics HMDP Inference Results HDPM Results Genetic Polymorphism Single nucleotide polymorphism (SNP) Two possible kinds of nucleotides at a single locus Nucleotide can be one of { A , C , T , G } Most genetic human variation are related to SNPs Each variant is called an allele Haplotype List of alleles in a local region of a chromosome Inherited as a unit, if there is no recombination

  10. Basics HMDP Inference Results HDPM Results Genetic Polymorphism Single nucleotide polymorphism (SNP) Two possible kinds of nucleotides at a single locus Nucleotide can be one of { A , C , T , G } Most genetic human variation are related to SNPs Each variant is called an allele Haplotype List of alleles in a local region of a chromosome Inherited as a unit, if there is no recombination Repeated recombinations between ancestral haplotypes

  11. Basics HMDP Inference Results HDPM Results Genetic Polymorphism (Contd.) Linkage disequilibrium (LD)

  12. Basics HMDP Inference Results HDPM Results Genetic Polymorphism (Contd.) Linkage disequilibrium (LD) Non-random association of alleles at different loci

  13. Basics HMDP Inference Results HDPM Results Genetic Polymorphism (Contd.) Linkage disequilibrium (LD) Non-random association of alleles at different loci Recombination decouples alleles, increase randomness, decrease LD

  14. Basics HMDP Inference Results HDPM Results Genetic Polymorphism (Contd.) Linkage disequilibrium (LD) Non-random association of alleles at different loci Recombination decouples alleles, increase randomness, decrease LD Infer chromosomal recombination hotspots

  15. Basics HMDP Inference Results HDPM Results Genetic Polymorphism (Contd.) Linkage disequilibrium (LD) Non-random association of alleles at different loci Recombination decouples alleles, increase randomness, decrease LD Infer chromosomal recombination hotspots Help understand origin and characteristics of genetic variation

  16. Basics HMDP Inference Results HDPM Results Genetic Polymorphism (Contd.) Linkage disequilibrium (LD) Non-random association of alleles at different loci Recombination decouples alleles, increase randomness, decrease LD Infer chromosomal recombination hotspots Help understand origin and characteristics of genetic variation Analyze genetic variation to reconstruct evolutionary history

  17. Basics HMDP Inference Results HDPM Results Haplotype Recombination and Inheritance

  18. Basics HMDP Inference Results HDPM Results Hidden Markov Process Generative model for choosing recombination sites

  19. Basics HMDP Inference Results HDPM Results Hidden Markov Process Generative model for choosing recombination sites Hidden Markov process

  20. Basics HMDP Inference Results HDPM Results Hidden Markov Process Generative model for choosing recombination sites Hidden Markov process Hidden states correspond to index over chromosomes

  21. Basics HMDP Inference Results HDPM Results Hidden Markov Process Generative model for choosing recombination sites Hidden Markov process Hidden states correspond to index over chromosomes Transition probabilities correspond to recombination rates

  22. Basics HMDP Inference Results HDPM Results Hidden Markov Process Generative model for choosing recombination sites Hidden Markov process Hidden states correspond to index over chromosomes Transition probabilities correspond to recombination rates Emission model corresponds to mutation process that give descendants

  23. Basics HMDP Inference Results HDPM Results Hidden Markov Process Generative model for choosing recombination sites Hidden Markov process Hidden states correspond to index over chromosomes Transition probabilities correspond to recombination rates Emission model corresponds to mutation process that give descendants Implemented using a Hidden Markov Dirichlet Process (HMDP)

  24. Basics HMDP Inference Results HDPM Results Dirichlet Process Mixtures We know the basics of DPMs

  25. Basics HMDP Inference Results HDPM Results Dirichlet Process Mixtures We know the basics of DPMs Haplotype modeling using an infinite mixture model

  26. Basics HMDP Inference Results HDPM Results Dirichlet Process Mixtures We know the basics of DPMs Haplotype modeling using an infinite mixture model A pool of ancestor haplotypes or founders

  27. Basics HMDP Inference Results HDPM Results Dirichlet Process Mixtures We know the basics of DPMs Haplotype modeling using an infinite mixture model A pool of ancestor haplotypes or founders The size of the pool is unknown

  28. Basics HMDP Inference Results HDPM Results Dirichlet Process Mixtures We know the basics of DPMs Haplotype modeling using an infinite mixture model A pool of ancestor haplotypes or founders The size of the pool is unknown Standard coalescence based models

  29. Basics HMDP Inference Results HDPM Results Dirichlet Process Mixtures We know the basics of DPMs Haplotype modeling using an infinite mixture model A pool of ancestor haplotypes or founders The size of the pool is unknown Standard coalescence based models Hidden variables is prohibitively large

  30. Basics HMDP Inference Results HDPM Results Dirichlet Process Mixtures We know the basics of DPMs Haplotype modeling using an infinite mixture model A pool of ancestor haplotypes or founders The size of the pool is unknown Standard coalescence based models Hidden variables is prohibitively large Hard to perform inference of ancestral features

  31. Basics HMDP Inference Results HDPM Results Dirichlet Process Mixtures (Contd.) H i = [ H i , 1 , . . . , H i , T ] haplotype over T SNPs, chromosome i

  32. Basics HMDP Inference Results HDPM Results Dirichlet Process Mixtures (Contd.) H i = [ H i , 1 , . . . , H i , T ] haplotype over T SNPs, chromosome i A k = [ A k , 1 , . . . , A k , T ] ancestral haplotype, mutation rate θ k

  33. Basics HMDP Inference Results HDPM Results Dirichlet Process Mixtures (Contd.) H i = [ H i , 1 , . . . , H i , T ] haplotype over T SNPs, chromosome i A k = [ A k , 1 , . . . , A k , T ] ancestral haplotype, mutation rate θ k C i , inheritance variable, latent ancestor of H i

  34. Basics HMDP Inference Results HDPM Results Dirichlet Process Mixtures (Contd.) H i = [ H i , 1 , . . . , H i , T ] haplotype over T SNPs, chromosome i A k = [ A k , 1 , . . . , A k , T ] ancestral haplotype, mutation rate θ k C i , inheritance variable, latent ancestor of H i Generative Model:

  35. Basics HMDP Inference Results HDPM Results Dirichlet Process Mixtures (Contd.) H i = [ H i , 1 , . . . , H i , T ] haplotype over T SNPs, chromosome i A k = [ A k , 1 , . . . , A k , T ] ancestral haplotype, mutation rate θ k C i , inheritance variable, latent ancestor of H i Generative Model: Draw a first haplotype a 1 | DP ( τ, Q 0 ) ∼ Q 0 ∼ P h ( ·| a 1 , θ 1 ) h 1

  36. Basics HMDP Inference Results HDPM Results Dirichlet Process Mixtures (Contd.) H i = [ H i , 1 , . . . , H i , T ] haplotype over T SNPs, chromosome i A k = [ A k , 1 , . . . , A k , T ] ancestral haplotype, mutation rate θ k C i , inheritance variable, latent ancestor of H i Generative Model: Draw a first haplotype a 1 | DP ( τ, Q 0 ) ∼ Q 0 ∼ P h ( ·| a 1 , θ 1 ) h 1 For subsequent haplotypes n cj � p ( c i = c j for some j < i | c 1 , . . . , c i − 1 ) = i − 1+ α 0 c i | DP ( τ, Q 0 ) ∼ α 0 p ( c i � = c j for all j < i | c 1 , . . . , c i − 1 ) = i − 1+ α 0

  37. Basics HMDP Inference Results HDPM Results Dirichlet Process Mixtures (Contd.) Generative Model (contd)

Recommend


More recommend