Baysian Haplotype Inference via the Dirichlet Process Eric Xing, - PowerPoint PPT Presentation

Nov 14, 2022 •382 likes •601 views

Baysian Haplotype Inference via the Dirichlet Process Eric Xing, Micheal Jordan, Roded Sharan presented by Amrudin Agovic Motivation 99.9 % of human DNA shared 0.1% of DNA makes up for differences Need to determine what those

Baysian Haplotype Inference via the Dirichlet Process Eric Xing, Micheal Jordan, Roded Sharan presented by Amrudin Agovic
Motivation  99.9 % of human DNA shared  0.1% of DNA makes up for differences  Need to determine what those 0.1% are  Find genes responsible for diseases
Background  Humans have 23 pairs of chromosomes in their cells  23 come from the father, 23 from the mother  Certain parts of the genome are inherited unchanged  Other genetic information gets mixed up
Background  Allele: genetic coding that occupies a position on the chromosome.  Genotype: unordered pairs of Alleles in a region (one from each chromosome)  Phase: Allele Chromosome association (not given)  SNP: Single Nucleotide Polymorphism, difference in one nucleotide (A,C,G,T)  Haplotype: set of associated SNP alleles in a region of a chromosome. A haplotype is inherited as a unit.
Background
Dirichlet Process Representation Let  G 0 ( Ф ) be a base measure for the dirichlet process  A (k) :=[A 1 (k) ,..,A J (k) ] be a founding haplotype configuration (ancestral template) at loci t=[1,..,J]  θ (k) be the mutation rate of the ancestor  Ф be the parameter associated with a mixture component. Where Ф k = {A (k) , θ (k) }
Dirichlet Process Representation  Use Chinese Restaurant Process  Associate population haplotype with table  Sample for each table Ф k = {A (k) , θ (k) }
The Model
Assumptions  G 0 ( A,θ )=p( A)p(θ)  p(A) uniform distribution over all haplotypes  p(θ) is Beta( α h , β h )
Distributions Considering for all alleles mutations: Integrating out theta:
Noisy Observation Model  Observed Genotype at a locus determined by parental and maternal alleles  If genotype disagrees penalize  γ has Beta prior
Pedigree-Haplotyper
Inference - Gibbs Sampling  γ and θ integrated out  Sample C it , A j (k) , H it,j (k) 1) Given current hidden values of haplotypes sample c it , a j
Gibbs Sampling 2) Given ancestral assignment and ancestral pool sample haplotype
Metropolis Hastings  Long list of loci and uniform prior p(a), leaves probability of sampling new ancestor very small.  Slow mixing  Sample ancestor assignment using proposal distribution
Metropolis Hastings  In acceptance probability, the proposal factor cancels out
Experiments  Simulated Data: Haplotypes randomly paired to form genotypes.  Performance compared to PHASE
Experiments  Two real data sets: 129 individuals, 90 individuals from 4 populations Dataset 1:
Experiments Dataset 2:  Small sample size, tougher data set  Haplotyper outperforms PHASE
Conclusions  Algorithm outperform PHASE on two data sets With a big margin on one of them.  Strength of proposed approach in flexibility  Can be extended to incorporate aspects of evolutionary dynamics and other things  Illustrated example: Pedigree information

Recommend

Read-based phasing for dense and accurate haplotyping of individual genomes Outline 1. Haplotype

Read-based phasing for dense and accurate haplotyping of individual genomes Outline 1. Haplotype Phasing 2. Diploid phasing 3. Not Only Diploid 4. References 1 Haplotype Phasing Haplotype Phasing A haplotype is the sequence of nucleotides

392 views • 17 slides

Genome Wide Haplotype analyses Genome Wide Haplotype analyses of human complex diseases with the

Enabling Grids for E sciencE Enabling Grids for E-sciencE Genome Wide Haplotype analyses Genome Wide Haplotype analyses of human complex diseases with the EGEE grid ith th EGEE id Tregouet David david.tregouet@upmc.fr INSERM UMRS937

121 views • 11 slides

The Dirichlet-Bohr radius Manuel Maestre April 13, 2014 Kent State University Content

The Dirichlet-Bohr radius Manuel Maestre April 13, 2014 Kent State University Content Dirichlet series Manuel Maestre The Dirichlet-Bohr radius Content Dirichlet series Dirichlet series and complex analysis on polydiscs Manuel Maestre

1.01k views • 78 slides

Lecture 14: Inference in Dirichlet Processes (Blei & Jordan, Variational inference for

CS598JHM: Advanced NLP (Spring 2013) http://courses.engr.illinois.edu/cs598jhm/ Lecture 14: Inference in Dirichlet Processes (Blei & Jordan, Variational inference for Dirichlet Process Mixture models, Bayesian Analysis 2006) Julia

354 views • 21 slides

Perspective Hierarchical Dirichlet Process for Perspective Hierarchical Dirichlet Process for

Perspective Hierarchical Dirichlet Process for Perspective Hierarchical Dirichlet Process for User-Tagged Image Modeling Xin Chen 1 Xiaohua Hu 1 Yuan An 1 Zunyan Xiong 1 Tingting He 2 Xin Chen 1 , Xiaohua Hu 1 , Yuan An 1 , Zunyan Xiong 1 ,

621 views • 42 slides

Hierarchical Dirichlet Processes Presenters: Micah Hodosh, Yizhou Sun 4/7/2010 1 Content

Hierarchical Dirichlet Processes Presenters: Micah Hodosh, Yizhou Sun 4/7/2010 1 Content Introduction and Motivation Dirichlet Processes Hierarchical Dirichlet Processes Definition Three Analogs Inference Three

867 views • 44 slides

Boundary Representation of Dirichlet Forms on Canonically Compactifiable Graphs Michael Schwarz

Boundary Representation of Dirichlet Forms on Canonically Compactifiable Graphs Michael Schwarz 18.12.2016 Michael Schwarz Boundary Representation of Dirichlet Forms on Canonically Compactifiable Graphs Dirichlet Forms Definition ( X , m )

964 views • 47 slides

Incorporating Domain Knowledge into Topic Modeling via Dirichlet Forest Priors David

Incorporating Domain Knowledge into Topic Modeling via Dirichlet Forest Priors David Andrzejewski, Xiaojin Zhu, Mark Craven University of WisconsinMadison ICML 2009 Andrzejewski (Wisconsin) Dirichlet Forest Priors ICML 2009 1 / 21 New

1.02k views • 77 slides

Baysian Networks Marco Chiarandini Department of Mathematics & Computer Science University

Lecture 5 Baysian Networks Marco Chiarandini Department of Mathematics & Computer Science University of Southern Denmark Slides by Stuart Russell and Peter Norvig Probability Basis Course Overview Bayesian networks Introduction

489 views • 27 slides

Variational Inference for Dirichlet Process Mixtures By David Blei and Michael Jordan Presented

Variational Inference for Dirichlet Process Mixtures By David Blei and Michael Jordan Presented by Daniel Acuna Motivation Non-parametric Bayesian models seem to be the right idea: Do not fix the number of mixture components

465 views • 23 slides

Memoized Online Variational Inference for Dirichlet Process Mixture Models Michael C. Hughes

Memoized Online Variational Inference for Dirichlet Process Mixture Models Michael C. Hughes Erik B. Sudderth Department of Computer Science, Brown University 26 June 2014 Advances in Neural Information Processing Systems (2013) Presented by

500 views • 12 slides

Inference in Bayesian networks Chapter 14.45 Chapter 14.45 1 Outline Exact inference

Inference in Bayesian networks Chapter 14.45 Chapter 14.45 1 Outline Exact inference by enumeration Exact inference by variable elimination Approximate inference by stochastic simulation Approximate inference by Markov

533 views • 38 slides

Web-based Y-STR database for haplotype frequency estimation and kinship index calculation I S

2012-05-29 Web-based Y-STR database for haplotype frequency estimation and kinship index calculation I S In Seok Yang k Y Dept. of Forensic Medicine Yonsei University College of Medicine Y chromosome short tandem repeat (Y-STR) The

392 views • 10 slides

Quantitative Genomics and Genetics BTRY 4830/6830; PBSB.5201.01 Lecture20: Haplotype testing and

Quantitative Genomics and Genetics BTRY 4830/6830; PBSB.5201.01 Lecture20: Haplotype testing and Minimum GWAS analysis steps Jason Mezey jgm45@cornell.edu April 17, 2017 (T) 8:40-9:55 Announcements Project will be posted today (see next

714 views • 30 slides

Towards Gapless, Chromosome Scale, Haplotype Assemblies Matt Settles, PhD UC Davis

Towards Gapless, Chromosome Scale, Haplotype Assemblies Matt Settles, PhD UC Davis Bioinformatics Core December 16, 2018 Human Genome In 1990, the National Institutes of Health (NIH) and the Department of Energy joined with international

627 views • 26 slides

CS681: Advanced Topics in Computational Biology Can Alkan EA509 calkan@cs.bilkent.edu.tr

CS681: Advanced Topics in Computational Biology Can Alkan EA509 calkan@cs.bilkent.edu.tr http://www.cs.bilkent.edu.tr/~calkan/teaching/cs681/ HAPLOTYPE PHASING Haplotype Haploid Genotype: a combination of alleles at multiple loci that

501 views • 37 slides

Optimizing 3D Graphics For Mobiles Mobiles Madan Kandula Director Introduction

Optimizing 3D Graphics For Mobiles Mobiles Madan Kandula Director Introduction Cross-Platform games and apps using our engine PlayStation VITA, PlayStation 3 iOS (iPhone, iPad) Android Windows Core expertise is real time

230 views • 11 slides

TalkingHead Editor worker: Jindrich Gottwald (j.gottwald@sh.cvut.cz) leader: Ing. Ladislav Kunc

IBM - CVUT Student Research Projects TalkingHead Editor worker: Jindrich Gottwald (j.gottwald@sh.cvut.cz) leader: Ing. Ladislav Kunc The Story expansion pack for TalkingHead project

446 views • 5 slides

Phonon Multim edia facile pour vos applications K evin Ottens 26 Janvier 2008 K evin

Plan Introduction Architecture Utiliser lAPI Phonon Multim edia facile pour vos applications K evin Ottens 26 Janvier 2008 K evin Ottens Phonon 1/14 Plan Introduction Architecture Utiliser lAPI Plan 1 Introduction 2

495 views • 14 slides

Exploring GPGPU Acceleration of Process-Oriented Simulations Communicating Process Architectures

Exploring GPGPU Acceleration of Process-Oriented Simulations Communicating Process Architectures 2013 Fred Barnes School of Computing, University of Kent, Canterbury F.R.M.Barnes@kent.ac.uk http://www.cs.kent.ac.uk/~frmb/ Contents

1.78k views • 129 slides

The First History of the 2008 US Presidential Campaign Modeling and Measuring Election Discourse

The First History of the 2008 US Presidential Campaign Modeling and Measuring Election Discourse Karl Grossner, PhD Candidate UC Santa Barbara Department of Geography 1 Geography and History Geography and history are different ways of

647 views • 41 slides

MPI AND OPENACC JIRI KRAUS, NVIDIA MPI+OPENACC System System System GDDR5 Memory GDDR5

MULTI GPU PROGRAMMING WITH MPI AND OPENACC JIRI KRAUS, NVIDIA MPI+OPENACC System System System GDDR5 Memory GDDR5 Memory GDDR5 Memory Memory Memory Memory GPU GPU GPU CPU CPU CPU PCI-e PCI-e PCI-e Network Network

494 views • 34 slides

Regulatory Quality Indicators: A Delphi Study Nancy Spector, PhD, RN, FAAN Director, Regulatory

Consensus on Nursing Education Regulatory Quality Indicators: A Delphi Study Nancy Spector, PhD, RN, FAAN Director, Regulatory Innovations ATI National Nurse Educator Summit April 1 & 2, 2019, Savannah Georgia The National Council of

516 views • 36 slides

Continuum Equilibria for Routing in Dense Ad-hoc Networks Eitan ALTMAN, Alonso SILVA*, Pierre

Problem Model Multi-Class Costs Conclusions Continuum Equilibria for Routing in Dense Ad-hoc Networks Eitan ALTMAN, Alonso SILVA*, Pierre BERNHARD, Merouane DEBBAH December 5, 2007 Altman, Silva, Bernhard, Debbah Continuum Equilibria for

531 views • 31 slides

Baysian Haplotype Inference via the Dirichlet Process Eric Xing, - PowerPoint PPT Presentation

Baysian Haplotype Inference via the Dirichlet Process Eric Xing, Micheal Jordan, Roded Sharan presented by Amrudin Agovic Motivation 99.9 % of human DNA shared 0.1% of DNA makes up for differences Need to determine what those

Read-based phasing for dense and accurate haplotyping of individual genomes Outline 1. Haplotype

Genome Wide Haplotype analyses Genome Wide Haplotype analyses of human complex diseases with the

The Dirichlet-Bohr radius Manuel Maestre April 13, 2014 Kent State University Content

Lecture 14: Inference in Dirichlet Processes (Blei & Jordan, Variational inference for

Perspective Hierarchical Dirichlet Process for Perspective Hierarchical Dirichlet Process for

Hierarchical Dirichlet Processes Presenters: Micah Hodosh, Yizhou Sun 4/7/2010 1 Content

Boundary Representation of Dirichlet Forms on Canonically Compactifiable Graphs Michael Schwarz

Incorporating Domain Knowledge into Topic Modeling via Dirichlet Forest Priors David

Baysian Networks Marco Chiarandini Department of Mathematics & Computer Science University

Variational Inference for Dirichlet Process Mixtures By David Blei and Michael Jordan Presented

Memoized Online Variational Inference for Dirichlet Process Mixture Models Michael C. Hughes

Inference in Bayesian networks Chapter 14.45 Chapter 14.45 1 Outline Exact inference

Web-based Y-STR database for haplotype frequency estimation and kinship index calculation I S

Quantitative Genomics and Genetics BTRY 4830/6830; PBSB.5201.01 Lecture20: Haplotype testing and

Towards Gapless, Chromosome Scale, Haplotype Assemblies Matt Settles, PhD UC Davis

CS681: Advanced Topics in Computational Biology Can Alkan EA509 calkan@cs.bilkent.edu.tr

Optimizing 3D Graphics For Mobiles Mobiles Madan Kandula Director Introduction

TalkingHead Editor worker: Jindrich Gottwald (j.gottwald@sh.cvut.cz) leader: Ing. Ladislav Kunc

Phonon Multim edia facile pour vos applications K evin Ottens 26 Janvier 2008 K evin

Exploring GPGPU Acceleration of Process-Oriented Simulations Communicating Process Architectures

The First History of the 2008 US Presidential Campaign Modeling and Measuring Election Discourse

MPI AND OPENACC JIRI KRAUS, NVIDIA MPI+OPENACC System System System GDDR5 Memory GDDR5

Regulatory Quality Indicators: A Delphi Study Nancy Spector, PhD, RN, FAAN Director, Regulatory

Continuum Equilibria for Routing in Dense Ad-hoc Networks Eitan ALTMAN, Alonso SILVA*, Pierre

Sambuz

Useful Links

Newsletter

Mail Us

Baysian Haplotype Inference via the Dirichlet Process Eric Xing, - PowerPoint PPT Presentation

Baysian Haplotype Inference via the Dirichlet Process Eric Xing, Micheal Jordan, Roded Sharan presented by Amrudin Agovic Motivation 99.9 % of human DNA shared 0.1% of DNA makes up for differences Need to determine what those

Read-based phasing for dense and accurate haplotyping of individual genomes Outline 1. Haplotype

Genome Wide Haplotype analyses Genome Wide Haplotype analyses of human complex diseases with the

The Dirichlet-Bohr radius Manuel Maestre April 13, 2014 Kent State University Content

Lecture 14: Inference in Dirichlet Processes (Blei &amp; Jordan, Variational inference for

Perspective Hierarchical Dirichlet Process for Perspective Hierarchical Dirichlet Process for

Hierarchical Dirichlet Processes Presenters: Micah Hodosh, Yizhou Sun 4/7/2010 1 Content

Boundary Representation of Dirichlet Forms on Canonically Compactifiable Graphs Michael Schwarz

Incorporating Domain Knowledge into Topic Modeling via Dirichlet Forest Priors David

Baysian Networks Marco Chiarandini Department of Mathematics &amp; Computer Science University

Variational Inference for Dirichlet Process Mixtures By David Blei and Michael Jordan Presented

Memoized Online Variational Inference for Dirichlet Process Mixture Models Michael C. Hughes

Inference in Bayesian networks Chapter 14.45 Chapter 14.45 1 Outline Exact inference

Web-based Y-STR database for haplotype frequency estimation and kinship index calculation I S

Quantitative Genomics and Genetics BTRY 4830/6830; PBSB.5201.01 Lecture20: Haplotype testing and

Towards Gapless, Chromosome Scale, Haplotype Assemblies Matt Settles, PhD UC Davis

CS681: Advanced Topics in Computational Biology Can Alkan EA509 calkan@cs.bilkent.edu.tr

Optimizing 3D Graphics For Mobiles Mobiles Madan Kandula Director Introduction

TalkingHead Editor worker: Jindrich Gottwald (j.gottwald@sh.cvut.cz) leader: Ing. Ladislav Kunc

Phonon Multim edia facile pour vos applications K evin Ottens 26 Janvier 2008 K evin

Exploring GPGPU Acceleration of Process-Oriented Simulations Communicating Process Architectures

The First History of the 2008 US Presidential Campaign Modeling and Measuring Election Discourse

MPI AND OPENACC JIRI KRAUS, NVIDIA MPI+OPENACC System System System GDDR5 Memory GDDR5

Regulatory Quality Indicators: A Delphi Study Nancy Spector, PhD, RN, FAAN Director, Regulatory

Continuum Equilibria for Routing in Dense Ad-hoc Networks Eitan ALTMAN, Alonso SILVA*, Pierre

Sambuz

Useful Links

Newsletter

Mail Us

Lecture 14: Inference in Dirichlet Processes (Blei & Jordan, Variational inference for

Baysian Networks Marco Chiarandini Department of Mathematics & Computer Science University