CSE 527 Lecture 10 More on the Gibbs Sampler Projects see web - PowerPoint PPT Presentation

Oct 02, 2022 •21 likes •231 views

CSE 527 Lecture 10 More on the Gibbs Sampler Projects see web Implementation or literature review Small (interdisciplinary) groups preferred Suggestion: make a schedule bite-size-pieces Some ideas on web/by email &

CSE 527 Lecture 10 More on the Gibbs Sampler
Projects – see web • Implementation or literature review • Small (interdisciplinary) groups preferred • Suggestion: • make a schedule • bite-size-pieces • Some ideas on web/by email & I’m happy to talk/listen/give (bad?) advice - send email
AlignAce (Roth, et al. 1998) • Lawrence et al.: protein motifs • Roth et al.: DNA regulatory motifs • Differences: • Genomic background model, e.g. yeast Saccharomyces cerevisiae is 62% A-T • both strands used • overlapping sites prohibited • Multiple motifs: find best & mask • “MAP” scoring; “specificity” scoring
Rocke & Tompa (Recomb ‘98) • Gibbs, adapted for gapped motifs • single “genomic” DNA sequence
Why Gaps • Biology often tolerates diversity • 2 similar TFs bind 2 similar sites • Same TF binds 2 sites (perhaps one better than the other) • Dimeric TFs often “don’t care” in middle & flexible • TF and/or DNA may twist/bulge
A Gapped Motif
Why gaps are hard • Alignment • Pairwise -- O(n 2 ) dynamic programming • Multiple -- O(n k ) • Gibbs/MEME/... require many alignments • Scoring
R/T Approach - Scores • WMM • Relative entropy, aka expected LLR • Score gaps like background, “minus a small penalty”
R/T Approach - Alignment • Gibbs replaces 1 string per iteration • Use pairwise alignment between new string and previously computed alignment of remaining k- 1 • Actually align motif against whole genome - Time O(genome length x motif width)
R/T Approach- “Gibbs” • discard 0-2 random strings at each iteration • pick replacement greedily, not by sampling; avoid local max by random restarts (see Rocke’s thesis for more on this)
Test Data • Haemophilus influenzae • ~1.8 megabases • Delete all protein-coding, leaves ~ 350 kb • Concatenate, separated with markers • Plus reverse complement, total ~ 700 kb
Motif width=10
A Motif + Context
Rewindowing • After convergence, “rewindow” -- choose subset of rows and adjust left/right boundaries to maximize score. • NP-hard? Use another greedy heuristic
Rewindowing
A closer look at 35 • 6 almost perfectly identical regions of 5.3 kb, each 3 rRNA genes plus some tRNA genes • 9% of genome but 50% of high-scoring motifs • removed 80kb containing them & re-ran
After Removal
More rewindowing 0 & 1 identical for another 55 bases; 5 differences in next 44. Probably not a TFBS, but not “random”
Summary • handles gaps • greedy “sampling” / random restarts • avoids full multiple alignment by exploiting good partial alignment • validation - null model for comparison • look at data - • rewindowing • rRNA cluster

Recommend

CSE 527 Computational Biology http://www.cs.washington.edu/527 Lecture 1: Overview & Bio

CSE 527 Computational Biology http://www.cs.washington.edu/527 Lecture 1: Overview & Bio Review Autumn 2004 Larry Ruzzo He who asks is a fool for five minutes, but he who does not ask remains a fool forever. -- Chinese Proverb Related

525 views • 25 slides

CSE 527 Computational Biology http://www.cs.washington.edu/527 Lecture 1: Overview & Bio

CSE 527 Computational Biology http://www.cs.washington.edu/527 Lecture 1: Overview & Bio Review Autumn 2004 Larry Ruzzo Related Courses He who asks is a fool for five Genome 540/541 (Winter/Spring) Intro. To Comp. Mol. Bio.

242 views • 7 slides

Rigid Geometric Transformations COMPSCI 527 Computer Vision COMPSCI 527 Computer Vision

Rigid Geometric Transformations COMPSCI 527 Computer Vision COMPSCI 527 Computer Vision Rigid Geometric Transformations 1 / 12 Outline 1 Motivation 2 Cross Product 3 Triple Product 4 Rotations 5 Rigid Transformations COMPSCI 527

273 views • 12 slides

Camera Calibration COMPSCI 527 Computer Vision COMPSCI 527 Computer Vision Camera

Camera Calibration COMPSCI 527 Computer Vision COMPSCI 527 Computer Vision Camera Calibration 1 / 12 Outline 1 General Ideas 2 A Camera Model 3 Parameter Optimization 4 Lab Setup and Imaging COMPSCI 527 Computer Vision Camera

343 views • 12 slides

Training Neural Nets COMPSCI 527 Computer Vision COMPSCI 527 Computer Vision Training

Training Neural Nets COMPSCI 527 Computer Vision COMPSCI 527 Computer Vision Training Neural Nets 1 / 29 Outline 1 The Softmax Simplex 2 Loss and Risk 3 Back-Propagation 4 Stochastic Gradient Descent 5 Regularization COMPSCI 527

472 views • 29 slides

Tracking Feature Windows COMPSCI 527 Computer Vision COMPSCI 527 Computer Vision

Tracking Feature Windows COMPSCI 527 Computer Vision COMPSCI 527 Computer Vision Tracking Feature Windows 1 / 19 Outline 1 Local Motion Estimation 2 Window Tracking 3 The Lucas-Kanade Tracker 4 Good Features to Track COMPSCI 527

204 views • 19 slides

CSE 527, Additional notes on MLE & EM Based on earlier notes by C. Grant & M. Narasimhan

CSE 527 Lecture Notes: MLE & EM 1 CSE 527, Additional notes on MLE & EM Based on earlier notes by C. Grant & M. Narasimhan Introduction Last lecture we began an examination of model based clustering. This lecture will be the

261 views • 9 slides

Image Motion COMPSCI 527 Computer Vision COMPSCI 527 Computer Vision Image Motion 1 /

Image Motion COMPSCI 527 Computer Vision COMPSCI 527 Computer Vision Image Motion 1 / 12 Outline 1 Image Motion 2 Occlusion, Correspondence, Motion Boundaries 3 Constancy of Appearance 4 Motion Field and Optical Flow 5 The Aperture

363 views • 12 slides

HW2o Image Differentiation COMPSCI 527 Computer Vision COMPSCI 527 Computer Vision

HW2o Image Differentiation COMPSCI 527 Computer Vision COMPSCI 527 Computer Vision Image Differentiation 1 / 16 Outline 1 The Meaning of Image Differentiation 2 A Conceptual Pipeline 3 Implementation 4 The Derivatives of a 2D Gaussian

177 views • 16 slides

Correlation, Convolution, Filtering COMPSCI 527 Computer Vision COMPSCI 527 Computer

Correlation, Convolution, Filtering COMPSCI 527 Computer Vision COMPSCI 527 Computer Vision Correlation, Convolution, Filtering 1 / 26 Outline 1 Template Matching and Correlation 2 Image Convolution 3 Filters 4 Separable Convolution

426 views • 26 slides

Rigid Geometric Transformations COMPSCI 527 Computer Vision COMPSCI 527 Computer Vision

Rigid Geometric Transformations COMPSCI 527 Computer Vision COMPSCI 527 Computer Vision Rigid Geometric Transformations 1 / 15 Outline 1 Motivation 2 Projection 3 Cross Product 4 Triple Product 5 Rotations 6 Rigid Transformations

425 views • 15 slides

Image Pyramids COMPSCI 527 Computer Vision COMPSCI 527 Computer Vision Image Pyramids 1

Image Pyramids COMPSCI 527 Computer Vision COMPSCI 527 Computer Vision Image Pyramids 1 / 12 Outline 1 Pyramids and Scale 2 (Spatial Frequency) Aliasing 3 Downsampling and Upsampling 4 Bilinear Interpolation 5 Gaussian (and Laplacian)

150 views • 12 slides

The Eight-Point Algorithm COMPSCI 527 Computer Vision COMPSCI 527 Computer Vision The

The Eight-Point Algorithm COMPSCI 527 Computer Vision COMPSCI 527 Computer Vision The Eight-Point Algorithm 1 / 17 Outline 1 Summary: The Epipolar Constraint 2 The Eight-Point Algorithm: t , R 3 Triangulation: P m 4 Bundle Adjustment

579 views • 17 slides

The Singular Value Decomposition COMPSCI 527 Computer Vision COMPSCI 527 Computer Vision

The Singular Value Decomposition COMPSCI 527 Computer Vision COMPSCI 527 Computer Vision The Singular Value Decomposition 1 / 21 Outline 1 Math Corners and the SVD: Motivation 2 Orthogonal Matrices 3 Orthogonal Projection 4 The Singular

604 views • 21 slides

Image Motion COMPSCI 527 Computer Vision COMPSCI 527 Computer Vision Image Motion 1 /

Image Motion COMPSCI 527 Computer Vision COMPSCI 527 Computer Vision Image Motion 1 / 13 Outline 1 Image Motion 2 Occlusion, Correspondence, Motion Boundaries 3 Constancy of Appearance 4 Motion Field and Optical Flow 5 The Aperture

270 views • 13 slides

The Epipolar Geometry COMPSCI 527 Computer Vision COMPSCI 527 Computer Vision The

The Epipolar Geometry COMPSCI 527 Computer Vision COMPSCI 527 Computer Vision The Epipolar Geometry 1 / 16 The Epipolar Geometry of a Pair of Cameras The Epipolar Geometry of a Pair of Cameras P projection ray y a r n o i t c

279 views • 11 slides

Thinking with Data in the Second Course Nicholas J. Horton Department of Mathematics and

Introduction Building precursors Framework for thinking with data Closing thoughts Thinking with Data in the Second Course Nicholas J. Horton Department of Mathematics and Statistics Amherst College, Amherst, MA, USA August 4, 2014

579 views • 42 slides

PLANAR: RNA Sequence Alignment using Non-Affine Gap Penalty and Secondary Structure Ofer Hirsch

PLANAR: RNA Sequence Alignment using Non-Affine Gap Penalty and Secondary Structure Ofer Hirsch Gill*, Naren Ramakrishnan** & Bhubaneswar Mishra* (*)Courant Institute, NYU & (**)Virginia Tech Outline Introduction PLAINS (for

884 views • 39 slides

Gene Expression: Details Pre-mRNA Secondary (Eukaryotes) Structure Prediction Aids DNA

Gene Expression: Details Pre-mRNA Secondary (Eukaryotes) Structure Prediction Aids DNA pre-mRNA mRNA Protein Splice Site Recognition nucleus Protein gene Donald J. Patterson, Ken Yasuhara, Walter L. Ruzzo DNA January 3-7, 2002

490 views • 9 slides

TEIN (Trans-Eurasia Information Network) - Co-Prosperity of Asia and Europe through Digital Silk

TEIN (Trans-Eurasia Information Network) - Co-Prosperity of Asia and Europe through Digital Silk Road - First NKN Annual Workshop Mumbai, India 1 November 2012 Dr. ByungKyu Kim Executive Officer TEIN* Cooperation Center Contents Global

356 views • 22 slides

Mo#f discovery Morgane Thomas-Chollier Computa)onal systems

Mo#f discovery Morgane Thomas-Chollier Computa)onal systems biology - IBENS mthomas@biologie.ens.fr M2 Computa6onal analysis of cis-regulatory sequences

527 views • 41 slides

Composite repetition-aware text indexing Djamal Belazzougui Fabio Cunial Travis Gagie Nicola

Composite repetition-aware text indexing Djamal Belazzougui Fabio Cunial Travis Gagie Nicola Prezza Mathieu Raffinot Compressed text indexes LZ family: LZ77 or LZ78. BWT family: FM index or Run-length encoded BWT (RLBWT). Compact

360 views • 17 slides

Data Mining in Bioinformatics Day 8: Clustering in Bioinformatics Clustering Gene Expression Data

Data Mining in Bioinformatics Day 8: Clustering in Bioinformatics Clustering Gene Expression Data Chlo-Agathe Azencott & Karsten Borgwardt February 10 to February 21, 2014 Machine Learning & Computational Biology Research Group Max

522 views • 50 slides

CSI5180. MachineLearningfor BioinformaticsApplications Rule Learning by Marcel Turcotte Version

CSI5180. MachineLearningfor BioinformaticsApplications Rule Learning by Marcel Turcotte Version November 21, 2019 Preamble Preamble 2/49 Preamble Rule Learning Chances are that you have never heard the term rule learning despite the fact

902 views • 86 slides

CSE 527 Lecture 10 More on the Gibbs Sampler Projects see web - PowerPoint PPT Presentation

CSE 527 Lecture 10 More on the Gibbs Sampler Projects see web Implementation or literature review Small (interdisciplinary) groups preferred Suggestion: make a schedule bite-size-pieces Some ideas on web/by email &

CSE 527 Computational Biology http://www.cs.washington.edu/527 Lecture 1: Overview &amp; Bio

CSE 527 Computational Biology http://www.cs.washington.edu/527 Lecture 1: Overview &amp; Bio

Rigid Geometric Transformations COMPSCI 527 Computer Vision COMPSCI 527 Computer Vision

Camera Calibration COMPSCI 527 Computer Vision COMPSCI 527 Computer Vision Camera

Training Neural Nets COMPSCI 527 Computer Vision COMPSCI 527 Computer Vision Training

Tracking Feature Windows COMPSCI 527 Computer Vision COMPSCI 527 Computer Vision

CSE 527, Additional notes on MLE &amp; EM Based on earlier notes by C. Grant &amp; M. Narasimhan

Image Motion COMPSCI 527 Computer Vision COMPSCI 527 Computer Vision Image Motion 1 /

HW2o Image Differentiation COMPSCI 527 Computer Vision COMPSCI 527 Computer Vision

Correlation, Convolution, Filtering COMPSCI 527 Computer Vision COMPSCI 527 Computer

Rigid Geometric Transformations COMPSCI 527 Computer Vision COMPSCI 527 Computer Vision

Image Pyramids COMPSCI 527 Computer Vision COMPSCI 527 Computer Vision Image Pyramids 1

The Eight-Point Algorithm COMPSCI 527 Computer Vision COMPSCI 527 Computer Vision The

The Singular Value Decomposition COMPSCI 527 Computer Vision COMPSCI 527 Computer Vision

Image Motion COMPSCI 527 Computer Vision COMPSCI 527 Computer Vision Image Motion 1 /

The Epipolar Geometry COMPSCI 527 Computer Vision COMPSCI 527 Computer Vision The

Thinking with Data in the Second Course Nicholas J. Horton Department of Mathematics and

PLANAR: RNA Sequence Alignment using Non-Affine Gap Penalty and Secondary Structure Ofer Hirsch

Gene Expression: Details Pre-mRNA Secondary (Eukaryotes) Structure Prediction Aids DNA

TEIN (Trans-Eurasia Information Network) - Co-Prosperity of Asia and Europe through Digital Silk

Mo#f discovery Morgane Thomas-Chollier Computa)onal systems

Composite repetition-aware text indexing Djamal Belazzougui Fabio Cunial Travis Gagie Nicola

Data Mining in Bioinformatics Day 8: Clustering in Bioinformatics Clustering Gene Expression Data

CSI5180. MachineLearningfor BioinformaticsApplications Rule Learning by Marcel Turcotte Version

CSE 527 Computational Biology http://www.cs.washington.edu/527 Lecture 1: Overview & Bio

CSE 527 Computational Biology http://www.cs.washington.edu/527 Lecture 1: Overview & Bio

CSE 527, Additional notes on MLE & EM Based on earlier notes by C. Grant & M. Narasimhan