Cbio 16S analysis pipeline Katie Lennard Microbiome analysis - PowerPoint PPT Presentation

Jul 24, 2023 •125 likes •318 views

Cbio 16S analysis pipeline Katie Lennard Microbiome analysis workflow Data preprocessing (UCT High Performance Cluster) Microbiome analysis workflow unsupervised classification correlations analyses Import data into R Microbiome analysis

Cbio 16S analysis pipeline Katie Lennard
Microbiome analysis workflow Data preprocessing (UCT High Performance Cluster)
Microbiome analysis workflow unsupervised classification correlations analyses Import data into R
Microbiome analysis workflow unsupervised classification Summary barplots correlations analyses Exploratory
Microbiome analysis workflow unsupervised classification Beta diversity: NMDS/PCoA correlations analyses Exploratory
Microbiome analysis workflow unsupervised classification Annotated heatmaps correlations analyses Exploratory
Microbiome analysis workflow unsupervised classification correlations analyses Differential abundance testing Downstream analyses
Microbiome analysis workflow correlations analyses Downstream analyses
Microbiome analysis workflow unsupervised classification unsupervised classification correlations analyses Downstream analyses
Microbiome analysis workflow unsupervised classification Biomarker discovery: random forests correlations analyses Downstream analyses
Customized .R script to make your life easier • Convert from phyloseq object to metagenomeSeq object • Get the lowest available taxonomic annotation for each OTU and merge counts at this level • Heatmap (using NMF package) customized for phyloseq objects • Can easily specify a subset of taxa and/or samples to plot • Select annotation colours • Select distance function for clustering • Choose to merge taxa at a given level (e.g. Genus) or plot individual OTUs • Generic barplot function build on phyloseq plot_bar() • Specify subset of samples • Filter OTUs so very rare ones (that just clog up the legend) are excluded • Merge at any taxonomic level (Family, Genus etc..) • Differential abundance testing + heatmap of significant results • Built around MetagenomSeq’s fitzig() and mrfulltable() functions • NB: currently only setup for two-class categorical comparisons • Correlations testing + correlation plot of significant results
Customized .R script to make your life easier • For PICRUSt data: takes the output from PICRUSt's metagenome_contributions.py, together with taxonomic annotation for the OTUs included in this table and provides a summary of the contribution of each Family/Genus.. etc to ONE SPECIFIC KEGG gene e.g. K02030 • Random forests analysis on the otu table of a supplied phyloseq object • The data is randomly divided into a training (two thirds of the data) and test set (remaining one third of the data not used for training) • Results printed to screen and written to file including: • most important taxa, AUC, PPV, NPV, OOB errors, class errors • option to specify the top N taxa to see how they perform
Random Forests output example
Random Forests output example
The 16S accreditation dataset: first look • Number of OTUs: 181 (140 retained after filtering) • Number of samples: 15 • Sample data summary (columns=Treatment; rows=Dog): 0 1 2 3 4 B 1 1 1 1 1 G 1 1 1 1 1 K 1 1 1 1 1
The 16S accreditation dataset: first look
The 16S accreditation dataset: first look
The 16S accreditation dataset: first look

Recommend

SANBio BIOINFORMATICS TRAINING COURSE THE MICROBIOME: ANALYSIS OF NGS DATA CBIO-PIPELINE SAMSON,

SANBio BIOINFORMATICS TRAINING COURSE THE MICROBIOME: ANALYSIS OF NGS DATA CBIO-PIPELINE SAMSON, KM 10/23/2017 Microbiome : Analysis of NGS Data 1 Outline Background Wet Lab! Raw reads Quality Assessment Quality Control Merging and

775 views • 38 slides

Efficient Analysis of Pipeline Models for WCET Computation Stephan Wilhelm (sw@absint.com)

A b s I n t Efficient Analysis of Pipeline Models for WCET Computation Stephan Wilhelm (sw@absint.com) AbsInt GmbH and Saarland University Outline 1. aiTs pipeline analysis 3. Why improve its efficiency? 5. BDD based pipeline analysis 7.

300 views • 10 slides

ClusterPCAML November 13, 2018 1 Lecture 23: Clustering and machine learning CBIO (CSCI)

ClusterPCAML November 13, 2018 1 Lecture 23: Clustering and machine learning CBIO (CSCI) 4835/6835: Introduction to Computational Biology 1.1 Overview and Objectives Its nice when youre able to automate a lot of your data analysis.

548 views • 32 slides

Big Data: Pipeline Demo Day Analysis of white matter shapes Nic Novak NSIDP 2 nd Year,

Big Data: Pipeline Demo Day Analysis of white matter shapes Nic Novak NSIDP 2 nd Year, Laboratory of Neuroimaging Summary White matter morphology and Alzheimers LONI Pipeline / methodology Results A problem for Pipeline

163 views • 14 slides

How To: Run the ENCODE long-RNA-seq analysis pipeline on DNAnexus Overview: In this exercise, we

How To: Run the ENCODE long-RNA-seq analysis pipeline on DNAnexus Overview: In this exercise, we will run the ENCODE Uniform Processing Long RNA-seq Pipeline on a small test dataset containing reads from chromosome 21 sampled from an ENCODE

613 views • 14 slides

THE RNA-SEQ ANALYSIS PIPELINE Alicia Oshlack Murdoch Childrens

THE RNA-SEQ ANALYSIS PIPELINE Alicia Oshlack Murdoch Childrens Research Ins5tute Two ways to look at sequencing data Sequence of Posi5on of mapped

929 views • 50 slides

THE DATA MINING PIPELINE What is data? The data mining pipeline: collection, preprocessing,

DATA MINING THE DATA MINING PIPELINE What is data? The data mining pipeline: collection, preprocessing, mining, and post-processing Sampling, feature extraction and normalization Exploratory analysis of data basic statistics What is data

2.11k views • 139 slides

How To: Run the ENCODE histone ChIP-seq analysis pipeline

How To: Run the ENCODE histone ChIP-seq analysis pipeline on DNAnexus Overview: In this exercise, we will run the ENCODE Uniform Processing

328 views • 15 slides

ComputationalModeling September 30, 2018 1 Lecture 15: Computational Modeling CBIO (CSCI)

ComputationalModeling September 30, 2018 1 Lecture 15: Computational Modeling CBIO (CSCI) 4835/6835: Introduction to Computational Biology 1.1 Overview and Objectives So far, weve discussed Hidden Markov Models as way to encapsulate and

332 views • 14 slides

ComputationalModeling February 11, 2020 1 Lecture 14: Computational Modeling CBIO (CSCI)

ComputationalModeling February 11, 2020 1 Lecture 14: Computational Modeling CBIO (CSCI) 4835/6835: Introduction to Computational Biology 1.1 Overview and Objectives So far, weve discussed Hidden Markov Models as way to encapsulate and

614 views • 15 slides

Lecture6_ModulesNumPyIO August 30, 2018 1 Lecture 6: Modules, NumPy, and File I/O CBIO (CSCI)

Lecture6_ModulesNumPyIO August 30, 2018 1 Lecture 6: Modules, NumPy, and File I/O CBIO (CSCI) 4835/6835: Introduction to Computational Biology 1.1 Overview and Objectives So far, all the data weve worked with have been

228 views • 20 slides

Pipeline Construction Pipeline Construction Challenges Challenges NAPCA Workshop August 19,

U.S. Department of Transportation Pipeline and Hazardous Materials Safety Administration Pipeline Construction Pipeline Construction Challenges Challenges NAPCA Workshop August 19, 2010 Houston, Texas Kenneth Y. Lee Office of Pipeline

507 views • 50 slides

Building a digital business Kalman Tiboldi Founder & CEO ( Former CBIO of TVH) TVH Group: 2

A TVH Parts company Building a digital business Kalman Tiboldi Founder & CEO ( Former CBIO of TVH) TVH Group: 2 Business Units 6800 colleagues worldwide Digital innovation at the core of TVH In-house developed IT & Business Close

639 views • 25 slides

Bioimaging1 November 1, 2018 1 Lecture 21: Bioimaging I CBIO (CSCI) 4835/6835: Introduction to

Bioimaging1 November 1, 2018 1 Lecture 21: Bioimaging I CBIO (CSCI) 4835/6835: Introduction to Computational Biology 1.1 Overview and Objectives Now that weve covered the basics of computer vision, lets look at how this can be applied

577 views • 20 slides

DynamicProgramming February 3, 2020 1 Lecture 9: Dynamic Programming CBIO (CSCI) 4835/6835:

DynamicProgramming February 3, 2020 1 Lecture 9: Dynamic Programming CBIO (CSCI) 4835/6835: Introduction to Computational Biology 1.1 Overview and Objectives Weve so far discussed sequence alignment from the perspective of distance

685 views • 40 slides

Bioimaging2 November 7, 2018 1 Lecture 22: Bioimaging II CBIO (CSCI) 4835/6835: Introduction to

Bioimaging2 November 7, 2018 1 Lecture 22: Bioimaging II CBIO (CSCI) 4835/6835: Introduction to Computational Biology 1.1 Overview and Objectives Today, well wrap up our module on image processing with some more in-depth examples of how to

737 views • 19 slides

Towards a BES Light Source Wide Event-triggered Tomography Data Analysis Pipeline Using a

Towards a BES Light Source Wide Event-triggered Tomography Data Analysis Pipeline Using a Sustainable Software Stack Hari Krishnan, Lawrence Berkeley National Laboratory CAMERA Center for Advanced Mathematics for Energy Research Applications

585 views • 37 slides

NUMERICAL ANALYSIS OF EROSION OF GAS-PIPELINE ELEMENTS A.A. Ryabov, Kudryavtsev A. Yu., Voronkov

NUMERICAL ANALYSIS OF EROSION OF GAS-PIPELINE ELEMENTS A.A. Ryabov, Kudryavtsev A. Yu., Voronkov O.V., (Sarov Engineering Center, Russia) Haritonov A.N, Maltsev A.I., Melnikov I.V., Kiselev M.N. (Gazprom DobychaNadym) Matt Straw (Norton Straw, UK)

311 views • 17 slides

IntroProbability September 17, 2018 1 Lecture 12: Introduction to Probability CBIO (CSCI)

IntroProbability September 17, 2018 1 Lecture 12: Introduction to Probability CBIO (CSCI) 4835/6835: Introduction to Computational Biology 1.1 Overview and Objectives Before we can jump into computational modeling, we have to cover some

340 views • 10 slides

1,000 foot pipeline Connect Replacement (Saugus 3 and 4) Wells to Magic Mountain Pipeline

Magic Mountain Water Pipeline Installation Agreement Amendment Commerce Center Drive Pipeline Background 1,000 foot pipeline Connect Replacement (Saugus 3 and 4) Wells to Magic Mountain Pipeline Five Point will oversee construction

60 views • 3 slides

Pipeline A Presentation by Team Pipeline Ben Lai Brandon Bakhshai Jeffrey Serio Somya

Pipeline A Presentation by Team Pipeline Ben Lai Brandon Bakhshai Jeffrey Serio Somya Vasudevan What is pipeline Pipeline is an asynchronous programming language that uses an event-driven architecture. Pipelines event-loop is powered by

330 views • 20 slides

ComputerVision October 30, 2018 1 Lecture 20: Introduction to Computer Vision CBIO (CSCI)

ComputerVision October 30, 2018 1 Lecture 20: Introduction to Computer Vision CBIO (CSCI) 4835/6835: Introduction to Computational Biology 1.1 Overview and Objectives This week, were moving into image processing. In this lecture, well

448 views • 20 slides

A Pipeline for Scalable Text Reuse Analysis Milad Alshomary Bauhaus Universitt 05.07.2018

A Pipeline for Scalable Text Reuse Analysis Milad Alshomary Bauhaus Universitt 05.07.2018 Milad Alshomary Pipeline for TR extraction 05.07.2018 1 Overview Motivation A Pipeline for Scalable Text Reuse Extraction Application on

1.01k views • 88 slides

Highlights Highlights of of New New Pipeline Pipeline Medicines Medicines Based on Meds

Highlights Highlights of of New New Pipeline Pipeline Medicines Medicines Based on Meds Pipeline Monitor 2018 CADTH Symposium April 15, 2019 Jared Berger Policy Analyst Providing insight into potentially high-impact medicines in the

318 views • 9 slides