examples of online analysis tools for gene expression data
play

Examples of online analysis tools for gene expression data Tools - PowerPoint PPT Presentation

Examples of online analysis tools for gene expression data Tools integrated in data repositories Tools for raw data analysis (cel files, or other scanner output) Processed data analysis tools Tools linking gene expression with gene function


  1. Examples of online analysis tools for gene expression data Tools integrated in data repositories Tools for raw data analysis (cel files, or other scanner output) Processed data analysis tools Tools linking gene expression with gene function Tools linking gene expression with sequence analysis

  2. Tools from the data repositories Advantages : Fast Done for a huge amount of public data Allow quick & dirty overview of “what's already known” Drawbacks Not usable for custom data Not flexible, poor tuning Examples GEO ArrayExpress SAGEmap

  3. GEO Tools Raw data retrieval (soft or matrix-formatted objects) GEO DataSet Cluster Analysis : a visualization tool for displaying precomputed cluster heat maps GEO Profiles : expression profiles per each gene/spot of one selected dataset

  4. GEO DataSet cluster analysis : example

  5. GEO DataSet cluster analysis : example

  6. GEO DataSet cluster analysis : example

  7. GEO differential expression analysis : example

  8. ArrayExpress Tools Processed (matrix) or Raw data retrieval Expression Profiles (per gene and per experiment)

  9. SAGE Anatomic Viewer (SAV) Displays gene expression results based on SAGE tags counts in human normal and malignant tissues

  10. Tools for raw files transformation Input : Affymetrix cel files Genepix or Scanalyze output files Functions : Standard microarray corrections and normalization Background correction Spot filtering Intra- and Interchip normalization Replicate scaling Data quality assessment and scoring

  11. Tools for raw files transformation : Express Yourself

  12. Processed data analysis tools Drawbacks Can be quite slow Input data format is very important Need to know well your data before using them Advantages Usually contains lots of functionalities Usable for custom data Can be very flexible Examples CIMminer GEDA Expression Profiler GEPAS

  13. CIMminer Generates color-coded Clustered Image Maps (CIMs) ("heat maps") Easy to use, but few tuning possibilities Good start for online clustering tools

  14. GEDA Specifically designed for the integrated analysis of global gene expression patterns in cancer Easy to use BUT : careful with the results interpretation

  15. GEDA : A few Screenshots

  16. GEDA : A few Screenshots

  17. GEDA : A few Screenshots

  18. GEDA : A few Screenshots

  19. GEDA : A few Screenshots

  20. GEDA : A few Screenshots

  21. Expression Profiler at EBI

  22. Expression Profiler at EBI

  23. Expression Profiler at EBI

  24. GEPAS

  25. GEPAS

  26. GEPAS

  27. GEPAS

  28. GEPAS

  29. GEPAS

  30. Tools to retrieve gene functions and annotations Goals Link Gene Ontology information to co-expressed genes Find pathways specificities under certain biological conditions Find promoter elements common in co-expressed genes Input files Expression data matrix with classes AND gene names Gene lists to compare Promoter sequences in FASTA format Examples Carrie Babelomics DAVID : Database for Annotation, Visualization and Integrated Discovery Inclusive : MotifSampler SSA

  31. CARRIE Computational Ascertainment of Regulatory Relationships Inferred from Expression Input Expression data matrix with gene Ids and sample classes Associated promoter sequences Output Known transcription factors associated with co-expressed genes KEGG pathways associated with genes Gene Ontology for selected genes

  32. CARRIE

  33. CARRIE

  34. Babelomics : FatiGO Linked to the GEPAS gene expression analysis tools Web-tools for functional annotation and analysis of group of genes in high- throughput experiments.

  35. Babelomics : FatiGO Input : Two gene lists to compare (differentially expressed genes) Different gene IDs supported (Entrez, HUGO, RefSeq, Affy...) Uses GO (Gene Ontology) database Output : Summary with the input parameters Summary input data: Initial number of genes, number of genes have ensembl correspondence and number of genes that have been used for the analysis. Links with the results for each repository that has been selected and the number of genes for which gene ontology annotation exist. Graphical view of GO terms represented in gene lists

  36. Babelomics : FatiGO

  37. Babelomics : FatiGO

  38. Babelomics : FatiGO

  39. MotifSampler Description Part of the INCLUSive suite which also contains gene expression data analysis Tries to find motifs in a given list of sequences Input Sequences in FASTA format An organism-specific background model (given) Motif length Number of motifs to retrieve Output A list of motifs instances for each input sequence

  40. Other online Tool : ArrayQuest Applies to data from GEO or custom data Contains Bioconductor methods, BioPerl and C++ based scripts Accepts new analysis method submission

Recommend


More recommend