from reference genes to global mean normalization
play

From reference genes to global mean normalization Jo Vandesompele - PowerPoint PPT Presentation

From reference genes to global mean normalization Jo Vandesompele professor, Ghent University co-founder and CEO, Biogazelle qPCR Symposium USA November 9, 2009 Millbrae, CA outline what is normalization gold standard for mRNA


  1. From reference genes to global mean normalization Jo Vandesompele professor, Ghent University co-founder and CEO, Biogazelle qPCR Symposium USA November 9, 2009 – Millbrae, CA

  2. outline  what is normalization  gold standard for mRNA normalization  global mean normalization and selection of stable small RNAs for microRNA normalization

  3. introduction to normalization  2 sources of variation in gene expression results  biological variation (true fold changes)  experimentally induced variation (noise and bias)  purpose of normalization is reduction of the experimental variation  input quantity: RNA quantity, cDNA synthesis efficiency, …  input quality: RNA integrity, RNA purity, …  gold standard is the use of multiple stably expressed reference genes  which genes?  how many?  how to do the calculations?

  4. normalization: geNorm solution  framework for qPCR gene expression normalisation using the reference gene concept:  quantified errors related to the use of a single reference gene (> 3 fold in 25% of the cases; > 6 fold in 10% of the cases)  developed a robust algorithm for assessment of expression stability of candidate reference genes  proposed the geometric mean of at least 3 reference genes for accurate and reliable normalisation  Vandesompele et al., Genome Biology, 2002

  5. geNorm software  automated analysis  ranking of candidate reference genes according to their stability  determination of how many genes are required for reliable normalization  http://medgen.ugent.be/genorm

  6. geNorm validation (I)  cancer patients survival curve statistically more significant results log rank statistics NF4 0.003 NF1 0.006 0.021 0.023 0.056 Hoebeeck et al., Int J Cancer, 2006

  7. geNorm validation (II)  mRNA haploinsufficiency measurements accurate assessment of small expression differences patient / control  3 independent experiments  95% confidence intervals  Hellemans et al., Nature Genetics, 2004

  8. normalization using multiple stable reference genes  geNorm is the de facto standard for reference gene validation and normalization  > 2,000 citations of our geNorm technology  > 10,000 geNorm software downloads in 100 countries

  9. global mean normalization  when a large set of genes are measured, the average expression level reflects the input amount and could be used for normalization  e.g. microarray based normalization o lowess, mean ratio, …  SAGE / NGS sequencing counts  the set of genes must be unbiased and sufficiently large  we make use of this principle to normalize microRNA data from experiments in which we quantify a substantial number of miRNAs (450 or 650) in a given sample

  10. global mean normalization  small-RNA controls  classic normalization strategy  small nuclear RNAs, small nucleolar RNAs  18 available from Applied Biosystems  global mean normalization  method applied for microarray data  universal: applicable for every miRNA dataset  many datapoints needed (megaplex vs. multiplex)  miRNAs/controls that resemble the mean  minimal standard deviation when comparing miRNA expression with mean ( geNorm V value, standard deviation of log transformed ratios)  compatible with multiplex assays  need to determine mean

  11. small RNA controls  How ‘stable’ is the global mean compared to controls?  geNorm analysis using controls and mean as input variables  exclusion of potentially co-regulated controls HY3 7q36 RNU19 5q31.2 RNU24 9q34 RNU38B 1p34.1-p32 RNU43 22q13 RNU44 1q25.1 RNU48 6p21.32 RNU49 17p11.2 RNU58A 18q21 RNU58B 18q21 RNU66 1p22.1 RNU6B 10p13 U18 15q22 U47 1q25.1 U54 8q12 U75 1q25.1 Z30 17q12 RPL21 13q12.2

  12. miRNA expression datasets  neuroblastoma tumour samples  T-ALL samples  EVI1 deregulated leukemias  retinoblastoma tumour samples  normal tissues  normal bone marrow

  13. T-ALL geNorm ranking 1,8 1,6 1,4 expression stability 1,2 1 0,8 0,6 0,4 0,2 0

  14. geNorm ranking neuroblastoma leukemia EVI1 overexpression bone marrow pool normal tissues

  15. neuroblastoma – removal of variation 120 100 80 not normalised stable controls 60 mean miRNAs 40 20 0 0 50 100 150 200 250 300

  16. removal of variation leukemia EVI1 overexpression T-ALL bone marrow pool normal tissues

  17. biological validation  MYCN binds to the mir-17-92 promoter CATGTG CACGTG CACGTG CATGTG CATGTG CATGTG CATGTG mir-17-92 cluster CpG island -5 kb +5 kb A B C 12 11 10 IMR5 9 Fold enrichment 8 WAC2 7 6 5 4 3 2 1 0 A B C Amplicon

  18. biological validation  choice of normalization strategy influences differential miRNA expression  Mir-17-92 expression in neuroblastoma tumours 3,5 3 2,5 2 1,5 stable controls mean 1 miRNAs 0,5 0

  19. biological validation  choice of normalization strategy influences differential miRNA expression  Mir-17-92 expression in neuroblastoma tumours 3,5 3 2,5 2 1,5 stable controls mean 1 miRNAs 0,5 0

  20. biological validation  choice of normalization strategy influences differential miRNA expression  Mir-17-92 expression in neuroblastoma tumours 3,5 3 2,5 2 1,5 stable controls mean 1 miRNAs 0,5 0

  21. balanced differential expression 3 controls 2 mean fold change (MYCN amplified vs. MYCN single copy) 1 0 -1 -2 average FC controls = -0.404 average Fc mean = 0.050 -3 average FC miRNAs = 0.124 -4 -5 -6 -7

  22. correlation MYCN downregulated genes – 2 normalization strategies stable miRNA control normalisation mean normalisation

  23. strategy also works for microarray data  each sample is measured by RT-qPCR and microarray  global mean normalization  standardization per method  hierarchical clustering  samples cluster by sample (and NOT by method)

  24. conclusions global mean normalization  novel and powerful miRNA normalization strategy  maximal reduction of technical noise  improved identification of differentially expressed genes  balancing of differential expression  universally applicable o global mean o multiple stable endogenous controls  Mestdagh et al., Genome Biology, 2009

  25. qbase PLUS normalization  most powerful, flexible and user-friendly real-time PCR data-analysis software  based on Ghent University’s geNorm and qBase technology  state of the art normalization procedures o one or more classic reference genes o global mean normalization o expressed repeat normalization  detection and correction of inter-run variation  dedicated error propagation  fully automated analysis; no manual interaction required  booth 19 http://www.qbaseplus.com

  26. conclusions  proper normalization has a major impact on your results  provides statistically more significant results  enables accurate assessment of small expression differences  gold standard for mRNA gene expression analysis  geNorm evaluation of candidate reference genes  geometric mean of multiple stably expressed reference genes  global mean normalization and subsequent geNorm based selection of reference genes that resemble the mean is a valid option when measuring a large and unbiased set of genes (e.g. all miRNAs)

  27. acknowledgments  miRNA  Pieter Mestdagh (UGent)  Frank Speleman (UGent)  Applied Biosystems  qbase PLUS  Jan Hellemans (Biogazelle – UGent)  Stefaan Derveaux (Biogazelle – UGent)

  28.  January 28-29, 2010 Ghent, Belgium  www.advances-in-genomics.org

Recommend


More recommend