how to do successful gene expression analysis
play

How to do successful gene expression analysis Jan Hellemans, PhD - PowerPoint PPT Presentation

How to do successful gene expression analysis Jan Hellemans, PhD Center for Medical Genetics Biogazelle qPCR meeting June 25 th 2010 Sienna, Italy Introduction qPCR: reference technology for nucleic acid quantification


  1. How to do successful gene expression analysis Jan Hellemans, PhD Center for Medical Genetics Biogazelle qPCR meeting – June 25 th 2010 – Sienna, Italy

  2. Introduction  qPCR: reference technology for nucleic acid quantification  sensitivity and specificity  wide dynamic range  speed  relative low cost  conceptual and practical simplicity  easy to perform ≠ easy to do it right  many steps involved  all need to be right

  3. Introduction RNA quality RT and PCR Choice of Choice of assessment primer design RT chemistry cDNA synthesis strategy Sample extraction Sample selection Assay validation and handling Data Data reporting analysis

  4. prepare – cycle – report

  5. Prepare experiment design • power analysis • sample vs gene maximization • run layout samples • preparation • quality control • pre amplification assays • design • in silico validation • empirical validation reference gene • selection • validation

  6. Prepare experiment design • power analysis • sample vs gene maximization • run layout samples • preparation • quality control • pre amplification assays • design • in silico validation • empirical validation reference gene • selection • validation

  7. Power analysis  determination of the number of data points needed to reach statistical significance for a given  difference  variability 14,00 12,00  technical constraints critical t-value 10,00  confidence interval (CI) 8,00  3 (~ critical t-value t*) 6,00 CI = SEM x t* 4,00 2,00 0,00 2 3 4 5 10 20 100 number of datapoints

  8. Power analysis  determination of the number of data points needed to reach statistical significance for a given  difference  variability  technical constraints  confidence interval (CI)  3  Mann-Whitney test: n A + n B  8  Wilcoxon test:  6 pairs  http://www.cs.uiowa.edu/~rlenth/Power/

  9. Sample vs gene maximization  how to set-up an experiment with  3 genes of interest (GOI) & 3 reference genes (REF)  11 samples (S) & 1 no template control (NTC) gene maximization sample maximization REF1 REF2 REF3 GOI1 GOI2 GOI3 S1 S2 S3 S4 S5 S6 S7 S8 S9 S10 S11NTC S1 REF1 S2 S3 REF2 S4 S5 REF3 S6 S7 GOI1 NTC REF1 REF2 REF3 GOI1 GOI2 GOI3 S1 S2 S3 S4 S5 S6 S7 S8 S9 S10 S11NTC S1 GOI2 S2 S3 GOI3 S8 S9 S10 S11 NTC

  10. Sample vs gene maximization  sample maximization – to be preferred  no increase in variation due to absence of inter-run variation  suitable for retrospective studies and controlled experiments  gene maximization  introduces (under-estimated) inter-run variation  applicable for prospective studies or large studies in which the number of samples do not fit in the run anymore  inter-run variation can be measured and corrected for using inter-run calibrators (IRC) through a procedure called inter-run calibration

  11. Prepare experiment design • power analysis • sample vs gene maximization • run layout samples • preparation • quality control • pre amplification assays • design • in silico validation • empirical validation reference gene • selection • validation

  12. Preparation  cDNA synthesis  most variable step in the workflow (> RT replicates)  different performance of the enzymes  linearity and yield are important  DNase treament  retropseudogenes (15%) and single exon genes (5%)  on column vs. in solution  verify absence of DNA • qPCR for genomic DNA target on RNA as input

  13. Quality control – RNA integrity value  Evaluate integrity of 18S and 28S rRNA  Agilent Bioanalyzer  Bio-Rad Experion  Caliper GX  Qiagen QIAxcel  Shimadzu MultiNA

  14. Quality control – 5’ - 3’ ratio  universally expressed low abundant reference  anchored oligo(dT) reverse transcription 5’ 3’ AAAAAA Cq 5’ Cq 3’ 9 8 7 6 5'-3' delta Ct 5 4 3 2 1 0 109 109* 109** 275 275* 275** 539 539* 539** samples  increasing delta-Cq values upon artificial RNA degradation

  15. Quality control – SPUD assay for inhibition  spiking of synthetic sequence lacking homology with any known human sequence into RNA SPUD SPUD SPUD SPUD SPUD + + + + + H2O heparin RNA1 RNA2 RNA3 ------------RT-qPCR--------- Cq 22 Cq 27 Cq 22 Cq 25 Cq 22 Δ Cq > 1: presence of inhibitors

  16. Pre amplification  methods  WT-Ovation (NuGEN)  limited cycle PCR (PreAmp - Applied Biosystems)  preservation of differential expression (fold changes) before (B) and after (A) sample pre- amplification  (G1S1) B /(G1S2) B = (G1S1) A /(G1S2) A  G1 B /G2 B < > G1 A /G2 A  gene G, sample S, before B, after A

  17. Prepare experiment design • power analysis • sample vs gene maximization • run layout samples • preparation • quality control • pre amplification assays • design • in silico validation • empirical validation reference gene • selection • validation

  18. http://www.rtprimerdb.org

  19. Assay design guidelines  location  sequence repeats, protein domains  splice variants  intron spanning vs intra exonic  short amplicons: 80-150bp  SNPs  primers  dTm < 2°C  identical Tm for all assays  maximum 2 GC in last 5 nucleotides  use software to design assays  Primer3(Plus), BeaconDesigner, RTprimerDB

  20. In silico assay validation  do thorough in silico assay evaluation  BLAST/BiSearch specificity analysis  mfold secondary structure  SNP analysis of primer annealing regions  splice variant specificity  streamline in silico analyses with RTprimerDB pipeline

  21. Empirical assay validation  specificity  size analysis (only once) • agarose or polyacrylamide gel • capillary electrophoresis  melting curves (SYBR, repeated)  [sequence / restriction digest]  amplification efficiency  standard curve • range & number dilution points • representative sample  [single curve efficiency algorithms]  for absolute quantification  linear range and limit of detection

  22. Prepare experiment design • power analysis • sample vs gene maximization • run layout samples • preparation • quality control • pre amplification assays • design • in silico validation • empirical validation reference gene • selection • validation

  23. Single reference gene  quantitative RT-PCR analysis of 10 reference genes (belonging to different functional and abundance classes) on 85 samples from 13 different human tissues 4 3 ACTB HMBS 2 HPRT1 TBP 1 UBC 0 A B C D E F G

  24. Single vs multiple reference genes  single reference gene  errors related to the use of a single reference gene: > 3 fold in 25% of the cases > 6 fold in 10% of the cases  multiple reference genes  developed a robust algorithm for assessment of expression stability of candidate reference genes  proposed the geometric mean of at least 3 reference genes for accurate and reliable normalisation  geNorm analysis in pilot study  Vandesompele et al . Genome Biol. 2002 Jun 18;3(7):RESEARCH0034.

  25. geNorm  validation  insensitive to outliers  reduce most of the variation  statistically more significant results  accurate assessment of small expression differences  de facto standard for reference gene validation  2 400 citations of the geNorm technology  ~12 000 geNorm software downloads in 112 countries

  26. genorm PLUS

  27. genorm PLUS

  28. genorm PLUS

  29. Cycle cycle • instrument • chemistry • controls

  30. Instrument  fast PCR  fast ramping ≠ fast qPCR experiment  96-well vs 384-well  384-well system is slightly more expensive  384-well plates harder to pipet (multichannel pipets or pipetting robot)  384-well run gives 4x more data in same time  384-well plates require smaller volumes  plate homogeneity test

  31. Chemistry  choose probes for  multiplexing  genotyping  absolute sensitivity (detection past cycle 40) (e.g. clinical-diagnostic setting, GMO detection)  choose SYBR Green I for  all other applications  low cost  seeing what you do

  32. Controls  melting curve  unique melt peak for all samples?  replicates  delta-Cq < 0.5 cycles?  controls  negative control really blank delta-Cq samples/NTC > 5?  positive controls with expected Cq?  amplification plot shape (kinetic outlier detection)

  33. Report relative quantification • efficiency correction • multiple reference gene normalization • inter-run calibration • error propagation bio statistical analysis • biological replicates • log transform data • selection of statistical test reporting guidelines • RDML • MIQE

  34. Report relative quantification • efficiency correction • multiple reference gene normalization • inter-run calibration • error propagation bio statistical analysis • biological replicates • log transform data • selection of statistical test reporting guidelines • RDML • MIQE

  35. Calculation methods Cq RQ NRQ CNRQ Normalization Inter-run calibration NRQ NRQ RQ RQ   CNRQ     2 NRQ   Cq Cq soi soi toi toi CNRQ RQ RQ E NRQ NRQ n RQ n   irc ref NRQ n RQ n irc ref i i i i Hellemans et al . Genome Biol. 2007;8(2):R19.

  36. Data processing - relative quantification

  37. qbase PLUS

Recommend


More recommend