next next generation sequencing an overview of generation
play

Next Next Generation Sequencing: an overview of Generation - PowerPoint PPT Presentation

Next Next Generation Sequencing: an overview of Generation Sequencing: an overview of technologies and applications technologies and applications Matthew Tinning Australian Genome Research Facility July 2012 History of Sequencing


  1. Next Next� �Generation Sequencing: an overview of Generation Sequencing: an overview of technologies and applications technologies and applications Matthew Tinning Australian Genome Research Facility July 2012

  2. History of Sequencing History of Sequencing Where have we been? Where have we been? 1869 – Discovery of DNA 1909 – Chemical characterisation 1953 – Structure of DNA solved 1977 – Sanger sequencing invented – First genome sequenced – Ф X174 (5 kb) – First genome sequenced – Ф X174 (5 kb) 1986 – First automated sequencing machine 1990 – Human Genome Project started 1992 – First “sequencing factory” at TIGR

  3. History of Sequencing History of Sequencing Where have we been? Where have we been? 1995 – First bacterial genome – H. influenzae (1.8 Mb) 1998 – First animal genome – C. elegans (97 Mb) 2003 – Completion of Human Genome Project (3 Gb) – 13 years, $2.7 bn 2005 – First “next-generation” sequencing instrument 2005 – First “next-generation” sequencing instrument 2008 – 2356 genome sequences in NCBI database

  4. Origin of DNA Sequencing Origin of DNA Sequencing • 1977 – First genome (ФX174) – Sequencing by synthesis (Sanger) – Sequencing by degradation (Maxam� – Sequencing by degradation (Maxam� Gilbert)

  5. Sanger sequencing Sanger sequencing • Uses DNA polymerase • All four nucleotides, plus one dideoxynucleotide • Random termination at specific bases • Random termination at specific bases • Separate by gel electrophoresis

  6. Chain extension and Chain extension and termination termination � � �� � � � � ������ ������ �������������������������

  7. Chain extension and Chain extension and termination termination � � � ����������� ���������������������� ���������������������� deoxynucleotide dideoxynucleotide

  8. Fragment identification Fragment identification

  9. Gel electrophoresis Gel electrophoresis

  10. 1986: 4 Reactions to 1 Lane 1986: 4 Reactions to 1 Lane Sequencing Reaction Products Progression of Sequencing Reaction

  11. ABI377 ABI377

  12. ABI3730xl ABI3730xl

  13. Electropherograms Electropherograms

  14. Sanger Sequencing Sanger Sequencing •Maximum read length ~900 base •Maximum yield/day < 2.1 million bases (rapid mode, 500 bp reads) < 0.1% of the human genome > 1000 days of sequencing for a 1 fold coverage ...

  15. Human Genome Project Human Genome Project

  16. Human Genome Project Human Genome Project • Launched in 1989 –expected to take 15 years – Competing Celera project launched in 1998 • Genome estimated to be 92% complete – 1 st Draft released in 2000 – “Complete” genome released in 2003 – “Complete” genome released in 2003 – Sequence of last chromosome published in 2006 • Cost: ~$3 billion – Celera ~$300 million

  17. Shotgun Sequencing Approach Shotgun Sequencing Approach

  18. Next Next� �Gen Sequencing Gen Sequencing Technologies Technologies

  19. Next Next� �Gen Sequencing Gen Sequencing Technologies Technologies • Four platforms, four technologies • All massively parallel sequencing – Sequencing by synthesis – Sequencing by Ligation – Sequencing by Ligation • Read lengths vary from ~36bp to >400bp • Read numbers vary from ~ 1 million to ~ 1 billion per run

  20. Next Next� �Gen Sequencing Gen Sequencing Technologies Technologies Roche GS-FLX Life Technologies SOLiD Life Technologies Ion Torrent Illumina HiSeq

  21. Next Gen Sequencing Library Next Gen Sequencing Library Preparation Preparation

  22. Roche GS Roche GS� �FLX FLX

  23. Workflow Workflow Sample Fragmentation Library Preparation emPCR Setup emPCR Amplification Pyrosequencing Data Analysis

  24. Pyrosequencing Pyrosequencing

  25. emPCR emPCR Emulsion PCR is a method of clonal amplification which allows for millions of unique PCRs to be performed at once through the generation of micro�reactors.

  26. emPCR The Water-in-Oil-Emulsion

  27. Massively Parallel Sequencing Massively Parallel Sequencing

  28. Data Analysis Data Analysis T Base A Base C Base G Base Flow Flow Flow Flow Raw Image Files Image Quality Base� Processing Filtering calling SFF File

  29. 454 Platform Updates 454 Platform Updates GS20 • 100bp reads, ~20Mbp / run GS�FLX • 250bp reads ~100 Mbp / run (7.5 hrs) GS�FLX Titanium GS�FLX Titanium • 400bp reads ~400 Mbp / run (10 hrs) • 400bp reads ~400 Mbp / run (10 hrs) GS�FLX Titanium Plus • 700 bp reads ~700 Mbp/run (18 hrs) GS Junior • 400 bp reads ~ 35Mbp/run (10 hrs)

  30. Illumina Illumina HiSeq HiSeq

  31. Illumina Sequencing Technology Illumina Sequencing Technology Robust Reversible Terminator Chemistry Foundation 3’ 5’ DNA (0.1-1.0 ug) A G T C G A C T T A C C G G A T A A C T T C C C G C G A T T C Sample G A preparation Cluster growth T 5’ Sequencing 1 2 3 4 5 6 7 8 9 T G C T A C G A T … Base calling Image acquisition

  32. Platform Updates Platform Updates Solexa 1G • 18bp reads, ~1Gbp / run Illumina GA • 36bp reads ~3Gbp / run Illumina GAII • 75bp paired reads ~10Gbp / run (8 days) Illumina GAIIx • 75bp paired reads ~40Gbp / run (8 days) Illumina HiSeq 2000 Illumina HiSeq 2000 • 100 bp paired reads ~200 Gbp/ run (10 days) Illumina HiSeq, v3 SBS • 100bp paired reads ~600Gbp / run (12 days) MiSeq • 150 paired reads ~1.5 Gb/run (27 hrs) Maximum yield / day 50,Gbp ~16x the human genome

  33. Applied Applied Biosystems Biosystems SOLiD SOLiD

  34. Sequencing by Ligation Sequencing by Ligation

  35. Base Interrogations Base Interrogations

  36. 2 Base encoding 2 Base encoding AT

  37. emPCR and Enrichment emPCR and Enrichment 3’ Modification allows covalent bonding to the slide surface

  38. Platform Updates Platform Updates • 50bp Paired reads ~50Gbp / run SOLiD 3 (12 days) • 50bp Paired reads ~100Gbp / run SOLiD 4 (12 days) • 75bp Paired reads ~300Gbp / run 5500xl (14 days) Maximum yield / day 21,000,000,000bp 7x the human genome 3.5 hours of sequencing for a 1 fold coverage.....

  39. Applied Applied Biosystems Biosystems: : Ion Torrent PGM Ion Torrent PGM

  40. Ion Torrent Ion Torrent • Ion Semiconductor Sequencing • Detection of hydrogen ions during the polymerization DNA • Sequencing occurs in microwells • Sequencing occurs in microwells with ion sensors • No modified nucleotides • No optics

  41. Ion Torrent Ion Torrent dNTP • DNA � Ions � Sequence – Nucleotides flow sequentially over Ion semiconductor chip – One sensor per well per sequencing H + reaction – Direct detection of natural DNA extension ∆ pH – Millions of sequencing reactions per chip – Fast cycle time, real time detection – Fast cycle time, real time detection ∆ Q ∆ Q Sensing Layer Sensor Plate ∆ V To column Bulk Drain Source receiver Silicon Substrate

  42. Ion Torrent: System Updates Ion Torrent: System Updates 314 Chip • 100bp reads ~10 Mb/run (1.5 hrs) • 100 bp reads ~100 Mbp / run (2 hrs) 316 Chip • 200 bp reads ~200 Mbp/run (3 hrs) 318 Chip • 200 bp reads ~1 Gbp / run (4.5 hrs)

  43. Summary of NGS Platforms Summary of NGS Platforms • Clonal amplification of sequencing template – emPCR (454, SOLiD and Ion Torrent) – Bridge amplification (Illumina) • Sequencing by Synthesis – 454 �������������� – Illumina ������������������������������� – Illumina ������������������������������� – Ion Torrent ���������������������������� • Sequencing by ligation – SOLiD – 2 base encoding • Dramatic reduction in cost of sequencing – GS�FLX provides > 100x decrease in costs compared to Sanger Sequencing – HiSeq and SOLiD > 100x decrease in costs over GS�FLX

  44. Rapid Innovation Driving Cost Rapid Innovation Driving Cost Down Down Evolution of NGS system output Cost per Human Genome Throughput (GB) 300 300GB 120 100 80 60 40 20GB 6GB 20 3GB 0 2007 2008 2009 2010

  45. NGS Library Preparation NGS Library Preparation • Library preparation • �������������������

  46. Library Preparation Library Preparation

  47. Sample preparation Sample preparation DNA mRNA chemical mechanical Fragmentation Fragmentation Fragmentation cDNA Synthesis cDNA Synthesis Ligation of Amplification/ Sequencing Adaptors Library Fragment Size Selection

  48. Applications Applications • DNA • Whole Genome • hybridization Capture • amplicon • amplicon • ChIP�seq • RNA • mRNA • small RNA

  49. Whole Genome Sequencing Whole Genome Sequencing • �������� assembly • Reference Mapping – SNVs, rearrangements • Comparative genomics E. coli assembly from MiSeq Data Illumina application notes

Recommend


More recommend