Genome sequencing in microfabricated high-density picolitre reactors.
Journal: 2005/September - Nature
ISSN: 1476-4687
Abstract:
The proliferation of large-scale DNA-sequencing projects in recent years has driven a search for alternative methods to reduce time and cost. Here we describe a scalable, highly parallel sequencing system with raw throughput significantly greater than that of state-of-the-art capillary electrophoresis instruments. The apparatus uses a novel fibre-optic slide of individual wells and is able to sequence 25 million bases, at 99% or better accuracy, in one four-hour run. To achieve an approximately 100-fold increase in throughput over current Sanger sequencing technology, we have developed an emulsion method for DNA amplification and an instrument for sequencing by synthesis using a pyrosequencing protocol optimized for solid support and picolitre-scale volumes. Here we show the utility, throughput, accuracy and robustness of this system by shotgun sequencing and de novo assembly of the Mycoplasma genitalium genome with 96% coverage at 99.96% accuracy in one run of the machine.
Relations:
Content
Citations
(2K+)
References
(20)
Diseases
(1)
Drugs
(1)
Chemicals
(1)
Processes
(2)
Similar articles
Articles by the same authors
Discussion board
Nature 437(7057): 376-380

Genome Sequencing in Open Microfabricated High Density Picoliter Reactors

+48 authors
454 Life Sciences Corp., 20 Commercial St., Branford, CT 06405, USA.
The Rothberg Institute For Childhood Diseases, 530 Whitfield St., Guilford, CT 06437, USA.
University of California, Berkeley, CA 94720, USA.
Laboratory of Microbiology, The Rockefeller University, New York, NY 10021, USA.
Correspondence and requests for materials should be addressed to J. M. Rothberg (moc.454@grebhtorj). Sequences for M. genitalium and S. pneumoniae were deposited at DDBJ/EMBL/GenBank under accession numbers AAGX01000000 and AAGY01000000 respectively.
These authors contributed equally to the work.

Abstract

We describe a scalable, highly parallel sequencing system with raw throughput significantly greater than that of state-of-the-art capillary electrophoresis instruments. The apparatus uses a novel 60×60 mm fibreoptic slide containing 1,600,000 individual wells and is able to sequence 25 million bases, at 99% or better accuracy (phred 20), in a 4 hour run. To provide sequencing templates, we clonally amplify DNA fragments on beads in the droplets of an emulsion. The template-carrying beads are loaded into the wells to convert each into a picoliter-scale sequencing reactor. We perform sequencing by synthesis using a pyrosequencing protocol optimized for solid support and the small dimension of the open reactors. Here we show the utility, throughput, accuracy and robustness of this system by shotgun sequencing and de novo assembling the Mycoplasma genitalium genome with 96% coverage at 99.96 % accuracy in one run of the machine.

Abstract

DNA sequencing has dramatically changed the nature of biomedical research and medicine. Reductions in the cost, complexity and time required to sequence large amount of DNA, including improvements in the ability to sequence bacterial and eukaryotic genomes will have significant scientific, economic and cultural impact. Large scale sequencing projects, including whole genome sequencing, have usually required the cloning of DNA fragments into bacterial vectors, amplification and purification of individual templates, followed by Sanger sequencing 1 using fluorescent chain-terminating nucleotide analogues 2 and either slab gel or capillary electrophoresis. Current estimates put the cost of sequencing a human genome between $10 and $25 million 3. Alternative sequencing methods have been described 45678 however, no technology has displaced the use of bacterial vectors and Sanger sequencing as the main generators of sequence information.

In this paper we describe an integrated system whose throughput routinely enables applications requiring millions of bases of sequence information, including whole genome sequencing Our focus has been on the co-development of an emulsion-based method 91011 to isolate and amplify DNA fragments in vitro, and of a fabricated substrate and instrument that performs pyrophosphate-based sequencing (“pyrosequencing” 512) in picoliter-sized wells.

In a typical run we generate over 25 million bases with a phred 20 or better quality score (predicted to have an accuracy of 99% or higher). While this phred 20 quality throughput is significantly higher than that of Sanger sequencing by capillary electrophoresis, it is currently at the cost of substantially shorter reads and lower average individual read accuracy 13. We further characterize the performance of the system, and demonstrate that it is possible to assemble bacterial genomes de novo from relatively short reads, by sequencing a known bacterial genome, Mycoplasma genitalium (580 kbp), and comparing our shotgun sequencing and de novo assembly with the results originally obtained for this genome 14. The results of shotgun sequencing and de novo assembly of a larger bacterial genome, Streptococcus pneumoniae15 (2.1 Mbp), are presented in Supplementary Table 4.

Footnotes

Supplementary Information accompanies the paper on www.nature.com/nature.

Footnotes

References

  • 1. Sanger F, Nicklen S, Coulson ARDNA sequencing with chain-terminating inhibitors. Proc Natl Acad Sci USA. 1977;74:5463.[Google Scholar]
  • 2. Prober JM, et al A system for rapid DNA sequencing with fluorescent chain-terminating dideoxynucleotides. Science. 1987;238:336.[PubMed][Google Scholar]
  • 3. NIH News Release, October 14, 2004, [PubMed]
  • 4. Nyren P, Pettersson B, Uhlen MSolid phase DNA minisequencing by an enzymatic luminometric inorganic pyrophosphate detection assay. Anal Biochem. 1993;208:171.[PubMed][Google Scholar]
  • 5. Ronaghi M, et al Real-time DNA sequencing using detection of pyrophosphate release. Anal Biochem. 1996;242:84.[PubMed][Google Scholar]
  • 6. Jacobson KB, et al Applications of mass spectrometry to DNA sequencing. GATA. 1991;8:223.[PubMed][Google Scholar]
  • 7. Bains W, Smith GCA novel method for nucleic acid sequence determination. J Theor Biol. 1988;135:303.[PubMed][Google Scholar]
  • 8. Jett JH, et al High-speed DNA sequencing: an approach based upon fluorescence detection of single molecules. Biomol Struct Dynamics. 1989:301.[PubMed][Google Scholar]
  • 9. Tawfik DS, Griffiths ADMan-made cell-like compartments for molecular evolution. Nat Biotechnology. 1998;16:652.[PubMed][Google Scholar]
  • 10. Ghadessy FJ, Ong JL, Holliger PDirected evolution of polymerase function by compartmentalized self-replication. Proc Nat Acad Sci USA. 2001;98:4552.[Google Scholar]
  • 11. Dressman D, Yan H, Traverso G, Kinzler KW, Vogelstein BTransforming single DNA molecules into fluorescent magnetic particles for detection and enumeration of genetic variations. Proc Nat Acad Sci USA. 2003;100:8817.[Google Scholar]
  • 12. Ronaghi M, Uhlen M, Nyren PA sequencing method based on real-time pyrophosphate. Science. 1998;281:363.[PubMed][Google Scholar]
  • 13. Current Sanger-based capillary electrophoresis sequencing systems produce up to 700 bp of sequence information from each of 96 DNA templates at an average read accuracy of 99.4% in one hour, or 67 thousand bases per hour, with substantially all of the bases having phred 20 or better quality(Applied Biosystems 3730xl DNA Analyzer Specification Sheet, 2004.)
  • 14. Fraser CM, et al The minimal gene complement of Mycoplasma genitalium. Science. 1995;270:397.[PubMed][Google Scholar]
  • 15. Tettelin H, et al Complete genome sequence of a virulent isolate of Streptococcus pneumoniae. Science. 2001;293:498.[PubMed][Google Scholar]
  • 16. Leamon JH, et al A massively parallel PicoTiterPlate based platform for discrete picoliter-scale polymerase chain reactions. Electrophoresis. 2003;24:3769.[PubMed][Google Scholar]
  • 17. Ronaghi MPyrosequencing sheds light on DNA sequencing. Genome Research. 2001;11:3.[PubMed][Google Scholar]
  • 18. Ewing B, Hillier L, Wendl MC, Green P. Base-calling of automated sequencer traces using phred. I. Accuracy assessment. Genome Research. 1998;8:175.[PubMed]
  • 19. Moore GECramming more components onto integrated circuits. Electronics. April 19 1965;38(Number 8)[PubMed][Google Scholar]
  • 20. Mehta K, Rajesh VA, Veeraswamy SFPGA implementation of VXIbus interface hardware. Biomed Sci Instrum. 1993;29:507.[PubMed][Google Scholar]
  • 21. Fagin B, Watt JG, Gross RA special-purpose processor for gene sequence analysis. Comput Appl Biosci. 1996;9:221.[PubMed][Google Scholar]
  • 22. Lander ES, Waterman MSGenomic mapping by fingerprinting random clones: a mathematical analysis. Genomics. 1988;2:231.[PubMed][Google Scholar]
  • 23. Myers EWToward simplifying and accurately formulating fragment assembly. J Comput Biol. 1995;2:2751.[PubMed][Google Scholar]
Collaboration tool especially designed for Life Science professionals.Drag-and-drop any entity to your messages.