词条 | DNA sequencing | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
释义 |
Knowledge of DNA sequences has become indispensable for basic biological research, and in numerous applied fields such as medical diagnosis, biotechnology, forensic biology, virology and biological systematics. The rapid speed of sequencing attained with modern DNA sequencing technology has been instrumental in the sequencing of complete DNA sequences, or genomes, of numerous types and species of life, including the human genome and other complete DNA sequences of many animal, plant, and microbial species. The first DNA sequences were obtained in the early 1970s by academic researchers using laborious methods based on two-dimensional chromatography. Following the development of fluorescence-based sequencing methods with a DNA sequencer,[2] DNA sequencing has become easier and orders of magnitude faster.[3] ApplicationsDNA sequencing may be used to determine the sequence of individual genes, larger genetic regions (i.e. clusters of genes or operons), full chromosomes, or entire genomes of any organism. DNA sequencing is also the most efficient way to indirectly sequence RNA or proteins (via their open reading frames). In fact, DNA sequencing has become a key technology in many areas of biology and other sciences such as medicine, forensics, and anthropology. Molecular biologySequencing is used in molecular biology to study genomes and the proteins they encode. Information obtained using sequencing allows researchers to identify changes in genes, associations with diseases and phenotypes, and identify potential drug targets. Evolutionary biologySince DNA is an informative macromolecule in terms of transmission from one generation to another, DNA sequencing is used in evolutionary biology to study how different organisms are related and how they evolved. Metagenomics{{Main|Metagenomics}}The field of metagenomics involves identification of organisms present in a body of water, sewage, dirt, debris filtered from the air, or swab samples from organisms. Knowing which organisms are present in a particular environment is critical to research in ecology, epidemiology, microbiology, and other fields. Sequencing enables researchers to determine which types of microbes may be present in a microbiome, for example. MedicineMedical technicians may sequence genes (or, theoretically, full genomes) from patients to determine if there is risk of genetic diseases. This is a form of genetic testing, though some genetic tests may not involve DNA sequencing. ForensicsDNA sequencing may be used along with DNA profiling methods for forensic identification[4] and paternity testing. DNA testing has evolved tremendously in the last few decades to ultimately link a DNA print to what is under investigation. The DNA patterns in fingerprint, saliva, hair follicles, etc. uniquely separate each living organism from another. Testing DNA is a technique which can detect specific genomes in a DNA strand to produce a unique and individualized pattern. Every living organism ever created has a one of a kind DNA pattern, which can be determined through DNA testing. It is extremely rare that two people have exactly the same DNA pattern, therefore DNA testing is highly successful. The four canonical bases{{Main|Nucleotide}}The canonical structure of DNA has four bases: thymine (T), adenine (A), cytosine (C), and guanine (G). DNA sequencing is the determination of the physical order of these bases in a molecule of DNA. However, there are many other bases that may be present in a molecule. In some viruses (specifically, bacteriophage), cytosine may be replaced by hydroxy methyl or hydroxy methyl glucose cytosine.[5] In mammalian DNA, variant bases with methyl groups or phosphosulfate may be found.[6][7] Depending on the sequencing technique, a particular modification, e.g., the 5mC (5 methyl cytosine) common in humans, may or may not be detected.[8] HistoryDiscovery of DNA structure and functionDeoxyribonucleic acid (DNA) was first discovered and isolated by Friedrich Miescher in 1869, but it remained understudied for many decades because proteins, rather than DNA, were thought to hold the genetic blueprint to life. This situation changed after 1944 as a result of some experiments by Oswald Avery, Colin MacLeod, and Maclyn McCarty demonstrating that purified DNA could change one strain of bacteria into another. This was the first time that DNA was shown capable of transforming the properties of cells. In 1953, James Watson and Francis Crick put forward their double-helix model of DNA, based on crystallized X-ray structures being studied by Rosalind Franklin – and without crediting her. According to the model, DNA is composed of two strands of nucleotides coiled around each other, linked together by hydrogen bonds and running in opposite directions. Each strand is composed of four complementary nucleotides – adenine (A), cytosine (C), guanine (G) and thymine (T) – with an A on one strand always paired with T on the other, and C always paired with G. They proposed such a structure allowed each strand to be used to reconstruct the other, an idea central to the passing on of hereditary information between generations.[9] The foundation for sequencing proteins was first laid by the work of Frederick Sanger who by 1955 had completed the sequence of all the amino acids in insulin, a small protein secreted by the pancreas. This provided the first conclusive evidence that proteins were chemical entities with a specific molecular pattern rather than a random mixture of material suspended in fluid. Sanger's success in sequencing insulin greatly electrified x-ray crystallographers, including Watson and Crick who by now were trying to understand how DNA directed the formation of proteins within a cell. Soon after attending a series of lectures given by Frederick Sanger in October 1954, Crick began to develop a theory which argued that the arrangement of nucleotides in DNA determined the sequence of amino acids in proteins which in turn helped determine the function of a protein. He published this theory in 1958.[10] RNA sequencingRNA sequencing was one of the earliest forms of nucleotide sequencing. The major landmark of RNA sequencing is the sequence of the first complete gene and the complete genome of Bacteriophage MS2, identified and published by Walter Fiers and his coworkers at the University of Ghent (Ghent, Belgium), in 1972[11] and 1976.[12] Traditional RNA sequencing methods require the creation of a cDNA molecule which must be sequenced.[13]Early DNA sequencing methodsThe first method for determining DNA sequences involved a location-specific primer extension strategy established by Ray Wu at Cornell University in 1970.[14] DNA polymerase catalysis and specific nucleotide labeling, both of which figure prominently in current sequencing schemes, were used to sequence the cohesive ends of lambda phage DNA.[15][16][17] Between 1970 and 1973, Wu, R Padmanabhan and colleagues demonstrated that this method can be employed to determine any DNA sequence using synthetic location-specific primers.[18][19][20] Frederick Sanger then adopted this primer-extension strategy to develop more rapid DNA sequencing methods at the MRC Centre, Cambridge, UK and published a method for "DNA sequencing with chain-terminating inhibitors" in 1977.[21] Walter Gilbert and Allan Maxam at Harvard also developed sequencing methods, including one for "DNA sequencing by chemical degradation".[22][21] In 1973, Gilbert and Maxam reported the sequence of 24 basepairs using a method known as wandering-spot analysis.[22] Advancements in sequencing were aided by the concurrent development of recombinant DNA technology, allowing DNA samples to be isolated from sources other than viruses. Sequencing of full genomesThe first full DNA genome to be sequenced was that of bacteriophage φX174 in 1977.[23] Medical Research Council scientists deciphered the complete DNA sequence of the Epstein-Barr virus in 1984, finding it contained 172,282 nucleotides. Completion of the sequence marked a significant turning point in DNA sequencing because it was achieved with no prior genetic profile knowledge of the virus.[24] A non-radioactive method for transferring the DNA molecules of sequencing reaction mixtures onto an immobilizing matrix during electrophoresis was developed by Pohl and co-workers in the early 1980s.[25][26] Followed by the commercialization of the DNA sequencer "Direct-Blotting-Electrophoresis-System GATC 1500" by GATC Biotech, which was intensively used in the framework of the EU genome-sequencing programme, the complete DNA sequence of the yeast Saccharomyces cerevisiae chromosome II.[29] Leroy E. Hood's laboratory at the California Institute of Technology announced the first semi-automated DNA sequencing machine in 1986.[27] This was followed by Applied Biosystems' marketing of the first fully automated sequencing machine, the ABI 370, in 1987 and by Dupont's Genesis 2000[28] which used a novel fluorescent labeling technique enabling all four dideoxynucleotides to be identified in a single lane. By 1990, the U.S. National Institutes of Health (NIH) had begun large-scale sequencing trials on Mycoplasma capricolum, Escherichia coli, Caenorhabditis elegans, and Saccharomyces cerevisiae at a cost of US$0.75 per base. Meanwhile, sequencing of human cDNA sequences called expressed sequence tags began in Craig Venter's lab, an attempt to capture the coding fraction of the human genome.[29] In 1995, Venter, Hamilton Smith, and colleagues at The Institute for Genomic Research (TIGR) published the first complete genome of a free-living organism, the bacterium Haemophilus influenzae. The circular chromosome contains 1,830,137 bases and its publication in the journal Science[30] marked the first published use of whole-genome shotgun sequencing, eliminating the need for initial mapping efforts. By 2001, shotgun sequencing methods had been used to produce a draft sequence of the human genome.[34][35] High-throughput sequencing (HTS) methodsSeveral new methods for DNA sequencing were developed in the mid to late 1990s and were implemented in commercial DNA sequencers by the year 2000. Together these were called the "next-generation" or "second-generation" sequencing (NGS) methods, in order to distinguish them from the aforementioned earlier methods, like Sanger Sequencing. In contrast to the first generation of sequencing, NGS technology is typically characterized by being highly scalable, allowing the entire genome to be sequenced at once. Usually, this is accomplished by fragmenting the genome into small pieces, randomly sampling for a fragment, and sequencing it using one of a variety of technologies, such as those described below. An entire genome is possible because multiple fragments are sequenced at once (giving it the name "massively parallel" sequencing) in an automated process. NGS technology has tremendously empowered researchers to look for insights into health, anthropologists to investigate human origins, and is catalyzing the "Personalized Medicine" movement. However, it has also opened the door to more room for error. There are many software tools to carry out the computational analysis of NGS data, each with its own algorithm. Even the parameters within one software package can change the outcome of the analysis. In addition, the large quantities of data produced by DNA sequencing have also required development of new methods and programs for sequence analysis. Several efforts to develop standards in the NGS field have been attempted to address these challenges, most of which have been small-scale efforts arising from individual labs. Most recently, a large, organized, FDA-funded effort has culminated in the BioCompute standard. On 26 October 1990, Roger Tsien, Pepi Ross, Margaret Fahnestock and Allan J Johnston filed a patent describing stepwise ("base-by-base") sequencing with removable 3' blockers on DNA arrays (blots and single DNA molecules).[31] In 1996, Pål Nyrén and his student Mostafa Ronaghi at the Royal Institute of Technology in Stockholm published their method of pyrosequencing.[32] On 1 April 1997, Pascal Mayer and Laurent Farinelli submitted patents to the World Intellectual Property Organization describing DNA colony sequencing.[33] The DNA sample preparation and random surface-PCR arraying methods described in this patent, coupled to Roger Tsien et al.'s "base-by-base" sequencing method, is now implemented in Illumina's Hi-Seq genome sequencers. In 1998, Phil Green and Brent Ewing of the University of Washington described their phred quality score for sequencer data analysis,[34] a landmark analysis technique that gained widespread adoption, and which is still the most common metric for assessing the accuracy of a sequencing platform.[35] Lynx Therapeutics published and marketed Massively parallel signature sequencing (MPSS), in 2000. This method incorporated a parallelized, adapter/ligation-mediated, bead-based sequencing technology and served as the first commercially available "next-generation" sequencing method, though no DNA sequencers were sold to independent laboratories.[36] Basic methodsMaxam-Gilbert sequencing{{Main|Maxam-Gilbert sequencing}}Allan Maxam and Walter Gilbert published a DNA sequencing method in 1977 based on chemical modification of DNA and subsequent cleavage at specific bases.[37] Also known as chemical sequencing, this method allowed purified samples of double-stranded DNA to be used without further cloning. This method's use of radioactive labeling and its technical complexity discouraged extensive use after refinements in the Sanger methods had been made. Maxam-Gilbert sequencing requires radioactive labeling at one 5' end of the DNA and purification of the DNA fragment to be sequenced. Chemical treatment then generates breaks at a small proportion of one or two of the four nucleotide bases in each of four reactions (G, A+G, C, C+T). The concentration of the modifying chemicals is controlled to introduce on average one modification per DNA molecule. Thus a series of labeled fragments is generated, from the radiolabeled end to the first "cut" site in each molecule. The fragments in the four reactions are electrophoresed side by side in denaturing acrylamide gels for size separation. To visualize the fragments, the gel is exposed to X-ray film for autoradiography, yielding a series of dark bands each corresponding to a radiolabeled DNA fragment, from which the sequence may be inferred.[37] Chain-termination methods{{Main|Sanger sequencing}}The chain-termination method developed by Frederick Sanger and coworkers in 1977 soon became the method of choice, owing to its relative ease and reliability.[38][39] When invented, the chain-terminator method used fewer toxic chemicals and lower amounts of radioactivity than the Maxam and Gilbert method. Because of its comparative ease, the Sanger method was soon automated and was the method used in the first generation of DNA sequencers. Sanger sequencing is the method which prevailed from the 1980s until the mid-2000s. Over that period, great advances were made in the technique, such as fluorescent labelling, capillary electrophoresis, and general automation. These developments allowed much more efficient sequencing, leading to lower costs. The Sanger method, in mass production form, is the technology which produced the first human genome in 2001, ushering in the age of genomics. However, later in the decade, radically different approaches reached the market, bringing the cost per genome down from $100 million in 2001 to $10,000 in 2011.[40] Advanced methods and de novo sequencingThe term "de novo sequencing" specifically refers to methods used to determine the sequence of DNA with no previously known sequence. De novo translates from Latin as "from the beginning". Gaps in the assembled sequence may be filled by primer walking. The different strategies have different tradeoffs in speed and accuracy; shotgun methods are often used for sequencing large genomes, but its assembly is complex and difficult, particularly with sequence repeats often causing gaps in genome assembly. Most sequencing approaches use an in vitro cloning step to amplify individual DNA molecules, because their molecular detection methods are not sensitive enough for single molecule sequencing. Emulsion PCR[44] isolates individual DNA molecules along with primer-coated beads in aqueous droplets within an oil phase. A polymerase chain reaction (PCR) then coats each bead with clonal copies of the DNA molecule followed by immobilization for later sequencing. Emulsion PCR is used in the methods developed by Marguilis et al. (commercialized by 454 Life Sciences), Shendure and Porreca et al. (also known as "Polony sequencing") and SOLiD sequencing, (developed by Agencourt, later Applied Biosystems, now Life Technologies).[51][45][46] Emulsion PCR is also used in the GemCode and Chromium platforms developed by 10x Genomics.[47] Shotgun sequencing{{Main|Shotgun sequencing}}Shotgun sequencing is a sequencing method designed for analysis of DNA sequences longer than 1000 base pairs, up to and including entire chromosomes. This method requires the target DNA to be broken into random fragments. After sequencing individual fragments, the sequences can be reassembled on the basis of their overlapping regions.[48] Bridge PCRAnother method for in vitro clonal amplification is bridge PCR, in which fragments are amplified upon primers attached to a solid surface[33][49][50] and form "DNA colonies" or "DNA clusters". This method is used in the Illumina Genome Analyzer sequencers. Single-molecule methods, such as that developed by Stephen Quake's laboratory (later commercialized by Helicos) are an exception: they use bright fluorophores and laser excitation to detect base addition events from individual DNA molecules fixed to a surface, eliminating the need for molecular amplification.[51] High-throughput methods{{Anchor|Next-generation methods}}High-throughput, or next-generation,[52] sequencing applies to genome sequencing, genome resequencing, transcriptome profiling (RNA-Seq), DNA-protein interactions (ChIP-sequencing), and epigenome characterization.[53] Resequencing is necessary, because the genome of a single individual of a species will not indicate all of the genome variations among other individuals of the same species. The high demand for low-cost sequencing has driven the development of high-throughput sequencing technologies that parallelize the sequencing process, producing thousands or millions of sequences concurrently.[54][55][56] High-throughput sequencing technologies are intended to lower the cost of DNA sequencing beyond what is possible with standard dye-terminator methods.[57] In ultra-high-throughput sequencing as many as 500,000 sequencing-by-synthesis operations may be run in parallel.[58][59][60] Such technologies led to the ability to sequence an entire human genome in as little as one day.[61] {{As of|2019||alt=As of 2019|df=|lc=|since=}}, corporate leaders in the development of high-throughput sequencing products included Illumina, Qiagen and ThermoFisher Scientific.[61]
Massively parallel signature sequencing (MPSS)The first of the high-throughput sequencing technologies, massively parallel signature sequencing (or MPSS), was developed in the 1990s at Lynx Therapeutics, a company founded in 1992 by Sydney Brenner and Sam Eletr. MPSS was a bead-based method that used a complex approach of adapter ligation followed by adapter decoding, reading the sequence in increments of four nucleotides. This method made it susceptible to sequence-specific bias or loss of specific sequences. Because the technology was so complex, MPSS was only performed 'in-house' by Lynx Therapeutics and no DNA sequencing machines were sold to independent laboratories. Lynx Therapeutics merged with Solexa (later acquired by Illumina) in 2004, leading to the development of sequencing-by-synthesis, a simpler approach acquired from Manteia Predictive Medicine, which rendered MPSS obsolete. However, the essential properties of the MPSS output were typical of later high-throughput data types, including hundreds of thousands of short DNA sequences. In the case of MPSS, these were typically used for sequencing cDNA for measurements of gene expression levels.[36] Polony sequencing{{Main|Polony sequencing}}The Polony sequencing method, developed in the laboratory of George M. Church at Harvard, was among the first high-throughput sequencing systems and was used to sequence a full E. coli genome in 2005.[77] It combined an in vitro paired-tag library with emulsion PCR, an automated microscope, and ligation-based sequencing chemistry to sequence an E. coli genome at an accuracy of >99.9999% and a cost approximately 1/9 that of Sanger sequencing.[77] The technology was licensed to Agencourt Biosciences, subsequently spun out into Agencourt Personal Genomics, and eventually incorporated into the Applied Biosystems SOLiD platform. Applied Biosystems was later acquired by Life Technologies, now part of Thermo Fisher Scientific. 454 pyrosequencing{{Main|454 Life Sciences#Technology}}A parallelized version of pyrosequencing was developed by 454 Life Sciences, which has since been acquired by Roche Diagnostics. The method amplifies DNA inside water droplets in an oil solution (emulsion PCR), with each droplet containing a single DNA template attached to a single primer-coated bead that then forms a clonal colony. The sequencing machine contains many picoliter-volume wells each containing a single bead and sequencing enzymes. Pyrosequencing uses luciferase to generate light for detection of the individual nucleotides added to the nascent DNA, and the combined data are used to generate sequence reads.[51] This technology provides intermediate read length and price per base compared to Sanger sequencing on one end and Solexa and SOLiD on the other.[57] Illumina (Solexa) sequencing{{Main|Illumina dye sequencing}}Solexa, now part of Illumina, was founded by Shankar Balasubramanian and David Klenerman in 1998, and developed a sequencing method based on reversible dye-terminators technology, and engineered polymerases.[93] The reversible terminated chemistry concept was invented by Bruno Canard and Simon Sarfati at the Pasteur Institute in Paris.[78][79] It was developed internally at Solexa by those named on the relevant patents. In 2004, Solexa acquired the company Manteia Predictive Medicine in order to gain a massively parallel sequencing technology invented in 1997 by Pascal Mayer and Laurent Farinelli.[33] It is based on "DNA Clusters" or "DNA colonies", which involves the clonal amplification of DNA on a surface. The cluster technology was co-acquired with Lynx Therapeutics of California. Solexa Ltd. later merged with Lynx to form Solexa Inc. In this method, DNA molecules and primers are first attached on a slide or flow cell and amplified with polymerase so that local clonal DNA colonies, later coined "DNA clusters", are formed. To determine the sequence, four types of reversible terminator bases (RT-bases) are added and non-incorporated nucleotides are washed away. A camera takes images of the fluorescently labeled nucleotides. Then the dye, along with the terminal 3' blocker, is chemically removed from the DNA, allowing for the next cycle to begin. Unlike pyrosequencing, the DNA chains are extended one nucleotide at a time and image acquisition can be performed at a delayed moment, allowing for very large arrays of DNA colonies to be captured by sequential images taken from a single camera. Decoupling the enzymatic reaction and the image capture allows for optimal throughput and theoretically unlimited sequencing capacity. With an optimal configuration, the ultimately reachable instrument throughput is thus dictated solely by the analog-to-digital conversion rate of the camera, multiplied by the number of cameras and divided by the number of pixels per DNA colony required for visualizing them optimally (approximately 10 pixels/colony). In 2012, with cameras operating at more than 10 MHz A/D conversion rates and available optics, fluidics and enzymatics, throughput can be multiples of 1 million nucleotides/second, corresponding roughly to 1 human genome equivalent at 1x coverage per hour per instrument, and 1 human genome re-sequenced (at approx. 30x) per day per instrument (equipped with a single camera).[80] Combinatorial probe anchor synthesis (cPAS)This method is an upgraded modification to combinatorial probe anchor ligation technology (cPAL) described by Complete Genomics[81] which has since become part of Chinese genomics company BGI in 2013.[82] The two companies have refined the technology to allow for longer read lengths, reaction time reductions and faster time to results. In addition, data are now generated as contiguous full-length reads in the standard FASTQ file format and can be used as-is in most short-read-based bioinformatics analysis pipelines.[83]{{citation needed|date=July 2018}} The two technologies that form the basis for this high-throughput sequencing technology are DNA nanoballs (DNB) and patterned arrays for nanoball attachment to a solid surface.[81] DNA nanoballs are simply formed by denaturing double stranded, adapter ligated libraries and ligating the forward strand only to a splint oligonucleotide to form a ssDNA circle. Faithful copies of the circles containing the DNA insert are produced utilizing Rolling Circle Amplification that generates approximately 300–500 copies. The long strand of ssDNA folds upon itself to produce a three-dimensional nanoball structure that is approximately 220 nm in diameter. Making DNBs replaces the need to generate PCR copies of the library on the flow cell and as such can remove large proportions of duplicate reads, adapter-adapter ligations and PCR induced errors.[83]{{citation needed|date=July 2018}} The patterned array of positively charged spots is fabricated through photolithography and etching techniques followed by chemical modification to generate a sequencing flow cell. Each spot on the flow cell is approximately 250 nm in diameter, are separated by 700 nm (centre to centre) and allows easy attachment of a single negatively charged DNB to the flow cell and thus reducing under or over-clustering on the flow cell.[81]{{citation needed|date=July 2018}} Sequencing is then performed by addition of an oligonucleotide probe that attaches in combination to specific sites within the DNB. The probe acts as an anchor that then allows one of four single reversibly inactivated, labelled nucleotides to bind after flowing across the flow cell. Unbound nucleotides are washed away before laser excitation of the attached labels then emit fluorescence and signal is captured by cameras that is converted to a digital output for base calling. The attached base has its terminator and label chemically cleaved at completion of the cycle. The cycle is repeated with another flow of free, labelled nucleotides across the flow cell to allow the next nucleotide to bind and have its signal captured. This process is completed a number of times (usually 50 to 300 times) to determine the sequence of the inserted piece of DNA at a rate of approximately 40 million nucleotides per second as of 2018.{{citation needed|date=July 2018}} SOLiD sequencing{{Main|2 base encoding}}Applied Biosystems' (now a Life Technologies brand) SOLiD technology employs sequencing by ligation. Here, a pool of all possible oligonucleotides of a fixed length are labeled according to the sequenced position. Oligonucleotides are annealed and ligated; the preferential ligation by DNA ligase for matching sequences results in a signal informative of the nucleotide at that position. Before sequencing, the DNA is amplified by emulsion PCR. The resulting beads, each containing single copies of the same DNA molecule, are deposited on a glass slide.[84] The result is sequences of quantities and lengths comparable to Illumina sequencing.[57] This sequencing by ligation method has been reported to have some issue sequencing palindromic sequences.[76]Ion Torrent semiconductor sequencing{{Main|Ion semiconductor sequencing}}Ion Torrent Systems Inc. (now owned by Life Technologies) developed a system based on using standard sequencing chemistry, but with a novel, semiconductor-based detection system. This method of sequencing is based on the detection of hydrogen ions that are released during the polymerisation of DNA, as opposed to the optical methods used in other sequencing systems. A microwell containing a template DNA strand to be sequenced is flooded with a single type of nucleotide. If the introduced nucleotide is complementary to the leading template nucleotide it is incorporated into the growing complementary strand. This causes the release of a hydrogen ion that triggers a hypersensitive ion sensor, which indicates that a reaction has occurred. If homopolymer repeats are present in the template sequence, multiple nucleotides will be incorporated in a single cycle. This leads to a corresponding number of released hydrogens and a proportionally higher electronic signal.[85] DNA nanoball sequencing{{Main|DNA nanoball sequencing}}DNA nanoball sequencing is a type of high throughput sequencing technology used to determine the entire genomic sequence of an organism. The company Complete Genomics uses this technology to sequence samples submitted by independent researchers. The method uses rolling circle replication to amplify small fragments of genomic DNA into DNA nanoballs. Unchained sequencing by ligation is then used to determine the nucleotide sequence.[108] This method of DNA sequencing allows large numbers of DNA nanoballs to be sequenced per run and at low reagent costs compared to other high-throughput sequencing platforms.[86] However, only short sequences of DNA are determined from each DNA nanoball which makes mapping the short reads to a reference genome difficult.[108] This technology has been used for multiple genome sequencing projects and is scheduled to be used for more.[87]Heliscope single molecule sequencing{{Main|Helicos single molecule fluorescent sequencing}}Heliscope sequencing is a method of single-molecule sequencing developed by Helicos Biosciences. It uses DNA fragments with added poly-A tail adapters which are attached to the flow cell surface. The next steps involve extension-based sequencing with cyclic washes of the flow cell with fluorescently labeled nucleotides (one nucleotide type at a time, as with the Sanger method). The reads are performed by the Heliscope sequencer.[88][89] The reads are short, averaging 35 bp.[90] In 2009 a human genome was sequenced using the Heliscope, however in 2012 the company went bankrupt.[91] Single molecule real time (SMRT) sequencing{{Main|Single molecule real time sequencing}}SMRT sequencing is based on the sequencing by synthesis approach. The DNA is synthesized in zero-mode wave-guides (ZMWs) – small well-like containers with the capturing tools located at the bottom of the well. The sequencing is performed with use of unmodified polymerase (attached to the ZMW bottom) and fluorescently labelled nucleotides flowing freely in the solution. The wells are constructed in a way that only the fluorescence occurring by the bottom of the well is detected. The fluorescent label is detached from the nucleotide upon its incorporation into the DNA strand, leaving an unmodified DNA strand. According to Pacific Biosciences (PacBio), the SMRT technology developer, this methodology allows detection of nucleotide modifications (such as cytosine methylation). This happens through the observation of polymerase kinetics. This approach allows reads of 20,000 nucleotides or more, with average read lengths of 5 kilobases.[68][92] In 2015, Pacific Biosciences announced the launch of a new sequencing instrument called the Sequel System, with 1 million ZMWs compared to 150,000 ZMWs in the PacBio RS II instrument.[93][94] SMRT sequencing is referred to as "third-generation" or "long-read" sequencing. Nanopore DNA sequencing{{Main|Nanopore sequencing}}The DNA passing through the nanopore changes its ion current. This change is dependent on the shape, size and length of the DNA sequence. Each type of the nucleotide blocks the ion flow through the pore for a different period of time. The method does not require modified nucleotides and is performed in real time. Nanopore sequencing is referred to as "third-generation" or "long-read" sequencing, along with SMRT sequencing. Early industrial research into this method was based on a technique called 'Exonuclease sequencing', where the readout of electrical signals occurring at nucleotides passing by alpha(α)-hemolysin pores covalently bound with cyclodextrin.[95] However the subsequently commercial method, 'strand sequencing' sequencing DNA bases in an intact strand. Two main areas of nanopore sequencing in development are solid state nanopore sequencing, and protein based nanopore sequencing. Protein nanopore sequencing utilizes membrane protein complexes such as α-hemolysin, MspA (Mycobacterium smegmatis Porin A) or CssG, which show great promise given their ability to distinguish between individual and groups of nucleotides.[96] In contrast, solid-state nanopore sequencing utilizes synthetic materials such as silicon nitride and aluminum oxide and it is preferred for its superior mechanical ability and thermal and chemical stability.[97] The fabrication method is essential for this type of sequencing given that the nanopore array can contain hundreds of pores with diameters smaller than eight nanometers.[96] The concept originated from the idea that single stranded DNA or RNA molecules can be electrophoretically driven in a strict linear sequence through a biological pore that can be less than eight nanometers, and can be detected given that the molecules release an ionic current while moving through the pore. The pore contains a detection region capable of recognizing different bases, with each base generating various time specific signals corresponding to the sequence of bases as they cross the pore which are then evaluated.[97] Precise control over the DNA transport through the pore is crucial for success. Various enzymes such as exonucleases and polymerases have been used to moderate this process by positioning them near the pore’s entrance.[98] Methods in developmentDNA sequencing methods currently under development include reading the sequence as a DNA strand transits through nanopores (a method that is now commercial but subsequent generations such as solid-state nanopores are still in development),[99][100] and microscopy-based techniques, such as atomic force microscopy or transmission electron microscopy that are used to identify the positions of individual nucleotides within long DNA fragments (>5,000 bp) by nucleotide labeling with heavier elements (e.g., halogens) for visual detection and recording.[101][102] Third generation technologies aim to increase throughput and decrease the time to result and cost by eliminating the need for excessive reagents and harnessing the processivity of DNA polymerase.[103]Tunnelling currents DNA sequencingAnother approach uses measurements of the electrical tunnelling currents across single-strand DNA as it moves through a channel. Depending on its electronic structure, each base affects the tunnelling current differently,[104] allowing differentiation between different bases.[105] The use of tunnelling currents has the potential to sequence orders of magnitude faster than ionic current methods and the sequencing of several DNA oligomers and micro-RNA has already been achieved.[106] Sequencing by hybridizationSequencing by hybridization is a non-enzymatic method that uses a DNA microarray. A single pool of DNA whose sequence is to be determined is fluorescently labeled and hybridized to an array containing known sequences. Strong hybridization signals from a given spot on the array identifies its sequence in the DNA being sequenced.[107]This method of sequencing utilizes binding characteristics of a library of short single stranded DNA molecules (oligonucleotides), also called DNA probes, to reconstruct a target DNA sequence. Non-specific hybrids are removed by washing and the target DNA is eluted.[108] Hybrids are re-arranged such that the DNA sequence can be reconstructed. The benefit of this sequencing type is its ability to capture a large number of targets with a homogenous coverage.[109] A large number of chemicals and starting DNA is usually required. However, with the advent of solution-based hybridization, much less equipment and chemicals are necessary.[108] Sequencing with mass spectrometryMass spectrometry may be used to determine DNA sequences. Matrix-assisted laser desorption ionization time-of-flight mass spectrometry, or MALDI-TOF MS, has specifically been investigated as an alternative method to gel electrophoresis for visualizing DNA fragments. With this method, DNA fragments generated by chain-termination sequencing reactions are compared by mass rather than by size. The mass of each nucleotide is different from the others and this difference is detectable by mass spectrometry. Single-nucleotide mutations in a fragment can be more easily detected with MS than by gel electrophoresis alone. MALDI-TOF MS can more easily detect differences between RNA fragments, so researchers may indirectly sequence DNA with MS-based methods by converting it to RNA first.[110]The higher resolution of DNA fragments permitted by MS-based methods is of special interest to researchers in forensic science, as they may wish to find single-nucleotide polymorphisms in human DNA samples to identify individuals. These samples may be highly degraded so forensic researchers often prefer mitochondrial DNA for its higher stability and applications for lineage studies. MS-based sequencing methods have been used to compare the sequences of human mitochondrial DNA from samples in a Federal Bureau of Investigation database[111] and from bones found in mass graves of World War I soldiers.[112] Early chain-termination and TOF MS methods demonstrated read lengths of up to 100 base pairs.[113] Researchers have been unable to exceed this average read size; like chain-termination sequencing alone, MS-based DNA sequencing may not be suitable for large de novo sequencing projects. Even so, a recent study did use the short sequence reads and mass spectroscopy to compare single-nucleotide polymorphisms in pathogenic Streptococcus strains.[114] Microfluidic Sanger sequencing{{Main|Sanger sequencing}}In microfluidic Sanger sequencing the entire thermocycling amplification of DNA fragments as well as their separation by electrophoresis is done on a single glass wafer (approximately 10 cm in diameter) thus reducing the reagent usage as well as cost.[115] In some instances researchers have shown that they can increase the throughput of conventional sequencing through the use of microchips.[116] Research will still need to be done in order to make this use of technology effective. Microscopy-based techniques{{Main|Transmission electron microscopy DNA sequencing}}This approach directly visualizes the sequence of DNA molecules using electron microscopy. The first identification of DNA base pairs within intact DNA molecules by enzymatically incorporating modified bases, which contain atoms of increased atomic number, direct visualization and identification of individually labeled bases within a synthetic 3,272 base-pair DNA molecule and a 7,249 base-pair viral genome has been demonstrated.[117] RNAP sequencingThis method is based on use of RNA polymerase (RNAP), which is attached to a polystyrene bead. One end of DNA to be sequenced is attached to another bead, with both beads being placed in optical traps. RNAP motion during transcription brings the beads in closer and their relative distance changes, which can then be recorded at a single nucleotide resolution. The sequence is deduced based on the four readouts with lowered concentrations of each of the four nucleotide types, similarly to the Sanger method.[118] A comparison is made between regions and sequence information is deduced by comparing the known sequence regions to the unknown sequence regions.[119] In vitro virus high-throughput sequencingA method has been developed to analyze full sets of protein interactions using a combination of 454 pyrosequencing and an in vitro virus mRNA display method. Specifically, this method covalently links proteins of interest to the mRNAs encoding them, then detects the mRNA pieces using reverse transcription PCRs. The mRNA may then be amplified and sequenced. The combined method was titled IVV-HiTSeq and can be performed under cell-free conditions, though its results may not be representative of in vivo conditions.[120] Sample preparationThe success of any DNA sequencing protocol relies upon the DNA or RNA sample extraction and preparation from the biological material of interest.
According to the sequencing technology to be used, the samples resulting from either the DNA or the RNA extraction require further preparation. For Sanger sequencing, either cloning procedures or PCR are required prior to sequencing. In the case of next-generation sequencing methods, library preparation is required before processing.[122] Assessing the quality and quantity of nucleic acids both after extraction and after library preparation identifies degraded, fragmented, and low-purity samples and yields high-quality sequencing data.[123] Development initiativesIn October 2006, the X Prize Foundation established an initiative to promote the development of full genome sequencing technologies, called the Archon X Prize, intending to award $10 million to "the first Team that can build a device and use it to sequence 100 human genomes within 10 days or less, with an accuracy of no more than one error in every 100,000 bases sequenced, with sequences accurately covering at least 98% of the genome, and at a recurring cost of no more than $10,000 (US) per genome."[124] Each year the National Human Genome Research Institute, or NHGRI, promotes grants for new research and developments in genomics. 2010 grants and 2011 candidates include continuing work in microfluidic, polony and base-heavy sequencing methodologies.[125] Computational challengesThe sequencing technologies described here produce raw data that needs to be assembled into longer sequences such as complete genomes (sequence assembly). There are many computational challenges to achieve this, such as the evaluation of the raw sequence data which is done by programs and algorithms such as Phred and Phrap. Other challenges have to deal with repetitive sequences that often prevent complete genome assemblies because they occur in many places of the genome. As a consequence, many sequences may not be assigned to particular chromosomes. The production of raw sequence data is only the beginning of its detailed bioinformatical analysis.[126] Yet new methods for sequencing and correcting sequencing errors were developed.[127] Read trimmingSometimes, the raw reads produced by the sequencer are correct and precise only in a fraction of their length. Using the entire read may introduce artifacts in the downstream analyses like genome assembly, snp calling, or gene expression estimation. Two classes of trimming programs have been introduced, based on the window-based or the running-sum classes of algorithms.[128] This is a partial list of the trimming algorithms currently available, specifying the algorithm class they belong to:
Ethical issues{{Expand section|date=May 2015}}{{Further|Bioethics}}Human genetics have been included within the field of bioethics since the early 1970s[135] and the growth in the use of DNA sequencing (particularly high-throughput sequencing) has introduced a number of ethical issues. One key issue is the ownership of an individual's DNA and the data produced when that DNA is sequenced.[136] Regarding the DNA molecule itself, the leading legal case on this topic, Moore v. Regents of the University of California (1990) ruled that individuals have no property rights to discarded cells or any profits made using these cells (for instance, as a patented cell line). However, individuals have a right to informed consent regarding removal and use of cells. Regarding the data produced through DNA sequencing, Moore gives the individual no rights to the information derived from their DNA.[136] As DNA sequencing becomes more widespread, the storage, security and sharing of genomic data has also become more important.[136][137] For instance, one concern is that insurers may use an individual's genomic data to modify their quote, depending on the perceived future health of the individual based on their DNA.[137][138] In May 2008, the Genetic Information Nondiscrimination Act (GINA) was signed in the United States, prohibiting discrimination on the basis of genetic information with respect to health insurance and employment.[139][140] In 2012, the US Presidential Commission for the Study of Bioethical Issues reported that existing privacy legislation for DNA sequencing data such as GINA and the Health Insurance Portability and Accountability Act were insufficient, noting that whole-genome sequencing data was particularly sensitive, as it could be used to identify not only the individual from which the data was created, but also their relatives.[141][142] Ethical issues have also been raised by the increasing use of genetic variation screening, both in newborns, and in adults by companies such as 23andMe.[143][144] It has been asserted that screening for genetic variations can be harmful, increasing anxiety in individuals who have been found to have an increased risk of disease.[145] For example, in one case noted in Time, doctors screening an ill baby for genetic variants chose not to inform the parents of an unrelated variant linked to dementia due to the harm it would cause to the parents.[146] However, a 2011 study in The New England Journal of Medicine has shown that individuals undergoing disease risk profiling did not show increased levels of anxiety.[145] See also{{col-begin}}{{col-2}}
Notes1. ^{{cite web|url=https://theconversation.com/introducing-dark-dna-the-phenomenon-that-could-change-how-we-think-about-evolution-82867|title=Introducing 'dark DNA' – the phenomenon that could change how we think about evolution}} 2. ^{{cite journal | vauthors = Olsvik O, Wahlberg J, Petterson B, Uhlén M, Popovic T, Wachsmuth IK, Fields PI | title = Use of automated sequencing of polymerase chain reaction-generated amplicons to identify three types of cholera toxin subunit B in Vibrio cholerae O1 strains | journal = J. Clin. Microbiol. | volume = 31 | issue = 1 | pages = 22–25 | date = January 1993 | pmid = 7678018 | pmc = 262614 | url = http://jcm.asm.org/cgi/pmidlookup?view=long&pmid=7678018 }}{{open access}} 3. ^{{cite journal | vauthors = Pettersson E, Lundeberg J, Ahmadian A | title = Generations of sequencing technologies | journal = Genomics | volume = 93 | issue = 2 | pages = 105–11 | date = February 2009 | pmid = 18992322 | doi = 10.1016/j.ygeno.2008.10.003 }} 4. ^{{Cite news|url=https://theconversation.com/from-the-crime-scene-to-the-courtroom-the-journey-of-a-dna-sample-82250|title=From the crime scene to the courtroom: the journey of a DNA sample|last=Curtis|first=Caitlin|date=29 August 2017|work=The Conversation|access-date=|archive-url=|archive-date=|dead-url=|last2=Hereward|first2=James}} 5. ^{{cite journal|last1=Moréra|first1=Solange|last2=Larivière|first2=Laurent|last3=Kurzeck|first3=Jürgen|last4=Aschke-Sonnenborn|first4=Ursula|last5=Freemont|first5=Paul S|last6=Janin|first6=Joël|last7=Rüger|first7=Wolfgang|title=High resolution crystal structures of T4 phage β-glucosyltransferase: induced fit and effect of substrate and metal binding|journal=Journal of Molecular Biology|date=August 2001|volume=311|issue=3|pages=569–77|doi=10.1006/jmbi.2001.4905|pmid=11493010}} 6. ^{{cite journal|last1=Ehrlich|first1=Melanie|last2=Gama-Sosa|first2=Miguel A.|last3=Huang|first3=Lan-Hsiang|last4=Midgett|first4=Rose Marie|last5=Kuo|first5=Kenneth C.|last6=McCune|first6=Roy A.|last7=Gehrke|first7=Charles|title=Amount and distribution of 5-methylcytosine in human DNA from different types of tissues or cells|journal=Nucleic Acids Research|date=1982|volume=10|issue=8|pages=2709–21|doi=10.1093/nar/10.8.2709|pmid=7079182|pmc=320645}} 7. ^{{cite journal|last1=Ehrlich|first1=M|last2=Wang|first2=R.|title=5-Methylcytosine in eukaryotic DNA|journal=Science|date=19 June 1981|volume=212|issue=4501|pages=1350–57|doi=10.1126/science.6262918|pmid=6262918|bibcode = 1981Sci...212.1350E }} 8. ^{{cite journal|last1=Song|first1=Chun-Xiao|last2=Clark|first2=Tyson A|last3=Lu|first3=Xing-Yu|last4=Kislyuk|first4=Andrey|last5=Dai|first5=Qing|last6=Turner|first6=Stephen W|last7=He|first7=Chuan|last8=Korlach|first8=Jonas|title=Sensitive and specific single-molecule sequencing of 5-hydroxymethylcytosine|journal=Nature Methods|date=20 November 2011|volume=9|issue=1|pages=75–77|doi=10.1038/nmeth.1779|pmid=22101853|pmc=3646335}} 9. ^{{cite journal | vauthors = Watson JD, Crick FH | title = The structure of DNA | journal = Cold Spring Harb. Symp. Quant. Biol. | volume = 18 | issue = | pages = 123–31 | year = 1953 | pmid = 13168976 | doi = 10.1101/SQB.1953.018.01.020 }} 10. ^Marks, L, The path to DNA sequencing: The life and work of Frederick Sanger. 11. ^{{cite journal | vauthors = Min Jou W, Haegeman G, Ysebaert M, Fiers W | title = Nucleotide sequence of the gene coding for the bacteriophage MS2 coat protein | journal = Nature | volume = 237 | issue = 5350 | pages = 82–8 | date = May 1972 | pmid = 4555447 | doi = 10.1038/237082a0 | bibcode = 1972Natur.237...82J }} 12. ^{{cite journal | vauthors = Fiers W, Contreras R, Duerinck F, Haegeman G, Iserentant D, Merregaert J, Min Jou W, Molemans F, Raeymaekers A, Van den Berghe A, Volckaert G, Ysebaert M | title = Complete nucleotide sequence of bacteriophage MS2 RNA: primary and secondary structure of the replicase gene | journal = Nature | volume = 260 | issue = 5551 | pages = 500–7 | date = April 1976 | pmid = 1264203 | doi = 10.1038/260500a0 | bibcode = 1976Natur.260..500F }} 13. ^{{Cite journal|last=Ozsolak|first=Fatih|last2=Milos|first2=Patrice M.|date=2011-02-01|title=RNA sequencing: advances, challenges and opportunities|url=http://www.nature.com/nrg/journal/v12/n2/full/nrg2934.html|journal=Nature Reviews Genetics|language=en|volume=12|issue=2|pages=87–98|doi=10.1038/nrg2934|issn=1471-0056|pmc=3031867|pmid=21191423}} 14. ^{{cite web|url=http://www.mbg.cornell.edu/faculty-staff/faculty/wu.cfm|title=Ray Wu Faculty Profile|archiveurl=https://web.archive.org/web/20090304121126/http://www.mbg.cornell.edu/faculty-staff/faculty/wu.cfm|archivedate=2009-03-04|publisher=Cornell University}} 15. ^{{cite journal|last=Padmanabhan|first=R|author2=Ray Wu |author3=Ernest Jay |title=Chemical Synthesis of a Primer and Its Use in the Sequence Analysis of the Lysozyme Gene of Bacteriophage T4|journal=Proceedings of the National Academy of Sciences|date=June 1974|volume=71|issue=6|pages=2510–14|doi=10.1073/pnas.71.6.2510|pmid=4526223|bibcode=1974PNAS...71.2510P|pmc=388489}} 16. ^{{cite journal | vauthors = Onaga LA | title = Ray Wu as Fifth Business: Demonstrating Collective Memory in the History of DNA Sequencing | journal = Studies in the History and Philosophy of Science | volume = 46 | pages = 1–14 | date = June 2014 | pmid = 24565976 | doi = 10.1016/j.shpsc.2013.12.006 | series = Part C }} 17. ^{{cite journal | vauthors = Wu R | title = Nucleotide sequence analysis of DNA | journal = Nature New Biology | volume = 236 | issue = 68 | pages = 198–200 | year = 1972 | pmid = 4553110 | doi = 10.1038/newbio236198a0 }} 18. ^{{cite journal | vauthors = Padmanabhan R, Wu R | title = Nucleotide sequence analysis of DNA. IX. Use of oligonucleotides of defined sequence as primers in DNA sequence analysis | journal = Biochem. Biophys. Res. Commun. | volume = 48 | issue = 5 | pages = 1295–302 | year = 1972 | pmid = 4560009 | doi = 10.1016/0006-291X(72)90852-2| url = }} 19. ^{{cite journal | vauthors = Wu R, Tu CD, Padmanabhan R | title = Nucleotide sequence analysis of DNA. XII. The chemical synthesis and sequence analysis of a dodecadeoxynucleotide which binds to the endolysin gene of bacteriophage lambda | journal = Biochem. Biophys. Res. Commun. | volume = 55 | issue = 4 | pages = 1092–99 | year = 1973 | pmid = 4358929 | doi = 10.1016/S0006-291X(73)80007-5}} 20. ^{{cite journal | vauthors = Jay E, Bambara R, Padmanabhan R, Wu R | title = DNA sequence analysis: a general, simple and rapid method for sequencing large oligodeoxyribonucleotide fragments by mapping | journal = Nucleic Acids Research | volume = 1 | issue = 3 | pages = 331–53 | date = March 1974 | pmid = 10793670 | pmc = 344020 | doi = 10.1093/nar/1.3.331 }} 21. ^Gilbert, W. DNA sequencing and gene structure. Nobel lecture, 8 December 1980. 22. ^{{cite journal | vauthors = Gilbert W, Maxam A | title = The Nucleotide Sequence of the lac Operator | journal = Proc. Natl. Acad. Sci. U.S.A. | volume = 70 | issue = 12 | pages = 3581–84 | date = December 1973 | pmid = 4587255 | pmc = 427284 | doi = 10.1073/pnas.70.12.3581 | bibcode = 1973PNAS...70.3581G }} 23. ^{{cite journal | vauthors = Sanger F, Air GM, Barrell BG, Brown NL, Coulson AR, Fiddes CA, Hutchison CA, Slocombe PM, Smith M | title = Nucleotide sequence of bacteriophage phi X174 DNA | journal = Nature | volume = 265 | issue = 5596 | pages = 687–95 | date = February 1977 | pmid = 870828 | doi = 10.1038/265687a0 | bibcode = 1977Natur.265..687S }} 24. ^"The Next Frontier: Human Viruses" , whatisbiotechnology.org, Retrieved 3 May 2017 25. ^{{cite journal | vauthors = Beck S, Pohl FM | title = DNA sequencing with direct blotting electrophoresis | journal = EMBO J | volume = 3 | issue = 12 | pages = 2905–09 | year = 1984 | pmid = 6396083 | pmc = 557787 | doi = 10.1002/j.1460-2075.1984.tb02230.x }} 26. ^United States Patent 4,631,122 (1986) 27. ^{{cite journal | vauthors = Smith LM, Sanders JZ, Kaiser RJ, Hughes P, Dodd C, Connell CR, Heiner C, Kent SB, Hood LE | title = Fluorescence Detection in Automated DNA Sequence Analysis | journal = Nature | volume = 321 | issue = 6071 | pages = 674–79 | date = 12 June 1986 | pmid = 3713851 | doi = 10.1038/321674a0 | bibcode = 1986Natur.321..674S }} 28. ^{{cite journal | vauthors = Prober JM, Trainor GL, Dam RJ, Hobbs FW, Robertson CW, Zagursky RJ, Cocuzza AJ, Jensen MA, Baumeister K | title = A system for rapid DNA sequencing with fluorescent chain-terminating dideoxynucleotides | journal = Science | volume = 238 | issue = 4825 | pages = 336–41 | date = 16 Oct 1987 | pmid = 2443975 | doi = 10.1126/science.2443975 | bibcode = 1987Sci...238..336P }} 29. ^{{cite journal | vauthors = Adams MD, Kelley JM, Gocayne JD, Dubnick M, Polymeropoulos MH, Xiao H, Merril CR, Wu A, Olde B, Moreno RF | title = Complementary DNA sequencing: expressed sequence tags and human genome project | journal = Science | volume = 252 | issue = 5013 | pages = 1651–56 | date = June 1991 | pmid = 2047873 | doi = 10.1126/science.2047873 | bibcode = 1991Sci...252.1651A }} 30. ^{{cite journal | vauthors = Fleischmann RD, Adams MD, White O, Clayton RA, Kirkness EF, Kerlavage AR, Bult CJ, Tomb JF, Dougherty BA, Merrick JM | title = Whole-genome random sequencing and assembly of Haemophilus influenzae Rd | journal = Science | volume = 269 | issue = 5223 | pages = 496–512 | date = July 1995 | pmid = 7542800 | doi = 10.1126/science.7542800 | bibcode = 1995Sci...269..496F }} 31. ^{{cite web|url=http://worldwide.espacenet.com/publicationDetails/biblio?FT=D&date=19910516&DB=EPODOC&locale=en_EP&CC=WO&NR=9106678A1&KC=A1&ND=4|title=Espacenet – Bibliographic data|website=worldwide.espacenet.com}} 32. ^{{cite journal | vauthors = Ronaghi M, Karamohamed S, Pettersson B, Uhlén M, Nyrén P | title = Real-time DNA sequencing using detection of pyrophosphate release | journal = Analytical Biochemistry | volume = 242 | issue = 1 | pages = 84–89 | year = 1996 | pmid = 8923969 | doi = 10.1006/abio.1996.0432 }} 33. ^1 2 {{cite web| last = Kawashima | first = Eric H. | author2 = Laurent Farinelli | author3 = Pascal Mayer | title = Patent: Method of nucleic acid amplification | accessdate = 2012-12-22 | date = 2005-05-12 | url = http://www.patentlens.net/patentlens/patent/WO_1998_044151_A1/en/}} 34. ^{{cite journal|vauthors=Ewing B, Green P|date=March 1998|title=Base-calling of automated sequencer traces using phred. II. Error probabilities|url=http://www.genome.org/cgi/pmidlookup?view=long&pmid=9521922|journal=Genome Res.|volume=8|issue=3|pages=186–94|doi=10.1101/gr.8.3.186|pmid=9521922}} 35. ^{{cite web|url=https://www.illumina.com/documents/products/technotes/technote_Q-Scores.pdf|title=Quality Scores for Next-Generation Sequencing|last=|first=|date=31 October 2011|website=Illumina|access-date=8 May 2018}} 36. ^1 {{cite journal | vauthors = Brenner S, Johnson M, Bridgham J, Golda G, Lloyd DH, Johnson D, Luo S, McCurdy S, Foy M, Ewan M, Roth R, George D, Eletr S, Albrecht G, Vermaas E, Williams SR, Moon K, Burcham T, Pallas M, DuBridge RB, Kirchner J, Fearon K, Mao J, Corcoran K | title = Gene expression analysis by massively parallel signature sequencing (MPSS) on microbead arrays | journal = Nature Biotechnology | volume = 18 | issue = 6 | pages = 630–34 | year = 2000 | pmid = 10835600 | doi = 10.1038/76469 }} 37. ^1 2 {{cite journal | vauthors = Maxam AM, Gilbert W | title = A new method for sequencing DNA | journal = Proc. Natl. Acad. Sci. USA | volume = 74 | issue = 2 | pages = 560–64 | date = February 1977 | pmid = 265521 | pmc = 392330 | doi = 10.1073/pnas.74.2.560 | bibcode = 1977PNAS...74..560M }} 38. ^1 {{cite journal | vauthors = Sanger F, Nicklen S, Coulson AR | title = DNA sequencing with chain-terminating inhibitors | journal = Proc. Natl. Acad. Sci. USA | volume = 74 | issue = 12 | pages = 5463–77 | date = December 1977 | pmid = 271968 | pmc = 431765 | doi = 10.1073/pnas.74.12.5463 | bibcode = 1977PNAS...74.5463S }} 39. ^{{cite journal | vauthors = Sanger F, Coulson AR | title = A rapid method for determining sequences in DNA by primed synthesis with DNA polymerase | journal = J. Mol. Biol. | volume = 94 | issue = 3 | pages = 441–48 | date = May 1975 | pmid = 1100841 | doi = 10.1016/0022-2836(75)90213-2 }} 40. ^{{cite web |last=Wetterstrand |first=Kris |title=DNA Sequencing Costs: Data from the NHGRI Genome Sequencing Program (GSP) |publisher=National Human Genome Research Institute |accessdate=30 May 2013 |url=https://www.genome.gov/sequencingcosts }} 41. ^{{cite journal | vauthors = Quail MA, Gu Y, Swerdlow H, Mayho M | title = Evaluation and optimisation of preparative semi-automated electrophoresis systems for Illumina library preparation | journal = Electrophoresis | volume = 33 | issue = 23 | pages = 3521–28 | year = 2012 | pmid = 23147856 | doi = 10.1002/elps.201200128 }} 42. ^{{cite journal | vauthors = Duhaime MB, Deng L, Poulos BT, Sullivan MB | title = Towards quantitative metagenomics of wild viruses and other ultra-low concentration DNA samples: a rigorous assessment and optimization of the linker amplification method | journal = Environ. Microbiol. | volume = 14 | issue = 9 | pages = 2526–37 | year = 2012 | pmid = 22713159 | pmc = 3466414 | doi = 10.1111/j.1462-2920.2012.02791.x }} 43. ^{{cite journal | vauthors = Peterson BK, Weber JN, Kay EH, Fisher HS, Hoekstra HE | title = Double digest RADseq: an inexpensive method for de novo SNP discovery and genotyping in model and non-model species | journal = PLoS ONE | volume = 7 | issue = 5 | pages = e37135 | year = 2012 | pmid = 22675423 | pmc = 3365034 | doi = 10.1371/journal.pone.0037135 |bibcode = 2012PLoSO...737135P }} 44. ^{{cite journal | vauthors = Williams R, Peisajovich SG, Miller OJ, Magdassi S, Tawfik DS, Griffiths AD | title = Amplification of complex gene libraries by emulsion PCR | journal = Nature Methods | volume = 3 | issue = 7 | pages = 545–50 | year = 2006 | pmid = 16791213 | doi = 10.1038/nmeth896 }} 45. ^{{cite journal | vauthors = Shendure J, Porreca GJ, Reppas NB, Lin X, McCutcheon JP, Rosenbaum AM, Wang MD, Zhang K, Mitra RD, Church GM | title = Accurate Multiplex Polony Sequencing of an Evolved Bacterial Genome | journal = Science | volume = 309 | issue = 5741 | pages = 1728–32 | year = 2005 | pmid = 16081699 | doi = 10.1126/science.1117389 | bibcode = 2005Sci...309.1728S }} 46. ^{{cite web|url=http://solid.appliedbiosystems.com/|archiveurl=https://web.archive.org/web/20080516181322/http://solid.appliedbiosystems.com/|archivedate=2008-05-16|title=Applied Biosystems – File Not Found (404 Error)|date=16 May 2008}} 47. ^{{cite journal|last1=Goodwin|first1=Sara|last2=McPherson|first2=John D.|last3=McCombie|first3=W. Richard|title=Coming of age: ten years of next-generation sequencing technologies|journal=Nature Reviews Genetics|date=17 May 2016|volume=17|issue=6|pages=333–51|doi=10.1038/nrg.2016.49|pmid=27184599}} 48. ^{{cite journal | vauthors = Staden R | title = A strategy of DNA sequencing employing computer programs. | journal = Nucleic Acids Research | volume = 6 | issue = 7 | pages = 2601–10 | date = 11 Jun 1979 | pmid = 461197 | pmc = 327874 | doi = 10.1093/nar/6.7.2601 }} 49. ^P. Mayer,L. Farinelli, G. Matton, C. Adessi, G. Turcatti, J. J. Mermod, E. Kawashima.DNA colony massively parallel sequencing ams98 presentation 50. ^{{US patent|5641658}} 51. ^{{cite journal | vauthors = Braslavsky I, Hebert B, Kartalov E, Quake SR | title = Sequence information can be obtained from single DNA molecules | journal = Proc. Natl. Acad. Sci. USA | volume = 100 | issue = 7 | pages = 3960–64 | date = April 2003 | pmid = 12651960 | pmc = 153030 | doi = 10.1073/pnas.0230489100 | bibcode = 2003PNAS..100.3960B }} 52. ^"Next-generation" remains in broad use as of 2019. For instance, {{cite journal|vauthors=Straiton J, Free T, Sawyer A, Martin J|date=February 2019|title=From Sanger Sequencing to Genome Databases and Beyond|url=|journal=BioTechniques|volume=66|issue=2|pages=60–63|doi=10.2144/btn-2019-0011|pmid=30744413|quote=Next-generation sequencing (NGS) technologies have revolutionized genomic research. (opening sentence of the article)|via=}} 53. ^{{cite journal | vauthors = de Magalhães JP, Finch CE, Janssens G | title = Next-generation sequencing in aging research: emerging applications, problems, pitfalls and possible solutions | journal = Ageing Research Reviews | volume = 9 | issue = 3 | pages = 315–23 | year = 2010 | pmid = 19900591 | pmc = 2878865 | doi = 10.1016/j.arr.2009.10.006 }} 54. ^{{cite journal | vauthors = Grada A | title = Next-generation sequencing: methodology and application | journal = J Invest Dermatol | volume = 133 | issue = 8 | pages = e11 | date = August 2013 | pmid = 23856935 | doi = 10.1038/jid.2013.248 }} 55. ^{{cite journal | vauthors = Hall N | title = Advanced sequencing technologies and their wider impact in microbiology | journal = J. Exp. Biol. | volume = 210| issue = Pt 9 | pages = 1518–25 | date = May 2007 | pmid = 17449817 | doi = 10.1242/jeb.001370 }}{{open access}} 56. ^{{cite journal | vauthors = Church GM | title = Genomes for all | journal = Sci. Am. | volume = 294 | issue = 1 | pages = 46–54 | date = January 2006 | pmid = 16468433 | doi = 10.1038/scientificamerican0106-46 | authorlink1 = George M. Church | bibcode = 2006SciAm.294a..46C }}{{subscription required}} 57. ^1 2 {{cite journal | vauthors = Schuster SC | title = Next-generation sequencing transforms today's biology | journal = Nat. Methods | volume = 5 | issue = 1 | pages = 16–18 | date = January 2008 | pmid = 18165802 | doi = 10.1038/nmeth1156 }} 58. ^{{cite book | title = Massively Parallel, Optical, and Neural Computing in the United States | first1 = Gilbert | last1 = Kalb | first2 = Robert | last2 = Moxley | publisher = IOS Press | year = 1992 | isbn = 978-90-5199-097-3 }}{{Page needed|date=June 2013}} 59. ^{{cite journal | vauthors = ten Bosch JR, Grody WW | title = Keeping Up with the Next Generation | journal = The Journal of Molecular Diagnostics | volume = 10 | issue = 6 | pages = 484–92 | year = 2008 | pmid = 18832462 | pmc = 2570630 | doi = 10.2353/jmoldx.2008.080027 }}{{open access}} 60. ^{{cite journal | vauthors = Tucker T, Marra M, Friedman JM | title = Massively Parallel Sequencing: The Next Big Thing in Genetic Medicine | journal = The American Journal of Human Genetics | volume = 85 | issue = 2 | pages = 142–54 | year = 2009 | pmid = 19679224 | pmc = 2725244 | doi = 10.1016/j.ajhg.2009.06.022 }}{{open access}} 61. ^1 {{Cite journal|last=Straiton|first=Jenny|last2=Free|first2=Tristan|last3=Sawyer|first3=Abigail|last4=Martin|first4=Joseph|date=February 2019|title=From Sanger Sequencing to Genome Databases and Beyond|url=|journal=BioTechniques|publisher=Future Science|volume=66|issue=2|pages=60–63|doi=10.2144/btn-2019-0011|pmid=30744413|via=}} 62. ^{{cite journal | vauthors = Quail MA, Smith M, Coupland P, Otto TD, Harris SR, Connor TR, Bertoni A, Swerdlow HP, Gu Y | title = A tale of three next generation sequencing platforms: comparison of Ion Torrent, Pacific Biosciences and illumina MiSeq sequencers | journal = BMC Genomics | volume = 13 | issue = 1 | page = 341 | date = 1 January 2012 | pmid = 22827831 | pmc = 3431227 | doi = 10.1186/1471-2164-13-341 }}{{open access}} 63. ^{{cite journal | vauthors = Liu L, Li Y, Li S, Hu N, He Y, Pong R, Lin D, Lu L, Law M | title = Comparison of Next-Generation Sequencing Systems | journal = Journal of Biomedicine and Biotechnology | volume = 2012 | pages = 251364 | date = 1 January 2012 | pmid = 22829749 | doi = 10.1155/2012/251364 | pmc=3398667}}{{open access}} 64. ^1 2 {{cite web|url=https://www.pacb.com/blog/new-software-polymerase-sequel-system-boost-throughput-affordability/|title=New Software, Polymerase for Sequel System Boost Throughput and Affordability – PacBio|date=7 March 2018}} 65. ^{{cite web |url=http://www.genomeweb.com/sequencing/after-year-testing-two-early-pacbio-customers-expect-more-routine-use-rs-sequenc |title=After a Year of Testing, Two Early PacBio Customers Expect More Routine Use of RS Sequencer in 2012 |author= |date=10 January 2012 |publisher=GenomeWeb }}{{registration required}} 66. ^{{cite web|url=http://globenewswire.com/news-release/2013/10/03/577891/10051072/en/Pacific-Biosciences-Introduces-New-Chemistry-With-Longer-Read-Lengths-to-Detect-Novel-Features-in-DNA-Sequence-and-Advance-Genome-Studies-of-Large-Organisms.html|title=Pacific Biosciences Introduces New Chemistry With Longer Read Lengths to Detect Novel Features in DNA Sequence and Advance Genome Studies of Large Organisms|first=Pacific Biosciences|last=Inc.|year=2013}} 67. ^{{cite journal | vauthors = Chin CS, Alexander DH, Marks P, Klammer AA, Drake J, Heiner C, Clum A, Copeland A, Huddleston J, Eichler EE, Turner SW, Korlach J | title = Nonhybrid, finished microbial genome assemblies from long-read SMRT sequencing data | journal = Nat. Methods | volume = 10 | issue = 6 | pages = 563–69 | year = 2013 | pmid = 23644548 | doi = 10.1038/nmeth.2474 }} 68. ^1 {{cite web|url=http://flxlexblog.wordpress.com/2013/07/05/de-novo-bacterial-genome-assembly-a-solved-problem/|title=De novo bacterial genome assembly: a solved problem?|date=5 July 2013}} 69. ^{{cite journal | vauthors = Rasko DA, Webster DR, Sahl JW, Bashir A, Boisen N, Scheutz F, Paxinos EE, Sebra R, Chin CS, Iliopoulos D, Klammer A, Peluso P, Lee L, Kislyuk AO, Bullard J, Kasarskis A, Wang S, Eid J, Rank D, Redman JC, Steyert SR, Frimodt-Møller J, Struve C, Petersen AM, Krogfelt KA, Nataro JP, Schadt EE, Waldor MK | title = Origins of the Strain Causing an Outbreak of Hemolytic–Uremic Syndrome in Germany | journal = N Engl J Med | volume = 365 | issue = 8 | pages = 709–17 | date = 25 August 2011 | pmid = 21793740 | doi = 10.1056/NEJMoa1106920 | pmc=3168948}}{{open access}} 70. ^{{cite journal | vauthors = Tran B, Brown AM, Bedard PL, Winquist E, Goss GD, Hotte SJ, Welch SA, Hirte HW, Zhang T, Stein LD, Ferretti V, Watt S, Jiao W, Ng K, Ghai S, Shaw P, Petrocelli T, Hudson TJ, Neel BG, Onetto N, Siu LL, McPherson JD, Kamel-Reid S, Dancey JE | title = Feasibility of real time next generation sequencing of cancer genes linked to drug response: Results from a clinical trial | journal = Int. J. Cancer | volume = 132 | issue = 7 | pages = 1547–55 | date = 1 January 2012 | pmid = 22948899 | doi = 10.1002/ijc.27817 | authorlink18 = Thomas J. Hudson | authorlink10 = Lincoln Stein }}{{subscription required}} 71. ^{{cite journal | vauthors = Murray IA, Clark TA, Morgan RD, Boitano M, Anton BP, Luong K, Fomenkov A, Turner SW, Korlach J, Roberts RJ | title = The methylomes of six bacteria | journal = Nucleic Acids Research | volume = 40 | issue = 22 | pages = 11450–62 | date = 2 October 2012 | pmid = 23034806 | pmc = 3526280 | doi = 10.1093/nar/gks891 }} 72. ^{{cite web|url=https://www.thermofisher.com/order/catalog/product/A30670|title=Ion 520 & Ion 530 ExT Kit-Chef – Thermo Fisher Scientific|website=www.thermofisher.com}} 73. ^ {{dead link|date=September 2018}} 74. ^{{cite journal | vauthors = van Vliet AH | title = Next generation sequencing of microbial transcriptomes: challenges and opportunities | journal = FEMS Microbiology Letters | volume = 302 | issue = 1 | pages = 1–7 | date = 1 January 2010 | pmid = 19735299 | doi = 10.1111/j.1574-6968.2009.01767.x }}{{open access}} 75. ^{{cite web|url=http://en.mgitech.cn/product/30.html|title=BGI and MGISEQ|last=|first=|date=|website=en.mgitech.cn|archive-url=|archive-date=|dead-url=|access-date=2018-07-05}} 76. ^1 {{cite journal | vauthors = Huang YF, Chen SC, Chiang YS, Chen TH, Chiu KP | title = Palindromic sequence impedes sequencing-by-ligation mechanism | journal = BMC Systems Biology | volume = 6 Suppl 2 | pages = S10 | year = 2012 | pmid = 23281822 | doi = 10.1186/1752-0509-6-S2-S10 | pmc=3521181}} 77. ^1 {{cite journal | vauthors = Shendure J, Porreca GJ, Reppas NB, Lin X, McCutcheon JP, Rosenbaum AM, Wang MD, Zhang K, Mitra RD, Church GM | title = Accurate multiplex polony sequencing of an evolved bacterial genome. | journal = Science | volume = 309 | issue = 5741 | pages = 1728–32 | date = 9 Sep 2005 | pmid = 16081699 | doi = 10.1126/science.1117389 | bibcode = 2005Sci...309.1728S }} 78. ^{{Citation|last=Canard|first=Bruno|title=Novel derivatives usable for the sequencing of nucleic acids|date=13 Oct 1994|url=http://www.google.ge/patents/CA2158975A1|last2=Sarfati|first2=Simon|accessdate=2016-03-09}} 79. ^{{Cite journal|last=Canard|first=Bruno|last2=Sarfati|first2=Robert S.|date=1994-10-11|title=DNA polymerase fluorescent substrates with reversible 3′-tags|url=http://www.sciencedirect.com/science/article/pii/0378111994902267|journal=Gene|volume=148|issue=1|pages=1–6|doi=10.1016/0378-1119(94)90226-7|pmid=7523248}} 80. ^{{cite journal | vauthors = Mardis ER | title = Next-generation DNA sequencing methods | journal = Annu Rev Genom Hum Genet | volume = 9 | issue = | pages = 387–402 | year = 2008 | pmid = 18576944 | doi = 10.1146/annurev.genom.9.081307.164359 }} 81. ^1 2 {{Cite journal|last=Drmanac|first=Radoje|last2=Sparks|first2=Andrew B.|last3=Callow|first3=Matthew J.|last4=Halpern|first4=Aaron L.|last5=Burns|first5=Norman L.|last6=Kermani|first6=Bahram G.|last7=Carnevali|first7=Paolo|last8=Nazarenko|first8=Igor|last9=Nilsen|first9=Geoffrey B.|date=2010-01-01|title=Human Genome Sequencing Using Unchained Base Reads on Self-Assembling DNA Nanoarrays|url=http://science.sciencemag.org/content/327/5961/78|journal=Science|language=en|volume=327|issue=5961|pages=78–81|doi=10.1126/science.1181498|issn=0036-8075|pmid=19892942|bibcode=2010Sci...327...78D}} 82. ^{{cite web|url=http://www.completegenomics.com/|title=About Us - Complete Genomics|last=brandonvd|website=Complete Genomics|language=en-US|access-date=2018-07-02}} 83. ^1 {{Cite journal|last=Huang|first=Jie|last2=Liang|first2=Xinming|last3=Xuan|first3=Yuankai|last4=Geng|first4=Chunyu|last5=Li|first5=Yuxiang|last6=Lu|first6=Haorong|last7=Qu|first7=Shoufang|last8=Mei|first8=Xianglin|last9=Chen|first9=Hongbo|date=2017-05-01|title=A reference human genome dataset of the BGISEQ-500 sequencer|url=https://academic.oup.com/gigascience/article/6/5/1/3098240|journal=GigaScience|language=en|volume=6|issue=5|pages=1–9|doi=10.1093/gigascience/gix024|pmc=5467036|pmid=28379488}} 84. ^{{cite journal | vauthors = Valouev A, Ichikawa J, Tonthat T, Stuart J, Ranade S, Peckham H, Zeng K, Malek JA, Costa G, McKernan K, Sidow A, Fire A, Johnson SM | title = A high-resolution, nucleosome position map of C. elegans reveals a lack of universal sequence-dictated positioning | journal = Genome Res. | volume = 18 | issue = 7 | pages = 1051–63 | date = July 2008 | pmid = 18477713 | pmc = 2493394 | doi = 10.1101/gr.076463.108 }} 85. ^{{cite journal | vauthors = Rusk N | year = 2011 | title = Torrents of sequence | url = | journal = Nat Methods | volume = 8 | issue = 1| page = 44 | doi=10.1038/nmeth.f.330}} 86. ^{{cite journal | vauthors = Porreca GJ | title = Genome Sequencing on Nanoballs | journal = Nature Biotechnology | volume = 28 | issue = 1 | pages = 43–44 | year = 2010 | pmid = 20062041 | doi = 10.1038/nbt0110-43 }} 87. ^[https://web.archive.org/web/20100825003320/http://www.completegenomics.com/news-events/press-releases/ Complete Genomics] Press release, 2010 88. ^{{cite web|url=http://www.helicosbio.com/Products/HelicosregGeneticAnalysisSystem/HeliScopetradeSequencer/tabid/87/Default.aspx|archiveurl=https://web.archive.org/web/20091102041828/http://www.helicosbio.com/Products/HelicosregGeneticAnalysisSystem/HeliScopetradeSequencer/tabid/87/Default.aspx|archivedate=2009-11-02|title=HeliScope Gene Sequencing / Genetic Analyzer System : Helicos BioSciences|date=2 November 2009}} 89. ^{{cite book | vauthors = Thompson JF, Steinmann KE | title = Single molecule sequencing with a HeliScope genetic analysis system | journal = Current Protocols in Molecular Biology | volume = Chapter 7 | pages = Unit7.10 | date = October 2010 | pmid = 20890904 | pmc = 2954431 | doi = 10.1002/0471142727.mb0710s92 | isbn = 978-0471142720 }} 90. ^{{cite web |url=http://seqll.com/technical-description/ |archive-url=https://web.archive.org/web/20140808055229/http://seqll.com/technical-description/ |dead-url=yes |archive-date=8 August 2014 |publisher=SeqLL |accessdate=9 Aug 2015 |title=tSMS SeqLL Technical Explanation}} 91. ^{{cite book|author1=Sara El-Metwally |title=Next Generation Sequencing Technologies and Challenges in Sequence Assembly |volume=7 |author2=Osama M. Ouda |author3=Mohamed Helmy |work=Next Generation Sequencing Technologies and Challenges in Sequence Assembly, Springer Briefs in Systems Biology Volume 7 |year=2014 |pages=51–59|doi=10.1007/978-1-4939-0715-1_6 |chapter=New Horizons in Next-Generation Sequencing |series=SpringerBriefs in Systems Biology |isbn=978-1-4939-0714-4 }} 92. ^{{cite web|url=http://www.genomeweb.com/sequencing/pacbio-sales-start-pick-company-delivers-product-enhancements|title=PacBio Sales Start to Pick Up as Company Delivers on Product Enhancements}} 93. ^{{cite web|url=http://www.bio-itworld.com/2015/9/30/pacbio-announces-sequel-sequencing-system.aspx|title=Bio-IT World|website=www.bio-itworld.com}} 94. ^{{cite web|url=https://www.genomeweb.com/business-news/pacbio-launches-higher-throughput-lower-cost-single-molecule-sequencing-system|title=PacBio Launches Higher-Throughput, Lower-Cost Single-Molecule Sequencing System}} 95. ^{{Cite journal|last=Clarke|first=James|last2=Wu|first2=Hai-Chen|last3=Jayasinghe|first3=Lakmal|last4=Patel|first4=Alpesh|last5=Reid|first5=Stuart|last6=Bayley|first6=Hagan|date=2009-04-01|title=Continuous base identification for single-molecule nanopore DNA sequencing|url=http://www.nature.com/nnano/journal/v4/n4/full/nnano.2009.12.html|journal=Nature Nanotechnology|language=en|volume=4|issue=4|pages=265–70|doi=10.1038/nnano.2009.12|issn=1748-3387|pmid=19350039|bibcode=2009NatNa...4..265C}} 96. ^1 {{cite journal|year=2012|title=Fabrication and characterization of solid-state nanopore arrays for high-throughput DNA sequencing|journal=Nanotechnology|volume=23|issue=38|page=385308|bibcode=2012Nanot..23L5308D|doi=10.1088/0957-4484/23/38/385308|pmc=3557807|pmid=22948520|vauthors=dela Torre R, Larkin J, Singer A, Meller A}} 97. ^1 {{cite journal|year=2012|title=Double-functionalized nanopore-embedded gold electrodes for rapid DNA sequencing|url=|journal=Applied Physics Letters|volume=100|issue=2|page=023701|doi=10.1063/1.3673335|vauthors=Pathak B, Lofas H, Prasongkit J, Grigoriev A, Ahuja R, Scheicher RH|bibcode=2012ApPhL.100b3701P}} 98. ^{{cite journal|year=2008|title=Selective aluminum passivation for targeted immobilization of single DNA polymerase molecules in zero-mode waveguide nanostructures|journal=Proceedings of the National Academy of Sciences|volume=105|issue=4|pages=1176–81|bibcode=2008PNAS..105.1176K|doi=10.1073/pnas.0710982105|pmc=2234111|pmid=18216253|vauthors=Korlach J, Marks PJ, Cicero RL, Gray JJ, Murphy DL, Roitman DB, Pham TT, Otto GA, Foquet M, Turner SW}} 99. ^{{cite web |url=http://mcb.harvard.edu/branton/index.htm |archive-url=https://web.archive.org/web/20020221002907/http://mcb.harvard.edu/branton/index.htm |dead-url=yes |archive-date=21 February 2002 |title=The Harvard Nanopore Group |publisher=Mcb.harvard.edu |accessdate=2009-11-15 |df=dmy-all }} 100. ^{{cite web |url=http://www.physorg.com/news157378086.html |title=Nanopore Sequencing Could Slash DNA Analysis Costs |accessdate=}} 101. ^{{US patent reference|number=20060029957|y=2005|m=07|d=14 |inventor=ZS Genetics|title=Systems and methods of analyzing nucleic acid polymers and related components}} 102. ^{{cite journal | vauthors = Xu M, Fujita D, Hanagata N | title = Perspectives and challenges of emerging single-molecule DNA sequencing technologies | journal = Small | volume = 5 | issue = 23 | pages = 2638–49 | date = December 2009 | pmid = 19904762 | doi = 10.1002/smll.200900976 }} 103. ^{{cite journal | vauthors = Schadt EE, Turner S, Kasarskis A | title = A window into third-generation sequencing | journal = Human Molecular Genetics | volume = 19 | issue = R2 | pages = R227–40 | year = 2010 | pmid = 20858600 | doi = 10.1093/hmg/ddq416 }} 104. ^{{cite journal | vauthors = Xu M, Endres RG, Arakawa Y | title = The electronic properties of DNA bases | journal = Small | volume = 3 | issue = 9 | pages = 1539–43 | year = 2007 | pmid = 17786897 | doi = 10.1002/smll.200600732}} 105. ^{{cite journal | vauthors = Di Ventra M | title = Fast DNA sequencing by electrical means inches closer | journal = Nanotechnology | volume = 24 | issue = 34 | page = 342501 | year = 2013 | pmid = 23899780 | doi = 10.1088/0957-4484/24/34/342501 | bibcode = 2013Nanot..24H2501D }} 106. ^{{cite journal | vauthors = Ohshiro T, Matsubara K, Tsutsui M, Furuhashi M, Taniguchi M, Kawai T | title = Single-molecule electrical random resequencing of DNA and RNA | journal = Sci Rep | volume = 2 | issue = | page = 501 | year = 2012 | pmid = 22787559 | pmc = 3392642 | doi = 10.1038/srep00501 |bibcode = 2012NatSR...2E.501O }} 107. ^{{cite journal | vauthors = Hanna GJ, Johnson VA, Kuritzkes DR, Richman DD, Martinez-Picado J, Sutton L, Hazelwood JD, D'Aquila RT | title = Comparison of Sequencing by Hybridization and Cycle Sequencing for Genotyping of Human Immunodeficiency Virus Type 1 Reverse Transcriptase | journal = J. Clin. Microbiol. | volume = 38 | issue = 7 | pages = 2715–21 | date = 1 July 2000 | pmid = 10878069 | pmc = 87006 | url = http://jcm.asm.org/cgi/pmidlookup?view=long&pmid=10878069 }} 108. ^1 {{cite journal | vauthors = Morey M, Fernández-Marmiesse A, Castiñeiras D, Fraga JM, Couce ML, Cocho JA | title = A glimpse into past, present, and future DNA sequencing | journal = Molecular Genetics and Metabolism | volume = 110 | issue = 1–2 | pages = 3–24 | year = 2013 | pmid = 23742747 | pmc = | doi = 10.1016/j.ymgme.2013.04.024 }} 109. ^{{cite journal | vauthors = Qin Y, Schneider TM, Brenner MP | title = Sequencing by Hybridization of Long Targets | journal = PLoS ONE | volume = 7 | issue = 5 | pages = e35819 | year = 2012 | pmid = 22574124 | pmc = 3344849 | doi = 10.1371/journal.pone.0035819 | editor1-last = Gibas | bibcode = 2012PLoSO...735819Q | editor1-first = Cynthia }} 110. ^{{cite journal | vauthors = Edwards JR, Ruparel H, Ju J | title = Mass-spectrometry DNA sequencing | journal = Mutation Research | volume = 573 | issue = 1–2 | pages = 3–12 | year = 2005 | pmid = 15829234 | doi = 10.1016/j.mrfmmm.2004.07.021 }} 111. ^{{cite journal | vauthors = Hall TA, Budowle B, Jiang Y, Blyn L, Eshoo M, Sannes-Lowery KA, Sampath R, Drader JJ, Hannis JC, Harrell P, Samant V, White N, Ecker DJ, Hofstadler SA | title = Base composition analysis of human mitochondrial DNA using electrospray ionization mass spectrometry: A novel tool for the identification and differentiation of humans | journal = Analytical Biochemistry | volume = 344 | issue = 1 | pages = 53–69 | year = 2005 | pmid = 16054106 | doi = 10.1016/j.ab.2005.05.028 }} 112. ^{{cite journal | vauthors = Howard R, Encheva V, Thomson J, Bache K, Chan YT, Cowen S, Debenham P, Dixon A, Krause JU, Krishan E, Moore D, Moore V, Ojo M, Rodrigues S, Stokes P, Walker J, Zimmermann W, Barallon R | title = Comparative analysis of human mitochondrial DNA from World War I bone samples by DNA sequencing and ESI-TOF mass spectrometry | journal = Forensic Science International: Genetics | volume = 7 | issue = 1 | pages = 1–9 | date = 15 Jun 2011 | pmid = 21683667 | doi = 10.1016/j.fsigen.2011.05.009 }} 113. ^{{cite journal | vauthors = Monforte JA, Becker CH | title = High-throughput DNA analysis by time-of-flight mass spectrometry | journal = Nature Medicine | volume = 3 | issue = 3 | pages = 360–62 | date = 1 March 1997 | pmid = 9055869 | doi = 10.1038/nm0397-360 }} 114. ^{{cite journal | vauthors = Beres SB, Carroll RK, Shea PR, Sitkiewicz I, Martinez-Gutierrez JC, Low DE, McGeer A, Willey BM, Green K, Tyrrell GJ, Goldman TD, Feldgarden M, Birren BW, Fofanov Y, Boos J, Wheaton WD, Honisch C, Musser JM | title = Molecular complexity of successive bacterial epidemics deconvoluted by comparative pathogenomics | journal = Proceedings of the National Academy of Sciences | volume = 107 | issue = 9 | pages = 4371–76 | date = 8 February 2010 | pmid = 20142485 | pmc = 2840111 | doi = 10.1073/pnas.0911295107 | bibcode = 2010PNAS..107.4371B }} 115. ^{{cite journal | vauthors = Kan CW, Fredlake CP, Doherty EA, Barron AE | title = DNA sequencing and genotyping in miniaturized electrophoresis systems | journal = Electrophoresis | volume = 25 | issue = 21–22 | pages = 3564–88 | date = 1 November 2004 | pmid = 15565709 | doi = 10.1002/elps.200406161 }} 116. ^{{cite journal | vauthors = Chen YJ, Roller EE, Huang X | title = DNA sequencing by denaturation: experimental proof of concept with an integrated fluidic device | journal = Lab on a Chip | volume = 10 | issue = 9 | pages = 1153–59 | year = 2010 | pmid = 20390134 | pmc = 2881221 | doi = 10.1039/b921417h }} 117. ^{{cite journal | vauthors = Bell DC, Thomas WK, Murtagh KM, Dionne CA, Graham AC, Anderson JE, Glover WR | title = DNA Base Identification by Electron Microscopy | journal = Microscopy and Microanalysis : The Official Journal of Microscopy Society of America, Microbeam Analysis Society, Microscopical Society of Canada | volume = 18 | issue = 5 | pages = 1049–53 | date = 9 Oct 2012 | pmid = 23046798 | doi = 10.1017/S1431927612012615 | bibcode = 2012MiMic..18.1049B }} 118. ^{{cite journal | vauthors = Pareek CS, Smoczynski R, Tretyn A | title = Sequencing technologies and genome sequencing | journal = Journal of Applied Genetics | volume = 52 | issue = 4 | pages = 413–35 | date = November 2011 | pmid = 21698376 | pmc = 3189340 | doi = 10.1007/s13353-011-0057-x }} 119. ^{{cite journal | vauthors = Pareek CS, Smoczynski R, Tretyn A | title = Sequencing technologies and genome sequencing | journal = Journal of Applied Genetics | volume = 52 | issue = 4 | pages = 413–35 | year = 2011 | pmid = 21698376 | pmc = 3189340 | doi = 10.1007/s13353-011-0057-x }} 120. ^{{cite journal | vauthors = Fujimori S, Hirai N, Ohashi H, Masuoka K, Nishikimi A, Fukui Y, Washio T, Oshikubo T, Yamashita T, Miyamoto-Sato E | title = Next-generation sequencing coupled with a cell-free display technology for high-throughput production of reliable interactome data | journal = Scientific Reports | volume = 2 | page = 691 | year = 2012 | pmid = 23056904 | pmc = 3466446 | doi = 10.1038/srep00691 | bibcode = 2012NatSR...2E.691F }} 121. ^{{cite journal | vauthors = Harbers M | year = 2008 | title = The Current Status of cDNA Cloning | url = | journal = Genomics | volume = 91 | issue = 3| pages = 232–42 | doi = 10.1016/j.ygeno.2007.11.004 | pmid = 18222633 }} 122. ^{{cite journal |vauthors=Alberti A, Belser C, Engelen S, Bertrand L, Orvain C, Brinas L, Cruaud C, etal | year = 2014 | title = Comparison of Library Preparation Methods Reveals Their Impact on Interpretation of Metatranscriptomic Data | journal = BMC Genomics | volume = 15 | issue = | pages = 912–12 | doi = 10.1186/1471-2164-15-912 | pmid=25331572 | pmc=4213505}} 123. ^{{cite web| url= https://www.illumina.com/content/dam/illumina-marketing/documents/products/appnotes/library-qc-fragment-analyzer-application-note-770-2017-002.pdf|title=Scalable Nucleic Acid Quality Assessments for Illumina Next-Generation Sequencing Library Prep|accessdate=2017-12-27}} 124. ^{{cite web|url=http://genomics.xprize.org/|title=Archon Genomics XPRIZE|website=Archon Genomics XPRIZE}} 125. ^{{cite web|url=http://www.genome.gov/10000004|title=Grant Information|website=National Human Genome Research Institute (NHGRI)}} 126. ^{{cite journal | vauthors = Severin J, Lizio M, Harshbarger J, Kawaji H, Daub CO, Hayashizaki Y, Bertin N, Forrest AR | title = Interactive visualization and analysis of large-scale sequencing datasets using ZENBU | journal = Nat. Biotechnol. | volume = 32 | issue = 3 | pages = 217–19 | year = 2014 | pmid = 24727769 | doi = 10.1038/nbt.2840 }} 127. ^{{cite journal |vauthors=Shmilovici A, Ben-Gal I | title = Using a VOM model for reconstructing potential coding regions in EST sequences|journal=Computational Statistics | year = 2007 | volume = 22 | issue = 1 | pages = 49–69 | doi = 10.1007/s00180-007-0021-8 | url = http://www.eng.tau.ac.il/~bengal/VOM_EST.pdf }} 128. ^{{cite journal | vauthors = Del Fabbro C, Scalabrin S, Morgante M, Giorgi FM | title = An Extensive Evaluation of Read Trimming Effects on Illumina NGS Data Analysis | journal = PLoS ONE | volume = 8 | issue = 12 | pages = e85024 | year = 2013 | pmid = 24376861 | pmc = 3871669 | doi = 10.1371/journal.pone.0085024 | bibcode = 2013PLoSO...885024D }} 129. ^{{cite journal|last1=Martin|first1=Marcel|title=Cutadapt removes adapter sequences from high-throughput sequencing reads|journal=EMBnet.journal|date=2 May 2011|volume=17|issue=1|page=10|doi=10.14806/ej.17.1.200}} 130. ^{{cite journal|last1=Smeds|first1=Linnéa|last2=Künstner|first2=Axel|last3=Donlin|first3=Maureen J.|title=ConDeTri - A Content Dependent Read Trimmer for Illumina Data|journal=PLoS ONE|date=19 October 2011|volume=6|issue=10|pages=e26314|doi=10.1371/journal.pone.0026314|bibcode = 2011PLoSO...626314S|pmid=22039460|pmc=3198461}} 131. ^{{cite book|last1=Spandow|first1=O|last2=Hellström|first2=S|last3=Schmidt|first3=SH|last4=De Paoli|first4=Emanuale|last5=Policriti|first5=Alberto|title=ERNE-BS5: Aligning BS-treated Sequences by Multiple Hits on a 5-letters Alphabet|journal=Proceedings of the ACM Conference on Bioinformatics, Computational Biology and Biomedicine|date=2012|volume=12|pages=12–19|doi=10.1145/2382936.2382938|isbn=9781450316705}} 132. ^{{cite journal|last1=Schmieder|first1=R.|last2=Edwards|first2=R.|title=Quality control and preprocessing of metagenomic datasets|journal=Bioinformatics|date=28 January 2011|volume=27|issue=6|pages=863–64|doi=10.1093/bioinformatics/btr026|pmid=21278185|pmc=3051327}} 133. ^{{cite journal|last1=Bolger|first1=A. M.|last2=Lohse|first2=M.|last3=Usadel|first3=B.|title=Trimmomatic: a flexible trimmer for Illumina sequence data|journal=Bioinformatics|date=1 April 2014|volume=30|issue=15|pages=2114–20|doi=10.1093/bioinformatics/btu170|pmid=24695404|pmc=4103590}} 134. ^{{cite journal|last1=Cox|first1=Murray P|last2=Peterson|first2=Daniel A|last3=Biggs|first3=Patrick J|title=SolexaQA: At-a-glance quality assessment of Illumina second-generation sequencing data|journal=BMC Bioinformatics|date=2010|volume=11|issue=1|page=485|doi=10.1186/1471-2105-11-485|pmid=20875133|pmc=2956736}} 135. ^{{cite journal|last1=Murray|first1=TH|title=Ethical issues in human genome research.|journal=FASEB Journal|date=January 1991|volume=5|issue=1|pages=55–60|pmid=1825074|doi=10.1096/fasebj.5.1.1825074}} 136. ^1 2 {{cite journal|last1=Robertson|first1=John A.|title=The $1000 Genome: Ethical and Legal Issues in Whole Genome Sequencing of Individuals|journal=The American Journal of Bioethics|date=August 2003|volume=3|issue=3|pages=35–42|doi=10.1162/152651603322874762|pmid=14735880}} 137. ^1 {{cite news|last1=Henderson|first1=Mark|title=Human genome sequencing: the real ethical dilemmas|url=https://www.theguardian.com/science/2013/sep/09/genetics-ethics-human-gene-sequencing|newspaper=The Guardian|accessdate=20 May 2015|date=2013-09-09}} 138. ^{{cite news|last1=Harmon|first1=Amy|title=Insurance Fears Lead Many to Shun DNA Tests|url=https://www.nytimes.com/2008/02/24/health/24dna.html?pagewanted=all&_r=0|website=The New York Times|accessdate=20 May 2015|date=24 February 2008}} 139. ^Statement of Administration policy, Executive Office of the President, Office of Management and Budget, 27 April 2007 140. ^{{cite news |url=http://www.genome.gov/27026050|title=President Bush Signs the Genetic Information Nondiscrimination Act of 2008|author=National Human Genome Research Institute|date=21 May 2008| accessdate=17 Feb 2014}} 141. ^{{cite web|last1=Baker|first1=Monya|title=US ethics panel reports on DNA sequencing and privacy|url=http://blogs.nature.com/news/2012/10/us-ethics-panel-reports-on-dna-sequencing-and-privacy.html|website=Nature New Blog|accessdate=20 May 2015}} 142. ^{{cite web|title=Privacy and Progress in Whole Genome Sequencing|url=http://bioethics.gov/sites/default/files/PrivacyProgress508_1.pdf|publisher=Presidential Commission for the Study of Bioethical Issues|accessdate=20 May 2015}} 143. ^{{cite journal|last1=Goldenberg|first1=Aaron J.|last2=Sharp|first2=Richard R.|title=The Ethical Hazards and Programmatic Challenges of Genomic Newborn Screening|journal=JAMA|date=1 February 2012|volume=307|issue=5|pages=461–2|doi=10.1001/jama.2012.68|pmid=22298675|pmc=3868436}} 144. ^{{cite web|last1=Hughes|first1=Virginia|title=It's Time To Stop Obsessing About the Dangers of Genetic Information|url=http://www.slate.com/articles/health_and_science/medical_examiner/2013/01/ethics_of_genetic_information_whole_genome_sequencing_is_here_and_we_need.html|website=Slate Magazine|accessdate=22 May 2015|date=2013-01-07}} 145. ^1 {{cite journal|last1=Bloss|first1=Cinnamon S.|last2=Schork|first2=Nicholas J.|last3=Topol|first3=Eric J.|title=Effect of Direct-to-Consumer Genomewide Profiling to Assess Disease Risk|journal=New England Journal of Medicine|date=10 February 2011|volume=364|issue=6|pages=524–34|doi=10.1056/NEJMoa1011893|pmid=21226570|pmc=3786730}} 146. ^{{cite news|last1=Rochman|first1=Bonnie|title=What Your Doctor Isn’t Telling You About Your DNA|url=http://healthland.time.com/2012/10/25/what-your-doctor-isnt-telling-you-about-your-dna/|website=Time.com|accessdate=22 May 2015|date=25 October 2012}} 147. ^1 {{cite journal | vauthors = Bentley DR, Balasubramanian S, et al. | title = Accurate whole human genome sequencing using reversible terminator chemistry | journal = Nature | volume = 456 | issue = 7218 | pages = 53–59 | year = 2008 | pmid = 18987734 | pmc = 2581791 | doi = 10.1038/nature07517 |bibcode = 2008Natur.456...53B }} 148. ^1 2 {{cite journal | vauthors = Drmanac R, Sparks AB, et al. | title = Human Genome Sequencing Using Unchained Base Reads in Self-Assembling DNA Nanoarrays | journal = Science | volume = 327 | issue = 5961 | pages = 78–81 | year = 2010 | pmid = 19892942 | doi = 10.1126/science.1181498 | bibcode = 2010Sci...327...78D }} 149. ^1 {{cite journal | vauthors = Feldmann H, et al. | title = Complete DNA sequence of yeast chromosome II | journal = EMBO J. | volume = 13 | issue = 24 | pages = 5795–809 | year = 1994 | pmid = 7813418 | pmc = 395553 | doi = 10.1002/j.1460-2075.1994.tb06923.x }} 150. ^1 {{cite journal | vauthors = Lander ES, Linton LM, Birren B, Nusbaum C, Zody MC, et al. | title = Initial sequencing and analysis of the human genome | journal = Nature | volume = 409 | issue = 6822 | pages = 860–921 | date = February 2001 | pmid = 11237011 | doi = 10.1038/35057062 | bibcode = 2001Natur.409..860L | url = https://deepblue.lib.umich.edu/bitstream/2027.42/62798/1/409860a0.pdf }} 151. ^1 2 {{cite journal | vauthors = Margulies M, Egholm M, et al. | title = Genome Sequencing in Open Microfabricated High Density Picoliter Reactors | journal = Nature | volume = 437 | issue = 7057 | pages = 376–80 | date = September 2005 | pmid = 16056220 | pmc = 1464427 | doi = 10.1038/nature03959 | bibcode = 2005Natur.437..376M }} 152. ^1 {{cite journal | vauthors = Venter JC, Adams MD, et al. | title = The sequence of the human genome | journal = Science | volume = 291 | issue = 5507 | pages = 1304–51 | date = February 2001 | pmid = 11181995 | doi = 10.1126/science.1058040 | bibcode = 2001Sci...291.1304V }} References{{reflist|refs=[147][148][149][150][151][152]}} External links{{Library resources box|onlinebooks=no |by=no}}{{wikibooks |1= Next Generation Sequencing (NGS) }}
11 : DNA sequencing|Biotechnology|DNA|Genetic mapping|Molecular biology|Molecular biology techniques|1970 introductions|1970 in biology|1970 in biotechnology|1970 in science|1998 in technology |
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
随便看 |
|
开放百科全书收录14589846条英语、德语、日语等多语种百科知识,基本涵盖了大多数领域的百科知识,是一部内容自由、开放的电子版国际百科全书。