WO2010125351A1 - Simultaneous sequencing of multiple nucleic acid sequences - Google Patents

Simultaneous sequencing of multiple nucleic acid sequences Download PDF

Info

Publication number
WO2010125351A1
WO2010125351A1 PCT/GB2010/000861 GB2010000861W WO2010125351A1 WO 2010125351 A1 WO2010125351 A1 WO 2010125351A1 GB 2010000861 W GB2010000861 W GB 2010000861W WO 2010125351 A1 WO2010125351 A1 WO 2010125351A1
Authority
WO
WIPO (PCT)
Prior art keywords
sequencing
nucleic acid
megaprimer
primers
primer
Prior art date
Application number
PCT/GB2010/000861
Other languages
French (fr)
Inventor
Jose Alejandro Madrigal
Steven George Edward Marsh
Neema Philippa Mayor
Christopher Henry Roberts
Original Assignee
The Anthony Nolan Trust
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by The Anthony Nolan Trust filed Critical The Anthony Nolan Trust
Publication of WO2010125351A1 publication Critical patent/WO2010125351A1/en

Links

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/68Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
    • C12Q1/6869Methods for sequencing

Definitions

  • the invention relates to the simultaneous sequencing of multiple nucleic acid sequences.
  • Simultaneous sequencing is the process of performing nucleic acid sequencing reactions in such a way that the sequences of a plurality of different templates are detected in multiplex, requiring the use of only a single reaction vessel and sequencing capillary.
  • the term 'unidirectional sequencing' refers herein to the sequencing of a single strand of a polynucleotide template and may be performed on either the sense or anti-sense strand of a template that is either single or double-stranded.
  • the term 'bidirectional sequencing' refers meanwhile to the sequencing of complementary sense and anti-sense strands of a double-stranded nucleic acid template.
  • the 'Sanger' method of chain-terminator sequencing 1 is the industry standard for nucleic acid sequence analysis, and a well-optimised sequencing reaction is able to produce around 1500 bases of continuous sequence data. Many sequencing requirements cannot take full advantage of this potential because the sequence of interest may be much shorter than the maximum possible and because it is often necessary to sequence multiple regions of the genome that are not in linkage disequilibrium. When this is the case, it has traditionally been necessary to perform numerous separate sequencing reactions, leading to a hyperbolic increase in the per sample cost of sequencing and a proportional decrease in sample throughput.
  • SimulSeq could also be used to perform simultaneous bi-directional sequencing, where both strands of an individual double-stranded template were detected in multiplex; but this was similarly limited to templates of length shorter than 150 bp.
  • the limitations of SimulSeq are due to its reliance on commercially synthesised oligonucleotide primers and the fact that these cannot be prepared at lengths exceeding the 150 bp - 200 bp limit of current oligonucleic acid synthesis techniques.
  • no simultaneous sequencing or bidirectional sequencing method has allowed a truly flexible approach that can be applied to any group of templates where there would be a practical benefit to performing this type of sequencing assay and where the combined lengths may stretch into the hundreds or early thousands of bases.
  • Automated 'Sanger' sequencing reactions are enabled by the extension of oligonucleotide primers 7 and these reactions produce nested, chain-terminated 1 and fluorescently labelled 8 sequence fragments of every possible length greater than that of the oligonucleotide primer and lesser than or equal to the length of the template nucleic acid.
  • the earliest fluorescence signal detected by the DNA analysis system during electrophoresis is from those fragments where the first primer extension step incorporates a chain terminator base.
  • Increasing the length of the oligonucleotide primer can delay the onset of this first signal because the rate of the electrophoretic migration of the sequence fragments is proportional to their masses.
  • the invention provides a method and a kit for the simultaneous sequencing of multiple nucleic acid sequences.
  • the method is called "Short Template Amplicon and Multiplex Megaprimer Enabled Relay” (STAMMER) sequencing.
  • STAMMER Short Template Amplicon and Multiplex Megaprimer Enabled Relay
  • the method provides a platform that is superior to pre-existing methods for simultaneous sequencing, including the bidirectional sequencing of double- stranded nucleic acid templates.
  • the invention provides a method for simultaneously sequencing a plurality of nucleic acid target sites, comprising (a) annealing each of a plurality of different-sized primers to the 5 '-end of each of said plurality of nucleic acid target sites; (b) performing a sequencing reaction by simultaneously extending said primers in the presence of labelled terminators and thereby obtaining a pool of extended primers in a single reaction vessel; and (c) determining the sequences of the nucleic acid target sites by analysing said pool of extended primers without separating the pool prior to analysis; wherein the largest extension product of each primer is smaller in length than the next largest, non-extended primer, and wherein at least one of the primers is a megaprimer comprising double-stranded nucleic acid of at least 100 base pairs in length.
  • the invention also provides a kit for performing the method of the invention, wherein the kit comprises (a) at least one megaprimer comprising double-stranded nucleic acid of at least 100 base pairs in length and
  • Figure 1 shows how the electrophoretic migration of labelled nucleic acid sequencing fragments can be slowed in order to delay the onset of detectable signal until after a previous sequence has fully migrated past the optical detector.
  • (1) is the 130 bp Megaprimer Target Region of a Short Template Amplicon of 260 bp of which (2) is the region of interest.
  • (3) is the 276 bp Megaprimer Target Region of a second Short Template Amplicon of 410 bp of which (4) is the region of interest.
  • (5) is the forward strand of a 130 bp megaprimer specific to (1) and (5b) is a silent period during which no sequence detection occurs during electrophoresis. The duration of (5b) is proportional to the molecular weight of the megaprimer (5).
  • (6) shows how megaprimer (5) is extended from the 3' end during the sequencing reaction and (6b) is the detectable sequence relating to this.
  • (7) is the reverse strand of megaprimer (5). Unlike (5), this strand cannot be extended during the sequencing reaction, as there is no downstream template.
  • (8) shows the forward strand of a 276 bp megaprimer specific to (3) and (8b) is the silent period of electrophoresis, during which no sequencing products relating to the extension of the megaprimer (8) are detected.
  • (9) shows how megaprimer (8) is extended from the 3' end during the sequencing reaction and (9b) is the detectable sequence relating to this.
  • (10) is the reverse strand of megaprimer (8).
  • FIG. 1 is a period of silence wherein all products of (6) have passed the detector but where the products of (9) have yet to be detectable.
  • the lower part of Figure 1 shows how the electrophoresis chromatogram would be expected to look in the results of a STAMMER sequencing reaction based upon the model in the upper part of Figure 1.
  • Figure 2 A is the electrophoresis chromatogram of the short template amplicon (1- 2) sequence derived from the forward strand (5) of the dsDNA megaprimer (5 and 7) and corresponds to the sequence described in Figure 1 as (6b).
  • Figure 2B is the electrophoresis chromatogram of the short template amplicon (3-4) sequence derived from the forward strand (8) of the dsDNA megaprimer (8 and 10) and corresponds to the sequence described in Figure 1 as (9b).
  • Figure 2C is the electrophoresis chromatogram of a STAMMER multiplex reaction in which both of the sequencing reactions described in Figures 2 A and 2B were carried out in multiplex according to the model described in Figure 1.
  • Figure 2D shows a portion of the fully analysed and base-called nucleotide sequence derived from the electrophoresis chromatogram pictured in 2C.
  • Sequence 6b is the NOD2/CARD15 exon 11 short template amplicon and sequence 9b is the NOD2/CARD15 exon 4 short template amplicon. They were prepared according to the primers described in table 1 and amplification and sequencing parameters were identical to those used with regards to the experiments described in figures 3 and 4 (see Example 1). Exon 11 (4A) and Exon 4 (4B) were detectable in the periods equivalent to size ranges 131-260 bp and 277-410 bp respectively. The quality data for the sequence shown in 9c is labelled Q and is prepared using the Phred algorithm according to Ewing, B. & Green, P., Base-calling of automated sequencer traces using Phred. II. Error probabilities.
  • Genome Res 8, 186-94 (1998) and Ewing, B., Hillier, L., Wendl, M.C. & Green, P. Base-calling of automated sequencer traces using Phred. I. Accuracy assessment. Genome Res 8, 175-85 (1998).
  • Figure 3 shows the STAMMER sequencing principle as applied to the NOD2/CARD15 gene.
  • Figure 3A shows the assay design using three pairs of Short Template Amplicons (STA), each of which is composed of a region of interest (ROI) and a megaprimer target region (MTR) which is complementary to one of the three corresponding megaprimers (MGP). The regions of interest are spaced some 18 kbp apart in total.
  • Figure 3B shows how i) priming of the MTR and extension of the megaprimer forward strand produces ii) nested and chain terminated sequence fragments complementary to the short template amplicon and how iii) there is no potential for extension of the megaprimer reverse strand due to a lack of a suitable extensible template.
  • Figure 3C shows how the detection of fluorescence signals from the three sets of nested megaprimer extension fragments would be predicted to occur on-line, in the manner of a relay.
  • Figure 5 shows a schematic of STAMMER applied to the bidirectional sequencing of a single short template amplicon.
  • Figure 5A shows how the anti- sense strand of a megaprimer (MGP-AS) and a sense oligonucleotide primer (OP) respectively hybridise to the sense and anti-sense strands of a short template amplicon.
  • the two primers extend during the cycle sequencing reaction in the direction of the solid arrows.
  • the sense strand of the megaprimer (MGP-S) does not extend due to a lack of suitable template.
  • a 3 '-propyl-oligonucleotide 'roadblock' oligonucleotide (RB) hybridises upstream of the megaprimer sense strand.
  • Figure 5B shows how a multiplex bidirectional STAMMER sequencing reaction can be performed according to this model.
  • the megaprimer enabled sequence covers the region of interest and some irrelevant flanking sequences whilst the oligonucleotide primer enabled sequence covers the complementary strand over the region of interest.
  • Figure 5C is the base called electropherographic data output of a complete STAMMER bi-directional sequencing reaction. Sequence quality is demonstrated by the topographical detail where vertical bars above the dashed horizontal line labelled Q20, represent base calls assigned with more than 99% accuracy according to Ewing et al. Guides to the actual scale of fluorescence intensity have been digitally removed from these images for reasons of clarity.
  • Figure 6 shows the effects of a roadblock on the sequencing reaction.
  • the raw chromatographic data in 6A shows the effect of a roadblock oligonucleotide on the progression of the otherwise unimpeded sequencing reaction shown in 6B.
  • the arrow marks the position where the sequencing enzyme is halted by the obstacle of the 5' end of the roadblock molecule.
  • STAMMER differs from the SimulSeq method partly in that it is based upon the use of one or more double-stranded polynucleotide 'megaprimers', which are generally PCR products and are used in the enabling of the sequencing reaction in place of classical oligonucleotide primers. Megaprimers of any length amenable to PCR can be constructed, thus relieving the constraints on template length imposed on 'SimulSeq' by the limitations of commercial oligonucleic acid synthesis technology.
  • Megaprimers of the invention are at least 100 bp in length, optionally at least 120 bp, at least 140 bp, at least 160 bp, at least 180 bp, at least 200 bp, at least 220 bp, at least 240 bp, at least 260 bp, at least 280 bp, or at least 300 bp in length.
  • the maximum length of a megaprimer is about 1500 bp but the length of the largest megaprimer is typically less than this; for example the length of the largest megaprimer is typically up to 1000 bp, 700 bp or 500 bp.
  • the lengths of the megaprimers can be tailored so that the electrophoretic migration rate (and therefore the detection of primer extension products relating to the megaprimer enabled sequences) may be delayed by the equivalent of tens, hundreds or potentially thousands of bases (see Figures 1 and 2). This allows an opportunity for one or more complete oligonucleotide and/or megaprimer enabled sequence(s) of any given length shorter than the subsequent megaprimer to migrate past the point of signal detection before the onset of detectable signal from the subsequent megaprimer enabled sequence(s).
  • Megaprimers can also be prepared by a number of non-PCR methods, such as restriction digestion with blunt end cutting restriction endonucleases, followed by purification of the desired fragment. Similarly, restriction endonuclease digestion with enzymes producing sticky ends, followed by degradation of the sticky ends to blunt ends using appropriate enzymes (generally possessing exonuclease activity), could be used.
  • Megaprimers could also be prepared using non-PCR based methods of DNA amplification. These include, though are not exclusively limited to, isothermal DNA amplification methods including rolling circle amplification (RCA), real time strand displacement amplification (SDA) and multiple displacement amplification (MDA). These methods, and RCA in particular, might be used in combination with restriction endonuclease digestion.
  • RCA rolling circle amplification
  • SDA real time strand displacement amplification
  • MDA multiple displacement amplification
  • the STAMMER sequencing technique is dependent not only upon the ability to delay the onset of sequence detection but also upon limiting the lengths of template that cause the individual primer/megaprimer extension reactions to terminate once the region of interest has been passed; thereby providing a
  • megaprimer enabled sequences may initiate. This is achieved by using site-specific quenchers of the sequencing reaction (i.e. roadblock primers) or Short Template Amplicons as templates; the latter are PCR products comprising only the informative region of interest and a 'megaprimer target region', an expanse of sequence flanking the region of interest that is equal in length and complementary to the enabling megaprimer.
  • site-specific quenchers of the sequencing reaction i.e. roadblock primers
  • Short Template Amplicons as templates; the latter are PCR products comprising only the informative region of interest and a 'megaprimer target region', an expanse of sequence flanking the region of interest that is equal in length and complementary to the enabling megaprimer.
  • Each strand of the double-stranded megaprimer is capable of hybridising to its complementary strand in the megaprimer target region, but there is no downstream template for the extension of the second megaprimer strand and the detectable sequence therefore represents only megaprimer extension across the region of interest.
  • Megaprimers may
  • Short Template Amplicon and Multiplex Megaprimer Enabled Relay (STAMMER) sequencing allows simultaneous 'relay' sequencing of practically any combination of nucleic acid templates.
  • Previous systems such as SimulSeq were limited to very short combined template lengths and were only practically applicable in molecular typing assays or in the bidirectional sequencing of very short regions of interest 6 .
  • the STAMMER approach to simultaneous sequencing is applicable to a far wider range of templates than has been previously possible and we believe that it could impact massively upon the costs and throughput of many routine and diagnostic re-sequencing applications.
  • STAMMER sequencing can be used to perform not only full sequencing and consequently molecular typing of known polymorphic positions but also the bidirectional sequencing of individual double-stranded templates.
  • vector specific STAMMER sequencing systems utilising multiple 'roadblock' oligonucleotides will allow simultaneous bidirectional sequencing of circular templates; thus facilitating double-stranded plasmid screening at half the cost and twice the throughput of normal sequencing. It is also possible to design STAMMER sequencing systems whereby several Short Template Amplicons are bidirectionally sequenced in a single assay.
  • the 'roadblock' molecule of the invention is a single-stranded nucleic acid that provides a physical barrier to the progression of a nucleic acid polymerase.
  • the 3 '-end of a roadblock molecule may be modified to prevent the extension of this end. This modification may involve chemical alteration of the 3 '-end, such as the attachment of an alkyl group (e.g. Cl to Cl 8 alkyl, preferably C2 to C6 alkyl, e.g. propyl).
  • an alkyl group e.g. Cl to Cl 8 alkyl, preferably C2 to C6 alkyl, e.g. propyl.
  • Different functional groups such as amino, thiol, or similar, or any group to which one might conjugate a terminating molecule, could also be attached to the 3 '-end of the roadblock to prevent its extension.
  • the last base at the 3 '-end of the roadblock molecule might be a dideoxy base, or might otherwise lack potential to be extended as a result of similar modification of the nucleotide triphosphate structure itself or of the use of a non-classical or non-naturally occurring nucleotide.
  • the 3 '-end of the roadblock may be abasic.
  • primers are more typical; for example the number of primers may be up to 2, 3, 5, 7, 10, 15, 20, 25 or 40.
  • 20 primers could be used to provide 20 sequences of 75 bases in relay.
  • An advantage of the invention is that STAMMER is the only technique that could utilise a future increase in the maximum size of sequencing product resolved by a sequencing capillary (currently about 1500 bp).
  • the total number of bases sequenced and the total number of primers used in a STAMMER multiplex is unlimited, provided that the technology exists to resolve the full range of extension products derived from the multiplex.
  • the molar ratio of each megaprimer to target site is 1 :20 to 20:1, more preferably 1 :10 to 10:1, more preferably 1 :5 to 10:1, more preferably 1 :3 to 3:1, most preferably about 5:1.
  • a further embodiment of the invention involves the removal of the non-priming strand of at least one megaprimer prior to the sequencing reaction.
  • the benefit of doing this is that one might, by removing the non-priming strand, allow the sequencing reaction to behave more like a conventional sequencing reaction in which a very high molar ratio between the oligonucleotide primer and amplicon is used. This increases the signal strength (which may be coupled to an increase in the sequence quality).
  • a single strand of the megaprimer is removed prior to performing the STAMMER reaction.
  • the simplest way of removing a single strand of a megaprimer(s) would be the application of linear-after-the-exponential PCR (LATE-PCR) according to Sanchez, J.A., Pierce, K.E., Rice, J.E. & Wangh, L.J.
  • Linear-after-the-exponential (LATE)-PCR an advanced method of asymmetric PCR and its uses in quantitative real-time analysis. Proc Natl Acad Sci USA 101, 1933-8 (2004).
  • This process of asymmetric PCR uses an excess of forward strand primer to ensure that in the later stages of PCR, the reverse primer is entirely used up and a phase of linear amplification takes place, leading to an excess of single stranded products, which may be used as primers.
  • This may be carried out in a single combined LATE-PCR amplification in one vessel.
  • the short template amplicons, dideoxynucleotides and/or roadblock oligonucleotide(s) required to perform the STAMMER reaction could then be added after a defined number of cycles of such a linear amplification.
  • Another approach might be to selectively deplete a double stranded PCR product of the second strand. This might be carried out using a biotinylated reverse primer.
  • the PCR product that is biotinylated in this way would be bound to a streptavidin paramagnetic bead or solid substrate, then subjected to conditions sufficient to denature the double helix and release the forward strand into solution which may be separated by magnetism or otherwise by simple elution.
  • paramagnetic methods might allow one to perform the second strand depletion in the same reaction vessel used for the STAMMER sequencing reaction. This might require the application of chemical denaturants, followed by the magnetic separation of the bead-bound reverse strands. This would be followed immediately by the sequencing reaction.
  • Single stranded megaprimers might also be constructed using two or more synthetic oligonucleotides representing a continuous sequence, which had been specifically bound to a third, possibly immobilised or bead bound scaffolding oligonucleotide.
  • the action of DNA ligase enzyme could then be used to ligate the two synthetic oligonucleotides end to end, producing a much longer megaprimer, which could then be eluted from the scaffold.
  • the pool of extended primers produced by the sequencing reaction of the invention can be analysed by separating them according to their size. This separation can be achieved by electrophoresis, such as capillary electrophoresis, or by using a non-capillary based electrophoresis sequencing platform (such as a slab gel ABI 377 machine).
  • the separation of extended primer products from a plurality of sequencing reactions can be performed in a single capillary.
  • the chain terminators of the sequencing reactions, dideoxynucleotides (ddATP, ddGTP, ddCTP, or ddTTP), are labelled so that each extended primer product can be detected. For instance, each of the four dideoxynucleotide chain terminators can be labelled with a different fluorescent dye.
  • Non-fluorescent chemical dyes chromophores
  • substrates of enzymatic reactions proteins or radioisotopes, or any other group to which one can attach any of the same
  • the nucleic acids of the invention including sequencing reaction templates, oligonucleotide primers, and megaprimers, can comprise deoxyribonucleic acid, ribonucleic acid, or a copolymer of deoxyribonucleic acid and ribonucleic acid.
  • the nucleic acids of the invention can be natural (i.e. sourced from any organism) or synthetic, and can be made in vitro or in vivo, enzymatically or non- enzymatically.
  • the template nucleic acid can comprise non-natural nucleotide analogues such as deoxyinosine or 7-deaza-2-deoxyguanosine. These analogues destabilize DNA duplexes and could allow a primer annealing and extension reaction to occur in a double-stranded sample without completely separating the strands.
  • the sequencing reaction of the invention may be carried out with a number of different nucleic acid polymerase enzymes, such as DNA polymerase, RNA polymerase, reverse transcriptase or RNA replicase, depending on the composition of the template nucleic acid.
  • nucleic acid polymerase enzymes such as DNA polymerase, RNA polymerase, reverse transcriptase or RNA replicase, depending on the composition of the template nucleic acid.
  • the invention also provides a kit comprising components to carry out the methods of the invention.
  • the kit may comprise at least one megaprimer comprising double-stranded nucleic acid of at least 100 bp in length, optionally at least 120 bp, at least 140 bp, at least 160 bp, at least 180 bp, at least 200 bp, at least 220 bp, at least 240 bp, at least 260 bp, at least 280 bp, or at least 300 bp in length.
  • the maximum length of a megaprimer is about 1500 bp.
  • the kit may comprise labelled chain terminators, optionally labelled with different fluorescent dyes.
  • the kit may comprise non- terminating dNTPs or NTPs, an enzyme or enzymes for nucleic acid polymerisation, appropriate buffer solutions, purified water and/or instructions for carrying out any one of the methods of the invention.
  • DNA samples were prepared from whole blood using a standard salting out method 15 .
  • PCR was performed in 30 ⁇ l reactions consisting of 100 ng of the genomic DNA template; 1 X PhusionTM HF Buffer (Finnzymes, Espoo, Finland); 0.8 mM dNTPs (Bioline, London, UK); each oligonucleotide primer (Sigma- Genosys, Cambridge, UK) at 500 ⁇ M in combination according to table Ia (NOD2/CARD15 simultaneous sequencing) and Ib (bidirectional STAMMER sequencing); MgC12 (Bioline, London, UK) at concentrations according to table 1 and finally 0.4 U Phusion High-Fidelity DNA Polymerase (Finnzymes, Espoo, Finland).
  • PCR cycling was carried out in a G-STORM GSX thermal cycler (GRI, Braintree, UK) and conditions were 34 cycles of (95°C: O'lO", 63°C: O' lO", 72°C: 1 OO") for the NOD2/CARD15 experiments.
  • PCR conditions for the bidirectional experiments were 30 cycles of (95°C: O'lO", 67°C: O'lO", 72°C: TOO").
  • the products were visualized on a 2% agarose gel then purified using the Wizard SV gel and PCR cleanup system (Promega, WI, USA).
  • STAMMER sequencing reactions consisted of 2.5 ⁇ l Sequencing Buffer (ABI, Foster City, Ca, USA) and roughly 10 ng of each megaprimer and short template amplicon, made up to 28.25 ⁇ l with H2O. The final molar ratios of the
  • NOD2/CARD15 megaprimers to their respective short template amplicon were around 2.22 (Exon 4), 1.88 (Exon 8) and 2.98 (Exon 11); the bidirectional sequencing megaprimer was used at a 4:1 ratio against the short template amplicon.
  • Synthesis of the 'roadblock' oligonucleotide included the addition of a chain-terminating 3' propyl group (Sigma-Genosys, Cambridge, UK).
  • the 'roadblock' oligonucleotide was included in the bidirectional sequencing reaction at 500 nM whilst the oligonucleotide primer was included at just 16 nM.
  • Sequences were detected on a 3730XL 96 capillary DNA analyser (ABI, Foster City, Ca, USA). EDTA was added to a final concentration of 6 mM and samples were then precipitated with 70 ⁇ l absolute propan-2-ol before recovery by centrifugation. Pellets were washed in 100 ⁇ l 70% propan-2-ol and were finally resuspended in 10 ⁇ l Hi-Di Formamide (ABI, Foster City, Ca., USA). Sample injection was for fifteen seconds at 1.5 kV. Electrophoresis was carried out at 6O 0 C for 80 minutes. Sequences were analysed using Sequencing analysis v5.2 and SeqScape v2.5. Basecalling and assignment of Phred (Q20) quality values 1 ' 7 was performed using the 'KB' basecaller (ABI, Foster City, Ca, USA).
  • NOD2/CARD15 gene as polymorphisms within these regions had been shown to be informative with regards to disease associations 12 .
  • the start and end of the megaprimer enabled sequences were marked by strong peaks relating to the tendency of the DNA polymerase enzyme to add a single and non template dependent adenosine residue to the 3' termini of the megaprimer and Short Template Amplicon homoduplexes. These artifactual peaks are somewhat useful in providing a means to delimit and thereby distinguish the start and end of each sequence in the multiplex; but megaprimer strands so altered at their 3' ends are incapable of efficiently and specifically hybridising to the megaprimer target region and this impacts greatly upon the strength and quality of the derived sequences. Such addition also depletes the adenosine content of the reaction and may lead to truncation of sequences.
  • Another approach to reducing or eliminating the 'A' peaks caused by non- template dependent addition of adenosine to the 3 '-end of the megaprimer and amplicon homoduplexes during the STAMMER technique is to use a non- proofreading enzyme during PCR preparation of the megaprimer(s) or Short Template Amplicons. This ensures that most megaprimer/Short Template Amplicon products are already adenylated at the 3' end and so little or no ddATP is added during STAMMER sequencing and the 'A' peaks are thus reduced. With time the megaprimers may lose the terminal A and therefore the peaks may return, because the ssDNA end is less stable than dsDNA.
  • the A bases can be restored prior to STAMMER sequencing. It would be preferable to use megaprimers where the addition of the extra A has no effect on the binding footprint of the 3' end of the megaprimer (i.e. where the base immediately downstream of the megaprimer target region is a thymine residue); in this way specificity is not lost.
  • the addition of the 3' A residue should, however, interfere with the megaprimer forward strand's ability to anneal to its own reverse strand, thereby reducing self-annealing that would otherwise limit the productivity of the reaction.
  • exonuclease 1 and shrimp alkaline phosphatase EXO-SAP
  • EXO-SAP shrimp alkaline phosphatase
  • Exonuclease removes the single stranded portion of the molecule (along with excess amplification primers) and the alkaline phosphatase dephosphorylates it and so prevents any form of polymerase/endonuclease from acting upon it.
  • the megaprimer cannot be treated with Exo-SAP, as it is essential that the megaprimer can be subject to polymerase activity.
  • sequencing technology such as plasmid screening and sequence-based typing 14 require the sequencing of both the anti-sense and sense strands of a double-stranded nucleic acid template region of interest. This has historically meant that two sequencing reactions have been necessarily performed for the detection of bidirectional sequence in any one region of interest.
  • STAMMER sequencing principle we demonstrate how the basic method described previously can be adapted to perform the sequencing of both strands of a single short template amplicon in a single relay sequencing reaction.
  • the bidirectional STAMMER approach depends on the fact that the DNA polymerase enzyme used in the sequencing reaction lacks a 5 '-3' exonuclease activity and thus the primer extension reaction can be halted by the obstacle of the 5' end of a "roadblock", a hybridised nucleic acid strand in the path of the enzyme.
  • An oligonucleotide was used in the manner of a conventional reaction to prime the sequencing of the antisense strand of a 475 bp short template amplicon constructed according to the primers described in table Ib.
  • a 231 bp megaprimer simultaneously enabled the sequencing of a 224 bp region of the sense strand from the same short template amplicon.
  • the oligonucleotide-enabled component of the sequencing reaction would progress across the entire short template amplicon and would still be detectable when the megaprimer-enabled component later became detectable. To prevent this, it was necessary to halt the oligonucleotide-enabled reaction before the end of the template was reached.
  • a previous method of bidirectional sequencing introduced abasic regions to the template in order to halt the sequencing reaction in such a way 6 , but this was not tenable in the STAMMER system. Instead, we simply placed a second oligonucleotide, a "roadblock", on the sense strand 179 bp downstream of 3' end of the sequence enabling oligonucleotide primer and 2bp upstream of the 5' end of the megaprimer target region ( Figure 5A). The 3' end of this roadblock was modified with a chain-terminating propyl group (C3 spacer) and therefore could not itself act as a sequence enabling primer.
  • C3 spacer chain-terminating propyl group
  • the 5' end meanwhile presented an obstacle aimed at halting the progression of the 5' to 3' endonuclease activity of the polymerase enzyme and thus limited the size of the oligonucleotide primer enabled sequencing products.
  • the use of the roadblock oligonucleotide at a high molar ratio with respect to both the short template amplicon and the oligonucleotide primer ensured that essentially all free template sense strands were hybridised to a roadblock molecule capable of halting the oligonucleotide primer extension reaction.
  • Figure 5C is the base called electrophoretic data output of a complete STAMMER Bidirectional sequencing reaction. Analysis of the products from this reaction showed that we were able to detect two discrete sequences in relay. Further scrutiny of these sequences confirmed that we had sequenced in relay both strands of a double-stranded template region of the KIR2DL4 gene and that the two sequences were complementary to one another across a 167 bp expanse of the region of interest. The average Phred quality value across the bidirectional sequence was 39.4; most nucleotide positions were identified by high quality base-calls (Q>20) in both directions and all positions had such in at least one direction.
  • NOD2/CARD15 gene are associated with an increased risk of relapse and death for patients with acute leukemia after hematopoietic stem-cell transplantation with unrelated donors. J Clin Oncol 25, 4262-9 (2007).

Landscapes

  • Life Sciences & Earth Sciences (AREA)
  • Chemical & Material Sciences (AREA)
  • Proteomics, Peptides & Aminoacids (AREA)
  • Organic Chemistry (AREA)
  • Zoology (AREA)
  • Wood Science & Technology (AREA)
  • Health & Medical Sciences (AREA)
  • Engineering & Computer Science (AREA)
  • Microbiology (AREA)
  • Immunology (AREA)
  • Biotechnology (AREA)
  • Molecular Biology (AREA)
  • Biophysics (AREA)
  • Analytical Chemistry (AREA)
  • Physics & Mathematics (AREA)
  • Biochemistry (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Health & Medical Sciences (AREA)
  • Genetics & Genomics (AREA)
  • Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)

Abstract

The invention provides a method and a kit for the simultaneous sequencing of multiple nucleic acid sequences. The method is called "Short Template Amplicon and Multiplex Megaprimer Enabled Relay" (STAMMER) sequencing. The method provides a platform that is superior to pre-existing methods for simultaneous sequencing, including the bidirectional sequencing of double- stranded nucleic acid templates. In its broadest aspect, the invention provides a method for simultaneously sequencing a plurality of nucleic acid target sites, comprising (a) annealing each of a plurality of different-sized primers to the 5 '- end of each of said plurality of nucleic acid target sites; (b) performing a sequencing reaction by simultaneously extending said primers in the presence of labelled terminators and thereby obtaining a pool of extended primers in a single reaction vessel; and (c) determining the sequences of the nucleic acid target sites by analysing said pool of extended primers without separating the pool prior to analysis; wherein the largest extension product of each primer is smaller in length than the next largest, non-extended primer, and wherein at least one of the primers is a megaprimer comprising double-stranded nucleic acid of at least 100 base pairs in length.

Description

SIMULTANEOUS SEQUENCING OF MULTIPLE NUCLEIC ACID SEQUENCES
FIELD OF THE INVENTION
The invention relates to the simultaneous sequencing of multiple nucleic acid sequences.
BACKGROUND OF THE INVENTION
Simultaneous sequencing is the process of performing nucleic acid sequencing reactions in such a way that the sequences of a plurality of different templates are detected in multiplex, requiring the use of only a single reaction vessel and sequencing capillary. The term 'unidirectional sequencing' refers herein to the sequencing of a single strand of a polynucleotide template and may be performed on either the sense or anti-sense strand of a template that is either single or double-stranded. The term 'bidirectional sequencing' refers meanwhile to the sequencing of complementary sense and anti-sense strands of a double-stranded nucleic acid template.
The 'Sanger' method of chain-terminator sequencing1 is the industry standard for nucleic acid sequence analysis, and a well-optimised sequencing reaction is able to produce around 1500 bases of continuous sequence data. Many sequencing requirements cannot take full advantage of this potential because the sequence of interest may be much shorter than the maximum possible and because it is often necessary to sequence multiple regions of the genome that are not in linkage disequilibrium. When this is the case, it has traditionally been necessary to perform numerous separate sequencing reactions, leading to a hyperbolic increase in the per sample cost of sequencing and a proportional decrease in sample throughput. There have been several previous attempts to perform 'simultaneous sequencing' wherein the sequencing of several regions of interest is performed in a single sequencing reaction and then detected 'on-line' in a single capillary of the DNA analysis platform. These methods have been limited in their application by their various dependencies upon the use of non-classical fluorochromes2'3, upon the physical segregation of the various sets of sequencing products before electrophoresis and detection in separate capillaries4 or upon the use of modified sequence analysis platforms5.
Superior to these approaches was 'SimulSeq', a technique that allowed simultaneous sequence detection from several template Polymerase Chain Reaction (PCR) products but that required the use of only 'off-the-shelf sequencing reagents and analysis platforms6. This technique (referring to WO/2003/056030) was strictly limited to the simultaneous detection of only a very small (~ 150 bp) total number of bases; allowing for say three multiplexed templates of 50 bp or of five multiplexed templates of 30 bp, or ten multiplexed templates of 15 bp and so on. SimulSeq could also be used to perform simultaneous bi-directional sequencing, where both strands of an individual double-stranded template were detected in multiplex; but this was similarly limited to templates of length shorter than 150 bp. The limitations of SimulSeq are due to its reliance on commercially synthesised oligonucleotide primers and the fact that these cannot be prepared at lengths exceeding the 150 bp - 200 bp limit of current oligonucleic acid synthesis techniques. Until now, no simultaneous sequencing or bidirectional sequencing method has allowed a truly flexible approach that can be applied to any group of templates where there would be a practical benefit to performing this type of sequencing assay and where the combined lengths may stretch into the hundreds or early thousands of bases.
Automated 'Sanger' sequencing reactions are enabled by the extension of oligonucleotide primers7 and these reactions produce nested, chain-terminated1 and fluorescently labelled8 sequence fragments of every possible length greater than that of the oligonucleotide primer and lesser than or equal to the length of the template nucleic acid. The earliest fluorescence signal detected by the DNA analysis system during electrophoresis is from those fragments where the first primer extension step incorporates a chain terminator base. Increasing the length of the oligonucleotide primer can delay the onset of this first signal because the rate of the electrophoretic migration of the sequence fragments is proportional to their masses. Molecular typing assays based on single nucleotide extensions have taken advantage of this behaviour to great effect by using oligonucleotides of different sizes to differentially delay the electrophoresis and detection of sometimes several dozen single nucleotide primer extension products relating to known polymorphisms in a multiplex reaction9"1 ' .
SUMMARY OF THE INVENTION
The invention provides a method and a kit for the simultaneous sequencing of multiple nucleic acid sequences. The method is called "Short Template Amplicon and Multiplex Megaprimer Enabled Relay" (STAMMER) sequencing. The method provides a platform that is superior to pre-existing methods for simultaneous sequencing, including the bidirectional sequencing of double- stranded nucleic acid templates.
In its broadest aspect, the invention provides a method for simultaneously sequencing a plurality of nucleic acid target sites, comprising (a) annealing each of a plurality of different-sized primers to the 5 '-end of each of said plurality of nucleic acid target sites; (b) performing a sequencing reaction by simultaneously extending said primers in the presence of labelled terminators and thereby obtaining a pool of extended primers in a single reaction vessel; and (c) determining the sequences of the nucleic acid target sites by analysing said pool of extended primers without separating the pool prior to analysis; wherein the largest extension product of each primer is smaller in length than the next largest, non-extended primer, and wherein at least one of the primers is a megaprimer comprising double-stranded nucleic acid of at least 100 base pairs in length. The invention also provides a kit for performing the method of the invention, wherein the kit comprises (a) at least one megaprimer comprising double-stranded nucleic acid of at least 100 base pairs in length and (b) labelled chain terminators.
BRIEF DESCRIPTION OF THE DRAWINGS
Figure 1 shows how the electrophoretic migration of labelled nucleic acid sequencing fragments can be slowed in order to delay the onset of detectable signal until after a previous sequence has fully migrated past the optical detector. (1) is the 130 bp Megaprimer Target Region of a Short Template Amplicon of 260 bp of which (2) is the region of interest. (3) is the 276 bp Megaprimer Target Region of a second Short Template Amplicon of 410 bp of which (4) is the region of interest. (5) is the forward strand of a 130 bp megaprimer specific to (1) and (5b) is a silent period during which no sequence detection occurs during electrophoresis. The duration of (5b) is proportional to the molecular weight of the megaprimer (5). (6) shows how megaprimer (5) is extended from the 3' end during the sequencing reaction and (6b) is the detectable sequence relating to this. (7) is the reverse strand of megaprimer (5). Unlike (5), this strand cannot be extended during the sequencing reaction, as there is no downstream template. (8) shows the forward strand of a 276 bp megaprimer specific to (3) and (8b) is the silent period of electrophoresis, during which no sequencing products relating to the extension of the megaprimer (8) are detected. (9) shows how megaprimer (8) is extended from the 3' end during the sequencing reaction and (9b) is the detectable sequence relating to this. (10) is the reverse strand of megaprimer (8). Unlike (5) and (8), this strand cannot be extended during the sequencing reaction as there is no downstream template. (11) is a period of silence wherein all products of (6) have passed the detector but where the products of (9) have yet to be detectable. The lower part of Figure 1 shows how the electrophoresis chromatogram would be expected to look in the results of a STAMMER sequencing reaction based upon the model in the upper part of Figure 1. Figure 2 A is the electrophoresis chromatogram of the short template amplicon (1- 2) sequence derived from the forward strand (5) of the dsDNA megaprimer (5 and 7) and corresponds to the sequence described in Figure 1 as (6b). Figure 2B is the electrophoresis chromatogram of the short template amplicon (3-4) sequence derived from the forward strand (8) of the dsDNA megaprimer (8 and 10) and corresponds to the sequence described in Figure 1 as (9b). Figure 2C is the electrophoresis chromatogram of a STAMMER multiplex reaction in which both of the sequencing reactions described in Figures 2 A and 2B were carried out in multiplex according to the model described in Figure 1. Figure 2D shows a portion of the fully analysed and base-called nucleotide sequence derived from the electrophoresis chromatogram pictured in 2C.
Sequence 6b is the NOD2/CARD15 exon 11 short template amplicon and sequence 9b is the NOD2/CARD15 exon 4 short template amplicon. They were prepared according to the primers described in table 1 and amplification and sequencing parameters were identical to those used with regards to the experiments described in figures 3 and 4 (see Example 1). Exon 11 (4A) and Exon 4 (4B) were detectable in the periods equivalent to size ranges 131-260 bp and 277-410 bp respectively. The quality data for the sequence shown in 9c is labelled Q and is prepared using the Phred algorithm according to Ewing, B. & Green, P., Base-calling of automated sequencer traces using Phred. II. Error probabilities. Genome Res 8, 186-94 (1998) and Ewing, B., Hillier, L., Wendl, M.C. & Green, P. Base-calling of automated sequencer traces using Phred. I. Accuracy assessment. Genome Res 8, 175-85 (1998).
Figure 3 shows the STAMMER sequencing principle as applied to the NOD2/CARD15 gene. Figure 3A shows the assay design using three pairs of Short Template Amplicons (STA), each of which is composed of a region of interest (ROI) and a megaprimer target region (MTR) which is complementary to one of the three corresponding megaprimers (MGP). The regions of interest are spaced some 18 kbp apart in total. Figure 3B shows how i) priming of the MTR and extension of the megaprimer forward strand produces ii) nested and chain terminated sequence fragments complementary to the short template amplicon and how iii) there is no potential for extension of the megaprimer reverse strand due to a lack of a suitable extensible template. Figure 3C shows how the detection of fluorescence signals from the three sets of nested megaprimer extension fragments would be predicted to occur on-line, in the manner of a relay.
Figure 4. In individual sequencing reactions, megaprimers were used to delay the detection of fluorescence signals relating to the three Short Template Amplicons of the NOD2/CARD15 gene. Signals relating to Exon 11 (4A) were detectable in the scanning time period equivalent to fragments in the size range of 131-260 bp. Exon 4 (4B) and Exon 8 (4C) were detectable in the periods equivalent to size ranges 277-410 bp and 438-546 bp respectively. When the STAMMER multiplex relay sequencing reaction was performed using all three Short Template Amplicons (4D), sequences were detectable in relay and were interspersed with clear intervals of silence in which no fluorescently labelled fragments were detected. Each sequence was delimited by strong adenosine peaks that were artefacts of the non-template dependent addition of ddATP residues to the megaprimer and short template amplicon homopolymers. Guides to fluorescence intensity scale have been digitally removed from these images for reasons of clarity.
Figure 5 shows a schematic of STAMMER applied to the bidirectional sequencing of a single short template amplicon. Figure 5A shows how the anti- sense strand of a megaprimer (MGP-AS) and a sense oligonucleotide primer (OP) respectively hybridise to the sense and anti-sense strands of a short template amplicon. The two primers extend during the cycle sequencing reaction in the direction of the solid arrows. The sense strand of the megaprimer (MGP-S) does not extend due to a lack of suitable template. A 3 '-propyl-oligonucleotide 'roadblock' oligonucleotide (RB) hybridises upstream of the megaprimer sense strand. Figure 5B shows how a multiplex bidirectional STAMMER sequencing reaction can be performed according to this model. The megaprimer enabled sequence covers the region of interest and some irrelevant flanking sequences whilst the oligonucleotide primer enabled sequence covers the complementary strand over the region of interest. Figure 5C is the base called electropherographic data output of a complete STAMMER bi-directional sequencing reaction. Sequence quality is demonstrated by the topographical detail where vertical bars above the dashed horizontal line labelled Q20, represent base calls assigned with more than 99% accuracy according to Ewing et al. Guides to the actual scale of fluorescence intensity have been digitally removed from these images for reasons of clarity.
Figure 6 shows the effects of a roadblock on the sequencing reaction. The raw chromatographic data in 6A shows the effect of a roadblock oligonucleotide on the progression of the otherwise unimpeded sequencing reaction shown in 6B. The arrow marks the position where the sequencing enzyme is halted by the obstacle of the 5' end of the roadblock molecule.
DETAILED DESCRIPTION OF THE INVENTION
STAMMER differs from the SimulSeq method partly in that it is based upon the use of one or more double-stranded polynucleotide 'megaprimers', which are generally PCR products and are used in the enabling of the sequencing reaction in place of classical oligonucleotide primers. Megaprimers of any length amenable to PCR can be constructed, thus relieving the constraints on template length imposed on 'SimulSeq' by the limitations of commercial oligonucleic acid synthesis technology. Megaprimers of the invention are at least 100 bp in length, optionally at least 120 bp, at least 140 bp, at least 160 bp, at least 180 bp, at least 200 bp, at least 220 bp, at least 240 bp, at least 260 bp, at least 280 bp, or at least 300 bp in length. For use with conventional sequence analysis platforms, the maximum length of a megaprimer is about 1500 bp but the length of the largest megaprimer is typically less than this; for example the length of the largest megaprimer is typically up to 1000 bp, 700 bp or 500 bp. The lengths of the megaprimers can be tailored so that the electrophoretic migration rate (and therefore the detection of primer extension products relating to the megaprimer enabled sequences) may be delayed by the equivalent of tens, hundreds or potentially thousands of bases (see Figures 1 and 2). This allows an opportunity for one or more complete oligonucleotide and/or megaprimer enabled sequence(s) of any given length shorter than the subsequent megaprimer to migrate past the point of signal detection before the onset of detectable signal from the subsequent megaprimer enabled sequence(s).
Megaprimers can also be prepared by a number of non-PCR methods, such as restriction digestion with blunt end cutting restriction endonucleases, followed by purification of the desired fragment. Similarly, restriction endonuclease digestion with enzymes producing sticky ends, followed by degradation of the sticky ends to blunt ends using appropriate enzymes (generally possessing exonuclease activity), could be used.
Megaprimers could also be prepared using non-PCR based methods of DNA amplification. These include, though are not exclusively limited to, isothermal DNA amplification methods including rolling circle amplification (RCA), real time strand displacement amplification (SDA) and multiple displacement amplification (MDA). These methods, and RCA in particular, might be used in combination with restriction endonuclease digestion.
The STAMMER sequencing technique is dependent not only upon the ability to delay the onset of sequence detection but also upon limiting the lengths of template that cause the individual primer/megaprimer extension reactions to terminate once the region of interest has been passed; thereby providing a
'silence' into which subsequent megaprimer enabled sequences may initiate. This is achieved by using site-specific quenchers of the sequencing reaction (i.e. roadblock primers) or Short Template Amplicons as templates; the latter are PCR products comprising only the informative region of interest and a 'megaprimer target region', an expanse of sequence flanking the region of interest that is equal in length and complementary to the enabling megaprimer. Each strand of the double-stranded megaprimer is capable of hybridising to its complementary strand in the megaprimer target region, but there is no downstream template for the extension of the second megaprimer strand and the detectable sequence therefore represents only megaprimer extension across the region of interest. Megaprimers may enable the sequencing of template strands in either the sense or anti-sense context.
Short Template Amplicon and Multiplex Megaprimer Enabled Relay (STAMMER) sequencing allows simultaneous 'relay' sequencing of practically any combination of nucleic acid templates. Previous systems such as SimulSeq were limited to very short combined template lengths and were only practically applicable in molecular typing assays or in the bidirectional sequencing of very short regions of interest6. The STAMMER approach to simultaneous sequencing is applicable to a far wider range of templates than has been previously possible and we believe that it could impact massively upon the costs and throughput of many routine and diagnostic re-sequencing applications. The technique will be of particular interest to those laboratories (especially in geographically isolated facilities, in developing and restructuring nations) where access to funds, labour and reagents are limiting and to those where sample numbers are very high and significant amounts of nucleic acid re-sequencing are necessary. We believe that the potential to combine the STAMMER method with the use of 'roadblock' oligonucleotides provides a superior and flexible means to perform relay sequencing on nucleic acid templates of almost any type and, most crucially, of almost any length. STAMMER represents a true sequencing assay rather than a means to molecular typing and, as it requires only 'off-the-shelf sequencing reagents, analysis platforms and software, it is very easily implemented in laboratories already performing nucleic acid sequencing. We have shown that STAMMER sequencing can be used to perform not only full sequencing and consequently molecular typing of known polymorphic positions but also the bidirectional sequencing of individual double-stranded templates. The use of vector specific STAMMER sequencing systems utilising multiple 'roadblock' oligonucleotides will allow simultaneous bidirectional sequencing of circular templates; thus facilitating double-stranded plasmid screening at half the cost and twice the throughput of normal sequencing. It is also possible to design STAMMER sequencing systems whereby several Short Template Amplicons are bidirectionally sequenced in a single assay.
The 'roadblock' molecule of the invention is a single-stranded nucleic acid that provides a physical barrier to the progression of a nucleic acid polymerase. The 3 '-end of a roadblock molecule may be modified to prevent the extension of this end. This modification may involve chemical alteration of the 3 '-end, such as the attachment of an alkyl group (e.g. Cl to Cl 8 alkyl, preferably C2 to C6 alkyl, e.g. propyl). Different functional groups such as amino, thiol, or similar, or any group to which one might conjugate a terminating molecule, could also be attached to the 3 '-end of the roadblock to prevent its extension. In addition, the last base at the 3 '-end of the roadblock molecule might be a dideoxy base, or might otherwise lack potential to be extended as a result of similar modification of the nucleotide triphosphate structure itself or of the use of a non-classical or non-naturally occurring nucleotide. Further, the 3 '-end of the roadblock may be abasic.
We recommend that the maximum combined length of all sequences in the STAMMER sequencing multiplex does not exceed 1500 bp as this allows operation of the sequence analysis platform within the parameters of conventional sequencing. Assuming that only such conventional sequence analysis platforms are available, we estimate that the maximum number of primers (to include megaprimers) that could be used in a single STAMMER sequencing experiment (and hence the maximum number of individual targets that could be sequenced) to be about 75. This estimate of 75 is based on the assumptions that a) the smallest primer used is 20 bp in length, b) 10 bp of sequence are obtained for each target, and c) a 10 bp 'silence' is used between each primer and the fully extended product of the next smallest primer. However, smaller numbers of primers are more typical; for example the number of primers may be up to 2, 3, 5, 7, 10, 15, 20, 25 or 40. For example, 20 primers could be used to provide 20 sequences of 75 bases in relay. An advantage of the invention is that STAMMER is the only technique that could utilise a future increase in the maximum size of sequencing product resolved by a sequencing capillary (currently about 1500 bp). In principle, the total number of bases sequenced and the total number of primers used in a STAMMER multiplex is unlimited, provided that the technology exists to resolve the full range of extension products derived from the multiplex.
Preferably, the molar ratio of each megaprimer to target site is 1 :20 to 20:1, more preferably 1 :10 to 10:1, more preferably 1 :5 to 10:1, more preferably 1 :3 to 3:1, most preferably about 5:1.
A further embodiment of the invention involves the removal of the non-priming strand of at least one megaprimer prior to the sequencing reaction. The benefit of doing this is that one might, by removing the non-priming strand, allow the sequencing reaction to behave more like a conventional sequencing reaction in which a very high molar ratio between the oligonucleotide primer and amplicon is used. This increases the signal strength (which may be coupled to an increase in the sequence quality).
In a particular embodiment, a single strand of the megaprimer is removed prior to performing the STAMMER reaction. The simplest way of removing a single strand of a megaprimer(s) would be the application of linear-after-the-exponential PCR (LATE-PCR) according to Sanchez, J.A., Pierce, K.E., Rice, J.E. & Wangh, L.J. Linear-after-the-exponential (LATE)-PCR: an advanced method of asymmetric PCR and its uses in quantitative real-time analysis. Proc Natl Acad Sci USA 101, 1933-8 (2004). This process of asymmetric PCR uses an excess of forward strand primer to ensure that in the later stages of PCR, the reverse primer is entirely used up and a phase of linear amplification takes place, leading to an excess of single stranded products, which may be used as primers. This may be carried out in a single combined LATE-PCR amplification in one vessel. The short template amplicons, dideoxynucleotides and/or roadblock oligonucleotide(s) required to perform the STAMMER reaction could then be added after a defined number of cycles of such a linear amplification. Another approach might be to selectively deplete a double stranded PCR product of the second strand. This might be carried out using a biotinylated reverse primer. The PCR product that is biotinylated in this way would be bound to a streptavidin paramagnetic bead or solid substrate, then subjected to conditions sufficient to denature the double helix and release the forward strand into solution which may be separated by magnetism or otherwise by simple elution. The use of paramagnetic methods might allow one to perform the second strand depletion in the same reaction vessel used for the STAMMER sequencing reaction. This might require the application of chemical denaturants, followed by the magnetic separation of the bead-bound reverse strands. This would be followed immediately by the sequencing reaction.
Single stranded megaprimers might also be constructed using two or more synthetic oligonucleotides representing a continuous sequence, which had been specifically bound to a third, possibly immobilised or bead bound scaffolding oligonucleotide. The action of DNA ligase enzyme could then be used to ligate the two synthetic oligonucleotides end to end, producing a much longer megaprimer, which could then be eluted from the scaffold.
The pool of extended primers produced by the sequencing reaction of the invention can be analysed by separating them according to their size. This separation can be achieved by electrophoresis, such as capillary electrophoresis, or by using a non-capillary based electrophoresis sequencing platform (such as a slab gel ABI 377 machine). The separation of extended primer products from a plurality of sequencing reactions can be performed in a single capillary. The chain terminators of the sequencing reactions, dideoxynucleotides (ddATP, ddGTP, ddCTP, or ddTTP), are labelled so that each extended primer product can be detected. For instance, each of the four dideoxynucleotide chain terminators can be labelled with a different fluorescent dye. Non-fluorescent chemical dyes (chromophores), substrates of enzymatic reactions, proteins or radioisotopes, or any other group to which one can attach any of the same, may also be used as labels, provided that one different label is used for each base. The nucleic acids of the invention, including sequencing reaction templates, oligonucleotide primers, and megaprimers, can comprise deoxyribonucleic acid, ribonucleic acid, or a copolymer of deoxyribonucleic acid and ribonucleic acid. The nucleic acids of the invention can be natural (i.e. sourced from any organism) or synthetic, and can be made in vitro or in vivo, enzymatically or non- enzymatically. The template nucleic acid can comprise non-natural nucleotide analogues such as deoxyinosine or 7-deaza-2-deoxyguanosine. These analogues destabilize DNA duplexes and could allow a primer annealing and extension reaction to occur in a double-stranded sample without completely separating the strands.
The sequencing reaction of the invention may be carried out with a number of different nucleic acid polymerase enzymes, such as DNA polymerase, RNA polymerase, reverse transcriptase or RNA replicase, depending on the composition of the template nucleic acid.
The invention also provides a kit comprising components to carry out the methods of the invention. The kit may comprise at least one megaprimer comprising double-stranded nucleic acid of at least 100 bp in length, optionally at least 120 bp, at least 140 bp, at least 160 bp, at least 180 bp, at least 200 bp, at least 220 bp, at least 240 bp, at least 260 bp, at least 280 bp, or at least 300 bp in length. For use with conventional sequence analysis platforms, the maximum length of a megaprimer is about 1500 bp. The kit may comprise labelled chain terminators, optionally labelled with different fluorescent dyes. The kit may comprise non- terminating dNTPs or NTPs, an enzyme or enzymes for nucleic acid polymerisation, appropriate buffer solutions, purified water and/or instructions for carrying out any one of the methods of the invention.
Methods
DNA samples were prepared from whole blood using a standard salting out method15. PCR was performed in 30 μl reactions consisting of 100 ng of the genomic DNA template; 1 X Phusion™ HF Buffer (Finnzymes, Espoo, Finland); 0.8 mM dNTPs (Bioline, London, UK); each oligonucleotide primer (Sigma- Genosys, Cambridge, UK) at 500 μM in combination according to table Ia (NOD2/CARD15 simultaneous sequencing) and Ib (bidirectional STAMMER sequencing); MgC12 (Bioline, London, UK) at concentrations according to table 1 and finally 0.4 U Phusion High-Fidelity DNA Polymerase (Finnzymes, Espoo, Finland). PCR cycling was carried out in a G-STORM GSX thermal cycler (GRI, Braintree, UK) and conditions were 34 cycles of (95°C: O'lO", 63°C: O' lO", 72°C: 1 OO") for the NOD2/CARD15 experiments. PCR conditions for the bidirectional experiments were 30 cycles of (95°C: O'lO", 67°C: O'lO", 72°C: TOO"). The products were visualized on a 2% agarose gel then purified using the Wizard SV gel and PCR cleanup system (Promega, WI, USA).
Short Template Amplicons and Megaprimers were roughly quantified using a Nanodrop NDlOOO Spectrophotometer (Wilmington, DE, USA) according to the Beer-Lambert equation using measurements of absorbance at 260nm and 280nm with an assumed average extinction coefficient of 50 ng-cm μl-1.
STAMMER sequencing reactions consisted of 2.5 μl Sequencing Buffer (ABI, Foster City, Ca, USA) and roughly 10 ng of each megaprimer and short template amplicon, made up to 28.25 μl with H2O. The final molar ratios of the
NOD2/CARD15 megaprimers to their respective short template amplicon were around 2.22 (Exon 4), 1.88 (Exon 8) and 2.98 (Exon 11); the bidirectional sequencing megaprimer was used at a 4:1 ratio against the short template amplicon. Synthesis of the 'roadblock' oligonucleotide included the addition of a chain-terminating 3' propyl group (Sigma-Genosys, Cambridge, UK).
The 'roadblock' oligonucleotide was included in the bidirectional sequencing reaction at 500 nM whilst the oligonucleotide primer was included at just 16 nM.
In a G-STORM GSX thermal cycler (GRI, Braintree, UK), cycle sequencing reactions were subjected to 95°C for TOO" and were then incubated at 85°C. Hot- start was carried out by the manual addition to each reaction of 1.75 μl of a mixture consisting of 1 μl H2O and 1 μl BigDye Terminators v3.0 Dye Mix (ABI, Foster City, Ca, USA). Cycle sequencing was then performed with 25 cycles of 98°C: 0'10" and 60°C: 0'30".
Sequences were detected on a 3730XL 96 capillary DNA analyser (ABI, Foster City, Ca, USA). EDTA was added to a final concentration of 6 mM and samples were then precipitated with 70 μl absolute propan-2-ol before recovery by centrifugation. Pellets were washed in 100 μl 70% propan-2-ol and were finally resuspended in 10 μl Hi-Di Formamide (ABI, Foster City, Ca., USA). Sample injection was for fifteen seconds at 1.5 kV. Electrophoresis was carried out at 6O0C for 80 minutes. Sequences were analysed using Sequencing analysis v5.2 and SeqScape v2.5. Basecalling and assignment of Phred (Q20) quality values1 ' 7 was performed using the 'KB' basecaller (ABI, Foster City, Ca, USA).
Table 1
Figure imgf000017_0001
Figure imgf000017_0002
EXAMPLES
Example 1 - STAMMER sequencing of three unlinked Short Template Amplicons
We selected three regions of interest from Exons 11, 4 and 8 of the
NOD2/CARD15 gene as polymorphisms within these regions had been shown to be informative with regards to disease associations12.
Using PCR, we generated the three corresponding Short Template Amplicons whose lengths were 260 bp, 410 bp and 546 bp respectively. The megaprimers targeting each of the Short Template Amplicons for Exons 11, 4 and 8 were PCR products of respectively 130 bp, 276 bp and 437 bp (Figure 3). To demonstrate that double-stranded megaprimers could efficiently prime the sequencing reactions and suitably delay their detection, we primed each short template amplicon with its corresponding megaprimer in a separate reaction. We maintained relatively even molar ratios between the two moieties in order to maximise the number of productive megaprimer to template hybridisations during the cycle sequencing reaction. The data produced demonstrated that the detection of the fluorescence signals could be delayed to an extent that was proportional to the size of the megaprimer (Figure 4A-C). Signals relating to Exon 11 (Figure 4A) were detectable in the scanning time period equivalent to fragments in the size range of 131-260 bp. Exon 4 (Figure 4B) and Exon 8 (Figure 4C) were detectable in the periods equivalent to size ranges of 277-410 bp and 438-546 bp respectively.
Having demonstrated that megaprimers could prime sequencing reactions in the correct way, we went on to prepare a complete multiplex STAMMER sequencing reaction by mixing equal quantities of the three Short Template Amplicons with roughly equimolar amounts of the three megaprimers. All three sequencing reactions were then carried out simultaneously in a single reaction vessel. When the STAMMER multiplex relay sequencing reaction was performed using all three Short Template Amplicons, sequences were detectable in relay and were interspersed with clear intervals of silence in which no fluorescently labelled fragments were detected (Figure 4D). The products of this reaction, when electrophoresed in a single capillary of the sequence analysis platform, produced three discrete sequences 'on-line' in the ranges of 131-260 bp (Exon 11), 277-410 bp (Exon 4) and 438-546 bp (Exon 8). The combined length of 370 simultaneously sequenced bases far exceeds that possible with, and is therefore superior to, other simultaneous sequencing methods.
The start and end of the megaprimer enabled sequences were marked by strong peaks relating to the tendency of the DNA polymerase enzyme to add a single and non template dependent adenosine residue to the 3' termini of the megaprimer and Short Template Amplicon homoduplexes. These artifactual peaks are somewhat useful in providing a means to delimit and thereby distinguish the start and end of each sequence in the multiplex; but megaprimer strands so altered at their 3' ends are incapable of efficiently and specifically hybridising to the megaprimer target region and this impacts greatly upon the strength and quality of the derived sequences. Such addition also depletes the adenosine content of the reaction and may lead to truncation of sequences. To eliminate the addition of non-template dependent 3' adenosine residues during the STAMMER sequencing method, we made use of a proofreading enzyme. To reduce the extent to which such bases were incorporated during the initial ramp to denaturation temperature at the start of the cycle sequencing reaction, we applied a 'hot-start' wherein the sequencing enzyme was added to the reaction vessel only after the first round of denaturation and annealing had taken place . In an ideal implementation of the STAMMER sequencing method, we would prevent non-template dependent A addition through utilisation of a sequencing reaction mixture modified to include a proofreading enzymic activity or where this was not possible (i.e. during bidirectional sequencing) a compensating quantity of deoxy- and dideoxy-adenosine trinucleotides.
Another approach to reducing or eliminating the 'A' peaks caused by non- template dependent addition of adenosine to the 3 '-end of the megaprimer and amplicon homoduplexes during the STAMMER technique is to use a non- proofreading enzyme during PCR preparation of the megaprimer(s) or Short Template Amplicons. This ensures that most megaprimer/Short Template Amplicon products are already adenylated at the 3' end and so little or no ddATP is added during STAMMER sequencing and the 'A' peaks are thus reduced. With time the megaprimers may lose the terminal A and therefore the peaks may return, because the ssDNA end is less stable than dsDNA.
Through incubation of the megaprimer with DNA polymerase and dATP in a suitable buffer, the A bases can be restored prior to STAMMER sequencing. It would be preferable to use megaprimers where the addition of the extra A has no effect on the binding footprint of the 3' end of the megaprimer (i.e. where the base immediately downstream of the megaprimer target region is a thymine residue); in this way specificity is not lost. The addition of the 3' A residue should, however, interfere with the megaprimer forward strand's ability to anneal to its own reverse strand, thereby reducing self-annealing that would otherwise limit the productivity of the reaction.
The addition of 3' terminal adenosine residues to the short template amplicon can be eliminated by the application of exonuclease 1 and shrimp alkaline phosphatase (EXO-SAP) subsequent to the PCR amplification. Exonuclease removes the single stranded portion of the molecule (along with excess amplification primers) and the alkaline phosphatase dephosphorylates it and so prevents any form of polymerase/endonuclease from acting upon it. The megaprimer cannot be treated with Exo-SAP, as it is essential that the megaprimer can be subject to polymerase activity.
Example 2 - Bidirectional sequencing of a single short template amplicon using the STAMMER sequencing method
Many applications of sequencing technology such as plasmid screening and sequence-based typing14 require the sequencing of both the anti-sense and sense strands of a double-stranded nucleic acid template region of interest. This has historically meant that two sequencing reactions have been necessarily performed for the detection of bidirectional sequence in any one region of interest. As a further proof of the application of the STAMMER sequencing principle, we demonstrate how the basic method described previously can be adapted to perform the sequencing of both strands of a single short template amplicon in a single relay sequencing reaction. The bidirectional STAMMER approach (Figure 5A) depends on the fact that the DNA polymerase enzyme used in the sequencing reaction lacks a 5 '-3' exonuclease activity and thus the primer extension reaction can be halted by the obstacle of the 5' end of a "roadblock", a hybridised nucleic acid strand in the path of the enzyme.
We selected a single 170 bp region of interest in KIR2DL4, a highly complex and GC rich gene (Robinson, J., Waller, M.J., Stoehr, P. & Marsh, S.G.E. IPD-the Immuno Polymorphism Database. Nucleic Acids Res 33, D523-6 (2005)) that had previously been shown to be a troublesome sequencing template requiring bidirectional coverage to reconstruct continuous high quality sequence (Roberts, C.H., Madrigal, J.A. & Marsh, S. G. Cloning and sequencing alleles of the KIR2DL4 gene from genomic DNA samples. Tissue Antigens 69 Suppl 1, 88-91 (2007)). An oligonucleotide was used in the manner of a conventional reaction to prime the sequencing of the antisense strand of a 475 bp short template amplicon constructed according to the primers described in table Ib. A 231 bp megaprimer simultaneously enabled the sequencing of a 224 bp region of the sense strand from the same short template amplicon. Under normal circumstances, the oligonucleotide-enabled component of the sequencing reaction would progress across the entire short template amplicon and would still be detectable when the megaprimer-enabled component later became detectable. To prevent this, it was necessary to halt the oligonucleotide-enabled reaction before the end of the template was reached. A previous method of bidirectional sequencing introduced abasic regions to the template in order to halt the sequencing reaction in such a way6, but this was not tenable in the STAMMER system. Instead, we simply placed a second oligonucleotide, a "roadblock", on the sense strand 179 bp downstream of 3' end of the sequence enabling oligonucleotide primer and 2bp upstream of the 5' end of the megaprimer target region (Figure 5A). The 3' end of this roadblock was modified with a chain-terminating propyl group (C3 spacer) and therefore could not itself act as a sequence enabling primer. The 5' end meanwhile presented an obstacle aimed at halting the progression of the 5' to 3' endonuclease activity of the polymerase enzyme and thus limited the size of the oligonucleotide primer enabled sequencing products. The use of the roadblock oligonucleotide at a high molar ratio with respect to both the short template amplicon and the oligonucleotide primer ensured that essentially all free template sense strands were hybridised to a roadblock molecule capable of halting the oligonucleotide primer extension reaction.
We went on to perform cycle sequencing reactions incorporating the Short Template Amplicon, its corresponding megaprimer and both the oligonucleotide primer and the roadblock oligonucleotide. Figure 5C is the base called electrophoretic data output of a complete STAMMER Bidirectional sequencing reaction. Analysis of the products from this reaction showed that we were able to detect two discrete sequences in relay. Further scrutiny of these sequences confirmed that we had sequenced in relay both strands of a double-stranded template region of the KIR2DL4 gene and that the two sequences were complementary to one another across a 167 bp expanse of the region of interest. The average Phred quality value across the bidirectional sequence was 39.4; most nucleotide positions were identified by high quality base-calls (Q>20) in both directions and all positions had such in at least one direction.
REFERENCES
1. Sanger, F., Nicklen, S. & Coulson, A.R. DNA sequencing with chain- terminating inhibitors. Proc Natl Acad Sci U S A 74, 5463-7 (1977).
2. Wiemann, S. et al. Simultaneous on-line DNA sequencing on both strands with two fluorescent dyes. Anal Biochem 224, 117-21 (1995). 3. Wiemann, S. et al. "Doublex" fluorescent DNA sequencing: two independent sequences obtained simultaneously in one reaction with internal labeling and unlabeled primers. Anal Biochem 234, 166-74 (1996).
4. van den Boom, D., Jurinke, C, Ruppert, A. & Koster, H. Forward and reverse DNA sequencing in a single reaction. Anal Biochem 256, 127-9 (1998).
5. Yager, T.D. et al. High performance DNA sequencing, and the detection of mutations and polymorphisms, on the Clipper sequencer. Electrophoresis 20, 1280-300 (1999).
6. Murphy, K.M. & Eshleman, J.R. Simultaneous sequencing of multiple polymerase chain reaction products and combined polymerase chain reaction with cycle sequencing in single reactions. Am J Pathol 161, 27-33 (2002).
7. Padmanabhan, R. & Wu, R. Nucleotide sequence analysis of DNA. IX. Use of oligonucleotides of defined sequence as primers in DNA sequence analysis. Biochem Biophys Res Commun 48, 1295-302 (1972).
8. Smith, L. M. et al. Fluorescence detection in automated DNA sequence analysis. Nature 321, 674-9 (1986).
9. Kuppuswamy, M.N. et al. Single nucleotide primer extension to detect genetic diseases: experimental application to hemophilia B (factor IX) and cystic fibrosis genes. Proc Natl Acad Sci U S A 88, 1143-7 (1991).
10. Nyren, P., Pettersson, B. & Uhlen, M. Solid phase DNA minisequencing by an enzymatic luminometric inorganic pyrophosphate detection assay. Anal Biochem 208, 171-5 (1993). 11. Gilbert, M.T. et al. Multiplex PCR with minisequencing as an effective high-throughput SNP typing method for formalin-fixed tissue. Electrophoresis 28, 2361-7 (2007).
12. Mayor, N.P. et al. Single nucleotide polymorphisms in the
NOD2/CARD15 gene are associated with an increased risk of relapse and death for patients with acute leukemia after hematopoietic stem-cell transplantation with unrelated donors. J Clin Oncol 25, 4262-9 (2007).
13. Chou, Q., Russell, M., Birch, D.E., Raymond, J. & Bloch, W. Prevention of pre-PCR mis-priming and primer dimerization improves low-copy-number amplifications. Nucleic Acids Res 20, 1717-23 (1992).
14. Santamaria, P. et al. HLA class II "typing": direct sequencing of DRB, DQB, and DQA genes. Hum Immunol 33, 69-81 (1992).
15. Miller, S.A., Dykes, D. D. & Polesky, H. F. A simple salting out procedure for extracting DNA from human nucleated cells. Nucleic Acids Res 16, 1215 (1988).
16. Ewing, B. & Green, P. Base-calling of automated sequencer traces using phred. II. Error probabilities. Genome Res 8, 186-94 (1998).
17. Ewing, B., Hillier, L., Wendl, M. C. & Green, P. Base-calling of automated sequencer traces using phred. I. Accuracy assessment. Genome Res 8, 175-85
(1998).

Claims

1. A method for simultaneously sequencing a plurality of nucleic acid target sites, comprising: (a) annealing each of a plurality of different-sized primers to the 5'- end of each of said plurality of nucleic acid target sites; (b) performing a sequencing reaction by simultaneously extending said primers in the presence of labelled chain terminators and thereby obtaining a pool of extended primers in a single reaction vessel; and (c) determining the sequences of the nucleic acid target sites by analysing said pool of extended primers without separating the pool prior to analysis; wherein the largest extension product of each primer is smaller in length than the next largest, non-extended primer, and wherein at least one of the primers is a megaprimer comprising double- stranded nucleic acid of at least 100 base pairs in length.
2. The method of claim 1 wherein the length of extension product of at least one primer is limited by the length of template available for that primer.
3. The method of claim 1, wherein the length of extension product of at least one primer is limited by annealing a roadblock nucleic acid molecule to a site downstream of the primer.
4. The method of claim 3, wherein the roadblock nucleic acid molecule comprises a modification of its 3 '-end that prevents extension.
5. The method of claim 4, wherein said modification is attachment of an alkyl group.
6. The method of any one of the preceding claims, further comprising removing the non-priming strand of at least one megaprimer prior to the sequencing reaction.
7. The method of any one of the preceding claims, wherein the pool of extended primers is analysed by separating them according to their size.
8. The method of claim 7, wherein the separation is by electrophoresis.
9. The method of claim 8, wherein the electrophoresis is performed in a single capillary or in a single lane of a slab gel.
10. The method of any one of the preceding claims, wherein at least one megaprimer is produced by: (a) PCR;
(b) restriction digestion; or
(c) non-PCR DNA amplification, optionally rolling circle amplification (RCA), multiple displacement amplification (MDA), real time strand displacement amplification (SDA), or other isothermal amplification.
11. The method of any one of the preceding claims, wherein a plurality of non-complementary target sites are sequenced unidirectionally.
12. The method of claim 11, wherein a polymerase possessing 3' to 5' endonuclease proofreading activity is used to carry out the sequencing reaction.
13. The method of any one of claims 1 to 10, which comprises bidirectional sequencing of at least one duplex nucleic acid molecule.
14. The method of claim 13, wherein said duplex nucleic acid molecule is linear or circular.
15. A kit for performing a method of any one of claims 1 to 14, wherein the kit comprises (a) at least one megaprimer comprising double-stranded nucleic acid of at least 100 base pairs in length and (b) labelled chain terminators.
16. The kit of claim 15, wherein the kit comprises at least one roadblock nucleic acid molecule comprising a modification at the 3 '-end that prevents extension.
17. The kit of claim 16, wherein the modification is attachment of an alkyl group.
18. The kit of claim 15, 16 or 17, wherein the kit comprises a DNA polymerase.
19. The kit of any one of claims 15 to 18, wherein the kit comprises instructions for performing a method of any one of claims 1 to 14.
PCT/GB2010/000861 2009-04-29 2010-04-29 Simultaneous sequencing of multiple nucleic acid sequences WO2010125351A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
GB0907412A GB0907412D0 (en) 2009-04-29 2009-04-29 Simultaneous sequencing of multiple nucleic acid sequences
GB0907412.1 2009-04-29

Publications (1)

Publication Number Publication Date
WO2010125351A1 true WO2010125351A1 (en) 2010-11-04

Family

ID=40792026

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/GB2010/000861 WO2010125351A1 (en) 2009-04-29 2010-04-29 Simultaneous sequencing of multiple nucleic acid sequences

Country Status (2)

Country Link
GB (1) GB0907412D0 (en)
WO (1) WO2010125351A1 (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
DE102013204712A1 (en) * 2013-03-18 2014-09-18 Siemens Aktiengesellschaft A method and kit for sequence specific arrest of a synthesis of a nucleic acid strand complementary to a single stranded template nucleic acid by a polymerase

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5871929A (en) * 1996-07-23 1999-02-16 Barnes; Wayne M. Suppression of pyrophosphorolysis in DNA sequencing and in other applications involving DNA replication
WO2000020628A1 (en) * 1998-10-01 2000-04-13 Bio-Id Diagnostic Inc. Multi-loci genomic analysis by a method of improved cycle sequencing
WO2002090505A2 (en) * 2001-05-09 2002-11-14 Virginia Commonwealth University Multiple sequencible and ligatible structures for genomic analysis
WO2003056030A2 (en) * 2001-11-08 2003-07-10 The Johns Hopkins University Methods and systems of nucleic acid sequencing

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5871929A (en) * 1996-07-23 1999-02-16 Barnes; Wayne M. Suppression of pyrophosphorolysis in DNA sequencing and in other applications involving DNA replication
WO2000020628A1 (en) * 1998-10-01 2000-04-13 Bio-Id Diagnostic Inc. Multi-loci genomic analysis by a method of improved cycle sequencing
WO2002090505A2 (en) * 2001-05-09 2002-11-14 Virginia Commonwealth University Multiple sequencible and ligatible structures for genomic analysis
WO2003056030A2 (en) * 2001-11-08 2003-07-10 The Johns Hopkins University Methods and systems of nucleic acid sequencing

Non-Patent Citations (21)

* Cited by examiner, † Cited by third party
Title
CHOU, Q.; RUSSELL, M.; BIRCH, D.E.; RAYMOND, J.; BLOCH, W.: "Prevention of pre-PCR mis-priming and primer dimerization improves low-copy-number amplifications", NUCLEIC ACIDS RES, vol. 20, 1992, pages 1717 - 23, XP002190821
EWING, B.; GREEN, P.: "Base-calling of automated sequencer traces using phred. II. Error probabilities", GENOME RES, vol. 8, 1998, pages 186 - 94, XP000915053
EWING, B.; HILLIER, L.; WENDL, M.C.; GREEN, P.: "Base-calling of automated sequencer traces using Phred. I. Accuracy assessment", GENOME RES, vol. 8, 1998, pages 175 - 85
GILBERT, M.T. ET AL.: "Multiplex PCR with minisequencing as an effective high-throughput SNP typing method for formalin-fixed tissue", ELECTROPHORESIS, vol. 28, 2007, pages 2361 - 7
KUPPUSWAMY, M.N. ET AL.: "Single nucleotide primer extension to detect genetic diseases: experimental application to hemophilia B (factor IX) and cystic fibrosis genes", PROC NATL ACAD SCI U S A, vol. 88, 1991, pages 1143 - 7, XP000371662, DOI: doi:10.1073/pnas.88.4.1143
MAYOR, N.P. ET AL.: "Single nucleotide polymorphisms in the NOD2/CARD 15 gene are associated with an increased risk of relapse and death for patients with acute leukemia after hematopoietic stem-cell transplantation with unrelated donors", J CLIN ONCOL, vol. 25, 2007, pages 4262 - 9, XP002692388, DOI: doi:10.1200/JCO.2007.12.1897
MILLER, S.A.; DYKES, D.D.; POLESKY, H.F.: "A simple salting out procedure for extracting DNA from human nucleated cells", NUCLEIC ACIDS RES, vol. 16, 1988, pages 1215
MURPHY, K.M.; ESHLEMAN, J.R.: "Simultaneous sequencing of multiple polymerase chain reaction products and combined polymerase chain reaction with cycle sequencing in single reactions", AM J PATHOL, vol. 161, 2002, pages 27 - 33, XP003005555
NYREN, P.; PETTERSSON, B.; UHLEN, M.: "Solid phase DNA minisequencing by an enzymatic luminometric inorganic pyrophosphate detection assay", ANAL BIOCHEM, vol. 208, 1993, pages 171 - 5, XP024763770, DOI: doi:10.1006/abio.1993.1024
PADMANABHAN, R.; WU, R.: "Nucleotide sequence analysis of DNA. IX. Use of oligonucleotides of defined sequence as primers in DNA sequence analysis", BIOCHEM BIOPHYS RES COMMUN, vol. 48, 1972, pages 1295 - 302, XP024834179, DOI: doi:10.1016/0006-291X(72)90852-2
ROBERTS CHRISSY H ET AL: "Short template amplicon and multiplex megaprimer-enabled relay (STAMMER) sequencing, a simultaneous approach to higher throughput sequence-based typing of polymorphic genes.", IMMUNOGENETICS APR 2010 LNKD- PUBMED:20204613, vol. 62, no. 4, April 2010 (2010-04-01), pages 253 - 260, XP002590288, ISSN: 1432-1211 *
ROBINSON, J.; WALLER, M.J.; STOEHR, P.; MARSH, S.G.E.: "IPD--the Immuno Polymorphism Database", NUCLEIC ACIDS RES, vol. 33, 2005, pages D523 - 6
SANCHEZ, J.A.; PIERCE, K.E.; RICE, J.E.; WANGH, L.J.: "Linear-after-the-exponential (LATE)-PCR: an advanced method of asymmetric PCR and its uses in quantitative real-time analysis", PROC NATL ACAD SCI U S A, vol. 101, 2004, pages 1933 - 8, XP002990854, DOI: doi:10.1073/pnas.0305476101
SANGER, F.; NICKLEN, S.; COULSON, A.R.: "DNA sequencing with chain-terminating inhibitors", PROC NATL ACAD SCI U S A, vol. 74, 1977, pages 5463 - 7, XP008154983, DOI: doi:10.1073/pnas.74.12.5463
SANTAMARIA, P. ET AL.: "HLA class II "typing": direct sequencing of DRB, DQB, and DQA genes", HUM IMMUNOL, vol. 33, 1992, pages 69 - 81, XP023787867, DOI: doi:10.1016/0198-8859(92)90056-S
SARKAR G ET AL: "THE MEGAPRIMER METHOD OF SITE-DIRECTED MUTAGENESIS", 1 January 1990, BIOTECHNIQUES, NATICK, MA, US, PAGE(S) 404 - 407, XP008009292 *
SMITH, L.M. ET AL.: "Fluorescence detection in automated DNA sequence analysis", NATURE, vol. 321, 1986, pages 674 - 9, XP000567803, DOI: doi:10.1038/321674a0
VAN DEN BOOM, D.; JURINKE, C.; RUPPERT, A.; KOSTER, H.: "Forward and reverse DNA sequencing in a single reaction", ANAL BIOCHEM, vol. 256, 1998, pages 127 - 9, XP000752497, DOI: doi:10.1006/abio.1997.2507
WIEMANN, S. ET AL.: ""Doublex" fluorescent DNA sequencing: two independent sequences obtained simultaneously in one reaction with internal labeling and unlabeled primers", ANAL BIOCHEM, vol. 234, 1996, pages 166 - 74, XP000552018, DOI: doi:10.1006/abio.1996.0068
WIEMANN, S. ET AL.: "Simultaneous on-line DNA sequencing on both strands with two fluorescent dyes", ANAL BIOCHEM, vol. 224, 1995, pages 117 - 21, XP000486746, DOI: doi:10.1006/abio.1995.1015
YAGER, T.D. ET AL.: "High performance DNA sequencing, and the detection of mutations and polymorphisms, on the Clipper sequencer", ELECTROPHORESIS, vol. 20, 1999, pages 1280 - 300, XP001053278, DOI: doi:10.1002/(SICI)1522-2683(19990101)20:6<1280::AID-ELPS1280>3.0.CO;2-#

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
DE102013204712A1 (en) * 2013-03-18 2014-09-18 Siemens Aktiengesellschaft A method and kit for sequence specific arrest of a synthesis of a nucleic acid strand complementary to a single stranded template nucleic acid by a polymerase

Also Published As

Publication number Publication date
GB0907412D0 (en) 2009-06-10

Similar Documents

Publication Publication Date Title
US10214771B2 (en) DNA amplification and sequencing using DNA molecules generated by random fragmentation
US9890408B2 (en) Multiple displacement amplification
US7262030B2 (en) Multiple sequencible and ligatible structures for genomic analysis
JP6110297B2 (en) Combination sequence barcodes for high-throughput screening
Makridakis et al. Multiplex automated primer extension analysis: simultaneous genotyping of several polymorphisms
Onofri et al. Development of multiplex PCRs for evolutionary and forensic applications of 37 human Y chromosome SNPs
EP4148142A1 (en) Methods of sample preparation
US7790391B2 (en) Methods of equalizing representation levels of nucleic acid targets
JPH09501312A (en) Parallel primer extension method for nucleic acid sequence analysis
WO2007149789A2 (en) Conversion of target specific amplification to universal sequencing
CN110088296B (en) Improved multiplex and multistep amplification reactions and reagents therefor
CN106834428B (en) High-throughput multi-site human short fragment tandem repeat sequence detection kit and preparation and application thereof
US20180100180A1 (en) Methods of single dna/rna molecule counting
WO2004067764A2 (en) Nucleic acid sequencing using nicking agents
Asari et al. Single nucleotide polymorphism genotyping by mini-primer allele-specific amplification with universal reporter primers for identification of degraded DNA
WO2010125351A1 (en) Simultaneous sequencing of multiple nucleic acid sequences
US20020018999A1 (en) Methods for characterizing polymorphisms
Best et al. Molecular pathology methods
AU4669100A (en) Nucleotide extension on a microarray of gel-immobilized primers
WO2021166989A1 (en) Method for producing dna molecules having an adaptor sequence added thereto, and use thereof
US20120053064A1 (en) Determining the identity of terminal nucleotides
EP4047100A1 (en) Ngs targeted dna panel using multiplex selective rca
Ableitner “Molbio” in Keywords
KR101762486B1 (en) Marker composition for analysis of maternal lineage in korean
WO2004059013A1 (en) Single nucleotide polymorphism detection using nucleotide depletion genotyping

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 10717204

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 10717204

Country of ref document: EP

Kind code of ref document: A1