WO2011147929A1 - Method of dna sequencing by polymerisation - Google Patents

Method of dna sequencing by polymerisation Download PDF

Info

Publication number
WO2011147929A1
WO2011147929A1 PCT/EP2011/058664 EP2011058664W WO2011147929A1 WO 2011147929 A1 WO2011147929 A1 WO 2011147929A1 EP 2011058664 W EP2011058664 W EP 2011058664W WO 2011147929 A1 WO2011147929 A1 WO 2011147929A1
Authority
WO
WIPO (PCT)
Prior art keywords
nucleic acid
double
stranded nucleic
acid molecule
polymerase
Prior art date
Application number
PCT/EP2011/058664
Other languages
French (fr)
Inventor
David Bensimon
Vincent Croquette
Jean-François Allemand
Maria Manosas
Fang-Yuan Ding
Original Assignee
Centre National De La Recherche Scientifique (Cnrs)
Ecole Normale Superieure
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority to ES11722796.7T priority Critical patent/ES2539254T3/en
Application filed by Centre National De La Recherche Scientifique (Cnrs), Ecole Normale Superieure filed Critical Centre National De La Recherche Scientifique (Cnrs)
Priority to CN201180034608.0A priority patent/CN103097551B/en
Priority to DK11722796.7T priority patent/DK2576822T3/en
Priority to JP2013511688A priority patent/JP2013528380A/en
Priority to AU2011257227A priority patent/AU2011257227B2/en
Priority to EP11722796.7A priority patent/EP2576822B1/en
Priority to CA2800637A priority patent/CA2800637C/en
Priority to US13/700,115 priority patent/US9493829B2/en
Priority to KR1020127034149A priority patent/KR101848377B1/en
Publication of WO2011147929A1 publication Critical patent/WO2011147929A1/en
Priority to IL223256A priority patent/IL223256A/en
Priority to HK13111280.1A priority patent/HK1183912A1/en
Priority to US15/334,593 priority patent/US9738928B2/en

Links

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/68Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
    • C12Q1/6869Methods for sequencing
    • C12Q1/6874Methods for sequencing involving nucleic acid arrays, e.g. sequencing by hybridisation
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/68Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
    • C12Q1/6869Methods for sequencing
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q2521/00Reaction characterised by the enzymatic activity
    • C12Q2521/30Phosphoric diester hydrolysing, i.e. nuclease
    • C12Q2521/319Exonuclease
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q2525/00Reactions involving modified oligonucleotides, nucleic acids, or nucleotides
    • C12Q2525/30Oligonucleotides characterised by their secondary structure
    • C12Q2525/301Hairpin oligonucleotides

Definitions

  • the present invention relates to a fast method for the determination of a sequence of a nucleic acid, DNA or RNA, which is useful, in particular, for the sequencing of an unknown nucleic acid or alternatively for the detection of a specific nucleic acid sequence for diagnosis.
  • nucleic acid sequence is at the heart of molecular biology.
  • a broad range of biological phenomena can be assessed by high- throughput DNA sequencing, e.g., genetic variation, RNA expression, protein-DNA interactions and chromosome conformation (see, for a few examples, Mitreva & Mardis, Methods Mol Biol, 533: 153-87, 2009; Mardis, Genome Med., 1(4): 40, 2009; Cloonan et al, Nat Methods, 5(7): 613-619, 2008; Valouev et al, Genome Res., 18(7): 10 1-63, 2008, Valouev et al, Nat Methods., 5(9):829-34, 2008; Orscheln et al, Clin Infect Dis., 49(4): 536-42, 2009 ; Walter et al, Proc Natl Acad Sci US A., 106(31): 12950-5, 2009; Mardis et al, N Engl J Med., 3
  • demonstration of the presence of a specific DNA sequence in a physiological sample constitutes, at the present time, the major line of development of diagnostic methods, e.g. for identifying the probability of bacteria of developing antibiotic resistance, genetic abnormalities, the risks of cancer associated with genetic modifications and viral infections, for example infections associated with HIV or with hepatitis viruses (see for example Zhang et al, Nature, 358: 591-593, 1992; Turner et al, J Bacteriol, 176(12): 3708-3722, 1994; Weston et al, Infection and Immunity, 77(7): 2840-2848, 2009).
  • Nucleic acid sequencing is nowadays carried out chiefly with capillary-based, semi- automated implementations of the Sanger biochemistry.
  • the classical method comprises a step of amplification of the DNA of interest, followed by a step of 'cycle sequencing', wherein each round of primer extension is stochastically terminated by the incorporation of fluorescently labeled dideoxynucleotides (ddNTPs).
  • ddNTPs dideoxynucleotides
  • Sequence is determined by high-resolution electrophoretic separation of the single-stranded, end- labelled extension products in a capillary-based polymer gel. Simultaneous electrophoresis in 96 or 384 independent capillaries provides a limited level of parallelization.
  • High-throughput sequencing technologies are intended to lower the cost of DNA sequencing beyond what is possible with standard dye-terminator methods. At present this very high throughput is achieved with substantial sacrifices in length and accuracy of the individual reads when compared to Sanger sequencing. Examples of such new methods include the 454 and the Solexa technologies. These technologies allow shotgun sequencing of whole genomes without cloning in E. coli or any host cell. Libraries of short, adaptor-flanked DNA fragments captured on the surface of beads are amplified by emulsion PCR.
  • Sequencing is carried out using primed synthesis by DNA polymerase.
  • the array is presented with each of the four dNTPs, sequentially, and the amount of incorporation is monitored by luminometric detection of the pyrophosphate released.
  • a key difference between this method and the Solexa is that the latter uses chain-terminating nucleotides.
  • the fluorescent label on the terminating base can be removed to leave an unblocked 3' terminus, making chain termination a reversible process.
  • the SOLiD technology relies on the ligation of fluorescently labeled di-base probes to a sequencing primer hybridized to an adaptor sequence within the clonally-amplified library template.
  • the Helicos platform allows the sequencing of single DNA molecules. This technology is based on the use of a highly sensitive detection system of flurorescent nucleotides incorporation to directly interrogate single DNA molecules via sequencing by synthesis.
  • the present invention relates to a method for the determination of a nucleic acid sequence by physical manipulation.
  • the said method comprises the steps of determining the physical location of the site where a pause of the replication occurs, and deducing there-from information on the sequence of the nucleic acid.
  • the method according to the present invention differs from the current approaches, which are chemical or biochemical. Its advantages are numerous:
  • the measurement can be repeated periodically on a second time-scale, thus leading to elimination of false positives (spurious arrest of the polymerase), improved statistics and a significant reduction in instrumental drifts. 6)
  • the experiment can be repeated many times on the same molecule, thus improving the statistics and the reliability of the measurement, since the newly synthesized single stranded nucleic acid can be ejected (by e.g. reducing the force or the ionic strength or by using a helicase or an exonuclease activity) after the replication step. It allows for the parallel sequencing of various double-stranded nucleic acid molecules, since each molecule can be manipulated independently of the others.
  • a non-modified enzyme can be used to sythesize the new strand, which lowers the associated costs and improves the error rate, as compared to the third-generation single molecule sequencing, which either introduces site- specific mutation in the polymerase to cleave the dye-linker, or binds the enzyme to a side, or modifies it with a quantum dot.
  • the present invention relates to a method for the determination of a nucleic acid sequence based on the physical localization on the sequenced nucleic acid molecule of the sites where replication is paused or blocked.
  • nucleic acid sequence By 'determination of a nucleic acid sequence', it is herein meant not only the deciphering of the actual succession of bases in a nucleic acid, but also all the activities leading directly or indirectly to the obtention of some information on nucleic acid sequence, such as the detection of a particular sequence in a nucleic acid molecule or the detection of a difference between the sequences of two different nucleic acid molecules. Most methods for determining a nucleic acid sequence rely on primed synthesis of a new strand by a processive polymerase.
  • a primer is hybridized to one of the strands of the double-stranded nucleic acid template; a new strand is synthesized from the primer by a polymerase; synthesis is paused or blocked at specific sites; and the detection of these pauses or blockages in polymerisation gives information on the sequence of the said nucleic acid.
  • the present invention stems from the observation that it is possible to measure the physical distance between the two ends of a partially denatured double-stranded nucleic acid molecule when the said molecule is under tension.
  • the progression of a replication fork is associated with the unwinding of the double-stranded nucleic acid molecule, leaving behind two free ends which are joined at the fork.
  • the replication is blocked at a specific site, the double-stranded nucleic- acid molecule is blocked in a conformation where the the two strands in front of the replication fork are still annealed, while the two parental strands behind the fork are separated.
  • the inventors have now found that it is possible to measure the physical distance between the two separated ends of the said double-stranded nucleic acid molecule, when the said double-stranded nucleic acid molecule is under tension.
  • the physical position on the said double-stranded nucleic acid molecule of the site where the pause or blockage of replication occurs can then be deduced from the said distance, resulting in some information about the sequence of the said double-stranded nucleic acid molecule.
  • the method of the invention relates to a method for the determination of a nucleic acid sequence, said method comprising the steps of:
  • step b) incubating the hybridized primer/double-stranded nucleic acid molecule obtained in step b) with a polymerase in conditions which will lead to at least one pause in replication;
  • 'denaturation' it is herein meant the process of strands separation of a double- stranded nucleic acid molecule occurring when most of the hydrogen bonds between the said strands are broken.
  • the denaturation process yields a denatured nucleic acid molecule, by which it is herein meant the two separated complementary strands resulting from the denaturation of a double- stranded nucleic acid molecule.
  • 'renaturation' it is herein referred to the process by which two separated complementary strands reform through hybridization into a double helix.
  • 'hybridization' is the process of establishing a non-covalent, sequence-specific interaction between two or more complementary strands of nucleic acids into a single hybrid.
  • the two strands are separated by submitting them to a physical force.
  • the free ends of the said double-stranded nucleic acid may be pulled apart, thus rupturing all the bonds between the paired bases, and opening the double-stranded nucleic acid.
  • the double-stranded nucleic acid molecule is a hairpin.
  • the 5' end of one strand is directly joined covalently to the 3' end of the other strand.
  • the double-stranded nucleic acid be represented diagrammatically in the context of the present invention, it is possible to liken it to a "zip fastener", which is opened (or closed): the denaturation of the double-stranded nucleic acid is the unzipping, the renaturation the rezipping.
  • the single-stranded nucleic acid of the invention can be in particular a DNA or an R A molecule, either natural or modified.
  • deoxyribonucleic acid and “DNA” as used herein mean a polymer composed of deoxyribonucleotides.
  • ribonucleic acid and “RNA” as used herein mean a polymer composed of ribonucleotides.
  • the said single- stranded nucleic acid may also be made of modified nucleotides, such as locked nucleic acid (LNA), which are nucleotides in which the ribose moiety is modified with an extra bridge connecting the 2' oxygen and 4' carbon, or peptide nucleic acid (PNA), wherein the backbone is composed of repeating N-(2-aminoethyl)-glycine units linked by peptide bonds.
  • LNA locked nucleic acid
  • PNA peptide nucleic acid
  • the invention applies to any type of double-stranded nucleic acid.
  • the double-stranded nucleic acid will be DNA, but it is understood that the invention also applies to single-stranded DNA-single-stranded DNA duplexes, perfectly paired or not perfectly paired, or alternatively to single-stranded DNA-single-stranded RNA duplexes, perfectly paired or not perfectly paired, or alternatively to single-stranded RNA-single-stranded RNA duplexes, perfectly paired or not perfectly paired.
  • the duplex may consist of the at least partial re-pairing of two single strands obtained from samples of different origins.
  • the invention also applies to the secondary structures of a sole single-stranded DNA or of a sole single-stranded RNA.
  • the double-stranded nucleic acid molecules may be specifically anchored on two solid substrates (e.g. microscope slide, micropipette, microparticle). One of the ends may be attached directly or indirectly to a surface, while the other end is attached directly or indirectly to a movable surface.
  • a tension is applied on both ends of the double-stranded nucleic acid when the supports are moved away. When the tension is higher than a threshold value, the two strands are separated and the nucleic acid molecule is denatured.
  • the tension applied is preferentially above or equal to 15 pN; it is more preferentially above or equal to 16 pN; it is even more preferentially above or equal to 17 pN; in a very much preferred aspect, it is above or equal to 18 pN.
  • This force may vary with temperature, nucleotide type and buffer, but the skilled person will easily adapt the said force with regard to these parameters in order to obtain the separation of the two strands.
  • the double-stranded nucleic acid is denatured by applying a tension higher than a threshold value.
  • Incubating the denatured double-stranded nucleic acid with a single-stranded nucleic acid (the "primer") leads to hybridization of the said single-stranded primer.
  • the sequence of the single- stranded nucleic acid molecule is complementary to at least part of the sequence of the double-stranded nucleic acid molecule.
  • the two strands of the denatured double-stranded nucleic acid can rehybridize.
  • a tension of between 10 and 12 pN is applied; more preferentially it is 12 pN; even more preferentially, it is 11 pN; still more preferentially, it is 10 pN.
  • the polymerase activity is active under these conditions of tension, resulting in primer extension by nucleotide incorporation into a new strand.
  • the double-stranded nucleic acid is a hairpin.
  • 'haipin' means a double helix wherein the 5' end of one strand is physically linked to the 3' end of the other strand through an unpaired loop.
  • the said physical link can be either covalent or non covalent.
  • the said physical link is a covalent bond.
  • a hairpin consists of a double- stranded stem and an unpaired single-stranded loop. In a hairpin, the ends of the two strands which are not engaged in the loop are free and can thus be pulled apart. This results in the unpairing of the double stranded nucleic acid, thus yielding a denatured double stranded nucleic acid molecule.
  • determining the position of the said blockage in replication with respect to one end of the double-stranded nucleic acid gives information on the sequence of the said double-stranded nucleic acid.
  • the loop can be of any length comprised between 0 and 60 nucleotides. It is believed that a loop region of at least about 4 or 5 nucleotides is needed to form a stable hairpin. However, it is also possible to perform the invention with loops of a much shorter length. Indeed, the inventors have found that in some embodiments of the invention it may be advantageous to use a hairpin which loop consists of 0 nucleotides. In this case, the 3' end of one strand is directly and physically linked to the 5 ' end of the other strand. Techniques allowing the free ends of double- stranded nucleic acid to be joined together are known, and some will be described in greater details in what follows.
  • determination of the blockage it is herein meant the determination of the physical parameters associated with the blockage.
  • the most useful of these parameters is the position of the blockage on the double-stranded nucleic acid molecule, said position corresponding to the position of the last incorporated nucleotide in the newly synthesized single-strand.
  • the position on the stretched double-stranded nucleic acid at which the pause in renaturation occurs can be precisely determined: the use of hairpin affords the skilled person to determine the physical distance between the two free ends of the hairpin at any time during the denaturation/renaturation process.
  • 'free end it is herein meant the end of one strand which is not covalently linked to an extremity of the other strand; as explained above, these free ends may each be bound to a different surface.
  • one of these surfaces may be movable, whilst the other may be motionless. The skilled person will thus easily realize that, in order to measure the distance between the free ends of the hairpin double-stranded nucleic acid, it is possible to simply measure the distance between the two surfaces.
  • This distance is maximal (z i g h at a force (F ope n ), which is higher than the threshold value mentioned above) when the hairpin molecule is completely denatured, since the hairpin nucleic acid is then completely extended; it is minimal (z low at a force (F tes t,) which corresponds to the intermediate value discussed above)) when the said hairpin molecule is completely renatured. It is advantageous to perform all length comparisons at the same force F tes t, so that the single stranded nucleic acid has the same elastic properties. Using the delay in loop closing the skilled user can measure Z igh (F tes t).
  • the double-stranded nucleic-acid molecule is blocked in a conformation where the two strands in front of the replication fork are still annealed, while the two parental strands behind the fork are separated.
  • the distance z pause will be minimal.
  • the distance z will be maximal (Fig. 1).
  • the pause can be observed during the polymerase induced unwinding of a hairpin under a tension F tes t. If replication proceeds at a force F ope n (for which the hairpin is open) until it is blocked, the replication blockage can be observed upon reducing the force to F tes t which allows for the strands to rezip up to the blockage point.
  • a distance of 1 nm corresponds to the distance spanned by two nucleotides (1 bp) in a nucleic acid under a 10 pN force.
  • the exact calibration versus force is given by the elasticity of single stranded nucleic acid. Therefore, by simply measuring the distance between the two free ends of the double- stranded nucleic acid molecule under tension, it is possible to determine precisely where the renaturation is blocked.
  • the method of the invention relates to a method for the determination of a nucleic acid sequence, said method comprising the steps of:
  • the primer hybridizing a single-stranded nucleic acid molecule ("the primer") with the said denatured double-stranded nucleic acid molecule;
  • step b) applying a tension to the hybridized primer/double-stranded nucleic acid molecule obtained in b); d) incubating the hybridized primer/double-stranded nucleic acid molecule obtained in step b) with a polymerase in conditions which will lead to at least one pause in replication; and
  • the distance between the two ends of the double-stranded molecule is determined when the replication process is blocked.
  • the distance between the two ends of the said molecule is determined when the molecule is completely denatured. Even more preferentially, the two distances are compared and the position of the blockage is determined.
  • polymerase refers to an enzyme that catalyzes the polymerization of nucleotides (i.e., the polymerase activity). Generally, the enzyme will initiate synthesis at the 3' end of the primer annealed to a polynucleotide template sequence, and will proceed toward the 5' end of the template strand.
  • DNA polymerase catalyzes the polymerization of deoxynucleotides
  • R A polymerase catalyses the polymerization of ribonucleotides.
  • the polymerase according to the invention is either a processive polymerase or non-processive polymerase.
  • a processive enzyme catalyses multiple rounds of a reaction on a denatured double-stranded nucleic acid template, while the enzyme stays bound to the said template.
  • a polymerase will be processive i.e. will stay bound to the denatured double-stranded nucleic acid template for at least 25 nucleotides, at least 50 nucleotides, at least 100 nucleotides, usually at least 500 nucleotides, and may be processive for at least 1000 nucleotides or more.
  • Polymerases according to the invention include R A-dependent RNA polymerases, DNA-dependent RNA polymerases, DNA-dependent DNA polymerases, RNA-dependent DNA polymerases (reverse transcriptase) and the like. Many such enzymes are known in the art.
  • the said polymerase is capable of synthesizing nucleic acids when the force applied to the double-stranded nucleic acid template is at an intermediary value, i.e. comprised between 10 and 12 pN or at high forces for which the hairpin is completely unfolded
  • a polymerase with a 3 '-5' exonuclease activity is used in the method of the invention.
  • 3 -5' exonuclease activity refers to the capability of an enzyme to remove incorporated nucleotides from the 3' end of a DNA polymer. Examples of such enzymes include e.g. T4 DNA Polymerase, T7 DNA Polymerase, DEEP VENT DNA polymerase, E. coli polymerase III, Phi29 DNA Polymerase, E. coli DNA Polymerase I, E.
  • the polymerase of the invention can be switched from a polymerase-active mode to a 3 '-5' exonuclease-active mode by decreasing the force applied to the double-stranded nucleic acid molecule under a minimal value.
  • the said minimal value is 7 pN; more preferably, the said minimal value is 6 pN; even more preferably, the said minimal value is 5 pN.
  • the said polymerase has, in addition, a strand displacement activity under an intermediate tension, e.g. when a force between 10 and 12 pN is applied to the double-stranded hairpin.
  • strand displacement it is herein meant the ability for the polymerase to displace the downstream nucleic acid during synthesis.
  • the inventors have found in particular that the T4 DNA Polymerase and the T7 DNA polymerase, which are not known to have any strand displacement activity in test tube conditions, i.e. in conditions where no tension is applied to the double-stranded template, are able to remove the downstream nucleic acid during polymerisation when the double-stranded hairpin is under a force > 10 pN.
  • the T4 DNA Polymerase and T7 DNA Polymerase are thus particularly suited for carrying out the method of the invention.
  • "T4 DNA Polymerase” and “T7 DNA Polymerase” herein refer to both the monomeric enzyme and the holoenzyme.
  • the method according to the invention comprises a replication step which is carried out under conditions which will lead to at least one pause in the replication process.
  • the double-stranded nucleic acid molecule is submitted to a tension during the replication step. More preferably, the said tension is around an intermediate value, i.e. the said tension is comprised between 10 and 12 pN.
  • the said pause can be caused by any of the means known to the person of skills in the art. Sequencing-by-synthesis method wherein the synthesis of the new strand is blocked have been widely used in the art. Any such method can be adapted for the purpose of the present invention.
  • the polymerase may be a nucleotide-sensitive, processive enzyme, which is then contacted with the denatured double-stranded nucleic acid template in a reaction mix which is rate-altering for the processive movement of the enzyme for a specified nucleotide.
  • the said reaction mix comprises a pool of deoxy-nucleotides (dNTP) where one of the bases is present at a very low concentration. In that case, each time the polymerase encounters the complement of the said nucleotide, it pauses until the low concentration nucleotide diffuses into position.
  • dNTP deoxy-nucleotides
  • ddNTPs dideoxynucleotides
  • dNTPs dideoxynucleotides
  • modified nucleotides when integrated into a sequence, prevent the addition of further nucleotides. This occurs because a phosphodiester bond cannot form between the dideoxynucleotide and the next incoming nucleotide, and thus the DNA chain is terminated. Therefore, incorporation of one ddNTP will cause the polymerase reaction to stop, since no nucleotide can be added after the said ddNTP.
  • the position of the blockage along the molecule thus reveals the position of incorporation of the ddNTP in the synthesized strand and allows the skilled person to identify the position of the corresponding base in the sequence. The position of each pause or blockage can then be determined by the method of the invention, i.e.
  • the method of the invention may be used for direct sequencing of an unknown nucleic acid.
  • a processive enzyme is used to synthesize from a known single- stranded nucleic acid (a primer) a sequence of increasing extension complementary to one of the hairpin strands, thereby effectively unwinding the double- stranded hairpin maintained under a moderate tension (e.g.
  • polymerisation is initiated by opening the double-stranded hairpin by transiently increasing the force to F ope n in the presence of a single-stranded primer.
  • Fopen is a tension higher than the threshold value required for completely opening the double stranded hairpin. This results in the hybridization of the primer to the double- stranded hairpin.
  • the force is set to F e i 0 n g ation ( ⁇ F ope n ) in order to enable the polymerase to synthesize a new strand (in a strand displacement mode or not).
  • Feiongation is preferably set at an intermediate tension lower than F ope n (the threshold value required for complete opening of the double stranded hairpin), said intermediate tension allowing for replication by the polymerase.
  • the said intermediate value is comprised between 10 and 12 pN.
  • the polymerase synthetizes a new strand at a sustained rate until a pause or a blockage occurs. The enzyme activity thus leads to the production of an extended complementary single- stranded nucleic acid molecule.
  • the said polymerase can be switched between its two modes of operation, i.e. exonuclease activity and elongation, by adjusting the force applied on the hairpin.
  • the elongation process is stopped before the polymerase reaches the loop.
  • This can be achieved by various means. For example, the insertion of non conventional bases a few nucleotides ahead of the loop will stop the polymerase. The same effect can be achieved with a double-stranded binding domain of a protein located just before the loop.
  • the force (F exo ) applied to the hairpin molecule is decreased under a minimal value, e.g. 5 pN, after the elongation process is blocked.
  • a minimal value e.g. 5 pN
  • the polymerase to switch to its exonuclease activity and disassemble the strand which has just been synthesized. This process stops when the whole strand is completely disassembled.
  • the disassembly process is stopped when the enzyme is stalled, e.g. when encountering a roadblock, such as a modified nucleotide in the primer. In such a case, it may be necessary to use an enzyme to eject the newly- synthesized strand.
  • suitable enzymes one may cite e.g.
  • helicases including a UVrD helicase, a recBCD helicase, E. coli UvrD helicase, Tte-UvrD helicase, T7 Gp4 helicase, RecBCDhelicase, DnaB helicase, MCM helicase, Rep helicase, RecQ helicase, PcrA helicase, T4 UvsW helicase, SV40 large T antigen helicase, Herpes virus helicase, yeast Sgsl helicase, DEAH ATP-dependent helicases and Papillomavirus helicase El protein and homologs thereof, and exonucleases, including snake venom phosphodiesterase, spleen phosphodiesterase, Bal-31 nuclease, E.
  • exonucleases including snake venom phosphodiesterase, spleen phosphodiesterase, Bal-31 nuclease, E.
  • E. coli exonuclease I E. coli exonuclease VII, Mung Bean Nuclease, SI Nuclease, an exonuclease activity of E. coli DNA polymerase 1, an exonuclease activity of a Klenow fragment of DNA polymerase 1, an exonuclease activity of T4 DNA polymerase, an exonuclease activity of T7 DNA polymerase, an exonuclease activity of Taq DNA polymerase, an exonuclease activity of DEEP VENT DNA polymerase, E. coli exonuclease III, ⁇ exonuclease and an exonuclease activity of VENTR DNA polymerase.
  • Disassembly of the strand which has just been synthesized provides the opportunity to repeat the whole process, i.e. the synthesis of a strand under a tension superior to a threshold, e.g. 10 pN, with the polymerase pausing e.g. every time it encounters the complement of the rare nucleotide or stopping if it incorporates a ddNTP or a NTP. If the primer has been expelled during the disassembly step, synthesis will be preceded by a step of opening the hairpin and closing it back so that a primer can hybridized. Increasing the force above 10 pN will switch back the polymerase in the elongation mode and a new pausing pattern may be recorded.
  • a threshold e.g. 10 pN
  • the replication step can be conducted in the presence of ddNTP at a high force where the hairpin is completely open. In that case blockages resulting from ddNTP incorporation can be detected upon lowering the force to F tes t.
  • the procedure is repeated with a shortage of another nucleotide or another ddNTP. After the said procedure has been repeated with each nucleotide, the positions of all the nucleotides in the strand are compiled together, thus yielding the complete sequence of the original double-stranded nucleic acid molecule.
  • Implementation of the method of the invention has been made possible, in particular, by the existence of devices designed for probing real-time nucleic acid interaction at the single-molecule level.
  • a device is described for example in U.S. Patents Nos. 7,052,650 and 7,244,391.
  • the apparatus described therein uses magnetic traps to apply a picoNewton scale force on a micron-sized super-paramagnetic bead.
  • the said apparatus comprises an optical microscope, magnets and a PC.
  • the double-stranded nucleic acid molecules are anchored at multiple points at one end to a motionless element, e.g. a surface, and at the other end to a movable surface, in this case a magnetic bead.
  • Magnets are provided for acting on the bead.
  • the magnets may be used for pulling the bead away from the surface.
  • the implementation of the method of the invention is not restricted to the above apparatus. Any device which allows one to fully extend and then refold a molecule of double stranded nucleic acid, whilst monitoring at the same time the extension of the said molecule can be used to implement the method of the invention.
  • optical tweezers may be used; they require however prior force calibration and are not easily parallelized for high throughput measurements.
  • Further drawbacks are the lack of total torsional control of the nucleic acid and the possible local heating of the solution by the focussed laser which may alter the hybridization conditions.
  • the double stranded nucleic acid is incubated for a few minutes in a solution of adequate beads (for example streptavidin coated ones) to which it binds by one of its labeled (for example biotin) ends.
  • the beads can be transparent if optical tweezers are later used for manipulation or magnetic if one uses magnetic traps or tweezers for manipulation.
  • the bead-nucleic acid assembly is injected in a fluidic chamber the surface of which has been treated such as to bind the other labeled end of the molecule (for example a surface coated with anti-Dig to bind the Dig-labeled end of the nucleic acid).
  • the beads are thus anchored to the surface via a nucleic acid hairpin, see Fig. la.
  • the distance of the bead to the surface is then monitored by various means known to the man of the art: for example the diffraction rings of their image on a camera can be used to deduce their distance, or the light intensity they scatter (or emit by fluorescence) when illuminated in an evanescent mode can be used to measure their distance.
  • the magnetic field they generate can be measured (using a magnetic sensor such as GMR or Hall sensors) to deduce their distance to a sensor on the anchoring surface.
  • the preferred embodiment uses a magnetic trap to pull on superparamagnetic beads anchored to a surface by a nucleic acid hairpin as described above.
  • small magnets placed above the sample are used to apply a constant force on the anchored bead, whose position can be determined with ⁇ 1 nm accuracy (depending on the pulling force and the dissipation due to hydrodynamic drag)
  • the tethering hairpin can be mechanically fully unzipped by pulling on the beads with a force larger than about 16 pN.
  • the blocking position is related to the sequence by a linear relation between full extension and the blocked one.
  • the full extension is preferably measured at the test force F tes t. This is achieved by designing the hairpin loop such that it requires a fraction of a second to refold once the force is reduced from F ope n to F tes t ⁇
  • nucleic acid becomes anchored directly to the support, for example the micro-bead, which involves a functionalization of this surface, for example by coating it with streptavidin, a COOH group, and the like, capable of reacting with the functionalized end of the nucleic acid.
  • Such methods necessitate, in general, functionalizing the nucleic acid, especially the 3' and 5' ends, that is to say grafting appropriate chemical groups onto them. It is, moreover, preferable to join the other two free ends of the molecule by a loop in order to prevent the strands from dissociating at the end of the operation, so that the latter can be repeated if appropriate. For this purpose, different procedures may be adopted.
  • the simplest is to functionalize, using synthetic oligonucleotides, one of the ends of a double-stranded nucleic acid with two different functions (biotin and amine, for example), which permit anchoring to two different pre-treated surfaces.
  • the two strands at the other end may be joined using a partially paired synthetic nucleotide in the form of a loop.
  • a paired, single-stranded nucleic acid i.e. a hairpin
  • the advantage of this method lies in its capacity to functionalize a heterogeneous population of large nucleic acid fragments (as are obtained by fractionation of a gene or chromosome), which can then be analyzed simultaneously.
  • the nucleic acid sample is fractionated using two (or more) restriction enzymes, which enables a subpopulation to be obtained with two different restriction sites at its ends which are similar over all the fragments.
  • This enables the two ends to be treated differently (for example by joining one end to an oligonucleotide in the form of a loop possessing the appropriate restriction site at its end).
  • the drawback of this method lies in the steric interference between the two adjacent functional groups, which can make coupling to the surfaces difficult.
  • each spacer sequence of bases is designed in order to use single-stranded sequencing primers of known sequence in the sequencing method of the invention.
  • the addition of a loop and/or spacers to the double-stranded nucleic acid molecules can be performed with any of the methods commonly used in molecular biology. These methods are well known to the person skilled in the art and there is thus no need to detail them here.
  • anchoring techniques there are many of these and they derive from the techniques for anchoring macromolecules (proteins, DNA, and the like) to commercially available pretreated surfaces. Most of these techniques have been developed for immunology tests, and link proteins (immunoglobulins) to surfaces carrying groups (— COOH, ⁇ NH 2 ,—OH, and the like) capable of reacting with the carboxyl (--COOH) or amine ( ⁇ NH 2 ) ends of proteins.
  • the covalent anchoring of nucleic acid may be accomplished directly, via the free phosphate of the 5' end of the molecule, which reacts with a secondary amine (Covalink — NH surface marketed by Polylabo at France) to form a covalent bond. It is also possible to functionalize DNA with an amine group and then to proceed as with a protein.
  • Patent EP 146 815 also describes various methods of attachment of DNA to a support.
  • patent application WO 92/16659 proposes a method using a polymer to attach DNA.
  • nucleic acid may be attached directly to the support but, where necessary, especially with a view to limiting the influence of the surfaces, the nucleic acid may be attached at the end of an inert arm of peptide or other nature, as is, for example, described in Patent EP 329 198.
  • Figure 1 Principle of detection of the hybridization of oligo-nucleotides to their complementary sequence on a hairpin DNA.
  • the hairpin DNA anchoring the bead to the surface (a) is momentarily unzipped by increasing the force pulling on the bead to a value above 16 pN.
  • the complementary fragment in solution hybridizes to its target on the opened DNA hairpin, thus preventing the rezipping of the hairpin (b) when the force is reduced back to its initial value.
  • the hairpin refolding presents four plateaus occuring at well defined extensions but with variable duration.
  • the top plateau at 73.71 nm is associated with the 83 bp fully opened hairpin at F tes t, while the bottom one corresponds to the hairpin completely refolded.
  • the two intermediate plateaus at 25.47 nm and 35.17 nm occur because two oligos have been placed in the solution. From these change in extension (z igh-z) it is possible to deduce where along the hairpin the complementary sequence has paired. Here according to their positions the blocks coincide with location 28.66 bp and 39.60 bp in very good agreement with their expected positions at 29 bp and 40 bp.
  • the plateau positions are better estimated by fitting Gaussian to the histogram obtained from several opening/closing cycles (here -20 cycles).
  • dATP 5 ⁇
  • ddATP 400 ⁇ ,
  • the peak of the curve shows the existence of hairpin sample.
  • the rising curve shows the processive synthesis of a new complementary strand by the T4 DNA polymerase, while the plateau represents the pause occurring when one ddNPT is incorporated.
  • the falling edge of the curve shows the exonuclease activity removing the strand just synthesized,. This synthesis and exonuclease phases may be repeated in cycles.
  • FIG. 3 Illustration of sequencing based on unbalanced dNTP concentration.
  • commercial buffer 5X Reaction Buffer: 335 mM Tris-HCl (pH 8.8 at 25°C), 33 mM MgCl 2 , 5 mM DTT, 84 mM (NH 4 ) 2 S0 4 , Fermentas; http:/ / ww.fermentas.com/ en/products / ail /modifying- enzymes/ mesophilic-polymerases/ ep006-t4-dna-polymerase).
  • the peak of the curve shows the existence of hairpin sample and could be used to hybridize a primer if needed.
  • middle force -11.7 pN
  • the rising edge of the curve shows the processive synthesis of a new complementary strand by the T4 DNA polymerase, while the transient pause corresponds to the presents when a dATP is needed.
  • low force -1.6 pN
  • the falling shows the cleavage of the newly- synthesized strand, due to the activation of the exonuclease activity of the T4 DNA polymerase. This synthesis and cleavage cycle is repeatable.
  • FIG. 4 Detection of the sequencing based on unbalanced dNTP concentration to their complementary sequence on a hairpin.
  • the peak of the curve shows the existence of hairpin sample and could be used to hybridize a primer if needed.
  • middle force -11.7 pN
  • the rising edge of the curve shows the processive synthesis of a new complementary strand by the T4 DNA polymerase, while the transient pause corresponds to the presents when a dATP is needed.
  • low force -1.6 pN
  • the falling shows the cleavage of the newly- synthesized strand, due to the activation of the exonuclease activity of the T4 DNA polymerase. This synthesis and cleavage cycle is repeatable.
  • the position histogram of these transient pauses is deduced similarly as previously described in Fig.l and base calls are indicated (arrows).
  • Fig. 4B There are two ways to measure the extension change. First, we can use the Zzip as reference; alternatively, we can also use the position of the hairpin loop as reference. Figure 5. Illustration of hairpin design.
  • the DNA fragment of interest is ligated with a DNA loop (with abasic and LNA bases) and two partially complementary DNA fragments (one ssDNA with Biotin on the end, one dsDNA with Dig at extremity), which eventually forms a hairpin that can be bound to a super-paramagnetic bead covered with streptavidin (DYNAL) while the other extremity to a glass coverslip treated with anti-dig.
  • DYNAL super-paramagnetic bead covered with streptavidin
  • the free 3' end of one strand can be labeled with biotin allowing binding to streptavidin coated beads, whereas the 5' end on the opposite strand can be labelled with digoxigenine allowing its binding to surfaces coated with an anti-Dig antibody.
  • This end-labelling can be done by various ways known to the man of the art, such as the use of terminal transferase to add biotin (or dig) modified nucleotides or hybridization with suitably labelled oligonucleotides.
  • This DNA construct is incubated for a few minutes in a solution of adequate beads (for example streptavidin coated ones) to which it binds by one of its labelled (for example biotin) ends.
  • the beads can be transparent if optical tweezers are later used for manipulation or magnetic if one uses magnetic traps or tweezers for manipulation.
  • the bead-DNA assembly is injected in a fluidic chamber the surface of which has been treated such as to bind the other labelled end of the molecule (for example a surface coated with anti-Dig to bind the Dig-labelled end of the DNA).
  • the beads are thus anchored to the surface via a DNA- hairpin, see Fig. la.
  • the distance of the bead to the surface is then monitored by various means known to the man of the art: for example the diffraction rings of their image on a camera can be used to deduce their distance, or the light intensity they scatter (or emit by fluorescence) when illuminated in an evanescent mode can be used to measure their distance.
  • the magnetic field they generate can be measured (using a magnetic sensor such as GMR or Hall sensors) to deduce their distance to a sensor on the anchoring surface.
  • the preferred embodiment uses a magnetic trap to pull on super- paramagnetic beads anchored to a surface by a DNA hairpin as described above.
  • small magnets placed above the sample are used to apply a constant force on the anchored bead, whose position can be determined with ⁇ 1 nm accuracy (depending on the pulling force and the dissipation due to hydrodynamic drag).
  • the apparatus described in U.S. Patents No. 7,052,650 and 7,244,391 was used.
  • the experiments reported here were performed in 25 mM Tris pH 7.5, 150 mM KAc, 10 mM MgCl 2 , 0.2 % BSA.
  • the tethering hairpin can be mechanically fully unzipped by pulling on the beads with a force larger than about 16 pN. Reducing the tension on the molecule to below about 11 pN allows the hairpin to re-zip spontaneously (the unzipping transition is reversible though hysteretic).
  • a molecule in solution such as a protein or complementary oligo-nucleotides of DNA, RNA, LNA or PNA
  • ss stretched single stranded
  • the principle of the assay is to switch between two forces: a large one F ope n to open the hairpin and a smaller one F tes t used to allow re-zipping and to measure the extension of the molecule at transient blockages.
  • the blocking position is related to the sequence by a linear relation between full extension and the blocked one.
  • the full extension is preferably measured at the test force F tes t. This is achieved by designing the hairpin loop such that it requires a fraction of a second to refold once the force is reduced from F ope n to F tes t ⁇
  • the hybridization position of an oligo-nucleotide can be measured with a basepair resolution
  • ⁇ x 2 > 4k B T Af (6 ⁇ ) /k 2 ssD NA(F)
  • k ssD NA(F) is the stiffness of a ssDNA molecule
  • ks is Boltzman constant
  • T the absolute temperature
  • the viscosity of water
  • r the bead's radius
  • Af is the frequency range of the measurement.
  • the T4 DNA polymerase can replicate a DNA hairpin when the force is high enough to sufficiently destabilize the fork (F tes t).
  • F tes t the fork
  • the incorporation of a specific ddNTP will prevent further elongation of the nascent strand by the T4 DNA polymerase.
  • this blockage can be easily identified, as shown in Fig. 6.
  • the exonuclease activity of the T4 DNA polymerase is activated and the enzyme excises the newly-synthesized strand.
  • the molecule hairpin
  • the molecule can be sequenced by identifying the blockage positions first in the presence of ddATP, and then of each of the other ddNTPs, i.e. ddTTP, ddCTP, and ddGTP.
  • the double-stranded hairpin molecule can be sequenced in a buffer comprising a deficit of one of the four dNTPs compared to the others, i.e. this dNTP is present at a very low concentration as compared to the others.
  • this dNTP is present at a very low concentration as compared to the others.
  • the molecule hairpin
  • dATP for example

Landscapes

  • Life Sciences & Earth Sciences (AREA)
  • Chemical & Material Sciences (AREA)
  • Proteomics, Peptides & Aminoacids (AREA)
  • Organic Chemistry (AREA)
  • Zoology (AREA)
  • Wood Science & Technology (AREA)
  • Health & Medical Sciences (AREA)
  • Engineering & Computer Science (AREA)
  • Microbiology (AREA)
  • Immunology (AREA)
  • Biotechnology (AREA)
  • Molecular Biology (AREA)
  • Biophysics (AREA)
  • Analytical Chemistry (AREA)
  • Physics & Mathematics (AREA)
  • Biochemistry (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Health & Medical Sciences (AREA)
  • Genetics & Genomics (AREA)
  • Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)

Abstract

The present invention relates to a method for the determination of a nucleic acid sequence by physical manipulation. The method is based on the precise determination of the localization of the replicating fork on the template by measuring the physical distance between one end of the molecule and the fork. This allows the determination of the physical location of the site where a pause or a blockage of the replication occurs, and deducing therefrom information on the sequence of the nucleic acid.

Description

METHOD OF DNA SEQUENCING BY POLYMERISATION
The present invention relates to a fast method for the determination of a sequence of a nucleic acid, DNA or RNA, which is useful, in particular, for the sequencing of an unknown nucleic acid or alternatively for the detection of a specific nucleic acid sequence for diagnosis.
Nowadays, the determination of nucleic acid sequence is at the heart of molecular biology. For example, a broad range of biological phenomena can be assessed by high- throughput DNA sequencing, e.g., genetic variation, RNA expression, protein-DNA interactions and chromosome conformation (see, for a few examples, Mitreva & Mardis, Methods Mol Biol, 533: 153-87, 2009; Mardis, Genome Med., 1(4): 40, 2009; Cloonan et al, Nat Methods, 5(7): 613-619, 2008; Valouev et al, Genome Res., 18(7): 10 1-63, 2008, Valouev et al, Nat Methods., 5(9):829-34, 2008; Orscheln et al, Clin Infect Dis., 49(4): 536-42, 2009 ; Walter et al, Proc Natl Acad Sci US A., 106(31): 12950-5, 2009; Mardis et al, N Engl J Med., 361(11): 1058-66, 2009, Hutchinson, Nucl. Acids Res., 35(18): 6227-6237, 2007).
In addition, demonstration of the presence of a specific DNA sequence in a physiological sample constitutes, at the present time, the major line of development of diagnostic methods, e.g. for identifying the probability of bacteria of developing antibiotic resistance, genetic abnormalities, the risks of cancer associated with genetic modifications and viral infections, for example infections associated with HIV or with hepatitis viruses (see for example Zhang et al, Nature, 358: 591-593, 1992; Turner et al, J Bacteriol, 176(12): 3708-3722, 1994; Weston et al, Infection and Immunity, 77(7): 2840-2848, 2009).
Nucleic acid sequencing is nowadays carried out chiefly with capillary-based, semi- automated implementations of the Sanger biochemistry. The classical method comprises a step of amplification of the DNA of interest, followed by a step of 'cycle sequencing', wherein each round of primer extension is stochastically terminated by the incorporation of fluorescently labeled dideoxynucleotides (ddNTPs). Sequence is determined by high-resolution electrophoretic separation of the single-stranded, end- labelled extension products in a capillary-based polymer gel. Simultaneous electrophoresis in 96 or 384 independent capillaries provides a limited level of parallelization.
The high demand for low-cost sequencing has driven the development of high- throughput sequencing technologies that parallelize the sequencing process, producing thousands or millions of sequences at once (Shendure & Ji, Nat Biotechnol, 26(10): 1135-45. 2008). High-throughput sequencing technologies are intended to lower the cost of DNA sequencing beyond what is possible with standard dye-terminator methods. At present this very high throughput is achieved with substantial sacrifices in length and accuracy of the individual reads when compared to Sanger sequencing. Examples of such new methods include the 454 and the Solexa technologies. These technologies allow shotgun sequencing of whole genomes without cloning in E. coli or any host cell. Libraries of short, adaptor-flanked DNA fragments captured on the surface of beads are amplified by emulsion PCR. Sequencing is carried out using primed synthesis by DNA polymerase. In the 454 method (also known as 'pyrosequencing'), the array is presented with each of the four dNTPs, sequentially, and the amount of incorporation is monitored by luminometric detection of the pyrophosphate released. A key difference between this method and the Solexa is that the latter uses chain-terminating nucleotides. The fluorescent label on the terminating base can be removed to leave an unblocked 3' terminus, making chain termination a reversible process. The SOLiD technology relies on the ligation of fluorescently labeled di-base probes to a sequencing primer hybridized to an adaptor sequence within the clonally-amplified library template. Specificity of the di-base probe is achieved by interrogating every 1st and 2nd base in each ligation reaction. Multiple cycles of ligation, detection and cleavage are performed with the number of cycles determining the eventual read length. In contrast to the three previous technologies, which all require a first step of amplification, the Helicos platform allows the sequencing of single DNA molecules. This technology is based on the use of a highly sensitive detection system of flurorescent nucleotides incorporation to directly interrogate single DNA molecules via sequencing by synthesis.
Such methods are described in e.g. US Patent No 4,882,127, U.S. Patent No. 4,849,077; U.S. Patent No.7,556,922; U.S. Patent No. 6,723,513; PCT Patent Application No. WO 03/066896; PCT Patent Application No. WO2007111924; U.S. Patent Application No. US 2008/0020392; PCT Patent Application No. WO 2006/084132; U.S. Patent Application No. US 2009/0186349; U.S. Patent Application No. US 2009/0181860; U.S. Patent Application No. US 2009/0181385; U.S. Patent Application No. US 2006/0275782; European Patent EP-Bl-1141399; Shendure & Ji, Nat BiotechnoL, 26(10): 1135-45. 2008; Pihlak et al, Nat BiotechnoL, 26(6) : 676- 684, 2008 ; Fuller et al., Nature BiotechnoL, 27(11): 1013-1023, 2009; Mardis, Genome Med., 1(4): 40, 2009; Metzker, Nature Rev. Genet., 11(1): 31-46, 2010.
However, all the methods developed so far suffer from serious drawbacks. In particular, they all make use of labelled nucleotides (e.g. fluorescent), thus contributing to seriously increasing the overall costs. Moreover, all these new methods bar one (the Helicos platform) require amplification of the target sequence prior to sequencing, which is time consuming on the one hand, increases the probability of errors on the other hand, and is highly prone to contamination. In addition, the methods involving mechanical techniques rather than biochemical lack sensitivity (Maier et al., Proc. Natl., Acad. Sci. U.S.A., 97(22): 12002-12007, 2000; Wuite et al., Nature, 404(6773): 103- 106, 2000 ; US 2010/0035252) There is thus still a need for new, highly sensitive method allowing the sequencing of single molecules.
Detailed description of the invention
The present invention relates to a method for the determination of a nucleic acid sequence by physical manipulation. In particular, the said method comprises the steps of determining the physical location of the site where a pause of the replication occurs, and deducing there-from information on the sequence of the nucleic acid.
The method according to the present invention, based on physical techniques and electronic treatments, differs from the current approaches, which are chemical or biochemical. Its advantages are numerous:
1) It allows the sequencing of a single molecule, and thus does not require a previous amplification step (e.g. by PCR).
It is far cheaper than the methods of the art since standard nucleic acid molecules are used, which are far less expensive than labelled nucleotides (either with fluorophores or some other groups). 3) It enables to determine the localization (in bp) of a newly synthesized complementary strand along a double stranded nucleic acid by measuring the distance between the two ends of the said double-stranded nucleic acid molecule. 4) In one embodiment it allows in one polymerization run to determine the position of a given nucleotide along the strand.
5) The measurement can be repeated periodically on a second time-scale, thus leading to elimination of false positives (spurious arrest of the polymerase), improved statistics and a significant reduction in instrumental drifts. 6) The experiment can be repeated many times on the same molecule, thus improving the statistics and the reliability of the measurement, since the newly synthesized single stranded nucleic acid can be ejected (by e.g. reducing the force or the ionic strength or by using a helicase or an exonuclease activity) after the replication step. It allows for the parallel sequencing of various double-stranded nucleic acid molecules, since each molecule can be manipulated independently of the others.
7) A non-modified enzyme can be used to sythesize the new strand, which lowers the associated costs and improves the error rate, as compared to the third-generation single molecule sequencing, which either introduces site- specific mutation in the polymerase to cleave the dye-linker, or binds the enzyme to a side, or modifies it with a quantum dot.
The present invention relates to a method for the determination of a nucleic acid sequence based on the physical localization on the sequenced nucleic acid molecule of the sites where replication is paused or blocked.
By 'determination of a nucleic acid sequence', it is herein meant not only the deciphering of the actual succession of bases in a nucleic acid, but also all the activities leading directly or indirectly to the obtention of some information on nucleic acid sequence, such as the detection of a particular sequence in a nucleic acid molecule or the detection of a difference between the sequences of two different nucleic acid molecules. Most methods for determining a nucleic acid sequence rely on primed synthesis of a new strand by a processive polymerase. In these methods, a primer is hybridized to one of the strands of the double-stranded nucleic acid template; a new strand is synthesized from the primer by a polymerase; synthesis is paused or blocked at specific sites; and the detection of these pauses or blockages in polymerisation gives information on the sequence of the said nucleic acid.
It has now been discovered according to the invention that it is possible to exploit the physical parameters associated with this blockage to obtain information on the sequence of the double-stranded nucleic acid. More precisely, the inventors have found that it is possible to physically locate on the said double-stranded nucleic acid molecule the site where the replication pause or blockage occurs; the specific physical position of the pause or blockage then provides information on the sequence of the said double- stranded nucleic acid.
The present invention stems from the observation that it is possible to measure the physical distance between the two ends of a partially denatured double-stranded nucleic acid molecule when the said molecule is under tension. In a sequencing-by- synthesis process, the progression of a replication fork is associated with the unwinding of the double-stranded nucleic acid molecule, leaving behind two free ends which are joined at the fork. When the replication is blocked at a specific site, the double-stranded nucleic- acid molecule is blocked in a conformation where the the two strands in front of the replication fork are still annealed, while the two parental strands behind the fork are separated. The inventors have now found that it is possible to measure the physical distance between the two separated ends of the said double-stranded nucleic acid molecule, when the said double-stranded nucleic acid molecule is under tension. The physical position on the said double-stranded nucleic acid molecule of the site where the pause or blockage of replication occurs can then be deduced from the said distance, resulting in some information about the sequence of the said double-stranded nucleic acid molecule.
Thus, the method of the invention relates to a method for the determination of a nucleic acid sequence, said method comprising the steps of:
a) denaturing a double-stranded nucleic acid molecule corresponding to the said nucleic acid sequence b) hybridizing a single-stranded nucleic acid molecule ("the primer") with the said denatured double-stranded nucleic acid molecule;
c) applying a tension to the hybridized primer/double-stranded nucleic acid molecule obtained in b);
d) incubating the hybridized primer/double-stranded nucleic acid molecule obtained in step b) with a polymerase in conditions which will lead to at least one pause in replication; and
e) determining the position of the said pause in replication with respect to one end of the double-stranded nucleic acid.
By 'denaturation', it is herein meant the process of strands separation of a double- stranded nucleic acid molecule occurring when most of the hydrogen bonds between the said strands are broken. The denaturation process yields a denatured nucleic acid molecule, by which it is herein meant the two separated complementary strands resulting from the denaturation of a double- stranded nucleic acid molecule. By 'renaturation', it is herein referred to the process by which two separated complementary strands reform through hybridization into a double helix. As used herein, 'hybridization' is the process of establishing a non-covalent, sequence-specific interaction between two or more complementary strands of nucleic acids into a single hybrid.
There are several possibilities known to the skilled person to denature the nucleic acid. In a most preferred manner, the two strands are separated by submitting them to a physical force. For example, the free ends of the said double-stranded nucleic acid may be pulled apart, thus rupturing all the bonds between the paired bases, and opening the double-stranded nucleic acid.
In this type of sequence determination method, it can be advantageous, in order to facilitate re-pairing, to arrange for the free ends of the double-stranded DNA (i.e. the ends which are not attached to supports) to be joined to one another covalently or quasi- covalently before pulling apart. In a preferred embodiment, the double-stranded nucleic acid molecule is a hairpin. In another preferred embodiment, the 5' end of one strand is directly joined covalently to the 3' end of the other strand. If it is desired that the double-stranded nucleic acid be represented diagrammatically in the context of the present invention, it is possible to liken it to a "zip fastener", which is opened (or closed): the denaturation of the double-stranded nucleic acid is the unzipping, the renaturation the rezipping.
The single-stranded nucleic acid of the invention can be in particular a DNA or an R A molecule, either natural or modified. The terms "deoxyribonucleic acid" and "DNA" as used herein mean a polymer composed of deoxyribonucleotides. The terms "ribonucleic acid" and "RNA" as used herein mean a polymer composed of ribonucleotides. The said single- stranded nucleic acid may also be made of modified nucleotides, such as locked nucleic acid (LNA), which are nucleotides in which the ribose moiety is modified with an extra bridge connecting the 2' oxygen and 4' carbon, or peptide nucleic acid (PNA), wherein the backbone is composed of repeating N-(2-aminoethyl)-glycine units linked by peptide bonds.
The invention applies to any type of double-stranded nucleic acid. Most often, the double-stranded nucleic acid will be DNA, but it is understood that the invention also applies to single-stranded DNA-single-stranded DNA duplexes, perfectly paired or not perfectly paired, or alternatively to single-stranded DNA-single-stranded RNA duplexes, perfectly paired or not perfectly paired, or alternatively to single-stranded RNA-single-stranded RNA duplexes, perfectly paired or not perfectly paired. Furthermore, the duplex may consist of the at least partial re-pairing of two single strands obtained from samples of different origins. Finally, the invention also applies to the secondary structures of a sole single-stranded DNA or of a sole single-stranded RNA.
In a typical configuration, the double-stranded nucleic acid molecules may be specifically anchored on two solid substrates (e.g. microscope slide, micropipette, microparticle). One of the ends may be attached directly or indirectly to a surface, while the other end is attached directly or indirectly to a movable surface. In this embodiment, a tension is applied on both ends of the double-stranded nucleic acid when the supports are moved away. When the tension is higher than a threshold value, the two strands are separated and the nucleic acid molecule is denatured. The tension applied is preferentially above or equal to 15 pN; it is more preferentially above or equal to 16 pN; it is even more preferentially above or equal to 17 pN; in a very much preferred aspect, it is above or equal to 18 pN. This force may vary with temperature, nucleotide type and buffer, but the skilled person will easily adapt the said force with regard to these parameters in order to obtain the separation of the two strands.
In a preferred embodiment of the invention, the double-stranded nucleic acid is denatured by applying a tension higher than a threshold value. Incubating the denatured double-stranded nucleic acid with a single-stranded nucleic acid (the "primer") leads to hybridization of the said single-stranded primer. Preferentially, the sequence of the single- stranded nucleic acid molecule is complementary to at least part of the sequence of the double-stranded nucleic acid molecule.
When the tension is decreased to around an intermediate value, the two strands of the denatured double-stranded nucleic acid can rehybridize. To obtain rehybridization of the said two strands, a tension of between 10 and 12 pN is applied; more preferentially it is 12 pN; even more preferentially, it is 11 pN; still more preferentially, it is 10 pN. According to the invention, the polymerase activity is active under these conditions of tension, resulting in primer extension by nucleotide incorporation into a new strand. Most preferably, the double-stranded nucleic acid is a hairpin. As used herein, 'haipin' means a double helix wherein the 5' end of one strand is physically linked to the 3' end of the other strand through an unpaired loop. The said physical link can be either covalent or non covalent. Preferentially, the said physical link is a covalent bond. Thus, a hairpin consists of a double- stranded stem and an unpaired single-stranded loop. In a hairpin, the ends of the two strands which are not engaged in the loop are free and can thus be pulled apart. This results in the unpairing of the double stranded nucleic acid, thus yielding a denatured double stranded nucleic acid molecule. It is possible to open completely a hairpin double-stranded nucleic acid molecule by pulling on each end of the said nucleic acid molecule with a force higher than a threshold value. When the tension applied to the molecule is decreased to an intermediate value, the nucleic acid molecule self-rehybridizes to reform a hairpin. Under this intermediate tension, a new strand is produced by the polymerase activity of the enzyme, until a blockage is encountered. According to the invention, determining the position of the said blockage in replication with respect to one end of the double-stranded nucleic acid gives information on the sequence of the said double-stranded nucleic acid.
Using a hairpin makes it possible, in particular, to perform cycles of pairing and unpairing and thus to improve the signal/noise ratio. For the purpose of the invention, the loop can be of any length comprised between 0 and 60 nucleotides. It is believed that a loop region of at least about 4 or 5 nucleotides is needed to form a stable hairpin. However, it is also possible to perform the invention with loops of a much shorter length. Indeed, the inventors have found that in some embodiments of the invention it may be advantageous to use a hairpin which loop consists of 0 nucleotides. In this case, the 3' end of one strand is directly and physically linked to the 5 ' end of the other strand. Techniques allowing the free ends of double- stranded nucleic acid to be joined together are known, and some will be described in greater details in what follows.
By determination of the blockage, it is herein meant the determination of the physical parameters associated with the blockage. The most useful of these parameters is the position of the blockage on the double-stranded nucleic acid molecule, said position corresponding to the position of the last incorporated nucleotide in the newly synthesized single-strand. Indeed, the inventors have found that the position on the stretched double-stranded nucleic acid at which the pause in renaturation occurs can be precisely determined: the use of hairpin affords the skilled person to determine the physical distance between the two free ends of the hairpin at any time during the denaturation/renaturation process.
By 'free end' it is herein meant the end of one strand which is not covalently linked to an extremity of the other strand; as explained above, these free ends may each be bound to a different surface. For example, one of these surfaces may be movable, whilst the other may be motionless. The skilled person will thus easily realize that, in order to measure the distance between the free ends of the hairpin double-stranded nucleic acid, it is possible to simply measure the distance between the two surfaces.
This distance is maximal (z igh at a force (Fopen ), which is higher than the threshold value mentioned above) when the hairpin molecule is completely denatured, since the hairpin nucleic acid is then completely extended; it is minimal (zlow at a force (Ftest,) which corresponds to the intermediate value discussed above)) when the said hairpin molecule is completely renatured. It is advantageous to perform all length comparisons at the same force Ftest, so that the single stranded nucleic acid has the same elastic properties. Using the delay in loop closing the skilled user can measure Z igh (Ftest). When the replication is blocked at a specific site, the double-stranded nucleic-acid molecule is blocked in a conformation where the two strands in front of the replication fork are still annealed, while the two parental strands behind the fork are separated. The distance between the two free ends when the replication process is temporarily or permanently paused can be measured: as expected, this distance z is comprised between Z igh and ziow (all z being measured with F = Ftest). It is immediately clear that the distance z varies with the localization on the hairpin molecule of the point where the replication fork is paused or blocked. If the said replication fork is paused at a sequence which is located close to the free ends of the hairpin, the distance zpause will be minimal. On the other hand, if the said replication fork is blocked at a sequence corresponding to a part of the hairpin which is close to the unpaired loop, the distance z will be maximal (Fig. 1). The pause can be observed during the polymerase induced unwinding of a hairpin under a tension Ftest. If replication proceeds at a force Fopen (for which the hairpin is open) until it is blocked, the replication blockage can be observed upon reducing the force to Ftest which allows for the strands to rezip up to the blockage point.
It is possible to correlate precisely a physical distance on a double-stranded nucleic acid molecule with a number of bases. For example, a distance of 1 nm corresponds to the distance spanned by two nucleotides (1 bp) in a nucleic acid under a 10 pN force. The exact calibration versus force is given by the elasticity of single stranded nucleic acid. Therefore, by simply measuring the distance between the two free ends of the double- stranded nucleic acid molecule under tension, it is possible to determine precisely where the renaturation is blocked.
Thus, in one embodiment, the method of the invention relates to a method for the determination of a nucleic acid sequence, said method comprising the steps of:
a) denaturing a double-stranded nucleic acid molecule corresponding to the said nucleic acid sequence ;
b) hybridizing a single-stranded nucleic acid molecule ("the primer") with the said denatured double-stranded nucleic acid molecule;
c) applying a tension to the hybridized primer/double-stranded nucleic acid molecule obtained in b); d) incubating the hybridized primer/double-stranded nucleic acid molecule obtained in step b) with a polymerase in conditions which will lead to at least one pause in replication; and
e) determining the position of the said pause in replication with respect to one end of the double-stranded nucleic acid,
wherein the distance between the two ends of the double-stranded molecule is determined when the replication process is blocked. Preferentially, the distance between the two ends of the said molecule is determined when the molecule is completely denatured. Even more preferentially, the two distances are compared and the position of the blockage is determined.
As used herein, "polymerase" refers to an enzyme that catalyzes the polymerization of nucleotides (i.e., the polymerase activity). Generally, the enzyme will initiate synthesis at the 3' end of the primer annealed to a polynucleotide template sequence, and will proceed toward the 5' end of the template strand. "DNA polymerase" catalyzes the polymerization of deoxynucleotides, while "R A polymerase" catalyses the polymerization of ribonucleotides. The polymerase according to the invention is either a processive polymerase or non-processive polymerase. A processive enzyme catalyses multiple rounds of a reaction on a denatured double-stranded nucleic acid template, while the enzyme stays bound to the said template. As understood herein, a polymerase will be processive i.e. will stay bound to the denatured double-stranded nucleic acid template for at least 25 nucleotides, at least 50 nucleotides, at least 100 nucleotides, usually at least 500 nucleotides, and may be processive for at least 1000 nucleotides or more. Polymerases according to the invention include R A-dependent RNA polymerases, DNA-dependent RNA polymerases, DNA-dependent DNA polymerases, RNA-dependent DNA polymerases (reverse transcriptase) and the like. Many such enzymes are known in the art. According to the invention, the said polymerase is capable of synthesizing nucleic acids when the force applied to the double-stranded nucleic acid template is at an intermediary value, i.e. comprised between 10 and 12 pN or at high forces for which the hairpin is completely unfolded
Preferably, a polymerase with a 3 '-5' exonuclease activity is used in the method of the invention. As used herein, "3 -5' exonuclease activity" refers to the capability of an enzyme to remove incorporated nucleotides from the 3' end of a DNA polymer. Examples of such enzymes include e.g. T4 DNA Polymerase, T7 DNA Polymerase, DEEP VENT DNA polymerase, E. coli polymerase III, Phi29 DNA Polymerase, E. coli DNA Polymerase I, E. coli DNA Polymerase I, Klenow Fragment, Phusion® High Fidelity DNA Polymerase, Phusion® Hot Start High Fidelity DNA Polymerase, Phire® Hot Start DNA Polymerase, 9°Nm DNA Polymerase, Herpes Simplex Virus Type 1 DNA Polymerase. In a preferred embodiment, the polymerase of the invention can be switched from a polymerase-active mode to a 3 '-5' exonuclease-active mode by decreasing the force applied to the double-stranded nucleic acid molecule under a minimal value. Preferably, the said minimal value is 7 pN; more preferably, the said minimal value is 6 pN; even more preferably, the said minimal value is 5 pN.
Even more preferably, the said polymerase has, in addition, a strand displacement activity under an intermediate tension, e.g. when a force between 10 and 12 pN is applied to the double-stranded hairpin. By "strand displacement", it is herein meant the ability for the polymerase to displace the downstream nucleic acid during synthesis. The inventors have found in particular that the T4 DNA Polymerase and the T7 DNA polymerase, which are not known to have any strand displacement activity in test tube conditions, i.e. in conditions where no tension is applied to the double-stranded template, are able to remove the downstream nucleic acid during polymerisation when the double-stranded hairpin is under a force > 10 pN. The T4 DNA Polymerase and T7 DNA Polymerase are thus particularly suited for carrying out the method of the invention. "T4 DNA Polymerase" and "T7 DNA Polymerase" herein refer to both the monomeric enzyme and the holoenzyme.
The method according to the invention comprises a replication step which is carried out under conditions which will lead to at least one pause in the replication process. Preferably, the double-stranded nucleic acid molecule is submitted to a tension during the replication step. More preferably, the said tension is around an intermediate value, i.e. the said tension is comprised between 10 and 12 pN. The said pause can be caused by any of the means known to the person of skills in the art. Sequencing-by-synthesis method wherein the synthesis of the new strand is blocked have been widely used in the art. Any such method can be adapted for the purpose of the present invention. For example, the polymerase may be a nucleotide-sensitive, processive enzyme, which is then contacted with the denatured double-stranded nucleic acid template in a reaction mix which is rate-altering for the processive movement of the enzyme for a specified nucleotide. For example, the said reaction mix comprises a pool of deoxy-nucleotides (dNTP) where one of the bases is present at a very low concentration. In that case, each time the polymerase encounters the complement of the said nucleotide, it pauses until the low concentration nucleotide diffuses into position. The positions of the pauses along the molecule thus reveal the positions of the scarce nucleotide in the synthesized strand and allows the skilled person to identify the position of the corresponding base in the sequence (Greenleaf and Block, Science, 313 : 801, 200; U.S. Patent No. 7,556,922). Alternatively, it is possible to use dideoxynucleotides (ddNTPs) in addition to the normal deoxynucleotides (dNTPs) found in DNA. Dideoxynucleotides are essentially the same as nucleotides except they contain a hydrogen group on the 3 ' carbon instead of a hydroxyl group (OH). These modified nucleotides, when integrated into a sequence, prevent the addition of further nucleotides. This occurs because a phosphodiester bond cannot form between the dideoxynucleotide and the next incoming nucleotide, and thus the DNA chain is terminated. Therefore, incorporation of one ddNTP will cause the polymerase reaction to stop, since no nucleotide can be added after the said ddNTP. The position of the blockage along the molecule thus reveals the position of incorporation of the ddNTP in the synthesized strand and allows the skilled person to identify the position of the corresponding base in the sequence. The position of each pause or blockage can then be determined by the method of the invention, i.e. by measuring the physical distance between the two free ends of the molecule. Besides, it is also possible to use nucleoside triphophate (NTPs) instead of ddNTPs. The difference is, incorporation of one NTP will only transiently pause the process of polymerase, generating a patern similar to the first proposed example mentioned here. The method of the invention may be used for direct sequencing of an unknown nucleic acid. In a preferred embodiment, a processive enzyme is used to synthesize from a known single- stranded nucleic acid (a primer) a sequence of increasing extension complementary to one of the hairpin strands, thereby effectively unwinding the double- stranded hairpin maintained under a moderate tension (e.g. in the range of 5 to 13 pN). In this embodiment, polymerisation is initiated by opening the double-stranded hairpin by transiently increasing the force to Fopen in the presence of a single-stranded primer. Fopen is a tension higher than the threshold value required for completely opening the double stranded hairpin. This results in the hybridization of the primer to the double- stranded hairpin. In a next step, the force is set to Fei0ngation (≤ Fopen ) in order to enable the polymerase to synthesize a new strand (in a strand displacement mode or not). Feiongation is preferably set at an intermediate tension lower than Fopen (the threshold value required for complete opening of the double stranded hairpin), said intermediate tension allowing for replication by the polymerase. Preferably, the said intermediate value is comprised between 10 and 12 pN. Under a tension equal to Fei0ngation, the polymerase synthetizes a new strand at a sustained rate until a pause or a blockage occurs. The enzyme activity thus leads to the production of an extended complementary single- stranded nucleic acid molecule.
In the strand displacement configuration, the said polymerase can be switched between its two modes of operation, i.e. exonuclease activity and elongation, by adjusting the force applied on the hairpin.
In a preferred embodiment, the elongation process is stopped before the polymerase reaches the loop. This can be achieved by various means. For example, the insertion of non conventional bases a few nucleotides ahead of the loop will stop the polymerase. The same effect can be achieved with a double-stranded binding domain of a protein located just before the loop.
In a further preferred embodiment, the force (Fexo) applied to the hairpin molecule is decreased under a minimal value, e.g. 5 pN, after the elongation process is blocked. This allows the polymerase to switch to its exonuclease activity and disassemble the strand which has just been synthesized. This process stops when the whole strand is completely disassembled. Alternatively, the disassembly process is stopped when the enzyme is stalled, e.g. when encountering a roadblock, such as a modified nucleotide in the primer. In such a case, it may be necessary to use an enzyme to eject the newly- synthesized strand. As examples of suitable enzymes, one may cite e.g. helicases, including a UVrD helicase, a recBCD helicase, E. coli UvrD helicase, Tte-UvrD helicase, T7 Gp4 helicase, RecBCDhelicase, DnaB helicase, MCM helicase, Rep helicase, RecQ helicase, PcrA helicase, T4 UvsW helicase, SV40 large T antigen helicase, Herpes virus helicase, yeast Sgsl helicase, DEAH ATP-dependent helicases and Papillomavirus helicase El protein and homologs thereof, and exonucleases, including snake venom phosphodiesterase, spleen phosphodiesterase, Bal-31 nuclease, E. coli exonuclease I, E. coli exonuclease VII, Mung Bean Nuclease, SI Nuclease, an exonuclease activity of E. coli DNA polymerase 1, an exonuclease activity of a Klenow fragment of DNA polymerase 1, an exonuclease activity of T4 DNA polymerase, an exonuclease activity of T7 DNA polymerase, an exonuclease activity of Taq DNA polymerase, an exonuclease activity of DEEP VENT DNA polymerase, E. coli exonuclease III, λ exonuclease and an exonuclease activity of VENTR DNA polymerase.
Disassembly of the strand which has just been synthesized provides the opportunity to repeat the whole process, i.e. the synthesis of a strand under a tension superior to a threshold, e.g. 10 pN, with the polymerase pausing e.g. every time it encounters the complement of the rare nucleotide or stopping if it incorporates a ddNTP or a NTP. If the primer has been expelled during the disassembly step, synthesis will be preceded by a step of opening the hairpin and closing it back so that a primer can hybridized. Increasing the force above 10 pN will switch back the polymerase in the elongation mode and a new pausing pattern may be recorded. Repeating the synthesis/disassembly cycle thus makes it possible to record several pausing patterns with the same rare nucleotide. This leads to an improved signal/noise ratio, allowing the obtention of a sequence of a higher quality with fewer errors. Alternatively the replication step can be conducted in the presence of ddNTP at a high force where the hairpin is completely open. In that case blockages resulting from ddNTP incorporation can be detected upon lowering the force to Ftest.
Once sufficient statistics have been recorded the procedure is repeated with a shortage of another nucleotide or another ddNTP. After the said procedure has been repeated with each nucleotide, the positions of all the nucleotides in the strand are compiled together, thus yielding the complete sequence of the original double-stranded nucleic acid molecule.
Implementation of the method of the invention has been made possible, in particular, by the existence of devices designed for probing real-time nucleic acid interaction at the single-molecule level. Such a device is described for example in U.S. Patents Nos. 7,052,650 and 7,244,391. The apparatus described therein uses magnetic traps to apply a picoNewton scale force on a micron-sized super-paramagnetic bead. Briefly, the said apparatus comprises an optical microscope, magnets and a PC. The double-stranded nucleic acid molecules are anchored at multiple points at one end to a motionless element, e.g. a surface, and at the other end to a movable surface, in this case a magnetic bead. Magnets are provided for acting on the bead. In particular, the magnets may be used for pulling the bead away from the surface. However, the implementation of the method of the invention is not restricted to the above apparatus. Any device which allows one to fully extend and then refold a molecule of double stranded nucleic acid, whilst monitoring at the same time the extension of the said molecule can be used to implement the method of the invention. For example, optical tweezers may be used; they require however prior force calibration and are not easily parallelized for high throughput measurements. Further drawbacks are the lack of total torsional control of the nucleic acid and the possible local heating of the solution by the focussed laser which may alter the hybridization conditions.
The double stranded nucleic acid is incubated for a few minutes in a solution of adequate beads (for example streptavidin coated ones) to which it binds by one of its labeled (for example biotin) ends. The beads can be transparent if optical tweezers are later used for manipulation or magnetic if one uses magnetic traps or tweezers for manipulation.
The bead-nucleic acid assembly is injected in a fluidic chamber the surface of which has been treated such as to bind the other labeled end of the molecule (for example a surface coated with anti-Dig to bind the Dig-labeled end of the nucleic acid). The beads are thus anchored to the surface via a nucleic acid hairpin, see Fig. la. The distance of the bead to the surface is then monitored by various means known to the man of the art: for example the diffraction rings of their image on a camera can be used to deduce their distance, or the light intensity they scatter (or emit by fluorescence) when illuminated in an evanescent mode can be used to measure their distance. Alternatively, the magnetic field they generate can be measured (using a magnetic sensor such as GMR or Hall sensors) to deduce their distance to a sensor on the anchoring surface.
To pull on the nucleic acid molecule anchoring the beads to the surface various techniques have been described. One can use the light of a focused laser beam to trap a transparent bead near the focal point. By the relative translation of the beam with respect to the anchoring surface one can apply a force on the tethering molecule (a typical optical tweezers assay). The exerted force being proportional to the displacement of the bead from its equilibrium position, to exert a constant force on the tethering molecule requires a feedback loop on the trapping beam.
To exert a constant force on a bead, the use of the hydrodynamic drag generated by a flow around the bead has been described, but it usually yields a low spatial accuracy (> 100 nm). The preferred embodiment uses a magnetic trap to pull on superparamagnetic beads anchored to a surface by a nucleic acid hairpin as described above. In this configuration, small magnets placed above the sample are used to apply a constant force on the anchored bead, whose position can be determined with < 1 nm accuracy (depending on the pulling force and the dissipation due to hydrodynamic drag) In every case one notices that the tethering hairpin can be mechanically fully unzipped by pulling on the beads with a force larger than about 16 pN. Reducing the tension on the molecule to below about 11 pN allows the hairpin to re-zip spontaneously (the unzipping transition is reversible though hysteretic). If, during the unzipped phase, some molecules in solution (such as proteins or complementary oligonucleotides of DNA, RNA, LNA or PNA) have bound to the stretched single stranded nucleic acid, these molecules will block the rezipping of the hairpin when the force is lowered to below 11 pN. The principle of the assay is thus to switch between two forces: a large one F0pen to open the hairpin and a smaller one Ftest used to allow re-zipping and to measure the extension of the molecule at transient blockages. The blocking position is related to the sequence by a linear relation between full extension and the blocked one. For best accuracy, the full extension is preferably measured at the test force Ftest. This is achieved by designing the hairpin loop such that it requires a fraction of a second to refold once the force is reduced from Fopen to Ftest ·
In order to attach nucleic acids to surfaces or supports, use may be made of any one of the techniques known in the field. Essentially, the nucleic acid becomes anchored directly to the support, for example the micro-bead, which involves a functionalization of this surface, for example by coating it with streptavidin, a COOH group, and the like, capable of reacting with the functionalized end of the nucleic acid.
Such methods necessitate, in general, functionalizing the nucleic acid, especially the 3' and 5' ends, that is to say grafting appropriate chemical groups onto them. It is, moreover, preferable to join the other two free ends of the molecule by a loop in order to prevent the strands from dissociating at the end of the operation, so that the latter can be repeated if appropriate. For this purpose, different procedures may be adopted.
The simplest is to functionalize, using synthetic oligonucleotides, one of the ends of a double-stranded nucleic acid with two different functions (biotin and amine, for example), which permit anchoring to two different pre-treated surfaces. The two strands at the other end may be joined using a partially paired synthetic nucleotide in the form of a loop. In this way, a paired, single-stranded nucleic acid, i.e. a hairpin, is produced from a double-stranded nucleic acid. The advantage of this method lies in its capacity to functionalize a heterogeneous population of large nucleic acid fragments (as are obtained by fractionation of a gene or chromosome), which can then be analyzed simultaneously. In this case, the nucleic acid sample is fractionated using two (or more) restriction enzymes, which enables a subpopulation to be obtained with two different restriction sites at its ends which are similar over all the fragments. This enables the two ends to be treated differently (for example by joining one end to an oligonucleotide in the form of a loop possessing the appropriate restriction site at its end). The drawback of this method lies in the steric interference between the two adjacent functional groups, which can make coupling to the surfaces difficult. To solve this problem, it can be advantageous to add at each free end of the hairpin molecule a "spacer" sequence of bases, to the end of which a functional group is then added; the two spacer sequences are non-complementary, affording each functional group enough space to bind to its dedicated surface. More advantageously, the sequence of each spacer sequence is designed in order to use single-stranded sequencing primers of known sequence in the sequencing method of the invention. The addition of a loop and/or spacers to the double-stranded nucleic acid molecules can be performed with any of the methods commonly used in molecular biology. These methods are well known to the person skilled in the art and there is thus no need to detail them here.
As regards the actual anchoring techniques, there are many of these and they derive from the techniques for anchoring macromolecules (proteins, DNA, and the like) to commercially available pretreated surfaces. Most of these techniques have been developed for immunology tests, and link proteins (immunoglobulins) to surfaces carrying groups (— COOH, ~NH2,—OH, and the like) capable of reacting with the carboxyl (--COOH) or amine (~NH2) ends of proteins. The covalent anchoring of nucleic acid may be accomplished directly, via the free phosphate of the 5' end of the molecule, which reacts with a secondary amine (Covalink — NH surface marketed by Polylabo at Strasbourg) to form a covalent bond. It is also possible to functionalize DNA with an amine group and then to proceed as with a protein.
There are also surfaces coated with streptavidin (Dynal beads, and the like), which permit quasi-covalent anchoring between the streptavidin and a biotinylated DNA molecule. Lastly, by grafting an antibody directed against digoxigenin onto a surface (by the methods mentioned above), a nucleic acid functionalized with digoxigenin may be anchored thereto. This represents merely a sample of the many possible anchoring techniques.
Among the attachment and anchoring techniques, there should also be mentioned, for example, the techniques described in Patent EP 152 886 using an enzymatic coupling for the attachment of DNA to a solid support such as cellulose.
Patent EP 146 815 also describes various methods of attachment of DNA to a support. Similarly, patent application WO 92/16659 proposes a method using a polymer to attach DNA.
Naturally, the nucleic acid may be attached directly to the support but, where necessary, especially with a view to limiting the influence of the surfaces, the nucleic acid may be attached at the end of an inert arm of peptide or other nature, as is, for example, described in Patent EP 329 198.
The practice of the invention employs, unless other otherwise indicated, conventional techniques or protein chemistry, molecular virology, microbiology, recombinant DNA technology, and pharmacology, which are within the skill of the art. Such techniques are explained fully in the literature. (See Ausubel et al., Current Protocols in Molecular Biology, Eds., John Wiley & Sons, Inc. New York, 1995; Remington's Pharmaceutical Sciences, 17th ed., Mack Publishing Co., Easton, Pa., 1985; and Sambrook et al, Molecular cloning: A laboratory manual 2nd edition, Cold Spring Harbor Laboratory Press - Cold Spring Harbor, NY, USA, 1989).
The examples below will enable other features and advantages of the present invention to be brought out. Legends of the Figures
Figure 1 Principle of detection of the hybridization of oligo-nucleotides to their complementary sequence on a hairpin DNA. The hairpin DNA anchoring the bead to the surface (a) is momentarily unzipped by increasing the force pulling on the bead to a value above 16 pN. In that phase the complementary fragment in solution hybridizes to its target on the opened DNA hairpin, thus preventing the rezipping of the hairpin (b) when the force is reduced back to its initial value. The hairpin refolding presents four plateaus occuring at well defined extensions but with variable duration. The top plateau at 73.71 nm is associated with the 83 bp fully opened hairpin at Ftest, while the bottom one corresponds to the hairpin completely refolded. The two intermediate plateaus at 25.47 nm and 35.17 nm occur because two oligos have been placed in the solution. From these change in extension (z igh-z) it is possible to deduce where along the hairpin the complementary sequence has paired. Here according to their positions the blocks coincide with location 28.66 bp and 39.60 bp in very good agreement with their expected positions at 29 bp and 40 bp. The plateau positions are better estimated by fitting Gaussian to the histogram obtained from several opening/closing cycles (here -20 cycles).
Figure 2 : Illustration of single molecule Sanger sequencing. Real-time record of synthesis on a 1.2 kbps single molecule hairpin with T4 DNA polymerase (dTNP = dGNP = dCNP = 500 μΜ, dATP = 5 μΜ, ddATP = 400 μΜ, clamp and clamp loader) in a commercial buffer (5X Reaction Buffer: 335 mM Tris-HCl (pH 8.8 at 25°C), 33 mM MgCl2, 5 mM DTT, 84 mM (NH4)2S04, Fermentas; http:/ / www.fermeritas.com/en/ roducts /all/ modifying- enzymes / mesophilic-pol y merases / ep006-t4-dna-polymerase) . At high force, the peak of the curve shows the existence of hairpin sample. At middle force, the rising curve shows the processive synthesis of a new complementary strand by the T4 DNA polymerase, while the plateau represents the pause occurring when one ddNPT is incorporated. At low force, the falling edge of the curve shows the exonuclease activity removing the strand just synthesized,. This synthesis and exonuclease phases may be repeated in cycles.
Figure 3 : Illustration of sequencing based on unbalanced dNTP concentration. Realtime record of synthesis on a 1.2 kbps single molecule hairpin with T4 DNA polymerase (dTNP = dGNP = dCNP = 33 μΜ, dATP = 20 nM, clamp and clamp loader) in commercial buffer (5X Reaction Buffer: 335 mM Tris-HCl (pH 8.8 at 25°C), 33 mM MgCl2, 5 mM DTT, 84 mM (NH4)2S04, Fermentas; http:/ / ww.fermentas.com/ en/products / ail /modifying- enzymes/ mesophilic-polymerases/ ep006-t4-dna-polymerase). At high force (~21 pN), the peak of the curve shows the existence of hairpin sample and could be used to hybridize a primer if needed. At middle force (-11.7 pN), the rising edge of the curve shows the processive synthesis of a new complementary strand by the T4 DNA polymerase, while the transient pause corresponds to the presents when a dATP is needed. At low force (-1.6 pN), the falling shows the cleavage of the newly- synthesized strand, due to the activation of the exonuclease activity of the T4 DNA polymerase. This synthesis and cleavage cycle is repeatable.
Figure 4: Detection of the sequencing based on unbalanced dNTP concentration to their complementary sequence on a hairpin. Fig. 4A: Real-time record of synthesis on a 83 bp single molecule hairpin with T4 DNA polymerase (dTNP = dGNP = dCNP = 33 μΜ, dATP = 20 nM, clamp and clamp loader) in commercial buffer (5X Reaction Buffer: 335 mM Tris-HCl (pH 8.8 at 25°C), 33 mM MgCl2, 5 mM DTT, 84 mM (NH4)2S04, Fermentas; http:/ /www.fermentas.com/ en/products/all/modifying- enzymes / mesophilic-polymerases / ep006-t4-dna-polymerase) . At high force (-21 pN), the peak of the curve shows the existence of hairpin sample and could be used to hybridize a primer if needed. At middle force (-11.7 pN), the rising edge of the curve shows the processive synthesis of a new complementary strand by the T4 DNA polymerase, while the transient pause corresponds to the presents when a dATP is needed. At low force (-1.6 pN), the falling shows the cleavage of the newly- synthesized strand, due to the activation of the exonuclease activity of the T4 DNA polymerase. This synthesis and cleavage cycle is repeatable. The position histogram of these transient pauses is deduced similarly as previously described in Fig.l and base calls are indicated (arrows). The true sequence of the template is shown on the right side of the figure. Fig. 4B: There are two ways to measure the extension changement. First, we can use the Zzip as reference; alternatively, we can also use the position of the hairpin loop as reference. Figure 5. Illustration of hairpin design. The DNA fragment of interest is ligated with a DNA loop (with abasic and LNA bases) and two partially complementary DNA fragments (one ssDNA with Biotin on the end, one dsDNA with Dig at extremity), which eventually forms a hairpin that can be bound to a super-paramagnetic bead covered with streptavidin (DYNAL) while the other extremity to a glass coverslip treated with anti-dig.
Experimental Examples
DNA preparation
A double-strand (ds)DNA fragment of unknown sequence and of a size comprised between a few tens and a few thousands base pairs, is ligated at one of its extremities to a DNA loop. Its other extremity is ligated to a dsDNA fragment allowing for the binding of its two strands to differently coated surfaces. For example, the free 3' end of one strand can be labeled with biotin allowing binding to streptavidin coated beads, whereas the 5' end on the opposite strand can be labelled with digoxigenine allowing its binding to surfaces coated with an anti-Dig antibody. This end-labelling can be done by various ways known to the man of the art, such as the use of terminal transferase to add biotin (or dig) modified nucleotides or hybridization with suitably labelled oligonucleotides.
Force stretching apparatus
This DNA construct is incubated for a few minutes in a solution of adequate beads (for example streptavidin coated ones) to which it binds by one of its labelled (for example biotin) ends. The beads can be transparent if optical tweezers are later used for manipulation or magnetic if one uses magnetic traps or tweezers for manipulation.
The bead-DNA assembly is injected in a fluidic chamber the surface of which has been treated such as to bind the other labelled end of the molecule (for example a surface coated with anti-Dig to bind the Dig-labelled end of the DNA). The beads are thus anchored to the surface via a DNA- hairpin, see Fig. la. The distance of the bead to the surface is then monitored by various means known to the man of the art: for example the diffraction rings of their image on a camera can be used to deduce their distance, or the light intensity they scatter (or emit by fluorescence) when illuminated in an evanescent mode can be used to measure their distance. Alternatively, the magnetic field they generate can be measured (using a magnetic sensor such as GMR or Hall sensors) to deduce their distance to a sensor on the anchoring surface.
To pull on the DNA molecule anchoring the beads to the surface various techniques have been described. The preferred embodiment uses a magnetic trap to pull on super- paramagnetic beads anchored to a surface by a DNA hairpin as described above. In this configuration, small magnets placed above the sample are used to apply a constant force on the anchored bead, whose position can be determined with < 1 nm accuracy (depending on the pulling force and the dissipation due to hydrodynamic drag). In this series of experiments, the apparatus described in U.S. Patents No. 7,052,650 and 7,244,391 was used. In addition, unless otherwise indicated, the experiments reported here were performed in 25 mM Tris pH 7.5, 150 mM KAc, 10 mM MgCl2, 0.2 % BSA. In every case, the tethering hairpin can be mechanically fully unzipped by pulling on the beads with a force larger than about 16 pN. Reducing the tension on the molecule to below about 11 pN allows the hairpin to re-zip spontaneously (the unzipping transition is reversible though hysteretic). If, during the unzipped phase, binding of a molecule in solution (such as a protein or complementary oligo-nucleotides of DNA, RNA, LNA or PNA) to the stretched single stranded (ss)DNA occurred, this molecule will transiently block the rezipping of the hairpin when the force is lowered to below 1 1 pN. The principle of the assay is to switch between two forces: a large one Fopen to open the hairpin and a smaller one Ftest used to allow re-zipping and to measure the extension of the molecule at transient blockages. The blocking position is related to the sequence by a linear relation between full extension and the blocked one. For best accuracy, the full extension is preferably measured at the test force Ftest. This is achieved by designing the hairpin loop such that it requires a fraction of a second to refold once the force is reduced from Fopen to Ftest ·
The hybridization position of an oligo-nucleotide can be measured with a basepair resolution
By measuring the extension of the DNA molecule (the distance of the bead to the surface) during one of these rezipping pauses, it is possible to determine the position of the blockage with a nanometer precision (1 nm corresponds to the distance spanned by two nucleotides (1 bp) in a ssDNA under a 10 pN force). The unzipping configuration displays the largest ratio of extension to basepair (in dsDNA the ratio is only 0.34 nm per bp).
The accuracy of this measurement is limited by two noise contributions:
• The accuracy of the measuring method,
· The brownian motion of the bead.
Different techniques can be used to measure the vertical position of the bead. One of the simplest relies on video microscopy (U.S. Patents Nos. 7,052,650 and 7,244,391). The results in Fig. 1 where obtained with this method, typical resolution reaches 1 nm for a 1 second averaging. Other methods with better resolution have been demonstrated, such as laser illumination with PSD sensors that reaches 0.1 nm in resolution (Greenleaf and Block, Science, 313: 801, 2006) and evanescent wave illumination (Singh-Zocchi et ah, Proc Natl Acad Sci U S A., 100(13): 7605-7610, 2003 , Liu et al, Biophys J, 96(9): 3810-3821, 2009).
The intrinsic limitation in resolution is given by the brownian fluctuations of the bead pulling on a ssDNA molecule. <x2> = 4kBT Af (6πηΓ) /k2 ssDNA(F) where kssDNA(F) is the stiffness of a ssDNA molecule, ks is Boltzman constant, T the absolute temperature, η the viscosity of water, r the bead's radius and Af is the frequency range of the measurement. kssDNA(F = 10 pN) = 0.05/Nb (N/m), where Nb is the number of bases of the ssDNA. For the 84 bp hairpin this leads to 0.04 nm of noise over 1 second (Af=l Hz) averaging. The larger noise in Fig. 1 (σ ~ 1 nm) is essentially due to the measuring device, not the intrinsic fluctuations. The intrinsic brownian noise increases with the size of the hairpin: a 1200 bp hairpin leads to a noise of 0.6 nm when averaging over 1 second.
Diagnostics and sequencing by mechanical detection of polymerization.
We have shown that the T4 DNA polymerase can replicate a DNA hairpin when the force is high enough to sufficiently destabilize the fork (Ftest). Like in the classical Sanger sequencing, the incorporation of a specific ddNTP will prevent further elongation of the nascent strand by the T4 DNA polymerase. In our method, this blockage can be easily identified, as shown in Fig. 6. Upon lowering the force, the exonuclease activity of the T4 DNA polymerase is activated and the enzyme excises the newly-synthesized strand. Thus, by repeated cycles of synthesis and cleavage, the molecule (hairpin) can be sequenced by identifying the blockage positions first in the presence of ddATP, and then of each of the other ddNTPs, i.e. ddTTP, ddCTP, and ddGTP.
Similarly, the double-stranded hairpin molecule can be sequenced in a buffer comprising a deficit of one of the four dNTPs compared to the others, i.e. this dNTP is present at a very low concentration as compared to the others. Thus, whenever the T4 DNA polymerase, during polymerization, reaches a position requiring the addition of the limiting nucleotide, a transient pause occurs, as examplified on Fig. 7. As described hereabove, the newly-synthesized brand can be excised by lowering the force, which results in the activation of the exaonuclease activity of the enzyme. Thus, by repeated cycles of synthesis and cleavage, the molecule (hairpin) can be sequenced by identifying the pause positions first in the presence of low concentrations of dATP (for example), and then in turn of each of the other dNTPs, i.e. dTTP, dCTP, and dGTP.

Claims

1) A method for the determination of a nucleic acid sequence, said method comprising the steps of:
a) denaturing a double-stranded nucleic acid molecule corresponding to the said nucleic acid sequence;
b) hybridizing a single-stranded nucleic acid molecule ("the primer") with the said denatured double-stranded nucleic acid molecule;
c) applying a tension to the hybridized primer/double-stranded nucleic acid molecule obtained in b);
d) incubating the hybridized primer/double-stranded nucleic acid molecule obtained in step b) with a polymerase in conditions which will lead to at least one pause in replication; and
e) determining the position of the said pause in replication with respect to one end of the double-stranded nucleic acid.
2) The method of claim 1, wherein the said double-stranded nucleic acid molecule is a hairpin.
3) The method of any of claims 1 or 2, wherein at least one of the bases of one of the strands of the double-stranded nucleic acid is attached directly or indirectly to a surface, and wherein at least one of the bases of the other strand of the double- stranded nucleic acid is attached to a movable surface.
4) The method of any of the previous claims, wherein the double-stranded nucleic acid is denatured in step a) by moving away the supports.
5) The method of claim 4, wherein a physical force above or equal to 15 pN, preferably above or equal to 17 pN, more preferably above or equal to 18 pN, is applied to the double-stranded molecule by moving away the supports.
6) The method of any of the previous claims, wherein the said tension of step c) is comprised between 12 pN and 10 pN.
7) The method of any of the previous claims, wherein the ends of the double-stranded nucleic acid which are not attached to a support are physically linked to one another covalently or not covalently. 8) The method of any of the previous claims, wherein the steps a) to e) are repeated several times (so as to accumulate measurements and increase the signal/noise ratio).
9) The method of any of the previous claims, wherein step e) comprises measuring the distance (z) between the two ends of the double-stranded nucleic acid molecule which are attached to the support.
10) The method of claim 9, comprising a further step of measuring the ditsance (z igh) between the two ends of the double-stranded nucleic acid molecule which are attached to the support, when the said double-stranded nucleic acid molecule is denatured.
11) The method of claim 10, further comprising the steps of:
• comparing z and z^gh, and
• determining the position of the pause.
12) The method of any of the previous claims wherein the primer is extended by the activity of the said polymerase, and the extension process is stopped before the said polymerase reaches the loop of the double-stranded nucleic acid molecule.
13) The method of any of the previous claims, comprising a further step of reducing the physical force applied to the double-stranded nucleic acid molecule to less than or equal to 5 pN.
14) The method of claim 13, wherein the exonuclease activity of the said polymerase is activated.
15) The method of any of the previous claims, comprising a further step of disassembling the newly- synthesized strand.
16) The method of any of the previous claims, wherein steps a) to e) are repeated.
17) The method of any of the previous claims wherein the polymerase is a nucleotide- selective enzyme and the conditions of step d) include using a reaction mix which is rate-altering for the processive-movement of the enzyme for a specified nucleotide. 18) The method of any of claims 1-16, wherein the conditions of step d) include using a reaction mix comprising dideoxynucleotides in addition to deoxynucleotides.
PCT/EP2011/058664 2010-05-27 2011-05-26 Method of dna sequencing by polymerisation WO2011147929A1 (en)

Priority Applications (12)

Application Number Priority Date Filing Date Title
EP11722796.7A EP2576822B1 (en) 2010-05-27 2011-05-26 Method of dna sequencing by polymerisation
CN201180034608.0A CN103097551B (en) 2010-05-27 2011-05-26 By the DNA sequencing method of polymerization
DK11722796.7T DK2576822T3 (en) 2010-05-27 2011-05-26 Method for DNA sequencing by polymerizing
JP2013511688A JP2013528380A (en) 2010-05-27 2011-05-26 DNA sequencing by polymerization
AU2011257227A AU2011257227B2 (en) 2010-05-27 2011-05-26 Method of DNA sequencing by polymerisation
ES11722796.7T ES2539254T3 (en) 2010-05-27 2011-05-26 DNA sequencing method by polymerization
CA2800637A CA2800637C (en) 2010-05-27 2011-05-26 Method of dna sequencing by polymerisation
US13/700,115 US9493829B2 (en) 2010-05-27 2011-05-26 Method of DNA sequencing by polymerisation
KR1020127034149A KR101848377B1 (en) 2010-05-27 2011-05-26 Method of dna sequencing by polymerisation
IL223256A IL223256A (en) 2010-05-27 2012-11-26 Method of dna sequencing by polymerisation
HK13111280.1A HK1183912A1 (en) 2010-05-27 2013-10-03 Method of dna sequencing by polymerisation dna
US15/334,593 US9738928B2 (en) 2010-05-27 2016-10-26 Method of DNA sequencing by polymerisation

Applications Claiming Priority (4)

Application Number Priority Date Filing Date Title
EP10305563A EP2390350A1 (en) 2010-05-27 2010-05-27 Method of DNA sequencing by polymerisation
EP10305563.8 2010-05-27
US37762110P 2010-08-27 2010-08-27
US61/377,621 2010-08-27

Related Child Applications (2)

Application Number Title Priority Date Filing Date
US13/700,115 A-371-Of-International US9493829B2 (en) 2010-05-27 2011-05-26 Method of DNA sequencing by polymerisation
US15/334,593 Continuation US9738928B2 (en) 2010-05-27 2016-10-26 Method of DNA sequencing by polymerisation

Publications (1)

Publication Number Publication Date
WO2011147929A1 true WO2011147929A1 (en) 2011-12-01

Family

ID=42415374

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/EP2011/058664 WO2011147929A1 (en) 2010-05-27 2011-05-26 Method of dna sequencing by polymerisation

Country Status (12)

Country Link
US (2) US9493829B2 (en)
EP (2) EP2390350A1 (en)
JP (2) JP2013528380A (en)
KR (1) KR101848377B1 (en)
CN (1) CN103097551B (en)
AU (1) AU2011257227B2 (en)
CA (1) CA2800637C (en)
DK (1) DK2576822T3 (en)
ES (1) ES2539254T3 (en)
HK (1) HK1183912A1 (en)
IL (1) IL223256A (en)
WO (1) WO2011147929A1 (en)

Cited By (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2014114687A1 (en) 2013-01-22 2014-07-31 Centre National De La Recherche Scientifique (Cnrs) Process for detection of dna modifications and protein binding by single molecule manipulation
EP3090803A1 (en) 2015-05-07 2016-11-09 Paris Sciences et Lettres - Quartier Latin Improved device for the analysis of nucleic acid molecules
WO2016177808A1 (en) 2015-05-07 2016-11-10 Paris Sciences Et Lettres - Quartier Latin Formation of hairpins in situ using force-induced strand invasion
WO2019030306A1 (en) 2017-08-08 2019-02-14 Depixus In vitro isolation and enrichment of nucleic acids using site-specific nucleases
WO2020099675A1 (en) 2018-11-16 2020-05-22 Depixus Optimization of in vitro isolation of nucleic acids using site-specific nucleases
WO2020120711A1 (en) 2018-12-12 2020-06-18 Depixus Method of nucleic acid enrichment using site-specific nucleases followed by capture
EP4160199A1 (en) 2021-10-04 2023-04-05 Depixus Apparatus for biomolecule analysis with a well and a cavity below the well
US12139746B2 (en) 2018-12-12 2024-11-12 Depixus Method of nucleic acid enrichment using site-specific nucleases followed by capture

Families Citing this family (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP2390351A1 (en) 2010-05-27 2011-11-30 Centre National de la Recherche Scientifique (CNRS) Method of DNA sequencing by hybridisation
EP2390350A1 (en) 2010-05-27 2011-11-30 Centre National de la Recherche Scientifique (CNRS) Method of DNA sequencing by polymerisation
WO2014113598A2 (en) 2013-01-16 2014-07-24 The Regents Of The University Of California Microfluidic devices to extract, concentrate and isolate molecules
US9862987B2 (en) * 2013-01-16 2018-01-09 The Regents Of The University Of California Label free molecular detection methods, systems and devices
US10344326B2 (en) 2016-05-25 2019-07-09 International Business Machines Corporation Magnetic flux density based DNA sequencing
AU2018272302A1 (en) * 2017-05-26 2019-12-19 Nuclera Nucleics Ltd Use of terminal transferase enzyme in nucleic acid synthesis

Citations (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP0146815A2 (en) 1983-12-12 1985-07-03 Miles Inc. Nucleic acid hybridization assay employing antibodies to intercalation complexes
EP0152886A2 (en) 1984-02-22 1985-08-28 Molecular Diagnostics, Inc. Immobilized nucleic acid-containing probes
US4849077A (en) 1984-08-06 1989-07-18 Akademie Der Wissenschaften Der Ddr Process for solid phase-sequencing of nucleic acid fragments
EP0329198A2 (en) 1981-04-17 1989-08-23 Yale University Ribose- and 2-deoxyribose compounds
WO1992016659A1 (en) 1991-03-21 1992-10-01 Eastman Kodak Company Element and method for nucleic acid amplification and detection using adhered probes
US20030027187A1 (en) * 1997-03-28 2003-02-06 Center National De La Recherche Scientifique (Cnrs) Apparatus and method for the manipulation and testing of molecules, and in particular of DNA
WO2003066896A2 (en) 2002-02-09 2003-08-14 Nanotype Gmbh Method for the detection of mutations
US20030166262A1 (en) * 1997-03-28 2003-09-04 Center National De La Recherche Scientifique (Cnrs) Apparatus and method for the manipulation and testing of molecules, and in particular of DNA
US6723513B2 (en) 1998-12-23 2004-04-20 Lingvitae As Sequencing method using magnifying tags
WO2006084132A2 (en) 2005-02-01 2006-08-10 Agencourt Bioscience Corp. Reagents, methods, and libraries for bead-based squencing
US20060275782A1 (en) 1999-04-20 2006-12-07 Illumina, Inc. Detection of nucleic acid reactions on bead arrays
WO2007111924A2 (en) 2006-03-23 2007-10-04 The Board Of Trustees Of The Leland Stanford Junior University Motion resolved molecular sequencing
WO2010016937A2 (en) * 2008-08-08 2010-02-11 Ion Torrent Systems Incorporated Methods for sequencing individual nucleic acids under tension

Family Cites Families (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
FR2703693B1 (en) * 1993-04-06 1995-07-13 Pasteur Institut Rapid method of determining a DNA sequence and application to sequencing and diagnosis.
FR2760024B1 (en) * 1997-02-21 1999-05-14 Centre Nat Rech Scient PROCESS FOR CHARACTERIZING DUPLEX OF NUCLEIC ACID
CA2281674A1 (en) 1997-02-25 1998-08-27 Ludwig Institute For Cancer Research Parg, a gtpase activating protein which interacts with ptpl1
CA2580070A1 (en) * 2004-09-10 2006-03-23 Sequenom, Inc. Methods for long-range sequence analysis of nucleic acids
WO2007091077A1 (en) * 2006-02-08 2007-08-16 Solexa Limited Method for sequencing a polynucleotide template
US7754429B2 (en) * 2006-10-06 2010-07-13 Illumina Cambridge Limited Method for pair-wise sequencing a plurity of target polynucleotides
CN100552041C (en) * 2007-01-22 2009-10-21 东南大学 Method for extensional sequencing DNA by circular crossbreed
EP2390351A1 (en) 2010-05-27 2011-11-30 Centre National de la Recherche Scientifique (CNRS) Method of DNA sequencing by hybridisation
EP2390350A1 (en) * 2010-05-27 2011-11-30 Centre National de la Recherche Scientifique (CNRS) Method of DNA sequencing by polymerisation
KR101917272B1 (en) 2011-12-22 2018-11-09 쌩뜨레 나티오날 데 라 르세르쉬 생띠끄 (씨. 엔. 알. 에스) Method of dna detection and quantification by single-molecule hybridization and manipulation

Patent Citations (23)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP0329198A2 (en) 1981-04-17 1989-08-23 Yale University Ribose- and 2-deoxyribose compounds
EP0146815A2 (en) 1983-12-12 1985-07-03 Miles Inc. Nucleic acid hybridization assay employing antibodies to intercalation complexes
EP0152886A2 (en) 1984-02-22 1985-08-28 Molecular Diagnostics, Inc. Immobilized nucleic acid-containing probes
US4849077A (en) 1984-08-06 1989-07-18 Akademie Der Wissenschaften Der Ddr Process for solid phase-sequencing of nucleic acid fragments
US4882127A (en) 1984-08-06 1989-11-21 Akademie Der Wissenschaften Der Ddr Device for solid phase sequencing of nucleic acid fragments
WO1992016659A1 (en) 1991-03-21 1992-10-01 Eastman Kodak Company Element and method for nucleic acid amplification and detection using adhered probes
US20030027187A1 (en) * 1997-03-28 2003-02-06 Center National De La Recherche Scientifique (Cnrs) Apparatus and method for the manipulation and testing of molecules, and in particular of DNA
US7244391B2 (en) 1997-03-28 2007-07-17 Centre National De La Recherche Scientifique (Cnrs) Apparatus and method for the manipulation and testing of molecules, and in particular of DNA
US20030166262A1 (en) * 1997-03-28 2003-09-04 Center National De La Recherche Scientifique (Cnrs) Apparatus and method for the manipulation and testing of molecules, and in particular of DNA
US7052650B2 (en) 1997-03-28 2006-05-30 Center National De La Recherche Scientifique (Cnrs) Apparatus and method for the manipulation and testing of molecules, and in particular of DNA
EP1141399B1 (en) 1998-12-23 2005-08-17 Preben Lexow Sequencing method using magnifying tags
US6723513B2 (en) 1998-12-23 2004-04-20 Lingvitae As Sequencing method using magnifying tags
US20060275782A1 (en) 1999-04-20 2006-12-07 Illumina, Inc. Detection of nucleic acid reactions on bead arrays
US20090186349A1 (en) 1999-04-20 2009-07-23 Illumina, Inc. Detection of nucleic acid reactions on bead arrays
WO2003066896A2 (en) 2002-02-09 2003-08-14 Nanotype Gmbh Method for the detection of mutations
WO2006084132A2 (en) 2005-02-01 2006-08-10 Agencourt Bioscience Corp. Reagents, methods, and libraries for bead-based squencing
US20090181385A1 (en) 2005-02-01 2009-07-16 Applied Biosystems Inc. Reagents, methods, and libraries for bead-based sequencing
US20090181860A1 (en) 2005-02-01 2009-07-16 Applied Biosystems Inc. Reagents, methods, and libraries for bead-based sequencing
WO2007111924A2 (en) 2006-03-23 2007-10-04 The Board Of Trustees Of The Leland Stanford Junior University Motion resolved molecular sequencing
US20080020392A1 (en) 2006-03-23 2008-01-24 Block Steven M Motion resolved molecular sequencing
US7556922B2 (en) 2006-03-23 2009-07-07 The Board Of Trustees Of The Leland Stanford Junior University Motion resolved molecular sequencing
WO2010016937A2 (en) * 2008-08-08 2010-02-11 Ion Torrent Systems Incorporated Methods for sequencing individual nucleic acids under tension
US20100035252A1 (en) 2008-08-08 2010-02-11 Ion Torrent Systems Incorporated Methods for sequencing individual nucleic acids under tension

Non-Patent Citations (24)

* Cited by examiner, † Cited by third party
Title
"Remington's Pharmaceutical Sciences", 1985, MACK PUBLISHING CO.
AUSUBEL ET AL.: "Current Protocols in Molecular Biology", 1995, JOHN WILEY & SONS, INC.
CLOONAN ET AL., NAT METHODS, vol. 5, no. 7, 2008, pages 613 - 619
FULLER ET AL., NATURE BIOTECHNOL., vol. 27, no. 11, 2009, pages 1013 - 1023
GREENLEAF, BLOCK, SCIENCE, vol. 313, 2006, pages 801
HUTCHINSON, NUCL. ACIDS RES., vol. 35, no. 18, 2007, pages 6227 - 6237
LIU ET AL., BIOPHYS J, vol. 96, no. 9, 2009, pages 3810 - 3821
MAIER ET AL., PROC. NATL., ACAD. SCI. U.S.A., vol. 97, no. 22, 2000, pages 12002 - 12007
MARDIS ET AL., N ENGL J MED., vol. 361, no. 11, 2009, pages 1058 - 66
MARDIS, GENOME MED., vol. 1, no. 4, 2009, pages 40
METZKER, NATURE REV. GENET., vol. 11, no. 1, 2010, pages 31 - 46
MITREVA, MARDIS, METHODS MOL BIOL., vol. 533, 2009, pages 153 - 87
ORSCHELN ET AL., CLIN INFECT DIS., vol. 49, no. 4, 2009, pages 536 - 42
PIHLAK ET AL., NAT BIOTECHNOL., vol. 26, no. 6, 2008, pages 676 - 684
SAMBROOK ET AL.: "Molecular cloning: A laboratory manual", 1989, COLD SPRING HARBOR LABORATORY PRESS
SHENDURE, JI, NAT BIOTECHNOL., vol. 26, no. 10, 2008, pages 1135 - 45
SINGH-ZOCCHI ET AL., PROC NATL ACAD SCI USA., vol. 100, no. 13, 2003, pages 7605 - 7610
TURNER ET AL., J BACTERIOL, vol. 176, no. 12, 1994, pages 3708 - 3722
VALOUEV ET AL., GENOME RES., vol. 18, no. 7, 2008, pages 1051 - 63
VALOUEV ET AL., NAT METHODS., vol. 5, no. 9, 2008, pages 829 - 34
WALTER ET AL., PROC NATL ACAD SCI US A., vol. 106, no. 31, 2009, pages 12950 - 5
WESTON ET AL., INFECTION AND IMMUNITY, vol. 77, no. 7, 2009, pages 2840 - 2848
WUITE ET AL., NATURE, vol. 404, no. 6773, 2000, pages 103 - 106
ZHANG ET AL., NATURE, vol. 358, 1992, pages 591 - 593

Cited By (17)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9915655B2 (en) 2013-01-22 2018-03-13 Centre National De La Recherche Scientifique (Cnrs) Process for detection of DNA modifications and protein binding by a single molecule manipulation
WO2014114687A1 (en) 2013-01-22 2014-07-31 Centre National De La Recherche Scientifique (Cnrs) Process for detection of dna modifications and protein binding by single molecule manipulation
EP3090803A1 (en) 2015-05-07 2016-11-09 Paris Sciences et Lettres - Quartier Latin Improved device for the analysis of nucleic acid molecules
WO2016177808A1 (en) 2015-05-07 2016-11-10 Paris Sciences Et Lettres - Quartier Latin Formation of hairpins in situ using force-induced strand invasion
WO2016177869A1 (en) 2015-05-07 2016-11-10 Paris Sciences Et Lettres - Quartier Latin Improved device for the analysis of nucleic acid molecules
US10196683B2 (en) 2015-05-07 2019-02-05 Paris Sciences Et Lettres—Quartier Latin Formation of hairpins in situ using force-induced strand invasion
US10933416B2 (en) 2015-05-07 2021-03-02 Paris Sciences Et Lettres—Quartier Latin Device for the analysis of nucleic acid molecules
US11384383B2 (en) 2017-08-08 2022-07-12 Depixus In vitro isolation and enrichment of nucleic acids using site-specific nucleases
WO2019030306A1 (en) 2017-08-08 2019-02-14 Depixus In vitro isolation and enrichment of nucleic acids using site-specific nucleases
EP3950957A1 (en) 2017-08-08 2022-02-09 Depixus In vitro isolation and enrichment of nucleic acids using site-specific nucleases
WO2020099675A1 (en) 2018-11-16 2020-05-22 Depixus Optimization of in vitro isolation of nucleic acids using site-specific nucleases
WO2020120711A1 (en) 2018-12-12 2020-06-18 Depixus Method of nucleic acid enrichment using site-specific nucleases followed by capture
US11473124B2 (en) 2018-12-12 2022-10-18 Depixus Method of nucleic acid enrichment using site-specific nucleases followed by capture
EP4095259A1 (en) 2018-12-12 2022-11-30 Depixus Method of nucleic acid enrichment using site-specific nucleases followed by capture
US12139746B2 (en) 2018-12-12 2024-11-12 Depixus Method of nucleic acid enrichment using site-specific nucleases followed by capture
EP4160199A1 (en) 2021-10-04 2023-04-05 Depixus Apparatus for biomolecule analysis with a well and a cavity below the well
WO2023057345A1 (en) 2021-10-04 2023-04-13 Depixus Apparatus for biomolecule analysis with a well and a cavity below the well

Also Published As

Publication number Publication date
CN103097551A (en) 2013-05-08
AU2011257227A1 (en) 2013-01-24
US9738928B2 (en) 2017-08-22
EP2576822A1 (en) 2013-04-10
IL223256A0 (en) 2013-02-03
CA2800637A1 (en) 2011-12-01
AU2011257227B2 (en) 2015-07-09
DK2576822T3 (en) 2015-06-29
US9493829B2 (en) 2016-11-15
EP2390350A1 (en) 2011-11-30
US20170073749A1 (en) 2017-03-16
IL223256A (en) 2017-10-31
JP2016214250A (en) 2016-12-22
CA2800637C (en) 2018-08-28
US20130171636A1 (en) 2013-07-04
CN103097551B (en) 2016-01-20
KR101848377B1 (en) 2018-04-13
EP2576822B1 (en) 2015-03-18
KR20130123302A (en) 2013-11-12
JP2013528380A (en) 2013-07-11
HK1183912A1 (en) 2014-01-10
JP6325028B2 (en) 2018-05-16
ES2539254T3 (en) 2015-06-29

Similar Documents

Publication Publication Date Title
US9738928B2 (en) Method of DNA sequencing by polymerisation
US9765394B2 (en) Method of DNA sequencing by hybridisation
KR102592367B1 (en) Systems and methods for clonal replication and amplification of nucleic acid molecules for genomic and therapeutic applications
EP3036359B1 (en) Next-generation sequencing libraries
JP2016537990A (en) Nucleic acid probe and genomic fragment detection method
US20200002759A1 (en) Methods for studying nucleotide accessibility in dna and rna based on low-yield bisulfite conversion and next-generation sequencing

Legal Events

Date Code Title Description
WWE Wipo information: entry into national phase

Ref document number: 201180034608.0

Country of ref document: CN

121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 11722796

Country of ref document: EP

Kind code of ref document: A1

ENP Entry into the national phase

Ref document number: 2800637

Country of ref document: CA

WWE Wipo information: entry into national phase

Ref document number: 223256

Country of ref document: IL

ENP Entry into the national phase

Ref document number: 2013511688

Country of ref document: JP

Kind code of ref document: A

NENP Non-entry into the national phase

Ref country code: DE

WWE Wipo information: entry into national phase

Ref document number: 2011722796

Country of ref document: EP

ENP Entry into the national phase

Ref document number: 20127034149

Country of ref document: KR

Kind code of ref document: A

ENP Entry into the national phase

Ref document number: 2011257227

Country of ref document: AU

Date of ref document: 20110526

Kind code of ref document: A

WWE Wipo information: entry into national phase

Ref document number: 13700115

Country of ref document: US