WO2024216159A1 - Methods and compositions for nucleic acid sequencing using labeled nucleotides - Google Patents
Methods and compositions for nucleic acid sequencing using labeled nucleotides Download PDFInfo
- Publication number
- WO2024216159A1 WO2024216159A1 PCT/US2024/024437 US2024024437W WO2024216159A1 WO 2024216159 A1 WO2024216159 A1 WO 2024216159A1 US 2024024437 W US2024024437 W US 2024024437W WO 2024216159 A1 WO2024216159 A1 WO 2024216159A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- label
- nucleotides
- nucleotide
- signal
- wavelength
- Prior art date
Links
- 125000003729 nucleotide group Chemical group 0.000 title claims abstract description 385
- 239000002773 nucleotide Substances 0.000 title claims abstract description 365
- 238000000034 method Methods 0.000 title claims abstract description 150
- 150000007523 nucleic acids Chemical class 0.000 title abstract description 118
- 102000039446 nucleic acids Human genes 0.000 title abstract description 103
- 108020004707 nucleic acids Proteins 0.000 title abstract description 103
- 238000012163 sequencing technique Methods 0.000 title abstract description 49
- 239000000203 mixture Substances 0.000 title abstract description 26
- 230000003321 amplification Effects 0.000 claims abstract description 18
- 238000003199 nucleic acid amplification method Methods 0.000 claims abstract description 18
- 238000005096 rolling process Methods 0.000 claims abstract description 8
- 102000040430 polynucleotide Human genes 0.000 claims description 84
- 108091033319 polynucleotide Proteins 0.000 claims description 84
- 239000002157 polynucleotide Substances 0.000 claims description 84
- 239000000758 substrate Substances 0.000 claims description 77
- 239000013615 primer Substances 0.000 claims description 65
- 238000010348 incorporation Methods 0.000 claims description 54
- 238000003384 imaging method Methods 0.000 claims description 26
- 230000000295 complement effect Effects 0.000 claims description 23
- 230000002441 reversible effect Effects 0.000 claims description 20
- YBJHBAHKTGYVGT-ZKWXMUAHSA-N (+)-Biotin Chemical compound N1C(=O)N[C@@H]2[C@H](CCCCC(=O)O)SC[C@@H]21 YBJHBAHKTGYVGT-ZKWXMUAHSA-N 0.000 claims description 18
- 238000005286 illumination Methods 0.000 claims description 13
- 108091034117 Oligonucleotide Proteins 0.000 claims description 12
- 108010090804 Streptavidin Proteins 0.000 claims description 11
- 229960002685 biotin Drugs 0.000 claims description 9
- 235000020958 biotin Nutrition 0.000 claims description 9
- 239000011616 biotin Substances 0.000 claims description 9
- AUTOLBMXDDTRRT-JGVFFNPUSA-N (4R,5S)-dethiobiotin Chemical compound C[C@@H]1NC(=O)N[C@@H]1CCCCCC(O)=O AUTOLBMXDDTRRT-JGVFFNPUSA-N 0.000 claims description 8
- 239000000178 monomer Substances 0.000 claims description 8
- 239000003155 DNA primer Substances 0.000 claims description 7
- 229910052799 carbon Inorganic materials 0.000 claims description 6
- 229910052770 Uranium Inorganic materials 0.000 claims description 4
- 108091028732 Concatemer Proteins 0.000 claims description 2
- 238000001712 DNA sequencing Methods 0.000 abstract description 4
- 102000016928 DNA-directed DNA polymerase Human genes 0.000 description 82
- 108010014303 DNA-directed DNA polymerase Proteins 0.000 description 76
- 102000053602 DNA Human genes 0.000 description 49
- 108020004414 DNA Proteins 0.000 description 48
- 239000000975 dye Substances 0.000 description 42
- 239000011541 reaction mixture Substances 0.000 description 28
- 229920002477 rna polymer Polymers 0.000 description 28
- 238000006243 chemical reaction Methods 0.000 description 26
- 239000003153 chemical reaction reagent Substances 0.000 description 24
- 102100034343 Integrase Human genes 0.000 description 18
- 108010092799 RNA-directed DNA polymerase Proteins 0.000 description 18
- 238000001514 detection method Methods 0.000 description 18
- 102000004190 Enzymes Human genes 0.000 description 17
- 108090000790 Enzymes Proteins 0.000 description 17
- 239000000872 buffer Substances 0.000 description 15
- -1 ATTO-TAGTM FQ Chemical compound 0.000 description 14
- 239000012099 Alexa Fluor family Substances 0.000 description 14
- 210000004027 cell Anatomy 0.000 description 14
- 230000009977 dual effect Effects 0.000 description 14
- 239000000523 sample Substances 0.000 description 13
- 108090000623 proteins and genes Proteins 0.000 description 12
- 150000001875 compounds Chemical class 0.000 description 11
- 108020004999 messenger RNA Proteins 0.000 description 11
- 102000004169 proteins and genes Human genes 0.000 description 11
- 239000003112 inhibitor Substances 0.000 description 10
- 108091028043 Nucleic acid sequence Proteins 0.000 description 9
- 150000002500 ions Chemical class 0.000 description 9
- OYPRJOBELJOOCE-UHFFFAOYSA-N Calcium Chemical compound [Ca] OYPRJOBELJOOCE-UHFFFAOYSA-N 0.000 description 8
- 102000004163 DNA-directed RNA polymerases Human genes 0.000 description 8
- 108090000626 DNA-directed RNA polymerases Proteins 0.000 description 8
- 229940123066 Polymerase inhibitor Drugs 0.000 description 8
- 230000015572 biosynthetic process Effects 0.000 description 8
- 229910052791 calcium Inorganic materials 0.000 description 8
- 239000011575 calcium Substances 0.000 description 8
- 230000003197 catalytic effect Effects 0.000 description 8
- 230000002860 competitive effect Effects 0.000 description 8
- 230000000694 effects Effects 0.000 description 8
- 239000007850 fluorescent dye Substances 0.000 description 8
- 238000003786 synthesis reaction Methods 0.000 description 8
- 241000713838 Avian myeloblastosis virus Species 0.000 description 7
- 108020004682 Single-Stranded DNA Proteins 0.000 description 7
- 229910021645 metal ion Inorganic materials 0.000 description 7
- 238000006116 polymerization reaction Methods 0.000 description 7
- 238000012408 PCR amplification Methods 0.000 description 6
- HAAZLUGHYHWQIW-KVQBGUIXSA-N dGTP Chemical compound C1=NC=2C(=O)NC(N)=NC=2N1[C@H]1C[C@H](O)[C@@H](COP(O)(=O)OP(O)(=O)OP(O)(O)=O)O1 HAAZLUGHYHWQIW-KVQBGUIXSA-N 0.000 description 6
- 238000005516 engineering process Methods 0.000 description 6
- 230000003993 interaction Effects 0.000 description 6
- 229910052751 metal Inorganic materials 0.000 description 6
- 239000002184 metal Substances 0.000 description 6
- 150000003839 salts Chemical class 0.000 description 6
- 229910019142 PO4 Inorganic materials 0.000 description 5
- 108091005804 Peptidases Proteins 0.000 description 5
- 108010006785 Taq Polymerase Proteins 0.000 description 5
- SUYVUBYJARFZHO-RRKCRQDMSA-N dATP Chemical compound C1=NC=2C(N)=NC=NC=2N1[C@H]1C[C@H](O)[C@@H](COP(O)(=O)OP(O)(=O)OP(O)(O)=O)O1 SUYVUBYJARFZHO-RRKCRQDMSA-N 0.000 description 5
- SUYVUBYJARFZHO-UHFFFAOYSA-N dATP Natural products C1=NC=2C(N)=NC=NC=2N1C1CC(O)C(COP(O)(=O)OP(O)(=O)OP(O)(O)=O)O1 SUYVUBYJARFZHO-UHFFFAOYSA-N 0.000 description 5
- RGWHQCVHVJXOKC-SHYZEUOFSA-J dCTP(4-) Chemical compound O=C1N=C(N)C=CN1[C@@H]1O[C@H](COP([O-])(=O)OP([O-])(=O)OP([O-])([O-])=O)[C@@H](O)C1 RGWHQCVHVJXOKC-SHYZEUOFSA-J 0.000 description 5
- NHVNXKFIZYSCEB-XLPZGREQSA-N dTTP Chemical compound O=C1NC(=O)C(C)=CN1[C@@H]1O[C@H](COP(O)(=O)OP(O)(=O)OP(O)(O)=O)[C@@H](O)C1 NHVNXKFIZYSCEB-XLPZGREQSA-N 0.000 description 5
- 238000002866 fluorescence resonance energy transfer Methods 0.000 description 5
- 238000012986 modification Methods 0.000 description 5
- 239000010452 phosphate Substances 0.000 description 5
- 238000003752 polymerase chain reaction Methods 0.000 description 5
- 235000018102 proteins Nutrition 0.000 description 5
- HSHNITRMYYLLCV-UHFFFAOYSA-N 4-methylumbelliferone Chemical compound C1=C(O)C=CC2=C1OC(=O)C=C2C HSHNITRMYYLLCV-UHFFFAOYSA-N 0.000 description 4
- BZTDTCNHAFUJOG-UHFFFAOYSA-N 6-carboxyfluorescein Chemical compound C12=CC=C(O)C=C2OC2=CC(O)=CC=C2C11OC(=O)C2=CC=C(C(=O)O)C=C21 BZTDTCNHAFUJOG-UHFFFAOYSA-N 0.000 description 4
- YXHLJMWYDTXDHS-IRFLANFNSA-N 7-aminoactinomycin D Chemical compound C[C@H]1OC(=O)[C@H](C(C)C)N(C)C(=O)CN(C)C(=O)[C@@H]2CCCN2C(=O)[C@@H](C(C)C)NC(=O)[C@H]1NC(=O)C1=C(N)C(=O)C(C)=C2OC(C(C)=C(N)C=C3C(=O)N[C@@H]4C(=O)N[C@@H](C(N5CCC[C@H]5C(=O)N(C)CC(=O)N(C)[C@@H](C(C)C)C(=O)O[C@@H]4C)=O)C(C)C)=C3N=C21 YXHLJMWYDTXDHS-IRFLANFNSA-N 0.000 description 4
- 108700012813 7-aminoactinomycin D Proteins 0.000 description 4
- 108020000946 Bacterial DNA Proteins 0.000 description 4
- ZHNUHDYFZUAESO-UHFFFAOYSA-N Formamide Chemical compound NC=O ZHNUHDYFZUAESO-UHFFFAOYSA-N 0.000 description 4
- 241000713869 Moloney murine leukemia virus Species 0.000 description 4
- PXHVJJICTQNCMI-UHFFFAOYSA-N Nickel Chemical compound [Ni] PXHVJJICTQNCMI-UHFFFAOYSA-N 0.000 description 4
- PZBFGYYEXUXCOF-UHFFFAOYSA-N TCEP Chemical compound OC(=O)CCP(CCC(O)=O)CCC(O)=O PZBFGYYEXUXCOF-UHFFFAOYSA-N 0.000 description 4
- 108020005202 Viral DNA Proteins 0.000 description 4
- JLCPHMBAVCMARE-UHFFFAOYSA-N [3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-hydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methyl [5-(6-aminopurin-9-yl)-2-(hydroxymethyl)oxolan-3-yl] hydrogen phosphate Polymers Cc1cn(C2CC(OP(O)(=O)OCC3OC(CC3OP(O)(=O)OCC3OC(CC3O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c3nc(N)[nH]c4=O)C(COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3CO)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cc(C)c(=O)[nH]c3=O)n3cc(C)c(=O)[nH]c3=O)n3ccc(N)nc3=O)n3cc(C)c(=O)[nH]c3=O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)O2)c(=O)[nH]c1=O JLCPHMBAVCMARE-UHFFFAOYSA-N 0.000 description 4
- 235000001014 amino acid Nutrition 0.000 description 4
- 150000001413 amino acids Chemical class 0.000 description 4
- 108091005948 blue fluorescent proteins Proteins 0.000 description 4
- 150000001768 cations Chemical class 0.000 description 4
- 239000003795 chemical substances by application Substances 0.000 description 4
- OPTASPLRGRRNAP-UHFFFAOYSA-N cytosine Chemical compound NC=1C=CNC(=O)N=1 OPTASPLRGRRNAP-UHFFFAOYSA-N 0.000 description 4
- 125000003963 dichloro group Chemical group Cl* 0.000 description 4
- VHJLVAABSRFDPM-QWWZWVQMSA-N dithiothreitol Chemical compound SC[C@@H](O)[C@H](O)CS VHJLVAABSRFDPM-QWWZWVQMSA-N 0.000 description 4
- UYTPUPDQBNUYGX-UHFFFAOYSA-N guanine Chemical compound O=C1NC(N)=NC2=C1N=CN2 UYTPUPDQBNUYGX-UHFFFAOYSA-N 0.000 description 4
- 238000009396 hybridization Methods 0.000 description 4
- 239000007788 liquid Substances 0.000 description 4
- 230000004048 modification Effects 0.000 description 4
- 230000035772 mutation Effects 0.000 description 4
- 238000007481 next generation sequencing Methods 0.000 description 4
- 230000008569 process Effects 0.000 description 4
- XJMOSONTPMZWPB-UHFFFAOYSA-M propidium iodide Chemical compound [I-].[I-].C12=CC(N)=CC=C2C2=CC=C(N)C=C2[N+](CCC[N+](C)(CC)CC)=C1C1=CC=CC=C1 XJMOSONTPMZWPB-UHFFFAOYSA-M 0.000 description 4
- 239000003419 rna directed dna polymerase inhibitor Substances 0.000 description 4
- 241000894007 species Species 0.000 description 4
- 239000000126 substance Substances 0.000 description 4
- ABZLKHKQJHEPAX-UHFFFAOYSA-N tetramethylrhodamine Chemical compound C=12C=CC(N(C)C)=CC2=[O+]C2=CC(N(C)C)=CC=C2C=1C1=CC=CC=C1C([O-])=O ABZLKHKQJHEPAX-UHFFFAOYSA-N 0.000 description 4
- MPLHNVLQVRSVEE-UHFFFAOYSA-N texas red Chemical compound [O-]S(=O)(=O)C1=CC(S(Cl)(=O)=O)=CC=C1C(C1=CC=2CCCN3CCCC(C=23)=C1O1)=C2C1=C(CCC1)C3=[N+]1CCCC3=C2 MPLHNVLQVRSVEE-UHFFFAOYSA-N 0.000 description 4
- 210000001519 tissue Anatomy 0.000 description 4
- 238000005406 washing Methods 0.000 description 4
- VGIRNWJSIRVFRT-UHFFFAOYSA-N 2',7'-difluorofluorescein Chemical compound OC(=O)C1=CC=CC=C1C1=C2C=C(F)C(=O)C=C2OC2=CC(O)=C(F)C=C21 VGIRNWJSIRVFRT-UHFFFAOYSA-N 0.000 description 3
- 125000001917 2,4-dinitrophenyl group Chemical group [H]C1=C([H])C(=C([H])C(=C1*)[N+]([O-])=O)[N+]([O-])=O 0.000 description 3
- NJYVEMPWNAYQQN-UHFFFAOYSA-N 5-carboxyfluorescein Chemical compound C12=CC=C(O)C=C2OC2=CC(O)=CC=C2C21OC(=O)C1=CC(C(=O)O)=CC=C21 NJYVEMPWNAYQQN-UHFFFAOYSA-N 0.000 description 3
- YMZMTOFQCVHHFB-UHFFFAOYSA-N 5-carboxytetramethylrhodamine Chemical compound C=12C=CC(N(C)C)=CC2=[O+]C2=CC(N(C)C)=CC=C2C=1C1=CC=C(C(O)=O)C=C1C([O-])=O YMZMTOFQCVHHFB-UHFFFAOYSA-N 0.000 description 3
- 108010017826 DNA Polymerase I Proteins 0.000 description 3
- 102000004594 DNA Polymerase I Human genes 0.000 description 3
- 230000006820 DNA synthesis Effects 0.000 description 3
- 241000588724 Escherichia coli Species 0.000 description 3
- 101900297506 Human immunodeficiency virus type 1 group M subtype B Reverse transcriptase/ribonuclease H Proteins 0.000 description 3
- 102000035195 Peptidases Human genes 0.000 description 3
- 239000004365 Protease Substances 0.000 description 3
- 108091028664 Ribonucleotide Proteins 0.000 description 3
- 241000205188 Thermococcus Species 0.000 description 3
- 102000002933 Thioredoxin Human genes 0.000 description 3
- 241000700605 Viruses Species 0.000 description 3
- DZBUGLKDJFMEHC-UHFFFAOYSA-N acridine Chemical class C1=CC=CC2=CC3=CC=CC=C3N=C21 DZBUGLKDJFMEHC-UHFFFAOYSA-N 0.000 description 3
- 230000008859 change Effects 0.000 description 3
- 239000003638 chemical reducing agent Substances 0.000 description 3
- 108010082025 cyan fluorescent protein Proteins 0.000 description 3
- 230000009849 deactivation Effects 0.000 description 3
- 238000012217 deletion Methods 0.000 description 3
- 230000037430 deletion Effects 0.000 description 3
- 239000005547 deoxyribonucleotide Substances 0.000 description 3
- 125000002637 deoxyribonucleotide group Chemical group 0.000 description 3
- 230000002255 enzymatic effect Effects 0.000 description 3
- GNBHRKFJIUUOQI-UHFFFAOYSA-N fluorescein Chemical compound O1C(=O)C2=CC=CC=C2C21C1=CC=C(O)C=C1OC1=CC(O)=CC=C21 GNBHRKFJIUUOQI-UHFFFAOYSA-N 0.000 description 3
- 108010021843 fluorescent protein 583 Proteins 0.000 description 3
- 238000003780 insertion Methods 0.000 description 3
- 230000037431 insertion Effects 0.000 description 3
- 230000001404 mediated effect Effects 0.000 description 3
- 230000003287 optical effect Effects 0.000 description 3
- 238000012545 processing Methods 0.000 description 3
- PYWVYCXTNDRMGF-UHFFFAOYSA-N rhodamine B Chemical compound [Cl-].C=12C=CC(=[N+](CC)CC)C=C2OC2=CC(N(CC)CC)=CC=C2C=1C1=CC=CC=C1C(O)=O PYWVYCXTNDRMGF-UHFFFAOYSA-N 0.000 description 3
- 239000002336 ribonucleotide Substances 0.000 description 3
- 125000002652 ribonucleotide group Chemical group 0.000 description 3
- 239000003381 stabilizer Substances 0.000 description 3
- 108060008226 thioredoxin Proteins 0.000 description 3
- 229940094937 thioredoxin Drugs 0.000 description 3
- 108091005957 yellow fluorescent proteins Proteins 0.000 description 3
- BGWLYQZDNFIFRX-UHFFFAOYSA-N 5-[3-[2-[3-(3,8-diamino-6-phenylphenanthridin-5-ium-5-yl)propylamino]ethylamino]propyl]-6-phenylphenanthridin-5-ium-3,8-diamine;dichloride Chemical compound [Cl-].[Cl-].C=1C(N)=CC=C(C2=CC=C(N)C=C2[N+]=2CCCNCCNCCC[N+]=3C4=CC(N)=CC=C4C4=CC=C(N)C=C4C=3C=3C=CC=CC=3)C=1C=2C1=CC=CC=C1 BGWLYQZDNFIFRX-UHFFFAOYSA-N 0.000 description 2
- BUJRUSRXHJKUQE-UHFFFAOYSA-N 5-carboxy-X-rhodamine triethylammonium salt Chemical compound CC[NH+](CC)CC.[O-]C(=O)C1=CC(C(=O)[O-])=CC=C1C1=C(C=C2C3=C4CCCN3CCC2)C4=[O+]C2=C1C=C1CCCN3CCCC2=C13 BUJRUSRXHJKUQE-UHFFFAOYSA-N 0.000 description 2
- VWOLRKMFAJUZGM-UHFFFAOYSA-N 6-carboxyrhodamine 6G Chemical compound [Cl-].C=12C=C(C)C(NCC)=CC2=[O+]C=2C=C(NCC)C(C)=CC=2C=1C1=CC(C(O)=O)=CC=C1C(=O)OCC VWOLRKMFAJUZGM-UHFFFAOYSA-N 0.000 description 2
- IXQPRUQVJIJUEB-UHFFFAOYSA-N 7-(diethylamino)-n-[2-(2,5-dioxopyrrol-1-yl)ethyl]-2-oxochromene-3-carboxamide Chemical compound O=C1OC2=CC(N(CC)CC)=CC=C2C=C1C(=O)NCCN1C(=O)C=CC1=O IXQPRUQVJIJUEB-UHFFFAOYSA-N 0.000 description 2
- 229930024421 Adenine Natural products 0.000 description 2
- GFFGJBXGBJISGV-UHFFFAOYSA-N Adenine Chemical compound NC1=NC=NC2=C1N=CN2 GFFGJBXGBJISGV-UHFFFAOYSA-N 0.000 description 2
- 241000894006 Bacteria Species 0.000 description 2
- 108020001019 DNA Primers Proteins 0.000 description 2
- 108010008286 DNA nucleotidylexotransferase Proteins 0.000 description 2
- 102100033215 DNA nucleotidylexotransferase Human genes 0.000 description 2
- SHIBSTMRCDJXLN-UHFFFAOYSA-N Digoxigenin Natural products C1CC(C2C(C3(C)CCC(O)CC3CC2)CC2O)(O)C2(C)C1C1=CC(=O)OC1 SHIBSTMRCDJXLN-UHFFFAOYSA-N 0.000 description 2
- 108090000371 Esterases Proteins 0.000 description 2
- 108060002716 Exonuclease Proteins 0.000 description 2
- OZLGRUXZXMRXGP-UHFFFAOYSA-N Fluo-3 Chemical compound CC1=CC=C(N(CC(O)=O)CC(O)=O)C(OCCOC=2C(=CC=C(C=2)C2=C3C=C(Cl)C(=O)C=C3OC3=CC(O)=C(Cl)C=C32)N(CC(O)=O)CC(O)=O)=C1 OZLGRUXZXMRXGP-UHFFFAOYSA-N 0.000 description 2
- 108010001336 Horseradish Peroxidase Proteins 0.000 description 2
- 108010001584 Human immunodeficiency virus 2 reverse transcriptase Proteins 0.000 description 2
- MHAJPDPJQMAIIY-UHFFFAOYSA-N Hydrogen peroxide Chemical compound OO MHAJPDPJQMAIIY-UHFFFAOYSA-N 0.000 description 2
- XEEYBQQBJWHFJM-UHFFFAOYSA-N Iron Chemical compound [Fe] XEEYBQQBJWHFJM-UHFFFAOYSA-N 0.000 description 2
- FGBAVQUHSKYMTC-UHFFFAOYSA-M LDS 751 dye Chemical compound [O-]Cl(=O)(=O)=O.C1=CC2=CC(N(C)C)=CC=C2[N+](CC)=C1C=CC=CC1=CC=C(N(C)C)C=C1 FGBAVQUHSKYMTC-UHFFFAOYSA-M 0.000 description 2
- 108090001060 Lipase Proteins 0.000 description 2
- 102000004882 Lipase Human genes 0.000 description 2
- 239000004367 Lipase Substances 0.000 description 2
- SEQKRHFRPICQDD-UHFFFAOYSA-N N-tris(hydroxymethyl)methylglycine Chemical compound OCC(CO)(CO)[NH2+]CC([O-])=O SEQKRHFRPICQDD-UHFFFAOYSA-N 0.000 description 2
- 102000003992 Peroxidases Human genes 0.000 description 2
- 108010002747 Pfu DNA polymerase Proteins 0.000 description 2
- 241000205160 Pyrococcus Species 0.000 description 2
- 239000013616 RNA primer Substances 0.000 description 2
- 230000006819 RNA synthesis Effects 0.000 description 2
- 102100037486 Reverse transcriptase/ribonuclease H Human genes 0.000 description 2
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 2
- 241000205180 Thermococcus litoralis Species 0.000 description 2
- 241001495444 Thermococcus sp. Species 0.000 description 2
- ATJFFYVFTNAWJD-UHFFFAOYSA-N Tin Chemical compound [Sn] ATJFFYVFTNAWJD-UHFFFAOYSA-N 0.000 description 2
- MZZINWWGSYUHGU-UHFFFAOYSA-J ToTo-1 Chemical compound [I-].[I-].[I-].[I-].C12=CC=CC=C2C(C=C2N(C3=CC=CC=C3S2)C)=CC=[N+]1CCC[N+](C)(C)CCC[N+](C)(C)CCC[N+](C1=CC=CC=C11)=CC=C1C=C1N(C)C2=CC=CC=C2S1 MZZINWWGSYUHGU-UHFFFAOYSA-J 0.000 description 2
- ISAKRJDGNUQOIC-UHFFFAOYSA-N Uracil Chemical compound O=C1C=CNC(=O)N1 ISAKRJDGNUQOIC-UHFFFAOYSA-N 0.000 description 2
- 108020000999 Viral RNA Proteins 0.000 description 2
- GRRMZXFOOGQMFA-UHFFFAOYSA-J YoYo-1 Chemical compound [I-].[I-].[I-].[I-].C12=CC=CC=C2C(C=C2N(C3=CC=CC=C3O2)C)=CC=[N+]1CCC[N+](C)(C)CCC[N+](C)(C)CCC[N+](C1=CC=CC=C11)=CC=C1C=C1N(C)C2=CC=CC=C2O1 GRRMZXFOOGQMFA-UHFFFAOYSA-J 0.000 description 2
- DPKHZNPWBDQZCN-UHFFFAOYSA-N acridine orange free base Chemical compound C1=CC(N(C)C)=CC2=NC3=CC(N(C)C)=CC=C3C=C21 DPKHZNPWBDQZCN-UHFFFAOYSA-N 0.000 description 2
- RJURFGZVJUQBHK-UHFFFAOYSA-N actinomycin D Natural products CC1OC(=O)C(C(C)C)N(C)C(=O)CN(C)C(=O)C2CCCN2C(=O)C(C(C)C)NC(=O)C1NC(=O)C1=C(N)C(=O)C(C)=C2OC(C(C)=CC=C3C(=O)NC4C(=O)NC(C(N5CCCC5C(=O)N(C)CC(=O)N(C)C(C(C)C)C(=O)OC4C)=O)C(C)C)=C3N=C21 RJURFGZVJUQBHK-UHFFFAOYSA-N 0.000 description 2
- 229960000643 adenine Drugs 0.000 description 2
- 108010004469 allophycocyanin Proteins 0.000 description 2
- 230000004075 alteration Effects 0.000 description 2
- 150000001412 amines Chemical class 0.000 description 2
- 239000012491 analyte Substances 0.000 description 2
- 238000000137 annealing Methods 0.000 description 2
- 238000004630 atomic force microscopy Methods 0.000 description 2
- 230000008901 benefit Effects 0.000 description 2
- 230000000903 blocking effect Effects 0.000 description 2
- 238000010804 cDNA synthesis Methods 0.000 description 2
- 238000006555 catalytic reaction Methods 0.000 description 2
- 239000003086 colorant Substances 0.000 description 2
- 239000002299 complementary DNA Substances 0.000 description 2
- 230000008878 coupling Effects 0.000 description 2
- 238000010168 coupling process Methods 0.000 description 2
- 238000005859 coupling reaction Methods 0.000 description 2
- 229940104302 cytosine Drugs 0.000 description 2
- 230000001419 dependent effect Effects 0.000 description 2
- 230000000368 destabilizing effect Effects 0.000 description 2
- 239000003599 detergent Substances 0.000 description 2
- GFZPJHFJZGRWMQ-UHFFFAOYSA-M diOC18(3) dye Chemical compound [O-]Cl(=O)(=O)=O.O1C2=CC=CC=C2[N+](CCCCCCCCCCCCCCCCCC)=C1C=CC=C1N(CCCCCCCCCCCCCCCCCC)C2=CC=CC=C2O1 GFZPJHFJZGRWMQ-UHFFFAOYSA-M 0.000 description 2
- 239000005546 dideoxynucleotide Substances 0.000 description 2
- QONQRTHLHBTMGP-UHFFFAOYSA-N digitoxigenin Natural products CC12CCC(C3(CCC(O)CC3CC3)C)C3C11OC1CC2C1=CC(=O)OC1 QONQRTHLHBTMGP-UHFFFAOYSA-N 0.000 description 2
- SHIBSTMRCDJXLN-KCZCNTNESA-N digoxigenin Chemical compound C1([C@@H]2[C@@]3([C@@](CC2)(O)[C@H]2[C@@H]([C@@]4(C)CC[C@H](O)C[C@H]4CC2)C[C@H]3O)C)=CC(=O)OC1 SHIBSTMRCDJXLN-KCZCNTNESA-N 0.000 description 2
- XPPKVPWEQAFLFU-UHFFFAOYSA-N diphosphoric acid Chemical class OP(O)(=O)OP(O)(O)=O XPPKVPWEQAFLFU-UHFFFAOYSA-N 0.000 description 2
- WDRWZVWLVBXVOI-QTNFYWBSSA-L dipotassium;(2s)-2-aminopentanedioate Chemical compound [K+].[K+].[O-]C(=O)[C@@H](N)CCC([O-])=O WDRWZVWLVBXVOI-QTNFYWBSSA-L 0.000 description 2
- 201000010099 disease Diseases 0.000 description 2
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 2
- 150000002148 esters Chemical class 0.000 description 2
- 230000005284 excitation Effects 0.000 description 2
- 102000013165 exonuclease Human genes 0.000 description 2
- 210000003811 finger Anatomy 0.000 description 2
- YFHXZQPUBCBNIP-UHFFFAOYSA-N fura-2 Chemical compound CC1=CC=C(N(CC(O)=O)CC(O)=O)C(OCCOC=2C(=CC=3OC(=CC=3C=2)C=2OC(=CN=2)C(O)=O)N(CC(O)=O)CC(O)=O)=C1 YFHXZQPUBCBNIP-UHFFFAOYSA-N 0.000 description 2
- 230000002068 genetic effect Effects 0.000 description 2
- 125000002887 hydroxy group Chemical group [H]O* 0.000 description 2
- PNDZEEPOYCVIIY-UHFFFAOYSA-N indo-1 Chemical compound CC1=CC=C(N(CC(O)=O)CC(O)=O)C(OCCOC=2C(=CC=C(C=2)C=2N=C3[CH]C(=CC=C3C=2)C(O)=O)N(CC(O)=O)CC(O)=O)=C1 PNDZEEPOYCVIIY-UHFFFAOYSA-N 0.000 description 2
- 238000002955 isolation Methods 0.000 description 2
- 235000019421 lipase Nutrition 0.000 description 2
- 150000002739 metals Chemical class 0.000 description 2
- 238000012544 monitoring process Methods 0.000 description 2
- 235000013919 monopotassium glutamate Nutrition 0.000 description 2
- 229910052759 nickel Inorganic materials 0.000 description 2
- QJGQUHMNIGDVPM-UHFFFAOYSA-N nitrogen group Chemical group [N] QJGQUHMNIGDVPM-UHFFFAOYSA-N 0.000 description 2
- 238000001668 nucleic acid synthesis Methods 0.000 description 2
- 230000003204 osmotic effect Effects 0.000 description 2
- 125000002467 phosphate group Chemical group [H]OP(=O)(O[H])O[*] 0.000 description 2
- BASFCYQUMIYNBI-UHFFFAOYSA-N platinum Chemical compound [Pt] BASFCYQUMIYNBI-UHFFFAOYSA-N 0.000 description 2
- SCVFZCLFOSHCOH-UHFFFAOYSA-M potassium acetate Chemical compound [K+].CC([O-])=O SCVFZCLFOSHCOH-UHFFFAOYSA-M 0.000 description 2
- 235000019833 protease Nutrition 0.000 description 2
- 235000019419 proteases Nutrition 0.000 description 2
- 238000000746 purification Methods 0.000 description 2
- BBEAQIROQSPTKN-UHFFFAOYSA-N pyrene Chemical compound C1=CC=C2C=CC3=CC=CC4=CC=C1C2=C43 BBEAQIROQSPTKN-UHFFFAOYSA-N 0.000 description 2
- 108010054624 red fluorescent protein Proteins 0.000 description 2
- 230000002829 reductive effect Effects 0.000 description 2
- 238000011160 research Methods 0.000 description 2
- 238000010839 reverse transcription Methods 0.000 description 2
- 238000003757 reverse transcription PCR Methods 0.000 description 2
- 238000007480 sanger sequencing Methods 0.000 description 2
- 239000004065 semiconductor Substances 0.000 description 2
- DYPYMMHZGRPOCK-UHFFFAOYSA-N seminaphtharhodafluor Chemical compound O1C(=O)C2=CC=CC=C2C21C(C=CC=1C3=CC=C(O)C=1)=C3OC1=CC(N)=CC=C21 DYPYMMHZGRPOCK-UHFFFAOYSA-N 0.000 description 2
- 150000003384 small molecules Chemical class 0.000 description 2
- 239000007787 solid Substances 0.000 description 2
- 230000006641 stabilisation Effects 0.000 description 2
- 238000011105 stabilization Methods 0.000 description 2
- 229910052712 strontium Inorganic materials 0.000 description 2
- CIOAGBVUUVVLOB-UHFFFAOYSA-N strontium atom Chemical compound [Sr] CIOAGBVUUVVLOB-UHFFFAOYSA-N 0.000 description 2
- 238000006467 substitution reaction Methods 0.000 description 2
- JGVWCANSWKRBCS-UHFFFAOYSA-N tetramethylrhodamine thiocyanate Chemical compound [Cl-].C=12C=CC(N(C)C)=CC2=[O+]C2=CC(N(C)C)=CC=C2C=1C1=CC=C(SC#N)C=C1C(O)=O JGVWCANSWKRBCS-UHFFFAOYSA-N 0.000 description 2
- 210000003813 thumb Anatomy 0.000 description 2
- RWQNBRDOKXIBIV-UHFFFAOYSA-N thymine Chemical compound CC1=CNC(=O)NC1=O RWQNBRDOKXIBIV-UHFFFAOYSA-N 0.000 description 2
- 229910052718 tin Inorganic materials 0.000 description 2
- 238000012546 transfer Methods 0.000 description 2
- 125000002264 triphosphate group Chemical group [H]OP(=O)(O[H])OP(=O)(O[H])OP(=O)(O[H])O* 0.000 description 2
- QGKMIGUHVLGJBR-UHFFFAOYSA-M (4z)-1-(3-methylbutyl)-4-[[1-(3-methylbutyl)quinolin-1-ium-4-yl]methylidene]quinoline;iodide Chemical compound [I-].C12=CC=CC=C2N(CCC(C)C)C=CC1=CC1=CC=[N+](CCC(C)C)C2=CC=CC=C12 QGKMIGUHVLGJBR-UHFFFAOYSA-M 0.000 description 1
- NOLHRFLIXVQPSZ-UHFFFAOYSA-N 1,3-thiazolidin-4-one Chemical compound O=C1CSCN1 NOLHRFLIXVQPSZ-UHFFFAOYSA-N 0.000 description 1
- OAKPWEUQDVLTCN-NKWVEPMBSA-N 2',3'-Dideoxyadenosine-5-triphosphate Chemical compound C1=NC=2C(N)=NC=NC=2N1[C@H]1CC[C@@H](CO[P@@](O)(=O)O[P@](O)(=O)OP(O)(O)=O)O1 OAKPWEUQDVLTCN-NKWVEPMBSA-N 0.000 description 1
- PRDFBSVERLRRMY-UHFFFAOYSA-N 2'-(4-ethoxyphenyl)-5-(4-methylpiperazin-1-yl)-2,5'-bibenzimidazole Chemical compound C1=CC(OCC)=CC=C1C1=NC2=CC=C(C=3NC4=CC(=CC=C4N=3)N3CCN(C)CC3)C=C2N1 PRDFBSVERLRRMY-UHFFFAOYSA-N 0.000 description 1
- RNIPJYFZGXJSDD-UHFFFAOYSA-N 2,4,5-triphenyl-1h-imidazole Chemical class C1=CC=CC=C1C1=NC(C=2C=CC=CC=2)=C(C=2C=CC=CC=2)N1 RNIPJYFZGXJSDD-UHFFFAOYSA-N 0.000 description 1
- UFBJCMHMOXMLKC-UHFFFAOYSA-N 2,4-dinitrophenol Chemical compound OC1=CC=C([N+]([O-])=O)C=C1[N+]([O-])=O UFBJCMHMOXMLKC-UHFFFAOYSA-N 0.000 description 1
- XDFNWJDGWJVGGN-UHFFFAOYSA-N 2-(2,7-dichloro-3,6-dihydroxy-9h-xanthen-9-yl)benzoic acid Chemical compound OC(=O)C1=CC=CC=C1C1C2=CC(Cl)=C(O)C=C2OC2=CC(O)=C(Cl)C=C21 XDFNWJDGWJVGGN-UHFFFAOYSA-N 0.000 description 1
- JNGRENQDBKMCCR-UHFFFAOYSA-N 2-(3-amino-6-iminoxanthen-9-yl)benzoic acid;hydrochloride Chemical compound [Cl-].C=12C=CC(=[NH2+])C=C2OC2=CC(N)=CC=C2C=1C1=CC=CC=C1C(O)=O JNGRENQDBKMCCR-UHFFFAOYSA-N 0.000 description 1
- JKMHFZQWWAIEOD-UHFFFAOYSA-N 2-[4-(2-hydroxyethyl)piperazin-1-yl]ethanesulfonic acid Chemical compound OCC[NH+]1CCN(CCS([O-])(=O)=O)CC1 JKMHFZQWWAIEOD-UHFFFAOYSA-N 0.000 description 1
- PDURUKZNVHEHGO-UHFFFAOYSA-N 2-[6-[bis(carboxymethyl)amino]-5-(carboxymethoxy)-1-benzofuran-2-yl]-1,3-oxazole-5-carboxylic acid Chemical compound O1C=2C=C(N(CC(O)=O)CC(O)=O)C(OCC(=O)O)=CC=2C=C1C1=NC=C(C(O)=O)O1 PDURUKZNVHEHGO-UHFFFAOYSA-N 0.000 description 1
- RJPSHDMGSVVHFA-UHFFFAOYSA-N 2-[carboxymethyl-[(7-hydroxy-4-methyl-2-oxochromen-8-yl)methyl]amino]acetic acid Chemical compound OC(=O)CN(CC(O)=O)CC1=C(O)C=CC2=C1OC(=O)C=C2C RJPSHDMGSVVHFA-UHFFFAOYSA-N 0.000 description 1
- OGHAROSJZRTIOK-KQYNXXCUSA-N 2-amino-9-[(2r,3r,4s,5r)-3,4-dihydroxy-5-(hydroxymethyl)oxolan-2-yl]-7-methylpurin-9-ium-6-olate Chemical compound C12=NC(N)=NC([O-])=C2N(C)C=[N+]1[C@@H]1O[C@H](CO)[C@@H](O)[C@H]1O OGHAROSJZRTIOK-KQYNXXCUSA-N 0.000 description 1
- ASJSAQIRZKANQN-CRCLSJGQSA-N 2-deoxy-D-ribose Chemical compound OC[C@@H](O)[C@@H](O)CC=O ASJSAQIRZKANQN-CRCLSJGQSA-N 0.000 description 1
- DVLFYONBTKHTER-UHFFFAOYSA-N 3-(N-morpholino)propanesulfonic acid Chemical compound OS(=O)(=O)CCCN1CCOCC1 DVLFYONBTKHTER-UHFFFAOYSA-N 0.000 description 1
- HAPJROQJVSPKCJ-UHFFFAOYSA-N 3-[4-[2-[6-(dibutylamino)naphthalen-2-yl]ethenyl]pyridin-1-ium-1-yl]propane-1-sulfonate Chemical compound C1=CC2=CC(N(CCCC)CCCC)=CC=C2C=C1C=CC1=CC=[N+](CCCS([O-])(=O)=O)C=C1 HAPJROQJVSPKCJ-UHFFFAOYSA-N 0.000 description 1
- CPDHBIMOKOHWDD-UHFFFAOYSA-L 3-[4-[4-[4-(diethylamino)phenyl]buta-1,3-dienyl]pyridin-1-ium-1-yl]propyl-triethylazanium;dibromide Chemical compound [Br-].[Br-].C1=CC(N(CC)CC)=CC=C1C=CC=CC1=CC=[N+](CCC[N+](CC)(CC)CC)C=C1 CPDHBIMOKOHWDD-UHFFFAOYSA-L 0.000 description 1
- FWBHETKCLVMNFS-UHFFFAOYSA-N 4',6-Diamino-2-phenylindol Chemical compound C1=CC(C(=N)N)=CC=C1C1=CC2=CC=C(C(N)=N)C=C2N1 FWBHETKCLVMNFS-UHFFFAOYSA-N 0.000 description 1
- YSCNMFDFYJUPEF-OWOJBTEDSA-N 4,4'-diisothiocyano-trans-stilbene-2,2'-disulfonic acid Chemical compound OS(=O)(=O)C1=CC(N=C=S)=CC=C1\C=C\C1=CC=C(N=C=S)C=C1S(O)(=O)=O YSCNMFDFYJUPEF-OWOJBTEDSA-N 0.000 description 1
- LHYQAEFVHIZFLR-UHFFFAOYSA-L 4-(4-diazonio-3-methoxyphenyl)-2-methoxybenzenediazonium;dichloride Chemical compound [Cl-].[Cl-].C1=C([N+]#N)C(OC)=CC(C=2C=C(OC)C([N+]#N)=CC=2)=C1 LHYQAEFVHIZFLR-UHFFFAOYSA-L 0.000 description 1
- ZMERMCRYYFRELX-UHFFFAOYSA-N 5-{[2-(iodoacetamido)ethyl]amino}naphthalene-1-sulfonic acid Chemical compound C1=CC=C2C(S(=O)(=O)O)=CC=CC2=C1NCCNC(=O)CI ZMERMCRYYFRELX-UHFFFAOYSA-N 0.000 description 1
- SGAOZXGJGQEBHA-UHFFFAOYSA-N 82344-98-7 Chemical compound C1CCN2CCCC(C=C3C4(OC(C5=CC(=CC=C54)N=C=S)=O)C4=C5)=C2C1=C3OC4=C1CCCN2CCCC5=C12 SGAOZXGJGQEBHA-UHFFFAOYSA-N 0.000 description 1
- 239000007991 ACES buffer Substances 0.000 description 1
- YIXZUOWWYKISPQ-UHFFFAOYSA-N ATTO 565 para-isomer Chemical compound [O-]Cl(=O)(=O)=O.C=12C=C3CCC[N+](CC)=C3C=C2OC=2C=C3N(CC)CCCC3=CC=2C=1C1=CC(C(O)=O)=CC=C1C(O)=O YIXZUOWWYKISPQ-UHFFFAOYSA-N 0.000 description 1
- PWZJEXGKUHVUFP-UHFFFAOYSA-N ATTO 590 meta-isomer Chemical compound [O-]Cl(=O)(=O)=O.C1=2C=C3C(C)=CC(C)(C)N(CC)C3=CC=2OC2=CC3=[N+](CC)C(C)(C)C=C(C)C3=CC2=C1C1=CC=C(C(O)=O)C=C1C(O)=O PWZJEXGKUHVUFP-UHFFFAOYSA-N 0.000 description 1
- QTBSBXVTEAMEQO-UHFFFAOYSA-M Acetate Chemical compound CC([O-])=O QTBSBXVTEAMEQO-UHFFFAOYSA-M 0.000 description 1
- 241000567139 Aeropyrum pernix Species 0.000 description 1
- 108020004774 Alkaline Phosphatase Proteins 0.000 description 1
- 102000002260 Alkaline Phosphatase Human genes 0.000 description 1
- 108700028369 Alleles Proteins 0.000 description 1
- USFZMSVCRYTOJT-UHFFFAOYSA-N Ammonium acetate Chemical compound N.CC(O)=O USFZMSVCRYTOJT-UHFFFAOYSA-N 0.000 description 1
- 239000005695 Ammonium acetate Substances 0.000 description 1
- 241000203069 Archaea Species 0.000 description 1
- 108020004634 Archaeal DNA Proteins 0.000 description 1
- 241000193830 Bacillus <bacterium> Species 0.000 description 1
- MWNLTKCQHFZFHN-UHFFFAOYSA-N CBQCA reagent Chemical compound C1=CC(C(=O)O)=CC=C1C(=O)C1=CC2=CC=CC=C2N=C1C=O MWNLTKCQHFZFHN-UHFFFAOYSA-N 0.000 description 1
- ZKQDCIXGCQPQNV-UHFFFAOYSA-N Calcium hypochlorite Chemical compound [Ca+2].Cl[O-].Cl[O-] ZKQDCIXGCQPQNV-UHFFFAOYSA-N 0.000 description 1
- VYZAMTAEIAYCRO-UHFFFAOYSA-N Chromium Chemical compound [Cr] VYZAMTAEIAYCRO-UHFFFAOYSA-N 0.000 description 1
- RURLVUZRUFHCJO-UHFFFAOYSA-N Chromomycin A3 Natural products COC(C1Cc2cc3cc(OC4CC(OC(=O)C)C(OC5CC(O)C(OC)C(C)O5)C(C)O4)c(C)c(O)c3c(O)c2C(=O)C1OC6CC(OC7CC(C)(O)C(OC(=O)C)C(C)O7)C(O)C(C)O6)C(=O)C(O)C(C)O RURLVUZRUFHCJO-UHFFFAOYSA-N 0.000 description 1
- 108020004635 Complementary DNA Proteins 0.000 description 1
- RYGMFSIKBFXOCR-UHFFFAOYSA-N Copper Chemical compound [Cu] RYGMFSIKBFXOCR-UHFFFAOYSA-N 0.000 description 1
- HMFHBZSHGGEWLO-SOOFDHNKSA-N D-ribofuranose Chemical compound OC[C@H]1OC(O)[C@H](O)[C@@H]1O HMFHBZSHGGEWLO-SOOFDHNKSA-N 0.000 description 1
- 108010092160 Dactinomycin Proteins 0.000 description 1
- XPDXVDYUQZHFPV-UHFFFAOYSA-N Dansyl Chloride Chemical compound C1=CC=C2C(N(C)C)=CC=CC2=C1S(Cl)(=O)=O XPDXVDYUQZHFPV-UHFFFAOYSA-N 0.000 description 1
- AHCYMLUZIRLXAA-SHYZEUOFSA-N Deoxyuridine 5'-triphosphate Chemical compound O1[C@H](COP(O)(=O)OP(O)(=O)OP(O)(O)=O)[C@@H](O)C[C@@H]1N1C(=O)NC(=O)C=C1 AHCYMLUZIRLXAA-SHYZEUOFSA-N 0.000 description 1
- 241000205236 Desulfurococcus Species 0.000 description 1
- 101100284769 Drosophila melanogaster hemo gene Proteins 0.000 description 1
- 108091005941 EBFP Proteins 0.000 description 1
- 108091005942 ECFP Proteins 0.000 description 1
- KCXVZYZYPLLWCC-UHFFFAOYSA-N EDTA Chemical compound OC(=O)CN(CC(O)=O)CCN(CC(O)=O)CC(O)=O KCXVZYZYPLLWCC-UHFFFAOYSA-N 0.000 description 1
- 241000588722 Escherichia Species 0.000 description 1
- 101000813126 Escherichia coli O157:H7 Laminin-binding fimbrial subunit ElfA Proteins 0.000 description 1
- LFQSCWFLJHTTHZ-UHFFFAOYSA-N Ethanol Chemical compound CCO LFQSCWFLJHTTHZ-UHFFFAOYSA-N 0.000 description 1
- 229910052693 Europium Inorganic materials 0.000 description 1
- OUVXYXNWSVIOSJ-UHFFFAOYSA-N Fluo-4 Chemical compound CC1=CC=C(N(CC(O)=O)CC(O)=O)C(OCCOC=2C(=CC=C(C=2)C2=C3C=C(F)C(=O)C=C3OC3=CC(O)=C(F)C=C32)N(CC(O)=O)CC(O)=O)=C1 OUVXYXNWSVIOSJ-UHFFFAOYSA-N 0.000 description 1
- 102220566467 GDNF family receptor alpha-1_S65A_mutation Human genes 0.000 description 1
- 102220566469 GDNF family receptor alpha-1_S65T_mutation Human genes 0.000 description 1
- 102220566453 GDNF family receptor alpha-1_Y66F_mutation Human genes 0.000 description 1
- 102220566451 GDNF family receptor alpha-1_Y66H_mutation Human genes 0.000 description 1
- 102220566455 GDNF family receptor alpha-1_Y66W_mutation Human genes 0.000 description 1
- GYHNNYVSQQEPJS-UHFFFAOYSA-N Gallium Chemical compound [Ga] GYHNNYVSQQEPJS-UHFFFAOYSA-N 0.000 description 1
- 241000193385 Geobacillus stearothermophilus Species 0.000 description 1
- 244000068988 Glycine max Species 0.000 description 1
- 235000010469 Glycine max Nutrition 0.000 description 1
- 239000007995 HEPES buffer Substances 0.000 description 1
- 108010078851 HIV Reverse Transcriptase Proteins 0.000 description 1
- 241000725303 Human immunodeficiency virus Species 0.000 description 1
- 241000713772 Human immunodeficiency virus 1 Species 0.000 description 1
- 241000713340 Human immunodeficiency virus 2 Species 0.000 description 1
- DGAQECJNVWCQMB-PUAWFVPOSA-M Ilexoside XXIX Chemical compound C[C@@H]1CC[C@@]2(CC[C@@]3(C(=CC[C@H]4[C@]3(CC[C@@H]5[C@@]4(CC[C@@H](C5(C)C)OS(=O)(=O)[O-])C)C)[C@@H]2[C@]1(C)O)C)C(=O)O[C@H]6[C@@H]([C@H]([C@@H]([C@H](O6)CO)O)O)O.[Na+] DGAQECJNVWCQMB-PUAWFVPOSA-M 0.000 description 1
- 108091092195 Intron Proteins 0.000 description 1
- QIVBCDIJIAJPQS-VIFPVBQESA-N L-tryptophane Chemical compound C1=CC=C2C(C[C@H](N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-VIFPVBQESA-N 0.000 description 1
- 102000003960 Ligases Human genes 0.000 description 1
- 108090000364 Ligases Proteins 0.000 description 1
- 108060001084 Luciferase Proteins 0.000 description 1
- 239000005089 Luciferase Substances 0.000 description 1
- 239000007987 MES buffer Substances 0.000 description 1
- 239000007993 MOPS buffer Substances 0.000 description 1
- FYYHWMGAXLPEAU-UHFFFAOYSA-N Magnesium Chemical compound [Mg] FYYHWMGAXLPEAU-UHFFFAOYSA-N 0.000 description 1
- 241000203407 Methanocaldococcus jannaschii Species 0.000 description 1
- 241000203353 Methanococcus Species 0.000 description 1
- 101000930835 Methanococcus voltae DNA polymerase Proteins 0.000 description 1
- 108020005196 Mitochondrial DNA Proteins 0.000 description 1
- DBXNUXBLKRLWFA-UHFFFAOYSA-N N-(2-acetamido)-2-aminoethanesulfonic acid Chemical compound NC(=O)CNCCS(O)(=O)=O DBXNUXBLKRLWFA-UHFFFAOYSA-N 0.000 description 1
- 206010028980 Neoplasm Diseases 0.000 description 1
- 229910021586 Nickel(II) chloride Inorganic materials 0.000 description 1
- 108020005187 Oligonucleotide Probes Proteins 0.000 description 1
- 108700020962 Peroxidase Proteins 0.000 description 1
- 108010053210 Phycocyanin Proteins 0.000 description 1
- NPYPAHLBTDXSSS-UHFFFAOYSA-N Potassium ion Chemical compound [K+] NPYPAHLBTDXSSS-UHFFFAOYSA-N 0.000 description 1
- 108010019653 Pwo polymerase Proteins 0.000 description 1
- 241001148023 Pyrococcus abyssi Species 0.000 description 1
- 241000205156 Pyrococcus furiosus Species 0.000 description 1
- 101900050251 Pyrococcus horikoshii DNA polymerase Proteins 0.000 description 1
- 108010021713 Pyrococcus sp GB-D DNA polymerase Proteins 0.000 description 1
- 241001467519 Pyrococcus sp. Species 0.000 description 1
- 241000205192 Pyrococcus woesei Species 0.000 description 1
- 241000204670 Pyrodictium occultum Species 0.000 description 1
- 108091034057 RNA (poly(A)) Proteins 0.000 description 1
- 102000017143 RNA Polymerase I Human genes 0.000 description 1
- 108010013845 RNA Polymerase I Proteins 0.000 description 1
- 102000009572 RNA Polymerase II Human genes 0.000 description 1
- 108010009460 RNA Polymerase II Proteins 0.000 description 1
- 102000014450 RNA Polymerase III Human genes 0.000 description 1
- 108010078067 RNA Polymerase III Proteins 0.000 description 1
- 108091008103 RNA aptamers Proteins 0.000 description 1
- 238000002123 RNA extraction Methods 0.000 description 1
- 238000003559 RNA-seq method Methods 0.000 description 1
- 238000001237 Raman spectrum Methods 0.000 description 1
- PYMYPHUHKUWMLA-LMVFSUKVSA-N Ribose Natural products OC[C@@H](O)[C@@H](O)[C@@H](O)C=O PYMYPHUHKUWMLA-LMVFSUKVSA-N 0.000 description 1
- 241000193448 Ruminiclostridium thermocellum Species 0.000 description 1
- 101100260935 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) TOD6 gene Proteins 0.000 description 1
- BUGBHKTXTAQXES-UHFFFAOYSA-N Selenium Chemical compound [Se] BUGBHKTXTAQXES-UHFFFAOYSA-N 0.000 description 1
- FKNQFGJONOIPTF-UHFFFAOYSA-N Sodium cation Chemical compound [Na+] FKNQFGJONOIPTF-UHFFFAOYSA-N 0.000 description 1
- 241000205098 Sulfolobus acidocaldarius Species 0.000 description 1
- 241000205091 Sulfolobus solfataricus Species 0.000 description 1
- UZMAPBJVXOGOFT-UHFFFAOYSA-N Syringetin Natural products COC1=C(O)C(OC)=CC(C2=C(C(=O)C3=C(O)C=C(O)C=C3O2)O)=C1 UZMAPBJVXOGOFT-UHFFFAOYSA-N 0.000 description 1
- 101710137500 T7 RNA polymerase Proteins 0.000 description 1
- 108010017842 Telomerase Proteins 0.000 description 1
- 102100032938 Telomerase reverse transcriptase Human genes 0.000 description 1
- 229910052771 Terbium Inorganic materials 0.000 description 1
- 101000865050 Thermococcus fumicolans DNA polymerase Proteins 0.000 description 1
- 241001237851 Thermococcus gorgonarius Species 0.000 description 1
- 241001235254 Thermococcus kodakarensis Species 0.000 description 1
- 241000204666 Thermotoga maritima Species 0.000 description 1
- 241000589596 Thermus Species 0.000 description 1
- 241000589500 Thermus aquaticus Species 0.000 description 1
- 241000589498 Thermus filiformis Species 0.000 description 1
- 108010085671 Thermus thermophilus DNA polymerase Proteins 0.000 description 1
- RTAQQCXQSZGOHL-UHFFFAOYSA-N Titanium Chemical compound [Ti] RTAQQCXQSZGOHL-UHFFFAOYSA-N 0.000 description 1
- 241000283907 Tragelaphus oryx Species 0.000 description 1
- GYDJEQRTZSCIOI-UHFFFAOYSA-N Tranexamic acid Chemical compound NCC1CCC(C(O)=O)CC1 GYDJEQRTZSCIOI-UHFFFAOYSA-N 0.000 description 1
- 102220615016 Transcription elongation regulator 1_S65C_mutation Human genes 0.000 description 1
- 102000004357 Transferases Human genes 0.000 description 1
- 108090000992 Transferases Proteins 0.000 description 1
- 239000007997 Tricine buffer Substances 0.000 description 1
- 239000007983 Tris buffer Substances 0.000 description 1
- QIVBCDIJIAJPQS-UHFFFAOYSA-N Tryptophan Natural products C1=CC=C2C(CC(N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-UHFFFAOYSA-N 0.000 description 1
- 108700002693 Viral Replicase Complex Proteins Proteins 0.000 description 1
- HCHKCACWOHOZIP-UHFFFAOYSA-N Zinc Chemical compound [Zn] HCHKCACWOHOZIP-UHFFFAOYSA-N 0.000 description 1
- UYRDHEJRPVSJFM-VSWVFQEASA-N [(1s,3r)-3-hydroxy-4-[(3e,5e,7e,9e,11z)-11-[4-[(e)-2-[(1r,3s,6s)-3-hydroxy-1,5,5-trimethyl-7-oxabicyclo[4.1.0]heptan-6-yl]ethenyl]-5-oxofuran-2-ylidene]-3,10-dimethylundeca-1,3,5,7,9-pentaenylidene]-3,5,5-trimethylcyclohexyl] acetate Chemical compound C[C@@]1(O)C[C@@H](OC(=O)C)CC(C)(C)C1=C=C\C(C)=C\C=C\C=C\C=C(/C)\C=C/1C=C(\C=C\[C@]23[C@@](O2)(C)C[C@@H](O)CC3(C)C)C(=O)O\1 UYRDHEJRPVSJFM-VSWVFQEASA-N 0.000 description 1
- APERIXFHHNDFQV-UHFFFAOYSA-N [2-[2-[2-[bis(carboxymethyl)amino]-5-methylphenoxy]ethoxy]-4-[3,6-bis(dimethylamino)xanthen-9-ylidene]cyclohexa-2,5-dien-1-ylidene]-bis(carboxymethyl)azanium;chloride Chemical compound [Cl-].C12=CC=C(N(C)C)C=C2OC2=CC(N(C)C)=CC=C2C1=C(C=1)C=CC(=[N+](CC(O)=O)CC(O)=O)C=1OCCOC1=CC(C)=CC=C1N(CC(O)=O)CC(O)=O APERIXFHHNDFQV-UHFFFAOYSA-N 0.000 description 1
- 241000193445 [Clostridium] stercorarium Species 0.000 description 1
- HDRRAMINWIWTNU-NTSWFWBYSA-N [[(2s,5r)-5-(2-amino-6-oxo-3h-purin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl] phosphono hydrogen phosphate Chemical compound C1=2NC(N)=NC(=O)C=2N=CN1[C@H]1CC[C@@H](COP(O)(=O)OP(O)(=O)OP(O)(O)=O)O1 HDRRAMINWIWTNU-NTSWFWBYSA-N 0.000 description 1
- ARLKCWCREKRROD-POYBYMJQSA-N [[(2s,5r)-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl] phosphono hydrogen phosphate Chemical compound O=C1N=C(N)C=CN1[C@@H]1O[C@H](COP(O)(=O)OP(O)(=O)OP(O)(O)=O)CC1 ARLKCWCREKRROD-POYBYMJQSA-N 0.000 description 1
- 238000010521 absorption reaction Methods 0.000 description 1
- RJURFGZVJUQBHK-IIXSONLDSA-N actinomycin D Chemical compound C[C@H]1OC(=O)[C@H](C(C)C)N(C)C(=O)CN(C)C(=O)[C@@H]2CCCN2C(=O)[C@@H](C(C)C)NC(=O)[C@H]1NC(=O)C1=C(N)C(=O)C(C)=C2OC(C(C)=CC=C3C(=O)N[C@@H]4C(=O)N[C@@H](C(N5CCC[C@H]5C(=O)N(C)CC(=O)N(C)[C@@H](C(C)C)C(=O)O[C@@H]4C)=O)C(C)C)=C3N=C21 RJURFGZVJUQBHK-IIXSONLDSA-N 0.000 description 1
- 229940125528 allosteric inhibitor Drugs 0.000 description 1
- HMFHBZSHGGEWLO-UHFFFAOYSA-N alpha-D-Furanose-Ribose Natural products OCC1OC(O)C(O)C1O HMFHBZSHGGEWLO-UHFFFAOYSA-N 0.000 description 1
- 125000003277 amino group Chemical group 0.000 description 1
- 229940126575 aminoglycoside Drugs 0.000 description 1
- 229940043376 ammonium acetate Drugs 0.000 description 1
- 235000019257 ammonium acetate Nutrition 0.000 description 1
- 238000004458 analytical method Methods 0.000 description 1
- 239000007864 aqueous solution Substances 0.000 description 1
- 238000000149 argon plasma sintering Methods 0.000 description 1
- 229910052785 arsenic Inorganic materials 0.000 description 1
- RQNWIZPPADIBDY-UHFFFAOYSA-N arsenic atom Chemical compound [As] RQNWIZPPADIBDY-UHFFFAOYSA-N 0.000 description 1
- 238000003556 assay Methods 0.000 description 1
- JPIYZTWMUGTEHX-UHFFFAOYSA-N auramine O free base Chemical compound C1=CC(N(C)C)=CC=C1C(=N)C1=CC=C(N(C)C)C=C1 JPIYZTWMUGTEHX-UHFFFAOYSA-N 0.000 description 1
- 150000001540 azides Chemical class 0.000 description 1
- 230000001580 bacterial effect Effects 0.000 description 1
- 239000011324 bead Substances 0.000 description 1
- 239000008280 blood Substances 0.000 description 1
- 210000004369 blood Anatomy 0.000 description 1
- DEGAKNSWVGKMLS-UHFFFAOYSA-N calcein Chemical compound O1C(=O)C2=CC=CC=C2C21C1=CC(CN(CC(O)=O)CC(O)=O)=C(O)C=C1OC1=C2C=C(CN(CC(O)=O)CC(=O)O)C(O)=C1 DEGAKNSWVGKMLS-UHFFFAOYSA-N 0.000 description 1
- 125000003178 carboxy group Chemical group [H]OC(*)=O 0.000 description 1
- 238000012512 characterization method Methods 0.000 description 1
- 239000002738 chelating agent Substances 0.000 description 1
- 238000007385 chemical modification Methods 0.000 description 1
- 229910052804 chromium Inorganic materials 0.000 description 1
- 239000011651 chromium Substances 0.000 description 1
- ZYVSOIYQKUDENJ-WKSBCEQHSA-N chromomycin A3 Chemical compound O([C@@H]1C[C@@H](O[C@H](C)[C@@H]1OC(C)=O)OC=1C=C2C=C3C[C@H]([C@@H](C(=O)C3=C(O)C2=C(O)C=1C)O[C@@H]1O[C@H](C)[C@@H](O)[C@H](O[C@@H]2O[C@H](C)[C@@H](O)[C@H](O[C@@H]3O[C@@H](C)[C@H](OC(C)=O)[C@@](C)(O)C3)C2)C1)[C@H](OC)C(=O)[C@@H](O)[C@@H](C)O)[C@@H]1C[C@@H](O)[C@@H](OC)[C@@H](C)O1 ZYVSOIYQKUDENJ-WKSBCEQHSA-N 0.000 description 1
- 210000000349 chromosome Anatomy 0.000 description 1
- 229910017052 cobalt Inorganic materials 0.000 description 1
- 239000010941 cobalt Substances 0.000 description 1
- GUTLYIVDDKVIGB-UHFFFAOYSA-N cobalt atom Chemical compound [Co] GUTLYIVDDKVIGB-UHFFFAOYSA-N 0.000 description 1
- 238000007796 conventional method Methods 0.000 description 1
- 229910052802 copper Inorganic materials 0.000 description 1
- 239000010949 copper Substances 0.000 description 1
- GLNDAGDHSLMOKX-UHFFFAOYSA-N coumarin 120 Chemical compound C1=C(N)C=CC2=C1OC(=O)C=C2C GLNDAGDHSLMOKX-UHFFFAOYSA-N 0.000 description 1
- 238000004132 cross linking Methods 0.000 description 1
- 239000003431 cross linking reagent Substances 0.000 description 1
- 125000000151 cysteine group Chemical group N[C@@H](CS)C(=O)* 0.000 description 1
- 229960000640 dactinomycin Drugs 0.000 description 1
- URGJWIFLBWJRMF-JGVFFNPUSA-N ddTTP Chemical compound O=C1NC(=O)C(C)=CN1[C@@H]1O[C@H](COP(O)(=O)OP(O)(=O)OP(O)(O)=O)CC1 URGJWIFLBWJRMF-JGVFFNPUSA-N 0.000 description 1
- 239000003398 denaturant Substances 0.000 description 1
- CFCUWKMKBJTWLW-UHFFFAOYSA-N deoliosyl-3C-alpha-L-digitoxosyl-MTM Natural products CC=1C(O)=C2C(O)=C3C(=O)C(OC4OC(C)C(O)C(OC5OC(C)C(O)C(OC6OC(C)C(O)C(C)(O)C6)C5)C4)C(C(OC)C(=O)C(O)C(C)O)CC3=CC2=CC=1OC(OC(C)C1O)CC1OC1CC(O)C(O)C(C)O1 CFCUWKMKBJTWLW-UHFFFAOYSA-N 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- KCFYHBSOLOXZIF-UHFFFAOYSA-N dihydrochrysin Natural products COC1=C(O)C(OC)=CC(C2OC3=CC(O)=CC(O)=C3C(=O)C2)=C1 KCFYHBSOLOXZIF-UHFFFAOYSA-N 0.000 description 1
- JVXZRNYCRFIEGV-UHFFFAOYSA-M dilC18(3) dye Chemical compound [O-]Cl(=O)(=O)=O.CC1(C)C2=CC=CC=C2N(CCCCCCCCCCCCCCCCCC)C1=CC=CC1=[N+](CCCCCCCCCCCCCCCCCC)C2=CC=CC=C2C1(C)C JVXZRNYCRFIEGV-UHFFFAOYSA-M 0.000 description 1
- ZQSBJPAQPRVNHU-UHFFFAOYSA-M dilC18(5) dye Chemical compound [O-]Cl(=O)(=O)=O.CC1(C)C2=CC=CC=C2N(CCCCCCCCCCCCCCCCCC)C1=CC=CC=CC1=[N+](CCCCCCCCCCCCCCCCCC)C2=CC=CC=C2C1(C)C ZQSBJPAQPRVNHU-UHFFFAOYSA-M 0.000 description 1
- 125000002147 dimethylamino group Chemical group [H]C([H])([H])N(*)C([H])([H])[H] 0.000 description 1
- 238000012172 direct RNA sequencing Methods 0.000 description 1
- 238000006073 displacement reaction Methods 0.000 description 1
- 238000009826 distribution Methods 0.000 description 1
- 239000003814 drug Substances 0.000 description 1
- 238000007877 drug screening Methods 0.000 description 1
- 108010048367 enhanced green fluorescent protein Proteins 0.000 description 1
- YQGOJNYOYNNSMM-UHFFFAOYSA-N eosin Chemical compound [Na+].OC(=O)C1=CC=CC=C1C1=C2C=C(Br)C(=O)C(Br)=C2OC2=C(Br)C(O)=C(Br)C=C21 YQGOJNYOYNNSMM-UHFFFAOYSA-N 0.000 description 1
- IINNWAYUJNWZRM-UHFFFAOYSA-L erythrosin B Chemical compound [Na+].[Na+].[O-]C(=O)C1=CC=CC=C1C1=C2C=C(I)C(=O)C(I)=C2OC2=C(I)C([O-])=C(I)C=C21 IINNWAYUJNWZRM-UHFFFAOYSA-L 0.000 description 1
- ZMMJGEGLRURXTF-UHFFFAOYSA-N ethidium bromide Chemical compound [Br-].C12=CC(N)=CC=C2C2=CC=C(N)C=C2[N+](CC)=C1C1=CC=CC=C1 ZMMJGEGLRURXTF-UHFFFAOYSA-N 0.000 description 1
- 229960005542 ethidium bromide Drugs 0.000 description 1
- DEFVIWRASFVYLL-UHFFFAOYSA-N ethylene glycol bis(2-aminoethyl)tetraacetic acid Chemical compound OC(=O)CN(CC(O)=O)CCOCCOCCN(CC(O)=O)CC(O)=O DEFVIWRASFVYLL-UHFFFAOYSA-N 0.000 description 1
- OGPBJKLSAFTDLK-UHFFFAOYSA-N europium atom Chemical compound [Eu] OGPBJKLSAFTDLK-UHFFFAOYSA-N 0.000 description 1
- NNMXSTWQJRPBJZ-UHFFFAOYSA-K europium(iii) chloride Chemical compound Cl[Eu](Cl)Cl NNMXSTWQJRPBJZ-UHFFFAOYSA-K 0.000 description 1
- 238000000605 extraction Methods 0.000 description 1
- 230000002550 fecal effect Effects 0.000 description 1
- 230000001605 fetal effect Effects 0.000 description 1
- GVEPBJHOBDJJJI-UHFFFAOYSA-N fluoranthrene Natural products C1=CC(C2=CC=CC=C22)=C3C2=CC=CC3=C1 GVEPBJHOBDJJJI-UHFFFAOYSA-N 0.000 description 1
- MHMNJMPURVTYEJ-UHFFFAOYSA-N fluorescein-5-isothiocyanate Chemical compound O1C(=O)C2=CC(N=C=S)=CC=C2C21C1=CC=C(O)C=C1OC1=CC(O)=CC=C21 MHMNJMPURVTYEJ-UHFFFAOYSA-N 0.000 description 1
- 238000002073 fluorescence micrograph Methods 0.000 description 1
- 238000012632 fluorescent imaging Methods 0.000 description 1
- 239000012634 fragment Substances 0.000 description 1
- 230000004927 fusion Effects 0.000 description 1
- 108020001507 fusion proteins Proteins 0.000 description 1
- 102000037865 fusion proteins Human genes 0.000 description 1
- 229910052733 gallium Inorganic materials 0.000 description 1
- 238000012239 gene modification Methods 0.000 description 1
- 238000012252 genetic analysis Methods 0.000 description 1
- 230000005017 genetic modification Effects 0.000 description 1
- 235000013617 genetically modified food Nutrition 0.000 description 1
- 238000011331 genomic analysis Methods 0.000 description 1
- 229910052732 germanium Inorganic materials 0.000 description 1
- GNPVGFCGXDBREM-UHFFFAOYSA-N germanium atom Chemical compound [Ge] GNPVGFCGXDBREM-UHFFFAOYSA-N 0.000 description 1
- 239000011521 glass Substances 0.000 description 1
- 229930195712 glutamate Natural products 0.000 description 1
- 230000036449 good health Effects 0.000 description 1
- 229910052736 halogen Inorganic materials 0.000 description 1
- 238000013537 high throughput screening Methods 0.000 description 1
- 239000012145 high-salt buffer Substances 0.000 description 1
- 230000028993 immune response Effects 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 238000011065 in-situ storage Methods 0.000 description 1
- 238000011835 investigation Methods 0.000 description 1
- 229910052742 iron Inorganic materials 0.000 description 1
- 230000002427 irreversible effect Effects 0.000 description 1
- 238000011901 isothermal amplification Methods 0.000 description 1
- 238000007834 ligase chain reaction Methods 0.000 description 1
- 230000000670 limiting effect Effects 0.000 description 1
- DLBFLQKQABVKGT-UHFFFAOYSA-L lucifer yellow dye Chemical compound [Li+].[Li+].[O-]S(=O)(=O)C1=CC(C(N(C(=O)NN)C2=O)=O)=C3C2=CC(S([O-])(=O)=O)=CC3=C1N DLBFLQKQABVKGT-UHFFFAOYSA-L 0.000 description 1
- 238000004020 luminiscence type Methods 0.000 description 1
- HWYHZTIRURJOHG-UHFFFAOYSA-N luminol Chemical compound O=C1NNC(=O)C2=C1C(N)=CC=C2 HWYHZTIRURJOHG-UHFFFAOYSA-N 0.000 description 1
- 229910052749 magnesium Inorganic materials 0.000 description 1
- 239000011777 magnesium Substances 0.000 description 1
- 238000013507 mapping Methods 0.000 description 1
- CFCUWKMKBJTWLW-BKHRDMLASA-N mithramycin Chemical compound O([C@@H]1C[C@@H](O[C@H](C)[C@H]1O)OC=1C=C2C=C3C[C@H]([C@@H](C(=O)C3=C(O)C2=C(O)C=1C)O[C@@H]1O[C@H](C)[C@@H](O)[C@H](O[C@@H]2O[C@H](C)[C@H](O)[C@H](O[C@@H]3O[C@H](C)[C@@H](O)[C@@](C)(O)C3)C2)C1)[C@H](OC)C(=O)[C@@H](O)[C@@H](C)O)[C@H]1C[C@@H](O)[C@H](O)[C@@H](C)O1 CFCUWKMKBJTWLW-BKHRDMLASA-N 0.000 description 1
- 238000002156 mixing Methods 0.000 description 1
- MLEBFEHOJICQQS-UHFFFAOYSA-N monodansylcadaverine Chemical compound C1=CC=C2C(N(C)C)=CC=CC2=C1S(=O)(=O)NCCCCCN MLEBFEHOJICQQS-UHFFFAOYSA-N 0.000 description 1
- VMGAPWLDMVPYIA-HIDZBRGKSA-N n'-amino-n-iminomethanimidamide Chemical compound N\N=C\N=N VMGAPWLDMVPYIA-HIDZBRGKSA-N 0.000 description 1
- CSJXLKVNKAXFSI-UHFFFAOYSA-N n-(2-aminoethyl)-5-(dimethylamino)naphthalene-1-sulfonamide Chemical compound C1=CC=C2C(N(C)C)=CC=CC2=C1S(=O)(=O)NCCN CSJXLKVNKAXFSI-UHFFFAOYSA-N 0.000 description 1
- 239000002159 nanocrystal Substances 0.000 description 1
- QMMRZOWCJAIUJA-UHFFFAOYSA-L nickel dichloride Chemical compound Cl[Ni]Cl QMMRZOWCJAIUJA-UHFFFAOYSA-L 0.000 description 1
- VOFUROIFQGPCGE-UHFFFAOYSA-N nile red Chemical compound C1=CC=C2C3=NC4=CC=C(N(CC)CC)C=C4OC3=CC(=O)C2=C1 VOFUROIFQGPCGE-UHFFFAOYSA-N 0.000 description 1
- 108091027963 non-coding RNA Proteins 0.000 description 1
- 102000042567 non-coding RNA Human genes 0.000 description 1
- 239000002777 nucleoside Substances 0.000 description 1
- 229960002378 oftasceine Drugs 0.000 description 1
- 229940124276 oligodeoxyribonucleotide Drugs 0.000 description 1
- 239000002751 oligonucleotide probe Substances 0.000 description 1
- 150000003891 oxalate salts Chemical class 0.000 description 1
- 125000003431 oxalo group Chemical group 0.000 description 1
- 239000007800 oxidant agent Substances 0.000 description 1
- 230000001590 oxidative effect Effects 0.000 description 1
- 125000004430 oxygen atom Chemical group O* 0.000 description 1
- 125000000636 p-nitrophenyl group Chemical group [H]C1=C([H])C(=C([H])C([H])=C1*)[N+]([O-])=O 0.000 description 1
- VYNDHICBIRRPFP-UHFFFAOYSA-N pacific blue Chemical compound FC1=C(O)C(F)=C2OC(=O)C(C(=O)O)=CC2=C1 VYNDHICBIRRPFP-UHFFFAOYSA-N 0.000 description 1
- 239000013610 patient sample Substances 0.000 description 1
- 101150092317 pbf1 gene Proteins 0.000 description 1
- UTIQDNPUHSAVDN-UHFFFAOYSA-N peridinin Natural products CC(=O)OC1CC(C)(C)C(=C=CC(=CC=CC=CC=C2/OC(=O)C(=C2)C=CC34OC3(C)CC(O)CC4(C)C)C)C(C)(O)C1 UTIQDNPUHSAVDN-UHFFFAOYSA-N 0.000 description 1
- 108040007629 peroxidase activity proteins Proteins 0.000 description 1
- NBIIXXVUZAFLBC-UHFFFAOYSA-K phosphate Chemical compound [O-]P([O-])([O-])=O NBIIXXVUZAFLBC-UHFFFAOYSA-K 0.000 description 1
- 150000008300 phosphoramidites Chemical class 0.000 description 1
- 229910052698 phosphorus Inorganic materials 0.000 description 1
- 229910052697 platinum Inorganic materials 0.000 description 1
- 229960003171 plicamycin Drugs 0.000 description 1
- 230000010287 polarization Effects 0.000 description 1
- 230000000379 polymerizing effect Effects 0.000 description 1
- 229910052700 potassium Inorganic materials 0.000 description 1
- 235000011056 potassium acetate Nutrition 0.000 description 1
- 229910001414 potassium ion Inorganic materials 0.000 description 1
- 238000002360 preparation method Methods 0.000 description 1
- 239000002987 primer (paints) Substances 0.000 description 1
- 108090000765 processed proteins & peptides Proteins 0.000 description 1
- 125000006239 protecting group Chemical group 0.000 description 1
- 238000000164 protein isolation Methods 0.000 description 1
- INCIMLINXXICKS-UHFFFAOYSA-M pyronin Y Chemical compound [Cl-].C1=CC(=[N+](C)C)C=C2OC3=CC(N(C)C)=CC=C3C=C21 INCIMLINXXICKS-UHFFFAOYSA-M 0.000 description 1
- 238000012175 pyrosequencing Methods 0.000 description 1
- 239000002096 quantum dot Substances 0.000 description 1
- 238000006862 quantum yield reaction Methods 0.000 description 1
- UKOBAUFLOGFCMV-UHFFFAOYSA-N quinacrine mustard Chemical compound C1=C(Cl)C=CC2=C(NC(C)CCCN(CCCl)CCCl)C3=CC(OC)=CC=C3N=C21 UKOBAUFLOGFCMV-UHFFFAOYSA-N 0.000 description 1
- RJSRSRITMWVIQT-UHFFFAOYSA-N quinolin-6-amine Chemical compound N1=CC=CC2=CC(N)=CC=C21 RJSRSRITMWVIQT-UHFFFAOYSA-N 0.000 description 1
- 238000009877 rendering Methods 0.000 description 1
- BOLDJAUMGUJJKM-LSDHHAIUSA-N renifolin D Natural products CC(=C)[C@@H]1Cc2c(O)c(O)ccc2[C@H]1CC(=O)c3ccc(O)cc3O BOLDJAUMGUJJKM-LSDHHAIUSA-N 0.000 description 1
- 230000010076 replication Effects 0.000 description 1
- HSSLDCABUXLXKM-UHFFFAOYSA-N resorufin Chemical compound C1=CC(=O)C=C2OC3=CC(O)=CC=C3N=C21 HSSLDCABUXLXKM-UHFFFAOYSA-N 0.000 description 1
- 230000004043 responsiveness Effects 0.000 description 1
- MYFATKRONKHHQL-UHFFFAOYSA-N rhodamine 123 Chemical compound [Cl-].COC(=O)C1=CC=CC=C1C1=C2C=CC(=[NH2+])C=C2OC2=CC(N)=CC=C21 MYFATKRONKHHQL-UHFFFAOYSA-N 0.000 description 1
- 229940043267 rhodamine b Drugs 0.000 description 1
- 108700038288 rhodamine-phalloidin Proteins 0.000 description 1
- 229910052703 rhodium Inorganic materials 0.000 description 1
- 239000010948 rhodium Substances 0.000 description 1
- MHOVAHRLVXNVSD-UHFFFAOYSA-N rhodium atom Chemical compound [Rh] MHOVAHRLVXNVSD-UHFFFAOYSA-N 0.000 description 1
- 239000003161 ribonuclease inhibitor Substances 0.000 description 1
- 102200089551 rs5030826 Human genes 0.000 description 1
- 210000003296 saliva Anatomy 0.000 description 1
- 229910052706 scandium Inorganic materials 0.000 description 1
- SIXSYDAISGFNSX-UHFFFAOYSA-N scandium atom Chemical compound [Sc] SIXSYDAISGFNSX-UHFFFAOYSA-N 0.000 description 1
- 229910052711 selenium Inorganic materials 0.000 description 1
- 239000011669 selenium Substances 0.000 description 1
- 238000000926 separation method Methods 0.000 description 1
- 239000011734 sodium Substances 0.000 description 1
- 229910052708 sodium Inorganic materials 0.000 description 1
- 239000011780 sodium chloride Substances 0.000 description 1
- 229910001415 sodium ion Inorganic materials 0.000 description 1
- UGJCNRLBGKEGEH-UHFFFAOYSA-N sodium-binding benzofuran isophthalate Chemical compound COC1=CC=2C=C(C=3C(=CC(=CC=3)C(O)=O)C(O)=O)OC=2C=C1N(CCOCC1)CCOCCOCCN1C(C(=CC=1C=2)OC)=CC=1OC=2C1=CC=C(C(O)=O)C=C1C(O)=O UGJCNRLBGKEGEH-UHFFFAOYSA-N 0.000 description 1
- 239000002904 solvent Substances 0.000 description 1
- 230000000087 stabilizing effect Effects 0.000 description 1
- 230000000638 stimulation Effects 0.000 description 1
- 229910001631 strontium chloride Inorganic materials 0.000 description 1
- AHBGXTDRMVNFER-UHFFFAOYSA-L strontium dichloride Chemical compound [Cl-].[Cl-].[Sr+2] AHBGXTDRMVNFER-UHFFFAOYSA-L 0.000 description 1
- 108091035539 telomere Proteins 0.000 description 1
- 102000055501 telomere Human genes 0.000 description 1
- 210000003411 telomere Anatomy 0.000 description 1
- 230000002123 temporal effect Effects 0.000 description 1
- GZCRRIHWUXGPOV-UHFFFAOYSA-N terbium atom Chemical compound [Tb] GZCRRIHWUXGPOV-UHFFFAOYSA-N 0.000 description 1
- LQSATJAZEBYDQQ-UHFFFAOYSA-J tetrapotassium;2-[4-[bis(carboxylatomethyl)amino]-3-(carboxylatomethoxy)phenyl]-1h-indole-6-carboxylate Chemical compound [K+].[K+].[K+].[K+].C1=C(N(CC([O-])=O)CC([O-])=O)C(OCC(=O)[O-])=CC(C=2NC3=CC(=CC=C3C=2)C([O-])=O)=C1 LQSATJAZEBYDQQ-UHFFFAOYSA-J 0.000 description 1
- QOFZZTBWWJNFCA-UHFFFAOYSA-N texas red-X Chemical compound [O-]S(=O)(=O)C1=CC(S(=O)(=O)NCCCCCC(=O)O)=CC=C1C(C1=CC=2CCCN3CCCC(C=23)=C1O1)=C2C1=C(CCC1)C3=[N+]1CCCC3=C2 QOFZZTBWWJNFCA-UHFFFAOYSA-N 0.000 description 1
- 238000002560 therapeutic procedure Methods 0.000 description 1
- ACOJCCLIDPZYJC-UHFFFAOYSA-M thiazole orange Chemical compound CC1=CC=C(S([O-])(=O)=O)C=C1.C1=CC=C2C(C=C3N(C4=CC=CC=C4S3)C)=CC=[N+](C)C2=C1 ACOJCCLIDPZYJC-UHFFFAOYSA-M 0.000 description 1
- 125000003396 thiol group Chemical group [H]S* 0.000 description 1
- 229940113082 thymine Drugs 0.000 description 1
- 239000010936 titanium Substances 0.000 description 1
- 229910052719 titanium Inorganic materials 0.000 description 1
- 238000013518 transcription Methods 0.000 description 1
- 230000035897 transcription Effects 0.000 description 1
- 230000007704 transition Effects 0.000 description 1
- 230000005945 translocation Effects 0.000 description 1
- 238000004627 transmission electron microscopy Methods 0.000 description 1
- 239000001226 triphosphate Substances 0.000 description 1
- 235000011178 triphosphate Nutrition 0.000 description 1
- LENZDBCJOHFCAS-UHFFFAOYSA-N tris Chemical compound OCC(N)(CO)CO LENZDBCJOHFCAS-UHFFFAOYSA-N 0.000 description 1
- 229940035893 uracil Drugs 0.000 description 1
- 210000002700 urine Anatomy 0.000 description 1
- 229910052720 vanadium Inorganic materials 0.000 description 1
- GPPXJZIENCGNKB-UHFFFAOYSA-N vanadium Chemical compound [V]#[V] GPPXJZIENCGNKB-UHFFFAOYSA-N 0.000 description 1
- 230000003612 virological effect Effects 0.000 description 1
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 1
- 229910052725 zinc Inorganic materials 0.000 description 1
- 239000011701 zinc Substances 0.000 description 1
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q1/00—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
- C12Q1/68—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
- C12Q1/6869—Methods for sequencing
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q2535/00—Reactions characterised by the assay type for determining the identity of a nucleotide base or a sequence of oligonucleotides
- C12Q2535/113—Cycle sequencing
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q2537/00—Reactions characterised by the reaction format or use of a specific feature
- C12Q2537/10—Reactions characterised by the reaction format or use of a specific feature the purpose or use of
- C12Q2537/143—Multiplexing, i.e. use of multiple primers or probes in a single reaction, usually for simultaneously analyse of multiple analysis
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q2565/00—Nucleic acid analysis characterised by mode or means of detection
- C12Q2565/50—Detection characterised by immobilisation to a surface
- C12Q2565/518—Detection characterised by immobilisation to a surface characterised by the immobilisation of the nucleic acid sample or target
Definitions
- the present disclosure relates in some aspects to methods and compositions for analyzing nucleic acid sequences, such as for determining sequences of nucleic acid molecules in a clonal cluster, including DNA sequencing methods and nucleotide compositions.
- fluorescent labels can be used to present different signals at different wavelengths.
- fewer than four different fluorescence emission channels are preferred, and improved methods for nucleic acid sequencing are needed. Provided herein are methods, compositions, and kits that meet such and other needs.
- provided herein are methods and systems to code the four nucleotides (A, T/U, G, and C) with two excitation wavelengths and less than four emission bandwidths.
- methods to code the four bases in DNA sequencing with different detectable labels and two illumination light sources.
- optimized label configurations where three or two fluorescence images can be used to determine the base extended at individual clusters with improve accuracy and simplified optical design.
- a method for determining a sequence of a polynucleotide template comprising: a) contacting the polynucleotide template with a first pool of nucleotides comprising: i) a nucleotide of a first base labeled with a first label configured to be detected at a first wavelength, ii) a nucleotide of a second base which is different from the first base and which nucleotide is labeled with a second label configured to be detected at a second wavelength which is different from the first wavelength, and iii) one or more nucleotides of a third base which is different from the first and second bases, wherein the one or more nucleotides comprise a nucleotide labeled with a third label configured to be detected at the first wavelength and a nucleotide labeled with a fourth label configured to be detected at the second wavelength.
- the method comprises b) allowing binding and optional incorporation of a nucleotide of the first pool templated on the polynucleotide template, wherein the bound and optionally incorporated nucleotide is complementary to a nucleotide residue at a first nucleotide position in the polynucleotide template.
- the method can comprise c) imaging the polynucleotide template to detect a signal (or record an absence of the signal) associated with the bound and optionally incorporated nucleotide of the first pool.
- the method can comprise: i) using a first power to image the polynucleotide template at the first wavelength and not at the second wavelength to acquire a first image, wherein a signal associated with the first label and/or a signal associated with the third label is detected or an absence of the signal is recorded, ii) using a second power to image the polynucleotide template at the second wavelength and not at the first wavelength to acquire a second image, wherein a signal associated with the second label and/or a signal associated with the fourth label is detected or an absence of the signal is recorded, and iii) using a third power to image the polynucleotide template at the first and second wavelengths to acquire a third image, wherein the third power is less than the first power and/or the second power, wherein a signal associated with the first label and/or a signal associated with the third label is detected at a signal intensity lower than that detected in c)i) or an absence of the signal is recorded, and wherein
- the method can comprise generating a signal codeword comprising signal codes each corresponding to the intensity of the signal detected in c)i) through c)iii) (or recorded absence of the signal), wherein the signal codeword corresponds to the identity of the base in the bound and optionally incorporated nucleotide.
- the method can comprise identifying the nucleotide residue at the first nucleotide position in the polynucleotide template.
- the first label and the third label can be different labels or the same label.
- the second label and the fourth label can be different labels or the same label.
- the one or more nucleotides of the third base can comprise a nucleotide labeled with the third label and the fourth label.
- the one or more nucleotides of the third base can comprise a first nucleotide species labeled with the third label and a second nucleotide species labeled with the fourth label.
- the third label and the fourth label can be different labels or the same label.
- the first and second wavelengths can correspond to different channels of a fluorescence microscope.
- the signal associated with the first label or the signal associated with the third label can be detected at a first signal intensity.
- the signal associated with the second label or the signal associated with the fourth label can be detected at a second signal intensity.
- the first and second signal intensities can be different by no more than 25%, no more than 20%, no more than 15%, no more than 10%, no more than 5%, or no more than 1%. In any of the preceding embodiments, the first and second signal intensities can be the same.
- the signal associated with the first label or the signal associated with the third label can be detected at a signal intensity lower than the first signal intensity.
- the signal associated with the second label or the signal associated with the fourth label can be detected at a signal intensity lower than the second signal intensity.
- the signal associated with the first label or the signal associated with the third label can be detected at a signal intensity between about 25% and about 75% of the first signal intensity.
- the signal associated with the second label or the signal associated with the fourth label can be detected at a signal intensity between about 25% and about 75% of the second signal intensity.
- the signal associated with the first label or the signal associated with the third label can be detected at a signal intensity about 50% of the first signal intensity.
- the signal associated with the second label or the signal associated with the fourth label can be detected at a signal intensity about 50% of the second signal intensity.
- the first pool of nucleotides can comprise a nucleotide of a fourth base which is different from the first, second, and third bases.
- the nucleotide of the fourth base can be configured to be nondetectable at the first wavelength and the second wavelength.
- the nucleotide of the fourth base can be free of a detectable label.
- the nucleotide of the fourth base can be configured to be detected at a third wavelength different from the first and second wavelengths.
- the polynucleotide template can be one of a plurality of polynucleotide templates having different template sequences. In any of the preceding embodiments, the sequences of the plurality of polynucleotide templates are determined. In any of the preceding embodiments, the polynucleotide template can be immobilized on a substrate. In any of the preceding embodiments, the polynucleotide template can be in one or more polynucleotides immobilized on the substrate.
- the polynucleotide template can be in a rolling circle amplification product immobilized on the substrate, wherein the rolling circle amplification product comprises multiple copies of the template sequence in the polynucleotide template.
- the polynucleotide template can be in a cluster of multiple polynucleotides immobilized on the substrate, wherein each polynucleotide in the cluster comprises a copy of the template sequence in the polynucleotide template.
- the plurality of polynucleotide templates can be imaged, and the first image can comprise the signal associated with the first label at the location of a first polynucleotide template, a recorded absence of a signal at the location of a second polynucleotide template, and the signal associated with the third label at the location of a third polynucleotide template.
- the plurality of polynucleotide templates can be imaged and the second image can comprise a recorded absence of a signal at the location of the first polynucleotide template, the signal associated with the second label at the location of the second polynucleotide template, and the signal associated with the fourth label at the location of a third polynucleotide template.
- the plurality of polynucleotide templates can be imaged and the third image can comprise: the signal associated with the first label at the location of the first polynucleotide template; the signal associated with the second label at the location of the second polynucleotide template, and the signal associated with the third label and the signal associated with the fourth label at the location of a third polynucleotide template.
- the first and second images can be acquired using the same power or powers that differ by no more than 25%, no more than 20%, no more than 15%, no more than 10%, no more than 5%, or no more than 1%, and the c)iii), the third image can be acquired using a power that is between about 25% and about 75% (optionally 50%) of the power(s) used in c)i) and/or c)ii).
- a method for determining sequences of a plurality of polynucleotide templates having different template sequences comprising: a) contacting a substrate having clusters of polynucleotides immobilized thereon with a first pool of nucleotides and with oligonucleotide primers, in any order, whereby each cluster comprises: i) multiple copies of one of the different template sequences, and ii) an oligonucleotide primer of the oligonucleotide primers annealed to a primer binding sequence for extension of the oligonucleotide primer templated on a copy of the template sequence, wherein the first pool of nucleotides comprises: i) nucleotides of a first base which are labeled with a label configured to be detected at a first wavelength, ii) nucleotides of a second base which are labeled with a label configured to be detected at a second wavelength which
- the method further comprises: b) allowing binding and optional incorporation of the nucleotides of the first pool templated on the multiple copies of the template sequence at each cluster, wherein the bound and optionally incorporated nucleotides are complementary to nucleotides at a first nucleotide position in the template sequences.
- the method can further comprise: c) imaging the substrate to detect signals or absence thereof at the clusters, wherein the signals are associated with the bound and optionally incorporated nucleotides of the first pool, and wherein the imaging comprises: i) imaging the substrate at the first wavelength and not at the second wavelength to acquire a first image, ii) imaging the substrate at the second wavelength and not at the first wavelength to acquire a second image, and iii) imaging the substrate at the first and second wavelengths simultaneously to acquire a third image, wherein in the third image, the signal intensity associated with the third base is no less than the signal intensity associated with the first base or the signal intensity associated with the second base.
- the method can further comprise generating, for each cluster, a signal codeword comprising signal codes corresponding to the signals or absence thereof detected in c)i) through c)iii), wherein different signal codewords correspond to different bases, thereby determining the bases at the first nucleotide position in the plurality of polynucleotide templates.
- the first pool of nucleotides can comprise nucleotides of a fourth base which are unlabeled and/or are undetected in the imaging.
- the first pool of nucleotides may not comprise nucleotides of a fourth base.
- the nucleotides of the third base can comprise a nucleotide labeled both with the label configured to be detected at the first wavelength and with the label configured to be detected at the second wavelength.
- the nucleotides of the third base can comprise a first nucleotide labeled with only the label configured to be detected at the first wavelength and a second nucleotide labeled with only the label configured to be detected at the second wavelength.
- the ratio of the labels configured to be detected at the first or second wavelength and the nucleotide can be independently 1:1, 2:1, 3:1, 4:1, 5:1, or greater.
- the labels can be conjugated to the nucleotide via a binding pair, optionally wherein the binding pair comprises biotin, DNP, DIG, or desthiobiotin, and a corresponding binding partner thereof.
- the substrate in any of one or more of the preceding embodiments, in c)i), can be illuminated at the first wavelength at a first full power to acquire the first image. In any of one or more of the preceding embodiments, in c)ii), the substrate can be illuminated at the second wavelength at a second full power to acquire the second image. In any of one or more of the preceding embodiments, in c)iii), the substrate can be illuminated at the first wavelength at less than the first full power, and illuminated at the second wavelength at less than the second full power, thereby acquiring the third image.
- the substrate in c)iii), can be illuminated at the first wavelength at about 50% of the first full power, and at the second wavelength at about 50% of the second full power to acquire the third image.
- the first and second full powers can be of the same constant current in LED illumination.
- the first and second full powers can be of different constant currents in LED illumination.
- the signal intensity associated with the third base in the third image, can be about twice the signal intensity associated with the first base or the signal intensity associated with the second base.
- the signal codeword can comprise signal codes corresponding to signal intensities associated with the nucleotides of the first base, the nucleotides of the second base, or the nucleotides of the third base.
- the signal codeword may not comprise signal codes corresponding to signal intensities associated with the nucleotides of the first base, the nucleotides of the second base, or the nucleotides of the third base.
- the method can comprise: d) contacting the substrate with a second pool of nucleotides comprising unlabeled nucleotides of the first base, unlabeled nucleotides of the second base, unlabeled nucleotides of the third base, and/or unlabeled nucleotides of a fourth base.
- the method can comprise: e) allowing binding and optional incorporation of the nucleotides of the second pool templated on the multiple copies of the template sequence at each cluster, wherein the bound and optionally incorporated nucleotides are complementary to nucleotides at the first nucleotide position in the template sequences.
- the method can comprise: d) contacting the substrate with a second pool of nucleotides comprising labeled nucleotides of a fourth base.
- the method can comprise: e) allowing binding and optional incorporation of the nucleotides of the second pool templated on the multiple copies of the template sequence at each cluster, wherein the bound and optionally incorporated nucleotides are complementary to nucleotides at the first nucleotide position in the template sequences.
- the method can comprise: f) imaging the substrate to detect signals or absence thereof at the clusters, wherein the signals are associated with the bound and optionally incorporated nucleotides of the second pool.
- the clusters can be disposed at spatially discrete sites on the substrate.
- the average distance between adjacent clusters on the substrate can be between about 0.3 pm and about 10 pm.
- the density of clusters on the substrate can be between about 100 k/mm 2 and about 5000 k/mm 2 .
- signals from any two adjacent clusters on the substrate can be optically resolvable.
- the clusters can comprise a random array of clusters on the substrate.
- the clusters can comprise an ordered array of clusters on the substrate.
- any one or more of the clusters can be formed via bridge amplification.
- any one or more of the clusters can each comprise multiple molecules each comprising: i) one or more adapter sequences and/or one or more primer binding sequences and ii) the same template sequence or complement thereof.
- any one or more of the clusters can be formed via rolling circle amplification (RCA).
- any one or more of the clusters can each comprise one or more RCA products (RCPs) each comprising: i) one or more adapter sequences and/or one or more primer binding sequences and ii) the same template sequence.
- RCPs RCA products
- any one or more of the clusters can each comprise: i) one or more polynucleotides that are 5’ immobilized on the substrate and 3’ blocked; ii) one or more polynucleotides that are 3’ immobilized on the substrate; and/or iii) one or more nucleic acid concatemers immobilized on the substrate.
- the first pool of nucleotides can comprise nucleotides of any three of A, T/U, C, and G. In any of one or more of the preceding embodiments, the first pool of nucleotides can comprise nucleotides of A, T/U, and C, but no nucleotides of G.
- the first and/or second pool of nucleotides can comprise one or more dNTP monomers and/or one or more dNTP multimers each comprising multiple dNTP molecules conjugated to a scaffold.
- the scaffold is conjugated to one or more detectable labels such as fluorescent moieties.
- the scaffold comprises a streptavidin or a variant or mutein thereof.
- the scaffold comprises an oligomeric streptavidin or a variant or mutein thereof.
- the first and/or second pool of nucleotides can each comprise a reversible terminator.
- the first and/or second pool of nucleotides can each comprise a 3’- O-blocked reversible terminator or a 3 ’-unblocked reversible terminator.
- the method can comprise allowing nucleotide incorporation after each imaging of the substrate.
- the method can comprise removing unbound or unincorporated nucleotides from the substrate prior to imaging the substrate.
- the method can comprise determining the bases at a second nucleotide position in the plurality of polynucleotide templates, wherein the second nucleotide position is 5’ to the first nucleotide position in the plurality of polynucleotide templates.
- kits comprising: i) a first plurality of nucleotide molecules of a first base, comprising nucleotides labeled or configured to be labeled with a first label detectable at a first wavelength, ii) a second plurality of nucleotide molecules of a second base which is different from the first base, comprising nucleotides labeled or configured to be labeled with a second label detectable at a second wavelength which is different from the first wavelength, and iii) a third plurality of nucleotide molecules of a third base which is different from the first and second bases, comprising a nucleotide labeled or configured to be labeled with a third label detectable at the first wavelength and a nucleotide labeled or configured to be labeled with a fourth label detectable at the second wavelength.
- the kit can comprise instructions for using the first plurality, the second plurality, and/or the third plurality of nucleotide molecules according to a method in the present disclosure.
- the first plurality, the second plurality, and/or the third plurality can each independently comprise between about 5% and about 50% nucleotide molecules that are not detectably labeled. In any of the preceding embodiments, the first plurality, the second plurality, and/or the third plurality can each independently comprise about 10% nucleotide molecules that are not detectably labeled.
- the kit can comprise a fourth plurality of nucleotide molecules of a fourth base which is different from the first, second, and third bases, optionally wherein the fourth plurality of nucleotide molecules are not detectably labeled.
- the first, second, third, and/or fourth plurality can each independently comprise one or more reversibly terminated nucleotide molecules, and/or one or more complexes comprising nucleotide molecules conjugated to a scaffold such that the nucleotide molecules are not incorporable.
- FIG. 1 provides an exemplary flowchart for nucleic acid sequencing, according to some embodiments described herein.
- FIG. 2 provides an exemplary process for nucleic acid sequencing.
- sequencing-by- synthesis is performed with labeled nucleotides for three out of the four bases.
- A) shows a representative status of DNA strands with primer during sequencing with ATGC as next base;
- B) shows the extended base after contact with labeled dATP, labeled dCTP, and labeled dTTP, where G is omitted since dGTP is not added;
- C) shows an image truth table: Image 1 is green illumination at a constant current drive (e.g., 12 Amp), Image 2 is red illumination at the same current (e.g., 12 Amp), Image 3 is green and red illumination with a total current that is same as previous (e.g. 12 Amp).
- D) shows the 3’ block can be cleaved after further contact with dye-labeled dGTP and/or unlabeled dGTP. All of the nucleotides in this illustration bear 3’ blocker.
- FIG. 3 provides an exemplary ideal plot of intensities of DNA clusters in Image 1 and Image 2.
- A is red only
- T is green only
- C is dual color with more efficient fluorescent dyes which intensities are equal to red only dye or green only dye at half or less surface dye concentration.
- FIG. 4 provides an exemplary plot of signal intensities of DNA clusters in Image 1 and Image 2 in reality.
- A is red only
- T is green only
- C is dual color with which intensity at red or green channel is half of the red only dye or the green only dye, respectively.
- FIG. 5 provides an exemplary plot of signal intensities of DNA clusters in Image 1, Image 2 and Image 3.
- A is red only, T is green only, and C is dual color with which intensity at red or green channel is equal to the red only dye or the green only dye, respectively.
- the distance between bright clusters and dark clusters is improved due to the use of Image 3.
- FIG. 6 provides an exemplary plot of signal intensities of DNA clusters in Image 1, Image 2 and Image 3.
- A is red only, T is green only, and C is dual color with which intensity at red or green channel is half of the red only dye or the green only dye, respectively.
- the distance between bright clusters and dark clusters is improved due to the use of Image 3.
- FIG. 8 provides an exemplary scheme showing a streptavidin mixture with different dye labels which can be used to stain biotin or desthiobiotin tagged nucleotide during sequencing to improve the brightness of dual coded nucleotides such as C in FIGS. 3-6.
- nucleic acid and “nucleotide” are intended to be consistent with their use in the art and to include naturally-occurring species or functional analogs thereof. Particularly useful functional analogs of nucleic acids are capable of hybridizing to a nucleic acid in a sequence-specific fashion (e.g., capable of hybridizing to two nucleic acids such that ligation can occur between the two hybridized nucleic acids) or are capable of being used as a template for replication of a particular nucleotide sequence.
- Naturally-occurring nucleic acids generally have a backbone containing phosphodiester bonds.
- An analog structure can have an alternate backbone linkage including any of a variety of those known in the art.
- Naturally- occurring nucleic acids generally have a deoxyribose sugar (e.g., found in deoxyribonucleic acid (DNA)) or a ribose sugar (e.g. found in ribonucleic acid (RNA)).
- a deoxyribose sugar e.g., found in deoxyribonucleic acid (DNA)
- RNA ribonucleic acid
- a nucleic acid can contain nucleotides having any of a variety of analogs of these sugar moieties that are known in the art.
- a nucleic acid can include native or non-native nucleotides.
- a native deoxyribonucleic acid can have one or more bases selected from the group consisting of adenine (A), thymine (T), cytosine (C), or guanine (G)
- a ribonucleic acid can have one or more bases selected from the group consisting of uracil (U), adenine (A), cytosine (C), or guanine (G).
- Useful non-native bases that can be included in a nucleic acid or nucleotide are known in the art.
- a “probe” or a “target,” when used in reference to a nucleic acid or sequence of a nucleic acids, is intended as a semantic identifier for the nucleic acid or sequence in the context of a method or composition, and does not limit the structure or function of the nucleic acid or sequence beyond what is expressly indicated.
- oligonucleotide and “polynucleotide” are used interchangeably to refer to a single-stranded multimer of nucleotides from about 2 to about 500 nucleotides in length. Oligonucleotides can be synthetic, made enzymatically (e.g., via polymerization), or using a “split-pool” method. Oligonucleotides can include ribonucleotide monomers (e.g., can be oligoribonucleotides) and/or deoxyribonucleotide monomers (e.g., oligodeoxyribonucleotides).
- oligonucleotides can include a combination of both deoxyribonucleotide monomers and ribonucleotide monomers in the oligonucleotide (e.g., random or ordered combination of deoxyribonucleotide monomers and ribonucleotide monomers).
- An oligonucleotide can be 4 to 10, 10 to 20, 21 to 30, 31 to 40, 41 to 50, 51 to 60, 61 to 70, 71 to 80, 80 to 100, 100 to 150, 150 to 200, 200 to 250, 250 to 300, 300 to 350, 350 to 400, or 400 to 500 nucleotides in length, for example.
- Oligonucleotides can include one or more functional moieties that are attached (e.g., covalently or non-covalently) to the multimer structure.
- an oligonucleotide can include one or more detectable labels (e.g., a radioisotope or fluorophore).
- detectable label refers to a directly or indirectly detectable moiety that is coupled to or may be coupled to another moiety, for example, a nucleotide or nucleotide analog.
- the detectable label can be directly detectable by itself (e.g., radioisotope labels or fluorescent labels) or, in the case of an enzymatic label, can be indirectly detectable, e.g., by catalyzing chemical alterations of a substrate compound or composition, which substrate compound or composition is directly detectable.
- the label can emit a signal or alter a signal delivered to the label so that the presence or absence of the label can be detected.
- coupling may be via a linker, which may be cleavable, such as photo-cleavable (e.g., cleavable under ultra-violet light), chemically-cleavable (e.g., via a reducing agent, such as dithiothreitol (DTT), tris(2-carboxyethyl)phosphine (TCEP)) or enzymatically cleavable (e.g., via an esterase, lipase, peptidase, or protease).
- a linker which may be cleavable, such as photo-cleavable (e.g., cleavable under ultra-violet light), chemically-cleavable (e.g., via a reducing agent, such as dithiothreitol (DTT), tris(2-carboxyethyl)phosphine (TCEP)) or enzymatically cleavable (e.g.,
- a detectable label is or includes a fluorophore.
- fluorophores include, but are not limited to, fluorescent nanocrystals; quantum dots; d- Rhodamine acceptor dyes including dichloro [R 110], dichloro [R6G], dichloro [TAMRA], dichloro[ROX] or the like; fluorescein donor dye including fluorescein, 6-FAM, or the like; Cyanine dyes such as Cy3B; Alexa dyes, SETA dyes, Atto dyes such as atto 647N which forms a FRET pair with Cy3B and the like.
- Fluorophores include, but are not limited to, MDCC (7-diethylamino-3-[([(2-maleimidyl)ethyl]amino)carbonyl]coumarin), TET, HEX, Cy3, TMR, ROX, Texas Red, Cy5, LC red 705 and LC red 640.
- a detectable label is or includes a luminescent or chemiluminescent moiety.
- luminescent/chemiluminescent moieties include, but are not limited to, peroxidases such as horseradish peroxidase (HRP), soybean peroxidase (SP), alkaline phosphatase, and luciferase. These protein moieties can catalyze chemiluminescent reactions given the appropriate substrates (e.g., an oxidizing reagent plus a chemiluminescent compound. A number of compound families are known to provide chemiluminescence under a variety of conditions.
- Non-limiting examples of chemiluminescent compound families include 2,3-dihydro-l,4-phthalazinedione luminol, 5-amino-6,7,8-trimethoxy- and the dimethylamino[ca]benz analog. These compounds can luminesce in the presence of alkaline hydrogen peroxide or calcium hypochlorite and base.
- chemiluminescent compound families include, e.g., 2,4,5-triphenylimidazoles, para-dimethylamino and - methoxy substituents, oxalates such as oxalyl active esters, p-nitrophenyl, N-alkyl acridinum esters, luciferins, lucigenins, or acridinium esters.
- a detectable label is or includes a metal-based or mass-based label.
- hybridizing refers to the pairing of substantially complementary or complementary nucleic acid sequences within two different molecules. Pairing can be achieved by any process in which a nucleic acid sequence joins with a substantially or fully complementary sequence through base pairing to form a hybridization complex.
- two nucleic acid sequences are “substantially complementary” if at least 60% (e.g., at least 70%, at least 80%, or at least 90%) of their individual bases are complementary to one another.
- a “primer” is a single- stranded nucleic acid sequence having a 3’ end that can be used as a substrate for a nucleic acid polymerase in a nucleic acid extension reaction.
- RNA primers are formed of RNA nucleotides, and are used in RNA synthesis, while DNA primers are formed of DNA nucleotides and used in DNA synthesis.
- Primers can also include both RNA nucleotides and DNA nucleotides (e.g., in a random or designed pattern). Primers can also include other natural or synthetic nucleotides described herein that can have additional functionality.
- DNA primers can be used to prime RNA synthesis and vice versa (e.g., RNA primers can be used to prime DNA synthesis).
- Primers can vary in length. For example, primers can be about 6 bases to about 120 bases. For example, primers can include up to about 25 bases.
- a primer may in some cases, refer to a primer binding sequence.
- a “nucleic acid extension” generally involves incorporation of one or more nucleic acids (e.g., A, G, C, T, U, nucleotide analogs, or derivatives thereof) into a molecule (such as, but not limited to, a nucleic acid sequence) in a template-dependent manner, such that consecutive nucleic acids are incorporated by an enzyme (such as a polymerase or reverse transcriptase), thereby generating a newly synthesized nucleic acid molecule.
- Enzymatic extension can be performed by an enzyme including, but not limited to, a polymerase and/or a reverse transcriptase.
- a primer that hybridizes to a complementary nucleic acid sequence can be used to synthesize a new nucleic acid molecule by using the complementary nucleic acid sequence as a template for nucleic acid synthesis.
- a 3’ polyadenylated tail of an mRNA transcript that hybridizes to a poly (dT) sequence can be used as a template for single-strand synthesis of a corresponding cDNA molecule.
- a poly (dT) sequence may be used as a sequencing primer for sequencing RNA molecules comprising poly (A) tails.
- a “non-terminating nucleotide” or “incorporating nucleotide” can include a nucleic acid moiety that can be attached to a 3' end of a polynucleotide using a polymerase or transcriptase, and that can have another non-terminating nucleic acid attached to it using a polymerase or transcriptase without the need to remove a protecting group or reversible terminator from the nucleotide.
- Naturally occurring nucleic acids are a type of nonterminating nucleic acid.
- Non-terminating nucleic acids may be labeled or unlabeled.
- a “PCR amplification” refers to the use of a polymerase chain reaction (PCR) to generate copies of genetic material, including DNA and RNA sequences. Suitable reagents and conditions for implementing PCR are described, for example, in U.S. Patent Nos. 4,683,202, 4,683,195, 4,800,159, 4,965,188, and 5,512,462, the entire contents of each of which are incorporated herein by reference.
- the reaction mixture includes the genetic material to be amplified, an enzyme, one or more primers that are employed in a primer extension reaction, and reagents for the reaction.
- the oligonucleotide primers are of sufficient length to provide for hybridization to complementary genetic material under annealing conditions.
- the length of the primers generally depends on the length of the amplification domains, but will typically be at least 4 bases, at least 5 bases, at least 6 bases, at least 8 bases, at least 9 bases, at least 10 base pairs (bp), at least 11 bp, at least 12 bp, at least 13 bp, at least 14 bp, at least 15 bp, at least 16 bp, at least 17 bp, at least 18 bp, at least 19 bp, at least 20 bp, at least 25 bp, at least 30 bp, at least 35 bp, and can be as long as 40 bp or longer, where the length of the primers will generally range from 18 to 50 bp.
- the genetic material can be contacted with a single primer or a set of two primers (forward and reverse primers), depending upon whether primer extension, linear or exponential amplification of the genetic material is desired.
- the PCR amplification process uses a DNA polymerase enzyme.
- the DNA polymerase activity can be provided by one or more distinct DNA polymerase enzymes.
- the DNA polymerase enzyme is from a bacterium, e.g., the DNA polymerase enzyme is a bacterial DNA polymerase enzyme.
- the DNA polymerase can be from a bacterium of the genus Escherichia, Bacillus, Thermophilus, or Pyrococcus.
- PCR amplification can include reactions such as, but not limited to, a strand-displacement amplification reaction, a rolling circle amplification reaction, a ligase chain reaction, a transcription-mediated amplification reaction, an isothermal amplification reaction, and/or a loop-mediated amplification reaction.
- reactions such as, but not limited to, a strand-displacement amplification reaction, a rolling circle amplification reaction, a ligase chain reaction, a transcription-mediated amplification reaction, an isothermal amplification reaction, and/or a loop-mediated amplification reaction.
- PCR amplification uses a single primer that is complementary to the 3’ tag of target DNA fragments.
- PCR amplification uses a first and a second primer, where at least a 3’ end portion of the first primer is complementary to at least a portion of the 3’ tag of the target nucleic acid fragments, and where at least a 3’ end portion of the second primer exhibits the sequence of at least a portion of the 5’ tag of the target nucleic acid fragments.
- a 5’ end portion of the first primer is non-complementary to the 3’ tag of the target nucleic acid fragments, and a 5’ end portion of the second primer does not exhibit the sequence of at least a portion of the 5’ tag of the target nucleic acid fragments.
- the first primer includes a first universal sequence and/or the second primer includes a second universal sequence.
- DNA polymerase includes not only naturally-occurring enzymes but also all modified derivatives thereof, including also derivatives of naturally-occurring DNA polymerase enzymes.
- the DNA polymerase can have been modified to remove 5’-3’ exonuclease activity.
- Sequence-modified derivatives or mutants of DNA polymerase enzymes that can be used include, but are not limited to, mutants that retain at least some of the functional, e.g., DNA polymerase activity of the wildtype sequence. Mutations can affect the activity profile of the enzymes, e.g., enhance or reduce the rate of polymerization, under different reaction conditions, e.g., temperature, template concentration, primer concentration, etc.
- DNA polymerases that can be used include, but are not limited to: E.coli DNA polymerase I, Bsu DNA polymerase, Bst DNA polymerase, Taq DNA polymerase, VENTTM DNA polymerase, DEEPVENTTM DNA polymerase, LongAmp® Taq DNA polymerase, LongAmp® Hot Start Taq DNA polymerase, Crimson LongAmp® Taq DNA polymerase, Crimson Taq DNA polymerase, OneTaq® DNA polymerase, OneTaq® Quick-Load® DNA polymerase, Hemo KlenTaq® DNA polymerase, REDTaq® DNA polymerase, Phusion® DNA polymerase, Phusion® High-Fidelity DNA polymerase, Platinum Pfx DNA polymerase, AccuPrime Pfx DNA polymerase, Phi29 DNA polymerase
- genetic material is amplified by reverse transcription polymerase chain reaction (RT-PCR).
- the desired reverse transcriptase activity can be provided by one or more distinct reverse transcriptase enzymes, suitable examples of which include, but are not limited to: M-MLV, MuLV, AMV, HIV, Array ScriptTM, MultiScribeTM, ThermoScriptTM, and SuperScript® I, II, III, and IV enzymes.
- Reverse transcriptase includes not only naturally occurring enzymes, but all such modified derivatives thereof, including also derivatives of naturally-occurring reverse transcriptase enzymes.
- reverse transcription can be performed using sequence-modified derivatives or mutants of M-MLV, MuLV, AMV, and HIV reverse transcriptase enzymes, including mutants that retain at least some of the functional, e.g., reverse transcriptase, activity of the wild-type sequence.
- the reverse transcriptase enzyme can be provided as part of a composition that includes other components, e.g., stabilizing components that enhance or improve the activity of the reverse transcriptase enzyme, such as RNase inhibitor(s), inhibitors of DNA-dependent DNA synthesis, e.g., actinomycin D.
- sequence-modified derivative or mutants of reverse transcriptase enzymes e.g., M-MLV
- compositions including unmodified and modified enzymes are commercially available, e.g., ArrayScriptTM, MultiScribeTM, ThermoScriptTM, and SuperScript® I, II, III, and IV enzymes.
- Certain reverse transcriptase enzymes can synthesize a complementary DNA strand using both RNA (cDNA synthesis) and single- stranded DNA (ssDNA) as a template.
- the reverse transcription reaction can use an enzyme (reverse transcriptase) that is capable of using both RNA and ssDNA as the template for an extension reaction, e.g., an AMV or MMLV reverse transcriptase.
- nucleic acids e.g., ssDNA in clonal clusters
- surface e.g., a surface in a flow cell for sequencing.
- the general base decoding processing are illustrated, by way of example, in FIG. 2, using DNA clusters as examples.
- the DNA clusters which are amplified by surface PCR are immobilized on surface.
- Each DNA clusters contains hundreds to thousands of copies of the same DNA sequence which is to be sequenced. Any two different clusters can contain the same DNA sequence or different DNA sequences to be sequenced.
- the mixture of three dNTPs, e.g., dATP, dCTP and dTTP with different labels, and a polymerase can be flowed through (e.g., through a flow cell in which the clusters are immobilized on a substrate) and contacted with the ssDNA strands hybridized with sequencing primers.
- the matched dNTPs are incorporated onto the strands and the missing dNTP (e.g., dGTP) is not incorporated.
- the excess dNTPs can be washed off and fluorescent imaging can be performed under two different illumination sources, e.g., 530 nm and 630 nm.
- dATP labelled with fluorescent dye (L2) can be detected under 630 nm excitation wavelength (red).
- dTTP with LI fluorescent label can be detected under green light (530 nm).
- dCTP which is labelled with two dyes (dCTP-L3+L4) or a mixture of nucleotide with two different dyes (e.g., dCTP-L3 and dCTP-L4) can be detected with both illumination conditions.
- Red and green colors are used as examples and fluorescent labels of other colors can be used.
- Wavelengths are often referred to using their associated color and include blue (400-470 nm), green (470-550 nm), red (630-700 nm) and NIR (700-1200) lights.
- the image 1 is acquired with 530 nm LED illumination at full power (e.g., constant current at 12 Amps).
- the image 2 is acquired with 630 nm LED illumination at full power as well (e.g., constant current at 12 Amps).
- the base types can be called based on these two images.
- the base is called as “A” at a cluster if the cluster only lights up in Image 2.
- the base at a cluster is “T” if the cluster only lights up in Image 1.
- the base is “C” if the cluster is detectable in both Image 1 and Image 2.
- the base is “G” if the cluster is not detected in either image.
- the fluorescent dyes selections can be very limit for dual labelled or mixed dyes to have the same brightness as single labelled nucleotide in red or green channel, since the concentration is split into half by two dyes.
- the dual labelled dyes will have energy transfer between dyes or different quantum yield, which can make it difficult to balance the brightness between these dyes.
- the ideal distribution of the intensity of DNA clusters in Image 1 and Image 2 is shown in FIG. 3.
- the average signal intensities of T and C in Image 1 is similar or equal and same for A and C in Image 2.
- the average signal intensity of C can be far weaker than T and A as demonstrated in FIG. 4, e.g., due to energy transfer between dyes.
- a third image can be acquired with both wavelengths, e.g., using 530 nm and 630 nm light sources (LED1 and LED2, respectively), where the clusters are illuminated but with 50% of power for LED1 and 50% of power of LED2 (12 Amps current in total).
- the dual labelled nucleotide can show 100% of the emission intensity since it can be excited by both light.
- the other two nucleotide with single labelled dye can only reach 50% of intensity in Image 1 and Image 2. These intensities can be used as third axis to improve the separation between the cluster intensity clouds. As shown in FIG.
- the distance between dark cloud and the dual labeled “C” cloud can be increased for 1.414 to 1.732 which is about 20% improvement compared to an ideal case using a conventional method (e.g., using only Image 1 and Image 2).
- the distance between dark cloud and the dual labeled “C” cloud can be improved by using Image 3, even when the dual color nucleotide (“C”) has intensity at red or green channel that is half of the red only dye or the green only dye.
- the distance between dark (“G”) and “C” clouds is increased from 0.707 to 0.866.
- the pass filter rate and quality score can be improved with increased distance between the cluster clouds.
- the nucleotides can be labelled with non-fluorescent labels such as biotin, desthiobiotin, 2,4-Dinitrophenyl (DNP) and Digoxigenin etc., which can be stained with dye labelled streptavidin or antibody.
- non-fluorescent labels such as biotin, desthiobiotin, 2,4-Dinitrophenyl (DNP) and Digoxigenin etc.
- DNP 2,4-Dinitrophenyl
- Digoxigenin etc. can be stained with dye labelled streptavidin or antibody.
- the intensities of clusters can be easily tuned by varying streptavidin/dye ratio or antibody /ratio. In addition, the intensities can be also tune by mixing of streptavidin without any tag.
- the dual channel brightness can be simply tuned by fine tuning the ratio between streptavidin- dyel, streptavidin-dye2, and optionally unlabeled streptavidin, as show in FIG. 7 and FIG. 8.
- the missing nucleotide such as G in FIG. 2 can be incorporated by contacting the dGTP with or without labels and polymerase in a separate step or can be introduced together with other unlabeled or labelled nucleotides such as dATP, dCTP and dTTP to improve the incorporation yield and prevent grow lag.
- the advantage of introducing another dNTP mix after imaging is the following.
- the dark nucleotide can be mixed with other dye labelled nucleotides, e.g., 10% of total nucleotides can be unlabeled, which can improve the phasing too since dark dNTP is incorporated more efficiency than labelled nucleotides.
- the missing nucleotide can be labeled with biotin or other hapten
- a third or fourth image can be acquired to light up the fourth type of clusters such as G in FIG. 4 to improve the accuracy of base calling after stain with streptavidin or antibody with dyes.
- the internal fluorescence dye can be replaced with other biomolecules such as biotin, desthiobiotin, 2,4-Dinitrophenol and digoxigenin etc.
- the fluorescent dye can be introduced later with streptavidin for biotin and desthiobiotin or antibody for small hapten.
- the dye tag can also be released with biotin for desthiobiotin labeled oligo.
- the nucleic acid molecules used in the methods described herein may be obtained from any suitable biological source, for example a tissue sample, a blood sample, a plasma sample, a saliva sample, a fecal sample, or a urine sample.
- the polynucleotides may be DNA or RNA molecules.
- RNA molecules are reverse transcribed into DNA molecules prior to hybridizing the polynucleotide to a sequencing primer.
- RNA molecules are not reverse transcribed and are hybridized to a sequencing primer for direct RNA sequencing.
- the nucleic acid molecule is a cell-free DNA (cfDNA), such as a circulating tumor DNA (ctDNA) or a fetal cell-free DNA.
- nucleic acid molecules include DNA molecules such as single- stranded DNA (ssDNA), double-stranded DNA (dsDNA), genomic DNA, methylated DNA, specific methylated DNA sequences, fragmented DNA, mitochondrial DNA, in situ synthesized PCR products, and RNA/DNA hybrids.
- the DNA analyte can be a transcript of another nucleic acid molecule (e.g., DNA or RNA such as mRNA) present in a tissue sample.
- RNA molecules such as various types of coding and non-coding RNA, including viral RNAs.
- RNA molecules include messenger RNA (mRNA), including a nascent RNA, a pre- mRNA, a primary-transcript RNA, and a processed RNA, such as a capped mRNA (e.g., with a 5’ 7-methyl guanosine cap), a polyadenylated mRNA (poly-A tail at the 3’ end), and a spliced mRNA in which one or more introns have been removed.
- mRNA messenger RNA
- a nascent RNA e.g., a pre- mRNA, a primary-transcript RNA
- a processed RNA such as a capped mRNA (e.g., with a 5’ 7-methyl guanosine cap), a polyadenylated mRNA (poly-A tail at the 3’ end), and a splic
- RNA analyte can be a transcript of another nucleic acid molecule (e.g., DNA or RNA such as viral RNA).
- a nucleic acid molecule may be a denatured nucleic acid, wherein the resulting denatured nucleic acid is single- stranded.
- the nucleic acid may be denatured, for example, optionally using formamide, heat, or both formamide and heat. In some embodiments, the nucleic acid is not denatured for use in a method disclosed herein.
- a nucleic acid molecule can be extracted from a cell, a virus, or a tissue sample comprising the cell or virus. Processing conditions can be adjusted to extract or release nucleic acid molecules (e.g., RNA) from a cell, a virus, or a tissue sample.
- nucleic acid molecules e.g., RNA
- a method disclosed herein comprises using one or more nucleotides or analogs thereof, including a native nucleotide or a nucleotide analog or modified nucleotide (e.g., labeled with one or more detectable labels).
- a nucleotide analog comprises a nitrogenous base, five-carbon sugar, and phosphate group, wherein any component of the nucleotide may be modified and/or replaced.
- a method disclosed herein may comprise but does not require using one or more non-incorporable nucleotides. Non-incorporable nucleotides may be modified to become incorporable at any point during the sequencing method.
- Nucleotide analogs include, but are not limited to, alpha-phosphate modified nucleotides, alpha-beta nucleotide analogs, beta-phosphate modified nucleotides, beta-gamma nucleotide analogs, gamma-phosphate modified nucleotides, caged nucleotides, or ddNTPs. Examples of nucleotide analogs are described in U.S. Patent No. 8,071,755, which is incorporated by reference herein in its entirety.
- a method disclosed herein may comprise but does not require using terminators that reversibly prevent nucleotide incorporation at the 3 '-end of the primer.
- One type of reversible terminator is a 3'-O-blocked reversible terminator.
- the terminator moiety is linked to the oxygen atom of the 3'-OH end of the 5-carbon sugar of a nucleotide.
- U.S. Patent Nos. 7,544,794 and 8,034,923 (the disclosures of these patents are incorporated by reference) describe reversible terminator dNTPs having the 3'-OH group replaced by a 3'-ONH2 group.
- reversible terminator is a 3 '-unblocked reversible terminator, wherein the terminator moiety is linked to the nitrogenous base of a nucleotide.
- U.S. Patent No. 8,808,989 discloses particular examples of base-modified reversible terminator nucleotides that may be used in connection with the methods described herein.
- Other reversible terminators that similarly can be used in connection with the methods described herein include those described in U.S. Patent Nos. 7,956,171, 8,071,755, and 9,399,798, herein incorporated by reference.
- a method disclosed herein may comprise but does not require using nucleotide analogs having terminator moieties that irreversibly prevent nucleotide incorporation at the 3 '-end of the primer.
- Irreversible nucleotide analogs include 2', 3'- dideoxynucleotides, ddNTPs (ddGTP, ddATP, ddTTP, ddCTP). Dideoxynucleotides lack the 3'-OH group of dNTPs that is essential for polymerase-mediated synthesis.
- a method disclosed herein may comprise but does not require using non-incorporable nucleotides comprising a blocking moiety that inhibits or prevents the nucleotide from forming a covalent linkage to a second nucleotide (3'-OH of a primer) during the incorporation step of a nucleic acid polymerization reaction.
- the blocking moiety can be removed from the nucleotide, allowing for nucleotide incorporation.
- a method disclosed herein may comprise but does not require using 1, 2, 3, 4 or more nucleotide analogs present in the SBS reaction.
- a nucleotide analog is replaced, diluted, or sequestered during an incorporation step.
- a nucleotide analog is replaced with a native nucleotide.
- a nucleotide analog is modified during an incorporation step. The modified nucleotide analog can be similar to or the same as a native nucleotide.
- a method disclosed herein may comprise but does not require using a nucleotide analog having a different binding affinity for a polymerase than a native nucleotide.
- a nucleotide analog has a different interaction with a next base than a native nucleotide.
- Nucleotide analogs and/or non-incorporable nucleotides may base-pair with a complementary base of a template nucleic acid.
- one or more nucleotides can be labeled with distinguishing and/or detectable tags or labels.
- the tags may be distinguishable by means of their differences in fluorescence, Raman spectrum, charge, mass, refractive index, luminescence, length, or any other measurable property.
- the tag may be attached to one or more different positions on the nucleotide, so long as the fidelity of binding to the polymerase-nucleic acid complex is sufficiently maintained to enable identification of the complementary base on the template nucleic acid correctly.
- the tag is attached to the nucleobase of the nucleotide.
- a tag is attached to the gamma phosphate position of the nucleotide.
- Detectable labels can be suitable for small scale detection and/or suitable for high- throughput screening.
- suitable detectable labels include, but are not limited to, radioisotopes, fluorophores, chemiluminescent compounds, bioluminescent compounds, and dyes.
- the detectable label can be qualitatively detected (e.g., optically or spectrally), or it can be quantified.
- Qualitative detection generally includes a detection method in which the existence or presence of the detectable label is confirmed, whereas quantifiable detection generally includes a detection method having a quantifiable (e.g., numerically reportable) value such as an intensity, duration, polarization, and/or other properties.
- the detectable label is bound to another moiety, for example, a nucleotide or nucleotide analog, and can include a fluorescent, a colorimetric, or a chemiluminescent label.
- a detectable label can be attached to another moiety, for example, a nucleotide or nucleotide analog.
- the detectable label is a fluorophore.
- the fluorophore can be from a group that includes: 7-AAD (7- Aminoactinomycin D), Acridine Orange (+DNA), Acridine Orange (+RNA), Alexa Fluor® 350, Alexa Fluor® 430, Alexa Fluor® 488, Alexa Fluor® 532, Alexa Fluor® 546, Alexa Fluor® 555, Alexa Fluor® 568, Alexa Fluor® 594, Alexa Fluor® 633, Alexa Fluor® 647, Alexa Fluor® 660, Alexa Fluor® 680, Alexa Fluor® 700, Alexa Fluor® 750, Allophycocyanin (APC), AMCA / AMCA-X, 7-Aminoactinomycin D (7-AAD), 7- Amino-4- methylcoumarin, 6-Aminoquinoline, Aniline Blue, ANS, APC-Cy7, ATTO-TAGTM CBQCA, ATTO-TAGTM FQ, Auramine O-Feulgen,
- the detectable label can be directly detectable by itself (e.g., radioisotope labels or fluorescent labels) or, in the case of an enzymatic label, can be indirectly detectable, e.g., by catalyzing chemical alterations of a substrate compound or composition, which substrate compound or composition is directly detectable.
- the label can emit a signal or alter a signal delivered to the label so that the presence or absence of the label can be detected.
- coupling may be via a linker, which may be cleavable, such as photo-cleavable (e.g., cleavable under ultra-violet light), chemically-cleavable (e.g., via a reducing agent, such as dithiothreitol (DTT), tris(2-carboxyethyl)phosphine (TCEP)) or enzymatically cleavable (e.g., via an esterase, lipase, peptidase, or protease).
- a linker which may be cleavable, such as photo-cleavable (e.g., cleavable under ultra-violet light), chemically-cleavable (e.g., via a reducing agent, such as dithiothreitol (DTT), tris(2-carboxyethyl)phosphine (TCEP)) or enzymatically cleavable (e.g.,
- Polymerases that may be used to carry out the disclosed techniques include naturally- occurring polymerases and any modified variations thereof, including, but not limited to, mutants, recombinants, fusions, genetic modifications, chemical modifications, synthetics, and analogs.
- Naturally occurring polymerases and modified variations thereof are not limited to polymerases that retain the ability to catalyze a polymerization reaction.
- the naturally occurring and/or modified variations thereof retain the ability to catalyze a polymerization reaction.
- the naturally-occurring and/or modified variations have special properties that enhance their ability to sequence DNA, including enhanced binding affinity to nucleic acids, reduced binding affinity to nucleic acids, enhanced catalysis rates, reduced catalysis rates, etc.
- Mutant polymerases include polymerases wherein one or more amino acids are replaced with other amino acids (naturally or non-naturally occurring), and insertions or deletions of one or more amino acids.
- a method disclosed herein may comprise but does not require using modified polymerases containing an external tag (e.g., an exogenous detectable label), which can be used to monitor the presence and interactions of the polymerase.
- an external tag e.g., an exogenous detectable label
- intrinsic signals from the polymerase can be used to monitor their presence and interactions.
- the provided methods can include monitoring the interaction of the polymerase, nucleotide and template nucleic acid through detection of an intrinsic signal from the polymerase.
- the intrinsic signal is a light scattering signal.
- intrinsic signals include native fluorescence of certain amino acids such as tryptophan.
- a method disclosed herein may comprise using an unlabeled polymerase, and monitoring is performed in the absence of an exogenous detectable label associated with the polymerase.
- Some modified polymerases or naturally occurring polymerases, under specific reaction conditions, may incorporate only single nucleotides and may remain bound to the primer-template after the incorporation of the single nucleotide.
- a method disclosed herein may comprise using a polymerase unlabeled with an exogenous detectable label (e.g., a fluorescent label). The label can be chemically linked to the structure of the polymerase by a covalent bond after the polymerase has been at least partially purified using protein isolation techniques.
- the exogenous detectable label can be chemically linked to the polymerase using a free sulfhydryl or a free amine moiety of the polymerase. This can involve chemical linkage to the polymerase through the side chain of a cysteine residue, or through the free amino group of the N-terminus.
- a fluorescent label attached to the polymerase is useful for locating the polymerase, as may be important for determining whether or not the polymerase has localized to a spot on an array corresponding to immobilized primed template nucleic acid.
- the fluorescent signal need not, and in some embodiments does not change absorption or emission characteristics as the result of binding any nucleotide.
- the signal emitted by the labeled polymerase is maintained uniformly in the presence and absence of any nucleotide being investigated as a possible next correct nucleotide.
- polymerase and its variants also refers to fusion proteins comprising at least two portions linked to each other, for example, where one portion comprises a peptide that can catalyze the polymerization of nucleotides into a nucleic acid strand is linked to another portion that comprises a second moiety, such as, a reporter enzyme or a processivity-modifying domain.
- T7 DNA polymerase comprises a nucleic acid polymerizing domain and a thioredoxin binding domain, wherein thioredoxin binding enhances the processivity of the polymerase. Absent the thioredoxin binding, T7 DNA polymerase is a distributive polymerase with processivity of only one to a few bases.
- DNA polymerases differ in detail, they have a similar overall shape of a hand with specific regions referred to as the fingers, the palm, and the thumb; and a similar overall structural transition, comprising the movement of the thumb and/or finger domains, during the synthesis of nucleic acids.
- DNA polymerases include, but are not limited to, bacterial DNA polymerases, eukaryotic DNA polymerases, archaeal DNA polymerases, viral DNA polymerases and phage DNA polymerases.
- Bacterial DNA polymerases include E. coli DNA polymerases I, II and III, IV and V, the Klenow fragment of E.
- Eukaryotic DNA polymerases include DNA polymerases a, P, y, 5, e, q, , c, p, and K, as well as the Revl polymerase (terminal deoxycytidyl transferase) and terminal deoxynucleotidyl transferase (TdT).
- Viral DNA polymerases include T4 DNA polymerase, phi-29 DNA polymerase, GA-1, phi-29-like DNA polymerases, PZA DNA polymerase, phi- 15 DNA polymerase, Cpl DNA polymerase, Cp7 DNA polymerase, T7 DNA polymerase, and T4 polymerase.
- DNA polymerases include thermostable and/or thermophilic DNA polymerases such as DNA polymerases isolated from Thermus aquaticus (Taq) DNA polymerase, Thermus filiformis (Tfi) DNA polymerase, Thermococcus zilligi (Tzi) DNA polymerase, Thermus thermophilus (Tth) DNA polymerase, Thermus flavusu (TjT) DNA polymerase, Pyrococcus woesei (Pwo) DNA polymerase, Pyrococcus furiosus (Pf ) DNA polymerase and Turbo Pfu DNA polymerase, Thermococcus litoralis (TH) DNA polymerase, Pyrococcus sp.
- Taq Thermus aquaticus
- Tfi Thermus filiformis
- Tzi Thermococcus zilligi
- Tzi Thermus thermophilus
- Tth Thermus thermophilus
- TjT Thermus flavusu DNA polyme
- GB-D polymerase Thermotoga maritima (Tma) DNA polymerase, Bacillus stearo thermophilus (Bst) DNA polymerase, Pyrococcus Kodakaraensis (KOD) DNA polymerase, Pfx DNA polymerase, Thermococcus sp. JDF-3 (JDF-3) DNA polymerase, Thermococcus gorgonarius (Tgo) DNA polymerase, Thermococcus acidophilium DNA polymerase; Sulfolobus acidocaldarius DNA polymerase; Thermococcus sp.
- modified versions of the extremely thermophilic marine archaea Thermococcus species 9° N can be used.
- Still other useful DNA polymerases, including the 3PDX polymerase are disclosed in U.S. Patent No. 8,703,461, the disclosure of which is incorporated by reference in its entirety.
- RNA polymerases include, but are not limited to, viral RNA polymerases such as T7 RNA polymerase, T3 polymerase, SP6 polymerase, and Kl l polymerase; Eukaryotic RNA polymerases such as RNA polymerase I, RNA polymerase II, RNA polymerase III, RNA polymerase IV, and RNA polymerase V; and Archaea RNA polymerase.
- viral RNA polymerases such as T7 RNA polymerase, T3 polymerase, SP6 polymerase, and Kl l polymerase
- Eukaryotic RNA polymerases such as RNA polymerase I, RNA polymerase II, RNA polymerase III, RNA polymerase IV, and RNA polymerase V
- Archaea RNA polymerase Archaea RNA polymerase.
- Reverse transcriptases include, but are not limited to, HIV-1 reverse transcriptase from human immunodeficiency virus type 1 (PDB 1HMV), HIV-2 reverse transcriptase from human immunodeficiency virus type 2, M-MLV reverse transcriptase from the Moloney murine leukemia virus, AMV reverse transcriptase from the avian myeloblastosis virus, and Telomerase reverse transcriptase that maintains the telomeres of eukaryotic chromosomes.
- PDB 1HMV human immunodeficiency virus type 1
- HIV-2 reverse transcriptase from human immunodeficiency virus type 2
- M-MLV reverse transcriptase from the Moloney murine leukemia virus
- AMV reverse transcriptase from the avian myeloblastosis virus
- Telomerase reverse transcriptase that maintains the telomeres of eukaryotic chromosomes.
- a first labeled nucleotide that has been incorporated is not deactivated (e.g., by removal and/or photobleaching of the label) prior to the introduction and/or incorporation of the next, second labeled nucleotide.
- the first and second labeled nucleotides can comprise the same base or different bases.
- the first and second labeled nucleotides can be introduced into a sequencing reaction mix simultaneously or at different time points in any order.
- first and second labeled nucleotides can be introduced by itself (e.g., in a suitable solvent such as water) or in a mixture with another sequencing reagent, such as one or more other labeled nucleotides and/or one or more unlabeled nucleotides.
- the first and second labeled nucleotides can also comprise the same base or different bases.
- nucleotides that have not been incorporated at a residue corresponding to a base in the template nucleic acid are not removed from the sequencing reaction mix prior to the introduction and/or incorporation of the second labeled nucleotide.
- the first and second labeled nucleotides are provided in the same sequencing reaction mix, and the first, second, and optionally any subsequent labeled nucleotide(s) are incorporated sequentially in a continuous manner.
- nucleotides e.g., fluorescently labeled A, T, C, and/or G nucleotides
- some embodiments of the method disclosed herein use continuous introduction and/or incorporation of nucleotides (e.g., fluorescently labeled A, T, C, and/or G nucleotides) without the need of label deactivation and/or wash steps in between sequential incorporation events for a given template nucleic acid molecule to be sequenced.
- nucleotides e.g., fluorescently labeled A, T, C, and/or G nucleotides
- label deactivation e.g., by cleaving and/or photobleaching the label
- label deactivation of a first incorporated nucleotide may occur stochastically throughout the continuous nucleotide incorporation process, for instance, prior to, during, or after the incorporation of a second, third, fourth, or a subsequent labeled nucleotide.
- Nucleic acid sequencing reaction mixtures typically include reagents that are commonly present in polymerase based nucleic acid synthesis reactions.
- the reaction mixture can include other molecules including, but not limited to, enzymes.
- the reaction mixture comprises any reagents or biomolecules generally present in a nucleic acid polymerization reaction.
- Reaction components may include, but are not limited to, salts, buffers, small molecules, detergents, crowding agents, metals, and ions.
- properties of the reaction mixture may be manipulated, for example, electrically, magnetically, and/or with vibration.
- the provided methods herein may further comprise but do not require one or more wash steps; a temperature change; a mechanical vibration; a pH change; or an optical stimulation that is not dye illumination or photobleaching.
- the wash step comprises contacting the substrate and the nucleic acid molecule, the primer, and/or the polymerase with one of more buffers, detergents, protein denaturants, proteases, oxidizing agents, reducing agents, or other agents capable of crosslinking or releasing crosslinks, e.g., crosslinks within a polymerase or crosslinks between a polymerase and nucleic acid.
- Methods and compositions for nucleic acid sequencing are known, for example, as described in U.S. Patent Nos. 10,246,744 and 10,844,428, incorporated herein by reference in their entireties for all purposes.
- Reaction mixture reagents can include, but are not limited to, enzymes (e.g., polymerase), dNTPs, template nucleic acids, primer nucleic acids, salts, buffers, small molecules, co-factors, metals, and ions.
- the ions may be catalytic ions, divalent catalytic ions, non-catalytic ions, non-covalent metal ions, or a combination thereof.
- the reaction mixture can include salts, such as NaCl, KC1, potassium acetate, ammonium acetate, potassium glutamate, or NH4CI or the like, that ionize in aqueous solution to yield monovalent cations.
- the reaction mixture can include a source of ions, such as Mg2+, Mn2+, Co2+, Cd2+, and/or Ba2+ ions.
- the reaction mixture can include tin, Ca2+, Zn2+, Cu2+, Co2+, Fe2+, and/or Ni2+, or other divalent non-catalytic metal cations.
- the reaction mixture can include metal cations that may inhibit formation of phosphodiester bonds between the primed template nucleic acid molecule and the cognate nucleotide.
- the metal cations can be used (e.g., at a suitable concentration) to slow down but not completely inhibit or prevent nucleotide incorporation, thereby reducing multiple nucleotide incorporation events in a single detection window.
- the sequencing reaction conditions comprise contacting the nucleic acid molecule and the primer with a buffer that regulates osmotic pressure.
- the reaction mixture comprises a buffer that regulates osmotic pressure.
- the buffer is a high salt buffer that includes a monovalent ion, such as a monovalent metal ion (e.g., potassium ion or sodium ion) at a concentration of from about 50 to about 1,500 mM.
- the buffer further comprises a source of glutamate ions (e.g., potassium glutamate).
- the buffer comprises a stabilizing agent.
- the stabilizing agent is a non- catalytic metal ion (e.g., a divalent non-catalytic metal ion).
- Non-catalytic metal ions useful in this context include, but are not limited to, calcium, strontium, scandium, titanium, vanadium, chromium, iron, cobalt, nickel, copper, zinc, gallium, germanium, arsenic, selenium, rhodium, europium, and/or terbium.
- the non-catalytic metal ion is strontium, tin, or nickel.
- the sequencing reaction mixture comprises strontium chloride or nickel chloride.
- the stabilizing agent can be used (e.g., at a suitable concentration) to slow down but not completely inhibit or prevent nucleotide incorporation, thereby reducing multiple nucleotide incorporation events in a single detection window.
- the buffer can include Tris, Tricine, HEPES, MOPS, ACES, MES, phosphate-based buffers, and acetate-based buffers.
- the reaction mixture can include chelating agents such as EDTA, EGTA, and the like.
- the reaction mixture includes crosslinking reagents.
- the interaction between the polymerase and template nucleic acid may be manipulated by modulating sequencing reaction parameters such as ionic strength, pH, temperature, or any combination thereof, or by the addition of a destabilizing agent to the reaction.
- the destabilizing agent can be used (e.g., at a suitable concentration) to slow down but not completely inhibit or prevent nucleotide incorporation, thereby reducing multiple nucleotide incorporation events in a single detection window.
- high salt e.g., 50 to 1,500 mM
- pH changes are utilized to destabilize a complex between the polymerase and template nucleic acid.
- the reaction conditions favor the stabilization of a complex among the polymerase, the template nucleic acid, and a labeled nucleotide.
- the pH of the reaction mixture can be adjusted from 4.0 to 10.0 to favor the stabilization of a complex among the polymerase, the template nucleic acid, and a labeled nucleotide.
- the pH of the reaction mixture is from 4.0 to 6.0.
- the pH of the reaction mixture is 6.0 to 10.0.
- a suitable salt concentration and/or a suitable pH can be selected to slow down but not completely inhibit or prevent nucleotide incorporation, thereby reducing multiple nucleotide incorporation events in a single detection window.
- the reaction mixture comprises a competitive inhibitor, where the competitive inhibitor may reduce the occurrence of multiple incorporations events in a detection window.
- the competitive inhibitor is a non-incorporable nucleotide.
- the competitive inhibitor is an aminoglycoside. The competitive inhibitor is capable of replacing either the nucleotide or the catalytic metal ion in the active site, such that the competitive inhibitor occupies the active site preventing or slowing down a nucleotide incorporation.
- both an incorporable nucleotide and a competitive inhibitor are introduced, such that the ratio of the incorporable nucleotide and the inhibitor can be adjusted to modulate the rate of incorporation of a single nucleotide at the 3 '-end of the primer.
- the competitive inhibitor can be used (e.g., at a low concentration) to slow down but not completely inhibit or prevent nucleotide incorporation, thereby reducing multiple nucleotide incorporation events in a single detection window.
- the reaction mixture comprises at least one nucleotide molecule that is a non-incorporable nucleotide.
- the reaction mixture comprises one or more nucleotide molecules incapable of incorporation into the primer of the primed template nucleic acid molecule.
- nucleotides incapable of incorporation include, for example, monophosphate nucleotides.
- the nucleotide may contain modifications to the triphosphate group that make the nucleotide non-incorporable. Examples of non-incorporable nucleotides may be found in U.S. Pat. No. 7,482,120, which is incorporated by reference herein in its entirety.
- the primer may not contain a free hydroxyl group at its 3 '-end, thereby rendering the primer incapable of incorporating any nucleotide, and, thus, making any nucleotide non-incorporable.
- the primer may be processed such that it contains a free hydroxyl group at its 3'-end to allow nucleotide incorporation.
- the non-incorporable nucleotide can be used (e.g., at a low concentration) to slow down but not completely inhibit or prevent nucleotide incorporation, thereby reducing multiple nucleotide incorporation events in a single detection window.
- the reaction mixture comprises at least one nucleotide molecule that is incorporable but is incorporated at a slower rate compared to a corresponding naturally-occurring nucleoside triphosphate (e.g., NTP or dNTP).
- nucleoside triphosphate e.g., NTP or dNTP
- Such nucleotides incorporable at a slower rate may include, for example, diphosphate nucleotides.
- the nucleotide may contain modifications to the triphosphate group that make the nucleotide incorporable at a slower rate.
- the nucleotide incorporable at a slower rate can be used to slow down but not completely inhibit or prevent nucleotide incorporation, thereby reducing multiple nucleotide incorporation events in a single detection window.
- the reaction mixture comprises a polymerase inhibitor.
- the polymerase inhibitor is a pyrophosphate analog.
- the polymerase inhibitor is an allosteric inhibitor.
- the polymerase inhibitor is a DNA or an RNA aptamer.
- the polymerase inhibitor competes with a catalytic-ion binding site in the polymerase.
- the polymerase inhibitor is a reverse transcriptase inhibitor.
- the polymerase inhibitor may be an HIV-1 reverse transcriptase inhibitor or an HIV-2 reverse transcriptase inhibitor.
- the HIV-1 reverse transcriptase inhibitor may be a (4/6-halogen/MeO/EtO-substituted benzo[d]thiazol-2-yl)thiazolidin-4-one.
- the polymerase inhibitor can be used (e.g., at a low concentration) to slow down but not completely inhibit or prevent nucleotide incorporation, thereby reducing multiple nucleotide incorporation events in a single detection window.
- the contacting step is facilitated by the use of a chamber such as a flow cell.
- a chamber such as a flow cell.
- the methods and apparatus described herein may employ next generation sequencing technology (NGS), which allows massively parallel sequencing.
- NGS next generation sequencing technology
- single DNA molecules are sequenced in a massively parallel fashion within a reaction chamber.
- a flow cell may be used but is not necessary.
- Flowing liquid reagents through the flow cell which contains an interior solid support surface (e.g., a planar surface), conveniently permits reagent exchange.
- Immobilized to the interior surface of the flow cell is one or more primed template nucleic acids to be sequenced or interrogated using the procedures described herein.
- Typical flow cells will include microfluidic valving that permits delivery of liquid reagents (e.g., components of the “reaction mixtures” discussed herein) to an entry port. Liquid reagents can be removed from the flow cell by exiting through an exit port.
- liquid reagents e.g., components of the “reaction mixtures” discussed herein
- a reaction chamber disclosed herein can comprise a reagent wall, an imaging area, and optionally an outlet configured to remove molecules of one or more of the polymerase, the first detectably labeled nucleotide, the second detectably labeled nucleotide, and/or one or more other reagents from the imaging area.
- the device may comprise one or more vents but no outlet or exit port for the reaction mixture.
- a method disclosed herein does not comprise a step of removing liquid reagents through an outlet or exit port, e.g., from a reaction chamber such as a flow cell.
- the methods disclosed herein may but do not need to be used in combination with any NGS sequencing methods.
- the sequencing technologies of NGS include but are not limited to pyrosequencing, sequencing-by- synthesis with reversible dye terminators, sequencing by oligonucleotide probe ligation, and ion semiconductor sequencing.
- Nucleic acids such as DNA or RNA from individual samples can be sequenced individually (singleplex sequencing) or nucleic acids such as DNA or RNA from multiple samples can be pooled and sequenced as indexed genomic molecules (multiplex sequencing) on a single sequencing run, to generate up to several hundred million reads of sequences. Examples of sequencing technologies that can be used to obtain the sequence information according to the present method are further described here.
- sequencing- by-synthesis platforms from 454 Life Sciences (Bradford, Conn.), Illumina/Solexa (Hayward, Calif.) and Helicos Biosciences (Cambridge, Mass.).
- Sanger sequencing including the automated Sanger sequencing, can also be employed in the methods described herein. Additional suitable sequencing methods include, but are not limited to nucleic acid imaging technologies, e.g., atomic force microscopy (AFM) or transmission electron microscopy (TEM).
- AFM atomic force microscopy
- TEM transmission electron microscopy
- the disclosed methods may be used in combination with massively parallel sequencing of nucleic acid molecules using Illumina's sequencing-by- synthesis and reversible terminator-based sequencing chemistry.
- a method disclosed herein can use a flow cell having a glass slide with lanes.
- sequence reads of predetermined length are localized by mapping (alignment) to a known reference sequence or genome (e.g., viral sequences or genomes).
- mapping e.g., mapping to a known reference sequence or genome (e.g., viral sequences or genomes).
- a number of computer algorithms are available for aligning sequences, including without limitation BLAST, BLITZ, FASTA, BOWTIE, or ELAND (Illumina, Inc., San Diego, Calif., USA).
- the provided sequencing methods disclosed herein may regulate polymerase interaction with the nucleotides and template nucleic acid (as well as rate of nucleotide incorporation) in a manner that reveals the identity of the next base while controlling the chemical addition of a nucleotide.
- the SBS reaction condition comprises a plurality of primed template nucleic acids, polymerases, nucleotides, or any combination thereof.
- the plurality of nucleotides comprises 1, 2, 3, 4, or more types of different nucleotides, for example dATP, dTTP (or dUTP), dGTP, and dCTP.
- the method can further comprise contacting the nucleic acid molecule with the substrate to immobilize the nucleic acid molecule.
- the nucleic acid molecule can be immobilized at a density of one molecule per at least about 250 nm 2 , at least about 200 nm 2 , at least about 150 nm 2 , at least about 100 nm 2 , at least about 90 nm 2 , at least about 80 nm 2 , at least about 70 nm 2 , at least about 60 nm 2 , at least about 50 nm 2 , at least about 40 nm 2 , at least about 30 nm 2 , at least about 20 nm 2 , at least about 10 nm 2 , at least about 5 nm 2 , or in between any two of the aforementioned values.
- a subset of nucleic acid molecules (e.g., nucleic acid strands to be sequenced) on the substrate may be active at one or more time points.
- a first subset of nucleic acid molecules on the substrate is active (e.g., allowing nucleotide incorporation into a sequencing primer using a singlestranded sequence as template) while a second subset of nucleic acid molecules on the substrate is inactive (e.g., not allowing nucleotide incorporation into a sequencing primer using a single-stranded sequence as template).
- a first subset of nucleic acid molecules on the substrate is activated (e.g., by a first set of polymerase and/or primer molecules) for nucleotide incorporation, while a second subset of nucleic acid molecules on the substrate is not activated (e.g., by the first set of polymerase and/or primer molecules), thus only signals associated with the first subset of nucleic acid molecules are detected.
- the second subset of nucleic acid molecules on the substrate is activated (e.g., by a second set of polymerase and/or primer molecules) for nucleotide incorporation, while the first subset of nucleic acid molecules on the substrate is not activated (e.g., by the second set of polymerase and/or primer molecules), thus only signals associated with the second subset of nucleic acid molecules are detected.
- the first and second sets of polymerase and/or primer molecules can be introduced at different time points, e.g., in sequential cycles with optional washing steps between cycles (e.g., to remove a set of polymerase and/or primer molecules for SBS of a first subset of strands before introducing the next set of polymerase and/or primer molecules for SBS of a second subset of strands).
- the substrate can comprise a bead, a planar substrate, a solid surface, a flow cell, a semiconductor chip, a well, a pillar, a chamber, a channel, a through hole, a nanopore, or any combination thereof.
- the substrate can comprise a microwell, a micropillar, a microchamber, a microchannel, or any combination thereof.
- compositions and kits comprising one or more of the primers, nucleic acid molecules, substrates, nucleotides including detectably labeled nucleotides, polymerases, and reagents for performing the methods provided herein, for example reagents required for one or more steps comprising hybridization, ligation, amplification, detection, sequencing, and/or sample preparation as described herein, for example, in Section IV.
- kits may be present in separate containers or certain compatible components may be pre-combined into a single container.
- the kits further contain instructions for using the components of the kit to practice the provided methods.
- kits can contain reagents and/or consumables required for performing one or more steps of the provided methods.
- the kits contain reagents for sample processing, such as nucleic acid extraction, isolation, and/or purification, e.g., RNA extraction, isolation, and/or purification.
- the kits contain reagents, such as enzymes and buffers for ligation and/or amplification, such as ligases and/or polymerases.
- the kits contain reagents, such as enzymes and buffers for primer extension and/or nucleic acid sequencing, such as polymerases and/or transcriptases.
- the kit can also comprise any of the reagents described herein, e.g., buffer components for tuning the rate of nucleotide incorporation and/or for tuning the rate of signal deactivation (e.g., by photobleaching).
- the kits contain reagents for signal detection during sequencing, such as detectable labels and detectably labeled molecules.
- the kits optionally contain other components, for example nucleic acid primers, enzymes and reagents, buffers, nucleotides, modified nucleotides, and reagents for additional assays.
- the provided embodiments can be applied in analyzing nucleic acid sequences, such as DNA and/or RNA sequencing. In some aspects, the embodiments can be applied in an imaging or detection method for multiplexed nucleic acid analysis. In some aspects, the provided embodiments can be used to identify or detect regions of interest in target nucleic acids, such as viral DNA or RNA.
- the region of interest comprises one or more nucleotide residues, such as a single-nucleotide polymorphism (SNP), a single-nucleotide variant (SNV), substitutions such as a single-nucleotide substitution, mutations such as a point mutation, insertions such as a single-nucleotide insertion, deletions such as a single-nucleotide deletion, translocations, inversions, duplications, and/or other sequences of interest.
- SNP single-nucleotide polymorphism
- SNV single-nucleotide variant
- substitutions such as a single-nucleotide substitution
- mutations such as a point mutation
- insertions such as a single-nucleotide insertion
- deletions such as a single-nucleotide deletion
- translocations inversions, duplications, and/or other sequences of interest.
- the embodiments can be applied in investigative and/or diagnostic applications, for example, for characterization or assessment of a sample from a subject.
- Applications of the provided method can comprise biomedical research and clinical diagnostics.
- biomedical research applications comprise, but are not limited to, genetic and genomic analysis for biological investigation or drug screening.
- clinical diagnostics applications comprise, but are not limited to, detecting gene markers such as disease, immune responses, bacterial or viral DNA/RNA for patient samples, loss of genetic heterozygosity, the presence of gene alleles indicative of a predisposition towards disease or good health, likelihood of responsiveness to therapy, or in personalized medicine or ancestry.
Landscapes
- Life Sciences & Earth Sciences (AREA)
- Chemical & Material Sciences (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Organic Chemistry (AREA)
- Zoology (AREA)
- Wood Science & Technology (AREA)
- Health & Medical Sciences (AREA)
- Engineering & Computer Science (AREA)
- Microbiology (AREA)
- Immunology (AREA)
- Biotechnology (AREA)
- Molecular Biology (AREA)
- Biophysics (AREA)
- Analytical Chemistry (AREA)
- Physics & Mathematics (AREA)
- Biochemistry (AREA)
- Bioinformatics & Cheminformatics (AREA)
- General Engineering & Computer Science (AREA)
- General Health & Medical Sciences (AREA)
- Genetics & Genomics (AREA)
- Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
Abstract
The present disclosure relates in some aspects to methods and compositions for analyzing nucleic acid sequences, such as for determining sequences of nucleic acid molecules in a clonal cluster, including DNA sequencing methods and nucleotide compositions. In some embodiments, the nucleic acid sequences are determined by contacting the nucleic acid molecules with nucleotides of a first base labeled with a first label detectable at a first wavelength, nucleotides of a second base labeled with a second label detectable at a second wavelength, and nucleotides of a third base comprising: (i) a nucleotide labeled with a third label detectable at the first wavelength, and (ii) a nucleotide labeled with a fourth label detectable at the second wavelength, where the third label and the four label can be comprises in the same nucleotide molecule or separate nucleotide molecules of the third base. In some embodiments, three images are acquired for each clonal cluster (e.g., generated using bridge amplification or rolling circle amplification) at a) the first wavelength alone, b) the second wavelength alone, and c) both the first wavelength and the second wavelength. In some embodiments, signal codewords corresponding to the signals or absence thereof in a) through c) is generated to identify the bases bound or incorporated during sequencing.
Description
METHODS AND COMPOSITIONS FOR NUCLEIC ACID SEQUENCING USING LABELED NUCLEOTIDES
CROSS-REFERENCE TO RELATED APPLICATION
[0001] This application claims priority to U.S. Provisional Application No. 63/459,214, filed on April 13, 2023, entitled “METHODS AND COMPOSITIONS FOR NUCLEIC ACID SEQUENCING USING LABELED NUCLEOTIDES,” which is herein incorporated by reference in its entirety for all purposes.
FIELD
[0002] The present disclosure relates in some aspects to methods and compositions for analyzing nucleic acid sequences, such as for determining sequences of nucleic acid molecules in a clonal cluster, including DNA sequencing methods and nucleotide compositions.
BACKGROUND
[0003] To detect the nucleotides which extended onto single strand DNA with primers, fluorescent labels can be used to present different signals at different wavelengths. To minimize the crosstalk and computation power to decode the four different bases, fewer than four different fluorescence emission channels are preferred, and improved methods for nucleic acid sequencing are needed. Provided herein are methods, compositions, and kits that meet such and other needs.
BRIEF SUMMARY
[0004] In some embodiments, provided herein are methods and systems to code the four nucleotides (A, T/U, G, and C) with two excitation wavelengths and less than four emission bandwidths. In some embodiments, provided herein are methods to code the four bases in DNA sequencing with different detectable labels and two illumination light sources. Also provided herein are optimized label configurations, where three or two fluorescence images can be used to determine the base extended at individual clusters with improve accuracy and simplified optical design.
[0005] In some embodiments, provided herein is a method for determining a sequence of a polynucleotide template, comprising: a) contacting the polynucleotide template with a first pool of nucleotides comprising: i) a nucleotide of a first base labeled with a first label configured to be detected at a first wavelength, ii) a nucleotide of a second base which is different from the first base and which nucleotide is labeled with a second label configured to
be detected at a second wavelength which is different from the first wavelength, and iii) one or more nucleotides of a third base which is different from the first and second bases, wherein the one or more nucleotides comprise a nucleotide labeled with a third label configured to be detected at the first wavelength and a nucleotide labeled with a fourth label configured to be detected at the second wavelength. In some embodiments, the method comprises b) allowing binding and optional incorporation of a nucleotide of the first pool templated on the polynucleotide template, wherein the bound and optionally incorporated nucleotide is complementary to a nucleotide residue at a first nucleotide position in the polynucleotide template. In any of the preceding embodiments, the method can comprise c) imaging the polynucleotide template to detect a signal (or record an absence of the signal) associated with the bound and optionally incorporated nucleotide of the first pool. In any of the preceding embodiments, the method can comprise: i) using a first power to image the polynucleotide template at the first wavelength and not at the second wavelength to acquire a first image, wherein a signal associated with the first label and/or a signal associated with the third label is detected or an absence of the signal is recorded, ii) using a second power to image the polynucleotide template at the second wavelength and not at the first wavelength to acquire a second image, wherein a signal associated with the second label and/or a signal associated with the fourth label is detected or an absence of the signal is recorded, and iii) using a third power to image the polynucleotide template at the first and second wavelengths to acquire a third image, wherein the third power is less than the first power and/or the second power, wherein a signal associated with the first label and/or a signal associated with the third label is detected at a signal intensity lower than that detected in c)i) or an absence of the signal is recorded, and wherein a signal associated with the second label and/or a signal associated with the fourth label is detected at a signal intensity lower than that detected in c)ii) or an absence of the signal is recorded. In any of the preceding embodiments, the method can comprise generating a signal codeword comprising signal codes each corresponding to the intensity of the signal detected in c)i) through c)iii) (or recorded absence of the signal), wherein the signal codeword corresponds to the identity of the base in the bound and optionally incorporated nucleotide. In any of the preceding embodiments, the method can comprise identifying the nucleotide residue at the first nucleotide position in the polynucleotide template.
[0006] In any of the preceding embodiments, the first label and the third label can be different labels or the same label. In any of the preceding embodiments, the second label and the fourth label can be different labels or the same label. In any of the preceding
embodiments, the one or more nucleotides of the third base can comprise a nucleotide labeled with the third label and the fourth label. In any of the preceding embodiments, the one or more nucleotides of the third base can comprise a first nucleotide species labeled with the third label and a second nucleotide species labeled with the fourth label. In any of the preceding embodiments, the third label and the fourth label can be different labels or the same label. In any of the preceding embodiments, the first and second wavelengths can correspond to different channels of a fluorescence microscope.
[0007] In any of the preceding embodiments, in c)i), the signal associated with the first label or the signal associated with the third label can be detected at a first signal intensity. In any of the preceding embodiments, in c)ii), the signal associated with the second label or the signal associated with the fourth label can be detected at a second signal intensity. In any of the preceding embodiments, the first and second signal intensities can be different by no more than 25%, no more than 20%, no more than 15%, no more than 10%, no more than 5%, or no more than 1%. In any of the preceding embodiments, the first and second signal intensities can be the same. In any of the preceding embodiments, in c)iii), the signal associated with the first label or the signal associated with the third label can be detected at a signal intensity lower than the first signal intensity. In any of the preceding embodiments, in c)iii), the signal associated with the second label or the signal associated with the fourth label can be detected at a signal intensity lower than the second signal intensity.
[0008] In any of the preceding embodiments, the signal associated with the first label or the signal associated with the third label can be detected at a signal intensity between about 25% and about 75% of the first signal intensity. In any of the preceding embodiments, the signal associated with the second label or the signal associated with the fourth label can be detected at a signal intensity between about 25% and about 75% of the second signal intensity. In any of the preceding embodiments, the signal associated with the first label or the signal associated with the third label can be detected at a signal intensity about 50% of the first signal intensity. In any of the preceding embodiments, the signal associated with the second label or the signal associated with the fourth label can be detected at a signal intensity about 50% of the second signal intensity.
[0009] In any of the preceding embodiments, the first pool of nucleotides can comprise a nucleotide of a fourth base which is different from the first, second, and third bases. In any of the preceding embodiments, the nucleotide of the fourth base can be configured to be nondetectable at the first wavelength and the second wavelength. In any of the preceding embodiments, the nucleotide of the fourth base can be free of a detectable label. In any of the
preceding embodiments, the nucleotide of the fourth base can be configured to be detected at a third wavelength different from the first and second wavelengths.
[0010] In any of the preceding embodiments, the polynucleotide template can be one of a plurality of polynucleotide templates having different template sequences. In any of the preceding embodiments, the sequences of the plurality of polynucleotide templates are determined. In any of the preceding embodiments, the polynucleotide template can be immobilized on a substrate. In any of the preceding embodiments, the polynucleotide template can be in one or more polynucleotides immobilized on the substrate. In any of the preceding embodiments, the polynucleotide template can be in a rolling circle amplification product immobilized on the substrate, wherein the rolling circle amplification product comprises multiple copies of the template sequence in the polynucleotide template. In any of the preceding embodiments, the polynucleotide template can be in a cluster of multiple polynucleotides immobilized on the substrate, wherein each polynucleotide in the cluster comprises a copy of the template sequence in the polynucleotide template.
[0011] In any of the preceding embodiments, in c)i), the plurality of polynucleotide templates can be imaged, and the first image can comprise the signal associated with the first label at the location of a first polynucleotide template, a recorded absence of a signal at the location of a second polynucleotide template, and the signal associated with the third label at the location of a third polynucleotide template. In any of the preceding embodiments, in c)ii), the plurality of polynucleotide templates can be imaged and the second image can comprise a recorded absence of a signal at the location of the first polynucleotide template, the signal associated with the second label at the location of the second polynucleotide template, and the signal associated with the fourth label at the location of a third polynucleotide template. In any of the preceding embodiments, in c)iii), the plurality of polynucleotide templates can be imaged and the third image can comprise: the signal associated with the first label at the location of the first polynucleotide template; the signal associated with the second label at the location of the second polynucleotide template, and the signal associated with the third label and the signal associated with the fourth label at the location of a third polynucleotide template.
[0012] In any of the preceding embodiments, in c)i) and c)ii), the first and second images can be acquired using the same power or powers that differ by no more than 25%, no more than 20%, no more than 15%, no more than 10%, no more than 5%, or no more than 1%, and the c)iii), the third image can be acquired using a power that is between about 25% and about 75% (optionally 50%) of the power(s) used in c)i) and/or c)ii).
[0013] In some embodiments, provided herein is a method for determining sequences of a plurality of polynucleotide templates having different template sequences, comprising: a) contacting a substrate having clusters of polynucleotides immobilized thereon with a first pool of nucleotides and with oligonucleotide primers, in any order, whereby each cluster comprises: i) multiple copies of one of the different template sequences, and ii) an oligonucleotide primer of the oligonucleotide primers annealed to a primer binding sequence for extension of the oligonucleotide primer templated on a copy of the template sequence, wherein the first pool of nucleotides comprises: i) nucleotides of a first base which are labeled with a label configured to be detected at a first wavelength, ii) nucleotides of a second base which are labeled with a label configured to be detected at a second wavelength which is different from the first wavelength, and iii) nucleotides of a third base comprising nucleotides labeled with a label configured to be detected at the first wavelength and nucleotides labeled with a label configured to be detected at the second wavelength.
[0014] In some embodiments, the method further comprises: b) allowing binding and optional incorporation of the nucleotides of the first pool templated on the multiple copies of the template sequence at each cluster, wherein the bound and optionally incorporated nucleotides are complementary to nucleotides at a first nucleotide position in the template sequences.
[0015] In any of one or more of the preceding embodiments, the method can further comprise: c) imaging the substrate to detect signals or absence thereof at the clusters, wherein the signals are associated with the bound and optionally incorporated nucleotides of the first pool, and wherein the imaging comprises: i) imaging the substrate at the first wavelength and not at the second wavelength to acquire a first image, ii) imaging the substrate at the second wavelength and not at the first wavelength to acquire a second image, and iii) imaging the substrate at the first and second wavelengths simultaneously to acquire a third image, wherein in the third image, the signal intensity associated with the third base is no less than the signal intensity associated with the first base or the signal intensity associated with the second base. [0016] In any of one or more of the preceding embodiments, the method can further comprise generating, for each cluster, a signal codeword comprising signal codes corresponding to the signals or absence thereof detected in c)i) through c)iii), wherein different signal codewords correspond to different bases, thereby determining the bases at the first nucleotide position in the plurality of polynucleotide templates.
[0017] In any of one or more of the preceding embodiments, the first pool of nucleotides can comprise nucleotides of a fourth base which are unlabeled and/or are undetected in the
imaging. Alternatively, in any of one or more of the preceding embodiments, the first pool of nucleotides may not comprise nucleotides of a fourth base.
[0018] In any of one or more of the preceding embodiments, the nucleotides of the third base can comprise a nucleotide labeled both with the label configured to be detected at the first wavelength and with the label configured to be detected at the second wavelength.
[0019] In any of one or more of the preceding embodiments, the nucleotides of the third base can comprise a first nucleotide labeled with only the label configured to be detected at the first wavelength and a second nucleotide labeled with only the label configured to be detected at the second wavelength.
[0020] In any of one or more of the preceding embodiments, in each of the nucleotides of the third base, the ratio of the labels configured to be detected at the first or second wavelength and the nucleotide can be independently 1:1, 2:1, 3:1, 4:1, 5:1, or greater.
[0021] In any of one or more of the preceding embodiments, the labels can be conjugated to the nucleotide via a binding pair, optionally wherein the binding pair comprises biotin, DNP, DIG, or desthiobiotin, and a corresponding binding partner thereof.
[0022] In any of one or more of the preceding embodiments, in c)i), the substrate can be illuminated at the first wavelength at a first full power to acquire the first image. In any of one or more of the preceding embodiments, in c)ii), the substrate can be illuminated at the second wavelength at a second full power to acquire the second image. In any of one or more of the preceding embodiments, in c)iii), the substrate can be illuminated at the first wavelength at less than the first full power, and illuminated at the second wavelength at less than the second full power, thereby acquiring the third image. In any of one or more of the preceding embodiments, in c)iii), the substrate can be illuminated at the first wavelength at about 50% of the first full power, and at the second wavelength at about 50% of the second full power to acquire the third image. In any of one or more of the preceding embodiments, the first and second full powers can be of the same constant current in LED illumination. In any of one or more of the preceding embodiments, the first and second full powers can be of different constant currents in LED illumination. In any of one or more of the preceding embodiments, in the third image, the signal intensity associated with the third base can be about twice the signal intensity associated with the first base or the signal intensity associated with the second base.
[0023] In any of one or more of the preceding embodiments, the signal codeword can comprise signal codes corresponding to signal intensities associated with the nucleotides of the first base, the nucleotides of the second base, or the nucleotides of the third base.
Alternatively, in any of one or more of the preceding embodiments, the signal codeword may not comprise signal codes corresponding to signal intensities associated with the nucleotides of the first base, the nucleotides of the second base, or the nucleotides of the third base. [0024] In any of one or more of the preceding embodiments, the method can comprise: d) contacting the substrate with a second pool of nucleotides comprising unlabeled nucleotides of the first base, unlabeled nucleotides of the second base, unlabeled nucleotides of the third base, and/or unlabeled nucleotides of a fourth base. In any of one or more of the preceding embodiments, the method can comprise: e) allowing binding and optional incorporation of the nucleotides of the second pool templated on the multiple copies of the template sequence at each cluster, wherein the bound and optionally incorporated nucleotides are complementary to nucleotides at the first nucleotide position in the template sequences. [0025] In any of one or more of the preceding embodiments, the method can comprise: d) contacting the substrate with a second pool of nucleotides comprising labeled nucleotides of a fourth base. In any of one or more of the preceding embodiments, the method can comprise: e) allowing binding and optional incorporation of the nucleotides of the second pool templated on the multiple copies of the template sequence at each cluster, wherein the bound and optionally incorporated nucleotides are complementary to nucleotides at the first nucleotide position in the template sequences. In any of one or more of the preceding embodiments, the method can comprise: f) imaging the substrate to detect signals or absence thereof at the clusters, wherein the signals are associated with the bound and optionally incorporated nucleotides of the second pool.
[0026] In any of one or more of the preceding embodiments, the clusters can be disposed at spatially discrete sites on the substrate. In any of one or more of the preceding embodiments, the average distance between adjacent clusters on the substrate can be between about 0.3 pm and about 10 pm. In any of one or more of the preceding embodiments, the density of clusters on the substrate can be between about 100 k/mm2 and about 5000 k/mm2.
[0027] In any of one or more of the preceding embodiments, signals from any two adjacent clusters on the substrate can be optically resolvable. In any of one or more of the preceding embodiments, the clusters can comprise a random array of clusters on the substrate. In any of one or more of the preceding embodiments, the clusters can comprise an ordered array of clusters on the substrate.
[0028] In any of one or more of the preceding embodiments, any one or more of the clusters can be formed via bridge amplification. In any of one or more of the preceding embodiments, any one or more of the clusters can each comprise multiple molecules each
comprising: i) one or more adapter sequences and/or one or more primer binding sequences and ii) the same template sequence or complement thereof. In any of one or more of the preceding embodiments, any one or more of the clusters can be formed via rolling circle amplification (RCA). In any of one or more of the preceding embodiments, any one or more of the clusters can each comprise one or more RCA products (RCPs) each comprising: i) one or more adapter sequences and/or one or more primer binding sequences and ii) the same template sequence.
[0029] In any of one or more of the preceding embodiments, any one or more of the clusters can each comprise: i) one or more polynucleotides that are 5’ immobilized on the substrate and 3’ blocked; ii) one or more polynucleotides that are 3’ immobilized on the substrate; and/or iii) one or more nucleic acid concatemers immobilized on the substrate.
[0030] In any of one or more of the preceding embodiments, the first pool of nucleotides can comprise nucleotides of any three of A, T/U, C, and G. In any of one or more of the preceding embodiments, the first pool of nucleotides can comprise nucleotides of A, T/U, and C, but no nucleotides of G.
[0031] In any of one or more of the preceding embodiments, the first and/or second pool of nucleotides can comprise one or more dNTP monomers and/or one or more dNTP multimers each comprising multiple dNTP molecules conjugated to a scaffold. In some embodiments, the scaffold is conjugated to one or more detectable labels such as fluorescent moieties. In some embodiments, the scaffold comprises a streptavidin or a variant or mutein thereof. In some embodiments, the scaffold comprises an oligomeric streptavidin or a variant or mutein thereof. In any of one or more of the preceding embodiments, the first and/or second pool of nucleotides can each comprise a reversible terminator. In any of one or more of the preceding embodiments, the first and/or second pool of nucleotides can each comprise a 3’- O-blocked reversible terminator or a 3 ’-unblocked reversible terminator. In any of one or more of the preceding embodiments, the method can comprise allowing nucleotide incorporation after each imaging of the substrate. In any of one or more of the preceding embodiments, the method can comprise removing unbound or unincorporated nucleotides from the substrate prior to imaging the substrate.
[0032] In any of one or more of the preceding embodiments, the method can comprise determining the bases at a second nucleotide position in the plurality of polynucleotide templates, wherein the second nucleotide position is 5’ to the first nucleotide position in the plurality of polynucleotide templates.
[0033] Also provided herein in some embodiments include a kit, comprising: i) a first plurality of nucleotide molecules of a first base, comprising nucleotides labeled or configured to be labeled with a first label detectable at a first wavelength, ii) a second plurality of nucleotide molecules of a second base which is different from the first base, comprising nucleotides labeled or configured to be labeled with a second label detectable at a second wavelength which is different from the first wavelength, and iii) a third plurality of nucleotide molecules of a third base which is different from the first and second bases, comprising a nucleotide labeled or configured to be labeled with a third label detectable at the first wavelength and a nucleotide labeled or configured to be labeled with a fourth label detectable at the second wavelength.
[0034] In any of the preceding embodiments, the kit can comprise instructions for using the first plurality, the second plurality, and/or the third plurality of nucleotide molecules according to a method in the present disclosure.
[0035] In any of the preceding embodiments, the first plurality, the second plurality, and/or the third plurality can each independently comprise between about 5% and about 50% nucleotide molecules that are not detectably labeled. In any of the preceding embodiments, the first plurality, the second plurality, and/or the third plurality can each independently comprise about 10% nucleotide molecules that are not detectably labeled.
[0036] In any of the preceding embodiments, the kit can comprise a fourth plurality of nucleotide molecules of a fourth base which is different from the first, second, and third bases, optionally wherein the fourth plurality of nucleotide molecules are not detectably labeled.
[0037] In any of the preceding embodiments, the first, second, third, and/or fourth plurality can each independently comprise one or more reversibly terminated nucleotide molecules, and/or one or more complexes comprising nucleotide molecules conjugated to a scaffold such that the nucleotide molecules are not incorporable.
INCORPORATION BY REFERENCE
[0038] All publications, patents, and patent applications mentioned in this specification are herein incorporated by reference in their entirety to the same extent as if each individual publication, patent, or patent application was specifically and individually indicated to be incorporated by reference in its entirety. In the event of a conflict between a term herein and a term in an incorporated reference, the term herein controls.
BRIEF DESCRIPTION OF THE DRAWINGS
[0039] Various aspects of the disclosed methods, devices, and systems are set forth with particularity in the appended claims. A better understanding of the features and advantages of the disclosed methods, devices, and systems will be obtained by reference to the following detailed description of illustrative embodiments and the accompanying drawings, of which: [0040] FIG. 1 provides an exemplary flowchart for nucleic acid sequencing, according to some embodiments described herein.
[0041] FIG. 2 provides an exemplary process for nucleic acid sequencing. In this example, sequencing-by- synthesis is performed with labeled nucleotides for three out of the four bases. A) shows a representative status of DNA strands with primer during sequencing with ATGC as next base; B) shows the extended base after contact with labeled dATP, labeled dCTP, and labeled dTTP, where G is omitted since dGTP is not added; C) shows an image truth table: Image 1 is green illumination at a constant current drive (e.g., 12 Amp), Image 2 is red illumination at the same current (e.g., 12 Amp), Image 3 is green and red illumination with a total current that is same as previous (e.g. 12 Amp). D) shows the 3’ block can be cleaved after further contact with dye-labeled dGTP and/or unlabeled dGTP. All of the nucleotides in this illustration bear 3’ blocker.
[0042] FIG. 3 provides an exemplary ideal plot of intensities of DNA clusters in Image 1 and Image 2. A is red only, T is green only, and C is dual color with more efficient fluorescent dyes which intensities are equal to red only dye or green only dye at half or less surface dye concentration.
[0043] FIG. 4 provides an exemplary plot of signal intensities of DNA clusters in Image 1 and Image 2 in reality. A is red only, T is green only, and C is dual color with which intensity at red or green channel is half of the red only dye or the green only dye, respectively.
[0044] FIG. 5 provides an exemplary plot of signal intensities of DNA clusters in Image 1, Image 2 and Image 3. A is red only, T is green only, and C is dual color with which intensity at red or green channel is equal to the red only dye or the green only dye, respectively. The distance between bright clusters and dark clusters is improved due to the use of Image 3. [0045] FIG. 6 provides an exemplary plot of signal intensities of DNA clusters in Image 1, Image 2 and Image 3. A is red only, T is green only, and C is dual color with which intensity at red or green channel is half of the red only dye or the green only dye, respectively. The distance between bright clusters and dark clusters is improved due to the use of Image 3.
[0046] FIG. 7 provides an exemplary scheme showing streptavidin with dual dye labels which can be used to stain a biotin or desthiobiotin tagged nucleotide during sequencing to improve the brightness of dual coded nucleotide such as C in FIGS. 3-6.
[0047] FIG. 8 provides an exemplary scheme showing a streptavidin mixture with different dye labels which can be used to stain biotin or desthiobiotin tagged nucleotide during sequencing to improve the brightness of dual coded nucleotides such as C in FIGS. 3-6.
DETAILED DESCRIPTION
I. Definitions
[0048] Specific terminology is used throughout this disclosure to explain various aspects of the apparatus, systems, methods, and compositions that are described.
[0049] Having described some illustrative embodiments of the present disclosure, it should be apparent to those skilled in the art that the foregoing is merely illustrative and not limiting, having been presented by way of example only. Numerous modifications and other illustrative embodiments are within the scope of one of ordinary skill in the art and are contemplated as falling within the scope of the present disclosure. In particular, although many of the examples presented herein involve specific combinations of method acts or system elements, it should be understood that those acts and those elements may be combined in other ways to accomplish the same objectives.
[0050] As used herein, the singular forms “a,” “an,” and “the” include plural referents unless the context clearly dictates otherwise. For example, “a” or “an” means “at least one” or “one or more.”
[0051] The term “about” as used herein refers to the usual error range for the respective value readily known to the skilled person in this technical field. Reference to “about” a value or parameter herein includes (and describes) embodiments that are directed to that value or parameter per se.
[0052] Throughout this disclosure, various aspects of the claimed subject matter are presented in a range format. It should be understood that the description in range format is merely for convenience and brevity and should not be construed as an inflexible limitation on the scope of the claimed subject matter. Accordingly, the description of a range should be considered to have specifically disclosed all the possible sub-ranges as well as individual numerical values within that range. For example, where a range of values is provided, it is understood that each intervening value, between the upper and lower limit of that range and
any other stated or intervening value in that stated range is encompassed within the claimed subject matter. The upper and lower limits of these smaller ranges may independently be included in the smaller ranges, and are also encompassed within the claimed subject matter, subject to any specifically excluded limit in the stated range. Where the stated range includes one or both of the limits, ranges excluding either or both of those included limits are also included in the claimed subject matter. This applies regardless of the breadth of the range. [0053] Use of ordinal terms such as “first”, “second”, “third”, etc., in the claims to modify a claim element does not by itself connote any priority, precedence, or order of one claim element over another or the temporal order in which acts of a method are performed, but are used merely as labels to distinguish one claim element having a certain name from another element having a same name (but for use of the ordinal term) to distinguish the claim elements. Similarly, use of a), b), etc., or i), ii), etc. does not by itself connote any priority, precedence, or order of steps in the claims. Similarly, the use of these terms in the specification does not by itself connote any required priority, precedence, or order.
[0054] The terms “nucleic acid” and “nucleotide” are intended to be consistent with their use in the art and to include naturally-occurring species or functional analogs thereof. Particularly useful functional analogs of nucleic acids are capable of hybridizing to a nucleic acid in a sequence-specific fashion (e.g., capable of hybridizing to two nucleic acids such that ligation can occur between the two hybridized nucleic acids) or are capable of being used as a template for replication of a particular nucleotide sequence. Naturally-occurring nucleic acids generally have a backbone containing phosphodiester bonds. An analog structure can have an alternate backbone linkage including any of a variety of those known in the art. Naturally- occurring nucleic acids generally have a deoxyribose sugar (e.g., found in deoxyribonucleic acid (DNA)) or a ribose sugar (e.g. found in ribonucleic acid (RNA)).
[0055] A nucleic acid can contain nucleotides having any of a variety of analogs of these sugar moieties that are known in the art. A nucleic acid can include native or non-native nucleotides. In this regard, a native deoxyribonucleic acid can have one or more bases selected from the group consisting of adenine (A), thymine (T), cytosine (C), or guanine (G), and a ribonucleic acid can have one or more bases selected from the group consisting of uracil (U), adenine (A), cytosine (C), or guanine (G). Useful non-native bases that can be included in a nucleic acid or nucleotide are known in the art.
[0056] A “probe” or a “target,” when used in reference to a nucleic acid or sequence of a nucleic acids, is intended as a semantic identifier for the nucleic acid or sequence in the
context of a method or composition, and does not limit the structure or function of the nucleic acid or sequence beyond what is expressly indicated.
[0057] The terms “oligonucleotide” and “polynucleotide” are used interchangeably to refer to a single-stranded multimer of nucleotides from about 2 to about 500 nucleotides in length. Oligonucleotides can be synthetic, made enzymatically (e.g., via polymerization), or using a “split-pool” method. Oligonucleotides can include ribonucleotide monomers (e.g., can be oligoribonucleotides) and/or deoxyribonucleotide monomers (e.g., oligodeoxyribonucleotides). In some examples, oligonucleotides can include a combination of both deoxyribonucleotide monomers and ribonucleotide monomers in the oligonucleotide (e.g., random or ordered combination of deoxyribonucleotide monomers and ribonucleotide monomers). An oligonucleotide can be 4 to 10, 10 to 20, 21 to 30, 31 to 40, 41 to 50, 51 to 60, 61 to 70, 71 to 80, 80 to 100, 100 to 150, 150 to 200, 200 to 250, 250 to 300, 300 to 350, 350 to 400, or 400 to 500 nucleotides in length, for example. Oligonucleotides can include one or more functional moieties that are attached (e.g., covalently or non-covalently) to the multimer structure. For example, an oligonucleotide can include one or more detectable labels (e.g., a radioisotope or fluorophore).
[0058] The terms “detectable label,” “optical label,” and “label” are used interchangeably herein to refer to a directly or indirectly detectable moiety that is coupled to or may be coupled to another moiety, for example, a nucleotide or nucleotide analog. The detectable label can be directly detectable by itself (e.g., radioisotope labels or fluorescent labels) or, in the case of an enzymatic label, can be indirectly detectable, e.g., by catalyzing chemical alterations of a substrate compound or composition, which substrate compound or composition is directly detectable. The label can emit a signal or alter a signal delivered to the label so that the presence or absence of the label can be detected. In some cases, coupling may be via a linker, which may be cleavable, such as photo-cleavable (e.g., cleavable under ultra-violet light), chemically-cleavable (e.g., via a reducing agent, such as dithiothreitol (DTT), tris(2-carboxyethyl)phosphine (TCEP)) or enzymatically cleavable (e.g., via an esterase, lipase, peptidase, or protease).
[0059] In some embodiments, a detectable label is or includes a fluorophore. Exemplary fluorophores include, but are not limited to, fluorescent nanocrystals; quantum dots; d- Rhodamine acceptor dyes including dichloro [R 110], dichloro [R6G], dichloro [TAMRA], dichloro[ROX] or the like; fluorescein donor dye including fluorescein, 6-FAM, or the like; Cyanine dyes such as Cy3B; Alexa dyes, SETA dyes, Atto dyes such as atto 647N which forms a FRET pair with Cy3B and the like. Fluorophores include, but are not limited to,
MDCC (7-diethylamino-3-[([(2-maleimidyl)ethyl]amino)carbonyl]coumarin), TET, HEX, Cy3, TMR, ROX, Texas Red, Cy5, LC red 705 and LC red 640.
[0060] In some embodiments, a detectable label is or includes a luminescent or chemiluminescent moiety. Common luminescent/chemiluminescent moieties include, but are not limited to, peroxidases such as horseradish peroxidase (HRP), soybean peroxidase (SP), alkaline phosphatase, and luciferase. These protein moieties can catalyze chemiluminescent reactions given the appropriate substrates (e.g., an oxidizing reagent plus a chemiluminescent compound. A number of compound families are known to provide chemiluminescence under a variety of conditions. Non-limiting examples of chemiluminescent compound families include 2,3-dihydro-l,4-phthalazinedione luminol, 5-amino-6,7,8-trimethoxy- and the dimethylamino[ca]benz analog. These compounds can luminesce in the presence of alkaline hydrogen peroxide or calcium hypochlorite and base. Other examples of chemiluminescent compound families include, e.g., 2,4,5-triphenylimidazoles, para-dimethylamino and - methoxy substituents, oxalates such as oxalyl active esters, p-nitrophenyl, N-alkyl acridinum esters, luciferins, lucigenins, or acridinium esters. In some embodiments, a detectable label is or includes a metal-based or mass-based label.
[0061] The terms “hybridizing,” “hybridize,” “annealing,” and “anneal” are used interchangeably in this disclosure, and refer to the pairing of substantially complementary or complementary nucleic acid sequences within two different molecules. Pairing can be achieved by any process in which a nucleic acid sequence joins with a substantially or fully complementary sequence through base pairing to form a hybridization complex. For purposes of hybridization, two nucleic acid sequences are “substantially complementary” if at least 60% (e.g., at least 70%, at least 80%, or at least 90%) of their individual bases are complementary to one another.
[0062] A “primer” is a single- stranded nucleic acid sequence having a 3’ end that can be used as a substrate for a nucleic acid polymerase in a nucleic acid extension reaction. RNA primers are formed of RNA nucleotides, and are used in RNA synthesis, while DNA primers are formed of DNA nucleotides and used in DNA synthesis. Primers can also include both RNA nucleotides and DNA nucleotides (e.g., in a random or designed pattern). Primers can also include other natural or synthetic nucleotides described herein that can have additional functionality. In some examples, DNA primers can be used to prime RNA synthesis and vice versa (e.g., RNA primers can be used to prime DNA synthesis). Primers can vary in length. For example, primers can be about 6 bases to about 120 bases. For example, primers can
include up to about 25 bases. A primer, may in some cases, refer to a primer binding sequence.
[0063] A “nucleic acid extension” generally involves incorporation of one or more nucleic acids (e.g., A, G, C, T, U, nucleotide analogs, or derivatives thereof) into a molecule (such as, but not limited to, a nucleic acid sequence) in a template-dependent manner, such that consecutive nucleic acids are incorporated by an enzyme (such as a polymerase or reverse transcriptase), thereby generating a newly synthesized nucleic acid molecule. Enzymatic extension can be performed by an enzyme including, but not limited to, a polymerase and/or a reverse transcriptase. For example, a primer that hybridizes to a complementary nucleic acid sequence can be used to synthesize a new nucleic acid molecule by using the complementary nucleic acid sequence as a template for nucleic acid synthesis. Similarly, a 3’ polyadenylated tail of an mRNA transcript that hybridizes to a poly (dT) sequence can be used as a template for single-strand synthesis of a corresponding cDNA molecule. Furthermore, a poly (dT) sequence may be used as a sequencing primer for sequencing RNA molecules comprising poly (A) tails.
[0064] A “non-terminating nucleotide” or “incorporating nucleotide” can include a nucleic acid moiety that can be attached to a 3' end of a polynucleotide using a polymerase or transcriptase, and that can have another non-terminating nucleic acid attached to it using a polymerase or transcriptase without the need to remove a protecting group or reversible terminator from the nucleotide. Naturally occurring nucleic acids are a type of nonterminating nucleic acid. Non-terminating nucleic acids may be labeled or unlabeled.
[0065] A “PCR amplification” refers to the use of a polymerase chain reaction (PCR) to generate copies of genetic material, including DNA and RNA sequences. Suitable reagents and conditions for implementing PCR are described, for example, in U.S. Patent Nos. 4,683,202, 4,683,195, 4,800,159, 4,965,188, and 5,512,462, the entire contents of each of which are incorporated herein by reference. In a typical PCR amplification, the reaction mixture includes the genetic material to be amplified, an enzyme, one or more primers that are employed in a primer extension reaction, and reagents for the reaction. The oligonucleotide primers are of sufficient length to provide for hybridization to complementary genetic material under annealing conditions. The length of the primers generally depends on the length of the amplification domains, but will typically be at least 4 bases, at least 5 bases, at least 6 bases, at least 8 bases, at least 9 bases, at least 10 base pairs (bp), at least 11 bp, at least 12 bp, at least 13 bp, at least 14 bp, at least 15 bp, at least 16 bp, at least 17 bp, at least 18 bp, at least 19 bp, at least 20 bp, at least 25 bp, at least 30 bp, at
least 35 bp, and can be as long as 40 bp or longer, where the length of the primers will generally range from 18 to 50 bp. The genetic material can be contacted with a single primer or a set of two primers (forward and reverse primers), depending upon whether primer extension, linear or exponential amplification of the genetic material is desired.
[0066] In some embodiments, the PCR amplification process uses a DNA polymerase enzyme. The DNA polymerase activity can be provided by one or more distinct DNA polymerase enzymes. In some embodiments, the DNA polymerase enzyme is from a bacterium, e.g., the DNA polymerase enzyme is a bacterial DNA polymerase enzyme. For instance, the DNA polymerase can be from a bacterium of the genus Escherichia, Bacillus, Thermophilus, or Pyrococcus.
[0067] In some embodiments, PCR amplification can include reactions such as, but not limited to, a strand-displacement amplification reaction, a rolling circle amplification reaction, a ligase chain reaction, a transcription-mediated amplification reaction, an isothermal amplification reaction, and/or a loop-mediated amplification reaction.
[0068] In some embodiments, PCR amplification uses a single primer that is complementary to the 3’ tag of target DNA fragments. In some embodiments, PCR amplification uses a first and a second primer, where at least a 3’ end portion of the first primer is complementary to at least a portion of the 3’ tag of the target nucleic acid fragments, and where at least a 3’ end portion of the second primer exhibits the sequence of at least a portion of the 5’ tag of the target nucleic acid fragments. In some embodiments, a 5’ end portion of the first primer is non-complementary to the 3’ tag of the target nucleic acid fragments, and a 5’ end portion of the second primer does not exhibit the sequence of at least a portion of the 5’ tag of the target nucleic acid fragments. In some embodiments, the first primer includes a first universal sequence and/or the second primer includes a second universal sequence.
[0069] The term “DNA polymerase” includes not only naturally-occurring enzymes but also all modified derivatives thereof, including also derivatives of naturally-occurring DNA polymerase enzymes. For instance, in some embodiments, the DNA polymerase can have been modified to remove 5’-3’ exonuclease activity. Sequence-modified derivatives or mutants of DNA polymerase enzymes that can be used include, but are not limited to, mutants that retain at least some of the functional, e.g., DNA polymerase activity of the wildtype sequence. Mutations can affect the activity profile of the enzymes, e.g., enhance or reduce the rate of polymerization, under different reaction conditions, e.g., temperature, template concentration, primer concentration, etc. Mutations or sequence-modifications can also affect the exonuclease activity and/or thermostability of the enzyme.
[0070] Suitable examples of DNA polymerases that can be used include, but are not limited to: E.coli DNA polymerase I, Bsu DNA polymerase, Bst DNA polymerase, Taq DNA polymerase, VENT™ DNA polymerase, DEEPVENT™ DNA polymerase, LongAmp® Taq DNA polymerase, LongAmp® Hot Start Taq DNA polymerase, Crimson LongAmp® Taq DNA polymerase, Crimson Taq DNA polymerase, OneTaq® DNA polymerase, OneTaq® Quick-Load® DNA polymerase, Hemo KlenTaq® DNA polymerase, REDTaq® DNA polymerase, Phusion® DNA polymerase, Phusion® High-Fidelity DNA polymerase, Platinum Pfx DNA polymerase, AccuPrime Pfx DNA polymerase, Phi29 DNA polymerase, Klenow fragment, Pwo DNA polymerase, Pfu DNA polymerase, T4 DNA polymerase and T7 DNA polymerase enzymes.
[0071] In some embodiments, genetic material is amplified by reverse transcription polymerase chain reaction (RT-PCR). The desired reverse transcriptase activity can be provided by one or more distinct reverse transcriptase enzymes, suitable examples of which include, but are not limited to: M-MLV, MuLV, AMV, HIV, Array Script™, MultiScribe™, ThermoScript™, and SuperScript® I, II, III, and IV enzymes. “Reverse transcriptase” includes not only naturally occurring enzymes, but all such modified derivatives thereof, including also derivatives of naturally-occurring reverse transcriptase enzymes.
[0072] In addition, reverse transcription can be performed using sequence-modified derivatives or mutants of M-MLV, MuLV, AMV, and HIV reverse transcriptase enzymes, including mutants that retain at least some of the functional, e.g., reverse transcriptase, activity of the wild-type sequence. The reverse transcriptase enzyme can be provided as part of a composition that includes other components, e.g., stabilizing components that enhance or improve the activity of the reverse transcriptase enzyme, such as RNase inhibitor(s), inhibitors of DNA-dependent DNA synthesis, e.g., actinomycin D. Many sequence-modified derivative or mutants of reverse transcriptase enzymes, e.g., M-MLV, and compositions including unmodified and modified enzymes are commercially available, e.g., ArrayScript™, MultiScribe™, ThermoScript™, and SuperScript® I, II, III, and IV enzymes.
[0073] Certain reverse transcriptase enzymes (e.g., Avian Myeloblastosis Virus (AMV) Reverse Transcriptase and Moloney Murine Leukemia Virus (M-MuLV, MMLV) Reverse Transcriptase) can synthesize a complementary DNA strand using both RNA (cDNA synthesis) and single- stranded DNA (ssDNA) as a template. Thus, in some embodiments, the reverse transcription reaction can use an enzyme (reverse transcriptase) that is capable of using both RNA and ssDNA as the template for an extension reaction, e.g., an AMV or MMLV reverse transcriptase.
II. Overview
[0074] In some embodiments, provided herein are methods to detect different base combination of nucleic acids (e.g., ssDNA in clonal clusters) which are immobilized on surface (e.g., a surface in a flow cell for sequencing). The general base decoding processing are illustrated, by way of example, in FIG. 2, using DNA clusters as examples.
[0075] In the particular example, the DNA clusters which are amplified by surface PCR are immobilized on surface. Each DNA clusters contains hundreds to thousands of copies of the same DNA sequence which is to be sequenced. Any two different clusters can contain the same DNA sequence or different DNA sequences to be sequenced. There are four different bases shown in FIG. 2 which represent the four possible initial configurations at each cluster at the beginning of DNA sequencing. The mixture of three dNTPs, e.g., dATP, dCTP and dTTP with different labels, and a polymerase can be flowed through (e.g., through a flow cell in which the clusters are immobilized on a substrate) and contacted with the ssDNA strands hybridized with sequencing primers. The matched dNTPs are incorporated onto the strands and the missing dNTP (e.g., dGTP) is not incorporated. The excess dNTPs can be washed off and fluorescent imaging can be performed under two different illumination sources, e.g., 530 nm and 630 nm. In FIG. 2, dATP labelled with fluorescent dye (L2) can be detected under 630 nm excitation wavelength (red). dTTP with LI fluorescent label can be detected under green light (530 nm). dCTP which is labelled with two dyes (dCTP-L3+L4) or a mixture of nucleotide with two different dyes (e.g., dCTP-L3 and dCTP-L4) can be detected with both illumination conditions. Red and green colors are used as examples and fluorescent labels of other colors can be used. Wavelengths are often referred to using their associated color and include blue (400-470 nm), green (470-550 nm), red (630-700 nm) and NIR (700-1200) lights.
[0076] In the particular example, the image 1 is acquired with 530 nm LED illumination at full power (e.g., constant current at 12 Amps). The image 2 is acquired with 630 nm LED illumination at full power as well (e.g., constant current at 12 Amps). The base types can be called based on these two images. The base is called as “A” at a cluster if the cluster only lights up in Image 2. The base at a cluster is “T” if the cluster only lights up in Image 1. The base is “C” if the cluster is detectable in both Image 1 and Image 2. The base is “G” if the cluster is not detected in either image. However, the fluorescent dyes selections can be very limit for dual labelled or mixed dyes to have the same brightness as single labelled nucleotide in red or green channel, since the concentration is split into half by two dyes. The dual labelled dyes will have energy transfer between dyes or different quantum yield, which can
make it difficult to balance the brightness between these dyes. The ideal distribution of the intensity of DNA clusters in Image 1 and Image 2 is shown in FIG. 3. The average signal intensities of T and C in Image 1 is similar or equal and same for A and C in Image 2. However, in reality, in sequencing images the average signal intensity of C can be far weaker than T and A as demonstrated in FIG. 4, e.g., due to energy transfer between dyes.
[0077] To improve the distance between A, T, G, and C clouds of clusters, in some embodiments a third image can be acquired with both wavelengths, e.g., using 530 nm and 630 nm light sources (LED1 and LED2, respectively), where the clusters are illuminated but with 50% of power for LED1 and 50% of power of LED2 (12 Amps current in total). The dual labelled nucleotide can show 100% of the emission intensity since it can be excited by both light. The other two nucleotide with single labelled dye can only reach 50% of intensity in Image 1 and Image 2. These intensities can be used as third axis to improve the separation between the cluster intensity clouds. As shown in FIG. 5, the distance between dark cloud and the dual labeled “C” cloud can be increased for 1.414 to 1.732 which is about 20% improvement compared to an ideal case using a conventional method (e.g., using only Image 1 and Image 2). As shown in FIG. 6, the distance between dark cloud and the dual labeled “C” cloud can be improved by using Image 3, even when the dual color nucleotide (“C”) has intensity at red or green channel that is half of the red only dye or the green only dye. The distance between dark (“G”) and “C” clouds is increased from 0.707 to 0.866. The pass filter rate and quality score can be improved with increased distance between the cluster clouds. [0078] To improve the brightness balance between nucleotides, the nucleotides can be labelled with non-fluorescent labels such as biotin, desthiobiotin, 2,4-Dinitrophenyl (DNP) and Digoxigenin etc., which can be stained with dye labelled streptavidin or antibody. The intensities of clusters can be easily tuned by varying streptavidin/dye ratio or antibody /ratio. In addition, the intensities can be also tune by mixing of streptavidin without any tag. The dual channel brightness can be simply tuned by fine tuning the ratio between streptavidin- dyel, streptavidin-dye2, and optionally unlabeled streptavidin, as show in FIG. 7 and FIG. 8. In some embodiments, the protein/dye ratio can be achieved to reach more than 5 dye molecules per protein molecule.
[0079] The missing nucleotide such as G in FIG. 2 can be incorporated by contacting the dGTP with or without labels and polymerase in a separate step or can be introduced together with other unlabeled or labelled nucleotides such as dATP, dCTP and dTTP to improve the incorporation yield and prevent grow lag. The advantage of introducing another dNTP mix after imaging is the following. In some embodiments, to improve the efficiency the dark
nucleotide can be mixed with other dye labelled nucleotides, e.g., 10% of total nucleotides can be unlabeled, which can improve the phasing too since dark dNTP is incorporated more efficiency than labelled nucleotides. In some embodiments, the missing nucleotide can be labeled with biotin or other hapten, a third or fourth image can be acquired to light up the fourth type of clusters such as G in FIG. 4 to improve the accuracy of base calling after stain with streptavidin or antibody with dyes.
[0080] The internal fluorescence dye can be replaced with other biomolecules such as biotin, desthiobiotin, 2,4-Dinitrophenol and digoxigenin etc. The fluorescent dye can be introduced later with streptavidin for biotin and desthiobiotin or antibody for small hapten. The dye tag can also be released with biotin for desthiobiotin labeled oligo.
III. Samples and Nucleic Acid Molecules
[0081] The nucleic acid molecules used in the methods described herein may be obtained from any suitable biological source, for example a tissue sample, a blood sample, a plasma sample, a saliva sample, a fecal sample, or a urine sample. The polynucleotides may be DNA or RNA molecules. In some embodiments, RNA molecules are reverse transcribed into DNA molecules prior to hybridizing the polynucleotide to a sequencing primer. In some embodiments, RNA molecules are not reverse transcribed and are hybridized to a sequencing primer for direct RNA sequencing. In some embodiments, the nucleic acid molecule is a cell-free DNA (cfDNA), such as a circulating tumor DNA (ctDNA) or a fetal cell-free DNA. [0082] Examples of nucleic acid molecules include DNA molecules such as single- stranded DNA (ssDNA), double-stranded DNA (dsDNA), genomic DNA, methylated DNA, specific methylated DNA sequences, fragmented DNA, mitochondrial DNA, in situ synthesized PCR products, and RNA/DNA hybrids. The DNA analyte can be a transcript of another nucleic acid molecule (e.g., DNA or RNA such as mRNA) present in a tissue sample.
[0083] Examples of nucleic acid molecules also include RNA molecules such as various types of coding and non-coding RNA, including viral RNAs. Examples of the different types of RNA molecules include messenger RNA (mRNA), including a nascent RNA, a pre- mRNA, a primary-transcript RNA, and a processed RNA, such as a capped mRNA (e.g., with a 5’ 7-methyl guanosine cap), a polyadenylated mRNA (poly-A tail at the 3’ end), and a spliced mRNA in which one or more introns have been removed. Also included in the nucleic acid molecules disclosed herein are non-capped mRNA, a non-poly adenylated
mRNA, and a non-spliced mRNA. The RNA analyte can be a transcript of another nucleic acid molecule (e.g., DNA or RNA such as viral RNA).
[0084] In some embodiments, a nucleic acid molecule may be a denatured nucleic acid, wherein the resulting denatured nucleic acid is single- stranded. The nucleic acid may be denatured, for example, optionally using formamide, heat, or both formamide and heat. In some embodiments, the nucleic acid is not denatured for use in a method disclosed herein. [0085] In some embodiments, a nucleic acid molecule can be extracted from a cell, a virus, or a tissue sample comprising the cell or virus. Processing conditions can be adjusted to extract or release nucleic acid molecules (e.g., RNA) from a cell, a virus, or a tissue sample.
IV. Sequencing Methods
A. Nucleotides and Nucleotide Analogs
[0086] In some embodiments, a method disclosed herein comprises using one or more nucleotides or analogs thereof, including a native nucleotide or a nucleotide analog or modified nucleotide (e.g., labeled with one or more detectable labels). In some embodiments, a nucleotide analog comprises a nitrogenous base, five-carbon sugar, and phosphate group, wherein any component of the nucleotide may be modified and/or replaced. In some embodiments, a method disclosed herein may comprise but does not require using one or more non-incorporable nucleotides. Non-incorporable nucleotides may be modified to become incorporable at any point during the sequencing method.
[0087] Nucleotide analogs include, but are not limited to, alpha-phosphate modified nucleotides, alpha-beta nucleotide analogs, beta-phosphate modified nucleotides, beta-gamma nucleotide analogs, gamma-phosphate modified nucleotides, caged nucleotides, or ddNTPs. Examples of nucleotide analogs are described in U.S. Patent No. 8,071,755, which is incorporated by reference herein in its entirety.
[0088] In some embodiments, a method disclosed herein may comprise but does not require using terminators that reversibly prevent nucleotide incorporation at the 3 '-end of the primer. One type of reversible terminator is a 3'-O-blocked reversible terminator. Here the terminator moiety is linked to the oxygen atom of the 3'-OH end of the 5-carbon sugar of a nucleotide. For example, U.S. Patent Nos. 7,544,794 and 8,034,923 (the disclosures of these patents are incorporated by reference) describe reversible terminator dNTPs having the 3'-OH group replaced by a 3'-ONH2 group. Another type of reversible terminator is a 3 '-unblocked reversible terminator, wherein the terminator moiety is linked to the nitrogenous base of a
nucleotide. For example, U.S. Patent No. 8,808,989 (the disclosure of which is incorporated by reference) discloses particular examples of base-modified reversible terminator nucleotides that may be used in connection with the methods described herein. Other reversible terminators that similarly can be used in connection with the methods described herein include those described in U.S. Patent Nos. 7,956,171, 8,071,755, and 9,399,798, herein incorporated by reference.
[0089] In some embodiments, a method disclosed herein may comprise but does not require using nucleotide analogs having terminator moieties that irreversibly prevent nucleotide incorporation at the 3 '-end of the primer. Irreversible nucleotide analogs include 2', 3'- dideoxynucleotides, ddNTPs (ddGTP, ddATP, ddTTP, ddCTP). Dideoxynucleotides lack the 3'-OH group of dNTPs that is essential for polymerase-mediated synthesis.
[0090] In some embodiments, a method disclosed herein may comprise but does not require using non-incorporable nucleotides comprising a blocking moiety that inhibits or prevents the nucleotide from forming a covalent linkage to a second nucleotide (3'-OH of a primer) during the incorporation step of a nucleic acid polymerization reaction. The blocking moiety can be removed from the nucleotide, allowing for nucleotide incorporation.
[0091] In some embodiments, a method disclosed herein may comprise but does not require using 1, 2, 3, 4 or more nucleotide analogs present in the SBS reaction. In some embodiments, a nucleotide analog is replaced, diluted, or sequestered during an incorporation step. In some embodiments, a nucleotide analog is replaced with a native nucleotide. In some embodiments, a nucleotide analog is modified during an incorporation step. The modified nucleotide analog can be similar to or the same as a native nucleotide.
[0092] In some embodiments, a method disclosed herein may comprise but does not require using a nucleotide analog having a different binding affinity for a polymerase than a native nucleotide. In some embodiments, a nucleotide analog has a different interaction with a next base than a native nucleotide. Nucleotide analogs and/or non-incorporable nucleotides may base-pair with a complementary base of a template nucleic acid.
[0093] In some embodiments, one or more nucleotides can be labeled with distinguishing and/or detectable tags or labels. The tags may be distinguishable by means of their differences in fluorescence, Raman spectrum, charge, mass, refractive index, luminescence, length, or any other measurable property. The tag may be attached to one or more different positions on the nucleotide, so long as the fidelity of binding to the polymerase-nucleic acid complex is sufficiently maintained to enable identification of the complementary base on the template nucleic acid correctly. In some embodiments, the tag is attached to the nucleobase of
the nucleotide. Alternatively, a tag is attached to the gamma phosphate position of the nucleotide.
[0094] Detectable labels can be suitable for small scale detection and/or suitable for high- throughput screening. As such, suitable detectable labels include, but are not limited to, radioisotopes, fluorophores, chemiluminescent compounds, bioluminescent compounds, and dyes. The detectable label can be qualitatively detected (e.g., optically or spectrally), or it can be quantified. Qualitative detection generally includes a detection method in which the existence or presence of the detectable label is confirmed, whereas quantifiable detection generally includes a detection method having a quantifiable (e.g., numerically reportable) value such as an intensity, duration, polarization, and/or other properties. In some embodiments, the detectable label is bound to another moiety, for example, a nucleotide or nucleotide analog, and can include a fluorescent, a colorimetric, or a chemiluminescent label. [0095] In some embodiments, a detectable label can be attached to another moiety, for example, a nucleotide or nucleotide analog. In some embodiments, the detectable label is a fluorophore. For example, the fluorophore can be from a group that includes: 7-AAD (7- Aminoactinomycin D), Acridine Orange (+DNA), Acridine Orange (+RNA), Alexa Fluor® 350, Alexa Fluor® 430, Alexa Fluor® 488, Alexa Fluor® 532, Alexa Fluor® 546, Alexa Fluor® 555, Alexa Fluor® 568, Alexa Fluor® 594, Alexa Fluor® 633, Alexa Fluor® 647, Alexa Fluor® 660, Alexa Fluor® 680, Alexa Fluor® 700, Alexa Fluor® 750, Allophycocyanin (APC), AMCA / AMCA-X, 7-Aminoactinomycin D (7-AAD), 7- Amino-4- methylcoumarin, 6-Aminoquinoline, Aniline Blue, ANS, APC-Cy7, ATTO-TAG™ CBQCA, ATTO-TAG™ FQ, Auramine O-Feulgen, BCECF (high pH), BFP (Blue Fluorescent Protein), BFP / GFP FRET, BOBO™-1 / BO-PRO™- 1, BOBO™-3 / BO-PRO™-3, BODIPY® FL, BODIPY® TMR, BODIPY® TR-X, BODIPY® 530/550, BODIPY® 558/568, BODIPY® 564/570, BODIPY® 581/591, BODIPY® 630/650-X, BODIPY® 650- 665-X, BTC, Calcein, Calcein Blue, Calcium Crimson™, Calcium Green- 1™, Calcium Orange™, Calcofluor® White, 5-Carboxyfluoroscein (5-FAM), 5- Carboxynaphthofluoroscein, 6-Carboxyrhodamine 6G, 5-Carboxytetramethylrhodamine (5- TAMRA), Carboxy-X -rhodamine (5-ROX), Cascade Blue®, Cascade Yellow™, CCF2 (GeneBLAzer™), CFP (Cyan Fluorescent Protein), CFP / YFP FRET, Chromomycin A3, Cl- NERF (low pH), CPM, 6-CR 6G, CTC Formazan, Cy2®, Cy3®, Cy3.5®, Cy5®, Cy5.5®, Cy7®, Cychrome (PE-Cy5), Dansylamine, Dansyl cadaverine, Dansylchloride, DAPI, Dapoxyl, DCFH, DHR, DiA (4-DM6-ASP), DiD (DilC18(5)), DIDS, Dil (DilC18(3)), DiO (DiOC18(3)), DiR (DilC18(7)), Di-4 ANEPPS, Di-8 ANEPPS, DM-NERF (4.5-6.5 pH),
DsRed (Red Fluorescent Protein), EBFP, ECFP, EGFP, ELF® -97 alcohol, Eosin, Erythrosin, Ethidium bromide, Ethidium homodimer- 1 (EthD-1), Europium (III) Chloride, 5-FAM (5- Carboxyfluorescein), Fast Blue, Fluorescein-dT phosphoramidite, FITC, Fluo-3, Fluo-4, FluorX®, Fluoro-Gold™ (high pH), Fluoro-Gold™ (low pH), Fluoro-Jade, FM® 1-43, Fura- 2 (high calcium), Fura-2 / BCECF, Fura Red™ (high calcium), Fura Red™ / Fluo-3, GeneBLAzer™ (CCF2), GFP Red Shifted (rsGFP), GFP Wild Type, GFP / BFP FRET, GFP / DsRed FRET, Hoechst 33342 & 33258, 7-Hydroxy-4-methylcoumarin (pH 9), 1,5 IAEDANS, Indo-1 (high calcium), Indo-1 (low calcium), Indodicarbocyanine, Indotricarbocyanine, JC-1, 6- JOE, JOJO™-1 / JO-PRO™- 1, LDS 751 (+DNA), LDS 751 (+RNA), LOLO™-1 / LO-PRO™-1, Lucifer Yellow, Ly soSensor™ Blue (pH 5), LysoSensor™ Green (pH 5), LysoSensor™ Yellow/Blue (pH 4.2), LysoTracker® Green, LysoTracker® Red, LysoTracker® Yellow, Mag-Fura-2, Mag-Indo-1, Magnesium Green™, Marina Blue®, 4-Methylumbelliferone, Mithramycin, MitoTracker® Green, MitoTracker® Orange, MitoTracker® Red, NBD (amine), Nile Red, Oregon Green® 488, Oregon Green® 500, Oregon Green® 514, Pacific Blue, PBF1, PE (R-phycoerythrin), PE-Cy5, PE-Cy7, PE- Texas Red, PerCP (Peridinin chlorphyll protein), PerCP-Cy5.5 (TruRed), PharRed (APC- Cy7), C-phycocyanin, R-phycocyanin, R-phycoerythrin (PE), PI (Propidium Iodide), PKH26, PKH67, POPO™-1 / PO-PRO™-1, POPO™-3 / PO-PRO™-3, Propidium Iodide (PI), PyMPO, Pyrene, Pyronin Y, Quantam Red (PE-Cy5), Quinacrine Mustard, R670 (PE-Cy5), Red 613 (PE-Texas Red) , Red Fluorescent Protein (DsRed), Resorufin, RH 414, Rhod-2, Rhodamine B, Rhodamine Green™, Rhodamine Red™, Rhodamine Phalloidin, Rhodamine 110, Rhodamine 123, 5-ROX (carboxy -X-rhodamine), S65A, S65C, S65L, S65T, SBFI, SITS, SNAFL®-1 (high pH), SNAFL®-2, SNARF®-1 (high pH), SNARF®-1 (low pH), Sodium Green™, SpectrumAqua®, SpectrumGreen® #1, SpectrumGreen® #2, SpectrumOrange®, SpectrumRed®, SYTO® 11, SYTO® 13, SYTO® 17, SYTO® 45, SYTOX® Blue, SYTOX® Green, SYTOX® Orange, 5-TAMRA (5- Carboxytetramethylrhodamine), Tetramethylrhodamine (TRITC), Texas Red® / Texas Red®-X, Texas Red®-X (NHS Ester), Thiadicarbocyanine, Thiazole Orange, TOTO®-1 / TO-PRO®-1, TOTO®-3 / TO-PRO®-3, TO-PRO®-5, Tri-color (PE-Cy5), TRITC (Tetramethylrhodamine), TruRed (PerCP-Cy5.5), WW 781, X-Rhodamine (XRITC) , Y66F, Y66H, Y66W, YFP (Yellow Fluorescent Protein), YOYO®-1 / YO-PRO®-1, YOYO®-3 / YO-PRO®-3, 6-FAM (Fluorescein), 6-FAM (NHS Ester), 6-FAM (Azide), HEX, TAMRA (NHS Ester), Yakima Yellow, MAX, TET, TEX615, ATTO 488, ATTO 532, ATTO 542, ATTO 550, ATTO 565, ATTO RholOl, ATTO 590, ATTO 633, ATTO 647N, TYE 563,
TYE 665, TYE 705, 5’ IRDye® 700, 5’ IRDye® 800, 5’ IRDye® 800CW (NHS Ester), WellRED D4 Dye, WellRED D3 Dye, WellRED D2 Dye, Lightcycler® 640 (NHS Ester), and Dy 750 (NHS Ester).
[0096] The detectable label can be directly detectable by itself (e.g., radioisotope labels or fluorescent labels) or, in the case of an enzymatic label, can be indirectly detectable, e.g., by catalyzing chemical alterations of a substrate compound or composition, which substrate compound or composition is directly detectable. The label can emit a signal or alter a signal delivered to the label so that the presence or absence of the label can be detected. In some cases, coupling may be via a linker, which may be cleavable, such as photo-cleavable (e.g., cleavable under ultra-violet light), chemically-cleavable (e.g., via a reducing agent, such as dithiothreitol (DTT), tris(2-carboxyethyl)phosphine (TCEP)) or enzymatically cleavable (e.g., via an esterase, lipase, peptidase, or protease).
B. Polymerases
[0097] Polymerases that may be used to carry out the disclosed techniques include naturally- occurring polymerases and any modified variations thereof, including, but not limited to, mutants, recombinants, fusions, genetic modifications, chemical modifications, synthetics, and analogs. Naturally occurring polymerases and modified variations thereof are not limited to polymerases that retain the ability to catalyze a polymerization reaction. In some embodiments, the naturally occurring and/or modified variations thereof retain the ability to catalyze a polymerization reaction. In some embodiments, the naturally-occurring and/or modified variations have special properties that enhance their ability to sequence DNA, including enhanced binding affinity to nucleic acids, reduced binding affinity to nucleic acids, enhanced catalysis rates, reduced catalysis rates, etc. Mutant polymerases include polymerases wherein one or more amino acids are replaced with other amino acids (naturally or non-naturally occurring), and insertions or deletions of one or more amino acids.
[0098] In some embodiments, a method disclosed herein may comprise but does not require using modified polymerases containing an external tag (e.g., an exogenous detectable label), which can be used to monitor the presence and interactions of the polymerase. In some embodiments, intrinsic signals from the polymerase can be used to monitor their presence and interactions. Thus, the provided methods can include monitoring the interaction of the polymerase, nucleotide and template nucleic acid through detection of an intrinsic signal from the polymerase. In some embodiments, the intrinsic signal is a light scattering signal.
For example, intrinsic signals include native fluorescence of certain amino acids such as tryptophan.
[0099] In some embodiments, a method disclosed herein may comprise using an unlabeled polymerase, and monitoring is performed in the absence of an exogenous detectable label associated with the polymerase. Some modified polymerases or naturally occurring polymerases, under specific reaction conditions, may incorporate only single nucleotides and may remain bound to the primer-template after the incorporation of the single nucleotide. [0100] In some embodiments, a method disclosed herein may comprise using a polymerase unlabeled with an exogenous detectable label (e.g., a fluorescent label). The label can be chemically linked to the structure of the polymerase by a covalent bond after the polymerase has been at least partially purified using protein isolation techniques. For example, the exogenous detectable label can be chemically linked to the polymerase using a free sulfhydryl or a free amine moiety of the polymerase. This can involve chemical linkage to the polymerase through the side chain of a cysteine residue, or through the free amino group of the N-terminus. In certain preferred embodiments, a fluorescent label attached to the polymerase is useful for locating the polymerase, as may be important for determining whether or not the polymerase has localized to a spot on an array corresponding to immobilized primed template nucleic acid. The fluorescent signal need not, and in some embodiments does not change absorption or emission characteristics as the result of binding any nucleotide. In some embodiments, the signal emitted by the labeled polymerase is maintained uniformly in the presence and absence of any nucleotide being investigated as a possible next correct nucleotide.
[0101] The term polymerase and its variants, as used herein, also refers to fusion proteins comprising at least two portions linked to each other, for example, where one portion comprises a peptide that can catalyze the polymerization of nucleotides into a nucleic acid strand is linked to another portion that comprises a second moiety, such as, a reporter enzyme or a processivity-modifying domain. For example, T7 DNA polymerase comprises a nucleic acid polymerizing domain and a thioredoxin binding domain, wherein thioredoxin binding enhances the processivity of the polymerase. Absent the thioredoxin binding, T7 DNA polymerase is a distributive polymerase with processivity of only one to a few bases. Although DNA polymerases differ in detail, they have a similar overall shape of a hand with specific regions referred to as the fingers, the palm, and the thumb; and a similar overall structural transition, comprising the movement of the thumb and/or finger domains, during the synthesis of nucleic acids.
[0102] DNA polymerases include, but are not limited to, bacterial DNA polymerases, eukaryotic DNA polymerases, archaeal DNA polymerases, viral DNA polymerases and phage DNA polymerases. Bacterial DNA polymerases include E. coli DNA polymerases I, II and III, IV and V, the Klenow fragment of E. coli DNA polymerase, Clostridium stercorarium (Csf) DNA polymerase, Clostridium thermocellum (Cth) DNA polymerase and Sulfolobus solfataricus (Sso) DNA polymerase. Eukaryotic DNA polymerases include DNA polymerases a, P, y, 5, e, q, , c, p, and K, as well as the Revl polymerase (terminal deoxycytidyl transferase) and terminal deoxynucleotidyl transferase (TdT). Viral DNA polymerases include T4 DNA polymerase, phi-29 DNA polymerase, GA-1, phi-29-like DNA polymerases, PZA DNA polymerase, phi- 15 DNA polymerase, Cpl DNA polymerase, Cp7 DNA polymerase, T7 DNA polymerase, and T4 polymerase. Other DNA polymerases include thermostable and/or thermophilic DNA polymerases such as DNA polymerases isolated from Thermus aquaticus (Taq) DNA polymerase, Thermus filiformis (Tfi) DNA polymerase, Thermococcus zilligi (Tzi) DNA polymerase, Thermus thermophilus (Tth) DNA polymerase, Thermus flavusu (TjT) DNA polymerase, Pyrococcus woesei (Pwo) DNA polymerase, Pyrococcus furiosus (Pf ) DNA polymerase and Turbo Pfu DNA polymerase, Thermococcus litoralis (TH) DNA polymerase, Pyrococcus sp. GB-D polymerase, Thermotoga maritima (Tma) DNA polymerase, Bacillus stearo thermophilus (Bst) DNA polymerase, Pyrococcus Kodakaraensis (KOD) DNA polymerase, Pfx DNA polymerase, Thermococcus sp. JDF-3 (JDF-3) DNA polymerase, Thermococcus gorgonarius (Tgo) DNA polymerase, Thermococcus acidophilium DNA polymerase; Sulfolobus acidocaldarius DNA polymerase; Thermococcus sp. go N-7 DNA polymerase; Pyrodictium occultum DNA polymerase; Methanococcus voltae DNA polymerase; Methanococcus thermoautotrophicum DNA polymerase; Methanococcus jannaschii DNA polymerase; Desulfurococcus strain TOK DNA polymerase (D. Tok Pol); Pyrococcus abyssi DNA polymerase; Pyrococcus horikoshii DNA polymerase; Pyrococcus islandicum DNA polymerase; Thermococcus fumicolans DNA polymerase; Aeropyrum pernix DNA polymerase; and the heterodimeric DNA polymerase DP1/DP2. Engineered and modified polymerases also are useful in connection with the disclosed techniques. For example, modified versions of the extremely thermophilic marine archaea Thermococcus species 9° N (e.g., Therminator DNA polymerase from New England BioEabs Inc.; Ipswich, Mass.) can be used. Still other useful DNA polymerases, including the 3PDX polymerase are disclosed in U.S. Patent No. 8,703,461, the disclosure of which is incorporated by reference in its entirety.
[0103] RNA polymerases include, but are not limited to, viral RNA polymerases such as T7 RNA polymerase, T3 polymerase, SP6 polymerase, and Kl l polymerase; Eukaryotic RNA polymerases such as RNA polymerase I, RNA polymerase II, RNA polymerase III, RNA polymerase IV, and RNA polymerase V; and Archaea RNA polymerase.
[0104] Reverse transcriptases include, but are not limited to, HIV-1 reverse transcriptase from human immunodeficiency virus type 1 (PDB 1HMV), HIV-2 reverse transcriptase from human immunodeficiency virus type 2, M-MLV reverse transcriptase from the Moloney murine leukemia virus, AMV reverse transcriptase from the avian myeloblastosis virus, and Telomerase reverse transcriptase that maintains the telomeres of eukaryotic chromosomes.
C. Sequencing Reactions
[0105] In some embodiments of a sequencing -by- synthesis (SBS) method disclosed herein, a first labeled nucleotide that has been incorporated is not deactivated (e.g., by removal and/or photobleaching of the label) prior to the introduction and/or incorporation of the next, second labeled nucleotide. The first and second labeled nucleotides can comprise the same base or different bases. The first and second labeled nucleotides can be introduced into a sequencing reaction mix simultaneously or at different time points in any order. Further, the first and second labeled nucleotides can be introduced by itself (e.g., in a suitable solvent such as water) or in a mixture with another sequencing reagent, such as one or more other labeled nucleotides and/or one or more unlabeled nucleotides. The first and second labeled nucleotides can also comprise the same base or different bases. In some embodiments, nucleotides that have not been incorporated at a residue corresponding to a base in the template nucleic acid (e.g., because the first labeled nucleotide has been incorporated at that residue) are not removed from the sequencing reaction mix prior to the introduction and/or incorporation of the second labeled nucleotide. In some embodiments, the first and second labeled nucleotides (and optionally labeled nucleotides for interrogating subsequent bases in the template) are provided in the same sequencing reaction mix, and the first, second, and optionally any subsequent labeled nucleotide(s) are incorporated sequentially in a continuous manner.
[0106] Thus, unlike existing SBS methods, some embodiments of the method disclosed herein use continuous introduction and/or incorporation of nucleotides (e.g., fluorescently labeled A, T, C, and/or G nucleotides) without the need of label deactivation and/or wash steps in between sequential incorporation events for a given template nucleic acid molecule
to be sequenced. Rather, in some embodiments, label deactivation (e.g., by cleaving and/or photobleaching the label) of a first incorporated nucleotide may occur stochastically throughout the continuous nucleotide incorporation process, for instance, prior to, during, or after the incorporation of a second, third, fourth, or a subsequent labeled nucleotide.
[0107] Nucleic acid sequencing reaction mixtures, or simply “reaction mixtures,” typically include reagents that are commonly present in polymerase based nucleic acid synthesis reactions. The reaction mixture can include other molecules including, but not limited to, enzymes. In some embodiments, the reaction mixture comprises any reagents or biomolecules generally present in a nucleic acid polymerization reaction. Reaction components may include, but are not limited to, salts, buffers, small molecules, detergents, crowding agents, metals, and ions. In some embodiments, properties of the reaction mixture may be manipulated, for example, electrically, magnetically, and/or with vibration.
[0108] The provided methods herein may further comprise but do not require one or more wash steps; a temperature change; a mechanical vibration; a pH change; or an optical stimulation that is not dye illumination or photobleaching. In some embodiments, the wash step comprises contacting the substrate and the nucleic acid molecule, the primer, and/or the polymerase with one of more buffers, detergents, protein denaturants, proteases, oxidizing agents, reducing agents, or other agents capable of crosslinking or releasing crosslinks, e.g., crosslinks within a polymerase or crosslinks between a polymerase and nucleic acid. Methods and compositions for nucleic acid sequencing are known, for example, as described in U.S. Patent Nos. 10,246,744 and 10,844,428, incorporated herein by reference in their entireties for all purposes.
[0109] Reaction mixture reagents can include, but are not limited to, enzymes (e.g., polymerase), dNTPs, template nucleic acids, primer nucleic acids, salts, buffers, small molecules, co-factors, metals, and ions. The ions may be catalytic ions, divalent catalytic ions, non-catalytic ions, non-covalent metal ions, or a combination thereof. The reaction mixture can include salts, such as NaCl, KC1, potassium acetate, ammonium acetate, potassium glutamate, or NH4CI or the like, that ionize in aqueous solution to yield monovalent cations. The reaction mixture can include a source of ions, such as Mg2+, Mn2+, Co2+, Cd2+, and/or Ba2+ ions. The reaction mixture can include tin, Ca2+, Zn2+, Cu2+, Co2+, Fe2+, and/or Ni2+, or other divalent non-catalytic metal cations. In some embodiments, the reaction mixture can include metal cations that may inhibit formation of phosphodiester bonds between the primed template nucleic acid molecule and the cognate nucleotide. In some embodiments, the metal cations can be used (e.g., at a suitable
concentration) to slow down but not completely inhibit or prevent nucleotide incorporation, thereby reducing multiple nucleotide incorporation events in a single detection window. [0110] In some embodiments, the sequencing reaction conditions comprise contacting the nucleic acid molecule and the primer with a buffer that regulates osmotic pressure. In some embodiments, the reaction mixture comprises a buffer that regulates osmotic pressure. In some embodiments, the buffer is a high salt buffer that includes a monovalent ion, such as a monovalent metal ion (e.g., potassium ion or sodium ion) at a concentration of from about 50 to about 1,500 mM. Salt concentrations in the range of from about 100 to about 1,500 mM, or from about 200 to 1,000 mM may also be used. In some embodiments, the buffer further comprises a source of glutamate ions (e.g., potassium glutamate). In some embodiments, the buffer comprises a stabilizing agent. In some embodiments, the stabilizing agent is a non- catalytic metal ion (e.g., a divalent non-catalytic metal ion). Non-catalytic metal ions useful in this context include, but are not limited to, calcium, strontium, scandium, titanium, vanadium, chromium, iron, cobalt, nickel, copper, zinc, gallium, germanium, arsenic, selenium, rhodium, europium, and/or terbium. In some embodiments, the non-catalytic metal ion is strontium, tin, or nickel. In some embodiments, the sequencing reaction mixture comprises strontium chloride or nickel chloride. In some embodiments, the stabilizing agent can be used (e.g., at a suitable concentration) to slow down but not completely inhibit or prevent nucleotide incorporation, thereby reducing multiple nucleotide incorporation events in a single detection window.
[0111] The buffer can include Tris, Tricine, HEPES, MOPS, ACES, MES, phosphate-based buffers, and acetate-based buffers. The reaction mixture can include chelating agents such as EDTA, EGTA, and the like. In some embodiments, the reaction mixture includes crosslinking reagents.
[0112] In some embodiments, the interaction between the polymerase and template nucleic acid may be manipulated by modulating sequencing reaction parameters such as ionic strength, pH, temperature, or any combination thereof, or by the addition of a destabilizing agent to the reaction. In some embodiments, the destabilizing agent can be used (e.g., at a suitable concentration) to slow down but not completely inhibit or prevent nucleotide incorporation, thereby reducing multiple nucleotide incorporation events in a single detection window.
[0113] In some embodiments, high salt (e.g., 50 to 1,500 mM) and/or pH changes are utilized to destabilize a complex between the polymerase and template nucleic acid. In some embodiments, the reaction conditions favor the stabilization of a complex among the
polymerase, the template nucleic acid, and a labeled nucleotide. By way of example, the pH of the reaction mixture can be adjusted from 4.0 to 10.0 to favor the stabilization of a complex among the polymerase, the template nucleic acid, and a labeled nucleotide. In some embodiments, the pH of the reaction mixture is from 4.0 to 6.0. In some embodiments, the pH of the reaction mixture is 6.0 to 10.0. In some embodiments, a suitable salt concentration and/or a suitable pH can be selected to slow down but not completely inhibit or prevent nucleotide incorporation, thereby reducing multiple nucleotide incorporation events in a single detection window.
[0114] In some embodiments, the reaction mixture comprises a competitive inhibitor, where the competitive inhibitor may reduce the occurrence of multiple incorporations events in a detection window. In one embodiment, the competitive inhibitor is a non-incorporable nucleotide. In an embodiment, the competitive inhibitor is an aminoglycoside. The competitive inhibitor is capable of replacing either the nucleotide or the catalytic metal ion in the active site, such that the competitive inhibitor occupies the active site preventing or slowing down a nucleotide incorporation. In some embodiments, both an incorporable nucleotide and a competitive inhibitor are introduced, such that the ratio of the incorporable nucleotide and the inhibitor can be adjusted to modulate the rate of incorporation of a single nucleotide at the 3 '-end of the primer. In some embodiments, the competitive inhibitor can be used (e.g., at a low concentration) to slow down but not completely inhibit or prevent nucleotide incorporation, thereby reducing multiple nucleotide incorporation events in a single detection window.
[0115] In some embodiments, the reaction mixture comprises at least one nucleotide molecule that is a non-incorporable nucleotide. In some embodiments, the reaction mixture comprises one or more nucleotide molecules incapable of incorporation into the primer of the primed template nucleic acid molecule. Such nucleotides incapable of incorporation include, for example, monophosphate nucleotides. For example, the nucleotide may contain modifications to the triphosphate group that make the nucleotide non-incorporable. Examples of non-incorporable nucleotides may be found in U.S. Pat. No. 7,482,120, which is incorporated by reference herein in its entirety. In some embodiments, the primer may not contain a free hydroxyl group at its 3 '-end, thereby rendering the primer incapable of incorporating any nucleotide, and, thus, making any nucleotide non-incorporable. In some embodiments, the primer may be processed such that it contains a free hydroxyl group at its 3'-end to allow nucleotide incorporation. In some embodiments, the non-incorporable nucleotide can be used (e.g., at a low concentration) to slow down but not completely inhibit
or prevent nucleotide incorporation, thereby reducing multiple nucleotide incorporation events in a single detection window.
[0116] In some embodiments, the reaction mixture comprises at least one nucleotide molecule that is incorporable but is incorporated at a slower rate compared to a corresponding naturally-occurring nucleoside triphosphate (e.g., NTP or dNTP). Such nucleotides incorporable at a slower rate may include, for example, diphosphate nucleotides. For example, the nucleotide may contain modifications to the triphosphate group that make the nucleotide incorporable at a slower rate. In some embodiments, the nucleotide incorporable at a slower rate can be used to slow down but not completely inhibit or prevent nucleotide incorporation, thereby reducing multiple nucleotide incorporation events in a single detection window.
[0117] In some embodiments, the reaction mixture comprises a polymerase inhibitor. In some embodiments, the polymerase inhibitor is a pyrophosphate analog. In some embodiments, the polymerase inhibitor is an allosteric inhibitor. In some embodiments, the polymerase inhibitor is a DNA or an RNA aptamer. In some embodiments, the polymerase inhibitor competes with a catalytic-ion binding site in the polymerase. In some embodiments, the polymerase inhibitor is a reverse transcriptase inhibitor. The polymerase inhibitor may be an HIV-1 reverse transcriptase inhibitor or an HIV-2 reverse transcriptase inhibitor. The HIV-1 reverse transcriptase inhibitor may be a (4/6-halogen/MeO/EtO-substituted benzo[d]thiazol-2-yl)thiazolidin-4-one. In some embodiments, the polymerase inhibitor can be used (e.g., at a low concentration) to slow down but not completely inhibit or prevent nucleotide incorporation, thereby reducing multiple nucleotide incorporation events in a single detection window.
[0118] In some embodiments, the contacting step is facilitated by the use of a chamber such as a flow cell. The methods and apparatus described herein may employ next generation sequencing technology (NGS), which allows massively parallel sequencing. In some embodiments, single DNA molecules are sequenced in a massively parallel fashion within a reaction chamber. A flow cell may be used but is not necessary. Flowing liquid reagents through the flow cell, which contains an interior solid support surface (e.g., a planar surface), conveniently permits reagent exchange. Immobilized to the interior surface of the flow cell is one or more primed template nucleic acids to be sequenced or interrogated using the procedures described herein. Typical flow cells will include microfluidic valving that permits delivery of liquid reagents (e.g., components of the “reaction mixtures” discussed herein) to
an entry port. Liquid reagents can be removed from the flow cell by exiting through an exit port.
[0119] In some embodiments, a reaction chamber disclosed herein can comprise a reagent wall, an imaging area, and optionally an outlet configured to remove molecules of one or more of the polymerase, the first detectably labeled nucleotide, the second detectably labeled nucleotide, and/or one or more other reagents from the imaging area. In some embodiments, the device may comprise one or more vents but no outlet or exit port for the reaction mixture. In some embodiments, a method disclosed herein does not comprise a step of removing liquid reagents through an outlet or exit port, e.g., from a reaction chamber such as a flow cell. [0120] The methods disclosed herein may but do not need to be used in combination with any NGS sequencing methods. The sequencing technologies of NGS include but are not limited to pyrosequencing, sequencing-by- synthesis with reversible dye terminators, sequencing by oligonucleotide probe ligation, and ion semiconductor sequencing. Nucleic acids such as DNA or RNA from individual samples can be sequenced individually (singleplex sequencing) or nucleic acids such as DNA or RNA from multiple samples can be pooled and sequenced as indexed genomic molecules (multiplex sequencing) on a single sequencing run, to generate up to several hundred million reads of sequences. Examples of sequencing technologies that can be used to obtain the sequence information according to the present method are further described here.
[0121] Some sequencing technologies are available commercially, such as the sequencing- by-synthesis platforms from 454 Life Sciences (Bradford, Conn.), Illumina/Solexa (Hayward, Calif.) and Helicos Biosciences (Cambridge, Mass.).
[0122] While the automated Sanger method is considered as a ‘first generation’ technology, Sanger sequencing including the automated Sanger sequencing, can also be employed in the methods described herein. Additional suitable sequencing methods include, but are not limited to nucleic acid imaging technologies, e.g., atomic force microscopy (AFM) or transmission electron microscopy (TEM).
[0123] In some embodiments, the disclosed methods may be used in combination with massively parallel sequencing of nucleic acid molecules using Illumina's sequencing-by- synthesis and reversible terminator-based sequencing chemistry. In some implementation, a method disclosed herein can use a flow cell having a glass slide with lanes.
[0124] After sequencing of nucleic acid molecules, sequence reads of predetermined length, e.g., at least about 15 bp, are localized by mapping (alignment) to a known reference sequence or genome (e.g., viral sequences or genomes). A number of computer algorithms
are available for aligning sequences, including without limitation BLAST, BLITZ, FASTA, BOWTIE, or ELAND (Illumina, Inc., San Diego, Calif., USA).
[0125] In some embodiments, the provided sequencing methods disclosed herein may regulate polymerase interaction with the nucleotides and template nucleic acid (as well as rate of nucleotide incorporation) in a manner that reveals the identity of the next base while controlling the chemical addition of a nucleotide. In some embodiments, the SBS reaction condition comprises a plurality of primed template nucleic acids, polymerases, nucleotides, or any combination thereof. In some embodiments, the plurality of nucleotides comprises 1, 2, 3, 4, or more types of different nucleotides, for example dATP, dTTP (or dUTP), dGTP, and dCTP.
[0126] In some embodiments, the method can further comprise contacting the nucleic acid molecule with the substrate to immobilize the nucleic acid molecule. In some embodiments, the nucleic acid molecule can be immobilized at a density of one molecule per at least about 250 nm2, at least about 200 nm2, at least about 150 nm2, at least about 100 nm2, at least about 90 nm2, at least about 80 nm2, at least about 70 nm2, at least about 60 nm2, at least about 50 nm2, at least about 40 nm2, at least about 30 nm2, at least about 20 nm2, at least about 10 nm2, at least about 5 nm2, or in between any two of the aforementioned values. Methods and compositions for arraying biomolecules on a substrate, e.g., as described in US 2005/0042649 (incorporated herein by reference in its entirety for all purposes), may be used in methods disclosed herein.
[0127] In some embodiments, a subset of nucleic acid molecules (e.g., nucleic acid strands to be sequenced) on the substrate may be active at one or more time points. In some embodiments, at any one time, a first subset of nucleic acid molecules on the substrate is active (e.g., allowing nucleotide incorporation into a sequencing primer using a singlestranded sequence as template) while a second subset of nucleic acid molecules on the substrate is inactive (e.g., not allowing nucleotide incorporation into a sequencing primer using a single-stranded sequence as template). In some embodiments, at one or more time points, a first subset of nucleic acid molecules on the substrate is activated (e.g., by a first set of polymerase and/or primer molecules) for nucleotide incorporation, while a second subset of nucleic acid molecules on the substrate is not activated (e.g., by the first set of polymerase and/or primer molecules), thus only signals associated with the first subset of nucleic acid molecules are detected. At one or more other time points, the second subset of nucleic acid molecules on the substrate is activated (e.g., by a second set of polymerase and/or primer molecules) for nucleotide incorporation, while the first subset of nucleic acid molecules on
the substrate is not activated (e.g., by the second set of polymerase and/or primer molecules), thus only signals associated with the second subset of nucleic acid molecules are detected. In some embodiments, the first and second sets of polymerase and/or primer molecules can be introduced at different time points, e.g., in sequential cycles with optional washing steps between cycles (e.g., to remove a set of polymerase and/or primer molecules for SBS of a first subset of strands before introducing the next set of polymerase and/or primer molecules for SBS of a second subset of strands).
[0128] In some embodiments, the substrate can comprise a bead, a planar substrate, a solid surface, a flow cell, a semiconductor chip, a well, a pillar, a chamber, a channel, a through hole, a nanopore, or any combination thereof. In some embodiments, the substrate can comprise a microwell, a micropillar, a microchamber, a microchannel, or any combination thereof.
V. Compositions, Kits, and Applications
[0129] Also provided herein are compositions and kits comprising one or more of the primers, nucleic acid molecules, substrates, nucleotides including detectably labeled nucleotides, polymerases, and reagents for performing the methods provided herein, for example reagents required for one or more steps comprising hybridization, ligation, amplification, detection, sequencing, and/or sample preparation as described herein, for example, in Section IV.
[0130] The various components of the kit may be present in separate containers or certain compatible components may be pre-combined into a single container. In some embodiments, the kits further contain instructions for using the components of the kit to practice the provided methods.
[0131] In some embodiments, the kits can contain reagents and/or consumables required for performing one or more steps of the provided methods. In some embodiments, the kits contain reagents for sample processing, such as nucleic acid extraction, isolation, and/or purification, e.g., RNA extraction, isolation, and/or purification. In some embodiments, the kits contain reagents, such as enzymes and buffers for ligation and/or amplification, such as ligases and/or polymerases. In some embodiments, the kits contain reagents, such as enzymes and buffers for primer extension and/or nucleic acid sequencing, such as polymerases and/or transcriptases. In some aspects, the kit can also comprise any of the reagents described herein, e.g., buffer components for tuning the rate of nucleotide
incorporation and/or for tuning the rate of signal deactivation (e.g., by photobleaching). In some embodiments, the kits contain reagents for signal detection during sequencing, such as detectable labels and detectably labeled molecules. In some embodiments, the kits optionally contain other components, for example nucleic acid primers, enzymes and reagents, buffers, nucleotides, modified nucleotides, and reagents for additional assays.
[0132] In some aspects, the provided embodiments can be applied in analyzing nucleic acid sequences, such as DNA and/or RNA sequencing. In some aspects, the embodiments can be applied in an imaging or detection method for multiplexed nucleic acid analysis. In some aspects, the provided embodiments can be used to identify or detect regions of interest in target nucleic acids, such as viral DNA or RNA. In some embodiments, the region of interest comprises one or more nucleotide residues, such as a single-nucleotide polymorphism (SNP), a single-nucleotide variant (SNV), substitutions such as a single-nucleotide substitution, mutations such as a point mutation, insertions such as a single-nucleotide insertion, deletions such as a single-nucleotide deletion, translocations, inversions, duplications, and/or other sequences of interest.
[0133] In some aspects, the embodiments can be applied in investigative and/or diagnostic applications, for example, for characterization or assessment of a sample from a subject. Applications of the provided method can comprise biomedical research and clinical diagnostics. For example, in biomedical research, applications comprise, but are not limited to, genetic and genomic analysis for biological investigation or drug screening. In clinical diagnostics, applications comprise, but are not limited to, detecting gene markers such as disease, immune responses, bacterial or viral DNA/RNA for patient samples, loss of genetic heterozygosity, the presence of gene alleles indicative of a predisposition towards disease or good health, likelihood of responsiveness to therapy, or in personalized medicine or ancestry. [0134] The present disclosure is not intended to be limited in scope to the particular disclosed embodiments, which are provided, for example, to illustrate various aspects of the present disclosure. Various modifications to the compositions and methods described will become apparent from the description and teachings herein. Such variations may be practiced without departing from the true scope and spirit of the disclosure and are intended to fall within the scope of the present disclosure.
Claims
1. A method for determining a sequence of a polynucleotide template, comprising: a) contacting the polynucleotide template with a first pool of nucleotides comprising: i) a nucleotide of a first base labeled with a first label configured to be detected at a first wavelength, ii) a nucleotide of a second base different from the first base, which nucleotide is labeled with a second label configured to be detected at a second wavelength which is different from the first wavelength, and iii) one or more nucleotides of a third base which is different from the first and second bases, wherein the one or more nucleotides comprise a nucleotide labeled with a third label configured to be detected at the first wavelength and a nucleotide labeled with a fourth label configured to be detected at the second wavelength; b) allowing binding and optional incorporation of a nucleotide of the first pool templated on the polynucleotide template, wherein the bound and optionally incorporated nucleotide is complementary to a nucleotide residue at a first nucleotide position in the polynucleotide template; c) imaging the polynucleotide template to detect a signal (or record an absence of the signal) associated with the bound and optionally incorporated nucleotide of the first pool, and wherein the imaging comprises: i) using a first power to image the polynucleotide template at the first wavelength and not at the second wavelength to acquire a first image, wherein a signal associated with the first label and/or a signal associated with the third label is detected or an absence of the signal is recorded, ii) using a second power to image the polynucleotide template at the second wavelength and not at the first wavelength to acquire a second image, wherein a signal associated with the second label and/or a signal associated with the fourth label is detected or an absence of the signal is recorded, and iii) using a third power to image the polynucleotide template at the first and second wavelengths to acquire a third image, wherein the third power is less than the first power and/or the second power, wherein a signal associated with the first label and/or a signal associated with the third label is detected at a signal intensity lower than that detected in c)i) or an absence of the signal is recorded, and wherein a signal associated with the second label and/or a signal associated with the fourth label is
detected at a signal intensity lower than that detected in c)ii) or an absence of the signal is recorded, wherein a signal codeword comprising signal codes each corresponding to the intensity of the signal detected in c)i) through c)iii) (or recorded absence of the signal) is generated, wherein the signal codeword corresponds to the identity of the base in the bound and optionally incorporated nucleotide, thereby identifying the nucleotide residue at the first nucleotide position in the polynucleotide template.
2. The method of claim 1, wherein the first label and the third label are different labels.
3. The method of claim 1, wherein the first label and the third label are the same label.
4. The method of any one of claims 1-3, wherein the second label and the fourth label are different labels.
5. The method of any one of claims 1-3, wherein the second label and the fourth label are the same label.
6. The method of any one of claims 1-5, wherein the one or more nucleotides of the third base comprise a nucleotide labeled with the third label and the fourth label.
7. The method of any one of claims 1-6, wherein the one or more nucleotides of the third base comprise a first nucleotide species labeled with the third label and a second nucleotide species labeled with the fourth label.
8. The method of any one of claims 1-7, wherein the third label and the fourth label are different labels.
9. The method of any one of claims 1-7, wherein the third label and the fourth label are the same label.
10. The method of any one of claims 1-9, wherein the first and second wavelengths correspond to different channels of a fluorescence microscope.
11. The method of any one of claims 1-10, wherein in c)i), the signal associated with the first label or the signal associated with the third label is detected at a first signal intensity, and in c)ii), the signal associated with the second label or the signal associated with the fourth label is detected at a second signal intensity.
12. The method of claim 11, wherein the first and second signal intensities differ by no more than 25%, no more than 20%, no more than 15%, no more than 10%, no more than 5%, or no more than 1%.
13. The method of claim 11, wherein the first and second signal intensities are the same.
14. The method of any one of claims 11-13, wherein in c)iii): the signal associated with the first label or the signal associated with the third label is detected at a signal intensity lower than the first signal intensity; and the signal associated with the second label or the signal associated with the fourth label is detected at a signal intensity lower than the second signal intensity.
15. The method of claim 14, wherein: the signal associated with the first label or the signal associated with the third label is detected at a signal intensity between about 25% and about 75% of the first signal intensity; and the signal associated with the second label or the signal associated with the fourth label is detected at a signal intensity between about 25% and about 75% of the second signal intensity.
16. The method of claim 14, wherein: the signal associated with the first label or the signal associated with the third label is detected at a signal intensity about 50% of the first signal intensity; and the signal associated with the second label or the signal associated with the fourth label is detected at a signal intensity about 50% of the second signal intensity.
17. The method of any one of claims 1-16, wherein the first pool of nucleotides comprises a nucleotide of a fourth base which is different from the first, second, and third bases.
18. The method of claim 17, wherein the nucleotide of the fourth base is not configured to be detected at the first wavelength and/or the second wavelength.
19. The method of claim 17 or claim 18, wherein the nucleotide of the fourth base is not detectably labeled.
20. The method of claim 17 or claim 18, wherein the nucleotide of the fourth base is configured to be detected at a third wavelength different from the first and second wavelengths.
21. The method of any one of claims 1-20, wherein the polynucleotide template is one of a plurality of polynucleotide templates having different template sequences.
22. The method of claim 21, wherein the sequences of the plurality of polynucleotide templates are determined.
23. The method of claim 21 or claim 22, wherein the polynucleotide template is immobilized on a substrate.
24. The method of claim 23, wherein the polynucleotide template is in one or more polynucleotides immobilized on the substrate.
25. The method of claim 24, wherein the polynucleotide template is in a rolling circle amplification product immobilized on the substrate, wherein the rolling circle amplification product comprises multiple copies of the template sequence in the polynucleotide template.
26. The method of claim 24, wherein the polynucleotide template is in a cluster of multiple polynucleotides immobilized on the substrate, wherein each polynucleotide in the cluster comprises a copy of the template sequence in the polynucleotide template.
27. The method of any one of claims 21-26, wherein in c)i), the plurality of polynucleotide templates are imaged, and the first image comprises the signal associated with the first label at the location of a first polynucleotide template, a recorded absence of a signal at the location of a second polynucleotide template, and the signal associated with the third label at the location of a third polynucleotide template.
28. The method of any one of claims 21-27, wherein in c)ii), the plurality of polynucleotide templates are imaged and the second image comprises a recorded absence of a signal at the location of the first polynucleotide template, the signal associated with the second label at the location of the second polynucleotide template, and the signal associated with the fourth label at the location of a third polynucleotide template.
29. The method of any one of claims 21-28, wherein in c)iii), the plurality of polynucleotide templates are imaged and the third image comprises: the signal associated with the first label at the location of the first polynucleotide template, the signal associated with the second label at the location of the second polynucleotide template, and
the signal associated with the third label and the signal associated with the fourth label at the location of a third polynucleotide template.
30. The method of any one of claims 1-29, wherein in c)i) and c)ii), the first and second images are acquired using the same power or powers that differ by no more than 25%, no more than 20%, no more than 15%, no more than 10%, no more than 5%, or no more than 1%, and the c)iii), the third image is acquired using a power that is between about 25% and about 75% of the power(s) used in c)i) and/or c)ii).
31. A method for determining sequences of a plurality of polynucleotide templates having different template sequences, comprising: a) contacting a substrate having clusters of polynucleotides immobilized thereon with a first pool of nucleotides and with oligonucleotide primers, in any order, whereby each cluster comprises: i) multiple copies of one of the different template sequences, and ii) an oligonucleotide primer of the oligonucleotide primers annealed to a primer binding sequence for extension of the oligonucleotide primer templated on a copy of the template sequence, wherein the first pool of nucleotides comprises: i) nucleotides of a first base which are labeled with a label configured to be detected at a first wavelength, ii) nucleotides of a second base which are labeled with a label configured to be detected at a second wavelength which is different from the first wavelength, and iii) nucleotides of a third base comprising nucleotides labeled with a label configured to be detected at the first wavelength and nucleotides labeled with a label configured to be detected at the second wavelength; b) allowing binding and optional incorporation of the nucleotides of the first pool templated on the multiple copies of the template sequence at each cluster, wherein the bound and optionally incorporated nucleotides are complementary to nucleotides at a first nucleotide position in the template sequences; c) imaging the substrate to detect signals or absence thereof at the clusters, wherein the signals are associated with the bound and optionally incorporated nucleotides of the first pool, and wherein the imaging comprises: i) imaging the substrate at the first wavelength and not at the second wavelength to acquire a first image,
ii) imaging the substrate at the second wavelength and not at the first wavelength to acquire a second image, and iii) imaging the substrate at the first and second wavelengths simultaneously to acquire a third image, wherein in the third image, the signal intensity associated with the third base is no less than the signal intensity associated with the first base or the signal intensity associated with the second base; wherein for each cluster, a signal codeword comprising signal codes corresponding to the signals or absence thereof detected in c)i) through c)iii) is generated, wherein different signal codewords correspond to different bases, thereby determining the bases at the first nucleotide position in the plurality of polynucleotide templates.
32. The method of claim 31, wherein the first pool of nucleotides comprises nucleotides of a fourth base which are unlabeled and/or are undetected in the imaging.
33. The method of claim 31, wherein the first pool of nucleotides does not comprise nucleotides of a fourth base.
34. The method of any one of claims 31-33, wherein the nucleotides of the third base comprise a nucleotide labeled with both the label configured to be detected at the first wavelength and the label configured to be detected at the second wavelength.
35. The method of any one of claims 31-34, wherein the nucleotides of the third base comprise a first nucleotide labeled with only the label configured to be detected at the first wavelength and a second nucleotide labeled with only the label configured to be detected at the second wavelength.
36. The method of any one of claims 31-35, wherein in each of the nucleotides of the third base, the ratio of the labels configured to be detected at the first or second wavelength and the nucleotide is independently 1:1, 2:1, 3:1, 4:1, 5:1, or greater.
37. The method of claim 36, wherein the labels are conjugated to the nucleotide via a binding pair, optionally wherein the binding pair comprises biotin, DNP, DIG, or desthiobiotin, and a corresponding binding partner thereof.
38. The method of any one of claims 31-37, wherein in c)i), the substrate is illuminated at the first wavelength at a first full power to acquire the first image.
39. The method of any one of claims 31-38, wherein in c)ii), the substrate is illuminated at the second wavelength at a second full power to acquire the second image.
40. The method of any one of claims 31-39, wherein in c)iii), the substrate is illuminated: at the first wavelength at less than the first full power, and at the second wavelength at less than the second full power, thereby acquiring the third image.
41. The method of claim 40, wherein in c)iii), the substrate is illuminated at the first wavelength at about 50% of the first full power, and at the second wavelength at about 50% of the second full power to acquire the third image.
42. The method of any one of claims 39-41, wherein the first and second full powers are of the same constant current in LED illumination.
43. The method of any one of claims 39-41, wherein the first and second full powers are of different constant currents in LED illumination.
44. The method of any one of claims 31-43, wherein in the third image, the signal intensity associated with the third base is about twice the signal intensity associated with the first base or the signal intensity associated with the second base.
45. The method of any one of claims 31-44, wherein the signal codeword comprises signal codes corresponding to signal intensities associated with the nucleotides of the first base, the nucleotides of the second base, or the nucleotides of the third base.
46. The method of any one of claims 31-44, wherein the signal codeword does not comprise signal codes corresponding to signal intensities associated with the nucleotides of the first base, the nucleotides of the second base, or the nucleotides of the third base.
47. The method of any one of claims 31-46, comprising: d) contacting the substrate with a second pool of nucleotides comprising unlabeled nucleotides of the first base, unlabeled nucleotides of the second base, unlabeled nucleotides of the third base, and/or unlabeled nucleotides of a fourth base; and e) allowing binding and optional incorporation of the nucleotides of the second pool templated on the multiple copies of the template sequence at each cluster, wherein the bound and optionally incorporated nucleotides are complementary to nucleotides at the first nucleotide position in the template sequences.
48. The method of any one of claims 31-46, comprising: d) contacting the substrate with a second pool of nucleotides comprising labeled nucleotides of a fourth base; e) allowing binding and optional incorporation of the nucleotides of the second pool templated on the multiple copies of the template sequence at each cluster, wherein the bound and optionally incorporated nucleotides are complementary to nucleotides at the first nucleotide position in the template sequences; and f) imaging the substrate to detect signals or absence thereof at the clusters, wherein the signals are associated with the bound and optionally incorporated nucleotides of the second pool.
49. The method of any one of claims 31-48, wherein the clusters are disposed at spatially discrete sites on the substrate.
50. The method of any one of claims 31-49, wherein the average distance between adjacent clusters on the substrate is between about 0.3 pm and about 10 pm.
51. The method of any one of claims 31-50, wherein the density of clusters on the substrate is between about 100 k/mm2 and about 5000 k/mm2.
52. The method of any one of claims 31-51, wherein signals from adjacent clusters on the substrate are optically resolvable.
53. The method of any one of claims 31-52, wherein the clusters comprise a random array of clusters on the substrate.
54. The method of any one of claims 31-53, wherein the clusters comprise an ordered array of clusters on the substrate.
55. The method of any one of claims 31-54, wherein one or more of the clusters are formed via bridge amplification.
56. The method of claim 55, wherein the one or more clusters each comprises multiple molecules each comprising: i) one or more adapter sequences and/or one or more primer binding sequences and ii) the same template sequence or complement thereof.
57. The method of any one of claims 31-54, wherein one or more of the clusters are formed via rolling circle amplification (RCA).
58. The method of claim 57, wherein the one or more clusters each comprises one or more RCA products (RCPs) each comprising: i) one or more adapter sequences and/or one or more primer binding sequences and ii) the same template sequence.
59. The method of any one of claims 31-58, wherein each cluster comprises: i) one or more polynucleotides that are 5’ immobilized on the substrate and 3’ blocked; ii) one or more polynucleotides that are 3’ immobilized on the substrate; and/or iii) one or more nucleic acid concatemers immobilized on the substrate.
60. The method of any one of claims 31-59, wherein the first pool of nucleotides comprises nucleotides of any three of A, T/U, C, and G.
61. The method of any one of claims 31-60, wherein the first pool of nucleotides comprises nucleotides of A, T/U, and C but no nucleotides of G.
62. The method of any one of claims 31-61, wherein the first and/or second pool of nucleotides comprise one or more dNTP monomers and/or one or more dNTP multimers each comprising multiple dNTP molecules conjugated to a scaffold, optionally wherein the scaffold is conjugated to one or more fluorescent moieties and optionally wherein the scaffold comprises a streptavidin.
63. The method of any one of claims 31-62, wherein the first and/or second pool of nucleotides each comprises a reversible terminator, optionally wherein the reversible terminator is a 3’-O-blocked reversible terminator or a 3 ’-unblocked reversible terminator, and the method comprises allowing nucleotide incorporation after imaging the substrate.
64. The method of any one of claims 31-63, comprising removing unbound or unincorporated nucleotides from the substrate prior to imaging the substrate in c).
65. The method of any one of claims 31-64, comprising determining the bases at a second nucleotide position in the plurality of polynucleotide templates, wherein the second nucleotide position is 5’ to the first nucleotide position in the plurality of polynucleotide templates.
66. A kit, comprising:
i) a first plurality of nucleotide molecules of a first base, comprising nucleotides labeled or configured to be labeled with a first label detectable at a first wavelength, ii) a second plurality of nucleotide molecules of a second base which is different from the first base, comprising nucleotides labeled or configured to be labeled with a second label detectable at a second wavelength which is different from the first wavelength, and iii) a third plurality of nucleotide molecules of a third base which is different from the first and second bases, comprising a nucleotide labeled or configured to be labeled with a third label detectable at the first wavelength and a nucleotide labeled or configured to be labeled with a fourth label detectable at the second wavelength, wherein the first plurality, the second plurality, and/or the third plurality each independently comprises between about 5% and about 50% nucleotide molecules that are not detectably labeled; and iv) optionally instructions for using the first plurality, the second plurality, and/or the third plurality of nucleotide molecules according to the method in any one of claims 1-65.
67. The kit of claim 66, wherein the first plurality, the second plurality, and/or the third plurality each independently comprises about 10% nucleotide molecules that are not detectably labeled.
68. The kit of claim 66 or claim 67, comprising: iv) a fourth plurality of nucleotide molecules of a fourth base which is different from the first, second, and third bases, optionally wherein the fourth plurality of nucleotide molecules are not detectably labeled.
69. The kit of any one of claims 66-68, wherein the first, second, third, and/or fourth plurality each independently comprises one or more reversibly terminated nucleotide molecules, and/or one or more complexes comprising nucleotide molecules conjugated to a scaffold such that the nucleotide molecules are not incorporable and the one or more complexes are labeled with detectable labels.
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US202363459214P | 2023-04-13 | 2023-04-13 | |
US63/459,214 | 2023-04-13 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2024216159A1 true WO2024216159A1 (en) | 2024-10-17 |
Family
ID=91029985
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/US2024/024437 WO2024216159A1 (en) | 2023-04-13 | 2024-04-12 | Methods and compositions for nucleic acid sequencing using labeled nucleotides |
Country Status (1)
Country | Link |
---|---|
WO (1) | WO2024216159A1 (en) |
Citations (22)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US4683195A (en) | 1986-01-30 | 1987-07-28 | Cetus Corporation | Process for amplifying, detecting, and/or-cloning nucleic acid sequences |
US4683202A (en) | 1985-03-28 | 1987-07-28 | Cetus Corporation | Process for amplifying nucleic acid sequences |
US4800159A (en) | 1986-02-07 | 1989-01-24 | Cetus Corporation | Process for amplifying, detecting, and/or cloning nucleic acid sequences |
US4965188A (en) | 1986-08-22 | 1990-10-23 | Cetus Corporation | Process for amplifying, detecting, and/or cloning nucleic acid sequences using a thermostable enzyme |
US5512462A (en) | 1994-02-25 | 1996-04-30 | Hoffmann-La Roche Inc. | Methods and reagents for the polymerase chain reaction amplification of long DNA sequences |
US20050042649A1 (en) | 1998-07-30 | 2005-02-24 | Shankar Balasubramanian | Arrayed biomolecules and their use in sequencing |
US7482120B2 (en) | 2005-01-28 | 2009-01-27 | Helicos Biosciences Corporation | Methods and compositions for improving fidelity in a nucleic acid synthesis reaction |
US7544794B1 (en) | 2005-03-11 | 2009-06-09 | Steven Albert Benner | Method for sequencing DNA and RNA by synthesis |
US7956171B2 (en) | 2007-05-18 | 2011-06-07 | Helicos Biosciences Corp. | Nucleotide analogs |
US8034923B1 (en) | 2009-03-27 | 2011-10-11 | Steven Albert Benner | Reagents for reversibly terminating primer extension |
US8071755B2 (en) | 2004-05-25 | 2011-12-06 | Helicos Biosciences Corporation | Nucleotide analogs |
WO2013044018A1 (en) * | 2011-09-23 | 2013-03-28 | Illumina, Inc. | Methods and compositions for nucleic acid sequencing |
US8703461B2 (en) | 2009-06-05 | 2014-04-22 | Life Technologies Corporation | Mutant RB69 DNA polymerase |
US8808989B1 (en) | 2013-04-02 | 2014-08-19 | Molecular Assemblies, Inc. | Methods and apparatus for synthesizing nucleic acids |
US9399798B2 (en) | 2011-09-13 | 2016-07-26 | Lasergen, Inc. | 3′-OH unblocked, fast photocleavable terminating nucleotides and methods for nucleic acid sequencing |
WO2017087823A1 (en) * | 2015-11-18 | 2017-05-26 | Mir Kalim U | Super-resolution sequencing |
WO2018165099A1 (en) * | 2017-03-07 | 2018-09-13 | Illumina, Inc. | Single light source, two-optical channel sequencing |
US10246744B2 (en) | 2016-08-15 | 2019-04-02 | Omniome, Inc. | Method and system for sequencing nucleic acids |
WO2020193765A1 (en) * | 2019-03-28 | 2020-10-01 | Illumina Cambridge Limited | Methods and compositions for nucleic acid sequencing using photoswitchable labels |
US10844428B2 (en) | 2015-04-28 | 2020-11-24 | Illumina, Inc. | Error suppression in sequenced DNA fragments using redundant reads with unique molecular indices (UMIS) |
WO2022129439A1 (en) * | 2020-12-17 | 2022-06-23 | Illumina Cambridge Limited | Methods, systems and compositions for nucleic acid sequencing |
WO2022136402A1 (en) * | 2020-12-22 | 2022-06-30 | Illumina Cambridge Limited | Methods and compositions for nucleic acid sequencing |
-
2024
- 2024-04-12 WO PCT/US2024/024437 patent/WO2024216159A1/en unknown
Patent Citations (24)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US4683202A (en) | 1985-03-28 | 1987-07-28 | Cetus Corporation | Process for amplifying nucleic acid sequences |
US4683202B1 (en) | 1985-03-28 | 1990-11-27 | Cetus Corp | |
US4683195A (en) | 1986-01-30 | 1987-07-28 | Cetus Corporation | Process for amplifying, detecting, and/or-cloning nucleic acid sequences |
US4683195B1 (en) | 1986-01-30 | 1990-11-27 | Cetus Corp | |
US4800159A (en) | 1986-02-07 | 1989-01-24 | Cetus Corporation | Process for amplifying, detecting, and/or cloning nucleic acid sequences |
US4965188A (en) | 1986-08-22 | 1990-10-23 | Cetus Corporation | Process for amplifying, detecting, and/or cloning nucleic acid sequences using a thermostable enzyme |
US5512462A (en) | 1994-02-25 | 1996-04-30 | Hoffmann-La Roche Inc. | Methods and reagents for the polymerase chain reaction amplification of long DNA sequences |
US20050042649A1 (en) | 1998-07-30 | 2005-02-24 | Shankar Balasubramanian | Arrayed biomolecules and their use in sequencing |
US8071755B2 (en) | 2004-05-25 | 2011-12-06 | Helicos Biosciences Corporation | Nucleotide analogs |
US7482120B2 (en) | 2005-01-28 | 2009-01-27 | Helicos Biosciences Corporation | Methods and compositions for improving fidelity in a nucleic acid synthesis reaction |
US7544794B1 (en) | 2005-03-11 | 2009-06-09 | Steven Albert Benner | Method for sequencing DNA and RNA by synthesis |
US7956171B2 (en) | 2007-05-18 | 2011-06-07 | Helicos Biosciences Corp. | Nucleotide analogs |
US8034923B1 (en) | 2009-03-27 | 2011-10-11 | Steven Albert Benner | Reagents for reversibly terminating primer extension |
US8703461B2 (en) | 2009-06-05 | 2014-04-22 | Life Technologies Corporation | Mutant RB69 DNA polymerase |
US9399798B2 (en) | 2011-09-13 | 2016-07-26 | Lasergen, Inc. | 3′-OH unblocked, fast photocleavable terminating nucleotides and methods for nucleic acid sequencing |
WO2013044018A1 (en) * | 2011-09-23 | 2013-03-28 | Illumina, Inc. | Methods and compositions for nucleic acid sequencing |
US8808989B1 (en) | 2013-04-02 | 2014-08-19 | Molecular Assemblies, Inc. | Methods and apparatus for synthesizing nucleic acids |
US10844428B2 (en) | 2015-04-28 | 2020-11-24 | Illumina, Inc. | Error suppression in sequenced DNA fragments using redundant reads with unique molecular indices (UMIS) |
WO2017087823A1 (en) * | 2015-11-18 | 2017-05-26 | Mir Kalim U | Super-resolution sequencing |
US10246744B2 (en) | 2016-08-15 | 2019-04-02 | Omniome, Inc. | Method and system for sequencing nucleic acids |
WO2018165099A1 (en) * | 2017-03-07 | 2018-09-13 | Illumina, Inc. | Single light source, two-optical channel sequencing |
WO2020193765A1 (en) * | 2019-03-28 | 2020-10-01 | Illumina Cambridge Limited | Methods and compositions for nucleic acid sequencing using photoswitchable labels |
WO2022129439A1 (en) * | 2020-12-17 | 2022-06-23 | Illumina Cambridge Limited | Methods, systems and compositions for nucleic acid sequencing |
WO2022136402A1 (en) * | 2020-12-22 | 2022-06-30 | Illumina Cambridge Limited | Methods and compositions for nucleic acid sequencing |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
EP3458597B1 (en) | Quantitative real time pcr amplification using an electrowetting-based device | |
US6818420B2 (en) | Methods of using FET labeled oligonucleotides that include a 3′-5′ exonuclease resistant quencher domain and compositions for practicing the same | |
US10428363B2 (en) | Amplification methods to minimise sequence specific bias | |
EP1358350B1 (en) | Polynucleotide sequence assay | |
US20220056533A1 (en) | Detection of target nucleic acid and variants | |
US20080009420A1 (en) | Isothermal methods for creating clonal single molecule arrays | |
JP2018514230A (en) | Amplification using primers with limited nucleotide composition | |
Wittwer et al. | Nucleic acid techniques | |
US10358673B2 (en) | Method of amplifying nucleic acid sequences | |
JP2020501565A (en) | Modified multiplex and multi-step amplification reactions and reagents therefor | |
JP3949378B2 (en) | Method for determining polynucleotide sequence variation | |
EP1615942B1 (en) | Polymerase inhibitor and method of using same | |
WO2023154712A1 (en) | Methods, compositions, and systems for long read single molecule sequencing | |
WO2022271701A2 (en) | Methods and compositions for nucleic acid sequencing | |
WO2024216159A1 (en) | Methods and compositions for nucleic acid sequencing using labeled nucleotides | |
EP3601611B1 (en) | Polynucleotide adapters and methods of use thereof | |
WO2024216163A1 (en) | Methods and compositions for nucleic acid sequencing using predominantly unlabeled nucleotides | |
WO2023245203A1 (en) | Methods for single cell sequencing and error rate reduction | |
WO2023035110A1 (en) | Method for analyzing sequence of target polynucleotide | |
US20230250470A1 (en) | Amplicon comprehensive enrichment | |
US20240229130A9 (en) | Methods and compositions for tracking barcodes in partitions | |
US20240376525A1 (en) | Use of ethylene carbonate in nucleic acid sequencing methods | |
US20210017589A1 (en) | Methods for amplification of nucleic acids with endonuclease-mediated shifting equilibrium (em-seq) | |
AU2023354390A1 (en) | Methods of modulating clustering kinetics |