WO2024134502A1 - Engineered double-strand rna ligases and uses thereof - Google Patents
Engineered double-strand rna ligases and uses thereof Download PDFInfo
- Publication number
- WO2024134502A1 WO2024134502A1 PCT/IB2023/062949 IB2023062949W WO2024134502A1 WO 2024134502 A1 WO2024134502 A1 WO 2024134502A1 IB 2023062949 W IB2023062949 W IB 2023062949W WO 2024134502 A1 WO2024134502 A1 WO 2024134502A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- amino acid
- seq
- acid sequence
- engineered
- oligonucleotide
- Prior art date
Links
- 108091032973 (ribonucleotides)n+m Proteins 0.000 title claims abstract description 513
- 108090000364 Ligases Proteins 0.000 title claims abstract description 498
- 102000003960 Ligases Human genes 0.000 title claims abstract description 496
- 102000040650 (ribonucleotides)n+m Human genes 0.000 claims abstract description 512
- 108091034117 Oligonucleotide Proteins 0.000 claims abstract description 461
- 239000012634 fragment Substances 0.000 claims abstract description 191
- 238000000034 method Methods 0.000 claims abstract description 130
- 238000004519 manufacturing process Methods 0.000 claims abstract description 20
- 108090000765 processed proteins & peptides Proteins 0.000 claims description 548
- 102000004196 processed proteins & peptides Human genes 0.000 claims description 515
- 229920001184 polypeptide Polymers 0.000 claims description 506
- 125000003275 alpha amino acid group Chemical group 0.000 claims description 297
- 125000003729 nucleotide group Chemical group 0.000 claims description 128
- 125000000539 amino acid group Chemical group 0.000 claims description 118
- 238000006243 chemical reaction Methods 0.000 claims description 104
- 230000000694 effects Effects 0.000 claims description 104
- 239000002773 nucleotide Substances 0.000 claims description 89
- 239000003446 ligand Substances 0.000 claims description 82
- 102000040430 polynucleotide Human genes 0.000 claims description 74
- 108091033319 polynucleotide Proteins 0.000 claims description 74
- 239000002157 polynucleotide Substances 0.000 claims description 74
- 150000007523 nucleic acids Chemical group 0.000 claims description 47
- 108020000161 polyphosphate kinase Proteins 0.000 claims description 42
- 102000039446 nucleic acids Human genes 0.000 claims description 41
- 108020004707 nucleic acids Proteins 0.000 claims description 41
- 229920000388 Polyphosphate Polymers 0.000 claims description 33
- 229910052799 carbon Inorganic materials 0.000 claims description 33
- 239000001205 polyphosphate Substances 0.000 claims description 33
- 235000011176 polyphosphates Nutrition 0.000 claims description 33
- 238000000746 purification Methods 0.000 claims description 31
- 150000001768 cations Chemical class 0.000 claims description 28
- 229910052717 sulfur Inorganic materials 0.000 claims description 26
- 239000013604 expression vector Substances 0.000 claims description 22
- 108091093094 Glycol nucleic acid Proteins 0.000 claims description 21
- 239000000203 mixture Substances 0.000 claims description 20
- 239000000284 extract Substances 0.000 claims description 19
- OVRNDRQMDRJTHS-KEWYIRBNSA-N N-acetyl-D-galactosamine Chemical class CC(=O)N[C@H]1C(O)O[C@H](CO)[C@H](O)[C@@H]1O OVRNDRQMDRJTHS-KEWYIRBNSA-N 0.000 claims description 16
- MBLBDJOUHNCFQT-UHFFFAOYSA-N N-acetyl-D-galactosamine Natural products CC(=O)NC(C=O)C(O)C(O)C(O)CO MBLBDJOUHNCFQT-UHFFFAOYSA-N 0.000 claims description 16
- 150000003839 salts Chemical class 0.000 claims description 16
- 229910052720 vanadium Inorganic materials 0.000 claims description 16
- 229910052727 yttrium Inorganic materials 0.000 claims description 16
- 238000007385 chemical modification Methods 0.000 claims description 14
- ZTWTYVWXUKTLCP-UHFFFAOYSA-L ethenyl-dioxido-oxo-$l^{5}-phosphane Chemical compound [O-]P([O-])(=O)C=C ZTWTYVWXUKTLCP-UHFFFAOYSA-L 0.000 claims description 14
- 125000004573 morpholin-4-yl group Chemical group N1(CCOCC1)* 0.000 claims description 14
- 229910052698 phosphorus Inorganic materials 0.000 claims description 14
- RYYWUUFWQRZTIU-UHFFFAOYSA-K thiophosphate Chemical compound [O-]P([O-])([O-])=S RYYWUUFWQRZTIU-UHFFFAOYSA-K 0.000 claims description 13
- GCLGEJMYGQKIIW-UHFFFAOYSA-H sodium hexametaphosphate Chemical compound [Na]OP1(=O)OP(=O)(O[Na])OP(=O)(O[Na])OP(=O)(O[Na])OP(=O)(O[Na])OP(=O)(O[Na])O1 GCLGEJMYGQKIIW-UHFFFAOYSA-H 0.000 claims description 12
- 235000019982 sodium hexametaphosphate Nutrition 0.000 claims description 12
- 230000021615 conjugation Effects 0.000 claims description 11
- 239000013612 plasmid Substances 0.000 claims description 11
- 239000000126 substance Substances 0.000 claims description 11
- 108091028043 Nucleic acid sequence Proteins 0.000 claims description 10
- 239000005547 deoxyribonucleotide Substances 0.000 claims description 10
- 229910052757 nitrogen Inorganic materials 0.000 claims description 10
- 208000035657 Abasia Diseases 0.000 claims description 9
- 239000003054 catalyst Substances 0.000 claims description 9
- 125000000956 methoxy group Chemical group [H]C([H])([H])O* 0.000 claims description 9
- 241000588724 Escherichia coli Species 0.000 claims description 8
- OVRNDRQMDRJTHS-CBQIKETKSA-N N-Acetyl-D-Galactosamine Chemical compound CC(=O)N[C@H]1[C@@H](O)O[C@H](CO)[C@H](O)[C@@H]1O OVRNDRQMDRJTHS-CBQIKETKSA-N 0.000 claims description 8
- 238000012258 culturing Methods 0.000 claims description 8
- 229910052700 potassium Inorganic materials 0.000 claims description 8
- 229910052721 tungsten Inorganic materials 0.000 claims description 8
- POGLDEPLJHAHDF-UHFFFAOYSA-N methylsulfonyloxyphosphonamidic acid Chemical compound CS(=O)(=O)OP(=O)(N)O POGLDEPLJHAHDF-UHFFFAOYSA-N 0.000 claims description 7
- 230000003278 mimic effect Effects 0.000 claims description 7
- 238000001179 sorption measurement Methods 0.000 claims description 7
- 229960000549 4-dimethylaminophenol Drugs 0.000 claims description 6
- VHYFNPMBLIVWCW-UHFFFAOYSA-N 4-dimethylaminopyridine Substances CN(C)C1=CC=NC=C1 VHYFNPMBLIVWCW-UHFFFAOYSA-N 0.000 claims description 6
- 239000002214 arabinonucleotide Substances 0.000 claims description 6
- PTMHPRAIXMAOOB-UHFFFAOYSA-L phosphoramidate Chemical compound NP([O-])([O-])=O PTMHPRAIXMAOOB-UHFFFAOYSA-L 0.000 claims description 6
- AQMNWCRSESPIJM-UHFFFAOYSA-M sodium metaphosphate Chemical compound [Na+].[O-]P(=O)=O AQMNWCRSESPIJM-UHFFFAOYSA-M 0.000 claims description 6
- 235000019983 sodium metaphosphate Nutrition 0.000 claims description 6
- 235000019830 sodium polyphosphate Nutrition 0.000 claims description 6
- 239000001577 tetrasodium phosphonato phosphate Substances 0.000 claims description 6
- 230000003100 immobilizing effect Effects 0.000 claims description 5
- YACKEPLHDIMKIO-UHFFFAOYSA-N methylphosphonic acid Chemical compound CP(O)(O)=O YACKEPLHDIMKIO-UHFFFAOYSA-N 0.000 claims description 5
- 239000012531 culture fluid Substances 0.000 claims description 4
- 239000011343 solid material Substances 0.000 claims description 4
- 241001515965 unidentified phage Species 0.000 claims description 4
- 239000013603 viral vector Substances 0.000 claims description 4
- 230000008569 process Effects 0.000 abstract description 11
- 239000000047 product Substances 0.000 description 115
- 210000004027 cell Anatomy 0.000 description 81
- JLCPHMBAVCMARE-UHFFFAOYSA-N [3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-hydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methyl [5-(6-aminopurin-9-yl)-2-(hydroxymethyl)oxolan-3-yl] hydrogen phosphate Polymers Cc1cn(C2CC(OP(O)(=O)OCC3OC(CC3OP(O)(=O)OCC3OC(CC3O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c3nc(N)[nH]c4=O)C(COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3CO)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cc(C)c(=O)[nH]c3=O)n3cc(C)c(=O)[nH]c3=O)n3ccc(N)nc3=O)n3cc(C)c(=O)[nH]c3=O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)O2)c(=O)[nH]c1=O JLCPHMBAVCMARE-UHFFFAOYSA-N 0.000 description 71
- 238000012986 modification Methods 0.000 description 58
- 235000001014 amino acid Nutrition 0.000 description 56
- 229940024606 amino acid Drugs 0.000 description 55
- 150000001413 amino acids Chemical class 0.000 description 54
- 230000004048 modification Effects 0.000 description 53
- 230000000692 anti-sense effect Effects 0.000 description 41
- 102000004190 Enzymes Human genes 0.000 description 39
- 108090000790 Enzymes Proteins 0.000 description 39
- 229940088598 enzyme Drugs 0.000 description 39
- 108091081021 Sense strand Proteins 0.000 description 35
- -1 polyethylene Polymers 0.000 description 35
- 229920002477 rna polymer Polymers 0.000 description 33
- 239000000758 substrate Substances 0.000 description 31
- 108020004459 Small interfering RNA Proteins 0.000 description 28
- 230000015572 biosynthetic process Effects 0.000 description 27
- 238000004458 analytical method Methods 0.000 description 24
- 239000004055 small Interfering RNA Substances 0.000 description 24
- 108020004414 DNA Proteins 0.000 description 23
- 102000053602 DNA Human genes 0.000 description 23
- 108090000623 proteins and genes Proteins 0.000 description 23
- XPPKVPWEQAFLFU-UHFFFAOYSA-J diphosphate(4-) Chemical compound [O-]P([O-])(=O)OP([O-])([O-])=O XPPKVPWEQAFLFU-UHFFFAOYSA-J 0.000 description 22
- 239000006166 lysate Substances 0.000 description 22
- 238000012228 RNA interference-mediated gene silencing Methods 0.000 description 21
- 230000009368 gene silencing by RNA Effects 0.000 description 21
- 239000000243 solution Substances 0.000 description 21
- 235000000346 sugar Nutrition 0.000 description 21
- 239000013598 vector Substances 0.000 description 21
- 239000003795 chemical substances by application Substances 0.000 description 20
- 230000014509 gene expression Effects 0.000 description 17
- 239000000463 material Substances 0.000 description 16
- 238000003776 cleavage reaction Methods 0.000 description 15
- 230000000295 complement effect Effects 0.000 description 15
- 150000002632 lipids Chemical class 0.000 description 15
- 230000007017 scission Effects 0.000 description 15
- KCXVZYZYPLLWCC-UHFFFAOYSA-N EDTA Chemical compound OC(=O)CN(CC(O)=O)CCN(CC(O)=O)CC(O)=O KCXVZYZYPLLWCC-UHFFFAOYSA-N 0.000 description 14
- 238000004128 high performance liquid chromatography Methods 0.000 description 14
- FWMNVWWHGCHHJJ-SKKKGAJSSA-N 4-amino-1-[(2r)-6-amino-2-[[(2r)-2-[[(2r)-2-[[(2r)-2-amino-3-phenylpropanoyl]amino]-3-phenylpropanoyl]amino]-4-methylpentanoyl]amino]hexanoyl]piperidine-4-carboxylic acid Chemical compound C([C@H](C(=O)N[C@H](CC(C)C)C(=O)N[C@H](CCCCN)C(=O)N1CCC(N)(CC1)C(O)=O)NC(=O)[C@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 FWMNVWWHGCHHJJ-SKKKGAJSSA-N 0.000 description 13
- ISAKRJDGNUQOIC-UHFFFAOYSA-N Uracil Chemical compound O=C1C=CNC(=O)N1 ISAKRJDGNUQOIC-UHFFFAOYSA-N 0.000 description 13
- 239000000562 conjugate Substances 0.000 description 13
- 238000012217 deletion Methods 0.000 description 13
- 230000037430 deletion Effects 0.000 description 13
- 235000018102 proteins Nutrition 0.000 description 13
- 102000004169 proteins and genes Human genes 0.000 description 13
- 239000007787 solid Substances 0.000 description 13
- 229910019142 PO4 Inorganic materials 0.000 description 12
- 239000000843 powder Substances 0.000 description 12
- 239000007983 Tris buffer Substances 0.000 description 11
- 238000007792 addition Methods 0.000 description 11
- UYTPUPDQBNUYGX-UHFFFAOYSA-N guanine Chemical compound O=C1NC(N)=NC2=C1N=CN2 UYTPUPDQBNUYGX-UHFFFAOYSA-N 0.000 description 11
- 239000000543 intermediate Substances 0.000 description 11
- 239000000178 monomer Substances 0.000 description 11
- 239000000816 peptidomimetic Substances 0.000 description 11
- 239000010452 phosphate Substances 0.000 description 11
- 230000001225 therapeutic effect Effects 0.000 description 11
- LENZDBCJOHFCAS-UHFFFAOYSA-N tris Chemical compound OCC(N)(CO)CO LENZDBCJOHFCAS-UHFFFAOYSA-N 0.000 description 11
- IYMAXBFPHPZYIK-BQBZGAKWSA-N Arg-Gly-Asp Chemical compound NC(N)=NCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O IYMAXBFPHPZYIK-BQBZGAKWSA-N 0.000 description 10
- 101710163270 Nuclease Proteins 0.000 description 10
- OPTASPLRGRRNAP-UHFFFAOYSA-N cytosine Chemical compound NC=1C=CNC(=O)N=1 OPTASPLRGRRNAP-UHFFFAOYSA-N 0.000 description 10
- 230000008685 targeting Effects 0.000 description 10
- 108020004705 Codon Proteins 0.000 description 9
- IAZDPXIOMUYVGZ-UHFFFAOYSA-N Dimethylsulphoxide Chemical compound CS(C)=O IAZDPXIOMUYVGZ-UHFFFAOYSA-N 0.000 description 9
- 108091046915 Threose nucleic acid Proteins 0.000 description 9
- 125000000217 alkyl group Chemical group 0.000 description 9
- 238000003556 assay Methods 0.000 description 9
- 239000007853 buffer solution Substances 0.000 description 9
- WIIZWVCIJKGZOK-RKDXNWHRSA-N chloramphenicol Chemical compound ClC(Cl)C(=O)N[C@H](CO)[C@H](O)C1=CC=C([N+]([O-])=O)C=C1 WIIZWVCIJKGZOK-RKDXNWHRSA-N 0.000 description 9
- 229960005091 chloramphenicol Drugs 0.000 description 9
- 230000006872 improvement Effects 0.000 description 9
- 108020004999 messenger RNA Proteins 0.000 description 9
- NBIIXXVUZAFLBC-UHFFFAOYSA-K phosphate Chemical compound [O-]P([O-])([O-])=O NBIIXXVUZAFLBC-UHFFFAOYSA-K 0.000 description 9
- 238000006467 substitution reaction Methods 0.000 description 9
- 239000006228 supernatant Substances 0.000 description 9
- 239000000074 antisense oligonucleotide Substances 0.000 description 8
- 238000012230 antisense oligonucleotides Methods 0.000 description 8
- HVYWMOMLDIMFJA-DPAQBDIFSA-N cholesterol Chemical compound C1C=C2C[C@@H](O)CC[C@]2(C)[C@@H]2[C@@H]1[C@@H]1CC[C@H]([C@H](C)CCCC(C)C)[C@@]1(C)CC2 HVYWMOMLDIMFJA-DPAQBDIFSA-N 0.000 description 8
- 125000004122 cyclic group Chemical group 0.000 description 8
- 230000000021 endosomolytic effect Effects 0.000 description 8
- 238000003780 insertion Methods 0.000 description 8
- 230000037431 insertion Effects 0.000 description 8
- 229920000768 polyamine Polymers 0.000 description 8
- 238000012216 screening Methods 0.000 description 8
- 108020000948 Antisense Oligonucleotides Proteins 0.000 description 7
- 108091028664 Ribonucleotide Proteins 0.000 description 7
- 229960000643 adenine Drugs 0.000 description 7
- 239000000872 buffer Substances 0.000 description 7
- 229920001429 chelating resin Polymers 0.000 description 7
- 239000012470 diluted sample Substances 0.000 description 7
- VHJLVAABSRFDPM-QWWZWVQMSA-N dithiothreitol Chemical compound SC[C@@H](O)[C@H](O)CS VHJLVAABSRFDPM-QWWZWVQMSA-N 0.000 description 7
- 230000002255 enzymatic effect Effects 0.000 description 7
- 238000011534 incubation Methods 0.000 description 7
- 238000002703 mutagenesis Methods 0.000 description 7
- 231100000350 mutagenesis Toxicity 0.000 description 7
- 239000002243 precursor Substances 0.000 description 7
- 238000002360 preparation method Methods 0.000 description 7
- 239000011541 reaction mixture Substances 0.000 description 7
- 239000002336 ribonucleotide Substances 0.000 description 7
- 125000002652 ribonucleotide group Chemical group 0.000 description 7
- 229940035893 uracil Drugs 0.000 description 7
- 229930024421 Adenine Natural products 0.000 description 6
- PEDCQBHIVMGVHV-UHFFFAOYSA-N Glycerine Chemical compound OCC(O)CO PEDCQBHIVMGVHV-UHFFFAOYSA-N 0.000 description 6
- 102000008100 Human Serum Albumin Human genes 0.000 description 6
- 108091006905 Human Serum Albumin Proteins 0.000 description 6
- 241001139947 Mida Species 0.000 description 6
- 108091093037 Peptide nucleic acid Proteins 0.000 description 6
- RWRDLPDLKQPQOW-UHFFFAOYSA-N Pyrrolidine Chemical compound C1CCNC1 RWRDLPDLKQPQOW-UHFFFAOYSA-N 0.000 description 6
- IQFYYKKMVGJFEH-XLPZGREQSA-N Thymidine Chemical compound O=C1NC(=O)C(C)=CN1[C@@H]1O[C@H](CO)[C@@H](O)C1 IQFYYKKMVGJFEH-XLPZGREQSA-N 0.000 description 6
- GSEJCLTVZPLZKY-UHFFFAOYSA-N Triethanolamine Chemical compound OCCN(CCO)CCO GSEJCLTVZPLZKY-UHFFFAOYSA-N 0.000 description 6
- DRTQHJPVMGBUCF-XVFCMESISA-N Uridine Chemical compound O[C@@H]1[C@H](O)[C@@H](CO)O[C@H]1N1C(=O)NC(=O)C=C1 DRTQHJPVMGBUCF-XVFCMESISA-N 0.000 description 6
- 239000002253 acid Substances 0.000 description 6
- 230000009286 beneficial effect Effects 0.000 description 6
- 150000001720 carbohydrates Chemical class 0.000 description 6
- 235000014633 carbohydrates Nutrition 0.000 description 6
- 230000015556 catabolic process Effects 0.000 description 6
- 239000013592 cell lysate Substances 0.000 description 6
- 229920001577 copolymer Polymers 0.000 description 6
- 238000006731 degradation reaction Methods 0.000 description 6
- 238000001952 enzyme assay Methods 0.000 description 6
- 238000011065 in-situ storage Methods 0.000 description 6
- 230000002779 inactivation Effects 0.000 description 6
- 230000000670 limiting effect Effects 0.000 description 6
- 125000000325 methylidene group Chemical group [H]C([H])=* 0.000 description 6
- 125000004437 phosphorous atom Chemical group 0.000 description 6
- 229920000193 polymethacrylate Polymers 0.000 description 6
- 239000002244 precipitate Substances 0.000 description 6
- 238000010791 quenching Methods 0.000 description 6
- 230000006798 recombination Effects 0.000 description 6
- 238000005215 recombination Methods 0.000 description 6
- 210000001519 tissue Anatomy 0.000 description 6
- JKMHFZQWWAIEOD-UHFFFAOYSA-N 2-[4-(2-hydroxyethyl)piperazin-1-yl]ethanesulfonic acid Chemical compound OCC[NH+]1CCN(CCS([O-])(=O)=O)CC1 JKMHFZQWWAIEOD-UHFFFAOYSA-N 0.000 description 5
- QNAYBMKLOCPYGJ-UWTATZPHSA-N D-alanine Chemical compound C[C@@H](N)C(O)=O QNAYBMKLOCPYGJ-UWTATZPHSA-N 0.000 description 5
- 239000007993 MOPS buffer Substances 0.000 description 5
- 108010072041 arginyl-glycyl-aspartic acid Proteins 0.000 description 5
- 230000001186 cumulative effect Effects 0.000 description 5
- 229940104302 cytosine Drugs 0.000 description 5
- 238000004108 freeze drying Methods 0.000 description 5
- 125000000524 functional group Chemical group 0.000 description 5
- 238000011068 loading method Methods 0.000 description 5
- 239000008188 pellet Substances 0.000 description 5
- 239000000376 reactant Substances 0.000 description 5
- 229940045145 uridine Drugs 0.000 description 5
- YBJHBAHKTGYVGT-ZKWXMUAHSA-N (+)-Biotin Chemical compound N1C(=O)N[C@@H]2[C@H](CCCCC(=O)O)SC[C@@H]21 YBJHBAHKTGYVGT-ZKWXMUAHSA-N 0.000 description 4
- GFFGJBXGBJISGV-UHFFFAOYSA-N Adenine Chemical compound NC1=NC=NC2=C1N=CN2 GFFGJBXGBJISGV-UHFFFAOYSA-N 0.000 description 4
- 108091023037 Aptamer Proteins 0.000 description 4
- 108091026890 Coding region Proteins 0.000 description 4
- QNAYBMKLOCPYGJ-UHFFFAOYSA-N D-alpha-Ala Natural products CC([NH3+])C([O-])=O QNAYBMKLOCPYGJ-UHFFFAOYSA-N 0.000 description 4
- 239000007995 HEPES buffer Substances 0.000 description 4
- 108090001090 Lectins Proteins 0.000 description 4
- 102000004856 Lectins Human genes 0.000 description 4
- 206010028980 Neoplasm Diseases 0.000 description 4
- 108010039918 Polylysine Proteins 0.000 description 4
- VYPSYNLAJGMNEJ-UHFFFAOYSA-N Silicium dioxide Chemical compound O=[Si]=O VYPSYNLAJGMNEJ-UHFFFAOYSA-N 0.000 description 4
- 125000002015 acyclic group Chemical group 0.000 description 4
- 150000001408 amides Chemical group 0.000 description 4
- 239000007806 chemical reaction intermediate Substances 0.000 description 4
- 235000012000 cholesterol Nutrition 0.000 description 4
- 230000009089 cytolysis Effects 0.000 description 4
- NNBZCPXTIHJBJL-UHFFFAOYSA-N decalin Chemical compound C1CCCC2CCCCC21 NNBZCPXTIHJBJL-UHFFFAOYSA-N 0.000 description 4
- 210000001163 endosome Anatomy 0.000 description 4
- 238000006911 enzymatic reaction Methods 0.000 description 4
- 229930182830 galactose Natural products 0.000 description 4
- RAXXELZNTBOGNW-UHFFFAOYSA-N imidazole Natural products C1=CNC=N1 RAXXELZNTBOGNW-UHFFFAOYSA-N 0.000 description 4
- 238000010348 incorporation Methods 0.000 description 4
- 239000002523 lectin Substances 0.000 description 4
- 239000012528 membrane Substances 0.000 description 4
- 229920000656 polylysine Polymers 0.000 description 4
- 229920000642 polymer Polymers 0.000 description 4
- 238000011533 pre-incubation Methods 0.000 description 4
- 230000008929 regeneration Effects 0.000 description 4
- 238000011069 regeneration method Methods 0.000 description 4
- 230000010076 replication Effects 0.000 description 4
- 229920005989 resin Polymers 0.000 description 4
- 239000011347 resin Substances 0.000 description 4
- 241000894007 species Species 0.000 description 4
- 238000003786 synthesis reaction Methods 0.000 description 4
- RWQNBRDOKXIBIV-UHFFFAOYSA-N thymine Chemical compound CC1=CNC(=O)NC1=O RWQNBRDOKXIBIV-UHFFFAOYSA-N 0.000 description 4
- 239000011782 vitamin Substances 0.000 description 4
- 235000013343 vitamin Nutrition 0.000 description 4
- 229930003231 vitamin Natural products 0.000 description 4
- 229940088594 vitamin Drugs 0.000 description 4
- ZLAQATDNGLKIEV-UHFFFAOYSA-N 5-methyl-2-sulfanylidene-1h-pyrimidin-4-one Chemical compound CC1=CNC(=S)NC1=O ZLAQATDNGLKIEV-UHFFFAOYSA-N 0.000 description 3
- 101000605621 Acinetobacter johnsonii Polyphosphate:AMP phosphotransferase Proteins 0.000 description 3
- 229920001661 Chitosan Polymers 0.000 description 3
- WQZGKKKJIJFFOK-GASJEMHNSA-N Glucose Natural products OC[C@H]1OC(O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-GASJEMHNSA-N 0.000 description 3
- 108090000288 Glycoproteins Proteins 0.000 description 3
- 102000003886 Glycoproteins Human genes 0.000 description 3
- 229930010555 Inosine Natural products 0.000 description 3
- UGQMRVRMYYASKQ-KQYNXXCUSA-N Inosine Chemical compound O[C@@H]1[C@H](O)[C@@H](CO)O[C@H]1N1C2=NC=NC(O)=C2N=C1 UGQMRVRMYYASKQ-KQYNXXCUSA-N 0.000 description 3
- QNAYBMKLOCPYGJ-REOHCLBHSA-N L-alanine Chemical compound C[C@H](N)C(O)=O QNAYBMKLOCPYGJ-REOHCLBHSA-N 0.000 description 3
- SBKRTALNRRAOJP-BWSIXKJUSA-N N-[(2S)-4-amino-1-[[(2S,3R)-1-[[(2S)-4-amino-1-oxo-1-[[(3S,6S,9S,12S,15R,18R,21S)-6,9,18-tris(2-aminoethyl)-15-benzyl-3-[(1R)-1-hydroxyethyl]-12-(2-methylpropyl)-2,5,8,11,14,17,20-heptaoxo-1,4,7,10,13,16,19-heptazacyclotricos-21-yl]amino]butan-2-yl]amino]-3-hydroxy-1-oxobutan-2-yl]amino]-1-oxobutan-2-yl]-6-methylheptanamide (6S)-N-[(2S)-4-amino-1-[[(2S,3R)-1-[[(2S)-4-amino-1-oxo-1-[[(3S,6S,9S,12S,15R,18R,21S)-6,9,18-tris(2-aminoethyl)-15-benzyl-3-[(1R)-1-hydroxyethyl]-12-(2-methylpropyl)-2,5,8,11,14,17,20-heptaoxo-1,4,7,10,13,16,19-heptazacyclotricos-21-yl]amino]butan-2-yl]amino]-3-hydroxy-1-oxobutan-2-yl]amino]-1-oxobutan-2-yl]-6-methyloctanamide sulfuric acid Chemical compound OS(O)(=O)=O.CC(C)CCCCC(=O)N[C@@H](CCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCN)C(=O)N[C@H]1CCNC(=O)[C@@H](NC(=O)[C@H](CCN)NC(=O)[C@H](CCN)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](Cc2ccccc2)NC(=O)[C@@H](CCN)NC1=O)[C@@H](C)O.CC[C@H](C)CCCCC(=O)N[C@@H](CCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCN)C(=O)N[C@H]1CCNC(=O)[C@@H](NC(=O)[C@H](CCN)NC(=O)[C@H](CCN)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](Cc2ccccc2)NC(=O)[C@@H](CCN)NC1=O)[C@@H](C)O SBKRTALNRRAOJP-BWSIXKJUSA-N 0.000 description 3
- PXHVJJICTQNCMI-UHFFFAOYSA-N Nickel Chemical compound [Ni] PXHVJJICTQNCMI-UHFFFAOYSA-N 0.000 description 3
- 108091005804 Peptidases Proteins 0.000 description 3
- 239000002202 Polyethylene glycol Substances 0.000 description 3
- 108010093965 Polymyxin B Proteins 0.000 description 3
- 239000004365 Protease Substances 0.000 description 3
- 102100037486 Reverse transcriptase/ribonuclease H Human genes 0.000 description 3
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 3
- 235000014680 Saccharomyces cerevisiae Nutrition 0.000 description 3
- XSQUKJJJFZCRTK-UHFFFAOYSA-N Urea Chemical compound NC(N)=O XSQUKJJJFZCRTK-UHFFFAOYSA-N 0.000 description 3
- 229960003767 alanine Drugs 0.000 description 3
- 150000001345 alkine derivatives Chemical class 0.000 description 3
- 125000000304 alkynyl group Chemical group 0.000 description 3
- 150000001540 azides Chemical class 0.000 description 3
- 239000011324 bead Substances 0.000 description 3
- IQFYYKKMVGJFEH-UHFFFAOYSA-N beta-L-thymidine Natural products O=C1NC(=O)C(C)=CN1C1OC(CO)C(O)C1 IQFYYKKMVGJFEH-UHFFFAOYSA-N 0.000 description 3
- DRTQHJPVMGBUCF-PSQAKQOGSA-N beta-L-uridine Natural products O[C@H]1[C@@H](O)[C@H](CO)O[C@@H]1N1C(=O)NC(=O)C=C1 DRTQHJPVMGBUCF-PSQAKQOGSA-N 0.000 description 3
- 230000008033 biological extinction Effects 0.000 description 3
- 238000004364 calculation method Methods 0.000 description 3
- 201000011510 cancer Diseases 0.000 description 3
- 150000001721 carbon Chemical group 0.000 description 3
- 230000003197 catalytic effect Effects 0.000 description 3
- 125000003636 chemical group Chemical group 0.000 description 3
- 230000000052 comparative effect Effects 0.000 description 3
- 230000001276 controlling effect Effects 0.000 description 3
- 230000008878 coupling Effects 0.000 description 3
- 238000010168 coupling process Methods 0.000 description 3
- 238000005859 coupling reaction Methods 0.000 description 3
- 210000000805 cytoplasm Anatomy 0.000 description 3
- 125000002637 deoxyribonucleotide group Chemical group 0.000 description 3
- 230000001419 dependent effect Effects 0.000 description 3
- 238000009826 distribution Methods 0.000 description 3
- 238000000855 fermentation Methods 0.000 description 3
- 230000004151 fermentation Effects 0.000 description 3
- OVBPIULPVIDEAO-LBPRGKRZSA-N folic acid Chemical compound C=1N=C2NC(N)=NC(=O)C2=NC=1CNC1=CC=C(C(=O)N[C@@H](CCC(O)=O)C(O)=O)C=C1 OVBPIULPVIDEAO-LBPRGKRZSA-N 0.000 description 3
- 239000000499 gel Substances 0.000 description 3
- 239000008103 glucose Substances 0.000 description 3
- 230000002209 hydrophobic effect Effects 0.000 description 3
- 229960003786 inosine Drugs 0.000 description 3
- 230000001404 mediated effect Effects 0.000 description 3
- 229910052751 metal Inorganic materials 0.000 description 3
- 239000002184 metal Substances 0.000 description 3
- 239000002777 nucleoside Substances 0.000 description 3
- 229920001223 polyethylene glycol Polymers 0.000 description 3
- 229960003548 polymyxin b sulfate Drugs 0.000 description 3
- 238000003259 recombinant expression Methods 0.000 description 3
- 230000001105 regulatory effect Effects 0.000 description 3
- 125000006853 reporter group Chemical group 0.000 description 3
- 210000002966 serum Anatomy 0.000 description 3
- 239000007790 solid phase Substances 0.000 description 3
- 238000010561 standard procedure Methods 0.000 description 3
- 150000003431 steroids Chemical class 0.000 description 3
- 230000009466 transformation Effects 0.000 description 3
- 210000004881 tumor cell Anatomy 0.000 description 3
- DRTQHJPVMGBUCF-UHFFFAOYSA-N uracil arabinoside Natural products OC1C(O)C(CO)OC1N1C(=O)NC(=O)C=C1 DRTQHJPVMGBUCF-UHFFFAOYSA-N 0.000 description 3
- KIUKXJAPPMFGSW-DNGZLQJQSA-N (2S,3S,4S,5R,6R)-6-[(2S,3R,4R,5S,6R)-3-Acetamido-2-[(2S,3S,4R,5R,6R)-6-[(2R,3R,4R,5S,6R)-3-acetamido-2,5-dihydroxy-6-(hydroxymethyl)oxan-4-yl]oxy-2-carboxy-4,5-dihydroxyoxan-3-yl]oxy-5-hydroxy-6-(hydroxymethyl)oxan-4-yl]oxy-3,4,5-trihydroxyoxane-2-carboxylic acid Chemical compound CC(=O)N[C@H]1[C@H](O)O[C@H](CO)[C@@H](O)[C@@H]1O[C@H]1[C@H](O)[C@@H](O)[C@H](O[C@H]2[C@@H]([C@@H](O[C@H]3[C@@H]([C@@H](O)[C@H](O)[C@H](O3)C(O)=O)O)[C@H](O)[C@@H](CO)O2)NC(C)=O)[C@@H](C(O)=O)O1 KIUKXJAPPMFGSW-DNGZLQJQSA-N 0.000 description 2
- GVJHHUAWPYXKBD-UHFFFAOYSA-N (±)-α-Tocopherol Chemical compound OC1=C(C)C(C)=C2OC(CCCC(C)CCCC(C)CCCC(C)C)(C)CCC2=C1C GVJHHUAWPYXKBD-UHFFFAOYSA-N 0.000 description 2
- WNXJIVFYUVYPPR-UHFFFAOYSA-N 1,3-dioxolane Chemical compound C1COCO1 WNXJIVFYUVYPPR-UHFFFAOYSA-N 0.000 description 2
- UHDGCWIWMRVCDJ-UHFFFAOYSA-N 1-beta-D-Xylofuranosyl-NH-Cytosine Natural products O=C1N=C(N)C=CN1C1C(O)C(O)C(CO)O1 UHDGCWIWMRVCDJ-UHFFFAOYSA-N 0.000 description 2
- RFLVMTUMFYRZCB-UHFFFAOYSA-N 1-methylguanine Chemical compound O=C1N(C)C(N)=NC2=C1N=CN2 RFLVMTUMFYRZCB-UHFFFAOYSA-N 0.000 description 2
- IQFYYKKMVGJFEH-BIIVOSGPSA-N 2'-deoxythymidine Natural products O=C1NC(=O)C(C)=CN1[C@@H]1O[C@@H](CO)[C@@H](O)C1 IQFYYKKMVGJFEH-BIIVOSGPSA-N 0.000 description 2
- YSAJFXWTVFGPAX-UHFFFAOYSA-N 2-[(2,4-dioxo-1h-pyrimidin-5-yl)oxy]acetic acid Chemical compound OC(=O)COC1=CNC(=O)NC1=O YSAJFXWTVFGPAX-UHFFFAOYSA-N 0.000 description 2
- QKNYBSVHEMOAJP-UHFFFAOYSA-N 2-amino-2-(hydroxymethyl)propane-1,3-diol;hydron;chloride Chemical compound Cl.OCC(N)(CO)CO QKNYBSVHEMOAJP-UHFFFAOYSA-N 0.000 description 2
- FZWGECJQACGGTI-UHFFFAOYSA-N 2-amino-7-methyl-1,7-dihydro-6H-purin-6-one Chemical compound NC1=NC(O)=C2N(C)C=NC2=N1 FZWGECJQACGGTI-UHFFFAOYSA-N 0.000 description 2
- KJJPLEZQSCZCKE-UHFFFAOYSA-N 2-aminopropane-1,3-diol Chemical group OCC(N)CO KJJPLEZQSCZCKE-UHFFFAOYSA-N 0.000 description 2
- OVONXEQGWXGFJD-UHFFFAOYSA-N 4-sulfanylidene-1h-pyrimidin-2-one Chemical compound SC=1C=CNC(=O)N=1 OVONXEQGWXGFJD-UHFFFAOYSA-N 0.000 description 2
- OIVLITBTBDPEFK-UHFFFAOYSA-N 5,6-dihydrouracil Chemical compound O=C1CCNC(=O)N1 OIVLITBTBDPEFK-UHFFFAOYSA-N 0.000 description 2
- RYVNIFSIEDRLSJ-UHFFFAOYSA-N 5-(hydroxymethyl)cytosine Chemical compound NC=1NC(=O)N=CC=1CO RYVNIFSIEDRLSJ-UHFFFAOYSA-N 0.000 description 2
- PEHVGBZKEYRQSX-UHFFFAOYSA-N 7-deaza-adenine Chemical compound NC1=NC=NC2=C1C=CN2 PEHVGBZKEYRQSX-UHFFFAOYSA-N 0.000 description 2
- HCGHYQLFMPXSDU-UHFFFAOYSA-N 7-methyladenine Chemical compound C1=NC(N)=C2N(C)C=NC2=N1 HCGHYQLFMPXSDU-UHFFFAOYSA-N 0.000 description 2
- MSSXOMSJDRHRMC-UHFFFAOYSA-N 9H-purine-2,6-diamine Chemical compound NC1=NC(N)=C2NC=NC2=N1 MSSXOMSJDRHRMC-UHFFFAOYSA-N 0.000 description 2
- QTBSBXVTEAMEQO-UHFFFAOYSA-M Acetate Chemical compound CC([O-])=O QTBSBXVTEAMEQO-UHFFFAOYSA-M 0.000 description 2
- 241000894006 Bacteria Species 0.000 description 2
- 102000004506 Blood Proteins Human genes 0.000 description 2
- 108010017384 Blood Proteins Proteins 0.000 description 2
- BVKZGUZCCUSVTD-UHFFFAOYSA-L Carbonate Chemical compound [O-]C([O-])=O BVKZGUZCCUSVTD-UHFFFAOYSA-L 0.000 description 2
- 229920002101 Chitin Polymers 0.000 description 2
- KRKNYBCHXYNGOX-UHFFFAOYSA-K Citrate Chemical compound [O-]C(=O)CC(O)(CC([O-])=O)C([O-])=O KRKNYBCHXYNGOX-UHFFFAOYSA-K 0.000 description 2
- 108010069514 Cyclic Peptides Proteins 0.000 description 2
- 102000001189 Cyclic Peptides Human genes 0.000 description 2
- 229920000858 Cyclodextrin Polymers 0.000 description 2
- UHDGCWIWMRVCDJ-PSQAKQOGSA-N Cytidine Natural products O=C1N=C(N)C=CN1[C@@H]1[C@@H](O)[C@@H](O)[C@H](CO)O1 UHDGCWIWMRVCDJ-PSQAKQOGSA-N 0.000 description 2
- CKLJMWTZIZZHCS-UWTATZPHSA-N D-aspartic acid Chemical compound OC(=O)[C@H](N)CC(O)=O CKLJMWTZIZZHCS-UWTATZPHSA-N 0.000 description 2
- SHZGCJCMOBCMKK-UHFFFAOYSA-N D-mannomethylose Natural products CC1OC(O)C(O)C(O)C1O SHZGCJCMOBCMKK-UHFFFAOYSA-N 0.000 description 2
- WQZGKKKJIJFFOK-QTVWNMPRSA-N D-mannopyranose Chemical compound OC[C@H]1OC(O)[C@@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-QTVWNMPRSA-N 0.000 description 2
- 229920002307 Dextran Polymers 0.000 description 2
- 239000004593 Epoxy Substances 0.000 description 2
- PNNNRSAQSRJVSB-SLPGGIOYSA-N Fucose Natural products C[C@H](O)[C@@H](O)[C@H](O)[C@H](O)C=O PNNNRSAQSRJVSB-SLPGGIOYSA-N 0.000 description 2
- NYHBQMYGNKIUIF-UUOKFMHZSA-N Guanosine Chemical compound C1=NC=2C(=O)NC(N)=NC=2N1[C@@H]1O[C@H](CO)[C@@H](O)[C@H]1O NYHBQMYGNKIUIF-UUOKFMHZSA-N 0.000 description 2
- 102000015779 HDL Lipoproteins Human genes 0.000 description 2
- 108010010234 HDL Lipoproteins Proteins 0.000 description 2
- NTYJJOPFIAHURM-UHFFFAOYSA-N Histamine Chemical compound NCCC1=CN=CN1 NTYJJOPFIAHURM-UHFFFAOYSA-N 0.000 description 2
- 102100034343 Integrase Human genes 0.000 description 2
- 101710203526 Integrase Proteins 0.000 description 2
- 229920001202 Inulin Polymers 0.000 description 2
- SHZGCJCMOBCMKK-DHVFOXMCSA-N L-fucopyranose Chemical compound C[C@@H]1OC(O)[C@@H](O)[C@H](O)[C@@H]1O SHZGCJCMOBCMKK-DHVFOXMCSA-N 0.000 description 2
- 102000007330 LDL Lipoproteins Human genes 0.000 description 2
- 108010007622 LDL Lipoproteins Proteins 0.000 description 2
- GUBGYTABKSRVRQ-QKKXKWKRSA-N Lactose Natural products OC[C@H]1O[C@@H](O[C@H]2[C@H](O)[C@@H](O)C(O)O[C@@H]2CO)[C@H](O)[C@@H](O)[C@H]1O GUBGYTABKSRVRQ-QKKXKWKRSA-N 0.000 description 2
- SMEROWZSTRWXGI-UHFFFAOYSA-N Lithocholsaeure Natural products C1CC2CC(O)CCC2(C)C2C1C1CCC(C(CCC(O)=O)C)C1(C)CC2 SMEROWZSTRWXGI-UHFFFAOYSA-N 0.000 description 2
- 102000016943 Muramidase Human genes 0.000 description 2
- 108010014251 Muramidase Proteins 0.000 description 2
- NWIBSHFKIJFRCO-WUDYKRTCSA-N Mytomycin Chemical compound C1N2C(C(C(C)=C(N)C3=O)=O)=C3[C@@H](COC(N)=O)[C@@]2(OC)[C@@H]2[C@H]1N2 NWIBSHFKIJFRCO-WUDYKRTCSA-N 0.000 description 2
- HYVABZIGRDEKCD-UHFFFAOYSA-N N(6)-dimethylallyladenine Chemical compound CC(C)=CCNC1=NC=NC2=C1N=CN2 HYVABZIGRDEKCD-UHFFFAOYSA-N 0.000 description 2
- 108010062010 N-Acetylmuramoyl-L-alanine Amidase Proteins 0.000 description 2
- OHLUUHNLEMFGTQ-UHFFFAOYSA-N N-methylacetamide Chemical compound CNC(C)=O OHLUUHNLEMFGTQ-UHFFFAOYSA-N 0.000 description 2
- ABLZXFCXXLZCGV-UHFFFAOYSA-N Phosphorous acid Chemical class OP(O)=O ABLZXFCXXLZCGV-UHFFFAOYSA-N 0.000 description 2
- 229920002873 Polyethylenimine Polymers 0.000 description 2
- 239000004372 Polyvinyl alcohol Substances 0.000 description 2
- WCUXLLCKKVVCTQ-UHFFFAOYSA-M Potassium chloride Chemical compound [Cl-].[K+] WCUXLLCKKVVCTQ-UHFFFAOYSA-M 0.000 description 2
- 239000004373 Pullulan Substances 0.000 description 2
- 229920001218 Pullulan Polymers 0.000 description 2
- CZPWVGJYEJSRLH-UHFFFAOYSA-N Pyrimidine Chemical compound C1=CN=CN=C1 CZPWVGJYEJSRLH-UHFFFAOYSA-N 0.000 description 2
- 108020004511 Recombinant DNA Proteins 0.000 description 2
- GMBQZIIUCVWOCD-WWASVFFGSA-N Sarsapogenine Chemical compound O([C@@H]1[C@@H]([C@]2(CC[C@@H]3[C@@]4(C)CC[C@H](O)C[C@H]4CC[C@H]3[C@@H]2C1)C)[C@@H]1C)[C@]11CC[C@H](C)CO1 GMBQZIIUCVWOCD-WWASVFFGSA-N 0.000 description 2
- MTCFGRXMJLQNBG-UHFFFAOYSA-N Serine Natural products OCC(N)C(O)=O MTCFGRXMJLQNBG-UHFFFAOYSA-N 0.000 description 2
- 108091027967 Small hairpin RNA Proteins 0.000 description 2
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 2
- PPBRXRYQALVLMV-UHFFFAOYSA-N Styrene Chemical compound C=CC1=CC=CC=C1 PPBRXRYQALVLMV-UHFFFAOYSA-N 0.000 description 2
- RYYWUUFWQRZTIU-UHFFFAOYSA-N Thiophosphoric acid Chemical class OP(O)(S)=O RYYWUUFWQRZTIU-UHFFFAOYSA-N 0.000 description 2
- 241000723792 Tobacco etch virus Species 0.000 description 2
- 102000004338 Transferrin Human genes 0.000 description 2
- 108090000901 Transferrin Proteins 0.000 description 2
- 229910052770 Uranium Inorganic materials 0.000 description 2
- 238000010521 absorption reaction Methods 0.000 description 2
- 239000012190 activator Substances 0.000 description 2
- 239000008186 active pharmaceutical agent Substances 0.000 description 2
- OIRDTQYFTABQOQ-KQYNXXCUSA-N adenosine Chemical compound C1=NC=2C(N)=NC=NC=2N1[C@@H]1O[C@H](CO)[C@@H](O)[C@H]1O OIRDTQYFTABQOQ-KQYNXXCUSA-N 0.000 description 2
- 125000003342 alkenyl group Chemical group 0.000 description 2
- WQZGKKKJIJFFOK-PHYPRBDBSA-N alpha-D-galactose Chemical compound OC[C@H]1O[C@H](O)[C@H](O)[C@@H](O)[C@H]1O WQZGKKKJIJFFOK-PHYPRBDBSA-N 0.000 description 2
- 125000004429 atom Chemical group 0.000 description 2
- 230000002210 biocatalytic effect Effects 0.000 description 2
- 235000020958 biotin Nutrition 0.000 description 2
- 239000011616 biotin Substances 0.000 description 2
- 229960002685 biotin Drugs 0.000 description 2
- 239000006227 byproduct Substances 0.000 description 2
- 125000004432 carbon atom Chemical group C* 0.000 description 2
- 210000000170 cell membrane Anatomy 0.000 description 2
- 239000001913 cellulose Substances 0.000 description 2
- 229920002678 cellulose Polymers 0.000 description 2
- 238000005119 centrifugation Methods 0.000 description 2
- 230000008859 change Effects 0.000 description 2
- 239000007795 chemical reaction product Substances 0.000 description 2
- 238000004587 chromatography analysis Methods 0.000 description 2
- 150000001875 compounds Chemical class 0.000 description 2
- 239000000470 constituent Substances 0.000 description 2
- 239000005289 controlled pore glass Substances 0.000 description 2
- 239000010949 copper Substances 0.000 description 2
- 239000003431 cross linking reagent Substances 0.000 description 2
- 125000000753 cycloalkyl group Chemical group 0.000 description 2
- UHDGCWIWMRVCDJ-ZAKLUEHWSA-N cytidine Chemical compound O=C1N=C(N)C=CN1[C@H]1[C@H](O)[C@@H](O)[C@H](CO)O1 UHDGCWIWMRVCDJ-ZAKLUEHWSA-N 0.000 description 2
- 239000000412 dendrimer Substances 0.000 description 2
- 229920000736 dendritic polymer Polymers 0.000 description 2
- ZBCBWPMODOFKDW-UHFFFAOYSA-N diethanolamine Chemical group OCCNCCO ZBCBWPMODOFKDW-UHFFFAOYSA-N 0.000 description 2
- 229940079919 digestives enzyme preparation Drugs 0.000 description 2
- LOKCTEFSRHRXRJ-UHFFFAOYSA-I dipotassium trisodium dihydrogen phosphate hydrogen phosphate dichloride Chemical compound P(=O)(O)(O)[O-].[K+].P(=O)(O)([O-])[O-].[Na+].[Na+].[Cl-].[K+].[Cl-].[Na+] LOKCTEFSRHRXRJ-UHFFFAOYSA-I 0.000 description 2
- 238000005516 engineering process Methods 0.000 description 2
- 238000000605 extraction Methods 0.000 description 2
- 235000019152 folic acid Nutrition 0.000 description 2
- 239000011724 folic acid Substances 0.000 description 2
- 230000002068 genetic effect Effects 0.000 description 2
- 239000001963 growth medium Substances 0.000 description 2
- 125000001475 halogen functional group Chemical group 0.000 description 2
- 125000005842 heteroatom Chemical group 0.000 description 2
- IPCSVZSSVZVIGE-UHFFFAOYSA-N hexadecanoic acid Chemical compound CCCCCCCCCCCCCCCC(O)=O IPCSVZSSVZVIGE-UHFFFAOYSA-N 0.000 description 2
- 229920002674 hyaluronan Polymers 0.000 description 2
- 229960003160 hyaluronic acid Drugs 0.000 description 2
- 238000009396 hybridization Methods 0.000 description 2
- FDGQSTZJBFJUBT-UHFFFAOYSA-N hypoxanthine Chemical compound O=C1NC=NC2=C1NC=N2 FDGQSTZJBFJUBT-UHFFFAOYSA-N 0.000 description 2
- 125000002632 imidazolidinyl group Chemical group 0.000 description 2
- 125000002636 imidazolinyl group Chemical group 0.000 description 2
- 239000000138 intercalating agent Substances 0.000 description 2
- 230000000968 intestinal effect Effects 0.000 description 2
- JYJIGFIDKWBXDU-MNNPPOADSA-N inulin Chemical compound O[C@H]1[C@H](O)[C@@H](CO)O[C@@]1(CO)OC[C@]1(OC[C@]2(OC[C@]3(OC[C@]4(OC[C@]5(OC[C@]6(OC[C@]7(OC[C@]8(OC[C@]9(OC[C@]%10(OC[C@]%11(OC[C@]%12(OC[C@]%13(OC[C@]%14(OC[C@]%15(OC[C@]%16(OC[C@]%17(OC[C@]%18(OC[C@]%19(OC[C@]%20(OC[C@]%21(OC[C@]%22(OC[C@]%23(OC[C@]%24(OC[C@]%25(OC[C@]%26(OC[C@]%27(OC[C@]%28(OC[C@]%29(OC[C@]%30(OC[C@]%31(OC[C@]%32(OC[C@]%33(OC[C@]%34(OC[C@]%35(OC[C@]%36(O[C@@H]%37[C@@H]([C@@H](O)[C@H](O)[C@@H](CO)O%37)O)[C@H]([C@H](O)[C@@H](CO)O%36)O)[C@H]([C@H](O)[C@@H](CO)O%35)O)[C@H]([C@H](O)[C@@H](CO)O%34)O)[C@H]([C@H](O)[C@@H](CO)O%33)O)[C@H]([C@H](O)[C@@H](CO)O%32)O)[C@H]([C@H](O)[C@@H](CO)O%31)O)[C@H]([C@H](O)[C@@H](CO)O%30)O)[C@H]([C@H](O)[C@@H](CO)O%29)O)[C@H]([C@H](O)[C@@H](CO)O%28)O)[C@H]([C@H](O)[C@@H](CO)O%27)O)[C@H]([C@H](O)[C@@H](CO)O%26)O)[C@H]([C@H](O)[C@@H](CO)O%25)O)[C@H]([C@H](O)[C@@H](CO)O%24)O)[C@H]([C@H](O)[C@@H](CO)O%23)O)[C@H]([C@H](O)[C@@H](CO)O%22)O)[C@H]([C@H](O)[C@@H](CO)O%21)O)[C@H]([C@H](O)[C@@H](CO)O%20)O)[C@H]([C@H](O)[C@@H](CO)O%19)O)[C@H]([C@H](O)[C@@H](CO)O%18)O)[C@H]([C@H](O)[C@@H](CO)O%17)O)[C@H]([C@H](O)[C@@H](CO)O%16)O)[C@H]([C@H](O)[C@@H](CO)O%15)O)[C@H]([C@H](O)[C@@H](CO)O%14)O)[C@H]([C@H](O)[C@@H](CO)O%13)O)[C@H]([C@H](O)[C@@H](CO)O%12)O)[C@H]([C@H](O)[C@@H](CO)O%11)O)[C@H]([C@H](O)[C@@H](CO)O%10)O)[C@H]([C@H](O)[C@@H](CO)O9)O)[C@H]([C@H](O)[C@@H](CO)O8)O)[C@H]([C@H](O)[C@@H](CO)O7)O)[C@H]([C@H](O)[C@@H](CO)O6)O)[C@H]([C@H](O)[C@@H](CO)O5)O)[C@H]([C@H](O)[C@@H](CO)O4)O)[C@H]([C@H](O)[C@@H](CO)O3)O)[C@H]([C@H](O)[C@@H](CO)O2)O)[C@@H](O)[C@H](O)[C@@H](CO)O1 JYJIGFIDKWBXDU-MNNPPOADSA-N 0.000 description 2
- 229940029339 inulin Drugs 0.000 description 2
- 125000001449 isopropyl group Chemical group [H]C([H])([H])C([H])(*)C([H])([H])[H] 0.000 description 2
- 125000004628 isothiazolidinyl group Chemical group S1N(CCC1)* 0.000 description 2
- 125000003965 isoxazolidinyl group Chemical group 0.000 description 2
- 239000008101 lactose Substances 0.000 description 2
- 150000002634 lipophilic molecules Chemical class 0.000 description 2
- SMEROWZSTRWXGI-HVATVPOCSA-N lithocholic acid Chemical compound C([C@H]1CC2)[C@H](O)CC[C@]1(C)[C@@H]1[C@@H]2[C@@H]2CC[C@H]([C@@H](CCC(O)=O)C)[C@@]2(C)CC1 SMEROWZSTRWXGI-HVATVPOCSA-N 0.000 description 2
- 210000004185 liver Anatomy 0.000 description 2
- 210000004072 lung Anatomy 0.000 description 2
- 230000028744 lysogeny Effects 0.000 description 2
- 239000004325 lysozyme Substances 0.000 description 2
- 229960000274 lysozyme Drugs 0.000 description 2
- 235000010335 lysozyme Nutrition 0.000 description 2
- 239000011159 matrix material Substances 0.000 description 2
- 230000007246 mechanism Effects 0.000 description 2
- 108091070501 miRNA Proteins 0.000 description 2
- 125000002757 morpholinyl group Chemical group 0.000 description 2
- 230000035772 mutation Effects 0.000 description 2
- 125000004433 nitrogen atom Chemical group N* 0.000 description 2
- 150000003833 nucleoside derivatives Chemical class 0.000 description 2
- 125000003835 nucleoside group Chemical group 0.000 description 2
- 230000003287 optical effect Effects 0.000 description 2
- 210000000056 organ Anatomy 0.000 description 2
- 125000000160 oxazolidinyl group Chemical group 0.000 description 2
- 229910052760 oxygen Inorganic materials 0.000 description 2
- 239000002245 particle Substances 0.000 description 2
- 239000008363 phosphate buffer Substances 0.000 description 2
- 239000002953 phosphate buffered saline Substances 0.000 description 2
- 150000004713 phosphodiesters Chemical class 0.000 description 2
- UEZVMMHDMIWARA-UHFFFAOYSA-M phosphonate Chemical compound [O-]P(=O)=O UEZVMMHDMIWARA-UHFFFAOYSA-M 0.000 description 2
- 150000008300 phosphoramidites Chemical class 0.000 description 2
- 230000026731 phosphorylation Effects 0.000 description 2
- 238000006366 phosphorylation reaction Methods 0.000 description 2
- 125000004193 piperazinyl group Chemical group 0.000 description 2
- 125000003386 piperidinyl group Chemical group 0.000 description 2
- BASFCYQUMIYNBI-UHFFFAOYSA-N platinum Chemical compound [Pt] BASFCYQUMIYNBI-UHFFFAOYSA-N 0.000 description 2
- 229920001308 poly(aminoacid) Polymers 0.000 description 2
- 229920002451 polyvinyl alcohol Polymers 0.000 description 2
- 101150001140 ppk gene Proteins 0.000 description 2
- 125000002924 primary amino group Chemical group [H]N([H])* 0.000 description 2
- 238000001742 protein purification Methods 0.000 description 2
- ZCCUUQDIBDJBTK-UHFFFAOYSA-N psoralen Chemical compound C1=C2OC(=O)C=CC2=CC2=C1OC=C2 ZCCUUQDIBDJBTK-UHFFFAOYSA-N 0.000 description 2
- 235000019423 pullulan Nutrition 0.000 description 2
- 125000004219 purine nucleobase group Chemical group 0.000 description 2
- 125000003072 pyrazolidinyl group Chemical group 0.000 description 2
- 125000002755 pyrazolinyl group Chemical group 0.000 description 2
- 150000003230 pyrimidines Chemical class 0.000 description 2
- 125000000719 pyrrolidinyl group Chemical group 0.000 description 2
- 125000001567 quinoxalinyl group Chemical group N1=C(C=NC2=CC=CC=C12)* 0.000 description 2
- 230000002441 reversible effect Effects 0.000 description 2
- HFHDHCJBZVLPGP-UHFFFAOYSA-N schardinger α-dextrin Chemical compound O1C(C(C2O)O)C(CO)OC2OC(C(C2O)O)C(CO)OC2OC(C(C2O)O)C(CO)OC2OC(C(O)C2O)C(CO)OC2OC(C(C2O)O)C(CO)OC2OC2C(O)C(O)C1OC2CO HFHDHCJBZVLPGP-UHFFFAOYSA-N 0.000 description 2
- 239000000377 silicon dioxide Substances 0.000 description 2
- 239000002904 solvent Substances 0.000 description 2
- 125000006850 spacer group Chemical group 0.000 description 2
- ATHGHQPFGPMSJY-UHFFFAOYSA-N spermidine Chemical compound NCCCCNCCCN ATHGHQPFGPMSJY-UHFFFAOYSA-N 0.000 description 2
- PFNFFQXMRSDOHW-UHFFFAOYSA-N spermine Chemical compound NCCCNCCCCNCCCN PFNFFQXMRSDOHW-UHFFFAOYSA-N 0.000 description 2
- 239000007921 spray Substances 0.000 description 2
- 239000007858 starting material Substances 0.000 description 2
- 150000008163 sugars Chemical class 0.000 description 2
- 230000008961 swelling Effects 0.000 description 2
- 125000003718 tetrahydrofuranyl group Chemical group 0.000 description 2
- 125000001984 thiazolidinyl group Chemical group 0.000 description 2
- UMGDCJDMYOKAJW-UHFFFAOYSA-N thiourea Chemical compound NC(N)=S UMGDCJDMYOKAJW-UHFFFAOYSA-N 0.000 description 2
- 238000001890 transfection Methods 0.000 description 2
- 239000012581 transferrin Substances 0.000 description 2
- LWIHDJKSTIGBAC-UHFFFAOYSA-K tripotassium phosphate Chemical compound [K+].[K+].[K+].[O-]P([O-])([O-])=O LWIHDJKSTIGBAC-UHFFFAOYSA-K 0.000 description 2
- 238000000108 ultra-filtration Methods 0.000 description 2
- XUARCIYIVXVTAE-ZAPOICBTSA-N uvaol Chemical compound C1C[C@H](O)C(C)(C)[C@@H]2CC[C@@]3(C)[C@]4(C)CC[C@@]5(CO)CC[C@@H](C)[C@H](C)[C@H]5C4=CC[C@@H]3[C@]21C XUARCIYIVXVTAE-ZAPOICBTSA-N 0.000 description 2
- PXXNTAGJWPJAGM-UHFFFAOYSA-N vertaline Natural products C1C2C=3C=C(OC)C(OC)=CC=3OC(C=C3)=CC=C3CCC(=O)OC1CC1N2CCCC1 PXXNTAGJWPJAGM-UHFFFAOYSA-N 0.000 description 2
- 239000011701 zinc Substances 0.000 description 2
- NOOLISFMXDJSKH-UTLUCORTSA-N (+)-Neomenthol Chemical group CC(C)[C@@H]1CC[C@@H](C)C[C@@H]1O NOOLISFMXDJSKH-UTLUCORTSA-N 0.000 description 1
- DTGKSKDOIYIVQL-WEDXCCLWSA-N (+)-borneol Chemical group C1C[C@@]2(C)[C@@H](O)C[C@@H]1C2(C)C DTGKSKDOIYIVQL-WEDXCCLWSA-N 0.000 description 1
- DNIAPMSPPWPWGF-VKHMYHEASA-N (+)-propylene glycol Chemical group C[C@H](O)CO DNIAPMSPPWPWGF-VKHMYHEASA-N 0.000 description 1
- REPVLJRCJUVQFA-UHFFFAOYSA-N (-)-isopinocampheol Chemical group C1C(O)C(C)C2C(C)(C)C1C2 REPVLJRCJUVQFA-UHFFFAOYSA-N 0.000 description 1
- HSINOMROUCMIEA-FGVHQWLLSA-N (2s,4r)-4-[(3r,5s,6r,7r,8s,9s,10s,13r,14s,17r)-6-ethyl-3,7-dihydroxy-10,13-dimethyl-2,3,4,5,6,7,8,9,11,12,14,15,16,17-tetradecahydro-1h-cyclopenta[a]phenanthren-17-yl]-2-methylpentanoic acid Chemical compound C([C@@]12C)C[C@@H](O)C[C@H]1[C@@H](CC)[C@@H](O)[C@@H]1[C@@H]2CC[C@]2(C)[C@@H]([C@H](C)C[C@H](C)C(O)=O)CC[C@H]21 HSINOMROUCMIEA-FGVHQWLLSA-N 0.000 description 1
- BHQCQFFYRZLCQQ-UHFFFAOYSA-N (3alpha,5alpha,7alpha,12alpha)-3,7,12-trihydroxy-cholan-24-oic acid Natural products OC1CC2CC(O)CCC2(C)C2C1C1CCC(C(CCC(O)=O)C)C1(C)C(O)C2 BHQCQFFYRZLCQQ-UHFFFAOYSA-N 0.000 description 1
- ZIZMDHZLHJBNSQ-UHFFFAOYSA-N 1,2-dihydrophenazine Chemical compound C1=CC=C2N=C(C=CCC3)C3=NC2=C1 ZIZMDHZLHJBNSQ-UHFFFAOYSA-N 0.000 description 1
- YPFDHNVEDLHUCE-UHFFFAOYSA-N 1,3-propanediol Chemical group OCCCO YPFDHNVEDLHUCE-UHFFFAOYSA-N 0.000 description 1
- 229940035437 1,3-propanediol Drugs 0.000 description 1
- WJNGQIYEQLPJMN-IOSLPCCCSA-N 1-methylinosine Chemical compound C1=NC=2C(=O)N(C)C=NC=2N1[C@@H]1O[C@H](CO)[C@@H](O)[C@H]1O WJNGQIYEQLPJMN-IOSLPCCCSA-N 0.000 description 1
- MZMNEDXVUJLQAF-UHFFFAOYSA-N 1-o-tert-butyl 2-o-methyl 4-hydroxypyrrolidine-1,2-dicarboxylate Chemical compound COC(=O)C1CC(O)CN1C(=O)OC(C)(C)C MZMNEDXVUJLQAF-UHFFFAOYSA-N 0.000 description 1
- PNXSUXRNBHUCSF-HSYVXBRLSA-N 107658-43-5 Chemical compound C([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)OC(=O)CC[C@@H](C(=O)N[C@@H](C)C(=O)OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](C)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC(C)C)NC(=O)[C@H](C)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O)NC(=O)[C@H](CCC(=O)OC(=O)[C@H](CCC(=O)OC(=O)[C@H](CC(C)C)NC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC=1C2=CC=CC=C2NC=1)NC(=O)[C@H](C)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](C)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](C)N)NC(=O)[C@H](C)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](C)N)C1C=NC=N1 PNXSUXRNBHUCSF-HSYVXBRLSA-N 0.000 description 1
- TZMSYXZUNZXBOL-UHFFFAOYSA-N 10H-phenoxazine Chemical compound C1=CC=C2NC3=CC=CC=C3OC2=C1 TZMSYXZUNZXBOL-UHFFFAOYSA-N 0.000 description 1
- NVKAWKQGWWIWPM-ABEVXSGRSA-N 17-β-hydroxy-5-α-Androstan-3-one Chemical compound C1C(=O)CC[C@]2(C)[C@H]3CC[C@](C)([C@H](CC4)O)[C@@H]4[C@@H]3CC[C@H]21 NVKAWKQGWWIWPM-ABEVXSGRSA-N 0.000 description 1
- UHUHBFMZVCOEOV-UHFFFAOYSA-N 1h-imidazo[4,5-c]pyridin-4-amine Chemical compound NC1=NC=CC2=C1N=CN2 UHUHBFMZVCOEOV-UHFFFAOYSA-N 0.000 description 1
- VEPOHXYIFQMVHW-XOZOLZJESA-N 2,3-dihydroxybutanedioic acid (2S,3S)-3,4-dimethyl-2-phenylmorpholine Chemical compound OC(C(O)C(O)=O)C(O)=O.C[C@H]1[C@@H](OCCN1C)c1ccccc1 VEPOHXYIFQMVHW-XOZOLZJESA-N 0.000 description 1
- AZUHIVLOSAPWDM-UHFFFAOYSA-N 2-(1h-imidazol-2-yl)-1h-imidazole Chemical compound C1=CNC(C=2NC=CN=2)=N1 AZUHIVLOSAPWDM-UHFFFAOYSA-N 0.000 description 1
- HLYBTPMYFWWNJN-UHFFFAOYSA-N 2-(2,4-dioxo-1h-pyrimidin-5-yl)-2-hydroxyacetic acid Chemical compound OC(=O)C(O)C1=CNC(=O)NC1=O HLYBTPMYFWWNJN-UHFFFAOYSA-N 0.000 description 1
- JECYNCQXXKQDJN-UHFFFAOYSA-N 2-(2-methylhexan-2-yloxymethyl)oxirane Chemical compound CCCCC(C)(C)OCC1CO1 JECYNCQXXKQDJN-UHFFFAOYSA-N 0.000 description 1
- SGAKLDIYNFXTCK-UHFFFAOYSA-N 2-[(2,4-dioxo-1h-pyrimidin-5-yl)methylamino]acetic acid Chemical compound OC(=O)CNCC1=CNC(=O)NC1=O SGAKLDIYNFXTCK-UHFFFAOYSA-N 0.000 description 1
- XMSMHKMPBNTBOD-UHFFFAOYSA-N 2-dimethylamino-6-hydroxypurine Chemical compound N1C(N(C)C)=NC(=O)C2=C1N=CN2 XMSMHKMPBNTBOD-UHFFFAOYSA-N 0.000 description 1
- WKMPTBDYDNUJLF-UHFFFAOYSA-N 2-fluoroadenine Chemical compound NC1=NC(F)=NC2=C1N=CN2 WKMPTBDYDNUJLF-UHFFFAOYSA-N 0.000 description 1
- SMADWRYCYBUIKH-UHFFFAOYSA-N 2-methyl-7h-purin-6-amine Chemical compound CC1=NC(N)=C2NC=NC2=N1 SMADWRYCYBUIKH-UHFFFAOYSA-N 0.000 description 1
- KOLPWZCZXAMXKS-UHFFFAOYSA-N 3-methylcytosine Chemical compound CN1C(N)=CC=NC1=O KOLPWZCZXAMXKS-UHFFFAOYSA-N 0.000 description 1
- GJAKJCICANKRFD-UHFFFAOYSA-N 4-acetyl-4-amino-1,3-dihydropyrimidin-2-one Chemical compound CC(=O)C1(N)NC(=O)NC=C1 GJAKJCICANKRFD-UHFFFAOYSA-N 0.000 description 1
- MQJSSLBGAQJNER-UHFFFAOYSA-N 5-(methylaminomethyl)-1h-pyrimidine-2,4-dione Chemical compound CNCC1=CNC(=O)NC1=O MQJSSLBGAQJNER-UHFFFAOYSA-N 0.000 description 1
- WPYRHVXCOQLYLY-UHFFFAOYSA-N 5-[(methoxyamino)methyl]-2-sulfanylidene-1h-pyrimidin-4-one Chemical compound CONCC1=CNC(=S)NC1=O WPYRHVXCOQLYLY-UHFFFAOYSA-N 0.000 description 1
- LQLQRFGHAALLLE-UHFFFAOYSA-N 5-bromouracil Chemical compound BrC1=CNC(=O)NC1=O LQLQRFGHAALLLE-UHFFFAOYSA-N 0.000 description 1
- VKLFQTYNHLDMDP-PNHWDRBUSA-N 5-carboxymethylaminomethyl-2-thiouridine Chemical compound O[C@@H]1[C@H](O)[C@@H](CO)O[C@H]1N1C(=S)NC(=O)C(CNCC(O)=O)=C1 VKLFQTYNHLDMDP-PNHWDRBUSA-N 0.000 description 1
- ZFTBZKVVGZNMJR-UHFFFAOYSA-N 5-chlorouracil Chemical compound ClC1=CNC(=O)NC1=O ZFTBZKVVGZNMJR-UHFFFAOYSA-N 0.000 description 1
- KSNXJLQDQOIRIP-UHFFFAOYSA-N 5-iodouracil Chemical compound IC1=CNC(=O)NC1=O KSNXJLQDQOIRIP-UHFFFAOYSA-N 0.000 description 1
- KELXHQACBIUYSE-UHFFFAOYSA-N 5-methoxy-1h-pyrimidine-2,4-dione Chemical compound COC1=CNC(=O)NC1=O KELXHQACBIUYSE-UHFFFAOYSA-N 0.000 description 1
- LRSASMSXMSNRBT-UHFFFAOYSA-N 5-methylcytosine Chemical compound CC1=CNC(=O)N=C1N LRSASMSXMSNRBT-UHFFFAOYSA-N 0.000 description 1
- KXBCLNRMQPRVTP-UHFFFAOYSA-N 6-amino-1,5-dihydroimidazo[4,5-c]pyridin-4-one Chemical compound O=C1NC(N)=CC2=C1N=CN2 KXBCLNRMQPRVTP-UHFFFAOYSA-N 0.000 description 1
- DCPSTSVLRXOYGS-UHFFFAOYSA-N 6-amino-1h-pyrimidine-2-thione Chemical compound NC1=CC=NC(S)=N1 DCPSTSVLRXOYGS-UHFFFAOYSA-N 0.000 description 1
- LOSIULRWFAEMFL-UHFFFAOYSA-N 7-deazaguanine Chemical compound O=C1NC(N)=NC2=C1CC=N2 LOSIULRWFAEMFL-UHFFFAOYSA-N 0.000 description 1
- HRYKDUPGBWLLHO-UHFFFAOYSA-N 8-azaadenine Chemical compound NC1=NC=NC2=NNN=C12 HRYKDUPGBWLLHO-UHFFFAOYSA-N 0.000 description 1
- LPXQRXLUHJKZIE-UHFFFAOYSA-N 8-azaguanine Chemical compound NC1=NC(O)=C2NN=NC2=N1 LPXQRXLUHJKZIE-UHFFFAOYSA-N 0.000 description 1
- 229960005508 8-azaguanine Drugs 0.000 description 1
- 229920001817 Agar Polymers 0.000 description 1
- 101800002011 Amphipathic peptide Proteins 0.000 description 1
- 239000004475 Arginine Substances 0.000 description 1
- 102000005427 Asialoglycoprotein Receptor Human genes 0.000 description 1
- BSYNRYMUTXBXSQ-UHFFFAOYSA-N Aspirin Chemical compound CC(=O)OC1=CC=CC=C1C(O)=O BSYNRYMUTXBXSQ-UHFFFAOYSA-N 0.000 description 1
- DWRXFEITVBNRMK-UHFFFAOYSA-N Beta-D-1-Arabinofuranosylthymine Natural products O=C1NC(=O)C(C)=CN1C1C(O)C(O)C(CO)O1 DWRXFEITVBNRMK-UHFFFAOYSA-N 0.000 description 1
- 229940122361 Bisphosphonate Drugs 0.000 description 1
- 206010006187 Breast cancer Diseases 0.000 description 1
- 208000026310 Breast neoplasm Diseases 0.000 description 1
- 239000002126 C01EB10 - Adenosine Substances 0.000 description 1
- QGOOVYDNNMBCPD-UHFFFAOYSA-N C1(CC1)OP(O)=O Chemical compound C1(CC1)OP(O)=O QGOOVYDNNMBCPD-UHFFFAOYSA-N 0.000 description 1
- JMDJVWXCQJZDSH-UHFFFAOYSA-N COCCCP(=O)(O)O Chemical compound COCCCP(=O)(O)O JMDJVWXCQJZDSH-UHFFFAOYSA-N 0.000 description 1
- UXVMQQNJUSDDNG-UHFFFAOYSA-L Calcium chloride Chemical compound [Cl-].[Cl-].[Ca+2] UXVMQQNJUSDDNG-UHFFFAOYSA-L 0.000 description 1
- KXDHJXZQYSOELW-UHFFFAOYSA-M Carbamate Chemical compound NC([O-])=O KXDHJXZQYSOELW-UHFFFAOYSA-M 0.000 description 1
- 239000004380 Cholic acid Substances 0.000 description 1
- RYGMFSIKBFXOCR-UHFFFAOYSA-N Copper Chemical compound [Cu] RYGMFSIKBFXOCR-UHFFFAOYSA-N 0.000 description 1
- 239000004971 Cross linker Substances 0.000 description 1
- MIKUYHXYGGJMLM-GIMIYPNGSA-N Crotonoside Natural products C1=NC2=C(N)NC(=O)N=C2N1[C@H]1O[C@@H](CO)[C@H](O)[C@@H]1O MIKUYHXYGGJMLM-GIMIYPNGSA-N 0.000 description 1
- CKLJMWTZIZZHCS-UHFFFAOYSA-N D-OH-Asp Natural products OC(=O)C(N)CC(O)=O CKLJMWTZIZZHCS-UHFFFAOYSA-N 0.000 description 1
- 150000008574 D-amino acids Chemical class 0.000 description 1
- NYHBQMYGNKIUIF-UHFFFAOYSA-N D-guanosine Natural products C1=2NC(N)=NC(=O)C=2N=CN1C1OC(CO)C(O)C1O NYHBQMYGNKIUIF-UHFFFAOYSA-N 0.000 description 1
- KVSNMTUIMXZPLU-UHFFFAOYSA-N D:A-friedo-oleanane Natural products CC12CCC3(C)C4CC(C)(C)CCC4(C)CCC3(C)C2CCC2(C)C1CCCC2C KVSNMTUIMXZPLU-UHFFFAOYSA-N 0.000 description 1
- NOOLISFMXDJSKH-UHFFFAOYSA-N DL-menthol Chemical group CC(C)C1CCC(C)CC1O NOOLISFMXDJSKH-UHFFFAOYSA-N 0.000 description 1
- 241000255581 Drosophila <fruit fly, genus> Species 0.000 description 1
- 241000196324 Embryophyta Species 0.000 description 1
- 102000004533 Endonucleases Human genes 0.000 description 1
- 108010042407 Endonucleases Proteins 0.000 description 1
- 241000724228 Enterobacteria phage RB69 Species 0.000 description 1
- 241001198387 Escherichia coli BL21(DE3) Species 0.000 description 1
- 241001302584 Escherichia coli str. K-12 substr. W3110 Species 0.000 description 1
- GHASVSINZRGABV-UHFFFAOYSA-N Fluorouracil Chemical compound FC1=CNC(=O)NC1=O GHASVSINZRGABV-UHFFFAOYSA-N 0.000 description 1
- JUUHNUPNMCGYDT-UHFFFAOYSA-N Friedelin Natural products CC1CC2C(C)(CCC3(C)C4CC(C)(C)CCC4(C)CCC23C)C5CCC(=O)C(C)C15 JUUHNUPNMCGYDT-UHFFFAOYSA-N 0.000 description 1
- 108010087294 GALA peptide Proteins 0.000 description 1
- 102000006395 Globulins Human genes 0.000 description 1
- 108010044091 Globulins Proteins 0.000 description 1
- 241000238631 Hexapoda Species 0.000 description 1
- 101001005668 Homo sapiens Mastermind-like protein 3 Proteins 0.000 description 1
- UGQMRVRMYYASKQ-UHFFFAOYSA-N Hypoxanthine nucleoside Natural products OC1C(O)C(CO)OC1N1C(NC=NC2=O)=C2N=C1 UGQMRVRMYYASKQ-UHFFFAOYSA-N 0.000 description 1
- 241000235058 Komagataella pastoris Species 0.000 description 1
- ODKSFYDXXFIFQN-BYPYZUCNSA-P L-argininium(2+) Chemical compound NC(=[NH2+])NCCC[C@H]([NH3+])C(O)=O ODKSFYDXXFIFQN-BYPYZUCNSA-P 0.000 description 1
- JVTAAEKCZFNVCJ-UHFFFAOYSA-M Lactate Chemical compound CC(O)C([O-])=O JVTAAEKCZFNVCJ-UHFFFAOYSA-M 0.000 description 1
- 239000006137 Luria-Bertani broth Substances 0.000 description 1
- 241000124008 Mammalia Species 0.000 description 1
- 102100025134 Mastermind-like protein 3 Human genes 0.000 description 1
- 108010007013 Melanocyte-Stimulating Hormones Proteins 0.000 description 1
- 102000015728 Mucins Human genes 0.000 description 1
- 108010063954 Mucins Proteins 0.000 description 1
- SGSSKEDGVONRGC-UHFFFAOYSA-N N(2)-methylguanine Chemical compound O=C1NC(NC)=NC2=C1N=CN2 SGSSKEDGVONRGC-UHFFFAOYSA-N 0.000 description 1
- OVBPIULPVIDEAO-UHFFFAOYSA-N N-Pteroyl-L-glutaminsaeure Natural products C=1N=C2NC(N)=NC(=O)C2=NC=1CNC1=CC=C(C(=O)NC(CCC(O)=O)C(O)=O)C=C1 OVBPIULPVIDEAO-UHFFFAOYSA-N 0.000 description 1
- BAWFJGJZGIEFAR-NNYOXOHSSA-O NAD(+) Chemical compound NC(=O)C1=CC=C[N+]([C@H]2[C@@H]([C@H](O)[C@@H](COP(O)(=O)OP(O)(=O)OC[C@@H]3[C@H]([C@@H](O)[C@@H](O3)N3C4=NC=NC(N)=C4N=C3)O)O2)O)=C1 BAWFJGJZGIEFAR-NNYOXOHSSA-O 0.000 description 1
- 108700019961 Neoplasm Genes Proteins 0.000 description 1
- 102000048850 Neoplasm Genes Human genes 0.000 description 1
- 229910004679 ONO2 Inorganic materials 0.000 description 1
- 235000021314 Palmitic acid Nutrition 0.000 description 1
- 108020002230 Pancreatic Ribonuclease Proteins 0.000 description 1
- 102000005891 Pancreatic ribonuclease Human genes 0.000 description 1
- PCNDJXKNXGMECE-UHFFFAOYSA-N Phenazine Natural products C1=CC=CC2=NC3=CC=CC=C3N=C21 PCNDJXKNXGMECE-UHFFFAOYSA-N 0.000 description 1
- 101150005409 Pmch gene Proteins 0.000 description 1
- 239000004698 Polyethylene Substances 0.000 description 1
- 108010020346 Polyglutamic Acid Proteins 0.000 description 1
- 239000004743 Polypropylene Substances 0.000 description 1
- 239000004793 Polystyrene Substances 0.000 description 1
- 102000007327 Protamines Human genes 0.000 description 1
- 108010007568 Protamines Proteins 0.000 description 1
- 101710149951 Protein Tat Proteins 0.000 description 1
- 101000781681 Protobothrops flavoviridis Disintegrin triflavin Proteins 0.000 description 1
- 102000007615 Pulmonary Surfactant-Associated Protein A Human genes 0.000 description 1
- 108010007100 Pulmonary Surfactant-Associated Protein A Proteins 0.000 description 1
- 102000000574 RNA-Induced Silencing Complex Human genes 0.000 description 1
- 108010016790 RNA-Induced Silencing Complex Proteins 0.000 description 1
- 102000007056 Recombinant Fusion Proteins Human genes 0.000 description 1
- 108010008281 Recombinant Fusion Proteins Proteins 0.000 description 1
- 102000006382 Ribonucleases Human genes 0.000 description 1
- 108010083644 Ribonucleases Proteins 0.000 description 1
- 241000293869 Salmonella enterica subsp. enterica serovar Typhimurium Species 0.000 description 1
- 241000256248 Spodoptera Species 0.000 description 1
- 241000187747 Streptomyces Species 0.000 description 1
- 101100054666 Streptomyces halstedii sch3 gene Proteins 0.000 description 1
- 229940100389 Sulfonylurea Drugs 0.000 description 1
- NINIDFKCEFEMDL-UHFFFAOYSA-N Sulfur Chemical compound [S] NINIDFKCEFEMDL-UHFFFAOYSA-N 0.000 description 1
- UCKMPCXJQFINFW-UHFFFAOYSA-N Sulphide Chemical compound [S-2] UCKMPCXJQFINFW-UHFFFAOYSA-N 0.000 description 1
- 108010061174 Thyrotropin Proteins 0.000 description 1
- 102000011923 Thyrotropin Human genes 0.000 description 1
- RTMWIZOXNKJHRE-UHFFFAOYSA-N Tigogenin Natural products CC1COC2CC(C)(OC12)C3CCC4C5CCC6CC(O)CCC6(C)C5CCC34C RTMWIZOXNKJHRE-UHFFFAOYSA-N 0.000 description 1
- DWCSNWXARWMZTG-UHFFFAOYSA-N Trigonegenin A Natural products CC1C(C2(CCC3C4(C)CCC(O)C=C4CCC3C2C2)C)C2OC11CCC(C)CO1 DWCSNWXARWMZTG-UHFFFAOYSA-N 0.000 description 1
- 241000700605 Viruses Species 0.000 description 1
- 229930003779 Vitamin B12 Natural products 0.000 description 1
- 229930003427 Vitamin E Natural products 0.000 description 1
- HCHKCACWOHOZIP-UHFFFAOYSA-N Zinc Chemical compound [Zn] HCHKCACWOHOZIP-UHFFFAOYSA-N 0.000 description 1
- 238000009825 accumulation Methods 0.000 description 1
- XVIYCJDWYLJQBG-UHFFFAOYSA-N acetic acid;adamantane Chemical compound CC(O)=O.C1C(C2)CC3CC1CC2C3 XVIYCJDWYLJQBG-UHFFFAOYSA-N 0.000 description 1
- 229960001138 acetylsalicylic acid Drugs 0.000 description 1
- 125000000641 acridinyl group Chemical class C1(=CC=CC2=NC3=CC=CC=C3C=C12)* 0.000 description 1
- 229960005305 adenosine Drugs 0.000 description 1
- 238000001042 affinity chromatography Methods 0.000 description 1
- 239000008272 agar Substances 0.000 description 1
- 235000004279 alanine Nutrition 0.000 description 1
- 125000003172 aldehyde group Chemical group 0.000 description 1
- 150000001336 alkenes Chemical class 0.000 description 1
- 125000002877 alkyl aryl group Chemical group 0.000 description 1
- 125000005600 alkyl phosphonate group Chemical group 0.000 description 1
- 239000002168 alkylating agent Substances 0.000 description 1
- 229940100198 alkylating agent Drugs 0.000 description 1
- 230000004075 alteration Effects 0.000 description 1
- 150000001409 amidines Chemical class 0.000 description 1
- 150000001412 amines Chemical class 0.000 description 1
- 125000003277 amino group Chemical group 0.000 description 1
- 125000005122 aminoalkylamino group Chemical group 0.000 description 1
- BFNBIHQBYMNNAN-UHFFFAOYSA-N ammonium sulfate Chemical compound N.N.OS(O)(=O)=O BFNBIHQBYMNNAN-UHFFFAOYSA-N 0.000 description 1
- 229910052921 ammonium sulfate Inorganic materials 0.000 description 1
- 235000011130 ammonium sulphate Nutrition 0.000 description 1
- 229960003473 androstanolone Drugs 0.000 description 1
- 210000004102 animal cell Anatomy 0.000 description 1
- 239000012736 aqueous medium Substances 0.000 description 1
- ODKSFYDXXFIFQN-UHFFFAOYSA-N arginine Natural products OC(=O)C(N)CCCNC(N)=N ODKSFYDXXFIFQN-UHFFFAOYSA-N 0.000 description 1
- 210000004507 artificial chromosome Anatomy 0.000 description 1
- 125000003710 aryl alkyl group Chemical group 0.000 description 1
- 108010006523 asialoglycoprotein receptor Proteins 0.000 description 1
- 229960005261 aspartic acid Drugs 0.000 description 1
- 230000001580 bacterial effect Effects 0.000 description 1
- 230000008901 benefit Effects 0.000 description 1
- 239000003613 bile acid Substances 0.000 description 1
- 239000011230 binding agent Substances 0.000 description 1
- 230000004071 biological effect Effects 0.000 description 1
- 239000012472 biological sample Substances 0.000 description 1
- 150000004663 bisphosphonates Chemical class 0.000 description 1
- 210000002449 bone cell Anatomy 0.000 description 1
- CKDOCTFBFTVPSN-UHFFFAOYSA-N borneol Chemical group C1CC2(C)C(C)CC1C2(C)C CKDOCTFBFTVPSN-UHFFFAOYSA-N 0.000 description 1
- 229940116229 borneol Drugs 0.000 description 1
- 239000011575 calcium Substances 0.000 description 1
- 239000001110 calcium chloride Substances 0.000 description 1
- 229910001628 calcium chloride Inorganic materials 0.000 description 1
- 229940041514 candida albicans extract Drugs 0.000 description 1
- 239000004202 carbamide Substances 0.000 description 1
- 125000003178 carboxy group Chemical group [H]OC(*)=O 0.000 description 1
- 239000000969 carrier Substances 0.000 description 1
- 125000002091 cationic group Chemical group 0.000 description 1
- 239000006143 cell culture medium Substances 0.000 description 1
- 230000001413 cellular effect Effects 0.000 description 1
- 239000002738 chelating agent Substances 0.000 description 1
- 150000005829 chemical entities Chemical class 0.000 description 1
- 239000003153 chemical reaction reagent Substances 0.000 description 1
- 239000003638 chemical reducing agent Substances 0.000 description 1
- BHQCQFFYRZLCQQ-OELDTZBJSA-N cholic acid Chemical compound C([C@H]1C[C@H]2O)[C@H](O)CC[C@]1(C)[C@@H]1[C@@H]2[C@@H]2CC[C@H]([C@@H](CCC(O)=O)C)[C@@]2(C)[C@@H](O)C1 BHQCQFFYRZLCQQ-OELDTZBJSA-N 0.000 description 1
- 235000019416 cholic acid Nutrition 0.000 description 1
- 229960002471 cholic acid Drugs 0.000 description 1
- 230000002759 chromosomal effect Effects 0.000 description 1
- 210000000349 chromosome Anatomy 0.000 description 1
- 229910017052 cobalt Inorganic materials 0.000 description 1
- 239000010941 cobalt Substances 0.000 description 1
- GUTLYIVDDKVIGB-UHFFFAOYSA-N cobalt atom Chemical compound [Co] GUTLYIVDDKVIGB-UHFFFAOYSA-N 0.000 description 1
- AGVAZMGAQJOSFJ-WZHZPDAFSA-M cobalt(2+);[(2r,3s,4r,5s)-5-(5,6-dimethylbenzimidazol-1-yl)-4-hydroxy-2-(hydroxymethyl)oxolan-3-yl] [(2r)-1-[3-[(1r,2r,3r,4z,7s,9z,12s,13s,14z,17s,18s,19r)-2,13,18-tris(2-amino-2-oxoethyl)-7,12,17-tris(3-amino-3-oxopropyl)-3,5,8,8,13,15,18,19-octamethyl-2 Chemical compound [Co+2].N#[C-].[N-]([C@@H]1[C@H](CC(N)=O)[C@@]2(C)CCC(=O)NC[C@@H](C)OP(O)(=O)O[C@H]3[C@H]([C@H](O[C@@H]3CO)N3C4=CC(C)=C(C)C=C4N=C3)O)\C2=C(C)/C([C@H](C\2(C)C)CCC(N)=O)=N/C/2=C\C([C@H]([C@@]/2(CC(N)=O)C)CCC(N)=O)=N\C\2=C(C)/C2=N[C@]1(C)[C@@](C)(CC(N)=O)[C@@H]2CCC(N)=O AGVAZMGAQJOSFJ-WZHZPDAFSA-M 0.000 description 1
- 238000004891 communication Methods 0.000 description 1
- 239000000306 component Substances 0.000 description 1
- 229910052802 copper Inorganic materials 0.000 description 1
- 239000000287 crude extract Substances 0.000 description 1
- 230000001351 cycling effect Effects 0.000 description 1
- KXGVEGMKQFWNSR-UHFFFAOYSA-N deoxycholic acid Natural products C1CC2CC(O)CCC2(C)C2C1C1CCC(C(CCC(O)=O)C)C1(C)C(O)C2 KXGVEGMKQFWNSR-UHFFFAOYSA-N 0.000 description 1
- 238000011033 desalting Methods 0.000 description 1
- 230000000368 destabilizing effect Effects 0.000 description 1
- 238000001514 detection method Methods 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 230000018109 developmental process Effects 0.000 description 1
- 238000000502 dialysis Methods 0.000 description 1
- WQLVFSAGQJTQCK-VKROHFNGSA-N diosgenin Chemical compound O([C@@H]1[C@@H]([C@]2(CC[C@@H]3[C@@]4(C)CC[C@H](O)CC4=CC[C@H]3[C@@H]2C1)C)[C@@H]1C)[C@]11CC[C@@H](C)CO1 WQLVFSAGQJTQCK-VKROHFNGSA-N 0.000 description 1
- WQLVFSAGQJTQCK-UHFFFAOYSA-N diosgenin Natural products CC1C(C2(CCC3C4(C)CCC(O)CC4=CCC3C2C2)C)C2OC11CCC(C)CO1 WQLVFSAGQJTQCK-UHFFFAOYSA-N 0.000 description 1
- 235000011180 diphosphates Nutrition 0.000 description 1
- 150000002016 disaccharides Chemical class 0.000 description 1
- 201000010099 disease Diseases 0.000 description 1
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 1
- NAGJZTKCGNOGPW-UHFFFAOYSA-N dithiophosphoric acid Chemical class OP(O)(S)=S NAGJZTKCGNOGPW-UHFFFAOYSA-N 0.000 description 1
- DTGKSKDOIYIVQL-UHFFFAOYSA-N dl-isoborneol Chemical group C1CC2(C)C(O)CC1C2(C)C DTGKSKDOIYIVQL-UHFFFAOYSA-N 0.000 description 1
- 238000011143 downstream manufacturing Methods 0.000 description 1
- 229940079593 drug Drugs 0.000 description 1
- 239000003814 drug Substances 0.000 description 1
- 238000007876 drug discovery Methods 0.000 description 1
- 239000000975 dye Substances 0.000 description 1
- 238000001962 electrophoresis Methods 0.000 description 1
- 238000004520 electroporation Methods 0.000 description 1
- 239000000839 emulsion Substances 0.000 description 1
- 210000002889 endothelial cell Anatomy 0.000 description 1
- 230000003511 endothelial effect Effects 0.000 description 1
- 239000002158 endotoxin Substances 0.000 description 1
- 230000002708 enhancing effect Effects 0.000 description 1
- 125000001495 ethyl group Chemical group [H]C([H])([H])C([H])([H])* 0.000 description 1
- 238000011156 evaluation Methods 0.000 description 1
- 230000001747 exhibiting effect Effects 0.000 description 1
- 238000002474 experimental method Methods 0.000 description 1
- 210000003722 extracellular fluid Anatomy 0.000 description 1
- 238000001914 filtration Methods 0.000 description 1
- 239000012530 fluid Substances 0.000 description 1
- 229960002949 fluorouracil Drugs 0.000 description 1
- 229940014144 folate Drugs 0.000 description 1
- 229960000304 folic acid Drugs 0.000 description 1
- 238000009472 formulation Methods 0.000 description 1
- OFMXGFHWLZPCFL-SVRPQWSVSA-N friedelin Chemical compound C([C@H]1[C@]2(C)CC[C@@]34C)C(C)(C)CC[C@]1(C)CC[C@]2(C)[C@H]4CC[C@@]1(C)[C@H]3CCC(=O)[C@@H]1C OFMXGFHWLZPCFL-SVRPQWSVSA-N 0.000 description 1
- MFVJCHSUSSRHRH-UHFFFAOYSA-N friedeline Natural products CC1(C)CCC2(C)CCC3C4(C)CCC5C(C)(C)C(=O)CCC5(C)C4CCC3(C)C2C1 MFVJCHSUSSRHRH-UHFFFAOYSA-N 0.000 description 1
- 230000002538 fungal effect Effects 0.000 description 1
- 230000004927 fusion Effects 0.000 description 1
- 229920000370 gamma-poly(glutamate) polymer Polymers 0.000 description 1
- WIGCFUFOHFEKBI-UHFFFAOYSA-N gamma-tocopherol Natural products CC(C)CCCC(C)CCCC(C)CCCC1CCC2C(C)C(O)C(C)C(C)C2O1 WIGCFUFOHFEKBI-UHFFFAOYSA-N 0.000 description 1
- 238000001502 gel electrophoresis Methods 0.000 description 1
- 230000030279 gene silencing Effects 0.000 description 1
- 238000012226 gene silencing method Methods 0.000 description 1
- 238000001415 gene therapy Methods 0.000 description 1
- 239000011521 glass Substances 0.000 description 1
- 150000004676 glycans Chemical class 0.000 description 1
- 150000002336 glycosamine derivatives Chemical class 0.000 description 1
- 230000013595 glycosylation Effects 0.000 description 1
- 238000006206 glycosylation reaction Methods 0.000 description 1
- PCHJSUWPFVWCPO-UHFFFAOYSA-N gold Chemical compound [Au] PCHJSUWPFVWCPO-UHFFFAOYSA-N 0.000 description 1
- 229910052737 gold Inorganic materials 0.000 description 1
- 239000010931 gold Substances 0.000 description 1
- 239000008187 granular material Substances 0.000 description 1
- 229940029575 guanosine Drugs 0.000 description 1
- 238000010438 heat treatment Methods 0.000 description 1
- 210000003494 hepatocyte Anatomy 0.000 description 1
- 125000000623 heterocyclic group Chemical group 0.000 description 1
- 125000000592 heterocycloalkyl group Chemical group 0.000 description 1
- GNOIPBMMFNIUFM-UHFFFAOYSA-N hexamethylphosphoric triamide Chemical compound CN(C)P(=O)(N(C)C)N(C)C GNOIPBMMFNIUFM-UHFFFAOYSA-N 0.000 description 1
- 229960001340 histamine Drugs 0.000 description 1
- 229940088597 hormone Drugs 0.000 description 1
- 239000005556 hormone Substances 0.000 description 1
- 108091008039 hormone receptors Proteins 0.000 description 1
- 229910052739 hydrogen Inorganic materials 0.000 description 1
- 239000001257 hydrogen Substances 0.000 description 1
- 125000002887 hydroxy group Chemical group [H]O* 0.000 description 1
- 150000002460 imidazoles Chemical class 0.000 description 1
- 238000001597 immobilized metal affinity chromatography Methods 0.000 description 1
- 238000000338 in vitro Methods 0.000 description 1
- 238000001727 in vivo Methods 0.000 description 1
- 230000006698 induction Effects 0.000 description 1
- 239000003112 inhibitor Substances 0.000 description 1
- 230000002401 inhibitory effect Effects 0.000 description 1
- 230000005764 inhibitory process Effects 0.000 description 1
- 230000000977 initiatory effect Effects 0.000 description 1
- 239000007924 injection Substances 0.000 description 1
- 238000002347 injection Methods 0.000 description 1
- 102000006495 integrins Human genes 0.000 description 1
- 108010044426 integrins Proteins 0.000 description 1
- 230000003993 interaction Effects 0.000 description 1
- 238000011835 investigation Methods 0.000 description 1
- 150000002500 ions Chemical class 0.000 description 1
- 230000002427 irreversible effect Effects 0.000 description 1
- 238000002955 isolation Methods 0.000 description 1
- FZWBNHMXJMCXLU-BLAUPYHCSA-N isomaltotriose Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CO)O[C@@H]1OC[C@@H]1[C@@H](O)[C@H](O)[C@@H](O)[C@@H](OC[C@@H](O)[C@@H](O)[C@H](O)[C@@H](O)C=O)O1 FZWBNHMXJMCXLU-BLAUPYHCSA-N 0.000 description 1
- 210000003734 kidney Anatomy 0.000 description 1
- 238000007169 ligase reaction Methods 0.000 description 1
- 125000005647 linker group Chemical group 0.000 description 1
- 230000029226 lipidation Effects 0.000 description 1
- 229920006008 lipopolysaccharide Polymers 0.000 description 1
- 239000002502 liposome Substances 0.000 description 1
- 239000012139 lysis buffer Substances 0.000 description 1
- 230000014759 maintenance of location Effects 0.000 description 1
- 210000004962 mammalian cell Anatomy 0.000 description 1
- 125000002960 margaryl group Chemical group [H]C([*])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])[H] 0.000 description 1
- 238000004949 mass spectrometry Methods 0.000 description 1
- 101150024647 mch gene Proteins 0.000 description 1
- 201000001441 melanoma Diseases 0.000 description 1
- 229940041616 menthol Drugs 0.000 description 1
- 239000002207 metabolite Substances 0.000 description 1
- 229910021645 metal ion Inorganic materials 0.000 description 1
- IZAGSTRIDUNNOY-UHFFFAOYSA-N methyl 2-[(2,4-dioxo-1h-pyrimidin-5-yl)oxy]acetate Chemical compound COC(=O)COC1=CNC(=O)NC1=O IZAGSTRIDUNNOY-UHFFFAOYSA-N 0.000 description 1
- 125000002496 methyl group Chemical group [H]C([H])([H])* 0.000 description 1
- 239000002679 microRNA Substances 0.000 description 1
- 229960004857 mitomycin Drugs 0.000 description 1
- 108091005601 modified peptides Proteins 0.000 description 1
- 239000003607 modifier Substances 0.000 description 1
- 238000010369 molecular cloning Methods 0.000 description 1
- 238000012544 monitoring process Methods 0.000 description 1
- 150000002772 monosaccharides Chemical class 0.000 description 1
- 230000007498 myristoylation Effects 0.000 description 1
- XJVXMWNLQRTRGH-UHFFFAOYSA-N n-(3-methylbut-3-enyl)-2-methylsulfanyl-7h-purin-6-amine Chemical compound CSC1=NC(NCCC(C)=C)=C2NC=NC2=N1 XJVXMWNLQRTRGH-UHFFFAOYSA-N 0.000 description 1
- WQEPLUUGTLDZJY-UHFFFAOYSA-N n-Pentadecanoic acid Natural products CCCCCCCCCCCCCCC(O)=O WQEPLUUGTLDZJY-UHFFFAOYSA-N 0.000 description 1
- QNILTEGFHQSKFF-UHFFFAOYSA-N n-propan-2-ylprop-2-enamide Chemical compound CC(C)NC(=O)C=C QNILTEGFHQSKFF-UHFFFAOYSA-N 0.000 description 1
- 229920005615 natural polymer Polymers 0.000 description 1
- 229910052759 nickel Inorganic materials 0.000 description 1
- 125000001893 nitrooxy group Chemical group [O-][N+](=O)O* 0.000 description 1
- QTNLALDFXILRQO-UHFFFAOYSA-N nonadecane-1,2,3-triol Chemical group CCCCCCCCCCCCCCCCC(O)C(O)CO QTNLALDFXILRQO-UHFFFAOYSA-N 0.000 description 1
- 230000000269 nucleophilic effect Effects 0.000 description 1
- 238000002515 oligonucleotide synthesis Methods 0.000 description 1
- 229920001542 oligosaccharide Polymers 0.000 description 1
- 150000002482 oligosaccharides Chemical class 0.000 description 1
- 238000005457 optimization Methods 0.000 description 1
- 150000007524 organic acids Chemical class 0.000 description 1
- 235000005985 organic acids Nutrition 0.000 description 1
- 229920000620 organic polymer Polymers 0.000 description 1
- 125000001181 organosilyl group Chemical group [SiH3]* 0.000 description 1
- 125000004430 oxygen atom Chemical group O* 0.000 description 1
- 102000002574 p38 Mitogen-Activated Protein Kinases Human genes 0.000 description 1
- 108010068338 p38 Mitogen-Activated Protein Kinases Proteins 0.000 description 1
- MCYTYTUNNNZWOK-LCLOTLQISA-N penetratin Chemical compound C([C@H](NC(=O)[C@H](CC=1C2=CC=CC=C2NC=1)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CCCNC(N)=N)[C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(N)=O)C1=CC=CC=C1 MCYTYTUNNNZWOK-LCLOTLQISA-N 0.000 description 1
- 239000000863 peptide conjugate Substances 0.000 description 1
- 238000002823 phage display Methods 0.000 description 1
- 230000003285 pharmacodynamic effect Effects 0.000 description 1
- 239000012071 phase Substances 0.000 description 1
- 150000008298 phosphoramidates Chemical class 0.000 description 1
- PTMHPRAIXMAOOB-UHFFFAOYSA-N phosphoramidic acid Chemical compound NP(O)(O)=O PTMHPRAIXMAOOB-UHFFFAOYSA-N 0.000 description 1
- 150000003013 phosphoric acid derivatives Chemical class 0.000 description 1
- 229910052697 platinum Inorganic materials 0.000 description 1
- 229920002627 poly(phosphazenes) Polymers 0.000 description 1
- 229920001467 poly(styrenesulfonates) Polymers 0.000 description 1
- 229920002401 polyacrylamide Polymers 0.000 description 1
- 108010064470 polyaspartate Proteins 0.000 description 1
- 125000005575 polycyclic aromatic hydrocarbon group Chemical group 0.000 description 1
- 229920000573 polyethylene Polymers 0.000 description 1
- 229920002643 polyglutamic acid Polymers 0.000 description 1
- 229920001155 polypropylene Polymers 0.000 description 1
- 229920001282 polysaccharide Polymers 0.000 description 1
- 239000005017 polysaccharide Substances 0.000 description 1
- 229920002223 polystyrene Polymers 0.000 description 1
- 229920000166 polytrimethylene carbonate Chemical group 0.000 description 1
- 229920002635 polyurethane Polymers 0.000 description 1
- 239000004814 polyurethane Substances 0.000 description 1
- 150000004032 porphyrins Chemical class 0.000 description 1
- 239000013641 positive control Substances 0.000 description 1
- 230000004481 post-translational protein modification Effects 0.000 description 1
- 230000001124 posttranscriptional effect Effects 0.000 description 1
- 239000001103 potassium chloride Substances 0.000 description 1
- 235000011164 potassium chloride Nutrition 0.000 description 1
- 229910000160 potassium phosphate Inorganic materials 0.000 description 1
- 235000011009 potassium phosphates Nutrition 0.000 description 1
- 238000001556 precipitation Methods 0.000 description 1
- 230000002062 proliferating effect Effects 0.000 description 1
- KCXFHTAICRTXLI-UHFFFAOYSA-N propane-1-sulfonic acid Chemical compound CCCS(O)(=O)=O KCXFHTAICRTXLI-UHFFFAOYSA-N 0.000 description 1
- 229940048914 protamine Drugs 0.000 description 1
- 239000012460 protein solution Substances 0.000 description 1
- 230000005588 protonation Effects 0.000 description 1
- 210000001938 protoplast Anatomy 0.000 description 1
- 239000002719 pyrimidine nucleotide Substances 0.000 description 1
- 230000035484 reaction time Effects 0.000 description 1
- 108020003175 receptors Proteins 0.000 description 1
- 102000005962 receptors Human genes 0.000 description 1
- 238000010188 recombinant method Methods 0.000 description 1
- 230000003362 replicative effect Effects 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 230000004044 response Effects 0.000 description 1
- 238000012552 review Methods 0.000 description 1
- 125000006413 ring segment Chemical group 0.000 description 1
- 238000005185 salting out Methods 0.000 description 1
- 239000000523 sample Substances 0.000 description 1
- COFLCBMDHTVQRA-UHFFFAOYSA-N sapphyrin Chemical compound N1C(C=2NC(C=C3N=C(C=C4NC(=C5)C=C4)C=C3)=CC=2)=CC=C1C=C1C=CC5=N1 COFLCBMDHTVQRA-UHFFFAOYSA-N 0.000 description 1
- 238000007423 screening assay Methods 0.000 description 1
- 230000035945 sensitivity Effects 0.000 description 1
- 238000000926 separation method Methods 0.000 description 1
- 238000002864 sequence alignment Methods 0.000 description 1
- 125000003607 serino group Chemical group [H]N([H])[C@]([H])(C(=O)[*])C(O[H])([H])[H] 0.000 description 1
- 238000001877 single-ion monitoring Methods 0.000 description 1
- 238000002791 soaking Methods 0.000 description 1
- 239000011780 sodium chloride Substances 0.000 description 1
- 238000010532 solid phase synthesis reaction Methods 0.000 description 1
- 238000000527 sonication Methods 0.000 description 1
- 229940063673 spermidine Drugs 0.000 description 1
- 229940063675 spermine Drugs 0.000 description 1
- 210000000952 spleen Anatomy 0.000 description 1
- 230000000087 stabilizing effect Effects 0.000 description 1
- 125000004079 stearyl group Chemical group [H]C([*])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])[H] 0.000 description 1
- 125000000547 substituted alkyl group Chemical group 0.000 description 1
- IIACRCGMVDHOTQ-UHFFFAOYSA-N sulfamic acid Chemical group NS(O)(=O)=O IIACRCGMVDHOTQ-UHFFFAOYSA-N 0.000 description 1
- 150000003456 sulfonamides Chemical group 0.000 description 1
- BDHFUVZGWQCTTF-UHFFFAOYSA-M sulfonate Chemical compound [O-]S(=O)=O BDHFUVZGWQCTTF-UHFFFAOYSA-M 0.000 description 1
- 150000003457 sulfones Chemical group 0.000 description 1
- YROXIXLRRCOBKF-UHFFFAOYSA-N sulfonylurea Chemical compound OC(=N)N=S(=O)=O YROXIXLRRCOBKF-UHFFFAOYSA-N 0.000 description 1
- 150000003462 sulfoxides Chemical class 0.000 description 1
- 239000011593 sulfur Substances 0.000 description 1
- 125000004434 sulfur atom Chemical group 0.000 description 1
- 239000013589 supplement Substances 0.000 description 1
- 239000000725 suspension Substances 0.000 description 1
- 238000001308 synthesis method Methods 0.000 description 1
- 229920002994 synthetic fiber Polymers 0.000 description 1
- 238000010189 synthetic method Methods 0.000 description 1
- 229920001059 synthetic polymer Polymers 0.000 description 1
- 150000003505 terpenes Chemical class 0.000 description 1
- 235000007586 terpenes Nutrition 0.000 description 1
- TUNFSRHWOTWDNC-HKGQFRNVSA-N tetradecanoic acid Chemical compound CCCCCCCCCCCCC[14C](O)=O TUNFSRHWOTWDNC-HKGQFRNVSA-N 0.000 description 1
- 150000004044 tetrasaccharides Chemical class 0.000 description 1
- 229940104230 thymidine Drugs 0.000 description 1
- 229940113082 thymine Drugs 0.000 description 1
- 229960000874 thyrotropin Drugs 0.000 description 1
- 230000001748 thyrotropin Effects 0.000 description 1
- 230000005945 translocation Effects 0.000 description 1
- 238000011282 treatment Methods 0.000 description 1
- 125000000876 trifluoromethoxy group Chemical group FC(F)(F)O* 0.000 description 1
- 125000002023 trifluoromethyl group Chemical group FC(F)(F)* 0.000 description 1
- 150000004043 trisaccharides Chemical class 0.000 description 1
- 150000003648 triterpenes Chemical class 0.000 description 1
- 239000012137 tryptone Substances 0.000 description 1
- 230000034512 ubiquitination Effects 0.000 description 1
- 238000010798 ubiquitination Methods 0.000 description 1
- 238000000825 ultraviolet detection Methods 0.000 description 1
- 238000011144 upstream manufacturing Methods 0.000 description 1
- SYFNOXYZEIYOSE-UHFFFAOYSA-N uvaol Natural products CC1CCC2(O)CCC3(C)C(=CCC4(C)C5(C)CCC(O)C(C)(C)C5CCC34C)C2C1C SYFNOXYZEIYOSE-UHFFFAOYSA-N 0.000 description 1
- 238000011179 visual inspection Methods 0.000 description 1
- 235000019163 vitamin B12 Nutrition 0.000 description 1
- 239000011715 vitamin B12 Substances 0.000 description 1
- 235000019165 vitamin E Nutrition 0.000 description 1
- 229940046009 vitamin E Drugs 0.000 description 1
- 239000011709 vitamin E Substances 0.000 description 1
- 150000003722 vitamin derivatives Chemical class 0.000 description 1
- 239000002699 waste material Substances 0.000 description 1
- 239000002023 wood Substances 0.000 description 1
- WCNMEQDMUYVWMJ-JPZHCBQBSA-N wybutoxosine Chemical compound C1=NC=2C(=O)N3C(CC([C@H](NC(=O)OC)C(=O)OC)OO)=C(C)N=C3N(C)C=2N1[C@@H]1O[C@H](CO)[C@@H](O)[C@H]1O WCNMEQDMUYVWMJ-JPZHCBQBSA-N 0.000 description 1
- 239000012138 yeast extract Substances 0.000 description 1
- 229910052725 zinc Inorganic materials 0.000 description 1
- SFVVQRJOGUKCEG-OPQSFPLASA-N β-MSH Chemical compound C1C[C@@H](O)[C@H]2C(COC(=O)[C@@](O)([C@@H](C)O)C(C)C)=CCN21 SFVVQRJOGUKCEG-OPQSFPLASA-N 0.000 description 1
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/93—Ligases (6)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/11—DNA or RNA fragments; Modified forms thereof; Non-coding nucleic acids having a biological activity
- C12N15/52—Genes encoding for enzymes or proenzymes
Definitions
- the present disclosure relates to the field of biotechnology, in particular to engineered double-stranded RNA (dsRNA) ligases and their application in industrial biocatalysis.
- the present disclosure also relates to a process of producing an engineered dsRNA ligase, and to a method for producing an oligonucleotide by contacting oligonucleotide fragments with an engineered dsRNA ligase.
- dsRNA double-stranded RNA
- Therapeutic oligonucleotides including small interfering RNA (siRNA) and inhibitory antisense oligonucleotides (ASOs) have the potential to treat a diverse range of life-threatening diseases.
- small interfering RNA siRNA
- ASOs inhibitory antisense oligonucleotides
- biocatalysis is being more frequently applied in the manufacture of active pharmaceutical ingredients (APIs) since enzymes are capable of highly selective transformation under mild reaction conditions and in aqueous media (Mann, G. & Stanger, F. V. Chimia (Aarau) 74, 407-417 (2020)).
- APIs active pharmaceutical ingredients
- the biocatalysis of short oligonucleotide fragments offers a sustainable and economical alternative to the solid phase chemical synthesis of full- length therapeutic oligonucleotides currently used.
- Shorter oligonucleotides can be synthesized more easily and with higher purities than longer oligonucleotides, simplifying downstream processing and reducing solvent waste. These short oligonucleotide fragments can then be combined using nucleic acid ligases to produce oligonucleotide products. Nucleic acid ligases have shown remarkable tolerance towards unnatural DNA/RNA containing pharmaceutically relevant chemical modifications (Kestemont, D., Herdewijn, P. & Renders, M. Curr Protoc Chem Biol 11, e62 (2019); Kestemont, D. et al. Chemical Communications 54, 6408-6411 (2016); and Nandakumar, J. & Shuman, S.
- dsRNA ligase to synthesize an siRNA product, starting from short fragments ( ⁇ 9 nts), containing extensive chemical modification, including 2’-OMe, 2’-F modified nucleotides, phosphorothioate backbone modified nucleotides and a terminal fragment that is functionalized with a bulky N- acetyl galactosamine (GalNAc) moiety has previously been described (Mann, G. et al. Tetrahedron Letters 93, 153696 (2022)).
- GalNAc bulky N- acetyl galactosamine
- the sequences of oligonucleotides 2, 3 and 5-12 are provided in Table 1.
- FIG. 3 Comparative data showing the relative peak area % of siRNA (1) present in the reaction samples comprising different concentrations of wild-type enzyme (SEQ ID NO: 2) and engineered enzymes (SEQ ID NOs: 288 and 632) following pre-incubation of the enzyme at 4 °C or 37 °C for 4 h.
- Enzyme concentration is provided as g/L of shake-flask powder (SFP) produced by lyophilization of frozen clarified lysate as described in the Examples.
- SFP shake-flask powder
- the present disclosure provides engineered double-stranded RNA (dsRNA) ligase polypeptides.
- dsRNA double-stranded RNA
- the present disclosure also provides gene sequences of engineered polypeptides, recombinant expression vectors comprising the genes, engineered host strains and efficient methods for the production thereof, as well as reaction processes for the biocatalysis of oligonucleotides using engineered polypeptides.
- the engineered double -stranded RNA (dsRNA) ligase polypeptides described herein have improved catalytic activity as compared to the wild-type dsRNA ligase from which they are derived.
- the engineered polypeptides provided herein were derived from a wildtype dsRNA ligase from Bacteriophage RB69.
- the wild-type dsRNA ligase consists of 332 amino acids and has the amino acid sequence shown in SEQ ID NO: 302 (also accessible under accession number Q7Y4V8 in UniProt).
- the disclosure provides an engineered double-stranded RNA (dsRNA) ligase polypeptide comprising an amino acid sequence having at least 80% sequence identity to an amino acid sequence selected from the group consisting of SEQ ID NOs: 304, 306, 308, 310, 312, 314, 316, 318, 320, 322, 324, 326, 328, 330, 332, 334, 336, 338, 340, 342, 344, 346,
- dsRNA double-stranded RNA
- the disclosure provides an engineered double-stranded RNA (dsRNA) ligase polypeptide comprising an amino acid sequence having at least 80% sequence identity to an amino acid sequence selected from the group consisting of SEQ ID NOs: 636, 638, 640, 642, 644, 646, 648, 650, 652, 654, 656, 658, 660, 662, 664, 666, and 668; wherein the engineered dsRNA ligase polypeptide: (a) has dsRNA ligase activity; and (b) does not the comprise the amino acid sequence of SEQ ID NO: 302.
- engineered dsRNA ligase polypeptide (a) has dsRNA ligase activity; and (b) does not the comprise the amino acid sequence of SEQ ID NO: 302.
- the disclosure provides an engineered double-stranded RNA (dsRNA) ligase polypeptide comprising an amino acid sequence having at least 85% sequence identity to an amino acid sequence selected from the group consisting of SEQ ID NOs: 304, 306, 308, 310, 312, 314, 316, 318, 320, 322, 324, 326, 328, 330, 332, 334, 336, 338, 340, 342, 344, 346,
- dsRNA double-stranded RNA
- engineered dsRNA ligase polypeptide (a) has dsRNA ligase activity; and (b) does not the comprise the amino acid sequence of SEQ ID NO: 302.
- dsRNA double-stranded RNA
- ligase polypeptide which is a polypeptide of: (a) a polypeptide comprising an amino acid sequence selected from the group consisting of SEQ ID NOs: 304, 306, 308, 310, 312, 314, 316, 318, 320, 322, 324, 326, 328, 330, 332, 334, 336, 338, 340, 342, 344, 346, 348, 350, 352, 354,
- a polypeptide having dsRNA ligase activity which comprises an amino acid sequence having (i) at least 80% sequence identity to one of the polypeptides recited in (a), and (ii) a substitution, deletion, addition or insertion of one or more amino acid residues relative to said one amino acid sequence recited in (a); wherein the engineered dsRNA ligase polypeptide does not the comprise the amino acid sequence of SEQ ID NO: 302.
- the disclosure provides an engineered double-stranded RNA (dsRNA) ligase polypeptide, which is a polypeptide of: (a) a polypeptide comprising an amino acid sequence selected from the group consisting of SEQ ID NOs: 636, 638, 640, 642, 644, 646, 648, 650, 652, 654, 656, 658, 660, 662, 664, 666, and 668; or (b) a polypeptide having dsRNA ligase activity, which comprises an amino acid sequence having (i) at least 80% sequence identity to one of the polypeptides recited in (a), and (ii) a substitution, deletion, addition or insertion of one or more amino acid residues relative to said one amino acid sequence recited in (a); wherein the engineered dsRNA ligase polypeptide does not the comprise the amino acid sequence of SEQ ID NO: 302.
- dsRNA double-stranded RNA
- ligase polypeptide which is a polypeptide of: (a) a polypeptide comprising an amino acid sequence selected from the group consisting of SEQ ID NOs: 304, 306, 308, 310, 312, 314, 316, 318, 320, 322, 324, 326, 328, 330, 332, 334, 336, 338, 340, 342, 344, 346, 348, 350, 352, 354,
- a polypeptide having dsRNA ligase activity which comprises an amino acid sequence having
- the disclosure provides an engineered double-stranded RNA (dsRNA) ligase polypeptide comprising an amino acid sequence having at least 80% sequence identity to an amino acid sequence selected from the group consisting of SEQ ID NOs: 304, 306, 308, 310, 312, 314, 316, 318, 320, 322, 324, 326, 328, 330, 332, 334, 336, 338, 340, 342, 344, 346,
- dsRNA double-stranded RNA
- the engineered dsRNA ligase polypeptide has dsRNA ligase activity; and (b) the amino acid sequence of the engineered dsRNA ligase polypeptide comprises one or more of the following amino acid residues: X6 is G or E ; X7 is Q; X15 is R, D or E; X19 is Q or D; X29 is N or L; X36 is V; X39 is A; X44 is V; X45 is V; X46 is Y; X47 is E; X49 is G; X51 is L; X53 is Y; X56 is R or A; X57 is S; X60 is T, G or P; X63 is S, Q or G; X64 is R, T, Q,
- the polypeptide comprises an amino acid sequence selected from the group consisting of SEQ ID NOs: 370, 488, 526, 578, 588, 590, and 592. In some embodiments, the polypeptide comprises an amino acid sequence of SEQ ID NO: 666. In some embodiments, the polypeptide comprises an amino acid sequence selected from the group consisting of SEQ ID NOs: 370, 488, 526, 578, 588, 590, 592, and 666.
- the disclosure provides an engineered double-stranded RNA (dsRNA) ligase polypeptide comprising an amino acid sequence having at least 80% sequence identity to an amino acid sequence selected from the group consisting of SEQ ID NOs: 370, 488, 526, 578, 588, 590, 592, and 666; wherein: (a) the engineered dsRNA ligase polypeptide has dsRNA ligase activity; and (b) the amino acid sequence of the engineered dsRNA ligase polypeptide comprises one or more of the following amino acid residues: X15 is D or E; X19 is D; X36 is V; X39 is A; X53 is Y; X185 is K; X218 is N; X221 is I; X237 is R; X251 is L; X255 is C; and X285 is A.
- dsRNA double-stranded RNA
- the disclosure provides an engineered double-stranded RNA (dsRNA) ligase polypeptide comprising an amino acid sequence having at least 80% sequence identity to the amino acid sequence of SEQ ID NO: 666; wherein: (a) the engineered dsRNA ligase polypeptide has dsRNA ligase activity; and (b) the amino acid sequence of the engineered dsRNA ligase polypeptide comprises one or more (e.g.
- X15 is D; X39 is A; X53 is Y; X185 is K; X218 is N; X221 is I; X237 is R; X251 is L; X255 is C; and X285 is A.
- the disclosure provides an engineered double-stranded RNA (dsRNA) ligase polypeptide comprising an amino acid sequence having at least 80% sequence identity to the amino acid sequence of SEQ ID NO: 370; wherein: (a) the engineered dsRNA ligase polypeptide has dsRNA ligase activity; and (b) the amino acid sequence of the engineered dsRNA ligase polypeptide comprises one or more (e.g. 2 or more, 3 or more, or all 4) of the following amino acid residues: X36 is V; X39 is A; X218 is N; and X221 is I.
- dsRNA double-stranded RNA
- the disclosure provides an engineered double-stranded RNA (dsRNA) ligase polypeptide comprising an amino acid sequence having at least 80% sequence identity to the amino acid sequence of SEQ ID NO: 488; wherein: (a) the engineered dsRNA ligase polypeptide has dsRNA ligase activity; and (b) the amino acid sequence of the engineered dsRNA ligase polypeptide comprises one or more (e.g. 2 or more, or all 3) of the following amino acid residues: X39 is A; X218 is N; and X221 is I.
- dsRNA double-stranded RNA
- the disclosure provides an engineered double-stranded RNA (dsRNA) ligase polypeptide comprising an amino acid sequence having at least 80% sequence identity to the amino acid sequence of SEQ ID NO: 526; wherein: (a) the engineered dsRNA ligase polypeptide has dsRNA ligase activity; and (b) the amino acid sequence of the engineered dsRNA ligase polypeptide comprises one or more (e.g. 2 or more, 3 or more, or all 4) of the following amino acid residues: X39 is A; X218 is N; X221 is I; and X255 is C.
- dsRNA double-stranded RNA
- the disclosure provides an engineered double-stranded RNA (dsRNA) ligase polypeptide comprising an amino acid sequence having at least 80% sequence identity to the amino acid sequence of SEQ ID NO: 578; wherein: (a) the engineered dsRNA ligase polypeptide has dsRNA ligase activity; and (b) the amino acid sequence of the engineered dsRNA ligase polypeptide comprises one or more (e.g.
- the disclosure provides an engineered double-stranded RNA (dsRNA) ligase polypeptide comprising an amino acid sequence having at least 80% sequence identity to the amino acid sequence of SEQ ID NO: 588 or 590; wherein: (a) the engineered dsRNA ligase polypeptide has dsRNA ligase activity; and (b) the amino acid sequence of the engineered dsRNA ligase polypeptide comprises one or more (e.g.
- X15 is D or E; X39 is A; X53 is Y; X218 is N; X221 is I; X237 is R; X251 is L; X255 is C; and X285 is A.
- the disclosure provides an engineered double-stranded RNA (dsRNA) ligase polypeptide comprising an amino acid sequence having at least 80% sequence identity to the amino acid sequence of SEQ ID NO: 592; wherein: (a) the engineered dsRNA ligase polypeptide has dsRNA ligase activity; and (b) the amino acid sequence of the engineered dsRNA ligase polypeptide comprises one or more (e.g.
- the polypeptide comprises an amino acid sequence selected from the group consisting of SEQ ID NOs: 636, 638, 640, 642, 644, 646, 648, 650, 652, 654, 656, 658, 660, 662, 664, 666 and 668. In some embodiments, the polypeptide comprises an amino acid sequence selected from the group consisting of SEQ ID NOs: 636, 638, 642, 646, 664, and 666.
- the disclosure also provides an engineered dsRNA ligase polypeptide comprising an amino acid sequence having at least 80% sequence identity to SEQ ID NO: 302, which produces at least 5% more oligonucleotide product than a dsRNA ligase polypeptide comprising the amino acid sequence of SEQ ID NO: 302 under the same ligation reaction conditions, wherein the engineered dsRNA ligase polypeptide does not the comprise the amino acid sequence of SEQ ID NO: 302.
- the ligation reaction conditions include about 1 pM to about 10 mM oligonucleotide fragment, a source of ATP, about 5 mM to about 100 mM divalent cation, and about 0.5 g/L to about 10 g/L engineered dsRNA ligase polypeptide, pH of about 4.0 to about 8.0, and temperature of about 10 °C to about 50°C.
- the source of ATP comprises ATP, optionally a stoichiometric excess of ATP.
- the source of ATP comprises: (a) polyphosphate kinase (PPK); (b) polyphosphate; and (c) AMP and/or ATP.
- the amino acid sequence of the engineered dsRNA ligase polypeptide comprises an amino acid sequence that differs from the sequence of SEQ ID NO: 302 in one or more amino acid residues selected from: X6, X7, X15, X19, X29, X36, X39, X46, X47, X49, X51, X53, X56, X57, X60, X63, X64, X66, X67, X87, X88, X91, X93, X103, X105, X107, X114, X122, X126, X129, X130, X131, X137, X144, X146, X158, X163, X173, X178, X190, X196, X216, X218, X221, X228, X230, X232, X235, X236, X237,
- the amino acid sequence of the engineered dsRNA ligase polypeptide comprises an amino acid sequence that differs from the sequence of SEQ ID NO: 302 in one or more amino acid residues selected from: X6, X7, X15, X19, X29, X36, X39, X44, X45, X46, X47, X49, X51, X53, X56, X57, X60, X63, X64, X66, X67, X87, X88, X89, X91, X92, X93, X103, X105, X107, X114, X122, X126, X129, X130, X131, X137, X144, X146, X158, X163, X173, X178, X185, X190, X196, X216, X218, X221, X228, X
- the amino acid sequence of the engineered dsRNA ligase polypeptide comprises one or more of the following amino acid residues: X6 is G; X7 is Q; X15 is R, D or E; X19 is Q or D; X29 is N or L; X36 is V; X39 is A; X46 is Y; X47 is E; X49 is G; X51 is L; X53 is Y; X56 is R or A; X57 is S; X60 is T, G or P; X63 is S, Q or G; X64 is R, T, Q, F, G, or M; X66 is F or W; X67 is N; X87 is T, P, K or absent; X88 is C; X91 is S; X93 is G, C, or A; X103 is V, C, Y, or T; X105 is V; X107 is R or T; XI 14 is N; X; X
- the amino acid sequence of the engineered dsRNA ligase polypeptide comprises one or more of the following amino acid residues: X6 is G or E; X7 is Q; X15 is R, D or E; X19 is Q or D; X29 is N or L; X36 is V; X39 is A; X44 is V; X45 is V; X46 is Y; X47 is E; X49 is G; X51 is L; X53 is Y; X56 is R or A; X57 is S; X60 is T, G or P; X63 is S, Q or G; X64 is R, T, Q, F, G, or M; X66 is F or W; X67 is N; X87 is T, P, K or absent; X88 is C; X89 is T; X91 is S; X92 is D; X93 is G, C, or A; X103 is V, C, Y,
- the amino acid sequence of the engineered dsRNA ligase polypeptide comprises an amino acid sequence that differs from the sequence of SEQ ID NO: 302 in one or more amino acid residues selected from: X15, X19, X36, X39, X53, X218, X221, X237, X251, X255, and X285, wherein the numbering refers to SEQ ID NO: 302, and wherein the engineered dsRNA ligase polypeptide has dsRNA ligase activity.
- the amino acid sequence of the engineered dsRNA ligase polypeptide comprises one or more of the following amino acid residues: X15 is D or E; X19 is D; X36 is V; X39 is A; X53 is Y; X218 is N; X221 is I; X237 is R; X251 is L; X255 is C; and X285 is A; wherein the numbering refers to SEQ ID NO: 302.
- the amino acid sequence of the engineered dsRNA ligase polypeptide comprises an amino acid sequence that differs from the sequence of SEQ ID NO: 302 in one or more amino acid residues selected from: X15, X19, X36, X39, X53, X185, X218, X221, X237, X251, X255, and X285, wherein the numbering refers to SEQ ID NO: 302, and wherein the engineered dsRNA ligase polypeptide has dsRNA ligase activity.
- the amino acid sequence of the engineered dsRNA ligase polypeptide comprises one or more of the following amino acid residues: X15 is D or E; X19 is D; X36 is V; X39 is A; X53 is Y; X185 is K; X218 is N; X221 is I; X237 is R; X251 is L; X255 is C; and X285 is A; wherein the numbering refers to SEQ ID NO: 302.
- the amino acid sequence of the engineered dsRNA ligase polypeptide comprises an amino acid sequence that differs from the sequence of SEQ ID NO: 302 in one or more (e.g. 2 or more, 3 or more, or all 4) amino acid residues selected from: X36, X39, X218 and X221, wherein the numbering refers to SEQ ID NO: 302, and wherein the engineered dsRNA ligase polypeptide has dsRNA ligase activity; optionally wherein the amino acid sequence of the engineered dsRNA ligase polypeptide comprises one or more (e.g. 2 or more, 3 or more, or all 4) of the following amino acid residues: X36 is V; X39 is A; X218 is N; and X221 is I.
- the amino acid sequence of the engineered dsRNA ligase polypeptide comprises an amino acid sequence that differs from the sequence of SEQ ID NO: 302 in one or more (e.g. 2 or more, or all 3) amino acid residues selected from: X39, X218 and X221, wherein the numbering refers to SEQ ID NO: 302, and wherein the engineered dsRNA ligase polypeptide has dsRNA ligase activity; optionally wherein the amino acid sequence of the engineered dsRNA ligase polypeptide comprises one or more (e.g. 2 or more, or all 3) of the following amino acid residues: X39 is A; X218 is N; and X221 is I.
- the amino acid sequence of the engineered dsRNA ligase polypeptide comprises an amino acid sequence that differs from the sequence of SEQ ID NO: 302 in one or more (e.g. 2 or more, 3 or more, or all 4) amino acid residues selected from: X39, X218, X221 and X255, wherein the numbering refers to SEQ ID NO: 302, and wherein the engineered dsRNA ligase polypeptide has dsRNA ligase activity; optionally wherein the amino acid sequence of the engineered dsRNA ligase polypeptide comprises one or more (e.g. 2 or more, 3 or more, or all 4) of the following amino acid residues: X39 is A; X218 is N; X221 is I; and X255 is C.
- the amino acid sequence of the engineered dsRNA ligase polypeptide comprises an amino acid sequence that differs from the sequence of SEQ ID NO: 302 in one or more (e.g. 2 or more, 3 or more, 4 or more, 5 or more, 6 or more, 7 or more, or all 8) amino acid residues selected from: X39, X53, X218, X221, X237, X251, X255 and X285, wherein the numbering refers to SEQ ID NO: 302, and wherein the engineered dsRNA ligase polypeptide has dsRNA ligase activity; optionally wherein the amino acid sequence of the engineered dsRNA ligase polypeptide comprises one or more (e.g.
- the amino acid sequence of the engineered dsRNA ligase polypeptide comprises an amino acid sequence that differs from the sequence of SEQ ID NO: 302 in one or more (e.g. 2 or more, 3 or more, 4 or more, 5 or more, 6 or more, 7 or more, 8 or more, or all 9) amino acid residues selected from: X15, X39, X53, X218, X221, X237, X251, X255 and X285, wherein the numbering refers to SEQ ID NO: 302, and wherein the engineered dsRNA ligase polypeptide has dsRNA ligase activity; optionally wherein the amino acid sequence of the engineered dsRNA ligase polypeptide comprises one or more (e.g.
- X15 is E; X39 is A; X 53 is Y; X218 is N; X221 is I; X237 is R; X251 is L; X255 is C; and X285 is A.
- the amino acid sequence of the engineered dsRNA ligase polypeptide comprises an amino acid sequence that differs from the sequence of SEQ ID NO: 302 in one or more (e.g. 2 or more, 3 or more, 4 or more, 5 or more, 6 or more, 7 or more, 8 or more, or all 9) amino acid residues selected from: X19, X39, X53, X218, X221, X237, X251, X255 and X285, wherein the numbering refers to SEQ ID NO: 302, and wherein the engineered dsRNA ligase polypeptide has dsRNA ligase activity; optionally wherein the amino acid sequence of the engineered dsRNA ligase polypeptide comprises one or more (e.g.
- the amino acid sequence of the engineered dsRNA ligase polypeptide comprises an amino acid sequence that differs from the sequence of SEQ ID NO: 302 in one or more (e.g. 2 or more, 3 or more, 4 or more, 5 or more, 6 or more, 7 or more, 8 or more, 9 or more, or all 10) amino acid residues selected from: X15, X39, X53, X185, X218, X221, X237, X251, X255 and X285, wherein the numbering refers to SEQ ID NO: 302, and wherein the engineered dsRNA ligase polypeptide has dsRNA ligase activity; optionally wherein the amino acid sequence of the engineered dsRNA ligase polypeptide comprises one or more (e.g.
- X15 is D; X39 is A; X 53 is Y; X185 is K; X218 is N; X221 is I; X237 is R; X251 is L; X255 is C; and X285 is A.
- the engineered dsRNA ligase polypeptide comprises a purification tag.
- the engineered dsRNA ligase polypeptide comprises an amino acid sequence selected from the group consisting of SEQ ID NOs: 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 86, 88, 90, 92, 94, 96, 98, 100, 102, 104, 106, 108, 110, 112, 114, 116, 118, 120, 122, 124, 126, 128, 130, 132, 134, 136, 138, 140, 142, 144,
- the engineered dsRNA ligase polypeptide comprises an amino acid sequence selected from the group consisting of SEQ ID NOs: 602, 604, 606, 608, 610, 612, 614, 616, 618, 620, 622, 624, 626, 628, 630, 632, and 634.
- the engineered dsRNA ligase polypeptide comprises an amino acid sequence selected from the group consisting of SEQ ID NOs: 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 86, 88, 90, 92, 94, 96, 98, 100, 102, 104, 106, 108, 110, 112, 114, 116, 118, 120, 122, 124, 126, 128, 130, 132, 134, 136, 138, 140, 142, 144, 146, 148,
- 294, 296, 298, 300 602, 604, 606, 608, 610, 612, 614, 616, 618, 620, 622, 624, 626, 628,
- the disclosure also provides a polypeptide immobilized on a solid material by chemical bond or a physical adsorption method, wherein the polypeptide comprises an engineered dsRNA ligase polypeptide described herein.
- the disclosure also provides a polynucleotide encoding the engineered dsRNA ligase polypeptide described herein.
- the polynucleotide comprises a nucleic acid sequence selected from SEQ ID NOs: 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 39, 41, 43, 45, 47, 49, 51, 53, 55, 57, 59, 61, 63, 65, 67, 69, 71, 73, 75, 77, 79, 81, 83, 85, 87, 89, 91, 93, 95, 97, 99, 101, 103, 105, 107, 109, 111, 113, 115, 117, 119, 121, 123, 125, 127, 129, 131, 133, 135, 137, 139, 141, 143, 145, 147, 149, 151, 153, 155, 157, 159, 161, 163, 165, 167,
- the polynucleotide comprises a nucleic acid sequence selected from SEQ ID NOs: 601, 603, 605, 607, 609, 611, 613, 615, 617, 619, 621, 623, 625, 627, 629, 631, 633, 635, 637, 639, 641, 643, 645, 647, 649, 651, 653, 655, 657, 659, 661, 663, 665, and 667.
- the polynucleotide comprises a nucleic acid sequence selected from: (a) SEQ ID NOs: 303, 305, 307, 309, 311, 313, 315, 317, 319, 321, 323, 325, 327, 329, 331, 333, 335, 337, 339, 341, 343, 345, 347, 349, 351, 353, 355, 357, 359, 361, 363, 365,
- the disclosure also provides an expression vector comprising the polynucleotide described herein.
- the vector comprises a plasmid, a cosmid, a bacteriophage or a viral vector.
- the disclosure also provides a host cell comprising the polynucleotide described herein or the expression vector described herein.
- the host cell is E. coli.
- the disclosure also provides a method of preparing an engineered dsRNA ligase polypeptide, which comprises the steps of culturing the host cell described herein and obtaining an engineered dsRNA ligase polypeptide from the culture.
- the disclosure also provides an engineered dsRNA ligase catalyst obtainable by culturing the host cells described herein, or according to the method described herein, wherein said engineered dsRNA ligase catalyst comprises cells or culture fluid containing the engineered dsRNA ligase polypeptides, or an article processed therewith, wherein the article refers to an extract obtained from the culture of host cell, an isolated product obtained by isolating or purifying an engineered dsRNA ligase from the extract, or an immobilized product obtained by immobilizing host cell, an extract thereof, or isolated product of the extract.
- the source of ATP comprises ATP.
- the source of ATP comprises: (a) polyphosphate kinase (PPK); (b) polyphosphate; and (c) AMP and/or ATP.
- the PPK is selected from PPK 12 or ajPAP.
- the method is performed using a sub-stoichiometric concentration of AMP and/or ATP.
- the polyphosphate is a polyphosphate salt.
- the polyphosphate salt is sodium polyphosphate (Maddrell’s salt) or sodium hexametaphosphate (Graham’s salt).
- the divalent cation cofactor is Mg 2+ or Mn 2+ .
- the method is performed with a divalent cation concentration of 5-100 mM, optionally 30-50 mM.
- the method further comprises a step of purifying the oligonucleotide.
- the disclosure also provides use of the engineered dsRNA ligase polypeptide described herein in the production of an oligonucleotide from two or more oligonucleotide fragments.
- each of the oligonucleotide fragments are 4-16 nucleotides in length, optionally 6-9 nucleotides in length.
- one or more of the oligonucleotide fragment(s) comprises one or two overhangs.
- one or more of the oligonucleotide fragments comprises a chemical modification.
- the chemical modification is selected from: (a) a modified backbone, optionally selected from a phosphorothioate (e.g.
- a modified nucleotide optionally selected from 2'-O-methyl (2’-OMe), 2'-flouro (2’-F), 2'-deoxy, 2'-deoxy-2’- fluoro, 2'-O-methoxyethyl (2'-O-MOE), 2'-O-aminopropyl (2'-O-AP), 2'-O- dimethylaminoethyl (2'-O-DMAOE), 2'-O-dimethylaminopropyl (2'-O-DMAP), 2'-O- dimethylaminoethyloxyethyl (2'-O-DMAEOE), 2'-O-N-methylacetamido (2'-0-NMA), locked nucleic acid (LNA), glycol nucleic acid (GNA), phosphoramidate (e.g.
- LNA locked nucleic acid
- GAA glycol nucleic acid
- phosphoramidate e.g.
- ligand comprises one or more N-Acetylgalactosamine (GalNAc) derivatives.
- GalNAc N-Acetylgalactosamine
- the disclosure also provides a composition
- a composition comprising: i. the engineered dsRNA ligase polypeptide described herein; ii. a source of ATP; and iii. a divalent cation.
- the composition further comprises two or more oligonucleotide fragments.
- the disclosure also provides a kit comprising: i. the engineered dsRNA ligase polypeptide described herein; ii. a source of ATP; iii. a divalent cation; and iv. instructions for use in a method of producing an oligonucleotide from two or more oligonucleotide fragments.
- the source of ATP comprises ATP.
- the source of ATP comprises: (a) polyphosphate kinase (PPK); (b) polyphosphate; and (c) AMP and/or ATP.
- the PPK is selected from PPK 12 or ajPAP.
- the polyphosphate is a polyphosphate salt.
- the polyphosphate salt is sodium polyphosphate (Maddrell’s salt) or sodium hexametaphosphate (Graham’s salt).
- the divalent cation cofactor is Mg 2+ or Mn 2+ .
- articles such as “a” and “an” refer to one or more than one (at least one) of the grammatical object of the article.
- the term “about” typically refers to the value which immediately follows the term ‘about’.
- “about 15 or more nucleotides” typically refers to 15 or more nucleotides.
- the term “about” embraces values which are +/- 1, 2 or 3 of the stated value.
- “about 15 or more nucleotides” may refer to 15+/-3 nucleotides, e.g. 12, 13, 14, 15, 16, 17 or 18 nucleotides.
- double-stranded RNA ligase” and “dsRNA ligase” are used interchangeably herein to refer to an enzyme having dsRNA ligase activity.
- a dsRNA ligase polypeptide may also be referred to herein as a “dsRNA ligase catalyst”.
- a dsRNA ligase of the invention is an ATP-dependent nucleic acid ligase.
- dsRNA ligase activity typically involves the ATP-dependent formation of a covalent bond between the 3 ’-OH of a ribonucleotide and the 5’-PO4 of a ribonucleotide or deoxyribonucleotide via the following steps: (1) dsRNA ligase reacts with ATP to form a covalent dsRNA ligase-AMP intermediate and release pyrophosphate; (2) AMP is transferred from the dsRNA ligase-AMP intermediate to the 5 ’-phosphate of a 3’ oligonucleotide fragment forming an adenylated oligonucleotide intermediate; and (3) the 3 ’-OH of a 5’ oligonucleotide fragment attacks the 5 ’ phosphate of the adenylated intermediate resulting in the formation of a phosphodiester bond
- the stoichiometric concentration of cofactor is the theoretical concentration required to achieve complete ligation in a given ligation reaction.
- the skilled person can readily derive the stoichiometric concentration of ATP required to achieve complete ligation based on the concentration of oligonucleotide fragments and the number of ligation reactions required to produce the oligonucleotide product. For example, a ligation reaction using 1 mM substrate which requires four ligation reactions has a stoichiometric ATP concentration of 4 mM. A stoichiometric excess of ATP can help ensure that complete ligation is achieved.
- the stoichiometric excess is at least 105% of the theoretical stoichiometric concentration of ATP required to achieve complete ligation, e.g. at least 110%, at least 115%, at least 120%, at least 125%, at least 130%, at least 135%, at least 140%, at least 145%, at least 150%, at least 160%, at least 170%, at least 180%, at least 190%, or at least 200%.
- engineered dsRNA ligase engineered dsRNA ligase polypeptide
- improved dsRNA ligase polypeptide engineered polypeptide
- oligonucleotide refers to a nucleic acid, typically comprising up to 100 nucleotides.
- oligonucleotide product refers to an oligonucleotide formed by the ligation of two or more oligonucleotide fragments by a dsDNA ligase described herein. Oligonucleotide products are also referred to herein simply as oligonucleotides. It will be understood that oligonucleotide products described herein comprise RNA. It will also be understood that oligonucleotide products described herein comprise a double-stranded region.
- oligonucleotide products described herein comprise RNA and DNA.
- a portion of the oligonucleotide product may be double-stranded DNA, while another portion is double-stranded RNA, forming a DNA-RNA chimera.
- RNAi RNA interference
- ASO antisense oligonucleotides
- RNAi is a post-transcriptional, targeted gene-silencing technique that uses RNAi agents to degrade messenger RNA (mRNA) containing the same sequence as the RNAi agent.
- ASOs are single-stranded nucleic acids that can be used to target mRNA derived from a gene of interest. ASOs can alter gene expression via a number of mechanisms including direct steric blockage of mRNA and ribonuclease H (RNase H) mediated degradation of mRNA.
- RNAi agents include, as non-limiting examples, siRNAs (small interfering RNAs), dsRNAs (double-stranded RNAs), shRNAs (short hairpin RNAs) and miRNAs (micro RNAs).
- RNAi agents also include, as additional non-limiting examples, locked nucleic acid (LNA), Morpholino, UNA, threose nucleic acid (TNA), glycol nucleic acid (GNA), peptide nucleic acid (PNA) and fluoro-arabinonucleic acid (FANA).
- RNAi agents also include molecules in which one or more strands are a mixture of RNA, DNA, LNA, Morpholino, UNA (unlocked nucleic acid), TNA, GNA, and/or FANA.
- one or both strands of an RNAi agent could be, for example, RNA, except that one or more RNA nucleotides is replaced by DNA, LNA, Morpholino, UNA, TNA, GNA, and/or FANA, etc.
- one or both strands of the RNAi agent can be nicked, and both strands can be the same length, or one strand can be shorter than the other.
- the oligonucleotide of the invention may be any of the RNAi agents described herein.
- oligonucleotide fragment refers to a nucleic acid that can be ligated to one or more additional oligonucleotide fragments to provide an oligonucleotide (or oligonucleotide product). Each oligonucleotide fragment corresponds to a portion of the oligonucleotide product. Oligonucleotide fragments may be referred to herein as “substrates” of the ligation reaction.
- dsRNA ligase activity involves the ligation of a 5’ oligonucleotide fragment to a 3 ’ oligonucleotide fragment.
- the prefixes 5’ and 3’ refer to the relative position of each oligonucleotide fragment in the oligonucleotide product after ligation, wherein the 5’ oligonucleotide fragment is located upstream of the 3 ’ oligonucleotide fragment (when the oligonucleotide product is presented in the 5’ to 3’ direction).
- a “5’ oligonucleotide fragment” typically comprises a 3’ terminal ribonucleotide having a 3 ’-hydroxyl group.
- a “3’ oligonucleotide fragment” comprises a 5’-phosphate, wherein the 5’ terminal nucleotide is a deoxyribonucleotide or a ribonucleotide.
- an oligonucleotide fragment may be a 3’ oligonucleotide fragment and a 5’ oligonucleotide fragment (e.g. wherein ligation reactions occur at the 5’ and 3’ ends of the oligonucleotide fragment).
- said oligonucleotide fragment may provide: (i) the 3’ oligonucleotide fragment in a ligation reaction with a 5’ oligonucleotide fragment; and (ii) the 5 ’ oligonucleotide fragment in a ligation reaction with a 3’ oligonucleotide fragment.
- oligonucleotide fragment 7 in Figure 1A provides: (i) the 3’ oligonucleotide fragment in a ligation reaction with 5’ oligonucleotide fragment 6 and; (ii) the 5 ’ oligonucleotide fragment in a ligation reaction with 3 ’ oligonucleotide fragment 12 to provide oligonucleotide product 2.
- a “terminal oligonucleotide fragment” herein refers to a nucleic acid that corresponds to an end (e.g. 5’ or 3’ end) portion of the oligonucleotide product.
- the 5’ terminal oligonucleotide fragment typically provides a 5’ oligonucleotide fragment for ligation to a 3’ oligonucleotide fragment.
- the 3’ terminal oligonucleotide fragment typically provides a 3’ oligonucleotide fragment for ligation to a 5 ’ oligonucleotide fragment.
- the 5’ terminal oligonucleotide is ligated directly to the 3’ terminal oligonucleotide.
- the 5 ’ terminal oligonucleotide and the 3 ’ terminal oligonucleotide are separated by one or more oligonucleotide fragments.
- oligonucleotide fragments described herein comprise RNA and DNA.
- a portion of an oligonucleotide fragment may be double-stranded DNA, while another portion is double-stranded RNA, forming a DNA-RNA chimera.
- overhang refers to at least one unpaired nucleotide that protrudes from the end of at least one of the two strands of a double -stranded oligonucleotide.
- this forms a nucleotide overhang, e.g., the unpaired nucleotide(s) form the overhang.
- An overhang that is complementary to the overhang of a second oligonucleotide fragment may be referred to as a “sticky end”.
- the oligonucleotide fragments described herein may have one or two sticky ends.
- Double -stranded nucleic acids comprise two anti-parallel and substantially complementary nucleic acid strands which are referred to as “sense” and “antisense” strands.
- the “antisense strand” refers to the strand of an RNAi which includes a region that is substantially complementary to a target sequence, e.g. an mRNA sequence.
- the “sense strand” refers to the strand of an RNAi that includes a region that is substantially complementary to a region of the antisense strand.
- the sense and antisense strands of an RNAi agent may be referred to as the passenger and guide strands, respectively.
- Sequences that are “substantially complementary” may be fully complementary or may contain one or more mismatches upon hybridization, while retaining the ability to hybridize under the conditions most relevant to their ultimate application.
- Conversion refers to the enzymatic transformation of a substrate to the corresponding product.
- Percent conversion or “conversion” refers to the percentage of oligonucleotide fragments that is converted to oligonucleotide product within a defined period of time under specified conditions.
- enzymatic activity or “activity” of a ligase can be expressed as the “percent conversion” of oligonucleotide fragments to oligonucleotide product.
- £p, £s and £i the extinction coefficient of the product, substrate, and intermediate oligonucleotides respectively.
- it is possible to resolve at least one substrate, reaction intermediate and product such as well- defined GalNAc-containing oligonucleotides, including GalNAc containing substrate fragments (e.g. oligonucleotide (12) as used in the examples described herein), reaction intermediates (e.g.
- AU arbitrary units
- Improved enzyme properties refers to an enzyme property that is better or more desirable for a specific purpose as compared to a reference dsRNA ligase such as a wild-type dsRNA ligase or another engineered dsRNA ligase under the same reaction conditions. Improved enzyme properties are exhibited by engineered dsRNA ligase polypeptides in this disclosure. The engineered dsRNA ligase polypeptides described herein exhibit increased enzyme activity (which can be expressed as a percentage of substrate conversion). Additional enzyme properties that may be improved include, but are not limited to, thermal stability, pH activity characteristics, cofactor requirements, and tolerance to inhibitors (e.g., reaction component, substrate or product inhibition).
- isolated polypeptide refers to a polypeptide that is substantially separated from other substances with which it is naturally associated, such as proteins, lipids, and polynucleotides.
- the term comprises polypeptides that have been removed or purified from their naturally occurring environment or expression system (e.g., in host cells or in vitro synthesis).
- Engineered dsRNA ligase polypeptides may be present in the cell, in the cell culture medium, or prepared in various forms, such as lysates or isolated preparations.
- the engineered dsRNA ligase polypeptide may be an isolated polypeptide.
- Wild-type refers to the form found in nature.
- a wild-type polypeptide or polynucleotide sequence is a sequence that is present in an organism that can be isolated from sources in nature, and which has not been intentionally modified by manual procedures.
- the polypeptide sequence of the wild-type dsRNA ligase described herein is provided by SEQ ID NO: 302.
- the wild-type sequence may also comprise a purification tag and may be provided by SEQ ID NO: 2.
- polynucleotide and “nucleic acid” are used interchangeably herein.
- protein protein
- polypeptide and “peptide” are used interchangeably herein to denote a polymer of at least two amino acids covalently linked by an amide bond, regardless of length or post-translational modification (e.g., glycosylation, phosphorylation, lipidation, myristoylation, ubiquitination, etc.).
- post-translational modification e.g., glycosylation, phosphorylation, lipidation, myristoylation, ubiquitination, etc.
- Recombinant or “engineered” when used with reference to, for example, a cell, nucleic acid or polypeptide refers to a material or material corresponding to the native or native form of the material, that has been modified in a manner that would not otherwise exist in nature, or is identical thereto but produced or derived from synthetic material and/or by manipulation using recombinant techniques.
- the amino acid may be in either the L- or D-configuration about a-carbon (Ca).
- “Ala” designates alanine without specifying the configuration about the a-carbon
- “D-Ala” and “L- Ala” designate D-alanine and L-alanine, respectively.
- upper case letters designate amino acids in the L-configuration about the a-carbon and lower-case letters designate amino acids in the D-configuration about the a-carbon.
- A designates L-alanine
- a designates D-alanine.
- nucleotides used for the genetically encoding nucleotides are conventional and are as follows: adenosine (A); guanosine (G); cytidine (C); thymidine (T); and uridine (U).
- the abbreviated nucleotides may be either ribonucleotides or 2 ’-deoxyribonucleotides.
- the nucleotides may be specified as being either ribonucleotides or 2 ’-deoxyribonucleotides on an individual basis or on an aggregate basis.
- guanine, cytosine, adenine, and uracil may be replaced by other moieties without substantially altering the base pairing properties of an oligonucleotide comprising a nucleotide bearing such replacement moiety.
- a nucleotide comprising inosine as its base may base pair with nucleotides containing adenine, cytosine, or uracil.
- nucleotides containing uracil, guanine, or adenine may be replaced in the nucleotide sequences of oligonucleotides featured in the present disclosure by a nucleotide containing, for example, inosine.
- adenine and cytosine anywhere in the oligonucleotide can be replaced with guanine and uracil, respectively to form Wobble base pairing with the target mRNA.
- amino acid difference or “residue difference” refers to the difference in amino acid residues at a position of a polypeptide sequence relative to the amino acid residue at a corresponding position in the reference sequence.
- the positions of amino acid differences are generally referred to herein as "Xn", where n refers to the corresponding position in the reference sequence on which the residue differences are based.
- a residue difference at position X6 as compared to SEQ ID NO: 302 refers to a difference in amino acid residue at the polypeptide position corresponding to position 6 of SEQ ID NO: 302.
- a residue difference at position X2 as compared to SEQ ID NO: 302 refers to an amino acid substitution to any residue other than serine at the position of the polypeptide corresponding to position 6 of SEQ ID NO: 302.
- the specific amino acid residue difference at the position may be indicated as “XnY” or “Xn is Y”, wherein “Xn” specifies the corresponding position in the reference sequence as described above, and “Y” is the single letter identifier of the residue present at that position in the engineered polypeptide.
- Specific amino acid differences may also be denoted by the conventional notation "AnY”, where A is a single letter identifier of the residue in the reference sequence, "n” is the number of residue position in the reference sequence, and “Y” is the single letter identifier of the residue present at that position in the engineered polypeptide.
- an engineered polypeptide of this disclosure may comprise one or more amino acid residue differences relative to a reference sequence, which is indicated by a list of specific positions at which residue differences are present relative to a reference sequence.
- more than one amino acid residue can be used in a specific residue position of an engineered polypeptide, the various amino acid residues can be listed as alternatives, e.g. “X19 is Q or D”.
- Deletion of an amino acid may be represented by e.g. “an amino acid sequence comprising Xn-” indicates that the amino acid sequence contains a deletion at the position corresponding to “Xn” in the reference sequence. “Deletion” refers to the modification of a polypeptide by removing one or more amino acids from a reference polypeptide.
- Deletions can include the removal of one or more amino acids, two or more amino acids, five or more amino acids, ten or more amino acids, fifteen or more amino acids, or twenty or more amino acids, up to 10% of the total number of amino acids of the enzyme, or up to 20% of the total number of amino acids making up the reference enzyme while retaining the enzymatic activity of the engineered dsRNA ligase and/or retaining the improved properties of the engineered dsRNA ligase. Deletion may involve the internal portion and/or the terminal portion of the polypeptide. In various embodiments, deletions may include a contiguous segment or may be discontinuous.
- corresponding to refers to the numbering of the residues of a specified reference when the given amino acid or polynucleotide sequence is compared to the reference sequence.
- residue number or residue position of a given sequence is designated with respect to the reference sequence, rather than by the actual numerical position of the residue within the given amino acid or polynucleotide sequence.
- a given amino acid sequence such as an engineered dsRNA ligase can be aligned to a reference sequence by introducing gaps to optimize residue matches between the two sequences. In these cases, although there are gaps, the numbering of the residue in the given amino acid or polynucleotide sequence is made with respect to the reference sequence to which it has been aligned.
- Reference sequence refers to a defined sequence that is used as a basis for sequence comparison.
- the reference sequence may be a subset of a larger sequence, for example, a full-length gene or a fragment of a polypeptide sequence.
- a “reference sequence” is a wild-type sequence.
- a “reference sequence” is an engineered or altered sequences.
- a sequence having a defined number of contiguous nucleotides or amino acids may be aligned with a nucleic acid or peptide sequence (having the same number of contiguous nucleotides or amino acids) from the corresponding portion of a nucleic acid or peptide sequence disclosed herein.
- the percentage sequence identity can be calculated by determining the number of positions at which either the identical nucleic acid base or amino acid residue occurs in both sequences, or a nucleic acid base or amino acid residue is aligned with a gap to yield the number of matched positions, dividing the number of matched positions by the total number of positions in the sequence and multiplying the result by 100 to yield the percentage of sequence identity.
- Those skilled in the art will appreciate that there are many established algorithms available to align two sequences. The optimal alignment of sequences for comparison can be conducted, for example, by the local homology algorithm of Smith and Waterman, 1981, Adv. Appl. Math. 2: 482, by the Homology alignment algorithm of Needleman and Wunsch, 1970, J. Mol. Biol.
- HSPs high scoring sequence pairs
- T some positive-valued threshold scores
- the cumulative scores are calculated using the parameters M (reward score for matched pair of residues; always> 0) and N (penalty score for mismatched residues; always ⁇ 0).
- M forward score for matched pair of residues; always> 0
- N penalty score for mismatched residues; always ⁇ 0.
- a scoring matrix is used to calculate the cumulative score. The extension of the word hits in each direction are halted when: the cumulative alignment score falls off by the quality X from its maximum achieved value; the cumulative score goes 0 or below, due to the accumulation of one or more negative -scoring residue alignments; or the end of either sequence is reached.
- the BLAST algorithm parameters W, T and X determine the sensitivity and speed of the alignment.
- the BLASTP program uses as defaults the word length (W) of 3, the expected value (E) of 10 and the BLOSUM62 scoring matrix (see Henikoff and Henikoff, 1989, Proc Natl Acad Sci USA 89: 10915).
- Exemplary determination of sequence alignments and %sequence identity can employ the BESTFIT or GAP programs in the GCG Wisconsin Software package (Accelrys, Madison WI), using the default parameters provided.
- an engineered dsRNA ligase possesses dsRNA ligase activity.
- Suitable reaction conditions refer to those conditions (e.g., enzyme loading, substrate loading, temperature, pH, etc.) in the reaction system, under which the substrate is converted to the desired product. Suitable reaction conditions can be readily identified by the person skilled in the art. Exemplary “suitable reaction conditions” are provided in the present disclosure and illustrated by examples. Engineered dsRNA ligase polypeptides
- the disclosure provides an engineered dsRNA ligase polypeptide comprising an amino acid sequence selected from the group consisting of SEQ ID NOs: 304, 306, 308, 310, 312, 314, 316, 318, 320, 322, 324, 326, 328, 330, 332, 334, 336, 338, 340, 342, 344, 346,
- the disclosure provides an engineered dsRNA ligase polypeptide comprising an amino acid sequence selected from the group consisting of SEQ ID NOs: 636, 638, 640, 642, 644, 646, 648, 650, 652, 654, 656, 658, 660, 662, 664, 666, and 668.
- the disclosure provides an engineered dsRNA ligase polypeptide comprising an amino acid sequence selected from the group consisting of SEQ ID NOs: 304, 306, 308, 310, 312, 314, 316, 318, 320, 322, 324, 326, 328, 330, 332, 334, 336, 338, 340, 342, 344, 346,
- the disclosure also provides an engineered dsRNA ligase polypeptide having dsRNA ligase activity and comprising an amino acid sequence having at least 80% sequence identity to an amino acid sequence selected from the group consisting of SEQ ID NOs: 304, 306, 308, 310, 312, 314, 316, 318, 320, 322, 324, 326, 328, 330, 332, 334, 336, 338, 340, 342, 344,
- engineered dsRNA ligase polypeptide does not the comprise the amino acid sequence of SEQ ID NO: 302.
- the disclosure also provides an engineered dsRNA ligase polypeptide having dsRNA ligase activity and comprising an amino acid sequence having at least 80% sequence identity to an amino acid sequence selected from the group consisting of SEQ ID NOs: 636, 638, 640, 642, 644, 646, 648, 650, 652, 654, 656, 658, 660, 662, 664, 666, and 668, wherein the engineered dsRNA ligase polypeptide does not the comprise the amino acid sequence of SEQ ID NO: 302.
- the disclosure also provides an engineered dsRNA ligase polypeptide having dsRNA ligase activity and comprising an amino acid sequence having at least 80% sequence identity to an amino acid sequence selected from the group consisting of SEQ ID NOs: 304, 306, 308, 310, 312, 314, 316, 318, 320, 322, 324, 326, 328, 330, 332, 334, 336, 338, 340, 342, 344,
- engineered dsRNA ligase polypeptide does not the comprise the amino acid sequence of SEQ ID NO: 302.
- dsRNA double-stranded RNA
- ligase polypeptide which is a polypeptide of: (a) a polypeptide comprising an amino acid sequence selected from the group consisting of SEQ ID NOs: 304, 306, 308, 310, 312, 314, 316, 318, 320, 322, 324, 326, 328, 330, 332, 334, 336, 338, 340, 342, 344, 346, 348, 350, 352, 354,
- a polypeptide having dsRNA ligase activity which comprises an amino acid sequence having (i) at least 80% sequence identity to one of the polypeptides recited in (a), and (ii) a substitution, deletion, addition or insertion of one or more amino acid residues relative to said one amino acid sequence recited in (a); wherein the engineered dsRNA ligase polypeptide does not the comprise the amino acid
- the disclosure provides an engineered double-stranded RNA (dsRNA) ligase polypeptide, which is a polypeptide of: (a) a polypeptide comprising an amino acid sequence selected from the group consisting of SEQ ID NOs: 636, 638, 640, 642, 644, 646, 648, 650, 652, 654, 656, 658, 660, 662, 664, 666, and 668; or (b) a polypeptide having dsRNA ligase activity, which comprises an amino acid sequence having (i) at least 80% sequence identity to one of the polypeptides recited in (a), and (ii) a substitution, deletion, addition or insertion of one or more amino acid residues relative to said one amino acid sequence recited in (a); wherein the engineered dsRNA ligase polypeptide does not the comprise the amino acid sequence of SEQ ID NO: 302.
- dsRNA double-stranded RNA
- ligase polypeptide which is a polypeptide of: (a) a polypeptide comprising an amino acid sequence selected from the group consisting of SEQ ID NOs: 304, 306, 308, 310, 312, 314, 316, 318, 320, 322, 324, 326, 328, 330, 332, 334, 336, 338, 340, 342, 344, 346, 348, 350, 352, 354,
- a polypeptide having dsRNA ligase activity which comprises an amino acid sequence having (i) at least 80% sequence identity to one of the polypeptides recited in (a), and (ii) a substitution, deletion, addition or insertion of one or more amino acid residues relative to said one amino acid sequence recited in (a); wherein the engineered dsRNA ligase polypeptide does not the comprise the amino acid sequence of SEQ ID NO: 302.
- the engineered dsRNA ligase polypeptide comprises an amino acid sequence having at least 85% sequence identity to an even numbered sequence identifier of SEQ ID NOs: 304-600, optionally at least 86%, at least 87%, at least 88%, at least 89%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or at least 99.5% sequence identity to an even numbered sequence identifier of SEQ ID NOs: 304-600.
- the engineered dsRNA ligase polypeptide comprises an amino acid sequence having at least 80% sequence identity to an even numbered sequence identifier of SEQ ID NOs: 304-600, optionally at least 81%, at least 82%, at least 83%, at least 84%, at least 85%, at least 86%, at least 87%, at least 88%, at least 89%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or at least 99.5% sequence identity to an even numbered sequence identifier of SEQ ID NOs: 304-600.
- the engineered dsRNA ligase polypeptide comprises an amino acid sequence having at least 80% sequence identity to an even numbered sequence identifier of SEQ ID NOs: 636-668, optionally at least 81%, at least 82%, at least 83%, at least 84%, at least 85%, at least 86%, at least 87%, at least 88%, at least 89%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or at least 99.5% sequence identity to an even numbered sequence identifier of SEQ ID NOs: 636-668.
- the engineered dsRNA ligase polypeptide comprises an amino acid sequence having at least 80% sequence identity to an even numbered sequence identifier of SEQ ID NOs: 304-600 or 636-668, optionally at least 81%, at least 82%, at least 83%, at least 84%, at least 85%, at least 86%, at least 87%, at least 88%, at least 89%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or at least 99.5% sequence identity to an even numbered sequence identifier of SEQ ID NOs: 304-600 or 636-668.
- the engineered dsRNA ligase polypeptides represented by the even numbered sequence identifiers of SEQ ID NOs: 304 to 600 and 636 to 668 exhibit higher activity than that of SEQ ID NO: 302, as shown in the Examples.
- the dsRNA ligase polypeptides used in the Examples (represented by even numbered sequence identifiers of SEQ ID NOs: 4 to 300 and 602 to 634, respectively) comprise an even numbered sequence identifier of SEQ ID NOs: 304 to 600 and 636 to 668 and an N-terminal purification tag (MHHHHHHENLYFQS (SEQ ID NO: 669)).
- SEQ ID NO: 4 comprises: (i) the N-terminal purification tag MHHHHHHENLYFQS (SEQ ID NO: 669); and (ii) SEQ ID NO: 304.
- dsRNA ligase polypeptides represented by even numbered sequence identifiers of SEQ ID NOs: 304 to 600 and 636 to 668 do not comprise the N-terminal purification tag represented by SEQ ID NO: 669.
- the wild-type dsRNA ligase polypeptide comprises the amino acid sequence of SEQ ID NO: 302 (also accessible under UniProt accession number Q7Y4V8).
- SEQ ID NO: 2 comprises: (i) the N-terminal purification tag MHHHHHHENLYFQS (SEQ ID NO: 669); and (ii) SEQ ID NO: 302. It will be readily understood that SEQ ID NOs: 2 and 302 both comprise the wild-type dsRNA ligase polypeptide sequence and so both sequences may be referred to herein as the wild-type sequence.
- the engineered dsRNA ligase polypeptide comprises an amino acid sequence selected from the group consisting of SEQ ID NOs: 370, 488, 526, 578, 588, 590 and 592. In some embodiments, the engineered dsRNA ligase polypeptide comprises the amino acid sequence of SEQ ID NO: 666. In some embodiments, the engineered dsRNA ligase polypeptide comprises an amino acid sequence selected from the group consisting of SEQ ID NOs: 370, 488, 526, 578, 588, 590, 592 and 666.
- the engineered dsRNA ligase polypeptide comprises an amino acid sequence selected from the group consisting of SEQ ID NOs: 70, 188, 226, 278, 288, 290 and 292. In some embodiments, the engineered dsRNA ligase polypeptide comprises the amino acid sequence of SEQ ID NO: 632. In some embodiments, the engineered dsRNA ligase polypeptide comprises an amino acid sequence selected from the group consisting of SEQ ID NOs: 70, 188, 226, 278, 288, 290, 292 and 632.
- SEQ ID NOs: 70, 188, 226, 278, 288, 290, 292 and 632 comprise: (i) an N-terminal purification tag MHHHHHHENLYFQS (SEQ ID NO: 669); and (ii) an amino acid sequence provided by SEQ ID NOs: 370, 488, 526, 578, 588, 590, 592 and 666, respectively.
- the engineered dsRNA ligase polypeptide comprises the amino acid sequence of SEQ ID NO: 370. In some embodiments, the engineered dsRNA ligase polypeptide comprises the amino acid sequence of SEQ ID NO: 488. In some embodiments, the engineered dsRNA ligase polypeptide comprises the amino acid sequence of SEQ ID NO: 526. In some embodiments, the engineered dsRNA ligase polypeptide polypeptide comprises the amino acid sequence of SEQ ID NO: 578. In some embodiments, the engineered dsRNA ligase polypeptide comprises the amino acid sequence of SEQ ID NO: 588.
- the engineered dsRNA ligase polypeptide comprises the amino acid sequence of SEQ ID NO: 590. In some embodiments, the engineered dsRNA ligase polypeptide comprises the amino acid sequence of SEQ ID NO: 592. In some embodiments, the engineered dsRNA ligase polypeptide comprises the amino acid sequence of SEQ ID NO: 666.
- the disclosure also provides an engineered dsRNA ligase polypeptide comprising an amino acid sequence having at least 80% sequence identity to SEQ ID NO: 302, which produces at least 5% more oligonucleotide product than a dsRNA ligase polypeptide comprising the amino acid sequence of SEQ ID NO: 302 under the same ligation reaction conditions, wherein the engineered dsRNA ligase polypeptide does not the comprise the amino acid sequence of SEQ ID NO: 302.
- the engineered dsRNA ligase polypeptide produces at least 10%, at least 15%, at least 20%, at least 25%, at least 30%, at least 35%, at least 40%, at least 45%, at least 50%, at least 55%, at least 60%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, or at least 100% more oligonucleotide product than a dsRNA ligase polypeptide comprising the amino acid sequence of SEQ ID NO: 302 under the same ligation reaction conditions.
- the ligation reaction conditions are as described herein.
- the engineered dsRNA ligase polypeptide comprises an amino acid sequence having at least 85% sequence identity to SEQ ID NO: 302, optionally at least 86%, at least 87%, at least 88%, at least 89%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or at least 99.5% sequence identity to SEQ ID NO: 302.
- the engineered dsRNA ligase polypeptide comprises an amino acid sequence having at least 80% sequence identity to SEQ ID NO: 302, optionally at least 81%, at least 82%, at least 83%, at least 84%, at least 85%, at least 86%, at least 87%, at least 88%, at least 89%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or at least 99.5% sequence identity to SEQ ID NO: 302.
- the ligation reaction conditions include about 1 pM to about 10 mM oligonucleotide fragment, a source of ATP, about 5 mM to about 100 mM divalent cation, and about 0.5 g/L to about 10 g/L engineered dsRNA ligase polypeptide, pH of about 4.0 to about 8.0, and temperature of about 10 °C to about 50 °C.
- the source of ATP is a stoichiometric concentration of ATP or a stoichiometric excess of ATP.
- the source of ATP comprises: (a) polyphosphate kinase (PPK); (b) polyphosphate; and (c) AMP and/or ATP.
- the engineered dsRNA ligase polypeptide comprises an amino acid sequence that differs from the sequence of SEQ ID NO: 302 in one or more (e.g. two or more, three or more, four or more, five or more, six or more, seven or more, eight or more, nine or more, or ten or more) amino acid residues selected from: X6, X7, X15, X19, X29, X36, X39, X46, X47, X49, X51, X53, X56, X57, X60, X63, X64, X66, X67, X87, X88, X91, X93, X103, X105, X107, X114, X122, X126, X129, X130, X131, X137, X144, X146, X158, X163, X173, X178, X190, X196,
- the engineered dsRNA ligase polypeptide comprises an amino acid sequence that differs from the sequence of SEQ ID NO: 302 in one or more (e.g. two or more, three or more, four or more, five or more, six or more, seven or more, eight or more, nine or more, or ten or more) amino acid residues selected from: X6, X7, X15, X19, X29, X36, X39, X44, X45, X46, X47, X49, X51, X53, X56, X57, X60, X63, X64, X66, X67, X87, X88, X89, X91, X92, X93, X103, X105, X107, XI 14, X122, X126, X129, X130, X131, X137, X144, X146, X158, X163,
- the engineered dsRNA ligase polypeptide comprises an amino acid sequence comprising one or more (e.g. two or more, three or more, four or more, five or more, six or more, seven or more, eight or more, nine or more, or ten or more) of the following amino acid residues:
- X6 is G or E;
- X7 is Q;
- X15 is R, D or E;
- X19 is Q or D;
- X29 is N or L;
- X36 is V;
- X39 is A;
- X44 is V;
- X45 is V;
- X46 is Y;
- X47 is E;
- X49 is G;
- X51 is L;
- X53 is Y;
- X56 is R or A;
- X57 is S;
- X60 is T, G or P;
- X63 is S, Q or G;
- X64 is R, T, Q, F
- the engineered dsRNA ligase polypeptide comprises an amino acid sequence that differs from the sequence of SEQ ID NO: 302 in one or more (e.g. 2 or more, 3 or more, 4 or more, 5 or more, 6 or more, 7 or more, 8 or more, 9 or more, 10 or more, or 11) amino acid residues selected from: X15, X19, X36, X39, X53, X218, X221, X237, X251, X255, and X285; wherein the numbering refers to SEQ ID NO: 302.
- the engineered dsRNA ligase polypeptide comprises an amino acid sequence comprising one or more (e.g.
- the engineered dsRNA ligase polypeptide comprises an amino acid sequence that differs from the sequence of SEQ ID NO: 302 in one or more (e.g. 2 or more, 3 or more, 4 or more, 5 or more, 6 or more, 7 or more, 8 or more, 9 or more, 10 or more, 11 or more, or 12) amino acid residues selected from: X15, X19, X36, X39, X53, X185, X218, X221, X237, X251, X255, and X285; wherein the numbering refers to SEQ ID NO: 302.
- the engineered dsRNA ligase polypeptide comprises an amino acid sequence comprising one or more (e.g. 2 or more, 3 or more, 4 or more, 5 or more, 6 or more, 7 or more, 8 or more, 9 or more, 10 or more, 11 or more, or 12) of the following amino acid residues: X15 is D or E; X19 is D; X36 is V; X39 is A; X53 is Y; X185 is K; X218 is N; X221 is I; X237 is R; X251 is L; X255 is C; and X285 is A; wherein the numbering refers to SEQ ID NO: 302.
- the engineered dsRNA ligase polypeptide comprises an amino acid sequence that differs from the sequence of SEQ ID NO: 302 in one or more (e.g. two or more or three or more) amino acid residues selected from: X36, X39, X218 and X221; wherein the numbering refers to SEQ ID NO: 302.
- the engineered dsRNA ligase polypeptide comprises an amino acid sequence comprising one or more (e.g. two or more or three or more) of the following amino acid residues: X36 is V; X39 is A; X218 is N; and X221 is I; wherein the numbering refers to SEQ ID NO: 302.
- the engineered dsRNA ligase polypeptide comprises an amino acid sequence that differs from the sequence of SEQ ID NO: 302 at amino acid residues: X36, X39, X218 and X221; wherein the numbering refers to SEQ ID NO: 302.
- the engineered dsRNA ligase polypeptide comprises an amino acid sequence comprising the following amino acid residues: X36 is V; X39 is A; X218 is N; and X221 is I; wherein the numbering refers to SEQ ID NO: 302.
- the engineered dsRNA ligase polypeptide comprises an amino acid sequence that differs from the sequence of SEQ ID NO: 302 in one or more (e.g. two or more) amino acid residues selected from: X39, X218 and X221; wherein the numbering refers to SEQ ID NO: 302.
- the engineered dsRNA ligase polypeptide comprises an amino acid sequence comprising one or more (e.g. two or more) of the following amino acid residues: X39 is A; X218 is N; and X221 is I; wherein the numbering refers to SEQ ID NO: 302.
- the engineered dsRNA ligase polypeptide comprises an amino acid sequence that differs from the sequence of SEQ ID NO: 302 at amino acid residues: X39, X218 and X221; wherein the numbering refers to SEQ ID NO: 302.
- the engineered dsRNA ligase polypeptide comprises an amino acid sequence comprising the following amino acid residues: X39 is A; X218 is N; and X221 is I; wherein the numbering refers to SEQ ID NO: 302.
- the engineered dsRNA ligase polypeptide comprises an amino acid sequence that differs from the sequence of SEQ ID NO: 302 in one or more (e.g. two or more or three or more) amino acid residues selected from: X39, X218, X221, and X255; wherein the numbering refers to SEQ ID NO: 302.
- the engineered dsRNA ligase polypeptide comprises an amino acid sequence comprising one or more (e.g. two or more orthree or more) of the following amino acid residues: X39 is A; X218 is N; X221 is I; and X255 is C; wherein the numbering refers to SEQ ID NO: 302.
- the engineered dsRNA ligase polypeptide comprises an amino acid sequence that differs from the sequence of SEQ ID NO: 302 at amino acid residues: X39, X218, X221, and X255; wherein the numbering refers to SEQ ID NO: 302.
- the engineered dsRNA ligase polypeptide comprises an amino acid sequence comprising the following amino acid residues: X39 is A; X218 is N; X221 is I; and X255 is C; wherein the numbering refers to SEQ ID NO: 302.
- the engineered dsRNA ligase polypeptide comprises an amino acid sequence that differs from the sequence of SEQ ID NO: 302 in one or more (e.g. two or more, three or more, four or more, five or more, six or more, or seven or more) amino acid residues selected from: X39, X53, X218, X221, X237, X251, X255 and X285; wherein the numbering refers to SEQ ID NO: 302.
- the engineered dsRNA ligase polypeptide comprises an amino acid sequence comprising one or more (e.g.
- the engineered dsRNA ligase polypeptide comprises an amino acid sequence that differs from the sequence of SEQ ID NO: 302 at amino acid residues: X39, X53, X218, X221, X237, X251, X255 and X285; wherein the numbering refers to SEQ ID NO: 302.
- the engineered dsRNA ligase polypeptide comprises an amino acid sequence comprising the following amino acid residues: X39 is A; X 53 is Y; X218 is N; X221 is I; X237 is R; X251 is L; X255 is C; and X285 is A; wherein the numbering refers to SEQ ID NO: 302.
- the engineered dsRNA ligase polypeptide comprises an amino acid sequence that differs from the sequence of SEQ ID NO: 302 in one or more (e.g. two or more, three or more, four or more, five or more, six or more, seven or more, or eight or more) amino acid residues selected from: X15, X39, X53, X218, X221, X237, X251, X255 and X285; wherein the numbering refers to SEQ ID NO: 302.
- the engineered dsRNA ligase polypeptide comprises an amino acid sequence comprising one or more (e.g.
- X15 is D or E; X39 is A; X 53 is Y; X218 is N; X221 is I; X237 is R; X251 is L; X255 is C; and X285 is A; wherein the numbering refers to SEQ ID NO: 302.
- the engineered dsRNA ligase polypeptide comprises an amino acid sequence that differs from the sequence of SEQ ID NO: 302 at amino acid residues: X15, X39, X53, X218, X221, X237, X251, X255 and X285; wherein the numbering refers to SEQ ID NO: 302.
- the engineered dsRNA ligase polypeptide comprises an amino acid sequence comprising the following amino acid residues: X15 is D; X39 is A; X 53 is Y; X218 is N; X221 is I; X237 is R; X251 is L; X255 is C; and X285 is A; wherein the numbering refers to SEQ ID NO: 302.
- the engineered dsRNA ligase polypeptide comprises an amino acid sequence comprising the following amino acid residues: X15 is E; X39 is A; X 53 is Y; X218 is N; X221 is I; X237 is R; X251 is L; X255 is C; and X285 is A; wherein the numbering refers to SEQ ID NO: 302.
- the amino acid sequence of the engineered dsRNA ligase polypeptide comprises an amino acid sequence that differs from the sequence of SEQ ID NO: 302 in one or more amino acid residues selected from: X15, X39, X53, X185, X218, X221, X237, X251, X255 and X285, wherein the numbering refers to SEQ ID NO: 302, and wherein the engineered dsRNA ligase polypeptide has dsRNA ligase activity; optionally wherein the amino acid sequence of the engineered dsRNA ligase polypeptide comprises one or more of the following amino acid residues: X15 is D; X39 is A; X 53 is Y; XI 85 is K; X218 is N; X221 is I; X237 is R; X251 is L; X255 is C; and X285 is A.
- the engineered dsRNA ligase polypeptide comprises a purification tag.
- Purification tags are typically appended to polypeptides so that they can be purified from their crude biological source using an affinity technique.
- the purification tag comprises a poly-histidine tag. Poly-histidine tags bind to matrices bearing immobilized metal ions and can be used to purify polypeptides by affinity chromatography.
- the purification tag further comprises a protease recognition site for removal of the purification tag.
- the protease recognition site comprises a Tobacco Etch Virus (TEV) protease recognition sequence.
- the purification tag comprises the amino acid sequence MHHHHHHENLYFQS (SEQ ID NO: 669).
- the engineered dsRNA ligase polypeptide comprises an amino acid sequence selected from the group consisting of SEQ ID NOs: 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 86, 88, 90, 92, 94, 96, 98, 100, 102, 104, 106, 108, 110, 112, 114, 116, 118, 120, 122, 124, 126, 128, 130, 132, 134, 136, 138, 140, 142, 144, 146, 148, 150, 152, 154, 156, 158, 160, 162, 164, 166, 168, 170, 172, 174, 176, 178, 180, 182, 184, 186,
- the engineered dsRNA ligase polypeptide comprises an amino acid sequence selected from the group consisting of SEQ ID NOs: 602, 604, 606, 608, 610, 612, 614, 616, 618, 620, 622, 624, 626, 628, 630, 632, and 634.
- the engineered dsRNA ligase polypeptide comprises an amino acid sequence selected from the group consisting of SEQ ID NOs: 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 86, 88, 90, 92, 94, 96, 98, 100, 102, 104, 106, 108, 110, 112, 114, 116, 118, 120, 122, 124, 126, 128, 130, 132, 134, 136, 138, 140, 142, 144, 146, 148,
- 294, 296, 298, 300 602, 604, 606, 608, 610, 612, 614, 616, 618, 620, 622, 624, 626, 628,
- the disclosure also provides a polypeptide immobilized on a solid support material by chemical bond or a physical adsorption method, wherein the polypeptide comprises an engineered dsRNA ligase polypeptide disclosed herein.
- Immobilization of a polypeptide by physical absorption typically involves the polypeptide being physically adsorbed or attached onto a solid support material. Adsorption can occur through weak non-specific forces such as van der Waals, hydrophobic interactions and hydrogen bonds. Physical adsorption may be achieved by soaking the support material in a solution of the polypeptide and incubating to allow time for physical adsorption to occur. Immobilization of a polypeptide by chemical bonding typically involves the attachment of the polypeptide to the support material via a covalent bond.
- the dsRNA ligase polypeptide is immobilized via a spacer positioned between the dsRNA ligase polypeptide and the solid material.
- the spacer is a peptide (e.g. a peptide comprising 2 or more, 3 or more, 4 or more, 5 or more, 10 or more, 15 or more, 20 or more, 25 or more, 30 or more, 35 or more, 40 or more, 45 or more, 50 or more, 75 or more, or 100 or more amino acids).
- the engineered dsRNA ligase polypeptide is immobilized using affinity immobilization. In some embodiments, the engineered dsRNA ligase polypeptide is immobilized using metal affinity immobilization, e.g. by contacting His- tagged engineered dsRNA ligase polypeptide with immobilized metal such as nickel, zinc, cobalt, or copper.
- the solid support material comprises a membrane, resin, solid carrier, or other solid phase material.
- a solid support material can be composed of organic polymers such as polystyrene, polyethylene, polypropylene, polyfluoroethylene, polyethyleneoxy, polymethacrylate, and polyacrylamide, as well as co-polymers and grafts thereof.
- a solid support material can also be inorganic, such as glass, silica, controlled pore glass (CPG), reverse phase silica or metal, such as gold or platinum.
- CPG controlled pore glass
- the configuration of a solid support material can be in the form of beads, spheres, particles, granules, a gel, a membrane or a surface. Surfaces can be planar, substantially planar, or non -planar.
- Solid support materials can be porous or non-porous and can have swelling or non-swelling characteristics.
- a solid support material can be configured in the form of a well, depression, or other container, vessel, feature, or location.
- Solid support materials useful for immobilizing the dsRNA ligase polypeptide for carrying out a ligase reaction include but are not limited to beads or resins such as polymethacrylate, e.g., polymethacrylates with epoxy functional groups, polymethacrylates with amino epoxy functional groups, polymethacrylates, styrene/DVB copolymer or polymethacrylates with octadecyl functional groups.
- Exemplary solid supports include, but are not limited to, chitosan beads, Eupergit C, IB-150, IB-350, IB-C435, IB-A369, IB-A161, IB-A171, IBS500, IB-S861, SEPABEADS (Mitsubishi), e.g., Sepabeads EC-EP, Sepabeads EC-HFA, Sepabeads EC-HG, Sepabeads EC-BU, Sepabeads EC-OD, Sepabeads EC-CM, Sepabeads EC-IDA, Sepabeads EC-EA, Sepabeads EC-HA, Sepabeads EC-QA, Sepabeads EXE, Sepabeads EXA, Dilbeads-TA, Amberzyme Oxirane, Amberlite XAD-7HP, Amberlite FPA98C1, Amberlite IRA
- this disclosure provides polynucleotides encoding engineered polypeptides having dsRNA ligase activity described herein.
- the polynucleotides can be linked to one or more heterologous regulatory sequences that control gene expression to produce recombinant polynucleotides that are capable of expressing the engineered polypeptides.
- Expression constructs comprising a heterologous polynucleotide encoding an engineered dsRNA ligase may be introduced into a suitable host cell to express the corresponding engineered dsRNA ligase polypeptide.
- this disclosure specifically contemplates each and every possible alteration of a polynucleotide that can be made by selecting a combination based on possible codon selections, for any of the polypeptides disclosed herein, comprising those amino acid sequences of exemplary engineered polypeptides listed in Examples 7 to 12, any of the polypeptides disclosed as even sequence identifiers of SEQ ID NOs: 304 to 600 and 636 to 668, and any of the polypeptides disclosed as even sequence identifiers of SEQ ID NOs: 4 to 300 and 602 to 634.
- the codons are preferably selected to accommodate the host cell in which the recombinant protein is produced.
- codons preferred for bacteria are used to express genes in bacteria; codons preferred for yeast are used to express genes in yeast; and codons preferred for mammals are used for gene expression in mammalian cells.
- the disclosure provides a polynucleotide encoding an engineered dsRNA ligase polypeptide described above.
- the polynucleotide encodes a polypeptide comprising an amino acid sequence having at least 80%, at least 81%, at least 82%, at least 83%, at least 84%, at least 85%, at least 86%, at least 87%, at least 88%, at least 89%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% to a reference sequence that is an even numbered sequence identifier of SEQ ID NOs: 304 to 600 or 636 to 668, wherein the polypeptide has dsRNA ligase activity and exhibits higher enzyme activity than a polypeptide comprising the amino acid of SEQ ID NO: 2 and/or 302.
- the polynucleotide encodes an engineered dsRNA ligase polypeptide described herein and comprises a nucleic acid sequence having at least 70%, at least 75%, at least 80%, at least 81%, at least 82%, at least 83%, at least 84%, at least 85, at least 86%, at least 87%, at least 88%, at least 89%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% sequence identity to a reference polynucleotide selected from the sequences having an odd numbered sequence identifier of SEQ ID NOs: 303 to 599 or 635 to 667, wherein: (i) the polynucleotide does not comprise SEQ ID NO: 301; and (ii) the polynucleotide does not encode a dsRNA ligase polypeptide having the amino acid sequence of
- the polynucleotide encodes an engineered dsRNA ligase polypeptide described herein and comprises a nucleic acid sequence having at least 70%, at least 75%, at least 80%, at least 81%, at least 82%, at least 83%, at least 84%, at least 85, at least 86%, at least 87%, at least 88%, at least 89%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% sequence identity to a reference polynucleotide selected from the sequences having an odd numbered sequence identifier of SEQ ID NOs: 3 to 299 or 601 to 633, wherein: (i) the polynucleotide does not comprise SEQ ID NO: 1; and (ii) the polynucleotide does not encode a dsRNA ligase polypeptide having the amino acid sequence of SEQ
- polynucleotides having an odd numbered sequence identifier of SEQ ID NOs: 3 to 299 or 601 to 633 encode an engineered dsRNA ligase polypeptide comprising an N-terminal purification tag (SEQ ID NO: 669).
- the isolated polynucleotides encoding engineered dsRNA ligase polypeptides can be manipulated to enable the expression of the engineered polypeptides in a variety of ways, which may comprise further modification of the sequences by codon optimization to improve expression, insertion into suitable expression elements with or without additional control sequences, and transformation into a host cell suitable for expression and production of the engineered polypeptides.
- manipulation of the isolated polynucleotide prior to insertion of the isolated polynucleotide into the vector may be desirable or necessary.
- Techniques for modifying polynucleotides and nucleic acid sequences using recombinant DNA methods are well known in the art. Guidance is provided below: Sambrook et al., 2001, Molecular Cloning: A Laboratory Manual, Third Edition, Cold Spring Harbor Laboratory Press; and Current Protocols in Molecular Biology, Ausubel. F. Eds., Greene Pub. Associates, 1998, 2010 Year update.
- the disclosure also provides an expression vector comprising the polynucleotide described herein.
- the vector is selected from a plasmid, a cosmid, a bacteriophage, or a viral vector.
- Recombinant expression vectors typically comprise one or more expression regulatory regions, such as promoters and terminators, origin of replication and the like.
- Polynucleotides encoding an engineered dsRNA ligase polypeptide described herein can be expressed by inserting the polynucleotide or the nucleic acid construct comprising the polynucleotide sequence into an appropriate expression vector.
- the coding sequence is located in the vector such that the coding sequence is linked to a suitable control sequence for expression.
- the recombinant expression vector can be any vector (e.g. , a plasmid or virus) that can be conveniently used in recombinant DNA procedures and can result in the expression of a polynucleotide sequence.
- the choice of vector will generally depend on the compatibility of the vector with the host cell to be introduced into.
- the vector can be linear or closed circular plasmid.
- the expression vector may be an autonomously replicating vector, i. e. , a vector that exists as an extrachromosomal entity whose replication is independent of chromosomal replication, such as a plasmid, extrachromosomal element, minichromosome, or artificial chromosome.
- the vector may contain any tools for ensuring self-copying.
- the vector may be a vector that, when introduced into a host cell, integrates into the genome and replicates with the chromosome into which it is integrated.
- a single vector or plasmid or two or more vectors or plasmids that together comprise the total DNA to be introduced into the genome of the host cell may be used.
- An exemplary expression vector can be prepared by inserting a polynucleotide encoding an engineered dsRNA ligase polypeptide to plasmid pACYC-Duet-1 (Novagen), pBR322 Vector (New England Biolabs), pUC19 Vector (New England Biolabs) or pET T7 Expression Vectors (Novagen).
- the disclosure also provides a host cell capable of expressing an engineered dsRNA ligase polypeptide described herein.
- the host cell comprises the nucleic acid molecule described herein, or the vector described herein.
- the host cell is Escherichia coli.
- the polynucleotide encoding the polypeptide is linked to one or more control sequences for expression of polypeptides in the host cell.
- Host cells for expression of polypeptides encoded by the expression vectors of the present disclosure are well known in the art, including, but not limited to, bacterial cells such as E. coli, Streptomyces, and Salmonella typhimurium,' fungal cells (e.g., Saccharomyces cerevisiae or Pichia pastoris).' insect cells such as Drosophila S2 and Spodoptera Sf9; animal cells such as CHO, COS, BHK, 293 and Bowes melanoma cells; and plant cells.
- An exemplary host cell is E. coli BL21 (DE3).
- the host cell may be wild-type or may be engineered through genomic editing. Suitable media and growth conditions for the above host cells are well known in the art.
- Polynucleotides or vectors used to express polypeptides can be introduced into cells by a variety of methods known in the art. Techniques comprise, among others, electroporation, bio-particle bombardment, liposome-mediated transfection, calcium chloride transfection, and protoplast fusion. Different methods of introducing polynucleotides into cells are known to those skilled in the art.
- the host cell may be used to express and isolate the polypeptide described herein.
- Engineered dsRNA ligase can be obtained by subjecting a polynucleotide encoding an dsRNA ligase to mutagenesis and/or directed evolution.
- An exemplary directional evolution technique can be found in "Biocatalysis for the Pharmaceutical Industry: Discovery, Development, and Manufacturing” (2009 John Wiley &Sons Asia (Pte) Ltd. ISBN: 978-0- 470-82314-9).
- the encoding polynucleotide may be prepared by standard solid-phase methods according to known synthetic methods.
- fragments of up to about 100 bases can be synthesized separately and then ligated (e.g., by enzymatic or chemical ligation methods or polymerase-mediated methods) to form any desired contiguous sequence.
- the polynucleotides and oligonucleotides of the present disclosure can be prepared by chemical synthesis using, for example, the classic phosphoramidite methods described by Beaucage et al., 1981, Tet Lett 22: 1859-69, or Matthes et al. People, 1984, EMBO J.
- oligonucleotides are synthesized, purified, annealed, ligated, and cloned into a suitable vector, for example, in an automated DNA synthesizer.
- a suitable vector for example, in an automated DNA synthesizer.
- essentially any nucleic acid is available from any of a variety of commercial sources.
- the disclosure provides a method of preparing an engineered dsRNA ligase polypeptide, which comprises the steps of culturing a host cell described herein and obtaining an engineered dsRNA ligase polypeptide from the culture.
- the process of preparing a polypeptide further comprises isolating the polypeptide.
- Engineered polypeptides may be expressed in suitable cells and isolated (or recovered) from the host cell and/or culture medium using any one or more of the well-known techniques for protein purification, the techniques for protein purification include, among others, lysozyme treatment, sonication, filtration, salting out, ultracentrifiigation and chromatography.
- the invention also provides an engineered dsRNA ligase catalyst obtainable by culturing a host cell described herein, or from the method of preparing an engineered dsRNA ligase polypeptide described herein, wherein said engineered dsRNA ligase catalyst comprises cells or culture fluid containing the engineered dsRNA ligase polypeptides, or an article processed therewith, wherein the article refers to an extract obtained from the culture of host cells, an isolated product obtained by isolating or purifying an engineered dsRNA ligase from the extract, or an immobilized product obtained by immobilizing host cells, an extract thereof, or isolated product of the extract.
- the disclosure provides a method of producing an oligonucleotide from two or more oligonucleotide fragments, wherein the method comprises contacting: (i) two or more oligonucleotide fragments; (ii) an engineered dsRNA ligase polypeptide disclosed herein; (iii) a source of ATP; and (iv) a divalent cation; to obtain an oligonucleotide.
- oligonucleotides by ligating two or more oligonucleotide fragments.
- the produced oligonucleotides are nucleic acids which typically comprise up to 100 nucleotides. It will be understood that oligonucleotides described herein comprise RNA. It will also be understood that oligonucleotides described herein comprise a double-stranded region.
- oligonucleotide fragment refers to a nucleic acid that can be ligated to one or more additional oligonucleotide fragments to provide an oligonucleotide product. Each oligonucleotide fragment corresponds to a portion of the oligonucleotide product.
- the oligonucleotide is a therapeutic oligonucleotide.
- the therapeutic oligonucleotide is a small interfering RNA (siRNA) or an antisense oligonucleotide (ASO).
- the oligonucleotide is an aptamer.
- the oligonucleotide comprises an overhang. In some embodiments, the oligonucleotide comprises a 3’ overhang. In some embodiments, the oligonucleotide comprises a 5’ overhang. In some embodiments, the overhang comprises 1, 2, 3, 4, 5, 6, 7, or 8 nucleotides. In some embodiments, the oligonucleotide comprises a blunt end. In some embodiments, the oligonucleotide comprises two blunt ends.
- the oligonucleotide is up to 20 nucleotides in length. In some embodiments, the oligonucleotide is up to 25, up to 30, up to 35, up to 40, up to 45, up to 50, up to 55, up to 60, up to 65, up to 70, up to 75, up to 80, up to 85, up to 90, up to 95, or up to 100 nucleotides in length. In some embodiments, the oligonucleotide is up to 60 nucleotides in length.
- the oligonucleotide is at least 20 nucleotides in length. In some embodiments, the oligonucleotide is at least 25, at least 30, at least 35, at least 40, at least 45, at least 50, at least 55, at least 60, at least 65, at least 70, at least 75, at least 80, at least 85, at least 90, at least 95, or 100 nucleotides in length.
- the oligonucleotide is 10-100 nucleotides in length. In some embodiments, the oligonucleotide is 10-80, 10-70, 10-60, 10-50, 10-40, 10-30, 10-25, 15-80, 15-70, 15-60, 15-50, 15-40, 15-30, or 15-25 nucleotides in length. In some embodiments, the oligonucleotide is 15-25 nucleotides in length.
- the two or more oligonucleotide fragments comprise one or more 3’ oligonucleotide fragments and one or more 5 ’ oligonucleotide fragments, wherein each of the one or more 3’ oligonucleotide fragments comprise a 5 ’-phosphate group and each of the one or more 5’ oligonucleotide fragments typically comprise a 3’ terminal ribonucleotide having a 3 ’-hydroxyl group.
- one or more of the oligonucleotide fragments comprises one or more mismatches. In some embodiments, one or more of the oligonucleotide fragments comprise an overhang. In some embodiments, one or more of the oligonucleotide fragments comprise a 3’ overhang. In some embodiments, one or more of the oligonucleotide fragments comprise a 5’ overhang. In some embodiments, one or more of the oligonucleotide fragments comprise a 3’ overhang and a 5’ overhang. In some embodiments, the overhang comprises 1, 2, 3, 4, 5, 6, 7, or 8 nucleotides.
- the two or more oligonucleotide fragments comprise a first oligonucleotide fragment having an overhang that is complementary to the overhang of a second oligonucleotide fragment. In some embodiments, the two or more oligonucleotide fragments comprise a first oligonucleotide fragment having a 3 ’ overhang and a 5 ’ overhang, wherein the 3’ overhang is complementary to the 5’ overhang of a second oligonucleotide fragment and the 5’ overhang is complementary to the 3’ overhang of a third oligonucleotide.
- one or more of the oligonucleotide fragments comprise a blunt end. In some embodiments, one or more of the oligonucleotide fragments comprise a 3’ overhang and a 5’ blunt end. In some embodiments, one or more of the oligonucleotide fragments comprise a 5’ overhang and a 3’ blunt end. In some embodiments, the 5’ terminal oligonucleotide fragment comprises a 3’ overhang and a 5’ blunt end. In some embodiments, the 3’ terminal oligonucleotide fragment comprise a 5’ overhang and a 3’ blunt end.
- two or more of the oligonucleotide fragments comprise two or more RNA oligonucleotide fragments. In some embodiments, the two or more RNA oligonucleotide fragments comprise double-stranded RNA (dsRNA) oligonucleotide fragments.
- dsRNA double-stranded RNA
- one or more of the oligonucleotide fragments comprise DNA and RNA.
- a portion of the oligonucleotide fragment may be double-stranded DNA, while another portion is double-stranded RNA, forming a DNA-RNA chimera.
- one or more of the oligonucleotide fragments comprise one or two strands which are RNA, or a mixture of RNA, DNA, LNA, Morpholino, UNA (unlocked nucleic acid), TNA (threose nucleic acid), GNA (glycol nucleic acid), and/or FANA (Fluoroarabino nucleic acid), modified RNA, etc.
- one or both strand(s) could be, for example, RNA except that one or more nucleotide(s) is replaced by DNA, LNA, Morpholino, UNA, TNA, GNA, and/or FANA, and/or modified RNA (e.g., any modified RNA disclosed herein or known in the art, such as 2’ modified RNA, including but not limited to 2’-F, 2’-0Me, 2’-0-M0E RNA, etc.).
- modified RNA e.g., any modified RNA disclosed herein or known in the art, such as 2’ modified RNA, including but not limited to 2’-F, 2’-0Me, 2’-0-M0E RNA, etc.
- the two or more oligonucleotide fragments are the same length. In some embodiments, the two or more oligonucleotide fragments are different lengths. In some embodiments, each of the two or more oligonucleotide fragments are 3-20 nucleotides in length. In some embodiments, each of the two or more oligonucleotide fragments are 4-16 nucleotides in length. In some embodiments, each of the two or more oligonucleotide fragments are 4-16, 4-15, 5-15, 6-15, 4-14, 4-13, 4-12, 4-11, 4-10, 4-9, 5-9, or 6-9 nucleotides in length.
- each of the two or more oligonucleotide fragments are at least 3 nucleotides in length. In some embodiments, each of the two or more oligonucleotide fragments are at least 4, at least 5, at least 6, at least 7, at least 8, at least 9, at least 10, at least 11, at least 12, at least 13, at least 14, or at least 15 nucleotides in length.
- the two or more oligonucleotide fragments comprise 3 or more, 4 or more, 5 or more, 6 or more, 7 or more, 8 or more, 9 or more, or 10 or more oligonucleotide fragments.
- one or more ligation reactions are required to generate the oligonucleotide product. In some embodiments, 2 or more, 3 or more, 4 or more, 5 or more, 6 or more, 7 or more, 8 or more, 9 or more, or 10 or more ligation reactions are required to generate the oligonucleotide product.
- one or more of the oligonucleotide fragments and/or the oligonucleotide comprises a chemical modification. In some embodiments, one or more of the oligonucleotide fragments and/or the oligonucleotide comprises at least one modified backbone modification. In some embodiments, one or more of the oligonucleotide fragments and/or the oligonucleotide comprises at least one modified nucleotide modification. In some embodiments, one or more of the oligonucleotide fragments and/or the oligonucleotide comprises at least one sugar modification (e.g. at the 2’-position or 4’-position).
- one or more of the oligonucleotide fragments and/or the oligonucleotide comprises: (i) at least one modified backbone modification; (ii) and at least one modified nucleotide modification; and/or (iii) at least one sugar modification.
- one or more of the oligonucleotide fragments and/or the oligonucleotide comprise a modification selected from the group consisting of: 2'-O-methyl (2’-OMe), 2'-flouro (2’-F), 2'-deoxy, 2'-deoxy-2’-fluoro, 2'-O-methoxyethyl (2'-O-MOE), 2'- O-aminopropyl (2'-O-AP), 2'-O-dimethylaminoethyl (2'-O-DMAOE), 2'-O- dimethylaminopropyl (2'-O-DMAP), 2'-O-dimethylaminoethyloxyethyl (2'-O-DMAEOE), 2'- O-N-methylacetamido (2'-0-NMA), locked nucleic acid (LNA), glycol nucleic acid (GNA), phosphoramidate (e.g.
- mesyl phosphoramidate 2',3'-seco nucleotide mimic
- 2'-F-arabino nucleotide abasic nucleotide
- 2'-amino modified nucleotide 2'-alkyl-modified nucleotide
- morpholino nucleotide vinylpho sphonate (e.g. 5’ vinylphosphonate)
- cyclopropyl phosphonate deoxyribonucleotide 2',3'-seco nucleotide mimic
- 2'-F-arabino nucleotide abasic nucleotide
- 2'-amino modified nucleotide 2'-alkyl-modified nucleotide
- morpholino nucleotide e.g. 5’ vinylphosphonate
- cyclopropyl phosphonate deoxyribonucleotide cyclopropyl phosphonate deoxyribonucleotide
- one or more of the oligonucleotide fragments and/or the oligonucleotide comprises a 2'-modification selected from the group consisting of: 2’-0Me, 2’-F, and 2'-deoxy.
- one or more of the oligonucleotide fragments and/or the oligonucleotide comprises at least one phosphorothioate or methylphosphonate intemucleotide linkage. In some embodiments, the oligonucleotide comprises at least one chiral phosphorothioate linkage.
- one or more of the oligonucleotide fragments and/or the oligonucleotide is conjugated to at least one ligand.
- the ligand may be conjugated to the sense strand, antisense strand or both strands, in any configuration e.g. at the 3 ’-end, 5 ’-end, non-end or a combination.
- the ligand comprises one or more N-Acetylgalactosamine (GalNAc) derivatives.
- GalNAc is an amino sugar derivative of galactose which may be used as a targeting ligand in oligonucleotides intended for targeting to the liver, where it binds to the asialoglycoprotein receptors on hepatocytes.
- the ligand comprises one or more GalNAc derivatives conjugated through a bivalent or trivalent branched carrier.
- the ligand is a peptide or a peptidomimetic.
- the ligand is conjugated to the sense strand. In some embodiments, the ligand is conjugated to the 3’ end of the sense strand. In some embodiments, the ligand is conjugated to the 5’ end of the sense strand. In some embodiments, the ligand is conjugated to a non-end of the sense strand.
- the ligand is conjugated to the antisense strand. In some embodiments, the ligand is conjugated to the 3’ end of the antisense strand. In some embodiments, the ligand is conjugated to a non-end of the antisense strand.
- the oligonucleotide is an RNAi agent comprising at least one 2’-modified nucleotide selected from a group consisting of 2’-0Me, 2’-F, 2'-deoxy, 2'-deoxy- 2’-fluoro, and 2'-0-M0E.
- the oligonucleotide is an RNAi agent wherein the sense strand is conjugated to one or more GalNAc ligand(s).
- one or more of the oligonucleotide fragments comprises at least one 2’- modified nucleotide selected from a group consisting of 2’-0Me, 2’-F, 2'-deoxy, 2'-deoxy-2’- fluoro, and 2'-0-M0E.
- one or more of the oligonucleotide fragments is a dsRNA wherein the sense strand is conjugated to one or more GalNAc ligand(s).
- the method is performed with an oligonucleotide fragment concentration of at least 1 mM, at least 2 mM, at least 3 mM, at least 4 mM, at least 5 mM, at least 6 mM, at least 7 mM, at least 8 mM, at least 9 mM, or at least 10 mM. In some embodiments, the method is performed with at least 1 mM, at least 2 mM, at least 3 mM, at least 4 mM, at least 5 mM, at least 6 mM, at least 7 mM, at least 8 mM, at least 9 mM, or at least 10 mM of each oligonucleotide fragment. In some embodiments, the method is performed with equimolar amounts of each of the two or more oligonucleotide fragments.
- the method produces at least 15 g of oligonucleotide product per litre of reaction mixture. In some embodiments, the method produces at least 16 g, at least 17 g, at least 18 g, at least 19 g, at least 20 g, at least 30 g, at least 40 g, at least 50 g, at least 60 g, at least 70 g, at least 80 g, at least 90, or at least 100 g of oligonucleotide product per litre of reaction mixture.
- the method is performed using an engineered dsRNA ligase as described herein.
- the method is performed using about 1 g/L engineered dsRNA ligase polypeptide, optionally 1.1 g/L, 1.15 g/L, 1.2 g/L, 1.25 g/L, 1.3 g/L, 1.35 g/L, 1.4 g/L, 1.45 g/L, 1.5 g/L, 1.55 g/L, 1.6 g/L, 1.65 g/L, 1.7 g/L, 1.75 g/L, 1.8 g/L, 1.85 g/L, 1.9 g/L, 1.95 g/L, 2 g/L, 2.1 g/L, 2.2 g/L, 2.3 g/L, 2.4 g/L, 2.5 g/L, 2.6 g/L, 2.7 g/L, 2.8 g/L, 2.9 g/L, 3 g/L, 3.25 g/L, 3.5 g/L, 3.75 g/L, 4 g
- dsRNA ligase The enzymatic activity of dsRNA ligase requires ATP as a cofactor. One molecule of ATP is converted to AMP per ligation reaction. The catalytic mechanism of dsRNA ligase and the role of ATP in nucleic acid ligation reactions are described above.
- the source of ATP is ATP.
- the method is performed using a stoichiometric concentration of ATP.
- the method is performed using a stoichiometric excess of ATP. The skilled person can readily determine the stoichiometric concentration of ATP required for a given ligation based on the concentration of the oligonucleotide fragments and the number of ligation reactions required to produce the oligonucleotide product.
- the method is performed using an ATP and/or AMP concentration of about 0.5 mM, about 1 mM, about 2 mM, about 3 mM, about 4 mM, about 5 mM, about 6 mM, about 7 mM, about 8 mM, about 9 mM, about 10 mM, about 12 mM, about 14 mM, about 16 mM, about 18 mM, about 20 mM, about 22 mM, about 24 mM, about 26 mM, about 28mM or about 30mM.
- an ATP and/or AMP concentration of about 0.5 mM, about 1 mM, about 2 mM, about 3 mM, about 4 mM, about 5 mM, about 6 mM, about 7 mM, about 8 mM, about 9 mM, about 10 mM, about 12 mM, about 14 mM, about 16 mM, about 18 mM, about 20 mM, about 22 mM
- the source of ATP is an ATP regeneration system.
- the ATP regeneration system comprises: (a) polyphosphate kinase (PPK); (b) polyphosphate; and (c) AMP and/or ATP.
- PPK polyphosphate kinase
- AMP AMP and/or ATP.
- the ATP regeneration system described herein comprises PPK and polyphosphate. PPK generates ATP from AMP using polyphosphate as a phosphate donor. ATP that is converted to AMP during the ligation reaction can be regenerated to ATP by PPK and used as a cofactor in a subsequent ligation reaction. This cycling of ATP obviates the need for high ATP concentration in the starting reaction. Instead, the reaction can be performed using sub- stoichiometric concentrations of ATP, and/or using the cheaper alternative, AMP.
- Polyphosphate kinases or “PPKs” are a family of enzymes which catalyze the formation of ATP from AMP and polyphosphate.
- the PPK is PPK12.
- the PPK comprises an amino acid sequence having at least 70% sequence identity to SEQ ID NO: 670, which is the amino acid sequence of PPK12.
- the PPK comprises an amino acid sequence having at least 75%, at least 80%, at least 85%, at least 86%, at least 87%, at least 88%, at least 89%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% or 100% sequence identity to SEQ ID NO: 670.
- the PPK comprises an amino acid sequence having at least 70% sequence identity to SEQ ID NO: 671, which is the amino acid sequence of an optimized PPK12. In some embodiments, the PPK comprises an amino acid sequence having at least 75%, at least 80%, at least 85%, at least 86%, at least 87%, at least 88%, at least 89%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% or 100% sequence identity to SEQ ID NO: 671.
- the PPK is Acinetobacter johnsonii polyphosphate: AMP phosphotransferase (AjPAP) (UniProt ID: Q83XD3).
- the PPK comprises an amino acid sequence having at least 70% sequence identity to SEQ ID NO: 672, which is the amino acid sequence of AjPAP.
- the PPK comprises an amino acid sequence having at least 75%, at least 80%, at least 85%, at least 86%, at least 87%, at least 88%, at least 89%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% or 100% sequence identity to SEQ ID NO: 672.
- the PPK is used in the form of whole cell, crude extract (e.g. cell free lyophilized extract or cell lysates), isolated polypeptide, or purified polypeptide.
- the PPK polypeptide is used in an immobilized form as described herein, such as immobilized on a resin.
- the method is performed using about 1 g/L PPK, optionally 1.1 g/L, 1.15 g/L, 1.2 g/L, 1.25 g/L, 1.3 g/L, 1.35 g/L, 1.4 g/L, 1.45 g/L, 1.5 g/L, 1.55 g/L, 1.6 g/L, 1.65 g/L, 1.7 g/L, 1.75 g/L, 1.8 g/L, 1.85 g/L, 1.9 g/L, 1.95 g/L, 2 g/L, 2.1 g/L, 2.2 g/L, 2.3 g/L, 2.4 g/L, 2.5 g/L, 2.6 g/L, 2.7 g/L, 2.8 g/L, 2.9 g/L, 3 g/L, 3.25 g/L, 3.5 g/L, 3.75 g/L, 4 g/L, 4.5 g/L or 5
- the polyphosphate is a polyphosphate salt.
- the polyphosphate salt is sodium polyphosphate (Maddrell’s salt) or sodium hexametaphosphate (Graham’s salt).
- the method is performed using a stoichiometric excess of polyphosphate. In some embodiments, the method is performed using a polyphosphate concentration of at least 5 mM, at least 10 mM, at least 15 mM, at least 20 mM, at least 25 mM, at least 30 mM, at least 35 mM, at least 40 mM, at least 45 mM, at least 50 mM, 55 mM, at least 60 mM, at least 65 mM, at least 70 mM, at least 75 mM, at least 80 mM, at least 85 mM, at least 90 mM, at least 95 mM, or at least 100 mM.
- the method is performed in the presence of PPK and polyphosphate
- the method is performed in the presence of AMP.
- the method is performed using a sub-stoichiometric concentration of ATP and/or AMP.
- the enzymatic activity of dsRNA ligases requires the presence of a divalent cation.
- the enzymatic activity of PPKs requires the presence of a divalent cation.
- the divalent cation comprises Mg 2+ and/or Mn 2+ .
- the method is performed with a divalent cation concentration of 5-100 mM, 10-100 mM, 15-100 mM, 20-100 mM, 30-100 mM, 5-90 mM, 5-80 mM, 5-70 mM, 5-60 mM, 5-50 mM, or 30-50 mM.
- the method is performed with a divalent cation concentration of at least 5 mM, at least 10 mM, at least 15 mM, at least 20 mM, at least 25 mM, at least 30 mM, at least 35 mM, at least 40 mM, at least 45 mM, at least 50 mM, 55 mM, at least 60 mM, at least 65 mM, at least 70 mM, at least 75 mM, at least 80 mM, at least 85 mM, at least 90 mM, at least 95 mM, or at least 100 mM.
- the method further comprises purifying the oligonucleotide product from the reaction mixture.
- the oligonucleotide product is at least 80% pure, optionally wherein the oligonucleotide product is at least 85%, at least 90%, at least 95% pure, optionally wherein the oligonucleotide product is at least 98% pure, optionally wherein the oligonucleotide product is at least 99% pure, optionally wherein the oligonucleotide product is at least 99.5% pure, optionally wherein the oligonucleotide product is at least 99.9% pure.
- oligonucleotide product that is pure does not contain oligonucleotide fragments, intermediate ligation products, or side products arising from nonspecific ligation.
- the oligonucleotide product may be purified or isolated using any method known in the art, for example using gel extractions or using cellulose-based matrices.
- the disclosure also provides an oligonucleotide produced by a method described herein.
- the oligonucleotide may be in any suitable buffer solution.
- the buffer solution is selected from Tris buffer (e.g. Tris-HCl), phosphate buffer, HEPES, MOPS (3-(A-morpholino)propancsulfonic acid), and triethanolamine (TEOA) buffer.
- the buffer solution comprises acetate, citrate, prolamine, carbonate, or phosphate, or any combination thereof.
- the buffer solution further comprises an agent for controlling the osmolarity of the solution, such that the osmolarity is kept at a desired value, e.g., at the physiologic values of the human plasma.
- Solutes which can be added to the buffer solution to control the osmolarity include, but are not limited to, proteins, peptides, amino acids, non-metabolized polymers, vitamins, ions, sugars, metabolites, organic acids, lipids, or salts.
- the agent for controlling the osmolarity of the solution is a salt.
- the agent for controlling the osmolarity of the solution is sodium chloride or potassium chloride.
- the present disclosure contemplates a range of suitable reaction conditions that may be used in the methods described herein, including but not limited to pH, temperature, buffers, substrate loadings, enzyme loading, cofactor loading, pressure, and reaction time. Additional suitable reaction conditions for ligation reactions described herein can be readily optimized by routine experimentation, e.g. performing the method described herein under experimental reaction conditions of varying reagent concentration, pH, temperature, and detecting the rate of oligonucleotide product formation.
- the reaction conditions may include a suitable pH.
- the desired pH or desired pH range can be maintained by using an acid or base, a suitable buffer, or a combination of buffer and added acid or base.
- the pH of the reaction mixture can be controlled before and/or during the reaction.
- suitable reaction conditions include a solution pH of about 4 to about 8, a pH of about 5 to about 8, a pH of about 6 to about 8, or a pH of about 7 to about 8.
- the reaction conditions include a solution pH of about 4, 4.5, 5, 5.5, 6, 6.5, 7, 7.5 or 8.
- suitable temperatures can be used for the reaction conditions, taking into consideration of, for example, the increase in reaction rate at higher temperatures, the activity of the enzyme for sufficient duration of the reaction.
- suitable reaction conditions include a temperature of about 10°C to about 60°C, about 10°C to about 50°C, about 25 °C to about 50°C, about 25°C to about 40°C, about 25°C to about 30°C, or about 10°C to about 30°C.
- suitable reaction temperatures include a temperature of about 10°C, 15°C, 20°C, 25°C, 30°C, 35°C, 40°C, 45°C, 50°C, 55°C, or 60°C.
- the temperature during the enzymatic reaction can be maintained at a certain temperature throughout the reaction. In some embodiments, the temperature during the enzymatic reaction may be adjusted over a temperature profile during the course of the reaction.
- the reaction may be performed in any suitable buffer solution.
- the buffer solution is selected from Tris buffer (e.g. Tris-HCl), phosphate buffer, HEPES, MOPS (3-(A-morpholino)propanesulfonic acid), and triethanolamine (TEOA) buffer.
- the buffer solution comprises acetate, citrate, prolamine, carbonate, or phosphate, or any combination thereof.
- the buffer solution is phosphate buffered saline (PBS).
- the reaction mixture further comprises a reducing agent, optionally DTT (Dithiothreitol).
- a reducing agent optionally DTT (Dithiothreitol).
- the engineered dsRNA ligase polypeptide may be added to the reaction mixture in different formulation forms, as frozen or lyophilized whole cells (FWC or LWC) transformed with the gene encoding the engineered dsRNA ligase polypeptide and/or as cell lysate or lyophilized cell lysate of such cells, so called shake flask powder (SFP), where the cell debris was removed and/or further purified as fermentation powder (FP).
- FWC or LWC frozen or lyophilized whole cells transformed with the gene encoding the engineered dsRNA ligase polypeptide and/or as cell lysate or lyophilized cell lysate of such cells, so called shake flask powder (SFP), where the cell debris was removed and/or further purified as fermentation powder (FP).
- SFP shake flask powder
- Whole cells transformed with gene(s) encoding the engineered dsRNA ligase polypeptide or cell extracts, lysates thereof, and isolated enzymes can be used in a wide variety of different forms, including solids (e.g., lyophilized, spray dried, or the like) or semisolid (e.g., a crude paste).
- the cell extract or cell lysate may be partially purified by precipitation (e.g. , ammonium sulfate, polyethyleneimine, heat treatment or the like), followed by desalting procedures (e.g., ultrafiltration, dialysis, and the like) prior to lyophilization.
- Any of the enzyme preparations can be immobilized to a solid phase material (such as a resin).
- a culture medium containing the secreted polypeptide can be used in the process herein.
- the solid reactants e.g. , enzymes, salts, etc.
- the reaction can be provided to the reaction in a variety of different forms, including powders (e.g., lyophilized, spray dried, etc.), solutions, emulsions, suspensions, and the like.
- the reactants can be readily lyophilized or spray-dried using methods and instrumentation known to one skilled in the art.
- the protein solution can be frozen at -80 °C in small aliquots, and then added to the pre-chilled lyophilization chamber, followed by the application of a vacuum.
- the order of addition of reactants is not critical.
- the reactants may be added together to the solvent at the same time or alternatively, some reactants may be added separately, and some may be added together at different time points.
- the methods of performing a ligation reaction may comprise the further step of isolating the oligonucleotide product of the enzymatic reaction. In particular, this step is typically performed after completion of the enzymatic reaction.
- the oligonucleotide is in particular typically separated from one or more, in particular essentially all of the other components of the reaction mixture. For example, the oligonucleotide is typically separated from the remaining substrate, side products, and/or enzymes. Isolation of the oligonucleotide may be achieved by means and techniques known in the art, e.g. by separating oligonucleotides based on their size such as by gel electrophoresis and gel extractions or using cellulose-based matrices.
- the method further comprises purifying the oligonucleotide by ultrafiltration and chromatography. Modifications
- the oligonucleotide fragment(s) and/or the oligonucleotide comprises a modification, e.g. a chemical modification.
- oligonucleotide fragment(s) means one or more oligonucleotide fragments. It will be appreciated that modifications which are present in the oligonucleotide fragment(s) are typically present in the oligonucleotide produced from said oligonucleotide fragment(s). In some embodiments, modification(s) are introduced to and/or removed from the oligonucleotide product.
- the oligonucleotide fragment(s) and/or oligonucleotide comprises a chemical modification. In some embodiments, the oligonucleotide fragment(s) and/or oligonucleotide comprises at least one backbone modification. In some embodiments, the oligonucleotide fragment(s) and/or oligonucleotide comprises at least one nucleotide modification. In some embodiments, the oligonucleotide fragment(s) and/or oligonucleotide comprises at least one sugar modification (e.g. at the 2’-position or 4’-position).
- the oligonucleotide fragment(s) and/or oligonucleotide comprises: (i) at least one backbone modification; (ii) at least one nucleotide modification; and/or (iii) at least one sugar modification.
- Modifications include, but are not limited to, end modifications of the terminal oligonucleotide fragments, e.g., 5 ’-end modifications (phosphorylation, conjugation, inverted linkages) or 3’-end modifications (conjugation, inverted linkages, etc.); base modifications, e.g. , replacement with stabilizing bases, destabilizing bases, or bases that base pair with an expanded repertoire of partners, removal of bases (abasic nucleotides), or conjugated bases; sugar modifications (e.g., at the 2’-position or 4’-position) or replacement of the sugar; or backbone modifications, including modification or replacement of the phosphodiester linkages.
- end modifications of the terminal oligonucleotide fragments e.g., 5 ’-end modifications (phosphorylation, conjugation, inverted linkages) or 3’-end modifications (conjugation, inverted linkages, etc.
- base modifications e.g. , replacement with stabilizing bases, destabilizing bases, or bases that base
- a terminal oligonucleotide fragment and/or oligonucleotide comprises a cap.
- the term "cap” and the like include a chemical moiety attached to the end of a double-stranded nucleotide duplex, but is used herein to exclude a chemical moiety that is a nucleotide or nucleoside.
- a “3’ cap” is attached at the 3’ end of a nucleotide or oligonucleotide and protects the molecule from degradation, e.g., from nucleases, such as those in blood serum or intestinal fluid.
- the oligonucleotide fragment(s) and/or oligonucleotide comprises one or more mismatches.
- a mismatch is defined herein as a difference between the base sequence or length when two sequences are maximally aligned and compared.
- a mismatch is defined as a position wherein the base of one sequence is not complementary to the base of the other sequence.
- a mismatch is counted, for example, if a position in the first sequence has a particular base (e.g., A), and the corresponding position in the second sequence has a base which is not complementary to said base in the first sequence (e.g., G), when the first and second sequences are aligned antiparallel to each other.
- a U can be replaced by T (either as RNA or, preferably, DNA, e.g., 2 ’-deoxy-thymidine); the replacement of a U with a T is not a mismatch as used herein, as either U or T can pair with A on the opposite strand.
- RNA oligonucleotide can thus comprise one or more DNA bases, e.g., T. No mismatch is counted between a DNA portion(s) of an RNAi agent and the corresponding target mRNA if basepairing occurs (e.g., between A, G, C, or T in the DNA portion, and the corresponding U, C, G, or A, respectively in the mRNA).
- a nucleotide modification in the sugar or phosphate is also not considered a mismatch.
- one sequence comprises a G
- the complementary sequence comprises a modified C (e.g., 2 ’-modification) at the same position, no mismatch would be counted.
- RNAi double-stranded RNAi
- a strand having a given sequence as an RNA would have zero mismatches from its complement sequence as a PNA; or morpholino; or LNA; or TNA; or GNA; or FANA; or a mix or chimera of RNA and DNA, TNA, GNA, FANA, Morpholino, UNA, LNA, and/or PNA, etc.
- mismatch No mismatch would occur between a nucleotide which is T, and a nucleotide which is A with a 5’ modification and/or a 2 ’-modification.
- base replacement base replacement
- terminal overhangs such as “UU” or “dTdT” are not counted when counting the number of mismatches.
- a mismatch is defined as a position wherein the base of one sequence does not match the base of the other sequence.
- dTdT (2'-deoxy-thymidine-5 ’-phosphate and 2'-deoxy-thymidine-5’- phosphate), or in some cases, TT or UU
- TT or UU can be added as a terminal dinucleotide cap or extension to one or both 3 ’-ends of the oligonucleotide, but this cap or extension is not included in the calculation of the total number of mismatches and is not considered part of the target sequence.
- the terminal dinucleotide protects the ends from nuclease degradation but does not contribute to target specificity (Elbashir et al. 2001 Nature 411: 494- 498; Elbashir et al. 2001 EMBO J. 20: 6877-6888; and Kraynack et al. 2006 RNA 12: 163- 176).
- nucleic acid molecules There are several examples in the art describing sugar, base, phosphate and backbone modifications that can be introduced into nucleic acid molecules with significant enhancement in their nuclease stability and efficacy.
- oligonucleotides are modified to enhance stability and/or enhance biological activity by modification with nuclease resistant groups, for example, 2'-amino, 2'-C-allyl, 2'-flouro, 2'-O-methyl, 2'-O-allyl, 2'-H, nucleotide base modifications.
- nuclease resistant groups for example, 2'-amino, 2'-C-allyl, 2'-flouro, 2'-O-methyl, 2'-O-allyl, 2'-H, nucleotide base modifications.
- Sugar modification of nucleic acid molecules are extensively described in the art.
- oligonucleotides Additional modifications and conjugations have been described. Soutschek et al. 2004 Nature 432: 173-178 presented conjugation of cholesterol to the 3’-end of the sense strand of an siRNA molecule by means of a pyrrolidine linker, thereby generating a covalent and irreversible conjugate. Chemical modifications (including conjugation with other molecules) of oligonucleotides may also be made to improve the in vivo pharmacokinetic retention time and efficiency.
- the oligonucleotide fragment(s) and/or oligonucleotide comprises a modified base.
- the disclosure encompasses an oligonucleotide and oligonucleotide fragments with a substitution of a single nucleotide at a given position with a modified version of the same nucleotide.
- a nucleotide (A, G, C or U) can be replaced by a modified base selected from 5-fluorouracil, 5 -bromouracil, 5-chlorouracil, 5-iodouracil, hypoxanthine, xantine, 4-acetylcytosine, 5 -(carboxyhydroxylmethyl) uracil, 5- carboxymethylaminomethyl-2 -thiouridine, 5-carboxymethylaminomethyluracil, dihydrouracil, beta-D-galactosylqueosine, inosine, N6-isopentenyladenine, 1-methylguanine, 1 -methylinosine, 2,2-dimethylguanine, 2-methyladenine, 2-methylguanine, 3 -methylcytosine, 5 -methylcytosine, N6-adenine, 7-methylguanine, 5 -methylaminomethyluracil, 5- methoxyaminomethyl-2 -thiouracil, beta-D-
- Additional modified variants include the addition of any other moiety (e.g., a radiolabel or other tag or conjugate) to the oligonucleotide or oligonucleotide fragment; provided that the base sequence is identical, the addition of other moieties produces a “modified variant” (with no mismatches).
- any other moiety e.g., a radiolabel or other tag or conjugate
- the oligonucleotide and/or oligonucleotide fragment(s) comprises a modification that causes the oligonucleotide to have increased stability in a biological sample or environment (e.g., cytoplasm, interstitial fluid, blood serum, lung or intestinal lavage).
- a biological sample or environment e.g., cytoplasm, interstitial fluid, blood serum, lung or intestinal lavage.
- the oligonucleotide and/or oligonucleotide fragment(s) comprises a modification that promotes cleavage by the RNA-induced silencing complex (z. e. a “RISC cleavage site”).
- the RISC cleavage site is the site on the target at which cleavage occurs.
- the antisense strand comprises a RISC cleavage site.
- the cleavage site of the antisense strand is typically around the 10, 11 and 12 positions from the 5’-end.
- the term “cleavage region” refers to a region that is located immediately adjacent to the cleavage site. In some embodiments, the cleavage region comprises three bases on either end of, and immediately adjacent to, the cleavage site. In some embodiments, the cleavage region comprises two bases on either end of, and immediately adjacent to, the cleavage site. In some embodiments, the cleavage site specifically occurs at the site bound by nucleotides 10 and 11 of the antisense strand, and the cleavage region comprises nucleotides 11, 12 and 13 of the antisense strand.
- the oligonucleotide fragment(s) and/or oligonucleotide comprises a modified backbone.
- an unmodified backbone consists of 3’ to 5’ phosphodiester bonds.
- a modified backbone may comprise non-natural intemucleoside linkages.
- Oligonucleotides having a modified backbone include those that retain a phosphorus atom in the backbone and those that do not have a phosphorus atom in the backbone.
- Oligonucleotide fragments comprising a modified backbone include, but are not limited to, those that do not have a phosphorus atom in the backbone.
- Modified backbones include, but are not limited to, phosphorothioates, chiral phosphorothioates, phosphorodithioates, phosphotriesters, aminoalkylphosphotriesters, methyl and other alkyl phosphonates (e.g. 3'-alkylene phosphonates and chiral phosphonates), phosphinates, phosphoramidates (e.g.
- Oligonucleotide fragments comprising a modified backbone that does not include a phosphorus atom therein may have backbones that are formed by short chain alkyl or cycloalkyl intemucleoside linkages, mixed heteroatoms and alkyl or cycloalkyl intemucleoside linkages, or one or more short chain heteroatomic or heterocyclic intemucleoside linkages.
- morpholino linkages formed in part from the sugar portion of a nucleoside
- siloxane backbones sulfide, sulfoxide and sulfone backbones
- formacetyl and thioformacetyl backbones methylene formacetyl and thioformacetyl backbones
- alkene containing backbones sulfamate backbones
- sulfonate and sulfonamide backbones amide backbones; and others having mixed N, O, S and CH2 component parts.
- the oligonucleotide and/or oligonucleotide fragment(s) comprises at least one phosphonate linkage, wherein the phosphonate is a modified phosphonate selected from the group consisting of: phosphorothioate (which may be an Rp isomer or an .S'p isomer):
- methylphosphonate methoxypropylphosphonate : 5 ’ -methylphosphonate : phonate: 5’-phosphorothioate; and peptide nucleic acid:
- the oligonucleotide and/or oligonucleotide fragment(s) comprises: at least one 5’-uridine-adenine-3’ (5’-ua-3’) dinucleotide, wherein the uridine is a 2’-modified nucleotide; at least one 5’-uridine-guanine-3’ (5’-ug-3’) dinucleotide, wherein the 5 ’-uridine is a 2’-modified nucleotide; at least one 5’-cytidine-adenine-3’ (5’-ca-3’) dinucleotide, wherein the 5’-cytidine is a 2’-modified nucleotide; or at least one 5’-uridine- uridine-3’ (5’-uu-3’) dinucleotide, wherein the 5’-uridine is a 2’-modified nucleotide.
- dinucleotide motifs are particularly prone to serum nuclease degradation (e.g. RNase A).
- Chemical modification at the 2'-position of the first pyrimidine nucleotide in the motif prevents or slows down such cleavage.
- This modification recipe is also known under the term 'endo light'.
- the oligonucleotide and/or oligonucleotide fragment(s) comprise a modified nucleobase, wherein the modified nucleobase is difluorotolyl, nitroindolyl, nitropyrrolyl, or nitroimidazolyl. In a particular embodiment, the modified nucleobase is difluorotolyl. In some embodiments, wherein the oligonucleotide and/or oligonucleotide fragment(s) is double-stranded, only one of the two strands contains a modified nucleobase. In some embodiments, wherein the oligonucleotide and/or oligonucleotide fragment(s) is double-stranded, both of the strands contain a modified nucleobase.
- the oligonucleotide fragment(s) and/or oligonucleotide comprises a modified sugar.
- Sugar modifications typically involve chemical modification of the sugar moiety of RNA or DNA.
- Sugar modifications include, but are not limited to, one of the following at the 2’-position: OH; F; O-, S-, orN-alkyl; O-, S-, orN-alkenyl; O-, S- or N- alkynyl; or O-alkyl-O-alkyl, wherein the alkyl, alkenyl and alkynyl can be substituted or unsubstituted Ci to Cio alkyl or C2 to Cio alkenyl and alkynyl.
- Exemplary modifications include O[(CH 2 ) n O] mCHs, O(CH 2 ). n OCH 3 , O(CH 2 ) n NH 2 , O(CH 2 ) n CH 3 , O(CH 2 ) n ONH 2 , and O(CH 2 ) n ON[(CH 2 ) n CH 3 )] 2 , where n and m are from 1 to about 10.
- Oligonucleotide fragments for use in the methods described herein may include one of the following at the 2’ position: Ci to Cio lower alkyl, substituted lower alkyl, alkaryl, aralkyl, O-alkaryl or O-aralkyl, SH, SCH3, OCN, Cl, Br, CN, CF 3 , OCF 3 , SOCH 3 , SO 2 CH 3 , ONO 2 , NO 2 , N 3 , NH 2 , heterocycloalkyl, heterocycloalkaryl, aminoalkylamino, polyalkylamino, substituted silyl, an RNA cleaving group, a reporter group, an intercalator, a group for improving the pharmacokinetic properties of a therapeutic RNA, or a group for improving the pharmacodynamic properties of a therapeutic RNA.
- the modification comprises a 2 ’-methoxy ethoxy (also known as 2 ’-O-(2 -methoxyethyl) or 2’-O-MOE), 2’- dimethylaminooxyethoxy (also known as 2’-DMAOE), and 2 ’-dimethylaminoethoxy ethoxy (also known in the art as 2’-O-dimethylaminoethoxyethyl or 2’-DMAEOE).
- 2 ’-methoxy ethoxy also known as 2 ’-O-(2 -methoxyethyl) or 2’-O-MOE
- 2’- dimethylaminooxyethoxy also known as 2’-DMAOE
- 2 ’-dimethylaminoethoxy ethoxy also known in the art as 2’-O-dimethylaminoethoxyethyl or 2’-DMAEOE
- modifications include: 5’-Me-2’-F nucleotides, 5’-Me-2’-Ome nucleotides, 5’- Me -2 ’-deoxynucleotides, 2 ’-alkoxyalkyl; and 2’-NMA (N-methylacetamide).
- Other modifications include 2’-methoxy (2’-OCH3), 2 ’-aminopropoxy (2’- OCH2CH2CH2NH2) and 2 ’-fluoro (2’-F). Similar modifications can also be made at other positions on an RNA, particularly the 3 ’ position of the sugar on the 3 ’ terminal nucleotide or in 2’-5’ linked dsRNA and the 5’ position of 5’ terminal nucleotide.
- the oligonucleotide fragment(s) and/or oligonucleotide comprises at least one modified nucleotide.
- the modification is selected from the group consisting of: 2’-O-methyl (2’-0me), 2’-flouro (2’-F), 2’-deoxy, 2’- deoxy-2’ -fluoro, 2’-O-methoxyethyl (2’-0-M0E), 2’-O-aminopropyl (2’-O-AP), 2’-O- dimethylaminoethyl (2’-0-DMA0E), 2’-O-dimethylaminopropyl (2’-0-DMAP), 2 -0- dimethylaminoethyloxyethyl (2’-0-DMAE0E), 2’-O-N-methylacetamido (2’-0-NMA), locked nucleic acid (LNA), glycol nucleic acid (GNA), phosphoramidate (e.
- one or more of the oligonucleotide fragments comprises a 2 ’-modification selected from the group consisting of: 2’-0me, 2’-F, and 2’- deoxy.
- the oligonucleotide and/or oligonucleotide fragment(s) comprises one or more 3'-O-methyl nucleotide.
- the oligonucleotide and/or oligonucleotide fragment(s) comprises a 2'-modification selected from the group consisting of: 2'-O-methyl (2’-0Me), 2'- flouro (2’-F), 2'-deoxy, 2'-deoxy-2’-fluoro, 2'-O-methoxyethyl (2'-0-M0E), 2'-O- aminopropyl (2'-O-AP), 2'-O-dimethylaminoethyl (2'-0-DMA0E), 2'-O- dimethylaminopropyl (2'-0-DMAP), 2'-O-dimethylaminoethyloxyethyl (2'-0-DMAE0E), 2'- O-N-methylacetamido (2'-0-NMA), locked nucleic acid (LNA), phosphoramidate (e.g.
- oligonucleotide and/or oligonucleotide fragment(s) comprises one or more 3'-O-methyl nucleotide.
- the oligonucleotide and/or oligonucleotide fragment(s) comprises a bridged nucleic acid.
- the bridged nucleic acid is locked nucleic acid.
- the bridged nucleic acid is constrained ethyl bridged nucleic acid:
- all pyrimidines are 2’ O-methyl- modified nucleosides.
- the sense and/or antisense strand is conjugated to one or more diagnostic compound, reporter group, cross-linking agent, nuclease-resistance conferring moiety, modified or unmodified nucleobase, lipophilic molecule, cholesterol, lipid, lectin, steroid, uvaol, hecigenin, diosgenin, terpene, triterpene, sarsasapogenin, Friedelin, epifriedelanol-derivatized lithocholic acid, vitamin, carbohydrate, dextran, pullulan, chitin, chitosan, synthetic carbohydrate, oligo lactate 15-mer, natural polymer, low- or medium- molecular weight polymer, inulin, cyclodextrin, hyaluronic acid, protein, protein-binding agent, integrin-targeting molecule, polycationic, peptide, polyamine, peptide mimic, and/or transferrin.
- the antisense strand comprises at least one 2’-0Me modified nucleotide. In some embodiments, the antisense strand comprises at least one 2’-F modified nucleotide. In some embodiments, the antisense strand comprises at least one 2’-deoxy modified nucleotide. In some embodiments, the antisense strand comprises at least one 2’- OMe modified nucleotide, at least one 2’-F modified nucleotide, or at least one 2’-deoxy modified nucleotide, or any combination thereof. In some embodiments, the antisense strand comprises alternating 2’-0Me and 2’-F modified nucleotides.
- the antisense strand comprises at least one 5’ vinylphosphonate. In some embodiments, the antisense strand comprises at least one chiral phosphorothioate linkage. In some embodiments, the antisense strand comprises at least one GNA. In some embodiments, the sense strand comprises at least one 2’-0Me modified nucleotide. In some embodiments, the sense strand comprises at least one 2’-F modified nucleotide. In some embodiments, the sense strand comprises at least one 2’-deoxy modified nucleotide.
- the sense strand comprises at least one 2’-0Me modified nucleotide, at least one 2’-F modified nucleotide, or at least one 2’ -deoxy modified nucleotide, or any combination thereof. In some embodiments, the sense strand comprises alternating 2’-0Me and 2’-F modified nucleotides. In some embodiments, the antisense strand and the sense strand each comprise at least one 2’-0Me modified nucleotide. In some embodiments, the antisense strand and the sense strand each comprise at least one 2’-F modified nucleotide.
- the antisense strand and the sense strand each comprise alternating 2’-0Me and 2’-F modified nucleotides.
- the sense strand comprises at least one 5’ vinylphosphonate.
- the sense strand comprises at least one chiral phosphorothioate linkage.
- the sense strand comprises at least one GNA.
- the sense strand comprises alternating 2’-0Me and 2’-F modified nucleotides over the full length of the sense strand. In some embodiments, the sense strand comprises alternating 2’-0Me and 2’-F modified nucleotides over part of the length of the sense strand e.g. over at least 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20 or more nucleotides of the sense strand.
- the antisense strand comprises alternating 2’-OMe and 2’-F modified nucleotides over the full length of the antisense strand. In some embodiments, the antisense strand comprises alternating 2’-OMe and 2’-F modified nucleotides over part of the length of the antisense strand e.g. over at least 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16,
- nucleotides of the antisense strand 17, 18, 19, 20 or more nucleotides of the antisense strand.
- the sense strand and antisense strand each comprise alternating 2’-OMe and 2’-F modified nucleotides over the full length of the sense strand and the antisense strand. In some embodiments, the sense strand and the antisense strand comprise alternating 2’-OMe and 2’-F modified nucleotides over part of the length of the sense strand and the antisense strand e.g. over at least 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17,
- nucleotides of the sense strand and the antisense strand 18, 19, 20 or more nucleotides of the sense strand and the antisense strand.
- one or more of the oligonucleotide fragments is conjugated to at least one ligand.
- the oligonucleotide product is conjugated to at least one ligand.
- the ligand may be conjugated to the sense strand, antisense strand or both strands, in any configuration e.g. at the 3 ’-end, 5 ’-end, non-end or a combination.
- the ligand comprises one or more N-Acetylgalactosamine (GalNAc) derivatives. In some embodiments, the ligand comprises one or more GalNAc derivatives conjugated through a bivalent or trivalent branched carrier. In some embodiments, the ligand is:
- the ligand is:
- the ligand is: In some embodiments, the ligand is:
- the ligand is:
- the ligand is:
- a ligand alters the distribution, targeting or lifetime of the molecule into which it is incorporated.
- a ligand provides an enhanced affinity for a selected target, e.g., molecule, cell or cell type, compartment, receptor e.g., a cellular or organ compartment, tissue, organ or region of the body, as, e.g., compared to a species absent such a ligand.
- Ligands providing enhanced affinity for a selected target are also termed targeting ligands.
- Some ligands can have endosomolytic properties.
- the endosomolytic ligands promote the lysis of the endosome and/or transport of the oligonucleotide, or a composition comprising the oligonucleotide, from the endosome to the cytoplasm of the cell.
- the endosomolytic ligand may be a polyanionic peptide or peptidomimetic which shows pH- dependent membrane activity and fusogenicity. In some embodiments, the endosomolytic ligand assumes its active conformation at endosomal pH.
- the “active” conformation is that conformation in which the endosomolytic ligand promotes lysis of the endosome and/or transport of the oligonucleotide, or a composition comprising the oligonucleotide, from the endosome to the cytoplasm of the cell.
- exemplary endosomolytic ligands include the GALA peptide (Subbarao et al., Biochemistry, 1987, 26: 2964-2972), the EALA peptide (Vogel et al., J. Am. Chem. Soc., 1996, 118: 1581-1586), and their derivatives (Turk et al., Biochem. Biophys. Acta, 2002, 1559: 56-68).
- the endosomolytic component may contain a chemical group (e.g., an amino acid) which will undergo a change in charge or protonation in response to a change in pH.
- the endosomolytic component may be linear or branched.
- Ligands can improve transport, hybridization, and specificity properties and may also improve nuclease resistance of the resultant natural or modified oligonucleotide.
- Ligands in general can include therapeutic modifiers, e.g., for enhancing uptake; diagnostic compounds or reporter groups e.g., for monitoring distribution; cross-linking agents; and nuclease-resistance conferring moieties.
- therapeutic modifiers e.g., for enhancing uptake
- diagnostic compounds or reporter groups e.g., for monitoring distribution
- cross-linking agents e.g., for monitoring distribution
- nuclease-resistance conferring moieties lipids, steroids, vitamins, sugars, proteins, peptides, polyamines, and peptide mimics.
- Ligands can include a naturally occurring substance, such as a protein (e.g., human serum albumin (HSA), low-density lipoprotein (LDL), high-density lipoprotein (HDL), or globulin); a carbohydrate (e.g., a dextran, pullulan, chitin, chitosan, inulin, cyclodextrin or hyaluronic acid); or a lipid.
- the ligand may also be a recombinant or synthetic molecule, such as a synthetic polymer, e.g., a synthetic polyamino acid, an oligonucleotide (e.g., an aptamer).
- polyamino acids examples include polylysine (PLL), poly L aspartic acid, poly L-glutamic acid, styrene-maleic acid anhydride copolymer, poly(L-lactide-co-glycolied) copolymer, divinyl ether-maleic anhydride copolymer, N-(2-hydroxypropyl)methacrylamide copolymer (HMPA), polyethylene glycol (PEG), polyvinyl alcohol (PVA), polyurethane, poly(2-ethylacryllic acid), N-isopropylacrylamide polymers, or polyphosphazine.
- PLL polylysine
- poly L aspartic acid poly L-glutamic acid
- styrene-maleic acid anhydride copolymer examples include poly(L-lactide-co-glycolied) copolymer, divinyl ether-maleic anhydride copolymer, N-(2-hydroxypropyl)methacrylamide copoly
- polyamines examples include: polyethylenimine, polylysine (PLL), spermine, spermidine, polyamine, pseudopeptide-polyamine, peptidomimetic polyamine, dendrimer polyamine, arginine, amidine, protamine, cationic lipid, cationic porphyrin, quaternary salt of a polyamine, or an alpha helical peptide .
- Ligands can also include targeting groups, e.g., a cell or tissue targeting agent, e.g., a lectin, glycoprotein, lipid or protein, e.g., an antibody, that binds to a specified cell type.
- a targeting group can be a thyrotropin, melanotropin, lectin, glycoprotein, surfactant protein A, Mucin carbohydrate, multivalent lactose, multivalent galactose, N-acetyl-galactosamine, N- acetyl-gulucosamine multivalent mannose, multivalent fucose, glycosylated polyaminoacids, multivalent galactose, transferrin, bisphosphonate, polyglutamate, polyaspartate, a lipid, cholesterol, a steroid, bile acid, folate, vitamin B12, biotin, an RGD peptide, an RGD peptidomimetic or an aptamer.
- ligands include dyes, intercalating agents (e.g., acridines), crosslinkers (e.g., psoralene, mitomycin C), porphyrins (TPPC4, texaphyrin, Sapphyrin), polycyclic aromatic hydrocarbons (e.g., phenazine, dihydrophenazine), artificial endonucleases or a chelator (e.g., EDTA), lipophilic molecules, e.g., cholesterol, cholic acid, adamantane acetic acid, 1 -pyrene butyric acid, dihydrotestosterone, 1,3-Bis- O(hexadecyl)glycerol, geranyloxyhexyl group, hexadecylglycerol, borneol, menthol, 1,3- propanediol, heptadecyl group, palmitic acid, myristic acid,O3-(ol),
- Ligands can be proteins, e.g., glycoproteins, or peptides, e.g., molecules having a specific affinity for a co-ligand, or antibodies e.g., an antibody, that binds to a specified cell type such as a cancer cell, endothelial cell, or bone cell.
- Ligands may also include hormones and hormone receptors. They can also include non-peptidic species, such as lipids, lectins, carbohydrates, vitamins, cofactors, multivalent lactose, multivalent galactose, N-acetyl- galactosamine, N-acetyl-gulucosamine multivalent mannose, multivalent fucose, or aptamers.
- the ligand can be, for example, a lipopolysaccharide, an activator of p38 MAP kinase, or an activator of NF -KB .
- the ligand is a lipid or lipid-based molecule.
- a lipid or lipid-based molecule preferably binds a serum protein, e.g., human serum albumin (HSA).
- HSA binding ligand allows for distribution of the conjugate to a target tissue.
- a lipid or lipid-based ligand can (a) increase resistance to degradation of the conjugate, (b) increase targeting or transport into a target cell or cell membrane, and/or (c) can be used to adjust binding to a serum protein, e.g., HSA.
- a lipid based ligand can be used to modulate, e.g., control the binding of the conjugate to a target tissue.
- the ligand is a peptide or a peptidomimetic.
- a peptidomimetic is a molecule capable of folding into a defined three-dimensional structure similar to a natural peptide.
- the peptide or peptidomimetic moiety can be about 5-50 amino acids long, e.g., about 5, 10, 15, 20, 25, 30, 35, 40, 45, or 50 amino acids long.
- a peptide or peptidomimetic can be, for example, a cell permeation peptide, cationic peptide, amphipathic peptide, or hydrophobic peptide (e.g., consisting primarily of Tyr, Trp or Phe).
- the peptide moiety can be a dendrimer peptide, constrained peptide or crosslinked peptide.
- the peptide moiety can include a hydrophobic membrane translocation sequence (MTS).
- MTS membrane translocation sequence
- the peptide moiety can be a “delivery” peptide, which can carry large polar molecules including peptides, oligonucleotides, and protein across cell membranes.
- a peptide or peptidomimetic can be encoded by a random sequence of DNA, such as a peptide identified from a phagedisplay library, or one-bead-one-compound (OBOC) combinatorial library (Lam et al., Nature, 354:82-84, 1991).
- a “peptide moiety” can range in length from about 5 amino acids to about 50 amino acids.
- the peptide moieties can have a structural modification, such as to increase stability or direct conformational properties. Any of the structural modifications described below can be utilized.
- An arginine-glycine-aspartic acid (RGD)-peptide moiety can be used to target a tumor cell, such as an endothelial tumor cell or a breast cancer tumor cell (Zitzmann et al., Cancer Res., 62:5139-43, 2002).
- RGD peptide can facilitate targeting of an oligonucleotide to tumors of a variety of other tissues, including the lung, kidney, spleen, or liver (Aoki et al., Cancer Gene Therapy 8:783-787, 2001).
- the RGD peptide can be linear or cyclic, and can be modified, e.g., glycosylated or methylated to facilitate targeting to specific tissues.
- Peptides that target markers enriched in proliferating cells can be used.
- RGD containing peptides and peptidomimetics can target cancer cells, in particular cells that exhibit an integrin.
- the ligand may comprise RGD peptides, cyclic peptides containing RGD, RGD peptides that include D-amino acids, or synthetic RGD mimics.
- Peptide and peptidomimetic ligands include those having naturally occurring or modified peptides, e.g., D or L peptides; a, , or y peptides; N-methyl peptides; azapeptides; peptides having one or more amide, i.e., peptide, linkages replaced with one or more urea, thiourea, carbamate, or sulfonyl urea linkages; or cyclic peptides .
- Ligands can be coupled to the oligonucleotide fragment(s) and/or oligonucleotide at various places, for example, 3 ’-end, 5 ’-end, and/or at an internal (“non-end”) position.
- the ligand is attached via an intervening tether, e.g., a carrier described herein.
- the ligand or tethered ligand may be present on a monomer when the monomer is incorporated into the oligonucleotide fragment(s) and/or oligonucleotide.
- the ligand may be incorporated via coupling to a “precursor” monomer after the “precursor” monomer has been incorporated into the oligonucleotide fragment and/or the oligonucleotide.
- a monomer having, e.g., an amino-terminated tether (i.e., having no associated ligand), e.g., TAP-(CH2)nNH2 may be incorporated into a growing oligonucleotide fragment.
- a ligand having an electrophilic group e.g., a pentafluorophenyl ester or aldehyde group
- a ligand having an electrophilic group can subsequently be attached to the precursor monomer by coupling the electrophilic group of the ligand with the terminal nucleophilic group of the precursor monomer’s tether.
- a monomer having a chemical group suitable for taking part in Click Chemistry reaction may be incorporated, e.g., an azide or alkyne terminated tether/linker.
- a ligand having complementary chemical group e.g. an alkyne or azide can be attached to the precursor monomer by coupling the alkyne and the azide together.
- the ligand is conjugated to nucleobases, sugar moieties, or intemucleosidic linkages of the oligonucleotide fragment(s) and/or oligonucleotide.
- Conjugation to purine nucleobases or derivatives thereof can occur at any position including, endocyclic and exocyclic atoms.
- the 2-, 6-, 7-, or 8-positions of a purine nucleobase are attached to a conjugate moiety.
- Conjugation to pyrimidine nucleobases or derivatives thereof can also occur at any position.
- the 2-, 5-, and 6- positions of a pyrimidine nucleobase can be substituted with a conjugate moiety.
- Conjugation to sugar moieties of nucleosides can occur at any carbon atom.
- Example carbon atoms of a sugar moiety that can be attached to a conjugate moiety include the 2', 3', and 5' carbon atoms.
- the T position can also be attached to a conjugate moiety, such as in an abasic residue.
- Intemucleosidic linkages can also bear conjugate moieties.
- phosphorus- containing linkages e.g., phosphodiester, phosphorothioate (e.g.
- the conjugate moiety can be attached directly to the phosphorus atom or to an O, N, or S atom bound to the phosphorus atom.
- the conjugate moiety can be attached to the nitrogen atom of the amine or amide or to an adjacent carbon atom.
- the ligand is conjugated to the sense strand. In some embodiments, the ligand is conjugated to the 3’ end of the sense strand. In some embodiments, the ligand is conjugated to the 5’ end of the sense strand. In some embodiments, the ligand is conjugated to a non-end of the sense strand.
- the ligand is conjugated to the antisense strand. In some embodiments, the ligand is conjugated to the 3’ end of the antisense strand. In some embodiments, the ligand is conjugated to a non-end of the antisense strand.
- the ligand may be attached via a carrier.
- the carriers include (i) at least one “backbone attachment point,” preferably two “backbone attachment points” and (ii) at least one “tethering attachment point.”
- a “backbone attachment point” as used herein refers to a functional group, e.g. a hydroxyl group, or generally, a bond available for, and that is suitable for incorporation of the carrier into the backbone, e.g., the phosphate, or modified phosphate, e.g., sulfur containing, backbone, of a nucleic acid.
- a “tethering attachment point” in some embodiments refers to a constituent ring atom of the cyclic carrier, e.g., a carbon atom or a heteroatom (distinct from an atom which provides a backbone attachment point), that connects a selected moiety.
- the moiety can be, e.g., a carbohydrate, e.g. monosaccharide, disaccharide, trisaccharide, tetrasaccharide, oligosaccharide and polysaccharide.
- the selected moiety is connected by an intervening tether to the cyclic carrier.
- the cyclic carrier will often include a functional group, e.g., an amino group, or generally, provide a bond, that is suitable for incorporation or tethering of another chemical entity, e.g., a ligand to the constituent ring.
- a functional group e.g., an amino group
- another chemical entity e.g., a ligand to the constituent ring.
- the sense and/or antisense strand may be conjugated to a ligand via a carrier, wherein the carrier can be cyclic group or acyclic group; preferably, the cyclic group is selected from pyrrolidinyl, pyrazolinyl, pyrazolidinyl, imidazolinyl, imidazolidinyl, piperidinyl, piperazinyl, [l,3]dioxolane, oxazolidinyl, isoxazolidinyl, morpholinyl, thiazolidinyl, isothiazolidinyl, quinoxalinyl, pyridazinonyl, tetrahydrofuryl and and decalin; preferably, the acyclic group is selected from serinol backbone or diethanolamine backbone.
- one or more oligonucleotide fragments comprise the sequence “TT”, “dTdT”, “dTsdT” or “UU” as a single-stranded overhang at the 3’ end, also termed herein a terminal dinucleotide or 3’ terminal dinucleotide.
- dT is 2'-deoxy-thymidine-5’- phosphate
- sdT is 2'-deoxy Thymidine 5'-phosphorothioate.
- Terminal dinucleotide “UU” is UU or 2’-0Me-U 2’-0Me-U, and the terminal TT and the terminal UU can be in the inverted/reverse orientation.
- the terminal dinucleotide (e.g., UU) is a modified variant of the dithymidine dinucleotide commonly placed as an overhang to protect the ends of siRNAs from nucleases (see, for example, Elbashir et al. 2001 Nature 411: 494-498; Elbashir et al. 2001 EMBO J. 20: 6877-6888; and Kraynack et al. 2006 RNA 12: 163-176).
- a terminal dinucleotide is known from these references to enhance nuclease resistance but not contribute to target recognition.
- one or both terminal oligonucleotide fragments comprise a 3 ’ end cap instead of or in addition to a terminal dinucleotide to stabilize the end from nuclease degradation provided that the 3’ end cap is able to both stabilize the oligonucleotide (e.g., against nucleases) and not interfere excessively with its desired activity.
- the sense and/or antisense strand may be conjugated to a ligand via a carrier, wherein the carrier can be cyclic group or acyclic group; preferably, the cyclic group is selected from pyrrolidinyl, pyrazolinyl, pyrazolidinyl, imidazolinyl, imidazolidinyl, piperidinyl, piperazinyl, [l,3]dioxolane, oxazolidinyl, isoxazolidinyl, morpholinyl, thiazolidinyl, isothiazolidinyl, quinoxalinyl, pyridazinonyl, tetrahydrofuryl and and decalin; preferably, the acyclic group is selected from serinol backbone or diethanolamine backbone.
- Embodiment 1 An engineered double-stranded RNA (dsRNA) ligase polypeptide comprising an amino acid sequence having at least 80% sequence identity to an amino acid sequence selected from the group consisting of SEQ ID NOs: 304, 306, 308, 310, 312, 314, 316, 318, 320, 322, 324, 326, 328, 330, 332, 334, 336, 338, 340, 342, 344, 346, 348, 350,
- dsRNA double-stranded RNA
- engineered dsRNA ligase polypeptide (a) has dsRNA ligase activity;
- Embodiment 2 An engineered double-stranded RNA (dsRNA) ligase polypeptide comprising an amino acid sequence having at least 80% sequence identity to an amino acid sequence selected from the group consisting of SEQ ID NOs: 304, 306, 308, 310, 312, 314, 316, 318, 320, 322, 324, 326, 328, 330, 332, 334, 336, 338, 340, 342, 344, 346, 348, 350,
- dsRNA double-stranded RNA
- Embodiment 3 The engineered dsRNA ligase polypeptide of Embodiment 1, wherein the polypeptide comprises an amino acid sequence selected from the group consisting of SEQ ID NOs: 370, 488, 526, 578, 588, 590, and 592.
- Embodiment 4 The engineered dsRNA ligase polypeptide of Embodiment 2, wherein the polypeptide comprises an amino acid sequence selected from the group consisting of SEQ ID NOs: 370, 488, 526, 578, 588, 590, 592 and 666.
- Embodiment 5 An engineered dsRNA ligase polypeptide comprising an amino acid sequence having at least 80% sequence identity to SEQ ID NO: 302, which produces at least 5% more oligonucleotide product than a dsRNA ligase polypeptide comprising the amino acid sequence of SEQ ID NO: 302 under the same ligation reaction conditions, wherein the engineered dsRNA ligase polypeptide does not the comprise the amino acid sequence of SEQ ID NO: 302.
- Embodiment 7 The engineered dsRNA ligase polypeptide of Embodiment 5 or 6, wherein the amino acid sequence of the engineered dsRNA ligase polypeptide comprises an amino acid sequence that differs from the sequence of SEQ ID NO: 302 in one or more amino acid residues selected from: X6, X7, X15, X19, X29, X36, X39, X46, X47, X49, X51, X53, X56, X57, X60, X63, X64, X66, X67, X87, X88, X91, X93, X103, X105, X107, XI 14, X122, X126, X129, X130, X131, X137, X144, X146, X158, X163, X173, X178, X190, X196, X216, X218, X221,
- Embodiment 8 The engineered dsRNA ligase polypeptide of Embodiment 7, wherein the amino acid sequence of the engineered dsRNA ligase polypeptide comprises one or more of the following amino acid residues: X6 is G; X7 is Q; X15 is R, D or E; X19 is Q or D; X29 is N or L; X36 is V; X39 is A; X46 is Y; X47 is E; X49 is G; X51 is L; X53 is Y; X56 is R or A; X57 is S; X60 is T, G or P; X63 is S, Q or G; X64 is R, T, Q, F, G, or M; X66 is F or W; X67 is N; X87 is T, P, K or absent; X88 is C; X91 is S; X93 is G, C, or A; X103 is V, C, Y, or T;
- Embodiment 9 The engineered dsRNA ligase polypeptide of any one of Embodiments 5-8, wherein the amino acid sequence of the engineered dsRNA ligase polypeptide comprises an amino acid sequence that differs from the sequence of SEQ ID NO: 302 in one or more amino acid residues selected from: X6, X7, X15, X19, X29, X36, X39, X44, X45, X46, X47, X49, X51, X53, X56, X57, X60, X63, X64, X66, X67, X87, X88, X89, X91, X92, X93, X103, X105, X107, X114, X122, X126, X129, X130, X131, X137, X144, X146, X158,
- Embodiment 10 The engineered dsRNA ligase polypeptide of Embodiment 9, wherein the amino acid sequence of the engineered dsRNA ligase polypeptide comprises one or more of the following amino acid residues: X6 is G or E; X7 is Q; X15 is R, D or E; X19 is Q or D; X29 is N or L; X36 is V; X39 is A; X44 is V; X45 is V; X46 is Y; X47 is E; X49 is G; X51 is L; X53 is Y; X56 is R or A; X57 is S; X60 is T, G or P; X63 is S, Q or G; X64 is R, T, Q, F, G, or M; X66 is F or W; X67 is N; X87 is T, P, K or absent; X88 is C; X89 is T; X91 is S; X92 is D
- Embodiment 11 The engineered dsRNA ligase polypeptide of any one of Embodiments 5-10, wherein the amino acid sequence of the engineered dsRNA ligase polypeptide comprises an amino acid sequence that differs from the sequence of SEQ ID NO: 302 in one or more amino acid residues selected from: X15, X19, X36, X39, X53, X218, X221, X237, X251, X255, and X285, wherein the numbering refers to SEQ ID NO: 302, and wherein the engineered dsRNA ligase polypeptide has dsRNA ligase activity.
- Embodiment 12 The engineered dsRNA ligase polypeptide of Embodiment 11, wherein the amino acid sequence of the engineered dsRNA ligase polypeptide comprises one or more of the following amino acid residues: X15 is D or E; X19 is D; X36 is V; X39 is A; X53 is Y; X218 is N; X221 is I; X237 is R; X251 is L; X255 is C; and X285 is A; wherein the numbering refers to SEQ ID NO: 302.
- Embodiment 13 The engineered dsRNA ligase polypeptide of any one of Embodiments 5-12, wherein the amino acid sequence of the engineered dsRNA ligase polypeptide comprises an amino acid sequence that differs from the sequence of SEQ ID NO: 302 in one or more amino acid residues selected from: X15, X19, X36, X39, X53, X185, X218, X221, X237, X251, X255, and X285, wherein the numbering refers to SEQ ID NO: 302, and wherein the engineered dsRNA ligase polypeptide has dsRNA ligase activity.
- Embodiment 14 The engineered dsRNA ligase polypeptide of Embodiment 13, wherein the amino acid sequence of the engineered dsRNA ligase polypeptide comprises one or more of the following amino acid residues: X15 is D or E; X19 is D; X36 is V; X39 is A; X53 is Y; X185 is K; X218 is N; X221 is I; X237 is R; X251 is L; X255 is C; and X285 is A; wherein the numbering refers to SEQ ID NO: 302.
- Embodiment 15 The engineered dsRNA ligase polypeptide of any one of Embodiments 5-14, wherein the amino acid sequence of the engineered dsRNA ligase polypeptide comprises an amino acid sequence that differs from the sequence of SEQ ID NO: 302 in one or more amino acid residues selected from: X36, X39, X218 and X221, wherein the numbering refers to SEQ ID NO: 302, and wherein the engineered dsRNA ligase polypeptide has dsRNA ligase activity; optionally wherein the amino acid sequence of the engineered dsRNA ligase polypeptide comprises one or more of the following amino acid residues: X36 is V; X39 is A; X218 is N; and X221 is I.
- Embodiment 16 The engineered dsRNA ligase polypeptide of any one of Embodiments 5-15, wherein the amino acid sequence of the engineered dsRNA ligase polypeptide comprises an amino acid sequence that differs from the sequence of SEQ ID NO: 302 in one or more amino acid residues selected from: X39, X218 and X221, wherein the numbering refers to SEQ ID NO: 302, and wherein the engineered dsRNA ligase polypeptide has dsRNA ligase activity; optionally wherein the amino acid sequence of the engineered dsRNA ligase polypeptide comprises one or more of the following amino acid residues: X39 is A; X218 is N; and X221 is l.
- Embodiment 17 The engineered dsRNA ligase polypeptide of any one of Embodiments 5-16, wherein the amino acid sequence of the engineered dsRNA ligase polypeptide comprises an amino acid sequence that differs from the sequence of SEQ ID NO: 302 in one or more amino acid residues selected from: X39, X218, X221 and X255, wherein the numbering refers to SEQ ID NO: 302, and wherein the engineered dsRNA ligase polypeptide has dsRNA ligase activity; optionally wherein the amino acid sequence of the engineered dsRNA ligase polypeptide comprises one or more of the following amino acid residues: X39 is A; X218 is N; X221 is I; and X255 is C.
- Embodiment 18 The engineered dsRNA ligase polypeptide of any one of Embodiments 5-17, wherein the amino acid sequence of the engineered dsRNA ligase polypeptide comprises an amino acid sequence that differs from the sequence of SEQ ID NO: 302 in one or more amino acid residues selected from: X39, X53, X218, X221, X237, X251, X255 and X285, wherein the numbering refers to SEQ ID NO: 302, and wherein the engineered dsRNA ligase polypeptide has dsRNA ligase activity; optionally wherein the amino acid sequence of the engineered dsRNA ligase polypeptide comprises one or more of the following amino acid residues: X39 is A; X 53 is Y; X218 is N; X221 is I; X237 is R; X251 is L; X255 is C; and X285 is A.
- Embodiment 19 The engineered dsRNA ligase polypeptide of any one of Embodiments 5-18, wherein the amino acid sequence of the engineered dsRNA ligase polypeptide comprises an amino acid sequence that differs from the sequence of SEQ ID NO: 302 in one or more amino acid residues selected from: X15, X39, X53, X218, X221, X237, X251, X255 and X285, wherein the numbering refers to SEQ ID NO: 302, and wherein the engineered dsRNA ligase polypeptide has dsRNA ligase activity; optionally wherein the amino acid sequence of the engineered dsRNA ligase polypeptide comprises one or more of the following amino acid residues: X15 is E; X39 is A; X 53 is Y; X218 is N; X221 is I; X237 is R; X251 is L; X255 is C; and X285
- Embodiment 20 The engineered dsRNA ligase polypeptide of any one of Embodiments 5-19, wherein the amino acid sequence of the engineered dsRNA ligase polypeptide comprises an amino acid sequence that differs from the sequence of SEQ ID NO: 302 in one or more amino acid residues selected from: X19, X39, X53, X218, X221, X237, X251, X255 and X285, wherein the numbering refers to SEQ ID NO: 302, and wherein the engineered dsRNA ligase polypeptide has dsRNA ligase activity; optionally wherein the amino acid sequence of the engineered dsRNA ligase polypeptide comprises one or more of the following amino acid residues: X19 is D; X39 is A; X 53 is Y; X218 is N; X221 is I; X237 is R; X251 is L; X255 is C; and X285
- Embodiment 21 The engineered dsRNA ligase polypeptide of any one of Embodiments 5-20, wherein the amino acid sequence of the engineered dsRNA ligase polypeptide comprises an amino acid sequence that differs from the sequence of SEQ ID NO: 302 in one or more amino acid residues selected from: X15, X39, X53, X185, X218, X221, X237, X251, X255 and X285, wherein the numbering refers to SEQ ID NO: 302, and wherein the engineered dsRNA ligase polypeptide has dsRNA ligase activity; optionally wherein the amino acid sequence of the engineered dsRNA ligase polypeptide comprises one or more of the following amino acid residues: X15 is D; X39 is A; X 53 is Y; X185 is K; X218 is N; X221 is I; X237 is R; X251 is L;
- Embodiment 22 The engineered dsRNA ligase polypeptide of any of Embodiments 1- 21, wherein the engineered dsRNA ligase polypeptide comprises a purification tag.
- Embodiment 23 The engineered dsRNA ligase polypeptide of Embodiment 22, comprising an amino acid sequence selected from the group consisting of SEQ ID NOs: 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 86, 88, 90, 92, 94, 96, 98, 100, 102, 104, 106, 108, 110, 112, 114, 116, 118, 120, 122, 124, 126, 128, 130, 132, 134, 136, 138, 140,
- Embodiment 24 The engineered dsRNA ligase polypeptide of Embodiment 22, comprising an amino acid sequence selected from the group consisting of SEQ ID NOs: 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 86, 88, 90, 92, 94, 96, 98, 100, 102, 104, 106, 108, 110, 112, 114, 116, 118, 120, 122, 124, 126, 128, 130, 132, 134, 136, 138, 140,
- Embodiment 25 A polypeptide immobilized on a solid material by chemical bond or a physical adsorption method, wherein the polypeptide comprises an engineered dsRNA ligase polypeptide according to any one of Embodiments 1-24.
- Embodiment 26 A polynucleotide encoding the engineered dsRNA ligase polypeptide of any one of Embodiments 1-24.
- Embodiment 27 The polynucleotide of Embodiment 26, wherein the polynucleotide sequence is SEQ ID NO: 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 39, 41, 43, 45, 47, 49, 51, 53, 55, 57, 59, 61, 63, 65, 67, 69, 71, 73, 75, 77, 79, 81, 83, 85, 87, 89, 91, 93, 95, 97, 99, 101, 103, 105, 107, 109, 111, 113, 115, 117, 119, 121, 123, 125, 127, 129, 131, 133, 135, 137, 139, 141, 143, 145, 147, 149, 151, 153, 155, 157, 159, 161, 163, 165,
- Embodiment 28 The polynucleotide of Embodiment 26, wherein the polynucleotide sequence is SEQ ID NO: 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 39, 41, 43, 45, 47, 49, 51, 53, 55, 57, 59, 61, 63, 65, 67, 69, 71, 73, 75, 77, 79, 81, 83, 85, 87, 89, 91, 93, 95, 97, 99, 101, 103, 105, 107, 109, 111, 113, 115, 117, 119, 121, 123, 125, 127, 129, 131, 133, 135, 137, 139, 141, 143
- Embodiment 29 An expression vector comprising the polynucleotide according to any one of Embodiments 26-28.
- Embodiment 30 The expression vector of Embodiment 29, which comprises a plasmid, a cosmid, a bacteriophage or a viral vector.
- Embodiment 31 A host cell comprising the polynucleotide of any one of Embodiments 26-28 or the expression vector of Embodiment 29 or 30, optionally wherein the host cell is E. coli.
- Embodiment 32 A method of preparing an engineered dsRNA ligase polypeptide, which comprises the steps of culturing the host cell according to Embodiment 31 and obtaining an engineered dsRNA ligase polypeptide from the culture.
- Embodiment 33 A method of preparing an engineered dsRNA ligase polypeptide, which comprises the steps of culturing the host cell according to Embodiment 31 and obtaining an engineered dsRNA ligase polypeptide from the culture.
- An engineered dsRNA ligase catalyst obtainable by culturing the host cells according to Embodiment 31, or according to the method of Embodiment 32, wherein said engineered dsRNA ligase catalyst comprises cells or culture fluid containing the engineered dsRNA ligase polypeptides, or an article processed therewith, wherein the article refers to an extract obtained from the culture of host cell, an isolated product obtained by isolating or purifying an engineered dsRNA ligase from the extract, or an immobilized product obtained by immobilizing host cell, an extract thereof, or isolated product of the extract.
- Embodiment 34 A method of producing an oligonucleotide from two or more oligonucleotide fragments, wherein the method comprises contacting:
- Embodiment 35 The method of Embodiment 34, wherein the source of ATP comprises ATP.
- Embodiment 36 The method of Embodiment 34 or 35, wherein the source of ATP comprises:
- Embodiment 37 The method of Embodiment 36, wherein the PPK is selected from PPK12 or ajPAP.
- Embodiment 38 The method of any one of Embodiments 36 or 37, wherein the method is performed using a sub-stoichiometric concentration of AMP and/or ATP.
- Embodiment 39 The method of any one of Embodiments 36-38, wherein the polyphosphate is a polyphosphate salt.
- Embodiment 40 The method of Embodiment 39, wherein the polyphosphate salt is sodium polyphosphate (Maddrell’s salt) or sodium hexametaphosphate (Graham’s salt).
- the polyphosphate salt is sodium polyphosphate (Maddrell’s salt) or sodium hexametaphosphate (Graham’s salt).
- Embodiment 41 The method of any one of Embodiments 34-40, wherein the divalent cation cofactor is Mg 2+ or Mn 2+ .
- Embodiment 42 The method of any one of Embodiments 34-41, wherein the method is performed with a divalent cation concentration of 5-100 mM, optionally 30-50 mM.
- Embodiment 43 The method of any one of Embodiments 34-42, further comprising a step of purifying the oligonucleotide.
- Embodiment 44 Use of the engineered dsRNA ligase polypeptide according to any one of Embodiments 1-24 in the production of an oligonucleotide from two or more oligonucleotide fragments.
- Embodiment 45 The method of any one of Embodiments 34-43 or the use of
- Embodiment 44 wherein the oligonucleotide is up to 60 nucleotides in length.
- Embodiment 46 The method of any one of Embodiments 34-43 or 45 or the use of Embodiment 44 or 45, wherein each of the oligonucleotide fragments are 4-16 nucleotides in length, optionally 6-9 nucleotides in length.
- Embodiment 47 The method of Embodiment 34-43, 45 or 46 or the use of any one of Embodiments 44-46, wherein one or more of the oligonucleotide fragment(s) comprises one or two overhangs.
- Embodiment 48 The method of any one of Embodiments 34-43 or 45-47 or the use of any one of Embodiments 44-47, wherein one or more of the oligonucleotide fragments comprises a chemical modification.
- Embodiment 49 The method or use of Embodiment 48, wherein the chemical modification is selected from:
- a modified backbone optionally selected from a phosphorothioate (e.g. chiral phosphorothioate) or methylphosphonate intemucleotide linkage;
- a phosphorothioate e.g. chiral phosphorothioate
- methylphosphonate intemucleotide linkage optionally selected from a phosphorothioate (e.g. chiral phosphorothioate) or methylphosphonate intemucleotide linkage
- a modified nucleotide optionally selected from 2'-O-methyl (2’-0Me), 2'-flouro (2’-F), 2'-deoxy, 2'-deoxy-2’-fluoro, 2'-O-methoxyethyl (2'-0-M0E), 2'-O- aminopropyl (2'-O-AP), 2'-O-dimethylaminoethyl (2'-0-DMA0E), 2'-O- dimethylaminopropyl (2'-0-DMAP), 2'-O-dimethylaminoethyloxyethyl (2'-O- DMAEOE), 2'-O-N-methylacetamido (2'-0-NMA), locked nucleic acid (LNA), glycol nucleic acid (GNA), phosphoramidate (e.g.
- GalNAc N- Acetylgalactosamine
- Embodiment 50 A composition comprising: i. the engineered dsRNA ligase polypeptide according to any one of Embodiments 1- 24; ii. a source of ATP; and iii. a divalent cation.
- Embodiment 51 The composition of Embodiment 50, further comprising two or more oligonucleotide fragments.
- Embodiment 52 A kit comprising: i. the engineered dsRNA ligase polypeptide according to any one of Embodiments 1- 24; ii. a source of ATP; iii. a divalent cation; and iv. instructions for use in a method of producing an oligonucleotide from two or more oligonucleotide fragments.
- Embodiment 53 The composition of Embodiment 50 or 51 or the kit of Embodiment 52, wherein the source of ATP comprises ATP.
- Embodiment 54 The composition of any one of Embodiments 50, 51 or 53 or the kit of Embodiment 52 or 53, wherein the source of ATP comprises:
- Embodiment 55 The composition or kit of Embodiment 54, wherein the PPK is selected from PPK 12 or ajPAP.
- Embodiment 56 The composition of any one of Embodiments 50, 51 or 53-55 or the kit of any one of Embodiments 52-55, wherein the polyphosphate is a polyphosphate salt.
- Embodiment 57 The composition or kit of Embodiment 56, wherein the polyphosphate salt is sodium polyphosphate (Maddrell’s salt) or sodium hexametaphosphate (Graham’s salt).
- Embodiment 58 The composition of any one of Embodiments 50, 51 or 53-57 or the kit of any one of Embodiments 52-57, wherein the divalent cation cofactor is Mg 2+ or Mn 2+ .
- ppm parts per million
- M molar
- mM millimolar
- uM and pM micromolar
- nM nanomolar
- mol molecular weight
- gm and g gram
- mg milligrams
- ug and pg micrograms
- L and 1 liter
- ml and mb milliliter
- cm centimeters
- mm millimeters
- um and pm micrometers
- coli W3110 (commonly used laboratory E. coli strain, available from the Coli Genetic Stock Center [CGSC], New Haven, CT); HTP (high throughput); HPLC (high pressure liquid chromatography); FIOP (fold improvements over positive control); Microfluidics (Microfluidics, Corp., Westwood, MA); Sigma-Aldrich (Sigma-Aldrich, St. Louis, MO; Difco (Difco Laboratories, BD Diagnostic Systems, Detroit, MI); Agilent (Agilent Technologies, Inc., Santa Clara, CA); Coming (Coming, Inc., Palo Alto, CA); Dow Coming (Dow Coming, Corp., Midland, MI); and Gene Oracle (Gene Oracle, Inc., Mountain View, CA).
- CGSC Coli Genetic Stock Center
- HTP high throughput
- HPLC high pressure liquid chromatography
- FIOP fold improvements over positive control
- Microfluidics Microfluidics (Microfluidics, Corp., Westwood, MA);
- RNA (1) sequences of the oligonucleotides referred to in parentheses (e.g. “siRNA (1)” and “oligonucleotide (2)") throughout the Examples are provided in Table 1.
- Polynucleotides encoding the polypeptides having ligase activity were cloned into the pCKl 10900 vector system (See e.g., US Pat. App. No. 2006/0195947A1 FIG. 3 which is hereby incorporated by reference in its entirety) and subsequently expressed in E. coli W311O 7n/A under the control of the lac promoter.
- the expression vector also contained the Pl 5a origin of replication and the chloramphenicol (CAM) resistance gene.
- E. coli W311O 7IMA cells were transformed with the pCKl 10900 plasmid containing the ligase-encoding genes.
- Transformed cells were plated out on Lysogeny broth (LB) agar plates containing 1% glucose and 30 pg/mL CAM, and grown overnight at 37° C. Subsequently single colonies were inoculated into 25 mL of LB supplemented with 30 pg/mL CAM and 1% glucose in a 250 ml baffled shake flask. The culture was grown overnight (16-20 hours and optical density (ODeoo) >3.8) in an incubator at 37°C, with shaking at 250 rpm.
- IPTG isopropyl-P-D-thiogalactoside
- the cell pellet was resuspended in 30 mb of 50 mM Tris-buffer at pH 7.5 and lysed using a LM20 MICROFLUIDIZER® processor system (Microfluidics). Cell debris was removed by centrifugation at 14,000 rpm for 30 minutes at 4°C. Ligase enzymes were then isolated from the clarified lysate using standard techniques known in the art, including immobilized metal affinity chromatography.
- siRNA (1) comprising of oligonucleotides (2) and (3)
- siRNA (4) comprising of oligonucleotides (3) and (5).
- Oligonucleotide (5) has the same sequence as oligonucleotide (2) but does not contain a 3’- GalNAc moiety.
- siRNAs (1) and (4) and oligonucleotides (2), (3) and (5) are depicted in Figure 1; and the sequences of oligonucleotides (2), (3) and (5) are provided in Table 1.
- SEQ ID NO: 2 for the production of siRNA (1) was subsequently confirmed using multiple enzyme preparations including isolated enzyme (example 1), clarified lysate (example 4) and shake flask powder (SFP; example 5).
- isolated enzyme example 1
- clarified lysate example 4
- shake flask powder SFP; example 5
- Single colonies were picked in a 96-well format and grown in 190 pL LB media containing 1% glucose and 30 pg/mL CAM, at 30°C, 200 rpm, and 85% humidity. Following overnight growth, 20 pL of the grown cultures were transferred into a deep well plate containing 380 pL of TB media with 30 pg/mL CAM. The cultures were grown at 30°C, 250 rpm, with 85% humidity for approximately 2.5 hours. When the ODeoo of the cultures reached 0.4-0.8, expression of the ligase gene was induced by the addition of IPTG to a final concentration of 1 mM. Following induction, growth continued for 18-20 hours at 30°C, 250 rpm with 85% humidity. Cells were harvested by centrifugation at 4,000 rpm and 4°C for 10 minutes; the supernatant was then discarded. The cell pellets were stored at -80°C until ready for use.
- the cell pellets Prior to performing the assay, the cell pellets were thawed and resuspended in 300 pL of lysis buffer (containing 1 g/L lysozyme, 0.5 g/L PMBS and 0.1 pL/mL or 0.2U/ml of commercial DNAse (New England BioLabs, M0303L) in 50 mM Tris-buffer at pH 7.5.
- the plates were agitated with medium-speed shaking for 2.5 hours on a microtiter plate shaker at room temperature. The plates were then centrifuged at 4,000 rpm for 10 minutes at 4°C, and the clarified supernatants were used in the HTP assay reaction for activity determination as described in the following examples.
- Shake-flask procedures can be used to generate engineered dsRNA ligase polypeptide shake-flask powders (SFP), which are useful for secondary screening assays and/or use in the biocatalytic processes described herein.
- Shake flask powder preparation of enzymes provides a more concentrated preparation of the engineered enzyme, as compared to the cell lysate used in HTP assays.
- Clarified lysate produced according to example 1 was collected, frozen at -80°C, and then lyophilized, using standard methods known in the art. Lyophilization of frozen clarified lysate provides a dry SFP comprising crude wild-type or engineered dsRNA ligase polypeptide.
- Table 6-1 HPUC method 1 used for activity determination.
- HPLC method 2 (Table 6-2) was developed from HPLC method 1 (Table 6-1) to improve the separation between the product oligonucleotide (3) and the substrate oligonucleotide (12).
- Table 6-2 HPLC method 1 used for activity determination.
- RF-MS RapidFire® Mass Spectrometry
- Table 6-3 The activity improvements of the engineered dsRNA ligases of Example 12 were also analyzed with RapidFire® Mass Spectrometry (RF-MS) using the method described in Table 6-3.
- RF-MS aims to reduce the analytical time compared to HPLC analysis.
- the selective detection of product oligonucleotides (2) and (3) is obtained by the specific masses of each product oligonucleotide analyzed under the multi-single ion monitoring (SIM) mode.
- SIM multi-single ion monitoring
- Relative dsRNA ligase activity is determined by comparing the sum of the MS signal of the five specific masses (given in Table 6-3) corresponding with each product oligonucleotide (2) and (3).
- Table 6-3 RF-MS method used for activity determination.
- the engineered polynucleotide (SEQ ID NO: 1) encoding the polypeptide with dsRNA ligase activity of SEQ ID NO: 2 was used to generate the engineered polypeptides of Table 7-
- polypeptides displayed improved dsRNA ligase activity under the desired conditions e.g., the improvement in the formation of either oligonucleotide products (2) or (3), or preferably both oligonucleotide products (2) and (3) that was produced in situ from the substrate oligonucleotides (6-7, 9-12) as compared to the starting polypeptide.
- Some polypeptides displayed improved product formation of either oligonucleotide product (2) or (3), or both oligonucleotide products (2) and (3) compared to the starting polypeptide as noted in Table 7-1.
- the sequences of oligonucleotides (2), (3), (6), (7) and (9-12) are provided in Table 1.
- the engineered polypeptides having the amino acid sequences of even-numbered sequence identifiers were generated from the “backbone” amino acid sequence of SEQ ID NO:
- the enzyme assays were carried out in 96-well PCR plates, in 50 pL total reaction volume per well.
- the reactions contained 2.5 % (v/v) of undiluted dsRNA ligase lysate, prepared as described in Example 4, 100 pM (each) substrate oligonucleotides (6-7, 9-12), 50 mM Tris-buffer at pH 7.5, 1 mM ATP, 10 mM MgCh and 5 mM DTT.
- the reaction plates were heat-sealed and incubated in a thermocycler at 30 °C for 2 h.
- the engineered dsRNA ligase polypeptides represented by the even numbered sequence identifiers of SEQ ID NOs: 4 to 106 comprise an even numbered sequence identifier of SEQ ID NOs: 304 to 406, respectively, and a 14 amino acid N-terminal purification tag (MHHHHHHENLYFQS (SEQ ID NO: 669)).
- SEQ ID NO: 4 comprises: (i) the 14 amino acid N-terminal purification tag of SEQ ID NO: 669; and (ii) the dsRNA ligase polypeptide of SEQ ID NO: 304.
- the position of a given mutation is provided relative to SEQ ID NO: 2 which includes (i) the 14 amino acid N-terminal purification tag of SEQ ID NO: 669 and (ii) the wild-type dsRNA ligase polypeptide of SEQ ID NO: 302.
- the position of a given mutation relative to SEQ ID NO: 302 i.e. the wild-type dsRNA ligase polypeptide without the purification tag
- position X251 of SEQ ID NO: 2 corresponds to position X237 of SEQ ID NO: 302.
- polypeptide from example 7 SEQ ID NO: 69 encoding the most active polypeptide with dsRNA ligase activity of SEQ ID NO: 70 was used to generate the engineered polypeptides of Table 8-1.
- These polypeptides displayed improved dsRNA ligase activity under the desired conditions e.g., the improvement in the formation of either oligonucleotide products (2) or (3), or preferably both oligonucleotide products (2) and (3) that was produced in situ from the substrates oligonucleotides (6-7, 9-12) as compared to the starting polypeptide.
- polypeptides displayed improved product formation of both oligonucleotide products (2) and (3) compared to the starting polypeptide are noted in Table 8-1.
- the engineered polypeptides, having the amino acid sequences of even-numbered sequence identifiers were generated from the “backbone” amino acid sequence of SEQ ID NO: 70, as described below together with the analytical method described in Table 6-1.
- Directed evolution began with the polynucleotide set forth in SEQ ID NO: 69.
- Libraries of engineered polypeptides were generated using various well-known techniques (e.g., saturation mutagenesis, recombination of previously identified beneficial amino acid differences) and screened using HTP assay and analysis methods, described below, that measured the polypeptides’ ability to produce oligonucleotides (2) and (3).
- the enzyme assays were carried out in 96-well PCR plates, in 50 pL total reaction volume per well.
- the reactions contained either 1.25 or 2.5 % (v/v) of undiluted dsRNA ligase lysate, prepared as described in Example 4, 100 pM (each) substrate oligonucleotides (6-7, 9- 12), 50 mM Tris-buffer at pH 7.5, 1 mM ATP, 10 mM MgCh and 5 mM DTT.
- the reaction plates were heat-sealed and incubated in a thermocycler at 30 °C for 2 h.
- the engineered dsRNA ligase polypeptides represented by the even numbered sequence identifiers of SEQ ID NOs: 108 to 216 comprise an even numbered sequence identifier of SEQ ID NOs: 408 to 516, respectively, and a 14 amino acid N-terminal purification tag (MHHHHHHENLYFQS (SEQ ID NO: 669)).
- SEQ ID NO: 108 comprises: (i) the 14 amino acid N-terminal purification tag of SEQ ID NO: 669; and (ii) the dsRNA ligase polypeptide of SEQ ID NO: 408.
- polypeptide from example 8 SEQ ID NO: 187 encoding the most active polypeptide with dsRNA ligase activity of SEQ ID NO: 188 was used to generate the engineered polypeptides of Table 9-1. These polypeptides displayed improved dsRNA ligase activity under the desired conditions e.g., the improvement in the formation of either oligonucleotide products (2) or (3), or preferably both oligonucleotide products (2) and (3) that was produced in situ from the substrates oligonucleotides (6-7, 9-12) as compared to the starting polypeptide.
- polypeptides displayed improved product formation of both oligonucleotide products (2) and (3) compared to the starting polypeptide are noted in Table 9- 1.
- the engineered polypeptides, having the amino acid sequences of even-numbered sequence identifiers were generated from the “backbone” amino acid sequence of SEQ ID NO: 188, as described below together with the analytical method described in Table 6-2.
- the enzyme assays were carried out in 96-well PCR plates, in 100 pL total reaction volume per well.
- the reactions contained 20 % (v/v) of undiluted dsRNA ligase lysate, prepared as described in Example 4, 1 mM (each) substrate oligonucleotides (6-7, 9-12), 50 mM Tris-buffer at pH 7.0, 10 mM ATP, 20 mM MgCh, 5 mM DTT and 10 % (v/v) DMSO.
- the reaction plates were heat-sealed and incubated in a thermocycler at 30 °C for 2 h.
- the plates were subjected to a heat inactivation step (95 °C, 20 min) to quench the reaction and precipitate proteinaceous content of the added lysate.
- the plates were then centrifuged at 4,000 rpm for 5 min. Subsequently a 50 pL aliquot of the supernatant was removed from each well and added to a deep well 96-well plate containing 450 pL of 5 mM EDTA solution (pH 7.0).
- the samples were further diluted by transferring 50 pL of the diluted sample into a deep well 96-well plate containing 950 pL of 5 mM EDTA solution (pH 7.0).
- the engineered dsRNA ligase polypeptides represented by the even numbered sequence identifiers of SEQ ID NOs: 218 to 246 comprise an even numbered sequence identifier of SEQ ID NOs: 518 to 546, respectively, and a 14 amino acid N-terminal purification tag (MHHHHHHENLYFQS (SEQ ID NO: 669)).
- SEQ ID NO: 218 comprises: (i) the 14 amino acid N-terminal purification tag of SEQ ID NO: 669; and (ii) the dsRNA ligase polypeptide of SEQ ID NO: 518.
- polypeptide from example 9 SEQ ID NO: 225 encoding the most active polypeptide with dsRNA ligase activity of SEQ ID NO: 226 was used to generate the engineered polypeptides of Table 10-1. These polypeptides displayed improved dsRNA ligase activity under the desired conditions e.g., the improvement in the formation of either oligonucleotide products (2) or (3), or preferably both oligonucleotide products (2) and (3) that was produced in situ from the substrates oligonucleotides (6-7, 9-12) as compared to the starting polypeptide.
- polypeptides displayed improved product formation of both oligonucleotide products (2) and (3) compared to the starting polypeptide are noted in Table 10-1.
- the engineered polypeptides, having the amino acid sequences of even-numbered sequence identifiers were generated from the “backbone” amino acid sequence of SEQ ID NO: 226, as described below together with the analytical method described in Table 6-2.
- Directed evolution began with the polynucleotide set forth in SEQ ID NO: 225.
- Libraries of engineered polypeptides were generated using various well-known techniques (e.g., saturation mutagenesis, recombination of previously identified beneficial amino acid differences) and screened using HTP assay and analysis methods, described below, that measured the polypeptides’ ability to produce oligonucleotides (2) and (3).
- the enzyme assays were carried out in 96-well PCR plates, in 100 pL total reaction volume per well.
- the reactions contained 2.5 % (v/v) of undiluted dsRNA ligase lysate, prepared as described in Example 4, 1 mM (each) substrate oligonucleotides (6-7, 9-12), 50 mM Tris-buffer at pH 7.0, 10 mM ATP, 20 mM MgCh, 5 mM DTT and 10 % (v/v) DMSO.
- the reaction plates were heat-sealed and incubated in a thermocycler at 30 °C for 24 h.
- the plates were subjected to a heat inactivation step (95 °C, 20 min) to quench the reaction and precipitate proteinaceous content of the added lysate.
- the plates were then centrifuged at 4,000 rpm for 5 min. Subsequently a 50 pL aliquot of the supernatant was removed from each well and added to a deep well 96-well plate containing 450 pL of 5 mM EDTA solution (pH 7.0).
- the samples were further diluted by transferring 50 pL of the diluted sample into a deep well 96-well plate containing 950 pL of 5 mM EDTA solution (pH 7.0).
- the engineered dsRNA ligase polypeptides represented by the even numbered sequence identifiers of SEQ ID NOs: 248 to 282 comprise an even numbered sequence identifier of SEQ ID NOs: 548 to 582, respectively, and a 14 amino acid N-terminal purification tag (MHHHHHHENLYFQS (SEQ ID NO: 669)).
- SEQ ID NO: 248 comprises: (i) the 14 amino acid N-terminal purification tag of SEQ ID NO: 669; and (ii) the dsRNA ligase polypeptide of SEQ ID NO: 548.
- polypeptide from example 10 SEQ ID NO: 277 encoding the most active polypeptide with dsRNA ligase activity of SEQ ID NO: 278 was used to generate the engineered polypeptides of Table 11-1.
- These polypeptides displayed improved dsRNA ligase activity under the desired conditions e.g., the improvement in the formation of either oligonucleotide products (2) or (3), or preferably both oligonucleotide products (2) and (3) that was produced in situ from the substrates oligonucleotides (6-7, 9-12) as compared to the starting polypeptide.
- polypeptides displayed improved product formation of both oligonucleotide products (2) and (3) compared to the starting polypeptide are noted in Table 11-1.
- the engineered polypeptides, having the amino acid sequences of even-numbered sequence identifiers were generated from the “backbone” amino acid sequence of SEQ ID NO: 278, as described below together with the analytical method described in Table 6-2. Directed evolution began with the polynucleotide set forth in SEQ ID NO: 277.
- the enzyme assays were carried out in 96-well PCR plates, in 100 pL total reaction volume per well.
- the reactions contained 10 % (v/v) of undiluted dsRNA ligase lysate, prepared as described in Example 4, 5 mM (each) substrate oligonucleotides (6-7, 9-12), 50 mM Tris-buffer at pH 7.0, 30 mM ATP, 60 mM MgCh and 10 % (v/v) DMSO.
- the reaction plates were heat-sealed and incubated in a thermocycler at 30 °C for 24 h.
- the plates were subjected to a heat inactivation step (95 °C, 20 min) to quench the reaction and precipitate proteinaceous content of the added lysate.
- the plates were then centrifuged at 4,000 rpm for 5 min. Subsequently a 50 pL aliquot of the supernatant was removed from each well and added to a deep well 96-well plate containing 950 pL of 5 mM EDTA solution (pH 7.0).
- the samples were further diluted by transferring 50 pL of the diluted sample into a deep well 96-well plate containing 450 pL of 5 mM EDTA solution (pH 7.0).
- the samples were diluted a third time by transferring 160 pL of the diluted sample into a deep well 96-well plate containing 640 pL of 5 mM EDTA solution (pH 7.0).
- the samples were diluted a final time by transferring 75 pL of the diluted sample into a shallow well 96-well plate containing 75 pL of 5 mM EDTA solution (pH 7.0).
- the samples were analyzed via HPLC to determine the activity of the enzyme variants using the analytical method described in Table 6-2. Selected ligase variants showing a faster product formation of oligonucleotides (2) and (3) relative to SEQ ID NO: 278 are shown in Table 11-1.
- the engineered dsRNA ligase polypeptides represented by the even numbered sequence identifiers of SEQ ID NOs: 284 to 300 comprise an even numbered sequence identifier of SEQ ID NOs: 584 to 600, respectively, and a 14 amino acid N-terminal purification tag (MHHHHHHENLYFQS (SEQ ID NO: 669)).
- SEQ ID NO: 284 comprises: (i) the 14 amino acid N-terminal purification tag of SEQ ID NO: 669; and (ii) the dsRNA ligase polypeptide of SEQ ID NO: 584.
- polypeptide from example 11 SEQ ID NO: 287 encoding the most active polypeptide with dsRNA ligase activity of SEQ ID NO: 288 was used to generate the engineered polypeptides of Table 12-1.
- These polypeptides displayed improved dsRNA ligase activity under the desired conditions e.g., the improvement in the formation of either oligonucleotide products (2) or (3), or preferably both oligonucleotide products (2) and (3) that was produced in situ from the substrate oligonucleotides (6-7, 9-12) as compared to the starting polypeptide.
- polypeptides displayed improved product formation of both oligonucleotide products (2) and (3) compared to the starting polypeptide, as noted in Table 12- 1. Furthermore, some polypeptides displayed improved thermal stability, resulting in higher residual activity following incubation of the dsRNA ligase solution at 30 °C for 1 h prior to setting up the reaction (Table 12-2).
- the engineered polypeptides having the amino acid sequences of even- numbered sequence identifiers were generated from the “backbone” amino acid sequence of SEQ ID NO: 288, as described below together with the analytical method described in Table 6-3. Directed evolution began with the polynucleotide set forth in SEQ ID NO: 287.
- the enzyme assays were carried out in 96-well PCR plates, in 100 pL total reaction volume per well.
- the cells were lysed according to Example 4, however 100 mM MOPS-buffer at pH 7.2 was used in place of 50 mM Tris at pH 7.5.
- the cell lysates were either undiluted or diluted 1 : 1 in 100 mM MOPS buffer, pH 7.2 and incubated at 30 °C and 4 °C respectively for 1 h.
- the reactions contained a final dsRNA ligase concentration of 40 % (v/v) for lysate incubated at 30 °C and 20% (v/v) for lysate incubated at 4 °C.
- the ligation reactions contained 3 mM (each) substrate oligonucleotides (6-7, 9-12), 100 mM MOPS-buffer at pH 7.2, 30 mM ATP, 60 mM MgCh and 10 % (v/v) DMSO.
- the reaction plates were heat-sealed and incubated in a thermocycler at 30 °C for 24 h.
- the plates were subjected to a heat inactivation step (95 °C, 20 min) to quench the reaction and precipitate proteinaceous content of the added lysate.
- the plates were then centrifuged at 4,000 rpm for 5 min. Subsequently a 50 pL aliquot of the supernatant was removed from each well and added to a deep well 96-well plate containing 950 pL of 5 mM EDTA solution (pH 7.0).
- the samples were further diluted by transferring 20 pL of the diluted sample into a deep well 96-well plate containing 180 pL of 5 mM EDTA solution (pH 7.0).
- the samples were diluted a third time by transferring 30 pL of the diluted sample into a deep well 96-well plate containing 150 pL of 5 mM EDTA solution (pH 7.0).
- the samples were analyzed via RF-MS to determine the activity of the enzyme variants using the analytical method described in Table 6-3.
- Selected ligase variants showing a faster product formation of oligonucleotides (2) and (3) relative to SEQ ID NO: 288 following pre-incubation at 4 °C are shown in Table 12-1.
- Selected ligase variants showing a faster product formation of oligonucleotides (2) and (3) relative to SEQ ID NO: 288 following pre-incubation at 30 °C are shown in Table 12-2.
- the engineered dsRNA ligase polypeptides represented by the even numbered sequence identifiers of SEQ ID NOs: 602 to 634 comprise an even numbered sequence identifier of SEQ ID NOs: 636 to 668, respectively, and a 14 amino acid N-terminal purification tag (MHHHHHHENLYFQS (SEQ ID NO: 669)).
- SEQ ID NO: 602 comprises: (i) the 14 amino acid N-terminal purification tag of SEQ ID NO: 669; and (ii) the dsRNA ligase polypeptide of SEQ ID NO: 636.
- polynucleotides SEQ ID NO: 1 encoding for the wild-type dsRNA ligase from Bacteriophage RB69, Uniprot ID: Q7Y4V8 with the SEQ ID NO: 2 and the engineered polynucleotides SEQ ID NO: 287, SEQ ID NO: 289 and SEQ ID NO: 291 encoding for the most improved variants from example 11 with polypeptide sequences SEQ ID NO: 288, SEQ ID NO: 290 and SEQ ID NO: 292, have been used for SFP production as described in example 5.
- the catalytic activity to convert the substrates oligonucleotides (6-7, 9-12) to the desired siRNA product (1) was evaluated under two reaction conditions: Condition 1 (50 mM Tris-buffer at pH 7.5, 1 mM ATP, 5 mM MgCh and 5 mM DTT, containing either 0 g/L, 0.0020 g/L, 0.0039 g/L, 0.0078 g/L, 0.0156 g/L, 0.0313 g/L, 0.0625 g/L, 0.125 g/L, 0.25 g/L, 0.5 g/L, 1 g/L, or 2 g/L of SFP), and Condition 2 (50 mM Tris-buffer at pH 7.0, 30 mM ATP, 60 mM MgCh and 10 % (v/v) DMSO containing either 0 g/L, 0.0049 g/L, 0.0098 g/L, 0.0195 g/L, 0.0391
- reaction plate 1 was incubated for 2 h and reaction plate 2 was incubated for 24 h.
- reaction plate 1 was diluted 40 x and reaction plate 2 was diluted 2 400 x.
- the samples were analyzed via HPLC to determine the activity of the enzyme variants using the analytical method described in Table 6-2.
- Comparative data in Figures 2A and 2B show the relative peak area % of siRNA (1) present in the reaction samples assayed under conditions 1 and 2 respectively. Under both conditions 1 and 2, the polypeptides SEQ ID NO: 288, SEQ ID NO: 290 and SEQ ID NO: 292 exhibit improved dsRNA ligase activity over the wild-type polypeptide SEQ ID NO: 2.
- polynucleotides SEQ ID NO: 1 encoding for the wild-type dsRNA ligase from Bacteriophage RB69, Uniprot ID: Q7Y4V8 with the SEQ ID NO: 2 and the engineered polynucleotide SEQ ID NO: 287 and SEQ ID NO: 631 encoding for the most improved variant from example 11 and example 12 with polypeptide sequences SEQ ID NO: 288 and SEQ ID NO: 632 respectively, have been used for SFP production as described in example 5.
- the catalytic activity to convert the substrate oligonucleotides (6-7, 9-12) to the desired siRNA product (1) and the thermostability of the two enzymes was evaluated by incubating a stock solution of the SFP for 4 h at either 4 or 37 °C prior to setting up the following ligation reaction: 6 mM (each) substrate oligonucleotides (6-7, 9-12), 100 mM MOPS-buffer at pH 7.2, 30 mM ATP, 60 mM MgCh and 10 % (v/v) DMSO.
- the ligation reactions contained, either 0 g/L, 0.156 g/L, 0.313 g/L, 0.625 g/L, 1.25 g/L, 2.5 g/L, 5 g/L, or 10 g/L of SFP.
- the enzyme assays were carried out in 96-well PCR plates, in 100 pL total reaction volume per well. The reaction plate was heat-sealed and incubated in a thermocycler at 30 °C for 24 h.
- the plate was subjected to a heat inactivation step (95 °C, 20 min) to quench the reaction and precipitate proteinaceous content of the added SFP.
- the plate was then centrifuged at 4,000 rpm for 5 min.
- a 50 pL aliquot of the supernatant from each well of each plate was removed and subsequently diluted 400 x in 50 mM EDTA solution (pH 7.0).
- the samples were analyzed via HPLC to determine the activity of the enzyme variants using the analytical method described in Table 6-2.
- Comparative data in Figure 3A shows the relative peak area % of siRNA (1) present in the reaction samples.
- Figure 3B shows the residual enzyme activity following pre -incubation of the SFP at 37 °C for 4 h, expressed relative to the ligation activity of the SFP pre-incubated at 4 °C for 4 h.
- the polypeptides SEQ ID NO: 632 exhibits improved dsRNA ligase activity and thermostability over the wild-type polypeptide SEQ ID NO: 2 and engineered polypeptide SEQ ID NO: 288.
- Table 13 provides a summary of the nucleic acid and amino acid sequences of the wild-type and engineered dsRNA ligase sequences described herein.
- the purification tag used in the Examples and reference in table 13 is the N-terminal purification tag MHHHHHHENLYFQS (SEQ ID NO: 669).
Landscapes
- Health & Medical Sciences (AREA)
- Genetics & Genomics (AREA)
- Life Sciences & Earth Sciences (AREA)
- Engineering & Computer Science (AREA)
- Chemical & Material Sciences (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Biomedical Technology (AREA)
- Organic Chemistry (AREA)
- Zoology (AREA)
- Wood Science & Technology (AREA)
- Molecular Biology (AREA)
- General Engineering & Computer Science (AREA)
- Biotechnology (AREA)
- Microbiology (AREA)
- Biochemistry (AREA)
- General Health & Medical Sciences (AREA)
- Plant Pathology (AREA)
- Biophysics (AREA)
- Physics & Mathematics (AREA)
- Medicinal Chemistry (AREA)
- Enzymes And Modification Thereof (AREA)
Abstract
The present disclosure relates to the field of biotechnology, in particular to engineered double-stranded RNA (dsRNA) ligases and their application in industrial biocatalysis. The present disclosure also relates to a process of producing an engineered dsRNA ligase, and to a method for producing an oligonucleotide by contacting oligonucleotide fragments with an engineered dsRNA ligase.
Description
ENGINEERED DOUBLE-STRAND RNA LIGASES AND USES THEREOF
CROSS-REFERENCE TO RELATED APPLICATIONS
This application claims priority to, and the benefit of, EP Application No. 22215201.9, filed on December 20, 2022, the content of which is incorporated herein by reference in its entirety.
SEQUENCE LISTING
The instant application contains a Sequence Listing which has been submitted electronically in .XML format and is hereby incorporated by reference in its entirety. Said .XML copy, created on 2 December 2023, is named PAT059445-WO-PCT_SL.xml and is 1.24MB in size.
Technical field
The present disclosure relates to the field of biotechnology, in particular to engineered double-stranded RNA (dsRNA) ligases and their application in industrial biocatalysis. The present disclosure also relates to a process of producing an engineered dsRNA ligase, and to a method for producing an oligonucleotide by contacting oligonucleotide fragments with an engineered dsRNA ligase.
Background art
Therapeutic oligonucleotides, including small interfering RNA (siRNA) and inhibitory antisense oligonucleotides (ASOs) have the potential to treat a diverse range of life-threatening diseases. In recent years there has been a significant increase in the number of approved oligonucleotide-based drugs, and a large rise in the number of therapeutic oligonucleotides under clinical investigation (Roberts, T. C., Langer, R. & Wood, M. J. A. Nature Reviews Drug Discovery 2020 19:10 19, 673-694 (2020)).
In support of green synthesis initiatives throughout the pharmaceutical industry, there is a significant need for next-generation oligonucleotide synthesis methods that are both sustainable and economical at the scale required to reach wider patient populations (Mishra, M. et al. Current Research in Green and Sustainable Chemistry 4, (2021)).
To this end, biocatalysis is being more frequently applied in the manufacture of active pharmaceutical ingredients (APIs) since enzymes are capable of highly selective transformation under mild reaction conditions and in aqueous media (Mann, G. & Stanger, F. V. Chimia (Aarau) 74, 407-417 (2020)). The biocatalysis of short oligonucleotide fragments
offers a sustainable and economical alternative to the solid phase chemical synthesis of full- length therapeutic oligonucleotides currently used.
Shorter oligonucleotides can be synthesized more easily and with higher purities than longer oligonucleotides, simplifying downstream processing and reducing solvent waste. These short oligonucleotide fragments can then be combined using nucleic acid ligases to produce oligonucleotide products. Nucleic acid ligases have shown remarkable tolerance towards unnatural DNA/RNA containing pharmaceutically relevant chemical modifications (Kestemont, D., Herdewijn, P. & Renders, M. Curr Protoc Chem Biol 11, e62 (2019); Kestemont, D. et al. Chemical Communications 54, 6408-6411 (2018); and Nandakumar, J. & Shuman, S. Molecular Cell 16, 211-221 (2004)), and the use of a dsRNA ligase to synthesize an siRNA product, starting from short fragments (< 9 nts), containing extensive chemical modification, including 2’-OMe, 2’-F modified nucleotides, phosphorothioate backbone modified nucleotides and a terminal fragment that is functionalized with a bulky N- acetyl galactosamine (GalNAc) moiety has previously been described (Mann, G. et al. Tetrahedron Letters 93, 153696 (2022)).
To achieve cost-effective and sustainable industrial scale biocatalysis of oligonucleotides, enzymes exhibiting high ligase activity are required. There exists an urgent and unmet need for engineered ligase enzymes, which exhibit improved ligase activity relative to wild-type enzymes. There is also an unmet need for biocatalytic methods of producing oligonucleotides from oligonucleotide fragments.
Brief description of the drawings
Figure 1. dsRNA ligase catalyzed ligation of: (A) oligonucleotide fragments 6:9; 7: 10 and 11: 12 to generate oligonucleotide 2:3 (= siRNA 1); and (B) oligonucleotide fragments 6:9, 7: 10 and 8: 11 to generate oligonucleotide 5:3 (= siRNA 4). The sequences of oligonucleotides 2, 3 and 5-12 are provided in Table 1.
Figure 2. Comparative data showing the relative peak area % of siRNA (1) present in the reaction samples comprising different concentrations of the wild-type enzyme (SEQ ID NO: 2) and engineered enzymes (SEQ ID NOs: 288, 290 and 292) assayed under: (A) condition 1; and (B) condition 2, as described in Example 13. Enzyme concentration is provided as g/L of shake-flask powder (SFP) produced by lyophilization of frozen clarified lysate as described in the Examples.
Figure 3. (A) Comparative data showing the relative peak area % of siRNA (1) present in the reaction samples comprising different concentrations of wild-type enzyme (SEQ ID NO: 2)
and engineered enzymes (SEQ ID NOs: 288 and 632) following pre-incubation of the enzyme at 4 °C or 37 °C for 4 h. (B) Comparative data showing residual enzyme activity following pre-incubation of the SFP at 37 °C for 4 h, expressed relative to the ligation activity of the SFP pre-incubated at 4 °C for 4 h. Enzyme concentration is provided as g/L of shake-flask powder (SFP) produced by lyophilization of frozen clarified lysate as described in the Examples.
Summary of the disclosure
The present disclosure provides engineered double-stranded RNA (dsRNA) ligase polypeptides. The present disclosure also provides gene sequences of engineered polypeptides, recombinant expression vectors comprising the genes, engineered host strains and efficient methods for the production thereof, as well as reaction processes for the biocatalysis of oligonucleotides using engineered polypeptides.
The engineered double -stranded RNA (dsRNA) ligase polypeptides described herein have improved catalytic activity as compared to the wild-type dsRNA ligase from which they are derived. Through substitutions and/or deletions of amino acid residues in directed evolution processes, the engineered polypeptides provided herein were derived from a wildtype dsRNA ligase from Bacteriophage RB69. The wild-type dsRNA ligase consists of 332 amino acids and has the amino acid sequence shown in SEQ ID NO: 302 (also accessible under accession number Q7Y4V8 in UniProt).
The disclosure provides an engineered double-stranded RNA (dsRNA) ligase polypeptide comprising an amino acid sequence having at least 80% sequence identity to an amino acid sequence selected from the group consisting of SEQ ID NOs: 304, 306, 308, 310, 312, 314, 316, 318, 320, 322, 324, 326, 328, 330, 332, 334, 336, 338, 340, 342, 344, 346,
348, 350, 352, 354, 356, 358, 360, 362, 364, 366, 368, 370, 372, 374, 376, 378, 380, 382,
384, 386, 388, 390, 392, 394, 396, 398, 400, 402, 404, 406, 408, 410, 412, 414, 416, 418,
420, 422, 424, 426, 428, 430, 432, 434, 436, 438, 440, 442, 444, 446, 448, 450, 452, 454,
456, 458, 460, 462, 464, 466, 468, 470, 472, 474, 476, 478, 480, 482, 484, 486, 488, 490,
492, 494, 496, 498, 500, 502, 504, 506, 508, 510, 512, 514, 516, 518, 520, 522, 524, 526,
528, 530, 532, 534, 536, 538, 540, 542, 544, 546, 548, 550, 552, 554, 556, 558, 560, 562,
564, 566, 568, 570, 572, 574, 576, 578, 580, 582, 584, 586, 588, 590, 592, 594, 596, 598, and
600; wherein the engineered dsRNA ligase polypeptide: (a) has dsRNA ligase activity; and (b) does not the comprise the amino acid sequence of SEQ ID NO: 302.
The disclosure provides an engineered double-stranded RNA (dsRNA) ligase polypeptide comprising an amino acid sequence having at least 80% sequence identity to an amino acid sequence selected from the group consisting of SEQ ID NOs: 636, 638, 640, 642, 644, 646, 648, 650, 652, 654, 656, 658, 660, 662, 664, 666, and 668; wherein the engineered dsRNA ligase polypeptide: (a) has dsRNA ligase activity; and (b) does not the comprise the amino acid sequence of SEQ ID NO: 302.
The disclosure provides an engineered double-stranded RNA (dsRNA) ligase polypeptide comprising an amino acid sequence having at least 80% sequence identity to an amino acid sequence selected from the group consisting of SEQ ID NOs: 304, 306, 308, 310, 312, 314, 316, 318, 320, 322, 324, 326, 328, 330, 332, 334, 336, 338, 340, 342, 344, 346,
348, 350, 352, 354, 356, 358, 360, 362, 364, 366, 368, 370, 372, 374, 376, 378, 380, 382,
384, 386, 388, 390, 392, 394, 396, 398, 400, 402, 404, 406, 408, 410, 412, 414, 416, 418,
420, 422, 424, 426, 428, 430, 432, 434, 436, 438, 440, 442, 444, 446, 448, 450, 452, 454,
456, 458, 460, 462, 464, 466, 468, 470, 472, 474, 476, 478, 480, 482, 484, 486, 488, 490,
492, 494, 496, 498, 500, 502, 504, 506, 508, 510, 512, 514, 516, 518, 520, 522, 524, 526,
528, 530, 532, 534, 536, 538, 540, 542, 544, 546, 548, 550, 552, 554, 556, 558, 560, 562,
564, 566, 568, 570, 572, 574, 576, 578, 580, 582, 584, 586, 588, 590, 592, 594, 596, 598,
600, 636, 638, 640, 642, 644, 646, 648, 650, 652, 654, 656, 658, 660, 662, 664, 666, and 668; wherein the engineered dsRNA ligase polypeptide: (a) has dsRNA ligase activity; and (b) does not the comprise the amino acid sequence of SEQ ID NO: 302.
The disclosure provides an engineered double-stranded RNA (dsRNA) ligase polypeptide comprising an amino acid sequence having at least 85% sequence identity to an amino acid sequence selected from the group consisting of SEQ ID NOs: 304, 306, 308, 310, 312, 314, 316, 318, 320, 322, 324, 326, 328, 330, 332, 334, 336, 338, 340, 342, 344, 346,
348, 350, 352, 354, 356, 358, 360, 362, 364, 366, 368, 370, 372, 374, 376, 378, 380, 382,
384, 386, 388, 390, 392, 394, 396, 398, 400, 402, 404, 406, 408, 410, 412, 414, 416, 418,
420, 422, 424, 426, 428, 430, 432, 434, 436, 438, 440, 442, 444, 446, 448, 450, 452, 454,
456, 458, 460, 462, 464, 466, 468, 470, 472, 474, 476, 478, 480, 482, 484, 486, 488, 490,
492, 494, 496, 498, 500, 502, 504, 506, 508, 510, 512, 514, 516, 518, 520, 522, 524, 526,
528, 530, 532, 534, 536, 538, 540, 542, 544, 546, 548, 550, 552, 554, 556, 558, 560, 562,
564, 566, 568, 570, 572, 574, 576, 578, 580, 582, 584, 586, 588, 590, 592, 594, 596, 598,
600, 636, 638, 640, 642, 644, 646, 648, 650, 652, 654, 656, 658, 660, 662, 664, 666, and 668; wherein the engineered dsRNA ligase polypeptide: (a) has dsRNA ligase activity; and (b) does not the comprise the amino acid sequence of SEQ ID NO: 302.
The disclosure provides an engineered double-stranded RNA (dsRNA) ligase polypeptide, which is a polypeptide of: (a) a polypeptide comprising an amino acid sequence selected from the group consisting of SEQ ID NOs: 304, 306, 308, 310, 312, 314, 316, 318, 320, 322, 324, 326, 328, 330, 332, 334, 336, 338, 340, 342, 344, 346, 348, 350, 352, 354,
356, 358, 360, 362, 364, 366, 368, 370, 372, 374, 376, 378, 380, 382, 384, 386, 388, 390,
392, 394, 396, 398, 400, 402, 404, 406, 408, 410, 412, 414, 416, 418, 420, 422, 424, 426,
428, 430, 432, 434, 436, 438, 440, 442, 444, 446, 448, 450, 452, 454, 456, 458, 460, 462,
464, 466, 468, 470, 472, 474, 476, 478, 480, 482, 484, 486, 488, 490, 492, 494, 496, 498,
500, 502, 504, 506, 508, 510, 512, 514, 516, 518, 520, 522, 524, 526, 528, 530, 532, 534,
536, 538, 540, 542, 544, 546, 548, 550, 552, 554, 556, 558, 560, 562, 564, 566, 568, 570,
572, 574, 576, 578, 580, 582, 584, 586, 588, 590, 592, 594, 596, 598, and 600; or (b) a polypeptide having dsRNA ligase activity, which comprises an amino acid sequence having (i) at least 80% sequence identity to one of the polypeptides recited in (a), and (ii) a substitution, deletion, addition or insertion of one or more amino acid residues relative to said one amino acid sequence recited in (a); wherein the engineered dsRNA ligase polypeptide does not the comprise the amino acid sequence of SEQ ID NO: 302.
The disclosure provides an engineered double-stranded RNA (dsRNA) ligase polypeptide, which is a polypeptide of: (a) a polypeptide comprising an amino acid sequence selected from the group consisting of SEQ ID NOs: 636, 638, 640, 642, 644, 646, 648, 650, 652, 654, 656, 658, 660, 662, 664, 666, and 668; or (b) a polypeptide having dsRNA ligase activity, which comprises an amino acid sequence having (i) at least 80% sequence identity to one of the polypeptides recited in (a), and (ii) a substitution, deletion, addition or insertion of one or more amino acid residues relative to said one amino acid sequence recited in (a); wherein the engineered dsRNA ligase polypeptide does not the comprise the amino acid sequence of SEQ ID NO: 302.
The disclosure provides an engineered double-stranded RNA (dsRNA) ligase polypeptide, which is a polypeptide of: (a) a polypeptide comprising an amino acid sequence selected from the group consisting of SEQ ID NOs: 304, 306, 308, 310, 312, 314, 316, 318, 320, 322, 324, 326, 328, 330, 332, 334, 336, 338, 340, 342, 344, 346, 348, 350, 352, 354,
356, 358, 360, 362, 364, 366, 368, 370, 372, 374, 376, 378, 380, 382, 384, 386, 388, 390,
392, 394, 396, 398, 400, 402, 404, 406, 408, 410, 412, 414, 416, 418, 420, 422, 424, 426,
428, 430, 432, 434, 436, 438, 440, 442, 444, 446, 448, 450, 452, 454, 456, 458, 460, 462,
464, 466, 468, 470, 472, 474, 476, 478, 480, 482, 484, 486, 488, 490, 492, 494, 496, 498,
500, 502, 504, 506, 508, 510, 512, 514, 516, 518, 520, 522, 524, 526, 528, 530, 532, 534,
536, 538, 540, 542, 544, 546, 548, 550, 552, 554, 556, 558, 560, 562, 564, 566, 568, 570,
572, 574, 576, 578, 580, 582, 584, 586, 588, 590, 592, 594, 596, 598, 600, 636, 638, 640,
642, 644, 646, 648, 650, 652, 654, 656, 658, 660, 662, 664, 666, and 668; or (b) a polypeptide having dsRNA ligase activity, which comprises an amino acid sequence having
(i) at least 80% sequence identity to one of the polypeptides recited in (a), and (ii) a substitution, deletion, addition or insertion of one or more amino acid residues relative to said one amino acid sequence recited in (a); wherein the engineered dsRNA ligase polypeptide does not the comprise the amino acid sequence of SEQ ID NO: 302.
The disclosure provides an engineered double-stranded RNA (dsRNA) ligase polypeptide comprising an amino acid sequence having at least 80% sequence identity to an amino acid sequence selected from the group consisting of SEQ ID NOs: 304, 306, 308, 310, 312, 314, 316, 318, 320, 322, 324, 326, 328, 330, 332, 334, 336, 338, 340, 342, 344, 346,
348, 350, 352, 354, 356, 358, 360, 362, 364, 366, 368, 370, 372, 374, 376, 378, 380, 382,
384, 386, 388, 390, 392, 394, 396, 398, 400, 402, 404, 406, 408, 410, 412, 414, 416, 418,
420, 422, 424, 426, 428, 430, 432, 434, 436, 438, 440, 442, 444, 446, 448, 450, 452, 454,
456, 458, 460, 462, 464, 466, 468, 470, 472, 474, 476, 478, 480, 482, 484, 486, 488, 490,
492, 494, 496, 498, 500, 502, 504, 506, 508, 510, 512, 514, 516, 518, 520, 522, 524, 526,
528, 530, 532, 534, 536, 538, 540, 542, 544, 546, 548, 550, 552, 554, 556, 558, 560, 562,
564, 566, 568, 570, 572, 574, 576, 578, 580, 582, 584, 586, 588, 590, 592, 594, 596, 598,
600, 636, 638, 640, 642, 644, 646, 648, 650, 652, 654, 656, 658, 660, 662, 664, 666, and 668; wherein: (a) the engineered dsRNA ligase polypeptide has dsRNA ligase activity; and (b) the amino acid sequence of the engineered dsRNA ligase polypeptide comprises one or more of the following amino acid residues: X6 is G or E ; X7 is Q; X15 is R, D or E; X19 is Q or D; X29 is N or L; X36 is V; X39 is A; X44 is V; X45 is V; X46 is Y; X47 is E; X49 is G; X51 is L; X53 is Y; X56 is R or A; X57 is S; X60 is T, G or P; X63 is S, Q or G; X64 is R, T, Q, F, G, or M; X66 is F or W; X67 is N; X87 is T, P, K or absent; X88 is C; X89 is T; X91 is S; X92 is D; X93 is G, C, or A; X103 is V, C, Y, or T; X105 is V; X107 is R or T; XI 14 is N; X122 is W; X126 is G; X129 is N; X130 is R, S or Y; X131 is R; X137 is V or C; X144 is N; X146 is R; X158 is W; X163 is G; X173 is L; X178 is R; X185 is K; X190 is Q; X196 is S or C; X216 is L or R; X221 is I; X228 is R; X230 is T; X232 is R; X235 is A, T, or G; X236 is S, L, or F; X237 is S, Q, R, L or G; X238 is F; X239 is G or R; X242 is R or M; X243 is N, S, G, or M; X244 is G or K; X251 is D or L; X252 is V; X254 is K; X255 is C; X258 is V; X269 is L; X280 is W; X284 is A; X285 is A; X293 is R; X296 is R; X301 is G, L, E, or F;
X303 is Q; X305 is G; X313 is A; X314 is A or V; X325 is R; and X328 is R; wherein the numbering refers to SEQ ID NO: 302.
In some embodiments, the polypeptide comprises an amino acid sequence selected from the group consisting of SEQ ID NOs: 370, 488, 526, 578, 588, 590, and 592. In some embodiments, the polypeptide comprises an amino acid sequence of SEQ ID NO: 666. In some embodiments, the polypeptide comprises an amino acid sequence selected from the group consisting of SEQ ID NOs: 370, 488, 526, 578, 588, 590, 592, and 666.
The disclosure provides an engineered double-stranded RNA (dsRNA) ligase polypeptide comprising an amino acid sequence having at least 80% sequence identity to an amino acid sequence selected from the group consisting of SEQ ID NOs: 370, 488, 526, 578, 588, 590, 592, and 666; wherein: (a) the engineered dsRNA ligase polypeptide has dsRNA ligase activity; and (b) the amino acid sequence of the engineered dsRNA ligase polypeptide comprises one or more of the following amino acid residues: X15 is D or E; X19 is D; X36 is V; X39 is A; X53 is Y; X185 is K; X218 is N; X221 is I; X237 is R; X251 is L; X255 is C; and X285 is A.
The disclosure provides an engineered double-stranded RNA (dsRNA) ligase polypeptide comprising an amino acid sequence having at least 80% sequence identity to the amino acid sequence of SEQ ID NO: 666; wherein: (a) the engineered dsRNA ligase polypeptide has dsRNA ligase activity; and (b) the amino acid sequence of the engineered dsRNA ligase polypeptide comprises one or more (e.g. 2 or more, 3 or more, 4 or more, 5 or more, 6 or more, 7 or more, 8 or more, 9 or more, or all 10) of the following amino acid residues: X15 is D; X39 is A; X53 is Y; X185 is K; X218 is N; X221 is I; X237 is R; X251 is L; X255 is C; and X285 is A.
The disclosure provides an engineered double-stranded RNA (dsRNA) ligase polypeptide comprising an amino acid sequence having at least 80% sequence identity to the amino acid sequence of SEQ ID NO: 370; wherein: (a) the engineered dsRNA ligase polypeptide has dsRNA ligase activity; and (b) the amino acid sequence of the engineered dsRNA ligase polypeptide comprises one or more (e.g. 2 or more, 3 or more, or all 4) of the following amino acid residues: X36 is V; X39 is A; X218 is N; and X221 is I.
The disclosure provides an engineered double-stranded RNA (dsRNA) ligase polypeptide comprising an amino acid sequence having at least 80% sequence identity to the amino acid sequence of SEQ ID NO: 488; wherein: (a) the engineered dsRNA ligase polypeptide has dsRNA ligase activity; and (b) the amino acid sequence of the engineered
dsRNA ligase polypeptide comprises one or more (e.g. 2 or more, or all 3) of the following amino acid residues: X39 is A; X218 is N; and X221 is I.
The disclosure provides an engineered double-stranded RNA (dsRNA) ligase polypeptide comprising an amino acid sequence having at least 80% sequence identity to the amino acid sequence of SEQ ID NO: 526; wherein: (a) the engineered dsRNA ligase polypeptide has dsRNA ligase activity; and (b) the amino acid sequence of the engineered dsRNA ligase polypeptide comprises one or more (e.g. 2 or more, 3 or more, or all 4) of the following amino acid residues: X39 is A; X218 is N; X221 is I; and X255 is C.
The disclosure provides an engineered double-stranded RNA (dsRNA) ligase polypeptide comprising an amino acid sequence having at least 80% sequence identity to the amino acid sequence of SEQ ID NO: 578; wherein: (a) the engineered dsRNA ligase polypeptide has dsRNA ligase activity; and (b) the amino acid sequence of the engineered dsRNA ligase polypeptide comprises one or more (e.g. 2 or more, 3 or more, 4 or more, 5 or more, 6 or more, 7 or more, or all 8) of the following amino acid residues: X39 is A; X 53 is Y; X218 is N; X221 is I; X237 is R; X251 is L; X255 is C; and X285 is A.
The disclosure provides an engineered double-stranded RNA (dsRNA) ligase polypeptide comprising an amino acid sequence having at least 80% sequence identity to the amino acid sequence of SEQ ID NO: 588 or 590; wherein: (a) the engineered dsRNA ligase polypeptide has dsRNA ligase activity; and (b) the amino acid sequence of the engineered dsRNA ligase polypeptide comprises one or more (e.g. 2 or more, 3 or more, 4 or more, 5 or more, 6 or more, 7 or more, 8 or more, or all 9) of the following amino acid residues: X15 is D or E; X39 is A; X53 is Y; X218 is N; X221 is I; X237 is R; X251 is L; X255 is C; and X285 is A.
The disclosure provides an engineered double-stranded RNA (dsRNA) ligase polypeptide comprising an amino acid sequence having at least 80% sequence identity to the amino acid sequence of SEQ ID NO: 592; wherein: (a) the engineered dsRNA ligase polypeptide has dsRNA ligase activity; and (b) the amino acid sequence of the engineered dsRNA ligase polypeptide comprises one or more (e.g. 2 or more, 3 or more, 4 or more, 5 or more, 6 or more, 7 or more, 8 or more, or all 9) of the following amino acid residues: X19 is D; X39 is A; X53 is Y; X218 is N; X221 is I; X237 is R; X251 is L; X255 is C; and X285 is A.
In some embodiments, the polypeptide comprises an amino acid sequence selected from the group consisting of SEQ ID NOs: 636, 638, 640, 642, 644, 646, 648, 650, 652, 654, 656, 658, 660, 662, 664, 666 and 668. In some embodiments, the polypeptide comprises an
amino acid sequence selected from the group consisting of SEQ ID NOs: 636, 638, 642, 646, 664, and 666.
The disclosure also provides an engineered dsRNA ligase polypeptide comprising an amino acid sequence having at least 80% sequence identity to SEQ ID NO: 302, which produces at least 5% more oligonucleotide product than a dsRNA ligase polypeptide comprising the amino acid sequence of SEQ ID NO: 302 under the same ligation reaction conditions, wherein the engineered dsRNA ligase polypeptide does not the comprise the amino acid sequence of SEQ ID NO: 302.
In some embodiments, the ligation reaction conditions include about 1 pM to about 10 mM oligonucleotide fragment, a source of ATP, about 5 mM to about 100 mM divalent cation, and about 0.5 g/L to about 10 g/L engineered dsRNA ligase polypeptide, pH of about 4.0 to about 8.0, and temperature of about 10 °C to about 50°C. In some embodiments, the source of ATP comprises ATP, optionally a stoichiometric excess of ATP. In some embodiments, the source of ATP comprises: (a) polyphosphate kinase (PPK); (b) polyphosphate; and (c) AMP and/or ATP.
In some embodiments, the amino acid sequence of the engineered dsRNA ligase polypeptide comprises an amino acid sequence that differs from the sequence of SEQ ID NO: 302 in one or more amino acid residues selected from: X6, X7, X15, X19, X29, X36, X39, X46, X47, X49, X51, X53, X56, X57, X60, X63, X64, X66, X67, X87, X88, X91, X93, X103, X105, X107, X114, X122, X126, X129, X130, X131, X137, X144, X146, X158, X163, X173, X178, X190, X196, X216, X218, X221, X228, X230, X232, X235, X236, X237, X238, X239, X242, X243, X244, X251, X252, X254, X255, X258, X269, X280, X284, X285, X293, X296, X301, X303, X305, X314, X325, and X328, wherein the numbering refers to SEQ ID NO: 302.
In some embodiments, the amino acid sequence of the engineered dsRNA ligase polypeptide comprises an amino acid sequence that differs from the sequence of SEQ ID NO: 302 in one or more amino acid residues selected from: X6, X7, X15, X19, X29, X36, X39, X44, X45, X46, X47, X49, X51, X53, X56, X57, X60, X63, X64, X66, X67, X87, X88, X89, X91, X92, X93, X103, X105, X107, X114, X122, X126, X129, X130, X131, X137, X144, X146, X158, X163, X173, X178, X185, X190, X196, X216, X218, X221, X228, X230, X232, X235, X236, X237, X238, X239, X242, X243, X244, X251, X252, X254, X255, X258, X269, X280, X284, X285, X293, X296, X301, X303, X305, X313, X314, X325, and X328, wherein the numbering refers to SEQ ID NO: 302.
In some embodiments, the amino acid sequence of the engineered dsRNA ligase polypeptide comprises one or more of the following amino acid residues: X6 is G; X7 is Q; X15 is R, D or E; X19 is Q or D; X29 is N or L; X36 is V; X39 is A; X46 is Y; X47 is E; X49 is G; X51 is L; X53 is Y; X56 is R or A; X57 is S; X60 is T, G or P; X63 is S, Q or G; X64 is R, T, Q, F, G, or M; X66 is F or W; X67 is N; X87 is T, P, K or absent; X88 is C; X91 is S; X93 is G, C, or A; X103 is V, C, Y, or T; X105 is V; X107 is R or T; XI 14 is N; X122 is W; X126 is G; X129 is N; X130 is R, S or Y; X131 is R; X137 is V or C; X144 is N; X146 is R; X158 is W; X163 is G; X173 is L; X178 is R; X190 is Q; X196 is S or C; X216 is L or R; X218 is N; X221 is I; X228 is R; X230 is T; X232 is R; X235 is A, T, or G; X236 is S, L, or F; X237 is S, Q, or R; X238 is F; X239 is G or R; X242 is R or M; X243 is N, S, G, or M; X244 is G or K; X251 is D or L; X252 is V; X254 is K; X255 is C; X258 is V; X269 is L; X280 is W; X284 is A; X285 is A; X293 is R; X296 is R; X301 is G, L, E, or F; X303 is Q; X305 is G; X314 is A or V; X325 is R; and X328 is R; wherein the numbering refers to SEQ ID NO: 302.
In some embodiments, the amino acid sequence of the engineered dsRNA ligase polypeptide comprises one or more of the following amino acid residues: X6 is G or E; X7 is Q; X15 is R, D or E; X19 is Q or D; X29 is N or L; X36 is V; X39 is A; X44 is V; X45 is V; X46 is Y; X47 is E; X49 is G; X51 is L; X53 is Y; X56 is R or A; X57 is S; X60 is T, G or P; X63 is S, Q or G; X64 is R, T, Q, F, G, or M; X66 is F or W; X67 is N; X87 is T, P, K or absent; X88 is C; X89 is T; X91 is S; X92 is D; X93 is G, C, or A; X103 is V, C, Y, or T; X105 is V; X107 is R or T; XI 14 is N; X122 is W; X126 is G; X129 is N; X130 is R, S or Y; X131 is R; X137 is V or C; X144 is N; X146 is R; X158 is W; X163 is G; X173 is L; X178 is R; X185 is K; X190 is Q; X196 is S or C; X216 is L or R; X218 is N; X221 is I; X228 is R; X230 is T; X232 is R; X235 is A, T, or G; X236 is S, L, or F; X237 is S, Q, R, L or G; X238 is F; X239 is G or R; X242 is R or M; X243 is N, S, G, or M; X244 is G or K; X251 is D or L; X252 is V; X254 is K; X255 is C; X258 is V; X269 is L; X280 is W; X284 is A; X285 is A; X293 is R; X296 is R; X301 is G, L, E, or F; X303 is Q; X305 is G; X313 is A; X314 is A or V; X325 is R; and X328 is R; wherein the numbering refers to SEQ ID NO: 302.
In some embodiments, the amino acid sequence of the engineered dsRNA ligase polypeptide comprises an amino acid sequence that differs from the sequence of SEQ ID NO: 302 in one or more amino acid residues selected from: X15, X19, X36, X39, X53, X218, X221, X237, X251, X255, and X285, wherein the numbering refers to SEQ ID NO: 302, and wherein the engineered dsRNA ligase polypeptide has dsRNA ligase activity.
In some embodiments, the amino acid sequence of the engineered dsRNA ligase polypeptide comprises one or more of the following amino acid residues: X15 is D or E; X19 is D; X36 is V; X39 is A; X53 is Y; X218 is N; X221 is I; X237 is R; X251 is L; X255 is C; and X285 is A; wherein the numbering refers to SEQ ID NO: 302.
In some embodiments, the amino acid sequence of the engineered dsRNA ligase polypeptide comprises an amino acid sequence that differs from the sequence of SEQ ID NO: 302 in one or more amino acid residues selected from: X15, X19, X36, X39, X53, X185, X218, X221, X237, X251, X255, and X285, wherein the numbering refers to SEQ ID NO: 302, and wherein the engineered dsRNA ligase polypeptide has dsRNA ligase activity.
In some embodiments, the amino acid sequence of the engineered dsRNA ligase polypeptide comprises one or more of the following amino acid residues: X15 is D or E; X19 is D; X36 is V; X39 is A; X53 is Y; X185 is K; X218 is N; X221 is I; X237 is R; X251 is L; X255 is C; and X285 is A; wherein the numbering refers to SEQ ID NO: 302.
In some embodiments, the amino acid sequence of the engineered dsRNA ligase polypeptide comprises an amino acid sequence that differs from the sequence of SEQ ID NO: 302 in one or more (e.g. 2 or more, 3 or more, or all 4) amino acid residues selected from: X36, X39, X218 and X221, wherein the numbering refers to SEQ ID NO: 302, and wherein the engineered dsRNA ligase polypeptide has dsRNA ligase activity; optionally wherein the amino acid sequence of the engineered dsRNA ligase polypeptide comprises one or more (e.g. 2 or more, 3 or more, or all 4) of the following amino acid residues: X36 is V; X39 is A; X218 is N; and X221 is I.
In some embodiments, the amino acid sequence of the engineered dsRNA ligase polypeptide comprises an amino acid sequence that differs from the sequence of SEQ ID NO: 302 in one or more (e.g. 2 or more, or all 3) amino acid residues selected from: X39, X218 and X221, wherein the numbering refers to SEQ ID NO: 302, and wherein the engineered dsRNA ligase polypeptide has dsRNA ligase activity; optionally wherein the amino acid sequence of the engineered dsRNA ligase polypeptide comprises one or more (e.g. 2 or more, or all 3) of the following amino acid residues: X39 is A; X218 is N; and X221 is I.
In some embodiments, the amino acid sequence of the engineered dsRNA ligase polypeptide comprises an amino acid sequence that differs from the sequence of SEQ ID NO: 302 in one or more (e.g. 2 or more, 3 or more, or all 4) amino acid residues selected from: X39, X218, X221 and X255, wherein the numbering refers to SEQ ID NO: 302, and wherein the engineered dsRNA ligase polypeptide has dsRNA ligase activity; optionally wherein the amino acid sequence of the engineered dsRNA ligase polypeptide comprises one or more
(e.g. 2 or more, 3 or more, or all 4) of the following amino acid residues: X39 is A; X218 is N; X221 is I; and X255 is C.
In some embodiments, the amino acid sequence of the engineered dsRNA ligase polypeptide comprises an amino acid sequence that differs from the sequence of SEQ ID NO: 302 in one or more (e.g. 2 or more, 3 or more, 4 or more, 5 or more, 6 or more, 7 or more, or all 8) amino acid residues selected from: X39, X53, X218, X221, X237, X251, X255 and X285, wherein the numbering refers to SEQ ID NO: 302, and wherein the engineered dsRNA ligase polypeptide has dsRNA ligase activity; optionally wherein the amino acid sequence of the engineered dsRNA ligase polypeptide comprises one or more (e.g. 2 or more, 3 or more, 4 or more, 5 or more, 6 or more, 7 or more, or all 8) of the following amino acid residues: X39 is A; X 53 is Y; X218 is N; X221 is I; X237 is R; X251 is L; X255 is C; and X285 is A.
In some embodiments, the amino acid sequence of the engineered dsRNA ligase polypeptide comprises an amino acid sequence that differs from the sequence of SEQ ID NO: 302 in one or more (e.g. 2 or more, 3 or more, 4 or more, 5 or more, 6 or more, 7 or more, 8 or more, or all 9) amino acid residues selected from: X15, X39, X53, X218, X221, X237, X251, X255 and X285, wherein the numbering refers to SEQ ID NO: 302, and wherein the engineered dsRNA ligase polypeptide has dsRNA ligase activity; optionally wherein the amino acid sequence of the engineered dsRNA ligase polypeptide comprises one or more (e.g. 2 or more, 3 or more, 4 or more, 5 or more, 6 or more, 7 or more, 8 or more, or all 9) of the following amino acid residues: X15 is E; X39 is A; X 53 is Y; X218 is N; X221 is I; X237 is R; X251 is L; X255 is C; and X285 is A.
In some embodiments, the amino acid sequence of the engineered dsRNA ligase polypeptide comprises an amino acid sequence that differs from the sequence of SEQ ID NO: 302 in one or more (e.g. 2 or more, 3 or more, 4 or more, 5 or more, 6 or more, 7 or more, 8 or more, or all 9) amino acid residues selected from: X19, X39, X53, X218, X221, X237, X251, X255 and X285, wherein the numbering refers to SEQ ID NO: 302, and wherein the engineered dsRNA ligase polypeptide has dsRNA ligase activity; optionally wherein the amino acid sequence of the engineered dsRNA ligase polypeptide comprises one or more (e.g. 2 or more, 3 or more, 4 or more, 5 or more, 6 or more, 7 or more, 8 or more, or all 9) of the following amino acid residues: X19 is D; X39 is A; X 53 is Y; X218 is N; X221 is I; X237 is R; X251 is L; X255 is C; and X285 is A.
In some embodiments, the amino acid sequence of the engineered dsRNA ligase polypeptide comprises an amino acid sequence that differs from the sequence of SEQ ID NO: 302 in one or more (e.g. 2 or more, 3 or more, 4 or more, 5 or more, 6 or more, 7 or more, 8
or more, 9 or more, or all 10) amino acid residues selected from: X15, X39, X53, X185, X218, X221, X237, X251, X255 and X285, wherein the numbering refers to SEQ ID NO: 302, and wherein the engineered dsRNA ligase polypeptide has dsRNA ligase activity; optionally wherein the amino acid sequence of the engineered dsRNA ligase polypeptide comprises one or more (e.g. 2 or more, 3 or more, 4 or more, 5 or more, 6 or more, 7 or more, 8 or more, 9 or more, or all 10) of the following amino acid residues: X15 is D; X39 is A; X 53 is Y; X185 is K; X218 is N; X221 is I; X237 is R; X251 is L; X255 is C; and X285 is A.
In some embodiments, the engineered dsRNA ligase polypeptide comprises a purification tag. In some embodiments, the engineered dsRNA ligase polypeptide comprises an amino acid sequence selected from the group consisting of SEQ ID NOs: 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 86, 88, 90, 92, 94, 96, 98, 100, 102, 104, 106, 108, 110, 112, 114, 116, 118, 120, 122, 124, 126, 128, 130, 132, 134, 136, 138, 140, 142, 144,
146, 148, 150, 152, 154, 156, 158, 160, 162, 164, 166, 168, 170, 172, 174, 176, 178, 180,
182, 184, 186, 188, 190, 192, 194, 196, 198, 200, 202, 204, 206, 208, 210, 212, 214, 216,
218, 220, 222, 224, 226, 228, 230, 232, 234, 236, 238, 240, 242, 244, 246, 248, 250, 252,
254, 256, 258, 260, 262, 264, 266, 268, 270, 272, 274, 276, 278, 280, 282, 284, 286, 288,
290, 292, 294, 296, 298 and 300. In some embodiments, the engineered dsRNA ligase polypeptide comprises an amino acid sequence selected from the group consisting of SEQ ID NOs: 602, 604, 606, 608, 610, 612, 614, 616, 618, 620, 622, 624, 626, 628, 630, 632, and 634. In some embodiments, the engineered dsRNA ligase polypeptide comprises an amino acid sequence selected from the group consisting of SEQ ID NOs: 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 86, 88, 90, 92, 94, 96, 98, 100, 102, 104, 106, 108, 110, 112, 114, 116, 118, 120, 122, 124, 126, 128, 130, 132, 134, 136, 138, 140, 142, 144, 146, 148,
150, 152, 154, 156, 158, 160, 162, 164, 166, 168, 170, 172, 174, 176, 178, 180, 182, 184,
186, 188, 190, 192, 194, 196, 198, 200, 202, 204, 206, 208, 210, 212, 214, 216, 218, 220,
222, 224, 226, 228, 230, 232, 234, 236, 238, 240, 242, 244, 246, 248, 250, 252, 254, 256,
258, 260, 262, 264, 266, 268, 270, 272, 274, 276, 278, 280, 282, 284, 286, 288, 290, 292,
294, 296, 298, 300, 602, 604, 606, 608, 610, 612, 614, 616, 618, 620, 622, 624, 626, 628,
630, 632, and 634.
The disclosure also provides a polypeptide immobilized on a solid material by chemical bond or a physical adsorption method, wherein the polypeptide comprises an engineered dsRNA ligase polypeptide described herein.
The disclosure also provides a polynucleotide encoding the engineered dsRNA ligase polypeptide described herein.
In some embodiments, the polynucleotide comprises a nucleic acid sequence selected from SEQ ID NOs: 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 39, 41, 43, 45, 47, 49, 51, 53, 55, 57, 59, 61, 63, 65, 67, 69, 71, 73, 75, 77, 79, 81, 83, 85, 87, 89, 91, 93, 95, 97, 99, 101, 103, 105, 107, 109, 111, 113, 115, 117, 119, 121, 123, 125, 127, 129, 131, 133, 135, 137, 139, 141, 143, 145, 147, 149, 151, 153, 155, 157, 159, 161, 163, 165, 167,
169, 171, 173, 175, 177, 179, 181, 183, 185, 187, 189, 191, 193, 195, 197, 199, 201, 203,
205, 207, 209, 211, 213, 215, 217, 219, 221, 223, 225, 227, 229, 231, 233, 235, 237, 239,
241, 243, 245, 247, 249, 251, 253, 255, 257, 259, 261, 263, 265, 267, 269, 271, 273, 275,
277, 279, 281, 283, 285, 287, 289, 291, 293, 295, 297, 299, 303, 305, 307, 309, 311, 313,
315, 317, 319, 321, 323, 325, 327, 329, 331, 333, 335, 337, 339, 341, 343, 345, 347, 349,
351, 353, 355, 357, 359, 361, 363, 365, 367, 369, 371, 373, 375, 377, 379, 381, 383, 385,
387, 389, 391, 393, 395, 397, 399, 401, 403, 405, 407, 409, 411, 413, 415, 417, 419, 421,
423, 425, 427, 429, 431, 433, 435, 437, 439, 441, 443, 445, 447, 449, 451, 453, 455, 457,
459, 461, 463, 465, 467, 469, 471, 473, 475, 477, 479, 481, 483, 485, 487, 489, 491, 493,
495, 497, 499, 501, 503, 505, 507, 509, 511, 513, 515, 517, 519, 521, 523, 525, 527, 529,
531, 533, 535, 537, 539, 541, 543, 545, 547, 549, 551, 553, 555, 557, 559, 561, 563, 565,
567, 569, 571, 573, 575, 577, 579, 581, 583, 585, 587, 589, 591, 593, 595, 597, and 599.
In some embodiments, the polynucleotide comprises a nucleic acid sequence selected from SEQ ID NOs: 601, 603, 605, 607, 609, 611, 613, 615, 617, 619, 621, 623, 625, 627, 629, 631, 633, 635, 637, 639, 641, 643, 645, 647, 649, 651, 653, 655, 657, 659, 661, 663, 665, and 667.
In some embodiments, the polynucleotide comprises a nucleic acid sequence selected from: (a) SEQ ID NOs: 303, 305, 307, 309, 311, 313, 315, 317, 319, 321, 323, 325, 327, 329, 331, 333, 335, 337, 339, 341, 343, 345, 347, 349, 351, 353, 355, 357, 359, 361, 363, 365,
367, 369, 371, 373, 375, 377, 379, 381, 383, 385, 387, 389, 391, 393, 395, 397, 399, 401,
403, 405, 407, 409, 411, 413, 415, 417, 419, 421, 423, 425, 427, 429, 431, 433, 435, 437,
439, 441, 443, 445, 447, 449, 451, 453, 455, 457, 459, 461, 463, 465, 467, 469, 471, 473,
475, 477, 479, 481, 483, 485, 487, 489, 491, 493, 495, 497, 499, 501, 503, 505, 507, 509,
511, 513, 515, 517, 519, 521, 523, 525, 527, 529, 531, 533, 535, 537, 539, 541, 543, 545,
547, 549, 551, 553, 555, 557, 559, 561, 563, 565, 567, 569, 571, 573, 575, 577, 579, 581,
583, 585, 587, 589, 591, 593, 595, 597, 599, 635, 637, 639, 641, 643, 645, 647, 649, 651,
653, 655, 657, 659, 661, 663, 665, and 667; and/or (b) SEQ ID NOs: 3, 5, 7, 9, 11, 13, 15, 17,
19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 39, 41, 43, 45, 47, 49, 51, 53, 55, 57, 59, 61, 63, 65, 67, 69, 71, 73, 75, 77, 79, 81, 83, 85, 87, 89, 91, 93, 95, 97, 99, 101, 103, 105, 107, 109, 111, 113, 115, 117, 119, 121, 123, 125, 127, 129, 131, 133, 135, 137, 139, 141, 143, 145, 147,
149, 151, 153, 155, 157, 159, 161, 163, 165, 167, 169, 171, 173, 175, 177, 179, 181, 183,
185, 187, 189, 191, 193, 195, 197, 199, 201, 203, 205, 207, 209, 211, 213, 215, 217, 219,
221, 223, 225, 227, 229, 231, 233, 235, 237, 239, 241, 243, 245, 247, 249, 251, 253, 255,
257, 259, 261, 263, 265, 267, 269, 271, 273, 275, 277, 279, 281, 283, 285, 287, 289, 291,
293, 295, 297, 299, 601, 603, 605, 607, 609, 611, 613, 615, 617, 619, 621, 623, 625, 627,
629, 631, and 633.
The disclosure also provides an expression vector comprising the polynucleotide described herein. In some embodiments, the vector comprises a plasmid, a cosmid, a bacteriophage or a viral vector.
The disclosure also provides a host cell comprising the polynucleotide described herein or the expression vector described herein. In some embodiments, the host cell is E. coli.
The disclosure also provides a method of preparing an engineered dsRNA ligase polypeptide, which comprises the steps of culturing the host cell described herein and obtaining an engineered dsRNA ligase polypeptide from the culture.
The disclosure also provides an engineered dsRNA ligase catalyst obtainable by culturing the host cells described herein, or according to the method described herein, wherein said engineered dsRNA ligase catalyst comprises cells or culture fluid containing the engineered dsRNA ligase polypeptides, or an article processed therewith, wherein the article refers to an extract obtained from the culture of host cell, an isolated product obtained by isolating or purifying an engineered dsRNA ligase from the extract, or an immobilized product obtained by immobilizing host cell, an extract thereof, or isolated product of the extract.
The disclosure further provides a method of producing an oligonucleotide from two or more oligonucleotide fragments, wherein the method comprises contacting: (i) two or more oligonucleotide fragments; (ii) an engineered dsRNA ligase polypeptide described herein;
(iii) a source of ATP; and (iv) a divalent cation; to obtain an oligonucleotide.
In some embodiments, the source of ATP comprises ATP.
In some embodiments, the source of ATP comprises: (a) polyphosphate kinase (PPK); (b) polyphosphate; and (c) AMP and/or ATP. In some embodiments, the PPK is selected from PPK 12 or ajPAP.
In some embodiments, the method is performed using a sub-stoichiometric concentration of AMP and/or ATP.
In some embodiments, the polyphosphate is a polyphosphate salt. In some embodiments, the polyphosphate salt is sodium polyphosphate (Maddrell’s salt) or sodium hexametaphosphate (Graham’s salt).
In some embodiments, the divalent cation cofactor is Mg2+ or Mn2+.
In some embodiments, the method is performed with a divalent cation concentration of 5-100 mM, optionally 30-50 mM.
In some embodiments, the method further comprises a step of purifying the oligonucleotide.
The disclosure also provides use of the engineered dsRNA ligase polypeptide described herein in the production of an oligonucleotide from two or more oligonucleotide fragments.
In some embodiments, the oligonucleotide is up to 60 nucleotides in length.
In some embodiments, each of the oligonucleotide fragments are 4-16 nucleotides in length, optionally 6-9 nucleotides in length.
In some embodiments, one or more of the oligonucleotide fragment(s) comprises one or two overhangs.
In some embodiments, one or more of the oligonucleotide fragments comprises a chemical modification. In some embodiments, the chemical modification is selected from: (a) a modified backbone, optionally selected from a phosphorothioate (e.g. chiral phosphorothioate) or methylphosphonate intemucleotide linkage; (b) a modified nucleotide, optionally selected from 2'-O-methyl (2’-OMe), 2'-flouro (2’-F), 2'-deoxy, 2'-deoxy-2’- fluoro, 2'-O-methoxyethyl (2'-O-MOE), 2'-O-aminopropyl (2'-O-AP), 2'-O- dimethylaminoethyl (2'-O-DMAOE), 2'-O-dimethylaminopropyl (2'-O-DMAP), 2'-O- dimethylaminoethyloxyethyl (2'-O-DMAEOE), 2'-O-N-methylacetamido (2'-0-NMA), locked nucleic acid (LNA), glycol nucleic acid (GNA), phosphoramidate (e.g. mesyl phosphoramidate), 2',3'-seco nucleotide mimic, 2'-F-arabino nucleotide, abasic nucleotide, 2'- amino modified nucleotide, 2'-alkyl-modified nucleotide, morpholino nucleotide, vinylphosphonate (e.g. 5’ vinylphosphonate), and cyclopropyl phosphonate deoxyribonucleotide; and/or (c) conjugation to a ligand, optionally wherein the ligand comprises one or more N-Acetylgalactosamine (GalNAc) derivatives.
The disclosure also provides a composition comprising: i. the engineered dsRNA ligase polypeptide described herein; ii. a source of ATP; and iii. a divalent cation.
In some embodiments, the composition further comprises two or more oligonucleotide fragments.
The disclosure also provides a kit comprising: i. the engineered dsRNA ligase polypeptide described herein; ii. a source of ATP; iii. a divalent cation; and iv. instructions for use in a method of producing an oligonucleotide from two or more oligonucleotide fragments.
In some embodiments, the source of ATP comprises ATP.
In some embodiments, the source of ATP comprises: (a) polyphosphate kinase (PPK); (b) polyphosphate; and (c) AMP and/or ATP.
In some embodiments, the PPK is selected from PPK 12 or ajPAP.
In some embodiments, the polyphosphate is a polyphosphate salt.
In some embodiments, the polyphosphate salt is sodium polyphosphate (Maddrell’s salt) or sodium hexametaphosphate (Graham’s salt).
In some embodiments, the divalent cation cofactor is Mg2+ or Mn2+.
Definitions
Unless expressly defined otherwise, technical and scientific terms used in this disclosure have the meanings that are commonly understood by people skilled in the art to which this invention belongs. The following references provide one of skill with a general definition of many of the terms used in this invention: Singleton et al., Dictionary of Microbiology and Molecular Biology (2nd ed. 1994); The Cambridge Dictionary of Science and Technology (Walker ed., 1988); The Glossary of Genetics, 5th Ed., R. Rieger et al. (eds.), Springer Verlag (1991); and Hale & Marham, The Harper Collins Dictionary of Biology (1991). As used herein, the following terms have the meanings ascribed to them below, unless specified otherwise.
As used throughout this disclosure, articles such as “a” and “an” refer to one or more than one (at least one) of the grammatical object of the article.
The term “and/or” means either “and” or “or” unless indicated otherwise.
As used herein, the term “about” typically refers to the value which immediately follows the term ‘about’. For example, “about 15 or more nucleotides” typically refers to 15 or more nucleotides. In some embodiments, the term “about” embraces values which are +/- 1, 2 or 3 of the stated value. For example, “about 15 or more nucleotides” may refer to 15+/-3 nucleotides, e.g. 12, 13, 14, 15, 16, 17 or 18 nucleotides.
The terms “double-stranded RNA ligase” and “dsRNA ligase” are used interchangeably herein to refer to an enzyme having dsRNA ligase activity. A dsRNA ligase polypeptide may also be referred to herein as a “dsRNA ligase catalyst”.
A dsRNA ligase of the invention is an ATP-dependent nucleic acid ligase. dsRNA ligase activity as used herein typically involves the ATP-dependent formation of a covalent bond between the 3 ’-OH of a ribonucleotide and the 5’-PO4 of a ribonucleotide or deoxyribonucleotide via the following steps: (1) dsRNA ligase reacts with ATP to form a covalent dsRNA ligase-AMP intermediate and release pyrophosphate; (2) AMP is transferred from the dsRNA ligase-AMP intermediate to the 5 ’-phosphate of a 3’ oligonucleotide fragment forming an adenylated oligonucleotide intermediate; and (3) the 3 ’-OH of a 5’ oligonucleotide fragment attacks the 5 ’ phosphate of the adenylated intermediate resulting in the formation of a phosphodiester bond and the release of AMP.
The stoichiometric concentration of cofactor is the theoretical concentration required to achieve complete ligation in a given ligation reaction. The skilled person can readily derive the stoichiometric concentration of ATP required to achieve complete ligation based on the concentration of oligonucleotide fragments and the number of ligation reactions required to produce the oligonucleotide product. For example, a ligation reaction using 1 mM substrate which requires four ligation reactions has a stoichiometric ATP concentration of 4 mM. A stoichiometric excess of ATP can help ensure that complete ligation is achieved. In some embodiments, the stoichiometric excess is at least 105% of the theoretical stoichiometric concentration of ATP required to achieve complete ligation, e.g. at least 110%, at least 115%, at least 120%, at least 125%, at least 130%, at least 135%, at least 140%, at least 145%, at least 150%, at least 160%, at least 170%, at least 180%, at least 190%, or at least 200%.
The terms "engineered dsRNA ligase", "engineered dsRNA ligase polypeptide", "improved dsRNA ligase polypeptide", and "engineered polypeptide" are used interchangeably herein.
As used herein, the term “oligonucleotide” refers to a nucleic acid, typically comprising up to 100 nucleotides. As used herein, the term “oligonucleotide product” refers to an oligonucleotide formed by the ligation of two or more oligonucleotide fragments by a dsDNA ligase described herein. Oligonucleotide products are also referred to herein simply as oligonucleotides. It will be understood that oligonucleotide products described herein comprise RNA. It will also be understood that oligonucleotide products described herein comprise a double-stranded region. In some embodiments, oligonucleotide products described herein comprise RNA and DNA. For example, a portion of the oligonucleotide
product may be double-stranded DNA, while another portion is double-stranded RNA, forming a DNA-RNA chimera.
The term “therapeutic oligonucleotide” refers to an oligonucleotide that can provide a therapeutic effect, e.g. by interacting with a biomolecule and/or by regulating gene expression. Therapeutic oligonucleotides include, but are not limited to, RNA interference (RNAi) agents and antisense oligonucleotides (ASO). RNAi is a post-transcriptional, targeted gene-silencing technique that uses RNAi agents to degrade messenger RNA (mRNA) containing the same sequence as the RNAi agent. ASOs are single-stranded nucleic acids that can be used to target mRNA derived from a gene of interest. ASOs can alter gene expression via a number of mechanisms including direct steric blockage of mRNA and ribonuclease H (RNase H) mediated degradation of mRNA.
RNAi agents include, as non-limiting examples, siRNAs (small interfering RNAs), dsRNAs (double-stranded RNAs), shRNAs (short hairpin RNAs) and miRNAs (micro RNAs). RNAi agents also include, as additional non-limiting examples, locked nucleic acid (LNA), Morpholino, UNA, threose nucleic acid (TNA), glycol nucleic acid (GNA), peptide nucleic acid (PNA) and fluoro-arabinonucleic acid (FANA). RNAi agents also include molecules in which one or more strands are a mixture of RNA, DNA, LNA, Morpholino, UNA (unlocked nucleic acid), TNA, GNA, and/or FANA. As a non-limiting example, one or both strands of an RNAi agent could be, for example, RNA, except that one or more RNA nucleotides is replaced by DNA, LNA, Morpholino, UNA, TNA, GNA, and/or FANA, etc. In some embodiments, one or both strands of the RNAi agent can be nicked, and both strands can be the same length, or one strand can be shorter than the other. The oligonucleotide of the invention may be any of the RNAi agents described herein.
The term “oligonucleotide fragment” herein refers to a nucleic acid that can be ligated to one or more additional oligonucleotide fragments to provide an oligonucleotide (or oligonucleotide product). Each oligonucleotide fragment corresponds to a portion of the oligonucleotide product. Oligonucleotide fragments may be referred to herein as “substrates” of the ligation reaction.
As described above, dsRNA ligase activity involves the ligation of a 5’ oligonucleotide fragment to a 3 ’ oligonucleotide fragment. In the context of oligonucleotide fragments, the prefixes 5’ and 3’ refer to the relative position of each oligonucleotide fragment in the oligonucleotide product after ligation, wherein the 5’ oligonucleotide fragment is located upstream of the 3 ’ oligonucleotide fragment (when the oligonucleotide product is presented in the 5’ to 3’ direction). As used herein, a “5’ oligonucleotide fragment” typically comprises a
3’ terminal ribonucleotide having a 3 ’-hydroxyl group. As used herein, a “3’ oligonucleotide fragment” comprises a 5’-phosphate, wherein the 5’ terminal nucleotide is a deoxyribonucleotide or a ribonucleotide.
It will be understood that, in some embodiments, an oligonucleotide fragment may be a 3’ oligonucleotide fragment and a 5’ oligonucleotide fragment (e.g. wherein ligation reactions occur at the 5’ and 3’ ends of the oligonucleotide fragment). For example, said oligonucleotide fragment may provide: (i) the 3’ oligonucleotide fragment in a ligation reaction with a 5’ oligonucleotide fragment; and (ii) the 5 ’ oligonucleotide fragment in a ligation reaction with a 3’ oligonucleotide fragment. For example, oligonucleotide fragment 7 in Figure 1A provides: (i) the 3’ oligonucleotide fragment in a ligation reaction with 5’ oligonucleotide fragment 6 and; (ii) the 5 ’ oligonucleotide fragment in a ligation reaction with 3 ’ oligonucleotide fragment 12 to provide oligonucleotide product 2.
A “terminal oligonucleotide fragment” herein refers to a nucleic acid that corresponds to an end (e.g. 5’ or 3’ end) portion of the oligonucleotide product. The 5’ terminal oligonucleotide fragment typically provides a 5’ oligonucleotide fragment for ligation to a 3’ oligonucleotide fragment. The 3’ terminal oligonucleotide fragment typically provides a 3’ oligonucleotide fragment for ligation to a 5 ’ oligonucleotide fragment. In some embodiments, the 5’ terminal oligonucleotide is ligated directly to the 3’ terminal oligonucleotide. In some embodiments, the 5 ’ terminal oligonucleotide and the 3 ’ terminal oligonucleotide are separated by one or more oligonucleotide fragments.
In some embodiments, oligonucleotide fragments described herein comprise RNA and DNA. For example, a portion of an oligonucleotide fragment may be double-stranded DNA, while another portion is double-stranded RNA, forming a DNA-RNA chimera.
The term “overhang” or “nucleotide overhang” herein refers to at least one unpaired nucleotide that protrudes from the end of at least one of the two strands of a double -stranded oligonucleotide. In some embodiments, when a 3 '-end of one strand extends beyond the 5 '-end of the other strand, or vice versa, this forms a nucleotide overhang, e.g., the unpaired nucleotide(s) form the overhang. An overhang that is complementary to the overhang of a second oligonucleotide fragment may be referred to as a “sticky end". The oligonucleotide fragments described herein may have one or two sticky ends.
“Blunt” or “blunt end” means that there are no unpaired nucleotides at that end of a double-stranded oligonucleotide, i.e. , no nucleotide overhang. A “blunt ended” oligonucleotide or oligonucleotide fragment is an oligonucleotide that is double-stranded over its entire length, i. e. , no nucleotide overhang at either end of the molecule.
Double -stranded nucleic acids comprise two anti-parallel and substantially complementary nucleic acid strands which are referred to as “sense” and “antisense” strands. In the context of double-stranded RNAi agents, the “antisense strand” refers to the strand of an RNAi which includes a region that is substantially complementary to a target sequence, e.g. an mRNA sequence. The “sense strand” refers to the strand of an RNAi that includes a region that is substantially complementary to a region of the antisense strand. The sense and antisense strands of an RNAi agent may be referred to as the passenger and guide strands, respectively.
Sequences that are “substantially complementary” may be fully complementary or may contain one or more mismatches upon hybridization, while retaining the ability to hybridize under the conditions most relevant to their ultimate application.
"Conversion" refers to the enzymatic transformation of a substrate to the corresponding product. "Percent conversion" or "conversion" refers to the percentage of oligonucleotide fragments that is converted to oligonucleotide product within a defined period of time under specified conditions. Thus, "enzymatic activity" or "activity" of a ligase can be expressed as the "percent conversion" of oligonucleotide fragments to oligonucleotide product.
Ideally to compare the activity between ligation reactions and account for natural variation in peak intensity between injections, the % conversion to product would be calculated for each sample analyzed using the following equation: 100
Whereby £p, £s and £i = the extinction coefficient of the product, substrate, and intermediate oligonucleotides respectively. In some instances, such as using the analytical method described herein, it is not possible to resolve all substrates, reaction intermediates and products. Therefore, the % conversion according to the above equation cannot be determined. However, in some instances, such as using the analytical method described herein, it is possible to resolve at least one substrate, reaction intermediate and product, such as well- defined GalNAc-containing oligonucleotides, including GalNAc containing substrate fragments (e.g. oligonucleotide (12) as used in the examples described herein), reaction intermediates (e.g. oligonucleotide (14) as demonstrated herein) and product strands (e.g. product oligonucleotide (2) as demonstrated herein). Therefore, a pseudo-% conversion can be calculated, denoted with arbitrary units (AU), which considers only these well resolved species according to the following equation:
whereby 8(2), 8(12) and 8<i4) are the extinction coefficient of oligonucleotides (2), (12) and (14) respectively. Using such a calculation an AU = 1.0 would imply that no more GalNAc- containing substrate or intermediate oligonucleotides are present in the reaction and that they have all be converted to GalNAc -containing product (2). In reality, for samples where AU = 1.0 the only other peak present in the chromatogram corresponds with the product oligonucleotide (2), and no other intermediates or starting materials can be identified. Furthermore, the ratio of the product oligonucleotides (2) and (3) are consistent with that of the authentic standard of siRNA product (1). Taken together, it can be concluded that AU = 1.0 is an approximation that is essentially equivalent to 100 % conversion.
"Improved enzyme properties" refers to an enzyme property that is better or more desirable for a specific purpose as compared to a reference dsRNA ligase such as a wild-type dsRNA ligase or another engineered dsRNA ligase under the same reaction conditions. Improved enzyme properties are exhibited by engineered dsRNA ligase polypeptides in this disclosure. The engineered dsRNA ligase polypeptides described herein exhibit increased enzyme activity (which can be expressed as a percentage of substrate conversion). Additional enzyme properties that may be improved include, but are not limited to, thermal stability, pH activity characteristics, cofactor requirements, and tolerance to inhibitors (e.g., reaction component, substrate or product inhibition).
An "isolated polypeptide" refers to a polypeptide that is substantially separated from other substances with which it is naturally associated, such as proteins, lipids, and polynucleotides. The term comprises polypeptides that have been removed or purified from their naturally occurring environment or expression system (e.g., in host cells or in vitro synthesis). Engineered dsRNA ligase polypeptides may be present in the cell, in the cell culture medium, or prepared in various forms, such as lysates or isolated preparations. As such, in some embodiments, the engineered dsRNA ligase polypeptide may be an isolated polypeptide.
"Wild-type" refers to the form found in nature. For example, a wild-type polypeptide or polynucleotide sequence is a sequence that is present in an organism that can be isolated from sources in nature, and which has not been intentionally modified by manual procedures. The polypeptide sequence of the wild-type dsRNA ligase described herein is provided by
SEQ ID NO: 302. As used herein, the wild-type sequence may also comprise a purification tag and may be provided by SEQ ID NO: 2.
The terms "polynucleotide" and "nucleic acid" are used interchangeably herein.
The terms "protein", "polypeptide" and "peptide" are used interchangeably herein to denote a polymer of at least two amino acids covalently linked by an amide bond, regardless of length or post-translational modification (e.g., glycosylation, phosphorylation, lipidation, myristoylation, ubiquitination, etc.).
"Recombinant" or "engineered" when used with reference to, for example, a cell, nucleic acid or polypeptide, refers to a material or material corresponding to the native or native form of the material, that has been modified in a manner that would not otherwise exist in nature, or is identical thereto but produced or derived from synthetic material and/or by manipulation using recombinant techniques.
The abbreviations used for the genetically encoded amino acids are conventional and are as follows:
When the three-letter abbreviations are used, unless specifically preceded by an “L” or a “D” or clear from the context in which the abbreviation is used, the amino acid may be in either the L- or D-configuration about a-carbon (Ca). For example, whereas “Ala” designates alanine without specifying the configuration about the a-carbon, “D-Ala” and “L- Ala” designate D-alanine and L-alanine, respectively.
When the one-letter abbreviations are used, upper case letters designate amino acids in the L-configuration about the a-carbon and lower-case letters designate amino acids in the D-configuration about the a-carbon. For example, “A” designates L-alanine and “a” designates D-alanine. When polypeptide sequences are presented as a string of one-letter or three-letter abbreviations (or mixtures thereof), the sequences are presented in the amino (N) to carboxy (C) direction in accordance with common convention.
The abbreviations used for the genetically encoding nucleotides are conventional and are as follows: adenosine (A); guanosine (G); cytidine (C); thymidine (T); and uridine (U). Unless specifically delineated, the abbreviated nucleotides may be either ribonucleotides or 2 ’-deoxyribonucleotides. The nucleotides may be specified as being either ribonucleotides or 2 ’-deoxyribonucleotides on an individual basis or on an aggregate basis. When nucleic acid sequences are presented as a string of one-letter abbreviations, the sequences are presented in the 5’ to 3’ direction in accordance with common convention, and the phosphodiester bonds are not indicated.
The skilled person is well aware that guanine, cytosine, adenine, and uracil may be replaced by other moieties without substantially altering the base pairing properties of an oligonucleotide comprising a nucleotide bearing such replacement moiety. For example, without limitation, a nucleotide comprising inosine as its base may base pair with nucleotides containing adenine, cytosine, or uracil. Hence, nucleotides containing uracil, guanine, or adenine may be replaced in the nucleotide sequences of oligonucleotides featured in the present disclosure by a nucleotide containing, for example, inosine. In another example, adenine and cytosine anywhere in the oligonucleotide can be replaced with guanine and uracil, respectively to form Wobble base pairing with the target mRNA.
"Amino acid difference" or "residue difference" refers to the difference in amino acid residues at a position of a polypeptide sequence relative to the amino acid residue at a corresponding position in the reference sequence. The positions of amino acid differences are generally referred to herein as "Xn", where n refers to the corresponding position in the reference sequence on which the residue differences are based. For example, "a residue
difference at position X6 as compared to SEQ ID NO: 302" refers to a difference in amino acid residue at the polypeptide position corresponding to position 6 of SEQ ID NO: 302. Thus, if the reference polypeptide of SEQ ID NO: 302 has a serine at position 6, then "a residue difference at position X2 as compared to SEQ ID NO: 302" refers to an amino acid substitution to any residue other than serine at the position of the polypeptide corresponding to position 6 of SEQ ID NO: 302.
The specific amino acid residue difference at the position may be indicated as "XnY" or “Xn is Y”, wherein "Xn" specifies the corresponding position in the reference sequence as described above, and "Y" is the single letter identifier of the residue present at that position in the engineered polypeptide. Specific amino acid differences may also be denoted by the conventional notation "AnY", where A is a single letter identifier of the residue in the reference sequence, "n" is the number of residue position in the reference sequence, and “Y” is the single letter identifier of the residue present at that position in the engineered polypeptide.
In some examples, an engineered polypeptide of this disclosure may comprise one or more amino acid residue differences relative to a reference sequence, which is indicated by a list of specific positions at which residue differences are present relative to a reference sequence. In some embodiments, more than one amino acid residue can be used in a specific residue position of an engineered polypeptide, the various amino acid residues can be listed as alternatives, e.g. “X19 is Q or D”.
Deletion of an amino acid may be represented by e.g. “an amino acid sequence comprising Xn-” indicates that the amino acid sequence contains a deletion at the position corresponding to “Xn” in the reference sequence. "Deletion" refers to the modification of a polypeptide by removing one or more amino acids from a reference polypeptide. Deletions can include the removal of one or more amino acids, two or more amino acids, five or more amino acids, ten or more amino acids, fifteen or more amino acids, or twenty or more amino acids, up to 10% of the total number of amino acids of the enzyme, or up to 20% of the total number of amino acids making up the reference enzyme while retaining the enzymatic activity of the engineered dsRNA ligase and/or retaining the improved properties of the engineered dsRNA ligase. Deletion may involve the internal portion and/or the terminal portion of the polypeptide. In various embodiments, deletions may include a contiguous segment or may be discontinuous.
In the context of the numbering for a given amino acid or polynucleotide sequence, "corresponding to," "reference to" or "relative to" refers to the numbering of the residues of a
specified reference when the given amino acid or polynucleotide sequence is compared to the reference sequence. In other words, the residue number or residue position of a given sequence is designated with respect to the reference sequence, rather than by the actual numerical position of the residue within the given amino acid or polynucleotide sequence. For example, a given amino acid sequence such as an engineered dsRNA ligase can be aligned to a reference sequence by introducing gaps to optimize residue matches between the two sequences. In these cases, although there are gaps, the numbering of the residue in the given amino acid or polynucleotide sequence is made with respect to the reference sequence to which it has been aligned.
"Reference sequence" refers to a defined sequence that is used as a basis for sequence comparison. The reference sequence may be a subset of a larger sequence, for example, a full-length gene or a fragment of a polypeptide sequence. In some embodiments, a "reference sequence" is a wild-type sequence. In some embodiments, a "reference sequence" is an engineered or altered sequences.
Methods of determining percentage sequence identity are known in the art. By way of example, when assessing sequence identity, a sequence having a defined number of contiguous nucleotides or amino acids may be aligned with a nucleic acid or peptide sequence (having the same number of contiguous nucleotides or amino acids) from the corresponding portion of a nucleic acid or peptide sequence disclosed herein. The percentage sequence identity can be calculated by determining the number of positions at which either the identical nucleic acid base or amino acid residue occurs in both sequences, or a nucleic acid base or amino acid residue is aligned with a gap to yield the number of matched positions, dividing the number of matched positions by the total number of positions in the sequence and multiplying the result by 100 to yield the percentage of sequence identity. Those skilled in the art will appreciate that there are many established algorithms available to align two sequences. The optimal alignment of sequences for comparison can be conducted, for example, by the local homology algorithm of Smith and Waterman, 1981, Adv. Appl. Math. 2: 482, by the Homology alignment algorithm of Needleman and Wunsch, 1970, J. Mol. Biol. 48: 443, by the search for similarity method of Pearson and Lipman, 1988, Proc. Natl. Acad. Sci. USA 85: 2444, by computerized implementations of these algorithms (GAP, BESTFIT, FASTA, and TFASTA in the GCG Wisconsin Package) or by visual inspection (see generally, Current Protocols in Molecular Biology, FM Ausubel et al. eds., Current Protocols, a Joint Venture between Greene Publishing Associates, Inc. and John Wiley & Sons, Inc., (1995 Supplement) (Ausubel)). Examples of algorithms that are suitable for
determining the percent sequence identity and percent sequence similarity are the BLAST and BLAST 2.0 algorithms, which are described in Altschul et al., 1990, J. Mol. Biol. 215: 403-410 and Altschul et al., 1977, Nucleic Acids Res. 3389-3402, respectively. Software for performing BLAST analysis is publicly available through the National Center for Biotechnology Information website. The algorithm involves first identifying high scoring sequence pairs (HSPs) by identifying short words of length W in the query sequence, which either match or satisfy some positive-valued threshold scores T when aligned with a word of the same length in the database sequence. T is referred to as, the neighborhood word score threshold (Altschul et al., Supra). These initial neighborhood word hits serve as seeds for initiating searches to find longer HSPs that contain them. The word hits are then extended in both directions along each sequence for as far as the cumulative alignment score can be increased. For nucleotide sequences, the cumulative scores are calculated using the parameters M (reward score for matched pair of residues; always> 0) and N (penalty score for mismatched residues; always <0). For amino acid sequences, a scoring matrix is used to calculate the cumulative score. The extension of the word hits in each direction are halted when: the cumulative alignment score falls off by the quality X from its maximum achieved value; the cumulative score goes 0 or below, due to the accumulation of one or more negative -scoring residue alignments; or the end of either sequence is reached. The BLAST algorithm parameters W, T and X determine the sensitivity and speed of the alignment. The BLASTN program (for nucleotide sequences) uses as defaults a word length (W) of 11, the expected value (E) of 10, M = 5, N = -4, and a comparison of both strands as a default value. For amino acid sequences, the BLASTP program uses as defaults the word length (W) of 3, the expected value (E) of 10 and the BLOSUM62 scoring matrix (see Henikoff and Henikoff, 1989, Proc Natl Acad Sci USA 89: 10915). Exemplary determination of sequence alignments and %sequence identity can employ the BESTFIT or GAP programs in the GCG Wisconsin Software package (Accelrys, Madison WI), using the default parameters provided.
It will be appreciated that, regardless of the percent sequence identity to a reference sequence, an engineered dsRNA ligase possesses dsRNA ligase activity.
"Suitable reaction conditions" refer to those conditions (e.g., enzyme loading, substrate loading, temperature, pH, etc.) in the reaction system, under which the substrate is converted to the desired product. Suitable reaction conditions can be readily identified by the person skilled in the art. Exemplary "suitable reaction conditions" are provided in the present disclosure and illustrated by examples.
Engineered dsRNA ligase polypeptides
The disclosure provides an engineered dsRNA ligase polypeptide comprising an amino acid sequence selected from the group consisting of SEQ ID NOs: 304, 306, 308, 310, 312, 314, 316, 318, 320, 322, 324, 326, 328, 330, 332, 334, 336, 338, 340, 342, 344, 346,
348, 350, 352, 354, 356, 358, 360, 362, 364, 366, 368, 370, 372, 374, 376, 378, 380, 382,
384, 386, 388, 390, 392, 394, 396, 398, 400, 402, 404, 406, 408, 410, 412, 414, 416, 418,
420, 422, 424, 426, 428, 430, 432, 434, 436, 438, 440, 442, 444, 446, 448, 450, 452, 454,
456, 458, 460, 462, 464, 466, 468, 470, 472, 474, 476, 478, 480, 482, 484, 486, 488, 490,
492, 494, 496, 498, 500, 502, 504, 506, 508, 510, 512, 514, 516, 518, 520, 522, 524, 526,
528, 530, 532, 534, 536, 538, 540, 542, 544, 546, 548, 550, 552, 554, 556, 558, 560, 562,
564, 566, 568, 570, 572, 574, 576, 578, 580, 582, 584, 586, 588, 590, 592, 594, 596, 598, and
600.
The disclosure provides an engineered dsRNA ligase polypeptide comprising an amino acid sequence selected from the group consisting of SEQ ID NOs: 636, 638, 640, 642, 644, 646, 648, 650, 652, 654, 656, 658, 660, 662, 664, 666, and 668.
The disclosure provides an engineered dsRNA ligase polypeptide comprising an amino acid sequence selected from the group consisting of SEQ ID NOs: 304, 306, 308, 310, 312, 314, 316, 318, 320, 322, 324, 326, 328, 330, 332, 334, 336, 338, 340, 342, 344, 346,
348, 350, 352, 354, 356, 358, 360, 362, 364, 366, 368, 370, 372, 374, 376, 378, 380, 382,
384, 386, 388, 390, 392, 394, 396, 398, 400, 402, 404, 406, 408, 410, 412, 414, 416, 418,
420, 422, 424, 426, 428, 430, 432, 434, 436, 438, 440, 442, 444, 446, 448, 450, 452, 454,
456, 458, 460, 462, 464, 466, 468, 470, 472, 474, 476, 478, 480, 482, 484, 486, 488, 490,
492, 494, 496, 498, 500, 502, 504, 506, 508, 510, 512, 514, 516, 518, 520, 522, 524, 526,
528, 530, 532, 534, 536, 538, 540, 542, 544, 546, 548, 550, 552, 554, 556, 558, 560, 562,
564, 566, 568, 570, 572, 574, 576, 578, 580, 582, 584, 586, 588, 590, 592, 594, 596, 598,
600, 636, 638, 640, 642, 644, 646, 648, 650, 652, 654, 656, 658, 660, 662, 664, 666, and 668.
The disclosure also provides an engineered dsRNA ligase polypeptide having dsRNA ligase activity and comprising an amino acid sequence having at least 80% sequence identity to an amino acid sequence selected from the group consisting of SEQ ID NOs: 304, 306, 308, 310, 312, 314, 316, 318, 320, 322, 324, 326, 328, 330, 332, 334, 336, 338, 340, 342, 344,
346, 348, 350, 352, 354, 356, 358, 360, 362, 364, 366, 368, 370, 372, 374, 376, 378, 380,
382, 384, 386, 388, 390, 392, 394, 396, 398, 400, 402, 404, 406, 408, 410, 412, 414, 416,
418, 420, 422, 424, 426, 428, 430, 432, 434, 436, 438, 440, 442, 444, 446, 448, 450, 452,
454, 456, 458, 460, 462, 464, 466, 468, 470, 472, 474, 476, 478, 480, 482, 484, 486, 488,
490, 492, 494, 496, 498, 500, 502, 504, 506, 508, 510, 512, 514, 516, 518, 520, 522, 524,
526, 528, 530, 532, 534, 536, 538, 540, 542, 544, 546, 548, 550, 552, 554, 556, 558, 560,
562, 564, 566, 568, 570, 572, 574, 576, 578, 580, 582, 584, 586, 588, 590, 592, 594, 596,
598, and 600, wherein the engineered dsRNA ligase polypeptide does not the comprise the amino acid sequence of SEQ ID NO: 302.
The disclosure also provides an engineered dsRNA ligase polypeptide having dsRNA ligase activity and comprising an amino acid sequence having at least 80% sequence identity to an amino acid sequence selected from the group consisting of SEQ ID NOs: 636, 638, 640, 642, 644, 646, 648, 650, 652, 654, 656, 658, 660, 662, 664, 666, and 668, wherein the engineered dsRNA ligase polypeptide does not the comprise the amino acid sequence of SEQ ID NO: 302.
The disclosure also provides an engineered dsRNA ligase polypeptide having dsRNA ligase activity and comprising an amino acid sequence having at least 80% sequence identity to an amino acid sequence selected from the group consisting of SEQ ID NOs: 304, 306, 308, 310, 312, 314, 316, 318, 320, 322, 324, 326, 328, 330, 332, 334, 336, 338, 340, 342, 344,
346, 348, 350, 352, 354, 356, 358, 360, 362, 364, 366, 368, 370, 372, 374, 376, 378, 380,
382, 384, 386, 388, 390, 392, 394, 396, 398, 400, 402, 404, 406, 408, 410, 412, 414, 416,
418, 420, 422, 424, 426, 428, 430, 432, 434, 436, 438, 440, 442, 444, 446, 448, 450, 452,
454, 456, 458, 460, 462, 464, 466, 468, 470, 472, 474, 476, 478, 480, 482, 484, 486, 488,
490, 492, 494, 496, 498, 500, 502, 504, 506, 508, 510, 512, 514, 516, 518, 520, 522, 524,
526, 528, 530, 532, 534, 536, 538, 540, 542, 544, 546, 548, 550, 552, 554, 556, 558, 560,
562, 564, 566, 568, 570, 572, 574, 576, 578, 580, 582, 584, 586, 588, 590, 592, 594, 596,
598, 600, 636, 638, 640, 642, 644, 646, 648, 650, 652, 654, 656, 658, 660, 662, 664, 666, and
668, wherein the engineered dsRNA ligase polypeptide does not the comprise the amino acid sequence of SEQ ID NO: 302.
The disclosure provides an engineered double-stranded RNA (dsRNA) ligase polypeptide, which is a polypeptide of: (a) a polypeptide comprising an amino acid sequence selected from the group consisting of SEQ ID NOs: 304, 306, 308, 310, 312, 314, 316, 318, 320, 322, 324, 326, 328, 330, 332, 334, 336, 338, 340, 342, 344, 346, 348, 350, 352, 354,
356, 358, 360, 362, 364, 366, 368, 370, 372, 374, 376, 378, 380, 382, 384, 386, 388, 390,
392, 394, 396, 398, 400, 402, 404, 406, 408, 410, 412, 414, 416, 418, 420, 422, 424, 426,
428, 430, 432, 434, 436, 438, 440, 442, 444, 446, 448, 450, 452, 454, 456, 458, 460, 462,
464, 466, 468, 470, 472, 474, 476, 478, 480, 482, 484, 486, 488, 490, 492, 494, 496, 498,
500, 502, 504, 506, 508, 510, 512, 514, 516, 518, 520, 522, 524, 526, 528, 530, 532, 534,
536, 538, 540, 542, 544, 546, 548, 550, 552, 554, 556, 558, 560, 562, 564, 566, 568, 570, 572, 574, 576, 578, 580, 582, 584, 586, 588, 590, 592, 594, 596, 598, and 600; or (b) a polypeptide having dsRNA ligase activity, which comprises an amino acid sequence having (i) at least 80% sequence identity to one of the polypeptides recited in (a), and (ii) a substitution, deletion, addition or insertion of one or more amino acid residues relative to said one amino acid sequence recited in (a); wherein the engineered dsRNA ligase polypeptide does not the comprise the amino acid sequence of SEQ ID NO: 302.
The disclosure provides an engineered double-stranded RNA (dsRNA) ligase polypeptide, which is a polypeptide of: (a) a polypeptide comprising an amino acid sequence selected from the group consisting of SEQ ID NOs: 636, 638, 640, 642, 644, 646, 648, 650, 652, 654, 656, 658, 660, 662, 664, 666, and 668; or (b) a polypeptide having dsRNA ligase activity, which comprises an amino acid sequence having (i) at least 80% sequence identity to one of the polypeptides recited in (a), and (ii) a substitution, deletion, addition or insertion of one or more amino acid residues relative to said one amino acid sequence recited in (a); wherein the engineered dsRNA ligase polypeptide does not the comprise the amino acid sequence of SEQ ID NO: 302.
The disclosure provides an engineered double-stranded RNA (dsRNA) ligase polypeptide, which is a polypeptide of: (a) a polypeptide comprising an amino acid sequence selected from the group consisting of SEQ ID NOs: 304, 306, 308, 310, 312, 314, 316, 318, 320, 322, 324, 326, 328, 330, 332, 334, 336, 338, 340, 342, 344, 346, 348, 350, 352, 354,
356, 358, 360, 362, 364, 366, 368, 370, 372, 374, 376, 378, 380, 382, 384, 386, 388, 390,
392, 394, 396, 398, 400, 402, 404, 406, 408, 410, 412, 414, 416, 418, 420, 422, 424, 426,
428, 430, 432, 434, 436, 438, 440, 442, 444, 446, 448, 450, 452, 454, 456, 458, 460, 462,
464, 466, 468, 470, 472, 474, 476, 478, 480, 482, 484, 486, 488, 490, 492, 494, 496, 498,
500, 502, 504, 506, 508, 510, 512, 514, 516, 518, 520, 522, 524, 526, 528, 530, 532, 534,
536, 538, 540, 542, 544, 546, 548, 550, 552, 554, 556, 558, 560, 562, 564, 566, 568, 570,
572, 574, 576, 578, 580, 582, 584, 586, 588, 590, 592, 594, 596, 598, 600, 636, 638, 640,
642, 644, 646, 648, 650, 652, 654, 656, 658, 660, 662, 664, 666, and 668; or (b) a polypeptide having dsRNA ligase activity, which comprises an amino acid sequence having (i) at least 80% sequence identity to one of the polypeptides recited in (a), and (ii) a substitution, deletion, addition or insertion of one or more amino acid residues relative to said one amino acid sequence recited in (a); wherein the engineered dsRNA ligase polypeptide does not the comprise the amino acid sequence of SEQ ID NO: 302.
In some embodiments, the engineered dsRNA ligase polypeptide comprises an amino acid sequence having at least 85% sequence identity to an even numbered sequence identifier of SEQ ID NOs: 304-600, optionally at least 86%, at least 87%, at least 88%, at least 89%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or at least 99.5% sequence identity to an even numbered sequence identifier of SEQ ID NOs: 304-600. In some embodiments, the engineered dsRNA ligase polypeptide comprises an amino acid sequence having at least 80% sequence identity to an even numbered sequence identifier of SEQ ID NOs: 304-600, optionally at least 81%, at least 82%, at least 83%, at least 84%, at least 85%, at least 86%, at least 87%, at least 88%, at least 89%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or at least 99.5% sequence identity to an even numbered sequence identifier of SEQ ID NOs: 304-600.
In some embodiments, the engineered dsRNA ligase polypeptide comprises an amino acid sequence having at least 80% sequence identity to an even numbered sequence identifier of SEQ ID NOs: 636-668, optionally at least 81%, at least 82%, at least 83%, at least 84%, at least 85%, at least 86%, at least 87%, at least 88%, at least 89%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or at least 99.5% sequence identity to an even numbered sequence identifier of SEQ ID NOs: 636-668.
In some embodiments, the engineered dsRNA ligase polypeptide comprises an amino acid sequence having at least 80% sequence identity to an even numbered sequence identifier of SEQ ID NOs: 304-600 or 636-668, optionally at least 81%, at least 82%, at least 83%, at least 84%, at least 85%, at least 86%, at least 87%, at least 88%, at least 89%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or at least 99.5% sequence identity to an even numbered sequence identifier of SEQ ID NOs: 304-600 or 636-668.
The engineered dsRNA ligase polypeptides represented by the even numbered sequence identifiers of SEQ ID NOs: 304 to 600 and 636 to 668 exhibit higher activity than that of SEQ ID NO: 302, as shown in the Examples. The dsRNA ligase polypeptides used in the Examples (represented by even numbered sequence identifiers of SEQ ID NOs: 4 to 300 and 602 to 634, respectively) comprise an even numbered sequence identifier of SEQ ID NOs: 304 to 600 and 636 to 668 and an N-terminal purification tag (MHHHHHHENLYFQS (SEQ ID NO: 669)). For example, SEQ ID NO: 4 comprises: (i) the N-terminal purification tag MHHHHHHENLYFQS (SEQ ID NO: 669); and (ii) SEQ ID NO: 304. dsRNA ligase
polypeptides represented by even numbered sequence identifiers of SEQ ID NOs: 304 to 600 and 636 to 668 do not comprise the N-terminal purification tag represented by SEQ ID NO: 669.
The wild-type dsRNA ligase polypeptide comprises the amino acid sequence of SEQ ID NO: 302 (also accessible under UniProt accession number Q7Y4V8). SEQ ID NO: 2 comprises: (i) the N-terminal purification tag MHHHHHHENLYFQS (SEQ ID NO: 669); and (ii) SEQ ID NO: 302. It will be readily understood that SEQ ID NOs: 2 and 302 both comprise the wild-type dsRNA ligase polypeptide sequence and so both sequences may be referred to herein as the wild-type sequence.
In some embodiments, the engineered dsRNA ligase polypeptide comprises an amino acid sequence selected from the group consisting of SEQ ID NOs: 370, 488, 526, 578, 588, 590 and 592. In some embodiments, the engineered dsRNA ligase polypeptide comprises the amino acid sequence of SEQ ID NO: 666. In some embodiments, the engineered dsRNA ligase polypeptide comprises an amino acid sequence selected from the group consisting of SEQ ID NOs: 370, 488, 526, 578, 588, 590, 592 and 666. In some embodiments, the engineered dsRNA ligase polypeptide comprises an amino acid sequence selected from the group consisting of SEQ ID NOs: 70, 188, 226, 278, 288, 290 and 292. In some embodiments, the engineered dsRNA ligase polypeptide comprises the amino acid sequence of SEQ ID NO: 632. In some embodiments, the engineered dsRNA ligase polypeptide comprises an amino acid sequence selected from the group consisting of SEQ ID NOs: 70, 188, 226, 278, 288, 290, 292 and 632. SEQ ID NOs: 70, 188, 226, 278, 288, 290, 292 and 632 comprise: (i) an N-terminal purification tag MHHHHHHENLYFQS (SEQ ID NO: 669); and (ii) an amino acid sequence provided by SEQ ID NOs: 370, 488, 526, 578, 588, 590, 592 and 666, respectively.
In some embodiments, the engineered dsRNA ligase polypeptide comprises the amino acid sequence of SEQ ID NO: 370. In some embodiments, the engineered dsRNA ligase polypeptide comprises the amino acid sequence of SEQ ID NO: 488. In some embodiments, the engineered dsRNA ligase polypeptide comprises the amino acid sequence of SEQ ID NO: 526. In some embodiments, the engineered dsRNA ligase polypeptide comprises the amino acid sequence of SEQ ID NO: 578. In some embodiments, the engineered dsRNA ligase polypeptide comprises the amino acid sequence of SEQ ID NO: 588. In some embodiments, the engineered dsRNA ligase polypeptide comprises the amino acid sequence of SEQ ID NO: 590. In some embodiments, the engineered dsRNA ligase polypeptide comprises the amino
acid sequence of SEQ ID NO: 592. In some embodiments, the engineered dsRNA ligase polypeptide comprises the amino acid sequence of SEQ ID NO: 666.
The disclosure also provides an engineered dsRNA ligase polypeptide comprising an amino acid sequence having at least 80% sequence identity to SEQ ID NO: 302, which produces at least 5% more oligonucleotide product than a dsRNA ligase polypeptide comprising the amino acid sequence of SEQ ID NO: 302 under the same ligation reaction conditions, wherein the engineered dsRNA ligase polypeptide does not the comprise the amino acid sequence of SEQ ID NO: 302. In some embodiments, the engineered dsRNA ligase polypeptide produces at least 10%, at least 15%, at least 20%, at least 25%, at least 30%, at least 35%, at least 40%, at least 45%, at least 50%, at least 55%, at least 60%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, or at least 100% more oligonucleotide product than a dsRNA ligase polypeptide comprising the amino acid sequence of SEQ ID NO: 302 under the same ligation reaction conditions. In some embodiments, the ligation reaction conditions are as described herein.
In some embodiments, the engineered dsRNA ligase polypeptide comprises an amino acid sequence having at least 85% sequence identity to SEQ ID NO: 302, optionally at least 86%, at least 87%, at least 88%, at least 89%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or at least 99.5% sequence identity to SEQ ID NO: 302.
In some embodiments, the engineered dsRNA ligase polypeptide comprises an amino acid sequence having at least 80% sequence identity to SEQ ID NO: 302, optionally at least 81%, at least 82%, at least 83%, at least 84%, at least 85%, at least 86%, at least 87%, at least 88%, at least 89%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or at least 99.5% sequence identity to SEQ ID NO: 302.
In some embodiments, the ligation reaction conditions include about 1 pM to about 10 mM oligonucleotide fragment, a source of ATP, about 5 mM to about 100 mM divalent cation, and about 0.5 g/L to about 10 g/L engineered dsRNA ligase polypeptide, pH of about 4.0 to about 8.0, and temperature of about 10 °C to about 50 °C. In some embodiments, the source of ATP is a stoichiometric concentration of ATP or a stoichiometric excess of ATP. In some embodiments, the source of ATP comprises: (a) polyphosphate kinase (PPK); (b) polyphosphate; and (c) AMP and/or ATP.
In some embodiments, the engineered dsRNA ligase polypeptide comprises an amino acid sequence that differs from the sequence of SEQ ID NO: 302 in one or more (e.g. two or
more, three or more, four or more, five or more, six or more, seven or more, eight or more, nine or more, or ten or more) amino acid residues selected from: X6, X7, X15, X19, X29, X36, X39, X46, X47, X49, X51, X53, X56, X57, X60, X63, X64, X66, X67, X87, X88, X91, X93, X103, X105, X107, X114, X122, X126, X129, X130, X131, X137, X144, X146, X158, X163, X173, X178, X190, X196, X216, X218, X221, X228, X230, X232, X235, X236, X237, X238, X239, X242, X243, X244, X251, X252, X254, X255, X258, X269, X280, X284, X285, X293, X296, X301, X303, X305, X314, X325, and X328, wherein the numbering refers to SEQ ID NO: 302. In some embodiments, the engineered dsRNA ligase polypeptide comprises an amino acid sequence that differs from the sequence of SEQ ID NO: 302 in one or more (e.g. two or more, three or more, four or more, five or more, six or more, seven or more, eight or more, nine or more, or ten or more) amino acid residues selected from: X6, X7, X15, X19, X29, X36, X39, X44, X45, X46, X47, X49, X51, X53, X56, X57, X60, X63, X64, X66, X67, X87, X88, X89, X91, X92, X93, X103, X105, X107, XI 14, X122, X126, X129, X130, X131, X137, X144, X146, X158, X163, X173, X178, X185, X190, X196, X216, X218, X221, X228, X230, X232, X235, X236, X237, X238, X239, X242, X243, X244, X251, X252, X254, X255, X258, X269, X280, X284, X285, X293, X296, X301, X303, X305, X313, X314, X325, and X328, wherein the numbering refers to SEQ ID NO: 302.
In some embodiments, the engineered dsRNA ligase polypeptide comprises an amino acid sequence comprising one or more (e.g. two or more, three or more, four or more, five or more, six or more, seven or more, eight or more, nine or more, or ten or more) of the following amino acid residues: X6 is G ; X7 is Q; X15 is R, D or E; X19 is Q or D; X29 is N or L; X36 is V; X39 is A; X46 is Y; X47 is E; X49 is G; X51 is L; X53 is Y; X56 is R or A; X57 is S; X60 is T, G or P; X63 is S, Q or G; X64 is R, T, Q, F, G, or M; X66 is F or W; X67 is N; X87 is T, P, K or absent; X88 is C; X91 is S; X93 is G, C, or A; X103 is V, C, Y, or T; X105 is V; X107 is R or T; XI 14 is N; X122 is W; X126 is G; X129 is N; X130 is R, S or Y; X131 is R; X137 is V or C; X144 is N; X146 is R; X158 is W; X163 is G; X173 is L; X178 is R; X190 is Q; X196 is S or C; X216 is L or R; X218 is N; X221 is I; X228 is R; X230 is T; X232 is R; X235 is A, T, or G; X236 is S, L, or F; X237 is S, Q, or R; X238 is F; X239 is G or R; X242 is R or M; X243 is N, S, G, or M; X244 is G or K; X251 is D or L; X252 is V; X254 is K; X255 is C; X258 is V; X269 is L; X280 is W; X284 is A; X285 is A; X293 is R; X296 is R; X301 is G, L, E, or F; X303 is Q; X305 is G; X314 is A or V; X325 is R; and X328 is R; wherein the numbering refers to SEQ ID NO: 302. In some embodiments, the engineered dsRNA ligase polypeptide comprises an amino acid sequence comprising one or
more (e.g. two or more, three or more, four or more, five or more, six or more, seven or more, eight or more, nine or more, or ten or more) of the following amino acid residues: X6 is G or E; X7 is Q; X15 is R, D or E; X19 is Q or D; X29 is N or L; X36 is V; X39 is A; X44 is V; X45 is V; X46 is Y; X47 is E; X49 is G; X51 is L; X53 is Y; X56 is R or A; X57 is S; X60 is T, G or P; X63 is S, Q or G; X64 is R, T, Q, F, G, or M; X66 is F or W; X67 is N; X87 is T, P, K or absent; X88 is C; X89 is T; X91 is S; X92 is D; X93 is G, C, or A; X103 is V, C, Y, or T; X105 is V; X107 is R or T; XI 14 is N; X122 is W; X126 is G; X129 is N; X130 is R, S or Y; X131 is R; X137 is V or C; X144 is N; X146 is R; X158 is W; X163 is G; X173 is L; X178 is R; X185 is K; X190 is Q; X196 is S or C; X216 is L or R; X218 is N; X221 is I; X228 is R; X230 is T; X232 is R; X235 is A, T, or G; X236 is S, L, or F; X237 is S, Q, R, L or G; X238 is F; X239 is G or R; X242 is R or M; X243 is N, S, G, or M; X244 is G or K; X251 is D or L; X252 is V; X254 is K; X255 is C; X258 is V; X269 is L; X280 is W; X284 is A; X285 is A; X293 is R; X296 is R; X301 is G, L, E, or F; X303 is Q; X305 is G; X313 is A; X314 is A or V; X325 is R; and X328 is R; wherein the numbering refers to SEQ ID NO: 302.
In some embodiments, the engineered dsRNA ligase polypeptide comprises an amino acid sequence that differs from the sequence of SEQ ID NO: 302 in one or more (e.g. 2 or more, 3 or more, 4 or more, 5 or more, 6 or more, 7 or more, 8 or more, 9 or more, 10 or more, or 11) amino acid residues selected from: X15, X19, X36, X39, X53, X218, X221, X237, X251, X255, and X285; wherein the numbering refers to SEQ ID NO: 302. In some embodiments, the engineered dsRNA ligase polypeptide comprises an amino acid sequence comprising one or more (e.g. 2 or more, 3 or more, 4 or more, 5 or more, 6 or more, 7 or more, 8 or more, 9 or more, 10 or more, or 11) of the following amino acid residues: X15 is D or E; X19 is D; X36 is V; X39 is A; X53 is Y; X218 is N; X221 is I; X237 is R; X251 is L; X255 is C; and X285 is A; wherein the numbering refers to SEQ ID NO: 302.
In some embodiments, the engineered dsRNA ligase polypeptide comprises an amino acid sequence that differs from the sequence of SEQ ID NO: 302 in one or more (e.g. 2 or more, 3 or more, 4 or more, 5 or more, 6 or more, 7 or more, 8 or more, 9 or more, 10 or more, 11 or more, or 12) amino acid residues selected from: X15, X19, X36, X39, X53, X185, X218, X221, X237, X251, X255, and X285; wherein the numbering refers to SEQ ID NO: 302. In some embodiments, the engineered dsRNA ligase polypeptide comprises an amino acid sequence comprising one or more (e.g. 2 or more, 3 or more, 4 or more, 5 or more, 6 or more, 7 or more, 8 or more, 9 or more, 10 or more, 11 or more, or 12) of the following amino acid residues: X15 is D or E; X19 is D; X36 is V; X39 is A; X53 is Y; X185
is K; X218 is N; X221 is I; X237 is R; X251 is L; X255 is C; and X285 is A; wherein the numbering refers to SEQ ID NO: 302.
In some embodiments, the engineered dsRNA ligase polypeptide comprises an amino acid sequence that differs from the sequence of SEQ ID NO: 302 in one or more (e.g. two or more or three or more) amino acid residues selected from: X36, X39, X218 and X221; wherein the numbering refers to SEQ ID NO: 302. In some embodiments, the engineered dsRNA ligase polypeptide comprises an amino acid sequence comprising one or more (e.g. two or more or three or more) of the following amino acid residues: X36 is V; X39 is A; X218 is N; and X221 is I; wherein the numbering refers to SEQ ID NO: 302.
In some embodiments, the engineered dsRNA ligase polypeptide comprises an amino acid sequence that differs from the sequence of SEQ ID NO: 302 at amino acid residues: X36, X39, X218 and X221; wherein the numbering refers to SEQ ID NO: 302. In some embodiments, the engineered dsRNA ligase polypeptide comprises an amino acid sequence comprising the following amino acid residues: X36 is V; X39 is A; X218 is N; and X221 is I; wherein the numbering refers to SEQ ID NO: 302.
In some embodiments, the engineered dsRNA ligase polypeptide comprises an amino acid sequence that differs from the sequence of SEQ ID NO: 302 in one or more (e.g. two or more) amino acid residues selected from: X39, X218 and X221; wherein the numbering refers to SEQ ID NO: 302. In some embodiments, the engineered dsRNA ligase polypeptide comprises an amino acid sequence comprising one or more (e.g. two or more) of the following amino acid residues: X39 is A; X218 is N; and X221 is I; wherein the numbering refers to SEQ ID NO: 302.
In some embodiments, the engineered dsRNA ligase polypeptide comprises an amino acid sequence that differs from the sequence of SEQ ID NO: 302 at amino acid residues: X39, X218 and X221; wherein the numbering refers to SEQ ID NO: 302. In some embodiments, the engineered dsRNA ligase polypeptide comprises an amino acid sequence comprising the following amino acid residues: X39 is A; X218 is N; and X221 is I; wherein the numbering refers to SEQ ID NO: 302.
In some embodiments, the engineered dsRNA ligase polypeptide comprises an amino acid sequence that differs from the sequence of SEQ ID NO: 302 in one or more (e.g. two or more or three or more) amino acid residues selected from: X39, X218, X221, and X255; wherein the numbering refers to SEQ ID NO: 302. In some embodiments, the engineered dsRNA ligase polypeptide comprises an amino acid sequence comprising one or more (e.g.
two or more orthree or more) of the following amino acid residues: X39 is A; X218 is N; X221 is I; and X255 is C; wherein the numbering refers to SEQ ID NO: 302.
In some embodiments, the engineered dsRNA ligase polypeptide comprises an amino acid sequence that differs from the sequence of SEQ ID NO: 302 at amino acid residues: X39, X218, X221, and X255; wherein the numbering refers to SEQ ID NO: 302. In some embodiments, the engineered dsRNA ligase polypeptide comprises an amino acid sequence comprising the following amino acid residues: X39 is A; X218 is N; X221 is I; and X255 is C; wherein the numbering refers to SEQ ID NO: 302.
In some embodiments, the engineered dsRNA ligase polypeptide comprises an amino acid sequence that differs from the sequence of SEQ ID NO: 302 in one or more (e.g. two or more, three or more, four or more, five or more, six or more, or seven or more) amino acid residues selected from: X39, X53, X218, X221, X237, X251, X255 and X285; wherein the numbering refers to SEQ ID NO: 302. In some embodiments, the engineered dsRNA ligase polypeptide comprises an amino acid sequence comprising one or more (e.g. two or more, three or more, four or more, five or more, six or more, or seven or more) of the following amino acid residues: X39 is A; X 53 is Y; X218 is N; X221 is I; X237 is R; X251 is L; X255 is C; and X285 is A; wherein the numbering refers to SEQ ID NO: 302.
In some embodiments, the engineered dsRNA ligase polypeptide comprises an amino acid sequence that differs from the sequence of SEQ ID NO: 302 at amino acid residues: X39, X53, X218, X221, X237, X251, X255 and X285; wherein the numbering refers to SEQ ID NO: 302. In some embodiments, the engineered dsRNA ligase polypeptide comprises an amino acid sequence comprising the following amino acid residues: X39 is A; X 53 is Y; X218 is N; X221 is I; X237 is R; X251 is L; X255 is C; and X285 is A; wherein the numbering refers to SEQ ID NO: 302.
In some embodiments, the engineered dsRNA ligase polypeptide comprises an amino acid sequence that differs from the sequence of SEQ ID NO: 302 in one or more (e.g. two or more, three or more, four or more, five or more, six or more, seven or more, or eight or more) amino acid residues selected from: X15, X39, X53, X218, X221, X237, X251, X255 and X285; wherein the numbering refers to SEQ ID NO: 302. In some embodiments, the engineered dsRNA ligase polypeptide comprises an amino acid sequence comprising one or more (e.g. two or more, three or more, four or more, five or more, six or more, seven or more, or eight or more) of the following amino acid residues: X15 is D or E; X39 is A; X 53 is Y; X218 is N; X221 is I; X237 is R; X251 is L; X255 is C; and X285 is A; wherein the numbering refers to SEQ ID NO: 302.
In some embodiments, the engineered dsRNA ligase polypeptide comprises an amino acid sequence that differs from the sequence of SEQ ID NO: 302 at amino acid residues: X15, X39, X53, X218, X221, X237, X251, X255 and X285; wherein the numbering refers to SEQ ID NO: 302. In some embodiments, the engineered dsRNA ligase polypeptide comprises an amino acid sequence comprising the following amino acid residues: X15 is D; X39 is A; X 53 is Y; X218 is N; X221 is I; X237 is R; X251 is L; X255 is C; and X285 is A; wherein the numbering refers to SEQ ID NO: 302. In some embodiments, the engineered dsRNA ligase polypeptide comprises an amino acid sequence comprising the following amino acid residues: X15 is E; X39 is A; X 53 is Y; X218 is N; X221 is I; X237 is R; X251 is L; X255 is C; and X285 is A; wherein the numbering refers to SEQ ID NO: 302.
In some embodiments, the amino acid sequence of the engineered dsRNA ligase polypeptide comprises an amino acid sequence that differs from the sequence of SEQ ID NO: 302 in one or more amino acid residues selected from: X15, X39, X53, X185, X218, X221, X237, X251, X255 and X285, wherein the numbering refers to SEQ ID NO: 302, and wherein the engineered dsRNA ligase polypeptide has dsRNA ligase activity; optionally wherein the amino acid sequence of the engineered dsRNA ligase polypeptide comprises one or more of the following amino acid residues: X15 is D; X39 is A; X 53 is Y; XI 85 is K; X218 is N; X221 is I; X237 is R; X251 is L; X255 is C; and X285 is A.
In some embodiments, the engineered dsRNA ligase polypeptide comprises a purification tag. Purification tags are typically appended to polypeptides so that they can be purified from their crude biological source using an affinity technique. In some embodiments, the purification tag comprises a poly-histidine tag. Poly-histidine tags bind to matrices bearing immobilized metal ions and can be used to purify polypeptides by affinity chromatography. In some embodiments, the purification tag further comprises a protease recognition site for removal of the purification tag. In some embodiments, the protease recognition site comprises a Tobacco Etch Virus (TEV) protease recognition sequence. In some embodiments, the purification tag comprises the amino acid sequence MHHHHHHENLYFQS (SEQ ID NO: 669).
In some embodiments, the engineered dsRNA ligase polypeptide comprises an amino acid sequence selected from the group consisting of SEQ ID NOs: 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 86, 88, 90, 92, 94, 96, 98, 100, 102, 104, 106, 108, 110, 112, 114, 116, 118, 120, 122, 124, 126, 128, 130, 132, 134, 136, 138, 140, 142, 144, 146, 148, 150, 152, 154, 156, 158, 160, 162, 164, 166, 168, 170, 172, 174, 176, 178, 180, 182, 184,
186, 188, 190, 192, 194, 196, 198, 200, 202, 204, 206, 208, 210, 212, 214, 216, 218, 220,
222, 224, 226, 228, 230, 232, 234, 236, 238, 240, 242, 244, 246, 248, 250, 252, 254, 256,
258, 260, 262, 264, 266, 268, 270, 272, 274, 276, 278, 280, 282, 284, 286, 288, 290, 292,
294, 296, 298 and 300.
In some embodiments, the engineered dsRNA ligase polypeptide comprises an amino acid sequence selected from the group consisting of SEQ ID NOs: 602, 604, 606, 608, 610, 612, 614, 616, 618, 620, 622, 624, 626, 628, 630, 632, and 634.
In some embodiments, the engineered dsRNA ligase polypeptide comprises an amino acid sequence selected from the group consisting of SEQ ID NOs: 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 86, 88, 90, 92, 94, 96, 98, 100, 102, 104, 106, 108, 110, 112, 114, 116, 118, 120, 122, 124, 126, 128, 130, 132, 134, 136, 138, 140, 142, 144, 146, 148,
150, 152, 154, 156, 158, 160, 162, 164, 166, 168, 170, 172, 174, 176, 178, 180, 182, 184,
186, 188, 190, 192, 194, 196, 198, 200, 202, 204, 206, 208, 210, 212, 214, 216, 218, 220,
222, 224, 226, 228, 230, 232, 234, 236, 238, 240, 242, 244, 246, 248, 250, 252, 254, 256,
258, 260, 262, 264, 266, 268, 270, 272, 274, 276, 278, 280, 282, 284, 286, 288, 290, 292,
294, 296, 298, 300, 602, 604, 606, 608, 610, 612, 614, 616, 618, 620, 622, 624, 626, 628,
630, 632, and 634.
Immobilization
The disclosure also provides a polypeptide immobilized on a solid support material by chemical bond or a physical adsorption method, wherein the polypeptide comprises an engineered dsRNA ligase polypeptide disclosed herein.
Immobilization of a polypeptide by physical absorption typically involves the polypeptide being physically adsorbed or attached onto a solid support material. Adsorption can occur through weak non-specific forces such as van der Waals, hydrophobic interactions and hydrogen bonds. Physical adsorption may be achieved by soaking the support material in a solution of the polypeptide and incubating to allow time for physical adsorption to occur. Immobilization of a polypeptide by chemical bonding typically involves the attachment of the polypeptide to the support material via a covalent bond.
In some embodiments, the dsRNA ligase polypeptide is immobilized via a spacer positioned between the dsRNA ligase polypeptide and the solid material. In some embodiments, the spacer is a peptide (e.g. a peptide comprising 2 or more, 3 or more, 4 or
more, 5 or more, 10 or more, 15 or more, 20 or more, 25 or more, 30 or more, 35 or more, 40 or more, 45 or more, 50 or more, 75 or more, or 100 or more amino acids).
In some embodiments, the engineered dsRNA ligase polypeptide is immobilized using affinity immobilization. In some embodiments, the engineered dsRNA ligase polypeptide is immobilized using metal affinity immobilization, e.g. by contacting His- tagged engineered dsRNA ligase polypeptide with immobilized metal such as nickel, zinc, cobalt, or copper.
In some embodiments, the solid support material comprises a membrane, resin, solid carrier, or other solid phase material. A solid support material can be composed of organic polymers such as polystyrene, polyethylene, polypropylene, polyfluoroethylene, polyethyleneoxy, polymethacrylate, and polyacrylamide, as well as co-polymers and grafts thereof. A solid support material can also be inorganic, such as glass, silica, controlled pore glass (CPG), reverse phase silica or metal, such as gold or platinum. The configuration of a solid support material can be in the form of beads, spheres, particles, granules, a gel, a membrane or a surface. Surfaces can be planar, substantially planar, or non -planar. Solid support materials can be porous or non-porous and can have swelling or non-swelling characteristics. A solid support material can be configured in the form of a well, depression, or other container, vessel, feature, or location. Solid support materials useful for immobilizing the dsRNA ligase polypeptide for carrying out a ligase reaction include but are not limited to beads or resins such as polymethacrylate, e.g., polymethacrylates with epoxy functional groups, polymethacrylates with amino epoxy functional groups, polymethacrylates, styrene/DVB copolymer or polymethacrylates with octadecyl functional groups.
Exemplary solid supports include, but are not limited to, chitosan beads, Eupergit C, IB-150, IB-350, IB-C435, IB-A369, IB-A161, IB-A171, IBS500, IB-S861, SEPABEADS (Mitsubishi), e.g., Sepabeads EC-EP, Sepabeads EC-HFA, Sepabeads EC-HG, Sepabeads EC-BU, Sepabeads EC-OD, Sepabeads EC-CM, Sepabeads EC-IDA, Sepabeads EC-EA, Sepabeads EC-HA, Sepabeads EC-QA, Sepabeads EXE, Sepabeads EXA, Dilbeads-TA, Amberzyme Oxirane, Amberlite XAD-7HP, Amberlite FPA98C1, Amberlite IRA958C1, Amberlite IRA67, Amberlite FPA90C1, Amberlite FPA40C1, Amberlite XAD18, Accurel EP100, ECR8206F/5730, ECR8206/5803, ECR8206M/5749, ReliZyme EP403, ReliZymeEPl 13, Lewatit VP OC 1600, Diaion WA20, Diaion WA21J, Diaion WA30, Dowex 66, Diaion HPA-25L, Lewatit VP OC 1064 MD PH, Lewatit VP OC 1163, Lifetech ECR8304F. Lifetech ECR8309F, Lifetech ECR8315F, Lifetech ECR8204F, Lifetech
ECR8285, Lifetech ECR1090M, Lifetech ECR1030M, Lifetech ECR8806M, Chromalite (MAM2/F) D6591, Chromalite MIDA/M, Chromalite MIDA/M/Le, Chromalite MIDA/M/Co, Chromalite MIDA/M/Ni, Chromalite MIDA/M/Cu and Chromalite MIDA/M/Zn.
Polynucleotides, control sequences, expression vectors and host cells that can be used to produce engineered dsRNA ligase polypeptides
In another aspect, this disclosure provides polynucleotides encoding engineered polypeptides having dsRNA ligase activity described herein. The polynucleotides can be linked to one or more heterologous regulatory sequences that control gene expression to produce recombinant polynucleotides that are capable of expressing the engineered polypeptides. Expression constructs comprising a heterologous polynucleotide encoding an engineered dsRNA ligase may be introduced into a suitable host cell to express the corresponding engineered dsRNA ligase polypeptide.
As apparent to one skilled in the art, the availability of protein sequences and knowledge of codons corresponding to a variety of amino acids provide an illustration of all possible polynucleotides that encode the protein sequence of interest. The degeneracy of the genetic code, in which the same amino acid is encoded by selectable or synonymous codons, allows for the production of an extremely large number of polynucleotides, all of which encode the engineered dsRNA ligase polypeptides disclosed herein. Thus, upon determination of a particular amino acid sequence, one skilled in the art can generate any number of different polynucleotides by modifying one or more codons in a manner that does not alter the amino acid sequence of the protein. In this regard, this disclosure specifically contemplates each and every possible alteration of a polynucleotide that can be made by selecting a combination based on possible codon selections, for any of the polypeptides disclosed herein, comprising those amino acid sequences of exemplary engineered polypeptides listed in Examples 7 to 12, any of the polypeptides disclosed as even sequence identifiers of SEQ ID NOs: 304 to 600 and 636 to 668, and any of the polypeptides disclosed as even sequence identifiers of SEQ ID NOs: 4 to 300 and 602 to 634.
In various embodiments, the codons are preferably selected to accommodate the host cell in which the recombinant protein is produced. For example, codons preferred for bacteria are used to express genes in bacteria; codons preferred for yeast are used to express genes in yeast; and codons preferred for mammals are used for gene expression in mammalian cells.
In some embodiments, the disclosure provides a polynucleotide encoding an engineered dsRNA ligase polypeptide described above.
In some embodiments, the polynucleotide encodes a polypeptide comprising an amino acid sequence having at least 80%, at least 81%, at least 82%, at least 83%, at least 84%, at least 85%, at least 86%, at least 87%, at least 88%, at least 89%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, or
at least 99% to a reference sequence that is an even numbered sequence identifier of SEQ ID NOs: 304 to 600 or 636 to 668, wherein the polypeptide has dsRNA ligase activity and exhibits higher enzyme activity than a polypeptide comprising the amino acid of SEQ ID NO: 2 and/or 302.
In some embodiments, the polynucleotide encodes an engineered dsRNA ligase polypeptide described herein and comprises a nucleic acid sequence having at least 70%, at least 75%, at least 80%, at least 81%, at least 82%, at least 83%, at least 84%, at least 85, at least 86%, at least 87%, at least 88%, at least 89%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% sequence identity to a reference polynucleotide selected from the sequences having an odd numbered sequence identifier of SEQ ID NOs: 303 to 599 or 635 to 667, wherein: (i) the polynucleotide does not comprise SEQ ID NO: 301; and (ii) the polynucleotide does not encode a dsRNA ligase polypeptide having the amino acid sequence of SEQ ID NO: 302.
In some embodiments, the polynucleotide encodes an engineered dsRNA ligase polypeptide described herein and comprises a nucleic acid sequence having at least 70%, at least 75%, at least 80%, at least 81%, at least 82%, at least 83%, at least 84%, at least 85, at least 86%, at least 87%, at least 88%, at least 89%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% sequence identity to a reference polynucleotide selected from the sequences having an odd numbered sequence identifier of SEQ ID NOs: 3 to 299 or 601 to 633, wherein: (i) the polynucleotide does not comprise SEQ ID NO: 1; and (ii) the polynucleotide does not encode a dsRNA ligase polypeptide having the amino acid sequence of SEQ ID NO: 2. It will be readily understood that polynucleotides having an odd numbered sequence identifier of SEQ ID NOs: 3 to 299 or 601 to 633 encode an engineered dsRNA ligase polypeptide comprising an N-terminal purification tag (SEQ ID NO: 669).
The isolated polynucleotides encoding engineered dsRNA ligase polypeptides can be manipulated to enable the expression of the engineered polypeptides in a variety of ways, which may comprise further modification of the sequences by codon optimization to improve expression, insertion into suitable expression elements with or without additional control sequences, and transformation into a host cell suitable for expression and production of the engineered polypeptides.
Depending on the expression vector, manipulation of the isolated polynucleotide prior to insertion of the isolated polynucleotide into the vector may be desirable or necessary. Techniques for modifying polynucleotides and nucleic acid sequences using recombinant
DNA methods are well known in the art. Guidance is provided below: Sambrook et al., 2001, Molecular Cloning: A Laboratory Manual, Third Edition, Cold Spring Harbor Laboratory Press; and Current Protocols in Molecular Biology, Ausubel. F. Eds., Greene Pub. Associates, 1998, 2010 Year update.
The disclosure also provides an expression vector comprising the polynucleotide described herein. In some embodiments, the vector is selected from a plasmid, a cosmid, a bacteriophage, or a viral vector. Recombinant expression vectors typically comprise one or more expression regulatory regions, such as promoters and terminators, origin of replication and the like.
Polynucleotides encoding an engineered dsRNA ligase polypeptide described herein can be expressed by inserting the polynucleotide or the nucleic acid construct comprising the polynucleotide sequence into an appropriate expression vector. In generating the expression vector, the coding sequence is located in the vector such that the coding sequence is linked to a suitable control sequence for expression. The recombinant expression vector can be any vector (e.g. , a plasmid or virus) that can be conveniently used in recombinant DNA procedures and can result in the expression of a polynucleotide sequence. The choice of vector will generally depend on the compatibility of the vector with the host cell to be introduced into. The vector can be linear or closed circular plasmid. The expression vector may be an autonomously replicating vector, i. e. , a vector that exists as an extrachromosomal entity whose replication is independent of chromosomal replication, such as a plasmid, extrachromosomal element, minichromosome, or artificial chromosome. The vector may contain any tools for ensuring self-copying. Alternatively, the vector may be a vector that, when introduced into a host cell, integrates into the genome and replicates with the chromosome into which it is integrated. Moreover, a single vector or plasmid or two or more vectors or plasmids that together comprise the total DNA to be introduced into the genome of the host cell may be used.
Many expression vectors useful to the embodiments of the present disclosure are commercially available. An exemplary expression vector can be prepared by inserting a polynucleotide encoding an engineered dsRNA ligase polypeptide to plasmid pACYC-Duet-1 (Novagen), pBR322 Vector (New England Biolabs), pUC19 Vector (New England Biolabs) or pET T7 Expression Vectors (Novagen).
The disclosure also provides a host cell capable of expressing an engineered dsRNA ligase polypeptide described herein. In some embodiments, the host cell comprises the
nucleic acid molecule described herein, or the vector described herein. In some embodiments, the host cell is Escherichia coli.
In some embodiments, the polynucleotide encoding the polypeptide is linked to one or more control sequences for expression of polypeptides in the host cell. Host cells for expression of polypeptides encoded by the expression vectors of the present disclosure are well known in the art, including, but not limited to, bacterial cells such as E. coli, Streptomyces, and Salmonella typhimurium,' fungal cells (e.g., Saccharomyces cerevisiae or Pichia pastoris).' insect cells such as Drosophila S2 and Spodoptera Sf9; animal cells such as CHO, COS, BHK, 293 and Bowes melanoma cells; and plant cells. An exemplary host cell is E. coli BL21 (DE3). The host cell may be wild-type or may be engineered through genomic editing. Suitable media and growth conditions for the above host cells are well known in the art.
Polynucleotides or vectors used to express polypeptides can be introduced into cells by a variety of methods known in the art. Techniques comprise, among others, electroporation, bio-particle bombardment, liposome-mediated transfection, calcium chloride transfection, and protoplast fusion. Different methods of introducing polynucleotides into cells are known to those skilled in the art.
The host cell may be used to express and isolate the polypeptide described herein.
Process of producing an engineered dsRNA ligase polypeptide
Engineered dsRNA ligase can be obtained by subjecting a polynucleotide encoding an dsRNA ligase to mutagenesis and/or directed evolution. An exemplary directional evolution technique can be found in "Biocatalysis for the Pharmaceutical Industry: Discovery, Development, and Manufacturing" (2009 John Wiley &Sons Asia (Pte) Ltd. ISBN: 978-0- 470-82314-9).
When the sequence of an engineered polypeptide is known, the encoding polynucleotide may be prepared by standard solid-phase methods according to known synthetic methods. In some embodiments, fragments of up to about 100 bases can be synthesized separately and then ligated (e.g., by enzymatic or chemical ligation methods or polymerase-mediated methods) to form any desired contiguous sequence. For example, the polynucleotides and oligonucleotides of the present disclosure can be prepared by chemical synthesis using, for example, the classic phosphoramidite methods described by Beaucage et al., 1981, Tet Lett 22: 1859-69, or Matthes et al. People, 1984, EMBO J. 3: 801-05, as typically practiced in automated synthesis methods. According to the phosphoramidite
method, oligonucleotides are synthesized, purified, annealed, ligated, and cloned into a suitable vector, for example, in an automated DNA synthesizer. In addition, essentially any nucleic acid is available from any of a variety of commercial sources.
The disclosure provides a method of preparing an engineered dsRNA ligase polypeptide, which comprises the steps of culturing a host cell described herein and obtaining an engineered dsRNA ligase polypeptide from the culture. In some embodiments, the process of preparing a polypeptide further comprises isolating the polypeptide. Engineered polypeptides may be expressed in suitable cells and isolated (or recovered) from the host cell and/or culture medium using any one or more of the well-known techniques for protein purification, the techniques for protein purification include, among others, lysozyme treatment, sonication, filtration, salting out, ultracentrifiigation and chromatography.
The invention also provides an engineered dsRNA ligase catalyst obtainable by culturing a host cell described herein, or from the method of preparing an engineered dsRNA ligase polypeptide described herein, wherein said engineered dsRNA ligase catalyst comprises cells or culture fluid containing the engineered dsRNA ligase polypeptides, or an article processed therewith, wherein the article refers to an extract obtained from the culture of host cells, an isolated product obtained by isolating or purifying an engineered dsRNA ligase from the extract, or an immobilized product obtained by immobilizing host cells, an extract thereof, or isolated product of the extract.
Ligation reactions
The disclosure provides a method of producing an oligonucleotide from two or more oligonucleotide fragments, wherein the method comprises contacting: (i) two or more oligonucleotide fragments; (ii) an engineered dsRNA ligase polypeptide disclosed herein; (iii) a source of ATP; and (iv) a divalent cation; to obtain an oligonucleotide.
Oligonucleotide products and fragments
Methods of the invention produce oligonucleotides by ligating two or more oligonucleotide fragments. The produced oligonucleotides (also referred to herein as “oligonucleotide products”) are nucleic acids which typically comprise up to 100 nucleotides. It will be understood that oligonucleotides described herein comprise RNA. It will also be understood that oligonucleotides described herein comprise a double-stranded region.
As used herein, “oligonucleotide fragment” refers to a nucleic acid that can be ligated to one or more additional oligonucleotide fragments to provide an oligonucleotide product. Each oligonucleotide fragment corresponds to a portion of the oligonucleotide product.
In some embodiments, the oligonucleotide is a therapeutic oligonucleotide. In some embodiments, the therapeutic oligonucleotide is a small interfering RNA (siRNA) or an antisense oligonucleotide (ASO). In some embodiments, the oligonucleotide is an aptamer.
In some embodiments, the oligonucleotide comprises an overhang. In some embodiments, the oligonucleotide comprises a 3’ overhang. In some embodiments, the oligonucleotide comprises a 5’ overhang. In some embodiments, the overhang comprises 1, 2, 3, 4, 5, 6, 7, or 8 nucleotides. In some embodiments, the oligonucleotide comprises a blunt end. In some embodiments, the oligonucleotide comprises two blunt ends.
In some embodiments, the oligonucleotide is up to 20 nucleotides in length. In some embodiments, the oligonucleotide is up to 25, up to 30, up to 35, up to 40, up to 45, up to 50, up to 55, up to 60, up to 65, up to 70, up to 75, up to 80, up to 85, up to 90, up to 95, or up to 100 nucleotides in length. In some embodiments, the oligonucleotide is up to 60 nucleotides in length.
In some embodiments, the oligonucleotide is at least 20 nucleotides in length. In some embodiments, the oligonucleotide is at least 25, at least 30, at least 35, at least 40, at least 45, at least 50, at least 55, at least 60, at least 65, at least 70, at least 75, at least 80, at least 85, at least 90, at least 95, or 100 nucleotides in length.
In some embodiments, the oligonucleotide is 10-100 nucleotides in length. In some embodiments, the oligonucleotide is 10-80, 10-70, 10-60, 10-50, 10-40, 10-30, 10-25, 15-80, 15-70, 15-60, 15-50, 15-40, 15-30, or 15-25 nucleotides in length. In some embodiments, the oligonucleotide is 15-25 nucleotides in length.
As used herein, the two or more oligonucleotide fragments comprise one or more 3’ oligonucleotide fragments and one or more 5 ’ oligonucleotide fragments, wherein each of the one or more 3’ oligonucleotide fragments comprise a 5 ’-phosphate group and each of the one or more 5’ oligonucleotide fragments typically comprise a 3’ terminal ribonucleotide having a 3 ’-hydroxyl group.
In some embodiments, one or more of the oligonucleotide fragments comprises one or more mismatches. In some embodiments, one or more of the oligonucleotide fragments comprise an overhang. In some embodiments, one or more of the oligonucleotide fragments comprise a 3’ overhang. In some embodiments, one or more of the oligonucleotide fragments comprise a 5’ overhang. In some embodiments, one or more of the oligonucleotide fragments
comprise a 3’ overhang and a 5’ overhang. In some embodiments, the overhang comprises 1, 2, 3, 4, 5, 6, 7, or 8 nucleotides.
In some embodiments, the two or more oligonucleotide fragments comprise a first oligonucleotide fragment having an overhang that is complementary to the overhang of a second oligonucleotide fragment. In some embodiments, the two or more oligonucleotide fragments comprise a first oligonucleotide fragment having a 3 ’ overhang and a 5 ’ overhang, wherein the 3’ overhang is complementary to the 5’ overhang of a second oligonucleotide fragment and the 5’ overhang is complementary to the 3’ overhang of a third oligonucleotide.
In some embodiments, one or more of the oligonucleotide fragments comprise a blunt end. In some embodiments, one or more of the oligonucleotide fragments comprise a 3’ overhang and a 5’ blunt end. In some embodiments, one or more of the oligonucleotide fragments comprise a 5’ overhang and a 3’ blunt end. In some embodiments, the 5’ terminal oligonucleotide fragment comprises a 3’ overhang and a 5’ blunt end. In some embodiments, the 3’ terminal oligonucleotide fragment comprise a 5’ overhang and a 3’ blunt end.
In some embodiments, two or more of the oligonucleotide fragments comprise two or more RNA oligonucleotide fragments. In some embodiments, the two or more RNA oligonucleotide fragments comprise double-stranded RNA (dsRNA) oligonucleotide fragments.
In some embodiments, one or more of the oligonucleotide fragments comprise DNA and RNA. For example, a portion of the oligonucleotide fragment may be double-stranded DNA, while another portion is double-stranded RNA, forming a DNA-RNA chimera.
In some embodiments, one or more of the oligonucleotide fragments comprise one or two strands which are RNA, or a mixture of RNA, DNA, LNA, Morpholino, UNA (unlocked nucleic acid), TNA (threose nucleic acid), GNA (glycol nucleic acid), and/or FANA (Fluoroarabino nucleic acid), modified RNA, etc. As a non-limiting example, one or both strand(s) could be, for example, RNA except that one or more nucleotide(s) is replaced by DNA, LNA, Morpholino, UNA, TNA, GNA, and/or FANA, and/or modified RNA (e.g., any modified RNA disclosed herein or known in the art, such as 2’ modified RNA, including but not limited to 2’-F, 2’-0Me, 2’-0-M0E RNA, etc.).
In some embodiments, the two or more oligonucleotide fragments are the same length. In some embodiments, the two or more oligonucleotide fragments are different lengths. In some embodiments, each of the two or more oligonucleotide fragments are 3-20 nucleotides in length. In some embodiments, each of the two or more oligonucleotide fragments are 4-16 nucleotides in length. In some embodiments, each of the two or more
oligonucleotide fragments are 4-16, 4-15, 5-15, 6-15, 4-14, 4-13, 4-12, 4-11, 4-10, 4-9, 5-9, or 6-9 nucleotides in length.
In some embodiments, each of the two or more oligonucleotide fragments are at least 3 nucleotides in length. In some embodiments, each of the two or more oligonucleotide fragments are at least 4, at least 5, at least 6, at least 7, at least 8, at least 9, at least 10, at least 11, at least 12, at least 13, at least 14, or at least 15 nucleotides in length.
In some embodiments, the two or more oligonucleotide fragments comprise 3 or more, 4 or more, 5 or more, 6 or more, 7 or more, 8 or more, 9 or more, or 10 or more oligonucleotide fragments.
In some embodiments, one or more ligation reactions are required to generate the oligonucleotide product. In some embodiments, 2 or more, 3 or more, 4 or more, 5 or more, 6 or more, 7 or more, 8 or more, 9 or more, or 10 or more ligation reactions are required to generate the oligonucleotide product.
In some embodiments, one or more of the oligonucleotide fragments and/or the oligonucleotide comprises a chemical modification. In some embodiments, one or more of the oligonucleotide fragments and/or the oligonucleotide comprises at least one modified backbone modification. In some embodiments, one or more of the oligonucleotide fragments and/or the oligonucleotide comprises at least one modified nucleotide modification. In some embodiments, one or more of the oligonucleotide fragments and/or the oligonucleotide comprises at least one sugar modification (e.g. at the 2’-position or 4’-position). In some embodiments, one or more of the oligonucleotide fragments and/or the oligonucleotide comprises: (i) at least one modified backbone modification; (ii) and at least one modified nucleotide modification; and/or (iii) at least one sugar modification.
In some embodiments, one or more of the oligonucleotide fragments and/or the oligonucleotide comprise a modification selected from the group consisting of: 2'-O-methyl (2’-OMe), 2'-flouro (2’-F), 2'-deoxy, 2'-deoxy-2’-fluoro, 2'-O-methoxyethyl (2'-O-MOE), 2'- O-aminopropyl (2'-O-AP), 2'-O-dimethylaminoethyl (2'-O-DMAOE), 2'-O- dimethylaminopropyl (2'-O-DMAP), 2'-O-dimethylaminoethyloxyethyl (2'-O-DMAEOE), 2'- O-N-methylacetamido (2'-0-NMA), locked nucleic acid (LNA), glycol nucleic acid (GNA), phosphoramidate (e.g. mesyl phosphoramidate), 2',3'-seco nucleotide mimic, 2'-F-arabino nucleotide, abasic nucleotide, 2'-amino modified nucleotide, 2'-alkyl-modified nucleotide, morpholino nucleotide, vinylpho sphonate (e.g. 5’ vinylphosphonate), and cyclopropyl phosphonate deoxyribonucleotide. In some embodiments, one or more of the oligonucleotide
fragments and/or the oligonucleotide comprises a 2'-modification selected from the group consisting of: 2’-0Me, 2’-F, and 2'-deoxy.
In some embodiments, one or more of the oligonucleotide fragments and/or the oligonucleotide comprises at least one phosphorothioate or methylphosphonate intemucleotide linkage. In some embodiments, the oligonucleotide comprises at least one chiral phosphorothioate linkage.
In some embodiments, one or more of the oligonucleotide fragments and/or the oligonucleotide is conjugated to at least one ligand. The ligand may be conjugated to the sense strand, antisense strand or both strands, in any configuration e.g. at the 3 ’-end, 5 ’-end, non-end or a combination.
In some embodiments, the ligand comprises one or more N-Acetylgalactosamine (GalNAc) derivatives. GalNAc is an amino sugar derivative of galactose which may be used as a targeting ligand in oligonucleotides intended for targeting to the liver, where it binds to the asialoglycoprotein receptors on hepatocytes. In some embodiments, the ligand comprises one or more GalNAc derivatives conjugated through a bivalent or trivalent branched carrier. In some embodiments, the ligand is a peptide or a peptidomimetic.
In some embodiments, the ligand is conjugated to the sense strand. In some embodiments, the ligand is conjugated to the 3’ end of the sense strand. In some embodiments, the ligand is conjugated to the 5’ end of the sense strand. In some embodiments, the ligand is conjugated to a non-end of the sense strand.
In some embodiments, the ligand is conjugated to the antisense strand. In some embodiments, the ligand is conjugated to the 3’ end of the antisense strand. In some embodiments, the ligand is conjugated to a non-end of the antisense strand.
In some embodiments, the oligonucleotide is an RNAi agent comprising at least one 2’-modified nucleotide selected from a group consisting of 2’-0Me, 2’-F, 2'-deoxy, 2'-deoxy- 2’-fluoro, and 2'-0-M0E. In some embodiments, the oligonucleotide is an RNAi agent wherein the sense strand is conjugated to one or more GalNAc ligand(s). In some embodiments, one or more of the oligonucleotide fragments comprises at least one 2’- modified nucleotide selected from a group consisting of 2’-0Me, 2’-F, 2'-deoxy, 2'-deoxy-2’- fluoro, and 2'-0-M0E. In some embodiments, one or more of the oligonucleotide fragments is a dsRNA wherein the sense strand is conjugated to one or more GalNAc ligand(s).
In some embodiments, the method is performed with an oligonucleotide fragment concentration of at least 1 mM, at least 2 mM, at least 3 mM, at least 4 mM, at least 5 mM, at least 6 mM, at least 7 mM, at least 8 mM, at least 9 mM, or at least 10 mM. In some
embodiments, the method is performed with at least 1 mM, at least 2 mM, at least 3 mM, at least 4 mM, at least 5 mM, at least 6 mM, at least 7 mM, at least 8 mM, at least 9 mM, or at least 10 mM of each oligonucleotide fragment. In some embodiments, the method is performed with equimolar amounts of each of the two or more oligonucleotide fragments.
In some embodiments, the method produces at least 15 g of oligonucleotide product per litre of reaction mixture. In some embodiments, the method produces at least 16 g, at least 17 g, at least 18 g, at least 19 g, at least 20 g, at least 30 g, at least 40 g, at least 50 g, at least 60 g, at least 70 g, at least 80 g, at least 90, or at least 100 g of oligonucleotide product per litre of reaction mixture.
Engineered dsRNA ligase polypeptide
The method is performed using an engineered dsRNA ligase as described herein.
In some embodiments, the method is performed using about 1 g/L engineered dsRNA ligase polypeptide, optionally 1.1 g/L, 1.15 g/L, 1.2 g/L, 1.25 g/L, 1.3 g/L, 1.35 g/L, 1.4 g/L, 1.45 g/L, 1.5 g/L, 1.55 g/L, 1.6 g/L, 1.65 g/L, 1.7 g/L, 1.75 g/L, 1.8 g/L, 1.85 g/L, 1.9 g/L, 1.95 g/L, 2 g/L, 2.1 g/L, 2.2 g/L, 2.3 g/L, 2.4 g/L, 2.5 g/L, 2.6 g/L, 2.7 g/L, 2.8 g/L, 2.9 g/L, 3 g/L, 3.25 g/L, 3.5 g/L, 3.75 g/L, 4 g/L, 4.5 g/L or 5 g/L engineered dsRNA ligase polypeptide.
Source of ATP
The enzymatic activity of dsRNA ligase requires ATP as a cofactor. One molecule of ATP is converted to AMP per ligation reaction. The catalytic mechanism of dsRNA ligase and the role of ATP in nucleic acid ligation reactions are described above.
In some embodiments, the source of ATP is ATP. In some embodiments, the method is performed using a stoichiometric concentration of ATP. In some embodiments, the method is performed using a stoichiometric excess of ATP. The skilled person can readily determine the stoichiometric concentration of ATP required for a given ligation based on the concentration of the oligonucleotide fragments and the number of ligation reactions required to produce the oligonucleotide product.
In some embodiments, the method is performed using an ATP and/or AMP concentration of about 0.5 mM, about 1 mM, about 2 mM, about 3 mM, about 4 mM, about 5 mM, about 6 mM, about 7 mM, about 8 mM, about 9 mM, about 10 mM, about 12 mM,
about 14 mM, about 16 mM, about 18 mM, about 20 mM, about 22 mM, about 24 mM, about 26 mM, about 28mM or about 30mM.
In some embodiments, the source of ATP is an ATP regeneration system. In some embodiments, the ATP regeneration system comprises: (a) polyphosphate kinase (PPK); (b) polyphosphate; and (c) AMP and/or ATP. Advantageously, the use of an ATP regeneration system overcomes the requirement for high concentrations of ATP to achieve complete ligation. The ATP regeneration system described herein comprises PPK and polyphosphate. PPK generates ATP from AMP using polyphosphate as a phosphate donor. ATP that is converted to AMP during the ligation reaction can be regenerated to ATP by PPK and used as a cofactor in a subsequent ligation reaction. This cycling of ATP obviates the need for high ATP concentration in the starting reaction. Instead, the reaction can be performed using sub- stoichiometric concentrations of ATP, and/or using the cheaper alternative, AMP.
“Polyphosphate kinases” or “PPKs” are a family of enzymes which catalyze the formation of ATP from AMP and polyphosphate.
In some embodiments, the PPK is PPK12. In some embodiments, the PPK comprises an amino acid sequence having at least 70% sequence identity to SEQ ID NO: 670, which is the amino acid sequence of PPK12. In some embodiments, the PPK comprises an amino acid sequence having at least 75%, at least 80%, at least 85%, at least 86%, at least 87%, at least 88%, at least 89%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% or 100% sequence identity to SEQ ID NO: 670.
In some embodiments, the PPK comprises an amino acid sequence having at least 70% sequence identity to SEQ ID NO: 671, which is the amino acid sequence of an optimized PPK12. In some embodiments, the PPK comprises an amino acid sequence having at least 75%, at least 80%, at least 85%, at least 86%, at least 87%, at least 88%, at least 89%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% or 100% sequence identity to SEQ ID NO: 671.
In some embodiments, the PPK is Acinetobacter johnsonii polyphosphate: AMP phosphotransferase (AjPAP) (UniProt ID: Q83XD3). In some embodiments, the PPK comprises an amino acid sequence having at least 70% sequence identity to SEQ ID NO: 672, which is the amino acid sequence of AjPAP. In some embodiments, the PPK comprises an amino acid sequence having at least 75%, at least 80%, at least 85%, at least 86%, at least 87%, at least 88%, at least 89%, at least 90%, at least 91%, at least 92%, at least 93%, at least
94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% or 100% sequence identity to SEQ ID NO: 672.
In some embodiments, the PPK is used in the form of whole cell, crude extract (e.g. cell free lyophilized extract or cell lysates), isolated polypeptide, or purified polypeptide. In some embodiments, the PPK polypeptide is used in an immobilized form as described herein, such as immobilized on a resin.
In some embodiments, the method is performed using about 1 g/L PPK, optionally 1.1 g/L, 1.15 g/L, 1.2 g/L, 1.25 g/L, 1.3 g/L, 1.35 g/L, 1.4 g/L, 1.45 g/L, 1.5 g/L, 1.55 g/L, 1.6 g/L, 1.65 g/L, 1.7 g/L, 1.75 g/L, 1.8 g/L, 1.85 g/L, 1.9 g/L, 1.95 g/L, 2 g/L, 2.1 g/L, 2.2 g/L, 2.3 g/L, 2.4 g/L, 2.5 g/L, 2.6 g/L, 2.7 g/L, 2.8 g/L, 2.9 g/L, 3 g/L, 3.25 g/L, 3.5 g/L, 3.75 g/L, 4 g/L, 4.5 g/L or 5 g/L PPK.
In some embodiments, the polyphosphate is a polyphosphate salt. In some embodiments, the polyphosphate salt is sodium polyphosphate (Maddrell’s salt) or sodium hexametaphosphate (Graham’s salt).
In some embodiments, the method is performed using a stoichiometric excess of polyphosphate. In some embodiments, the method is performed using a polyphosphate concentration of at least 5 mM, at least 10 mM, at least 15 mM, at least 20 mM, at least 25 mM, at least 30 mM, at least 35 mM, at least 40 mM, at least 45 mM, at least 50 mM, 55 mM, at least 60 mM, at least 65 mM, at least 70 mM, at least 75 mM, at least 80 mM, at least 85 mM, at least 90 mM, at least 95 mM, or at least 100 mM.
In some embodiments, wherein the method is performed in the presence of PPK and polyphosphate, the method is performed in the presence of AMP. In some embodiments, wherein the method is performed in the presence of PPK and polyphosphate, the method is performed using a sub-stoichiometric concentration of ATP and/or AMP.
Divalent cation
The enzymatic activity of dsRNA ligases requires the presence of a divalent cation. The enzymatic activity of PPKs requires the presence of a divalent cation. In some embodiments, the divalent cation comprises Mg2+ and/or Mn2+.
In some embodiments, the method is performed with a divalent cation concentration of 5-100 mM, 10-100 mM, 15-100 mM, 20-100 mM, 30-100 mM, 5-90 mM, 5-80 mM, 5-70 mM, 5-60 mM, 5-50 mM, or 30-50 mM. In some embodiments, the method is performed with a divalent cation concentration of at least 5 mM, at least 10 mM, at least 15 mM, at least 20 mM, at least 25 mM, at least 30 mM, at least 35 mM, at least 40 mM, at least 45 mM, at
least 50 mM, 55 mM, at least 60 mM, at least 65 mM, at least 70 mM, at least 75 mM, at least 80 mM, at least 85 mM, at least 90 mM, at least 95 mM, or at least 100 mM.
In some embodiments, the method further comprises purifying the oligonucleotide product from the reaction mixture. In some embodiments, the oligonucleotide product is at least 80% pure, optionally wherein the oligonucleotide product is at least 85%, at least 90%, at least 95% pure, optionally wherein the oligonucleotide product is at least 98% pure, optionally wherein the oligonucleotide product is at least 99% pure, optionally wherein the oligonucleotide product is at least 99.5% pure, optionally wherein the oligonucleotide product is at least 99.9% pure. An oligonucleotide product that is pure does not contain oligonucleotide fragments, intermediate ligation products, or side products arising from nonspecific ligation. The oligonucleotide product may be purified or isolated using any method known in the art, for example using gel extractions or using cellulose-based matrices.
The disclosure also provides an oligonucleotide produced by a method described herein. The oligonucleotide may be in any suitable buffer solution. In some embodiments, the buffer solution is selected from Tris buffer (e.g. Tris-HCl), phosphate buffer, HEPES, MOPS (3-(A-morpholino)propancsulfonic acid), and triethanolamine (TEOA) buffer. In some embodiments, the buffer solution comprises acetate, citrate, prolamine, carbonate, or phosphate, or any combination thereof. In some embodiments, the buffer solution further comprises an agent for controlling the osmolarity of the solution, such that the osmolarity is kept at a desired value, e.g., at the physiologic values of the human plasma. Solutes which can be added to the buffer solution to control the osmolarity include, but are not limited to, proteins, peptides, amino acids, non-metabolized polymers, vitamins, ions, sugars, metabolites, organic acids, lipids, or salts. In some embodiments, the agent for controlling the osmolarity of the solution is a salt. In some embodiments, the agent for controlling the osmolarity of the solution is sodium chloride or potassium chloride.
Reaction conditions
As disclosed herein and exemplified in the examples, the present disclosure contemplates a range of suitable reaction conditions that may be used in the methods described herein, including but not limited to pH, temperature, buffers, substrate loadings, enzyme loading, cofactor loading, pressure, and reaction time. Additional suitable reaction conditions for ligation reactions described herein can be readily optimized by routine experimentation, e.g. performing the method described herein under experimental reaction
conditions of varying reagent concentration, pH, temperature, and detecting the rate of oligonucleotide product formation.
In any of the embodiments of the process disclosed herein, the reaction conditions may include a suitable pH. As noted above, the desired pH or desired pH range can be maintained by using an acid or base, a suitable buffer, or a combination of buffer and added acid or base. The pH of the reaction mixture can be controlled before and/or during the reaction. In some embodiments, suitable reaction conditions include a solution pH of about 4 to about 8, a pH of about 5 to about 8, a pH of about 6 to about 8, or a pH of about 7 to about 8. In some embodiments, the reaction conditions include a solution pH of about 4, 4.5, 5, 5.5, 6, 6.5, 7, 7.5 or 8.
In any of the embodiments of the method disclosed herein, suitable temperatures can be used for the reaction conditions, taking into consideration of, for example, the increase in reaction rate at higher temperatures, the activity of the enzyme for sufficient duration of the reaction. Accordingly, in some embodiments, suitable reaction conditions include a temperature of about 10°C to about 60°C, about 10°C to about 50°C, about 25 °C to about 50°C, about 25°C to about 40°C, about 25°C to about 30°C, or about 10°C to about 30°C. In some embodiments, suitable reaction temperatures include a temperature of about 10°C, 15°C, 20°C, 25°C, 30°C, 35°C, 40°C, 45°C, 50°C, 55°C, or 60°C. In some embodiments, the temperature during the enzymatic reaction can be maintained at a certain temperature throughout the reaction. In some embodiments, the temperature during the enzymatic reaction may be adjusted over a temperature profile during the course of the reaction.
The reaction may be performed in any suitable buffer solution. In some embodiments, the buffer solution is selected from Tris buffer (e.g. Tris-HCl), phosphate buffer, HEPES, MOPS (3-(A-morpholino)propanesulfonic acid), and triethanolamine (TEOA) buffer. In some embodiments, the buffer solution comprises acetate, citrate, prolamine, carbonate, or phosphate, or any combination thereof. In some embodiments, the buffer solution is phosphate buffered saline (PBS).
In some embodiments, the reaction mixture further comprises a reducing agent, optionally DTT (Dithiothreitol).
In carrying out the ligation reactions described herein, the engineered dsRNA ligase polypeptide may be added to the reaction mixture in different formulation forms, as frozen or lyophilized whole cells (FWC or LWC) transformed with the gene encoding the engineered dsRNA ligase polypeptide and/or as cell lysate or lyophilized cell lysate of such cells, so called shake flask powder (SFP), where the cell debris was removed and/or further purified as
fermentation powder (FP). Whole cells transformed with gene(s) encoding the engineered dsRNA ligase polypeptide or cell extracts, lysates thereof, and isolated enzymes can be used in a wide variety of different forms, including solids (e.g., lyophilized, spray dried, or the like) or semisolid (e.g., a crude paste). The cell extract or cell lysate may be partially purified by precipitation (e.g. , ammonium sulfate, polyethyleneimine, heat treatment or the like), followed by desalting procedures (e.g., ultrafiltration, dialysis, and the like) prior to lyophilization. Any of the enzyme preparations can be immobilized to a solid phase material (such as a resin).
In any of the embodiments of the process disclosed herein, wherein an engineered polypeptide is expressed in the form of a secreted polypeptide, a culture medium containing the secreted polypeptide can be used in the process herein.
In any of the embodiments of the process disclosed herein, the solid reactants (e.g. , enzymes, salts, etc.) can be provided to the reaction in a variety of different forms, including powders (e.g., lyophilized, spray dried, etc.), solutions, emulsions, suspensions, and the like. The reactants can be readily lyophilized or spray-dried using methods and instrumentation known to one skilled in the art. For example, the protein solution can be frozen at -80 °C in small aliquots, and then added to the pre-chilled lyophilization chamber, followed by the application of a vacuum.
In any of the embodiments of the process disclosed herein, the order of addition of reactants is not critical. The reactants may be added together to the solvent at the same time or alternatively, some reactants may be added separately, and some may be added together at different time points.
The methods of performing a ligation reaction may comprise the further step of isolating the oligonucleotide product of the enzymatic reaction. In particular, this step is typically performed after completion of the enzymatic reaction. The oligonucleotide is in particular typically separated from one or more, in particular essentially all of the other components of the reaction mixture. For example, the oligonucleotide is typically separated from the remaining substrate, side products, and/or enzymes. Isolation of the oligonucleotide may be achieved by means and techniques known in the art, e.g. by separating oligonucleotides based on their size such as by gel electrophoresis and gel extractions or using cellulose-based matrices. In some embodiments, the method further comprises purifying the oligonucleotide by ultrafiltration and chromatography.
Modifications
In some embodiments, the oligonucleotide fragment(s) and/or the oligonucleotide comprises a modification, e.g. a chemical modification. As used herein, the term “oligonucleotide fragment(s)” means one or more oligonucleotide fragments. It will be appreciated that modifications which are present in the oligonucleotide fragment(s) are typically present in the oligonucleotide produced from said oligonucleotide fragment(s). In some embodiments, modification(s) are introduced to and/or removed from the oligonucleotide product.
In some embodiments, the oligonucleotide fragment(s) and/or oligonucleotide comprises a chemical modification. In some embodiments, the oligonucleotide fragment(s) and/or oligonucleotide comprises at least one backbone modification. In some embodiments, the oligonucleotide fragment(s) and/or oligonucleotide comprises at least one nucleotide modification. In some embodiments, the oligonucleotide fragment(s) and/or oligonucleotide comprises at least one sugar modification (e.g. at the 2’-position or 4’-position). In some embodiments, the oligonucleotide fragment(s) and/or oligonucleotide comprises: (i) at least one backbone modification; (ii) at least one nucleotide modification; and/or (iii) at least one sugar modification.
Modifications include, but are not limited to, end modifications of the terminal oligonucleotide fragments, e.g., 5 ’-end modifications (phosphorylation, conjugation, inverted linkages) or 3’-end modifications (conjugation, inverted linkages, etc.); base modifications, e.g. , replacement with stabilizing bases, destabilizing bases, or bases that base pair with an expanded repertoire of partners, removal of bases (abasic nucleotides), or conjugated bases; sugar modifications (e.g., at the 2’-position or 4’-position) or replacement of the sugar; or backbone modifications, including modification or replacement of the phosphodiester linkages.
In some embodiments, a terminal oligonucleotide fragment and/or oligonucleotide comprises a cap. The term "cap” and the like include a chemical moiety attached to the end of a double-stranded nucleotide duplex, but is used herein to exclude a chemical moiety that is a nucleotide or nucleoside. A “3’ cap” is attached at the 3’ end of a nucleotide or oligonucleotide and protects the molecule from degradation, e.g., from nucleases, such as those in blood serum or intestinal fluid. A non-nucleotidic 3’ cap is not a nucleotide and can replace a TT or UU dinucleotide at the end of a blunt-ended oligonucleotide. In some embodiments, non-nucleotidic 3’ end caps are as disclosed in, for example, WO 2005/021749 and WO 2007/128477; and U.S. Pat. No. 8,097,716; U.S. Pat. No. 8,084,600; and U.S. Pat.
No. 8,344,128. A “5’ cap” is atached at the 5’ end of a nucleotide or oligonucleotide. A cap should not interfere (or unduly interfere) with oligonucleotide activity.
In some embodiments, the oligonucleotide fragment(s) and/or oligonucleotide comprises one or more mismatches. A mismatch is defined herein as a difference between the base sequence or length when two sequences are maximally aligned and compared. In the context of double -stranded oligonucleotides (in which two sequences are aligned antiparallel to each other) a mismatch is defined as a position wherein the base of one sequence is not complementary to the base of the other sequence. Thus, a mismatch is counted, for example, if a position in the first sequence has a particular base (e.g., A), and the corresponding position in the second sequence has a base which is not complementary to said base in the first sequence (e.g., G), when the first and second sequences are aligned antiparallel to each other. Note, however, that on a given RNA strand, a U can be replaced by T (either as RNA or, preferably, DNA, e.g., 2 ’-deoxy-thymidine); the replacement of a U with a T is not a mismatch as used herein, as either U or T can pair with A on the opposite strand. An RNA oligonucleotide can thus comprise one or more DNA bases, e.g., T. No mismatch is counted between a DNA portion(s) of an RNAi agent and the corresponding target mRNA if basepairing occurs (e.g., between A, G, C, or T in the DNA portion, and the corresponding U, C, G, or A, respectively in the mRNA).
A mismatch is also counted, e.g., if a position in one sequence has a base (e.g., A), and the corresponding position on the other sequence has no base (e.g., that position is an abasic nucleotide, which comprises a phosphate-sugar backbone but no base). A singlestranded nick in either sequence (or in the sense or anti-sense strand) is not counted as mismatch. Thus, as a non-limiting example, no mismatch would be counted if one sequence (in the 5 ’->3’ orientation) comprises the sequence AG, but the complementary sequence (in the 3 ’->5’ orientation) comprises the sequence TC with a single-stranded nick between the T and the C. A nucleotide modification in the sugar or phosphate is also not considered a mismatch. Thus, if one sequence comprises a G, and the complementary sequence comprises a modified C (e.g., 2 ’-modification) at the same position, no mismatch would be counted.
Thus, no mismatches are counted if modifications are made to the sugar, phosphate, or backbone of the oligonucleotide without modifying the base. Thus, in the context of double-stranded RNAi, a strand having a given sequence as an RNA would have zero mismatches from its complement sequence as a PNA; or morpholino; or LNA; or TNA; or GNA; or FANA; or a mix or chimera of RNA and DNA, TNA, GNA, FANA, Morpholino, UNA, LNA, and/or PNA, etc. No mismatch would occur between a nucleotide which is T,
and a nucleotide which is A with a 5’ modification and/or a 2 ’-modification. The key feature of a mismatch (base replacement) is that it would not be able to base-pair with the corresponding base on the opposite strand. In addition, terminal overhangs such as “UU” or “dTdT” are not counted when counting the number of mismatches. In such cases, a mismatch is defined as a position wherein the base of one sequence does not match the base of the other sequence.
It is noted that dTdT (2'-deoxy-thymidine-5 ’-phosphate and 2'-deoxy-thymidine-5’- phosphate), or in some cases, TT or UU, can be added as a terminal dinucleotide cap or extension to one or both 3 ’-ends of the oligonucleotide, but this cap or extension is not included in the calculation of the total number of mismatches and is not considered part of the target sequence. This is because the terminal dinucleotide protects the ends from nuclease degradation but does not contribute to target specificity (Elbashir et al. 2001 Nature 411: 494- 498; Elbashir et al. 2001 EMBO J. 20: 6877-6888; and Kraynack et al. 2006 RNA 12: 163- 176).
There are several examples in the art describing sugar, base, phosphate and backbone modifications that can be introduced into nucleic acid molecules with significant enhancement in their nuclease stability and efficacy. For example, oligonucleotides are modified to enhance stability and/or enhance biological activity by modification with nuclease resistant groups, for example, 2'-amino, 2'-C-allyl, 2'-flouro, 2'-O-methyl, 2'-O-allyl, 2'-H, nucleotide base modifications. Sugar modification of nucleic acid molecules are extensively described in the art.
Additional modifications and conjugations of oligonucleotides have been described. Soutschek et al. 2004 Nature 432: 173-178 presented conjugation of cholesterol to the 3’-end of the sense strand of an siRNA molecule by means of a pyrrolidine linker, thereby generating a covalent and irreversible conjugate. Chemical modifications (including conjugation with other molecules) of oligonucleotides may also be made to improve the in vivo pharmacokinetic retention time and efficiency.
In some embodiments, the oligonucleotide fragment(s) and/or oligonucleotide comprises a modified base. The disclosure encompasses an oligonucleotide and oligonucleotide fragments with a substitution of a single nucleotide at a given position with a modified version of the same nucleotide. Thus a nucleotide (A, G, C or U) can be replaced by a modified base selected from 5-fluorouracil, 5 -bromouracil, 5-chlorouracil, 5-iodouracil, hypoxanthine, xantine, 4-acetylcytosine, 5 -(carboxyhydroxylmethyl) uracil, 5- carboxymethylaminomethyl-2 -thiouridine, 5-carboxymethylaminomethyluracil,
dihydrouracil, beta-D-galactosylqueosine, inosine, N6-isopentenyladenine, 1-methylguanine, 1 -methylinosine, 2,2-dimethylguanine, 2-methyladenine, 2-methylguanine, 3 -methylcytosine, 5 -methylcytosine, N6-adenine, 7-methylguanine, 5 -methylaminomethyluracil, 5- methoxyaminomethyl-2 -thiouracil, beta-D-mannosylqueosine, 5'- methoxy carboxymethyluracil, 5 -methoxyuracil, 2-methylthio-N6-isopentenyladenine, uracil - 5-oxyacetic acid (v), wybutoxosine, pseudouracil, queosine, 2-thiocytosine, 5-methyl-2- thiouracil, 2-thiouracil, 4-thiouracil, 5 -methyluracil, uracil-5- oxyacetic acid methylester, uracil-5-oxyacetic acid (v), 5-methyl-2 -thiouracil, 3-(3-amino-3-N-2 -carboxypropyl) uracil, 2,6-diaminopurine, 5 -hydroxymethyl cytosine, 2-aminoadenine, 6-methyl and other alkyl derivatives of adenine and guanine, 2-propyl and other alkyl derivatives of adenine and guanine, 2-thiothymine, 5-propynyl ( — C=C — CHs) uracil and cytosine and other alkynyl derivatives of pyrimidine bases, 6-azo uracil, cytosine and thymine, 8-halo, 8-amino, 8-thiol, 8-thioalkyl, 8-hydroxyl and other 8-substituted adenines and guanines, 5-halo particularly 5- bromo, 5 -trifluoromethyl and other 5-substituted uracils and cytosines, 7-methyladenine, 2-F- adenine, 8-azaguanine and 8-azaadenine, 7-deazaguanine and 7-deazaadenine and 3- deazaguanine and 3 -deazaadenine..
Additional modified variants include the addition of any other moiety (e.g., a radiolabel or other tag or conjugate) to the oligonucleotide or oligonucleotide fragment; provided that the base sequence is identical, the addition of other moieties produces a “modified variant” (with no mismatches).
In addition to these modifications and patterns (e.g., formats) for modifications, other modifications or sets of modifications of the sequences provided can be generated using common knowledge of nucleic acid modification. These various embodiments and embodiments of the oligonucleotides of the present disclosure can be used in RNA interference.
In some embodiments, the oligonucleotide and/or oligonucleotide fragment(s) comprises a modification that causes the oligonucleotide to have increased stability in a biological sample or environment (e.g., cytoplasm, interstitial fluid, blood serum, lung or intestinal lavage).
In some embodiments, the oligonucleotide and/or oligonucleotide fragment(s) comprises a modification that promotes cleavage by the RNA-induced silencing complex (z. e. a “RISC cleavage site”). The RISC cleavage site is the site on the target at which cleavage occurs. In some embodiments, the antisense strand comprises a RISC cleavage site. For an RNAi agent having a duplex region of 17-23 nucleotide in length, the cleavage site of the
antisense strand is typically around the 10, 11 and 12 positions from the 5’-end. As used herein, the term “cleavage region” refers to a region that is located immediately adjacent to the cleavage site. In some embodiments, the cleavage region comprises three bases on either end of, and immediately adjacent to, the cleavage site. In some embodiments, the cleavage region comprises two bases on either end of, and immediately adjacent to, the cleavage site. In some embodiments, the cleavage site specifically occurs at the site bound by nucleotides 10 and 11 of the antisense strand, and the cleavage region comprises nucleotides 11, 12 and 13 of the antisense strand.
In some embodiments, the oligonucleotide fragment(s) and/or oligonucleotide comprises a modified backbone. As used herein, an unmodified backbone consists of 3’ to 5’ phosphodiester bonds. A modified backbone may comprise non-natural intemucleoside linkages. Oligonucleotides having a modified backbone include those that retain a phosphorus atom in the backbone and those that do not have a phosphorus atom in the backbone.
Oligonucleotide fragments comprising a modified backbone include, but are not limited to, those that do not have a phosphorus atom in the backbone. Modified backbones include, but are not limited to, phosphorothioates, chiral phosphorothioates, phosphorodithioates, phosphotriesters, aminoalkylphosphotriesters, methyl and other alkyl phosphonates (e.g. 3'-alkylene phosphonates and chiral phosphonates), phosphinates, phosphoramidates (e.g. mesyl phosphoramidate, 3'-amino phosphoramidate and aminoalkylphosphoramidates), thionophosphoramidates, thionoalkylphosphonates, thionoalkylphosphotriesters, and boranophosphates having normal 3 '-5' linkages, 2'-5 '-linked analogs of these, and those having inverted polarity wherein the adjacent pairs of nucleoside units are linked 3’-5* to 5’-3* or 2'-5' to 5’-2*.
Oligonucleotide fragments comprising a modified backbone that does not include a phosphorus atom therein may have backbones that are formed by short chain alkyl or cycloalkyl intemucleoside linkages, mixed heteroatoms and alkyl or cycloalkyl intemucleoside linkages, or one or more short chain heteroatomic or heterocyclic intemucleoside linkages. These include those having morpholino linkages (formed in part from the sugar portion of a nucleoside); siloxane backbones; sulfide, sulfoxide and sulfone backbones; formacetyl and thioformacetyl backbones; methylene formacetyl and thioformacetyl backbones; alkene containing backbones; sulfamate backbones; methyleneimino and methylenehydrazino backbones; sulfonate and sulfonamide backbones; amide backbones; and others having mixed N, O, S and CH2 component parts.
In some embodiments, the oligonucleotide and/or oligonucleotide fragment(s) comprises at least one phosphonate linkage, wherein the phosphonate is a modified phosphonate selected from the group consisting of: phosphorothioate (which may be an Rp isomer or an .S'p isomer):
methylphosphonate :
methoxypropylphosphonate :
5 ’ -methylphosphonate : phonate:
5’-phosphorothioate;
and peptide nucleic acid:
In some embodiments, the oligonucleotide and/or oligonucleotide fragment(s) comprises: at least one 5’-uridine-adenine-3’ (5’-ua-3’) dinucleotide, wherein the uridine is a 2’-modified nucleotide; at least one 5’-uridine-guanine-3’ (5’-ug-3’) dinucleotide, wherein the 5 ’-uridine is a 2’-modified nucleotide; at least one 5’-cytidine-adenine-3’ (5’-ca-3’)
dinucleotide, wherein the 5’-cytidine is a 2’-modified nucleotide; or at least one 5’-uridine- uridine-3’ (5’-uu-3’) dinucleotide, wherein the 5’-uridine is a 2’-modified nucleotide. These dinucleotide motifs are particularly prone to serum nuclease degradation (e.g. RNase A). Chemical modification at the 2'-position of the first pyrimidine nucleotide in the motif prevents or slows down such cleavage. This modification recipe is also known under the term 'endo light'.
In some embodiments, the oligonucleotide and/or oligonucleotide fragment(s) comprise a modified nucleobase, wherein the modified nucleobase is difluorotolyl, nitroindolyl, nitropyrrolyl, or nitroimidazolyl. In a particular embodiment, the modified nucleobase is difluorotolyl. In some embodiments, wherein the oligonucleotide and/or oligonucleotide fragment(s) is double-stranded, only one of the two strands contains a modified nucleobase. In some embodiments, wherein the oligonucleotide and/or oligonucleotide fragment(s) is double-stranded, both of the strands contain a modified nucleobase.
In some embodiments, the oligonucleotide fragment(s) and/or oligonucleotide comprises a modified sugar. Sugar modifications typically involve chemical modification of the sugar moiety of RNA or DNA. Sugar modifications include, but are not limited to, one of the following at the 2’-position: OH; F; O-, S-, orN-alkyl; O-, S-, orN-alkenyl; O-, S- or N- alkynyl; or O-alkyl-O-alkyl, wherein the alkyl, alkenyl and alkynyl can be substituted or unsubstituted Ci to Cio alkyl or C2 to Cio alkenyl and alkynyl. Exemplary modifications include O[(CH2)nO] mCHs, O(CH2).nOCH3, O(CH2)nNH2, O(CH2) nCH3, O(CH2)nONH2, and O(CH2)nON[(CH2)nCH3)]2, where n and m are from 1 to about 10. Oligonucleotide fragments for use in the methods described herein may include one of the following at the 2’ position: Ci to Cio lower alkyl, substituted lower alkyl, alkaryl, aralkyl, O-alkaryl or O-aralkyl, SH, SCH3, OCN, Cl, Br, CN, CF3, OCF3, SOCH3, SO2CH3, ONO2, NO2, N3, NH2, heterocycloalkyl, heterocycloalkaryl, aminoalkylamino, polyalkylamino, substituted silyl, an RNA cleaving group, a reporter group, an intercalator, a group for improving the pharmacokinetic properties of a therapeutic RNA, or a group for improving the pharmacodynamic properties of a therapeutic RNA. In some embodiments, the modification comprises a 2 ’-methoxy ethoxy (also known as 2 ’-O-(2 -methoxyethyl) or 2’-O-MOE), 2’- dimethylaminooxyethoxy (also known as 2’-DMAOE), and 2 ’-dimethylaminoethoxy ethoxy (also known in the art as 2’-O-dimethylaminoethoxyethyl or 2’-DMAEOE). Further exemplary modifications include: 5’-Me-2’-F nucleotides, 5’-Me-2’-Ome nucleotides, 5’- Me -2 ’-deoxynucleotides, 2 ’-alkoxyalkyl; and 2’-NMA (N-methylacetamide).
Other modifications include 2’-methoxy (2’-OCH3), 2 ’-aminopropoxy (2’- OCH2CH2CH2NH2) and 2 ’-fluoro (2’-F). Similar modifications can also be made at other positions on an RNA, particularly the 3 ’ position of the sugar on the 3 ’ terminal nucleotide or in 2’-5’ linked dsRNA and the 5’ position of 5’ terminal nucleotide.
In some embodiments, the oligonucleotide fragment(s) and/or oligonucleotide comprises at least one modified nucleotide. In some embodiments, the modification is selected from the group consisting of: 2’-O-methyl (2’-0me), 2’-flouro (2’-F), 2’-deoxy, 2’- deoxy-2’ -fluoro, 2’-O-methoxyethyl (2’-0-M0E), 2’-O-aminopropyl (2’-O-AP), 2’-O- dimethylaminoethyl (2’-0-DMA0E), 2’-O-dimethylaminopropyl (2’-0-DMAP), 2 -0- dimethylaminoethyloxyethyl (2’-0-DMAE0E), 2’-O-N-methylacetamido (2’-0-NMA), locked nucleic acid (LNA), glycol nucleic acid (GNA), phosphoramidate (e.g. mesyl phosphoramidate), 2’, 3 ’-seco nucleotide mimic, 2’-F -arabino nucleotide, abasic nucleotide, 2’-amino modified nucleotide, 2 ’-alkyl -modified nucleotide, morpholino nucleotide, vinylphosphonate (e.g. 5’ vinylphosphonate), and cyclopropyl phosphonate deoxyribonucleotide. In some embodiments, one or more of the oligonucleotide fragments comprises a 2 ’-modification selected from the group consisting of: 2’-0me, 2’-F, and 2’- deoxy. In some embodiments, the oligonucleotide and/or oligonucleotide fragment(s) comprises one or more 3'-O-methyl nucleotide.
In some embodiments, the oligonucleotide and/or oligonucleotide fragment(s) comprises a 2'-modification selected from the group consisting of: 2'-O-methyl (2’-0Me), 2'- flouro (2’-F), 2'-deoxy, 2'-deoxy-2’-fluoro, 2'-O-methoxyethyl (2'-0-M0E), 2'-O- aminopropyl (2'-O-AP), 2'-O-dimethylaminoethyl (2'-0-DMA0E), 2'-O- dimethylaminopropyl (2'-0-DMAP), 2'-O-dimethylaminoethyloxyethyl (2'-0-DMAE0E), 2'- O-N-methylacetamido (2'-0-NMA), locked nucleic acid (LNA), phosphoramidate (e.g. mesyl phosphoramidate), 2',3'-seco nucleotide mimic, 2'-F-arabino nucleotide, abasic nucleotide, 2'- amino modified nucleotide, 2'-alkyl-modified nucleotide, morpholino nucleotide, vinylphosphonate (e.g. 5’ vinylphosphonate), deoxyribonucleotide, and cyclopropyl phosphonate. In some embodiments, the oligonucleotide and/or oligonucleotide fragment(s) comprises one or more 3'-O-methyl nucleotide.
In some embodiments, the oligonucleotide and/or oligonucleotide fragment(s) comprises a bridged nucleic acid. In some embodiments, the bridged nucleic acid is locked nucleic acid. In some embodiments, the bridged nucleic acid is constrained ethyl bridged nucleic acid:
In some embodiments, all pyrimidines (uridine and cytidine) are 2’ O-methyl- modified nucleosides.
In some embodiments, the sense and/or antisense strand is conjugated to one or more diagnostic compound, reporter group, cross-linking agent, nuclease-resistance conferring moiety, modified or unmodified nucleobase, lipophilic molecule, cholesterol, lipid, lectin, steroid, uvaol, hecigenin, diosgenin, terpene, triterpene, sarsasapogenin, Friedelin, epifriedelanol-derivatized lithocholic acid, vitamin, carbohydrate, dextran, pullulan, chitin, chitosan, synthetic carbohydrate, oligo lactate 15-mer, natural polymer, low- or medium- molecular weight polymer, inulin, cyclodextrin, hyaluronic acid, protein, protein-binding agent, integrin-targeting molecule, polycationic, peptide, polyamine, peptide mimic, and/or transferrin.
In some embodiments, the antisense strand comprises at least one 2’-0Me modified nucleotide. In some embodiments, the antisense strand comprises at least one 2’-F modified nucleotide. In some embodiments, the antisense strand comprises at least one 2’-deoxy modified nucleotide. In some embodiments, the antisense strand comprises at least one 2’- OMe modified nucleotide, at least one 2’-F modified nucleotide, or at least one 2’-deoxy modified nucleotide, or any combination thereof. In some embodiments, the antisense strand comprises alternating 2’-0Me and 2’-F modified nucleotides. In some embodiments, the antisense strand comprises at least one 5’ vinylphosphonate. In some embodiments, the antisense strand comprises at least one chiral phosphorothioate linkage. In some embodiments, the antisense strand comprises at least one GNA. In some embodiments, the sense strand comprises at least one 2’-0Me modified nucleotide. In some embodiments, the sense strand comprises at least one 2’-F modified nucleotide. In some embodiments, the sense strand comprises at least one 2’-deoxy modified nucleotide. In some embodiments, the sense strand comprises at least one 2’-0Me modified nucleotide, at least one 2’-F modified nucleotide, or at least one 2’ -deoxy modified nucleotide, or any combination thereof. In some embodiments, the sense strand comprises alternating 2’-0Me and 2’-F modified nucleotides. In some embodiments, the antisense strand and the sense strand each comprise at
least one 2’-0Me modified nucleotide. In some embodiments, the antisense strand and the sense strand each comprise at least one 2’-F modified nucleotide. In some embodiments, the antisense strand and the sense strand each comprise alternating 2’-0Me and 2’-F modified nucleotides. In some embodiments, the sense strand comprises at least one 5’ vinylphosphonate. In some embodiments, the sense strand comprises at least one chiral phosphorothioate linkage. In some embodiments, the sense strand comprises at least one GNA.
In some embodiments, the sense strand comprises alternating 2’-0Me and 2’-F modified nucleotides over the full length of the sense strand. In some embodiments, the sense strand comprises alternating 2’-0Me and 2’-F modified nucleotides over part of the length of the sense strand e.g. over at least 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20 or more nucleotides of the sense strand.
In some embodiments, the antisense strand comprises alternating 2’-OMe and 2’-F modified nucleotides over the full length of the antisense strand. In some embodiments, the antisense strand comprises alternating 2’-OMe and 2’-F modified nucleotides over part of the length of the antisense strand e.g. over at least 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16,
17, 18, 19, 20 or more nucleotides of the antisense strand.
In some embodiments, the sense strand and antisense strand each comprise alternating 2’-OMe and 2’-F modified nucleotides over the full length of the sense strand and the antisense strand. In some embodiments, the sense strand and the antisense strand comprise alternating 2’-OMe and 2’-F modified nucleotides over part of the length of the sense strand and the antisense strand e.g. over at least 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17,
18, 19, 20 or more nucleotides of the sense strand and the antisense strand.
Ligands
In some embodiments, one or more of the oligonucleotide fragments is conjugated to at least one ligand. In some embodiments, the oligonucleotide product is conjugated to at least one ligand. The ligand may be conjugated to the sense strand, antisense strand or both strands, in any configuration e.g. at the 3 ’-end, 5 ’-end, non-end or a combination.
In some embodiments, the ligand comprises one or more N-Acetylgalactosamine (GalNAc) derivatives. In some embodiments, the ligand comprises one or more GalNAc derivatives conjugated through a bivalent or trivalent branched carrier.
In some embodiments, the ligand is:
In some embodiments, a ligand alters the distribution, targeting or lifetime of the molecule into which it is incorporated. In some embodiments, a ligand provides an enhanced affinity for a selected target, e.g., molecule, cell or cell type, compartment, receptor e.g., a cellular or organ compartment, tissue, organ or region of the body, as, e.g., compared to a
species absent such a ligand. Ligands providing enhanced affinity for a selected target are also termed targeting ligands.
Some ligands can have endosomolytic properties. The endosomolytic ligands promote the lysis of the endosome and/or transport of the oligonucleotide, or a composition comprising the oligonucleotide, from the endosome to the cytoplasm of the cell. The endosomolytic ligand may be a polyanionic peptide or peptidomimetic which shows pH- dependent membrane activity and fusogenicity. In some embodiments, the endosomolytic ligand assumes its active conformation at endosomal pH. The “active” conformation is that conformation in which the endosomolytic ligand promotes lysis of the endosome and/or transport of the oligonucleotide, or a composition comprising the oligonucleotide, from the endosome to the cytoplasm of the cell. Exemplary endosomolytic ligands include the GALA peptide (Subbarao et al., Biochemistry, 1987, 26: 2964-2972), the EALA peptide (Vogel et al., J. Am. Chem. Soc., 1996, 118: 1581-1586), and their derivatives (Turk et al., Biochem. Biophys. Acta, 2002, 1559: 56-68). The endosomolytic component may contain a chemical group (e.g., an amino acid) which will undergo a change in charge or protonation in response to a change in pH. The endosomolytic component may be linear or branched.
Ligands can improve transport, hybridization, and specificity properties and may also improve nuclease resistance of the resultant natural or modified oligonucleotide.
Ligands in general can include therapeutic modifiers, e.g., for enhancing uptake; diagnostic compounds or reporter groups e.g., for monitoring distribution; cross-linking agents; and nuclease-resistance conferring moieties. General examples include lipids, steroids, vitamins, sugars, proteins, peptides, polyamines, and peptide mimics.
Ligands can include a naturally occurring substance, such as a protein (e.g., human serum albumin (HSA), low-density lipoprotein (LDL), high-density lipoprotein (HDL), or globulin); a carbohydrate (e.g., a dextran, pullulan, chitin, chitosan, inulin, cyclodextrin or hyaluronic acid); or a lipid. The ligand may also be a recombinant or synthetic molecule, such as a synthetic polymer, e.g., a synthetic polyamino acid, an oligonucleotide (e.g., an aptamer). Examples of polyamino acids include polylysine (PLL), poly L aspartic acid, poly L-glutamic acid, styrene-maleic acid anhydride copolymer, poly(L-lactide-co-glycolied) copolymer, divinyl ether-maleic anhydride copolymer, N-(2-hydroxypropyl)methacrylamide copolymer (HMPA), polyethylene glycol (PEG), polyvinyl alcohol (PVA), polyurethane, poly(2-ethylacryllic acid), N-isopropylacrylamide polymers, or polyphosphazine. Examples of polyamines include: polyethylenimine, polylysine (PLL), spermine, spermidine, polyamine, pseudopeptide-polyamine, peptidomimetic polyamine, dendrimer polyamine,
arginine, amidine, protamine, cationic lipid, cationic porphyrin, quaternary salt of a polyamine, or an alpha helical peptide .
Ligands can also include targeting groups, e.g., a cell or tissue targeting agent, e.g., a lectin, glycoprotein, lipid or protein, e.g., an antibody, that binds to a specified cell type. A targeting group can be a thyrotropin, melanotropin, lectin, glycoprotein, surfactant protein A, Mucin carbohydrate, multivalent lactose, multivalent galactose, N-acetyl-galactosamine, N- acetyl-gulucosamine multivalent mannose, multivalent fucose, glycosylated polyaminoacids, multivalent galactose, transferrin, bisphosphonate, polyglutamate, polyaspartate, a lipid, cholesterol, a steroid, bile acid, folate, vitamin B12, biotin, an RGD peptide, an RGD peptidomimetic or an aptamer.
Other examples of ligands include dyes, intercalating agents (e.g., acridines), crosslinkers (e.g., psoralene, mitomycin C), porphyrins (TPPC4, texaphyrin, Sapphyrin), polycyclic aromatic hydrocarbons (e.g., phenazine, dihydrophenazine), artificial endonucleases or a chelator (e.g., EDTA), lipophilic molecules, e.g., cholesterol, cholic acid, adamantane acetic acid, 1 -pyrene butyric acid, dihydrotestosterone, 1,3-Bis- O(hexadecyl)glycerol, geranyloxyhexyl group, hexadecylglycerol, borneol, menthol, 1,3- propanediol, heptadecyl group, palmitic acid, myristic acid,O3-(oleoyl)lithocholic acid, 03- (oleoyl)cholenic acid, dimethoxytrityl, or phenoxazine)and peptide conjugates (e.g., antennapedia peptide, Tat peptide), alkylating agents, phosphate, amino, mercapto, PEG (e.g., PEG-40K), MPEG, [MPEG] 2, polyamino, alkyl, substituted alkyl, radiolabeled markers, enzymes, haptens (e.g., biotin), transport/absorption facilitators (e.g., aspirin, vitamin E, folic acid), synthetic ribonucleases (e.g., imidazole, bisimidazole, histamine, imidazole clusters, acridine-imidazole conjugates, Eu3+ complexes of tetraazamacrocycles), dinitrophenyl, HRP, or AP.
Ligands can be proteins, e.g., glycoproteins, or peptides, e.g., molecules having a specific affinity for a co-ligand, or antibodies e.g., an antibody, that binds to a specified cell type such as a cancer cell, endothelial cell, or bone cell. Ligands may also include hormones and hormone receptors. They can also include non-peptidic species, such as lipids, lectins, carbohydrates, vitamins, cofactors, multivalent lactose, multivalent galactose, N-acetyl- galactosamine, N-acetyl-gulucosamine multivalent mannose, multivalent fucose, or aptamers. The ligand can be, for example, a lipopolysaccharide, an activator of p38 MAP kinase, or an activator of NF -KB .
In some embodiments, the ligand is a lipid or lipid-based molecule. Such a lipid or lipid-based molecule preferably binds a serum protein, e.g., human serum albumin (HSA).
An HSA binding ligand allows for distribution of the conjugate to a target tissue. A lipid or lipid-based ligand can (a) increase resistance to degradation of the conjugate, (b) increase targeting or transport into a target cell or cell membrane, and/or (c) can be used to adjust binding to a serum protein, e.g., HSA. A lipid based ligand can be used to modulate, e.g., control the binding of the conjugate to a target tissue.
In some embodiments, the ligand is a peptide or a peptidomimetic. A peptidomimetic is a molecule capable of folding into a defined three-dimensional structure similar to a natural peptide. The peptide or peptidomimetic moiety can be about 5-50 amino acids long, e.g., about 5, 10, 15, 20, 25, 30, 35, 40, 45, or 50 amino acids long. A peptide or peptidomimetic can be, for example, a cell permeation peptide, cationic peptide, amphipathic peptide, or hydrophobic peptide (e.g., consisting primarily of Tyr, Trp or Phe). The peptide moiety can be a dendrimer peptide, constrained peptide or crosslinked peptide. In another alternative, the peptide moiety can include a hydrophobic membrane translocation sequence (MTS). The peptide moiety can be a “delivery” peptide, which can carry large polar molecules including peptides, oligonucleotides, and protein across cell membranes. A peptide or peptidomimetic can be encoded by a random sequence of DNA, such as a peptide identified from a phagedisplay library, or one-bead-one-compound (OBOC) combinatorial library (Lam et al., Nature, 354:82-84, 1991).
As used herein, a “peptide moiety” can range in length from about 5 amino acids to about 50 amino acids. The peptide moieties can have a structural modification, such as to increase stability or direct conformational properties. Any of the structural modifications described below can be utilized. An arginine-glycine-aspartic acid (RGD)-peptide moiety can be used to target a tumor cell, such as an endothelial tumor cell or a breast cancer tumor cell (Zitzmann et al., Cancer Res., 62:5139-43, 2002). An RGD peptide can facilitate targeting of an oligonucleotide to tumors of a variety of other tissues, including the lung, kidney, spleen, or liver (Aoki et al., Cancer Gene Therapy 8:783-787, 2001). The RGD peptide can be linear or cyclic, and can be modified, e.g., glycosylated or methylated to facilitate targeting to specific tissues. Peptides that target markers enriched in proliferating cells can be used. For example, RGD containing peptides and peptidomimetics can target cancer cells, in particular cells that exhibit an integrin. Thus, the ligand may comprise RGD peptides, cyclic peptides containing RGD, RGD peptides that include D-amino acids, or synthetic RGD mimics.
Peptide and peptidomimetic ligands include those having naturally occurring or modified peptides, e.g., D or L peptides; a, , or y peptides; N-methyl peptides; azapeptides;
peptides having one or more amide, i.e., peptide, linkages replaced with one or more urea, thiourea, carbamate, or sulfonyl urea linkages; or cyclic peptides .
Ligands can be coupled to the oligonucleotide fragment(s) and/or oligonucleotide at various places, for example, 3 ’-end, 5 ’-end, and/or at an internal (“non-end”) position. In some embodiments, the ligand is attached via an intervening tether, e.g., a carrier described herein. The ligand or tethered ligand may be present on a monomer when the monomer is incorporated into the oligonucleotide fragment(s) and/or oligonucleotide. In some embodiments, the ligand may be incorporated via coupling to a “precursor” monomer after the “precursor” monomer has been incorporated into the oligonucleotide fragment and/or the oligonucleotide. For example, a monomer having, e.g., an amino-terminated tether (i.e., having no associated ligand), e.g., TAP-(CH2)nNH2 may be incorporated into a growing oligonucleotide fragment. In a subsequent operation, i.e., after incorporation of the precursor monomer into the oligonucleotide fragment, a ligand having an electrophilic group, e.g., a pentafluorophenyl ester or aldehyde group, can subsequently be attached to the precursor monomer by coupling the electrophilic group of the ligand with the terminal nucleophilic group of the precursor monomer’s tether.
In another example, a monomer having a chemical group suitable for taking part in Click Chemistry reaction may be incorporated, e.g., an azide or alkyne terminated tether/linker. In a subsequent operation, i.e., after incorporation of the precursor monomer into the oligonucleotide fragment(s) and/or the oligonucleotide, a ligand having complementary chemical group, e.g. an alkyne or azide can be attached to the precursor monomer by coupling the alkyne and the azide together.
In some embodiments, the ligand is conjugated to nucleobases, sugar moieties, or intemucleosidic linkages of the oligonucleotide fragment(s) and/or oligonucleotide. Conjugation to purine nucleobases or derivatives thereof can occur at any position including, endocyclic and exocyclic atoms. In some embodiments, the 2-, 6-, 7-, or 8-positions of a purine nucleobase are attached to a conjugate moiety. Conjugation to pyrimidine nucleobases or derivatives thereof can also occur at any position. In some embodiments, the 2-, 5-, and 6- positions of a pyrimidine nucleobase can be substituted with a conjugate moiety. Conjugation to sugar moieties of nucleosides can occur at any carbon atom. Example carbon atoms of a sugar moiety that can be attached to a conjugate moiety include the 2', 3', and 5' carbon atoms. The T position can also be attached to a conjugate moiety, such as in an abasic residue. Intemucleosidic linkages can also bear conjugate moieties. For phosphorus- containing linkages (e.g., phosphodiester, phosphorothioate (e.g. chrial phosphorothioate),
phosphorodithiotate, phosphoroamidate, and the like), the conjugate moiety can be attached directly to the phosphorus atom or to an O, N, or S atom bound to the phosphorus atom. For amine- or amide-containing intemucleosidic linkages (e.g., PNA), the conjugate moiety can be attached to the nitrogen atom of the amine or amide or to an adjacent carbon atom.
In some embodiments, the ligand is conjugated to the sense strand. In some embodiments, the ligand is conjugated to the 3’ end of the sense strand. In some embodiments, the ligand is conjugated to the 5’ end of the sense strand. In some embodiments, the ligand is conjugated to a non-end of the sense strand.
In some embodiments, the ligand is conjugated to the antisense strand. In some embodiments, the ligand is conjugated to the 3’ end of the antisense strand. In some embodiments, the ligand is conjugated to a non-end of the antisense strand.
The ligand may be attached via a carrier. The carriers include (i) at least one “backbone attachment point,” preferably two “backbone attachment points” and (ii) at least one “tethering attachment point.” A “backbone attachment point” as used herein refers to a functional group, e.g. a hydroxyl group, or generally, a bond available for, and that is suitable for incorporation of the carrier into the backbone, e.g., the phosphate, or modified phosphate, e.g., sulfur containing, backbone, of a nucleic acid. A “tethering attachment point” (TAP) in some embodiments refers to a constituent ring atom of the cyclic carrier, e.g., a carbon atom or a heteroatom (distinct from an atom which provides a backbone attachment point), that connects a selected moiety. The moiety can be, e.g., a carbohydrate, e.g. monosaccharide, disaccharide, trisaccharide, tetrasaccharide, oligosaccharide and polysaccharide. Optionally, the selected moiety is connected by an intervening tether to the cyclic carrier. Thus, the cyclic carrier will often include a functional group, e.g., an amino group, or generally, provide a bond, that is suitable for incorporation or tethering of another chemical entity, e.g., a ligand to the constituent ring.
Wherein the oligonucleotide fragment is a dsRNA, the sense and/or antisense strand may be conjugated to a ligand via a carrier, wherein the carrier can be cyclic group or acyclic group; preferably, the cyclic group is selected from pyrrolidinyl, pyrazolinyl, pyrazolidinyl, imidazolinyl, imidazolidinyl, piperidinyl, piperazinyl, [l,3]dioxolane, oxazolidinyl, isoxazolidinyl, morpholinyl, thiazolidinyl, isothiazolidinyl, quinoxalinyl, pyridazinonyl, tetrahydrofuryl and and decalin; preferably, the acyclic group is selected from serinol backbone or diethanolamine backbone.
In some embodiments, one or more oligonucleotide fragments comprise the sequence “TT”, “dTdT”, “dTsdT” or “UU” as a single-stranded overhang at the 3’ end, also termed
herein a terminal dinucleotide or 3’ terminal dinucleotide. dT is 2'-deoxy-thymidine-5’- phosphate and sdT is 2'-deoxy Thymidine 5'-phosphorothioate. Terminal dinucleotide “UU” is UU or 2’-0Me-U 2’-0Me-U, and the terminal TT and the terminal UU can be in the inverted/reverse orientation. The terminal dinucleotide (e.g., UU) is a modified variant of the dithymidine dinucleotide commonly placed as an overhang to protect the ends of siRNAs from nucleases (see, for example, Elbashir et al. 2001 Nature 411: 494-498; Elbashir et al. 2001 EMBO J. 20: 6877-6888; and Kraynack et al. 2006 RNA 12: 163-176). A terminal dinucleotide is known from these references to enhance nuclease resistance but not contribute to target recognition.
In some embodiments, one or both terminal oligonucleotide fragments comprise a 3 ’ end cap instead of or in addition to a terminal dinucleotide to stabilize the end from nuclease degradation provided that the 3’ end cap is able to both stabilize the oligonucleotide (e.g., against nucleases) and not interfere excessively with its desired activity.
Wherein the oligonucleotide fragment is a dsRNA, the sense and/or antisense strand may be conjugated to a ligand via a carrier, wherein the carrier can be cyclic group or acyclic group; preferably, the cyclic group is selected from pyrrolidinyl, pyrazolinyl, pyrazolidinyl, imidazolinyl, imidazolidinyl, piperidinyl, piperazinyl, [l,3]dioxolane, oxazolidinyl, isoxazolidinyl, morpholinyl, thiazolidinyl, isothiazolidinyl, quinoxalinyl, pyridazinonyl, tetrahydrofuryl and and decalin; preferably, the acyclic group is selected from serinol backbone or diethanolamine backbone.
Additional embodiments
Embodiment 1. An engineered double-stranded RNA (dsRNA) ligase polypeptide comprising an amino acid sequence having at least 80% sequence identity to an amino acid sequence selected from the group consisting of SEQ ID NOs: 304, 306, 308, 310, 312, 314, 316, 318, 320, 322, 324, 326, 328, 330, 332, 334, 336, 338, 340, 342, 344, 346, 348, 350,
352, 354, 356, 358, 360, 362, 364, 366, 368, 370, 372, 374, 376, 378, 380, 382, 384, 386,
388, 390, 392, 394, 396, 398, 400, 402, 404, 406, 408, 410, 412, 414, 416, 418, 420, 422,
424, 426, 428, 430, 432, 434, 436, 438, 440, 442, 444, 446, 448, 450, 452, 454, 456, 458,
460, 462, 464, 466, 468, 470, 472, 474, 476, 478, 480, 482, 484, 486, 488, 490, 492, 494,
496, 498, 500, 502, 504, 506, 508, 510, 512, 514, 516, 518, 520, 522, 524, 526, 528, 530,
532, 534, 536, 538, 540, 542, 544, 546, 548, 550, 552, 554, 556, 558, 560, 562, 564, 566,
568, 570, 572, 574, 576, 578, 580, 582, 584, 586, 588, 590, 592, 594, 596, 598, and 600; wherein the engineered dsRNA ligase polypeptide:
(a) has dsRNA ligase activity; and
(b) does not the comprise the amino acid sequence of SEQ ID NO: 302.
Embodiment 2. An engineered double-stranded RNA (dsRNA) ligase polypeptide comprising an amino acid sequence having at least 80% sequence identity to an amino acid sequence selected from the group consisting of SEQ ID NOs: 304, 306, 308, 310, 312, 314, 316, 318, 320, 322, 324, 326, 328, 330, 332, 334, 336, 338, 340, 342, 344, 346, 348, 350,
352, 354, 356, 358, 360, 362, 364, 366, 368, 370, 372, 374, 376, 378, 380, 382, 384, 386,
388, 390, 392, 394, 396, 398, 400, 402, 404, 406, 408, 410, 412, 414, 416, 418, 420, 422,
424, 426, 428, 430, 432, 434, 436, 438, 440, 442, 444, 446, 448, 450, 452, 454, 456, 458,
460, 462, 464, 466, 468, 470, 472, 474, 476, 478, 480, 482, 484, 486, 488, 490, 492, 494,
496, 498, 500, 502, 504, 506, 508, 510, 512, 514, 516, 518, 520, 522, 524, 526, 528, 530,
532, 534, 536, 538, 540, 542, 544, 546, 548, 550, 552, 554, 556, 558, 560, 562, 564, 566,
568, 570, 572, 574, 576, 578, 580, 582, 584, 586, 588, 590, 592, 594, 596, 598, 600, 636,
638, 640, 642, 644, 646, 648, 650, 652, 654, 656, 658, 660, 662, 664, 666, and 668; wherein the engineered dsRNA ligase polypeptide:
(a) has dsRNA ligase activity; and
(b) does not the comprise the amino acid sequence of SEQ ID NO: 302.
Embodiment 3. The engineered dsRNA ligase polypeptide of Embodiment 1, wherein the polypeptide comprises an amino acid sequence selected from the group consisting of SEQ ID NOs: 370, 488, 526, 578, 588, 590, and 592.
Embodiment 4. The engineered dsRNA ligase polypeptide of Embodiment 2, wherein the polypeptide comprises an amino acid sequence selected from the group consisting of SEQ ID NOs: 370, 488, 526, 578, 588, 590, 592 and 666.
Embodiment 5. An engineered dsRNA ligase polypeptide comprising an amino acid sequence having at least 80% sequence identity to SEQ ID NO: 302, which produces at least 5% more oligonucleotide product than a dsRNA ligase polypeptide comprising the amino acid sequence of SEQ ID NO: 302 under the same ligation reaction conditions, wherein the engineered dsRNA ligase polypeptide does not the comprise the amino acid sequence of SEQ ID NO: 302.
Embodiment 6. The engineered dsRNA ligase polypeptide of Embodiment 5, wherein the ligation reaction conditions include about 1 pM to about 10 mM oligonucleotide fragment, a source of ATP, about 5 mM to about 100 mM divalent cation, and about 0.5 g/L to about 10 g/L engineered dsRNA ligase polypeptide, pH of about 4.0 to about 8.0, and temperature of about 10 °C to about 50 °C.
Embodiment 7. The engineered dsRNA ligase polypeptide of Embodiment 5 or 6, wherein the amino acid sequence of the engineered dsRNA ligase polypeptide comprises an amino acid sequence that differs from the sequence of SEQ ID NO: 302 in one or more amino acid residues selected from: X6, X7, X15, X19, X29, X36, X39, X46, X47, X49, X51, X53, X56, X57, X60, X63, X64, X66, X67, X87, X88, X91, X93, X103, X105, X107, XI 14, X122, X126, X129, X130, X131, X137, X144, X146, X158, X163, X173, X178, X190, X196, X216, X218, X221, X228, X230, X232, X235, X236, X237, X238, X239, X242, X243, X244, X251, X252, X254, X255, X258, X269, X280, X284, X285, X293, X296, X301, X303, X305, X314, X325, and X328, wherein the numbering refers to SEQ ID NO: 302.
Embodiment 8. The engineered dsRNA ligase polypeptide of Embodiment 7, wherein the amino acid sequence of the engineered dsRNA ligase polypeptide comprises one or more of the following amino acid residues: X6 is G; X7 is Q; X15 is R, D or E; X19 is Q or D; X29 is N or L; X36 is V; X39 is A; X46 is Y; X47 is E; X49 is G; X51 is L; X53 is Y; X56 is R or A; X57 is S; X60 is T, G or P; X63 is S, Q or G; X64 is R, T, Q, F, G, or M; X66 is F or W; X67 is N; X87 is T, P, K or absent; X88 is C; X91 is S; X93 is G, C, or A; X103 is V, C, Y, or T; X105 is V; X107 is R or T; XI 14 is N; X122 is W; X126 is G; X129 is N; X130 is R, S or Y; X131 is R; X137 is V or C; X144 is N; X146 is R; X158 is W; X163 is G; X173 is L; X178 is R; X190 is Q; X196 is S or C; X216 is L or R; X218 is N; X221 is I; X228 is R; X230 is T; X232 is R; X235 is A, T, or G; X236 is S, L, or F; X237 is S, Q, or R; X238 is F; X239 is G or R; X242 is R or M; X243 is N, S, G, or M; X244 is G or K; X251 is D or L; X252 is V; X254 is K; X255 is C; X258 is V; X269 is L; X280 is W; X284 is A; X285 is A; X293 is R; X296 is R; X301 is G, L, E, or F; X303 is Q; X305 is G; X314 is A or V; X325 is R; and X328 is R; wherein the numbering refers to SEQ ID NO: 302.
Embodiment 9. The engineered dsRNA ligase polypeptide of any one of Embodiments 5-8, wherein the amino acid sequence of the engineered dsRNA ligase polypeptide comprises
an amino acid sequence that differs from the sequence of SEQ ID NO: 302 in one or more amino acid residues selected from: X6, X7, X15, X19, X29, X36, X39, X44, X45, X46, X47, X49, X51, X53, X56, X57, X60, X63, X64, X66, X67, X87, X88, X89, X91, X92, X93, X103, X105, X107, X114, X122, X126, X129, X130, X131, X137, X144, X146, X158,
X163, X173, X178, X185, X190, X196, X216, X218, X221, X228, X230, X232, X235,
X236, X237, X238, X239, X242, X243, X244, X251, X252, X254, X255, X258, X269,
X280, X284, X285, X293, X296, X301, X303, X305, X313, X314, X325, and X328, wherein the numbering refers to SEQ ID NO: 302.
Embodiment 10. The engineered dsRNA ligase polypeptide of Embodiment 9, wherein the amino acid sequence of the engineered dsRNA ligase polypeptide comprises one or more of the following amino acid residues: X6 is G or E; X7 is Q; X15 is R, D or E; X19 is Q or D; X29 is N or L; X36 is V; X39 is A; X44 is V; X45 is V; X46 is Y; X47 is E; X49 is G; X51 is L; X53 is Y; X56 is R or A; X57 is S; X60 is T, G or P; X63 is S, Q or G; X64 is R, T, Q, F, G, or M; X66 is F or W; X67 is N; X87 is T, P, K or absent; X88 is C; X89 is T; X91 is S; X92 is D; X93 is G, C, or A; X103 is V, C, Y, or T; X105 is V; X107 is R or T; XI 14 is N; X122 is W; X126 is G; X129 is N; X130 is R, S or Y; X131 is R; X137 is V or C; X144 is N; X146 is R; X158 is W; X163 is G; X173 is L; X178 is R; X185 is K; X190 is Q; X196 is S or C; X216 is L or R; X218 is N; X221 is I; X228 is R; X230 is T; X232 is R; X235 is A, T, or G; X236 is S, L, or F; X237 is S, Q, R, L or G; X238 is F; X239 is G or R; X242 is R or M; X243 is N, S, G, or M; X244 is G or K; X251 is D or L; X252 is V; X254 is K; X255 is C; X258 is V; X269 is L; X280 is W; X284 is A; X285 is A; X293 is R; X296 is R; X301 is G, L, E, or F; X303 is Q; X305 is G; X313 is A; X314 is A or V; X325 is R; and X328 is R; wherein the numbering refers to SEQ ID NO: 302.
Embodiment 11. The engineered dsRNA ligase polypeptide of any one of Embodiments 5-10, wherein the amino acid sequence of the engineered dsRNA ligase polypeptide comprises an amino acid sequence that differs from the sequence of SEQ ID NO: 302 in one or more amino acid residues selected from: X15, X19, X36, X39, X53, X218, X221, X237, X251, X255, and X285, wherein the numbering refers to SEQ ID NO: 302, and wherein the engineered dsRNA ligase polypeptide has dsRNA ligase activity.
Embodiment 12. The engineered dsRNA ligase polypeptide of Embodiment 11, wherein the amino acid sequence of the engineered dsRNA ligase polypeptide comprises one or more
of the following amino acid residues: X15 is D or E; X19 is D; X36 is V; X39 is A; X53 is Y; X218 is N; X221 is I; X237 is R; X251 is L; X255 is C; and X285 is A; wherein the numbering refers to SEQ ID NO: 302.
Embodiment 13. The engineered dsRNA ligase polypeptide of any one of Embodiments 5-12, wherein the amino acid sequence of the engineered dsRNA ligase polypeptide comprises an amino acid sequence that differs from the sequence of SEQ ID NO: 302 in one or more amino acid residues selected from: X15, X19, X36, X39, X53, X185, X218, X221, X237, X251, X255, and X285, wherein the numbering refers to SEQ ID NO: 302, and wherein the engineered dsRNA ligase polypeptide has dsRNA ligase activity.
Embodiment 14. The engineered dsRNA ligase polypeptide of Embodiment 13, wherein the amino acid sequence of the engineered dsRNA ligase polypeptide comprises one or more of the following amino acid residues: X15 is D or E; X19 is D; X36 is V; X39 is A; X53 is Y; X185 is K; X218 is N; X221 is I; X237 is R; X251 is L; X255 is C; and X285 is A; wherein the numbering refers to SEQ ID NO: 302.
Embodiment 15. The engineered dsRNA ligase polypeptide of any one of Embodiments 5-14, wherein the amino acid sequence of the engineered dsRNA ligase polypeptide comprises an amino acid sequence that differs from the sequence of SEQ ID NO: 302 in one or more amino acid residues selected from: X36, X39, X218 and X221, wherein the numbering refers to SEQ ID NO: 302, and wherein the engineered dsRNA ligase polypeptide has dsRNA ligase activity; optionally wherein the amino acid sequence of the engineered dsRNA ligase polypeptide comprises one or more of the following amino acid residues: X36 is V; X39 is A; X218 is N; and X221 is I.
Embodiment 16. The engineered dsRNA ligase polypeptide of any one of Embodiments 5-15, wherein the amino acid sequence of the engineered dsRNA ligase polypeptide comprises an amino acid sequence that differs from the sequence of SEQ ID NO: 302 in one or more amino acid residues selected from: X39, X218 and X221, wherein the numbering refers to SEQ ID NO: 302, and wherein the engineered dsRNA ligase polypeptide has dsRNA ligase activity; optionally wherein the amino acid sequence of the engineered dsRNA ligase polypeptide comprises one or more of the following amino acid residues: X39 is A; X218 is N; and X221 is l.
Embodiment 17. The engineered dsRNA ligase polypeptide of any one of Embodiments 5-16, wherein the amino acid sequence of the engineered dsRNA ligase polypeptide comprises an amino acid sequence that differs from the sequence of SEQ ID NO: 302 in one or more amino acid residues selected from: X39, X218, X221 and X255, wherein the numbering refers to SEQ ID NO: 302, and wherein the engineered dsRNA ligase polypeptide has dsRNA ligase activity; optionally wherein the amino acid sequence of the engineered dsRNA ligase polypeptide comprises one or more of the following amino acid residues: X39 is A; X218 is N; X221 is I; and X255 is C.
Embodiment 18. The engineered dsRNA ligase polypeptide of any one of Embodiments 5-17, wherein the amino acid sequence of the engineered dsRNA ligase polypeptide comprises an amino acid sequence that differs from the sequence of SEQ ID NO: 302 in one or more amino acid residues selected from: X39, X53, X218, X221, X237, X251, X255 and X285, wherein the numbering refers to SEQ ID NO: 302, and wherein the engineered dsRNA ligase polypeptide has dsRNA ligase activity; optionally wherein the amino acid sequence of the engineered dsRNA ligase polypeptide comprises one or more of the following amino acid residues: X39 is A; X 53 is Y; X218 is N; X221 is I; X237 is R; X251 is L; X255 is C; and X285 is A.
Embodiment 19. The engineered dsRNA ligase polypeptide of any one of Embodiments 5-18, wherein the amino acid sequence of the engineered dsRNA ligase polypeptide comprises an amino acid sequence that differs from the sequence of SEQ ID NO: 302 in one or more amino acid residues selected from: X15, X39, X53, X218, X221, X237, X251, X255 and X285, wherein the numbering refers to SEQ ID NO: 302, and wherein the engineered dsRNA ligase polypeptide has dsRNA ligase activity; optionally wherein the amino acid sequence of the engineered dsRNA ligase polypeptide comprises one or more of the following amino acid residues: X15 is E; X39 is A; X 53 is Y; X218 is N; X221 is I; X237 is R; X251 is L; X255 is C; and X285 is A.
Embodiment 20. The engineered dsRNA ligase polypeptide of any one of Embodiments 5-19, wherein the amino acid sequence of the engineered dsRNA ligase polypeptide comprises an amino acid sequence that differs from the sequence of SEQ ID NO: 302 in one or more amino acid residues selected from: X19, X39, X53, X218, X221, X237, X251, X255
and X285, wherein the numbering refers to SEQ ID NO: 302, and wherein the engineered dsRNA ligase polypeptide has dsRNA ligase activity; optionally wherein the amino acid sequence of the engineered dsRNA ligase polypeptide comprises one or more of the following amino acid residues: X19 is D; X39 is A; X 53 is Y; X218 is N; X221 is I; X237 is R; X251 is L; X255 is C; and X285 is A.
Embodiment 21. The engineered dsRNA ligase polypeptide of any one of Embodiments 5-20, wherein the amino acid sequence of the engineered dsRNA ligase polypeptide comprises an amino acid sequence that differs from the sequence of SEQ ID NO: 302 in one or more amino acid residues selected from: X15, X39, X53, X185, X218, X221, X237, X251, X255 and X285, wherein the numbering refers to SEQ ID NO: 302, and wherein the engineered dsRNA ligase polypeptide has dsRNA ligase activity; optionally wherein the amino acid sequence of the engineered dsRNA ligase polypeptide comprises one or more of the following amino acid residues: X15 is D; X39 is A; X 53 is Y; X185 is K; X218 is N; X221 is I; X237 is R; X251 is L; X255 is C; and X285 is A.
Embodiment 22. The engineered dsRNA ligase polypeptide of any of Embodiments 1- 21, wherein the engineered dsRNA ligase polypeptide comprises a purification tag.
Embodiment 23. The engineered dsRNA ligase polypeptide of Embodiment 22, comprising an amino acid sequence selected from the group consisting of SEQ ID NOs: 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 86, 88, 90, 92, 94, 96, 98, 100, 102, 104, 106, 108, 110, 112, 114, 116, 118, 120, 122, 124, 126, 128, 130, 132, 134, 136, 138, 140,
142, 144, 146, 148, 150, 152, 154, 156, 158, 160, 162, 164, 166, 168, 170, 172, 174, 176,
178, 180, 182, 184, 186, 188, 190, 192, 194, 196, 198, 200, 202, 204, 206, 208, 210, 212,
214, 216, 218, 220, 222, 224, 226, 228, 230, 232, 234, 236, 238, 240, 242, 244, 246, 248,
250, 252, 254, 256, 258, 260, 262, 264, 266, 268, 270, 272, 274, 276, 278, 280, 282, 284,
286, 288, 290, 292, 294, 296, 298 and 300.
Embodiment 24. The engineered dsRNA ligase polypeptide of Embodiment 22, comprising an amino acid sequence selected from the group consisting of SEQ ID NOs: 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 86, 88, 90, 92, 94, 96, 98, 100, 102, 104,
106, 108, 110, 112, 114, 116, 118, 120, 122, 124, 126, 128, 130, 132, 134, 136, 138, 140,
142, 144, 146, 148, 150, 152, 154, 156, 158, 160, 162, 164, 166, 168, 170, 172, 174, 176,
178, 180, 182, 184, 186, 188, 190, 192, 194, 196, 198, 200, 202, 204, 206, 208, 210, 212,
214, 216, 218, 220, 222, 224, 226, 228, 230, 232, 234, 236, 238, 240, 242, 244, 246, 248,
250, 252, 254, 256, 258, 260, 262, 264, 266, 268, 270, 272, 274, 276, 278, 280, 282, 284,
286, 288, 290, 292, 294, 296, 298, 300, 602, 604, 606, 608, 610, 612, 614, 616, 618, 620,
622, 624, 626, 628, 630, 632, and 634.
Embodiment 25. A polypeptide immobilized on a solid material by chemical bond or a physical adsorption method, wherein the polypeptide comprises an engineered dsRNA ligase polypeptide according to any one of Embodiments 1-24.
Embodiment 26. A polynucleotide encoding the engineered dsRNA ligase polypeptide of any one of Embodiments 1-24.
Embodiment 27. The polynucleotide of Embodiment 26, wherein the polynucleotide sequence is SEQ ID NO: 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 39, 41, 43, 45, 47, 49, 51, 53, 55, 57, 59, 61, 63, 65, 67, 69, 71, 73, 75, 77, 79, 81, 83, 85, 87, 89, 91, 93, 95, 97, 99, 101, 103, 105, 107, 109, 111, 113, 115, 117, 119, 121, 123, 125, 127, 129, 131, 133, 135, 137, 139, 141, 143, 145, 147, 149, 151, 153, 155, 157, 159, 161, 163, 165,
167, 169, 171, 173, 175, 177, 179, 181, 183, 185, 187, 189, 191, 193, 195, 197, 199, 201,
203, 205, 207, 209, 211, 213, 215, 217, 219, 221, 223, 225, 227, 229, 231, 233, 235, 237,
239, 241, 243, 245, 247, 249, 251, 253, 255, 257, 259, 261, 263, 265, 267, 269, 271, 273,
275, 277, 279, 281, 283, 285, 287, 289, 291, 293, 295, 297, 299, 303, 305, 307, 309, 311,
313, 315, 317, 319, 321, 323, 325, 327, 329, 331, 333, 335, 337, 339, 341, 343, 345, 347,
349, 351, 353, 355, 357, 359, 361, 363, 365, 367, 369, 371, 373, 375, 377, 379, 381, 383,
385, 387, 389, 391, 393, 395, 397, 399, 401, 403, 405, 407, 409, 411, 413, 415, 417, 419,
421, 423, 425, 427, 429, 431, 433, 435, 437, 439, 441, 443, 445, 447, 449, 451, 453, 455,
457, 459, 461, 463, 465, 467, 469, 471, 473, 475, 477, 479, 481, 483, 485, 487, 489, 491,
493, 495, 497, 499, 501, 503, 505, 507, 509, 511, 513, 515, 517, 519, 521, 523, 525, 527,
529, 531, 533, 535, 537, 539, 541, 543, 545, 547, 549, 551, 553, 555, 557, 559, 561, 563,
565, 567, 569, 571, 573, 575, 577, 579, 581, 583, 585, 587, 589, 591, 593, 595, 597, or 599.
Embodiment 28. The polynucleotide of Embodiment 26, wherein the polynucleotide sequence is SEQ ID NO: 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 39, 41, 43, 45, 47, 49, 51, 53, 55, 57, 59, 61, 63, 65, 67, 69, 71, 73, 75, 77, 79, 81, 83, 85, 87, 89, 91, 93, 95, 97, 99, 101, 103, 105, 107, 109, 111, 113, 115, 117, 119, 121, 123, 125, 127, 129, 131, 133, 135, 137, 139, 141, 143, 145, 147, 149, 151, 153, 155, 157, 159, 161, 163, 165,
167, 169, 171, 173, 175, 177, 179, 181, 183, 185, 187, 189, 191, 193, 195, 197, 199, 201,
203, 205, 207, 209, 211, 213, 215, 217, 219, 221, 223, 225, 227, 229, 231, 233, 235, 237,
239, 241, 243, 245, 247, 249, 251, 253, 255, 257, 259, 261, 263, 265, 267, 269, 271, 273,
275, 277, 279, 281, 283, 285, 287, 289, 291, 293, 295, 297, 299, 303, 305, 307, 309, 311,
313, 315, 317, 319, 321, 323, 325, 327, 329, 331, 333, 335, 337, 339, 341, 343, 345, 347,
349, 351, 353, 355, 357, 359, 361, 363, 365, 367, 369, 371, 373, 375, 377, 379, 381, 383,
385, 387, 389, 391, 393, 395, 397, 399, 401, 403, 405, 407, 409, 411, 413, 415, 417, 419,
421, 423, 425, 427, 429, 431, 433, 435, 437, 439, 441, 443, 445, 447, 449, 451, 453, 455,
457, 459, 461, 463, 465, 467, 469, 471, 473, 475, 477, 479, 481, 483, 485, 487, 489, 491,
493, 495, 497, 499, 501, 503, 505, 507, 509, 511, 513, 515, 517, 519, 521, 523, 525, 527,
529, 531, 533, 535, 537, 539, 541, 543, 545, 547, 549, 551, 553, 555, 557, 559, 561, 563,
565, 567, 569, 571, 573, 575, 577, 579, 581, 583, 585, 587, 589, 591, 593, 595, 597, 599,
601, 603, 605, 607, 609, 611, 613, 615, 617, 619, 621, 623, 625, 627, 629, 631, 633, 635,
637, 639, 641, 643, 645, 647, 649, 651, 653, 655, 657, 659, 661, 663, 665, and 667.
Embodiment 29. An expression vector comprising the polynucleotide according to any one of Embodiments 26-28.
Embodiment 30. The expression vector of Embodiment 29, which comprises a plasmid, a cosmid, a bacteriophage or a viral vector.
Embodiment 31. A host cell comprising the polynucleotide of any one of Embodiments 26-28 or the expression vector of Embodiment 29 or 30, optionally wherein the host cell is E. coli.
Embodiment 32. A method of preparing an engineered dsRNA ligase polypeptide, which comprises the steps of culturing the host cell according to Embodiment 31 and obtaining an engineered dsRNA ligase polypeptide from the culture.
Embodiment 33. An engineered dsRNA ligase catalyst obtainable by culturing the host cells according to Embodiment 31, or according to the method of Embodiment 32, wherein said engineered dsRNA ligase catalyst comprises cells or culture fluid containing the engineered dsRNA ligase polypeptides, or an article processed therewith, wherein the article refers to an extract obtained from the culture of host cell, an isolated product obtained by isolating or purifying an engineered dsRNA ligase from the extract, or an immobilized product obtained by immobilizing host cell, an extract thereof, or isolated product of the extract.
Embodiment 34. A method of producing an oligonucleotide from two or more oligonucleotide fragments, wherein the method comprises contacting:
(i) two or more oligonucleotide fragments;
(ii) an engineered dsRNA ligase polypeptide according to any one of Embodiments 1- 24;
(iii) a source of ATP; and
(iv) a divalent cation; to obtain an oligonucleotide.
Embodiment 35. The method of Embodiment 34, wherein the source of ATP comprises ATP.
Embodiment 36. The method of Embodiment 34 or 35, wherein the source of ATP comprises:
(a) polyphosphate kinase (PPK);
(b) polyphosphate; and
(c) AMP and/or ATP.
Embodiment 37. The method of Embodiment 36, wherein the PPK is selected from PPK12 or ajPAP.
Embodiment 38. The method of any one of Embodiments 36 or 37, wherein the method is performed using a sub-stoichiometric concentration of AMP and/or ATP.
Embodiment 39. The method of any one of Embodiments 36-38, wherein the polyphosphate is a polyphosphate salt.
Embodiment 40. The method of Embodiment 39, wherein the polyphosphate salt is sodium polyphosphate (Maddrell’s salt) or sodium hexametaphosphate (Graham’s salt).
Embodiment 41. The method of any one of Embodiments 34-40, wherein the divalent cation cofactor is Mg2+ or Mn2+.
Embodiment 42. The method of any one of Embodiments 34-41, wherein the method is performed with a divalent cation concentration of 5-100 mM, optionally 30-50 mM.
Embodiment 43. The method of any one of Embodiments 34-42, further comprising a step of purifying the oligonucleotide.
Embodiment 44. Use of the engineered dsRNA ligase polypeptide according to any one of Embodiments 1-24 in the production of an oligonucleotide from two or more oligonucleotide fragments.
Embodiment 45. The method of any one of Embodiments 34-43 or the use of
Embodiment 44, wherein the oligonucleotide is up to 60 nucleotides in length.
Embodiment 46. The method of any one of Embodiments 34-43 or 45 or the use of Embodiment 44 or 45, wherein each of the oligonucleotide fragments are 4-16 nucleotides in length, optionally 6-9 nucleotides in length.
Embodiment 47. The method of Embodiment 34-43, 45 or 46 or the use of any one of Embodiments 44-46, wherein one or more of the oligonucleotide fragment(s) comprises one or two overhangs.
Embodiment 48. The method of any one of Embodiments 34-43 or 45-47 or the use of any one of Embodiments 44-47, wherein one or more of the oligonucleotide fragments comprises a chemical modification.
Embodiment 49. The method or use of Embodiment 48, wherein the chemical modification is selected from:
(a) a modified backbone, optionally selected from a phosphorothioate (e.g. chiral phosphorothioate) or methylphosphonate intemucleotide linkage;
(b) a modified nucleotide, optionally selected from 2'-O-methyl (2’-0Me), 2'-flouro (2’-F), 2'-deoxy, 2'-deoxy-2’-fluoro, 2'-O-methoxyethyl (2'-0-M0E), 2'-O- aminopropyl (2'-O-AP), 2'-O-dimethylaminoethyl (2'-0-DMA0E), 2'-O- dimethylaminopropyl (2'-0-DMAP), 2'-O-dimethylaminoethyloxyethyl (2'-O- DMAEOE), 2'-O-N-methylacetamido (2'-0-NMA), locked nucleic acid (LNA), glycol nucleic acid (GNA), phosphoramidate (e.g. mesyl phosphoramidate), 2',3'-seco nucleotide mimic, 2'-F-arabino nucleotide, abasic nucleotide, 2'-amino modified nucleotide, 2'-alkyl-modified nucleotide, morpholino nucleotide, vinylphosphonate (e.g. 5’ vinylphosphonate), and cyclopropyl phosphonate deoxyribonucleotide; and/or
(c) conjugation to a ligand, optionally wherein the ligand comprises one or more N- Acetylgalactosamine (GalNAc) derivatives.
Embodiment 50. A composition comprising: i. the engineered dsRNA ligase polypeptide according to any one of Embodiments 1- 24; ii. a source of ATP; and iii. a divalent cation.
Embodiment 51. The composition of Embodiment 50, further comprising two or more oligonucleotide fragments.
Embodiment 52. A kit comprising: i. the engineered dsRNA ligase polypeptide according to any one of Embodiments 1- 24; ii. a source of ATP; iii. a divalent cation; and iv. instructions for use in a method of producing an oligonucleotide from two or more oligonucleotide fragments.
Embodiment 53. The composition of Embodiment 50 or 51 or the kit of Embodiment 52, wherein the source of ATP comprises ATP.
Embodiment 54. The composition of any one of Embodiments 50, 51 or 53 or the kit of Embodiment 52 or 53, wherein the source of ATP comprises:
(a) polyphosphate kinase (PPK);
(b) polyphosphate; and
(c) AMP and/or ATP.
Embodiment 55. The composition or kit of Embodiment 54, wherein the PPK is selected from PPK 12 or ajPAP.
Embodiment 56. The composition of any one of Embodiments 50, 51 or 53-55 or the kit of any one of Embodiments 52-55, wherein the polyphosphate is a polyphosphate salt.
Embodiment 57. The composition or kit of Embodiment 56, wherein the polyphosphate salt is sodium polyphosphate (Maddrell’s salt) or sodium hexametaphosphate (Graham’s salt). Embodiment 58. The composition of any one of Embodiments 50, 51 or 53-57 or the kit of any one of Embodiments 52-57, wherein the divalent cation cofactor is Mg2+ or Mn2+.
Different features and embodiments of the present disclosure are exemplified in the following representative examples, which are intended to be illustrative and not restrictive.
EXAMPLES
The following Examples, including experiments and results achieved, are provided for illustrative purposes only and are not to be construed as limiting the present invention.
In the Examples below, the following abbreviations apply: ppm (parts per million); M (molar); mM (millimolar), uM and pM (micromolar); nM (nanomolar); mol (moles); gm and g (gram); mg (milligrams); ug and pg (micrograms); L and 1 (liter); ml and mb (milliliter); cm (centimeters); mm (millimeters); um and pm (micrometers); sec. (seconds); min(s) (minute(s)); h(s) and hr(s) (hour(s)); U (units); MW (molecular weight); rpm (rotations per minute); psi and PSI (pounds per square inch); °C (degrees Centigrade); RT and rt (room temperature); OD600
(Optical density at 600 nm), CAM and cam (chloramphenicol); DMSO (dimethylsulfoxide); FP (Fermentation powder); FWC (Frozen whole cells), LWC (Lyophilized whole cells), PMBS (polymyxin B sulfate); IPTG (isopropyl P-D-l -thiogalactopyranoside); LB (Lysogeny broth); TB (Terrific Broth; 12 g/L bacto-tryptone, 24 g/L yeast extract, 4 mL/L glycerol, 65 mM potassium phosphate, pH 7.0, 1 mM MgSOi): TEoA (triethanolamine buffer), HEPES (HEPES zwitterionic buffer; 4-(2-hydroxyethyl)-piperazineethanesulfonic acid); SFP (shake flask powder); CDS (coding sequence); DNA (deoxyribonucleic acid); RNA (ribonucleic acid); E. coli W3110 (commonly used laboratory E. coli strain, available from the Coli Genetic Stock Center [CGSC], New Haven, CT); HTP (high throughput); HPLC (high pressure liquid chromatography); FIOP (fold improvements over positive control); Microfluidics (Microfluidics, Corp., Westwood, MA); Sigma-Aldrich (Sigma-Aldrich, St. Louis, MO; Difco (Difco Laboratories, BD Diagnostic Systems, Detroit, MI); Agilent (Agilent Technologies, Inc., Santa Clara, CA); Coming (Coming, Inc., Palo Alto, CA); Dow Coming (Dow Coming, Corp., Midland, MI); and Gene Oracle (Gene Oracle, Inc., Mountain View, CA).
The sequences of the oligonucleotides referred to in parentheses (e.g. “siRNA (1)” and “oligonucleotide (2)") throughout the Examples are provided in Table 1.
EXAMPLE 1
Preparation of isolated enzymes
Polynucleotides encoding the polypeptides having ligase activity, were cloned into the pCKl 10900 vector system (See e.g., US Pat. App. No. 2006/0195947A1 FIG. 3 which is hereby incorporated by reference in its entirety) and subsequently expressed in E. coli W311O 7n/A under the control of the lac promoter. The expression vector also contained the Pl 5a origin of replication and the chloramphenicol (CAM) resistance gene.
E. coli W311O 7IMA cells were transformed with the pCKl 10900 plasmid containing the ligase-encoding genes. Transformed cells were plated out on Lysogeny broth (LB) agar plates containing 1% glucose and 30 pg/mL CAM, and grown overnight at 37° C. Subsequently single colonies were inoculated into 25 mL of LB supplemented with 30 pg/mL CAM and 1% glucose in a 250 ml baffled shake flask. The culture was grown overnight (16-20 hours and optical density (ODeoo) >3.8) in an incubator at 37°C, with shaking at 250 rpm. A I L shake flask containing 250 mL of Terrific Broth (TB) media with 30 pg/mL CAM, was inoculated with 5 mL of the grown overnight culture. The 250 mL culture was incubated at 30°C, 250 rpm, for 3 - 3.5 hours until ODeoo reached 0.6-0.8. Expression of the ligase gene was induced
by the addition of isopropyl-P-D-thiogalactoside (IPTG) to a final concentration of 1 mM, and growth was continued for an additional 18-20 hours. Cells were harvested by transferring the culture into a centrifuge bottle, which was then centrifuged at 7,000 rpm for 5 minutes at 4°C. The supernatant was discarded, and the remaining cell pellet lysed. For lysis, the cell pellet was resuspended in 30 mb of 50 mM Tris-buffer at pH 7.5 and lysed using a LM20 MICROFLUIDIZER® processor system (Microfluidics). Cell debris was removed by centrifugation at 14,000 rpm for 30 minutes at 4°C. Ligase enzymes were then isolated from the clarified lysate using standard techniques known in the art, including immobilized metal affinity chromatography.
EXAMPLE 2
Identification of dsRNA ligase activity for the production of siRNA (1)
To identify an enzyme with dsRNA ligase activity for the production of siRNA (1) comprising of oligonucleotides (2) and (3) a collection of ligases were first screened for the production of a surrogate product, siRNA (4) comprising of oligonucleotides (3) and (5). Oligonucleotide (5) has the same sequence as oligonucleotide (2) but does not contain a 3’- GalNAc moiety. siRNAs (1) and (4) and oligonucleotides (2), (3) and (5) are depicted in Figure 1; and the sequences of oligonucleotides (2), (3) and (5) are provided in Table 1.
Screening of the isolated ligases was performed in 20 pL reaction volumes in PCR tubes, each tube containing 50 mM Tris-buffer pH 7.5, either 1 mM ATP or 1 mM NAD+, 10 mM MgC12, 5 mM DTT and 10 pM (each) of substrate oligonucleotides (6 - 11) with 50 % (v/v) isolated ligase enzyme. Reactions were incubated in a thermocycler at 16 °C for 2 h and analyzed using standard techniques known in the art, including electrophoresis. A ligase with SEQ ID NO: 2 exhibited the highest dsRNA ligase activity towards the formation of siRNA (4). The activity of SEQ ID NO: 2 for the production of siRNA (1) was subsequently confirmed using multiple enzyme preparations including isolated enzyme (example 1), clarified lysate (example 4) and shake flask powder (SFP; example 5).
Table 1 Oligonucleotide sequences
EXAMPLE 3
Preparation of cell pellets for high throughput (HTP) screening
Single colonies were picked in a 96-well format and grown in 190 pL LB media containing 1% glucose and 30 pg/mL CAM, at 30°C, 200 rpm, and 85% humidity. Following overnight growth, 20 pL of the grown cultures were transferred into a deep well plate containing 380 pL of TB media with 30 pg/mL CAM. The cultures were grown at 30°C, 250 rpm, with 85% humidity for approximately 2.5 hours. When the ODeoo of the cultures reached 0.4-0.8, expression of the ligase gene was induced by the addition of IPTG to a final concentration of 1 mM. Following induction, growth continued for 18-20 hours at 30°C, 250 rpm with 85% humidity. Cells were harvested by centrifugation at 4,000 rpm and 4°C for 10 minutes; the supernatant was then discarded. The cell pellets were stored at -80°C until ready for use.
EXAMPLE 4
Lysis and preparation of clarified lysate
Prior to performing the assay, the cell pellets were thawed and resuspended in 300 pL of lysis buffer (containing 1 g/L lysozyme, 0.5 g/L PMBS and 0.1 pL/mL or 0.2U/ml of commercial DNAse (New England BioLabs, M0303L) in 50 mM Tris-buffer at pH 7.5. The
plates were agitated with medium-speed shaking for 2.5 hours on a microtiter plate shaker at room temperature. The plates were then centrifuged at 4,000 rpm for 10 minutes at 4°C, and the clarified supernatants were used in the HTP assay reaction for activity determination as described in the following examples.
EXAMPLE 5
Preparation of shake flask powder (SFP) and fermentation powder (FP)
Shake-flask procedures can be used to generate engineered dsRNA ligase polypeptide shake-flask powders (SFP), which are useful for secondary screening assays and/or use in the biocatalytic processes described herein. Shake flask powder preparation of enzymes provides a more concentrated preparation of the engineered enzyme, as compared to the cell lysate used in HTP assays.
Clarified lysate produced according to example 1 was collected, frozen at -80°C, and then lyophilized, using standard methods known in the art. Lyophilization of frozen clarified lysate provides a dry SFP comprising crude wild-type or engineered dsRNA ligase polypeptide.
EXAMPLE 6
Analytical method for activity and selectivity evaluation
Activity improvements of the engineered dsRNA ligases were analyzed by High Pressure Liquid Chromatography (HPLC) using the methods described in Table 6-1 and 6-2. HPLC methods with UV-detection were developed to analyze the formation of product oligonucleotides (2) and (3). The analytical methods aim for the shortest run time enabling good resolution of the product oligonucleotides (2) and (3). Consequently, the six substrate (6- 7, 9-12) and the four intermediate (13-16) oligonucleotides could not all be well resolved from each other. However, it is possible to resolve the well-defined GalNAc -containing oligonucleotides, including the substrate oligonucleotide (12), a reaction intermediate oligonucleotide (14) and the product oligonucleotide (2). Therefore, a pseudo-% conversion can be calculated, denoted with arbitrary units (AU), which considers only these well resolved species according to the following equation:
Whereby 8(2), 8(12) and 8<i4) are the extinction coefficient of oligonucleotides (2), (12) and (14) respectively. Using such a calculation an AU = 1.0 would imply that no more GalNAc- containing substrate or intermediate oligonucleotides are present in the reaction and that they have all be converted to GalNAc -containing product (2). In reality, for samples where AU = 1.0 the only other peak present in the chromatogram corresponds with the product oligonucleotide (2), and no other intermediates or starting materials can be identified. Furthermore, the ratio of the product oligonucleotides (2) and (3) are consistent with that of the authentic standard of siRNA product (1). Taken together, it can be concluded that AU = 1.0 is an approximation that is essentially equivalent to 100 % conversion.
Table 6-1: HPUC method 1 used for activity determination.
HPLC method 2 (Table 6-2) was developed from HPLC method 1 (Table 6-1) to improve the separation between the product oligonucleotide (3) and the substrate oligonucleotide (12).
The activity improvements of the engineered dsRNA ligases of Example 12 were also analyzed with RapidFire® Mass Spectrometry (RF-MS) using the method described in Table 6-3. RF-MS aims to reduce the analytical time compared to HPLC analysis. The selective detection of product oligonucleotides (2) and (3) is obtained by the specific masses of each product oligonucleotide analyzed under the multi-single ion monitoring (SIM) mode. Relative
dsRNA ligase activity is determined by comparing the sum of the MS signal of the five specific masses (given in Table 6-3) corresponding with each product oligonucleotide (2) and (3).
The methods provided herein find use in analyzing the variants produced using the present invention. However, it is not intended that present invention be limited to the methods described herein, as there are other suitable methods known in the art that are applicable to the analysis of the variants provided herein and/or produced using the methods provided herein.
EXAMPLE 7
Round 1 Evolution and Screening of Engineered Polypeptides Derived from SEQ ID NO: 2 for Improved Production of siRNA product (1)
The engineered polynucleotide (SEQ ID NO: 1) encoding the polypeptide with dsRNA ligase activity of SEQ ID NO: 2 was used to generate the engineered polypeptides of Table 7-
1. These polypeptides displayed improved dsRNA ligase activity under the desired conditions e.g., the improvement in the formation of either oligonucleotide products (2) or (3), or preferably both oligonucleotide products (2) and (3) that was produced in situ from the substrate oligonucleotides (6-7, 9-12) as compared to the starting polypeptide. Some polypeptides displayed improved product formation of either oligonucleotide product (2) or (3), or both oligonucleotide products (2) and (3) compared to the starting polypeptide as noted in Table 7-1. The sequences of oligonucleotides (2), (3), (6), (7) and (9-12) are provided in Table 1.
The engineered polypeptides, having the amino acid sequences of even-numbered sequence identifiers were generated from the “backbone” amino acid sequence of SEQ ID NO:
2, as described below together with the analytical method described in Table 6- 1. Directed evolution began with the polynucleotide set forth in SEQ ID NO: 1. Libraries of engineered polypeptides were generated using various well-known techniques (e.g., saturation mutagenesis, recombination of previously identified beneficial amino acid differences) and screened using HTP assay and analysis methods, described below, that measured the polypeptides’ ability to produce oligonucleotide products (2) and (3).
The enzyme assays were carried out in 96-well PCR plates, in 50 pL total reaction volume per well. The reactions contained 2.5 % (v/v) of undiluted dsRNA ligase lysate, prepared as described in Example 4, 100 pM (each) substrate oligonucleotides (6-7, 9-12), 50 mM Tris-buffer at pH 7.5, 1 mM ATP, 10 mM MgCh and 5 mM DTT. The reaction plates were heat-sealed and incubated in a thermocycler at 30 °C for 2 h.
After incubation the plates were subjected to a heat inactivation step (95 °C, 20 min) to quench the reaction and precipitate proteinaceous content of the added lysate. The plates were then centrifuged at 4,000 rpm for 5 min. Subsequently a 2 pL aliquot of the supernatant was removed from each well and added to a shallow well 96-well plate containing 98 pL of 5 mM EDTA solution (pH 7.0). The samples were analyzed via HPLC to determine the activity of the enzyme variants using the analytical method described in Table 6-1. Selected ligase variants showing greater product formation of oligonucleotides (2) and (3) relative to SEQ ID NO:2 are shown in Table 7-1.
The engineered dsRNA ligase polypeptides represented by the even numbered sequence identifiers of SEQ ID NOs: 4 to 106 comprise an even numbered sequence identifier of SEQ ID NOs: 304 to 406, respectively, and a 14 amino acid N-terminal purification tag (MHHHHHHENLYFQS (SEQ ID NO: 669)). For example, SEQ ID NO: 4 comprises: (i) the 14 amino acid N-terminal purification tag of SEQ ID NO: 669; and (ii) the dsRNA ligase polypeptide of SEQ ID NO: 304.
Throughout the Examples, the position of a given mutation is provided relative to SEQ ID NO: 2 which includes (i) the 14 amino acid N-terminal purification tag of SEQ ID NO: 669 and (ii) the wild-type dsRNA ligase polypeptide of SEQ ID NO: 302. The position of a given mutation relative to SEQ ID NO: 302 (i.e. the wild-type dsRNA ligase polypeptide without the purification tag) can be obtained by subtracting the 14 amino acid N-terminal purification tag from the SEQ ID NOs described in the Examples. For example, position X251 of SEQ ID NO: 2 corresponds to position X237 of SEQ ID NO: 302.
EXAMPLE 8
Round 2 Evolution and Screening of Engineered Polypeptides Derived from SEQ ID NO: 70 for Improved Production of siRNA product (1)
The polynucleotide from example 7 SEQ ID NO: 69 encoding the most active polypeptide with dsRNA ligase activity of SEQ ID NO: 70 was used to generate the engineered polypeptides of Table 8-1. These polypeptides displayed improved dsRNA ligase activity under the desired conditions e.g., the improvement in the formation of either oligonucleotide products (2) or (3), or preferably both oligonucleotide products (2) and (3) that was produced in situ from the substrates oligonucleotides (6-7, 9-12) as compared to the starting polypeptide. Some polypeptides displayed improved product formation of both oligonucleotide products (2) and (3) compared to the starting polypeptide are noted in Table 8-1. The engineered polypeptides, having the amino acid sequences of even-numbered sequence identifiers were generated from the “backbone” amino acid sequence of SEQ ID NO: 70, as described below together with the analytical method described in Table 6-1.
Directed evolution began with the polynucleotide set forth in SEQ ID NO: 69. Libraries of engineered polypeptides were generated using various well-known techniques (e.g., saturation mutagenesis, recombination of previously identified beneficial amino acid differences) and screened using HTP assay and analysis methods, described below, that measured the polypeptides’ ability to produce oligonucleotides (2) and (3).
The enzyme assays were carried out in 96-well PCR plates, in 50 pL total reaction volume per well. The reactions contained either 1.25 or 2.5 % (v/v) of undiluted dsRNA ligase lysate, prepared as described in Example 4, 100 pM (each) substrate oligonucleotides (6-7, 9- 12), 50 mM Tris-buffer at pH 7.5, 1 mM ATP, 10 mM MgCh and 5 mM DTT. The reaction plates were heat-sealed and incubated in a thermocycler at 30 °C for 2 h.
After incubation the plates were subjected to a heat inactivation step (95 °C, 20 min) to quench the reaction and precipitate proteinaceous content of the added lysate. The plates were then centrifuged at 4,000 rpm for 5 min. Subsequently a 2 pL aliquot of the supernatant was removed from each well and added to a shallow well 96-well plate containing 98 pL of 5 mM EDTA solution (pH 7.0). The samples were analyzed via HPLC to determine the activity of the enzyme variants using the analytical method described in Table 6-1. Selected ligase variants showing a faster product formation of oligonucleotides (2) and (3) relative to SEQ ID NO:70 are shown in Table 8- 1.
The engineered dsRNA ligase polypeptides represented by the even numbered sequence identifiers of SEQ ID NOs: 108 to 216 comprise an even numbered sequence identifier of SEQ ID NOs: 408 to 516, respectively, and a 14 amino acid N-terminal purification tag (MHHHHHHENLYFQS (SEQ ID NO: 669)). For example, SEQ ID NO: 108 comprises: (i) the 14 amino acid N-terminal purification tag of SEQ ID NO: 669; and (ii) the dsRNA ligase polypeptide of SEQ ID NO: 408.
EXAMPLE 9 Round 3 Evolution and Screening of Engineered Polypeptides Derived from SEQ ID NO: 188 for Improved Production of siRNA product (1)
The polynucleotide from example 8 SEQ ID NO: 187 encoding the most active polypeptide with dsRNA ligase activity of SEQ ID NO: 188 was used to generate the engineered polypeptides of Table 9-1. These polypeptides displayed improved dsRNA ligase activity under the desired conditions e.g., the improvement in the formation of either
oligonucleotide products (2) or (3), or preferably both oligonucleotide products (2) and (3) that was produced in situ from the substrates oligonucleotides (6-7, 9-12) as compared to the starting polypeptide. Some polypeptides displayed improved product formation of both oligonucleotide products (2) and (3) compared to the starting polypeptide are noted in Table 9- 1. The engineered polypeptides, having the amino acid sequences of even-numbered sequence identifiers were generated from the “backbone” amino acid sequence of SEQ ID NO: 188, as described below together with the analytical method described in Table 6-2.
Directed evolution began with the polynucleotide set forth in SEQ ID NO: 187. Libraries of engineered polypeptides were generated using various well-known techniques (e.g., saturation mutagenesis, recombination of previously identified beneficial amino acid differences) and screened using HTP assay and analysis methods, described below, that measured the polypeptides’ ability to produce oligonucleotides (2) and (3).
The enzyme assays were carried out in 96-well PCR plates, in 100 pL total reaction volume per well. The reactions contained 20 % (v/v) of undiluted dsRNA ligase lysate, prepared as described in Example 4, 1 mM (each) substrate oligonucleotides (6-7, 9-12), 50 mM Tris-buffer at pH 7.0, 10 mM ATP, 20 mM MgCh, 5 mM DTT and 10 % (v/v) DMSO. The reaction plates were heat-sealed and incubated in a thermocycler at 30 °C for 2 h.
After incubation the plates were subjected to a heat inactivation step (95 °C, 20 min) to quench the reaction and precipitate proteinaceous content of the added lysate. The plates were then centrifuged at 4,000 rpm for 5 min. Subsequently a 50 pL aliquot of the supernatant was removed from each well and added to a deep well 96-well plate containing 450 pL of 5 mM EDTA solution (pH 7.0). The samples were further diluted by transferring 50 pL of the diluted sample into a deep well 96-well plate containing 950 pL of 5 mM EDTA solution (pH 7.0). The samples were analyzed via HPLC to determine the activity of the enzyme variants using the analytical method described in Table 6-2. Selected ligase variants showing a faster product formation of oligonucleotides (2) and (3) relative to SEQ ID NO: 188 are shown in Table 9-1.
The engineered dsRNA ligase polypeptides represented by the even numbered sequence identifiers of SEQ ID NOs: 218 to 246 comprise an even numbered sequence identifier of SEQ ID NOs: 518 to 546, respectively, and a 14 amino acid N-terminal purification tag (MHHHHHHENLYFQS (SEQ ID NO: 669)). For example, SEQ ID NO: 218 comprises: (i) the 14 amino acid N-terminal purification tag of SEQ ID NO: 669; and (ii) the dsRNA ligase polypeptide of SEQ ID NO: 518.
EXAMPLE 10
Round 4 Evolution and Screening of Engineered Polypeptides Derived from SEQ ID NO: 226 for Improved Production of siRNA product (1)
The polynucleotide from example 9 SEQ ID NO: 225 encoding the most active polypeptide with dsRNA ligase activity of SEQ ID NO: 226 was used to generate the engineered polypeptides of Table 10-1. These polypeptides displayed improved dsRNA ligase activity under the desired conditions e.g., the improvement in the formation of either oligonucleotide products (2) or (3), or preferably both oligonucleotide products (2) and (3) that was produced in situ from the substrates oligonucleotides (6-7, 9-12) as compared to the starting polypeptide. Some polypeptides displayed improved product formation of both oligonucleotide products (2) and (3) compared to the starting polypeptide are noted in Table 10-1. The engineered polypeptides, having the amino acid sequences of even-numbered
sequence identifiers were generated from the “backbone” amino acid sequence of SEQ ID NO: 226, as described below together with the analytical method described in Table 6-2.
Directed evolution began with the polynucleotide set forth in SEQ ID NO: 225. Libraries of engineered polypeptides were generated using various well-known techniques (e.g., saturation mutagenesis, recombination of previously identified beneficial amino acid differences) and screened using HTP assay and analysis methods, described below, that measured the polypeptides’ ability to produce oligonucleotides (2) and (3).
The enzyme assays were carried out in 96-well PCR plates, in 100 pL total reaction volume per well. The reactions contained 2.5 % (v/v) of undiluted dsRNA ligase lysate, prepared as described in Example 4, 1 mM (each) substrate oligonucleotides (6-7, 9-12), 50 mM Tris-buffer at pH 7.0, 10 mM ATP, 20 mM MgCh, 5 mM DTT and 10 % (v/v) DMSO. The reaction plates were heat-sealed and incubated in a thermocycler at 30 °C for 24 h.
After incubation the plates were subjected to a heat inactivation step (95 °C, 20 min) to quench the reaction and precipitate proteinaceous content of the added lysate. The plates were then centrifuged at 4,000 rpm for 5 min. Subsequently a 50 pL aliquot of the supernatant was removed from each well and added to a deep well 96-well plate containing 450 pL of 5 mM EDTA solution (pH 7.0). The samples were further diluted by transferring 50 pL of the diluted sample into a deep well 96-well plate containing 950 pL of 5 mM EDTA solution (pH 7.0). The samples were analyzed via HPLC to determine the activity of the enzyme variants using the analytical method described in Table 6-2. Selected ligase variants showing a faster product formation of oligonucleotides (2) and (3) relative to SEQ ID NO: 226 are shown in Table 10- 1.
The engineered dsRNA ligase polypeptides represented by the even numbered sequence identifiers of SEQ ID NOs: 248 to 282 comprise an even numbered sequence identifier of SEQ ID NOs: 548 to 582, respectively, and a 14 amino acid N-terminal purification tag (MHHHHHHENLYFQS (SEQ ID NO: 669)). For example, SEQ ID NO: 248 comprises: (i) the 14 amino acid N-terminal purification tag of SEQ ID NO: 669; and (ii) the dsRNA ligase polypeptide of SEQ ID NO: 548.
EXAMPLE 11
Round 5 Evolution and Screening of Engineered Polypeptides Derived from SEQ ID NO: 278 for Improved Production of siRNA product (1)
The polynucleotide from example 10 SEQ ID NO: 277 encoding the most active polypeptide with dsRNA ligase activity of SEQ ID NO: 278 was used to generate the engineered polypeptides of Table 11-1. These polypeptides displayed improved dsRNA ligase activity under the desired conditions e.g., the improvement in the formation of either oligonucleotide products (2) or (3), or preferably both oligonucleotide products (2) and (3) that was produced in situ from the substrates oligonucleotides (6-7, 9-12) as compared to the starting polypeptide. Some polypeptides displayed improved product formation of both oligonucleotide products (2) and (3) compared to the starting polypeptide are noted in Table 11-1. The engineered polypeptides, having the amino acid sequences of even-numbered sequence identifiers were generated from the “backbone” amino acid sequence of SEQ ID NO: 278, as described below together with the analytical method described in Table 6-2.
Directed evolution began with the polynucleotide set forth in SEQ ID NO: 277. Libraries of engineered polypeptides were generated using various well-known techniques (e.g., saturation mutagenesis, recombination of previously identified beneficial amino acid differences) and screened using HTP assay and analysis methods, described below, that measured the polypeptides’ ability to produce oligonucleotides (2) and (3).
The enzyme assays were carried out in 96-well PCR plates, in 100 pL total reaction volume per well. The reactions contained 10 % (v/v) of undiluted dsRNA ligase lysate, prepared as described in Example 4, 5 mM (each) substrate oligonucleotides (6-7, 9-12), 50 mM Tris-buffer at pH 7.0, 30 mM ATP, 60 mM MgCh and 10 % (v/v) DMSO. The reaction plates were heat-sealed and incubated in a thermocycler at 30 °C for 24 h.
After incubation the plates were subjected to a heat inactivation step (95 °C, 20 min) to quench the reaction and precipitate proteinaceous content of the added lysate. The plates were then centrifuged at 4,000 rpm for 5 min. Subsequently a 50 pL aliquot of the supernatant was removed from each well and added to a deep well 96-well plate containing 950 pL of 5 mM EDTA solution (pH 7.0). The samples were further diluted by transferring 50 pL of the diluted sample into a deep well 96-well plate containing 450 pL of 5 mM EDTA solution (pH 7.0). The samples were diluted a third time by transferring 160 pL of the diluted sample into a deep well 96-well plate containing 640 pL of 5 mM EDTA solution (pH 7.0). The samples were diluted a final time by transferring 75 pL of the diluted sample into a shallow well 96-well plate containing 75 pL of 5 mM EDTA solution (pH 7.0). The samples were analyzed via HPLC to determine the activity of the enzyme variants using the analytical method described in Table 6-2. Selected ligase variants showing a faster product formation of oligonucleotides (2) and (3) relative to SEQ ID NO: 278 are shown in Table 11-1.
The engineered dsRNA ligase polypeptides represented by the even numbered sequence identifiers of SEQ ID NOs: 284 to 300 comprise an even numbered sequence identifier of SEQ ID NOs: 584 to 600, respectively, and a 14 amino acid N-terminal purification tag (MHHHHHHENLYFQS (SEQ ID NO: 669)). For example, SEQ ID NO: 284 comprises: (i) the 14 amino acid N-terminal purification tag of SEQ ID NO: 669; and (ii) the dsRNA ligase polypeptide of SEQ ID NO: 584.
EXAMPLE 12
Round 6 Evolution and Screening of Engineered Polypeptides Derived from SEQ ID NO: 288 for Improved Production of siRNA product (1) and improved thermal stability
The polynucleotide from example 11 SEQ ID NO: 287 encoding the most active polypeptide with dsRNA ligase activity of SEQ ID NO: 288 was used to generate the engineered polypeptides of Table 12-1. These polypeptides displayed improved dsRNA ligase activity under the desired conditions e.g., the improvement in the formation of either oligonucleotide products (2) or (3), or preferably both oligonucleotide products (2) and (3) that was produced in situ from the substrate oligonucleotides (6-7, 9-12) as compared to the starting polypeptide. Some polypeptides displayed improved product formation of both oligonucleotide products (2) and (3) compared to the starting polypeptide, as noted in Table 12- 1. Furthermore, some polypeptides displayed improved thermal stability, resulting in higher residual activity following incubation of the dsRNA ligase solution at 30 °C for 1 h prior to setting up the reaction (Table 12-2). The engineered polypeptides, having the amino acid sequences of even- numbered sequence identifiers were generated from the “backbone” amino acid sequence of SEQ ID NO: 288, as described below together with the analytical method described in Table 6-3.
Directed evolution began with the polynucleotide set forth in SEQ ID NO: 287. Libraries of engineered polypeptides were generated using various well-known techniques (e.g., saturation mutagenesis, recombination of previously identified beneficial amino acid differences) and screened using HTP assay and analysis methods, described below, that measured the polypeptides’ ability to produce oligonucleotides (2) and (3).
The enzyme assays were carried out in 96-well PCR plates, in 100 pL total reaction volume per well. The cells were lysed according to Example 4, however 100 mM MOPS-buffer at pH 7.2 was used in place of 50 mM Tris at pH 7.5. To provide a thermal challenge and to identify hits that were more thermostable, the cell lysates were either undiluted or diluted 1 : 1 in 100 mM MOPS buffer, pH 7.2 and incubated at 30 °C and 4 °C respectively for 1 h. The reactions contained a final dsRNA ligase concentration of 40 % (v/v) for lysate incubated at 30 °C and 20% (v/v) for lysate incubated at 4 °C. In addition, the ligation reactions contained 3 mM (each) substrate oligonucleotides (6-7, 9-12), 100 mM MOPS-buffer at pH 7.2, 30 mM ATP, 60 mM MgCh and 10 % (v/v) DMSO. The reaction plates were heat-sealed and incubated in a thermocycler at 30 °C for 24 h.
After incubation the plates were subjected to a heat inactivation step (95 °C, 20 min) to quench the reaction and precipitate proteinaceous content of the added lysate. The plates were then centrifuged at 4,000 rpm for 5 min. Subsequently a 50 pL aliquot of the supernatant was removed from each well and added to a deep well 96-well plate containing 950 pL of 5 mM EDTA solution (pH 7.0). The samples were further diluted by transferring 20 pL of the diluted sample into a deep well 96-well plate containing 180 pL of 5 mM EDTA solution (pH 7.0). The samples were diluted a third time by transferring 30 pL of the diluted sample into a deep well 96-well plate containing 150 pL of 5 mM EDTA solution (pH 7.0). The samples were analyzed via RF-MS to determine the activity of the enzyme variants using the analytical method described in Table 6-3. Selected ligase variants showing a faster product formation of oligonucleotides (2) and (3) relative to SEQ ID NO: 288 following pre-incubation at 4 °C are shown in Table 12-1. Selected ligase variants showing a faster product formation of oligonucleotides (2) and (3) relative to SEQ ID NO: 288 following pre-incubation at 30 °C are shown in Table 12-2.
The engineered dsRNA ligase polypeptides represented by the even numbered sequence identifiers of SEQ ID NOs: 602 to 634 comprise an even numbered sequence identifier of SEQ ID NOs: 636 to 668, respectively, and a 14 amino acid N-terminal purification tag (MHHHHHHENLYFQS (SEQ ID NO: 669)). For example, SEQ ID NO: 602 comprises: (i) the 14 amino acid N-terminal purification tag of SEQ ID NO: 669; and (ii) the dsRNA ligase polypeptide of SEQ ID NO: 636.
EXAMPLE 13
Comparison of the catalytic activity of the wildtype polypeptide SEQ ID NO: 2 and the engineered polypeptides SEQ ID NO: 288, SEQ ID NO: 290 and SEQ ID NO: 292
The polynucleotides SEQ ID NO: 1 encoding for the wild-type dsRNA ligase from Bacteriophage RB69, Uniprot ID: Q7Y4V8 with the SEQ ID NO: 2 and the engineered polynucleotides SEQ ID NO: 287, SEQ ID NO: 289 and SEQ ID NO: 291 encoding for the most improved variants from example 11 with polypeptide sequences SEQ ID NO: 288, SEQ ID NO: 290 and SEQ ID NO: 292, have been used for SFP production as described in example 5.
The catalytic activity to convert the substrates oligonucleotides (6-7, 9-12) to the desired siRNA product (1) was evaluated under two reaction conditions: Condition 1 (50 mM Tris-buffer at pH 7.5, 1 mM ATP, 5 mM MgCh and 5 mM DTT, containing either 0 g/L, 0.0020 g/L, 0.0039 g/L, 0.0078 g/L, 0.0156 g/L, 0.0313 g/L, 0.0625 g/L, 0.125 g/L, 0.25 g/L, 0.5 g/L, 1 g/L, or 2 g/L of SFP), and Condition 2 (50 mM Tris-buffer at pH 7.0, 30 mM ATP, 60 mM MgCh and 10 % (v/v) DMSO containing either 0 g/L, 0.0049 g/L, 0.0098 g/L, 0.0195 g/L, 0.0391 g/L, 0.0781 g/L, 0.1563 g/L, 0.3125 g/L, 0.625 g/L, 1.25 g/L, 2.5 g/L, or 5 g/L of SFP). The enzyme assays were carried out in 96-well PCR plates, in 100 pL total reaction volume per well; condition 1 in reaction plate 1 and condition 2 in reaction plate 2. Both reaction plates were heat-sealed and incubated in a thermocycler at 30 °C. Reaction plate 1 was incubated for 2 h and reaction plate 2 was incubated for 24 h.
Following incubation, the plates were subjected to a heat inactivation step (95 °C, 20 min) to quench the reaction and precipitate proteinaceous content of the added SFP. The plates were then centrifuged at 4,000 rpm for 5 min. A 50 pL aliquot of the supernatant from each well of each plate was removed and subsequently diluted in 50 mM EDTA solution (pH 7.0). Reaction plate 1 was diluted 40 x and reaction plate 2 was diluted 2 400 x. The samples were analyzed via HPLC to determine the activity of the enzyme variants using the analytical method described in Table 6-2.
Comparative data in Figures 2A and 2B show the relative peak area % of siRNA (1) present in the reaction samples assayed under conditions 1 and 2 respectively. Under both conditions 1 and 2, the polypeptides SEQ ID NO: 288, SEQ ID NO: 290 and SEQ ID NO: 292 exhibit improved dsRNA ligase activity over the wild-type polypeptide SEQ ID NO: 2.
EXAMPLE 14
Comparison of the catalytic activity of the wildtype polypeptide SEQ ID NO: 2 and the engineered polypeptides SEQ ID NOs: 288 and 632
The polynucleotides SEQ ID NO: 1 encoding for the wild-type dsRNA ligase from Bacteriophage RB69, Uniprot ID: Q7Y4V8 with the SEQ ID NO: 2 and the engineered polynucleotide SEQ ID NO: 287 and SEQ ID NO: 631 encoding for the most improved variant from example 11 and example 12 with polypeptide sequences SEQ ID NO: 288 and SEQ ID NO: 632 respectively, have been used for SFP production as described in example 5.
The catalytic activity to convert the substrate oligonucleotides (6-7, 9-12) to the desired siRNA product (1) and the thermostability of the two enzymes was evaluated by incubating a stock solution of the SFP for 4 h at either 4 or 37 °C prior to setting up the following ligation reaction: 6 mM (each) substrate oligonucleotides (6-7, 9-12), 100 mM MOPS-buffer at pH 7.2, 30 mM ATP, 60 mM MgCh and 10 % (v/v) DMSO. In addition, the ligation reactions contained, either 0 g/L, 0.156 g/L, 0.313 g/L, 0.625 g/L, 1.25 g/L, 2.5 g/L, 5 g/L, or 10 g/L of SFP. The enzyme assays were carried out in 96-well PCR plates, in 100 pL total reaction volume per well. The reaction plate was heat-sealed and incubated in a thermocycler at 30 °C for 24 h.
Following incubation, the plate was subjected to a heat inactivation step (95 °C, 20 min) to quench the reaction and precipitate proteinaceous content of the added SFP. The plate was then centrifuged at 4,000 rpm for 5 min. A 50 pL aliquot of the supernatant from each well of each plate was removed and subsequently diluted 400 x in 50 mM EDTA solution (pH 7.0). The samples were analyzed via HPLC to determine the activity of the enzyme variants using the analytical method described in Table 6-2.
Comparative data in Figure 3A shows the relative peak area % of siRNA (1) present in the reaction samples. Figure 3B shows the residual enzyme activity following pre -incubation of the SFP at 37 °C for 4 h, expressed relative to the ligation activity of the SFP pre-incubated at 4 °C for 4 h. Under all conditions, the polypeptides SEQ ID NO: 632, exhibits improved dsRNA ligase activity and thermostability over the wild-type polypeptide SEQ ID NO: 2 and engineered polypeptide SEQ ID NO: 288.
SUMMARY OF ENGINEERED dsRNA LIGASE POLYPEPTIDE SEQUENCES
Table 13 provides a summary of the nucleic acid and amino acid sequences of the wild-type and engineered dsRNA ligase sequences described herein. The purification tag used in the Examples and reference in table 13 is the N-terminal purification tag MHHHHHHENLYFQS (SEQ ID NO: 669).
Claims
1. An engineered double-stranded RNA (dsRNA) ligase polypeptide comprising an amino acid sequence having at least 85% sequence identity to an amino acid sequence selected from the group consisting of SEQ ID NOs: 666, 304, 306, 308, 310, 312, 314, 316, 318, 320, 322, 324, 326, 328, 330, 332, 334, 336, 338, 340, 342, 344, 346, 348, 350, 352,
354, 356, 358, 360, 362, 364, 366, 368, 370, 372, 374, 376, 378, 380, 382, 384, 386, 388,
390, 392, 394, 396, 398, 400, 402, 404, 406, 408, 410, 412, 414, 416, 418, 420, 422, 424,
426, 428, 430, 432, 434, 436, 438, 440, 442, 444, 446, 448, 450, 452, 454, 456, 458, 460,
462, 464, 466, 468, 470, 472, 474, 476, 478, 480, 482, 484, 486, 488, 490, 492, 494, 496,
498, 500, 502, 504, 506, 508, 510, 512, 514, 516, 518, 520, 522, 524, 526, 528, 530, 532,
534, 536, 538, 540, 542, 544, 546, 548, 550, 552, 554, 556, 558, 560, 562, 564, 566, 568,
570, 572, 574, 576, 578, 580, 582, 584, 586, 588, 590, 592, 594, 596, 598, 600, 636, 638,
640, 642, 644, 646, 648, 650, 652, 654, 656, 658, 660, 662, 664, and 668; wherein the engineered dsRNA ligase polypeptide:
(a) has dsRNA ligase activity; and
(b) does not the comprise the amino acid sequence of SEQ ID NO: 302.
2. The engineered dsRNA ligase polypeptide of claim 1, wherein the polypeptide comprises an amino acid sequence selected from the group consisting of SEQ ID NOs: 666, 370, 488, 526, 578, 588, 590, and 592.
3. An engineered dsRNA ligase polypeptide comprising an amino acid sequence having at least 85% sequence identity to SEQ ID NO: 302, which produces at least 5% more oligonucleotide product than a dsRNA ligase polypeptide comprising the amino acid sequence of SEQ ID NO: 302 under the same ligation reaction conditions, wherein the engineered dsRNA ligase polypeptide does not the comprise the amino acid sequence of SEQ ID NO: 302.
4. The engineered dsRNA ligase polypeptide of claim 3, wherein the ligation reaction conditions include about 1 pM to about 10 mM oligonucleotide fragment, a source of ATP, about 5 mM to about 100 mM divalent cation, and about 0.5 g/L to about 10 g/L engineered dsRNA ligase polypeptide, pH of about 4.0 to about 8.0, and temperature of about 10 °C to about 50 °C.
The engineered dsRNA ligase of any preceding claim, wherein:
(a) the amino acid sequence of the engineered dsRNA ligase polypeptide comprises an amino acid sequence that differs from the sequence of SEQ ID NO: 302 in one or more amino acid residues selected from: X6, X7, X15, X19, X29, X36, X39, X44, X45, X46, X47, X49, X51, X53, X56, X57, X60, X63, X64, X66, X67, X87, X88, X89, X91, X92, X93, X103, X105, X107, X114, X122, X126, X129, X130, X131, X137, X144, X146, X158, X163, X173, X178, X185, X190, X196, X216, X218,
X221, X228, X230, X232, X235, X236, X237, X238, X239, X242, X243, X244,
X251, X252, X254, X255, X258, X269, X280, X284, X285, X293, X296, X301,
X303, X305, X313, X314, X325, and X328, wherein the numbering refers to SEQ ID
NO: 302; optionally wherein the amino acid sequence of the engineered dsRNA ligase polypeptide comprises one or more of the following amino acid residues: X6 is G or E; X7 is Q; X15 is R, D or E; X19 is Q or D; X29 is N or L; X36 is V; X39 is A; X44 is V; X45 is V; X46 is Y; X47 is E; X49 is G; X51 is L; X53 is Y; X56 is R or A;
X57 is S; X60 is T, G or P; X63 is S, Q or G; X64 is R, T, Q, F, G, or M; X66 is F or W; X67 is N; X87 is T, P, K or absent; X88 is C; X89 is T; X91 is S; X92 is D; X93 is G, C, or A; X103 is V, C, Y, or T; X105 is V; X107 is R or T; XI 14 is N; X122 is W; X126 is G; X129 is N; X130 is R, S or Y; X131 is R; X137 is V or C; X144 is N; X146 is R; X158 is W; X163 is G; X173 is L; X178 is R; X185 is K; X190 is Q; X196 is S or C; X216 is L or R; X218 is N; X221 is I; X228 is R; X230 is T; X232 is R; X235 is A, T, or G; X236 is S, L, or F; X237 is S, Q, R, L or G; X238 is F; X239 is G or R; X242 is R or M; X243 is N, S, G, or M; X244 is G or K; X251 is D or L; X252 is V; X254 is K; X255 is C; X258 is V; X269 is L; X280 is W; X284 is A;
X285 is A; X293 is R; X296 is R; X301 is G, L, E, or F; X303 is Q; X305 is G; X313 is A; X314 is A or V; X325 is R; and X328 is R; wherein the numbering refers to SEQ ID NO: 302; and/or
(b) the amino acid sequence of the engineered dsRNA ligase polypeptide comprises an amino acid sequence that differs from the sequence of SEQ ID NO: 302 in one or more amino acid residues selected from: X15, X19, X36, X39, X53, X185, X218, X221, X237, X251, X255, and X285, wherein the numbering refers to SEQ ID NO: 302, and wherein the engineered dsRNA ligase polypeptide has dsRNA ligase activity; optionally wherein the amino acid sequence of the engineered dsRNA ligase
polypeptide comprises one or more of the following amino acid residues: X15 is D or E; X19 is D; X36 is V; X39 is A; X53 is Y; X185 is K; X218 is N; X221 is I; X237 is R; X251 is L; X255 is C; and X285 is A; wherein the numbering refers to SEQ ID NO: 302; and/or
(c) the amino acid sequence of the engineered dsRNA ligase polypeptide comprises an amino acid sequence that differs from the sequence of SEQ ID NO: 302 in one or more amino acid residues selected from: X36, X39, X218 and X221, wherein the numbering refers to SEQ ID NO: 302, and wherein the engineered dsRNA ligase polypeptide has dsRNA ligase activity; optionally wherein the amino acid sequence of the engineered dsRNA ligase polypeptide comprises one or more of the following amino acid residues: X36 is V; X39 is A; X218 is N; and X221 is I; and/or
(d) the amino acid sequence of the engineered dsRNA ligase polypeptide comprises an amino acid sequence that differs from the sequence of SEQ ID NO: 302 in one or more amino acid residues selected from: X39, X218 and X221, wherein the numbering refers to SEQ ID NO: 302, and wherein the engineered dsRNA ligase polypeptide has dsRNA ligase activity; optionally wherein the amino acid sequence of the engineered dsRNA ligase polypeptide comprises one or more of the following amino acid residues: X39 is A; X218 is N; and X221 is I; and/or
(e) the amino acid sequence of the engineered dsRNA ligase polypeptide comprises an amino acid sequence that differs from the sequence of SEQ ID NO: 302 in one or more amino acid residues selected from: X39, X218, X221 and X255, wherein the numbering refers to SEQ ID NO: 302, and wherein the engineered dsRNA ligase polypeptide has dsRNA ligase activity; optionally wherein the amino acid sequence of the engineered dsRNA ligase polypeptide comprises one or more of the following amino acid residues: X39 is A; X218 is N; X221 is I; and X255 is C; and/or
(f) the amino acid sequence of the engineered dsRNA ligase polypeptide comprises an amino acid sequence that differs from the sequence of SEQ ID NO: 302 in one or more amino acid residues selected from: X39, X53, X218, X221, X237, X251, X255 and X285, wherein the numbering refers to SEQ ID NO: 302, and wherein the engineered dsRNA ligase polypeptide has dsRNA ligase activity; optionally wherein
the amino acid sequence of the engineered dsRNA ligase polypeptide comprises one or more of the following amino acid residues: X39 is A; X 53 is Y; X218 is N; X221 is I; X237 is R; X251 is L; X255 is C; and X285 is A; and/or
(g) the amino acid sequence of the engineered dsRNA ligase polypeptide comprises an amino acid sequence that differs from the sequence of SEQ ID NO: 302 in one or more amino acid residues selected from: X15, X39, X53, X218, X221, X237, X251, X255 and X285, wherein the numbering refers to SEQ ID NO: 302, and wherein the engineered dsRNA ligase polypeptide has dsRNA ligase activity; optionally wherein the amino acid sequence of the engineered dsRNA ligase polypeptide comprises one or more of the following amino acid residues: X15 E; X39 is A; X 53 is Y; X218 is N; X221 is I; X237 is R; X251 is L; X255 is C; and X285 is A; and/or
(h) the amino acid sequence of the engineered dsRNA ligase polypeptide comprises an amino acid sequence that differs from the sequence of SEQ ID NO: 302 in one or more amino acid residues selected from: X19, X39, X53, X218, X221, X237, X251, X255 and X285, wherein the numbering refers to SEQ ID NO: 302, and wherein the engineered dsRNA ligase polypeptide has dsRNA ligase activity; optionally wherein the amino acid sequence of the engineered dsRNA ligase polypeptide comprises one or more of the following amino acid residues: X19 is D; X39 is A; X 53 is Y; X218 is N; X221 is I; X237 is R; X251 is L; X255 is C; and X285 is A; and/or
(i) the amino acid sequence of the engineered dsRNA ligase polypeptide comprises an amino acid sequence that differs from the sequence of SEQ ID NO: 302 in one or more amino acid residues selected from: X15, X39, X53, X185, X218, X221, X237, X251, X255 and X285, wherein the numbering refers to SEQ ID NO: 302, and wherein the engineered dsRNA ligase polypeptide has dsRNA ligase activity; optionally wherein the amino acid sequence of the engineered dsRNA ligase polypeptide comprises one or more of the following amino acid residues: X15 is D; X39 is A; X 53 is Y; X185 is K; X218 is N; X221 is I; X237 is R; X251 is L; X255 is C; and X285 is A.
6. The engineered dsRNA ligase polypeptide of any of claims 1-5, wherein the engineered dsRNA ligase polypeptide comprises a purification tag; optionally wherein the engineered dsRNA ligase polypeptide comprises an amino acid sequence selected from the group consisting of SEQ ID NOs: 632, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 86, 88, 90, 92, 94, 96, 98, 100, 102, 104, 106, 108, 110, 112, 114, 116, 118, 120, 122, 124, 126, 128, 130, 132, 134, 136, 138, 140, 142, 144, 146, 148, 150, 152, 154, 156, 158,
160, 162, 164, 166, 168, 170, 172, 174, 176, 178, 180, 182, 184, 186, 188, 190, 192, 194,
196, 198, 200, 202, 204, 206, 208, 210, 212, 214, 216, 218, 220, 222, 224, 226, 228, 230,
232, 234, 236, 238, 240, 242, 244, 246, 248, 250, 252, 254, 256, 258, 260, 262, 264, 266,
268, 270, 272, 274, 276, 278, 280, 282, 284, 286, 288, 290, 292, 294, 296, 298, 300, 602,
604, 606, 608, 610, 612, 614, 616, 618, 620, 622, 624, 626, 628, 630, and 634.
7. A polypeptide immobilized on a solid material by chemical bond or a physical adsorption method, wherein the polypeptide comprises an engineered dsRNA ligase polypeptide according to any one of claims 1-6.
8. A polynucleotide encoding the engineered dsRNA ligase polypeptide of any one of claims 1-6; optionally wherein:
(a) the polynucleotide comprises a nucleic acid sequence selected from SEQ ID NOs:
303, 305, 307, 309, 311, 313, 315, 317, 319, 321, 323, 325, 327, 329, 331, 333, 335,
337, 339, 341, 343, 345, 347, 349, 351, 353, 355, 357, 359, 361, 363, 365, 367, 369,
371, 373, 375, 377, 379, 381, 383, 385, 387, 389, 391, 393, 395, 397, 399, 401, 403,
405, 407, 409, 411, 413, 415, 417, 419, 421, 423, 425, 427, 429, 431, 433, 435, 437,
439, 441, 443, 445, 447, 449, 451, 453, 455, 457, 459, 461, 463, 465, 467, 469, 471,
473, 475, 477, 479, 481, 483, 485, 487, 489, 491, 493, 495, 497, 499, 501, 503, 505,
507, 509, 511, 513, 515, 517, 519, 521, 523, 525, 527, 529, 531, 533, 535, 537, 539,
541, 543, 545, 547, 549, 551, 553, 555, 557, 559, 561, 563, 565, 567, 569, 571, 573,
575, 577, 579, 581, 583, 585, 587, 589, 591, 593, 595, 597, 599, 635, 637, 639, 641,
643, 645, 647, 649, 651, 653, 655, 657, 659, 661, 663, 665, and 667; and/or
(b) the polynucleotide comprises a nucleic acid sequence selected from SEQ ID NOs: 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 39, 41, 43, 45, 47, 49, 51, 53, 55, 57, 59, 61, 63, 65, 67, 69, 71, 73, 75, 77, 79, 81, 83, 85, 87, 89, 91, 93, 95, 97, 99, 101, 103, 105, 107, 109, 111, 113, 115, 117, 119, 121, 123, 125, 127, 129,
131, 133, 135, 137, 139, 141, 143, 145, 147, 149, 151, 153, 155, 157, 159, 161, 163,
165, 167, 169, 171, 173, 175, 177, 179, 181, 183, 185, 187, 189, 191, 193, 195, 197,
199, 201, 203, 205, 207, 209, 211, 213, 215, 217, 219, 221, 223, 225, 227, 229, 231,
233, 235, 237, 239, 241, 243, 245, 247, 249, 251, 253, 255, 257, 259, 261, 263, 265,
267, 269, 271, 273, 275, 277, 279, 281, 283, 285, 287, 289, 291, 293, 295, 297, 299,
601, 603, 605, 607, 609, 611, 613, 615, 617, 619, 621, 623, 625, 627, 629, 631, and
633.
9. An expression vector comprising the polynucleotide according to claim 8; optionally wherein the expression vector comprises a plasmid, a cosmid, a bacteriophage or a viral vector.
10. A host cell comprising the polynucleotide of claim 8 or the expression vector of claim 9, optionally wherein the host cell is E. coli.
11. A method of preparing an engineered dsRNA ligase polypeptide, which comprises the steps of culturing the host cell according to claim 10 and obtaining an engineered dsRNA ligase polypeptide from the culture.
12. An engineered dsRNA ligase catalyst obtainable by culturing the host cells according to claim 10, or according to the method of claim 11, wherein said engineered dsRNA ligase catalyst comprises cells or culture fluid containing the engineered dsRNA ligase polypeptides, or an article processed therewith, wherein the article refers to an extract obtained from the culture of host cell, an isolated product obtained by isolating or purifying an engineered dsRNA ligase from the extract, or an immobilized product obtained by immobilizing host cell, an extract thereof, or isolated product of the extract.
13. A method of producing an oligonucleotide from two or more oligonucleotide fragments, wherein the method comprises contacting:
(i) two or more oligonucleotide fragments;
(ii) an engineered dsRNA ligase polypeptide according to any one of claims 1-6;
(iii) a source of ATP; and
(iv) a divalent cation; to obtain an oligonucleotide;
optionally wherein:
(a) the method further comprises a step of purifying the oligonucleotide; and/or
(b) the method is performed using a sub-stoichiometric concentration of AMP and/or ATP; and/or
(c) the method is performed with a divalent cation concentration of 5-100 mM, optionally 30-50 mM.
14. Use of the engineered dsRNA ligase polypeptide according to any one of claims 1-6 in the production of an oligonucleotide from two or more oligonucleotide fragments.
15. The method of claim 13 or the use of claim 14, wherein:
(a) the oligonucleotide is up to 60 nucleotides in length; and/or
(b) each of the oligonucleotide fragments are 4-16 nucleotides in length, optionally 6- 9 nucleotides in length; and/or
(c) one or more of the oligonucleotide fragment(s) comprises one or two overhangs; and/or
(d) one or more of the oligonucleotide fragments comprises a chemical modification; optionally wherein the chemical modification is selected from:
(i) a modified backbone, optionally selected from a phosphorothioate (e.g. chiral phosphorothioate) or methylphosphonate intemucleotide linkage;
(ii) a modified nucleotide, optionally selected from 2'-O-methyl (2’-OMe), 2'- flouro (2’-F), 2'-deoxy, 2'-deoxy-2’-fluoro, 2'-O-methoxyethyl (2'-0-M0E), 2'- O-aminopropyl (2'-O-AP), 2'-O-dimethylaminoethyl (2'-O-DMAOE), 2'-O- dimethylaminopropyl (2'-O-DMAP), 2'-O-dimethylaminoethyloxyethyl (2'-O- DMAEOE), 2'-O-N-methylacetamido (2'-0-NMA), locked nucleic acid (LNA), glycol nucleic acid (GNA), phosphoramidate (e.g. mesyl phosphoramidate), 2',3'-seco nucleotide mimic, 2'-F-arabino nucleotide, abasic nucleotide, 2'- amino modified nucleotide, 2'-alkyl-modified nucleotide, morpholino nucleotide, vinylphosphonate (e.g. 5’ vinylphosphonate), and cyclopropyl phosphonate deoxyribonucleotide; and/or
(iii) conjugation to a ligand, optionally wherein the ligand comprises one or more N-Acetylgalactosamine (GalNAc) derivatives.
16. A composition comprising: i. the engineered dsRNA ligase polypeptide according to any one of claims 1-6; ii. a source of ATP; and iii. a divalent cation; optionally wherein the composition further comprises two or more oligonucleotide fragments.
17. A kit comprising: i. the engineered dsRNA ligase polypeptide according to any one of claims 1-6; ii. a source of ATP; iii. a divalent cation; and iv. instructions for use in a method of producing an oligonucleotide from two or more oligonucleotide fragments.
18. The method of claim 13 or claim 15, the composition of claim 16 or the kit of claim
17, wherein:
(A) the source of ATP comprises ATP; and/or
(B) the source of ATP comprises:
(a) polyphosphate kinase (PPK);
(b) polyphosphate; and
(c) AMP and/or ATP; optionally wherein:
(i) the PPK is selected from PPK12 or ajPAP; and/or
(ii) the polyphosphate is a polyphosphate salt, optionally wherein the polyphosphate salt is sodium polyphosphate (Maddrell’s salt) or sodium hexametaphosphate (Graham’s salt); and/or
(C) the divalent cation cofactor is Mg2+ or Mn2+.
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
EP22215201 | 2022-12-20 | ||
EP22215201.9 | 2022-12-20 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2024134502A1 true WO2024134502A1 (en) | 2024-06-27 |
Family
ID=84547245
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/IB2023/062949 WO2024134502A1 (en) | 2022-12-20 | 2023-12-19 | Engineered double-strand rna ligases and uses thereof |
Country Status (1)
Country | Link |
---|---|
WO (1) | WO2024134502A1 (en) |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2005021749A1 (en) | 2003-08-28 | 2005-03-10 | Novartis Ag | Interfering rna duplex having blunt-ends and 3’-modifications |
US20060195947A1 (en) | 2003-08-11 | 2006-08-31 | Codexis, Inc. | Ketoreductase polypeptides and related polynucleotides |
WO2007128477A2 (en) | 2006-05-04 | 2007-11-15 | Novartis Ag | SHORT INTERFERING RIBONUCLEIC ACID (siRNA) FOR ORAL ADMINISTRATION |
EP3885434A1 (en) * | 2020-03-25 | 2021-09-29 | Ajinomoto Co., Inc. | Ligase mutant |
-
2023
- 2023-12-19 WO PCT/IB2023/062949 patent/WO2024134502A1/en unknown
Patent Citations (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20060195947A1 (en) | 2003-08-11 | 2006-08-31 | Codexis, Inc. | Ketoreductase polypeptides and related polynucleotides |
WO2005021749A1 (en) | 2003-08-28 | 2005-03-10 | Novartis Ag | Interfering rna duplex having blunt-ends and 3’-modifications |
US8097716B2 (en) | 2003-08-28 | 2012-01-17 | Novartis Ag | Interfering RNA duplex having blunt-ends and 3′-modifications |
WO2007128477A2 (en) | 2006-05-04 | 2007-11-15 | Novartis Ag | SHORT INTERFERING RIBONUCLEIC ACID (siRNA) FOR ORAL ADMINISTRATION |
US8084600B2 (en) | 2006-05-04 | 2011-12-27 | Novartis Ag | Short interfering ribonucleic acid (siRNA) with improved pharmacological properties |
US8344128B2 (en) | 2006-05-04 | 2013-01-01 | Novartis Ag | Short interfering ribonucleic acid (siRNA) for oral administration |
EP3885434A1 (en) * | 2020-03-25 | 2021-09-29 | Ajinomoto Co., Inc. | Ligase mutant |
Non-Patent Citations (32)
Title |
---|
"Biocatalysis for the Pharmaceutical Industry: Discovery, Development, and Manufacturing", 2009, JOHN WILEY &SONS |
"Current Protocols in Molecular Biology", 1998, GREENE PUB. ASSOCIATES |
"Uniprot", Database accession no. Q7Y4V8 |
ALTSCHUL ET AL., J. MOL. BIOL., vol. 215, pages 403 - 410 |
ALTSCHUL ET AL., NUCLEIC ACIDS RES., 1977, pages 3389 - 3402 |
AOKI ET AL., CANCER GENE THERAPY, vol. 8, 2001, pages 783 - 787 |
BEAUCAGE ET AL., TET LETT, vol. 22, 1981, pages 1859 - 69 |
DESAI KEVIN K. ET AL: "A tRNA splicing operon: Archease endows RtcB with dual GTP/ATP cofactor specificity and accelerates RNA ligation", vol. 42, no. 6, 16 January 2014 (2014-01-16), GB, pages 3931 - 3942, XP093045932, ISSN: 0305-1048, Retrieved from the Internet <URL:https://academic.oup.com/nar/article-pdf/42/6/3931/45200473/nar_42_6_3931.pdf> DOI: 10.1093/nar/gkt1375 * |
ELBASHIR ET AL., EMBO J., vol. 20, 2001, pages 6877 - 6888 |
ELBASHIR ET AL., NATURE, vol. 411, 2001, pages 494 - 498 |
HALEMARHAM, THE HARPER COLLINS DICTIONARY OF BIOLOGY, 1991 |
HENIKOFFHENIKOFF, PROC NATL ACAD SCI USA, vol. 89, 1989, pages 10915 |
KESTEMONT, D. ET AL., CHEMICAL COMMUNICATIONS, vol. 54, 2018, pages 6408 - 6411 |
KESTEMONT, D.HERDEWIJN, P.RENDERS, M., CURR PROTOC CHEM BIOL, vol. 11, 2019, pages 62 |
KRAYNACK ET AL., RNA, vol. 12, 2006, pages 163 - 176 |
LAM ET AL., NATURE, vol. 354, 1991, pages 82 - 84 |
MANN, G. ET AL., TETRAHEDRON LETTERS, vol. 93, 2022, pages 153696 |
MANN, G.STANGER, F. V, CHIMIA (AARAU), vol. 74, 2020, pages 407 - 417 |
MATTHES ET AL., PEOPLE, 1984 |
MISHRA, M ET AL., CURRENT RESEARCH IN GREEN AND SUSTAINABLE CHEMISTRY, vol. 4, 2021 |
NANDAKUMAR, J.SHUMAN, S., MOLECULAR CELL, vol. 16, 2004, pages 211 - 221 |
NEEDLEMANWUNSCH, J. MOL. BIOL., vol. 48, 1970, pages 443 |
PEARSONLIPMAN, PROC. NATL. ACAD. SCI. USA, vol. 85, 1988, pages 2444 |
ROBERTS, T. C.LANGER, R.WOOD, M. J. A., NATURE REVIEWS DRUG DISCOVERY, vol. 19:10, no. 19, 2020, pages 673 - 694 |
SINGLETON ET AL.: "Dictionary of Microbiology and Molecular Biology", 1994 |
SMITHWATERMAN, ADV. APPL. MATH., vol. 2, 1981, pages 482 |
SOUTSCHEK ET AL., NATURE, vol. 432, 2004, pages 173 - 178 |
SUBBARAO ET AL., BIOCHEMISTRY, vol. 26, 1987, pages 2964 - 2972 |
TURK ET AL., BIOCHEM. BIOPHYS. ACTA, vol. 1559, 2002, pages 56 - 68 |
VOGEL ET AL., J. AM. CHEM. SOC., vol. 118, 1996, pages 1581 - 1586 |
YIN SHENMIN ET AL: "Structure-Function Analysis of T4 RNA Ligase 2", JOURNAL OF BIOLOGICAL CHEMISTRY, vol. 278, no. 20, 1 May 2003 (2003-05-01), US, pages 17601 - 17608, XP093045936, ISSN: 0021-9258, DOI: 10.1074/jbc.M300817200 * |
ZITZMANN ET AL., CANCER RES., vol. 62, 2002, pages 5139 - 43 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US20230256001A1 (en) | Modified rna agents with reduced off-target effect | |
JP7062623B2 (en) | Ketohexokinase (KHK) iRNA composition and its usage | |
JP6947880B2 (en) | Hepatitis B virus (HBV) iRNA composition and its usage | |
JP7057390B2 (en) | Complement component iRNA composition and its usage | |
CN108368507B (en) | iRNA compositions of programmed cell death 1 ligand 1(PD-L1) and methods of use thereof | |
JP4981681B2 (en) | Composition and method for inducing immune response in mammals and method for avoiding immune response to oligonucleotide agents such as short interfering RNA | |
TWI727948B (en) | FACTOR XII (HAGEMAN FACTOR) (F12), KALLIKREIN B, PLASMA (FLETCHER FACTOR) 1 (KLKB1), AND KININOGEN 1 (KNG1) iRNA COMPOSITIONS AND METHODS OF USE THEREOF | |
KR20240010762A (en) | Modified double-stranded rna agents | |
KR20150021489A (en) | MODIFIED RNAi AGENTS | |
JP2014527401A (en) | Compositions and methods for inhibiting gene expression of hepatitis B virus | |
US20210388356A1 (en) | Modified double stranded oligonucleotide | |
CA3174068A1 (en) | Conjugated oligonucleotides for tissue specific delivery | |
US20080214489A1 (en) | Aptamer-mediated intracellular delivery of oligonucleotides | |
WO2024134502A1 (en) | Engineered double-strand rna ligases and uses thereof | |
WO2024134505A1 (en) | Nucleic acid ligation method | |
US20240191230A1 (en) | Conjugates of sirna and antisense oligonucleotides (sirnaso) and methods of use in gene silencing | |
JP5349323B2 (en) | Materials and methods for generating transcripts containing modified nucleotides | |
WO2024182578A1 (en) | Oligonucleotides for rna editing | |
Nainytė | Synthesis of modified oligonucleotides for prebiotic studies and as novel CoV-2 therapeutics | |
EP4363575A1 (en) | Methods and compositions for adar-mediated editing | |
EP4363574A1 (en) | Methods and compositions for adar-mediated editing | |
WO2022246023A1 (en) | Methods and compositions for adar-mediated editing | |
WO2024129710A1 (en) | Carbohydrate conjugates for the delivery of therapeutic oligonucleotides | |
CN117015601A (en) | Methods and compositions for ADAR-mediated SERPINA1 editing | |
CN116583602A (en) | G protein-coupled receptor 75 (GPR 75) iRNA compositions and methods of use thereof |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 23836597 Country of ref document: EP Kind code of ref document: A1 |