WO2024218234A1 - Generation of multi-copy host cells - Google Patents
Generation of multi-copy host cells Download PDFInfo
- Publication number
- WO2024218234A1 WO2024218234A1 PCT/EP2024/060591 EP2024060591W WO2024218234A1 WO 2024218234 A1 WO2024218234 A1 WO 2024218234A1 EP 2024060591 W EP2024060591 W EP 2024060591W WO 2024218234 A1 WO2024218234 A1 WO 2024218234A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- lpp
- host cell
- protospacer
- sequences
- sequence
- Prior art date
Links
- 102000040430 polynucleotide Human genes 0.000 claims abstract description 515
- 108091033319 polynucleotide Proteins 0.000 claims abstract description 515
- 239000002157 polynucleotide Substances 0.000 claims abstract description 515
- 238000000034 method Methods 0.000 claims abstract description 196
- 229920001184 polypeptide Polymers 0.000 claims abstract description 157
- 102000004196 processed proteins & peptides Human genes 0.000 claims abstract description 157
- 108090000765 processed proteins & peptides Proteins 0.000 claims abstract description 157
- 210000004027 cell Anatomy 0.000 claims description 865
- 108020005004 Guide RNA Proteins 0.000 claims description 571
- 230000005782 double-strand break Effects 0.000 claims description 204
- 230000034431 double-strand break repair via homologous recombination Effects 0.000 claims description 175
- 239000012634 fragment Substances 0.000 claims description 132
- 108020004414 DNA Proteins 0.000 claims description 92
- 238000003780 insertion Methods 0.000 claims description 89
- 230000037431 insertion Effects 0.000 claims description 89
- 101710163270 Nuclease Proteins 0.000 claims description 48
- 240000004808 Saccharomyces cerevisiae Species 0.000 claims description 43
- 235000014680 Saccharomyces cerevisiae Nutrition 0.000 claims description 43
- 230000002538 fungal effect Effects 0.000 claims description 29
- 239000002773 nucleotide Substances 0.000 claims description 24
- 125000003729 nucleotide group Chemical group 0.000 claims description 24
- 240000006439 Aspergillus oryzae Species 0.000 claims description 23
- 235000002247 Aspergillus oryzae Nutrition 0.000 claims description 23
- 241000499912 Trichoderma reesei Species 0.000 claims description 20
- 241000193830 Bacillus <bacterium> Species 0.000 claims description 19
- 238000004519 manufacturing process Methods 0.000 claims description 16
- 241000223218 Fusarium Species 0.000 claims description 14
- 241000223259 Trichoderma Species 0.000 claims description 14
- UHPMCKVQTMMPCG-UHFFFAOYSA-N 5,8-dihydroxy-2-methoxy-6-methyl-7-(2-oxopropyl)naphthalene-1,4-dione Chemical compound CC1=C(CC(C)=O)C(O)=C2C(=O)C(OC)=CC(=O)C2=C1O UHPMCKVQTMMPCG-UHFFFAOYSA-N 0.000 claims description 13
- 241000228212 Aspergillus Species 0.000 claims description 13
- 241000146399 Ceriporiopsis Species 0.000 claims description 12
- 241000221779 Fusarium sambucinum Species 0.000 claims description 12
- 244000063299 Bacillus subtilis Species 0.000 claims description 11
- 235000014469 Bacillus subtilis Nutrition 0.000 claims description 11
- 230000006780 non-homologous end joining Effects 0.000 claims description 11
- 241000228245 Aspergillus niger Species 0.000 claims description 9
- 241000235648 Pichia Species 0.000 claims description 9
- 241000123346 Chrysosporium Species 0.000 claims description 8
- 241000567163 Fusarium cerealis Species 0.000 claims description 8
- 241000146406 Fusarium heterosporum Species 0.000 claims description 8
- 230000007246 mechanism Effects 0.000 claims description 8
- 108091033380 Coding strand Proteins 0.000 claims description 7
- 241000351920 Aspergillus nidulans Species 0.000 claims description 6
- 241000194108 Bacillus licheniformis Species 0.000 claims description 6
- 241000894006 Bacteria Species 0.000 claims description 6
- 241000235403 Rhizomucor miehei Species 0.000 claims description 6
- 241000193744 Bacillus amyloliquefaciens Species 0.000 claims description 5
- 241001328122 Bacillus clausii Species 0.000 claims description 5
- 241000193388 Bacillus thuringiensis Species 0.000 claims description 5
- 241000223221 Fusarium oxysporum Species 0.000 claims description 5
- 241000567178 Fusarium venenatum Species 0.000 claims description 5
- 241000193385 Geobacillus stearothermophilus Species 0.000 claims description 5
- 241001480714 Humicola insolens Species 0.000 claims description 5
- 241000194017 Streptococcus Species 0.000 claims description 5
- 241000187747 Streptomyces Species 0.000 claims description 5
- 241000223258 Thermomyces lanuginosus Species 0.000 claims description 5
- 241001313536 Thermothelomyces thermophila Species 0.000 claims description 5
- 229940097012 bacillus thuringiensis Drugs 0.000 claims description 5
- 210000004962 mammalian cell Anatomy 0.000 claims description 5
- 241001019659 Acremonium <Plectosphaerellaceae> Species 0.000 claims description 4
- 241001513093 Aspergillus awamori Species 0.000 claims description 4
- 241000892910 Aspergillus foetidus Species 0.000 claims description 4
- 241001225321 Aspergillus fumigatus Species 0.000 claims description 4
- 241001480052 Aspergillus japonicus Species 0.000 claims description 4
- 241000223651 Aureobasidium Species 0.000 claims description 4
- 241000193752 Bacillus circulans Species 0.000 claims description 4
- 241000193749 Bacillus coagulans Species 0.000 claims description 4
- 241000193747 Bacillus firmus Species 0.000 claims description 4
- 241000193422 Bacillus lentus Species 0.000 claims description 4
- 241000194107 Bacillus megaterium Species 0.000 claims description 4
- 241000194103 Bacillus pumilus Species 0.000 claims description 4
- 241000222490 Bjerkandera Species 0.000 claims description 4
- 241000222478 Bjerkandera adusta Species 0.000 claims description 4
- 241000193764 Brevibacillus brevis Species 0.000 claims description 4
- 241000589876 Campylobacter Species 0.000 claims description 4
- 241000222120 Candida <Saccharomycetales> Species 0.000 claims description 4
- 241001466517 Ceriporiopsis aneirina Species 0.000 claims description 4
- 241001646018 Ceriporiopsis gilvescens Species 0.000 claims description 4
- 241001277875 Ceriporiopsis rivulosa Species 0.000 claims description 4
- 241000524302 Ceriporiopsis subrufa Species 0.000 claims description 4
- 241000985909 Chrysosporium keratinophilum Species 0.000 claims description 4
- 241001674013 Chrysosporium lucknowense Species 0.000 claims description 4
- 241001556045 Chrysosporium merdarium Species 0.000 claims description 4
- 241000080524 Chrysosporium queenslandicum Species 0.000 claims description 4
- 241001674001 Chrysosporium tropicum Species 0.000 claims description 4
- 241000355696 Chrysosporium zonatum Species 0.000 claims description 4
- 241000193403 Clostridium Species 0.000 claims description 4
- 241000222511 Coprinus Species 0.000 claims description 4
- 244000251987 Coprinus macrorhizus Species 0.000 claims description 4
- 235000001673 Coprinus macrorhizus Nutrition 0.000 claims description 4
- 241000222356 Coriolus Species 0.000 claims description 4
- 241001337994 Cryptococcus <scale insect> Species 0.000 claims description 4
- 241000194033 Enterococcus Species 0.000 claims description 4
- 241000589565 Flavobacterium Species 0.000 claims description 4
- 241000145614 Fusarium bactridioides Species 0.000 claims description 4
- 241000223194 Fusarium culmorum Species 0.000 claims description 4
- 241000223195 Fusarium graminearum Species 0.000 claims description 4
- 241001112697 Fusarium reticulatum Species 0.000 claims description 4
- 241001014439 Fusarium sarcochroum Species 0.000 claims description 4
- 241000223192 Fusarium sporotrichioides Species 0.000 claims description 4
- 241001465753 Fusarium torulosum Species 0.000 claims description 4
- 241000605909 Fusobacterium Species 0.000 claims description 4
- 241000146398 Gelatoporia subvermispora Species 0.000 claims description 4
- 241000626621 Geobacillus Species 0.000 claims description 4
- 241000589989 Helicobacter Species 0.000 claims description 4
- 241000223198 Humicola Species 0.000 claims description 4
- 241000235649 Kluyveromyces Species 0.000 claims description 4
- 241001138401 Kluyveromyces lactis Species 0.000 claims description 4
- 241000235087 Lachancea kluyveri Species 0.000 claims description 4
- 241000186660 Lactobacillus Species 0.000 claims description 4
- 241000194036 Lactococcus Species 0.000 claims description 4
- 241001344133 Magnaporthe Species 0.000 claims description 4
- 241000235395 Mucor Species 0.000 claims description 4
- 241000226677 Myceliophthora Species 0.000 claims description 4
- 241000588653 Neisseria Species 0.000 claims description 4
- 241000233892 Neocallimastix Species 0.000 claims description 4
- 241000221960 Neurospora Species 0.000 claims description 4
- 241000221961 Neurospora crassa Species 0.000 claims description 4
- 241001072230 Oceanobacillus Species 0.000 claims description 4
- 241001236817 Paecilomyces <Clavicipitaceae> Species 0.000 claims description 4
- 241000194109 Paenibacillus lautus Species 0.000 claims description 4
- 241000228143 Penicillium Species 0.000 claims description 4
- 241000222385 Phanerochaete Species 0.000 claims description 4
- 241000222393 Phanerochaete chrysosporium Species 0.000 claims description 4
- 241000222395 Phlebia Species 0.000 claims description 4
- 241000222397 Phlebia radiata Species 0.000 claims description 4
- 241000235379 Piromyces Species 0.000 claims description 4
- 241000222350 Pleurotus Species 0.000 claims description 4
- 244000252132 Pleurotus eryngii Species 0.000 claims description 4
- 235000001681 Pleurotus eryngii Nutrition 0.000 claims description 4
- 241000589516 Pseudomonas Species 0.000 claims description 4
- 241000959173 Rasamsonia emersonii Species 0.000 claims description 4
- 241000235070 Saccharomyces Species 0.000 claims description 4
- 235000003534 Saccharomyces carlsbergensis Nutrition 0.000 claims description 4
- 235000001006 Saccharomyces cerevisiae var diastaticus Nutrition 0.000 claims description 4
- 244000206963 Saccharomyces cerevisiae var. diastaticus Species 0.000 claims description 4
- 241000204893 Saccharomyces douglasii Species 0.000 claims description 4
- 241001407717 Saccharomyces norbensis Species 0.000 claims description 4
- 241001123227 Saccharomyces pastorianus Species 0.000 claims description 4
- 241000607142 Salmonella Species 0.000 claims description 4
- 241000222480 Schizophyllum Species 0.000 claims description 4
- 241000235346 Schizosaccharomyces Species 0.000 claims description 4
- 241000191940 Staphylococcus Species 0.000 claims description 4
- 241000264435 Streptococcus dysgalactiae subsp. equisimilis Species 0.000 claims description 4
- 241000194048 Streptococcus equi Species 0.000 claims description 4
- 241000193996 Streptococcus pyogenes Species 0.000 claims description 4
- 241000194054 Streptococcus uberis Species 0.000 claims description 4
- 241000958303 Streptomyces achromogenes Species 0.000 claims description 4
- 241001468227 Streptomyces avermitilis Species 0.000 claims description 4
- 241000187432 Streptomyces coelicolor Species 0.000 claims description 4
- 241000187392 Streptomyces griseus Species 0.000 claims description 4
- 241000187398 Streptomyces lividans Species 0.000 claims description 4
- 241000228341 Talaromyces Species 0.000 claims description 4
- 241001540751 Talaromyces ruber Species 0.000 claims description 4
- 241000228178 Thermoascus Species 0.000 claims description 4
- 241001494489 Thielavia Species 0.000 claims description 4
- 241001495429 Thielavia terrestris Species 0.000 claims description 4
- 241001149964 Tolypocladium Species 0.000 claims description 4
- 241000222354 Trametes Species 0.000 claims description 4
- 241000222357 Trametes hirsuta Species 0.000 claims description 4
- 241000222355 Trametes versicolor Species 0.000 claims description 4
- 241000217816 Trametes villosa Species 0.000 claims description 4
- 241000223260 Trichoderma harzianum Species 0.000 claims description 4
- 241000378866 Trichoderma koningii Species 0.000 claims description 4
- 241000223262 Trichoderma longibrachiatum Species 0.000 claims description 4
- 241000223261 Trichoderma viride Species 0.000 claims description 4
- 241000202898 Ureaplasma Species 0.000 claims description 4
- 241000409279 Xerochrysium dermatitidis Species 0.000 claims description 4
- 241000235013 Yarrowia Species 0.000 claims description 4
- 241000235015 Yarrowia lipolytica Species 0.000 claims description 4
- 229940091771 aspergillus fumigatus Drugs 0.000 claims description 4
- 229940054340 bacillus coagulans Drugs 0.000 claims description 4
- 229940005348 bacillus firmus Drugs 0.000 claims description 4
- 230000002950 deficient Effects 0.000 claims description 4
- 238000010362 genome editing Methods 0.000 claims description 4
- 229940039696 lactobacillus Drugs 0.000 claims description 4
- 229940115922 streptococcus uberis Drugs 0.000 claims description 4
- 210000005255 gram-positive cell Anatomy 0.000 claims description 3
- 210000000349 chromosome Anatomy 0.000 description 184
- 238000003752 polymerase chain reaction Methods 0.000 description 123
- 108090000623 proteins and genes Proteins 0.000 description 108
- 230000000295 complement effect Effects 0.000 description 105
- 239000003550 marker Substances 0.000 description 56
- 230000010354 integration Effects 0.000 description 52
- 239000013612 plasmid Substances 0.000 description 51
- 230000008439 repair process Effects 0.000 description 43
- 230000009466 transformation Effects 0.000 description 38
- 150000007523 nucleic acids Chemical group 0.000 description 34
- FWMNVWWHGCHHJJ-SKKKGAJSSA-N 4-amino-1-[(2r)-6-amino-2-[[(2r)-2-[[(2r)-2-[[(2r)-2-amino-3-phenylpropanoyl]amino]-3-phenylpropanoyl]amino]-4-methylpentanoyl]amino]hexanoyl]piperidine-4-carboxylic acid Chemical compound C([C@H](C(=O)N[C@H](CC(C)C)C(=O)N[C@H](CCCCN)C(=O)N1CCC(N)(CC1)C(O)=O)NC(=O)[C@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 FWMNVWWHGCHHJJ-SKKKGAJSSA-N 0.000 description 28
- 108091026890 Coding region Proteins 0.000 description 28
- 238000005520 cutting process Methods 0.000 description 28
- 108010076504 Protein Sorting Signals Proteins 0.000 description 27
- 230000014509 gene expression Effects 0.000 description 27
- 102000039446 nucleic acids Human genes 0.000 description 27
- 108020004707 nucleic acids Proteins 0.000 description 27
- 102000004169 proteins and genes Human genes 0.000 description 26
- 239000013598 vector Substances 0.000 description 25
- 210000001938 protoplast Anatomy 0.000 description 24
- 238000003491 array Methods 0.000 description 23
- 235000018102 proteins Nutrition 0.000 description 23
- 102000004190 Enzymes Human genes 0.000 description 20
- 108090000790 Enzymes Proteins 0.000 description 20
- 229940088598 enzyme Drugs 0.000 description 20
- XSQUKJJJFZCRTK-UHFFFAOYSA-N Urea Chemical compound NC(N)=O XSQUKJJJFZCRTK-UHFFFAOYSA-N 0.000 description 18
- 108091033409 CRISPR Proteins 0.000 description 16
- DRTQHJPVMGBUCF-XVFCMESISA-N Uridine Chemical compound O[C@@H]1[C@H](O)[C@@H](CO)O[C@H]1N1C(=O)NC(=O)C=C1 DRTQHJPVMGBUCF-XVFCMESISA-N 0.000 description 16
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Chemical compound O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 16
- 108091028043 Nucleic acid sequence Proteins 0.000 description 15
- 239000000243 solution Substances 0.000 description 14
- 229920001817 Agar Polymers 0.000 description 13
- 238000010354 CRISPR gene editing Methods 0.000 description 13
- 102100039087 Peptidyl-alpha-hydroxyglycine alpha-amidating lyase Human genes 0.000 description 13
- 102000040945 Transcription factor Human genes 0.000 description 13
- 108091023040 Transcription factor Proteins 0.000 description 13
- 239000008272 agar Substances 0.000 description 13
- 239000008367 deionised water Substances 0.000 description 13
- 229910021641 deionized water Inorganic materials 0.000 description 13
- 238000013518 transcription Methods 0.000 description 13
- 230000035897 transcription Effects 0.000 description 13
- 229930006000 Sucrose Natural products 0.000 description 12
- CZMRCDWAGMRECN-UGDNZRGBSA-N Sucrose Chemical compound O[C@H]1[C@H](O)[C@@H](CO)O[C@@]1(CO)O[C@@H]1[C@H](O)[C@@H](O)[C@H](O)[C@@H](CO)O1 CZMRCDWAGMRECN-UGDNZRGBSA-N 0.000 description 12
- 230000002759 chromosomal effect Effects 0.000 description 12
- 238000001914 filtration Methods 0.000 description 12
- 239000000463 material Substances 0.000 description 12
- 239000005720 sucrose Substances 0.000 description 12
- 241000233866 Fungi Species 0.000 description 11
- 108020004999 messenger RNA Proteins 0.000 description 11
- 230000001105 regulatory effect Effects 0.000 description 11
- 108090000942 Lactalbumin Proteins 0.000 description 10
- 108090001060 Lipase Proteins 0.000 description 10
- 150000001413 amino acids Chemical group 0.000 description 10
- 238000005516 engineering process Methods 0.000 description 10
- 239000002609 medium Substances 0.000 description 10
- 239000004202 carbamide Substances 0.000 description 9
- 238000002474 experimental method Methods 0.000 description 9
- 239000013604 expression vector Substances 0.000 description 9
- 238000000855 fermentation Methods 0.000 description 9
- 230000004151 fermentation Effects 0.000 description 9
- 108700026220 vif Genes Proteins 0.000 description 9
- 108091079001 CRISPR RNA Proteins 0.000 description 8
- 102000004407 Lactalbumin Human genes 0.000 description 8
- 102000004882 Lipase Human genes 0.000 description 8
- 239000004367 Lipase Substances 0.000 description 8
- DRTQHJPVMGBUCF-PSQAKQOGSA-N beta-L-uridine Natural products O[C@H]1[C@@H](O)[C@H](CO)O[C@@H]1N1C(=O)NC(=O)C=C1 DRTQHJPVMGBUCF-PSQAKQOGSA-N 0.000 description 8
- 230000004927 fusion Effects 0.000 description 8
- 230000006801 homologous recombination Effects 0.000 description 8
- 238000002744 homologous recombination Methods 0.000 description 8
- 235000019421 lipase Nutrition 0.000 description 8
- 239000000203 mixture Substances 0.000 description 8
- 238000006467 substitution reaction Methods 0.000 description 8
- DRTQHJPVMGBUCF-UHFFFAOYSA-N uracil arabinoside Natural products OC1C(O)C(CO)OC1N1C(=O)NC(=O)C=C1 DRTQHJPVMGBUCF-UHFFFAOYSA-N 0.000 description 8
- 229940045145 uridine Drugs 0.000 description 8
- 235000021241 α-lactalbumin Nutrition 0.000 description 8
- 239000004382 Amylase Substances 0.000 description 7
- 108010065511 Amylases Proteins 0.000 description 7
- 102000013142 Amylases Human genes 0.000 description 7
- 235000019418 amylase Nutrition 0.000 description 7
- 238000013459 approach Methods 0.000 description 7
- 230000001580 bacterial effect Effects 0.000 description 7
- 230000006870 function Effects 0.000 description 7
- 101150054232 pyrG gene Proteins 0.000 description 7
- 230000010076 replication Effects 0.000 description 7
- 125000006850 spacer group Chemical group 0.000 description 7
- 230000008685 targeting Effects 0.000 description 7
- FBPFZTCFMRRESA-FSIIMWSLSA-N D-Glucitol Natural products OC[C@H](O)[C@H](O)[C@@H](O)[C@H](O)CO FBPFZTCFMRRESA-FSIIMWSLSA-N 0.000 description 6
- CSNNHWWHGAXBCP-UHFFFAOYSA-L Magnesium sulfate Chemical compound [Mg+2].[O-][S+2]([O-])([O-])[O-] CSNNHWWHGAXBCP-UHFFFAOYSA-L 0.000 description 6
- 108010064983 Ovomucin Proteins 0.000 description 6
- 239000002202 Polyethylene glycol Substances 0.000 description 6
- 230000002068 genetic effect Effects 0.000 description 6
- 230000037361 pathway Effects 0.000 description 6
- 101150065883 pdiA gene Proteins 0.000 description 6
- 230000008488 polyadenylation Effects 0.000 description 6
- 229920001223 polyethylene glycol Polymers 0.000 description 6
- 239000001965 potato dextrose agar Substances 0.000 description 6
- 238000002360 preparation method Methods 0.000 description 6
- 238000000746 purification Methods 0.000 description 6
- 239000000600 sorbitol Substances 0.000 description 6
- 230000014616 translation Effects 0.000 description 6
- 238000011144 upstream manufacturing Methods 0.000 description 6
- 108010059892 Cellulase Proteins 0.000 description 5
- 108010008885 Cellulose 1,4-beta-Cellobiosidase Proteins 0.000 description 5
- 241000588724 Escherichia coli Species 0.000 description 5
- 108010084455 Zeocin Proteins 0.000 description 5
- 238000003556 assay Methods 0.000 description 5
- 239000002299 complementary DNA Substances 0.000 description 5
- 238000013461 design Methods 0.000 description 5
- 108010038658 exo-1,4-beta-D-xylosidase Proteins 0.000 description 5
- 239000000499 gel Substances 0.000 description 5
- 238000012423 maintenance Methods 0.000 description 5
- CWCMIVBLVUHDHK-ZSNHEYEWSA-N phleomycin D1 Chemical compound N([C@H](C(=O)N[C@H](C)[C@@H](O)[C@H](C)C(=O)N[C@@H]([C@H](O)C)C(=O)NCCC=1SC[C@@H](N=1)C=1SC=C(N=1)C(=O)NCCCCNC(N)=N)[C@@H](O[C@H]1[C@H]([C@@H](O)[C@H](O)[C@H](CO)O1)O[C@@H]1[C@H]([C@@H](OC(N)=O)[C@H](O)[C@@H](CO)O1)O)C=1N=CNC=1)C(=O)C1=NC([C@H](CC(N)=O)NC[C@H](N)C(N)=O)=NC(N)=C1C CWCMIVBLVUHDHK-ZSNHEYEWSA-N 0.000 description 5
- 230000028327 secretion Effects 0.000 description 5
- 239000000725 suspension Substances 0.000 description 5
- 238000001890 transfection Methods 0.000 description 5
- 238000013519 translation Methods 0.000 description 5
- 101000757144 Aspergillus niger Glucoamylase Proteins 0.000 description 4
- 101150118523 LYS4 gene Proteins 0.000 description 4
- 108010063045 Lactoferrin Proteins 0.000 description 4
- 102000010445 Lactoferrin Human genes 0.000 description 4
- 102000004316 Oxidoreductases Human genes 0.000 description 4
- 108090000854 Oxidoreductases Proteins 0.000 description 4
- 102000004357 Transferases Human genes 0.000 description 4
- 108090000992 Transferases Proteins 0.000 description 4
- 101800005109 Triakontatetraneuropeptide Proteins 0.000 description 4
- 239000003146 anticoagulant agent Substances 0.000 description 4
- 101150008194 argB gene Proteins 0.000 description 4
- 230000027455 binding Effects 0.000 description 4
- 239000000872 buffer Substances 0.000 description 4
- -1 cell Chemical class 0.000 description 4
- 238000010276 construction Methods 0.000 description 4
- 230000000694 effects Effects 0.000 description 4
- 108010021843 fluorescent protein 583 Proteins 0.000 description 4
- 108091006047 fluorescent proteins Proteins 0.000 description 4
- 102000034287 fluorescent proteins Human genes 0.000 description 4
- 210000004408 hybridoma Anatomy 0.000 description 4
- CSSYQJWUGATIHM-IKGCZBKSSA-N l-phenylalanyl-l-lysyl-l-cysteinyl-l-arginyl-l-arginyl-l-tryptophyl-l-glutaminyl-l-tryptophyl-l-arginyl-l-methionyl-l-lysyl-l-lysyl-l-leucylglycyl-l-alanyl-l-prolyl-l-seryl-l-isoleucyl-l-threonyl-l-cysteinyl-l-valyl-l-arginyl-l-arginyl-l-alanyl-l-phenylal Chemical compound C([C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 CSSYQJWUGATIHM-IKGCZBKSSA-N 0.000 description 4
- 235000021242 lactoferrin Nutrition 0.000 description 4
- 229940078795 lactoferrin Drugs 0.000 description 4
- 229960000274 lysozyme Drugs 0.000 description 4
- 239000004325 lysozyme Substances 0.000 description 4
- 101150051471 metF gene Proteins 0.000 description 4
- 101150106833 metG gene Proteins 0.000 description 4
- 101150042623 metH gene Proteins 0.000 description 4
- 230000000813 microbial effect Effects 0.000 description 4
- 230000004048 modification Effects 0.000 description 4
- 238000012986 modification Methods 0.000 description 4
- 108010000416 ovomacroglobulin Proteins 0.000 description 4
- 230000008569 process Effects 0.000 description 4
- 150000003839 salts Chemical class 0.000 description 4
- 101150111766 sc gene Proteins 0.000 description 4
- 231100000331 toxic Toxicity 0.000 description 4
- 230000002588 toxic effect Effects 0.000 description 4
- 101150016309 trpC gene Proteins 0.000 description 4
- NMEHNETUFHBYEG-IHKSMFQHSA-N tttn Chemical compound C([C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC=1NC=NC=1)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N1[C@@H](CCC1)C(=O)NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CCSC)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](NC(=O)[C@H]1N(CCC1)C(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)[C@@H](C)O)[C@@H](C)O)C1=CC=CC=C1 NMEHNETUFHBYEG-IHKSMFQHSA-N 0.000 description 4
- 101100049989 Aspergillus niger xlnB gene Proteins 0.000 description 3
- 101000946377 Bos taurus Alpha-lactalbumin Proteins 0.000 description 3
- 108010022172 Chitinases Proteins 0.000 description 3
- 102000012286 Chitinases Human genes 0.000 description 3
- 108010053770 Deoxyribonucleases Proteins 0.000 description 3
- 102000016911 Deoxyribonucleases Human genes 0.000 description 3
- YQYJSBFKSSDGFO-UHFFFAOYSA-N Epihygromycin Natural products OC1C(O)C(C(=O)C)OC1OC(C(=C1)O)=CC=C1C=C(C)C(=O)NC1C(O)C(O)C2OCOC2C1O YQYJSBFKSSDGFO-UHFFFAOYSA-N 0.000 description 3
- WQZGKKKJIJFFOK-GASJEMHNSA-N Glucose Natural products OC[C@H]1OC(O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-GASJEMHNSA-N 0.000 description 3
- 101000952182 Homo sapiens Max-like protein X Proteins 0.000 description 3
- 108010029541 Laccase Proteins 0.000 description 3
- 102000003960 Ligases Human genes 0.000 description 3
- 108090000364 Ligases Proteins 0.000 description 3
- 102100037423 Max-like protein X Human genes 0.000 description 3
- 108010014251 Muramidase Proteins 0.000 description 3
- 102000016943 Muramidase Human genes 0.000 description 3
- 108010062010 N-Acetylmuramoyl-L-alanine Amidase Proteins 0.000 description 3
- 102000012288 Phosphopyruvate Hydratase Human genes 0.000 description 3
- 108010022181 Phosphopyruvate Hydratase Proteins 0.000 description 3
- 108091028113 Trans-activating crRNA Proteins 0.000 description 3
- 239000007983 Tris buffer Substances 0.000 description 3
- IXKSXJFAGXLQOQ-XISFHERQSA-N WHWLQLKPGQPMY Chemical compound C([C@@H](C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)NC(=O)[C@@H](N)CC=1C2=CC=CC=C2NC=1)C1=CNC=N1 IXKSXJFAGXLQOQ-XISFHERQSA-N 0.000 description 3
- AVKUERGKIZMTKX-NJBDSQKTSA-N ampicillin Chemical compound C1([C@@H](N)C(=O)N[C@H]2[C@H]3SC([C@@H](N3C2=O)C(O)=O)(C)C)=CC=CC=C1 AVKUERGKIZMTKX-NJBDSQKTSA-N 0.000 description 3
- 229960000723 ampicillin Drugs 0.000 description 3
- 230000008901 benefit Effects 0.000 description 3
- 108010051210 beta-Fructofuranosidase Proteins 0.000 description 3
- 108010047754 beta-Glucosidase Proteins 0.000 description 3
- 102000006995 beta-Glucosidase Human genes 0.000 description 3
- 229940041514 candida albicans extract Drugs 0.000 description 3
- 229940106157 cellulase Drugs 0.000 description 3
- 238000005119 centrifugation Methods 0.000 description 3
- 238000003776 cleavage reaction Methods 0.000 description 3
- 238000000605 extraction Methods 0.000 description 3
- 239000008103 glucose Substances 0.000 description 3
- 239000001573 invertase Substances 0.000 description 3
- 235000011073 invertase Nutrition 0.000 description 3
- 235000010335 lysozyme Nutrition 0.000 description 3
- 229910052943 magnesium sulfate Inorganic materials 0.000 description 3
- 235000019341 magnesium sulphate Nutrition 0.000 description 3
- 230000001404 mediated effect Effects 0.000 description 3
- 210000001236 prokaryotic cell Anatomy 0.000 description 3
- 238000003259 recombinant expression Methods 0.000 description 3
- 238000011084 recovery Methods 0.000 description 3
- 230000007017 scission Effects 0.000 description 3
- 239000003381 stabilizer Substances 0.000 description 3
- 230000002103 transcriptional effect Effects 0.000 description 3
- LENZDBCJOHFCAS-UHFFFAOYSA-N tris Chemical compound OCC(N)(CO)CO LENZDBCJOHFCAS-UHFFFAOYSA-N 0.000 description 3
- 101150041186 xyn2 gene Proteins 0.000 description 3
- 239000012138 yeast extract Substances 0.000 description 3
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 2
- ZBMRKNMTMPPMMK-UHFFFAOYSA-N 2-amino-4-[hydroxy(methyl)phosphoryl]butanoic acid;azane Chemical compound [NH4+].CP(O)(=O)CCC(N)C([O-])=O ZBMRKNMTMPPMMK-UHFFFAOYSA-N 0.000 description 2
- GNFTZDOKVXKIBK-UHFFFAOYSA-N 3-(2-methoxyethoxy)benzohydrazide Chemical compound COCCOC1=CC=CC(C(=O)NN)=C1 GNFTZDOKVXKIBK-UHFFFAOYSA-N 0.000 description 2
- 101150033839 4 gene Proteins 0.000 description 2
- 108010011619 6-Phytase Proteins 0.000 description 2
- OJSXICLEROKMBP-FFUDWAICSA-N 869705-22-6 Chemical compound C([C@@H](C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)NCC(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC=1NC=NC=1)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(N)=O)C(C)C)NC(=O)[C@H]1N(CCC1)C(=O)[C@H](C)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 OJSXICLEROKMBP-FFUDWAICSA-N 0.000 description 2
- 102100034044 All-trans-retinol dehydrogenase [NAD(+)] ADH1B Human genes 0.000 description 2
- 101710193111 All-trans-retinol dehydrogenase [NAD(+)] ADH4 Proteins 0.000 description 2
- 108090000915 Aminopeptidases Proteins 0.000 description 2
- 102000004400 Aminopeptidases Human genes 0.000 description 2
- 102000004580 Aspartic Acid Proteases Human genes 0.000 description 2
- 108010017640 Aspartic Acid Proteases Proteins 0.000 description 2
- IJGRMHOSHXDMSA-UHFFFAOYSA-N Atomic nitrogen Chemical compound N#N IJGRMHOSHXDMSA-UHFFFAOYSA-N 0.000 description 2
- 101100438857 Bacillus subtilis (strain 168) ccpA gene Proteins 0.000 description 2
- 108091005658 Basic proteases Proteins 0.000 description 2
- 102000007350 Bone Morphogenetic Proteins Human genes 0.000 description 2
- 108010007726 Bone Morphogenetic Proteins Proteins 0.000 description 2
- FGUUSXIOTUKUDN-IBGZPJMESA-N C1(=CC=CC=C1)N1C2=C(NC([C@H](C1)NC=1OC(=NN=1)C1=CC=CC=C1)=O)C=CC=C2 Chemical compound C1(=CC=CC=C1)N1C2=C(NC([C@H](C1)NC=1OC(=NN=1)C1=CC=CC=C1)=O)C=CC=C2 FGUUSXIOTUKUDN-IBGZPJMESA-N 0.000 description 2
- OKTJSMMVPCPJKN-UHFFFAOYSA-N Carbon Chemical compound [C] OKTJSMMVPCPJKN-UHFFFAOYSA-N 0.000 description 2
- 108010006303 Carboxypeptidases Proteins 0.000 description 2
- 102000005367 Carboxypeptidases Human genes 0.000 description 2
- 108010076119 Caseins Proteins 0.000 description 2
- 102100035882 Catalase Human genes 0.000 description 2
- 108010053835 Catalase Proteins 0.000 description 2
- 108010031396 Catechol oxidase Proteins 0.000 description 2
- 102000030523 Catechol oxidase Human genes 0.000 description 2
- 208000037088 Chromosome Breakage Diseases 0.000 description 2
- 108020004705 Codon Proteins 0.000 description 2
- 102000008186 Collagen Human genes 0.000 description 2
- 108010035532 Collagen Proteins 0.000 description 2
- 108010026206 Conalbumin Proteins 0.000 description 2
- 108010025880 Cyclomaltodextrin glucanotransferase Proteins 0.000 description 2
- 102000004127 Cytokines Human genes 0.000 description 2
- 108090000695 Cytokines Proteins 0.000 description 2
- 102000053602 DNA Human genes 0.000 description 2
- 238000007702 DNA assembly Methods 0.000 description 2
- 230000004568 DNA-binding Effects 0.000 description 2
- 102000016928 DNA-directed DNA polymerase Human genes 0.000 description 2
- 101100070376 Dictyostelium discoideum alad gene Proteins 0.000 description 2
- 108010089072 Dolichyl-diphosphooligosaccharide-protein glycotransferase Proteins 0.000 description 2
- 101710121765 Endo-1,4-beta-xylanase Proteins 0.000 description 2
- 102000010911 Enzyme Precursors Human genes 0.000 description 2
- 108010062466 Enzyme Precursors Proteins 0.000 description 2
- 108090000371 Esterases Proteins 0.000 description 2
- 108091006020 Fc-tagged proteins Proteins 0.000 description 2
- 102000002464 Galactosidases Human genes 0.000 description 2
- 108010093031 Galactosidases Proteins 0.000 description 2
- 108060003306 Galactosyltransferase Proteins 0.000 description 2
- 102000030902 Galactosyltransferase Human genes 0.000 description 2
- 101100460773 Geobacillus stearothermophilus nprA gene Proteins 0.000 description 2
- 101100369308 Geobacillus stearothermophilus nprS gene Proteins 0.000 description 2
- 101100080316 Geobacillus stearothermophilus nprT gene Proteins 0.000 description 2
- 101800001586 Ghrelin Proteins 0.000 description 2
- 108010073178 Glucan 1,4-alpha-Glucosidase Proteins 0.000 description 2
- 102100022624 Glucoamylase Human genes 0.000 description 2
- 102000004366 Glucosidases Human genes 0.000 description 2
- 108010056771 Glucosidases Proteins 0.000 description 2
- 101000718525 Homo sapiens Alpha-galactosidase A Proteins 0.000 description 2
- 101000829367 Homo sapiens Src substrate cortactin Proteins 0.000 description 2
- 102000004157 Hydrolases Human genes 0.000 description 2
- 108090000604 Hydrolases Proteins 0.000 description 2
- 108010021625 Immunoglobulin Fragments Proteins 0.000 description 2
- 102000008394 Immunoglobulin Fragments Human genes 0.000 description 2
- 102100040018 Interferon alpha-2 Human genes 0.000 description 2
- 108010079944 Interferon-alpha2b Proteins 0.000 description 2
- 108010050904 Interferons Proteins 0.000 description 2
- 102000014150 Interferons Human genes 0.000 description 2
- 102000015696 Interleukins Human genes 0.000 description 2
- 108010063738 Interleukins Proteins 0.000 description 2
- 108090000769 Isomerases Proteins 0.000 description 2
- 102000004195 Isomerases Human genes 0.000 description 2
- 102000004317 Lyases Human genes 0.000 description 2
- 108090000856 Lyases Proteins 0.000 description 2
- 102100024295 Maltase-glucoamylase Human genes 0.000 description 2
- 108010054377 Mannosidases Proteins 0.000 description 2
- 102000001696 Mannosidases Human genes 0.000 description 2
- 108010006519 Molecular Chaperones Proteins 0.000 description 2
- 101800000590 Obestatin Proteins 0.000 description 2
- 102400000441 Obestatin Human genes 0.000 description 2
- 241000233654 Oomycetes Species 0.000 description 2
- 102000004264 Osteopontin Human genes 0.000 description 2
- 108010081689 Osteopontin Proteins 0.000 description 2
- 108010058846 Ovalbumin Proteins 0.000 description 2
- 101710091688 Patatin Proteins 0.000 description 2
- 101000865553 Pentadiplandra brazzeana Defensin-like protein Proteins 0.000 description 2
- 102000035195 Peptidases Human genes 0.000 description 2
- 108091005804 Peptidases Proteins 0.000 description 2
- 102000003992 Peroxidases Human genes 0.000 description 2
- 229920001030 Polyethylene Glycol 4000 Polymers 0.000 description 2
- 108020004511 Recombinant DNA Proteins 0.000 description 2
- 108010083644 Ribonucleases Proteins 0.000 description 2
- 102000006382 Ribonucleases Human genes 0.000 description 2
- 102000003838 Sialyltransferases Human genes 0.000 description 2
- 108090000141 Sialyltransferases Proteins 0.000 description 2
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 2
- 102100023719 Src substrate cortactin Human genes 0.000 description 2
- 108091081024 Start codon Proteins 0.000 description 2
- 108060008539 Transglutaminase Proteins 0.000 description 2
- 241000700605 Viruses Species 0.000 description 2
- 230000003213 activating effect Effects 0.000 description 2
- 108090000637 alpha-Amylases Proteins 0.000 description 2
- 102000004139 alpha-Amylases Human genes 0.000 description 2
- 108010030291 alpha-Galactosidase Proteins 0.000 description 2
- 102000005840 alpha-Galactosidase Human genes 0.000 description 2
- 108010028144 alpha-Glucosidases Proteins 0.000 description 2
- 229940024171 alpha-amylase Drugs 0.000 description 2
- 229940127219 anticoagulant drug Drugs 0.000 description 2
- 108010005774 beta-Galactosidase Proteins 0.000 description 2
- 102000005936 beta-Galactosidase Human genes 0.000 description 2
- 239000003114 blood coagulation factor Substances 0.000 description 2
- 229960000182 blood factors Drugs 0.000 description 2
- 229940112869 bone morphogenetic protein Drugs 0.000 description 2
- 238000004422 calculation algorithm Methods 0.000 description 2
- 108010089934 carbohydrase Proteins 0.000 description 2
- 229910052799 carbon Inorganic materials 0.000 description 2
- 239000005018 casein Substances 0.000 description 2
- BECPQYXYKAMYBN-UHFFFAOYSA-N casein, tech. Chemical compound NCCCCC(C(O)=O)N=C(O)C(CC(O)=O)N=C(O)C(CCC(O)=N)N=C(O)C(CC(C)C)N=C(O)C(CCC(O)=O)N=C(O)C(CC(O)=O)N=C(O)C(CCC(O)=O)N=C(O)C(C(C)O)N=C(O)C(CCC(O)=N)N=C(O)C(CCC(O)=N)N=C(O)C(CCC(O)=N)N=C(O)C(CCC(O)=O)N=C(O)C(CCC(O)=O)N=C(O)C(COP(O)(O)=O)N=C(O)C(CCC(O)=N)N=C(O)C(N)CC1=CC=CC=C1 BECPQYXYKAMYBN-UHFFFAOYSA-N 0.000 description 2
- 108010067454 caseinomacropeptide Proteins 0.000 description 2
- 235000021240 caseins Nutrition 0.000 description 2
- 230000015556 catabolic process Effects 0.000 description 2
- 210000002230 centromere Anatomy 0.000 description 2
- 238000006243 chemical reaction Methods 0.000 description 2
- 210000004978 chinese hamster ovary cell Anatomy 0.000 description 2
- 238000010367 cloning Methods 0.000 description 2
- 229920001436 collagen Polymers 0.000 description 2
- 150000001875 compounds Chemical class 0.000 description 2
- 108010005400 cutinase Proteins 0.000 description 2
- 229940079593 drug Drugs 0.000 description 2
- 239000003814 drug Substances 0.000 description 2
- 238000004520 electroporation Methods 0.000 description 2
- 239000003623 enhancer Substances 0.000 description 2
- 108010000165 exo-1,3-alpha-glucanase Proteins 0.000 description 2
- 238000007429 general method Methods 0.000 description 2
- 238000012268 genome sequencing Methods 0.000 description 2
- 102000006602 glyceraldehyde-3-phosphate dehydrogenase Human genes 0.000 description 2
- 108020004445 glyceraldehyde-3-phosphate dehydrogenase Proteins 0.000 description 2
- 230000013595 glycosylation Effects 0.000 description 2
- 238000006206 glycosylation reaction Methods 0.000 description 2
- 239000003102 growth factor Substances 0.000 description 2
- 150000003278 haem Chemical class 0.000 description 2
- 238000003306 harvesting Methods 0.000 description 2
- 229910001385 heavy metal Inorganic materials 0.000 description 2
- 101150112623 hemA gene Proteins 0.000 description 2
- 101150055960 hemB gene Proteins 0.000 description 2
- 239000005556 hormone Substances 0.000 description 2
- 229940088597 hormone Drugs 0.000 description 2
- 102000043404 human GLA Human genes 0.000 description 2
- 210000005260 human cell Anatomy 0.000 description 2
- 238000001727 in vivo Methods 0.000 description 2
- 238000011534 incubation Methods 0.000 description 2
- 229940079322 interferon Drugs 0.000 description 2
- 239000007788 liquid Substances 0.000 description 2
- 108010009689 mannosyl-oligosaccharide 1,2-alpha-mannosidase Proteins 0.000 description 2
- 239000011159 matrix material Substances 0.000 description 2
- 229910052751 metal Inorganic materials 0.000 description 2
- 239000002184 metal Substances 0.000 description 2
- 150000002739 metals Chemical class 0.000 description 2
- 238000010369 molecular cloning Methods 0.000 description 2
- 235000015097 nutrients Nutrition 0.000 description 2
- 229940092253 ovalbumin Drugs 0.000 description 2
- 230000002351 pectolytic effect Effects 0.000 description 2
- 229940072417 peroxidase Drugs 0.000 description 2
- 108040007629 peroxidase activity proteins Proteins 0.000 description 2
- JTJMJGYZQZDUJJ-UHFFFAOYSA-N phencyclidine Chemical class C1CCCCN1C1(C=2C=CC=CC=2)CCCCC1 JTJMJGYZQZDUJJ-UHFFFAOYSA-N 0.000 description 2
- 229940085127 phytase Drugs 0.000 description 2
- 238000001556 precipitation Methods 0.000 description 2
- 229940023143 protein vaccine Drugs 0.000 description 2
- 230000006798 recombination Effects 0.000 description 2
- 238000005215 recombination Methods 0.000 description 2
- 108091008146 restriction endonucleases Proteins 0.000 description 2
- 230000003248 secreting effect Effects 0.000 description 2
- 238000000926 separation method Methods 0.000 description 2
- 230000007480 spreading Effects 0.000 description 2
- 238000003892 spreading Methods 0.000 description 2
- 230000009469 supplementation Effects 0.000 description 2
- 230000001225 therapeutic effect Effects 0.000 description 2
- 230000002537 thrombolytic effect Effects 0.000 description 2
- 238000010361 transduction Methods 0.000 description 2
- 230000026683 transduction Effects 0.000 description 2
- 102000003601 transglutaminase Human genes 0.000 description 2
- 239000012137 tryptone Substances 0.000 description 2
- 229960005486 vaccine Drugs 0.000 description 2
- 230000009105 vegetative growth Effects 0.000 description 2
- 229910001868 water Inorganic materials 0.000 description 2
- OWEGMIWEEQEYGQ-UHFFFAOYSA-N 100676-05-9 Natural products OC1C(O)C(O)C(CO)OC1OCC1C(O)C(O)C(O)C(OC2C(OC(O)C(O)C2O)CO)O1 OWEGMIWEEQEYGQ-UHFFFAOYSA-N 0.000 description 1
- 101150090724 3 gene Proteins 0.000 description 1
- OSJPPGNTCRNQQC-UWTATZPHSA-N 3-phospho-D-glyceric acid Chemical compound OC(=O)[C@H](O)COP(O)(O)=O OSJPPGNTCRNQQC-UWTATZPHSA-N 0.000 description 1
- 241000589158 Agrobacterium Species 0.000 description 1
- 102000007698 Alcohol dehydrogenase Human genes 0.000 description 1
- 108010021809 Alcohol dehydrogenase Proteins 0.000 description 1
- 108010037870 Anthranilate Synthase Proteins 0.000 description 1
- 241000235349 Ascomycota Species 0.000 description 1
- 101000690713 Aspergillus niger Alpha-glucosidase Proteins 0.000 description 1
- 101900127796 Aspergillus oryzae Glucoamylase Proteins 0.000 description 1
- 108090000145 Bacillolysin Proteins 0.000 description 1
- 101000695691 Bacillus licheniformis Beta-lactamase Proteins 0.000 description 1
- 108010029675 Bacillus licheniformis alpha-amylase Proteins 0.000 description 1
- 108010045681 Bacillus stearothermophilus neutral protease Proteins 0.000 description 1
- 241000221198 Basidiomycota Species 0.000 description 1
- 101710130006 Beta-glucanase Proteins 0.000 description 1
- 108010084185 Cellulases Proteins 0.000 description 1
- 102000005575 Cellulases Human genes 0.000 description 1
- 229920002101 Chitin Polymers 0.000 description 1
- 229920001661 Chitosan Polymers 0.000 description 1
- 241000233652 Chytridiomycota Species 0.000 description 1
- 108020004638 Circular DNA Proteins 0.000 description 1
- 241000699800 Cricetinae Species 0.000 description 1
- 102000018832 Cytochromes Human genes 0.000 description 1
- 108010052832 Cytochromes Proteins 0.000 description 1
- 230000004544 DNA amplification Effects 0.000 description 1
- 230000033616 DNA repair Effects 0.000 description 1
- 238000001712 DNA sequencing Methods 0.000 description 1
- 102000004163 DNA-directed RNA polymerases Human genes 0.000 description 1
- 108090000626 DNA-directed RNA polymerases Proteins 0.000 description 1
- 101100108088 Emericella nidulans (strain FGSC A4 / ATCC 38163 / CBS 112.46 / NRRL 194 / M139) acvA gene Proteins 0.000 description 1
- 101100287044 Emericella nidulans (strain FGSC A4 / ATCC 38163 / CBS 112.46 / NRRL 194 / M139) ipnA gene Proteins 0.000 description 1
- 241000588722 Escherichia Species 0.000 description 1
- 108050001049 Extracellular proteins Proteins 0.000 description 1
- 229910005390 FeSO4-7H2O Inorganic materials 0.000 description 1
- 229910005444 FeSO4—7H2O Inorganic materials 0.000 description 1
- 241000192125 Firmicutes Species 0.000 description 1
- 101150094690 GAL1 gene Proteins 0.000 description 1
- 108091092584 GDNA Proteins 0.000 description 1
- 102100028501 Galanin peptides Human genes 0.000 description 1
- 108700007698 Genetic Terminator Regions Proteins 0.000 description 1
- 208000031448 Genomic Instability Diseases 0.000 description 1
- 229920001503 Glucan Polymers 0.000 description 1
- 101000882901 Homo sapiens Claudin-2 Proteins 0.000 description 1
- 101100121078 Homo sapiens GAL gene Proteins 0.000 description 1
- 101000821096 Homo sapiens Synapsin-2 Proteins 0.000 description 1
- 101001035458 Humicola insolens Endoglucanase-5 Proteins 0.000 description 1
- 239000007836 KH2PO4 Substances 0.000 description 1
- 102100027612 Kallikrein-11 Human genes 0.000 description 1
- 241001099157 Komagataella Species 0.000 description 1
- 241000235058 Komagataella pastoris Species 0.000 description 1
- 241001099156 Komagataella phaffii Species 0.000 description 1
- FBOZXECLQNJBKD-ZDUSSCGKSA-N L-methotrexate Chemical compound C=1N=C2N=C(N)N=C(N)C2=NC=1CN(C)C1=CC=C(C(=O)N[C@@H](CCC(O)=O)C(O)=O)C=C1 FBOZXECLQNJBKD-ZDUSSCGKSA-N 0.000 description 1
- GUBGYTABKSRVRQ-PICCSMPSSA-N Maltose Natural products O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CO)O[C@@H]1O[C@@H]1[C@@H](CO)OC(O)[C@H](O)[C@H]1O GUBGYTABKSRVRQ-PICCSMPSSA-N 0.000 description 1
- 229920000057 Mannan Polymers 0.000 description 1
- 108090000157 Metallothionein Proteins 0.000 description 1
- 229910017234 MnSO4 H2O Inorganic materials 0.000 description 1
- 229910017237 MnSO4-H2O Inorganic materials 0.000 description 1
- 229910017228 MnSO4—H2O Inorganic materials 0.000 description 1
- 229910002651 NO3 Inorganic materials 0.000 description 1
- NHNBFGGVMKEFGY-UHFFFAOYSA-N Nitrate Chemical compound [O-][N+]([O-])=O NHNBFGGVMKEFGY-UHFFFAOYSA-N 0.000 description 1
- 108091005461 Nucleic proteins Chemical group 0.000 description 1
- 108700026244 Open Reading Frames Proteins 0.000 description 1
- 206010034133 Pathogen resistance Diseases 0.000 description 1
- 229930182555 Penicillin Natural products 0.000 description 1
- JGSARLDLIJGVTE-MBNYWOFBSA-N Penicillin G Chemical compound N([C@H]1[C@H]2SC([C@@H](N2C1=O)C(O)=O)(C)C)C(=O)CC1=CC=CC=C1 JGSARLDLIJGVTE-MBNYWOFBSA-N 0.000 description 1
- 239000001888 Peptone Substances 0.000 description 1
- 108010080698 Peptones Proteins 0.000 description 1
- 241000425347 Phyla <beetle> Species 0.000 description 1
- 102000001253 Protein Kinase Human genes 0.000 description 1
- 108010008281 Recombinant Fusion Proteins Proteins 0.000 description 1
- 102000007056 Recombinant Fusion Proteins Human genes 0.000 description 1
- 102000018120 Recombinases Human genes 0.000 description 1
- 108010091086 Recombinases Proteins 0.000 description 1
- 241000235402 Rhizomucor Species 0.000 description 1
- 241000235343 Saccharomycetales Species 0.000 description 1
- 238000012300 Sequence Analysis Methods 0.000 description 1
- 101001010097 Shigella phage SfV Bactoprenol-linked glucose translocase Proteins 0.000 description 1
- 108090000787 Subtilisin Proteins 0.000 description 1
- 102100021994 Synapsin-2 Human genes 0.000 description 1
- 108010022394 Threonine synthase Proteins 0.000 description 1
- 102000005924 Triose-Phosphate Isomerase Human genes 0.000 description 1
- 108700015934 Triose-phosphate isomerases Proteins 0.000 description 1
- 101710152431 Trypsin-like protease Proteins 0.000 description 1
- 241000758405 Zoopagomycotina Species 0.000 description 1
- 101150116772 aatA gene Proteins 0.000 description 1
- 239000011543 agarose gel Substances 0.000 description 1
- 238000013019 agitation Methods 0.000 description 1
- 230000003321 amplification Effects 0.000 description 1
- 238000004458 analytical method Methods 0.000 description 1
- 101150009206 aprE gene Proteins 0.000 description 1
- 210000004507 artificial chromosome Anatomy 0.000 description 1
- 101150005925 aspC gene Proteins 0.000 description 1
- 101150103518 bar gene Proteins 0.000 description 1
- 239000011324 bead Substances 0.000 description 1
- 230000003115 biocidal effect Effects 0.000 description 1
- 239000003139 biocide Substances 0.000 description 1
- 238000010170 biological method Methods 0.000 description 1
- 230000033228 biological regulation Effects 0.000 description 1
- 230000015572 biosynthetic process Effects 0.000 description 1
- 230000000903 blocking effect Effects 0.000 description 1
- 150000001720 carbohydrates Chemical class 0.000 description 1
- 230000003197 catalytic effect Effects 0.000 description 1
- 230000034303 cell budding Effects 0.000 description 1
- 239000013592 cell lysate Substances 0.000 description 1
- 230000010307 cell transformation Effects 0.000 description 1
- 230000003833 cell viability Effects 0.000 description 1
- 230000001413 cellular effect Effects 0.000 description 1
- 239000001913 cellulose Substances 0.000 description 1
- 229920002678 cellulose Polymers 0.000 description 1
- 238000007385 chemical modification Methods 0.000 description 1
- 238000004587 chromatography analysis Methods 0.000 description 1
- 230000021615 conjugation Effects 0.000 description 1
- ARUVKPQLZAKDPS-UHFFFAOYSA-L copper(II) sulfate Chemical compound [Cu+2].[O-][S+2]([O-])([O-])[O-] ARUVKPQLZAKDPS-UHFFFAOYSA-L 0.000 description 1
- 229910000366 copper(II) sulfate Inorganic materials 0.000 description 1
- 239000013078 crystal Substances 0.000 description 1
- 230000007812 deficiency Effects 0.000 description 1
- 238000012217 deletion Methods 0.000 description 1
- 230000037430 deletion Effects 0.000 description 1
- 238000000432 density-gradient centrifugation Methods 0.000 description 1
- 239000005547 deoxyribonucleotide Substances 0.000 description 1
- 125000002637 deoxyribonucleotide group Chemical group 0.000 description 1
- 238000011118 depth filtration Methods 0.000 description 1
- 230000029087 digestion Effects 0.000 description 1
- 238000007865 diluting Methods 0.000 description 1
- 230000008034 disappearance Effects 0.000 description 1
- 238000011143 downstream manufacturing Methods 0.000 description 1
- 238000011304 droplet digital PCR Methods 0.000 description 1
- 238000007876 drug discovery Methods 0.000 description 1
- 108010091371 endoglucanase 1 Proteins 0.000 description 1
- 238000001952 enzyme assay Methods 0.000 description 1
- 210000003527 eukaryotic cell Anatomy 0.000 description 1
- 238000001704 evaporation Methods 0.000 description 1
- 230000008020 evaporation Effects 0.000 description 1
- 239000000284 extract Substances 0.000 description 1
- 239000004744 fabric Substances 0.000 description 1
- 239000007850 fluorescent dye Substances 0.000 description 1
- 230000008014 freezing Effects 0.000 description 1
- 238000007710 freezing Methods 0.000 description 1
- 108020001507 fusion proteins Proteins 0.000 description 1
- 102000037865 fusion proteins Human genes 0.000 description 1
- 238000012224 gene deletion Methods 0.000 description 1
- 230000030279 gene silencing Effects 0.000 description 1
- 239000011521 glass Substances 0.000 description 1
- 108010061330 glucan 1,4-alpha-maltohydrolase Proteins 0.000 description 1
- 150000004676 glycans Chemical class 0.000 description 1
- 230000012010 growth Effects 0.000 description 1
- 239000001963 growth medium Substances 0.000 description 1
- 238000000338 in vitro Methods 0.000 description 1
- 239000003999 initiator Substances 0.000 description 1
- 230000017730 intein-mediated protein splicing Effects 0.000 description 1
- 230000003993 interaction Effects 0.000 description 1
- 230000003834 intracellular effect Effects 0.000 description 1
- 238000002955 isolation Methods 0.000 description 1
- ISPYRSDWRDQNSW-UHFFFAOYSA-L manganese(II) sulfate monohydrate Chemical compound O.[Mn+2].[O-]S([O-])(=O)=O ISPYRSDWRDQNSW-UHFFFAOYSA-L 0.000 description 1
- 238000005374 membrane filtration Methods 0.000 description 1
- 229960000485 methotrexate Drugs 0.000 description 1
- 238000001471 micro-filtration Methods 0.000 description 1
- 244000005700 microbiome Species 0.000 description 1
- 229910000402 monopotassium phosphate Inorganic materials 0.000 description 1
- 235000019796 monopotassium phosphate Nutrition 0.000 description 1
- 238000002703 mutagenesis Methods 0.000 description 1
- 231100000350 mutagenesis Toxicity 0.000 description 1
- 230000035772 mutation Effects 0.000 description 1
- 230000007935 neutral effect Effects 0.000 description 1
- 229910052757 nitrogen Inorganic materials 0.000 description 1
- 101150105920 npr gene Proteins 0.000 description 1
- 101150017837 nprM gene Proteins 0.000 description 1
- 238000003199 nucleic acid amplification method Methods 0.000 description 1
- 238000012261 overproduction Methods 0.000 description 1
- 239000002245 particle Substances 0.000 description 1
- 239000008188 pellet Substances 0.000 description 1
- 229940049954 penicillin Drugs 0.000 description 1
- 235000019319 peptone Nutrition 0.000 description 1
- 229920001282 polysaccharide Polymers 0.000 description 1
- 239000005017 polysaccharide Substances 0.000 description 1
- 230000004481 post-translational protein modification Effects 0.000 description 1
- 230000001124 posttranscriptional effect Effects 0.000 description 1
- GNSKLFRGEWLPPA-UHFFFAOYSA-M potassium dihydrogen phosphate Chemical compound [K+].OP(O)([O-])=O GNSKLFRGEWLPPA-UHFFFAOYSA-M 0.000 description 1
- 230000001376 precipitating effect Effects 0.000 description 1
- 239000002243 precursor Substances 0.000 description 1
- 230000001737 promoting effect Effects 0.000 description 1
- 108020003519 protein disulfide isomerase Proteins 0.000 description 1
- 108060006633 protein kinase Proteins 0.000 description 1
- 101150023446 prpA gene Proteins 0.000 description 1
- 101150108007 prs gene Proteins 0.000 description 1
- 101150086435 prs1 gene Proteins 0.000 description 1
- 101150070305 prsA gene Proteins 0.000 description 1
- 238000010188 recombinant method Methods 0.000 description 1
- 230000007115 recruitment Effects 0.000 description 1
- 230000022532 regulation of transcription, DNA-dependent Effects 0.000 description 1
- 230000008263 repair mechanism Effects 0.000 description 1
- 230000003362 replicative effect Effects 0.000 description 1
- 230000004044 response Effects 0.000 description 1
- 238000010839 reverse transcription Methods 0.000 description 1
- 108020004418 ribosomal RNA Proteins 0.000 description 1
- 239000000523 sample Substances 0.000 description 1
- 238000012216 screening Methods 0.000 description 1
- 238000012163 sequencing technique Methods 0.000 description 1
- 239000011780 sodium chloride Substances 0.000 description 1
- 238000002415 sodium dodecyl sulfate polyacrylamide gel electrophoresis Methods 0.000 description 1
- 241000894007 species Species 0.000 description 1
- 238000001694 spray drying Methods 0.000 description 1
- 238000010561 standard procedure Methods 0.000 description 1
- 238000004659 sterilization and disinfection Methods 0.000 description 1
- 239000000126 substance Substances 0.000 description 1
- 239000000758 substrate Substances 0.000 description 1
- 231100000419 toxicity Toxicity 0.000 description 1
- 230000001988 toxicity Effects 0.000 description 1
- 238000012546 transfer Methods 0.000 description 1
- 238000000844 transformation Methods 0.000 description 1
- 230000009261 transgenic effect Effects 0.000 description 1
- 230000014621 translational initiation Effects 0.000 description 1
- 238000000108 ultra-filtration Methods 0.000 description 1
- 238000010977 unit operation Methods 0.000 description 1
- 230000035899 viability Effects 0.000 description 1
- 101150084346 wA gene Proteins 0.000 description 1
- 210000005253 yeast cell Anatomy 0.000 description 1
- 239000007222 ypd medium Substances 0.000 description 1
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/87—Introduction of foreign genetic material using processes not otherwise provided for, e.g. co-transformation
- C12N15/90—Stable introduction of foreign DNA into chromosome
- C12N15/902—Stable introduction of foreign DNA into chromosome using homologous recombination
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/80—Vectors or expression systems specially adapted for eukaryotic hosts for fungi
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/14—Hydrolases (3)
- C12N9/16—Hydrolases (3) acting on ester bonds (3.1)
- C12N9/22—Ribonucleases RNAses, DNAses
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/11—DNA or RNA fragments; Modified forms thereof; Non-coding nucleic acids having a biological activity
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2310/00—Structure or type of the nucleic acid
- C12N2310/10—Type of nucleic acid
- C12N2310/20—Type of nucleic acid involving clustered regularly interspaced short palindromic repeats [CRISPRs]
Definitions
- the present invention relates to host cells comprising in its genome at least two landing pad polynucleotides (LPP), methods of generating multi-copy host cells, and methods of producing a polypeptide of interest with said host cells.
- LPP landing pad polynucleotides
- Recombinant host cells are widely used for recombinant protein production.
- productivity of the applied cell systems i.e., the production of total protein per fermentation unit
- yield increases have been achieved through mutagenesis and screening for increased production of proteins of interest.
- this approach is mainly only useful for the overproduction of endogenous proteins in isolates containing the enzymes of interest. Therefore, for each new protein or enzyme product, a lengthy strain and process development program is required to achieve improved productivities.
- Multi-copy host cells can for example be generated using the flippase-based FLP-FRT system disclosed in WO16145084 (Novozymes A/S).
- FLP-FRT system disclosed in WO16145084 (Novozymes A/S).
- known disadvantages of the FLP-FRT system are for example (i) the occurrence of unwanted recombination between FRT sites, (ii) the requirement of high DNA loads during transformation, and (iii) the requirement for marker genes in all of the landing pads.
- the object of the present invention is to provide a modified host cell and a method for generating multi-copy host cells.
- the present invention is based on the surprising and inventive finding that host cells comprising multiple landing pads, each landing pad comprising homology arms and multiple protospacer sequences, can be efficiently used for targeted integration of multiple GOI copies.
- Said host cells can be used to generate multi-copy host cells via the method of the invention: synthetic protospacer array defined entry (SPADE).
- SPADE synthetic protospacer array defined entry
- a further advantage of the SPADE system over prior art systems, e.g., the FLP-FRT system, is that the load of DNA can be reduced during transformation - which is an important advantage when toxic DNA/proteins are transformed.
- GOI sequences previously integrated into landing pads can be “re-used” and integrated into “empty” landing pads to further increase the copy number of the host cell.
- the amount of DNA per transfection can be reduced to reduce DNA and/or protein toxicity for the host cell (and thereby maintaining desired host cell viabilities), whilst not compromising integration efficiency or GOI copy number.
- the SPADE system does not require a recombinase sequence present close (e.g., placed in cis) to the landing pads as is the case for the FLP/FRT system.
- the SPADE system allows to maintain some landing pads as “empty” landing pads, while other landing pads harbor the gene of interest. This further contributes to a versatile host cell generation platform and e.g., allows later integration of further (same or different) genes of interest into the genome if desired.
- the different landing pads can be used for integration of different GOIs, to obtain strains having varying ratios of two or more GOIs (see Example 8 and Example 12, respectively).
- the SPADE system allows to introduce linear PCR products comprising the different features of the system (e.g., GOI, 3’ homology arm, and 5’ homology arm), see e.g., Example 11 . By doing so, no full plasmid constructs need to be cloned which results in a highly time-efficient strain construction.
- the present invention provides host cells comprising in their genome at least two landing pad polynucleotide (LPP) sequences, each LPP sequence comprising in 5’ to 3’ direction: a 5’ homology arm, two or more protospacer sequences, and a 3’ homology arm, wherein the polynucleotide sequence of the 5’ homology arm is identical for at least two LPP sequences, and wherein the polynucleotide sequence of the 3’ homology arm is identical for at least two LPP sequences.
- LPP landing pad polynucleotide
- the host cells of the first aspect have the advantage of being versatile starting points for generating multi-copy host cells, i.e. the cells can be used to integrated genes of interest into one, two or more, or into all LPPs of the host cell.
- the location and number of the LPPs in the host cell genome can be designed depending on the host cell type and the gene of interest which shall be integrated therein.
- the present invention provides a method for generating a multi-copy host cell, comprising the steps of a) providing a host cell according to the first aspect, b) delivering to the host cell a gene editing system comprising:
- a first donor polynucleotide comprising in 5’ to 3’ direction: a 5’ homology arm, a first polynucleotide encoding a first polypeptide of interest, and a 3’ homology arm, OR a set of first PCR fragments comprising a first 5’ homology arm, a first 3’ homology arm, and a first polynucleotide encoding a first polypetide of interest, wherein the 3’ homology arm and/or first 3’ homology arm is identical to the 3’ homology arm of one or more LPP, and wherein the 5’ homology arm and/or first 5’ homology arm is identical to the 5’ homology arm of one or more LPP, and c) cultivating the host cell at a first temperature allowing one or more first guide RNA of the at least one guide RNA to form a complex with the nuclease at one or more protospacer of one or more LPP, and wherein the complex promotes the homology directed repair by creating a
- the present invention provides a method of producing a polypeptide of interest, comprising cultivating a multi-copy host cell under conditions conducive for production of the polypeptide, which multi-copy host cell is generated by the method according to the second aspect.
- Figure 1 shows a host cell of the invention comprising 4 landing pad polynucleotides (LPPs).
- LPPs landing pad polynucleotides
- Figure 2a shows the RNA-guided cutting of three LPPs in a host cell of the invention.
- Figure 2b shows homology-directed repair of the generated double strand breaks in a host cell of the invention.
- Figure 2c shows a host cell harboring three copies of a GOI in three LPPs.
- FIG. 3 shows a detailed overview of the 4 LPPs including their PAM sequences and protospacers (homology arms are not fully represented), and the LPP either being present on the coding strand or on the template strand.
- Figure 4 shows an example of a set of PCR fragments used for transformation into a host cell with a SPADE LPP. Definitions
- LPP Landing pad polynucleotide
- the term ”LPP or landing pad polynucleotide, or landing pad means a DNA sequence which comprises a 5’ homology arm, two or more protospacer sequences, and a 3’ homology arm.
- the LPP is inserted into the host cell genome on the desired chromosome and/or genome location.
- the LPP is then targeted with a nuclease and a gRNA at a protospacer of the LPP which results in a double-strand break at or near the targeted protospacer.
- Homology-directed repair takes place with a donor polynucleotide and/or PCR fragments.
- the donor polynucleotide or PCR fragments carry a gene of interest, and comprise homology arms being identical to the 5’ and 3’ homology arms of the targeted LPP.
- the repair results in integration of the donor polynucleotide and/or PCR fragments in the targeted LPP.
- the homology-directed repair is not limited to the presence of the donor polynucleotide and/or PCR fragments, but may instead/additionally take place with an already integrated gene of interest in a LPP having identical homology arms to the homology arms of the targeted LPP.
- the homology arms may have a length of at least 10 nt, at least 20 nt, at least 30 nt, at least 50 nt, at least 100 nt, 200 nucleotides (nt), at least 300 nt, at least 400 nt, at least 500 nt, at least 600 nt, at least 700 nt, at least 800 nt, at least 900 nt, or at least 1000 nt.
- the 5’ arm is located upstream of the two or more protospacers, and the 3’ arm is located downstream of the two or more protospacers.
- the 5’ arm is located upstream of the first polynucleotide encoding the first polypeptide of interest, and the 3’ arm is located downstream of the first polynucleotide encoding the first polypeptide of interest.
- Protospacer means a DNA sequence located within the 5’ arm and the 3’ arm of an LPP.
- the protospacer is integrated into the host cell genome and is complementary to the spacer sequence of a guide RNA, which guide RNA can be used to create a doublestrand cut at or near the protospacer sequence.
- the protospacer may have a length of bewteen 14 - 36 nucleotides.
- DNA-guided nucleases may also require a PAM sequence present near the protospacer sequence.
- the PAM sequence (protospacer adjacent motif) is located directly downstream of the target sequence in the genomic DNA, on the non-target strand, i.e. on the strand opposite to the strand which comprises the protospacer sequence.
- first protospacer means a protospacer which is present in all LPP of the host cell.
- second protospacer means a protospacer which is unique for only one LPP of the at least two LPPs in the host cell genome.
- Third protospacer means a protospacer which is present in a third subset of the at least two LPPs.
- Third subset of the at least two LPPs means a subset of LPPs comprising a third protospacer which is present in two or more LPPs, and/or is present in all LPPs of the host cell.
- fourth protospacer means a protospacer which is present in a fourth subset of the at least two LPPs.
- the fourth subset of LPPs means a subset of LPPs comprising a fourth protospacer which is present in two or more LPPs, and/or is present in all LPPs of the host cell.
- the fourth subset consists of a number of LPPs which number of LPPs is different from the number of LPPs present in the third subset.
- fifth protospacer means a protospacer which is present in a fifth subset of the at least two LPPs.
- the fifth subset of the at least two LPPs means a subset of LPPs comprising a fifth protospacer which is present in two or more LPPs, and/or is present in all LPPs of the host cell.
- the fifth subset consists of a number of LPPs which number of LPPs is different from the number of LPPs present in the third subset, and also different from the number of LPPs present in the fourth subset.
- RNA-guided DNA nucleases provide sequence-specific DNA cutting through base-pairing interactions bewteen a guide RNA and target DNA (protospacer sequence) of the two or more LPPs. These nucleases include, but are not limited to Cas9, MAD7, and Cpf1. RNA-gudied nucleases use base pairing to recognize and cleave target DNA with complementarity to the guide RNA. guide RNA and nuclease may be delivered to the host cell by transformation I transfection, either on the same plasmid or linearized DNA vector, or on different separate plasmids or DNA vectors.
- the nuclease may be delivered to the cell as protein, e.g. by protein transfection.
- guide RNA functions as a guide for the RNA-guided DNA nuclease.
- the guide RNA forms a complex with the nuclease and the target strand, to which the guide RNA is complementary.
- the guide RNA may comprise a spacer, which is complementary to the protospacer sequence of the DNA target, a short CRISPR RNA (crRNA), and a tracrRNA.
- the first guide RNA is a guide RNA that forms an editing complex most efficiently at the first temperature, and forms an editing complex less efficiently at other temperatures. Also, the first guide RNA comprises a spacer sequence which may be complementary to any of the first, second, third, fourth or fifth protospacers.
- the first guide RNA may be a 1st first guide RNA, a 2nd first guide RNA, 3rd first guide RNA, 4rd first guide RNA, and /or a 5th first guide RNA.
- the first donor polynucleotide comprises a first polynucleotide which encodes a first polypeptide of interest.
- the first donor polynucleotide furthermore comprises a 5’ homology arm upstream of the first polynucleotide, and a 3’ homology arm downstream of the first polynucleotide.
- the homology arms of the first donor polynucleotide are typically similar and/or identical to the homology arms of the LPPs into which the first polynucleotide is integrated into.
- the second donor polynucleotide comprises a second polynucleotide which encodes a second polypeptide of interest.
- the second donor polynucleotide furthermore comprises a 5’ homology arm upstream of the second polynucleotide, and a 3’ homology arm downstream of the second polynucleotide.
- the homology arms of the second donor polynucleotide are typically similar and/or identical to the homology arms of the LPPs into which the second polynucleotide is integrated into.
- the first temperature allows efficient nuclease-based cutting of one or more protospacers which are complemtary to one or more first guide RNA.
- the first temperature typically also allows cultivation of the host cell, but may differ from the host cell's optimal cultivation temperature.
- the second guide RNA is a guide RNA that forms an editing complex most efficiently at the second temperature, and forms an editing complex less efficiently at other temperatures. Also, the second guide RNA comprises a spacer sequence which may be complementary to any of the first, second, third, fourth or fifth protospacers.
- the second guide RNA may be a 1st second guide RNA, a 2nd second guide RNA, 3rd second guide RNA, 4rd second guide RNA, and /or a 5th second guide RNA.
- the second temperature allows efficient nuclease-based cutting of one or more protospacers which are complemtary to one or more second guide RNA.
- the second temperature typically also allows cultivation of the host cell, but may differ from the host cell's optimal cultivation temperature.
- the second temperature differs from the first temperature by at least 0.5 degrees Celsius.
- the third guide RNA is a guide RNA that forms an editing complex most efficiently at the third temperature, and forms an editing complex less efficiently at other temperatures. Also, the third guide RNA comprises a spacer sequence which may be complementary to any of the first, second, third, fourth or fifth protospacers.
- the third guide RNA may be a 1st third guide RNA, a 2nd third guide RNA, 3rd third guide RNA, 4rd third guide RNA, and /or a 5th third guide RNA.
- the third temperature allows efficient nuclease-based cutting of one or more protospacers which are complemtary to one or more third guide RNA.
- the third temperature typically also allows cultivation of the host cell, but may differ from the host cell's optimal cultivation temperature.
- the third temperature differs from the first temperature and the second temperature by at least 0.5 degrees Celsius.
- the fourth guide RNA is a guide RNA that forms an editing complex most efficiently at the fourth temperature, and forms an editing complex less efficiently at other temperatures. Also, the fourth guide RNA comprises a spacer sequence which may be complementary to any of the first, second, third, fourth or fifth protospacers.
- the fourth guide RNA may be a 1st fourth guide RNA, a 2nd fourth guide RNA, 3rd fourth guide RNA, 4rd fourth guide RNA, and /or a 5th fourth guide RNA.
- the fourth temperature allows efficient nuclease-based cutting of one or more protospacers which are complemtary to one or more fourth guide RNA.
- the fourth temperature typically also allows cultivation of the host cell, but may differ from the host cell's optimal cultivation temperature.
- the fourth temperature differs from the first temperature, the second temperature, and the third temperature by at least 0.5 degrees Celsius.
- the fifth guide RNA is a guide RNA that forms an editing complex most efficiently at the fifth temperature, and forms an editing complex less efficiently at other temperatures. Also, the fifth guide RNA comprises a spacer sequence which may be complementary to any of the first, second, third, fourth or fifth protospacers.
- the t fifth hird guide RNA may be a 1 st fifth guide RNA, a 2nd fifth guide RNA, 3rd fifth guide RNA, 4rd fifth guide RNA, and /or a 5th fifth guide RNA.
- the fifth temperature allows efficient nuclease-based cutting of one or more protospacers which are complemtary to one or more fifth guide RNA.
- the fifth temperature typically also allows cultivation of the host cell, but may differ from the host cell's optimal cultivation temperature.
- the fifth temperature differs from the first temperature, the second temperature, the third temperature, and the fourth temperature by at least 0.5 degrees Celsius.
- Homology-directed repair is a process where DNA double-strand break is repaired by homologous recombination using a DNA template.
- the DNA template may be a donor polynucleotide, or a DNA sequence previously integrated into an LPP.
- the repair mechanism uses the homology arms to integrate the DNA template into the broken target site.
- NHEJ non-homologous end-joining
- cDNA lacks intron sequences that may be present in the corresponding genomic DNA.
- the initial, primary RNA transcript is a precursor to mRNA that is processed through a series of steps, including splicing, before appearing as mature spliced mRNA.
- Coding sequence means a polynucleotide, which directly specifies the amino acid sequence of a polypeptide.
- the boundaries of the coding sequence are generally determined by an open reading frame, which begins with a start codon, such as ATG, GTG, or TTG, and ends with a stop codon, such as TAA, TAG, or TGA.
- the coding sequence may be a genomic DNA, cDNA, synthetic DNA, or a combination thereof.
- control sequences means nucleic acid sequences involved in regulation of expression of a polynucleotide in a specific organism or in vitro. Each control sequence may be native (/.e., from the same gene) or heterologous (/.e., from a different gene) to the polynucleotide encoding the polypeptide, and native or heterologous to each other. Such control sequences include, but are not limited to leader, polyadenylation, prepropeptide, propeptide, signal peptide, promoter, terminator, enhancer, and transcription or translation initiator and terminator sequences. At a minimum, the control sequences include a promoter, and transcriptional and translational stop signals. The control sequences may be provided with linkers for the purpose of introducing specific restriction sites facilitating ligation of the control sequences with the coding region of the polynucleotide encoding a polypeptide.
- expression means any step involved in the production of a polypeptide including, but not limited to, transcription, post-transcriptional modification, translation, post-translational modification, and secretion.
- Expression vector refers to a linear or circular DNA construct comprising a DNA sequence encoding a polypeptide, which coding sequence is operably linked to a suitable control sequence capable of effecting expression of the DNA in a suitable host.
- control sequences may include a promoter to effect transcription, an optional operator sequence to control transcription, a sequence encoding suitable ribosome binding sites on the mRNA, enhancers and sequences which control termination of transcription and translation.
- Fusion polypeptide is a polypeptide in which one polypeptide is fused at the N-terminus and/or the C-terminus of a polypeptide of the present invention.
- a fusion polypeptide is produced by fusing a polynucleotide encoding another polypeptide to a polynucleotide of the present invention, or by fusing two or more polynucleotides of the present invention together.
- Techniques for producing fusion polypeptides are known in the art, and include ligating the coding sequences encoding the polypeptides so that they are in frame and that expression of the fusion polypeptide is under control of the same promoter(s) and terminator.
- Fusion polypeptides may also be constructed using intein technology in which fusion polypeptides are created post-translationally (Cooper et al., 1993, EMBO J. 12: 2575-2583; Dawson et al., 1994, Science 266: 776-779).
- a fusion polypeptide can further comprise a cleavage site between the two polypeptides. Upon secretion of the fusion protein, the site is cleaved releasing the two polypeptides. Examples of cleavage sites include, but are not limited to, the sites disclosed in Martin et al., 2003, J. Ind. Microbiol. Biotechnol. 3: 568-576; Svetina et al., 2000, J.
- heterologous means, with respect to a host cell, that a polypeptide or nucleic acid does not naturally occur in the host cell.
- heterologous means, with respect to a polypeptide or nucleic acid, that a control sequence, e.g., promoter, of a polypeptide or nucleic acid is not naturally associated with the polypeptide or nucleic acid, i.e., the control sequence is from a gene other than the gene encoding the mature polypeptide.
- Host Strain or Host Cell is an organism into which an expression vector, phage, virus, or other DNA construct, including a polynucleotide encoding a polypeptide of interest (e.g., an amylase) has been introduced.
- exemplary host strains are microorganism cells (e.g., bacteria, filamentous fungi, and yeast) capable of expressing the polypeptide of interest and/or fermenting saccharides.
- the term "host cell” includes protoplasts created from cells.
- Isolated means a polypeptide, nucleic acid, cell, or other specified material or component that has been separated from at least one other material or component, including but not limited to, other proteins, nucleic acids, cells, etc.
- An isolated polypeptide, nucleic acid, cell or other material is thus in a form that does not occur in nature.
- An isolated polypeptide includes, but is not limited to, a culture broth containing the secreted polypeptide expressed in a host cell.
- Native means a nucleic acid or polypeptide naturally occurring in a host cell.
- Nucleic acid encompasses DNA, RNA, heteroduplexes, and synthetic molecules capable of encoding a polypeptide. Nucleic acids may be single stranded or double stranded, and may be chemical modifications. The terms “nucleic acid” and “polynucleotide” are used interchangeably. Because the genetic code is degenerate, more than one codon may be used to encode a particular amino acid, and the present compositions and methods encompass nucleotide sequences that encode a particular amino acid sequence. Unless otherwise indicated, nucleic acid sequences are presented in 5'-to-3' orientation.
- nucleic acid construct means a nucleic acid molecule, either single- or double-stranded, which is isolated from a naturally occurring gene or is modified to contain segments of nucleic acids in a manner that would not otherwise exist in nature or which is synthetic, and which comprises one or more control sequences operably linked to the nucleic acid sequence.
- operably linked means that specified components are in a relationship (including but not limited to juxtaposition) permitting them to function in an intended manner.
- a regulatory sequence is operably linked to a coding sequence such that expression of the coding sequence is under control of the regulatory sequence.
- PCR fragments means double stranded DNA generated by PCR, comprising overlaps at the 3’ and/or 5’ ends suitable for homologous recombination with one or more other PCR fragments.
- the homologous recombination takes usually place in the host cell after transformation of the PCR fragments, wherein the PCR fragments are integrated into one or more LPP. Examples for PCR fragments are disclosed in Example 11 and Figure 4.
- a set of PCR fragments comprises a 5’ homology arm PCR fragment, a 3’ homology arm PCR fragment, and a PCR fragment comprising the polynucleotide of interest, preferably the polynucleotide of interest is linked to a promoter and a terminator.
- purified means a nucleic acid, polypeptide or cell that is substantially free from other components as determined by analytical techniques well known in the art (e.g., a purified polypeptide or nucleic acid may form a discrete band in an electrophoretic gel, chromatographic eluate, and/or a media subjected to density gradient centrifugation).
- a purified nucleic acid or polypeptide is at least about 50% pure, usually at least about 60%, about 65%, about 70%, about 75%, about 80%, about 85%, about 90%, about 91 %, about 92%, about 93%, about 94%, about 95%, about 96%, about 97%, about 98%, about 99%, about 99.5%, about 99.6%, about 99.7%, about 99.8% or more pure (e.g., percent by weight or on a molar basis).
- a composition is enriched for a molecule when there is a substantial increase in the concentration of the molecule after application of a purification or enrichment technique.
- the term "enriched" refers to a compound, polypeptide, cell, nucleic acid, amino acid, or other specified material or component that is present in a composition at a relative or absolute concentration that is higher than a starting composition.
- the term “purified” as used herein refers to the polypeptide or cell being essentially free from components (especially insoluble components) from the production organism. In other aspects, the term “purified” refers to the polypeptide being essentially free of insoluble components (especially insoluble components) from the native organism from which it is obtained. In one aspect, the polypeptide is separated from some of the soluble components of the organism and culture medium from which it is recovered. The polypeptide may be purified (/.e., separated) by one or more of the unit operations filtration, precipitation, or chromatography.
- the polypeptide may be purified such that only minor amounts of other proteins, in particular, other polypeptides, are present.
- purified as used herein may refer to removal of other components, particularly other proteins and most particularly other enzymes present in the cell of origin of the polypeptide.
- the polypeptide may be "substantially pure", i.e., free from other components from the organism in which it is produced, e.g., a host organism for recombinantly produced polypeptide.
- the polypeptide is at least 40% pure by weight of the total polypeptide material present in the preparation.
- the polypeptide is at least 50%, 60%, 70%, 80% or 90% pure by weight of the total polypeptide material present in the preparation.
- a "substantially pure polypeptide” may denote a polypeptide preparation that contains at most 10%, preferably at most 8%, more preferably at most 6%, more preferably at most 5%, more preferably at most 4%, more preferably at most 3%, even more preferably at most 2%, most preferably at most 1 %, and even most preferably at most 0.5% by weight of other polypeptide material with which the polypeptide is natively or recombinantly associated.
- the substantially pure polypeptide is at least 92% pure, preferably at least 94% pure, more preferably at least 95% pure, more preferably at least 96% pure, more preferably at least 97% pure, more preferably at least 98% pure, even more preferably at least 99% pure, most preferably at least 99.5% pure by weight of the total polypeptide material present in the preparation.
- the polypeptide of the present invention is preferably in a substantially pure form (/.e., the preparation is essentially free of other polypeptide material with which it is natively or recombinantly associated). This can be accomplished, for example by preparing the polypeptide by well-known recombinant methods or by classical purification methods.
- Recombinant is used in its conventional meaning to refer to the manipulation, e.g., cutting and rejoining, of nucleic acid sequences to form constellations different from those found in nature.
- the term recombinant refers to a cell, nucleic acid, polypeptide or vector that has been modified from its native state.
- recombinant cells express genes that are not found within the native (non-recombinant) form of the cell, or express native genes at different levels or under different conditions than found in nature.
- the term “recombinant” is synonymous with “genetically modified” and “transgenic”.
- Recover means the removal of a polypeptide from at least one fermentation broth component selected from the list of a cell, a nucleic acid, or other specified material, e.g., recovery ofthe polypeptide from the whole fermentation broth, or from the cell-free fermentation broth, by polypeptide crystal harvest, by filtration, e.g.
- depth filtration by use of filter aids or packed filter medias, cloth filtration in chamber filters, rotary-drum filtration, drum filtration, rotary vacuum-drum filters, candle filters, horizontal leaf filters or similar, using sheed or pad filtration in framed or modular setups
- membrane filtration using sheet filtration, module filtration, candle filtration, microfiltration, ultrafiltration in either cross flow, dynamic cross flow or dead end operation
- centrifugation using decanter centrifuges, disc stack centrifuges, hyrdo cyclones or similar
- Recovery encompasses isolation and/or purification of the polypeptide.
- Sequence identity The relatedness between two amino acid sequences or between two nucleotide sequences is described by the parameter “sequence identity”.
- the sequence identity between two amino acid sequences is determined as the output of “longest identity” using the Needleman-Wunsch algorithm (Needleman and Wunsch, 1970, J. Mol. Biol. 48: 443-453) as implemented in the Needle program of the EMBOSS package (EMBOSS: The European Molecular Biology Open Software Suite, Rice et al., 2000, Trends Genet. 16: 276-277), preferably version 6.6.0 or later.
- the parameters used are a gap open penalty of 10, a gap extension penalty of 0.5, and the EBLOSUM62 (EMBOSS version of BLOSUM62) substitution matrix.
- the Needle program In order for the Needle program to report the longest identity, the -nobrief option must be specified in the command line.
- the output of Needle labeled “longest identity” is calculated as follows:
- the sequence identity between two polynucleotide sequences is determined as the output of “longest identity” using the Needleman-Wunsch algorithm (Needleman and Wunsch, 1970, supra) as implemented in the Needle program of the EMBOSS package (EMBOSS: The European Molecular Biology Open Software Suite, Rice et al., 2000, supra), preferably version 6.6.0 or later.
- the parameters used are a gap open penalty of 10, a gap extension penalty of 0.5, and the EDNAFULL (EMBOSS version of NCBI NUC4.4) substitution matrix.
- the nobrief option must be specified in the command line.
- the output of Needle labeled “longest identity” is calculated as follows:
- Signal Peptide A "signal peptide” is a sequence of amino acids attached to the N-terminal portion of a protein, which facilitates the secretion of the protein outside the cell.
- the mature form of an extracellular protein lacks the signal peptide, which is cleaved off during the secretion process.
- Wild-type in reference to an amino acid sequence or nucleic acid sequence means that the amino acid sequence or nucleic acid sequence is a native or naturally-occurring sequence.
- naturally-occurring refers to anything (e.g., proteins, amino acids, or nucleic acid sequences) that is found in nature.
- non-naturally occurring refers to anything that is not found in nature (e.g., recombinant nucleic acids and protein sequences produced in the laboratory or modification of the wild-type sequence).
- the present invention relates to a host cell comprising in its genome at least two landing pad polynucleotide (LPP) sequences, each LPP sequence comprising in 5’ to 3’ direction: a 5’ homology arm, two or more protospacer sequences, and a 3’ homology arm, wherein the polynucleotide sequence of the 5’ homology arm is identical for at least two LPP sequences, and wherein the polynucleotide sequence of the 3’ homology arm is identical for at least two LPP sequences.
- LPP landing pad polynucleotide
- At least one protospacer of the at least two protospacers is present in at least two LPP.
- the cell comprises in its genome at least three LPP, wherein at least one LPP comprises second 3’ and second 5’ homology arms having less than 100 % sequence identity to the 3’ and 5’ homology arms of the at least two LPP, respectively, e.g., less than 99%, less than 98%, less than 97%, less than 96%, less than 95%, less than 90%, less thank 85%, less than 80%, less than 75%, less than 70%, or less than 65%.
- the polynucleotide sequence of the 5’ homology arm is identical for all of the at least two LPP sequences.
- the polynucleotide sequence of the 3’ homology arm is identical for all of the at least two LPP sequences.
- the 3’ homology arms and the 5’ homology arms are synthetic polynucleotide sequences.
- At least one LPP is located on the coding strand (3’ to 5’ direction), and at least one LPP is located on the template strand (5’ to 3’ direction).
- At least two LPPs are located on the coding strand (3’ to 5’ direction), and at least two LPPs are located on the template strand (5’ to 3’ direction).
- each LPP comprises two protospacer sequences, such as a first protospacer, and a second protospacer.
- each LPP comprises three different protospacer sequences, such as a first protospacer, a second protospacer, and a third protospacer.
- each LPP comprises four different protospacer sequences, such as a first protospacer, a second protospacer, a third protospacer, and a fourth protospacer.
- each LPP comprises five different protospacer sequences, such as a first protospacer, a second protospacer, a third protospacer, a fourth protospacer, and a fifth protospacer.
- the first protospacer has a polynucleotide sequence with a sequence identity of below 100%, such as below 99%, below 98%, below 97%, below 96%, below 95%, below 90%, below 85%, below 80%, below 75%, below 70%, below 65%, below 60%, below 55%, below 50%, below 45%, below 40%, below 35%, below 30%, below 25%, below 20%, below 15%, or below 10% to the polynucleotide sequence of the second protospacer, to the polynucleotide sequence of the third protospacer, to the polynucleotide sequence of the fourth protospacer, and to the polynucleotide sequence of the fifth protospacer.
- the second protospacer has a polynucleotide sequence with a sequence identity of below 100%, such as below 99%, below 98%, below 97%, below 96%, below 95%, below 90%, below 85%, below 80%, below 75%, below 70%, below 65%, below 60%, below 55%, below 50%, below 45%, below 40%, below 35%, below 30%, below 25%, below 20%, below 15%, or below 10% to the polynucleotide sequence of the first protospacer, to the polynucleotide sequence of the third protospacer, to the polynucleotide sequence of the fourth protospacer, and to the polynucleotide sequence of the fifth protospacer.
- a sequence identity of below 100%, such as below 99%, below 98%, below 97%, below 96%, below 95%, below 90%, below 85%, below 80%, below 75%, below 70%, below 65%, below 60%, below 55%, below 50%, below 45%, below 40%, below 35%, below 30%, below 25%
- the third protospacer has a polynucleotide sequence with a sequence identity below 100%, such as below 99%, below 98%, below 97%, below 96%, below 95%, below 90%, below 85%, below 80%, below 75%, below 70%, below 65%, below 60%, below 55%, below 50%, below 45%, below 40%, below 35%, below 30%, below 25%, below 20%, below 15%, or below 10% to the polynucleotide sequence of the first protospacer, to the polynucleotide sequence of the second protospacer, to the polynucleotide sequence of the fourth protospacer, and to the polynucleotide sequence of the fifth protospacer.
- a sequence identity below 100%, such as below 99%, below 98%, below 97%, below 96%, below 95%, below 90%, below 85%, below 80%, below 75%, below 70%, below 65%, below 60%, below 55%, below 50%, below 45%, below 40%, below 35%, below 30%, below 25%, below
- the fourth protospacer has a polynucleotide sequence with a sequence identity of below 100%, such as below 99%, below 98%, below 97%, below 96%, below 95%, below 90%, below 85%, below 80%, below 75%, below 70%, below 65%, below 60%, below 55%, below 50%, below 45%, below 40%, below 35%, below 30%, below 25%, below 20%, below 15%, or below 10% to the polynucleotide sequence of the first protospacer, to the polynucleotide sequence of the second protospacer, to the polynucleotide sequence of the third protospacer, and to the polynucleotide sequence of the fifth protospacer.
- a sequence identity of below 100% such as below 99%, below 98%, below 97%, below 96%, below 95%, below 90%, below 85%, below 80%, below 75%, below 70%, below 65%, below 60%, below 55%, below 50%, below 45%, below 40%, below 35%, below 30%, below 25%,
- the fifth protospacer has a polynucleotide sequence with a sequence identity of below 100%, such as below 99%, below 98%, below 97%, below 96%, below 95%, below 90%, below 85%, below 80%, below 75%, below 70%, below 65%, below 60%, below 55%, below 50%, below 45%, below 40%, below 35%, below 30%, below 25%, below 20%, below 15%, or below 10% to the polynucleotide sequence of the first protospacer, to the polynucleotide sequence of the second protospacer, to the polynucleotide sequence of the third protospacer, and to the polynucleotide sequence of the fourth protospacer.
- a sequence identity of below 100% such as below 99%, below 98%, below 97%, below 96%, below 95%, below 90%, below 85%, below 80%, below 75%, below 70%, below 65%, below 60%, below 55%, below 50%, below 45%, below 40%, below 35%, below 30%, below 25%,
- each LPP comprises more than five different protospacer sequences.
- one or more of the at least two protospacer sequences is heterologous to the host cell.
- all of the at least two protospacer sequences are heterologous to the host cell.
- one or more of the at least two protospacer sequences is synthetic.
- all of the at least two protospacer sequences is synthetic.
- each protospacer sequence has a length of at least 15 nt, at least 20 nt, or at least 25 nt, preferably each protospacer has a length of 20 - 30 nt.
- the host cell comprises two LPP sequences, three LPP sequences, four LPP sequences, 5 LPP sequences, 6 LPP sequences, 7 LPP sequences, 8 LPP sequences, 9 LPP sequences, 10 LPP sequences, 11 LPP sequences, 12 LPP sequences, 13 LPP sequences, 14 LPP sequences, 15 LPP sequences, 16 LPP sequences, or more than 16 LPP sequences.
- the host cell comprises 2 - 8 LPP sequences.
- the host cell comprises at least two chromosomes, at least four chromosomes, at least six chromosomes or at least 8 chromosomes.
- one or more of the host cell chromosomes comprises a total of 1 LPP per chromosome. In one embodiment, two or more chromosomes comprise a total of 1 LPP per chromosome.
- three chromosomes comprise a total of 1 LPP per chromosome.
- four chromosomes comprise a total of 1 LPP per chromosome.
- five chromosomes comprise a total of 1 LPP per chromosome.
- six chromosomes comprise a total of 1 LPP per chromosome.
- 7 chromosomes comprise a total of 1 LPP per chromosome.
- 8 chromosomes comprise a total of 1 LPP per chromosome.
- each chromosome comprises a total of 1 LPP per chromosome.
- LPP is located on the short arm of a chromosome.
- the LPP is located on the long arm of a chromosome.
- one or more of the chromosomes comprises a total of 2 LPP per chromosome.
- two or more chromosomes comprise a total of 2 LPP per chromosome.
- three chromosomes comprise a total of 2 LPP per chromosome.
- four chromosomes comprise a total of 2 LPP per chromosome.
- five chromosomes comprise a total of 2 LPP per chromosome.
- six chromosomes comprise a total of 2 LPP per chromosome.
- chromosomes comprise a total of 2 LPP per chromosome.
- 8 chromosomes comprise a total of 2 LPP per chromosome.
- each chromosome comprises a total of 2 LPP per chromosome.
- the 2 LPP are located on opposite locations of the centromere of the chromosome.
- one LPP is located on the short arm, and one LPP is located on the long arm.
- one or more of the chromosomes comprises a total of 3 LPP per chromosome.
- chromosomes comprise a total of 3 LPP per chromosome.
- three chromosomes comprise a total of 3 LPP per chromosome.
- four chromosomes comprise a total of 3 LPP per chromosome.
- five chromosomes comprise a total of 3 LPP per chromosome.
- six chromosomes comprise a total of 3 LPP per chromosome.
- 7 chromosomes comprise a total of 3 LPP per chromosome. In one embodiment, 8 chromosomes comprise a total of 3 LPP per chromosome.
- each chromosome comprises a total of 3 LPP per chromosome.
- the ratio of numbers of LPP between short arm and long arm is selected from the list of 3:0, 0:3, 2:1 , and 1 :2.
- one or more of the chromosomes comprises a total of 4 LPP per chromosome.
- two or more chromosomes comprise a total of 4 LPP per chromosome.
- three chromosomes comprise a total of 4 LPP per chromosome.
- four chromosomes comprise a total of 4 LPP per chromosome.
- five chromosomes comprise a total of 4 LPP per chromosome.
- six chromosomes comprise a total of 4 LPP per chromosome.
- 7 chromosomes comprise a total of 4 LPP per chromosome.
- 8 chromosomes comprise a total of 4 LPP per chromosome.
- each chromosome comprises a total of 4 LPP per chromosome.
- the ratio of numbers of LPP between short arm and long arm is selected from the list of 4:0, 0:4, 3:1 , 1 :3, 2:2, 2:1 , and 1 :2.
- the host cell is deficient of a non-homologous end joining (NHEJ) mechanism.
- NHEJ non-homologous end joining
- the deficiency in the NHEJ mechanism increases the efficiency of homology directed repair and thus increases the efficiency of DNA integration using the method of the second aspect.
- NHEJ-deficiency in host cells increases the stability of the genome, also when there are empty LPPs present in the host cell genome.
- each 3’ homology arm and each 5’ homology arm has a length of at least 10 nt, at least 20 nt, at least 30 nt, at least 50 nt, at least 100 nt, 200 nucleotides (nt), at least 300 nt, at least 400 nt, at least 500 nt, at least 600 nt, at least 700 nt, at least 800 nt, at least 900 nt, or at least 1000 nt.
- each 3’ homology arm and each 5’ homology arm has a length of at least 100 nt.
- one or more LPP of the at least two LPP comprise a selectable marker, and/or a counter-selectable marker.
- 2 or more LPP of the at least two LPP comprise a selectable marker, and/or a counter- selectable marker.
- 3 or more LPP of the at least two LPP comprise a selectable marker, and/or a counter- selectable marker. In one embodiment, 4 or more LPP of the at least two LPP comprise a selectable marker, and/or a counter- selectable marker.
- 5 or more LPP of the at least two LPP comprise a selectable marker, and/or a counter- selectable marker.
- 6 or more LPP of the at least two LPP comprise a selectable marker, and/or a counter- selectable marker.
- 7 or more LPP of the at least two LPP comprise a selectable marker, and/or a counter- selectable marker.
- 8 or more LPP of the at least two LPP comprise a selectable marker, and/or a counter- selectable marker.
- the selectable marker comprises a marker selected from the list of a pyrG gene, a fcy1 gene, an adeA gene, an adeB gene, a metF gene, a metG gene, a metH gene, a lysF gene, a trpC gene, a sC gene, an argB gene, and a gene encoding a fluorescent protein, such as a GFP, YFP, CFP, DsRed, or a eqPF611 .
- the counter-selectable marker comprises a marker selected from the list of a pyrG gene, a fcy1 gene, an adeA gene, an adeB gene, a metF gene, a metG gene, a metH gene, a lysF gene, a trpC gene, a sC gene, an argB gene, and a gene encoding a fluorescent protein, such as a GFP, YFP, CFP, DsRed, or a eqPF611 .
- none of the at least two LPP comprises a selectable marker, and/or a counter- selectable marker.
- each DNA strain complementary to the two or more protospacers sequences comprises a PAM sequence (protospacer adjacent motif).
- the first, second, third, fourth or fifth protospacer sequence is selected from a sequence having a sequence identity of at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% to SEQ ID NO: 3, SEQ ID NO: 4, SEQ ID NO: 5, SEQ ID NO: 6, SEQ ID NO: 7, SEQ ID NO: 8, SEW ID NO: 9, SEQ ID NO: 10, SEQ ID NO: 11 , SEQ ID NO: 12, or SEQ ID NO: 13.
- the 5’ homology arm sequences comprises or consists of a polynucleotide sequence having a sequence identity of at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% to SEQ ID NO: 1 or SEQ ID NO: 2.
- the 5’ homology arm sequence comprises or consists of SEQ ID NO: 1 or SEQ ID NO: 2. In one embodiment, the 3’ homology arm sequence comprises or consists of SEQ ID NO: 1 or SEQ ID NO: 2.
- the 3’ homology arm sequences comprises or consists of a polynucleotide sequence having a sequence identity of at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% to SEQ ID NO: 1 or SEQ ID NO: 2.
- each of the at least two LPP comprises a first protospacer sequence with a polynucleotide sequence identical for each first protospacer of each at least two LPP.
- each LPP is subject for a double-strand break, and thus the gene of interest can be integrated into each of the LPPs of the host cell.
- each of the at least two LPP comprises a second protospacer sequence with a polynucleotide sequence which is unique for each second protospacer of each at least two LPP.
- each of the at least two LPP comprises a third protospacer sequence with a polynucleotide sequence which is identical for each third polynucleotide within a third subset of the at least two LPPs.
- the third subset of LPPs is subject for a double-strand break, and thus the gene of interest can be integrated into each of the LPPs of the third subset.
- the third subset consists of 2 LPP sequences, 3 LPP sequences, 4 LPP sequences, 5 LPP sequences, 6 LPP sequences, 7 LPP sequences, 8 LPP sequences, 9 LPP sequences, 10 LPP, sequences, 11 LPP sequences, 12 LPP sequences, 13 LPP sequences, 14 LPP sequences, or 15 LPP sequences.
- each of the at least two LPP comprises a fourth protospacer sequence with a polynucleotide sequence which is identical for each fourth polynucleotide within a fourth subset of the at least two LPPs.
- the fourth subset of LPPs is subject for a double-strand break, and thus the gene of interest can be integrated into each of the LPPs of the fourth subset.
- the fourth subset consists of 2 LPP sequences, 3 LPP sequences, 4 LPP sequences, 5 LPP sequences, 6 LPP sequences, 7 LPP sequences, 8 LPP sequences, 9 LPP sequences, 10 LPP, sequences, 11 LPP sequences, 12 LPP sequences, 13 LPP sequences, 14 LPP sequences, or 15 LPP sequences.
- each of the at least two LPP comprises a fifth protospacer sequence with a polynucleotide sequence which is identical for each fifth protospacer within a fifth subset of the at least two LPPs.
- the fifth subset of LPPs is subject for a double-strand break, and thus the gene of interest can be integrated into each of the LPPs of the fifth subset.
- the fifth subset consists of 2 LPP sequences, 3 LPP sequences, 4 LPP sequences, 5 LPP sequences, 6 LPP sequences, 7 LPP sequences, 8 LPP sequences, 9 LPP sequences, 10 LPP, sequences, 11 LPP sequences, 12 LPP sequences, 13 LPP sequences, 14 LPP sequences, or 15 LPP sequences.
- the host cell comprises three LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for two of the three LPPs.
- the host cell comprises four LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for two of the four LPPs.
- the host cell comprises four LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for three of the four LPPs.
- the host cell comprises five LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for two of the five LPPs.
- the host cell comprises five LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for three of the five LPPs.
- the host cell comprises five LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for four of the five LPPs.
- the host cell comprises six LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for two of the six LPPs.
- the host cell comprises six LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for three of the six LPPs.
- the host cell comprises six LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for four of the six LPPs.
- the host cell comprises six LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for five of the six LPPs.
- the host cell comprises 7 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for two of the 7 LPPs.
- the host cell comprises 7 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for three of the 7 LPPs.
- the host cell comprises 7 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for four of the 7 LPPs.
- the host cell comprises 7 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for five of the 7 LPPs.
- the host cell comprises 7 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for six of the 7 LPPs. In one embodiment, the host cell comprises 8 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for two of the 8 LPPs.
- the host cell comprises 8 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for three of the 8 LPPs.
- the host cell comprises 8 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for four of the 8 LPPs.
- the host cell comprises 8 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for five of the 8 LPPs.
- the host cell comprises 8 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for six of the 8 LPPs.
- the host cell comprises 8 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for 7 of the 8 LPPs.
- the host cell comprises 9 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for two of the 9 LPPs.
- the host cell comprises 9 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for three of the 9 LPPs.
- the host cell comprises 9 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for four of the 9 LPPs.
- the host cell comprises 9 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for five of the 9 LPPs.
- the host cell comprises 9 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for six of the 9 LPPs.
- the host cell comprises 9 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for 7 of the 9 LPPs.
- the host cell comprises 9 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for 8 of the 9 LPPs.
- the host cell comprises 10 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for two of the 10 LPPs.
- the host cell comprises 10 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for three of the 10 LPPs.
- the host cell comprises 10 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for four of the 10 LPPs.
- the host cell comprises 10 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for five of the 10 LPPs. In one embodiment, the host cell comprises 10 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for six of the 10 LPPs.
- the host cell comprises 10 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for 7 of the 10 LPPs.
- the host cell comprises 10 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for 8 of the 10 LPPs.
- the host cell comprises 10 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for 9 of the 10 LPPs.
- the host cell comprises 11 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for two of the 11 LPPs.
- the host cell comprises 11 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for three of the 11 LPPs.
- the host cell comprises 11 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for four of the 11 LPPs.
- the host cell comprises 11 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for five of the 11 LPPs.
- the host cell comprises 11 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for six of the 11 LPPs.
- the host cell comprises 11 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for 7 of the 11 LPPs.
- the host cell comprises 11 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for 8 of the 11 LPPs.
- the host cell comprises 11 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for 9 of the 11 LPPs.
- the host cell comprises 11 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for 10 of the 11 LPPs.
- the host cell comprises 12 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for two of the 12 LPPs.
- the host cell comprises 12 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for three of the 12 LPPs.
- the host cell comprises 12 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for four of the 12 LPPs.
- the host cell comprises 12 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for five of the 12 LPPs. In one embodiment, the host cell comprises 12 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for six of the 12 LPPs.
- the host cell comprises 12 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for 7 of the 12 LPPs.
- the host cell comprises 12 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for 8 of the 12 LPPs.
- the host cell comprises 12 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for 9 of the 12 LPPs.
- the host cell comprises 12 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for 10 of the 12LPPs.
- the host cell comprises 12 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for 11 of the 12 LPPs.
- the host cell comprises 13 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for two of the 13 LPPs.
- the host cell comprises 13 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for three of the 13 LPPs.
- the host cell comprises 13 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for four of the 13 LPPs.
- the host cell comprises 13 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for five of the 13 LPPs.
- the host cell comprises 13 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for six of the 13 LPPs.
- the host cell comprises 13 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for 7 of the 13 LPPs.
- the host cell comprises 13 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for 8 of the 13 LPPs.
- the host cell comprises 13 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for 9 of the 13 LPPs.
- the host cell comprises 13 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for 10 of the 13 LPPs.
- the host cell comprises 13 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for 11 of the 13 LPPs.
- the host cell comprises 13 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for 12 of the 13 LPPs. In one embodiment, the host cell comprises 14 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for two of the 14 LPPs.
- the host cell comprises 14 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for three of the 14 LPPs.
- the host cell comprises 14 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for four of the 14 LPPs.
- the host cell comprises 14 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for five of the 14 LPPs.
- the host cell comprises 14 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for six of the 14 LPPs.
- the host cell comprises 14 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for 7 of the 14 LPPs.
- the host cell comprises 14 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for 8 of the 14 LPPs.
- the host cell comprises 14 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for 9 of the 14 LPPs.
- the host cell comprises 14 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for 10 of the 14 LPPs.
- the host cell comprises 14 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for 11 of the 14 LPPs.
- the host cell comprises 14 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for 12 of the 14 LPPs.
- the host cell comprises 14 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for 13 of the 14 LPPs.
- the host cell comprises 15 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for two of the 15 LPPs.
- the host cell comprises 15 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for three of the 15 LPPs.
- the host cell comprises 15 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for four of the 15 LPPs.
- the host cell comprises 15 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for five of the 15 LPPs.
- the host cell comprises 15 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for six of the 15 LPPs. In one embodiment, the host cell comprises 15 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for 7 of the 15 LPPs.
- the host cell comprises 15 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for 8 of the 15 LPPs.
- the host cell comprises 15 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for 9 of the 15 LPPs.
- the host cell comprises 15 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for 10 of the 15 LPPs.
- the host cell comprises 15 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for 11 of the 15 LPPs.
- the host cell comprises 15 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for 12 of the 15 LPPs.
- the host cell comprises 15 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for 13 of the 15 LPPs.
- the host cell comprises 15 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for 14 of the 15 LPPs.
- the host cell comprises 16 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for two of the 16 LPPs.
- the host cell comprises 16 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for three of the 16 LPPs.
- the host cell comprises 16 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for four of the 16 LPPs.
- the host cell comprises 16 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for five of the 16 LPPs.
- the host cell comprises 16 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for six of the 16 LPPs.
- the host cell comprises 16 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for 7 of the 16 LPPs.
- the host cell comprises 16 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for 8 of the 16 LPPs.
- the host cell comprises 16 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for 9 of the 16 LPPs.
- the host cell comprises 16 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for 10 of the 16 LPPs. In one embodiment, the host cell comprises 16 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for 11 of the 16 LPPs.
- the host cell comprises 16 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for 12 of the 16 LPPs.
- the host cell comprises 16 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for 13 of the 16 LPPs.
- the host cell comprises 16 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for 14 of the 16 LPPs.
- the host cell comprises 16 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for 15 of the 16 LPPs.
- the host cell is a fungal cell.
- the host cell is a yeast recombinant host cell, e.g., a Candida, Hansenula, Kluyveromyces, Pichia, Saccharomyces, Schizosaccharomyces, or Yarrowia cell, such as a Kluyveromyces lactis, Saccharomyces carlsbergensis, Saccharomyces cerevisiae, Saccharomyces diastaticus, Saccharomyces douglasii, Saccharomyces kluyveri, Saccharomyces norbensis, Saccharomyces oviformis, or Yarrowia lipolytica cell.
- a yeast recombinant host cell e.g., a Candida, Hansenula, Kluyveromyces, Pichia, Saccharomyces, Schizosaccharomyces, or Yarrowia cell, such as a Kluyveromyces lactis, Saccharomyces carlsbergensis,
- the host cell is a filamentous fungal recombinant host cell, e.g., an Acremonium, Aspergillus, Aureobasidium, Bjerkandera, Ceriporiopsis, Chrysosporium, Coprinus, Coriolus, Cryptococcus, Filibasidium, Fusarium, Humicola, Magnaporthe, Mucor, Myceliophthora, Neocallimastix, Neurospora, Paecilomyces, Penicillium, Phanerochaete, Phlebia, Piromyces, Pleurotus, Schizophyllum, Talaromyces, Thermoascus, Thielavia, Tolypocladium, Trametes, or Trichoderma cell, in particular, an Aspergillus awamori, Aspergillus foetidus, Aspergillus fumigatus, Aspergillus japonicus, Aspergillus nidulans,
- the host cell is a Trichoderma cell, preferably a Trichderma reesei cell.
- the host cell is isolated.
- the host cell is purified.
- the host cell is a prokaryotic recombinant host cell, e.g., a Gram-positive cell selected from the group consisting of Bacillus, Clostridium, Enterococcus, Geobacillus, Lactobacillus, Lactococcus, Oceanobacillus, Staphylococcus, Streptococcus, or Streptomyces cells, or a Gram-negative bacteria selected from the group consisting of Campylobacter, E.
- a prokaryotic recombinant host cell e.g., a Gram-positive cell selected from the group consisting of Bacillus, Clostridium, Enterococcus, Geobacillus, Lactobacillus, Lactococcus, Oceanobacillus, Staphylococcus, Streptococcus, or Streptomyces cells, or a Gram-negative bacteria selected from the group consisting of Campylobacter, E.
- Flavobacterium Fusobacterium, Helicobacter, llyobacter, Neisseria, Pseudomonas, Salmonella, and Ureaplasma cells, such as Bacillus alkalophilus, Bacillus amyloliquefaciens, Bacillus brevis, Bacillus circulans, Bacillus clausii, Bacillus coagulans, Bacillus firmus, Bacillus lautus, Bacillus lentus, Bacillus licheniformis, Bacillus megaterium, Bacillus pumilus, Bacillus stearothermophilus, Bacillus subtilis, Bacillus thuringiensis, Streptococcus equisimilis, Streptococcus pyogenes, Streptococcus uberis, and Streptococcus equi subsp.
- the host cell is a Bacillus cell.
- the host cell is a Bacillus licheniformis cell.
- the host cell is a Bacillus subtilis cell.
- the host cell is a prokaryotic host cell and wherein at least one LPP is located between two selectable and/or counter-selectable marker genes.
- the host cell is a mammalian cell.
- the mammalian cell is selected from the list of a human cell, a mouse cell, a rat cell, a mouse hybridoma cell, a hamser cell (e.g. Chinese hamster ovary cell), and a rat hybridoma cell.
- a human cell e.g. human hamster ovary cell
- a hamser cell e.g. Chinese hamster ovary cell
- a rat hybridoma cell e.g. Chinese hamster ovary cell
- a construct or vector comprising a polynucleotide is introduced into a host cell so that the construct or vector is maintained as a chromosomal integrant or as a self-replicating extra-chromosomal vector as described earlier.
- the choice of a host cell will to a large extent depend upon the gene encoding the polypeptide and its source.
- the polypeptide can be native or heterologous to the recombinant host cell.
- at least one of the one or more control sequences can be heterologous to the polynucleotide encoding the polypeptide of interest.
- the recombinant host cell may comprise at least two copies, e.g., three, four, five, or more copies of the polynucleotide of the present invention.
- the host cell may be any microbial cell useful in the recombinant production of a polypeptide of interest, e.g., a prokaryotic cell or a fungal cell.
- the prokaryotic host cell may be any Gram-positive or Gram-negative bacterium.
- Gram-positive bacteria include, but are not limited to, Bacillus, Clostridium, Enterococcus, Geobacillus, Lactobacillus, Lactococcus, Oceanobacillus, Staphylococcus, Streptococcus, and Streptomyces.
- Gram-negative bacteria include, but are not limited to, Campylobacter, E. coli, Flavobacterium, Fusobacterium, Helicobacter, llyobacter, Neisseria, Pseudomonas, Salmonella, and Ureaplasma.
- the bacterial host cell may be any Bacillus cell including, but not limited to, Bacillus alkalophilus, Bacillus amyloliquefaciens, Bacillus brevis, Bacillus circulans, Bacillus clausii, Bacillus coagulans, Bacillus firmus, Bacillus lautus, Bacillus lentus, Bacillus licheniformis, Bacillus megaterium, Bacillus pumilus, Bacillus stearothermophilus, Bacillus subtilis, and Bacillus thuringiensis cells.
- the Bacillus cell is a Bacillus amyloliquefaciens, Bacillus licheniformis and Bacillus subtilis cell.
- Bacillus classes/genera/species shall be defined as described in Patel and Gupta, 2020, Int. J. Syst. Evol. Microbiol. 70: 406-438.
- the bacterial host cell may also be any Streptococcus cell including, but not limited to, Streptococcus equisimilis, Streptococcus pyogenes, Streptococcus uberis, and Streptococcus equi subsp. Zooepidemicus cells.
- the bacterial host cell may also be any Streptomyces cell including, but not limited to, Streptomyces achromogenes, Streptomyces avermitilis, Streptomyces coelicolor, Streptomyces griseus, and Streptomyces lividans cells.
- Methods for introducing DNA into prokaryotic host cells are well-known in the art, and any suitable method can be used including but not limited to protoplast transformation, competent cell transformation, electroporation, conjugation, transduction, with DNA introduced as linearized or as circular polynucleotide. Persons skilled in the art will be readily able to identify a suitable method for introducing DNA into a given prokaryotic cell depending, e.g., on the genus. Methods for introducing DNA into prokaryotic host cells are for example described in Heinze et al., 2018, BMC Microbiology 18:56, Burke et al., 2001 , Proc. Natl. Acad. Sci. USA 98: 6289-6294, Choi et al., 2006, J. Microbiol. Methods 64: 391-397, and Donald et al., 2013, J. Bacteriol. 195(1 1): 2612-2620.
- the host cell may be a fungal cell.
- “Fungi” as used herein includes the phyla Ascomycota, Basidiomycota, Chytridiomycota, and Zygomycota as well as the Oomycota and all mitosporic fungi (as defined by Hawksworth et al., In, Ainsworth and Bisby’s Dictionary of The Fungi, 8th edition, 1995, CAB International, University Press, Cambridge, UK).
- Fungal cells may be transformed by a process involving protoplast-mediated transformation, Agrobacterium-mediated transformation, electroporation, biolistic method and shock-wave-mediated transformation as reviewed by Li et al., 2017, Microbial Cell Factories 16: 168 and procedures described in EP 238023, Yelton et al., 1984, Proc. Natl. Acad. Sci. USA 81 : 1470-1474, Christensen et al., 1988, Bio/Technology 6: 1419-1422, and Lubertozzi and Keasling, 2009, Biotechn. Advances 27: 53-75.
- any method known in the art for introducing DNA into a fungal host cell can be used, and the DNA can be introduced as linearized or as circular polynucleotide.
- the fungal host cell may be a yeast cell.
- yeast as used herein includes ascosporogenous yeast (Endomycetales), basidiosporogenous yeast, and yeast belonging to the Fungi Imperfecti (Blastomycetes). For purposes of this invention, yeast shall be defined as described in Biology and Activities of Yeast (Skinner, Passmore, and Davenport, editors, Soc. App. Bacteriol. Symposium Series No. 9, 1980).
- the yeast host cell may be a Candida, Hansenula, Kluyveromyces, Pichia, Saccharomyces, Schizosaccharomyces, or Yarrowia cell, such as a Kluyveromyces lactis, Saccharomyces carlsbergensis, Saccharomyces cerevisiae, Saccharomyces diastaticus, Saccharomyces douglasii, Saccharomyces kluyveri, Saccharomyces norbensis, Saccharomyces oviformis, or Yarrowia lipolytica cell.
- the yeast host cell is a Pichia or Komagataella cell, e.g., a Pichia pastoris cell (Komagataella phaffii).
- the fungal host cell may be a filamentous fungal cell.
- “Filamentous fungi” include all filamentous forms of the subdivision Eumycota and Oomycota (as defined by Hawksworth et al., 1995, supra).
- the filamentous fungi are generally characterized by a mycelial wall composed of chitin, cellulose, glucan, chitosan, mannan, and other complex polysaccharides. Vegetative growth is by hyphal elongation and carbon catabolism is obligately aerobic. In contrast, vegetative growth by yeasts such as Saccharomyces cerevisiae is by budding of a unicellular thallus and carbon catabolism may be fermentative.
- the filamentous fungal host cell may be an Aspergillus awamori, Aspergillus foetidus, Aspergillus fumigatus, Aspergillus japonicus, Aspergillus nidulans, Aspergillus niger, Aspergillus oryzae, Bjerkandera adusta, Ceriporiopsis aneirina, Ceriporiopsis caregiea, Ceriporiopsis gilvescens, Ceriporiopsis pannocinta, Ceriporiopsis rivulosa, Ceriporiopsis subrufa, Ceriporiopsis subvermispora, Chrysosporium inops, Chrysosporium keratinophilum, Chrysosporium lucknowense, Chrysosporium merdarium, Chrysosporium pannicola, Chrysosporium queenslandicum, Chrysosporium tropicum, Chrysosporium zona
- LBPs Landing Pad Polynucleotides
- the present invention also relates to at least two LPPs present in the host cell genome, as described herein.
- Each LPP comprising a 5’ homology arm, two or more protospacers, and a 3’ homology arm.
- the LPP may be a genomic DNA, a cDNA, a synthetic DNA, or a combination thereof.
- the LPP comprises an protospacer array of at least 2 protospacers comprising at least 50 nucleotides (e.g., nucleotides 1 to 50 of SEQ ID NOs: 14 -17).
- the LPP comprises an protospacer array of at least 3 protospacers comprising at least 75 nucleotides (e.g., nucleotides 1 to 75 of SEQ ID NOs: 14 -17).
- the LPP comprises an protospacer array of at least 4 protospacers comprising at least 100 nucleotides (e.g., nucleotides 1 to 100 of SEQ ID NOs: 14 -17). In an aspect, the LPP comprises an protospacer array of at least 5 protospacers comprising at least 125 nucleotides (e.g., nucleotides 1 to 125 of SEQ ID NOs: 14 -17).
- the protospacers of the LPP may also be mutated by introduction of nucleotide substitutions, or by introduction of nucleotide substitutions that may give rise to a different target sequence.
- the homology arms of the LPP may also be mutated by introduction of nucleotide substitutions, or by introduction of nucleotide substitutions that may give rise to a different homology arm sequence.
- nucleotide substitution see, e.g., Ford et al., 1991 , Protein Expression and Purification 2: 95-107.
- the LPP is isolated.
- the LPP is purified.
- the present invention also relates to nucleic acid constructs comprising an LPP, and to nucleic acid constructs comprising a donor polynucleotide of the present invention.
- the polynucleotide encoding a polypeptide of interest is operably linked to one or more control sequences that direct the expression of the coding sequence in a suitable host cell under conditions compatible with the control sequences.
- the donor polynucleotide may be manipulated in a variety of ways to provide for expression of the polypeptide. Manipulation of the polynucleotide prior to its insertion into a vector may be desirable or necessary depending on the expression vector. Techniques for modifying polynucleotides utilizing recombinant DNA methods are well known in the art.
- the control sequence may be a promoter, a polynucleotide that is recognized by a host cell for expression of a polynucleotide encoding a polypeptide of the present invention.
- the promoter contains transcriptional control sequences that mediate the expression of the polypeptide.
- the promoter may be any polynucleotide that shows transcriptional activity in the host cell including mutant, truncated, and hybrid promoters, and may be obtained from genes encoding extracellular or intracellular polypeptides either homologous or heterologous to the host cell.
- Suitable promoters for directing transcription of the polynucleotide of the present invention in a bacterial host cell are described in Sambrook et al., 1989, Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Lab., NY, Davis et al., 2012, supra, and Song et al., 2016, PLOS One 11 (7): e0158447.
- promoters for directing transcription of the polynucleotide of the present invention in a filamentous fungal host cell are promoters obtained from Aspergillus, Fusarium, Rhizomucor and Trichoderma cells, such as the promoters described in Mukherjee et al., 2013, “Trichoderma-. Biology and Applications”, and by Schmoll and Dattenbbck, 2016, “Gene Expression Systems in Fungi: Advancements and Applications”, Fungal Biology.
- the control sequence may also be a transcription terminator, which is recognized by a host cell to terminate transcription.
- the terminator is operably linked to the 3’-terminus of the polynucleotide encoding the polypeptide. Any terminator that is functional in the host cell may be used in the present invention.
- Preferred terminators for bacterial host cells may be obtained from the genes for Bacillus clausii alkaline protease (aprH), Bacillus licheniformis alpha-amylase (amyL), and Escherichia coll ribosomal RNA (rrnS).
- aprH Bacillus clausii alkaline protease
- AmyL Bacillus licheniformis alpha-amylase
- rrnS Escherichia coll ribosomal RNA
- Preferred terminators for filamentous fungal host cells may be obtained from Aspergillus or Trichoderma species, such as obtained from the genes for Aspergillus niger glucoamylase, Trichoderma reesei beta-glucosidase, Trichoderma reesei cellobiohydrolase I, and Trichoderma reesei endoglucanase I, such as the terminators described in Mukherjee et al., 2013, “Trichoderma-. Biology and Applications”, and by Schmoll and Dattenbbck, 2016, “Gene Expression Systems in Fungi: Advancements and Applications”, Fungal Biology.
- Preferred terminators for yeast host cells may be obtained from the genes for Saccharomyces cerevisiae enolase, Saccharomyces cerevisiae cytochrome C (CYC1), and Saccharomyces cerevisiae glyceraldehyde-3-phosphate dehydrogenase.
- Other useful terminators for yeast host cells are described by Romanos et al., 1992, Yeast 8: 423-488.
- control sequence may also be an mRNA stabilizer region downstream of a promoter and upstream of the coding sequence of a gene which increases expression of the gene.
- mRNA stabilizer regions are obtained from a Bacillus thuringiensis crylllA gene (WO 94/25612) and a Bacillus subtilis SP82 gene (Hue et al., 1995, J. Bacteriol. 177: 3465-3471).
- mRNA stabilizer regions for fungal cells are described in Geisberg et al., 2014, Cell 156(4): 812-824, and in Morozov et al., 2006, Eukaryotic Ce// 5(11): 1838-1846.
- the control sequence may also be a leader, a non-translated region of an mRNA that is important for translation by the host cell.
- the leader is operably linked to the 5’-terminus of the polynucleotide encoding the polypeptide. Any leader that is functional in the host cell may be used.
- Suitable leaders for bacterial host cells are described by Hambraeus et al., 2000, Microbiology 146(12): 3051-3059, and by Kaberdin and Blasi, 2006, FEMS Microbiol. Rev. 30(6): 967-979.
- Preferred leaders for filamentous fungal host cells may be obtained from the genes for Aspergillus oryzae TAKA amylase and Aspergillus nidulans triose phosphate isomerase.
- Suitable leaders for yeast host cells may be obtained from the genes for Saccharomyces cerevisiae enolase (ENO-1), Saccharomyces cerevisiae 3-phosphoglycerate kinase, Saccharomyces cerevisiae alpha-factor, and Saccharomyces cerevisiae alcohol dehydrogenase/glyceraldehyde- 3-phosphate dehydrogenase (ADH2/GAP).
- ENO-1 Saccharomyces cerevisiae enolase
- Saccharomyces cerevisiae 3-phosphoglycerate kinase Saccharomyces cerevisiae alpha-factor
- Saccharomyces cerevisiae alcohol dehydrogenase/glyceraldehyde- 3-phosphate dehydrogenase ADH2/GAP
- the control sequence may also be a polyadenylation sequence, a sequence operably linked to the 3’-terminus of the polynucleotide which, when transcribed, is recognized by the host cell as a signal to add polyadenosine residues to transcribed mRNA. Any polyadenylation sequence that is functional in the host cell may be used.
- Preferred polyadenylation sequences for filamentous fungal host cells are obtained from the genes for Aspergillus nidulans anthranilate synthase, Aspergillus niger glucoamylase, Aspergillus niger alphaglucosidase, Aspergillus oryzae TAKA amylase, and Fusarium oxysporum trypsin-like protease.
- the control sequence may also be a signal peptide coding region that encodes a signal peptide linked to the N-terminus of a polypeptide and directs the polypeptide into the cell’s secretory pathway.
- the 5’-end of the coding sequence of the polynucleotide may inherently contain a signal peptide coding sequence naturally linked in translation reading frame with the segment of the coding sequence that encodes the polypeptide.
- the 5’-end of the coding sequence may contain a signal peptide coding sequence that is heterologous to the coding sequence.
- a heterologous signal peptide coding sequence may be required where the coding sequence does not naturally contain a signal peptide coding sequence.
- a heterologous signal peptide coding sequence may simply replace the natural signal peptide coding sequence to enhance secretion of the polypeptide. Any signal peptide coding sequence that directs the expressed polypeptide into the secretory pathway of a host cell may be used.
- Effective signal peptide coding sequences for bacterial host cells are the signal peptide coding sequences obtained from the genes for Bacillus NCIB 11837 maltogenic amylase, Bacillus licheniformis subtilisin, Bacillus licheniformis beta-lactamase, Bacillus stearothermophilus alpha-amylase, Bacillus stearothermophilus neutral proteases (nprT, nprS, nprM), and Bacillus subtilis prsA. Further signal peptides are described by Freudl, 2018, Microbial Cell Factories 17: 52.
- Effective signal peptide coding sequences for filamentous fungal host cells are the signal peptide coding sequences obtained from the genes for Aspergillus niger neutral amylase, Aspergillus niger glucoamylase, Aspergillus oryzae TAKA amylase, Humicola insolens cellulase, Humicola insolens endoglucanase V, Humicola lanuginosa lipase, and Rhizomucor miehei aspartic proteinase, such as the signal peptide described by Xu et al., 2018, Biotechnology Letters 40: 949-955
- Useful signal peptides for yeast host cells are obtained from the genes for Saccharomyces cerevisiae alpha-factor and Saccharomyces cerevisiae invertase. Other useful signal peptide coding sequences are described by Romanos et al., 1992, supra.
- the control sequence may also be a propeptide coding sequence that encodes a propeptide positioned at the N-terminus of a polypeptide.
- the resultant polypeptide is known as a proenzyme or propolypeptide (or a zymogen in some cases).
- a propolypeptide is generally inactive and can be converted to an active polypeptide by catalytic or autocatalytic cleavage of the propeptide from the propolypeptide.
- the propeptide coding sequence may be obtained from the genes for Bacillus subtilis alkaline protease (aprE), Bacillus subtilis neutral protease (nprT), Myceliophthora thermophila laccase (WO 95/33836), Rhizomucor miehei aspartic proteinase, and Saccharomyces cerevisiae alpha-factor. Where both signal peptide and propeptide sequences are present, the propeptide sequence is positioned next to the N-terminus of a polypeptide and the signal peptide sequence is positioned next to the N-terminus of the propeptide sequence.
- the polypeptide when both signal peptide and propeptide sequences are present, the polypeptide may comprise only a part of the signal peptide sequence and/or only a part of the propeptide sequence.
- the final or isolated polypeptide may comprise a mixture of mature polypeptides and polypeptides which comprise, either partly or in full length, a propeptide sequence and/or a signal peptide sequence.
- regulatory sequences that regulate expression of the polypeptide relative to the growth of the host cell.
- regulatory sequences are those that cause expression of the gene to be turned on or off in response to a chemical or physical stimulus, including the presence of a regulatory compound.
- Regulatory sequences in prokaryotic systems include the lac, tac, and trp operator systems.
- yeast the ADH2 system or GAL1 system may be used.
- the Aspergillus niger glucoamylase promoter In filamentous fungi, the Aspergillus niger glucoamylase promoter, Aspergillus oryzae TAKA alpha-amylase promoter, and Aspergillus oryzae glucoamylase promoter, Trichoderma reesei cellobiohydrolase I promoter, and Trichoderma reesei cellobiohydrolase II promoter may be used.
- Other examples of regulatory sequences are those that allow for gene amplification. In fungal systems, these regulatory sequences include the dihydrofolate reductase gene that is amplified in the presence of methotrexate, and the metallothionein genes that are amplified with heavy metals.
- the control sequence may also be a transcription factor, a polynucleotide encoding a polynucleotide-specific DNA-binding polypeptide that controls the rate of the transcription of genetic information from DNA to mRNA by binding to a specific polynucleotide sequence.
- the transcription factor may function alone and/or together with one or more other polypeptides or transcription factors in a complex by promoting or blocking the recruitment of RNA polymerase.
- Transcription factors are characterized by comprising at least one DNA-binding domain which often attaches to a specific DNA sequence adjacent to the genetic elements which are regulated by the transcription factor.
- the transcription factor may regulate the expression of a protein of interest either directly, i.e., by activating the transcription of the gene encoding the protein of interest by binding to its promoter, or indirectly, i.e., by activating the transcription of a further transcription factor which regulates the transcription of the gene encoding the protein of interest, such as by binding to the promoter of the further transcription factor.
- Suitable transcription factors for fungal host cells are described in WO 2017/144177.
- Suitable transcription factors for prokaryotic host cells are described in Seshasayee et al., 2011 , Subcellular Biochemistry 52: 7-23, as well in Balleza et al., 2009, FEMS Microbiol. Rev. 33(1): 133-151.
- the present invention also relates to recombinant expression vectors comprising a donor polynucleotide of the present invention, a promoter, and transcriptional and translational stop signals.
- the various nucleotide, homology arms and control sequences may be joined together to produce a recombinant expression vector that may include one or more convenient restriction sites to allow for insertion or substitution of the donor polynucleotide encoding the polypeptide at the LPPs.
- the coding sequence is located in the vector so that the coding sequence is operably linked with the appropriate control sequences for expression.
- the recombinant expression vector may be any vector (e.g., a plasmid or virus) that can be conveniently subjected to recombinant DNA procedures and can bring about expression of the polynucleotide of interest.
- the choice of the vector will typically depend on the compatibility of the vector with the host cell into which the vector is to be introduced.
- the vector may be a linear or closed circular plasmid.
- the vector may be an autonomously replicating vector, i.e., a vector that exists as an extrachromosomal entity, the replication of which is independent of chromosomal replication, e.g., a plasmid, an extrachromosomal element, a minichromosome, or an artificial chromosome.
- the vector may contain any means for assuring self-replication.
- the vector may be one that, when introduced into the host cell, is integrated into the genome and replicated together with the chromosome(s) into which it has been integrated.
- a single vector or plasmid or two or more vectors or plasmids that together contain the total DNA to be introduced into the genome of the host cell, or a transposon may be used.
- the vector may contain one or more selectable markers that permit easy selection of transformed, transfected, transduced, or the like cells.
- a selectable marker is a gene the product of which provides for biocide or viral resistance, resistance to heavy metals, prototrophy to auxotrophs, and the like.
- the first polypeptide is heterologous to the host cell.
- At least one of the one or more control sequences is heterologous to the first polynucleotide.
- the first guide RNA is complementary to one or more third protospacers, wherein during step c) the complex is formed at one or more third protospacer of the one or more LPP, creating a double-strand break at or near the one or more third protospacer.
- the first temperature allows the one or more first guide RNA to form a complex at one or more protospacers of two LPP, and wherein the complex promotes the homology directed repair by creating a double-strand break at or near the one or more protospacer sequence of the two LPP, wherein the homology directed repair comprises insertion of the first donor polynucleotide or set of first PCR fragments at the double-strand break to provide a host cell with two copies of the first donor polynucleotide in its genome.
- the first temperature allows the one or more first guide RNA to form a complex at one or more protospacers of six LPP, and wherein the complex promotes the homology directed repair by creating a double-strand break at or near the one or more protospacer sequence of the six LPP, wherein the homology directed repair comprises insertion of the first donor polynucleotide or set of first PCR fragments at the double-strand break to provide a host cell with six copies of the first donor polynucleotide in its genome.
- the first temperature allows the one or more first guide RNA to form a complex at one or more protospacers of 9 LPP, and wherein the complex promotes the homology directed repair by creating a double-strand break at or near the one or more protospacer sequence of the 9 LPP, wherein the homology directed repair comprises insertion of the first donor polynucleotide or set of first PCR fragments at the double-strand break to provide a host cell with 9 copies of the first donor polynucleotide in its genome.
- the first temperature allows the one or more first guide RNA to form a complex at one or more protospacers of 10 LPP, and wherein the complex promotes the homology directed repair by creating a double-strand break at or near the one or more protospacer sequence of the 10 LPP, wherein the homology directed repair comprises insertion of the first donor polynucleotide or set of first PCR fragments at the double-strand break to provide a host cell with 10 copies of the first donor polynucleotide in its genome.
- the first temperature allows the one or more first guide RNA to form a complex at one or more protospacers of 11 LPP, and wherein the complex promotes the homology directed repair by creating a double-strand break at or near the one or more protospacer sequence of the 11 LPP, wherein the homology directed repair comprises insertion of the first donor polynucleotide or set of first PCR fragments at the double-strand break to provide a host cell with 11 copies of the first donor polynucleotide in its genome.
- the first temperature allows the one or more first guide RNA to form a complex at one or more protospacers of 12 LPP, and wherein the complex promotes the homology directed repair by creating a double-strand break at or near the one or more protospacer sequence of the 12 LPP, wherein the homology directed repair comprises insertion of the first donor polynucleotide or set of first PCR fragments at the double-strand break to provide a host cell with 12 copies of the first donor polynucleotide in its genome.
- the first temperature allows the one or more first guide RNA to form a complex at one or more protospacers of 13 LPP, and wherein the complex promotes the homology directed repair by creating a double-strand break at or near the one or more protospacer sequence of the 13 LPP, wherein the homology directed repair comprises insertion of the first donor polynucleotide or set of first PCR fragments at the double-strand break to provide a host cell with 13 copies of the first donor polynucleotide in its genome.
- the first temperature allows the one or more first guide RNA to form a complex at one or more protospacers of 14 LPP, and wherein the complex promotes the homology directed repair by creating a double-strand break at or near the one or more protospacer sequence of the 14 LPP, wherein the homology directed repair comprises insertion of the first donor polynucleotide or set of first PCR fragments at the double-strand break to provide a host cell with 14 copies of the first donor polynucleotide in its genome.
- the first temperature allows the one or more first guide RNA to form a complex at one or more protospacers of 15 LPP, and wherein the complex promotes the homology directed repair by creating a double-strand break at or near the one or more protospacer sequence of the 15 LPP, wherein the homology directed repair comprises insertion of the first donor polynucleotide or set of first PCR fragments at the double-strand break to provide a host cell with 15 copies of the first donor polynucleotide in its genome.
- the first temperature allows the one or more first guide RNA to form a complex at one or more protospacers of 16 LPP, and wherein the complex promotes the homology directed repair by creating a double-strand break at or near the one or more protospacer sequence of the 16 LPP, wherein the homology directed repair comprises insertion of the first donor polynucleotide or set of first PCR fragments at the double-strand break to provide a host cell with 16 copies of the first donor polynucleotide in its genome.
- step b) there are delivered at least two first guide RNAs, each first guide RNA forming a complex with the nuclease at a first, second, third, fourth, or fifth protospacer.
- the at least two first guide RNAs comprise at least two of: a 1 st first guide RNA, a 2 nd first guide RNA, a 3 rd first guide RNA, a 4 th first guide RNA, and a 5 th first guide RNA.
- the 1 st first guide RNA has a polynucleotide sequence with a sequence identity of below 100%, such as below 99%, below 98%, below 97%, below 96%, below 95%, below 90%, below 85%, below 80%, below 75%, below 70%, below 65%, below 60%, below 55%, below 50%, below 45%, below 40%, below 35%, below 30%, below 25%, below 20%, below 15%, or below 10% to the polynucleotide sequence of the 2 nd first guide RNA, to the polynucleotide sequence of 3 rd first guide RNA, to the polynucleotide sequence of the 4 th first guide RNA, and to the polynucleotide sequence of the 5 th first guide RNA.
- the 2 nd first guide RNA has a polynucleotide sequence with a sequence identity of below 100%, such as below 99%, below 98%, below 97%, below 96%, below 95%, below 90%, below 85%, below 80%, below 75%, below 70%, below 65%, below 60%, below 55%, below 50%, below 45%, below 40%, below 35%, below 30%, below 25%, below 20%, below 15%, or below 10% to the polynucleotide sequence of the 1 st first guide RNA, to the polynucleotide sequence of 3 rd first guide RNA, to the polynucleotide sequence of the 4 th first guide RNA, and to the polynucleotide sequence of the 5 th first guide RNA.
- a sequence identity of below 100% such as below 99%, below 98%, below 97%, below 96%, below 95%, below 90%, below 85%, below 80%, below 75%, below 70%, below 65%, below 60%, below 55%, below 50%, below
- the 3 rd first guide RNA has a polynucleotide sequence with a sequence identity of below 100%, such as below 99%, below 98%, below 97%, below 96%, below 95%, below 90%, below 85%, below 80%, below 75%, below 70%, below 65%, below 60%, below 55%, below 50%, below 45%, below 40%, below 35%, below 30%, below 25%, below 20%, below 15%, or below 10% to the polynucleotide sequence of the 1 st first guide RNA, to the polynucleotide sequence of 2 nd first guide RNA, to the polynucleotide sequence of the 4 th first guide RNA, and to the polynucleotide sequence of the 5 th first guide RNA.
- the 4 th first guide RNA has a polynucleotide sequence with a sequence identity below 100%, such as below 99%, below 98%, below 97%, below 96%, below 95%, below 90%, below 85%, below 80%, below 75%, below 70%, below 65%, below 60%, below 55%, below 50%, below 45%, below 40%, below 35%, below 30%, below 25%, below 20%, below 15%, or below 10% to the polynucleotide sequence of the 1 st first guide RNA, to the polynucleotide sequence of 2 nd first guide RNA, to the polynucleotide sequence of the 3 rd first guide RNA, and to the polynucleotide sequence of the 5 th first guide RNA.
- a sequence identity below 100%, such as below 99%, below 98%, below 97%, below 96%, below 95%, below 90%, below 85%, below 80%, below 75%, below 70%, below 65%, below 60%, below 55%, below 50%, below
- the 5 th first guide RNA has a polynucleotide sequence with a sequence identity of below 100%, such as below 99%, below 98%, below 97%, below 96%, below 95%, below 90%, below 85%, below 80%, below 75%, below 70%, below 65%, below 60%, below 55%, below 50%, below 45%, below 40%, below 35%, below 30%, below 25%, below 20%, below 15%, or below 10% to the polynucleotide sequence of the 1 st first guide RNA, to the polynucleotide sequence of 2 nd first guide RNA, to the polynucleotide sequence of the 3 rd first guide RNA, and to the polynucleotide sequence of the 4 th first guide RNA.
- a sequence identity of below 100% such as below 99%, below 98%, below 97%, below 96%, below 95%, below 90%, below 85%, below 80%, below 75%, below 70%, below 65%, below 60%, below 55%, below 50%, below
- the one or more first guide RNA comprises a 1 st first guide RNA, having a sequence complementary to a first protospacer sequence, a second protospacer sequence, a third protospacer sequence, a fourth protospacer sequence, or a fifth protospacer sequence, preferably the 1 st first guide RNA is complementary to a first protospacer sequence.
- the one or more first guide RNA comprises a 2 nd first guide RNA, having a sequence complementary to a first protospacer sequence, a second protospacer sequence, a third protospacer sequence, a fourth protospacer sequence, or a fifth protospacer sequence, preferably the 2 nd first guide RNA is complementary to a second protospacer sequence.
- the one or more first guide RNA comprises a 3 rd first guide RNA, having a sequence complementary to a first protospacer sequence, a second protospacer sequence, a third protospacer sequence, a fourth protospacer sequence, or a fifth protospacer sequence, preferably the 3 rd first guide RNA is complementary to a third protospacer sequence.
- the one or more first guide RNA comprises a 4 th first guide RNA, having a sequence complementary to a first protospacer sequence, a second protospacer sequence, a third protospacer sequence, a fourth protospacer sequence, or a fifth protospacer sequence, preferably the 4 th first guide RNA is complementary to a fourth protospacer sequence.
- the one or more first guide RNA comprises a 5 th first guide RNA, having a sequence complementary to a first protospacer sequence, a second protospacer sequence, a third protospacer sequence, a fourth protospacer sequence, or a fifth protospacer sequence, preferably the 5 th first guide RNA is complementary to a fifth protospacer sequence.
- the at least one guide RNA comprises a first guide RNA and a second guide RNA
- the method additionally comprises step d): d) cultivating the host cell at a second temperature allowing one or more second guide RNA of the at least one guide RNA to form a complex with the nuclease at one or more protospacer of the one or more LPP, and wherein the complex promotes the homology directed repair by creating a double-strand break at or near the one or more protospacer sequence of the one or more LPP, wherein the homology directed repair comprises insertion of the first donor polynucleotide and/or set of first PCR fragments at the double-strand break.
- step b) there is additionally delivered one or more second polynucleotide comprising in 5’ to 3’ direction: a 5’ homology arm, a second polynucleotide encoding a second polypeptide of interest, and a 3’ homology arm, OR a set of one or more second PCR fragments comprising a second 3’ homology arm, a second 5’ homology arm, and a second polynucleotide encoding a second polypeptide of interest, wherein the 3’ homology arm is identical to the 3’ homology arm of one or more LPP and wherein the 5’ homology arm is identical to the 5’ homology arm of one or more LPP, and wherein during step c) the homology directed repair comprises insertion of the second donor polynucleotide or set of second PCR fragments at the double-strand break.
- step b) there is additionally delivered one or more first polynucleotide comprising in 5’ to 3’ direction: a second 5’ homology arm, a first polynucleotide encoding the first polypeptide of interest, and a second 3’ homology arm, OR a set of one or more first PCR fragments comprising a second 3’ homology arm, a second 5’ homology arm, and the first polynucleotide encoding the first polypeptide of interest, wherein the second 3’ homology arms and the second 5’ homology arms have less than 100% sequence identity to the 3’ and 5’ homology arms of the at least two LPP, respectively, e.g., less than 99%, less than 98%, less than 97%, less than 96%, less than 95%, less than 90%, less thank 85%, less than 80%, less than 75%, less than 70%, or less than 65%, and wherein during step c) the homology directed repair comprises insertion of the first donor polynucle
- the second guide RNA is complementary to one or more first protospacers, wherein during step d) the complex is formed at one or more first protospacer of the one or more LPP, creating a double-strand break at or near the one or more first protospacer.
- the second guide RNA is complementary to one or more second protospacers, wherein during step d) the complex is formed at one or more second protospacer of the one or more LPP, creating a double-strand break at or near the one or more second protospacer.
- the second guide RNA is complementary to one or more third protospacers, wherein during step d) the complex is formed at one or more third protospacer of the one or more LPP, creating a double-strand break at or near the one or more third protospacer.
- the second guide RNA is complementary to one or more fourth protospacers, wherein during step d) the complex is formed at one or more fourth protospacer of the one or more LPP, creating a double-strand break at or near the one or more fourth protospacer.
- the second guide RNA is complementary to one or more fifth protospacers, wherein during step d) the complex is formed at one or more fifth protospacer of the one or more LPP, creating a double-strand break at or near the one or more fifth protospacer.
- the second temperature allows the one or more second guide RNA to form a complex at one or more protospacers of one LPP, and wherein the complex promotes the homology directed repair by creating a double-strand break at or near the one or more protospacer sequence of the one LPP, wherein the homology directed repair comprises insertion of the first donor polynucleotide and/or set of first PCR fragments at the double-strand break to provide a host cell with at least one copy of the first donor polynucleotide in its genome.
- the second temperature allows the one or more second guide RNA to form a complex at one or more protospacers of two LPP, and wherein the complex promotes the homology directed repair by creating a double-strand break at or near the one or more protospacer sequence of the two LPP, wherein the homology directed repair comprises insertion of the first donor polynucleotide and/or set of first PCR fragments at the double-strand break to provide a host cell with at least two copies of the first donor polynucleotide in its genome.
- the second temperature allows the one or more second guide RNA to form a complex at one or more protospacers of three LPP, and wherein the complex promotes the homology directed repair by creating a double-strand break at or near the one or more protospacer sequence of the three LPP, wherein the homology directed repair comprises insertion of the first donor polynucleotide and/or set of first PCR fragments at the double-strand break to provide a host cell with at least three copies of the first donor polynucleotide in its genome.
- the second temperature allows the one or more second guide RNA to form a complex at one or more protospacers of four LPP, and wherein the complex promotes the homology directed repair by creating a double-strand break at or near the one or more protospacer sequence of the four LPP, wherein the homology directed repair comprises insertion of the first donor polynucleotide and/or set of first PCR fragments at the double-strand break to provide a host cell with at least four copies of the first donor polynucleotide in its genome.
- the second temperature allows the one or more second guide RNA to form a complex at one or more protospacers of five LPP, and wherein the complex promotes the homology directed repair by creating a double-strand break at or near the one or more protospacer sequence of the five LPP, wherein the homology directed repair comprises insertion of the first donor polynucleotide and/or set of first PCR fragments at the double-strand break to provide a host cell with at least five copies of the first donor polynucleotide in its genome.
- the second temperature allows the one or more second guide RNA to form a complex at one or more protospacers of six LPP, and wherein the complex promotes the homology directed repair by creating a double-strand break at or near the one or more protospacer sequence of the six LPP, wherein the homology directed repair comprises insertion of the first donor polynucleotide and/or set of first PCR fragments at the double-strand break to provide a host cell with at least six copies of the first donor polynucleotide in its genome.
- the second temperature allows the one or more second guide RNA to form a complex at one or more protospacers of 8 LPP, and wherein the complex promotes the homology directed repair by creating a double-strand break at or near the one or more protospacer sequence of the 8 LPP, wherein the homology directed repair comprises insertion of the first donor polynucleotide and/or set of first PCR fragments at the double-strand break to provide a host cell with at least 8 copies of the first donor polynucleotide in its genome.
- the second temperature allows the one or more second guide RNA to form a complex at one or more protospacers of 16 LPP, and wherein the complex promotes the homology directed repair by creating a double-strand break at or near the one or more protospacer sequence of the 16 LPP, wherein the homology directed repair comprises insertion of the first donor polynucleotide and/or set of first PCR fragments at the double-strand break to provide a host cell with at least 16 copies of the first donor polynucleotide in its genome.
- step b) there are delivered at least two second guide RNAs, each second guide RNA forming a complex with the nuclease at a first, second, third, fourth, or fifth protospacer.
- the at least two second guide RNAs comprise at least two of: a 1 st second guide RNA, a 2 nd second guide RNA, a 3 rd second guide RNA, a 4 th second guide RNA, and a 5 th second guide RNA.
- the 1 st second guide RNA has a polynucleotide sequence with a sequence identity below 100%, such as below 99%, below 98%, below 97%, below 96%, below 95%, below 90%, below 85%, below 80%, below 75%, below 70%, below 65%, below 60%, below 55%, below 50%, below 45%, below 40%, below 35%, below 30%, below 25%, below 20%, below 15%, or below 10% to the polynucleotide sequence of the 2 nd second guide RNA, to the polynucleotide sequence of 3 rd second guide RNA, to the polynucleotide sequence of the 4 th second guide RNA, and to the polynucleotide sequence of the 5 th second guide RNA.
- a sequence identity below 100%, such as below 99%, below 98%, below 97%, below 96%, below 95%, below 90%, below 85%, below 80%, below 75%, below 70%, below 65%, below 60%, below 55%, below 50%, below
- the 2 nd second guide RNA has a polynucleotide sequence with a sequence identity below 100%, such as below 99%, below 98%, below 97%, below 96%, below 95%, below 90%, below 85%, below 80%, below 75%, below 70%, below 65%, below 60%, below 55%, below 50%, below 45%, below 40%, below 35%, below 30%, below 25%, below 20%, below 15%, or below 10% to the polynucleotide sequence of the 1 st second guide RNA, to the polynucleotide sequence of 3 rd second guide RNA, to the polynucleotide sequence of the 4 th second guide RNA, and to the polynucleotide sequence of the 5 th second guide RNA.
- a sequence identity below 100%, such as below 99%, below 98%, below 97%, below 96%, below 95%, below 90%, below 85%, below 80%, below 75%, below 70%, below 65%, below 60%, below 55%, below 50%, below
- the 3 rd second guide RNA has a polynucleotide sequence with a sequence identity of below 100%, such as below 99%, below 98%, below 97%, below 96%, below 95%, below 90%, below 85%, below 80%, below 75%, below 70%, below 65%, below 60%, below 55%, below 50%, below 45%, below 40%, below 35%, below 30%, below 25%, below 20%, below 15%, or below 10% to the polynucleotide sequence of the 1 st second guide RNA, to the polynucleotide sequence of 2 nd second guide RNA, to the polynucleotide sequence of the 4 th second guide RNA, and to the polynucleotide sequence of the 5 th second guide RNA.
- the 4 th second guide RNA has a polynucleotide sequence with a sequence identity below 100%, such as below 99%, below 98%, below 97%, below 96%, below 95%, below 90%, below 85%, below 80%, below 75%, below 70%, below 65%, below 60%, below 55%, below 50%, below 45%, below 40%, below 35%, below 30%, below 25%, below 20%, below 15%, or below 10% to the polynucleotide sequence of the 1 st second guide RNA, to the polynucleotide sequence of 2 nd second guide RNA, to the polynucleotide sequence of the 3 rd second guide RNA, and to the polynucleotide sequence of the 5 th second guide RNA.
- a sequence identity below 100%, such as below 99%, below 98%, below 97%, below 96%, below 95%, below 90%, below 85%, below 80%, below 75%, below 70%, below 65%, below 60%, below 55%, below 50%, below
- the 5 th second guide RNA has a polynucleotide sequence with a sequence identity below 100%, such as below 99%, below 98%, below 97%, below 96%, below 95%, below 90%, below 85%, below 80%, below 75%, below 70%, below 65%, below 60%, below 55%, below 50%, below 45%, below 40%, below 35%, below 30%, below 25%, below 20%, below 15%, or below 10% to the polynucleotide sequence of the 1 st second guide RNA, to the polynucleotide sequence of 2 nd second guide RNA, to the polynucleotide sequence of the 3 rd second guide RNA, and to the polynucleotide sequence of the 4 th second guide RNA.
- a sequence identity below 100%, such as below 99%, below 98%, below 97%, below 96%, below 95%, below 90%, below 85%, below 80%, below 75%, below 70%, below 65%, below 60%, below 55%, below 50%, below
- the one or more second guide RNA comprises a 1 st second guide RNA, having a sequence complementary to a first protospacer sequence, a second protospacer sequence, a third protospacer sequence, a fourth protospacer sequence, or a fifth protospacer sequence, preferably the 1 st second guide RNA is complementary to a first protospacer sequence.
- the one or more second guide RNA comprises a 2 nd second guide RNA, having a sequence complementary to a first protospacer sequence, a second protospacer sequence, a third protospacer sequence, a fourth protospacer sequence, or a fifth protospacer sequence, preferably the 2 nd second guide RNA is complementary to a second protospacer sequence.
- the one or more second guide RNA comprises a 3 rd second guide RNA, having a sequence complementary to a first protospacer sequence, a second protospacer sequence, a third protospacer sequence, a fourth protospacer sequence, or a fifth protospacer sequence, preferably the 3 rd second guide RNA is complementary to a third protospacer sequence.
- the one or more second guide RNA comprises a 4 th second guide RNA, having a sequence complementary to a first protospacer sequence, a second protospacer sequence, a third protospacer sequence, a fourth protospacer sequence, or a fifth protospacer sequence, preferably the 4 th second guide RNA is complementary to a fourth protospacer sequence.
- the one or more second guide RNA comprises a 5 th second guide RNA, having a sequence complementary to a first protospacer sequence, a second protospacer sequence, a third protospacer sequence, a fourth protospacer sequence, or a fifth protospacer sequence, preferably the 5 th second guide RNA is complementary to a fifth protospacer sequence.
- the at least one guide RNA comprises a first guide RNA, a second guide RNA, and a third guide RNA
- the method additionally comprises the step e) cultivating the host cell at a third temperature allowing one or more third guide RNA of the at least one guide RNA to form a complex with the nuclease at one or more protospacer of the one or more LPP, and wherein the complex promotes the homology directed repair by creating a double-strand break at or near the one or more protospacer sequence of the one or more LPP, wherein the homology directed repair comprises insertion of the first donor polynucleotide and/or set of first PCR fragments at the double-strand break.
- the third guide RNA is complementary to one or more first protospacers, wherein during step e) the complex is formed at one or more first protospacer of the one or more LPP, creating a double-strand break at or near the one or more first protospacer.
- the third guide RNA is complementary to one or more second protospacers, wherein during step e) the complex is formed at one or more second protospacer of the one or more LPP, creating a double-strand break at or near the one or more second protospacer.
- the third guide RNA is complementary to one or more third protospacers, wherein during step e) the complex is formed at one or more third protospacer of the one or more LPP, creating a double-strand break at or near the one or more third protospacer.
- the third guide RNA is complementary to one or more fourth protospacers, wherein during step e) the complex is formed at one or more fourth protospacer of the one or more LPP, creating a double-strand break at or near the one or more fourth protospacer.
- the third guide RNA is complementary to one or more fifth protospacers, wherein during step e) the complex is formed at one or more fifth protospacer of the one or more LPP, creating a double-strand break at or near the one or more fifth protospacer.
- the third temperature allows the one or more third guide RNA to form a complex at one or more protospacers of one LPP, and wherein the complex promotes the homology directed repair by creating a double-strand break at or near the one or more protospacer sequence of the one LPP, wherein the homology directed repair comprises insertion of the first donor polynucleotide and/or set of first PCR fragments at the double-strand break to provide a host cell with at least one copy of the first donor polynucleotide in its genome.
- the third temperature allows the one or more third guide RNA to form a complex at one or more protospacers of two LPP, and wherein the complex promotes the homology directed repair by creating a double-strand break at or near the one or more protospacer sequence of the two LPP, wherein the homology directed repair comprises insertion of the first donor polynucleotide and/or set of first PCR fragments at the double-strand break to provide a host cell with at least two copies of the first donor polynucleotide in its genome.
- the third temperature allows the one or more third guide RNA to form a complex at one or more protospacers of three LPP, and wherein the complex promotes the homology directed repair by creating a double-strand break at or near the one or more protospacer sequence of the three LPP, wherein the homology directed repair comprises insertion of the first donor polynucleotide and/or set of first PCR fragments at the double-strand break to provide a host cell with at least three copies of the first donor polynucleotide in its genome.
- the third temperature allows the one or more third guide RNA to form a complex at one or more protospacers of four LPP, and wherein the complex promotes the homology directed repair by creating a double-strand break at or near the one or more protospacer sequence of the four LPP, wherein the homology directed repair comprises insertion of the first donor polynucleotide and/or set of first PCR fragments at the double-strand break to provide a host cell with at least four copies of the first donor polynucleotide in its genome.
- the third temperature allows the one or more third guide RNA to form a complex at one or more protospacers of five LPP, and wherein the complex promotes the homology directed repair by creating a double-strand break at or near the one or more protospacer sequence of the five LPP, wherein the homology directed repair comprises insertion of the first donor polynucleotide and/or set of first PCR fragments at the double-strand break to provide a host cell with at least five copies of the first donor polynucleotide in its genome.
- the third temperature allows the one or more third guide RNA to form a complex at one or more protospacers of six LPP, and wherein the complex promotes the homology directed repair by creating a double-strand break at or near the one or more protospacer sequence of the six LPP, wherein the homology directed repair comprises insertion of the first donor polynucleotide and/or set of first PCR fragments at the double-strand break to provide a host cell with at least six copies of the first donor polynucleotide in its genome.
- the third temperature allows the one or more third guide RNA to form a complex at one or more protospacers of 8 LPP, and wherein the complex promotes the homology directed repair by creating a double-strand break at or near the one or more protospacer sequence of the 8 LPP, wherein the homology directed repair comprises insertion of the first donor polynucleotide and/or set of first PCR fragments at the double-strand break to provide a host cell with at least 8 copies of the first donor polynucleotide in its genome.
- the third temperature allows the one or more third guide RNA to form a complex at one or more protospacers of 16 LPP, and wherein the complex promotes the homology directed repair by creating a double-strand break at or near the one or more protospacer sequence of the 16 LPP, wherein the homology directed repair comprises insertion of the first donor polynucleotide and/or set of first PCR fragments at the double-strand break to provide a host cell with at least 16 copies of the first donor polynucleotide in its genome.
- step b) there are delivered at least two third guide RNAs, each third guide RNA forming a complex with the nuclease at a first, second, third, fourth, or fifth protospacer.
- the at least two third guide RNAs comprise at least two of: a 1 st third guide RNA, a 2 nd third guide RNA, a 3 rd third guide RNA, a 4 th third guide RNA, and a 5 th third guide RNA.
- the 1 st third guide RNA has a polynucleotide sequence with a sequence identity of below 100%, such as below 99%, below 98%, below 97%, below 96%, below 95%, below 90%, below 85%, below 80%, below 75%, below 70%, below 65%, below 60%, below 55%, below 50%, below 45%, below 40%, below 35%, below 30%, below 25%, below 20%, below 15%, or below 10% to the polynucleotide sequence of the 2 nd third guide RNA, to the polynucleotide sequence of 3 rd third guide RNA, to the polynucleotide sequence of the 4 th third guide RNA, and to the polynucleotide sequence of the 5 th third guide RNA.
- a sequence identity of below 100% such as below 99%, below 98%, below 97%, below 96%, below 95%, below 90%, below 85%, below 80%, below 75%, below 70%, below 65%, below 60%, below 55%, below 50%, below
- the 2 nd third guide RNA has a polynucleotide sequence with a sequence identity of below 100%, such as below 99%, below 98%, below 97%, below 96%, below 95%, below 90%, below 85%, below 80%, below 75%, below 70%, below 65%, below 60%, below 55%, below 50%, below 45%, below 40%, below 35%, below 30%, below 25%, below 20%, below 15%, or below 10% to the polynucleotide sequence of the 1 st third guide RNA, to the polynucleotide sequence of 3 rd third guide RNA, to the polynucleotide sequence of the 4 th third guide RNA, and to the polynucleotide sequence of the 5 th third guide RNA.
- a sequence identity of below 100% such as below 99%, below 98%, below 97%, below 96%, below 95%, below 90%, below 85%, below 80%, below 75%, below 70%, below 65%, below 60%, below 55%, below 50%, below
- the 3 rd third guide RNA has a polynucleotide sequence with a sequence identity below 100%, such as below 99%, below 98%, below 97%, below 96%, below 95%, below 90%, below 85%, below 80%, below 75%, below 70%, below 65%, below 60%, below 55%, below 50%, below 45%, below 40%, below 35%, below 30%, below 25%, below 20%, below 15%, or below 10% to the polynucleotide sequence of the 1 st third guide RNA, to the polynucleotide sequence of 2 nd third guide RNA, to the polynucleotide sequence of the 4 th third guide RNA, and to the polynucleotide sequence of the 5 th third guide RNA.
- a sequence identity below 100%, such as below 99%, below 98%, below 97%, below 96%, below 95%, below 90%, below 85%, below 80%, below 75%, below 70%, below 65%, below 60%, below 55%, below 50%, below
- the 4 th third guide RNA has a polynucleotide sequence with a sequence identity of below 100%, such as below 99%, below 98%, below 97%, below 96%, below 95%, below 90%, below 85%, below 80%, below 75%, below 70%, below 65%, below 60%, below 55%, below 50%, below 45%, below 40%, below 35%, below 30%, below 25%, below 20%, below 15%, or below 10% to the polynucleotide sequence of the 1 st third guide RNA, to the polynucleotide sequence of 2 nd third guide RNA, to the polynucleotide sequence of the 3 rd third guide RNA, and to the polynucleotide sequence of the 5 th third guide RNA.
- a sequence identity of below 100% such as below 99%, below 98%, below 97%, below 96%, below 95%, below 90%, below 85%, below 80%, below 75%, below 70%, below 65%, below 60%, below 55%, below 50%, below
- the 5 th third guide RNA has a polynucleotide sequence with a sequence identity below 100%, such as below 99%, below 98%, below 97%, below 96%, below 95%, below 90%, below 85%, below 80%, below 75%, below 70%, below 65%, below 60%, below 55%, below 50%, below 45%, below 40%, below 35%, below 30%, below 25%, below 20%, below 15%, or below 10% to the polynucleotide sequence of the 1 st third guide RNA, to the polynucleotide sequence of 2 nd third guide RNA, to the polynucleotide sequence of the 3 rd third guide RNA, and to the polynucleotide sequence of the 4 th third guide RNA.
- a sequence identity below 100%, such as below 99%, below 98%, below 97%, below 96%, below 95%, below 90%, below 85%, below 80%, below 75%, below 70%, below 65%, below 60%, below 55%, below 50%, below
- the one or more third guide RNA comprises a 1 st third guide RNA, having a sequence complementary to a first protospacer sequence, a second protospacer sequence, a third protospacer sequence, a fourth protospacer sequence, or a fifth protospacer sequence, preferably the 1 st third guide RNA is complementary to a first protospacer sequence.
- the one or more third guide RNA comprises a 2 nd third guide RNA, having a sequence complementary to a first protospacer sequence, a second protospacer sequence, a third protospacer sequence, a fourth protospacer sequence, or a fifth protospacer sequence, preferably the 2 nd third guide RNA is complementary to a second protospacer sequence.
- the one or more third guide RNA comprises a 3 rd third guide RNA, having a sequence complementary to a first protospacer sequence, a second protospacer sequence, a third protospacer sequence, a fourth protospacer sequence, or a fifth protospacer sequence, preferably the 3 rd third guide RNA is complementary to a third protospacer sequence.
- the one or more third guide RNA comprises a 4 th third guide RNA, having a sequence complementary to a first protospacer sequence, a second protospacer sequence, a third protospacer sequence, a fourth protospacer sequence, or a fifth protospacer sequence, preferably the 4 th third guide RNA is complementary to a fourth protospacer sequence.
- the one or more third guide RNA comprises a 5 th third guide RNA, having a sequence complementary to a first protospacer sequence, a second protospacer sequence, a third protospacer sequence, a fourth protospacer sequence, or a fifth protospacer sequence, preferably the 5 th third guide RNA is complementary to a fifth protospacer sequence.
- the at least one guide RNA comprises a first guide RNA, a second guide RNA, a third guide RNA, and a fourth guide RNA
- the method additionally comprises the step f) cultivating the host cell at a fourth temperature allowing one or more fourth guide RNA of the at least one guide RNA to form a complex with the nuclease at one or more protospacer of the one or more LPP, and wherein the complex promotes the homology directed repair by creating a double-strand break at or near the one or more protospacer sequence of the one or more LPP, wherein the homology directed repair comprises insertion of the first donor polynucleotide and/or set of first PCR fragments at the double-strand break.
- the at least one guide RNA comprises a first guide RNA, a second guide RNA, a third guide RNA, a fourth guide RNA, and a fifth guide RNA
- the method additionally comprises the step g) cultivating the host cell at a fifth temperature allowing one or more fifth guide RNA of the at least one guide RNA to form a complex with the nuclease at one or more protospacer of the one or more LPP, and wherein the complex promotes the homology directed repair by creating a double-strand break at or near the one or more protospacer sequence of the one or more LPP, wherein the homology directed repair comprises insertion of the first donor polynucleotide and/or set of first PCR fragments at the double-strand break.
- the first temperature is in the range of 15 - 50 C°, such as in the range of 22 - 42 C°, such as in the range of 23 - 39 C°, such as in the range of 25 - 37 C°.
- second temperature is in the range of 15 - 50 C°, such as in the range of 22 - 42 C°, such as in the range of 23 - 39 C°, such as in the range of 25 - 37 C°.
- the third temperature is in the range of 15 - 50 C°, such as in the range of 22 - 42 C°, such as in the range of 23 - 39 C°, such as in the range of 25 - 37 C°.
- the fourth temperature is in the range of 15 - 50 C°, such as in the range of 22 - 42
- the fifth temperature is in the range of 15 - 50 C°, such as in the range of 22 - 42 C°, such as in the range of 23 - 39 C°, such as in the range of 25 - 37 C°.
- the fifth temperature is in the range of 15 - 50 C°, such as in the range of 22 - 42 C°, such as in the range of 23 - 39 C°, such as in the range of 25 - 37 C°.
- the first temperature and the second temperature are in the range of 15 - 50 C°, such as in the range of 22 - 42 C°, such as in the range of 23 - 39 C°, such as in the range of 25 - 37 C°.
- the first temperature, the second temperature, and the third temperature are in the range of 15 - 50 C°, such as in the range of 22 - 42 C°, such as in the range of 23 - 39 C°, such as in the range of 25 - 37 C°.
- the first temperature, the second temperature, the third temperature, and the fourth temperature are in the range of 15 - 50 C°, such as in the range of 22 - 42 C°, such as in the range of 23 - 39 C°, such as in the range of 25 - 37 C°.
- the first temperature, the second temperature, the third temperature, the fourth temperature, and the fifth temperature are in the range of 15 - 50 C°, such as in the range of 22 - 42 C°, such as in the range of 23 - 39 C°, such as in the range of 25 - 37 C°.
- the first temperature differs from the second temperature by at least 0.5 C°, at least 1 C°, at least 1 .5 C°, at least 2 C°, at least 2.5 C°, at least 3 C°, at least 3.5 C°, at least 4 C°, at least 4.5 C°, at least 5 C°, at least 5.5 C°, or at least 6 C°.
- the second temperature differs from the third temperature by at least 0.5 C°, at least 1 C°, at least 1 .5 C°, at least 2 C°, at least 2.5 C°, at least 3 C°, at least 3.5 C°, at least 4 C°, at least 4.5 C°, at least 5 C°, at least 5.5 C°, or at least 6 C°
- the third temperature differs from the fourth temperature by at least 0.5 C°, at least 1 C°, at least 1 .5 C°, at least 2 C°, at least 2.5 C°, at least 3 C°, at least 3.5 C°, at least 4 C°, at least 4.5 C°, at least 5 C°, at least 5.5 C°, or at least 6 C°
- the fourth temperature differs from the fifth temperature by at least 0.5 C°, at least 1 C°, at least 1 .5 C°, at least 2 C°, at least 2.5 C°, at least 3 C°, at least 3.5 C°, at least 4 C°, at least 4.5 C°, at least 5 C°, at least 5.5 C°, or at least 6 C°
- the first temperature is selected from the list of 22 C°, 23 C°, 24 C°, 25 C°, 26 C°, 27 C°, 28 C°, 29 C°, 30 C°, 31 C°, 32 C°, 33 C°, 34 C°, 35 C°, 36 C°, 37 C°, 38 C°, 39 C°, 40 C°, 41 C°, 42 C°, 43 C°, 44 C°, 45 C°, 46 C°, 47 C°, 48 C°, 49 C°, and 50 C°.
- the second temperature is selected from the list of 22 C°, 23 C°, 24 C°, 25 C°, 26 C°, 27 C°, 28 C°, 29 C°, 30 C°, 31 C°, 32 C°, 33 C°, 34 C°, 35 C°, 36 C°, 37 C°, 38 C°, 39 C°, 40 C°, 41 C°, 42 C°, 43 C°, 44 C°, 45 C°, 46 C°, 47 C°, 48 C°, 49 C°, and 50 C°.
- the third temperature is selected from the list of 22 C°, 23 C°, 24 C°, 25 C°, 26 C°, 27 C°, 28 C°, 29 C°, 30 C°, 31 C°, 32 C°, 33 C°, 34 C°, 35 C°, 36 C°, 37 C°, 38 C°, 39 C°, 40 C°, 41 C°, 42 C°, 43 C°, 44 C°, 45 C°, 46 C°, 47 C°, 48 C°, 49 C°, and 50 C°.
- the fourth temperature is selected from the list of 22 C°, 23 C°, 24 C°, 25 C°, 26 C°, 27 C°, 28 C°, 29 C°, 30 C°, 31 C°, 32 C°, 33 C°, 34 C°, 35 C°, 36 C°, 37 C°, 38 C°, 39 C°, 40 C°, 41 C°, 42 C°, 43 C°, 44 C°, 45 C°, 46 C°, 47 C°, 48 C°, 49 C°, and 50 C°.
- the fifth temperature is selected from the list of 22 C°, 23 C°, 24 C°, 25 C°, 26 C°, 27 C°, 28 C°, 29 C°, 30 C°, 31 C°, 32 C°, 33 C°, 34 C°, 35 C°, 36 C°, 37 C°, 38 C°, 39 C°, 40 C°, 41 C°, 42 C°, 43 C°, 44 C°, 45 C°, 46 C°, 47 C°, 48 C°, 49 C°, and 50 C°.
- the first temperature and the second temperature are selected from the list of 22 C°, 23 C°, 24 C°, 25 C°, 26 C°, 27 C°, 28 C°, 29 C°, 30 C°, 31 C°, 32 C°, 33 C°, 34 C°, 35 C°, 36 C°, 37 C°, 38 C°, 39 C°, 40 C°, 41 C°, 42 C°, 43 C°, 44 C°, 45 C°, 46 C°, 47 C°, 48 C°, 49 C°, and 50 C°..
- the first temperature, the second temperature, and the third temperature are selected from the list of 22 C°, 23 C°, 24 C°, 25 C°, 26 C°, 27 C°, 28 C°, 29 C°, 30 C°, 31 C°, 32 C°, 33 C°, 34 C°, 35 C°, 36 C°, 37 C°, 38 C°, 39 C°, 40 C°, 41 C°, 42 C°, 43 C°, 44 C°, 45 C°, 46 C°, 47 C°, 48 C°, 49 C°, and 50 C°.
- the first temperature, the second temperature, the third temperature, and the fourth temperature are selected from the list of 22 C°, 23 C°, 24 C°, 25 C°, 26 C°, 27 C°, 28 C°, 29 C°, 30 C°, 31 C°, 32 C°, 33 C°, 34 C°, 35 C°, 36 C°, 37 C°, 38 C°, 39 C°, 40 C°, 41 C°, 42 C°, 43 C°, 44 C°, 45 C°, 46 C°, 47 C°, 48 C°, 49 C°, and 50 C°.
- the first temperature, the second temperature, the third temperature, the fourth temperature, and the fifth temperature are selected from the list of 22 C°, 23 C°, 24 C°, 25 C°, 26 C°, 27 C°, 28 C°, 29 C°, 30 C°, 31 C°, 32 C°, 33 C°, 34 C°, 35 C°, 36 C°, 37 C°, 38 C°, 39 C°, 40 C°, 41 C°, 42 C°, 43 C°, 44 C°, 45 C°, 46 C°, 47 C°, 48 C°, 49 C°, and 50 C°.
- the first temperature is in the range of 22 - 40 C°
- the second temperature is in the range of 35 - 50 C°.
- the first donor polynucleotide does not comprise a selectable marker or counter- selectable marker.
- the first donor polynucleotide comprises a selectable or counter-selectable marker.
- step b) there is furthermore delivered a second donor polynucleotide comprising in 5’ to 3’ direction: a 5’ homology arm, a second polynucleotide encoding a second polypeptide of interest, and a 3’ homology arm, wherein the 3’ homology arm is identical to the 3’ homology arm of the host cell, and wherein the 5’ homology arm is identical to the 5’ homology arm of the host cell.
- the first polypeptide of interest comprises a brazzein, a casein, a patatin, an ovalbumin, an osteopontin, an ovotransferrin, an ovomucin, an ovomucoid, an ovostatin, a glycomacropeptide, a lactoferrin, an alpha-lactalbumin, e.g., bovine alpha-lactalbumin, a beta-lactalbumin and/or a collagen.
- an alpha-lactalbumin e.g., bovine alpha-lactalbumin, a beta-lactalbumin and/or a collagen.
- the first polypeptide of interest comprises a therapeutic polypeptide selected from the group consisting of an antibody, an antibody fragment, an antibody-based drug, a Fc fusion protein, an anticoagulant, a blood factor, a bone morphogenetic protein, an engineered protein scaffold, an enzyme, a growth factor, a blood clotting factor, a hormone, an interferon (such as an interferon alpha-2b), an interleukin, a lactoferrin, an alpha-lactalbumin, a beta-lactalbumin, an ovomucoid, an ovostatin, a cytokine, an obestatin, a human galactosidase (such as an human alpha-galactosidase A), a vaccine, a protein vaccine, and a thrombolytic.
- a therapeutic polypeptide selected from the group consisting of an antibody, an antibody fragment, an antibody-based drug, a Fc fusion protein, an anticoagulant, a blood
- first polypeptide of interest comprises an enzyme
- the enzyme is selected from the list of a hydrolase, isomerase, ligase, lyase, oxidoreductase, or transferase, e.g., an aminopeptidase, amylase, carbohydrase, carboxypeptidase, catalase, cellobiohydrolase, cellulase, chitinase, cutinase, cyclodextrin glycosyltransferase, deoxyribonuclease, endoglucanase, esterase, alpha-galactosidase, beta-galactosidase, glucoamylase, alpha-glucosidase, beta-glucosidase, invertase, laccase, lipase, mannosidase, mutanase, oxidase, pectinolytic enzyme, peroxidase, phytase, polyphenoloxidase,
- the second polypeptide of interest comprises a polypeptide selected from the list of a transcription factor (e.g. amyR), a chaperone (e.g. BipA, PdiA, or PdiB), a heme pathway enzyme (e.g. encoded by hemA, or hemB), or a glycosylation pathway enzyme (e.g. a Mannosidase I, a GIcNAc transferase, a galactosyltransferase, a sialyltransferase, an oligosaccharyltransferase, or an glucosidase).
- a transcription factor e.g. amyR
- a chaperone e.g. BipA, PdiA, or PdiB
- heme pathway enzyme e.g. encoded by hemA, or hemB
- a glycosylation pathway enzyme e.g. a Mannosidase I, a
- the method additionally comprises step h), subsequent of any of steps c), d), e), f) or g):
- RNA-guided DNA nuclease delivering to the host cell a RNA-guided DNA nuclease, and at least one guide RNA directed to one or more protospacer of one or more empty LPP, and
- the host cell at a temperature allowing the at least one guide RNA to form a complex with the nuclease at the one or more empty LPP, wherein the complex promotes the homology directed repair by creating a double-strand break at or near the one or more protospacer sequence of the one or more empty LPP, wherein the homology directed repair comprises insertion of the first polynucleotide at the doublestrand break of the empty LPP.
- the present invention relates to a method of producing a polypeptide of interest, comprising cultivating a multi-copy host cell under conditions conducive for production of the polypeptide, said multi-copy host cell being generated by the method of the second aspect.
- the method further comprises recovering the polypeptide of interest.
- the host cell is cultivated in a nutrient medium suitable for production of the polypeptide using methods known in the art.
- the cell may be cultivated by shake flask cultivation, or small-scale or large-scale fermentation (including continuous, batch, fed-batch, or solid-state, and/or microcarrierbased fermentations) in laboratory or industrial fermentors in a suitable medium and under conditions allowing the polypeptide to be expressed and/or isolated.
- suitable media are available from commercial suppliers or may be prepared according to published compositions (e.g., in catalogues of the American Type Culture Collection). If the polypeptide is secreted into the nutrient medium, the polypeptide can be recovered directly from the medium. If the polypeptide is not secreted, it can be recovered from cell lysates.
- the polypeptide may be detected using methods known in the art that are specific for the polypeptide, including, but not limited to, the use of specific antibodies, formation of an enzyme product, disappearance of an enzyme substrate, or an assay determining the relative or specific activity of the polypeptide.
- the polypeptide may be recovered from the medium using methods known in the art, including, but not limited to, collection, centrifugation, filtration, extraction, spray-drying, evaporation, or precipitation.
- a whole fermentation broth comprising the polypeptide is recovered.
- a cell-free fermentation broth comprising the polypeptide is recovered.
- polypeptide may be purified by a variety of procedures known in the art to obtain substantially pure polypeptides and/or polypeptide fragments (see, e.g., Wingfield, 2015, Current Protocols in Protein Science; 80(1): 6.1.1-6.1.35; Labrou, 2014, Protein Downstream Processing, 1129: 3-10).
- polypeptide is not recovered.
- E.coli HST08 (Clontech) is used for plasmid construction and amplification. Amplified plasmids are recovered with Qiagen Plasmid Midi or Mini Kit (Qiagen). Ligation is done with NEBuilder HiFi DNA Assembly Cloning Kit (NEB) according to the manufactory instructions. GFXTM PCR DNA and Gel Band Purification Kit (Merck) is used for the purification of PCR fragments and extraction of DNA fragment from agarose gel.
- Enzymes for DNA manipulations e.g. restriction endonucleases, ligases etc.
- Enzymes for DNA manipulations e.g. restriction endonucleases, ligases etc.
- E. coli chemically competent host strain was incubated with DNA for 20 minutes on ice. The solution was then transferred to 42 degrees for 2 minutes and 20x Volume of SOC medium was added. The solution was then incubated at 37 degrees for 1 hour at 700 rpm before spreading on LB + Ampicillin plates.
- Transformation of Aspergillus species can be achieved using the general methods for yeast transformation.
- Aspergillus oryzae host strain was inoculated in 100 ml of Sucrose medium (supplemented with 10 mM uridine if strain is pyrG-) and incubated for 16 h at 30°C at 200 rpm. Pellets were collected and washed with 0.6 M MgSO4, and resuspended in 10 ml 1.2 M MgSO4 containing a commercial beta- glucanase product (GLUCANEXTM, Novozymes A/S, Bagsvaerd, Denmark) at a final concentration of 40 mg per ml. The suspension was incubated at 30°C at 50 rpm until protoplasts were formed, and then washed twice with STC buffer.
- Sucrose medium supplied with 10 mM uridine if strain is pyrG-
- Pellets were collected and washed with 0.6 M MgSO4, and resuspended in 10 ml 1.2 M MgSO4 containing a commercial beta- glucan
- the protoplasts were resuspended to a final concentration of 1x10 7 protoplasts/ml. Approximately 1 pg of guide plasmid and 1 .5 ug of repair DNA was added to 100 pl of the protoplast suspension, mixed gently, and incubated 5 minutes at Room Temperature. 400ul of PEG solution was added and the protoplast suspension was incubated for 20 minutes at Room Temperature. After the addition of 8 ml of 50°C Sucrose Agar, the reaction was poured onto Sucrose Agar plates and the plates were incubated at 30°C for 4 days.
- COVE trace metals solution was composed of 0.04 g of NaB4O7 «10H2O, 0.4 g of CuSO4 «5H2O, 1.2 g of FeSO4-7H2O, 0.7 g of MnSO4-H2O, 0.8 g of Na2MoO2-2H20, 10 g of ZnSO4-7H2O, and deionized water to 1 liter.
- 50X COVE salts solution was composed of 26 g of KCI, 26 g of MgSO4 «7H2O, 76 g of KH2PO4, 50 ml of COVE trace metals solution, and deionized water to 1 liter.
- Sucrose Medium was composed of 342.3 g of sucrose, 20 ml of 50X COVE salts solution, 10 ml of 1 M Urea, and deionized water to 1 liter.
- Sucrose Agar were composed of 342.3 g of sucrose, 20 ml of 50X COVE salts solution, 10 ml of 1 M Urea, 20g of granulated agar, and deionized water to 1 liter. If bar selection is used, Basta (Bayer A/S) is added to a final concentration of 0.11 % glufosinat (w/V).
- STC buffer was composed of 0.8 M sorbitol, 25 mM Tris pH 8, and 25 mM CaCI2.
- PEG solution was composed of 60% PEG 4000 in STC buffer.
- LB plus Ampicillin plates were composed of 10 g of tryptone, 5 g of yeast extract, 5 g of sodium chloride, 15 g of Bacto agar, ampicillin at 150 pg per ml, and deionized water to 1 liter.
- SOC medium was composed of 20 g of tryptone, 5 g of yeast extract, 0.5 g of NaCI, 10 ml of 250 mM KCI, and deionized water to 1 liter.
- COVE Urea Zeocin plates were composed of 342.3 g of sucrose, 20 ml of COVE salts solution, 10 ml 1 M urea, 25 g of Noble agar, and deionized water to 1 liter. The pH was adjusted to pH 6.5. After sterilisation, 1 mL 100 mg/mL Zeocin (GibcoTM, Thermo Fisher Scientific).
- PDA plates were composed of 39 g of potato dextrose agar (Difco) and deionized water to 1 liter.
- PEG+G buffer was composed of 60% polyethylene glycol (PEG) 4000, 20% w/v glucose, 10 mM Tris-HCI pH 7.5, and 10 mM CaC12 in deionized water. The solution is filter sterilized.
- STC+G was composed of 1 M sorbitol, w/v glucose, 10 mM Tris pH 7.5, and 10 mM CaC12 in deionized water.
- STC was composed of 1 M sorbitol, 10 mM Tris pH 7.5, and 10 mM CaC12 in deionized water.
- YPD medium was composed of 1 % yeast extract, 2% peptone, and 2% glucose in deionized water.
- Trichoderma reesei strain SAMF128-2A 11-1 is described in WO 20112911 .
- ddPCR Digital Droplet PCR
- the ddPCR assays was run according to protocols available for the QX200 Bio-Rad ddPCR system. DNA was prepared by inoculating fungal spores in water added glass beads shaking vigorously for 2 minutes. An aliquot of DNA extract was transferred to the in-digest ddPCR solution. For digestion of DNA, one of the four restriction enzymes Msel, Hindlll, Alul or Haelll was used according to the suggestions by Bio-Rad for use in ddPCR assays. Probe assays (mix of fluorescent probe and PCR primers) were acquired from Integrated DNA Technologies (IDT). The assays were designed using the PrimerQuestTM Tool supplied by IDT.
- T. reesei was cultivated in two shake flasks, each containing 25 mL of Y PD medium, at 27°C for 17 hours with gentle agitation at 90 rpm.
- Mycelia were collected by filtration using a Vacuum Driven Disposable Filtration System (Millipore) and washed twice with deionized water and twice with 1 .2 M sorbitol.
- Protoplasts were generated by suspending the washed mycelia in 30 mL of 1.2 M sorbitol containing 5 mg of YATALASE TM (Takara Bio USA, Inc.) per ml and 0.5 mg of chitinase (Sigma Chemical Co.) per ml for 60-75 minutes at 34°C with gentle shaking at 90 rpm. Protoplasts were collected by centrifugation at 834 x g for 7 minutes and washed twice with cold 1 .2 M sorbitol. The protoplasts were counted using a hemocytometer and re-suspended to a final concentration of 1 x 10 8 protoplasts per ml of STC. Aliquots (1 .1 ml) of the protoplast solution were placed in a Mr. FROSTY TM freezing container (Thermo Fisher Scientific) at -800 C for later use (as described in W020123845).
- Transformation of Trichoderma species can be achieved using the general methods for yeast transformation.
- the preferred procedure for this invention is described below. Approximately 1 pg of D27XZX or D278ZE plasmid DNA and 1 ug of plasmid D26V2Q DNA were combined (and added to 100 pl of the protoplast suspension of strain TT540 and then mixed gently. Then 250 pl PEG+G was added to the DNA-protoplast mixture, mixed gently and incubated at 34°C for 30 minutes. One ml of STC+G was added, the protoplast suspension was mixed gently and poured onto Cove Urea Zeocin agar plates or PDA agar plates (with overlay with Hygromycin). The plates were incubated at 30°C for 8-10 days.
- Transformants were picked to PDA agar and incubated at 30°C for 5-7 days.
- the strains were spore purified by diluting spores from the PDA agar plates in water and spreading onto Cove agar for a second round of selection (as described in W020123845).
- D27XZX or D278ZE plasmid DNA was integrated into the genome by homologous recombination using the 5' and 3' flanks of the 70883 locus contained in the plasmids.
- the plasmid sequence between these 70883 homology flanks was integrated into the genome replacing the 70883 coding sequence.
- Transformants were selected for using the amdS selection marker contained between the 70883 flanking sequence within the plasmid.
- the resulting strains with the 70883 replacement and integrated intervening plasmid DNA sequence were named 0184PQ 5 (D27XZX in 016VA2), 01792Q (D278ZE in 016VA2), 0253QJ (D27XZX in 0154NN) and 016E5W (D278ZE in 0154NN).
- a landing pad poynucleotide is a DNA sequence consisting of three components, a protospacer array sequence consisting of one or more CRISPR target sites flanked by a left sequence and a right sequence. The two flanking sequences are used for integration of a repair construct by homologous recombination resulting in repair of the CRISPR guide facilitated chromosomal breaks in the landing pad.
- the left flank named SYN2L (SEQ ID NO: 1)
- SYN2L consists of 1000 base pairs (bp), i.e., having a length of 1000 nucleotides (nt).
- the SYN2L sequence was randomly generated by a python script setting the AT content to 52%.
- the right flank was named SYN2R (SEQ ID NO: 2), which is a 1000 bp sequence.
- the protospacer array consists of five protospacer sequences.
- the protospacer sequences for the arrays were randomly generated by a python script setting the AT content to 52% on average.
- the protospacers had all been tested to function at 30 degrees Celsius using MAD7 as the CRISPR enzyme.
- the sequence TTTC was chosen for all protospacers in the protospacer arrays.
- the location of the PAM sequence for each protospacer is shown in Figure 3.
- the four protospacer arrays of the four LPPs all contain two first protospacer sequences #1 A and #1 B.
- One first protospacer sequence #1 A is PS-59 (SEQ ID NO: 3) which is present in all four protospacer arrays
- another first protospacer sequence #1 B is PS-73 (SEQ ID NO: 4) which is also present in all four protospacer arrays.
- each of the protospacer arrays contained three additional protospacers: a second protospacer (#2), a third protospacer (#3), and a fourth protospacer (#4).
- These protospacer sequences are either present in the protospacer arrays of three landing pads in total, two landing pads in total or are unique to the respective protospacer arrays.
- Table 2 For an overview of the presence of protospacer sequences for each of the four landing pads LPP#1-LPP#4 see Figure 1 , and below Table 2.
- chromosomal loci For building the Aspergillus oryzae recipient strain, four chromosomal loci were chosen, which are present on four different chromosome arms to ensure genetic stability in the experiment.
- pepAb locus on chromosome 1 the pepE locus on chromosome 2
- amdS locus on chromosome 6 the pepAa locus on chromosome 8.
- the four LPPs were inserted at the four chromosomal loci using four CRISPR guide RNAs targeting each of the four loci as well as four repair fragments, which contained the four LPPs flanked by loci specific repair sequences. While integrating the LPPs, the loci specific genes were deleted thereby removing the guide targeting sequence.
- the resulting Aspergillus oryzae recipient strain AT5885 containing the four LPPs at the four different chromosomal loci was confirmed by genome sequencing.
- the recipient strain AT5885 is ApyrG and therefore requires supplementation of uridine to be able to grow.
- LBP#1 the five protospacers PS-59, PS-73 PS-74, PS-75 and PS-76 for integration in the pepAb locus;
- LBP#2 the five protospacers PS-59, PS-73, PS-74, PS-75 and PS-11 for integration in the pepE locus;
- LEP#4 the five protospacers PS-59, PS-73, PS-48, PS-89 and PS-94 for integration in the pepAa locus.
- Example 2 One-step integration of 4 gene copies into 4 landing pad polynucleotides (LPP) using one gRNA sequence
- the recipient strain AT5885 comprising 4 LPPs was made transformable by protoplasting mycelia.
- gRNA CRISPR guide RNA
- pAT4859 comprises a first gRNA (SEQ ID NO:18) enabling cutting at the protospacer sequence PS-59
- pAT4873 comprises a first gRNA (SEQ ID NO: 19) enabling cutting at the protospacer sequence PS-73.
- Both guide RNA plasmids carry the pyrG gene for maintenance in the recipient strain upon transformation.
- An overview of designed gRNAs is shown in Table 3.
- Protoplasts of AT5885 were transformed with pAT4859 and repair DNA and transformants were selected on minimal plates without addition of uridine.
- the transformation efficiency was 100 transformants I ug of repair DNA.
- 20 transformants were spore reisolated twice and the copy number was determined by ddPCR.
- 75% of the strains had the expected four copies of the gene of interest, i.e. one copy per LPP.
- ddPCR was used to verify that the protospacer arrays in the four LPPs were no longer present in those 75% correct transformants, which indicates that the repair and insertion was successful.
- Example 3 One-step integration of 3 gene copies into 3 LPPs, using 2 different gRNA sequences
- AT5885 comprising 4 LPPs
- AT6412 was made pyrG+ resulting in strain AT6412.
- pAT6716 One CRISPR guide RNA plasmid was built (pAT6716), which enables cutting of three of the four LPPs in the recipient strain.
- pAT6716 comprises gRNA sequences (SEQ ID NO: 20) that allow DNA cutting at the protospacer sequence PS-75 (present in LPP#1 and LPP#2), and guide RNA (SEQ ID NO: 21) enabling cutting at the protospacer sequence PS-94 (present in LPP#4).
- the cutting of the three LPPs is indicated in Figure 2a.
- the guide RNA plasmid carries the bar gene for maintenance in the recipient strain upon transformation on BASTA plates.
- REPAIR1 contained a sequence of interest/gene of interest (GOI) flanked by the repair flanks (homology arms) SYN2L (SEQ ID NO: 1) and SYN2R (SEQ ID NO: 2).
- the GOI was subject to integration via homology directed repair at the sites were double strand breaks were introduced, see Figure 2b.
- Protoplasts of AT6412 were transformed with pAT6716 and repair DNA construct REPAIR1 .Transformants were selected on minimal plates without addition of uridine. The transformation efficiency was 100+ transformants I ug of repair DNA.
- Example 4 is an example for ’’step h)” of the method according to the second aspect of the invention.
- the AT6412 host strain comprising 4 LPPs (generated in Example 1) was used for this experiment.
- the method described in example 2 is not preferred for making multi-copy strains for such proteins of interest.
- a guide plasmid, pAT6654, comprising a gRNA sequence (SEQ ID NO: 20) enabling cutting of PS- 75 present in two (LPP#1 and LPP#2) of the four LPPs in AT6412 was made.
- a guide plasmid, pAT6657, comprising a gRNA sequence (SEQ ID NO: 21) enabling cutting of PS- 94 present in one (LPP#4) of the four LPPs in AT6412 was made.
- a guide plasmid, pAT6327, comprising a gRNA sequence (SEQ ID NO: 18) enabling cutting of PS- 59 present in all four of the landing pads in AT6412 was made.
- step 1 the guide plasmid pAT6654 was transformed in AT6412 together with small amounts of REPAIR2 DNA, so the cells could survive the exposure.
- Transformants were spore isolated and strains having two copies of REPAIR2 integrated, one copy into LPP#1 and one copy into LPP#2, were characterized using ddPCR to determine the copy number and the integrity of the two other landing pads were determined as well using ddPCR.
- the final 2-copy strain was named AT6712.
- step 2a the guide plasmid pAT6557, cutting one of two remaining “empty” landing pads (LPP#4), was transformed in AT6712. Transformants were spore isolated and strains having three copies of REPAIR2 integrated were characterized by ddPCR. Correct transformants tested had the REPAIR2 fragment copied from one of the already occupied loci into the third locus LPP#4, resulting in a 3-copy strain.
- step 2b the guide plasmid pAT6327, cutting both of the two remaining “empty” landing pads (LPP#3 and LPP#4), was transformed in AT6712. Transformants were spore isolated and strains having four copies of REPAIR2 integrated were characterized by ddPCR. Correct transformants tested had the REPAIR2 fragment copied from the already occupied loci to both loci LPP#3 and LPP#4, resulting in a 4- copy strain.
- the host cells are transformed with high amount of DNA to increase integration efficiency and to secure integration into all desired landing pads.
- This high DNA load can be toxic for some genes of interest and/or for some host cells.
- multi-copy strains can also be generated for genes of interest which are toxic for the cells, as the presented two-step approach can keep the exposure to the toxic DNA at a minimum.
- Example 5 Possible arrangements for protospacer presence in up to 16 landing pads.
- Table 4 shows exemplary arrangements for protospacers in up to 16 landing pads.
- Example 1 of Table 4 shows a host cell with 16 landing pads (LPP#1 - LPP#16). Each LPP comprises between 1-5 protospacer sequences. Protospacer #1 is present in only one LPP. Protospacer #2 is present in two LPPs. Protospacer #3 is present in 4 LPPs. Protospacer #4 is present in 8 LPPs, and protospacer #5 is present in all 16 LPPs. Depending on how many copies of the gene of interest shall be incorporated into the host cell, a suitable protospacer can be targeted.
- Example 2 of Table 4 shows a host cell with 16 landing pads (LPP#1 - LPP#16). Each LPP comprises between 1-5 protospacer sequences. Protospacer #1 is present in only one LPP. Protospacer #2 is present in 4 LPPs. Protospacer #3 is present in 6 LPPs. Protospacer #4 is present in 10 LPPs, and protospacer #5 is present in all 16 LPPs. Depending on how many copies of the gene of interest shall be incorporated into the host cell, a suitable protospacer can be targeted.
- a landing pad polynucleotide is a DNA sequence consisting of three components, a protospacer array sequence consisting of one or more CRISPR target sites flanked by a left sequence and a right sequence. The two flanking sequences are used for integration of a repair construct by homologous recombination resulting in repair of the CRISPR guide facilitated chromosomal breaks in the landing pad.
- the left flank named SYN2L (SEQ ID NO: 1)
- SYN2L consists of 1000 base pairs (bp), i.e., having a length of 1000 nucleotides (nt).
- the SYN2L sequence was randomly generated by a python script setting the AT content to 52%.
- the right flank was named SYN2R (SEQ ID NO: 2), which is a 1000 bp sequence.
- the protospacer array consists of five protospacer sequences.
- the protospacer sequences for the arrays were randomly generated by a python script setting the AT content to 52% on average.
- the protospacers had all been tested to function at 30 degrees Celsius using MAD7 as the CRISPR enzyme.
- the sequence TTTC was chosen for all protospacers in the protospacer arrays.
- the location of the PAM sequence for each protospacer is shown in Figure 3.
- the four protospacer arrays of the four LPPs in Trichoderma cells all contain two first protospacer sequences #1A and #1 B.
- One first protospacer sequence #1A is PS-59 (SEQ ID NO: 3) which is present in all four protospacer arrays
- another first protospacer sequence #1 B is PS-73 (SEQ ID NO: 4) which is also present in all four protospacer arrays.
- each of the protospacer arrays contained three additional protospacers: a second protospacer (#2), a third protospacer (#3), and a fourth protospacer (#4).
- These protospacer sequences are either present in the protospacer arrays of three landing pads in total, two landing pads in total or are unique to the respective protospacer arrays.
- Table 5 For an overview of the presence of protospacer sequences for each of the four landing pads LPP#1-LPP#4 see Table 5.
- Trichoderma reesei recipient strains four chromosomal loci were chosen, which are present on four different chromosome arms to ensure genetic stability in the experiment.
- the four LPPs were inserted at the four chromosomal loci using four CRISPR guide RNAs targeting each of the four loci as well as four repair fragments, which contained the four LPPs flanked by loci specific repair sequences. While integrating the LPPs, the loci specific genes were deleted thereby removing the guide targeting sequence.
- Trichoderma reesei recipient strain TT540 containing the four LPPs at the four different chromosomal loci was confirmed by digital droplet PCR.
- the four Trichoderma protospacer arrays have been listed as:
- - LPP#T2 the five protospacers PS-59, PS-73, PS-89, PS-94 and PS-48 for integration in the tf37062 locus;
- - LPP#T4 the five protospacers PS-59, PS-73, PS-74, PS-79 and PS-80 for integration in the xyn2 locus.
- Example 7 One-step integration of 4 gene copies into 4 landing pad polynucleotides (LPP) using one gRNA sequence
- the recipient Trichoderma reesei strain TT540 comprising 4 LPPs was made transformable by protoplasting mycelia.
- One CRISPR guide RNA (gRNA) plasmids was built (pTT397) containing a gRNA which enables cutting of all four LPPs in the recipient strain.
- the plasmid comprised the first gRNA (SEQ ID NO:18) enabling cutting at the protospacer sequence PS-59.
- Plasmid pTT397 carries the hygromycin marker-gene for maintenance in the recipient strain upon transformation.
- Table 6 An overview of designed gRNAs is shown in Table 6.
- Protoplasts of TT540 were transformed with pTT397 and repair DNA (Ascl-digested pTT391 ; EcoL1028-lysozyme; WO2021202479-A1) and transformants were selected on PDA plates with hygromycin addition. Ten transformants were spore reisolated twice and the copy number was determined by ddPCR. 75% of the transformants had the expected four copies of the gene of interest, i.e. one copy per LPP.
- Example 8 One-step multiplex integration of two different genes into 4 LPPs, using one gRNA sequence
- REPAIR4 contained a first sequence of interest/gene of interest (GOI), i.e. pTT391 : EcoL1028 lysozyme (WO2021202479-A1), flanked by the homology arms SYN2L (SEQ ID NO: 1) and SYN2R (SEQ ID NO: 2).
- REPAIRS contained a second sequence of interest/gene of interest (GOI), i.e.
- pTT508 AfBX beta- xylosidase (WO2022127892-A1), flanked by the homology arms SYN2L (SEQ ID NO: 1) and SYN2R (SEQ ID NO: 2. 5-10 transformants were spore reisolated twice and the copy number for each gene of interest was determined by ddPCR. In all transformants the four LLPs were occupied with the genes of interest. We obtained varying copy number ratios of lysozyme versus beta-xylosidase, containing the combinations: zero to four (0:4), one to three (1 :3), and three to one (3:1).
- the SPADE technology proves to be efficient for targeted integration into multiple landing pads using different GOI sequences.
- the shown one-step multiplexing approach allows easy combination of GOIs in strain construction.
- Example 9 Efficiency of applied flank length for one-step integration of one gene copy into 1 LPPs using one gRNA sequence
- the Trichoderma reesei recipient strain TT499 comprising 1 LPP was made transformable by protoplasting mycelia.
- One CRISPR guide RNA (gRNA) plasmids was built (pTT505) containing a gRNA which enables cutting the LPP in the recipient strain.
- the plasmid comprised the first gRNA (SEQ ID NO:18) enabling cutting at the protospacer sequence PS-59.
- Plasmid pTT505 carries the zeocin marker-gene for maintenance in the recipient strain upon transformation.
- Table 6 An overview of designed gRNAs is shown in Table 6.
- the repair flanks are varying in length from 1000, 750, 500, 250, 200, 150, to 100-bp, being shortened at the outside.
- the gene of interest is EcoL1028 lysozyme (WO2021202479-A1).
- Protoplasts of TT499 were transformed with pTT505 and the seven different repair DNAs and transformants were selected on Cove plates supplemented with urea and zeocin.
- transformants For each transformation applying the various repair constructs, we obtained transformants.
- the transformants obtained from each transformation were spore reisolated twice and the copy number was determined by ddPCR. All transformants had the expected single copy of the gene of interest. This example shows that SPADE integration can be performed using minimal 100-bp flanks.
- Example 10 One step generation of a 4-copy lipase strain using a single gRNA sequence
- chromosomal loci For building the Aspergillus oryzae recipient strain, four chromosomal loci were chosen, which are present on four different chromosome arms to ensure genetic stability in the experiment.
- pepAb locus on chromosome 1 the pepE locus on chromosome 2
- amdS locus on chromosome 6 the pepAa locus on chromosome 8.
- the four LPPs were inserted at the four chromosomal loci using four CRISPR guide RNAs targeting each of the four loci as well as four repair fragments, which contained the four LPPs flanked by loci specific repair sequences. While integrating the LPPs, the loci specific genes were deleted thereby removing the guide targeting sequence.
- LBP#1 the five protospacers PS-59, PS-73, PS-74, PS-75 and PS-76 for integration in the pepAb locus;
- LBP#2 the five protospacers PS-59, PS-73, PS-74, PS-75 and PS-11 for integration in the pepE locus;
- Protoplast of the strain A2F0012 (pyrG-) containing LPP #1 , #2, #3 and #4 was transformed by using CRISPR guide RNA (gRNA) pAT4859 containing a gRNA which enables cutting of all four LPPs in the recipient strain.
- gRNA CRISPR guide RNA
- pAT4859 comprises a gRNA (SEQ ID NO:18) enabling cutting at the protospacer sequence PS-59 present on all four LPPs, the guide RNA plasmids carry the pyrG gene for maintenance in the recipient strain upon transformation.
- the Protoplasts of A2F0012 were transformed with pAT4859 and the 3 repair DNA fragments, the transformation was done on an automated platform using a Hamilton robots, the transformants were plated on sucrose plates without addition of uridine, and grown 4 days at 30°C, the transformation efficiency was 100 transformants; 5 transformants were selected, picked and transferred to sucrose agar in with urea and uridine MTP, the plate was incubated 3 days at 30°C, the copy number was determined by ddPCR giving 80% of transformants with 4 copy integrated.
- ddPCR was used to verify that the protospacer arrays in the four LPPs (PS-59) were no longer present additionally the LPP were PCR amplified for nanopore sequenced to confirm copy number integrated on each of the LPP and verify the integrated expression cassette in those 80% correct transformants, which indicates that the repair and insertion was successful.
- a fermentation on microscale was done by making a spore solution of the successful transformants, those were inoculated on MTP containing 180pl of liquid sucrose media, urea and uridine and incubated for seven days at 30°C, after 24 hours of incubation 10pl 10,75 concentration maltose was added, on day 7 SDS page gels were made and enzyme activity assay was run to verify the enzyme production and validating the activity on the produced enzyme.
- the AT5885 strain described in Example 1 comprising 4 LPPs was used as starting point.
- Two additional chromosomal loci were chosen for inserting LPP#5 and LPP#6 into AT5885.
- LPP#5 was inserted on chromosome 3 between gene AQ090026000695 and AQ090026000696 at the position 786 nucleotides upstream from the translation initiation codon of gene AQ090026000695.
- the final strain was named AT6818, see Table 7 for an overview.
- LBP#1 the five protospacers PS-59, PS-73, PS-74, PS-75 and PS-76 for integration in the pepAb locus;
- - SEQ ID NO: 15 (LPP#2): the five protospacers PS-59, PS-73, PS-74, PS-75 and PS-11 for integration in the pepE locus;
- - SEQ ID NO: 16 (LPP#3): the five protospacers PS-59, PS-73, PS-48, PS-49 and PS-93 for integration in the amdS locus
- - SEQ ID NO: 17 (LPP#4): the five protospacers PS-59, PS-73, PS-48, PS-89 and PS-94 for integration in the pepAa locus.
- - SEQ ID NO: 33 (LPP#5): the five protospacers PS-59, PS-73, PS-74, PS-79 and PS-80 for integration at the A0090026000695 locus.
- LBP#6 the five protospacers PS-59, PS-73, PS-48, PS-89 and PS-90 for integration in the nprA locus.
- Table 7. Overview of strains, plasmids and PCR fragments (DNA fragments) used in Example 11 .
- the primers were designed to include overlapping regions to allow for combination and integration of the 3 PCR-fragments in SYN2 landing pads in the A oryzae genome. The design is based on the method disclosed in W02020025357A1 (Novozymes A/S).
- the 750_4 primer contains a 11 bp overlap with the 750-3 fragment and the 750_5 primer contains a 26bp extension that overlaps with the 750-2 PCR-fragment. Together the overlapping primers generate a 37 bp overlap between the PCR fragments 750-2 and 750-3.
- Example 11 shows that the SPADE system is not limited to be used with (linearized) plasmids, but can also be used with PCR fragments comprising the different features of the SPADE system, to further streamline strain generation.
- the AT5885 strain was deleted in the three genes acvA, ipnA and aatA to remove the strains’ ability to produce Penicillin.
- the resulting Aspergillus oryzae recipient strain was named AT6230.
- a guide plasmid pAT4848, comprising gRNA sequence (SEQ ID NO: 13) enabling cutting of PS- 48 present in two LPP (LPP#3 and LPP#4) of the four LPPs in AT6230 was made.
- stepl the guide plasmid pAT4848 was transformed together with REPAIR-14 into AT6230 obtaining a strain A2F0285 having four copies of the Alpha-Lactalbumin gene with two copies inserted in LPP#3 and two copies inserted in LPP#4.
- step2 the guide plasmid pAT4873 was transformed together with REPAIR-15 into A2F0285. 20 Transformants were tested by ddPCR counting the number of Alpha-Lactalbumin genes and pdiA genes. Using this procedure, we obtained three different outcome strains: a. 8 copies of Alpha-Lactalbumin, 0 copies of pdiA. b. 6 copies of Alpha-Lactalbumin, 1 copy of pdiA. c. 4 copies of Alpha-Lactalbumin, 2 copies of pdiA.
- a host cell comprising in its genome at least two landing pad polynucleotide (LPP) sequences, each LPP sequence comprising in 5’ to 3’ direction: a 5’ homology arm, two or more protospacer sequences, and a 3’ homology arm, wherein the polynucleotide sequence of the 5’ homology arm is identical for at least two LPP sequences, and wherein the polynucleotide sequence of the 3’ homology arm is identical for at least two LPP sequences.
- LPP landing pad polynucleotide
- the host cell according to any one of the previous paragraphs, wherein the cell comprises in its genome at least three LPP, and wherein at least one LPP comprises second 3’ and second 5’ homology arms having less than 100 % sequence identity to the 3’ and 5’ homology arms of the at least two LPP, respectively, e.g., less than 99%, less than 98%, less than 97%, less than 96%, less than 95%, less than 90%, less thank 85%, less than 80%, less than 75%, less than 70%, or less than 65%.
- each LPP comprises two protospacer sequences, such as a first protospacer, and a second protospacer.
- each LPP comprises three different protospacer sequences, such as a first protospacer, a second protospacer, and a third protospacer.
- each LPP comprises four different protospacer sequences, such as a first protospacer, a second protospacer, a third protospacer, and a fourth protospacer.
- each LPP comprises five different protospacer sequences, such as a first protospacer, a second protospacer, a third protospacer, a fourth protospacer, and a fifth protospacer.
- the first protospacer has a polynucleotide sequence with a sequence identity of below 100%, such as below 99%, below 98%, below 97%, below 96%, below 95%, below 90%, below 85%, below 80%, below 75%, below 70%, below 65%, below 60%, below 55%, below 50%, below 45%, below 40%, below 35%, below 30%, below 25%, below 20%, below 15%, or below 10% to the polynucleotide sequence of the second protospacer, to the polynucleotide sequence of the third protospacer, to the polynucleotide sequence of the fourth protospacer, and to the polynucleotide sequence of the fifth protospacer.
- a sequence identity of below 100% such as below 99%, below 98%, below 97%, below 96%, below 95%, below 90%, below 85%, below 80%, below 75%, below 70%, below 65%, below 60%, below 55%, below 50%, below 45%, below 40%, below 35%, below 30%, below 25%,
- the second protospacer has a polynucleotide sequence with a sequence identity of below 100%, such as below 99%, below 98%, below 97%, below 96%, below 95%, below 90%, below 85%, below 80%, below 75%, below 70%, below 65%, below 60%, below 55%, below 50%, below 45%, below 40%, below 35%, below 30%, below 25%, below 20%, below 15%, or below 10% to the polynucleotide sequence of the first protospacer, to the polynucleotide sequence of the third protospacer, to the polynucleotide sequence of the fourth protospacer, and to the polynucleotide sequence of the fifth protospacer.
- the third protospacer has a polynucleotide sequence with a sequence identity below 100%, such as below 99%, below 98%, below 97%, below 96%, below 95%, below 90%, below 85%, below 80%, below 75%, below 70%, below 65%, below 60%, below 55%, below 50%, below 45%, below 40%, below 35%, below 30%, below 25%, below 20%, below 15%, or below 10% to the polynucleotide sequence of the first protospacer, to the polynucleotide sequence of the second protospacer, to the polynucleotide sequence of the fourth protospacer, and to the polynucleotide sequence of the fifth protospacer.
- a sequence identity below 100%, such as below 99%, below 98%, below 97%, below 96%, below 95%, below 90%, below 85%, below 80%, below 75%, below 70%, below 65%, below 60%, below 55%, below 50%, below 45%, below 40%, below 35%, below 30%, below 25%, below
- the fourth protospacer has a polynucleotide sequence with a sequence identity of below 100%, such as below 99%, below 98%, below 97%, below 96%, below 95%, below 90%, below 85%, below 80%, below 75%, below 70%, below 65%, below 60%, below 55%, below 50%, below 45%, below 40%, below 35%, below 30%, below 25%, below 20%, below 15%, or below 10% to the polynucleotide sequence of the first protospacer, to the polynucleotide sequence of the second protospacer, to the polynucleotide sequence of the third protospacer, and to the polynucleotide sequence of the fifth protospacer.
- the fifth protospacer has a polynucleotide sequence with a sequence identity of below 100%, such as below 99%, below 98%, below 97%, below 96%, below 95%, below 90%, below 85%, below 80%, below 75%, below 70%, below 65%, below 60%, below 55%, below 50%, below 45%, below 40%, below 35%, below 30%, below 25%, below 20%, below 15%, or below 10% to the polynucleotide sequence of the first protospacer, to the polynucleotide sequence of the second protospacer, to the polynucleotide sequence of the third protospacer, and to the polynucleotide sequence of the fourth protospacer.
- each LPP comprises more than five different protospacer sequences.
- each protospacer sequence has a length of at least 15 nt, at least 20 nt, or at least 25 nt, preferably each protospacer has a length of 20 - 30 nt.
- the host cell comprises two LPP sequences, three LPP sequences, four LPP sequences, 5 LPP sequences, 6 LPP sequences, 7 LPP sequences, 8 LPP sequences, 9 LPP sequences, 10 LPP sequences, 11 LPP sequences, 12 LPP sequences, 13 LPP sequences, 14 LPP sequences, 15 LPP sequences, 16 LPP sequences, or more than 16 LPP sequences.
- the host cell comprises at least two chromosomes, at least four chromosomes, at least six chromosomes or at least 8 chromosomes.
- each chromosome comprises a total of 1 LPP per chromosome.
- each chromosome comprises a total of 2 LPP per chromosome.
- each chromosome comprises a total of 3 LPP per chromosome.
- each chromosome comprises a total of 4 LPP per chromosome.
- each 3’ homology arm and each 5’ homology arm has a length of at least 10 nt, at least 20 nt, at least 30 nt, at least 50 nt, at least 100 nt, 200 nucleotides (nt), at least 300 nt, at least 400 nt, at least 500 nt, at least 600 nt, at least 700 nt, at least 800 nt, at least 900 nt, or at least 1000 nt.
- each 3’ homology arm and each 5’ homology arm has a length of at least 100 nt.
- each 3’ homology arm and each 5’ homology arm has a length of at least 800 nt, at least 900 nt, or at least 1000 nt.
- one or more LPP of the at least two LPP comprise a selectable marker, and/or a counter-selectable marker.
- the selectable marker comprises a marker selected from the list of a pyrG gene, a fcy1 gene, an adeA gene, an adeB gene, a metF gene, a metG gene, a metH gene, a lysF gene, a trpC gene, a sC gene, an argB gene, and a gene encoding a fluorescent protein, such as a GFP, YFP, CFP, DsRed, or a eqPF611 .
- the counter-selectable marker comprises a marker selected from the list of a pyrG gene, a fcy1 gene, an adeA gene, an adeB gene, a metF gene, a metG gene, a metH gene, a lysF gene, a trpC gene, a sC gene, an argB gene, and a gene encoding a fluorescent protein, such as a GFP, YFP, CFP, DsRed, or a eqPF611 .
- each DNA strain complementary to the two or more protospacers sequences comprises a PAM sequence (protospacer adjacent motif).
- the first, second, third, fourth or fifth protospacer sequence is selected from a sequence having a sequence identity of at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% to SEQ ID NO: 3, SEQ ID NO: 4, SEQ ID NO: 5, SEQ ID NO: 6, SEQ ID NO: 7, SEQ ID NO: 8, SEW ID NO: 9, SEQ ID NO: 10, SEQ ID NO: 1 1 , SEQ ID NO: 12, or SEQ ID NO: 13.
- the host cell according to any one of the previous paragraphs, wherein the 5’ homology arm sequence comprises or consists of SEQ ID NO: 1 or SEQ ID NO: 2.
- the host cell according to any one of the previous pharagraphs, wherein the 5’ homology arm sequences comprises or consists of a polynucleotide sequence having a sequence identity of at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% to SEQ ID NO: 1 or SEQ ID NO: 2.
- each of the at least two LPP comprises a first protospacer sequence with a polynucleotide sequence identical for each first protospacer of each at least two LPP.
- each of the at least two LPP comprises a second protospacer sequence with a polynucleotide sequence which is unique for each second protospacer of each at least two LPP.
- each of the at least two LPP comprises a third protospacer sequence with a polynucleotide sequence which is identical for each third polynucleotide within a third subset of the at least two LPPs.
- the third subset consists of 2 LPP sequences, 3 LPP sequences, 4 LPP sequences, 5 LPP sequences, 6 LPP sequences, 7 LPP sequences, 8 LPP sequences, 9 LPP sequences, 10 LPP, sequences, 11 LPP sequences, 12 LPP sequences, 13 LPP sequences, 14 LPP sequences, or 15 LPP sequences.
- each of the at least two LPP comprises a fourth protospacer sequence with a polynucleotide sequence which is identical for each fourth polynucleotide within a fourth subset of the at least two LPPs.
- the fourth subset consists of 2 LPP sequences, 3 LPP sequences, 4 LPP sequences, 5 LPP sequences, 6 LPP sequences, 7 LPP sequences, 8 LPP sequences, 9 LPP sequences, 10 LPP, sequences, 11 LPP sequences, 12 LPP sequences, 13 LPP sequences, 14 LPP sequences, or 15 LPP sequences.
- each of the at least two LPP comprises a fifth protospacer sequence with a polynucleotide sequence which is identical for each fifth protospacer within a fifth subset of the at least two LPPs.
- the host cell according to any one of the previous paragraphs, wherein the fifth subset consists of 2 LPP sequences, 3 LPP sequences, 4 LPP sequences, 5 LPP sequences, 6 LPP sequences, 7 LPP sequences, 8 LPP sequences, 9 LPP sequences, 10 LPP, sequences, 11 LPP sequences, 12 LPP sequences, 13 LPP sequences, 14 LPP sequences, or 15 LPP sequences.
- the host cell comprises four LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for three of the four LPPs.
- the host cell comprises five LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for two of the five LPPs.
- the host cell comprises five LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for four of the five LPPs.
- the host cell comprises six LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for two of the six LPPs.
- the host cell comprises six LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for three of the six LPPs.
- the host cell comprises six LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for four of the six LPPs.
- the host cell according to any one of the previous paragraphs, wherein the host cell comprises six LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for five of the six LPPs.
- the host cell according to any one of the previous paragraphs, wherein the host cell comprises 7 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for two of the 7 LPPs.
- the host cell comprises 8 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for two of the 8 LPPs.
- the host cell according to any one of the previous paragraphs, wherein the host cell comprises 8 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for five of the 8 LPPs.
- the host cell according to any one of the previous paragraphs, wherein the host cell comprises 9 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for 7 of the 9 LPPs.
- the host cell according to any one of the previous paragraphs, wherein the host cell comprises 10 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for 7 of the 10 LPPs.
- the host cell according to any one of the previous paragraphs, wherein the host cell comprises 10 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for 9 of the 10 LPPs.
- the host cell comprises 11 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for two of the 11 LPPs.
- the host cell according to any one of the previous paragraphs, wherein the host cell comprises 11 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for five of the 11 LPPs.
- the host cell according to any one of the previous paragraphs, wherein the host cell comprises 12 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for two of the 12 LPPs.
- the host cell according to any one of the previous paragraphs, wherein the host cell comprises 12 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for three of the 12 LPPs.
- the host cell according to any one of the previous paragraphs, wherein the host cell comprises 12 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for six of the 12 LPPs.
- the host cell according to any one of the previous paragraphs, wherein the host cell comprises 12 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for 11 of the 12 LPPs.
- the host cell comprises 13 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for three of the 13 LPPs.
- the host cell according to any one of the previous paragraphs, wherein the host cell comprises 13 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for six of the 13 LPPs.
- the host cell according to any one of the previous paragraphs, wherein the host cell comprises 13 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for 11 of the 13 LPPs.
- the host cell according to any one of the previous paragraphs, wherein the host cell comprises 13 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for 12 of the 13 LPPs.
- the host cell according to any one of the previous paragraphs, wherein the host cell comprises 14 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for two of the 14 LPPs.
- the host cell according to any one of the previous paragraphs, wherein the host cell comprises 14 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for 10 of the 14 LPPs.
- the host cell according to any one of the previous paragraphs, wherein the host cell comprises 15 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for 10 of the 15 LPPs.
- the host cell comprises 16 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for two of the 16 LPPs.
- the host cell comprises 16 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for three of the 16 LPPs.
- the host cell comprises 16 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for five of the 16 LPPs.
- the host cell comprises 16 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for 14 of the 16 LPPs.
- the host cell comprises 16 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for 15 of the 16 LPPs.
- the host cell according to any one of the previous paragraphs, wherein the host cell is a fungal cell.
- the host cell is a yeast recombinant host cell, e.g., a Candida, Hansenula, Kluyveromyces, Pichia, Saccharomyces, Schizosaccharomyces, or Yarrowia cell, such as a Kluyveromyces lactis, Saccharomyces carlsbergensis, Saccharomyces cerevisiae, Saccharomyces diastaticus, Saccharomyces douglasii, Saccharomyces kluyveri, Saccharomyces norbensis, Saccharomyces oviformis, or Yarrowia lipolytica cell.
- yeast recombinant host cell e.g., a Candida, Hansenula, Kluyveromyces, Pichia, Saccharomyces, Schizosaccharomyces, or Yarrowia cell, such as a Kluyveromyces lactis, Saccharomyces carlsbergensis, Sacchar
- the host cell is a filamentous fungal recombinant host cell, e.g., an Acremonium, Aspergillus, Aureobasidium, Bjerkandera, Ceriporiopsis, Chrysosporium, Coprinus, Coriolus, Cryptococcus, Filibasidium, Fusarium, Humicola, Magnaporthe, Mucor, Myceliophthora, Neocallimastix, Neurospora, Paecilomyces, Penicillium, Phanerochaete, Phlebia, Piromyces, Pleurotus, Schizophyllum, Talaromyces, Thermoascus, Thielavia, Tolypocladium, Trametes, or Trichoderma cell, in particular, an Aspergillus awamori, Aspergillus foetidus, Aspergillus fumigatus, Aspergillus japonic
- the host cell according to any one of the previous paragraphs, wherein the host cell is an Aspergillus cell, preferably an Aspergillus niger cell, or an Aspergillus oryzae cell.
- the host cell according to any one of the previous paragraphs, wherein the host cell is a Trichoderma cell, preferably a Trichderma reesei cell.
- the host cell is a prokaryotic recombinant host cell, e.g., a Gram-positive cell selected from the group consisting of Bacillus, Clostridium, Enterococcus, Geobacillus, Lactobacillus, Lactococcus, Oceanobacillus, Staphylococcus, Streptococcus, or Streptomyces cells, or a Gram-negative bacteria selected from the group consisting of Campylobacter, E.
- a prokaryotic recombinant host cell e.g., a Gram-positive cell selected from the group consisting of Bacillus, Clostridium, Enterococcus, Geobacillus, Lactobacillus, Lactococcus, Oceanobacillus, Staphylococcus, Streptococcus, or Streptomyces cells, or a Gram-negative bacteria selected from the group consisting of Campylobacter, E.
- Flavobacterium Fusobacterium, Helicobacter, llyobacter, Neisseria, Pseudomonas, Salmonella, and Ureaplasma cells, such as Bacillus alkalophilus, Bacillus amyloliquefaciens, Bacillus brevis, Bacillus circulans, Bacillus clausii, Bacillus coagulans, Bacillus firmus, Bacillus lautus, Bacillus lentus, Bacillus lichen!
- the host cell according to any one of the previous paragraphs, wherein the host cell is a Bacillus lichen! formis cell.
- the host cell according to any one of the previous paragraphs, wherein the host cell is a Bacillus subtilis cell.
- the host cell according to any one of the previous paragraphs, wherein the mammalian cell is selected from the list of a human cell, a mouse cell, a rat cell, a mouse hybridoma cell, a hamster cell (e.g. Chinese hamster ovary cell), and a rat hybridoma cell.
- the mammalian cell is selected from the list of a human cell, a mouse cell, a rat cell, a mouse hybridoma cell, a hamster cell (e.g. Chinese hamster ovary cell), and a rat hybridoma cell.
- a method for generating a multi-copy host cell comprising the steps of a) providing a host cell according to any one of the previous paragraphs, b) delivering to the host cell a gene editing system comprising:
- a first donor polynucleotide comprising in 5’ to 3’ direction: a 5’ homology arm, a first polynucleotide encoding a first polypeptide of interest, and a 3’ homology arm, OR a set of first PCR fragments comprising a first 5’ homology arm, a first 3’ homology arm, and a first polynucleotide encoding a first polypeptide of interest, wherein the 3’ homology arm is identical to the 3’ homology arm of one or more LPP, and wherein the 5’ homology arm is identical to the 5’ homology arm of one or more LPP, and c) cultivating the host cell at a first temperature allowing one or more first guide RNA of the at least one guide RNA to form a complex with the nuclease at one or more protospacer of one or more LPP, and wherein the complex promotes the homology directed repair by creating a double-strand break at or near the one or more protospacer sequence of the one
- step c) the complex is formed at one or more first protospacer of the one or more LPP, creating a double-strand break at or near the one or more first protospacer.
- step c) the complex is formed at one or more second protospacer of the one or more LPP, creating a double-strand break at or near the one or more second protospacer.
- step c) the complex is formed at one or more third protospacer of the one or more LPP, creating a double-strand break at or near the one or more third protospacer.
- step c) the complex is formed at one or more fourth protospacer of the one or more LPP, creating a double-strand break at or near the one or more fourth protospacer.
- step c) the complex is formed at one or more fifth protospacer of the one or more LPP, creating a double-strand break at or near the one or more fifth protospacer.
- step c) the first temperature allows the one or more first guide RNA to form a complex at one or more protospacers of one LPP, and wherein the complex promotes the homology directed repair by creating a double-strand break at or near the one or more protospacer sequence of the one LPP, wherein the homology directed repair comprises insertion of the first donor polynucleotide and/or set of first PCR fragments at the double-strand break to provide a host cell with one copy of the first donor polynucleotide in its genome.
- the homology directed repair comprises insertion of the first donor polynucleotide and/or set of first PCR fragments at the double-strand break to provide a host cell with one copy of the first donor polynucleotide in its genome.
- step c) the first temperature allows the one or more first guide RNA to form a complex at one or more protospacers of two LPP, and wherein the complex promotes the homology directed repair by creating a double-strand break at or near the one or more protospacer sequence of the two LPP, wherein the homology directed repair comprises insertion of the first donor polynucleotide or set of first PCR fragments at the double-strand break to provide a host cell with two copies of the first donor polynucleotide in its genome.
- step c) the first temperature allows the one or more first guide RNA to form a complex at one or more protospacers of three LPP, and wherein the complex promotes the homology directed repair by creating a double-strand break at or near the one or more protospacer sequence of the three LPP, wherein the homology directed repair comprises insertion of the first donor polynucleotide or set of first PCR fragments at the doublestrand break to provide a host cell with three copies of the first donor polynucleotide in its genome.
- step c) the first temperature allows the one or more first guide RNA to form a complex at one or more protospacers of four LPP, and wherein the complex promotes the homology directed repair by creating a double-strand break at or near the one or more protospacer sequence of the four LPP, wherein the homology directed repair comprises insertion of the first donor polynucleotide or set of first PCR fragments at the double-strand break to provide a host cell with four copies of the first donor polynucleotide in its genome.
- step c) the first temperature allows the one or more first guide RNA to form a complex at one or more protospacers of five LPP, and wherein the complex promotes the homology directed repair by creating a double-strand break at or near the one or more protospacer sequence of the five LPP, wherein the homology directed repair comprises insertion of the first donor polynucleotide or set of first PCR fragments at the double-strand break to provide a host cell with five copies of the first donor polynucleotide in its genome.
- step c) the first temperature allows the one or more first guide RNA to form a complex at one or more protospacers of six LPP, and wherein the complex promotes the homology directed repair by creating a double-strand break at or near the one or more protospacer sequence of the six LPP, wherein the homology directed repair comprises insertion of the first donor polynucleotide or set of first PCR fragments at the double-strand break to provide a host cell with six copies of the first donor polynucleotide in its genome.
- step c) the first temperature allows the one or more first guide RNA to form a complex at one or more protospacers of 7 LPP, and wherein the complex promotes the homology directed repair by creating a double-strand break at or near the one or more protospacer sequence of the 7 LPP, wherein the homology directed repair comprises insertion of the first donor polynucleotide or set of first PCR fragments at the double-strand break to provide a host cell with 7 copies of the first donor polynucleotide in its genome.
- step c) the first temperature allows the one or more first guide RNA to form a complex at one or more protospacers of 8 LPP, and wherein the complex promotes the homology directed repair by creating a double-strand break at or near the one or more protospacer sequence of the 8 LPP, wherein the homology directed repair comprises insertion of the first donor polynucleotide or set of first PCR fragments at the double-strand break to provide a host cell with 8 copies of the first donor polynucleotide in its genome.
- step c) the first temperature allows the one or more first guide RNA to form a complex at one or more protospacers of 9 LPP, and wherein the complex promotes the homology directed repair by creating a double-strand break at or near the one or more protospacer sequence of the 9 LPP, wherein the homology directed repair comprises insertion of the first donor polynucleotide or set of first PCR fragments at the double-strand break to provide a host cell with 9 copies of the first donor polynucleotide in its genome.
- step c) the first temperature allows the one or more first guide RNA to form a complex at one or more protospacers of 10 LPP, and wherein the complex promotes the homology directed repair by creating a double-strand break at or near the one or more protospacer sequence of the 10 LPP, wherein the homology directed repair comprises insertion of the first donor polynucleotide or set of first PCR fragments at the double-strand break to provide a host cell with 10 copies of the first donor polynucleotide in its genome.
- step c) the first temperature allows the one or more first guide RNA to form a complex at one or more protospacers of 11 LPP, and wherein the complex promotes the homology directed repair by creating a double-strand break at or near the one or more protospacer sequence of the 11 LPP, wherein the homology directed repair comprises insertion of the first donor polynucleotide or set of first PCR fragments at the double-strand break to provide a host cell with 11 copies of the first donor polynucleotide in its genome.
- step c) the first temperature allows the one or more first guide RNA to form a complex at one or more protospacers of 12 LPP, and wherein the complex promotes the homology directed repair by creating a double-strand break at or near the one or more protospacer sequence of the 12 LPP, wherein the homology directed repair comprises insertion of the first donor polynucleotide or set of first PCR fragments at the double-strand break to provide a host cell with 12 copies of the first donor polynucleotide in its genome.
- step c) the first temperature allows the one or more first guide RNA to form a complex at one or more protospacers of 13 LPP, and wherein the complex promotes the homology directed repair by creating a double-strand break at or near the one or more protospacer sequence of the 13 LPP, wherein the homology directed repair comprises insertion of the first donor polynucleotide or set of first PCR fragments at the double-strand break to provide a host cell with 13 copies of the first donor polynucleotide in its genome.
- step c) the first temperature allows the one or more first guide RNA to form a complex at one or more protospacers of 14 LPP, and wherein the complex promotes the homology directed repair by creating a double-strand break at or near the one or more protospacer sequence of the 14 LPP, wherein the homology directed repair comprises insertion of the first donor polynucleotide or set of first PCR fragments at the double-strand break to provide a host cell with 14 copies of the first donor polynucleotide in its genome.
- step c) the first temperature allows the one or more first guide RNA to form a complex at one or more protospacers of 15 LPP, and wherein the complex promotes the homology directed repair by creating a double-strand break at or near the one or more protospacer sequence of the 15 LPP, wherein the homology directed repair comprises insertion of the first donor polynucleotide or set of first PCR fragments at the double-strand break to provide a host cell with 15 copies of the first donor polynucleotide in its genome.
- step c) the first temperature allows the one or more first guide RNA to form a complex at one or more protospacers of 16 LPP, and wherein the complex promotes the homology directed repair by creating a double-strand break at or near the one or more protospacer sequence of the 16 LPP, wherein the homology directed repair comprises insertion of the first donor polynucleotide or set of first PCR fragments at the double-strand break to provide a host cell with 16 copies of the first donor polynucleotide in its genome.
- step b) there are delivered at least two first guide RNAs, each first guide RNA forming a complex with the nuclease at a first, second, third, fourth, or fifth protospacer.
- the at least two first guide RNAs comprise at least two of: a 1 st first guide RNA, a 2 nd first guide RNA, a 3 rd first guide RNA, a 4 th first guide RNA, and a 5 th first guide RNA.
- the 1 st first guide RNA has a polynucleotide sequence with a sequence identity of below 100%, such as below 99%, below 98%, below 97%, below 96%, below 95%, below 90%, below 85%, below 80%, below 75%, below 70%, below 65%, below 60%, below 55%, below 50%, below 45%, below 40%, below 35%, below 30%, below 25%, below 20%, below 15%, or below 10% to the polynucleotide sequence of the 2 nd first guide RNA, to the polynucleotide sequence of 3 rd first guide RNA, to the polynucleotide sequence of the 4 th first guide RNA, and to the polynucleotide sequence of the 5 th first guide RNA.
- the 2 nd first guide RNA has a polynucleotide sequence with a sequence identity of below 100%, such as below 99%, below 98%, below 97%, below 96%, below 95%, below 90%, below 85%, below 80%, below 75%, below 70%, below 65%, below 60%, below 55%, below 50%, below 45%, below 40%, below 35%, below 30%, below 25%, below 20%, below 15%, or below 10% to the polynucleotide sequence of the 1 st first guide RNA, to the polynucleotide sequence of 3 rd first guide RNA, to the polynucleotide sequence of the 4 th first guide RNA, and to the polynucleotide sequence of the 5 th first guide RNA.
- a sequence identity of below 100% such as below 99%, below 98%, below 97%, below 96%, below 95%, below 90%, below 85%, below 80%, below 75%, below 70%, below 65%, below 60%, below 55%, below 50%, below
- the 3 rd first guide RNA has a polynucleotide sequence with a sequence identity of below 100%, such as below 99%, below 98%, below 97%, below 96%, below 95%, below 90%, below 85%, below 80%, below 75%, below 70%, below 65%, below 60%, below 55%, below 50%, below 45%, below 40%, below 35%, below 30%, below 25%, below 20%, below 15%, or below 10% to the polynucleotide sequence of the 1 st first guide RNA, to the polynucleotide sequence of 2 nd first guide RNA, to the polynucleotide sequence of the 4 th first guide RNA, and to the polynucleotide sequence of the 5 th first guide RNA.
- the 4 th first guide RNA has a polynucleotide sequence with a sequence identity below 100%, such as below 99%, below 98%, below 97%, below 96%, below 95%, below 90%, below 85%, below 80%, below 75%, below 70%, below 65%, below 60%, below 55%, below 50%, below 45%, below 40%, below 35%, below 30%, below 25%, below 20%, below 15%, or below 10% to the polynucleotide sequence of the 1 st first guide RNA, to the polynucleotide sequence of 2 nd first guide RNA, to the polynucleotide sequence of the 3 rd first guide RNA, and to the polynucleotide sequence of the 5 th first guide RNA.
- a sequence identity below 100%, such as below 99%, below 98%, below 97%, below 96%, below 95%, below 90%, below 85%, below 80%, below 75%, below 70%, below 65%, below 60%, below 55%, below 50%, below
- the 5 th first guide RNA has a polynucleotide sequence with a sequence identity of below 100%, such as below 99%, below 98%, below 97%, below 96%, below 95%, below 90%, below 85%, below 80%, below 75%, below 70%, below 65%, below 60%, below 55%, below 50%, below 45%, below 40%, below 35%, below 30%, below 25%, below 20%, below 15%, or below 10% to the polynucleotide sequence of the 1 st first guide RNA, to the polynucleotide sequence of 2 nd first guide RNA, to the polynucleotide sequence of the 3 rd first guide RNA, and to the polynucleotide sequence of the 4 th first guide RNA.
- a sequence identity of below 100% such as below 99%, below 98%, below 97%, below 96%, below 95%, below 90%, below 85%, below 80%, below 75%, below 70%, below 65%, below 60%, below 55%, below 50%, below
- the one or more first guide RNA comprises a 1 st first guide RNA, having a sequence complementary to a first protospacer sequence, a second protospacer sequence, a third protospacer sequence, a fourth protospacer sequence, or a fifth protospacer sequence, preferably the 1 st first guide RNA is complementary to a first protospacer sequence.
- the one or more first guide RNA comprises a 2 nd first guide RNA, having a sequence complementary to a first protospacer sequence, a second protospacer sequence, a third protospacer sequence, a fourth protospacer sequence, or a fifth protospacer sequence, preferably the 2 nd first guide RNA is complementary to a second protospacer sequence.
- the one or more first guide RNA comprises a 3 rd first guide RNA, having a sequence complementary to a first protospacer sequence, a second protospacer sequence, a third protospacer sequence, a fourth protospacer sequence, or a fifth protospacer sequence, preferably the 3 rd first guide RNA is complementary to a third protospacer sequence.
- the one or more first guide RNA comprises a 4 th first guide RNA, having a sequence complementary to a first protospacer sequence, a second protospacer sequence, a third protospacer sequence, a fourth protospacer sequence, or a fifth protospacer sequence, preferably the 4 th first guide RNA is complementary to a fourth protospacer sequence.
- the one or more first guide RNA comprises a 5 th first guide RNA, having a sequence complementary to a first protospacer sequence, a second protospacer sequence, a third protospacer sequence, a fourth protospacer sequence, or a fifth protospacer sequence, preferably the 5 th first guide RNA is complementary to a fifth protospacer sequence. 251 .
- the at least one guide RNA comprises a first guide RNA and a second guide RNA
- the method additionally comprises step d): d) cultivating the host cell at a second temperature allowing one or more second guide RNA of the at least one guide RNA to form a complex with the nuclease at one or more protospacer of the one or more LPP, and wherein the complex promotes the homology directed repair by creating a double-strand break at or near the one or more protospacer sequence of the one or more LPP, wherein the homology directed repair comprises insertion of the first donor polynucleotide or set of first PCR fragments at the double-strand break.
- step b) there is additionally delivered one or more second polynucleotide comprising in 5’ to 3’ direction: a 5’ homology arm, a second polynucleotide encoding a second polypeptide of interest, and a 3’ homology arm, OR a set of one or more second PCR fragments comprising a second 3’ homology arm, a second 5’ homology arm, and a second polynucleotide encoding a second polypeptide of interest, wherein the 3’ homology arm is identical to the 3’ homology arm of one or more LPP and wherein the 5’ homology arm is identical to the 5’ homology arm of one or more LPP, and wherein during step c) the homology directed repair comprises insertion of the second donor polynucleotide or set of second PCR fragments at the double-strand break.
- step b) there is additionally delivered one or more first polynucleotide comprising in 5’ to 3’ direction: a second 5’ homology arm, a first polynucleotide encoding the first polypeptide of interest, and a second 3’ homology arm, OR a set of one or more first PCR fragments comprising a second 3’ homology arm, a second 5’ homology arm, and the first polynucleotide encoding the first polypeptide of interest, wherein the second 3’ homology arms and the second 5’ homology arms have less than 100% sequence identity to the 3’ and 5’ homology arms of the at least two LPP, respectively, e.g., less than 99%, less than 98%, less than 97%, less than 96%, less than 95%, less than 90%, less thank 85%, less than 80%, less than 75%, less than 70%, or less than 65%, and wherein during step c) the
- step d) the complex is formed at one or more first protospacer of the one or more LPP, creating a double-strand break at or near the one or more first protospacer.
- the second guide RNA is complementary to one or more second protospacers, wherein during step d) the complex is formed at one or more second protospacer of the one or more LPP, creating a double-strand break at or near the one or more second protospacer.
- the second guide RNA is complementary to one or more third protospacers, wherein during step d) the complex is formed at one or more third protospacer of the one or more LPP, creating a double-strand break at or near the one or more third protospacer.
- step d) the complex is formed at one or more fourth protospacer of the one or more LPP, creating a double-strand break at or near the one or more fourth protospacer.
- step d) the complex is formed at one or more fifth protospacer of the one or more LPP, creating a double-strand break at or near the one or more fifth protospacer.
- step d) the second temperature allows the one or more second guide RNA to form a complex at one or more protospacers of one LPP, and wherein the complex promotes the homology directed repair by creating a double-strand break at or near the one or more protospacer sequence of the one LPP, wherein the homology directed repair comprises insertion of the first donor polynucleotide and/or set of first PCR fragments at the doublestrand break to provide a host cell with at least one copy of the first donor polynucleotide in its genome.
- step d) the second temperature allows the one or more second guide RNA to form a complex at one or more protospacers of two LPP, and wherein the complex promotes the homology directed repair by creating a double-strand break at or near the one or more protospacer sequence of the two LPP, wherein the homology directed repair comprises insertion of the first donor polynucleotide or set of first PCR fragments at the doublestrand break to provide a host cell with at least two copies of the first donor polynucleotide in its genome.
- step d) the second temperature allows the one or more second guide RNA to form a complex at one or more protospacers of three LPP, and wherein the complex promotes the homology directed repair by creating a double-strand break at or near the one or more protospacer sequence of the three LPP, wherein the homology directed repair comprises insertion of the first donor polynucleotide or set of first PCR fragments at the doublestrand break to provide a host cell with at least three copies of the first donor polynucleotide in its genome.
- step d) the second temperature allows the one or more second guide RNA to form a complex at one or more protospacers of four LPP, and wherein the complex promotes the homology directed repair by creating a double-strand break at or near the one or more protospacer sequence of the four LPP, wherein the homology directed repair comprises insertion of the first donor polynucleotide or set of first PCR fragments at the doublestrand break to provide a host cell with at least four copies of the first donor polynucleotide in its genome.
- step d) the second temperature allows the one or more second guide RNA to form a complex at one or more protospacers of five LPP, and wherein the complex promotes the homology directed repair by creating a double-strand break at or near the one or more protospacer sequence of the five LPP, wherein the homology directed repair comprises insertion of the first donor polynucleotide or set of first PCR fragments at the doublestrand break to provide a host cell with at least five copies of the first donor polynucleotide in its genome.
- step d) the second temperature allows the one or more second guide RNA to form a complex at one or more protospacers of six LPP, and wherein the complex promotes the homology directed repair by creating a double-strand break at or near the one or more protospacer sequence of the six LPP, wherein the homology directed repair comprises insertion of the first donor polynucleotide or set of first PCR fragments at the doublestrand break to provide a host cell with at least six copies of the first donor polynucleotide in its genome.
- step d) the second temperature allows the one or more second guide RNA to form a complex at one or more protospacers of 8 LPP, and wherein the complex promotes the homology directed repair by creating a double-strand break at or near the one or more protospacer sequence of the 8 LPP, wherein the homology directed repair comprises insertion of the first donor polynucleotide or set of first PCR fragments at the double-strand break to provide a host cell with at least 8 copies of the first donor polynucleotide in its genome.
- step d) the second temperature allows the one or more second guide RNA to form a complex at one or more protospacers of 16 LPP, and wherein the complex promotes the homology directed repair by creating a double-strand break at or near the one or more protospacer sequence of the 16 LPP, wherein the homology directed repair comprises insertion of the first donor polynucleotide or set of first PCR fragments at the doublestrand break to provide a host cell with at least 16 copies of the first donor polynucleotide in its genome.
- step b) there are delivered at least two second guide RNAs, each second guide RNA forming a complex with the nuclease at a first, second, third, fourth, or fifth protospacer.
- the at least two second guide RNAs comprise at least two of: a 1 st second guide RNA, a 2 nd second guide RNA, a 3 rd second guide RNA, a 4 th second guide RNA, and a 5 th second guide RNA.
- the 1 st second guide RNA has a polynucleotide sequence with a sequence identity below 100%, such as below 99%, below 98%, below 97%, below 96%, below 95%, below 90%, below 85%, below 80%, below 75%, below 70%, below 65%, below 60%, below 55%, below 50%, below 45%, below 40%, below 35%, below 30%, below 25%, below 20%, below 15%, or below 10% to the polynucleotide sequence of the 2 nd second guide RNA, to the polynucleotide sequence of 3 rd second guide RNA, to the polynucleotide sequence of the 4 th second guide RNA, and to the polynucleotide sequence of the 5 th second guide RNA.
- a sequence identity below 100%, such as below 99%, below 98%, below 97%, below 96%, below 95%, below 90%, below 85%, below 80%, below 75%, below 70%, below 65%, below 60%, below 55%, below 50%, below
- the 2 nd second guide RNA has a polynucleotide sequence with a sequence identity below 100%, such as below 99%, below 98%, below 97%, below 96%, below 95%, below 90%, below 85%, below 80%, below 75%, below 70%, below 65%, below 60%, below 55%, below 50%, below 45%, below 40%, below 35%, below 30%, below 25%, below 20%, below 15%, or below 10% to the polynucleotide sequence of the 1 st second guide RNA, to the polynucleotide sequence of 3 rd second guide RNA, to the polynucleotide sequence of the 4 th second guide RNA, and to the polynucleotide sequence of the 5 th second guide RNA.
- a sequence identity below 100%, such as below 99%, below 98%, below 97%, below 96%, below 95%, below 90%, below 85%, below 80%, below 75%, below 70%, below 65%, below 60%, below 55%, below 50%, below
- the 3 rd second guide RNA has a polynucleotide sequence with a sequence identity of below 100%, such as below 99%, below 98%, below 97%, below 96%, below 95%, below 90%, below 85%, below 80%, below 75%, below 70%, below 65%, below 60%, below 55%, below 50%, below 45%, below 40%, below 35%, below 30%, below 25%, below 20%, below 15%, or below 10% to the polynucleotide sequence of the 1 st second guide RNA, to the polynucleotide sequence of 2 nd second guide RNA, to the polynucleotide sequence of the 4 th second guide RNA, and to the polynucleotide sequence of the 5 th second guide RNA.
- the 4 th second guide RNA has a polynucleotide sequence with a sequence identity below 100%, such as below 99%, below 98%, below 97%, below 96%, below 95%, below 90%, below 85%, below 80%, below 75%, below 70%, below 65%, below 60%, below 55%, below 50%, below 45%, below 40%, below 35%, below 30%, below 25%, below 20%, below 15%, or below 10% to the polynucleotide sequence of the 1 st second guide RNA, to the polynucleotide sequence of 2 nd second guide RNA, to the polynucleotide sequence of the 3 rd second guide RNA, and to the polynucleotide sequence of the 5 th second guide RNA.
- a sequence identity below 100%, such as below 99%, below 98%, below 97%, below 96%, below 95%, below 90%, below 85%, below 80%, below 75%, below 70%, below 65%, below 60%, below 55%, below 50%, below
- the 5 th second guide RNA has a polynucleotide sequence with a sequence identity below 100%, such as below 99%, below 98%, below 97%, below 96%, below 95%, below 90%, below 85%, below 80%, below 75%, below 70%, below 65%, below 60%, below 55%, below 50%, below 45%, below 40%, below 35%, below 30%, below 25%, below 20%, below 15%, or below 10% to the polynucleotide sequence of the 1 st second guide RNA, to the polynucleotide sequence of 2 nd second guide RNA, to the polynucleotide sequence of the 3 rd second guide RNA, and to the polynucleotide sequence of the 4 th second guide RNA.
- a sequence identity below 100% such as below 99%, below 98%, below 97%, below 96%, below 95%, below 90%, below 85%, below 80%, below 75%, below 70%, below 65%, below 60%, below 55%, below 50%, below 45%
- the one or more second guide RNA comprises a 1 st second guide RNA, having a sequence complementary to a first protospacer sequence, a second protospacer sequence, a third protospacer sequence, a fourth protospacer sequence, or a fifth protospacer sequence, preferably the 1 st second guide RNA is complementary to a first protospacer sequence.
- the one or more second guide RNA comprises a 2 nd second guide RNA, having a sequence complementary to a first protospacer sequence, a second protospacer sequence, a third protospacer sequence, a fourth protospacer sequence, or a fifth protospacer sequence, preferably the 2 nd second guide RNA is complementary to a second protospacer sequence.
- the one or more second guide RNA comprises a 3 rd second guide RNA, having a sequence complementary to a first protospacer sequence, a second protospacer sequence, a third protospacer sequence, a fourth protospacer sequence, or a fifth protospacer sequence, preferably the 3 rd second guide RNA is complementary to a third protospacer sequence.
- the one or more second guide RNA comprises a 4 th second guide RNA, having a sequence complementary to a first protospacer sequence, a second protospacer sequence, a third protospacer sequence, a fourth protospacer sequence, or a fifth protospacer sequence, preferably the 4 th second guide RNA is complementary to a fourth protospacer sequence.
- the one or more second guide RNA comprises a 5 th second guide RNA, having a sequence complementary to a first protospacer sequence, a second protospacer sequence, a third protospacer sequence, a fourth protospacer sequence, or a fifth protospacer sequence, preferably the 5 th second guide RNA is complementary to a fifth protospacer sequence.
- the at least one guide RNA comprises a first guide RNA, a second guide RNA, and a third guide RNA
- the method additionally comprises the step e) cultivating the host cell at a third temperature allowing one or more third guide RNA of the at least one guide RNA to form a complex with the nuclease at one or more protospacer of the one or more LPP, and wherein the complex promotes the homology directed repair by creating a double-strand break at or near the one or more protospacer sequence of the one or more LPP, wherein the homology directed repair comprises insertion of the first donor polynucleotide and/or set of first PCR fragments at the double-strand break.
- step e the complex is formed at one or more first protospacer of the one or more LPP, creating a double-strand break at or near the one or more first protospacer.
- step e the complex is formed at one or more second protospacer of the one or more LPP, creating a double-strand break at or near the one or more second protospacer.
- step e the complex is formed at one or more third protospacer of the one or more LPP, creating a double-strand break at or near the one or more third protospacer.
- step e the complex is formed at one or more fourth protospacer of the one or more LPP, creating a double-strand break at or near the one or more fourth protospacer.
- step e the complex is formed at one or more fifth protospacer of the one or more LPP, creating a double-strand break at or near the one or more fifth protospacer.
- step e) allows the one or more third guide RNA to form a complex at one or more protospacers of one LPP, and wherein the complex promotes the homology directed repair by creating a double-strand break at or near the one or more protospacer sequence of the one LPP, wherein the homology directed repair comprises insertion of the first donor polynucleotide or set of first PCR fragments at the double-strand break to provide a host cell with at least one copy of the first donor polynucleotide in its genome.
- step e) allows the one or more third guide RNA to form a complex at one or more protospacers of two LPP, and wherein the complex promotes the homology directed repair by creating a double-strand break at or near the one or more protospacer sequence of the two LPP, wherein the homology directed repair comprises insertion of the first donor polynucleotide or set of first PCR fragments at the double-strand break to provide a host cell with at least two copies of the first donor polynucleotide in its genome.
- step e) the third temperature allows the one or more third guide RNA to form a complex at one or more protospacers of three LPP, and wherein the complex promotes the homology directed repair by creating a double-strand break at or near the one or more protospacer sequence of the three LPP, wherein the homology directed repair comprises insertion of the first donor polynucleotide or set of first PCR fragments at the doublestrand break to provide a host cell with at least three copies of the first donor polynucleotide in its genome.
- step e) the third temperature allows the one or more third guide RNA to form a complex at one or more protospacers of four LPP, and wherein the complex promotes the homology directed repair by creating a double-strand break at or near the one or more protospacer sequence of the four LPP, wherein the homology directed repair comprises insertion of the first donor polynucleotide or set of first PCR fragments at the doublestrand break to provide a host cell with at least four copies of the first donor polynucleotide in its genome.
- step e) the third temperature allows the one or more third guide RNA to form a complex at one or more protospacers of five LPP, and wherein the complex promotes the homology directed repair by creating a double-strand break at or near the one or more protospacer sequence of the five LPP, wherein the homology directed repair comprises insertion of the first donor polynucleotide or set of first PCR fragments at the double-strand break to provide a host cell with at least five copies of the first donor polynucleotide in its genome.
- step e) the third temperature allows the one or more third guide RNA to form a complex at one or more protospacers of six LPP, and wherein the complex promotes the homology directed repair by creating a double-strand break at or near the one or more protospacer sequence of the six LPP, wherein the homology directed repair comprises insertion of the first donor polynucleotide or set of first PCR fragments at the double-strand break to provide a host cell with at least six copies of the first donor polynucleotide in its genome.
- step e) the third temperature allows the one or more third guide RNA to form a complex at one or more protospacers of 8 LPP, and wherein the complex promotes the homology directed repair by creating a double-strand break at or near the one or more protospacer sequence of the 8 LPP, wherein the homology directed repair comprises insertion of the first donor polynucleotide or set of first PCR fragments at the double-strand break to provide a host cell with at least 8 copies of the first donor polynucleotide in its genome.
- step e) the third temperature allows the one or more third guide RNA to form a complex at one or more protospacers of 16 LPP, and wherein the complex promotes the homology directed repair by creating a double-strand break at or near the one or more protospacer sequence of the 16 LPP, wherein the homology directed repair comprises insertion of the first donor polynucleotide or set of first PCR fragments at the double-strand break to provide a host cell with at least 16 copies of the first donor polynucleotide in its genome.
- step b) there are delivered at least two third guide RNAs, each third guide RNA forming a complex with the nuclease at a first, second, third, fourth, or fifth protospacer.
- the at least two third guide RNAs comprise at least two of: a 1 st third guide RNA, a 2 nd third guide RNA, a 3 rd third guide RNA, a 4 th third guide RNA, and a 5 th third guide RNA.
- the 1 st third guide RNA has a polynucleotide sequence with a sequence identity of below 100%, such as below 99%, below 98%, below 97%, below 96%, below 95%, below 90%, below 85%, below 80%, below 75%, below 70%, below 65%, below 60%, below 55%, below 50%, below 45%, below 40%, below 35%, below 30%, below 25%, below 20%, below 15%, or below 10% to the polynucleotide sequence of the 2 nd third guide RNA, to the polynucleotide sequence of 3 rd third guide RNA, to the polynucleotide sequence of the 4 th third guide RNA, and to the polynucleotide sequence of the 5 th third guide RNA.
- a sequence identity of below 100% such as below 99%, below 98%, below 97%, below 96%, below 95%, below 90%, below 85%, below 80%, below 75%, below 70%, below 65%, below 60%, below 55%, below 50%, below
- the 2 nd third guide RNA has a polynucleotide sequence with a sequence identity of below 100%, such as below 99%, below 98%, below 97%, below 96%, below 95%, below 90%, below 85%, below 80%, below 75%, below 70%, below 65%, below 60%, below 55%, below 50%, below 45%, below 40%, below 35%, below 30%, below 25%, below 20%, below 15%, or below 10% to the polynucleotide sequence of the 1 st third guide RNA, to the polynucleotide sequence of 3 rd third guide RNA, to the polynucleotide sequence of the 4 th third guide RNA, and to the polynucleotide sequence of the 5 th third guide RNA.
- a sequence identity of below 100% such as below 99%, below 98%, below 97%, below 96%, below 95%, below 90%, below 85%, below 80%, below 75%, below 70%, below 65%, below 60%, below 55%, below 50%, below
- the 3 rd third guide RNA has a polynucleotide sequence with a sequence identity below 100%, such as below 99%, below 98%, below 97%, below 96%, below 95%, below 90%, below 85%, below 80%, below 75%, below 70%, below 65%, below 60%, below 55%, below 50%, below 45%, below 40%, below 35%, below 30%, below 25%, below 20%, below 15%, or below 10% to the polynucleotide sequence of the 1 st third guide RNA, to the polynucleotide sequence of 2 nd third guide RNA, to the polynucleotide sequence of the 4 th third guide RNA, and to the polynucleotide sequence of the 5 th third guide RNA.
- a sequence identity below 100%, such as below 99%, below 98%, below 97%, below 96%, below 95%, below 90%, below 85%, below 80%, below 75%, below 70%, below 65%, below 60%, below 55%, below 50%, below
- the 4 th third guide RNA has a polynucleotide sequence with a sequence identity of below 100%, such as below 99%, below 98%, below 97%, below 96%, below 95%, below 90%, below 85%, below 80%, below 75%, below 70%, below 65%, below 60%, below 55%, below 50%, below 45%, below 40%, below 35%, below 30%, below 25%, below 20%, below 15%, or below 10% to the polynucleotide sequence of the 1 st third guide RNA, to the polynucleotide sequence of 2 nd third guide RNA, to the polynucleotide sequence of the 3 rd third guide RNA, and to the polynucleotide sequence of the 5 th third guide RNA.
- a sequence identity of below 100% such as below 99%, below 98%, below 97%, below 96%, below 95%, below 90%, below 85%, below 80%, below 75%, below 70%, below 65%, below 60%, below 55%, below 50%, below
- the 5 th third guide RNA has a polynucleotide sequence with a sequence identity below 100%, such as below 99%, below 98%, below 97%, below 96%, below 95%, below 90%, below 85%, below 80%, below 75%, below 70%, below 65%, below 60%, below 55%, below 50%, below 45%, below 40%, below 35%, below 30%, below 25%, below 20%, below 15%, or below 10% to the polynucleotide sequence of the 1 st third guide RNA, to the polynucleotide sequence of 2 nd third guide RNA, to the polynucleotide sequence of the 3 rd third guide RNA, and to the polynucleotide sequence of the 4 th third guide RNA.
- a sequence identity below 100%, such as below 99%, below 98%, below 97%, below 96%, below 95%, below 90%, below 85%, below 80%, below 75%, below 70%, below 65%, below 60%, below 55%, below 50%, below
- the one or more third guide RNA comprises a 1 st third guide RNA, having a sequence complementary to a first protospacer sequence, a second protospacer sequence, a third protospacer sequence, a fourth protospacer sequence, or a fifth protospacer sequence, preferably the 1 st third guide RNA is complementary to a first protospacer sequence.
- the one or more third guide RNA comprises a 2 nd third guide RNA, having a sequence complementary to a first protospacer sequence, a second protospacer sequence, a third protospacer sequence, a fourth protospacer sequence, or a fifth protospacer sequence, preferably the 2 nd third guide RNA is complementary to a second protospacer sequence.
- the one or more third guide RNA comprises a 3 rd third guide RNA, having a sequence complementary to a first protospacer sequence, a second protospacer sequence, a third protospacer sequence, a fourth protospacer sequence, or a fifth protospacer sequence, preferably the 3 rd third guide RNA is complementary to a third protospacer sequence.
- the one or more third guide RNA comprises a 4 th third guide RNA, having a sequence complementary to a first protospacer sequence, a second protospacer sequence, a third protospacer sequence, a fourth protospacer sequence, or a fifth protospacer sequence, preferably the 4 th third guide RNA is complementary to a fourth protospacer sequence.
- the one or more third guide RNA comprises a 5 th third guide RNA, having a sequence complementary to a first protospacer sequence, a second protospacer sequence, a third protospacer sequence, a fourth protospacer sequence, or a fifth protospacer sequence, preferably the 5 th third guide RNA is complementary to a fifth protospacer sequence.
- the at least one guide RNA comprises a first guide RNA, a second guide RNA, a third guide RNA, and a fourth guide RNA
- the method additionally comprises the step f) cultivating the host cell at a fourth temperature allowing one or more fourth guide RNA of the at least one guide RNA to form a complex with the nuclease at one or more protospacer of the one or more LPP, and wherein the complex promotes the homology directed repair by creating a double-strand break at or near the one or more protospacer sequence of the one or more LPP, wherein the homology directed repair comprises insertion of the first donor polynucleotide or set of first PCR fragments at the double-strand break.
- the at least one guide RNA comprises a first guide RNA, a second guide RNA, a third guide RNA, a fourth guide RNA, and a fifth guide RNA
- the method additionally comprises the step g) cultivating the host cell at a fifth temperature allowing one or more fifth guide RNA of the at least one guide RNA to form a complex with the nuclease at one or more protospacer of the one or more LPP, and wherein the complex promotes the homology directed repair by creating a double-strand break at or near the one or more protospacer sequence of the one or more LPP, wherein the homology directed repair comprises insertion of the first donor polynucleotide or set of first PCR fragments at the double-strand break.
- the first temperature is in the range of 15 - 50 C°, such as in the range of 22 - 42 C°, such as in the range of 23 - 39 C°, such as in the range of 25 - 37 C°.
- the second temperature is in the range of 15 - 50 C°, such as in the range of 22 - 42 C°, such as in the range of 23 - 39 C°, such as in the range of 25 - 37 C°.
- the third temperature is in the range of 15 - 50 C°, such as in the range of 22 - 42 C°, such as in the range of 23 - 39 C°, such as in the range of 25 - 37 C°.
- the fourth temperature is in the range of 15 - 50 C°, such as in the range of 22 - 42 C°, such as in the range of 23 - 39 C°, such as in the range of 25 - 37 C°.
- the fifth temperature is in the range of 15 - 50 C°, such as in the range of 22 - 42 C°, such as in the range of 23 - 39 C°, such as in the range of 25 - 37 C°.
- first temperature and the second temperature are in the range of 15 - 50 C°, such as in the range of 22 - 42 C°, such as in the range of 23 - 39 C°, such as in the range of 25 - 37 C°. 311 .
- first temperature, the second temperature, and the third temperature are in the range of 15 - 50 C°, such as in the range of 22 - 42 C°, such as in the range of 23 - 39 C°, such as in the range of 25 - 37 C°.
- first temperature, the second temperature, the third temperature, and the fourth temperature are in the range of 15 - 50 C°, such as in the range of 22 - 42 C°, such as in the range of 23 - 39 C°, such as in the range of 25 - 37 C°.
- first temperature, the second temperature, the third temperature, the fourth temperature, and the fifth temperature are in the range of 15 - 50 C°, such as in the range of 22 - 42 C°, such as in the range of 23 - 39 C°, such as in the range of 25 - 37 C°.
- the first temperature differs from the second temperature by at least 0.5 C°, at least 1 C°, at least 1 .5 C°, at least 2 C°, at least 2.5 C°, at least 3 C°, at least 3.5 C°, at least 4 C°, at least 4.5 C°, at least 5 C°, at least 5.5 C°, or at least 6 C°.
- the third temperature differs from the fourth temperature by at least 0.5 C°, at least 1 C°, at least 1 .5 C°, at least 2 C°, at least 2.5 C°, at least 3 C°, at least 3.5 C°, at least 4 C°, at least 4.5 C°, at least 5 C°, at least 5.5 C°, or at least 6 C°
- the fourth temperature differs from the fifth temperature by at least 0.5 C°, at least 1 C°, at least 1 .5 C°, at least 2 C°, at least 2.5 C°, at least 3 C°, at least 3.5 C°, at least 4 C°, at least 4.5 C°, at least 5 C°, at least 5.5 C°, or at least 6 C°
- the first temperature is selected from the list of 22 C°, 23 C°, 24 C°, 25 C°, 26 C°, 27 C°, 28 C°, 29 C°, 30 C°, 31 C°, 32 C°, 33 C°, 34 C°, 35 C°, 36 C°, 37 C°, 38 C°, 39 C°, 40 C°, 41 C°, 42 C°, 43 C°, 44 C°, 45 C°, 46 C°, 47 C°, 48 C°, 49 C°, and 50 C°.
- the third temperature is selected from the list of 22 C°, 23 C°, 24 C°, 25 C°, 26 C°, 27 C°, 28 C°, 29 C°, 30 C°, 31 C°, 32 C°, 33 C°, 34 C°, 35 C°, 36 C°, 37 C°, 38 C°, 39 C°, 40 C°, 41 C°, 42 C°, 43 C°, 44 C°, 45 C°, 46 C°, 47 C°, 48 C°, 49 C°, and 50 C°.
- the fourth temperature is selected from the list of 22 C°, 23 C°, 24 C°, 25 C°, 26 C°, 27 C°, 28 C°, 29 C°, 30 C°, 31 C°, 32 C°, 33 C°, 34 C°, 35 C°, 36 C°, 37 C°, 38 C°, 39 C°, 40 C°, 41 C°, 42 C°, 43 C°, 44 C°, 45 C°, 46 C°, 47 C°, 48 C°, 49 C°, and 50 C°.
- first temperature and the second temperature are selected from the list of 22 C°, 23 C°, 24 C°, 25 C°, 26 C°, 27 C°, 28 C°, 29 C°, 30 C°, 31 C°, 32 C°, 33 C°, 34 C°, 35 C°, 36 C°, 37 C°, 38 C°, 39 C°, 40 C°, 41 C°, 42 C°, 43 C°, 44 C°, 45 C°, 46 C°, 47 C°, 48 C°, 49 C°, and 50 C°..
- first temperature, the second temperature, and the third temperature are selected from the list of 22 C°, 23 C°, 24 C°, 25 C°, 26 C°, 27 C°, 28 C°, 29 C°, 30 C°, 31 C°, 32 C°, 33 C°, 34 C°, 35 C°, 36 C°, 37 C°, 38 C°, 39 C°, 40 C°, 41 C°, 42 C°, 43 C°, 44 C°, 45 C°, 46 C°, 47 C°, 48 C°, 49 C°, and 50 C°.
- first temperature, the second temperature, the third temperature, and the fourth temperature are selected from the list of 22 C°, 23 C°, 24 C°, 25 C°, 26 C°, 27 C°, 28 C°, 29 C°, 30 C°, 31 C°, 32 C°, 33 C°, 34 C°, 35 C°, 36 C°, 37 C°, 38 C°, 39 C°, 40 C°, 41 C°, 42 C°, 43 C°, 44 C°, 45 C°, 46 C°, 47 C°, 48 C°, 49 C°, and 50 C°.
- first temperature, the second temperature, the third temperature, the fourth temperature, and the fifth temperature are selected from the list of 22 C°, 23 C°, 24 C°, 25 C°, 26 C°, 27 C°, 28 C°, 29 C°, 30 C°, 31 C°, 32 C°, 33 C°, 34 C°, 35 C°, 36 C°, 37 C°, 38 C°, 39 C°, 40 C°, 41 C°, 42 C°, 43 C°, 44 C°, 45 C°, 46 C°, 47 C°, 48 C°, 49 C°, and 50 C°.
- step b) there is furthermore delivered a second donor polynucleotide comprising in 5’ to 3’ direction: a 5’ homology arm, a second polynucleotide encoding a second polypeptide of interest, and a 3’ homology arm, wherein the 3’ homology arm is identical to the 3’ homology arm of the host cell, and wherein the 5’ homology arm is identical to the 5’ homology arm of the host cell.
- the first polypeptide of interest comprises a brazzein, a casein, a patatin, an ovalbumin, an osteopontin, an ovotransferrin, an ovomucin, an ovomucoid, an ovostatin, a glycomacropeptide, a lactoferrin, an alpha-lactalbumin, e.g., bovine alphalactalbumin, a beta-lactalbumin and/or a collagen.
- the first polypeptide of interest comprises a brazzein, a casein, a patatin, an ovalbumin, an osteopontin, an ovotransferrin, an ovomucin, an ovomucoid, an ovostatin, a glycomacropeptide, a lactoferrin, an alpha-lactalbumin, e.g., bovine alphalactalbumin, a beta-lactalbumin and/or a collagen.
- the first polypeptide of interest comprises a therapeutic polypeptide selected from the group consisting of an antibody, an antibody fragment, an antibody-based drug, a Fc fusion protein, an anticoagulant, a blood factor, a bone morphogenetic protein, an engineered protein scaffold, an enzyme, a growth factor, a blood clotting factor, a hormone, an interferon (such as an interferon alpha-2b), an interleukin, a lactoferrin, an alpha-lactalbumin, a beta-lactalbumin, an ovomucoid, an ovostatin, a cytokine, an obestatin, a human galactosidase (such as an human alpha-galactosidase A), a vaccine, a protein vaccine, and a thrombolytic.
- a therapeutic polypeptide selected from the group consisting of an antibody, an antibody fragment, an antibody-based drug, a Fc fusion protein, an anticoagulant, a blood
- the enzyme is selected from the list of a hydrolase, isomerase, ligase, lyase, oxidoreductase, or transferase, e.g., an aminopeptidase, amylase, carbohydrase, carboxypeptidase, catalase, cellobiohydrolase, cellulase, chitinase, cutinase, cyclodextrin glycosyltransferase, deoxyribonuclease, endoglucanase, esterase, alpha-galactosidase, beta-galactosidase, glucoamylase, alpha-glucosidase, beta-glucosidase, invertase, laccase, lipase, mannosidase, mutanase, oxidase, pectinolytic enzyme, peroxidase, a hydrolase, isomerase, ligase, lya
- the second polypeptide of interest comprises a polypeptide selected from the list of a transcription factor (e.g. amyR), a chaperone (e.g. BipA, PdiA, or PdiB), a heme pathway enzyme (e.g. encoded by hemA, or hemB), or a glycosylation pathway enzyme (e.g. a Mannosidase I, a GIcNAc transferase, a galactosyltransferase, a sialyltransferase, an oligosaccharyltransferase, or an glucosidase).
- a transcription factor e.g. amyR
- a chaperone e.g. BipA, PdiA, or PdiB
- heme pathway enzyme e.g. encoded by hemA, or hemB
- a glycosylation pathway enzyme e.g. a Mannosidase I, a
- RNA-guided DNA nuclease delivering to the host cell a RNA-guided DNA nuclease, and at least one guide RNA directed to one or more protospacer of one or more empty LPP, and
- the host cell at a temperature allowing the at least one guide RNA to form a complex with the nuclease at the one or more empty LPP, wherein the complex promotes the homology directed repair by creating a double-strand break at or near the one or more protospacer sequence of the one or more empty LPP, wherein the homology directed repair comprises insertion of the first polynucleotide at the doublestrand break of the empty LPP.
- a method of producing a polypeptide of interest comprising cultivating a multi-copy host cell, said cell being generated by the method of any preceding paragraphs, under conditions conducive for production of the polypeptide. 338. The method of paragraph 337, further comprising recovering the polypeptide of interest.
Landscapes
- Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Genetics & Genomics (AREA)
- Engineering & Computer Science (AREA)
- Chemical & Material Sciences (AREA)
- Zoology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Organic Chemistry (AREA)
- Wood Science & Technology (AREA)
- Molecular Biology (AREA)
- Biomedical Technology (AREA)
- General Engineering & Computer Science (AREA)
- Biotechnology (AREA)
- Microbiology (AREA)
- Biochemistry (AREA)
- General Health & Medical Sciences (AREA)
- Plant Pathology (AREA)
- Physics & Mathematics (AREA)
- Mycology (AREA)
- Biophysics (AREA)
- Medicinal Chemistry (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
Abstract
The present invention relates to host cells comprising in its genome at least two landing pad polynucleotides (LPP), methods of generating multi-copy host cells, and methods of producing a polypeptide of interest with said multi-copy host cells.
Description
GENERATION OF MULTI-COPY HOST CELLS
Reference to a Sequence Listing
This application contains a Sequence Listing in computer readable form, which is incorporated herein by reference.
Background of the Invention
Field of the Invention
The present invention relates to host cells comprising in its genome at least two landing pad polynucleotides (LPP), methods of generating multi-copy host cells, and methods of producing a polypeptide of interest with said host cells.
Description of the Related Art
Recombinant host cells are widely used for recombinant protein production. For industrial and commercial purposes, the productivity of the applied cell systems, i.e., the production of total protein per fermentation unit, is an important factor of production costs. Traditionally, yield increases have been achieved through mutagenesis and screening for increased production of proteins of interest. However, this approach is mainly only useful for the overproduction of endogenous proteins in isolates containing the enzymes of interest. Therefore, for each new protein or enzyme product, a lengthy strain and process development program is required to achieve improved productivities.
To obtain high-level expression of a particular gene, a well-established procedure is targeting multiple copies of the recombinant gene constructs to the locus of a highly expressed endogenous gene. However, generation of such multi-copy host cells is difficult to control (i.e., number and location of GOI copies, and genome stability) and often a lengthy process requiring numerous iterations of transfection/transformation and subsequent sequencing. Also, the efficiency of targeted integration is often low, making multi-copy strain generation laborious and expensive.
Multi-copy host cells can for example be generated using the flippase-based FLP-FRT system disclosed in WO16145084 (Novozymes A/S). However, known disadvantages of the FLP-FRT system are for example (i) the occurrence of unwanted recombination between FRT sites, (ii) the requirement of high DNA loads during transformation, and (iii) the requirement for marker genes in all of the landing pads.
Thus, it is of continuous interest to improve the generation of multi-copy host cells. The object of the present invention is to provide a modified host cell and a method for generating multi-copy host cells.
Summary of the Invention
The present invention is based on the surprising and inventive finding that host cells comprising multiple landing pads, each landing pad comprising homology arms and multiple protospacer sequences, can be efficiently used for targeted integration of multiple GOI copies. Said host cells can be used to generate multi-copy host cells via the method of the invention: synthetic protospacer array defined entry (SPADE). With the presence of multiple landing pads and multiple protospacer sequences, the SPADE
system provides a versatile platform to generate host cells with desired copy numbers for gene(s) of interest to be integrated into the host cell genome.
Surprisingly, although not using any marker genes in the SPADE cells, high multi-copy integration efficiencies of 75-80% were observed (Examples 2-3). Most surprisingly, high multi-copy integration efficiency was achieved in only one round of transformation, i.e., in a one step fashion (Example 2, Examples 7-8, and Examples 10-11), resulting in shortened timelines for multi-copy host cell generation with up to 6 copies of the GOI. Importantly, the inventors have shown that homology arms as short as 100 bp are sufficient for the SPADE system (Example 9).
Furthermore, the inventors observed that with synthetic homology arms in the landing pads, potential silencing issues were eliminated, resulting in increased homologous recombination events.
A further surprising finding was that, by placing the landing pads in different orientation in the genome, i.e., some landing pads are present on the coding strand while other landing pads are present on the template strand (Figure 3), unwanted recombination events between integration sites, which as described above is a major disadvantage in using the prior art FLP-FRT systems, can be significantly reduced. This design furthermore prevents unwanted landing pad I GOI deletions and/or chromosome arm swaps.
A further advantage of the SPADE system over prior art systems, e.g., the FLP-FRT system, is that the load of DNA can be reduced during transformation - which is an important advantage when toxic DNA/proteins are transformed. As shown in Example 4, using the SPADE technology GOI sequences previously integrated into landing pads can be “re-used” and integrated into “empty” landing pads to further increase the copy number of the host cell. The amount of DNA per transfection can be reduced to reduce DNA and/or protein toxicity for the host cell (and thereby maintaining desired host cell viabilities), whilst not compromising integration efficiency or GOI copy number.
Additionally, the SPADE system does not require a recombinase sequence present close (e.g., placed in cis) to the landing pads as is the case for the FLP/FRT system.
Also, the SPADE system allows to maintain some landing pads as “empty” landing pads, while other landing pads harbor the gene of interest. This further contributes to a versatile host cell generation platform and e.g., allows later integration of further (same or different) genes of interest into the genome if desired.
Advantageously, within a one or two integration steps the different landing pads can be used for integration of different GOIs, to obtain strains having varying ratios of two or more GOIs (see Example 8 and Example 12, respectively).
Lastly, the SPADE system allows to introduce linear PCR products comprising the different features of the system (e.g., GOI, 3’ homology arm, and 5’ homology arm), see e.g., Example 11 . By doing so, no full plasmid constructs need to be cloned which results in a highly time-efficient strain construction.
In a first aspect, the present invention provides host cells comprising in their genome at least two landing pad polynucleotide (LPP) sequences, each LPP sequence comprising in 5’ to 3’ direction: a 5’ homology arm, two or more protospacer sequences, and a 3’ homology arm, wherein the polynucleotide sequence of the 5’ homology arm is identical for at least two LPP sequences, and wherein the polynucleotide sequence of the 3’ homology arm is identical for at least two LPP sequences.
The host cells of the first aspect have the advantage of being versatile starting points for generating multi-copy host cells, i.e. the cells can be used to integrated genes of interest into one, two or more, or into
all LPPs of the host cell. The location and number of the LPPs in the host cell genome can be designed depending on the host cell type and the gene of interest which shall be integrated therein.
In a second aspect, the present invention provides a method for generating a multi-copy host cell, comprising the steps of a) providing a host cell according to the first aspect, b) delivering to the host cell a gene editing system comprising:
- a RNA-guided DNA nuclease,
- at least one guide RNA directed to one or more protospacer of the one or more LPP sequences,
- a first donor polynucleotide comprising in 5’ to 3’ direction: a 5’ homology arm, a first polynucleotide encoding a first polypeptide of interest, and a 3’ homology arm, OR a set of first PCR fragments comprising a first 5’ homology arm, a first 3’ homology arm, and a first polynucleotide encoding a first polypetide of interest, wherein the 3’ homology arm and/or first 3’ homology arm is identical to the 3’ homology arm of one or more LPP, and wherein the 5’ homology arm and/or first 5’ homology arm is identical to the 5’ homology arm of one or more LPP, and c) cultivating the host cell at a first temperature allowing one or more first guide RNA of the at least one guide RNA to form a complex with the nuclease at one or more protospacer of one or more LPP, and wherein the complex promotes the homology directed repair by creating a double-strand break at or near the one or more protospacer sequence of the one or more LPP, wherein the homology directed repair comprises insertion of the first donor polynucleotide or set of first PCR fragments at the double-strand break.
Using the method of the second aspect, efficient and targeted integration of multiple gene copies into the host cell is enabled, either in a single step or in a multi-step approach.
In a third aspect, the present invention provides a method of producing a polypeptide of interest, comprising cultivating a multi-copy host cell under conditions conducive for production of the polypeptide, which multi-copy host cell is generated by the method according to the second aspect.
Brief Description of the Drawings
Figure 1 shows a host cell of the invention comprising 4 landing pad polynucleotides (LPPs).
Figure 2a shows the RNA-guided cutting of three LPPs in a host cell of the invention.
Figure 2b shows homology-directed repair of the generated double strand breaks in a host cell of the invention.
Figure 2c shows a host cell harboring three copies of a GOI in three LPPs.
Figure 3 shows a detailed overview of the 4 LPPs including their PAM sequences and protospacers (homology arms are not fully represented), and the LPP either being present on the coding strand or on the template strand.
Figure 4 shows an example of a set of PCR fragments used for transformation into a host cell with a SPADE LPP.
Definitions
In accordance with this detailed description, the following definitions apply. Note that the singular forms "a," "an," and "the" include plural references unless the context clearly dictates otherwise.
Unless defined otherwise or clearly indicated by context, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this invention belongs.
Landing pad polynucleotide (LPP): The term ”LPP” or landing pad polynucleotide, or landing pad, means a DNA sequence which comprises a 5’ homology arm, two or more protospacer sequences, and a 3’ homology arm. The LPP is inserted into the host cell genome on the desired chromosome and/or genome location. The LPP is then targeted with a nuclease and a gRNA at a protospacer of the LPP which results in a double-strand break at or near the targeted protospacer. Homology-directed repair takes place with a donor polynucleotide and/or PCR fragments. The donor polynucleotide or PCR fragments carry a gene of interest, and comprise homology arms being identical to the 5’ and 3’ homology arms of the targeted LPP. The repair results in integration of the donor polynucleotide and/or PCR fragments in the targeted LPP. The homology-directed repair is not limited to the presence of the donor polynucleotide and/or PCR fragments, but may instead/additionally take place with an already integrated gene of interest in a LPP having identical homology arms to the homology arms of the targeted LPP.
Homology arm: The term homology arm means a DNA sequence which is present at the 5’ arm (= 5’ homology arm) and the 3’ arm (= 3’ homology arm) of the LPP and the donor polynucleotide. Typically, the 3’ arm sequence is not identical to the 5’ arm sequence. The homology arms may have a length of at least 10 nt, at least 20 nt, at least 30 nt, at least 50 nt, at least 100 nt, 200 nucleotides (nt), at least 300 nt, at least 400 nt, at least 500 nt, at least 600 nt, at least 700 nt, at least 800 nt, at least 900 nt, or at least 1000 nt.
For the LPP, the 5’ arm is located upstream of the two or more protospacers, and the 3’ arm is located downstream of the two or more protospacers.
For the donor polynucleotide, the 5’ arm is located upstream of the first polynucleotide encoding the first polypeptide of interest, and the 3’ arm is located downstream of the first polynucleotide encoding the first polypeptide of interest.
Protospacer: The term protospacer or protospacer sequence means a DNA sequence located within the 5’ arm and the 3’ arm of an LPP. The protospacer is integrated into the host cell genome and is complementary to the spacer sequence of a guide RNA, which guide RNA can be used to create a doublestrand cut at or near the protospacer sequence. The protospacer may have a length of bewteen 14 - 36 nucleotides. DNA-guided nucleases may also require a PAM sequence present near the protospacer sequence. The PAM sequence (protospacer adjacent motif) is located directly downstream of the target sequence in the genomic DNA, on the non-target strand, i.e. on the strand opposite to the strand which comprises the protospacer sequence.
First protospacer: The term first protospacer means a protospacer which is present in all LPP of the host cell.
Second protospacer: The term second protospacer means a protospacer which is unique for only one LPP of the at least two LPPs in the host cell genome.
Third protospacer: The term third protospacer means a protospacer which is present in a third subset of the at least two LPPs.
Third subset of the at least two LPPs: The third subset of LPPs, means a subset of LPPs comprising a third protospacer which is present in two or more LPPs, and/or is present in all LPPs of the host cell.
Fourth protospacer: The term fourth protospacer means a protospacer which is present in a fourth subset of the at least two LPPs.
Fourth subset of the at least two LPPs: The fourth subset of LPPs, means a subset of LPPs comprising a fourth protospacer which is present in two or more LPPs, and/or is present in all LPPs of the host cell. Typically, the fourth subset consists of a number of LPPs which number of LPPs is different from the number of LPPs present in the third subset.
Fifth protospacer: The term fifth protospacer means a protospacer which is present in a fifth subset of the at least two LPPs.
Fifth subset of the at least two LPPs: The fifth subset of LPPs, means a subset of LPPs comprising a fifth protospacer which is present in two or more LPPs, and/or is present in all LPPs of the host cell. Typically, the fifth subset consists of a number of LPPs which number of LPPs is different from the number of LPPs present in the third subset, and also different from the number of LPPs present in the fourth subset.
RNA-guided DNA nuclease: RNA-guided DNA nucleases provide sequence-specific DNA cutting through base-pairing interactions bewteen a guide RNA and target DNA (protospacer sequence) of the two or more LPPs. These nucleases include, but are not limited to Cas9, MAD7, and Cpf1. RNA-gudied nucleases use base pairing to recognize and cleave target DNA with complementarity to the guide RNA. guide RNA and nuclease may be delivered to the host cell by transformation I transfection, either on the same plasmid or linearized DNA vector, or on different separate plasmids or DNA vectors. Additionally or alternatively, the nuclease may be delivered to the cell as protein, e.g. by protein transfection. guide RNA: The guide RNA functions as a guide for the RNA-guided DNA nuclease. The guide RNA forms a complex with the nuclease and the target strand, to which the guide RNA is complementary. The guide RNA may comprise a spacer, which is complementary to the protospacer sequence of the DNA target, a short CRISPR RNA (crRNA), and a tracrRNA.
First guide RNA: The first guide RNA is a guide RNA that forms an editing complex most efficiently at the first temperature, and forms an editing complex less efficiently at other temperatures. Also, the first guide RNA comprises a spacer sequence which may be complementary to any of the first, second, third, fourth or fifth protospacers. The first guide RNA may be a 1st first guide RNA, a 2nd first guide RNA, 3rd first guide RNA, 4rd first guide RNA, and /or a 5th first guide RNA.
First donor polynuclotide: The first donor polynucleotide comprises a first polynucleotide which encodes a first polypeptide of interest. The first donor polynucleotide furthermore comprises a 5’ homology arm upstream of the first polynucleotide, and a 3’ homology arm downstream of the first polynucleotide. The homology arms of the first donor polynucleotide are typically similar and/or identical to the homology arms of the LPPs into which the first polynucleotide is integrated into.
Second donor polynucleotide: The second donor polynucleotide comprises a second polynucleotide which encodes a second polypeptide of interest. The second donor polynucleotide furthermore comprises a 5’ homology arm upstream of the second polynucleotide, and a 3’ homology arm downstream of the second polynucleotide. The homology arms of the second donor polynucleotide are
typically similar and/or identical to the homology arms of the LPPs into which the second polynucleotide is integrated into.
First temperature: The first temperature allows efficient nuclease-based cutting of one or more protospacers which are complemtary to one or more first guide RNA. The first temperature typically also allows cultivation of the host cell, but may differ from the host cell's optimal cultivation temperature.
Second guide RNA: The second guide RNA is a guide RNA that forms an editing complex most efficiently at the second temperature, and forms an editing complex less efficiently at other temperatures. Also, the second guide RNA comprises a spacer sequence which may be complementary to any of the first, second, third, fourth or fifth protospacers. The second guide RNA may be a 1st second guide RNA, a 2nd second guide RNA, 3rd second guide RNA, 4rd second guide RNA, and /or a 5th second guide RNA.
Second temperature: The second temperature allows efficient nuclease-based cutting of one or more protospacers which are complemtary to one or more second guide RNA. The second temperature typically also allows cultivation of the host cell, but may differ from the host cell's optimal cultivation temperature. Typically, the second temperature differs from the first temperature by at least 0.5 degrees Celsius.
Third guide RNA: The third guide RNA is a guide RNA that forms an editing complex most efficiently at the third temperature, and forms an editing complex less efficiently at other temperatures. Also, the third guide RNA comprises a spacer sequence which may be complementary to any of the first, second, third, fourth or fifth protospacers. The third guide RNA may be a 1st third guide RNA, a 2nd third guide RNA, 3rd third guide RNA, 4rd third guide RNA, and /or a 5th third guide RNA.
Third temperature: The third temperature allows efficient nuclease-based cutting of one or more protospacers which are complemtary to one or more third guide RNA. The third temperature typically also allows cultivation of the host cell, but may differ from the host cell's optimal cultivation temperature. Typically, the third temperature differs from the first temperature and the second temperature by at least 0.5 degrees Celsius.
Fourth guide RNA: The fourth guide RNA is a guide RNA that forms an editing complex most efficiently at the fourth temperature, and forms an editing complex less efficiently at other temperatures. Also, the fourth guide RNA comprises a spacer sequence which may be complementary to any of the first, second, third, fourth or fifth protospacers. The fourth guide RNA may be a 1st fourth guide RNA, a 2nd fourth guide RNA, 3rd fourth guide RNA, 4rd fourth guide RNA, and /or a 5th fourth guide RNA.
Fourth temperature: The fourth temperature allows efficient nuclease-based cutting of one or more protospacers which are complemtary to one or more fourth guide RNA. The fourth temperature typically also allows cultivation of the host cell, but may differ from the host cell's optimal cultivation temperature. Typically, the fourth temperature differs from the first temperature, the second temperature, and the third temperature by at least 0.5 degrees Celsius.
Fifth guide RNA: The fifth guide RNA is a guide RNA that forms an editing complex most efficiently at the fifth temperature, and forms an editing complex less efficiently at other temperatures. Also, the fifth guide RNA comprises a spacer sequence which may be complementary to any of the first, second, third, fourth or fifth protospacers. The t fifth hird guide RNA may be a 1 st fifth guide RNA, a 2nd fifth guide RNA, 3rd fifth guide RNA, 4rd fifth guide RNA, and /or a 5th fifth guide RNA.
Fifth temperature: The fifth temperature allows efficient nuclease-based cutting of one or more protospacers which are complemtary to one or more fifth guide RNA. The fifth temperature typically also
allows cultivation of the host cell, but may differ from the host cell's optimal cultivation temperature. Typically, the fifth temperature differs from the first temperature, the second temperature, the third temperature, and the fourth temperature by at least 0.5 degrees Celsius.
Homology directed repair: Homology-directed repair is a process where DNA double-strand break is repaired by homologous recombination using a DNA template. The DNA template may be a donor polynucleotide, or a DNA sequence previously integrated into an LPP. The repair mechanism uses the homology arms to integrate the DNA template into the broken target site.
Deficient NHEJ mechanism: Besides homology directed repair, double-strand breaks may also be repaired by a non-homologous end-joining (NHEJ) mechanism. For cells where the NHEJ mechanism has been inactivated, e.g. by mutation or gene deletions, the rate of homology directed repair can be increased.
Empty LPP: The term empty LPP or empty landing pad means an LPP sequence which does not comprise an integrated polynucleotide of interest. Preferably, the empty LPP has not been cut or targeted with a guide RNA previously. Such empty LPP may be subject for homology directed gene integration either with the donor polynucleotide (= during step c) ), or with a polynucleotide of interest previously integrated into another LPP (= during step h) ). cDNA: The term "cDNA" means a DNA molecule that can be prepared by reverse transcription from a mature, spliced, mRNA molecule obtained from a eukaryotic or prokaryotic cell. cDNA lacks intron sequences that may be present in the corresponding genomic DNA. The initial, primary RNA transcript is a precursor to mRNA that is processed through a series of steps, including splicing, before appearing as mature spliced mRNA.
Coding sequence: The term “coding sequence” means a polynucleotide, which directly specifies the amino acid sequence of a polypeptide. The boundaries of the coding sequence are generally determined by an open reading frame, which begins with a start codon, such as ATG, GTG, or TTG, and ends with a stop codon, such as TAA, TAG, or TGA. The coding sequence may be a genomic DNA, cDNA, synthetic DNA, or a combination thereof.
Control sequences: The term “control sequences” means nucleic acid sequences involved in regulation of expression of a polynucleotide in a specific organism or in vitro. Each control sequence may be native (/.e., from the same gene) or heterologous (/.e., from a different gene) to the polynucleotide encoding the polypeptide, and native or heterologous to each other. Such control sequences include, but are not limited to leader, polyadenylation, prepropeptide, propeptide, signal peptide, promoter, terminator, enhancer, and transcription or translation initiator and terminator sequences. At a minimum, the control sequences include a promoter, and transcriptional and translational stop signals. The control sequences may be provided with linkers for the purpose of introducing specific restriction sites facilitating ligation of the control sequences with the coding region of the polynucleotide encoding a polypeptide.
Expression: The term “expression” means any step involved in the production of a polypeptide including, but not limited to, transcription, post-transcriptional modification, translation, post-translational modification, and secretion.
Expression vector: An "expression vector" refers to a linear or circular DNA construct comprising a DNA sequence encoding a polypeptide, which coding sequence is operably linked to a suitable control sequence capable of effecting expression of the DNA in a suitable host. Such control sequences may include a promoter to effect transcription, an optional operator sequence to control transcription, a
sequence encoding suitable ribosome binding sites on the mRNA, enhancers and sequences which control termination of transcription and translation.
Fusion polypeptide: The term “fusion polypeptide” is a polypeptide in which one polypeptide is fused at the N-terminus and/or the C-terminus of a polypeptide of the present invention. A fusion polypeptide is produced by fusing a polynucleotide encoding another polypeptide to a polynucleotide of the present invention, or by fusing two or more polynucleotides of the present invention together. Techniques for producing fusion polypeptides are known in the art, and include ligating the coding sequences encoding the polypeptides so that they are in frame and that expression of the fusion polypeptide is under control of the same promoter(s) and terminator. Fusion polypeptides may also be constructed using intein technology in which fusion polypeptides are created post-translationally (Cooper et al., 1993, EMBO J. 12: 2575-2583; Dawson et al., 1994, Science 266: 776-779). A fusion polypeptide can further comprise a cleavage site between the two polypeptides. Upon secretion of the fusion protein, the site is cleaved releasing the two polypeptides. Examples of cleavage sites include, but are not limited to, the sites disclosed in Martin et al., 2003, J. Ind. Microbiol. Biotechnol. 3: 568-576; Svetina et al., 2000, J. Biotechnol. 76: 245-251 ; Rasmussen-Wilson et al., 1997, Appl. Environ. Microbiol. 63: 3488-3493; Ward et al., 1995, Biotechnology 13: 498-503; and Contreras et al., 1991 , Biotechnology 9: 378-381 ; Eaton et al., 1986, Biochemistry 25: 505-512; Collins-Racie et al., 1995, Biotechnology 13: 982-987; Carter et al., 1989, Proteins: Structure, Function, and Genetics 6: 240-248; and Stevens, 2003, Drug Discovery World 4: 35-48.
Heterologous: The term "heterologous" means, with respect to a host cell, that a polypeptide or nucleic acid does not naturally occur in the host cell. The term "heterologous" means, with respect to a polypeptide or nucleic acid, that a control sequence, e.g., promoter, of a polypeptide or nucleic acid is not naturally associated with the polypeptide or nucleic acid, i.e., the control sequence is from a gene other than the gene encoding the mature polypeptide.
Host Strain or Host Cell: A "host strain" or "host cell" is an organism into which an expression vector, phage, virus, or other DNA construct, including a polynucleotide encoding a polypeptide of interest (e.g., an amylase) has been introduced. Exemplary host strains are microorganism cells (e.g., bacteria, filamentous fungi, and yeast) capable of expressing the polypeptide of interest and/or fermenting saccharides. The term "host cell" includes protoplasts created from cells.
Introduced: The term "introduced" in the context of inserting a nucleic acid sequence into a cell, means "transfection", "transformation" or "transduction," as known in the art.
Isolated: The term “isolated” means a polypeptide, nucleic acid, cell, or other specified material or component that has been separated from at least one other material or component, including but not limited to, other proteins, nucleic acids, cells, etc. An isolated polypeptide, nucleic acid, cell or other material is thus in a form that does not occur in nature. An isolated polypeptide includes, but is not limited to, a culture broth containing the secreted polypeptide expressed in a host cell.
Native: The term "native" means a nucleic acid or polypeptide naturally occurring in a host cell.
Nucleic acid: The term "nucleic acid" encompasses DNA, RNA, heteroduplexes, and synthetic molecules capable of encoding a polypeptide. Nucleic acids may be single stranded or double stranded, and may be chemical modifications. The terms "nucleic acid" and "polynucleotide" are used interchangeably. Because the genetic code is degenerate, more than one codon may be used to encode a particular amino acid, and the present compositions and methods encompass nucleotide sequences that
encode a particular amino acid sequence. Unless otherwise indicated, nucleic acid sequences are presented in 5'-to-3' orientation.
Nucleic acid construct: The term "nucleic acid construct" means a nucleic acid molecule, either single- or double-stranded, which is isolated from a naturally occurring gene or is modified to contain segments of nucleic acids in a manner that would not otherwise exist in nature or which is synthetic, and which comprises one or more control sequences operably linked to the nucleic acid sequence.
Operably linked: The term "operably linked" means that specified components are in a relationship (including but not limited to juxtaposition) permitting them to function in an intended manner. For example, a regulatory sequence is operably linked to a coding sequence such that expression of the coding sequence is under control of the regulatory sequence.
PCR fragments: The term “PCR fragments” means double stranded DNA generated by PCR, comprising overlaps at the 3’ and/or 5’ ends suitable for homologous recombination with one or more other PCR fragments. The homologous recombination takes usually place in the host cell after transformation of the PCR fragments, wherein the PCR fragments are integrated into one or more LPP. Examples for PCR fragments are disclosed in Example 11 and Figure 4. In a preferred embodiment, a set of PCR fragments comprises a 5’ homology arm PCR fragment, a 3’ homology arm PCR fragment, and a PCR fragment comprising the polynucleotide of interest, preferably the polynucleotide of interest is linked to a promoter and a terminator.
Purified: The term “purified” means a nucleic acid, polypeptide or cell that is substantially free from other components as determined by analytical techniques well known in the art (e.g., a purified polypeptide or nucleic acid may form a discrete band in an electrophoretic gel, chromatographic eluate, and/or a media subjected to density gradient centrifugation). A purified nucleic acid or polypeptide is at least about 50% pure, usually at least about 60%, about 65%, about 70%, about 75%, about 80%, about 85%, about 90%, about 91 %, about 92%, about 93%, about 94%, about 95%, about 96%, about 97%, about 98%, about 99%, about 99.5%, about 99.6%, about 99.7%, about 99.8% or more pure (e.g., percent by weight or on a molar basis). In a related sense, a composition is enriched for a molecule when there is a substantial increase in the concentration of the molecule after application of a purification or enrichment technique. The term "enriched" refers to a compound, polypeptide, cell, nucleic acid, amino acid, or other specified material or component that is present in a composition at a relative or absolute concentration that is higher than a starting composition.
In one aspect, the term "purified" as used herein refers to the polypeptide or cell being essentially free from components (especially insoluble components) from the production organism. In other aspects, the term "purified" refers to the polypeptide being essentially free of insoluble components (especially insoluble components) from the native organism from which it is obtained. In one aspect, the polypeptide is separated from some of the soluble components of the organism and culture medium from which it is recovered. The polypeptide may be purified (/.e., separated) by one or more of the unit operations filtration, precipitation, or chromatography.
Accordingly, the polypeptide may be purified such that only minor amounts of other proteins, in particular, other polypeptides, are present. The term "purified" as used herein may refer to removal of other components, particularly other proteins and most particularly other enzymes present in the cell of origin of the polypeptide. The polypeptide may be "substantially pure", i.e., free from other components from the organism in which it is produced, e.g., a host organism for recombinantly produced polypeptide. In one
aspect, the polypeptide is at least 40% pure by weight of the total polypeptide material present in the preparation. In one aspect, the polypeptide is at least 50%, 60%, 70%, 80% or 90% pure by weight of the total polypeptide material present in the preparation. As used herein, a "substantially pure polypeptide" may denote a polypeptide preparation that contains at most 10%, preferably at most 8%, more preferably at most 6%, more preferably at most 5%, more preferably at most 4%, more preferably at most 3%, even more preferably at most 2%, most preferably at most 1 %, and even most preferably at most 0.5% by weight of other polypeptide material with which the polypeptide is natively or recombinantly associated.
It is, therefore, preferred that the substantially pure polypeptide is at least 92% pure, preferably at least 94% pure, more preferably at least 95% pure, more preferably at least 96% pure, more preferably at least 97% pure, more preferably at least 98% pure, even more preferably at least 99% pure, most preferably at least 99.5% pure by weight of the total polypeptide material present in the preparation. The polypeptide of the present invention is preferably in a substantially pure form (/.e., the preparation is essentially free of other polypeptide material with which it is natively or recombinantly associated). This can be accomplished, for example by preparing the polypeptide by well-known recombinant methods or by classical purification methods.
Recombinant: The term "recombinant" is used in its conventional meaning to refer to the manipulation, e.g., cutting and rejoining, of nucleic acid sequences to form constellations different from those found in nature. The term recombinant refers to a cell, nucleic acid, polypeptide or vector that has been modified from its native state. Thus, for example, recombinant cells express genes that are not found within the native (non-recombinant) form of the cell, or express native genes at different levels or under different conditions than found in nature. The term “recombinant” is synonymous with “genetically modified” and “transgenic”.
Recover: The terms "recover" or “recovery” means the removal of a polypeptide from at least one fermentation broth component selected from the list of a cell, a nucleic acid, or other specified material, e.g., recovery ofthe polypeptide from the whole fermentation broth, or from the cell-free fermentation broth, by polypeptide crystal harvest, by filtration, e.g. depth filtration (by use of filter aids or packed filter medias, cloth filtration in chamber filters, rotary-drum filtration, drum filtration, rotary vacuum-drum filters, candle filters, horizontal leaf filters or similar, using sheed or pad filtration in framed or modular setups) or membrane filtration (using sheet filtration, module filtration, candle filtration, microfiltration, ultrafiltration in either cross flow, dynamic cross flow or dead end operation), or by centrifugation (using decanter centrifuges, disc stack centrifuges, hyrdo cyclones or similar), or by precipitating the polypeptide and using relevant solid-liquid separation methods to harvest the polypeptide from the broth media by use of classification separation by particle sizes. Recovery encompasses isolation and/or purification of the polypeptide.
Sequence identity: The relatedness between two amino acid sequences or between two nucleotide sequences is described by the parameter “sequence identity”.
For purposes of the present invention, the sequence identity between two amino acid sequences is determined as the output of “longest identity” using the Needleman-Wunsch algorithm (Needleman and Wunsch, 1970, J. Mol. Biol. 48: 443-453) as implemented in the Needle program of the EMBOSS package (EMBOSS: The European Molecular Biology Open Software Suite, Rice et al., 2000, Trends Genet. 16: 276-277), preferably version 6.6.0 or later. The parameters used are a gap open penalty of 10, a gap extension penalty of 0.5, and the EBLOSUM62 (EMBOSS version of BLOSUM62) substitution matrix. In
order for the Needle program to report the longest identity, the -nobrief option must be specified in the command line. The output of Needle labeled “longest identity” is calculated as follows:
(Identical Residues x 100)/(Length of Alignment - Total Number of Gaps in Alignment)
For purposes of the present invention, the sequence identity between two polynucleotide sequences is determined as the output of “longest identity” using the Needleman-Wunsch algorithm (Needleman and Wunsch, 1970, supra) as implemented in the Needle program of the EMBOSS package (EMBOSS: The European Molecular Biology Open Software Suite, Rice et al., 2000, supra), preferably version 6.6.0 or later. The parameters used are a gap open penalty of 10, a gap extension penalty of 0.5, and the EDNAFULL (EMBOSS version of NCBI NUC4.4) substitution matrix. In order for the Needle program to report the longest identity, the nobrief option must be specified in the command line. The output of Needle labeled “longest identity” is calculated as follows:
(Identical Deoxyribonucleotides x 100)/(Length of Alignment - Total Number of Gaps in Alignment)
Signal Peptide: A "signal peptide" is a sequence of amino acids attached to the N-terminal portion of a protein, which facilitates the secretion of the protein outside the cell. The mature form of an extracellular protein lacks the signal peptide, which is cleaved off during the secretion process.
Wild-type: The term "wild-type" in reference to an amino acid sequence or nucleic acid sequence means that the amino acid sequence or nucleic acid sequence is a native or naturally-occurring sequence. As used herein, the term "naturally-occurring" refers to anything (e.g., proteins, amino acids, or nucleic acid sequences) that is found in nature. Conversely, the term "non-naturally occurring" refers to anything that is not found in nature (e.g., recombinant nucleic acids and protein sequences produced in the laboratory or modification of the wild-type sequence).
Detailed Description of the Invention
Host Cells
In a first aspect, the present invention relates to a host cell comprising in its genome at least two landing pad polynucleotide (LPP) sequences, each LPP sequence comprising in 5’ to 3’ direction: a 5’ homology arm, two or more protospacer sequences, and a 3’ homology arm, wherein the polynucleotide sequence of the 5’ homology arm is identical for at least two LPP sequences, and wherein the polynucleotide sequence of the 3’ homology arm is identical for at least two LPP sequences.
In one embodiment, at least one protospacer of the at least two protospacers is present in at least two LPP.
In one embodiment, the cell comprises in its genome at least three LPP, wherein at least one LPP comprises second 3’ and second 5’ homology arms having less than 100 % sequence identity to the 3’ and 5’ homology arms of the at least two LPP, respectively, e.g., less than 99%, less than 98%, less than 97%,
less than 96%, less than 95%, less than 90%, less thank 85%, less than 80%, less than 75%, less than 70%, or less than 65%.
In one embodiment, the polynucleotide sequence of the 5’ homology arm is identical for all of the at least two LPP sequences.
In one embodiment, the polynucleotide sequence of the 3’ homology arm is identical for all of the at least two LPP sequences.
In one embodiment, wherein the 3’ homology arms and the 5’ homology arms are heterologous to the host cell.
In one embodiment, the 3’ homology arms and the 5’ homology arms are synthetic polynucleotide sequences.
In one embodiment, at least one LPP is located on the coding strand (3’ to 5’ direction), and at least one LPP is located on the template strand (5’ to 3’ direction).
In one embodiment, at least two LPPs are located on the coding strand (3’ to 5’ direction), and at least two LPPs are located on the template strand (5’ to 3’ direction).
In one embodiment, each LPP comprises two protospacer sequences, such as a first protospacer, and a second protospacer.
In one embodiment, each LPP comprises three different protospacer sequences, such as a first protospacer, a second protospacer, and a third protospacer.
In one embodiment, each LPP comprises four different protospacer sequences, such as a first protospacer, a second protospacer, a third protospacer, and a fourth protospacer.
In one embodiment, each LPP comprises five different protospacer sequences, such as a first protospacer, a second protospacer, a third protospacer, a fourth protospacer, and a fifth protospacer.
In one embodiment, the first protospacer has a polynucleotide sequence with a sequence identity of below 100%, such as below 99%, below 98%, below 97%, below 96%, below 95%, below 90%, below 85%, below 80%, below 75%, below 70%, below 65%, below 60%, below 55%, below 50%, below 45%, below 40%, below 35%, below 30%, below 25%, below 20%, below 15%, or below 10% to the polynucleotide sequence of the second protospacer, to the polynucleotide sequence of the third protospacer, to the polynucleotide sequence of the fourth protospacer, and to the polynucleotide sequence of the fifth protospacer.
In one embodiment, the second protospacer has a polynucleotide sequence with a sequence identity of below 100%, such as below 99%, below 98%, below 97%, below 96%, below 95%, below 90%, below 85%, below 80%, below 75%, below 70%, below 65%, below 60%, below 55%, below 50%, below 45%, below 40%, below 35%, below 30%, below 25%, below 20%, below 15%, or below 10% to the polynucleotide sequence of the first protospacer, to the polynucleotide sequence of the third protospacer, to the polynucleotide sequence of the fourth protospacer, and to the polynucleotide sequence of the fifth protospacer.
In one embodiment, the third protospacer has a polynucleotide sequence with a sequence identity below 100%, such as below 99%, below 98%, below 97%, below 96%, below 95%, below 90%, below 85%, below 80%, below 75%, below 70%, below 65%, below 60%, below 55%, below 50%, below 45%, below 40%, below 35%, below 30%, below 25%, below 20%, below 15%, or below 10% to the polynucleotide sequence of the first protospacer, to the polynucleotide sequence of the second protospacer, to the polynucleotide sequence of the fourth protospacer, and to the polynucleotide sequence of the fifth protospacer.
In one embodiment, the fourth protospacer has a polynucleotide sequence with a sequence identity of below 100%, such as below 99%, below 98%, below 97%, below 96%, below 95%, below 90%, below 85%, below 80%, below 75%, below 70%, below 65%, below 60%, below 55%, below 50%, below 45%, below 40%, below 35%, below 30%, below 25%, below 20%, below 15%, or below 10% to the polynucleotide sequence of the first protospacer, to the polynucleotide sequence of the second protospacer, to the polynucleotide sequence of the third protospacer, and to the polynucleotide sequence of the fifth protospacer.
In one embodiment, wherein the fifth protospacer has a polynucleotide sequence with a sequence identity of below 100%, such as below 99%, below 98%, below 97%, below 96%, below 95%, below 90%, below 85%, below 80%, below 75%, below 70%, below 65%, below 60%, below 55%, below 50%, below 45%, below 40%, below 35%, below 30%, below 25%, below 20%, below 15%, or below 10% to the polynucleotide sequence of the first protospacer, to the polynucleotide sequence of the second protospacer, to the polynucleotide sequence of the third protospacer, and to the polynucleotide sequence of the fourth protospacer.
In one embodiment, each LPP comprises more than five different protospacer sequences.
In one embodiment, one or more of the at least two protospacer sequences is heterologous to the host cell.
In one embodiment, all of the at least two protospacer sequences are heterologous to the host cell.
In one embodiment, one or more of the at least two protospacer sequences is synthetic.
In one embodiment, all of the at least two protospacer sequences is synthetic.
In one embodiment, each protospacer sequence has a length of at least 15 nt, at least 20 nt, or at least 25 nt, preferably each protospacer has a length of 20 - 30 nt.
In one embodiment, the host cell comprises two LPP sequences, three LPP sequences, four LPP sequences, 5 LPP sequences, 6 LPP sequences, 7 LPP sequences, 8 LPP sequences, 9 LPP sequences, 10 LPP sequences, 11 LPP sequences, 12 LPP sequences, 13 LPP sequences, 14 LPP sequences, 15 LPP sequences, 16 LPP sequences, or more than 16 LPP sequences.
In one embodiment, the host cell comprises 2 - 8 LPP sequences.
In one embodiment, the host cell comprises at least two chromosomes, at least four chromosomes, at least six chromosomes or at least 8 chromosomes.
In one embodiment, one or more of the host cell chromosomes comprises a total of 1 LPP per chromosome.
In one embodiment, two or more chromosomes comprise a total of 1 LPP per chromosome.
In one embodiment, three chromosomes comprise a total of 1 LPP per chromosome.
In one embodiment, four chromosomes comprise a total of 1 LPP per chromosome.
In one embodiment, five chromosomes comprise a total of 1 LPP per chromosome.
In one embodiment, six chromosomes comprise a total of 1 LPP per chromosome.
In one embodiment, 7 chromosomes comprise a total of 1 LPP per chromosome.
In one embodiment, 8 chromosomes comprise a total of 1 LPP per chromosome.
In one embodiment, each chromosome comprises a total of 1 LPP per chromosome.
In one embodiment, LPP is located on the short arm of a chromosome.
In one embodiment, the LPP is located on the long arm of a chromosome.
In one embodiment, one or more of the chromosomes comprises a total of 2 LPP per chromosome.
In one embodiment, two or more chromosomes comprise a total of 2 LPP per chromosome.
In one embodiment, three chromosomes comprise a total of 2 LPP per chromosome.
In one embodiment, four chromosomes comprise a total of 2 LPP per chromosome.
In one embodiment, five chromosomes comprise a total of 2 LPP per chromosome.
In one embodiment, six chromosomes comprise a total of 2 LPP per chromosome.
In one embodiment, wherein 7 chromosomes comprise a total of 2 LPP per chromosome.
In one embodiment, 8 chromosomes comprise a total of 2 LPP per chromosome.
In one embodiment, each chromosome comprises a total of 2 LPP per chromosome.
In one embodiment, for each of the one or more chromosomes, the 2 LPP are located on opposite locations of the centromere of the chromosome.
In one embodiment, one LPP is located on the short arm, and one LPP is located on the long arm.
In one embodiment, one or more of the chromosomes comprises a total of 3 LPP per chromosome.
In one embodiment, wherein two or more chromosomes comprise a total of 3 LPP per chromosome.
In one embodiment, three chromosomes comprise a total of 3 LPP per chromosome.
In one embodiment, four chromosomes comprise a total of 3 LPP per chromosome.
In one embodiment, five chromosomes comprise a total of 3 LPP per chromosome.
In one embodiment, six chromosomes comprise a total of 3 LPP per chromosome.
In one embodiment, 7 chromosomes comprise a total of 3 LPP per chromosome.
In one embodiment, 8 chromosomes comprise a total of 3 LPP per chromosome.
In one embodiment, each chromosome comprises a total of 3 LPP per chromosome.
In one embodiment, comprising 3 LPP per chromosome, the ratio of numbers of LPP between short arm and long arm is selected from the list of 3:0, 0:3, 2:1 , and 1 :2.
In one embodiment, one or more of the chromosomes comprises a total of 4 LPP per chromosome.
In one embodiment, two or more chromosomes comprise a total of 4 LPP per chromosome.
In one embodiment, three chromosomes comprise a total of 4 LPP per chromosome.
In one embodiment, four chromosomes comprise a total of 4 LPP per chromosome.
In one embodiment, five chromosomes comprise a total of 4 LPP per chromosome.
In one embodiment, six chromosomes comprise a total of 4 LPP per chromosome.
In one embodiment, 7 chromosomes comprise a total of 4 LPP per chromosome.
In one embodiment, 8 chromosomes comprise a total of 4 LPP per chromosome.
In one embodiment, each chromosome comprises a total of 4 LPP per chromosome.
In one embodiment, for 4 LPP per chromosome, the ratio of numbers of LPP between short arm and long arm is selected from the list of 4:0, 0:4, 3:1 , 1 :3, 2:2, 2:1 , and 1 :2.
In one embodiment, the host cell is deficient of a non-homologous end joining (NHEJ) mechanism. The deficiency in the NHEJ mechanism increases the efficiency of homology directed repair and thus increases the efficiency of DNA integration using the method of the second aspect. Furthermore, NHEJ-deficiency in host cells increases the stability of the genome, also when there are empty LPPs present in the host cell genome.
In one embodiment, each 3’ homology arm and each 5’ homology arm has a length of at least 10 nt, at least 20 nt, at least 30 nt, at least 50 nt, at least 100 nt, 200 nucleotides (nt), at least 300 nt, at least 400 nt, at least 500 nt, at least 600 nt, at least 700 nt, at least 800 nt, at least 900 nt, or at least 1000 nt.
In one embodiment, each 3’ homology arm and each 5’ homology arm has a length of at least 800 nt, at least 900 nt, or at least 1000 nt.
In one embodiment, each 3’ homology arm and each 5’ homology arm has a length of at least 100 nt.
In one embodiment, one or more LPP of the at least two LPP comprise a selectable marker, and/or a counter-selectable marker.
In one embodiment, 2 or more LPP of the at least two LPP comprise a selectable marker, and/or a counter- selectable marker.
In one embodiment, 3 or more LPP of the at least two LPP comprise a selectable marker, and/or a counter- selectable marker.
In one embodiment, 4 or more LPP of the at least two LPP comprise a selectable marker, and/or a counter- selectable marker.
In one embodiment, 5 or more LPP of the at least two LPP comprise a selectable marker, and/or a counter- selectable marker.
In one embodiment, 6 or more LPP of the at least two LPP comprise a selectable marker, and/or a counter- selectable marker.
In one embodiment, 7 or more LPP of the at least two LPP comprise a selectable marker, and/or a counter- selectable marker.
In one embodiment, 8 or more LPP of the at least two LPP comprise a selectable marker, and/or a counter- selectable marker.
In one embodiment, the selectable marker comprises a marker selected from the list of a pyrG gene, a fcy1 gene, an adeA gene, an adeB gene, a metF gene, a metG gene, a metH gene, a lysF gene, a trpC gene, a sC gene, an argB gene, and a gene encoding a fluorescent protein, such as a GFP, YFP, CFP, DsRed, or a eqPF611 .
In one embodiment, the counter-selectable marker comprises a marker selected from the list of a pyrG gene, a fcy1 gene, an adeA gene, an adeB gene, a metF gene, a metG gene, a metH gene, a lysF gene, a trpC gene, a sC gene, an argB gene, and a gene encoding a fluorescent protein, such as a GFP, YFP, CFP, DsRed, or a eqPF611 .
In one embodiment, none of the at least two LPP comprises a selectable marker, and/or a counter- selectable marker.
In one embodiment, each DNA strain complementary to the two or more protospacers sequences comprises a PAM sequence (protospacer adjacent motif).
In one embodiment, the PAM sequence is selected from the list of “NGG”, “NGRRT”, “NGRRN”, “NNNNGATT”, “NNNNRYAC”, “NNAGAAW”, “TTTV”, “TTTC”, “TTTN”, “TTN”, “ATTN”, “GTTN”, and “CTTN” (“N” being any nucleotide, “Y” = C or T, “V”= A or C or G, “W”= A or T, “R”= A or G,), preferably the PAM sequence is TTTN, more preferably the PAM sequence is TTTC.
In one embodiment, the first, second, third, fourth or fifth protospacer sequence is selected from a sequence having a sequence identity of at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% to SEQ ID NO: 3, SEQ ID NO: 4, SEQ ID NO: 5, SEQ ID NO: 6, SEQ ID NO: 7, SEQ ID NO: 8, SEW ID NO: 9, SEQ ID NO: 10, SEQ ID NO: 11 , SEQ ID NO: 12, or SEQ ID NO: 13.
In one embodiment, the 5’ homology arm sequences comprises or consists of a polynucleotide sequence having a sequence identity of at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% to SEQ ID NO: 1 or SEQ ID NO: 2.
In one embodiment, the 5’ homology arm sequence comprises or consists of SEQ ID NO: 1 or SEQ ID NO: 2.
In one embodiment, the 3’ homology arm sequence comprises or consists of SEQ ID NO: 1 or SEQ ID NO: 2.
In one embodiment, the 3’ homology arm sequences comprises or consists of a polynucleotide sequence having a sequence identity of at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% to SEQ ID NO: 1 or SEQ ID NO: 2.
In one embodiment, wherein each of the at least two LPP comprises a first protospacer sequence with a polynucleotide sequence identical for each first protospacer of each at least two LPP. When the host cell is targeted with a nuclease and gRNA complementary to the first protospacer, each LPP is subject for a double-strand break, and thus the gene of interest can be integrated into each of the LPPs of the host cell.
In one embodiment, each of the at least two LPP comprises a second protospacer sequence with a polynucleotide sequence which is unique for each second protospacer of each at least two LPP. When the host cell is targeted with a nuclease and gRNA complementary to the second protospacer, only one LPP is subject for a double-strand break, and thus the gene of interest can be integrated into only one LPP of the LPPs of the host cell.
In one embodiment, each of the at least two LPP comprises a third protospacer sequence with a polynucleotide sequence which is identical for each third polynucleotide within a third subset of the at least two LPPs. When the host cell is targeted with a nuclease and gRNA complementary to the third protospacer, the third subset of LPPs is subject for a double-strand break, and thus the gene of interest can be integrated into each of the LPPs of the third subset.
In one embodiment, the third subset consists of 2 LPP sequences, 3 LPP sequences, 4 LPP sequences, 5 LPP sequences, 6 LPP sequences, 7 LPP sequences, 8 LPP sequences, 9 LPP sequences, 10 LPP, sequences, 11 LPP sequences, 12 LPP sequences, 13 LPP sequences, 14 LPP sequences, or 15 LPP sequences.
In one embodiment, each of the at least two LPP comprises a fourth protospacer sequence with a polynucleotide sequence which is identical for each fourth polynucleotide within a fourth subset of the at least two LPPs. When the host cell is targeted with a nuclease and gRNA complementary to the fourth protospacer, the fourth subset of LPPs is subject for a double-strand break, and thus the gene of interest can be integrated into each of the LPPs of the fourth subset.
In one embodiment, the fourth subset consists of 2 LPP sequences, 3 LPP sequences, 4 LPP sequences, 5 LPP sequences, 6 LPP sequences, 7 LPP sequences, 8 LPP sequences, 9 LPP sequences, 10 LPP, sequences, 11 LPP sequences, 12 LPP sequences, 13 LPP sequences, 14 LPP sequences, or 15 LPP sequences.
In one embodiment, each of the at least two LPP comprises a fifth protospacer sequence with a polynucleotide sequence which is identical for each fifth protospacer within a fifth subset of the at least two LPPs. When the host cell is targeted with a nuclease and gRNA complementary to the fifth protospacer, the fifth subset of LPPs is subject for a double-strand break, and thus the gene of interest can be integrated into each of the LPPs of the fifth subset.
In one embodiment, the fifth subset consists of 2 LPP sequences, 3 LPP sequences, 4 LPP sequences, 5 LPP sequences, 6 LPP sequences, 7 LPP sequences, 8 LPP sequences, 9 LPP sequences, 10 LPP, sequences, 11 LPP sequences, 12 LPP sequences, 13 LPP sequences, 14 LPP sequences, or 15 LPP sequences.
In one embodiment, the host cell comprises three LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for two of the three LPPs.
In one embodiment, the host cell comprises four LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for two of the four LPPs.
In one embodiment, the host cell comprises four LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for three of the four LPPs.
In one embodiment, the host cell comprises five LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for two of the five LPPs.
In one embodiment, the host cell comprises five LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for three of the five LPPs.
In one embodiment, the host cell comprises five LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for four of the five LPPs.
In one embodiment, the host cell comprises six LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for two of the six LPPs.
In one embodiment, the host cell comprises six LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for three of the six LPPs.
In one embodiment, the host cell comprises six LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for four of the six LPPs.
In one embodiment, the host cell comprises six LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for five of the six LPPs.
In one embodiment, the host cell comprises 7 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for two of the 7 LPPs.
In one embodiment, the host cell comprises 7 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for three of the 7 LPPs.
In one embodiment, the host cell comprises 7 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for four of the 7 LPPs.
In one embodiment, the host cell comprises 7 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for five of the 7 LPPs.
In one embodiment, the host cell comprises 7 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for six of the 7 LPPs.
In one embodiment, the host cell comprises 8 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for two of the 8 LPPs.
In one embodiment, the host cell comprises 8 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for three of the 8 LPPs.
In one embodiment, the host cell comprises 8 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for four of the 8 LPPs.
In one embodiment, the host cell comprises 8 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for five of the 8 LPPs.
In one embodiment, the host cell comprises 8 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for six of the 8 LPPs.
In one embodiment, the host cell comprises 8 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for 7 of the 8 LPPs.
In one embodiment, the host cell comprises 9 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for two of the 9 LPPs.
In one embodiment, the host cell comprises 9 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for three of the 9 LPPs.
In one embodiment, the host cell comprises 9 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for four of the 9 LPPs.
In one embodiment, the host cell comprises 9 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for five of the 9 LPPs.
In one embodiment, the host cell comprises 9 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for six of the 9 LPPs.
In one embodiment, the host cell comprises 9 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for 7 of the 9 LPPs.
In one embodiment, the host cell comprises 9 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for 8 of the 9 LPPs.
In one embodiment, the host cell comprises 10 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for two of the 10 LPPs.
In one embodiment, the host cell comprises 10 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for three of the 10 LPPs.
In one embodiment, the host cell comprises 10 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for four of the 10 LPPs.
In one embodiment, the host cell comprises 10 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for five of the 10 LPPs.
In one embodiment, the host cell comprises 10 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for six of the 10 LPPs.
In one embodiment, the host cell comprises 10 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for 7 of the 10 LPPs.
In one embodiment, the host cell comprises 10 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for 8 of the 10 LPPs.
In one embodiment, the host cell comprises 10 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for 9 of the 10 LPPs.
In one embodiment, the host cell comprises 11 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for two of the 11 LPPs.
In one embodiment, the host cell comprises 11 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for three of the 11 LPPs.
In one embodiment, the host cell comprises 11 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for four of the 11 LPPs.
In one embodiment, the host cell comprises 11 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for five of the 11 LPPs.
In one embodiment, the host cell comprises 11 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for six of the 11 LPPs.
In one embodiment, the host cell comprises 11 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for 7 of the 11 LPPs.
In one embodiment, the host cell comprises 11 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for 8 of the 11 LPPs.
In one embodiment, the host cell comprises 11 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for 9 of the 11 LPPs.
In one embodiment, the host cell comprises 11 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for 10 of the 11 LPPs.
In one embodiment, the host cell comprises 12 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for two of the 12 LPPs.
In one embodiment, the host cell comprises 12 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for three of the 12 LPPs.
In one embodiment, the host cell comprises 12 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for four of the 12 LPPs.
In one embodiment, the host cell comprises 12 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for five of the 12 LPPs.
In one embodiment, the host cell comprises 12 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for six of the 12 LPPs.
In one embodiment, the host cell comprises 12 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for 7 of the 12 LPPs.
In one embodiment, the host cell comprises 12 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for 8 of the 12 LPPs.
In one embodiment, the host cell comprises 12 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for 9 of the 12 LPPs.
In one embodiment, the host cell comprises 12 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for 10 of the 12LPPs.
In one embodiment, the host cell comprises 12 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for 11 of the 12 LPPs.
In one embodiment, the host cell comprises 13 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for two of the 13 LPPs.
In one embodiment, the host cell comprises 13 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for three of the 13 LPPs.
In one embodiment, the host cell comprises 13 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for four of the 13 LPPs.
In one embodiment, the host cell comprises 13 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for five of the 13 LPPs.
In one embodiment, the host cell comprises 13 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for six of the 13 LPPs.
In one embodiment, the host cell comprises 13 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for 7 of the 13 LPPs.
In one embodiment, the host cell comprises 13 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for 8 of the 13 LPPs.
In one embodiment, the host cell comprises 13 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for 9 of the 13 LPPs.
In one embodiment, the host cell comprises 13 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for 10 of the 13 LPPs.
In one embodiment, the host cell comprises 13 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for 11 of the 13 LPPs.
In one embodiment, the host cell comprises 13 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for 12 of the 13 LPPs.
In one embodiment, the host cell comprises 14 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for two of the 14 LPPs.
In one embodiment, the host cell comprises 14 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for three of the 14 LPPs.
In one embodiment, the host cell comprises 14 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for four of the 14 LPPs.
In one embodiment, the host cell comprises 14 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for five of the 14 LPPs.
In one embodiment, the host cell comprises 14 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for six of the 14 LPPs.
In one embodiment, the host cell comprises 14 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for 7 of the 14 LPPs.
In one embodiment, the host cell comprises 14 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for 8 of the 14 LPPs.
In one embodiment, the host cell comprises 14 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for 9 of the 14 LPPs.
In one embodiment, the host cell comprises 14 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for 10 of the 14 LPPs.
In one embodiment, the host cell comprises 14 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for 11 of the 14 LPPs.
In one embodiment, the host cell comprises 14 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for 12 of the 14 LPPs.
In one embodiment, the host cell comprises 14 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for 13 of the 14 LPPs.
In one embodiment, the host cell comprises 15 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for two of the 15 LPPs.
In one embodiment, the host cell comprises 15 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for three of the 15 LPPs.
In one embodiment, the host cell comprises 15 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for four of the 15 LPPs.
In one embodiment, the host cell comprises 15 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for five of the 15 LPPs.
In one embodiment, the host cell comprises 15 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for six of the 15 LPPs.
In one embodiment, the host cell comprises 15 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for 7 of the 15 LPPs.
In one embodiment, the host cell comprises 15 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for 8 of the 15 LPPs.
In one embodiment, the host cell comprises 15 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for 9 of the 15 LPPs.
In one embodiment, the host cell comprises 15 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for 10 of the 15 LPPs.
In one embodiment, the host cell comprises 15 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for 11 of the 15 LPPs.
In one embodiment, the host cell comprises 15 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for 12 of the 15 LPPs.
In one embodiment, the host cell comprises 15 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for 13 of the 15 LPPs.
In one embodiment, the host cell comprises 15 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for 14 of the 15 LPPs.
In one embodiment, the host cell comprises 16 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for two of the 16 LPPs.
In one embodiment, the host cell comprises 16 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for three of the 16 LPPs.
In one embodiment, the host cell comprises 16 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for four of the 16 LPPs.
In one embodiment, the host cell comprises 16 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for five of the 16 LPPs.
In one embodiment, the host cell comprises 16 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for six of the 16 LPPs.
In one embodiment, the host cell comprises 16 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for 7 of the 16 LPPs.
In one embodiment, the host cell comprises 16 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for 8 of the 16 LPPs.
In one embodiment, the host cell comprises 16 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for 9 of the 16 LPPs.
In one embodiment, the host cell comprises 16 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for 10 of the 16 LPPs.
In one embodiment, the host cell comprises 16 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for 11 of the 16 LPPs.
In one embodiment, the host cell comprises 16 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for 12 of the 16 LPPs.
In one embodiment, the host cell comprises 16 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for 13 of the 16 LPPs.
In one embodiment, the host cell comprises 16 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for 14 of the 16 LPPs.
In one embodiment, the host cell comprises 16 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for 15 of the 16 LPPs.
In one embodiment, the host cell is a fungal cell.
In one embodiment, the host cell is a yeast recombinant host cell, e.g., a Candida, Hansenula, Kluyveromyces, Pichia, Saccharomyces, Schizosaccharomyces, or Yarrowia cell, such as a Kluyveromyces lactis, Saccharomyces carlsbergensis, Saccharomyces cerevisiae, Saccharomyces diastaticus, Saccharomyces douglasii, Saccharomyces kluyveri, Saccharomyces norbensis, Saccharomyces oviformis, or Yarrowia lipolytica cell.
In one embodiment, the host cell is a filamentous fungal recombinant host cell, e.g., an Acremonium, Aspergillus, Aureobasidium, Bjerkandera, Ceriporiopsis, Chrysosporium, Coprinus, Coriolus, Cryptococcus, Filibasidium, Fusarium, Humicola, Magnaporthe, Mucor, Myceliophthora, Neocallimastix, Neurospora, Paecilomyces, Penicillium, Phanerochaete, Phlebia, Piromyces, Pleurotus, Schizophyllum, Talaromyces, Thermoascus, Thielavia, Tolypocladium, Trametes, or Trichoderma cell, in particular, an Aspergillus awamori, Aspergillus foetidus, Aspergillus fumigatus, Aspergillus japonicus, Aspergillus nidulans, Aspergillus niger, Aspergillus oryzae, Bjerkandera adusta, Ceriporiopsis aneirina, Ceriporiopsis caregiea, Ceriporiopsis gilvescens, Ceriporiopsis pannocinta, Ceriporiopsis rivulosa, Ceriporiopsis subrufa, Ceriporiopsis subvermispora, Chrysosporium inops, Chrysosporium keratinophilum, Chrysosporium lucknowense, Chrysosporium merdarium, Chrysosporium pannicola, Chrysosporium queenslandicum, Chrysosporium tropicum, Chrysosporium zonatum, Coprinus cinereus, Coriolus hirsutus, Fusarium bactridioides, Fusarium cerealis, Fusarium crookwellense, Fusarium culmorum, Fusarium graminearum, Fusarium graminum, Fusarium heterosporum, Fusarium negundi, Fusarium oxysporum, Fusarium reticulatum, Fusarium roseum, Fusarium sambucinum, Fusarium sarcochroum, Fusarium sporotrichioides, Fusarium sulphureum, Fusarium torulosum, Fusarium trichothecioides, Fusarium venenatum, Humicola insolens, Humicola lanuginosa, Mucor miehei, Myceliophthora thermophila, Neurospora crassa, Penicillium purpurogenum, Phanerochaete chrysosporium, Phlebia radiata, Pleurotus eryngii, Talaromyces emersonii, Thielavia terrestris, Trametes villosa, Trametes versicolor, Trichoderma harzianum, Trichoderma koningii, Trichoderma longibrachiatum, Trichoderma reesei, or Trichoderma viride cell.
In one embodiment, the host cell is an Aspergillus cell, preferably an Aspergillus niger cell, or an Aspergillus oryzae cell.
In one embodiment, the host cell is a Trichoderma cell, preferably a Trichderma reesei cell.
In one embodiment, the host cell is isolated.
In one embodiment, the host cell is purified.
In one embodiment, the host cell is a prokaryotic recombinant host cell, e.g., a Gram-positive cell selected from the group consisting of Bacillus, Clostridium, Enterococcus, Geobacillus, Lactobacillus, Lactococcus, Oceanobacillus, Staphylococcus, Streptococcus, or Streptomyces cells, or a Gram-negative bacteria selected from the group consisting of Campylobacter, E. coll, Flavobacterium, Fusobacterium, Helicobacter, llyobacter, Neisseria, Pseudomonas, Salmonella, and Ureaplasma cells, such as Bacillus alkalophilus, Bacillus amyloliquefaciens, Bacillus brevis, Bacillus circulans, Bacillus clausii, Bacillus coagulans, Bacillus firmus, Bacillus lautus, Bacillus lentus, Bacillus licheniformis, Bacillus megaterium, Bacillus pumilus, Bacillus stearothermophilus, Bacillus subtilis, Bacillus thuringiensis, Streptococcus equisimilis, Streptococcus pyogenes, Streptococcus uberis, and Streptococcus equi subsp. Zooepidemicus, Streptomyces achromogenes, Streptomyces avermitilis, Streptomyces coelicolor, Streptomyces griseus, and Streptomyces lividans cells.
In one embodiment, the host cell is a Bacillus cell.
In one embodiment, the host cell is a Bacillus licheniformis cell.
In one embodiment, the host cell is a Bacillus subtilis cell.
In one embodiment, the host cell is a prokaryotic host cell and wherein at least one LPP is located between two selectable and/or counter-selectable marker genes.
In one embodiment, the host cell is a mammalian cell.
In one embodiment, the mammalian cell is selected from the list of a human cell, a mouse cell, a rat cell, a mouse hybridoma cell, a hamser cell (e.g. Chinese hamster ovary cell), and a rat hybridoma cell.
A construct or vector comprising a polynucleotide is introduced into a host cell so that the construct or vector is maintained as a chromosomal integrant or as a self-replicating extra-chromosomal vector as described earlier. The choice of a host cell will to a large extent depend upon the gene encoding the polypeptide and its source. The polypeptide can be native or heterologous to the recombinant host cell. Also, at least one of the one or more control sequences can be heterologous to the polynucleotide encoding the polypeptide of interest. The recombinant host cell may comprise at least two copies, e.g., three, four, five, or more copies of the polynucleotide of the present invention.
The host cell may be any microbial cell useful in the recombinant production of a polypeptide of interest, e.g., a prokaryotic cell or a fungal cell.
The prokaryotic host cell may be any Gram-positive or Gram-negative bacterium. Gram-positive bacteria include, but are not limited to, Bacillus, Clostridium, Enterococcus, Geobacillus, Lactobacillus, Lactococcus, Oceanobacillus, Staphylococcus, Streptococcus, and Streptomyces. Gram-negative bacteria
include, but are not limited to, Campylobacter, E. coli, Flavobacterium, Fusobacterium, Helicobacter, llyobacter, Neisseria, Pseudomonas, Salmonella, and Ureaplasma.
The bacterial host cell may be any Bacillus cell including, but not limited to, Bacillus alkalophilus, Bacillus amyloliquefaciens, Bacillus brevis, Bacillus circulans, Bacillus clausii, Bacillus coagulans, Bacillus firmus, Bacillus lautus, Bacillus lentus, Bacillus licheniformis, Bacillus megaterium, Bacillus pumilus, Bacillus stearothermophilus, Bacillus subtilis, and Bacillus thuringiensis cells. In an embodiment, the Bacillus cell is a Bacillus amyloliquefaciens, Bacillus licheniformis and Bacillus subtilis cell.
For purposes of this invention, Bacillus classes/genera/species shall be defined as described in Patel and Gupta, 2020, Int. J. Syst. Evol. Microbiol. 70: 406-438.
The bacterial host cell may also be any Streptococcus cell including, but not limited to, Streptococcus equisimilis, Streptococcus pyogenes, Streptococcus uberis, and Streptococcus equi subsp. Zooepidemicus cells.
The bacterial host cell may also be any Streptomyces cell including, but not limited to, Streptomyces achromogenes, Streptomyces avermitilis, Streptomyces coelicolor, Streptomyces griseus, and Streptomyces lividans cells.
Methods for introducing DNA into prokaryotic host cells are well-known in the art, and any suitable method can be used including but not limited to protoplast transformation, competent cell transformation, electroporation, conjugation, transduction, with DNA introduced as linearized or as circular polynucleotide. Persons skilled in the art will be readily able to identify a suitable method for introducing DNA into a given prokaryotic cell depending, e.g., on the genus. Methods for introducing DNA into prokaryotic host cells are for example described in Heinze et al., 2018, BMC Microbiology 18:56, Burke et al., 2001 , Proc. Natl. Acad. Sci. USA 98: 6289-6294, Choi et al., 2006, J. Microbiol. Methods 64: 391-397, and Donald et al., 2013, J. Bacteriol. 195(1 1): 2612-2620.
The host cell may be a fungal cell. “Fungi” as used herein includes the phyla Ascomycota, Basidiomycota, Chytridiomycota, and Zygomycota as well as the Oomycota and all mitosporic fungi (as defined by Hawksworth et al., In, Ainsworth and Bisby’s Dictionary of The Fungi, 8th edition, 1995, CAB International, University Press, Cambridge, UK).
Fungal cells may be transformed by a process involving protoplast-mediated transformation, Agrobacterium-mediated transformation, electroporation, biolistic method and shock-wave-mediated transformation as reviewed by Li et al., 2017, Microbial Cell Factories 16: 168 and procedures described in EP 238023, Yelton et al., 1984, Proc. Natl. Acad. Sci. USA 81 : 1470-1474, Christensen et al., 1988, Bio/Technology 6: 1419-1422, and Lubertozzi and Keasling, 2009, Biotechn. Advances 27: 53-75. However, any method known in the art for introducing DNA into a fungal host cell can be used, and the DNA can be introduced as linearized or as circular polynucleotide.
The fungal host cell may be a yeast cell. “Yeast” as used herein includes ascosporogenous yeast (Endomycetales), basidiosporogenous yeast, and yeast belonging to the Fungi Imperfecti (Blastomycetes). For purposes of this invention, yeast shall be defined as described in Biology and Activities of Yeast (Skinner, Passmore, and Davenport, editors, Soc. App. Bacteriol. Symposium Series No. 9, 1980).
The yeast host cell may be a Candida, Hansenula, Kluyveromyces, Pichia, Saccharomyces, Schizosaccharomyces, or Yarrowia cell, such as a Kluyveromyces lactis, Saccharomyces carlsbergensis, Saccharomyces cerevisiae, Saccharomyces diastaticus, Saccharomyces douglasii, Saccharomyces kluyveri, Saccharomyces norbensis, Saccharomyces oviformis, or Yarrowia lipolytica cell. In a preferred
embodiment, the yeast host cell is a Pichia or Komagataella cell, e.g., a Pichia pastoris cell (Komagataella phaffii).
The fungal host cell may be a filamentous fungal cell. “Filamentous fungi” include all filamentous forms of the subdivision Eumycota and Oomycota (as defined by Hawksworth et al., 1995, supra). The filamentous fungi are generally characterized by a mycelial wall composed of chitin, cellulose, glucan, chitosan, mannan, and other complex polysaccharides. Vegetative growth is by hyphal elongation and carbon catabolism is obligately aerobic. In contrast, vegetative growth by yeasts such as Saccharomyces cerevisiae is by budding of a unicellular thallus and carbon catabolism may be fermentative.
The filamentous fungal host cell may be an Acremonium, Aspergillus, Aureobasidium, Bjerkandera, Ceriporiopsis, Chrysosporium, Coprinus, Coriolus, Cryptococcus, Filibasidium, Fusarium, Humicola, Magnaporthe, Mucor, Myceliophthora, Neocallimastix, Neurospora, Paecilomyces, Penicillium, Phanerochaete, Phlebia, Piromyces, Pleurotus, Schizophyllum, Talaromyces, Thermoascus, Thielavia, Tolypocladium, Trametes, or Trichoderma cell. In a preferred embodiment, the filamentous fungal host cell is an Aspergillus, Trichoderma or Fusarium cell. In a further preferred embodiment, the filamentous fungal host cell is an Aspergillus niger, Aspergillus oryzae, Trichoderma reesei, or Fusarium venenatum cell.
For example, the filamentous fungal host cell may be an Aspergillus awamori, Aspergillus foetidus, Aspergillus fumigatus, Aspergillus japonicus, Aspergillus nidulans, Aspergillus niger, Aspergillus oryzae, Bjerkandera adusta, Ceriporiopsis aneirina, Ceriporiopsis caregiea, Ceriporiopsis gilvescens, Ceriporiopsis pannocinta, Ceriporiopsis rivulosa, Ceriporiopsis subrufa, Ceriporiopsis subvermispora, Chrysosporium inops, Chrysosporium keratinophilum, Chrysosporium lucknowense, Chrysosporium merdarium, Chrysosporium pannicola, Chrysosporium queenslandicum, Chrysosporium tropicum, Chrysosporium zonatum, Coprinus cinereus, Coriolus hirsutus, Fusarium bactridioides, Fusarium cerealis, Fusarium crookwellense, Fusarium culmorum, Fusarium graminearum, Fusarium graminum, Fusarium heterosporum, Fusarium negundi, Fusarium oxysporum, Fusarium reticulatum, Fusarium roseum, Fusarium sambucinum, Fusarium sarcochroum, Fusarium sporotrichioides, Fusarium sulphureum, Fusarium torulosum, Fusarium trichothecioides, Fusarium venenatum, Humicola insolens, Humicola lanuginosa, Mucor miehei, Myceliophthora thermophila, Neurospora crassa, Penicillium purpurogenum, Phanerochaete chrysosporium, Phlebia radiata, Pleurotus eryngii, Talaromyces emersonii, Thielavia terrestris, Trametes villosa, Trametes versicolor, Trichoderma harzianum, Trichoderma koningii, Trichoderma longibrachiatum, Trichoderma reesei, or Trichoderma viride cell.
Landing Pad Polynucleotides (LPPs)
The present invention also relates to at least two LPPs present in the host cell genome, as described herein. Each LPP comprising a 5’ homology arm, two or more protospacers, and a 3’ homology arm.
The LPP may be a genomic DNA, a cDNA, a synthetic DNA, or a combination thereof.
In an aspect, the LPP comprises an protospacer array of at least 2 protospacers comprising at least 50 nucleotides (e.g., nucleotides 1 to 50 of SEQ ID NOs: 14 -17).
In an aspect, the LPP comprises an protospacer array of at least 3 protospacers comprising at least 75 nucleotides (e.g., nucleotides 1 to 75 of SEQ ID NOs: 14 -17).
In an aspect, the LPP comprises an protospacer array of at least 4 protospacers comprising at least 100 nucleotides (e.g., nucleotides 1 to 100 of SEQ ID NOs: 14 -17).
In an aspect, the LPP comprises an protospacer array of at least 5 protospacers comprising at least 125 nucleotides (e.g., nucleotides 1 to 125 of SEQ ID NOs: 14 -17).
The protospacers of the LPP may also be mutated by introduction of nucleotide substitutions, or by introduction of nucleotide substitutions that may give rise to a different target sequence. The homology arms of the LPP may also be mutated by introduction of nucleotide substitutions, or by introduction of nucleotide substitutions that may give rise to a different homology arm sequence. For a general description of nucleotide substitution, see, e.g., Ford et al., 1991 , Protein Expression and Purification 2: 95-107.
In an aspect, the LPP is isolated.
In another aspect, the LPP is purified.
Nucleic Acid Constructs
The present invention also relates to nucleic acid constructs comprising an LPP, and to nucleic acid constructs comprising a donor polynucleotide of the present invention.
In the donor polynucleotide, the polynucleotide encoding a polypeptide of interest is operably linked to one or more control sequences that direct the expression of the coding sequence in a suitable host cell under conditions compatible with the control sequences.
The donor polynucleotide may be manipulated in a variety of ways to provide for expression of the polypeptide. Manipulation of the polynucleotide prior to its insertion into a vector may be desirable or necessary depending on the expression vector. Techniques for modifying polynucleotides utilizing recombinant DNA methods are well known in the art.
Promoters
The control sequence may be a promoter, a polynucleotide that is recognized by a host cell for expression of a polynucleotide encoding a polypeptide of the present invention. The promoter contains transcriptional control sequences that mediate the expression of the polypeptide. The promoter may be any polynucleotide that shows transcriptional activity in the host cell including mutant, truncated, and hybrid promoters, and may be obtained from genes encoding extracellular or intracellular polypeptides either homologous or heterologous to the host cell.
Examples of suitable promoters for directing transcription of the polynucleotide of the present invention in a bacterial host cell are described in Sambrook et al., 1989, Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Lab., NY, Davis et al., 2012, supra, and Song et al., 2016, PLOS One 11 (7): e0158447.
Examples of suitable promoters for directing transcription of the polynucleotide of the present invention in a filamentous fungal host cell are promoters obtained from Aspergillus, Fusarium, Rhizomucor and Trichoderma cells, such as the promoters described in Mukherjee et al., 2013, “Trichoderma-. Biology and Applications”, and by Schmoll and Dattenbbck, 2016, “Gene Expression Systems in Fungi: Advancements and Applications”, Fungal Biology.
For expression in a yeast host, examples of useful promoters are described by Smolke et al., 2018, “Synthetic Biology: Parts, Devices and Applications” (Chapter 6: Constitutive and Regulated Promoters in Yeast: How to Design and Make Use of Promoters in S. cerevisiae), and by Schmoll and Dattenbbck, 2016, “Gene Expression Systems in Fungi: Advancements and Applications”, Fungal Biology.
Terminators
The control sequence may also be a transcription terminator, which is recognized by a host cell to terminate transcription. The terminator is operably linked to the 3’-terminus of the polynucleotide encoding the polypeptide. Any terminator that is functional in the host cell may be used in the present invention.
Preferred terminators for bacterial host cells may be obtained from the genes for Bacillus clausii alkaline protease (aprH), Bacillus licheniformis alpha-amylase (amyL), and Escherichia coll ribosomal RNA (rrnS).
Preferred terminators for filamentous fungal host cells may be obtained from Aspergillus or Trichoderma species, such as obtained from the genes for Aspergillus niger glucoamylase, Trichoderma reesei beta-glucosidase, Trichoderma reesei cellobiohydrolase I, and Trichoderma reesei endoglucanase I, such as the terminators described in Mukherjee et al., 2013, “Trichoderma-. Biology and Applications”, and by Schmoll and Dattenbbck, 2016, “Gene Expression Systems in Fungi: Advancements and Applications”, Fungal Biology.
Preferred terminators for yeast host cells may be obtained from the genes for Saccharomyces cerevisiae enolase, Saccharomyces cerevisiae cytochrome C (CYC1), and Saccharomyces cerevisiae glyceraldehyde-3-phosphate dehydrogenase. Other useful terminators for yeast host cells are described by Romanos et al., 1992, Yeast 8: 423-488. mRNA Stabilizers
The control sequence may also be an mRNA stabilizer region downstream of a promoter and upstream of the coding sequence of a gene which increases expression of the gene.
Examples of suitable mRNA stabilizer regions are obtained from a Bacillus thuringiensis crylllA gene (WO 94/25612) and a Bacillus subtilis SP82 gene (Hue et al., 1995, J. Bacteriol. 177: 3465-3471).
Examples of mRNA stabilizer regions for fungal cells are described in Geisberg et al., 2014, Cell 156(4): 812-824, and in Morozov et al., 2006, Eukaryotic Ce// 5(11): 1838-1846.
Leader Sequences
The control sequence may also be a leader, a non-translated region of an mRNA that is important for translation by the host cell. The leader is operably linked to the 5’-terminus of the polynucleotide encoding the polypeptide. Any leader that is functional in the host cell may be used.
Suitable leaders for bacterial host cells are described by Hambraeus et al., 2000, Microbiology 146(12): 3051-3059, and by Kaberdin and Blasi, 2006, FEMS Microbiol. Rev. 30(6): 967-979.
Preferred leaders for filamentous fungal host cells may be obtained from the genes for Aspergillus oryzae TAKA amylase and Aspergillus nidulans triose phosphate isomerase.
Suitable leaders for yeast host cells may be obtained from the genes for Saccharomyces cerevisiae enolase (ENO-1), Saccharomyces cerevisiae 3-phosphoglycerate kinase, Saccharomyces cerevisiae alpha-factor, and Saccharomyces cerevisiae alcohol dehydrogenase/glyceraldehyde- 3-phosphate dehydrogenase (ADH2/GAP).
Polyadenylation Sequences
The control sequence may also be a polyadenylation sequence, a sequence operably linked to the 3’-terminus of the polynucleotide which, when transcribed, is recognized by the host cell as a signal to add
polyadenosine residues to transcribed mRNA. Any polyadenylation sequence that is functional in the host cell may be used.
Preferred polyadenylation sequences for filamentous fungal host cells are obtained from the genes for Aspergillus nidulans anthranilate synthase, Aspergillus niger glucoamylase, Aspergillus niger alphaglucosidase, Aspergillus oryzae TAKA amylase, and Fusarium oxysporum trypsin-like protease.
Useful polyadenylation sequences for yeast host cells are described by Guo and Sherman, 1995, Mol. Cellular Biol. 15: 5983-5990.
Signal Peptides
The control sequence may also be a signal peptide coding region that encodes a signal peptide linked to the N-terminus of a polypeptide and directs the polypeptide into the cell’s secretory pathway. The 5’-end of the coding sequence of the polynucleotide may inherently contain a signal peptide coding sequence naturally linked in translation reading frame with the segment of the coding sequence that encodes the polypeptide. Alternatively, the 5’-end of the coding sequence may contain a signal peptide coding sequence that is heterologous to the coding sequence. A heterologous signal peptide coding sequence may be required where the coding sequence does not naturally contain a signal peptide coding sequence. Alternatively, a heterologous signal peptide coding sequence may simply replace the natural signal peptide coding sequence to enhance secretion of the polypeptide. Any signal peptide coding sequence that directs the expressed polypeptide into the secretory pathway of a host cell may be used.
Effective signal peptide coding sequences for bacterial host cells are the signal peptide coding sequences obtained from the genes for Bacillus NCIB 11837 maltogenic amylase, Bacillus licheniformis subtilisin, Bacillus licheniformis beta-lactamase, Bacillus stearothermophilus alpha-amylase, Bacillus stearothermophilus neutral proteases (nprT, nprS, nprM), and Bacillus subtilis prsA. Further signal peptides are described by Freudl, 2018, Microbial Cell Factories 17: 52.
Effective signal peptide coding sequences for filamentous fungal host cells are the signal peptide coding sequences obtained from the genes for Aspergillus niger neutral amylase, Aspergillus niger glucoamylase, Aspergillus oryzae TAKA amylase, Humicola insolens cellulase, Humicola insolens endoglucanase V, Humicola lanuginosa lipase, and Rhizomucor miehei aspartic proteinase, such as the signal peptide described by Xu et al., 2018, Biotechnology Letters 40: 949-955
Useful signal peptides for yeast host cells are obtained from the genes for Saccharomyces cerevisiae alpha-factor and Saccharomyces cerevisiae invertase. Other useful signal peptide coding sequences are described by Romanos et al., 1992, supra.
Propeptides
The control sequence may also be a propeptide coding sequence that encodes a propeptide positioned at the N-terminus of a polypeptide. The resultant polypeptide is known as a proenzyme or propolypeptide (or a zymogen in some cases). A propolypeptide is generally inactive and can be converted to an active polypeptide by catalytic or autocatalytic cleavage of the propeptide from the propolypeptide. The propeptide coding sequence may be obtained from the genes for Bacillus subtilis alkaline protease (aprE), Bacillus subtilis neutral protease (nprT), Myceliophthora thermophila laccase (WO 95/33836), Rhizomucor miehei aspartic proteinase, and Saccharomyces cerevisiae alpha-factor.
Where both signal peptide and propeptide sequences are present, the propeptide sequence is positioned next to the N-terminus of a polypeptide and the signal peptide sequence is positioned next to the N-terminus of the propeptide sequence. Additionally or alternatively, when both signal peptide and propeptide sequences are present, the polypeptide may comprise only a part of the signal peptide sequence and/or only a part of the propeptide sequence. Alternatively, the final or isolated polypeptide may comprise a mixture of mature polypeptides and polypeptides which comprise, either partly or in full length, a propeptide sequence and/or a signal peptide sequence.
Regulatory Sequences
It may also be desirable to add regulatory sequences that regulate expression of the polypeptide relative to the growth of the host cell. Examples of regulatory sequences are those that cause expression of the gene to be turned on or off in response to a chemical or physical stimulus, including the presence of a regulatory compound. Regulatory sequences in prokaryotic systems include the lac, tac, and trp operator systems. In yeast, the ADH2 system or GAL1 system may be used. In filamentous fungi, the Aspergillus niger glucoamylase promoter, Aspergillus oryzae TAKA alpha-amylase promoter, and Aspergillus oryzae glucoamylase promoter, Trichoderma reesei cellobiohydrolase I promoter, and Trichoderma reesei cellobiohydrolase II promoter may be used. Other examples of regulatory sequences are those that allow for gene amplification. In fungal systems, these regulatory sequences include the dihydrofolate reductase gene that is amplified in the presence of methotrexate, and the metallothionein genes that are amplified with heavy metals.
Transcription Factors
The control sequence may also be a transcription factor, a polynucleotide encoding a polynucleotide-specific DNA-binding polypeptide that controls the rate of the transcription of genetic information from DNA to mRNA by binding to a specific polynucleotide sequence. The transcription factor may function alone and/or together with one or more other polypeptides or transcription factors in a complex by promoting or blocking the recruitment of RNA polymerase. Transcription factors are characterized by comprising at least one DNA-binding domain which often attaches to a specific DNA sequence adjacent to the genetic elements which are regulated by the transcription factor. The transcription factor may regulate the expression of a protein of interest either directly, i.e., by activating the transcription of the gene encoding the protein of interest by binding to its promoter, or indirectly, i.e., by activating the transcription of a further transcription factor which regulates the transcription of the gene encoding the protein of interest, such as by binding to the promoter of the further transcription factor. Suitable transcription factors for fungal host cells are described in WO 2017/144177. Suitable transcription factors for prokaryotic host cells are described in Seshasayee et al., 2011 , Subcellular Biochemistry 52: 7-23, as well in Balleza et al., 2009, FEMS Microbiol. Rev. 33(1): 133-151.
Expression Vectors
The present invention also relates to recombinant expression vectors comprising a donor polynucleotide of the present invention, a promoter, and transcriptional and translational stop signals. The various nucleotide, homology arms and control sequences may be joined together to produce a recombinant expression vector that may include one or more convenient restriction sites to allow for
insertion or substitution of the donor polynucleotide encoding the polypeptide at the LPPs. In creating the expression vector, the coding sequence is located in the vector so that the coding sequence is operably linked with the appropriate control sequences for expression.
The recombinant expression vector may be any vector (e.g., a plasmid or virus) that can be conveniently subjected to recombinant DNA procedures and can bring about expression of the polynucleotide of interest. The choice of the vector will typically depend on the compatibility of the vector with the host cell into which the vector is to be introduced. The vector may be a linear or closed circular plasmid.
The vector may be an autonomously replicating vector, i.e., a vector that exists as an extrachromosomal entity, the replication of which is independent of chromosomal replication, e.g., a plasmid, an extrachromosomal element, a minichromosome, or an artificial chromosome. The vector may contain any means for assuring self-replication. Alternatively, the vector may be one that, when introduced into the host cell, is integrated into the genome and replicated together with the chromosome(s) into which it has been integrated. Furthermore, a single vector or plasmid or two or more vectors or plasmids that together contain the total DNA to be introduced into the genome of the host cell, or a transposon, may be used.
The vector may contain one or more selectable markers that permit easy selection of transformed, transfected, transduced, or the like cells. A selectable marker is a gene the product of which provides for biocide or viral resistance, resistance to heavy metals, prototrophy to auxotrophs, and the like.
For integration into the host cell genome, the vector may rely on homology arms which facilitate integration via homology-directed repair (HDR).
For autonomous replication, the vector may further comprise an origin of replication enabling the vector to replicate autonomously in the host cell in question. The origin of replication may be any plasmid replicator mediating autonomous replication that functions in a cell. The term “origin of replication” or “plasmid replicator” means a polynucleotide that enables a plasmid or vector to replicate in vivo.
More than one copy of a donor polynucleotide of the present invention may be inserted into a host cell to increase production of a polypeptide of interest. For example, 2 or 3 or 4 or 5 or more copies are inserted into a host cell.
Method for generating multi-copy host cells
In a second aspect, the present invention relates to a method for generating a multi-copy host cell, comprising the steps of: a) providing a host cell according to the first aspect, b) delivering to the host cell a gene editing system comprising:
- a RNA-guided DNA nuclease,
- at least one guide RNA directed to one or more protospacer of the one or more LPP sequences,
- a first donor polynucleotide comprising in 5’ to 3’ direction: a 5’ homology arm, a first polynucleotide encoding a first polypeptide of interest, and a 3’ homology arm, OR a set of first PCR fragments comprising a first 5’ homology arm, a first 3’ homology arm, and a first
polynucleotide encoding a first polypetide of interest wherein the 3’ homology arm is identical to the 3’ homology arm of one or more LPP, and wherein the 5’ homology arm is identical to the 5’ homology arm of one or more LPP, and c) cultivating the host cell at a first temperature allowing one or more first guide RNA of the at least one guide RNA to form a complex with the nuclease at one or more protospacer of one or more LPP, and wherein the complex promotes the homology directed repair by creating a double-strand break at or near the one or more protospacer sequence of the one or more LPP, wherein the homology directed repair comprises insertion of the first donor polynucleotide or set of first PCR fragments at the double-strand break.
In one embodiment, the first polypeptide is heterologous to the host cell.
In one embodiment, the first polynucleotide is operably linked to one or more control sequences that direct the production of the first polypeptide.
In one embodiment, at least one of the one or more control sequences is heterologous to the first polynucleotide.
In one embodiment, the first guide RNA is complementary to one or more first protospacers, wherein during step c) the complex is formed at one or more first protospacer of the one or more LPP, creating a doublestrand break at or near the one or more first protospacer.
In one embodiment, the first guide RNA is complementary to one or more second protospacers, wherein during step c) the complex is formed at one or more second protospacer of the one or more LPP, creating a double-strand break at or near the one or more second protospacer.
In one embodiment, the first guide RNA is complementary to one or more third protospacers, wherein during step c) the complex is formed at one or more third protospacer of the one or more LPP, creating a double-strand break at or near the one or more third protospacer.
In one embodiment, the first guide RNA is complementary to one or more fourth protospacers, wherein during step c) the complex is formed at one or more fourth protospacer of the one or more LPP, creating a double-strand break at or near the one or more fourth protospacer.
In one embodiment, the first guide RNA is complementary to one or more fifth protospacers, wherein during step c) the complex is formed at one or more fifth protospacer of the one or more LPP, creating a doublestrand break at or near the one or more fifth protospacer.
In one embodiment, during step c) the first temperature allows the one or more first guide RNA to form a complex at one or more protospacers of one LPP, and wherein the complex promotes the homology directed repair by creating a double-strand break at or near the one or more protospacer sequence of the one LPP, wherein the homology directed repair comprises insertion of the first donor polynucleotide or set of first PCR fragments at the double-strand break to provide a host cell with one copy of the first donor polynucleotide in its genome.
In one embodiment, during step c) the first temperature allows the one or more first guide RNA to form a complex at one or more protospacers of two LPP, and wherein the complex promotes
the homology directed repair by creating a double-strand break at or near the one or more protospacer sequence of the two LPP, wherein the homology directed repair comprises insertion of the first donor polynucleotide or set of first PCR fragments at the double-strand break to provide a host cell with two copies of the first donor polynucleotide in its genome.
In one embodiment, during step c) the first temperature allows the one or more first guide RNA to form a complex at one or more protospacers of three LPP, and wherein the complex promotes the homology directed repair by creating a double-strand break at or near the one or more protospacer sequence of the three LPP, wherein the homology directed repair comprises insertion of the first donor polynucleotide or set of first PCR fragments at the double-strand break to provide a host cell with three copies of the first donor polynucleotide in its genome.
In one embodiment, during step c) the first temperature allows the one or more first guide RNA to form a complex at one or more protospacers of four LPP, and wherein the complex promotes the homology directed repair by creating a double-strand break at or near the one or more protospacer sequence of the four LPP, wherein the homology directed repair comprises insertion of the first donor polynucleotide or set of first PCR fragments at the double-strand break to provide a host cell with four copies of the first donor polynucleotide in its genome.
In one embodiment, during step c) the first temperature allows the one or more first guide RNA to form a complex at one or more protospacers of five LPP, and wherein the complex promotes the homology directed repair by creating a double-strand break at or near the one or more protospacer sequence of the five LPP, wherein the homology directed repair comprises insertion of the first donor polynucleotide or set of first PCR fragments at the double-strand break to provide a host cell with five copies of the first donor polynucleotide in its genome.
In one embodiment, during step c) the first temperature allows the one or more first guide RNA to form a complex at one or more protospacers of six LPP, and wherein the complex promotes the homology directed repair by creating a double-strand break at or near the one or more protospacer sequence of the six LPP, wherein the homology directed repair comprises insertion of the first donor polynucleotide or set of first PCR fragments at the double-strand break to provide a host cell with six copies of the first donor polynucleotide in its genome.
In one embodiment, during step c) the first temperature allows the one or more first guide RNA to form a complex at one or more protospacers of 7 LPP, and wherein the complex promotes the homology directed repair by creating a double-strand break at or near the one or more protospacer sequence of the 7 LPP, wherein the homology directed repair comprises insertion of the first donor polynucleotide or set of first PCR fragments at the double-strand break to provide a host cell with 7 copies of the first donor polynucleotide in its genome.
In one embodiment, during step c) the first temperature allows the one or more first guide RNA to form a complex at one or more protospacers of 8 LPP, and wherein the complex promotes the homology directed repair by creating a double-strand break at or near the one or more protospacer sequence of the 8 LPP, wherein the homology directed repair comprises insertion of the first donor polynucleotide or set of first
PCR fragments at the double-strand break to provide a host cell with 8 copies of the first donor polynucleotide in its genome.
In one embodiment, during step c) the first temperature allows the one or more first guide RNA to form a complex at one or more protospacers of 9 LPP, and wherein the complex promotes the homology directed repair by creating a double-strand break at or near the one or more protospacer sequence of the 9 LPP, wherein the homology directed repair comprises insertion of the first donor polynucleotide or set of first PCR fragments at the double-strand break to provide a host cell with 9 copies of the first donor polynucleotide in its genome.
In one embodiment, during step c) the first temperature allows the one or more first guide RNA to form a complex at one or more protospacers of 10 LPP, and wherein the complex promotes the homology directed repair by creating a double-strand break at or near the one or more protospacer sequence of the 10 LPP, wherein the homology directed repair comprises insertion of the first donor polynucleotide or set of first PCR fragments at the double-strand break to provide a host cell with 10 copies of the first donor polynucleotide in its genome.
In one embodiment, during step c) the first temperature allows the one or more first guide RNA to form a complex at one or more protospacers of 11 LPP, and wherein the complex promotes the homology directed repair by creating a double-strand break at or near the one or more protospacer sequence of the 11 LPP, wherein the homology directed repair comprises insertion of the first donor polynucleotide or set of first PCR fragments at the double-strand break to provide a host cell with 11 copies of the first donor polynucleotide in its genome.
In one embodiment, during step c) the first temperature allows the one or more first guide RNA to form a complex at one or more protospacers of 12 LPP, and wherein the complex promotes the homology directed repair by creating a double-strand break at or near the one or more protospacer sequence of the 12 LPP, wherein the homology directed repair comprises insertion of the first donor polynucleotide or set of first PCR fragments at the double-strand break to provide a host cell with 12 copies of the first donor polynucleotide in its genome.
In one embodiment, during step c) the first temperature allows the one or more first guide RNA to form a complex at one or more protospacers of 13 LPP, and wherein the complex promotes the homology directed repair by creating a double-strand break at or near the one or more protospacer sequence of the 13 LPP, wherein the homology directed repair comprises insertion of the first donor polynucleotide or set of first PCR fragments at the double-strand break to provide a host cell with 13 copies of the first donor polynucleotide in its genome.
In one embodiment, during step c) the first temperature allows the one or more first guide RNA to form a complex at one or more protospacers of 14 LPP, and wherein the complex promotes the homology directed repair by creating a double-strand break at or near the one or more protospacer sequence of the 14 LPP, wherein the homology directed repair comprises insertion of the first donor polynucleotide or set of first PCR fragments at the double-strand break to provide a host cell with 14 copies of the first donor polynucleotide in its genome.
In one embodiment, during step c) the first temperature allows the one or more first guide RNA to form a complex at one or more protospacers of 15 LPP, and wherein the complex promotes the homology directed repair by creating a double-strand break at or near the one or more protospacer sequence of the 15 LPP, wherein the homology directed repair comprises insertion of the first donor polynucleotide or set of first PCR fragments at the double-strand break to provide a host cell with 15 copies of the first donor polynucleotide in its genome.
In one embodiment, during step c) the first temperature allows the one or more first guide RNA to form a complex at one or more protospacers of 16 LPP, and wherein the complex promotes the homology directed repair by creating a double-strand break at or near the one or more protospacer sequence of the 16 LPP, wherein the homology directed repair comprises insertion of the first donor polynucleotide or set of first PCR fragments at the double-strand break to provide a host cell with 16 copies of the first donor polynucleotide in its genome.
In one embodiment, during step b) there are delivered at least two first guide RNAs, each first guide RNA forming a complex with the nuclease at a first, second, third, fourth, or fifth protospacer.
In one embodiment, the at least two first guide RNAs comprise at least two of: a 1st first guide RNA, a 2nd first guide RNA, a 3rd first guide RNA, a 4th first guide RNA, and a 5th first guide RNA.
In one embodiment, the 1st first guide RNA has a polynucleotide sequence with a sequence identity of below 100%, such as below 99%, below 98%, below 97%, below 96%, below 95%, below 90%, below 85%, below 80%, below 75%, below 70%, below 65%, below 60%, below 55%, below 50%, below 45%, below 40%, below 35%, below 30%, below 25%, below 20%, below 15%, or below 10% to the polynucleotide sequence of the 2nd first guide RNA, to the polynucleotide sequence of 3rd first guide RNA, to the polynucleotide sequence of the 4th first guide RNA, and to the polynucleotide sequence of the 5th first guide RNA.
In one embodiment, the 2nd first guide RNA has a polynucleotide sequence with a sequence identity of below 100%, such as below 99%, below 98%, below 97%, below 96%, below 95%, below 90%, below 85%, below 80%, below 75%, below 70%, below 65%, below 60%, below 55%, below 50%, below 45%, below 40%, below 35%, below 30%, below 25%, below 20%, below 15%, or below 10% to the polynucleotide sequence of the 1st first guide RNA, to the polynucleotide sequence of 3rd first guide RNA, to the polynucleotide sequence of the 4th first guide RNA, and to the polynucleotide sequence of the 5th first guide RNA.
In one embodiment, the 3rd first guide RNA has a polynucleotide sequence with a sequence identity of below 100%, such as below 99%, below 98%, below 97%, below 96%, below 95%, below 90%, below 85%, below 80%, below 75%, below 70%, below 65%, below 60%, below 55%, below 50%, below 45%, below 40%, below 35%, below 30%, below 25%, below 20%, below 15%, or below 10% to the polynucleotide sequence of the 1st first guide RNA, to the polynucleotide sequence of 2nd first guide RNA, to the polynucleotide sequence of the 4th first guide RNA, and to the polynucleotide sequence of the 5th first guide RNA.
In one embodiment, the 4th first guide RNA has a polynucleotide sequence with a sequence identity below 100%, such as below 99%, below 98%, below 97%, below 96%, below 95%, below 90%, below 85%,
below 80%, below 75%, below 70%, below 65%, below 60%, below 55%, below 50%, below 45%, below 40%, below 35%, below 30%, below 25%, below 20%, below 15%, or below 10% to the polynucleotide sequence of the 1st first guide RNA, to the polynucleotide sequence of 2nd first guide RNA, to the polynucleotide sequence of the 3rd first guide RNA, and to the polynucleotide sequence of the 5th first guide RNA.
In one embodiment, the 5th first guide RNA has a polynucleotide sequence with a sequence identity of below 100%, such as below 99%, below 98%, below 97%, below 96%, below 95%, below 90%, below 85%, below 80%, below 75%, below 70%, below 65%, below 60%, below 55%, below 50%, below 45%, below 40%, below 35%, below 30%, below 25%, below 20%, below 15%, or below 10% to the polynucleotide sequence of the 1st first guide RNA, to the polynucleotide sequence of 2nd first guide RNA, to the polynucleotide sequence of the 3rd first guide RNA, and to the polynucleotide sequence of the 4th first guide RNA.
In one embodiment, the one or more first guide RNA comprises a 1st first guide RNA, having a sequence complementary to a first protospacer sequence, a second protospacer sequence, a third protospacer sequence, a fourth protospacer sequence, or a fifth protospacer sequence, preferably the 1st first guide RNA is complementary to a first protospacer sequence.
In one embodiment, the one or more first guide RNA comprises a 2nd first guide RNA, having a sequence complementary to a first protospacer sequence, a second protospacer sequence, a third protospacer sequence, a fourth protospacer sequence, or a fifth protospacer sequence, preferably the 2nd first guide RNA is complementary to a second protospacer sequence.
In one embodiment, the one or more first guide RNA comprises a 3rd first guide RNA, having a sequence complementary to a first protospacer sequence, a second protospacer sequence, a third protospacer sequence, a fourth protospacer sequence, or a fifth protospacer sequence, preferably the 3rd first guide RNA is complementary to a third protospacer sequence.
In one embodiment, the one or more first guide RNA comprises a 4th first guide RNA, having a sequence complementary to a first protospacer sequence, a second protospacer sequence, a third protospacer sequence, a fourth protospacer sequence, or a fifth protospacer sequence, preferably the 4th first guide RNA is complementary to a fourth protospacer sequence.
In one embodiment, the one or more first guide RNA comprises a 5th first guide RNA, having a sequence complementary to a first protospacer sequence, a second protospacer sequence, a third protospacer sequence, a fourth protospacer sequence, or a fifth protospacer sequence, preferably the 5th first guide RNA is complementary to a fifth protospacer sequence.
In one embodiment, the at least one guide RNA comprises a first guide RNA and a second guide RNA, and wherein the method additionally comprises step d): d) cultivating the host cell at a second temperature allowing one or more second guide RNA of the at least one guide RNA to form a complex with the nuclease at one or more protospacer of the one or more LPP, and wherein the complex promotes the homology directed repair by creating a double-strand break at or near the one or more protospacer sequence of the one or more LPP, wherein the homology
directed repair comprises insertion of the first donor polynucleotide and/or set of first PCR fragments at the double-strand break.
In one embodiment, during step b) there is additionally delivered one or more second polynucleotide comprising in 5’ to 3’ direction: a 5’ homology arm, a second polynucleotide encoding a second polypeptide of interest, and a 3’ homology arm, OR a set of one or more second PCR fragments comprising a second 3’ homology arm, a second 5’ homology arm, and a second polynucleotide encoding a second polypeptide of interest, wherein the 3’ homology arm is identical to the 3’ homology arm of one or more LPP and wherein the 5’ homology arm is identical to the 5’ homology arm of one or more LPP, and wherein during step c) the homology directed repair comprises insertion of the second donor polynucleotide or set of second PCR fragments at the double-strand break.
In another embodiment, during step b) there is additionally delivered one or more first polynucleotide comprising in 5’ to 3’ direction: a second 5’ homology arm, a first polynucleotide encoding the first polypeptide of interest, and a second 3’ homology arm, OR a set of one or more first PCR fragments comprising a second 3’ homology arm, a second 5’ homology arm, and the first polynucleotide encoding the first polypeptide of interest, wherein the second 3’ homology arms and the second 5’ homology arms have less than 100% sequence identity to the 3’ and 5’ homology arms of the at least two LPP, respectively, e.g., less than 99%, less than 98%, less than 97%, less than 96%, less than 95%, less than 90%, less thank 85%, less than 80%, less than 75%, less than 70%, or less than 65%, and wherein during step c) the homology directed repair comprises insertion of the first donor polynucleotide or set of PCR fragments at the double-strand break between a second 3’ homology arm and a second 5’ homology arm.
In one embodiment, the second guide RNA is complementary to one or more first protospacers, wherein during step d) the complex is formed at one or more first protospacer of the one or more LPP, creating a double-strand break at or near the one or more first protospacer.
In one embodiment, the second guide RNA is complementary to one or more second protospacers, wherein during step d) the complex is formed at one or more second protospacer of the one or more LPP, creating a double-strand break at or near the one or more second protospacer.
In one embodiment, the second guide RNA is complementary to one or more third protospacers, wherein during step d) the complex is formed at one or more third protospacer of the one or more LPP, creating a double-strand break at or near the one or more third protospacer.
In one embodiment, wherein the second guide RNA is complementary to one or more fourth protospacers, wherein during step d) the complex is formed at one or more fourth protospacer of the one or more LPP, creating a double-strand break at or near the one or more fourth protospacer.
In one embodiment, the second guide RNA is complementary to one or more fifth protospacers, wherein during step d) the complex is formed at one or more fifth protospacer of the one or more LPP, creating a double-strand break at or near the one or more fifth protospacer.
In one embodiment, during step d) the second temperature allows the one or more second guide RNA to form a complex at one or more protospacers of one LPP, and wherein the complex promotes the homology directed repair by creating a double-strand break at or near the one or more protospacer sequence of the one LPP, wherein the homology directed repair comprises insertion of the first donor polynucleotide and/or set of first PCR fragments at the double-strand break to provide a host cell with at least one copy of the first donor polynucleotide in its genome.
In one embodiment, during step d) the second temperature allows the one or more second guide RNA to form a complex at one or more protospacers of two LPP, and wherein the complex promotes the homology directed repair by creating a double-strand break at or near the one or more protospacer sequence of the two LPP, wherein the homology directed repair comprises insertion of the first donor polynucleotide and/or set of first PCR fragments at the double-strand break to provide a host cell with at least two copies of the first donor polynucleotide in its genome.
In one embodiment, during step d) the second temperature allows the one or more second guide RNA to form a complex at one or more protospacers of three LPP, and wherein the complex promotes the homology directed repair by creating a double-strand break at or near the one or more protospacer sequence of the three LPP, wherein the homology directed repair comprises insertion of the first donor polynucleotide and/or set of first PCR fragments at the double-strand break to provide a host cell with at least three copies of the first donor polynucleotide in its genome.
In one embodiment, during step d) the second temperature allows the one or more second guide RNA to form a complex at one or more protospacers of four LPP, and wherein the complex promotes the homology directed repair by creating a double-strand break at or near the one or more protospacer sequence of the four LPP, wherein the homology directed repair comprises insertion of the first donor polynucleotide and/or set of first PCR fragments at the double-strand break to provide a host cell with at least four copies of the first donor polynucleotide in its genome.
In one embodiment, during step d) the second temperature allows the one or more second guide RNA to form a complex at one or more protospacers of five LPP, and wherein the complex promotes the homology directed repair by creating a double-strand break at or near the one or more protospacer sequence of the five LPP, wherein the homology directed repair comprises insertion of the first donor polynucleotide and/or set of first PCR fragments at the double-strand break to provide a host cell with at least five copies of the first donor polynucleotide in its genome.
In one embodiment, during step d) the second temperature allows the one or more second guide RNA to form a complex at one or more protospacers of six LPP, and wherein the complex promotes the homology directed repair by creating a double-strand break at or near the one or more protospacer sequence of the six LPP, wherein the homology directed repair comprises insertion of the first donor polynucleotide and/or set of first PCR fragments at the double-strand break to provide a host cell with at least six copies of the first donor polynucleotide in its genome.
In one embodiment, during step d) the second temperature allows the one or more second guide RNA to form a complex at one or more protospacers of 8 LPP, and wherein the complex promotes the homology directed repair by creating a double-strand break at or near the one or more protospacer sequence of the 8 LPP, wherein the homology directed repair comprises insertion of the first donor polynucleotide and/or set of first PCR fragments at the double-strand break to provide a host cell with at least 8 copies of the first donor polynucleotide in its genome.
In one embodiment, during step d) the second temperature allows the one or more second guide RNA to form a complex at one or more protospacers of 16 LPP, and wherein the complex promotes the homology directed repair by creating a double-strand break at or near the one or more protospacer sequence of the 16 LPP, wherein the homology directed repair comprises insertion of the first donor polynucleotide and/or set of first PCR fragments at the double-strand break to provide a host cell with at least 16 copies of the first donor polynucleotide in its genome.
In one embodiment, during step b) there are delivered at least two second guide RNAs, each second guide RNA forming a complex with the nuclease at a first, second, third, fourth, or fifth protospacer.
In one embodiment, wherein the at least two second guide RNAs comprise at least two of: a 1st second guide RNA, a 2nd second guide RNA, a 3rd second guide RNA, a 4th second guide RNA, and a 5th second guide RNA.
In one embodiment, the 1st second guide RNA has a polynucleotide sequence with a sequence identity below 100%, such as below 99%, below 98%, below 97%, below 96%, below 95%, below 90%, below 85%, below 80%, below 75%, below 70%, below 65%, below 60%, below 55%, below 50%, below 45%, below 40%, below 35%, below 30%, below 25%, below 20%, below 15%, or below 10% to the polynucleotide sequence of the 2nd second guide RNA, to the polynucleotide sequence of 3rd second guide RNA, to the polynucleotide sequence of the 4th second guide RNA, and to the polynucleotide sequence of the 5th second guide RNA.
In one embodiment, the 2nd second guide RNA has a polynucleotide sequence with a sequence identity below 100%, such as below 99%, below 98%, below 97%, below 96%, below 95%, below 90%, below 85%, below 80%, below 75%, below 70%, below 65%, below 60%, below 55%, below 50%, below 45%, below 40%, below 35%, below 30%, below 25%, below 20%, below 15%, or below 10% to the polynucleotide sequence of the 1st second guide RNA, to the polynucleotide sequence of 3rd second guide RNA, to the polynucleotide sequence of the 4th second guide RNA, and to the polynucleotide sequence of the 5th second guide RNA.
In one embodiment, the 3rd second guide RNA has a polynucleotide sequence with a sequence identity of below 100%, such as below 99%, below 98%, below 97%, below 96%, below 95%, below 90%, below 85%, below 80%, below 75%, below 70%, below 65%, below 60%, below 55%, below 50%, below 45%, below 40%, below 35%, below 30%, below 25%, below 20%, below 15%, or below 10% to the polynucleotide sequence of the 1st second guide RNA, to the polynucleotide sequence of 2nd second guide RNA, to the polynucleotide sequence of the 4th second guide RNA, and to the polynucleotide sequence of the 5th second guide RNA.
In one embodiment, the 4th second guide RNA has a polynucleotide sequence with a sequence identity below 100%, such as below 99%, below 98%, below 97%, below 96%, below 95%, below 90%, below 85%, below 80%, below 75%, below 70%, below 65%, below 60%, below 55%, below 50%, below 45%, below 40%, below 35%, below 30%, below 25%, below 20%, below 15%, or below 10% to the polynucleotide sequence of the 1st second guide RNA, to the polynucleotide sequence of 2nd second guide RNA, to the polynucleotide sequence of the 3rd second guide RNA, and to the polynucleotide sequence of the 5th second guide RNA.
In one embodiment, the 5th second guide RNA has a polynucleotide sequence with a sequence identity below 100%, such as below 99%, below 98%, below 97%, below 96%, below 95%, below 90%, below 85%, below 80%, below 75%, below 70%, below 65%, below 60%, below 55%, below 50%, below 45%, below 40%, below 35%, below 30%, below 25%, below 20%, below 15%, or below 10% to the polynucleotide sequence of the 1st second guide RNA, to the polynucleotide sequence of 2nd second guide RNA, to the polynucleotide sequence of the 3rd second guide RNA, and to the polynucleotide sequence of the 4th second guide RNA.
In one embodiment, the one or more second guide RNA comprises a 1st second guide RNA, having a sequence complementary to a first protospacer sequence, a second protospacer sequence, a third protospacer sequence, a fourth protospacer sequence, or a fifth protospacer sequence, preferably the 1st second guide RNA is complementary to a first protospacer sequence.
In one embodiment, the one or more second guide RNA comprises a 2nd second guide RNA, having a sequence complementary to a first protospacer sequence, a second protospacer sequence, a third protospacer sequence, a fourth protospacer sequence, or a fifth protospacer sequence, preferably the 2nd second guide RNA is complementary to a second protospacer sequence.
In one embodiment, the one or more second guide RNA comprises a 3rd second guide RNA, having a sequence complementary to a first protospacer sequence, a second protospacer sequence, a third protospacer sequence, a fourth protospacer sequence, or a fifth protospacer sequence, preferably the 3rd second guide RNA is complementary to a third protospacer sequence.
In one embodiment, the one or more second guide RNA comprises a 4th second guide RNA, having a sequence complementary to a first protospacer sequence, a second protospacer sequence, a third protospacer sequence, a fourth protospacer sequence, or a fifth protospacer sequence, preferably the 4th second guide RNA is complementary to a fourth protospacer sequence.
In one embodiment, the one or more second guide RNA comprises a 5th second guide RNA, having a sequence complementary to a first protospacer sequence, a second protospacer sequence, a third protospacer sequence, a fourth protospacer sequence, or a fifth protospacer sequence, preferably the 5th second guide RNA is complementary to a fifth protospacer sequence.
In one embodiment, the at least one guide RNA comprises a first guide RNA, a second guide RNA, and a third guide RNA, and wherein the method additionally comprises the step e) cultivating the host cell at a third temperature allowing one or more third guide RNA of the at least one guide RNA to form a complex with the nuclease at one or more protospacer of the one or more LPP, and
wherein the complex promotes the homology directed repair by creating a double-strand break at or near the one or more protospacer sequence of the one or more LPP, wherein the homology directed repair comprises insertion of the first donor polynucleotide and/or set of first PCR fragments at the double-strand break.
In one embodiment, the third guide RNA is complementary to one or more first protospacers, wherein during step e) the complex is formed at one or more first protospacer of the one or more LPP, creating a double-strand break at or near the one or more first protospacer.
In one embodiment, the third guide RNA is complementary to one or more second protospacers, wherein during step e) the complex is formed at one or more second protospacer of the one or more LPP, creating a double-strand break at or near the one or more second protospacer.
In one embodiment, the third guide RNA is complementary to one or more third protospacers, wherein during step e) the complex is formed at one or more third protospacer of the one or more LPP, creating a double-strand break at or near the one or more third protospacer.
In one embodiment, the third guide RNA is complementary to one or more fourth protospacers, wherein during step e) the complex is formed at one or more fourth protospacer of the one or more LPP, creating a double-strand break at or near the one or more fourth protospacer.
In one embodiment, the third guide RNA is complementary to one or more fifth protospacers, wherein during step e) the complex is formed at one or more fifth protospacer of the one or more LPP, creating a double-strand break at or near the one or more fifth protospacer.
In one embodiment, during step e) the third temperature allows the one or more third guide RNA to form a complex at one or more protospacers of one LPP, and wherein the complex promotes the homology directed repair by creating a double-strand break at or near the one or more protospacer sequence of the one LPP, wherein the homology directed repair comprises insertion of the first donor polynucleotide and/or set of first PCR fragments at the double-strand break to provide a host cell with at least one copy of the first donor polynucleotide in its genome.
In one embodiment, during step e) the third temperature allows the one or more third guide RNA to form a complex at one or more protospacers of two LPP, and wherein the complex promotes the homology directed repair by creating a double-strand break at or near the one or more protospacer sequence of the two LPP, wherein the homology directed repair comprises insertion of the first donor polynucleotide and/or set of first PCR fragments at the double-strand break to provide a host cell with at least two copies of the first donor polynucleotide in its genome.
In one embodiment, during step e) the third temperature allows the one or more third guide RNA to form a complex at one or more protospacers of three LPP, and wherein the complex promotes the homology directed repair by creating a double-strand break at or near the one or more protospacer sequence of the three LPP, wherein the homology directed repair comprises insertion of the first donor polynucleotide and/or set of first PCR fragments at the double-strand break to provide a host cell with at least three copies of the first donor polynucleotide in its genome.
In one embodiment, during step e) the third temperature allows the one or more third guide RNA to form a complex at one or more protospacers of four LPP, and wherein the complex promotes the homology directed repair by creating a double-strand break at or near the one or more protospacer sequence of the four LPP, wherein the homology directed repair comprises insertion of the first donor polynucleotide and/or set of first PCR fragments at the double-strand break to provide a host cell with at least four copies of the first donor polynucleotide in its genome.
In one embodiment, during step e) the third temperature allows the one or more third guide RNA to form a complex at one or more protospacers of five LPP, and wherein the complex promotes the homology directed repair by creating a double-strand break at or near the one or more protospacer sequence of the five LPP, wherein the homology directed repair comprises insertion of the first donor polynucleotide and/or set of first PCR fragments at the double-strand break to provide a host cell with at least five copies of the first donor polynucleotide in its genome.
In one embodiment, during step e) the third temperature allows the one or more third guide RNA to form a complex at one or more protospacers of six LPP, and wherein the complex promotes the homology directed repair by creating a double-strand break at or near the one or more protospacer sequence of the six LPP, wherein the homology directed repair comprises insertion of the first donor polynucleotide and/or set of first PCR fragments at the double-strand break to provide a host cell with at least six copies of the first donor polynucleotide in its genome.
In one embodiment, during step e) the third temperature allows the one or more third guide RNA to form a complex at one or more protospacers of 8 LPP, and wherein the complex promotes the homology directed repair by creating a double-strand break at or near the one or more protospacer sequence of the 8 LPP, wherein the homology directed repair comprises insertion of the first donor polynucleotide and/or set of first PCR fragments at the double-strand break to provide a host cell with at least 8 copies of the first donor polynucleotide in its genome.
In one embodiment, during step e) the third temperature allows the one or more third guide RNA to form a complex at one or more protospacers of 16 LPP, and wherein the complex promotes the homology directed repair by creating a double-strand break at or near the one or more protospacer sequence of the 16 LPP, wherein the homology directed repair comprises insertion of the first donor polynucleotide and/or set of first PCR fragments at the double-strand break to provide a host cell with at least 16 copies of the first donor polynucleotide in its genome.
In one embodiment, during step b) there are delivered at least two third guide RNAs, each third guide RNA forming a complex with the nuclease at a first, second, third, fourth, or fifth protospacer.
In one embodiment, the at least two third guide RNAs comprise at least two of: a 1st third guide RNA, a 2nd third guide RNA, a 3rd third guide RNA, a 4th third guide RNA, and a 5th third guide RNA.
In one embodiment, the 1st third guide RNA has a polynucleotide sequence with a sequence identity of below 100%, such as below 99%, below 98%, below 97%, below 96%, below 95%, below 90%, below 85%, below 80%, below 75%, below 70%, below 65%, below 60%, below 55%, below 50%, below 45%, below 40%, below 35%, below 30%, below 25%, below 20%, below 15%, or below 10% to the polynucleotide sequence of the 2nd third guide RNA, to the polynucleotide sequence of 3rd third guide RNA, to the
polynucleotide sequence of the 4th third guide RNA, and to the polynucleotide sequence of the 5th third guide RNA.
In one embodiment, the 2nd third guide RNA has a polynucleotide sequence with a sequence identity of below 100%, such as below 99%, below 98%, below 97%, below 96%, below 95%, below 90%, below 85%, below 80%, below 75%, below 70%, below 65%, below 60%, below 55%, below 50%, below 45%, below 40%, below 35%, below 30%, below 25%, below 20%, below 15%, or below 10% to the polynucleotide sequence of the 1st third guide RNA, to the polynucleotide sequence of 3rd third guide RNA, to the polynucleotide sequence of the 4th third guide RNA, and to the polynucleotide sequence of the 5th third guide RNA.
In one embodiment, the 3rd third guide RNA has a polynucleotide sequence with a sequence identity below 100%, such as below 99%, below 98%, below 97%, below 96%, below 95%, below 90%, below 85%, below 80%, below 75%, below 70%, below 65%, below 60%, below 55%, below 50%, below 45%, below 40%, below 35%, below 30%, below 25%, below 20%, below 15%, or below 10% to the polynucleotide sequence of the 1st third guide RNA, to the polynucleotide sequence of 2nd third guide RNA, to the polynucleotide sequence of the 4th third guide RNA, and to the polynucleotide sequence of the 5th third guide RNA.
In one embodiment, the 4th third guide RNA has a polynucleotide sequence with a sequence identity of below 100%, such as below 99%, below 98%, below 97%, below 96%, below 95%, below 90%, below 85%, below 80%, below 75%, below 70%, below 65%, below 60%, below 55%, below 50%, below 45%, below 40%, below 35%, below 30%, below 25%, below 20%, below 15%, or below 10% to the polynucleotide sequence of the 1st third guide RNA, to the polynucleotide sequence of 2nd third guide RNA, to the polynucleotide sequence of the 3rd third guide RNA, and to the polynucleotide sequence of the 5th third guide RNA.
In one embodiment, the 5th third guide RNA has a polynucleotide sequence with a sequence identity below 100%, such as below 99%, below 98%, below 97%, below 96%, below 95%, below 90%, below 85%, below 80%, below 75%, below 70%, below 65%, below 60%, below 55%, below 50%, below 45%, below 40%, below 35%, below 30%, below 25%, below 20%, below 15%, or below 10% to the polynucleotide sequence of the 1st third guide RNA, to the polynucleotide sequence of 2nd third guide RNA, to the polynucleotide sequence of the 3rd third guide RNA, and to the polynucleotide sequence of the 4th third guide RNA.
In one embodiment, the one or more third guide RNA comprises a 1st third guide RNA, having a sequence complementary to a first protospacer sequence, a second protospacer sequence, a third protospacer sequence, a fourth protospacer sequence, or a fifth protospacer sequence, preferably the 1st third guide RNA is complementary to a first protospacer sequence.
In one embodiment, the one or more third guide RNA comprises a 2nd third guide RNA, having a sequence complementary to a first protospacer sequence, a second protospacer sequence, a third protospacer sequence, a fourth protospacer sequence, or a fifth protospacer sequence, preferably the 2nd third guide RNA is complementary to a second protospacer sequence.
In one embodiment, the one or more third guide RNA comprises a 3rd third guide RNA, having a sequence complementary to a first protospacer sequence, a second protospacer sequence, a third protospacer sequence, a fourth protospacer sequence, or a fifth protospacer sequence, preferably the 3rd third guide RNA is complementary to a third protospacer sequence.
In one embodiment, the one or more third guide RNA comprises a 4th third guide RNA, having a sequence complementary to a first protospacer sequence, a second protospacer sequence, a third protospacer sequence, a fourth protospacer sequence, or a fifth protospacer sequence, preferably the 4th third guide RNA is complementary to a fourth protospacer sequence.
In one embodiment, the one or more third guide RNA comprises a 5th third guide RNA, having a sequence complementary to a first protospacer sequence, a second protospacer sequence, a third protospacer sequence, a fourth protospacer sequence, or a fifth protospacer sequence, preferably the 5th third guide RNA is complementary to a fifth protospacer sequence.
In one embodiment, the at least one guide RNA comprises a first guide RNA, a second guide RNA, a third guide RNA, and a fourth guide RNA, and wherein the method additionally comprises the step f) cultivating the host cell at a fourth temperature allowing one or more fourth guide RNA of the at least one guide RNA to form a complex with the nuclease at one or more protospacer of the one or more LPP, and wherein the complex promotes the homology directed repair by creating a double-strand break at or near the one or more protospacer sequence of the one or more LPP, wherein the homology directed repair comprises insertion of the first donor polynucleotide and/or set of first PCR fragments at the double-strand break.
In one embodiment, the at least one guide RNA comprises a first guide RNA, a second guide RNA, a third guide RNA, a fourth guide RNA, and a fifth guide RNA, and wherein the method additionally comprises the step g) cultivating the host cell at a fifth temperature allowing one or more fifth guide RNA of the at least one guide RNA to form a complex with the nuclease at one or more protospacer of the one or more LPP, and wherein the complex promotes the homology directed repair by creating a double-strand break at or near the one or more protospacer sequence of the one or more LPP, wherein the homology directed repair comprises insertion of the first donor polynucleotide and/or set of first PCR fragments at the double-strand break.
In one embodiment, the first temperature is in the range of 15 - 50 C°, such as in the range of 22 - 42 C°, such as in the range of 23 - 39 C°, such as in the range of 25 - 37 C°.
In one embodiment, second temperature is in the range of 15 - 50 C°, such as in the range of 22 - 42 C°, such as in the range of 23 - 39 C°, such as in the range of 25 - 37 C°.
In one embodiment, the third temperature is in the range of 15 - 50 C°, such as in the range of 22 - 42 C°, such as in the range of 23 - 39 C°, such as in the range of 25 - 37 C°.
In one embodiment, the fourth temperature is in the range of 15 - 50 C°, such as in the range of 22 - 42
C°, such as in the range of 23 - 39 C°, such as in the range of 25 - 37 C°.
In one embodiment, the fifth temperature is in the range of 15 - 50 C°, such as in the range of 22 - 42 C°, such as in the range of 23 - 39 C°, such as in the range of 25 - 37 C°.
In one embodiment, the first temperature and the second temperature are in the range of 15 - 50 C°, such as in the range of 22 - 42 C°, such as in the range of 23 - 39 C°, such as in the range of 25 - 37 C°.
In one embodiment, the first temperature, the second temperature, and the third temperature are in the range of 15 - 50 C°, such as in the range of 22 - 42 C°, such as in the range of 23 - 39 C°, such as in the range of 25 - 37 C°.
In one embodiment, the first temperature, the second temperature, the third temperature, and the fourth temperature are in the range of 15 - 50 C°, such as in the range of 22 - 42 C°, such as in the range of 23 - 39 C°, such as in the range of 25 - 37 C°.
In one embodiment, the first temperature, the second temperature, the third temperature, the fourth temperature, and the fifth temperature are in the range of 15 - 50 C°, such as in the range of 22 - 42 C°, such as in the range of 23 - 39 C°, such as in the range of 25 - 37 C°.
In one embodiment, the first temperature differs from the second temperature by at least 0.5 C°, at least 1 C°, at least 1 .5 C°, at least 2 C°, at least 2.5 C°, at least 3 C°, at least 3.5 C°, at least 4 C°, at least 4.5 C°, at least 5 C°, at least 5.5 C°, or at least 6 C°.
In one embodiment, the second temperature differs from the third temperature by at least 0.5 C°, at least 1 C°, at least 1 .5 C°, at least 2 C°, at least 2.5 C°, at least 3 C°, at least 3.5 C°, at least 4 C°, at least 4.5 C°, at least 5 C°, at least 5.5 C°, or at least 6 C°
In one embodiment, the third temperature differs from the fourth temperature by at least 0.5 C°, at least 1 C°, at least 1 .5 C°, at least 2 C°, at least 2.5 C°, at least 3 C°, at least 3.5 C°, at least 4 C°, at least 4.5 C°, at least 5 C°, at least 5.5 C°, or at least 6 C°
In one embodiment, the fourth temperature differs from the fifth temperature by at least 0.5 C°, at least 1 C°, at least 1 .5 C°, at least 2 C°, at least 2.5 C°, at least 3 C°, at least 3.5 C°, at least 4 C°, at least 4.5 C°, at least 5 C°, at least 5.5 C°, or at least 6 C°
In one embodiment, the first temperature is selected from the list of 22 C°, 23 C°, 24 C°, 25 C°, 26 C°, 27 C°, 28 C°, 29 C°, 30 C°, 31 C°, 32 C°, 33 C°, 34 C°, 35 C°, 36 C°, 37 C°, 38 C°, 39 C°, 40 C°, 41 C°, 42 C°, 43 C°, 44 C°, 45 C°, 46 C°, 47 C°, 48 C°, 49 C°, and 50 C°.
In one embodiment, the second temperature is selected from the list of 22 C°, 23 C°, 24 C°, 25 C°, 26 C°, 27 C°, 28 C°, 29 C°, 30 C°, 31 C°, 32 C°, 33 C°, 34 C°, 35 C°, 36 C°, 37 C°, 38 C°, 39 C°, 40 C°, 41 C°, 42 C°, 43 C°, 44 C°, 45 C°, 46 C°, 47 C°, 48 C°, 49 C°, and 50 C°.
In one embodiment, the third temperature is selected from the list of 22 C°, 23 C°, 24 C°, 25 C°, 26 C°, 27 C°, 28 C°, 29 C°, 30 C°, 31 C°, 32 C°, 33 C°, 34 C°, 35 C°, 36 C°, 37 C°, 38 C°, 39 C°, 40 C°, 41 C°, 42 C°, 43 C°, 44 C°, 45 C°, 46 C°, 47 C°, 48 C°, 49 C°, and 50 C°.
In one embodiment, the fourth temperature is selected from the list of 22 C°, 23 C°, 24 C°, 25 C°, 26 C°, 27 C°, 28 C°, 29 C°, 30 C°, 31 C°, 32 C°, 33 C°, 34 C°, 35 C°, 36 C°, 37 C°, 38 C°, 39 C°, 40 C°, 41 C°, 42 C°, 43 C°, 44 C°, 45 C°, 46 C°, 47 C°, 48 C°, 49 C°, and 50 C°.
In one embodiment, the fifth temperature is selected from the list of 22 C°, 23 C°, 24 C°, 25 C°, 26 C°, 27 C°, 28 C°, 29 C°, 30 C°, 31 C°, 32 C°, 33 C°, 34 C°, 35 C°, 36 C°, 37 C°, 38 C°, 39 C°, 40 C°, 41 C°, 42 C°, 43 C°, 44 C°, 45 C°, 46 C°, 47 C°, 48 C°, 49 C°, and 50 C°.
In one embodiment, the first temperature and the second temperature are selected from the list of 22 C°, 23 C°, 24 C°, 25 C°, 26 C°, 27 C°, 28 C°, 29 C°, 30 C°, 31 C°, 32 C°, 33 C°, 34 C°, 35 C°, 36 C°, 37 C°, 38 C°, 39 C°, 40 C°, 41 C°, 42 C°, 43 C°, 44 C°, 45 C°, 46 C°, 47 C°, 48 C°, 49 C°, and 50 C°..
In one embodiment, the first temperature, the second temperature, and the third temperature are selected from the list of 22 C°, 23 C°, 24 C°, 25 C°, 26 C°, 27 C°, 28 C°, 29 C°, 30 C°, 31 C°, 32 C°, 33 C°, 34 C°, 35 C°, 36 C°, 37 C°, 38 C°, 39 C°, 40 C°, 41 C°, 42 C°, 43 C°, 44 C°, 45 C°, 46 C°, 47 C°, 48 C°, 49 C°, and 50 C°.
In one embodiment, the first temperature, the second temperature, the third temperature, and the fourth temperature are selected from the list of 22 C°, 23 C°, 24 C°, 25 C°, 26 C°, 27 C°, 28 C°, 29 C°, 30 C°, 31 C°, 32 C°, 33 C°, 34 C°, 35 C°, 36 C°, 37 C°, 38 C°, 39 C°, 40 C°, 41 C°, 42 C°, 43 C°, 44 C°, 45 C°, 46 C°, 47 C°, 48 C°, 49 C°, and 50 C°.
In one embodiment, the first temperature, the second temperature, the third temperature, the fourth temperature, and the fifth temperature are selected from the list of 22 C°, 23 C°, 24 C°, 25 C°, 26 C°, 27 C°, 28 C°, 29 C°, 30 C°, 31 C°, 32 C°, 33 C°, 34 C°, 35 C°, 36 C°, 37 C°, 38 C°, 39 C°, 40 C°, 41 C°, 42 C°, 43 C°, 44 C°, 45 C°, 46 C°, 47 C°, 48 C°, 49 C°, and 50 C°.
In one embodiment, the first temperature is in the range of 22 - 40 C°, and the second temperature is in the range of 35 - 50 C°.
In one embodiment, the first donor polynucleotide does not comprise a selectable marker or counter- selectable marker.
In one embodiment, the first donor polynucleotide comprises a selectable or counter-selectable marker.
In one embodiment, during step b) there is furthermore delivered a second donor polynucleotide comprising in 5’ to 3’ direction: a 5’ homology arm, a second polynucleotide encoding a second polypeptide of interest, and a 3’ homology arm, wherein the 3’ homology arm is identical to the 3’ homology arm of the host cell, and wherein the 5’ homology arm is identical to the 5’ homology arm of the host cell.
In one embodiment, the first polypeptide of interest comprises a brazzein, a casein, a patatin, an ovalbumin, an osteopontin, an ovotransferrin, an ovomucin, an ovomucoid, an ovostatin, a glycomacropeptide, a lactoferrin, an alpha-lactalbumin, e.g., bovine alpha-lactalbumin, a beta-lactalbumin and/or a collagen.
In one embodiment, the first polypeptide of interest comprises a therapeutic polypeptide selected from the group consisting of an antibody, an antibody fragment, an antibody-based drug, a Fc fusion protein, an anticoagulant, a blood factor, a bone morphogenetic protein, an engineered protein scaffold, an enzyme,
a growth factor, a blood clotting factor, a hormone, an interferon (such as an interferon alpha-2b), an interleukin, a lactoferrin, an alpha-lactalbumin, a beta-lactalbumin, an ovomucoid, an ovostatin, a cytokine, an obestatin, a human galactosidase (such as an human alpha-galactosidase A), a vaccine, a protein vaccine, and a thrombolytic.
In one embodiment, first polypeptide of interest comprises an enzyme.
In one embodiment, the enzyme is selected from the list of a hydrolase, isomerase, ligase, lyase, oxidoreductase, or transferase, e.g., an aminopeptidase, amylase, carbohydrase, carboxypeptidase, catalase, cellobiohydrolase, cellulase, chitinase, cutinase, cyclodextrin glycosyltransferase, deoxyribonuclease, endoglucanase, esterase, alpha-galactosidase, beta-galactosidase, glucoamylase, alpha-glucosidase, beta-glucosidase, invertase, laccase, lipase, mannosidase, mutanase, oxidase, pectinolytic enzyme, peroxidase, phytase, polyphenoloxidase, proteolytic enzyme, ribonuclease, transglutaminase, xylanase, or beta-xylosidase.
In one embodiment, the second polypeptide of interest comprises a polypeptide selected from the list of a transcription factor (e.g. amyR), a chaperone (e.g. BipA, PdiA, or PdiB), a heme pathway enzyme (e.g. encoded by hemA, or hemB), or a glycosylation pathway enzyme (e.g. a Mannosidase I, a GIcNAc transferase, a galactosyltransferase, a sialyltransferase, an oligosaccharyltransferase, or an glucosidase).
In one embodiment, the method additionally comprises step h), subsequent of any of steps c), d), e), f) or g):
- providing a host cell comprising a first polynucleotide in one or more LPP, said host cell being generated with the method of any of the previous paragraphs, and said host cell comprising at least one LPP not comprising the first polynucleotide (= empty LPP),
- delivering to the host cell a RNA-guided DNA nuclease, and at least one guide RNA directed to one or more protospacer of one or more empty LPP, and
- cultivating the host cell at a temperature allowing the at least one guide RNA to form a complex with the nuclease at the one or more empty LPP, wherein the complex promotes the homology directed repair by creating a double-strand break at or near the one or more protospacer sequence of the one or more empty LPP, wherein the homology directed repair comprises insertion of the first polynucleotide at the doublestrand break of the empty LPP.
Methods of Production
In a third aspect, the present invention relates to a method of producing a polypeptide of interest, comprising cultivating a multi-copy host cell under conditions conducive for production of the polypeptide, said multi-copy host cell being generated by the method of the second aspect.
In one embodiment, the method further comprises recovering the polypeptide of interest.
The host cell is cultivated in a nutrient medium suitable for production of the polypeptide using methods known in the art. For example, the cell may be cultivated by shake flask cultivation, or small-scale
or large-scale fermentation (including continuous, batch, fed-batch, or solid-state, and/or microcarrierbased fermentations) in laboratory or industrial fermentors in a suitable medium and under conditions allowing the polypeptide to be expressed and/or isolated. Suitable media are available from commercial suppliers or may be prepared according to published compositions (e.g., in catalogues of the American Type Culture Collection). If the polypeptide is secreted into the nutrient medium, the polypeptide can be recovered directly from the medium. If the polypeptide is not secreted, it can be recovered from cell lysates.
The polypeptide may be detected using methods known in the art that are specific for the polypeptide, including, but not limited to, the use of specific antibodies, formation of an enzyme product, disappearance of an enzyme substrate, or an assay determining the relative or specific activity of the polypeptide.
The polypeptide may be recovered from the medium using methods known in the art, including, but not limited to, collection, centrifugation, filtration, extraction, spray-drying, evaporation, or precipitation. In one aspect, a whole fermentation broth comprising the polypeptide is recovered. In another aspect, a cell-free fermentation broth comprising the polypeptide is recovered.
The polypeptide may be purified by a variety of procedures known in the art to obtain substantially pure polypeptides and/or polypeptide fragments (see, e.g., Wingfield, 2015, Current Protocols in Protein Science; 80(1): 6.1.1-6.1.35; Labrou, 2014, Protein Downstream Processing, 1129: 3-10).
In an alternative aspect, the polypeptide is not recovered.
The present invention is further described by the following examples that should not be construed as limiting the scope of the invention.
Examples
Materials and Methods
Unless otherwise stated, DNA manipulation and transformation were performed using standard methods of molecular biology as described in Sambrook et al. (1989) Molecular cloning: A laboratory 20 manual, Cold Spring Harbor lab., Cold Spring Harbor, NY; Ausubel, F. M. et al. (eds.) "Current protocols
in Molecular Biology", John Wiley and Sons, 1995; Harwood, C. R., and Cut-ting, S. M. (eds.) "Molecular Biological Methods for Bacillus". John Wiley and Sons, 1990.
Purchased material (E.coli and kits)
E.coli HST08 (Clontech) is used for plasmid construction and amplification. Amplified plasmids are recovered with Qiagen Plasmid Midi or Mini Kit (Qiagen). Ligation is done with NEBuilder HiFi DNA Assembly Cloning Kit (NEB) according to the manufactory instructions. GFX™ PCR DNA and Gel Band Purification Kit (Merck) is used for the purification of PCR fragments and extraction of DNA fragment from agarose gel.
Amplified plasmids were recovered with Qiagen Plasmid Kit (Qiagen). DNA fragments were gel purified using the Qiagen MinElute Gel Extraction kit (Qiagen). Ligation reactions were carried out using the NEBUILDER@ HiFi DNA Assembly Cloning Kit (New England Biolabs Inc.) according to the manufacturer's instructions. Polymerase Chain Reaction (PCR) was carried out with Phusion@ DNA Polymerase (Thermo Fisher Scientific). Sanger DNA sequencing of plasmid DNA was performed by GeneWiz from Azenta Life Science (GENEWIZ Germany GmbH, Germany). Sequence analysis was performed with the SnapGene version 6.1 .0 (GSL Biotech LLC, United States). PCR from genomic DNA was carried out using Thermo Scientific Phusion@ DNA Polymerase (Thermo Scientific).
Enzymes for DNA manipulations (e.g. restriction endonucleases, ligases etc.) were obtained from New England Biolabs, Inc. and were used according to the manufacturer's instructions.
DNA fragments
All DNA fragments used in this study was made synthetically by Integrated DNA Technologies (IDT).
Transformation of E. coli
E. coli chemically competent host strain was incubated with DNA for 20 minutes on ice. The solution was then transferred to 42 degrees for 2 minutes and 20x Volume of SOC medium was added. The solution was then incubated at 37 degrees for 1 hour at 700 rpm before spreading on LB + Ampicillin plates.
Transformation of Aspergillus oryzae
Transformation of Aspergillus species can be achieved using the general methods for yeast transformation.
Aspergillus oryzae host strain was inoculated in 100 ml of Sucrose medium (supplemented with 10 mM uridine if strain is pyrG-) and incubated for 16 h at 30°C at 200 rpm. Pellets were collected and washed with 0.6 M MgSO4, and resuspended in 10 ml 1.2 M MgSO4 containing a commercial beta- glucanase product (GLUCANEX™, Novozymes A/S, Bagsvaerd, Denmark) at a final concentration of 40 mg per ml. The suspension was incubated at 30°C at 50 rpm until protoplasts were formed, and then washed twice with STC buffer. The protoplasts were resuspended to a final concentration of 1x107 protoplasts/ml. Approximately 1 pg of guide plasmid and 1 .5 ug of repair DNA was added to 100 pl of the protoplast suspension, mixed gently, and incubated 5 minutes at Room Temperature. 400ul of PEG
solution was added and the protoplast suspension was incubated for 20 minutes at Room Temperature. After the addition of 8 ml of 50°C Sucrose Agar, the reaction was poured onto Sucrose Agar plates and the plates were incubated at 30°C for 4 days.
Media
COVE trace metals solution was composed of 0.04 g of NaB4O7«10H2O, 0.4 g of CuSO4«5H2O, 1.2 g of FeSO4-7H2O, 0.7 g of MnSO4-H2O, 0.8 g of Na2MoO2-2H20, 10 g of ZnSO4-7H2O, and deionized water to 1 liter.
50X COVE salts solution was composed of 26 g of KCI, 26 g of MgSO4«7H2O, 76 g of KH2PO4, 50 ml of COVE trace metals solution, and deionized water to 1 liter.
Sucrose Medium was composed of 342.3 g of sucrose, 20 ml of 50X COVE salts solution, 10 ml of 1 M Urea, and deionized water to 1 liter.
Sucrose Agar were composed of 342.3 g of sucrose, 20 ml of 50X COVE salts solution, 10 ml of 1 M Urea, 20g of granulated agar, and deionized water to 1 liter. If bar selection is used, Basta (Bayer A/S) is added to a final concentration of 0.11 % glufosinat (w/V).
STC buffer was composed of 0.8 M sorbitol, 25 mM Tris pH 8, and 25 mM CaCI2.
PEG solution was composed of 60% PEG 4000 in STC buffer.
LB plus Ampicillin plates were composed of 10 g of tryptone, 5 g of yeast extract, 5 g of sodium chloride, 15 g of Bacto agar, ampicillin at 150 pg per ml, and deionized water to 1 liter.
SOC medium was composed of 20 g of tryptone, 5 g of yeast extract, 0.5 g of NaCI, 10 ml of 250 mM KCI, and deionized water to 1 liter.
COVE Urea Zeocin plates were composed of 342.3 g of sucrose, 20 ml of COVE salts solution, 10 ml 1 M urea, 25 g of Noble agar, and deionized water to 1 liter. The pH was adjusted to pH 6.5. After sterilisation, 1 mL 100 mg/mL Zeocin (GibcoTM, Thermo Fisher Scientific).
PDA plates were composed of 39 g of potato dextrose agar (Difco) and deionized water to 1 liter.
PEG+G buffer was composed of 60% polyethylene glycol (PEG) 4000, 20% w/v glucose, 10 mM Tris-HCI pH 7.5, and 10 mM CaC12 in deionized water. The solution is filter sterilized.
STC+G was composed of 1 M sorbitol, w/v glucose, 10 mM Tris pH 7.5, and 10 mM CaC12 in deionized water.
STC was composed of 1 M sorbitol, 10 mM Tris pH 7.5, and 10 mM CaC12 in deionized water.
YPD medium was composed of 1 % yeast extract, 2% peptone, and 2% glucose in deionized water.
Microbial strains
Trichoderma reesei strain SAMF128-2A 11-1 is described in WO 20112911 .
ddPCR (Digital Droplet PCR) for copy number determination
The ddPCR assays was run according to protocols available for the QX200 Bio-Rad ddPCR system. DNA was prepared by inoculating fungal spores in water added glass beads shaking vigorously for 2 minutes. An aliquot of DNA extract was transferred to the in-digest ddPCR solution. For digestion of DNA, one of the four restriction enzymes Msel, Hindlll, Alul or Haelll was used according to the suggestions by Bio-Rad for use in ddPCR assays. Probe assays (mix of fluorescent probe and PCR primers) were acquired from Integrated DNA Technologies (IDT). The assays were designed using the PrimerQuest™ Tool supplied by IDT. Droplets were made using the Bio-Rad QX200 AutoDG Droplet Digital PCR System. PCR was run using a Bio-Rad C1000 Touch Thermal Cycler with a 96-well module. Results were obtained using the QX200 droplet reader and results were analyzed using the QuataSoft™ version 1 .7 software. All copy number results were obtained using an internal standard, which was either the wA gene or the prpA gene.
Generation of Trichoderma reesei protoplasts
Protoplast preparation and transformation of Trichoderma reesei were performed using a protocol similar to Penttila et al., 1987, Gene 61 : 155-164. Briefly, T. reesei was cultivated in two shake flasks, each containing 25 mL of Y PD medium, at 27°C for 17 hours with gentle agitation at 90 rpm. Mycelia were collected by filtration using a Vacuum Driven Disposable Filtration System (Millipore) and washed twice with deionized water and twice with 1 .2 M sorbitol. Protoplasts were generated by suspending the washed mycelia in 30 mL of 1.2 M sorbitol containing 5 mg of YATALASE TM (Takara Bio USA, Inc.) per ml and 0.5 mg of chitinase (Sigma Chemical Co.) per ml for 60-75 minutes at 34°C with gentle shaking at 90 rpm. Protoplasts were collected by centrifugation at 834 x g for 7 minutes and washed twice with cold 1 .2 M sorbitol. The protoplasts were counted using a hemocytometer and re-suspended to a final concentration of 1 x 10 8 protoplasts per ml of STC. Aliquots (1 .1 ml) of the protoplast solution were placed in a Mr. FROSTY TM freezing container (Thermo Fisher Scientific) at -800 C for later use (as described in W020123845).
Trichoderma Transformation
Transformation of Trichoderma species can be achieved using the general methods for yeast transformation. The preferred procedure for this invention is described below. Approximately 1 pg of D27XZX or D278ZE plasmid DNA and 1 ug of plasmid D26V2Q DNA were combined (and added to 100 pl of the protoplast suspension of strain TT540 and then mixed gently. Then 250 pl PEG+G was added to the DNA-protoplast mixture, mixed gently and incubated at 34°C for 30 minutes. One ml of STC+G was added, the protoplast suspension was mixed gently and poured onto Cove Urea Zeocin agar plates or PDA agar plates (with overlay with Hygromycin). The plates were incubated at 30°C for 8-10 days.
Transformants were picked to PDA agar and incubated at 30°C for 5-7 days. The strains were spore purified by diluting spores from the PDA agar plates in water and spreading onto Cove agar for a second round of selection (as described in W020123845).
A portion of the D27XZX or D278ZE plasmid DNA was integrated into the genome by homologous recombination using the 5' and 3' flanks of the 70883 locus contained in the plasmids. The plasmid sequence between these 70883 homology flanks was integrated into the genome replacing the 70883
coding sequence. Transformants were selected for using the amdS selection marker contained between the 70883 flanking sequence within the plasmid. The resulting strains with the 70883 replacement and integrated intervening plasmid DNA sequence were named 0184PQ 5 (D27XZX in 016VA2), 01792Q (D278ZE in 016VA2), 0253QJ (D27XZX in 0154NN) and 016E5W (D278ZE in 0154NN).
Example 1. Generation of recipient SPADE Aspergillus host cell (synthetic jsrotospacer array defined entry)
A landing pad poynucleotide (LPP) is a DNA sequence consisting of three components, a protospacer array sequence consisting of one or more CRISPR target sites flanked by a left sequence and a right sequence. The two flanking sequences are used for integration of a repair construct by homologous recombination resulting in repair of the CRISPR guide facilitated chromosomal breaks in the landing pad.
For the design of the landing pads in this experiment, the left flank, named SYN2L (SEQ ID NO: 1), consists of 1000 base pairs (bp), i.e., having a length of 1000 nucleotides (nt). The SYN2L sequence was randomly generated by a python script setting the AT content to 52%. The right flank was named SYN2R (SEQ ID NO: 2), which is a 1000 bp sequence.
For this experiment, the protospacer array consists of five protospacer sequences. The protospacer sequences for the arrays were randomly generated by a python script setting the AT content to 52% on average. Moreover, the protospacers had all been tested to function at 30 degrees Celsius using MAD7 as the CRISPR enzyme. As the 5’-PAM sequence, the sequence TTTC was chosen for all protospacers in the protospacer arrays. The location of the PAM sequence for each protospacer is shown in Figure 3.
As shown in Table 2 and in Figure 1 , the four protospacer arrays of the four LPPs, all contain two first protospacer sequences #1 A and #1 B. One first protospacer sequence #1 A is PS-59 (SEQ ID NO: 3) which is present in all four protospacer arrays, and another first protospacer sequence #1 B is PS-73 (SEQ ID NO: 4) which is also present in all four protospacer arrays.
Moreover, each of the protospacer arrays contained three additional protospacers: a second protospacer (#2), a third protospacer (#3), and a fourth protospacer (#4). These protospacer sequences are either present in the protospacer arrays of three landing pads in total, two landing pads in total or are unique to the respective protospacer arrays. For an overview of the presence of protospacer sequences for each of the four landing pads LPP#1-LPP#4 see Figure 1 , and below Table 2.
For building the Aspergillus oryzae recipient strain, four chromosomal loci were chosen, which are present on four different chromosome arms to ensure genetic stability in the experiment. We chose the pepAb locus on chromosome 1 , the pepE locus on chromosome 2, the amdS locus on chromosome 6 and the pepAa locus on chromosome 8.
The four LPPs were inserted at the four chromosomal loci using four CRISPR guide RNAs targeting each of the four loci as well as four repair fragments, which contained the four LPPs flanked by loci specific repair sequences. While integrating the LPPs, the loci specific genes were deleted thereby removing the guide targeting sequence.
The resulting Aspergillus oryzae recipient strain AT5885 containing the four LPPs at the four different chromosomal loci was confirmed by genome sequencing. The recipient strain AT5885 is ApyrG and therefore requires supplementation of uridine to be able to grow.
The full sequences of the four protospacer arrays have been listed as:
- SEQ ID NO: 14 (LPP#1): the five protospacers PS-59, PS-73 PS-74, PS-75 and PS-76 for integration in the pepAb locus;
- SEQ ID NO: 15 (LPP#2): the five protospacers PS-59, PS-73, PS-74, PS-75 and PS-11 for integration in the pepE locus;
- SEQ ID NO: 16 (LPP#3): the five protospacers PS-59, PS-73, PS-48, PS-49 and PS-93 for integration in the amdS locus
- SEQ ID NO: 17 (LPP#4): the five protospacers PS-59, PS-73, PS-48, PS-89 and PS-94 for integration in the pepAa locus.
Example 2. One-step integration of 4 gene copies into 4 landing pad polynucleotides (LPP) using one gRNA sequence
The recipient strain AT5885 comprising 4 LPPs was made transformable by protoplasting mycelia.
Two CRISPR guide RNA (gRNA) plasmids were built (pAT4859 and pAT4873), each containing a gRNA which enables cutting of all four LPPs in the recipient strain. pAT4859 comprises a first gRNA (SEQ ID NO:18) enabling cutting at the protospacer sequence PS-59, and pAT4873 comprises a first gRNA (SEQ ID NO: 19) enabling cutting at the protospacer sequence PS-73. Both guide RNA plasmids carry the pyrG gene for maintenance in the recipient strain upon transformation. An overview of designed gRNAs is shown in Table 3.
A repair DNA construct (= first donor polynucleotide), REPAIR"! , was built, which contained a sequence of interest flanked by the repair flanks SYN2L (SEQ ID NO: 1) and SYN2R (SEQ ID NO: 2).
In Table 3 (right column), upper case letters in the guide RNA represent the tracrRNA (transactivating CRISPR RNA), whereas lower case letters represent the crRNA (CRISPR RNA), which recognizes the protospacer in the target DNA.
Protoplasts of AT5885 were transformed with pAT4859 and repair DNA and transformants were selected on minimal plates without addition of uridine. The transformation efficiency was 100 transformants I ug of repair DNA. 20 transformants were spore reisolated twice and the copy number was determined by ddPCR. 75% of the strains had the expected four copies of the gene of interest, i.e. one copy per LPP. Moreover, ddPCR was used to verify that the protospacer arrays in the four LPPs were no longer present in those 75% correct transformants, which indicates that the repair and insertion was successful.
Similar results were obtained using the guide plasmid pAT4873 and the same repair DNA construct. Thus, we conclude that the SPADE technology can be used for efficient and controlled 1 -step integration of multiple copies into multiple landing pads.
Example 3. One-step integration of 3 gene copies into 3 LPPs, using 2 different gRNA sequences
AT5885, comprising 4 LPPs, was made pyrG+ resulting in strain AT6412.
One CRISPR guide RNA plasmid was built (pAT6716), which enables cutting of three of the four LPPs in the recipient strain. pAT6716 comprises gRNA sequences (SEQ ID NO: 20) that allow DNA cutting at the protospacer sequence PS-75 (present in LPP#1 and LPP#2), and guide RNA (SEQ ID NO: 21) enabling cutting at the protospacer sequence PS-94 (present in LPP#4). The cutting of the three LPPs is indicated in Figure 2a. The guide RNA plasmid carries the bar gene for maintenance in the recipient strain upon transformation on BASTA plates.
A repair DNA construct (=first donor polynucleotide), REPAIR1 , was built. REPAIR1 contained a sequence of interest/gene of interest (GOI) flanked by the repair flanks (homology arms) SYN2L (SEQ ID NO: 1) and SYN2R (SEQ ID NO: 2). The GOI was subject to integration via homology directed repair at the sites were double strand breaks were introduced, see Figure 2b.
Protoplasts of AT6412 were transformed with pAT6716 and repair DNA construct REPAIR1 .Transformants were selected on minimal plates without addition of uridine. The transformation efficiency was 100+ transformants I ug of repair DNA. 20 transformants were spore reisolated twice and the copy number was determined by ddPCR. 80% of the strains had the expected three copies of the gene of interest, i.e., the sequence of interest was present in LPP#1 , LPP#2, and LPP#4 (see Figure 2c). Moreover, ddPCR was used to verify that the protospacer arrays in the three LPPs were no longer present in those 80% correct transformants, which indicates that the repair and insertion was successful. Moreover, it was verified that the fourth landing pad (LPP#3) was intact. Thus, the SPADE technology proves to be efficient for targeted integration into multiple landing pads using different guide RNA sequences. The shown one-step multiplexing approach allows controlled dosing of copy numbers during strain construction.
Example 4. Multi-step integration of gene copies into multiple LPPs
Example 4 is an example for ’’step h)” of the method according to the second aspect of the invention.
The AT6412 host strain comprising 4 LPPs (generated in Example 1) was used for this experiment.
A DNA repair fragment, REPAIR2 (= first donor polynucleotide), was designed containing an expression cassette encoding a protein of interest to which Aspergillus oryzae reacts negative/sensible, e.g. the host cells show reduced viabilities after transformation. Thus, the method described in example 2 is not preferred for making multi-copy strains for such proteins of interest.
A guide plasmid, pAT6654, comprising a gRNA sequence (SEQ ID NO: 20) enabling cutting of PS- 75 present in two (LPP#1 and LPP#2) of the four LPPs in AT6412 was made.
A guide plasmid, pAT6657, comprising a gRNA sequence (SEQ ID NO: 21) enabling cutting of PS- 94 present in one (LPP#4) of the four LPPs in AT6412 was made.
A guide plasmid, pAT6327, comprising a gRNA sequence (SEQ ID NO: 18) enabling cutting of PS- 59 present in all four of the landing pads in AT6412 was made.
In step 1 , the guide plasmid pAT6654 was transformed in AT6412 together with small amounts of REPAIR2 DNA, so the cells could survive the exposure. Transformants were spore isolated and strains having two copies of REPAIR2 integrated, one copy into LPP#1 and one copy into LPP#2, were characterized using ddPCR to determine the copy number and the integrity of the two other landing pads were determined as well using ddPCR. The final 2-copy strain was named AT6712.
The subsequent steps 2a and 2b were carried out in separate experiments, resulting in a 3-copy strain and in a 4-copy strain, respectively.
In step 2a, the guide plasmid pAT6557, cutting one of two remaining “empty” landing pads (LPP#4), was transformed in AT6712. Transformants were spore isolated and strains having three copies of REPAIR2 integrated were characterized by ddPCR. Correct transformants tested had the REPAIR2 fragment copied from one of the already occupied loci into the third locus LPP#4, resulting in a 3-copy strain.
In step 2b, the guide plasmid pAT6327, cutting both of the two remaining “empty” landing pads (LPP#3 and LPP#4), was transformed in AT6712. Transformants were spore isolated and strains having four copies of REPAIR2 integrated were characterized by ddPCR. Correct transformants tested had the REPAIR2 fragment copied from the already occupied loci to both loci LPP#3 and LPP#4, resulting in a 4- copy strain.
Normally, for generation of multicopy strains in one step, the host cells are transformed with high amount of DNA to increase integration efficiency and to secure integration into all desired landing pads. This high DNA load can be toxic for some genes of interest and/or for some host cells. Using the SPADE technology of example 4, multi-copy strains can also be generated for genes of interest which are toxic for the cells, as the presented two-step approach can keep the exposure to the toxic DNA at a minimum.
Example 5. Possible arrangements for protospacer presence in up to 16 landing pads.
Table 4 shows exemplary arrangements for protospacers in up to 16 landing pads.
Example 1 of Table 4 shows a host cell with 16 landing pads (LPP#1 - LPP#16). Each LPP comprises between 1-5 protospacer sequences. Protospacer #1 is present in only one LPP. Protospacer #2 is present in two LPPs. Protospacer #3 is present in 4 LPPs. Protospacer #4 is present in 8 LPPs, and protospacer #5 is present in all 16 LPPs. Depending on how many copies of the gene of interest shall be incorporated into the host cell, a suitable protospacer can be targeted.
Example 2 of Table 4 shows a host cell with 16 landing pads (LPP#1 - LPP#16). Each LPP comprises between 1-5 protospacer sequences. Protospacer #1 is present in only one LPP. Protospacer #2 is present in 4 LPPs. Protospacer #3 is present in 6 LPPs. Protospacer #4 is present in 10 LPPs, and protospacer #5 is present in all 16 LPPs. Depending on how many copies of the gene of interest shall be incorporated into the host cell, a suitable protospacer can be targeted.
These examples clearly show how versatile the LPPs can be designed to provide a flexible SPADE host cell which can be used for multi-copy integration, either in a one step approach, or in a multi-step approach.
Example 6. Generation of recipient SPADE Trichoderma host cell
A landing pad polynucleotide (LPP) is a DNA sequence consisting of three components, a protospacer array sequence consisting of one or more CRISPR target sites flanked by a left sequence and a right sequence. The two flanking sequences are used for integration of a repair construct by homologous recombination resulting in repair of the CRISPR guide facilitated chromosomal breaks in the landing pad.
For the design of the landing pads in this experiment, the left flank, named SYN2L (SEQ ID NO: 1), consists of 1000 base pairs (bp), i.e., having a length of 1000 nucleotides (nt). The SYN2L sequence was randomly generated by a python script setting the AT content to 52%. The right flank was named SYN2R (SEQ ID NO: 2), which is a 1000 bp sequence.
For this experiment, the protospacer array consists of five protospacer sequences. The protospacer sequences for the arrays were randomly generated by a python script setting the AT content to 52% on average. Moreover, the protospacers had all been tested to function at 30 degrees Celsius using MAD7 as the CRISPR enzyme. As the 5’-PAM sequence, the sequence TTTC was chosen for all protospacers in the protospacer arrays. The location of the PAM sequence for each protospacer is shown in Figure 3.
As shown in Table 2 and in Figure 1 for the Aspergillus cells, the four protospacer arrays of the four LPPs in Trichoderma cells all contain two first protospacer sequences #1A and #1 B. One first protospacer sequence #1A is PS-59 (SEQ ID NO: 3) which is present in all four protospacer arrays, and another first protospacer sequence #1 B is PS-73 (SEQ ID NO: 4) which is also present in all four protospacer arrays.
Moreover, each of the protospacer arrays contained three additional protospacers: a second protospacer (#2), a third protospacer (#3), and a fourth protospacer (#4). These protospacer sequences are either present in the protospacer arrays of three landing pads in total, two landing pads in total or are unique to the respective protospacer arrays. For an overview of the presence of protospacer sequences for each of the four landing pads LPP#1-LPP#4 see Table 5.
For building the Trichoderma reesei recipient strains, four chromosomal loci were chosen, which are present on four different chromosome arms to ensure genetic stability in the experiment. We chose the eg1 locus on chromosome 2, the tf37062 locus on chromosome 4, the tf92949 locus on chromosome 3 and the xyn2 locus on chromosome 2.
The four LPPs were inserted at the four chromosomal loci using four CRISPR guide RNAs targeting each of the four loci as well as four repair fragments, which contained the four LPPs flanked by loci specific repair sequences. While integrating the LPPs, the loci specific genes were deleted thereby removing the guide targeting sequence.
The resulting Trichoderma reesei recipient strain TT540 containing the four LPPs at the four different chromosomal loci was confirmed by digital droplet PCR.
The four Trichoderma protospacer arrays have been listed as:
- LPP#T1 : the five protospacers PS-59, PS-73, PS-89, PS-90 and PS-48 for integration in the eg1 locus;
- LPP#T2: the five protospacers PS-59, PS-73, PS-89, PS-94 and PS-48 for integration in the tf37062 locus;
- LPP#T3: the five protospacers PS-59, PS-73, PS-74, PS-75 and PS-76 for integration in the tf92949 locus
- LPP#T4: the five protospacers PS-59, PS-73, PS-74, PS-79 and PS-80 for integration in the xyn2 locus.
Next to host TT540 with four LPP’s, we also constructed a host with only one LLP, i.e. TT499 having the single LLP in the xyn2 locus.
Example 7. One-step integration of 4 gene copies into 4 landing pad polynucleotides (LPP) using one gRNA sequence
The recipient Trichoderma reesei strain TT540 comprising 4 LPPs was made transformable by protoplasting mycelia. One CRISPR guide RNA (gRNA) plasmids was built (pTT397) containing a gRNA which enables cutting of all four LPPs in the recipient strain. The plasmid comprised the first gRNA (SEQ ID NO:18) enabling cutting at the protospacer sequence PS-59. Plasmid pTT397 carries the hygromycin marker-gene for maintenance in the recipient strain upon transformation. An overview of designed gRNAs is shown in Table 6.
A repair DNA construct (= first donor polynucleotide), REPAIRS, was built, which contained a sequence of interest flanked by the repair flanks SYN2L (SEQ ID NO: 1) and SYN2R (SEQ ID NO: 2).
Table 6.
In Table 6 (right column), upper case letters in the guide RNA represent the tracrRNA (transactivating CRISPR RNA), whereas lower case letters represent the crRNA (CRISPR RNA), which recognizes the protospacer in the target DNA.
Protoplasts of TT540 were transformed with pTT397 and repair DNA (Ascl-digested pTT391 ; EcoL1028-lysozyme; WO2021202479-A1) and transformants were selected on PDA plates with hygromycin addition. Ten transformants were spore reisolated twice and the copy number was determined by ddPCR. 75% of the transformants had the expected four copies of the gene of interest, i.e. one copy per LPP.
Similar results were obtained using the guide plasmid pTT397 and another repair DNA construct (Ascl-digested pTT508; Af BX beta-xylosidase; WO2022127892-A1). Thus, we conclude that the SPADE technology can be used for efficient and controlled 1-step integration of multiple copies into multiple landing pads in T. reesei.
Example 8. One-step multiplex integration of two different genes into 4 LPPs, using one gRNA sequence
In this example, we transformed Trichoderma reesei recipient strain TT540 with guide plasmid pTT397 and applied two repair DNA constructs (=first donor polynucleotide), REPAIR4 and REPAIRS. REPAIR4 contained a first sequence of interest/gene of interest (GOI), i.e. pTT391 : EcoL1028 lysozyme (WO2021202479-A1), flanked by the homology arms SYN2L (SEQ ID NO: 1) and SYN2R (SEQ ID NO: 2). REPAIRS contained a second sequence of interest/gene of interest (GOI), i.e. pTT508: AfBX beta- xylosidase (WO2022127892-A1), flanked by the homology arms SYN2L (SEQ ID NO: 1) and SYN2R (SEQ ID NO: 2. 5-10 transformants were spore reisolated twice and the copy number for each gene of interest was determined by ddPCR. In all transformants the four LLPs were occupied with the genes of interest. We obtained varying copy number ratios of lysozyme versus beta-xylosidase, containing the combinations: zero to four (0:4), one to three (1 :3), and three to one (3:1).
Thus, the SPADE technology proves to be efficient for targeted integration into multiple landing pads using different GOI sequences. The shown one-step multiplexing approach allows easy combination of GOIs in strain construction.
Example 9. Efficiency of applied flank length for one-step integration of one gene copy into 1 LPPs using one gRNA sequence
The Trichoderma reesei recipient strain TT499 comprising 1 LPP was made transformable by protoplasting mycelia. One CRISPR guide RNA (gRNA) plasmids was built (pTT505) containing a gRNA which enables cutting the LPP in the recipient strain. The plasmid comprised the first gRNA (SEQ ID NO:18) enabling cutting at the protospacer sequence PS-59. Plasmid pTT505 carries the zeocin marker-gene for maintenance in the recipient strain upon transformation. An overview of designed gRNAs is shown in Table 6.
In this example, 7 repair DNA constructs (= first donor polynucleotide), REPAIR6-12, were built, which each contained a sequence of interest flanked by the repair flanks SYN2L (SEQ ID NO: 1) and SYN2R (SEQ ID NO: 2). The repair flanks (homology arms) are varying in length from 1000, 750, 500, 250, 200, 150, to 100-bp, being shortened at the outside.
The gene of interest is EcoL1028 lysozyme (WO2021202479-A1). Protoplasts of TT499 were transformed with pTT505 and the seven different repair DNAs and transformants were selected on Cove plates supplemented with urea and zeocin. For each transformation applying the various repair constructs, we obtained transformants. The transformants obtained from each transformation were spore reisolated twice and the copy number was determined by ddPCR. All transformants had the expected single copy of the gene of interest. This example shows that SPADE integration can be performed using minimal 100-bp flanks.
Example 10. One step generation of a 4-copy lipase strain using a single gRNA sequence
For building the Aspergillus oryzae recipient strain, four chromosomal loci were chosen, which are present on four different chromosome arms to ensure genetic stability in the experiment. We chose the pepAb locus on chromosome 1 , the pepE locus on chromosome 2, the amdS locus on chromosome 6 and the pepAa locus on chromosome 8.
The four LPPs were inserted at the four chromosomal loci using four CRISPR guide RNAs targeting each of the four loci as well as four repair fragments, which contained the four LPPs flanked by loci specific repair sequences. While integrating the LPPs, the loci specific genes were deleted thereby removing the guide targeting sequence.
The resulting Aspergillus oryzae recipient strain A2F0012 containing the four LPPs at the four different chromosomal loci was confirmed by genome sequencing. The recipient strain A2F0012 is ApyrG and therefore requires supplementation of uridine to be able to grow.
The full sequences of the four protospacer arrays have been listed as:
- SEQ ID NO: 14 (LPP#1): the five protospacers PS-59, PS-73, PS-74, PS-75 and PS-76 for integration in the pepAb locus;
- SEQ ID NO: 15 (LPP#2): the five protospacers PS-59, PS-73, PS-74, PS-75 and PS-11 for integration in the pepE locus;
- SEQ ID NO: 16 (LPP#3): the five protospacers PS-59, PS-73, PS-48, PS-49 and PS-93 for integration in the amdS locus
- SEQ ID NO: 17 (LPP#4): the five protospacers PS-59, PS-73, PS-48, PS-89 and PS-94 for integration in the pepAa locus. DNA fragments for transformation
All DNA fragments used in this study were PCR amplified from gDNA and confirmed by sanger sequenced. The repair DNA construct (= first donor polynucleotide), REPAIR13, was built containing a sequence of interest flanked by the repair flanks SYN2L (SEQ ID NO: 1) and SYN2R (SEQ ID NO: 2).
Transformation of Aspergillus oryzae
Protoplast of the strain A2F0012 (pyrG-) containing LPP #1 , #2, #3 and #4 was transformed by using CRISPR guide RNA (gRNA) pAT4859 containing a gRNA which enables cutting of all four LPPs in the recipient strain. pAT4859 comprises a gRNA (SEQ ID NO:18) enabling cutting at the protospacer sequence PS-59 present on all four LPPs, the guide RNA plasmids carry the pyrG gene for maintenance in the recipient strain upon transformation.
The Protoplasts of A2F0012 were transformed with pAT4859 and the 3 repair DNA fragments, the transformation was done on an automated platform using a Hamilton robots, the transformants were plated on sucrose plates without addition of uridine, and grown 4 days at 30°C, the transformation efficiency was 100 transformants; 5 transformants were selected, picked and transferred to sucrose agar in with urea and uridine MTP, the plate was incubated 3 days at 30°C, the copy number was determined by ddPCR giving 80% of transformants with 4 copy integrated.
Moreover, ddPCR was used to verify that the protospacer arrays in the four LPPs (PS-59) were no longer present additionally the LPP were PCR amplified for nanopore sequenced to confirm copy number integrated on each of the LPP and verify the integrated expression cassette in those 80% correct transformants, which indicates that the repair and insertion was successful.
A fermentation on microscale was done by making a spore solution of the successful transformants, those were inoculated on MTP containing 180pl of liquid sucrose media, urea and uridine and incubated for seven days at 30°C, after 24 hours of incubation 10pl 10,75 concentration maltose was added, on day 7 SDS page gels were made and enzyme activity assay was run to verify the enzyme production and validating the activity on the produced enzyme.
Example 11. Generation of a multi-copy lipase strain
For building the Aspergillus oryzae recipient strain AT6818 containing six LPPs, the AT5885 strain described in Example 1 comprising 4 LPPs was used as starting point. Two additional chromosomal loci were chosen for inserting LPP#5 and LPP#6 into AT5885. First, LPP#5 was inserted on chromosome 3 between gene AQ090026000695 and AQ090026000696 at the position 786 nucleotides upstream from the translation initiation codon of gene AQ090026000695. Second, LPP#6 was inserted on chromosome 7 at the nprA locus. The final strain was named AT6818, see Table 7 for an overview.
The full sequences of the six LPP protospacer arrays have been listed as:
- SEQ ID NO: 14 (LPP#1): the five protospacers PS-59, PS-73, PS-74, PS-75 and PS-76 for integration in the pepAb locus;
- SEQ ID NO: 15 (LPP#2): the five protospacers PS-59, PS-73, PS-74, PS-75 and PS-11 for integration in the pepE locus;
- SEQ ID NO: 16 (LPP#3): the five protospacers PS-59, PS-73, PS-48, PS-49 and PS-93 for integration in the amdS locus
- SEQ ID NO: 17 (LPP#4): the five protospacers PS-59, PS-73, PS-48, PS-89 and PS-94 for integration in the pepAa locus. - SEQ ID NO: 33 (LPP#5): the five protospacers PS-59, PS-73, PS-74, PS-79 and PS-80 for integration at the A0090026000695 locus.
- SEQ ID NO: 34 (LPP#6): the five protospacers PS-59, PS-73, PS-48, PS-89 and PS-90 for integration in the nprA locus. Table 7. Overview of strains, plasmids and PCR fragments (DNA fragments) used in Example 11 .
The primers were designed to include overlapping regions to allow for combination and integration of the 3 PCR-fragments in SYN2 landing pads in the A oryzae genome. The design is based on the method disclosed in W02020025357A1 (Novozymes A/S).
The 750_2rev primer contain a 14 bp overlap to the lipase expression cassette in the 750-2 PCR fragment. The 750_3FW primer contain a 27bp extension that overlaps with the 750-1 PCR fragment. Together the overlapping primers generate a 41 bp overlap between the PCR fragments, 750-1 and 750- 2.
The 750_4 primer contains a 11 bp overlap with the 750-3 fragment and the 750_5 primer contains a 26bp extension that overlaps with the 750-2 PCR-fragment. Together the overlapping primers generate a 37 bp overlap between the PCR fragments 750-2 and 750-3.
Protoplasts of the host strains listed in Table 7 were transformed with the CRISPR plasmid pAT4859 and three repair DNA fragments encoding SYN2-Left and SYN2-Right flanking regions homologous to the LPPs and a lipase TLL12225A expression cassette in between (Figure 4). In Fig. 4 the dashed lines represent in-vivo homologous recombination between the different PCR fragments. The transformations were performed manually using PEG. The transformants were plated onto sucrose plates containing the appropriate nitrogen source (nitrate for AT5885 and AT6818, and urea for AT5883) and top agar was added to the transformants. After 4 days of incubation at 30°C selected transformants were streak-purified twice and the integration of the expression cassette into the targeted LPPs was confirmed by ddPCR. In this way, the lipase strains shown in Table 8 were generated.
As can be seen from Table 8, using the SPADE system of the invention, with a single round of transformation of the PCR fragments we generated strains with 2, 4, and 6 copies of the lipase gene to occupy all LPPs of the host strains with the lipase genes. This example further demonstrates the efficient and versatile application of the SPADE system. Also, Example 11 shows that the SPADE system is not limited to be used with (linearized) plasmids, but can also be used with PCR fragments comprising the different features of the SPADE system, to further streamline strain generation.
Example 12. Two step generation of multiplex strains using internal gene transfer
For building the Aspergillus oryzae recipient strain, the AT5885 strain was deleted in the three genes acvA, ipnA and aatA to remove the strains’ ability to produce Penicillin. The resulting Aspergillus oryzae recipient strain was named AT6230.
A guide plasmid pAT4848, comprising gRNA sequence (SEQ ID NO: 13) enabling cutting of PS- 48 present in two LPP (LPP#3 and LPP#4) of the four LPPs in AT6230 was made.
A repair DNA construct, REPAIR-14 (= first donor polynucleotide), encoding two copies of Bovine Alpha-Lactalbumin flanked by the repair flanks SYN2L and SYN2R was made.
A repair DNA construct, REPAIR-15 (= second donor polynucleotide), encoding one copy of the protein disulfide isomerase gene pdiA flanked by the repair flanks SYN2L and SYN2R was made.
In stepl , the guide plasmid pAT4848 was transformed together with REPAIR-14 into AT6230 obtaining a strain A2F0285 having four copies of the Alpha-Lactalbumin gene with two copies inserted in LPP#3 and two copies inserted in LPP#4.
In step2, the guide plasmid pAT4873 was transformed together with REPAIR-15 into A2F0285. 20 Transformants were tested by ddPCR counting the number of Alpha-Lactalbumin genes and pdiA genes. Using this procedure, we obtained three different outcome strains: a. 8 copies of Alpha-Lactalbumin, 0 copies of pdiA. b. 6 copies of Alpha-Lactalbumin, 1 copy of pdiA. c. 4 copies of Alpha-Lactalbumin, 2 copies of pdiA.
Using internal gene copying mechanisms from one set of homology flanks to another, it was possible to obtain strains with varying ratios of alpha-lactalbumin and pdiA.
The invention described and claimed herein is not to be limited in scope by the specific aspects herein disclosed, since these aspects are intended as illustrations of several aspects of the invention. Any equivalent aspects are intended to be within the scope of this invention. Indeed, various modifications of the invention in addition to those shown and described herein will become apparent to those skilled in the art from the foregoing description. Such modifications are also intended to fall within the scope of the appended claims. In the case of conflict, the present disclosure including definitions will control.
The invention is further defined by the following numbered paragraphs:
1 . A host cell comprising in its genome at least two landing pad polynucleotide (LPP) sequences, each LPP sequence comprising in 5’ to 3’ direction: a 5’ homology arm, two or more protospacer sequences, and a 3’ homology arm, wherein the polynucleotide sequence of the 5’ homology arm is identical for at least two LPP sequences, and wherein the polynucleotide sequence of the 3’ homology arm is identical for at least two LPP sequences.
2. The host cell according to any one of the previous paragraphs, wherein at least one protospacer of the at least two protospacers is present in at least two LPP.
2a. The host cell according to any one of the previous paragraphs, wherein the cell comprises in its genome at least three LPP, and wherein at least one LPP comprises second 3’ and second 5’ homology arms having less than 100 % sequence identity to the 3’ and 5’ homology arms of the at least two LPP, respectively, e.g., less than 99%, less than 98%, less than 97%, less than 96%, less than 95%, less than 90%, less thank 85%, less than 80%, less than 75%, less than 70%, or less than 65%.
3. The host cell according to any one of the previous paragraphs, wherein the polynucleotide sequence of the 5’ homology arm is identical for all of the at least two LPP sequences.
4. The host cell according to any one of the previous paragraphs, wherein the polynucleotide sequence of the 3’ homology arm is identical for all of the at least two LPP sequences.
5. The host cell according to any one of the previous paragraphs, wherein the 3’ homology arms and the 5’ homology arms are heterologous to the host cell.
6a. The host cell according to any one of the previous paragraphs, wherein the 3’ homology arms and the 5’ homology arms are synthetic polynucleotide sequences.
6b. The host cell according to any one of the previous paragraphs, wherein at least one LPP is located on the coding strand (3’ to 5’ direction), and wherein at least one LPP is located on the template strand (5’ to 3’ direction).
6c. The host cell according to any one of the previous paragraphs, wherein at least two LPPs are located on the coding strand (3’ to 5’ direction), and wherein at least two LPPs are located on the template strand (5’ to 3’ direction).
7. The host cell according to any one of the previous paragraphs, wherein each LPP comprises two protospacer sequences, such as a first protospacer, and a second protospacer.
8. The host cell according to any one of the previous paragraphs, wherein each LPP comprises three different protospacer sequences, such as a first protospacer, a second protospacer, and a third protospacer.
9. The host cell according to any one of the previous paragraphs, wherein each LPP comprises four different protospacer sequences, such as a first protospacer, a second protospacer, a third protospacer, and a fourth protospacer.
10. The host cell according to any one of the previous paragraphs, wherein each LPP comprises five different protospacer sequences, such as a first protospacer, a second protospacer, a third protospacer, a fourth protospacer, and a fifth protospacer.
11 . The host cell according to any one of the previous paragraphs, wherein the first protospacer has a polynucleotide sequence with a sequence identity of below 100%, such as below 99%, below 98%, below 97%, below 96%, below 95%, below 90%, below 85%, below 80%, below 75%, below 70%, below 65%, below 60%, below 55%, below 50%, below 45%, below 40%, below 35%, below 30%, below 25%, below 20%, below 15%, or below 10% to the polynucleotide sequence of the second protospacer, to the polynucleotide sequence of the third protospacer, to the polynucleotide sequence of the fourth protospacer, and to the polynucleotide sequence of the fifth protospacer.
12. The host cell according to any one of the previous paragraphs, wherein the second protospacer has a polynucleotide sequence with a sequence identity of below 100%, such as below 99%, below 98%, below 97%, below 96%, below 95%, below 90%, below 85%, below 80%, below 75%, below 70%, below 65%, below 60%, below 55%, below 50%, below 45%, below 40%, below 35%, below 30%, below 25%, below 20%, below 15%, or below 10% to the polynucleotide sequence of the first protospacer, to the polynucleotide sequence of the third protospacer, to the polynucleotide sequence of the fourth protospacer, and to the polynucleotide sequence of the fifth protospacer.
13. The host cell according to any one of the previous paragraphs, wherein the third protospacer has a polynucleotide sequence with a sequence identity below 100%, such as below 99%, below 98%, below 97%, below 96%, below 95%, below 90%, below 85%, below 80%, below 75%, below 70%, below 65%, below 60%, below 55%, below 50%, below 45%, below 40%, below 35%, below 30%, below 25%, below 20%, below 15%, or below 10% to the polynucleotide sequence of the first protospacer, to the polynucleotide sequence of the second protospacer, to the polynucleotide sequence of the fourth protospacer, and to the polynucleotide sequence of the fifth protospacer.
14. The host cell according to any one of the previous paragraphs, wherein the fourth protospacer has a polynucleotide sequence with a sequence identity of below 100%, such as below 99%, below 98%, below 97%, below 96%, below 95%, below 90%, below 85%, below 80%, below 75%, below 70%, below 65%, below 60%, below 55%, below 50%, below 45%, below 40%, below 35%, below 30%, below 25%, below 20%, below 15%, or below 10% to the polynucleotide sequence of the first protospacer, to the polynucleotide sequence of the second protospacer, to the polynucleotide sequence of the third protospacer, and to the polynucleotide sequence of the fifth protospacer.
15. The host cell according to any one of the previous paragraphs, wherein the fifth protospacer has a polynucleotide sequence with a sequence identity of below 100%, such as below 99%, below 98%, below
97%, below 96%, below 95%, below 90%, below 85%, below 80%, below 75%, below 70%, below 65%, below 60%, below 55%, below 50%, below 45%, below 40%, below 35%, below 30%, below 25%, below 20%, below 15%, or below 10% to the polynucleotide sequence of the first protospacer, to the polynucleotide sequence of the second protospacer, to the polynucleotide sequence of the third protospacer, and to the polynucleotide sequence of the fourth protospacer.
16. The host cell according to any one of the previous paragraphs, wherein each LPP comprises more than five different protospacer sequences.
17. The host cell according to any one of the previous paragraphs, wherein one or more of the at least two protospacer sequences is heterologous to the host cell.
18. The host cell according to any one of the previous paragraphs, wherein all of the at least two protospacer sequences are heterologous to the host cell.
19. The host cell according to any one of the previous paragraphs, wherein one or more of the at least two protospacer sequences is synthetic.
20. The host cell according to any one of the previous paragraphs, wherein all of the at least two protospacer sequences is synthetic.
21. The host cell according to any one of the previous paragraphs, wherein each protospacer sequence has a length of at least 15 nt, at least 20 nt, or at least 25 nt, preferably each protospacer has a length of 20 - 30 nt.
22. The host cell according to any one of the previous paragraphs, wherein the host cell comprises two LPP sequences, three LPP sequences, four LPP sequences, 5 LPP sequences, 6 LPP sequences, 7 LPP sequences, 8 LPP sequences, 9 LPP sequences, 10 LPP sequences, 11 LPP sequences, 12 LPP sequences, 13 LPP sequences, 14 LPP sequences, 15 LPP sequences, 16 LPP sequences, or more than 16 LPP sequences.
23. The host cell according to any one of the previous paragraphs, wherein the host cell comprises 2 - 8 LPP sequences.
24. The host cell according to any one of the previous paragraphs, wherein the host cell comprises at least two chromosomes, at least four chromosomes, at least six chromosomes or at least 8 chromosomes.
25. The host cell according to any one of the previous paragraphs, wherein one or more of the host cell chromosomes comprises a total of 1 LPP per chromosome.
26. The host cell according to any one of the previous paragraphs, wherein two or more chromosomes comprise a total of 1 LPP per chromosome.
27. The host cell according to any one of the previous paragraphs, wherein three chromosomes comprise a total of 1 LPP per chromosome.
28. The host cell according to any one of the previous paragraphs, wherein four chromosomes comprise a total of 1 LPP per chromosome.
29. The host cell according to any one of the previous paragraphs, wherein five chromosomes comprise a total of 1 LPP per chromosome.
30. The host cell according to any one of the previous paragraphs, wherein six chromosomes comprise a total of 1 LPP per chromosome.
31 . The host cell according to any one of the previous paragraphs, wherein 7 chromosomes comprise a total of 1 LPP per chromosome.
32. The host cell according to any one of the previous paragraphs, wherein 8 chromosomes comprise a total of 1 LPP per chromosome.
33. The host cell according to any one of the previous paragraphs, wherein each chromosome comprises a total of 1 LPP per chromosome.
34. The host cell according to any one of the previous paragraphs, wherein the LPP is located on the short arm of a chromosome.
35. The host cell according to any one of the previous paragraphs, wherein the LPP is located on the long arm of a chromosome.
36. The host cell according to any one of the previous paragraphs, wherein one or more of the chromosomes comprises a total of 2 LPP per chromosome.
37. The host cell according to any one of the previous paragraphs, wherein two or more chromosomes comprise a total of 2 LPP per chromosome.
38. The host cell according to any one of the previous paragraphs, wherein three chromosomes comprise a total of 2 LPP per chromosome.
39. The host cell according to any one of the previous paragraphs, wherein four chromosomes comprise a total of 2 LPP per chromosome.
40. The host cell according to any one of the previous paragraphs, wherein five chromosomes comprise a total of 2 LPP per chromosome.
41 . The host cell according to any one of the previous paragraphs, wherein six chromosomes comprise a total of 2 LPP per chromosome.
42. The host cell according to any one of the previous paragraphs, wherein 7 chromosomes comprise a total of 2 LPP per chromosome.
43. The host cell according to any one of the previous paragraphs, wherein 8 chromosomes comprise a total of 2 LPP per chromosome.
44. The host cell according to any one of the previous paragraphs, wherein each chromosome comprises a total of 2 LPP per chromosome.
45. The host cell according to any one of the previous paragraphs, wherein for each of the one or more chromosomes, the 2 LPP are located on opposite locations of the centromere of the chromosome.
46. The host cell according to any one of the previous paragraphs, wherein one LPP is located on the short arm, and one LPP is located on the long arm.
47. The host cell according to any one of the previous paragraphs, wherein one or more of the chromosomes comprises a total of 3 LPP per chromosome.
48. The host cell according to any one of the previous paragraphs, wherein two or more chromosomes comprise a total of 3 LPP per chromosome.
49. The host cell according to any one of the previous paragraphs, wherein three chromosomes comprise a total of 3 LPP per chromosome.
50. The host cell according to any one of the previous paragraphs, wherein four chromosomes comprise a total of 3 LPP per chromosome.
51 . The host cell according to any one of the previous paragraphs, wherein five chromosomes comprise a total of 3 LPP per chromosome.
52. The host cell according to any one of the previous paragraphs, wherein six chromosomes comprise a total of 3 LPP per chromosome.
53. The host cell according to any one of the previous paragraphs, wherein 7 chromosomes comprise a total of 3 LPP per chromosome.
54. The host cell according to any one of the previous paragraphs, wherein 8 chromosomes comprise a total of 3 LPP per chromosome.
55. The host cell according to any one of the previous paragraphs, wherein each chromosome comprises a total of 3 LPP per chromosome.
56. The host cell according to any one of the previous paragraphs, comprising 3 LPP per chromosome, the ratio of numbers of LPP between short arm and long arm is selected from the list of 3:0, 0:3, 2:1 , and 1 :2.
57. The host cell according to any one of the previous paragraphs, wherein one or more of the chromosomes comprises a total of 4 LPP per chromosome.
58. The host cell according to any one of the previous paragraphs, wherein two or more chromosomes comprise a total of 4 LPP per chromosome.
59. The host cell according to any one of the previous paragraphs, wherein three chromosomes comprise a total of 4 LPP per chromosome.
60. The host cell according to any one of the previous paragraphs, wherein four chromosomes comprise a total of 4 LPP per chromosome.
61 . The host cell according to any one of the previous paragraphs, wherein five chromosomes comprise a total of 4 LPP per chromosome.
62. The host cell according to any one of the previous paragraphs, wherein six chromosomes comprise a total of 4 LPP per chromosome.
63. The host cell according to any one of the previous paragraphs, wherein 7 chromosomes comprise a total of 4 LPP per chromosome.
64. The host cell according to any one of the previous paragraphs, wherein 8 chromosomes comprise a total of 4 LPP per chromosome.
65. The host cell according to any one of the previous paragraphs, wherein each chromosome comprises a total of 4 LPP per chromosome.
66. The host cell according to any one of the previous paragraphs, wherein for 4 LPP per chromosome, the ratio of numbers of LPP between short arm and long arm is selected from the list of 4:0, 0:4, 3:1 , 1 :3, 2:2, 2:1 , and 1 :2.
67. The host cell according to any one of the previous paragraphs, wherein the host cell is deficient of a non-homologous end joining (NHEJ) mechanism.
68. The host cell according to any one of the previous paragraphs, wherein each 3’ homology arm and each 5’ homology arm has a length of at least 10 nt, at least 20 nt, at least 30 nt, at least 50 nt, at least 100 nt, 200 nucleotides (nt), at least 300 nt, at least 400 nt, at least 500 nt, at least 600 nt, at least 700 nt, at least 800 nt, at least 900 nt, or at least 1000 nt.
69. The host cell according to any one of the previous paragraphs, wherein each 3’ homology arm and each 5’ homology arm has a length of at least 100 nt.
70. The host cell according to any one of the previous paragraphs, wherein each 3’ homology arm and each 5’ homology arm has a length of at least 800 nt, at least 900 nt, or at least 1000 nt.
71 . The host cell according to any one of the previous paragraphs, wherein one or more LPP of the at least two LPP comprise a selectable marker, and/or a counter-selectable marker.
72. The host cell according to any one of the previous paragraphs, wherein 2 or more LPP of the at least two LPP comprise a selectable marker, and/or a counter-selectable marker.
73. The host cell according to any one of the previous paragraphs, wherein 3 or more LPP of the at least two LPP comprise a selectable marker, and/or a counter-selectable marker.
74. The host cell according to any one of the previous paragraphs, wherein 4 or more LPP of the at least two LPP comprise a selectable marker, and/or a counter-selectable marker.
75. The host cell according to any one of the previous paragraphs, wherein 5 or more LPP of the at least two LPP comprise a selectable marker, and/or a counter-selectable marker.
76. The host cell according to any one of the previous paragraphs, wherein 6 or more LPP of the at least two LPP comprise a selectable marker, and/or a counter-selectable marker.
77. The host cell according to any one of the previous paragraphs, wherein 7 or more LPP of the at least two LPP comprise a selectable marker, and/or a counter-selectable marker.
78. The host cell according to any one of the previous paragraphs, wherein 8 or more LPP of the at least two LPP comprise a selectable marker, and/or a counter-selectable marker.
79. The host cell according to any one of the previous paragraphs, wherein the selectable marker comprises a marker selected from the list of a pyrG gene, a fcy1 gene, an adeA gene, an adeB gene, a metF gene, a metG gene, a metH gene, a lysF gene, a trpC gene, a sC gene, an argB gene, and a gene encoding a fluorescent protein, such as a GFP, YFP, CFP, DsRed, or a eqPF611 .
80. The host cell according to any one of the previous paragraphs, wherein the counter-selectable marker comprises a marker selected from the list of a pyrG gene, a fcy1 gene, an adeA gene, an adeB gene, a metF gene, a metG gene, a metH gene, a lysF gene, a trpC gene, a sC gene, an argB gene, and a gene encoding a fluorescent protein, such as a GFP, YFP, CFP, DsRed, or a eqPF611 .
81 . The host cell according to any one of the previous paragraphs, wherein none of the at least two LPP comprises a selectable marker, and/or a counter-selectable marker.
82. The host cell according to any one of the previous paragraphs, wherein each DNA strain complementary to the two or more protospacers sequences comprises a PAM sequence (protospacer adjacent motif).
83. The host cell according to any one of the previous paragraphs, wherein the PAM sequence is selected from the list of “NGG”, “NGRRT”, “NGRRN”, “NNNNGATT”, “NNNNRYAC”, “NNAGAAW”, “TTTV”, “TTTC”, “TTTN”, “TTN”, “ATTN”, “GTTN”, and “CTTN” (“N” being any nucleotide, “Y” = C or T, “V”= A or C or G, “W - A or T, “R”= A or G,), preferably the PAM sequence is TTTN, more preferably the PAM sequence is TTTC.
84. The host cell according to any one of the previous paragraphs, wherein the first, second, third, fourth or fifth protospacer sequence is selected from a sequence having a sequence identity of at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% to SEQ ID NO: 3, SEQ ID NO: 4, SEQ ID NO: 5, SEQ ID NO: 6, SEQ ID NO: 7, SEQ ID NO: 8, SEW ID NO: 9, SEQ ID NO: 10, SEQ ID NO: 1 1 , SEQ ID NO: 12, or SEQ ID NO: 13.
85. The host cell according to any one of the previous paragraphs, wherein the 5’ homology arm sequence comprises or consists of SEQ ID NO: 1 or SEQ ID NO: 2.
85a. The host cell according to any one of the previous pharagraphs, wherein the 5’ homology arm sequences comprises or consists of a polynucleotide sequence having a sequence identity of at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% to SEQ ID NO: 1 or SEQ ID NO: 2.
86. The host cell according to any one of the previous paragraphs, wherein the 3’ homology arm sequence comprises or consists of SEQ ID NO: 1 or SEQ ID NO: 2.
86a. The host cell according to any one of the previous pharagraphs, wherein the 3’ homology arm sequences comprises or consists of a polynucleotide sequence having a sequence identity of at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% to SEQ ID NO: 1 or SEQ ID NO: 2.
87. The host cell according to any one of the previous paragraphs, wherein each of the at least two LPP comprises a first protospacer sequence with a polynucleotide sequence identical for each first protospacer of each at least two LPP.
88. The host cell according to any one of the previous paragraphs, wherein each of the at least two LPP comprises a second protospacer sequence with a polynucleotide sequence which is unique for each second protospacer of each at least two LPP.
89. The host cell according to any one of the previous paragraphs, wherein each of the at least two LPP comprises a third protospacer sequence with a polynucleotide sequence which is identical for each third polynucleotide within a third subset of the at least two LPPs.
90. The host cell according to any one of the previous paragraphs, wherein the third subset consists of 2 LPP sequences, 3 LPP sequences, 4 LPP sequences, 5 LPP sequences, 6 LPP sequences, 7 LPP sequences, 8 LPP sequences, 9 LPP sequences, 10 LPP, sequences, 11 LPP sequences, 12 LPP sequences, 13 LPP sequences, 14 LPP sequences, or 15 LPP sequences.
91 . The host cell according to any one of the previous paragraphs, wherein each of the at least two LPP comprises a fourth protospacer sequence with a polynucleotide sequence which is identical for each fourth polynucleotide within a fourth subset of the at least two LPPs.
92. The host cell according to any one of the previous paragraphs, wherein the fourth subset consists of 2 LPP sequences, 3 LPP sequences, 4 LPP sequences, 5 LPP sequences, 6 LPP sequences, 7 LPP sequences, 8 LPP sequences, 9 LPP sequences, 10 LPP, sequences, 11 LPP sequences, 12 LPP sequences, 13 LPP sequences, 14 LPP sequences, or 15 LPP sequences.
93. The host cell according to any one of the previous paragraphs, wherein each of the at least two LPP comprises a fifth protospacer sequence with a polynucleotide sequence which is identical for each fifth protospacer within a fifth subset of the at least two LPPs.
94. The host cell according to any one of the previous paragraphs, wherein the fifth subset consists of 2 LPP sequences, 3 LPP sequences, 4 LPP sequences, 5 LPP sequences, 6 LPP sequences, 7 LPP sequences, 8 LPP sequences, 9 LPP sequences, 10 LPP, sequences, 11 LPP sequences, 12 LPP sequences, 13 LPP sequences, 14 LPP sequences, or 15 LPP sequences.
95. The host cell according to any one of the previous paragraphs, wherein the host cell comprises three LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for two of the three LPPs.
96. The host cell according to any one of the previous paragraphs, wherein the host cell comprises four LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for two of the four LPPs.
97. The host cell according to any one of the previous paragraphs, wherein the host cell comprises four LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for three of the four LPPs.
98. The host cell according to any one of the previous paragraphs, wherein the host cell comprises five LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for two of the five LPPs.
99. The host cell according to any one of the previous paragraphs, wherein the host cell comprises five LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for three of the five LPPs.
100. The host cell according to any one of the previous paragraphs, wherein the host cell comprises five LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for four of the five LPPs.
101 . The host cell according to any one of the previous paragraphs, wherein the host cell comprises six LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for two of the six LPPs.
102. The host cell according to any one of the previous paragraphs, wherein the host cell comprises six LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for three of the six LPPs.
103. The host cell according to any one of the previous paragraphs, wherein the host cell comprises six LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for four of the six LPPs.
104. The host cell according to any one of the previous paragraphs, wherein the host cell comprises six LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for five of the six LPPs.
105. The host cell according to any one of the previous paragraphs, wherein the host cell comprises 7 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for two of the 7 LPPs.
106. The host cell according to any one of the previous paragraphs, wherein the host cell comprises 7 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for three of the 7 LPPs.
107. The host cell according to any one of the previous paragraphs, wherein the host cell comprises 7 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for four of the 7 LPPs.
108. The host cell according to any one of the previous paragraphs, wherein the host cell comprises 7 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for five of the 7 LPPs.
109. The host cell according to any one of the previous paragraphs, wherein the host cell comprises 7 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for six of the 7 LPPs.
110. The host cell according to any one of the previous paragraphs, wherein the host cell comprises 8 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for two of the 8 LPPs.
111. The host cell according to any one of the previous paragraphs, wherein the host cell comprises 8 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for three of the 8 LPPs.
112. The host cell according to any one of the previous paragraphs, wherein the host cell comprises 8 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for four of the 8 LPPs.
113. The host cell according to any one of the previous paragraphs, wherein the host cell comprises 8 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for five of the 8 LPPs.
114. The host cell according to any one of the previous paragraphs, wherein the host cell comprises 8 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for six of the 8 LPPs.
115. The host cell according to any one of the previous paragraphs, wherein the host cell comprises 8 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for 7 of the 8 LPPs.
116. The host cell according to any one of the previous paragraphs, wherein the host cell comprises 9 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for two of the 9 LPPs.
117. The host cell according to any one of the previous paragraphs, wherein the host cell comprises 9 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for three of the 9 LPPs.
118. The host cell according to any one of the previous paragraphs, wherein the host cell comprises 9 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for four of the 9 LPPs.
119. The host cell according to any one of the previous paragraphs, wherein the host cell comprises 9 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for five of the 9 LPPs.
120. The host cell according to any one of the previous paragraphs, wherein the host cell comprises 9 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for six of the 9 LPPs.
121 . The host cell according to any one of the previous paragraphs, wherein the host cell comprises 9 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for 7 of the 9 LPPs.
122. The host cell according to any one of the previous paragraphs, wherein the host cell comprises 9 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for 8 of the 9 LPPs.
123. The host cell according to any one of the previous paragraphs, wherein the host cell comprises 10 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for two of the 10 LPPs.
124. The host cell according to any one of the previous paragraphs, wherein the host cell comprises 10 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for three of the 10 LPPs.
125. The host cell according to any one of the previous paragraphs, wherein the host cell comprises 10 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for four of the 10 LPPs.
126. The host cell according to any one of the previous paragraphs, wherein the host cell comprises 10 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for five of the 10 LPPs.
127. The host cell according to any one of the previous paragraphs, wherein the host cell comprises 10 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for six of the 10 LPPs.
128. The host cell according to any one of the previous paragraphs, wherein the host cell comprises 10 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for 7 of the 10 LPPs.
129. The host cell according to any one of the previous paragraphs, wherein the host cell comprises 10 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for 8 of the 10 LPPs.
130. The host cell according to any one of the previous paragraphs, wherein the host cell comprises 10 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for 9 of the 10 LPPs.
131. The host cell according to any one of the previous paragraphs, wherein the host cell comprises 11 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for two of the 11 LPPs.
132. The host cell according to any one of the previous paragraphs, wherein the host cell comprises 11 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for three of the 11 LPPs.
134. The host cell according to any one of the previous paragraphs, wherein the host cell comprises 11 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for four of the 11 LPPs.
135. The host cell according to any one of the previous paragraphs, wherein the host cell comprises 11 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for five of the 11 LPPs.
136. The host cell according to any one of the previous paragraphs, wherein the host cell comprises 11 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for six of the 11 LPPs.
137. The host cell according to any one of the previous paragraphs, wherein the host cell comprises 11 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for 7 of the 11 LPPs.
138. The host cell according to any one of the previous paragraphs, wherein the host cell comprises 11 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for 8 of the 11 LPPs.
139. The host cell according to any one of the previous paragraphs, wherein the host cell comprises 11 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for 9 of the 11 LPPs.
140. The host cell according to any one of the previous paragraphs, wherein the host cell comprises 11 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for 10 of the 11 LPPs.
141 . The host cell according to any one of the previous paragraphs, wherein the host cell comprises 12 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for two of the 12 LPPs.
142. The host cell according to any one of the previous paragraphs, wherein the host cell comprises 12 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for three of the 12 LPPs.
143. The host cell according to any one of the previous paragraphs, wherein the host cell comprises 12 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for four of the 12 LPPs.
144. The host cell according to any one of the previous paragraphs, wherein the host cell comprises 12 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for five of the 12 LPPs.
145. The host cell according to any one of the previous paragraphs, wherein the host cell comprises 12 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for six of the 12 LPPs.
146. The host cell according to any one of the previous paragraphs, wherein the host cell comprises 12 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for 7 of the 12 LPPs.
147. The host cell according to any one of the previous paragraphs, wherein the host cell comprises 12 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for 8 of the 12 LPPs.
148. The host cell according to any one of the previous paragraphs, wherein the host cell comprises 12 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for 9 of the 12 LPPs.
149. The host cell according to any one of the previous paragraphs, wherein the host cell comprises 12 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for 10 of the 12LPPs.
150. The host cell according to any one of the previous paragraphs, wherein the host cell comprises 12 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for 11 of the 12 LPPs.
151 . The host cell according to any one of the previous paragraphs, wherein the host cell comprises 13 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for two of the 13 LPPs.
152. The host cell according to any one of the previous paragraphs, wherein the host cell comprises 13 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for three of the 13 LPPs.
153. The host cell according to any one of the previous paragraphs, wherein the host cell comprises 13 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for four of the 13 LPPs.
154. The host cell according to any one of the previous paragraphs, wherein the host cell comprises 13 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for five of the 13 LPPs.
155. The host cell according to any one of the previous paragraphs, wherein the host cell comprises 13 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for six of the 13 LPPs.
156. The host cell according to any one of the previous paragraphs, wherein the host cell comprises 13 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for 7 of the 13 LPPs.
157. The host cell according to any one of the previous paragraphs, wherein the host cell comprises 13 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for 8 of the 13 LPPs.
158. The host cell according to any one of the previous paragraphs, wherein the host cell comprises 13 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for 9 of the 13 LPPs.
159. The host cell according to any one of the previous paragraphs, wherein the host cell comprises 13 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for 10 of the 13 LPPs.
160. The host cell according to any one of the previous paragraphs, wherein the host cell comprises 13 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for 11 of the 13 LPPs.
161 . The host cell according to any one of the previous paragraphs, wherein the host cell comprises 13 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for 12 of the 13 LPPs.
162. The host cell according to any one of the previous paragraphs, wherein the host cell comprises 14 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for two of the 14 LPPs.
163. The host cell according to any one of the previous paragraphs, wherein the host cell comprises 14 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for three of the 14 LPPs.
164. The host cell according to any one of the previous paragraphs, wherein the host cell comprises 14 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for four of the 14 LPPs.
165. The host cell according to any one of the previous paragraphs, wherein the host cell comprises 14 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for five of the 14 LPPs.
166. The host cell according to any one of the previous paragraphs, wherein the host cell comprises 14 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for six of the 14 LPPs.
167. The host cell according to any one of the previous paragraphs, wherein the host cell comprises 14 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for 7 of the 14 LPPs.
168. The host cell according to any one of the previous paragraphs, wherein the host cell comprises 14 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for 8 of the 14 LPPs.
169. The host cell according to any one of the previous paragraphs, wherein the host cell comprises 14 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for 9 of the 14 LPPs.
170. The host cell according to any one of the previous paragraphs, wherein the host cell comprises 14 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for 10 of the 14 LPPs.
171 . The host cell according to any one of the previous paragraphs, wherein the host cell comprises 14 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for 11 of the 14 LPPs.
172. The host cell according to any one of the previous paragraphs, wherein the host cell comprises 14 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for 12 of the 14 LPPs.
173. The host cell according to any one of the previous paragraphs, wherein the host cell comprises 14 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for 13 of the 14 LPPs.
174. The host cell according to any one of the previous paragraphs, wherein the host cell comprises 15 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for two of the 15 LPPs.
175. The host cell according to any one of the previous paragraphs, wherein the host cell comprises 15 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for three of the 15 LPPs.
176. The host cell according to any one of the previous paragraphs, wherein the host cell comprises 15 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for four of the 15 LPPs.
177. The host cell according to any one of the previous paragraphs, wherein the host cell comprises 15 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for five of the 15 LPPs.
178. The host cell according to any one of the previous paragraphs, wherein the host cell comprises 15 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for six of the 15 LPPs.
179. The host cell according to any one of the previous paragraphs, wherein the host cell comprises 15 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for 7 of the 15 LPPs.
180. The host cell according to any one of the previous paragraphs, wherein the host cell comprises 15 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for 8 of the 15 LPPs.
181 . The host cell according to any one of the previous paragraphs, wherein the host cell comprises 15 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for 9 of the 15 LPPs.
182. The host cell according to any one of the previous paragraphs, wherein the host cell comprises 15 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for 10 of the 15 LPPs.
183. The host cell according to any one of the previous paragraphs, wherein the host cell comprises 15 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for 11 of the 15 LPPs.
184. The host cell according to any one of the previous paragraphs, wherein the host cell comprises 15 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for 12 of the 15 LPPs.
185. The host cell according to any one of the previous paragraphs, wherein the host cell comprises 15 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for 13 of the 15 LPPs.
186. The host cell according to any one of the previous paragraphs, wherein the host cell comprises 15 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for 14 of the 15 LPPs.
187. The host cell according to any one of the previous paragraphs, wherein the host cell comprises 16 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for two of the 16 LPPs.
188. The host cell according to any one of the previous paragraphs, wherein the host cell comprises 16 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for three of the 16 LPPs.
189. The host cell according to any one of the previous paragraphs, wherein the host cell comprises 16 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for four of the 16 LPPs.
190. The host cell according to any one of the previous paragraphs, wherein the host cell comprises 16 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for five of the 16 LPPs.
191 . The host cell according to any one of the previous paragraphs, wherein the host cell comprises 16 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for six of the 16 LPPs.
192. The host cell according to any one of the previous paragraphs, wherein the host cell comprises 16 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for 7 of the 16 LPPs.
193. The host cell according to any one of the previous paragraphs, wherein the host cell comprises 16 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for 8 of the 16 LPPs.
194. The host cell according to any one of the previous paragraphs, wherein the host cell comprises 16 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for 9 of the 16 LPPs.
195. The host cell according to any one of the previous paragraphs, wherein the host cell comprises 16 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for 10 of the 16 LPPs.
196. The host cell according to any one of the previous paragraphs, wherein the host cell comprises 16 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for 11 of the 16 LPPs.
197. The host cell according to any one of the previous paragraphs, wherein the host cell comprises 16 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for 12 of the 16 LPPs.
198. The host cell according to any one of the previous paragraphs, wherein the host cell comprises 16 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for 13 of the 16 LPPs.
199. The host cell according to any one of the previous paragraphs, wherein the host cell comprises 16 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for 14 of the 16 LPPs.
200. The host cell according to any one of the previous paragraphs, wherein the host cell comprises 16 LPP sequences, and wherein the third, fourth, or fifth protospacer sequence is identical for 15 of the 16 LPPs.
201 . The host cell according to any one of the previous paragraphs, wherein the host cell is a fungal cell.
202. The host cell according to any one of the previous paragraphs, wherein the host cell is a yeast recombinant host cell, e.g., a Candida, Hansenula, Kluyveromyces, Pichia, Saccharomyces, Schizosaccharomyces, or Yarrowia cell, such as a Kluyveromyces lactis, Saccharomyces carlsbergensis, Saccharomyces cerevisiae, Saccharomyces diastaticus, Saccharomyces douglasii, Saccharomyces kluyveri, Saccharomyces norbensis, Saccharomyces oviformis, or Yarrowia lipolytica cell.
203. The host cell according to any one of the previous paragraphs, wherein the host cell is a filamentous fungal recombinant host cell, e.g., an Acremonium, Aspergillus, Aureobasidium, Bjerkandera, Ceriporiopsis, Chrysosporium, Coprinus, Coriolus, Cryptococcus, Filibasidium, Fusarium, Humicola, Magnaporthe, Mucor, Myceliophthora, Neocallimastix, Neurospora, Paecilomyces, Penicillium, Phanerochaete, Phlebia, Piromyces, Pleurotus, Schizophyllum, Talaromyces, Thermoascus, Thielavia, Tolypocladium, Trametes, or Trichoderma cell, in particular, an Aspergillus awamori, Aspergillus foetidus, Aspergillus fumigatus, Aspergillus japonicus, Aspergillus nidulans, Aspergillus niger, Aspergillus oryzae, Bjerkandera adusta, Ceriporiopsis aneirina, Ceriporiopsis caregiea, Ceriporiopsis gilvescens, Ceriporiopsis pannocinta, Ceriporiopsis rivulosa, Ceriporiopsis subrufa, Ceriporiopsis subvermispora, Chrysosporium inops, Chrysosporium keratinophilum, Chrysosporium lucknowense, Chrysosporium merdarium, Chrysosporium pannicola, Chrysosporium queenslandicum, Chrysosporium tropicum, Chrysosporium zonatum, Coprinus cinereus, Coriolus hirsutus, Fusarium bactridioides, Fusarium cerealis, Fusarium crookwellense, Fusarium culmorum, Fusarium graminearum, Fusarium graminum, Fusarium heterosporum, Fusarium negundi, Fusarium oxysporum, Fusarium reticulatum, Fusarium roseum, Fusarium sambucinum, Fusarium sarcochroum, Fusarium sporotrichioides, Fusarium sulphureum, Fusarium torulosum, Fusarium trichothecioides, Fusarium venenatum, Humicola insolens, Humicola lanuginosa, Mucor miehei, Myceliophthora thermophila, Neurospora crassa, Penicillium purpurogenum, Phanerochaete chrysosporium, Phlebia radiata, Pleurotus eryngii, Talaromyces emersonii, Thielavia terrestris, Trametes villosa, Trametes versicolor, Trichoderma harzianum, Trichoderma koningii, Trichoderma longibrachiatum, Trichoderma reesei, or Trichoderma viride cell.
204. The host cell according to any one of the previous paragraphs, wherein the host cell is an Aspergillus cell, preferably an Aspergillus niger cell, or an Aspergillus oryzae cell.
204a. The host cell according to any one of the previous paragraphs, wherein the host cell is a Trichoderma cell, preferably a Trichderma reesei cell.
205. The host cell according to any one of the previous paragraphs, which is isolated.
206. The host cell according to any one of the previous paragraphs, which is purified.
207. The host cell according to any one of the previous paragraphs, wherein the host cell is a prokaryotic recombinant host cell, e.g., a Gram-positive cell selected from the group consisting of Bacillus, Clostridium, Enterococcus, Geobacillus, Lactobacillus, Lactococcus, Oceanobacillus, Staphylococcus, Streptococcus, or Streptomyces cells, or a Gram-negative bacteria selected from the group consisting of Campylobacter, E. coll, Flavobacterium, Fusobacterium, Helicobacter, llyobacter, Neisseria, Pseudomonas, Salmonella, and Ureaplasma cells, such as Bacillus alkalophilus, Bacillus amyloliquefaciens, Bacillus brevis, Bacillus circulans, Bacillus clausii, Bacillus coagulans, Bacillus firmus, Bacillus lautus, Bacillus lentus, Bacillus lichen! formis, Bacillus megaterium, Bacillus pumilus, Bacillus stearothermophilus, Bacillus subtilis, Bacillus thuringiensis, Streptococcus equisimilis, Streptococcus pyogenes, Streptococcus uberis, and Streptococcus equi subsp. Zooepidemicus, Streptomyces achromogenes, Streptomyces avermitilis, Streptomyces coelicolor, Streptomyces griseus, and Streptomyces lividans cells.
208. The host cell according to any one of the previous paragraphs, wherein the host cell is a Bacillus cell.
209. The host cell according to any one of the previous paragraphs, wherein the host cell is a Bacillus lichen! formis cell.
210. The host cell according to any one of the previous paragraphs, wherein the host cell is a Bacillus subtilis cell.
21 1 . The host cell according to any one of the previous paragraphs, wherein the host cell is a prokaryotic host cell and wherein at least one LPP is located between two selectable and/or counter-selectable marker genes.
212. The host cell according to any one of the previous paragraphs, wherein the host cell is a mammalian cell.
213. The host cell according to any one of the previous paragraphs, wherein the mammalian cell is selected from the list of a human cell, a mouse cell, a rat cell, a mouse hybridoma cell, a hamster cell (e.g. Chinese hamster ovary cell), and a rat hybridoma cell.
214. A method for generating a multi-copy host cell, comprising the steps of a) providing a host cell according to any one of the previous paragraphs, b) delivering to the host cell a gene editing system comprising:
- a RNA-guided DNA nuclease,
- at least one guide RNA directed to one or more protospacer of the one or more LPP sequences,
- a first donor polynucleotide comprising in 5’ to 3’ direction: a 5’ homology arm, a first polynucleotide encoding a first polypeptide of interest, and a 3’ homology arm, OR a set of first PCR fragments comprising a first 5’ homology arm, a first 3’ homology arm, and a first polynucleotide encoding a first polypeptide of interest, wherein the 3’ homology arm is identical to the 3’ homology arm of one or more LPP, and wherein the 5’ homology arm is identical to the 5’ homology arm of one or more LPP, and
c) cultivating the host cell at a first temperature allowing one or more first guide RNA of the at least one guide RNA to form a complex with the nuclease at one or more protospacer of one or more LPP, and wherein the complex promotes the homology directed repair by creating a double-strand break at or near the one or more protospacer sequence of the one or more LPP, wherein the homology directed repair comprises insertion of the first donor polynucleotide or set of first PCR fragments at the double-strand break.
215. The method according to any one of the previous paragraphs, wherein the first polypeptide is heterologous to the host cell.
216. The method according to any one of the previous paragraphs, wherein the first polynucleotide is operably linked to one or more control sequences that direct the production of the first polypeptide.
217. The method according to any one of the previous paragraphs, wherein at least one of the one or more control sequences is heterologous to the first polynucleotide.
218. The method according to any one of the previous paragraphs, wherein the first guide RNA is complementary to one or more first protospacers, wherein during step c) the complex is formed at one or more first protospacer of the one or more LPP, creating a double-strand break at or near the one or more first protospacer.
219. The method according to any one of the previous paragraphs, wherein the first guide RNA is complementary to one or more second protospacers, wherein during step c) the complex is formed at one or more second protospacer of the one or more LPP, creating a double-strand break at or near the one or more second protospacer.
220. The method according to any one of the previous paragraphs, wherein the first guide RNA is complementary to one or more third protospacers, wherein during step c) the complex is formed at one or more third protospacer of the one or more LPP, creating a double-strand break at or near the one or more third protospacer.
221 . The method according to any one of the previous paragraphs, wherein the first guide RNA is complementary to one or more fourth protospacers, wherein during step c) the complex is formed at one or more fourth protospacer of the one or more LPP, creating a double-strand break at or near the one or more fourth protospacer.
222. The method according to any one of the previous paragraphs, wherein the first guide RNA is complementary to one or more fifth protospacers, wherein during step c) the complex is formed at one or more fifth protospacer of the one or more LPP, creating a double-strand break at or near the one or more fifth protospacer.
223. The method according to any one of the previous paragraphs, wherein during step c) the first temperature allows the one or more first guide RNA to form a complex at one or more protospacers of one LPP, and wherein the complex promotes the homology directed repair by creating a double-strand break at or near the one or more protospacer sequence of the one LPP, wherein the homology directed repair comprises insertion of the first donor polynucleotide and/or set of first PCR fragments at the double-strand break to provide a host cell with one copy of the first donor polynucleotide in its genome.
224. The method according to any one of the previous paragraphs, wherein during step c) the first temperature allows the one or more first guide RNA to form a complex at one or more protospacers of two LPP, and wherein the complex promotes the homology directed repair by creating a double-strand break at or near the one or more protospacer sequence of the two LPP, wherein the homology directed repair comprises insertion of the first donor polynucleotide or set of first PCR fragments at the double-strand break to provide a host cell with two copies of the first donor polynucleotide in its genome.
225. The method according to any one of the previous paragraphs, wherein during step c) the first temperature allows the one or more first guide RNA to form a complex at one or more protospacers of three LPP, and wherein the complex promotes the homology directed repair by creating a double-strand break at or near the one or more protospacer sequence of the three LPP, wherein the homology directed repair comprises insertion of the first donor polynucleotide or set of first PCR fragments at the doublestrand break to provide a host cell with three copies of the first donor polynucleotide in its genome.
226. The method according to any one of the previous paragraphs, wherein during step c) the first temperature allows the one or more first guide RNA to form a complex at one or more protospacers of four LPP, and wherein the complex promotes the homology directed repair by creating a double-strand break at or near the one or more protospacer sequence of the four LPP, wherein the homology directed repair comprises insertion of the first donor polynucleotide or set of first PCR fragments at the double-strand break to provide a host cell with four copies of the first donor polynucleotide in its genome.
227. The method according to any one of the previous paragraphs, wherein during step c) the first temperature allows the one or more first guide RNA to form a complex at one or more protospacers of five LPP, and wherein the complex promotes the homology directed repair by creating a double-strand break at or near the one or more protospacer sequence of the five LPP, wherein the homology directed repair comprises insertion of the first donor polynucleotide or set of first PCR fragments at the double-strand break to provide a host cell with five copies of the first donor polynucleotide in its genome.
228. The method according to any one of the previous paragraphs, wherein during step c) the first temperature allows the one or more first guide RNA to form a complex at one or more protospacers of six LPP, and wherein the complex promotes the homology directed repair by creating a double-strand break at or near the one or more protospacer sequence of the six LPP, wherein the homology directed repair comprises insertion of the first donor polynucleotide or set of first PCR fragments at the double-strand break to provide a host cell with six copies of the first donor polynucleotide in its genome.
229. The method according to any one of the previous paragraphs, wherein during step c) the first temperature allows the one or more first guide RNA to form a complex at one or more protospacers of 7 LPP, and wherein the complex promotes the homology directed repair by creating a double-strand break at or near the one or more protospacer sequence of the 7 LPP, wherein the homology directed repair comprises insertion of the first donor polynucleotide or set of first PCR fragments at the double-strand break to provide a host cell with 7 copies of the first donor polynucleotide in its genome.
230. The method according to any one of the previous paragraphs, wherein during step c) the first temperature allows the one or more first guide RNA to form a complex at one or more protospacers of 8 LPP, and wherein the complex promotes the homology directed repair by creating a double-strand break
at or near the one or more protospacer sequence of the 8 LPP, wherein the homology directed repair comprises insertion of the first donor polynucleotide or set of first PCR fragments at the double-strand break to provide a host cell with 8 copies of the first donor polynucleotide in its genome.
231 . The method according to any one of the previous paragraphs, wherein during step c) the first temperature allows the one or more first guide RNA to form a complex at one or more protospacers of 9 LPP, and wherein the complex promotes the homology directed repair by creating a double-strand break at or near the one or more protospacer sequence of the 9 LPP, wherein the homology directed repair comprises insertion of the first donor polynucleotide or set of first PCR fragments at the double-strand break to provide a host cell with 9 copies of the first donor polynucleotide in its genome.
232. The method according to any one of the previous paragraphs, wherein during step c) the first temperature allows the one or more first guide RNA to form a complex at one or more protospacers of 10 LPP, and wherein the complex promotes the homology directed repair by creating a double-strand break at or near the one or more protospacer sequence of the 10 LPP, wherein the homology directed repair comprises insertion of the first donor polynucleotide or set of first PCR fragments at the double-strand break to provide a host cell with 10 copies of the first donor polynucleotide in its genome.
233. The method according to any one of the previous paragraphs, wherein during step c) the first temperature allows the one or more first guide RNA to form a complex at one or more protospacers of 11 LPP, and wherein the complex promotes the homology directed repair by creating a double-strand break at or near the one or more protospacer sequence of the 11 LPP, wherein the homology directed repair comprises insertion of the first donor polynucleotide or set of first PCR fragments at the double-strand break to provide a host cell with 11 copies of the first donor polynucleotide in its genome.
234. The method according to any one of the previous paragraphs, wherein during step c) the first temperature allows the one or more first guide RNA to form a complex at one or more protospacers of 12 LPP, and wherein the complex promotes the homology directed repair by creating a double-strand break at or near the one or more protospacer sequence of the 12 LPP, wherein the homology directed repair comprises insertion of the first donor polynucleotide or set of first PCR fragments at the double-strand break to provide a host cell with 12 copies of the first donor polynucleotide in its genome.
235. The method according to any one of the previous paragraphs, wherein during step c) the first temperature allows the one or more first guide RNA to form a complex at one or more protospacers of 13 LPP, and wherein the complex promotes the homology directed repair by creating a double-strand break at or near the one or more protospacer sequence of the 13 LPP, wherein the homology directed repair comprises insertion of the first donor polynucleotide or set of first PCR fragments at the double-strand break to provide a host cell with 13 copies of the first donor polynucleotide in its genome.
236. The method according to any one of the previous paragraphs, wherein during step c) the first temperature allows the one or more first guide RNA to form a complex at one or more protospacers of 14 LPP, and wherein the complex promotes the homology directed repair by creating a double-strand break at or near the one or more protospacer sequence of the 14 LPP, wherein the homology directed repair comprises insertion of the first donor polynucleotide or set of first PCR fragments at the double-strand break to provide a host cell with 14 copies of the first donor polynucleotide in its genome.
237. The method according to any one of the previous paragraphs, wherein during step c) the first temperature allows the one or more first guide RNA to form a complex at one or more protospacers of 15 LPP, and wherein the complex promotes the homology directed repair by creating a double-strand break at or near the one or more protospacer sequence of the 15 LPP, wherein the homology directed repair comprises insertion of the first donor polynucleotide or set of first PCR fragments at the double-strand break to provide a host cell with 15 copies of the first donor polynucleotide in its genome.
238. The method according to any one of the previous paragraphs, wherein during step c) the first temperature allows the one or more first guide RNA to form a complex at one or more protospacers of 16 LPP, and wherein the complex promotes the homology directed repair by creating a double-strand break at or near the one or more protospacer sequence of the 16 LPP, wherein the homology directed repair comprises insertion of the first donor polynucleotide or set of first PCR fragments at the double-strand break to provide a host cell with 16 copies of the first donor polynucleotide in its genome.
239. The method according to any one of the previous paragraphs, wherein during step b) there are delivered at least two first guide RNAs, each first guide RNA forming a complex with the nuclease at a first, second, third, fourth, or fifth protospacer.
240. The method according to any one of the previous paragraphs, wherein the at least two first guide RNAs comprise at least two of: a 1st first guide RNA, a 2nd first guide RNA, a 3rd first guide RNA, a 4th first guide RNA, and a 5th first guide RNA.
241. The method according to any one of the previous paragraphs, wherein the 1st first guide RNA has a polynucleotide sequence with a sequence identity of below 100%, such as below 99%, below 98%, below 97%, below 96%, below 95%, below 90%, below 85%, below 80%, below 75%, below 70%, below 65%, below 60%, below 55%, below 50%, below 45%, below 40%, below 35%, below 30%, below 25%, below 20%, below 15%, or below 10% to the polynucleotide sequence of the 2nd first guide RNA, to the polynucleotide sequence of 3rd first guide RNA, to the polynucleotide sequence of the 4th first guide RNA, and to the polynucleotide sequence of the 5th first guide RNA.
242. The method according to any one of the previous paragraphs, wherein the 2nd first guide RNA has a polynucleotide sequence with a sequence identity of below 100%, such as below 99%, below 98%, below 97%, below 96%, below 95%, below 90%, below 85%, below 80%, below 75%, below 70%, below 65%, below 60%, below 55%, below 50%, below 45%, below 40%, below 35%, below 30%, below 25%, below 20%, below 15%, or below 10% to the polynucleotide sequence of the 1st first guide RNA, to the polynucleotide sequence of 3rd first guide RNA, to the polynucleotide sequence of the 4th first guide RNA, and to the polynucleotide sequence of the 5th first guide RNA.
243. The method according to any one of the previous paragraphs, wherein the 3rd first guide RNA has a polynucleotide sequence with a sequence identity of below 100%, such as below 99%, below 98%, below 97%, below 96%, below 95%, below 90%, below 85%, below 80%, below 75%, below 70%, below 65%, below 60%, below 55%, below 50%, below 45%, below 40%, below 35%, below 30%, below 25%, below 20%, below 15%, or below 10% to the polynucleotide sequence of the 1st first guide RNA, to the
polynucleotide sequence of 2nd first guide RNA, to the polynucleotide sequence of the 4th first guide RNA, and to the polynucleotide sequence of the 5th first guide RNA.
244. The method according to any one of the previous paragraphs, wherein the 4th first guide RNA has a polynucleotide sequence with a sequence identity below 100%, such as below 99%, below 98%, below 97%, below 96%, below 95%, below 90%, below 85%, below 80%, below 75%, below 70%, below 65%, below 60%, below 55%, below 50%, below 45%, below 40%, below 35%, below 30%, below 25%, below 20%, below 15%, or below 10% to the polynucleotide sequence of the 1st first guide RNA, to the polynucleotide sequence of 2nd first guide RNA, to the polynucleotide sequence of the 3rd first guide RNA, and to the polynucleotide sequence of the 5th first guide RNA.
245. The method according to any one of the previous paragraphs, wherein the 5th first guide RNA has a polynucleotide sequence with a sequence identity of below 100%, such as below 99%, below 98%, below 97%, below 96%, below 95%, below 90%, below 85%, below 80%, below 75%, below 70%, below 65%, below 60%, below 55%, below 50%, below 45%, below 40%, below 35%, below 30%, below 25%, below 20%, below 15%, or below 10% to the polynucleotide sequence of the 1st first guide RNA, to the polynucleotide sequence of 2nd first guide RNA, to the polynucleotide sequence of the 3rd first guide RNA, and to the polynucleotide sequence of the 4th first guide RNA.
246. The method according to any one of the previous paragraphs, wherein the one or more first guide RNA comprises a 1st first guide RNA, having a sequence complementary to a first protospacer sequence, a second protospacer sequence, a third protospacer sequence, a fourth protospacer sequence, or a fifth protospacer sequence, preferably the 1st first guide RNA is complementary to a first protospacer sequence.
247. The method according to any one of the previous paragraphs, wherein the one or more first guide RNA comprises a 2nd first guide RNA, having a sequence complementary to a first protospacer sequence, a second protospacer sequence, a third protospacer sequence, a fourth protospacer sequence, or a fifth protospacer sequence, preferably the 2nd first guide RNA is complementary to a second protospacer sequence.
248. The method according to any one of the previous paragraphs, wherein the one or more first guide RNA comprises a 3rd first guide RNA, having a sequence complementary to a first protospacer sequence, a second protospacer sequence, a third protospacer sequence, a fourth protospacer sequence, or a fifth protospacer sequence, preferably the 3rd first guide RNA is complementary to a third protospacer sequence.
249. The method according to any one of the previous paragraphs, wherein the one or more first guide RNA comprises a 4th first guide RNA, having a sequence complementary to a first protospacer sequence, a second protospacer sequence, a third protospacer sequence, a fourth protospacer sequence, or a fifth protospacer sequence, preferably the 4th first guide RNA is complementary to a fourth protospacer sequence.
250. The method according to any one of the previous paragraphs, wherein the one or more first guide RNA comprises a 5th first guide RNA, having a sequence complementary to a first protospacer sequence, a second protospacer sequence, a third protospacer sequence, a fourth protospacer sequence, or a fifth protospacer sequence, preferably the 5th first guide RNA is complementary to a fifth protospacer sequence.
251 . The method according to any one of the previous paragraphs, wherein the at least one guide RNA comprises a first guide RNA and a second guide RNA, and wherein the method additionally comprises step d): d) cultivating the host cell at a second temperature allowing one or more second guide RNA of the at least one guide RNA to form a complex with the nuclease at one or more protospacer of the one or more LPP, and wherein the complex promotes the homology directed repair by creating a double-strand break at or near the one or more protospacer sequence of the one or more LPP, wherein the homology directed repair comprises insertion of the first donor polynucleotide or set of first PCR fragments at the double-strand break.
251 a. The method according to any one of the previous paragraphs, wherein during step b) there is additionally delivered one or more second polynucleotide comprising in 5’ to 3’ direction: a 5’ homology arm, a second polynucleotide encoding a second polypeptide of interest, and a 3’ homology arm, OR a set of one or more second PCR fragments comprising a second 3’ homology arm, a second 5’ homology arm, and a second polynucleotide encoding a second polypeptide of interest, wherein the 3’ homology arm is identical to the 3’ homology arm of one or more LPP and wherein the 5’ homology arm is identical to the 5’ homology arm of one or more LPP, and wherein during step c) the homology directed repair comprises insertion of the second donor polynucleotide or set of second PCR fragments at the double-strand break.
251 b. The method according to any one of the previous paragraphs, wherein during step b) there is additionally delivered one or more first polynucleotide comprising in 5’ to 3’ direction: a second 5’ homology arm, a first polynucleotide encoding the first polypeptide of interest, and a second 3’ homology arm, OR a set of one or more first PCR fragments comprising a second 3’ homology arm, a second 5’ homology arm, and the first polynucleotide encoding the first polypeptide of interest, wherein the second 3’ homology arms and the second 5’ homology arms have less than 100% sequence identity to the 3’ and 5’ homology arms of the at least two LPP, respectively, e.g., less than 99%, less than 98%, less than 97%, less than 96%, less than 95%, less than 90%, less thank 85%, less than 80%, less than 75%, less than 70%, or less than 65%, and wherein during step c) the homology directed repair comprises insertion of the first donor polynucleotide or set of PCR fragments at the double-strand break between a second 3’ homology arm and a second 5’ homology arm.
252. The method according to any one of the previous paragraphs, wherein the second guide RNA is complementary to one or more first protospacers, wherein during step d) the complex is formed at one or more first protospacer of the one or more LPP, creating a double-strand break at or near the one or more first protospacer.
253. The method according to any one of the previous paragraphs, wherein the second guide RNA is complementary to one or more second protospacers, wherein during step d) the complex is formed at one or more second protospacer of the one or more LPP, creating a double-strand break at or near the one or more second protospacer.
254. The method according to any one of the previous paragraphs, wherein the second guide RNA is complementary to one or more third protospacers, wherein during step d) the complex is formed at one or more third protospacer of the one or more LPP, creating a double-strand break at or near the one or more third protospacer.
255. The method according to any one of the previous paragraphs, wherein the second guide RNA is complementary to one or more fourth protospacers, wherein during step d) the complex is formed at one or more fourth protospacer of the one or more LPP, creating a double-strand break at or near the one or more fourth protospacer.
256. The method according to any one of the previous paragraphs, wherein the second guide RNA is complementary to one or more fifth protospacers, wherein during step d) the complex is formed at one or more fifth protospacer of the one or more LPP, creating a double-strand break at or near the one or more fifth protospacer.
257. The method according to any one of the previous paragraphs, wherein during step d) the second temperature allows the one or more second guide RNA to form a complex at one or more protospacers of one LPP, and wherein the complex promotes the homology directed repair by creating a double-strand break at or near the one or more protospacer sequence of the one LPP, wherein the homology directed repair comprises insertion of the first donor polynucleotide and/or set of first PCR fragments at the doublestrand break to provide a host cell with at least one copy of the first donor polynucleotide in its genome.
258. The method according to any one of the previous paragraphs, wherein during step d) the second temperature allows the one or more second guide RNA to form a complex at one or more protospacers of two LPP, and wherein the complex promotes the homology directed repair by creating a double-strand break at or near the one or more protospacer sequence of the two LPP, wherein the homology directed repair comprises insertion of the first donor polynucleotide or set of first PCR fragments at the doublestrand break to provide a host cell with at least two copies of the first donor polynucleotide in its genome.
259. The method according to any one of the previous paragraphs, wherein during step d) the second temperature allows the one or more second guide RNA to form a complex at one or more protospacers of three LPP, and wherein the complex promotes the homology directed repair by creating a double-strand break at or near the one or more protospacer sequence of the three LPP, wherein the homology directed repair comprises insertion of the first donor polynucleotide or set of first PCR fragments at the doublestrand break to provide a host cell with at least three copies of the first donor polynucleotide in its genome.
260. The method according to any one of the previous paragraphs, wherein during step d) the second temperature allows the one or more second guide RNA to form a complex at one or more protospacers of four LPP, and wherein the complex promotes the homology directed repair by creating a double-strand break at or near the one or more protospacer sequence of the four LPP, wherein the homology directed repair comprises insertion of the first donor polynucleotide or set of first PCR fragments at the doublestrand break to provide a host cell with at least four copies of the first donor polynucleotide in its genome.
261 . The method according to any one of the previous paragraphs, wherein during step d) the second temperature allows the one or more second guide RNA to form a complex at one or more protospacers of five LPP, and wherein the complex promotes the homology directed repair by creating a double-strand
break at or near the one or more protospacer sequence of the five LPP, wherein the homology directed repair comprises insertion of the first donor polynucleotide or set of first PCR fragments at the doublestrand break to provide a host cell with at least five copies of the first donor polynucleotide in its genome.
262. The method according to any one of the previous paragraphs, wherein during step d) the second temperature allows the one or more second guide RNA to form a complex at one or more protospacers of six LPP, and wherein the complex promotes the homology directed repair by creating a double-strand break at or near the one or more protospacer sequence of the six LPP, wherein the homology directed repair comprises insertion of the first donor polynucleotide or set of first PCR fragments at the doublestrand break to provide a host cell with at least six copies of the first donor polynucleotide in its genome.
263. The method according to any one of the previous paragraphs, wherein during step d) the second temperature allows the one or more second guide RNA to form a complex at one or more protospacers of 8 LPP, and wherein the complex promotes the homology directed repair by creating a double-strand break at or near the one or more protospacer sequence of the 8 LPP, wherein the homology directed repair comprises insertion of the first donor polynucleotide or set of first PCR fragments at the double-strand break to provide a host cell with at least 8 copies of the first donor polynucleotide in its genome.
264. The method according to any one of the previous paragraphs, wherein during step d) the second temperature allows the one or more second guide RNA to form a complex at one or more protospacers of 16 LPP, and wherein the complex promotes the homology directed repair by creating a double-strand break at or near the one or more protospacer sequence of the 16 LPP, wherein the homology directed repair comprises insertion of the first donor polynucleotide or set of first PCR fragments at the doublestrand break to provide a host cell with at least 16 copies of the first donor polynucleotide in its genome.
265. The method according to any one of the previous paragraphs, wherein during step b) there are delivered at least two second guide RNAs, each second guide RNA forming a complex with the nuclease at a first, second, third, fourth, or fifth protospacer.
266. The method according to any one of the previous paragraphs, wherein the at least two second guide RNAs comprise at least two of: a 1st second guide RNA, a 2nd second guide RNA, a 3rd second guide RNA, a 4th second guide RNA, and a 5th second guide RNA.
267. The method according to any one of the previous paragraphs, wherein the 1st second guide RNA has a polynucleotide sequence with a sequence identity below 100%, such as below 99%, below 98%, below 97%, below 96%, below 95%, below 90%, below 85%, below 80%, below 75%, below 70%, below 65%, below 60%, below 55%, below 50%, below 45%, below 40%, below 35%, below 30%, below 25%, below 20%, below 15%, or below 10% to the polynucleotide sequence of the 2nd second guide RNA, to the polynucleotide sequence of 3rd second guide RNA, to the polynucleotide sequence of the 4th second guide RNA, and to the polynucleotide sequence of the 5th second guide RNA.
268. The method according to any one of the previous paragraphs, wherein the 2nd second guide RNA has a polynucleotide sequence with a sequence identity below 100%, such as below 99%, below 98%, below 97%, below 96%, below 95%, below 90%, below 85%, below 80%, below 75%, below 70%, below 65%, below 60%, below 55%, below 50%, below 45%, below 40%, below 35%, below 30%, below 25%, below 20%, below 15%, or below 10% to the polynucleotide sequence of the 1st second guide RNA, to the
polynucleotide sequence of 3rd second guide RNA, to the polynucleotide sequence of the 4th second guide RNA, and to the polynucleotide sequence of the 5th second guide RNA.
269. The method according to any one of the previous paragraphs, wherein the 3rd second guide RNA has a polynucleotide sequence with a sequence identity of below 100%, such as below 99%, below 98%, below 97%, below 96%, below 95%, below 90%, below 85%, below 80%, below 75%, below 70%, below 65%, below 60%, below 55%, below 50%, below 45%, below 40%, below 35%, below 30%, below 25%, below 20%, below 15%, or below 10% to the polynucleotide sequence of the 1st second guide RNA, to the polynucleotide sequence of 2nd second guide RNA, to the polynucleotide sequence of the 4th second guide RNA, and to the polynucleotide sequence of the 5th second guide RNA.
270. The method according to any one of the previous paragraphs, wherein the 4th second guide RNA has a polynucleotide sequence with a sequence identity below 100%, such as below 99%, below 98%, below 97%, below 96%, below 95%, below 90%, below 85%, below 80%, below 75%, below 70%, below 65%, below 60%, below 55%, below 50%, below 45%, below 40%, below 35%, below 30%, below 25%, below 20%, below 15%, or below 10% to the polynucleotide sequence of the 1st second guide RNA, to the polynucleotide sequence of 2nd second guide RNA, to the polynucleotide sequence of the 3rd second guide RNA, and to the polynucleotide sequence of the 5th second guide RNA.
271 . The method according to any one of the previous paragraphs, wherein the 5th second guide RNA has a polynucleotide sequence with a sequence identity below 100%, such as below 99%, below 98%, below 97%, below 96%, below 95%, below 90%, below 85%, below 80%, below 75%, below 70%, below 65%, below 60%, below 55%, below 50%, below 45%, below 40%, below 35%, below 30%, below 25%, below 20%, below 15%, or below 10% to the polynucleotide sequence of the 1st second guide RNA, to the polynucleotide sequence of 2nd second guide RNA, to the polynucleotide sequence of the 3rd second guide RNA, and to the polynucleotide sequence of the 4th second guide RNA.
272. The method according to any one of the previous paragraphs, wherein the one or more second guide RNA comprises a 1st second guide RNA, having a sequence complementary to a first protospacer sequence, a second protospacer sequence, a third protospacer sequence, a fourth protospacer sequence, or a fifth protospacer sequence, preferably the 1st second guide RNA is complementary to a first protospacer sequence.
273. The method according to any one of the previous paragraphs, wherein the one or more second guide RNA comprises a 2nd second guide RNA, having a sequence complementary to a first protospacer sequence, a second protospacer sequence, a third protospacer sequence, a fourth protospacer sequence, or a fifth protospacer sequence, preferably the 2nd second guide RNA is complementary to a second protospacer sequence.
274. The method according to any one of the previous paragraphs, wherein the one or more second guide RNA comprises a 3rd second guide RNA, having a sequence complementary to a first protospacer sequence, a second protospacer sequence, a third protospacer sequence, a fourth protospacer sequence, or a fifth protospacer sequence, preferably the 3rd second guide RNA is complementary to a third protospacer sequence.
275. The method according to any one of the previous paragraphs, wherein the one or more second guide RNA comprises a 4th second guide RNA, having a sequence complementary to a first protospacer sequence, a second protospacer sequence, a third protospacer sequence, a fourth protospacer sequence, or a fifth protospacer sequence, preferably the 4th second guide RNA is complementary to a fourth protospacer sequence.
276. The method according to any one of the previous paragraphs, wherein the one or more second guide RNA comprises a 5th second guide RNA, having a sequence complementary to a first protospacer sequence, a second protospacer sequence, a third protospacer sequence, a fourth protospacer sequence, or a fifth protospacer sequence, preferably the 5th second guide RNA is complementary to a fifth protospacer sequence.
277. The method according to any one of the previous paragraphs, wherein the at least one guide RNA comprises a first guide RNA, a second guide RNA, and a third guide RNA, and wherein the method additionally comprises the step e) cultivating the host cell at a third temperature allowing one or more third guide RNA of the at least one guide RNA to form a complex with the nuclease at one or more protospacer of the one or more LPP, and wherein the complex promotes the homology directed repair by creating a double-strand break at or near the one or more protospacer sequence of the one or more LPP, wherein the homology directed repair comprises insertion of the first donor polynucleotide and/or set of first PCR fragments at the double-strand break.
278. The method according to any one of the previous paragraphs, wherein the third guide RNA is complementary to one or more first protospacers, wherein during step e) the complex is formed at one or more first protospacer of the one or more LPP, creating a double-strand break at or near the one or more first protospacer.
279. The method according to any one of the previous paragraphs, wherein the third guide RNA is complementary to one or more second protospacers, wherein during step e) the complex is formed at one or more second protospacer of the one or more LPP, creating a double-strand break at or near the one or more second protospacer.
280. The method according to any one of the previous paragraphs, wherein the third guide RNA is complementary to one or more third protospacers, wherein during step e) the complex is formed at one or more third protospacer of the one or more LPP, creating a double-strand break at or near the one or more third protospacer.
281 . The method according to any one of the previous paragraphs, wherein the third guide RNA is complementary to one or more fourth protospacers, wherein during step e) the complex is formed at one or more fourth protospacer of the one or more LPP, creating a double-strand break at or near the one or more fourth protospacer.
282. The method according to any one of the previous paragraphs, wherein the third guide RNA is complementary to one or more fifth protospacers, wherein during step e) the complex is formed at one or
more fifth protospacer of the one or more LPP, creating a double-strand break at or near the one or more fifth protospacer.
283. The method according to any one of the previous paragraphs, wherein during step e) the third temperature allows the one or more third guide RNA to form a complex at one or more protospacers of one LPP, and wherein the complex promotes the homology directed repair by creating a double-strand break at or near the one or more protospacer sequence of the one LPP, wherein the homology directed repair comprises insertion of the first donor polynucleotide or set of first PCR fragments at the double-strand break to provide a host cell with at least one copy of the first donor polynucleotide in its genome.
284. The method according to any one of the previous paragraphs, wherein during step e) the third temperature allows the one or more third guide RNA to form a complex at one or more protospacers of two LPP, and wherein the complex promotes the homology directed repair by creating a double-strand break at or near the one or more protospacer sequence of the two LPP, wherein the homology directed repair comprises insertion of the first donor polynucleotide or set of first PCR fragments at the double-strand break to provide a host cell with at least two copies of the first donor polynucleotide in its genome.
285. The method according to any one of the previous paragraphs, wherein during step e) the third temperature allows the one or more third guide RNA to form a complex at one or more protospacers of three LPP, and wherein the complex promotes the homology directed repair by creating a double-strand break at or near the one or more protospacer sequence of the three LPP, wherein the homology directed repair comprises insertion of the first donor polynucleotide or set of first PCR fragments at the doublestrand break to provide a host cell with at least three copies of the first donor polynucleotide in its genome.
286. The method according to any one of the previous paragraphs, wherein during step e) the third temperature allows the one or more third guide RNA to form a complex at one or more protospacers of four LPP, and wherein the complex promotes the homology directed repair by creating a double-strand break at or near the one or more protospacer sequence of the four LPP, wherein the homology directed repair comprises insertion of the first donor polynucleotide or set of first PCR fragments at the doublestrand break to provide a host cell with at least four copies of the first donor polynucleotide in its genome.
287. The method according to any one of the previous paragraphs, wherein during step e) the third temperature allows the one or more third guide RNA to form a complex at one or more protospacers of five LPP, and wherein the complex promotes the homology directed repair by creating a double-strand break at or near the one or more protospacer sequence of the five LPP, wherein the homology directed repair comprises insertion of the first donor polynucleotide or set of first PCR fragments at the double-strand break to provide a host cell with at least five copies of the first donor polynucleotide in its genome.
288. The method according to any one of the previous paragraphs, wherein during step e) the third temperature allows the one or more third guide RNA to form a complex at one or more protospacers of six LPP, and wherein the complex promotes the homology directed repair by creating a double-strand break at or near the one or more protospacer sequence of the six LPP, wherein the homology directed repair comprises insertion of the first donor polynucleotide or set of first PCR fragments at the double-strand break to provide a host cell with at least six copies of the first donor polynucleotide in its genome.
289. The method according to any one of the previous paragraphs, wherein during step e) the third temperature allows the one or more third guide RNA to form a complex at one or more protospacers of 8 LPP, and wherein the complex promotes the homology directed repair by creating a double-strand break at or near the one or more protospacer sequence of the 8 LPP, wherein the homology directed repair comprises insertion of the first donor polynucleotide or set of first PCR fragments at the double-strand break to provide a host cell with at least 8 copies of the first donor polynucleotide in its genome.
290. The method according to any one of the previous paragraphs, wherein during step e) the third temperature allows the one or more third guide RNA to form a complex at one or more protospacers of 16 LPP, and wherein the complex promotes the homology directed repair by creating a double-strand break at or near the one or more protospacer sequence of the 16 LPP, wherein the homology directed repair comprises insertion of the first donor polynucleotide or set of first PCR fragments at the double-strand break to provide a host cell with at least 16 copies of the first donor polynucleotide in its genome.
291 . The method according to any one of the previous paragraphs, wherein during step b) there are delivered at least two third guide RNAs, each third guide RNA forming a complex with the nuclease at a first, second, third, fourth, or fifth protospacer.
292. The method according to any one of the previous paragraphs, wherein the at least two third guide RNAs comprise at least two of: a 1st third guide RNA, a 2nd third guide RNA, a 3rd third guide RNA, a 4th third guide RNA, and a 5th third guide RNA.
293. The method according to any one of the previous paragraphs, wherein the 1st third guide RNA has a polynucleotide sequence with a sequence identity of below 100%, such as below 99%, below 98%, below 97%, below 96%, below 95%, below 90%, below 85%, below 80%, below 75%, below 70%, below 65%, below 60%, below 55%, below 50%, below 45%, below 40%, below 35%, below 30%, below 25%, below 20%, below 15%, or below 10% to the polynucleotide sequence of the 2nd third guide RNA, to the polynucleotide sequence of 3rd third guide RNA, to the polynucleotide sequence of the 4th third guide RNA, and to the polynucleotide sequence of the 5th third guide RNA.
294. The method according to any one of the previous paragraphs, wherein the 2nd third guide RNA has a polynucleotide sequence with a sequence identity of below 100%, such as below 99%, below 98%, below 97%, below 96%, below 95%, below 90%, below 85%, below 80%, below 75%, below 70%, below 65%, below 60%, below 55%, below 50%, below 45%, below 40%, below 35%, below 30%, below 25%, below 20%, below 15%, or below 10% to the polynucleotide sequence of the 1st third guide RNA, to the polynucleotide sequence of 3rd third guide RNA, to the polynucleotide sequence of the 4th third guide RNA, and to the polynucleotide sequence of the 5th third guide RNA.
295. The method according to any one of the previous paragraphs, wherein the 3rd third guide RNA has a polynucleotide sequence with a sequence identity below 100%, such as below 99%, below 98%, below 97%, below 96%, below 95%, below 90%, below 85%, below 80%, below 75%, below 70%, below 65%, below 60%, below 55%, below 50%, below 45%, below 40%, below 35%, below 30%, below 25%, below 20%, below 15%, or below 10% to the polynucleotide sequence of the 1st third guide RNA, to the polynucleotide sequence of 2nd third guide RNA, to the polynucleotide sequence of the 4th third guide RNA, and to the polynucleotide sequence of the 5th third guide RNA.
296. The method according to any one of the previous paragraphs, wherein the 4th third guide RNA has a polynucleotide sequence with a sequence identity of below 100%, such as below 99%, below 98%, below 97%, below 96%, below 95%, below 90%, below 85%, below 80%, below 75%, below 70%, below 65%, below 60%, below 55%, below 50%, below 45%, below 40%, below 35%, below 30%, below 25%, below 20%, below 15%, or below 10% to the polynucleotide sequence of the 1st third guide RNA, to the polynucleotide sequence of 2nd third guide RNA, to the polynucleotide sequence of the 3rd third guide RNA, and to the polynucleotide sequence of the 5th third guide RNA.
297. The method according to any one of the previous paragraphs, wherein the 5th third guide RNA has a polynucleotide sequence with a sequence identity below 100%, such as below 99%, below 98%, below 97%, below 96%, below 95%, below 90%, below 85%, below 80%, below 75%, below 70%, below 65%, below 60%, below 55%, below 50%, below 45%, below 40%, below 35%, below 30%, below 25%, below 20%, below 15%, or below 10% to the polynucleotide sequence of the 1st third guide RNA, to the polynucleotide sequence of 2nd third guide RNA, to the polynucleotide sequence of the 3rd third guide RNA, and to the polynucleotide sequence of the 4th third guide RNA.
298. The method according to any one of the previous paragraphs, wherein the one or more third guide RNA comprises a 1st third guide RNA, having a sequence complementary to a first protospacer sequence, a second protospacer sequence, a third protospacer sequence, a fourth protospacer sequence, or a fifth protospacer sequence, preferably the 1st third guide RNA is complementary to a first protospacer sequence.
299. The method according to any one of the previous paragraphs, wherein the one or more third guide RNA comprises a 2nd third guide RNA, having a sequence complementary to a first protospacer sequence, a second protospacer sequence, a third protospacer sequence, a fourth protospacer sequence, or a fifth protospacer sequence, preferably the 2nd third guide RNA is complementary to a second protospacer sequence.
300. The method according to any one of the previous paragraphs, wherein the one or more third guide RNA comprises a 3rd third guide RNA, having a sequence complementary to a first protospacer sequence, a second protospacer sequence, a third protospacer sequence, a fourth protospacer sequence, or a fifth protospacer sequence, preferably the 3rd third guide RNA is complementary to a third protospacer sequence.
301 . The method according to any one of the previous paragraphs, wherein the one or more third guide RNA comprises a 4th third guide RNA, having a sequence complementary to a first protospacer sequence, a second protospacer sequence, a third protospacer sequence, a fourth protospacer sequence, or a fifth protospacer sequence, preferably the 4th third guide RNA is complementary to a fourth protospacer sequence.
302. The method according to any one of the previous paragraphs, wherein the one or more third guide RNA comprises a 5th third guide RNA, having a sequence complementary to a first protospacer sequence, a second protospacer sequence, a third protospacer sequence, a fourth protospacer sequence, or a fifth protospacer sequence, preferably the 5th third guide RNA is complementary to a fifth protospacer sequence.
303. The method according to any one of the previous paragraphs, wherein the at least one guide RNA comprises a first guide RNA, a second guide RNA, a third guide RNA, and a fourth guide RNA, and wherein the method additionally comprises the step f) cultivating the host cell at a fourth temperature allowing one or more fourth guide RNA of the at least one guide RNA to form a complex with the nuclease at one or more protospacer of the one or more LPP, and wherein the complex promotes the homology directed repair by creating a double-strand break at or near the one or more protospacer sequence of the one or more LPP, wherein the homology directed repair comprises insertion of the first donor polynucleotide or set of first PCR fragments at the double-strand break.
304. The method according to any one of the previous paragraphs, wherein the at least one guide RNA comprises a first guide RNA, a second guide RNA, a third guide RNA, a fourth guide RNA, and a fifth guide RNA, and wherein the method additionally comprises the step g) cultivating the host cell at a fifth temperature allowing one or more fifth guide RNA of the at least one guide RNA to form a complex with the nuclease at one or more protospacer of the one or more LPP, and wherein the complex promotes the homology directed repair by creating a double-strand break at or near the one or more protospacer sequence of the one or more LPP, wherein the homology directed repair comprises insertion of the first donor polynucleotide or set of first PCR fragments at the double-strand break.
305. The method according to any one of the previous paragraphs, wherein the first temperature is in the range of 15 - 50 C°, such as in the range of 22 - 42 C°, such as in the range of 23 - 39 C°, such as in the range of 25 - 37 C°.
306. The method according to any one of the previous paragraphs, wherein the second temperature is in the range of 15 - 50 C°, such as in the range of 22 - 42 C°, such as in the range of 23 - 39 C°, such as in the range of 25 - 37 C°.
307. The method according to any one of the previous paragraphs, wherein the third temperature is in the range of 15 - 50 C°, such as in the range of 22 - 42 C°, such as in the range of 23 - 39 C°, such as in the range of 25 - 37 C°.
308. The method according to any one of the previous paragraphs, wherein the fourth temperature is in the range of 15 - 50 C°, such as in the range of 22 - 42 C°, such as in the range of 23 - 39 C°, such as in the range of 25 - 37 C°.
309. The method according to any one of the previous paragraphs, wherein the fifth temperature is in the range of 15 - 50 C°, such as in the range of 22 - 42 C°, such as in the range of 23 - 39 C°, such as in the range of 25 - 37 C°.
310. The method according to any one of the previous paragraphs, wherein the first temperature and the second temperature are in the range of 15 - 50 C°, such as in the range of 22 - 42 C°, such as in the range of 23 - 39 C°, such as in the range of 25 - 37 C°.
311 . The method according to any one of the previous paragraphs, wherein the first temperature, the second temperature, and the third temperature are in the range of 15 - 50 C°, such as in the range of 22 - 42 C°, such as in the range of 23 - 39 C°, such as in the range of 25 - 37 C°.
312. The method according to any one of the previous paragraphs, wherein the first temperature, the second temperature, the third temperature, and the fourth temperature are in the range of 15 - 50 C°, such as in the range of 22 - 42 C°, such as in the range of 23 - 39 C°, such as in the range of 25 - 37 C°.
313. The method according to any one of the previous paragraphs, wherein the first temperature, the second temperature, the third temperature, the fourth temperature, and the fifth temperature are in the range of 15 - 50 C°, such as in the range of 22 - 42 C°, such as in the range of 23 - 39 C°, such as in the range of 25 - 37 C°.
314. The method according to any one of the previous paragraphs, wherein the first temperature differs from the second temperature by at least 0.5 C°, at least 1 C°, at least 1 .5 C°, at least 2 C°, at least 2.5 C°, at least 3 C°, at least 3.5 C°, at least 4 C°, at least 4.5 C°, at least 5 C°, at least 5.5 C°, or at least 6 C°.
315. The method according to any one of the previous paragraphs, wherein the second temperature differs from the third temperature by at least 0.5 C°, at least 1 C°, at least 1 .5 C°, at least 2 C°, at least 2.5 C°, at least 3 C°, at least 3.5 C°, at least 4 C°, at least 4.5 C°, at least 5 C°, at least 5.5 C°, or at least 6 C°
316. The method according to any one of the previous paragraphs, wherein the third temperature differs from the fourth temperature by at least 0.5 C°, at least 1 C°, at least 1 .5 C°, at least 2 C°, at least 2.5 C°, at least 3 C°, at least 3.5 C°, at least 4 C°, at least 4.5 C°, at least 5 C°, at least 5.5 C°, or at least 6 C°
317. The method according to any one of the previous paragraphs, wherein the fourth temperature differs from the fifth temperature by at least 0.5 C°, at least 1 C°, at least 1 .5 C°, at least 2 C°, at least 2.5 C°, at least 3 C°, at least 3.5 C°, at least 4 C°, at least 4.5 C°, at least 5 C°, at least 5.5 C°, or at least 6 C°
318. The method according to any one of the previous paragraphs, wherein the first temperature is selected from the list of 22 C°, 23 C°, 24 C°, 25 C°, 26 C°, 27 C°, 28 C°, 29 C°, 30 C°, 31 C°, 32 C°, 33 C°, 34 C°, 35 C°, 36 C°, 37 C°, 38 C°, 39 C°, 40 C°, 41 C°, 42 C°, 43 C°, 44 C°, 45 C°, 46 C°, 47 C°, 48 C°, 49 C°, and 50 C°.
319. The method according to any one of the previous paragraphs, wherein the second temperature is selected from the list of 22 C°, 23 C°, 24 C°, 25 C°, 26 C°, 27 C°, 28 C°, 29 C°, 30 C°, 31 C°, 32 C°, 33 C°, 34 C°, 35 C°, 36 C°, 37 C°, 38 C°, 39 C°, 40 C°, 41 C°, 42 C°, 43 C°, 44 C°, 45 C°, 46 C°, 47 C°, 48 C°, 49 C°, and 50 C°.
320. The method according to any one of the previous paragraphs, wherein the third temperature is selected from the list of 22 C°, 23 C°, 24 C°, 25 C°, 26 C°, 27 C°, 28 C°, 29 C°, 30 C°, 31 C°, 32 C°, 33 C°, 34 C°, 35 C°, 36 C°, 37 C°, 38 C°, 39 C°, 40 C°, 41 C°, 42 C°, 43 C°, 44 C°, 45 C°, 46 C°, 47 C°, 48 C°, 49 C°, and 50 C°.
321 . The method according to any one of the previous paragraphs, wherein the fourth temperature is selected from the list of 22 C°, 23 C°, 24 C°, 25 C°, 26 C°, 27 C°, 28 C°, 29 C°, 30 C°, 31 C°, 32 C°, 33 C°,
34 C°, 35 C°, 36 C°, 37 C°, 38 C°, 39 C°, 40 C°, 41 C°, 42 C°, 43 C°, 44 C°, 45 C°, 46 C°, 47 C°, 48 C°, 49 C°, and 50 C°.
322. The method according to any one of the previous paragraphs, wherein the fifth temperature is selected from the list of 22 C°, 23 C°, 24 C°, 25 C°, 26 C°, 27 C°, 28 C°, 29 C°, 30 C°, 31 C°, 32 C°, 33 C°, 34 C°,
35 C°, 36 C°, 37 C°, 38 C°, 39 C°, 40 C°, 41 C°, 42 C°, 43 C°, 44 C°, 45 C°, 46 C°, 47 C°, 48 C°, 49 C°, and 50 C°.
323. The method according to any one of the previous paragraphs, wherein the first temperature and the second temperature are selected from the list of 22 C°, 23 C°, 24 C°, 25 C°, 26 C°, 27 C°, 28 C°, 29 C°, 30 C°, 31 C°, 32 C°, 33 C°, 34 C°, 35 C°, 36 C°, 37 C°, 38 C°, 39 C°, 40 C°, 41 C°, 42 C°, 43 C°, 44 C°, 45 C°, 46 C°, 47 C°, 48 C°, 49 C°, and 50 C°..
324. The method according to any one of the previous paragraphs, wherein the first temperature, the second temperature, and the third temperature are selected from the list of 22 C°, 23 C°, 24 C°, 25 C°, 26 C°, 27 C°, 28 C°, 29 C°, 30 C°, 31 C°, 32 C°, 33 C°, 34 C°, 35 C°, 36 C°, 37 C°, 38 C°, 39 C°, 40 C°, 41 C°, 42 C°, 43 C°, 44 C°, 45 C°, 46 C°, 47 C°, 48 C°, 49 C°, and 50 C°.
325. The method according to any one of the previous paragraphs, wherein the first temperature, the second temperature, the third temperature, and the fourth temperature are selected from the list of 22 C°, 23 C°, 24 C°, 25 C°, 26 C°, 27 C°, 28 C°, 29 C°, 30 C°, 31 C°, 32 C°, 33 C°, 34 C°, 35 C°, 36 C°, 37 C°, 38 C°, 39 C°, 40 C°, 41 C°, 42 C°, 43 C°, 44 C°, 45 C°, 46 C°, 47 C°, 48 C°, 49 C°, and 50 C°.
326. The method according to any one of the previous paragraphs, wherein the first temperature, the second temperature, the third temperature, the fourth temperature, and the fifth temperature are selected from the list of 22 C°, 23 C°, 24 C°, 25 C°, 26 C°, 27 C°, 28 C°, 29 C°, 30 C°, 31 C°, 32 C°, 33 C°, 34 C°, 35 C°, 36 C°, 37 C°, 38 C°, 39 C°, 40 C°, 41 C°, 42 C°, 43 C°, 44 C°, 45 C°, 46 C°, 47 C°, 48 C°, 49 C°, and 50 C°.
327. The method according to any one of the previous paragraphs, wherein the first temperature is in the range of 22 - 40 C°, and wherein the second temperature is in the range of 35 - 50 C°.
328. The method according to any one of the previous paragraphs, wherein the first donor polynucleotide does not comprise a selectable marker or counter-selectable marker.
329. The method according to any one of the previous paragraphs, wherein the first donor polynucleotide comprises a selectable or counter-selectable marker.
330. The method according to any one of the previous paragraphs, wherein during step b) there is furthermore delivered a second donor polynucleotide comprising in 5’ to 3’ direction: a 5’ homology arm, a second polynucleotide encoding a second polypeptide of interest, and a 3’ homology arm, wherein the 3’ homology arm is identical to the 3’ homology arm of the host cell, and wherein the 5’ homology arm is identical to the 5’ homology arm of the host cell.
331 . The method according to any one of the previous paragraphs, wherein the first polypeptide of interest comprises a brazzein, a casein, a patatin, an ovalbumin, an osteopontin, an ovotransferrin, an ovomucin,
an ovomucoid, an ovostatin, a glycomacropeptide, a lactoferrin, an alpha-lactalbumin, e.g., bovine alphalactalbumin, a beta-lactalbumin and/or a collagen.
332. The method according to any one of the previous paragraphs, wherein the first polypeptide of interest comprises a therapeutic polypeptide selected from the group consisting of an antibody, an antibody fragment, an antibody-based drug, a Fc fusion protein, an anticoagulant, a blood factor, a bone morphogenetic protein, an engineered protein scaffold, an enzyme, a growth factor, a blood clotting factor, a hormone, an interferon (such as an interferon alpha-2b), an interleukin, a lactoferrin, an alpha-lactalbumin, a beta-lactalbumin, an ovomucoid, an ovostatin, a cytokine, an obestatin, a human galactosidase (such as an human alpha-galactosidase A), a vaccine, a protein vaccine, and a thrombolytic.
333. The method according to any one of the previous paragraphs, wherein the first polypeptide of interest comprises an enzyme.
334. The method according to any one of the previous paragraphs, wherein the enzyme is selected from the list of a hydrolase, isomerase, ligase, lyase, oxidoreductase, or transferase, e.g., an aminopeptidase, amylase, carbohydrase, carboxypeptidase, catalase, cellobiohydrolase, cellulase, chitinase, cutinase, cyclodextrin glycosyltransferase, deoxyribonuclease, endoglucanase, esterase, alpha-galactosidase, beta-galactosidase, glucoamylase, alpha-glucosidase, beta-glucosidase, invertase, laccase, lipase, mannosidase, mutanase, oxidase, pectinolytic enzyme, peroxidase, phytase, polyphenoloxidase, proteolytic enzyme, ribonuclease, transglutaminase, xylanase, or beta-xylosidase.
335. The method according to any one of the previous paragraphs, wherein the second polypeptide of interest comprises a polypeptide selected from the list of a transcription factor (e.g. amyR), a chaperone (e.g. BipA, PdiA, or PdiB), a heme pathway enzyme (e.g. encoded by hemA, or hemB), or a glycosylation pathway enzyme (e.g. a Mannosidase I, a GIcNAc transferase, a galactosyltransferase, a sialyltransferase, an oligosaccharyltransferase, or an glucosidase).
336. The method according to any one of the previous paragraphs, wherein the method additionally comprises step h), subsequent of any of steps c), d), e), f) or g):
- providing a host cell comprising a first polynucleotide in one or more LPP, said host cell being generated with the method of any of the previous paragraphs, and said host cell comprising at least one LPP not comprising the first polynucleotide (= empty LPP),
- delivering to the host cell a RNA-guided DNA nuclease, and at least one guide RNA directed to one or more protospacer of one or more empty LPP, and
- cultivating the host cell at a temperature allowing the at least one guide RNA to form a complex with the nuclease at the one or more empty LPP, wherein the complex promotes the homology directed repair by creating a double-strand break at or near the one or more protospacer sequence of the one or more empty LPP, wherein the homology directed repair comprises insertion of the first polynucleotide at the doublestrand break of the empty LPP.
337. A method of producing a polypeptide of interest, comprising cultivating a multi-copy host cell, said cell being generated by the method of any preceding paragraphs, under conditions conducive for production of the polypeptide.
338. The method of paragraph 337, further comprising recovering the polypeptide of interest.
Claims
1 . A host cell comprising in its genome at least two landing pad polynucleotide (LPP) sequences, each LPP sequence comprising in 5’ to 3’ direction: a 5’ homology arm, two or more protospacer sequences, and a 3’ homology arm, wherein the polynucleotide sequence of the 5’ homology arm is identical for at least two LPP sequences, and wherein the polynucleotide sequence of the 3’ homology arm is identical for at least two LPP sequences.
2. The host cell according to claim 1 , wherein at least one protospacer of the at least two protospacers is present in at least two LPP.
3. The host cell according to any one of the previous claims, wherein the polynucleotide sequence of the 5’ homology arm is identical for all of the at least two LPP sequences.
4. The host cell according to any one of the previous claims, wherein the polynucleotide sequence of the 3’ homology arm is identical for all of the at least two LPP sequences.
5. The host cell according to any one of the previous claims, wherein the host cell comprises three LPP sequences, four LPP sequences, 5 LPP sequences, 6 LPP sequences, 7 LPP sequences, 8 LPP sequences, 9 LPP sequences, 10 LPP sequences, 11 LPP sequences, 12 LPP sequences, 13 LPP sequences, 14 LPP sequences, 15 LPP sequences, 16 LPP sequences, or more than 16 LPP sequences.
6. The host cell according to any one of the previous claims, wherein at least one LPP is located on the coding strand (3’ to 5’ direction), and wherein at least one LPP is located on the template strand (5’ to 3’ direction).
7. The host cell according to any one of the previous claims, wherein the host cell is deficient of a non-homologous end joining (NHEJ) mechanism.
8. The host cell according to any one of the previous claims, wherein each 3’ homology arm and each 5’ homology arm has a length of at least 10 nt, at least 20 nt, at least 30 nt, at least 50 nt, at least 100 nt, 200 nucleotides (nt), at least 300 nt, at least 400 nt, at least 500 nt, at least 600 nt, at least 700 nt, at least 800 nt, at least 900 nt, or at least 1000 nt.
9. The host cell according to any one of the previous claims, wherein each of the at least two LPP comprises a first protospacer sequence with a polynucleotide sequence identical for each first protospacer of each at least two LPP.
10. The host cell according to any one of the previous claims, wherein each of the at least two LPP comprises a second protospacer sequence with a polynucleotide sequence which is unique for each second protospacer of each at least two LPP.
11 . The host cell according to any one of the previous claims, wherein the host cell is a fungal cell.
12. The host cell according to any one of the previous claims, wherein the host cell is a yeast recombinant host cell, e.g., a Candida, Hansenula, Kluyveromyces, Pichia, Saccharomyces, Schizosaccharomyces, or Yarrowia cell, such as a Kluyveromyces lactis, Saccharomyces carlsbergensis, Saccharomyces cerevisiae, Saccharomyces diastaticus, Saccharomyces douglasii, Saccharomyces kluyveri, Saccharomyces norbensis, Saccharomyces oviformis, or Yarrowia lipolytica cell.
13. The host cell according to any one of the previous claims, wherein the host cell is a filamentous fungal recombinant host cell, e.g., an Acremonium, Aspergillus, Aureobasidium, Bjerkandera, Ceriporiopsis, Chrysosporium, Coprinus, Coriolus, Cryptococcus, Filibasidium, Fusarium, Humicola, Magnaporthe, Mucor, Myceliophthora, Neocallimastix, Neurospora, Paecilomyces, Penicillium, Phanerochaete, Phlebia, Piromyces, Pleurotus, Schizophyllum, Talaromyces, Thermoascus, Thielavia, Tolypocladium, Trametes, or Trichoderma cell, in particular, an Aspergillus awamori, Aspergillus foetidus, Aspergillus fumigatus, Aspergillus japonicus, Aspergillus nidulans, Aspergillus niger, Aspergillus oryzae, Bjerkandera adusta, Ceriporiopsis aneirina, Ceriporiopsis caregiea, Ceriporiopsis gilvescens, Ceriporiopsis pannocinta, Ceriporiopsis rivulosa, Ceriporiopsis subrufa, Ceriporiopsis subvermispora, Chrysosporium inops, Chrysosporium keratinophilum, Chrysosporium lucknowense, Chrysosporium merdarium, Chrysosporium pannicola, Chrysosporium queenslandicum, Chrysosporium tropicum, Chrysosporium zonatum, Coprinus cinereus, Coriolus hirsutus, Fusarium bactridioides, Fusarium cerealis, Fusarium crookwellense, Fusarium culmorum, Fusarium graminearum, Fusarium graminum, Fusarium heterosporum, Fusarium negundi, Fusarium oxysporum, Fusarium reticulatum, Fusarium roseum, Fusarium sambucinum, Fusarium sarcochroum, Fusarium sporotrichioides, Fusarium sulphureum, Fusarium torulosum, Fusarium trichothecioides, Fusarium venenatum, Humicola insolens, Humicola lanuginosa, Mucor miehei, Myceliophthora thermophila, Neurospora crassa, Penicillium purpurogenum, Phanerochaete chrysosporium, Phlebia radiata, Pleurotus eryngii, Talaromyces emersonii, Thielavia terrestris, Trametes villosa, Trametes versicolor, Trichoderma harzianum, Trichoderma koningii, Trichoderma longibrachiatum, Trichoderma reesei, or Trichoderma viride cell, preferably the host cell is an Aspergillus cell, more preferably the host cell is an Aspergillus niger cell, or an Aspergillus oryzae cell.
14. The host cell according to any one of claims 1 to 10, wherein the host cell is a prokaryotic recombinant host cell, e.g., a Gram-positive cell selected from the group consisting of Bacillus, Clostridium, Enterococcus, Geobacillus, Lactobacillus, Lactococcus, Oceanobacillus, Staphylococcus, Streptococcus, or Streptomyces cells, or a Gram-negative bacteria selected from the group consisting of Campylobacter, E. coll, Flavobacterium, Fusobacterium, Helicobacter, llyobacter, Neisseria, Pseudomonas, Salmonella, and Ureaplasma cells, such as Bacillus alkalophilus, Bacillus amyloliquefaciens, Bacillus brevis, Bacillus circulans, Bacillus clausii, Bacillus coagulans, Bacillus firmus, Bacillus lautus, Bacillus lentus, Bacillus lichen! formis, Bacillus megaterium, Bacillus pumilus, Bacillus stearothermophilus, Bacillus subtilis, Bacillus thuringiensis, Streptococcus equisimilis, Streptococcus pyogenes, Streptococcus uberis, and Streptococcus equi subsp. Zooepidemicus, Streptomyces achromogenes, Streptomyces avermitilis, Streptomyces coelicolor, Streptomyces griseus, and Streptomyces lividans cells, preferably the host cell is a Bacillus cell, more preferably a Bacillus licheniformis cell or a Bacillus subtilis cell.
15. The host cell according to any one of claims 1 to 10, wherein the host cell is a mammalian cell.
16. A method for generating a multi-copy host cell, comprising the steps of a) providing a host cell according to any one of claim 1 to 15, b) delivering to the host cell a gene editing system comprising:
- a RNA-guided DNA nuclease,
- at least one guide RNA directed to one or more protospacer of the one or more LPP sequences,
- a first donor polynucleotide comprising in 5’ to 3’ direction: a 5’ homology arm, a first polynucleotide encoding a first polypeptide of interest, and a 3’ homology arm, OR a set of first PCR fragments comprising a first 5’ homology arm, a first 3’ homology arm, and a first polynucleotide encoding a first polypetide of interest, wherein the 3’ homology arm and/or first 3’ homology arm is identical to the 3’ homology arm of one or more LPP and wherein the 5’ homology arm and/or first 5’ homology arm is identical to the 5’ homology arm of one or more LPP, and c) cultivating the host cell at a first temperature allowing one or more first guide RNA of the at least one guide RNA to form a complex with the nuclease at one or more protospacer of one or more LPP, and wherein the complex promotes the homology directed repair by creating a double-strand break at or near the one or more protospacer sequence of the one or more LPP, wherein the homology directed repair comprises insertion of the first donor polynucleotide and/or set of first PCR fragments at the double-strand break.
17. The method according to claim 16, wherein the at least one guide RNA comprises a first guide RNA and a second guide RNA, and wherein the method additionally comprises step d): d) cultivating the host cell at a second temperature allowing one or more second guide RNA of the at least one guide RNA to form a complex with the nuclease at one or more protospacer of the one or more LPP, and wherein the complex promotes the homology directed repair by creating a double-strand break at or near the one or more protospacer sequence of the one or more LPP, wherein the homology directed repair comprises insertion of the first donor polynucleotide at the double-strand break.
18. The method according to any one of claims 16 to 17, wherein the first temperature differs from the second temperature by at least 0.5 C°, at least 1 C°, at least 1.5 C°, at least 2 C°, at least 2.5 C°, at least 3 C°, at least 3.5 C°, at least 4 C°, at least 4.5 C°, at least 5 C°, at least 5.5 C°, or at least 6 C°.
19. The method according to any one of claims 16 to 18, wherein the method, subsequent of any of steps c) or d), additionally comprises step h):
- providing a host cell comprising a first polynucleotide in one or more LPP, said host cell being generated with the method of any of claims 16 to 18, and said host cell comprising at least one LPP not comprising the first polynucleotide (= empty LPP),
- delivering to the host cell a RNA-guided DNA nuclease, and at least one guide RNA directed to one or more protospacer of one or more empty LPP, and
- cultivating the host cell at a temperature allowing the at least one guide RNA to form a complex with the nuclease at the one or more empty LPP, wherein the complex promotes the homology directed repair by creating a double-strand break at or near the one or more protospacer sequence of the one or more empty LPP, wherein the homology directed repair comprises insertion of the first polynucleotide at the doublestrand break of the empty LPP.
20. A method of producing a polypeptide of interest, comprising cultivating a multi-copy host cell under conditions conducive for production of the polypeptide, which multi-copy host cell is generated by the method according to any one of claims 16 to 19.
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
EP23169124.7 | 2023-04-21 | ||
EP23169124 | 2023-04-21 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2024218234A1 true WO2024218234A1 (en) | 2024-10-24 |
Family
ID=86282297
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/EP2024/060591 WO2024218234A1 (en) | 2023-04-21 | 2024-04-18 | Generation of multi-copy host cells |
Country Status (1)
Country | Link |
---|---|
WO (1) | WO2024218234A1 (en) |
Citations (14)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP0238023A2 (en) | 1986-03-17 | 1987-09-23 | Novo Nordisk A/S | Process for the production of protein products in Aspergillus oryzae and a promoter for use in Aspergillus |
WO1994025612A2 (en) | 1993-05-05 | 1994-11-10 | Institut Pasteur | Nucleotide sequences for the control of the expression of dna sequences in a cellular host |
WO1995033836A1 (en) | 1994-06-03 | 1995-12-14 | Novo Nordisk Biotech, Inc. | Phosphonyldipeptides useful in the treatment of cardiovascular diseases |
EP2527448A1 (en) * | 2011-05-23 | 2012-11-28 | Novozymes A/S | Simultaneous site-specific integrations of multiple gene-copies in filamentous fungi |
WO2016145084A1 (en) | 2015-03-09 | 2016-09-15 | Novozymes A/S | Methods of introducing multiple expression constructs into a eukaryotic cell |
WO2017144177A1 (en) | 2016-02-26 | 2017-08-31 | Keskin Hüseyin | Driving and/or flight simulator |
US10041092B2 (en) * | 2013-12-19 | 2018-08-07 | Amyris, Inc. | Methods for genomic integration |
WO2020025357A1 (en) | 2018-08-02 | 2020-02-06 | Novozymes A/S | Preparation of combinatorial libraries of dna constructs using introns |
WO2020112911A1 (en) | 2018-11-28 | 2020-06-04 | Novozymes A/S | Improved filamentous fungal host cells |
WO2020123845A1 (en) | 2018-12-12 | 2020-06-18 | Novozymes A/S | Methods for increasing the productivity of a filamentous fungal cell in the production of a polypeptide |
EP3242950B1 (en) * | 2015-01-06 | 2021-10-06 | DSM IP Assets B.V. | A crispr-cas system for a filamentous fungal host cell |
WO2021202479A1 (en) | 2020-04-03 | 2021-10-07 | Novozymes A/S | Submerged fermentation process |
WO2022127892A1 (en) | 2020-12-17 | 2022-06-23 | Novozymes A/S | Improved fiber-wash in corn wet-milling |
US20220251606A1 (en) * | 2016-05-18 | 2022-08-11 | Amyris, Inc. | Compositions and methods for genomic integration of nucleic acids into exogenous landing pads |
-
2024
- 2024-04-18 WO PCT/EP2024/060591 patent/WO2024218234A1/en unknown
Patent Citations (15)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP0238023A2 (en) | 1986-03-17 | 1987-09-23 | Novo Nordisk A/S | Process for the production of protein products in Aspergillus oryzae and a promoter for use in Aspergillus |
WO1994025612A2 (en) | 1993-05-05 | 1994-11-10 | Institut Pasteur | Nucleotide sequences for the control of the expression of dna sequences in a cellular host |
WO1995033836A1 (en) | 1994-06-03 | 1995-12-14 | Novo Nordisk Biotech, Inc. | Phosphonyldipeptides useful in the treatment of cardiovascular diseases |
EP2527448A1 (en) * | 2011-05-23 | 2012-11-28 | Novozymes A/S | Simultaneous site-specific integrations of multiple gene-copies in filamentous fungi |
US20140120625A1 (en) * | 2011-05-23 | 2014-05-01 | Novozymes A/S | Simultaneous Site-Specific Integrations of Multiple Gene-Copies in Filamentous Fungi |
US10041092B2 (en) * | 2013-12-19 | 2018-08-07 | Amyris, Inc. | Methods for genomic integration |
EP3242950B1 (en) * | 2015-01-06 | 2021-10-06 | DSM IP Assets B.V. | A crispr-cas system for a filamentous fungal host cell |
WO2016145084A1 (en) | 2015-03-09 | 2016-09-15 | Novozymes A/S | Methods of introducing multiple expression constructs into a eukaryotic cell |
WO2017144177A1 (en) | 2016-02-26 | 2017-08-31 | Keskin Hüseyin | Driving and/or flight simulator |
US20220251606A1 (en) * | 2016-05-18 | 2022-08-11 | Amyris, Inc. | Compositions and methods for genomic integration of nucleic acids into exogenous landing pads |
WO2020025357A1 (en) | 2018-08-02 | 2020-02-06 | Novozymes A/S | Preparation of combinatorial libraries of dna constructs using introns |
WO2020112911A1 (en) | 2018-11-28 | 2020-06-04 | Novozymes A/S | Improved filamentous fungal host cells |
WO2020123845A1 (en) | 2018-12-12 | 2020-06-18 | Novozymes A/S | Methods for increasing the productivity of a filamentous fungal cell in the production of a polypeptide |
WO2021202479A1 (en) | 2020-04-03 | 2021-10-07 | Novozymes A/S | Submerged fermentation process |
WO2022127892A1 (en) | 2020-12-17 | 2022-06-23 | Novozymes A/S | Improved fiber-wash in corn wet-milling |
Non-Patent Citations (48)
Title |
---|
"Biology and Activities of Yeast", 1980, article "Soc. App. Bacteriol. Symposium Series" |
"Current protocols in Molecular Biology", 1995, JOHN WILEY AND SONS |
"Molecular Biological Methods for Bacillus", 1990, JOHN WILEY AND SONS |
BALLEZA ET AL., FEMS MICROBIOL. REV, vol. 33, no. 1, 2009, pages 133 - 151 |
BURKE ET AL., PROC. NATL. ACAD. SCI. USA, vol. 98, 2001, pages 6289 - 6294 |
CARTER ET AL., PROTEINS: STRUCTURE, FUNCTION, AND GENETICS, vol. 6, 1989, pages 240 - 248 |
CHOI ET AL., J. MICROBIOL. METHODS, vol. 64, 2006, pages 391 - 397 |
CHRISTENSEN ET AL., BIOLTECHNOLOGY, vol. 6, 1988, pages 1419 - 1422 |
COLLINS-RACIE ET AL., BIOTECHNOLOGY, vol. 13, 1995, pages 982 - 987 |
CONTRERAS ET AL., BIOTECHNOLOGY, vol. 9, 1991, pages 378 - 381 |
COOPER ET AL., EMBO J., vol. 12, 1993, pages 2575 - 2583 |
DAWSON ET AL., SCIENCE, vol. 266, 1994, pages 776 - 779 |
DONALD ET AL., J. BACTERIOL, vol. 195, no. 11, 2013, pages 2612 - 2620 |
EATON ET AL., BIOCHEMISTRY, vol. 25, 1986, pages 505 - 512 |
FORD ET AL., PROTEIN EXPRESSION AND PURIFICATION, vol. 2, 1991, pages 95 - 107 |
FREUDL, MICROBIAL CELL FACTORIES, vol. 17, 2018, pages 52 |
GEISBERG ET AL., CELL, vol. 156, no. 4, 2014, pages 812 - 824 |
GUOSHERMAN, MOL. CELLULAR BIOL, vol. 15, 1995, pages 5983 - 5990 |
HAMBRAEUS ET AL., MICROBIOLOGY, vol. 146, no. 12, 2000, pages 3051 - 3059 |
HEINZE ET AL., BMC MICROBIOLOGY, vol. 18, 2018, pages 56 |
HUE ET AL., J. BACTERIOL, vol. 177, 1995, pages 3465 - 3471 |
JUNG GIYOUNG: "Engineering mammalian cell line for N-linked glycosylation control", PHD THESIS, 1 February 2021 (2021-02-01), pages 1 - 109, XP055964032, Retrieved from the Internet <URL:https://dspace.mit.edu/bitstream/handle/1721.1/130720/1251803194-MIT.pdf?sequence=1&isAllowed=y> [retrieved on 20220922] * |
KABERDINBLASI, FEMS MICROBIOL. REV, vol. 30, no. 6, 2006, pages 967 - 979 |
LABROU, PROTEIN DOWNSTREAM PROCESSING, vol. 1129, 2014, pages 3 - 10 |
LI ET AL., MICROBIAL CELL FACTORIES, vol. 16, 2017, pages 168 |
LUBERTOZZIKEASLING, BIOTECHN. ADVANCES, vol. 27, 2009, pages 53 - 75 |
MARTIN ET AL., J. IND. MICROBIOL. BIOTECHNOL, vol. 3, 2003, pages 568 - 576 |
MOROZOV ET AL., EUKARYOTIC CELL, vol. 5, no. 11, 2006, pages 1838 - 1846 |
MUKHERJEE ET AL., TRICHODERMA: BIOLOGY AND APPLICATIONS, 2013 |
NEEDLEMANWUNSCH, J. MOL. BIOL, vol. 48, 1970, pages 443 - 453 |
PATELGUPTA, INT. J. SYST. EVOL. MICROBIOL, vol. 70, 2020, pages 406 - 438 |
PENTTILA ET AL., GENE, vol. 61, 1987, pages 155 - 164 |
RASMUSSEN-WILSON ET AL., APPL. ENVIRON. MICROBIOL, vol. 63, 1997, pages 3488 - 3493 |
RICE ET AL.: "Trends Genet", vol. 16, 2000, article "EMBOSS: The European Molecular Biology Open Software Suite", pages: 276 - 277 |
ROCHELLE AW ET AL: "Can too many copies spoil the broth?", MICROBIAL CELL FACTORIES, SPRINGER, vol. 12, no. 1, 20 December 2013 (2013-12-20), pages 128, XP021171256, ISSN: 1475-2859, DOI: 10.1186/1475-2859-12-128 * |
ROMANOS ET AL., YEAST, vol. 8, 1992, pages 423 - 488 |
SAMBROOK ET AL.: "Molecular cloning: A laboratory 20 manual", 1989, COLD SPRING HARBOR LAB. |
SCHMOLLDATTENBOCK: "Gene Expression Systems in Fungi: Advancements and Applications", FUNGAL BIOLOGY, 2016 |
SESHASAYEE ET AL., SUBCELLULAR BIOCHEMISTRY, vol. 52, 2011, pages 7 - 23 |
SHIN SEUNGHYEON ET AL: "Streamlined Human Cell-Based Recombinase-Mediated Cassette Exchange Platform Enables Multigene Expression for the Production of Therapeutic Proteins", ACS SYNTHETIC BIOLOGY, vol. 10, no. 7, 16 June 2021 (2021-06-16), Washington DC ,USA, pages 1715 - 1727, XP093019139, ISSN: 2161-5063, DOI: 10.1021/acssynbio.1c00113 * |
SMOLKE ET AL.: "Synthetic Biology: Parts, Devices and Applications", 2018, article "Constitutive and Regulated Promoters in Yeast: How to Design and Make Use of Promoters in S. cerevisiae" |
SONG ET AL., PLOS ONE, vol. 11, no. 7, 2016, pages e0158447 |
STEVENS, DRUG DISCOVERY WORLD, vol. 4, 2003, pages 35 - 48 |
SVETINA ET AL., J. BIOTECHNOL, vol. 76, 2000, pages 245 - 251 |
THOMAS E KUHLMAN AND EDWARD C COX: "Site-specific chromosomal integration of large synthetic constructs", NUCLEIC ACIDS RESEARCH, OXFORD UNIVERSITY PRESS, GB, vol. 38, no. 6, 1 April 2010 (2010-04-01), pages 1 - 10, XP002666135, ISSN: 1362-4962, [retrieved on 20100104], DOI: 10.1093/NAR/GKP1193 * |
WINGFIELD, CURRENT PROTOCOLS IN PROTEIN SCIENCE, vol. 80, no. 1, 2015, pages 1 - 35 |
XU ET AL., BIOTECHNOLOGY LETTERS, vol. 40, 2018, pages 949 - 955 |
YELTON ET AL., PROC. NATL. ACAD. SCI. USA, vol. 81, pages 1470 - 1474 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US20190185847A1 (en) | Improving a Microorganism by CRISPR-Inhibition | |
WO2017191210A1 (en) | Genome editing by crispr-cas9 in filamentous fungal host cells | |
US20160304905A1 (en) | Fungal Gene Library By Double Split-Marker Integration | |
WO2018050666A1 (en) | Genomic integration of dna fragments in fungal host cells | |
US20170114091A1 (en) | Resolubilization of protein crystals at low ph | |
US11046736B2 (en) | Filamentous fungal host | |
US20170313997A1 (en) | Filamentous Fungal Double-Mutant Host Cells | |
WO2015025055A1 (en) | Regulated pepc expression | |
CA3121271A1 (en) | Modified filamentous fungal host cells | |
US20220025422A1 (en) | Improved Filamentous Fungal Host Cells | |
WO2020092608A1 (en) | Genome editing by guided endonuclease and single-stranded oligonucleotide | |
US20150307871A1 (en) | Method for generating site-specific mutations in filamentous fungi | |
US20220267783A1 (en) | Filamentous fungal expression system | |
WO2018167153A1 (en) | Improved filamentous fungal host cell | |
WO2024218234A1 (en) | Generation of multi-copy host cells | |
EP3728583B1 (en) | Counter-selection by inhibition of conditionally essential genes | |
EP3990629A1 (en) | Counter-selection by inhibition of conditionally essential genes | |
WO2020173817A1 (en) | Calcite binding proteins | |
WO2024120767A1 (en) | Modified rna polymerase activities | |
US20230407273A1 (en) | Glycosyltransferase variants for improved protein production | |
WO2022090555A1 (en) | Leader peptides and polynucleotides encoding the same | |
WO2024056643A1 (en) | Fungal signal peptides |