CA2270289C - Proteins with enhanced levels of essential amino acids - Google Patents
Proteins with enhanced levels of essential amino acids Download PDFInfo
- Publication number
- CA2270289C CA2270289C CA002270289A CA2270289A CA2270289C CA 2270289 C CA2270289 C CA 2270289C CA 002270289 A CA002270289 A CA 002270289A CA 2270289 A CA2270289 A CA 2270289A CA 2270289 C CA2270289 C CA 2270289C
- Authority
- CA
- Canada
- Prior art keywords
- seq
- sequence
- modified variant
- val
- amino acid
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Fee Related
Links
- 239000003797 essential amino acid Substances 0.000 title claims abstract description 70
- 235000020776 essential amino acid Nutrition 0.000 title claims abstract description 70
- 108090000623 proteins and genes Proteins 0.000 title claims description 233
- 102000004169 proteins and genes Human genes 0.000 title claims description 181
- 108090000765 processed proteins & peptides Proteins 0.000 claims abstract description 188
- 102000004196 processed proteins & peptides Human genes 0.000 claims abstract description 173
- 229920001184 polypeptide Polymers 0.000 claims abstract description 169
- 230000014509 gene expression Effects 0.000 claims abstract description 63
- 235000016709 nutrition Nutrition 0.000 claims abstract description 17
- 230000001965 increasing effect Effects 0.000 claims abstract description 14
- 241000196324 Embryophyta Species 0.000 claims description 210
- 150000007523 nucleic acids Chemical class 0.000 claims description 197
- 102000039446 nucleic acids Human genes 0.000 claims description 177
- 108020004707 nucleic acids Proteins 0.000 claims description 177
- 235000018102 proteins Nutrition 0.000 claims description 163
- 108091033319 polynucleotide Proteins 0.000 claims description 117
- 102000040430 polynucleotide Human genes 0.000 claims description 117
- 239000002157 polynucleotide Substances 0.000 claims description 117
- 235000001014 amino acid Nutrition 0.000 claims description 102
- 229940024606 amino acid Drugs 0.000 claims description 95
- 150000001413 amino acids Chemical class 0.000 claims description 93
- 238000000034 method Methods 0.000 claims description 87
- KDXKERNSBIXSRK-UHFFFAOYSA-N Lysine Natural products NCCCCC(N)C(O)=O KDXKERNSBIXSRK-UHFFFAOYSA-N 0.000 claims description 60
- 239000004472 Lysine Substances 0.000 claims description 56
- 125000000539 amino acid group Chemical group 0.000 claims description 42
- 240000008042 Zea mays Species 0.000 claims description 41
- 235000002017 Zea mays subsp mays Nutrition 0.000 claims description 30
- 238000003259 recombinant expression Methods 0.000 claims description 30
- 238000009396 hybridization Methods 0.000 claims description 29
- 239000000203 mixture Substances 0.000 claims description 24
- 235000016383 Zea mays subsp huehuetenangensis Nutrition 0.000 claims description 23
- 235000009973 maize Nutrition 0.000 claims description 23
- 229930182817 methionine Natural products 0.000 claims description 21
- FFEARJCKVFRZRR-BYPYZUCNSA-N L-methionine Chemical compound CSCC[C@H](N)C(O)=O FFEARJCKVFRZRR-BYPYZUCNSA-N 0.000 claims description 18
- 241001465754 Metazoa Species 0.000 claims description 18
- 235000007340 Hordeum vulgare Nutrition 0.000 claims description 16
- 230000000295 complement effect Effects 0.000 claims description 16
- 125000002924 primary amino group Chemical group [H]N([H])* 0.000 claims description 15
- 239000004473 Threonine Substances 0.000 claims description 11
- AYFVYJQAPQTCCC-UHFFFAOYSA-N Threonine Natural products CC(O)C(N)C(O)=O AYFVYJQAPQTCCC-UHFFFAOYSA-N 0.000 claims description 11
- 125000000151 cysteine group Chemical class N[C@@H](CS)C(=O)* 0.000 claims description 11
- 125000003588 lysine group Chemical group [H]N([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])(N([H])[H])C(*)=O 0.000 claims description 10
- 244000068988 Glycine max Species 0.000 claims description 9
- 235000010469 Glycine max Nutrition 0.000 claims description 9
- QIVBCDIJIAJPQS-VIFPVBQESA-N L-tryptophane Chemical compound C1=CC=C2C(C[C@H](N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-VIFPVBQESA-N 0.000 claims description 9
- QIVBCDIJIAJPQS-UHFFFAOYSA-N Tryptophan Natural products C1=CC=C2C(CC(N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-UHFFFAOYSA-N 0.000 claims description 9
- 235000018417 cysteine Nutrition 0.000 claims description 9
- DHMQDGOQFOQNFH-UHFFFAOYSA-N Glycine Chemical compound NCC(O)=O DHMQDGOQFOQNFH-UHFFFAOYSA-N 0.000 claims description 8
- 235000011684 Sorghum saccharatum Nutrition 0.000 claims description 7
- 238000002360 preparation method Methods 0.000 claims description 7
- KDXKERNSBIXSRK-YFKPBYRVSA-N L-lysine Chemical compound NCCCC[C@H](N)C(O)=O KDXKERNSBIXSRK-YFKPBYRVSA-N 0.000 claims description 6
- AYFVYJQAPQTCCC-GBXIJSLDSA-N L-threonine Chemical compound C[C@@H](O)[C@H](N)C(O)=O AYFVYJQAPQTCCC-GBXIJSLDSA-N 0.000 claims description 6
- 235000004977 Brassica sinapistrum Nutrition 0.000 claims description 5
- WHUUTDBJXJRKMK-UHFFFAOYSA-N Glutamic acid Natural products OC(=O)C(N)CCC(O)=O WHUUTDBJXJRKMK-UHFFFAOYSA-N 0.000 claims description 5
- AGPKZVBTJJNPAG-WHFBIAKZSA-N L-isoleucine Chemical compound CC[C@H](C)[C@H](N)C(O)=O AGPKZVBTJJNPAG-WHFBIAKZSA-N 0.000 claims description 5
- 235000021307 Triticum Nutrition 0.000 claims description 5
- KZSNJWFQEVHDMF-UHFFFAOYSA-N Valine Natural products CC(C)C(N)C(O)=O KZSNJWFQEVHDMF-UHFFFAOYSA-N 0.000 claims description 5
- AGPKZVBTJJNPAG-UHFFFAOYSA-N isoleucine Natural products CCC(C)C(N)C(O)=O AGPKZVBTJJNPAG-UHFFFAOYSA-N 0.000 claims description 5
- 229960000310 isoleucine Drugs 0.000 claims description 5
- 150000003839 salts Chemical class 0.000 claims description 5
- 235000014698 Brassica juncea var multisecta Nutrition 0.000 claims description 4
- 235000006008 Brassica napus var napus Nutrition 0.000 claims description 4
- 235000006618 Brassica rapa subsp oleifera Nutrition 0.000 claims description 4
- 244000020551 Helianthus annuus Species 0.000 claims description 4
- 235000003222 Helianthus annuus Nutrition 0.000 claims description 4
- HNDVDQJCIGZPNO-YFKPBYRVSA-N L-histidine Chemical compound OC(=O)[C@@H](N)CC1=CN=CN1 HNDVDQJCIGZPNO-YFKPBYRVSA-N 0.000 claims description 4
- ROHFNLRQFUQHCH-YFKPBYRVSA-N L-leucine Chemical compound CC(C)C[C@H](N)C(O)=O ROHFNLRQFUQHCH-YFKPBYRVSA-N 0.000 claims description 4
- COLNVLDHVKWLRT-QMMMGPOBSA-N L-phenylalanine Chemical compound OC(=O)[C@@H](N)CC1=CC=CC=C1 COLNVLDHVKWLRT-QMMMGPOBSA-N 0.000 claims description 4
- KZSNJWFQEVHDMF-BYPYZUCNSA-N L-valine Chemical compound CC(C)[C@H](N)C(O)=O KZSNJWFQEVHDMF-BYPYZUCNSA-N 0.000 claims description 4
- ROHFNLRQFUQHCH-UHFFFAOYSA-N Leucine Natural products CC(C)CC(N)C(O)=O ROHFNLRQFUQHCH-UHFFFAOYSA-N 0.000 claims description 4
- 235000017587 Medicago sativa ssp. sativa Nutrition 0.000 claims description 4
- 235000007164 Oryza sativa Nutrition 0.000 claims description 4
- HNDVDQJCIGZPNO-UHFFFAOYSA-N histidine Natural products OC(=O)C(N)CC1=CN=CN1 HNDVDQJCIGZPNO-UHFFFAOYSA-N 0.000 claims description 4
- COLNVLDHVKWLRT-UHFFFAOYSA-N phenylalanine Natural products OC(=O)C(N)CC1=CC=CC=C1 COLNVLDHVKWLRT-UHFFFAOYSA-N 0.000 claims description 4
- 235000009566 rice Nutrition 0.000 claims description 4
- 239000004474 valine Substances 0.000 claims description 4
- 239000004475 Arginine Substances 0.000 claims description 3
- 239000004471 Glycine Substances 0.000 claims description 3
- 235000007688 Lycopersicon esculentum Nutrition 0.000 claims description 3
- 244000061176 Nicotiana tabacum Species 0.000 claims description 3
- 235000002637 Nicotiana tabacum Nutrition 0.000 claims description 3
- ODKSFYDXXFIFQN-UHFFFAOYSA-N arginine Natural products OC(=O)C(N)CCCNC(N)=N ODKSFYDXXFIFQN-UHFFFAOYSA-N 0.000 claims description 3
- 238000003776 cleavage reaction Methods 0.000 claims description 3
- 235000013922 glutamic acid Nutrition 0.000 claims description 3
- 239000004220 glutamic acid Substances 0.000 claims description 3
- 125000001360 methionine group Chemical group N[C@@H](CCSC)C(=O)* 0.000 claims description 3
- 230000001172 regenerating effect Effects 0.000 claims description 3
- 230000007017 scission Effects 0.000 claims description 3
- 238000005406 washing Methods 0.000 claims description 2
- 240000000385 Brassica napus var. napus Species 0.000 claims 2
- 240000005979 Hordeum vulgare Species 0.000 claims 2
- 240000004658 Medicago sativa Species 0.000 claims 2
- 240000007594 Oryza sativa Species 0.000 claims 2
- 240000003768 Solanum lycopersicum Species 0.000 claims 2
- 240000006394 Sorghum bicolor Species 0.000 claims 2
- 244000098338 Triticum aestivum Species 0.000 claims 2
- FWMNVWWHGCHHJJ-SKKKGAJSSA-N 4-amino-1-[(2r)-6-amino-2-[[(2r)-2-[[(2r)-2-[[(2r)-2-amino-3-phenylpropanoyl]amino]-3-phenylpropanoyl]amino]-4-methylpentanoyl]amino]hexanoyl]piperidine-4-carboxylic acid Chemical compound C([C@H](C(=O)N[C@H](CC(C)C)C(=O)N[C@H](CCCCN)C(=O)N1CCC(N)(CC1)C(O)=O)NC(=O)[C@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 FWMNVWWHGCHHJJ-SKKKGAJSSA-N 0.000 claims 1
- WHUUTDBJXJRKMK-VKHMYHEASA-N L-glutamic acid Chemical compound OC(=O)[C@@H](N)CCC(O)=O WHUUTDBJXJRKMK-VKHMYHEASA-N 0.000 claims 1
- 108010076504 Protein Sorting Signals Proteins 0.000 claims 1
- 108091005804 Peptidases Proteins 0.000 abstract description 34
- 239000004365 Protease Substances 0.000 abstract description 33
- 125000003729 nucleotide group Chemical group 0.000 abstract description 23
- 239000002773 nucleotide Substances 0.000 abstract description 22
- 102100037486 Reverse transcriptase/ribonuclease H Human genes 0.000 abstract 1
- 239000000137 peptide hydrolase inhibitor Substances 0.000 description 158
- 229940124158 Protease/peptidase inhibitor Drugs 0.000 description 148
- 210000004027 cell Anatomy 0.000 description 80
- 108020004414 DNA Proteins 0.000 description 45
- 239000002299 complementary DNA Substances 0.000 description 40
- 108091006086 inhibitor proteins Proteins 0.000 description 39
- 238000006467 substitution reaction Methods 0.000 description 37
- 230000009261 transgenic effect Effects 0.000 description 35
- 102000035195 Peptidases Human genes 0.000 description 33
- 235000019419 proteases Nutrition 0.000 description 32
- 239000013615 primer Substances 0.000 description 31
- 210000001519 tissue Anatomy 0.000 description 27
- 230000000694 effects Effects 0.000 description 26
- 239000012634 fragment Substances 0.000 description 23
- 230000005764 inhibitory process Effects 0.000 description 23
- 239000000523 sample Substances 0.000 description 23
- 238000003752 polymerase chain reaction Methods 0.000 description 21
- 108020004705 Codon Proteins 0.000 description 20
- 239000013598 vector Substances 0.000 description 19
- 125000003275 alpha amino acid group Chemical group 0.000 description 18
- VCSABYLVNWQYQE-UHFFFAOYSA-N Ala-Lys-Lys Natural products NCCCCC(NC(=O)C(N)C)C(=O)NC(CCCCN)C(O)=O VCSABYLVNWQYQE-UHFFFAOYSA-N 0.000 description 17
- 108091028043 Nucleic acid sequence Proteins 0.000 description 17
- 230000029087 digestion Effects 0.000 description 17
- 239000012530 fluid Substances 0.000 description 17
- 239000000047 product Substances 0.000 description 17
- 239000001488 sodium phosphate Substances 0.000 description 17
- 229910000162 sodium phosphate Inorganic materials 0.000 description 17
- RYFMWSXOAZQYPI-UHFFFAOYSA-K trisodium phosphate Chemical compound [Na+].[Na+].[Na+].[O-]P([O-])([O-])=O RYFMWSXOAZQYPI-UHFFFAOYSA-K 0.000 description 17
- VYZBPPBKFCHCIS-WPRPVWTQSA-N Arg-Val-Gly Chemical compound OC(=O)CNC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCN=C(N)N VYZBPPBKFCHCIS-WPRPVWTQSA-N 0.000 description 16
- 108090000317 Chymotrypsin Proteins 0.000 description 16
- 241000209219 Hordeum Species 0.000 description 15
- 229960002376 chymotrypsin Drugs 0.000 description 15
- 230000009466 transformation Effects 0.000 description 15
- 238000002474 experimental method Methods 0.000 description 14
- 238000013518 transcription Methods 0.000 description 14
- 230000035897 transcription Effects 0.000 description 14
- 108091026890 Coding region Proteins 0.000 description 13
- 108010092854 aspartyllysine Proteins 0.000 description 13
- 239000000758 substrate Substances 0.000 description 13
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 12
- HEMHJVSKTPXQMS-UHFFFAOYSA-M Sodium hydroxide Chemical compound [OH-].[Na+] HEMHJVSKTPXQMS-UHFFFAOYSA-M 0.000 description 12
- 230000003321 amplification Effects 0.000 description 12
- 238000003556 assay Methods 0.000 description 12
- 230000015572 biosynthetic process Effects 0.000 description 12
- 108010030617 leucyl-phenylalanyl-valine Proteins 0.000 description 12
- 230000004048 modification Effects 0.000 description 12
- 238000012986 modification Methods 0.000 description 12
- 238000003199 nucleic acid amplification method Methods 0.000 description 12
- CFBILACNYSPRPM-UHFFFAOYSA-N 2-amino-2-(hydroxymethyl)propane-1,3-diol;2-[[1,3-dihydroxy-2-(hydroxymethyl)propan-2-yl]amino]acetic acid Chemical compound OCC(N)(CO)CO.OCC(CO)(CO)NCC(O)=O CFBILACNYSPRPM-UHFFFAOYSA-N 0.000 description 11
- MEQLGHAMAUPOSJ-DCAQKATOSA-N Lys-Ser-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O MEQLGHAMAUPOSJ-DCAQKATOSA-N 0.000 description 11
- 230000000875 corresponding effect Effects 0.000 description 11
- 230000002401 inhibitory effect Effects 0.000 description 11
- 239000003550 marker Substances 0.000 description 11
- 239000000463 material Substances 0.000 description 11
- ZHNUHDYFZUAESO-UHFFFAOYSA-N Formamide Chemical compound NC=O ZHNUHDYFZUAESO-UHFFFAOYSA-N 0.000 description 10
- 235000007244 Zea mays Nutrition 0.000 description 10
- 238000007792 addition Methods 0.000 description 10
- 210000000349 chromosome Anatomy 0.000 description 10
- 239000000499 gel Substances 0.000 description 10
- 241000894007 species Species 0.000 description 10
- HRBYTAIBKPNZKQ-AVGNSLFASA-N Glu-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCC(O)=O HRBYTAIBKPNZKQ-AVGNSLFASA-N 0.000 description 9
- QQPSCXKFDSORFT-IHRRRGAJSA-N Lys-Lys-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCCN QQPSCXKFDSORFT-IHRRRGAJSA-N 0.000 description 9
- 108090000631 Trypsin Proteins 0.000 description 9
- 102000004142 Trypsin Human genes 0.000 description 9
- 238000006243 chemical reaction Methods 0.000 description 9
- 239000003112 inhibitor Substances 0.000 description 9
- 108010017391 lysylvaline Proteins 0.000 description 9
- 239000012588 trypsin Substances 0.000 description 9
- XHVONGZZVUUORG-WEDXCCLWSA-N Gly-Thr-Lys Chemical compound NCC(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CCCCN XHVONGZZVUUORG-WEDXCCLWSA-N 0.000 description 8
- JDBQSGMJBMPNFT-AVGNSLFASA-N Leu-Pro-Val Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O JDBQSGMJBMPNFT-AVGNSLFASA-N 0.000 description 8
- ZHQWPWQNVRCXAX-XQQFMLRXSA-N Val-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N ZHQWPWQNVRCXAX-XQQFMLRXSA-N 0.000 description 8
- 239000002253 acid Substances 0.000 description 8
- 238000004458 analytical method Methods 0.000 description 8
- 230000027455 binding Effects 0.000 description 8
- 150000001875 compounds Chemical class 0.000 description 8
- 230000002496 gastric effect Effects 0.000 description 8
- ZNJFBWYDHIGLCU-HWKXXFMVSA-N jasmonic acid Chemical compound CC\C=C/C[C@@H]1[C@@H](CC(O)=O)CCC1=O ZNJFBWYDHIGLCU-HWKXXFMVSA-N 0.000 description 8
- 108020004999 messenger RNA Proteins 0.000 description 8
- 230000002829 reductive effect Effects 0.000 description 8
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 8
- HRPVXLWXLXDGHG-UHFFFAOYSA-N Acrylamide Chemical compound NC(=O)C=C HRPVXLWXLXDGHG-UHFFFAOYSA-N 0.000 description 7
- 229940122644 Chymotrypsin inhibitor Drugs 0.000 description 7
- KCXVZYZYPLLWCC-UHFFFAOYSA-N EDTA Chemical compound OC(=O)CN(CC(O)=O)CCN(CC(O)=O)CC(O)=O KCXVZYZYPLLWCC-UHFFFAOYSA-N 0.000 description 7
- 241000588724 Escherichia coli Species 0.000 description 7
- 241000880493 Leptailurus serval Species 0.000 description 7
- JSGWNFKWZNPDAV-YDHLFZDLSA-N Phe-Val-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 JSGWNFKWZNPDAV-YDHLFZDLSA-N 0.000 description 7
- 235000005824 Zea mays ssp. parviglumis Nutrition 0.000 description 7
- 230000000692 anti-sense effect Effects 0.000 description 7
- 239000003541 chymotrypsin inhibitor Substances 0.000 description 7
- 235000005822 corn Nutrition 0.000 description 7
- 238000012217 deletion Methods 0.000 description 7
- 230000037430 deletion Effects 0.000 description 7
- 230000007613 environmental effect Effects 0.000 description 7
- 239000013604 expression vector Substances 0.000 description 7
- 238000005755 formation reaction Methods 0.000 description 7
- 230000002068 genetic effect Effects 0.000 description 7
- 239000004009 herbicide Substances 0.000 description 7
- 230000001939 inductive effect Effects 0.000 description 7
- 108010070643 prolylglutamic acid Proteins 0.000 description 7
- 230000008929 regeneration Effects 0.000 description 7
- 238000011069 regeneration method Methods 0.000 description 7
- 238000002415 sodium dodecyl sulfate polyacrylamide gel electrophoresis Methods 0.000 description 7
- 238000001890 transfection Methods 0.000 description 7
- FVSOUJZKYWEFOB-KBIXCLLPSA-N Ala-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](C)N FVSOUJZKYWEFOB-KBIXCLLPSA-N 0.000 description 6
- KYQNAIMCTRZLNP-QSFUFRPTSA-N Asp-Ile-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O KYQNAIMCTRZLNP-QSFUFRPTSA-N 0.000 description 6
- 102000004190 Enzymes Human genes 0.000 description 6
- 108090000790 Enzymes Proteins 0.000 description 6
- VYOILACOFPPNQH-UMNHJUIQSA-N Gln-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)N)N VYOILACOFPPNQH-UMNHJUIQSA-N 0.000 description 6
- OQXDUSZKISQQSS-GUBZILKMSA-N Glu-Lys-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O OQXDUSZKISQQSS-GUBZILKMSA-N 0.000 description 6
- OLTHVCNYJAALPL-BHYGNILZSA-N Glu-Trp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CNC3=CC=CC=C32)NC(=O)[C@H](CCC(=O)O)N)C(=O)O OLTHVCNYJAALPL-BHYGNILZSA-N 0.000 description 6
- TVYWVSJGSHQWMT-AJNGGQMLSA-N Ile-Leu-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N TVYWVSJGSHQWMT-AJNGGQMLSA-N 0.000 description 6
- RBEATVHTWHTHTJ-KKUMJFAQSA-N Lys-Leu-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O RBEATVHTWHTHTJ-KKUMJFAQSA-N 0.000 description 6
- ABSSTGUCBCDKMU-UWVGGRQHSA-N Pro-Lys-Gly Chemical compound NCCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H]1CCCN1 ABSSTGUCBCDKMU-UWVGGRQHSA-N 0.000 description 6
- 244000062793 Sorghum vulgare Species 0.000 description 6
- VULNJDORNLBPNG-SWRJLBSHSA-N Thr-Glu-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N)O VULNJDORNLBPNG-SWRJLBSHSA-N 0.000 description 6
- COYHRQWNJDJCNA-NUJDXYNKSA-N Thr-Thr-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O COYHRQWNJDJCNA-NUJDXYNKSA-N 0.000 description 6
- 230000004075 alteration Effects 0.000 description 6
- 125000003178 carboxy group Chemical group [H]OC(*)=O 0.000 description 6
- 229940042399 direct acting antivirals protease inhibitors Drugs 0.000 description 6
- 229940088598 enzyme Drugs 0.000 description 6
- 230000006870 function Effects 0.000 description 6
- 230000000968 intestinal effect Effects 0.000 description 6
- 238000002955 isolation Methods 0.000 description 6
- 108010034529 leucyl-lysine Proteins 0.000 description 6
- 230000035772 mutation Effects 0.000 description 6
- 210000000056 organ Anatomy 0.000 description 6
- 238000012163 sequencing technique Methods 0.000 description 6
- 230000014616 translation Effects 0.000 description 6
- 241000589158 Agrobacterium Species 0.000 description 5
- 241000589155 Agrobacterium tumefaciens Species 0.000 description 5
- YIGLXQRFQVWFEY-NRPADANISA-N Ala-Gln-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O YIGLXQRFQVWFEY-NRPADANISA-N 0.000 description 5
- VCSABYLVNWQYQE-SRVKXCTJSA-N Ala-Lys-Lys Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CCCCN)C(O)=O VCSABYLVNWQYQE-SRVKXCTJSA-N 0.000 description 5
- NKLRWRRVYGQNIH-GHCJXIJMSA-N Asn-Ile-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O NKLRWRRVYGQNIH-GHCJXIJMSA-N 0.000 description 5
- NVFSJIXJZCDICF-SRVKXCTJSA-N Asp-Lys-Lys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N NVFSJIXJZCDICF-SRVKXCTJSA-N 0.000 description 5
- 241000894006 Bacteria Species 0.000 description 5
- 101710137926 Chymotrypsin inhibitor Proteins 0.000 description 5
- OGMQXTXGLDNBSS-FXQIFTODSA-N Glu-Ala-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O OGMQXTXGLDNBSS-FXQIFTODSA-N 0.000 description 5
- UGVQELHRNUDMAA-BYPYZUCNSA-N Gly-Ala-Gly Chemical compound [NH3+]CC(=O)N[C@@H](C)C(=O)NCC([O-])=O UGVQELHRNUDMAA-BYPYZUCNSA-N 0.000 description 5
- NTBOEZICHOSJEE-YUMQZZPRSA-N Gly-Lys-Ser Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O NTBOEZICHOSJEE-YUMQZZPRSA-N 0.000 description 5
- OVZLLFONXILPDZ-VOAKCMCISA-N Leu-Lys-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OVZLLFONXILPDZ-VOAKCMCISA-N 0.000 description 5
- FYPWFNKQVVEELI-ULQDDVLXSA-N Leu-Phe-Val Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CC1=CC=CC=C1 FYPWFNKQVVEELI-ULQDDVLXSA-N 0.000 description 5
- SVBJIZVVYJYGLA-DCAQKATOSA-N Leu-Ser-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O SVBJIZVVYJYGLA-DCAQKATOSA-N 0.000 description 5
- FGMHXLULNHTPID-KKUMJFAQSA-N Lys-His-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCCCN)C(O)=O)CC1=CN=CN1 FGMHXLULNHTPID-KKUMJFAQSA-N 0.000 description 5
- IUWMQCZOTYRXPL-ZPFDUUQYSA-N Lys-Ile-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O IUWMQCZOTYRXPL-ZPFDUUQYSA-N 0.000 description 5
- CNGOEHJCLVCJHN-SRVKXCTJSA-N Lys-Pro-Glu Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O CNGOEHJCLVCJHN-SRVKXCTJSA-N 0.000 description 5
- 108091034117 Oligonucleotide Proteins 0.000 description 5
- 102000057297 Pepsin A Human genes 0.000 description 5
- 108090000284 Pepsin A Proteins 0.000 description 5
- FUOGXAQMNJMBFG-WPRPVWTQSA-N Pro-Val-Gly Chemical compound OC(=O)CNC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 FUOGXAQMNJMBFG-WPRPVWTQSA-N 0.000 description 5
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 5
- 241000700605 Viruses Species 0.000 description 5
- 108010044940 alanylglutamine Proteins 0.000 description 5
- 239000000872 buffer Substances 0.000 description 5
- 230000002759 chromosomal effect Effects 0.000 description 5
- 238000010367 cloning Methods 0.000 description 5
- 238000010276 construction Methods 0.000 description 5
- 238000001514 detection method Methods 0.000 description 5
- XKUKSGPZAADMRA-UHFFFAOYSA-N glycyl-glycyl-glycine Natural products NCC(=O)NCC(=O)NCC(O)=O XKUKSGPZAADMRA-UHFFFAOYSA-N 0.000 description 5
- 108010089804 glycyl-threonine Proteins 0.000 description 5
- 238000003018 immunoassay Methods 0.000 description 5
- 239000012139 lysis buffer Substances 0.000 description 5
- 239000002609 medium Substances 0.000 description 5
- 229940111202 pepsin Drugs 0.000 description 5
- 239000013612 plasmid Substances 0.000 description 5
- 238000012545 processing Methods 0.000 description 5
- 230000002797 proteolythic effect Effects 0.000 description 5
- 108010061238 threonyl-glycine Proteins 0.000 description 5
- 238000013519 translation Methods 0.000 description 5
- 108010000700 Acetolactate synthase Proteins 0.000 description 4
- ISVACHFCVRKIDG-SRVKXCTJSA-N Arg-Val-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O ISVACHFCVRKIDG-SRVKXCTJSA-N 0.000 description 4
- NZWDWXSWUQCNMG-GARJFASQSA-N Asp-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)O)N)C(=O)O NZWDWXSWUQCNMG-GARJFASQSA-N 0.000 description 4
- 241000283690 Bos taurus Species 0.000 description 4
- 241000701489 Cauliflower mosaic virus Species 0.000 description 4
- NDKSHNQINMRKHT-PEXQALLHSA-N His-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CC1=CN=CN1)N NDKSHNQINMRKHT-PEXQALLHSA-N 0.000 description 4
- 241000282412 Homo Species 0.000 description 4
- 206010020649 Hyperkeratosis Diseases 0.000 description 4
- QSPLUJGYOPZINY-ZPFDUUQYSA-N Ile-Asp-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N QSPLUJGYOPZINY-ZPFDUUQYSA-N 0.000 description 4
- QZZIBQZLWBOOJH-PEDHHIEDSA-N Ile-Ile-Val Chemical compound N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(=O)O QZZIBQZLWBOOJH-PEDHHIEDSA-N 0.000 description 4
- UYODHPPSCXBNCS-XUXIUFHCSA-N Ile-Val-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC(C)C UYODHPPSCXBNCS-XUXIUFHCSA-N 0.000 description 4
- AAKRWBIIGKPOKQ-ONGXEEELSA-N Leu-Val-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O AAKRWBIIGKPOKQ-ONGXEEELSA-N 0.000 description 4
- 241000209510 Liliopsida Species 0.000 description 4
- NYTDJEZBAAFLLG-IHRRRGAJSA-N Lys-Val-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(O)=O NYTDJEZBAAFLLG-IHRRRGAJSA-N 0.000 description 4
- 239000012722 SDS sample buffer Substances 0.000 description 4
- 229920002684 Sepharose Polymers 0.000 description 4
- YXEYTHXDRDAIOJ-CWRNSKLLSA-N Ser-Trp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CNC3=CC=CC=C32)NC(=O)[C@H](CO)N)C(=O)O YXEYTHXDRDAIOJ-CWRNSKLLSA-N 0.000 description 4
- LAYSXAOGWHKNED-XPUUQOCRSA-N Val-Gly-Ser Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O LAYSXAOGWHKNED-XPUUQOCRSA-N 0.000 description 4
- KZKMBGXCNLPYKD-YEPSODPASA-N Val-Gly-Thr Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O KZKMBGXCNLPYKD-YEPSODPASA-N 0.000 description 4
- FTKXYXACXYOHND-XUXIUFHCSA-N Val-Ile-Leu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O FTKXYXACXYOHND-XUXIUFHCSA-N 0.000 description 4
- XPKCFQZDQGVJCX-RHYQMDGZSA-N Val-Lys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](C(C)C)N)O XPKCFQZDQGVJCX-RHYQMDGZSA-N 0.000 description 4
- AEFJNECXZCODJM-UWVGGRQHSA-N Val-Val-Gly Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@@H](C(C)C)C(=O)NCC([O-])=O AEFJNECXZCODJM-UWVGGRQHSA-N 0.000 description 4
- 150000007513 acids Chemical class 0.000 description 4
- 238000013459 approach Methods 0.000 description 4
- 230000003115 biocidal effect Effects 0.000 description 4
- 239000003795 chemical substances by application Substances 0.000 description 4
- 230000003247 decreasing effect Effects 0.000 description 4
- 241001233957 eudicotyledons Species 0.000 description 4
- 210000003527 eukaryotic cell Anatomy 0.000 description 4
- BRZYSWJRSDMWLG-CAXSIQPQSA-N geneticin Chemical compound O1C[C@@](O)(C)[C@H](NC)[C@@H](O)[C@H]1O[C@@H]1[C@@H](O)[C@H](O[C@@H]2[C@@H]([C@@H](O)[C@H](O)[C@@H](C(C)O)O2)N)[C@@H](N)C[C@H]1N BRZYSWJRSDMWLG-CAXSIQPQSA-N 0.000 description 4
- 108010049041 glutamylalanine Proteins 0.000 description 4
- 108010050848 glycylleucine Proteins 0.000 description 4
- PJJJBBJSCAKJQF-UHFFFAOYSA-N guanidinium chloride Chemical compound [Cl-].NC(N)=[NH2+] PJJJBBJSCAKJQF-UHFFFAOYSA-N 0.000 description 4
- 230000002363 herbicidal effect Effects 0.000 description 4
- 229910052739 hydrogen Inorganic materials 0.000 description 4
- 239000001257 hydrogen Substances 0.000 description 4
- 238000003780 insertion Methods 0.000 description 4
- 230000037431 insertion Effects 0.000 description 4
- ZNJFBWYDHIGLCU-UHFFFAOYSA-N jasmonic acid Natural products CCC=CCC1C(CC(O)=O)CCC1=O ZNJFBWYDHIGLCU-UHFFFAOYSA-N 0.000 description 4
- 108010057821 leucylproline Proteins 0.000 description 4
- MYWUZJCMWCOHBA-VIFPVBQESA-N methamphetamine Chemical compound CN[C@@H](C)CC1=CC=CC=C1 MYWUZJCMWCOHBA-VIFPVBQESA-N 0.000 description 4
- 238000010369 molecular cloning Methods 0.000 description 4
- 210000000496 pancreas Anatomy 0.000 description 4
- YBYRMVIVWMBXKQ-UHFFFAOYSA-N phenylmethanesulfonyl fluoride Chemical compound FS(=O)(=O)CC1=CC=CC=C1 YBYRMVIVWMBXKQ-UHFFFAOYSA-N 0.000 description 4
- 229920000642 polymer Polymers 0.000 description 4
- 108010004914 prolylarginine Proteins 0.000 description 4
- 230000000644 propagated effect Effects 0.000 description 4
- 210000001938 protoplast Anatomy 0.000 description 4
- 238000000746 purification Methods 0.000 description 4
- 230000001105 regulatory effect Effects 0.000 description 4
- 239000000243 solution Substances 0.000 description 4
- 238000010561 standard procedure Methods 0.000 description 4
- 210000002784 stomach Anatomy 0.000 description 4
- 239000012089 stop solution Substances 0.000 description 4
- 239000000126 substance Substances 0.000 description 4
- 230000009469 supplementation Effects 0.000 description 4
- 238000003786 synthesis reaction Methods 0.000 description 4
- 230000002103 transcriptional effect Effects 0.000 description 4
- WEVYAHXRMPXWCK-UHFFFAOYSA-N Acetonitrile Chemical compound CC#N WEVYAHXRMPXWCK-UHFFFAOYSA-N 0.000 description 3
- GWFSQQNGMPGBEF-GHCJXIJMSA-N Ala-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C)N GWFSQQNGMPGBEF-GHCJXIJMSA-N 0.000 description 3
- HNJNAMGZQZPSRE-GUBZILKMSA-N Arg-Pro-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O HNJNAMGZQZPSRE-GUBZILKMSA-N 0.000 description 3
- BUVNWKQBMZLCDW-UGYAYLCHSA-N Asp-Asn-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BUVNWKQBMZLCDW-UGYAYLCHSA-N 0.000 description 3
- 108091035707 Consensus sequence Proteins 0.000 description 3
- JJKKWYQVHRUSDG-GUBZILKMSA-N Glu-Ala-Lys Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(O)=O JJKKWYQVHRUSDG-GUBZILKMSA-N 0.000 description 3
- CGOHAEBMDSEKFB-FXQIFTODSA-N Glu-Glu-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O CGOHAEBMDSEKFB-FXQIFTODSA-N 0.000 description 3
- GJBUAAAIZSRCDC-GVXVVHGQSA-N Glu-Leu-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O GJBUAAAIZSRCDC-GVXVVHGQSA-N 0.000 description 3
- 241000238631 Hexapoda Species 0.000 description 3
- KLBVGHCGHUNHEA-BJDJZHNGSA-N Ile-Leu-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)O)N KLBVGHCGHUNHEA-BJDJZHNGSA-N 0.000 description 3
- QNAYBMKLOCPYGJ-REOHCLBHSA-N L-alanine Chemical compound C[C@H](N)C(O)=O QNAYBMKLOCPYGJ-REOHCLBHSA-N 0.000 description 3
- OIYWBDBHEGAVST-BZSNNMDCSA-N Lys-His-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O OIYWBDBHEGAVST-BZSNNMDCSA-N 0.000 description 3
- XIZQPFCRXLUNMK-BZSNNMDCSA-N Lys-Leu-Phe Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CCCCN)N XIZQPFCRXLUNMK-BZSNNMDCSA-N 0.000 description 3
- ALGGDNMLQNFVIZ-SRVKXCTJSA-N Lys-Lys-Asp Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)O)C(=O)O)N ALGGDNMLQNFVIZ-SRVKXCTJSA-N 0.000 description 3
- 241000219823 Medicago Species 0.000 description 3
- 206010028980 Neoplasm Diseases 0.000 description 3
- 241000209094 Oryza Species 0.000 description 3
- 108010067372 Pancreatic elastase Proteins 0.000 description 3
- 102000016387 Pancreatic elastase Human genes 0.000 description 3
- ORPZXBQTEHINPB-SRVKXCTJSA-N Pro-Arg-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H]1CCCN1)C(O)=O ORPZXBQTEHINPB-SRVKXCTJSA-N 0.000 description 3
- VPVHXWGPALPDGP-GUBZILKMSA-N Pro-Asn-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O VPVHXWGPALPDGP-GUBZILKMSA-N 0.000 description 3
- UTAUEDINXUMHLG-FXQIFTODSA-N Pro-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@@H]1CCCN1 UTAUEDINXUMHLG-FXQIFTODSA-N 0.000 description 3
- VPEVBAUSTBWQHN-NHCYSSNCSA-N Pro-Glu-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O VPEVBAUSTBWQHN-NHCYSSNCSA-N 0.000 description 3
- 108010025216 RVF peptide Proteins 0.000 description 3
- 102000007056 Recombinant Fusion Proteins Human genes 0.000 description 3
- 108010008281 Recombinant Fusion Proteins Proteins 0.000 description 3
- 108090000787 Subtilisin Proteins 0.000 description 3
- DXPURPNJDFCKKO-RHYQMDGZSA-N Thr-Lys-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)[C@@H](C)O)C(O)=O DXPURPNJDFCKKO-RHYQMDGZSA-N 0.000 description 3
- 241000209140 Triticum Species 0.000 description 3
- 101710162629 Trypsin inhibitor Proteins 0.000 description 3
- 229940122618 Trypsin inhibitor Drugs 0.000 description 3
- RUCNAYOMFXRIKJ-DCAQKATOSA-N Val-Ala-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN RUCNAYOMFXRIKJ-DCAQKATOSA-N 0.000 description 3
- PFNZJEPSCBAVGX-CYDGBPFRSA-N Val-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](C(C)C)N PFNZJEPSCBAVGX-CYDGBPFRSA-N 0.000 description 3
- BMGOFDMKDVVGJG-NHCYSSNCSA-N Val-Asp-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N BMGOFDMKDVVGJG-NHCYSSNCSA-N 0.000 description 3
- URIRWLJVWHYLET-ONGXEEELSA-N Val-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)C(C)C URIRWLJVWHYLET-ONGXEEELSA-N 0.000 description 3
- SYOMXKPPFZRELL-ONGXEEELSA-N Val-Gly-Lys Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CCCCN)C(=O)O)N SYOMXKPPFZRELL-ONGXEEELSA-N 0.000 description 3
- ZRSZTKTVPNSUNA-IHRRRGAJSA-N Val-Lys-Leu Chemical compound CC(C)C[C@H](NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)C(C)C)C(O)=O ZRSZTKTVPNSUNA-IHRRRGAJSA-N 0.000 description 3
- 235000004279 alanine Nutrition 0.000 description 3
- 108010041407 alanylaspartic acid Proteins 0.000 description 3
- BFNBIHQBYMNNAN-UHFFFAOYSA-N ammonium sulfate Chemical compound N.N.OS(O)(=O)=O BFNBIHQBYMNNAN-UHFFFAOYSA-N 0.000 description 3
- 229910052921 ammonium sulfate Inorganic materials 0.000 description 3
- 235000011130 ammonium sulphate Nutrition 0.000 description 3
- 230000000890 antigenic effect Effects 0.000 description 3
- 230000004071 biological effect Effects 0.000 description 3
- 238000009395 breeding Methods 0.000 description 3
- 230000001488 breeding effect Effects 0.000 description 3
- 244000038559 crop plants Species 0.000 description 3
- 230000002708 enhancing effect Effects 0.000 description 3
- 235000013305 food Nutrition 0.000 description 3
- 108010080575 glutamyl-aspartyl-alanine Proteins 0.000 description 3
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 3
- 108010015792 glycyllysine Proteins 0.000 description 3
- 229960000789 guanidine hydrochloride Drugs 0.000 description 3
- 238000003119 immunoblot Methods 0.000 description 3
- 230000006872 improvement Effects 0.000 description 3
- 108010078274 isoleucylvaline Proteins 0.000 description 3
- 229930027917 kanamycin Natural products 0.000 description 3
- 229960000318 kanamycin Drugs 0.000 description 3
- SBUJHOSQTJFQJX-NOAMYHISSA-N kanamycin Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CN)O[C@@H]1O[C@H]1[C@H](O)[C@@H](O[C@@H]2[C@@H]([C@@H](N)[C@H](O)[C@@H](CO)O2)O)[C@H](N)C[C@@H]1N SBUJHOSQTJFQJX-NOAMYHISSA-N 0.000 description 3
- 229930182823 kanamycin A Natural products 0.000 description 3
- 150000002632 lipids Chemical class 0.000 description 3
- 108010003700 lysyl aspartic acid Proteins 0.000 description 3
- 230000001404 mediated effect Effects 0.000 description 3
- 210000003470 mitochondria Anatomy 0.000 description 3
- 230000036961 partial effect Effects 0.000 description 3
- 238000010647 peptide synthesis reaction Methods 0.000 description 3
- 239000012071 phase Substances 0.000 description 3
- 230000008488 polyadenylation Effects 0.000 description 3
- 230000004481 post-translational protein modification Effects 0.000 description 3
- 238000001742 protein purification Methods 0.000 description 3
- 238000003127 radioimmunoassay Methods 0.000 description 3
- 230000001568 sexual effect Effects 0.000 description 3
- 238000002741 site-directed mutagenesis Methods 0.000 description 3
- 239000007790 solid phase Substances 0.000 description 3
- -1 sulfur amino acids Chemical class 0.000 description 3
- 238000010189 synthetic method Methods 0.000 description 3
- 230000005026 transcription initiation Effects 0.000 description 3
- 239000002753 trypsin inhibitor Substances 0.000 description 3
- 108010084932 tryptophyl-proline Proteins 0.000 description 3
- 108010051110 tyrosyl-lysine Proteins 0.000 description 3
- IBIDRSSEHFLGSD-UHFFFAOYSA-N valinyl-arginine Natural products CC(C)C(N)C(=O)NC(C(O)=O)CCCN=C(N)N IBIDRSSEHFLGSD-UHFFFAOYSA-N 0.000 description 3
- 238000001262 western blot Methods 0.000 description 3
- 102000040650 (ribonucleotides)n+m Human genes 0.000 description 2
- JKMHFZQWWAIEOD-UHFFFAOYSA-N 2-[4-(2-hydroxyethyl)piperazin-1-yl]ethanesulfonic acid Chemical compound OCC[NH+]1CCN(CCS([O-])(=O)=O)CC1 JKMHFZQWWAIEOD-UHFFFAOYSA-N 0.000 description 2
- ZBMRKNMTMPPMMK-UHFFFAOYSA-N 2-amino-4-[hydroxy(methyl)phosphoryl]butanoic acid;azane Chemical compound [NH4+].CP(O)(=O)CCC(N)C([O-])=O ZBMRKNMTMPPMMK-UHFFFAOYSA-N 0.000 description 2
- 230000005730 ADP ribosylation Effects 0.000 description 2
- 229930024421 Adenine Natural products 0.000 description 2
- GFFGJBXGBJISGV-UHFFFAOYSA-N Adenine Chemical compound NC1=NC=NC2=C1N=CN2 GFFGJBXGBJISGV-UHFFFAOYSA-N 0.000 description 2
- KUDREHRZRIVKHS-UWJYBYFXSA-N Ala-Asp-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KUDREHRZRIVKHS-UWJYBYFXSA-N 0.000 description 2
- SFNFGFDRYJKZKN-XQXXSGGOSA-N Ala-Gln-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](C)N)O SFNFGFDRYJKZKN-XQXXSGGOSA-N 0.000 description 2
- MPLOSMWGDNJSEV-WHFBIAKZSA-N Ala-Gly-Asp Chemical compound [H]N[C@@H](C)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O MPLOSMWGDNJSEV-WHFBIAKZSA-N 0.000 description 2
- SOBIAADAMRHGKH-CIUDSAMLSA-N Ala-Leu-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O SOBIAADAMRHGKH-CIUDSAMLSA-N 0.000 description 2
- OINVDEKBKBCPLX-JXUBOQSCSA-N Ala-Lys-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OINVDEKBKBCPLX-JXUBOQSCSA-N 0.000 description 2
- WNHNMKOFKCHKKD-BFHQHQDPSA-N Ala-Thr-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O WNHNMKOFKCHKKD-BFHQHQDPSA-N 0.000 description 2
- XAXMJQUMRJAFCH-CQDKDKBSSA-N Ala-Tyr-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=C(O)C=C1 XAXMJQUMRJAFCH-CQDKDKBSSA-N 0.000 description 2
- ZCUFMRIQCPNOHZ-NRPADANISA-N Ala-Val-Gln Chemical compound C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N ZCUFMRIQCPNOHZ-NRPADANISA-N 0.000 description 2
- 108700028369 Alleles Proteins 0.000 description 2
- 108700042778 Antimicrobial Peptides Proteins 0.000 description 2
- 102000044503 Antimicrobial Peptides Human genes 0.000 description 2
- 101100062433 Arabidopsis thaliana DHDPS1 gene Proteins 0.000 description 2
- NABSCJGZKWSNHX-RCWTZXSCSA-N Arg-Arg-Thr Chemical compound NC(N)=NCCC[C@@H](C(=O)N[C@@H]([C@H](O)C)C(O)=O)NC(=O)[C@@H](N)CCCN=C(N)N NABSCJGZKWSNHX-RCWTZXSCSA-N 0.000 description 2
- PRLPSDIHSRITSF-UNQGMJICSA-N Arg-Phe-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PRLPSDIHSRITSF-UNQGMJICSA-N 0.000 description 2
- YCYXHLZRUSJITQ-SRVKXCTJSA-N Arg-Pro-Pro Chemical compound NC(=N)NCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 YCYXHLZRUSJITQ-SRVKXCTJSA-N 0.000 description 2
- AWMAZIIEFPFHCP-RCWTZXSCSA-N Arg-Pro-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O AWMAZIIEFPFHCP-RCWTZXSCSA-N 0.000 description 2
- FMYQECOAIFGQGU-CYDGBPFRSA-N Arg-Val-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FMYQECOAIFGQGU-CYDGBPFRSA-N 0.000 description 2
- FXGMURPOWCKNAZ-JYJNAYRXSA-N Arg-Val-Phe Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 FXGMURPOWCKNAZ-JYJNAYRXSA-N 0.000 description 2
- 108010055400 Aspartate kinase Proteins 0.000 description 2
- IJGRMHOSHXDMSA-UHFFFAOYSA-N Atomic nitrogen Chemical compound N#N IJGRMHOSHXDMSA-UHFFFAOYSA-N 0.000 description 2
- 244000205479 Bertholletia excelsa Species 0.000 description 2
- 235000012284 Bertholletia excelsa Nutrition 0.000 description 2
- 244000188595 Brassica sinapistrum Species 0.000 description 2
- 102100032202 Cornulin Human genes 0.000 description 2
- SWJYSDXMTPMBHO-FXQIFTODSA-N Cys-Pro-Ser Chemical compound [H]N[C@@H](CS)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O SWJYSDXMTPMBHO-FXQIFTODSA-N 0.000 description 2
- 108010090461 DFG peptide Proteins 0.000 description 2
- 102000053602 DNA Human genes 0.000 description 2
- 229940124143 Endopeptidase inhibitor Drugs 0.000 description 2
- 241000206602 Eukaryota Species 0.000 description 2
- INKFLNZBTSNFON-CIUDSAMLSA-N Gln-Ala-Arg Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O INKFLNZBTSNFON-CIUDSAMLSA-N 0.000 description 2
- VLOLPWWCNKWRNB-LOKLDPHHSA-N Gln-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O VLOLPWWCNKWRNB-LOKLDPHHSA-N 0.000 description 2
- RDPOETHPAQEGDP-ACZMJKKPSA-N Glu-Asp-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O RDPOETHPAQEGDP-ACZMJKKPSA-N 0.000 description 2
- UFPXDFOYHVEIPI-BYPYZUCNSA-N Gly-Gly-Asp Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O UFPXDFOYHVEIPI-BYPYZUCNSA-N 0.000 description 2
- 101000920981 Homo sapiens Cornulin Proteins 0.000 description 2
- UCGDDTHMMVWVMV-FSPLSTOPSA-N Ile-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(O)=O UCGDDTHMMVWVMV-FSPLSTOPSA-N 0.000 description 2
- XQLGNKLSPYCRMZ-HJWJTTGWSA-N Ile-Phe-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(=O)O)N XQLGNKLSPYCRMZ-HJWJTTGWSA-N 0.000 description 2
- PXKACEXYLPBMAD-JBDRJPRFSA-N Ile-Ser-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)O)N PXKACEXYLPBMAD-JBDRJPRFSA-N 0.000 description 2
- 108091092195 Intron Proteins 0.000 description 2
- CKLJMWTZIZZHCS-REOHCLBHSA-N L-aspartic acid Chemical compound OC(=O)[C@@H](N)CC(O)=O CKLJMWTZIZZHCS-REOHCLBHSA-N 0.000 description 2
- RRSLQOLASISYTB-CIUDSAMLSA-N Leu-Cys-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(O)=O)C(O)=O RRSLQOLASISYTB-CIUDSAMLSA-N 0.000 description 2
- SEMUSFOBZGKBGW-YTFOTSKYSA-N Leu-Ile-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O SEMUSFOBZGKBGW-YTFOTSKYSA-N 0.000 description 2
- OTXBNHIUIHNGAO-UWVGGRQHSA-N Leu-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CCCCN OTXBNHIUIHNGAO-UWVGGRQHSA-N 0.000 description 2
- RZXLZBIUTDQHJQ-SRVKXCTJSA-N Leu-Lys-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O RZXLZBIUTDQHJQ-SRVKXCTJSA-N 0.000 description 2
- 241000227653 Lycopersicon Species 0.000 description 2
- LMVOVCYVZBBWQB-SRVKXCTJSA-N Lys-Asp-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN LMVOVCYVZBBWQB-SRVKXCTJSA-N 0.000 description 2
- UIJVKVHLCQSPOJ-XIRDDKMYSA-N Lys-Ser-Trp Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(O)=O UIJVKVHLCQSPOJ-XIRDDKMYSA-N 0.000 description 2
- JHNOXVASMSXSNB-WEDXCCLWSA-N Lys-Thr-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O JHNOXVASMSXSNB-WEDXCCLWSA-N 0.000 description 2
- RPWQJSBMXJSCPD-XUXIUFHCSA-N Lys-Val-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CCCCN)C(C)C)C(O)=O RPWQJSBMXJSCPD-XUXIUFHCSA-N 0.000 description 2
- MIXPUVSPPOWTCR-FXQIFTODSA-N Met-Ser-Ser Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O MIXPUVSPPOWTCR-FXQIFTODSA-N 0.000 description 2
- 108010079364 N-glycylalanine Proteins 0.000 description 2
- 108020004711 Nucleic Acid Probes Proteins 0.000 description 2
- 239000004677 Nylon Substances 0.000 description 2
- 238000012408 PCR amplification Methods 0.000 description 2
- 108010019160 Pancreatin Proteins 0.000 description 2
- 241001494479 Pecora Species 0.000 description 2
- YDUGVDGFKNXFPL-IXOXFDKPSA-N Phe-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N)O YDUGVDGFKNXFPL-IXOXFDKPSA-N 0.000 description 2
- 108700001094 Plant Genes Proteins 0.000 description 2
- NXEYSLRNNPWCRN-SRVKXCTJSA-N Pro-Glu-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NXEYSLRNNPWCRN-SRVKXCTJSA-N 0.000 description 2
- ITUDDXVFGFEKPD-NAKRPEOUSA-N Pro-Ser-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ITUDDXVFGFEKPD-NAKRPEOUSA-N 0.000 description 2
- YDTUEBLEAVANFH-RCWTZXSCSA-N Pro-Val-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 YDTUEBLEAVANFH-RCWTZXSCSA-N 0.000 description 2
- 241000209056 Secale Species 0.000 description 2
- QVOGDCQNGLBNCR-FXQIFTODSA-N Ser-Arg-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O QVOGDCQNGLBNCR-FXQIFTODSA-N 0.000 description 2
- ZIFYDQAFEMIZII-GUBZILKMSA-N Ser-Leu-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZIFYDQAFEMIZII-GUBZILKMSA-N 0.000 description 2
- VGQVAVQWKJLIRM-FXQIFTODSA-N Ser-Ser-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O VGQVAVQWKJLIRM-FXQIFTODSA-N 0.000 description 2
- PURRNJBBXDDWLX-ZDLURKLDSA-N Ser-Thr-Gly Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CO)N)O PURRNJBBXDDWLX-ZDLURKLDSA-N 0.000 description 2
- BEBVVQPDSHHWQL-NRPADANISA-N Ser-Val-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O BEBVVQPDSHHWQL-NRPADANISA-N 0.000 description 2
- 229940122055 Serine protease inhibitor Drugs 0.000 description 2
- 101710102218 Serine protease inhibitor Proteins 0.000 description 2
- 101001044900 Solanum tuberosum Proteinase inhibitor 1 Proteins 0.000 description 2
- 101710151905 Subtilisin inhibitor Proteins 0.000 description 2
- GZYNMZQXFRWDFH-YTWAJWBKSA-N Thr-Arg-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N1CCC[C@@H]1C(=O)O)N)O GZYNMZQXFRWDFH-YTWAJWBKSA-N 0.000 description 2
- OGOYMQWIWHGTGH-KZVJFYERSA-N Thr-Val-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O OGOYMQWIWHGTGH-KZVJFYERSA-N 0.000 description 2
- 229920004890 Triton X-100 Polymers 0.000 description 2
- 239000013504 Triton X-100 Substances 0.000 description 2
- IVBJBFSWJDNQFW-XIRDDKMYSA-N Trp-Pro-Glu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O IVBJBFSWJDNQFW-XIRDDKMYSA-N 0.000 description 2
- ISAKRJDGNUQOIC-UHFFFAOYSA-N Uracil Chemical compound O=C1C=CNC(=O)N1 ISAKRJDGNUQOIC-UHFFFAOYSA-N 0.000 description 2
- MJOUSKQHAIARKI-JYJNAYRXSA-N Val-Phe-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CC1=CC=CC=C1 MJOUSKQHAIARKI-JYJNAYRXSA-N 0.000 description 2
- JVGDAEKKZKKZFO-RCWTZXSCSA-N Val-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](C(C)C)N)O JVGDAEKKZKKZFO-RCWTZXSCSA-N 0.000 description 2
- 238000002835 absorbance Methods 0.000 description 2
- 230000009471 action Effects 0.000 description 2
- 229960000643 adenine Drugs 0.000 description 2
- 108010029539 arginyl-prolyl-proline Proteins 0.000 description 2
- 108010060035 arginylproline Proteins 0.000 description 2
- 238000002820 assay format Methods 0.000 description 2
- 230000001580 bacterial effect Effects 0.000 description 2
- 101150103518 bar gene Proteins 0.000 description 2
- 230000005540 biological transmission Effects 0.000 description 2
- 230000015556 catabolic process Effects 0.000 description 2
- 238000005341 cation exchange Methods 0.000 description 2
- 230000001413 cellular effect Effects 0.000 description 2
- 230000008859 change Effects 0.000 description 2
- 238000004587 chromatography analysis Methods 0.000 description 2
- 108010060199 cysteinylproline Proteins 0.000 description 2
- OPTASPLRGRRNAP-UHFFFAOYSA-N cytosine Chemical compound NC=1C=CNC(=O)N=1 OPTASPLRGRRNAP-UHFFFAOYSA-N 0.000 description 2
- 230000001419 dependent effect Effects 0.000 description 2
- 238000011161 development Methods 0.000 description 2
- 230000018109 developmental process Effects 0.000 description 2
- 235000013399 edible fruits Nutrition 0.000 description 2
- 238000004520 electroporation Methods 0.000 description 2
- 210000002257 embryonic structure Anatomy 0.000 description 2
- 238000005516 engineering process Methods 0.000 description 2
- 230000005284 excitation Effects 0.000 description 2
- 239000000835 fiber Substances 0.000 description 2
- 230000037433 frameshift Effects 0.000 description 2
- 230000004927 fusion Effects 0.000 description 2
- 108020001507 fusion proteins Proteins 0.000 description 2
- 102000037865 fusion proteins Human genes 0.000 description 2
- 238000001502 gel electrophoresis Methods 0.000 description 2
- 238000001641 gel filtration chromatography Methods 0.000 description 2
- 238000003205 genotyping method Methods 0.000 description 2
- ZDXPYRJPNDTMRX-UHFFFAOYSA-N glutamine Natural products OC(=O)C(N)CCC(N)=O ZDXPYRJPNDTMRX-UHFFFAOYSA-N 0.000 description 2
- 230000013595 glycosylation Effects 0.000 description 2
- 238000006206 glycosylation reaction Methods 0.000 description 2
- 108010001064 glycyl-glycyl-glycyl-glycine Proteins 0.000 description 2
- 230000012010 growth Effects 0.000 description 2
- UYTPUPDQBNUYGX-UHFFFAOYSA-N guanine Chemical compound O=C1NC(N)=NC2=C1N=CN2 UYTPUPDQBNUYGX-UHFFFAOYSA-N 0.000 description 2
- 230000033444 hydroxylation Effects 0.000 description 2
- 238000005805 hydroxylation reaction Methods 0.000 description 2
- 108010002685 hygromycin-B kinase Proteins 0.000 description 2
- 230000001900 immune effect Effects 0.000 description 2
- 238000000338 in vitro Methods 0.000 description 2
- 238000001727 in vivo Methods 0.000 description 2
- 238000011534 incubation Methods 0.000 description 2
- 230000000977 initiatory effect Effects 0.000 description 2
- 230000003993 interaction Effects 0.000 description 2
- 210000000936 intestine Anatomy 0.000 description 2
- 238000007834 ligase chain reaction Methods 0.000 description 2
- 108010009298 lysylglutamic acid Proteins 0.000 description 2
- 210000004962 mammalian cell Anatomy 0.000 description 2
- 238000004519 manufacturing process Methods 0.000 description 2
- 238000002844 melting Methods 0.000 description 2
- 230000008018 melting Effects 0.000 description 2
- 239000003147 molecular marker Substances 0.000 description 2
- 229940111688 monobasic potassium phosphate Drugs 0.000 description 2
- 235000019796 monopotassium phosphate Nutrition 0.000 description 2
- 238000002703 mutagenesis Methods 0.000 description 2
- 231100000350 mutagenesis Toxicity 0.000 description 2
- 229910052757 nitrogen Inorganic materials 0.000 description 2
- 239000002853 nucleic acid probe Substances 0.000 description 2
- 229920001778 nylon Polymers 0.000 description 2
- 229940055695 pancreatin Drugs 0.000 description 2
- 108010091212 pepstatin Proteins 0.000 description 2
- FAXGPCHRFPCXOO-LXTPJMTPSA-N pepstatin A Chemical compound OC(=O)C[C@H](O)[C@H](CC(C)C)NC(=O)[C@H](C)NC(=O)C[C@H](O)[C@H](CC(C)C)NC(=O)[C@H](C(C)C)NC(=O)[C@H](C(C)C)NC(=O)CC(C)C FAXGPCHRFPCXOO-LXTPJMTPSA-N 0.000 description 2
- 238000003359 percent control normalization Methods 0.000 description 2
- 238000003976 plant breeding Methods 0.000 description 2
- 210000002706 plastid Anatomy 0.000 description 2
- 229920001223 polyethylene glycol Polymers 0.000 description 2
- GNSKLFRGEWLPPA-UHFFFAOYSA-M potassium dihydrogen phosphate Chemical compound [K+].OP(O)([O-])=O GNSKLFRGEWLPPA-UHFFFAOYSA-M 0.000 description 2
- 238000001556 precipitation Methods 0.000 description 2
- 230000008569 process Effects 0.000 description 2
- 210000001236 prokaryotic cell Anatomy 0.000 description 2
- 238000010188 recombinant method Methods 0.000 description 2
- 238000011160 research Methods 0.000 description 2
- 230000000717 retained effect Effects 0.000 description 2
- 238000004366 reverse phase liquid chromatography Methods 0.000 description 2
- 238000012552 review Methods 0.000 description 2
- 238000012216 screening Methods 0.000 description 2
- 239000003001 serine protease inhibitor Substances 0.000 description 2
- 229910001415 sodium ion Inorganic materials 0.000 description 2
- 239000007787 solid Substances 0.000 description 2
- 238000010532 solid phase synthesis reaction Methods 0.000 description 2
- 230000002269 spontaneous effect Effects 0.000 description 2
- 238000003860 storage Methods 0.000 description 2
- UCSJYZPVAKXKNQ-HZYVHMACSA-N streptomycin Chemical compound CN[C@H]1[C@H](O)[C@@H](O)[C@H](CO)O[C@H]1O[C@@H]1[C@](C=O)(O)[C@H](C)O[C@H]1O[C@@H]1[C@@H](NC(N)=N)[C@H](O)[C@@H](NC(N)=N)[C@H](O)[C@H]1O UCSJYZPVAKXKNQ-HZYVHMACSA-N 0.000 description 2
- 108010008664 streptomycin 3''-kinase Proteins 0.000 description 2
- 230000019635 sulfation Effects 0.000 description 2
- 238000005670 sulfation reaction Methods 0.000 description 2
- 238000004809 thin layer chromatography Methods 0.000 description 2
- RWQNBRDOKXIBIV-UHFFFAOYSA-N thymine Chemical compound CC1=CNC(=O)NC1=O RWQNBRDOKXIBIV-UHFFFAOYSA-N 0.000 description 2
- 238000012546 transfer Methods 0.000 description 2
- 230000001131 transforming effect Effects 0.000 description 2
- 230000017105 transposition Effects 0.000 description 2
- 125000000430 tryptophan group Chemical group [H]N([H])C(C(=O)O*)C([H])([H])C1=C([H])N([H])C2=C([H])C([H])=C([H])C([H])=C12 0.000 description 2
- 238000010798 ubiquitination Methods 0.000 description 2
- 230000034512 ubiquitination Effects 0.000 description 2
- 108010073969 valyllysine Proteins 0.000 description 2
- ZDSRFXVZVHSYMA-CMOCDZPBSA-N (2s)-2-[[(2s)-2-[[(2s)-2-[[(2s)-2-amino-3-(4-hydroxyphenyl)propanoyl]amino]-3-(4-hydroxyphenyl)propanoyl]amino]-4-carboxybutanoyl]amino]pentanedioic acid Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O)C1=CC=C(O)C=C1 ZDSRFXVZVHSYMA-CMOCDZPBSA-N 0.000 description 1
- QMOQBVOBWVNSNO-UHFFFAOYSA-N 2-[[2-[[2-[(2-azaniumylacetyl)amino]acetyl]amino]acetyl]amino]acetate Chemical compound NCC(=O)NCC(=O)NCC(=O)NCC(O)=O QMOQBVOBWVNSNO-UHFFFAOYSA-N 0.000 description 1
- XJFPXLWGZWAWRQ-UHFFFAOYSA-N 2-[[2-[[2-[[2-[[2-[(2-azaniumylacetyl)amino]acetyl]amino]acetyl]amino]acetyl]amino]acetyl]amino]acetate Chemical compound NCC(=O)NCC(=O)NCC(=O)NCC(=O)NCC(=O)NCC(O)=O XJFPXLWGZWAWRQ-UHFFFAOYSA-N 0.000 description 1
- QKNYBSVHEMOAJP-UHFFFAOYSA-N 2-amino-2-(hydroxymethyl)propane-1,3-diol;hydron;chloride Chemical compound Cl.OCC(N)(CO)CO QKNYBSVHEMOAJP-UHFFFAOYSA-N 0.000 description 1
- ODHCTXKNWHHXJC-VKHMYHEASA-N 5-oxo-L-proline Chemical compound OC(=O)[C@@H]1CCC(=O)N1 ODHCTXKNWHHXJC-VKHMYHEASA-N 0.000 description 1
- 101150011812 AADAC gene Proteins 0.000 description 1
- 101150001232 ALS gene Proteins 0.000 description 1
- 241000589156 Agrobacterium rhizogenes Species 0.000 description 1
- PBAMJJXWDQXOJA-FXQIFTODSA-N Ala-Asp-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N PBAMJJXWDQXOJA-FXQIFTODSA-N 0.000 description 1
- GSCLWXDNIMNIJE-ZLUOBGJFSA-N Ala-Asp-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O GSCLWXDNIMNIJE-ZLUOBGJFSA-N 0.000 description 1
- VWEWCZSUWOEEFM-WDSKDSINSA-N Ala-Gly-Ala-Gly Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(=O)NCC(O)=O VWEWCZSUWOEEFM-WDSKDSINSA-N 0.000 description 1
- MDNAVFBZPROEHO-DCAQKATOSA-N Ala-Lys-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O MDNAVFBZPROEHO-DCAQKATOSA-N 0.000 description 1
- MDNAVFBZPROEHO-UHFFFAOYSA-N Ala-Lys-Val Natural products CC(C)C(C(O)=O)NC(=O)C(NC(=O)C(C)N)CCCCN MDNAVFBZPROEHO-UHFFFAOYSA-N 0.000 description 1
- AWNAEZICPNGAJK-FXQIFTODSA-N Ala-Met-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CO)C(O)=O AWNAEZICPNGAJK-FXQIFTODSA-N 0.000 description 1
- IYKVSFNGSWTTNZ-GUBZILKMSA-N Ala-Val-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IYKVSFNGSWTTNZ-GUBZILKMSA-N 0.000 description 1
- OMSKGWFGWCQFBD-KZVJFYERSA-N Ala-Val-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OMSKGWFGWCQFBD-KZVJFYERSA-N 0.000 description 1
- 102000007698 Alcohol dehydrogenase Human genes 0.000 description 1
- 108010021809 Alcohol dehydrogenase Proteins 0.000 description 1
- 241000207875 Antirrhinum Species 0.000 description 1
- 241000219194 Arabidopsis Species 0.000 description 1
- BHSYMWWMVRPCPA-CYDGBPFRSA-N Arg-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CCCN=C(N)N BHSYMWWMVRPCPA-CYDGBPFRSA-N 0.000 description 1
- PQWTZSNVWSOFFK-FXQIFTODSA-N Arg-Asp-Asn Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)CN=C(N)N PQWTZSNVWSOFFK-FXQIFTODSA-N 0.000 description 1
- YSUVMPICYVWRBX-VEVYYDQMSA-N Arg-Asp-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O YSUVMPICYVWRBX-VEVYYDQMSA-N 0.000 description 1
- ZEAYJGRKRUBDOB-GARJFASQSA-N Arg-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O ZEAYJGRKRUBDOB-GARJFASQSA-N 0.000 description 1
- QEHMMRSQJMOYNO-DCAQKATOSA-N Arg-His-Asn Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N QEHMMRSQJMOYNO-DCAQKATOSA-N 0.000 description 1
- YKBHOXLMMPZPHQ-GMOBBJLQSA-N Arg-Ile-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O YKBHOXLMMPZPHQ-GMOBBJLQSA-N 0.000 description 1
- IIAXFBUTKIDDIP-ULQDDVLXSA-N Arg-Leu-Phe Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O IIAXFBUTKIDDIP-ULQDDVLXSA-N 0.000 description 1
- AOHKLEBWKMKITA-IHRRRGAJSA-N Arg-Phe-Ser Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N AOHKLEBWKMKITA-IHRRRGAJSA-N 0.000 description 1
- VENMDXUVHSKEIN-GUBZILKMSA-N Arg-Ser-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O VENMDXUVHSKEIN-GUBZILKMSA-N 0.000 description 1
- QTAIIXQCOPUNBQ-QXEWZRGKSA-N Arg-Val-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O QTAIIXQCOPUNBQ-QXEWZRGKSA-N 0.000 description 1
- WOZDCBHUGJVJPL-AVGNSLFASA-N Arg-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N WOZDCBHUGJVJPL-AVGNSLFASA-N 0.000 description 1
- CGXQUULXFWRJOI-SRVKXCTJSA-N Arg-Val-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O CGXQUULXFWRJOI-SRVKXCTJSA-N 0.000 description 1
- RZVVKNIACROXRM-ZLUOBGJFSA-N Asn-Ala-Asp Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N RZVVKNIACROXRM-ZLUOBGJFSA-N 0.000 description 1
- YVXRYLVELQYAEQ-SRVKXCTJSA-N Asn-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N YVXRYLVELQYAEQ-SRVKXCTJSA-N 0.000 description 1
- VPPXTHJNTYDNFJ-CIUDSAMLSA-N Asp-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N VPPXTHJNTYDNFJ-CIUDSAMLSA-N 0.000 description 1
- MFMJRYHVLLEMQM-DCAQKATOSA-N Asp-Arg-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC(=O)O)N MFMJRYHVLLEMQM-DCAQKATOSA-N 0.000 description 1
- DBWYWXNMZZYIRY-LPEHRKFASA-N Asp-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC(=O)O)N)C(=O)O DBWYWXNMZZYIRY-LPEHRKFASA-N 0.000 description 1
- XYBJLTKSGFBLCS-QXEWZRGKSA-N Asp-Arg-Val Chemical compound NC(N)=NCCC[C@@H](C(=O)N[C@@H](C(C)C)C(O)=O)NC(=O)[C@@H](N)CC(O)=O XYBJLTKSGFBLCS-QXEWZRGKSA-N 0.000 description 1
- YNQIDCRRTWGHJD-ZLUOBGJFSA-N Asp-Asn-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC(O)=O YNQIDCRRTWGHJD-ZLUOBGJFSA-N 0.000 description 1
- RDRMWJBLOSRRAW-BYULHYEWSA-N Asp-Asn-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O RDRMWJBLOSRRAW-BYULHYEWSA-N 0.000 description 1
- QOVWVLLHMMCFFY-ZLUOBGJFSA-N Asp-Asp-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O QOVWVLLHMMCFFY-ZLUOBGJFSA-N 0.000 description 1
- DTNUIAJCPRMNBT-WHFBIAKZSA-N Asp-Gly-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](C)C(O)=O DTNUIAJCPRMNBT-WHFBIAKZSA-N 0.000 description 1
- KTTCQQNRRLCIBC-GHCJXIJMSA-N Asp-Ile-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O KTTCQQNRRLCIBC-GHCJXIJMSA-N 0.000 description 1
- UZFHNLYQWMGUHU-DCAQKATOSA-N Asp-Lys-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O UZFHNLYQWMGUHU-DCAQKATOSA-N 0.000 description 1
- CTWCFPWFIGRAEP-CIUDSAMLSA-N Asp-Lys-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O CTWCFPWFIGRAEP-CIUDSAMLSA-N 0.000 description 1
- AKKUDRZKFZWPBH-SRVKXCTJSA-N Asp-Lys-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)O)N AKKUDRZKFZWPBH-SRVKXCTJSA-N 0.000 description 1
- GKWFMNNNYZHJHV-SRVKXCTJSA-N Asp-Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC(O)=O GKWFMNNNYZHJHV-SRVKXCTJSA-N 0.000 description 1
- LIJXJYGRSRWLCJ-IHRRRGAJSA-N Asp-Phe-Arg Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O LIJXJYGRSRWLCJ-IHRRRGAJSA-N 0.000 description 1
- IDDMGSKZQDEDGA-SRVKXCTJSA-N Asp-Phe-Asn Chemical compound OC(=O)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(N)=O)C(O)=O)CC1=CC=CC=C1 IDDMGSKZQDEDGA-SRVKXCTJSA-N 0.000 description 1
- UEFODXNXUAVPTC-VEVYYDQMSA-N Asp-Thr-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O UEFODXNXUAVPTC-VEVYYDQMSA-N 0.000 description 1
- HCOQNGIHSXICCB-IHRRRGAJSA-N Asp-Tyr-Arg Chemical compound N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)O HCOQNGIHSXICCB-IHRRRGAJSA-N 0.000 description 1
- DCXYFEDJOCDNAF-UHFFFAOYSA-N Asparagine Natural products OC(=O)C(N)CC(N)=O DCXYFEDJOCDNAF-UHFFFAOYSA-N 0.000 description 1
- 235000005340 Asparagus officinalis Nutrition 0.000 description 1
- 241001106067 Atropa Species 0.000 description 1
- 235000005781 Avena Nutrition 0.000 description 1
- 244000075850 Avena orientalis Species 0.000 description 1
- 241000193830 Bacillus <bacterium> Species 0.000 description 1
- 241000219198 Brassica Species 0.000 description 1
- 235000011331 Brassica Nutrition 0.000 description 1
- 240000002791 Brassica napus Species 0.000 description 1
- 241000724256 Brome mosaic virus Species 0.000 description 1
- 125000001433 C-terminal amino-acid group Chemical group 0.000 description 1
- 101100264195 Caenorhabditis elegans app-1 gene Proteins 0.000 description 1
- 101100505161 Caenorhabditis elegans mel-32 gene Proteins 0.000 description 1
- UXVMQQNJUSDDNG-UHFFFAOYSA-L Calcium chloride Chemical compound [Cl-].[Cl-].[Ca+2] UXVMQQNJUSDDNG-UHFFFAOYSA-L 0.000 description 1
- 241000282461 Canis lupus Species 0.000 description 1
- 235000002566 Capsicum Nutrition 0.000 description 1
- 240000008574 Capsicum frutescens Species 0.000 description 1
- OKTJSMMVPCPJKN-UHFFFAOYSA-N Carbon Chemical compound [C] OKTJSMMVPCPJKN-UHFFFAOYSA-N 0.000 description 1
- 239000005496 Chlorsulfuron Substances 0.000 description 1
- 241000223782 Ciliophora Species 0.000 description 1
- 241000207199 Citrus Species 0.000 description 1
- 108700010070 Codon Usage Proteins 0.000 description 1
- 108091028732 Concatemer Proteins 0.000 description 1
- DZLQXIFVQFTFJY-BYPYZUCNSA-N Cys-Gly-Gly Chemical compound SC[C@H](N)C(=O)NCC(=O)NCC(O)=O DZLQXIFVQFTFJY-BYPYZUCNSA-N 0.000 description 1
- WQZGKKKJIJFFOK-QTVWNMPRSA-N D-mannopyranose Chemical compound OC[C@H]1OC(O)[C@@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-QTVWNMPRSA-N 0.000 description 1
- 108020003215 DNA Probes Proteins 0.000 description 1
- 239000003155 DNA primer Substances 0.000 description 1
- 239000003298 DNA probe Substances 0.000 description 1
- 238000001712 DNA sequencing Methods 0.000 description 1
- 108090000626 DNA-directed RNA polymerases Proteins 0.000 description 1
- 102000004163 DNA-directed RNA polymerases Human genes 0.000 description 1
- 241000208296 Datura Species 0.000 description 1
- 241000208175 Daucus Species 0.000 description 1
- 240000001879 Digitalis lutea Species 0.000 description 1
- BWGNESOTFCXPMA-UHFFFAOYSA-N Dihydrogen disulfide Chemical compound SS BWGNESOTFCXPMA-UHFFFAOYSA-N 0.000 description 1
- YQYJSBFKSSDGFO-UHFFFAOYSA-N Epihygromycin Natural products OC1C(O)C(C(=O)C)OC1OC(C(=C1)O)=CC=C1C=C(C)C(=O)NC1C(O)C(O)C2OCOC2C1O YQYJSBFKSSDGFO-UHFFFAOYSA-N 0.000 description 1
- 241000702463 Geminiviridae Species 0.000 description 1
- 241000208152 Geranium Species 0.000 description 1
- JSYULGSPLTZDHM-NRPADANISA-N Gln-Ala-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O JSYULGSPLTZDHM-NRPADANISA-N 0.000 description 1
- JXBZEDIQFFCHPZ-PEFMBERDSA-N Gln-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N JXBZEDIQFFCHPZ-PEFMBERDSA-N 0.000 description 1
- RGAOLBZBLOJUTP-GRLWGSQLSA-N Gln-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)NC(=O)[C@H](CCC(=O)N)N RGAOLBZBLOJUTP-GRLWGSQLSA-N 0.000 description 1
- SOEXCCGNHQBFPV-DLOVCJGASA-N Gln-Val-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O SOEXCCGNHQBFPV-DLOVCJGASA-N 0.000 description 1
- JZDHUJAFXGNDSB-WHFBIAKZSA-N Glu-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O JZDHUJAFXGNDSB-WHFBIAKZSA-N 0.000 description 1
- ZZIFPJZQHRJERU-WDSKDSINSA-N Glu-Cys-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CS)C(=O)NCC(O)=O ZZIFPJZQHRJERU-WDSKDSINSA-N 0.000 description 1
- AIGROOHQXCACHL-WDSKDSINSA-N Glu-Gly-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](C)C(O)=O AIGROOHQXCACHL-WDSKDSINSA-N 0.000 description 1
- BCYGDJXHAGZNPQ-DCAQKATOSA-N Glu-Lys-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O BCYGDJXHAGZNPQ-DCAQKATOSA-N 0.000 description 1
- HJTSRYLPAYGEEC-SIUGBPQLSA-N Glu-Tyr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CCC(=O)O)N HJTSRYLPAYGEEC-SIUGBPQLSA-N 0.000 description 1
- MFYLRRCYBBJYPI-JYJNAYRXSA-N Glu-Tyr-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O MFYLRRCYBBJYPI-JYJNAYRXSA-N 0.000 description 1
- KIEICAOUSNYOLM-NRPADANISA-N Glu-Val-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O KIEICAOUSNYOLM-NRPADANISA-N 0.000 description 1
- LZEUDRYSAZAJIO-AUTRQRHGSA-N Glu-Val-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O LZEUDRYSAZAJIO-AUTRQRHGSA-N 0.000 description 1
- HQTDNEZTGZUWSY-XVKPBYJWSA-N Glu-Val-Gly Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)CCC(O)=O)C(=O)NCC(O)=O HQTDNEZTGZUWSY-XVKPBYJWSA-N 0.000 description 1
- WQZGKKKJIJFFOK-GASJEMHNSA-N Glucose Natural products OC[C@H]1OC(O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-GASJEMHNSA-N 0.000 description 1
- JRDYDYXZKFNNRQ-XPUUQOCRSA-N Gly-Ala-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN JRDYDYXZKFNNRQ-XPUUQOCRSA-N 0.000 description 1
- CCQOOWAONKGYKQ-BYPYZUCNSA-N Gly-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)CN CCQOOWAONKGYKQ-BYPYZUCNSA-N 0.000 description 1
- FXGRXIATVXUAHO-WEDXCCLWSA-N Gly-Lys-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCCN FXGRXIATVXUAHO-WEDXCCLWSA-N 0.000 description 1
- GAFKBWKVXNERFA-QWRGUYRKSA-N Gly-Phe-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 GAFKBWKVXNERFA-QWRGUYRKSA-N 0.000 description 1
- GGLIDLCEPDHEJO-BQBZGAKWSA-N Gly-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)CN GGLIDLCEPDHEJO-BQBZGAKWSA-N 0.000 description 1
- YOBGUCWZPXJHTN-BQBZGAKWSA-N Gly-Ser-Arg Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YOBGUCWZPXJHTN-BQBZGAKWSA-N 0.000 description 1
- JSLVAHYTAJJEQH-QWRGUYRKSA-N Gly-Ser-Phe Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 JSLVAHYTAJJEQH-QWRGUYRKSA-N 0.000 description 1
- ZLCLYFGMKFCDCN-XPUUQOCRSA-N Gly-Ser-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CO)NC(=O)CN)C(O)=O ZLCLYFGMKFCDCN-XPUUQOCRSA-N 0.000 description 1
- CUVBTVWFVIIDOC-YEPSODPASA-N Gly-Thr-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)CN CUVBTVWFVIIDOC-YEPSODPASA-N 0.000 description 1
- 241000208818 Helianthus Species 0.000 description 1
- HDXNWVLQSQFJOX-SRVKXCTJSA-N His-Arg-Gln Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N HDXNWVLQSQFJOX-SRVKXCTJSA-N 0.000 description 1
- RNVUQLOKVIPNEM-BZSNNMDCSA-N His-Tyr-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N)O RNVUQLOKVIPNEM-BZSNNMDCSA-N 0.000 description 1
- 101000582320 Homo sapiens Neurogenic differentiation factor 6 Proteins 0.000 description 1
- 101001042146 Hordeum vulgare Subtilisin-chymotrypsin inhibitor-2A Proteins 0.000 description 1
- 241000208278 Hyoscyamus Species 0.000 description 1
- MKWSZEHGHSLNPF-NAKRPEOUSA-N Ile-Ala-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)O)N MKWSZEHGHSLNPF-NAKRPEOUSA-N 0.000 description 1
- QTUSJASXLGLJSR-OSUNSFLBSA-N Ile-Arg-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N QTUSJASXLGLJSR-OSUNSFLBSA-N 0.000 description 1
- UMYZBHKAVTXWIW-GMOBBJLQSA-N Ile-Asp-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N UMYZBHKAVTXWIW-GMOBBJLQSA-N 0.000 description 1
- BCVIOZZGJNOEQS-XKNYDFJKSA-N Ile-Ile Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@H](C(O)=O)[C@@H](C)CC BCVIOZZGJNOEQS-XKNYDFJKSA-N 0.000 description 1
- PFPUFNLHBXKPHY-HTFCKZLJSA-N Ile-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(=O)O)N PFPUFNLHBXKPHY-HTFCKZLJSA-N 0.000 description 1
- YGDWPQCLFJNMOL-MNXVOIDGSA-N Ile-Leu-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N YGDWPQCLFJNMOL-MNXVOIDGSA-N 0.000 description 1
- BCXBIONYYJCSDF-CIUDSAMLSA-N Ile-Val Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(O)=O BCXBIONYYJCSDF-CIUDSAMLSA-N 0.000 description 1
- YJRSIJZUIUANHO-NAKRPEOUSA-N Ile-Val-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(=O)O)N YJRSIJZUIUANHO-NAKRPEOUSA-N 0.000 description 1
- IPFKIGNDTUOFAF-CYDGBPFRSA-N Ile-Val-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IPFKIGNDTUOFAF-CYDGBPFRSA-N 0.000 description 1
- BCISUQVFDGYZBO-QSFUFRPTSA-N Ile-Val-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC(O)=O BCISUQVFDGYZBO-QSFUFRPTSA-N 0.000 description 1
- NJGXXYLPDMMFJB-XUXIUFHCSA-N Ile-Val-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N NJGXXYLPDMMFJB-XUXIUFHCSA-N 0.000 description 1
- YHFPHRUWZMEOIX-CYDGBPFRSA-N Ile-Val-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(=O)O)N YHFPHRUWZMEOIX-CYDGBPFRSA-N 0.000 description 1
- 229930010555 Inosine Natural products 0.000 description 1
- UGQMRVRMYYASKQ-KQYNXXCUSA-N Inosine Chemical compound O[C@@H]1[C@H](O)[C@@H](CO)O[C@H]1N1C2=NC=NC(O)=C2N=C1 UGQMRVRMYYASKQ-KQYNXXCUSA-N 0.000 description 1
- 108010065920 Insulin Lispro Proteins 0.000 description 1
- 108010002386 Interleukin-3 Proteins 0.000 description 1
- 235000013757 Juglans Nutrition 0.000 description 1
- 241000758789 Juglans Species 0.000 description 1
- 108010025815 Kanamycin Kinase Proteins 0.000 description 1
- DCXYFEDJOCDNAF-REOHCLBHSA-N L-asparagine Chemical compound OC(=O)[C@@H](N)CC(N)=O DCXYFEDJOCDNAF-REOHCLBHSA-N 0.000 description 1
- LEVWYRKDKASIDU-IMJSIDKUSA-N L-cystine Chemical compound [O-]C(=O)[C@@H]([NH3+])CSSC[C@H]([NH3+])C([O-])=O LEVWYRKDKASIDU-IMJSIDKUSA-N 0.000 description 1
- RCFDOSNHHZGBOY-UHFFFAOYSA-N L-isoleucyl-L-alanine Natural products CCC(C)C(N)C(=O)NC(C)C(O)=O RCFDOSNHHZGBOY-UHFFFAOYSA-N 0.000 description 1
- OUYCCCASQSFEME-QMMMGPOBSA-N L-tyrosine Chemical compound OC(=O)[C@@H](N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-QMMMGPOBSA-N 0.000 description 1
- 241000208822 Lactuca Species 0.000 description 1
- 108091026898 Leader sequence (mRNA) Proteins 0.000 description 1
- 108090001090 Lectins Proteins 0.000 description 1
- 102000004856 Lectins Human genes 0.000 description 1
- VQPPIMUZCZCOIL-GUBZILKMSA-N Leu-Gln-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O VQPPIMUZCZCOIL-GUBZILKMSA-N 0.000 description 1
- CQGSYZCULZMEDE-UHFFFAOYSA-N Leu-Gln-Pro Natural products CC(C)CC(N)C(=O)NC(CCC(N)=O)C(=O)N1CCCC1C(O)=O CQGSYZCULZMEDE-UHFFFAOYSA-N 0.000 description 1
- UCBPDSYUVAAHCD-UWVGGRQHSA-N Leu-Pro-Gly Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O UCBPDSYUVAAHCD-UWVGGRQHSA-N 0.000 description 1
- CHJKEDSZNSONPS-DCAQKATOSA-N Leu-Pro-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O CHJKEDSZNSONPS-DCAQKATOSA-N 0.000 description 1
- PWPBLZXWFXJFHE-RHYQMDGZSA-N Leu-Pro-Thr Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O PWPBLZXWFXJFHE-RHYQMDGZSA-N 0.000 description 1
- YQFZRHYZLARWDY-IHRRRGAJSA-N Leu-Val-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN YQFZRHYZLARWDY-IHRRRGAJSA-N 0.000 description 1
- 241000209082 Lolium Species 0.000 description 1
- 235000002262 Lycopersicon Nutrition 0.000 description 1
- WSXTWLJHTLRFLW-SRVKXCTJSA-N Lys-Ala-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(O)=O WSXTWLJHTLRFLW-SRVKXCTJSA-N 0.000 description 1
- VHXMZJGOKIMETG-CQDKDKBSSA-N Lys-Ala-Tyr Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CCCCN)N VHXMZJGOKIMETG-CQDKDKBSSA-N 0.000 description 1
- NTSPQIONFJUMJV-AVGNSLFASA-N Lys-Arg-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O NTSPQIONFJUMJV-AVGNSLFASA-N 0.000 description 1
- GKFNXYMAMKJSKD-NHCYSSNCSA-N Lys-Asp-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O GKFNXYMAMKJSKD-NHCYSSNCSA-N 0.000 description 1
- QQUJSUFWEDZQQY-AVGNSLFASA-N Lys-Gln-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CCCCN QQUJSUFWEDZQQY-AVGNSLFASA-N 0.000 description 1
- UGTZHPSKYRIGRJ-YUMQZZPRSA-N Lys-Glu Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(O)=O)CCC(O)=O UGTZHPSKYRIGRJ-YUMQZZPRSA-N 0.000 description 1
- IVFUVMSKSFSFBT-NHCYSSNCSA-N Lys-Ile-Gly Chemical compound OC(=O)CNC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCCCN IVFUVMSKSFSFBT-NHCYSSNCSA-N 0.000 description 1
- ZXFRGTAIIZHNHG-AJNGGQMLSA-N Lys-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CCCCN)N ZXFRGTAIIZHNHG-AJNGGQMLSA-N 0.000 description 1
- WAIHHELKYSFIQN-XUXIUFHCSA-N Lys-Ile-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O WAIHHELKYSFIQN-XUXIUFHCSA-N 0.000 description 1
- ONPDTSFZAIWMDI-AVGNSLFASA-N Lys-Leu-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O ONPDTSFZAIWMDI-AVGNSLFASA-N 0.000 description 1
- OIQSIMFSVLLWBX-VOAKCMCISA-N Lys-Leu-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OIQSIMFSVLLWBX-VOAKCMCISA-N 0.000 description 1
- LJADEBULDNKJNK-IHRRRGAJSA-N Lys-Leu-Val Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O LJADEBULDNKJNK-IHRRRGAJSA-N 0.000 description 1
- NVGBPTNZLWRQSY-UWVGGRQHSA-N Lys-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(O)=O)CCCCN NVGBPTNZLWRQSY-UWVGGRQHSA-N 0.000 description 1
- AIXUQKMMBQJZCU-IUCAKERBSA-N Lys-Pro Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(O)=O AIXUQKMMBQJZCU-IUCAKERBSA-N 0.000 description 1
- OBZHNHBAAVEWKI-DCAQKATOSA-N Lys-Pro-Asn Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O OBZHNHBAAVEWKI-DCAQKATOSA-N 0.000 description 1
- WGILOYIKJVQUPT-DCAQKATOSA-N Lys-Pro-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O WGILOYIKJVQUPT-DCAQKATOSA-N 0.000 description 1
- JOSAKOKSPXROGQ-BJDJZHNGSA-N Lys-Ser-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JOSAKOKSPXROGQ-BJDJZHNGSA-N 0.000 description 1
- RMOKGALPSPOYKE-KATARQTJSA-N Lys-Thr-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O RMOKGALPSPOYKE-KATARQTJSA-N 0.000 description 1
- VKCPHIOZDWUFSW-ONGXEEELSA-N Lys-Val-Gly Chemical compound OC(=O)CNC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN VKCPHIOZDWUFSW-ONGXEEELSA-N 0.000 description 1
- OZVXDDFYCQOPFD-XQQFMLRXSA-N Lys-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N OZVXDDFYCQOPFD-XQQFMLRXSA-N 0.000 description 1
- 241000121629 Majorana Species 0.000 description 1
- 240000003183 Manihot esculenta Species 0.000 description 1
- QTZXSYBVOSXBEJ-WDSKDSINSA-N Met-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(O)=O)CC(O)=O QTZXSYBVOSXBEJ-WDSKDSINSA-N 0.000 description 1
- STTRPDDKDVKIDF-KKUMJFAQSA-N Met-Glu-Tyr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 STTRPDDKDVKIDF-KKUMJFAQSA-N 0.000 description 1
- WEDDFMCSUNNZJR-WDSKDSINSA-N Met-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(O)=O WEDDFMCSUNNZJR-WDSKDSINSA-N 0.000 description 1
- LXCSZPUQKMTXNW-BQBZGAKWSA-N Met-Ser-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O LXCSZPUQKMTXNW-BQBZGAKWSA-N 0.000 description 1
- LPNWWHBFXPNHJG-AVGNSLFASA-N Met-Val-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN LPNWWHBFXPNHJG-AVGNSLFASA-N 0.000 description 1
- 241001024304 Mino Species 0.000 description 1
- 102000016943 Muramidase Human genes 0.000 description 1
- 108010014251 Muramidase Proteins 0.000 description 1
- 241000204031 Mycoplasma Species 0.000 description 1
- 108010062010 N-Acetylmuramoyl-L-alanine Amidase Proteins 0.000 description 1
- PYUSHNKNPOHWEZ-YFKPBYRVSA-N N-formyl-L-methionine Chemical compound CSCC[C@@H](C(O)=O)NC=O PYUSHNKNPOHWEZ-YFKPBYRVSA-N 0.000 description 1
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 1
- 108010002311 N-glycylglutamic acid Proteins 0.000 description 1
- 125000000729 N-terminal amino-acid group Chemical group 0.000 description 1
- 101800000135 N-terminal protein Proteins 0.000 description 1
- 238000005481 NMR spectroscopy Methods 0.000 description 1
- 241001282315 Nemesis Species 0.000 description 1
- 102100030589 Neurogenic differentiation factor 6 Human genes 0.000 description 1
- 241000208125 Nicotiana Species 0.000 description 1
- 239000000020 Nitrocellulose Substances 0.000 description 1
- 238000000636 Northern blotting Methods 0.000 description 1
- 108091005461 Nucleic proteins Proteins 0.000 description 1
- 108010003571 Nut Proteins Proteins 0.000 description 1
- 108020005187 Oligonucleotide Probes Proteins 0.000 description 1
- 241000231814 Omus Species 0.000 description 1
- 241000219830 Onobrychis Species 0.000 description 1
- 108700026244 Open Reading Frames Proteins 0.000 description 1
- 101800001452 P1 proteinase Proteins 0.000 description 1
- 101150107301 Pecr gene Proteins 0.000 description 1
- 241000209046 Pennisetum Species 0.000 description 1
- 108091093037 Peptide nucleic acid Proteins 0.000 description 1
- 240000007377 Petunia x hybrida Species 0.000 description 1
- 241000219833 Phaseolus Species 0.000 description 1
- HCTXJGRYAACKOB-SRVKXCTJSA-N Phe-Asn-Asp Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N HCTXJGRYAACKOB-SRVKXCTJSA-N 0.000 description 1
- NHCKESBLOMHIIE-IRXDYDNUSA-N Phe-Gly-Phe Chemical compound C([C@H](N)C(=O)NCC(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 NHCKESBLOMHIIE-IRXDYDNUSA-N 0.000 description 1
- PEFJUUYFEGBXFA-BZSNNMDCSA-N Phe-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=CC=C1 PEFJUUYFEGBXFA-BZSNNMDCSA-N 0.000 description 1
- IAJOBQBIJHVGMQ-UHFFFAOYSA-N Phosphinothricin Natural products CP(O)(=O)CCC(N)C(O)=O IAJOBQBIJHVGMQ-UHFFFAOYSA-N 0.000 description 1
- 241000218657 Picea Species 0.000 description 1
- 101100271190 Plasmodium falciparum (isolate 3D7) ATAT gene Proteins 0.000 description 1
- 239000002202 Polyethylene glycol Substances 0.000 description 1
- 241000219000 Populus Species 0.000 description 1
- OOLOTUZJUBOMAX-GUBZILKMSA-N Pro-Ala-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O OOLOTUZJUBOMAX-GUBZILKMSA-N 0.000 description 1
- SGCZFWSQERRKBD-BQBZGAKWSA-N Pro-Asp-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(O)=O)NC(=O)[C@@H]1CCCN1 SGCZFWSQERRKBD-BQBZGAKWSA-N 0.000 description 1
- XUSDDSLCRPUKLP-QXEWZRGKSA-N Pro-Asp-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H]1CCCN1 XUSDDSLCRPUKLP-QXEWZRGKSA-N 0.000 description 1
- LHALYDBUDCWMDY-CIUDSAMLSA-N Pro-Glu-Ala Chemical compound C[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1)C(O)=O LHALYDBUDCWMDY-CIUDSAMLSA-N 0.000 description 1
- VDGTVWFMRXVQCT-GUBZILKMSA-N Pro-Glu-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1 VDGTVWFMRXVQCT-GUBZILKMSA-N 0.000 description 1
- NMELOOXSGDRBRU-YUMQZZPRSA-N Pro-Glu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCC(=O)O)NC(=O)[C@@H]1CCCN1 NMELOOXSGDRBRU-YUMQZZPRSA-N 0.000 description 1
- BFXZQMWKTYWGCF-PYJNHQTQSA-N Pro-His-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BFXZQMWKTYWGCF-PYJNHQTQSA-N 0.000 description 1
- LPGSNRSLPHRNBW-AVGNSLFASA-N Pro-His-Val Chemical compound C([C@@H](C(=O)N[C@@H](C(C)C)C([O-])=O)NC(=O)[C@H]1[NH2+]CCC1)C1=CN=CN1 LPGSNRSLPHRNBW-AVGNSLFASA-N 0.000 description 1
- FNGOXVQBBCMFKV-CIUDSAMLSA-N Pro-Ser-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O FNGOXVQBBCMFKV-CIUDSAMLSA-N 0.000 description 1
- KWMZPPWYBVZIER-XGEHTFHBSA-N Pro-Ser-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KWMZPPWYBVZIER-XGEHTFHBSA-N 0.000 description 1
- AWJGUZSYVIVZGP-YUMQZZPRSA-N Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1 AWJGUZSYVIVZGP-YUMQZZPRSA-N 0.000 description 1
- 108010019653 Pwo polymerase Proteins 0.000 description 1
- 108010066717 Q beta Replicase Proteins 0.000 description 1
- 230000004570 RNA-binding Effects 0.000 description 1
- 241000218206 Ranunculus Species 0.000 description 1
- 241000220259 Raphanus Species 0.000 description 1
- 108020004511 Recombinant DNA Proteins 0.000 description 1
- 108700005075 Regulator Genes Proteins 0.000 description 1
- AUNGANRZJHBGPY-SCRDCRAPSA-N Riboflavin Chemical compound OC[C@@H](O)[C@@H](O)[C@@H](O)CN1C=2C=C(C)C(C)=CC=2N=C2C1=NC(=O)NC2=O AUNGANRZJHBGPY-SCRDCRAPSA-N 0.000 description 1
- 108091028664 Ribonucleotide Proteins 0.000 description 1
- 108010003581 Ribulose-bisphosphate carboxylase Proteins 0.000 description 1
- 235000011449 Rosa Nutrition 0.000 description 1
- 108091006629 SLC13A2 Proteins 0.000 description 1
- 241001106018 Salpiglossis Species 0.000 description 1
- 235000007238 Secale cereale Nutrition 0.000 description 1
- 108010016634 Seed Storage Proteins Proteins 0.000 description 1
- 241000780602 Senecio Species 0.000 description 1
- OLIJLNWFEQEFDM-SRVKXCTJSA-N Ser-Asp-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 OLIJLNWFEQEFDM-SRVKXCTJSA-N 0.000 description 1
- UOLGINIHBRIECN-FXQIFTODSA-N Ser-Glu-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O UOLGINIHBRIECN-FXQIFTODSA-N 0.000 description 1
- CRJZZXMAADSBBQ-SRVKXCTJSA-N Ser-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CO CRJZZXMAADSBBQ-SRVKXCTJSA-N 0.000 description 1
- QMCDMHWAKMUGJE-IHRRRGAJSA-N Ser-Phe-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O QMCDMHWAKMUGJE-IHRRRGAJSA-N 0.000 description 1
- PYTKULIABVRXSC-BWBBJGPYSA-N Ser-Ser-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PYTKULIABVRXSC-BWBBJGPYSA-N 0.000 description 1
- KKKVOZNCLALMPV-XKBZYTNZSA-N Ser-Thr-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O KKKVOZNCLALMPV-XKBZYTNZSA-N 0.000 description 1
- UYLKOSODXYSWMQ-XGEHTFHBSA-N Ser-Thr-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CO)N)O UYLKOSODXYSWMQ-XGEHTFHBSA-N 0.000 description 1
- ZSDXEKUKQAKZFE-XAVMHZPKSA-N Ser-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N)O ZSDXEKUKQAKZFE-XAVMHZPKSA-N 0.000 description 1
- VVKVHAOOUGNDPJ-SRVKXCTJSA-N Ser-Tyr-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O VVKVHAOOUGNDPJ-SRVKXCTJSA-N 0.000 description 1
- MTCFGRXMJLQNBG-UHFFFAOYSA-N Serine Natural products OCC(N)C(O)=O MTCFGRXMJLQNBG-UHFFFAOYSA-N 0.000 description 1
- XUIMIQQOPSSXEZ-UHFFFAOYSA-N Silicon Chemical compound [Si] XUIMIQQOPSSXEZ-UHFFFAOYSA-N 0.000 description 1
- 241000220261 Sinapis Species 0.000 description 1
- 241000207763 Solanum Species 0.000 description 1
- 235000002634 Solanum Nutrition 0.000 description 1
- 108091081024 Start codon Proteins 0.000 description 1
- XSLXHSYIVPGEER-KZVJFYERSA-N Thr-Ala-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O XSLXHSYIVPGEER-KZVJFYERSA-N 0.000 description 1
- VOHWDZNIESHTFW-XKBZYTNZSA-N Thr-Glu-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N)O VOHWDZNIESHTFW-XKBZYTNZSA-N 0.000 description 1
- XPNSAQMEAVSQRD-FBCQKBJTSA-N Thr-Gly-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)NCC(O)=O XPNSAQMEAVSQRD-FBCQKBJTSA-N 0.000 description 1
- MSIYNSBKKVMGFO-BHNWBGBOSA-N Thr-Gly-Pro Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N1CCC[C@@H]1C(=O)O)N)O MSIYNSBKKVMGFO-BHNWBGBOSA-N 0.000 description 1
- XYFISNXATOERFZ-OSUNSFLBSA-N Thr-Ile-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N XYFISNXATOERFZ-OSUNSFLBSA-N 0.000 description 1
- WRUWXBBEFUTJOU-XGEHTFHBSA-N Thr-Met-Ser Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)O)N)O WRUWXBBEFUTJOU-XGEHTFHBSA-N 0.000 description 1
- JAJOFWABAUKAEJ-QTKMDUPCSA-N Thr-Pro-His Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N)O JAJOFWABAUKAEJ-QTKMDUPCSA-N 0.000 description 1
- IVDFVBVIVLJJHR-LKXGYXEUSA-N Thr-Ser-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O IVDFVBVIVLJJHR-LKXGYXEUSA-N 0.000 description 1
- GQPQJNMVELPZNQ-GBALPHGKSA-N Thr-Ser-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N)O GQPQJNMVELPZNQ-GBALPHGKSA-N 0.000 description 1
- YRJOLUDFVAUXLI-GSSVUCPTSA-N Thr-Thr-Asp Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(O)=O YRJOLUDFVAUXLI-GSSVUCPTSA-N 0.000 description 1
- MFMGPEKYBXFIRF-SUSMZKCASA-N Thr-Thr-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MFMGPEKYBXFIRF-SUSMZKCASA-N 0.000 description 1
- PJCYRZVSACOYSN-ZJDVBMNYSA-N Thr-Thr-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(O)=O PJCYRZVSACOYSN-ZJDVBMNYSA-N 0.000 description 1
- 241000723873 Tobacco mosaic virus Species 0.000 description 1
- 108020004566 Transfer RNA Proteins 0.000 description 1
- 241001312519 Trigonella Species 0.000 description 1
- 239000007983 Tris buffer Substances 0.000 description 1
- 235000019714 Triticale Nutrition 0.000 description 1
- DXYQIGZZWYBXSD-JSGCOSHPSA-N Trp-Pro Chemical compound O=C([C@H](CC=1C2=CC=CC=C2NC=1)N)N1CCC[C@H]1C(O)=O DXYQIGZZWYBXSD-JSGCOSHPSA-N 0.000 description 1
- XDQGKIMTRSVSBC-WDSOQIARSA-N Trp-Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC1=CNC2=CC=CC=C12 XDQGKIMTRSVSBC-WDSOQIARSA-N 0.000 description 1
- GYKDRHDMGQUZPU-MGHWNKPDSA-N Tyr-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC1=CC=C(C=C1)O)N GYKDRHDMGQUZPU-MGHWNKPDSA-N 0.000 description 1
- RWOKVQUCENPXGE-IHRRRGAJSA-N Tyr-Ser-Arg Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O RWOKVQUCENPXGE-IHRRRGAJSA-N 0.000 description 1
- 108090000848 Ubiquitin Proteins 0.000 description 1
- 102000044159 Ubiquitin Human genes 0.000 description 1
- 108091023045 Untranslated Region Proteins 0.000 description 1
- IZFVRRYRMQFVGX-NRPADANISA-N Val-Ala-Gln Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N IZFVRRYRMQFVGX-NRPADANISA-N 0.000 description 1
- NMANTMWGQZASQN-QXEWZRGKSA-N Val-Arg-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N NMANTMWGQZASQN-QXEWZRGKSA-N 0.000 description 1
- UBTBGUDNDFZLGP-SRVKXCTJSA-N Val-Arg-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](C(C)C)C(=O)O)N UBTBGUDNDFZLGP-SRVKXCTJSA-N 0.000 description 1
- GXAZTLJYINLMJL-LAEOZQHASA-N Val-Asn-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N GXAZTLJYINLMJL-LAEOZQHASA-N 0.000 description 1
- PVPAOIGJYHVWBT-KKHAAJSZSA-N Val-Asn-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](C(C)C)N)O PVPAOIGJYHVWBT-KKHAAJSZSA-N 0.000 description 1
- CGGVNFJRZJUVAE-BYULHYEWSA-N Val-Asp-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N CGGVNFJRZJUVAE-BYULHYEWSA-N 0.000 description 1
- UZDHNIJRRTUKKC-DLOVCJGASA-N Val-Gln-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](C(C)C)C(=O)O)N UZDHNIJRRTUKKC-DLOVCJGASA-N 0.000 description 1
- SZTTYWIUCGSURQ-AUTRQRHGSA-N Val-Glu-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O SZTTYWIUCGSURQ-AUTRQRHGSA-N 0.000 description 1
- ZXAGTABZUOMUDO-GVXVVHGQSA-N Val-Glu-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N ZXAGTABZUOMUDO-GVXVVHGQSA-N 0.000 description 1
- MJFSRZZJQWZHFQ-SRVKXCTJSA-N Val-Met-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(=O)O)N MJFSRZZJQWZHFQ-SRVKXCTJSA-N 0.000 description 1
- YTNGABPUXFEOGU-SRVKXCTJSA-N Val-Pro-Arg Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O YTNGABPUXFEOGU-SRVKXCTJSA-N 0.000 description 1
- NHXZRXLFOBFMDM-AVGNSLFASA-N Val-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)C(C)C NHXZRXLFOBFMDM-AVGNSLFASA-N 0.000 description 1
- BGXVHVMJZCSOCA-AVGNSLFASA-N Val-Pro-Lys Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)O)N BGXVHVMJZCSOCA-AVGNSLFASA-N 0.000 description 1
- CEKSLIVSNNGOKH-KZVJFYERSA-N Val-Thr-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](C(C)C)N)O CEKSLIVSNNGOKH-KZVJFYERSA-N 0.000 description 1
- GVNLOVJNNDZUHS-RHYQMDGZSA-N Val-Thr-Lys Chemical compound [H]N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O GVNLOVJNNDZUHS-RHYQMDGZSA-N 0.000 description 1
- JAIZPWVHPQRYOU-ZJDVBMNYSA-N Val-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](C(C)C)N)O JAIZPWVHPQRYOU-ZJDVBMNYSA-N 0.000 description 1
- KRNYOVHEKOBTEF-YUMQZZPRSA-N Val-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(O)=O KRNYOVHEKOBTEF-YUMQZZPRSA-N 0.000 description 1
- 241000219977 Vigna Species 0.000 description 1
- 208000036142 Viral infection Diseases 0.000 description 1
- 235000009392 Vitis Nutrition 0.000 description 1
- 241000219095 Vitis Species 0.000 description 1
- 241000209149 Zea Species 0.000 description 1
- JLCPHMBAVCMARE-UHFFFAOYSA-N [3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-hydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methyl [5-(6-aminopurin-9-yl)-2-(hydroxymethyl)oxolan-3-yl] hydrogen phosphate Polymers Cc1cn(C2CC(OP(O)(=O)OCC3OC(CC3OP(O)(=O)OCC3OC(CC3O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c3nc(N)[nH]c4=O)C(COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3CO)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cc(C)c(=O)[nH]c3=O)n3cc(C)c(=O)[nH]c3=O)n3ccc(N)nc3=O)n3cc(C)c(=O)[nH]c3=O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)O2)c(=O)[nH]c1=O JLCPHMBAVCMARE-UHFFFAOYSA-N 0.000 description 1
- 230000002159 abnormal effect Effects 0.000 description 1
- 238000009825 accumulation Methods 0.000 description 1
- 230000021736 acetylation Effects 0.000 description 1
- 238000006640 acetylation reaction Methods 0.000 description 1
- 230000004913 activation Effects 0.000 description 1
- 230000010933 acylation Effects 0.000 description 1
- 238000005917 acylation reaction Methods 0.000 description 1
- 238000001042 affinity chromatography Methods 0.000 description 1
- 244000193174 agave Species 0.000 description 1
- 230000032683 aging Effects 0.000 description 1
- 239000000556 agonist Substances 0.000 description 1
- 108010005233 alanylglutamic acid Proteins 0.000 description 1
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 1
- 230000009435 amidation Effects 0.000 description 1
- 238000007112 amidation reaction Methods 0.000 description 1
- 150000003931 anilides Chemical class 0.000 description 1
- 235000019728 animal nutrition Nutrition 0.000 description 1
- 238000000137 annealing Methods 0.000 description 1
- 239000005557 antagonist Substances 0.000 description 1
- 239000003242 anti bacterial agent Substances 0.000 description 1
- 229940088710 antibiotic agent Drugs 0.000 description 1
- 239000000427 antigen Substances 0.000 description 1
- 108091007433 antigens Proteins 0.000 description 1
- 102000036639 antigens Human genes 0.000 description 1
- 230000010516 arginylation Effects 0.000 description 1
- 238000003491 array Methods 0.000 description 1
- 235000009582 asparagine Nutrition 0.000 description 1
- 229960001230 asparagine Drugs 0.000 description 1
- 108010077245 asparaginyl-proline Proteins 0.000 description 1
- 235000003704 aspartic acid Nutrition 0.000 description 1
- 108010040443 aspartyl-aspartic acid Proteins 0.000 description 1
- 108010069205 aspartyl-phenylalanine Proteins 0.000 description 1
- 108010093581 aspartyl-proline Proteins 0.000 description 1
- 108010047857 aspartylglycine Proteins 0.000 description 1
- 108010068265 aspartyltyrosine Proteins 0.000 description 1
- 230000010310 bacterial transformation Effects 0.000 description 1
- 230000008901 benefit Effects 0.000 description 1
- OQFSQFPPLPISGP-UHFFFAOYSA-N beta-carboxyaspartic acid Natural products OC(=O)C(N)C(C(O)=O)C(O)=O OQFSQFPPLPISGP-UHFFFAOYSA-N 0.000 description 1
- 238000002306 biochemical method Methods 0.000 description 1
- 239000003139 biocide Substances 0.000 description 1
- 230000006696 biosynthetic metabolic pathway Effects 0.000 description 1
- 238000006664 bond formation reaction Methods 0.000 description 1
- 239000007853 buffer solution Substances 0.000 description 1
- 239000001110 calcium chloride Substances 0.000 description 1
- 229910001628 calcium chloride Inorganic materials 0.000 description 1
- 235000011148 calcium chloride Nutrition 0.000 description 1
- 101150039352 can gene Proteins 0.000 description 1
- 238000005251 capillar electrophoresis Methods 0.000 description 1
- 239000001390 capsicum minimum Substances 0.000 description 1
- 150000001720 carbohydrates Chemical class 0.000 description 1
- 229910052799 carbon Inorganic materials 0.000 description 1
- 238000004113 cell culture Methods 0.000 description 1
- 230000024245 cell differentiation Effects 0.000 description 1
- 239000013592 cell lysate Substances 0.000 description 1
- 230000006037 cell lysis Effects 0.000 description 1
- 235000013339 cereals Nutrition 0.000 description 1
- 238000001311 chemical methods and process Methods 0.000 description 1
- 239000003153 chemical reaction reagent Substances 0.000 description 1
- VJYIFXVZLXQVHO-UHFFFAOYSA-N chlorsulfuron Chemical compound COC1=NC(C)=NC(NC(=O)NS(=O)(=O)C=2C(=CC=CC=2)Cl)=N1 VJYIFXVZLXQVHO-UHFFFAOYSA-N 0.000 description 1
- 238000002983 circular dichroism Methods 0.000 description 1
- 238000001142 circular dichroism spectrum Methods 0.000 description 1
- 235000020971 citrus fruits Nutrition 0.000 description 1
- 238000012411 cloning technique Methods 0.000 description 1
- 238000004440 column chromatography Methods 0.000 description 1
- 238000002967 competitive immunoassay Methods 0.000 description 1
- 238000004883 computer application Methods 0.000 description 1
- 238000009833 condensation Methods 0.000 description 1
- 230000005494 condensation Effects 0.000 description 1
- 230000002596 correlated effect Effects 0.000 description 1
- 230000008878 coupling Effects 0.000 description 1
- 238000010168 coupling process Methods 0.000 description 1
- 238000005859 coupling reaction Methods 0.000 description 1
- 238000004132 cross linking Methods 0.000 description 1
- 238000005520 cutting process Methods 0.000 description 1
- 108010016616 cysteinylglycine Proteins 0.000 description 1
- 229960003067 cystine Drugs 0.000 description 1
- 229940104302 cytosine Drugs 0.000 description 1
- 210000000172 cytosol Anatomy 0.000 description 1
- 230000007812 deficiency Effects 0.000 description 1
- 238000006731 degradation reaction Methods 0.000 description 1
- 230000017858 demethylation Effects 0.000 description 1
- 238000010520 demethylation reaction Methods 0.000 description 1
- 238000004925 denaturation Methods 0.000 description 1
- 230000036425 denaturation Effects 0.000 description 1
- 239000005547 deoxyribonucleotide Substances 0.000 description 1
- 125000002637 deoxyribonucleotide group Chemical group 0.000 description 1
- 230000001687 destabilization Effects 0.000 description 1
- 230000000368 destabilizing effect Effects 0.000 description 1
- 230000001627 detrimental effect Effects 0.000 description 1
- 235000005911 diet Nutrition 0.000 description 1
- 230000037213 diet Effects 0.000 description 1
- 108010054813 diprotin B Proteins 0.000 description 1
- 238000006073 displacement reaction Methods 0.000 description 1
- 238000001962 electrophoresis Methods 0.000 description 1
- 238000010828 elution Methods 0.000 description 1
- 230000000408 embryogenic effect Effects 0.000 description 1
- 230000009088 enzymatic function Effects 0.000 description 1
- 235000020774 essential nutrients Nutrition 0.000 description 1
- 230000007717 exclusion Effects 0.000 description 1
- 230000002349 favourable effect Effects 0.000 description 1
- 230000022244 formylation Effects 0.000 description 1
- 238000006170 formylation reaction Methods 0.000 description 1
- 238000013467 fragmentation Methods 0.000 description 1
- 238000006062 fragmentation reaction Methods 0.000 description 1
- 230000008014 freezing Effects 0.000 description 1
- 238000007710 freezing Methods 0.000 description 1
- 230000002538 fungal effect Effects 0.000 description 1
- 230000006251 gamma-carboxylation Effects 0.000 description 1
- 238000002523 gelfiltration Methods 0.000 description 1
- 238000010353 genetic engineering Methods 0.000 description 1
- 239000008103 glucose Substances 0.000 description 1
- IAJOBQBIJHVGMQ-BYPYZUCNSA-N glufosinate-P Chemical compound CP(O)(=O)CC[C@H](N)C(O)=O IAJOBQBIJHVGMQ-BYPYZUCNSA-N 0.000 description 1
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 1
- 108010008237 glutamyl-valyl-glycine Proteins 0.000 description 1
- 108010027668 glycyl-alanyl-valine Proteins 0.000 description 1
- 108010067216 glycyl-glycyl-glycine Proteins 0.000 description 1
- 108010074027 glycyl-seryl-phenylalanine Proteins 0.000 description 1
- 108010037850 glycylvaline Proteins 0.000 description 1
- 239000001963 growth medium Substances 0.000 description 1
- 229960004198 guanidine Drugs 0.000 description 1
- 150000003278 haem Chemical group 0.000 description 1
- 238000003306 harvesting Methods 0.000 description 1
- 230000036541 health Effects 0.000 description 1
- 238000004128 high performance liquid chromatography Methods 0.000 description 1
- 238000003365 immunocytochemistry Methods 0.000 description 1
- 230000000951 immunodiffusion Effects 0.000 description 1
- 238000000760 immunoelectrophoresis Methods 0.000 description 1
- 238000001114 immunoprecipitation Methods 0.000 description 1
- 238000007901 in situ hybridization Methods 0.000 description 1
- 238000012296 in situ hybridization assay Methods 0.000 description 1
- 230000002779 inactivation Effects 0.000 description 1
- 230000006698 induction Effects 0.000 description 1
- 238000002347 injection Methods 0.000 description 1
- 239000007924 injection Substances 0.000 description 1
- 229960003786 inosine Drugs 0.000 description 1
- 238000007689 inspection Methods 0.000 description 1
- 230000026045 iodination Effects 0.000 description 1
- 238000006192 iodination reaction Methods 0.000 description 1
- 150000002500 ions Chemical class 0.000 description 1
- BPHPUYQFMNQIOC-NXRLNHOXSA-N isopropyl beta-D-thiogalactopyranoside Chemical compound CC(C)S[C@@H]1O[C@H](CO)[C@H](O)[C@H](O)[C@H]1O BPHPUYQFMNQIOC-NXRLNHOXSA-N 0.000 description 1
- 238000011005 laboratory method Methods 0.000 description 1
- 239000002523 lectin Substances 0.000 description 1
- 230000031700 light absorption Effects 0.000 description 1
- 230000000670 limiting effect Effects 0.000 description 1
- 239000002502 liposome Substances 0.000 description 1
- 239000007788 liquid Substances 0.000 description 1
- 230000007774 longterm Effects 0.000 description 1
- 239000006166 lysate Substances 0.000 description 1
- 229960000274 lysozyme Drugs 0.000 description 1
- 239000004325 lysozyme Substances 0.000 description 1
- 235000010335 lysozyme Nutrition 0.000 description 1
- 108010054155 lysyllysine Proteins 0.000 description 1
- 210000002231 macronucleus Anatomy 0.000 description 1
- 235000005739 manihot Nutrition 0.000 description 1
- 238000013507 mapping Methods 0.000 description 1
- 230000013011 mating Effects 0.000 description 1
- 230000007246 mechanism Effects 0.000 description 1
- 230000011987 methylation Effects 0.000 description 1
- 238000007069 methylation reaction Methods 0.000 description 1
- 238000000520 microinjection Methods 0.000 description 1
- 235000019713 millet Nutrition 0.000 description 1
- 238000012544 monitoring process Methods 0.000 description 1
- 230000000869 mutational effect Effects 0.000 description 1
- 230000007498 myristoylation Effects 0.000 description 1
- CJWXCNXHAIFFMH-AVZHFPDBSA-N n-[(2s,3r,4s,5s,6r)-2-[(2r,3r,4s,5r)-2-acetamido-4,5,6-trihydroxy-1-oxohexan-3-yl]oxy-3,5-dihydroxy-6-methyloxan-4-yl]acetamide Chemical compound C[C@H]1O[C@@H](O[C@@H]([C@@H](O)[C@H](O)CO)[C@@H](NC(C)=O)C=O)[C@H](O)[C@@H](NC(C)=O)[C@@H]1O CJWXCNXHAIFFMH-AVZHFPDBSA-N 0.000 description 1
- 229920001220 nitrocellulos Polymers 0.000 description 1
- 238000007899 nucleic acid hybridization Methods 0.000 description 1
- 230000035764 nutrition Effects 0.000 description 1
- 239000002751 oligonucleotide probe Substances 0.000 description 1
- 238000009401 outcrossing Methods 0.000 description 1
- 230000002018 overexpression Effects 0.000 description 1
- 230000003647 oxidation Effects 0.000 description 1
- 238000007254 oxidation reaction Methods 0.000 description 1
- 239000002245 particle Substances 0.000 description 1
- 230000037361 pathway Effects 0.000 description 1
- 108091000699 pea lectin Proteins 0.000 description 1
- 239000008188 pellet Substances 0.000 description 1
- 230000001175 peptic effect Effects 0.000 description 1
- 108010018625 phenylalanylarginine Proteins 0.000 description 1
- 150000004713 phosphodiesters Chemical class 0.000 description 1
- 229930195732 phytohormone Natural products 0.000 description 1
- 238000004161 plant tissue culture Methods 0.000 description 1
- 230000010152 pollination Effects 0.000 description 1
- 230000001323 posttranslational effect Effects 0.000 description 1
- 238000011533 pre-incubation Methods 0.000 description 1
- 239000002243 precursor Substances 0.000 description 1
- 230000013823 prenylation Effects 0.000 description 1
- 239000002987 primer (paints) Substances 0.000 description 1
- 108010053725 prolylvaline Proteins 0.000 description 1
- 238000000159 protein binding assay Methods 0.000 description 1
- 229940121649 protein inhibitor Drugs 0.000 description 1
- 239000012268 protein inhibitor Substances 0.000 description 1
- 230000009145 protein modification Effects 0.000 description 1
- 238000000734 protein sequencing Methods 0.000 description 1
- 238000001243 protein synthesis Methods 0.000 description 1
- NHDHVHZZCFYRSB-UHFFFAOYSA-N pyriproxyfen Chemical compound C=1C=CC=NC=1OC(C)COC(C=C1)=CC=C1OC1=CC=CC=C1 NHDHVHZZCFYRSB-UHFFFAOYSA-N 0.000 description 1
- 229940043131 pyroglutamate Drugs 0.000 description 1
- 238000010791 quenching Methods 0.000 description 1
- 230000000171 quenching effect Effects 0.000 description 1
- 230000006340 racemization Effects 0.000 description 1
- 239000000376 reactant Substances 0.000 description 1
- 230000009467 reduction Effects 0.000 description 1
- 230000022532 regulation of transcription, DNA-dependent Effects 0.000 description 1
- 230000010076 replication Effects 0.000 description 1
- 230000001850 reproductive effect Effects 0.000 description 1
- 230000002441 reversible effect Effects 0.000 description 1
- 239000002336 ribonucleotide Substances 0.000 description 1
- 125000002652 ribonucleotide group Chemical group 0.000 description 1
- 238000007363 ring formation reaction Methods 0.000 description 1
- 238000007423 screening assay Methods 0.000 description 1
- 230000035939 shock Effects 0.000 description 1
- 229910052710 silicon Inorganic materials 0.000 description 1
- 239000010703 silicon Substances 0.000 description 1
- 239000002904 solvent Substances 0.000 description 1
- 230000000392 somatic effect Effects 0.000 description 1
- 238000000527 sonication Methods 0.000 description 1
- 229960000268 spectinomycin Drugs 0.000 description 1
- UNFWWIHTNXNPBV-WXKVUWSESA-N spectinomycin Chemical compound O([C@@H]1[C@@H](NC)[C@@H](O)[C@H]([C@@H]([C@H]1O1)O)NC)[C@]2(O)[C@H]1O[C@H](C)CC2=O UNFWWIHTNXNPBV-WXKVUWSESA-N 0.000 description 1
- 230000006641 stabilisation Effects 0.000 description 1
- 238000011105 stabilization Methods 0.000 description 1
- 229960005322 streptomycin Drugs 0.000 description 1
- 229910052717 sulfur Inorganic materials 0.000 description 1
- 239000011593 sulfur Substances 0.000 description 1
- 239000013589 supplement Substances 0.000 description 1
- 230000001629 suppression Effects 0.000 description 1
- 229920002994 synthetic fiber Polymers 0.000 description 1
- 230000008685 targeting Effects 0.000 description 1
- 108010031491 threonyl-lysyl-glutamic acid Proteins 0.000 description 1
- 229940113082 thymine Drugs 0.000 description 1
- 238000012090 tissue culture technique Methods 0.000 description 1
- 230000005030 transcription termination Effects 0.000 description 1
- LENZDBCJOHFCAS-UHFFFAOYSA-N tris Chemical compound OCC(N)(CO)CO LENZDBCJOHFCAS-UHFFFAOYSA-N 0.000 description 1
- OUYCCCASQSFEME-UHFFFAOYSA-N tyrosine Natural products OC(=O)C(N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-UHFFFAOYSA-N 0.000 description 1
- 108010068794 tyrosyl-tyrosyl-glutamyl-glutamic acid Proteins 0.000 description 1
- 230000003827 upregulation Effects 0.000 description 1
- 238000011144 upstream manufacturing Methods 0.000 description 1
- 229940035893 uracil Drugs 0.000 description 1
- 108010036320 valylleucine Proteins 0.000 description 1
- 108010021889 valylvaline Proteins 0.000 description 1
- 210000005167 vascular cell Anatomy 0.000 description 1
- 238000012795 verification Methods 0.000 description 1
- 108700026220 vif Genes Proteins 0.000 description 1
- 230000009385 viral infection Effects 0.000 description 1
- 230000001018 virulence Effects 0.000 description 1
- 238000003260 vortexing Methods 0.000 description 1
- 241000228158 x Triticosecale Species 0.000 description 1
- 238000002424 x-ray crystallography Methods 0.000 description 1
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/82—Vectors or expression systems specially adapted for eukaryotic hosts for plant cells, e.g. plant artificial chromosomes (PACs)
- C12N15/8241—Phenotypically and genetically modified plants via recombinant DNA technology
- C12N15/8242—Phenotypically and genetically modified plants via recombinant DNA technology with non-agronomic quality (output) traits, e.g. for industrial processing; Value added, non-agronomic traits
- C12N15/8243—Phenotypically and genetically modified plants via recombinant DNA technology with non-agronomic quality (output) traits, e.g. for industrial processing; Value added, non-agronomic traits involving biosynthetic or metabolic pathways, i.e. metabolic engineering, e.g. nicotine, caffeine
- C12N15/8251—Amino acid content, e.g. synthetic storage proteins, altering amino acid biosynthesis
- C12N15/8253—Methionine or cysteine
-
- A—HUMAN NECESSITIES
- A23—FOODS OR FOODSTUFFS; TREATMENT THEREOF, NOT COVERED BY OTHER CLASSES
- A23K—FODDER
- A23K20/00—Accessory food factors for animal feeding-stuffs
- A23K20/10—Organic substances
- A23K20/142—Amino acids; Derivatives thereof
- A23K20/147—Polymeric derivatives, e.g. peptides or proteins
-
- A—HUMAN NECESSITIES
- A23—FOODS OR FOODSTUFFS; TREATMENT THEREOF, NOT COVERED BY OTHER CLASSES
- A23L—FOODS, FOODSTUFFS, OR NON-ALCOHOLIC BEVERAGES, NOT COVERED BY SUBCLASSES A21D OR A23B-A23J; THEIR PREPARATION OR TREATMENT, e.g. COOKING, MODIFICATION OF NUTRITIVE QUALITIES, PHYSICAL TREATMENT; PRESERVATION OF FOODS OR FOODSTUFFS, IN GENERAL
- A23L33/00—Modifying nutritive qualities of foods; Dietetic products; Preparation or treatment thereof
- A23L33/10—Modifying nutritive qualities of foods; Dietetic products; Preparation or treatment thereof using additives
- A23L33/17—Amino acids, peptides or proteins
- A23L33/185—Vegetable proteins
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/81—Protease inhibitors
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/81—Protease inhibitors
- C07K14/8107—Endopeptidase (E.C. 3.4.21-99) inhibitors
- C07K14/811—Serine protease (E.C. 3.4.21) inhibitors
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/82—Vectors or expression systems specially adapted for eukaryotic hosts for plant cells, e.g. plant artificial chromosomes (PACs)
- C12N15/8241—Phenotypically and genetically modified plants via recombinant DNA technology
- C12N15/8242—Phenotypically and genetically modified plants via recombinant DNA technology with non-agronomic quality (output) traits, e.g. for industrial processing; Value added, non-agronomic traits
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/82—Vectors or expression systems specially adapted for eukaryotic hosts for plant cells, e.g. plant artificial chromosomes (PACs)
- C12N15/8241—Phenotypically and genetically modified plants via recombinant DNA technology
- C12N15/8242—Phenotypically and genetically modified plants via recombinant DNA technology with non-agronomic quality (output) traits, e.g. for industrial processing; Value added, non-agronomic traits
- C12N15/8243—Phenotypically and genetically modified plants via recombinant DNA technology with non-agronomic quality (output) traits, e.g. for industrial processing; Value added, non-agronomic traits involving biosynthetic or metabolic pathways, i.e. metabolic engineering, e.g. nicotine, caffeine
- C12N15/8251—Amino acid content, e.g. synthetic storage proteins, altering amino acid biosynthesis
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/82—Vectors or expression systems specially adapted for eukaryotic hosts for plant cells, e.g. plant artificial chromosomes (PACs)
- C12N15/8241—Phenotypically and genetically modified plants via recombinant DNA technology
- C12N15/8242—Phenotypically and genetically modified plants via recombinant DNA technology with non-agronomic quality (output) traits, e.g. for industrial processing; Value added, non-agronomic traits
- C12N15/8243—Phenotypically and genetically modified plants via recombinant DNA technology with non-agronomic quality (output) traits, e.g. for industrial processing; Value added, non-agronomic traits involving biosynthetic or metabolic pathways, i.e. metabolic engineering, e.g. nicotine, caffeine
- C12N15/8251—Amino acid content, e.g. synthetic storage proteins, altering amino acid biosynthesis
- C12N15/8254—Tryptophan or lysine
Landscapes
- Life Sciences & Earth Sciences (AREA)
- Health & Medical Sciences (AREA)
- Genetics & Genomics (AREA)
- Chemical & Material Sciences (AREA)
- Engineering & Computer Science (AREA)
- Organic Chemistry (AREA)
- Biotechnology (AREA)
- Molecular Biology (AREA)
- Zoology (AREA)
- Biomedical Technology (AREA)
- General Engineering & Computer Science (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Wood Science & Technology (AREA)
- General Health & Medical Sciences (AREA)
- Biophysics (AREA)
- Biochemistry (AREA)
- Cell Biology (AREA)
- Polymers & Plastics (AREA)
- Plant Pathology (AREA)
- Physics & Mathematics (AREA)
- Nutrition Science (AREA)
- Microbiology (AREA)
- Medicinal Chemistry (AREA)
- Gastroenterology & Hepatology (AREA)
- Food Science & Technology (AREA)
- Animal Husbandry (AREA)
- Mycology (AREA)
- Breeding Of Plants And Reproduction By Means Of Culturing (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
- Medicines That Contain Protein Lipid Enzymes And Other Medicines (AREA)
- Enzymes And Modification Thereof (AREA)
- Organic Low-Molecular-Weight Compounds And Preparation Thereof (AREA)
Abstract
The present invention provides for polypeptides comprising protease inhibito rs with increased amounts of essential amino acids and nucleotides encoding for these peptides. Also provided are transformed plants and seeds with enhanced nutritional value due to the expression of modified polypeptides.
Description
.o~ '~icant Ref. No.: 05718-PC'f.app . , . " . . . . ~. . . ~ , .
. . . . , . . . .
. . . , , , .. ~. ... ~~.. .. ,.
PROTEINS WITH ENHANCED LEVELS
OF ESSENTIAL ArvIINO ACIDS
Field of the Invention $ The present invention relates to the field of protein engineering wherein changing amino acid compositions effects improvements in the nutrition content of feed.
Specifically, the present invention relates to methods of enhancing the nutritional content of animal feed by expressing derivatives of a protease inhibitor to provide higher percentages of essential amino acids in plants.
Background of the Invention Feed formula ns are required to provide animals essential nutrients critical to growth. However, crop plants are generally rendered food sources of poor nutritional quality because they contain low proportions of several amino acids which are essential I 5 for, but cannot be synthesized by, monogastric ;animals.
For many years researchers have attempted to improve the balance of essential amino acids in the seed proteins of important crops through breeding programs.
As more becomes known about seed storage proteins and the expression of the genes which encode these proteins, and as transformation systems are developed for a greater variety of plants, molecular approaches for improving the nutritional quality of seed proteins can provide alternatives to the more conventional approaches. Thus, specific amino acid levels can be enhanced in a given crop via biotechnology.
One_alternative method is to express .a heterologous protein of favorable amino acid composition at levels sufficient to obviate feed supplementation. For example, a number of seed proteins rich in sulfur amino acids have been identified. A key to good expression of such proteins involves efficient: expression cassettes with tissue-preferred promoters. Not only must the gene-controlling regions direct the synthesis of high levels of mRNA, the mRNA must be translated into a stable protein and over expression of this protein must not be detrimental to plant or animal health.
Among the essential amino acids needed for animal nutrition, often limiting in crop plants, are methionine, threonine, lysine, isoleucine, leucine, valine, tryptophan, phenylalanine, and histidine. Attempts to increase the levels of these free amino acids by 75529-49(S) breeding, mutant selection and/or changing the composition of the storage prc~a~;ins accumulated in crop plants has met with limited success.
A transgenic example is the phaseolin-promoted Brazil nut 2S expression cassette.
However, even though Brazil nut protein increases the amount of total methionine and bound methionine, thereby improving nutritional value, there appears to be a threshold limitation as to the total amount of methionine that is accumulated in the seeds. The seeds remain insufficient as sources of methionine and methionine supplementation is required in diets utilizing the above soybeans.
An alternative to the enhancement of specific amino acid levels by altering the levels of proteins containing the desired amino acid is modification of amino acid biosynthesis. Recombinant DNA and gene transfer technologies have been applied to alter enzyme activity catalyzing key steps in the amino acid biosynthetic pathway. See Glassman, U.S. Patent No. 5,258,300; Galili, et al., European Patent Application No.
485970; (1992). However, modification of the amino acid levels in seeds is not always correlated with changes in the level of proteins that incorporate those amino acids. See Burrow, et al., Mol. Gen. Genet.; Vol. 241;
pp. 431-439; (1993). Increases in free lysine levels in leaves and seeds have been obtained by selection for DHDPS mutants or by expressing the E. coli DHDPS in plants. However, since the level of free amino acids in seeds, in general, is only a minor fraction of the total amino acid content, these increases have been insufficient to significantly increase the total amino acid content of seed.
The IysC gene is a mutant bacterial aspartate kinase which is desensitized to feedback inhibition by lysine and threonine. Expression of this gene results in an increase in the level of lysine and threonine biosynthesis. However, expression of this gene with seed-specific expression cassettes has resulted in only a 6-7% increase in the level of Total threonine - -or lysine in the seed. See Karchi, et al., The Plant J.; Vol. 3;
pp. 721-7; (1993.
Thus, there is minimal impact on the nutritional value of seeds, and supplementation with essential amino acids is still required.
In another study (Falco et al., Biotechnology 13:577-582, 1995), manipulateon of bacterial DHDPs and aspartate kinase did result in useful increases in free lysine and total AE-plicant Ref. No.: 05718-PCT.app r t seed lysine. However, abnormal accumulation of lysine catabolites was also observed suggesting that the free lysine ool was subject to catabolism.
Based on the foregoing, there exists a need for methods of increasing the levels of essential amino acids in seeds of plants. As can be seen from the prior art, previous S approaches have led to insufficient increases in the levels of both free and bound amino acids and insignificant enhancement of the nutriitional content of the feed.
Summary of thla Invention It is one object of the present invention 1:o provide nucleic acids encoding protease inhibitors with modified levels of essential amino acids. It is an object to reduce the protease inhibitory activity in addition to modifying levels of essential amino acids and antigenic polypeptide fragments thereof. It is a further object of the present invention to provide transgenic plants comprising protease inhibitors with modified levels of essential amino acids. Additionally, it is an object of the present invention to provide methods for increasing the nutritional value of a plant and for providing an animal feed composition comprising the transgenic plants comprising protease inhibitors with modified levels of essential amino acids and reduced protease inhibitory activity. The protease inhibitor CI-2 has been modified to produce on 83 amino acid polypeptide and an amino-terminal truncated version of 65 amino acids residues.
Therefore, in one aspect, the present invention relates to a polypeptide comprising at least 10 contiguous amino acid residues frorrl a protein having Seq. ID No.
2, 4, 6, 8, 10 or 12,16,18,20,22,24; and wherein the polypeptide exhibits reduced protease inhibitor activity compared to a wild-type protein. In ont; embodiment, the present invention relates to the ahove-~entioned polypeptide comprisini; Seq. ID No. 2, 4, 6, 8, 10 or 12, 16,18,20,22,24 and the polypeptide wherein more than about 55%, but less than about 95%, more than about 55%, but less than about. 90%, or more than about 55% but less than about 85%, of the amino acid residues are essential amino acids. In some embodiments, the essential amino acid is lysine, tryptophan, methionine, threonine or mixtures thereof. In some embodiments, the present invention relates to the nucleic acid encoding the polypeptide referred to supra and in one embodiment, relates to the nucleic acid as DNA and in another embodiment to a second nucleic acid which is complementary to the DNA. Another embodirrlent relates to the polypeptide wherein more than about 10% but less than about 40% .of the amino acid residues are essential 3 ,", =. ; j_, r .. .. 'e 75529-49(S) amino acids. Another embodiment relates to the transformed plant containing the polypeptide supra. In some embodiments an animal feed composition is provided.
In another embodiment, the polypeptide referred to supra, comprises at least 20 contiguous amino acid residues.
In one aspect, the present invention relates to this polypeptide which contains or is modified to contain essential amino acids at positions 1, 8, 11, 17, 19, 34, 41, 56, 59, 62, 65, 67 or 73. In another aspect, the present invention relates to polypeptide which contains or is modified to contain essential amino acids at positions 1, 16, 23, 41, 44, 49 and 55. In other embodiments, the polypeptide comprises at least 30 contiguous amino acid residues.
In a further aspect, the present invention relates to the modification of amino acid residues in the active site of protease inhibitors. The above mentioned polypeptide contains, or is modified to contain, non-wild type amino acid residues at positions from about 53 to about 70. In some embodiments, the non-wild type amino acid residues are located at positions 58-60, 62, 65 or 67. In another embodiment, the polypeptide the non-wild type amino acid residue is located at position 59. In some embodiments, the present invention relates to the nucleic acid encoding the polypeptide referred to supra.
In another aspect the polypeptide is about 7.3 Kda or about 9.2 Kda and further comprises one or more additional amino terminal amino acid residues, and in some embodiments, the amino-terminal amino acid residue is methionine. The number of additional amino terminal amino acid residues is preferably less than 50. In another embodiment, the polypeptide is a cleavage product and in yet another, the polypeptide is recombinantly produced.
. . . . , . . . .
. . . , , , .. ~. ... ~~.. .. ,.
PROTEINS WITH ENHANCED LEVELS
OF ESSENTIAL ArvIINO ACIDS
Field of the Invention $ The present invention relates to the field of protein engineering wherein changing amino acid compositions effects improvements in the nutrition content of feed.
Specifically, the present invention relates to methods of enhancing the nutritional content of animal feed by expressing derivatives of a protease inhibitor to provide higher percentages of essential amino acids in plants.
Background of the Invention Feed formula ns are required to provide animals essential nutrients critical to growth. However, crop plants are generally rendered food sources of poor nutritional quality because they contain low proportions of several amino acids which are essential I 5 for, but cannot be synthesized by, monogastric ;animals.
For many years researchers have attempted to improve the balance of essential amino acids in the seed proteins of important crops through breeding programs.
As more becomes known about seed storage proteins and the expression of the genes which encode these proteins, and as transformation systems are developed for a greater variety of plants, molecular approaches for improving the nutritional quality of seed proteins can provide alternatives to the more conventional approaches. Thus, specific amino acid levels can be enhanced in a given crop via biotechnology.
One_alternative method is to express .a heterologous protein of favorable amino acid composition at levels sufficient to obviate feed supplementation. For example, a number of seed proteins rich in sulfur amino acids have been identified. A key to good expression of such proteins involves efficient: expression cassettes with tissue-preferred promoters. Not only must the gene-controlling regions direct the synthesis of high levels of mRNA, the mRNA must be translated into a stable protein and over expression of this protein must not be detrimental to plant or animal health.
Among the essential amino acids needed for animal nutrition, often limiting in crop plants, are methionine, threonine, lysine, isoleucine, leucine, valine, tryptophan, phenylalanine, and histidine. Attempts to increase the levels of these free amino acids by 75529-49(S) breeding, mutant selection and/or changing the composition of the storage prc~a~;ins accumulated in crop plants has met with limited success.
A transgenic example is the phaseolin-promoted Brazil nut 2S expression cassette.
However, even though Brazil nut protein increases the amount of total methionine and bound methionine, thereby improving nutritional value, there appears to be a threshold limitation as to the total amount of methionine that is accumulated in the seeds. The seeds remain insufficient as sources of methionine and methionine supplementation is required in diets utilizing the above soybeans.
An alternative to the enhancement of specific amino acid levels by altering the levels of proteins containing the desired amino acid is modification of amino acid biosynthesis. Recombinant DNA and gene transfer technologies have been applied to alter enzyme activity catalyzing key steps in the amino acid biosynthetic pathway. See Glassman, U.S. Patent No. 5,258,300; Galili, et al., European Patent Application No.
485970; (1992). However, modification of the amino acid levels in seeds is not always correlated with changes in the level of proteins that incorporate those amino acids. See Burrow, et al., Mol. Gen. Genet.; Vol. 241;
pp. 431-439; (1993). Increases in free lysine levels in leaves and seeds have been obtained by selection for DHDPS mutants or by expressing the E. coli DHDPS in plants. However, since the level of free amino acids in seeds, in general, is only a minor fraction of the total amino acid content, these increases have been insufficient to significantly increase the total amino acid content of seed.
The IysC gene is a mutant bacterial aspartate kinase which is desensitized to feedback inhibition by lysine and threonine. Expression of this gene results in an increase in the level of lysine and threonine biosynthesis. However, expression of this gene with seed-specific expression cassettes has resulted in only a 6-7% increase in the level of Total threonine - -or lysine in the seed. See Karchi, et al., The Plant J.; Vol. 3;
pp. 721-7; (1993.
Thus, there is minimal impact on the nutritional value of seeds, and supplementation with essential amino acids is still required.
In another study (Falco et al., Biotechnology 13:577-582, 1995), manipulateon of bacterial DHDPs and aspartate kinase did result in useful increases in free lysine and total AE-plicant Ref. No.: 05718-PCT.app r t seed lysine. However, abnormal accumulation of lysine catabolites was also observed suggesting that the free lysine ool was subject to catabolism.
Based on the foregoing, there exists a need for methods of increasing the levels of essential amino acids in seeds of plants. As can be seen from the prior art, previous S approaches have led to insufficient increases in the levels of both free and bound amino acids and insignificant enhancement of the nutriitional content of the feed.
Summary of thla Invention It is one object of the present invention 1:o provide nucleic acids encoding protease inhibitors with modified levels of essential amino acids. It is an object to reduce the protease inhibitory activity in addition to modifying levels of essential amino acids and antigenic polypeptide fragments thereof. It is a further object of the present invention to provide transgenic plants comprising protease inhibitors with modified levels of essential amino acids. Additionally, it is an object of the present invention to provide methods for increasing the nutritional value of a plant and for providing an animal feed composition comprising the transgenic plants comprising protease inhibitors with modified levels of essential amino acids and reduced protease inhibitory activity. The protease inhibitor CI-2 has been modified to produce on 83 amino acid polypeptide and an amino-terminal truncated version of 65 amino acids residues.
Therefore, in one aspect, the present invention relates to a polypeptide comprising at least 10 contiguous amino acid residues frorrl a protein having Seq. ID No.
2, 4, 6, 8, 10 or 12,16,18,20,22,24; and wherein the polypeptide exhibits reduced protease inhibitor activity compared to a wild-type protein. In ont; embodiment, the present invention relates to the ahove-~entioned polypeptide comprisini; Seq. ID No. 2, 4, 6, 8, 10 or 12, 16,18,20,22,24 and the polypeptide wherein more than about 55%, but less than about 95%, more than about 55%, but less than about. 90%, or more than about 55% but less than about 85%, of the amino acid residues are essential amino acids. In some embodiments, the essential amino acid is lysine, tryptophan, methionine, threonine or mixtures thereof. In some embodiments, the present invention relates to the nucleic acid encoding the polypeptide referred to supra and in one embodiment, relates to the nucleic acid as DNA and in another embodiment to a second nucleic acid which is complementary to the DNA. Another embodirrlent relates to the polypeptide wherein more than about 10% but less than about 40% .of the amino acid residues are essential 3 ,", =. ; j_, r .. .. 'e 75529-49(S) amino acids. Another embodiment relates to the transformed plant containing the polypeptide supra. In some embodiments an animal feed composition is provided.
In another embodiment, the polypeptide referred to supra, comprises at least 20 contiguous amino acid residues.
In one aspect, the present invention relates to this polypeptide which contains or is modified to contain essential amino acids at positions 1, 8, 11, 17, 19, 34, 41, 56, 59, 62, 65, 67 or 73. In another aspect, the present invention relates to polypeptide which contains or is modified to contain essential amino acids at positions 1, 16, 23, 41, 44, 49 and 55. In other embodiments, the polypeptide comprises at least 30 contiguous amino acid residues.
In a further aspect, the present invention relates to the modification of amino acid residues in the active site of protease inhibitors. The above mentioned polypeptide contains, or is modified to contain, non-wild type amino acid residues at positions from about 53 to about 70. In some embodiments, the non-wild type amino acid residues are located at positions 58-60, 62, 65 or 67. In another embodiment, the polypeptide the non-wild type amino acid residue is located at position 59. In some embodiments, the present invention relates to the nucleic acid encoding the polypeptide referred to supra.
In another aspect the polypeptide is about 7.3 Kda or about 9.2 Kda and further comprises one or more additional amino terminal amino acid residues, and in some embodiments, the amino-terminal amino acid residue is methionine. The number of additional amino terminal amino acid residues is preferably less than 50. In another embodiment, the polypeptide is a cleavage product and in yet another, the polypeptide is recombinantly produced.
75529-49(S) In a further aspect, the present invention relates to an expression cassette comprising the nucleic acids as described supra, operably linked to a promoter providing for protein expression. In some embodiments, the promoter provides for protein expression in plants and in others the promoter provides for protein expression in bacteria, yeast or virus.
In yet another aspect, the present invention is directed to transformed plant cells containing the expression cassette described supra.
4a Ap,.~.licant Ref. No.: 057IR-PCT.app In another aspect, the present invention is directed to transformed plants containing at least one copy of the expression caasette described supra. In some embodiments, there is a seed of this transformed plant.
Another aspect of this invention provides a polypeptide produced by substituting an essential amino acid for at least one but less than 50 amino acid residues in a protease inhibitor for enhancing nutritional value of feed.
In another aspect, the present invention relates to polypeptides supra wherein hydrogen bonding is disrupted in the active site loop of the inhibitor.
In yet another aspect, the present invention relates to the polypeptide sa~pra which exhibits decreased protease inhibitor activity as compared to the wild-type protein which does not have substituted amino acid residues. In some embodiments nucleic acid encodes a protease inhibitor protein with decreased inhibitory activity.
In another aspect, the present invention relates to the polypeptide supra which exhibits less than about 30% of the inhibitor activity compared to corresponding wild-type protein which does not have substituted amino acid residues.
In another aspect, the present invention relates to a nucleic acid comprising the sequence of SEQ ID No. 1,3,5,7,9,11,15,17,19,21, or 23 or a nucleic acid having at least 70% identity thereto, wherein the nucleic acid encodes for a polypeptide which exhibits reduced protease inhibitor activity compared to a wild type protein. In one embodiment, the polypep~id_s exhibits 80% identity and in another embodiment, 90%.
In yet another aspect, the present invention relates to a nucleic acid encoding a protease inhibitor protein wherein nucleotides have been substituted to increase the number of essential amino acids in the encoded protein. In one embodiment, the inhibitor protein is derived from a plant. In another emb~~diment, the inhibitor protein is a chymotrypsin inhibitor- like protein.
In another aspect, the present invention relates to an expression cassette comprising the nucleic acid encoding the polypeptide supra, operably linked to a promoter providing for protein expression. In some embodiments, the promoter provides e_v. ~ ..
AFplicant Ref. No.: 0571 R-PCT.app r .. .. . .
. . . . . . , . , . . . < . < . .
< . ~ . .
for protein expression in plants. In some embodiments, the promoter provides for protein expression in bacteria, yeast or virus.
In yet another aspect, the transformed plant containing at least one copy of the expression casette supra. In some embodiments, the transformed plant is a monocotyledonous plant and could be selected from the group consisting of maize, sorghum, wheat, rice and barley. In some embodiments, the transformed plant is a dicotyledonous plant and could be selected from the group consisting of soybean, alfalfa, canola, sunflower, tobacco, tomato and canola. :Preferably, the transformed plant is maize or soybeans. In some embodiments seed is produced by the transformed plant. In some embodiments an animal feed composition is provided, and in some, the animal feed composition is the seed.
In another aspect, the present invention relates to transformed plant cells containing the expression cassette supra.
In another aspect, the present invention relates to a method for increasing the 1 ~ nutritional value of a plant comprising introducing into the cells of the plant the expression cassette supra to yield transformed plant cells and regenerating a transformed plant from the transformed plant cells.
The present invention provides a method for genetically modifying protease inhibitors to increase the level of at least, but not limited to one, essential amino acid in a plant so as to enhance the nutritional value of the plant. The methods comprise the introducW'on f an expression cassette into re;~enerable plant cells to yield transformed plant cells. The expression cassette comprises a nucleotide encoding a protease inhibitor operably linked to a promoter functional in plant cells.
A fertile transgenic plant is regenerated from the transformed cells, and seeds are isolated from the plant. The seeds comprise the polypeptide which is encoded by the DNA segment and which is produced in an amount sufficient to increase the amount of the essential amino acid in the seeds of the transformed plants, relative to the amount of the essential amino acid in the seeds of a corresponding untransformed plant, e.g., the seeds of a regenerated control plant that is not transformed or corresponding untransformed seeds isolated from the transformed plant.
In yet another aspect, the present invention is directed to transformed plant cells containing the expression cassette described supra.
4a Ap,.~.licant Ref. No.: 057IR-PCT.app In another aspect, the present invention is directed to transformed plants containing at least one copy of the expression caasette described supra. In some embodiments, there is a seed of this transformed plant.
Another aspect of this invention provides a polypeptide produced by substituting an essential amino acid for at least one but less than 50 amino acid residues in a protease inhibitor for enhancing nutritional value of feed.
In another aspect, the present invention relates to polypeptides supra wherein hydrogen bonding is disrupted in the active site loop of the inhibitor.
In yet another aspect, the present invention relates to the polypeptide sa~pra which exhibits decreased protease inhibitor activity as compared to the wild-type protein which does not have substituted amino acid residues. In some embodiments nucleic acid encodes a protease inhibitor protein with decreased inhibitory activity.
In another aspect, the present invention relates to the polypeptide supra which exhibits less than about 30% of the inhibitor activity compared to corresponding wild-type protein which does not have substituted amino acid residues.
In another aspect, the present invention relates to a nucleic acid comprising the sequence of SEQ ID No. 1,3,5,7,9,11,15,17,19,21, or 23 or a nucleic acid having at least 70% identity thereto, wherein the nucleic acid encodes for a polypeptide which exhibits reduced protease inhibitor activity compared to a wild type protein. In one embodiment, the polypep~id_s exhibits 80% identity and in another embodiment, 90%.
In yet another aspect, the present invention relates to a nucleic acid encoding a protease inhibitor protein wherein nucleotides have been substituted to increase the number of essential amino acids in the encoded protein. In one embodiment, the inhibitor protein is derived from a plant. In another emb~~diment, the inhibitor protein is a chymotrypsin inhibitor- like protein.
In another aspect, the present invention relates to an expression cassette comprising the nucleic acid encoding the polypeptide supra, operably linked to a promoter providing for protein expression. In some embodiments, the promoter provides e_v. ~ ..
AFplicant Ref. No.: 0571 R-PCT.app r .. .. . .
. . . . . . , . , . . . < . < . .
< . ~ . .
for protein expression in plants. In some embodiments, the promoter provides for protein expression in bacteria, yeast or virus.
In yet another aspect, the transformed plant containing at least one copy of the expression casette supra. In some embodiments, the transformed plant is a monocotyledonous plant and could be selected from the group consisting of maize, sorghum, wheat, rice and barley. In some embodiments, the transformed plant is a dicotyledonous plant and could be selected from the group consisting of soybean, alfalfa, canola, sunflower, tobacco, tomato and canola. :Preferably, the transformed plant is maize or soybeans. In some embodiments seed is produced by the transformed plant. In some embodiments an animal feed composition is provided, and in some, the animal feed composition is the seed.
In another aspect, the present invention relates to transformed plant cells containing the expression cassette supra.
In another aspect, the present invention relates to a method for increasing the 1 ~ nutritional value of a plant comprising introducing into the cells of the plant the expression cassette supra to yield transformed plant cells and regenerating a transformed plant from the transformed plant cells.
The present invention provides a method for genetically modifying protease inhibitors to increase the level of at least, but not limited to one, essential amino acid in a plant so as to enhance the nutritional value of the plant. The methods comprise the introducW'on f an expression cassette into re;~enerable plant cells to yield transformed plant cells. The expression cassette comprises a nucleotide encoding a protease inhibitor operably linked to a promoter functional in plant cells.
A fertile transgenic plant is regenerated from the transformed cells, and seeds are isolated from the plant. The seeds comprise the polypeptide which is encoded by the DNA segment and which is produced in an amount sufficient to increase the amount of the essential amino acid in the seeds of the transformed plants, relative to the amount of the essential amino acid in the seeds of a corresponding untransformed plant, e.g., the seeds of a regenerated control plant that is not transformed or corresponding untransformed seeds isolated from the transformed plant.
AMENDED S~tEET
75529-49(S) Preferably, the substantiated amino acid is an essential amino acid. More preferably, tryptophan threonine, methionine and lysine arE: the substituted essential amino acid.
Even more preferably, the additional essential amino acid is lysine.
A preferred embodiment of the present invention is the introduction of an Expression cassette into regenerable plant cells. Also preferred is the introduction of an expression cassette comprising a DNA segment encoding an endogenous or modified polypeptide sequence.
The present invention also encompasses variations in the sequences described above, wherein such variations are due to site-directed mutagenesis, or other mechanisms known in the art, to increase or decrease levels of selected amino acids of interest. For example, :>ite-directed mutagenesis to increase levels of essential amino acids is a preferred embodiment.
The present invention also provides a fertile transgenic plant. The fertile transgenic plant contains an isolated DNA segment comprising a promoter and encoding a protein comprising a protease inhibitor, modified by increasing the number of essential amino acids, under the control of the promoter. The protease inhibitor is expressed as so that the level of essential amino acids in the seeds of the transgenic plant is increased above the level in the seeds of a plant which only differ from t:he seeds of the transgenic plant in that the DNA segment or t:he encoded seed protein is under the control of a different promoter. The DNA segment is transmitted through a cc>mplete normal sexual cycle of the transgenic plant to the next generation. The present invention provides nucleotide sequences encoding proteins containing higher levels of essential amino acids by the substitution of 75529-49(S) one or more of the amino acid residues in the protease inhibitor. Substitutions at one or more of, but not limited to, positions 1, 8, 11, 17, 19, 34, 41, 56, 59, 62, 67 and 73 of the wild type protein are substituted with essential amino acids. The present invention also involves the expression of the present chymotrypsin inhibitor derivatives or any derived protease inhibitor in plants to provide higher percentages of essential amino acids in plants than wild type plants.
In a preferred embodiment of the present invention, the present derivatives also exhibit reduced protease inhibitor activity. This is achieved by substituting the amino acid residues from about amino acid residue 53 to about amino acid residue 70 with residues other than the wild type residues.
In one aspect, there is described an isolated polypeptide comprising a modified variant of SEQ ID N0: 14, or a modified variant of the sequence from position 19 to position 83 of SEQ ID N0: 14, wherein the modified variant:
(a) contains a higher percentage of essential amino acids than either SEQ ID N0: 14 or the sequence from position 19 to position 83 of SEQ ID NO: 14; (b) has greater than 600 amino acid similarity to SEQ ID N0: 14 or the sequence from position 19 to position 83 of SEQ ID N0: 14, wherein the percent sequence similarity is based on the entire sequence and is determined by BLAST 2.0 using default parameters; and (c) contains an essential amino acid at a position corresponding to a position of SEQ ID N0: 14 selected from the group consisting of 1, 8, 17, 19, 34, 41, and 67, or contains a lysine at a position corresponding to a position of SEQ ID N0: 14 selected from the group consisting of 56, 59, 62 and 73.
7a 75529-49(S) In another aspect, there is described an isolated polypeptide comprising a modified variant of SEQ ID N0: 14, or a modified variant of the sequence from position 19 to position 83 of SEQ ID N0: 14, wherein the modified variant:
(a) contains a higher percentage of essential amino acids than either SEQ ID N0: 14 or the sequence from position 19 to position 83 of SEQ ID N0: 14; (b) has greater than 600 amino acid similarity to SEQ ID NO: 14 or the sequence from position 19 to position 83 of SEQ ID N0: 14, wherein the percent sequence similarity is based on the entire sequence and is determined by BLAST 2.0 using default parameters; and (c) is modified at at least 11 positions of SEQ ID N0: 14 to contain essential amino acids at said at least 11 positions.
In another aspect, there is described an isolated polypeptide comprising a modified variant of SEQ ID N0: 14, or a modified variant of the sequence from position 19 to position 83 of SEQ ID N0: 14, wherein the modified variant:
(a) contains a higher percentage of essential amino acids than either SEQ ID N0: 14 or the sequence from position 19 to position 83 of SEQ ID NO: 14; (b) has greater than 600 amino acid similarity to SEQ ID N0: 14 or the sequence from position 19 to position 83 of SEQ ID N0: 14, wherein the percent sequence similarity is based on the entire sequence and is determined by BLAST 2.0 using default parameters; and (c) contains a pair of cysteines at at least one pair of positions corresponding to SEQ ID N0: 14 positions Glu-23 and Arg-81, Thr-22 and Val-82, or Val-53 and Val-70.
In another aspect, there is described an isolated polypeptide comprising a modified variant of SEQ ID NO: 14, or a modified variant of the sequence from position 19 to position 83 of SEQ ID NO: 14, wherein the modified variant:
(a) contains at least 55~ essential amino acids; (b) has greater than 60o amino acid similarity to SEQ ID N0: 14 or 7b 75529-49(S) the sequence from position 19 to position 83 of SEQ ID
N0: 14, wherein the percent sequence similarity is based on the entire sequence and is determined by BLAST 2.0 using default parameters; and (c) contains a pair of cysteines at at least one pair of positions corresponding to SEQ ID
NO: 14 positions Glu-23 and Arg-81, Thr-22 and Val-82, or Val-53 and Val-70.
In another aspect, there is described an isolated nucleic acid encoding the polypeptide of the invention.
In another aspect, there is described a recombinant expression cassette comprising the nucleic acid of the invention operably linked to a promoter.
In another aspect, there is described a transformed plant cell comprising the recombinant expression cassette of the invention.
In another aspect, there is described an animal feed composition comprising plant tissue, wherein the plant tissue comprises the polypeptide of the invention.
In another aspect, there is described a method for increasing the nutritional value of a plant comprising:
(a) introducing into cells of the plant a recombinant expression cassette of the invention, wherein the promoter provides for protein expression in plants, to yield transformed plant cells, and (b) regenerating a transformed plant from the transformed plant cells.
In another aspect, there is described use of at least one recombinant expression cassette of the invention, wherein the promoter provides for protein expression in plants, in the preparation of a transformed plant.
7c 75529-49(S) In another aspect, there is described use of at least one recombinant expression cassette of the invention, wherein the promoter provides for protein expression in plants, for the preparation of a seed of a transformed plant.
In another aspect, there is described use of the plant cell of the invention in the preparation of an animal feed composition.
7d AFplicant Ref. No.: 0571 R-PCT.app Methods for expressing the modified protease inhibitors and for using plants are also provided to enhance the nutritional value ol~animal feed.
It is therefore an object of the present invention to provide methods for increasing the levels of the essential amino acids in the seeds of plants used for animal feed.
It is a further object of the present invention to provide seeds for food and/or feed with higher levels of the essential amino acid, lysine, than wild type species of the same seeds.
It is a further object of the present invention to provide seeds for food and/or feed such that the level of the essential amino acids is increased such that the need for feed supplementation is greatly reduced or obviated.
It is one object of the present invention ro provide nucleic acids encoding enzymes involved in protease inhibition and antigenic polypeptide fragments thereof.
It is also an object of the present invention to provide protease inhibitor polypeptides and antigenic fragments thereof. It is a further object of the present invention to provide transgenic 1 S plants comprising protease inhibitor nucleic acids. Additionally, it is an object of the present invention to provide methods for modulating, in a transgenic plant, the expression of protease inhibitor polynucleotides of the present invention.
Therefore, in one aspect, the present invention relates to an isolated nucleic acid comprising a member selected from the group consisting of (a)a polynucleotide having at least 70% identity to a polynucleotide encoding a polypeptide selected from the group consisting of SEQ ID NOS: 2,4,6,8,10 and l2,116,18,20,22,24;and (b) a polynucleotide which is complementary to the polynucleotide of (a); and (c) a polynucleotide comprising at least 3~-~tiguous nucleotides from a polyrmcleotide of (a) or (b). In some embodiments, the polynucleotide has a sequence selected from the group consisting of SEQ ID NOS: 1,3,5,7,9 and 11, 15,17,19,21, or 23 . The isolated nucleic acid can be DNA.
In another aspect, the present invention relates to recombinant expression cassettes, comprising a nucleic acid as describf:d, supra, operably linked to a promoter.
In some embodiments, the nucleic acid is operably linked in antisense orientation to the promoter.
In another aspect, the present invention is directed to a host cell transfected with the recombinant expression cassette as described, supra. In some embodiments, the host ~1,~L'~J"r~': ~'i ~C.' J _ .:, i:_~
Applicant Ref. No.: 05718-PCT.app cell is a maize, rye, barley, wheat, sorghum, oa~a, millet, rice, triticale, sunflower, alfalfa, rapeseed or soybean cell.
In a further aspect, the present invention relates to an isolated protein comprising a polypeptide of at least 10 contiguous amino acids encoded by the isolated nucleic acid referred to, supra. In some embodiments, the polypeptide has a sequence selected from the group consisting of SEQ ID NOS: 2,4,6,8,10 and 12,16,18,20,22,24.
In another aspect, the present invention relates to an isolated nucleic acid comprising a polynucleotide of at least 30 nuclE;otides in length which selectively hybridizes under stringent conditions to a nucleic acid selected from the group consisting of SEQ ID NOS: 1,3,5,7,9 and 11, 15,17,19,21, 23 or a complement thereof. In some embodiments, the isolated nucleic acid is operably linked to a promoter.
In yet another aspect, the present invention relates to an isolated nucleic acid comprising a polynucleotide, the polynucleotide having at least 60% sequence identity to an identical length of a nucleic acid selected from the group consisting of SEQ ID NOS:
1,3,5,7,9 and 1 l, 15,17,19,21, 23 or a complement thereof.
In another aspect, the present invention :relates to an isolated nucleic acid comprising a polynucleotide having a sequence of a nucleic acid amplified from a Zea mays nucleic acid library using the primers selected from the group consisting of: SEQ ID
NOS: 25 and 26 or complements thereof. In some embodiments, the nucleic acid library is a cDNA library.
In another aspect, the present invention :relates to a recombinant expression cassette comprising a nucleic acid amplified from a library as referred to szrpra, wherein the nucleicd is operably linked to a promoter. In some embodiments, the present invention relates to a host cell transfected with l:his recombinant expression cassette In some embodiments, the present invention relates to a protease inhibitor protein produced from this host cell.
In a further aspect, the present invention relates to a heterologous promoter operably linked to a non-isolated protease inhibitor polynucleotide encoding a polypeptide, wherein the polypeptide is encoded by a nucleic acid amplified from a nucleic acid library as referred to, supra.
In yet another aspect, the present invention relates to a transgenic plant comprising a recombinant expression cassette comprising a plant promoter operably linked to any of : ' r,-75529-49(S) the isolated nucleic acids referred to supra. In some embodiments, the transgenic plant is Zea mays. The present invention also provides transgenic seed from the transgenic plant.
In a further aspect, the present invention relates to a method of providing a modified protease inhibitor in a plant, comprising the steps of (a) transforming a plant cell with a recombinant expression casette comprising a protease inhibitor polynucleotide operably linked to a promoter; (b) growing the plant cell under plant growing conditions; and (c) inducing expression of the polynucleotide.
Applicant Ref. No.: 05718-PCT.app F~,ure IistinE
Figure 1 Protease Inhibition Sequence identification DETAILED DESCRIPTION
Barley High Lysine 1(BHL-1) is coded for by the polypeptides of SEQ ID
No. 2 which is encoded for by the nucleic acid of SEQ ID No. 1.
Barley High Lysine 2 (BHL-2) its coded for by the polypeptides of SEQ
ID No. 4 which is encoded for by the nucleic acid of SEQ ID No. 3.
Barley High Lysine 3 (BHL-3) i;s coded for by the polypeptides of SEQ ID
No. 6 which is encoded for by the nucleic acid of SEQ ID No. 5.
Barley High Lysine 3N (BHL-3N) is coded for by the polypeptides of SEQ
ID No. 8 which is encoded for by the nucleic acid of SEQ ID No. 7.
Barley High Lysine 1N (BHL-1N) is coded for by the polypeptides of SEQ
1 ~ ID No. 10 which is encoded for by the nucleic acid of SEQ ID No. 9.
Barley High Lysine 2N (BHL-2N) is coded for by the polypeptides of SEQ
ID No. 12 which is encoded for by the nucleic acid of SEQ ID No. 11.
Wild-type chymotrypsin inhibitor (WI-CI-2) is coded for by the polypeptides of SEQ ID No. 14 which is encoded for by the nucleic acid of SEQ
ID No. 13.
Maize EST PI-1 is coded for by the polypeptides of SEQ ID No.l6 which is encoded for by the nucleic acid of SEQ ID No. 15.
Maize EST PI-2 is coded for by the polypeptides of SEQ ID No.18 which is encoded for by the nucleic acid of SEQ ID No. 17.
Maize EST PI-3 is coded for by the polypeptides of SEQ ID No.20 which is encoded for by the nucleic acid of SEQ ID No. 19.
Maize EST PI-4 is coded for by the polypeptides of SEQ ID No.22 which is encoded for by the nucleic acid of SEQ ID No. 21.
Maize EST PI-Sis coded for by t:he polypeptides of SEQ ID No. 24 which is encoded for by the nucleic acid of SEQ ID No. 23.
The 5' and 3' PCR primer pairs A & B, are identified as SEQ ID Nos. 25 and 26, respectively.
Applicant Ref. No.: 05718-PCT.app Definitions Units, prefixes, and symbols may be denoted in their SI accepted form. Unless otherwise indicated, nucleic acids are written left to right in 5' to 3' orientation; amino acid sequences are written left to right in amino to carboxy orientation, respectively.
Numeric ranges are inclusive of the numbers defining the range. Amino acids may be referred to herein by either their commonly known three letter symbols or by the one-letter symbols recommended by the IUPAC-IUB Biochemical Nomenclature Commission. Nucleotides, likewise, may be referred to by their commonly accepted single-letter codes. The terms defined below are more fully defined by reference to the specification as a whole.
"Chymotrypsin inhibitor-like" protein is a protein with a sequence identity of 40%
or more to the CI-2 from barley.
"%" refers to molar % unless otherwise specified or implied.
"Essential amino acids" are amino acids that must be obtained from an external source because they are not synthesized by the individual. They are comprised of:
methionine, threonine, lysine, isoleucine, leucine, valine, tryptophan, phenylalanine, and histidine.
By "amplified" is meant the construction of multiple copies of a nucleic acid sequence or multiple copies complementar)~ to the nucleic acid sequence using at least one of the nucleic acid sequences as a template. Amplification systems include the polymerase~ain reaction (PCR) system, ligase chain reaction (LCR) system, nucleic acid sequence based amplification (NASBA, Cangene, Mississauga, Ontario), Q-Beta Replicase systems, transcription-based amplification system (TAS), and strand displacement amplification (SDA). See, e.g., Dicxgnostic Molecular Microbiology:
Principles and Applications, D. H. Persing et al., Ed., American Society for Microbiology, Washington, D.C. (1993).
As used herein, "antisense orientation" includes reference to a duplex polynucleotide sequence which is operably linked to a promoter in an orientation where 12 l'e~~EPJCeB S;-~EE
Applicant Ref. No.: 05718-PCT.app the antisense strand is transcribed. The antisense strand is sufficiently complementary to an endogenous transcription product such that translation of the endogenous transcription product is often inhibited.
As used herein, "chromosomal region" includes reference to a length of chromosome which may be measured by reference to the linear segment of DNA
which it comprises. The chromosomal region can be defined by reference to two unique DNA
sequences, i.e., markers.
The term "conservatively modified variants" applies to both amino acid and nucleic acid sequences. With respect to particular nucleic acid sequences, conservatively modified variants refers to those nucleic acids which encode identical or essentially identical amino acid sequences, or where the nucleic acid does not encode an amino acid sequence, to essentially identical sequences. Because of the degeneracy of the genetic code, a large number of functionally identical nucleic acids encode any given protein. For instance, the codons GCA, GCC, GCG and GCU all encode the amino acid alanine.
Thus, at every position where an alanine is specified by a codon, the codon can be altered to any of the corresponding codons described without altering the encoded polypeptide.
Such nucleic acid variations are "silent variations" and represent one species of conservatively modified variation. Every nucleic acid sequence herein which encodes a polypeptide also describes every possible silent variation of the nucleic acid. One of ordinary skill will recognize that each codon in a nucleic acid (except AUG, which is ordinarily the only codon for methionine, and TGG, which is ordinarily the only codon for tryptophan) can be modified to yield a functionally identical molecule.
Accordingly, each sileiation of a nucleic acid which encodes a polypeptide of the present invention is implicit in each described polypeptide sequence and incorporated herein by reference.
As to amino acid sequences, one of skill will recognize that individual substitutions, deletions or additions to a nucleic acid, peptide, polypeptide, or protein sequence which alters, adds or deletes a single amino acid or a small percentage of amino acids in the encoded sequence is a "conservatimely modified variant" where the alteration results in the substitution of an amino acid with a chemically similar amino acid. Thus, any number of amino acid residues selected from the group of integers consisting of from 1 to 15 can be so altered. Thus, for example, 1, 2, 3, 4, 5, 7, or 10 alterations can be :.-: ,~ ~ ~.y ~;
f. _ ~ . a .:,U:..., ~.W :~ t:.
Applicant Ref. No.: 0571 R-PCT.app made. Conservatively modified variants typically provide similar biological activity as the unmodified polypeptide sequence from which they are derived. For example, substrate specificity, enzyme activity, or ligancUreceptor binding is generally at least 30%, 40%, 50%, 60%, 70%, 80%, or 90% of the native protein for it's native substrate.
Conservative substitution tables providing functionally similar amino acids are well known in the art.
The following six groups each contain s~mino acids that are conservative substitutions for one another:
1 ) Alanine (A), Serine (S), Threonine (T);
2) Aspartic acid (D), Glutamic acid (E);
3) Asparagine (N), Glutamine (Q);
4) Arginine (R), Lysine (K);
5) Isoleucine (I), Leucine (L), Methionine (M), Valine (V); and 6) Phenylalanine (F), Tyrosine (Y), Tryptophan (W).
See also, Creighton ( 1984) Proteins W.H. Freeman and Company.
By "encoding" or "encoded", with respect to a specified nucleic acid, is meant comprising the information for translation into the specified protein. A
nucleic acid encoding a protein may comprise non-translated sequence (e.g., introns) within translated regions of the nucleic acid, or may lack such intervening non-translated sequences (e.g., as in cDNA). The information by which a protein is encoded is specified by the use of codons. Typically, the amino acid sequence is encoded by the nucleic acid using the "universal" genetic code. However, variants of the universal code, such as is present in some plant=aiiimal, and fungal mitochondria, the bacterium Mycoplasma capricolz~m (Proc. Natl. Acad. Sci. (USA), 82: 2306-2309 (1985)), or the ciliate Macronucleus, may be used when the nucleic acid is expressed using these organisms.
When the nucleic acid is prepared or altered synthetically, advantage can be taken of known codon preferences of the intended host where the nucleic acid is to be expressed. For example, although nucleic acid sequences of the present invention may be expressed in both monocotyledonous and dicot;yledonous plant species, sequences can be modified to account for the specific codon pref°rences and GC content preferences of monocotyledons or dicotyledons as these preferences have been shown to differ (Murray et al. Nucl. Acids Res. 17: 477-498 (1989)). Tllus, the maize preferred codon for a 14 ' . _.' '-'; z~
~.. r Applicant Ref. No.: 05718-PCT.app particular amino acid may be derived from known gene sequences from maize.
Maize codon usage for 28 genes from maize plants are listed in Table 4 of Murray et al., szrpra.
As used herein "full-length sequence" includes reference to a protease inhibitor polynucleotide or the encoded protein having t:he entire amino acid sequence of, a native (non-synthetic), endogenous, catalytically active form of a protein involved in protease inhibition. A full-length sequence can be determined by size comparison relative to a control which is a native (non-synthetic) endogenous cellular protease inhibitor nucleic acid or protein. Methods to determine whether a sequence is full-length are well known in the art including such exemplary techniques as northern or western blots.
See, e.g., Plant Moleczrlar Biology: A Laboratory Manzrczl, Clark, Ed., Springer-Verlag, Berlin (1997). Comparison to known full-length homologous sequences can also be used to identify full-length sequences of the present invention. Additionally, consensus sequences typically present at the 5' and 3' unt:ranslated regions of mRNA aid in the identification of a polynucleotide as full-length. For example, the consensus sequence ANNNNAUGG, where the underlined codon represents the N-terminal methionine, aids in determining whether the polynucleotide has a complete 5' end. Consensus sequences at the 3' end, such as polyadenylation sequences, aid in determining whether the polynucleotide has a complete 3' end.
As used herein, "heterologous" in reference to a nucleic acid is a nucleic acid that originates from a foreign species, or, if from thc: same species, is substantially modified from its native form in composition and/or genomic locus. For example, a promoter operably linked to a heterologous structural gene is from a species different from that from whiE structural gene was derived, or, if from the same species, one or both are substantially modified from their original form. A heterologous protein may originate from a foreign species or, if from the same species, is substantially modified from its original form.
By "host cell" is meant a cell which contains a vector and supports the replication and/or expression of the expression vector. Host cells may be prokaryotic cells such as E.
coli, or eukaryotic cells such as yeast, insect, amphibian, or mammalian cells. Preferably, host cells are monocotyledonous or dicotyledenous plant cells. A particularly preferred monocotyledonous host cell is a maize host cell.
Applicant Ref. No.: 0>71R-PCT.app The term "hybridization complex" includes reference to a duplex nucleic acid sequence formed by two single-stranded nucleic acid sequences which selectively hybridize with each other.
The terms "isolated" or "biologically pure" refer to material which is: (1) substantially or essentially free from components which normally accompany or interact with it as found in its naturally occurring environment. The isolated material optionally comprises material not found with the material in its natural environment. (2) If the material is in its natural environment, the material has been synthetically (non-naturally) altered to a composition and/or placed at a loccls in the cell (e.g., genome) not native to a material found in that environment. The alteration to yield the synthetic material can be performed on the material within or removed from its natural state. For example, a naturally occurring nucleic acid becomes an isolated nucleic acid if it is altered, or if it is transcribed from DNA which is altered, by non-natural, synthetic (i.e., "man-made") methods performed within the cell from which it originates. See, e.g., Compounds and Methods for Site Directed Mutagenesis in Eukaryotic Cells, Kmiec, U.S. Patent No.
5,565,350; In Vivo Homologous Sequence Targeting in Eukaryotic Cells; Zarling et al., PCT/LJS93/03868. Likewise, a naturally occurring nucleic acid (e.g., a promoter) become isolated if it is introduced by non-naturally occurring means to a locus of the genome not native to that nucleic acid.
The term "protease inhibitor nucleic acids" means an isolated nucleic acid comprising a polynucleotide (a "protease inhibitor polynucleotide") encoding a polypeptide involved in protease inhibition.
As-t»d herein, "localized within the chromosomal region defined by and including" with respect to particular markers includes reference to a contiguous length of a chromosome delimited by and including the stated markers.
As used herein, "marker" includes referE:nce to a locus on a chromosome that serves to identify a unique position on the chromosome. A "polymorphic marker"
includes reference to a marker which appears in multiple forms (alleles) such that different forms of the marker, when they are preaent in a homologous pair, allow transmission of each of the chromosomes in that pair to be followed. A
genotype may be defined by use of a single or a plurality of markers.
16 ,~MFNDED ~:r'=1 75529-49(S) As used herein, "nucleic acid" includes reference to a deoxyribonucleotide or ribonucleotide polymer in either single- or double-stranded form, and unless otherwise limited, encompasses known analogues of natural nucleotides that hybridize to single-stranded nucleic acids in a manner similar to naturally occurring nucleotides {e.g., peptide nucleic acids).
By "nucleic acid library" is meant a collection of isolated DNA or RNA
molecules which comprise and substantially represent the entire transcribed fraction of a genome of a specified organism. Construction of exemplary nucleic acid libraries, such as genomic and cDNA libraries, is taught in standard molecular biology references such as Berger and Kimmel, Guide to Molecular Cloning Techniques, Methods in Enzymology, Vol.
152, Academic Press, Inc., San Diego, CA (Berger); Sambrook et al., Molecular Cloning - A
Laboratory Manual, 2nd ed., Vol. 1-3 (1989); and Current Protocols in Molecular Biology, F.M. Ausubel et al., Eds., Current Protocols, a joint venture between Greene Publishing Associates, Inc, and John Wiley & Sons, Inc. (1994 Supplement).
I S As used herein "operably linked" includes reference to a functional linkage between a promoter and a second sequence, wherein the promoter sequence initiates'and mediates transcription of the DNA sequence corresponding to the second sequence.
Generally, operably linked means that the nucleic acid sequences being linked are contiguous and, where necessary to join two protein coding regions, contiguous and in the same reading frame.
As used herein, the term "plant" includes reference to whole plants, plant organs (e.g., leaves, stems, roots, etc.), seeds and progeny of same. The class of plants which can be used in the methods of the invention is generally as broad as the class of higher plants amenable to transformation techniques, including both monocotyledonous and dicotyledonous plants. Particularly preferred is Zea mays.
As used herein, "polynucleotide" includes reference to a deoxyribopolynucleotide, ribopolynucleotide, or analogs thereof, that hybridize to nucleic acids in a manner similar to naturally occurring nucleotides. A polynucleotide can be full-length or a sub-sequence of a native or heterologous structural or regulatory gene. Unless otherwise indicated, the Applicant Ref. No.: 0571 R-PCT.app term includes reference to the specified sequence as well as the complementary sequence thereof. Thus, DNAs or RNAs with backbones modified for stability or for other reasons are "polynucleotides" as that term is intended herein. Moreover, DNAs or RNAs comprising unusual bases, such as inosine, or modified bases, such as tritylated bases, to name just two examples, are polynucleotidesas the term is used herein. It will be appreciated that a great variety of modifications have been made to DNE~ and RNA that serve many useful purposes known to those of skill in the art. The term polynucleotide as it is employed herein embraces such chemically, enzymaticallyor metabolicallymodified forms of polynucleotides, as well as the chemical forms of DNA and RNA characteristic of viruses and cells, including inter alia, simple and complex cells.
The terms "polypeptide", "peptide" and "protein" are used interchangeably herein to refer to a polymer of amino acid residues. Tlhe terms apply to amino acid polymers in which one or more amino acid residue is an artificial chemical analogue of a corresponding naturally occurring amino acid, as well as to naturally occurring amino acid polymers. Among the known modifications which may be present in polypeptides of the present are, to name an illustrative few, acetylation, acylation, ADP-ribosylation, amidation, covalent attachment of flavin, covalent attachment of a heme moiety, covalent attachment of a nucleotide or nucleotide derivative, covalent attachment of a lipid or lipid derivative, covalent attachment of phosphotidylinositol,cross-linking, cyclization, disulfide bond formation, demethylation, formation of covalent cross-links, formation of cystine, formation of pyroglutamate, formylation, gamma-carboxylation,glycosylation, GPI anchor formation, hydroxylation, iodination, methylation, myristoylation, oxidation, proteolytic processing,osphorylation,prenylation, racemization, selenoylation, sulfation, transfer-RNA mediated addition of amino acids to proteins such as arginylation, and ubiquitination.
Such modifications are well known to those of skill and have been described in great detail in the scientific literature. Several particularly common modifications, glycosylation, lipid attachment, sulfation, gamma-carboxylationof g lutamic acid residues, hydroxylation and ADP-ribosylation, for instance, are described in most basic texts, such as, for instance Proteins - Structure and Molecular Properties, 2nd ed., T. E. Creighton, W. H.
Freeman and Company, New York ( 1993). Many detailed reviews are available on this subject, such as, for example, those provided by Wold, F., PosttranslationalProtein Modifications:
Perspectives and Prospects, pp. 1-12 in Posttranslational Covalent Modification of Proteins, 18 ..,.'~~,;i;; :~-::~~;
Applicant Ref. No.: 05718-PCT.app :.
.. . . . . . ~. . . , , , , . , . . ,' , . , ,. ,. ' . ":. , ,.
B. C. Johnson, Ed., Academic Press, New York ( 1983); Seifter et al., Meth.
Enz-ymol. 182:
626-646 ( I 990) and Rattan et al., Protein Synthesis: Posttranslational Modifications and Aging, Ann. N.Y. Acad. Sci. 663: 48-62 ( 1992). It will be appreciated, as is well known and as noted above, that polypeptides are not always. entirely linear. For instance, polypeptides may be branched as a result of ubiquitination, arid they may be circular, with or without branching, generally as a result of posttranslation events, including natural processing event and events brought about by human manipulation which do not occur naturally.
Circular, branched and branched circular polypeptides ma.y be synthesized by non-translation natural process and by entirely synthetic methods, as well. Modifications can occur anywhere in a polypeptide, including the peptide backbone, the: amino acid side-chains and the amino or carboxyl termini. In fact, blockage of the amino or carboxyl group in a polypeptide, or both, by a covalent modification, is common in naturally occurring and synthetic polypeptides and such modifications may be present in polypeptides of the present invention, as well.
For instance, the amino terminal residue of polypeptides made in E. coli or other cells, prior to proteolytic processing, almost invariably will be N-formylmethionine.
During post-translational modification of the peptide, a metlzionine residue at the NHZ-terminus may be deleted. Accordingly, this invention contemplates the use of both the methionine-containing and the methionineless amino terminal variants of the protein of the invention.
In general, as used herein, the term polypeptide Encompasses all such modifications, particularly those that are present in polypeptides synthesized by expressing a polynucleotide in a host cell.
As used herein "promoter" includes reference to a region of DNA upstream from the start o~a~scription and involved in recognition and binding of RNA
polymerase and other proteins to initiate transcription. A "plans: promoter" is a promoter capable of initiating transcription in plant cells. Examples. of promoters under developmental control include promoters that preferentially initiate transcription in certain tissues, such as leaves, roots, seeds, fibers, xylem vessels, trach.eids, or sclerenchyma. Such promoters are referred to as "tissue preferred". Promoters which initiate transcription only in certain tissue are referred to as "tissue specific". A "cell type" specific promoter is primarily drives expression in certain cell types in one or more organs, for example, vascular cells in roots or leaves. An "inducible" promoter is a promoter which is under environmental control. Examples of environmental conditions. that may effect transcription by inducible Applicant Ref. No.: 0~71R-PCT. app ~: ..
. . . . . . r ~ :.
.
r .
:.
promoters include anaerobic conditions or the presence of light. Tissue specific, cell type specific, and inducible promoters constitute th~~ class of "non-constitutive"
promoters. A
"constitutive" promoter is a promoter which is active under most environmental conditions.
The terms "polypeptide involved in protease inhibition" or "protease inhibitor polypeptide" refer to one or more proteins, in ~;lycosylated or non-glycosylated form, acting as a protease inhibitor. Examples are included as, but not limited to:
chymotrypsin inhibitor, trypsin inhibitor, protease inhibitor, pre-pro-proteinase inhibitor I, subtilisin-chymotrypsin inhibitor, tumor-related protein, genetic tumor-related proteinase inhibitor, subtilisin inhibitor, endopeptidase inhibitor, serine protease inhibitor, wound-inducible proteinase inhibitor, and eglin c. The term is also inclusive of fragments, variants, homologs, alleles or precursors (e.g., preproproteins or proproteins) thereof.
A "protease inhibitor protein" comprises a protease inhibitor polypeptide.
As used herein "recombinant" includes reference to a cell, or nucleic acid, or 1 S vector, that has been modified by the introduction of a heterologous nucleic acid or the alteration or placement of a native nucleic acid to a form or to a locus not native to that cell, or that the cell is derived from a cell so modified. Thus, for example, recombinant cells express genes that are not found in identical form within the native (non-recombinant) form of the cell or express native genes that are otherwise abnormally expressed, under expressed or not expressed at all. The term "recombinant" as used herein does not encompass the alteration of the cell, nucleic acid or vector by naturally occurring events (e.g., spontaneous mutation, n~~tural transformati~ftransduction/transposition) such as those occurring without direct human intervention.
As used herein, a "recombinant expression cassette" is a nucleic acid construct, generated recombinantly or synthetically, with a series of specified nucleic acid elements which permit transcription of a particular nucleic acid in a target cell. The recombinant expression cassette can be incorporated into a pllasmid, chromosome, mitochondria) DNA, plastid DNA, virus, or nucleic acid fragment. Typically, the recombinant expression cassette portion of the expression vector includes, among other sequences, a nucleic acid to be transcribed, and a promoter.
APJIENDED SHEET
Applicant Ref. No.: 05718-PC'r.app ' . , _ . . . ~ . ~ . ' . . .
. . ' ~ .
., .. ." ."' ..
The term "residue" or "amino acid residue" or "amino acid" are used interchangeably herein to refer to an amino acid that is incorporated into a protein, polypeptide, or peptide (collectively "protein"). The amino acid may be a naturally occurring amino acid and, unless otherwise limited, may encompass known analogs of natural amino acids that can function in a simil'.ar manner as naturally occurring amino acids.
The term "selectively hybridizes" includes reference to hybridization, under stringent hybridization conditions, of a nucleic acid sequence to a specified nucleic acid target sequence to a detectably greater degree (e.g., at least 2-fold over background) than its hybridization to non-target nucleic acid sequences and to the substantial exclusion of non-target nucleic acids. Selectively hybridizing sequences typically have about at least 80% sequence identity, preferably 90% sequence identity, and most preferably 100%
sequence identity (i.e., complementary) with each other.
The terms "stringent conditions" or "stringent hybridization conditions" includes reference to conditions under which a probe will hybridize to its target sequence, to a detectably greater degree than other sequences (e.g., at least 2-fold over background). Stringent conditions are sequence-dependent and will be different in different circumstances. Longer sequences hybridize specifically at higher temperatures.
Generally, stringent conditions are selected to be about 5 °C lower than the thermal melting point (Tm) for the specific sequence at a defined ionic strength and pH. The Tm is the temperature (under defined ionic strength and pH) at which 50% of a complementary target seque~e hybridizes to a perfectly matched probe. Typically, stringent conditions will be those in which the salt concentration is less than about 1.0 M Na ion, typically about 0.01 to 1.0 M Na ion concentration (or other salts) at pH 7.0 to 8.3 and the temperature is at least about 30°C for short probes (e.g., 10 to 50 nucleotides) and at least about 60°C for long probes (e.g., greater than 50 nucleotides).
Stringent conditions may also be achieved with the addition of destabilizing agents such as formamide.
Exemplary low stringency conditions include hybridization with a buffer solution of 30%
formamide, 1 M NaCI, 1% SDS at 37°C, and a wash in 2X SSC at 50°C.
Exemplary high stringency conditions include hybridization in 50% formamide, 1 M NaCI, 1 % SDS at 37°C, and a wash in O.1X SSC at 60°C.
ja A~ ~
r f ~Ff fJCLI Ci ~"~.~
Applicant Ref. No.: 05718-PCT.app ., . " , . .. . . . . .~ . . . , .
. . ..
. . . . . . .
.. ~. ... .... ,.
Stringent hybridization conditions in the context of nucleic acid hybridization assay formats are sequence dependent, and are different under different environmental parameters. Longer sequences hybridize selectively at higher temperatures. An extensive guide to the hybridization of nucleic acids is found in Tijssen, Laboratory Techniques in Biochemistry and Molecular Biology--Hybridization with Nucleic Acid Probes, Part I, Chapter 2 "Overview of principles of hybridization and the strategy of nucleic acid probe assays", Elsevier, New York (1993).
The terms "transfection" or "transformation" include reference to the introduction of a nucleic acid into a eukaryotic or prokaryotic cell where the nucleic acid may be incorporated into the genome of the cell (e.g., chromosome, plasmid, plastid or mitochondria) DNA), converted into an autonomous replicon, or transiently expressed (e.g., transfected mRNA).
As used herein, "transgenic plant" includes reference to a plant which comprises within its genome a heterologous polynucleotide. Generally, the heterologous polynucleotide is stably integrated within the genome such that the polynucleotide is passed on to successive generations. The heterologous polynucleotide may be integrated into the genome alone or as part of a recombinant expression cassette.
"Transgenic" is used herein to include any cell, cell line, callus, tissue, plant part or plant, the genotype of which has been altered by the presence of heterologous nucleic acid including those transgenics initially so altered as well as those created by sexual crosses or asexual propagation from the initial transgenic. The term "transgenic" as used herein does not encompass the alteration~e genome (chromosomal or extra-chromosomal) by conventional plant breeding methods or by naturally occurring events such as random cross-fertilization, non-recombinant viral infection, non-recombinant bacterial transformation, non recombinant transposition, or spontaneous mutation.
As used herein, "vector" includes reference to a nucleic acid used in transfection of a host cell and into which can be inserted a polynucleotide. Vectors are often replicons. Expression vectors permit transcription of a nucleic acid inserted therein.
The following terms are used to describe the sequence relationships between two or more nucleic acids or polynucleotides: (a) "reference sequence", (b) "comparison "'~ ~rG=_ , ,...., 75529-49(S) window", (c) "sequence identity", (d) "percentage of sequence identity", and (e) "substantial identity".
(a) As used herein, "reference sequence" is a defined sequence used as a basis for sequence comparison. A reference sequence may be a subset or the entirety of a specified sequence; for example, as a segment of a full-length cDNA or gene sequence, or the complete ~cDNA or gene sequence.
(b) As used herein, "comparison window" means includes reference to a contiguous and specified segment of a polynucleotide sequence, wherein the polynucleotide sequence may be compared to a reference sequence and wherein the portion of the polynucleotide sequence in the comparison window may comprise additions or deletions (i.e., gaps) compared to the reference sequence (which does not comprise additions or deletions) for optimal alignment of the two sequences.
Generally, the comparison window is at least 20 contiguous nucleotides in length, and optionally can be 30, 40, 50, 100, or longer. Those of skill in the art understand that to avoid a high similarity to a reference sequence due to inclusion of gaps in the polynucleotide sequence a gap penalty is typically introduced and is subtracted from the number of matches.
Methods of alignment of sequences for comparison are well-known in the art.
Optimal alignment of sequences for comparison may be conducted by the local homology algorithm of Smith and Waterman, Adv. Appl. Math 2: 482 (1981); by the homology alignment algorithm ofNeedleman and Wunsch, J. Mol. Biol. 48: 443 (1970); by the search for similarity method of Pearson and Lipman, Proc. Natl. Acad. Sci. 85:
(1988); by computerized implementations of these algorithms, including, but not limited to: CLUSTAL in the PC/Gene program by Intelligenetics, Mountain View, California, GAP, BESTFIT, BLAST" FASTA, and TFASTA in the Wisconsin Genetics Software Package, Genetics Computer Group (GCG), 575 Science Dr., Madison, Wisconsin, USA;
the CLUSTAL program is well described by Higgins and Sharp, Gene 73: 237-244 (1988); Higgins and Sharp, CABIOS 5: 151-153 (1989); Corpet, et al., Nucleic Acids Research 16: 10881-90 (1988); Huang, et al., Computer Applications in the Biosciences 8: 15~-65 (1992), and Pearson, et al., Methods in Molecular Biolosy 24: 307-331 (1994);
preferred computer alignment methods also include the BLASTP, BLASTN, and BLASTX algorithms. Altschul, et al., J. Mol. Biol. 215: 403-410 (1990).
Alignment is also often performed by inspection and manual alignment.
Trade-mark 23 Applicant Ref. No.: 05718-PCT.app ' . ,. . , . . ,~ , ~, , . .
. . , ". . . , , , , ,. .. .., . "' (c) As used herein, "sequence identity" or "identity" in the context of two nucleic acid or polypeptide sequences includes reference to the residues in the two sequences which are the same when aligned fo:r maximum correspondence over a specified comparison window. When percentage of sequence identity is used in reference to proteins it is recognized that residue positions which are not identical often differ by conservative amino acid substitutions, where amino acid residues are substituted for other amino acid residues with similar chemical properties (e.g. charge or hydrophobicity) and therefore do not change the functional propertifa of the molecule. Where sequences differ in conservative substitutions, the percent sequence identity may be adjusted upwards to correct for the conservative nature of the substitution. Sequences which differ by such conservative substitutions are said to have "sequence similarity" or "similarity". Means for making this adjustment are well-known to those of skill in the art.
Typically this involves scoring a conservative substitution as a partial rather than a full mismatch, thereby increasing the percentage sequence identity. Thus, for example, where an identical amino acid is given a score of 1 and a non-conservative substitution is given a score of zero, a conservative substitution is given a score between zero and 1. The scoring of conservative substitutions is calculated, e.g., according to the algorithm of Meyers and Miller, Computer Applic. Biol. Sci., 4: 11-17 (1988) e.g., as implemented in the program PC/GENE (Intelligenetics, Mountain View, California, USA).
(d) As used herein, "percentage of sequence identity" means the value determined by comparing two optimally aligned sequences over a comparison window, wherein the portion of the polynucleotide sequence in the comparison window may corrrpr~'s~ additions or deletions (i.e., gaps) as compared to the reference sequence (which does not comprise additions or deletion~~) for optimal alignment of the two sequences. The percentage is calculated by determining the number of positions at which the identical nucleic acid base or amino acid residue occurs in both sequences to yield the number of matched positions, dividing the number of matched positions by the total number of positions in the window of comparison and multiplying the result by 100 to yield the percentage of sequence identity.
(e) (i) The term "substantial identity" of polynucleotide sequences means that a polynucleotide comprises a sequence that has at least 70%
sequence identity, preferably at least 80%, more preferably at lease: 90% and most preferably at least 95%, A~,~ENDED SlIEET
Applicant Ref. No.: 0~71R-PCT.app compared to a reference sequence using one of the alignment programs described using standard parameters. One of skill will recognize that these values can be appropriately adjusted to determine corresponding identity of proteins encoded by two nucleotide sequences by taking into account codon degeneracy, amino acid similarity, reading frame positioning and the like. Substantial identity of amino acid sequences for these purposes normally means sequence identity of at least 60'%, more preferably at least 70%, 80%, 90%, and most preferably at least 95%. Polypeptides which are "substantially similar"
share sequences as noted above except that residue positions which are not identical may differ by conservative amino acid changes.
Another indication that nucleotide sequences are substantially identical is if two molecules hybridize to each other under stringent conditions. Generally, stringent conditions are selected to be about 5°C to about 20°C lower than the thermal melting point (Tm) for the specific sequence at a defined ionic strength and pH. The T,n is the temperature (under defined ionic strength and pH) at which 50% of the target sequence hybridizes to a perfectly matched probe. Typically, stringent wash conditions are those in which the salt concentration is about 0.02 molar at pH 7 and the temperature is at least about 50, 55, or 60°C. However, nucleic acids which do not hybridize to each other under stringent conditions are still substantially identical if the polypeptides which they encode are substantially identical. This may occur, e.g., when a copy of a nucleic acid is created using the maximum codon degeneracy permitted by the genetic code. One indication that two nucleic acid sequences are substantially identical is that the polypeptide which the first nucleic acid encodes is immunologically cross reactive with the polypeptide encoded by the sec~ucleic acid.
(e) (ii) The terms "substantial identity" in the context of a peptide indicates that a peptide comprises a sequence with at least 70% sequence identity to a reference sequence, preferably 80%, more preferably 85'%, most preferably at least 90%
or 95%
sequence identity to the reference sequence over a specified comparison window.
Preferably, optimal alignment is conducted using the homology alignment algorithm of Needleman and Wunsch, J. Mol. Biol. 48: 443 (1970). An indication that two peptide sequences are substantially identical is that one peptide is immunologically reactive with antibodies raised against the second peptide. Thus, a peptide is substantially identical to a r;'J!Er;r~~D SHEET
Applicant Ref. No.: 05718-PCT.app second peptide, for example, where the two peptides differ only by a conservative substitution.
S It has been unexpectedly discovered that a protease inhibitor can be modified to enhance its content of essential amino acids coupled with reduction in protese inhibitor activity. In a preferred embodiment of the present invention, derivatives of the protease inhibitor, CI-2, simultaneously exhibit both enhanced essential amino acid content as well as decreased protease inhibitor activity. The present compounds are thus excellent candidates for enhancing the nutritional value of feed.
The present invention provides, inter aha, compositions and methods for modulating (i.e., increasing or decreasing) the total levels of essential amino acids and/or altering the ratios of essential amino acids in plants. Thus, the present invention provides utility in such exemplary applications as improving the nutritional properties of fodder crops, increasing the value of plant material for pulp and paper production, altering the protease inhibitory activity, as well as for improving the utility of plant material where the amount of essential amino acids or composition is important, such as the use of plant as a feed. In particular, protease inhibitor polypeptides may be expressed at times or in quantities which are not characteristic of natural plants.
The present invention also provides isolated nucleic acid comprising polynucleotides of sufficient length and complementarity to a protease inhibitor gene, to use as probes or amplification primers in the detection, quantitation, or isolation of gene transcripts~r example, isolated nucleic acids of the present invention can be used as probes in detecting deficiencies in the level of mRNA in screenings for desired transgenic plants, for detecting mutations in the gene (e.g., substitutions, deletions, or additions), for monitoring upregulation of protease inhibition in screening assays for compounds affecting protease inhibition, or for use as molecular markers in plant breeding programs.
The isolated nucleic acids of the present invention can also be used for recombinant expression of protease inhibitor polypeptides for use as immunogens in the preparation and/or screening of antibodies. The isolated nucleic acids of the present invention can also be employed for use in sense or antisense suppression of one or more protease inhibitor genes in a host cell, tissue, or plant. Further, using a primer specific to an . ,.,.- , ~t.~ -r .. ~~ r Applicant Ref. No.: 0571 R-PC1'.app insertion sequence (e.g., transposon) and a primer which specifically hybridizes to an isolated nucleic acid of the present invention, one can use nucleic acid amplification to identity insertion sequence inactivated protease: inhibitor genes from a cDNA
library prepared from insertion sequence mutagenized plants. Progeny seed from the plants S comprising the desired inactivated gene can be grown to a plant to study the phenotypic changes characteristic of that inactivation. See, Tools to Determine the Function of Genes, 1995 Proceedings of the Fiftieth Annual Corn and Sorghum Industry Research Conference, American Seed Trade Association., Washington, D.C., 1995.
The present invention also provides isolated proteins comprising polypeptides having a minimal amino acid sequence from the polypeptides involved in protease inhibition as disclosed herein. The present invention also provides proteins comprising at least one epitope from a polypeptide involved in protease inhibition. The proteins of the present invention can be employed in assays fo:r enzyme agonists or antagonists of enzyme function, or for use as immunogens or antigens to obtain antibodies specifically immunoreactive with a protein of the present invention. Such antibodies can be used in assays for expression levels, for identifying andJor isolating nucleic acids of the present invention from expression libraries, or for purification of polypeptides involved in protease inhibition. In a preferred embodiment ~of the present invention, the present protein has both elevated essential amino acid content and reduced protease inhibitor activity.
The isolated nucleic acids of the present invention can be used over a broad range of plant types, including species from the genera Ctrcttrbita, Rosa, Vitis, Juglans, Fragariar.ls, Medicago, Onobrychis, Trifoli'um, Trigonella, Vigna, Citrus, Linttm, Geranium, Manihot, Daucus, Arabidopsis, Brassica, Raphanus, Sinapis, Atropa, Capsicum, Datura, Hyoscyamus, Lycopersicon, Nicotiana, Solanum, Petunia, Digitalis, Majorana, Ciahorium, Helianthus, Lactuca, Br~omus, Asparagus, Antirrhinum, Heterocallis, Nemesis. Pelargonitrm, Panieum, Pennisetum, Ranunculus, Senecio, Salpiglossis, Ctrcumis, Browaalia, Glycine, Pisium, Phaseolus, Lolium, Oryza, Zea, Avena, Hordeum, Secale, Triticum, Sorghum, Picea, and Populus.
. _ _- ~ --CL=y 75529-49(S) The isolated nucleic acids of the present invention can be used over a broad range of po--ypeptide types, including anti-microbial peptides such as those described in Rao, G., Antimicrobial Peptides; Molecular Plant-Microbe Interactions 8:6-13 (1995).
Protease Inhibitor Nucleic Acids The present invention provides, inter alia, isolated and/or heterologous nucleic acids of RNA, DNA, and analogs and/or chimeras thereof, comprising a protease inhibitor polynucleotide encoding such proteins as: chymotrypsin inhibitor, trypsin inhibitor, protease inhibitor, pre-pro-proteinase inhibitor I, subtilisin-chymotrypsin inhibitor, tumor-related protein, genetic tumor-related proteinase inhibitor, subtilisin inhibitor, endopeptidase inhibitor, serine protease inhibitor, wound-inducible proteinase inhibitor, and eglin c. The protease inhibitor nucleic acids of the present invention comprise protease inhibitor polynucleotides which, are inclusive of:
(a) a polynucleotide encoding a protease inhibitor polypeptide of SEQ ID NOS: 2,4,6,8,10, or 12,16,18,20,22,24 and conservatively modified and polymorphic variants thereof, including exemplary polynucleotides of SEQ ID NOS: 1,3,5,7,9 and 11,15,17,19,21,23 and. conservative changes (b) a polynucleotide which is the product of amplification from Zea mays nucleic acid library using primer pairs from amongst. the consecutive pairs from SEQ ID NOS: 25 and 26, which amplify polynuc:leotides having substantial identity to polynucleotides from amongst those having SEQ ID
NOS: 1,3,5,7,9 or 11,15,17,19,21,23 75529-49(S) (c) a polynucl_eotide which selectively hybridizes under stringent hybridization conditions consisting of washing in a salt concentration of about 0.02 molar at pH 7 at 50°C, to a polynucleotide of (a) or (b);
(d) a polynucl.eotide having at least 60o sequence identity with Sequence I:D NOS: 1,3,5,7,9,11,15,17,19,21 or 23;
(e) a polynucl.eotide encoding a protein having a specified number of contiguous amino acids from a prototype polypeptide, wherein the protein is specifically recognized by antisera elicited by prE:sentation of the protein and wherein the protein does 28a Applicant Rcf. No.: 057! R-PCT.app ' ~ - . .
. .~ . , , . . . . . . . , , ", .. .. .,. . . ., not detectably immunoreact to antisera which has been fully immunosorbed with the protein;
(f) complementary sequences oi'polynucleotides of (a), (b), (c), (d), or (e);
and (g) a polynucleotide comprising; at least 20 contiguous nucleotides from a polynucleotide of Sequence ID Nos. 1, 3, 5, 7, 9, 11, 15, 17, 19, 21 or 23.
A. Polynucleotides Encoding A Protease inhibitor Protein of SEQ ID NOS: 2, 4, 6, 8,10 and 12,16,18, 20, 22, 24 or Conservatively Mod ified or Polymorphic Variants Thereof As indicated in (a), sarpra, the present invention provides isolated and/or heterologous nucleic acids comprising protease: inhibitor polynucleotides, wherein the polynucleotides encode the protease inhibitor polypeptides disclosed herein as SEQ ID
NOS: 2,4,6,8,10 and 12,16,18,20,22,24 or conservatively modified or polymorphic variants thereof. Those of skill in the art will recognize that the degeneracy of the genetic 1 ~ code allows for a plurality of polynucleotides to encode for the identical amino acid sequence. Thus, the present invention includes protease inhibitor polynucleotides of SEQ
ID NOS: 1,3,5,7,9 and 11, 15,17,19,21, 23 and silent variations of polynucleotides encoding a protease inhibitor polypeptide of SF:Q ID NOS: 2,4,6,8,10 and 12,16,18,20,22,24. The present invention further provides isolated and/or heterologous nucleic acids comprising protease inhibitor pol;ynucleotides encoding conservatively modified variants of a protease inhibitor polypeptide of SEQ ID NOS:
2,4,6,8,10 and 12, 16,18,20,22,24. Additionally, the present invention further provides isolated and/or heterolog8a~iucleic acids comprising protease inhibitor polynucleotides encoding one or more polymorphic (allelic) variants of protease inhibitor polypeptides/polynucleotides.
B. Polynucleotides Amplified from a Zea mays .Nucleic Acid Library As indicated in (b), supra, the present invention provides isolated and/or heterologous nucleic acids comprising protease inhibitor polynucleotides, wherein the polynucleotides are amplified from a Zea mays nucleic acid library. The nucleic acid library may be a cDNA library, a genomic library, or a library generally constructed from nuclear transcripts at any stage of intron processing. Nucleic acid libraries from other plants, both monocots and dicots could also be used in a similar fashion. The ~1~~PJDEt? c~.,~'T
Applicant Ref. No.: 05718-PCT.app polynucleotides of the present invention include those amplified using the following primer pairs:
SEQ ID NOS: 25 and 26 which yield an amplic,on comprising a sequence having substantial identity to SEQ ID NOS: 7,9, and 11.
S Thus, the present invention provides protease inhibitor synthetic polynucleotides having the sequence of the gene, a nuclear transcript, a cDNA, or complementary sequences thereof. In preferred embodiments, l:he nucleic acid library is constructed from Zea mays, such as lines B73, PHRE1, A632, B1VIS-P2#10, and W23, each of which are known and publicly available. In particularly preferred embodiments, the library is constructed from tissue such as root, leaf, or tassel, or embryonic tissue.
The amplification products can be translated using expression systems well known to those of skill in the art and as discussed, infr~2. The resulting translation products can be confirmed as protease inhibitor polypeptides. of the present invention by, for example, assaying for the appropriate inhibition activity or verifying the presence of a linear epitope which is specific to a protease inhibitor polypeptide using standard immunoassay methods.
Those of ordinary skill will appreciate that primers which selectively amplify, under stringent conditions, the polynucleotides of the present invention (and their complements) can be constructed by reference 1:o the sequences provided herein at SEQ
ID NOS: 1,3,5,7,9 and 11. In preferred embodiments, the primers will be constructed to anneal with the first three contiguous nucleotides at their 5' terminal end's to the first codon encoding the carboxy or amino terminal amino acid residue (or the complements thereof) opolynucleotides of the present invention. Typically, such primers are at least 15 nucleotides in length. The primer length in nucleotides is selected from the group of integers consisting of from at least 15 to 90. Thus, the primers can be at least 15, 18, 20, 25, 30, 40, 50, 60, 70, 80, or 90 nucleotides in length.
The amplification primers may optionally be elongated in the 3' direction with contiguous nucleotide sequences from polynucleotide sequences of SEQ ID NOS:
1,3,5,7,9 and 11, 15,17,19,21, from which they are derived. The number of nucleotides by which the primers can be elongated is selected from the group of integers consisting of from at least 1 to 25. Thus, for example, the primers can be elongated with an additional l, 5, 10, or 1 S nucleotides. Those of skill will recognize that a lengthened primer 30 '-'~w'~... -~_-Elppli,:ant Ref. No.: 05718-PCT.app sequence can be employed to increase specificity of binding (i.e., annealing) to a target sequence.
C. Polynucleotides Which Selectively Hybridize to a Polynucleotide of (A) or (B) As indicated in (c), supra, the present invention provides isolated and/or heterologous nucleic acids comprising protease inhibitor polynucleotides, wherein the polynucleotides selectively hybridize, under selective hybridization conditions, to a protease inhibitor polynucleotide of paragraphs (A) or (B) as discussed, supra. Thus, the polynucleotides of this embodiment can be used for isolating, detecting, and/or quantifying nucleic acids comprising the polynucleotides of (A) or (B). Low stringency hybridization conditions are typically, but not exclusively, employed with sequences having relatively small sequence identity. Moderate and high stringency conditions can optionally be employed for sequences of greater identity. Low stringency conditions allow selective hybridization of sequences having about 70% sequence identity.
D. Polynucleotides Having at Least 60% Seqzrence Identity with the Polynzrcleotides of (A), (B) or (C) As indicated in (d), supra, the present invention provides isolated and/or heterologous nucleic acids comprising protease inhibitor polynucleotides, wherein the polynucleotides have a specified identity at the nucleotide level to a polynucleotide as disclosed above in paragraphs (A), (B), (C), or (D). The percentage of identity to a reference sequence is at least 60% and, rounded upwards to the nearest integer, can be expressed.as:~ integer selected from the group of integers consisting of from 60 to 99.
Thus, for example, the percentage of identity to a reference sequence can be at least 70%, 75%, 80%, 85%, 90%, or 95%.
The protease inhibitor polynucleotide optionally encodes a protein having a molecular weight as the unglycosylated protein within 20% of the molecular weight of the truncated or full-length protease inhibitor polype:ptides as disclosed herein (e.g., SEQ ID
NOS: 2,4,6,8,10 and 12). Preferably, the molecular weight is within 1 S% of a full length protease inhibitor polypeptide, more preferably within 10% or 5%, and most preferably 31 . . .,.; '~:' '_'.' ._ ;~~ 7~
Applicant Ref. No.: 05718-PCT. app within 3%, 2%, or 1% of a full length protease inhibitor polypeptide of the present invention.
Optionally, the protease inhibitor polynucleotides of this embodiment will encode a protein having an inhibitory activity less than or equal to 20%, 30%, 40%, or 50% of the native, endogenous (i.e., non-isolated), full-length protease inhibitor polypeptide.
Determination of protein inhibition can be detemined by any number of means well known to those of skill in the art.
F. Polynucleotides Complementary to the Polyr~ucleotides of (A)-(E) As indicated in (f), supra, the present invention provides isolated and/or heterologous nucleic acids comprising protease inhibitor polynucleotides, wherein the polynucleotides are complementary to the polynucleotides of paragraphs A-E, above. As those of skill in the art will recognize, complementary sequences base-pair throughout the entirety of their length with the polynucleotides of (A)-(E) (i.e., have 100%
sequence identity). Complementary bases associate through hydrogen bonding in double stranded nucleic acids. For example, the following base pairs are complementary:
guanine and cytosine; adenine and thymine; and adenine and uracil.
G. Polynz~cleotides Which are Subseqzrences of ,the Polynucleotides of (A)-(F) As indicated in (h), supra, the present invention provides isolated and/or heterologous nucleic acids comprising protease inhibitor polynucleotides, wherein the polynucleotide comprises at least 15 contiguous bases from the polynucleotides of (A) through (F~.-discussed above. The length of tile polynucleotide is given as an integer selected from the group consisting of from at least 1 S to the length of the nucleic acid sequence from which the protease inhibitor polynucleotide is a subsequence of.
Thus, for example, polynucleotides of the present invention are inclusive of polynucleotides comprising at least 15, 20, 25, 30, 40, 50, 60, 75, or 100 contiguous nucleotides in length from the polynucleotides of (A)-(F). Optionally, the number of such subsequences encoded by a polynucleotide of the instant embodiment can be any integer selected from the group consisting of from 1 to 20, such as 2, :3, 4, or 5.
Construction of Protease inhibitor Nucleic Acids t-r /cppli~ant Ref. No.: 0571 R-PCT.app The isolated and/or heterologous protease inhibitor nucleic acids of the present invention can be made using (a) standard recombinant methods, (b) synthetic techniques, or combinations thereof. In some embodiments, the protease inhibitor polynucleotides of the present invention will be cloned, amplified, or otherwise constructed from a plant.
The preferred plants are barley and Zea mays, such as inbred line B73 which is publicly known and available. Particularly preferred is the use of Zea mays tissue such as roots, leaves, tassels, seeds or embryonic tissue.
A. Recombinant Methods for Constructing Protease inhibitor Nucleic Acids The isolated and/or heterologous nucleic; acid compositions of this invention, such as RNA, cDNA, genomic DNA, or a hybrid thereof, can be obtained from plant biological sources using any number of cloning methodologies known to those of skill in the art.
The isolation of protease inhibitor polynucleotides may be accomplished by a number of techniques. For instance, oligonucleotide probes based on the sequences disclosed here can be used to identify the desired gene in a cDNA or genomic DNA
library. To construct genomic libraries, large segments of genomic DNA are generated by random fragmentation, e.g. using restriction ene!onucleases, and are ligated with vector DNA to form concatemers that can be packaged into the appropriate vector. To prepare a cDNA library, mRNA is isolated from the desired organ, such as sclerenchyma and a cDNA library which contains the gene encoding; for a protease inhibitor protein (i.e., the protease inhibitor gene) is prepared from the mF:NA. Alternatively, cDNA may be prepared from mRNA extracted from other tissues in which protease inhibitor genes or homologs.xpressed.
The DNA or genomic library can then bf: screened using a probe based upon the sequence of a cloned protease inhibitor polynucl.eotide such as those disclosed herein.
Probes may be used to hybridize with genomic I~NA or cDNA sequences to isolate homologous genes in the same or different plant species. Those of skill in the art will appreciate that various degrees of stringency of hybridization can be employed in the assay; and either the hybridization or the wash medium can be stringent. As the conditions for hybridization become more stringent, there must be a greater degree of complementarity between the probe and the target for duplex formation to occur. The degree of stringency can be controlled by temperature, ionic strength, pH and the presence AMENDED SHEET
Applicant Ref. No.: 0~7tR-PCT.app .. ..
. . , . . . .
. ~ . . . . . . ~..
.. .. .~. .... ..
of a partially denaturing solvent such as formamide. For example, the stringency of hybridization is conveniently varied by changing the polarity of the reactant solution through manipulation of the concentration of formamide within the range of 0%
to 50%.
Cloning methodologies to accomplish these ends, and sequencing methods to S verify the sequence of nucleic acids are well known in the art. Examples of appropriate cloning and sequencing techniques, and instructions sufficient to direct persons of skill through many cloning exercises are found in S~~mbrook, et al., Molecular Cloning. A
Laboratory Manual, 2nd Ed., Cold Spring Harlbor Laboratory Vols. 1-3 (1989), Methods in Enzymology, Vol. 152: Garide to Molecular Cloning Technigues, Berger and Kimmel, Eds., San Diego: Academic Press, Inc. (1987), Current Protocols in Molecular Biology, Ausubel, et al., Eds., Greene Publishing and Wiley-Interscience, New York ( 1987); Plant Molecular Biology: A Laboratory ManZral, Clark, Ed., Springer-Verlag, Berlin (1997).
The nucleic acids of interest can also be; amplified from nucleic acid samples using amplification techniques. For instance, polymerase chain reaction (PCR) technology can be used to amplify the sequences of protease inhibitor polynucleotides of the present invention and related genes directly from genomic DNA or cDNA
libraries.
PCR and other in vitro amplification methods may also be useful, for example, to clone nucleic acid sequences that code for proteins to be expressed, to make nucleic acids to use as probes for detecting the presence of the desii:-ed mRNA in samples, for nucleic acid sequencing, or for other purposes.
The degree of complementarity (sequence identity) required for detectable binding will vary in accordance with the stringency of the hybridization medium and/or wash medium. -~ewdegree of complementarity will optimally be 100 percent; however, it should be understood that minor sequence variations in the probes and primers may be compensated for by reducing the stringency of the hybridization and/or wash medium.
Examples of techniques sufficient to direct persons of skill through in vitro amplification methods are found in Berger, Sarnbrook, and Ausubel, as well as Mullis et al., U.S. Patent No. 4,683,202 (1987); PCR Protocols A Guide to Methods and Applications, Innis et al., Eds., Academic Press; Inc., San Diego, CA (1990);
Arnheim &
Levinson, C&EN pp. 36-47 (October 1, 1990).
B. Synthetic Methods for Constructing Protease inhibitor Nucleic Acids ~r:~~~~~r:~ s~L~r Hppl~cant Ref. No.: 05718-PCT.app ' ~ ~ ~ ~ , .. ~. . . .. . ..
The isolated nucleic acids of the present invention can also be prepared by direct chemical synthesis by methods such as the phosphotr~iester method of Narang et al., Meth.
Enzvmol. 68: 90-99 ( 1979)and the phosphodiester method of Brown et al., Meth.
Enzymol. 68: 109-151 (1979). The isolated nucleic acids of the present invention can also be modified through methods such as site directed mutogenesis, error prone PCR
and known to one of skill.
Recombinant Expression Cassettes The present invention further provides recombinant expression cassettes comprising a protease inhibitor nucleic acid of the present invention. A
nucleic acid sequence coding for the desired protease inhibitor polynucleotide, for example a cDNA or a genomic sequence encoding a full length protease inhibitor protein, can be used to construct a recombinant expression cassette wr~ich can be introduced into the desired host cell. A recombinant expression cassette will typically comprise a protease inhibitor polynucleotide operably linked to transcriptional initiation regulatory sequences which will direct the transcription of the protease inhibitor polynucleotide in the intended host cell, such as tissues of a transformed plant.
For example, plant expression vectors may include (1) a cloned plant gene under the transcriptional control of 5' and 3' regulatory sequences and (2) a dominant selectable marker. Such plant expression vectors may also contain, if desired, a promoter regulatory region (e.g., one conferring inducible or constitutivvironmentally- or developmentally-regulated, or cell- or tissue-specific/selective expression), a transcription initiation start site, a ribosome binding site, an RNA processing signal, a transcription termination site, and/or a polyadenylation signal. Highly preferred plant expression cassettes will be designed to include one or more selectable marker genes, .such as kanamycin resistance or herbicide tolerance genes.
A plant promoter fragment may be employed which will direct expression of the protease inhibitor polynucleotide in all tissues of a regenerated plant. Such promoters are referred to herein as "constitutive" promoters arid are active under most environmental ~,t,p~r,~~cr c~..c~-.
~.~ ....,:... i Applicant Ref. No.: 05718-PCT.app . , . "' , , ~~, , , , ,. .. , . ,.
conditions and states of development or cell differentiation. Examples of constitutive promoters include the cauliflower mosaic virus (CaMV) 35S transcription initiation region, the 1'- or 2'- promoter derived from T-L>NA ofAgrobacterium tumefat;iens, the ubiquitin 1 promoter, the Smas promoter, the ci:nnamyl alcohol dehydrogenase promoter (U.S. Patent No. 5,683,439), the Nos promoter, the pEmu promoter, the rubisco promoter, the GRP1-8 promoter, and other transcription initiation regions from various plant genes known to those of skill. In a preferred embodirrlent, the gamma zero promoter of maize would be used.
Alternatively, the plant promoter may direct expression of the protease inhibitor polynucleotide in a specific tissue or may be otherwise under more precise environmental or developmental control Examples of promoters under developmental control include promoters that initiate transcription only, or preferentially, in certain tissues, such as leaves, roots, fruit, seeds, or flowers. The operation of a promoter may also vary depending on its location in the genome. Thus, an inducible promoter may become fully or partially constitutive in certain locations.
Both heterologous and non-heterologous (i.e., endogenous) promoters can be employed to direct expression of the protease inhibitor nucleic acids of the present invention. These promoters can also be used, fir example, in recombinant expression cassettes to drive expression of antisense nucleic acids to reduce, increase, or alter protease inhibitor content and/or composition in a desired tissue Methods for identifying promoters with a particular expression pattern, in terms of, e.g., tissue type, cell type, stage of development, andJor environmental conditiorrs~=a~e well known in the art. See, e.g., The Maize Handbook, Chapters 114-115, Freeling and Walbot, Eds., Springer, New York: (1994); Corn and Corn Improvement, 3'a edition, Chapter 6, Sprague and Dudley, Eds., American Society of Agronomy, Madison, Wisconsin (1988). A typical step in promoter isolation methods is identification of gene products that are expressed with some degree of specificity in the target tissue. Amongst the range of methodologies are: differential hybridization to cDNA libraries;
subtractive hybridization; differential display; differential 2 -D gel electrophoresis;
DNA probe arrays;
and isolation of proteins known to be expressed. with some specificity in the target tissue.
Such methods are well known to those of skill in the art. Commercially available _ _., c_'_:
75529-49(S) products for identifying promoters are known in the art such as CloneTech's (Palo Alto, CA) PROMOTERFINDER DNA Walking Kit Once promoter and/or gene sequences are known, a region of suitable size is selected from the genomic DNA that is 5' to the transcriptional start, or the translational start site, and such sequences are then linked to a coding sequence. If the transcriptional start site is used as the point of fusion, any of a number of possible 5' untranslated regions can be used in between the transcriptional start site and the partial coding sequence. If the translational start site at the 3' end of the specific promoter is used, then it is linked directly to the methionine start codon of a coding sequence.
If polypeptide expression is desired, it is generally desirable to include a polyadenylation region at the 3'-end of the protease inhibitor polynucleotide coding region. An intron sequence can be added to the ~' untranslated region or the coding sequence of the partial coding sequence to increase the amount of the mature message that accumulates in the cytosol Use of maize introns Adhl-S intron 1, 2, and 6, the Bronze-I
intron are known in the art. See generally, The Maize Handbook, Chapter 116, Freeling and Walbot, Eds., Springer, New York ( 1994).
The vector comprising the sequences from a protease inhibitor nucleic acid will typically comprise a marker gene which confers a selectable phenotype on plant cells.
Usually, the selectable marker gene will encode antibiotic resistance, with suitable genes including genes coding for resistance to the antibiotic spectinomycin (e.g., the aada gene), the streptomycin phosphotransferase (SPT) gene coding for streptomycin resistance, the neomycin phosphotransferase (NPTII) gene encoding kanamycin or geneticin resistance, the hygromycin phosphotransferase (HPT) gene coding for hygromycin resistance, genes coding for resistance to herbicides which act to inhibit the action of acetolactate synthase (ALS), in particular the sulfonylurea-type herbicides (e.g., the acetolactate synthase (ALS) gene containing mutations leading to such resistance in particular the S4 and/or Hra mutations), genes coding for resistance to herbicides which act to inhibit action of glutamine synthase, such as phosphinothricin or basta (e.g., the bar gene), or other such genes known in the art. The bar gene encodes resistance to the herbicide basta, the nptll gene encodes resistance to the antibiotics kanamycin and geneticin, and the ALS gene encodes resistance to the herbicide chlorsulfuron.
Trade-mark Applicant Ref. No.: 071 R-PCT.app ' . . .. ,.
. . , " . . , , . , . , . . . ,.
,. ,. ". .,.. , Typical vectors useful for expression of genes in higher plants are well known in the art and include vectors derived from the tumor-inducing (Ti) plasmid of Agrobacterium tumefaciens described by Rogers et al., Meth. In Enzymol., 153:253-277 (1987). These vectors are plant integrating vect~~rs in that on transformation, the vectors integrate a portion of vector DNA into the genome of the host plant. Exemplary A.
tumefaciens vectors useful herein are plasmids pKYLX6 and pKYLX7 of Schardl et al., Gene, 61:1-11 (1987) and Berger et al., Proc. Natl. Acad. Sci. U.S.A., 86:8402-(1989). Another useful vector herein is plasmid pBI101.2 that is available from Clontech Laboratories, Inc. (Palo Alto, CA).
The protease inhibitor polynucleotide of the present invention can be expressed in either sense or anti-sense orientation as desired.
Protease inhibitor Proteins The isolated protease inhibitor proteins of the present invention comprise a protease inhibitor polypeptide having at least 10 amino acids encoded by any one of the protease inhibitor polynucleotides as discussed more fully, sarpra, or polypeptides which are conservatively modified variants thereof. Exemplary protease inhibitor polypeptide sequences are provided in SEQ ID NOS: 2,4,6,8,10 and 12. The protease inhibitor proteins of the present invention or variants thereof can comprise any number of contiguous amino acid residues from a protease; inhibitor protein, wherein that number is selected from the group of integers consisting of from 10 to the number of residues in a full-length protease inhibitor polypeptide. Optionally, this subsequence of contiguous amino acids at least 15, 20, 25, 30, 35, or 40 amino acids in length, often at least 50, 60, 70, 80, or 90 amino acids in length. Further, the number of such subsequences can be any integer selected from the group consisting of from 1 to 20, such as 2, 3, 4, or 5.
As those of skill will appreciate, the preaent invention includes protease inhibitor polypeptides with less inhibitory activity. Less inhibitory protease inhibitor polypeptides have an inhibitory activity at least 20%, 30%, or 40%, and preferably at least 50% or 60%, below that of the native (non-synthetic), endogenous protease inhibitor polypeptide.
A preferred immunoassay is a competitive immunoassay as discussed, infra.
Thus, the protease inhibitor proteins can be employed as immunogens for constructing ftpplicant Ref. No.: 05718-PCT.app 1 1 1 ! a 1 1 1 . 1 1 n I ~ I . 1 1 a 1 . 1 n n 1 v 1 n ! 1 1 ~ n . 1 a 1 1 1 o i , antibodies immunoreactive to a protease inhibitor protein for such exemplary utilities as immunoassays or protein purification techniquca.
Expression of Proteins in Host Cells Using the nucleic acids of the present invention, one may express a protease inhibitor protein in a recombinantly engineered cell such as bacteria, yeast, insect, mammalian, or preferably plant cells. The cells produce the protein in a non-natural condition (e.g., in quantity, composition, location, and/or time), because they have been genetically altered through human intervention to do so.
It is expected that those of skill in the act are knowledgeable in the numerous expression systems available for expression of nucleic acids encoding protease inhibitor proteins. No attempt to describe in detail the various methods known for the expression of proteins in prokaryotes or eukaryotes will be made.
IS
B. Expression in Eukaryotes A variety of eukaryotic expression systems such as yeast, insect cell lines, plant and mammalian cells, are known to those of skill in the art. As explained briefly below, protease inhibitor proteins of the present invention may be expressed in these eukaryotic systems. In some embodiments, transformed/transfected plant cells, as discussed infra, are employed as expression systems for production of the proteins of the instant invention.
Transfection/Transformation of Cells The method of transformation/transfection is not critical to the instant invention;
various methods of transformation or transfection are currently available. As newer methods are available to transform crops or othE:r host cells they may be directly applied.
Accordingly, a wide variety of methods have been developed to insert a DNA
sequence into the genome of a host cell to obtain the transcription and/or translation of the sequence to effect phenotypic changes in the organism. Thus, any method which provides for efficient transformation/transfection may be employed.
AMENDED SN~ET
Applicant Ref. No.: 0571 R-PCT.app A. Plant Transformation A DNA sequence coding for the desired protease inhibitor polynucleotide, for example a cDNA or a genomic sequence encoding a full length protein, will be used to construct a recombinant expression cassette which can be introduced into the desired plant.
Isolated nucleic acids of the present invention can be introduced into plants according to techniques known in the art. Generally, recombinant expression cassettes as described above and suitable for transformation of plant cells are prepared.
Techniques for transforming a wide variety of higher plant species are well known and described in the technical, scientific, and patent literature. Se:e, for example, Weising et al., Ann. Rev.
Genet. 22: 421-477 (1988). For example, the DIVA construct may be introduced directly into the genomic DNA of the plant cell using techniques such as electroporation, PEG
poration, particle bombardment, silicon fiber dellivery, or microinjection of plant cell protoplasts or embryogenic callus. Alternatively, the DNA constructs may be combined with suitable T-DNA flanking regions and introduced into a conventional Agrobacterium tumefaciens host vector. The virulence functions of the Agrobacterium tumefaciens host will direct the insertion of the construct and adjacent marker into the plant cell DNA
when the cell is infected by the bacteria.
The introduction of DNA constructs using polyethylene glycol precipitation is described in Paszkowski et al., Embo J. 3: 2717-2722 (1984). Electroporation techniques are described in Fromm et al., Proc. Natl. Acad. Sci. 82: 5824 (1985).
Ballistic transforms r ii techniques are described in Klein et al., Nature 327: 70-73 (1987).
Agrobacterium tumefaciens-meditated transformation techniques are well described in the scientific literature. See, for example Horsch et al., Science 233: 496-498 (1984), and Fraley et al., Proc. Natl. Acad. Sci. 80: 4803 (1583). Although Agrobacterium is useful primarily in dicots, certain monocots can be transformed by Agrobacterium. For instance, Agrobacterium transformation of maize is described in U.S. Patent No.
5,550,318.
Other methods of transfection or transformation include (1) Agrobacteria~m rhizogenes-mediated transformation (see, e.g., L,ichtenstein and Fuller In:
Genetic Engineering, vol. 6, PWJ Rigby, Ed., London, Academic Press, 1987; and Lichtenstein, C. P., and Draper, J,. In: DNA Cloning, Vol. II, D. M. Glover, Ed., Oxford, IRI Press, ~;"J~~;DEI~ ~~-tEEi 75529-49(S) 1985),Application PCT/US87/02512 (WO 88/02405 published Apr. 7, 1988) describes the use of A.rhizogenes strain A4 and its Ri plasmid along with A. tumefaciens vectors pARC8 or pARC 16 (2) liposome-mediated DNA uptake (see, e.g., Freeman et al., Plant Cell Physiol. 25: 1353, 1984), (3) the vortexing method (see, e.g., Kindle, Proc. Natl.
Acad. Sci., USA 87: 1228, (1990).
DNA can also be introduced into plants by direct DNA transfer into pollen as described by Zhou et al., Methods in Enzymology, 101:433 (1983); D. Hess, Intern Rev. Cytol., 107:367 (I987); Luo et al., Plane Mol. Biol. Reporter, 6:165 (1988). Expression ofpolypeptide coding genes can be obtained by injection of the DNA into reproductive organs of a plant as described by Pena et al., Nature, 325.:274 (1987). DNA can also be injected directly into the cells of immature embryos and the rehydration of desiccated embryos as described by Neuhaus et al., Theor. Appl. Genet., 75:30 (1987); and Benbrook et al., in Proceedings Bio Expo 1986, Butterworth, Stoneham, Mass., pp. 27-54 ( 1986). A variety of plant viruses that can be employed as vectors are known in the art and include cauliflower mosaic virus (CaMV), geminivirus, brome mosaic virus, and tobacco mosaic virus.
Synthesis of Proteins Protease inhibitor proteins of the present invention can be constructed using non-20. cellular synthetic methods. Solid phase synthesis of protease inhibitor proteins of less than about 50 amino acids in length may be accomplished by attaching the C-terminal amino acid of the sequence to an insoluble support followed by sequential addition of the remaining amino acids in the sequence. Techniques for solid phase synthesis are described by Barany and Merrifield, Solid-Phase Peptide Synthesis, pp. 3-284 in The Peptides: Analysis, Synthesis, Biology. Yol. 2: Special Methods in Peptide Synthesis, Part A.; Merrifield, et al., J. Am. Chem. Soc. 85: 2149-2156 (1963), and Stewart et al., Solid Phase Peptide Synthesis, 2nd ed., Pierce Chem. Co., Rockford, Ill.
(1984). Also, the compounds can be synthesized on an applied Biosystems model 431 a peptide synthesizer using fastmocTM chemistry involving hbtu [2-(lh-benzotriazol-1-yl)-1,1,3,3-tetramethyluronium hexafluorophosphate, as published by Rao, et al., Int. J.
Pecr. Prot.
Res.; Vol. 40; pp. 508-515; (1992), Peptides can be cleaved following standard protocols and purified by reverse phase Applicant Ref. No.: 0571 R-PCT.app chromatography using standard methods. The amino acid sequence of each peptide can be confirmed by automated edman degradation on an applied biosystems 477a protein sequencer/120a pth analyzer. Protease inhibitor proteins of greater length may be synthesized by condensation of the amino and carboxy termini of shorter fragments.
Methods of forming peptide bonds by activation. of a carboxy terminal end (e.g., by the use of the coupling reagent N,N'-dicycylohexylcarbodiimide)) is known to those of skill.
Purification of Proteins The protease inhibitor proteins of the present invention may be purified by standard techniques well known to those of skill in the art. Recombinantly produced protease inhibitor proteins can be directly expressed or expressed as a fusion protein. The recombinant protease inhibitor protein is purified by a combination of cell lysis (e.g., sonication, French press) and affinity chromatography. For fusion products, subsequent digestion of the fusion protein with an appropriate proteolytic enzyme releases the desired 1 S recombinant protease inhibitor protein.
The protease inhibitor proteins of this invention, recombinant or synthetic, may be purified to substantial purity by standard techniques well known in the art, including selective precipitation with such substances as ammonium sulfate, column chromatography, immunopurification methods, and others. See, for instance, R.
Scopes, Protein Purification: Principles and Practice, Springer-Verlag: New York (1982);
Deutscher, Guide to Protein Purification, Acadf:mic Press (1990). For example, antibodies may be raised to the protease inhibitor proteins as described herein.
Purificatie~om E. coli can be achieved following procedures described in U.S.
Patent No. 4,511,503. The protein may then be isolated from cells expressing the protease inhibitor protein and further purified by standard protein chemistry techniques as described herein. Detection of the expressed protein is achieved by methods known in the art and include, for example, radioimmunoassays, Western blotting techniques, protease inhibition assays, or immunoprecipitation.
Trans>=enic Plant Re ~neration Transformed plant cells which are derivE:d by any of the above transformation techniques can be cultured to regenerate a whole plant which possesses the transformed . ~.,~:r~'. ~-..,-,__.
. . ..' ~. s . ~ .. .....
Applicant Ref. No.: Oi7lR-PCT.app genotype and thus the desired protease inhibitor content and/or composition phenotype.
Such regeneration techniques often rely on manipulation of certain phytohormones in a tissue culture growth medium, typically relying on a biocide and/or herbicide marker which has been introduced together with the protease inhibitor polynucleotide.
Plants cells transformed with a plant expression vector can be regenerated, e.g., from single cells, callus tissue or leaf discs according to standard plant tissue culture techniques. It is well known in th.e art that various cells, tissues, and organs from almost any plant can b~e successfully cultured to regenerate an entire plant. Plant regeneration from cultured protoplasts is described in Evans et al., Protoplasts Isolation and Culture, Handbook ofPlant Cell Carlture, Macmillilan Publishing Company, New York, pp. 124-176 (1983); and Binding, Regeneration ofPlants, Plant Protoplasts, CRC', Press, Boca Raton, pp. 21-73 (1985).
The regeneration of plants containing the foreign gene introduced by Agrobacterium from leaf explants can be achieved as described by Horsch et al., Science, 227:1229-1231 (1985 Regeneration can also be obtained from plant callus, explants, organs, or parts thereof. Such regeneration techniques are described generally in Klee et al., Ann. Rev. of Plant Phys. 38: 467-486 (1987For maize cell culture and regeneration see generally, The Maize Handbook, Freeling and Walbot, Eds., Springer, New York (1994); Corn and Corn Improvement, 3'd edition, Sprague and Dudley E,ds., American Society of Agronomy, Madison, Wisconsin (1988).
One of skill will recognize that after the recombinant expression cassette is stably incorporate~.n transgenic plants and confirmed to be operable, it can be introduced into other plants by sexual crossing. Any of a number of standard breeding techniques can be used, depending upon the species to be crossed.
In vegetatively propagated crops, mature transgenic plants can be propagated by the taking of cuttings or by tissue culture techniques to produce multiple identical plants.
Selection of desirable transgenics is made and new varieties are obtained and propagated vegetatively for commercial use. In seed propagated crops, mature transgenic plants can be self crossed to produce a homozygous inbred plant. The inbred plant produces seed containing the newly introduced heterologous nucleic acid. These seeds can be grown to 43 ~ - _ ;, : , Applicant Ref. No.: 057IR-PCT.app ~ ~ . , ,.
.. . . . . .. . , . .
. ~ . ... . . . .:. .
produce plants that would produce the selected phenotype, (e.g., altered protease inhibitor content or composition).
Parts obtained from the regenerated plant, such as flowers, seeds, leaves, branches, fruit, and the like are included in the invention, provided that these parts comprise cells comprising the isolated nucleic acid of the present invention. Progeny and variants, and mutants of the regenerated plants are also included within the scope of the invention, provided that these parts comprise the introduced nucleic acid sequences.
Transgenic plants expressing the selectable marker can be screened for transmission of the protease inhibitor nucleic acrid of the present invention by, for example, standard immunoblot and DNA detection techniques. Transgenic lines are also typically evaluated on levels of expression of the heterologous nucleic acid.
Expression at the RNA level can be determined initially to identify and quantitate expression-positive plants. Standard techniques for RNA analysis can be employed and include PCR
amplification assays using oligonucleotide primers designed to amplify only the heterologous RNA templates and solution hybridization assays using heterologous nucleic acid-specific probes. The RNA-positive plants can then analyzed for protein expression by Western immunoblot analysis using the protease inhibitor specific antibodies of the present invention. In addition, in situ hybridization and immunocytochemistry according to standard protocols can be done using heterologous nucleic acid specific polynucleotide probes and antibodies, respectively, to localize sites of expression within transgenic tissue. Generally, a number of transgenic lines are usually screened for the incorporated nucleic aci~"o identify and select plants with the most appropriate expression profiles.
A preferred embodiment is a transgenic plant that is homozygous for the added heterologous nucleic acid; i.e., a transgenic plant that contains two added nucleic acid sequences, one gene at the same locus on each chromosome of a chromosome pair.
A
homozygous transgenic plant can be obtained by sexually mating (selfing) a heterozygous transgenic plant that contains a single added heterologous nucleic acid, germinating some of the seed produced and analyzing the resulting plants produced for altered activity relative to a control plant (i.e., native, non-transgenic). Back-crossing to a parental plant and out-crossing with a non- transgenic plant are also contemplated.
~,rn'r~aF~ s~;~~r 75529-49(S) Protein structure and amino acid substitution It can be difficult to predict the ultimate effect of substitution on the tertiary structure and folding of the protein. Both tertiary structure and folding are critical to the stability and adequate expression of the protein in vivo. It is critical to undertake analysis and functional modeling of the wild type compound to determine whether substitutions can be made without disrupting biological activity.
The biological activity of a protein is dictated by its three dimensional structure which is intrinsically related to the folding of the protein. The folding of a protein into its functional domains is a direct consequence of the primary amino acid sequence.
While it is true that many proteins tolerate amino acid changes without affecting the folding or function of the protein, there is no a rp iori method of predicting which amino acid may be substituted or deleted without affecting the folding pathway. Each protein is unique and the folding process is necessarily an experimental determination. As has been concluded by Zabin et al., ("Approaches to Predicting Effects of Single Amino Acid Substitutions on the Function of a Protein"; Biochemistry; Vol. 30; pp. 6230-6240; 1991), neither the frequency of exchange of amino acids between homologous proteins nor any other measure of the properties of the amino acids are particularly useful by themselves in predicting whether a protein with an amino acid substitution will be functional. The scientific literature is replete with examples where seemingly conservative substitutions have resulted in major perturbations of structure and activity and vice versa, see e.g.;
Summers, et al., "A Conservative Amino Acid Substitution, Arginine for Lysine, Abolishes Export of a Hybrid Protein in E. Coli." J. Biol. Chem., Vol. 264, pp. 20082-20088, (1989); Ringe, D., "The Sheep in Wolfs Clothing" Nature, Vol. 339, pp.
6S8-659, (1989); Hirabayashi et al., "Effect of Amino Acid Substitution by Site-directed Mutagenesis on the Carbohydrate Recognition and Stability of Human 14-kDa (3-galactoside-binding Lectin," J. Biol. Chem., Vol. 266, pp. 23648-23653, (1991); and van Eijsden, et al., "Mutational Analysis of Pea Lectin: Substitution of Asn125 for Asp in the Monosachharide-binding Site Eliminates Mannose/Glucose -binding Activity,"
Plant Mol. Biol., Vol. 20, pp. 1049-1058 (1992).
The 3D structure of many proteins, including enzymes and protein inhibitors such as the barley chymotrypsin inhibitor has been solved. The three dimensional structure of a 75529-49(S) truncated fragment of CI-2 (with 65 residues) that is missing the N-terminal I8 residues has been determined by x-ray crystallography as well as by NMR spectroscopy (McPhalen, et al., Biochemistry; Vol. 26; pp. 261-269; (1987); and Clore, et al., Protein Ene.; Vol. 1, pp. 3I3-318; (1987)). In the wild type CI-2 the first 18 residues do not assume any ordered conformation and also do not contribute to the structural integrity of the molecule (see e.g. Kjaer, et al., Carlsberg Res. Commun.; Vol. 53; pp. 327-354;
(1987?. This polypeptide is found in the endosperm of grain and is isolated as an 83 residue protein with no disulfide bridges. See e.g. Jonassen, L, Carlsbere Res. Commun.; Vol. 45; pp. 47-48; (1980); and Svendsen, L, et al., Carlsberg Res. Commun.; Vol. 45; pp. 79-85; (1980). The 3D structure of CI-2 has been determined. See McPhalen, et al., 1987.
CI-2 is predominantly a (3-sheet protein, devoid of disulfide bonds and containing a wide loop of approximately 18 residues (residue 53-70 in the CI-2 molecule) in the extended conformation. This is the reactive site loop that contains a methionine I5 residue at position 59 which confers the property of chymotrypsin inhibition. A
constrained peptide containing these residues has been synthesized and shown to retain full chymotrypsin inhibitory activity. See Leatherbarrow, et al., Biochem., Vol. 30, pp.
10717-10721 (1991). In the absence of any disulfide bonds, the integrity of the reactive site loop is maintained by strong hydrogen bond interactions between G1u60 -~
Arg65 and Thr58 -~ Arg67. Mutants of CI-2 in which Thr58 and G1u60 have been replaced with Ala are not only less stable proteins but also have little or no protease inhibitory activity.
See Jackson, et al., Biochem., Vol. 33, pp. 13880-13887 (1994); and Jandu, et al., Biochem., Vol. 33, pp. 6264-6269 (1990). These studies have demonstrated that the reactive site loop is a key structural feature essential for the function of protease inhibition.
Molecular Markers The present invention provides a method of genotyping a plant comprising a protease inhibitor polynucleotide. Preferably, the plant is a monocot, such as maize or sorghum. Genotyping provides a means of distinguishing homologs of a chromosome pair and can be used to differentiate segregants in a plant population.
r Appli.;ant Ref. No.: 05718-PCT.app . . . , " , .. ., . . . . , . , .
. . . ,.. . , . , . . . . , . ,. ., , , , Molecular marker methods can be used for phylogenetic studies, characterizing genetic relationships among crop varieties, identifying crosses or somatic hybrids, localizing chromosomal segments affecting monogenic traits, map based cloning, and the study of quantitative inheritance. See, e.g., Plant Molecular Biology: A Laboratory Manual, Chapter 7, Clark, Ed., Springer-Verlag, Berlin (1.997). For molecular marker methods, see generally, The DNA Revolution by Andrew H. Paterson 1996 (Chapter 2) in:
Genome Mapping in Plants (ed. Andrew H. Paterson) by Academic Press/R. G. Landis Company, Austin, Texas, pp.7-21.
Detection of Protease Inhibitor Nucleic Acids The present invention further provides methods for detecting protease inhibitor polynucleotides of the present invention in a nucleic acid sample suspected of comprising a protease inhibitor polynucleotide, such as a plant cell lysate, particularly a lysate of corn. In some embodiments, a proteasE; inhibitor gene or portion thereof can be amplified prior to the step of contacting the nucleic acid sample with a protease inhibitor polynucleotide. The nucleic acid sample is contacted with the protease inhibitor polynucleotide to form a hybridization complex. The protease inhibitor polynucleotide hybridizes under stringent conditions to a gene encoding a protease inhibitor polypeptide.
Formation of the hybridization complex is used to detect a gene encoding a protease inhibitor polypeptide in the nucleic acid sample. Those of skill will appreciate that an isolated nucleic acid comprising a protease inhil:>itor polynucleotide should lack cross-hybridizing sequences with non-protease inhibitor genes that would yield a false positive result.
Detection of the hybridization complex can be achieved using any number of well known methods. For example, the nucleic acid sample, or a portion thereof, may be assayed by hybridization formats including but not limited to, solution phase, solid phase, mixed phase, or in situ hybridization assays.
Protease Inhibitor Protein Immunoassays 47 . , .~,~-., t~pplicant Ref. No.: 0571 R-PCT.app .. , , .. . , . . . ... . . . ..
. . . . . .
.. .. ... .... ,.
Means of detecting the protease inhibitor proteins of the present invention are not critical aspects of the present invention. In a preferred embodiment, the protease inhibitor proteins are detected and/or quantified using any of a number of well recognized immunological binding assays (see, e.g., U.S. Patents 4,366,241; 4,376,110;
4,517,288;
and 4,837,168). For a review of the general immunoassays, see also Methods in Cell Biology, Vol. 37: Antibodies in Cell Biology, Asai, Ed., Academic Press, Inc.
New York (1993); Basic and Clinical Immunology 7th Edition, Stites & Ten, Eds. (1991).
D. Other Assay Formats In a particularly preferred embodiment, Western blot (immunoblot) analysis is used to detect and quantify the presence of protease inhibitor protein in the sample. The technique generally comprises separating sample proteins by gel electrophoresis on the basis of molecular weight, transferring the separated proteins to a suitable solid support, (such as a nitrocellulose i:llter, a nylon filter, or derivatized nylon I 5 filter), and incubating the sample with the antibodies that specifically bind protease inhibitor protein. The anti-protease inhibitor protein antibodies specifically bind to protease inhibitor protein on the solid support. 'These antibodies may be directly labeled or alternatively may be subsequently detected using labeled antibodies (e.g., labeled sheep anti-mouse antibodies) that specifically bind to ~,he anti-protease inhibitor protein.
E. Quantifrcation of Protease inhibitor Proteins.
Protease inhibitor proteins may b~e detected and quantified by any of a number of~ans well known to those of skill in the art. These include analytic biochemical methods such as electrophoresis, capillary electrophoresis, high performance liquid chromatography (HPLC), thin layer chromatography (TLC), hyperdiffusion chromatography, and the like, and various immunological methods such as fluid or gel precipitin reactions, immunodiffusion (single or double), immunoelectrophoresis, radioimmunoassays (RIAs), enzyme-linked imnlunosorbent assays (ELISAs), immunofluorescent assays, and the like.
48 , 7559-49 (S) Example 1: Isolation of DNA Coding for Protease inhibitor Protein from Zea mays or other plant library The polynucleotides having DNA sequences given in SEQ ID Nos: 15, 17, 19, 21, and 23 were obtained from the sequencing of cDNA clones prepared from maize.
SEQ ID NO 15 is a contig comprised of 28 cDNA clones. 20 of the cDNA clones were from libraries prepared from leaves treated with jasmonic acid. One was from a root library. Four were from libraries prepared from corn rootworm-infested roots.
One was from a tassel library. One was from a library prepared from seedlings recovering from heat shock. One was from a shoot culture library.
SEQ ID NO 17 is a contig comprised of two cDNA clones. One was from a jasmonic acid treated leaf library. The other was from an induced resistance leaf library.
SEQ ID NO 19 is a contig comprised of two cDNA clones. One was from a germinating maize seedling library. The other was from jasmonic acid treated leaf library.
SEQ ID NO 21 is a contig comprised of 4 cDNA clones. All four were from libraries prepared from jasmonic acid treated leaves.
SEQ ID NO 23 is a contig comprised of two cDNA clones. One was from a library prepared from silks, 24 hours post pollination. The other was from a library prepared from root tips less than 5 mm in length.
One skilled in the art could apply these same methods to other plant nucleotide containing libraries.
Example 2: Engineering BHL for nutritional enhancement Wild type CI-2 (from barley) contains 49.4% essential amino acids (41/83) and 9.6% lysine (8/83). Using the strategies outlined below, six different BHL
variants with increasing amounts of lysine have been proposed. The lysine percentages are 21.5%, 24.1%, 23.1%,and 25.3%, for BHL-1, BHL-1N, BHL-2, BHL-2N, BHL-3, and BHL-3N, respectively. Construct BHL-1N contains the same eight substitutions as BHL-1, plus lysine substitutions in the 18 additional amino acid residues in the amino terminal region. BHL-2 is the same as BHL-1 but with changes of amino acid residues 40 and 42 Applicant Ref. No.: 057IR-PCT.app . . ~~ .. . ., . . .. . . . . .
. . . .~.
. . . .
.. .. .., .~,. .. .
to Ala and amino acid residue 47 to lysine. Construct BHL-2N contains the same substitutions as BHL-2, plus four lysine substitutions in the 18 additional amino acid residues in the amino terminal region. BHL-3 is the same as BHL-2 except that residues 40 and 42 are changed to Gly and His, respectively. Construct BHL-3N contains the same 11 substitutions as BHL-3, plus the four lysine substitutions in 18 additional amino acid residues in the amino terminal region. Onc: skilled in the art will realize that essential and non-wild-type amino acid residue substitutions will be tolerated at both the same positions substituted with lysine, and at other positions.
The active site loop region encompasses an extended loop region from about amino acid residue 53 to about amino acid residue 70. Destabilization of the reactive loop was achieved by substituting the non-wild type amino acids residues at about positions 53 to about 70. Amino acid residues were changed by primer mutagenesis.
Preferably, the following mutations are made: Arg62 ~ Lys62, Arg65 -~ Lys65, Arg67 -~ Lys67, Thr58 -~ A1a58 or G1y58, Met59 --~ Lys59, and GIuE~O ~ A1a60 or His60. However, it will be readily apparent to one skilled in the art that functionally equivalent substitutions to those described above will also be effective in the present invention.
In a preferred embodiment of the prese;nt invention, the present protein has both elevated essential amino acid content and reduced protease inhibitor activity.
Modification in the area by amino acid substitution or other means, destroys the hydrogen bonding and changes or reduces the protease inhibitor activity of BHL.
Substitution of amino acid residues threonine, at position 58, and glutamic acid, at position 60, with glycine and histidine, respecaively, resulted in a protein with lowered protease iitor activity. Residue 59 is a critical residue in modifying protease inhibitor activity and changing specificity. When this residue was changed to a lysine, the protease inhibition specificity was changed from a chymotrypin inhibitor to a trypsin inhibitor.
The present invention provides for the creation of a nutritionally enhanced feed from WT CI-2 through at least one lysine substitution of residues 1,18,11,17,19,34,41,56,59,62,67 and 73 (long versions BHL-1N, 2N, 3N) plus residue 67 in BH2-2N and BH2-3N. Lysine substitutions in BHL-1,2 and 3 are at amino acid residues 1,16,23,41,44,49 and 55, plus residue 47 in BHL-2 and BHL-3.
Example 3- Construction of Expression Cassettes 50 t~,%~ :rJV!~ SNP
h Applicant Rcf. No.: 0571 R-PCT.app ., ..
,.
. . , , . , . .
..
. ~ , . , . . . ,.
. . . .
..
. . ., .. ... .. . ..
Vector construction was based upon the; published WT CI-2A sequence information Williamson et al, Eur. J. Biochem 165: 99-106 (1987) and SEQ ID NO
13.
Methods for obtaining full length or truncated wild-type CI-2 DNA include, but are not limited to PCR amplification, from a barley (or other plant ) endosperm cDNA
library using oligonucleotides derived from Seq. ID no 13 or from the published sequence supra, using probes derived from the same on a barley (or other plant ) endosperm cDNA
library, or using a set of overlapping oligonuclc:otides that encompass the gene.
The BHL-1 insert corresponds to SEQ ID NO '.l, plus start and stop codons.
Oligonucleotide pairs, N4394/N4395, and N4396/N4397, were annealed and ligated together to make a 202 base pair double strandf:d DNA molecule with overhangs compatible with Rca I and Nhe I restriction sites. PCR was performed on the annealed molecule using primers N5045 and N5046 to add a 5' Spe I site and 3' Hind III
site. The PCR product was then restriction digested at those sites and ligated into pBluescript II
KS+ at Spe I and Hind III sites. The insert was then removed by restriction digestion with Rca I and Hind III and was ligated into the Nco I and Hind III sites of pET28a (Novagen) to form the BHL-1 construct.
Oligonucleotide and primer sequences (5' to 3'):
TCGGTGGAGA
AATCATAGTT
CAGGATCACC
S 1 TTCTTGGCTT TCTCCACCGA TTTC:CCCACC AACTCCGGCC
ACTCTGTCTT
51 _ r~ ~ .err.
..;';L':; ~:i_cj Applicant Ref. No.: 05718-PCT.app . ,. . ~ .. , .. ., . , , . ~ , . , ..
. . . . . .., . ~ . . .
. . . , . .
. .. ., . >.
CGTCAAGCTC
51 TTTGTGGATA AAAAGGACAA CA7'CGCGCAG GTCCCCAGGG TCGG
TCCACAAAGA
51 GCTTGACGCG GTCGATCTTA TAT"TCCTTCG TCACCTTTGT AC
b. BHL-2: The BHL-2 construct insert corresponds to SEQ ID NO 3, plus start and stop codons. An overlap PCR strategy was used to make the BHL-2 construct.
PWO
polymerase from Boehringer-Mannheim was used for all PCR reactions.The primers were chosen to change 3 amino acids in the BHL-1 active site loop region, and to create unique AgeI and Hind III restriction sites flanking the active site loop, to facilitate loop replacement iri future constructs. A unique Rca I site (compatible with Nco I) was included at the S' end, and a unique Xho I site was included at the 3' end.
The overlap PCR was done as follows: PCR was done with primers N13561 and N13564, using the BHL-1 construct as template. A separate PCR was done with primers N13563 and N13562, again using the BHL-1 construct as template. The products from both reactions were gel purified and combined. Primer N13565, which overlapped regions on both of the PCR products, was then added and another l?CR was done to generate the full-length insert. The resulting product was amplified by ~~nother PCR with primers N13561 and N13562. It was subsequently suspected that a deletion was present in N13562 that caused a frameshift near the 3' end of the PCR product. To avoid this frameshift problem, a final .. _. .~- , , -j~ ~ .
Applicant Ref. No.: 05718-PCT.app ., ,. . ., ,. .
. . . ,. .
. . , ~ . . , . .., .
. . . . . . . . .
.. ,. m. ",. ~. .
PCR reaction was done with primers N13562 and N13905. The final PCR product was digested with Rca I and Xho I, and then ligated into the Nco I and Xho I sites of pET 28b.
Note: Some primers had 6-oligonucleotide extensions to improve restriction digestion efficiency.
Primer sequences (5' TO 3'):
N13562 (as ordered) 1 ATCGACAAGGTCAAGCTTTTTC~TGGATAAAAAGGA
1 GTTGGTACAAAGGTGGCGAAG~GCCTATAAGATCGACAAGGTCAAG
c. BHL-3: The BHL-3 construct insert corresponds to SEQ ID NO 5, plus start and stop codons. The BHL-2 construct was digested with Age I and Hind III, and the region between these sites was removed by gel purification. Oligonucleotide pairs, N14471 and N 14472, were annealed to make a double stranded DNA molecule with overhangs compatible with Age I and Hind III restriction sites. The annealed product was ligated into the Age I and Hind III sites of the digested BHL-2 construct to yield the construct.
Oligonucleotide Primer sequences (5' to 3'):
d. BHL-I N, BHL-2N, and BHL-3N
The BHL-1N, BHL-2N, and BHL-3N construct inserts correspond to SEQ ID No 9, SEQ
ID NO 11, and SEQ ID NO 7, respectively, plus start and stop codons. Three separate PCR reactions were done with either the BHL-1, BHL-2, or BHL-3 constructs as template. The primers for these reactions were N13771 and N13905. The resulting PCR
products were digested with Rca I and Xho I and ligated into the Nco I and Xho I sites of pET 28b to yield the BHL-1N, BHL-2N, and BI~IL-3N constructs.
Primer sequences (S' to 3'):
A~,~ENDED SHEET
75529-49(S) TTTTTTTCATGAAGTCGGTGGAGAAGAAACCGAAGGGTGTGAAGACAGG
N13905 (already provided in BHL-2 description) BHL-1N is an 83 residue polypeptide in which residues 1,8,11, and 17 were also replaced with lysine. The resulting compound has the protein sequence indicated in Sequence LD. No.lO.
BHL-2N is an 83 residue polypeptide in which residues 1,8,11, and 17 were also replaced with lysine. The resulting compound has the protein sequence indicated in Sequence LD. No.l2.
BHL-3N is an 83 residue polypeptide in which residues 1,8,11, and 17 were also replaced with lysine. The resulting compound has the protein sequence indicated in Sequence LD. No.B.
Example 3 - Expression of BHL-1 in E. colt E.rpression in E. colt BHL-I, BHL-2, BHL-3, BHL-3N, and the truncated wild-type CI-2 (residues 19 through 6~ of SEQ ID NO. 14) were expressed in E colt using materials and methods from Novagen, Inc. The Novagen expression vector pET-28 was used (pET-28a for WT CI-and BHL-1, and pET-28b for the other proteins). Ecoli strains BL21 (DE-3) or BL21 (DE-3)pLysS were used. Cultures were typically grown until an OD at 600 nm of 0.8 to 1.0, and then induced with 1 mM IPTG and grown another 2.5 to 5 hours before harvesting.
Induction at an OD as low as 0.4 was also done successfully. Growth temperatures of 37 degrees centigrade and 30 degrees centigrade were both used successfully. The media used was 2xYT plus the appropriate antibiotic at the concentration recommended in the Novagen manual.
Purification a. WT CI-2 (truncated)-- Lysis buffer was 50 mM Tris-HCI, pH 8.0, 1 mM EDTA, 150 mM NaCI. The protein was precipitated with 70% ammonium sulfate. The pallet was dissolved and dialyzed against 50 mM Tris-HCI, pH 8.6. The protein was ~.oa.d~d onto a Hi-Trap Q column, and the unbound fraction was collected and precipitated i~~ 70%
ammonium sulfate. The pellet was dissolved in 50 mM sodium phosphate, pH 7.0, Trade-mark 54 75529-49(S) mM NaCI, and fractionated on a Superdex-75 26/60 gel filtration column.
Fractions wire pooled and concentrated.
b. BHL-I--Lysis buffer was 50 mM sodium phosphate, pH 7.0, 1 mM EDTA.
The protein was loaded onto an SP Sepharose FF 16/10 column, washed with 150 mM
NaCI in 50 mIVI sodium phosphate, pH 7.0, and then eluted with an NaCI
gradient in SO
mM sodium phosphate. BHL-1 eluted at approximately 200 mM NaCI. Fractions were pooled and concentrated.
c. BHL-2, BHL-3, and BHL-3N--Lysis buffer was 50 mM Hepes, pH 8.0, 2mM
EDTA, 0.1% Triton X-100, and 0.5 mg/m1 Iysozyme. The protein was loaded onto an SP-Sepharose cation exchange column (typically a 5 to 10 ml size), washed with 150 mM
NaCI in 50 mM sodium phosphate, pH 7.0, and eluted with 500 mM NaCI in 50 mM
sodium phosphate, pH 7Ø The protein was concentrated and then subjected to Superdex-75 gel filtration chromatography twice.
d. BHL-1--Lysis buffer was SO mM sodium phosphate, pH 7.0, 1 mNl EDTA.
The protein was loaded onto an SP Sepharose FF 16/10 column, washed with 150 mM
NaC1 in 50 mM sodium phosphate, pH 7.0, and then eluted with an NaCI gradient in 50 mM sodium phosphate. BHL-I eluted at approximately 200 mNI NaCI. Fractions were pooled and concentrated.
e. BHL-2, BHL-3, and BHL-3N--Lysis buffer was 50 mM Hepes, pH 8.0, 2miVI
EDTA, 0.1% Triton X-100, and 0.5 mg/ml lysozyme. The protein was loaded onto an SP-Sepharose cation exchange column (typically a 5 to 10 ml size), washed with 150 mM
NaCI in 50 mM sodium phosphate, pH 7.0, and eluted with 500 mM NaCI in 50 mM
sodium phosphate, pH 7Ø The protein was concentrated and then subjected to Superdex-75 gel filtration chromatography twice.
4. Storage The purified proteins were stored long term by freezing in liquid nitrogen and keeping frozen at -70 degrees centigrade.
5. Verification of recombinant protein identity.
a. DNA sequencing--The insert region of these pET 28 constructs was co~rmed by DNA sequencing.
b. N-terminal protein sequencing --Trade-mark 55 Appl'~.cant Ref. No.: 0571 R-PCT.app ..
.. . . .
~ ,j~
. . . . ..~ . ~ .
~ . ,. ..
. . .. ..~ ... ,~.. ,. ..
100 pg of purified BHL-3 were digested with 1 pg of chymotrypsin (Sigma catalog # C-4129) for 30 min at 37 degrees centigrade in 50 mM sodium phosphate, pH 7Ø
The resulting chymotryptic fragments were purified by reversed phase chromatography, using an acetonitrile gradient for elution. Three pure peaks were observed and were sent to the University of Michigan Medical School Protein. Structure Facility for N-terminal sequencing (6 cycles). Peak 1 had an N-terminal sequence of val-asp-lys-lys-asp-asn.
Peak 2 had an N-terminal sequence of lys-ile-as.p-lys-val-lys. Peak 3 had an N-terminal sequence of met-lys-leu-lys-thr-glu. These results demonstrate that chymotrypsin cleaved BHL-3 after tyr-61 and phe-69. The N-terminal sequences all match exactly the expected sequence, assuming that the start methionine was largely retained in the recombinant protein. This experiment verifies that the protein we expressed in and purified from E. coli was BHL-3. Furthermore, SDS-PAGE analysis with 16.5%
Tris-Tricine precast gels from Biorad showed a similar mobility of BHL-1 and BHL-2 with the confirmed BHL-3 protein, as would be expected because BHL-1 and BHL-2 have molecular masses very similar to that of BHL-3.
160 ~g of BHL-3N were digested with 1.6 ~g pepsin overnight, and the resulting peptic fragments were purified by reversed phase chromatography. Five of the resulting peaks were sent to the Iowa State University Protein Facility for N-terminal sequencing through four cycles. The N-terminal sequences of the 5 peaks were: val-gly-lys-ser, phe-val-asp-lys, pro-val-gly-thr, met-lys-ser-val, and ile-ile-val-leu, all of which exactly match the expected BHL-3N sequence, assuming that 'the start methionine was largely retained in this recombinant protein. This experiment verifies that the protein we expressed in and purified fre~E. coli was BHL-3N.
c. Protease inhibition--The obvious protease inhibitory activity observf:d for BHL-1 and for the wild-type protein are further evidence that we have purified the a};pected proteins from E coli.
The details of these protease inhibition experiments are described next.
r ~.,._~ r;; .Tn c~~c-;~
75529-49 (S) The following experiments utilized truncated wild type CI-2 as represented as nt. SS-249 in Seq. ID NO. 13 with addition of start and stop codons.
E~camgle S - Protease Inhibition assays and Proteolitic Digests S a. Chymotrypsin Protease activity was measured by an increase in absorbance at 40S nm.
Sigma Chymotrypsin type iI (Bovine pancreas) Cat. # C-4129.
Substrate - Sigma cat. # .~-7388. N-Succinyl-Ala-Ala-Pro-phe-p vitro anilide or BHL protein used, l nM chymotrypsin, 1mM substrate, 200 ul vol~ne luM BSA included in control (no CI-2, no BHL).
Preincubated 30 min 37 C., then added substrate to start and kept at 37 C.
Buffer 0.2M tris - HCl pH 8.0 Read Abs 40S nm - 30 min Protease Activity - % of Control ABS. 40S nm Abs. At 405 nm Rep. 1 Rep. 2 Mean (S.D.) Using % control data Controll-value 0.350 0.299 control 100.0 100.0 100.0 WT CI-2-value .042 .018 control 12.0 6.0 9.0 (4.2) BHL-1-value .289 .274 control 82.6 91.6 87.1 (6.4) BHL-2-value .309 .318 control 88.3 106.4 97.4 (12.8) BHL-3-value .346 .31 S
control 98.9 lOS.4 102.2 (4.6) BHL-3N-value .318 .31 S
control 90.9 lOS.4 98.2 (10.3) 75529-49 (S) b. Subtilisin Subtilisin Carlsberg lichenif'ormis (Sigma cat.
from Bacillus # P-5380) Substrate and buffer same as for chymotrypsin exper. 200 ul reaction volume 1 uM CI2 or BHL
1nM subtilisin 1mM Substrate room temp (25 C) 30 min. preincubatedthen added substrate and read absorbance at 40~nm 30 min. data used luM BSA used or BHL) in control (no Abs. At 405 nm Rep. 1 Rep. 2 Mean (S.D.) Using % ontrol c data Controll-value 2.171 1.834 control 100.0 100.0 100.0 WT CI-2-value .014 .002 control 0.6 0 0.3 (0.4) BHL-1-value .286 .295 control 13.2 16.1 14.7 {2.1) BHL-2-value 1.692 1.569 control 77.9 85.6 81.8 (5.4) BHL-3-value 7.056 1.960 control 94.7 106.9 100.8 (8.6) BHL-3N-value 2.103 1.729 control 96.9 94.3 95.6 (1.8) 75529-49(S) c. Trypsin Bovine pancreas trypsin (Sigma cat #T-8919) Substrate S-2222 (chromogenix): N-benzoyl-2-isolenuel-Lglutamyl-glycyl-L-arginine-p-nitroaniline buffer: SOmMTris pH 7.5, 2mM NaCI, 2mM CaCl2, 0.005 % TritonX-100.
30 min. preincubation 25°, then added substrate and kept at 25°;
these are 30 minute values.
1 mM substrate, SuM CI-2 or BHL, O.SnM trypsin, no BSA in control. 200 ul reaction volume .
Abs. At 405nm Rep. 1 Rep. 2 Rep. 3 Rep. 4 Mean (S.D.) Using Control Data Control l- .505 .533 .473 .391 value control 100.0 100.0 100.0 100.0 100.0 WT CI-2- .561 .533 .474 .420 value control 111.1 100.0 100.2 107.4 104.7 (5.5) BHL-1-value .072 .096 .041 .057 control 14.3 18.0 8.7 14.6 13.9 (3.9) BHL-2-value .436 .481 .404 .405 control 86.3 90.2 85.4 103.5 91.4 (8.4) BHL-3-value .536 .557 .456 .430 control 106.1 104.5 96.4 110.0 104.3 (5.7) BHL-3N- .542 .583 .490 .437 value control 107.3 109.4 103.6 111.8 108.0 (3.5) Trade-mark Applicant Ref. No.: 05718-PCT.app ,. , . . .
.. .. . . . .. . . , , , . . , . . . , . , .
. , . . , , ,. ., .,. .". ,. ..
d. Elastase Porcine elastase IV (Sigma) Type Cat#
Substrate: Sigma S-4760 N-succinyl-ala-ala-ala-p-nitroanile buffer: 0.2M Tris HC1 pH 8.0 200 ul reactive volume 50nM elastase, 2 uM CI-2 or BHL;
1mM substrate luM BSA in control min. preincub, , then ubstrate. Kept at 25; 30 added min. data s Abs. At 405 nm Rep. 1 Rep. 2 Mean (sp) Using % control data Control 1-value 1.416 1.461 control 100.0 100.0 100.0 WT CI-2-value .030 .049 control 2.1 3.4 2.8 (0.9) BHL-1-value 1.519 1.459 control 107.3 99.9 103.6 (5.2) BHL-2-value 1.558 1.509 control 110.0 103.3 106.7 (4.7) BHL-3-value 1.587 1.493 control 112.1 102.2 107.2 (7.0) BHL-3N-value 1.527 1.481 control 107.8 101.4 104.6 (4.5) r !
75529-49(S) protease inhibition summary - % of control Protein Chymotrypsin Trypsin Elastase Subtilisin WT CI-2 9.0 104.7 2.8 0.3 BHL-1 87.1 13.9 103.6 14.7 BHL-2 97.4 91.4 106.7 81.8 BHL-3 102.2 104.3 107.2 100.8 BHL-3N 9$.2 108.0 104.6 95.6 These experiments show that BHL-2, BHL-3 and BHL-3N have reduced protease inhibition activity compared to WT CI-2 .
Digestion by trypsin The purified proteins were incubated at 37 degrees centigrade with a 100:1 (wt:wt) ratio of BHL protein or wild-type CI-2 : trypsin for l5min, 30 min, 1 hr, 2 hr, or 4 hr.
Incubation buffer was 50 mM sodium phosphate, pH 7Ø Bovine pancreas trypsin was used (Sigma catalog # T-8918). Digestion was assessed by SDS-PAGE with 16.5%
Tris-Tricine precast gels from Biorad. The BHL-2, BHL-3, and BHL-3N proteins were digested by trypsin in 15 minutes. In contrast, the BHL-1 and wild-type truncated CI-2 proteins were resistant to trypsin. This experiment confirmed that the BHL-2, BHL-3, and BHL-3N proteins are not effective inhibitors of trypsin.
Digestion by chymotrypsin.
The purified proteins were incubated at 37 degrees centigrade with a 100:1 (wt:wt) ratio of BHL protein or wild-type CI-2 : chymotrypsin for 15min, 30 min, 1 hr, 2 hr, ox 4 hr. Incubation buffer was 50 mM sodium phosphate, pH 7Ø Bovine pancreas chymotrypsin type II (Sigma catalog # S-7388 was used. Digestion was assessed by SDS-PAGE with 16.5% precast Tris-Tricine gels from Biorad. BHL-2, BHL-3, and BHL-3N proteins were digested by chymotrypsin in 15 minutes. In contrast, BHL-1 and wild-type CI-2 proteins were resistant to chymotrypsin. This experiment confirrraed that BHL-2, BHL-3, and BHL-3N are not effective inhibitors of chymotrypsin.
Digestion in simulated gastric fluid.
Applicant Ref. No.: 05718-PC'T.app .. , . .. . ,.
.. , . . .. . , , .
. . . ~ . ... . . . . ,., , . . . , .
. ,. ., ... ..,. .. ., Simulated gastric fluid was prepared by dissolving 20 mg NaCI and 32 mg of pepsin in 70 pl of HCl plus enough water to make 10 ml. Porcine stomach pepsin (Sigma cat # P-6887) was used. SO ~l of 1 mg/ml BHL-3N or wild-type CI-2 protein were incubated with 250 pl simulated gastric fluid at :37 degrees centigrade. At 1 S sec, 30 sec, 1 min, 5 min, and 30 min, 40 ~1 aliquots were removed to a stop solution consisting of 40 ~l 2X Tris-Tricine SDS sample buffer (Biorad) that also contained 3 pl of 1 M
Tris-HC1, pH 8.0 and 0.1 mg/ml pepstatin A (Boehringer-lvlannheim cat # 60010).
Digestion was assessed by 16.5% Tris-Tricine SDS-PAGE (pre:cast gels from Biorad).
Both BHL-3N and wild-type CI-2 were digested in simulated gastric fluid in 15 seconds. This experiment suggests that our engineered proteins and even the wild-type protein would likely be digested into proteolytic; fragments in the stomach of humans or monogastric animals.
Digestion in simulated intestinal fluid.
Simulated intestinal fluid was prepared by dissolving 68 mg of monobasic potassium phosphate in 2.5 ml of water, adding 1.9 ml of 0.2 N sodium hydroxide and 4 ml of water. Then 2.0 g porcine pancreatin (Sigma catalog # P-7545) was added and the resulting solution was adjusted with 0.2N sodium hydroxide to a pH of 7.5.
Water was added to make a final volume of 10 ml.
50 ~g of BHL-3N or wild-type CI-2 protein in 50 pl were incubated with 250 ~1 simulated-~estinal fluid at 37 degrees centigrade . At 15 sec, 30 sec, 1 min, 5 min, and min, 40 pl aliquots were removed and added to 40 pl of a stop solution consisting of 25 2X Tris-Tricine SDS sample buffer (Biorad) containing 2 mM EDTA and 2mM
phenylmethylsulfonyl fluoride (Sigma catalog 3# P-7626). Digestion was assessed by 16.5 Tris-Tricine SDS-PAGE (precast gels form l3iorad).
BHL-3N was digested by simulated, intcatinal fluid in 15 seconds. In contrast, 30 wild-type CI-2 was resistant to digestion for 30' minutes. This experiment shows that in the intestine of humans or monogastric animal~~, our engineered protein would likely be more digestible than the wild-type protein would be. These results are consistent with the Appl~,cant Ref. No.: 0~71R-PCT.app . ~. . . . ,.
. . . , . . . , ..~ . . . ", ,.
. . , . .
. . " , .., .,.. ,.
protease inhibition assays showing that BHL-3N was not an effective protease inhibitor.
The inventive protein was digested in less than i:me minutes, less than one and less than 30 seconds.
Digestion in simulated gastric fluid Simulated gastric fluid was prepared by dissolving 20 mg NaCI and 32 mg of pepsin in 70 ~l of HCl plus enough water to make 10 ml. Porcine stomach pepsin (Sigma cat # P-6887) was used. 50 ~l of 1 mg/ml BHL-3N or wild-type CI-2 were incubated with 250 ~l simulated gastric fluid at 37 degrees centigrade. At 15 sec, 30 sec, 1 min, 5 min, and 30 min, 40 ~1 aliquots were removed to a stop solution consisting of 40 ul 2X
Tris-Tricine SDS sample buffer (Biorad) that al.>o contained 3 pl of 1 M Tris-HCI, pH 8.0 and 0.1 mg/ml pepstatin A (Boehringer-Mannheim cat # 60010). Digestion was assessed by 16.5% Tris-Tricine SDS-PAGE (precast gels. from BioradTM).
Both BHL-3N and wild-type CI-2 were digested in simulated gastric fluid in 15 seconds. This experiment suggests that our engineered proteins and even the wild-type protein would likely be digested into proteolytic: fragments in the stomach of humans or monogastric animals.
Digestion in simulated intestinal fluid.
Simulated intestinal fluid was prepared by dissolving 68 mg of monobasic potassium phosphate in 2.5 ml of water, adding 1.9 ml of 0.2 N sodium hydroxide and 4 ml of water. Then 2.0 g porcine pancreatin (Sig:ma catalog # P-7545) was added and the resulting sel.~ion was adjusted with 0.2N sodium hydroxide to a pH of 7.5.
Water was added to make a final volume of 10 ml.
50 ~1 of lmg/ml BHL-3N or wild-type CI-2 were incubated with 250 ~1 simulated intestinal fluid at 37 degrees centigrade . At 15 sec, 30 sec, 1 min, 5 min, and 30 min, 40 ~l aliquots were removed and added to 40 ~1 of a stop solution consisting of 2X Tris-Tricine SDS sample buffer (Biorad) containing 2 mM EDTA and 2mM
phenylmethylsulfonyl fluoride (Sigma catalog #~ P-7626). Digestion was assessed by 16.5 Tris-Tricine SDS-PAGE (precast gels form Biorad).
A?~~ENDED SHEET
Applicant Ref. No.: 05718-PCT.app ,.
. . . . ,. . , .
. . . ~ .., . . ~, .
. . , .
.. .. ... .... " ..
BHL-3N was digested by simulated intestinal fluid in 15 seconds. In contrast, wild-type CI-2 was resistant to digestion for 30 minutes. ~Chis experiment shows that in the intestine of humans or monogastric animals, our engineered protein would likely be more digestible than the wild-type protein would be. 'These results are consistent with the protease inhibition assays showing that BHL-3I~1 was not an effective protease inhibitor.
The inventive proteins were digested in less than five minutes, less than one minute and less than 30 seconds.
Example 6 - Protein Conformation Wild type CI-2, BHL-I, BHL-2, BHL-3 and BHL-3N at proteins concentrations of approximately 0.16mg/ml in IOmM sodium phosphate, pH = 7.0 were prepared and sent to the University of Michigan Medical Scho~~l Protein Structure Facility for circular dichroism analysis. Data indicates that the substituted proteins BHL-1, BHL-2 and BHL-3 have very similar CD spectra confirming that the BHL proteins fold into a structure similar to the wild type CI-2.
Example 7 - Thermodynamic stability Equilibrium denaturation experiments were done to assess the thermodynamic stability of the engineered and wild-type proteins, following the method of Pace et al.
(Meth. Enzym. 131:266-280). The engineered or wild-type proteins at a concentration of 2 pM were incubated 18 hours at 25 degrees centigrade in 10 mM sodium phosphate, pH
7.0, with various concentrations of guanidine-hydrochloride. Unfolding of the proteins was monite~l by measuring intrinsic fluorescence at 25 degrees centigrade, using an excitation wavelength of 280 nm and an emission wavelength of 356 nm. The guanidine-hydrochloride concentration sufficient for 50% unfolding was found to be 3.9M
for wild-type, 2.4M for BHL-1, and 0.9M for BHL-2, BI:IL-3, and BHL-3N. These experiments showed that BHL-1 has a higher thermodynamic stability than do the other engineered proteins, but that all of the engineered proteins lhave a lower thermodynamic stability than does the wild-type protein.
Example 8 - Accessibility of the Tryptophan of BHL Proteins to Acrylamide ~It~Elw~~~~ Si'I~C
Applicant Ref. No.: 05718-PCT.app .. , , .. . . . . . .~ . . . . . .
. . . . .,. , . . . ". ,.
~ . . , . ~ , .
.. .. ... .." .. ..
Acrylamide effectively quenches the fluorescence of accessible tryptophan residues in proteins. We examined fluorescence quenching of the tryptophan residue of the BHL
proteins and of the truncated WT CI-2, in the prcaence or absence of 6M
guanidine-hydrochloride. An excitation wavelength of 295 nm was used. Emission wavelengths of 337 nm and 356 nm were used for the samples v~ithout guanidine-HCl and with guanidine-HCI, respectively. Protein concentrations of 20 pM or 2 pM were used for the samples without, and with guanidine-HCI, respectively. Samples were in 10 mM
sodium phosphate, pH 7.0, and contained acrylamide at the following concentrations:
0, 0.0196M, 0.0385M, 0.0566M, 0.0741M, 0.0909M, 0.1071M, 0.01228M, or 0.1379M. The equation of Mclure and Edelman (Biochem 6: 559-566) vvas used to correct for self absorption of light by acrylamide. Fo/F was plotted against the molar acrylamide concentration, where Fo = fluorescence intensity without acrylamide, and F = fluorescence intensity with acrylamide. The slope of each line (known as the Stern-Volmer constant) was determined. The mean of 2 experiments is presented below. Values in parentheses are standard deviations.
Protein 6M guanidine-HC1 Slope BHL-1 - 3.5 (0.3) BHL-1 + 16.9 (1.3) BHL-2 - 4.6 (0.4) BHL-2 + 19.0 (0.1) BHL-3 - 2.4 (0.2) BHL-3 + 17.5 (0.04) BHL-3N - 5.8 (0.1) BHL-3N + 16.6 (0.6) WT CI-2 - 1.7 (0.1) (truncated.
WT CI-2 + 15.7(2.1) (truncated) Example 9 - Stabilization ~ Disulfide Bonds, An examination of the WI-CI 2 three dimensional structure has identified three pairs of residues (Glu-23 and Arg-81, Thr-22 and Val-82, and Val-53 and Val-70) with an alpha carbon distance appropriate for disulfide formation. Constructs designed to substitute these residues with cysteines will be prepared.
,,~. r.,.. ~.', i SEQUENCE LISTING
(1) GENERAL INFORMATION:
(i) APPLICANT: PIONEER HI-BRED INTERNATIONAL, INC.
(ii) TITLE OF INVENTION: PROTEINS WITH ENHANCED LEVELS
OF ESSENTIAL AMINO ACIDS
(iii) NUMBER OF SEQUENCES: 26 (iv) CORRESPONDENCE ADDRESS:
(A) ADDRESSEE: SMART & BIGGAR
(B) STREET: P.O. BOX 2999, STATION D
lO (C) CITY: OTTAWA
(D) STATE: ONT
(E) COUNTRY: CANADA
(F) ZIP: K1P 5Y6 (v) COMPUTER READABLE FORM:
(A) MEDIUM TYPE: Floppy disk (B) COMPUTER: IBM PC compatible (C) OPERATING SYSTEM: PC-DOS/MS-DOS
(D) SOFTWARE: ASCII (text) (vi) CURRENT APPLICATION DATA:
2 O (A) APPLICATION NUMBER: CA 2,270,289 (B) FILING DATE: 31-OCT-1997 (C) CLASSIFICATION:
(vii) PRIOR APPLICATION DATA:
(A) APPLICATION NUMBER: US 08/740,682 (B) FILING DATE: O1-NOV-1996 (viii) ATTORNEY/AGENT INFORMATION:
(A) NAME: SMART & BIGGAR
(B) REGISTRATION NUMBER:
(C) REFERENCE/DOCKET NUMBER: 75529-49 , CA 02270289 1999-08-OS
(ix) TELECOMMUNICATION INFORMATION:
(A) TELEPHONE: (613)-232-2486 (B) TELEFAX: (613)-232-8440 (2) INFORMATION FOR SEQ ID N0:1:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 195 base pairs 66a Applicant Ref. " , , >. . , , No.: ~, > , >
05718-PCT.app ". .
' , ~ ,. .,. >".
(B) TYPE: nucleic >
acid (C) STRANDEDNESS:
single (D) TOPOLOGY: linear S (ii) MOLECULE
TYPE:
cDNA
(ix) FEATURE:
(A) NAME/KEY: Coding quence:
Se (B) LOCATION: 1...195 IO (D) OTHER INFORMATION:
(xi) SEQ7:DNO:1:
SEQUENCE
DESCRIPTION:
AAG CTG ACA GAG TGG CCG TTGG7.>GGGG AAATCGGTG GAGAAA 48 AAG GAG
1S Lys Leu Thr Glu Trp Pro LeuVa1Gly LysSerVal GluLys Lys Glu 1 5 1C> 15 GCC AAG GTG ATC CTG AAG AAGCC:AGAG GCGCAAATC ATAGTT 96 AAG GAC
Ala Lys Val Ile Leu Lys LysProGlu AlaGlnIle IleVal Lys Asp CTG CCG GGT ACA AAG GTG AAGG~,ATAT AAGATCGAC CGCGTC 144 GTT ACG
Leu Pro Gly Thr Lys Val LysG7.uTyr LysIleAsp ArgVal Val Thr AAG CTC GTG GAT AAA AAG AACA7.'CGCG CAGGTCCCC AGGGTC 192 TTT GAC
Lys Leu Val Asp Lys Lys AsnI7.eAla GlnValPro ArgVal Phe Asp Gly (2) INFORMATION FOR SEQ ID N0:2:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 65 amino acids 40 (B) TYPE: amino acid ---~: STRANDEDNESS : single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: protein 4S (v) FRAGMENT TYPE: internal (xi) SEQUENCE DESCRIPTION: :CDN0:2:
SEQ
Lys Leu Lys Thr Glu Trp Pro ValGlyLys Ser Glu Glu Leu Val Lys SO i s l 15 Ala Lys Lys Val Ile Leu Lys ProGluAla Gln Ile Asp Lys Ile Val Leu Pro Val Gly Thr Lys Val G:LuTyrLys Ile Arg Thr Lys Asp Val SS Lys Leu Phe Val Asp Lys Lys I:LeAlaGln Val Arg Asp Asn Pro Val Gly AMENDED ShiEET
Applicant Ref. No.: 05718-PC?.app ..
..
. . . . ., .~~. . . .
. . ~ . .., . . . ." , ~ ~ . . . . . .
,. ., ... .". .. ,.
(2) INFORMATION
FOR
SEQ
ID
N0:3:
S
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 195 base rs pai (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA
(ix) FEATURE:
IS (A) NAME/KEY: Coding quence Se (B) LOCATION: 1...195 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION:SEQID N0:3:
GAG
Lys Leu Lys Thr Glu Trp Pro LeuValGlyLys SerValG1u Lys Glu 2S GCC AAG AAG GTG ATC CTG AAG AAGCC'AGAGGCG CAAATCATA GTT 96 GAC
Ala Lys Lys Val Ile Leu Lys LysProGluAla GlnIleIle Val Asp CTA CCG GTT GGT ACA AAG GTG AAGGC'CTATAAG ATCGACAAG GTC 144 GCG
Leu Pro Val Gly Thr Lys Val LysAlaTyrLys IleAspLys Val Ala GAC
Lys Leu Phe Val Asp Lys Lys AsnIleAlaGln ValProArg Val Asp ~S 50 55 60 Gly (2) INFORMATION FOR SEQ ID NC1:4:
4S (i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 65 amino acids (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal (xi) SEQUENCE DESCRIPTION: SEQ 7:D N0:4:
SS
Lys Leu Lys Thr Glu Trp Pro Glu Leu Val Gly Lys Ser Val Glu Lys Ala Lys Lys Val Ile Leu Lys Asp Lys Pro Glu Ala Gln Ile Ile Val AMENDED SHEET
Applicant Ref. No.: 05718-PCT.app w ' .. , ~. . . , . . .~ , , .
.., . . . . ", . . ~ . ~ ~ .
.. .. ... " .. .. ..
Leu Pro Val Gly Thr Lys Val Ala Lys Ala Tyr Lys Ile Asp Lys Val Lys Leu Phe Val Asp Lys Lys Asp Asn Il.e Ala Gln Val Pro Arg Val Gly (2) INFORMATION FOR SEQ ID N0:5:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 195 base rs pai IS (B) TYPE:
nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA
2O (ix) FEATURE:
(A) NAME/KEY: Coding quence.
Se (B) LOCATION: 1...195 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION:SEQ N0:5:
:CD
AAG CTG AAG ACA GAG TGG CCG TTGG'CGGGG AAATCGGTG GAGAAA 48 GAG
Lys Leu Lys Thr Glu Trp Pro LeuValGly LysSerVal GluLys Glu GAC
Ala Lys Lys Val Ile Leu Lys LysP:roGlu AlaGlnIle IleVal Asp CTA CCG GTT GGT ACA AAG GTG AAGC..~TTAT AAGATCGAC AAGGTC 144 GGT
Leu Pro Val Gly Thr Lys Val LysHisTyr LysIleAsp LysVal Gly GAC
Lys Leu Phe Val Asp Lys Lys AsnIleAla GlnValPro ArgVal Asp 4S G1y SO (2) INFORMATION FOR SEQ ID N0:6:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 65 amino acids (B) TYPE: amino acid SS (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: protein AMENDEt~ C'irrT
Applicant Ref. No.: 0571 R-PCT.app (v) FRAGMENT TYPE: internal (xi) SEQUENCE DESCRIPTION: SEQ ID N0:6:
~ ..
.. , ..
. , . . . . .. . . . . .
~ . . . ... . . . . ~.~ ..
. . . . .
.. ., .,. ..., .. ., $ Lys Leu Lys Thr Glu Trp Pro Glu Leu V,~1 Gly Lys Ser Val Glu Lys Ala Lys Lys Val Ile Leu Lys Asp Lys P:ro Glu Ala Gln Ile Ile Val Leu Pro Val Gly Thr Lys Val Gly Lys His Tyr Lys Ile Asp Lys Val Lys Leu Phe Val Asp Lys Lys Asp Asn ILe Ala Gln Val Pro Arg Val Gly 1$
°,C~D H~Ei ,,,, r,<~
Applicant Ref. No.: 05718-PCT.app ~ . .. , ..
,. .. . . .~ . .. . . . . . .
. . . . . .,. . . . . .., ..
~ ~ . , .. .. . . ..,. .. ..
(2) INFORMATION FOR SEQ ID N0:7:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 249 base pairs S (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA
lO (ix) FEATURE:
(A) NAME/KEY: Coding Sequence:
(B) LOCATION: 1...249 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ 7:D N0:7:
AAG TCG GTG GAG AAG AAA CCG AAG GGT G7.'G AAG ACA GGT GCG GGT GAC 48 Lys Ser Val Glu Lys Lys Pro Lys Gly Val Lys Thr Gly Ala Gly Asp Lys His Lys Leu Lys Thr Glu Trp Pro G:_u Leu Val Gly Lys Ser Val GAG AAA GCC AAG AAG GTG ATC CTG AAG G~~.C AAG CCA GAG GCG CAA ATC 144 Glu Lys Ala Lys Lys Val Ile Leu Lys Asp Lys Pro Glu Ala Gln Ile Ile Val Leu Pro Val Gly Thr Lys Val G:Ly Lys His Tyr Lys Ile Asp AAG GTC AAG CTT TTT GTG GAT AAA AAG G~3C AAC ATC GCG CAG GTC CCC 240 3$ Lys Val Lys Leu Phe Val Asp Lys Lys Asp Asn Ile Ala Gln Val Pro Arg Val Gly (2) INFORMATION FOR SEQ ID Nc~:B:
4S (i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 83 amino acids (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear SO
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal (xi) SEQUENCE DESCRIPTION: SEQ ID N0:8:
SS
Lys Ser Val Glu Lys Lys Pro Lys Gly V,~1 Lys Thr Gly Ala Gly Asp Lys His Lys Leu Lys Thr Glu Trp Pro Glu Leu Val Gly Lys Ser Val ,4,';,-.n;r1'~; ~;:~~y_~:
Applicant Ref. No.: 0571 R-PCT.app ~. ., . .~ ,. "
. . . . .. . . , , , , . . . . . ." . . . . ,., , . . " , . . .. .. .~. .,., Glu Lys Ala Lys Lys Val Ile Leu Lys A;sp Lys Pro Glu Ala Gln Ile Ile Val Leu Pro Val Gly Thr Lys Val G:Ly Lys His Tyr Lys Ile Asp Lys Val Lys Leu Phe Val Asp Lys Lys As~~ Asn Ile Ala Gln Val Pro Arg Val Gly (2) INFORMATION FOR SEQ ID N0:9:
IS (i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 249 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA
(ix) FEATURE:
(A) NAME/KEY: Coding Sequence 2S (B) LOCATION: 1...249 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:9:
3O AAG TCG GTG GAG AAG AAA CCG AAG GGT G'TG AAG ACA GGT GCG GGT GAC 48 Lys Ser Val Glu Lys Lys Pro Lys Gly Val Lys Thr Gly Ala Gly Asp AAG CAT AAG CTG AAG ACA GAG TGG CCG G.AG TTG GTG GGG AAA TCG GTG 96 3S Lys His Lys Leu Lys Thr Glu Trp Pro Glu Leu Val Gly Lys Ser Val GAG AAA GCC AAG AAG GTG ATC CTG AAG G.AC AAG CCA GAG GCG CAA ATC 144 Glu Lys Ala Lys Lys Val Ile Leu Lys Asp Lys Pro Glu Ala Gln Ile Ile Val Leu Pro Val Gly Thr Lys Val Thr Lys Glu Tyr Lys Ile Asp Arg Val Lys Leu Phe Val Asp Lys Lys Asp Asn Ile Ala Gln Val Pro Arg Val Gly SS (2) INFORMATION FOR SEQ ID NO:10:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 83 amino acids ;,i;: ~ ~~.: ~'-%
Applicant Rcf. No.: 05718-PCT.app (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal (xi) SEQUENCE DESCRIPTION: SEQ ID NO:10:
;; ~.;~ . ~:,=l-Appi:cant Ref. No.: 05718-PCT.app ,. .. . .. ,.
.. , . . . . .. . . . , .
. . . , , .,. . . . . ."
. . . , " .. ... .... ,. ., Lys Ser Val Glu Lys Lys Pro Lys Gly Val Lys Thr Gly Ala Gly Asp Lys HisLys LeuLys ThrGluTrpPro G:LuLeuVa1 GlyLysSer Val S Glu LysAla LysLys ValIleLeuLys AapLysPro GluAlaGln Ile Ile ValLeu ProVal GlyThrLysVal TlzrLysGlu TyrLysIle Asp Arg ValLys Phe LysLys Asp Ile Ala Val Pro Leu Val Asn Gln Asp Arg ValGly (2) INFORMATION FOR SEQ ID NO:11:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 249 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA
ZS (ix) FEATURE:
(A) NAME/KEY: Coding Sequenc~_ (B) LOCATION: 1...249 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:11:
AAG TCG GTG GAG AAG AAA CCG AAG GGT G'TG AAG ACA GGT GCG GGT GAC 48 Lys Ser Val Glu Lys Lys Pro Lys Gly Val Lys Thr Gly Ala Gly Asp AAG CAT AAG CTG AAG ACA GAG TGG CCG G.AG TTG GTG GGG AAA TCG GTG 96 Lys His Lys Leu Lys Thr Glu Trp Pro Glu Leu Val Gly Lys Ser Val GAG P.AA~C AAG AAG GTG ATC CTG AAG GAC AAG CCA GAG GCG CAA ATC 144 Glu Lys Ala Lys Lys Val Ile Leu Lys Asp Lys Pro Glu Ala Gln Ile Ile Val Leu Pro Val Gly Thr Lys Val Ala Lys Ala Tyr Lys Ile Asp SO Lys Val Lys Leu Phe Val Asp Lys Lys Asp Asn Ile Ala Gln Val Pro SS
Arg Val Gly (2) INFORMATION FOR SEQ TD N0:12:
AppIScant Ref. No.: 05718-PCT.app .. . ., . . . . ,. . . , , . , . . . . , ... . . . ., , ~ , . , ~ . " " ." ,... " ., (i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 83 amino acids (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal (xi) SEQUENCE DESCRIPTION: SEQ ID N0:12:
Lys Ser Val Glu Lys Lys Pro Lys Gly V;al Lys Thr Gly Ala Gly Asp Lys His Lys Leu Lys Thr Glu Trp Pro G.Lu Leu Val Gly Lys Ser Val Glu Lys Ala Lys Lys Val Ile Leu Lys A;sp Lys Pro Glu Ala Gln Ile Ile Val Leu Pro Val Gly Thr Lys Val A:La Lys Ala Tyr Lys Ile Asp Lys Val Lys Leu Phe Val Asp Lys Lys A;sp Asn Ile Ala Gln Val Pro Arg Val Gly (2) INFORMATION FOR SEQ ID N0:13:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 249 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear ~S (ii) MOLECULE TYPE: cDNA
(ix) FEATURE:
(A) NAME/KEY: Coding Sequence (B) LOCATION: 1...249 4O (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ :ID N0:13:
AGT TCA GTG GAG AAG AAG CCG GAG GGA G'TG AAC ACC GGT GCT GGT GAC 48 4$ Ser Ser Val Glu Lys Lys Pro Glu Gly Val Asn Thr Gly Ala Gly Asp CGT CAC AAC CTG AAG ACA GAG TGG CCA G:4G TTG GTG GGG AAA TCG GTG 96 Arg His Asn Leu Lys Thr Glu Trp Pro G:Lu Leu Val Gly Lys Ser Val Glu Glu Ala Lys Lys Val Ile Leu Gln A;sp Lys Pro Glu Ala Gln Ile Ile Val Leu Pro Val Gly Thr Ile Val Tlar Met Glu Tyr Arg Ile Asp Appijcant Rcf. No.: 05718-PCT.app . ~~ .. .
..
. . . ~. . . ~ ~ , .
. .. . . . . "
. , ~. .~ . , .,» ,.
Arg Val Arg Leu Phe Val Asp Lys Leu Asp Asn Ile Ala Gln Val Pro S
Arg Val Gly (2) INFORMATION FOR SEQ N0:14:
ID
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 83 amino acids IS (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: protein ZO (v) FRAGMENT TYPE: internal (xi) SEQUENCE DESCRIPTION: N0:14:
SEQ :LD
Ser Ser Val Glu Lys Lys Pro Va~lAsn ThrGly Gly Glu Gly Ala Asp Arg His Asn Leu Lys Thr Glu G:LuLeu ValGly Ser Trp Pro Lys Val Glu Glu Ala Lys Lys Val Ile AspLys ProGlu Gln Leu Gln Ala Ile 30 Ile Val Leu Pro Val Gly Thr ThrMet GluTyr Ile Ile Val Arg Asp Arg Jal Arg Leu Phe Val Asp AspAsn IleAla Val Lys Leu Gln Pro Arg Val Gly (2) INFORMATION FOR SEQ ID N0:15:
4O (i) SEQUENCE CHARACTERISTICS:
-~ LENGTH: 459 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: CDNA
(ix) FEATURE:
(A) NAME/KEY: Coding Sequence:
SO (B) LOCATION: 1....288 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ).D N0:15:
SS GCA GTG CAA CAA GCA AGA TTT ACC TGC CC:A TCG ATC ATA TCG TCA ACT 48 Ala Val Gln Gln Ala Arg Phe Thr Cys Pro Ser Ile Ile Ser Ser Thr Applicant Ref. No.: 0571 R-PCT.app v ~ .r ,. , m ~ , ~ ~ ~ ~ ~ , . , . . . , ~ , , . . . , , ,., . . , , ' ~ ~ . , , ~ r m , , , : , , , . , , ~
Gly Pro Ala Val Arg Asp Thr Met Ser Ser Thr Glu Cys Gly Gly Gly S GGC GGC GGC GCC AAG ACG TCG TGG CCT Gi~G GTG GTC GGG CTG AGC GTG 144 Gly Gly Gly Ala Lys Thr Ser Trp Pro G:Lu Val Val Gly Leu Ser Val GAG GAC GCC AAG AAG GTG ATG GTC AAG Gl~C AAG CCG GAC GCC GAC ATC 192 Glu Asp Ala Lys Lys' Val Met Val Lys Asp Lys Pro Asp Ala Asp Ile Val Val Leu Pro Val Gly Ser Val Val Thr Ala Asp Tyr Arg Pro Asn CGT GTC CGC ATC TTC GTC GAC ATC GTC G(:C CAG ACG CCC CAC ATC GGC T 289 Arg Val Arg Ile Phe Val Asp Ile Val A7_a Gln Thr Pro His Ile Gly GATAATATAT AAGCTAGCCG CTATTTCCTT TCCT7.'GCCCC AGAACTTGAA ATAAATATAT 349 ATACGATGAA ATAACGCGGG CATGCCGAAT ANATCdGANTG TGNNTGAATT CTCACTAATT 409 AAGTAATGNC ATAAATAAAC GTATTCAAAA AAAA7~.AAAAA P~~AAAAAA.AA 4 5 9 (2) INFORMATION FOR SEQ ID N0:16:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 96 amino acids (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear ~S (ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal (xi) SEQUENCE DESCRIPTION: SEQ 7:D N0:16:
Ala Val Gln Gln Ala Arg Phe Thr Cys Pro Ser Ile Ile Ser Ser Thr Gly Pro Ala Val Arg Asp Thr Met Ser Se:r Thr Glu Cys Gly Gly Gly Gly Gly Gly Ala Lys Thr Ser Trp Pro Gl.u Val Val Gly Leu Ser Val Glu Asp Ala Lys Lys Val Met Val Lys A~,p Lys Pro Asp Ala Asp Ile Val Val Leu Pro Val Gly Ser Val Val Thr Ala Asp Tyr Arg Pro Asn SO Arg Val Arg Ile Phe Val Asp Ile Val Ala Gln Thr Pro His Ile Gly SS !2) INFORMATION FOR SEQ ID N0:17:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 428 base pairs Applicant Ref. No.: 05718-PCT.app .. ., , .. , ,. . . . . . ., . . . . . , ", . . . , .. .. .n .... .. ..
(B) TYPE: nucleic acid (C) STRANDEDNESS: single (D} TOPOLOGY: linear S (ii) MOLECULE TYPE: cDNA
(ix) FEATURE:
(A) NAME/KEY: Coding Sequence (B) LOCATION: 1...303 IO (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ :CD N0:17:
CGA CCC ACG CGT CCG CCC ACG CGT CCG G(:A AGA TTT ACC TGC CCA TCG 48 IS Arg Pro Thr Arg Pro Pro Thr Arg Pro Al.a Arg Phe Thr Cys Pro Ser Ile Ile Ser Ser Thr Gly Pro Ala Val Arg Asp Thr Met Ser Ser Thr GAG TGC GGC GGC GGC GGC GGC GGC GCC AF~G ACG TCG TGG CCT GAG GTG 144 Glu Cys Gly Gly Gly Gly Gly Gly Ala L~~s Thr Ser Trp Pro Glu Val GTC GGG CTG AGC GTG GAG GAC GCC AAG AF~G GTG ATC CTC AAG GAC AAG 192 Val Gly Leu Ser Val Glu Asp Ala Lys Lys Val Ile Leu Lys Asp Lys 3O CCG GAC GCC GAC ATC GTG GTG CTG CCC GT'C GGC TCC GTG GTG ACC GCG 240 Pro Asp Ala Asp Ile Val Val Leu Pro Va.l Gly Ser Val Val Thr Ala GAT TAT CGC CCT AAC CGT GTC CGC ATC TT'C GTC GAC ATC GTC GCC CAG 288 ~S Asp Tyr Arg Pro Asn Arg Val Arg Ile Phe Val Asp Ile Val Ala Gln Thr Pro His Ile Gly TGAAAAA.AAA F,~~P.AAP.AP.AA AAAA 4 2 8 (2) INFORMATION FOR SEQ ID N0:18:
(i) SEQUENCE CHARACTERISTICS:
SO (A) LENGTH: 101 amino acids (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear SS (ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal (xi) SEQUENCE DESCRIPTION: SEQ I;,~ N0:18:
Applicant Ref. No.: 05718-PCT.app r. .. . . ,.
.. . . . . , .: . . . . .
, . . . . .,. .
.
.. .. . . . .. .. ., Arg Pro Thr Arg Pro Pro Thr Arg Pro A:La Arg Phe Thr Cys Pro Ser Ile Ile SerSerThr Gly AlaVal A:rgAspThr MetSer SerThr Pro Glu Cys GlyGlyGly Gly GlyAla L!~sThrSer TrpPro GluVal Gly Val Gly LeuSerVal Glu AlaLys Ll~sValIle LeuLys AspLys Asp Pro Asp AlaAspIle Val LeuPro ValGlySer ValVal ThrAla Val Asp Tyr ArgProAsn Arg ArgIle PheValAsp IleVal AlaGln Val Thr Pro HisIleGly loo (2) INFORMATION FOR SEQ ID N0:19:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 441 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA
(ix) FEATURE:
(A) NAME/KEY: Coding Sequence' (B) LOCATION: 1...255 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ :CD N0:19:
Leu Ile Ile Ala Leu Ser Val Xaa His Arg Gln Pro Ser Thr Met Ser Ser Thr-'G1~ Gly Gly Asp Asp Gly ATa L;rs Lys Ser Trp Pro Glu Val 45 Val Gly Leu Ser Leu Glu Glu Ala Lys Arg Val Ile Leu Cys Asp Lys CCC GAC GCC GAC ATC GTC GTG CTG CCC G'.CC GGC ACG CCG GTG ACC ATG 192 Pro Asp Ala Asp Ile Val Val Leu Pro Val Gly Thr Pro Val Thr Met $5 GAT TTC CGC CCC AAC CGC GTC CGC ATC T'.CC GTC GAC ACC GTC GCG GAG 240 Asp Phe Arg Pro Asn Arg Val Arg Ile Phe Val Asp Thr Val Ala Glu GCA MCC CAC ATC GGC TGAGGTTAAA TCTACA)~AAT GAATGAYTCG GACATGCCAT G 296 Ala Xaa His Ile Gly Applicant Rcf. No.: 05718-PCT.app .. .. .
,. . . . . .~ . . . , , .
. . . . .., . . , . . ..
. ~ . , . .
.. .. ... .,.. .. ., S
(2) INFORMATION FOR SEQ ID NC~:20:
lO (i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 85 amino acids (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal (xi) SEQUENCE DESCRIPTION: SEQ 7:D N0:20:
Leu IleIle AlaLeuSer ValXaaHis ArgGlnPro SerThrMet Ser CI
Ser ThrGly GlyGlyAsp AspGlyAla L~~sLysSer TrpProGlu Val 2S Val GlyLeu SerLeuGlu GluAlaLys ArgValIle LeuCysAsp Lys Pro AspAla AspIleVal ValLeuPro ValGlyThr ProValThr Met Asp PheArg ProAsnArg ValArgIle PheValAsp ThrValAla Glu Ala XaaHis IleGly (2) INFORMATION FOR SEQ ID N0:21:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 382 base pairs 40 (B) TYPE: nucleic acid ~ STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA
4S (ix) FEATURE:
(A) NAME/KEY: Coding Sequencf:
(B) LOCATION: 1...213 (D) OTHER INFORMATION:
SO
(xi) SEQUENCE DESCRIPTION: SEQ :CD N0:21:
Val Arg Arg Arg Arg Thr Ala Thr Gly G:Ly Lys Thr Ser Trp Pro Glu GTG GTC GGG CTG AGC GTC GAG GAA GCC Ai4G AAG GTG ATT CTG GCG GAC 96 Val Val Gly Leu Ser Val Glu Glu Ala Lys Lys Val Ile Leu Ala Asp Applicant Ref. No.: 0571 R-PCT.app .. ., . .. ..
.. . . . . . .. . . . . .
. . . .. , . ... .
. . .
.. .. ... .... .. ..
Lys Pro Asn Ala Asp Ile Val Val Leu Pro Thr Thr Thr Gln Ala Val Thr Ser Asp Phe Gly Phe Asp Arg Val Arg Val Phe Val Gly Thr Val Ala Gln Thr Pro His Val Gly P~~;4AAAAAAA AAAAA 3 8 2 (2) INFORMATION FOR SEQ ID N0:22:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 71 amino acids 2S (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal (xi) SEQUENCE DESCRIPTION: SEQ ID N0:22:
Val Arg Arg Arg Arg Thr Ala Thr Gly Gly Lys Thr Ser Trp Pro Glu ~S 1 5 10 15 Val Val Gly Leu Ser Val Glu Glu Ala Lys Lys Val Ile Leu Ala Asp Lys Pro Asn Ala Asp Ile Val Val Leu Pro Thr Thr Thr Gln Ala Val Thr Ser Asp Phe Gly Phe Asp Arg Val Arg Val Phe Val Gly Thr Val Ala Gln Thr Pro His Val Gly (2) INFORMATION FOR SEQ ID N0:23:
(i) SEQUENCE CHARACTERISTICS:
SO (A) LENGTH: 448 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear SS (ii) MOLECULE TYPE: cDNA
(ix) FEATURE:
(A) NAME/KEY: Coding Sequence gl Applicant Ref. No.: 05718-PCT.app ., . .. , .. .. .~ . . , . . . . .
.., . . .
, , .. .. .., . .. ..
(B) LOCATION: 1...240 (D) OTHER INFORMATION:
S
(xi) SEQUENCE DESCRIPTION: SEQ 7:D N0:23:
CGA TTT AGC TAT AGC AGG TCT CGA TCG GC:G GCC ATG AGC GGT AGC CGC 48 Arg Phe Ser Tyr Ser Arg Ser Arg Ser Al.a Ala Met Ser Gly Ser Arg Ser Lys Lys Ser Trp Pro Glu Val Glu Gl.y Leu Pro Ser Glu Val Ala 1$ Lys Gln Lys Ile Leu Ala Asp Arg Pro A:>p Val Gln Val Val Val Leu CCC GAC GGC TCC TTC GTC ACC ACT GAT TTC: AAC GAC AAG CGC GTC CGG 192 Pro Asp Gly Ser Phe Val Thr Thr Asp Phe Asn Asp Lys Arg Val Arg GTC TTC GTC GAC AAC GCC GAC AAC GTC GC:C AAA GTC CCC AAG ATC GGC T 241 Val Phe Val Asp Asn Ala Asp Asn Val Al.a Lys Val Pro Lys Ile Gly AGCTAGCTAG CTAGGCCCAA TCGTTCTAAT CAGC7.'AGTTT CTTTCTTTCA TAAATAAAAG 301 CTTAATGGAT GCCATGGCGC CCGCGCGCGC CTYC.t~TCATG AAAAGCTACA TTTGAAACGA 421 (2) INFORMATION FOR SEQ ID N0:24:
3S (i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 80 amino acids (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (=~ MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal (xi) SEQUENCE DESCRIPTION: SEQ :CD N0:24:
Arg Phe SerTyrSer ArgSerArg SerA:LaAla MetSerGly SerArg Ser Lys LysSerTrp ProGluVal GluG:lyLeu ProSerGlu ValAla Lys Gln LysIleLeu AlaAspArg ProAspVal GlnValVal ValLeu Pro Asp GlySerPhe ValThrThr AspPheAsn AspLysArg ValArg Val Phe ValAspAsn AlaAspAsn ValA:laLys ValProLys IleGly 6s 70 75 so -r..h ~.:
Applicant Ref. No.: 0571 R-PCT.app .. , , ,.
.., . . . . , " , , . , , ". . . . . ", . , , .. " ." .,., .. ,.
(2) INFORMATION FOR SEQ ID N0:25:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 18 base pairs S (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA
(xi) SEQUENCE DESCRIPTION: SEQ I:D N0:25:
(2) INFORMATION FOR SEQ ID NC>:26:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 18 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear ZS (ii) MOLECULE TYPE: cDNA
(xi) SEQUENCE DESCRIPTION: SEQ I:D N0:26:
75529-49(S) All publications and patent applications mentioned in this specification are indicati~r~ of the level of skill of those skilled in the art to which this invention pertains.
Variations on the above embodiments are within the ability of one of ordzz~ar-y skill in the art, and such variations do not depart from the scope of the present ira-v~;nti~~~ as described in the following claims.
75529-49(S) Preferably, the substantiated amino acid is an essential amino acid. More preferably, tryptophan threonine, methionine and lysine arE: the substituted essential amino acid.
Even more preferably, the additional essential amino acid is lysine.
A preferred embodiment of the present invention is the introduction of an Expression cassette into regenerable plant cells. Also preferred is the introduction of an expression cassette comprising a DNA segment encoding an endogenous or modified polypeptide sequence.
The present invention also encompasses variations in the sequences described above, wherein such variations are due to site-directed mutagenesis, or other mechanisms known in the art, to increase or decrease levels of selected amino acids of interest. For example, :>ite-directed mutagenesis to increase levels of essential amino acids is a preferred embodiment.
The present invention also provides a fertile transgenic plant. The fertile transgenic plant contains an isolated DNA segment comprising a promoter and encoding a protein comprising a protease inhibitor, modified by increasing the number of essential amino acids, under the control of the promoter. The protease inhibitor is expressed as so that the level of essential amino acids in the seeds of the transgenic plant is increased above the level in the seeds of a plant which only differ from t:he seeds of the transgenic plant in that the DNA segment or t:he encoded seed protein is under the control of a different promoter. The DNA segment is transmitted through a cc>mplete normal sexual cycle of the transgenic plant to the next generation. The present invention provides nucleotide sequences encoding proteins containing higher levels of essential amino acids by the substitution of 75529-49(S) one or more of the amino acid residues in the protease inhibitor. Substitutions at one or more of, but not limited to, positions 1, 8, 11, 17, 19, 34, 41, 56, 59, 62, 67 and 73 of the wild type protein are substituted with essential amino acids. The present invention also involves the expression of the present chymotrypsin inhibitor derivatives or any derived protease inhibitor in plants to provide higher percentages of essential amino acids in plants than wild type plants.
In a preferred embodiment of the present invention, the present derivatives also exhibit reduced protease inhibitor activity. This is achieved by substituting the amino acid residues from about amino acid residue 53 to about amino acid residue 70 with residues other than the wild type residues.
In one aspect, there is described an isolated polypeptide comprising a modified variant of SEQ ID N0: 14, or a modified variant of the sequence from position 19 to position 83 of SEQ ID N0: 14, wherein the modified variant:
(a) contains a higher percentage of essential amino acids than either SEQ ID N0: 14 or the sequence from position 19 to position 83 of SEQ ID NO: 14; (b) has greater than 600 amino acid similarity to SEQ ID N0: 14 or the sequence from position 19 to position 83 of SEQ ID N0: 14, wherein the percent sequence similarity is based on the entire sequence and is determined by BLAST 2.0 using default parameters; and (c) contains an essential amino acid at a position corresponding to a position of SEQ ID N0: 14 selected from the group consisting of 1, 8, 17, 19, 34, 41, and 67, or contains a lysine at a position corresponding to a position of SEQ ID N0: 14 selected from the group consisting of 56, 59, 62 and 73.
7a 75529-49(S) In another aspect, there is described an isolated polypeptide comprising a modified variant of SEQ ID N0: 14, or a modified variant of the sequence from position 19 to position 83 of SEQ ID N0: 14, wherein the modified variant:
(a) contains a higher percentage of essential amino acids than either SEQ ID N0: 14 or the sequence from position 19 to position 83 of SEQ ID N0: 14; (b) has greater than 600 amino acid similarity to SEQ ID NO: 14 or the sequence from position 19 to position 83 of SEQ ID N0: 14, wherein the percent sequence similarity is based on the entire sequence and is determined by BLAST 2.0 using default parameters; and (c) is modified at at least 11 positions of SEQ ID N0: 14 to contain essential amino acids at said at least 11 positions.
In another aspect, there is described an isolated polypeptide comprising a modified variant of SEQ ID N0: 14, or a modified variant of the sequence from position 19 to position 83 of SEQ ID N0: 14, wherein the modified variant:
(a) contains a higher percentage of essential amino acids than either SEQ ID N0: 14 or the sequence from position 19 to position 83 of SEQ ID NO: 14; (b) has greater than 600 amino acid similarity to SEQ ID N0: 14 or the sequence from position 19 to position 83 of SEQ ID N0: 14, wherein the percent sequence similarity is based on the entire sequence and is determined by BLAST 2.0 using default parameters; and (c) contains a pair of cysteines at at least one pair of positions corresponding to SEQ ID N0: 14 positions Glu-23 and Arg-81, Thr-22 and Val-82, or Val-53 and Val-70.
In another aspect, there is described an isolated polypeptide comprising a modified variant of SEQ ID NO: 14, or a modified variant of the sequence from position 19 to position 83 of SEQ ID NO: 14, wherein the modified variant:
(a) contains at least 55~ essential amino acids; (b) has greater than 60o amino acid similarity to SEQ ID N0: 14 or 7b 75529-49(S) the sequence from position 19 to position 83 of SEQ ID
N0: 14, wherein the percent sequence similarity is based on the entire sequence and is determined by BLAST 2.0 using default parameters; and (c) contains a pair of cysteines at at least one pair of positions corresponding to SEQ ID
NO: 14 positions Glu-23 and Arg-81, Thr-22 and Val-82, or Val-53 and Val-70.
In another aspect, there is described an isolated nucleic acid encoding the polypeptide of the invention.
In another aspect, there is described a recombinant expression cassette comprising the nucleic acid of the invention operably linked to a promoter.
In another aspect, there is described a transformed plant cell comprising the recombinant expression cassette of the invention.
In another aspect, there is described an animal feed composition comprising plant tissue, wherein the plant tissue comprises the polypeptide of the invention.
In another aspect, there is described a method for increasing the nutritional value of a plant comprising:
(a) introducing into cells of the plant a recombinant expression cassette of the invention, wherein the promoter provides for protein expression in plants, to yield transformed plant cells, and (b) regenerating a transformed plant from the transformed plant cells.
In another aspect, there is described use of at least one recombinant expression cassette of the invention, wherein the promoter provides for protein expression in plants, in the preparation of a transformed plant.
7c 75529-49(S) In another aspect, there is described use of at least one recombinant expression cassette of the invention, wherein the promoter provides for protein expression in plants, for the preparation of a seed of a transformed plant.
In another aspect, there is described use of the plant cell of the invention in the preparation of an animal feed composition.
7d AFplicant Ref. No.: 0571 R-PCT.app Methods for expressing the modified protease inhibitors and for using plants are also provided to enhance the nutritional value ol~animal feed.
It is therefore an object of the present invention to provide methods for increasing the levels of the essential amino acids in the seeds of plants used for animal feed.
It is a further object of the present invention to provide seeds for food and/or feed with higher levels of the essential amino acid, lysine, than wild type species of the same seeds.
It is a further object of the present invention to provide seeds for food and/or feed such that the level of the essential amino acids is increased such that the need for feed supplementation is greatly reduced or obviated.
It is one object of the present invention ro provide nucleic acids encoding enzymes involved in protease inhibition and antigenic polypeptide fragments thereof.
It is also an object of the present invention to provide protease inhibitor polypeptides and antigenic fragments thereof. It is a further object of the present invention to provide transgenic 1 S plants comprising protease inhibitor nucleic acids. Additionally, it is an object of the present invention to provide methods for modulating, in a transgenic plant, the expression of protease inhibitor polynucleotides of the present invention.
Therefore, in one aspect, the present invention relates to an isolated nucleic acid comprising a member selected from the group consisting of (a)a polynucleotide having at least 70% identity to a polynucleotide encoding a polypeptide selected from the group consisting of SEQ ID NOS: 2,4,6,8,10 and l2,116,18,20,22,24;and (b) a polynucleotide which is complementary to the polynucleotide of (a); and (c) a polynucleotide comprising at least 3~-~tiguous nucleotides from a polyrmcleotide of (a) or (b). In some embodiments, the polynucleotide has a sequence selected from the group consisting of SEQ ID NOS: 1,3,5,7,9 and 11, 15,17,19,21, or 23 . The isolated nucleic acid can be DNA.
In another aspect, the present invention relates to recombinant expression cassettes, comprising a nucleic acid as describf:d, supra, operably linked to a promoter.
In some embodiments, the nucleic acid is operably linked in antisense orientation to the promoter.
In another aspect, the present invention is directed to a host cell transfected with the recombinant expression cassette as described, supra. In some embodiments, the host ~1,~L'~J"r~': ~'i ~C.' J _ .:, i:_~
Applicant Ref. No.: 05718-PCT.app cell is a maize, rye, barley, wheat, sorghum, oa~a, millet, rice, triticale, sunflower, alfalfa, rapeseed or soybean cell.
In a further aspect, the present invention relates to an isolated protein comprising a polypeptide of at least 10 contiguous amino acids encoded by the isolated nucleic acid referred to, supra. In some embodiments, the polypeptide has a sequence selected from the group consisting of SEQ ID NOS: 2,4,6,8,10 and 12,16,18,20,22,24.
In another aspect, the present invention relates to an isolated nucleic acid comprising a polynucleotide of at least 30 nuclE;otides in length which selectively hybridizes under stringent conditions to a nucleic acid selected from the group consisting of SEQ ID NOS: 1,3,5,7,9 and 11, 15,17,19,21, 23 or a complement thereof. In some embodiments, the isolated nucleic acid is operably linked to a promoter.
In yet another aspect, the present invention relates to an isolated nucleic acid comprising a polynucleotide, the polynucleotide having at least 60% sequence identity to an identical length of a nucleic acid selected from the group consisting of SEQ ID NOS:
1,3,5,7,9 and 1 l, 15,17,19,21, 23 or a complement thereof.
In another aspect, the present invention :relates to an isolated nucleic acid comprising a polynucleotide having a sequence of a nucleic acid amplified from a Zea mays nucleic acid library using the primers selected from the group consisting of: SEQ ID
NOS: 25 and 26 or complements thereof. In some embodiments, the nucleic acid library is a cDNA library.
In another aspect, the present invention :relates to a recombinant expression cassette comprising a nucleic acid amplified from a library as referred to szrpra, wherein the nucleicd is operably linked to a promoter. In some embodiments, the present invention relates to a host cell transfected with l:his recombinant expression cassette In some embodiments, the present invention relates to a protease inhibitor protein produced from this host cell.
In a further aspect, the present invention relates to a heterologous promoter operably linked to a non-isolated protease inhibitor polynucleotide encoding a polypeptide, wherein the polypeptide is encoded by a nucleic acid amplified from a nucleic acid library as referred to, supra.
In yet another aspect, the present invention relates to a transgenic plant comprising a recombinant expression cassette comprising a plant promoter operably linked to any of : ' r,-75529-49(S) the isolated nucleic acids referred to supra. In some embodiments, the transgenic plant is Zea mays. The present invention also provides transgenic seed from the transgenic plant.
In a further aspect, the present invention relates to a method of providing a modified protease inhibitor in a plant, comprising the steps of (a) transforming a plant cell with a recombinant expression casette comprising a protease inhibitor polynucleotide operably linked to a promoter; (b) growing the plant cell under plant growing conditions; and (c) inducing expression of the polynucleotide.
Applicant Ref. No.: 05718-PCT.app F~,ure IistinE
Figure 1 Protease Inhibition Sequence identification DETAILED DESCRIPTION
Barley High Lysine 1(BHL-1) is coded for by the polypeptides of SEQ ID
No. 2 which is encoded for by the nucleic acid of SEQ ID No. 1.
Barley High Lysine 2 (BHL-2) its coded for by the polypeptides of SEQ
ID No. 4 which is encoded for by the nucleic acid of SEQ ID No. 3.
Barley High Lysine 3 (BHL-3) i;s coded for by the polypeptides of SEQ ID
No. 6 which is encoded for by the nucleic acid of SEQ ID No. 5.
Barley High Lysine 3N (BHL-3N) is coded for by the polypeptides of SEQ
ID No. 8 which is encoded for by the nucleic acid of SEQ ID No. 7.
Barley High Lysine 1N (BHL-1N) is coded for by the polypeptides of SEQ
1 ~ ID No. 10 which is encoded for by the nucleic acid of SEQ ID No. 9.
Barley High Lysine 2N (BHL-2N) is coded for by the polypeptides of SEQ
ID No. 12 which is encoded for by the nucleic acid of SEQ ID No. 11.
Wild-type chymotrypsin inhibitor (WI-CI-2) is coded for by the polypeptides of SEQ ID No. 14 which is encoded for by the nucleic acid of SEQ
ID No. 13.
Maize EST PI-1 is coded for by the polypeptides of SEQ ID No.l6 which is encoded for by the nucleic acid of SEQ ID No. 15.
Maize EST PI-2 is coded for by the polypeptides of SEQ ID No.18 which is encoded for by the nucleic acid of SEQ ID No. 17.
Maize EST PI-3 is coded for by the polypeptides of SEQ ID No.20 which is encoded for by the nucleic acid of SEQ ID No. 19.
Maize EST PI-4 is coded for by the polypeptides of SEQ ID No.22 which is encoded for by the nucleic acid of SEQ ID No. 21.
Maize EST PI-Sis coded for by t:he polypeptides of SEQ ID No. 24 which is encoded for by the nucleic acid of SEQ ID No. 23.
The 5' and 3' PCR primer pairs A & B, are identified as SEQ ID Nos. 25 and 26, respectively.
Applicant Ref. No.: 05718-PCT.app Definitions Units, prefixes, and symbols may be denoted in their SI accepted form. Unless otherwise indicated, nucleic acids are written left to right in 5' to 3' orientation; amino acid sequences are written left to right in amino to carboxy orientation, respectively.
Numeric ranges are inclusive of the numbers defining the range. Amino acids may be referred to herein by either their commonly known three letter symbols or by the one-letter symbols recommended by the IUPAC-IUB Biochemical Nomenclature Commission. Nucleotides, likewise, may be referred to by their commonly accepted single-letter codes. The terms defined below are more fully defined by reference to the specification as a whole.
"Chymotrypsin inhibitor-like" protein is a protein with a sequence identity of 40%
or more to the CI-2 from barley.
"%" refers to molar % unless otherwise specified or implied.
"Essential amino acids" are amino acids that must be obtained from an external source because they are not synthesized by the individual. They are comprised of:
methionine, threonine, lysine, isoleucine, leucine, valine, tryptophan, phenylalanine, and histidine.
By "amplified" is meant the construction of multiple copies of a nucleic acid sequence or multiple copies complementar)~ to the nucleic acid sequence using at least one of the nucleic acid sequences as a template. Amplification systems include the polymerase~ain reaction (PCR) system, ligase chain reaction (LCR) system, nucleic acid sequence based amplification (NASBA, Cangene, Mississauga, Ontario), Q-Beta Replicase systems, transcription-based amplification system (TAS), and strand displacement amplification (SDA). See, e.g., Dicxgnostic Molecular Microbiology:
Principles and Applications, D. H. Persing et al., Ed., American Society for Microbiology, Washington, D.C. (1993).
As used herein, "antisense orientation" includes reference to a duplex polynucleotide sequence which is operably linked to a promoter in an orientation where 12 l'e~~EPJCeB S;-~EE
Applicant Ref. No.: 05718-PCT.app the antisense strand is transcribed. The antisense strand is sufficiently complementary to an endogenous transcription product such that translation of the endogenous transcription product is often inhibited.
As used herein, "chromosomal region" includes reference to a length of chromosome which may be measured by reference to the linear segment of DNA
which it comprises. The chromosomal region can be defined by reference to two unique DNA
sequences, i.e., markers.
The term "conservatively modified variants" applies to both amino acid and nucleic acid sequences. With respect to particular nucleic acid sequences, conservatively modified variants refers to those nucleic acids which encode identical or essentially identical amino acid sequences, or where the nucleic acid does not encode an amino acid sequence, to essentially identical sequences. Because of the degeneracy of the genetic code, a large number of functionally identical nucleic acids encode any given protein. For instance, the codons GCA, GCC, GCG and GCU all encode the amino acid alanine.
Thus, at every position where an alanine is specified by a codon, the codon can be altered to any of the corresponding codons described without altering the encoded polypeptide.
Such nucleic acid variations are "silent variations" and represent one species of conservatively modified variation. Every nucleic acid sequence herein which encodes a polypeptide also describes every possible silent variation of the nucleic acid. One of ordinary skill will recognize that each codon in a nucleic acid (except AUG, which is ordinarily the only codon for methionine, and TGG, which is ordinarily the only codon for tryptophan) can be modified to yield a functionally identical molecule.
Accordingly, each sileiation of a nucleic acid which encodes a polypeptide of the present invention is implicit in each described polypeptide sequence and incorporated herein by reference.
As to amino acid sequences, one of skill will recognize that individual substitutions, deletions or additions to a nucleic acid, peptide, polypeptide, or protein sequence which alters, adds or deletes a single amino acid or a small percentage of amino acids in the encoded sequence is a "conservatimely modified variant" where the alteration results in the substitution of an amino acid with a chemically similar amino acid. Thus, any number of amino acid residues selected from the group of integers consisting of from 1 to 15 can be so altered. Thus, for example, 1, 2, 3, 4, 5, 7, or 10 alterations can be :.-: ,~ ~ ~.y ~;
f. _ ~ . a .:,U:..., ~.W :~ t:.
Applicant Ref. No.: 0571 R-PCT.app made. Conservatively modified variants typically provide similar biological activity as the unmodified polypeptide sequence from which they are derived. For example, substrate specificity, enzyme activity, or ligancUreceptor binding is generally at least 30%, 40%, 50%, 60%, 70%, 80%, or 90% of the native protein for it's native substrate.
Conservative substitution tables providing functionally similar amino acids are well known in the art.
The following six groups each contain s~mino acids that are conservative substitutions for one another:
1 ) Alanine (A), Serine (S), Threonine (T);
2) Aspartic acid (D), Glutamic acid (E);
3) Asparagine (N), Glutamine (Q);
4) Arginine (R), Lysine (K);
5) Isoleucine (I), Leucine (L), Methionine (M), Valine (V); and 6) Phenylalanine (F), Tyrosine (Y), Tryptophan (W).
See also, Creighton ( 1984) Proteins W.H. Freeman and Company.
By "encoding" or "encoded", with respect to a specified nucleic acid, is meant comprising the information for translation into the specified protein. A
nucleic acid encoding a protein may comprise non-translated sequence (e.g., introns) within translated regions of the nucleic acid, or may lack such intervening non-translated sequences (e.g., as in cDNA). The information by which a protein is encoded is specified by the use of codons. Typically, the amino acid sequence is encoded by the nucleic acid using the "universal" genetic code. However, variants of the universal code, such as is present in some plant=aiiimal, and fungal mitochondria, the bacterium Mycoplasma capricolz~m (Proc. Natl. Acad. Sci. (USA), 82: 2306-2309 (1985)), or the ciliate Macronucleus, may be used when the nucleic acid is expressed using these organisms.
When the nucleic acid is prepared or altered synthetically, advantage can be taken of known codon preferences of the intended host where the nucleic acid is to be expressed. For example, although nucleic acid sequences of the present invention may be expressed in both monocotyledonous and dicot;yledonous plant species, sequences can be modified to account for the specific codon pref°rences and GC content preferences of monocotyledons or dicotyledons as these preferences have been shown to differ (Murray et al. Nucl. Acids Res. 17: 477-498 (1989)). Tllus, the maize preferred codon for a 14 ' . _.' '-'; z~
~.. r Applicant Ref. No.: 05718-PCT.app particular amino acid may be derived from known gene sequences from maize.
Maize codon usage for 28 genes from maize plants are listed in Table 4 of Murray et al., szrpra.
As used herein "full-length sequence" includes reference to a protease inhibitor polynucleotide or the encoded protein having t:he entire amino acid sequence of, a native (non-synthetic), endogenous, catalytically active form of a protein involved in protease inhibition. A full-length sequence can be determined by size comparison relative to a control which is a native (non-synthetic) endogenous cellular protease inhibitor nucleic acid or protein. Methods to determine whether a sequence is full-length are well known in the art including such exemplary techniques as northern or western blots.
See, e.g., Plant Moleczrlar Biology: A Laboratory Manzrczl, Clark, Ed., Springer-Verlag, Berlin (1997). Comparison to known full-length homologous sequences can also be used to identify full-length sequences of the present invention. Additionally, consensus sequences typically present at the 5' and 3' unt:ranslated regions of mRNA aid in the identification of a polynucleotide as full-length. For example, the consensus sequence ANNNNAUGG, where the underlined codon represents the N-terminal methionine, aids in determining whether the polynucleotide has a complete 5' end. Consensus sequences at the 3' end, such as polyadenylation sequences, aid in determining whether the polynucleotide has a complete 3' end.
As used herein, "heterologous" in reference to a nucleic acid is a nucleic acid that originates from a foreign species, or, if from thc: same species, is substantially modified from its native form in composition and/or genomic locus. For example, a promoter operably linked to a heterologous structural gene is from a species different from that from whiE structural gene was derived, or, if from the same species, one or both are substantially modified from their original form. A heterologous protein may originate from a foreign species or, if from the same species, is substantially modified from its original form.
By "host cell" is meant a cell which contains a vector and supports the replication and/or expression of the expression vector. Host cells may be prokaryotic cells such as E.
coli, or eukaryotic cells such as yeast, insect, amphibian, or mammalian cells. Preferably, host cells are monocotyledonous or dicotyledenous plant cells. A particularly preferred monocotyledonous host cell is a maize host cell.
Applicant Ref. No.: 0>71R-PCT.app The term "hybridization complex" includes reference to a duplex nucleic acid sequence formed by two single-stranded nucleic acid sequences which selectively hybridize with each other.
The terms "isolated" or "biologically pure" refer to material which is: (1) substantially or essentially free from components which normally accompany or interact with it as found in its naturally occurring environment. The isolated material optionally comprises material not found with the material in its natural environment. (2) If the material is in its natural environment, the material has been synthetically (non-naturally) altered to a composition and/or placed at a loccls in the cell (e.g., genome) not native to a material found in that environment. The alteration to yield the synthetic material can be performed on the material within or removed from its natural state. For example, a naturally occurring nucleic acid becomes an isolated nucleic acid if it is altered, or if it is transcribed from DNA which is altered, by non-natural, synthetic (i.e., "man-made") methods performed within the cell from which it originates. See, e.g., Compounds and Methods for Site Directed Mutagenesis in Eukaryotic Cells, Kmiec, U.S. Patent No.
5,565,350; In Vivo Homologous Sequence Targeting in Eukaryotic Cells; Zarling et al., PCT/LJS93/03868. Likewise, a naturally occurring nucleic acid (e.g., a promoter) become isolated if it is introduced by non-naturally occurring means to a locus of the genome not native to that nucleic acid.
The term "protease inhibitor nucleic acids" means an isolated nucleic acid comprising a polynucleotide (a "protease inhibitor polynucleotide") encoding a polypeptide involved in protease inhibition.
As-t»d herein, "localized within the chromosomal region defined by and including" with respect to particular markers includes reference to a contiguous length of a chromosome delimited by and including the stated markers.
As used herein, "marker" includes referE:nce to a locus on a chromosome that serves to identify a unique position on the chromosome. A "polymorphic marker"
includes reference to a marker which appears in multiple forms (alleles) such that different forms of the marker, when they are preaent in a homologous pair, allow transmission of each of the chromosomes in that pair to be followed. A
genotype may be defined by use of a single or a plurality of markers.
16 ,~MFNDED ~:r'=1 75529-49(S) As used herein, "nucleic acid" includes reference to a deoxyribonucleotide or ribonucleotide polymer in either single- or double-stranded form, and unless otherwise limited, encompasses known analogues of natural nucleotides that hybridize to single-stranded nucleic acids in a manner similar to naturally occurring nucleotides {e.g., peptide nucleic acids).
By "nucleic acid library" is meant a collection of isolated DNA or RNA
molecules which comprise and substantially represent the entire transcribed fraction of a genome of a specified organism. Construction of exemplary nucleic acid libraries, such as genomic and cDNA libraries, is taught in standard molecular biology references such as Berger and Kimmel, Guide to Molecular Cloning Techniques, Methods in Enzymology, Vol.
152, Academic Press, Inc., San Diego, CA (Berger); Sambrook et al., Molecular Cloning - A
Laboratory Manual, 2nd ed., Vol. 1-3 (1989); and Current Protocols in Molecular Biology, F.M. Ausubel et al., Eds., Current Protocols, a joint venture between Greene Publishing Associates, Inc, and John Wiley & Sons, Inc. (1994 Supplement).
I S As used herein "operably linked" includes reference to a functional linkage between a promoter and a second sequence, wherein the promoter sequence initiates'and mediates transcription of the DNA sequence corresponding to the second sequence.
Generally, operably linked means that the nucleic acid sequences being linked are contiguous and, where necessary to join two protein coding regions, contiguous and in the same reading frame.
As used herein, the term "plant" includes reference to whole plants, plant organs (e.g., leaves, stems, roots, etc.), seeds and progeny of same. The class of plants which can be used in the methods of the invention is generally as broad as the class of higher plants amenable to transformation techniques, including both monocotyledonous and dicotyledonous plants. Particularly preferred is Zea mays.
As used herein, "polynucleotide" includes reference to a deoxyribopolynucleotide, ribopolynucleotide, or analogs thereof, that hybridize to nucleic acids in a manner similar to naturally occurring nucleotides. A polynucleotide can be full-length or a sub-sequence of a native or heterologous structural or regulatory gene. Unless otherwise indicated, the Applicant Ref. No.: 0571 R-PCT.app term includes reference to the specified sequence as well as the complementary sequence thereof. Thus, DNAs or RNAs with backbones modified for stability or for other reasons are "polynucleotides" as that term is intended herein. Moreover, DNAs or RNAs comprising unusual bases, such as inosine, or modified bases, such as tritylated bases, to name just two examples, are polynucleotidesas the term is used herein. It will be appreciated that a great variety of modifications have been made to DNE~ and RNA that serve many useful purposes known to those of skill in the art. The term polynucleotide as it is employed herein embraces such chemically, enzymaticallyor metabolicallymodified forms of polynucleotides, as well as the chemical forms of DNA and RNA characteristic of viruses and cells, including inter alia, simple and complex cells.
The terms "polypeptide", "peptide" and "protein" are used interchangeably herein to refer to a polymer of amino acid residues. Tlhe terms apply to amino acid polymers in which one or more amino acid residue is an artificial chemical analogue of a corresponding naturally occurring amino acid, as well as to naturally occurring amino acid polymers. Among the known modifications which may be present in polypeptides of the present are, to name an illustrative few, acetylation, acylation, ADP-ribosylation, amidation, covalent attachment of flavin, covalent attachment of a heme moiety, covalent attachment of a nucleotide or nucleotide derivative, covalent attachment of a lipid or lipid derivative, covalent attachment of phosphotidylinositol,cross-linking, cyclization, disulfide bond formation, demethylation, formation of covalent cross-links, formation of cystine, formation of pyroglutamate, formylation, gamma-carboxylation,glycosylation, GPI anchor formation, hydroxylation, iodination, methylation, myristoylation, oxidation, proteolytic processing,osphorylation,prenylation, racemization, selenoylation, sulfation, transfer-RNA mediated addition of amino acids to proteins such as arginylation, and ubiquitination.
Such modifications are well known to those of skill and have been described in great detail in the scientific literature. Several particularly common modifications, glycosylation, lipid attachment, sulfation, gamma-carboxylationof g lutamic acid residues, hydroxylation and ADP-ribosylation, for instance, are described in most basic texts, such as, for instance Proteins - Structure and Molecular Properties, 2nd ed., T. E. Creighton, W. H.
Freeman and Company, New York ( 1993). Many detailed reviews are available on this subject, such as, for example, those provided by Wold, F., PosttranslationalProtein Modifications:
Perspectives and Prospects, pp. 1-12 in Posttranslational Covalent Modification of Proteins, 18 ..,.'~~,;i;; :~-::~~;
Applicant Ref. No.: 05718-PCT.app :.
.. . . . . . ~. . . , , , , . , . . ,' , . , ,. ,. ' . ":. , ,.
B. C. Johnson, Ed., Academic Press, New York ( 1983); Seifter et al., Meth.
Enz-ymol. 182:
626-646 ( I 990) and Rattan et al., Protein Synthesis: Posttranslational Modifications and Aging, Ann. N.Y. Acad. Sci. 663: 48-62 ( 1992). It will be appreciated, as is well known and as noted above, that polypeptides are not always. entirely linear. For instance, polypeptides may be branched as a result of ubiquitination, arid they may be circular, with or without branching, generally as a result of posttranslation events, including natural processing event and events brought about by human manipulation which do not occur naturally.
Circular, branched and branched circular polypeptides ma.y be synthesized by non-translation natural process and by entirely synthetic methods, as well. Modifications can occur anywhere in a polypeptide, including the peptide backbone, the: amino acid side-chains and the amino or carboxyl termini. In fact, blockage of the amino or carboxyl group in a polypeptide, or both, by a covalent modification, is common in naturally occurring and synthetic polypeptides and such modifications may be present in polypeptides of the present invention, as well.
For instance, the amino terminal residue of polypeptides made in E. coli or other cells, prior to proteolytic processing, almost invariably will be N-formylmethionine.
During post-translational modification of the peptide, a metlzionine residue at the NHZ-terminus may be deleted. Accordingly, this invention contemplates the use of both the methionine-containing and the methionineless amino terminal variants of the protein of the invention.
In general, as used herein, the term polypeptide Encompasses all such modifications, particularly those that are present in polypeptides synthesized by expressing a polynucleotide in a host cell.
As used herein "promoter" includes reference to a region of DNA upstream from the start o~a~scription and involved in recognition and binding of RNA
polymerase and other proteins to initiate transcription. A "plans: promoter" is a promoter capable of initiating transcription in plant cells. Examples. of promoters under developmental control include promoters that preferentially initiate transcription in certain tissues, such as leaves, roots, seeds, fibers, xylem vessels, trach.eids, or sclerenchyma. Such promoters are referred to as "tissue preferred". Promoters which initiate transcription only in certain tissue are referred to as "tissue specific". A "cell type" specific promoter is primarily drives expression in certain cell types in one or more organs, for example, vascular cells in roots or leaves. An "inducible" promoter is a promoter which is under environmental control. Examples of environmental conditions. that may effect transcription by inducible Applicant Ref. No.: 0~71R-PCT. app ~: ..
. . . . . . r ~ :.
.
r .
:.
promoters include anaerobic conditions or the presence of light. Tissue specific, cell type specific, and inducible promoters constitute th~~ class of "non-constitutive"
promoters. A
"constitutive" promoter is a promoter which is active under most environmental conditions.
The terms "polypeptide involved in protease inhibition" or "protease inhibitor polypeptide" refer to one or more proteins, in ~;lycosylated or non-glycosylated form, acting as a protease inhibitor. Examples are included as, but not limited to:
chymotrypsin inhibitor, trypsin inhibitor, protease inhibitor, pre-pro-proteinase inhibitor I, subtilisin-chymotrypsin inhibitor, tumor-related protein, genetic tumor-related proteinase inhibitor, subtilisin inhibitor, endopeptidase inhibitor, serine protease inhibitor, wound-inducible proteinase inhibitor, and eglin c. The term is also inclusive of fragments, variants, homologs, alleles or precursors (e.g., preproproteins or proproteins) thereof.
A "protease inhibitor protein" comprises a protease inhibitor polypeptide.
As used herein "recombinant" includes reference to a cell, or nucleic acid, or 1 S vector, that has been modified by the introduction of a heterologous nucleic acid or the alteration or placement of a native nucleic acid to a form or to a locus not native to that cell, or that the cell is derived from a cell so modified. Thus, for example, recombinant cells express genes that are not found in identical form within the native (non-recombinant) form of the cell or express native genes that are otherwise abnormally expressed, under expressed or not expressed at all. The term "recombinant" as used herein does not encompass the alteration of the cell, nucleic acid or vector by naturally occurring events (e.g., spontaneous mutation, n~~tural transformati~ftransduction/transposition) such as those occurring without direct human intervention.
As used herein, a "recombinant expression cassette" is a nucleic acid construct, generated recombinantly or synthetically, with a series of specified nucleic acid elements which permit transcription of a particular nucleic acid in a target cell. The recombinant expression cassette can be incorporated into a pllasmid, chromosome, mitochondria) DNA, plastid DNA, virus, or nucleic acid fragment. Typically, the recombinant expression cassette portion of the expression vector includes, among other sequences, a nucleic acid to be transcribed, and a promoter.
APJIENDED SHEET
Applicant Ref. No.: 05718-PC'r.app ' . , _ . . . ~ . ~ . ' . . .
. . ' ~ .
., .. ." ."' ..
The term "residue" or "amino acid residue" or "amino acid" are used interchangeably herein to refer to an amino acid that is incorporated into a protein, polypeptide, or peptide (collectively "protein"). The amino acid may be a naturally occurring amino acid and, unless otherwise limited, may encompass known analogs of natural amino acids that can function in a simil'.ar manner as naturally occurring amino acids.
The term "selectively hybridizes" includes reference to hybridization, under stringent hybridization conditions, of a nucleic acid sequence to a specified nucleic acid target sequence to a detectably greater degree (e.g., at least 2-fold over background) than its hybridization to non-target nucleic acid sequences and to the substantial exclusion of non-target nucleic acids. Selectively hybridizing sequences typically have about at least 80% sequence identity, preferably 90% sequence identity, and most preferably 100%
sequence identity (i.e., complementary) with each other.
The terms "stringent conditions" or "stringent hybridization conditions" includes reference to conditions under which a probe will hybridize to its target sequence, to a detectably greater degree than other sequences (e.g., at least 2-fold over background). Stringent conditions are sequence-dependent and will be different in different circumstances. Longer sequences hybridize specifically at higher temperatures.
Generally, stringent conditions are selected to be about 5 °C lower than the thermal melting point (Tm) for the specific sequence at a defined ionic strength and pH. The Tm is the temperature (under defined ionic strength and pH) at which 50% of a complementary target seque~e hybridizes to a perfectly matched probe. Typically, stringent conditions will be those in which the salt concentration is less than about 1.0 M Na ion, typically about 0.01 to 1.0 M Na ion concentration (or other salts) at pH 7.0 to 8.3 and the temperature is at least about 30°C for short probes (e.g., 10 to 50 nucleotides) and at least about 60°C for long probes (e.g., greater than 50 nucleotides).
Stringent conditions may also be achieved with the addition of destabilizing agents such as formamide.
Exemplary low stringency conditions include hybridization with a buffer solution of 30%
formamide, 1 M NaCI, 1% SDS at 37°C, and a wash in 2X SSC at 50°C.
Exemplary high stringency conditions include hybridization in 50% formamide, 1 M NaCI, 1 % SDS at 37°C, and a wash in O.1X SSC at 60°C.
ja A~ ~
r f ~Ff fJCLI Ci ~"~.~
Applicant Ref. No.: 05718-PCT.app ., . " , . .. . . . . .~ . . . , .
. . ..
. . . . . . .
.. ~. ... .... ,.
Stringent hybridization conditions in the context of nucleic acid hybridization assay formats are sequence dependent, and are different under different environmental parameters. Longer sequences hybridize selectively at higher temperatures. An extensive guide to the hybridization of nucleic acids is found in Tijssen, Laboratory Techniques in Biochemistry and Molecular Biology--Hybridization with Nucleic Acid Probes, Part I, Chapter 2 "Overview of principles of hybridization and the strategy of nucleic acid probe assays", Elsevier, New York (1993).
The terms "transfection" or "transformation" include reference to the introduction of a nucleic acid into a eukaryotic or prokaryotic cell where the nucleic acid may be incorporated into the genome of the cell (e.g., chromosome, plasmid, plastid or mitochondria) DNA), converted into an autonomous replicon, or transiently expressed (e.g., transfected mRNA).
As used herein, "transgenic plant" includes reference to a plant which comprises within its genome a heterologous polynucleotide. Generally, the heterologous polynucleotide is stably integrated within the genome such that the polynucleotide is passed on to successive generations. The heterologous polynucleotide may be integrated into the genome alone or as part of a recombinant expression cassette.
"Transgenic" is used herein to include any cell, cell line, callus, tissue, plant part or plant, the genotype of which has been altered by the presence of heterologous nucleic acid including those transgenics initially so altered as well as those created by sexual crosses or asexual propagation from the initial transgenic. The term "transgenic" as used herein does not encompass the alteration~e genome (chromosomal or extra-chromosomal) by conventional plant breeding methods or by naturally occurring events such as random cross-fertilization, non-recombinant viral infection, non-recombinant bacterial transformation, non recombinant transposition, or spontaneous mutation.
As used herein, "vector" includes reference to a nucleic acid used in transfection of a host cell and into which can be inserted a polynucleotide. Vectors are often replicons. Expression vectors permit transcription of a nucleic acid inserted therein.
The following terms are used to describe the sequence relationships between two or more nucleic acids or polynucleotides: (a) "reference sequence", (b) "comparison "'~ ~rG=_ , ,...., 75529-49(S) window", (c) "sequence identity", (d) "percentage of sequence identity", and (e) "substantial identity".
(a) As used herein, "reference sequence" is a defined sequence used as a basis for sequence comparison. A reference sequence may be a subset or the entirety of a specified sequence; for example, as a segment of a full-length cDNA or gene sequence, or the complete ~cDNA or gene sequence.
(b) As used herein, "comparison window" means includes reference to a contiguous and specified segment of a polynucleotide sequence, wherein the polynucleotide sequence may be compared to a reference sequence and wherein the portion of the polynucleotide sequence in the comparison window may comprise additions or deletions (i.e., gaps) compared to the reference sequence (which does not comprise additions or deletions) for optimal alignment of the two sequences.
Generally, the comparison window is at least 20 contiguous nucleotides in length, and optionally can be 30, 40, 50, 100, or longer. Those of skill in the art understand that to avoid a high similarity to a reference sequence due to inclusion of gaps in the polynucleotide sequence a gap penalty is typically introduced and is subtracted from the number of matches.
Methods of alignment of sequences for comparison are well-known in the art.
Optimal alignment of sequences for comparison may be conducted by the local homology algorithm of Smith and Waterman, Adv. Appl. Math 2: 482 (1981); by the homology alignment algorithm ofNeedleman and Wunsch, J. Mol. Biol. 48: 443 (1970); by the search for similarity method of Pearson and Lipman, Proc. Natl. Acad. Sci. 85:
(1988); by computerized implementations of these algorithms, including, but not limited to: CLUSTAL in the PC/Gene program by Intelligenetics, Mountain View, California, GAP, BESTFIT, BLAST" FASTA, and TFASTA in the Wisconsin Genetics Software Package, Genetics Computer Group (GCG), 575 Science Dr., Madison, Wisconsin, USA;
the CLUSTAL program is well described by Higgins and Sharp, Gene 73: 237-244 (1988); Higgins and Sharp, CABIOS 5: 151-153 (1989); Corpet, et al., Nucleic Acids Research 16: 10881-90 (1988); Huang, et al., Computer Applications in the Biosciences 8: 15~-65 (1992), and Pearson, et al., Methods in Molecular Biolosy 24: 307-331 (1994);
preferred computer alignment methods also include the BLASTP, BLASTN, and BLASTX algorithms. Altschul, et al., J. Mol. Biol. 215: 403-410 (1990).
Alignment is also often performed by inspection and manual alignment.
Trade-mark 23 Applicant Ref. No.: 05718-PCT.app ' . ,. . , . . ,~ , ~, , . .
. . , ". . . , , , , ,. .. .., . "' (c) As used herein, "sequence identity" or "identity" in the context of two nucleic acid or polypeptide sequences includes reference to the residues in the two sequences which are the same when aligned fo:r maximum correspondence over a specified comparison window. When percentage of sequence identity is used in reference to proteins it is recognized that residue positions which are not identical often differ by conservative amino acid substitutions, where amino acid residues are substituted for other amino acid residues with similar chemical properties (e.g. charge or hydrophobicity) and therefore do not change the functional propertifa of the molecule. Where sequences differ in conservative substitutions, the percent sequence identity may be adjusted upwards to correct for the conservative nature of the substitution. Sequences which differ by such conservative substitutions are said to have "sequence similarity" or "similarity". Means for making this adjustment are well-known to those of skill in the art.
Typically this involves scoring a conservative substitution as a partial rather than a full mismatch, thereby increasing the percentage sequence identity. Thus, for example, where an identical amino acid is given a score of 1 and a non-conservative substitution is given a score of zero, a conservative substitution is given a score between zero and 1. The scoring of conservative substitutions is calculated, e.g., according to the algorithm of Meyers and Miller, Computer Applic. Biol. Sci., 4: 11-17 (1988) e.g., as implemented in the program PC/GENE (Intelligenetics, Mountain View, California, USA).
(d) As used herein, "percentage of sequence identity" means the value determined by comparing two optimally aligned sequences over a comparison window, wherein the portion of the polynucleotide sequence in the comparison window may corrrpr~'s~ additions or deletions (i.e., gaps) as compared to the reference sequence (which does not comprise additions or deletion~~) for optimal alignment of the two sequences. The percentage is calculated by determining the number of positions at which the identical nucleic acid base or amino acid residue occurs in both sequences to yield the number of matched positions, dividing the number of matched positions by the total number of positions in the window of comparison and multiplying the result by 100 to yield the percentage of sequence identity.
(e) (i) The term "substantial identity" of polynucleotide sequences means that a polynucleotide comprises a sequence that has at least 70%
sequence identity, preferably at least 80%, more preferably at lease: 90% and most preferably at least 95%, A~,~ENDED SlIEET
Applicant Ref. No.: 0~71R-PCT.app compared to a reference sequence using one of the alignment programs described using standard parameters. One of skill will recognize that these values can be appropriately adjusted to determine corresponding identity of proteins encoded by two nucleotide sequences by taking into account codon degeneracy, amino acid similarity, reading frame positioning and the like. Substantial identity of amino acid sequences for these purposes normally means sequence identity of at least 60'%, more preferably at least 70%, 80%, 90%, and most preferably at least 95%. Polypeptides which are "substantially similar"
share sequences as noted above except that residue positions which are not identical may differ by conservative amino acid changes.
Another indication that nucleotide sequences are substantially identical is if two molecules hybridize to each other under stringent conditions. Generally, stringent conditions are selected to be about 5°C to about 20°C lower than the thermal melting point (Tm) for the specific sequence at a defined ionic strength and pH. The T,n is the temperature (under defined ionic strength and pH) at which 50% of the target sequence hybridizes to a perfectly matched probe. Typically, stringent wash conditions are those in which the salt concentration is about 0.02 molar at pH 7 and the temperature is at least about 50, 55, or 60°C. However, nucleic acids which do not hybridize to each other under stringent conditions are still substantially identical if the polypeptides which they encode are substantially identical. This may occur, e.g., when a copy of a nucleic acid is created using the maximum codon degeneracy permitted by the genetic code. One indication that two nucleic acid sequences are substantially identical is that the polypeptide which the first nucleic acid encodes is immunologically cross reactive with the polypeptide encoded by the sec~ucleic acid.
(e) (ii) The terms "substantial identity" in the context of a peptide indicates that a peptide comprises a sequence with at least 70% sequence identity to a reference sequence, preferably 80%, more preferably 85'%, most preferably at least 90%
or 95%
sequence identity to the reference sequence over a specified comparison window.
Preferably, optimal alignment is conducted using the homology alignment algorithm of Needleman and Wunsch, J. Mol. Biol. 48: 443 (1970). An indication that two peptide sequences are substantially identical is that one peptide is immunologically reactive with antibodies raised against the second peptide. Thus, a peptide is substantially identical to a r;'J!Er;r~~D SHEET
Applicant Ref. No.: 05718-PCT.app second peptide, for example, where the two peptides differ only by a conservative substitution.
S It has been unexpectedly discovered that a protease inhibitor can be modified to enhance its content of essential amino acids coupled with reduction in protese inhibitor activity. In a preferred embodiment of the present invention, derivatives of the protease inhibitor, CI-2, simultaneously exhibit both enhanced essential amino acid content as well as decreased protease inhibitor activity. The present compounds are thus excellent candidates for enhancing the nutritional value of feed.
The present invention provides, inter aha, compositions and methods for modulating (i.e., increasing or decreasing) the total levels of essential amino acids and/or altering the ratios of essential amino acids in plants. Thus, the present invention provides utility in such exemplary applications as improving the nutritional properties of fodder crops, increasing the value of plant material for pulp and paper production, altering the protease inhibitory activity, as well as for improving the utility of plant material where the amount of essential amino acids or composition is important, such as the use of plant as a feed. In particular, protease inhibitor polypeptides may be expressed at times or in quantities which are not characteristic of natural plants.
The present invention also provides isolated nucleic acid comprising polynucleotides of sufficient length and complementarity to a protease inhibitor gene, to use as probes or amplification primers in the detection, quantitation, or isolation of gene transcripts~r example, isolated nucleic acids of the present invention can be used as probes in detecting deficiencies in the level of mRNA in screenings for desired transgenic plants, for detecting mutations in the gene (e.g., substitutions, deletions, or additions), for monitoring upregulation of protease inhibition in screening assays for compounds affecting protease inhibition, or for use as molecular markers in plant breeding programs.
The isolated nucleic acids of the present invention can also be used for recombinant expression of protease inhibitor polypeptides for use as immunogens in the preparation and/or screening of antibodies. The isolated nucleic acids of the present invention can also be employed for use in sense or antisense suppression of one or more protease inhibitor genes in a host cell, tissue, or plant. Further, using a primer specific to an . ,.,.- , ~t.~ -r .. ~~ r Applicant Ref. No.: 0571 R-PC1'.app insertion sequence (e.g., transposon) and a primer which specifically hybridizes to an isolated nucleic acid of the present invention, one can use nucleic acid amplification to identity insertion sequence inactivated protease: inhibitor genes from a cDNA
library prepared from insertion sequence mutagenized plants. Progeny seed from the plants S comprising the desired inactivated gene can be grown to a plant to study the phenotypic changes characteristic of that inactivation. See, Tools to Determine the Function of Genes, 1995 Proceedings of the Fiftieth Annual Corn and Sorghum Industry Research Conference, American Seed Trade Association., Washington, D.C., 1995.
The present invention also provides isolated proteins comprising polypeptides having a minimal amino acid sequence from the polypeptides involved in protease inhibition as disclosed herein. The present invention also provides proteins comprising at least one epitope from a polypeptide involved in protease inhibition. The proteins of the present invention can be employed in assays fo:r enzyme agonists or antagonists of enzyme function, or for use as immunogens or antigens to obtain antibodies specifically immunoreactive with a protein of the present invention. Such antibodies can be used in assays for expression levels, for identifying andJor isolating nucleic acids of the present invention from expression libraries, or for purification of polypeptides involved in protease inhibition. In a preferred embodiment ~of the present invention, the present protein has both elevated essential amino acid content and reduced protease inhibitor activity.
The isolated nucleic acids of the present invention can be used over a broad range of plant types, including species from the genera Ctrcttrbita, Rosa, Vitis, Juglans, Fragariar.ls, Medicago, Onobrychis, Trifoli'um, Trigonella, Vigna, Citrus, Linttm, Geranium, Manihot, Daucus, Arabidopsis, Brassica, Raphanus, Sinapis, Atropa, Capsicum, Datura, Hyoscyamus, Lycopersicon, Nicotiana, Solanum, Petunia, Digitalis, Majorana, Ciahorium, Helianthus, Lactuca, Br~omus, Asparagus, Antirrhinum, Heterocallis, Nemesis. Pelargonitrm, Panieum, Pennisetum, Ranunculus, Senecio, Salpiglossis, Ctrcumis, Browaalia, Glycine, Pisium, Phaseolus, Lolium, Oryza, Zea, Avena, Hordeum, Secale, Triticum, Sorghum, Picea, and Populus.
. _ _- ~ --CL=y 75529-49(S) The isolated nucleic acids of the present invention can be used over a broad range of po--ypeptide types, including anti-microbial peptides such as those described in Rao, G., Antimicrobial Peptides; Molecular Plant-Microbe Interactions 8:6-13 (1995).
Protease Inhibitor Nucleic Acids The present invention provides, inter alia, isolated and/or heterologous nucleic acids of RNA, DNA, and analogs and/or chimeras thereof, comprising a protease inhibitor polynucleotide encoding such proteins as: chymotrypsin inhibitor, trypsin inhibitor, protease inhibitor, pre-pro-proteinase inhibitor I, subtilisin-chymotrypsin inhibitor, tumor-related protein, genetic tumor-related proteinase inhibitor, subtilisin inhibitor, endopeptidase inhibitor, serine protease inhibitor, wound-inducible proteinase inhibitor, and eglin c. The protease inhibitor nucleic acids of the present invention comprise protease inhibitor polynucleotides which, are inclusive of:
(a) a polynucleotide encoding a protease inhibitor polypeptide of SEQ ID NOS: 2,4,6,8,10, or 12,16,18,20,22,24 and conservatively modified and polymorphic variants thereof, including exemplary polynucleotides of SEQ ID NOS: 1,3,5,7,9 and 11,15,17,19,21,23 and. conservative changes (b) a polynucleotide which is the product of amplification from Zea mays nucleic acid library using primer pairs from amongst. the consecutive pairs from SEQ ID NOS: 25 and 26, which amplify polynuc:leotides having substantial identity to polynucleotides from amongst those having SEQ ID
NOS: 1,3,5,7,9 or 11,15,17,19,21,23 75529-49(S) (c) a polynucl_eotide which selectively hybridizes under stringent hybridization conditions consisting of washing in a salt concentration of about 0.02 molar at pH 7 at 50°C, to a polynucleotide of (a) or (b);
(d) a polynucl.eotide having at least 60o sequence identity with Sequence I:D NOS: 1,3,5,7,9,11,15,17,19,21 or 23;
(e) a polynucl.eotide encoding a protein having a specified number of contiguous amino acids from a prototype polypeptide, wherein the protein is specifically recognized by antisera elicited by prE:sentation of the protein and wherein the protein does 28a Applicant Rcf. No.: 057! R-PCT.app ' ~ - . .
. .~ . , , . . . . . . . , , ", .. .. .,. . . ., not detectably immunoreact to antisera which has been fully immunosorbed with the protein;
(f) complementary sequences oi'polynucleotides of (a), (b), (c), (d), or (e);
and (g) a polynucleotide comprising; at least 20 contiguous nucleotides from a polynucleotide of Sequence ID Nos. 1, 3, 5, 7, 9, 11, 15, 17, 19, 21 or 23.
A. Polynucleotides Encoding A Protease inhibitor Protein of SEQ ID NOS: 2, 4, 6, 8,10 and 12,16,18, 20, 22, 24 or Conservatively Mod ified or Polymorphic Variants Thereof As indicated in (a), sarpra, the present invention provides isolated and/or heterologous nucleic acids comprising protease: inhibitor polynucleotides, wherein the polynucleotides encode the protease inhibitor polypeptides disclosed herein as SEQ ID
NOS: 2,4,6,8,10 and 12,16,18,20,22,24 or conservatively modified or polymorphic variants thereof. Those of skill in the art will recognize that the degeneracy of the genetic 1 ~ code allows for a plurality of polynucleotides to encode for the identical amino acid sequence. Thus, the present invention includes protease inhibitor polynucleotides of SEQ
ID NOS: 1,3,5,7,9 and 11, 15,17,19,21, 23 and silent variations of polynucleotides encoding a protease inhibitor polypeptide of SF:Q ID NOS: 2,4,6,8,10 and 12,16,18,20,22,24. The present invention further provides isolated and/or heterologous nucleic acids comprising protease inhibitor pol;ynucleotides encoding conservatively modified variants of a protease inhibitor polypeptide of SEQ ID NOS:
2,4,6,8,10 and 12, 16,18,20,22,24. Additionally, the present invention further provides isolated and/or heterolog8a~iucleic acids comprising protease inhibitor polynucleotides encoding one or more polymorphic (allelic) variants of protease inhibitor polypeptides/polynucleotides.
B. Polynucleotides Amplified from a Zea mays .Nucleic Acid Library As indicated in (b), supra, the present invention provides isolated and/or heterologous nucleic acids comprising protease inhibitor polynucleotides, wherein the polynucleotides are amplified from a Zea mays nucleic acid library. The nucleic acid library may be a cDNA library, a genomic library, or a library generally constructed from nuclear transcripts at any stage of intron processing. Nucleic acid libraries from other plants, both monocots and dicots could also be used in a similar fashion. The ~1~~PJDEt? c~.,~'T
Applicant Ref. No.: 05718-PCT.app polynucleotides of the present invention include those amplified using the following primer pairs:
SEQ ID NOS: 25 and 26 which yield an amplic,on comprising a sequence having substantial identity to SEQ ID NOS: 7,9, and 11.
S Thus, the present invention provides protease inhibitor synthetic polynucleotides having the sequence of the gene, a nuclear transcript, a cDNA, or complementary sequences thereof. In preferred embodiments, l:he nucleic acid library is constructed from Zea mays, such as lines B73, PHRE1, A632, B1VIS-P2#10, and W23, each of which are known and publicly available. In particularly preferred embodiments, the library is constructed from tissue such as root, leaf, or tassel, or embryonic tissue.
The amplification products can be translated using expression systems well known to those of skill in the art and as discussed, infr~2. The resulting translation products can be confirmed as protease inhibitor polypeptides. of the present invention by, for example, assaying for the appropriate inhibition activity or verifying the presence of a linear epitope which is specific to a protease inhibitor polypeptide using standard immunoassay methods.
Those of ordinary skill will appreciate that primers which selectively amplify, under stringent conditions, the polynucleotides of the present invention (and their complements) can be constructed by reference 1:o the sequences provided herein at SEQ
ID NOS: 1,3,5,7,9 and 11. In preferred embodiments, the primers will be constructed to anneal with the first three contiguous nucleotides at their 5' terminal end's to the first codon encoding the carboxy or amino terminal amino acid residue (or the complements thereof) opolynucleotides of the present invention. Typically, such primers are at least 15 nucleotides in length. The primer length in nucleotides is selected from the group of integers consisting of from at least 15 to 90. Thus, the primers can be at least 15, 18, 20, 25, 30, 40, 50, 60, 70, 80, or 90 nucleotides in length.
The amplification primers may optionally be elongated in the 3' direction with contiguous nucleotide sequences from polynucleotide sequences of SEQ ID NOS:
1,3,5,7,9 and 11, 15,17,19,21, from which they are derived. The number of nucleotides by which the primers can be elongated is selected from the group of integers consisting of from at least 1 to 25. Thus, for example, the primers can be elongated with an additional l, 5, 10, or 1 S nucleotides. Those of skill will recognize that a lengthened primer 30 '-'~w'~... -~_-Elppli,:ant Ref. No.: 05718-PCT.app sequence can be employed to increase specificity of binding (i.e., annealing) to a target sequence.
C. Polynucleotides Which Selectively Hybridize to a Polynucleotide of (A) or (B) As indicated in (c), supra, the present invention provides isolated and/or heterologous nucleic acids comprising protease inhibitor polynucleotides, wherein the polynucleotides selectively hybridize, under selective hybridization conditions, to a protease inhibitor polynucleotide of paragraphs (A) or (B) as discussed, supra. Thus, the polynucleotides of this embodiment can be used for isolating, detecting, and/or quantifying nucleic acids comprising the polynucleotides of (A) or (B). Low stringency hybridization conditions are typically, but not exclusively, employed with sequences having relatively small sequence identity. Moderate and high stringency conditions can optionally be employed for sequences of greater identity. Low stringency conditions allow selective hybridization of sequences having about 70% sequence identity.
D. Polynucleotides Having at Least 60% Seqzrence Identity with the Polynzrcleotides of (A), (B) or (C) As indicated in (d), supra, the present invention provides isolated and/or heterologous nucleic acids comprising protease inhibitor polynucleotides, wherein the polynucleotides have a specified identity at the nucleotide level to a polynucleotide as disclosed above in paragraphs (A), (B), (C), or (D). The percentage of identity to a reference sequence is at least 60% and, rounded upwards to the nearest integer, can be expressed.as:~ integer selected from the group of integers consisting of from 60 to 99.
Thus, for example, the percentage of identity to a reference sequence can be at least 70%, 75%, 80%, 85%, 90%, or 95%.
The protease inhibitor polynucleotide optionally encodes a protein having a molecular weight as the unglycosylated protein within 20% of the molecular weight of the truncated or full-length protease inhibitor polype:ptides as disclosed herein (e.g., SEQ ID
NOS: 2,4,6,8,10 and 12). Preferably, the molecular weight is within 1 S% of a full length protease inhibitor polypeptide, more preferably within 10% or 5%, and most preferably 31 . . .,.; '~:' '_'.' ._ ;~~ 7~
Applicant Ref. No.: 05718-PCT. app within 3%, 2%, or 1% of a full length protease inhibitor polypeptide of the present invention.
Optionally, the protease inhibitor polynucleotides of this embodiment will encode a protein having an inhibitory activity less than or equal to 20%, 30%, 40%, or 50% of the native, endogenous (i.e., non-isolated), full-length protease inhibitor polypeptide.
Determination of protein inhibition can be detemined by any number of means well known to those of skill in the art.
F. Polynucleotides Complementary to the Polyr~ucleotides of (A)-(E) As indicated in (f), supra, the present invention provides isolated and/or heterologous nucleic acids comprising protease inhibitor polynucleotides, wherein the polynucleotides are complementary to the polynucleotides of paragraphs A-E, above. As those of skill in the art will recognize, complementary sequences base-pair throughout the entirety of their length with the polynucleotides of (A)-(E) (i.e., have 100%
sequence identity). Complementary bases associate through hydrogen bonding in double stranded nucleic acids. For example, the following base pairs are complementary:
guanine and cytosine; adenine and thymine; and adenine and uracil.
G. Polynz~cleotides Which are Subseqzrences of ,the Polynucleotides of (A)-(F) As indicated in (h), supra, the present invention provides isolated and/or heterologous nucleic acids comprising protease inhibitor polynucleotides, wherein the polynucleotide comprises at least 15 contiguous bases from the polynucleotides of (A) through (F~.-discussed above. The length of tile polynucleotide is given as an integer selected from the group consisting of from at least 1 S to the length of the nucleic acid sequence from which the protease inhibitor polynucleotide is a subsequence of.
Thus, for example, polynucleotides of the present invention are inclusive of polynucleotides comprising at least 15, 20, 25, 30, 40, 50, 60, 75, or 100 contiguous nucleotides in length from the polynucleotides of (A)-(F). Optionally, the number of such subsequences encoded by a polynucleotide of the instant embodiment can be any integer selected from the group consisting of from 1 to 20, such as 2, :3, 4, or 5.
Construction of Protease inhibitor Nucleic Acids t-r /cppli~ant Ref. No.: 0571 R-PCT.app The isolated and/or heterologous protease inhibitor nucleic acids of the present invention can be made using (a) standard recombinant methods, (b) synthetic techniques, or combinations thereof. In some embodiments, the protease inhibitor polynucleotides of the present invention will be cloned, amplified, or otherwise constructed from a plant.
The preferred plants are barley and Zea mays, such as inbred line B73 which is publicly known and available. Particularly preferred is the use of Zea mays tissue such as roots, leaves, tassels, seeds or embryonic tissue.
A. Recombinant Methods for Constructing Protease inhibitor Nucleic Acids The isolated and/or heterologous nucleic; acid compositions of this invention, such as RNA, cDNA, genomic DNA, or a hybrid thereof, can be obtained from plant biological sources using any number of cloning methodologies known to those of skill in the art.
The isolation of protease inhibitor polynucleotides may be accomplished by a number of techniques. For instance, oligonucleotide probes based on the sequences disclosed here can be used to identify the desired gene in a cDNA or genomic DNA
library. To construct genomic libraries, large segments of genomic DNA are generated by random fragmentation, e.g. using restriction ene!onucleases, and are ligated with vector DNA to form concatemers that can be packaged into the appropriate vector. To prepare a cDNA library, mRNA is isolated from the desired organ, such as sclerenchyma and a cDNA library which contains the gene encoding; for a protease inhibitor protein (i.e., the protease inhibitor gene) is prepared from the mF:NA. Alternatively, cDNA may be prepared from mRNA extracted from other tissues in which protease inhibitor genes or homologs.xpressed.
The DNA or genomic library can then bf: screened using a probe based upon the sequence of a cloned protease inhibitor polynucl.eotide such as those disclosed herein.
Probes may be used to hybridize with genomic I~NA or cDNA sequences to isolate homologous genes in the same or different plant species. Those of skill in the art will appreciate that various degrees of stringency of hybridization can be employed in the assay; and either the hybridization or the wash medium can be stringent. As the conditions for hybridization become more stringent, there must be a greater degree of complementarity between the probe and the target for duplex formation to occur. The degree of stringency can be controlled by temperature, ionic strength, pH and the presence AMENDED SHEET
Applicant Ref. No.: 0~7tR-PCT.app .. ..
. . , . . . .
. ~ . . . . . . ~..
.. .. .~. .... ..
of a partially denaturing solvent such as formamide. For example, the stringency of hybridization is conveniently varied by changing the polarity of the reactant solution through manipulation of the concentration of formamide within the range of 0%
to 50%.
Cloning methodologies to accomplish these ends, and sequencing methods to S verify the sequence of nucleic acids are well known in the art. Examples of appropriate cloning and sequencing techniques, and instructions sufficient to direct persons of skill through many cloning exercises are found in S~~mbrook, et al., Molecular Cloning. A
Laboratory Manual, 2nd Ed., Cold Spring Harlbor Laboratory Vols. 1-3 (1989), Methods in Enzymology, Vol. 152: Garide to Molecular Cloning Technigues, Berger and Kimmel, Eds., San Diego: Academic Press, Inc. (1987), Current Protocols in Molecular Biology, Ausubel, et al., Eds., Greene Publishing and Wiley-Interscience, New York ( 1987); Plant Molecular Biology: A Laboratory ManZral, Clark, Ed., Springer-Verlag, Berlin (1997).
The nucleic acids of interest can also be; amplified from nucleic acid samples using amplification techniques. For instance, polymerase chain reaction (PCR) technology can be used to amplify the sequences of protease inhibitor polynucleotides of the present invention and related genes directly from genomic DNA or cDNA
libraries.
PCR and other in vitro amplification methods may also be useful, for example, to clone nucleic acid sequences that code for proteins to be expressed, to make nucleic acids to use as probes for detecting the presence of the desii:-ed mRNA in samples, for nucleic acid sequencing, or for other purposes.
The degree of complementarity (sequence identity) required for detectable binding will vary in accordance with the stringency of the hybridization medium and/or wash medium. -~ewdegree of complementarity will optimally be 100 percent; however, it should be understood that minor sequence variations in the probes and primers may be compensated for by reducing the stringency of the hybridization and/or wash medium.
Examples of techniques sufficient to direct persons of skill through in vitro amplification methods are found in Berger, Sarnbrook, and Ausubel, as well as Mullis et al., U.S. Patent No. 4,683,202 (1987); PCR Protocols A Guide to Methods and Applications, Innis et al., Eds., Academic Press; Inc., San Diego, CA (1990);
Arnheim &
Levinson, C&EN pp. 36-47 (October 1, 1990).
B. Synthetic Methods for Constructing Protease inhibitor Nucleic Acids ~r:~~~~~r:~ s~L~r Hppl~cant Ref. No.: 05718-PCT.app ' ~ ~ ~ ~ , .. ~. . . .. . ..
The isolated nucleic acids of the present invention can also be prepared by direct chemical synthesis by methods such as the phosphotr~iester method of Narang et al., Meth.
Enzvmol. 68: 90-99 ( 1979)and the phosphodiester method of Brown et al., Meth.
Enzymol. 68: 109-151 (1979). The isolated nucleic acids of the present invention can also be modified through methods such as site directed mutogenesis, error prone PCR
and known to one of skill.
Recombinant Expression Cassettes The present invention further provides recombinant expression cassettes comprising a protease inhibitor nucleic acid of the present invention. A
nucleic acid sequence coding for the desired protease inhibitor polynucleotide, for example a cDNA or a genomic sequence encoding a full length protease inhibitor protein, can be used to construct a recombinant expression cassette wr~ich can be introduced into the desired host cell. A recombinant expression cassette will typically comprise a protease inhibitor polynucleotide operably linked to transcriptional initiation regulatory sequences which will direct the transcription of the protease inhibitor polynucleotide in the intended host cell, such as tissues of a transformed plant.
For example, plant expression vectors may include (1) a cloned plant gene under the transcriptional control of 5' and 3' regulatory sequences and (2) a dominant selectable marker. Such plant expression vectors may also contain, if desired, a promoter regulatory region (e.g., one conferring inducible or constitutivvironmentally- or developmentally-regulated, or cell- or tissue-specific/selective expression), a transcription initiation start site, a ribosome binding site, an RNA processing signal, a transcription termination site, and/or a polyadenylation signal. Highly preferred plant expression cassettes will be designed to include one or more selectable marker genes, .such as kanamycin resistance or herbicide tolerance genes.
A plant promoter fragment may be employed which will direct expression of the protease inhibitor polynucleotide in all tissues of a regenerated plant. Such promoters are referred to herein as "constitutive" promoters arid are active under most environmental ~,t,p~r,~~cr c~..c~-.
~.~ ....,:... i Applicant Ref. No.: 05718-PCT.app . , . "' , , ~~, , , , ,. .. , . ,.
conditions and states of development or cell differentiation. Examples of constitutive promoters include the cauliflower mosaic virus (CaMV) 35S transcription initiation region, the 1'- or 2'- promoter derived from T-L>NA ofAgrobacterium tumefat;iens, the ubiquitin 1 promoter, the Smas promoter, the ci:nnamyl alcohol dehydrogenase promoter (U.S. Patent No. 5,683,439), the Nos promoter, the pEmu promoter, the rubisco promoter, the GRP1-8 promoter, and other transcription initiation regions from various plant genes known to those of skill. In a preferred embodirrlent, the gamma zero promoter of maize would be used.
Alternatively, the plant promoter may direct expression of the protease inhibitor polynucleotide in a specific tissue or may be otherwise under more precise environmental or developmental control Examples of promoters under developmental control include promoters that initiate transcription only, or preferentially, in certain tissues, such as leaves, roots, fruit, seeds, or flowers. The operation of a promoter may also vary depending on its location in the genome. Thus, an inducible promoter may become fully or partially constitutive in certain locations.
Both heterologous and non-heterologous (i.e., endogenous) promoters can be employed to direct expression of the protease inhibitor nucleic acids of the present invention. These promoters can also be used, fir example, in recombinant expression cassettes to drive expression of antisense nucleic acids to reduce, increase, or alter protease inhibitor content and/or composition in a desired tissue Methods for identifying promoters with a particular expression pattern, in terms of, e.g., tissue type, cell type, stage of development, andJor environmental conditiorrs~=a~e well known in the art. See, e.g., The Maize Handbook, Chapters 114-115, Freeling and Walbot, Eds., Springer, New York: (1994); Corn and Corn Improvement, 3'a edition, Chapter 6, Sprague and Dudley, Eds., American Society of Agronomy, Madison, Wisconsin (1988). A typical step in promoter isolation methods is identification of gene products that are expressed with some degree of specificity in the target tissue. Amongst the range of methodologies are: differential hybridization to cDNA libraries;
subtractive hybridization; differential display; differential 2 -D gel electrophoresis;
DNA probe arrays;
and isolation of proteins known to be expressed. with some specificity in the target tissue.
Such methods are well known to those of skill in the art. Commercially available _ _., c_'_:
75529-49(S) products for identifying promoters are known in the art such as CloneTech's (Palo Alto, CA) PROMOTERFINDER DNA Walking Kit Once promoter and/or gene sequences are known, a region of suitable size is selected from the genomic DNA that is 5' to the transcriptional start, or the translational start site, and such sequences are then linked to a coding sequence. If the transcriptional start site is used as the point of fusion, any of a number of possible 5' untranslated regions can be used in between the transcriptional start site and the partial coding sequence. If the translational start site at the 3' end of the specific promoter is used, then it is linked directly to the methionine start codon of a coding sequence.
If polypeptide expression is desired, it is generally desirable to include a polyadenylation region at the 3'-end of the protease inhibitor polynucleotide coding region. An intron sequence can be added to the ~' untranslated region or the coding sequence of the partial coding sequence to increase the amount of the mature message that accumulates in the cytosol Use of maize introns Adhl-S intron 1, 2, and 6, the Bronze-I
intron are known in the art. See generally, The Maize Handbook, Chapter 116, Freeling and Walbot, Eds., Springer, New York ( 1994).
The vector comprising the sequences from a protease inhibitor nucleic acid will typically comprise a marker gene which confers a selectable phenotype on plant cells.
Usually, the selectable marker gene will encode antibiotic resistance, with suitable genes including genes coding for resistance to the antibiotic spectinomycin (e.g., the aada gene), the streptomycin phosphotransferase (SPT) gene coding for streptomycin resistance, the neomycin phosphotransferase (NPTII) gene encoding kanamycin or geneticin resistance, the hygromycin phosphotransferase (HPT) gene coding for hygromycin resistance, genes coding for resistance to herbicides which act to inhibit the action of acetolactate synthase (ALS), in particular the sulfonylurea-type herbicides (e.g., the acetolactate synthase (ALS) gene containing mutations leading to such resistance in particular the S4 and/or Hra mutations), genes coding for resistance to herbicides which act to inhibit action of glutamine synthase, such as phosphinothricin or basta (e.g., the bar gene), or other such genes known in the art. The bar gene encodes resistance to the herbicide basta, the nptll gene encodes resistance to the antibiotics kanamycin and geneticin, and the ALS gene encodes resistance to the herbicide chlorsulfuron.
Trade-mark Applicant Ref. No.: 071 R-PCT.app ' . . .. ,.
. . , " . . , , . , . , . . . ,.
,. ,. ". .,.. , Typical vectors useful for expression of genes in higher plants are well known in the art and include vectors derived from the tumor-inducing (Ti) plasmid of Agrobacterium tumefaciens described by Rogers et al., Meth. In Enzymol., 153:253-277 (1987). These vectors are plant integrating vect~~rs in that on transformation, the vectors integrate a portion of vector DNA into the genome of the host plant. Exemplary A.
tumefaciens vectors useful herein are plasmids pKYLX6 and pKYLX7 of Schardl et al., Gene, 61:1-11 (1987) and Berger et al., Proc. Natl. Acad. Sci. U.S.A., 86:8402-(1989). Another useful vector herein is plasmid pBI101.2 that is available from Clontech Laboratories, Inc. (Palo Alto, CA).
The protease inhibitor polynucleotide of the present invention can be expressed in either sense or anti-sense orientation as desired.
Protease inhibitor Proteins The isolated protease inhibitor proteins of the present invention comprise a protease inhibitor polypeptide having at least 10 amino acids encoded by any one of the protease inhibitor polynucleotides as discussed more fully, sarpra, or polypeptides which are conservatively modified variants thereof. Exemplary protease inhibitor polypeptide sequences are provided in SEQ ID NOS: 2,4,6,8,10 and 12. The protease inhibitor proteins of the present invention or variants thereof can comprise any number of contiguous amino acid residues from a protease; inhibitor protein, wherein that number is selected from the group of integers consisting of from 10 to the number of residues in a full-length protease inhibitor polypeptide. Optionally, this subsequence of contiguous amino acids at least 15, 20, 25, 30, 35, or 40 amino acids in length, often at least 50, 60, 70, 80, or 90 amino acids in length. Further, the number of such subsequences can be any integer selected from the group consisting of from 1 to 20, such as 2, 3, 4, or 5.
As those of skill will appreciate, the preaent invention includes protease inhibitor polypeptides with less inhibitory activity. Less inhibitory protease inhibitor polypeptides have an inhibitory activity at least 20%, 30%, or 40%, and preferably at least 50% or 60%, below that of the native (non-synthetic), endogenous protease inhibitor polypeptide.
A preferred immunoassay is a competitive immunoassay as discussed, infra.
Thus, the protease inhibitor proteins can be employed as immunogens for constructing ftpplicant Ref. No.: 05718-PCT.app 1 1 1 ! a 1 1 1 . 1 1 n I ~ I . 1 1 a 1 . 1 n n 1 v 1 n ! 1 1 ~ n . 1 a 1 1 1 o i , antibodies immunoreactive to a protease inhibitor protein for such exemplary utilities as immunoassays or protein purification techniquca.
Expression of Proteins in Host Cells Using the nucleic acids of the present invention, one may express a protease inhibitor protein in a recombinantly engineered cell such as bacteria, yeast, insect, mammalian, or preferably plant cells. The cells produce the protein in a non-natural condition (e.g., in quantity, composition, location, and/or time), because they have been genetically altered through human intervention to do so.
It is expected that those of skill in the act are knowledgeable in the numerous expression systems available for expression of nucleic acids encoding protease inhibitor proteins. No attempt to describe in detail the various methods known for the expression of proteins in prokaryotes or eukaryotes will be made.
IS
B. Expression in Eukaryotes A variety of eukaryotic expression systems such as yeast, insect cell lines, plant and mammalian cells, are known to those of skill in the art. As explained briefly below, protease inhibitor proteins of the present invention may be expressed in these eukaryotic systems. In some embodiments, transformed/transfected plant cells, as discussed infra, are employed as expression systems for production of the proteins of the instant invention.
Transfection/Transformation of Cells The method of transformation/transfection is not critical to the instant invention;
various methods of transformation or transfection are currently available. As newer methods are available to transform crops or othE:r host cells they may be directly applied.
Accordingly, a wide variety of methods have been developed to insert a DNA
sequence into the genome of a host cell to obtain the transcription and/or translation of the sequence to effect phenotypic changes in the organism. Thus, any method which provides for efficient transformation/transfection may be employed.
AMENDED SN~ET
Applicant Ref. No.: 0571 R-PCT.app A. Plant Transformation A DNA sequence coding for the desired protease inhibitor polynucleotide, for example a cDNA or a genomic sequence encoding a full length protein, will be used to construct a recombinant expression cassette which can be introduced into the desired plant.
Isolated nucleic acids of the present invention can be introduced into plants according to techniques known in the art. Generally, recombinant expression cassettes as described above and suitable for transformation of plant cells are prepared.
Techniques for transforming a wide variety of higher plant species are well known and described in the technical, scientific, and patent literature. Se:e, for example, Weising et al., Ann. Rev.
Genet. 22: 421-477 (1988). For example, the DIVA construct may be introduced directly into the genomic DNA of the plant cell using techniques such as electroporation, PEG
poration, particle bombardment, silicon fiber dellivery, or microinjection of plant cell protoplasts or embryogenic callus. Alternatively, the DNA constructs may be combined with suitable T-DNA flanking regions and introduced into a conventional Agrobacterium tumefaciens host vector. The virulence functions of the Agrobacterium tumefaciens host will direct the insertion of the construct and adjacent marker into the plant cell DNA
when the cell is infected by the bacteria.
The introduction of DNA constructs using polyethylene glycol precipitation is described in Paszkowski et al., Embo J. 3: 2717-2722 (1984). Electroporation techniques are described in Fromm et al., Proc. Natl. Acad. Sci. 82: 5824 (1985).
Ballistic transforms r ii techniques are described in Klein et al., Nature 327: 70-73 (1987).
Agrobacterium tumefaciens-meditated transformation techniques are well described in the scientific literature. See, for example Horsch et al., Science 233: 496-498 (1984), and Fraley et al., Proc. Natl. Acad. Sci. 80: 4803 (1583). Although Agrobacterium is useful primarily in dicots, certain monocots can be transformed by Agrobacterium. For instance, Agrobacterium transformation of maize is described in U.S. Patent No.
5,550,318.
Other methods of transfection or transformation include (1) Agrobacteria~m rhizogenes-mediated transformation (see, e.g., L,ichtenstein and Fuller In:
Genetic Engineering, vol. 6, PWJ Rigby, Ed., London, Academic Press, 1987; and Lichtenstein, C. P., and Draper, J,. In: DNA Cloning, Vol. II, D. M. Glover, Ed., Oxford, IRI Press, ~;"J~~;DEI~ ~~-tEEi 75529-49(S) 1985),Application PCT/US87/02512 (WO 88/02405 published Apr. 7, 1988) describes the use of A.rhizogenes strain A4 and its Ri plasmid along with A. tumefaciens vectors pARC8 or pARC 16 (2) liposome-mediated DNA uptake (see, e.g., Freeman et al., Plant Cell Physiol. 25: 1353, 1984), (3) the vortexing method (see, e.g., Kindle, Proc. Natl.
Acad. Sci., USA 87: 1228, (1990).
DNA can also be introduced into plants by direct DNA transfer into pollen as described by Zhou et al., Methods in Enzymology, 101:433 (1983); D. Hess, Intern Rev. Cytol., 107:367 (I987); Luo et al., Plane Mol. Biol. Reporter, 6:165 (1988). Expression ofpolypeptide coding genes can be obtained by injection of the DNA into reproductive organs of a plant as described by Pena et al., Nature, 325.:274 (1987). DNA can also be injected directly into the cells of immature embryos and the rehydration of desiccated embryos as described by Neuhaus et al., Theor. Appl. Genet., 75:30 (1987); and Benbrook et al., in Proceedings Bio Expo 1986, Butterworth, Stoneham, Mass., pp. 27-54 ( 1986). A variety of plant viruses that can be employed as vectors are known in the art and include cauliflower mosaic virus (CaMV), geminivirus, brome mosaic virus, and tobacco mosaic virus.
Synthesis of Proteins Protease inhibitor proteins of the present invention can be constructed using non-20. cellular synthetic methods. Solid phase synthesis of protease inhibitor proteins of less than about 50 amino acids in length may be accomplished by attaching the C-terminal amino acid of the sequence to an insoluble support followed by sequential addition of the remaining amino acids in the sequence. Techniques for solid phase synthesis are described by Barany and Merrifield, Solid-Phase Peptide Synthesis, pp. 3-284 in The Peptides: Analysis, Synthesis, Biology. Yol. 2: Special Methods in Peptide Synthesis, Part A.; Merrifield, et al., J. Am. Chem. Soc. 85: 2149-2156 (1963), and Stewart et al., Solid Phase Peptide Synthesis, 2nd ed., Pierce Chem. Co., Rockford, Ill.
(1984). Also, the compounds can be synthesized on an applied Biosystems model 431 a peptide synthesizer using fastmocTM chemistry involving hbtu [2-(lh-benzotriazol-1-yl)-1,1,3,3-tetramethyluronium hexafluorophosphate, as published by Rao, et al., Int. J.
Pecr. Prot.
Res.; Vol. 40; pp. 508-515; (1992), Peptides can be cleaved following standard protocols and purified by reverse phase Applicant Ref. No.: 0571 R-PCT.app chromatography using standard methods. The amino acid sequence of each peptide can be confirmed by automated edman degradation on an applied biosystems 477a protein sequencer/120a pth analyzer. Protease inhibitor proteins of greater length may be synthesized by condensation of the amino and carboxy termini of shorter fragments.
Methods of forming peptide bonds by activation. of a carboxy terminal end (e.g., by the use of the coupling reagent N,N'-dicycylohexylcarbodiimide)) is known to those of skill.
Purification of Proteins The protease inhibitor proteins of the present invention may be purified by standard techniques well known to those of skill in the art. Recombinantly produced protease inhibitor proteins can be directly expressed or expressed as a fusion protein. The recombinant protease inhibitor protein is purified by a combination of cell lysis (e.g., sonication, French press) and affinity chromatography. For fusion products, subsequent digestion of the fusion protein with an appropriate proteolytic enzyme releases the desired 1 S recombinant protease inhibitor protein.
The protease inhibitor proteins of this invention, recombinant or synthetic, may be purified to substantial purity by standard techniques well known in the art, including selective precipitation with such substances as ammonium sulfate, column chromatography, immunopurification methods, and others. See, for instance, R.
Scopes, Protein Purification: Principles and Practice, Springer-Verlag: New York (1982);
Deutscher, Guide to Protein Purification, Acadf:mic Press (1990). For example, antibodies may be raised to the protease inhibitor proteins as described herein.
Purificatie~om E. coli can be achieved following procedures described in U.S.
Patent No. 4,511,503. The protein may then be isolated from cells expressing the protease inhibitor protein and further purified by standard protein chemistry techniques as described herein. Detection of the expressed protein is achieved by methods known in the art and include, for example, radioimmunoassays, Western blotting techniques, protease inhibition assays, or immunoprecipitation.
Trans>=enic Plant Re ~neration Transformed plant cells which are derivE:d by any of the above transformation techniques can be cultured to regenerate a whole plant which possesses the transformed . ~.,~:r~'. ~-..,-,__.
. . ..' ~. s . ~ .. .....
Applicant Ref. No.: Oi7lR-PCT.app genotype and thus the desired protease inhibitor content and/or composition phenotype.
Such regeneration techniques often rely on manipulation of certain phytohormones in a tissue culture growth medium, typically relying on a biocide and/or herbicide marker which has been introduced together with the protease inhibitor polynucleotide.
Plants cells transformed with a plant expression vector can be regenerated, e.g., from single cells, callus tissue or leaf discs according to standard plant tissue culture techniques. It is well known in th.e art that various cells, tissues, and organs from almost any plant can b~e successfully cultured to regenerate an entire plant. Plant regeneration from cultured protoplasts is described in Evans et al., Protoplasts Isolation and Culture, Handbook ofPlant Cell Carlture, Macmillilan Publishing Company, New York, pp. 124-176 (1983); and Binding, Regeneration ofPlants, Plant Protoplasts, CRC', Press, Boca Raton, pp. 21-73 (1985).
The regeneration of plants containing the foreign gene introduced by Agrobacterium from leaf explants can be achieved as described by Horsch et al., Science, 227:1229-1231 (1985 Regeneration can also be obtained from plant callus, explants, organs, or parts thereof. Such regeneration techniques are described generally in Klee et al., Ann. Rev. of Plant Phys. 38: 467-486 (1987For maize cell culture and regeneration see generally, The Maize Handbook, Freeling and Walbot, Eds., Springer, New York (1994); Corn and Corn Improvement, 3'd edition, Sprague and Dudley E,ds., American Society of Agronomy, Madison, Wisconsin (1988).
One of skill will recognize that after the recombinant expression cassette is stably incorporate~.n transgenic plants and confirmed to be operable, it can be introduced into other plants by sexual crossing. Any of a number of standard breeding techniques can be used, depending upon the species to be crossed.
In vegetatively propagated crops, mature transgenic plants can be propagated by the taking of cuttings or by tissue culture techniques to produce multiple identical plants.
Selection of desirable transgenics is made and new varieties are obtained and propagated vegetatively for commercial use. In seed propagated crops, mature transgenic plants can be self crossed to produce a homozygous inbred plant. The inbred plant produces seed containing the newly introduced heterologous nucleic acid. These seeds can be grown to 43 ~ - _ ;, : , Applicant Ref. No.: 057IR-PCT.app ~ ~ . , ,.
.. . . . . .. . , . .
. ~ . ... . . . .:. .
produce plants that would produce the selected phenotype, (e.g., altered protease inhibitor content or composition).
Parts obtained from the regenerated plant, such as flowers, seeds, leaves, branches, fruit, and the like are included in the invention, provided that these parts comprise cells comprising the isolated nucleic acid of the present invention. Progeny and variants, and mutants of the regenerated plants are also included within the scope of the invention, provided that these parts comprise the introduced nucleic acid sequences.
Transgenic plants expressing the selectable marker can be screened for transmission of the protease inhibitor nucleic acrid of the present invention by, for example, standard immunoblot and DNA detection techniques. Transgenic lines are also typically evaluated on levels of expression of the heterologous nucleic acid.
Expression at the RNA level can be determined initially to identify and quantitate expression-positive plants. Standard techniques for RNA analysis can be employed and include PCR
amplification assays using oligonucleotide primers designed to amplify only the heterologous RNA templates and solution hybridization assays using heterologous nucleic acid-specific probes. The RNA-positive plants can then analyzed for protein expression by Western immunoblot analysis using the protease inhibitor specific antibodies of the present invention. In addition, in situ hybridization and immunocytochemistry according to standard protocols can be done using heterologous nucleic acid specific polynucleotide probes and antibodies, respectively, to localize sites of expression within transgenic tissue. Generally, a number of transgenic lines are usually screened for the incorporated nucleic aci~"o identify and select plants with the most appropriate expression profiles.
A preferred embodiment is a transgenic plant that is homozygous for the added heterologous nucleic acid; i.e., a transgenic plant that contains two added nucleic acid sequences, one gene at the same locus on each chromosome of a chromosome pair.
A
homozygous transgenic plant can be obtained by sexually mating (selfing) a heterozygous transgenic plant that contains a single added heterologous nucleic acid, germinating some of the seed produced and analyzing the resulting plants produced for altered activity relative to a control plant (i.e., native, non-transgenic). Back-crossing to a parental plant and out-crossing with a non- transgenic plant are also contemplated.
~,rn'r~aF~ s~;~~r 75529-49(S) Protein structure and amino acid substitution It can be difficult to predict the ultimate effect of substitution on the tertiary structure and folding of the protein. Both tertiary structure and folding are critical to the stability and adequate expression of the protein in vivo. It is critical to undertake analysis and functional modeling of the wild type compound to determine whether substitutions can be made without disrupting biological activity.
The biological activity of a protein is dictated by its three dimensional structure which is intrinsically related to the folding of the protein. The folding of a protein into its functional domains is a direct consequence of the primary amino acid sequence.
While it is true that many proteins tolerate amino acid changes without affecting the folding or function of the protein, there is no a rp iori method of predicting which amino acid may be substituted or deleted without affecting the folding pathway. Each protein is unique and the folding process is necessarily an experimental determination. As has been concluded by Zabin et al., ("Approaches to Predicting Effects of Single Amino Acid Substitutions on the Function of a Protein"; Biochemistry; Vol. 30; pp. 6230-6240; 1991), neither the frequency of exchange of amino acids between homologous proteins nor any other measure of the properties of the amino acids are particularly useful by themselves in predicting whether a protein with an amino acid substitution will be functional. The scientific literature is replete with examples where seemingly conservative substitutions have resulted in major perturbations of structure and activity and vice versa, see e.g.;
Summers, et al., "A Conservative Amino Acid Substitution, Arginine for Lysine, Abolishes Export of a Hybrid Protein in E. Coli." J. Biol. Chem., Vol. 264, pp. 20082-20088, (1989); Ringe, D., "The Sheep in Wolfs Clothing" Nature, Vol. 339, pp.
6S8-659, (1989); Hirabayashi et al., "Effect of Amino Acid Substitution by Site-directed Mutagenesis on the Carbohydrate Recognition and Stability of Human 14-kDa (3-galactoside-binding Lectin," J. Biol. Chem., Vol. 266, pp. 23648-23653, (1991); and van Eijsden, et al., "Mutational Analysis of Pea Lectin: Substitution of Asn125 for Asp in the Monosachharide-binding Site Eliminates Mannose/Glucose -binding Activity,"
Plant Mol. Biol., Vol. 20, pp. 1049-1058 (1992).
The 3D structure of many proteins, including enzymes and protein inhibitors such as the barley chymotrypsin inhibitor has been solved. The three dimensional structure of a 75529-49(S) truncated fragment of CI-2 (with 65 residues) that is missing the N-terminal I8 residues has been determined by x-ray crystallography as well as by NMR spectroscopy (McPhalen, et al., Biochemistry; Vol. 26; pp. 261-269; (1987); and Clore, et al., Protein Ene.; Vol. 1, pp. 3I3-318; (1987)). In the wild type CI-2 the first 18 residues do not assume any ordered conformation and also do not contribute to the structural integrity of the molecule (see e.g. Kjaer, et al., Carlsberg Res. Commun.; Vol. 53; pp. 327-354;
(1987?. This polypeptide is found in the endosperm of grain and is isolated as an 83 residue protein with no disulfide bridges. See e.g. Jonassen, L, Carlsbere Res. Commun.; Vol. 45; pp. 47-48; (1980); and Svendsen, L, et al., Carlsberg Res. Commun.; Vol. 45; pp. 79-85; (1980). The 3D structure of CI-2 has been determined. See McPhalen, et al., 1987.
CI-2 is predominantly a (3-sheet protein, devoid of disulfide bonds and containing a wide loop of approximately 18 residues (residue 53-70 in the CI-2 molecule) in the extended conformation. This is the reactive site loop that contains a methionine I5 residue at position 59 which confers the property of chymotrypsin inhibition. A
constrained peptide containing these residues has been synthesized and shown to retain full chymotrypsin inhibitory activity. See Leatherbarrow, et al., Biochem., Vol. 30, pp.
10717-10721 (1991). In the absence of any disulfide bonds, the integrity of the reactive site loop is maintained by strong hydrogen bond interactions between G1u60 -~
Arg65 and Thr58 -~ Arg67. Mutants of CI-2 in which Thr58 and G1u60 have been replaced with Ala are not only less stable proteins but also have little or no protease inhibitory activity.
See Jackson, et al., Biochem., Vol. 33, pp. 13880-13887 (1994); and Jandu, et al., Biochem., Vol. 33, pp. 6264-6269 (1990). These studies have demonstrated that the reactive site loop is a key structural feature essential for the function of protease inhibition.
Molecular Markers The present invention provides a method of genotyping a plant comprising a protease inhibitor polynucleotide. Preferably, the plant is a monocot, such as maize or sorghum. Genotyping provides a means of distinguishing homologs of a chromosome pair and can be used to differentiate segregants in a plant population.
r Appli.;ant Ref. No.: 05718-PCT.app . . . , " , .. ., . . . . , . , .
. . . ,.. . , . , . . . . , . ,. ., , , , Molecular marker methods can be used for phylogenetic studies, characterizing genetic relationships among crop varieties, identifying crosses or somatic hybrids, localizing chromosomal segments affecting monogenic traits, map based cloning, and the study of quantitative inheritance. See, e.g., Plant Molecular Biology: A Laboratory Manual, Chapter 7, Clark, Ed., Springer-Verlag, Berlin (1.997). For molecular marker methods, see generally, The DNA Revolution by Andrew H. Paterson 1996 (Chapter 2) in:
Genome Mapping in Plants (ed. Andrew H. Paterson) by Academic Press/R. G. Landis Company, Austin, Texas, pp.7-21.
Detection of Protease Inhibitor Nucleic Acids The present invention further provides methods for detecting protease inhibitor polynucleotides of the present invention in a nucleic acid sample suspected of comprising a protease inhibitor polynucleotide, such as a plant cell lysate, particularly a lysate of corn. In some embodiments, a proteasE; inhibitor gene or portion thereof can be amplified prior to the step of contacting the nucleic acid sample with a protease inhibitor polynucleotide. The nucleic acid sample is contacted with the protease inhibitor polynucleotide to form a hybridization complex. The protease inhibitor polynucleotide hybridizes under stringent conditions to a gene encoding a protease inhibitor polypeptide.
Formation of the hybridization complex is used to detect a gene encoding a protease inhibitor polypeptide in the nucleic acid sample. Those of skill will appreciate that an isolated nucleic acid comprising a protease inhil:>itor polynucleotide should lack cross-hybridizing sequences with non-protease inhibitor genes that would yield a false positive result.
Detection of the hybridization complex can be achieved using any number of well known methods. For example, the nucleic acid sample, or a portion thereof, may be assayed by hybridization formats including but not limited to, solution phase, solid phase, mixed phase, or in situ hybridization assays.
Protease Inhibitor Protein Immunoassays 47 . , .~,~-., t~pplicant Ref. No.: 0571 R-PCT.app .. , , .. . , . . . ... . . . ..
. . . . . .
.. .. ... .... ,.
Means of detecting the protease inhibitor proteins of the present invention are not critical aspects of the present invention. In a preferred embodiment, the protease inhibitor proteins are detected and/or quantified using any of a number of well recognized immunological binding assays (see, e.g., U.S. Patents 4,366,241; 4,376,110;
4,517,288;
and 4,837,168). For a review of the general immunoassays, see also Methods in Cell Biology, Vol. 37: Antibodies in Cell Biology, Asai, Ed., Academic Press, Inc.
New York (1993); Basic and Clinical Immunology 7th Edition, Stites & Ten, Eds. (1991).
D. Other Assay Formats In a particularly preferred embodiment, Western blot (immunoblot) analysis is used to detect and quantify the presence of protease inhibitor protein in the sample. The technique generally comprises separating sample proteins by gel electrophoresis on the basis of molecular weight, transferring the separated proteins to a suitable solid support, (such as a nitrocellulose i:llter, a nylon filter, or derivatized nylon I 5 filter), and incubating the sample with the antibodies that specifically bind protease inhibitor protein. The anti-protease inhibitor protein antibodies specifically bind to protease inhibitor protein on the solid support. 'These antibodies may be directly labeled or alternatively may be subsequently detected using labeled antibodies (e.g., labeled sheep anti-mouse antibodies) that specifically bind to ~,he anti-protease inhibitor protein.
E. Quantifrcation of Protease inhibitor Proteins.
Protease inhibitor proteins may b~e detected and quantified by any of a number of~ans well known to those of skill in the art. These include analytic biochemical methods such as electrophoresis, capillary electrophoresis, high performance liquid chromatography (HPLC), thin layer chromatography (TLC), hyperdiffusion chromatography, and the like, and various immunological methods such as fluid or gel precipitin reactions, immunodiffusion (single or double), immunoelectrophoresis, radioimmunoassays (RIAs), enzyme-linked imnlunosorbent assays (ELISAs), immunofluorescent assays, and the like.
48 , 7559-49 (S) Example 1: Isolation of DNA Coding for Protease inhibitor Protein from Zea mays or other plant library The polynucleotides having DNA sequences given in SEQ ID Nos: 15, 17, 19, 21, and 23 were obtained from the sequencing of cDNA clones prepared from maize.
SEQ ID NO 15 is a contig comprised of 28 cDNA clones. 20 of the cDNA clones were from libraries prepared from leaves treated with jasmonic acid. One was from a root library. Four were from libraries prepared from corn rootworm-infested roots.
One was from a tassel library. One was from a library prepared from seedlings recovering from heat shock. One was from a shoot culture library.
SEQ ID NO 17 is a contig comprised of two cDNA clones. One was from a jasmonic acid treated leaf library. The other was from an induced resistance leaf library.
SEQ ID NO 19 is a contig comprised of two cDNA clones. One was from a germinating maize seedling library. The other was from jasmonic acid treated leaf library.
SEQ ID NO 21 is a contig comprised of 4 cDNA clones. All four were from libraries prepared from jasmonic acid treated leaves.
SEQ ID NO 23 is a contig comprised of two cDNA clones. One was from a library prepared from silks, 24 hours post pollination. The other was from a library prepared from root tips less than 5 mm in length.
One skilled in the art could apply these same methods to other plant nucleotide containing libraries.
Example 2: Engineering BHL for nutritional enhancement Wild type CI-2 (from barley) contains 49.4% essential amino acids (41/83) and 9.6% lysine (8/83). Using the strategies outlined below, six different BHL
variants with increasing amounts of lysine have been proposed. The lysine percentages are 21.5%, 24.1%, 23.1%,and 25.3%, for BHL-1, BHL-1N, BHL-2, BHL-2N, BHL-3, and BHL-3N, respectively. Construct BHL-1N contains the same eight substitutions as BHL-1, plus lysine substitutions in the 18 additional amino acid residues in the amino terminal region. BHL-2 is the same as BHL-1 but with changes of amino acid residues 40 and 42 Applicant Ref. No.: 057IR-PCT.app . . ~~ .. . ., . . .. . . . . .
. . . .~.
. . . .
.. .. .., .~,. .. .
to Ala and amino acid residue 47 to lysine. Construct BHL-2N contains the same substitutions as BHL-2, plus four lysine substitutions in the 18 additional amino acid residues in the amino terminal region. BHL-3 is the same as BHL-2 except that residues 40 and 42 are changed to Gly and His, respectively. Construct BHL-3N contains the same 11 substitutions as BHL-3, plus the four lysine substitutions in 18 additional amino acid residues in the amino terminal region. Onc: skilled in the art will realize that essential and non-wild-type amino acid residue substitutions will be tolerated at both the same positions substituted with lysine, and at other positions.
The active site loop region encompasses an extended loop region from about amino acid residue 53 to about amino acid residue 70. Destabilization of the reactive loop was achieved by substituting the non-wild type amino acids residues at about positions 53 to about 70. Amino acid residues were changed by primer mutagenesis.
Preferably, the following mutations are made: Arg62 ~ Lys62, Arg65 -~ Lys65, Arg67 -~ Lys67, Thr58 -~ A1a58 or G1y58, Met59 --~ Lys59, and GIuE~O ~ A1a60 or His60. However, it will be readily apparent to one skilled in the art that functionally equivalent substitutions to those described above will also be effective in the present invention.
In a preferred embodiment of the prese;nt invention, the present protein has both elevated essential amino acid content and reduced protease inhibitor activity.
Modification in the area by amino acid substitution or other means, destroys the hydrogen bonding and changes or reduces the protease inhibitor activity of BHL.
Substitution of amino acid residues threonine, at position 58, and glutamic acid, at position 60, with glycine and histidine, respecaively, resulted in a protein with lowered protease iitor activity. Residue 59 is a critical residue in modifying protease inhibitor activity and changing specificity. When this residue was changed to a lysine, the protease inhibition specificity was changed from a chymotrypin inhibitor to a trypsin inhibitor.
The present invention provides for the creation of a nutritionally enhanced feed from WT CI-2 through at least one lysine substitution of residues 1,18,11,17,19,34,41,56,59,62,67 and 73 (long versions BHL-1N, 2N, 3N) plus residue 67 in BH2-2N and BH2-3N. Lysine substitutions in BHL-1,2 and 3 are at amino acid residues 1,16,23,41,44,49 and 55, plus residue 47 in BHL-2 and BHL-3.
Example 3- Construction of Expression Cassettes 50 t~,%~ :rJV!~ SNP
h Applicant Rcf. No.: 0571 R-PCT.app ., ..
,.
. . , , . , . .
..
. ~ , . , . . . ,.
. . . .
..
. . ., .. ... .. . ..
Vector construction was based upon the; published WT CI-2A sequence information Williamson et al, Eur. J. Biochem 165: 99-106 (1987) and SEQ ID NO
13.
Methods for obtaining full length or truncated wild-type CI-2 DNA include, but are not limited to PCR amplification, from a barley (or other plant ) endosperm cDNA
library using oligonucleotides derived from Seq. ID no 13 or from the published sequence supra, using probes derived from the same on a barley (or other plant ) endosperm cDNA
library, or using a set of overlapping oligonuclc:otides that encompass the gene.
The BHL-1 insert corresponds to SEQ ID NO '.l, plus start and stop codons.
Oligonucleotide pairs, N4394/N4395, and N4396/N4397, were annealed and ligated together to make a 202 base pair double strandf:d DNA molecule with overhangs compatible with Rca I and Nhe I restriction sites. PCR was performed on the annealed molecule using primers N5045 and N5046 to add a 5' Spe I site and 3' Hind III
site. The PCR product was then restriction digested at those sites and ligated into pBluescript II
KS+ at Spe I and Hind III sites. The insert was then removed by restriction digestion with Rca I and Hind III and was ligated into the Nco I and Hind III sites of pET28a (Novagen) to form the BHL-1 construct.
Oligonucleotide and primer sequences (5' to 3'):
TCGGTGGAGA
AATCATAGTT
CAGGATCACC
S 1 TTCTTGGCTT TCTCCACCGA TTTC:CCCACC AACTCCGGCC
ACTCTGTCTT
51 _ r~ ~ .err.
..;';L':; ~:i_cj Applicant Ref. No.: 05718-PCT.app . ,. . ~ .. , .. ., . , , . ~ , . , ..
. . . . . .., . ~ . . .
. . . , . .
. .. ., . >.
CGTCAAGCTC
51 TTTGTGGATA AAAAGGACAA CA7'CGCGCAG GTCCCCAGGG TCGG
TCCACAAAGA
51 GCTTGACGCG GTCGATCTTA TAT"TCCTTCG TCACCTTTGT AC
b. BHL-2: The BHL-2 construct insert corresponds to SEQ ID NO 3, plus start and stop codons. An overlap PCR strategy was used to make the BHL-2 construct.
PWO
polymerase from Boehringer-Mannheim was used for all PCR reactions.The primers were chosen to change 3 amino acids in the BHL-1 active site loop region, and to create unique AgeI and Hind III restriction sites flanking the active site loop, to facilitate loop replacement iri future constructs. A unique Rca I site (compatible with Nco I) was included at the S' end, and a unique Xho I site was included at the 3' end.
The overlap PCR was done as follows: PCR was done with primers N13561 and N13564, using the BHL-1 construct as template. A separate PCR was done with primers N13563 and N13562, again using the BHL-1 construct as template. The products from both reactions were gel purified and combined. Primer N13565, which overlapped regions on both of the PCR products, was then added and another l?CR was done to generate the full-length insert. The resulting product was amplified by ~~nother PCR with primers N13561 and N13562. It was subsequently suspected that a deletion was present in N13562 that caused a frameshift near the 3' end of the PCR product. To avoid this frameshift problem, a final .. _. .~- , , -j~ ~ .
Applicant Ref. No.: 05718-PCT.app ., ,. . ., ,. .
. . . ,. .
. . , ~ . . , . .., .
. . . . . . . . .
.. ,. m. ",. ~. .
PCR reaction was done with primers N13562 and N13905. The final PCR product was digested with Rca I and Xho I, and then ligated into the Nco I and Xho I sites of pET 28b.
Note: Some primers had 6-oligonucleotide extensions to improve restriction digestion efficiency.
Primer sequences (5' TO 3'):
N13562 (as ordered) 1 ATCGACAAGGTCAAGCTTTTTC~TGGATAAAAAGGA
1 GTTGGTACAAAGGTGGCGAAG~GCCTATAAGATCGACAAGGTCAAG
c. BHL-3: The BHL-3 construct insert corresponds to SEQ ID NO 5, plus start and stop codons. The BHL-2 construct was digested with Age I and Hind III, and the region between these sites was removed by gel purification. Oligonucleotide pairs, N14471 and N 14472, were annealed to make a double stranded DNA molecule with overhangs compatible with Age I and Hind III restriction sites. The annealed product was ligated into the Age I and Hind III sites of the digested BHL-2 construct to yield the construct.
Oligonucleotide Primer sequences (5' to 3'):
d. BHL-I N, BHL-2N, and BHL-3N
The BHL-1N, BHL-2N, and BHL-3N construct inserts correspond to SEQ ID No 9, SEQ
ID NO 11, and SEQ ID NO 7, respectively, plus start and stop codons. Three separate PCR reactions were done with either the BHL-1, BHL-2, or BHL-3 constructs as template. The primers for these reactions were N13771 and N13905. The resulting PCR
products were digested with Rca I and Xho I and ligated into the Nco I and Xho I sites of pET 28b to yield the BHL-1N, BHL-2N, and BI~IL-3N constructs.
Primer sequences (S' to 3'):
A~,~ENDED SHEET
75529-49(S) TTTTTTTCATGAAGTCGGTGGAGAAGAAACCGAAGGGTGTGAAGACAGG
N13905 (already provided in BHL-2 description) BHL-1N is an 83 residue polypeptide in which residues 1,8,11, and 17 were also replaced with lysine. The resulting compound has the protein sequence indicated in Sequence LD. No.lO.
BHL-2N is an 83 residue polypeptide in which residues 1,8,11, and 17 were also replaced with lysine. The resulting compound has the protein sequence indicated in Sequence LD. No.l2.
BHL-3N is an 83 residue polypeptide in which residues 1,8,11, and 17 were also replaced with lysine. The resulting compound has the protein sequence indicated in Sequence LD. No.B.
Example 3 - Expression of BHL-1 in E. colt E.rpression in E. colt BHL-I, BHL-2, BHL-3, BHL-3N, and the truncated wild-type CI-2 (residues 19 through 6~ of SEQ ID NO. 14) were expressed in E colt using materials and methods from Novagen, Inc. The Novagen expression vector pET-28 was used (pET-28a for WT CI-and BHL-1, and pET-28b for the other proteins). Ecoli strains BL21 (DE-3) or BL21 (DE-3)pLysS were used. Cultures were typically grown until an OD at 600 nm of 0.8 to 1.0, and then induced with 1 mM IPTG and grown another 2.5 to 5 hours before harvesting.
Induction at an OD as low as 0.4 was also done successfully. Growth temperatures of 37 degrees centigrade and 30 degrees centigrade were both used successfully. The media used was 2xYT plus the appropriate antibiotic at the concentration recommended in the Novagen manual.
Purification a. WT CI-2 (truncated)-- Lysis buffer was 50 mM Tris-HCI, pH 8.0, 1 mM EDTA, 150 mM NaCI. The protein was precipitated with 70% ammonium sulfate. The pallet was dissolved and dialyzed against 50 mM Tris-HCI, pH 8.6. The protein was ~.oa.d~d onto a Hi-Trap Q column, and the unbound fraction was collected and precipitated i~~ 70%
ammonium sulfate. The pellet was dissolved in 50 mM sodium phosphate, pH 7.0, Trade-mark 54 75529-49(S) mM NaCI, and fractionated on a Superdex-75 26/60 gel filtration column.
Fractions wire pooled and concentrated.
b. BHL-I--Lysis buffer was 50 mM sodium phosphate, pH 7.0, 1 mM EDTA.
The protein was loaded onto an SP Sepharose FF 16/10 column, washed with 150 mM
NaCI in 50 mIVI sodium phosphate, pH 7.0, and then eluted with an NaCI
gradient in SO
mM sodium phosphate. BHL-1 eluted at approximately 200 mM NaCI. Fractions were pooled and concentrated.
c. BHL-2, BHL-3, and BHL-3N--Lysis buffer was 50 mM Hepes, pH 8.0, 2mM
EDTA, 0.1% Triton X-100, and 0.5 mg/m1 Iysozyme. The protein was loaded onto an SP-Sepharose cation exchange column (typically a 5 to 10 ml size), washed with 150 mM
NaCI in 50 mM sodium phosphate, pH 7.0, and eluted with 500 mM NaCI in 50 mM
sodium phosphate, pH 7Ø The protein was concentrated and then subjected to Superdex-75 gel filtration chromatography twice.
d. BHL-1--Lysis buffer was SO mM sodium phosphate, pH 7.0, 1 mNl EDTA.
The protein was loaded onto an SP Sepharose FF 16/10 column, washed with 150 mM
NaC1 in 50 mM sodium phosphate, pH 7.0, and then eluted with an NaCI gradient in 50 mM sodium phosphate. BHL-I eluted at approximately 200 mNI NaCI. Fractions were pooled and concentrated.
e. BHL-2, BHL-3, and BHL-3N--Lysis buffer was 50 mM Hepes, pH 8.0, 2miVI
EDTA, 0.1% Triton X-100, and 0.5 mg/ml lysozyme. The protein was loaded onto an SP-Sepharose cation exchange column (typically a 5 to 10 ml size), washed with 150 mM
NaCI in 50 mM sodium phosphate, pH 7.0, and eluted with 500 mM NaCI in 50 mM
sodium phosphate, pH 7Ø The protein was concentrated and then subjected to Superdex-75 gel filtration chromatography twice.
4. Storage The purified proteins were stored long term by freezing in liquid nitrogen and keeping frozen at -70 degrees centigrade.
5. Verification of recombinant protein identity.
a. DNA sequencing--The insert region of these pET 28 constructs was co~rmed by DNA sequencing.
b. N-terminal protein sequencing --Trade-mark 55 Appl'~.cant Ref. No.: 0571 R-PCT.app ..
.. . . .
~ ,j~
. . . . ..~ . ~ .
~ . ,. ..
. . .. ..~ ... ,~.. ,. ..
100 pg of purified BHL-3 were digested with 1 pg of chymotrypsin (Sigma catalog # C-4129) for 30 min at 37 degrees centigrade in 50 mM sodium phosphate, pH 7Ø
The resulting chymotryptic fragments were purified by reversed phase chromatography, using an acetonitrile gradient for elution. Three pure peaks were observed and were sent to the University of Michigan Medical School Protein. Structure Facility for N-terminal sequencing (6 cycles). Peak 1 had an N-terminal sequence of val-asp-lys-lys-asp-asn.
Peak 2 had an N-terminal sequence of lys-ile-as.p-lys-val-lys. Peak 3 had an N-terminal sequence of met-lys-leu-lys-thr-glu. These results demonstrate that chymotrypsin cleaved BHL-3 after tyr-61 and phe-69. The N-terminal sequences all match exactly the expected sequence, assuming that the start methionine was largely retained in the recombinant protein. This experiment verifies that the protein we expressed in and purified from E. coli was BHL-3. Furthermore, SDS-PAGE analysis with 16.5%
Tris-Tricine precast gels from Biorad showed a similar mobility of BHL-1 and BHL-2 with the confirmed BHL-3 protein, as would be expected because BHL-1 and BHL-2 have molecular masses very similar to that of BHL-3.
160 ~g of BHL-3N were digested with 1.6 ~g pepsin overnight, and the resulting peptic fragments were purified by reversed phase chromatography. Five of the resulting peaks were sent to the Iowa State University Protein Facility for N-terminal sequencing through four cycles. The N-terminal sequences of the 5 peaks were: val-gly-lys-ser, phe-val-asp-lys, pro-val-gly-thr, met-lys-ser-val, and ile-ile-val-leu, all of which exactly match the expected BHL-3N sequence, assuming that 'the start methionine was largely retained in this recombinant protein. This experiment verifies that the protein we expressed in and purified fre~E. coli was BHL-3N.
c. Protease inhibition--The obvious protease inhibitory activity observf:d for BHL-1 and for the wild-type protein are further evidence that we have purified the a};pected proteins from E coli.
The details of these protease inhibition experiments are described next.
r ~.,._~ r;; .Tn c~~c-;~
75529-49 (S) The following experiments utilized truncated wild type CI-2 as represented as nt. SS-249 in Seq. ID NO. 13 with addition of start and stop codons.
E~camgle S - Protease Inhibition assays and Proteolitic Digests S a. Chymotrypsin Protease activity was measured by an increase in absorbance at 40S nm.
Sigma Chymotrypsin type iI (Bovine pancreas) Cat. # C-4129.
Substrate - Sigma cat. # .~-7388. N-Succinyl-Ala-Ala-Pro-phe-p vitro anilide or BHL protein used, l nM chymotrypsin, 1mM substrate, 200 ul vol~ne luM BSA included in control (no CI-2, no BHL).
Preincubated 30 min 37 C., then added substrate to start and kept at 37 C.
Buffer 0.2M tris - HCl pH 8.0 Read Abs 40S nm - 30 min Protease Activity - % of Control ABS. 40S nm Abs. At 405 nm Rep. 1 Rep. 2 Mean (S.D.) Using % control data Controll-value 0.350 0.299 control 100.0 100.0 100.0 WT CI-2-value .042 .018 control 12.0 6.0 9.0 (4.2) BHL-1-value .289 .274 control 82.6 91.6 87.1 (6.4) BHL-2-value .309 .318 control 88.3 106.4 97.4 (12.8) BHL-3-value .346 .31 S
control 98.9 lOS.4 102.2 (4.6) BHL-3N-value .318 .31 S
control 90.9 lOS.4 98.2 (10.3) 75529-49 (S) b. Subtilisin Subtilisin Carlsberg lichenif'ormis (Sigma cat.
from Bacillus # P-5380) Substrate and buffer same as for chymotrypsin exper. 200 ul reaction volume 1 uM CI2 or BHL
1nM subtilisin 1mM Substrate room temp (25 C) 30 min. preincubatedthen added substrate and read absorbance at 40~nm 30 min. data used luM BSA used or BHL) in control (no Abs. At 405 nm Rep. 1 Rep. 2 Mean (S.D.) Using % ontrol c data Controll-value 2.171 1.834 control 100.0 100.0 100.0 WT CI-2-value .014 .002 control 0.6 0 0.3 (0.4) BHL-1-value .286 .295 control 13.2 16.1 14.7 {2.1) BHL-2-value 1.692 1.569 control 77.9 85.6 81.8 (5.4) BHL-3-value 7.056 1.960 control 94.7 106.9 100.8 (8.6) BHL-3N-value 2.103 1.729 control 96.9 94.3 95.6 (1.8) 75529-49(S) c. Trypsin Bovine pancreas trypsin (Sigma cat #T-8919) Substrate S-2222 (chromogenix): N-benzoyl-2-isolenuel-Lglutamyl-glycyl-L-arginine-p-nitroaniline buffer: SOmMTris pH 7.5, 2mM NaCI, 2mM CaCl2, 0.005 % TritonX-100.
30 min. preincubation 25°, then added substrate and kept at 25°;
these are 30 minute values.
1 mM substrate, SuM CI-2 or BHL, O.SnM trypsin, no BSA in control. 200 ul reaction volume .
Abs. At 405nm Rep. 1 Rep. 2 Rep. 3 Rep. 4 Mean (S.D.) Using Control Data Control l- .505 .533 .473 .391 value control 100.0 100.0 100.0 100.0 100.0 WT CI-2- .561 .533 .474 .420 value control 111.1 100.0 100.2 107.4 104.7 (5.5) BHL-1-value .072 .096 .041 .057 control 14.3 18.0 8.7 14.6 13.9 (3.9) BHL-2-value .436 .481 .404 .405 control 86.3 90.2 85.4 103.5 91.4 (8.4) BHL-3-value .536 .557 .456 .430 control 106.1 104.5 96.4 110.0 104.3 (5.7) BHL-3N- .542 .583 .490 .437 value control 107.3 109.4 103.6 111.8 108.0 (3.5) Trade-mark Applicant Ref. No.: 05718-PCT.app ,. , . . .
.. .. . . . .. . . , , , . . , . . . , . , .
. , . . , , ,. ., .,. .". ,. ..
d. Elastase Porcine elastase IV (Sigma) Type Cat#
Substrate: Sigma S-4760 N-succinyl-ala-ala-ala-p-nitroanile buffer: 0.2M Tris HC1 pH 8.0 200 ul reactive volume 50nM elastase, 2 uM CI-2 or BHL;
1mM substrate luM BSA in control min. preincub, , then ubstrate. Kept at 25; 30 added min. data s Abs. At 405 nm Rep. 1 Rep. 2 Mean (sp) Using % control data Control 1-value 1.416 1.461 control 100.0 100.0 100.0 WT CI-2-value .030 .049 control 2.1 3.4 2.8 (0.9) BHL-1-value 1.519 1.459 control 107.3 99.9 103.6 (5.2) BHL-2-value 1.558 1.509 control 110.0 103.3 106.7 (4.7) BHL-3-value 1.587 1.493 control 112.1 102.2 107.2 (7.0) BHL-3N-value 1.527 1.481 control 107.8 101.4 104.6 (4.5) r !
75529-49(S) protease inhibition summary - % of control Protein Chymotrypsin Trypsin Elastase Subtilisin WT CI-2 9.0 104.7 2.8 0.3 BHL-1 87.1 13.9 103.6 14.7 BHL-2 97.4 91.4 106.7 81.8 BHL-3 102.2 104.3 107.2 100.8 BHL-3N 9$.2 108.0 104.6 95.6 These experiments show that BHL-2, BHL-3 and BHL-3N have reduced protease inhibition activity compared to WT CI-2 .
Digestion by trypsin The purified proteins were incubated at 37 degrees centigrade with a 100:1 (wt:wt) ratio of BHL protein or wild-type CI-2 : trypsin for l5min, 30 min, 1 hr, 2 hr, or 4 hr.
Incubation buffer was 50 mM sodium phosphate, pH 7Ø Bovine pancreas trypsin was used (Sigma catalog # T-8918). Digestion was assessed by SDS-PAGE with 16.5%
Tris-Tricine precast gels from Biorad. The BHL-2, BHL-3, and BHL-3N proteins were digested by trypsin in 15 minutes. In contrast, the BHL-1 and wild-type truncated CI-2 proteins were resistant to trypsin. This experiment confirmed that the BHL-2, BHL-3, and BHL-3N proteins are not effective inhibitors of trypsin.
Digestion by chymotrypsin.
The purified proteins were incubated at 37 degrees centigrade with a 100:1 (wt:wt) ratio of BHL protein or wild-type CI-2 : chymotrypsin for 15min, 30 min, 1 hr, 2 hr, ox 4 hr. Incubation buffer was 50 mM sodium phosphate, pH 7Ø Bovine pancreas chymotrypsin type II (Sigma catalog # S-7388 was used. Digestion was assessed by SDS-PAGE with 16.5% precast Tris-Tricine gels from Biorad. BHL-2, BHL-3, and BHL-3N proteins were digested by chymotrypsin in 15 minutes. In contrast, BHL-1 and wild-type CI-2 proteins were resistant to chymotrypsin. This experiment confirrraed that BHL-2, BHL-3, and BHL-3N are not effective inhibitors of chymotrypsin.
Digestion in simulated gastric fluid.
Applicant Ref. No.: 05718-PC'T.app .. , . .. . ,.
.. , . . .. . , , .
. . . ~ . ... . . . . ,., , . . . , .
. ,. ., ... ..,. .. ., Simulated gastric fluid was prepared by dissolving 20 mg NaCI and 32 mg of pepsin in 70 pl of HCl plus enough water to make 10 ml. Porcine stomach pepsin (Sigma cat # P-6887) was used. SO ~l of 1 mg/ml BHL-3N or wild-type CI-2 protein were incubated with 250 pl simulated gastric fluid at :37 degrees centigrade. At 1 S sec, 30 sec, 1 min, 5 min, and 30 min, 40 ~1 aliquots were removed to a stop solution consisting of 40 ~l 2X Tris-Tricine SDS sample buffer (Biorad) that also contained 3 pl of 1 M
Tris-HC1, pH 8.0 and 0.1 mg/ml pepstatin A (Boehringer-lvlannheim cat # 60010).
Digestion was assessed by 16.5% Tris-Tricine SDS-PAGE (pre:cast gels from Biorad).
Both BHL-3N and wild-type CI-2 were digested in simulated gastric fluid in 15 seconds. This experiment suggests that our engineered proteins and even the wild-type protein would likely be digested into proteolytic; fragments in the stomach of humans or monogastric animals.
Digestion in simulated intestinal fluid.
Simulated intestinal fluid was prepared by dissolving 68 mg of monobasic potassium phosphate in 2.5 ml of water, adding 1.9 ml of 0.2 N sodium hydroxide and 4 ml of water. Then 2.0 g porcine pancreatin (Sigma catalog # P-7545) was added and the resulting solution was adjusted with 0.2N sodium hydroxide to a pH of 7.5.
Water was added to make a final volume of 10 ml.
50 ~g of BHL-3N or wild-type CI-2 protein in 50 pl were incubated with 250 ~1 simulated-~estinal fluid at 37 degrees centigrade . At 15 sec, 30 sec, 1 min, 5 min, and min, 40 pl aliquots were removed and added to 40 pl of a stop solution consisting of 25 2X Tris-Tricine SDS sample buffer (Biorad) containing 2 mM EDTA and 2mM
phenylmethylsulfonyl fluoride (Sigma catalog 3# P-7626). Digestion was assessed by 16.5 Tris-Tricine SDS-PAGE (precast gels form l3iorad).
BHL-3N was digested by simulated, intcatinal fluid in 15 seconds. In contrast, 30 wild-type CI-2 was resistant to digestion for 30' minutes. This experiment shows that in the intestine of humans or monogastric animal~~, our engineered protein would likely be more digestible than the wild-type protein would be. These results are consistent with the Appl~,cant Ref. No.: 0~71R-PCT.app . ~. . . . ,.
. . . , . . . , ..~ . . . ", ,.
. . , . .
. . " , .., .,.. ,.
protease inhibition assays showing that BHL-3N was not an effective protease inhibitor.
The inventive protein was digested in less than i:me minutes, less than one and less than 30 seconds.
Digestion in simulated gastric fluid Simulated gastric fluid was prepared by dissolving 20 mg NaCI and 32 mg of pepsin in 70 ~l of HCl plus enough water to make 10 ml. Porcine stomach pepsin (Sigma cat # P-6887) was used. 50 ~l of 1 mg/ml BHL-3N or wild-type CI-2 were incubated with 250 ~l simulated gastric fluid at 37 degrees centigrade. At 15 sec, 30 sec, 1 min, 5 min, and 30 min, 40 ~1 aliquots were removed to a stop solution consisting of 40 ul 2X
Tris-Tricine SDS sample buffer (Biorad) that al.>o contained 3 pl of 1 M Tris-HCI, pH 8.0 and 0.1 mg/ml pepstatin A (Boehringer-Mannheim cat # 60010). Digestion was assessed by 16.5% Tris-Tricine SDS-PAGE (precast gels. from BioradTM).
Both BHL-3N and wild-type CI-2 were digested in simulated gastric fluid in 15 seconds. This experiment suggests that our engineered proteins and even the wild-type protein would likely be digested into proteolytic: fragments in the stomach of humans or monogastric animals.
Digestion in simulated intestinal fluid.
Simulated intestinal fluid was prepared by dissolving 68 mg of monobasic potassium phosphate in 2.5 ml of water, adding 1.9 ml of 0.2 N sodium hydroxide and 4 ml of water. Then 2.0 g porcine pancreatin (Sig:ma catalog # P-7545) was added and the resulting sel.~ion was adjusted with 0.2N sodium hydroxide to a pH of 7.5.
Water was added to make a final volume of 10 ml.
50 ~1 of lmg/ml BHL-3N or wild-type CI-2 were incubated with 250 ~1 simulated intestinal fluid at 37 degrees centigrade . At 15 sec, 30 sec, 1 min, 5 min, and 30 min, 40 ~l aliquots were removed and added to 40 ~1 of a stop solution consisting of 2X Tris-Tricine SDS sample buffer (Biorad) containing 2 mM EDTA and 2mM
phenylmethylsulfonyl fluoride (Sigma catalog #~ P-7626). Digestion was assessed by 16.5 Tris-Tricine SDS-PAGE (precast gels form Biorad).
A?~~ENDED SHEET
Applicant Ref. No.: 05718-PCT.app ,.
. . . . ,. . , .
. . . ~ .., . . ~, .
. . , .
.. .. ... .... " ..
BHL-3N was digested by simulated intestinal fluid in 15 seconds. In contrast, wild-type CI-2 was resistant to digestion for 30 minutes. ~Chis experiment shows that in the intestine of humans or monogastric animals, our engineered protein would likely be more digestible than the wild-type protein would be. 'These results are consistent with the protease inhibition assays showing that BHL-3I~1 was not an effective protease inhibitor.
The inventive proteins were digested in less than five minutes, less than one minute and less than 30 seconds.
Example 6 - Protein Conformation Wild type CI-2, BHL-I, BHL-2, BHL-3 and BHL-3N at proteins concentrations of approximately 0.16mg/ml in IOmM sodium phosphate, pH = 7.0 were prepared and sent to the University of Michigan Medical Scho~~l Protein Structure Facility for circular dichroism analysis. Data indicates that the substituted proteins BHL-1, BHL-2 and BHL-3 have very similar CD spectra confirming that the BHL proteins fold into a structure similar to the wild type CI-2.
Example 7 - Thermodynamic stability Equilibrium denaturation experiments were done to assess the thermodynamic stability of the engineered and wild-type proteins, following the method of Pace et al.
(Meth. Enzym. 131:266-280). The engineered or wild-type proteins at a concentration of 2 pM were incubated 18 hours at 25 degrees centigrade in 10 mM sodium phosphate, pH
7.0, with various concentrations of guanidine-hydrochloride. Unfolding of the proteins was monite~l by measuring intrinsic fluorescence at 25 degrees centigrade, using an excitation wavelength of 280 nm and an emission wavelength of 356 nm. The guanidine-hydrochloride concentration sufficient for 50% unfolding was found to be 3.9M
for wild-type, 2.4M for BHL-1, and 0.9M for BHL-2, BI:IL-3, and BHL-3N. These experiments showed that BHL-1 has a higher thermodynamic stability than do the other engineered proteins, but that all of the engineered proteins lhave a lower thermodynamic stability than does the wild-type protein.
Example 8 - Accessibility of the Tryptophan of BHL Proteins to Acrylamide ~It~Elw~~~~ Si'I~C
Applicant Ref. No.: 05718-PCT.app .. , , .. . . . . . .~ . . . . . .
. . . . .,. , . . . ". ,.
~ . . , . ~ , .
.. .. ... .." .. ..
Acrylamide effectively quenches the fluorescence of accessible tryptophan residues in proteins. We examined fluorescence quenching of the tryptophan residue of the BHL
proteins and of the truncated WT CI-2, in the prcaence or absence of 6M
guanidine-hydrochloride. An excitation wavelength of 295 nm was used. Emission wavelengths of 337 nm and 356 nm were used for the samples v~ithout guanidine-HCl and with guanidine-HCI, respectively. Protein concentrations of 20 pM or 2 pM were used for the samples without, and with guanidine-HCI, respectively. Samples were in 10 mM
sodium phosphate, pH 7.0, and contained acrylamide at the following concentrations:
0, 0.0196M, 0.0385M, 0.0566M, 0.0741M, 0.0909M, 0.1071M, 0.01228M, or 0.1379M. The equation of Mclure and Edelman (Biochem 6: 559-566) vvas used to correct for self absorption of light by acrylamide. Fo/F was plotted against the molar acrylamide concentration, where Fo = fluorescence intensity without acrylamide, and F = fluorescence intensity with acrylamide. The slope of each line (known as the Stern-Volmer constant) was determined. The mean of 2 experiments is presented below. Values in parentheses are standard deviations.
Protein 6M guanidine-HC1 Slope BHL-1 - 3.5 (0.3) BHL-1 + 16.9 (1.3) BHL-2 - 4.6 (0.4) BHL-2 + 19.0 (0.1) BHL-3 - 2.4 (0.2) BHL-3 + 17.5 (0.04) BHL-3N - 5.8 (0.1) BHL-3N + 16.6 (0.6) WT CI-2 - 1.7 (0.1) (truncated.
WT CI-2 + 15.7(2.1) (truncated) Example 9 - Stabilization ~ Disulfide Bonds, An examination of the WI-CI 2 three dimensional structure has identified three pairs of residues (Glu-23 and Arg-81, Thr-22 and Val-82, and Val-53 and Val-70) with an alpha carbon distance appropriate for disulfide formation. Constructs designed to substitute these residues with cysteines will be prepared.
,,~. r.,.. ~.', i SEQUENCE LISTING
(1) GENERAL INFORMATION:
(i) APPLICANT: PIONEER HI-BRED INTERNATIONAL, INC.
(ii) TITLE OF INVENTION: PROTEINS WITH ENHANCED LEVELS
OF ESSENTIAL AMINO ACIDS
(iii) NUMBER OF SEQUENCES: 26 (iv) CORRESPONDENCE ADDRESS:
(A) ADDRESSEE: SMART & BIGGAR
(B) STREET: P.O. BOX 2999, STATION D
lO (C) CITY: OTTAWA
(D) STATE: ONT
(E) COUNTRY: CANADA
(F) ZIP: K1P 5Y6 (v) COMPUTER READABLE FORM:
(A) MEDIUM TYPE: Floppy disk (B) COMPUTER: IBM PC compatible (C) OPERATING SYSTEM: PC-DOS/MS-DOS
(D) SOFTWARE: ASCII (text) (vi) CURRENT APPLICATION DATA:
2 O (A) APPLICATION NUMBER: CA 2,270,289 (B) FILING DATE: 31-OCT-1997 (C) CLASSIFICATION:
(vii) PRIOR APPLICATION DATA:
(A) APPLICATION NUMBER: US 08/740,682 (B) FILING DATE: O1-NOV-1996 (viii) ATTORNEY/AGENT INFORMATION:
(A) NAME: SMART & BIGGAR
(B) REGISTRATION NUMBER:
(C) REFERENCE/DOCKET NUMBER: 75529-49 , CA 02270289 1999-08-OS
(ix) TELECOMMUNICATION INFORMATION:
(A) TELEPHONE: (613)-232-2486 (B) TELEFAX: (613)-232-8440 (2) INFORMATION FOR SEQ ID N0:1:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 195 base pairs 66a Applicant Ref. " , , >. . , , No.: ~, > , >
05718-PCT.app ". .
' , ~ ,. .,. >".
(B) TYPE: nucleic >
acid (C) STRANDEDNESS:
single (D) TOPOLOGY: linear S (ii) MOLECULE
TYPE:
cDNA
(ix) FEATURE:
(A) NAME/KEY: Coding quence:
Se (B) LOCATION: 1...195 IO (D) OTHER INFORMATION:
(xi) SEQ7:DNO:1:
SEQUENCE
DESCRIPTION:
AAG CTG ACA GAG TGG CCG TTGG7.>GGGG AAATCGGTG GAGAAA 48 AAG GAG
1S Lys Leu Thr Glu Trp Pro LeuVa1Gly LysSerVal GluLys Lys Glu 1 5 1C> 15 GCC AAG GTG ATC CTG AAG AAGCC:AGAG GCGCAAATC ATAGTT 96 AAG GAC
Ala Lys Val Ile Leu Lys LysProGlu AlaGlnIle IleVal Lys Asp CTG CCG GGT ACA AAG GTG AAGG~,ATAT AAGATCGAC CGCGTC 144 GTT ACG
Leu Pro Gly Thr Lys Val LysG7.uTyr LysIleAsp ArgVal Val Thr AAG CTC GTG GAT AAA AAG AACA7.'CGCG CAGGTCCCC AGGGTC 192 TTT GAC
Lys Leu Val Asp Lys Lys AsnI7.eAla GlnValPro ArgVal Phe Asp Gly (2) INFORMATION FOR SEQ ID N0:2:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 65 amino acids 40 (B) TYPE: amino acid ---~: STRANDEDNESS : single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: protein 4S (v) FRAGMENT TYPE: internal (xi) SEQUENCE DESCRIPTION: :CDN0:2:
SEQ
Lys Leu Lys Thr Glu Trp Pro ValGlyLys Ser Glu Glu Leu Val Lys SO i s l 15 Ala Lys Lys Val Ile Leu Lys ProGluAla Gln Ile Asp Lys Ile Val Leu Pro Val Gly Thr Lys Val G:LuTyrLys Ile Arg Thr Lys Asp Val SS Lys Leu Phe Val Asp Lys Lys I:LeAlaGln Val Arg Asp Asn Pro Val Gly AMENDED ShiEET
Applicant Ref. No.: 05718-PC?.app ..
..
. . . . ., .~~. . . .
. . ~ . .., . . . ." , ~ ~ . . . . . .
,. ., ... .". .. ,.
(2) INFORMATION
FOR
SEQ
ID
N0:3:
S
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 195 base rs pai (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA
(ix) FEATURE:
IS (A) NAME/KEY: Coding quence Se (B) LOCATION: 1...195 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION:SEQID N0:3:
GAG
Lys Leu Lys Thr Glu Trp Pro LeuValGlyLys SerValG1u Lys Glu 2S GCC AAG AAG GTG ATC CTG AAG AAGCC'AGAGGCG CAAATCATA GTT 96 GAC
Ala Lys Lys Val Ile Leu Lys LysProGluAla GlnIleIle Val Asp CTA CCG GTT GGT ACA AAG GTG AAGGC'CTATAAG ATCGACAAG GTC 144 GCG
Leu Pro Val Gly Thr Lys Val LysAlaTyrLys IleAspLys Val Ala GAC
Lys Leu Phe Val Asp Lys Lys AsnIleAlaGln ValProArg Val Asp ~S 50 55 60 Gly (2) INFORMATION FOR SEQ ID NC1:4:
4S (i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 65 amino acids (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal (xi) SEQUENCE DESCRIPTION: SEQ 7:D N0:4:
SS
Lys Leu Lys Thr Glu Trp Pro Glu Leu Val Gly Lys Ser Val Glu Lys Ala Lys Lys Val Ile Leu Lys Asp Lys Pro Glu Ala Gln Ile Ile Val AMENDED SHEET
Applicant Ref. No.: 05718-PCT.app w ' .. , ~. . . , . . .~ , , .
.., . . . . ", . . ~ . ~ ~ .
.. .. ... " .. .. ..
Leu Pro Val Gly Thr Lys Val Ala Lys Ala Tyr Lys Ile Asp Lys Val Lys Leu Phe Val Asp Lys Lys Asp Asn Il.e Ala Gln Val Pro Arg Val Gly (2) INFORMATION FOR SEQ ID N0:5:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 195 base rs pai IS (B) TYPE:
nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA
2O (ix) FEATURE:
(A) NAME/KEY: Coding quence.
Se (B) LOCATION: 1...195 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION:SEQ N0:5:
:CD
AAG CTG AAG ACA GAG TGG CCG TTGG'CGGGG AAATCGGTG GAGAAA 48 GAG
Lys Leu Lys Thr Glu Trp Pro LeuValGly LysSerVal GluLys Glu GAC
Ala Lys Lys Val Ile Leu Lys LysP:roGlu AlaGlnIle IleVal Asp CTA CCG GTT GGT ACA AAG GTG AAGC..~TTAT AAGATCGAC AAGGTC 144 GGT
Leu Pro Val Gly Thr Lys Val LysHisTyr LysIleAsp LysVal Gly GAC
Lys Leu Phe Val Asp Lys Lys AsnIleAla GlnValPro ArgVal Asp 4S G1y SO (2) INFORMATION FOR SEQ ID N0:6:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 65 amino acids (B) TYPE: amino acid SS (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: protein AMENDEt~ C'irrT
Applicant Ref. No.: 0571 R-PCT.app (v) FRAGMENT TYPE: internal (xi) SEQUENCE DESCRIPTION: SEQ ID N0:6:
~ ..
.. , ..
. , . . . . .. . . . . .
~ . . . ... . . . . ~.~ ..
. . . . .
.. ., .,. ..., .. ., $ Lys Leu Lys Thr Glu Trp Pro Glu Leu V,~1 Gly Lys Ser Val Glu Lys Ala Lys Lys Val Ile Leu Lys Asp Lys P:ro Glu Ala Gln Ile Ile Val Leu Pro Val Gly Thr Lys Val Gly Lys His Tyr Lys Ile Asp Lys Val Lys Leu Phe Val Asp Lys Lys Asp Asn ILe Ala Gln Val Pro Arg Val Gly 1$
°,C~D H~Ei ,,,, r,<~
Applicant Ref. No.: 05718-PCT.app ~ . .. , ..
,. .. . . .~ . .. . . . . . .
. . . . . .,. . . . . .., ..
~ ~ . , .. .. . . ..,. .. ..
(2) INFORMATION FOR SEQ ID N0:7:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 249 base pairs S (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA
lO (ix) FEATURE:
(A) NAME/KEY: Coding Sequence:
(B) LOCATION: 1...249 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ 7:D N0:7:
AAG TCG GTG GAG AAG AAA CCG AAG GGT G7.'G AAG ACA GGT GCG GGT GAC 48 Lys Ser Val Glu Lys Lys Pro Lys Gly Val Lys Thr Gly Ala Gly Asp Lys His Lys Leu Lys Thr Glu Trp Pro G:_u Leu Val Gly Lys Ser Val GAG AAA GCC AAG AAG GTG ATC CTG AAG G~~.C AAG CCA GAG GCG CAA ATC 144 Glu Lys Ala Lys Lys Val Ile Leu Lys Asp Lys Pro Glu Ala Gln Ile Ile Val Leu Pro Val Gly Thr Lys Val G:Ly Lys His Tyr Lys Ile Asp AAG GTC AAG CTT TTT GTG GAT AAA AAG G~3C AAC ATC GCG CAG GTC CCC 240 3$ Lys Val Lys Leu Phe Val Asp Lys Lys Asp Asn Ile Ala Gln Val Pro Arg Val Gly (2) INFORMATION FOR SEQ ID Nc~:B:
4S (i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 83 amino acids (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear SO
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal (xi) SEQUENCE DESCRIPTION: SEQ ID N0:8:
SS
Lys Ser Val Glu Lys Lys Pro Lys Gly V,~1 Lys Thr Gly Ala Gly Asp Lys His Lys Leu Lys Thr Glu Trp Pro Glu Leu Val Gly Lys Ser Val ,4,';,-.n;r1'~; ~;:~~y_~:
Applicant Ref. No.: 0571 R-PCT.app ~. ., . .~ ,. "
. . . . .. . . , , , , . . . . . ." . . . . ,., , . . " , . . .. .. .~. .,., Glu Lys Ala Lys Lys Val Ile Leu Lys A;sp Lys Pro Glu Ala Gln Ile Ile Val Leu Pro Val Gly Thr Lys Val G:Ly Lys His Tyr Lys Ile Asp Lys Val Lys Leu Phe Val Asp Lys Lys As~~ Asn Ile Ala Gln Val Pro Arg Val Gly (2) INFORMATION FOR SEQ ID N0:9:
IS (i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 249 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA
(ix) FEATURE:
(A) NAME/KEY: Coding Sequence 2S (B) LOCATION: 1...249 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:9:
3O AAG TCG GTG GAG AAG AAA CCG AAG GGT G'TG AAG ACA GGT GCG GGT GAC 48 Lys Ser Val Glu Lys Lys Pro Lys Gly Val Lys Thr Gly Ala Gly Asp AAG CAT AAG CTG AAG ACA GAG TGG CCG G.AG TTG GTG GGG AAA TCG GTG 96 3S Lys His Lys Leu Lys Thr Glu Trp Pro Glu Leu Val Gly Lys Ser Val GAG AAA GCC AAG AAG GTG ATC CTG AAG G.AC AAG CCA GAG GCG CAA ATC 144 Glu Lys Ala Lys Lys Val Ile Leu Lys Asp Lys Pro Glu Ala Gln Ile Ile Val Leu Pro Val Gly Thr Lys Val Thr Lys Glu Tyr Lys Ile Asp Arg Val Lys Leu Phe Val Asp Lys Lys Asp Asn Ile Ala Gln Val Pro Arg Val Gly SS (2) INFORMATION FOR SEQ ID NO:10:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 83 amino acids ;,i;: ~ ~~.: ~'-%
Applicant Rcf. No.: 05718-PCT.app (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal (xi) SEQUENCE DESCRIPTION: SEQ ID NO:10:
;; ~.;~ . ~:,=l-Appi:cant Ref. No.: 05718-PCT.app ,. .. . .. ,.
.. , . . . . .. . . . , .
. . . , , .,. . . . . ."
. . . , " .. ... .... ,. ., Lys Ser Val Glu Lys Lys Pro Lys Gly Val Lys Thr Gly Ala Gly Asp Lys HisLys LeuLys ThrGluTrpPro G:LuLeuVa1 GlyLysSer Val S Glu LysAla LysLys ValIleLeuLys AapLysPro GluAlaGln Ile Ile ValLeu ProVal GlyThrLysVal TlzrLysGlu TyrLysIle Asp Arg ValLys Phe LysLys Asp Ile Ala Val Pro Leu Val Asn Gln Asp Arg ValGly (2) INFORMATION FOR SEQ ID NO:11:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 249 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA
ZS (ix) FEATURE:
(A) NAME/KEY: Coding Sequenc~_ (B) LOCATION: 1...249 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:11:
AAG TCG GTG GAG AAG AAA CCG AAG GGT G'TG AAG ACA GGT GCG GGT GAC 48 Lys Ser Val Glu Lys Lys Pro Lys Gly Val Lys Thr Gly Ala Gly Asp AAG CAT AAG CTG AAG ACA GAG TGG CCG G.AG TTG GTG GGG AAA TCG GTG 96 Lys His Lys Leu Lys Thr Glu Trp Pro Glu Leu Val Gly Lys Ser Val GAG P.AA~C AAG AAG GTG ATC CTG AAG GAC AAG CCA GAG GCG CAA ATC 144 Glu Lys Ala Lys Lys Val Ile Leu Lys Asp Lys Pro Glu Ala Gln Ile Ile Val Leu Pro Val Gly Thr Lys Val Ala Lys Ala Tyr Lys Ile Asp SO Lys Val Lys Leu Phe Val Asp Lys Lys Asp Asn Ile Ala Gln Val Pro SS
Arg Val Gly (2) INFORMATION FOR SEQ TD N0:12:
AppIScant Ref. No.: 05718-PCT.app .. . ., . . . . ,. . . , , . , . . . . , ... . . . ., , ~ , . , ~ . " " ." ,... " ., (i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 83 amino acids (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal (xi) SEQUENCE DESCRIPTION: SEQ ID N0:12:
Lys Ser Val Glu Lys Lys Pro Lys Gly V;al Lys Thr Gly Ala Gly Asp Lys His Lys Leu Lys Thr Glu Trp Pro G.Lu Leu Val Gly Lys Ser Val Glu Lys Ala Lys Lys Val Ile Leu Lys A;sp Lys Pro Glu Ala Gln Ile Ile Val Leu Pro Val Gly Thr Lys Val A:La Lys Ala Tyr Lys Ile Asp Lys Val Lys Leu Phe Val Asp Lys Lys A;sp Asn Ile Ala Gln Val Pro Arg Val Gly (2) INFORMATION FOR SEQ ID N0:13:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 249 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear ~S (ii) MOLECULE TYPE: cDNA
(ix) FEATURE:
(A) NAME/KEY: Coding Sequence (B) LOCATION: 1...249 4O (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ :ID N0:13:
AGT TCA GTG GAG AAG AAG CCG GAG GGA G'TG AAC ACC GGT GCT GGT GAC 48 4$ Ser Ser Val Glu Lys Lys Pro Glu Gly Val Asn Thr Gly Ala Gly Asp CGT CAC AAC CTG AAG ACA GAG TGG CCA G:4G TTG GTG GGG AAA TCG GTG 96 Arg His Asn Leu Lys Thr Glu Trp Pro G:Lu Leu Val Gly Lys Ser Val Glu Glu Ala Lys Lys Val Ile Leu Gln A;sp Lys Pro Glu Ala Gln Ile Ile Val Leu Pro Val Gly Thr Ile Val Tlar Met Glu Tyr Arg Ile Asp Appijcant Rcf. No.: 05718-PCT.app . ~~ .. .
..
. . . ~. . . ~ ~ , .
. .. . . . . "
. , ~. .~ . , .,» ,.
Arg Val Arg Leu Phe Val Asp Lys Leu Asp Asn Ile Ala Gln Val Pro S
Arg Val Gly (2) INFORMATION FOR SEQ N0:14:
ID
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 83 amino acids IS (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: protein ZO (v) FRAGMENT TYPE: internal (xi) SEQUENCE DESCRIPTION: N0:14:
SEQ :LD
Ser Ser Val Glu Lys Lys Pro Va~lAsn ThrGly Gly Glu Gly Ala Asp Arg His Asn Leu Lys Thr Glu G:LuLeu ValGly Ser Trp Pro Lys Val Glu Glu Ala Lys Lys Val Ile AspLys ProGlu Gln Leu Gln Ala Ile 30 Ile Val Leu Pro Val Gly Thr ThrMet GluTyr Ile Ile Val Arg Asp Arg Jal Arg Leu Phe Val Asp AspAsn IleAla Val Lys Leu Gln Pro Arg Val Gly (2) INFORMATION FOR SEQ ID N0:15:
4O (i) SEQUENCE CHARACTERISTICS:
-~ LENGTH: 459 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: CDNA
(ix) FEATURE:
(A) NAME/KEY: Coding Sequence:
SO (B) LOCATION: 1....288 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ).D N0:15:
SS GCA GTG CAA CAA GCA AGA TTT ACC TGC CC:A TCG ATC ATA TCG TCA ACT 48 Ala Val Gln Gln Ala Arg Phe Thr Cys Pro Ser Ile Ile Ser Ser Thr Applicant Ref. No.: 0571 R-PCT.app v ~ .r ,. , m ~ , ~ ~ ~ ~ ~ , . , . . . , ~ , , . . . , , ,., . . , , ' ~ ~ . , , ~ r m , , , : , , , . , , ~
Gly Pro Ala Val Arg Asp Thr Met Ser Ser Thr Glu Cys Gly Gly Gly S GGC GGC GGC GCC AAG ACG TCG TGG CCT Gi~G GTG GTC GGG CTG AGC GTG 144 Gly Gly Gly Ala Lys Thr Ser Trp Pro G:Lu Val Val Gly Leu Ser Val GAG GAC GCC AAG AAG GTG ATG GTC AAG Gl~C AAG CCG GAC GCC GAC ATC 192 Glu Asp Ala Lys Lys' Val Met Val Lys Asp Lys Pro Asp Ala Asp Ile Val Val Leu Pro Val Gly Ser Val Val Thr Ala Asp Tyr Arg Pro Asn CGT GTC CGC ATC TTC GTC GAC ATC GTC G(:C CAG ACG CCC CAC ATC GGC T 289 Arg Val Arg Ile Phe Val Asp Ile Val A7_a Gln Thr Pro His Ile Gly GATAATATAT AAGCTAGCCG CTATTTCCTT TCCT7.'GCCCC AGAACTTGAA ATAAATATAT 349 ATACGATGAA ATAACGCGGG CATGCCGAAT ANATCdGANTG TGNNTGAATT CTCACTAATT 409 AAGTAATGNC ATAAATAAAC GTATTCAAAA AAAA7~.AAAAA P~~AAAAAA.AA 4 5 9 (2) INFORMATION FOR SEQ ID N0:16:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 96 amino acids (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear ~S (ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal (xi) SEQUENCE DESCRIPTION: SEQ 7:D N0:16:
Ala Val Gln Gln Ala Arg Phe Thr Cys Pro Ser Ile Ile Ser Ser Thr Gly Pro Ala Val Arg Asp Thr Met Ser Se:r Thr Glu Cys Gly Gly Gly Gly Gly Gly Ala Lys Thr Ser Trp Pro Gl.u Val Val Gly Leu Ser Val Glu Asp Ala Lys Lys Val Met Val Lys A~,p Lys Pro Asp Ala Asp Ile Val Val Leu Pro Val Gly Ser Val Val Thr Ala Asp Tyr Arg Pro Asn SO Arg Val Arg Ile Phe Val Asp Ile Val Ala Gln Thr Pro His Ile Gly SS !2) INFORMATION FOR SEQ ID N0:17:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 428 base pairs Applicant Ref. No.: 05718-PCT.app .. ., , .. , ,. . . . . . ., . . . . . , ", . . . , .. .. .n .... .. ..
(B) TYPE: nucleic acid (C) STRANDEDNESS: single (D} TOPOLOGY: linear S (ii) MOLECULE TYPE: cDNA
(ix) FEATURE:
(A) NAME/KEY: Coding Sequence (B) LOCATION: 1...303 IO (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ :CD N0:17:
CGA CCC ACG CGT CCG CCC ACG CGT CCG G(:A AGA TTT ACC TGC CCA TCG 48 IS Arg Pro Thr Arg Pro Pro Thr Arg Pro Al.a Arg Phe Thr Cys Pro Ser Ile Ile Ser Ser Thr Gly Pro Ala Val Arg Asp Thr Met Ser Ser Thr GAG TGC GGC GGC GGC GGC GGC GGC GCC AF~G ACG TCG TGG CCT GAG GTG 144 Glu Cys Gly Gly Gly Gly Gly Gly Ala L~~s Thr Ser Trp Pro Glu Val GTC GGG CTG AGC GTG GAG GAC GCC AAG AF~G GTG ATC CTC AAG GAC AAG 192 Val Gly Leu Ser Val Glu Asp Ala Lys Lys Val Ile Leu Lys Asp Lys 3O CCG GAC GCC GAC ATC GTG GTG CTG CCC GT'C GGC TCC GTG GTG ACC GCG 240 Pro Asp Ala Asp Ile Val Val Leu Pro Va.l Gly Ser Val Val Thr Ala GAT TAT CGC CCT AAC CGT GTC CGC ATC TT'C GTC GAC ATC GTC GCC CAG 288 ~S Asp Tyr Arg Pro Asn Arg Val Arg Ile Phe Val Asp Ile Val Ala Gln Thr Pro His Ile Gly TGAAAAA.AAA F,~~P.AAP.AP.AA AAAA 4 2 8 (2) INFORMATION FOR SEQ ID N0:18:
(i) SEQUENCE CHARACTERISTICS:
SO (A) LENGTH: 101 amino acids (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear SS (ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal (xi) SEQUENCE DESCRIPTION: SEQ I;,~ N0:18:
Applicant Ref. No.: 05718-PCT.app r. .. . . ,.
.. . . . . , .: . . . . .
, . . . . .,. .
.
.. .. . . . .. .. ., Arg Pro Thr Arg Pro Pro Thr Arg Pro A:La Arg Phe Thr Cys Pro Ser Ile Ile SerSerThr Gly AlaVal A:rgAspThr MetSer SerThr Pro Glu Cys GlyGlyGly Gly GlyAla L!~sThrSer TrpPro GluVal Gly Val Gly LeuSerVal Glu AlaLys Ll~sValIle LeuLys AspLys Asp Pro Asp AlaAspIle Val LeuPro ValGlySer ValVal ThrAla Val Asp Tyr ArgProAsn Arg ArgIle PheValAsp IleVal AlaGln Val Thr Pro HisIleGly loo (2) INFORMATION FOR SEQ ID N0:19:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 441 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA
(ix) FEATURE:
(A) NAME/KEY: Coding Sequence' (B) LOCATION: 1...255 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ :CD N0:19:
Leu Ile Ile Ala Leu Ser Val Xaa His Arg Gln Pro Ser Thr Met Ser Ser Thr-'G1~ Gly Gly Asp Asp Gly ATa L;rs Lys Ser Trp Pro Glu Val 45 Val Gly Leu Ser Leu Glu Glu Ala Lys Arg Val Ile Leu Cys Asp Lys CCC GAC GCC GAC ATC GTC GTG CTG CCC G'.CC GGC ACG CCG GTG ACC ATG 192 Pro Asp Ala Asp Ile Val Val Leu Pro Val Gly Thr Pro Val Thr Met $5 GAT TTC CGC CCC AAC CGC GTC CGC ATC T'.CC GTC GAC ACC GTC GCG GAG 240 Asp Phe Arg Pro Asn Arg Val Arg Ile Phe Val Asp Thr Val Ala Glu GCA MCC CAC ATC GGC TGAGGTTAAA TCTACA)~AAT GAATGAYTCG GACATGCCAT G 296 Ala Xaa His Ile Gly Applicant Rcf. No.: 05718-PCT.app .. .. .
,. . . . . .~ . . . , , .
. . . . .., . . , . . ..
. ~ . , . .
.. .. ... .,.. .. ., S
(2) INFORMATION FOR SEQ ID NC~:20:
lO (i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 85 amino acids (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal (xi) SEQUENCE DESCRIPTION: SEQ 7:D N0:20:
Leu IleIle AlaLeuSer ValXaaHis ArgGlnPro SerThrMet Ser CI
Ser ThrGly GlyGlyAsp AspGlyAla L~~sLysSer TrpProGlu Val 2S Val GlyLeu SerLeuGlu GluAlaLys ArgValIle LeuCysAsp Lys Pro AspAla AspIleVal ValLeuPro ValGlyThr ProValThr Met Asp PheArg ProAsnArg ValArgIle PheValAsp ThrValAla Glu Ala XaaHis IleGly (2) INFORMATION FOR SEQ ID N0:21:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 382 base pairs 40 (B) TYPE: nucleic acid ~ STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA
4S (ix) FEATURE:
(A) NAME/KEY: Coding Sequencf:
(B) LOCATION: 1...213 (D) OTHER INFORMATION:
SO
(xi) SEQUENCE DESCRIPTION: SEQ :CD N0:21:
Val Arg Arg Arg Arg Thr Ala Thr Gly G:Ly Lys Thr Ser Trp Pro Glu GTG GTC GGG CTG AGC GTC GAG GAA GCC Ai4G AAG GTG ATT CTG GCG GAC 96 Val Val Gly Leu Ser Val Glu Glu Ala Lys Lys Val Ile Leu Ala Asp Applicant Ref. No.: 0571 R-PCT.app .. ., . .. ..
.. . . . . . .. . . . . .
. . . .. , . ... .
. . .
.. .. ... .... .. ..
Lys Pro Asn Ala Asp Ile Val Val Leu Pro Thr Thr Thr Gln Ala Val Thr Ser Asp Phe Gly Phe Asp Arg Val Arg Val Phe Val Gly Thr Val Ala Gln Thr Pro His Val Gly P~~;4AAAAAAA AAAAA 3 8 2 (2) INFORMATION FOR SEQ ID N0:22:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 71 amino acids 2S (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal (xi) SEQUENCE DESCRIPTION: SEQ ID N0:22:
Val Arg Arg Arg Arg Thr Ala Thr Gly Gly Lys Thr Ser Trp Pro Glu ~S 1 5 10 15 Val Val Gly Leu Ser Val Glu Glu Ala Lys Lys Val Ile Leu Ala Asp Lys Pro Asn Ala Asp Ile Val Val Leu Pro Thr Thr Thr Gln Ala Val Thr Ser Asp Phe Gly Phe Asp Arg Val Arg Val Phe Val Gly Thr Val Ala Gln Thr Pro His Val Gly (2) INFORMATION FOR SEQ ID N0:23:
(i) SEQUENCE CHARACTERISTICS:
SO (A) LENGTH: 448 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear SS (ii) MOLECULE TYPE: cDNA
(ix) FEATURE:
(A) NAME/KEY: Coding Sequence gl Applicant Ref. No.: 05718-PCT.app ., . .. , .. .. .~ . . , . . . . .
.., . . .
, , .. .. .., . .. ..
(B) LOCATION: 1...240 (D) OTHER INFORMATION:
S
(xi) SEQUENCE DESCRIPTION: SEQ 7:D N0:23:
CGA TTT AGC TAT AGC AGG TCT CGA TCG GC:G GCC ATG AGC GGT AGC CGC 48 Arg Phe Ser Tyr Ser Arg Ser Arg Ser Al.a Ala Met Ser Gly Ser Arg Ser Lys Lys Ser Trp Pro Glu Val Glu Gl.y Leu Pro Ser Glu Val Ala 1$ Lys Gln Lys Ile Leu Ala Asp Arg Pro A:>p Val Gln Val Val Val Leu CCC GAC GGC TCC TTC GTC ACC ACT GAT TTC: AAC GAC AAG CGC GTC CGG 192 Pro Asp Gly Ser Phe Val Thr Thr Asp Phe Asn Asp Lys Arg Val Arg GTC TTC GTC GAC AAC GCC GAC AAC GTC GC:C AAA GTC CCC AAG ATC GGC T 241 Val Phe Val Asp Asn Ala Asp Asn Val Al.a Lys Val Pro Lys Ile Gly AGCTAGCTAG CTAGGCCCAA TCGTTCTAAT CAGC7.'AGTTT CTTTCTTTCA TAAATAAAAG 301 CTTAATGGAT GCCATGGCGC CCGCGCGCGC CTYC.t~TCATG AAAAGCTACA TTTGAAACGA 421 (2) INFORMATION FOR SEQ ID N0:24:
3S (i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 80 amino acids (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (=~ MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal (xi) SEQUENCE DESCRIPTION: SEQ :CD N0:24:
Arg Phe SerTyrSer ArgSerArg SerA:LaAla MetSerGly SerArg Ser Lys LysSerTrp ProGluVal GluG:lyLeu ProSerGlu ValAla Lys Gln LysIleLeu AlaAspArg ProAspVal GlnValVal ValLeu Pro Asp GlySerPhe ValThrThr AspPheAsn AspLysArg ValArg Val Phe ValAspAsn AlaAspAsn ValA:laLys ValProLys IleGly 6s 70 75 so -r..h ~.:
Applicant Ref. No.: 0571 R-PCT.app .. , , ,.
.., . . . . , " , , . , , ". . . . . ", . , , .. " ." .,., .. ,.
(2) INFORMATION FOR SEQ ID N0:25:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 18 base pairs S (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA
(xi) SEQUENCE DESCRIPTION: SEQ I:D N0:25:
(2) INFORMATION FOR SEQ ID NC>:26:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 18 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear ZS (ii) MOLECULE TYPE: cDNA
(xi) SEQUENCE DESCRIPTION: SEQ I:D N0:26:
75529-49(S) All publications and patent applications mentioned in this specification are indicati~r~ of the level of skill of those skilled in the art to which this invention pertains.
Variations on the above embodiments are within the ability of one of ordzz~ar-y skill in the art, and such variations do not depart from the scope of the present ira-v~;nti~~~ as described in the following claims.
Claims (77)
1. An isolated polypeptide comprising a modified variant of SEQ ID NO: 14, or a modified variant of the sequence from position 19 to position 83 of SEQ ID NO: 14, wherein the modified variant:
(a) contains a higher percentage of essential amino acids than either SEQ ID N0: 14 or the sequence from position 19 to position 83 of SEQ ID N0: 14;
(b) has greater than 60% amino acid similarity to SEQ ID NO: 14 or the sequence from position 19 to position 83 of SEQ ID NO: 14, wherein the percent sequence similarity is based on the entire sequence and is determined by BLAST
2.0 using default parameters; and (c) contains an essential amino acid at a position corresponding to a position of SEQ ID NO: 14 selected from the group consisting of 1, 8, 17, 19, 34, 41, and 67, or contains a lysine at a position corresponding to a position of SEQ ID NO: 14 selected from the group consisting of 56, 59, 62 and 73.
(a) contains a higher percentage of essential amino acids than either SEQ ID N0: 14 or the sequence from position 19 to position 83 of SEQ ID N0: 14;
(b) has greater than 60% amino acid similarity to SEQ ID NO: 14 or the sequence from position 19 to position 83 of SEQ ID NO: 14, wherein the percent sequence similarity is based on the entire sequence and is determined by BLAST
2.0 using default parameters; and (c) contains an essential amino acid at a position corresponding to a position of SEQ ID NO: 14 selected from the group consisting of 1, 8, 17, 19, 34, 41, and 67, or contains a lysine at a position corresponding to a position of SEQ ID NO: 14 selected from the group consisting of 56, 59, 62 and 73.
2. An isolated polypeptide comprising a modified variant of SEQ ID NO: 14, or a modified variant of the sequence from position 19 to position 83 of SEQ ID N0: 14, wherein the modified variant:
(a) contains a higher percentage of essential amino acids than either SEQ ID NO: 14 or the sequence from position 19 to position 83 of SEQ ID NO: 14;
(b) has greater than 60% amino acid similarity to SEQ ID N0: 14 or the sequence from position 19 to position 83 of SEQ ID N0: 14, wherein the percent sequence similarity is based on the entire sequence and is determined by BLAST
2.0 using default parameters; and (c) is modified at at least 11 positions of SEQ ID
NO: 14 to contain essential amino acids at said at least 11 positions.
(a) contains a higher percentage of essential amino acids than either SEQ ID NO: 14 or the sequence from position 19 to position 83 of SEQ ID NO: 14;
(b) has greater than 60% amino acid similarity to SEQ ID N0: 14 or the sequence from position 19 to position 83 of SEQ ID N0: 14, wherein the percent sequence similarity is based on the entire sequence and is determined by BLAST
2.0 using default parameters; and (c) is modified at at least 11 positions of SEQ ID
NO: 14 to contain essential amino acids at said at least 11 positions.
3. An isolated polypeptide comprising a modified variant of SEQ ID NO: 14, or a modified variant of the sequence from position 19 to position 83 of SEQ ID NO: 14, wherein the modified variant:
(a) contains a higher percentage of essential amino acids than either SEQ ID NO: 14 or the sequence from position 19 to position 83 of SEQ ID NO: 14;
(b) has greater than 60% amino acid similarity to SEQ ID NO: 14 or the sequence from position 19 to position 83 of SEQ ID NO: 14, wherein the percent sequence similarity is based on the entire sequence and is determined by BLAST
2.0 using default parameters; and (c) contains a pair of cysteines at at least one pair of positions corresponding to SEQ ID NO:14 positions Glu-23 and Arg-81, Thr-22 and Val-82, or Val-53 and Val-70.
(a) contains a higher percentage of essential amino acids than either SEQ ID NO: 14 or the sequence from position 19 to position 83 of SEQ ID NO: 14;
(b) has greater than 60% amino acid similarity to SEQ ID NO: 14 or the sequence from position 19 to position 83 of SEQ ID NO: 14, wherein the percent sequence similarity is based on the entire sequence and is determined by BLAST
2.0 using default parameters; and (c) contains a pair of cysteines at at least one pair of positions corresponding to SEQ ID NO:14 positions Glu-23 and Arg-81, Thr-22 and Val-82, or Val-53 and Val-70.
4. An isolated polypeptide comprising a modified variant of SEQ ID NO: 14, or a modified variant of the sequence from position 19 to position 83 of SEQ ID NO: 14, wherein the modified variant:
(a) contains at least 55% essential amino acids;
(b) has greater than 60% amino acid similarity to SEQ ID NO: 14 or the sequence from position 19 to position 83 of SEQ ID NO: 14, wherein the percent sequence similarity is based on the entire sequence and is determined by BLAST
2.0 using default parameters; and (c) contains a pair of cysteines at at least one pair of positions corresponding to SEQ ID N0:14 positions Glu-23 and Arg-81, Thr-22 and Val-82, or Val-53 and Val-70.
(a) contains at least 55% essential amino acids;
(b) has greater than 60% amino acid similarity to SEQ ID NO: 14 or the sequence from position 19 to position 83 of SEQ ID NO: 14, wherein the percent sequence similarity is based on the entire sequence and is determined by BLAST
2.0 using default parameters; and (c) contains a pair of cysteines at at least one pair of positions corresponding to SEQ ID N0:14 positions Glu-23 and Arg-81, Thr-22 and Val-82, or Val-53 and Val-70.
5. The polypeptide of any one of claims 2 to 4 wherein the modified variant contains an essential amino acid at a position corresponding to a position of SEQ ID N0:
14 selected from the group consisting of 1, 8, 17, 19, 34, 41, and 67, or contains a lysine at a position corresponding to a position of SEQ ID N0: 14 selected from the group consisting of 56, 59, 62 and 73.
14 selected from the group consisting of 1, 8, 17, 19, 34, 41, and 67, or contains a lysine at a position corresponding to a position of SEQ ID N0: 14 selected from the group consisting of 56, 59, 62 and 73.
6. The polypeptide of any one of claims 1, 3 and 4 wherein the modified variant is modified at at least 11 positions of SEQ ID N0:.14 to contain essential amino acids at said at least 11 positions.
7. The polypeptide of claim 1 or 2 wherein the modified variant contains a pair of cysteines at at least one pair of positions corresponding to SEQ ID N0:14 positions Glu-23 and Arg-81, Thr-22 and Val-82, or Val-53 and Val-70.
8. The polypeptide of claim 1 or 3 wherein the modified variant contains at least 55% essential amino acids.
9. The polypeptide of claim 4 or 7 wherein the modified variant contains cysteines at positions Glu-23 and Arg-81 of SEQ ID N0:14.
10. The polypeptide of claim 4 or 7 wherein the modified variant contains cysteines at positions Thr-22 and Val-82 of SEQ ID NO:14.
11. The polypeptide of claim 4 or 7 wherein the modified variant contains cysteines at positions Val-53 and Val-70 of SEQ ID NO:14.
12. The isolated polypeptide of any one of claims 1 to 11 wherein at least five of the modified positions are modified from a non-essential amino acid to an essential amino acid.
13. The isolated polypeptide of any one of claims 1 to 12 wherein the modified variant has at least 70% amino acid similarity to SEQ ID NO: 14 or the sequence from position 18 19 to position 83 of SEQ ID NO: 14, wherein the percent sequence similarity is based on the entire sequence and is determined by BLAST 2.0 using default parameters.
14. The isolated polypeptide of any one of claims 1 to 12 wherein the modified variant has at least 80% amino acid similarity to SEQ ID NO: 14 or the sequence from position 19 to position 83 of SEQ ID NO: 14, wherein the percent sequence similarity is based on the entire sequence and is determined by BLAST 2.0 using default parameters.
15. The isolated polypeptide of any one of claims 1, 3, 4, 6, 8 and 9 to 11 wherein the modified variant has at least 90% amino acid similarity to SEQ ID NO: 14 or the sequence from position 19 to position 83 of SEQ ID NO: 14, wherein the percent sequence similarity is based on the entire sequence and is determined by BLAST 2.0 using default parameters.
16. The isolated polypeptide of any one of claims 1, 3, 4, 6, 8 and 9 to 11 wherein the modified variant has at least 95% amino acid similarity to SEQ ID NO: 14 or the sequence from position 19 to position 83 of SEQ ID NO: 14, wherein the percent sequence similarity is based on the entire sequence and is determined by BLAST 2.0 using default parameters.
17. The isolated polypeptide of any one of claims 1 to 16 wherein the modified variant contains an essential amino acid at a position corresponding to position 1 of SEQ ID NO:
14.
14.
18. The isolated polypeptide of any one of claims 1 to 16 wherein the modified variant contains an essential amino acid at a position corresponding to position 8 of SEQ ID NO:
14.
14.
19. The isolated polypeptide of any one of claims 1 to 16 wherein the modified variant contains an essential amino acid at a position corresponding to position 17 of SEQ ID
NO: 14.
NO: 14.
20. The isolated polypeptide of any one of claims 1 to 16 wherein the modified variant contains an essential amino acid at a position corresponding to position 19 of SEQ ID
NO: 14.
NO: 14.
21. The isolated polypeptide of any one of claims 1 to 16 wherein the modified variant contains an essential amino acid at a position corresponding to position 34 of SEQ ID
NO: 14.
NO: 14.
22. The isolated polypeptide of any one of claims 1 to 16 wherein the modified variant contains an essential amino acid at a position corresponding to position 41 of SEQ ID
NO: 14.
NO: 14.
23. The isolated polypeptide of any one of claims 1 to 16 wherein the modified variant contains lysine at a position corresponding to position 56 of SEQ ID NO: 14.
24. The isolated polypeptide of any one of claims 1 to 16 wherein the modified variant contains lysine at a position corresponding to position 59 of SEQ ID NO: 14.
25. The isolated polypeptide of any one of claims 1 to 16 wherein the modified variant contains lysine at a position corresponding to position 62 of SEQ ID NO: 14.
26. The isolated polypeptide of any one of claims 1 to 16 wherein the modified variant contains an essential amino acid at a position corresponding to position 67 of SEQ ID
NO: 14.
NO: 14.
27. The isolated polypeptide of any one of claims 1 to 16 wherein the modified variant contains lysine at a position corresponding to position 73 of SEQ ID NO: 14.
28. The isolated polypeptide of any one of claims 18 to 22 and 26 wherein the essential amino acid is lysine.
29. The isolated polypeptide of any one of claims 1 to 28 wherein the modified variant contains lysine at at least 2 positions corresponding to SEQ ID NO: 14 selected from the group consisting of 1, 8, 17, 19, 34, 41, 56, 59, 62, 67 and 73.
30. The isolated polypeptide of any one of claims 1 to 28 wherein the modified variant contains lysine at at least 3 positions corresponding to SEQ ID NO: 14 selected from the group consisting of 1, 8, 17, 19, 34, 41, 56, 59, 62, 67 and 73.
31. The isolated polypeptide of any one of claims 1 to 15 wherein the modified variant contains lysine at at least 4 positions corresponding to SEQ ID NO: 14 selected from the group consisting of 1, 8, 17, 19, 34, 41, 56, 59, 62, 67 and 73.
32. The isolated polypeptide of any one of claims 1 to 15 wherein the modified variant contains lysine at at least 5 positions corresponding to SEQ ID NO: 14 selected from the group consisting of 1, 8, 17, 19, 34, 41, 56, 59, 62, 67 and 73.
33. The isolated polypeptide of any one of claims 1 to 15 wherein the modified variant contains lysine at at least 6 positions corresponding to SEQ ID NO: 14 selected from the group consisting of 1, 8, 17, 19, 34, 41, 56, 59, 62, 67 and 73.
34. The isolated polypeptide of any one of claims 1 to 15 wherein the modified variant contains lysine at at least 7 positions corresponding to SEQ ID NO: 14 selected from the group consisting of 1, 8, 17, 19, 34, 41, 56, 59, 62, 67 and 73.
35. The isolated polypeptide of any one of claims 1 to 15 wherein the modified variant contains lysine at at least 8 positions corresponding to SEQ ID NO: 14 selected from the group consisting of 1, 8, 17, 19, 34, 41, 56, 59, 62, 67 and 73.
36. The isolated polypeptide of any one of claims 1 to 15 wherein the modified variant contains lysine at at least 9 positions corresponding to SEQ ID NO: 14 selected from the group consisting of 1, 8, 17, 19, 34, 41, 56, 59, 62, 67 and 73.
37. The isolated polypeptide of any one of claims 1 to 15 wherein the modified variant contains lysine at at least positions corresponding to SEQ ID NO: 14 selected from the group consisting of 1, 8, 17, 19, 34, 41, 56, 59, 62, 67 and 73.
38. The isolated polypeptide of any one of claims 1 to wherein the modified variant contains lysine at positions 1, 8, 17, 19, 34, 41, 56, 59, 62, 67 and 73 of SEQ ID NO:
14.
14.
39. The isolated polypeptide of any one of claims 1 to 27 and 32 to 38 wherein the essential amino acid is methionine, threonine, lysine, isoleucine, leucine, valine, tryptophan, phenylalanine, histidine, or mixtures thereof.
40. The isolated polypeptide of claim 39 wherein the essential amino acid is lysine, tryptophan, methionine, threonine, or mixtures thereof.
41. The isolated polypeptide of claim 39 or 40, further comprising arginine, at least two cysteine residues, isoleucine, glycine, glutamic acid or mixtures thereof, wherein the at least two cysteine residues occupy at least one pair of positions corresponding to SEQ ID NO:14 positions Glu-23 and Arg-81, Thr-22 and Val-82, or Val-53 and Val-70.
42. The isolated polypeptide of claim 1 wherein the modified variant has the sequence set forth in SEQ ID NO: 2.
43. The isolated polypeptide of claim 1 wherein the modified variant has the sequence set forth in SEQ ID NO: 4.
44. The isolated polypeptide of claim 1 wherein the modified variant has the sequence set forth in SEQ ID NO: 6.
45. The isolated polypeptide of claim 1 wherein the modified variant has the sequence set forth in SEQ ID NO: 8.
46. The isolated polypeptide of claim 1 wherein the modified variant has the sequence set forth in SEQ ID NO:
10.
10.
47. The isolated polypeptide of claim 1 wherein the modified variant has the sequence set forth in SEQ ID NO:
12.
12.
48. The polypeptide of any one of claims 1 to 41, having a molecular weight of about 7.3 Kda or about 9.2 Kda.
49. The polypeptide of any one of claims 1 to 41 and 48, which is a cleavage product.
50. The polypeptide of claim 49, wherein a signal peptide-containing protein is cleaved to produce the cleavage product.
51. The polypeptide of any one of claims 1 to 41 and 48, which is recombinantly produced.
52. The polypeptide of any one of claims 1 to 41 and 48, wherein the modified variant is the modified variant of the sequence from position 19 to 83 of SEQ ID NO:14, further comprising more than one and less than 50 additional amino terminal amino acid residues.
53. The polypeptide of claim 52, wherein one additional amino terminal amino acid residue is methionine.
54. The polypeptide of claim 52, wherein the additional amino terminal amino acid residues are essential amino acids.
55. An isolated nucleic acid encoding the polypeptide of any one of claims 1 to 47.
56. The isolated nucleic acid of claim 55 comprising a polynucleotide amplified from a plant nucleic acid library using at least one of the primers of SEQ ID NO: 25 or SEQ ID
NO: 26 wherein the isolated nucleic acid selectively hybridizes under stringent hybridization conditions, comprising washing with a salt concentration of about 0.02 molar. at pH 7 at 50°C, to the complement of a nucleic acid selected from the group consisting of SEQ ID NOs: 1, 3, 5, 7, 9, and 11.
NO: 26 wherein the isolated nucleic acid selectively hybridizes under stringent hybridization conditions, comprising washing with a salt concentration of about 0.02 molar. at pH 7 at 50°C, to the complement of a nucleic acid selected from the group consisting of SEQ ID NOs: 1, 3, 5, 7, 9, and 11.
57. A recombinant expression cassette comprising the nucleic acid of claim 55 or 56 operably linked to a promoter.
58. The recombinant expression cassette of claim 57, wherein the promoter provides for protein expression in plants.
59. A transformed plant cell comprising the recombinant expression cassette of claim 57 or 58.
60. The plant cell of claim 59, wherein the plant cell is monocotyledonous.
61. The plant cell of claim 60, wherein the plant cell is selected from the group consisting of maize, sorghum, wheat, rice and barley.
62. The plant cell of claim 59, wherein the plant cell is dicotyledonous.
63. The plant cell of claim 62, wherein the plant cell is selected from the group consisting of soybean, alfalfa, canola, sunflower, tobacco and tomato.
64. The plant cell of claim 59, wherein the plant cell is maize or soybean.
65. The plant cell of any one of claims 59 to 64 which is a seed cell.
66. An animal feed composition comprising plant tissue, wherein the plant tissue comprises the polypeptide of any one of claims 1 to 54.
67. A method for increasing the nutritional value of a plant comprising:
(a) introducing into cells of the plant the recombinant expression cassette as defined in claim 58 to yield transformed plant cells, and (b) regenerating a transformed plant from the transformed plant cells.
(a) introducing into cells of the plant the recombinant expression cassette as defined in claim 58 to yield transformed plant cells, and (b) regenerating a transformed plant from the transformed plant cells.
68. The method of claim 67, wherein the transformed plant is maize.
69. Use of at least one recombinant expression cassette as defined in claim 58 in the preparation of a transformed plant.
70. Use of at least one recombinant expression cassette as defined in claim 58 for the preparation of a seed of a transformed plant.
71. The use of claim 69 or 70, wherein the plant is a monocotyledonous plant.
72. The use of claim 71, wherein the monocotyledonous plant is selected from the group consisting of maize, sorghum, wheat, rice and barley.
73. The use of claim 69 or 70, wherein the plant is a dicotyledonous plant.
74. The use of claim 73, wherein the dicotyledonous plant is selected from the group consisting of soybean, alfalfa, canola, sunflower, tobacco and tomato.
75. The use of claim 69, wherein the plant is maize or soybean.
76. The use of claim 70, wherein the plant is maize or soybean.
77. Use of the plant cell of any one of claims 59 to 65 in the preparation of an animal feed composition.
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US74068296A | 1996-11-01 | 1996-11-01 | |
US08/740,682 | 1996-11-01 | ||
PCT/US1997/020441 WO1998020133A2 (en) | 1996-11-01 | 1997-10-31 | Proteins with enhanced levels of essential amino acids |
Publications (2)
Publication Number | Publication Date |
---|---|
CA2270289A1 CA2270289A1 (en) | 1998-05-14 |
CA2270289C true CA2270289C (en) | 2005-09-27 |
Family
ID=24977593
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CA002270289A Expired - Fee Related CA2270289C (en) | 1996-11-01 | 1997-10-31 | Proteins with enhanced levels of essential amino acids |
Country Status (5)
Country | Link |
---|---|
EP (1) | EP0946729A2 (en) |
AU (1) | AU728086B2 (en) |
CA (1) | CA2270289C (en) |
HU (1) | HUP0000810A3 (en) |
WO (1) | WO1998020133A2 (en) |
Families Citing this family (89)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6171640B1 (en) * | 1997-04-04 | 2001-01-09 | Monsanto Company | High beta-conglycinin products and their use |
FR2803592A1 (en) | 2000-01-06 | 2001-07-13 | Aventis Cropscience Sa | NOVEL DERIVATIVES OF 3-HYDROXYPICOLINIC ACID, PROCESS FOR THEIR PREPARATION AND FUNGICIDAL COMPOSITIONS CONTAINING SAME |
FR2815969B1 (en) | 2000-10-30 | 2004-12-10 | Aventis Cropscience Sa | TOLERANT PLANTS WITH HERBICIDES BY METABOLIC BYPASS |
US20020188965A1 (en) | 2001-04-20 | 2002-12-12 | Zou-Yu Zhao | Methods of transforming plants |
JP2005534301A (en) * | 2002-07-29 | 2005-11-17 | ウニベルシテ ラバル | Methods for enhancing nutritional value in plant extracts |
NZ564016A (en) | 2005-05-25 | 2009-08-28 | Pioneer Hi Bred Int | Methods for improving crop plant architecture and yield by introducing recombinant expression cassette |
BRPI0613713A2 (en) * | 2005-07-08 | 2011-02-01 | Renessen Llc | high tryptophan soy meal |
AU2007212168A1 (en) | 2006-02-09 | 2007-08-16 | Pioneer Hi-Bred International, Inc. | Genes for enhancing nitrogen utilization efficiency in crop plants |
AR060523A1 (en) | 2006-04-19 | 2008-06-25 | Pioneer Hi Bred Int | ISOLATED POLINUCLEOTID MOLECULES THAT CORRESPOND TO MUTANT ALELOS AND WILD TYPE OF CORN D9 GEN AND METHODS FOR USE |
US7964774B2 (en) | 2008-05-14 | 2011-06-21 | Monsanto Technology Llc | Plants and seeds of spring canola variety SCV384196 |
EP2344640A1 (en) | 2008-10-30 | 2011-07-20 | Pioneer Hi-Bred International Inc. | Manipulation of glutamine synthetases (gs) to improve nitrogen use efficiency and grain yield in higher plants |
EA201171245A1 (en) | 2009-04-14 | 2012-05-30 | Пайонир Хай-Бред Интернешнл, Инк. | ACC SYNTHASE MODULATION, IMPROVING PLANT YIELD UNDER CONDITIONS OF LOW NITROGEN CONTAINMENT |
US8466342B2 (en) | 2009-06-09 | 2013-06-18 | Pioneer Hi Bred International Inc | Early endosperm promoter and methods of use |
US9175301B2 (en) | 2009-07-24 | 2015-11-03 | Pioneer Hi Bred International Inc | Use of dimerization domain component stacks to modulate plant architecture |
BR112012003761A2 (en) | 2009-08-20 | 2015-09-08 | Pionner Hi Bred International Inc | nitrate transporter polynucleotide, vector, expression cassette, host cell, transgenic plant, seed, method for increasing productivity in a plant, nt protein level modulation method in a plant cell and in a plant, nitrate uptake in plants, decoupling method in a plant nitrate that signals nitrate uptake, recombinant or isolated nitrate reductase (nr) polynucleotide. |
MX2012002113A (en) | 2009-08-20 | 2012-08-08 | Pioneer Hi Bred Int | Functional expression of shuffled yeast nitrate transporter (ynti) in maize to improve nitrate uptake under low nitrate environment. |
EP2494056A1 (en) | 2009-10-26 | 2012-09-05 | Pioneer Hi-Bred International Inc. | Somatic ovule specific promoter and methods of use |
MX2012007855A (en) | 2010-01-06 | 2013-06-05 | Du Pont | Identification of diurnal rhythms in photosynthetic and non-photosynthetic tissues from zea mays and use in improving crop plants. |
AU2011212538B2 (en) | 2010-02-02 | 2014-12-04 | BASF Agricultural Solutions Seed US LLC | Soybean transformation using HPPD inhibitors as selection agents |
AU2011328963B2 (en) | 2010-11-17 | 2016-12-08 | Pioneer Hi-Bred International, Inc. | Prediction of phenotypes and traits based on the metabolome |
EP2689019A2 (en) | 2011-03-23 | 2014-01-29 | Pioneer Hi-Bred International Inc. | Methods for producing a complex transgenic trait locus |
AU2012333207A1 (en) | 2011-06-21 | 2014-01-23 | E.I. Du Pont De Nemours And Company | Methods and compositions for producing male sterile plants |
CA2844101A1 (en) | 2011-08-03 | 2013-02-07 | E. I. Du Pont De Nemours And Company | Methods and compositions for targeted integration in a plant |
US20140298544A1 (en) | 2011-10-28 | 2014-10-02 | Pioneer Hi Bred International Inc | Engineered PEP carboxylase variants for improved plant productivity |
EP2773762A1 (en) | 2011-10-31 | 2014-09-10 | Pioneer Hi-Bred International Inc. | Improving plant drought tolerance, nitrogen use efficiency and yield |
WO2013096818A1 (en) | 2011-12-21 | 2013-06-27 | The Curators Of The University Of Missouri | Soybean variety s05-11268 |
WO2013096810A1 (en) | 2011-12-21 | 2013-06-27 | The Curators Of The University Of Missouri | Soybean variety s05-11482 |
CN104039966A (en) | 2012-01-06 | 2014-09-10 | 先锋国际良种公司 | Ovule specific promoter and methods of use |
WO2013103365A1 (en) | 2012-01-06 | 2013-07-11 | Pioneer Hi-Bred International, Inc. | Pollen preferred promoters and methods of use |
AR089793A1 (en) | 2012-01-27 | 2014-09-17 | Du Pont | METHODS AND COMPOSITIONS TO GENERATE COMPOSITE TRANSGENIC RISK LOCUS |
WO2013138309A1 (en) | 2012-03-13 | 2013-09-19 | Pioneer Hi-Bred International, Inc. | Genetic reduction of male fertility in plants |
WO2013138358A1 (en) | 2012-03-13 | 2013-09-19 | Pioneer Hi-Bred International, Inc. | Genetic reduction of male fertility in plants |
US8859857B2 (en) | 2012-04-26 | 2014-10-14 | Monsanto Technology Llc | Plants and seeds of spring canola variety SCV259778 |
US8835720B2 (en) | 2012-04-26 | 2014-09-16 | Monsanto Technology Llc | Plants and seeds of spring canola variety SCV967592 |
US8878009B2 (en) | 2012-04-26 | 2014-11-04 | Monsanto Technology, LLP | Plants and seeds of spring canola variety SCV318181 |
WO2014059155A1 (en) | 2012-10-11 | 2014-04-17 | Pioneer Hi-Bred International, Inc. | Guard cell promoters and uses thereof |
EP2906703A2 (en) | 2012-10-15 | 2015-08-19 | Pioneer Hi-Bred International, Inc. | Methods and compositions to enhance activity of cry endotoxins |
WO2014081673A2 (en) | 2012-11-20 | 2014-05-30 | Pioneer Hi-Bred International, Inc. | Engineering plants for efficient uptake and utilization of urea to improve crop production |
WO2014164014A1 (en) | 2013-03-11 | 2014-10-09 | Pioneer Hi-Bred International, Inc. | Genes for improving nutrient uptake and abiotic stress tolerance in plants |
BR112015022737A2 (en) | 2013-03-11 | 2018-04-24 | Pioneer Hi-Bred International, Inc | recombinant polynucleotide construct, plant cell, plant, seed, method for regulating expression |
WO2014164828A2 (en) | 2013-03-11 | 2014-10-09 | Pioneer Hi-Bred International, Inc. | Methods and compositions employing a sulfonylurea-dependent stabilization domain |
WO2014164116A1 (en) | 2013-03-13 | 2014-10-09 | Pioneer Hi-Bred International, Inc. | Functional expression of bacterial major facilitator superfamily (sfm) gene in maize to improve agronomic traits and grain yield |
AU2014241045B2 (en) | 2013-03-13 | 2017-08-31 | Pioneer Hi-Bred International, Inc. | Glyphosate application for weed control in brassica |
US20160010101A1 (en) | 2013-03-13 | 2016-01-14 | Pioneer Hi-Bred International, Inc. | Enhanced nitrate uptake and nitrate translocation by over- expressing maize functional low-affinity nitrate transporters in transgenic maize |
CN105339380A (en) | 2013-03-14 | 2016-02-17 | 先锋国际良种公司 | Compositions and methods to control insect pests |
BR112015023304A2 (en) | 2013-03-14 | 2017-07-18 | Pioneer Hi Bred Int | method to improve abiotic stress tolerance of a plant, plant and seed |
WO2014143996A2 (en) | 2013-03-15 | 2014-09-18 | Pioneer Hi-Bred International, Inc. | Compositions and methods of use of acc oxidase polynucleotides and polypeptides |
EP2971000A4 (en) | 2013-03-15 | 2016-11-23 | Pioneer Hi Bred Int | Phi-4 polypeptides and methods for their use |
BR112015026499B1 (en) | 2013-04-17 | 2022-06-21 | Pioneer Hi-Bred International, Inc | Method for characterizing a target sequence |
CN106232820A (en) | 2013-08-16 | 2016-12-14 | 先锋国际良种公司 | Insecticidal protein and using method thereof |
US11773400B2 (en) | 2013-08-22 | 2023-10-03 | E.I. Du Pont De Nemours And Company | Methods for producing genetic modifications in a plant genome without incorporating a selectable transgene marker, and compositions thereof |
CA3223359A1 (en) | 2013-09-13 | 2015-03-19 | Pioneer Hi-Bred International, Inc. | Insecticidal proteins and methods for their use |
BR112016018287A2 (en) | 2014-02-07 | 2017-10-10 | Du Pont | insecticide proteins and methods for their use |
CN106687594A (en) | 2014-07-11 | 2017-05-17 | 纳幕尔杜邦公司 | Compositions and methods for producing plants resistant to glyphosate herbicide |
US20170218384A1 (en) | 2014-08-08 | 2017-08-03 | Pioneer Hi-Bred International, Inc. | Ubiquitin promoters and introns and methods of use |
RU2017112324A (en) | 2014-09-12 | 2018-10-15 | Пайонир Хай-Бред Интернэшнл, Инк. | CREATION OF WEBSITES OF SITE-SPECIFIC INTEGRATION FOR COMPLEX SIGNS LOCUSES IN CORN AND SOY, AND ALSO WAYS OF APPLICATION |
WO2016044092A1 (en) | 2014-09-17 | 2016-03-24 | Pioneer Hi Bred International Inc | Compositions and methods to control insect pests |
RU2733898C2 (en) | 2014-10-16 | 2020-10-09 | Пайонир Хай-Бред Интернэшнл, Инк. | Insecticidal proteins and methods of using them |
WO2016099916A1 (en) | 2014-12-19 | 2016-06-23 | E. I. Du Pont De Nemours And Company | Polylactic acid compositions with accelerated degradation rate and increased heat stability |
US11041158B2 (en) | 2014-12-22 | 2021-06-22 | AgBiome, Inc. | Optimization methods for making a synthetic gene |
US11492632B2 (en) | 2015-03-19 | 2022-11-08 | Pioneer Hi-Bred International, Inc. | Methods and compositions for accelerated trait introgression |
CA2985198A1 (en) | 2015-05-19 | 2016-11-24 | Pioneer Hi-Bred International, Inc. | Insecticidal proteins and methods for their use |
US10647995B2 (en) | 2015-06-16 | 2020-05-12 | Pioneer Hi-Bred International, Inc. | Compositions and methods to control insect pests |
CN116003550A (en) | 2015-08-06 | 2023-04-25 | 先锋国际良种公司 | Insecticidal proteins of plant origin and methods of use thereof |
JP2018527931A (en) | 2015-08-28 | 2018-09-27 | パイオニア ハイ−ブレッド インターナショナル, インコーポレイテッド | OCRROBACTRUM-mediated transformation of plants |
BR112018007796A2 (en) | 2015-11-06 | 2018-10-30 | Du Pont | soybean plants, soybean plant parts or soybean seeds, method of selecting a soybean cell, methods of selecting a soybean cell and producing a locus and nucleic acid molecule |
WO2017105987A1 (en) | 2015-12-18 | 2017-06-22 | Pioneer Hi-Bred International, Inc. | Insecticidal proteins and methods for their use |
CA3004056C (en) | 2015-12-22 | 2024-01-23 | Pioneer Hi-Bred International, Inc. | Tissue-preferred promoters and methods of use |
US9896696B2 (en) | 2016-02-15 | 2018-02-20 | Benson Hill Biosystems, Inc. | Compositions and methods for modifying genomes |
MX2018013249A (en) | 2016-05-04 | 2019-02-13 | Pioneer Hi Bred Int | Insecticidal proteins and methods for their use. |
BR112018076047A2 (en) | 2016-06-16 | 2019-03-26 | Pioneer Hi Bred Int | silencing element, dna construct, expression cassette, host cell, composition, plant cell, plant or plant part, transgenic seed, method for controlling a plant insect pest and kit |
ES2924552T3 (en) | 2016-06-24 | 2022-10-07 | Pioneer Hi Bred Int | Regulatory elements of plants and methods of using them |
CA3026113A1 (en) | 2016-07-01 | 2018-01-04 | Pioneer Hi-Bred International, Inc. | Insecticidal proteins from plants and methods for their use |
US20210292778A1 (en) | 2016-07-12 | 2021-09-23 | Pioneer Hi-Bred International, Inc. | Compositions and methods to control insect pests |
US11140902B2 (en) | 2016-09-27 | 2021-10-12 | University Of Florida Research Foundation, Inc. | Insect toxin delivery mediated by a densovirus coat protein |
EP3535285B1 (en) | 2016-11-01 | 2022-04-06 | Pioneer Hi-Bred International, Inc. | Insecticidal proteins and methods for their use |
US20200407737A1 (en) | 2017-05-03 | 2020-12-31 | KWS SAAT SE & Co. KGaA | Use of crispr-cas endonucleases for plant genome engineering |
EP4317443A3 (en) | 2017-08-09 | 2024-02-28 | RiceTec, Inc. | Compositions and methods for modifying genomes |
AU2018337756A1 (en) | 2017-09-25 | 2020-04-02 | Pioneer Hi-Bred, International, Inc. | Tissue-preferred promoters and methods of use |
US11820791B2 (en) | 2018-03-14 | 2023-11-21 | Pioneer Hi-Bred International, Inc. | Insecticidal proteins from plants and methods for their use |
EP3764798A4 (en) | 2018-03-14 | 2021-12-22 | Pioneer Hi-Bred International, Inc. | Insecticidal proteins from plants and methods for their use |
CA3096516A1 (en) | 2018-05-22 | 2019-11-28 | Pioneer Hi-Bred International, Inc. | Plant regulatory elements and methods of use thereof |
CA3097915A1 (en) | 2018-06-28 | 2020-01-02 | Pioneer Hi-Bred International, Inc. | Methods for selecting transformed plants |
CA3111090A1 (en) | 2018-10-31 | 2020-05-07 | Pioneer Hi-Bred International, Inc. | Compositions and methods for ochrobactrum-mediated plant transformation |
BR112021026220A2 (en) | 2019-06-25 | 2022-02-15 | Inari Agriculture Tech Inc | Enhanced genome editing by homology-dependent repair |
WO2021046526A1 (en) | 2019-09-05 | 2021-03-11 | Benson Hill, Inc. | Compositions and methods for modifying genomes |
CN115867564A (en) | 2020-07-14 | 2023-03-28 | 先锋国际良种公司 | Insecticidal proteins and methods of use thereof |
WO2023119135A1 (en) | 2021-12-21 | 2023-06-29 | Benson Hill, Inc. | Compositions and methods for modifying genomes |
WO2023141464A1 (en) | 2022-01-18 | 2023-07-27 | AgBiome, Inc. | Method for designing synthetic nucleotide sequences |
Family Cites Families (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5096815A (en) * | 1989-01-06 | 1992-03-17 | Protein Engineering Corporation | Generation and selection of novel dna-binding proteins and polypeptides |
DK223790D0 (en) * | 1990-09-18 | 1990-09-18 | Novo Nordisk As | PROTEASE-CONTAINING DETERGENT COMPOSITION |
DK43192D0 (en) * | 1992-03-31 | 1992-03-31 | Novo Nordisk As | detergent composition |
WO1994010315A2 (en) * | 1992-10-23 | 1994-05-11 | Pioneer Hi-Bred International, Inc. | Process for enhancing the content of a selected amino acid in a seed storage protein |
EP0745126B1 (en) * | 1993-01-13 | 2001-09-12 | Pioneer Hi-Bred International, Inc. | High lysine derivatives of alpha-hordothionin |
WO1995027068A1 (en) * | 1994-04-04 | 1995-10-12 | Pioneer Hi-Bred International, Inc. | Reduction of endogenous seed protein levels in plants |
-
1997
- 1997-10-31 AU AU51749/98A patent/AU728086B2/en not_active Ceased
- 1997-10-31 CA CA002270289A patent/CA2270289C/en not_active Expired - Fee Related
- 1997-10-31 HU HU0000810A patent/HUP0000810A3/en unknown
- 1997-10-31 WO PCT/US1997/020441 patent/WO1998020133A2/en not_active Application Discontinuation
- 1997-10-31 EP EP97946614A patent/EP0946729A2/en not_active Withdrawn
Also Published As
Publication number | Publication date |
---|---|
HUP0000810A2 (en) | 2000-06-28 |
AU728086B2 (en) | 2001-01-04 |
WO1998020133A3 (en) | 1998-07-23 |
AU5174998A (en) | 1998-05-29 |
EP0946729A2 (en) | 1999-10-06 |
WO1998020133A2 (en) | 1998-05-14 |
CA2270289A1 (en) | 1998-05-14 |
HUP0000810A3 (en) | 2002-02-28 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CA2270289C (en) | Proteins with enhanced levels of essential amino acids | |
US11130958B2 (en) | Plants having increased tolerance to heat stress | |
Drakakaki et al. | Constitutive expression of soybean ferritin cDNA intransgenic wheat and rice results in increased iron levels in vegetative tissues but not in seeds | |
US9290773B2 (en) | Transgenic plants with enhanced agronomic traits | |
US7189889B2 (en) | Methods for improving seed characteristics | |
US7465852B2 (en) | Compositions and methods for altering the disulfide status of proteins | |
US20050108791A1 (en) | Transgenic plants with improved phenotypes | |
US20130276161A1 (en) | Transgenic plants with enhanced agronomic traits | |
US6800726B1 (en) | Proteins with increased levels of essential amino acids | |
HU214630B (en) | Method for increasing the level of free lysine in plant cells, and for producing transformed plant cells and expression cassettes | |
WO1999067404A1 (en) | Alteration of hemicellulose concentration in plants by rgp | |
US20100162433A1 (en) | Plants with improved nitrogen utilization and stress tolerance | |
EP2496071B1 (en) | Plants with improved nitrogen utilization and stress tolerance | |
US6184437B1 (en) | Lysine rich protein from winged bean | |
MXPA02004482A (en) | Genes encoding enzymes for lignin biosynthesis and uses thereof. | |
MX2014007711A (en) | Methods for improving crop yield. | |
AU2011315102B2 (en) | Production of plants having improved water-deficit tolerance | |
WO2000047614A1 (en) | Transgenic plants with modified expression of the dp protein | |
US6265636B1 (en) | Pyruvate dehydrogenase kinase polynucleotides, polypeptides and uses thereof | |
WO2013030812A1 (en) | High-methionine transgenic soybean seeds expressing the arabidopsis cystathionine gamma-synthase gene | |
MXPA99004124A (en) | Proteins with enhanced levels of essential amino acids | |
US7179957B1 (en) | Method for altering nitrogen or oil content of seeds by down regulating AGL11 expression or activity | |
EP2003205A2 (en) | Improved grain quality through altered expression of seed proteins | |
Altenbach et al. | Transgenic plants with improved protein quality | |
MXPA01006841A (en) | Maize alternative oxidase genes and uses thereof |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
EEER | Examination request | ||
MKLA | Lapsed |
Effective date: 20151102 |