CN101883783A - 具有增强的产量相关性状的植物及其制备方法 - Google Patents
具有增强的产量相关性状的植物及其制备方法 Download PDFInfo
- Publication number
- CN101883783A CN101883783A CN2008801187499A CN200880118749A CN101883783A CN 101883783 A CN101883783 A CN 101883783A CN 2008801187499 A CN2008801187499 A CN 2008801187499A CN 200880118749 A CN200880118749 A CN 200880118749A CN 101883783 A CN101883783 A CN 101883783A
- Authority
- CN
- China
- Prior art keywords
- plant
- polypeptide
- seq
- nucleic acid
- sequence
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/82—Vectors or expression systems specially adapted for eukaryotic hosts for plant cells, e.g. plant artificial chromosomes (PACs)
- C12N15/8241—Phenotypically and genetically modified plants via recombinant DNA technology
- C12N15/8261—Phenotypically and genetically modified plants via recombinant DNA technology with agronomic (input) traits, e.g. crop yield
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/415—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from plants
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y02—TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
- Y02A—TECHNOLOGIES FOR ADAPTATION TO CLIMATE CHANGE
- Y02A40/00—Adaptation technologies in agriculture, forestry, livestock or agroalimentary production
- Y02A40/10—Adaptation technologies in agriculture, forestry, livestock or agroalimentary production in agriculture
- Y02A40/146—Genetically Modified [GMO] plants, e.g. transgenic plants
Landscapes
- Health & Medical Sciences (AREA)
- Genetics & Genomics (AREA)
- Life Sciences & Earth Sciences (AREA)
- Chemical & Material Sciences (AREA)
- Organic Chemistry (AREA)
- Engineering & Computer Science (AREA)
- Molecular Biology (AREA)
- Wood Science & Technology (AREA)
- Biotechnology (AREA)
- General Engineering & Computer Science (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Biomedical Technology (AREA)
- Zoology (AREA)
- Biochemistry (AREA)
- General Health & Medical Sciences (AREA)
- Biophysics (AREA)
- Microbiology (AREA)
- Physics & Mathematics (AREA)
- Plant Pathology (AREA)
- Cell Biology (AREA)
- Botany (AREA)
- Gastroenterology & Hepatology (AREA)
- Medicinal Chemistry (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Peptides Or Proteins (AREA)
- Breeding Of Plants And Reproduction By Means Of Culturing (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
- Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
Abstract
本发明涉及用于通过调节植物中编码PATL(PATELLIN)多肽,或PRP38(RNA加工因子前体38),或ADA2(转录衔接头2)多肽的表达而增强植物产量相关性状的方法。本发明也涉及用于通过调节植物中GATA样多肽的核酸的表达而提高千粒重、种子总重量和/或饱满种子数的方法。本发明也涉及用于通过增加植物中编码WD40重复(WDR)23样多肽的核酸的表达而增强多种植物产量相关性状的方法。本发明也提供了在本发明方法中有用的迄今未知的PATL核酸和构建体。
Description
本发明总体上涉及分子生物学领域并涉及用于通过调节植物中编码PATL(PATELLIN)多肽或PRP38(RNA加工因子前体38)或GATA样多肽或ADA2(转录衔接头2)多肽或WDR23样多肽的核酸的表达而增强植物产量相关性状的方法。本发明也涉及具有编码PATL多肽或PRP38或GATA样或ADA2多肽或WDR23样的核酸的受调节表达的植物,所述植物相对于对应野生型植物或其他对照植物具有增强的产量相关性状。本发明也提供了在本发明方法中有用的迄今未知的PATL或PRP38或GATA样或ADA2或DR23样核酸和构建体。
在一个实施方案中,本发明还涉及用于通过调节编码GATA样多肽的核酸在植物中的表达来改善植物生长特征的方法。本发明还涉及具有编码GATA样多肽的核酸的受调节的表达的植物,所述植物相对于对应的野生型植物或其他对照植物具有改善的生长特征。本发明也提供了在本发明方法中有用的构建体。
在另一个实施方案中,本发明还涉及用于通过增加编码WD40重复(WDR)23样多肽的核酸序列在植物中的表达来增强各种植物产量相关性状的方法。本发明还涉及具有编码WD 23样多肽的核酸的增加的表达的植物,所述植物相对于对照植物具有增强的产量相关性状。此外,本发明涉及编码具有上述植物产量增加活性的上述多肽的特定核酸序列、包含所述核酸序列的核酸构建体、载体和植物。
持续增长的世界人口和农业用可耕地供应萎缩刺激了有关提高农业效率的研究。常规的作物及园艺学改良手段利用选择育种技术以鉴定具有受欢迎特征的植物。然而,此类选择育种技术具有几个缺陷,即这些技术一般耗费很多劳动并且产生这样的植物,其经常含有异源的遗传组分,其可能不总导致受欢迎性状从亲代植物传递下去。分子生物学进展已经允许人类改良动物及植物的种质。植物的遗传工程使得可以分离和操作遗传物质(一般处于DNA或RNA形式)并且随后导入该遗传物质至植物中。此类技术具有产生具备多种经济学、农学或园艺学改良性状的作物或植物的能力。
具有特殊经济意义的性状是提高的产量。产量通常定义为来自作物的经济价值的可测量结果。该结果可以就数量和/或品质方面进行定义。产量直接取决于几个因素,例如器官的数目和大小、植物构造(例如枝的数目)、种子产生、叶衰老等。根发育、养分摄入、胁迫耐受性和早期生长势(early vigor)也可以是决定产量的重要因素。优化前述因素因而可以有助于提高作物产量。
种子产量是特别重要的性状,因为许多植物的种子对人和动物营养是重要的。作物如谷物、稻、小麦、卡诺拉油菜和大豆占超过一半的人类总热量摄入,无论通过直接消费种子本身或通过消费基于加工的种子而产生的肉产品。作物也是糖、油及工业加工中所用许多类型代谢物的来源。种子含有胚(新枝条(shoot)和新根的起源)和胚乳(萌发期间和籽苗早期生长期间用于胚生长的养分来源)。种子发育涉及多种基因并且需要代谢物从根、叶和茎转移至正在生长的种子中。胚乳尤其同化糖类、油和蛋白质的代谢前体并且将它们合成为贮藏大分子以灌满籽粒。
植物生物量是饲料作物如苜蓿、饲用谷物和干草的产量。产量的许多代用物已经用于谷物作物。它们当中主要是对植物尺寸的估计。植物尺寸可以根据物种和发育阶段以多种方式测量,不过包括总植物干重、地上部分干重、地上部分鲜重、叶面积、茎体积、植物高度、莲座丛直径、叶长度、根长度、根质量、分蘖数和叶数。许多物种维持在给定发育阶段上植物不同部分的尺寸之间的保守比率。这些异速增长关系用来从这些尺寸量值之一外推至另一种尺寸量值(例如Tittonell等,2005Agric Ecosys & Environ 105:213)。在发育早期的植物尺寸一般与发育中稍后的植物尺寸相关。具有较大叶面积的较大植物通常可以比较小的植物吸收更多光线和二氧化碳并且因而可能会在相同的时段期间获得更大重量(Fasoula和Tollenaar 2005 Maydica 50:39)。此外,这也是植物应当初始实现较大尺寸的微环境优势或遗传优势的潜在延续。存在针对植物尺寸和生长速率的强遗传组分(例如ter Steege等,2005 PlantPhysiology 139:1078),并且因而对于一系列各异遗传表型,在一种环境条件下的植物尺寸很可能关联于另一种环境条件下的尺寸(Hittalmani等,2003 Theoretical AppliedGenetics 107:679)。以这种方式,使用标准环境作为作物在田间于不同位置及时间所遭遇的多样且动态环境的代用物。
对于许多作物的另一个重要性状是早期生长势。改善早期生长势是现代稻育种计划在温带和热带稻品种方面的重要目标。长根对于水栽稻中正确土壤固定是重要的。在稻直接播种至被淹没田地的情况下,以及在植物必须从水中迅速萌发的情况下,较长的枝条与生长势相关。在实施条播的情况下,较长的中胚轴和胚芽鞘对良好出苗是重要的。将早期生长势人工改造到植物内的能力在农业中将是极重要的。例如,不良的早期生长势已经限制了基于玉米带种质(Corn Belt germplasm)在欧洲大西洋地区引种玉米(Zea mayes L.)杂种。
收获指数即种子产量对地上部分干重的比率在许多环境条件下是相对稳定的并且因而可以经常获得植物尺寸与谷物产量之间的强相关(例如Rebetzke等,2002 Crop Science 42:739)。这些过程是内在联系的,因为谷物生物量的主要部分取决于植物叶和茎的现有和储备光合生产率(Gardener等,1985 Physiology of Crop Plants.Iowa State UniversityPress,第68-73页)。因此,选择植物尺寸(甚至在发育的早期)已经用作未来潜在产能的指示物(例如Tittonell等,2005 Agric Ecosys & Environ105:213)。当检验遗传差异对胁迫耐受性的影响时,使土壤属性、温度、水和养分有效性和光强度标准化的能力是温室或植物生长室环境与田间相比的固有优势。然而,产量因不良授粉所致的人为限制可能限制这些受控环境检验产量差异的用途,其中所述的不良授粉因缺少风和昆虫或成熟根或根冠生长的足够空间引起。因此,在标准化条件下在生长室或温室测量发育早期的植物尺寸是提供潜在遗传产量优势指示的标准操作。
另一个重要性状是改良的非生物性胁迫耐受性。非生物性胁迫是世界范围作物损失的主要原因,对于大多数主要作物植物而言平均产量降低超过50%(Wang等,(2003)Planta 218:1-14)。非生物性胁迫可以由干旱、盐度、极端温度、化学毒性养分、(大分子和/或微量元素)过量或匮乏、辐射和氧化胁迫引起。提高植物的非生物性胁迫耐受性能力将在世界范围对农民是巨大的经济优势并且将允许在不利条件期间及在本来不可能栽培作物的陆地上栽培作物。
因而可以通过优化前述因素之一提高作物产量。
取决于最终用途,对某些产量性状的改良可能优先于其它产量性状。例如对于应用如饲料或木材生产或生物燃料资源而言,增加植物营养体部分可能是希望的,而对于应用如面粉、淀粉或油生产而言,种子参数提高可能是特别希望的。即便在种子参数当中,某些参数可以更优先于其它参数,这取决于用途。多种机制可以有助于提高种子产量,无论其形式为提高的种子尺寸或提高的种子数目。
增强植物中产量相关性状(种子产量和/或生物量)的一种方法可以是通过调节植物的内在生长机制如细胞周期或参与植物生长或参与防御机制的多种信号传导途径。
现在已经发现可以通过调节植物中编码PATL(Patellin)多肽或PRP38(RNA加工因子前体38)或ADA2(转录衔接头2)多肽或WD40重复(WDR)23样多肽的核酸的表达而增强植物中多种产量相关性状
又发现可通过调节植物中编码GATA样(目的蛋白)的核酸表达而改善多种生长特征。
增强的产量相关性状包含以下一项或多项:提高的每株植物种子总产量、提高的种子饱满率、提高的种子充数量、提高的收获指数或提高的千粒重。
脂类(lipids)是在非极性溶剂(如氯仿或醚)中可溶的物质。脂类是活生物体中的必需成分。如糖脂、磷脂和胆固醇的脂类是细胞膜的关键结构成分,而甘油三酯是生物能源分子。磷脂酰肌醇(PtdIns)和磷脂酰胆碱(PtdCho)是磷脂的实例。脂类和蛋白质间的相互作用在将涉及多种过程(如细胞信号发放和细胞增殖)的蛋白质和糖脂靶向至特定的膜和细胞内部位中起作用。多种蛋白质与脂类的生物合成、转运和吸收相关。此外,涉及信号转导和蛋白质导向的关键蛋白质具有翻译后附加至其的脂类衍生基团(Stryer,L.(1995)Biochemistry,W.H.Freeman和Co.,New York N.Y.,第264-267页,934)。
磷脂酰肌醇/磷脂酰胆碱转移蛋白质(PIPT)是通过促进在真核细胞的不同膜区室中转移,而涉及PtdIns和PtdCho代谢协调的遍在蛋白质。在酵母中,主要的PIPT是SEC14蛋白质,它是从高尔基体外侧网络分泌必需的(Bankatis等,1989,J.Cell Biol.108:1271-81)。在植物和其他高等生物中发现了类似的蛋白质。在植物中,几种SEC14同源物已被描述为与酿酒酵母(S.cerevisiae)SEC14蛋白质具有约25%的氨基酸序列相似性,在一些情况下,已通过sec14-1温度敏感型酵母突变体的互补在实验上验证了假定的PtdIns/PtdCho转移功能(Jouannic等,1998)。据报道,植物SEC14蛋白质在多种生物学过程中发挥作用,如胞质分裂、高渗胁迫诱导的信号发放途径、多种膜间的小泡运输或结瘤作用期间的膜生物发生(Allen-Baume等,2002;FEBsLetters 531:74-80;Peterman等,2004.Plant Phys.136:3080-3094;Monks等,2001,Plant Cell 13:1205-19;Kapranov等,2001 Plant Cell13:1369-1382)。
在拟南芥(Arabidopsis thaliana)中,已报道Patellin1(PATL1)蛋白质(SEC14同源物蛋白质家族成员)在胞质分裂期间定位至细胞板和显示具有PtdIns结合活性。PATL1是拟南芥中六种蛋白质的小家族的成员,其特征在于存在SEC14和GOLD(高尔基体动力学)结构域这两种保守结构域。
SEC14结构域见于酵母的SEC14蛋白质和见于RhoGAP、RhoGEF和asGAP、神经纤维瘤蛋白(NF1)。该结构域还见于多种可以作为视觉周期功能成分的视黄醛/维甲醛结合蛋白质的C端,和Trio蛋白质中,Trio蛋白质是整合和放大涉及肌动蛋白重建的信号的多功能因子。SEC14有时称为CRAL_TRIO结构域。SEC14结构域涉及脂类结合(Sha和Luo.1999.,Biochim Biophys Acta.1441:268-77)。
GOLD结构域是见于几种真核高尔基体和脂类运输蛋白质的蛋白质组件。其长度一般在90-150氨基酸之间。在GOLD结构域超家族中观察到的大部分大小差异可追踪至见于该结构域的一些版本中的单个巨大的低复杂性插入片段。GOLD结构域存在于可与膜相互作用,或在多种蛋白质与细胞骨架丝的相互作用中有作用的蛋白质中,如动物SEC14蛋白质或酵母氧固醇(oxysterol)结合蛋白质同源物3(OSH3)。预测GOLD结构域介导多中蛋白质-蛋白质相互作用(Anantharaman等,2002.Genome Biol.3)。
专利申请WO 2004/09014描述了涉及胁迫耐性的部分PATL蛋白质。
细胞中信使RNA(mRNA)的合成和功能需要包含转录、加工、转运、翻译和降解的一系列事件。RNA加工指翻译后修饰RNA的事件。在真核生物中,大多数新生前mRNA包含内含子,内含子的切除导致外显子的精确连接以产生成熟mRNA,成熟mRNA是被核糖体用来翻译为蛋白质的RNA形式。RNA的转录后修饰还包含影响稳定性和翻译效率的5’端帽化和3’端聚腺苷化。mRNA翻译和周转间的关系对基因表达调节和细胞正确发挥作用很关键。在真核生物中,数十种蛋白质涉及RNA代谢。这种蛋白质的实例是前mRNA剪接因子PRP38。
酵母PRP38是在前mRNA剪接缺陷的酿酒酵母的温度敏感型突变体遗传筛选中鉴定的(Blanton等,1992,Mol.Cel.Biol.12,3939-3947)。插入序列从真核前mRNA转录本的切除发生于细胞核内称为剪接体的巨大的复杂结构上。剪接体通过两个步骤的机制进行内含子切除,该机制包含(i)内含子上在5’剪接位点切割和连接内含子5’端核苷酸至3’端附近的腺苷和(ii)在3’剪接位点切割和外显子连接。剪接体的组装通过U1、和U5 snRNP(小核糖核蛋白U1-U6)的相继加入来进行。在组装过程后期,U4从剪接体解离。据信,导致前mRNA分子中内含子切除的剪接的催化事件发生在U4解离之后。在酵母中,据报道,PRP38对剪接体组装不是必要的,但其是导致剪接体催化激活的构象变化必需的(Xie等,(1998)EMBO Journal 17卷pp.2938-2946)。
在拟南芥中,依照其与SR蛋白质的序列相似性,PRP38蛋白质(AtPRP38)被称为SRL1(SR样1)(Forment等,Plant J.2002年6月;30(5):511-9)。但是,PRP38在结构上不同于其他SR蛋白质。SR蛋白质一般包含RRM结构域(RNA结合结构域)和RS结构域(Kalyna和Barta Biochem Soc Trans.2004 32:561-4.),而AtPRP38包含RS结构域但缺少RRM结构域。
WO 01/81599描述了用编码几种SR蛋白质的核酸在酵母和植物中改良胁迫耐性的方法。已发现SRL1及其他在前mRNA剪接中起作用的蛋白质在植物对非生物胁迫的响应中发挥重要作用(Lee等,2006Plant Cell.Jul;18(7):1736-49)。
GATA家族形成真核生物Cys2/Cys2型锌指转录因子(见于如细胞状黏菌、植物、真菌、线虫、昆虫、棘皮动物、脊椎动物)中的主要家族之一,其包含1个或2个高度保守的锌指DNA结合结构域。DNA结合的共有序列是CX2CX17-20CX2C,并包含由保守的4个半胱氨酸配位(coordinate)的锌原子(Omichinski等,1993;Reyes等,2004)。此结构域之后一般是碱性区域。其特征性地在GATA识别序列(A/T)GATA(A/G)处结合DNA(Martin和Orkin,1990;Omichinski等,2003)。动物的GATA在第二和第三个半胱氨酸之间一般包含17个残基,而在真菌中这一数目可以是17-18(很少是19或20)。植物在第二和第三个半胱氨酸之间具有18或20个残基。
在人类中,最初的GATA1被鉴定为促进球蛋白基因表达所需的转录因子(Pevny等,1991)。第一种植物GATA(NTL1)作为粗糙链孢霉(Neurospora crassa)转录因子NIT2(在氮限制条件中激活氮代谢酶基因表达的蛋白质)的植物同源物分离自烟草。NTL1在第二和第三个半胱氨酸之间包含18个间隔(CX2CX18CX2C)(Daniel-Vedele和Caboche,1997)。拟南芥和稻(Oryza sativa)基因组中分别存在29和28个编码假定的GATA转录因子(CX2CX18或20CX2C)的基因座(Reyes等,2004):拟南芥仅具有带一个GATA结构域的蛋白质,而稻具有两种包含两个GATA结构域的蛋白质和一种包含三个GATA结构域的蛋白质。稻转录因子数据库(DRTF,Gao等,Bioinformatics 2006,22,1286-1287)提出籼稻(Indica)中有28种GATA,粳稻(Japonica)中有23种GATA。稻和拟南芥中的转录因子GATA家族可以划分为7个亚家族,这些亚家族中的一些在两个物种中均存在,而其他亚家族是两个物种之一独有的。
GATA-样蛋白质(HAN)在强组成型35S启动子控制下的异位表达严重地影响植物的生存力和发育(Zhao等,Plant Cell 16,2586-2600,2004)。转化的植物矮小,具有异常形成的叶和较小且畸形的花序。US2007/0250956公开OsGATA11及其对提高种子产量的用途。
植物中的转录调节是许多生物学过程的基础,如对环境的适应和代谢和生理平衡的调节。因此,改变植物中的基因转录可导致植物生长和发育的显著改变。这种特性可用来改良作物植物的性能。存在数百种控制基因表达的基因,其中只有少数已显示在调节其在植物中的表达时对农业的目的性状具有有益作用(Vinocur和Altman,2005,CurrentOpinion in Biotechnology 16,123-132;Gutterson和Reuber 2004.Current Opinion in Plant Biology,7,465-471)。为增强产量性状调节基因表达的可用性将有助于改良作物性能,并最终有利于基于农业的产业。
在真核生物中,基因表达与导致启动子变得更易于接近RNA聚合酶II和转录装置的其他成分的染色质修饰相偶联。两类主要的蛋白质复合物影响染色质动力学。一类包含ATP依赖的染色质重建器(SWI/SNF相关复合物),第二类包含修饰组蛋白蛋白质的复合物。一种这类组蛋白修饰由组蛋白乙酰转移酶(HAT)介导,可通过存在于核心组蛋白N端中的特定赖氨酸残基乙酰化(Narlikar等,2002 Cell 108,475-487)。作为转录辅激活物起作用的原型组蛋白乙酰转移酶之一称为Gcn5。在酵母中,Gcn5形成已知包含如ADA、SAGA、SALSA和SLIK的衔接头蛋白质的不同蛋白质复合物[18-20]。已在包括植物的高等真核生物中鉴定了同源复合物。在拟南芥中,已描述了两种编码ADA2蛋白质的旁系同源基因(Stockinger等,2001 Nucleic acid research 29,1524-1533)。
通用转录因子募集RNA聚合酶II至启动子的TATA框,而转录激活因子结合至称为上游激活位点(UAS)的特定DNA序列。通用转录因子可直接或通过衔接头蛋白质与转录激活因子相互作用。虽然也已经描述了通过DNA结合结构域的相互作用,但衔接头蛋白质一般通过激活结构域与转录激活因子相互作用(Mao等,2006.Biochimica etBiophysica Acta 1759 69-79)。
在拟南芥中,据报道ADA2蛋白质增强GCN5在体外乙酰化组蛋白的能力和调节GCN5的底物特异性。此外,GCN5可在植物同源物独有而真菌和动物同源物缺乏的基序处乙酰化ADA2蛋白质(Mao等,2006)。已发现ADA2b和GCN5中的T-DNA插入突变对植物的生长和发育具有多效性,包含矮小的尺寸、异常的根发育和花内短的花瓣和雄蕊(Vlachonasios等,2003The Plant Cell,15卷,626-638)。还已经公开了来自其他植物物种的编码转录衔接头蛋白质的基因(WO0003026)。
许多不同细胞过程的调节需要蛋白质相互作用结构域进行多肽彼此间结合,和与磷脂、小分子或核酸结合的用途。一种这类蛋白质相互作用结构域称为WD重复(参见综述Smith等,(1999)TIBS 24:181-185)。包含WD重复的蛋白质大量存在于大部分生物中,例如,人类(H.sapiens)中超过300种、秀丽隐杆线虫(C.elegans)中超过140种、拟南芥中超过390种和酿酒酵母中超过90种。虽然通过WD重复(一条多肽中4-16个拷贝之间)的存在而在结构上相关,但这些蛋白质具有极多样的功能。
通过Gly-His(GH)二肽N端距Trp-Asp(WD)二肽(位于基序的C端)10-20个残基,长度一般为大约40(但至多60)个残基(因此称为“WD40”),在一级结构上粗略地定义WD重复。但是,GH二肽和WD二肽都不是绝对保守。GH和WD之间是保守的核心序列,其可以用可从例如在英国的European Bioinformatics Institute(EBI)所拥有的InterPro获得的算法鉴定。InterPro是蛋白质家族、结构域和功能位点的数据库,其中可将见于已知蛋白质的可鉴定特征应用于未知蛋白质序列。
据预测,WD40蛋白质形成β螺旋桨-样结构(环状重复结构),其包含三个潜在的相互作用表面:顶部、底部和环状面。这种扩展的表面积允许WD40蛋白质可逆地与数个蛋白质形成复合物,从而协调涉及数组蛋白质的相继和/或同时相互作用。
涉及WD40蛋白质的非常重要的蛋白质复合物中有一类多蛋白质泛素E3连接酶,Cullin4(CUL4)和损伤DNA结合蛋白质1(DDB1)是其核心蛋白质(Higa等,(2007)Cell Division 2:5;Angers等,(2006)Nature 443:590-593;Higa等,(2006)Nature Cell Biol 8(11):1277-1283;He等,(2006)Genes & Development 20:2949-2954)。这种复合物为底物募集机制锚定WD40蛋白质作为分子衔接头,底物随后被泛素化和破坏。这些WD蛋白质形成称为DCAF(DDB1-CUL4A结合因子)的亚类,它们在两个连续的WD40重复末端包含两个保守的DxR基序(He等(2006)见上)。
一种此类WD40蛋白质WDR23(WD重复23;也称为DCAF11)与见于植物的WD40蛋白质具有显著的一级序列同一性。最先在紫草(Lithospermum erythrorhizon)中将这种WD40蛋白质鉴定为克隆14B(LEC14B;NCBI检索号D83074)。
在专利申请WO2002/016655中,SEQ ID NO:2577涉及编码WDR23-样多肽的拟南芥核酸序列,描述了用SEQ ID NO:1-SEQ IDNO:5379中的任意一个或多个来鉴定植物细胞已暴露过的胁迫条件的方法。在美国专利申请US2004/034888中,SEQ ID NO:13,294涉及编码WDR23-样多肽的拟南芥核酸序列,描述了用SEQ ID NO:1-SEQ IDNO:36,564中的任意一个或多个来产生具有改良特性的植物的方法。
令人惊奇地,现已发现调节编码PRP38多肽的核酸的表达使植物具有增强的(或改良的)产量相关性状,尤其是相对于对照植物的提高的产量,也在除由盐、干旱、寒冷或冰冻引起的渗透胁迫的条件中。
还令人惊奇地,现已发现调节(优选提高)编码GATA-样多肽的核酸的表达使植物相对于对照植物具有显著提高的千粒重(TKW),而如种子数(饱满种子或总种子数)的其他种子产量参数未显著提高。
此外,令人惊奇地,现已发现调节编码ADA2多肽的核酸的表达使植物相对于对照植物具有增强的产量相关性状,尤其是提高的产量。
还令人惊奇地,现已发现提高编码如本文定义的WDR23-样多肽的核酸序列的表达使植物相对于对照植物具有提高的产量相关性状。
此外,令人惊奇地,现已发现调节编码PATL多肽的核酸的表达使植物相对于对照植物具有增强的产量相关性状,尤其是提高的产量。
根据本发明的一个实施方案,提供了相对于对照植物在植物中改良(或增强)产量相关性状的方法,其包括调节编码PATL多肽、或PRP38、或ADA2多肽的核酸在植物中的表达。
根据另一个实施方案,提供了相对于对照植物提高千粒重(TKW)的方法,其包括调节(优选提高)编码GATA-样多肽的核酸在植物中的表达。
根据另一个实施方案,提供了相对于对照植物在植物中增强产量相关性状的方法,其包括提高编码如本文定义的WDR23-样多肽的核酸序列在植物中的表达。提高的产量相关性状包含以下一种或多种:提高的每株植物的种子总产量、提高的种子饱满率、提高的饱满种子数、提高的收获指数或提高的千粒重。
定义
多肽/蛋白质
术语“多肽”和“蛋白质”在本文中可相互交换地使用并且指由肽键连接起来的任意长度聚合物形式的氨基酸。
多核苷酸/核酸/核酸序列/核苷酸序列
术语“多核苷酸”、“核酸序列”、“核苷酸序列”、“核酸”、“核酸分子”在本文中可相互交换地使用并且指任意长度的聚合非分支形式的核苷酸,即核糖核苷酸或脱氧核糖核苷酸或这二者的组合。
对照植物
选择合适的对照植物是实验设计的例行部分并且可以包括相应的野生型植物或无目的基因的相应植物。对照植物一般是相同的植物物种或甚至是与待评估植物相同的品种。对照植物也可以是待评估植物的失效合子。失效合子是因分离而丢失转基因的个体。如本文中所用的“对照植物”不仅指完整植物,也指植物部分,包括种子和种子部分。
同源物
蛋白质的“同源物”包括这样的肽、寡肽、多肽、蛋白质和酶,它们相对于非修饰的所讨论蛋白质具有氨基酸取代、缺失和/或插入并且与衍生它们的非修饰蛋白质具有相似的生物学活性和功能活性。
缺失指从蛋白质中移除一个或多个氨基酸。
插入指一个或多个氨基酸残基被导入蛋白质中的预定位点。插入可以包含氨基端融合和/或羧基端融合以及序列内插入单个或多个氨基酸。通常,在氨基酸序列内部的插入物比氨基端融合物或羧基端融合物小约1至10个残基级别。氨基端或羧基端融合蛋白或融合肽的例子包括如酵母双杂交系统中所用的转录激活物的结合结构域或激活结构域、噬菌体外壳蛋白、(组氨酸)-6-标签、谷胱甘肽S-转移酶-标签、蛋白A、麦芽糖结合蛋白、二氢叶酸还原酶、Tag·100表位、c-myc表位、FLAG-表位、lacZ、CMP(钙调蛋白结合肽)、HA表位、蛋白C表位和VSV表位。
取代指以具有相似特性(如相似的疏水性、亲水性、抗原性、形成或破坏α-螺旋结构或β-折叠结构的倾向性)的其他氨基酸替代蛋白质的氨基酸。氨基酸取代一般是单个残基的,但是根据给予多肽的功能性约束条件,可以是簇集的;插入通常是约1至10个氨基酸残基级别。氨基酸取代优选地是保守性氨基酸取代。保守性取代表是本领域熟知的(见例如Creighton(1984)Proteins.W.H.Freeman和Company(编)和下表1)。
表1:保守性氨基酸取代的例子
残基 | 保守性取代 | 残基 | 保守性取代 |
Ala | Ser | Leu | Ile;Val |
Arg | Lys | Lys | Arg;Gln |
Asn | Gln;His | Met | Leu;Ile |
Asp | Glu | Phe | Met;Leu;Tyr |
Gln | Asn | Ser | Thr;Gly |
Cys | Ser | Thr | Ser;Val |
Glu | Asp | Trp | Tyr |
Gly | Pro | Tyr | Trp;Phe |
His | Asn;Gln | Val | Ile;Leu |
Ile | Leu,Val |
氨基酸取代、缺失和/或插入可以使用本领域熟知的肽合成技术如固相肽合成法等或通过重组DNA操作轻易地进行。用于操作DNA序列以产生蛋白质的取代、插入或缺失变体的方法是本领域熟知的。例如,用于在DNA的预定位点处产生取代突变的技术是本领域技术人员熟知的并且包括M13诱变法、T7-Gen体外诱变法(USB,Cleveland,OH)、QuickChange位点定向诱变法(Stratagene,San Diego,CA)、PCR介导的位点定向诱变或其他位点定向诱变法。
衍生物
“衍生物”包括这样的肽、寡肽、多肽,其中与天然存在形式蛋白质(如Patellin或RNA加工因子前体38或GATA样多肽或衔接头2或WDR23样多肽)的氨基酸序列相比较,它们包含非天然存在氨基酸残基对氨基酸的取代或非天然存在氨基酸残基的添加。蛋白质的“衍生物”也包括这样的肽、寡肽、多肽,其中与所述多肽的天然存在形式的氨基酸序列相比,它们包含天然存在的改变(糖基化、酰化、异戊二烯化、磷酸化、肉豆蔻酰化、硫酸化等)的氨基酸残基或非天然存在的改变的氨基酸残基。与衍生出衍生物的氨基酸序列相比较,该衍生物可以也包含与所述氨基酸序列共价或非共价结合的一个或多个非氨基酸取代基或添加物(例如报道分子或其他配体),如所结合旨在促进检测该衍生物的报道分子,和相对于天然存在蛋白质的氨基酸序列而言,包含非天然存在的氨基酸残基。此外,“衍生物”也包括天然存在形式的蛋白质与标签肽如FLAG、HIS6或硫氧还蛋白(对标签肽的综述,参见Terpe,Appl.Microbiol.Biotechnol.60,523-533,2003)的融合物。
直向同源物/旁系同源物
直向同源物和旁系同源物包括用来描述基因祖先关系的进化概念。旁系同源物是相同物种内因祖先基因复制而起源的基因;直向同源物是来自不同生物的因物种形成而起源的基因,并且也衍生于共同的祖先基因。
结构域
术语“结构域”指在进化相关性蛋白质的序列比对结果上的特定位置处保守的一组氨基酸。尽管在其他位置处的氨基酸可以在同源物之间不同,然而在特定位置处高度保守的氨基酸指示在蛋白质结构、稳定性或功能方面很可能是必需的氨基酸。结构域因其在蛋白质同源物家族的比对序列中高程度保守而鉴定,故它们可以用作鉴定物来确定所讨论的任意多肽是否属于先前已鉴定的多肽家族。
基序/共有序列/标签
术语“基序”或“共有序列”或“标签”指在进化相关蛋白质的序列中的短保守区域。基序往往是结构域的高度保守部分,但是也可以仅包括该结构域的部分,或可以位于保守结构域之外(若该基序的全部氨基酸位于定义的结构域之外)。
杂交
如本文中所定义的术语“杂交”是其中基本上同源的互补核苷酸序列相互复性的过程。杂交过程可以完全在溶液中进行,即两种互补性核酸均处在溶液中。杂交过程也可以用固定至基质如磁珠、琼脂糖凝胶(Sepharose)珠或任何其他树脂的互补性核酸之一进行。杂交过程也可以用固定至固体支持物如硝酸纤维素膜或尼龙膜上或通过例如照相平版印刷术固定至例如硅玻璃支持物(后者称作核酸阵列或微阵列或称作核酸芯片)的互补性核酸之一进行。为使杂交发生,核酸分子通常被热变性或化学变性,以将双链解链成两条单链和/或去除来自单链核酸的发夹或其他二级结构。
术语“严格性”指杂交发生的条件。杂交的严格性受诸条件如温度、盐浓度、离子强度和杂交缓冲液组成的影响。通常,将低严格条件选择成在定义的离子强度和pH处,低于特定序列的热解链温度(Tm)约30℃。中等严格条件是当所述温度在Tm以下20℃时,并且高严格条件是当所述温度在Tm以下10℃时。高严格杂交条件一般用于分离与靶核酸序列具有高序列相似性的杂交序列。然而,核酸可以在序列上偏离且依旧编码基本上相同的多肽,原因是遗传密码的简并性。因而,有时候可能需要中等严格杂交条件以鉴定此类核酸分子。
Tm是在定义的离子强度和pH处的下述温度,其中50%的靶序列在所述温度与完全匹配的探针杂交。Tm取决于溶液条件和探针的碱基组成及长度。例如,较长的序列在更高温度上特异性杂交。最大杂交速率从低于Tm约16℃直至32℃获得。杂交溶液中一价阳离子的存在降低了两条核酸链之间的静电排斥作用,因而促进杂交体形成;这种作用对于直到0.4M的钠浓度是显而易见的(对于更高的浓度而言,可以忽略这种作用)。甲酰胺降低DNA-DNA和DNA-RNA双链体的解链温度,每百分数的甲酰胺降低0.6至0.7℃,且添加50%甲酰胺允许在30至45℃杂交,尽管杂交速率会降低。碱基对错配降低杂交速率和双链体的热稳定性。平均且对于大的探针而言,Tm下降约1℃/每%碱基错配。根据杂交体的类型,Tm可以使用以下等式计算:
1)DNA-DNA杂交体(Meinkoth和Wahl,Anal.Biochem.,138:267-284,1984):
Tm=81.5℃+16.6×log10[Na+]a+0.41×%[G/Cb]-500×Lc]-1-0.61×%甲酰胺
2)DNA-RNA杂交体或RNA-RNA杂交体:
Tm=79.8+18.5(log10[Na+]a)+0.58(%G/Cb)+11.8(%G/Cb)2-820/Lc
3)寡DNA杂交体或寡RNAd杂交体:
对少于20个核苷酸而言:Tm=2(ln)
对20-35个核苷酸而言:Tm=22+1.46(ln)
a或者用于其他一价阳离子,但是仅在0.01-0.4M范围内是精确的。
b仅对30%-75%范围内的%GC是精确的。
cL=双链体的碱基对长度。
dOligo,寡核苷酸;ln,=引物的有效长度=2×(G/C数)+(A/T数)。
可以使用许多已知技术中任意一种技术控制非特异性结合,例如将膜以含有蛋白质的溶液封闭、添加异源RNA、异源DNA和SDS至杂交缓冲液,并且用RNA酶处理。对于非同源性探针,可以通过变换以下条件之一:(i)渐进地降低复性温度(例如从68℃至42℃)或(ii)渐进地降低甲酰胺浓度(例如从50%至0%)进行一系列杂交。技术人员了解可以在杂交期间变更并且会维持或改变所述严格条件的多个参数。
除了杂交条件之外,杂交特异性一般还取决于杂交后洗液的功能。为除去因非特异性杂交引起的背景,用稀释的盐溶液洗涤样品。此类洗液的关键因素包括最终洗涤溶液的离子强度和温度:盐浓度越低且洗涤温度越高,则洗涤的严格性越高。洗涤条件一般在杂交严格性上或低于所述杂交严格性进行。阳性杂交产生至少两倍于背景信号的信号。通常,用于核酸杂交测定法或基因扩增检测方法的适宜严格条件如上所述。也可以选择严格性更高或更低的条件。技术人员了解可以在洗涤期间变更并且会维持或改变所述严格条件的多个参数。
例如,用于长度大于50个核苷酸的DNA杂交体的典型高严格杂交条件包括在65℃于1×SSC中或在42℃于1×SSC和50%甲酰胺中杂交,随后在65℃于0.3×SSC中洗涤。用于长度大于50个核苷酸的DNA杂交体的中等严格杂交条件的例子包括在50℃于4×SSC或在40℃于6×SSC和50%甲酰胺中杂交,随后在50℃于2×SSC中洗涤。杂交体的长度是杂交核酸的预期长度。当序列已知的核酸杂交时,可以通过比对序列并鉴定本文中所述的保守区而确定杂交体长度。1×SSC是0.15MNaCl和15mM柠檬酸钠;杂交溶液和洗涤溶液可以额外地包括5×Denhardt试剂、0.5-1.0%SDS、100μg/ml变性的片段化鲑精DNA、0.5%焦磷酸钠。
出于定义严格性水平的目的,可以参考Sambrook等(2001)Molecular Cloning:a laboratory manual,第三版,Cold Spring HarborLaboratory Press,CSH,New York或参考Current Protocols inMolecular Biology,John Wiley & Sons,N.Y.(1989和年度更新版)。
剪接变体
如本文中所用的术语“剪接变体”包括其中已经切除、取代、置换或添加所选内含子和/或外显子或其中已经缩短或加长内含子的核酸序列的变体。此类变体将是基本上保留蛋白质的生物学活性的一类变体;这可以通过选择性保留蛋白质的功能片段实现。此类剪接变体可以在自然界中找到或可以人工制备。用于预测和分离此类剪接变体的方法是本领域熟知的(见例如Foissac和Schiex(2005)BMC Bioinformatics.6:25)。
等位变体
等位基因或等位变体是给定基因位于相同染色体位置处的备选形式。等位变体包含单核苷酸多态性(SNP)和小的插入/缺失多态性(INDEL)。INDEL的大小通常小于100bp。SNP和INDEL形成大部分生物的天然存在多态性株系中的序列变体的最大集合。
基因改组/定向进化
基因改组或定向进化由反复DNA改组,随后适当筛选和/或选择以产生编码具有改良生物学活性的蛋白质的核酸或其部分的变体而组成(Castle等,(2004)Science 304(5674):1151-4;美国专利5,811,238和6,395,547)。
调节元件/调控序列/启动子
术语“调节元件”、“调控序列”和“启动子”均在本文中可相互交换地使用并且在广泛含义上意指能够实现与它们相连接的序列表达的调节性核酸序列。术语“启动子”一般指位于基因转录起点上游并参与识别和结合RNA聚合酶和其他蛋白质,因而指导有效链接的核酸转录的核酸调控序列。前述术语包括从经典真核基因组基因(包括对于精确转录启动所需的TATA框,具有或没有CCAAT框序列)衍生的转录调节序列和应答发育性刺激和/或外部刺激或以组织特异性方式而改变基因表达的其它调节元件(即上游激活序列、增强子和沉默子)。本术语还包括经典原核基因的转录调节序列,在此情况下它可以包括一个-35框序列和/或一个-10框转录调节序列。术语“调节元件”也包含赋予、激活或增加核酸分子在细胞、组织或器官中表达的人工融合分子或衍生物。
“植物启动子”包含介导植物细胞中编码序列节段表达的调节元件。因此,植物启动子不必须是植物来源的,但可以源自病毒或微生物,例如来自侵袭植物细胞的病毒。“植物启动子”也可以源自植物细胞,例如来自用在本发明方法中待表达并在本文中描述的核酸序列转化的植物。这也适用于其他“植物”调节信号,如“植物”终止子。在本发明方法中有用的核苷酸序列上游的启动子可以通过一个或多个核苷酸取代、插入和/或缺失进行修饰,但不影响启动子、可读框(ORF)或3’调节区如终止子或远离ORF存在的其他3’调节区的功能性或活性。还有可能所述启动子的活性因修饰其序列或它们被更活跃的启动子、甚至来自异源生物的启动子彻底取代而提高。为了在植物中表达,如上所述,核酸分子必须有效地连接至或包含在正确的时间点并以所需空间表达模式表达基因的合适启动子。
为鉴定功能性等同启动子,候选启动子的启动子强度和/或表达模式可以例如通过将此启动子有效链接至报道基因并分析该报道基因在植物的多种组织中的表达水平和模式进行分析。合适的熟知报道基因包括例如β-葡糖醛酸酶或β-半乳糖苷酶。启动子活性通过测量β-葡糖醛酸酶或β-半乳糖苷酶的酶活性进行分析。启动子强度和/或表达模式可以随后与参考启动子(如在本发明方法中使用的一种启动子)的启动子强度和/或表达模式比较。备选地,启动子强度可以使用本领域已知方法如RNA印迹法及放射自显影图的密度计分析法、定量实时PCR或RT-PCR(Heid等,1996Genome Methods 6:986-994),通过量化mRNA水平或通过将本发明方法中所用核酸的mRNA水平与持家基因(如18SrRNA)的mRNA水平比较进行分析。通常“弱启动子”意指驱动编码序列在低水平表达的启动子。“低水平”意指在每个细胞约1/10,000转录物至约1/100,000转录物、至约1/500,0000转录物的水平上。相反,“强启动子”驱动编码序列在高水平或以每个细胞约1/10转录物至约1/100转录物、至约1/1000转录物表达。一般而言,“中等强度启动子”意指驱动编码序列以低于强启动子水平、尤其以在一切情况下低于受35SCaMV启动子控制时所获得水平的水平表达的启动子。通常,中等强度启动子对核酸表达的驱动比35S CaMV启动子低至少3、5、10、15、20、25、30、35、40、45、50、60、70、80、90、100倍。
有效连接
如本文中所用的术语“有效连接”指启动子序列与目的基因之间的功能连接,从而该启动子序列能够启动该目的基因转录。
组成型启动子
“组成型启动子”指在生长和发育的大部分期间但不是必需在全部期间,以及在大多数环境条件下,在至少一种细胞、组织或器官中有转录活性的启动子。下表2给出组成型启动子的例子。
表2:组成型启动子的例子
基因来源 | 参考文献 |
肌动蛋白 | McElroy等,Plant Cell,2:163-171,1990 |
HMGP | WO 2004/070039 |
CAMV35S | Odell等,Nature,313:810-812,1985 |
CaMV 19S | Nilsson等,Physiol.Plant.100:456-462,1997 |
GOS2 | de Pater等,Plant J Nov;2(6):837-44,1992,WO 2004/065596 |
遍在蛋白 | Christensen等,Plant Mol.Biol.18:675-689,1992 |
稻亲环蛋白 | Buchholz等,Plant Mol Biol.25(5):837-43,1994 |
玉米H3组蛋白 | Lepetit等,Mol.Gen.Genet.231:276-285,1992 |
苜蓿H3组蛋白 | Wu等,Plant Mol.Biol.11:641-649,1988 |
肌动蛋白2 | An等,Plant J.10(1);107-121,1996 |
基因来源 | 参考文献 |
34S FMV | Sanger等,Plant.Mol.Biol.,14,1990:433-443 |
核酮糖二磷酸羧化酶-加氧酶小亚基 | US 4,962,028 |
OCS | Leisner(1988)Proc Natl Acad Sci USA 85(5):2553 |
SAD1 | Jain等,Crop Science,39(6),1999:1696 |
SAD2 | Jain等,Crop Science,39(6),1999:1696 |
nos | Shaw等,(1984)Nucleic Acids Res.12(20):7831-7846 |
V-ATP酶 | WO 01/14572 |
超级启动子(Super promoter) | WO 95/14098 |
G框蛋白质 | WO 94/12015 |
遍在启动子
遍在启动子是在生物的全部组织或细胞中基本上有活性的。
发育调节型启动子
发育调节型启动子在某些发育阶段期间或在经历发育变化的植物的部分中有活性。
诱导型启动子
诱导型启动子在应答化学刺激(综述见Gatz 1997,Annu.Rev.PlantPhysiol.Plant Mol.Biol.,48:89-108)、环境刺激或物理刺激时具有诱导或提高的转录启动作用,或可以是“胁迫诱导的”,即当植物暴露于多种胁迫条件时其激活,或是“病原体诱导的”,即当植物暴露于多种病原体时其激活。
器官特异性/组织特异性启动子
器官特异性或组织特异性启动子是能够偏好地启动某些器官或组织如叶、根、种子组织等中转录的启动子。例如,“根特异性启动子”是这样的启动子,该启动子优势地在植物根中具有转录活性,基本上在植物的任何其他部分中无活性,尽管在该植物的这些其他部分中仍允许任意泄露表达。能够仅在某些细胞中启动转录的启动子在本文中称作“细胞特异性的”。
根特异性启动子的例子列于下表i中。
表i:根特异性启动子的例子
基因来源 | 参考文献 |
稻RCc3 | Xu等,(1995)Plant Mol Biol 27(2):237-48 |
拟南芥磷酸盐转运蛋白PHT1 | Kovama等,2005 |
苜蓿磷酸盐转运蛋白 | Xiao等,2006 |
拟南芥属Pyk10 | Nitz等,(2001)Plant Sci 161(2):337-346 |
烟草根特异性基因RB7、RD2、RD5、RH12 | Conkling等,(1990)Plant Phys 93(3):1203-1211 |
大麦根特异性凝集素 | Lerner和Raikhel(1989)Plant Phys 91:124-129 |
根特异性羟脯氨酸丰富蛋白 | Keller和Lamb(1989)Genes&Dev 3:1639-1646 |
拟南芥属CDC27B/hobbit | Blilou等,(2002)Genes&Dev 16:2566-2575 |
种子特异性启动子是能够在种子组织中优势地具有转录活性的启动子,但无需排他性地在种子组织中有转录活性(在泄露表达的情况下)。种子特异性启动子可以在种子发育期间和/或萌发期间有活性。种子特异性启动子的例子在Qing Qu和Takaiwa(Plant Biotechnol.J.2,113-125,2004)中给出,所述文献的公开内容如完整所述那样通过引用方式并入本文且示于下表ii。
表ii:种子特异性启动子的例子
基因来源 | 参考文献 |
种子特异性基因 | Simon等,Plant Mol.Biol.5:191,1985; |
Scofield等,J.Biol.Chem.262:12202,1987.; | |
Baszczynski等,Plant Mol.Biol.14:633,1990. | |
巴西坚果(Brazil Nut)白蛋白 | Pearson等,Plant Mol.Biol.18:235-245,1992. |
豆球蛋白 | Ellis等,Plant Mol.Biol.10:203-214,1988. |
基因来源 | 参考文献 |
谷蛋白(稻) | Takaiwa等,Mol.Gen.Genet.208:15-22,1986; |
Takaiwa等,FEBS Letts.221:43-47,1987. | |
玉米醇溶蛋白 | Matzke等,Plant Mol Biol,14(3):323-321990 |
NapA | Stalberg等,Planta 199:515-519,1996. |
小麦LMW和HMW麦谷蛋白-1 | Mol Gen Genet 216:81-90,1989;NAR 17:461-2,1989 |
小麦SPA | Albani等,Plant Cell,9:171-184,1997 |
小麦α、β、γ-麦醇溶蛋白 | EMBO J.3:1409-15,1984 |
大麦Itr1启动子 | Diaz等,(1995)Mol Gen Genet 248(5):592-8 |
大麦B1、C、D大麦醇溶蛋白 | Theor Appl Gen 98:1253-62,1999;Plant J4:343-55,1993;Mol Gen Genet 250:750-60,1996 |
大麦DOF | Mena等,The Plant Journal,116(1):53-62,1998 |
blz2 | EP99106056.7 |
合成的启动子 | Vicente-Carbajosa等,Plant J.13:629-640,1998. |
稻谷醇溶蛋白NRP33 | Wu等,Plant Cell Physiology 39(8)885-889,1998 |
稻α-球蛋白Glb-1 | Wu等,Plant Cell Physiology 39(8)885-889,1998 |
稻OSH1 | Sato等,Proc.Natl.Acad.Sci.USA,93:8117-8122,1996 |
稻α-球蛋白REB/OHP-1 | Nakase等,Plant Mol.Biol.33:513-522,1997 |
稻ADP-葡萄糖焦磷酸酶 | Trans Res 6:157-68,1997 |
玉米ESR基因家族 | Plant J 12:235-46,1997 |
高粱α-高粱醇溶蛋白 | DeRose等,Plant Mol.Biol 32:1029-35,1996 |
基因来源 | 参考文献 |
KNOX | Postma-Haarsma等,Plant Mol.Biol.39:257-71,1999 |
稻油质蛋白 | Wu等,J.Biochem.123:386,1998 |
向日葵油质蛋白 | Cummins等,Plant Mol.Biol.19:873-876,1992 |
PRO0117,推定的稻40S核糖体蛋白 | WO 2004/070039 |
PRO0136,稻丙氨酸氨基转移酶 | 未公开 |
PRO0147,胰蛋白酶抑制剂ITR1(大麦) | 未公开 |
PRO0151,稻WSI18 | WO 2004/070039 |
PRO0175,稻RAB21 | WO 2004/070039 |
PRO005 | WO 2004/070039 |
PRO0095 | WO 2004/070039 |
α-淀粉酶(Amy32b) | Lanahan等,Plant Cell 4:203-211,1992;Skriver等,Proc Natl Acad Sci USA 88:7266-7270,1991 |
组织蛋白酶β样基因 | Cejudo等,Plant Mol Biol 20:849-856,1992 |
大麦Ltp2 | Kalla等,Plant J.6:849-60,1994 |
Chi26 | Leah等,Plant J.4:579-89,1994 |
玉米B-Peru | Selinger等,Genetics 149;1125-38,1998 |
如本文中所定义的绿色组织特异性启动子是优势地在绿色组织中具有转录活性的启动子,在植物的任何其它部分内基本上无活性,尽管在该植物的这些其他部分中仍允许任意泄露表达。
可以用来实施本发明方法的绿色组织特异性启动子的例子示于下表iii中。
表iii:绿色组织特异性启动子的例子
基因 | 表达 | 参考文献 |
玉米正磷酸二激酶 | 叶特异性 | Fukavama等,2001 |
玉米磷酸烯醇式丙酮酸羧化酶 | 叶特异性 | Kausch等,2001 |
稻磷酸烯醇式丙酮酸羧化酶 | 叶特异性 | Liu等,2003 |
稻核酮糖二磷酸羧化酶-加氧酶小亚基 | 叶特异性 | Nomura等,2000 |
稻β扩展蛋白EXBP9 | 枝条特异性 | WO 2004/070039 |
木豆(Pigeonpea)核酮糖二磷酸羧化酶-加氧酶小亚基 | 叶特异性 | Panguluri等,2005 |
豌豆RBCS3A | 叶特异性 |
组织特异性启动子的另一个例子是分生组织特异性启动子,其优势地在分生组织中具有转录活性,在植物的任何其它部分内基本上无活性,尽管在该植物的这些其他部分中仍允许任意泄露表达。可以用来实施本发明方法的分生组织特异性启动子的例子示于下表iv中。
表iv:分生组织特异性启动子的例子
基因来源 | 表达模式 | 参考文献 |
稻OSH1 | 从胚球期至籽苗期的枝条顶端分生组织 | Sato等,(1996)Proc.Natl.Acad.Sci.USA,93:8117-8122 |
稻金属硫蛋白 | 分生组织特异性 | BAD87835.1 |
WAK1和WAK2 | 枝条和根顶端分生组织,和在扩展的叶和萼片中 | Wagner和Kohorn(2001)Plant Cell13(2):303-318 |
终止子
术语“终止子”包括作为转录单元末端处DNA序列的调控序列,所述的DNA序列产生初级转录物的3’加工和多腺苷酸化及转录终止的信号。终止子可以从天然基因、从多种其他植物基因或从T-DNA衍生。待添加的终止子可以从例如胭脂碱合酶或章鱼碱合酶基因或备选地从另一种植物基因或较次优选地从任何其他真核基因衍生。
调节
就表达或基因表达而言,术语“调节”意指这样的过程,其中与对照植物相比,表达水平因该基因表达而改变,所述表达水平提高或降低。原始、未调节的表达可以是结构性RNA(rRNA、tRNA)或mRNA的任何类型的表达,随后是翻译。术语“调节活性”应当意指本发明核酸序列或所编码蛋白质的表达的任何改变,这引起植物产量提高和/或生长增加。
表达
术语“表达”或“基因表达”意指某个特定基因或多个特定基因或特定基因构建体的转录。术语“表达”或“基因表达”尤其意指某个基因或某些基因或基因构建体转录成结构性RNA(rRNA、tRNA)或mRNA,所述RNA随后翻译成或不翻译成蛋白质。该过程包括DNA的转录和所得mRNA产物的加工。
增加的表达/过量表达
如本文中所用的术语“增加的表达”或“过量表达”意指相对于原有野生型表达水平为额外的任何形式的表达。
在本领域内充分报道了用于提高基因或基因产物表达的方法并且这些方法包括例如由适宜启动子驱动的过量表达、使用转录增强子或翻译增强子。充当启动子或增强子元件的分离核酸可以导入多核苷酸的非异源形式的适宜位置(一般在上游)中,从而上调编码目的多肽的核酸表达。例如,内源性启动子可以在体内通过突变、缺失和/或取代加以改变(见Kmiec,US 5,565,350;Zarling等,WO9322443),或可以将分离的启动子以相对于本发明基因的恰当方向及距离导入植物细胞,从而控制该基因的表达。
若需要多肽表达,通常希望的是在多核苷酸编码区的3’末端处包括多腺苷化区域。该多腺苷酸化区域可以从天然基因、从多种其他植物基因或从T-DNA衍生。待添加的3’末端序列可以从例如胭脂碱合酶或章鱼碱合酶基因或备选地从另一种植物基因或较不优选地从任何其他真核基因衍生。
内含子序列也可以添加至5’非翻译区(UTR)或部分编码序列的编码序列以提高细胞质中聚集的成熟信使的量。已经显示在植物和动物表达构建体的转录单位中包含可剪接内含子提高了mRNA水平及蛋白质水平上的基因表达高达1000倍(Buchman和Berg(1988)Mol.Cell biol.8:4395-4405;Callis等(1987)Gens Dev 1:1183-1200)。此种内含子增强基因表达的作用一般在所述内含子置于转录单位的5’末端附近时最强烈。玉米内含子Adh1-S内含子1、2和6、Bronze-1内含子的用途是本领域已知的。对于总体信息,见:The Maize Handbook,第116章,Freeling和Walbot编,Springer,N.Y.(1994)。
内源基因
本文中对“内源”基因的称谓不仅仅指如植物中以其天然形式(即没有人类任何干预)存在的所讨论基因,还指处于分离形式下的随后(再)导入植物(转基因)的相同基因(或基本上同源的核酸/基因)。例如,含有这种转基因的转基因植物可以遭遇转基因表达的相当大程度地降低和/或内源基因表达的实质降低。分离的基因可以从生物分离或可以是人造的,例如通过化学合成法。
降低的表达
本文中提及的“降低的表达”或“降低或基本消除表达”意指内源基因表达和/或多肽水平和/或多肽活性相对于对照植物的下降。与对照植物相比较,所述降低或基本上消除以增加的优选顺序是至少10%、20%、30%、40%或50%、60%、70%、80%、85%、90%或95%、96%、97%、98%、99%或更多降低。
为了降低或基本消除植物中内源基因的表达,需要核酸序列的基本上连续的核苷酸的足够长度。为进行基因沉默,该长度可以短至20、19、18、17、16、15、14、13、12、11、10个或更少核苷酸,或者该长度可以长至整个基因(包括部分或完整的5’和/或3’UTR)。基本上连续的核苷酸片段可以从编码Patellin或RNA加工因子前体38或Adaptor2或WDR23样多肽的核酸(靶基因)的任何核酸衍生,或从能够编码Patellin或RNA加工因子前体38或衔接头2或WDR23样多肽的直向同源物、旁系同源物或同源物的任何核酸衍生。优选地,基本上连续的核苷酸的片段能够与靶基因(有义链或反义链)形成氢键,更优选地,基本上连续的核苷酸片段以增加的优选顺序与靶基因(有义链或反义链)具有50%、60%、70%、80%、85%、90%、95%、96%、97%、98%、99%、100%的序列同一性。编码(功能性)多肽的核酸序列不是本文中所讨论用于降低或基本消除内源基因表达的多种方法的前提。
用于降低或基本上消除植物中内源基因表达的多种方法的例子,或降低蛋白质的水平和/或活性的例子是本领域技术人员已知的。例如,本领域技术人员将能够轻易调整用于沉默的公知方法,从而通过利用合适的启动子在完整植物中或其部分中实现内源基因表达的降低。
可以使用常规工具和技术完成表达的这种降低或基本消除。用于降低或基本消除内源基因表达的优选方法是在植物中引入并表达基因构建体,其中将核酸(在此情况下,从目的基因衍生的,或从能够编码Patellin或RNA加工因子前体38或衔接头2或WDR23样多肽的直向同源物、旁系同源物或同源物的任何核酸中衍生的一段基本上连续的核苷酸)克隆至所述基因构建体,(部分或完全地)作为被间隔序列(非编码性DNA)隔开的反向重复序列。
另一用于降低或基本消除内源基因表达的方法是使用核酸序列或其部分(在此情况下,所述部分是从目的基因衍生的,或从能够编码Patellin或RNA加工因子前体38或衔接头2或WDR23样多肽的直向同源物、旁系同源物或同源物的任何核酸序列中衍生的一段基本上连续的核苷酸)的反向重复序列(其优选能够形成发夹结构)进行RNA介导的沉默。RNA沉默方法的另一个例子包括将核酸序列或其部分(在此情况下是从目的基因衍生的,或从能够编码Patellin或RNA加工因子前体38或衔接头2或WDR23样多肽的直向同源物、旁系同源物或同源物的任何核酸序列中衍生的一段基本上连续的核苷酸)以有义方向导入植物。RNA沉默方法的另一个例子包括使用反义核酸序列。基因沉默也可以通过插入诱变(例如T-DNA插入或转座子插入)或通过如Angell和Baulcombe((1999)Plant J.20(3):357-62)、(Amplicon VIGS WO98/36083)或Baulcombe(WO 99/15682)及其他人描述的策略实现。技术人员熟知其他方法,如使用针对内源性多肽的抗体以抑制该多肽在植物中的功能,或干扰某多肽参与其中的信号传导途径。人工和/或天然的微RNA(miRNA)可以用来敲除基因表达和/或mRNA翻译。内源性miRNA是通常19-24个核苷酸长的单链小RNA。可以专门地遗传工程化一般长度21个核苷酸的人工微RNA(amiRNA)以负向调节单个或多个目的基因的基因表达。选择植物的微RNA靶的决定因素是本领域熟知的。已经定义了用于靶识别的经验参数并且可以使用它们辅助特定amiRNA的设计(Schwab等,(2005)Dev Cell 8(4):517-27)。用于设计并产生amiRNA及其前体的便利工具也是公众可获得的(Schwab等,(2006)Plant Cel 18(5):1121-33)。
在这种优选的方法中,使用核酸或其部分(在此情况下,所述部分是从目的基因衍生的,或从能够编码Patellin或RNA加工因子前体38或衔接头2或WDR23样多肽的直向同源物、旁系同源物或同源物的任何核酸中衍生的一段基本上连续的核苷酸)的反向重复序列(其优选能够形成发夹结构),通过RNA介导的沉默作用降低或基本上消除内源基因的表达。在包含调控序列的表达载体中克隆该反向重复序列。非编码性DNA核酸序列(间隔序列,例如基质附着区片段(MAR)、内含子、多接头等)位于形成所述反向重复序列的两个反向核酸之间。在反向重复序列转录后,形成具有(部分或完全)自我互补性结构的嵌合RNA。这种双链RNA结构称作发夹RNA(hpRNA)。hpRNA由植物加工成siRNA,该siRNA被掺入RNA诱导的沉默复合体(RISC)。该RISC进一步切开所述mRNA转录物,从而相当大程度地降低待翻译成多肽的mRNA转录物的数目。对于其他一般细节,参见例如Grierson等(1998)WO98/53083;Waterhouse等(1999)WO 99/53050。
本发明方法的实施不取决于在植物中引入并表达将所述核酸作为反向重复序列克隆到其中的基因构建体,不过可以使用几种熟知“基因沉默”方法中任何一种或多种方法来实现相同效果。
用于降低内源基因表达的一种这样的方法是RNA介导的基因表达沉默(下调)。在这种情况下,沉默作用由植物中与内源性靶基因实质相似的双链RNA序列(dsRNA)触发。这种dsRNA进一步被植物加工成约20个至约26个核苷酸的所谓短干扰RNA(siRNA)。所述siRNA被掺入RNA诱导的沉默复合体(RISC),其中所述RISC切割内源靶基因的mRNA转录物,从而相当大程度地将降低待翻译成多肽的mRNA转录物的数目。优选地,所述双链RNA序列与靶基因对应。
RNA沉默方法的另一个例子涉及将核酸序列或其部分(在此情况下是从目的基因衍生的,或从能够编码Patellin或RNA加工因子前体38或衔接头2的直向同源物、旁系同源物或同源物的任何核酸中衍生的一段基本上连续的核苷酸)以有义方向导入植物。“有义方向”是指与自身mRNA转录物同源的DNA序列。因而将所述核酸序列的至少一个拷贝导入植物。这个额外核酸序列会降低内源基因表达,从而产生已知为共抑制作用的现象。将一个核酸序列的几个额外拷贝导入植物时,基因表达的降低将更明显,因为高转录物水平与触发共抑制作用之间存在正相关。
RNA沉默方法的另一个例子涉及使用反义核酸序列。“反义”核酸序列包含与编码蛋白质的“有义”核酸序列互补,即与双链cDNA分子的编码链互补,或与mRNA转录物序列互补的核苷酸序列。反义核酸序列优选地互补于待沉默的内源基因。这种互补性可以存在于基因的“编码区”中和/或其“非编码区”中。术语“编码区”指包含被翻译成氨基酸残基的密码子的核苷酸序列的区域。术语“非编码区”指分布在编码区侧翼的被转录但不翻译成氨基酸的5’和3’序列(也称作5’和3’非翻译区)。
反义核酸序列可以根据Watson和Crick碱基配对规则设计。反义核酸序列可以互补于整个核酸序列(在此情况下是从目的基因衍生的,或从能够编码Patellin或RNA加工因子前体38或衔接头2的直向同源物、旁系同源物或同源物的任何核酸中衍生的一段基本上连续的核苷酸),不过也可以是仅对所述核酸序列的一部分(包括mRNA 5’和3’UTR)反义的寡核苷酸。例如,反义寡核苷酸序列可以互补于编码多肽的mRNA转录物的翻译起点周围的区域。合适反义寡核苷酸序列的长度是本领域已知的并且可以从约50、45、40、35、30、25、20、15或10个核苷酸或更小的核苷酸长度开始。本发明的反义核酸序列可以使用化学合成反应和酶连接反应,利用本领域已知的方法构建。例如,反义核酸序列(例如反义寡核苷酸序列)可以使用天然存在核苷酸或以多种方式修饰的核苷酸化学地合成,其中所述的修饰核苷酸设计旨在增加分子的生物学稳定性或增加反义与有义核酸序列之间所形成的双链体的物理稳定性,例如,可以使用硫代磷酸酯衍生物和吖啶取代的核苷酸。可以用来产生反义核酸序列的修饰核苷酸的例子是本领域熟知的。已知的核苷酸修饰包括甲基化、环化和‘加帽’及用类似物(如肌苷)取代一个或多个天然存在核苷酸。对核苷酸的其他修饰作用是本领域熟知的。
反义核酸序列可以使用表达载体以生物学方式产生,其中一种核酸序列已经以反义方向亚克隆(即从插入的核酸转录出的RNA会对目的靶核酸为反义方向)到所述表达载体中。优选地,植物中反义核酸序列的产生借助稳定整合的核酸构建体进行,其中所述的核酸构建体包含启动子、有效链接的反义寡核苷酸和终止子。
用于本发明方法中沉默作用的核酸分子(无论被导入植物中或原位(in situ)地产生)与mRNA转录物和/或编码多肽的基因组DNA杂交或结合以因而抑制蛋白质的表达,例如通过抑制转录和/或翻译做到这一点。杂交可以因形成稳定双链体的常规核苷酸互补性引起,或例如,在与DNA双链体结合的反义核酸序列的情况下,因双螺旋大沟内的特异性相互作用引起。反义核酸序列可以通过在特定组织部位转化或直接注射导入植物。备选地,反义核酸序列可以被修饰以靶向所选的细胞并且随后全身性施用。例如,对于全身性施用,可以修饰反义核酸序列,从而它们与表达在所选细胞表面上的受体或抗原特异性地结合,例如通过将所述反义核酸序列连接至与细胞表面受体或抗原结合的肽或抗体连接而做到这一点。反义核酸序列也可以使用本文中所述的载体递送至细胞。
根据另一方面,反义核酸序列是α-端基异构核酸序列。α端基异构核酸序列与互补RNA形成特定的双链杂交体,在所述双链杂交体中与常见的b-单元相反,所述链彼此平行(Gaultier等(1987)Nucl Ac Res15:6625-6641)。反义核酸序列也可以包含2’-O-甲基核糖核苷酸(Inoue等(1987)Nucl Ac Res 15,6131-6148)或嵌合RNA-DNA类似物(Inoue等(1987)FEBS Lett.215,327-330)。
内源基因表达的降低或基本上消除也可以使用核酶进行。核酶是具有核糖核酸酶活性的催化性RNA分子,能够切割与之具有互补区域的单链核酸序列,如mRNA。因此,核酶(例如锤头状核酶(在Haselhoff和Gerlach(1988)Nature 334,585-591中描述)可以用来催化地切割编码多肽的mRNA转录物,因而相当大程度地降低待翻译成多肽的mRNA转录物的数目。可以设计对核酸序列具有专一性的核酶(参见例如:Cech等美国专利号4,987,071;和Cech等美国专利号5,116,742)。备选地,与核酸序列相对应的mRNA转录物可以用来从RNA分子的汇集物中选出具有特定核糖核酸酶活性的催化性RNA(Bartel和Szostak(1993)Science 261,1411-1418)。核酶在植物中用于基因沉默的用途是本领域已知的(例如Atkins等(1994)WO 94/00012;Lenne等(1995)WO95/03404;Lutziger等(2000)WO 00/00619;Prinsen等(1997)WO97/13865和Scott等(1997)WO 97/38116)。
基因沉默也可以通过插入诱变(例如T-DNA插入或转座子插入)或通过如Angell和Baulcombe((1999)Plant J.20(3):357-62)、(AmpliconVIGS WO 98/36083)或Baulcombe(WO 99/15682)及其他人描述的策略实现。
如果内源基因中存在突变和/或在随后导入植物的分离基因/核酸中存在突变,基因沉默也可能发生。所述降低或基本上消除可以由无功能的多肽引起。例如,该多肽可以与多种相互作用的蛋白质结合;一种或多种突变和/或截短作用因而可以产生仍能够结合相互作用的蛋白质(如受体蛋白)但不能展示正常功能的多肽(如信号传导配体)。
基因沉默的另一种方法是瞄准互补于基因调节区(例如启动子和/或增强子)的核酸序列以形成阻止靶细胞中基因转录的三重螺旋结构。参见Helene,C.,Anticancer Drug Res.6,569-84,1991;Helene等,Ann.N.Y.Acad.Sci.660,27-361992;和Maher,L.J.Bioassays 14,807-15,1992。
技术人员会熟知其他方法,如使用针对内源多肽的抗体以抑制该多肽在植物中(in planta)的功能,或干扰涉及某多肽的信号传导途径。特别地,可以考虑人造分子可能用于抑制靶多肽的生物学功能,或用于干扰涉及所述靶多肽的信号传导途径。
备选地,可以建立筛选程序以鉴定植物群体中基因的天然变体,其中所述的变体编码具有降低的活性的多肽。也可以使用此类天然变体,例如来进行同源重组。
人工和/或天然的微RNA(miRNA)可以用来敲除基因表达和/或mRNA翻译。内源miRNA是通常19-24个核苷酸长度的单链小RNA。它们主要发挥调节基因表达和/或mRNA翻译的功能。大多数的植物微RNA(miRNA)与其靶序列具有完全或接近完全的互补性。然而,存在具有多达5个错配的天然靶。它们从具有特征性折返结构的较长非编码性RNA由Dicer家族的双链特异性RNA酶加工得来。加工后,它们通过与RNA诱导的沉默复合体(RISC)的主要组分-Argonaute蛋白结合被掺入该复合体。miRNA充当RISC的特异性组分,因为它们与胞浆中的靶核酸(大多是mRNA)发生碱基配对。后续调节事件包括靶mRNA切割和摧毁和/或翻译抑制。miRNA过量表达的影响因此往往反映为靶基因的mRNA水平降低。
可以专门地遗传工程化一般21个核苷酸长度的人工微RNA(amiRNA)以负向地调节单个或多个目的基因的基因表达。选择植物的微RNA靶的决定因素是本领域熟知的。用于靶识别的经验参数已经被定义并可以用来辅助具体amiRNA的设计(Schwab等,Dev.Cell 8,517-527,2005)。用于设计并产生amiRNA及其前体的便利工具也是公众可获得的(Schwab等,2006 Plant Cell.2006 18(5):1121-33)。
为了最佳性能,用于降低植物中内源基因表达的基因沉默技术需要使用来自单子叶植物的核酸序列转化单子叶植物,并使用来自双子叶植物的核酸序列转化双子叶植物。优选地,将来自任意的给定植物物种的核酸序列导入相同的物种。例如,将来自稻的核酸序列转化到稻植物中。然而,不绝对要求待导入的核酸序列来自与待导入该核酸序列的植物相同的植物物种。只要内源靶基因与待导入的核酸之间存在实质同源性即可。
上文描述用于降低或基本上消除植物中内源基因表达的多种方法的例子。例如,本领域技术人员会轻易地能够调整用于沉默的前述方法,从而通过利用合适启动子实现在完整植物中或其部分中降低内源基因的表达。
选择性标记(基因)/报道基因
“选择性标记”、“选择性标记基因”或“报道基因”包括向细胞赋予表型的任意基因,其中在所述细胞中表达所述“选择性标记”、“选择性标记基因”或“报道基因”以促进鉴定和/或选择用本发明核酸构建体转染或转化的细胞。这些标记基因能够借助一系列不同原理而鉴定核酸分子的成功转移。合适的标记可以选自赋予抗生素抗性或除草剂抗性、导入新代谢性状或允许目视选择的标记。选择性标记基因的例子包括赋予抗生素抗性的基因(如使新霉素和卡那霉素磷酸化的nptII或使潮霉素磷酸化的hpt或赋予针对例如博来霉素、链霉素、四环素、氯霉素、氨苄青霉素、庆大霉素、遗传霉素(Geneticin)(G418)、壮观霉素或杀稻瘟茵素的抗性的基因)、赋予除草剂抗性的基因(例如提供Basta抗性的bar;提供草甘膦抗性的aroA或gox或赋予针对例如咪唑啉酮、膦丝菌素或磺脲类的抗性的基因)或提供代谢性状的基因(如允许植物使用甘露糖作为唯一碳源的manA,或利用木糖的木糖异构酶,或抗营养性标记如2-脱氧葡萄糖抗性)。目视标记基因的表达导致颜色(例如β-葡糖醛酸酶、GUS或β-半乳糖苷酶与其有色底物例如X-Gal)、发光(如萤光素/萤光素酶系统)或荧光(绿色荧光蛋白GFP和其衍生物)的形成。这个名单仅代表少数的可能标记。技术人员熟悉此类标记。取决于生物和选择方法,优选不同的标记。
已知当核酸稳定或瞬时地整合至植物细胞时,仅少数细胞摄取外来DNA,并且根据需要,将外来DNA整合至细胞基因组中,这取决于所用的表达载体和所用的转染技术。为鉴定并选择这些整合体,通常将编码选择性标记的基因(如上文所述的基因)连同目的基因一起导入宿主细胞。这些标记可以在这些基因例如通过常规方法缺失而无功能的突变体中使用。此外,编码选择性标记的核酸分子可以在包含编码本发明多肽或在本发明方法中所用多肽的序列的相同载体上,或在独立的载体上导入宿主细胞。已经用所导入核酸稳定转染的细胞可以例如通过选择作用鉴定(例如具有整合的选择性标记的细胞存活而其他细胞死亡)。一旦不再需要所述标记基因时,可以从转基因细胞中移除或切除它们。用于标记移除的技术是本领域已知的,有用的技术在上文定义部分中描述。
因为一旦已经成功地导入所述标记基因、尤其抗生素抗性基因和除草剂抗性基因,则这些核酸是转基因宿主细胞中不再需要或不想要的,因此用于导入核酸的本发明方法有利地使用能够移除或切除这些标记基因的技术。一种这样的方法是所谓共转化法。共转化法同时使用两种载体以转化,一种载体携带本发明的核酸而第二种载体携带标记基因。大比例的转化体接受或在植物情况下包含(多达40%或更多的转化体)这两种载体。在用农杆菌(Agrobacterium)转化的情况下,转化体通常仅接受载体的一部分,即侧翼存在T-DNA的序列,该序列通常代表表达盒。标记基因随后可以通过开展杂交从转化植物中移除。在另一种方法中,整合至转座子的标记基因与想要的核酸一起用于转化(称作Ac/Ds技术)。转化体可以与转座酶来源物杂交,或转化体用引起转座酶表达的核酸构建体瞬时或稳定转化。在一些情况下(大约10%),一旦转化已经成功发生,则转座子从宿主细胞的基因组跳出并丢失。在其他许多情况下,转座子跳到一个不同位置。在这些情况下,标记基因必须通过开展杂交予以消除。在微生物学中,开发了有可能或促进检测这类事件的技术。又一种有利方法依赖于所谓重组系统;所述方法的优势在于杂交消除作用可以用该重组系统实行。最知名的该类型系统称作Cre/lox系统。Cre1是移除位于loxP序列之间序列的重组酶。若所述标记基因整合于loxP序列之间,一旦转化已经成功发生,则它因重组酶表达而被移除。其他重组系统是HIN/HIX、FLP/FRT和REP/STB系统(Tribble等,J.Biol.Chem.,275,2000:22255-22267;Velmurugan等,J.CellBiol.,149,2000:553-566)。位点特异性地整合本发明核酸序列至植物基因组是可能的。自然,这些方法也可以应用于微生物如酵母、真菌或细菌。
转基因的/转基因/重组
为本发明的目的,“转基因的”、“转基因”或“重组”例如就核酸序列而言,意指包含所述核酸序列的表达盒、基因构建体或载体,或用本发明核酸序列、表达盒或载体转化的生物,这些构建体均通过重组方法产生,其中
(a)编码在本发明方法中有用的蛋白质的核酸序列,或
(b)与本发明核酸序列有效链接的基因调控序列,例如启动子,或
(c)a)和b)
并不位于它们的天然遗传环境中或已经通过重组方法被修饰,所述的修饰有可能采取例如取代、添加、倒位或插入一个或多个核苷酸残基的形式。天然遗传环境理解为意指原初植物中的天然基因组位点或染色体位点或存在于基因组文库中。在基因组文库的情况下,优选地保留,至少部分地保留核酸序列的天然遗传环境。该环境分布在所述核酸序列的至少一侧并且具有至少50bp、优选至少500bp、特别优选至少1000bp、最优选至少5000bp序列长度。当通过非天然、合成性(“人工”)方法(例如诱变处理)修饰天然存在表达盒时,该表达盒-例如所述核酸序列的天然启动子与编码如上文所定义在本发明方法中有用的多肽的相应核酸序列的天然存在组合-变成转基因表达盒。合适的方法例如在US5,565,350或WO 00/15815中描述。
为本发明目的,如上所述,将转基因植物因此理解为意指本发明方法中所用诸核酸不处于它们在所述植物基因组中的天然基因座处,从而有可能同源或异源地表达所述核酸。然而,如所提及,转基因还意指尽管本发明的或本发明方法中所用的诸核酸处于它们在植物基因组中的天然位置处,然而相对于天然序列,它们的序列已经被修饰,和/或所述天然序列的调节序列已经被修饰。转基因优选地理解为意指本发明核酸在基因组中的非天然基因座处表达,即所述核酸的同源表达或优选异源表达发生。优选的转基因植物在本文中提及。
转化
如本文中提及的术语“导入”或“转化”包括转移外源多核苷酸至宿主细胞中,无论转化所用的方法是什么。能够后续克隆性增殖(无论通过器官发生或胚发生)的植物组织可以用本发明的基因构建体转化并且可完整植物以从中再生。所选的具体组织根据可用于并且最好适于正在进行转化的具体物种的克隆性增殖系统变化。示例性靶组织包括叶盘、花粉、胚、子叶、下胚轴、大配子体、愈伤组织、现存的分生组织(例如顶端分生组织、腋芽和根分生组织)和诱导的分生组织(例如子叶分生组织和下胚轴分生组织)。多核苷酸可以瞬时或稳定地导入宿主细胞并且可以非整合地维持,例如作为质粒。备选地,它可以整合至宿主基因组中。所得的转化植物细胞随后可以用来以本领域技术人员已知的方式再生出转化植物。
外来基因转移至植物基因组的过程称作转化。植物物种的转化现在是相当常规的技术。有利地,可以使用几种转化方法中的任何一种方法将目的基因导入合适的祖先细胞。描述用于转化并从植物组织或植物细胞再生出植物的方法可以用于瞬时转化或稳定转化。转化方法包括使用脂质体、电穿孔法、增加游离DNA摄入的化学品、DNA直接注射至植物、粒子枪轰击法、使用病毒或花粉的转化法和微量投射法(microprojection)。转化方法可以选自用于原生质体的钙/聚乙二醇法(Krens,E.A.等,(1982)Nature 296,72-74;Negrutiu I等,(1987)PlantMol Biol 8:363-373);原生质体的电穿孔法(Shillito R.D.等,(1985)Bio/Technol 3,1099-1102);对植物材料的微量注射法(Crossway A等,(1986)Mol.Gen Genet 202:179-185);DNA或RNA包被粒子轰击法(Klein TM等,(1987)Nature 327:70)、用(非整合性)病毒感染等。包括转基因作物植物在内的转基因植物优选通过农杆菌介导的转化法产生。有利的转化方法是植物原位(in planta)转化法。为此目的,例如有可能使农杆菌作用于植物种子或有可能用农杆菌接种植物分生组织。根据本发明,已经证明将转化的农杆菌悬液作用于完整植物或至少作用于花原基是特别有利的。随后培育该植物直至获得已处理植物的种子(Clough和Bent,Plant J.(1998)16,735-743)。用于农杆菌介导稻转化的方法包括用于稻转化的熟知方法,如在以下任意文献中描述的那些方法:欧洲专利申请EP 1198985 A1,Aldemita和Hodges(Planta 199:612-617,1996);Chan等,(Plant Mol Biol 22(3):491-506,1993),Hiei等,(Plant J 6(2):271-282,1994),其公开内容如充分所述那样通过引用的方式并入本文。在玉米转化的情况下,优选的方法如Ishida等,(Nat.Biotechnol 14(6):745-50,1996)或Frame等,(Plant Physiol 129(1):13-22,2002)描述,其公开内容如充分所述那样通过引用的方式并入本文。所述方法还例如由B.Jenes等,Techniques for Gene Transfer,在:Transgenic Plants,第1卷,Engineering and Utilization,编者S.D.Kung和R.Wu,Academic Press(1993)128-143及在Potrykus Annu.Rev.Plant Physiol.Plant Molec.Biol.42(1991)205-225)中描述。待表达的核酸或构建体优选地克隆至适于转化根瘤农杆菌(Agrobacteriumtumefaciens)的载体,例如pBin19(Bevan等,Nucl.Acids Res.12(1984)8711)。被这种载体转化的农杆菌随后可以按照已知方式用于转化植物,例如作为模型使用的植物如拟南芥属植物(拟南芥在本发明范围不视为作物植物),或作物植物,例如烟草植物,所述方式例如是通过在农杆菌溶液中浸泡擦伤的叶或切碎的叶并随后在合适培养基中培育它们。借助根瘤农杆菌转化植物例如由和Willmitzer在Nucl.AcidRes.(1988)16,9877中描述或尤其从F.F.White,用于高等植物中基因转移的载体(Vectors for Gene Transfer in Higher Plants);在TransgenicPlants,第1卷,Engineering and Utilization,S.D.Kung和R.Wu编,Academic Press,1993,第15-38页中获知。
除了转化随后必需再生成完整植物的体细胞之外,也可以转化植物分生组织的细胞,并且尤其那些发育成配子的细胞。在这种情况下,转化的配子遵循天然的植物发育过程,从而产生转基因植物。因此,例如用农杆菌处理拟南芥属植物的种子并且从正在发育的植物获得种子,其中一定比例的所述植物被转化并且因此是转基因的[Feldman,KA和Marks MD(1987)Mol Gen Genet 208:274-289;Feldmann K(1992),在:编者C Koncz,N-H Chua和J Shell,Methods in ArabidopsisResearch.Word Scientific,Singapore,第274-289页]。备选方法基于反复移除花序并将莲座丛中心内的切除部位与转化的农杆菌温育,因而同样可以在较晚的时间点获得转化的种子(Chang(1994)Plant J.5:551-558;Katavic(1994)Mol Gen Genet,245:363-370)。然而,特别有效的方法是改良真空渗入法,如“浸花”法。在拟南芥属植物真空浸润法的情况下,用农杆菌悬液处理在降低压力下的完整植物[Bechthold,N(1993).C R Acad Sci Paris Life Sci,316:1194-1199],而在“浸花法”的情况下,将正在发育的花组织与表面活性剂处理过的农杆菌悬液短暂温育[Clough,SJ和Bent,AF(1998)The Plant J.16,735-743]。在这两种情况下均收获某个比例的转基因种子,并且这些种子可以通过在如上所述的选择条件下培育与非转基因种子区分。此外,质体的稳定转化是有利的,因为质体在大部分作物中以母系方式遗传,这降低或消除了借助花粉的转基因流动风险。叶绿体基因组的转化一般通过已经在Klaus等,2004[Nature Biotechnology 22(2),225-229]中示意性展示的方法实现。简而言之,将待转化的序列连同选择性标记基因一起克隆至同源于叶绿体基因组的侧翼序列之间。这些同源侧翼序列指导向原质体的位点特异性整合。已经对许多不同的植物物种描述质体转化过程并且在Bock(2001)基础研究和植物生物技术中的转基因质体(Transgenicplastids in basic research and plant biotechnology).J Mol Biol.2001年月21日;312(3):425-38或Maliga,P(2003)质体转化技术商业化进展(Progress towards commercialization of plastid transformationtechnology),Trends Biotechnol.21,20-28中给出综述。其他的生物技术进展最近已经以无标记质体转化体的形式报道,其中可以通过瞬时共整合的标记基因产生所述无标记质体转化体(Klaus等,2004,NatureBiotechnology 22(2),225-229)。
T-DNA活化标签技术(T-DNA activation tagging)
T-DNA活化标签技术(Hayashi等,Science(1992)1350-1353)涉及以如此方式在目的基因的基因组区域内或基因编码区的上游或下游10kb处插入通常含有启动子(也可以是翻译增强子或内含子)的T-DNA,从而该启动子指导目标基因的表达。一般,目标基因的天然启动子对该基因表达的调节作用被破坏,并且该基因受新导入的启动子控制。该启动子一般嵌入T-DNA中。这种T-DNA随机地插入植物基因组,例如借助农杆菌感染,并且引起所插入T-DNA附近的基因受调节的表达。所得的转基因植物表现显性表型,原因在于所导入启动子附近的基因受修饰的表达。
TILLING
术语“TILLING”是“基因组中定向诱导局部损伤法”的缩写并且指用于产生和/或鉴定核酸的诱变技术,其中所述核酸编码具有改良表达和/或活性的蛋白质。TILLING还允许选择携带此类突变变体的植物。这些突变变体可以表现在强度或在位置或在时间方面改良的表达(例如,如果所述突变影响启动子)。这些突变变体可以显示比其天然形式基因所表现活性更高的活性。TILLING联合了高密度诱变法与高通量筛选法。TILLING中一般所遵循的步骤是:(a)EMS诱变(Redei GP和Koncz C(1992)在Methods in Arabidopsis Research,Koncz C,Chua NH,Schell J,Singapore编,World Scientific Publishing Co,第16-82页;Feldmann等,(1994)在Meyerowitz EM,Somerville CR编,Arabidopsis.Cold Spring Harbor Laboratory Press,Cold Spring Harbor,NY,第137-172页;Lightner和Caspar T(1998)在J Martinez-Zapater,JSalinas编者,Methods on Molecular Biology第82卷.Humana Press,Totowa,NJ,第91-104页);(b)制备和汇集个体DNA;(c)PCR扩增目的区域;(d)变性和复性以导致异双链体形成;(e)DHPLC,其中汇集物中异双链体的存在被检测为色谱图中的一个额外峰;(f)鉴定突变个体;和(g)将突变PCR产物测序。用于TILLING的方法是本领域熟知的(McCallum等,(2000)Nat Biotechnol 18:455-457;综述参见Stemple(2004)Nat Rev Genet 5(2):145-50)。
同源重组
同源重组允许在基因组中限定的所选位置处导入所选核酸。同源重组是生物科学中常规用于低等生物如酵母或小立碗藓属(Physcomitrella)苔藓的标准技术。已经对模式植物(Offringa等,(1990)EMBO J 9(10):3077-84)和作物植物例如稻(Terada等,(2002)Nat Biotech 20(10):1030-4;Iida和Terada(2004)Curr Opin Biotech 15(2):132-8)描述了用于植物中开展同源重组的方法,并且存在与靶生物无关而通常适用的方法(Miller等,Nature Biotechnol.25,778-785,2007)。.
产量
术语“产量”通常意指经济价值的可测量结果,一般与指定作物、与面积并且与时间间隔有关。单个植物部分基于它们的数目、大小和/或重量而直接有助于产量,或实际产量是相对于某作物和年份的每平方米产量,这通过总产量(包括收获的和评估的产量)除以播种的平方米数确定。术语植物的“产量”可以涉及该植物的营养生物量(根和/或枝条生物量)、涉及繁殖器官和/或涉及繁殖体(如种子)。
早期生长势
“早期生长势”指活跃、健康、充分平衡的生长,尤其是植物生长早期期间,并且可以因提高的植物适应性所致,其中所述提高的植物适的原因是例如该植物更好地适应环境(即优化能量源的用途和在枝条与根之间的分配)。具有早期生长势的植物也显示提高的籽苗存活和更佳的作物建立,这往往产生高度均一的田块(作物以均一方式生长,即大多数植物在基本上相同的时间达到各个发育期)和往往形成更好且更高的产量。因而,早期生长势可以通过测量多种因素如千粒重(ThousandKernel Weight)、萌发百分数、出苗百分数、籽苗生长、籽苗高度、根长度、根和枝条生物量和许多其他因素等确定。
提高/改善/增强
术语“提高”、“改善”或“增强”是相互可交换的并且在应用含义上应当意指与如本文中定义的对照植物相比较,至少3%、4%、5%、6%、7%、8%、9%或10%、优选至少15%或20%、更优选地25%、30%、35%或40%更多的产量和/或生长。
种子产量
提高的种子产量自身可以表现为以下一个或多个指标:a)种子生物量(种子总重量)增加,这可以基于单粒种子基础和/或每株植物和/或每平方米;b)提高的每株植物花数目;c)提高的(饱满)种子数;d)提高的种子饱满率(其表述为饱满种子数与种子总数之间的比率);e)提高的收获指数,其表述为可收获部分(如种子)产量与总生物量的比率;f)提高的千粒重(TKW);(g)提高的初生穗数,这从计数的饱满种子数及它们的总重量外推出来。提高的TKW可以因增加的种子尺寸和/或种子重量引起,并且也可以因胚尺寸和/或胚乳尺寸增加引起。
种子产量的提高也可以表现为种子尺寸和/或种子体积的增加。此外,种子产量提高也可以本身表现为种子面积和/或种子长度和/或种子宽度和/或种子周长的提高。提高的种子产量也可以产生改良的构造,或可以因改良的构造而出现。
绿度指数
从植物的数字图像计算如本文中所用的“绿度指数”。对属于图像上植物目标的每个像素计算绿色值与红色值的比率(在编码颜色的RGB模式中)。绿度指数表述为绿色/红色比超过给定阈值的像素百分数。在正常生长条件下,在盐胁迫生长条件下和在养分有效性降低的生长条件下,植物的绿度指数在开花前的最后成像中测量。相反,在干旱胁迫生长条件下,植物的绿度指数在干旱后的首次成像中测量。
植物
本文中所用的术语“植物”包括完整植物、植物的祖先及子代和包括种子、枝条、茎、叶、根(包括块茎)、花和组织、器官在内的植物部分,其中每种前述对象包含目的基因/核酸。术语“植物”也包括植物细胞、悬浮培养物、愈伤组织、胚、分生组织区、配子体、孢子体、花粉和小孢子,同样其中每种前述对象包含目的基因/核酸。
在本发明方法中特别有用的植物包括属于植物界(Viridiplantae)超家族、尤其单子叶和双子叶植物的全部植物,包括饲用或饲料豆科植物、观赏植物、粮食作物、树或灌木,其中所述植物选自包含以下物种的名单:槭树属物种(Acer spp.)、猕猴桃属物种(Actinidia spp.)、秋葵属物种(Abelmoschus spp.)、剑麻(Agave sisalana)、冰草属物种(Agropyronspp.)、匍匐剪股颖(Agrostis stolonifera)、葱属物种(Allium spp.)、苋属物种(Amaranthus spp.)、欧洲海滨草(Ammophila arenaria)、凤梨(Ananas comosus)、番荔枝属物种(Annona spp.)、旱芹(Apiumgraveolens)、蜘蛛兰属物种(Arachis spp.)、木波罗属物种(Artocarpusspp.)、石刁柏(Asparagus officinalis)、燕麦属物种(Avena spp.)(例如燕麦(Avena sativa)、野燕麦(Avena fatua)、比赞燕麦(Avena byzantina)、野燕麦原变种(Avena fatua var.sativa)、杂种燕麦(Avena hybrida)、阳桃(Averrhoa carambola)、箣竹属物种(Bambusa sp.)、冬瓜(Benincasahispida)、巴西栗(Bertholletia excelsea)、甜菜(Beta vulgaris)、芸苔属物种(Brassica spp.)(例如欧洲油菜(Brassica napus)、芜青物种(Brassicarapa ssp.)[卡诺拉油菜、油菜籽油菜(oilseed rape)、蔓青(turnip rape)])、Cadaba farinosa、茶(Camellia sinensis)、美人蕉(Canna indica)、大麻(Cannabis sativa)、辣椒属物种(Capsicum spp.)、Carex elata、番木瓜(Carica papaya)、大果假虎刺(Carissa macrocarpa)、山核桃属物种(Carya spp.)、红花(Carthamus tinctorius)、栗属物种(Castanea spp.)、美洲木棉(Ceiba pentandra)、苦苣(Cichorium endivia)、樟属物种(Cinnamomum spp.)、西瓜(Citrullus lanatus)、柑桔属物种(Citrus spp.)、椰子属物种(Cocos spp.)、咖啡属物种(Coffea spp.)、芋头(Colocasiaesculenta)、非洲梧桐属物种(Cola spp.)、黄麻属物种(Corchorus sp.)、芫荽(Coriandrum sativum)、榛属物种(Corylus spp.)、山楂属物种(Crataegus spp.)、番红花(Crocus sativus)、南瓜属物种(Cucurbita spp.)、香瓜属物种(Cucumis spp.)、菜蓟属物种(Cynara spp.)、胡萝卜(Daucuscarota)、山马蝗属物种(Desmodium spp.)、龙眼(Dimocarpus longan)、薯蓣属物种(Dioscorea spp.)、柿树属物种(Diospyros spp.)、稗属物种(Echinochloa spp.)、油棕属(Elaeis)(例如油棕(Elaeis guineensis)、美洲油棕(Elaeis oleifera)、穇子(Eleusine coracana)、蔗茅属物种(Erianthussp.)、枇杷(Eriobotrya japonica)、桉属物种(Eucalyptus sp.)、红仔果(Eugenia uniflora)、荞麦属物种(Fagopyrum spp.)、水青冈属物种(Fagusspp.)、苇状羊茅(Festuca arundinacea)、无花果(Ficus carica)、金桔属物种(Fortunella spp.)、草莓属物种(Fragaria spp.)、银杏(Ginkgobiloba)、大豆属物种(Glycine spp.)(例如大豆(Glycine max)、大豆(Sojahispida)或大豆(Soja max)、陆地棉(Gossypium hirstum)、向日葵属物种(Helianthus spp.)(例如向日葵(Helianthus annuus)、长管萱草(Hemerocallis fulva)、木槿属物种(Hibiscus spp.)、大麦属物种(Hordeumspp.)(例如大麦(Hordeum vulgare)、甘薯(Ipomoea batatas)、核桃属物种(Juglans spp.)、莴苣(Lactuca sativa)、山黧豆属物种(Lathyrus spp.)、兵豆(Lens culinaris)、亚麻(Linum usitatissimum)、荔枝(Litchichinensis)、百脉根属物种(Lotus spp.)、棱角丝瓜(Luffa acutangula)、羽扇豆属物种(Lupinus spp.)、Luzula sylvatica、番茄属物种(Lycopersicon spp.)(例如番茄(Lycopersicon esculentum、Lycopersiconlycopersicum、Lycopersicon pyriforme)、硬皮豆属物种(Macrotylomaspp.)、苹果属物种(Malus spp.)、凹缘金虎尾(Malpighia emarginata)、牛油果(Mammea americana)、芒果(Mangifera indica)、木薯属物种(Manihot spp.)、人心果(Manilkara zapota)、苜蓿(Medicago sativa)、草木樨属物种(Melilotus spp.)、薄荷属物种(Mentha spp.)、芒(Miscanthussinensis)、苦瓜属物种(Momordica spp.)、黑桑(Morus nigra)、芭蕉属物种(Musa spp.)、烟草属物种(Nicotiana spp.)、木犀榄属物种(Oleaspp.)、仙人掌属物种(Opuntia spp.)、鸟足豆属物种(Ornithopus spp.)、稻属些物种(Oryza spp.)(例如稻(Oryza sativa)、阔叶稻(Oryza latifolia)、稷(Panicum miliaceum)、柳枝稷(Panicum virgatum)、鸡蛋果(Passifloraedulis)、欧防风(Pastinaca sativa)、狼尾草属物种(Pennisetum sp.)、鳄梨属物种(Persea spp.)、欧芹(Petroselinum crispum)、虉草(Phalarisarundinacea)、菜豆属物种(Phaseolus spp.)、猫尾草(Phleum pratense)、刺葵属物种(Phoenix spp.)、南方芦苇(Phragmites australis)、酸浆属物种(Physalis spp.)、松属物种(Pinus spp.)、阿月浑子(Pistacia vera)、豌豆属物种(Pisum spp.)、早熟禾属物种(Poa spp.)、杨属物种(Populusspp.)、牧豆草属物种(Prosopis spp.)、李属物种(Prunus spp.)、番石榴属物种(Psidium spp.)、石榴(Punica granatum)、西洋梨(Pyrus communis)、栎属物种(Quercus spp.)、萝卜(Raphanus sativus)、波叶大黄(Rheumrhabarbarum)、茶藨子属物种(Ribes spp.)、蓖麻(Ricinus communis)、悬钩子属物种(Rubus spp.)、甘蔗属物种(Saccharum spp.)、柳属物种(Salix sp.)、接骨木属物种(Sambucus spp.)、黑麦(Secale cereale)、胡麻属物种(Sesamum spp.)、白芥属物种(Sinapis sp.)、茄属物种(Solanumspp.)(例如马铃薯(Solanum tuberosum)、红茄(Solanum integrifolium)或番茄(Solanum tuberosum)、两色蜀黍(Sorghum bicolor)、菠菜属物种(Spinacia spp.)、蒲桃属物种(Syzygium spp.)、万寿菊属物种(Tagetesspp.)、酸豆(Tamarindus indica)、可可树(Theobroma cacao)、车轴草属物种(Trifolium spp.)、小黑麦属物种(Triticale sp.)、Triticosecalerimpaui、小麦属物种(Triticum spp.)(例如普通小麦(Triticum aestivum)、硬粒小麦(Triticum durum)、圆柱小麦(Triticum turgidum)、Triticumhybernum、马卡小麦(Triticum macha)、普通小麦(Triticum sativum)或普通小麦(Triticum vulgare)、小金莲花(Tropaeolum minus)、金莲花(Tropaeolum majus)、越桔属物种(Vaccinium spp.)、野碗豆属物种(Viciaspp.)、豇豆属物种(Vigna spp.)、香堇(Viola odorata)、葡萄属物种(Vitisspp.)、玉蜀黍(Zea mays)、Zizania palustris、枣属物种(Ziziphus spp.)及其他。
发明详述
令人惊讶地,现在已经发现:调节植物中编码PATL多肽或PRP38多肽或GATA样多肽或ADA2多肽或WDR23样多肽的核酸的表达产生相对于对照植物具有增强的产量相关性状的植物。根据第一实施方案,本发明提供了用于相对于对照植物而言增强植物中产量相关性状的方法,其包括调节植物中编码PATL多肽或PRP38多肽或GATA样多肽或ADA2多肽或WDR23样多肽的核酸的表达。
在一个实施方案中,令人惊讶地,现在已经发现:调节(优选提高)植物中编码GATA样多肽的核酸的表达产生相对于对照植物具有提高的千粒重(TKW)的植物。根据一个实施方案,本发明提供了用于相对于对照植物而言增加植物中TKW的方法,其包括调节(优选提高)植物中编码GATA样多肽的核酸的表达。
在另一个实施方案中,现在已经发现:提高植物中编码如本文定义的WDR23样多肽的核酸序列的表达产生相对于对照植物具有增强的产量相关性状的植物。本发明提供编码WDR23样多肽的核酸序列和WDR23样多肽,借此植物中分离核酸序列的提高的表述相对于对照植物而言增强产量相关性状。
用于调节(优选提高)编码PATL多肽或PRP38多肽或GATA样多肽或ADA2多肽或WDR23样多肽的核酸的表达的优选方法是通过在植物中引入并表达编码PATL多肽或PRP38多肽或GATA样多肽或ADA2多肽或WDR23样多肽的核酸和可选地选择具有增强的产量相关性状的植物。
下文中对“用于本发明方法中的蛋白质”的任何提及意指如本文中所定义的PATL多肽或PRP38多肽或GATA样多肽或ADA2多肽或WDR23样多肽。下文中对“用于本发明方法中的核酸”的任何提及意指能够编码此类PATL多肽或PRP38多肽或GATA样多肽或ADA2多肽或WDR23样多肽的核酸。待引入植物的核酸(并因此用于实施本发明的方法)是任意编码下述类型的多肽的核酸序列,,也称作“PATL核酸”或“PATL基因”或“PRP38核酸”或“PRP38基因”或“GATA样核酸”或“GATA样基因”或“ADA2核酸”或“ADA2基因”或“WDR23样核酸序列”或“WDR23样基因”。
如本文中所定义的术语“PATL多肽”指任意多肽,其包含:
(i)SEC14结构域和/或
(ii)GOLD结构域
SEC 14和GOLD结构域通常位于PATL多肽的C-端区。PATL多肽的N-端长度可变,且具有比C-端更多样的氨基酸序列。N-端特征在于富含酸性氨基酸(PI-等电点-大约为4),例如谷氨酸(E),以及含有多个EEK(谷氨酸-谷氨酸-赖氨酸)重复。另外,预计N-端序列包括一个或多个卷曲螺旋,该卷曲螺旋是常见的蛋白质寡聚化-折叠基序。卷曲螺旋结构域通常含有疏水和亲水氨基酸残基,其形成α-螺旋,其在胞质环境中可包住提供二聚化的热力学驱动力的亲水氨基酸之间的彼此相对的疏水链。卷曲螺旋可使用本领域充分描述的方法和软件容易的鉴定,例如使用COIL或PAIRCIOL2(Lupas等;1991.Science,252,1162-1164;MacDonnell等2006.Bioinformatics.;22(3):356-8)。图1显示SEQ ID NO:2中所示的特征性EEK重复和卷曲螺旋结构域。
用于本发明方法的优选的PATL多肽包含具有一个或多个以下特征的酸性N-端:
(i)等电点按照递增的优选顺序为低于5、4.5、4、3.5或3;
(ii)一个或多个卷曲螺旋,优选按照递增的优选顺序为一个、两个、三个、四个或五个;
(iii)一个或多个EEK重复,优选按照递增的优选顺序为一个、两个、三个、四个、五个、六个或七个。
本文中PATL多肽的N-端是在SEC14结构域N-端延伸的蛋白质部分。
多肽的等电点,即特定分子或表面不携带静电荷时的pH,可由本领域已知的方法计算(Stryer,1995),例如使用Henderson-Hasselbalch方程式(Henderson(1908),Am.J.Physiol.,21,173-179;Hasselbalc(1917),Biochemische Zeitschrift,78,112-144.de Livie,(2003)J.Chem.Educ.,80,146)。
通常,PATL多肽在上述N-端之后包含SEC14结构域。PATL多肽中的SEC14结构域包含与酵母Sec14p蛋白质同源的区域,该酵母Sec14p蛋白质包含磷脂结合口袋。酵母Sec14p中涉及脂类结合和/或转移的E、K和G氨基酸残基在PATL多肽中是保守的,其在SEQ ID NO:2中对应E434、K465和G493(图1)。SEQ ID NO:2的谷氨酸E434和赖氨酸K465是酵母Sec14p蛋白质E207和K239的同源残基,其形成涉及选择和结合PtdIns的盐桥。酵母Sec14P的疏水口袋在PATL多肽中也是保守的。
PATL多肽可包含通常位于C-端的GOLD结构域。PATL多肽中的GOLD结构域富含赖氨酸残基,并包含保守序列KX(10-11)(K/R/T)KKKX(0-1)(L/V/A)(L/V/A)YR(参见图2),其与披网格蛋白小泡蛋白质中发现的PtdIns(4,5)P2结合基序类似。
用于本发明方法的优选的PATL多肽包含至少一个如下结构域:
(i)SEQ ID NO:71所示的SEC14结构域:lpeldsvvfyrgadreghpvcynvygefqdkdlyekafgdeekrerflkwriqllergilsqldfspsgicsmvqvtdlknsppmlgkhravtrqavallqdnypefiakkvfinvpwwylaankmmspfltqrtkskfifaspaksaetlfryiapeqvpvqfgglfk或按照递增的优选顺序与SEQ ID NO:71所示的结构域或表A中任意多肽中存在的任意SEC14结构域具有至少50%、55%、60%、65%、70%、75%、80%、82%、85%、90%、92%、95%、97%或更高序列同一性的结构域;
(ii)SEQ ID NO:72所示的GOLD结构域:sdavteltikpssketveipvtenstigwelrvlgwevsygaeftpdaeggytvivqktrkvpaneepimkgsfkvgepgkivltinnpaskkkkllyrskv或按照递增的优选顺序与SEQ ID NO:72所示的结构域或表A中任意多肽中存在的任意GOLD结构域具有至少50%、55%、60%、65%、70%、75%、80%、82%、85%、90%、92%、95%、97%或更高序列同一性的结构域。
用于本发明方法的进一步优选的PATL多肽包含至少一个保守基序:分别SEQ ID NO:69/L(L/T)KFLRAR和SEQ ID NO:70/(L/F)(Q/E)DNYPEF所示的基序Ia和基序IIa。
用于本发明方法的PATL多肽优选是按照递增的优选顺序与表A中给出的任意多肽具有至少50%、55%、60%、70%、75%、80%、85%、90%、92%、93%、95%、96%、98%或更高序列同一性的那些多肽。
PATL多肽中包含的SEC14结构域和GOLD结构域可通过检索特定的数据鉴定,该特定的数据库包含覆盖保守蛋白质结构域和家族的序列多重比对和隐匿马尔科夫模型的集合,例如可获自Sanger Institute,英国的Pfam。或者上述结构域可通过扫描The Integrated Resource ofProtein Families,Domains和Sites(InterPro)数据库发现,以检测与已知SEC14或GOLD结构域的显著序列比对(参见实施例4)。Interpro数据库是常用的文本和序列-碱基检索(text-and sequnce-based search)标签据书库的整合界面。InterPro数据库合并了这些数据库,所述数据库使用不同的方法学及不同程度的有关充分表征的蛋白质的生物学信息以获得蛋白质特征标识(protein signatures)。合作数据库包括SWISS-PROT、PROSITE、TrEMBL、PRINTS、ProDom和Pfam、Smart和TIGRFAM。Interpro由英国欧洲生物信息学研究所(EuropeanBioinformatics Institute)维护。本文定义的两个多肽或两个结构域之间的显著匹配是按照递增的优选顺序具有低于e-5(e的负5次幂)、1.e-10、1.e-15、1.e-20、1.e-25、1.e-50、1.e-75、1.e-100、1.e-200、1.e-300、1.e-400、1.e-500、1.e-600、1.e-700和1.e-800的e-值的比对。多肽序列可使用本领域任一已知的方法比对,该方法包括全局和局部比对方法,例如Blast算法,例如Altschul,SF,等,(1990)J.Mol.Biol.215:403-10描述的算法。取得与给定序列出现比对结果的概率作为用于鉴定相似多肽的基础。通常用来代表这种概率的参数称作e-值(E-值)。所述e-值是S评分可靠性的一个量度。S评分是待比对的两个序列的相似性的一个度量。e-值描述给定S评分预期以多少频率随机出现。该e-值可以高至1.0。
优选地,用于本发明方法的PATL多肽包含至少一个如下结构域:
(i)SEC14结构域,其当于已知的SEC14结构域比对时,更优选当于表C的任意SEC14结构域比对时,按照递增的优选顺序的e-值低于e-5(e的负5次幂)、1.e-10、1.e-15、1.e-20、1.e-25、1.e-50、1.e-75、1.e-100、1.e-200、1.e-300、1.e-400、1.e-500、1.e-600、1.e-700和1.e-800;
(ii)GOLD结构域,其当于已知的GOLD结构域比对时,更优选当于表C的任意S GOLD结构域比对时,按照递增的优选顺序的e-值低于e-5(e的负5次幂)、1.e-10、1.e-15、1.e-20、1.e-25、1.e-50、1.e-75、1.e-100、1.e-200、1.e-300、1.e-400、1.e-500、1.e-600、1.e-700和1.e-800。
优选地,用于本发明方法的PATL多肽序列是下述多肽序列,当其用于构建诸如图3所示的系统树时,与该树中的任意序列聚簇,尤其优选与包含SEQ ID NO:2的组Ia中的序列聚簇。
术语“结构域”和“基序”在文中的“定义”部分定义。存在用于鉴定结构域的专业数据库,如SMART(Schultz等(1998)Proc.Natl.Acad.Sci.美国95,5857-5864;Letunic等(2002)Nucleic Acids Res 30,242-244)、InterPro(Mulder等,(2003)Nucl.Acids.Res.31,315-318)、Prosite(Bucher和Bairoch(1994),A generalized profile syntax forbiomolecular sequences motifs and its function in automatic sequenceinterpretation(用于生物分子序列基序的广义图谱语法及其在自动化序列判读中的功能),(在)ISMB-94;第二届分子生物学智能系统国际会议文集.Altman R.,Brutlag D.,Karp P.,Lathrop R.,Searls D.编,第53-61页,AAAI Press,Menlo Park;Hulo等,Nucl.Acids.Res.32:D134-D137,(2004))或Pfam(Bateman等,Nucleic Acids Research 30(1):276-280(2002))。用于计算机芯片上分析蛋白质序列的一组工具可在ExPASY蛋白质组学服务器上获得(瑞士生物信息学研究所(Gasteiger等,ExPASy:the proteomics server for in-depth protein knowledge andanalysis(用于深入认识和分析蛋白质的蛋白组学服务器),Nucleic AcidsRes.31:3784-3788(2003))。结构域或基序也可以使用常规技术如通过序列比对而鉴定。
用于比对序列比较的方法是本领域众所周知的,此类方法包括GAP、BESTFIT、BLAST、FASTA和TFASTA。GAP使用Needleman和Wunsch算法((1970)J Mol Biol 48:443-453)以找到使匹配数最大化并使空位数最小化的两个完整序列的全局性(即覆盖完整序列的)比对。BLAST算法(Altschul等(1990)J Mol Biol 215:403-10)计算序列同一性百分数并执行对两个序列间相似性的统计分析。用于执行BLAST分析的软件是公众通过国家生物技术信息中心(NCBI)可获得的。同源物可以使用例如ClustalW序列多重比对算法(1.83版本)以默认配对比对参数和评分方法(以百分数计)而轻易地鉴定。相似性和同一性的全局百分数也可以使用在MatGAT软件包中可获得的方法之一而确定(Campanella等,BMC Bioinformatics.2003 Jul 10;4:29.MatGAT:用蛋白质或DNA序列而产生相似性/同一性矩阵的应用(an applicationthat generates similarity/identity matrices using protein or DNAsequences))。可以进行细微手工编著以优化保守基序之间的比对,如本领域技术人员显而易见。此外除了使用全长序列以鉴定同源物,也可以使用特定结构域。使用上文提及的程序,使用默认参数,针对整个核酸序列或氨基酸序列或针对选择的结构域或保守基序测定了序列同一性值。对于局部比对,Smith-Waterman算法尤其有用(Smith TF,Waterman MS(1981)J.Mol.Biol 147(1);195-7)。
进一步,PATL多肽通常具有磷脂酰肌醇(Phosphatidylinositide,PtdIns)和/或磷脂酰胆碱(Ptd Cho)结合活性。它们优选结合磷脂酰肌醇。磷脂酰肌醇是指任何具有其sn-甘油3-磷酸残基酯化为1D-肌-肌醇的1-羟基的糖磷脂(glycophospholipid)。或者,PATL多肽通常体外催化磷脂酰肌醇(Phosphatidylinosito)和磷脂酰胆碱在膜之间的转移。测定PtdIns/PtdCho结合和,或跨膜转移的方法是本领域众所周知的(Bankaitis VA等.(1990).Nature 347:561-2;Peterman等(2004))。
或者,PATL核酸可通过其互补酿酒酵母sec14温度敏感型突变体的缺陷生长表型的能力在功能性互补试验中鉴定。在此体内测试能够在酵母中表达PATL多肽的PATL核酸。几种酿酒酵母sec14温度敏感型突变体菌株和其互补方法是本领域众所周知的(Kearns等,(1998)EMBO J.17,4004-17;Kapranov等,(2001);Lee等,(2000)Biochymbiophys Acta 1486:55-71)。
另外,当PATL多肽、PRP38多肽、GATA样多肽或ADA2多肽在稻中表达且根据实施例7和8中概述的本发明方法提高时,得到具有增强的产量相关形状的植物,该产量相关性状尤其是种子总重量、种子饱满率、饱满种子数和每穗花总数中的任何一个或多个。
本发明通过用SEQ ID NO:1所示的编码多肽序列SEQ ID NO:2的核酸序列转化的植物加以说明,或通过用SEQ ID NO:76所示的编码多肽序列SEQ ID NO:77的核酸序列转化的植物加以说明,或通过用SEQID NO:128所示的编码多肽序列SEQ ID NO:129的核酸序列转化的植物加以说明,或通过用SEQ ID NO:181所示的编码多肽序列SEQ IDNO:182的核酸序列转化的植物加以说明,或通过用SEQ ID NO:215所示的编码多肽序列WDR23样多肽序列SEQ ID NO:216的核酸序列转化的植物加以说明。然而,本发明的实施不限于这些序列。本发明方法可以使用如文中定义的任意PATL-编码核酸或PATL多肽,PRP38-编码核酸或PRP38多肽或GATA样-编码核酸或GATA样多肽或ADA2-编码核酸或ADA2多肽或WDR23样多肽有利地进行。
编码PATL多肽或PRP38多肽或GATA样多肽或ADA2多肽或WDR23样多肽的核酸的实例在本文实施例1的表A中给出。此类核酸可用于实施本发明的方法。实施例1的表A中给出的氨基酸序列是SEQID NO:2所示的PATL多肽,或SEQ ID NO:77所示的PRP38多肽,或SEQ ID NO:129所示的GATA样多肽,或SEQ ID NO:182所示的ADA2多肽,或SEQ ID NO:216所示的WDR23样多肽的直向同源物和旁系同源物的序列实例,术语“直向同源物”和“旁系同源物”如文中所定义。其他直向同源物和旁系同源物可以通过开展所谓交互性blast搜索而容易地鉴定。这通常涉及第一BLAST,其中所述的第一BLAST包括提交查询序列(例如使用实施例1的表A中列出的任一序列)用于针对任一序列数据库(如公众可用的NCBI数据库)的BLAST搜索。当从核苷酸序列开始时,一般使用BLASTN或TBLASTX(使用标准默认值),并且当从蛋白质序列开始时,可以使用BLASTP或TBLASTN(使用标准默认值)。任选地可以筛选BLAST结果。随后提交筛选结果和非筛选结果的全长序列以针对来自生物的序列进行反向BLAST(第二BLAST),其中查询序列来自所述的生物(其中在查询序列是SEQ ID NO:1或SEQ ID NO:2的情况下,第二BLAST因而将针对稻序列;或在查询序列是SEQ ID NO:76或SEQ ID NO:77、SEQ ID NO:128或SEQID NO:129、SEQ ID NO:181或SEQ ID NO:182或SEQ ID NO:215或SEQ ID NO:216的情况下,第二BLAST因而将针对拟南芥序列)。随后比较第一和第二BLAST搜索的结果。若来自第一BLAST的高阶位命中源自与查询序列从其中衍生的物种相同的物种,反向BLAST随后理想地在最高命中的查询序列中产生,则鉴定到旁系同源物;若第一BLAST的中的高阶位命中不源自与查询序列从其中衍生的物种相同的物种,并且优选地在反向BLAST时产生属于最高命中之列的查询序列,则鉴定到直向同源物。
PRP38结构域是指在可发挥前-mRNA加工因子功能的多肽中发现的保守氨基酸序列。其通常具有大约170个氨基酸的长度,但是例如在Ostta_PRP38_1中较短,而在Vitvi_PRP38_1多肽中较长(参见表C)。PRP38结构域可由表C中的任一PRP38结构域序列所示,且由任意下列多肽结构域所示,所述多肽与表C中的任一PRP38结构域的序列同一性按照递增的优选顺序为至少50%、55%、60%、65%、70%、75%、80%、85%、90%、92%、95%、98%或更高。PRP38结构域可包含一个或多个SEQ ID NO:123和SEQ ID NO:124所示的更为保守的序列基序。
或者,PRP38结构域可定义为与已知PRP38多肽,优选与表A中的任一多肽显著匹配(significantly aligning)的任意多肽结构域。本文定义的两个多肽或两个结构域之间的显著匹配是按照递增的优选顺序具有低于e-5(e的负5次幂)、1.e-10、1.e-15、1.e-20、1.e-25、1.e-50、1.e-75、1.e-100、1.e-200、1.e-300、1.e-400、1.e-500、1.e-600、1.e-700和1.e-800的e-值的比对。多肽序列可使用本领域任意已知的方法比对,该方法包括全局和局部比对方法,例如Blast算法,例如Altschul,SF,等,(1990)J.Mol.Biol.215:403-10描述的算法。取得与给定序列出现比对结果的概率作为用于鉴定相似多肽的基础。通常用来代表这种概率的参数称作e-值(E-值)。所述e-值是S评分可靠性的一个量度。S评分是查询序列与包含PRP38的查询序列的相似性的一个度量。e-值描述给定S评分预期以多少频率随机出现。该e-值临界可以高至1.0。
本发明方法中使用的PRP38多肽优选包含下述PRP38结构域,其当与已知PRP38结构域,更优选当与表C中的任一PRP38结构域比对时,按照递增的优选顺序具有低于e-5、1.e-10、1.e-15、1.e-20、1.e-25、1.e-50、1.e-75、1.e-100、1.e-200、1.e-300、1.e-400、1.e-500、1.e-600、1.e-700和1.e-800的e-值。
多肽中存在的PRP38结构域可通过检索特定的数据库鉴定,该特定的数据库包含覆盖保守蛋白质结构域和家族的序列多重比对和隐匿马尔科夫模型的集合,例如可获自Sanger Institute,英国的Pfam。本文的实施例2显示了使用PRP38多肽作为查询序列的Pfam检索到的结果。
DUF1777结构域是迄今描述的蛋白质的小子集中发现的保守氨基酸序列。其通常为140至150个氨基酸的长度,虽然也发现更短的形式,参见例如Chlre_PRP38_1多肽的DUF1777结构域(表C)。基于序列同源性,已在不同的活生物中鉴定了DUF1777结构域。DUF结构域的编辑(compilation)可见于可获自Sanger Institute(英国)的Pfam数据库中。多肽中DUF1777结构域的存在可容易地通过筛选特定的数据库鉴定,该特定的数据库包含覆盖保守蛋白质结构域和家族的序列多重比对和隐匿马尔科夫模型的集合,例如Pfam。包含DUF1777结构域的多肽在检索Pfam数据库中,将登记为命中之前已知的DUF1777结构域。真核来源的蛋白质中DUF1777结构域的氨基酸序列常常是很多变的,仅有少数几个氨基酸残基是恰好保守的。两个DUF1777结构域比对的e-值通常高,一般高于0.001,更一般高于0.01。
本发明方法中使用的PRP38多肽优选包含下述DUF1777结构域,其当与已知DUF1777结构域,更优选当与表C中的任一DUF1777结构域比对时,按照递增的优选顺序具有低于1.e-5、1.e-10、1.e-15、1.e-20、1.e-25、1.e-50、1.e-75、1.e-100、1.e-200、1.e-300、1.e-400、1.e-500、1.e-600、1.e-700和1.e-800的e-值。
本发明方法中使用的PRP38多肽更优选包含下述DUF1777结构域,其与表C中的任一DUF1777结构域按照递增的优选顺序具有至少50%、55%、60%、65%、70%、75%、80%、85%、90%、92%、95%、98%或更高的序列同一性。
多肽中存在的DUF1777结构域可通过检索特定的数据库鉴定,该特定的数据库包含覆盖保守蛋白质结构域和家族的序列多重比对和隐匿马尔科夫模型的集合,例如可获自Sanger Institute,英国的Pfam。本文的实施例2显示了使用PRP38多肽作为查询序列的Pfam检索你的结果。
PRP38多肽可包含RS结构域。RS结构域是富含精氨酸(R)和丝氨酸(S)氨基酸残基的多肽区域。RS结构域包含多个二肽,如序列RS(精氨酸-丝氨酸)、RE(精氨酸-谷氨酸)、RD(精氨酸-天冬氨酸)所示的二肽。通常二肽跨越PRP38多肽从第4位至第150位氨基酸的区域。上述二肽可在聚簇中存在,此类聚簇全部由任一个或多个RS、RE、RD二肽组成,且一般长度为4-40个氨基酸。
本发明方法中使用的PRP38多肽优选包含一个或多个下述RS、RE、RD二肽,其为4、6、8、10、12、14、20、24、30、40、50、100或最多150个氨基酸的一段序列。
本发明方法中使用的PRP38多肽更优选包含两个或多个下述二肽聚簇,该聚簇全部由RS、RE和/活RD氨基酸残基组成。更优选该PRP38多肽包含4个二肽聚簇,最优选的二肽聚簇由RSRSRSRS所代表。
PRP38多肽还可包含任一个或多个下列保守序列基序:
(iv)SEQ ID NO:120/基序Ib:RRPPSVKASLSVSFGQRAPHRASTRDSSPVRRT,或与基序Ib按照递增的优选顺序具有至少50%、55%、60%、65%、70%、75%、80%、85%、90%、95%或更高序列同一性的基序;和/或
(v)SEQ ID NO:121/基序IIb:SPYIRA(I/V)GFLYLRY,或与基序IIb按照递增的优选顺序具有至少50%、55%、60%、65%、70%、75%、80%、85%、90%、95%或更高序列同一性的基序;和/或
(vi)SEQ ID NO:122/基序IIIb:KLKDLYGD,或与基序IIIb按照递增的优选顺序具有至少50%、55%、60%、65%、70%、75%、80%、85%、90%、95%或更高序列同一性的基序。
优选地,本发明方法中使用的PRP38多肽包含任一个或多个以下基序:
(i)基序1b(SEQ ID NO:120),其中任意氨基酸残基可由保守氨基酸取代和/或多达50%的氨基酸残基可由非保守氨基酸取代。
(ii)基序1b(SEQ ID NO:121),其中任意氨基酸残基可由保守氨基酸取代和/或多达50%的氨基酸残基可由非保守氨基酸取代。
(iii)基序1b(SEQ ID NO:122),其中任意氨基酸残基可由保守氨基酸取代和/或多达50%的氨基酸残基可由非保守氨基酸取代。
本发明方法中使用的PRP38多肽优选是下述,其与表A中给出的任意多肽按照递增的优选顺序具有至少50%、55%、60%、70%、75%、80%、85%、90%、92%、93%、95%、96%;98%或更高序列同一性。
优选地,本发明方法中使用的PRP38多肽是下述,当其用于构建诸如图8所示的系统树时,与包含SEQ ID NO:77的组Ib中的任意序列聚簇。
通常在PRP38多肽中,PRP38结构域见于N-端,DUF1777结构域见于C-端。PRP38多肽一般具有酸性C-端和,并定位于细胞核。
另外,PRP38多肽一般具有前-mRNA剪切活性。测定前-mRNA剪切活性的工具和技术是本领域众所周知的(Blanton S,等,(1992)Mol Cell Biol 12(9):3939-47;Stevens SW和Abelson J(1999)Proc NatlAcad Sci USA 96(13):7226-31;Gottschalk A,等,(1999)EMBO J18(16):4535-48;Pandit S,等,(2006)Proc Natl Acad Sci USA103(37):13700-5)。
另外,当PRP38多肽根据实施例7和8中概述的本发明方法在稻中表达时,得到具有增强的产量相关形状的植物,该产量相关性状尤其是地上部分叶生物量、萌发势、种子总重量、种子饱满率、收获指数和种子总数中的任何一个或多个。
本发明通过用SEQ ID NO:76所示的编码多肽序列SEQ ID NO:77的核酸序列转化的植物加以说明。然而,本发明的实施不限于这些序列;本发明方法可以使用如文中定义的任意PRP38-编码核酸或PRP38多肽有利地进行。
如本文定义的“GATA样多肽”是指任意包含GATA结构域的Zn指转录因子(例如SMART数据库中以登录号SM00401所定义的)。优选地,本发明方法中使用的“GATA样多肽”是在Zn指的第二个和第三个Cys残基之间包含18或20个氨基酸的单个GATA结构域,更优选在Zn指的第二个和第三个Cys残基之间包含18个氨基酸(CX2CX18CX2C)。术语“锌指”或“Zn指”是本领域已知的,指下述序列基序,其中半胱氨酸和/或组氨酸与锌原子配位,形成特定功能所需的局部肽结构。
更优选,本发明方法中使用的GATA样多肽属于亚家族II,其由Reyes等(Plant Physiol.134,1718-1732,2004)定义。亚家族II GATA转录因子一般由具有2或3个外显子的基因组成,其中锌指分处于最后两个外显子中。亚家族II GATA转录因子在Zn指环中也具有18个残基。
本发明方法中使用的GATA样多肽中的GATA结构域优选包含B型Zn指类,其如Reyes等,(2004)所定义。BZn指类结构域在Zn指环中具有18个残基,并由保守Ser残基(图11中示于GATA结构域的第27位)和保守IRX(R/K)K基序的存在进一步表征。
优选地,GATA结构域包含基序1c和/或基序2c:
基序1c(SEQ ID NO:130):
C(S/A/T)(D/E/N)CXT(T/S/A)(K/S)TP(L/M)WR(S/G/N)GP
其中X可以是任意氨基酸,X优选是N、K、G、H、D中的一个。
基序2c(SEQ ID NO:131):GPKSLCNACGIRX(R/K)K
其中X可以是任意氨基酸,X优选是Q、H、N、S,Y,F中的一个。
优选地,发明方法中使用的GATA样多肽还包含基序3c(SEQ IDNO:132):
(A/S)(A/W)X(L/C)(L/N)(M/L/V)(T/L/A)(L/D)(S/R)
其中X可以是任意氨基酸,X优选是M、L、V、I、R中的一个。
或者,GATA样蛋白质与SEQ ID NO:129所示的氨基酸,按照递增的优选顺序具有至少14%、15%、20%、25%、26%、27%、28%、29%、30%、31%、32%、33%、34%、35%、36%、37%、38%、39%、40%、41%、42%、43%、44%、45%、46%、47%、48%、49%、50%、51%、52%、53%、54%、55%、56%、57%、58%、59%、60%、61%、62%、63%、64%、65%、66%、67%、68%、69%、70%、71%、72%、73%、74%、75%、76%、77%、78%、79%、80%、81%、82%、83%、84%、85%、86%、87%、88%、89%、90%、91%、92%、93%、94%、95%、96%、97%、98%或99%的全序列同一性,前提是同源蛋白质包含如上定义的GATA结构域和一个或多个上述保守基序。全序列同一性使用全局比对算法,例如GAP(GCG Wisconsin Package,Accelrys)程序中的Needleman Wunsch算法测定,优选使用默认参数。与总体序列同一性相比,当仅考虑保守结构域(例如GATA结构域)或基序时,序列同一性通常更高。
优选地,当多肽序列用于构建例如Reyes等,(2004)中图11或图14描述的系统树时,与Reyes等,(2004)定义的GATA样多肽“亚家族II”组聚簇,而不与任何其他组聚簇,该“亚家族II”组包含SEQ ID NO:129所示的氨基酸序列。
另外,GATA样多肽(至少以其天然形式)一般具有DNA-结合活性。GATA-Zn指识别的共有DNA序列是[AT]GATA[AG]。测定DNA结合活性的工具和技术是本领域众所周知的,参见例如Teakle等,(Plant Mol.Biol.50,43-57,2002),或Ghirlando和Trainor(J.Biol.Chem.278,45620-45628,2003)。
另外,当GATA样多肽根据实施例7和8中概述的本发明方法在稻中表达时,得到具有提高的千粒重的植物。
本发明通过用SEQ ID NO:128所示的编码多肽序列SEQ IDNO:129的核酸序列转化的植物加以说明。然而,本发明的实施不限于这些序列;本发明方法可以使用如文中定义的任意GATA样-编码核酸或GATA样多肽有利地进行。
本文定义的“ADA2多肽”是指任意转录衔接头多肽,其包含两个或多个以下基序:
(vii)ZZ型锌(Zn)指结构域
(viii)SANT DNA结合结构域
(ix)钙EF手结构域
(x)SWIRM结构域
ZZ型锌(Zn)指结构域的原型存在于肌养蛋白,CBP/p300蛋白质中。ZZ结构域中的Cys-x2-Cys基序是存在于锌指中的回忆性的(reminiscent)Cys-x2-Cys指节(knuckle)。该结构域序列中的4-6个半胱氨酸残基负责与锌原子配位,以增强结构(Ponting等,TrendsBiochem Sci 1996;21:11-13)。ADA2多肽中的ZZ型锌指结构域可参与协助蛋白质-蛋白质相互作用。包含ZZ锌指结构域的拟南芥ADA2a和ADA2b蛋白质的片段已显示结合GCN5蛋白质(Mao等,2006)。
SANT DNA结合结构域是Myb DNA结合结构域的亚家族(Aasland等,1996 Trends Biochem Sci 1996;21:87-88)。包含SANTDNA结合结构域的多肽特异性识别存在于基因启动子中的序列YAAC(G/T)G。本发明方法中使用的ADA多肽结合包含序列YAAC(G/T)G的基因启动子,其中Y可以是C或T。
SWIRM结构域(Pfam登录号PF04433)是在真核染色体蛋白质中发现的大约85个氨基酸的小α-螺旋结构域。其在首先识别的蛋白质SWI3、RSC8和MOIRA之后命名。预测该结构域介导染色质-蛋白质复合体装配中的蛋白质-蛋白质相互作用Lenkart等,Proc Natl AcadSci USA.2006;103:2057-2062)。
钙(Ca)结合EF手结构域由两侧连接十二个残基的α-螺旋结构的十二个残基环组成。在EF-手环中,钙离子与五角双锥(pentagonalbipyramidal)构型配位。参与结合的六个残基在第1、3、5、7、9和12位,这些残基表示为X、Y、Z、-Y、-X和-Z。第12位不变的Glu或Asp为配位Ca提供两个氧(二齿配位体)(Finn和Forsen 1995Structure,3:7-11)。
本发明方法中使用的优选ADA2多肽包含二个或多个以下结构域:
(i)SEQ ID NO:207所示的ZZ型锌(Zn)指结构域:kpglyccnycdkdlsglvrfkcavcmdfdlcvecfsvgvelnrhkn,或与SEQ ID NO:207所示的结构域或表A中的任一多肽中存在的任何ZZ型锌指结构域按照递增的优选顺序具有至少50%、55%、60%、65%、70%、75%、80%、82%、85%、90%、92%、95%、97%或更高序列同一性的结构域;
(ii)SEQ ID NO:208所示的SANT DNA结合结构域:vtsdwnadeeillleaiatygfgnwkevadhvgsktttecikhfnsaym,或与SEQ ID NO:208所示的结构域或表A中的任一多肽中存在的任何SANT结构域按照递增的优选顺序具有至少50%、55%、60%、65%、70%、75%、80%、82%、85%、90%、92%、95%、97%或更高序列同一性的结构域;
(iii)SEQ ID NO:209所示的Ca结合EF手结构域:dndaeqlladmef,或与SEQ ID NO:209所示的结构域或表A中的任一多肽中存在的任何Ca结合EF手结构域按照递增的优选顺序具有至少50%、55%、60%、65%、70%、75%、80%、82%、85%、90%、92%、95%、97%或更高序列同一性的结构域;
(iv)SEQ ID NO:210所示的SWIRM结构域:priysgldtwdvdgllgadllsetekkmcnetrilpvhylkmldiltreikkgqikkksdaysffkvepskvdrvydmlvhkgigdst,或与SEQ ID NO:210所示的结构域或表A中的任一多肽中存在的任何SWIRM结构域按照递增的优选顺序具有至少50%、55%、60%、65%、70%、75%、80%、82%、85%、90%、92%、95%、97%或更高序列同一性的结构域。
本发明方法中使用的ADA2多肽优选是按照递增的优选顺序与表A中的任意多肽具有至少50%、55%、60%、70%、75%、80%、85%、90%、92%、93%、95%、96%、98%或更高序列同一性的那些多肽。
多肽中包含的ZZ型锌(Zn)指结构域、SANT DNA结合、钙EF手结构域和SWIRM结构域可通过检索特定的数据库鉴定,该特定的数据库包含覆盖保守蛋白质结构域和家族的序列多重比对和隐匿马尔科夫模型的集合,例如可获自Sanger Institute,英国的Pfam。或者,上述结构域可通过扫描The Integrated Resource of Protein Families,Domains和Sites(InterPro)数据库发现,以检测与已知ZZ型锌(Zn)指结构域、SANT DNA结合、钙EF手结构域和SWIRM结构域的显著序列比对。Interpro数据库是常用的文本和序列-碱基检索标签据书库的整合界面。InterPro数据库合并了这些数据库,所述数据库使用不同的方法学及不同程度的有关充分表征的蛋白质的生物学信息以获得蛋白质特征标识。合作数据库包括SWISS-PROT、PROSITE、TrEMBL、PRINTS、ProDom和Pfam、Smart和TIGRFAM。Interpro由英国欧洲生物信息学研究所维护。本文定义的两个多肽或两个结构域之间的显著匹配是按照递增的优选顺序具有低于e-5(e的负5次幂)、1.e-10、1.e-15、1.e-20、1.e-25、1.e-50、1.e-75、1.e-100、1.e-200、1.e-300、1.e-400、1.e-500、1.e-600、1.e-700和1.e-800的e-值的比对。多肽序列可使用本领域任意已知的方法比对,该方法包括全局和局部比对方法,例如Blast算法,例如Altschul,SF,等,(1990)J.Mol.Biol.215:403-10描述的算法。取得与给定序列出现比对结果的概率作为用于鉴定相似多肽的基础。通常用来代表这种概率的参数称作e-值(E-值)。所述e-值是S评分可靠性的一个量度。S评分是待比对的两个序列的相似性的一个度量。e-值描述给定S评分预期以多少频率随机出现。该e-值可以高至1.0。
用于本发明方法的ADA2多肽优选包含两个或多个以下结构域:
(i)ZZ型锌(Zn)指结构域,其当与已知ZZ型Zn指结构域,更优选当与表C中的任一ZZ型Zn指比对时,按照递增的优选顺序具有低于e-5、1.e-10、1.e-15、1.e-20、1.e-25、1.e-50、1.e-75、1.e-100、1.e-200、1.e-300、1.e-400、1.e-500、1.e-600、1.e-700和1.e-800的e-值;
(ii)SANT DNA结合结构域,其当与已知SANT DNA结合结构域,更优选当与表C中的任一SANT DNA结合结构域比对时,按照递增的优选顺序具有低于e-5、1.e-10、1.e-15、1.e-20、1.e-25、1.e-50、1.e-75、1.e-100、1.e-200、1.e-300、1.e-400、1.e-500、1.e-600、1.e-700和1.e-800的e-值;
(iii)钙EF手结构域,其当与已知钙EF手结构域,更优选当与表C中的任一钙EF手结构域比对时,按照递增的优选顺序具有低于e-5、1.e-10、1.e-15、1.e-20、1.e-25、1.e-50、1.e-75、1.e-100、1.e-200、1.e-300、1.e-400、1.e-500、1.e-600、1.e-700和1.e-800的e-值;
(iv)SWIRM结构域,其当与已知SWIRM结构域,更优选当与表C中的任一SWIRM结构域比对时,按照递增的优选顺序具有低于e-5、1.e-10、1.e-15、1.e-20、1.e-25、1.e-50、1.e-75、1.e-100、1.e-200、1.e-300、1.e-400、1.e-500、1.e-600、1.e-700和1.e-800的e-值。
或者,ADA2多肽可定义为与已知ADA2多肽,优选与表A中的任一多肽显著匹配的任意多肽。
优选地,用于本发明方法的ADA2多肽序列是下述多肽序列,当其用于构建诸如图17所示的系统树时,与该树中的任意序列聚簇,尤其优选与包含SEQ ID NO:182的组Id中的序列聚簇。
在ADA2多肽中,通常ZZ Zn指结构域见N-端,SANT-DNA结合结构域见于中间部分,SWIRM结构域见于C-端.ADA2多肽一般定位于细胞核。植物ADA2多肽可包含核定位信号,例如图15中的SEQID NO:182所示。
另外,ADA2多肽通常体外增强GCN5乙酰化组蛋白的能力,使得GCN5能够乙酰化核小体组蛋白。测定组蛋白乙酰化的工具和技术前已描述(Stockinger等,2001;Mao等,2006)。植物细胞中植物来源的ADA2多肽活性的调控可通过乙酰化进行。已提出拟南芥ADA2b中ADA2a的赖氨酸残基K257和K215可被乙酰化。
本发明提供编码WDR23样多肽的核酸序列和WDR23样多肽,由此相对于对照植物,植物中分离的核酸序列的增加的表达增强产量-相关性状。
因此本发明的一个实施方案提供WDR23样多肽的分离的核酸序列,其包含:
(i)SEQ ID NO:219、SEQ ID NO:225或SEQ ID NO:229所示的分离的核酸序列;
(ii)SEQ ID NO:219、SEQ ID NO:225或SEQ ID NO:229所示的分离的核酸序列的互补序列;
(iii)编码SEQ ID NO:220、SEQ ID NO:226或SEQ ID NO:230所示的多肽序列的分离的核酸序列;
(iv)由于遗传密码简并性的原因可从SEQ ID NO:220、SEQ IDNO:226或SEQ ID NO:230所示的多肽序列推导出来的分离的核酸序列;
(v)能够在严格杂交条件下与SEQ ID NO:219、SEQ ID NO:225或SEQ ID NO:229所示的核酸序列或其互补序列杂交的分离的核酸序列;
(vi)分离的核酸序列,其编码下述多肽,所述多肽与SEQ ID NO:220、SEQ ID NO:226或SEQ ID NO:230所示的多肽序列,按照递增的优选顺序具有至少50%、55%、60%、65%、70%、75%、80%、85%、90%、95%、96%、97%、98%、99%或更高氨基酸序列同一性;
(iv)分离的核酸序列,其编码包含下述结构域的多肽,所述结构域与SEQ ID NO:271所示的保守结构域,按照递增的优选顺序具有至少65%、70%、75%、80%、85%、90%、95%、96%、97%、98%、99%或更高氨基酸序列同一性。
本发明的另一个实施方案还提供下述分离的WDR23样多肽,其包含:
(i)SEQ ID NO:220、SEQ ID NO:226或SEQ ID NO:230所示的多肽序列;
(ii)与SEQ ID NO:220、SEQ ID NO:226或SEQ ID NO:230所示的多肽序列按照递增的优选顺序具有至少50%、55%、60%、65%、70%、75%、80%、85%、90%、95%、96%、97%、98%、99%氨基酸序列同一性的多肽序列;
(iii)包含下述结构域的多肽,所述结构域与SEQ ID NO:271所示的保守结构域,按照递增的优选顺序具有至少65%、70%、75%、80%、85%、90%、95%、96%、97%、98%、99%或更高氨基酸序列同一性;
(iii)上述(i)至(iii)中任意多肽序列的衍生物。
根据另一实施方案,本发明提供相对于对照植物,在植物中增强产量-相关性状的方法,其包含在植物中增加编码WDR23样多肽的核酸序列的表达。
增加编码WDR23样多肽的核酸序列的表达的优选方法是通过在植物中引入和表达编码WDR23样多肽的核酸序列。
下文中对“用于本发明方法中的多肽”的任何提及意指如本文中所定义的WDR23样多肽。下文中对“用于本发明方法中的核酸序列”的任何提及意指能够编码此类WDR23样多肽的核酸序列。待引入植物的核酸序列(并因此用于实施本发明的方法)是任意编码下述类型的多肽的核酸序列,所述多肽现已描述,在下文中也称为“WDR23样核酸序列”或“WDR23样基因”。
本文定义的“WDR23样多肽”是指包含下述结构域的多肽,所述结构域与SEQ ID NO:271所示的保守结构域(CD)具有至少65%、70%、75%、80%、85%、90%、95%、98%、99%或更高氨基酸序列同一性。
或者或另外,本文定义的“WDR23样多肽”是指任意下述多肽,其包含:(i)至少四个具有PFAM登录号PF00400的WD40重复;和(ii)两个连续WD40重复末端的至少两个保守的DxR基序。
或者或另外,本文定义的“WDR23样多肽”是指任意下述多肽,其与SEQ ID NO:216所示的WDR23样多肽或本文表A中给出的任意多肽序列按照递增的优选顺序具有至少50%、55%、60%、65%、70%、75%、80%、85%、90%、95%、98%、99%或更高氨基酸同一性。
多肽序列SEQ ID NO:216的分析示于本文下面的实施例4中。例如,SEQ ID NO:216所示的WDR23样多肽包含至少四个具有PFAM登录号PF00400的WD40重复。结构域还可以使用诸如序列比对的常规技术鉴定。本文表A的全长多肽的比对示于图23。此类比对可用于鉴定WDR23样多肽之间的最保守结构域,例SEQ ID NO:271所示的保守结构域(CD)(包含于SEQ ID NO:216中)。
还可以使用常规技术(如序列比对)来鉴定结构域。比对序列以进行比较的方法为本领域所熟知,这些方法包括GAP、BESTFIT、BLAST、FASTA和TFASTA。GAP利用Needleman和Wunsch((1970)J Mol Biol 48:443-453)的算法来寻找两序列间使匹配数最高并使空位数最少的全局比对(即在完整序列上)。BLAST算法(Altschul等(1990)J Mol Biol 215:403-10)在两序列间计算百分比序列同一性并进行相似性的统计学分析。用于进行BLAST分析的软件在国际生物技术信息中心(National Centre for Biotechnology Information(NCBI))向公众提供。可以使用如默认配对比对参数的ClustalW序列多重比对算法(1.83版)和百分比评分法来容易地鉴定同源物。也可以使用MatGAT软件包(Campanella等,BMC Bioinformatics.10:29.MatGAT:anapplication that generates similarity/identity matrices using protein orDNA sequences)中提供的一种方法确定全局的相似性和同一性百分比。本领域技术人员会意识到,可以进行少量手动编辑以优化保守性基序之间的比对。此外,还可以使用特定的结构域代替全长序列来鉴定同源物。序列同一性值可是使用默认参数的上述程序在完整的核酸序列或氨基酸序列上或在所选择的结构域或保守的基序测定的。本文实施例3在表B中描述了SEQ ID NO:216所示的WDR23样多肽和表A中列举的全长WDR23样多肽之间的同一性百分比,其可低至54%氨基酸序列百分比。如果同一性计算在SEQ ID NO:271所示的保守结构域(CD)(包含于SEQ ID NO:216中)和表A的WDR23样多肽的保守结构域之间进行,同一性百分比可增加至69%,如图23所示。此类计算的结构见于本申请表B。
蛋白质亚细胞定位预测的工作是很重要的,并已充分研究。了解蛋白质的定位有助于阐明其功能。蛋白质定位实验方法的范围从免疫定位到使用绿色荧光蛋白(GFP)或β-葡糖醛酸酶(GUS)标记蛋白质。这些方法很准确,但与计算机方法相比需要大量劳动。从序列数据对蛋白质定位进行计算机预测近来已取得很大进展。本领域技术人员熟知的算法在Swiss Institute for Bioinformatics提供的ExPASy蛋白组学工具可获得,例如PSort、TargetP、ChloroP、LocTree、Predotar、LipoP、MITOPROT、PATS、PTS1、SignalP、TMHMM等。
另外,本发明方法中使用的WDR23样多肽(至少以它们的天然形式)一般能够通过它们的WD40重复基序与其他多肽相互作。存在多种蛋白质-蛋白质活性的测试,例如酵母双杂交测试、后接质谱的串联亲和纯化(TAP)、共亲和纯化等。
高阶位命中是具有低E-值的那些命中。E-值越低,评分越显著(或换句话说,偶然发现该命中的几率越低)。E-值的计算是本领域熟知的。除了E-值外,比较结果也由同一性百分数评分。同一性百分数指两个所比较的核酸(或多肽)序列之间特定长度范围内相同核苷酸(或氨基酸)的数目。在大型家族的情况下,可以使用ClustalW,随后使用邻接树法,以帮助观察相关基因的聚类并鉴定直向同源物和旁系同源物。
核酸变体也可以用于实施本发明的方法。此类核酸变体的实例包括编码实施例1的表A中给出的任一氨基酸序列的同源物和衍生物的核酸,术语“同源物”和“衍生物”如文中所定义。也可用于本发明方法中的是编码实施例1的表A中给出的任一氨基酸序列的直向同源物或旁系同源物的同源物和衍生物的核酸。用于本发明方法中的同源物和衍生物与它们衍生自的未修饰的蛋白质具有基本相同的生物和功能活性。
用于实施本发明方法的其他核酸变体包括编码PATL或PRP38或GATA样或ADA2或WDR23样多肽的核酸的部分;与编码PATL或PRP38或GATA样或ADA2或WDR23样多肽的核酸杂交的核酸;编码PATL或PRP38或GATA样或ADA2或WDR23样多肽的核酸的剪接变体;编码PATL或PRP38或GATA样或ADA2或WDR23样多肽的核酸的等位变体;以及通过基因改组获得的编码PATL或PRP38或GATA样或ADA2或WDR23样多肽的核酸的变体。术语杂交序列、剪接变体、等位变体和基因改组如文中所述。
编码PATL或PRP38或GATA样或ADA2或WDR23样多肽的核酸不需要是全长核酸,因为本发明方法的实施不依赖全长核酸序列的使用。本发明提供在植物中增强产量相关性状的方法,其包括在植物中引入和表达实施例1的表A中给出的核酸序列中任一个的部分,或实施例1的表A中给出的氨基酸序列中任一个的直向同源物、旁系同源物或同源物的编码核酸的部分。
核酸的部分可以例如通过对该核酸产生一个或多个缺失而制备。所述的部分可以以分离的形式加以使用或它们可以与其他编码性(或非编码性)序列融合,以便例如产生组合几种活性的蛋白质。当与其他编码序列融合时,翻译时产生的所得多肽可以比对该蛋白质部分所预测的多肽更大。
用于本发明方法中的部分编码文中定义的PATL或PRP38或ADA2多肽,其与实施例1的表A中给出的氨基酸序列具有基本相同的生物学活性。用于本发明方法中的部分包含下述蛋白质结构域,所述结构域与表C中定义的保守结构域中的任一个,按照递增的优选顺序具有50%、55%、60%、70%、75%、80%、85%、90%、92%、93%、95%、96%、98%或更高序列同一性。优选地,此部分是在实施例1的表A中给出的任何一种核酸的部分,或实施例1的表A中给出的氨基酸序列中任一个的直向同源物或旁系同源物的编码核酸的部分。此部分的长度优选为至少50、100、150、200、300、400、500、550、600、650、700、750、800、850、900、950、1000、1200或更多个连续核苷酸,其中所述的连续核苷酸是实施例1的表A中给出的任何一种核酸序列,或是实施例1的表A中给出的氨基酸序列中任一个的直向同源物或旁系同源物的编码核酸。优选地,此部分编码下述氨基酸序列的片段,当该氨基酸序列用于构建诸如图3所示的系统树时,与该树中的任意序列聚簇,尤其优选与包含SEQ ID NO:2的组Ia中的序列聚簇。进一步优选地,此部分是核酸SEQ ID NO:76的部分,最优选地,此部分是SEQ ID NO:82所示的部分。优选地,此部分编码下述氨基酸序列的片段,当其用于构建诸如图8所示的系统树时,与包含SEQ ID NO:77的组Ib中的任意序列聚簇。
对于GATA样多肽,用于本发明方法中的部分编码文中定义的GATA样多肽,其与实施例1的表A中给出的氨基酸序列具有基本相同的生物学活性。优选地,此部分是在实施例1的表A中给出的任何一种核酸的部分,或实施例1的表A中给出的氨基酸序列中任一个的直向同源物或旁系同源物的编码核酸的部分。此部分的长度优选为至少500、550、600、650、700、750、800、850、900、950、1000、1050个连续核苷酸,其中所述的连续核苷酸是实施例1的表A中给出的任何一种核酸序列,或是实施例1的表A中给出的氨基酸序列中任一个的直向同源物或旁系同源物的编码核酸。最优选地,此部分是核酸SEQID NO:128的部分。优选地,此部分编码下述氨基酸序列的片段,当该片段用于构建例如Reyes等,(2004)中图11或图14描述的系统树时,与Reyes等,(2004)定义的GATA样多肽“亚家族II”组聚簇,而不与任何其他组聚簇,该“亚家族II”组包含SEQ ID NO:129所示的氨基酸序列。
用于本发明方法中的部分编码文中定义的WDR23样多肽,其与实施例1的表A中给出的多肽序列具有基本相同的生物学活性。优选地,此部分是在实施例1的表A中给出的任何一种核酸序列的部分,或实施例1的表A中给出的多肽序列中任一个的直向同源物或旁系同源物的编码核酸序列的部分。此部分的长度优选为至少400、450、500、550、600、650、700、750、800、850、900、950、1000、1050、1100、1150、1200、1250、1300、1350、1400、1450、1480个或更多个连续核苷酸,其中所述的连续核苷酸是实施例1的表A中给出的任何一种核酸序列,或是实施例1的表A中给出的多肽序列中任一个的直向同源物或旁系同源物的编码核酸序列。优选地,此部分是编码下述多肽序列的核酸序列的部分,该多肽包含与SEQ ID NO:271所示的保守结构域(CD)具有至少65%、70%、75%、80%、85%、90%、95%、98%、99%或更高氨基酸序列同一性的结构域。最优选地,此部分是核酸序列SEQ IDNO:215的部分。
用于本发明方法中的另一核酸变体是能够在降低的严格条件下、优选在严格条件下与编码如文中所定义的PATL或PRP38或GATA样或ADA2或WDR23样多肽的核酸杂交,或与如文中所定义的部分杂交的核酸。
本发明提供用于在植物中增强产量相关性状的方法,其包括在植物中引入并表达能够与实施例1的表A中给出的任何一种核酸杂交的核酸,或包括在植物中引入并表达如此一种核酸,其能够与实施例1的表A中给出的任意核酸序列的直向同源物、旁系同源物或同源物的编码核酸杂交的核酸。
用于本发明方法中的杂交序列编码文中定义的PATL或PRP38或GATA样或ADA2或WDR23样多肽,其与实施例1的表A中给出的氨基酸序列具有基本相同的生物学活性。优选地,杂交序列能够与实施例1的表A中给出的任一核酸杂交或与任意这些序列的部分杂交,其中所述的部分如上文所定义,或者其中杂交序列能够与实施例1的表A中给出的任一氨基酸序列的直向同源物或旁系同源物的编码核酸杂交。最优选地,杂交序列能够与SEQ ID NO:1所示的核酸或其部分杂交,或与SEQ ID NO:76所示的核酸或其部分杂交,或与SEQ ID NO:128所示的核酸或其部分杂交,或与SEQ ID NO:181所示的核酸或其部分杂交。关于WDR23样序列,杂交序列优选能够与编码下述多肽序列的核酸序列杂交,该多肽包含与SEQ ID NO:271所示的保守结构域(CD)具有至少65%、70%、75%、80%、85%、90%、95%、98%、99%或更高氨基酸序列同一性的结构域。最优选地,杂交序列能够与SEQ IDNO:215所示的核酸序列或其部分杂交。
优选地,此杂交序列编码具有下述氨基酸序列的多肽,当该氨基酸序列为全长并用于构建诸如图3所示的系统树时,与该树中的任意序列聚簇,尤其优选与包含SEQ ID NO:2的组Ia中的序列聚簇,或当其用于构建诸如图8所示的系统树时,与包含SEQ ID NO:77的组Ib中的任意序列聚簇。同样优选地,此杂交序列编码具有下述氨基酸序列的多肽,当该氨基酸序列为全长并用于构建例如Reyes等,(2004)中图11或图14描述的系统树时,与Reyes等,(2004)定义的GATA样多肽“亚家族II”组聚簇,而不与任何其他组聚簇,该“亚家族II”组包含SEQ IDNO:129所示的氨基酸序列。
用于本发明方法中的另一种核酸变体是编码如上文所定义的PATL或PRP38或GATA样或ADA2或WDR23样多肽的剪接变体,剪接变体如文中所定义。
本发明提供用于在植物中增强产量相关性状的方法,其包括在植物中引入并表达实施例1的表A中给出的任何一种核酸序列的剪接变体,或实施例1的表A中给出的任意氨基酸序列的直向同源物、旁系同源物或同源物的编码核酸的剪接变体。
优选的剪接变体是SEQ ID NO:1所示的核酸的剪接变体,或编码SEQ ID NO:2的直向同源物或旁系同源物的核酸的剪接变体。更优选地,剪接变体是SEQ ID NO:1的变体。优选地,当此剪接变体编码的氨基酸序列用于构建诸如图3所示的系统树时,与该树中的任意序列聚簇,尤其优选与包含SEQ ID NO:2的组Ia中的序列聚簇。
其他优选的剪接变体是SEQ ID NO:76所示的核酸的剪接变体,或编码SEQ ID NO:77的直向同源物或旁系同源物的核酸的剪接变体。更优选地,剪接变体是SEQ ID NO:76的变体,剪接变体最优选是SEQ IDNO:76所示的剪接变体。优选地,当此剪接变体编码的氨基酸序列用于构建诸如图8所示的系统树时,与包含SEQ ID NO:77的组Ib中的任意序列聚簇。
更优选的剪接变体是SEQ ID NO:128所示的核酸的剪接变体,或编码SEQ ID NO:129的直向同源物或旁系同源物的核酸的剪接变体。优选地,当此剪接变体编码的氨基酸序列用于构建例如Reyes等,(2004)中图11或图14描述的系统树时,与Reyes等,(2004)定义的GATA样多肽“亚家族II”组聚簇,而不与任何其他组聚簇,该“亚家族II”组包含SEQ ID NO:129所示的氨基酸序列。
进一步优选的剪接变体是SEQ ID NO:181所示的核酸的剪接变体,或编码SEQ ID NO:182的直向同源物或旁系同源物的核酸的剪接变体。更优选地,剪接变体是SEQ ID NO:181的变体。优选地,当此剪接变体编码的氨基酸序列用于构建诸如图17所示的系统树时,与该树中的任意序列聚簇,尤其优选与包含SEQ ID NO:182的组Id中的序列聚簇。
人直向同源物WRD23呈现为多个同种型(NCBI登录号AK057636.1),因此,也可预测为编码WRD23样多肽的植物核酸序列的剪接变体。
对于WDR23样序列,优选的剪接变体是SEQ ID NO:215所示的核酸序列的剪接变体,或编码SEQ ID NO:216的直向同源物或旁系同源物的核酸序列的剪接变体。优选地,剪接变体是编码下述多肽序列的核酸序列的剪接变体,该多肽包含与SEQ ID NO:271所示的保守结构域(CD)具有至少65%、70%、75%、80%、85%、90%、95%、98%、99%或更高氨基酸同一性的结构域。
用于实施本发明方法的另一种核酸变体是编码如上文所定义的PATL或PRP38或GATA样或ADA2或WDR23样多肽的核酸的等位变体,等位变体如文中所定义。
本发明提供用于在植物中增强产量相关性状的方法,其包括在植物中引入并表达实施例1的表A中给出的任何一种核酸的等位变体,或包括在植物中引入并表达实施例1的表A中给出的任意氨基酸序列的直向同源物、旁系同源物或同源物的编码核酸的等位变体。
用于本发明方法中的等位变体具有与PATL多肽SEQ ID NO:2和实施例1的表A中所示的任意氨基酸基本相同的生物学活性。等位变体天然存在,且包含在本发明方法中的是这些天然等位基因的用途。优选地,等位变体是SEQ ID NO:1的等位变体,或编码SEQ ID NO:2的直向同源物或旁系同源物的核酸的等位变体。优选地,当此等位变体编码的氨基酸序列用于构建诸如图3所示的系统树时,与该树中的任意序列聚簇,尤其优选与包含SEQ ID NO:2的组Ia中的序列聚簇。
用于本发明方法中的其他等位变体具有与PRP38多肽SEQ IDNO:77和实施例1的表A中所示的任意氨基酸基本相同的生物学活性。等位变体天然存在,且包含在本发明方法中的是这些天然等位基因的用途。优选地,等位变体是SEQ ID NO:76的等位变体,或编码SEQ IDNO:77的直向同源物或旁系同源物的核酸的等位变体。优选地,当此等位变体编码的氨基酸序列用于构建诸如图8所示的系统树时,与包含SEQ ID NO:77的组Ib中的任意序列聚簇。
对于GATA样多肽,用于本发明方法中的等位变体具有与GATA样多肽SEQ ID NO:129和实施例1的表A中所示的任意氨基酸基本相同的生物学活性。等位变体天然存在,且包含在本发明方法中的是这些天然等位基因的用途。
优选地,等位变体是SEQ ID NO:128的等位变体,或编码SEQ IDNO:129的直向同源物或旁系同源物的核酸的等位变体。优选地,当此等位变体编码的氨基酸序列用于构建例如Reyes等,(2004)中图11或图14描述的系统树时,与Reyes等,(2004)定义的GATA样多肽“亚家族II”组聚簇,而不与任何其他组聚簇,该“亚家族II”组包含SEQ IDNO:129所示的氨基酸序列。
对于ADA2多肽,用于本发明方法中的等位变体具有与ADA2多肽SEQ ID NO:182和实施例1的表A中所示的任意氨基酸基本相同的生物学活性。等位变体天然存在,且包含在本发明方法中的是这些天然等位基因的用途。优选地,等位变体是SEQ ID NO:181的等位变体,或编码SEQ ID NO:182的直向同源物或旁系同源物的核酸的等位变体。优选地,当此等位变体编码的氨基酸序列用于构建诸如图17所示的系统树时,与该树中的任意序列聚簇,尤其优选与包含SEQ ID NO:182的组Id中的序列聚簇。
对于WDR23样多肽,用于本发明方法中的等位变体具有与WDR23样多肽SEQ ID NO:216和实施例1的表A中所示的任意多肽基本相同的生物学活性。等位变体天然存在,且包含在本发明方法中的是这些天然等位基因的用途。优选地,等位变体是SEQ ID NO:215的等位变体,或编码SEQ ID NO:216直向同源物或旁系同源物的核酸序列的等位变体。优选地,此等位变体是下述多肽序列的等位变体,该多肽包含与SEQ ID NO:271所示的保守结构域(CD)具有至少65%、70%、75%、80%、85%、90%、95%、98%、99%或更高氨基酸序列同一性的结构域。
基因改组或定向进化也可以用来产生编码如上文定义的PATL或PRP38或GATA样或ADA2或WDR23样多肽的核酸的变体;术语“基因改组”如文中所定义。
本发明提供用于在植物中增强产量相关性状的方法,其包括在植物中引入并表达实施例1的表A中给出的任何一种核酸序列的变体,或包括在植物中引入并表达如此核酸的变体,其中所述的核酸编码实施例1的表A中给出的任意氨基酸序列的直向同源物、旁系同源物或同源物,所述变体核酸通过基因改组获得。
优选地,通过基因改组获得的变体核酸编码的下述氨基酸序列,当其用于构建诸如图3所示的系统树时,与该树中的任意序列聚簇,尤其优选与包含SEQ ID NO:2或SEQ ID NO:77的组Ia中的序列聚簇。同样优选地,通过基因改组获得的变体核酸编码的下述氨基酸序列,当其用于构建例如Reyes等,(2004)中图11或图14描述的系统树时,与Reyes等,(2004)定义的GATA样多肽“亚家族II”组聚簇,而不与任何其他组聚簇,该“亚家族II”组包含SEQ ID NO:129所示的氨基酸序列。对于ADA2,优选通过基因改组获得的变体核酸编码的下述氨基酸序列,当其用于构建诸如图17所示的系统树时,与该树中的任意序列聚簇,尤其优选与包含SEQ ID NO:182的组Id中的序列聚簇。对于WDR23,通过基因改组获得的变体核酸序列编码的下述多肽序列,其包含与SEQ ID NO:271所示的保守结构域(CD)具有至少65%、70%、75%、80%、85%、90%、95%、98%、99%或更高氨基酸序列同一性的结构域。
另外,核酸变体也可以通过位点定向诱变获得。几种方法可用于实现位点定向诱变,最常见的是基于PCR的方法(Current Protocols inMolecular Biology.Wiley编)。
编码PATL或PRP38或GATA样或ADA2多肽的核酸可以来自任何自然来源或人工来源。核酸可以从其天然形式就组成和/或基因组环境方面通过人类有意操作而加以修饰。编码PATL或GATA样多肽的核酸优选地来自植物,还优选来自单子叶植物,还优选来自禾本科(poaceae),最优选地来自稻的核酸。编码PRP38多肽的核酸优选地来自植物,还优选来自双子叶植物,更优选来自十字花科(brasicaceae),该核酸最优选来自拟南芥的核酸。编码ADA2多肽-encoding核酸优选地来自植物,还优选来自双子叶植物,更优选来自十字花科,该核酸最优选来自拟南芥的核酸。编码WDR23样多肽的核酸序列可以来自任何自然来源或人工来源。核酸序列可以从其天然形式就组成和/或基因组环境方面通过人类有意操作而加以修饰。编码WDR23样多肽的核酸序列来自真核生物域(domain),优选来自植物界,进一步优选来自双子叶植物。更优选地,编码WDR23样多肽的核酸序列来自十字花科,最优选地,编码WDR23样多肽的核酸序列来自拟南芥。
有利地,本发明在此提供未知的PATL核酸和多肽序列。
本发明的另一实施方案提供分离的核酸分子,其包含:
(i)SEQ ID NO:9;SEQ ID NO:11;SEQ ID NO:13;SEQ IDNO:15;SEQ ID NO:17;SEQ ID NO:19;SEQ ID NO:21;SEQID NO:23;SEQ ID NO:25和SEQ ID NO:27所示的核酸;
(ii)与(i)中给出的任一SEQ ID NO互补的核酸或其片段;
(iii)编码下述PATL多肽的核酸,所述多肽与SEQ ID NO:10;SEQ ID NO:12;SEQ ID NO:14;SEQ ID NO:16;SEQ ID NO:18;SEQ ID NO:20;SEQ ID NO:22;SEQ ID NO:24;SEQ ID NO:26和SEQ ID NO:28给出的任一氨基酸序列按照递增的优选顺序具有至少70%、75%、80%、85%、90%、95%、96%、97%、98%、99%或更高序列同一性;
(iv)能够在严格杂交条件下与上述(i)、(ii)或(iii)给出的任一核酸杂交的核酸。
本发明的另一实施方案提供分离的多肽,其包含:
(i)与SEQ ID NO:10;SEQ ID NO:12;SEQ ID NO:14;SEQID NO:16;SEQ ID NO:18;SEQ ID NO:20;SEQ ID NO:22;SEQID NO:24;SEQ ID NO:26和SEQ ID NO:28给出的任一氨基酸序列按照递增的优选顺序具有至少80%、85%、90%、95%、96%、97%、98%、99%或100%序列同一性的氨基酸序列;
(ii)(i)中给出的任意氨基酸序列的衍生物。
本发明方法的实施产生了具有增强的产量相关性状的植物。特别地,本发明方法的实施产生这样的植物,其相对于对照植物具有提高的产量、尤其提高的种子产量。术语“产量”和“种子产量”在本文的“定义”部分中更详细地描述。
本文中对增强的产量相关性状的谈及意指植物的一个或多个部分的生物量(重量)增加,所述的部分可以包括地上(可收获)部分和/或地下(可收获)部分。特别地,此类可收获部分是种子,并且本发明方法的实施产生了相对于对照植物的种子产量而言,具有提高的种子产量的植物。
以谷物为例,产量提高可以表现为以下一个或多个指标:每平方米已建立的植物数目增加、每株植物花序数的提高、行数、每行核粒数、核粒重、千粒重、花序长度/直径的提高、种子饱满率(即饱满种子数除以种子总数并乘以100)提高,及其他。以稻为例,产量提高可以自身表现为下列一种或多种指标的提高:每平方米植物数、每株植物的花序数、每个花序的小穗数、每个花序的花(小花)数目(其表述为饱满种子数对原发花序数的比)、种子饱满率(即饱满种子数除以种子总数并乘以100)提高、千粒重提高及其他。
本发明提供相对于对照植物增加产量尤其是植物种子产量的方法,所述方法包括调节编码如本文定义的PATL或PRP38或GATA样或ADA2或WDR23样多肽的核酸在植物中的表达。
由于本发明的转基因植物具有提高的产量,故这些植物有可能在其生活周期的相应阶段(在其生活周期的至少部分期间)相对于对照植物的生长速率表现出提高的生长速率。
提高的生长速率可以是植物的一个或多个部分(包括种子)特有的,或可以基本上遍及整株植物。具有提高的生长速率的植物可以具备较短的生活周期。植物的生活周期可以意指从干燥成熟种子生长直至植物已经产生与起始材料相似的干燥成熟种子的阶段所需要的时间。这个生活周期可以受诸因素如早期生长势、生长速率、绿度指数、开花时间和种子成熟速度影响。生长速率的提高可以在植物生活周期的一个或多个阶段上或基本上在植物整个生活周期期间发生。在植物生活周期中早期期间提高的生长速率可以反映增强的生长势。生长速率的提高可以改变植物的收获周期,从而允许植物更晚地播种和/或更早地收获,而这本来是不可能的(可以随更早的开花时间获得相似效果)。如果大幅提高生长速率,则可以允许进一步播种相同植物物种的种子(例如播种并收获稻植物,随后播种并收获其他稻植物,全部稻植物均处于一个常规生长时期中)。类似地,如果大幅提高生长速率,则可以允许进一步播种不同植物物种的种子(例如播种并收获谷物植物,随后例如播种并任选收获大豆、马铃薯或任何其他合适的植物)。在一些作物植物的例子中,来自相同根状茎的额外收获次数也可以是可能的。改变植物的收获周期可以导致每平方米一年生物量生产提高(原因在于可以培育并收获任何具体植物的次数(即在一年中)提高)。生长速率的提高也可以允许将转基因植物在比其野生型对应物更广泛的地理区域内培育,因为培育作物的地域限制往往由栽种时期(早季)或在收获时期(晚季)的不利环境条件决定。如果缩短收获周期,则可以避开这类不利条件。生长速率可以通过从生长曲线衍生多个参数而确定,此类参数可以是:T-Mid(植物达到50%最大植物大小所花费的时间)和T-90(植物达到90%最大植物大小所花费的时间),以及其他。
根据本发明的一个优选特征,本发明方法的实施产生了相对于对照植物具有提高的生长速率的植物。因此,根据本发明,提供了用于提高植物生长速率的方法,所述方法包括在植物中调节编码如本文中定义的PATL或PRP38或GATA样或ADA2或WDR23样多肽的核酸表达。
在包括植物日常所暴露的胁迫的典型农业生长条件下,出现产量和/或生长速率的提高。植物一般通过生长得更慢而应答于胁迫暴露。在严重胁迫条件下,植物甚至可能完全停止生长。另一方面,轻度或日常胁迫为植物所暴露的任何下述胁迫,其中所述的胁迫不导致植物完全停止生长,但同时不能恢复生长。与非胁迫条件下的对照植物相比较,轻度胁迫在本发明的意义下导致受胁迫植物的生长降低小于40%、35%或30%、优选地小于25%、20%或15%、更优选地小于14%、13%、12%、11%或10%或更低。由于农业实践(灌溉、施肥、杀虫剂处理)的进步,栽培作物植物中并不经常遭遇严重胁迫。在典型农业条件下培育的植物可能经常遇到轻度胁迫。因此,由轻度胁迫诱导的受损生长对于农业往往是不受欢迎的特征。轻度胁迫是植物所暴露的常见生物胁迫和/或非生物(环境)胁迫。非生物胁迫可以因干旱或过量的水、缺氧胁迫、盐胁迫、化学毒性、氧化胁迫和热、寒冷或冰冻温度所致。非生物胁迫可以是因水胁迫(尤其因为干旱)、盐胁迫、氧化胁迫或离子胁迫引起的渗透胁迫。
对于GATA样,与对照植物相比,无论植物处于非胁迫条件下还是植物暴露于多种胁迫下,都发生产量和/或生长速率的增加。对于WDR23,与相当条件下生长的对照植物相比,无论植物处于非胁迫条件下还是植物暴露于多种胁迫下,都发生产量相关性状的增强。植物一般通过生长得更慢而对暴露于胁迫作出应答。在严重胁迫条件下,植物甚至可以完全停止生长。另一方面,轻微胁迫在本文中定义为植物对其暴露的任何胁迫,其中所述的胁迫未导致植物完全停止生长而没有恢复生长的能力。与非胁迫条件下的对照植物相比,轻微胁迫在本发明意义中导致受胁迫植物生长降低小于40%、35%或30%,优选小于25%、20%或15%,更优选小于14%、13%、12%、11%或10%或更低。由于农业实践(灌溉、施肥、杀虫剂处理)上的进步,在栽培作物植物中并不经常遇到严重胁迫。因此,由轻微胁迫诱导的受损生长往往是农业上不希望的特征。轻微胁迫是植物暴露的常见生物性和/或非生物性(环境)胁迫。非生物胁迫可以因干旱或水涝、厌氧胁迫、盐胁迫、化学毒性、氧化胁迫和热、寒冷或冰冻温度所致。生物胁迫一般是由病原体如细菌、病毒、真菌、线虫和昆虫引起的那些胁迫。如本文中所用的术语”非胁迫”条件是允许植物最佳生长的环境条件。本领域技术人员清楚对于给定地点的正常土壤条件和气候条件。
特别地,本发明的方法可以在非胁迫条件下或在轻度干旱条件开展以产生相对于对照植物具有提高的产量的植物。如Wang等(Planta(2003)218:1-14)中报道,非生物性胁迫导致一系列不利地影响植物生长及生产力的形态学、生理学、生物化学和分子变化。干旱、盐度、极端温度和氧化胁迫已知相互联系并可以通过相似的机制诱导生长损害及细胞损害。Rabbani等(Plant Physiol(2003)133:1755-1767)描述了干旱胁迫与高盐度胁迫之间极高程度的“交互作用”。例如,干旱和/或盐化作用主要表现为渗透胁迫,从而导致细胞内稳态和离子分布的破坏。经常伴随高温或低温、盐度或干旱胁迫的氧化胁迫可以引起功能蛋白和结构蛋白变性。因此,这些多样的环境胁迫常常激活相似的细胞信号传导途径和细胞应答,如产生胁迫蛋白、上调抗氧化物质、积累兼容性溶质和生长停滞。术语“非胁迫”条件是允许植物最佳生长的那些环境条件。本领域技术人员清楚对于给定地点的正常土壤条件和气候条件。如本文中所用的术语非胁迫条件包括植物所暴露的偶尔的或平常的轻度胁迫,如本文定义的,但不包括重度胁迫。
相对于可比较条件下培育的对照植物,本发明方法的实施赋予在非胁迫条件下或在轻度干旱条件下培育的植物提高的产量。因此,根据本发明,提供了用于在非胁迫条件下或在轻度干旱条件下培育的植物中提高产量的方法,所述方法包括增加植物中编码PATL或PRP38或GATA样或ADA2或WDR23样多肽的核酸表达。
相对于可比较条件下培育的对照植物,本发明方法的实施赋予在养分缺乏条件下、尤其在缺氮条件下培育的植物提高的产量。因此,根据本发明,提供了用于在养分缺乏条件下培育的植物中提高产量的方法,所述方法包括调节植物中编码PATL或PRP38或GATA样或ADA2或WDR23样多肽的核酸表达。养分缺乏可以因养分如氮、磷酸盐和其他含磷化合物、钾、钙、镉、镁、锰、铁和硼及其他元素缺少所致。就WDR23而言,优选降低的养分可获得性是降低的氮可获得性。
本发明包括通过本发明方法可获得的植物或其部分(包括种子)。所述植物或其部分包含编码如上文定义的PATL或PRP38或GATA样或ADA2或WDR23样多肽的核酸转基因。
本发明也提供了基因构建体和载体以促进在植物中导入和/或表达编码PATL或PRP38或GATA样或ADA2或WDR23样多肽的核酸。所述基因构建体可以插入适于转化至植物并适于在转化细胞中表达目的基因的载体,所述载体可以是市售的。本发明也提供了如本文中定义的基因构建体在本发明方法中的用途。
更具体地,本发明提供了构建体,其包含:
(a)编码如上定义的PATL或PRP38或GATA样或ADA2或WDR23样多肽的核酸;
(b)能够驱动(a)的核酸序列表达的一个或多个调控序列;和任选地
(c)转录终止序列。
优选地,编码PATL或PRP38或GATA样或ADA2或WDR23样多肽的核酸如上文定义。术语“调控序列”和“终止序列”如本文中定义。
对于WDR23,优选一种构建体调控序列是分离自植物基因组的组成型启动子。物组成型启动子的实例是GOS2启动子,更优选地是稻GOS2启动子,最优选地是SEQ ID NO:272所示的GOS2启动子。或者,一种构建体调控序列是分离自植物基因组的分生组织特异性启动子。植物分生组织特异性启动子的实例是金属硫蛋白(MT)启动子,更优选地是稻MT启动子,最优选是SEQ ID NO:273所示的MT启动子。
植物用包含上述任意核酸的载体转化。技术人员非常了解必须存在于所述载体上以便成功转化、选择和增殖含有目的序列的宿主细胞的遗传元件。此目的序列有效地与一个或多个调控序列(至少与启动子)连接。
有利地,任意类型的启动子,无论是天然的或合成的,可以用来提高所述核酸序列的表达。组成型启动子是在所述方法中特别有用的。优选地,所述组成型启动子也是遍在启动子。对于GATA样多肽,启动子具有中等强度。对于多种启动子类型的定义,见本文中的“定义”部分。
应当明白本发明的应用不限于SEQ ID NO:1所示的编码PATL多肽的核酸,本发明的应用也不限于编码PATL多肽的核酸在受组成型启动子驱动时或在受绿色-组织启动子驱动时的表达。
更应当明白本发明的应用不限于SEQ ID NO:76所示的编码PRP38多肽的核酸,本发明的应用也不限于编码PRP38多肽的核酸在受组成型启动子驱动时或在根特异性启动子驱动时的表达。
还应当明白本发明的应用不限于SEQ ID NO:128所示的编码GATA样多肽的核酸,本发明的应用也不限于编码GATA样多肽的核酸在受组成型启动子驱动时的表达。
还应当明白本发明的应用不限于SEQ ID NO:181所示的编码ADA2多肽的核酸,本发明的应用也不限于编码ADA2多肽的核酸在受组成型启动子驱动时或在受绿色-组织启动子驱动时的表达。
还应当明白本发明的应用不限于SEQ ID NO:215所示的编码WDR23样多肽的核酸序列,本发明的应用也不限于编码WDR23样多肽的核酸序列在受组成型或分生组织特异性启动子驱动时的表达。
组成型启动子优选为HMG(高迁移组)启动子,也称为HMGP启动子,优选来自稻的HMG启动子。进一步优选地,该组成型启动子由基本上与SEQ ID NO:33相似的核酸序列代表,最优选该组成型启动子SEQ ID NO:33所代表。绿色组织特异性启动子优选EXP9(扩展蛋白),也称为EXP,也称为HMGP启动子,优选来自稻的EXP启动子。进一步优选地,该绿色组织特异性由基本上与SEQ ID NO:34相似的核酸序列代表,最优选该组成型启动子SEQ ID NO:34所代表。HMG和EXP启动子的描述和对于组成型和绿色组织特异性启动子的其他例子,见本文“定义”部分中的表2。
对于PRP38,组成型启动子优选地是GOS2启动子,优选来自稻的GOS2启动子。进一步优选地,该组成型启动子由基本上与SEQ IDNO:127相似的核酸序列代表,最优选该组成型启动子SEQ ID NO:127所代表。对于组成型启动子的其他例子,见本文“定义”部分中的表ii。
对于GATA样多肽,组成型启动子优选地是GOS2启动子,优选地是来自稻的GOS2启动子。进一步优选地,该组成型启动子由基本上与SEQ ID NO:135相似的核酸序列代表,最优选地该组成型启动子SEQ ID NO:135所代表。任选地,一个或多个终止子序列可用于被引入植物中的构建体。优选地,该构建体包括与SEQ ID NO 136基本相似或相同的表达盒,包括GOS2启动子和编码GATA样多肽SEQ ID NO129的核酸。
对于ADA2多肽,组成型启动子优选为HMG(高迁移组)启动子,也称为HMGP启动子,优选来自稻的HMG启动子。进一步优选地,该组成型启动子由基本上与SEQ ID NO:213相似的核酸序列代表,最优选地该组成型启动子SEQ ID NO:213所代表。绿色组织特异性启动子优选EXP9(扩展蛋白),也称为EXP,也称为HMGP启动子,优选来自稻的EXP启动子。进一步优选地,该绿色组织特异性由基本上与SEQ ID NO:214相似的核酸序列代表,最优选地该组成型启动子SEQID NO:214所代表。HMG和EXP启动子的描述和对于组成型和绿色组织特异性启动子的其他例子,见本文“定义”部分中的表iii。
其他调节元件可以包括转录增强子以及翻译增强子。本领域技术人员将知道可能适用于实施本发明的终止子和增强子序列。如定义部分中描述,内含子序列也可以添加至5’非翻译区(UTR)或编码序列中,以提高细胞质中积累的成熟信使的量。(除启动子、增强子、沉默子、内含子序列、3’UTR和/或5’UTR区域之外的)其他调控序列可以是蛋白质/或RNA稳定化元件。此类序列将是已知的或可以由本领域技术人员轻易地获得。
本发明的基因构建体可以还包括对于特定细胞类型中维持和/或复制所需要的复制起点序列。一个例子是当需要基因构建体作为游离型遗传元件(例如质粒或粘粒分子)在细菌细胞中维持时的复制起点。优选的复制起点包括但不限于f1-ori和colE1。
为检测如用于本发明方法中的核酸序列的成功转移和/或选择包含这些核酸的转基因植物,使用标记基因(或报道基因)是有利的。因此,所述基因构建体可以任选地包含选择性标记基因。选择性标记在本文的“定义”部分中更详细地描述。一旦不再需要所述标记基因时,可以从转基因细胞中移除或切除它们。用于标记移除的技术是本领域已知的,有用的技术在上文定义部分中描述。
本发明也提供了用于产生转基因植物的方法,所述转基因植物相对于对照植物具有增强的产量相关性状,其中所述方法包括在植物中引入并表达编码如上文所定义的PATL或PRP38或GATA样或ADA2或WDR23样多肽的任意核酸。
更具体地,本发明提供了用于产生转基因植物的方法,所述转基因植物具有增加的增强产量相关性状、尤其提高的种子产量,其中所述方法包括:
(i)在植物或植物细胞中引入并表达编码PATL或PRP38或ADA2多肽的核酸;和
(ii)在促进植物生长和发育的条件下培育植物细胞;和任选地
(iii)选择具有增强的产量相关性状的植物。
对于GATA样多肽,本发明提供了用于产生转基因植物的方法,所述转基因植物具有增加的增强产量相关性状、尤其提高的TKW,其中所述方法包括:
(i)在植物或植物细胞中引入并表达编码GATA样或WDR23样多肽的核酸;和
(ii)在促进植物生长和发育的条件下培育植物细胞。
(i)的核酸可以能够编码如本文中所定义的PATL或PRP38或GATA样或ADA2或WDR23样多肽的任意核酸。
该核酸可以直接地导入植物细胞或导入植物自身(包括导入组织、器官或植物的任何其他部分)。根据本发明的优选特征,该核酸优选地通过转化作用导入植物。术语“转化”在本文的“定义”部分中更详细地描述。
通常在转化后,对植物细胞或细胞群体选择一个或多个标记的存在性,其中所述标记由随同目的基因一起共转移的植物可表达基因编码,随后将转化材料再生成完整植物。为了选择转化的植物,一般使转化中获得的植物材料经历选择条件,从而转化植物可以与非转化植物区分开。例如,以上述方式获得的种子可以播种,并且在初始培育时间后,通过喷洒接受合适的选择。另一种可能性在于种子根据需要消毒后,在使用合适选择剂的琼脂板上培育,从而仅转化的种子可以长成植物。备选地,筛选所述转化植物的选择性标记(如上文所述的选择性标记)的存在性。
在DNA转移和再生后,推定转化的植物也可以例如使用DNA印迹分析对目的基因的存在性、拷贝数和/或基因组构造进行评价。备选地或额外地,新导入的DNA的表达水平可以使用RNA印迹分析和/或蛋白质印迹分析监测,这两项技术均是本领域普通技术人员熟知的。
产生的转化植物可以通过多种方法繁殖,如通过克隆繁殖法或经典育种技术。例如,第一世代(或T1)转化植物可以进行自交并且选择纯合的第二世代(或T2)转化体,并且T2植物随后可以通过经典育种技术进一步繁殖。产生的转化生物可以采取多种形式。例如,它们可以是转化细胞和非转化细胞的嵌合体;克隆性转化体(例如,被转化以含有表达盒的全部细胞);转化组织和非转化组织的移植体(例如在植物中,与未转化接穗嫁接的转化根状茎)。
本发明明确地扩展至由本文中所述的任意方法产生的任意植物细胞或植物,并扩展至全部植物部分及其繁殖体。本发明进一步扩展以包括已经由前述任意方法产生的原代转化或转染细胞、组织、器官或完整植物的子代,唯一要求是子代表现与如本发明方法中的亲本相同的基因型和/或表型特征。
本发明也包括宿主细胞,其包含编码如上文所定义PATL或PRP38或GATA样或ADA2或WDR23样多肽的分离的核酸。本发明的优选宿主细胞是植物细胞。对于本发明方法中所用核酸或载体、表达盒或构建体或载体的宿主植物原则上有利地是能够合成在本发明方法中所使用的多肽的全部植物。
本发明的方法有利地适用于任意植物。特别在本发明方法中有用的植物包括属于植物界超家族、尤其属于单子叶和双子叶植物的全部植物,包括饲用或饲料豆科植物、观赏植物、粮食作物、树或灌木。根据本发明的一个优选实施方案,所述植物是作物植物。作物植物的例子包括大豆、向日葵、卡诺拉油菜、苜蓿、油菜籽、棉花、番茄、马铃薯和烟草。更优选地,所述植物是单子叶植物。单子叶植物的例子包括甘蔗。更优选地,所述植物是禾谷植物。禾谷植物的例子包括稻、玉米、小麦、大麦、稷、黑麦、小黑麦属、高粱和燕麦。
本发明也扩展至植物的可收获部分,例如但不限于种子、叶、果实、花、茎、根、根状茎、块茎和球茎,或也扩展至编码与植物组成型启动子有效链接的WDR23样(如上文定义)的分离核酸序列。本发明进一步涉及衍生自、优选直接衍生自此种植物的可收获部分的产品,如干燥颗粒或粉末、油、脂肪和脂肪酸、淀粉或蛋白质。
根据本发明的优选特征,受调节的表达是提高的表达。用于增加核酸或基因或基因产物表达的方法是本领域中充分报道的并且在定义部分中提供了例子。
如上所述,用于调节编码PATL或PRP38或GATA样或ADA2或WDR23样多肽的核酸的优选方法是在植物中引入并表达编码PATL或PRP38或GATA样或ADA2或WDR23样多肽的核酸;然而,也可以使用包括但不限于T-DNA活化标签法、TILLING、同源重组在内的其他熟知技术实现实施本方法的效果,即增强产量相关性状。在定义部分中提供了这些技术的描述。
本发明也包括编码如文中所述PATL或PRP38或GATA样或ADA2或WDR23样多肽的核酸的用途,和这些PATL或PRP38或GATA样或ADA2或WDR23样多肽在正常生长条件下、在非生物胁迫生长条件(优选地在渗透胁迫生长条件)下和在养分可利用性降低的条件下、优选在氮可利用性降低的生长条件下增强植物中任一前述产量相关性状的用途。
编码本文中所述PATL或PRP38或GATA样或ADA2或WDR23样多肽的核酸,或PATL或PRP38或GATA样或ADA2或WDR23样多肽自身可以用于其中鉴定到DNA标记的育种程序中,其中所述的DNA标记可以遗传地与PATL或PRP38或GATA样或ADA2或WDR23样编码多肽的基因连锁。所述核酸/基因或PATL或PRP38或GATA样或ADA2或WDR23样多肽自身可以用来定义分子标记。这种DNA或蛋白质标记随后可以在育种程序中用来在本发明的方法中选择具有增强的如上文所定义产量相关性状的植物。
编码PATL或PRP38或GATA样或ADA2或WDR23样多肽的核酸/基因的等位变体也可以用于标记辅助的育种程序中。此类育种程序有时需要使用例如EMS诱变法通过对植物进行诱变处理而导入等位变异;备选地,所述程序可以从收集并非故意造成的所谓“自然”源性等位变体开始。随后进行等位变体的鉴定,例如通过PCR法。此后是步骤:选择所讨论的和导致产量提高的序列的优异等位变体。一般通过监测含有所讨论序列的不同等位变体的植物的生长性能实施选择。可以在温室中或在田间监测生长性能。其他任选步骤包括将其中鉴定到优异等位变体的植物与另一种植物杂交。这可能用来例如产生感兴趣的表型特征的组合。
编码PATL或PRP38或GATA样或ADA2或WDR23样多肽的核酸也可以作为探针用于遗传地或物理地绘制所述探针构成其一部分的基因并且用作与这些基因连锁的性状的标记。此类信息可以用于植物育种中,以开发具有所希望表型的品系。编码PATL或PRP38或GATA样或ADA2或WDR23样多肽的核酸的这种用途仅需要具有至少15个核苷酸长度的核酸序列。编码PATL或PRP38或GATA样或ADA2或WDR23样多肽的核酸可以用作限制性片段长度多态性(RFLP)标记。限制性消化的植物基因组DNA的DNA印迹物(Sambrook J,Fritsch EF和Maniatis T(1989)Molecular Cloning,A Laboratory Manual)可以用编码PATL或PRP38或GATA样或ADA2或WDR23样多肽的核酸探测。所得的结合图式随后可以使用计算机程序如MapMaker(Lander等,(1987)Genomics 1:174-181)开展遗传分析以构建遗传图。此外,所述核酸可以用来探测含有一组个体的限制性核酸内切酶处理的基因组DNA的DNA印迹物,其中所述的一组个体代表具有定义的遗传杂交的亲本和子代。DNA多态性的分离是明显的并用来计算编码PATL或PRP38或GATA样或ADA2或WDR23样多肽的核酸在先前使用这个群体获得的遗传图中的位置(Botstein等,(1980)Am.J.Hum.Genet.32:314-331)。
在Bernatzky和Tanksley(1986)Plant Mol.Biol.Reporter 4:37-41中描述了植物基因来源的探针的产生及其在遗传作图中的用途。许多出版物描述了使用上文概述的方法学或其变例对特定cDNA克隆的遗传作图。例如,F2互交群、回交群、随机交配群、邻近纯合系和其他个体群体可以用于作图。此类方法学是本领域技术人员熟知的。
所述核酸探针也可以用于物理作图(即序列在物理图上的排列;见Hoheisel等在:Non-mammalian Genomic Analyasis:A Practical Guide,Academic press 1996,第319-346页及其中引用的参考文献)。
在另一个实施方案中,所述核酸探针可以在直接荧光原位杂交(FISH)作图法(Trask(1991)Trends Genet.7:149-154)中使用。尽管当前的FISH作图法支持使用大的克隆(几个kb至几百个kb;见Laan等,(1995)Genome Res.5:13-20),然而灵敏度的改善可以允许使用更短探针进行FISH作图。
用于遗传作图及物理作图的多种基于核酸扩增的方法可以使用所述核酸而实施。方法例子包括等位基因特异性扩增法(Kazazian(1989)J.Lab.Clin.Med 11:95-96)、PCR扩增片段的多态性(CAPS;Sheffield等,(1993)Genomics 16:325-332)、等位基因特异性连接(Landegren等,(1988)Science 241:1077-1080)、核苷酸延伸反应(Sokolov(1990)NucleicAcid Res.18:3671)、放射杂交作图(Walter等,(1997)Nat.Genet.7:22-28)和Happy作图法(Dear和Cook(1989)Nucleic Acid Res.17:6795-6807)。对于这些方法,使用一种核酸序列的序列来设计并产生在扩增反应或在引物延伸反应中使用的引物对。此类引物的设计是本领域技术人员熟知的。在使用基于PCR遗传作图的方法中,可能需要在对应于当前核酸序列的区域中鉴定作图交叉的亲本之间的DNA序列差异。然而,这对作图法而言通常不是必需的。
如前文所述,本发明方法产生了具有增强的产量相关性状的植物。这些性状也可以与经济上有利的其他性状组合,如其他的产量增强性状、其他非生物胁迫和生物胁迫耐受性、调节多种构造性特征和/或生物化学特征和/或生理学特征的性状。
在一个实施方案中,本发明涉及如下概括的主题:
项1A 用于相对于对照植物增强植物中产量相关性状的方法,其包括调节植物中编码PATELLIN多肽的核酸的表达和可选地选择具有增强的产量相关性状的植物。
项2A 根据项1A的方法,其中所述PATELLIN多肽包含至少一个如下结构域:
(i)SEQ ID NO:71所示的SEC14结构域:lpeldsvvfyrgadreghpvcynvygefqdkdlyekafgdeekrerflkwriqllergilsqldfspsgicsmvqvtdlknsppmlgkhravtrqavallqdnypefiakkvfinvpwwylaankmmspfltqrtkskfifaspaksaetlfryiapeqvpvqfgglfk,或与SEQ ID NO:71所示的结构域或出现在表A中的任意多肽中的任意SEC14结构域以增加的优选顺序具有至少50%、55%、60%、65%、70%、75%、80%、82%、85%、90%、92%、95%、97%或更多的序列同一性的结构域;
(ii)SEQ ID NO:72所示的GOLD结构域:sdavteltikpssketveipvtenstigwelrvlgwevsygaeftpdaeggytvivqktrkvpaneepimkgsfkvgepgkivltinnpaskkkkllyrskv,或与SEQ ID NO:72所示的结构域或出现在表A中的任意多肽中的任意GOLD结构域以增加的优选顺序具有至少50%、55%、60%、65%、70%、75%、80%、82%、85%、90%、92%、95%、97%或更多的序列同一性的结构域。
项3A 根据项1A或2A的方法,其中所述受调节的表达通过在植物中引入和表达编码PATELLIN多肽的核酸实现。
项4A 根据任意前述项的方法,其中所述编码PATELLIN多肽的核酸编码表A中所列任一种蛋白质或是这种核酸的一部分或能与这种核酸杂交的核酸。
项5A 根据任意前述项的方法,其中所述核酸序列编码表A中所列任意蛋白质的直向同源物或旁系同源物。
项6A 根据任意前述项的方法,其中所述增强的产量相关性状包括相对于对照植物提高的产量,优选种子产量。
项7A 根据项1A至6A中任一项的方法,其中所述增强的产量相关性状在缺氮条件下获得。
项8A 根据项3A至7A中任一项的方法,其中所述核酸有效链接至组成型启动子,优选至GOS2启动子,最优选至来自稻的GOS2启动子。
项9A 根据任意前述项的方法,其中所述编码PATELLIN多肽的核酸具有植物起源,优选来自双子叶植物,进一步优选来自禾本科(Poaceae),更优选来自稻属(Oryza),最优选来自稻。
项10A 由根据任一前述项的方法可获得的植物或其部分,包括种子,其中所述的植物或其部分包含编码PATELLIN多肽的重组核酸。
项11A 分离的核酸分子,其包含以下特征:
(i)SEQ ID NO:9;SEQ ID NO:11;SEQ ID NO:13;SEQ ID NO:15;SEQ ID NO:17;SEQ ID NO:19;SEQ ID NO:21;SEQ ID NO:23;SEQ ID NO:25和SEQ ID NO:27所示的核酸;
(ii)与(i)中给出的任一SEQ ID NO互补的核酸片段;
(iii)编码PATELLIN多肽的核酸,其与SEQ ID NO:10;SEQ IDNO:12;SEQ ID NO:14;SEQ ID NO:16;SEQ ID NO:18;SEQ ID NO:20;SEQ ID NO:22;SEQ ID NO:24;SEQ ID NO:26和SEQ ID NO:28中给出的任一氨基酸序列以增加的优选顺序具有至少70%、75%、80%、85%、90%、95%、96%、97%、98%、99%或100%的序列同一性;
(iv)在严格条件下能与上面(i)、(ii)或(iii)给出的任一核酸杂交的核酸。
项12A 分离多肽,其包括:
(I)氨基酸序列,其与SEQ ID NO:10、SEQ ID NO:12、SEQ ID NO:14、SEQ ID NO:16、SEQ ID NO:18、SEQ ID NO:20、SEQ ID NO:22、SEQ ID NO:24;SEQ ID NO:26和SEQ ID NO:28中给出的任一氨基酸序列以增加的优选顺序具有至少80%、85%、90%、95%、96%、97%、98%、99%或100%的序列同一性;
(ii)(i)中给出的任意氨基酸序列的衍生物。
项13A构建体,其包括:
(I)编码如项1A、2A或12A定义的PATELLIN多肽的核酸或根据项11的核酸;
(ii)能驱动(a)的核酸序列表达的一个或多个调控序列;和任选地
(iii)转录终止序列。
项14A根据项13A的构建体,其中所述调控序列之一是组成型启动子,优选GOS2启动子,最优选来自稻的GOS2启动子。
项15A 根据项13A或14A的构建体在用于产生植物的方法中的用途,所述植物相对于对照植物具有提高的产量,特别是提高的生物量和/或提高的种子产量。
项16A 用根据项13A或14A的构建体转化的植物、植物部分或植物细胞。
项17A 用于产生转基因植物的方法,所述的转基因植物相对于对照植物具有提高的产量、优选提高的种子产量,该方法包括:
(i)在植物中引入并表达编码如项项1A,2A或12A中定义的PATELLIN多肽的核酸或根据项11A的核酸;和
(ii)在促进植物生长和发育的条件下培育植物细胞;和任选地
(iii)选择具有增强的产量相关性状的植物。
项18A 转基因植物,其因编码如项项1A或2A中所定义PATELLIN多肽的核酸的受调节的表达而相对于对照植物具有提高的产量、特别是提高的种子产量,或所述转基因植物衍生的转基因植物细胞。
项19A 根据项11A、16A或18A的转基因植物或从其中衍生的转基因植物细胞,其中所述的植物是作物植物或单子叶植物或禾谷植物,如稻、玉米、小麦、大麦、稷、黑麦、小黑麦、高粱和燕麦。
项20A 根据项19A的的植物的可收获部分,其中所述的可收获部分优选地是枝条生物量和/或种子。
项21A 产物,其衍生自根据项19A的植物和/或根据项20A的植物的可收获部分。
项22A 编码PATELLIN多肽的核酸在相对于对照植物提高产量、优选提高种子产量和/或枝条生物量中的用途。
项1B 用于相对于对照植物增强植物中产量相关性状的方法,其包括调节植物中编码PRP38多肽的核酸表达。
项2B 根据项1B的方法,其中所述PRP38多肽包含一个或多个如下基序:
(i)DUF1777结构域
(ii)RS结构域
(iii)基序1a(SEQ ID NO:120),其中任意氨基酸残基可由保守氨基酸取代和或达50%的氨基酸残基可由非保守氨基酸取代。
(iv)基序1a(SEQ ID NO:121),其中任意氨基酸残基可由保守氨基酸取代和或达50%的氨基酸残基可由非保守氨基酸取代。
(v)基序1a(SEQ ID NO:122),其中任意氨基酸残基可由保守氨基酸取代和或达50%的氨基酸残基可由非保守氨基酸取代。
项3B 根据项1B或2B的方法,其中所述受调节的表达通过在植物中引入并表达编码PRP38多肽的核酸实现。
项4B 根据任意前述项的方法,其中所述的编码PRP38多肽的核酸编码表A中所列的任一种蛋白质或是这种核酸的一部分或是能够与这种核酸杂交的核酸。
项5B 根据任意前述项的方法,其中所述的核酸序列编码表A中给出的任意蛋白质的直向同源物或旁系同源物。
项6B 根据任意前述项的方法,其中所述的增强的产量相关性状包括相对于对照植物提高的产量、优选提高的生物量和/或提高的种子产量。
项7B 根据项1B至6B中任一项的方法,其中所述增强的产量相关性状在非胁迫条件下获得。
项8B 根据项1B至6B中任一项的方法,其中所述的增强的产量相关性状在缺氮条件下获得。
项9B 根据项3B至8B中任一项的方法,其中所述的核酸有效链接至组成型启动子,优选至GOS2启动子,最优选至来自稻的GOS2启动子。
项10B 根据任意前述项的方法,其中所述的编码PRP38多肽的核酸序列是植物来源的,优选地来自双子叶植物,进一步优选地来自十字花科,更优选地来自拟南芥属,最优选地来自拟南芥。
项11B 由根据任一前述项的方法可获得的植物或其部分,包括种子,其中所述的植物或其部分包含编码PRP38多肽的重组核酸。
项12B 构建体,其包含:
(i)编码如项1或2定义的PRP38多肽的核酸;
(ii)能够驱动(a)的核酸序列表达的一个或多个调控序列;和任选地
(iii)转录终止序列。
项13B 根据项12B的构建体,其中所述调控序列是组成型启动子,优选是GOS2启动子,最优选地是来自稻的GOS2启动子。
项14B 根据项12B或13B的构建体在用于产生植物的方法中的用途,所述植物相对于对照植物具有提高的产量,特别是提高的生物量和/或提高的种子产量。
项15B 用根据项12B或13B的构建体转化的植物、植物部分或植物细胞。
项16B 用于产生转基因植物的方法,所述的转基因植物相对于对照植物具有提高的产量、特别是提高的生物量和/或提高的种子产量,该方法包括:
(i)在植物中引入并表达编码如项1B或2B中所定义的PRP38多肽的核酸;和
(ii)在促进植物生长和发育的条件下培育植物细胞。
项17B转基因植物,其因相对于对照植物具有因编码如项1B或2B中所定义的PRP38多肽的核酸的受调节表达而引起提高产量、特别是提高的生物量和/或提高的种子产量,或所述转基因植物衍生的转基因植物细胞。
项18B 根据项11B、15B或17B的转基因植物,或从其中衍生的转基因植物细胞,其中所述的植物是作物植物或单子叶植物或禾谷植物,如稻、玉米、小麦、大麦、稷、黑麦、小黑麦、高粱和燕麦。
项19B 根据项18B的植物的可收获部分,其中所述的可收获部分优选地是枝条生物量和/或种子。
项20B 产物,其衍生自根据项18B的植物和/或根据项19B的植物的可收获部分。
项21B 编码PRP38多肽的核酸在植物中相对于对照植物提高产量、特别是提高种子产量和/或枝条生物量中的用途。
项1C 用于相对于对照植物提高植物的千粒重、总种子重和饱满种子数中一项或多项的方法,其包括调节植物中编码GATA样多肽的核酸表达,其中所述的GATA样多肽属于GATA转录因子的亚家族II且包含GATA结构域。
项2C根据项1C的方法,其中所述的GATA样多肽包含一个或多个以下基序:
(i)基序1c:C(S/A/T)(D/E/N)CXT(T/S/A)(K/S)TP(L/M)WR(S/G/N)GP(SEQ ID NO:130),
(ii)基序2c:GPKSLCNACGIRX(R/K)K(SEQ ID NO:131),
(iii)基序3c:(A/S)(A/W)X(L/C)(L/N)(M/L/V)(T/L/A)(L/D)(S/R)(SEQ ID NO:132)
项3C 根据项1C或2C的方法其中所述受调节的表达通过在植物中引入并表达编码GATA样多肽的核酸实现。
项4C 根据任一前述项的方法,其中所述的编码GATA样多肽的核酸编码表A中所列的任一种蛋白质或是这种核酸的一部分或是能够与这种核酸杂交的核酸。
项5C 根据任一前述项的方法,其中所述的核酸序列编码表A中给出的任意蛋白质的直向同源物或旁系同源物。
项6C 根据项1C至5C任一项的方法,其中所述增强的产量相关性状在非胁迫条件下获得。
项7C 根据项3C至6C任一项的方法,其中所述的核酸有效链接至组成型启动子,优选至GOS2启动子,最优选地至来自稻的GOS2启动子。
项8C 根据任一前述项的方法,其中所述的编码GATA样多肽的核酸是植物来源的,优选地来自双子叶植物,进一步优选地来自禾本科,更优选地来自稻属,最优选地来自稻。
项9C 由根据任一前述项的方法可获得的植物或其部分,包括种子,其中所述植物或其部分包含编码GATA样多肽的重组核酸,所述核酸有效连接至植物来源的组成型启动子,优选至GOS2启动子,最优选地至来自稻的GOS2启动子。
项10C 构建体,其包含:
(i)编码如项1C或2C定义中的GATA样多肽的核酸;
(ii)能够驱动(a)的核酸序列表达的一个或多个植物来源的调控序列;和任选地
(iii)转录终止序列。
项11C根据项10C的构建体,其中所述调控序列之一是组成型启动子,优选GOS2启动子,最优选来自稻的GOS2启动子。
项12C 根据项10C或11C的构建体在用于产生植物的方法中的用途,所述植物相对于对照植物具有提高的产量,包括千粒重、总种子重和饱满种子数中一项或多项。
项13C 用根据项10C或11C的构建体转化的植物、植物部分或植物细胞。
项14C 用于产生转基因植物的方法,所述的转基因植物相对于对照植物具有提高的产量、特别是提高的生物量和/或提高的种子产量,该方法包括:
(i)在植物中引入并表达编码如项1C或2C中定义的GATA样多肽且有效连接至植物来源的组成型启动子的核酸;和
(ii)在促进植物生长和发育的条件下培育植物细胞。
项15C 转基因植物,其相对于对照植物具有因编码如项1C或2C中定义的GATA样多肽的核酸的受调节表达引起的提高的千粒重,或所述转基因植物衍生的转基因植物细胞。
项16C 根据项9C、13C或15C的转基因植物,或从其中衍生的转基因植物细胞,其中所述的植物是作物植物或单子叶植物或禾谷植物,如稻、玉米、小麦、大麦、稷、黑麦、小黑麦、高粱和燕麦。
项17C 根据项17C的植物的可收获部分,其中所述的可收获部分优选地是种子。
项18C 产物,其衍生自根据项16C的植物和/或根据项17C的植物的可收获部分。
项19C 编码GATA样多肽的核酸在植物中相对于对照植物提高千粒重、总种子重和饱满种子数中一项或多项的用途。
项1D 用于相对于对照植物增强植物中产量相关性状的方法,其包括调节植物中编码ADA2多肽的核酸的表达和可选地选择具有增强的产量相关性状的植物。
项2D 根据项1D的方法,其中所述ADA2多肽包含两个或多个如下基序:
(i)SEQ ID NO:207:kpglyccnycdkdlsglvrfkcavcmdfdlcvecfsvgvelnrhkn所示的ZZ型锌指结构域,或与SEQ ID NO:207所示的结构域或出现在表A中任意多肽中的任意ZZ型锌指结构域以增加的优选顺序具有至少50%、55%、60%、65%、70%、75%、80%、82%、85%、90%、92%、95%、97%或更多的序列同一性的结构域;
(ii)SEQ ID NO:208所示的SANT DNA结合结构域:vtsdwnadeeillleaiatygfgnwkevadhvgsktttecikhfnsaym,或SEQ ID NO:208所示的结构域或出现在表A中任意多肽中的任意SANT DNA结合结构域以增加的优选顺序具有至少50%、55%、60%、65%、70%、75%、80%、82%、85%、90%、92%、95%、97%或更多的序列同一性的结构域;
(iii)SEQ ID NO:209所示的Ca结合EF手结构域:dndaeqlladmef,或SEQ ID NO:209所示的结构域或出现在表A中任意多肽中的任意Ca结合EF手结构域以增加的优选顺序具有至少50%、55%、60%、65%、70%、75%、80%、82%、85%、90%、92%、95%、97%或更多的序列同一性的结构域;
(iv)SEQ ID NO:210所示的SWIRM结构域:priysgldtwdvdgllgadllsetekkmcnetrilpvhylkmldiltreikkgqikkksdaysffkvepskvdrvydmlvhkgigdst,或SEQ ID NO:210所示的结构域或出现在表A中任意多肽中的任意SWIRM结构域以增加的优选顺序具有至少50%、55%、60%、65%、70%、75%、80%、82%、85%、90%、92%、95%、97%或更多的序列同一性的结构域。
项3D 根据项1D或2D的方法,其中所述受调节的表达通过在植物中引入和表达编码ADA2多肽的核酸实现。
项4D 根据任意前述项的方法,其中所述编码ADA2多肽的核酸编码表A中所列任一种蛋白质,或是这种核酸的一部分或能与这种核酸杂交的核酸。
项5D 根据任意前述项的方法,其中所述核酸序列编码表A中所列任意蛋白质的直向同源物或旁系同源物。
项6D 根据任意前述项的方法,其中所述增强的产量相关性状包括相对于对照植物提高的产量,优选种子产量。
项7D 根据项1D至6D中任一项的方法,其中所述增强的产量相关性状在轻度胁迫条件下获得。
项8D 根据项1D至6D中任一项的方法,其中所述增强的产量相关性状在缺氮条件下获得。
项9D 根据项3D至8D中任一项的方法,其中所述核酸有效链接至组成型启动子,优选至HMGP启动子,最优选至来自稻的HMGP启动子。
项10D 根据任意前述项的方法,其中所述编码ADA2多肽的核酸具有植物起源,优选来自双子叶植物,进一步优选地来自十字花科,更优选地来自拟南芥属,最优选地来自拟南芥。
项11D 由根据任一前述项的方法可获得的植物或其部分,包括种子,其中所述的植物或其部分包含编码ADA2多肽的重组核酸。
项12D 构建体,其包含:
(i)编码如项1D或2D定义的ADA2多肽的核酸;
(ii)能够驱动(a)的核酸序列表达的一个或多个调控序列;和任选地
(iii)转录终止序列。
项13D 根据项12D的构建体,其中所述调控序列之一是组成型启动子,优选是HMGP启动子,最优选地是来自稻的HMGP启动子。
项14D 根据项12D或13D的构建体在用于产生植物的方法中的用途,所述植物相对于对照植物具有提高的产量,特别是提高的生物量和/或提高的种子产量。
项15D 用根据项12D或13D的构建体转化的植物、植物部分或植物细胞。
项16D 用于产生转基因植物的方法,所述的转基因植物相对于对照植物具有提高的产量、特别是提高的种子产量,该方法包括:
(i)在植物中引入并表达编码如项1D或2D中所定义的ADA2多肽的核酸;和
(ii)在促进植物生长和发育的条件下培育植物细胞;和任选地
(iii)选择具有增强的产量相关性状的植物。
项17D 转基因植物,其因编码如项项1D或2D中所定义ADA2多肽的核酸的受调节的表达而相对于对照植物具有提高的产量、特别是提高的种子产量,或所述转基因植物衍生的转基因植物细胞。
项18D 根据项11D、15D或1D的转基因植物或从其中衍生的转基因植物细胞,其中所述的植物是作物植物或单子叶植物或禾谷植物,如稻、玉米、小麦、大麦、稷、黑麦、小黑麦、高粱和燕麦。
项19D 根据项19的植物的可收获部分,其中所述的可收获部分优选地是枝条生物量和/或种子。
项20D 产物,其衍生自根据项18D的植物和/或根据项19D的植物的可收获部分。
项21D 编码ADA2多肽的核酸在相对于对照植物提高植物产量、优选提高种子产量和/或枝条生物量中的用途。
项1E 编码WD40重复(WDR)23样多肽的分离核酸序列,其包括:
SEQ ID NO:219、SEQ ID NO:225或SEQ ID NO:229所示的分离的核酸序列;
(ii)SEQ ID NO:219、SEQ ID NO:225或SEQ ID NO:229所示的分离的核酸序列的互补序列;
(iii)编码SEQ ID NO:220、SEQ ID NO:226或SEQ ID NO:230所示的多肽序列的分离的核酸序列;
(iv)由于遗传密码简并性的原因可从SEQ ID NO:220、SEQ IDNO:226或SEQ ID NO:230所示的多肽序列推导出来的分离的核酸序列;
(v)能够在严格杂交条件下与SEQ ID NO:219、SEQ ID NO:225或SEQ ID NO:229所示的核酸序列或其互补序列杂交的分离的核酸序列;
(vi)分离的核酸序列,其编码下述多肽,所述多肽与SEQ ID NO:220、SEQ ID NO:226或SEQ ID NO:230所示的多肽序列,按照递增的优选顺序具有至少50%、55%、60%、65%、70%、75%、80%、85%、90%、95%、96%、97%、98%、99%或更高氨基酸序列同一性;
(v)分离的核酸序列,其编码包含下述结构域的多肽,所述结构域与SEQ ID NO:271所示的保守结构域,按照递增的优选顺序具有至少65%、70%、75%、80%、85%、90%、95%、96%、97%、98%、99%或更高氨基酸序列同一性。
项2E 分离WDR23样多肽,其包括:
(i)SEQ ID NO:220、SEQ ID NO:226或SEQ ID NO:230所示的多肽序列;
(ii)与SEQ ID NO:220、SEQ ID NO:226或SEQ ID NO:230所示的多肽序列按照递增的优选顺序具有至少50%、55%、60%、65%、70%、75%、80%、85%、90%、95%、96%、97%、98%、99%氨基酸序列同一性的多肽序列;
(iii)包含下述结构域的多肽,所述结构域与SEQ ID NO:271所示的保守结构域,按照递增的优选顺序具有至少65%、70%、75%、80%、85%、90%、95%、96%、97%、98%、99%或更高氨基酸序列同一性;
(iii)上述(i)至(iii)中任意多肽序列的衍生物。
项3E 用于相对于对照植物增强产量相关形状的方法,所述方法包括增加编码WDR23样多肽的核酸序列在植物中表达,所述WDR23样多肽包含结构域,所述结构域与SEQ ID NO:271所示的保守结构域(CD)具有至少65%、70%、75%、80%、85%、90%、95%、98%、99%或更高氨基酸序列同一性,和任选地选择具有增加的产量相关性状的植物。
项4E 根据项3E的方法,其中所述WDR23样多肽包含:(i)至少四个具有PFAM登录号PF00400的WD40重复;和(ii)两个连续WD40重复末端的至少两个保守的DxR基序。
项5E 根据项3E或4E的方法,其中所述WDR23样多肽与SEQID NO:216所示的WDR23样多肽或本文表A给出的任意多肽序列或项2E中定义的WDR23样多肽以增加的优选顺序具有至少50%、55%、60%、65%、70%、75%、80%、85%、90%、95%、98%、99%或更高氨基酸序列同一性。
项6E 根据项3E至5E中任一项的方法,其中所述编码WDR23样多肽的核酸序列由表A中给出的任一核酸序列SEQ ID NO或其部分或能与表A中给出的任一核酸序列SEQ ID NO杂交的序列代表。
项7E 根据项3E至6E中任一项的方法,其中所述核酸序列编码表A中所列任意多肽序列SEQ ID NO的直向同源物或旁系同源物。
项8E 根据项3E至7E中任一项的方法,其中所述增加的表达通过T-DNA活化标签技术、TILLING或同源重组中任一种或多种实现。
项9E 根据项3E至8E中任一项的方法,其中所述增加的表达通过在植物中引入并表达编码WDR23样多肽的核酸序列实现。
项10E 根据项3E至9E中任一项的方法,其中所述增加的产量相关性状是提高的每株植物种子总产量、提高的种子饱满率、提高的饱满种子数、提高的收获指数、和提高的千粒重中一项或多项。
项11E 根据项3E至10E中任一项的方法,其中所述核酸序列有效连接至组成型启动子,优选至植物组成型启动子,更优选至GOS2启动子,最优选至SEQ ID NO:272所示的来自稻的GOS2启动子。
项12E 根据项3E至10E中任一项的方法,其中所述核酸序列有效连接至分生组织特异性启动子,优选至植物金属硫蛋白启动子,更优选至SEQ ID NO:273所示的来自稻的金属硫蛋白启动子。
项13E 根据项3E至12E中任一项的方法,其中所述编码WDR23样多肽的核酸序列来自植物界,优选来自双子叶植物,进一步优选地来自十字花科,最优选地来自拟南芥。
项14E 由根据项3E至13E中任一项的方法可获得的植物、其部分(包括种子)或植物细胞,其中所述的植物、其部分或细胞包含有效连接至植物组成型启动子的编码WDR23样多肽的分离的核酸转基因。
项15E 植物、其部分(包括种子)或植物细胞,其包含根据项1E的分离的核酸转基因或包含根据项2E的编码WDR23样多肽的分离核酸序列。
项16E 构建体,其包含:
(i)编码如项1E、2E或3E至7E中任一项定义的WDR23样多肽的核酸序列;
(ii)能够驱动(a)的核酸序列表达的一个或多个调控序列;和任选地
(iii)转录终止序列。
项17E根据项16E的构建体,其中所述调控序列是植物组成型启动子,优选GOS2启动子,更优选稻GOS2启动子,最优选SEQ ID NO:272所示的GOS2启动子。
项18E 根据项16E的构建体,其中所述调控序列是分生组织特异性启动子,优选金属硫蛋白(MT)启动子,更优选稻MT启动子,最优选至SEQ ID NO:273所示的MT启动子。
项19E 根据项16E或18E的构建体在用于产生植物的方法中的用途,所述植物相对于对照植物具有增强的产量相关形状,所述增强的产量相关形状是提高的每株植物种子总产量、提高的种子饱满率、提高的饱满种子数、提高的收获指数、和提高的千粒重中一项或多项。
项20E 用根据项16E至18E中任一项的构建体转化的植物、植物部分或植物细胞。
项21E 用于产生转基因植物的方法,所述的转基因植物相对于对照植物具有增强的产量相关形状,该方法包括:
(i)在植物、植物部分或植物细胞中引入并表达编码如项1E或3E至7E中任一项定义的WDR23样多肽的核酸序列;和
(ii)在促进植物生长和发育的条件下培育植物细胞、植物部分或植物。
项22E 转基因植物,其相对于对照植物具有因下述核酸序列的增加的表达引起的增强的产量相关性状,或所述转基因植物衍生的转基因植物细胞或转基因植物部分,所述核酸序列编码有效连接至植物表达性启动子,如项1E或3E至7E中任一项定义的的WDR23样多肽。
项23E 根据项14E、15E、20E或22E的转基因植物,其中所述的植物是作物植物或单子叶植物或禾谷植物,如稻、玉米、小麦、大麦、稷、黑麦、小黑麦、高粱和燕麦,或从所述转基因植物衍生的转基因植物细胞。
项24E 根据项23E的植物的可收获部分,其包含编码WDR23样多肽的分离核酸序列,其中所述的可收获部分优选是种子。
项25E 产物,其衍生自根据项23E的植物和/或根据项24E的植物的可收获部分。
项26E 编码如项1E或3E至7E中任一项定义的WDR23样多肽的核酸序列在增强产量相关性状中的用途,所述增强产量相关性状包括提高的每株植物种子总产量、提高的种子饱满率、提高的饱满种子数、提高的收获指数、和提高的千粒重中一项或多项。
附图简述
本发明现在将参考以下图进行描述,其中:
图1代表序列SEQ ID NO:2。标出结构域结构和功能相关氨基酸。SEC14和GOLD结构域以粗体和双下划线分别标出。框出涉及PtdIns/PtdChs结合/转运活性的氨基酸残基,而沿着脂类-结合口袋的疏水残基有下划线。盐桥结构域以小写字母显示。卷曲螺旋下标曲线。
图2代表选择的表A1的PALT多肽的多重比对。
图3显示选择的表A1的PALT多肽的系统树。
图4代表双元载体,其用于在稻中增加稻GOS2启动子控制下的PALT编码核酸的表达。
图5详述了用于实施本发明方法的序列的实例。
图6代表序列SEQ ID NO:77。示出保守结构域和基序:PRP38结构域是下划线的、DUF1777结构域是粗体字的、基序Ib,IIb,IIIb和IVb是框出的。
图7代表表A2的PRP38多肽的多重比对。
图8显示表A2的PRP38多肽的系统树。
图9代表双元载体,其用于在稻中增加稻GOS2启动子(pGOS2)控制下的PRP38编码核酸的表达。
图10详述了用于实施本发明方法的序列的实例。
图11代表SEQ ID NO:129的结构域结构,GATA结构域为粗体且保守基序1c至3c是下划线的。
图12代表表亚组II GATA样多肽的多重比对。点表示保守残基,冒号表示高度保守残基,且星号表示相同的氨基酸。
图13代表双元载体,其用于在稻中增加稻GOS2启动子(pGOS2::GATA样)控制下的GATA样-编码核酸的表达。
图14详述了用于实施本发明方法的序列的实例。
图15代表序列SEQ ID NO:182。示出保守结构域和基序。框出“KRKK”假定的核定位信号,粗体显示ZZ型锌指结构域,下划线和粗体字标出SANT DNA结合结构域,斜体和下划线标出Ca结合F手结构域,且双下划线标出SWIRM结构域。框出蛋白质中心部分中的相关和假定介导ADA2乙酰化的Lys(K)残基。
图16代表表A4的ADA2多肽的多重比对。
图17显示表A4的ADA2多肽的系统树。
图18和图19代表双元载体,其用于在稻中增加稻HMGP启动子(图18)和EXP9启动子(图19)控制下的ADA2编码核酸的表达。
图20详述了用于实施本发明方法的序列的实例。
图21代表SEQ ID NO:216所示的WRD23样的结构的卡通图。对应PFAM登录号PF00400的D40重复是图表标出的。
图22显示人中WRD23的功能。WRD23是多蛋白质泛素E3连接酶复合物的部分,其中Cullin4(CUL4)和损伤DNA结合蛋白质1(DDB1)是核心蛋白质(Higa等,(2007)Cell Division 2:5;Angers等,(2006)Nature 443:590-593;Higa等,(2006)Nature Cell Biol 8(11):1277-1283;He等,(2006)Genes&Development 20:2949-2954)。这种复合物,例如WRD23,为底物募集机制锚定WD40蛋白质作为分子衔接头,底物随后被泛素化和破坏。
图23显示序列比对,其代表在WD40的两个连续的浆(blade)中保守的DxR基序、在来自人(Homo sapiens)的WRD23(NCBI登录号AK057636)、黑曲霉(Aspergillus niger)(NCBI登录号CAK40817)和表A5的植物WRD23样多肽。
图24显示表A5的WDR23样多肽的AlignX(来自Vector NTI10.3,Invitrogen Corporation)序列多重比对。例SEQ ID NO:271所示的保守结构域(CD)的起点和终点以括号显示。共有序列中对应PF00400的WD40重复以X标出。DxR基序也在共有序列中鉴定。
图25显示双元载体,其用于在稻中增加植物表达性启动子,例如均来自稻的GOS2启动子或分生组织启动子控制下的WDR23样多肽编码核酸的表达。
图26详述了用于实施本发明方法的序列的实例。
实施例
本发明现在参考如下实施例进行描述,所述实施例仅是示意性的。以下实施例不意图完全限定或限制本发明的范围。
DNA操作:除非另外说明,重组DNA技术根据(Sambrook(2001)Molecular Cloning:a laboratory manual,第3版Cold Spring HarborLaboratory Press,CSH,New York)或Ausubel等,(1994),CurrentProtocols in Molecular Biology,Current Protocols第1卷和第2卷中描述的标准方案进行。用于植物分子研究工作的标准材料和方法在BIOS科学出版有限责任公司(BIOS Scientific Publications Ltd(英国))和Blackwell科学出版社(Blackwell Scientific Publications(英国))出版的R.D.D.Croy的Plant Molecular Biology Labfax(1993)中描述。
实施例1:鉴定与本发明方法中所用核酸序列相关的序列
使用数据库序列搜索工具,如基本局部比对工具(BLAST)(Altschul等(1990)J.Mol.Biol.215:403-410;和Altschul等(1997)Nucleic AcidsRes.25:3389-3402)在国家生物技术信息中心(NCBI)的Entrez核苷酸数据库中所维护的那些序列内鉴定到与本发明方法中所用核酸序列相关的(全长cDNA、EST或基因组)序列。该程序用来通过核酸序列或多肽序列与序列数据库比较并通过计算匹配的统计学显著性而找到序列间具有局部相似性的区域。例如,本发明所用核酸编码的多肽使用TBLASTN算法,采用默认设置和过滤以忽略低复杂性序列抵消。分析的结果通过配对性比较显示,并根据几率评分(E-值)排序,其中该评分反映特定比对结果因偶然而发生的概率(E-值越低,命中的显著性越高)。除了E-值外,比较还通过同一性百分数进行记分。同一性百分数指两个所比较核酸(或多肽)序列之间在特定长度范围内的相同核苷酸(或氨基酸)数目。
表A提供了与本发明方法中所用的核酸序列相关的核酸序列的列表。
在本说明书中使用的术语“表A”将用来说明表A1,和/或A2,和/或A3,和/或A4,和/或A5的内容。在本说明书中使用的术语“表A1”将用来说明表A1的内容。在本说明书中使用的术语“表A2”将用来说明表A2的内容。在本说明书中使用的术语“表A3”将用来说明表A3的内容。在本说明书中使用的术语“表A4”将用来说明表A4的内容。在本说明书中使用的术语“表A5”将用来说明表A5的内容。在一个优选的实施方案中,术语“表A”意指表A1。在一个优选的实施方案中,术语“表A”意指表A2。在一个优选的实施方案中,术语“表A”意指表A3。在一个优选的实施方案中,术语“表A”意指表A4。在一个优选的实施方案中,术语“表A”意指表A5。
在本说明书中使用的术语“表B”将用来说明表B1,和/或B2,和/或B3,和/或B4,和/或B5的内容。在本说明书中使用的术语“表B1”将用来说明表B1的内容。在本说明书中使用的术语“表B2”将用来说明表B2的内容。在本说明书中使用的术语“表B3”将用来说明表B3的内容。在本说明书中使用的术语“表B4”将用来说明表B4的内容。在本说明书中使用的术语“表B5”将用来说明表B5的内容。在一个优选的实施方案中,术语“表B”意指表B1。在一个优选的实施方案中,术语“表B”意指表B2。在一个优选的实施方案中,术语“表B”意指表B3。在一个优选的实施方案中,术语“表B”意指表B4。在一个优选的实施方案中,术语“表B”意指表B5。
在本说明书中使用的术语“表C”将用来说明表C1,和/或C2,和/或C3,和/或C4,和/或C5的内容。在本说明书中使用的术语“表C1”将用来说明表C1的内容。在本说明书中使用的术语“表C2”将用来说明表C2的内容。在本说明书中使用的术语“表C3”将用来说明表C3的内容。在本说明书中使用的术语“表C4”将用来说明表C4的内容。在本说明书中使用的术语“表C5”将用来说明表C5的内容。在一个优选的实施方案中,术语“表C”意指表C1。在一个优选的实施方案中,术语“表C”意指表C2。在一个优选的实施方案中,术语“表C”意指表C3。在一个优选的实施方案中,术语“表C”意指表C4。在一个优选的实施方案中,术语“表C”意指表C5。
在本说明书中使用的术语“表D”将用来说明表D1,和/或D2,和/或D3,和/或D4,和/或D5的内容。在本说明书中使用的术语“表D1”将用来说明表D1的内容。在本说明书中使用的术语“表D2”将用来说明表D2的内容。在本说明书中使用的术语“表D3”将用来说明表D3的内容。在本说明书中使用的术语“表D4”将用来说明表D4的内容。在本说明书中使用的术语“表D5”将用来说明表D5的内容。在一个优选的实施方案中,术语“表D”意指表D1。在一个优选的实施方案中,术语“表D”意指表D2。在一个优选的实施方案中,术语“表D”意指表D3。在一个优选的实施方案中,术语“表D”意指表D4。在一个优选的实施方案中,术语“表D”意指表D5。
表A1:PATL核酸和多肽的实例:
名称 | 来源物种 | 核酸SEQ ID NO: | 多肽SEQ ID NO: |
Orysa_PATL_1 | 稻 | 1 | 2 |
Orysa_PATL_2 | 稻 | 3 | 4 |
Orysa_PATL_3 | 稻 | 5 | 6 |
Orysa_PATL_4 | 稻 | 7 | 8 |
Brana_PATL_1 | 欧洲油菜 | 9 | 10 |
Helan_PATL_1 | 向日葵 | 11 | 12 |
Zeama_PATL_1 | 玉蜀黍 | 13 | 14 |
Glyma_PATL_1 | 大豆 | 15 | 16 |
Triaes_PATL_2 | 普通小麦 | 17 | 18 |
Triaes_PATL_3 | 普通小麦 | 19 | 20 |
Glyma_PATL_2 | 大豆 | 21 | 22 |
Glyma_PATL_3 | 大豆 | 23 | 24 |
Zeama_PATL_2 | 玉蜀黍 | 25 | 26 |
Zeama_PATL_4 | 玉蜀黍 | 27 | 28 |
Sacof_PATL_1 | 甘蔗 | 29 | 30 |
Sacof_PATL_2 | 甘蔗 | 31 | 32 |
Sacof_PATL_3 | 甘蔗 | 33 | 34 |
Sacof_PATL_4 | 甘蔗 | 35 | 36 |
Triae_PATL_1 | 普通小麦 | 37 | 38 |
Arath_PATL_1 | 拟南芥(Arabdidopsisthaliana) | 39 | 40 |
Arath_PATL_2 | 拟南芥 | 41 | 42 |
Arath_PATL_3 | 拟南芥 | 43 | 44 |
Arath_PATL_4 | 拟南芥 | 45 | 46 |
Arath_PATL_5 | 拟南芥 | 47 | 48 |
Arath_PATL_6 | 拟南芥 | 49 | 50 |
Poptr_PATL_1 | 毛果杨 | 51 | 52 |
Poptr_PATL_2 | 毛果杨 | 53 | 54 |
Poptr_PATL_3 | 毛果杨 | 55 | 56 |
Poptr_PATL_4 | 毛果杨 | 57 | 58 |
Lyces_PATL_1 | 番茄 | 59 | 60 |
Medtr_PATL_1 | 蒺藜苜蓿 | 61 | 62 |
Betvu_PATL_1 | 甜菜 | 63 | 64 |
Chlre_PATL_1 | 莱茵衣藻 | 65 | 66 |
Dicdi_PATL_1 | 盘基网柄菌 | 67 | 68 |
表A2:PRP38核酸和多肽的实例:
名称 | 来源物种 | 核酸SEQ ID NO: | 多肽SEQ ID NO: |
Arath_PRP38_1 | 拟南芥(Arabidopsisthaliana) | SEQ ID NO:76 | SEQ ID NO:77 |
名称 | 来源物种 | 核酸SEQ ID NO: | 多肽SEQ ID NO: |
Arath_PRP38_2 | 拟南芥 | SEQ ID NO:78 | SEQ ID NO:79 |
Arath_PRP38_3 | 拟南芥 | SEQ ID NO:80 | SEQ ID NO:81 |
Arath_PRP38_4 | 拟南芥 | SEQ ID NO:82 | SEQ ID NO:83 |
Arath_PRP38_5 | 拟南芥 | SEQ ID NO:84 | SEQ ID NO:85 |
brasy_PRP38_1 | 短柄草 | SEQ ID NO:86 | SEQ ID NO:87 |
brasy_PRP38_2 | 短柄草 | SEQ ID NO:88 | SEQ ID NO:89 |
Chlre_PRP38_1 | 莱茵衣藻 | SEQ ID NO:90 | SEQ ID NO:91 |
Horvu_PRP38_1 | 大麦 | SEQ ID NO:92 | SEQ ID NO:93 |
Lyces_PRP38_1 | 番茄 | SEQ ID NO:94 | SEQ ID NO:95 |
Medtr_PRP38_1 | 蒺藜苜蓿 | SEQ ID NO:96 | SEQ ID NO:97 |
Orysa_PRP38_1 | 稻 | SEQ ID NO:98 | SEQ ID NO:99 |
Orysa_PRP38_2 | 稻 | SEQ ID NO:100 | SEQ ID NO:101 |
Ostta_PRP38_1 | 海洋微藻 | SEQ ID NO:102 | SEQ ID NO:103 |
表A3:GATA样多肽的实例:
植物来源 | 核酸SEQ ID NO: | 蛋白质SEQ ID NO: |
稻 | 128 | 129 |
稻 | 137 | 138 |
稻 | 139 | 140 |
稻 | 141 | 142 |
稻 | 143 | 144 |
稻 | 145 | 146 |
拟南芥 | 147 | 148 |
拟南芥 | 149 | 150 |
拟南芥 | 151 | 152 |
拟南芥 | 153 | 154 |
植物来源 | 核酸SEQ ID NO: | 蛋白质SEQ ID NO: |
拟南芥 | 155 | 156 |
拟南芥 | 157 | 158 |
拟南芥 | 137 | 138 |
葡萄 | 139 | 140 |
葡萄 | 141 | 142 |
拟南芥 | 143 | 144 |
葡萄 | 145 | 146 |
葡萄 | 147 | 148 |
稻 | 149 | 150 |
稻 | 151 | 152 |
拟南芥 | 153 | 154 |
稻 | 155 | 156 |
表A4:ADA2核酸和多肽的实例:
名称 | 来源物种 | 核酸SEQ ID NO: | 多肽SEQ ID NO: |
Arath_ADA2_1 | 拟南芥 | 181 | 182 |
Arath_ADA2_2 | 拟南芥 | 183 | 184 |
Dicdi_ADA2_1 | 盘基网柄菌 | 185 | 186 |
Lyces_ADA2_1 | 番茄 | 187 | 188 |
Lyces_ADA2_2 | 番茄 | 189 | 190 |
名称 | 来源物种 | 核酸SEQ ID NO: | 多肽SEQ ID NO: |
Ostlu_ADA2_1 | Ostreococcus lucimarinus | 191 | 192 |
Orysa_ADA2_1 | 稻 | 193 | 194 |
Poptr_ADA2_1 | 毛果杨 | 195 | 196 |
Poptr_ADA2_2 | 毛果杨 | 197 | 198 |
Poptr_ADA2_3 | 毛果杨 | 199 | 200 |
Vitvi_ADA2_1 | 葡萄 | 201 | 202 |
Zeama_ADA_1 | 玉蜀黍 | 203 | 204 |
Zeama_ADA_2 | 玉蜀黍 | 205 | 206 |
表A5:WDR23样多肽序列和编码核酸序列的实例:
名称 | 来源生物 | 公开数据库登录号 | 核酸序列SEQ IDNO: | 多肽序列SEQ IDNO: | 状态 |
Arath_WDR23 | 拟南芥 | AT4G03020 | 215 | 216 | 全长 |
Aqufo_WDR23 | Aquilegiaformosa xAquilegiapubescens | DR934238,DT758266 | 217 | 218 | 全长 |
Brana_WDR23 | 欧洲油菜 | 私人拥有的 | 219 | 220 | 全长 |
Glyma_WDR23 | 大豆 | EH262769.1,BE807607.1,BM178842.1 | 221 | 222 | 全长 |
名称 | 来源生物 | 公开数据库登录号 | 核酸序列SEQ IDNO: | 多肽序列SEQ IDNO: | 状态 |
Goshi_WDR23 | 陆地棉 | DT571300,DW224686.1 | 223 | 224 | 全长 |
Helan_WDR23 | 向日葵 | 私人拥有的 | 225 | 226 | 全长 |
Horvu_WDR23II | 大麦 | TA42103_4513 | 227 | 228 | 全长 |
Linus_WDR23 | Linumusitatissum | 私人拥有的 | 229 | 230 | 全长 |
Liter_WDR23(或LEC14B) | 紫草 | D83074.1 | 231 | 232 | 全长 |
Lyces_WDR23 | 番茄 | BT013732.1 | 233 | 234 | 全长 |
Medtr_WDR23 | 蒺藜苜蓿 | TC107985 | 235 | 236 | 全长 |
Orysa_WDR23 | 稻 | NM_001062054 | 237 | 238 | 全长 |
(Os05g0407200) | |||||
Pinra_WDR23 | 辐射松 | AEB27202 | 239 | 240 | 全长 |
Poptr_WDR23 | 美洲山杨 | scaff_XIV.822[1577]f[31-1497] | 241 | 242 | 全长 |
Pruar_WDR23(LEC14B) | 野杏 | U82760.1 | 243 | 244 | 全长 |
名称 | 来源生物 | 公开数据库登录号 | 核酸序列SEQ IDNO: | 多肽序列SEQ IDNO: | 状态 |
Sacof_WDR23II | 甘蔗 | CA119761,CA209562.1,CA198970.1 | 245 | 246 | 全长 |
Triae_WRD23 | 普通小麦 | TA81375_4565 | 247 | 248 | 全长 |
Triae_WRD23II | 普通小麦 | EA148218 | 249 | 250 | 全长 |
Vitvi_WDR23 | 葡萄 | EV236978.1,CB002670,EV235943 | 251 | 252 | 全长 |
Zeama_WDR23 | 玉蜀黍 | DT943774.1,DV536181.1,EE042623.2 | 253 | 254 | 全长 |
Zeama_WDR23II | 玉蜀黍 | CO527332.1,CF004625.1,EB408231.1,DV163655.1 | 255 | 256 | 全长 |
Citsi_WDR23 | 甜橙 | DN620350,CN186594.1 | 257 | 258 | 部分 |
Glyma_WRD23II | 大豆 | BQ741328.1, | 259 | 260 | 部分 |
BI471220.1,CX708493.1 | |||||
Horvu_WDR23I | 大麦 | BQ753299 | 261 | 262 | 部分 |
Horvu_WDR23III | 大麦 | BQ471803.1,BM370019.2 | 263 | 264 | 部分 |
名称 | 来源生物 | 公开数据库登录号 | 核酸序列SEQ IDNO: | 多肽序列SEQ IDNO: | 状态 |
Pinta_WDR23 | 火炬松 | CV034652,CX652385 | 265 | 266 | 部分 |
Sacof_WDR23 | 甘蔗 | CA146950,CF570656 | 267 | 268 | 部分 |
Sorbi_WDR23 | 两色蜀黍 | CB928406.1,BE599991,CF770659.1 | 269 | 270 | 部分 |
对于WDR23样蛋白质,在一些情况下,相关序列已经由研究机构如基因组研究机构(The Institute for Genomic Research,TIGR)初步地汇编并且公开披露。可以使用真核生物基因直向同源物(EukaryoticGene Orthologs,EGO)数据库来鉴定此类相关序列,这可通过关键词搜索或通过使用BLAST算法以目的核酸或多肽序列进行。对于其他情况,为了特定生物创建了专用核酸序列数据库,例如联合基因组研究所(Joint Genome Institute)所作的。对于其他情况,序列获自私人数据库,其自行努力测序制备,例如欧洲油菜、向日葵(Helianthus annus)和Linum usitatissum的数据库。
实施例2:PATL PRP38、GATA样、ADA2和WDR23样多肽序列的比对
使用来自Vector NTI(Invitrogen)的Alignment X程序进行多肽序列的比对,其中所述Alignment X程序基于流行的累进比对Clustal W算法(Thompson等(1997)Nucleic Acids Res 25:4876-4882;Chenna等(2003),Nucleic Acids Res 31:3497-3500)。默认值是:空位开口罚分的默认值是10,空位延伸罚分是0.1并且所选的权重矩阵是Blosum 62(如果比对多肽的话)。PATL多肽中序列的保守性在C-端高于N-端。给出共有序列。示出保守的氨基酸。PATL多肽比对于图2。比较图1和图2,揭示保守结构域的存在,以及图1中所示的关键氨基酸残基出现在图2的PATL多肽中。PATL多肽的系统树(图3)使用如Vector NTI(Invitrogen)的AlignX程序中所提供的邻接法聚类算法构建。
PRP38多肽中序列的保守性主要在该多肽N-端PRP38结构域中,N-端结构域通常在序列长度和组成上更加多变。PRP38多肽比对于图7。PRP38多肽的系统树(图8)使用如Vector NTI(Invitrogen)的AlignX程序中所提供的邻接法聚类算法构建。
GATA样多肽中序列的保守性主要在该多肽的GATA结构域,C端结构域、N端结构域通常在序列长度和组成上更加多变。GATA样多肽比对于图12。
ADA2多肽中序列的保守性主要在该多肽的保守型ZZ型Zn指、SANT DNA结合结构域和SWIRM结构域。给出共有序列。ZZ型Zn指结构域富含半胱氨酸残基。ADA2多肽比对于图16。ADA2多肽的系统树(图17)使用如Vector NTI(Invitrogen)的AlignX程序中所提供的邻接法聚类算法构建。
表A5中全长WDR23样多肽序列的序列多重比对使用AlignX算法(来自Vector NTI 10.3,Invitrogen Corporation)进行。比对的结果示于本申请的图23和24。
图23显示序列比对,其代表在WD40的两个连续的浆中保守的DxR基序、在来自人的WRD23(NCBI登录号AK057636)、黑曲霉(NCBI登录号CAK40817)和表A5的植物WRD23样多肽。
在图24中,例SEQ ID NO:271所示的保守结构域(CD)的起点和终点使用括号显示。共有序列中对应PF00400的WD40重复以X标出。DxR基序也在共有序列中鉴定。
实施例3:计算在实施本发明方法中有用的多肽序列之间的全局同一性百分数
使用现有技术领域可获得的方法之一,即MatGAT(矩阵总体比对工具)软件(BMC Bioinformatics.2003 4:29.MatGAT:使用蛋白质序列或DNA序列产生相似性/同一性矩阵的一项应用(an application that基因rates similarity/identity matrices using protein或DNA序列s),Campanella JJ,Bitincka L,Smalley J;该软件由Ledion Bitincka维护)确定在实施本发明方法中有用的全长多肽序列之间的全局相似性和同一性百分数。MatGAT软件对DNA序列或蛋白质序列产生相似性/同一性矩阵,无需预先比对数据。该程序使用Myers和Miller全局比对算法(空位开口罚分12和空位延伸罚分2)执行一系列配对比对,使用例如Blosum 62(对于多肽而言)计算相似性和同一性并且随后将结果置于距离矩阵中。在分割线下半部分显示序列相似性,并且在对角分割线的上半部分显示序列同一性。
比较中使用的参数是:
评分矩阵:Blosum62
第一空位:12
延伸空位:2
表B1中显示在多肽序列的全长范围内全局相似性和同一性的软件分析结果。在对角线上方给出同一性百分数(粗体)而在对角线下方给出相似性百分数(正常字体)。
与Orysa_PATL_1多肽(SEQ ID NO:2)相比,在实施本发明方法中有用的PATL多肽序列之间的同一性百分比可低至23.7%氨基酸同一性。
表B1:在该多肽序列的全长范围内全局相似性和同一性的MatGAT结果。
对于PRP38多肽,与Arath_PRP38_1多肽(表A2)相比,选自表A2的多肽序列的全长范围内全局相似性示于表B2。
表B2:PRP38多肽之间的序列相似性
PRP38多肽 | 与SEQ ID NO:77的%序列相似性 |
Arath_PRP38_5 | 23.6 |
brasy_PRP38_1 | 63.5 |
brasy_PRP38_2 | 32.1 |
Chlre_PRP38_1 | 40.6 |
Horvu_PRP38_1 | 56.1 |
Lyces_PRP38_1 | 60.7 |
Medtr_PRP38_1 | 25.4 |
Orysa_PRP38_1 | 60.9 |
Orysa_PRP38_2 | 25.9 |
与SEQ ID NO:77相比,在实施本发明方法中有用的表B2的PRP38多肽序列之间的同一性百分比可低至23.6%氨基酸同一性。
对于GATA样蛋白质,比较中使用的参数是:
评分矩阵:Blosum62
第一空位:11
延伸空位:1
表B3中显示在多肽序列的全长范围内全局相似性和同一性的软件分析结果。在对角线上方给出同一性百分数(粗体)而在对角线下方给出相似性百分数(正常字体)。
与SEQ ID NO:129相比,在实施本发明方法中有用的GATA样蛋白质序列之间的同一性百分比可低至14%氨基酸同一性。
对于ADA2多肽,比较中使用的参数是:
评分矩阵:Blosum62
第一空位:12
延伸空位:2
表B4中显示在多肽序列的全长范围内全局相似性和同一性的软件分析结果。在对角线上方给出同一性百分数(粗体)而在对角线下方给出相似性百分数(正常字体)。
与Arath_ADA2_1多肽(SEQ ID NO:182)相比,在实施本发明方法中有用的ADA2多肽序列之间的同一性百分比可低至21.2%氨基酸同一性。
表B4:该多肽序列的全长范围内全局相似性和同一性的MatGAT结果。
表B4:ADA2多肽之间的序列相似性
Name | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 13 |
1.Lyces_ADA2_2 | 55.3 | 21.2 | 52.7 | 49.2 | 57.3 | 57.1 | 61.6 | 56.7 | 57.3 | 58.6 | 57.9 | 33.5 | |
2.Lyces_ADA2_1 | 73.2 | 19.7 | 57.0 | 46.1 | 54.5 | 53.8 | 54.0 | 65.0 | 65.4 | 52.4 | 54.6 | 33.1 | |
3.Dicdi_ADA2_1 | 33.2 | 32.5 | 19.9 | 21.2 | 22.6 | 22.4 | 22.7 | 19.6 | 19.9 | 23.3 | 22.5 | 20.4 | |
4.Arath_ADA2_2 | 69.1 | 71.6 | 31.3 | 45.2 | 50.8 | 51.1 | 50.4 | 59.4 | 61.0 | 51.2 | 51.3 | 35.2 | |
5.Arath_ADA2_1 | 68.9 | 65.3 | 32.7 | 63.1 | 49.2 | 48.9 | 52.0 | 47.8 | 48.2 | 54.9 | 49.3 | 33.7 | |
6.Zeama_ADA_2 | 72.0 | 69.6 | 35.4 | 66.2 | 65.8 | 94.7 | 58.4 | 58.0 | 58.1 | 54.0 | 87.7 | 34.3 | |
7.Zeama_ADA_1 | 72.7 | 69.4 | 34.6 | 66.5 | 66.7 | 97.0 | 58.4 | 58.3 | 57.9 | 54.0 | 88.9 | 35.0 | |
8.Vitvi_ADA2_1 | 78.4 | 68.4 | 34.9 | 66.3 | 67.2 | 74.7 | 74.3 | 59.1 | 59.6 | 62.3 | 59.2 | 33.1 | |
9.Poptr_ADA2_1 | 73.9 | 77.5 | 32.5 | 72.0 | 68.6 | 72.9 | 73.3 | 74.3 | 87.9 | 55.1 | 58.0 | 33.3 | |
10.Poptr_ADA2_3 | 74.5 | 78.0 | 32.6 | 73.5 | 66.5 | 73.6 | 73.8 | 75.0 | 93.8 | 54.5 | 56.9 | 32.8 | |
11.Poptr_ADA2_2 | 76.3 | 69.8 | 34.2 | 69.3 | 71.0 | 70.3 | 70.1 | 75.0 | 71.3 | 71.9 | 55.2 | 33.2 | |
12.Orysa_ADA2_1 | 72.5 | 68.6 | 34.5 | 66.8 | 66.5 | 93.7 | 94.4 | 75.4 | 73.4 | 73.4 | 70.4 | 35.8 | |
13.Ostlu_ADA2_1 | 53.0 | 54.7 | 34.4 | 52.8 | 53.5 | 54.2 | 54.2 | 52.2 | 52.3 | 51.5 | 55.6 | 53.8 |
对于WDR23样蛋白质,比较中使用的参数是:
评分矩阵:Blosum62
第一空位:12
延伸空位:2
表B5中显示在多肽序列(除了部分多肽序列)的全长范围内全局相似性和同一性的软件分析结果。
在SEQ ID NO:271所示的保守结构域(CD)(且包含于SEQ IDNO:216)和表A5的全长多肽序列的保守结构域(如图24中示出的)之间进行相同的分析,结果示于表B5.1。
与SEQ ID NO:216相比,在实施本发明方法中有用的全长肽序列之间的同一性百分比可低至54%氨基酸同一性。
在SEQ ID NO:271所示的保守结构域(CD)(且包含于SEQ IDNO:216)和表A5的多肽序列的保守结构域(如图24中示出的)之间的同一性百分比增加至69%氨基酸同一性,如表B5.1所示。
实施例4:鉴定在实施本发明方法中有用的多肽序列中所包含的结构域
通过搜索InterPro数据库鉴定保守结构域蛋白质。蛋白质家族、结构域和位点的集成资源(InterPro)数据库是针对基于文本及基于序列的搜索法的常用特征标识数据库的集成界面。InterPro数据库合并了这些数据库,所述数据库使用不同的方法学及不同程度的有关充分表征的蛋白质的生物学信息以获得蛋白质特征标识(protein signatures)。合作数据库包括SWISS-PROT、PROSITE、TrEMBL、Panther、PRINTS、ProDom和Pfam、Smart和TIGRFAM。Pfam是覆盖众多常见蛋白质结构域和家族的序列多重比对结果和隐匿马尔科夫模型(HMM)的庞大集合。Pfam在英国Sanger研究所服务器上维护。Interpro由英国欧洲生物信息学研究所维护。
在表C5中显示SEQ ID NO:2所示的多肽序列的InterPro扫描结果。
表C5:SEQ ID NO:2所示的多肽序列的InterPro扫描结果(主要登录号)。数据库登录号中数据库的缩写:PF:Pfam;PS:Prosite;SM:Smart;SSF:Superfamily。
在表C2中显示SEQ ID NO:77所示的多肽序列的Pfam扫描结果。表C2:表A2的多肽序列的Pfam扫描结果(主要登录号)。给出所扫描的多肽(查询多肽)中氨基酸坐标分隔的结构域(结构域名称)。给出命中Pfam输入(entry)的查询多肽比对的e-值。
在表C3中显示SEQ ID NO:129所示的多肽序列的InterPro扫描结果。
表C3:SEQ ID NO:129所示的多肽序列的InterPro扫描结果(主要登录号)。
数据库 | 登录号 | 登录号名称 | SEQ ID NO 129的氨基酸坐标 |
InterPro | IPR000679 | Zn-指,GATA型 | |
HMMPfam | PF00320 | GATA | T[178-213]1.9E-14 |
HMMSmart | SM00401 | ZnF_GATA | T[172-223]1.1E-16 |
ProfileScan | PS50114 | GATA_ZN_FINGER_2 | T[176-208]12.268 |
在表C4中显示SEQ ID NO:182所示的多肽序列的Pfam扫描结果。
表C4:SEQ ID NO:182所示的多肽序列的Pfam扫描结果(主要登录号)。
表C4:表A4的多肽序列的InterPro扫描结果(主要登录号)。给出所扫描的多肽(查询多肽)中氨基酸坐标分隔的结构域(结构域名称)。给出命中Pfam输入的查询多肽比对的e-值。
在表C5中显示SEQ ID NO:216所示的多肽序列的InterPro扫描结果。
表C6:SEQ ID NO:216所示的多肽序列的InterPro扫描结果。
InterPro登录号和名称 | 集成的数据库名称 | 集成的数据库登录号 | 集成的数据库登录名称 |
IPR0001680WD40重复 | PFAM | PF00400 | WD40 |
InterPro登录号和名称 | 集成的数据库名称 | 集成的数据库登录号 | 集成的数据库登录名称 |
IPR0001680WD40重复 | Smart | SM00320 | WD40 |
IPR0001680WD40重复 | ProfileScan | PS50082 | WD_repeats_2 |
IPR0001680WD40重复 | ProfileScan | PS50294 | WD_REPEATS_REGION |
IPR0001680WD40重复 | FPrintScan | PR00320 | GPROTEINBRPT |
IPR011046WD40重复-样 | SuperFamily | SSF50978 | WD40_like |
IPR15943WD40/YVTN重复-样 | Gene3D | G3DSA:2.130.10.10 | WD40/YVTNrepeat-like |
实施例5:在本发明方法中所用的核酸序列的克隆
使用定制的拟南芥幼苗cDNA文库(在pCMV Sport 6.0中;Invitrogen,Paisley,UK)作为模板,通过PCR扩增本在本发明方法中使用的核酸序列。使用Hifi Taq DNA聚合酶,在标准条件下使用50μl PCR混合物中的200ng模板进行PCR。
对于PATL,使用的引物是SEQ ID NO:73:5’-ggggacaagtttgtacaaaaaagcaggcttaaacaatggcggaggagccac-3’和SEQ IDNO:74;5’-ggggaccactttgtacaagaaagctgggtgtggtgaatctggtgatcagg-3’,其包括用于Gateway重组的AttB位点。也使用标准方法纯化扩增的PCR片段。随后进行Gateway方法的第一步骤,即BP反应,在此期间所述PCR片段与pDONR201质粒体内重组以产生根据Gateway术语学的“进入克隆”pOrysa_PATL_1。质粒pDONR201作为Gateway技术的部分从Invitrogen购买。
包含SEQ ID NO:1的进入克隆随后在LR反应中与用于稻转化的两个目的载体一起使用。第一载体含有在T-DNA边界内部的功能性元件:植物选择性标记、可筛选标记表达盒,和意图与已经克隆在该进入克隆中的目的核酸序列发生LR体内重组的Gateway盒。用于组成型表达的稻GOS2启动子(SEQ ID NO:75)位于该Gateway盒上游。
在LR重组步骤后,将所得表达载体pGOS2::Orysa_PATL_1(图4)根据本领域熟知的方法转化至农杆菌菌株LBA4044中。
对于RP38,使用的引物是SEQ ID NO:125:5’ggggacaagtttgtacaaaaaagcaggcttaaacaatggcggagatacagtcaaa 3’和SEQ IDNO:126;5’ggggaccactttgtacaagaaagctgggttcacctccaagaggaacca 3’,其包括用于Gateway重组的AttB位点。也使用标准方法纯化扩增的PCR片段。随后进行Gateway方法的第一步骤,即BP反应,在此期间所述PCR片段与pDONR201质粒体内重组以产生根据Gateway术语学的“进入克隆”pPRP38。质粒pDONR201作为Gateway技术的部分从Invitrogen购买。
包含SEQ ID NO:76的进入克隆随后在LR反应中与用于稻转化的目的载体一起使用。该载体含有在T-DNA边界内部的植物选择性标记、可筛选标记表达盒,和意图与已经克隆在该进入克隆中的目的核酸序列发生LR体内重组的Gateway盒作为功能性元件。用于组成型表达的稻GOS2启动子(SEQ ID NO:127)位于该Gateway盒上游。
在LR重组步骤后,将所得表达载体pGOS2::PRP38(图9)根据本领域熟知的方法转化至农杆菌菌株LBA4044中。
对于GATA,使用的引物是prm10133(SEQ ID NO:133;有义,起始密码子是粗体):
5’GGGGACCACTTTGTACAAGAAAGCTGGGTAGCTAGCTAGTTTTGATCAGC 3’,其包括用于Gateway重组的AttB位点。也使用标准方法纯化扩增的PCR片段。随后进行Gateway方法的第一步骤,即BP反应,在此期间所述PCR片段与pDONR201质粒体内重组以产生根据Gateway术语学的“进入克隆”pGATA-like。质粒pDONR201作为Gateway技术的部分从Invitrogen购买。
包含SEQ ID NO:128的进入克隆随后在LR反应中与用于稻转化的目的载体一起使用。该载体含有在T-DNA边界内部的植物选择性标记、可筛选标记表达盒,和意图与已经克隆在该进入克隆中的目的核酸序列发生LR体内重组的Gateway盒作为功能性元件。用于组成型表达的稻GOS2启动子(SEQ ID NO:135)位于该Gateway盒上游。
在LR重组步骤后,将所得表达载体pGOS2::GATA-like(图13)根据本领域熟知的方法转化至农杆菌菌株LBA4044中。
使用定制的稻幼苗cDNA文库(在pCMV Sport 6.0中;Invitrogen,Paisley,UK)作为模板,通过PCR扩增本在本发明方法中使用的核酸序列SEQ ID NO:177。使用Hifi Taq DNA聚合酶,在标准条件下使用50μl PCR混合物中的200ng模板进行PCR。使用的引物是prm10106(SEQ ID NO:179;有义,启示密码子为粗体):5’-ggggacaagtttgtacaaaaaagcaggcttaaacaatgcttcaccattactacagc-3’和prm10107(SEQ ID NO:180;反义,互补):5’-ggggaccactttgtacaagaaagctgggtcca acgctaatgctacact-3’,其包括用于Gateway重组的AttB位点。也使用标准方法纯化扩增的PCR片段。其他克隆步骤如上所述。
对于ADA2,使用的引物是SEQ ID NO:211:5’-GGGGACAAGTTTGTACAAAAAAGCAGGCTTAAACAATGGGTCGTTCGAAACTAGC-3’和SEQ ID NO:212;5’-GGGGACCACTTTGTACAAGAAAGCTGGGTCATGTTAGGACCATGAAGCT ATG-3’,其包括用于Gateway重组的AttB位点。也使用标准方法纯化扩增的PCR片段。随后进行Gateway方法的第一步骤,即BP反应,在此期间所述PCR片段与pDONR201质粒体内重组以产生根据Gateway术语学的“进入克隆”pAtADA2_1。质粒pDONR201作为Gateway技术的部分从Invitrogen购买。
包含SEQ ID NO:181的进入克隆随后在LR反应中与用于稻转化的两个目的载体一起使用。第一载体含有在T-DNA边界内部的功能性元件:植物选择性标记、可筛选标记表达盒,和意图与已经克隆在该进入克隆中的目的核酸序列发生LR体内重组的Gateway盒。用于组成型表达的稻HMGP启动子(SEQ ID NO:213)位于该Gateway盒上游。第二载体在T-DNA边界内部含有上述相同的功能性元件,稻EXP9启动子(SEQ IDNO:214)位于该Gateway盒上游。
在LR重组步骤后,将所得表达载体pHMG::Arath_ADA2_1(图18)根据本领域熟知的方法转化至农杆菌菌株LBA4044中。
除非另外说明,重组DNA技术根据(Sambrook(2001)MolecularCloning:a laboratory manual,第3版,Cold Spring Harbor LaboratoryPress,CSH,New York)或在Ausubel等(1994),Current Protocols inMolecular Biology,Current Protocols第1卷和第2卷中描述的标准方案进行。用于植物分子研究工作的标准材料和方法在BIOS ScientificPublications Ltd(英国)和Blackwell Scientific Publications(英国)出版的R.D.D.Croy的Plant Molecular Biology Labfax(1993)中描述。
对于WDR23样多肽,编码SEQ ID NO:216所示的WDR23样多肽序列的拟南芥cDNA使用下述cDNA作为模板进行PCR扩增,所述cDNA合成自生长于不同条件下的拟南芥的不同组织提取得到的mRNA。包括用于Gateway重组的AttB位点的下列引物用于PCR扩增:
(v)Prm 09100(SEQ ID NO:274,正义):
5’-GGGGACAAGTTTGTACAAAAAAGCAGGCTTAAACAATGTTTTTTGGACCAAGTGAG-3’
(vi)Prm 09101(SEQ ID NO:275,反义,互补):
5’-GGGGACCACTTTGTACAAGAAAGCTGGGTTGTGTAGAGAGACGCATCAGT-3’
使用Hifi Taq DNA聚合酶在标准条件下进行PCR。使用标准方法扩增和纯化具有预期长度的PCR片段(包含attB位点)。随后进行Gateway方法的第一步骤,即BP反应,在此期间,所述PCR片段与pDONR201质粒在体内重组以产生根据Gateway术语学的“进入克隆”。质粒pDONR201作为Gateway技术的部分从Invitrogen购买。
实施例5A:在实施本发明方法中有用的多肽序列的拓扑结构预测
TargetP 1.1预测真核蛋白的亚细胞定位。基于任何氨基端前序列:叶绿体转运肽(cTP)、线粒体靶向肽(mTP)或分泌途径信号肽(SP)的预测存在性进行定位指派。作为最终预测基础的评分并不真正是概率,并且它们不是必需地加合成一体。然而,根据TargetP,具有最高评分的定位是最可能的,并且评分之间的关系(可靠性级别)可以指示该预测具有多大确定性。可靠性级别(RC)范围从1至5,其中1表示最可靠的预测。TargetP在丹麦技术大学(Technical University of Denmark)的服务器上维护。
对于预测含有氨基端前序列的序列而言,也可以预测潜在的切割位点。
可以选择许多参数,如生物组别(非植物或植物)、临界值集合(无、预定义的临界值集合或用户指定的临界值集合)和切割位点预测的计算(是或否)。
在表D3中呈现SEQ ID NO:129所示的多肽序列的TargetP 1.1分析的结果。选择“植物”生物组别,未定义临界值,并且对转运肽的预测长度提出要求。SEQ ID NO:129所示的多肽序列的亚细胞定位可以是细胞质或细胞核,没有预测到转运肽。
表D3:SEQ ID NO:129所示的多肽序列的TargetP 1.1分析结果
长度(AA) | 353 |
叶绿体转运肽 | 0.067 |
线粒体转运肽 | 0.169 |
分泌途径信号肽 | 0.186 |
其他亚细胞靶向 | 0.804 |
预测的位置 | / |
可靠性级别 | 2 |
预测的转运肽长度 | / |
许多其他算法可以用来进行此类分析,包括:
·在丹麦技术大学服务器上维护的ChloroP 1.1;
·在澳大利亚布里斯班昆士兰大学生物科学研究所的服务器上维护的Protein Prowler亚细胞定位预测者1.2版;
·在加拿大阿伯特省埃德蒙顿市阿尔伯塔大学的服务器上维护的PENCE蛋白组分析专家PA-GOSUB 2.5;
·在丹麦技术大学服务器上维护的TMHMM。
实施例6:植物转化
稻转化
使用含有表达载体的农杆菌来转化稻植物。将粳稻栽培品种日本晴(Nipponbare)的成熟干燥种子脱壳。通过如下方式实施消毒:在70%乙醇中孵育1分钟,随后在0.2%HgCl2中孵育30分钟,随后用无菌蒸馏水洗涤6次15分钟。无菌的种子随后在含有2,4-D的培养基(愈伤组织诱导培养基)上萌发。在黑暗中孵育4周后,将胚发生的盾片衍生性愈伤组织切下并在相同的培养基上增殖。2周后,将所述愈伤组织通过在同一种培养基上传代培养另外2周进行繁殖或增殖。胚发生的愈伤组织片在新鲜培养基上传代培养3日,随后共培育(以助长细胞分裂活性)。
将含有表达载体的农杆菌菌株LBA4404用于共培育。农杆菌接种在含有适宜抗生素的AB培养基上并在28℃培养3日。随后收集细菌并在液体共培育培养基中悬浮至密度(OD600)约1。该混悬液随后转移至培养皿内并将所述愈伤组织浸入此混悬液中15分钟。该愈伤组织随后在滤纸上蘸干并转移至固化的共培育培养基,并在25℃于黑暗中孵育3日。共培育的愈伤组织在含2,4-D的培养基上在28℃于黑暗中在存在选择剂时培育4周。在此期间,迅速生长的抗性愈伤组织团发育。将这种材料转移至再生培养基并在光照下孵育后,胚发生潜能释放并且苗在随后4至5周内发育。将苗从愈伤组织上切下并且在含有植物生长素的培养基上孵育2至3周,将苗从所述培养基转移至土壤。硬化的苗在温室中于高湿度和短日照下培育。
对于一个构建体,产生大约35个独立的T0稻转化体。将原代转化体从组织培养箱转移至温室。在定量PCR分析验证T-DNA插入物的拷贝数后,仅保留显示所述选择剂抗性的单拷贝转基因植物用于收获T1种子。种子随后在移栽后3至5个月收获。该方法以超过50%的比例产生单基因座转化体(Aldemita和Hodges1996,Chan等1993,Hiei等1994)。
玉米转化
玉米(玉蜀黍,Zea mays)的转化用Ishida等(1996)Nature Biotech 14(6):745-50描述的改良方法进行。在谷物中,转化是基因型依赖的并且仅特定基因型适合于转化和再生。近交系A188(明尼苏达大学)或以A188作为亲本的杂交体是用于转化的供体材料的良好来源,不过也可以成功地使用其他基因型。谷穗从授粉后大约11日(DAP)的谷物植物收获,此时不成熟的胚的长度是大约1至1.2mm。将不成熟的胚与含有所述表达载体的根瘤农杆菌共培育,并且通过器官发生过程回收转基因植物。切下的胚在愈伤组织诱导培养基上、随后在玉米再生培养基上培育,其中所述的培养基含有选择剂(例如咪唑啉酮,不过可以使用不同的选择性标记)。培养板在25℃于光照下孵育2-3周,或直至苗发育。将来自每个胚的绿色苗转移至玉米生根培养基并在25℃孵育2-3周,直至根发育。将生根的苗移植至温室中的土壤内。从针对所述选择剂显示耐受性并且含有单拷贝T-DNA插入物的植物产生T1种子。
小麦转化
小麦的转化用Ishida等(1996)Nature Biotech 14(6):745-50描述的方法进行。栽培品种Bobwhite(可从墨西哥CIMMYT获得)通常用于转化。将不成熟的胚与含有所述表达载体的根瘤农杆菌共培育,并且通过器官发生过程回收转基因植物。与农杆菌孵育后,所述胚在愈伤组织诱导培养基上、随后于再生培养基上体外培育,其中所述的培养基含有选择剂(例如咪唑啉酮,不过可以使用不同的选择性标记)。培养板在25℃于光照下孵育2-3周,或直至苗发育。将来自每个胚的绿色苗转移至生根培养基并在25℃孵育2-3周,直至根发育。将生根的苗移植至温室中的土壤内。从显示选择剂耐受性并且含有单拷贝T-DNA插入物的植物产生T1种子。
大豆转化
根据Texas A&M美国专利5,164,310中描述的改良方法转化大豆。几个商业大豆品种适合于通过这种方法转化。栽培品种Jack(从Illinois种子基金会可获得)通常用于转化。将大豆种子消毒以体外播种。从7日龄年幼籽苗切除下胚轴、胚根和一片子叶。进一步培育上胚轴和剩余的子叶以发育腋生结节。将这些腋生结节切下并与含有表达载体的根瘤农杆菌孵育。在共培育处理之后,将外植体洗涤并转移至选择培养基。将再生的苗切下并置于苗伸长培养基上。将长度不超过1cm的苗置于生根培养基上直至根发育。将生根的苗移植至温室中的土壤内。从显示选择剂耐受性并且含有单拷贝T-DNA插入物的植物产生T1种子。
油菜籽/卡诺拉油菜转化
使用5-6日龄年幼籽苗的子叶柄和下胚轴作为组织培养用外植体并且根据Babic等,(1998,Plant Cell Rep 17:183-188)进行转化。商业品种Westar(Agriculture Canada)是用于转化的标准品种,不过也可以使用其他品种。对卡诺拉油菜种子进行表面消毒以体外播种。从所述体外籽苗切下带有子叶的子叶柄外植体,并且用(含有表达载体的)农杆菌通过该叶柄外植体的切口末端浸入细菌悬液而接种。所述外植体随后在23℃,16小时光照下于含有3mg/l BAP、3%蔗糖、0.7%植物琼脂的MSBAP-3培养基上培养2日。与农杆菌共培育2日后,将叶柄外植体转移至含有3mg/l BAP、头孢噻肟、羧苄青霉素或特美汀(300mg/l)的MSBAP-3培养基上培养7日,并且随后在含有头孢噻肟、羧苄青霉素或特美汀和选择剂的MSBAP-3培养基上培养,直至苗再生。当苗长度是5-10mm时,将这些苗切下并转移至苗伸长培养基(MSBAP-0.5,含有0.5mg/l BAP)。将长度大约2cm的苗转移至用于根诱导的生根培养基(MS0)。将生根的苗移植至温室中的土壤内。从显示选择剂耐受性并且含有单拷贝T-DNA插入物的植物产生T1种子。
苜蓿转化
使用(McKersie等,1999 Plant Physiol 119:839-847)的方法转化苜蓿的再生性克隆。苜蓿的再生和转化是基因型依赖性的并且因而需要再生性植物。已经描述了获得再生性植物的方法。例如,这些再生性植物可以选自栽培品种Rangelander(Agriculture Canada)或如Brown DCW和AAtanassov(1985.Plant Cell Tissue Culture 4:111-112)所述的任何其他商业苜蓿品种。备选地,已经选择RA3品种(威斯康星大学)用于组织培养(Walker等,1978 Am J Bot 65:654-659)。叶柄外植体与含有表达载体的根瘤农杆菌C58C1 pMP90(McKersie等,1999 Plant Physiol 119:839-847)或LBA4404的过夜培养物共培育。所述外植体在黑暗中于含有288mg/LPro、53mg/L硫代脯氨酸、4.35g/L K2SO4和100μm乙酰丁香酮的SH诱导培养基上共培育3日。所述外植体在半浓度的Murashige-Skoog培养基(Murashige和Skoog,1962)中洗涤并铺种在不含乙酰丁香酮而含有抑制农杆菌生长的合适选择剂和合适抗生素的相同SH诱导培养基上。几周后,将体细胞胚转移至不含生长调节剂、不含抗生素和含有50g/L蔗糖的BOi2Y发育培养基。随后在半浓度Murashige-Skoog培养基上萌发体细胞胚。将生根的籽苗移植至花钵内并且在温室中培育。从显示选择剂耐受性并且含有单拷贝T-DNA插入物的植物产生T1种子。
棉花转化
使用根瘤农杆菌,根据US 5,159,135中所述的方法转化棉花。棉花种子在3%次氯酸钠溶液中作20分钟表面消毒并在含500μg/ml头孢噻肟的蒸馏水中洗涤。随后将种子转移至含有50μg/m苯菌灵的SH培养基用于萌发。取下4至6日龄籽苗的下胚轴,切成0.5cm小片并置于0.8%琼脂上。农杆菌混悬液(每毫升大约108个细胞,从含有用目的基因和合适选择标记转化的过夜培养物稀释)用于接种下胚轴外植体。在室温和光照下3日后,将组织转移至固体培养基(1.6g/l脱乙酰吉兰糖胶),所述固体培养基含有具有维生素B5的Murashige和Skoog盐(Gamborg等,Exp.Cell Res.50:151-158(1968))、0.1mg/l 2,4-D、0.1mg/l 6-呋喃甲基氨基嘌呤和750μg/ml MgCL2以及杀死残留细菌的50至100μg/ml头孢噻肟和400-500μg/ml羧苄青霉素。单个细胞系在2至3个月(每隔4至6周传代培养)后分离并且在用于组织增殖的选择培养基上进一步培育(30℃,16小时光周期)。转化的组织随后在非选择培养基上进一步培育持续2至3个月以产生体细胞胚。将至少4mm长度的外观健康胚转移至含有精细蛭石中具SH培养基的管内,所述SH培养基补充有0.1mg/l吲哚乙酸、6-呋喃甲基氨基嘌呤和赤霉酸。在30℃以16小时光周期培育胚,并且将处于2至3叶期的小植物转移至具有蛭石和养分的花钵。植物硬化并随后移至温室以进一步培育。
对于WDR23样基因,棉花(陆地棉)的转化使用根瘤农杆菌,在下胚轴外植体上进行。市售品种,例如Coker 130或Coker 312(SeedCo,Lubbock,TX)是用于转化的标准变种,但是也可以使用其他变种。表面消毒种子并在黑暗中萌发。从长约1-1.5厘米的萌发幼苗上切下下胚轴外植体。下胚轴外植体浸没在包含表达载体的根瘤农杆菌接种物中5分钟,然后在黑暗中,于24℃下在MS+1.8mg/l KNO3+2%葡萄糖上共培育约48小时。外植体转移到含有合适细菌和植物选择性标记的相同培养基中(更新几次),直到可以看见胚性胼胝体(embryogenic calli)。分离胼胝体并再培养直至出现体细胞胚。源自体细胞胚的小植株在生根培养基上成熟至根发育。生根的幼苗转移到温室中的花钵土壤中。从表现选择剂耐受性并含有单拷贝T-DNA插入物的植物中产生T1种子。
实施例6B:使用SEQ ID NO:215所示的核酸序列构建表达载体
包含SEQ ID NO:215的进入克隆随后在LR反应中与用于稻转化法的目的载体一起使用。该载体含有在T-DNA边界内部的植物选择性标记、可筛选标记表达盒,和意图与已经克隆在该进入克隆中的目的核酸序列发生LR体内重组的Gateway盒作为功能性元件。用于组成型表达的稻GOS2启动子(SEQ ID NO:272)位于该Gateway盒上游。也产生用于稻转化的第二目的载体,其具有用于分生组织特异性表达的稻金属硫蛋白启动子(MT;SEQ ID NO:273)。
在LR重组步骤后,所得表达载体pGOS2::WDR23-like和pMT::WDR23-like(图25)根据本领域熟知的方法独立转化至农杆菌菌株LBA4044中。
实施例7:表型评价方法
7.1评价建立
产生大约35个独立的T0稻转化体。将原代转化体从组织培养室转移至温室以培育并收获T1种子。留下6个事件,其中T1子代以3∶1比例对所述转基因的存在/不存在分离。对于这些事件中的每个事件,通过监测目视标记表达选出大约10株含有该转基因的T1籽苗(杂合子和纯合子)和大约10株缺少该转基因的T1籽苗(失效合子)。以随机位置并排生长转基因植物和相应的失效合子。温室条件(非胁迫条件)是短日照(12小时光照),在光照下28℃和在黑暗中22℃,和70%相对湿度。进行频繁的浇水,以满足植物生长和发展出健康的外观的水和养分的需要。
植物从播种期至成熟期数次通过数字成像室。在每个时间点上,从至少6个不同角度拍摄每株植物的数字图像(2048×1536像素,1600万颜色)。
干旱筛选
在盆栽土壤中在正常条件下培育来自T2种子的植物直至它们接近齐穗期。随后将它们转移至灌溉减少的“干燥”区。将湿度探测器插入随机选择的花钵内,以监测土壤水含量(SWC)。当SWC下降低于某个阈值时,自动地对所述植物连续再灌溉直至再次达到正常水平。随后将植物再次转移至正常条件。栽培的剩余部分(植物成熟、种子收获)与不在非生物胁迫条件下培育的植物相同。如对正常条件下详述那样记录生长和产量参数。
氮利用效率筛选
在盆栽土壤中在除营养液之外的正常条件下培育来自T2种子的稻植物。从移植至成熟期间用含有降低的、通常7至8倍之间更少的氮(N)含量的特定营养液浇灌所述花钵。栽培的剩余部分(植物成熟、种子收获)与不在非生物胁迫下培育的植物相同。如对正常条件下详述那样记录生长和产量参数。
7.2统计分析:F-检验
使用两因素ANOVA(变量分析)作为总体评价植物表型特征的统计模型。对用本发明基因转化的全部事件的全部植物的全部所测量参数实施F检验。实施F检验以检查该基因对全部转化事件的影响并验证该基因的整体作用(又称作基因总体作用)。对于该F检验而言,真实基因总体作用显著性的阈值对于所述F检验设置在5%概率水平上。显著性F检验值指出了基因作用,这意味不仅仅是基因的存在或位置才造成表型上的差异。
因为实施了具有重叠事件的两个实验,故进行联合分析。这用于检验对这两个实验影响的一致性,并且如果一致,则用于积累来自两个实验的证据以提高结论的可信度。所用的方法是考虑数据的多重水平结构的混合模型法(即实验-事件-分离子)。通过比较似然比检验与卡方分布(chi squaredistribution)获得P-值。
7.3测量的参数
生物量相关的参数测量
植物从播种期至成熟期数次通过数字成像室。在每个时间点上,从至少6个不同角度拍摄每株植物的数字图像(2048×1536像素,1600万颜色)。
植物地上部分面积(或叶生物量)通过计数来自植物地上部分的数字图像上与背景区别的像素总数而确定。这个值对相同时间点上从不同角度拍摄的画面进行平均化并且通过校正转化成以平方mm表述的物理表面值(physical surface value)。实验显示以这种方式测量的地上部分植物面积与地上植物部分的生物量相关。地上部分面积是在植物已经达到其最大叶生物量的时间点处所测量的面积。早期生长势是萌发后3周的植物(籽苗)地上部分面积。根生物量的增加表述为总根生物量的增加(测量为植物寿命期间所观察到的根最大生物量);或表述为根/冠比增加(测量为根和苗的活跃生长期间根质量与苗质量之间的比例)。
早期生长势通过计数来自植物地上部分的与背景区别的像素总数确定。这个值对相同时间点上从不同角度拍摄的画面进行平均化并且通过校正转化成以平方mm表述的物理表面值。下述结果是针对萌发后3周的植物。
种子相关的参数测量值
将成熟的原发花序收获、计数、装袋、加条形码标记并且随后在干燥箱内于37℃干燥3日。随后将所述花序脱粒,并且收集和计数全部种子。使用吹气装置将饱满粒与空粒分开。弃去空粒并且再次计数剩余部分。饱满粒在分析天平上称重。通过计数分离步骤后仍留下的饱满粒的数目确定饱满种子数。种子总产量通过称量从一株植物收获的全部饱满粒测量。每株植物的种子总数通过计数从一株植物收获的壳数测量。千粒重(TKW)从计数的饱满种子数及它们的总重量外推出来。本发明中的收获指数(HI)定义为种子总产量与地上部分面积(mm2)之间的比率乘以系数106。如本发明中定义的每花序总花数是种子总数与成熟原发花序数之间的比率。如本发明中定义的种子饱满率是饱满种子数对种子(或小花)总数的比例(表述为%)。
盐胁迫筛选
植物在由椰子纤维和argex(3∶1比率)组成的基质上培育。在温室中移植小植物后,在头两周期间使用正常营养液。在这两周后,添加25mM盐(NaCl)至所述营养液,直至收获植物。如对正常条件下详述那样记录生长和产量参数。
降低的营养(氮)可利用性筛选
在除营养液之外的正常条件下在盆栽土壤中培育来自6个事件(T2种子)的植物。从移植至成熟期间用含有降低的、通常7至8倍之间更少的氮(N)含量的特定营养液浇灌所述花钵。培育的其余部分(植物成熟、种子收获)与不在非生物胁迫下培育的植物相同。如对正常条件下详述那样记录生长和产量参数。
实施例8:转基因植物的表型评价结果
表达GOS2启动子下的Orysa_PATL_1(SEQ ID NO:1)核酸,且生长于非胁迫条件温室(实施例7)中的转基因稻植物的评测结果如下所示。至少3%的提高在种子重量(种子总重量)、饱满种子数和植物高度中观察到(表D1)。
表D1:用pGOS2::SEQ ID NO:1转化的转基因植物(T2植物)的性状评测结果。
产量相关性状 | 相对于对照败合植物,转基因植物(用pGOS2::Orysa_PATL_1转化的植物)中的增加% |
种子总重量 | 8 |
收获指数 | 5 |
植物高度 | 3 |
饱满种子数 | 7 |
生长于非胁迫条件温室(实施例7)中表达AtPRP38_1(SEQ ID NO:76)核酸的转基因稻植物的评测结果如下所示。至少5%的提高在地上部分生物量(AreaMax)、萌发势(早期生长势)、种子总产量、饱满种子数、饱满率、收获指数和每株植物种子数中观察到(表D2)。
表D2:用pGOS2::PRP38转化的转基因植物的性状评测结果。
产量相关性状 | 相对于对照败合植物,转基因植物中的增加% |
地上部分面积 | 10 |
萌发势 | 38 |
种子总产量 | 18 |
饱满种子数 | 18 |
种子饱满率 | 7 |
收获指数 | 8 |
种子总数 | 11 |
在非胁迫条件下表达SEQ ID NO:128所示GATA样核酸的转基因稻植物的评测显示增加的千粒重(共6个事件,总体增加9.1%,p-值:0.00001)。对于其他产量参数未观察到显著的变化。
在非胁迫条件下表达SEQ ID NO:177所示GATA样核酸的转基因稻植物的评测显示增加的种子总重量(6个中的4个品系,总体增加16.9%,p-值:0.00001),以及增加的饱满种子数(6个中的4个品系,总体增加16.0%,p-值:0.00001)。对于其他产量参数未观察到显著的变化。
生长于非胁迫条件温室(实施例7)中表达pHMG或the pEXP启动子下的Arath_ADA2_1(SEQ ID NO:181)核酸的转基因稻植物的评测结果如下所示。至少3%的提高在种子重量(种子总重量)、饱满种子(饱满率)、每穗花数和收获指数中观察到(表D4)。
表D4:用pHMG::SEQ ID NO:181和用pEXP::SEQ ID NO:181转化的转基因植物(T1植物)的性状评测结果。
产量相关性状 | 相对于对照败合植物(用pHMG::Arath_ADA2_1转化的植物),转基因植物中的增加% | 相对于对照败合植物(用pEXP::Arath_ADA2_1转化的植物),转基因植物中的增加%) |
种子总重量 | 21 | 9 |
饱满种子数 | 21 | 8 |
种子饱满率 | 12 | 10 |
产量相关性状 | 相对于对照败合植物(用pHMG::Arath_ADA2_1转化的植物),转基因植物中的增加% | 相对于对照败合植物(用pEXP::Arath_ADA2_1转化的植物),转基因植物中的增加%) |
每穗花数 | 4 | 3 |
收获指数 | 23 | 8 |
实施例9:在组成型启动子控制下,表达编码SEQ ID NO:216所示的WDR23样多肽的核酸序列的转基因稻植物的表型评估结果
在组成型表达GOS2启动子控制下,表达编码SEQ ID NO:216所示的WDR23样多肽的核酸序列的转基因稻植物T1和T2世代的评估结果如下所示。
如表D5所示,在T1和T2世代表型分析中,与对应的败合(对照)相比,转基因植物的每株种子总产量、种子饱满率、饱满种子数、收获指数和千粒重(TKW)均有显著的提高。
表D5:在组成型表达GOS2启动子控制下,表达编码SEQ ID NO:216所示的WDR23样多肽的核酸序列的转基因稻植物T1和T2世代的评估结果。
性状 | T1世代中的总体平均% | T2世代中的总体平均% |
每株种子总产量 | 84 | 16 |
种子饱满率 | 73 | 11 |
饱满种子数 | 95 | 7 |
收获指数 | 93 | 13 |
TKW | 8 | 4 |
实施例10:在分生组织特异性启动子控制下,表达编码SEQ ID NO:216所示的WDR23样多肽的核酸序列的转基因稻植物的表型评估结果
在分生组织特异性表达金属硫蛋白MT启动子控制下,表达编码SEQID NO:216所示的WDR23样多肽的核酸序列的转基因稻植物T1和T2世代的评估结果如下所示。
如表E所示,在T1和T2世代表型分析中,与对应的败合(对照)相比,转基因植物的每株种子总产量、种子饱满率、饱满种子数、收获指数和千粒重(TKW)均有显著的提高。
表E:在分生组织特异性表达MT启动子控制下,表达编码SEQ IDNO:216所示的WDR23样多肽的核酸序列的转基因稻植物T1和T2世代的评估结果。
性状 | T1世代中的总体平均% | T2世代中的总体平均% |
每株种子总产量 | 18 | 7 |
种子饱满率 | 17 | 6 |
饱满种子数 | 10 | 5 |
收获指数 | 15 | 8 |
TKW | 2 | 0 |
序列表
<110>巴斯夫植物科学有限公司(BASF Plant Science GmbH)
<120>具有增强的产量相关性状的植物及其制备方法
<130>PF60259
<160>275
<170>PatentIn版本3.5
<210>1
<211>1842
<212>DNA
<213>稻(Oryza sativa)
<400>1
atggcggagg agccacagcc agaggccgcg cccgccgcgg tggcggcgac gaccgaggtg 60
gcggtggcgg agaaggcgcc cgtggaggcg gagaaggaga agaaggtgga ggaggagacg 120
ccggcggtgg aggccgaggc gaaggaggag aagaaggatg aggcggcggc ggcggcggcg 180
gcgggaggtg atgaggccgg ggcgatagag gggaccggat cgttcaagga ggagagcaac 240
ctggtggcgg acttgcctga cccggagaag aaggcgctcg atgagttcaa gcagctgatc 300
gccgccgccc tcgccgcctg tgagttcaat ctgcctcccc ctccgccgcc tcccaaggcg 360
aaggttgaag ccgccgttga ggagaccaag gcggaggaga ccaaggccga ggaggaaccc 420
aaggctgagg agccggccaa ggaggaggag cccaaggccg aggtggcggc ggcggcggcg 480
gcgccgccgg aggcaggaac cgaggagccg aaggcggagg cgtcgtccga agaggccaag 540
accgaggagc cgaaggccga ggcggcggcc gacgagccgg ccaaggagga gtccaaagct 600
gaggcggcgc cggctgagga agccaagccg gccgagccgg agccggagga gaagaccgtc 660
gtggtcaccg aggaagaggc ggccaccaag acggtggaag cgatcgagga aaccgtcgtg 720
cccgctgctg ctgcgcctgc tgccgccgcc acggaggaag ccgcggcgcc ggaaccggag 780
gtgcaggcgg cggcggcgcc tgagcccgtg ttgatctggg gcgtgcccct ggtaggcgac 840
gacgagcgca ccgacacggt gctcctcaag ttcctgcgcg cgcgcgagtt caaggtgaag 900
gaggccatgg cgatgctcag gtcggccgtg ctgtggcgca agcgcttcgg catcgagtcc 960
ctcctcgacg ccgacctcgc cctgccggag ctcgacagcg tggtgttcta ccgcggcgcc 1020
gaccgcgagg gccaccccgt gtgctacaac gtctacggcg aattccagga caaggacctg 1080
tacgagaagg cattcggcga cgaggagaag cgggagcgct tcctcaagtg gcgcatccag 1140
ctgctggagc gcggcatcct gtcgcagctc gacttctcgc ccagtggcat ctgctccatg 1200
gttcaggtca cagacctcaa gaactcgcca cctatgctcg gcaagcaccg cgccgtcacc 1260
cgccaggccg ttgctctgct ccaggacaac taccccgagt tcatcgccaa gaaggtgttc 1320
atcaatgtgc catggtggta tctcgctgcc aacaaaatga tgagcccgtt cctcacgcag 1380
cgtaccaaga gcaagttcat ttttgccagc ccagccaaat cagctgagac cctcttcaga 1440
tatatcgcac cagagcaagt ccctgtccaa ttcggaggtc tcttcaagga agatgatcct 1500
gagttcacca cctcagacgc cgttaccgag ctcactatca aaccttcatc gaaagaaacc 1560
gttgagattc ctgtcactga gaattccacg attggatggg agctccgggt gcttggatgg 1620
gaggtgagct acggagcaga gttcactcct gatgccgagg gtggatacac agtcatcgtg 1680
cagaaaacga ggaaggtgcc tgcaaatgag gaaccaatca tgaaaggcag cttcaaggtt 1740
ggcgagccag gaaagattgt gctaacgatc aacaaccctg catcaaagaa gaagaagctc 1800
ctctacagat ccaaggtcaa gagcaccagt gagtccgttt ga 1842
<210>2
<211>613
<212>PRT
<213>稻
<400>2
Met Ala Glu Glu Pro Gln Pro Glu Ala Ala Pro Ala Ala Val Ala Ala
1 5 10 15
Thr Thr Glu Val Ala Val Ala Glu Lys Ala Pro Val Glu Ala Glu Lys
20 25 30
Glu Lys Lys Val Glu Glu Glu Thr Pro Ala Val Glu Ala Glu Ala Lys
35 40 45
Glu Glu Lys Lys Asp Glu Ala Ala Ala Ala Ala Ala Ala Gly Gly Asp
50 55 60
Glu Ala Gly Ala Ile Glu Gly Thr Gly Ser Phe Lys Glu Glu Ser Asn
65 70 75 80
Leu Val Ala Asp Leu Pro Asp Pro Glu Lys Lys Ala Leu Asp Glu Phe
85 90 95
Lys Gln Leu Ile Ala Ala Ala Leu Ala Ala Cys Glu Phe Asn Leu Pro
100 105 110
Pro Pro Pro Pro Pro Pro Lys Ala Lys Val Glu Ala Ala Val Glu Glu
115 120 125
Thr Lys Ala Glu Glu Thr Lys Ala Glu Glu Glu Pro Lys Ala Glu Glu
130 135 140
Pro Ala Lys Glu Glu Glu Pro Lys Ala Glu Val Ala Ala Ala Ala Ala
145 150 155 160
Ala Pro Pro Glu Ala Gly Thr Glu Glu Pro Lys Ala Glu Ala Ser Ser
165 170 175
Glu Glu Ala Lys Thr Glu Glu Pro Lys Ala Glu Ala Ala Ala Asp Glu
180 185 190
Pro Ala Lys Glu Glu Ser Lys Ala Glu Ala Ala Pro Ala Glu Glu Ala
195 200 205
Lys Pro Ala Glu Pro Glu Pro Glu Glu Lys Thr Val Val Val Thr Glu
210 215 220
Glu Glu Ala Ala Thr Lys Thr Val Glu Ala Ile Glu Glu Thr Val Val
225 230 235 240
Pro Ala Ala Ala Ala Pro Ala Ala Ala Ala Thr Glu Glu Ala Ala Ala
245 250 255
Pro Glu Pro Glu Val Gln Ala Ala Ala Ala Pro Glu Pro Val Leu Ile
260 265 270
Trp Gly Val Pro Leu Val Gly Asp Asp Glu Arg Thr Asp Thr Val Leu
275 280 285
Leu Lys Phe Leu Arg Ala Arg Glu Phe Lys Val Lys Glu Ala Met Ala
290 295 300
Met Leu Arg Ser Ala Val Leu Trp Arg Lys Arg Phe Gly Ile Glu Ser
305 310 315 320
Leu Leu Asp Ala Asp Leu Ala Leu Pro Glu Leu Asp Ser Val Val Phe
325 330 335
Tyr Arg Gly Ala Asp Arg Glu Gly His Pro Val Cys Tyr Asn Val Tyr
340 345 350
Gly Glu Phe Gln Asp Lys Asp Leu Tyr Glu Lys Ala Phe Gly Asp Glu
355 360 365
Glu Lys Arg Glu Arg Phe Leu Lys Trp Arg Ile Gln Leu Leu Glu Arg
370 375 380
Gly Ile Leu Ser Gln Leu Asp Phe Ser Pro Ser Gly Ile Cys Ser Met
385 390 395 400
Val Gln Val Thr Asp Leu Lys Asn Ser Pro Pro Met Leu Gly Lys His
405 410 415
Arg Ala Val Thr Arg Gln Ala Val Ala Leu Leu Gln Asp Asn Tyr Pro
420 425 430
Glu Phe Ile Ala Lys Lys Val Phe Ile Asn Val Pro Trp Trp Tyr Leu
435 440 445
Ala Ala Asn Lys Met Met Ser Pro Phe Leu Thr Gln Arg Thr Lys Ser
450 455 460
Lys Phe Ile Phe Ala Ser Pro Ala Lys Ser Ala Glu Thr Leu Phe Arg
465 470 475 480
Tyr Ile Ala Pro Glu Gln Val Pro Val Gln Phe Gly Gly Leu Phe Lys
485 490 495
Glu Asp Asp Pro Glu Phe Thr Thr Ser Asp Ala Val Thr Glu Leu Thr
500 505 510
Ile Lys Pro Ser Ser Lys Glu Thr Val Glu Ile Pro Val Thr Glu Asn
515 520 525
Ser Thr Ile Gly Trp Glu Leu Arg Val Leu Gly Trp Glu Val Ser Tyr
530 535 540
Gly Ala Glu Phe Thr Pro Asp Ala Glu Gly Gly Tyr Thr Val Ile Val
545 550 555 560
Gln Lys Thr Arg Lys Val Pro Ala Asn Glu Glu Pro Ile Met Lys Gly
565 570 575
Ser Phe Lys Val Gly Glu Pro Gly Lys Ile Val Leu Thr Ile Asn Asn
580 585 590
Pro Ala Ser Lys Lys Lys Lys Leu Leu Tyr Arg Ser Lys Val Lys Ser
595 600 605
Thr Ser Glu Ser Val
610
<210>3
<211>1308
<212>DNA
<213>稻
<400>3
atgtctccga ccgccacccc atcccccgcg cccgccgcgg tggcggcagc gcccaagccg 60
ccgccgtcgg ggacgggggc gaagcgcagc ctgatgtcgt cgctgatgga ggccaccgcg 120
ctgctcaggt cgtcgtcgtt caaggaggac tcgtacgtgg cgtccgcgct cccggcgtcc 180
gacctccgcg cgctcgccga cctcagggcg ctgctgtcca cgcacccgga cccaatctcc 240
atctggggcg tgccgctcaa cccggcgccg ccccagggcg gggagggggc cccggccccg 300
gccgccgccg ccgacgagcg cgccgacgtc gtgctgctca agttcctccg cgcgcgggac 360
ttccgcgtcc gcgacgcgca cgccatgctg ctccgctgcg ccgcgtggcg ggccgagttc 420
cgcgccgacg ccgtgctgga cgaggacctc gggttcaagg acctggaggg ggtcgtcgcc 480
tacatgcacg gctgggaccg ggagggccac ccggtctgct acaacgccta cggcgtcttc 540
aaggacaggg acatgtacga ccgcgtcttc ggcgacggcg agcgcctcgc ccgcttcctc 600
cgctggcgcg tccaggtcat ggagcgcggc gtccgcgcgc tccacctccg ccccggcggc 660
gtcaacgcca tcatccaggt caccgacctc aaggacatgc ccaagcgcga gctccgcgcc 720
gcgtccaacc agatcctctc cctcttccag gacaactacc ctgagatggt cgcgcgcaag 780
gtgttcatca acgtgccctg gtacttctcg gtgctgttct ccatgatctc gccgttcctc 840
acggagcgca ccaagagcaa gttcgtcatc gcgcgcgagg gcaacgtcgc cgagacgctc 900
ttcaagttca tccggccgga gctggtgccg gtgcagtacg gcgggctgag ccgcgccggc 960
gacctggaga acggcccgcc gaagccggcg tccgagttca ccatcaaggg cggcgagaag 1020
gtcttcctag agatcgacgg catcgaggcc ggcgcgacga taacgtggga cctggtggtg 1080
ggcgggtggg agctggagta cggcgcggag tacgtgccgg cggcggagga cagctacacg 1140
ctgtgcgtgg agcggacgag gaaggtgccg gccgccgccg acgagccggt gcacaacgcc 1200
ttcacggcga gggaggccgg caagatggtg ctctccatcg acaactccgg ctcccggaag 1260
cggaaggtcg ccgcctaccg gtacttcgtg cgcaagccgt cggcgtag 1308
<210>4
<211>435
<212>PRT
<213>稻
<400>4
Met Ser Pro Thr Ala Thr Pro Ser Pro Ala Pro Ala Ala Val Ala Ala
1 5 10 15
Ala Pro Lys Pro Pro Pro Ser Gly Thr Gly Ala Lys Arg Ser Leu Met
20 25 30
Ser Ser Leu Met Glu Ala Thr Ala Leu Leu Arg Ser Ser Ser Phe Lys
35 40 45
Glu Asp Ser Tyr Val Ala Ser Ala Leu Pro Ala Ser Asp Leu Arg Ala
50 55 60
Leu Ala Asp Leu Arg Ala Leu Leu Ser Thr His Pro Asp Pro Ile Ser
65 70 75 80
Ile Trp Gly Val Pro Leu Asn Pro Ala Pro Pro Gln Gly Gly Glu Gly
85 90 95
Ala Pro Ala Pro Ala Ala Ala Ala Asp Glu Arg Ala Asp Val Val Leu
100 105 110
Leu Lys Phe Leu Arg Ala Arg Asp Phe Arg Val Arg Asp Ala His Ala
115 120 125
Met Leu Leu Arg Cys Ala Ala Trp Arg Ala Glu Phe Arg Ala Asp Ala
130 135 140
Val Leu Asp Glu Asp Leu Gly Phe Lys Asp Leu Glu Gly Val Val Ala
145 150 155 160
Tyr Met His Gly Trp Asp Arg Glu Gly His Pro Val Cys Tyr Asn Ala
165 170 175
Tyr Gly Val Phe Lys Asp Arg Asp Met Tyr Asp Arg Val Phe Gly Asp
180 185 190
Gly Glu Arg Leu Ala Arg Phe Leu Arg Trp Arg Val Gln Val Met Glu
195 200 205
Arg Gly Val Arg Ala Leu His Leu Arg Pro Gly Gly Val Asn Ala Ile
210 215 220
Ile Gln Val Thr Asp Leu Lys Asp Met Pro Lys Arg Glu Leu Arg Ala
225 230 235 240
Ala Ser Asn Gln Ile Leu Ser Leu Phe Gln Asp Asn Tyr Pro Glu Met
245 250 255
Val Ala Arg Lys Val Phe Ile Asn Val Pro Trp Tyr Phe Ser Val Leu
260 265 270
Phe Ser Met Ile Ser Pro Phe Leu Thr Glu Arg Thr Lys Ser Lys Phe
275 280 285
Val Ile Ala Arg Glu Gly Asn Val Ala Glu Thr Leu Phe Lys Phe Ile
290 295 300
Arg Pro Glu Leu Val Pro Val Gln Tyr Gly Gly Leu Ser Arg Ala Gly
305 310 315 320
Asp Leu Glu Asn Gly Pro Pro Lys Pro Ala Ser Glu Phe Thr Ile Lys
325 330 335
Gly Gly Glu Lys Val Phe Leu Glu Ile Asp Gly Ile Glu Ala Gly Ala
340 345 350
Thr Ile Thr Trp Asp Leu Val Val Gly Gly Trp Glu Leu Glu Tyr Gly
355 360 365
Ala Glu Tyr Val Pro Ala Ala Glu Asp Ser Tyr Thr Leu Cys ValGlu
370 375 380
Arg Thr Arg Lys Val Pro Ala Ala Ala Asp Glu Pro Val His Asn Ala
385 390 395 400
Phe Thr Ala Arg Glu Ala Gly Lys Met Val Leu Ser Ile Asp Asn Ser
405 410 415
Gly Ser Arg Lys Arg Lys Val Ala Ala Tyr Arg Tyr Phe ValArg Lys
420 425 430
Pro Ser Ala
435
<210>5
<211>1758
<212>DNA
<213>稻
<400>5
atggctgagg aggcgaagca agagaccccc gccgccgccg aggtggtggt ggtggagaag 60
gccgacgagg tggtggcggt ggagaaggcc gtggaggtgg aggcggagga gaagaagttg 120
gcggagcagg aggaggagga ggagaagaag gcggaggagg cggaggaggc cgctgggggc 180
gatgaggcgg cggtgatcga ggggacgggg tcgttcaagg aggagagcaa cctcgtctcc 240
gagctccccg acccggagcg cacggcgctc gcccagctca aggagctcgt cgccgccgcg 300
cttgccgccg gcgagtttga ccttcccccg ccgccgccgc ctccaccggc caaggcggag 360
gagccggcca aggaggagga acccaaggcc gccgaggccc ccgccgccga ggagcccaag 420
gccgaggccg aggctgaggc tgaggccgcc gccacggagg agcccaagac cgaggagccc 480
aagaccgagg agcccgccaa ggaggaggag cccaaggcgg cggcggcggc tgcggccgag 540
gagcccaagg cggaggcggc ggcggaggag gccaagccgg ccgagccgga aacggaggag 600
aagaccgtcg tggtcaccga ggatgaaggc accagcaaga cggtggaggc catcgaggag 660
accgtcgtcg tcgccgcccc ggccgcggcc gcggaagccg aggctgcggc gccgaaggag 720
gagctgatat ggggcgtgcc gctgaccggc gacgacgagc gcacggacac cgtgctgctc 780
aagttcctcc gcgccaggga gttcaaggtg aaggaggcca tggcgatgct caaggcggcg 840
gtgctgtggc gcaagcgctt cggcatcgac gccgtcctgg ccgccgacct cggcctgccg 900
gagctcgaga acgtggtgtt ctaccgcggc gccgaccgcg agggccaccc cgtctgctac 960
aacgtctacg gcgagttcca ggacaaggac ctctacgaga aggccttcgg cgacgaggag 1020
aagcgggagc gattcctcaa gtggcgcatc cagctcctcg agcgcggcat cctcgaccag 1080
ctcgacttct cccccagcgg catctgctcc atggtgcagg tcaccgacct caagaactcg 1140
ccgcccatgc tcggcaagca ccgcaccgtc acccgccagg cgctcgccct gctccaggac 1200
aactaccccg agttcatcgc caagaagatt ttcatcaacg tgccatggtg gtacattgcg 1260
gctaacaaaa tggtgagtcc gttcctcaca caacgcacca agagcaagat catcttttgc 1320
accgcagcca agtctgcaga gacccttttc agatacattg ctcctgagca agtccctgtc 1380
caattcggag gcctctacaa agaggatgac accgagttct ccacctcaga tgctgtcaca 1440
gagcttccga tcaaaccttc atccaaagaa actgttgaga ttcctgctac tgagaactcc 1500
acggttgtgt gggagcttcg ggtgcttggg tgggaggtga gctacggtgc agagttcacc 1560
cctgatgcag agggtggcta cactgtcatt gtgcagaaga caaggaaggt gcccgccaat 1620
gaagagccga tcatgaaggg cagcttcaag gttggcgagc ctggcaagat cgtcctcaca 1680
gtcgacaacg cagcatccaa gaagaagaag cagcttcttt accgattcaa ggtcaagagc 1740
tcctctgaat ctgcctaa 1758
<210>6
<211>585
<212>PRT
<213>稻
<400>6
Met Ala Glu Glu Ala Lys Gln Glu Thr Pro Ala Ala Ala Glu Val Val
1 5 10 15
Val Val Glu Lys Ala Asp Glu Val Val Ala Val Glu Lys Ala Val Glu
20 25 30
Val Glu Ala Glu Glu Lys Lys Leu Ala Glu Gln Glu Glu Glu Glu Glu
35 40 45
Lys Lys Ala Glu Glu Ala Glu Glu Ala Ala Gly Gly Asp Glu Ala Ala
50 55 60
Val Ile Glu Gly Thr Gly Ser Phe Lys Glu Glu Ser Asn Leu Val Ser
65 70 75 80
Glu Leu Pro Asp Pro Glu Arg Thr Ala Leu Ala Gln Leu Lys Glu Leu
85 90 95
Val Ala Ala Ala Leu Ala Ala Gly Glu Phe Asp Leu Pro Pro Pro Pro
100 105 110
Pro Pro Pro Pro Ala Lys Ala Glu Glu Pro Ala Lys Glu Glu Glu Pro
115 120 125
Lys Ala Ala Glu Ala Pro Ala Ala Glu Glu Pro Lys Ala Glu Ala Glu
130 135 140
Ala Glu Ala Glu Ala Ala Ala Thr Glu Glu Pro Lys Thr Glu Glu Pro
145 150 155 160
Lys Thr Glu Glu Pro Ala Lys Glu Glu Glu Pro Lys Ala Ala Ala Ala
165 170 175
Ala Ala Ala Glu Glu Pro Lys Ala Glu Ala Ala Ala Glu Glu Ala Lys
180 185 190
Pro Ala Glu Pro Glu Thr Glu Glu Lys Thr Val Val Val Thr Glu Asp
195 200 205
Glu Gly Thr Ser Lys Thr Val Glu Ala Ile Glu Glu Thr Val Val Val
210 215 220
Ala Ala Pro Ala Ala Ala Ala Glu Ala Glu Ala Ala Ala Pro Lys Glu
225 230 235 240
Glu Leu Ile Trp Gly Val Pro Leu Thr Gly Asp Asp Glu Arg Thr Asp
245 250 255
Thr Val Leu Leu Lys Phe Leu Arg Ala Arg Glu Phe Lys Val Lys Glu
260 265 270
Ala Met Ala Met Leu Lys Ala Ala Val Leu Trp Arg Lys Arg Phe Gly
275 280 285
Ile Asp Ala Val Leu Ala Ala Asp Leu Gly Leu Pro Glu Leu Glu Asn
290 295 300
Val Val Phe Tyr Arg Gly Ala Asp Arg Glu Gly His Pro Val Cys Tyr
305 310 315 320
Asn Val Tyr Gly Glu Phe Gln Asp Lys Asp Leu Tyr Glu Lys Ala Phe
325 330 335
Gly Asp Glu Glu Lys Arg Glu Arg Phe Leu Lys Trp Arg Ile Gln Leu
340 345 350
Leu Glu Arg Gly Ile Leu Asp Gln Leu Asp Phe Ser Pro Ser Gly Ile
355 360 365
Cys Ser Met Val Gln Val Thr Asp Leu Lys Asn Ser Pro Pro Met Leu
370 375 380
Gly Lys His Arg Thr Val Thr Arg Gln Ala Leu Ala Leu Leu Gln Asp
385 390 395 400
Asn Tyr Pro Glu Phe Ile Ala Lys Lys Ile Phe Ile Asn Val Pro Trp
405 410 415
Trp Tyr Ile Ala Ala Asn Lys Met Val Ser Pro Phe Leu Thr Gln Arg
420 425 430
Thr Lys Ser Lys Ile Ile Phe Cys Thr Ala Ala Lys Ser Ala Glu Thr
435 440 445
Leu Phe Arg Tyr Ile Ala Pro Glu Gln Val Pro Val Gln Phe Gly Gly
450 455 460
Leu Tyr Lys Glu Asp Asp Thr Glu Phe Ser Thr Ser Asp Ala Val Thr
465 470 475 480
Glu Leu Pro Ile Lys Pro Ser Ser Lys Glu Thr Val Glu Ile Pro Ala
485 490 495
Thr Glu Asn Ser Thr Val Val Trp Glu Leu Arg Val Leu Gly Trp Glu
500 505 510
Val Ser Tyr Gly Ala Glu Phe Thr Pro Asp Ala Glu Gly Gly Tyr Thr
515 520 525
Val Ile Val Gln Lys Thr Arg Lys Val Pro Ala Asn Glu Glu Pro Ile
530 535 540
Met Lys Gly Ser Phe Lys Val Gly Glu Pro Gly Lys Ile Val Leu Thr
545 550 555 560
Val Asp Asn Ala Ala Ser Lys Lys Lys Lys Gln Leu Leu Tyr Arg Phe
565 570 575
Lys Val Lys Ser Ser Ser Glu Ser Ala
580 585
<210>7
<211>1554
<212>DNA
<213>稻
<400>7
atggcggtgg aggtcgtgtc tgagggtgcg gcggccgccg cggcggaggt ggcggcgccg 60
gagacgaagg aggtgaccgc gaaggcggcg gcggatgagg cggtgacgct ggctgccgtc 120
gtgtccaaga acgcgtcgtt cagggaggag agcaacttcc tggatgatct caaggacggc 180
gagaggaagg cgttggctga gctccgtgcc aaggttgagg aggccatcgt cgacggcaag 240
ctgttcgatg acggcaaggt ggaggccaag aagaaggccg cggcggcgga ggaggagaag 300
gcggtggagg aggccgccgg tgagaagaaa gatggcgagg agaagaaaga ggaggaggag 360
ccggtgacgg aggagaagaa ggaagaggag caaggcgagg aggaggagga gcccaagaaa 420
gaggaagccg acgagggcga gaaggaggag aagccggcgg aggaggaggc ggcggcggtt 480
gtggacaagg acatcgctct gtggggcgtg ccgctgctgc cgagcaaggg cgacgacgcc 540
accgacgtcg tcctcctcaa gttcctccgc gcgcgcgact tcaaggccgg cgccgccttc 600
gacatgctcc gcaagacgct ccactggcgc agggagtgga agggcttcgc cgccggcacc 660
gacgacgacg acgacggcga ggcgctcccg gcggagctcg ccgacgcgtg ctacctcgac 720
ggcgcggaca gggagggcca cccggtgtgc tacaacgcgc tcggcgtgtt cgccgacgac 780
gccgtgtaca agaaggcgct cggcacggag gaaggcaagg cgaggttcct ccggtggagg 840
gtgcgcgcca tggagagcca cgtggccaag ctcgacctca ggcccggcgg cgtcgcgtcg 900
ctgctgcagg tgacggacct caagaactcg ccggggccgg ccaagaagga cctccgcgtc 960
gccatgaagc aggtgctcga cctcttccag gacaactacc ctgagctcgt cgcaagaaac 1020
attctgatca atgtgccgtt ctggtactac gcgttcagca ccctcttcta cccgttcatg 1080
acgcagagga ccaagagcaa gttcgtcatt gctcggccct ccaaggtcac cgagaccctc 1140
ctcaagtaca ttcccattga agccattcca gtgaagtacg gtggtctgaa gcgcgacgac 1200
gacaccgagt tctcggcaga ggacagtgaa gtcacagagc tcgttgtcaa ggcaagctcc 1260
accgaaacca tcgagatcga agccacagag ggtgacacta cgctgacatg ggacctgacc 1320
gtgctgggat gggaggtgaa ctacaaggag gagttcgtgc cgagcgagga gggctcgtac 1380
accgtcatcg tcaagaaggg gaagaagatg gggtcgtcgg aggcggcggt ccgcaactcg 1440
ttccgcgccg gcgagccggg gaaggtggtc ctcaccgtcg agaacctgac gcacaggaag 1500
aagaaggtgc tgttcaggca caaggccaag agcgcctgtg ccaaggagtg ctga 1554
<210>8
<211>517
<212>PRT
<213>稻
<400>8
Met Ala Val Glu Val Val Ser Glu Gly Ala Ala Ala Ala Ala Ala Glu
1 5 10 15
Val Ala Ala Pro Glu Thr Lys Glu Val Thr Ala Lys Ala Ala Ala Asp
20 25 30
Glu Ala Val Thr Leu Ala Ala Val Val Ser Lys Asn Ala Ser Phe Arg
35 40 45
Glu Glu Ser Asn Phe Leu Asp Asp Leu Lys Asp Gly Glu Arg Lys Ala
50 55 60
Leu Ala Glu Leu Arg Ala Lys Val Glu Glu Ala Ile Val Asp Gly Lys
65 70 75 80
Leu Phe Asp Asp Gly Lys Val Glu Ala Lys Lys Lys Ala Ala Ala Ala
85 90 95
Glu Glu Glu Lys Ala Val Glu Glu Ala Ala Gly Glu Lys Lys Asp Gly
100 105 110
Glu Glu Lys Lys Glu Glu Glu Glu Pro Val Thr Glu Glu Lys Lys Glu
115 120 125
Glu Glu Gln Gly Glu Glu Glu Glu Glu Pro Lys Lys Glu Glu Ala Asp
130 135 140
Glu Gly Glu Lys Glu Glu Lys Pro Ala Glu Glu Glu Ala Ala Ala Val
145 150 155 160
Val Asp Lys Asp Ile Ala Leu Trp Gly Val Pro Leu Leu Pro Ser Lys
165 170 175
Gly Asp Asp Ala Thr Asp Val Val Leu Leu Lys Phe Leu Arg Ala Arg
180 185 190
Asp Phe Lys Ala Gly Ala Ala Phe Asp Met Leu Arg Lys Thr Leu His
195 200 205
Trp Arg Arg Glu Trp Lys Gly Phe Ala Ala Gly Thr Asp Asp Asp Asp
210 215 220
Asp Gly Glu Ala Leu Pro Ala Glu Leu Ala Asp Ala Cys Tyr Leu Asp
225 230 235 240
Gly Ala Asp Arg Glu Gly His Pro Val Cys Tyr Asn Ala Leu Gly Val
245 250 255
Phe Ala Asp Asp Ala Val Tyr Lys Lys Ala Leu Gly Thr Glu Glu Gly
260 265 270
Lys Ala Arg Phe Leu Arg Trp Arg Val Arg Ala Met Glu Ser His Val
275 280 285
Ala Lys Leu Asp Leu Arg Pro Gly Gly Val Ala Ser Leu Leu Gln Val
290 295 300
Thr Asp Leu Lys Asn Ser Pro Gly Pro Ala Lys Lys Asp Leu Arg Val
305 310 315 320
Ala Met Lys Gln Val Leu Asp Leu Phe Gln Asp Asn Tyr Pro Glu Leu
325 330 335
Val Ala Arg Asn Ile Leu Ile Asn Val Pro Phe Trp Tyr Tyr Ala Phe
340 345 350
Ser Thr Leu Phe Tyr Pro Phe Met Thr Gln Arg Thr Lys Ser Lys Phe
355 360 365
Val Ile Ala Arg Pro Ser Lys Val Thr Glu Thr Leu Leu Lys Tyr Ile
370 375 380
Pro Ile Glu Ala Ile Pro Val Lys Tyr Gly Gly Leu Lys Arg Asp Asp
385 390 395 400
Asp Thr Glu Phe Ser Ala Glu Asp Ser Glu Val Thr Glu Leu Val Val
405 410 415
Lys Ala Ser Ser Thr Glu Thr Ile Glu Ile Glu Ala Thr Glu Gly Asp
420 425 430
Thr Thr Leu Thr Trp Asp Leu Thr Val Leu Gly Trp Glu Val Asn Tyr
435 440 445
Lys Glu Glu Phe Val Pro Ser Glu Glu Gly Ser Tyr Thr Val Ile Val
450 455 460
Lys Lys Gly Lys Lys Met Gly Ser Ser Glu Ala Ala Val Arg Asn Ser
465 470 475 480
Phe Arg Ala Gly Glu Pro Gly Lys Val Val Leu Thr Val Glu Asn Leu
485 490 495
Thr His Arg Lys Lys Lys Val Leu Phe Arg His Lys Ala Lys Ser Ala
500 505 510
Cys Ala Lys Glu Cys
515
<210>9
<211>1058
<212>DNA
<213>欧洲油菜(Brassica napus)
<400>9
gcgacgtcgt tttactgaag ttcctacgcg ctagggactt caaggtgaaa gattcgctgg 60
caatgctcaa gaacacagtc aagtggagaa gggagttcaa gatcgacgag ttggtcgatg 120
aagactttgt ggatgatctt gacaaggtcg tgttcatgca cggacacgac cgtgaaggcc 180
acccggtgtg ttacaatgtc tacggcgagt tccagaacaa ggagctttac aacaagacgt 240
tctccgatga ggagaagagg aagcatttct tgaggacgag gattcagttc ttggagagga 300
gtataaggaa gctagacttc agctccggtg gtgtttccac catttttcag attaatgata 360
tgaagaactc tccggggttg gggaagaaag agcttagatc ggcgaccaag caagccgtgc 420
agttgcttca ggacaattac cctgagtttg tcttcaaaca ggccttcatc aatgttccct 480
ggtggtacct tgtgttttac actgtgattg gtccgttcat gacaccaaga tcaaagagca 540
agttggtgtt tgctggtccg tcaagatcag ccgaaactct tttcaaatac atatcgcccg 600
aacaagtccc ggtacaatac ggaggattga gtgtagatcc ttgtgactgc aatccagact 660
tctctttgga tgatccagcc tcagaggtca ttgttaagcc tggaacaaag caaactgttg 720
agatcataat ctatgagaaa tgtgagattg tgtgggagat aagggtaatt ggatgggaag 780
tgagctacaa ggcagagttt gtgcctgaag agaaagatgc ttatacggtg gttgtacaga 840
aaccgaggaa gatgaaacca ttcgatgaac cggtgttaac ccagagcttc aaagtgaatg 900
agcttggcaa ggttttactc actgtagaca acccaacctc taagaagaag aagctcgttt 960
acaggttcaa tgtcaaacca ctctgaagtg agatgtcttc tttgtgtttt tgtatatgtg 1020
agtgtttggt atatcatatc atatcatttt gtatttgt 1058
<210>10
<211>307
<212>PRT
<213>欧洲油菜
<400>10
Met Leu Lys Asn Thr Val Lys Trp Arg Arg Glu Phe Lys Ile Asp Glu
1 5 10 15
Leu Val Asp Glu Asp Phe Val Asp Asp Leu Asp Lys Val Val Phe Met
20 25 30
His Gly His Asp Arg Glu Gly His Pro Val Cys Tyr Asn Val Tyr Gly
35 40 45
Glu Phe Gln Asn Lys Glu Leu Tyr Asn Lys Thr Phe Ser Asp Glu Glu
50 55 60
Lys Arg Lys His Phe Leu Arg Thr Arg Ile Gln Phe Leu Glu Arg Ser
65 70 75 80
Ile Arg Lys Leu Asp Phe Ser Ser Gly Gly Val Ser Thr Ile Phe Gln
85 90 95
Ile Asn Asp Met Lys Asn Ser Pro Gly Leu Gly Lys Lys Glu Leu Arg
100 105 110
Ser Ala Thr Lys Gln Ala Val Gln Leu Leu Gln Asp Asn Tyr Pro Glu
115 120 125
Phe Val Phe Lys Gln Ala Phe Ile Asn Val Pro Trp Trp Tyr Leu Val
130 135 140
Phe Tyr Thr Val Ile Gly Pro Phe Met Thr Pro Arg Ser Lys Ser Lys
145 150 155 160
Leu Val Phe Ala Gly Pro Ser Arg Ser Ala Glu Thr Leu Phe Lys Tyr
165 170 175
Ile Ser Pro Glu Gln Val Pro Val Gln Tyr Gly Gly Leu Ser Val Asp
180 185 190
Pro Cys Asp Cys Asn Pro Asp Phe Ser Leu Asp Asp Pro Ala Ser Glu
195 200 205
Val Ile Val Lys Pro Gly Thr Lys Gln Thr Val Glu Ile Ile Ile Tyr
210 215 220
Glu Lys Cys Glu Ile Val Trp Glu Ile Arg Val Ile Gly Trp Glu Val
225 230 235 240
Ser Tyr Lys Ala Glu Phe Val Pro Glu Glu Lys Asp Ala Tyr Thr Val
245 250 255
Val Val Gln Lys Pro Arg Lys Met Lys Pro Phe Asp Glu Pro Val Leu
260 265 270
Thr Gln Ser Phe Lys Val Asn Glu Leu Gly Lys Val Leu Leu Thr Val
275 280 285
Asp Asn Pro Thr Ser Lys Lys Lys Lys Leu ValTyr Arg Phe Asn Val
290 295 300
Lys Pro Leu
305
<210>11
<211>1092
<212>DNA
<213>向日葵(Helianthus annuus)
<400>11
tcccgcgtcg acgatttcgt atgatcttga gaaagttgtg attaatcatg gctttgataa 60
agaaggacac ccagattaat ataatgtgta tggtgagttt cagaacaaag agttgtataa 120
taaaatgttt agtgataatg aaggaagatt gaggttttta aggtggagaa ttcagtatct 180
tgaaaggagt ataaggaagt tggattttag gcctggtggg gtgaatacta tttttcagat 240
tagtgatttg aagaactcgc ctggaccggc gaaacgggag cttcggttag ccaccaggca 300
agctctgcag attctgcagg ataactaccc tgaatttgtg gcaaaacagg ttttcatcaa 360
tgccccttgg tggtatttag ctttctatac aatgattagt ccattcatga ctcaaaggac 420
taaaagcaag tttgtattcg ctagcacggc caagacaccc gaaacccttt tcaaatacgt 480
gaatccggag cacgtaccaa ttcagtacgg cgggttaagc gtagattact gcgattgcaa 540
tcccgaattc acaattgatg atccggcttc agtggtcacc gttaaaccag ctaccaagca 600
gaccgtggag attatagtaa acgagaaatg cttatttgta tgggagctac gtgtagtcgg 660
ttgggaggtg agctatagtg ctgaatatgt gccgaacaac gaaagccatt acacgataat 720
catacaaaag gctagaaaga tgactccaac cgatgaacca gtgatcagtc acagtttcaa 780
gatcagtgag cttggtaaga tacttctaac cgtcgacaac ccgacctcca agaagaaaac 840
actgctctat aggttcaagg tgaacccgct ttccgaatag aataacgtac atcaaggatc 900
gacagacaga cagtcatatg atcgcgcgag tttattattt atttatgttt ttatttgttt 960
ctttttggta gttgttatga gtttgggtcc atggtgttgt gtttgggttc aaaaggcttg 1020
tggtttttgt attaatgtct tttggctttc acaaaatcgg ttgctgaaat ttgactgatc 1080
gagttctatt tt 1092
<210>12
<211>251
<212>PRT
<213>向日葵
<400>12
Met Phe Ser Asp Asn Glu Gly Arg Leu Arg Phe Leu Arg Trp Arg Ile
1 5 10 15
Gln Tyr Leu Glu Arg Ser Ile Arg Lys Leu Asp Phe Arg Pro Gly Gly
20 25 30
Val Asn Thr Ile Phe Gln Ile Ser Asp Leu Lys Asn Ser Pro Gly Pro
35 40 45
Ala Lys Arg Glu Leu Arg Leu Ala Thr Arg Gln Ala Leu Gln Ile Leu
50 55 60
Gln Asp Asn Tyr Pro Glu Phe Val Ala Lys Gln Val Phe Ile Asn Ala
65 70 75 80
Pro Trp Trp Tyr Leu Ala Phe Tyr Thr Met Ile Ser Pro Phe Met Thr
85 90 95
Gln Arg Thr Lys Ser Lys Phe Val Phe Ala Ser Thr Ala Lys Thr Pro
100 105 110
Glu Thr Leu Phe Lys Tyr Val Asn Pro Glu His Val Pro Ile Gln Tyr
115 120 125
Gly Gly Leu Ser Val Asp Tyr Cys Asp Cys Asn Pro Glu Phe Thr Ile
130 135 140
Asp Asp Pro Ala Ser Val Val Thr Val Lys Pro Ala Thr Lys Gln Thr
145 150 155 160
Val Glu Ile Ile Val Asn Glu Lys Cys Leu Phe Val Trp Glu Leu Arg
165 170 175
Val Val Gly Trp Glu Val Ser Tyr Ser Ala Glu Tyr Val Pro Asn Asn
180 185 190
Glu Ser His Tyr Thr Ile Ile Ile Gln Lys Ala Arg Lys Met Thr Pro
195 200 205
Thr Asp Glu Pro Val Ile Ser His Ser Phe Lys Ile Ser Glu Leu Gly
210 215 220
Lys Ile Leu Leu Thr Val Asp Asn Pro Thr Ser Lys Lys Lys Thr Leu
225 230 235 240
Leu Tyr Arg Phe Lys Val Asn Pro Leu Ser Glu
245 250
<210>13
<211>1409
<212>DNA
<213>玉蜀黍(Zea mays)
<400>13
cggccgcgct cccggcgtcc gagctccgcg cgctcgccga cctcaaggcg ctgctcgcca 60
cccacccgga ccccatctcc atctggggcg tcccgctgaa ccctcgctcc cctcccgccg 120
cggcggacga cgccgccccg gtcgacgagc gcgccgacgt ggtgctcctc aagttcctcc 180
gcgcgcggga cttccgcgtc cgcgacgccc acgccatggt gctccgctgc gccgcctggc 240
gcgccgagtt cggcgccgac gccgtgctgg acgaggagct gggcttcaag gacctggagg 300
gcatcgtcgc ctacatgcac ggctgggacc gcgacggcca ccccgtctgc tacaacgcct 360
acggcgtctt caaagacagg gacatgtacg agcgcgtctt cggcgacggc gaccgcctcg 420
cgcgcttcct ccgctggcgc gtccaggtca tggagcgtgg cgtgcgcgcg ctcaccctga 480
ggcccggggg cgtcaacgcc atcatacagg tcaccgacct caaggacatg ccaaagcggg 540
agctcagagc cgccagcaac cagatcctct ccctcttcca ggacaactac ccggagatgg 600
tagcgcggaa ggtgttcatt aacgtgccgt ggtacttctc cgtgctcttc tccatgatct 660
cgcccttcct cacggagcgc accaagagca agttcgtcat cgcgcgcgag ggcaacgtcg 720
ccgagacact ctacaagttc atccggccgg agctggtgcc tgtgcagtac ggcgggctga 780
gccgcaccgg tgacctcgag aacggaccac cgaagccggc gtccgagttc accatcaagg 840
gtggcgagaa ggtcttcctg gagatcgacg gtatcgaggc cggtgcaacg ataacgtggg 900
atctggtcgt cgggggctgg gacctcgagt acggagccga gtacgtgccg gcggccgagg 960
acagctacac gctctgcgtg gagaagacga ggatggtctc ggccaccgcc gaggagcccg 1020
tgcacaatgc cttcacggct agggaagccg gcaagatggt gctgtccata gacaattctg 1080
gttcccggaa gcgcaaggtt gccgcctaca ggtacttcgt gcgcaagtcg tcggcgtagg 1140
catgctgatg tcttgtggtg gtcgtcgcgg ccggcacctt gccggcaggt agccgcgagg 1200
gggaggttaa cgtggaacaa ctggtgaagt gtttacttgg gcttgtggga tttggtgggg 1260
agaggtcaag ttaggctgtt taattattag tgctataacg acatgcatgt attatgccat 1320
atgtagaata taatgtactc cgtaagagct gttgcacttt tgtcatcacc actgttataa 1380
agctaatgat gagttaagtt ttttgggct 1409
<210>14
<211>307
<212>PRT
<213>玉蜀黍
<400>14
Met Val Leu Arg Cys Ala Ala Trp Arg Ala Glu Phe Gly Ala Asp Ala
1 5 10 15
Val Leu Asp Glu Glu Leu Gly Phe Lys Asp Leu Glu Gly Ile Val Ala
20 25 30
Tyr Met His Gly Trp Asp Arg Asp Gly His Pro Val Cys Tyr Asn Ala
35 40 45
Tyr Gly Val Phe Lys Asp Arg Asp Met Tyr Glu Arg Val Phe Gly Asp
50 55 60
Gly Asp Arg Leu Ala Arg Phe Leu Arg Trp Arg Val Gln Val Met Glu
65 70 75 80
Arg Gly Val Arg Ala Leu Thr Leu Arg Pro Gly Gly Val Asn Ala Ile
85 90 95
Ile Gln Val Thr Asp Leu Lys Asp Met Pro Lys Arg Glu Leu Arg Ala
100 105 110
Ala Ser Asn Gln Ile Leu Ser Leu Phe Gln Asp Asn Tyr Pro Glu Met
115 120 125
Val Ala Arg Lys Val Phe Ile Asn Val Pro Trp Tyr Phe Ser Val Leu
130 135 140
Phe Ser Met Ile Ser Pro Phe Leu Thr Glu Arg Thr Lys Ser Lys Phe
145 150 155 160
Val Ile Ala Arg Glu Gly Asn Val Ala Glu Thr Leu Tyr Lys Phe Ile
165 170 175
Arg Pro Glu Leu Val Pro Val Gln Tyr Gly Gly Leu Ser Arg Thr Gly
180 185 190
Asp Leu Glu Asn Gly Pro Pro Lys Pro Ala Ser Glu Phe Thr Ile Lys
195 200 205
Gly Gly Glu Lys Val Phe Leu Glu Ile Asp Gly Ile Glu Ala Gly Ala
210 215 220
Thr Ile Thr Trp Asp Leu Val Val Gly Gly Trp Asp Leu Glu Tyr Gly
225 230 235 240
Ala Glu Tyr Val Pro Ala Ala Glu Asp Ser Tyr Thr Leu Cys Val Glu
245 250 255
Lys Thr Arg Met Val Ser Ala Thr Ala Glu Glu Pro Val His Asn Ala
260 265 270
Phe Thr Ala Arg Glu Ala Gly Lys Met Val Leu Ser Ile Asp Asn Ser
275 280 285
Gly Ser Arg Lys Arg Lys Val Ala Ala Tyr Arg Tyr Phe Val Arg Lys
290 295 300
Ser Ser Ala
305
<210>15
<211>1160
<212>DNA
<213>大豆(Glycine max)
<400>15
accatgctga agaacacgat ccagtggagg aaggagtttg ggatggagga gctgatggaa 60
gagaagctcg gggatgagtt ggagaaggtg gtgttcatgc acggctttga caaggagggt 120
caccctgtgt gttacaacat atatgaggag ttccagaaca aggagttgta caagaagact 180
ttttctgatg aggagaagag ggagaagttc ctgaggtgga gaattcagtt cctggagaag 240
agtataagga agcttgattt caaccctggt ggcatatgca ccattgttca tgttaatgac 300
ctcaagaact ctcctggact tgccaagtgg gaacttagac aagccaccaa acatgcccta 360
caattgcttc aagacaatta ccctgaattt gttgccaaac aggtttttat taatgtgcct 420
tggtggtacc tggcagtgaa taggatgata agcccttttc ttactcagag gactaaaagc 480
aagtttgtct ttgctgggcc ttccaaatca acggaaaccc ttttgagata catagctccg 540
gagcagcttc ccgtgaagta cggtggacta agcaaagatg gggagttcgg aaatatcgat 600
gctgtcacag aaatcacagt gaggccggca gcaaaacata cagtggaatt ttcagttact 660
gagaactgct tactctcttg ggagcttaga gtaataggat gggaagtaag ttatggcgca 720
gaatttgtgc caagctcaga aggaagctac acagtaattg tccagaaggc taggaaggtt 780
gcttcatcag aagaaccagt tctttgcaac agttttaagg ttggtgaacc tgggaaagtt 840
gttctcacca ttgacaacac aagctctaag aagaagaagc tcttgtatcg cttgaagacc 900
aagccttccc cttctgacta aaatcatcat tgttatgtgt atgtgcaagt gggaagggga 960
agaacagtgc cagaattgtt tgcttacatt cctgcatcaa cttctaccaa gacaccagtg 1020
cttcaaagat taactttcca gctgttcata tatgaatata aagtttttgt ttttaatttt 1080
ttggcctttt tgttttgatg attattatgt gttcaaccat attatataat tatatatacc 1140
tctttctgtg tttttcttgc 1160
<210>16
<211>305
<212>PRT
<213>大豆
<400>16
Met Leu Lys Asn Thr Ile Gln Trp Arg Lys Glu Phe Gly Met Glu Glu
1 5 10 15
Leu Met Glu Glu Lys Leu Gly Asp Glu Leu Glu Lys Val Val Phe Met
20 25 30
His Gly Phe Asp Lys Glu Gly His Pro Val Cys Tyr Asn Ile Tyr Glu
35 40 45
Glu Phe Gln Asn Lys Glu Leu Tyr Lys Lys Thr Phe Ser Asp Glu Glu
50 55 60
Lys Arg Glu Lys Phe Leu Arg Trp Arg Ile Gln Phe Leu Glu Lys Ser
65 70 75 80
Ile Arg Lys Leu Asp Phe Asn Pro Gly Gly Ile Cys Thr Ile Val His
85 90 95
Val Asn Asp Leu Lys Asn Ser Pro Gly Leu Ala Lys Trp Glu Leu Arg
100 105 110
Gln Ala Thr Lys His Ala Leu Gln Leu Leu Gln Asp Asn Tyr Pro Glu
115 120 125
Phe Val Ala Lys Gln Val Phe Ile Asn Val Pro Trp Trp Tyr Leu Ala
130 135 140
Val Asn Arg Met Ile Ser Pro Phe Leu Thr Gln Arg Thr Lys Ser Lys
145 150 155 160
Phe Val Phe Ala Gly Pro Ser Lys Ser Thr Glu Thr Leu Leu Arg Tyr
165 170 175
Ile Ala Pro Glu Gln Leu Pro Val Lys Tyr Gly Gly Leu Ser Lys Asp
180 185 190
Gly Glu Phe Gly Asn Ile Asp Ala Val Thr Glu Ile Thr Val Arg Pro
195 200 205
Ala Ala Lys His Thr Val Glu Phe Ser Val Thr Glu Asn Cys Leu Leu
210 215 220
Ser Trp Glu Leu Arg Val Ile Gly Trp Glu Val Ser Tyr Gly Ala Glu
225 230 235 240
Phe Val Pro Ser Ser Glu Gly Ser Tyr Thr Val Ile Val Gln Lys Ala
245 250 255
Arg Lys Val Ala Ser Ser Glu Glu Pro Val Leu Cys Asn Ser Phe Lys
260 265 270
Val Gly Glu Pro Gly Lys Val Val Leu Thr Ile Asp Asn Thr Ser Ser
275 280 285
Lys Lys Lys Lys Leu Leu Tyr Arg Leu Lys Thr Lys Pro Ser Pro Ser
290 295 300
Asp
305
<210>17
<211>2212
<212>DNA
<213>普通小麦(Triticum aestivum)
<400>17
ccacgcgtcc gctctccccc tctccctcct gctctatctt tctcaccgca aaagcttgaa 60
acaccccgcg gagatggcag aggagccgca gccacaggcc gccgccgccc ccgccgccgc 120
ggccacggag gtggtcgtcg ccgagaaggc gccggcggag gtggagaaga aggccgagga 180
gcccgcggcg gaggcggagg ccgaggagac ggccgccgtt gccgacgacg ggggcgccgt 240
cgaggccacc ggctctttca aggaggagag caacctcgtc gccgacctgc ctgacccgga 300
gaagaaggcg ctcgacgagt tcaaggagct gatcgtcgcc gcgctcgccg ccggtgagtt 360
caatctgccc cctcccccgc cgccgccgaa ggccaagact gaggccgccg cagaggagac 420
caagacggag gcgccggcca aggaggaggc caagaccgag gagccggcca aggcggaaga 480
accagccaag gaggagccca aggctgaaga gccggccaag gccgaggcgg cagcggcgga 540
gccagcagcc gaggagccca aggccgtggt cgctgccgag gcagcagccg aggagccggc 600
caaggaggaa cccaaggccg aggaggccaa gccggccgag ccaaagaagg aggaggaagc 660
agtcgtggtc gccgaggagg gcaccaagac ggcggaaccg gtcgaggagg ccgccgccgc 720
cgccaccacc acagagcagg cagcggcgcc ggaaccggag gcggaggcag ccgcgcccga 780
gccggtgttc atctggggcg tgccgctggt gggcgacgac gagcgcacgg acgcggtgct 840
gctcaagttc ctgcgcgcgc gggagttcaa ggtgaaggag gcgatggcga tgctccggtc 900
cgccgtgctc tggcggaagc gcttcggcat cgagtcgctc ctggaggcgg acctggcctt 960
cccggagctg gagaaggtgg tgttctaccg cggcgccgac cgggagggcc accccgtctg 1020
ctacaacgtg tacggcgagt tccaggacaa ggaggtgtac gagaaggcgt tcggcgacga 1080
ggagaagcgg gagcggttcc tcaagtggcg catccagctg ctggagcgcg gcatcctgtc 1140
gcagctggac ttcgcgccca gcggcatctg ctccatggtg caggtcaccg acctcaagaa 1200
ctcgccgccc atgctcggca agcaccgcgc cgtcacccgc caggccgtcg ccctgctcca 1260
ggacaactac cccgagttca tcgccaagaa ggtgttcatc aacgtgccat ggtggtatct 1320
cgctgccaac aaaatgatga gccctttcct cacccagcgc accaagagca agttcgtgtt 1380
cgccagccag gccaagtcac ccgagaccct cttcagatac atcgcgccgg agcaagtccc 1440
cgtccaattc ggaggcctct tcaaggaaga tgaccctgat ttcaccacct ccgactctgt 1500
caccgagctc accatcaaag cttcatccaa agaaaccatt gagatccctg tcaccgagaa 1560
ctcaacgatt gtatgggagc tccgggtgct cggctgggag gtcagccacg gcgcggagtt 1620
cacccccgac gctgagggcg cgtacaccgt catcgtgcag aagacaagga aggtccccgc 1680
gaatgaggag cccatcatga agggcagctt caaggccggc gaggccggca agatcgtgct 1740
cacggtcagc aacgccgcgt cgaagaagaa gaagctcctc tacagatcca aggtgaagtg 1800
cagcaccggc gagtccgttg aggccgacat tccatgacca ccattggagt cagtccctga 1860
tgatgataga agaagaagaa gaagaagaag ataaaccgcc tttttggttt ttgttctttg 1920
attccattgg ttttgtggtt tttggttcgc attcccgcat ttgtttaatt attaaaatta 1980
aaaacccaaa gtgagcttga ttttgtgacg gtacagtagt tgggagagga aggttggtat 2040
ggatgggatg atataatggc atcgtgatgg ttgttgaggg tagggcaaga ggagaaaaat 2100
ggatgataca atctgctgct gctctgtaaa tttgtctgta cattgttgca atcgctggct 2160
ggatcctcat ggacatgtta tatttagaag tacctgctgc catcaaaaaa ac 2212
<210>18
<211>587
<212>PRT
<213>普通小麦
<400>18
Met Ala Glu Glu Pro Gln Pro Gln Ala Ala Ala Ala Pro Ala Ala Ala
1 5 10 15
Ala Thr Glu Val Val Val Ala Glu Lys Ala Pro Ala Glu Val Glu Lys
20 25 30
Lys Ala Glu Glu Pro Ala Ala Glu Ala Glu Ala Glu Glu Thr Ala Ala
35 40 45
Val Ala Asp Asp Gly Gly Ala Val Glu Ala Thr Gly Set Phe Lys Glu
50 55 60
Glu Ser Asn Leu Val Ala Asp Leu Pro Asp Pro Glu Lys Lys Ala Leu
65 70 75 80
Asp Glu Phe Lys Glu Leu Ile Val Ala Ala Leu Ala Ala Gly Glu Phe
85 90 95
Asn Leu Pro Pro Pro Pro Pro Pro Pro Lys Ala Lys Thr Glu Ala Ala
100 105 110
Ala Glu Glu Thr Lys Thr Glu Ala Pro Ala Lys Glu Glu Ala Lys Thr
115 120 125
Glu Glu Pro Ala Lys Ala Glu Glu Pro Ala Lys Glu Glu Pro Lys Ala
130 135 140
Glu Glu Pro Ala Lys Ala Glu Ala Ala Ala Ala Glu Pro Ala Ala Glu
145 150 155 160
Glu Pro Lys Ala Val Val Ala Ala Glu Ala Ala Ala Glu Glu Pro Ala
165 170 175
Lys Glu Glu Pro Lys Ala Glu Glu Ala Lys Pro Ala Glu Pro Lys Lys
180 185 190
Glu Glu Glu Ala Val Val Val Ala Glu Glu Gly Thr Lys Thr Ala Glu
195 200 205
Pro Val Glu Glu Ala Ala Ala Ala Ala Thr Thr Thr Glu Gln Ala Ala
210 215 220
Ala Pro Glu Pro Glu Ala Glu Ala Ala Ala Pro Glu Pro Val Phe Ile
225 230 235 240
Trp Gly Val Pro Leu Val Gly Asp Asp Glu Arg Thr Asp Ala Val Leu
245 250 255
Leu Lys Phe Leu Arg Ala Arg Glu Phe Lys Val Lys Glu Ala Met Ala
260 265 270
Met Leu Arg Ser Ala Val Leu Trp Arg Lys Arg Phe Gly Ile Glu Ser
275 280 285
Leu Leu Glu Ala Asp Leu Ala Phe Pro Glu Leu Glu Lys Val Val Phe
290 295 300
Tyr Arg Gly Ala Asp Arg Glu Gly His Pro Val Cys Tyr Asn Val Tyr
305 310 315 320
Gly Glu Phe Gln Asp Lys Glu Val Tyr Glu Lys Ala Phe Gly Asp Glu
325 330 335
Glu Lys Arg Glu Arg Phe Leu Lys Trp Arg Ile Gln Leu Leu Glu Arg
340 345 350
Gly Ile Leu Ser Gln Leu Asp Phe Ala Pro Ser Gly Ile Cys Ser Met
355 360 365
Val Gln Val Thr Asp Leu Lys Asn Ser Pro Pro Met Leu Gly Lys His
370 375 380
Arg Ala Val Thr Arg Gln Ala Val Ala Leu Leu Gln Asp Asn Tyr Pro
385 390 395 400
Glu Phe Ile Ala Lys Lys Val Phe Ile Asn Val Pro Trp Trp Tyr Leu
405 410 415
Ala Ala Asn Lys Met Met Ser Pro Phe Leu Thr Gln Arg Thr Lys Ser
420 425 430
Lys Phe Val Phe Ala Ser Gln Ala Lys Ser Pro Glu Thr Leu Phe Arg
435 440 445
Tyr Ile Ala Pro Glu Gln Val Pro Val Gln Phe Gly Gly Leu Phe Lys
450 455 460
Glu Asp Asp Pro Asp Phe Thr Thr Ser Asp Ser Val Thr Glu Leu Thr
465 470 475 480
Ile Lys Ala Ser Ser Lys Glu Thr Ile Glu Ile Pro Val Thr Glu Asn
485 490 495
Ser Thr Ile Val Trp Glu Leu Arg Val Leu Gly Trp Glu Val Ser His
500 505 510
Gly Ala Glu Phe Thr Pro Asp Ala Glu Gly Ala Tyr Thr Val Ile Val
515 520 525
Gln Lys Thr Arg Lys Val Pro Ala Asn Glu Glu Pro Ile Met Lys Gly
530 535 540
Ser Phe Lys Ala Gly Glu Ala Gly Lys Ile Val Leu Thr Val Ser Asn
545 550 555 560
Ala Ala Ser Lys Lys Lys Lys Leu Leu Tyr Arg Ser Lys Val Lys Cys
565 570 575
Ser Thr Gly Glu Ser Val Glu Ala Asp Ile Pro
580 585
<210>19
<211>1393
<212>DNA
<213>普通小麦
<400>19
ggtccggaat ttccgggtcg acccacccgt ccgggaggag cccaaggccg aggaggccat 60
cgaggagacc gccgtccccg ccgcggccga gccggaggcg gcgcccgccg ccgagcccaa 120
ggaggagctg atctggggcg tgccgctggt gggcggcgac gagcgcacgg acacggtgct 180
cctcaagttc ctccgcgcgc gcgagttcaa ggtgaaggag gccatggcga tgctcaaggc 240
ggcggtgctg tggcgcaaga gcttcggcat cgacgcgctc ctgggcgccg acctcggcgt 300
gccggagctg gagaacgtcg tcttctaccg cggcgccgac cgcgagggcc accccgtctg 360
ctacaacgtc tacagcgagt tccaggacaa ggacctctac gagaaggcct tcggcgacga 420
cgagaagcgg gagcgcttcc tcaggtggcg catccagctc ctcgagcgcg gcatccggga 480
gcagctcgac ttctcgccca gcggcatctg ctccatggtg caggtcaccg acctcaagaa 540
ctcgccgccc atgctcggca agcaccgcgc cgtcacccgc caggcgctcg cgctgctcca 600
ggacaactac cctgaattca tcgccaagaa ggtgttcatc aatgtgccat ggtggtatct 660
tgcggcaaac aagatgatga gcccattcct cacacagcgc accaagagca aattcacgtt 720
ttgcagccca gccaagaccg cagagaccct attcagatac atcgcgccgg agcaggtccc 780
tgtccaattc ggcggcctct acaaagagga tgatactgaa ttctccactt ctgatggcgt 840
gaccgagctc actgtcaaac cttcttccaa agaaactgtt gagattcctg ctactgagaa 900
ctccacggtc gtgtgggagc tccgtgtgct tggatgggag gtgagctacg gcgtggagtt 960
caccccggac gccgagggcg gctacacggt catcgtgcag aagactcgga aggtgcccgc 1020
caacgaggag ccaatcatga agggtagctt caaagcgagc gagcctggca aggtggtgct 1080
catcgtcaac aacccgacgt cgaagaagaa gaagctgctg tgccgattca aggtgaagag 1140
ctccaccgaa tcctccgcct gatgaggttc cagctgctga tacaaccgcc aaccaggtcc 1200
ataccaccgc caccatttga acatgtcgca tgatagggga gagcaaataa gattttagta 1260
gatggccgtt ttcgtgtcgg gttctttgat ttgttggctt gctgtttttt gggggtcgga 1320
tttgtatgtg tgtttactcg gaaccaaagt gggcttgttc tatgaatgag ataggtactc 1380
ctggctggga tcc 1393
<210>20
<211>312
<212>PRT
<213>普通小麦
<400>20
Met Ala Met Leu Lys Ala Ala Val Leu Trp Arg Lys Ser Phe Gly Ile
1 5 10 15
Asp Ala Leu Leu Gly Ala Asp Leu Gly Val Pro Glu Leu Glu Asn Val
20 25 30
Val Phe Tyr Arg Gly Ala Asp Arg Glu Gly His Pro Val Cys Tyr Asn
35 40 45
Val Tyr Ser Glu Phe Gln Asp Lys Asp Leu Tyr Glu Lys Ala Phe Gly
50 55 60
Asp Asp Glu Lys Arg Glu Arg Phe Leu Arg Trp Arg Ile Gln Leu Leu
65 70 75 80
Glu Arg Gly Ile Arg Glu Gln Leu Asp Phe Ser Pro Ser Gly Ile Cys
85 90 95
Ser Met Val Gln Val Thr Asp Leu Lys Asn Ser Pro Pro Met Leu Gly
100 105 110
Lys His Arg Ala Val Thr Arg Gln Ala Leu Ala Leu Leu Gln Asp Asn
115 120 125
Tyr Pro Glu Phe Ile Ala Lys Lys Val Phe Ile Asn Val Pro Trp Trp
130 135 140
Tyr Leu Ala Ala Asn Lys Met Met Ser Pro Phe Leu Thr Gln Arg Thr
145 150 155 160
Lys Ser Lys Phe Thr Phe Cys Ser Pro Ala Lys Thr Ala Glu Thr Leu
165 170 175
Phe Arg Tyr Ile Ala Pro Glu Gln Val Pro Val Gln Phe Gly Gly Leu
180 185 190
Tyr Lys Glu Asp Asp Thr Glu Phe Ser Thr Ser Asp Gly Val Thr Glu
195 200 205
Leu Thr Val Lys Pro Ser Ser Lys Glu Thr Val Glu Ile Pro Ala Thr
210 215 220
Glu Asn Ser Thr Val Val Trp Glu Leu Arg Val Leu Gly Trp Glu Val
225 230 235 240
Ser Tyr Gly Val Glu Phe Thr Pro Asp Ala Glu Gly Gly Tyr Thr Val
245 250 255
Ile Val Gln Lys Thr Arg Lys Val Pro Ala Asn Glu Glu Pro Ile Met
260 265 270
Lys Gly Ser Phe Lys Ala Ser Glu Pro Gly Lys Val Val Leu Ile Val
275 280 285
Asn Asn Pro Thr Ser Lys Lys Lys Lys Leu Leu Cys Arg Phe Lys Val
290 295 300
Lys Ser Ser Thr Glu Ser Ser Ala
305 310
<210>21
<211>2018
<212>DNA
<213>大豆
<220>
<221>misc_feature
<222>(1862)..(1862)
<223>n是a、c、g或t
<220>
<221>misc_feature
<222>(1878)..(1878)
<223>n是a、c、g或t
<220>
<221>misc_feature
<222>(1889)..(1889)
<223>n是a、c、g或t
<220>
<221>misc_feature
<222>(1922)..(1922)
<223>n是a、c、g或t
<220>
<221>misc_feature
<222>(1929)..(1930)
<223>n是a、c、g或t
<220>
<221>misc_feature
<222>(1936)..(1936)
<223>n是a、c、g或t
<220>
<221>misc_feature
<222>(1962)..(1962)
<223>n是a、c、g或t
<220>
<221>misc_feature
<222>(1971)..(1972)
<223>n是a、c、g或t
<220>
<221>misc_feature
<222>(2006)..(2006)
<223>n是a、c、g或t
<400>21
gaagcaactc ttgtcttccc tttcaacttc aactcaaccc tcaatcaaac tcattctgtc 60
ttttcttcta tttttatttc tatccaatca tccatggccg aggaacccca aaaaccagcc 120
tccgctgaag aagtggtcgc tgttcctgcg gagaacccac catctgaagc cgaagctgaa 180
aacatcgaag cagagaaggc ccagagtggt gtagaagaca agatttccca gtcggtttcg 240
ttcaaggagg agaccaacgt ggttggcgac ctccccgagg cgcagaagaa agcccttgat 300
gagctcaaga agcttgttca agaagcgctc aacaaccatg agctaactgc tcccaagcca 360
gaaccggaga agaagaaacc agcagcagag aagaaggagg aagttgaagt gacagaaggg 420
aagaaggaag ctgaagtcat agaagagaag aaggaagtgg aagtgacgga agagaagaag 480
gaaattgaag taacagaaga gaagaaggaa gctgaagtca tagaagagaa gaaggaagtg 540
gaagtgacgg aagagaagaa ggaaattgaa gtaacagaag agaagaagga agcagaagtg 600
aaagaagaga agaaggaagg ggaagtgacg gaagagaaga aggaagttga agtgacggaa 660
gagaagaagg aagcggaagt gatagtagaa gagaagaagg aagttgaagt gacggaagag 720
aagaaggaag tggaagtgac cgaaggaaag aaagaagtgg aagtgatcga agagaagaag 780
gaaacagaag tgacagaaga aaagaaagaa gtggaagtag aagtgaggga agagaagaag 840
gaaagtgaag tgaaagaaga agaaaaaggt cgggaggttg ttccagagga agttgagata 900
tggggaattc ccctgctggg ggacgagagg agcgatgtga ttctgctaaa gtttcttagg 960
gcaagggatt tcaaggtgaa ggaggccttg aacatgataa gaaacacggt gcgatggaga 1020
aaggaatttg gaatagaggg tctagtggag gaagatcttg gaagtgattg ggagaaggtg 1080
gtgttcaagg atggatacga caaagaaggg cacccggtgt actacaacgt ctttggggag 1140
tttgaggaca aggagttgta cagcaagacg tttttggacg aggaaaagag gaacaagttc 1200
ataaggtgga ggattcagtc gttggagaag agcgttagaa gccttgactt ctctccgaat 1260
gggatatcga caatagttca ggtgaacgac cttaagaact ctcccggact gggcaagagg 1320
gaactgaggc aggccaccaa tcaggccctt caactgcttc aggacaacta ccctgagttc 1380
gttgccaagc agatattcat caatgtcccc tggtggtacc ttgccttttc taggatgatc 1440
agtcccttct tcacacagag gaccaagagt aaatttgttt ttgctggccc ttccaaatct 1500
gctgataccc ttttcagata tatagctccg gagctggtcc cggttcaata cggtggtctt 1560
agcagagagg ctgaacagga attcacctct gcttaccctg ttacggagtt tactattaaa 1620
cccgctacca aacattctgt tgagttccct gtttctgaga aaagccatct tgtttgggaa 1680
atccgagtgg tgggttggga tgtcagctat ggagctgaat ttgtgcccag cgctgaggat 1740
ggatacactg tcatagtaca caagagcagg aaaattgctc ccgctgatga gaccgttctt 1800
accaacggtt tcagaattgg tgaacctggc aagattgtac tcaccataga caaccaaaca 1860
tncaagaaga agaaactnct ctacaggtnc cagaccaaac ccattgcaga gtaagcttga 1920
tnaggatgnn tactgntact gtatattcat cattacaacc ancacaacat nngtggttgg 1980
gcgagggact tattctttca tacggngtct tcttttct 2018
<210>22
<211>606
<212>PRT
<213>大豆
<220>
<221>不确定
<222>(590)..(590)
<223>Xaa可以是任意天然氨基酸
<220>
<221>不确定
<222>(599)..(599)
<223>Xaa可以是任意天然氨基酸
<400>22
Met Ala Glu Glu Pro Gln Lys Pro Ala Ser Ala Glu Glu Val Val Ala
1 5 10 15
Val Pro Ala Glu Asn Pro Pro Ser Glu Ala Glu Ala Glu Asn Ile Glu
20 25 30
Ala Glu Lys Ala Gln Ser Gly Val Glu Asp Lys Ile Ser Gln Ser Val
35 40 45
Ser Phe Lys Glu Glu Thr Asn Val Val Gly Asp Leu Pro Glu Ala Gln
50 55 60
Lys Lys Ala Leu Asp Glu Leu Lys Lys Leu Val Gln Glu Ala Leu Asn
65 70 75 80
Asn His Glu Leu Thr Ala Pro Lys Pro Glu Pro Glu Lys Lys Lys Pro
85 90 95
Ala Ala Glu Lys Lys Glu Glu Val Glu Val Thr Glu Gly Lys Lys Glu
100 105 110
Ala Glu Val Ile Glu Glu Lys Lys Glu Val Glu Val Thr Glu Glu Lys
115 120 125
Lys Glu Ile Glu Val Thr Glu Glu Lys Lys Glu Ala Glu Val Ile Glu
130 135 140
Glu Lys Lys Glu Val Glu Val Thr Glu Glu Lys Lys Glu Ile Glu Val
145 150 155 160
Thr Glu Glu Lys Lys Glu Ala Glu Val Lys Glu Glu Lys Lys Glu Gly
165 170 175
Glu Val Thr Glu Glu Lys Lys Glu Val Glu Val Thr Glu Glu Lys Lys
180 185 190
Glu Ala Glu Val Ile Val Glu Glu Lys Lys Glu Val Glu Val Thr Glu
195 200 205
Glu Lys Lys Glu Val Glu Val Thr Glu Gly Lys Lys Glu Val Glu Val
210 215 220
Ile Glu Glu Lys Lys Glu Thr Glu Val Thr Glu Glu Lys Lys Glu Val
225 230 235 240
Glu Val Glu Val Arg Glu Glu Lys Lys Glu Ser Glu Val Lys Glu Glu
245 250 255
Glu Lys Gly Arg Glu Val Val Pro Glu Glu Val Glu Ile Trp Gly Ile
260 265 270
Pro Leu Leu Gly Asp Glu Arg Ser Asp Val Ile Leu Leu Lys Phe Leu
275 280 285
Arg Ala Arg Asp Phe Lys Val Lys Glu Ala Leu Asn Met Ile Arg Asn
290 295 300
Thr Val Arg Trp Arg Lys Glu Phe Gly Ile Glu Gly Leu Val Glu Glu
305 310 315 320
Asp Leu Gly Ser Asp Trp Glu Lys Val Val Phe Lys Asp Gly Tyr Asp
325 330 335
Lys Glu Gly His Pro Val Tyr Tyr Asn Val Phe Gly Glu Phe Glu Asp
340 345 350
Lys Glu Leu Tyr Ser Lys Thr Phe Leu Asp Glu Glu Lys Arg Asn Lys
355 360 365
Phe Ile Arg Trp Arg Ile Gln Ser Leu Glu Lys Ser Val Arg Ser Leu
370 375 380
Asp Phe Ser Pro Asn Gly Ile Ser Thr Ile Val Gln Val Asn Asp Leu
385 390 395 400
Lys Asn Ser Pro Gly Leu Gly Lys Arg Glu Leu Arg Gln Ala Thr Asn
405 410 415
Gln Ala Leu Gln Leu Leu Gln Asp Asn Tyr Pro Glu Phe Val Ala Lys
420 425 430
Gln Ile Phe Ile Asn Val Pro Trp Trp Tyr Leu Ala Phe Ser Arg Met
435 440 445
Ile Ser Pro Phe Phe Thr Gln Arg Thr Lys Ser Lys Phe Val Phe Ala
450 455 460
Gly Pro Ser Lys Ser Ala Asp Thr Leu Phe Arg Tyr Ile Ala Pro Glu
465 470 475 480
Leu Val Pro Val Gln Tyr Gly Gly Leu Ser Arg Glu Ala Glu Gln Glu
485 490 495
Phe Thr Ser Ala Tyr Pro Val Thr Glu Phe Thr Ile Lys Pro Ala Thr
500 505 510
Lys His Ser Val Glu Phe Pro Val Ser Glu Lys Ser His Leu Val Trp
515 520 525
Glu Ile Arg Val Val Gly Trp Asp Val Ser Tyr Gly Ala Glu Phe Val
530 535 540
Pro Ser Ala Glu Asp Gly Tyr Thr Val Ile Val His Lys Ser Arg Lys
545 550 555 560
Ile Ala Pro Ala Asp Glu Thr Val Leu Thr Asn Gly Phe Arg Ile Gly
565 570 575
Glu Pro Gly Lys Ile Val Leu Thr Ile Asp Asn Gln Thr Xaa Lys Lys
580 585 590
Lys Lys Leu Leu Tyr Arg Xaa Gln Thr Lys Pro Ile Ala Glu
595 600 605
<210>23
<211>1642
<212>DNA
<213>大豆
<400>23
gaaatgaaag ctaaggaaag gaagggtaaa ggagcgtaaa actggaggag tttgtgtctt 60
gttttcttgt ggcggggcat aggattccga ataaatgcat gtttcaactt tcctttttct 120
tctctcagtc tctctagcta gctttcttct ccactctctc aaatggccca aaatgattcc 180
aaccctactc cgcctccgga accccatgta gcggcggaac ccattactga ggatttggtc 240
caagacaaag aagaagagga tgatagtagt aagattgtta ttccagtccc agagagcgag 300
tccttgtcat tgaaggagga tagcaatagg gtttctgatt ccgagaaaaa tgccattgat 360
gagctgaaga agctcctgaa agaggaatta gaggacgagg aggtttccat ctggggcgtc 420
cctctcttta aggatgacag gactgacgtc attctcctca agtttctcag agctcgtgag 480
ctcaaagtga aggacgccct tgtcatgttt caaaacactc tccgatggag gaaggacttc 540
aacatcgacg cccttctgga tgaagatctg ggcgaccact tggagaaggt tgtcttcatg 600
cacggacacg gcagagaggg ccatcccgtc tgttacaacg tctacggcga gttccagaac 660
aaggacctct accacaaggc cttctcctct caggataatc gaaacaagtt tctccgatgg 720
cgtattcagt tgttggagcg cagtattcgg cacctcgact tcactccttc ctccggcatc 780
aacaccattt tccaagtcaa tgacctcaaa aactcccctg gccctgctaa acgtgagctt 840
cgccttgcca ccaaacaagc tttgcagttg cttcaggaca actatcccga atttgttgcc 900
aaacaggttt ttatcaacgt cccatggtgg tatcttgctt tctataccat gatcaatccc 960
ttcttgactt cgaggaccaa aagcaaattt gtctttgctg gaccatccaa gtcccccgat 1020
actcttttca agtatatttc tcctgagcaa gtgcccgttc agtatggtgg cctcagtgta 1080
gatttctgtg actgcaaccc cgatttcact atgtctgatc ctgtcaccga aattcctata 1140
aagcctacca ctaagcaaac tgtggaaatt gctatttatg agaagtgcat tattgtttgg 1200
gagctgcgcg tggtgggctg ggaggttagc tacaatgctg aattcaagcc tgatgttgaa 1260
gatgcatata cggttatcat acagaaggcc acaaagatgt cccccaccga tgaaccagtt 1320
gtttccaata gctttaaagt tgttgaactg ggaaaattgt tgctcaccat agacaatcct 1380
accttgaaaa aaaagaggct tctttacagg ttcaagatca aaccctactc tgattgagag 1440
aaaataacct ctggtggtgg ttttgggtac atgaaggaaa ttgaacttga agagtaagaa 1500
atatgcatgt catcatttgt tcggtccatt tttcatctat agttttggtc gtgagttgtt 1560
tgcagattgt ctttctctgt ctttgttggt tggtggggtg tttgaacact ttattgctag 1620
tctaaattgt tctttaattt tc 1642
<210>24
<211>424
<212>PRT
<213>大豆
<400>24
Met Ala Gln Asn Asp Ser Asn Pro Thr Pro Pro Pro Glu Pro His Val
1 5 10 15
Ala Ala Glu Pro Ile Thr Glu Asp Leu Val Gln Asp Lys Glu Glu Glu
20 25 30
Asp Asp Ser Ser Lys Ile Val Ile Pro Val Pro Glu Ser Glu Ser Leu
35 40 45
Ser Leu Lys Glu Asp Ser Asn Arg Val Ser Asp Ser Glu Lys Asn Ala
50 55 60
Ile Asp Glu Leu Lys Lys Leu Leu Lys Glu Glu Leu Glu Asp Glu Glu
65 70 75 80
Val Ser Ile Trp Gly Val Pro Leu Phe Lys Asp Asp Arg Thr Asp Val
85 90 95
Ile Leu Leu Lys Phe Leu Arg Ala Arg Glu Leu Lys Val Lys Asp Ala
100 105 110
Leu Val Met Phe Gln Asn Thr Leu Arg Trp Arg Lys Asp Phe Asn Ile
115 120 125
Asp Ala Leu Leu Asp Glu Asp Leu Gly Asp His Leu Glu Lys Val Val
130 135 140
Phe Met His Gly His Gly Arg Glu Gly His Pro Val Cys Tyr Asn Val
145 150 155 160
Tyr Gly Glu Phe Gln Asn Lys Asp Leu Tyr His Lys Ala Phe Ser Ser
165 170 175
Gln Asp Asn Arg Asn Lys Phe Leu Arg Trp Arg Ile Gln Leu Leu Glu
180 185 190
Arg Ser Ile Arg His Leu Asp Phe Thr Pro Ser Ser Gly Ile Asn Thr
195 200 205
Ile Phe Gln Val Asn Asp Leu Lys Asn Ser Pro Gly Pro Ala Lys Arg
210 215 220
Glu Leu Arg Leu Ala Thr Lys Gln Ala Leu Gln Leu Leu Gln Asp Asn
225 230 235 240
Tyr Pro Glu Phe Val Ala Lys Gln Val Phe Ile Asn Val Pro Trp Trp
245 250 255
Tyr Leu Ala Phe Tyr Thr Met Ile Asn Pro Phe Leu Thr Ser Arg Thr
260 265 270
Lys Ser Lys Phe Val Phe Ala Gly Pro Ser Lys Ser Pro Asp Thr Leu
275 280 285
Phe Lys Tyr Ile Ser Pro Glu Gln Val Pro Val Gln Tyr Gly Gly Leu
290 295 300
Ser Val Asp Phe Cys Asp Cys Asn Pro Asp Phe Thr Met Ser Asp Pro
305 310 315 320
Val Thr Glu Ile Pro Ile Lys Pro Thr Thr Lys Gln Thr Val Glu Ile
325 330 335
Ala Ile Tyr Glu Lys Cys Ile Ile Val Trp Glu Leu Arg Val Val Gly
340 345 350
Trp Glu Val Ser Tyr Asn Ala Glu Phe Lys Pro Asp Val Glu Asp Ala
355 360 365
Tyr Thr Val Ile Ile Gln Lys Ala Thr Lys Met Ser Pro Thr Asp Glu
370 375 380
Pro Val Val Ser Asn Ser Phe Lys Val Val Glu Leu Gly Lys Leu Leu
385 390 395 400
Leu Thr Ile Asp Asn Pro Thr Leu Lys Lys Lys Arg Leu Leu Tyr Arg
405 410 415
Phe Lys Ile Lys Pro Tyr Ser Asp
420
<210>25
<211>2353
<212>DNA
<213>玉蜀黍
<400>25
tccgccccgc cctgtcgctt tcccttccac acgcgccgtt cgctttgatc gaccaggcag 60
gcatggccga cgagacgaag caagaagccg ccgccccggc ggccgaggtg gtcgtgacgg 120
aggaggagaa gaaggcagaa gagaccgccc cggtggcgga ggaaaaggcc gtggaggcgg 180
ctgtagagaa ggccgcggag gcggaggcgg gggccgagga aaaggccgcg gaagcggact 240
cggaggagga gaagaaggcg gaggaggccg aggaggccgc cgcgggcgat gaggcggccg 300
tgatcgatgg cactgggtcg ttcaaggagg agagtaacct ggtgtccgag ctccccgacc 360
ctgagcgcac agcgctcgcg cagctcaagg agctcgtcgc caccgcgctc gctaacgggg 420
agttcaacct gccgccgccg cctgccaagg aggaggccaa gaaggaggag ccggcaaagg 480
aagaagctcc ggcggacaag gaggacgagc ccaaggcaga ggaggcggct gcccaagagc 540
ccgtcaagga ggaggccaag cctgaggagc ccaaaacgga ggcgccggcg gaagcagcgc 600
ccgaggaggt taaggacgag acacccgtgc cggaagagac caagactgag gctcccgcgc 660
cggaggagcc caaggccgag gagcctgcca aggaggagct caaggcagag gcggcgacgg 720
aagcggtcgc cgaggagacc aaaccagctg agccggtgcc ggaggaggag gagaagacgg 780
tcgttgttgc cgaggaggag gccaccaaaa cggtggaagc catcgaggag acggtcgccg 840
tcgccgtcgc cgccgccgcg tccgaggagc ccgaggcggg cgagccgaag gaggagctga 900
tctggggcgt gccgctggcg ggcgacgacg agcgcacgga cacggtgctg ctcaagttcc 960
tccgcgcgcg cgagttcaag gtgaaggagg cgatggcgat gctcaagtcg gcggtgctgt 1020
ggcgcaagcg gttcggcatc gacgagctcc tcctggacgc cgacctcggc ctgcgggagc 1080
tggagggcgt ggtgttctac cgcggcgccg accgcgaggg ccacccggtc tgctacaacg 1140
tgtacggcga gttccaggac aaggagctgt acgagagggc cttcggcgac gaggagaagc 1200
gggagcgctt cctcaagtgg cgcatccagc tcctggagcg cggcatccgg gagcagctcg 1260
acttctcgcc cagcggcatc tgctccatgg tgcaggtcac cgacctcaag aactcgccgc 1320
ccatgctcgg caagcaccgc gccgtcacgc gccaggctct cgccctgcta caggacaact 1380
acccggagtt cgtggccaag aaggtgttca tcaacgtgcc gtggtggtac ctggcggcaa 1440
acaaggtgat gagcccattc ctgactcagc gcaccaagag caagatcgtc ttctgcagcc 1500
ctggcaagtc ggcggagacc ctcttcagat acatcgcccc ggagcaagtc cccgtccagt 1560
tcggcggcct gtacaaggag gacgacacgg agttctccac ctccgacgcc gtcaccgagc 1620
tcaccgtgaa accgtcctcc aaggagaccg tcgagatccc agccaccgag aactccaccg 1680
tggtgtggga gctccgcgtg ctggggtggg aggtgagcta cggcgccgag ttcacccccg 1740
acgcggaggg cggctacacc gtcatcgtgc agaagacgcg gaaggtcccc gcccacgagg 1800
agcccatcat gaagggcagc ttcaaggcca cggagcccgg caagctggtg ctgggcgtga 1860
acaacccggc gtccaggaag aagaagctgc tgtgccggtt caaggtgagg agcgccgccg 1920
cctgatgagg gtcgttggtc tgggtcccag caggtacagc ctgccagctg cttgaccacc 1980
agcccgacat gtataattcg atcgatcgcc aaccaggtcc ataccgccac catttgaatg 2040
aacatgctgc attgcattac atgataggag agagagagag agagagagag gacaataaag 2100
agtttctgct tcctagctag gcccgttccg tgtcctggtc gttctttgat tatttgttgg 2160
cttgctgttc ttttttttcc ttggggggag ggtcgcgttt gtatgtgttt attacttgaa 2220
tgaaaacaaa agtgagcttg ttaatgaaaa aaaaaaaaaa aaatgttgag gtcaccgtgg 2280
tccgtggtgg ggtgggggaa gaagaacagt aacagttgct gcccggcctg tacatttctt 2340
tcaaaaaaaa aaa 2353
<210>26
<211>620
<212>PRT
<213>玉蜀黍
<400>26
Met Ala Asp Glu Thr Lys Gln Glu Ala Ala Ala Pro Ala Ala Glu Val
1 5 10 15
Val Val Thr Glu Glu Glu Lys Lys Ala Glu Glu Thr Ala Pro Val Ala
20 25 30
Glu Glu Lys Ala Val Glu Ala Ala Val Glu Lys Ala Ala Glu Ala Glu
35 40 45
Ala Gly Ala Glu Glu Lys Ala Ala Glu Ala Asp Ser Glu Glu Glu Lys
50 55 60
Lys Ala Glu Glu Ala Glu Glu Ala Ala Ala Gly Asp Glu Ala Ala Val
65 70 75 80
Ile Asp Gly Thr Gly Ser Phe Lys Glu Glu Ser Asn Leu Val Ser Glu
85 90 95
Leu Pro Asp Pro Glu Arg Thr Ala Leu Ala Gln Leu Lys Glu Leu Val
100 105 110
Ala Thr Ala Leu Ala Asn Gly Glu Phe Asn Leu Pro Pro Pro Pro Ala
115 120 125
Lys Glu Glu Ala Lys Lys Glu Glu Pro Ala Lys Glu Glu Ala Pro Ala
130 135 140
Asp Lys Glu Asp Glu Pro Lys Ala Glu Glu Ala Ala Ala Gln Glu Pro
145 150 155 160
Val Lys Glu Glu Ala Lys Pro Glu Glu Pro Lys Thr Glu Ala Pro Ala
165 170 175
Glu Ala Ala Pro Glu Glu Val Lys Asp Glu Thr Pro Val Pro Glu Glu
180 185 190
Thr Lys Thr Glu Ala Pro Ala Pro Glu Glu Pro Lys Ala Glu Glu Pro
195 200 205
Ala Lys Glu Glu Leu Lys Ala Glu Ala Ala Thr Glu Ala Val Ala Glu
210 215 220
Glu Thr Lys Pro Ala Glu Pro Val Pro Glu Glu Glu Glu Lys Thr Val
225 230 235 240
Val Val Ala Glu Glu Glu Ala Thr Lys Thr Val Glu Ala Ile Glu Glu
245 250 255
Thr Val Ala Val Ala Val Ala Ala Ala Ala Ser Glu Glu Pro Glu Ala
260 265 270
Gly Glu Pro Lys Glu Glu Leu Ile Trp Gly Val Pro Leu Ala Gly Asp
275 280 285
Asp Glu Arg Thr Asp Thr Val Leu Leu Lys Phe Leu Arg Ala Arg Glu
290 295 300
Phe Lys Val Lys Glu Ala Met Ala Met Leu Lys Ser Ala Val Leu Trp
305 310 315 320
Arg Lys Arg Phe Gly Ile Asp Glu Leu Leu Leu Asp Ala Asp Leu Gly
325 330 335
Leu Arg Glu Leu Glu Gly Val Val Phe Tyr Arg Gly Ala Asp Arg Glu
340 345 350
Gly His Pro Val Cys Tyr Asn Val Tyr Gly Glu Phe Gln Asp Lys Glu
355 360 365
Leu Tyr Glu Arg Ala Phe Gly Asp Glu Glu Lys Arg Glu Arg Phe Leu
370 375 380
Lys Trp Arg Ile Gln Leu Leu Glu Arg Gly Ile Arg Glu Gln Leu Asp
385 390 395 400
Phe Ser Pro Ser Gly Ile Cys Ser Met Val Gln Val Thr Asp Leu Lys
405 410 415
Asn Ser Pro Pro Met Leu Gly Lys His Arg Ala Val Thr Arg Gln Ala
420 425 430
Leu Ala Leu Leu Gln Asp Asn Tyr Pro Glu Phe Val Ala Lys Lys Val
435 440 445
Phe Ile Asn Val Pro Trp Trp Tyr Leu Ala Ala Asn Lys Val Met Ser
450 455 460
Pro Phe Leu Thr Gln Arg Thr Lys Ser Lys Ile Val Phe Cys Ser Pro
465 470 475 480
Gly Lys Ser Ala Glu Thr Leu Phe Arg Tyr Ile Ala Pro Glu Gln Val
485 490 495
Pro Val Gln Phe Gly Gly Leu Tyr Lys Glu Asp Asp Thr Glu Phe Ser
500 505 510
Thr Ser Asp Ala Val Thr Glu Leu Thr Val Lys Pro Ser Ser Lys Glu
515 520 525
Thr Val Glu Ile Pro Ala Thr Glu Asn Ser Thr Val Val Trp Glu Leu
530 535 540
Arg Val Leu Gly Trp Glu Val Ser Tyr Gly Ala Glu Phe Thr Pro Asp
545 550 555 560
Ala Glu Gly Gly Tyr Thr Val Ile Val Gln Lys Thr Arg Lys Val Pro
565 570 575
Ala His Glu Glu Pro Ile Met Lys Gly Ser Phe Lys Ala Thr Glu Pro
580 585 590
Gly Lys Leu Val Leu Gly Val Asn Asn Pro Ala Ser Arg Lys Lys Lys
595 600 605
Leu Leu Cys Arg Phe Lys Val Arg Ser Ala Ala Ala
610 615 620
<210>27
<211>2151
<212>DNA
<213>玉蜀黍
<400>27
caacaatctc tctcccctca ctccctctct gcagcgcgca gctttcaaag cgttgggaga 60
gatggcagag gagacgcaac cagaggccgc agccgccgcc gcgcccgccg cggccgaggt 120
agtcgtgacc gaagctgcgc cggcggaggc ggaggtgcct gtggcggcgg aagctgaagc 180
cgaggccaag gatgagaaga aaggtgacga ggcggagctc accgccgatg acgcgggggt 240
ggggaccggc tcgttcaagg aggaaagcaa cctggtggaa gacctgcccg acccggagaa 300
gaaggcgctc gacgagttca agcagctgat cgctgccgcc ctcgccgccg gtgagttcaa 360
cctgcctccc ccgccgccgc cgccgaaggc caaggagacg aaggtggagg aagccaaggc 420
cgaggagccc gccaaagaag agcccgcggc cgaggcggag gctacggcgg aggagcccaa 480
ggcccaggtg gctgcggatg ccccggttga ggaggtcaag acggaggtgc cgccggccga 540
ggaggccaag gctgagacac tggctgagga agccaagcct tccgagcccg agccgcagga 600
gaagaccgtc gtggtcactg aggaggagac tgccaccaag acggtggaag caatcgagga 660
aaccgtcgtg tccgcccccg ccgccatccc ggaggaagca gcggcgccag aggcggtggt 720
cgaggctcag gcgaccgcgc ctgaacccgt gctgatttgg ggcgtgccgc tggtcggcga 780
cgacgagcgc acggacacgg ttctgctcaa gttcctgcgt gcgcgggagt tcaaggtgaa 840
ggatgccatg gcgatgctca agtccgcggt gctgtggcgc aagcgcttcg gcatcacctc 900
gctcctcgac gccgacctcg gcctgacgga gctggagaac gtggtgttct accgcggcac 960
ggaccgcgaa ggccaccccg tgtgctacaa cgtctacggt gagttccagg acaaggatct 1020
ctacgagaag gccttcggcg acgatgagaa gagggagcgc ttcctcaagt ggcgcatcca 1080
gctgctggag cgcggcattc tgtcgaagct ggacttctcg cccagcggca tctgctccat 1140
ggttcaggtt accgacctca agaactcgcc gcctatgctc ggcaagcacc gcgccgtcac 1200
ccgccaggct gtcacgctgc tccaggacaa ctaccccgag ttcattgcca agaaggtgtt 1260
catcaatgtg ccgtggtggt atctagccgc caacaagatg atgagcccgt tcctcacaca 1320
gcgcaccaag agcaagttcg tctttgctag cccagccaaa tcagcagcga ctctattcag 1380
atacatcgca ccggaacaag ttcctgtcca atttggaggc ctcttcaagg aggatgatcc 1440
tgagttcacc acctctgaca ctgtctccga gctcactatc aaaccatcct caaaagaaac 1500
cgttgagatc cctgtcaccg agaactccac aattgtatgg gaactccggg tgctgagttg 1560
ggaagtgagc tatggcgccg agttcacccc cgacgcggag ggtgggtaca ccgtcattgt 1620
acagaagaca aggaaggtgc ctgctaacga ggaaccgatc atgaagggaa gcttcaaggc 1680
aggcgagcct ggcaaacttg tgctaactgt gaacaaccct gcatccaaga agaagacgct 1740
cctttacaga tcgaaggtga agagcaccag cgagtgagtg tgaggtcacg ccgctagctg 1800
cctgggccct acagttaaaa cgatctacag catgatagaa gagagaagga accttttggt 1860
ttggttcgtt aatttactgg gttttttttt tgggttcgca ttctacattt tgtttggttg 1920
aaaccaaagt gagcttgttt ttgtgacagt agatgggaga agtataatgg cattgtgtga 1980
tggatggttg ttgatgaggg cagggaggac gaaaatgtgg gggaatgaaa ggttggagaa 2040
tgtctgttcc tctgtagatg tgttctgtac attgcatctc ttggattctc attgatatgt 2100
taaaatttag gagtacttgg tcacatcatt attcaatcca tgttactctg c 2151
<210>28
<211>571
<212>PRT
<213>玉蜀黍
<400>28
Met Ala Glu Glu Thr Gln Pro Glu Ala Ala Ala Ala Ala Ala Pro Ala
1 5 10 15
Ala Ala Glu Val Val Val Thr Glu Ala Ala Pro Ala Glu Ala Glu Val
20 25 30
Pro Val Ala Ala Glu Ala Glu Ala Glu Ala Lys Asp Glu Lys Lys Gly
35 40 45
Asp Glu Ala Glu Leu Thr Ala Asp Asp Ala Gly Val Gly Thr Gly Ser
50 55 60
Phe Lys Glu Glu Ser Asn Leu ValGlu Asp Leu Pro Asp Pro Glu Lys
65 70 75 80
Lys Ala Leu Asp Glu Phe Lys Gln Leu Ile Ala Ala Ala Leu Ala Ala
85 90 95
Gly Glu Phe Asn Leu Pro Pro Pro Pro Pro Pro Pro Lys Ala Lys Glu
100 105 110
Thr Lys Val Glu Glu Ala Lys Ala Glu Glu Pro Ala Lys Glu Glu Pro
115 120 125
Ala Ala Glu Ala Glu Ala Thr Ala Glu Glu Pro Lys Ala Gln Val Ala
130 135 140
Ala Asp Ala Pro Val Glu Glu Val Lys Thr Glu Val Pro Pro Ala Glu
145 150 155 160
Glu Ala Lys Ala Glu Thr Leu Ala Glu Glu Ala Lys Pro Ser Glu Pro
165 170 175
Glu Pro Gln Glu Lys Thr Val Val Val Thr Glu Glu Glu Thr Ala Thr
180 185 190
Lys Thr Val Glu Ala Ile Glu Glu Thr Val Val Ser Ala Pro Ala Ala
195 200 205
Ile Pro Glu Glu Ala Ala Ala Pro Glu Ala Val Val Glu Ala Gln Ala
210 215 220
Thr Ala Pro Glu Pro Val Leu Ile Trp Gly Val Pro Leu Val Gly Asp
225 230 235 240
Asp Glu Arg Thr Asp Thr Val Leu Leu Lys Phe Leu Arg Ala Arg Glu
245 250 255
Phe Lys Val Lys Asp Ala Met Ala Met Leu Lys Ser Ala Val Leu Trp
260 265 270
Arg Lys Arg Phe Gly Ile Thr Ser Leu Leu Asp Ala Asp Leu Gly Leu
275 280 285
Thr Glu Leu Glu Asn Val Val Phe Tyr Arg Gly Thr Asp Arg Glu Gly
290 295 300
His Pro Val Cys Tyr Asn Val Tyr Gly Glu Phe Gln Asp Lys Asp Leu
305 310 315 320
Tyr Glu Lys Ala Phe Gly Asp Asp Glu Lys Arg Glu Arg Phe Leu Lys
325 330 335
Trp Arg Ile Gln Leu Leu Glu Arg Gly Ile Leu Ser Lys Leu Asp Phe
340 345 350
Ser Pro Ser Gly Ile Cys Ser Met Val Gln Val Thr Asp Leu Lys Asn
355 360 365
Ser Pro Pro Met Leu Gly Lys His Arg Ala Val Thr Arg Gln Ala Val
370 375 380
Thr Leu Leu Gln Asp Asn Tyr Pro Glu Phe Ile Ala Lys Lys Val Phe
385 390 395 400
Ile Asn Val Pro Trp Trp Tyr Leu Ala Ala Asn Lys Met Met Ser Pro
405 410 415
Phe Leu Thr Gln Arg Thr Lys Ser Lys Phe Val Phe Ala Ser Pro Ala
420 425 430
Lys Ser Ala Ala Thr Leu Phe Arg Tyr Ile Ala Pro Glu Gln Val Pro
435 440 445
Val Gln Phe Gly Gly Leu Phe Lys Glu Asp Asp Pro Glu Phe Thr Thr
450 455 460
Ser Asp Thr Val Ser Glu Leu Thr Ile Lys Pro Ser Ser Lys Glu Thr
465 470 475 480
Val Glu Ile Pro Val Thr Glu Asn Ser Thr Ile Val Trp Glu Leu Arg
485 490 495
Val Leu Ser Trp Glu Val Ser Tyr Gly Ala Glu Phe Thr Pro Asp Ala
500 505 510
Glu Gly Gly Tyr Thr Val Ile Val Gln Lys Thr Arg Lys Val Pro Ala
515 520 525
Asn Glu Glu Pro Ile Met Lys Gly Ser Phe Lys Ala Gly Glu Pro Gly
530 535 540
Lys Leu Val Leu Thr Val Asn Asn Pro Ala Ser Lys Lys Lys Thr Leu
545 550 555 560
Leu Tyr Arg Ser Lys Val Lys Ser Thr Ser Glu
565 570
<210>29
<211>682
<212>DNA
<213>甘蔗(Saccharum officinarum)
<220>
<221>misc_feature
<222>(674)..(674)
<223>n是a、c、g或t
<400>29
cggacgcgtg ggcaaaacgg tggaagccat cgaggagacc gctgtcgcct ccgccgtggc 60
cgaacctgag gcggaggccg cgccggcgcc ggcggccgag ccgaaggagg agctgatctg 120
gggcgtgccg ctggtgggcg acgacgagcg cacggacacg gtgctgctca agttcctccg 180
cgcgcgcgag ttcaaggtga aggaggccct ggcgatgctc aagtcggcgg tgctgtggcg 240
caagcgcttc ggcatcgacg agctcctggg cgccgacctc ggcctgccgg agctggagaa 300
cgtggtgttc taccgcggcg ccgaccgcga gggccacccc gtctgctaca acgtctacgg 360
cgagttccag gacaaggagc tctacgagaa ggccttcggc gacgaggaga agcgggagcg 420
cttcctcaag tggcgcatcc agctcctcga gcgcggcatc agggagcagc tcgacttctc 480
gcccagtggc atctgctcca tggtgcaggt caccgacctc aagaactcgc cgcccatgct 540
cggcaagcac cgcgccgtca cacgccaggc tctcgccctg ctccaggaca actaccccga 600
gttcgtggcc caagaggtgt ttatcaatgt gccatggtgg tacctcgcgg cgaaacaagt 660
gatgagccca ttcnctgact ca 682
<210>30
<211>156
<212>PRT
<213>甘蔗
<220>
<221>不确定
<222>(154)..(154)
<223>Xaa可以是任意天然氨基酸
<400>30
Met Leu Lys Ser Ala Val Leu Trp Arg Lys Arg Phe Gly Ile Asp Glu
1 5 10 15
Leu Leu Gly Ala Asp Leu Gly Leu Pro Glu Leu Glu Asn Val Val Phe
20 25 30
Tyr Arg Gly Ala Asp Arg Glu Gly His Pro Val Cys Tyr Asn Val Tyr
35 40 45
Gly Glu Phe Gln Asp Lys Glu Leu Tyr Glu Lys Ala Phe Gly Asp Glu
50 55 60
Glu Lys Arg Glu Arg Phe Leu Lys Trp Arg Ile Gln Leu Leu Glu Arg
65 70 75 80
Gly Ile Arg Glu Gln Leu Asp Phe Ser Pro Ser Gly Ile Cys Ser Met
85 90 95
Val Gln Val Thr Asp Leu Lys Asn Ser Pro Pro Met Leu Gly Lys His
100 105 110
Arg Ala Val Thr Arg Gln Ala Leu Ala Leu Leu Gln Asp Asn Tyr Pro
115 120 125
Glu Phe Val Ala Gln Glu Val Phe Ile Asn Val Pro Trp Trp Tyr Leu
130 135 140
Ala Ala Lys Gln Val Met Ser Pro Phe Xaa Asp Ser
145 150 155
<210>31
<211>837
<212>DNA
<213>甘蔗
<220>
<221>misc_feature
<222>(563)..(563)
<223>n是a、c、g或t
<220>
<221>misc_feature
<222>(588)..(588)
<223>n是a、c、g或t
<220>
<221>misc_feature
<222>(755)..(755)
<223>n是a、c、g或t
<220>
<221>misc_feature
<222>(760)..(761)
<223>n是a、c、g或t
<220>
<221>misc_feature
<222>(771)..(773)
<223>n是a、c、g或t
<220>
<221>misc_feature
<222>(807)..(807)
<223>n是a、c、g或t
<400>31
cctggagcag cgtgacagga caactacccc gagttcattg ccaagaaggt gttcatcaat 60
gtgccgtggt ggtatctcgc tgccaacaag atgatgagcc cgttcctcac acagcgcacc 120
aagagcaagt tcgtttttgc tagcccagcc aagtcagcag agactctatt cagatacatc 180
gcagcggagc aagttcctgt ccaatttgga ggcctcttca aggaggacga ccctgagttc 240
accacctccg acactgtcgc tgagctcact atcaaaccat cgtcaaaaga aaccattgag 300
atccctgtca cggagaactc cacaattgta tgggaactcc gggtgctcgg ttgggaggtg 360
agctatggtg ctgagttcac ccctgacgct gagggtgggt acactgtcat tgtacagaaa 420
acaaggaagg tgcccgctaa cgaggaaccg atcatgaagg gaagcttcaa ggtacgcgag 480
cccggcaaac ttgtgctaac tgtgaacaac tcggcatcca agaagaagaa gctccttcac 540
agatcaaagg tgaagagcac cancgagtga gtgtgaggtt gctgctanct gcctgggtcc 600
tacagttaaa cgatctacca cagttaatct cagcatgata gaagagagga aaaacctttt 660
ggtttggttc gttaatttat tgggttttgc ttgttttggt gcacattcta cattttgttt 720
ggttaaacaa aagtgagctt gtttttggtt gattnaaatn natttccact nnnaaaaaaa 780
aaaaaaataa atttaagggg gggccgnttt aaaaaggggg ggccgttcct ataggaa 837
<210>32
<211>159
<212>PRT
<213>甘蔗
<220>
<221>不确定
<222>(158)..(158)
<223>Xaa可以是任意天然氨基酸
<220>
<221>不确定
<222>(158)..(158)
<400>32
Met Met Ser Pro Phe Leu Thr Gln Arg Thr Lys Ser Lys Phe Val Phe
1 5 10 15
Ala Ser Pro Ala Lys Ser Ala Glu Thr Leu Phe Arg Tyr Ile Ala Ala
20 25 30
Glu Gln Val Pro Val Gln Phe Gly Gly Leu Phe Lys Glu Asp Asp Pro
35 40 45
Glu Phe Thr Thr Ser Asp Thr Val Ala Glu Leu Thr Ile Lys Pro Ser
50 55 60
Ser Lys Glu Thr Ile Glu Ile Pro Val Thr Glu Asn Ser Thr Ile Val
65 70 75 80
Trp Glu Leu Arg Val Leu Gly Trp Glu Val Ser Tyr Gly Ala Glu Phe
85 90 95
Thr Pro Asp Ala Glu Gly Gly Tyr Thr Val Ile Val Gln Lys Thr Arg
100 105 110
Lys Val Pro Ala Asn Glu Glu Pro Ile Met Lys Gly Ser Phe Lys Val
115 120 125
Arg Glu Pro Gly Lys Leu Val Leu Thr Val Asn Asn Ser Ala Ser Lys
130 135 140
Lys Lys Lys Leu Leu His Arg Ser Lys Val Lys Ser Thr Xaa Glu
145 150 155
<210>33
<211>1143
<212>DNA
<213>甘蔗
<220>
<221>misc_feature
<222>(1033)..(1033)
<223>n是a、c、g或t
<400>33
gcctgagccc gtgctgatct ggggcgtgcc gctggtgggc gacgacgagc gcacggacac 60
ggttctgctt aagttcctgc gagcgcggga gttcaaggtg aaggaggcca ttgcgatgct 120
caagtccgcg gtgctgtggc gcaagcgctt cggcatcacc tcgctcctcg acgccgacct 180
cggcctgccg gagctggaga acgtggtgtt ctaccgcggc gccgaccgcg agggccaccc 240
cgtgtgctac aacgtctacg gcgagttcca ggacaaggat ctctacgaga aggccttcgg 300
cgacgatgag aagcgggagc gcttcctcaa gtggcgcatc cagctgctgg agcgcggcat 360
cctgtcgaag ctggactttt cgcccagcgg catctgctcc atggtccagg ttaccgacct 420
caagaactcg ccgcccatgc tcggcaagca ccgcaccgtc acccgccagg ctgtcacgct 480
gctccaggac aactaccccg agttcattgc caagaaggtg ttcattcaat gtgccgtggt 540
ggtatctcgc tgccaacaag atgatgagcc cgttcctcac acagcgcacc aagagcaagt 600
tcgtttttgc tagcccagcc aagtcagcag agactctatt cagatacatc gcagcggagc 660
aagttcctgt ccaatttgga ggcctcttca aggaggacga ccctgagttc accacctccg 720
acactgtcgc tgagctcact atcaaaccat cgtcaaaaga aaccattgag atccctgtca 780
cggagaactc cacaattgta tgggaactcc gggtgctcgg ttgggaggtg agctatggtg 840
ctgagttcac ccctgacgct gagggtgggt acactgtcat tgtacagaat acaaggaagg 900
tgcccgctaa cgaggaaccg atcatgaagg gaagcttcaa ggtaggcgag cccggcaaac 960
ttgtgctaac tgtgaacaag ctagcatcca ataagaagaa gctccttcac agatcaaagg 1020
tgaagagcac cancgagtga gtgtgaagtt gctgctagct ggctgggtcc tacagttcaa 1080
cgatctacca cagtttatct cagcatgata gaagaaagga aaaacctttt ggcttgggtc 1140
cgt 1143
<210>34
<211>244
<212>PRT
<213>甘蔗
<220>
<221>不确定
<222>(243)..(243)
<223>Xaa可以是任意天然氨基酸
<400>34
Met Arg Ser Gly Ser Ala Ser Ser Ser Gly Ala Ser Ser Cys Trp Ser
1 5 10 15
Ala Ala Ser Cys Arg Ser Trp Thr Phe Arg Pro Ala Ala Ser Ala Pro
20 25 30
Trp Ser Arg Leu Pro Thr Ser Arg Thr Arg Arg Pro Cys Ser Ala Ser
35 40 45
Thr Ala Pro Ser Pro Ala Arg Leu Ser Arg Cys Ser Arg Thr Thr Thr
50 55 60
Pro Ser Ser Leu Pro Arg Arg Cys Ser Phe Asn Val Pro Trp Trp Tyr
65 70 75 80
Leu Ala Ala Asn Lys Met Met Ser Pro Phe Leu Thr Gln Arg Thr Lys
85 90 95
Ser Lys Phe Val Phe Ala Ser Pro Ala Lys Ser Ala Glu Thr Leu Phe
100 105 110
Arg Tyr Ile Ala Ala Glu Gln Val Pro Val Gln Phe Gly Gly Leu Phe
115 120 125
Lys Glu Asp Asp Pro Glu Phe Thr Thr Ser Asp Thr Val Ala Glu Leu
130 135 140
Thr Ile Lys Pro Ser Ser Lys Glu Thr Ile Glu Ile Pro Val Thr Glu
145 150 155 160
Asn Ser Thr Ile Val Trp Glu Leu Arg Val Leu Gly Trp Glu Val Ser
165 170 175
Tyr Gly Ala Glu Phe Thr Pro Asp Ala Glu Gly Gly Tyr Thr Val Ile
180 185 190
Val Gln Asn Thr Arg Lys Val Pro Ala Asn Glu Glu Pro Ile Met Lys
195 200 205
Gly Ser Phe Lys Val Gly Glu Pro Gly Lys Leu Val Leu Thr Val Asn
210 215 220
Lys Leu Ala Ser Asn Lys Lys Lys Leu Leu His Arg Ser Lys Val Lys
225 230 235 240
Ser Thr Xaa Glu
<210>35
<211>1404
<212>DNA
<213>甘蔗
<220>
<221>misc_feature
<222>(565)..(565)
<223>n是a、c、g或t
<220>
<221>misc_feature
<222>(577)..(577)
<223>n是a、c、g或t
<400>35
ctctctcttg ctcctctcct cgccaccgcc gccgtgaaat ccgcacgcgt gctgcgtcct 60
gccgagttgt gagctgtgat tgtgagtgag cgccatggcc gtggaggctg tgtctggaaa 120
tggcgccgag gcggtggcgc cggcgccggc gaaggaggtg aacgccaagg aggcggttgc 180
ggtgtccaag aacgcgtcgt tcagggagga gagcaacttc ctggacgatc tcaaggagag 240
cgagcgtaag gcgctcgccg agctccgcga caaggtcgag gcggccatcg tggagggcaa 300
gctgttcgac gacggcggca agccggaggc gaaggagaag gagcaggcca agaagaaggc 360
tgagaagacc gtggagaaga aagaggagga gcccgaagcc gaagagaagg gagaggagga 420
cggcaagaag gaggccgacg ccgaggagga gaaaaaggaa ggcgaggagg aaggggagaa 480
gaaggacgac gaggagggtg gaggagaaga caccaaggac gaggccaaga aagatgaagc 540
cggcgagaag gcggcggcga aggangagaa agaggangag aagccggcgg agacggcggc 600
cgtcgtcgtc gtcgacaagg acatcgcgct gtggggcgtg cctctgctcc cgagcaaggg 660
agacgaggcc acggacgtgg tgctcctcaa gttcctccgc gcgcgcgact tcaaggccgg 720
cgccgcgttc gagatgctcc gccgcacgct ccgctggcgc aggggctgga ccggcttcag 780
cgttgacgcc gacgacgacg acgccgacct ccccgaggag ctcgcgggcg cgtgctacct 840
cgacggcgcg gaccgggagg gccacccggt gtgctacaac gcgccgggcg tgttcgcgga 900
cgacgccgtg tacaagaagg cgctgggcac cgaggaaggc aaggccaggt tcctccggtg 960
gcgggtccgc gccatggagc gccacgtggc cgagctggac ctgaggcccg gcggcgccgc 1020
gtcgctgctg caggtgaccg acctgaagaa ctcgccgggc ccggccaaga aggacctccg 1080
cgtcgccgtc aagcaggtgc tcgacctgtt ccaggacaac taccccgagc tcgtcgcaag 1140
aaacatctta atcaacgtgc cgttctggta ctacgcgttc agcgccctgt tctacccgtt 1200
cctgacgcag aggaccaaga gcaagttcgt cgttgctcgc ccgtccaagg tcaccgagac 1260
cctcctcaag tacattccga ttgaggccat ccccgtgaag tacggcggcc tgaaacgcga 1320
cggcgacacc gagttcttcg cggacgacag cgaagtcaca gaaggcaccg tcaaggaaag 1380
ctccacgcag accatcgaga tcga 1404
<210>36
<211>436
<212>PRT
<213>甘蔗
<220>
<221>不确定
<222>(157)..(157)
<223>Xaa可以是任意天然氨基酸
<220>
<221>不确定
<222>(161)..(161)
<223>Xaa可以是任意天然氨基酸
<400>36
Met Ala Val Glu Ala Val Ser Gly Asn Gly Ala Glu Ala Val Ala Pro
1 5 10 15
Ala Pro Ala Lys Glu Val Asn Ala Lys Glu Ala Val Ala Val Ser Lys
20 25 30
Asn Ala Ser Phe Arg Glu Glu Ser Asn Phe Leu Asp Asp Leu Lys Glu
35 40 45
Ser Glu Arg Lys Ala Leu Ala Glu Leu Arg Asp Lys Val Glu Ala Ala
50 55 60
Ile Val Glu Gly Lys Leu Phe Asp Asp Gly Gly Lys Pro Glu Ala Lys
65 70 75 80
Glu Lys Glu Gln Ala Lys Lys Lys Ala Glu Lys Thr Val Glu Lys Lys
85 90 95
Glu Glu Glu Pro Glu Ala Glu Glu Lys Gly Glu Glu Asp Gly Lys Lys
100 105 110
Glu Ala Asp Ala Glu Glu Glu Lys Lys Glu Gly Glu Glu Glu Gly Glu
115 120 125
Lys Lys Asp Asp Glu Glu Gly Gly Gly Glu Asp Thr Lys Asp Glu Ala
130 135 140
Lys Lys Asp Glu Ala Gly Glu Lys Ala Ala Ala Lys Xaa Glu Lys Glu
145 150 155 160
Xaa Glu Lys Pro Ala Glu Thr Ala Ala Val Val Val Val Asp Lys Asp
165 170 175
Ile Ala Leu Trp Gly Val Pro Leu Leu Pro Ser Lys Gly Asp Glu Ala
180 185 190
Thr Asp Val Val Leu Leu Lys Phe Leu Arg Ala Arg Asp Phe Lys Ala
195 200 205
Gly Ala Ala Phe Glu Met Leu Arg Arg Thr Leu Arg Trp Arg Arg Gly
210 215 220
Trp Thr Gly Phe Ser Val Asp Ala Asp Asp Asp Asp Ala Asp Leu Pro
225 230 235 240
Glu Glu Leu Ala Gly Ala Cys Tyr Leu Asp Gly Ala Asp Arg Glu Gly
245 250 255
His Pro Val Cys Tyr Asn Ala Pro Gly Val Phe Ala Asp Asp Ala Val
260 265 270
Tyr Lys Lys Ala Leu Gly Thr Glu Glu Gly Lys Ala Arg Phe Leu Arg
275 280 285
Trp Arg Val Arg Ala Met Glu Arg His Val Ala Glu Leu Asp Leu Arg
290 295 300
Pro Gly Gly Ala Ala Ser Leu Leu Gln Val Thr Asp Leu Lys Asn Ser
305 310 315 320
Pro Gly Pro Ala Lys Lys Asp Leu Arg Val Ala Val Lys Gln Val Leu
325 330 335
Asp Leu Phe Gln Asp Asn Tyr Pro Glu Leu Val Ala Arg Asn Ile Leu
340 345 350
Ile Asn Val Pro Phe Trp Tyr Tyr Ala Phe Ser Ala Leu Phe Tyr Pro
355 360 365
Phe Leu Thr Gln Arg Thr Lys Ser Lys Phe Val Val Ala Arg Pro Ser
370 375 380
Lys Val Thr Glu Thr Leu Leu Lys Tyr Ile Pro Ile Glu Ala Ile Pro
385 390 395 400
Val Lys Tyr Gly Gly Leu Lys Arg Asp Gly Asp Thr Glu Phe Phe Ala
405 410 415
Asp Asp Ser Glu Val Thr Glu Gly Thr Val Lys Glu Ser Ser Thr Gln
420 425 430
Thr Ile Glu Ile
435
<210>37
<211>1658
<212>DNA
<213>普通小麦
<400>37
ccacgcgtcc gctctccccc tctccctcct gctctatctt tctcaccgca aaagcttgaa 60
acaccccgcg gagatggcag aggagccgca gccacaggcc gccgccgccc ccgccgccgc 120
ggccacggag gtggtcgtcg ccgagaaggc gccggcggag gtggagaaga aggccgagga 180
gcccgcggcg gaggcggagg ccgaggagac ggccgccgtt gccgacgacg ggggcgccgt 240
cgaggccacc ggctctttca aggaggagag caacctcgtc gccgacctgc ctgacccgga 300
gaagaaggcg ctcgacgagt tcaaggagct gatcgtcgcc gcgctcgccg ccggtgagtt 360
caatctgccc cctcccccgc cgccgccgaa ggccaagact gaggccgccg cagaggagac 420
caagacggag gcgccggcca aggaggaggc caagaccgag gagccggcca aggcggaaga 480
accagccaag gaggagccca aggctgaaga gccggccaag gccgaggcgg cagcggcgga 540
gccagcagcc gaggagccca aggccgtggt cgctgccgag gcagcagccg aggagccggc 600
caaggaggaa cccaaggccg aggaggccaa gccggccgag ccaaagaagg aggaggaagc 660
agtcgtggtc gccgaggagg gcaccaagac ggcggaaccg gtcgaggagg ccgccgccgc 720
cgccaccacc acagagcagg cagcggcgcc ggaaccggag gcggaggcag ccgcgcccga 780
gccggtgttc atctggggcg tgccgctggt gggcgacgac gagcgcacgg acgcggtgct 840
gctcaagttc ctgcgcgcgc gggagttcaa ggtgaaggag gcgatggcga tgctccggtc 900
cgccgtgctg tggcggaagc gcttcggcat cgagtcgctc ctggaggccg acctggcctt 960
cccggagctg gagaaggtgg tgttctaccg cggcgccgac cgggagggcc acccggtgtg 1020
ctacaacgtg tacggcgagt tccaggacaa ggaggtgtac gagaaggcgt tcggcgacga 1080
ggagaagcgg gagcggttcc tcaagtggcg catccagctg ctggagcgcg gcatcctgtc 1140
gcagctggac ttcgcgccca gcggcatctg ctccatggtg caggtcaccg acctcaagaa 1200
ctcgccgccc atgctcggca agcaccgcgc cgtcacccgc caggccgtcg ccctgctcca 1260
ggacaactac cccgagttca tcgccaagaa ggtgttcatc aacgtgccat ggtggtatct 1320
cgctgccaac aaaatgatga gccctttcct cacccagcgc accaagagca agttcgtgtt 1380
cgccagccag gccaagtcac ccgagaccct cttcagatac attgcgccgg agcaagttcc 1440
cgtccaattt ggaggcctct tcaaggaaga tgaccctgat ttcaccacct ccgactctgt 1500
caccgagctc accatcaaag cttcatccaa agaaaccatt gagatccctg tcaccgagaa 1560
ctcaacgatt gtatgggagc tccgggtgct cggctgggag gtcagccacg gcgcggagtt 1620
caccccggac gccgaggggg cgtacaccgt catcgtgc 1658
<210>38
<211>528
<212>PRT
<213>普通小麦
<400>38
Met Ala Glu Glu Pro Gln Pro Gln Ala Ala Ala Ala Pro Ala Ala Ala
1 5 10 15
Ala Thr Glu Val Val Val Ala Glu Lys Ala Pro Ala Glu Val Glu Lys
20 25 30
Lys Ala Glu Glu Pro Ala Ala Glu Ala Glu Ala Glu Glu Thr Ala Ala
35 40 45
Val Ala Asp Asp Gly Gly Ala Val Glu Ala Thr Gly Ser Phe Lys Glu
50 55 60
Glu Ser Asn Leu Val Ala Asp Leu Pro Asp Pro Glu Lys Lys Ala Leu
65 70 75 80
Asp Glu Phe Lys Glu Leu Ile Val Ala Ala Leu Ala Ala Gly Glu Phe
85 90 95
Asn Leu Pro Pro Pro Pro Pro Pro Pro Lys Ala Lys Thr Glu Ala Ala
100 105 110
Ala Glu Glu Thr Lys Thr Glu Ala Pro Ala Lys Glu Glu Ala Lys Thr
115 120 125
Glu Glu Pro Ala Lys Ala Glu Glu Pro Ala Lys Glu Glu Pro Lys Ala
130 135 140
Glu Glu Pro Ala Lys Ala Glu Ala Ala Ala Ala Glu Pro Ala Ala Glu
145 150 155 160
Glu Pro Lys Ala Val Val Ala Ala Glu Ala Ala Ala Glu Glu Pro Ala
165 170 175
Lys Glu Glu Pro Lys Ala Glu Glu Ala Lys Pro Ala Glu Pro Lys Lys
180 185 190
Glu Glu Glu Ala Val Val Val Ala Glu Glu Gly Thr Lys Thr Ala Glu
195 200 205
Pro Val Glu Glu Ala Ala Ala Ala Ala Thr Thr Thr Glu Gln Ala Ala
210 215 220
Ala Pro Glu Pro Glu Ala Glu Ala Ala Ala Pro Glu Pro Val Phe Ile
225 230 235 240
Trp Gly Val Pro Leu Val Gly Asp Asp Glu Arg Thr Asp Ala Val Leu
245 250 255
Leu Lys Phe Leu Arg Ala Arg Glu Phe Lys Val Lys Glu Ala Met Ala
260 265 270
Met Leu Arg Ser Ala Val Leu Trp Arg Lys Arg Phe Gly Ile Glu Ser
275 280 285
Leu Leu Glu Ala Asp Leu Ala Phe Pro Glu Leu Glu Lys Val Val Phe
290 295 300
Tyr Arg Gly Ala Asp Arg Glu Gly His Pro Val Cys Tyr Asn Val Tyr
305 310 315 320
Gly Glu Phe Gln Asp Lys Glu Val Tyr Glu Lys Ala Phe Gly Asp Glu
325 330 335
Glu Lys Arg Glu Arg Phe Leu Lys Trp Arg Ile Gln Leu Leu Glu Arg
340 345 350
Gly Ile Leu Ser Gln Leu Asp Phe Ala Pro Ser Gly Ile Cys Ser Met
355 360 365
Val Gln Val Thr Asp Leu Lys Asn Ser Pro Pro Met Leu Gly Lys His
370 375 380
Arg Ala Val Thr Arg Gln Ala Val Ala Leu Leu Gln Asp Asn Tyr Pro
385 390 395 400
Glu Phe Ile Ala Lys Lys Val Phe Ile Asn Val Pro Trp Trp Tyr Leu
405 410 415
Ala Ala Asn Lys Met Met Ser Pro Phe Leu Thr Gln Arg Thr Lys Ser
420 425 430
Lys Phe Val Phe Ala Ser Gln Ala Lys Ser Pro Glu Thr Leu Phe Arg
435 440 445
Tyr Ile Ala Pro Glu Gln Val Pro Val Gln Phe Gly Gly Leu Phe Lys
450 455 460
Glu Asp Asp Pro Asp Phe Thr Thr Ser Asp Ser Val Thr Glu Leu Thr
465 470 475 480
Ile Lys Ala Ser Ser Lys Glu Thr Ile Glu Ile Pro Val Thr Glu Asn
485 490 495
Ser Thr Ile Val Trp Glu Leu Arg Val Leu Gly Trp Glu Val Ser His
500 505 510
Gly Ala Glu Phe Thr Pro Asp Ala Glu Gly Ala Tyr Thr Val Ile Val
515 520 525
<210>39
<211>1722
<212>DNA
<213>拟南芥(Arabdidopsis thaliana)
<400>39
atggctcaag aggaagtaca gaaatcggct gatgtcgctg ctgctccggt ggtgaaggag 60
aaacctatta ccgataagga ggttactatt cctacccctg tggcagagaa agaggaagtt 120
gctgctcctg tctctgatga gaaggcggtt ccagagaagg aggtgactcc ggagaaggaa 180
gccccagcgg cggaagcgga gaaatctgtt tcggtgaagg aggaagagac ggttgttgta 240
gctgagaagg ttgttgtttt aactgctgag gaagttcaga agaaggcact tgaggagttt 300
aaggagcttg taagggaggc tttgaacaaa cgtgaattca ctgctccggt gacgccggtt 360
aaggaagaga aaacagagga gaagaaaaca gaggaggaaa ctaaagagga agagaaaaca 420
gaggagaaga aagaagagac aacgactgag gttaaggttg aagaagagaa accggcggtt 480
ccagcggcgg aggaggagaa atcatcagag gctgctccgg ttgagaccaa atctgaggag 540
aaacctgaag agaaagcaga ggtaacaacc gagaaagcat ccagtgccga agaagatgga 600
accaagaccg tggaagcaat cgaagaatct atcgtctctg tttcaccacc tgaatccgcc 660
gtagcacctg tcgtggtaga gactgtcgcc gttgctgagg cagagccagt ggagccggaa 720
gaagtctcga tctggggagt tccactactc caagacgaga gatctgacgt gatcctcacg 780
aaattcctcc gtgcaagaga ctttaaggtc aaagaagctt taaccatgct taaaaacacc 840
gtccagtggc gtaaagaaaa caaaatcgac gaactcgttg aatccggaga agaagtgagt 900
gagttcgaga agatggtgtt tgctcacggt gttgacaaag aaggacacgt cgtgatctac 960
agttcttacg gtgagtttca gaacaaggag cttttctccg acaaggagaa gcttaacaag 1020
ttcctcagct ggaggattca gctacaagag aagtgtgtga gagctattga tttcagcaac 1080
cctgaagcga agtcttcgtt tgtgttcgtc agcgacttca ggaacgctcc aggacttggt 1140
aaaagagcct tgtggcaatt catcagacgc gctgttaaac aattcgagga caattatcct 1200
gaattcgccg ctaaagagct attcatcaat gtcccatggt ggtacattcc atactacaaa 1260
acattcggat ctatcatcac atccccaagg actaggagca agatggtcct tgctggtcca 1320
tccaaatctg ccgatactat tttcaaatac atagctcctg aacaagttcc cgttaaatac 1380
ggtggactta gcaaagatac tcctttgacc gaagaaacca taacggaagc catcgttaaa 1440
ccggcagcaa actacactat tgaattgcct gcttctgagg cttgcacgct ttcatgggag 1500
cttagggttt tgggtgctga tgtgagctac ggagctcagt ttgagccaac caccgaagga 1560
agctatgctg tgatcgtctc taagacacgg aagattggat caaccgatga accggtgata 1620
accgattctt ttaaggtggg tgaaccggga aagattgtga tcacaatcga caaccagact 1680
tccaagaaga agaaagtgct ctacaggttc aaaactcaat aa 1722
<210>40
<211>573
<212>PRT
<213>拟南芥
<400>40
Met Ala Gln Glu Glu Val Gln Lys Ser Ala Asp Val Ala Ala Ala Pro
1 5 10 15
Val Val Lys Glu Lys Pro Ile Thr Asp Lys Glu Val Thr Ile Pro Thr
20 25 30
Pro Val Ala Glu Lys Glu Glu Val Ala Ala Pro Val Ser Asp Glu Lys
35 40 45
Ala Val Pro Glu Lys Glu Val Thr Pro Glu Lys Glu Ala Pro Ala Ala
50 55 60
Glu Ala Glu Lys Ser Val Ser Val Lys Glu Glu Glu Thr Val Val Val
65 70 75 80
Ala Glu Lys Val Val Val Leu Thr Ala Glu Glu Val Gln Lys Lys Ala
85 90 95
Leu Glu Glu Phe Lys Glu Leu Val Arg Glu Ala Leu Asn Lys Arg Glu
100 105 110
Phe Thr Ala Pro Val Thr Pro Val Lys Glu Glu Lys Thr Glu Glu Lys
115 120 125
Lys Thr Glu Glu Glu Thr Lys Glu Glu Glu Lys Thr Glu Glu Lys Lys
130 135 140
Glu Glu Thr Thr Thr Glu Val Lys Val Glu Glu Glu Lys Pro Ala Val
145 150 155 160
Pro Ala Ala Glu Glu Glu Lys Ser Ser Glu Ala Ala Pro Val Glu Thr
165 170 175
Lys Ser Glu Glu Lys Pro Glu Glu Lys Ala Glu Val Thr Thr Glu Lys
180 185 190
Ala Ser Ser Ala Glu Glu Asp Gly Thr Lys Thr Val Glu Ala Ile Glu
195 200 205
Glu Ser Ile Val Ser Val Ser Pro Pro Glu Ser Ala Val Ala Pro Val
210 215 220
Val Val Glu Thr Val Ala Val Ala Glu Ala Glu Pro Val Glu Pro Glu
225 230 235 240
Glu Val Ser Ile Trp Gly Val Pro Leu Leu Gln Asp Glu Arg Ser Asp
245 250 255
Val Ile Leu Thr Lys Phe Leu Arg Ala Arg Asp Phe Lys Val Lys Glu
260 265 270
Ala Leu Thr Met Leu Lys Asn Thr Val Gln Trp Arg Lys Glu Asn Lys
275 280 285
Ile Asp Glu Leu Val Glu Ser Gly Glu Glu Val Ser Glu Phe Glu Lys
290 295 300
Met Val Phe Ala His Gly Val Asp Lys Glu Gly His Val Val Ile Tyr
305 310 315 320
Ser Ser Tyr Gly Glu Phe Gln Asn Lys Glu Leu Phe Ser Asp Lys Glu
325 330 335
Lys Leu Asn Lys Phe Leu Ser Trp Arg Ile Gln Leu Gln Glu Lys Cys
340 345 350
Val Arg Ala Ile Asp Phe Ser Asn Pro Glu Ala Lys Ser Ser Phe Val
355 360 365
Phe Val Ser Asp Phe Arg Asn Ala Pro Gly Leu Gly Lys Arg Ala Leu
370 375 380
Trp Gln Phe Ile Arg Arg Ala Val Lys Gln Phe Glu Asp Asn Tyr Pro
385 390 395 400
Glu Phe Ala Ala Lys Glu Leu Phe Ile Asn Val Pro Trp Trp Tyr Ile
405 410 415
Pro Tyr Tyr Lys Thr Phe Gly Ser Ile Ile Thr Ser Pro Arg Thr Arg
420 425 430
Ser Lys Met Val Leu Ala Gly Pro Ser Lys Ser Ala Asp Thr Ile Phe
435 440 445
Lys Tyr Ile Ala Pro Glu Gln Val Pro Val Lys Tyr Gly Gly Leu Ser
450 455 460
Lys Asp Thr Pro Leu Thr Glu Glu Thr Ile Thr Glu Ala Ile Val Lys
465 470 475 480
Pro Ala Ala Asn Tyr Thr Ile Glu Leu Pro Ala Ser Glu Ala Cys Thr
485 490 495
Leu Ser Trp Glu Leu Arg Val Leu Gly Ala Asp Val Ser Tyr Gly Ala
500 505 510
Gln Phe Glu Pro Thr Thr Glu Gly Ser Tyr Ala Val Ile Val Ser Lys
515 520 525
Thr Arg Lys Ile Gly Ser Thr Asp Glu Pro Val Ile Thr Asp Ser Phe
530 535 540
Lys Val Gly Glu Pro Gly Lys Ile Val Ile Thr Ile Asp Asn Gln Thr
545 550 555 560
Ser Lys Lys Lys Lys Val Leu Tyr Arg Phe Lys Thr Gln
565 570
<210>41
<211>1473
<212>DNA
<213>拟南芥
<400>41
atggctgaag aacctactac taccactctc gttacaccgg aaaagctacc ttctccgagc 60
ctcacgcctt ctgaagtatc tgaatctact caagatgccc taccgacaga gacagaaact 120
ctggagaaag tgactgagac taatccaccg gaaactgcag ataccaccac caagccagaa 180
gaagaaaccg cggcagagca tcatccaccg acagtgacgg aaacagaaac tgcatcgacg 240
gagaaacaag aggttaaaga cgaagcatcg cagaaagaag tagctgaaga gaaaaagagt 300
atgattccac agaatcttgg ttcattcaaa gaagaaagca gcaaactttc tgatctatct 360
aattccgaga agaaatcact cgatgaacta aaacatctag ttcgagaagc tctagacaat 420
caccaattca ccaacacacc agaagaagtc aagatttggg ggattccatt acttgaagac 480
gatagaagcg acgtcgtttt gttaaaattc ctaagagcta gggagttcaa ggtgaaagat 540
tcgtttgcta tgctcaagaa cacaatcaag tggagaaagg agttcaagat cgatgaattg 600
gtcgaggaag atcttgtgga tgatcttgac aaggttgtgt ttatgcatgg acatgaccga 660
gaaggtcacc ctgtttgtta caatgtctat ggtgagtttc agaacaagga gctttataat 720
aagacgtttt ctgatgagga aaagaggaaa catttcttga ggactaggat tcagttcttg 780
gagaggagta taaggaagct agattttagc tctggtgggg tttctactat ttttcaggtt 840
aatgatatga agaattctcc ggggttaggg aagaaagagc ttagatcagc tactaagcaa 900
gctgttgagt tgcttcagga caattaccct gagtttgtct tcaaacaggc ttttatcaat 960
gttccttggt ggtaccttgt gttttatact gtgattggtc cgttcatgac accaagatca 1020
aagagcaagc ttgtgtttgc tggtccttcg cgttcagctg aaaccctatt caaatacata 1080
tcacccgagc aagttccagt acaatatggt ggattgagtg ttgatccttg cgactgcaat 1140
ccagactttt cgttggaaga ttcagcctct gagatcactg ttaagcccgg aacaaaacaa 1200
actgttgaga tcataatcta tgagaaatgt gaacttgtgt gggagataag ggtaactgga 1260
tgggaagtga gctacaaggc tgaatttgtg ccggaagaga aagatgctta cacggtggtt 1320
atacaaaaac cgaggaagat gagaccatcc gatgaaccgg tgttaaccca tagcttcaaa 1380
gtgaatgagc ttggcaaggt tttactcaca gtagacaacc caacctctaa gaagaagaag 1440
ctcgtttaca ggttcaatgt caaacctctc taa 1473
<210>42
<211>490
<212>PRT
<213>拟南芥
<400>42
Met Ala Glu Glu Pro Thr Thr Thr Thr Leu Val Thr Pro Glu Lys Leu
1 5 10 15
Pro Ser Pro Ser Leu Thr Pro Ser Glu Val Ser Glu Ser Thr Gln Asp
20 25 30
Ala Leu Pro Thr Glu Thr Glu Thr Leu Glu Lys Val Thr Glu Thr Asn
35 40 45
Pro Pro Glu Thr Ala Asp Thr Thr Thr Lys Pro Glu Glu Glu Thr Ala
50 55 60
Ala Glu His His Pro Pro Thr Val Thr Glu Thr Glu Thr Ala Ser Thr
65 70 75 80
Glu Lys Gln Glu Val Lys Asp Glu Ala Ser Gln Lys Glu Val Ala Glu
85 90 95
Glu Lys Lys Ser Met Ile Pro Gln Asn Leu Gly Ser Phe Lys Glu Glu
100 105 110
Ser Ser Lys Leu Ser Asp Leu Ser Asn Ser Glu Lys Lys Ser Leu Asp
115 120 125
Glu Leu Lys His Leu Val Arg Glu Ala Leu Asp Asn His Gln Phe Thr
130 135 140
Asn Thr Pro Glu Glu Val Lys Ile Trp Gly Ile Pro Leu Leu Glu Asp
145 150 155 160
Asp Arg Ser Asp Val Val Leu Leu Lys Phe Leu Arg Ala Arg Glu Phe
165 170 175
Lys Val Lys Asp Ser Phe Ala Met Leu Lys Asn Thr Ile Lys Trp Arg
180 185 190
Lys Glu Phe Lys Ile Asp Glu Leu Val Glu Glu Asp Leu Val Asp Asp
195 200 205
Leu Asp Lys Val Val Phe Met His Gly His Asp Arg Glu Gly His Pro
210 215 220
Val Cys Tyr Asn Val Tyr Gly Glu Phe Gln Asn Lys Glu Leu Tyr Asn
225 230 235 240
Lys Thr Phe Ser Asp Glu Glu Lys Arg Lys His Phe Leu Arg Thr Arg
245 250 255
Ile Gln Phe Leu Glu Arg Ser Ile Arg Lys Leu Asp Phe Ser Ser Gly
260 265 270
Gly Val Ser Thr Ile Phe Gln Val Asn Asp Met Lys Asn Ser Pro Gly
275 280 285
Leu Gly Lys Lys Glu Leu Arg Ser Ala Thr Lys Gln Ala Val Glu Leu
290 295 300
Leu Gln Asp Asn Tyr Pro Glu Phe Val Phe Lys Gln Ala Phe Ile Asn
305 310 315 320
Val Pro Trp Trp Tyr Leu Val Phe Tyr Thr Val Ile Gly Pro Phe Met
325 330 335
Thr Pro Arg Ser Lys Ser Lys Leu Val Phe Ala Gly Pro Ser Arg Ser
340 345 350
Ala Glu Thr Leu Phe Lys Tyr Ile Ser Pro Glu Gln Val Pro Val Gln
355 360 365
Tyr Gly Gly Leu Ser Val Asp Pro Cys Asp Cys Asn Pro Asp Phe Ser
370 375 380
Leu Glu Asp Ser Ala Ser Glu Ile Thr Val Lys Pro Gly Thr Lys Gln
385 390 395 400
Thr Val Glu Ile Ile Ile Tyr Glu Lys Cys Glu Leu Val Trp Glu Ile
405 410 415
Arg Val Thr Gly Trp Glu Val Ser Tyr Lys Ala Glu Phe Val Pro Glu
420 425 430
Glu Lys Asp Ala Tyr Thr Val Val Ile Gln Lys Pro Arg Lys Met Arg
435 440 445
Pro Ser Asp Glu Pro Val Leu Thr His Ser Phe Lys Val Asn Glu Leu
450 455 460
Gly Lys Val Leu Leu Thr Val Asp Asn Pro Thr Ser Lys Lys Lys Lys
465 470 475 480
Leu Val Tyr Arg Phe Asn Val Lys Pro Leu
485 490
<210>43
<211>2052
<212>DNA
<213>拟南芥
<400>43
atggctcaag aagagataca gaaacctact gcctctgttc cagtggttaa ggaggaaact 60
cctgccccgg ttaaggaggt tgaggtgccg gttactacgg agaaagctgt ggctgcgcct 120
gctccggaag ctacggagga gaaagttgtg tctgaggtgg cggtgcctga aacagaggtg 180
acggcggtga aagaggagga ggttgcgacg ggaaaggaga tcttgcaatc ggagtcgttt 240
aaggaggaag gctatttggc ttctgaatta caggaagctg agaagaatgc tttggctgag 300
ttaaaggagt tggttaggga ggctttgaac aagcgtgaat tcaccgcgcc accgccacca 360
ccagctccgg taaaggaaga gaaagttgag gagaagaaaa cagaggaaac agaggaaaag 420
aaggaagaag ttaaaacaga ggaaaaatct cttgaggctg aaaccaaaga agaggagaaa 480
tctgctgctc cggccaccgt agagaccaag aaagaagaga tcttggccgc tccggctccg 540
atcgtcgcag agaccaagaa ggaagagaca ccagttgctc ctgctccggt agagactaaa 600
ccggctgctc cggtcgttgc agagacaaag aaggaagaaa tattaccagc tgctccggtc 660
accacagaga ccaaggtgga agagaaagtc gttccagtag aaaccacacc ggctgctcca 720
gtcaccacag agaccaagga agaagagaaa gccgctccgg tcaccacaga gaccaaggag 780
gaagagaaag cagctccggg agagaccaag aaagaagaga aagcaaccgc ctctactcag 840
gtcaagaggg cctcaaaatt tattaaagat atatttgtct cagtcaccac tagcgagaag 900
aagaaggaag aagagaaacc agcagtagta acaatcgaga aggctttcgc agctgatcaa 960
gaagaagaaa caaaaaccgt tgaagcagtc gaagaatcaa tcgtctccat cactcttcca 1020
gagacagctg catacgtaga gccagaagaa gtctcaatct ggggaatccc acttctagag 1080
gacgaaagat ccgacgtgat cctcctcaaa ttcctccgtg cacgtgactt caaggtcaaa 1140
gaagccttca cgatgctgaa aaacaccgtc caatggcgca aagagaacaa gatcgacgac 1200
ctagtctcag aagatcttga aggaagcgag tttgagaagt tggtgttcac tcacggtgtc 1260
gacaaacaag gacatgtcgt gatctatagc tcgtacggtg agtttcagaa caaggagatt 1320
ttctcagata aagagaagct tagcaagttt ctcaaatgga ggattcagtt ccaagagaag 1380
tgtgtgaggt ctcttgactt tagccctgag gctaagtcat cgttcgtgtt cgttagtgac 1440
ttcaggaacg ctcctggact tggtcagaga gcattgtggc agttcattaa acgcgccgtt 1500
aagcaattcg aagataacta tccagagttt gtcgctaaag agctgttcat taatgtccca 1560
tggtggtaca ttccttacta caaaacattc ggaagtatca ttacatcgcc aaggacaagg 1620
agcaagatgg tcctttctgg tccatccaaa tccgctgaga ccattttcaa atacgtagct 1680
cctgaagtag tcccggttaa gtatggtgga ctcagcaaag atagtccatt caccgttgaa 1740
gatggagtca ccgaggccgt agttaaatcg acatctaaat ataccattga tttgcctgct 1800
acagagggtt ccacgctctc atgggagctt agggttttgg gtgcggacgt gagctacgga 1860
gctcaatttg agccaagcaa tgaggcaagc tacaccgtga ttgtctctaa gaaccggaag 1920
gtcggtttaa ctgatgaacc ggtgataacc gattctttca aggcaagtga ggcgggaaag 1980
gtcgtgatca cgattgacaa ccaaaccttt aagaagaaga aggtgctcta caggtccaaa 2040
acccaagcat aa 2052
<210>44
<211>683
<212>PRT
<213>拟南芥
<400>44
Met Ala Gln Glu Glu Ile Gln Lys Pro Thr Ala Ser Val Pro Val Val
1 5 10 15
Lys Glu Glu Thr Pro Ala Pro Val Lys Glu Val Glu Val Pro Val Thr
20 25 30
Thr Glu Lys Ala Val Ala Ala Pro Ala Pro Glu Ala Thr Glu Glu Lys
35 40 45
Val Val Ser Glu Val Ala Val Pro Glu Thr Glu Val Thr Ala Val Lys
50 55 60
Glu Glu GluVal Ala Thr Gly Lys Glu Ile Leu Gln Ser Glu Ser Phe
65 70 75 80
Lys Glu Glu Gly Tyr Leu Ala Ser Glu Leu Gln Glu Ala Glu Lys Asn
85 90 95
Ala Leu Ala Glu Leu Lys Glu Leu Val Arg Glu Ala Leu Asn Lys Arg
100 105 110
Glu Phe Thr Ala Pro Pro Pro Pro Pro Ala Pro Val Lys Glu Glu Lys
115 120 125
Val Glu Glu Lys Lys Thr Glu Glu Thr Glu Glu Lys Lys Glu Glu Val
130 135 140
Lys Thr Glu Glu Lys Ser Leu Glu Ala Glu Thr Lys Glu Glu Glu Lys
145 150 155 160
Ser Ala Ala Pro Ala Thr Val Glu Thr Lys Lys Glu Glu Ile Leu Ala
165 170 175
Ala Pro Ala Pro Ile Val Ala Glu Thr Lys Lys Glu Glu Thr Pro Val
180 185 190
Ala Pro Ala Pro Val Glu Thr Lys Pro Ala Ala Pro Val Val Ala Glu
195 200 205
Thr Lys Lys Glu Glu Ile Leu Pro Ala Ala Pro Val Thr Thr Glu Thr
210 215 220
Lys Val Glu Glu Lys Val Val Pro Val Glu Thr Thr Pro Ala Ala Pro
225 230 235 240
Val Thr Thr Glu Thr Lys Glu Glu Glu Lys Ala Ala Pro Val Thr Thr
245 250 255
Glu Thr Lys Glu Glu Glu Lys Ala Ala Pro Gly Glu Thr Lys Lys Glu
260 265 270
Glu Lys Ala Thr Ala Ser Thr Gln Val Lys Arg Ala Ser Lys Phe Ile
275 280 285
Lys Asp Ile Phe Val Ser Val Thr Thr Ser Glu Lys Lys Lys Glu Glu
290 295 300
Glu Lys Pro Ala Val Val Thr Ile Glu Lys Ala Phe Ala Ala Asp Gln
305 310 315 320
Glu Glu Glu Thr Lys Thr Val Glu Ala Val Glu Glu Ser Ile Val Ser
325 330 335
Ile Thr Leu Pro Glu Thr Ala Ala Tyr Val Glu Pro Glu Glu Val Ser
340 345 350
Ile Trp Gly Ile Pro Leu Leu Glu Asp Glu Arg Ser Asp Val Ile Leu
355 360 365
Leu Lys Phe Leu Arg Ala Arg Asp Phe Lys Val Lys Glu Ala Phe Thr
370 375 380
Met Leu Lys Asn Thr Val Gln Trp Arg Lys Glu Asn Lys Ile Asp Asp
385 390 395 400
Leu Val Ser Glu Asp Leu Glu Gly Ser Glu Phe Glu Lys Leu Val Phe
405 410 415
Thr His Gly Val Asp Lys Gln Gly His Val Val Ile Tyr Ser Ser Tyr
420 425 430
Gly Glu Phe Gln Asn Lys Glu Ile Phe Ser Asp Lys Glu Lys Leu Ser
435 440 445
Lys Phe Leu Lys Trp Arg Ile Gln Phe Gln Glu Lys Cys Val Arg Ser
450 455 460
Leu Asp Phe Ser Pro Glu Ala Lys Ser Ser Phe Val Phe Val Ser Asp
465 470 475 480
Phe Arg Asn Ala Pro Gly Leu Gly Gln Arg Ala Leu Trp Gln Phe Ile
485 490 495
Lys Arg Ala Val Lys Gln Phe Glu Asp Asn Tyr Pro Glu Phe Val Ala
500 505 510
Lys Glu Leu Phe Ile Asn Val Pro Trp Trp Tyr Ile Pro Tyr Tyr Lys
515 520 525
Thr Phe Gly Ser Ile Ile Thr Ser Pro Arg Thr Arg Ser Lys Met Val
530 535 540
Leu Ser Gly Pro Ser Lys Ser Ala Glu Thr Ile Phe Lys Tyr Val Ala
545 550 555 560
Pro Glu Val Val Pro Val Lys Tyr Gly Gly Leu Ser Lys Asp Ser Pro
565 570 575
Phe Thr Val Glu Asp Gly Val Thr Glu Ala Val Val Lys Ser Thr Ser
580 585 590
Lys Tyr Thr Ile Asp Leu Pro Ala Thr Glu Gly Ser Thr Leu Ser Trp
595 600 605
Glu Leu Arg Val Leu Gly Ala Asp Val Ser Tyr Gly Ala Gln Phe Glu
610 615 620
Pro Ser Asn Glu Ala Ser Tyr Thr Val Ile Val Ser Lys Asn Arg Lys
625 630 635 640
Val Gly Leu Thr Asp Glu Pro Val Ile Thr Asp Ser Phe Lys Ala Ser
645 650 655
Glu Ala Gly Lys Val Val Ile Thr Ile Asp Asn Gln Thr Phe Lys Lys
660 665 670
Lys Lys Val Leu Tyr Arg Ser Lys Thr Gln Ala
675 680
<210>45
<211>1230
<212>DNA
<213>拟南芥
<400>45
atggatgctt cattgtctcc attcgatcac caaaaaactc aaaacacaga gccaaagaaa 60
agcttcatta cctcactaat cactctccgt tcaaacaaca tcaaagaaga cacatacttc 120
gtctcagaac tcaaacccac ggagcaaaaa tcacttcaag aactcaaaga aaagctctca 180
gcttcatcct ccaaagcttc ttcaatgtgg ggagtctcac tcctcggtgg agacgacaaa 240
gctgacgtaa tcctcctcaa gttcctcaga gcaagagatt tcaaagtagc agactctttg 300
agaatgcttg agaagtgttt ggagtggaga gaagagttca aagcagagaa attgacagaa 360
gaagatctgg gttttaaaga tttggaaggt aaagttgctt acatgagagg ctacgacaaa 420
gaaggacacc cagtttgtta caatgcttat ggtgtgttta aagagaaaga gatgtatgag 480
agagtgtttg gtgatgaaga gaagcttaac aagtttctga gatggagagt tcaggttttg 540
gagagaggtg ttaaaatgct tcattttaaa cctggtggtg ttaattccat tattcaagtt 600
acagatctta aagatatgcc taagagagag cttagagttg cttctaatca gatcctctct 660
ctttttcaag ataattaccc tgaattggtt gctactaaga tattcataaa cgtgccttgg 720
tacttcagtg tgatctactc aatgttcagc ccattcctga ctcagagaac aaagagcaag 780
tttgtgatgt ccaaagaagg caatgcagca gaaacactct acaagttcat taggccagaa 840
gatattccgg tgcaatacgg tggtcttagc cgtcctactg attcgcaaaa cggaccgcca 900
aaaccggcgt ctgaattctc catcaagggt ggtgagaaag ttaacattca gattgaaggc 960
attgagggtg gagcaaccat aacatgggat atagtagttg gaggatggga tttagagtac 1020
agtgcagagt ttgttccaaa cgctgaagag agttacgcga tcgttgtcga gaaaccgaag 1080
aagatgaaag ctacagatga agctgtttgc aactctttca ctacagtaga agctgggaag 1140
ctcattctct ctgttgacaa tactctctct cgcaagaaga aagttgctgc ttaccgttac 1200
actgtccgga aatctactac aaccgtctaa 1230
<210>46
<211>409
<212>PRT
<213>拟南芥
<400>46
Met Asp Ala Ser Leu Ser Pro Phe Asp His Gln Lys Thr Gln Asn Thr
1 5 10 15
Glu Pro Lys Lys Ser Phe Ile Thr Ser Leu Ile Thr Leu Arg Ser Asn
20 25 30
Asn Ile Lys Glu Asp Thr Tyr Phe Val Ser Glu Leu Lys Pro Thr Glu
35 40 45
Gln Lys Ser Leu Gln Glu Leu Lys Glu Lys Leu Ser Ala Ser Ser Ser
50 55 60
Lys Ala Ser Ser Met Trp Gly Val Ser Leu Leu Gly Gly Asp Asp Lys
65 70 75 80
Ala Asp Val Ile Leu Leu Lys Phe Leu Arg Ala Arg Asp Phe Lys Val
85 90 95
Ala Asp Ser Leu Arg Met Leu Glu Lys Cys Leu Glu Trp Arg Glu Glu
100 105 110
Phe Lys Ala Glu Lys Leu Thr Glu Glu Asp Leu Gly Phe Lys Asp Leu
115 120 125
Glu Gly Lys Val Ala Tyr Met Arg Gly Tyr Asp Lys Glu Gly His Pro
130 135 140
Val Cys Tyr Asn Ala Tyr Gly Val Phe Lys Glu Lys Glu Met Tyr Glu
145 150 155 160
Arg Val Phe Gly Asp Glu Glu Lys Leu Asn Lys Phe Leu Arg Trp Arg
165 170 175
Val Gln Val Leu Glu Arg Gly Val Lys Met Leu His Phe Lys Pro Gly
180 185 190
Gly Val Asn Ser Ile Ile Gln Val Thr Asp Leu Lys Asp Met Pro Lys
195 200 205
Arg Glu Leu Arg Val Ala Ser Asn Gln Ile Leu Ser Leu Phe Gln Asp
210 215 220
Asn Tyr Pro Glu Leu Val Ala Thr Lys Ile Phe Ile Asn Val Pro Trp
225 230 235 240
Tyr Phe Ser Val Ile Tyr Ser Met Phe Ser Pro Phe Leu Thr Gln Arg
245 250 255
Thr Lys Ser Lys Phe Val Met Ser Lys Glu Gly Asn Ala Ala Glu Thr
260 265 270
Leu Tyr Lys Phe Ile Arg Pro Glu Asp Ile Pro Val Gln Tyr Gly Gly
275 280 285
Leu Ser Arg Pro Thr Asp Ser Gln Asn Gly Pro Pro Lys Pro Ala Ser
290 295 300
Glu Phe Ser Ile Lys Gly Gly Glu Lys Val Asn Ile Gln Ile Glu Gly
305 310 315 320
Ile Glu Gly Gly Ala Thr Ile Thr Trp Asp Ile Val Val Gly Gly Trp
325 330 335
Asp Leu Glu Tyr Ser Ala Glu Phe Val Pro Asn Ala Glu Glu Ser Tyr
340 345 350
Ala Ile Val Val Glu Lys Pro Lys Lys Met Lys Ala Thr Asp Glu Ala
355 360 365
Val Cys Asn Ser Phe Thr Thr Val Glu Ala Gly Lys Leu Ile Leu Ser
370 375 380
Val Asp Asn Thr Leu Ser Arg Lys Lys Lys Val Ala Ala Tyr Arg Tyr
385 390 395 400
Thr Val Arg Lys Ser Thr Thr Thr Val
405
<210>47
<211>1623
<212>DNA
<213>拟南芥
<400>47
atgactgctg aagttaaggt tgaggagaaa caggtggagt cagaggttgt tattgctcct 60
gctgttgttc ctgaggagac tactgttaag gctgttgtgg aagagactaa ggttgaagaa 120
gatgagagca agcctgaggg tgtggagaag agtgcttcct tcaaagaaga gagtgatttc 180
tttgctgatt tgaaagaatc tgagaaaaag gcactgagtg atctcaagtc taagcttgag 240
gaagctattg ttgacaacac tctcttaaag acgaagaaga aggagagctc tcctatgaag 300
gagaagaagg aagaggttgt gaaacctgaa gctgaggttg agaagaagaa ggaagaagca 360
gcagaggaga aggttgaaga agagaagaaa tctgaggctg ttgttaccga agaagcaccg 420
aaagctgaga ctgttgaggc tgttgttaca gaggagataa tccccaagga agaagtgact 480
actgttgttg agaaggtaga agaagaaacc aaggaagaag agaagaaaac cgaggatgtt 540
gttactgaag aagtgaaagc tgagactatt gaggtggagg atgaagatga gtcggtggat 600
aaggatatcg agctttgggg agtgccattg cttccaagca aaggagctga aagcacggat 660
gttatcctct tgaagttctt gagagcaaga gactttaaag tcaacgaagc ctttgagatg 720
ctgaagaaaa ccctcaaatg gagaaagcaa aacaagattg attcgatcct tggagaggag 780
tttggggagg atcttgccac tgcagcttac atgaacggtg tggaccgcga gtcccaccca 840
gtttgttaca atgtccacag cgaggagctt taccagacga ttgggtcgga gaagaacaga 900
gagaagttct tgagatggag gtttcagctg atggagaagg gaatccagaa gcttaatctt 960
aaaccaggag gtgttacttc tcttctccag atccacgatc tcaaaaacgc tcctggagtg 1020
tcaagaacag agatttgggt cggaatcaag aaagtaatcg agactttgca ggacaactat 1080
ccggaattcg tgtccagaaa catattcatc aacgttccat tctggttcta cgccatgaga 1140
gctgtcctct cgccattctt aactcaacga accaagagca agtttgttgt ggctcgtccc 1200
gctaaggtca gagagactct tctcaagtac attccagctg atgagctccc agttcagtac 1260
ggtgggttca aaacagtaga cgataccgaa ttctccaacg aaactgtctc tgaagttgtt 1320
gttaagcctg gatcatctga aaccatcgaa atcccagctc ctgagactga aggtacattg 1380
gtatgggaca tagcggtttt gggatgggaa gtgaattaca aggaagagtt tgtgccaaca 1440
gaagaaggag cttacacggt aatagtccaa aaggtgaaga agatgggagc aaatgaagga 1500
ccaatcagga acagtttcaa gaacagtcag gctggtaaga ttgttcttac cgttgacaat 1560
gtctctggca agaagaagaa agttctgtac aggtacagaa ccaagactga atcctcttcc 1620
tga 1623
<210>48
<211>540
<212>PRT
<213>拟南芥
<400>48
Met Thr Ala Glu Val Lys ValGlu Glu Lys Gln Val Glu Ser Glu Val
1 5 10 15
Val Ile Ala Pro Ala Val Val Pro Glu Glu Thr Thr Val Lys Ala Val
20 25 30
Val Glu Glu Thr Lys Val Glu Glu Asp Glu Ser Lys Pro Glu Gly Val
35 40 45
Glu Lys Ser Ala Ser Phe Lys Glu Glu Ser Asp Phe Phe Ala Asp Leu
50 55 60
Lys Glu Ser Glu Lys Lys Ala Leu Ser Asp Leu Lys Ser Lys Leu Glu
65 70 75 80
Glu Ala Ile Val Asp Asn Thr Leu Leu Lys Thr Lys Lys Lys Glu Ser
85 90 95
Ser Pro Met Lys Glu Lys Lys Glu Glu Val Val Lys Pro Glu Ala Glu
100 105 110
Val Glu Lys Lys Lys Glu Glu Ala Ala Glu Glu Lys Val Glu Glu Glu
115 120 125
Lys Lys Ser Glu Ala Val Val Thr Glu Glu Ala Pro Lys Ala Glu Thr
130 135 140
Val Glu Ala Val Val Thr Glu Glu Ile Ile Pro Lys Glu Glu Val Thr
145 150 155 160
Thr Val Val Glu Lys Val Glu Glu Glu Thr Lys Glu Glu Glu Lys Lys
165 170 175
Thr Glu Asp Val Val Thr Glu Glu Val Lys Ala Glu Thr Ile Glu Val
180 185 190
Glu Asp Glu Asp Glu Ser Val Asp Lys Asp Ile Glu Leu Trp Gly Val
195 200 205
Pro Leu Leu Pro Ser Lys Gly Ala Glu Ser Thr Asp Val Ile Leu Leu
210 215 220
Lys Phe Leu Arg Ala Arg Asp Phe Lys Val Asn Glu Ala Phe Glu Met
225 230 235 240
Leu Lys Lys Thr Leu Lys Trp Arg Lys Gln Asn Lys Ile Asp Ser Ile
245 250 255
Leu Gly Glu Glu Phe Gly Glu Asp Leu Ala Thr Ala Ala Tyr Met Asn
260 265 270
Gly Val Asp Arg Glu Ser His Pro Val Cys Tyr Asn Val His Ser Glu
275 280 285
Glu Leu Tyr Gln Thr Ile Gly Ser Glu Lys Asn Arg Glu Lys Phe Leu
290 295 300
Arg Trp Arg Phe Gln Leu Met Glu Lys Gly Ile Gln Lys Leu Asn Leu
305 310 315 320
Lys Pro Gly Gly Val Thr Ser Leu Leu Gln Ile His Asp Leu Lys Asn
325 330 335
Ala Pro Gly Val Ser Arg Thr Glu Ile Trp Val Gly Ile Lys Lys Val
340 345 350
Ile Glu Thr Leu Gln Asp Asn Tyr Pro Glu Phe Val Ser Arg Asn Ile
355 360 365
Phe Ile Asn Val Pro Phe Trp Phe Tyr Ala Met Arg Ala Val Leu Ser
370 375 380
Pro Phe Leu Thr Gln Arg Thr Lys Ser Lys Phe Val Val Ala Arg Pro
385 390 395 400
Ala Lys Val Arg Glu Thr Leu Leu Lys Tyr Ile Pro Ala Asp Glu Leu
405 410 415
Pro Val Gln Tyr Gly Gly Phe Lys Thr Val Asp Asp Thr Glu Phe Ser
420 425 430
Asn Glu Thr Val Ser Glu Val Val Val Lys Pro Gly Ser Ser Glu Thr
435 440 445
Ile Glu Ile Pro Ala Pro Glu Thr Glu Gly Thr Leu Val Trp Asp Ile
450 455 460
Ala Val Leu Gly Trp Glu Val Asn Tyr Lys Glu Glu Phe Val Pro Thr
465 470 475 480
Glu Glu Gly Ala Tyr Thr Val Ile Val Gln Lys Val Lys Lys Met Gly
485 490 495
Ala Asn Glu Gly Pro Ile Arg Asn Ser Phe Lys Asn Ser Gln Ala Gly
500 505 510
Lys Ile Val Leu Thr Val Asp Asn Val Ser Gly Lys Lys Lys Lys Val
515 520 525
Leu Tyr Arg Tyr Arg Thr Lys Thr Glu Ser Ser Ser
530 535 540
<210>49
<211>2007
<212>DNA
<213>拟南芥
<400>49
atgtctcaag attctgcaac tactactccg ccgccacctt taacctccga cgtttcaatg 60
ccttccggtg aagaagatga gccaaagcat gtaacatcgg aagaagaggc accggtgact 120
tcggagacaa atctgaagct acctttgatg ccggagctag aggagtcaaa tcatacggcg 180
gaggttgttt cagagaaggt gacgccggag acgatgactt tggagtcaga aggtctcaac 240
cacgcggcgg aggattcaga gcagacacat gaagtgacgc cggagacaga gactgcgaag 300
ctagaggttc tcaaccacac ggcggaggat tcagagcaga cacatgaagt gacgccggag 360
aaagagactg tgaaatcaga gtttctcaac cacgtggcgg aggattcaga gcagacacat 420
gaagtgacgc cggagacaga gactgtgaag tcagaggttc tcaaccatgc ggcggaggat 480
tcagagcagc cacgtggagt gacaccgacg ccggagacag aaacatcgga ggcagacacg 540
tcattgcttg taacttccga gacagaagag cctaaccatg cggcggagga ttattcagag 600
acagagccat cacagaaact aatgttggag cagaggagaa agtacatgga agtagaagat 660
tggacagagc cagaactacc agatgaagcg gtgttagaag ctgcagcgtc agtccctgag 720
ccaaagcaac cagagcctca gacaccacca ccaccaccat ctactactac ttccactgtt 780
gcatctagat ctttagccga aatgatgaac agagaagaag cagaagtaga agagaaacaa 840
aagattcaga ttcctcgtag tctcggttca ttcaaagaag aaacaaacaa aatctccgat 900
ctttcagaaa cagagttaaa cgcacttcaa gagcttcgtc accttcttca agtatcacaa 960
gattcaagca aaacctctat atggggtgtg ccacttctca aagacgacag aaccgacgtc 1020
gttttgttaa agtttctaag agcaagagac tttaaacctc aagaagctta ctcaatgcta 1080
aacaagacac tccaatggag aatcgacttc aacatcgaag agcttctaga cgaaaacctc 1140
ggtgacgatt tagacaaagt tgtgttcatg caaggacaag acaaggagaa tcatcctgtc 1200
tgttacaatg tctacggtga gtttcagaac aaagatcttt atcagaaaac gttttcagat 1260
gaagagaaga gagaacggtt cttgagatgg aggattcagt ttcttgaaaa gagtatcagg 1320
aatcttgatt ttgtagctgg tggtgtttcc acgatatgtc aagtaaatga tcttaagaat 1380
tctccaggac ctggtaagac tgagcttagg ttagctacta agcaagctct tcatcttctt 1440
caagacaatt accctgagtt tgtctctaaa cagatattca tcaatgttcc atggtggtac 1500
cttgcgttct atagaattat tagtcctttt atgtcacaaa ggtcaaagag caaactagtt 1560
ttcgcaggtc cttcaagatc tgcagaaacc cttctcaagt acatatcacc cgaacatgtc 1620
ccggttcagt atggtggact aagtgtggat aattgcgagt gtaactcgga tttcacacac 1680
gatgatatcg ctaccgagat tactgttaaa ccaactacta aacaaaccgt cgagattatt 1740
gtttacgaga aatgtacaat cgtgtgggag ataagagtag tgggatggga ggtttcgtat 1800
ggagcggagt ttgtgccgga gaacaaagaa gggtatacag tgatcattca gaaaccgagg 1860
aagatgactg cgaaaaatga actggtggtg tctcatagct tcaaagttgg agaagttggc 1920
aggattttgc taactgttga taacccgact tcgaccaaga aaatgcttat ctacaggttc 1980
aaggttaagc ctttagcttg tgagtaa 2007
<210>50
<211>668
<212>PRT
<213>拟南芥
<400>50
Met Ser Gln Asp Ser Ala Thr Thr Thr Pro Pro Pro Pro Leu Thr Ser
1 5 10 15
Asp Val Ser Met Pro Ser Gly Glu Glu Asp Glu Pro Lys His Val Thr
20 25 30
Ser Glu Glu Glu Ala Pro Val Thr Ser Glu Thr Asn Leu Lys Leu Pro
35 40 45
Leu Met Pro Glu Leu Glu Glu Ser Asn His Thr Ala Glu Val Val Ser
50 55 60
Glu Lys Val Thr Pro Glu Thr Met Thr Leu Glu Ser Glu Gly Leu Asn
65 70 75 80
His Ala Ala Glu Asp Ser Glu Gln Thr His Glu Val Thr Pro Glu Thr
85 90 95
Glu Thr Ala Lys Leu Glu Val Leu Asn His Thr Ala Glu Asp Ser Glu
100 105 110
Gln Thr His Glu Val Thr Pro Glu Lys Glu Thr Val Lys Ser Glu Phe
115 120 125
Leu Asn His Val Ala Glu Asp Ser Glu Gln Thr His Glu Val Thr Pro
130 135 140
Glu Thr Glu Thr Val Lys Ser Glu Val Leu Asn His Ala Ala Glu Asp
145 150 155 160
Ser Glu Gln Pro Arg Gly Val Thr Pro Thr Pro Glu Thr Glu Thr Ser
165 170 175
Glu Ala Asp Thr Ser Leu Leu Val Thr Ser Glu Thr Glu Glu Pro Asn
180 185 190
His Ala Ala Glu Asp Tyr Ser Glu Thr Glu Pro Ser Gln Lys Leu Met
195 200 205
Leu Glu Gln Arg Arg Lys Tyr Met Glu Val Glu Asp Trp Thr Glu Pro
210 215 220
Glu Leu Pro Asp Glu Ala Val Leu Glu Ala Ala Ala Ser Val Pro Glu
225 230 235 240
Pro Lys Gln Pro Glu Pro Gln Thr Pro Pro Pro Pro Pro Ser Thr Thr
245 250 255
Thr Ser Thr Val Ala Ser Arg Ser Leu Ala Glu Met Met Asn Arg Glu
260 265 270
Glu Ala Glu Val Glu Glu Lys Gln Lys Ile Gln Ile Pro Arg Ser Leu
275 280 285
Gly Ser Phe Lys Glu Glu Thr Asn Lys Ile Ser Asp Leu Ser Glu Thr
290 295 300
Glu Leu Asn Ala Leu Gln Glu Leu Arg His Leu Leu Gln Val Ser Gln
305 310 315 320
Asp Ser Ser Lys Thr Ser Ile Trp Gly Val Pro Leu Leu Lys Asp Asp
325 330 335
Arg Thr Asp Val Val Leu Leu Lys Phe Leu Arg Ala Arg Asp Phe Lys
340 345 350
Pro Gln Glu Ala Tyr Ser Met Leu Asn Lys Thr Leu Gln Trp Arg Ile
355 360 365
Asp Phe Asn Ile Glu Glu Leu Leu Asp Glu Asn Leu Gly Asp Asp Leu
370 375 380
Asp Lys Val Val Phe Met Gln Gly Gln Asp Lys Glu Asn His Pro Val
385 390 395 400
Cys Tyr Asn Val Tyr Gly Glu Phe Gln Asn Lys Asp Leu Tyr Gln Lys
405 410 415
Thr Phe Ser Asp Glu Glu Lys Arg Glu Arg Phe Leu Arg Trp Arg Ile
420 425 430
Gln Phe Leu Glu Lys Ser Ile Arg Asn Leu Asp Phe Val Ala Gly Gly
435 440 445
Val Ser Thr Ile Cys Gln Val Asn Asp Leu Lys Asn Ser Pro Gly Pro
450 455 460
Gly Lys Thr Glu Leu Arg Leu Ala Thr Lys Gln Ala Leu His Leu Leu
465 470 475 480
Gln Asp Asn Tyr Pro Glu Phe Val Ser Lys Gln Ile Phe Ile Asn Val
485 490 495
Pro Trp Trp Tyr Leu Ala Phe Tyr Arg Ile Ile Ser Pro Phe Met Ser
500 505 510
Gln Arg Ser Lys Ser Lys Leu Val Phe Ala Gly Pro Ser Arg Ser Ala
515 520 525
Glu Thr Leu Leu Lys Tyr Ile Ser Pro Glu His Val Pro Val Gln Tyr
530 535 540
Gly Gly Leu Ser Val Asp Asn Cys Glu Cys Asn Ser Asp Phe Thr His
545 550 555 560
Asp Asp Ile Ala Thr Glu Ile Thr Val Lys Pro Thr Thr Lys Gln Thr
565 570 575
Val Glu Ile Ile Val Tyr Glu Lys Cys Thr Ile Val Trp Glu Ile Arg
580 585 590
Val Val Gly Trp Glu Val Ser Tyr Gly Ala Glu Phe Val Pro Glu Asn
595 600 605
Lys Glu Gly Tyr Thr Val Ile Ile Gln Lys Pro Arg Lys Met Thr Ala
610 615 620
Lys Asn Glu Leu Val Val Ser His Ser Phe Lys Val Gly Glu Val Gly
625 630 635 640
Arg Ile Leu Leu Thr Val Asp Asn Pro Thr Ser Thr Lys Lys Met Leu
645 650 655
Ile Tyr Arg Phe Lys Val Lys Pro Leu Ala Cys Glu
660 665
<210>51
<211>1542
<212>DNA
<213>毛果杨(Populus trichocarpa)
<400>51
gactcagtac taacatctga atctgatgtt agtactgaga ttaaaccacc agcagagact 60
caggaatcaa aagttgaaga aaccccagaa aaagaatcac aagaagtggc aaaagaagag 120
caaaaggctg caccttcacc agaggagatc actatatggg ggatccctct tctgaaagat 180
gatagaagtg atgtggttct cttgaagttc ttgagggcaa gggattttaa ggtaagcgat 240
gcatttgtaa tgatcaagaa cacaattcaa tggaggagag actttaaaat tgatgagctt 300
gttgatgaag atctaggtga tgatttggag aaagttgtgt ttatgcatgg ttatgacagg 360
gaagggcatc ctgtgtgtta taatgtgtat ggggagtttc aaaataaaga gttgtatcag 420
aagacattct ctgatgagga gaaaagattg aagtttttga ggtggcggat tcagttcttg 480
gagaggagta ttaggaagct tgattttagt cctagtggta tttccaccgt cttccaggtt 540
aatgatctca agaactctcc aggacccgga aagagagagc ttaggttggc tactaaacag 600
gctctcctat tgcttcagga caattaccct gagtttgtgg ccaaacaggt gttcatcaat 660
gtcccttggt ggtatcttgc attttataca atgatcagtc catttatgac acaaagaacc 720
aaaagcaaat ttgtattcgc aggcccatca aaatctgctg agacactttt caaatatgta 780
tctcctgagc aagttcctat tcagtatggt ggcttgagcg tggatttctg cgactgcaac 840
cccgaattta cttttgctga tcctgctact gagataactg taaaaccagc aaccaagcaa 900
actgtggaaa ttataattta tgagaaatgt ttcattgttt gggagttgcg agttgttgga 960
tgggaggtga gttatagtgc tgaattcgtg cccgattcta aagatgcata cacaattata 1020
atgacaaaac ccacaaaaat gaccccaacc aatgagccag tggtgtctaa cagcttcaaa 1080
gttggtgagc tgggaaaaat attgctcaca gttgacaact ctacctcaaa gaagaagaaa 1140
cttctctaca ggttcaagat aaaccccttc tcagattgag gaactctttg tatttaatgt 1200
ttccatctca gatagttgtg gaaattgaat taattttatg tactctgggt gagccttttg 1260
tctctcgtct tttttctcta ccgaatagca catggtttgc attcggtgtt ttttttgcca 1320
atcatcttgg ggaaaacaaa cagatgtgtg ggcatatgga ggggggtgtg gttgtgaaac 1380
ttatatatac ataaatgtgc catttcattg aggcgttttg cgatgcgctt tctcgtttat 1440
ctggatgtca atatttcagt accatttacc tttatgtaat ctcaaaactc ttatgttatt 1500
gcgacatcga atacttcaat ttgacatcat ctttaaagtt ga 1542
<210>52
<211>309
<212>PRT
<213>毛果杨
<400>52
Met Ile Lys Asn Thr Ile Gln Trp Arg Arg Asp Phe Lys Ile Asp Glu
1 5 10 15
Leu Val Asp Glu Asp Leu Gly Asp Asp Leu Glu Lys Val Val Phe Met
20 25 30
His Gly Tyr Asp Arg Glu Gly His Pro Val Cys Tyr Asn Val Tyr Gly
35 40 45
Glu Phe Gln Asn Lys Glu Leu Tyr Gln Lys Thr Phe Ser Asp Glu Glu
50 55 60
Lys Arg Leu Lys Phe Leu Arg Trp Arg Ile Gln Phe Leu Glu Arg Ser
65 70 75 80
Ile Arg Lys Leu Asp Phe Ser Pro Ser Gly Ile Ser Thr Val Phe Gln
85 90 95
Val Asn Asp Leu Lys Asn Ser Pro Gly Pro Gly Lys Arg Glu Leu Arg
100 105 110
Leu Ala Thr Lys Gln Ala Leu Leu Leu Leu Gln Asp Asn Tyr Pro Glu
115 120 125
Phe Val Ala Lys Gln Val Phe Ile Asn Val Pro Trp Trp Tyr Leu Ala
130 135 140
Phe Tyr Thr Met Ile Ser Pro Phe Met Thr Gln Arg Thr Lys Ser Lys
145 150 155 160
Phe Val Phe Ala Gly Pro Ser Lys Ser Ala Glu Thr Leu Phe Lys Tyr
165 170 175
Val Ser Pro Glu Gln Val Pro Ile Gln Tyr Gly Gly Leu Ser Val Asp
180 185 190
Phe Cys Asp Cys Asn Pro Glu Phe Thr Phe Ala Asp Pro Ala Thr Glu
195 200 205
Ile Thr Val Lys Pro Ala Thr Lys Gln Thr Val Glu Ile Ile Ile Tyr
210 215 220
Glu Lys Cys Phe Ile Val Trp Glu Leu Arg Val Val Gly Trp Glu Val
225 230 235 240
Ser Tyr Ser Ala Glu Phe Val Pro Asp Ser Lys Asp Ala Tyr Thr Ile
245 250 255
Ile Met Thr Lys Pro Thr Lys Met Thr Pro Thr Asn Glu Pro Val Val
260 265 270
Ser Asn Ser Phe Lys Val Gly Glu Leu Gly Lys Ile Leu Leu Thr Val
275 280 285
Asp Asn Ser Thr Ser Lys Lys Lys Lys Leu Leu Tyr Arg Phe Lys Ile
290 295 300
Asn Pro Phe Ser Asp
305
<210>53
<211>1080
<212>DNA
<213>毛果杨
<400>53
gggaggattc aattcctgga aaaaagtatc aggacattgg atttcagtcc cggtggaatt 60
tccacaattg ttcaggttaa tgacttgaaa aattctcctg gaccagctaa gagagagctt 120
agacaagcta ctagacaggc acttcaattg cttcaagaca actatccaga atttgtggcc 180
aaacagatct tcatcaatgt tccctggtgg tacctaacag taaatagaat gataagtcca 240
tttttaaccc agaggaccag aagcaagttt gtctttgttg gtccttccaa atctgccgaa 300
acccttatca ggtacatagc cgctgagcaa ataccagtga agtacggagg actaagcaaa 360
gatggtgaat ttggctcagc tgatgttgtt actgagatta ccgtgaagcc agcagcaaag 420
cacactgtag aattcccagt tactgagaca tgccttttaa catgggaagt gagagttgcg 480
ggatgggatg tgagctatag tgcagaattt gtaccaagtg ctgaagatag ctacacagtg 540
atcatccaaa aggctagaaa ggttgctgca actgaagaac cagtggtttg caacagtttc 600
aaaattggtg aacctggtaa agttgttctc accattgaca actccacatc caagaagaag 660
aagaagctcc tctatcgctt gaaaaccaag cccgcttctt ctgattaatt aagggactat 720
atatattgaa acaacaatag aagattttgc ttacattctt gctgctgctg ctgctgccaa 780
ttttatcaac atgatcatat cacagcttga aggtgttctg agggtcttga tcatggagaa 840
gataaagaaa tcttgaagat gtttatttat atgtttattt ataattgaat tttgttttgg 900
tgtggaatgg attaaggatg ttgtgcaatt gaaggctaga agcatgtatg gggataggga 960
agaagctcca ttactagtgc caagaatttt ctttgtaaat tctttatggc tttctttctc 1020
tttccctgta agtatctttt ggacatatta tgatattaat gaagacagta tctttcctat 1080
<210>54
<211>159
<212>PRT
<213>毛果杨
<400>54
Met Ile Ser Pro Phe Leu Thr Gln Arg Thr Arg Ser Lys Phe Val Phe
1 5 10 15
Val Gly Pro Ser Lys Ser Ala Glu Thr Leu Ile Arg Tyr Ile Ala Ala
20 25 30
Glu Gln Ile Pro Val Lys Tyr Gly Gly Leu Ser Lys Asp Gly Glu Phe
35 40 45
Gly Ser Ala Asp Val Val Thr Glu Ile Thr Val Lys Pro Ala Ala Lys
50 55 60
His Thr Val Glu Phe Pro Val Thr Glu Thr Cys Leu Leu Thr Trp Glu
65 70 75 80
Val Arg Val Ala Gly Trp Asp Val Ser Tyr Ser Ala Glu Phe Val Pro
85 90 95
Ser Ala Glu Asp Ser Tyr Thr Val Ile Ile Gln Lys Ala Arg Lys Val
100 105 110
Ala Ala Thr Glu Glu Pro Val Val Cys Asn Ser Phe Lys Ile Gly Glu
115 120 125
Pro Gly Lys Val Val Leu Thr Ile Asp Asn Ser Thr Ser Lys Lys Lys
130 135 140
Lys Lys Leu Leu Tyr Arg Leu Lys Thr Lys Pro Ala Ser Ser Asp
145 150 155
<210>55
<211>911
<212>DNA
<213>毛果杨
<400>55
atgtgttcat caatgttcca tggtggtacc taacattcaa taagatgatc agccctttcc 60
tgacacacag gacaaagagc aagtttgttt ttgctggccc atccaagtct gctgaaacac 120
ttttcaaata catagctcct gaagaagtgc cagttcaata tggtggacta agcaaggatg 180
gcgaattcac aggtgccgat accgtcacag atgttactat taagccaaca tcaaagcaca 240
ctgttgagtt cccagtgtct gaggcatgtg ttctcgtttg ggagcttcga gtttttggtt 300
gggatgtgag ctatggagct gaattcgtgc ctagcgctga ggatggttac accgttatcg 360
tatcaaagac caggaagatt atctcatccg atgatcctgt gatctcagac acattcaaaa 420
ttggtgaacc tggcaaggtt gtgcttacca ttgataacca aacctctaag aagaagaagc 480
tcctctatag gtcaaagacc aaaccccttt ctgaatgagc ttcaagaatc aatgccttcg 540
tggggttttg aacttccatt aatttagaag acatggatgg cataacaagg ggtccttggt 600
catgatgcat gaacacgagg tgtggagttt tgttctgttt aaatttaata tattttttgt 660
tttataaagt ctgggtttgg gctgggattt gttgtttggt gtttaattgg atacaatgta 720
ctggtgaaga cagtctggga ttatcatgga ttgacagaca catttgaggt ttaagtgtga 780
agaatatgaa aatgtgtttg ccaaagttga aagactatcc ttttttgttc ttggtactct 840
tacctgtaaa atttgtgtaa atgtgatggt gttctacttc ttccttctta attaatacat 900
gtatttttta c 911
<210>56
<211>157
<212>PRT
<213>毛果杨
<400>56
Met Ile Ser Pro Phe Leu Thr His Arg Thr Lys Ser Lys Phe Val Phe
1 5 10 15
Ala Gly Pro Ser Lys Ser Ala Glu Thr Leu Phe Lys Tyr Ile Ala Pro
20 25 30
Glu Glu Val Pro Val Gln Tyr Gly Gly Leu Ser Lys Asp Gly Glu Phe
35 40 45
Thr Gly Ala Asp Thr Val Thr Asp Val Thr Ile Lys Pro Thr Ser Lys
50 55 60
His Thr Val Glu Phe Pro Val Ser Glu Ala Cys Val Leu Val Trp Glu
65 70 75 80
Leu Arg Val Phe Gly Trp Asp Val Ser Tyr Gly Ala Glu Phe Val Pro
85 90 95
Ser Ala Glu Asp Gly Tyr Thr Val Ile Val Ser Lys Thr Arg Lys Ile
100 105 110
Ile Ser Ser Asp Asp Pro Val Ile Ser Asp Thr Phe Lys Ile Gly Glu
115 120 125
Pro Gly Lys Val Val Leu Thr Ile Asp Asn Gln Thr Ser Lys Lys Lys
130 135 140
Lys Leu Leu Tyr Arg Ser Lys Thr Lys Pro Leu Ser Glu
145 150 155
<210>57
<211>840
<212>DNA
<213>毛果杨
<400>57
actagtttaa ttaaattaat cccccccccc gtggtattgc ttctttgctt caaatcagtg 60
atctcaagaa ttctccttcc ccatcaaaga aagagctcag gactgccatg agcaaagctg 120
tcaccctttt gcaggacaat tatccagaat ttgttgcaaa aaatatattc ataaatgttc 180
cattttggta ttatgctttc aacgccctgc tatctccttt cttggctcaa agaaccaaga 240
gcaaatttgt cgttgttcgc cctgccaaga ccaccgagac attgctcaag tatgttcagg 300
ccgaggaaat ccctgtccaa tatggtggct tcaagaggga gaatgatttc gagttctcca 360
gcgaagatgg tgaagtttca gaacttgtaa tcaaagctgg atcaactgaa accattgaga 420
tccctgcagc agaggttgga gccacattgc tttgggacct gacagttgtg ggatgggaag 480
tgaattacaa ggaggaattt gtgccaagtg atgaagcttc ctacaccatc atcatccaaa 540
agggcaagaa aatgagctca aatgaagagc caactcgcaa cactttcagg aacaatgaac 600
ctggaaaggt agttctgacc attcagaatt ggtcaagcaa gaagaagagg gtcctatacc 660
gatacaagac caagaagaat gcttcctatt gagacaagac tccattcttt tatattatca 720
gtcttttatt tgccgtgaca attcttttct agatagaaaa cattgattct tttcaggatt 780
gatatctata tctatatctg atcgtgtgat ttgatttgtt atttctgggg ttcctttttc 840
<210>58
<211>194
<212>PRT
<213>毛果杨
<400>58
Met Ser Lys Ala Val Thr Leu Leu Gln Asp Asn Tyr Pro Glu Phe Val
1 5 10 15
Ala Lys Asn Ile Phe Ile Asn Val Pro Phe Trp Tyr Tyr Ala Phe Asn
20 25 30
Ala Leu Leu Ser Pro Phe Leu Ala Gln Arg Thr Lys Ser Lys Phe Val
35 40 45
Val Val Arg Pro Ala Lys Thr Thr Glu Thr Leu Leu Lys Tyr Val Gln
50 55 60
Ala Glu Glu Ile Pro Val Gln Tyr Gly Gly Phe Lys Arg Glu Asn Asp
65 70 75 80
Phe Glu Phe Ser Ser Glu Asp Gly Glu Val Ser Glu Leu Val Ile Lys
85 90 95
Ala Gly Ser Thr Glu Thr Ile Glu Ile Pro Ala Ala Glu Val Gly Ala
l00 105 110
Thr Leu Leu Trp Asp Leu Thr Val Val Gly Trp Glu Val Asn Tyr Lys
115 120 125
Glu Glu Phe Val Pro Ser Asp Glu Ala Ser Tyr Thr Ile Ile Ile Gln
130 135 140
Lys Gly Lys Lys Met Ser Ser Asn Glu Glu Pro Thr Arg Asn Thr Phe
145 150 155 160
Arg Asn Asn Glu Pro Gly Lys Val Val Leu Thr Ile Gln Asn Trp Ser
165 170 175
Ser Lys Lys Lys Arg Val Leu Tyr Arg Tyr Lys Thr Lys Lys Asn Ala
180 185 190
Ser Tyr
<210>59
<211>1564
<212>DNA
<213>番茄(Lycopersicum esculentum)
<400>59
ggagaaagta actccaccgg aaactgaagc aactccggcg ccggcagcag agacaccatc 60
tgagccatcg gagacagaga aggtggaggc agtcgaggaa atcaaggaaa ccattgttga 120
agtaccggct gcggttgctg tgatggcctc cacggaggag ccaccggctg cagaggcaga 180
ggaaccgaaa acagagcaaa ccccaccagc agcaccagaa gaagtatcca tatggggaat 240
acccctttta gcagatgaga gaagtgatgt aatccttctc aagtttctgc gagcgagaga 300
tttcaaggtg aaagaagctt tcaccatgtt gaaaagtgtt gtcgcatgga gaaaagaatt 360
caagattgat gaactcttgg atgagaaaga attaggacaa ggacttgaaa aagttgttta 420
caatcacgga gtagacaaag aaggtcaccc tgtatgttac aatgcatttg gtgagttcca 480
agacaaagaa ttgtaccaaa acacttttgc tgatgacaaa gagaaactca ccaaattcct 540
cagatggaga attcaattca tggagaaatc catcaggaat cttgatttta gccctgatgg 600
tatcaacact tttgttcaag ttcttgatct gaagaattca cctggactct tcttttacaa 660
gaaagaactt cgccaagcca ccaatcgtgc ccttctatta ctccaggata attaccctga 720
atttgttgcc aagcaggtgt tcatcaatgt tccatggtgg tacccagctt actacaggat 780
gattaatgca tctttcacta caaggaccaa gagcaagttt gtttttgctg gtccttcaag 840
atctgctgat actctattca aatacatagc acctgaacaa gtaccagcac aatatggtgg 900
acttagcaag gagggtgaag aggaattcac cactgctgaa ccagccactg aggaaatcat 960
taagccagct tctaaacaca ccattgaatt cccagttact gagaaaagca cattggtttg 1020
ggaagcaaga gtgacagggt gggatgtatc atatggagct gaatttgtgc ctagtgctga 1080
aggtggctac accattctca tagagaaatc aagaaaagtt ggggcaaatg aatcagtgat 1140
cagcactagc tacaaggcaa gtgaagcagg caaagtggta atcacaattg acaaccaaac 1200
ttctaaaaag aagaaacttg tttacaggtc caagaacaag atctcagatt gctgagccgc 1260
cttttcttgt ttcatagatt tttcttttgg ttgatatcaa ctagtactat tattttgagc 1320
tttttggggg tatttattta tattttattg gggttatttt ttgagctatt ttgctgttgg 1380
attgtttttt ttattatatg gggaatattt attatattac aaaggggttg agatgtggag 1440
tacaagttat tgatacattt tcttctgaaa tattttgtgt aaatctaatg ctgcctctca 1500
ttttccttac atttttttgg tgatgtgaag cttaacgtta gttaatccta tacttatgtt 1560
tcct 1564
<210>60
<211>370
<212>PRT
<213>番茄
<400>60
Met Ala Ser Thr Glu Glu Pro Pro Ala Ala Glu Ala Glu Glu Pro Lys
1 5 10 15
Thr Glu Gln Thr Pro Pro Ala Ala Pro Glu Glu Val Ser Ile Trp Gly
20 25 30
Ile Pro Leu Leu Ala Asp Glu Arg Ser Asp Val Ile Leu Leu Lys Phe
35 40 45
Leu Arg Ala Arg Asp Phe Lys Val Lys Glu Ala Phe Thr Met Leu Lys
50 55 60
Ser Val Val Ala Trp Arg Lys Glu Phe Lys Ile Asp Glu Leu Leu Asp
65 70 75 80
Glu Lys Glu Leu Gly Gln Gly Leu Glu Lys Val Val Tyr Asn His Gly
85 90 95
Val Asp Lys Glu Gly His Pro Val Cys Tyr Asn Ala Phe Gly Glu Phe
100 105 110
Gln Asp Lys Glu Leu Tyr Gln Asn Thr Phe Ala Asp Asp Lys Glu Lys
115 120 125
Leu Thr Lys Phe Leu Arg Trp Arg Ile Gln Phe Met Glu Lys Ser Ile
130 135 140
Arg Asn Leu Asp Phe Ser Pro Asp Gly Ile Asn Thr Phe Val Gln Val
145 150 155 160
Leu Asp Leu Lys Asn Ser Pro Gly Leu Phe Phe Tyr Lys Lys Glu Leu
165 170 175
Arg Gln Ala Thr Asn Arg Ala Leu Leu Leu Leu Gln Asp Asn Tyr Pro
180 185 190
Glu Phe Val Ala Lys Gln Val Phe Ile Asn Val Pro Trp Trp Tyr Pro
195 200 205
Ala Tyr Tyr Arg Met Ile Asn Ala Ser Phe Thr Thr Arg Thr Lys Ser
210 215 220
Lys Phe Val Phe Ala Gly Pro Ser Arg Ser Ala Asp Thr Leu Phe Lys
225 230 235 240
Tyr Ile Ala Pro Glu Gln Val Pro Ala Gln Tyr Gly Gly Leu Ser Lys
245 250 255
Glu Gly Glu Glu Glu Phe Thr Thr Ala Glu Pro Ala Thr Glu Glu Ile
260 265 270
Ile Lys Pro Ala Ser Lys His Thr Ile Glu Phe Pro Val Thr Glu Lys
275 280 285
Ser Thr Leu Val Trp Glu Ala Arg Val Thr Gly Trp Asp Val Ser Tyr
290 295 300
Gly Ala Glu Phe Val Pro Ser Ala Glu Gly Gly Tyr Thr Ile Leu Ile
305 310 315 320
Glu Lys Ser Arg Lys Val Gly Ala Asn Glu Ser Val Ile Ser Thr Ser
325 330 335
Tyr Lys Ala Ser Glu Ala Gly Lys Val Val Ile Thr Ile Asp Asn Gln
340 345 350
Thr Ser Lys Lys Lys Lys Leu Val Tyr Arg Ser Lys Asn Lys Ile Ser
355 360 365
Asp Cys
370
<210>61
<211>2297
<212>DNA
<213>蒺藜苜蓿(Medicago truncatula)
<400>61
ttatacataa aacacatcaa cccaagccca attgtctcca actccaactc caaacccatc 60
atctcactct ctctatatta gtcttttctc aatacccaaa tatcactttt tcttaacctt 120
aaccttaacc ctaatatcat ctcactcttt tttcatatca aatggctgct gaacctcaaa 180
aacctgctga agaagttgcc acaaccacct ctgaaactgt tgttgagaaa gaacaacagg 240
ctgatggagt tgtagctgct gctgttaccg ctgccgctgt taccgctgcc accactgata 300
aggaagctgt tgctgatcct cctcctgctg ttgctgatga ggctgagaag ccggcggaag 360
ttgtggctga taaggtggcg gatgaaactg ttgttgatga aagcaaggtt tctcaatcgg 420
tttcttttaa ggaagaaact aacgtggttt ctgaacttcc tgacgttcag aaaaaagcac 480
ttgatgaact taaacaactt attcaagaag cgcttaacaa acatgaattc accgctcctc 540
cacccgctcc agtcaaagca cctgaacctg aagtagctgt aaaagaagag aaaaagcctg 600
aagaagatga aaagaaaacc gaagaggtgg tagaagagaa gaaagatgaa gcagtagttg 660
aggagaagaa ggttgatgaa gaaaaaggtt caacctctga ggaacctaaa gttgaaactg 720
ctgaacctga aaaggaggag aagaaagtgg aggaaacggt tgtagaagtt gttgagaaaa 780
tagctgcaag taccgaagaa gacggtgcga aaacagttga ggctattcag gaaagtatag 840
tatctgttcc agttactgaa ggtgaacaac ctgttgctga gcctgttgct gaagtggagg 900
ttactcctat tgtaccagaa gaagttgaaa tatggggaat tccattacta gctgatgaaa 960
gaagtgatgt gattcttctc aagttcttga gagctaggga ttttaaggtg aaggaggctt 1020
tcactatgat caaacaaacc gtgctttggc gaaaggaatt cggagtcgaa gcacttcttc 1080
aagaagatct tggaactgac tgggacaaag ttgttttcac tgatggtact gacaaagaag 1140
gtcaccctgt ttattacaat gtttttggtg agtttgagga taaggatttg tatcaaaaaa 1200
cattctctga tgaagagaag agaaccaagt tcgttcgttg gtggattcag tctttggaga 1260
aaagtgttag gaaactcgac tttgctccat ctggtatctc tactcttgtt cagattaatg 1320
atcttaaaaa ttctcctgga cttcttggta agaaagagct tagacaatct attaagcaga 1380
ctcttcagtt gcttcaggat aactatcctg aatttgttgc caaacagatt ttcatcaatg 1440
ttccttggtg gtaccttgcc ttctctagga tgatcagtcc tttcctgaca caaaggacta 1500
agagcaaatt tgtatttgct ggttcctcca aatctgctga aacccttttc aaatatatag 1560
ctcctgagca agtgccagtt aaatatggag gactgagcag agacggtgaa caggaattca 1620
ccactgctga ccctgctaca gaggttacta tcaaaccagc aactaaacat gctgttgagt 1680
tcccagtttc tgagaaaagc actttggttt gggaagtaag agttgtggat tggaatgtga 1740
gctatggagc agaatttgtg cctagtgctg aagatggata cactgtgata atccagaaga 1800
acaggaaaat tgctgcagct gatgaaacag taattagcaa caccttcaaa gttggtgaac 1860
ctggaaaagt tgtactcacc attgataacc aaacatccaa gaaaaagaag ctgctttaca 1920
ggtccaagac catacccatc tctgagtaaa aggaaaaatg tttaaattca ttatatggtc 1980
aataacatta ttgtgaaggt tattgggtgg gttacttata ttcttttata ggaatatttt 2040
cttcatttgg tttgtaaatt attttgtttc aagtatgggg ttgggaaatt ttatttacag 2100
ttattatcac ttatcaatat tcttaaatta atgtggtatt ttgttctttt gtgttggaat 2160
tggatgaaac ctttgctgga tgaagaacac aattgggttc ttttgtaaaa caaaagttta 2220
ttgtttttgt tatcttgtga ctctgtgatt gaaggattta tttgagtaat ttttttattc 2280
caattagtct agtggcc 2297
<210>62
<211>595
<212>PRT
<213>蒺藜苜蓿
<400>62
Met Ala Ala Glu Pro Gln Lys Pro Ala Glu Glu Val Ala Thr Thr Thr
1 5 10 15
Ser Glu Thr Val Val Glu Lys Glu Gln Gln Ala Asp Gly Val Val Ala
20 25 30
Ala Ala Val Thr Ala Ala Ala Val Thr Ala Ala Thr Thr Asp Lys Glu
35 40 45
Ala Val Ala Asp Pro Pro Pro Ala Val Ala Asp Glu Ala Glu Lys Pro
50 55 60
Ala Glu Val Val Ala Asp Lys Val Ala Asp Glu Thr Val Val Asp Glu
65 70 75 80
Ser Lys Val Ser Gln Ser Val Ser Phe Lys Glu Glu Thr Asn Val Val
85 90 95
Ser Glu Leu Pro Asp Val Gln Lys Lys Ala Leu Asp Glu Leu Lys Gln
100 105 110
Leu Ile Gln Glu Ala Leu Asn Lys His Glu Phe Thr Ala Pro Pro Pro
115 120 125
Ala Pro Val Lys Ala Pro Glu Pro Glu Val Ala Val Lys Glu Glu Lys
130 135 140
Lys Pro Glu Glu Asp Glu Lys Lys Thr Glu Glu Val Val Glu Glu Lys
145 150 155 160
Lys Asp Glu Ala Val Val Glu Glu Lys Lys Val Asp Glu Glu Lys Gly
165 170 175
Ser Thr Ser Glu Glu Pro Lys Val Glu Thr Ala Glu Pro Glu Lys Glu
180 185 190
Glu Lys Lys Val Glu Glu Thr Val Val Glu Val Val Glu Lys Ile Ala
195 200 205
Ala Ser Thr Glu Glu Asp Gly Ala Lys Thr Val Glu Ala Ile Gln Glu
210 215 220
Ser Ile Val Ser Val Pro Val Thr Glu Gly Glu Gln Pro Val Ala Glu
225 230 235 240
Pro Val Ala Glu Val Glu Val Thr Pro Ile Val Pro Glu Glu Val Glu
245 250 255
Ile Trp Gly Ile Pro Leu Leu Ala Asp Glu Arg Ser Asp Val Ile Leu
260 265 270
Leu Lys Phe Leu Arg Ala Arg Asp Phe Lys Val Lys Glu Ala Phe Thr
275 280 285
Met Ile Lys Gln Thr Val Leu Trp Arg Lys Glu Phe Gly Val Glu Ala
290 295 300
Leu Leu Gln Glu Asp Leu Gly Thr Asp Trp Asp Lys Val Val Phe Thr
305 310 315 320
Asp Gly Thr Asp Lys Glu Gly His Pro Val Tyr Tyr Asn Val Phe Gly
325 330 335
Glu Phe Glu Asp Lys Asp Leu Tyr Gln Lys Thr Phe Ser Asp Glu Glu
340 345 350
Lys Arg Thr Lys Phe Val Arg Trp Trp Ile Gln Ser Leu Glu Lys Ser
355 360 365
Val Arg Lys Leu Asp Phe Ala Pro Ser Gly Ile Ser Thr Leu Val Gln
370 375 380
Ile Asn Asp Leu Lys Asn Ser Pro Gly Leu Leu Gly Lys Lys Glu Leu
385 390 395 400
Arg Gln Ser Ile Lys Gln Thr Leu Gln Leu Leu Gln Asp Asn Tyr Pro
405 410 415
Glu Phe Val Ala Lys Gln Ile Phe Ile Asn Val Pro Trp Trp Tyr Leu
420 425 430
Ala Phe Ser Arg Met Ile Ser Pro Phe Leu Thr Gln Arg Thr Lys Ser
435 440 445
Lys Phe Val Phe Ala Gly Ser Ser Lys Ser Ala Glu Thr Leu Phe Lys
450 455 460
Tyr Ile Ala Pro Glu Gln Val Pro Val Lys Tyr Gly Gly Leu Ser Arg
465 470 475 480
Asp Gly Glu Gln Glu Phe Thr Thr Ala Asp Pro Ala Thr Glu Val Thr
485 490 495
Ile Lys Pro Ala Thr Lys His Ala Val Glu Phe Pro Val Ser Glu Lys
500 505 510
Ser Thr Leu Val Trp Glu Val Arg Val Val Asp Trp Asn Val Ser Tyr
515 520 525
Gly Ala Glu Phe Val Pro Ser Ala Glu Asp Gly Tyr Thr Val Ile Ile
530 535 540
Gln Lys Asn Arg Lys Ile Ala Ala Ala Asp Glu Thr Val Ile Ser Asn
545 550 555 560
Thr Phe Lys Val Gly Glu Pro Gly Lys Val Val Leu Thr Ile Asp Asn
565 570 575
Gln Thr Ser Lys Lys Lys Lys Leu Leu Tyr Arg Ser Lys Thr Ile Pro
580 585 590
Ile Ser Glu
595
<210>63
<211>1510
<212>DNA
<213>甜菜(Beta vulgaris)
<220>
<221>misc_feature
<222>(2)..(3)
<223>n是a、c、g或t
<400>63
tnncccgggc tgcaggaatt cggcacgagc tcatttctct acatcaaaaa cacaacaaag 60
agatcaccca tggcggaaga aacccataag ccagaatcaa cggtggctga agtggtggtt 120
ccagtagccg agaaaccagc tgagaagcca gctgagaagg cagttctacc acctgaagct 180
gagaaactag ctgcagctga atcagctgaa gccgagaagc cagctgattc agccgaggct 240
aagatagctc aacaagtctc attcaaagag gagactaatg ttgcaagtga gctacctgag 300
ctacatagaa aggctctcga ggacttgaag aaacttattc aagaagccct cgagaagcac 360
gagttctctt ctcctcctcc tccgcctccg cctgctccag ctaaagttga ggagaaggcg 420
gaagagaaga aagaggaaca acctccatcc accacctcca ccaccaccac caccaccacc 480
gcggtttcag atgaggttgc tgttgctcct ccatccgaag aggccccgaa aactgacgag 540
gcctctccga aagtggagga ggagcctgca aaaatagttg agcaaccacc tacaacaccg 600
gcagaagaac ctgaaccagc aaaaacacct gaggttgttg ttgctgaaga ggagaaaact 660
ggtgaggata ttaaagaaac tatagtagtc gaggttgcga caactacagc agcaccagta 720
ctaacagaac cagaatctgt tgaggagaca ccaaaagaag ctgaagttgt agtggaagaa 780
tcaccaaagg agccagaaga agtatcaata tggggaattc cacttcttgc tgatgaaaga 840
agtgatgtaa ttctattgaa attcttaaga gcaagagatt atagagtgaa agatgctttc 900
actatgatta gaaatactgc tcgttggaga aaagaatttg aggttgattc tttacttgat 960
gaagatcttg gaaatgatta tgagaaagtt gtttttacac atggagttga taaacaaggt 1020
cgtcctgttt gttataatgt gtttggagag tttcaaaata aggaacttta tcagaatact 1080
ttctctgatg cagaaaaaag gaaaaagttc ttgagatggt tgattcaatt ccttgaaaaa 1140
actattagaa ctcttgattt tagtcctgaa ggaattaatt cttttgttct tgttaatgat 1200
ttgaagaatt ctcctgggta tggtaagaga gatctttaca aagttattga caagtttctt 1260
gagattctcc aggataatta cccagaattt gctgctaaac agttgtgcat caatgtttca 1320
tggtggtctt ggcatacaac tggatctatt tgactgtatt tacaccaagg agcaagagca 1380
agtttgtgtt tgcaagccca tctaaaactg ctgagaccct tttcaagtac atagctcctg 1440
agcaggtgcc tgttcaattt ggtgggcaca gcaagtttgg cgagcatgag ttttcccctg 1500
ctgatactgt 1510
<210>64
<211>427
<212>PRT
<213>甜菜
<400>64
Met Ala Glu Glu Thr His Lys Pro Glu Ser Thr Val Ala Glu Val Val
1 5 10 15
Val Pro Val Ala Glu Lys Pro Ala Glu Lys Pro Ala Glu Lys Ala Val
20 25 30
Leu Pro Pro Glu Ala Glu Lys Leu Ala Ala Ala Glu Ser Ala Glu Ala
35 40 45
Glu Lys Pro Ala Asp Ser Ala Glu Ala Lys Ile Ala Gln Gln Val Ser
50 55 60
Phe Lys Glu Glu Thr Asn Val Ala Ser Glu Leu Pro Glu Leu His Arg
65 70 75 80
Lys Ala Leu Glu Asp Leu Lys Lys Leu Ile Gln Glu Ala Leu Glu Lys
85 90 95
His Glu Phe Ser Ser Pro Pro Pro Pro Pro Pro Pro Ala Pro Ala Lys
100 105 110
Val Glu Glu Lys Ala Glu Glu Lys Lys Glu Glu Gln Pro Pro Ser Thr
115 120 125
Thr Ser Thr Thr Thr Thr Thr Thr Thr Ala Val Ser Asp Glu Val Ala
130 135 140
Val Ala Pro Pro Ser Glu Glu Ala Pro Lys Thr Asp Glu Ala Ser Pro
145 150 155 160
Lys Val Glu Glu Glu Pro Ala Lys Ile Val Glu Gln Pro Pro Thr Thr
165 170 175
Pro Ala Glu Glu Pro Glu Pro Ala Lys Thr Pro Glu Val Val Val Ala
180 185 190
Glu Glu Glu Lys Thr Gly Glu Asp Ile Lys Glu Thr Ile Val Val Glu
195 200 205
Val Ala Thr Thr Thr Ala Ala Pro Val Leu Thr Glu Pro Glu Ser Val
210 215 220
Glu Glu Thr Pro Lys Glu Ala Glu Val Val Val Glu Glu Ser Pro Lys
225 230 235 240
Glu Pro Glu Glu Val Ser Ile Trp Gly Ile Pro Leu Leu Ala Asp Glu
245 250 255
Arg Ser Asp Val Ile Leu Leu Lys Phe Leu Arg Ala Arg Asp Tyr Arg
260 265 270
Val Lys Asp Ala Phe Thr Met Ile Arg Asn Thr Ala Arg Trp Arg Lys
275 280 285
Glu Phe Glu Val Asp Ser Leu Leu Asp Glu Asp Leu Gly Asn Asp Tyr
290 295 300
Glu Lys Val Val Phe Thr His Gly Val Asp Lys Gln Gly Arg Pro Val
305 310 315 320
Cys Tyr Asn Val Phe Gly Glu Phe Gln Asn Lys Glu Leu Tyr Gln Asn
325 330 335
Thr Phe Ser Asp Ala Glu Lys Arg Lys Lys Phe Leu Arg Trp Leu Ile
340 345 350
Gln Phe Leu Glu Lys Thr Ile Arg Thr Leu Asp Phe Ser Pro Glu Gly
355 360 365
Ile Asn Ser Phe Val Leu Val Asn Asp Leu Lys Asn Ser Pro Gly Tyr
370 375 380
Gly Lys Arg Asp Leu Tyr Lys Val Ile Asp Lys Phe Leu Glu Ile Leu
385 390 395 400
Gln Asp Asn Tyr Pro Glu Phe Ala Ala Lys Gln Leu Cys Ile Asn Val
405 410 415
Ser Trp Trp Ser Trp His Thr Thr Gly Ser Ile
420 425
<210>65
<211>552
<212>DNA
<213>菜茵衣藻(Chlamydomonas reinhardtii)
<400>65
gtcgtgctac tcaagttctt gcgcgcacgg cagtggaacg tggcggccgc agttaacatg 60
cttgtcaact gtctgcggtg gcggcgcgac ttcgacgtgg ccggcctcgg cctggagacc 120
ttcccgccgc agctggcggc ggcggggcag ctcaccggac acgaccgggc cggcaaccca 180
gtcacctaca actactacgg cacgggtgtg gacttgaacg cggtgatggg cagcccgggc 240
ggtgtggcca ccttcgtgcg gtggcgggtg cggctgatgg agcaggcgat tgcgcagctg 300
gacttcgagc gcggcgtgga gcacgtcacg cagatccacg actacgccgg cgcctccatg 360
ttccgcatgg acgcgggcat caagtccgcc agccgcgaga tcatacgact gttccaggac 420
aactatcccg aactgctgtc cgccaagctg ttcctcaacg tgccgcgggt catggagttc 480
ctgttcggcg tgttcagcgg cctggcggac gccgccaccc gcgccaagtt caccatggcc 540
tcgcccgccc gc 552
<210>66
<211>184
<212>PRT
<213>菜茵衣藻
<400>66
Val Val Leu Leu Lys Phe Leu Arg Ala Arg Gln Trp Asn Val Ala Ala
1 5 10 15
Ala Val Asn Met Leu Val Asn Cys Leu Arg Trp Arg Arg Asp Phe Asp
20 25 30
Val Ala Gly Leu Gly Leu Glu Thr Phe Pro Pro Gln Leu Ala Ala Ala
35 40 45
Gly Gln Leu Thr Gly His Asp Arg Ala Gly Asn Pro Val Thr Tyr Asn
50 55 60
Tyr Tyr Gly Thr Gly Val Asp Leu Asn Ala Val Met Gly Ser Pro Gly
65 70 75 80
Gly Val Ala Thr Phe Val Arg Trp Arg Val Arg Leu Met Glu Gln Ala
85 90 95
Ile Ala Gln Leu Asp Phe Glu Arg Gly Val Glu His Val Thr Gln Ile
100 105 110
His Asp Tyr Ala Gly Ala Ser Met Phe Arg Met Asp Ala Gly Ile Lys
115 120 125
Ser Ala Ser Arg Glu Ile Ile Arg Leu Phe Gln Asp Asn Tyr Pro Glu
130 135 140
Leu Leu Ser Ala Lys Leu Phe Leu Asn Val Pro Arg Val Met Glu Phe
145 150 155 160
Leu Phe Gly Val Phe Ser Gly Leu Ala Asp Ala Ala Thr Arg Ala Lys
165 170 175
Phe Thr Met Ala Ser Pro Ala Arg
180
<210>67
<211>1335
<212>DNA
<213>盘基网柄菌(Dictyostelium discoideum)
<400>67
atgagtggat ttattaaaga tttatctcaa cctcaaagtg aagctttaaa tcaatttaaa 60
gaatatttaa ataaaaaaga aacaattgtg caaattaaat cagatattaa aaataaatta 120
ccaacaacta caacaactac aacaactact aatacagaaa cagaagaatc atcatcatca 180
tcatcaccat catcaaaaga agaagaaaaa cacttaaaaa tttggaatat taatttagaa 240
aatgattcaa aagagcgtga tattatttta ttaaaatttt tacgtgctag agaatttaaa 300
attgaaaatt caaaacaaat gttaattgat tgtttaattt ggagaaaaca aaatcaagta 360
gatgattatg aaaagatagt taatgaagca tttccagact attataaaaa cattggtacc 420
attttcaaga ctgataaaga gggtagacca gtaatgatca atcattacca tgcaattaat 480
ccagatgtta ttttcaaaga tggtgtggac caatttgttc gttggaaagt ccaacaaatg 540
gaaatcgcca ttagggatac actcatccca tcgcaatggg aaattgaaga tttaatagtc 600
attcacgact ataaagattg ctcattcttt agaatggatc cacgtataaa acaagcatct 660
aatcaaacca ttcaaaccct tcaaaataac tacccagaat ttttagctcg taaattcttt 720
attaacatcc catggttaat ggagaagttg ttctcaattt tcacagtatt cacatcagag 780
cgtacaaaaa gcaaattcat aatttgttct ggaaattatc gggaaaaact tttaaaatat 840
attgaagcag attcaatcgc tccaaaatta tctggtttcg aagataacca atcaccaatt 900
ttaaatatta aaatcaaacc tcaaaaatca cattcaattc aattaggtaa acttgatgct 960
gataaaacta ttgaatggga attttgtaca aatgagattg attctgaaat tggtgctaaa 1020
attttaattg aaccaaataa ccaaccaaca acttctaatg atattttata ttttaataat 1080
aatagtaata ataataataa taataataat tcaccaacac caagtaatag taattatcca 1140
ttcaattgtt ttctttcaat tgaaccaaga gaatttaata gtggttcaat tcaaattgaa 1200
gatgattctt attatacttt agtttttaat aatcatttaa ataaacaatg tgatttattt 1260
tatcgtataa ctataaaatc aaaaacaact cattcttcaa ctacaacttc aacaatcgaa 1320
actttaggaa attaa 1335
<210>68
<211>444
<212>PRT
<213>盘基网柄菌
<400>68
Met Ser Gly Phe Ile Lys Asp Leu Ser Gln Pro Gln Ser Glu Ala Leu
1 5 10 15
Asn Gln Phe Lys Glu Tyr Leu Asn Lys Lys Glu Thr Ile Val Gln Ile
20 25 30
Lys Ser Asp Ile Lys Asn Lys Leu Pro Thr Thr Thr Thr Thr Thr Thr
35 40 45
Thr Thr Asn Thr Glu Thr Glu Glu Ser Ser Ser Ser Ser Ser Pro Ser
50 55 60
Ser Lys Glu Glu Glu Lys His Leu Lys Ile Trp Asn Ile Asn Leu Glu
65 70 75 80
Asn Asp Ser Lys Glu Arg Asp Ile Ile Leu Leu Lys Phe Leu Arg Ala
85 90 95
Arg Glu Phe Lys Ile Glu Asn Ser Lys Gln Met Leu Ile Asp Cys Leu
100 105 110
Ile Trp Arg Lys Gln Asn Gln Val Asp Asp Tyr Glu Lys Ile Val Asn
115 120 125
Glu Ala Phe Pro Asp Tyr Tyr Lys Asn Ile Gly Thr Ile Phe Lys Thr
130 135 140
Asp Lys Glu Gly Arg Pro Val Met Ile Asn His Tyr His Ala Ile Asn
145 150 155 160
Pro Asp Val Ile Phe Lys Asp Gly Val Asp Gln Phe Val Arg Trp Lys
165 170 175
Val Gln Gln Met Glu Ile Ala Ile Arg Asp Thr Leu Ile Pro Ser Gln
180 185 190
Trp Glu Ile Glu Asp Leu Ile Val Ile His Asp Tyr Lys Asp Cys Ser
195 200 205
Phe Phe Arg Met Asp Pro Arg Ile Lys Gln Ala Ser Asn Gln Thr Ile
210 215 220
Gln Thr Leu Gln Asn Asn Tyr Pro Glu Phe Leu Ala Arg Lys Phe Phe
225 230 235 240
Ile Asn Ile Pro Trp Leu Met Glu Lys Leu Phe Ser Ile Phe Thr Val
245 250 255
Phe Thr Ser Glu Arg Thr Lys Ser Lys Phe Ile Ile Cys Ser Gly Asn
260 265 270
Tyr Arg Glu Lys Leu Leu Lys Tyr Ile Glu Ala Asp Ser Ile Ala Pro
275 280 285
Lys Leu Ser Gly Phe Glu Asp Asn Gln Ser Pro Ile Leu Asn Ile Lys
290 295 300
Ile Lys Pro Gln Lys Ser His Ser Ile Gln Leu Gly Lys Leu Asp Ala
305 310 315 320
Asp Lys Thr Ile Glu Trp Glu Phe Cys Thr Asn Glu Ile Asp Ser Glu
325 330 335
Ile Gly Ala Lys Ile Leu Ile Glu Pro Asn Asn Gln Pro Thr Thr Ser
340 345 350
Asn Asp Ile Leu Tyr Phe Asn Asn Asn Ser Asn Asn Asn Asn Asn Asn
355 360 365
Asn Asn Ser Pro Thr Pro Ser Asn Ser Asn Tyr Pro Phe Asn Cys Phe
370 375 380
Leu Ser Ile Glu Pro Arg Glu Phe Asn Ser Gly Ser Ile Gln Ile Glu
385 390 395 400
Asp Asp Ser Tyr Tyr Thr Leu Val Phe Asn Asn His Leu Asn Lys Gln
405 410 415
Cys Asp Leu Phe Tyr Arg Ile Thr Ile Lys Ser Lys Thr Thr His Ser
420 425 430
Ser Thr Thr Thr Ser Thr Ile Glu Thr Leu Gly Asn
435 440
<210>69
<211>8
<212>PRT
<213>人工序列
<220>
<223>基序1
<220>
<221>变体
<222>(2)..(2)
<223>/取代=″Thr″
<400>69
Leu Leu Lys Phe Leu Arg Ala Arg
1 5
<210>70
<211>8
<212>PRT
<213>人工序列
<220>
<223>基序2
<220>
<221>变体
<222>(1)..(1)
<223>/取代=″Phe″
<220>
<221>变体
<222>(2)..(2)
<223>/取代=″Glu″
<400>70
Leu Gln Asp Asn Tyr Pro Glu Phe
1 5
<210>71
<211>169
<212>PRT
<213>人工序列
<220>
<223>SEC结构域
<400>71
Leu Pro Glu Leu Asp Ser Val Val Phe Tyr Arg Gly Ala Asp Arg Glu
1 5 10 15
Gly His Pro Val Cys Tyr Asn Val Tyr Gly Glu Phe Gln Asp Lys Asp
20 25 30
Leu Tyr Glu Lys Ala Phe Gly Asp Glu Glu Lys Arg Glu Arg Phe Leu
35 40 45
Lys Trp Arg Ile Gln Leu Leu Glu Arg Gly Ile Leu Ser Gln Leu Asp
50 55 60
Phe Ser Pro Ser Gly Ile Cys Ser Met Val Gln Val Thr Asp Leu Lys
65 70 75 80
Asn Ser Pro Pro Met Leu Gly Lys His Arg Ala Val Thr Arg Gln Ala
85 90 95
Val Ala Leu Leu Gln Asp Asn Tyr Pro Glu Phe Ile Ala Lys Lys Val
100 105 110
Phe Ile Asn Val Pro Trp Trp Tyr Leu Ala Ala Asn Lys Met Met Ser
115 120 125
Pro Phe Leu Thr Gln Arg Thr Lys Ser Lys Phe Ile Phe Ala Ser Pro
130 135 140
Ala Lys Ser Ala Glu Thr Leu Phe Arg Tyr Ile Ala Pro Glu Gln Val
145 150 155 160
Pro Val Gln Phe Gly Gly Leu Phe Lys
165
<210>72
<211>102
<212>PRT
<213>人工序列
<220>
<223>GOLD结构域
<400>72
Ser Asp Ala Val Thr Glu Leu Thr Ile Lys Pro Ser Ser Lys Glu Thr
1 5 10 15
Val Glu Ile Pro Val Thr Glu Asn Ser Thr Ile Gly Trp Glu Leu Arg
20 25 30
Val Leu Gly Trp Glu Val Ser Tyr Gly Ala Glu Phe Thr Pro Asp Ala
35 40 45
Glu Gly Gly Tyr Thr Val Ile Val Gln Lys Thr Arg Lys Val Pro Ala
50 55 60
Asn Glu Glu Pro Ile Met Lys Gly Ser Phe Lys Val Gly Glu Pro Gly
65 70 75 80
Lys Ile Val Leu Thr Ile Asn Asn Pro Ala Ser Lys Lys Lys Lys Leu
85 90 95
Leu Tyr Arg Ser Lys Val
100
<210>73
<211>51
<212>DNA
<213>人工序列
<220>
<223>引物1
<400>73
ggggacaagt ttgtacaaaa aagcaggctt aaacaatggc ggaggagcca c 51
<210>74
<211>50
<212>DNA
<213>人工序列
<220>
<223>引物2
<400>74
ggggaccact ttgtacaaga aagctgggtg tggtgaatct ggtgatcagg 50
<210>75
<211>2194
<212>DNA
<213>稻
<400>75
aatccgaaaa gtttctgcac cgttttcacc ccctaactaa caatataggg aacgtgtgct 60
aaatataaaa tgagacctta tatatgtagc gctgataact agaactatgc aagaaaaact 120
catccaccta ctttagtggc aatcgggcta aataaaaaag agtcgctaca ctagtttcgt 180
tttccttagt aattaagtgg gaaaatgaaa tcattattgc ttagaatata cgttcacatc 240
tctgtcatga agttaaatta ttcgaggtag ccataattgt catcaaactc ttcttgaata 300
aaaaaatctt tctagctgaa ctcaatgggt aaagagagag atttttttta aaaaaataga 360
atgaagatat tctgaacgta ttggcaaaga tttaaacata taattatata attttatagt 420
ttgtgcattc gtcatatcgc acatcattaa ggacatgtct tactccatcc caatttttat 480
ttagtaatta aagacaattg acttattttt attatttatc ttttttcgat tagatgcaag 540
gtacttacgc acacactttg tgctcatgtg catgtgtgag tgcacctcct caatacacgt 600
tcaactagca acacatctct aatatcactc gcctatttaa tacatttagg tagcaatatc 660
tgaattcaag cactccacca tcaccagacc acttttaata atatctaaaa tacaaaaaat 720
aattttacag aatagcatga aaagtatgaa acgaactatt taggtttttc acatacaaaa 780
aaaaaaagaa ttttgctcgt gcgcgagcgc caatctccca tattgggcac acaggcaaca 840
acagagtggc tgcccacaga acaacccaca aaaaacgatg atctaacgga ggacagcaag 900
tccgcaacaa ccttttaaca gcaggctttg cggccaggag agaggaggag aggcaaagaa 960
aaccaagcat cctccttctc ccatctataa attcctcccc ccttttcccc tctctatata 1020
ggaggcatcc aagccaagaa gagggagagc accaaggaca cgcgactagc agaagccgag 1080
cgaccgcctt ctcgatccat atcttccggt cgagttcttg gtcgatctct tccctcctcc 1140
acctcctcct cacagggtat gtgcctccct tcggttgttc ttggatttat tgttctaggt 1200
tgtgtagtac gggcgttgat gttaggaaag gggatctgta tctgtgatga ttcctgttct 1260
tggatttggg atagaggggt tcttgatgtt gcatgttatc ggttcggttt gattagtagt 1320
atggttttca atcgtctgga gagctctatg gaaatgaaat ggtttaggga tcggaatctt 1380
gcgattttgt gagtaccttt tgtttgaggt aaaatcagag caccggtgat tttgcttggt 1440
gtaataaagt acggttgttt ggtcctcgat tctggtagtg atgcttctcg atttgacgaa 1500
gctatccttt gtttattccc tattgaacaa aaataatcca actttgaaga cggtcccgtt 1560
gatgagattg aatgattgat tcttaagcct gtccaaaatt tcgcagctgg cttgtttaga 1620
tacagtagtc cccatcacga aattcatgga aacagttata atcctcagga acaggggatt 1680
ccctgttctt ccgatttgct ttagtcccag aatttttttt cccaaatatc ttaaaaagtc 1740
actttctggt tcagttcaat gaattgattg ctacaaataa tgcttttata gcgttatcct 1800
agctgtagtt cagttaatag gtaatacccc tatagtttag tcaggagaag aacttatccg 1860
atttctgatc tccattttta attatatgaa atgaactgta gcataagcag tattcatttg 1920
gattattttt tttattagct ctcacccctt cattattctg agctgaaagt ctggcatgaa 1980
ctgtcctcaa ttttgttttc aaattcacat cgattatcta tgcattatcc tcttgtatct 2040
acctgtagaa gtttcttttt ggttattcct tgactgcttg attacagaaa gaaatttatg 2100
aagctgtaat cgggatagtt atactgcttg ttcttatgat tcatttcctt tgtgcagttc 2160
ttggtgtagc ttgccacttt caccagcaaa gttc 2194
<210>76
<211>1182
<212>DNA
<213>拟南芥(Arabidopsis thaliana)
<400>76
atggcggaga tacagtcaaa tggaagggca tatgagtcat tattggaaaa ggttctttca 60
atgaacattc tttcttctga ctattttaaa gagctctatg gtttaaagac ttatcatgag 120
gtaattgatg aaatctacaa ccaagttaat catgtggagc cgtggatggg tgggaattgc 180
cgtggtcctt caacagcgta ttgtcttctc tacaaattct ttaccatgaa acttacagtg 240
aagcagatgc atggactgtt aaagcacaca gattctcctt atattagagc ggttggattc 300
ctatatttaa gatatgttgc agatgcaaag acgttgtgga catggtatga accatacatt 360
aaagatgatg aggagttttc accaggacca aatggacgga tgacgacaat gggtgtttat 420
gtacgtgatt tgctgcttgg actgtactac tttgatactt tgtttcctcg tatacctgtt 480
cctgtcatgc gccagattgt atcaaacctt gagaagatga atttaccaac taaaccttct 540
ggttcaaccg gagacatgac ccgtggctca gaagacactg cccgtcgtcc accatcagta 600
aaagcatccc tctctgcttc atttggtcag cgtgcacctc atcgtgcttc caccagaggc 660
tcttctcctg ttcgccgtcc tccaccgact ggttatgaca gaaatggagg cgatgaagta 720
caacagcggt ccccacgtag aagccagagc cgagactatt attctgacag agactcagat 780
agacaacggg aaagagagag ggagaaagac cgcgaaagag agagggggag ggatagatac 840
agagaaaggg aaagggatta tggtaatgat aggagatcaa ggcgtgacta tgatagtaga 900
agcaggcgca atgattatga ggacgacaga agtagacatg accggagaag caggagcaga 960
agcagaagta ggagcaggag tgtgcagatt gagcgtgaac cgactcctaa aagagatagt 1020
agcaacaaag agaaatcggc ggtgacagtg aacagcaatc tcgcaaagct aaaagatttg 1080
tatggagacg caagtagtca gaaaagggat gaaggatttg gaacaaggaa agattcaagt 1140
tcagaagaag tgataaagct tggtggttcc tcttggaggt ga 1182
<210>77
<211>393
<212>PRT
<213>拟南芥
<400>77
Met Ala Glu Ile Gln Ser Asn Gly Arg Ala Tyr Glu Ser Leu Leu Glu
1 5 10 15
Lys Val Leu Ser Met Asn Ile Leu Ser Ser Asp Tyr Phe Lys Glu Leu
20 25 30
Tyr Gly Leu Lys Thr Tyr His Glu Val Ile Asp Glu Ile Tyr Asn Gln
35 40 45
Val Asn His Val Glu Pro Trp Met Gly Gly Asn Cys Arg Gly Pro Ser
50 55 60
Thr Ala Tyr Cys Leu Leu Tyr Lys Phe Phe Thr Met Lys Leu Thr Val
65 70 75 80
Lys Gln Met His Gly Leu Leu Lys His Thr Asp Ser Pro Tyr Ile Arg
85 90 95
Ala Val Gly Phe Leu Tyr Leu Arg Tyr Val Ala Asp Ala Lys Thr Leu
100 105 110
Trp Thr Trp Tyr Glu Pro Tyr Ile Lys Asp Asp Glu Glu Phe Ser Pro
115 120 125
Gly Pro Asn Gly Arg Met Thr Thr Met Gly Val Tyr Val Arg Asp Leu
130 135 140
Leu Leu Gly Leu Tyr Tyr Phe Asp Thr Leu Phe Pro Arg Ile Pro Val
145 150 155 160
Pro Val Met Arg Gln Ile Val Ser Asn Leu Glu Lys Met Asn Leu Pro
165 170 175
Thr Lys Pro Ser Gly Ser Thr Gly Asp Met Thr Arg Gly Ser Glu Asp
180 185 190
Thr Ala Arg Arg Pro Pro Ser Val Lys Ala Ser Leu Ser Ala Ser Phe
195 200 205
Gly Gln Arg Ala Pro His Arg Ala Ser Thr Arg Gly Ser Ser Pro Val
210 215 220
Arg Arg Pro Pro Pro Thr Gly Tyr Asp Arg Asn Gly Gly Asp Glu Val
225 230 235 240
Gln Gln Arg Ser Pro Arg Arg Ser Gln Ser Arg Asp Tyr Tyr Ser Asp
245 250 255
Arg Asp Ser Asp Arg Gln Arg Glu Arg Glu Arg Glu Lys Asp Arg Glu
260 265 270
Arg Glu Arg Gly Arg Asp Arg Tyr Arg Glu Arg Glu Arg Asp Tyr Gly
275 280 285
Asn Asp Arg Arg Ser Arg Arg Asp Tyr Asp Ser Arg Ser Arg Arg Asn
290 295 300
Asp Tyr Glu Asp Asp Arg Ser Arg His Asp Arg Arg Ser Arg Ser Arg
305 310 315 320
Ser Arg Ser Arg Ser Arg Ser Val Gln Ile Glu Arg Glu Pro Thr Pro
325 330 335
Lys Arg Asp Ser Ser Asn Lys Glu Lys Ser Ala Val Thr Val Asn Ser
340 345 350
Asn Leu Ala Lys Leu Lys Asp Leu Tyr Gly Asp Ala Ser Ser Gln Lys
355 360 365
Arg Asp Glu Gly Phe Gly Thr Arg Lys Asp Ser Ser Ser Glu Glu Val
370 375 380
Ile Lys Leu Gly Gly Ser Ser Trp Arg
385 390
<210>78
<211>1182
<212>DNA
<213>拟南芥
<400>78
atggcggaga tacagtcaaa tggaagggca tatgagtcat tattggaaaa ggttctttca 60
atgaacattc tttcttctga ctattttaaa gagctctatg gtttaaagac ttatcatgag 120
gtaattgatg aaatctacaa ccaagttaat catgtggagc cgtggatggg tgggaattgc 180
cgtggtcctt caacagcgta ttgtcttctc tacaaattct ttaccatgaa acttacagtg 240
aagcagatgc atggactgtt aaagcacaca gattctcctt atattagagc ggttggattc 300
ctatatttaa gatatgttgc agatgcaaag acgttgtgga catggtatga accatacatt 360
aaagatgatg aggagttttc accaggatca aatggacgga tgacgacaat gggtgtttat 420
gtacgtgatt tgctgcttgg actgtactac tttgatactt tgtttcctcg tatacctgtt 480
cctgtcatgc gccagattgt atcaaacctt gagaagatga atttaccaac taaaccttct 540
ggttcaaccg gagacatgac ccgtggctca gaagacactg cccgtcgtcc accatcagta 600
aaagcatccc tctctgtttc atttggtcag cgtgcacctc atcgtgcttc caccagaggc 660
tcttctcctg ttcgccgtcc tccaccgact ggttatgaca gaaatggagg cgatgaagta 720
caacagcggt ccccacgtag aagccagagc cgagactatt attctgacag agactcagat 780
agacaacggg aaagagagag ggagaaagac cgcgaaagag agagggggag ggatagatac 840
agagaaaggg aaagggatta tggtaatgat aggagatcaa ggcgtgacta tgatagtaga 900
agcaggcgca atgattatga ggacgacaga agtagacatg accggagaag caggagcaga 960
agcagaagta ggagcaggag tgtgcagatt gagcgtgaac cgactcctaa aagagatagt 1020
agcaacaaag agaaatcggc ggtgacagtg aacagcaatc tcgcaaagct aaaagatttg 1080
tatggagacg caagtagtca gaaaagggat gaaggatttg gaacaaggaa agattcaagt 1140
tcagaagaag tgataaagct tggtggttcc tcttggaggt ga 1182
<210>79
<211>393
<212>PRT
<213>拟南芥
<400>79
Met Ala Glu Ile Gln Ser Asn Gly Arg Ala Tyr Glu Ser Leu Leu Glu
1 5 10 15
Lys Val Leu Ser Met Asn Ile Leu Ser Ser Asp Tyr Phe Lys Glu Leu
20 25 30
Tyr Gly Leu Lys Thr Tyr His Glu Val Ile Asp Glu Ile Tyr Asn Gln
35 40 45
Val Asn His Val Glu Pro Trp Met Gly Gly Asn Cys Arg Gly Pro Ser
50 55 60
Thr Ala Tyr Cys Leu Leu Tyr Lys Phe Phe Thr Met Lys Leu Thr Val
65 70 75 80
Lys Gln Met His Gly Leu Leu Lys His Thr Asp Ser Pro Tyr Ile Arg
85 90 95
Ala Val Gly Phe Leu Tyr Leu Arg Tyr Val Ala Asp Ala Lys Thr Leu
100 105 110
Trp Thr Trp Tyr Glu Pro Tyr Ile Lys Asp Asp Glu Glu Phe Ser Pro
115 120 125
Gly Ser Asn Gly Arg Met Thr Thr Met Gly Val Tyr Val Arg Asp Leu
130 135 140
Leu Leu Gly Leu Tyr Tyr Phe Asp Thr Leu Phe Pro Arg Ile Pro Val
145 150 155 160
Pro Val Met Arg Gln Ile Val Ser Asn Leu Glu Lys Met Asn Leu Pro
165 170 175
Thr Lys Pro Ser Gly Ser Thr Gly Asp Met Thr Arg Gly Ser Glu Asp
180 185 190
Thr Ala Arg Arg Pro Pro Ser Val Lys Ala Ser Leu Ser Val Ser Phe
195 200 205
Gly Gln Arg Ala Pro His Arg Ala Ser Thr Arg Gly Ser Ser Pro Val
210 215 220
Arg Arg Pro Pro Pro Thr Gly Tyr Asp Arg Asn Gly Gly Asp Glu Val
225 230 235 240
Gln Gln Arg Ser Pro Arg Arg Ser Gln Ser Arg Asp Tyr Tyr Ser Asp
245 250 255
Arg Asp Ser Asp Arg Gln Arg Glu Arg Glu Arg Glu Lys Asp Arg Glu
260 265 270
Arg Glu Arg Gly Arg Asp Arg Tyr Arg Glu Arg Glu Arg Asp Tyr Gly
275 280 285
Asn Asp Arg Arg Ser Arg Arg Asp Tyr Asp Ser Arg Ser Arg Arg Asn
290 295 300
Asp Tyr Glu Asp Asp Arg Ser Arg His Asp Arg Arg Ser Arg Ser Arg
305 310 315 320
Ser Arg Ser Arg Ser Arg Ser Val Gln Ile Glu Arg Glu Pro Thr Pro
325 330 335
Lys Arg Asp Ser Ser Asn Lys Glu Lys Ser Ala ValThr Val Asn Ser
340 345 350
Asn Leu Ala Lys Leu Lys Asp Leu Tyr Gly Asp Ala Ser Ser Gln Lys
355 360 365
Arg Asp Glu Gly Phe Gly Thr Arg Lys Asp Ser Ser Ser Glu Glu Val
370 375 380
Ile Lys Leu Gly Gly Ser Ser Trp Arg
385 390
<210>80
<211>1158
<212>DNA
<213>拟南芥
<400>80
atggcggaga tacagtcaaa tggaagggca tatgagtcat tattggaaaa ggttctttca 60
atgaacattc tttcttctga ctattttaaa gagctctatg gtttaaagac ttatcatgag 120
gtaattgatg aaatctacaa ccaagttaat catgtggagc cgtggatggg tgggaattgc 180
cgtggtcctt caacagcgta ttgtcttctc tacaaattct ttaccatgaa acttacagtg 240
aagcagatgc atggactgtt aaagcacaca gattctcctt atattagagc ggttggattc 300
ctatatttaa gatatgttgc agatgcaaag acgttgtgga catggtatga accatacatt 360
aaagatgatg aggagttttc accaggatca aatggacgga tgacgacaat gggtgtttat 420
tactactttg atactttgtt tcctcgtata cctgttcctg tcatgcgcca gattgtatca 480
aaccttgaga agatgaattt accaactaaa ccttctggtt caaccggaga catgacccgt 540
ggctcagaag acactgcccg tcgtccacca tcagtaaaag catccctctc tgtttcattt 600
ggtcagcgtg cacctcatcg tgcttccacc agaggctctt ctcctgttcg ccgtcctcca 660
ccgactggtt atgacagaaa tggaggcgat gaagtacaac agcggtcccc acgtagaagc 720
cagagccgag actattattc tgacagagac tcagatagac aacgggaaag agagagggag 780
aaagaccgcg aaagagagag ggggagggat agatacagag aaagggaaag ggattatggt 840
aatgatagga gatcaaggcg tgactatgat agtagaagca ggcgcaatga ttatgaggac 900
gacagaagta gacatgaccg gagaagcagg agcagaagca gaagtaggag caggagtgtg 960
cagattgagc gtgaaccgac tcctaaaaga gatagtagca acaaagagaa atcggcggtg 1020
acagtgaaca gcaatctcgc aaagctaaaa gatttgtatg gagacgcaag tagtcagaaa 1080
agggatgaag gatttggaac aaggaaagat tcaagttcag aagaagtgat aaagcttggt 1140
ggttcctctt ggaggtga 1158
<210>81
<211>385
<212>PRT
<213>拟南芥
<400>81
Met Ala Glu Ile Gln Ser Asn Gly Arg Ala Tyr Glu Ser Leu Leu Glu
1 5 10 15
Lys Val Leu Ser Met Asn Ile Leu Ser Ser Asp Tyr Phe Lys Glu Leu
20 25 30
Tyr Gly Leu Lys Thr Tyr His Glu Val Ile Asp Glu Ile Tyr Asn Gln
35 40 45
Val Asn His Val Glu Pro Trp Met Gly Gly Asn Cys Arg Gly Pro Ser
50 55 60
Thr Ala Tyr Cys Leu Leu Tyr Lys Phe Phe Thr Met Lys Leu Thr Val
65 70 75 80
Lys Gln Met His Gly Leu Leu Lys His Thr Asp Ser Pro Tyr Ile Arg
85 90 95
Ala Val Gly Phe Leu Tyr Leu Arg Tyr Val Ala Asp Ala Lys Thr Leu
100 105 110
Trp Thr Trp Tyr Glu Pro Tyr Ile Lys Asp Asp Glu Glu Phe Ser Pro
115 120 125
Gly Ser Asn Gly Arg Met Thr Thr Met Gly Val Tyr Tyr Tyr Phe Asp
130 135 140
Thr Leu Phe Pro Arg Ile Pro Val Pro Val Met Arg Gln Ile Val Ser
145 150 155 160
Asn Leu Glu Lys Met Asn Leu Pro Thr Lys Pro Ser Gly Ser Thr Gly
165 170 175
Asp Met Thr Arg Gly Ser Glu Asp Thr Ala Arg Arg Pro Pro Ser Val
180 185 190
Lys Ala Ser Leu Ser Val Ser Phe Gly Gln Arg Ala Pro His Arg Ala
195 200 205
Ser Thr Arg Gly Ser Ser Pro Val Arg Arg Pro Pro Pro Thr Gly Tyr
210 215 220
Asp Arg Asn Gly Gly Asp Glu Val Gln Gln Arg Ser Pro Arg Arg Ser
225 230 235 240
Gln Ser Arg Asp Tyr Tyr Ser Asp Arg Asp Ser Asp Arg Gln Arg Glu
245 250 255
Arg Glu Arg Glu Lys Asp Arg Glu Arg Glu Arg Gly Arg Asp Arg Tyr
260 265 270
Arg Glu Arg Glu Arg Asp Tyr Gly Asn Asp Arg Arg Ser Arg Arg Asp
275 280 285
Tyr Asp Ser Arg Ser Arg Arg Asn Asp Tyr Glu Asp Asp Arg Ser Arg
290 295 300
His Asp Arg Arg Ser Arg Ser Arg Ser Arg Ser Arg Ser Arg Ser Val
305 310 315 320
Gln Ile Glu Arg Glu Pro Thr Pro Lys Arg Asp Ser Ser Asn Lys Glu
325 330 335
Lys Ser Ala Val Thr Val Asn Ser Asn Leu Ala Lys Leu Lys Asp Leu
340 345 350
Tyr Gly Asp Ala Ser Ser Gln Lys Arg Asp Glu Gly Phe Gly Thr Arg
355 360 365
Lys Asp Ser Ser Ser Glu Glu Val Ile Lys Leu Gly Gly Ser Ser Trp
370 375 380
Arg
385
<210>82
<211>680
<212>DNA
<213>拟南芥
<400>82
caaacettga gaagatgaat ttaccaacta aaccttctgg ttcaaccgga gacatgaccc 60
gtggctcaga agacactgcc cgtcgtccac catcagtaaa agcatctctc tctgtttcat 120
ttggtcagcg tgcacctcat cgtgcttcca ccagaggctc ttctcctgtt cgccgtcctc 180
caccgactgg ttatgacaga aatggaggcg atgaagtaca acagcggtcc ccacgtagaa 240
gccagagccg agactattat tctgacagag actcagatag acaacgggaa agagagaggg 300
agaaagaccg cgaaagagag agggggaggg atagatacag agaaagggag agggattatg 360
gtaatgatag gagatcaagg cgcgactatg atagtagaag caggcgcaat gattatgagg 420
acgacagaag tagacatgac cggagaagca ggagcagaag cagaagtagg agcaggagtg 480
tgcagattga gcgtgaaccg actcctaaaa gagatagtag caacaaagag aaatcggcgg 540
tgacagtgaa cagcaatctc gcaaagctaa aagatttgta tggagacgca agtagtcaga 600
aaagggatga aggatttgga acaaggaaag attcaagttc agaagaagtg ataaagcttg 660
gtggttcctc ttggaggtga 680
<210>83
<211>225
<212>PRT
<213>拟南芥
<400>83
Asn Leu Glu Lys Met Asn Leu Pro Thr Lys Pro Ser Gly Ser Thr Gly
1 5 10 15
Asp Met Thr Arg Gly Ser Glu Asp Thr Ala Arg Arg Pro Pro Ser Val
20 25 30
Lys Ala Ser Leu Ser Val Ser Phe Gly Gln Arg Ala Pro His Arg Ala
35 40 45
Ser Thr Arg Gly Ser Ser Pro Val Arg Arg Pro Pro Pro Thr Gly Tyr
50 55 60
Asp Arg Asn Gly Gly Asp Glu Val Gln Gln Arg Ser Pro Arg Arg Ser
65 70 75 80
Gln Ser Arg Asp Tyr Tyr Ser Asp Arg Asp Ser Asp Arg Gln Arg Glu
85 90 95
Arg Glu Arg Glu Lys Asp Arg Glu Arg Glu Arg Gly Arg Asp Arg Tyr
100 105 110
Arg Glu Arg Glu Arg Asp Tyr Gly Asn Asp Arg Arg Ser Arg Arg Asp
115 120 125
Tyr Asp Ser Arg Ser Arg Arg Asn Asp Tyr Glu Asp Asp Arg Ser Arg
130 135 140
His Asp Arg Arg Ser Arg Ser Arg Ser Arg Ser Arg Ser Arg Ser Val
145 150 155 160
Gln Ile Glu Arg Glu Pro Thr Pro Lys Arg Asp Ser Ser Asn Lys Glu
165 170 175
Lys Ser Ala Val Thr Val Asn Ser Asn Leu Ala Lys Leu Lys Asp Leu
180 185 190
Tyr Gly Asp Ala Ser Ser Gln Lys Arg Asp Glu Gly Phe Gly Thr Arg
195 200 205
Lys Asp Ser Ser Ser Glu Glu Val Ile Lys Leu Gly Gly Ser Ser Trp
210 215 220
Arg
225
<210>84
<211>1068
<212>DNA
<213>拟南芥
<400>84
atggcaaaca gaacagatcc gttggcaaag aatataagag gaacgaatcc gcagaatctg 60
gtagagaaga ttgtgcgaac gaagatttat cagcacacct tttggaagga gcagtgcttt 120
ggtctcacgg cggagacatt ggtggacaaa gctatggagc tcgaccatct aggtggtacc 180
tttggtggta gccgcaagcc tactccgttc ctttgcctca tattgaagat gcttcaaatc 240
cagcctgaga aggaaattgt cgtggagttc ataaaaaatg atgactacaa atatgttcgt 300
attcttggtg cgttctatct gcgtctcact gggactgatg ttgatgtcta tcgctacctc 360
gaacctctct acaatgacta ccggaaagtg agacaaaagt tatctgatgg gaagttttcg 420
ctgacacatg tggacgaagt cattgaggaa cttctaacca aggattattc ttgtgatatt 480
gcaatgccac gtttgaagaa aaggtggacg cttgaacaga atggtttatt agagccaagg 540
aaaagtgttt tggaagacga ctttgaagaa gaggaagaaa aggaggagaa tgaagggatt 600
gctgatggat ctgaagatga gatggatcag cgccgtaaga gtcctgaaag agaaagagaa 660
agagacagag acaggagacg cgacagtcat agacacaggg atcgtgatta tgacagagac 720
tatgatatgg atcgagatca tgacagagac tatgaaagag aacgtgggca tggtcgagac 780
cgggataggg agagagacag ggatcactat agagagcgag atagggacag ggaaagaggc 840
agagatagag aacgagacag aagagacagg gcaaggcgca gaagtagaag caggagtagg 900
gatcgtaaga gacatgaaac tgatgatgtg cgggatcggg aagaacctaa gaaaaagaaa 960
gaaaagaagg agaagatgaa ggaagatgga accgatcatc caaatcctga aattgcagag 1020
atgaatagac tgagagcatc actgggaatg aaacccctca gggactga 1068
<210>85
<211>355
<212>PRT
<213>拟南芥
<400>85
Met Ala Asn Arg Thr Asp Pro Leu Ala Lys Asn Ile Arg Gly Thr Asn
1 5 10 15
Pro Gln Asn Leu Val Glu Lys Ile Val Arg Thr Lys Ile Tyr Gln His
20 25 30
Thr Phe Trp Lys Glu Gln Cys Phe Gly Leu Thr Ala Glu Thr Leu Val
35 40 45
Asp Lys Ala Met Glu Leu Asp His Leu Gly Gly Thr Phe Gly Gly Ser
50 55 60
Arg Lys Pro Thr Pro Phe Leu Cys Leu Ile Leu Lys Met Leu Gln Ile
65 70 75 80
Gln Pro Glu Lys Glu Ile Val Val Glu Phe Ile Lys Asn Asp Asp Tyr
85 90 95
Lys Tyr Val Arg Ile Leu Gly Ala Phe Tyr Leu Arg Leu Thr Gly Thr
100 105 110
Asp Val Asp Val Tyr Arg Tyr Leu Glu Pro Leu Tyr Asn Asp Tyr Arg
115 120 125
Lys Val Arg Gln Lys Leu Ser Asp Gly Lys Phe Ser Leu Thr His Val
130 135 140
Asp Glu Val Ile Glu Glu Leu Leu Thr Lys Asp Tyr Ser Cys Asp Ile
145 150 155 160
Ala Met Pro Arg Leu Lys Lys Arg Trp Thr Leu Glu Gln Asn Gly Leu
165 170 175
Leu Glu Pro Arg Lys Ser Val Leu Glu Asp Asp Phe Glu Glu Glu Glu
180 185 190
Glu Lys Glu Glu Asn Glu Gly Ile Ala Asp Gly Ser Glu Asp Glu Met
195 200 205
Asp Gln Arg Arg Lys Ser Pro Glu Arg Glu Arg Glu Arg Asp Arg Asp
210 215 220
Arg Arg Arg Asp Ser His Arg His Arg Asp Arg Asp Tyr Asp Arg Asp
225 230 235 240
Tyr Asp Met Asp Arg Asp His Asp Arg Asp Tyr Glu Arg Glu Arg Gly
245 250 255
His Gly Arg Asp Arg Asp Arg Glu Arg Asp Arg Asp His Tyr Arg Glu
260 265 270
Arg Asp Arg Asp Arg Glu Arg Gly Arg Asp Arg Glu Arg Asp Arg Arg
275 280 285
Asp Arg Ala Arg Arg Arg Ser Arg Ser Arg Ser Arg Asp Arg Lys Arg
290 295 300
His Glu Thr Asp Asp Val Arg Asp Arg Glu Glu Pro Lys Lys Lys Lys
305 310 315 320
Glu Lys Lys Glu Lys Met LysGlu Asp Gly Thr Asp His Pro Asn Pro
325 330 335
Glu Ile Ala Glu Met Asn Arg Leu Arg Ala Ser Leu Gly Met Lys Pro
340 345 350
Leu Arg Asp
355
<210>86
<211>1185
<212>DNA
<213>短柄草(Brachypodium sylvaticum)
<400>86
atggagatac agtcgtccgg gaggcccatc gaggtgctca tggagaaggt gctgtccatg 60
aacatcgtct cctcggacta cttcaaggag ctctacaaga tcaagacgta ccacgaggtc 120
atcgacgaga tctacaacca ggtcgaccac gtcgagccgt ggatgaccgg caactgccgc 180
ggcccatcca ccgccttctg cctcctctac aagctcttca ccatgaagct caccatgaac 240
cagatgcacg gcctgctcaa gcaccctgat tccccttaca tcagagctat tggatttctc 300
tacctacgat acgttgcgga accaaagacg ctatggactt ggtatgagcc ctacattaaa 360
gatgatgagg agttttcccc tgggtcgaat ggtaaaatga caactatggg cgtttatgtg 420
cgtgatgtcc tccttggcca gtactacttc gacagtcttc ttccgcgagt gcctctccta 480
attttgcgac aggtcagtgc ccatcttgag aagatgaagc tcccaacaaa gcagtcaggg 540
atgactgggg attcaagtcg ccttggttca gatgatactg cccggcgtcc tccttcggtg 600
aaggcctctt tgtctgtctc ttttggtcag cgtgcgccac accgtgcgtc cacaagggac 660
tcgtctccag ttcgaaagac attgccttct atacgggaaa gggaaaggag tcatgacggt 720
gatcgtgcaa aatctccacc caggaagcgc cgaagtgaaa gtcgggagcg taatcgtgaa 780
actgagaggg accgttcgga tcgtgatcgt ggtagatata acgatagaga acaaggtcgg 840
caaagccgtg acagcagaga tcgtgattac catcgttcga gctatgcaga aagagatgtt 900
gaaagacgag gccatgaaag gagggacagg aactctgatc gaaatggacg ttcgagcgcc 960
cacagaagca ggagtaggag caggagtcca agccgtggca gaaccaacgg ggaccaccgt 1020
cgctctagcc catttggtaa agcacccgag tcatccaact tggctaagtt gaaggatctt 1080
tacggcgatg cgtcaaatgc aaaggaagat gcaggcgatg gtagagctcg cagggattcc 1140
ggagctgaag aggtaatcag attgggaggt gcaaggtgga ggtga 1185
<210>87
<211>394
<212>PRT
<213>短柄草
<400>87
Met Glu Ile Gln Ser Ser Gly Arg Pro Ile Glu Val Leu Met Glu Lys
1 5 10 15
Val Leu Ser Met Asn Ile Val Ser Ser Asp Tyr Phe Lys Glu Leu Tyr
20 25 30
Lys Ile Lys Thr Tyr His Glu Val Ile Asp Glu Ile Tyr Asn Gln Val
35 40 45
Asp His Val Glu Pro Trp Met Thr Gly Asn Cys Arg Gly Pro Ser Thr
50 55 60
Ala Phe Cys Leu Leu Tyr Lys Leu Phe Thr Met Lys Leu Thr Met Asn
65 70 75 80
Gln Met His Gly Leu Leu Lys His Pro Asp Ser Pro Tyr Ile Arg Ala
85 90 95
Ile Gly Phe Leu Tyr Leu Arg Tyr Val Ala Glu Pro Lys Thr Leu Trp
100 105 110
Thr Trp Tyr Glu Pro Tyr Ile Lys Asp Asp Glu Glu Phe Ser Pro Gly
115 120 125
Ser Asn Gly Lys Met Thr Thr Met Gly Val Tyr Val Arg Asp Val Leu
130 135 140
Leu Gly Gln Tyr Tyr Phe Asp Ser Leu Leu Pro Arg Val Pro Leu Leu
145 150 155 160
Ile Leu Arg Gln Val Ser Ala His Leu Glu Lys Met Lys Leu Pro Thr
165 170 175
Lys Gln Ser Gly Met Thr Gly Asp Ser Ser Arg Leu Gly Ser Asp Asp
180 185 190
Thr Ala Arg Arg Pro Pro Ser Val Lys Ala Ser Leu Ser Val Ser Phe
195 200 205
Gly Gln Arg Ala Pro His Arg Ala Ser Thr Arg Asp Ser Ser Pro Val
210 215 220
Arg Lys Thr Leu Pro Ser Ile Arg Glu Arg Glu Arg Ser His Asp Gly
225 230 235 240
Asp Arg Ala Lys Ser Pro Pro Arg Lys Arg Arg Ser Glu Ser Arg Glu
245 250 255
Arg Asn Arg Glu Thr Glu Arg Asp Arg Ser Asp Arg Asp Arg Gly Arg
260 265 270
Tyr Asn Asp Arg Glu Gln Gly Arg Gln Ser Arg Asp Ser Arg Asp Arg
275 280 285
Asp Tyr His Arg Ser Ser Tyr Ala Glu Arg Asp Val Glu Arg Arg Gly
290 295 300
His Glu Arg Arg Asp Arg Asn Ser Asp Arg Asn Gly Arg Ser Ser Ala
305 310 315 320
His Arg Ser Arg Ser Arg Ser Arg Ser Pro Ser Arg Gly Arg Thr Asn
325 330 335
Gly Asp His Arg Arg Ser Ser Pro Phe Gly Lys Ala Pro Glu Ser Ser
340 345 350
Asn Leu Ala Lys Leu Lys Asp Leu Tyr Gly Asp Ala Ser Asn Ala Lys
355 360 365
Glu Asp Ala Gly Asp Gly Arg Ala Arg Arg Asp Ser Gly Ala Glu Glu
370 375 380
Val Ile Arg Leu Gly Gly Ala Arg Trp Arg
385 390
<210>88
<211>489
<212>DNA
<213>短柄草
<400>88
atggagatac agtcgtccgg gaggcccatc gaggtgctca tggagaaggt gctgtccatg 60
aacatcgtct cctcggacta cttcaaggag ctctacaaga tcaagacgta ccacgaggtc 120
atcgacgaga tctacaacca ggtcgaccac gtcgagccgt ggatgaccgg caactgccgc 180
ggcccatcca ccgccttctg cctcctctac aagctcttca ccatgaagct caccatgaac 240
cagatgcacg gcctgctcaa gcaccctgat tccccttaca tcagagctat tggatttctc 300
tacctacgat acgttgcgga accaaagacg ctatggactt ggtatgagcc ctacattaaa 360
gatgatgagg agttttcccc tgggtcgaat ggtaaaatga caactatggg cgtttatgtg 420
cgtgatgtcc tccttggcca ggtatatctc ttaaagtatg ctcccacttc ttccattaaa 480
catatttga 489
<210>89
<211>162
<212>PRT
<213>短柄草
<400>89
Met Glu Ile Gln Ser Ser Gly Arg Pro Ile Glu Val Leu Met Glu Lys
1 5 10 15
Val Leu Ser Met Asn Ile Val Ser Ser Asp Tyr Phe Lys Glu Leu Tyr
20 25 30
Lys Ile Lys Thr Tyr His Glu Val Ile Asp Glu Ile Tyr Asn Gln Val
35 40 45
Asp His Val Glu Pro Trp Met Thr Gly Asn Cys Arg Gly Pro Ser Thr
50 55 60
Ala Phe Cys Leu Leu Tyr Lys Leu Phe Thr Met Lys Leu Thr Met Asn
65 70 75 80
Gln Met His Gly Leu Leu Lys His Pro Asp Ser Pro Tyr Ile Arg Ala
85 90 95
Ile Gly Phe Leu Tyr Leu Arg Tyr Val Ala Glu Pro Lys Thr Leu Trp
100 105 110
Thr Trp Tyr Glu Pro Tyr Ile Lys Asp Asp Glu Glu Phe Ser Pro Gly
115 120 125
Ser Asn Gly Lys Met Thr Thr Met Gly ValTyr Val Arg Asp Val Leu
130 135 140
Leu Gly Gln Val Tyr Leu Leu Lys Tyr Ala Pro Thr Ser Ser Ile Lys
145 150 155 160
His Ile
<210>90
<211>1116
<212>DNA
<213>菜茵衣藻
<400>90
atggaaatcc atggctccaa caccaccttc aacctcgaga acgtgctgcg tcaaaacatc 60
ctgagctcgg actactacaa ggggacctgc tcggaactga gcaactgttc agacattgtg 120
gacgaaatct acgagtctgt cgatcatgtg gaaccttgga tgagcggcaa cgcgcgcggg 180
ccttccacgg ccttctgcct tctgcaccgc ctcttcacgc tcaagctctc agcaaaggag 240
gtgaaaggca tgctggacca caaggactct ccctacatcc gcgcagtggg cttcctgtac 300
ctgcgctacg tgggggaccc gaagacgctg tggagctggg tggcgccgta cgtgaaggat 360
caggagaaat tttcgccgag cgggccgaac gagaaggagg tggccatggg cgactacgta 420
cgtgacctgc tgctctccca gtactacttc gagaccatct tcccgcgcat cccgaagccc 480
gtgcaagacc aaatcaacga cgagctgacg aagcgcagcc tggccacgac ggccaagggc 540
aatggcggcg ccggcggcgc tgaccgccgc ggcatggacg actccggcaa ccgtcgaccc 600
gcctcggtga aggcgtcgct gtcggtcgcg ttcggccagc gcgcgcccaa ccgctccggt 660
gctcgcgagg agggtcgcgg ccgggaccca tcgcttgcgc agcgcgacgg cactgcagcg 720
gctcgcggcg gccgcggctc cgcctcgccg gagccgccac gcgaccggcg ggaggcgccg 780
gcaccgcggc gggatttcga ccgggagcgg gacgtgcgcg gaggcggtgg tggcggcgct 840
gaccgccgtg acaaccggga ctacggccgc gaccgggacc gtggtcgcga ctacgacaag 900
agccgggatt atgacaagag ccgggattac gccaagggca gggactatga ccgggggcgg 960
gactacgggc gcggcggtgg cgctggtggt ggcggccgcg gctacgatga ccgccgggac 1020
gaacggcggc gcagccggag ccggagccgc agccgggacc gcaacggcgg cggtgctcgg 1080
gacgcgcgcg atgtattcaa ggacgcccgg cggtag 1116
<210>91
<211>371
<212>PRT
<213>菜茵衣藻
<400>91
Met Glu Ile His Gly Ser Asn Thr Thr Phe Asn Leu Glu Asn Val Leu
1 5 10 15
Arg Gln Asn Ile Leu Ser Ser Asp Tyr Tyr Lys Gly Thr Cys Ser Glu
20 25 30
Leu Ser Asn Cys Ser Asp Ile Val Asp Glu Ile Tyr Glu Ser Val Asp
35 40 45
His Val Glu Pro Trp Met Ser Gly Asn Ala Arg Gly Pro Ser Thr Ala
50 55 60
Phe Cys Leu Leu His Arg Leu Phe Thr Leu Lys Leu Ser Ala Lys Glu
65 70 75 80
Val Lys Gly Met Leu Asp His Lys Asp Ser Pro Tyr Ile Arg Ala Val
85 90 95
Gly Phe Leu Tyr Leu Arg Tyr Val Gly Asp Pro Lys Thr Leu Trp Ser
100 105 110
Trp Val Ala Pro Tyr Val Lys Asp Gln Glu Lys Phe Ser Pro Ser Gly
115 120 125
Pro Asn Glu Lys Glu Val Ala Met Gly Asp Tyr Val Arg Asp Leu Leu
130 135 140
Leu Ser Gln Tyr Tyr Phe Glu Thr Ile Phe Pro Arg Ile Pro Lys Pro
145 150 155 160
Val Gln Asp Gln Ile Asn Asp Glu Leu Thr Lys Arg Ser Leu Ala Thr
165 170 175
Thr Ala Lys Gly Asn Gly Gly Ala Gly Gly Ala Asp Arg Arg Gly Met
180 185 190
Asp Asp Ser Gly Asn Arg Arg Pro Ala Ser Val Lys Ala Ser Leu Ser
195 200 205
Val Ala Phe Gly Gln Arg Ala Pro Asn Arg Ser Gly Ala Arg Glu Glu
210 215 220
Gly Arg Gly Arg Asp Pro Ser Leu Ala Gln Arg Asp Gly Thr Ala Ala
225 230 235 240
Ala Arg Gly Gly Arg Gly Ser Ala Ser Pro Glu Pro Pro Arg Asp Arg
245 250 255
Arg Glu Ala Pro Ala Pro Arg Arg Asp Phe Asp Arg Glu Arg Asp Val
260 265 270
Arg Gly Gly Gly Gly Gly Gly Ala Asp Arg Arg Asp Asn Arg Asp Tyr
275 280 285
Gly Arg Asp Arg Asp Arg Gly Arg Asp Tyr Asp Lys Ser Arg Asp Tyr
290 295 300
Asp Lys Ser Arg Asp Tyr Ala Lys Gly Arg Asp Tyr Asp Arg Gly Arg
305 310 315 320
Asp Tyr Gly Arg Gly Gly Gly Ala Gly Gly Gly Gly Arg Gly Tyr Asp
325 330 335
Asp Arg Arg Asp Glu Arg Arg Arg Ser Arg Ser Arg Ser Arg Ser Arg
340 345 350
Asp Arg Asn Gly Gly Gly Ala Arg Asp Ala Arg Asp Val Phe Lys Asp
355 360 365
Ala Arg Arg
370
<210>92
<211>1676
<212>DNA
<213>大麦(Hordeum vulgare)
<400>92
attcggcacg aggcctcccc tttcctcgcc tccttcatcc atactccggc gatctcccag 60
ctaccgtcga atcccaaaca tcacaccctc cgccgtcgcc gaagccgata cgagttcatc 120
ctctacagag agtcgtcatc ccctctggtt tgggggtctc cgcaagccgc cctcgctctc 180
ggaagccaag gaggtcgtag ggtttgcgcg catcctccgt gtttgtgttc gctgaaatat 240
ggagatacag acttcaggaa agcccatcga tatgttgatg gagaaggttc tttgtatgaa 300
tattctttct tctgattact tcaaggagct ctacaggatg aagacctatc atgaggtcat 360
tgacgagatc tataaccaag ttgatcatgt ggagccttgg atgactggca attgcagggg 420
tccttccact gcattttgtc tcctgtacaa gttcttcaca atgaagctta ctgtgaaaca 480
gatgcatggt ttgttgaagc atcctgactc cccgtacatt agagctatag gatttttgta 540
tcttcgatat gttgcagatc caaagatcct atggacatgg tatgagccct acttgaagga 600
tgatgaggaa ttctcccctg gatctaatgg tcgcatgaca accatgggtg tatttgtgcg 660
tgatcttata cttggacagt actactttga tagtatcctt ccaagagttc ctgttccagt 720
agttcgtcaa gtaacagcca atcttgagaa gatgaagctg cctaccaagc tttctggggt 780
gactggagac agtcgccacg gatcagagga tactgcccgt cgcccccctt ctgttaaagc 840
ttctttgtca gtttcttttg gacagcgtgc gccacaccgt gcttccacac gggattcttc 900
cccagtgcga cgaacagtca cccaagatga tcaacggaga tcatcttccc catttcgtcg 960
tagtgcaagt cgggaggggc cttacagtga ccgttcaatt cacgaccgag aaggtaaccg 1020
ttcaagccgt gaccgagata ctgaccattc aagccgtgac cgagatactg tccgttcaag 1080
ccgtgaccga gatactggcc gttcaagccg tgaccgagat actggccttt caagccgtga 1140
cagagagcgt gattatgacc gtgacagcag ggattgtgac tattacaggt tcaggcattc 1200
agaagaaaaa aggaattacc gaagcgaaca tgacaatagt agacacagac gctccagctc 1260
atgtcatagg agcagaagcc ggagtcggag caggagcagg agcaggaatg agcatcgttc 1320
cagtccattt ggggatacaa gcaaagagaa ggctgctgct gcctcgagca acctagctaa 1380
gctgaaagac ctgtacggcg acgtagctga gaagaaggat gatggtgatg ccaggcggct 1440
tcaccatgat tcatgtgccg aagaggttat taggttggga ggccctaggt ggagataaat 1500
atgaaccgcc gactctgtca tccaggtcat gctgttactg caccgttcga tttacctgct 1560
tttgcattga cattagtgag atcactcttt gtaaacactc ggtactttat gtggttcatt 1620
aatccatatc ttttggacgt gcagtcaatc tatcaaattc atcaacgtaa aaaaag 1676
<210>93
<211>365
<212>PRT
<213>大麦
<400>93
Met Glu Ile Gln Thr Ser Gly Lys Pro Ile Asp Met Leu Met Glu Lys
1 5 10 15
Val Leu Cys Met Asn Ile Leu Ser Ser Asp Tyr Phe Lys Glu Leu Tyr
20 25 30
Arg Met Lys Thr Tyr His Glu Val Ile Asp Glu Ile Tyr Asn Gln Val
35 40 45
Asp His Val Glu Pro Trp Met Thr Gly Asn Cys Arg Gly Pro Ser Thr
50 55 60
Ala Phe Cys Leu Leu Tyr Lys Phe Phe Thr Met Lys Leu Thr Val Lys
65 70 75 80
Gln Met His Gly Leu Leu Lys His Pro Asp Ser Pro Tyr Ile Arg Ala
85 90 95
Ile Gly Phe Leu Tyr Leu Arg Tyr Val Ala Asp Pro Lys Ile Leu Trp
100 105 110
Thr Trp Tyr Glu Pro Tyr Leu Lys Asp Asp Glu Glu Phe Ser Pro Gly
115 120 125
Ser Asn Gly Arg Met Thr Thr Met Gly Val Phe Val Arg Asp Leu Ile
130 135 140
Leu Gly Gln Tyr Tyr Phe Asp Ser Ile Leu Pro Arg Val Pro Val Pro
145 150 155 160
Val Val Arg Gln Val Thr Ala Asn Leu Glu Lys Met Lys Leu Pro Thr
165 170 175
Lys Leu Ser Gly Val Thr Gly Asp Ser Arg His Gly Ser Glu Asp Thr
180 185 190
Ala Arg Arg Pro Pro Ser Val Lys Ala Ser Leu Ser Val Ser Phe Gly
195 200 205
Gln Arg Ala Pro His Arg Ala Ser Thr Arg Asp Ser Ser Pro Val Arg
210 215 220
Arg Thr Val Thr Gln Asp Asp Gln Arg Arg Ser Ser Ser Pro Phe Arg
225 230 235 240
Arg Ser Ala Ser Arg Glu Gly Pro Tyr Ser Asp Arg Ser Ile His Asp
245 250 255
Arg Glu Gly Asn Arg Ser Ser Arg Asp Arg Asp Thr Asp His Ser Ser
260 265 270
Arg Asp Arg Asp Thr Val Arg Ser Ser Arg Asp Arg Asp Thr Gly Arg
275 280 285
Ser Ser Arg Asp Arg Asp Thr Gly Leu Ser Ser Arg Asp Arg Glu Arg
290 295 300
Asp Tyr Asp Arg Asp Ser Arg Asp Cys Asp Tyr Tyr Arg Phe Arg His
305 310 315 320
Ser Glu Glu Lys Arg Asn Tyr Arg Ser Glu His Asp Asn Ser Arg His
325 330 335
Arg Arg Ser Ser Ser Cys His Arg Ser Arg Ser Arg Ser Arg Ser Arg
340 345 350
Ser Arg Ser Arg Asn Glu His Arg Ser Ser Pro Phe Gly
355 360 365
<210>94
<211>1824
<212>DNA
<213>番茄
<400>94
gagcttttgt atcccctcga tctccggttc ctctctcctt cttcttctct tcgtcgtcaa 60
ttttccattt tccctcatcg attctccggc ggaaaaccga aaaaaggtaa ttcataatct 120
gaacctaaac cttttccgat atcattttta ttgttggcca cagaaggtag gaaaagataa 180
cactctggtg tccaagtcac taagggcatg gctgagctta agacttctgg gagacctata 240
gaccagttgt tggagaaggt tctctgcatg aacattctat cttctgatta cttcagagac 300
cttttgcgcc tgaaaactta tcatgaagtg attgatgaaa tctataatca agttgaccat 360
gtggaaccat ggatgactgg caactgtcgt ggtccttcaa cagccttctg ccttctctac 420
aagttcttca caatgaaact tactgtcaag caaatgcatg gcctgttaaa gcatccagat 480
tctccttaca ttagagctat tgggttcctt tatctgagat atcttggtga ttttaagaca 540
ttatggggtt ggtatgagcc ttacctcaaa gatgatgagg aattctctcc tggatccagt 600
gggcaaatga ccacaatggg tgtatatgtg cgtgacttat ttctcgggca gtattatttt 660
gacacactac taccccgcat tcctgttcct gtcgtgcgga cagcagttgc cagtctcgaa 720
aaaatgaatc tgccgaccaa actttctggg tcgattgggg attctagtcg tggatctgag 780
gaaacttctc gccggccacc ttctgtcaaa gcttcccttt cagtgtcctt tggtcagcgg 840
gcacctcatc gtgcatcaac tagagattca tctcccatcc gaagaacaat tgcaccacca 900
tcctatgata aggatggtgc aaatggttca agacgttccc ccagcatgcg ccggagtcaa 960
agccgtgatt tatctgaccg ggaaaattct gaaagggaca ggggccggga ccgggacagg 1020
gacagggaca gggacaggga cagggacagg gaaagaacta gggacagaga acgtgatagg 1080
gatagggata gggataggta tagggaccag gaaagagaaa gggatagggg cagggatcgg 1140
gatagagata gaaggtatga taacgaaaga gatcgtgaaa gggacagaga caggaggcat 1200
gattatgaca gagaccgggg aagggataga gacaggaggt atgactatga tcgaaggtca 1260
attgagagaa gcagaagaga ctatgacagg agcaggagcc gtagtaggag tagaagccac 1320
agccgaagct tgcatgatca aggtacaagg cttgaccagc agcgaactcc acctagggat 1380
gagagcaagg agaagaaggc tgcatctagc aatctggcca agcttaaaga tctatatggc 1440
gacttcggca ataaaaagga gaacataggt gatgacaggg ctccaaatag ggatactagt 1500
actgaggagg ttatcagact tggtggttct acatggaggt agttggttac atttacgttt 1560
tgtaagagtt tggatactct cactgtctca ccttctcaga agagcacatt gacagctagc 1620
cttttgttga aagggaatat gcaaactgat agctgcagag aggagatgct gtttcatgtt 1680
ttctgcagtc ggcagagctg cttgtaaact ggatttcctt tactattatt ttagacttgt 1740
gttacaatat tggatggatt tcaattgtcc ctttgtttgt taaaacatta ttactactta 1800
aagcaattga caaacttata agcc 1824
<210>95
<211>444
<212>PRT
<213>番茄
<400>95
Met Ala Glu Leu Lys Thr Ser Gly Arg Pro Ile Asp Gln Leu Leu Glu
1 5 10 15
Lys Val Leu Cys Met Asn Ile Leu Ser Ser Asp Tyr Phe Arg Asp Leu
20 25 30
Leu Arg Leu Lys Thr Tyr His Glu Val Ile Asp Glu Ile Tyr Asn Gln
35 40 45
Val Asp His Val Glu Pro Trp Met Thr Gly Asn Cys Arg Gly Pro Ser
50 55 60
Thr Ala Phe Cys Leu Leu Tyr Lys Phe Phe Thr Met Lys Leu Thr Val
65 70 75 80
Lys Gln Met His Gly Leu Leu Lys His Pro Asp Ser Pro Tyr Ile Arg
85 90 95
Ala Ile Gly Phe Leu Tyr Leu Arg Tyr Leu Gly Asp Phe Lys Thr Leu
100 105 110
Trp Gly Trp Tyr Glu Pro Tyr Leu Lys Asp Asp Glu Glu Phe Ser Pro
115 120 125
Gly Ser Ser Gly Gln Met Thr Thr Met Gly Val Tyr Val Arg Asp Leu
130 135 140
Phe Leu Gly Gln Tyr Tyr Phe Asp Thr Leu Leu Pro Arg Ile Pro Val
145 150 155 160
Pro Val Val Arg Thr Ala Val Ala Ser Leu Glu Lys Met Asn Leu Pro
165 170 175
Thr Lys Leu Ser Gly Ser Ile Gly Asp Ser Ser Arg Gly Ser Glu Glu
180 185 190
Thr Ser Arg Arg Pro Pro Ser Val Lys Ala Ser Leu Ser Val Ser Phe
195 200 205
Gly Gln Arg Ala Pro His Arg Ala Ser Thr Arg Asp Ser Ser Pro Ile
210 215 220
Arg Arg Thr Ile Ala Pro Pro Ser Tyr Asp Lys Asp Gly Ala Asn Gly
225 230 235 240
Ser Arg Arg Ser Pro Ser Met Arg Arg Ser Gln Ser Arg Asp Leu Ser
245 250 255
Asp Arg Glu Asn Ser Glu Arg Asp Arg Gly Arg Asp Arg Asp Arg Asp
260 265 270
Arg Asp Arg Asp Arg Asp Arg Asp Arg Glu Arg Thr Arg Asp Arg Glu
275 280 285
Arg Asp Arg Asp Arg Asp Arg Asp Arg Tyr Arg Asp Gln Glu Arg Glu
290 295 300
Arg Asp Arg Gly Arg Asp Arg Asp Arg Asp Arg Arg Tyr Asp Asn Glu
305 310 315 320
Arg Asp Arg Glu Arg Asp Arg Asp Arg Arg His Asp Tyr Asp Arg Asp
325 330 335
Arg Gly Arg Asp Arg Asp Arg Arg Tyr Asp Tyr Asp Arg Arg Ser Ile
340 345 350
Glu Arg Ser Arg Arg Asp Tyr Asp Arg Ser Arg Ser Arg Ser Arg Ser
355 360 365
Arg Ser His Ser Arg Ser Leu His Asp Gln Gly Thr Arg Leu Asp Gln
370 375 380
Gln Arg Thr Pro Pro Arg Asp Glu Ser Lys Glu Lys Lys Ala Ala Ser
385 390 395 400
Ser Asn Leu Ala Lys Leu Lys Asp Leu Tyr Gly Asp Phe Gly Asn Lys
405 410 415
Lys Glu Asn Ile Gly Asp Asp Arg Ala Pro Asn Arg Asp Thr Ser Thr
420 425 430
Glu Glu Val Ile Arg Leu Gly Gly Ser Thr Trp Arg
435 440
<210>96
<211>1157
<212>DNA
<213>蒺藜苜蓿
<400>96
atggcaaatc gcactgatcc agcagcgaag agtattcgag gcacaaaccc tcaaaacctt 60
gttgaaaaaa ttctccgctc aaagatctat cagcacactt attggaaaga acaatgcttc 120
ggcttaacag cagaaaccct agtcgacaaa gccatggagc tcgaccacct cggcggaact 180
tacggtggca accgcaaacc cactcccttc atgtgcctcg tcatgaaaat gcttcagatt 240
caacccgaga aagaaatcgt catcgaattc atcaaaaacg atgattacaa gtatgtgagg 300
atactgggtg cattttattt gcgtcttact ggatctgata cggatgtgta ccattatctg 360
gagccgttgt ataatgatta taggaaactg cggcggaaat taccggatgg acagtttgct 420
ttgacacatg ttgatgaggt tattgatgaa cttcttacaa ctgattattc ctgtgatatt 480
gctatgcccc gtattaagaa aaggtggact cttgaatctc ttggtgcctt agaacctaga 540
caaagtgcac ttgaagagga ttttgaggag gaagaggaaa atgaggataa tgaacagcct 600
gctgaggagc ctgagaagga ttataatcgt gggcgaagcc ctgcaaggga aagagatagg 660
gatagaagac gtgatagtca tagacacagg gatcgtgact atgacagaga atatgataga 720
gattatgaca gagagcgagg acgtggccga gatagagatc gggacagaga tagggaaaag 780
gaaagggaca gagataggga gagggacaga gaccgatatc gtctgaggga agaaaaggat 840
tatggtcgtg agagagaagg tagggagcgc gagaggagag acagagatcg tgaccgtggt 900
aggaggagga gctactcaag gagtcgaagt agaagcaggg atcgcaagga tcatgatggt 960
ggggactaca gaaagagaca tgctcgaagt agcgtaagtc caagaagaga tggagctgag 1020
gatggtgagc caaagaagaa gaaggaaaag aaagaaaaga aggaaaagaa ggatgacggg 1080
accgaccatc cagatccaga gattgcagaa gcaaacagga tacgagcatc actgggtttg 1140
aaaccactta agatgtg 1157
<210>97
<211>385
<212>PRT
<213>蒺藜苜蓿
<400>97
Met Ala Asn Arg Thr Asp Pro Ala Ala Lys Ser Ile Arg Gly Thr Asn
1 5 10 15
Pro Gln Asn Leu Val Glu Lys Ile Leu Arg Ser Lys Ile Tyr Gln His
20 25 30
Thr Tyr Trp Lys Glu Gln Cys Phe Gly Leu Thr Ala Glu Thr Leu Val
35 40 45
Asp Lys Ala Met Glu Leu Asp His Leu Gly Gly Thr Tyr Gly Gly Asn
50 55 60
Arg Lys Pro Thr Pro Phe Met Cys Leu Val Met Lys Met Leu Gln Ile
65 70 75 80
Gln Pro Glu Lys Glu Ile Val Ile Glu Phe Ile Lys Asn Asp Asp Tyr
85 90 95
Lys Tyr Val Arg Ile Leu Gly Ala Phe Tyr Leu Arg Leu Thr Gly Ser
100 105 110
Asp Thr Asp Val Tyr His Tyr Leu Glu Pro Leu Tyr Asn Asp Tyr Arg
115 120 125
Lys Leu Arg Arg Lys Leu Pro Asp Gly Gln Phe Ala Leu Thr His Val
130 135 140
Asp Glu Val Ile Asp Glu Leu Leu Thr Thr Asp Tyr Ser Cys Asp Ile
145 150 155 160
Ala Met Pro Arg Ile Lys Lys Arg Trp Thr Leu Glu Ser Leu Gly Ala
165 170 175
Leu Glu Pro Arg Gln Ser Ala Leu Glu Glu Asp Phe Glu Glu Glu Glu
180 185 190
Glu Asn Glu Asp Asn Glu Gln Pro Ala Glu Glu Pro Glu Lys Asp Tyr
195 200 205
Asn Arg Gly Arg Ser Pro Ala Arg Glu Arg Asp Arg Asp Arg Arg Arg
210 215 220
Asp Ser His Arg His Arg Asp Arg Asp Tyr Asp Arg Glu Tyr Asp Arg
225 230 235 240
Asp Tyr Asp Arg Glu Arg Gly Arg Gly Arg Asp Arg Asp Arg Asp Arg
245 250 255
Asp Arg Glu Lys Glu Arg Asp Arg Asp Arg Glu Arg Asp Arg Asp Arg
260 265 270
Tyr Arg Leu Arg Glu Glu Lys Asp Tyr Gly Arg Glu Arg Glu Gly Arg
275 280 285
Glu Arg Glu Arg Arg Asp Arg Asp Arg Asp Arg Gly Arg Arg Arg Ser
290 295 300
Tyr Ser Arg Ser Arg Ser Arg Ser Arg Asp Arg Lys Asp His Asp Gly
305 310 315 320
Gly Asp Tyr Arg Lys Arg His Ala Arg Ser Ser Val Ser Pro Arg Arg
325 330 335
Asp Gly Ala Glu Asp Gly Glu Pro Lys Lys Lys Lys Glu Lys Lys Glu
340 345 350
Lys Lys Glu Lys Lys Asp Asp Gly Thr Asp His Pro Asp Pro Glu Ile
355 360 365
Ala Glu Ala Asn Arg Ile Arg Ala Ser Leu Gly Leu Lys Pro Leu Lys
370 375 380
Met
385
<210>98
<211>1305
<212>DNA
<213>稻
<400>98
atggagatac aaacttcagg gaagcccatt gatctgctga tggagaaggt tctttgtatg 60
aacattatgt cttctgatta cttcaaggag ctctacaggc tgaagaccta tcatgaggtc 120
attgatgaaa tatacaatca agttgatcat gtggagcctt ggatgactgg caattgcagg 180
ggcccctcca ctgcattctg tctcctctac aagttcttca ccatgaagct tactgtcaaa 240
cagatgcatg gtttgttgaa gcatcctgat tccccataca ttagagctat agggttcttg 300
tatcttcgat atgttgcaga tccgaagatc ttgtggacgt ggtatgagcc ctacttgaag 360
gatgatgagg aattctcccc tggatctaat ggtcgcatga caaccatggg tgtttatgtg 420
cgtgatctta tacttggaca gtactacttc gatagtcttc ttccaagagt tcctcttcca 480
gtaattcgtc aagtgacatc caatcttgag aagatgaagt tgcccactaa gctttctggg 540
attactggag agtctaatcg tcatggatca gaagatactg cccgccggcc tccttccgtg 600
aaggcttctc tgtcagtttc ctttggacag cgtgctccgc atcgtgcatc cacacgggag 660
tcatctccag ttcggaggac agtcacccat gatggccatc gtaaatcttc ctcaccatct 720
cgccgtagcg gaagccgcga ggttcctgat cgtgatcgat caagccgtga ccgttctagt 780
cgtgactatg accgttcaag ccatgaccgt gatcgtgacc attccagtcg tgactatgac 840
cgtccaagcc atgaccgtga ccgtgatcgt gaccgttcca gtcgtgacta tgaccgttca 900
agtcgtgacc gggatcatga tagagacatc agagactatc atcggcgtga tcgtgacagc 960
agggaccgtg actataggtc taggcattca tccgaaagac aagatgaccg aagggaccgt 1020
gaccgtgagg gtagcaggca cagacggtcc agctctcggc acagaagcag aagccgcagc 1080
cgcagccgca gccgcagcag aagcaggagc cgcagcagaa atgaggagag atccagtcct 1140
tttggcaatg caggcaaaga aaagactgct gccatctcga gcaacctagc aaagctcaag 1200
gacttgtatg gtgatgtaac tgagaagaag gacgacggtg aagcccctcg ccgtgattcg 1260
tgcgcggagg aggttatcag gttgggtggc ccgagatgga gatag 1305
<210>99
<211>434
<212>PRT
<213>稻
<400>99
Met Glu Ile Gln Thr Ser Gly Lys Pro Ile Asp Leu Leu Met Glu Lys
1 5 10 15
Val Leu Cys Met Asn Ile Met Ser Ser Asp Tyr Phe Lys Glu Leu Tyr
20 25 30
Arg Leu Lys Thr Tyr His Glu Val Ile Asp Glu Ile Tyr Asn Gln Val
35 40 45
Asp His Val Glu Pro Trp Met Thr Gly Asn Cys Arg Gly Pro Ser Thr
50 55 60
Ala Phe Cys Leu Leu Tyr Lys Phe Phe Thr Met Lys Leu Thr Val Lys
65 70 75 80
Gln Met His Gly Leu Leu Lys His Pro Asp Ser Pro Tyr Ile Arg Ala
85 90 95
Ile Gly Phe Leu Tyr Leu Arg Tyr Val Ala Asp Pro Lys Ile Leu Trp
100 105 110
Thr Trp Tyr Glu Pro Tyr Leu Lys Asp Asp Glu Glu Phe Ser Pro Gly
115 120 125
Ser Asn Gly Arg Met Thr Thr Met Gly Val Tyr Val Arg Asp Leu Ile
130 135 140
Leu Gly Gln Tyr Tyr Phe Asp Ser Leu Leu Pro Arg Val Pro Leu Pro
145 150 155 160
Val Ile Arg Gln Val Thr Ser Asn Leu Glu Lys Met Lys Leu Pro Thr
165 170 175
Lys Leu Ser Gly Ile Thr Gly Glu Ser Asn Arg His Gly Ser Glu Asp
180 185 190
Thr Ala Arg Arg Pro Pro Ser Val Lys Ala Ser Leu Ser Val Ser Phe
195 200 205
Gly Gln Arg Ala Pro His Arg Ala Ser Thr Arg Glu Ser Ser Pro Val
210 215 220
Arg Arg Thr Val Thr His Asp Gly His Arg Lys Ser Ser Ser Pro Ser
225 230 235 240
Arg Arg Ser Gly Ser Arg Glu Val Pro Asp Arg Asp Arg Ser Ser Arg
245 250 255
Asp Arg Ser Ser Arg Asp Tyr Asp Arg Ser Ser His Asp Arg Asp Arg
260 265 270
Asp His Ser Ser Arg Asp Tyr Asp Arg Pro Ser His Asp Arg Asp Arg
275 280 285
Asp Arg Asp Arg Ser Ser Arg Asp Tyr Asp Arg Ser Ser Arg Asp Arg
290 295 300
Asp His Asp Arg Asp Ile Arg Asp Tyr His Arg Arg Asp Arg Asp Ser
305 310 315 320
Arg Asp Arg Asp Tyr Arg Ser Arg His Ser Ser Glu Arg Gln Asp Asp
325 330 335
Arg Arg Asp Arg Asp Arg Glu Gly Ser Arg His Arg Arg Ser Ser Ser
340 345 350
Arg His Arg Ser Arg Ser Arg Ser Arg Ser Arg Ser Arg Ser Arg Ser
355 360 365
Arg Ser Arg Ser Arg Asn Glu Glu Arg Ser Ser Pro Phe Gly Asn Ala
370 375 380
Gly Lys Glu Lys Thr Ala Ala Ile Ser Ser Asn Leu Ala Lys Leu Lys
385 390 395 400
Asp Leu Tyr Gly Asp Val Thr Glu Lys Lys Asp Asp Gly Glu Ala Pro
405 410 415
Arg Arg Asp Ser Cys Ala Glu Glu Val Ile Arg Leu Gly Gly Pro Arg
420 425 430
Trp Arg
<210>100
<211>1179
<212>DNA
<213>稻
<400>100
atggcgaacc gcacggaccc cctggcgaag agcatccacg ggacgaaccc tcagaacctg 60
gtggagaaga tcgtccggtc caagatctac cagagcacct actggaagga gcagtgcttt 120
ggcctcaccg ccgagaccct cgtcgacaag gccatggagc tcgaccacac cggcggcacc 180
tacggcggca accgcaagcc cacccccttc ctctgcctcg ccctcaagat gctccagatc 240
cagcccgaca aggacatcgt cgtcgagttc atcaagaacg aggattacaa gtatgtccgt 300
gttcttggtg ccttctacct tcgcctcact gccaccgtcg ccgacgtcta ccaatacctc 360
gagccgctct acaacgacta ccgcaagatc aggcacaagc tcagtgatgg aaagtttacc 420
ctgacccacg tcgacgagtt cattgacgac ctcctcacca aggactactc ctgcgatacg 480
gccctccccc gcatccagaa aagatgggtt cttgaaactt ctggaactct agaaccaaga 540
agaagtgcac ttgaagatga ttttgaggaa gaggaggaag acaaggagga tgaacaacct 600
atggatatag atgagccaaa tggtcgtgaa aagcatgatc attatcgtgg aaggagccct 660
actagagatc gagacaggga gaggaaacat gaaagacacc acagggaccg agattacgac 720
agagatcggg attatggtag gggacgggaa agagaccgag atagagaccg tgaaagagat 780
agagacaggg atagagatcg ggatcgggat cgggatcgag accgtcatcg catacgagat 840
gaggactaca gtcgagatag ggaccgagca agagataggg atggcaggga aagagaacgc 900
tgggacagag accgtgggag gcgcaggagc cgttcaagga gcaggagcag ggatcgacga 960
gaaagagacc gagaagatgg agagtaccgt aggaggcgtg atcggggtag tgccagtcct 1020
cgaggtcatg cggaggatgg tggctcaaga gatgagccga agaagagaaa ggaaaagaaa 1080
gagaagaagg gtgaaggaaa tgcaccagat ccaaatgacc cagagattat agaaatgaac 1140
aagctccgag cctctctagg gttgaaacca ctgaagtag 1179
<210>101
<211>392
<212>PRT
<213>稻
<400>101
Met Ala Asn Arg Thr Asp Pro Leu Ala Lys Ser Ile His Gly Thr Asn
1 5 10 15
Pro Gln Asn Leu Val Glu Lys Ile Val Arg Ser Lys Ile Tyr Gln Ser
20 25 30
Thr Tyr Trp Lys Glu Gln Cys Phe Gly Leu Thr Ala Glu Thr Leu Val
35 40 45
Asp Lys Ala Met Glu Leu Asp His Thr Gly Gly Thr Tyr Gly Gly Asn
50 55 60
Arg Lys Pro Thr Pro Phe Leu Cys Leu Ala Leu Lys Met Leu Gln Ile
65 70 75 80
Gln Pro Asp Lys Asp Ile Val Val Glu Phe Ile Lys Asn Glu Asp Tyr
85 90 95
Lys Tyr Val Arg Val Leu Gly Ala Phe Tyr Leu Arg Leu Thr Ala Thr
100 105 110
Val Ala Asp Val Tyr Gln Tyr Leu Glu Pro Leu Tyr Asn Asp Tyr Arg
115 120 125
Lys Ile Arg His Lys Leu Ser Asp Gly Lys Phe Thr Leu Thr His Val
130 135 140
Asp Glu Phe Ile Asp Asp Leu Leu Thr Lys Asp Tyr Ser Cys Asp Thr
145 150 155 160
Ala Leu Pro Arg Ile Gln Lys Arg Trp Val Leu Glu Thr Ser Gly Thr
165 170 175
Leu Glu Pro Arg Arg Ser Ala Leu Glu Asp Asp Phe Glu Glu Glu Glu
180 185 190
Glu Asp Lys Glu Asp Glu Gln Pro Met Asp Ile Asp Glu Pro Asn Gly
195 200 205
Arg Glu Lys His Asp His Tyr Arg Gly Arg Ser Pro Thr Arg Asp Arg
210 215 220
Asp Arg Glu Arg Lys His Glu Arg His His Arg Asp Arg Asp Tyr Asp
225 230 235 240
Arg Asp Arg Asp Tyr Gly Arg Gly Arg Glu Arg Asp Arg Asp Arg Asp
245 250 255
Arg Glu Arg Asp Arg Asp Arg Asp Arg Asp Arg Asp Arg Asp Arg Asp
260 265 270
Arg Asp Arg His Arg Ile Arg Asp Glu Asp Tyr Ser Arg Asp Arg Asp
275 280 285
Arg Ala Arg Asp Arg Asp Gly Arg Glu Arg Glu Arg Trp Asp Arg Asp
290 295 300
Arg Gly Arg Arg Arg Ser Arg Ser Arg Ser Arg Ser Arg Asp Arg Arg
305 310 315 320
Glu Arg Asp Arg Glu Asp Gly Glu Tyr Arg Arg Arg Arg Asp Arg Gly
325 330 335
Ser Ala Ser Pro Arg Gly His Ala Glu Asp Gly Gly Ser Arg Asp Glu
340 345 350
Pro Lys Lys Arg Lys Glu Lys Lys Glu Lys Lys Gly Glu Gly Asn Ala
355 360 365
Pro Asp Pro Asn Asp Pro Glu Ile Ile Glu Met Asn Lys Leu ArgAla
370 375 380
Ser Leu Gly Leu Lys Pro Leu Lys
385 390
<210>102
<211>915
<212>DNA
<213>海洋微藻(Ostreococcus tauri)
<400>102
atgacgtact ggaaggagaa gtgcttcggc gtgagcgccg aggcgttggt cgatctcgcg 60
gtcgacctca ggtcggtggg tgggatttac ggcgggaaca acagagcgac ggagtttttg 120
tgcctcacgc tgaagctgtt gcagatacag cctgagaagg agatcgtgtt agagtttatt 180
aagaatgagg atcacaagta cgtcaggttg ctcggcgcgt tttacctacg gttggtgggg 240
aaaccgacgg acgtgtacag atacctcgag ccgctgttga acgactatag aaaggttcgg 300
tatcgcacgc gtgatgggaa gtacgcactg acgcatgtgg atgagtttgt gaacaatttg 360
ttgacgaagg atatgttttg cgacgtgacg ctcccacgcg tgccgcatcg ccaggtgtta 420
gaggccgcgg gagcgcttga gccacgcgta tctgcgcttg aggaggatat cgcagatttg 480
gaggaagagc tcgagagcgc ggtggaggag gcaataggtc aacggatgaa catggatgtc 540
gacgcaggcg aggccgctgc cgcggcgtcc actcgaggtg cgcgcgagga cggcgagatc 600
gttgcgtcgg gatcgaagcg ttcgcgcgag cacgatggcg ttcggtatcg agaatgtgac 660
gatagcgacg gtgacaggta cgtccgaagg cgcgagcggt cgagatctag gagtcgcgac 720
cgcgtgccgg cacgccgcga cgacgctcgg cccggagttt tggctagtgg cgaagagatg 780
gatcacagag agaagaagga gaagaaagag aagaaggaaa aacgagagaa gaaggagaag 840
accgagatgg accccgaaat cgcagaggcg aatgcgatca gggccaagct cgggttgaag 900
ccgcttcgtg gatga 915
<210>103
<211>304
<212>PRT
<213>海洋微藻
<400>103
Met Thr Tyr Trp Lys Glu Lys Cys Phe Gly Val Ser Ala Glu Ala Leu
1 5 10 15
Val Asp Leu Ala Val Asp Leu Arg Ser Val Gly Gly Ile Tyr Gly Gly
20 25 30
Asn Asn Arg Ala Thr Glu Phe Leu Cys Leu Thr Leu Lys Leu Leu Gln
35 40 45
Ile Gln Pro Glu Lys Glu Ile Val Leu Glu Phe Ile Lys Asn Glu Asp
50 55 60
His Lys Tyr ValArg Leu Leu Gly Ala Phe Tyr Leu Arg Leu Val Gly
65 70 75 80
Lys Pro Thr Asp Val Tyr Arg Tyr Leu Glu Pro Leu Leu Asn Asp Tyr
85 90 95
Arg Lys Val Arg Tyr Arg Thr Arg Asp Gly Lys Tyr Ala Leu Thr His
100 105 110
Val Asp Glu Phe Val Asn Asn Leu Leu Thr Lys Asp Met Phe Cys Asp
115 120 125
Val Thr Leu Pro Arg Val Pro His Arg Gln Val Leu Glu Ala Ala Gly
130 135 140
Ala Leu Glu Pro Arg Val Ser Ala Leu Glu Glu Asp Ile Ala Asp Leu
145 150 155 160
Glu Glu Glu Leu Glu Ser Ala Val Glu Glu Ala Ile Gly Gln Arg Met
165 170 175
Asn Met Asp Val Asp Ala Gly Glu Ala Ala Ala Ala Ala Ser Thr Arg
180 185 190
Gly Ala Arg Glu Asp Gly Glu Ile Val Ala Ser Gly Ser Lys Arg Ser
195 200 205
Arg Glu His Asp Gly Val Arg Tyr Arg Glu Cys Asp Asp Ser Asp Gly
210 215 220
Asp Arg Tyr Val Arg Arg Arg Glu Arg Ser Arg Ser Arg Ser Arg Asp
225 230 235 240
Arg Val Pro Ala Arg Arg Asp Asp Ala Arg Pro Gly Val Leu Ala Ser
245 250 255
Gly Glu Glu Met Asp His Arg Glu Lys Lys Glu Lys Lys Glu Lys Lys
260 265 270
Glu Lys Arg Glu Lys Lys Glu Lys Thr Glu Met Asp Pro Glu Ile Ala
275 280 285
Glu Ala Asn Ala Ile Arg Ala Lys Leu Gly Leu Lys Pro Leu Arg Gly
290 295 300
<210>104
<211>630
<212>DNA
<213>海洋微藻
<400>104
atgccgtcgg tgatcgaaaa ccacggacga ccgatctgga ccccgttcgg gaacggcgcg 60
gcgacgagcg ggaagtcgca cggcgtcgag gaggtgcttc gacaaaacat tgcgcactca 120
gaatactttc gaaaacttcg tcgcgcggac gatctggggc gaccggcgta cgatttcatg 180
gcgctcgtgg atgagattta tgaattggtc gatcactgcg aaccgtggat gtgcgggaac 240
gcgcgggggg cgtcgacggg gttttgcatc ttgtttcaat tctgtgagat ggagctcagc 300
gacggcaacg tgtggcattt gttgaggcac ggagactcgc cgtttatccg agctttaggg 360
ttcctgtatg tacggtacgt gaagaacggg cgggagctct tgaagtggtg cgaggagttc 420
ttcggagacg aggaaaagtt taaaccgtcg ccggacggga aggaggtgac gatgggcgcg 480
ttcgttcgcg acttgctgct cgagcagagg tacttcgaaa ccatcctgcc gaggattcct 540
gaggttgcga ggagagagat cataaaggtc tcggttgcgg tggtcagggc gggagccgca 600
ttgctggtgg ttctcagccg cagtctgtga 630
<210>105
<211>209
<212>PRT
<213>海洋微藻
<400>105
Met Pro Ser Val Ile Glu Asn His Gly Arg Pro Ile Trp Thr Pro Phe
1 5 10 15
Gly Asn Gly Ala Ala Thr Ser Gly Lys Ser His Gly Val Glu Glu Val
20 25 30
Leu Arg Gln Asn Ile Ala His Ser Glu Tyr Phe Arg Lys Leu Arg Arg
35 40 45
Ala Asp Asp Leu Gly Arg Pro Ala Tyr Asp Phe Met Ala Leu Val Asp
50 55 60
Glu Ile Tyr Glu Leu Val Asp His Cys Glu Pro Trp Met Cys Gly Asn
65 70 75 80
Ala Arg Gly Ala Ser Thr Gly Phe Cys Ile Leu Phe Gln Phe Cys Glu
85 90 95
Met Glu Leu Ser Asp Gly Asn Val Trp His Leu Leu Arg His Gly Asp
100 105 110
Ser Pro Phe Ile Arg Ala Leu Gly Phe Leu Tyr Val Arg Tyr Val Lys
115 120 125
Asn Gly Arg Glu Leu Leu Lys Trp Cys Glu Glu Phe Phe Gly Asp Glu
130 135 140
Glu Lys Phe Lys Pro Ser Pro Asp Gly Lys Glu Val Thr Met Gly Ala
145 150 155 160
Phe Val Arg Asp Leu Leu Leu Glu Gln Arg Tyr Phe Glu Thr Ile Leu
165 170 175
Pro Arg Ile Pro Glu Val Ala Arg Arg Glu Ile Ile Lys Val Ser Val
180 185 190
Ala Val Val Arg Ala Gly Ala Ala Leu Leu Val Val Leu Ser Arg Ser
195 200 205
Leu
<210>106
<211>1409
<212>DNA
<213>毛果杨
<400>106
ctcaggagtg cttggattgt ttagtacagt atggagatac agacaaatgg gaaaccaata 60
gattcgctgt tagagaaggt cctttgtatg aacatactat catcagatta cttcaaggag 120
ctttaccgat taaagacata ccatgaagtg atagatgaaa tatacaatca agttgaccat 180
gttgagccat ggatgactgg caactgtcgt ggtccatcta cttccttttg ccttctatac 240
aagttcttca ccatgaaact cactgtcaaa caaatgcatg gtttgctaaa gcacaaggat 300
tctccttata tcagagcggt tgggttcctc tacctgagat atgctgggga tccaaagaca 360
ctgtggaatt ggtttgaacc atatattaaa gacgatgagg aattttctcc aggaactagt 420
ggaaggaaga cgacaatggg tgtttatgtg cgtgatttgc ttctcggaca gtactacttt 480
gatacccttt tcccccgtat tcctgttcct gtcatgcggc agatcacatc gaatctggag 540
aagttgaagc taccaacaaa aatctctggt tcgacagggg atggaaaccg tcatggatct 600
gatgatacgg cacgccggcc accatctgtg aaggcagcac tttcagtctc ttttggtcag 660
cgtgctcctc atcgtgcatc aaccagggat tcgtctcctg ttcgtcgcac aataccttcc 720
ccctcctatg acagaaccag tgatgattca cgaagtcgac tcggccagag tcgtgaatat 780
tctgataaag aatattcaga tcgggatcat gataggggta gggaaaggga ccaagaccat 840
gatcgggata gagagaggga cagggttcgg gatagggatc aggagagaga aagggaccgg 900
gatcgtgaaa gggattggga tcagagtcgg gacagagaca gggatcggga aagggataga 960
tacagaaggt atgattatga tagaagttcc aggtacactg atagggaaag cagaagggat 1020
tctgaacaga gcagccgtga cagaagtagg cattatagag aaagtagttc ttatagaagc 1080
cgcagtcgaa gcaggagcag gagcaggagc cgaagctcgc aagctggcgc atcaccattt 1140
gatcgccatc caactcctca aagggatgga aacaaggata agacatctgc gcctagcaat 1200
ctggctaagc tcaaagatct ttatggtgat cttagtgatc agaaagggga tgctggcctg 1260
gaaagggttc ctcggaggga taatgatggt gaagaggttt ttagactcgg tggttccact 1320
tggaggtagg tagttcaatc ttaaaacagt agcttcagtg gctgtcacat cagctgtatc 1380
aatgaggtct tcagccaggc caaatttct 1409
<210>107
<211>432
<212>PRT
<213>毛果杨
<400>107
Met Glu Ile Gln Thr Asn Gly Lys Pro Ile Asp Ser Leu Leu Glu Lys
1 5 10 15
Val Leu Cys Met Asn Ile Leu Ser Ser Asp Tyr Phe Lys Glu Leu Tyr
20 25 30
Arg Leu Lys Thr Tyr His Glu Val Ile Asp Glu Ile Tyr Asn Gln Val
35 40 45
Asp His Val Glu Pro Trp Met Thr Gly Asn Cys Arg Gly Pro Ser Thr
50 55 60
Ser Phe Cys Leu Leu Tyr Lys Phe Phe Thr Met Lys Leu Thr Val Lys
65 70 75 80
Gln Met His Gly Leu Leu Lys His Lys Asp Ser Pro Tyr Ile Arg Ala
85 90 95
Val Gly Phe Leu Tyr Leu Arg Tyr Ala Gly Asp Pro Lys Thr Leu Trp
100 105 110
Asn Trp Phe Glu Pro Tyr Ile Lys Asp Asp Glu Glu Phe Ser Pro Gly
115 120 125
Thr Ser Gly Arg Lys Thr Thr Met Gly Val Tyr Val Arg Asp Leu Leu
130 135 140
Leu Gly Gln Tyr Tyr Phe Asp Thr Leu Phe Pro Arg Ile Pro Val Pro
145 150 155 160
Val Met Arg Gln Ile Thr Ser Asn Leu Glu Lys Leu Lys Leu Pro Thr
165 170 175
Lys Ile Ser Gly Ser Thr Gly Asp Gly Asn Arg His Gly Ser Asp Asp
180 185 190
Thr Ala Arg Arg Pro Pro Ser Val Lys Ala Ala Leu Ser Val Ser Phe
195 200 205
Gly Gln Arg Ala Pro His Arg Ala Ser Thr Arg Asp Ser Ser Pro Val
210 215 220
Arg Arg Thr Ile Pro Ser Pro Ser Tyr Asp Arg Thr Ser Asp Asp Ser
225 230 235 240
Arg Ser Arg Leu Gly Gln Ser Arg Glu Tyr Ser Asp Lys Glu Tyr Ser
245 250 255
Asp Arg Asp His Asp Arg Gly Arg Glu Arg Asp Gln Asp His Asp Arg
260 265 270
Asp Arg Glu Arg Asp Arg Val Arg Asp Arg Asp Gln Glu Arg Glu Arg
275 280 285
Asp Arg Asp Arg Glu Arg Asp Trp Asp Gln Ser Arg Asp Arg Asp Arg
290 295 300
Asp Arg Glu Arg Asp Arg Tyr Arg Arg Tyr Asp Tyr Asp Arg Ser Ser
305 310 315 320
Arg Tyr Thr Asp Arg Glu Ser Arg Arg Asp Ser Glu Gln Ser Ser Arg
325 330 335
Asp Arg Ser Arg His Tyr Arg Glu Ser Ser Ser Tyr Arg Ser Arg Ser
340 345 350
Arg Ser Arg Ser Arg Ser Arg Ser Arg Ser Ser Gln Ala Gly Ala Ser
355 360 365
Pro Phe Asp Arg His Pro Thr Pro Gln Arg Asp Gly Asn Lys Asp Lys
370 375 380
Thr Ser Ala Pro Ser Asn Leu Ala Lys Leu Lys Asp Leu Tyr Gly Asp
385 390 395 400
Leu Ser Asp Gln Lys Gly Asp Ala Gly Leu Glu Arg Val Pro Arg Arg
405 410 415
Asp Asn Asp Gly Glu Glu Val Phe Arg Leu Gly Gly Ser Thr Trp Arg
420 425 430
<210>108
<211>1394
<212>DNA
<213>毛果杨
<400>108
acaggagtgc ttgcatttgt ttagtacatt atggaggtac agacaaatgg gaaaccgata 60
gattcactct ttgagaaggt cctttgtatg aacattctat catcggatta cttcaaggag 120
ctttaccgat taaagacgta ccatgaagtg attgatgaaa tatacaatca agttgacaat 180
gttgagccat ggatgactgg taactgtcgt ggcccatcta cgtccttttg ccttctgtac 240
aagttcttca ccatgaagct cactgtcaaa caaatgcatg gtctgctaaa gcacaaggat 300
tctccttata tcagagcggt tgggttcctt tacctgagat atgctggtga cccaaagaca 360
ctgtggaatt ggtttgaacc atatatcaaa gatgatgagg aattttctcc tggatctagt 420
ggaaggaaga caacaatagg catatatgtg cgtgatttac ttctcggaca gtactacttt 480
gatacccttt tcccccgtat tcctgttcct gtcttgcggc agatcacagc caatcttgag 540
atgatgaagc tacccacaaa aatttctggt tcaacagggg atggcaaccg tcatggatct 600
gatgatactg cacgtcgacc accatctgtg aaggctgcac tttcagtctc ttttggtcag 660
cgtgctcctc atcgtgcatc aactagggac tcatctcctg ttcgtcgcac gctaccgcca 720
ccctcctatg acagaaccag tgatgatcca cgaagtcatc gcagccagag tcgcgaatat 780
tctgataaag aatattcaga cagggatcgg gatcaagata ggggtagaga gagggaccgt 840
gatagggaca gagagaggga cagggttcgg gatagagatc atgatagaga aagagatcgg 900
gaccgtggca gggacagtga caggaaacag gaacgtgaga ggggtagaga ccgaaggtct 960
gattacgata ggagttccag gtacactgac agggagagca gaagggatta tgaacggagc 1020
agccgtgatg gaagtaggcg tcatagagaa agtaattata gaacccggag tcggagcagg 1080
agcagaagta gaagccaaag cttgcaagct ggcacatcac catttgatca gcatccaact 1140
cctcaaaggg atggaagcaa ggataggaca tctgcatcta gcaatctggc taagctcaaa 1200
gatctttatg gtgatcttgg tgatcagaaa ggggatgctg gcctggaaag gggtcctcgg 1260
agggacaatg atggtgaaga agtttttaga ctgggtggtt ctacttggag gtaggtagtt 1320
caatgaaagc tgtaactgtc tgatgtatca gtaaggtctt cagccatgcc aaatcactgt 1380
ttgagtcaag aaaa 1394
<210>109
<211>427
<212>PRT
<213>毛果杨
<400>109
Met Glu Val Gln Thr Asn Gly Lys Pro Ile Asp Ser Leu Phe Glu Lys
1 5 10 15
Val Leu Cys Met Asn Ile Leu Ser Ser Asp Tyr Phe Lys Glu Leu Tyr
20 25 30
Arg Leu Lys Thr Tyr His Glu Val Ile Asp Glu Ile Tyr Asn Gln Val
35 40 45
Asp Asn Val Glu Pro Trp Met Thr Gly Asn Cys Arg Gly Pro Ser Thr
50 55 60
Ser Phe Cys Leu Leu Tyr Lys Phe Phe Thr Met Lys Leu Thr Val Lys
65 70 75 80
Gln Met His Gly Leu Leu Lys His Lys Asp Ser Pro Tyr Ile Arg Ala
85 90 95
Val Gly Phe Leu Tyr Leu Arg Tyr Ala Gly Asp Pro Lys Thr Leu Trp
100 105 110
Asn Trp Phe Glu Pro Tyr Ile Lys Asp Asp Glu Glu Phe Ser Pro Gly
115 120 125
Ser Ser Gly Arg Lys Thr Thr Ile Gly Ile Tyr Val Arg Asp Leu Leu
130 135 140
Leu Gly Gln Tyr Tyr Phe Asp Thr Leu Phe Pro Arg Ile Pro Val Pro
145 150 155 160
Val Leu Arg Gln Ile Thr Ala Asn Leu Glu Met Met Lys Leu Pro Thr
165 170 175
Lys Ile Ser Gly Ser Thr Gly Asp Gly Asn Arg His Gly Ser Asp Asp
180 185 190
Thr Ala Arg Arg Pro Pro Ser Val Lys Ala Ala Leu Ser Val Ser Phe
195 200 205
Gly Gln Arg Ala Pro His Arg Ala Ser Thr Arg Asp Ser Ser Pro Val
210 215 220
Arg Arg Thr Leu Pro Pro Pro Ser Tyr Asp Arg Thr Ser Asp Asp Pro
225 230 235 240
Arg Ser His Arg Ser Gln Ser Arg Glu Tyr Ser Asp Lys Glu Tyr Ser
245 250 255
Asp Arg Asp Arg Asp Gln Asp Arg Gly Arg Glu Arg Asp Arg Asp Arg
260 265 270
Asp Arg Glu Arg Asp Arg Val Arg Asp Arg Asp His Asp Arg Glu Arg
275 280 285
Asp Arg Asp Arg Gly Arg Asp Ser Asp Arg Lys Gln Glu Arg Glu Arg
290 295 300
Gly Arg Asp Arg Arg Ser Asp Tyr Asp Arg Ser Ser Arg Tyr Thr Asp
305 310 315 320
Arg Glu Ser Arg Arg Asp Tyr Glu Arg Ser Ser Arg Asp Gly Ser Arg
325 330 335
Arg His Arg Glu Ser Asn Tyr Arg Thr Arg Ser Arg Ser Arg Ser Arg
340 345 350
Ser Arg Ser Gln Ser Leu Gln Ala Gly Thr Ser Pro Phe Asp Gln His
355 360 365
Pro Thr Pro Gln Arg Asp Gly Ser Lys Asp Arg Thr Ser Ala Ser Ser
370 375 380
Asn Leu Ala Lys Leu Lys Asp Leu Tyr Gly Asp Leu Gly Asp Gln Lys
385 390 395 400
Gly Asp Ala Gly Leu Glu Arg Gly Pro Arg Arg Asp Asn Asp Gly Glu
405 410 415
Glu Val Phe Arg Leu Gly Gly Ser Thr Trp Arg
420 425
<210>110
<211>1281
<212>DNA
<213>甘蔗
<220>
<221>misc_feature
<222>(1140)..(1141)
<223>n是a、c、g或t
<220>
<221>misc_feature
<222>(1220)..(1220)
<223>n是a、c、g或t
<400>110
cgggctcctt tcctccgcct ctcgtctagt ccctctccta ccgaaccctc ccaatccaaa 60
gcgattcctc cgggccgccg ccgtcccgat cgcgccgcgc cgccggggcg gcgtctcccc 120
ggctgccctt agctcagctc gcgttccggt ggaacagaga agagaggtgg ggggaatgga 180
gatccagtcg tctggccggc ccatcgaggg gctgatggag aaggtgctgt ccgtgaacat 240
cctctcctcg gactacttca aggagctctt caagtacaag acctaccacg aggtggtcga 300
cgagatctac aaccaggtgg accacgtcga gccctggatg accggcaact gccgcgggcc 360
ctcctccgcc ttctgcctcc tctacaagtt cttcaccatg aagctcaccg tcaagcagat 420
gcacgggctg ctcaagcacc aggactcccc ctacatcaga gctattggat tcctctacct 480
gcgatatgtt gcagaaccga agacgctgtg gacttggtat gaaccctata tcaaggatga 540
cgaggagttt gcccctggat caaatggtaa aatgactaca atgggcgttt atgtgcgtga 600
tctcctcctt ggtcagtact atttcgacag tcttcttcca cgagtgcctc tcccaattct 660
ccgacaggtc actagccatc ttgagaagct gaagcttcca acaaagcagt caggaatgac 720
tggggattcc aataggcttg aatcaaatga tactgccaga aggcctcctt ccgtaaaggc 780
ttctttgtct gtctcttttg gtcagcgtgc tccacaccgt gcatacacaa gggattcttc 840
cccagtccga agaacattac cttccaaaca ggacaaggaa agaagttatg atggtgacca 900
tgcaaaatcg ccaccaagga agcgcagaag tcagagctct gagcgccatc atgactcaga 960
gagggaccgt tcaaatcgtg atcgtggcaa gtacaaggat agggagcatg atcgttatgc 1020
tcgtgatcac agagaccggg atcatcatcg gcagagttat tcagataggg atgacgaaag 1080
gcgaggccgt gaaaagaggg acagggattc tgaccgaaag agatattcaa gctcccgcan 1140
ngagcaggag tccagtccgt ggcagaactg acggcgacaa acatcgctcc agcccaattt 1200
gtagggcacc agaatcattn cacctggcaa agctaaagga tttatacggt gatgcaacaa 1260
acacaaagaa tgatgcttgc g 1281
<210>111
<211>331
<212>PRT
<213>甘蔗
<220>
<221>misc_feature
<222>(322)..(322)
<223>Xaa可以是任意天然氨基酸
<400>111
Met Glu Ile Gln Ser Ser Gly Arg Pro Ile Glu Gly Leu Met Glu Lys
1 5 10 15
Val Leu Ser Val Asn Ile Leu Ser Ser Asp Tyr Phe Lys Glu Leu Phe
20 25 30
Lys Tyr Lys Thr Tyr His Glu Val Val Asp Glu Ile Tyr Asn Gln Val
35 40 45
Asp His Val Glu Pro Trp Met Thr Gly Asn Cys Arg Gly Pro Ser Ser
50 55 60
Ala Phe Cys Leu Leu Tyr Lys Phe Phe Thr Met Lys Leu Thr Val Lys
65 70 75 80
Gln Met His Gly Leu Leu Lys His Gln Asp Ser Pro Tyr Ile Arg Ala
85 90 95
Ile Gly Phe Leu Tyr Leu Arg Tyr Val Ala Glu Pro Lys Thr Leu Trp
100 105 110
Thr Trp Tyr Glu Pro Tyr Ile Lys Asp Asp Glu Glu Phe Ala Pro Gly
115 120 125
Ser Asn Gly Lys Met Thr Thr Met Gly Val Tyr Val Arg Asp Leu Leu
130 135 140
Leu Gly Gln Tyr Tyr Phe Asp Ser Leu Leu Pro Arg Val Pro Leu Pro
145 150 155 160
Ile Leu Arg Gln Val Thr Ser His Leu Glu Lys Leu Lys Leu Pro Thr
165 170 175
Lys Gln Ser Gly Met Thr Gly Asp Ser Asn Arg Leu Glu Ser Asn Asp
180 185 190
Thr Ala Arg Arg Pro Pro Ser Val Lys Ala Ser Leu Ser Val Ser Phe
195 200 205
Gly Gln Arg Ala Pro His Arg Ala Tyr Thr Arg Asp Ser Ser Pro Val
210 215 220
Arg Arg Thr Leu Pro Ser Lys Gln Asp Lys Glu Arg Ser Tyr Asp Gly
225 230 235 240
Asp His Ala Lys Ser Pro Pro Arg Lys Arg Arg Ser Gln Ser Ser Glu
245 250 255
Arg His His Asp Ser Glu Arg Asp Arg Ser Asn Arg Asp Arg Gly Lys
260 265 270
Tyr Lys Asp Arg Glu His Asp Arg Tyr Ala Arg Asp His Arg Asp Arg
275 280 285
Asp His His Arg Gln Ser Tyr Ser Asp Arg Asp Asp Glu Arg Arg Gly
290 295 300
Arg Glu Lys Arg Asp Arg Asp Ser Asp Arg Lys Arg Tyr Ser Ser Ser
305 310 315 320
Arg Xaa Glu Gln Glu Ser Ser Pro Trp Gln Asn
325 330
<210>112
<211>1179
<212>DNA
<213>甘蔗
<400>112
atggagatcc agtcatctgc ccggcccatc gaggggctga tggagaaggt gctgtccgtg 60
aacatcctct cctcggacta cttcaaggag ctcttcaagt acaagaccta ccacgaggtg 120
gtcgacgaga tctacaacca ggtggaccac gtcgagccct ggatgaccgg caactgccgc 180
gggccctcct ccgccttctg cctcctctac aagttcttca ccatgaagct caccgtcaag 240
cagatgcacg ggctgctcaa gcatcaggac tccccctaca tcagagctat tggattcctc 300
tacctgcgat atgttgcaga accgaagacg ctgtggactt ggtatgaacc ctatatcaag 360
gatgacgagg agtttgcccc tggatcaaat ggtaaattga ctacaatggg cgtttatgtg 420
cgtgatctcc tccttggtca gtactatttc gacagtcttc ttccacgagt gcctctccca 480
attctccgac aggtcactag ccatcttgag aagctgaagc ttccaacaaa gcagtcagga 540
atgactgggg attccaatag gcttgaatca aatgatactg ccagaaggcc tccttccgta 600
aaggcttctt tgtctgtctc ttttggtcag cgtgctccac accgtgcatc cacaagggat 660
tcttccccag tccgaagaac attaccttcc aaacaggaca aagaaagaag ttatgatggt 720
gaccatgcaa aatcgccacc aaggaagcgc agaagtcaga gctctgagcg tcatcatgac 780
tctgagaggg accgttcaga tcgtgatcgt ggcaggtaca agggtaggga gcatgatcgt 840
tatgctcgtg atcacagaga ccgggatcat catcggcaga gttattcaga tagggatgac 900
gaaaggcgag gccgtgaaaa gagggacagg gattctgacc acaatagaca ttcaagctcc 960
cgcaggagca ggagcaggag tccagtccgt ggcagaactg acggtgacaa gcatcgctcc 1020
agcccatttg gtagggcacc agaatcatcc aacctggcaa agctaaagga tttatacggt 1080
gatgcaacaa acacaaagaa tgatgctggc gatgatagag ctcacaggga ttctggaacg 1140
gaagaggtaa tccgactggg aggggcaagg tggaggtga 1179
<210>113
<211>392
<212>PRT
<213>甘蔗
<400>113
Met Glu Ile Gln Ser Ser Ala Arg Pro Ile Glu Gly Leu Met Glu Lys
1 5 10 15
Val Leu Ser Val Asn Ile Leu Ser Ser Asp Tyr Phe Lys Glu Leu Phe
20 25 30
Lys Tyr Lys Thr Tyr His Glu Val Val Asp Glu Ile Tyr Asn Gln Val
35 40 45
Asp His Val Glu Pro Trp Met Thr Gly Asn Cys Arg Gly Pro Ser Ser
50 55 60
Ala Phe Cys Leu Leu Tyr Lys Phe Phe Thr Met Lys Leu Thr Val Lys
65 70 75 80
Gln Met His Gly Leu Leu Lys His Gln Asp Ser Pro Tyr Ile Arg Ala
85 90 95
Ile Gly Phe Leu Tyr Leu Arg Tyr ValAla Glu Pro Lys Thr Leu Trp
100 105 110
Thr Trp Tyr Glu Pro Tyr Ile Lys Asp Asp Glu Glu Phe Ala Pro Gly
115 120 125
Ser Asn Gly Lys Leu Thr Thr Met Gly Val Tyr Val Arg Asp Leu Leu
130 135 140
Leu Gly Gln Tyr Tyr Phe Asp Ser Leu Leu Pro Arg Val Pro Leu Pro
145 150 155 160
Ile Leu Arg Gln Val Thr Ser His Leu Glu Lys Leu Lys Leu Pro Thr
165 170 175
Lys Gln Ser Gly Met Thr Gly Asp Ser Asn Arg Leu Glu Ser Asn Asp
180 185 190
Thr Ala Arg Arg Pro Pro Ser Val Lys Ala Ser Leu Ser Val Ser Phe
195 200 205
Gly Gln Arg Ala Pro His Arg Ala Ser Thr Arg Asp Ser Ser Pro Val
210 215 220
Arg Arg Thr Leu Pro Ser Lys Gln Asp Lys Glu Arg Ser Tyr Asp Gly
225 230 235 240
Asp His Ala Lys Ser Pro Pro Arg Lys Arg Arg Ser Gln Ser Ser Glu
245 250 255
Arg His His Asp Ser Glu Arg Asp Arg Ser Asp Arg Asp Arg Gly Arg
260 265 270
Tyr Lys Gly Arg Glu His Asp Arg Tyr Ala Arg Asp His Arg Asp Arg
275 280 285
Asp His His Arg Gln Ser Tyr Ser Asp Arg Asp Asp Glu Arg Arg Gly
290 295 300
Arg Glu Lys Arg Asp Arg Asp Ser Asp His Asn Arg His Ser Ser Ser
305 310 315 320
Arg Arg Ser Arg Ser Arg Ser Pro Val Arg Gly Arg Thr Asp Gly Asp
325 330 335
Lys His Arg Ser Ser Pro Phe Gly Arg Ala Pro Glu Ser Ser Asn Leu
340 345 350
Ala Lys Leu Lys Asp Leu Tyr Gly Asp Ala Thr Asn Thr Lys Asn Asp
355 360 365
Ala Gly Asp Asp Arg Ala His Arg Asp Ser Gly Thr Glu Glu Val Ile
370 375 380
Arg Leu Gly Gly Ala Arg Trp Arg
385 390
<210>114
<211>729
<212>DNA
<213>酿酒酵母(Saccharomyces cerevisiae)
<400>114
atggctgtca atgaatttca agtggagtct aacatctctc caaaacaact gaataaccag 60
tcagtgtcac ttgttattcc tcggttgaca agagataaaa ttcataattc aatgtactat 120
aaagtaaatc taagcaacga atctttgaga ggcaatacaa tggtagagct tttgaaagtt 180
atgattggcg catttggtac cataaaaggt caaaatggtc atttacacat gatggttctc 240
ggtggcattg agtttaaatg catcttaatg aagttaatcg aaatcagacc gaatttccag 300
cagttgaact tcttattgaa tgtaaaaaat gagaacggtt ttgactcgaa atatattatt 360
gctttgcttc tggtttatgc gcggttacag tattattatt tgaatggcaa taacaaaaac 420
gatgatgatg aaaatgattt gataaagtta tttaaagtac aattatacaa atattcacag 480
cattatttca aactaaaaag tttcccacta caagtagact gctttgctca ctcctataac 540
gaagaacttt gtataataca cattgatgaa ttagtcgatt ggttggccac acaggaccat 600
atctggggta ttccattagg gaaatgtcaa tggaataaaa tatacaactc tgatgaagag 660
agtagttcta gcgaaagcga aagtaatggt gacagtgaag atgacaacga caccagcagc 720
gaatcatag 729
<210>115
<21l>242
<212>PRT
<213>酿酒酵母
<400>115
Met Ala Val Asn Glu Phe Gln Val Glu Ser Asn Ile Ser Pro Lys Gln
1 5 10 15
Leu Asn Asn Gln Ser Val Ser Leu Val Ile Pro Arg Leu Thr Arg Asp
20 25 30
Lys Ile His Asn Ser Met Tyr Tyr Lys Val Asn Leu Ser Asn Glu Ser
35 40 45
Leu Arg Gly Asn Thr Met Val Glu Leu Leu Lys Val Met Ile Gly Ala
50 55 60
Phe Gly Thr Ile Lys Gly Gln Asn Gly His Leu His Met Met Val Leu
65 70 75 80
Gly Gly Ile Glu Phe Lys Cys Ile Leu Met Lys Leu Ile Glu Ile Arg
85 90 95
Pro Asn Phe Gln Gln Leu Asn Phe Leu Leu Asn Val Lys Asn Glu Asn
100 105 110
Gly Phe Asp Ser Lys Tyr Ile Ile Ala Leu Leu Leu Val Tyr Ala Arg
115 120 125
Leu Gln Tyr Tyr Tyr Leu Asn Gly Asn Asn Lys Asn Asp Asp Asp Glu
130 135 140
Asn Asp Leu Ile Lys Leu Phe Lys Val Gln Leu Tyr Lys Tyr Ser Gln
145 150 155 160
His Tyr Phe Lys Leu Lys Ser Phe Pro Leu Gln Val Asp Cys Phe Ala
165 170 175
His Ser Tyr Asn Glu Glu Leu Cys Ile Ile His Ile Asp Glu Leu Val
180 185 190
Asp Trp Leu Ala Thr Gln Asp His Ile Trp Gly Ile Pro Leu Gly Lys
195 200 205
Cys Gln Trp Asn Lys Ile Tyr Asn Ser Asp Glu Glu Ser Ser Ser Ser
210 215 220
Glu Ser Glu Ser Asn Gly Asp Ser Glu Asp Asp Asn Asp Thr Ser Ser
225 230 235 240
Glu Ser
<210>116
<211>1182
<212>DNA
<213>普通小麦
<400>116
atggcgaacc gcacggaccc ccgggcccgg agcatccacg gcaccaaccc tcagaacctg 60
gtggagaaga tcgtgcgggc caagatctac cagagcaact actggaagga gcagtgcttc 120
ggcctcacgg cggagaccct cgtcgacaag gccatggagc tcgactacac cggcggcacc 180
cacggcggca accgcaggcc gacccccttc ctctgcctcg ctctcaagat gctccagatc 240
cagcccgaca aggaaatcgt cgtcgagttc atcaaggacg aggactacaa gtatgtccgg 300
gttcttgggg ccttctacct gcgcctcact ggcaccgtcg ccgacgttta ccagtacctc 360
gagccgctct acaacgacta ccgcaagatt aggcaaaagc tcagcgatgg aaaattcacg 420
ctgacacacg ttgacgaatt cattgacgag ctcctgacca aggactatag ctgcggcact 480
gccctccccc gcattcagaa aagatggatc cttgaagctt ctggaactct agaacctaga 540
agaagtgcac ttgaagacga ttttgaggaa gaggaggaag ataaggagga tggacagcct 600
atggacgtag atgagcctaa cactcatgaa aaggaccatc ttcgtggaag aagccccacc 660
aaagaacgcg acagggaaag ggagagggac agagacagga aacacgaaag gcatcacagg 720
gaccgagatc atgacagaga tcgggatcac gacagggact atggaagagg ccgggaaaga 780
gatcgagaca gagatagagg ccgtgaaaga gatagagaga gggataggga acgagaccgt 840
caccgcatcc gagatgacga ctaccaccga gatcgagacc gggatggcag ggaaagggaa 900
cgccgggaca gagaccgtgg caggcacagg agccgctcag ggagcagaag ccgggatcgg 960
cgtgaaagag accgtgaagt gggagagctc cgtaagaggc gtggccgtgg tagtgccagt 1020
cctcctcggg ggcgtgccga ggatggtccg agggaggagc ctaagaagag aaaggaaaag 1080
aaagagaaga agggcagcgg gaacggtcca gatcctaatg atccagagat tatagagatg 1140
aacaagctgc gtgcatcgat agggttggga ccactgaagt ag 1182
<210>117
<211>393
<212>PRT
<213>普通小麦
<400>117
Met Ala Asn Arg Thr Asp Pro Arg Ala Arg Ser Ile His Gly Thr Asn
1 5 10 15
Pro Gln Asn Leu Val Glu Lys Ile Val Arg Ala Lys Ile Tyr Gln Ser
20 25 30
Asn Tyr Trp Lys Glu Gln Cys Phe Gly Leu Thr Ala Glu Thr Leu Val
35 40 45
Asp Lys Ala Met Glu Leu Asp Tyr Thr Gly Gly Thr His Gly Gly Asn
50 55 60
Arg Arg Pro Thr Pro Phe Leu Cys Leu Ala Leu Lys Met Leu Gln Ile
65 70 75 80
Gln Pro Asp Lys Glu Ile Val Val Glu Phe Ile Lys Asp Glu Asp Tyr
85 90 95
Lys Tyr Val Arg Val Leu Gly Ala Phe Tyr Leu Arg Leu Thr Gly Thr
100 105 110
Val Ala Asp Val Tyr Gln Tyr Leu Glu Pro Leu Tyr Asn Asp Tyr Arg
115 120 125
Lys Ile Arg Gln Lys Leu Ser Asp Gly Lys Phe Thr Leu Thr His Val
130 135 140
Asp Glu Phe Ile Asp Glu Leu Leu Thr Lys Asp Tyr Ser Cys Gly Thr
145 150 155 160
Ala Leu Pro Arg Ile Gln Lys Arg Trp Ile Leu Glu Ala Ser Gly Thr
165 170 175
Leu Glu Pro Arg Arg Ser Ala Leu Glu Asp Asp Phe Glu Glu Glu Glu
180 185 190
Glu Asp Lys Glu Asp Gly Gln Pro Met Asp Val Asp Glu Pro Asn Thr
195 200 205
His Glu Lys Asp His Leu Arg Gly Arg Ser Pro Thr Lys Glu Arg Asp
210 215 220
Arg Glu Arg Glu Arg Asp Arg Asp Arg Lys His Glu Arg His His Arg
225 230 235 240
Asp Arg Asp His Asp Arg Asp Arg Asp His Asp Arg Asp Tyr Gly Arg
245 250 255
Gly Arg Glu Arg Asp Arg Asp Arg Asp Arg Gly Arg Glu Arg Asp Arg
260 265 270
Glu Arg Asp Arg Glu Arg Asp Arg His Arg Ile Arg Asp Asp Asp Tyr
275 280 285
His Arg Asp Arg Asp Arg Asp Gly Arg Glu Arg Glu Arg Arg Asp Arg
290 295 300
Asp Arg Gly Arg His Arg Ser Arg Ser Gly Ser Arg Ser Arg Asp Arg
305 310 315 320
Arg Glu Arg Asp Arg Glu Val Gly Glu Leu Arg Lys Arg Arg Gly Arg
325 330 335
Gly Ser Ala Ser Pro Pro Arg Gly Arg Ala Glu Asp Gly Pro Arg Glu
340 345 350
Glu Pro Lys Lys Arg Lys Glu Lys Lys Glu Lys Lys Gly Ser Gly Asn
355 360 365
Gly Pro Asp Pro Asn Asp Pro Glu Ile Ile Glu Met Asn Lys Leu Arg
370 375 380
Ala Ser Ile Gly Leu Gly Pro Leu Lys
385 390
<210>118
<211>1119
<212>DNA
<213>葡萄(Vitis vinifera)
<400>118
atggcgaacc gtacggaccc agcggcgaag agcatacgag gcacgaatcc gcaaaacttg 60
gtggagaaga ttctgaggtc gaagatttac cagaacacgt actggaagga gcagtgcttt 120
ggattgaccg cggagactct ggttgacaag gccatggagc tcgaccacct cggcggcacc 180
tttggtggta accgcaagcc cacgcccttc atgtgcctcg tcatgaaaat gctccagatc 240
cagcccgaga aggacatcgt cgtcgagttc ataaaaaacg aagagtacaa atatgtccga 300
atacttggtg cattttattt gcgtcttaca gggatagata ctgatgtgta ccaataccta 360
gagcctctat acaatgacta tcggaaattg aggagaaaat tatctgatgg aaattattct 420
ttgacacacg ttgatgaggt tatcgatgaa cttctgacaa aagattattc ctgtgacgtt 480
gccttgcccc gtatcaagaa aagatggact cttgaatccc ttggtacact ggaaccaaga 540
agaagtgctt tggaagatga ttttgaggaa gaggaagaaa aagaagagga tgaccaactc 600
atggatgaat tagatgttgg ggctcatgaa aaggattatt atcgtgggcg aagccctgca 660
agggagagag atagggatag aaagcgtgac agtcacagat atagagatcg agattatgat 720
agggaacgtg gaagaggacg agaaagagat cgggaaaggg aaagggacag agacagctat 780
agagacaggg agagggagag ggacagagac agggaccgct atcgtctgag agatgataaa 840
gaatatggtc gtgacaggga gagggaaagg gagagggaag gcagggagag ggagaggcga 900
gacagggacc gagccattca aggagccgaa gtaggagcaa ggatcaggca tgctcgcagc 960
agcaccaaca tgccagagga tggaaccact cgagaagagc caaggaagaa gaaagaaaag 1020
aaggagaaga aggatgatgg cactgaccac ccagatccag agattgcaga agcaaacagg 1080
ttgcgggcat cccttgggct taaacccttg aaactctaa 1119
<210>119
<211>372
<212>PRT
<213>葡萄
<400>119
Met Ala Asn Arg Thr Asp Pro Ala Ala Lys Ser Ile Arg Gly Thr Asn
1 5 10 15
Pro Gln Asn Leu Val Glu Lys Ile Leu Arg Ser Lys Ile Tyr Gln Asn
20 25 30
Thr Tyr Trp Lys Glu Gln Cys Phe Gly Leu Thr Ala Glu Thr Leu Val
35 40 45
Asp Lys Ala Met Glu Leu Asp His Leu Gly Gly Thr Phe Gly Gly Asn
50 55 60
Arg Lys Pro Thr Pro Phe Met Cys Leu Val Met Lys Met Leu Gln Ile
65 70 75 80
Gln Pro Glu Lys Asp Ile Val Val Glu Phe Ile Lys Asn Glu Glu Tyr
85 90 95
Lys Tyr Val Arg Ile Leu Gly Ala Phe Tyr Leu Arg Leu Thr Gly Ile
100 105 110
Asp Thr Asp Val Tyr Gln Tyr Leu Glu Pro Leu Tyr Asn Asp Tyr Arg
115 120 125
Lys Leu Arg Arg Lys Leu Ser Asp Gly Asn Tyr Ser Leu Thr His Val
130 135 140
Asp Glu Val Ile Asp Glu Leu Leu Thr Lys Asp Tyr Ser Cys Asp Val
145 150 155 160
Ala Leu Pro Arg Ile Lys Lys Arg Trp Thr Leu Glu Ser Leu Gly Thr
165 170 175
Leu Glu Pro Arg Arg Ser Ala Leu Glu Asp Asp Phe Glu Glu Glu Glu
180 185 190
Glu Lys Glu Glu Asp Asp Gln Leu Met Asp Glu Leu Asp Val Gly Ala
195 200 205
His Glu Lys Asp Tyr Tyr Arg Gly Arg Ser Pro Ala Arg Glu Arg Asp
210 215 220
Arg Asp Arg Lys Arg Asp Ser His Arg Tyr Arg Asp Arg Asp Tyr Asp
225 230 235 240
Arg Glu Arg Gly Arg Gly Arg Glu Arg Asp Arg Glu Arg Glu Arg Asp
245 250 255
Arg Asp Ser Tyr Arg Asp Arg Glu Arg Glu Arg Asp Arg Asp Arg Asp
260 265 270
Arg Tyr Arg Leu Arg Asp Asp Lys Glu Tyr Gly Arg Asp Arg Glu Arg
275 280 285
Glu Arg Glu Arg Glu Gly Arg Glu Arg Glu Arg Arg Asp Arg Asp Arg
290 295 300
Ala Ile Gln Gly Ala Glu Val Gly Ala Arg Ile Arg His Ala Arg Ser
305 310 315 320
Ser Thr Asn Met Pro Glu Asp Gly Thr Thr Arg Glu Glu Pro Arg Lys
325 330 335
Lys Lys Glu Lys Lys Glu Lys Lys Asp Asp Gly Thr Asp His Pro Asp
340 345 350
Pro Glu Ile Ala Glu Ala Asn Arg Leu Arg Ala Ser Leu Gly Leu Lys
355 360 365
Pro Leu Lys Leu
370
<210>120
<21l>33
<212>PRT
<213>人工序列
<220>
<223>基序I
<400>120
Arg Arg Pro Pro Ser Val Lys Ala Ser Leu Ser Val Ser Phe Gly Gln
1 5 10 15
Arg Ala Pro His Arg Ala Ser Thr Arg Asp Ser Ser Pro Val Arg Arg
20 25 30
Thr
<210>121
<211>14
<212>PRT
<213>人工序列
<220>
<223>基序II
<220>
<221>变体
<222>(7)..(7)
<223>/取代=″Val″
<400>121
Ser Pro Tyr Ile Arg Ala Ile Gly Phe Leu Tyr Leu Arg Tyr
1 5 10
<210>122
<211>8
<212>PRT
<213>人工序列
<220>
<223>基序III
<400>122
Lys Leu Lys Asp Leu Tyr Gly Asp
1 5
<210>123
<211>6
<212>PRT
<213>人工序列
<220>
<223>基序IV
<220>
<221>变体
<222>(1)..(1)
<223>/取代=″Asn″
<220>
<221>不确定
<222>(2)..(2)
<223>Xaa可以是任意天然氨基酸
<220>
<221>变体
<222>(4)..(4)
<223>/取代=″Asn″
<400>123
Leu Xaa Glu Lys Val Leu
1 5
<210>124
<211>5
<212>PRT
<213>人工序列
<220>
<223>基序V
<400>124
Leu Val Glu Lys Ile
1 5
<210>125
<211>55
<212>DNA
<213>人工序列
<220>
<223>引物1
<400>125
ggggacaagt ttgtacaaaa aagcaggctt aaacaatggc ggagatacag tcaaa 55
<210>126
<211>48
<212>DNA
<213>人工序列
<220>
<223>引物2
<400>126
ggggaccact ttgtacaaga aagctgggtt cacctccaag aggaacca 48
<210>127
<211>2194
<212>DNA
<213>稻
<400>127
aatccgaaaa gtttctgcac cgttttcacc ccctaactaa caatataggg aacgtgtgct 60
aaatataaaa tgagacctta tatatgtagc gctgataact agaactatgc aagaaaaact 120
catccaccta ctttagtggc aatcgggcta aataaaaaag agtcgctaca ctagtttcgt 180
tttccttagt aattaagtgg gaaaatgaaa tcattattgc ttagaatata cgttcacatc 240
tctgtcatga agttaaatta ttcgaggtag ccataattgt catcaaactc ttcttgaata 300
aaaaaatctt tctagctgaa ctcaatgggt aaagagagag atttttttta aaaaaataga 360
atgaagatat tctgaacgta ttggcaaaga tttaaacata taattatata attttatagt 420
ttgtgcattc gtcatatcgc acatcattaa ggacatgtct tactccatcc caatttttat 480
ttagtaatta aagacaattg acttattttt attatttatc ttttttcgat tagatgcaag 540
gtacttacgc acacactttg tgctcatgtg catgtgtgag tgcacctcct caatacacgt 600
tcaactagca acacatctct aatatcactc gcctatttaa tacatttagg tagcaatatc 660
tgaattcaag cactccacca tcaccagacc acttttaata atatctaaaa tacaaaaaat 720
aattttacag aatagcatga aaagtatgaa acgaactatt taggtttttc acatacaaaa 780
aaaaaaagaa ttttgctcgt gcgcgagcgc caatctccca tattgggcac acaggcaaca 840
acagagtggc tgcccacaga acaacccaca aaaaacgatg atctaacgga ggacagcaag 900
tccgcaacaa ccttttaaca gcaggctttg cggccaggag agaggaggag aggcaaagaa 960
aaccaagcat cctccttctc ccatctataa attcctcccc ccttttcccc tctctatata 1020
ggaggcatcc aagccaagaa gagggagagc accaaggaca cgcgactagc agaagccgag 1080
cgaccgcctt ctcgatccat atcttccggt cgagttcttg gtcgatctct tccctcctcc 1140
acctcctcct cacagggtat gtgcctccct tcggttgttc ttggatttat tgttctaggt 1200
tgtgtagtac gggcgttgat gttaggaaag gggatctgta tctgtgatga ttcctgttct 1260
tggatttggg atagaggggt tcttgatgtt gcatgttatc ggttcggttt gattagtagt 1320
atggttttca atcgtctgga gagctctatg gaaatgaaat ggtttaggga tcggaatctt 1380
gcgattttgt gagtaccttt tgtttgaggt aaaatcagag caccggtgat tttgcttggt 1440
gtaataaagt acggttgttt ggtcctcgat tctggtagtg atgcttctcg atttgacgaa 1500
gctatccttt gtttattccc tattgaacaa aaataatcca actttgaaga cggtcccgtt 1560
gatgagattg aatgattgat tcttaagcct gtccaaaatt tcgcagctgg cttgtttaga 1620
tacagtagtc cccatcacga aattcatgga aacagttata atcctcagga acaggggatt 1680
ccctgttctt ccgatttgct ttagtcccag aatttttttt cccaaatatc ttaaaaagtc 1740
actttctggt tcagttcaat gaattgattg ctacaaataa tgcttttata gcgttatcct 1800
agctgtagtt cagttaatag gtaatacccc tatagtttag tcaggagaag aacttatccg 1860
atttctgatc tccattttta attatatgaa atgaactgta gcataagcag tattcatttg 1920
gattattttt tttattagct ctcacccctt cattattctg agctgaaagt ctggcatgaa 1980
ctgtcctcaa ttttgttttc aaattcacat cgattatcta tgcattatcc tcttgtatct 2040
acctgtagaa gtttcttttt ggttattcct tgactgcttg attacagaaa gaaatttatg 2100
aagctgtaat cgggatagtt atactgcttg ttcttatgat tcatttcctt tgtgcagttc 2160
ttggtgtagc ttgccacttt caccagcaaa gttc 2194
<210>128
<211>1062
<212>DNA
<213>稻
<400>128
atgtctacta tctacatgag ccagctacct gctactctcc ctctaatgga gggggatcag 60
gatcaggggc tctacccagc cttccataga gcaaaggacc ctcctatctt gttccctttc 120
atgatcgaca gcgccgtcga gcaccaaggg caaatctatg gagatcaggg cttgaggagg 180
cagcaggttt tgggtgaatc caatcaacag ttcaatgatc acatgatgat gggcggatca 240
gatgtcttcc tcacaccgtc tccgttccga ccaaccatcc aaagcatcgg cagcgacatg 300
atccagcgat catcttatga tccatacgat atcgagagta acaacaagca gcatgccaat 360
ggatcaacca gcaagtggat gtcgacgccg ccaatgaaga tgaggatcat aaggaagggg 420
gcggcaaccg atcctgaggg cggggcggtg agaaagccaa ggagaagagc acaagcgcac 480
caggatgaga gccagcaaca actgcagcaa gctttgggtg tcgttagagt gtgctcggac 540
tgcaacacca ccaagacccc cttgtggaga agtggtcctt gtggccccaa gtccctttgc 600
aacgcgtgtg gcatcaggca aaggaaggcg cggcgggcga tggccgctgc tgccaacggc 660
ggagcggcgg tggcgccggc aaagagcgtg gccgcggcgc cggtgaacaa taagccggcg 720
gcgaagaagg agaagagggc ggcggacgtc gaccggtcgc tgccgttcaa gaaacggtgc 780
aagatggtcg atcacgttgc tgctgccgtc gctgccacca agcccacggc tgctggagaa 840
gtagtggccg ccgctccgaa ggaccaagat cacgtcatcg tcgtcggtgg cgagaacgcc 900
gccgccacct ccatgccggc acagaacccg atatccaagg cggcggcgac cgccgctgcc 960
gccgccgcct ctccggcgtt cttccacggc ctccctcgcg acgagatcac cgacgccgcc 1020
atgctgctca tgaccctatc ctgtggcctc gtccacagct ag 1062
<210>129
<211>353
<212>PRT
<213>稻
<400>129
Met Ser Thr Ile Tyr Met Ser Gln Leu Pro Ala Thr Leu Pro Leu Met
1 5 10 15
Glu Gly Asp Gln Asp Gln Gly Leu Tyr Pro Ala Phe His Arg Ala Lys
20 25 30
Asp Pro Pro Ile Leu Phe Pro Phe Met Ile Asp Ser Ala Val Glu His
35 40 45
Gln Gly Gln Ile Tyr Gly Asp Gln Gly Leu Arg Arg Gln Gln Val Leu
50 55 60
Gly Glu Ser Asn Gln Gln Phe Asn Asp His Met Met Met Gly Gly Ser
65 70 75 80
Asp Val Phe Leu Thr Pro Ser Pro Phe Arg Pro Thr Ile Gln Ser Ile
85 90 95
Gly Ser Asp Met Ile Gln Arg Ser Ser Tyr Asp Pro Tyr Asp Ile Glu
100 105 110
Ser Asn Asn Lys Gln His Ala Asn Gly Ser Thr Ser Lys Trp Met Ser
115 120 125
Thr Pro Pro Met Lys Met Arg Ile Ile Arg Lys Gly Ala Ala Thr Asp
130 135 140
Pro Glu Gly Gly Ala Val Arg Lys Pro Arg Arg Arg Ala Gln Ala His
145 150 155 160
Gln Asp Glu Ser Gln Gln Gln Leu Gln Gln Ala Leu Gly Val Val Arg
165 170 175
Val Cys Ser Asp Cys Asn Thr Thr Lys Thr Pro Leu Trp Arg Ser Gly
180 185 190
Pro Cys Gly Pro Lys Ser Leu Cys Asn Ala Cys Gly Ile Arg Gln Arg
195 200 205
Lys Ala Arg Arg Ala Met Ala Ala Ala Ala Asn Gly Gly Ala Ala Val
210 215 220
Ala Pro Ala Lys Ser Val Ala Ala Ala Pro Val Asn Asn Lys Pro Ala
225 230 235 240
Ala Lys Lys Glu Lys Arg Ala Ala Asp Val Asp Arg Ser Leu Pro Phe
245 250 255
Lys Lys Arg Cys Lys Met Val Asp His Val Ala Ala Ala Val Ala Ala
260 265 270
Thr Lys Pro Thr Ala Ala Gly Glu Val Val Ala Ala Ala Pro Lys Asp
275 280 285
Gln Asp His Val Ile Val Val Gly Gly Glu Asn Ala Ala Ala Thr Ser
290 295 300
Met Pro Ala Gln Asn Pro Ile Ser Lys Ala Ala Ala Thr Ala Ala Ala
305 310 315 320
Ala Ala Ala Ser Pro Ala Phe Phe His Gly Leu Pro Arg Asp Glu Ile
325 330 335
Thr Asp Ala Ala Met Leu Leu Met Thr Leu Ser Cys Gly Leu Val His
340 345 350
Ser
<210>130
<211>16
<212>PRT
<213>人工序列
<220>
<223>基序1
<220>
<221>变体
<222>(2)..(2)
<223>/取代=″Ala″/取代=″Thr″
<220>
<221>变体
<222>(3)..(3)
<223>/取代=″Glu″/取代=″Asn″
<220>
<221>不确定
<222>(5)..(5)
<223>Xaa可以是任意天然氨基酸,preferably one of
Asn,Lys,Gly,His,Asp
<220>
<221>变体
<222>(7)..(7)
<223>/取代=″Ser″/取代=″Ala″
<220>
<221>变体
<222>(8)..(8)
<223>/取代=″Ser″
<220>
<221>变体
<222>(11)..(11)
<223>/取代=″Met″
<220>
<221>变体
<222>(14)..(14)
<223>/取代=″Gly″/取代=″Asn″
<400>130
Cys Ser Asp Cys Xaa Thr Thr Lys Thr Pro Leu Trp Arg Ser Gly Pro
1 5 10 15
<210>131
<211>15
<212>PRT
<213>人工序列
<220>
<223>基序2
<220>
<221>不确定
<222>(13)..(13)
<223>Xaa可以是任意天然氨基酸,优选为Gln,His,Asn,Ser,Tyr,Phe之一
<220>
<221>变体
<222>(14)..(14)
<223>/取代=″Lys″
<400>131
Gly Pro Lys Ser Leu Cys Asn Ala Cys Gly Ile Arg Xaa Arg Lys
1 5 10 15
<210>132
<211>9
<212>PRT
<213>人工序列
<220>
<223>基序3
<220>
<221>变体
<222>(1)..(1)
<223>/取代=″Ser″
<220>
<221>变体
<222>(2)..(2)
<223>/取代=″Trp″
<220>
<221>不确定
<222>(3)..(3)
<223>Xaa可以是任意天然氨基酸,preferably one of
Met,Leu,Val,Ile,Arg
<220>
<221>变体
<222>(4)..(4)
<223>/取代=″Cys″
<220>
<221>变体
<222>(5)..(5)
<223>/取代=″Asn″
<220>
<221>变体
<222>(6)..(6)
<223>/取代=″Leu″/取代=″Val″
<220>
<221>变体
<222>(7)..(7)
<223>/取代=″Leu″/取代=″Ala″
<220>
<221>变体
<222>(8)..(8)
<223>/取代=″Asp″
<220>
<221>变体
<222>(9)..(9)
<223>/取代=″Arg″
<400>132
Ala Ala Xaa Leu Leu Met Thr Leu Ser
1 5
<210>133
<211>58
<212>DNA
<213>人工序列
<220>
<223>引物:prm10133
<400>133
ggggacaagt ttgtacaaaa aagcaggctt aaacaatgtc tactatctac atgagcca 58
<210>134
<211>50
<212>DNA
<213>人工序列
<220>
<223>引物:prm10134
<400>134
ggggaccact ttgtacaaga aagctgggta gctagctagt tttgatcagc 50
<210>135
<211>2194
<212>DNA
<213>稻
<400>135
aatccgaaaa gtttctgcac cgttttcacc ccctaactaa caatataggg aacgtgtgct 60
aaatataaaa tgagacctta tatatgtagc gctgataact agaactatgc aagaaaaact 120
catccaccta ctttagtggc aatcgggcta aataaaaaag agtcgctaca ctagtttcgt 180
tttccttagt aattaagtgg gaaaatgaaa tcattattgc ttagaatata cgttcacatc 240
tctgtcatga agttaaatta ttcgaggtag ccataattgt catcaaactc ttcttgaata 300
aaaaaatctt tctagctgaa ctcaatgggt aaagagagag atttttttta aaaaaataga 360
atgaagatat tctgaacgta ttggcaaaga tttaaacata taattatata attttatagt 420
ttgtgcattc gtcatatcgc acatcattaa ggacatgtct tactccatcc caatttttat 480
ttagtaatta aagacaattg acttattttt attatttatc ttttttcgat tagatgcaag 540
gtacttacgc acacactttg tgctcatgtg catgtgtgag tgcacctcct caatacacgt 600
tcaactagca acacatctct aatatcactc gcctatttaa tacatttagg tagcaatatc 660
tgaattcaag cactccacca tcaccagacc acttttaata atatctaaaa tacaaaaaat 720
aattttacag aatagcatga aaagtatgaa acgaactatt taggtttttc acatacaaaa 780
aaaaaaagaa ttttgctcgt gcgcgagcgc caatctccca tattgggcac acaggcaaca 840
acagagtggc tgcccacaga acaacccaca aaaaacgatg atctaacgga ggacagcaag 900
tccgcaacaa ccttttaaca gcaggctttg cggccaggag agaggaggag aggcaaagaa 960
aaccaagcat cctccttctc ccatctataa attcctcccc ccttttcccc tctctatata 1020
ggaggcatcc aagccaagaa gagggagagc accaaggaca cgcgactagc agaagccgag 1080
cgaccgcctt ctcgatccat atcttccggt cgagttcttg gtcgatctct tccctcctcc 1140
acctcctcct cacagggtat gtgcctccct tcggttgttc ttggatttat tgttctaggt 1200
tgtgtagtac gggcgttgat gttaggaaag gggatctgta tctgtgatga ttcctgttct 1260
tggatttggg atagaggggt tcttgatgtt gcatgttatc ggttcggttt gattagtagt 1320
atggttttca atcgtctgga gagctctatg gaaatgaaat ggtttaggga tcggaatctt 1380
gcgattttgt gagtaccttt tgtttgaggt aaaatcagag caccggtgat tttgcttggt 1440
gtaataaagt acggttgttt ggtcctcgat tctggtagtg atgcttctcg atttgacgaa 1500
gctatccttt gtttattccc tattgaacaa aaataatcca actttgaaga cggtcccgtt 1560
gatgagattg aatgattgat tcttaagcct gtccaaaatt tcgcagctgg cttgtttaga 1620
tacagtagtc cccatcacga aattcatgga aacagttata atcctcagga acaggggatt 1680
ccctgttctt ccgatttgct ttagtcccag aatttttttt cccaaatatc ttaaaaagtc 1740
actttctggt tcagttcaat gaattgattg ctacaaataa tgcttttata gcgttatcct 1800
agctgtagtt cagttaatag gtaatacccc tatagtttag tcaggagaag aacttatccg 1860
atttctgatc tccattttta attatatgaa atgaactgta gcataagcag tattcatttg 1920
gattattttt tttattagct ctcacccctt cattattctg agctgaaagt ctggcatgaa 1980
ctgtcctcaa ttttgttttc aaattcacat cgattatcta tgcattatcc tcttgtatct 2040
acctgtagaa gtttcttttt ggttattcct tgactgcttg attacagaaa gaaatttatg 2100
aagctgtaat cgggatagtt atactgcttg ttcttatgat tcatttcctt tgtgcagttc 2160
ttggtgtagc ttgccacttt caccagcaaa gttc 2194
<210>136
<211>3308
<212>DNA
<213>人工序列
<220>
<223>表达盒
<400>136
aatccgaaaa gtttctgcac cgttttcacc ccctaactaa caatataggg aacgtgtgct 60
aaatataaaa tgagacctta tatatgtagc gctgataact agaactatgc aagaaaaact 120
catccaccta ctttagtggc aatcgggcta aataaaaaag agtcgctaca ctagtttcgt 180
tttccttagt aattaagtgg gaaaatgaaa tcattattgc ttagaatata cgttcacatc 240
tctgtcatga agttaaatta ttcgaggtag ccataattgt catcaaactc ttcttgaata 300
aaaaaatctt tctagctgaa ctcaatgggt aaagagagag atttttttta aaaaaataga 360
atgaagatat tctgaacgta ttggcaaaga tttaaacata taattatata attttatagt 420
ttgtgcattc gtcatatcgc acatcattaa ggacatgtct tactccatcc caatttttat 480
ttagtaatta aagacaattg acttattttt attatttatc ttttttcgat tagatgcaag 540
gtacttacgc acacactttg tgctcatgtg catgtgtgag tgcacctcct caatacacgt 600
tcaactagca acacatctct aatatcactc gcctatttaa tacatttagg tagcaatatc 660
tgaattcaag cactccacca tcaccagacc acttttaata atatctaaaa tacaaaaaat 720
aattttacag aatagcatga aaagtatgaa acgaactatt taggtttttc acatacaaaa 780
aaaaaaagaa ttttgctcgt gcgcgagcgc caatctccca tattgggcac acaggcaaca 840
acagagtggc tgcccacaga acaacccaca aaaaacgatg atctaacgga ggacagcaag 900
tccgcaacaa ccttttaaca gcaggctttg cggccaggag agaggaggag aggcaaagaa 960
aaccaagcat cctcctcctc ccatctataa attcctcccc ccttttcccc tctctatata 1020
ggaggcatcc aagccaagaa gagggagagc accaaggaca cgcgactagc agaagccgag 1080
cgaccgcctt cttcgatcca tatcttccgg tcgagttctt ggtcgatctc ttccctcctc 1140
cacctcctcc tcacagggta tgtgcccttc ggttgttctt ggatttattg ttctaggttg 1200
tgtagtacgg gcgttgatgt taggaaaggg gatctgtatc tgtgatgatt cctgttcttg 1260
gatttgggat agaggggttc ttgatgttgc atgttatcgg ttcggtttga ttagtagtat 1320
ggttttcaat cgtctggaga gctctatgga aatgaaatgg tttagggtac ggaatcttgc 1380
gattttgtga gtaccttttg tttgaggtaa aatcagagca ccggtgattt tgcttggtgt 1440
aataaaagta cggttgtttg gtcctcgatt ctggtagtga tgcttctcga tttgacgaag 1500
ctatcctttg tttattccct attgaacaaa aataatccaa ctttgaagac ggtcccgttg 1560
atgagattga atgattgatt cttaagcctg tccaaaattt cgcagctggc ttgtttagat 1620
acagtagtcc ccatcacgaa attcatggaa acagttataa tcctcaggaa caggggattc 1680
cctgttcttc cgatttgctt tagtcccaga attttttttc ccaaatatct taaaaagtca 1740
ctttctggtt cagttcaatg aattgattgc tacaaataat gcttttatag cgttatccta 1800
gctgtagttc agttaatagg taatacccct atagtttagt caggagaaga acttatccga 1860
tttctgatct ccatttttaa ttatatgaaa tgaactgtag cataagcagt attcatttgg 1920
attatttttt ttattagctc tcaccccttc attattctga gctgaaagtc tggcatgaac 1980
tgtcctcaat tttgttttca aattcacatc gattatctat gcattatcct cttgtatcta 2040
cctgtagaag tttctttttg gttattcctt gactgcttga ttacagaaag aaatttatga 2100
agctgtaatc gggatagtta tactgcttgt tcttatgatt catttccttt gtgcagttct 2160
tggtgtagct tgccactttc accagcaaag ttcatttaaa tcaactaggg atatcacaag 2220
tttgtacaaa aaagcaggct taaacaatgt ctactatcta catgagccag ctacctgcta 2280
ctctccctct aatggagggg gatcaggatc aggggctcta cccagccttc catagagcaa 2340
aggaccctcc tatcttgttc cctttcatga tcgacagcgc cgtcgagcac caagggcaaa 2400
tctatggaga tcagggcttg aggaggcagc aggttttggg tgaatccaat caacagttca 2460
atgatcacat gatgatgggc ggatcagatg tcttcctcac accgtctccg ttccgaccaa 2520
ccatccaaag catcggcagc gacatgatcc agcgatcatc ttatgatcca tacgatatcg 2580
agagtaacaa caagcagcat gccaatggat caaccagcaa gtggatgtcg acgccgccaa 2640
tgaagatgag gatcataagg aagggggcgg caaccgatcc tgagggcggg gcggtgagaa 2700
agccaaggag aagagcacaa gcgcaccagg atgagagcca gcaacaactg cagcaagctt 2760
tgggtgtcgt tagagtgtgc tcggactgca acaccaccaa gacccccttg tggagaagtg 2820
gtccttgtgg ccccaagtcc ctttgcaacg cgtgtggcat caggcaaagg aaggcgcggc 2880
gggcgatggc cgctgctgcc aacggcggag cggcggtggc gccggcaaag agcgtggccg 2940
cggcgccggt gaacaataag ccggcggcga agaaggagaa gagggcggcg gacgtcgacc 3000
ggtcgctgcc gttcaagaaa cggtgcaaga tggtcgatca cgttgctgct gccgtcgctg 3060
ccaccaagcc cacggctgct ggagaagtag tggccgccgc tccgaaggac caagatcacg 3120
tcatcgtcgt cggtggcgag aacgccgccg ccacctccat gccggcacag aacccgatat 3180
ccaaggcggc ggcgaccgcc gctgccgccg ccgcctctcc ggcgttcttc cacggcctcc 3240
ctcgcgacga gatcaccgac gccgccatgc tgctcatgac cctatcctgt ggcctcgtcc 3300
acagctag 3308
<210>137
<211>1044
<212>DNA
<213>稻
<400>137
atgtctacta tctacatgag ccagctacct gctactctcc ctctaatgga gggggatcag 60
gatcaggggc tctacccagc cttccataga gcaaaggacc ctcctatctt gttccctttc 120
atgatcgaca gcgccgtcga gcaccaaggg caaatctatg gagatcaggg cttgaggagg 180
cagcaggttt tgggtgaatc caatcaacag ttcaatgatc acatgatgat gggcggatca 240
gatgtcttcc tcacaccgtc tccgttccga ccaaccatcc aaagcatcgg cagtgacatg 300
atccagcgat catcttatga tccatacgat atcgagagta acaacaagca gcatgccaat 360
ggatcaacca gcaagtggat gtcgacgccg ccaatgaaga tgaggatcat aaggaagggg 420
gcggcaaccg atcctgaggg cggggcggtg agaaagccaa ggagaagagc acaagcgcac 480
caggatgaga gccagcaaca actgcagcaa gctttgggtg tcgttagagt gtgctcggac 540
tgcaacacca ccaagacccc cttgtggaga agtggtcctt gtggccccaa gtccctttgc 600
aacgcgtgtg gcatcaggca aaggaaggcg cggcgggcga tggccgctgc tgccaacggc 660
ggagcggcgg tggcgccggc aaagagcgtg gccgcggcgc cggtgaacaa taagccggcg 720
gcgaagaagg agaagagggc ggcggacgtc gaccggtcgc tgccgttcaa gaaacggtgc 780
aagatggtcg atcacgttgc tgctgccgtc gctgccacca agcccacggc tgctggagaa 840
gtagtggccg ccgctccgaa ggaccaagat cacgtcatcg tcgtcggtgg cgagaacgcc 900
gccgccacct ccatgccggc acagaacccg atatccaagg cggcggcggc ctctccggcg 960
ttcttccacg gcctccctcg cgacgagatc accgacgccg ccatgctgct catgacccta 1020
tcctgtggcc tcgtccacag ctag 1044
<210>138
<211>347
<212>PRT
<213>稻
<400>138
Met Ser Thr Ile Tyr Met Ser Gln Leu Pro Ala Thr Leu Pro Leu Met
1 5 10 15
Glu Gly Asp Gln Asp Gln Gly Leu Tyr Pro Ala Phe His Arg Ala Lys
20 25 30
Asp Pro Pro Ile Leu Phe Pro Phe Met Ile Asp Ser Ala Val Glu His
35 40 45
Gln Gly Gln Ile Tyr Gly Asp Gln Gly Leu Arg Arg Gln Gln Val Leu
50 55 60
Gly Glu Ser Asn Gln Gln Phe Asn Asp His Met Met Met Gly Gly Ser
65 70 75 80
Asp Val Phe Leu Thr Pro Ser Pro Phe Arg Pro Thr Ile Gln Ser Ile
85 90 95
Gly Ser Asp Met Ile Gln Arg Ser Ser Tyr Asp Pro Tyr Asp Ile Glu
100 105 110
Ser Asn Asn Lys Gln His Ala Asn Gly Ser Thr Ser Lys Trp Met Ser
115 120 125
Thr Pro Pro Met Lys Met Arg Ile Ile Arg Lys Gly Ala Ala Thr Asp
130 135 140
Pro Glu Gly Gly Ala Val Arg Lys Pro Arg Arg Arg Ala Gln Ala His
145 150 155 160
Gln Asp Glu Ser Gln Gln Gln Leu Gln Gln Ala Leu Gly Val Val Arg
165 170 175
Val Cys Ser Asp Cys Asn Thr Thr Lys Thr Pro Leu Trp Arg Ser Gly
180 185 190
Pro Cys Gly Pro Lys Ser Leu Cys Asn Ala Cys Gly Ile Arg Gln Arg
195 200 205
Lys Ala Arg Arg Ala Met Ala Ala Ala Ala Asn Gly Gly Ala Ala Val
210 215 220
Ala Pro Ala Lys Ser Val Ala Ala Ala Pro Val Asn Asn Lys Pro Ala
225 230 235 240
Ala Lys Lys Glu Lys Arg Ala Ala Asp Val Asp Arg Ser Leu Pro Phe
245 250 255
Lys Lys Arg Cys Lys Met Val Asp His Val Ala Ala Ala Val Ala Ala
260 265 270
Thr Lys Pro Thr Ala Ala Gly Glu Val Val Ala Ala Ala Pro Lys Asp
275 280 285
Gln Asp His Val Ile Val Val Gly Gly Glu Asn Ala Ala Ala Thr Ser
290 295 300
Met Pro Ala Gln Asn Pro Ile Ser Lys Ala Ala Ala Ala Ser Pro Ala
305 310 315 320
Phe Phe His Gly Leu Pro Arg Asp Glu Ile Thr Asp Ala Ala Met Leu
325 330 335
Leu Met Thr Leu Ser Cys Gly Leu Val His Ser
340 345
<210>139
<211>1065
<212>DNA
<213>稻
<400>139
atgtctacta tctacatgag ccagctacct gctactctcc ctctaatgga gggggatcag 60
gatcaggggc tctacccagc cttccataga gcaaaggacc ctcctatctt gttccctttc 120
atgatcgaca gcgccgtcga gcaccaaggg caaatctatg gagatcaggg cttgaggagg 180
cagcaggttt tgggtgaatc caatcaacag ttcaatgatc acatgatgat gggcggatca 240
gatgtcttcc tcacaccgtc tccgttccga ccaaccatcc aaagcatcgg cagcgacatg 300
atccagcgat catcttatga tccatacgat atcgagagta acaacaagca gcatgccaat 360
ggatcaacca gcaagtggat gtcgacgccg ccaatgaaga tgaggatcat aaggaagggg 420
gcggcaaccg atcctgaggg cggggcggtg agaaagccaa ggagaagagc acaagcgcac 480
caggatgaga gccagcaaca actgcagcaa gctttgggtg tcgttagagt gtgctcggac 540
tgcaacacca ccaagacccc cttgtggaga agtggtcctt gtggccccaa gtccctttgc 600
aacgcgtgtg gcatcaggca aaggaaggcg cggcgggcga tggccgctgc tgccaacggg 660
cggagcggcc ggtggcgccg gcaaagaggc gtggccgcgg cgccggtgaa caataagccg 720
gcggcgaaga aggagaagag ggcggcggac gtcgaccggt cgctgccgtt caagaaacgg 780
tgcaagatgg tcgatcacgt tgctgctgcc gtcgctgcca ccaagcccac ggctgctgga 840
gaagtagtgg ccgccgctcc gaaggaccaa gatcacgtca tcgtcgtcgg tggcgagaac 900
gccgccgcca cctccatgcc ggcacagaac ccgatatcca aggcggcggc gaccgccgct 960
gccgccgccg cctctccggc gttcttccac ggcctccctc gcgacgagat caccgacgcc 1020
gccatgctgc tcatgaccct atcctgtggc ctcgtccaca gctag 1065
<210>140
<211>354
<212>PRT
<213>稻
<400>140
Met Ser Thr Ile Tyr Met Ser Gln Leu Pro Ala Thr Leu Pro Leu Met
1 5 10 15
Glu Gly Asp Gln Asp Gln Gly Leu Tyr Pro Ala Phe His Arg Ala Lys
20 25 30
Asp Pro Pro Ile Leu Phe Pro Phe Met Ile Asp Ser Ala Val Glu His
35 40 45
Gln Gly Gln Ile Tyr Gly Asp Gln Gly Leu Arg Arg Gln Gln Val Leu
50 55 60
Gly Glu Ser Asn Gln Gln Phe Asn Asp His Met Met Met Gly Gly Ser
65 70 75 80
Asp Val Phe Leu Thr Pro Ser Pro Phe Arg Pro Thr Ile Gln Ser Ile
85 90 95
Gly Ser Asp Met Ile Gln Arg Ser Ser Tyr Asp Pro Tyr Asp Ile Glu
100 105 110
Ser Asn Asn Lys Gln His Ala Asn Gly Ser Thr Ser Lys Trp Met Ser
115 120 125
Thr Pro Pro Met Lys Met Arg Ile Ile Arg Lys Gly Ala Ala Thr Asp
130 135 140
Pro Glu Gly Gly Ala Val Arg Lys Pro Arg Arg Arg Ala Gln Ala His
145 150 155 160
Gln Asp Glu Ser Gln Gln Gln Leu Gln Gln Ala Leu Gly Val Val Arg
165 170 175
Val Cys Ser Asp Cys Asn Thr Thr Lys Thr Pro Leu Trp Arg Ser Gly
180 185 190
Pro Cys Gly Pro Lys Ser Leu Cys Asn Ala Cys Gly Ile Arg Gln Arg
195 200 205
Lys Ala Arg Arg Ala Met Ala Ala Ala Ala Asn Gly Arg Ser Gly Arg
210 215 220
Trp Arg Arg Gln Arg Gly Val Ala Ala Ala Pro Val Asn Asn Lys Pro
225 230 235 240
Ala Ala Lys Lys Glu Lys Arg Ala Ala Asp Val Asp Arg Ser Leu Pro
245 250 255
Phe Lys Lys Arg Cys Lys Met Val Asp His Val Ala Ala Ala Val Ala
260 265 270
Ala Thr Lys Pro Thr Ala Ala Gly Glu Val Val Ala Ala Ala Pro Lys
275 280 285
Asp Gln Asp His Val Ile Val Val Gly Gly Glu Asn Ala Ala Ala Thr
290 295 300
Ser Met Pro Ala Gln Asn Pro Ile Ser Lys Ala Ala Ala Thr Ala Ala
305 310 315 320
Ala Ala Ala Ala Ser Pro Ala Phe Phe His Gly Leu Pro Arg Asp Glu
325 330 335
Ile Thr Asp Ala Ala Met Leu Leu Met Thr Leu Ser Cys Gly Leu Val
340 345 350
His Ser
<210>141
<211>1128
<212>DNA
<213>稻
<400>141
atggaggggg agcaccacca tcaccaccag gatcatcacc aaggccactt ccaagccttc 60
tccctgcagc ctaaggatcc cccagtctta ttcccctttg tgatcagtag aagaagcagc 120
agcagcagcc ctagcgacag caccactcta agctatggtt cagaccatca cttgacacag 180
cagcagcagc atcagcatca agccatgctt gagccccaaa atatgattgg aggatcatcc 240
gctggcatct ttgcgacgcc gttcccgacc gtcaagagca tccgcgacga catgatcgag 300
cggtcgcagt tcgatccata cgataccgag aagctgcagg cgagctgcgg gttagccaag 360
gtcgtcgccg gcggcaagtg gagcgcggtg ccagcggcca agatgaagat cacgaggaag 420
atgggtgagc cgtcgtccgg tgtcactggc ggggctgcga cgacggtggc gccgaagaag 480
ccgaggagga ggccggcgca ggcgtacgag gatcacggcc atggcggcgc catgggccaa 540
gcttttggcg tgattagggt gtgctccgac tgcaacacca ccaagactcc cttgtggagg 600
agtggcccgt gcggccccaa gtcgctttgc aacgcgtgcg gcatcaggca gaggaaggcg 660
cggcgggcga tgatggcctc cggactacca gcgtccccca acgccgccgg ccccaaggcg 720
gccgcacata gcggcgccgc tgcggtggcg gctgcgcagc cgaaggtgaa gaaggagaag 780
agagccgacg tcgaccggtc gtcgctgccg ttcaagaaac ggtgcaaggt cgtccaggtc 840
gaggatcatc aaacgctgcc cgccgccaca aacgcagccg ccgcagctgc catggaggag 900
acggccgagt ccgccaccgt cgccccgccc ccggcgccga cgacgagggg tggtactctc 960
gtcgacagca tcgggctcag ctggagcaag acccatgccg ccgccaccgc ctcctgcagc 1020
ttccggccgt caccggtggc tcccggcttc gcggcggcgg tgcaggacga gatcactgac 1080
gccgccatgc tgctcatgac gctgtcctgc gggcttgtcc ggagctga 1128
<210>142
<211>375
<212>PRT
<213>稻
<400>142
Met Glu Gly Glu His His His His His Gln Asp His His Gln Gly His
1 5 10 15
Phe Gln Ala Phe Ser Leu Gln Pro Lys Asp Pro Pro Val Leu Phe Pro
20 25 30
Phe Val Ile Ser Arg Arg Ser Ser Ser Ser Ser Pro Ser Asp Ser Thr
35 40 45
Thr Leu Ser Tyr Gly Ser Asp His His Leu Thr Gln Gln Gln Gln His
50 55 60
Gln His Gln Ala Met Leu Glu Pro Gln Asn Met Ile Gly Gly Ser Ser
65 70 75 80
Ala Gly Ile Phe Ala Thr Pro Phe Pro Thr Val Lys Ser Ile Arg Asp
85 90 95
Asp Met Ile Glu Arg Ser Gln Phe Asp Pro Tyr Asp Thr Glu Lys Leu
100 105 110
Gln Ala Ser Cys Gly Leu Ala Lys Val Val Ala Gly Gly Lys Trp Ser
115 120 125
Ala Val Pro Ala Ala Lys Met Lys Ile Thr Arg Lys Met Gly Glu Pro
130 135 140
Ser Ser Gly Val Thr Gly Gly Ala Ala Thr Thr Val Ala Pro Lys Lys
145 150 155 160
Pro Arg Arg Arg Pro Ala Gln Ala Tyr Glu Asp His Gly His Gly Gly
165 170 175
Ala Met Gly Gln Ala Phe Gly Val Ile Arg Val Cys Ser Asp Cys Asn
180 185 190
Thr Thr Lys Thr Pro Leu Trp Arg Ser Gly Pro Cys Gly Pro Lys Ser
195 200 205
Leu Cys Asn Ala Cys Gly Ile Arg Gln Arg Lys Ala Arg Arg Ala Met
210 215 220
Met Ala Ser Gly Leu Pro Ala Ser Pro Asn Ala Ala Gly Pro Lys Ala
225 230 235 240
Ala Ala His Ser Gly Ala Ala Ala Val Ala Ala Ala Gln Pro Lys Val
245 250 255
Lys Lys Glu Lys Arg Ala Asp Val Asp Arg Ser Ser Leu Pro Phe Lys
260 265 270
Lys Arg Cys Lys Val Val Gln Val Glu Asp His Gln Thr Leu Pro Ala
275 280 285
Ala Thr Asn Ala Ala Ala Ala Ala Ala Met Glu Glu Thr Ala Glu Ser
290 295 300
Ala Thr Val Ala Pro Pro Pro Ala Pro Thr Thr Arg Gly Gly Thr Leu
305 310 315 320
Val Asp Ser Ile Gly Leu Ser Trp Ser Lys Thr His Ala Ala Ala Thr
325 330 335
Ala Ser Cys Ser Phe Arg Pro Ser Pro Val Ala Pro Gly Phe Ala Ala
340 345 350
Ala Val Gln Asp Glu Ile Thr Asp Ala Ala Met Leu Leu Met Thr Leu
355 360 365
Ser Cys Gly Leu Val Arg Ser
370 375
<210>143
<211>1119
<212>DNA
<213>稻
<400>143
atggaggggg agcaccacca ggatcatcac caaggccact tccaagcctt ctccctgcag 60
cctaaggatc ccccagtctt attccccttt gtgatcaata gaagaagcag cagcagcagc 120
cctagcgaca gcaccactct aagctatggt tcagaccatc acttgacaca gcagcagcag 180
catcagcatc aagccatgct tgagccccaa catatgattg gaggatcatc cgctggcatc 240
tttgcgacgc cgttcccgac cgtcaagagc atccgcgacg acatgatcga gcggtcgcag 300
ttcgatccat acgataccga gaagctgcag gcgagctgcg ggttagccaa ggtcgtcgcc 360
ggcggcaagt ggagcgcggt gccagcggcc aagatgaaga tcacgaggaa gatgggtgag 420
ccgtcgtccg gtgtcactgg cggggctgcg acgacggtgg cgccgaagaa gccgaggagg 480
aggctggcgc aggcgtacga ggatcacggc catggcggcg ccatgggcca agcttttggc 540
gtgattaggg tgtgctccga ctgcaacacc accaagactc ccttgtggag gagtggcccg 600
tgcggcccca agtcgctttg caacgcgtgc ggcatcaggc agaggaaggc gcggcgggcg 660
atgatggcct ccggactacc agcgtccccc aacgccgccg gccccaaggc ggccgcacat 720
agcggcgccg ctgcggtggc ggctgcgcag ccgaaggtga agaaggagaa gagagccgac 780
gtcgaccggt cgtcgctgcc gttcaagaaa cggtgcaagg ccgtccaggt cgaggatcat 840
caaacgctgc ccgccgccac aaacgcagcc gccgcagctg ccatggagga gacggccgag 900
tccgccaccg tcgccccgcc cccggcgccg acgacgaggg gttgtactct cgtcgacagc 960
atcgggctca gctggagcaa gacccatgcc gccgccaccg cctcctgcag cttccggccg 1020
tcaccggtgg ctcccggctt cgcggcggcg gtgcaggacg agatcactga cgccgccatg 1080
ctgctcatga cgctgtcctg cgggcttgtc cggagctga 1119
<210>144
<211>372
<212>PRT
<213>稻
<400>144
Met Glu Gly Glu His His Gln Asp His His Gln Gly His Phe Gln Ala
1 5 10 15
Phe Ser Leu Gln Pro Lys Asp Pro Pro Val Leu Phe Pro Phe Val Ile
20 25 30
Asn Arg Arg Ser Ser Ser Ser Ser Pro Ser Asp Ser Thr Thr Leu Ser
35 40 45
Tyr Gly Ser Asp His His Leu Thr Gln Gln Gln Gln His Gln His Gln
50 55 60
Ala Met Leu Glu Pro Gln His Met Ile Gly Gly Ser Ser Ala Gly Ile
65 70 75 80
Phe Ala Thr Pro Phe Pro Thr Val Lys Ser Ile Arg Asp Asp Met Ile
85 90 95
Glu Arg Ser Gln Phe Asp Pro Tyr Asp Thr Glu Lys Leu Gln Ala Ser
100 105 110
Cys Gly Leu Ala Lys Val Val Ala Gly Gly Lys Trp Ser Ala Val Pro
115 120 125
Ala Ala Lys Met Lys Ile Thr Arg Lys Met Gly Glu Pro Ser Ser Gly
130 135 140
Val Thr Gly Gly Ala Ala Thr Thr Val Ala Pro Lys Lys Pro Arg Arg
145 150 155 160
Arg Leu Ala Gln Ala Tyr Glu Asp His Gly His Gly Gly Ala Met Gly
165 170 175
Gln Ala Phe Gly Val Ile Arg Val Cys Ser Asp Cys Asn Thr Thr Lys
180 185 190
Thr Pro Leu Trp Arg Ser Gly Pro Cys Gly Pro Lys Ser Leu Cys Asn
195 200 205
Ala Cys Gly Ile Arg Gln Arg Lys Ala Arg Arg Ala Met Met Ala Ser
210 215 220
Gly Leu Pro Ala Ser Pro Asn Ala Ala Gly Pro Lys Ala Ala Ala His
225 230 235 240
Ser Gly Ala Ala Ala Val Ala Ala Ala Gln Pro Lys Val Lys Lys Glu
245 250 255
Lys Arg Ala Asp Val Asp Arg Ser Ser Leu Pro Phe Lys Lys Arg Cys
260 265 270
Lys Ala Val Gln Val Glu Asp His Gln Thr Leu Pro Ala Ala Thr Asn
275 280 285
Ala Ala Ala Ala Ala Ala Met Glu Glu Thr Ala Glu Ser Ala Thr Val
290 295 300
Ala Pro Pro Pro Ala Pro Thr Thr Arg Gly Cys Thr Leu Val Asp Ser
305 310 315 320
Ile Gly Leu Ser Trp Ser Lys Thr His Ala Ala Ala Thr Ala Ser Cys
325 330 335
Ser Phe Arg Pro Ser Pro Val Ala Pro Gly Phe Ala Ala Ala Val Gln
340 345 350
Asp Glu Ile Thr Asp Ala Ala Met Leu Leu Met Thr Leu Ser Cys Gly
355 360 365
Leu Val Arg Ser
370
<210>145
<211>1044
<212>DNA
<213>稻
<400>145
atgtctacca tctacatgag tcagctctca gctgctctcc ctctcatgga gggggagcac 60
caccatcacc accaggatca tcaccaaggc cacttccaag ccttctccct gcagcctaag 120
gatcccccag tcttattccc ctttgtgatc agtagaagaa gcagcagcag cagccctagc 180
gacagcacca ctctaagcta tggttcagac catcacttga cacagcagca gcagcatcag 240
catcaagcca tgcttgagcc ccaaaatatg attggaggat catccgctgg catctttgcg 300
acgccgttcc cgaccgtcaa gagcatccgc gacgacatga tcgagcggtc gcagttcgat 360
ccatacgata ccgagaagct gcaggcgagc tgcgggttag ccaaggtcgt cgccggcggc 420
aagtggagcg cggtgccagc ggccaagatg aagatcacga ggaagatggg tgagccgtcg 480
tccggtgtca ctggcggggc tgcgacgacg gtggcgccga agaagccgag gaggaggccg 540
gcgcaggcgt acgaggatca cggccatggc ggcgccatgg gccaagcttt tggcgtgatt 600
agggtgtgct ccgactgcaa caccaccaag actcccttgt ggaggagtgg cccgtgcggc 660
cccaagtcgc tttgcaacgc gtgcggcatc aggcagagga aggcgcggcg ggcgatgatg 720
gcctccggac taccagcgtc ccccaacgcc gccggcccca aggcggccgc acatagcggc 780
gccacaaacg cagccgccgc agctgccatg gaggagacgg ccgagtccgc caccgtcgcc 840
ccgcccccgg cgccgacgac gaggggtggt actctcgtcg acagcatcgg gctcagctgg 900
agcaagaccc atgccgccgc caccgcctcc tgcagcttcc ggccgtcacc ggtggctccc 960
ggcttcgcgg cggcggtgca ggacgagatc actgacgccg ccatgctgct catgacgctg 1020
tcctgcgggc ttgtccggag ctga 1044
<210>146
<211>347
<212>PRT
<213>稻
<400>146
Met Ser Thr Ile Tyr Met Ser Gln Leu Ser Ala Ala Leu Pro Leu Met
1 5 10 15
Glu Gly Glu His His His His His Gln Asp His His Gln Gly His Phe
20 25 30
Gln Ala Phe Ser Leu Gln Pro Lys Asp Pro Pro Val Leu Phe Pro Phe
35 40 45
Val Ile Ser Arg Arg Ser Ser Ser Ser Ser Pro Ser Asp Ser Thr Thr
50 55 60
Leu Ser Tyr Gly Ser Asp His His Leu Thr Gln Gln Gln Gln His Gln
65 70 75 80
His Gln Ala Met Leu Glu Pro Gln Asn Met Ile Gly Gly Ser Ser Ala
85 90 95
Gly Ile Phe Ala Thr Pro Phe Pro Thr Val Lys Ser Ile Arg Asp Asp
100 105 110
Met Ile Glu Arg Ser Gln Phe Asp Pro Tyr Asp Thr Glu Lys Leu Gln
115 120 125
Ala Ser Cys Gly Leu Ala Lys Val Val Ala Gly Gly Lys Trp Ser Ala
130 135 140
Val Pro Ala Ala Lys Met Lys Ile Thr Arg Lys Met Gly Glu Pro Ser
145 150 155 160
Ser Gly Val Thr Gly Gly Ala Ala Thr Thr Val Ala Pro Lys Lys Pro
165 170 175
Arg Arg Arg Pro Ala Gln Ala Tyr Glu Asp His Gly His Gly Gly Ala
180 185 190
Met Gly Gln Ala Phe Gly Val Ile Arg Val Cys Ser Asp Cys Asn Thr
195 200 205
Thr Lys Thr Pro Leu Trp Arg Ser Gly Pro Cys Gly Pro Lys Ser Leu
210 215 220
Cys Asn Ala Cys Gly Ile Arg Gln Arg Lys Ala Arg Arg Ala Met Met
225 230 235 240
Ala Ser Gly Leu Pro Ala Ser Pro Asn Ala Ala Gly Pro Lys Ala Ala
245 250 255
Ala His Ser Gly Ala Thr Asn Ala Ala Ala Ala Ala Ala Met Glu Glu
260 265 270
Thr Ala Glu Ser Ala Thr Val Ala Pro Pro Pro Ala Pro Thr Thr Arg
275 280 285
Gly Gly Thr Leu Val Asp Ser Ile Gly Leu Ser Trp Ser Lys Thr His
290 295 300
Ala Ala Ala Thr Ala Ser Cys Ser Phe Arg Pro Ser Pro Val Ala Pro
305 310 315 320
Gly Phe Ala Ala Ala Val Gln Asp Glu Ile Thr Asp Ala Ala Met Leu
325 330 335
Leu Met Thr Leu Ser Cys Gly Leu Val Arg Ser
340 345
<210>147
<211>363
<212>DNA
<213>拟南芥
<400>147
atggatccaa ggaagctact atcttgttca tcctcttacg tgtcagtgag aatgaaagaa 60
gagaagggga caattaggtg ttgcagtgag tgtaagacca ccaagacacc aatgtggaga 120
ggtggaccaa ctggtcctaa gtcactttgc aatgcatgtg gaattagaca cagaaaacag 180
agacgatcag agttattggg tattcatatt attcgcagcc acaaaagctt agcctccaag 240
aagataaacc tattatcatc atcacacggt ggcgtggcgg tgaagaaacg aaggagtcta 300
aaggaggaag aacaagctgc tttgtgtcta ttgttattgt cttgtagctc tgttttggcc 360
tga 363
<210>148
<21l>120
<212>PRT
<213>拟南芥
<400>148
Met Asp Pro Arg Lys Leu Leu Ser Cys Ser Ser Ser Tyr Val Ser Val
1 5 10 15
Arg Met Lys Glu Glu Lys Gly Thr Ile Arg Cys Cys Ser Glu Cys Lys
20 25 30
Thr Thr Lys Thr Pro Met Trp Arg Gly Gly Pro Thr Gly Pro Lys Ser
35 40 45
Leu Cys Asn Ala Cys Gly Ile Arg His Arg Lys Gln Arg Arg Ser Glu
50 55 60
Leu Leu Gly Ile His Ile Ile Arg Ser His Lys Ser Leu Ala Ser Lys
65 70 75 80
Lys Ile Asn Leu Leu Ser Ser Ser His Gly Gly Val Ala Val Lys Lys
85 90 95
Arg Arg Ser Leu Lys Glu Glu Glu Gln Ala Ala Leu Cys Leu Leu Leu
100 105 110
Leu Ser Cys Ser Ser Val Leu Ala
115 120
<210>149
<211>1653
<212>DNA
<213>拟南芥
<400>149
atggagcgat ctcgctcaga aacgccatcg tctcgctcac gattgaagct gtgcttcata 60
aactcgccgc catcatcgat attcacgggg tccaagatcg aagctgagga tggttctccg 120
cttgtgatcg agctcgtgga cgccaccaca aacactctag ttagtacggg accgttctcg 180
tcttctcggg tcgagctcgt gccgctgaac gctgatttca cggaagaaag ctggaccgtt 240
gagggattta atcggaatat tctcacgcaa cgtgaaggga aacgtccgtt gctcactgga 300
gacctaacgg tgatgcttaa aaacggtgtt ggagttataa ccggagatat agctttctcg 360
gataactcga gctggactag gagtcggaag ttccggttag gtgctaagtt gaccggagat 420
ggagccgtgg aggcgagaag tgaagctttt ggatgtagag accaacgagg agaatgggtc 480
tcaaagaaaa catggaacac aattgtatca catgccatgg attgcgtttt ggacgaaaca 540
gagtgttaca tttacaatgc aaacactccg ggcgtaacac ttctcttcaa ctctgtttat 600
gagttgataa gagtgtcatt caatggcaac gatatccaaa accttgatca gccaattcta 660
gaccaattaa aggccgaagc ttatcaaaac cttaaccgca ttacagcggt taacgatagg 720
acctttgtgg gtcatccaca aaggtcctta cagtgcccgc aagatcctgg atttgtcgta 780
acatgttctg gatcgcagca catcgacttt caaggaagtt tggatccatc aagctcttcg 840
atggctcttt gccacaaagc ttcaagctca acggtccacc ctgatgtcct gatgagtttt 900
gataactcat caaccgcgag gtttcatatc gacaaaaagt tcttaccgac tttcggaaac 960
agcttcaaag taagtgaact cgatcaagta cacggaaaat cacaaactgt tgtgacaaaa 1020
ggttgtatag agaataacga ggaggatgag aacgcgtttt cttatcatca ccatgatgac 1080
atgacctcaa gctggtcacc tggtacgcac caagccgttg aaacgatgtt tcttaccgtg 1140
tctgagacgg aagaagctgg aatgttcgat gttcattttg caaacgttaa tttgggatct 1200
ccaagagcca ggtggtgtaa ggttaaggca gctttcaagg ttagggcagc ttttaaggaa 1260
gtccggagac acacaactgc cagaaatccg agggaaggct tgaagctact atcttgttca 1320
tcctcttacg tgtcagtgag aatgaaagaa gagaagggga caattaggtg ttgcagtgag 1380
tgtaagacca ccaagacacc aatgtggaga ggtggaccaa ctggtcctaa gtcactttgc 1440
aatgcatgtg gaattagaca cagaaaacag agacgatcag agttattggg tattcatatt 1500
attcgcagcc acaaaagctt agcctccaag aagataaacc tattatcatc atcacacggt 1560
ggcgtggcgg tgaagaaacg aaggagtcta aaggaggaag aacaagctgc tttgtgtcta 1620
ttgttattgt cttgtagctc tgttttggcc taa 1653
<210>150
<211>550
<212>PRT
<213>拟南芥
<400>150
Met Glu Arg Ser Arg Ser Glu Thr Pro Ser Ser Arg Ser Arg Leu Lys
1 5 10 15
Leu Cys Phe Ile Asn Ser Pro Pro Ser Ser Ile Phe Thr Gly Ser Lys
20 25 30
Ile Glu Ala Glu Asp Gly Ser Pro Leu Val Ile Glu Leu Val Asp Ala
35 40 45
Thr Thr Asn Thr Leu Val Ser Thr Gly Pro Phe Ser Ser Ser Arg Val
50 55 60
Glu Leu Val Pro Leu Asn Ala Asp Phe Thr Glu Glu Ser Trp Thr Val
65 70 75 80
Glu Gly Phe Asn Arg Asn Ile Leu Thr Gln Arg Glu Gly Lys Arg Pro
85 90 95
Leu Leu Thr Gly Asp Leu Thr Val Met Leu Lys Asn Gly Val Gly Val
100 105 110
Ile Thr Gly Asp Ile Ala Phe Ser Asp Asn Ser Ser Trp Thr Arg Ser
115 120 125
Arg Lys Phe Arg Leu Gly Ala Lys Leu Thr Gly Asp Gly Ala Val Glu
130 135 140
Ala Arg Ser Glu Ala Phe Gly Cys Arg Asp Gln Arg Gly Glu Trp Val
145 150 155 160
Ser Lys Lys Thr Trp Asn Thr Ile Val Ser His Ala Met Asp Cys Val
165 170 175
Leu Asp Glu Thr Glu Cys Tyr Ile Tyr Asn Ala Asn Thr Pro Gly Val
180 185 190
Thr Leu Leu Phe Asn Ser Val Tyr Glu Leu Ile Arg Val Ser Phe Asn
195 200 205
Gly Asn Asp Ile Gln Asn Leu Asp Gln Pro Ile Leu Asp Gln Leu Lys
210 215 220
Ala Glu Ala Tyr Gln Asn Leu Asn Arg Ile Thr Ala Val Asn Asp Arg
225 230 235 240
Thr Phe Val Gly His Pro Gln Arg Ser Leu Gln Cys Pro Gln Asp Pro
245 250 255
Gly Phe Val Val Thr Cys Ser Gly Ser Gln His Ile Asp Phe Gln Gly
260 265 270
Ser Leu Asp Pro Ser Ser Ser Ser Met Ala Leu Cys His Lys Ala Ser
275 280 285
Ser Ser Thr Val His Pro Asp Val Leu Met Ser Phe Asp Asn Ser Ser
290 295 300
Thr Ala Arg Phe His Ile Asp Lys Lys Phe Leu Pro Thr Phe Gly Asn
305 310 315 320
Ser Phe Lys Val Ser Glu Leu Asp Gln Val His Gly Lys Ser Gln Thr
325 330 335
Val Val Thr Lys Gly Cys Ile Glu Asn Asn Glu Glu Asp Glu Asn Ala
340 345 350
Phe Ser Tyr His His His Asp Asp Met Thr Ser Ser Trp Ser Pro Gly
355 360 365
Thr His Gln Ala Val Glu Thr Met Phe Leu Thr Val Ser Glu Thr Glu
370 375 380
Glu Ala Gly Met Phe Asp Val His Phe Ala Asn Val Asn Leu Gly Ser
385 390 395 400
Pro Arg Ala Arg Trp Cys Lys Val Lys Ala Ala Phe Lys Val Arg Ala
405 410 415
Ala Phe Lys Glu Val Arg Arg His Thr Thr Ala Arg Asn Pro Arg Glu
420 425 430
Gly Leu Lys Leu Leu Ser Cys Ser Ser Ser Tyr Val Ser Val Arg Met
435 440 445
Lys Glu Glu LysGly Thr Ile Arg Cys Cys Ser Glu Cys Lys Thr Thr
450 455 460
Lys Thr Pro Met Trp Arg Gly Gly Pro Thr Gly Pro Lys Ser Leu Cys
465 470 475 480
Asn Ala Cys Gly Ile Arg His Arg Lys Gln Arg Arg Ser Glu Leu Leu
485 490 495
Gly Ile His Ile Ile Arg Ser His Lys Ser Leu Ala Ser Lys Lys Ile
500 505 510
Asn Leu Leu Ser Ser Ser His Gly Gly Val Ala Val Lys Lys Arg Arg
515 520 525
Ser Leu Lys Glu Glu Glu Gln Ala Ala Leu Cys Leu Leu Leu Leu Ser
530 535 540
Cys Ser Ser Val Leu Ala
545 550
<210>151
<211>363
<212>DNA
<213>拟南芥
<400>151
atggatccaa ggaagctact atcttgttca tcctcttacg tgtcagtgag aatgaaagaa 60
gagaagggga caattaggtg ttgcagtgag tgtaagacca ccaagacacc aatgtggaga 120
ggtggaccaa ctggtcctaa gtcactttgc aatgcatgtg gaattagaca cagaaaacag 180
agacgatcag agttattggg tattcatatt attcgcagcc acaaaagctt agcctccaag 240
aagataaacc tattatcatc atcacacggt ggcgtggcgg tgaagaaacg aaggagtcta 300
aaggaggaag aacaagctgc tttgtgtcta ttgttattgt cttgtagctc tgttttggcc 360
gga 363
<210>152
<211>121
<212>PRT
<213>拟南芥
<400>152
Met Asp Pro Arg Lys Leu Leu Ser Cys Ser Ser Ser Tyr Val Ser Val
1 5 10 15
Arg Met Lys Glu Glu Lys Gly Thr Ile Arg Cys Cys Ser Glu Cys Lys
20 25 30
Thr Thr Lys Thr Pro Met Trp Arg Gly Gly Pro Thr Gly Pro Lys Ser
35 40 45
Leu Cys Asn Ala Cys Gly Ile Arg His Arg Lys Gln Arg Arg Ser Glu
50 55 60
Leu Leu Gly Ile His Ile Ile Arg Ser His Lys Ser Leu Ala Ser Lys
65 70 75 80
Lys Ile Asn Leu Leu Ser Ser Ser His Gly Gly Val Ala Val Lys Lys
85 90 95
Arg Arg Ser Leu Lys Glu Glu Glu Gln Ala Ala Leu Cys Leu Leu Leu
100 105 110
Leu Ser Cys Ser Ser Val Leu Ala Gly
115 120
<210>153
<211>363
<212>DNA
<213>拟南芥
<400>153
atggatccaa ggaagctact atcttgttca tcctcttacg tgtcaatgag aatgaaagaa 60
gagaagggga caattaggtg ttgcagtgag tgtaagacca ccaagacacc aatgtggaga 120
ggtggaccaa ctggtcctaa gtcactttgc aatgcatgtg gaattagaca cagaaaacag 180
agacgatcag agttattggg tattcatatt attcgcagcc acaaaagctt agcctccaag 240
aagataaacc tattatcatc atcacacggt ggcgtggcgg tgaagaaacg aaggagtcta 300
aaggaggaag aacaagctgc tttgtgtcta ttgttattgt cttgtagctc tgttttggcc 360
taa 363
<210>154
<211>120
<212>PRT
<213>拟南芥
<400>154
Met Asp Pro Arg Lys Leu Leu Ser Cys Ser Ser Ser Tyr Val Ser Met
1 5 10 15
Arg Met Lys Glu Glu Lys Gly Thr Ile Arg Cys Cys Ser Glu Cys Lys
20 25 30
Thr Thr Lys Thr Pro Met Trp Arg Gly Gly Pro Thr Gly Pro Lys Ser
35 40 45
Leu Cys Asn Ala Cys Gly Ile Arg His Arg Lys Gln Arg Arg Ser Glu
50 55 60
Leu Leu Gly Ile His Ile Ile Arg Ser His Lys Ser Leu Ala Ser Lys
65 70 75 80
Lys Ile Asn Leu Leu Ser Ser Ser His Gly Gly Val Ala Val Lys Lys
85 90 95
Arg Arg Ser Leu Lys Glu Glu Glu Gln Ala Ala Leu Cys Leu Leu Leu
100 105 110
Leu Ser Cys Ser Ser Val Leu Ala
115 120
<210>155
<211>1197
<212>DNA
<213>拟南芥
<400>155
atggattcaa attttcatta ctcgatagat cttaacgaag atcaaaacca tcacgaacaa 60
ccctttttct atcctcttgg atcctcttcc tcgcttcatc atcatcatca tcatcatcat 120
catcaagtcc cttctaattc ttcatcttct tcttcgtcca tttcatcgct ctcctcttac 180
ctccctttct tgatcaactc tcaagaagat caacatgttg cctacaacaa cacttatcac 240
gctgatcatc tccatctttc tcaacccctc aaggccaaga tgtttgtggc taacggtgga 300
tcatcagcat gcgatcacat ggtgccaaag aaggagacaa gactgaaact aacgataagg 360
aaaaaagatc acgaagacca accccatcct cttcatcaaa acccgacaaa acccgattca 420
gactccgaca agtggttgat gtccccaaag atgcggttga tcaagaaaac aatcaccaac 480
aataaacagc tcattgatca gactaataat aataatcata aagaaagtga tcactaccct 540
ttgaatcata agactaattt cgacgaggat caccatgaag atcttaattt caagaacgtc 600
ttgaccagga agaccacggc cgcgaccacc gagaatcgct acaatacaat caacgagaac 660
ggttatagta ataacaatgg cgtgattagg gtttgttcgg attgtaacac caccaagact 720
cctctttggc gaagtggacc tcgaggtccc aagtctcttt gtaacgcatg tggtatacgg 780
caaagaaagg caaggcgagc cgctatggcc gcggccgctg cagccggcga ccaagaggtg 840
gcggtagcgc cccgagtgca acaattaccg ctgaaaaaga agttgcaaaa taaaaaaaag 900
agatcaaacg gaggggaaaa atacaatcac tctcctccaa tggtggccaa ggccaaaaag 960
tgcaagatca aagaggaaga ggagaaggaa atggaagcgg aaacggttgc cggagattca 1020
gagatcagca aatctacaac ttcttctaat tcttcgattt cgtcaaacaa attttgtttc 1080
gatgatttga caataatgtt gagcaaaagc tcagcttatc aacaagtgtt cccacaagat 1140
gagaaggagg ctgctgtttt gctcatggct ctgtcgtatg gaatggttca cggttga 1197
<210>156
<211>398
<212>PRT
<213>拟南芥
<400>156
Met Asp Ser Asn Phe His Tyr Ser Ile Asp Leu Asn Glu Asp Gln Asn
1 5 10 15
His His Glu Gln Pro Phe Phe Tyr Pro Leu Gly Ser Ser Ser Ser Leu
20 25 30
His His His His His His His His His Gln Val Pro Ser Asn Ser Ser
35 40 45
Ser Ser Ser Ser Ser Ile Ser Ser Leu Ser Ser Tyr Leu Pro Phe Leu
50 55 60
Ile Asn Ser Gln Glu Asp Gln His Val Ala Tyr Asn Asn Thr Tyr His
65 70 75 80
Ala Asp His Leu His Leu Ser Gln Pro Leu Lys Ala Lys Met Phe Val
85 90 95
Ala Asn Gly Gly Ser Ser Ala Cys Asp His Met Val Pro Lys Lys Glu
100 105 110
Thr Arg Leu Lys Leu Thr Ile Arg Lys Lys Asp His Glu Asp Gln Pro
115 120 125
His Pro Leu His Gln Asn Pro Thr Lys Pro Asp Ser Asp Ser Asp Lys
130 135 140
Trp Leu Met Ser Pro Lys Met Arg Leu Ile Lys Lys Thr Ile Thr Asn
145 150 155 160
Asn Lys Gln Leu Ile Asp Gln Thr Asn Asn Asn Asn His Lys Glu Ser
165 170 175
Asp His Tyr Pro Leu Asn His Lys Thr Asn Phe Asp Glu Asp His His
180 185 190
Glu Asp Leu Asn Phe Lys Asn Val Leu Thr Arg Lys Thr Thr Ala Ala
195 200 205
Thr Thr Glu Asn Arg Tyr Asn Thr Ile Asn Glu Asn Gly Tyr Ser Asn
210 215 220
Asn Asn Gly Val Ile Arg Val Cys Ser Asp Cys Asn Thr Thr Lys Thr
225 230 235 240
Pro Leu Trp Arg Ser Gly Pro Arg Gly Pro Lys Ser Leu Cys Asn Ala
245 250 255
Cys Gly Ile Arg Gln Arg Lys Ala Arg Arg Ala Ala Met Ala Ala Ala
260 265 270
Ala Ala Ala Gly Asp Gln Glu Val Ala Val Ala Pro Arg Val Gln Gln
275 280 285
Leu Pro Leu Lys Lys Lys Leu Gln Asn Lys Lys Lys Arg Ser Asn Gly
290 295 300
Gly Glu Lys Tyr Asn His Ser Pro Pro Met Val Ala Lys Ala Lys Lys
305 310 315 320
Cys Lys Ile Lys Glu Glu Glu Glu Lys Glu Met Glu Ala Glu Thr Val
325 330 335
Ala Gly Asp Ser Glu Ile Ser Lys Ser Thr Thr Ser Ser Asn Ser Ser
340 345 350
Ile Ser Ser Asn Lys Phe Cys Phe Asp Asp Leu Thr Ile Met Leu Ser
355 360 365
Lys Ser Ser Ala Tyr Gln Gln Val Phe Pro Gln Asp Glu Lys Glu Ala
370 375 380
Ala Val Leu Leu Met Ala Leu Ser Tyr Gly Met Val His Gly
385 390 395
<210>157
<211>1197
<212>DNA
<213>拟南芥
<400>157
atggattcaa attttcatta ctcgatagat cttaacgaag atcaaaacca tcacgaacaa 60
ccctttttct atcctcttgg atcctcttcc tcgcttcatc atcatcatca tcatcatcat 120
catcaagtcc cttctaattc ttcatcttct tcttcgtcca tttcatcgct ctcctcttac 180
ctccctttct tgatcaactc tcaagaagat caacatgttg cctacaacaa cacttatcac 240
gctgatcatc tccatctttc tcaacccctc aaggccaaga tgtttgtggc taacggtgga 300
tcatcagcat gcgatcacat ggtgccaaag aaggagacaa gactgaaact aacgataagg 360
aaaaaagatc acgaagacca accccatcct cttcatcaaa acccgacaaa acccgattca 420
gactccgaca agtggttgat gtccccaaag atgcggttga tcaagaaaac aataaccaac 480
aataaacagc tcattgatca gactaataat aataatcata aagaaagtga tcactaccct 540
ttgaatcata agactaattt cgacgaggat caccatgaag atcttaattt caagaacgtc 600
ttgaccagga agaccacggc cgcgaccacc gagaatcgct acaatacaat caacgagaac 660
ggttatagta ataacaatgg cgtgattagg gtttgttcgg attgtaacac caccaagact 720
cctctttggc gaagtggacc tcgaggtccc aagtctcttt gtaacgcatg tggtatacgg 780
caaagaaagg caaggcgagc cgctatggcc gcggccgctg cagccggcga ccaagaggtg 840
gcggtagcgc cccgagtgca acaattaccg ctgaaaaaga atttgcaaaa taaaaaaaag 900
agatcaaacg gaggggaaaa atacaatcac tctcctccaa tggtggccaa ggccaaaaag 960
tgcaagatca aagaggaaga ggagaaggaa atggaagcgg aaacggttgc cggagattca 1020
gagatcagca aatctacaac ttcttctaat tcttcgattt cgtcaaacaa attttgcttc 1080
gatgatttga caataatgtt gagcaaaagc tcagcttatc aacaagtgtt cccacaagat 1140
gagaaggagg ctgctgtttt gctcatggct ctgtcgtatg gaatggttca cggttga 1197
<210>158
<211>398
<212>PRT
<213>拟南芥
<400>158
Met Asp Ser Asn Phe His Tyr Ser Ile Asp Leu Asn Glu Asp Gln Asn
1 5 10 15
His His Glu Gln Pro Phe Phe Tyr Pro Leu Gly Ser Ser Ser Ser Leu
20 25 30
His His His His His His His His His Gln Val Pro Ser Asn Ser Ser
35 40 45
Ser Ser Ser Ser Ser Ile Ser Ser Leu Ser Ser Tyr Leu Pro Phe Leu
50 55 60
Ile Asn Ser Gln Glu Asp Gln His Val Ala Tyr Asn Asn Thr Tyr His
65 70 75 80
Ala Asp His Leu His Leu Ser Gln Pro Leu Lys Ala Lys Met Phe Val
85 90 95
Ala Asn Gly Gly Ser Ser Ala Cys Asp His Met Val Pro Lys Lys Glu
100 105 110
Thr Arg Leu Lys Leu Thr Ile Arg Lys Lys Asp His Glu Asp Gln Pro
115 120 125
His Pro Leu His Gln Asn Pro Thr Lys Pro Asp Ser Asp Ser Asp Lys
130 135 140
Trp Leu Met Ser Pro Lys Met Arg Leu Ile Lys Lys Thr Ile Thr Asn
145 150 155 160
Asn Lys Gln Leu Ile Asp Gln Thr Asn Asn Asn Asn His Lys Glu Ser
165 170 175
Asp His Tyr Pro Leu Asn His Lys Thr Asn Phe Asp Glu Asp His His
180 185 190
Glu Asp Leu Asn Phe Lys Asn Val Leu Thr Arg Lys Thr Thr Ala Ala
195 200 205
Thr Thr Glu Asn Arg Tyr Asn Thr Ile Asn Glu Asn Gly Tyr Ser Asn
210 215 220
Asn Asn Gly Val Ile Arg Val Cys Ser Asp Cys Asn Thr Thr Lys Thr
225 230 235 240
Pro Leu Trp Arg Ser Gly Pro Arg Gly Pro Lys Ser Leu Cys Asn Ala
245 250 255
Cys Gly Ile Arg Gln Arg Lys Ala Arg Arg Ala Ala Met Ala Ala Ala
260 265 270
Ala Ala Ala Gly Asp Gln Glu Val Ala Val Ala Pro Arg Val Gln Gln
275 280 285
Leu Pro Leu Lys Lys Asn Leu Gln Asn Lys Lys Lys Arg Ser Asn Gly
290 295 300
Gly Glu Lys Tyr Asn His Ser Pro Pro Met Val Ala Lys Ala Lys Lys
305 310 315 320
Cys Lys Ile Lys Glu Glu Glu Glu Lys Glu Met Glu Ala Glu Thr Val
325 330 335
Ala Gly Asp Ser Glu Ile Ser Lys Ser Thr Thr Ser Ser Asn Ser Ser
340 345 350
Ile Ser Ser Asn Lys Phe Cys Phe Asp Asp Leu Thr Ile Met Leu Ser
355 360 365
Lys Ser Ser Ala Tyr Gln Gln Val Phe Pro Gln Asp Glu Lys Glu Ala
370 375 380
Ala Val Leu Leu Met Ala Leu Ser Tyr Gly Met Val His Gly
385 390 395
<210>159
<211>1059
<212>DNA
<213>拟南芥
<400>159
atgggttcca attttcatta cacaatagat ctcaatgaag atcaaaacca tcagcctttt 60
ttcgcttctc ttggatcctc tcttcatcat catctacaac aacaacaaca acaacaacaa 120
cattttcatc accaagcttc ttctaatccc tcttctttga tgtcaccgtc tctttcctac 180
tttcctttct tgataaactc tcgccaagat caagtatatg ttgggtacaa caataacact 240
tttcatgatg ttcttgatac ccatatctcc caacctctcg agaccaagaa ctttgtatct 300
gatggtggtt catcatcaag tgatcaaatg gtgcccaaga aggagacacg actaaaattg 360
acgataaaga agaaagataa tcatcaagac caaaccgatc ttcctcaatc cccaataaaa 420
gacatgacag gaactaactc gctcaagtgg atatcttcga aggtgagatt aatgaagaag 480
aaaaaggcga ttattaccac cagcgacagc agcaaacaac acactaataa cgaccaatcc 540
tcaaacctaa gcaattcgga aagacagaat ggttataaca acgattgcgt gattaggatt 600
tgctccgatt gtaacacaac caagactcct ctttggagaa gtggtccgag aggtcccaag 660
tctctttgta acgcttgtgg aataaggcaa aggaaggcca ggcgggccgc tatggccacg 720
gcaaccgcaa ccgcagtctc tggcgtatcc ccaccggtca tgaagaagaa gatgcaaaac 780
aagaacaaga tatcaaatgg agtttataaa atcttatctc ctttgcccct aaaggtaaac 840
acgtgtaaga gaatgatcac actagaggag accgcattag ccgaggattt ggagacccag 900
agcaactcca cgatgttatc atcttcagac aatatctatt tcgatgatct agcattactg 960
ttgagcaaaa gttcagctta tcagcaagtt ttccctcaag atgagaagga ggctgccatt 1020
ttactaatgg ctctatcgca cggaatggtt cacgggtga 1059
<210>160
<211>352
<212>PRT
<213>拟南芥
<400>160
Met Gly Ser Asn Phe His Tyr Thr Ile Asp Leu Asn Glu Asp Gln Asn
1 5 10 15
His Gln Pro Phe Phe Ala Ser Leu Gly Ser Ser Leu His His His Leu
20 25 30
Gln Gln Gln Gln Gln Gln Gln Gln His Phe His His Gln Ala Ser Ser
35 40 45
Asn Pro Ser Ser Leu Met Ser Pro Ser Leu Ser Tyr Phe Pro Phe Leu
50 55 60
Ile Asn Ser Arg Gln Asp Gln Val Tyr Val Gly Tyr Asn Asn Asn Thr
65 70 75 80
Phe His Asp Val Leu Asp Thr His Ile Ser Gln Pro Leu Glu Thr Lys
85 90 95
Asn Phe Val Ser Asp Gly Gly Ser Ser Ser Ser Asp Gln Met Val Pro
100 105 110
Lys Lys Glu Thr Arg Leu Lys Leu Thr Ile Lys Lys Lys Asp Asn His
115 120 125
Gln Asp Gln Thr Asp Leu Pro Gln Ser Pro Ile Lys Asp Met Thr Gly
130 135 140
Thr Asn Ser Leu Lys Trp Ile Ser Ser Lys Val Arg Leu Met Lys Lys
145 150 155 160
Lys Lys Ala Ile Ile Thr Thr Ser Asp Ser Ser Lys Gln His Thr Asn
165 170 175
Asn Asp Gln Ser Ser Asn Leu Ser Asn Ser Glu Arg Gln Asn Gly Tyr
180 185 190
Asn Asn Asp Cys Val Ile Arg Ile Cys Ser Asp Cys Asn Thr Thr Lys
195 200 205
Thr Pro Leu Trp Arg Ser Gly Pro Arg Gly Pro Lys Ser Leu Cys Asn
210 215 220
Ala Cys Gly Ile Arg Gln Arg Lys Ala Arg Arg Ala Ala Met Ala Thr
225 230 235 240
Ala Thr Ala Thr Ala Val Ser Gly Val Ser Pro Pro Val Met Lys Lys
245 250 255
Lys Met Gln Asn Lys Asn Lys Ile Ser Asn Gly Val Tyr Lys Ile Leu
260 265 270
Ser Pro Leu Pro Leu Lys Val Asn Thr Cys Lys Arg Met Ile Thr Leu
275 280 285
Glu Glu Thr Ala Leu Ala Glu Asp Leu Glu Thr Gln Ser Asn Ser Thr
290 295 300
Met Leu Ser Ser Ser Asp Asn Ile Tyr Phe Asp Asp Leu Ala Leu Leu
305 310 315 320
Leu Ser Lys Ser Ser Ala Tyr Gln Gln ValPhe Pro Gln Asp Glu Lys
325 330 335
Glu Ala Ala Ile Leu Leu Met Ala Leu Ser His Gly Met Val His Gly
340 345 350
<210>161
<211>636
<212>DNA
<213>葡萄
<400>161
atggcggatg acaataagag cagccacaaa ttrtcggttt ttaagaagga agaaggagat 60
gaaggtaata aaagtactga gaaatggatg tcttcaaaga tgaggctgat gagaaaaatg 120
atgaactcgg attgcactac agcgaaaatc gagcagaagg ttgaagatca tcagcagtgg 180
gacaatatta aygagwtcaa ctcttccaac aatactagta atatcccaat tagagtctgc 240
agtgattgta acacaaccaa aacccctctt tggaggagyg gtcctagagg tcccaagtca 300
ctttgcaatg cctgtggaat taggcaaagg aaggcgagac gagccatggc agcagcggca 360
gcagcagcag cgaatggcac agccgttggg accgagatat cgcctatgaa gatgaagctg 420
cccaacaagg aaaagaagat gcatacaagc aatgtagggc aacagaagaa gctctgcaag 480
cccccttgtc ctcctcccac cgagaagaag ctttgcttcg aagatttcac ttcgagtatt 540
tgcaagaact caggttttag acgagtgttc cctcgggatg aagaagaagc cgcgatcetc 600
ctaatggcct tatcttgtga ccttgtttac agttga 636
<210>162
<211>211
<212>PRT
<213>葡萄
<220>
<221>misc_feature
<222>(66)..(66)
<223>Xaa可以是任意天然氨基酸
<400>162
Met Ala Asp Asp Asn Lys Ser Ser His Lys Leu Ser Val Phe Lys Lys
1 5 10 15
Glu Glu Gly Asp Glu Gly Asn Lys Ser Thr Glu Lys Trp Met Ser Ser
20 25 30
Lys Met Arg Leu Met Arg Lys Met Met Asn Ser Asp Cys Thr Thr Ala
35 40 45
Lys Ile Glu Gln Lys Val Glu Asp His Gln Gln Trp Asp Asn Ile Asn
50 55 60
Glu Xaa Asn Ser Ser Asn Asn Thr Ser Asn Ile Pro Ile Arg Val Cys
65 70 75 80
Ser Asp Cys Asn Thr Thr Lys Thr Pro Leu Trp Arg Ser Gly Pro Arg
85 90 95
Gly Pro Lys Ser Leu Cys Asn Ala Cys Gly Ile Arg Gln Arg Lys Ala
100 105 110
Arg Arg Ala Met Ala Ala Ala Ala Ala Ala Ala Ala Asn Gly Thr Ala
115 120 125
Val Gly Thr Glu Ile Ser Pro Met Lys Met Lys Leu Pro Asn Lys Glu
130 135 140
Lys Lys Met His Thr Ser Asn Val Gly Gln Gln Lys Lys Leu Cys Lys
145 150 155 160
Pro Pro Cys Pro Pro Pro Thr Glu Lys Lys Leu Cys Phe Glu Asp Phe
165 170 175
Thr Ser Ser Ile Cys Lys Asn Ser Gly Phe Arg Arg Val Phe Pro Arg
180 185 190
Asp Glu Glu Glu Ala Ala Ile Leu Leu Met Ala Leu Ser Cys Asp Leu
195 200 205
Val Tyr Ser
210
<210>163
<211>546
<212>DNA
<213>葡萄
<400>163
atgatctcgg atcaaactgg tgcccaaaaa ccaagcaaca ctgcactcaa ttttggagat 60
cacaagcagc aatccttgcc ttctgaaacc gattacaata gcatcaattc ctctaatatc 120
aacagcaaca acacaattag ggtttgtgca gattgtaaca caactaagac ccctctgtgg 180
aggagtggcc caagaggccc taagtctctc tgcaacgcct gcggaatcag gcaaaggaag 240
gctagacggg ccatggctgc tgctgctgca actgctaatg gcacaattct tccaaccaac 300
acagcaccca caaagaccaa ggccaagcac aaagacaaga agtcgagcaa tggtcatgtt 360
tcacactaca agaaacggtg caaactggct gcggccccat cttgtgaaac aaagaagctt 420
tgtttcgagg acttcaccat aagcttgagt aagaattccg ctttccaccg agttttcctc 480
caagacgaga tcaaggaagc ggcgatcctg ctaatggctc tatcttgcgg cctcgtccat 540
ggttga 546
<210>164
<211>181
<212>PRT
<213>葡萄
<400>164
Met Ile Ser Asp Gln Thr Gly Ala Gln Lys Pro Ser Asn Thr Ala Leu
1 5 10 15
Asn Phe Gly Asp His Lys Gln Gln Ser Leu Pro Ser Glu Thr Asp Tyr
20 25 30
Asn Ser Ile Asn Ser Ser Asn Ile Asn Ser Asn Asn Thr Ile Arg Val
35 40 45
Cys Ala Asp Cys Asn Thr Thr Lys Thr Pro Leu Trp Arg Ser Gly Pro
50 55 60
Arg Gly Pro Lys Ser Leu Cys Asn Ala Cys Gly Ile Arg Gln Arg Lys
65 70 75 80
Ala Arg Arg Ala Met Ala Ala Ala Ala Ala Thr Ala Asn Gly Thr Ile
85 90 95
Leu Pro Thr Asn Thr Ala Pro Thr Lys Thr Lys Ala Lys His Lys Asp
100 105 110
Lys Lys Ser Ser Asn Gly His Val Ser His Tyr Lys Lys Arg Cys Lys
115 120 125
Leu Ala Ala Ala Pro Ser Cys Glu Thr Lys Lys Leu Cys Phe Glu Asp
130 135 140
Phe Thr Ile Ser Leu Ser Lys Asn Ser Ala Phe His Arg Val Phe Leu
145 150 155 160
Gln Asp Glu Ile Lys Glu Ala Ala Ile Leu Leu Met Ala Leu Ser Cys
165 170 175
Gly Leu Val His Gly
180
<210>165
<211>420
<212>DNA
<213>拟南芥
<400>165
atgctagatc acagtgaaaa ggtcttattg gttgattcag aaaccatgaa aacaagagct 60
gaagatatga tcgaacagaa caacactagt gttaacgaca agaagaagac ttgtgctgat 120
tgtggaacca gtaaaactcc tctttggcgt ggtggtcctg ttggtccaaa gtcgttgtgt 180
aacgcgtgtg ggatcagaaa cagaaagaag agaagaggag gaacagaaga taataagaaa 240
ttaaagaaat cgagttctgg cggcggaaac cgtaaatttg gtgaatcgtt aaaacagagt 300
ttgatggatt tggggataag gaagagatca acggtggaga agcaacgaca gaagcttggt 360
gaagaagaac aagccgctgt gttactcatg gctctttctt atggctctgt ttacgcttag 420
<210>166
<211>139
<212>PRT
<213>拟南芥
<400>166
Met Leu Asp His Ser Glu Lys Val Leu Leu Val Asp Ser Glu Thr Met
1 5 10 15
Lys Thr Arg Ala Glu Asp Met Ile Glu Gln Asn Asn Thr Ser Val Asn
20 25 30
Asp Lys Lys Lys Thr Cys Ala Asp Cys Gly Thr Ser Lys Thr Pro Leu
35 40 45
Trp Arg Gly Gly Pro Val Gly Pro Lys Ser Leu Cys Asn Ala Cys Gly
50 55 60
Ile Arg Asn Arg Lys Lys Arg Arg Gly Gly Thr Glu Asp Asn Lys Lys
65 70 75 80
Leu Lys Lys Ser Ser Ser Gly Gly Gly Asn Arg Lys Phe Gly Glu Ser
85 90 95
Leu Lys Gln Ser Leu Met Asp Leu Gly Ile Arg Lys Arg Ser Thr Val
100 105 110
Glu Lys Gln Arg Gln Lys Leu Gly Glu Glu Glu Gln Ala Ala Val Leu
115 120 125
Leu Met Ala Leu Ser Tyr Gly Ser Val Tyr Ala
130 135
<210>167
<211>363
<212>DNA
<213>葡萄
<400>167
atgaacaaca aaaacccaga tgctgtttcg tcggctgaga gccaggttaa cgagccgaag 60
aagacctgcg ctgattgtgg caccaccaaa acccctctct ggagaggcgg tccagctggg 120
cctaagtctc tgtgcaatgc atgtggtatc agaagcagga agaagagaag agccttcctg 180
ggaagcagca accactccca taacaatggt ggcggcaacg ggaacaataa attgggggac 240
tcgctgaaga ggaggctctt cgcattggga agagaggtgt tgttgcagag atcaacactg 300
ggagaagaag agcaagcggc cgtactgtta atggctctgt cttacggcta cgtctatgct 360
taa 363
<210>168
<211>120
<212>PRT
<213>葡萄
<400>168
Met Asn Asn Lys Asn Pro Asp Ala Val Ser Ser Ala Glu Ser Gln Val
1 5 10 15
Asn Glu Pro Lys Lys Thr Cys Ala Asp Cys Gly Thr Thr Lys Thr Pro
20 25 30
Leu Trp Arg Gly Gly Pro Ala Gly Pro Lys Ser Leu Cys Asn Ala Cys
35 40 45
Gly Ile Arg Ser Arg Lys Lys Arg Arg Ala Phe Leu Gly Ser Ser Asn
50 55 60
His Ser His Asn Asn Gly Gly Gly Asn Gly Asn Asn Lys Leu Gly Asp
65 70 75 80
Ser Leu Lys Arg Arg Leu Phe Ala Leu Gly Arg Glu Val Leu Leu Gln
85 90 95
Arg Ser Thr Leu Gly Glu Glu Glu Gln Ala Ala Val Leu Leu Met Ala
100 105 110
Leu Ser Tyr Gly Tyr Val Tyr Ala
115 120
<210>169
<211>378
<212>DNA
<213>葡萄
<400>169
atgatggatc tgagcaaaaa ggaatcattg tctgaggaaa tgaatgagat caagaaatgt 60
tgtactgatt gcaagaccac caagacgccc ctgtggagag gtgggccagc tgggcctaag 120
tcactctgca atgcatgtgg gatcagatac aggaagagga ggagttccat ggtgggtgtg 180
aacaaaaaga aagagagaat gaacagtggt agccatgatt tgagtgaaac tttgaagcag 240
tcactcatgg ctttggggaa tgaggtgatg atgcagaggc agagatcttc agtgaagaaa 300
cagaggagga agttggggga agaagaacaa gcagctgtac tgttgatggc actctcatgt 360
ggctctgttt ttgcctag 378
<210>170
<211>125
<212>PRT
<213>葡萄
<400>170
Met Met Asp Leu Ser Lys Lys Glu Ser Leu Ser Glu Glu Met Asn Glu
1 5 10 15
Ile Lys Lys Cys Cys Thr Asp Cys Lys Thr Thr Lys Thr Pro Leu Trp
20 25 30
Arg Gly Gly Pro Ala Gly Pro Lys Ser Leu Cys Asn Ala Cys Gly Ile
35 40 45
Arg Tyr Arg Lys Arg Arg Ser Ser Met Val Gly Val Asn Lys Lys Lys
50 55 60
Glu Arg Met Asn Ser Gly Ser His Asp Leu Ser Glu Thr Leu Lys Gln
65 70 75 80
Ser Leu Met Ala Leu Gly Asn Glu Val Met Met Gln Arg Gln Arg Ser
85 90 95
Ser Val Lys Lys Gln Arg Arg Lys Leu Gly Glu Glu Glu Gln Ala Ala
100 105 110
Val Leu Leu Met Ala Leu Ser Cys Gly Ser Val Phe Ala
115 120 125
<210>171
<211>411
<212>DNA
<213>稻
<400>171
atggattcct cgtcggtcga gaaggggagt gggtcgatag atccggacga gcgcacggcc 60
tccggcgagc ccaaggcgtg caccgactgc cacaccacca agactccgct ctggcgcggc 120
ggcccctccg gccccaagtc gctatgcaac gcgtgcggga tccggtaccg gaagaagaga 180
cgggaggcgc tggggctcga cgccggcgag ggcggcgcgg agcggcagga gaagaagaag 240
agcaagaggg agagagggga ggaggtgacc atggagctcc gcatggtggg gttcgggaag 300
gaggtggtcc tgaagcagcg gcggcggatg cggcggagga gacgcctcgg cgaggaggag 360
aaggcggcca tcctcctcat ggccctctcc tccggagtca tctacgcctg a 411
<210>172
<211>136
<212>PRT
<213>稻
<400>172
Met Asp Ser Ser Ser Val Glu Lys Gly Ser Gly Ser Ile Asp Pro Asp
1 5 10 15
Glu Arg Thr Ala Ser Gly Glu Pro Lys Ala Cys Thr Asp Cys His Thr
20 25 30
Thr Lys Thr Pro Leu Trp Arg Gly Gly Pro Ser Gly Pro Lys Ser Leu
35 40 45
Cys Asn Ala Cys Gly Ile Arg Tyr Arg Lys Lys Arg Arg Glu Ala Leu
50 55 60
Gly Leu Asp Ala Gly Glu Gly Gly Ala Glu Arg Gln Glu Lys Lys Lys
65 70 75 80
Ser Lys Arg Glu Arg Gly Glu Glu Val Thr Met Glu Leu Arg Met Val
85 90 95
Gly Phe Gly Lys Glu Val Val Leu Lys Gln Arg Arg Arg Met Arg Arg
100 105 110
Arg Arg Arg Leu Gly Glu Glu Glu Lys Ala Ala Ile Leu Leu Met Ala
115 120 125
Leu Ser Ser Gly Val Ile Tyr Ala
130 135
<210>173
<211>429
<212>DNA
<213>稻
<400>173
atggatatgg attcttcctc ctccccggtg gacaaggtgg atcccgacga gtgcaacggc 60
tctaaggctt gcgctgactg ccacactacc aagactccgc tatggcgagg cggacccgga 120
ggacccaagt cgctgtgcaa cgcatgcggg atccggtatc ggaagaggcg gcgggcggcg 180
ctcggcctgg actcttccgc caccgccacc gccaccgacg gagcggagca gcagaagaag 240
actaaggcca agaaggaaaa ggcacaggag gaggaggtca ccatggagct ccacacggtg 300
ggcttccgca gcaaggacgc tgctgtgttc aagcagcgcc ggcggatgcg ccgcaggaaa 360
tgccttggcg aggaggagag ggccgccatc ctgctcatgg cactctcctc gggcgtcatc 420
tacgcctga 429
<210>174
<211>142
<212>PRT
<213>稻
<400>174
Met Asp Met Asp Ser Ser Ser Ser Pro Val Asp Lys Val Asp Pro Asp
1 5 10 15
Glu Cys Asn Gly Ser Lys Ala Cys Ala Asp Cys His Thr Thr Lys Thr
20 25 30
Pro Leu Trp Arg Gly Gly Pro Gly Gly Pro Lys Ser Leu Cys Asn Ala
35 40 45
Cys Gly Ile Arg Tyr Arg Lys Arg Arg Arg Ala Ala Leu Gly Leu Asp
50 55 60
Ser Ser Ala Thr Ala Thr Ala Thr Asp Gly Ala Glu Gln Gln Lys Lys
65 70 75 80
Thr Lys Ala Lys Lys Glu Lys Ala Gln Glu Glu Glu Val Thr Met Glu
85 90 95
Leu His Thr Val Gly Phe Arg Ser Lys Asp Ala Ala Val Phe Lys Gln
100 105 110
Arg Arg Arg Met Arg Arg Arg Lys Cys Leu Gly Glu Glu Glu Arg Ala
115 120 125
Ala Ile Leu Leu Met Ala Leu Ser Ser Gly Val Ile Tyr Ala
130 135 140
<210>175
<211>885
<212>DNA
<213>拟南芥
<400>175
atgcagactc cgtacactac ttcaacgcag gggcaatatt gtcattcttg tggaatgttc 60
caccaccata gccaaagctg ctgctacaac aacaacaaca actccaacgc cggttcttac 120
tcgatggtct tctccatgca aaacggtggc gttttcgagc agaacggtga ggactatcat 180
cactcttcct ccctcgttga ctgcactctc tctcttggaa ctccttctac gaggctttgt 240
gaggaagatg agaaacgtag acgctctact tcatctggtg cttcttcttg catctccaac 300
ttttgggact tgattcacac caaaaacaac aactccaaaa cggcaccgta caataacgtt 360
ccttctttct ccgctaacaa gccaagtcgc ggttgttccg gtggtggtga tggcggagga 420
ggcggtggcg gaggtgactc tcttctcgct agacgctgtg ccaactgtga cactacttct 480
actccactat ggaggaatgg tcctagaggc cctaagtccc tatgcaacgc atgcggcatt 540
cgtttcaaga aggaagagag aagaactact gcggcttcag gaaacaccgt cgtcggagct 600
gcaccggttc aaaccgacca gtacgggcat cacaactctg gctacaataa ttaccatgct 660
gccactaata acaacaataa taatggtact ccgtgggctc atcaccactc gacgcagagg 720
gttccgtgta attatccggc aaatgagatc aggttcatgg atgattacgg cagtggagta 780
gcaaacaacg ttgaatccga cggtgctcac ggcggtgttc cgttcctttc ttggaggctt 840
aatgtagcgg atagggcaag tcttgtccat gactttacca gatga 885
<210>176
<211>295
<212>PRT
<213>拟南芥
<400>176
Met Met Gln Thr Pro Tyr Thr Thr Ser Thr Gln Gly Gln Tyr Cys His
1 5 10 15
Ser Cys Gly Met Phe His His His Ser Gln Ser Cys Cys Tyr Asn Asn
20 25 30
Asn Asn Asn Ser Asn Ala Gly Ser Tyr Ser Met Val Phe Ser Met Gln
35 40 45
Asn Gly Gly Val Phe Glu Gln Asn Gly Glu Asp Tyr His His Ser Ser
50 55 60
Ser Leu Val Asp Cys Thr Leu Ser Leu Gly Thr Pro Ser Thr Arg Leu
65 70 75 80
Cys Glu Glu Asp Glu Lys Arg Arg Arg Ser Thr Ser Ser Gly Ala Ser
85 90 95
Ser Cys Ile Ser Asn Phe Trp Asp Leu Ile His Thr Lys Asn Asn Asn
100 105 110
Ser Lys Thr Ala Pro Tyr Asn Asn Val Pro Ser Phe Ser Ala Asn Lys
115 120 125
Pro Ser Arg Gly Cys Ser Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly
130 135 140
Gly Gly Asp Ser Leu Leu Ala Arg Arg Cys Ala Asn Cys Asp Thr Thr
145 150 155 160
Ser Thr Pro Leu Trp Arg Asn Gly Pro Arg Gly Pro Lys Ser Leu Cys
165 170 175
Asn Ala Cys Gly Ile Arg Phe Lys Lys Glu Glu Arg Arg Thr Thr Ala
180 185 190
Ala Thr Gly Asn Thr Val Val Gly Ala Ala Pro Val Gln Thr Asp Gln
195 200 205
Tyr Gly His His Asn Ser Gly Tyr Asn Asn Tyr His Ala Ala Thr Asn
210 215 220
Asn Asn Asn Asn Asn Gly Thr Pro Trp Ala His His His Ser Thr Gln
225 230 235 240
Arg Val Pro Cys Asn Tyr Pro Ala Asn Glu Ile Arg Phe Met Asp Asp
245 250 255
Tyr Gly Ser Gly Val Ala Asn Asn Val Glu Ser Asp Gly Ala His Gly
260 265 270
Gly Val Pro Phe Leu Ser Trp Arg Leu Asn Val Ala Asp Arg Ala Ser
275 280 285
Leu Val His Asp Phe Thr Arg
290 295
<210>177
<211>840
<212>DNA
<213>稻
<400>177
atgcttcacc attactacag cggcggcgcc ggccatcatc aggacgtcgc tgcagctggt 60
agccccggcg acatggcttc ctccaccttc tcgctcttct tcccgatgtc caatgggcag 120
tgttggccgc cgtcgacggt ggaggagtcc gcggcctacg acgaccacag caccgtcacc 180
acctctcctt cctcgccttc gtcgtcctcc accggctccg tcgactgcac gctctcgctc 240
ggcacgccgt cgtctcgccg cgccgagccc gtcgcggcgg cggcgccagc ggcaaaccat 300
ggggcgcccg tgccggcgca ttatccgtcg ctgtcagcgg cgaccgtgtc ctgggacgcg 360
actgccgagt cgtactattg tggccagcag gggaggccgg ccaccggcgc cgccaagtgc 420
gccgccggcg ccgggcacga cgcgctcctc gaccgccgct gcgccaactg cggcaccgcg 480
tccacgccgc tctggaggaa cggccctcgc ggacccaagt cgctgtgcaa cgcgtgcggg 540
atcaggtaca agaaggagga gcggcgcgcg gcggcgacga cgacgacggc cgacggcgcc 600
gccggatgcg gcttcatcac cgcgcagcgt ggacgcgggt cgaccgcggc caaggcggcg 660
cccgccgtga cgacgtgcgg cgaggagacg tcaccgtacg tcgtcggcgg cggcggcggc 720
ggcggcgagg tcgcggacgc ggcgtatctc gcctggcggc tcaacgtcgt cccaccggcg 780
gcgacggcga cggcgttctc ggtgtggccg gagcgagcta gcctctacca ctacaactag 840
<210>178
<211>279
<212>PRT
<213>稻
<400>178
Met Leu His His Tyr Tyr Ser Gly Gly Ala Gly His His Gln Asp Val
1 5 10 15
Ala Ala Ala Gly Ser Pro Gly Asp Met Ala Ser Ser Thr Phe Ser Leu
20 25 30
Phe Phe Pro Met Ser Asn Gly Gln Cys Trp Pro Pro Ser Thr Val Glu
35 40 45
Glu Ser Ala Ala Tyr Asp Asp His Ser Thr Val Thr Thr Ser Pro Ser
50 55 60
Ser Pro Ser Ser Ser Ser Thr Gly Ser Val Asp Cys Thr Leu Ser Leu
65 70 75 80
Gly Thr Pro Ser Ser Arg Arg Ala Glu Pro Val Ala Ala Ala Ala Pro
85 90 95
Ala Ala Asn His Gly Ala Pro Val Pro Ala His Tyr Pro Ser Leu Ser
100 105 110
Ala Ala Thr Val Ser Trp Asp Ala Thr Ala Glu Ser Tyr Tyr Cys Gly
115 120 125
Gln Gln Gly Arg Pro Ala Thr Gly Ala Ala Lys Cys Ala Ala Gly Ala
130 135 140
Gly His Asp Ala Leu Leu Asp Arg Arg Cys Ala Asn Cys Gly Thr Ala
145 150 155 160
Ser Thr Pro Leu Trp Arg Asn Gly Pro Arg Gly Pro Lys Ser Leu Cys
165 170 175
Asn Ala Cys Gly Ile Arg Tyr Lys Lys Glu Glu Arg Arg Ala Ala Ala
180 185 190
Thr Thr Thr Thr Ala Asp Gly Ala Ala Gly Cys Gly Phe Ile Thr Ala
195 200 205
Gln Arg Gly Arg Gly Ser Thr Ala Ala Lys Ala Ala Pro Ala Val Thr
210 215 220
Thr Cys Gly Glu Glu Thr Ser Pro Tyr Val Val Gly Gly Gly Gly Gly
225 230 235 240
Gly Gly Glu Val Ala Asp Ala Ala Tyr Leu Ala Trp Arg Leu Asn Val
245 250 255
Val Pro Pro Ala Ala Thr Ala Thr Ala Phe Ser Val Trp Pro Glu Arg
260 265 270
Ala Ser Leu Tyr His Tyr Asn
275
<210>179
<211>56
<212>DNA
<213>人工序列
<220>
<223>引物:prm10106
<400>179
ggggacaagt ttgtacaaaa aagcaggctt aaacaatgct tcaccattac tacagc 56
<210>180
<211>48
<212>DNA
<213>人工序列
<220>
<223>引物:prm10107
<400>180
ggggaccact ttgtacaaga aagctgggtc caacgctaat gctacact 48
<210>181
<211>1647
<212>DNA
<213>拟南芥
<400>181
atgggtcgtt cgaaactagc ttctcgtcct gctgaggaag acctgaatcc aggaaaatca 60
aaaaggaaaa agatatcatt gggtcctgag aatgctgcgg catcaatttc caccggaatt 120
gaagctggga atgagaggaa acctggcctt tactgttgta actattgcga taaagatctg 180
tctggtttgg ttcgtttcaa atgtgctgtt tgtatggact ttgatctttg tgtggaatgc 240
ttttctgtcg gcgttgaact taatcgtcac aagaacagtc acccatatcg tgttatggac 300
aatttgtctt tttcgcttgt tacttctgat tggaatgccg atgaagagat actccttctt 360
gaggccattg cgacatacgg gtttggcaat tggaaagaag ttgcagacca tgttggtagt 420
aagacaacga cagaatgtat taaacacttc aattctgctt acatgcagtc accatgcttt 480
ccacttccgg acttgtccca tactattgga aagagcaaag atgagctgct tgctatgagt 540
aaagatagtg cagtcaaaac agaaatacct gcatttgtga ggctatctcc aaaagaagag 600
ttacctgtgt cagctgaaat caaacacgaa gcttcaggga aggtcaatga aatagatcca 660
cctttgtctg ccttagctgg agtcaagaag aaaggcaatg taccgcaggc taaggacatc 720
atcaagttgg aagctgcaaa acaacaatct gacaggagtg tcggggagaa gaaactcaga 780
cttcctggag agaaagttcc attagtaaca gagttatatg gttacaatct aaagagggaa 840
gaatttgaga tcgaacatga caacgatgct gagcaactgc ttgctgacat ggaatttaag 900
gattctgaca cagatgctga gcgtgagcag aaactgcagg ttcttcgtat ttactcgaaa 960
aggcttgatg agaggaagcg gaggaaggaa tttgttctgg aaagaaactt gttgtaccct 1020
gatcaatatg agatgagcct ttcagcagag gagagaaaaa tatataaaag ctgtaaagtg 1080
tttgcgcggt tccaatccaa agaagagcac aaggaactga ttaagaaagt cattgaagag 1140
caccaaattc tcagaagaat cgaggatctt caggaagcta gaactgctgg ttgcaggaca 1200
acttcagacg caaatagatt tatagaagag aagagaaaga aggaagctga agaaagtatg 1260
ctgctgcggc ttaaccacgg tgcaccaggc agtatagccg gtaaaacact aaaaagtcca 1320
agagggttac ccagaaattt gcatcccttt ggttctgact cactgccaaa ggtcacacct 1380
ccaagaatat acagcggttt ggacacttgg gatgttgatg gtctccttgg agctgactta 1440
ctctcagaga ccgaaaagaa gatgtgcaat gagaccagaa tactgcctgt acactatttg 1500
aagatgttgg atatcttaac aagagaaata aagaaggggc agataaagaa aaagtctgat 1560
gcttatagct tcttcaaagt agagccgagt aaagtagaca gagtatatga tatgctggtt 1620
cataagggaa taggtgactc aacatga 1647
<210>182
<211>548
<212>PRT
<213>拟南芥
<400>182
Met Gly Arg Ser Lys Leu Ala Ser Arg Pro Ala Glu Glu Asp Leu Asn
1 5 10 15
Pro Gly Lys Ser Lys Arg Lys Lys Ile Ser Leu Gly Pro Glu Asn Ala
20 25 30
Ala Ala Ser Ile Ser Thr Gly Ile Glu Ala Gly Asn Glu Arg Lys Pro
35 40 45
Gly Leu Tyr Cys Cys Asn Tyr Cys Asp Lys Asp Leu Ser Gly Leu Val
50 55 60
Arg Phe Lys Cys Ala Val Cys Met Asp Phe Asp Leu Cys Val Glu Cys
65 70 75 80
Phe Ser Val Gly Val Glu Leu Asn Arg His Lys Asn Ser His Pro Tyr
85 90 95
Arg Val Met Asp Asn Leu Ser Phe Ser Leu Val Thr Ser Asp Trp Asn
100 105 110
Ala Asp Glu Glu Ile Leu Leu Leu Glu Ala Ile Ala Thr Tyr Gly Phe
115 120 125
Gly Asn Trp Lys Glu Val Ala Asp His Val Gly Ser Lys Thr Thr Thr
130 135 140
Glu Cys Ile Lys His Phe Asn Ser Ala Tyr Met Gln Ser Pro Cys Phe
145 150 155 160
Pro Leu Pro Asp Leu Ser His Thr Ile Gly Lys Ser Lys Asp Glu Leu
165 170 175
Leu Ala Met Ser Lys Asp Ser Ala Val Lys Thr Glu Ile Pro Ala Phe
180 185 190
Val Arg Leu Ser Pro Lys Glu Glu Leu Pro Val Ser Ala Glu Ile Lys
195 200 205
His Glu Ala Ser Gly Lys Val Asn Glu Ile Asp Pro Pro Leu Ser Ala
210 215 220
Leu Ala Gly Val Lys Lys Lys Gly Asn Val Pro Gln Ala Lys Asp Ile
225 230 235 240
Ile Lys Leu Glu Ala Ala Lys Gln Gln Ser Asp Arg Ser Val Gly Glu
245 250 255
Lys Lys Leu Arg Leu Pro Gly Glu Lys Val Pro Leu Val Thr Glu Leu
260 265 270
Tyr Gly Tyr Asn Leu Lys Arg Glu Glu Phe Glu Ile Glu His Asp Asn
275 280 285
Asp Ala Glu Gln Leu Leu Ala Asp Met Glu Phe Lys Asp Ser Asp Thr
290 295 300
Asp Ala Glu Arg Glu Gln Lys Leu Gln Val Leu Arg Ile Tyr Ser Lys
305 310 315 320
Arg Leu Asp Glu Arg Lys Arg Arg Lys Glu Phe Val Leu Glu Arg Asn
325 330 335
Leu Leu Tyr Pro Asp Gln Tyr Glu Met Ser Leu Ser Ala Glu Glu Arg
340 345 350
Lys Ile Tyr Lys Ser Cys Lys Val Phe Ala Arg Phe Gln Ser Lys Glu
355 360 365
Glu His Lys Glu Leu Ile Lys Lys Val Ile Glu Glu His Gln Ile Leu
370 375 380
Arg Arg Ile Glu Asp Leu Gln Glu Ala Arg Thr Ala Gly Cys Arg Thr
385 390 395 400
Thr Ser Asp Ala Asn Arg Phe Ile Glu Glu Lys Arg Lys Lys Glu Ala
405 410 415
Glu Glu Ser Met Leu Leu Arg Leu Asn His Gly Ala Pro Gly Ser Ile
420 425 430
Ala Gly Lys Thr Leu Lys Ser Pro Arg Gly Leu Pro Arg Asn Leu His
435 440 445
Pro Phe Gly Ser Asp Ser Leu Pro Lys Val Thr Pro Pro Arg Ile Tyr
450 455 460
Ser Gly Leu Asp Thr Trp Asp Val Asp Gly Leu Leu Gly Ala Asp Leu
465 470 475 480
Leu Ser Glu Thr Glu Lys Lys Met Cys Asn Glu Thr Arg Ile Leu Pro
485 490 495
Val His Tyr Leu Lys Met Leu Asp Ile Leu Thr Arg Glu Ile Lys Lys
500 505 510
Gly Gln Ile Lys Lys Lys Ser Asp Ala Tyr Ser Phe Phe Lys Val Glu
515 520 525
Pro Ser Lys Val Asp Arg Val Tyr Asp Met Leu Val His Lys Gly Ile
530 535 540
Gly Asp Ser Thr
545
<210>183
<211>1464
<212>DNA
<213>拟南芥
<400>183
atgggtcgct ctcgagggaa cttccaaaat ttcgaagacc ctactcagag aacgaggaaa 60
aagaaaaatg cggctaatgt ggagaacttt gagtctactt ctttggtacc aggtgctgag 120
ggaggaggga agtataactg cgattattgc cagaaagaca ttactggaaa aattaggata 180
aagtgtgctg tctgtccaga ttttgatctc tgtatagaat gtatgtctgt tggagcagag 240
atcactcctc acaaatgtga tcacccatac cgagttatgg gaaatctaac tttcccgctt 300
atttgtcctg actggagtgc ggatgatgaa atgcttctcc tggagggact tgaaatttat 360
ggcttgggaa actgggcaga ggttgcggag cacgtgggaa cgaagagtaa agaacagtgt 420
cttgagcact acagaaacat ctatttgaac tcaccatttt tcccacttcc ggatatgtca 480
catgtagcag ggaagaacag aaaagaactt caagccatgg ctaaaggacg cattgatgac 540
aagaaagcag agcagaacat gaaagaagag tacccgttct ctcctcctaa agtcaaagtt 600
gaagacacac aaaaagagtc ttttgtagac agaagttttg gagggaagaa acctgtttcc 660
acctcggtaa acaactcttt ggttgagctg agtaattaca accagaaaag agaagagttt 720
gaccctgaat atgacaatga tgctgagcaa ctcttggcgg agatggagtt caaagagaac 780
gatactcctg aagaacatga actgaagctg cgtgtgttgc gtatctattc aaaaaggctt 840
gatgagagga aacgtagaaa agaattcata atagaaagaa acctgttgta cccaaatccc 900
tttgagaagg acctgtctca ggaggagaaa gtacaatgcc gacgtttgga cgtttttatg 960
cgttttcatt caaaagagga gcacgacgag ctactccgta atgttgtaag cgagtaccgc 1020
atggtgaaac ggctcaaaga tctcaaggaa gctcaagtgg cagggtgtcg ttcaacggct 1080
gaagcagaga ggtatctggg aaggaagagg aagagagaaa acgaagaagg gatgaacaga 1140
gggaaagaga gcggtcaatt tggtcaaatt gcaggggaga tgggctctag accacctgtg 1200
caagcttctt caagctatgt gaatgatttg gacttgattg ggttcacgga gtcgcaactg 1260
ctgtctgaat ccgagaagcg tctctgcagc gaggtcaagt tggttccacc ggtttatcta 1320
cagatgcaac aagtgatgtc acatgagata ttcaaaggga atgtaacgaa gaagtcggat 1380
gcatatagcc ttttcaagat tgatccaacc aaagtggatc gagtttatga tatgcttgtg 1440
aagaagggta ttgctcaact ttaa 1464
<210>184
<211>487
<212>PRT
<213>拟南芥
<400>184
Met Gly Arg Ser Arg Gly Asn Phe Gln Asn Phe Glu Asp Pro Thr Gln
1 5 10 15
Arg Thr Arg Lys Lys Lys Asn Ala Ala Asn Val Glu Asn Phe Glu Ser
20 25 30
Thr Ser Leu Val Pro Gly Ala Glu Gly Gly Gly Lys Tyr Asn Cys Asp
35 40 45
Tyr Cys Gln Lys Asp Ile Thr Gly Lys Ile Arg Ile Lys Cys Ala Val
50 55 60
Cys Pro Asp Phe Asp Leu Cys Ile Glu Cys Met Ser Val Gly Ala Glu
65 70 75 80
Ile Thr Pro His Lys Cys Asp His Pro Tyr Arg Val Met Gly Asn Leu
85 90 95
Thr Phe Pro Leu Ile Cys Pro Asp Trp Ser Ala Asp Asp Glu Met Leu
100 105 110
Leu Leu Glu Gly Leu Glu Ile Tyr Gly Leu Gly Asn Trp Ala Glu Val
115 120 125
Ala Glu His Val Gly Thr Lys Ser Lys Glu Gln Cys Leu Glu His Tyr
130 135 140
Arg Asn Ile Tyr Leu Asn Ser Pro Phe Phe Pro Leu Pro Asp Met Ser
145 150 155 160
His Val Ala Gly Lys Asn Arg Lys Glu Leu Gln Ala Met Ala Lys Gly
165 170 175
Arg Ile Asp Asp Lys Lys Ala Glu Gln Asn Met Lys Glu Glu Tyr Pro
180 185 190
Phe Ser Pro Pro Lys Val Lys Val Glu Asp Thr Gln Lys Glu Ser Phe
195 200 205
Val Asp Arg Ser Phe Gly Gly Lys Lys Pro Val Ser Thr Ser Val Asn
210 215 220
Asn Ser Leu Val Glu Leu Ser Asn Tyr Asn Gln Lys Arg Glu Glu Phe
225 230 235 240
Asp Pro Glu Tyr Asp Asn Asp Ala Glu Gln Leu Leu Ala Glu Met Glu
245 250 255
Phe Lys Glu Asn Asp Thr Pro Glu Glu His Glu Leu Lys Leu Arg Val
260 265 270
Leu Arg Ile Tyr Ser Lys Arg Leu Asp Glu Arg Lys Arg Arg Lys Glu
275 280 285
Phe Ile Ile Glu Arg Asn Leu Leu Tyr Pro Asn Pro Phe Glu Lys Asp
290 295 300
Leu Ser Gln Glu Glu Lys Val Gln Cys Arg Arg Leu Asp Val Phe Met
305 310 315 320
Arg Phe His Ser Lys Glu Glu His Asp Glu Leu Leu Arg Asn Val Val
325 330 335
Ser Glu Tyr Arg Met Val Lys Arg Leu Lys Asp Leu Lys Glu Ala Gln
340 345 350
Val Ala Gly Cys Arg Ser Thr Ala Glu Ala Glu Arg Tyr Leu Gly Arg
355 360 365
Lys Arg Lys Arg Glu Asn Glu Glu Gly Met Asn Arg Gly Lys Glu Ser
370 375 380
Gly Gln Phe Gly Gln Ile Ala Gly Glu Met Gly Ser Arg Pro Pro Val
385 390 395 400
Gln Ala Ser Ser Ser Tyr Val Asn Asp Leu Asp Leu Ile Gly Phe Thr
405 410 415
Glu Ser Gln Leu Leu Ser Glu Ser Glu Lys Arg Leu Cys Ser Glu Val
420 425 430
Lys Leu Val Pro Pro Val Tyr Leu Gln Met Gln Gln Val Met Ser His
435 440 445
Glu Ile Phe Lys Gly Asn Val Thr Lys Lys Ser Asp Ala Tyr Ser Leu
450 455 460
Phe Lys Ile Asp Pro Thr Lys Val Asp Arg Val Tyr Asp Met Leu Val
465 470 475 480
Lys Lys Gly Ile Ala Gln Leu
485
<210>185
<211>2745
<212>DNA
<213>盘基网柄菌
<400>185
atgacttcaa caataaataa agaagaacct acaacacttg taaataaaaa aaggagaaaa 60
gaacatgttg atgatgacga tgataatgat gatgacattg aaatgcaaaa tgtatcaaat 120
gataatatta ataatacaga tgatgaaaat aataatgtaa atacaaatgg aaataataca 180
aacaaaacca acaacaacaa caacaacaac aacaacaaca acaacaataa taatgaagaa 240
gatgacgatg aagaagactt attaattacc aaaagaagga atagtagatc aacaacaatg 300
agtaataata gtagtaataa atcaacacca tctaaaaaga aaagaattaa gaaatcatat 360
gataatgata aagattttgt aggagatgat gaagaggaag atgataatga tgatgaagat 420
gatgatggtg atgttgtaat ctcaaataat aataataata ataataataa taatactaat 480
aataataata ataataataa tggtaatggt aatggtaata acaatacaaa caattcattc 540
gaagaagatg atgatgaaga ggatgatgat gaagaagatg aagaagaaga agaagaaggt 600
atgaaaatat caaaaagtaa aagacaaaca caaataatac caccaacaca aagtgaagtt 660
gatgatcttg taaataaaag ttttaatagt gtagatgatg atgaagaaga taatgaagag 720
gatgataaaa caaaaaatac tactactact actgctacaa atactactaa tgtaccaaca 780
acaacaacaa caactacaac aacatcgaac aatactacat taccaactac taccactact 840
accaccacta ataatacaac aaaaccaaat gttaaaatag aacaaccaaa aacacaacaa 900
ccaccacaac cacaaccaat aaataaacca actcaatcaa atactatcac aacaaataca 960
aatacaagta caaatacaaa cacagctaca aatacaaatt caaatacagc tacagccaca 1020
acaacatcac ctacctcaat gacaaaacaa acaaggaagt caaccaattc accaactaac 1080
agtaatggta ataataataa taataataat aataataatt ttatagagga acaagtgaat 1140
gaaggattat atcattgtga ttattgtcaa aaggatatta gtggtgttgt acgtataaga 1200
tgttcagtat gtactgattt cgatttatgt ttagaatgtt ttagtgttgg tgtagagatt 1260
acaccacatc gtaattttca tgattatcat gtagttgata atatgcattt cccaatgttt 1320
actgatgatt ggggtgctga tgaagaatta ttacttttag aagcaattga attatatggt 1380
ttaggtaatt ggaatgaagt ttcagagaat gtaggcgcac attcaaaatc acctttagaa 1440
tgtaaagcac attatttcgc tcattattta aatagttcaa cttcaccttt accagataca 1500
tcaaaggttt taactacaaa tgaaaatgtt cactttaaaa gagcaaaaac aactgtaaat 1560
ggtaattatt ataatgatta tattattgat aatagtgacg atgatgataa taataataat 1620
aataataatt ataacgataa tagtaataat acaacaccaa caaagtcatt caatagtgtt 1680
aataaaagta aaaaattaaa tcatagaaat agtcatggtg aagaaggacc aagtggacca 1740
gtaacagatt cagttggata tatgaagaat agaggacatt ttgaagtaga atatgataat 1800
gaagcagagt tggttgtaaa ggatttaaca tttgaacctg acgatagtca agcagataga 1860
gatattaaat tgaatgtatt agaatcctat gatcaaagat tggatgaacg tattagaaga 1920
aggaatttca ttgttgagaa aggtttatta gattatagaa aagtagagag aaaacgatac 1980
aaagatgata aagagatatt aaattcattg aaatgtttcc ttcaaacagt taccaaagaa 2040
gagcatgaat caatgataaa tggattaata aatgaaaaga atataaagaa tagaattcta 2100
caattacaag agtatcgtga aaatggtatt aaaacattgg cagatggtca aaattttgat 2160
gaagataaaa gaaaacgtga agttgataaa tctatgaaac gttccaaatc tgaattggca 2220
tcatatagtt taaatagtgg attaagttca tacaatccaa atcataatcc atttggtcat 2280
cattatttag gtggtagtag tagtggatta agtggtggta gtggtggagg tggtggtggt 2340
ggtggtgatc catcatttaa aactcaaaaa caattaacta aagaaaaaga agatatttat 2400
ttaggtattg gtgagaatag gaaacatcat tcaagtaaat tgaaaaagaa tgctaaaatg 2460
gaattggaag gtttaccaaa tgctgatgca ctctcattaa aagagaaaca aatttgtaca 2520
actcataaac ttttaccaca acaataccta attgtcaaac aagcattaat ctctgaatct 2580
ttaaaaactc aaggtgtaat taaattatca accgctttta aactaattaa attaaatcaa 2640
gttaaaattc atagactttt agaattcttt gaaagaaatc attggttaaa atttaatatt 2700
gattgtgaaa ctaacacttc taatactact tcaaattata aataa 2745
<210>186
<211>914
<212>PRT
<213>盘基网柄菌
<400>186
Met Thr Ser Thr Ile Asn Lys Glu Glu Pro Thr Thr Leu Val Asn Lys
1 5 10 15
Lys Arg Arg Lys Glu His Val Asp Asp Asp Asp Asp Asn Asp Asp Asp
20 25 30
Ile Glu Met Gln Asn Val Ser Asn Asp Asn Ile Asn Asn Thr Asp Asp
35 40 45
Glu Asn Asn Asn Val Asn Thr Asn Gly Asn Asn Thr Asn Lys Thr Asn
50 55 60
Asn Asn Asn Asn Asn Asn Asn Asn Asn Asn Asn Asn Asn Asn Glu Glu
65 70 75 80
Asp Asp Asp Glu Glu Asp Leu Leu Ile Thr Lys Arg Arg Asn Ser Arg
85 90 95
Ser Thr Thr Met Ser Asn Asn Ser Ser Asn Lys Ser Thr Pro Ser Lys
100 105 110
Lys Lys Arg Ile Lys Lys Ser Tyr Asp Asn Asp Lys Asp Phe Val Gly
115 120 125
Asp Asp Glu Glu Glu Asp Asp Asn Asp Asp Glu Asp Asp Asp Gly Asp
130 135 140
Val Val Ile Ser Asn Asn Asn Asn Asn Asn Asn Asn Asn Asn Thr Asn
145 150 155 160
Asn Asn Asn Asn Asn Asn Asn Gly Asn Gly Asn Gly Asn Asn Asn Thr
165 170 175
Asn Asn Ser Phe Glu Glu Asp Asp Asp Glu Glu Asp Asp Asp Glu Glu
180 185 190
Asp Glu Glu Glu Glu Glu Glu Gly Met Lys Ile Ser Lys Ser Lys Arg
195 200 205
Gln Thr Gln Ile Ile Pro Pro Thr Gln Ser Glu Val Asp Asp Leu Val
210 215 220
Asn Lys Ser Phe Asn Ser Val Asp Asp Asp Glu Glu Asp Asn Glu Glu
225 230 235 240
Asp Asp Lys Thr Lys Asn Thr Thr Thr Thr Thr Ala Thr Asn Thr Thr
245 250 255
Asn Val Pro Thr Thr Thr Thr Thr Thr Thr Thr Thr Ser Asn Asn Thr
260 265 270
Thr Leu Pro Thr Thr Thr Thr Thr Thr Thr Thr Asn Asn Thr Thr Lys
275 280 285
Pro Asn Val Lys Ile Glu Gln Pro Lys Thr Gln Gln Pro Pro Gln Pro
290 295 300
Gln Pro Ile Asn Lys Pro Thr Gln Ser Asn Thr Ile Thr Thr Asn Thr
305 310 315 320
Asn Thr Ser Thr Asn Thr Asn Thr Ala Thr Asn Thr Asn Ser Asn Thr
325 330 335
Ala Thr Ala Thr Thr Thr Ser Pro Thr Ser Met Thr Lys Gln Thr Arg
340 345 350
Lys Ser Thr Asn Ser Pro Thr Asn Ser Asn Gly Asn Asn Asn Asn Asn
355 360 365
Asn Asn Asn Asn Asn Phe Ile Glu Glu Gln Val Asn Glu Gly Leu Tyr
370 375 380
His Cys Asp Tyr Cys Gln Lys Asp Ile Ser Gly Val Val Arg Ile Arg
385 390 395 400
Cys Ser Val Cys Thr Asp Phe Asp Leu Cys Leu Glu Cys Phe Ser Val
405 410 415
Gly Val Glu Ile Thr Pro His Arg Asn Phe His Asp Tyr His Val Val
420 425 430
Asp Asn Met His Phe Pro Met Phe Thr Asp Asp Trp Gly Ala Asp Glu
435 440 445
Glu Leu Leu Leu Leu Glu Ala Ile Glu Leu Tyr Gly Leu Gly Asn Trp
450 455 460
Asn Glu Val Ser Glu Asn Val Gly Ala His Ser Lys Ser Pro Leu Glu
465 470 475 480
Cys Lys Ala His Tyr Phe Ala His Tyr Leu Asn Ser Ser Thr Ser Pro
485 490 495
Leu Pro Asp Thr Ser Lys Val Leu Thr Thr Asn Glu Asn Val His Phe
500 505 510
Lys Arg Ala Lys Thr Thr Val Asn Gly Asn Tyr Tyr Asn Asp Tyr Ile
515 520 525
Ile Asp Asn Ser Asp Asp Asp Asp Asn Asn Asn Asn Asn Asn Asn Tyr
530 535 540
Asn Asp Asn Ser Asn Asn Thr Thr Pro Thr Lys Ser Phe Asn Ser Val
545 550 555 560
Asn Lys Ser Lys Lys Leu Asn His Arg Asn Ser His Gly Glu Glu Gly
565 570 575
Pro Ser Gly Pro Val Thr Asp Ser Val Gly Tyr Met Lys Asn Arg Gly
580 585 590
His Phe Glu Val Glu Tyr Asp Asn Glu Ala Glu Leu Val Val Lys Asp
595 600 605
Leu Thr Phe Glu Pro Asp Asp Ser Gln Ala Asp Arg Asp Ile Lys Leu
610 615 620
Asn Val Leu Glu Ser Tyr Asp Gln Arg Leu Asp Glu Arg Ile Arg Arg
625 630 635 640
Arg Asn Phe Ile Val Glu Lys Gly Leu Leu Asp Tyr Arg Lys Val Glu
645 650 655
Arg Lys Arg Tyr Lys Asp Asp Lys Glu Ile Leu Asn Ser Leu Lys Cys
660 665 670
Phe Leu Gln Thr Val Thr Lys Glu Glu His Glu Ser Met Ile Asn Gly
675 680 685
Leu Ile Asn Glu Lys Asn Ile Lys Asn Arg Ile Leu Gln Leu Gln Glu
690 695 700
Tyr Arg Glu Asn Gly Ile Lys Thr Leu Ala Asp Gly Gln Asn Phe Asp
705 710 715 720
Glu Asp Lys Arg Lys Arg Glu Val Asp Lys Ser Met Lys Arg Ser Lys
725 730 735
Ser Glu Leu Ala Ser Tyr Ser Leu Asn Ser Gly Leu Ser Ser Tyr Asn
740 745 750
Pro Asn His Asn Pro Phe Gly His His Tyr Leu Gly Gly Ser Ser Ser
755 760 765
Gly Leu Ser Gly Gly Ser Gly Gly Gly Gly Gly Gly Gly Gly Asp Pro
770 775 780
Ser Phe Lys Thr Gln Lys Gln Leu Thr Lys Glu Lys Glu Asp Ile Tyr
785 790 795 800
Leu Gly Ile Gly Glu Asn Arg Lys His His Ser Ser Lys Leu Lys Lys
805 810 815
Asn Ala Lys Met Glu Leu Glu Gly Leu Pro Asn Ala Asp Ala Leu Ser
820 825 830
Leu Lys Glu Lys Gln Ile Cys Thr Thr His Lys Leu Leu Pro Gln Gln
835 840 845
Tyr Leu Ile Val Lys Gln Ala Leu Ile Ser Glu Ser Leu Lys Thr Gln
850 855 860
Gly Val Ile Lys Leu Ser Thr Ala Phe Lys Leu Ile Lys Leu Asn Gln
865 870 875 880
Val Lys Ile His Arg Leu Leu Glu Phe Phe Glu Arg Asn His Trp Leu
885 890 895
Lys Phe Asn Ile Asp Cys Glu Thr Asn Thr Ser Asn Thr Thr Ser Asn
900 905 910
Tyr Lys
<210>187
<211>1683
<212>DNA
<213>番茄
<400>187
atgggtcgtt ctcgggctgt tcatcaatcc actgatgatg atccaagcca gaggtctaag 60
agaaaaaggg cagtgccaaa tgtggagagt tttgatactg cagctactgg ccaaatattg 120
actgaaggga aaaaggcttt gtaccattgc aattattgca ataaagacat atctggaagg 180
attcggatta aatgtgttgt gtgttctgac tttgatcttt gtgtggaatg cttttctgtt 240
ggagcagaag tgcagcctca caaaagcaat catctgtata gggttatgga taacctgtca 300
tttcctctca tatgtgctga ctggaatgct gatgaggaaa tgttacttct agagggtttg 360
gaaatgtacg gattggcaaa ctgggctgaa gttgccgaac atgttggaac gaagagtaaa 420
cagcagtgta ttgaccacta taagtccacc tatattagtt ctccttgttt tccgcttccg 480
gacatgtccc atgttatggg aaagaacaga gaggaacttc ttgccatggc caaggatcaa 540
ggatatgcag ctcccggggg agttaatgtt aaagaagagt ctccattctc tgcaggaatc 600
aagatggaag atcaaaggga agaaaattca actggacttg cctcagttgg aggttctgct 660
tctggtacat tagcaggagc tggaaagagg acatctagct tacttcatag taaggagaat 720
catgatagca tcaaagtgga aggttgtcct gcagacagga gtgtcggaga gaaaaagcct 780
aggtcatcag tggacgaggg gccttccatg acagaattaa gtggttataa ttccaagaga 840
gaggagtttg aaattgaata cgataatgat gctgagcaga tggtggctga tatggaattt 900
aaagagacag ataccaatgc tgagcgtgaa ctgaaacttc gggtattgcg tatatacaat 960
aaaaggcttg atgagaggaa acgtaggaag gattttattt tggaaaggaa actacttcat 1020
cctgatcctt ttgagaaaga cctcaccccg gaggagaagg acatatgccg tcgttacagg 1080
gtgttcatgc gttttagttc taaagaggag catgaggatt tccttaggag cataatcgag 1140
gagcaccgaa tagttaaacg aatacgagat cttcaggatg cccgaattgc tggttgccga 1200
actttagctg aggcagaaag atatgttgaa caaaagagag cgcgggaatc tgaagaaaat 1260
atacgtagac tgaaggagaa cacccagagt ggcccaagtg gaaaatattt gcaaagagca 1320
ggtcacttta aagtggagca tgacagcagc cccagaggag ttggtagggg ccctgaaatg 1380
atggattgtt gcaatgactt atcatcaacc accgcaccac atggtgttgg aagtgctgta 1440
gacatttggg atgtcagtgg gttttcagga gctgagttgc tctcagaagc tgaaaaaaag 1500
ctttgtgatg agatgagaat cctgccggct cattatctaa acatgtcgca aaccatgtcc 1560
atggggatct ttaatggcaa catcaccaag aaatctgatg cgcatggtct attcaatgtt 1620
gatccgaata agattgacaa agtgtatgag atgcttgtca aaaagggcct ggctcaagca 1680
taa 1683
<210>188
<211>560
<212>PRT
<213>番茄
<400>188
Met Gly Arg Ser Arg Ala Val His Gln Ser Thr Asp Asp Asp Pro Ser
1 5 10 15
Gln Arg Ser Lys Arg Lys Arg Ala Val Pro Asn ValGlu Ser Phe Asp
20 25 30
Thr Ala Ala Thr Gly Gln Ile Leu Thr Glu Gly Lys Lys Ala Leu Tyr
35 40 45
His Cys Asn Tyr Cys Asn Lys Asp Ile Ser Gly Arg Ile Arg Ile Lys
50 55 60
Cys Val Val Cys Ser Asp Phe Asp Leu Cys Val Glu Cys Phe Ser Val
65 70 75 80
Gly Ala Glu Val Gln Pro His Lys Ser Asn His Leu Tyr Arg Val Met
85 90 95
Asp Asn Leu Ser Phe Pro Leu Ile Cys Ala Asp Trp Asn Ala Asp Glu
100 105 110
Glu Met Leu Leu Leu Glu Gly Leu Glu Met Tyr Gly Leu Ala Asn Trp
115 120 125
Ala Glu Val Ala Glu His Val Gly Thr Lys Ser Lys Gln Gln Cys Ile
130 135 140
Asp His Tyr Lys Ser Thr Tyr Ile Ser Ser Pro Cys Phe Pro Leu Pro
145 150 155 160
Asp Met Ser His Val Met Gly Lys Asn Arg Glu Glu Leu Leu Ala Met
165 170 175
Ala Lys Asp Gln Gly Tyr Ala Ala Pro Gly Gly Val Asn Val Lys Glu
180 185 190
Glu Ser Pro Phe Ser Ala Gly Ile Lys Met Glu Asp Gln Arg Glu Glu
195 200 205
Asn Ser Thr Gly Leu Ala Ser Val Gly Gly Ser Ala Ser Gly Thr Leu
210 215 220
Ala Gly Ala Gly Lys Arg Thr Ser Ser Leu Leu His Ser Lys Glu Asn
225 230 235 240
His Asp Ser Ile Lys Val Glu Gly Cys Pro Ala Asp Arg Ser Val Gly
245 250 255
Glu Lys Lys Pro Arg Ser Ser Val Asp Glu Gly Pro Ser Met Thr Glu
260 265 270
Leu Ser Gly Tyr Asn Ser Lys Arg Glu Glu Phe Glu Ile Glu Tyr Asp
275 280 285
Asn Asp Ala Glu Gln Met Val Ala Asp Met Glu Phe Lys Glu Thr Asp
290 295 300
Thr Asn Ala Glu Arg Glu Leu Lys Leu Arg Val Leu Arg Ile Tyr Asn
305 310 315 320
Lys Arg Leu Asp Glu Arg Lys Arg Arg Lys Asp Phe Ile Leu Glu Arg
325 330 335
Lys Leu Leu His Pro Asp Pro Phe Glu Lys Asp Leu Thr Pro Glu Glu
340 345 350
Lys Asp Ile Cys Arg Arg Tyr Arg Val Phe Met Arg Phe Ser Ser Lys
355 360 365
Glu Glu His Glu Asp Phe Leu Arg Ser Ile Ile Glu Glu His Arg Ile
370 375 380
Val Lys Arg Ile Arg Asp Leu Gln Asp Ala Arg Ile Ala Gly Cys Arg
385 390 395 400
Thr Leu Ala Glu Ala Glu Arg Tyr Val Glu Gln Lys Arg Ala Arg Glu
405 410 415
Ser Glu Glu Asn Ile Arg Arg Leu Lys Glu Asn Thr Gln Ser Gly Pro
420 425 430
Ser Gly Lys Tyr Leu Gln Arg Ala Gly His Phe Lys Val Glu His Asp
435 440 445
Ser Ser Pro Arg Gly Val Gly Arg Gly Pro Glu Met Met Asp Cys Cys
450 455 460
Asn Asp Leu Ser Ser Thr Thr Ala Pro His Gly Val Gly Ser Ala Val
465 470 475 480
Asp Ile Trp Asp Val Ser Gly Phe Ser Gly Ala Glu Leu Leu Ser Glu
485 490 495
Ala Glu Lys Lys Leu Cys Asp Glu Met Arg Ile Leu Pro Ala His Tyr
500 505 510
Leu Asn Met Ser Gln Thr Met Ser Met Gly Ile Phe Asn Gly Asn Ile
515 520 525
Thr Lys Lys Ser Asp Ala His Gly Leu Phe Asn Val Asp Pro Asn Lys
530 535 540
Ile Asp Lys Val Tyr Glu Met Leu Val Lys Lys Gly Leu Ala Gln Ala
545 550 555 560
<210>189
<211>1632
<212>DNA
<213>番茄
<400>189
atgggtcgct ctcgtgggaa ttttcaagct gatgaagatc ccagccaaag atcaaggagg 60
aaaaagaatg cctcaagtgt agacaattta gaatctgcga ccactggtca agggacagct 120
gatggcaaaa gggccttgta tcactgcaat tattgcaaca aagacattag cgggagaact 180
cgtataaaat gtgctgtatg ttacgatttt gacctatgta tagagtgctt ctctgttggt 240
gctgaggtgc atccccacaa aagccatcac cactataggg ttatggatat cttagctttc 300
ccgcttattt gcccagactg gaatgctgat gaagagatgt tgctccttga gggaattgag 360
atgtatggca tgggtaattg ggctgaagta ggtgagcatg tcggaacaaa gacaaaagaa 420
gcctgcattg accattttaa ggatgcgtac ttaaagtcac cttactttcc tctaccagat 480
atgactcacg tcatggggaa aaacagaatg gaactccttg ccatggctaa agggaatttc 540
actgataaga aaggactctc ttcacttggt gatgttgctc ctaaagatga atcgttctct 600
ccgtctcgaa tcaaagttga agacactcat aaaattggtc cttcaggacg tttaacttct 660
gtatccaatg cgggaatcac aggcataaaa aagccatcca gcaaaacgct aatcaaagat 720
caaaatgaac ctgttaaatt tgaagataat tcaggcagaa attttggagg caagaaaccg 780
aaatctttga aggatgatgg atcctcattg atgaaattaa gtggatatat tcccaagagg 840
caagaatttg atcctgaata tgataatgat gcggagcaac tattggctga tatggaattc 900
aaggaaactg aaactgaaga ggagcgcgaa cttaagctgc gtgttctgcg tatctattcc 960
aagaggcttg atgaaagaaa acgccgcaag gtttttattc tagagaggaa tttactccag 1020
ccaagtgaat ttgagaagaa tttgtcacca gaagagaaag gtatatgccg atgttatgat 1080
gccattatgc gctttctctc gaaggaggag catgaagaat tacttaaggc tgtggtctca 1140
gaacatagat atctgaaaag aatacaagaa ctcaaggaag cgaaagctgc aggttgtcgt 1200
tcgtctgctg aagttgatag gtacttagaa aggaaaagga agaaggaagt tgaagaaggt 1260
gttccgagaa agggaagctc tcagattggc ccaatgagcc aggaaagcct gaacatacct 1320
gcttcttctg agtcacttgg aatacattca aatagaaaac cttgtagcca ggcgattttg 1380
agttccgaca ccaatgcagg tgttccagct ttttctgcag gagaactgtt atctgaacct 1440
gagaaacaac tatgtcaaga aatcaggtta tcgccgcatc attatcttag gatgcaggag 1500
gtccttacaa tacaaattta tagtggtaat atcactagaa aatcagatgc ttatcctttg 1560
tttcaaatag aagcaactaa agtagataga gtttatgata tgcttttgaa gaaaggagtt 1620
gcacccttgt aa 1632
<210>190
<211>543
<212>PRT
<213>番茄
<400>190
Met Gly Arg Ser Arg Gly Asn Phe Gln Ala Asp Glu Asp Pro Ser Gln
1 5 10 15
Arg Ser Arg Arg Lys Lys Asn Ala Ser Ser Val Asp Asn Leu Glu Ser
20 25 30
Ala Thr Thr Gly Gln Gly Thr Ala Asp Gly Lys Arg Ala Leu Tyr His
35 40 45
Cys Asn Tyr Cys Asn Lys Asp Ile Ser Gly Arg Thr Arg Ile Lys Cys
50 55 60
Ala Val Cys Tyr Asp Phe Asp Leu Cys Ile Glu Cys Phe Ser Val Gly
65 70 75 80
Ala Glu Val His Pro His Lys Ser His His His Tyr Arg Val Met Asp
85 90 95
Ile Leu Ala Phe Pro Leu Ile Cys Pro Asp Trp Asn Ala Asp Glu Glu
100 105 110
Met Leu Leu Leu Glu Gly Ile Glu Met Tyr Gly Met Gly Asn Trp Ala
115 120 125
Glu Val Gly Glu His Val Gly Thr Lys Thr Lys Glu Ala Cys Ile Asp
130 135 140
His Phe Lys Asp Ala Tyr Leu Lys Ser Pro Tyr Phe Pro Leu Pro Asp
145 150 155 160
Met Thr His Val Met Gly Lys Asn Arg Met Glu Leu Leu Ala Met Ala
165 170 175
Lys Gly Asn Phe Thr Asp Lys Lys Gly Leu Ser Ser Leu Gly Asp Val
180 185 190
Ala Pro Lys Asp Glu Ser Phe Ser Pro Ser Arg Ile Lys Val Glu Asp
195 200 205
Thr His Lys Ile Gly Pro Ser Gly Arg Leu Thr Ser Val Ser Asn Ala
210 215 220
Gly Ile Thr Gly Ile Lys Lys Pro Ser Ser Lys Thr Leu Ile Lys Asp
225 230 235 240
Gln Asn Glu Pro Val Lys Phe Glu Asp Asn Ser Gly Arg Asn Phe Gly
245 250 255
Gly Lys Lys Pro Lys Ser Leu Lys Asp Asp Gly Ser Ser Leu Met Lys
260 265 270
Leu Ser Gly Tyr Ile Pro Lys Arg Gln Glu Phe Asp Pro Glu Tyr Asp
275 280 285
Asn Asp Ala Glu Gln Leu Leu Ala Asp Met Glu Phe Lys Glu Thr Glu
290 295 300
Thr Glu Glu Glu Arg Glu Leu Lys Leu Arg Val Leu Arg Ile Tyr Ser
305 310 315 320
Lys Arg Leu Asp Glu Arg Lys Arg Arg Lys Val Phe Ile Leu Glu Arg
325 330 335
Asn Leu Leu Gln Pro Ser Glu Phe Glu Lys Asn Leu Ser Pro Glu Glu
340 345 350
Lys Gly Ile Cys Arg Cys Tyr Asp Ala Ile Met Arg Phe Leu Ser Lys
355 360 365
Glu Glu His Glu Glu Leu Leu Lys Ala Val Val Ser Glu His Arg Tyr
370 375 380
Leu Lys Arg Ile Gln Glu Leu Lys Glu Ala Lys Ala Ala Gly Cys Arg
385 390 395 400
Ser Ser Ala Glu Val Asp Arg Tyr Leu Glu Arg Lys Arg Lys Lys Glu
405 410 415
Val Glu Glu Gly Val Pro Arg Lys Gly Ser Ser Gln Ile Gly Pro Met
420 425 430
Ser Gln Glu Ser Leu Asn Ile Pro Ala Ser Ser Glu Ser Leu Gly Ile
435 440 445
His Ser Asn Arg Lys Pro Cys Ser Gln Ala Ile Leu Ser Ser Asp Thr
450 455 460
Asn Ala Gly Val Pro Ala Phe Ser Ala Gly Glu Leu Leu Ser Glu Pro
465 470 475 480
Glu Lys Gln Leu Cys Gln Glu Ile Arg Leu Ser Pro His His Tyr Leu
485 490 495
Arg Met Gln Glu Val Leu Thr Ile Gln Ile Tyr Ser Gly Asn Ile Thr
500 505 510
Arg Lys Ser Asp Ala Tyr Pro Leu Phe Gln Ile Glu Ala Thr Lys Val
515 520 525
Asp Arg Val Tyr Asp Met Leu Leu Lys Lys Gly Val Ala Pro Leu
530 535 540
<210>191
<211>1548
<212>DNA
<213>Ostreococcus lucimarinus
<400>191
atggcgagcg cgctcgtgcc gaaacggcga cgggtggcga cggaaaacgc gatgacgaag 60
ctgagtggga acggggagtc gtgcgcactg tttaactgta actattgcca aaaggacatc 120
tcgaacgtgg tgcgcgtacg gtgcgcggag tgcgcaaacg tggatctgtg cacggagtgc 180
ttcgcggtcg gcgtggagcc gcacccgcac aaggcgtatc atcagtatca cgtcatcgac 240
aacatgtcgt ttccgctgtt cacgcgagat tggggggctg acgaagagtt gttattgctg 300
gaggcagtgg agatgttcgg gttggggaac tggaccgagg tgagcgaaca cgtcgggacg 360
aagacgcgcg cgcagtgtca cgcgcactat tttgaagtct acgtcaagtc tccttgcgcg 420
ccgttaccgg atatgtcgaa gattttagga aaaggcgtcg cgcgtatgac atcagacgag 480
ctcaaagcgg aggcggagca aaaggcgaac gaaaataagg atgtggagga ggaggagaag 540
cttctcgaat cgcttgctaa cccgaacgca gtgaagacgg agggcaacgt gcaggaactc 600
acaggttaca acatcaagcg caatgagttc gatcccgaat acgacatgga tgccgaactt 660
cccctggcgg agatggaatt tcgcgaaaac gacaccgaag aagacgtcca gatgaagctg 720
cgaatgattg aaatctacaa cagccggctt caagaacgag cgagaagaaa acaattcatt 780
ctcgaacgca atctgctgaa cgtgaaaaag caacaaaacg tggaaaagaa gcgttcacaa 840
tacgagcgcg acttacacgg caccatgcgt atatttgcac gctttctcac gagtaccgag 900
tacgacgtct tgctcgaggg tctcgccgcg gagcaccgaa tccgaacccg catcaccgaa 960
ctgaaagagt acagacgcaa tggtattcat accatcgcag agggcgagga ttacgatttg 1020
gagaagcgtc gtcgtgagac ggagttcgct cgtctacacg cgatcgagca tccaactagc 1080
aagaacatag ccagagcgaa caacttaggt cctcgccgtc gaatgtactt gtcacttgat 1140
ctcgccgatc ttccaggcgt agaccttttg aacgacgacg aaaaggagtt gtgcaggagc 1200
tgtcgcttat tgcctgtgca gtatctctcg atgaaggtgg agttgatgcg agagggtctc 1260
aagtccgaaa agccgctcaa cagaaatcac gttcggaata tgttcaaagt agacccactc 1320
aaggctattc gtgtgtatga gttactccta cagcacggct gggtgttgga agacggcttc 1380
gtgaacccag gtgaggatga agactccgaa cctgcgccga aaaagtcagc cagcgcagac 1440
gaggaggaag acgaggagga cgatgaagta gattacgaaa ccgacgataa cgacgaagac 1500
gaggacgagg aagacgacga ggaagaggat agcgaggaag acgattag 1548
<210>192
<211>515
<212>PRT
<213>Ostreococcus lucimarinus
<400>192
Met Ala Ser Ala Leu Val Pro Lys Arg Arg Arg Val Ala Thr Glu Asn
1 5 10 15
Ala Met Thr Lys Leu Ser Gly Asn Gly Glu Ser Cys Ala Leu Phe Asn
20 25 30
Cys Asn Tyr Cys Gln Lys Asp Ile Ser Asn Val Val Arg Val Arg Cys
35 40 45
Ala Glu Cys Ala Asn Val Asp Leu Cys Thr Glu Cys Phe Ala Val Gly
50 55 60
Val Glu Pro His Pro His Lys Ala Tyr His Gln Tyr His Val Ile Asp
65 70 75 80
Asn Met Ser Phe Pro Leu Phe Thr Arg Asp Trp Gly Ala Asp Glu Glu
85 90 95
Leu Leu Leu Leu Glu Ala Val Glu Met Phe Gly Leu Gly Asn Trp Thr
100 105 110
Glu Val Ser Glu His Val Gly Thr Lys Thr Arg Ala Gln Cys His Ala
115 120 125
His Tyr Phe Glu Val Tyr Val Lys Ser Pro Cys Ala Pro Leu Pro Asp
130 135 140
Met Ser Lys Ile Leu Gly Lys Gly Val Ala Arg Met Thr Ser Asp Glu
145 150 155 160
Leu Lys Ala Glu Ala Glu Gln Lys Ala Asn Glu Asn Lys Asp Val Glu
165 170 175
Glu Glu Glu Lys Leu Leu Glu Ser Leu Ala Asn Pro Asn Ala Val Lys
180 185 190
Thr Glu Gly Asn Val Gln Glu Leu Thr Gly Tyr Asn Ile Lys Arg Asn
195 200 205
Glu Phe Asp Pro Glu Tyr Asp Met Asp Ala Glu Leu Pro Leu Ala Glu
210 215 220
Met Glu Phe Arg Glu Asn Asp Thr Glu Glu Asp Val Gln Met Lys Leu
225 230 235 240
Arg Met Ile Glu Ile Tyr Asn Ser Arg Leu Gln Glu Arg Ala Arg Arg
245 250 255
Lys Gln Phe Ile Leu Glu Arg Asn Leu Leu Asn Val Lys Lys Gln Gln
260 265 270
Asn Val Glu Lys Lys Arg Ser Gln Tyr Glu Arg Asp Leu His Gly Thr
275 280 285
Met Arg Ile Phe Ala Arg Phe Leu Thr Ser Thr Glu Tyr Asp Val Leu
290 295 300
Leu Glu Gly Leu Ala Ala Glu His Arg Ile Arg Thr Arg Ile Thr Glu
305 310 315 320
Leu Lys Glu Tyr Arg Arg Asn Gly Ile His Thr Ile Ala Glu Gly Glu
325 330 335
Asp Tyr Asp Leu Glu Lys Arg Arg Arg Glu Thr Glu Phe Ala Arg Leu
340 345 350
His Ala Ile Glu His Pro Thr Ser Lys Asn Ile Ala Arg Ala Asn Asn
355 360 365
Leu Gly Pro Arg Arg Arg Met Tyr Leu Ser Leu Asp Leu Ala Asp Leu
370 375 380
Pro Gly Val Asp Leu Leu Asn Asp Asp Glu Lys Glu Leu Cys Arg Ser
385 390 395 400
Cys Arg Leu Leu Pro Val Gln Tyr Leu Ser Met Lys Val Glu Leu Met
405 410 415
Arg Glu Gly Leu Lys Ser Glu Lys Pro Leu Asn Arg Asn His Val Arg
420 425 430
Asn Met Phe Lys Val Asp Pro Leu Lys Ala Ile Arg Val Tyr Glu Leu
435 440 445
Leu Leu Gln His Gly Trp Val Leu Glu Asp Gly Phe Val Asn Pro Gly
450 455 460
Glu Asp Glu Asp Ser Glu Pro Ala Pro Lys Lys Ser Ala Ser Ala Asp
465 470 475 480
Glu Glu Glu Asp Glu Glu Asp Asp Glu Val Asp Tyr Glu Thr Asp Asp
485 490 495
Asn Asp Glu Asp Glu Asp Glu Glu Asp Asp Glu Glu Glu Asp Ser Glu
500 505 510
Glu Asp Asp
515
<210>193
<211>1704
<212>DNA
<213>稻
<400>193
atgggccggt ctcgcggggt gcccaattcc ggcgacgatg aaacgaacca caggtcgaag 60
cggaggaggg tcgcgtcgag cggcgatgcg ccggactcgc tctcggcggc ctgcggggga 120
gccggagagg gtggtgggaa gaaggcgctg taccactgca actactgcaa taaggatatt 180
tccgggaaga tccggatcaa gtgctccaag tgccccgact tcgacctctg cgtcgagtgc 240
ttctcggtcg gcgccgaggt caccccgcac cgcagcaacc atccttacag ggtcatggac 300
aacctgtctt tccctcttat ttgtccagat tggaatgcag acgaggaaat ccttcttcta 360
gagggaattg aaatgtatgg tctgggaaat tgggctgaag ttgcggagca tgttggcacc 420
aagaccaagg cacaatgcat tgatcattat acaactgcat acatgaactc accttgttat 480
ccccttccgg atatgtctca tgttaacggt aagaacagga aggaattgct tgctatggct 540
aaagtacaag gcgagagtaa aaaagtgtta ccaggggatt tgacccctaa ggacgagtct 600
ccattttctc ccccaagggt caaggtggaa gatgcacttg gagaaggttt agctggtcga 660
tcaccttcac acatagctgg gggtgcaaat aagaaagcat caaatgttgg acaattcaaa 720
gatggtgcta atgtagcaaa agttgaagat ggtcatgtgg atagaagtat aggtgtgaaa 780
aaaccccgat attctgcaga tgaagggcct tctttgactg aactgagtgg atacaattca 840
aagagacatg aatttgaccc agagtatgat aacgatgctg aacaggcact cgctgagatg 900
gagtttaaag aaactgattc ggaaactgat cgtgaactga agctaagggt attgcgtatt 960
tacttgtcaa ggcttgatga aagaaaaagg agaaaagagt tcatactgga aagaaactta 1020
ctatttccta atcctttgga gaaggatctc acaaatgaag acaaggaagt ttaccatcgc 1080
tataaggtgt tcatgcgttt cctttctaag gaggaacatg aagcacttgt taggagtgtt 1140
cttgaggaac ggaaaattcg aaggaggatt caagagcttc aggaatgtcg ttctgctgga 1200
tgccgtacat tggctgaagc aaagattcac atagagcaaa agaggaaaaa ggaacatgag 1260
gtgaatgccc aaaaagctaa ggaaagtggt cagctcttat ccaacactaa agtggtgcat 1320
aagacgaatc gacctatgaa aatcgagtca gatggtaatt tggatcagaa gaaaggtggt 1380
gccagcttgg attctactgg cagggattct ccaaaaacca cagggcatgc aggcactaaa 1440
cattgggatg actgggatat tgttggtttt cctggggcag agctattaag caccagtgaa 1500
aaaaatctat gctgtcagaa cagattgcta cccaaccatt acctgaaaat gcaggaggtt 1560
ttgatgcagg aaatattcaa gggtagtgtc gccaagaagg aagatgccca tgtattattt 1620
aaggttgacc ctgccaaagt agataacgtt tatgatatgg tgacgaaaaa gttgggtacc 1680
aatgaggagg ccccgactgt ttag 1704
<210>194
<211>567
<212>PRT
<213>稻
<400>194
Met Gly Arg Ser Arg Gly Val Pro Asn Ser Gly Asp Asp Glu Thr Asn
1 5 10 15
His Arg Ser Lys Arg Arg Arg Val Ala Ser Ser Gly Asp Ala Pro Asp
20 25 30
Ser Leu Ser Ala Ala Cys Gly Gly Ala Gly Glu Gly Gly Gly Lys Lys
35 40 45
Ala Leu Tyr His Cys Asn Tyr Cys Asn Lys Asp Ile Ser Gly Lys Ile
50 55 60
Arg Ile Lys Cys Ser Lys Cys Pro Asp Phe Asp Leu Cys Val Glu Cys
65 70 75 80
Phe Ser Val Gly Ala Glu Val Thr Pro His Arg Ser Asn His Pro Tyr
85 90 95
Arg Val Met Asp Asn Leu Ser Phe Pro Leu Ile Cys Pro Asp Trp Asn
100 105 110
Ala Asp Glu Glu Ile Leu Leu Leu Glu Gly Ile Glu Met Tyr Gly Leu
115 120 125
Gly Asn Trp Ala Glu Val Ala Glu His Val Gly Thr Lys Thr Lys Ala
130 135 140
Gln Cys Ile Asp His Tyr Thr Thr Ala Tyr Met Asn Ser Pro Cys Tyr
145 150 155 160
Pro Leu Pro Asp Met Ser His Val Asn Gly Lys Asn Arg Lys Glu Leu
165 170 175
Leu Ala Met Ala Lys Val Gln Gly Glu Ser Lys Lys Val Leu Pro Gly
180 185 190
Asp Leu Thr Pro Lys Asp Glu Ser Pro Phe Ser Pro Pro Arg ValLys
195 200 205
Val Glu Asp Ala Leu Gly Glu Gly Leu Ala Gly Arg Ser Pro Ser His
210 215 220
Ile Ala Gly Gly Ala Asn Lys Lys Ala Ser Asn Val Gly Gln Phe Lys
225 230 235 240
Asp Gly Ala Asn Val Ala Lys ValGlu Asp Gly His Val Asp Arg Ser
245 250 255
Ile Gly Val Lys Lys Pro Arg Tyr Ser Ala Asp Glu Gly Pro Ser Leu
260 265 270
Thr Glu Leu Ser Gly Tyr Asn Ser Lys Arg His Glu Phe Asp Pro Glu
275 280 285
Tyr Asp Asn Asp Ala Glu Gln Ala Leu Ala Glu Met Glu Phe Lys Glu
290 295 300
Thr Asp Ser Glu Thr Asp Arg Glu Leu Lys Leu Arg Val Leu Arg Ile
305 310 315 320
Tyr Leu Ser Arg Leu Asp Glu Arg Lys Arg Arg Lys Glu Phe Ile Leu
325 330 335
Glu Arg Asn Leu Leu Phe Pro Asn Pro Leu Glu Lys Asp Leu Thr Asn
340 345 350
Glu Asp Lys Glu Val Tyr His Arg Tyr Lys Val Phe Met Arg Phe Leu
355 360 365
Ser Lys Glu Glu His Glu Ala Leu Val Arg Ser Val Leu Glu Glu Arg
370 375 380
Lys Ile Arg Arg Arg Ile Gln Glu Leu Gln Glu Cys Arg Ser Ala Gly
385 390 395 400
Cys Arg Thr Leu Ala Glu Ala Lys Ile His Ile Glu Gln Lys Arg Lys
405 410 415
Lys Glu His Glu Val Asn Ala Gln Lys Ala Lys Glu Ser Gly Gln Leu
420 425 430
Leu Ser Asn Thr Lys Val Val His Lys Thr Asn Arg Pro Met Lys Ile
435 440 445
Glu Ser Asp Gly Asn Leu Asp Gln Lys Lys Gly Gly Ala Ser Leu Asp
450 455 460
Ser Thr Gly Arg Asp Ser Pro Lys Thr Thr Gly His Ala Gly Thr Lys
465 470 475 480
His Trp Asp Asp Trp Asp Ile Val Gly Phe Pro Gly Ala Glu Leu Leu
485 490 495
Ser Thr Ser Glu Lys Asn Leu Cys Cys Gln Asn Arg Leu Leu Pro Asn
500 505 510
His Tyr Leu Lys Met Gln Glu Val Leu Met Gln Glu Ile Phe Lys Gly
515 520 525
Ser Val Ala Lys Lys Glu Asp Ala His Val Leu Phe Lys Val Asp Pro
530 535 540
Ala Lys Val Asp Asn Val Tyr Asp Met Val Thr Lys Lys Leu Gly Thr
545 550 555 560
Asn Glu Glu Ala Pro Thr Val
565
<210>195
<211>1683
<212>DNA
<213>毛果杨
<400>195
atgggtcgtt ctcgagggaa ttttcactct aatgatgaag accctactca gagatcaaga 60
aggaagaaga atgcggcaag tggagataat tccgaatctt tattggctgg ccaaggaagt 120
ggtgatggga aaagggcatt ataccattgc aattattgca ataaagatat aacagggaag 180
acccgtatca aatgcgctat gtgccctgat tttgacctat gcttagagtg cttctctgta 240
ggagctgagg ttacacctca taaaagcaat cacccttaca gggttatgga taatttatct 300
ttcccgctta tttgccctga ttggaatgca gatgaagaaa tactgcttct agagggaatt 360
gaaatgtatg gattggggaa ctgggcagaa attgctgagc acgtggggac aaagagtaaa 420
gacacatgta ttgaacacta taatagtgtt tacatgcaat cccagtactt ccctctcccg 480
gacatgtcgc ttgttgttgg gaaaaataga aaggaacttc ttgctatggc caagggatat 540
agtgaggaca aaaaaggtgc tgctatgctt ggggatctta ctttgaagga agaatctcca 600
ttttctcctt caagagtgaa agtcgaagaa atgcataaag gaggttcctc tggccgatta 660
tcaacattaa actcagaggt agaatctgct ggccgtccta ctaccacaaa ctctgcagca 720
acagctgcta ataagaaggc atctagcatt gctcgggtta aagatggacc taatgttgtt 780
aaagtggaag atcctcaggt ggaccgaaat gctaaaggga agaaaccgaa ttcctctggg 840
agtgagggtc catctttaat ggagttgagc ggttataacc ccaagaggca ggagtttgat 900
cccgaatatg ataatgatgc tgagcagttg ctagctgaga tggaatttaa agatactgac 960
actgaggaag agcgggagct gaaattgcga gtgctgcata tatattcaaa gcggctcgac 1020
gagagaaagc gcagaaaaga tttcatacta gaaagaaatc tgctgcaacc aagtcctttt 1080
gaaaaggact tgactccaga agagagggca ttatgtcggc gttatgaccc tttcatgcgt 1140
tttcattcca aggaagagca tgaggaattg cttcaggttg ttatcgaaga gcatcggatg 1200
ctgaaaagga tcgaagagct gaaggaagct caagcagctg gttgccgcac agcagctgag 1260
gcggacaggt accttgagca gaagaggaaa aaagaagccg aggaaaattc cagtagactg 1320
aaagacaatg ctctggttgg tcctagcaac catggtgctc ccaatgcatt tattccgtca 1380
gagtctgtta ggaaggattc gagtactaga cctgtaggac agggctctgc tagctatgcc 1440
aatggtttgg acacaacagg cttttatgaa acgcagctac tatctgaaac tgaaaaacgg 1500
ctatgccgtg agattcacct acctcctcct gtctacctca agatgcaaga ggtcatgacc 1560
aaagagatct tcagcggtaa catcactaag aaattggatg ctcacccctt gttcaagatt 1620
gaagcaagca aagttgatag ggtgtatgat atacttgtga agaaggggat tgctcaacct 1680
tga 1683
<210>196
<211>540
<212>PRT
<213>毛果杨
<400>196
Met Gly Arg Ser Arg Gly Arg Pro Pro Ser Ser Gly Thr Ser Thr Ala
1 5 10 15
Ala Ala Ala Ser Asp Asp Pro Asn Asn Arg Ser Ser Lys Arg Lys Lys
20 25 30
Thr Thr Ser Asn Val Gly Ser Ile Glu Thr Ala Phe Pro Ala Val Tyr
35 40 45
Gln Glu Lys Gly Gln Gly Lys Leu Ala Leu Tyr His Cys Asn Tyr Cys
50 55 60
His Lys Asp Ile Ser Gly Met Val Arg Ile Lys Cys Ala Val Cys Pro
65 70 75 80
Asp Phe Asp Leu Cys Val Glu Cys Phe Ser Val Gly Ala Glu Val Thr
85 90 95
Pro His Lys Ser Asn His Pro Tyr Arg Val Met Asp Asn Leu Ser Phe
100 105 110
Pro Leu Phe His Pro Asp Trp Asn Thr Asp Glu Glu Ile Leu Leu Leu
115 120 125
Glu Gly Ile Glu Met Tyr Gly Phe Gly Asn Trp Thr Glu Val Ser Glu
130 135 140
His Ala Gly Thr Lys Ser Lys Ser Gln Cys Ile Asp His Tyr Asn Ala
145 150 155 160
Val Tyr Met Asp Ser Pro Cys Phe Pro Leu Pro Asp Met Ser His Val
165 170 175
Met Gly Lys Thr Arg Glu Glu Leu Leu Ala Met Ala Arg Gly Asn Val
180 185 190
Glu Met Lys Lys Glu Val Ser Ser His Met Gly Ser Ser Ser Gly Asn
195 200 205
Thr Phe Ser Asp Ala Val Lys Lys Ala Ser Asn Glu Ala Gln Ile Lys
210 215 220
Asp Lys Ile Lys Val Glu Glu Pro Leu Ser Asp Arg Ser Ile Arg Glu
225 230 235 240
Lys Lys Pro Arg Ile Cys Gly Glu Glu Gly Pro Ser Met Thr Glu Leu
245 250 255
Ser Gly Tyr Asn Phe Lys Arg Gln Glu Phe Glu Ile Glu Tyr Asp Asn
260 265 270
Asp Ala Glu Gln Leu Leu Ala Asp Met Glu Phe Lys Asp Thr Asp Thr
275 280 285
Asp Ala Glu Leu Asp Met Lys Leu Gln Val Leu Arg Ile Tyr Ser Lys
290 295 300
Arg Leu Asp Glu Arg Lys Arg Arg Lys Asp Phe Ile Leu Glu Arg Asn
305 310 315 320
Leu Phe Tyr Pro Asp Ala Phe Glu Lys Asn Ile Ser Pro Glu Glu Lys
325 330 335
Glu Ile Tyr Gln Arg Tyr Lys Val Phe Met Arg Phe His Thr Lys Glu
340 345 350
Glu His Glu Glu Leu Met Lys Thr Val Ile Glu Asp His Gln Ile Met
355 360 365
Lys Arg Ile Gln Asp Leu Gln Glu Ala Arg Ala Ala Gly Cys Gln Thr
370 375 380
Ala Gly Glu Ala Gln Gly Phe Ile Glu Gln Lys Arg Lys Lys Glu Ala
385 390 395 400
Glu Glu Ser Ala Gln Arg Ala Lys Glu Ser Met Gln Ala Gly Pro Ala
405 410 415
Gly Lys Leu Leu Pro Lys Pro Asn His Leu Asp Ser Ser Pro Arg Gly
420 425 430
Ala Val Lys Cys Ser Thr Val Phe His Pro Gly Gly Asn Asp Ser Ser
435 440 445
Ser Met Ile Ala Lys Gln Ala Ile Ser Ser Thr Leu Asp Glu Trp Asp
450 455 460
Ile Ala Gly Phe Leu Gly Ala Asp Leu Leu Ser Glu Ser Asp Lys Arg
465 470 475 480
Leu Cys Cys Glu Leu Arg Ile Leu Pro Ala His Tyr Leu Asn Met Leu
485 490 495
His Ile Met Ser Ile Glu Ile Thr Lys Gly Thr Val Thr Asn Lys Thr
500 505 510
Asp Ala His Ser Leu Phe Lys Val Glu Ser Ser Lys Val Asp Arg Val
515 520 525
Tyr Asp Met Leu Val Lys Lys Gly Ile Ala Leu Ala
530 535 540
<210>197
<211>1623
<212>DNA
<213>毛果杨
<400>197
atgggtcgtt cgcgcggtcg ccctccttct tccggaacct ccactgccgc cgccgcctct 60
gatgatccaa acaatagatc ttcaaaaaga aaaaagacga cttccaatgt agggagtata 120
gagactgcat ttccagcagt atatcaagaa aagggtcaag ggaaactggc actataccac 180
tgcaattact gtcataaaga catctctgga atggttcgca ttaagtgtgc agtgtgtcct 240
gatttcgacc tttgcgttga gtgtttttct gttggagccg aagtgactcc tcataaaagc 300
aatcatccct acagggttat ggacaatctg tcttttccgc tctttcatcc agactggaat 360
acagatgaag agatattact tctagagggc attgaaatgt atggatttgg gaactggact 420
gaagtttcag aacatgctgg aaccaagagc aaatctcaat gcattgatca ctataatgct 480
gtatacatgg actccccatg ctttcctctc ccagacatgt ctcatgttat gggaaaaaca 540
agagaggagc tccttgcaat ggccagagga aatgttgaaa tgaagaaaga agtcagctct 600
catatgggtt caagcagtgg caacacattc tcagatgcag ttaagaaagc atctaacgag 660
gcccagatta aggataagat taaagtggaa gaacctctgt ctgacaggag tattcgagag 720
aaaaaaccta gaatttgcgg agaggaagga ccttcaatga cagagttaag tggctataat 780
ttcaagaggc aggaatttga gattgaatat gataatgatg cagagcaact actggcagat 840
atggaattca aagatactga cactgatgct gagcttgaca tgaaactgca agttctgcgc 900
atttactcaa aaaggcttga tgagaggaaa cggaggaaag attttatttt ggaaagaaat 960
ttgttttacc ctgatgcatt tgagaagaac atttcacctg aagagaagga aatatatcag 1020
cgttacaagg tcttcatgag gttccacaca aaagaagagc atgaagaatt gatgaagact 1080
gttattgaag atcatcagat tatgaaaaga atacaagatc ttcaggaagc tcgagctgct 1140
ggctgtcaaa cagctggtga ggcccaagga tttattgagc agaagagaaa gaaggaagcc 1200
gaagaaagtg cccaaagagc gaaggaaagt atgcaagcag gcccagcagg taaactgttg 1260
ccaaagccaa atcatcttga cagcagccct cgtggagctg tcaagtgttc caccgttttt 1320
catcctggtg gcaacgactc atcttcaatg attgcaaaac aagcaatttc aagcaccctc 1380
gatgagtggg atattgctgg attcctaggg gctgatttgc tctctgaatc tgataagcgt 1440
ctttgttgtg agttgagaat actacctgca cattatctca acatgctgca cataatgtca 1500
atagagataa caaagggtac tgttaccaac aaaaccgatg ctcatagcct gttcaaggtg 1560
gaatcaagca aagtggatag agtatatgat atgttagtga aaaaggggat tgctctagca 1620
tga 1623
<210>198
<211>560
<212>PRT
<213>毛果杨
<400>198
Met Gly Arg Ser Arg Gly Asn Phe His Ser Asn Asp Glu Asp Pro Thr
1 5 10 15
Gln Arg Ser Arg Arg Lys Lys Asn Ala Ala Ser Gly Asp Asn Ser Glu
20 25 30
Ser Leu Leu Ala Gly Gln Gly Ser Gly Asp Gly Lys Arg Ala Leu Tyr
35 40 45
His Cys Asn Tyr Cys Asn Lys Asp Ile Thr Gly Lys Thr Arg Ile Lys
50 55 60
Cys Ala Met Cys Pro Asp Phe Asp Leu Cys Leu Glu Cys Phe Ser Val
65 70 75 80
Gly Ala Glu Val Thr Pro His Lys Ser Asn His Pro Tyr Arg Val Met
85 90 95
Asp Asn Leu Ser Phe Pro Leu Ile Cys Pro Asp Trp Asn Ala Asp Glu
100 105 110
Glu Ile Leu Leu Leu Glu Gly Ile Glu Met Tyr Gly Leu Gly Asn Trp
115 120 125
Ala Glu Ile Ala Glu His Val Gly Thr Lys Ser Lys Asp Thr Cys Ile
130 135 140
Glu His Tyr Asn Ser Val Tyr Met Gln Ser Gln Tyr Phe Pro Leu Pro
145 150 155 160
Asp Met Ser Leu Val Val Gly Lys Asn Arg Lys Glu Leu Leu Ala Met
165 170 175
Ala Lys Gly Tyr Ser Glu Asp Lys Lys Gly Ala Ala Met Leu Gly Asp
180 185 190
Leu Thr Leu Lys Glu Glu Ser Pro Phe Ser Pro Ser Arg Val Lys Val
195 200 205
Glu Glu Met His Lys Gly Gly Ser Ser Gly Arg Leu Ser Thr Leu Asn
210 215 220
Ser Glu Val Glu Ser Ala Gly Arg Pro Thr Thr Thr Asn Ser Ala Ala
225 230 235 240
Thr Ala Ala Asn Lys Lys Ala Ser Ser Ile Ala Arg Val Lys Asp Gly
245 250 255
Pro Asn Val Val Lys Val Glu Asp Pro Gln Val Asp Arg Asn Ala Lys
260 265 270
Gly Lys Lys Pro Asn Ser Ser Gly Ser Glu Gly Pro Ser Leu Met Glu
275 280 285
Leu Ser Gly Tyr Asn Pro Lys Arg Gln Glu Phe Asp Pro Glu Tyr Asp
290 295 300
Asn Asp Ala Glu Gln Leu Leu Ala Glu Met Glu Phe Lys Asp Thr Asp
305 310 315 320
Thr Glu Glu Glu Arg Glu Leu Lys Leu Arg Val Leu His Ile Tyr Ser
325 330 335
Lys Arg Leu Asp Glu Arg Lys Arg Arg Lys Asp Phe Ile Leu Glu Arg
340 345 350
Asn Leu Leu Gln Pro Ser Pro Phe Glu Lys Asp Leu Thr Pro Glu Glu
355 360 365
Arg Ala Leu Cys Arg Arg Tyr Asp Pro Phe Met Arg Phe His Ser Lys
370 375 380
Glu Glu His Glu Glu Leu Leu Gln Val Val Ile Glu Glu His Arg Met
385 390 395 400
Leu Lys Arg Ile Glu Glu Leu Lys Glu Ala Gln Ala Ala Gly Cys Arg
405 410 415
Thr Ala Ala Glu Ala Asp Arg Tyr Leu Glu Gln Lys Arg Lys Lys Glu
420 425 430
Ala Glu Glu Asn Ser Ser Arg Leu Lys Asp Asn Ala Leu Val Gly Pro
435 440 445
Ser Asn His Gly Ala Pro Asn Ala Phe Ile Pro Ser Glu Ser Val Arg
450 455 460
Lys Asp Ser Ser Thr Arg Pro Val Gly Gln Gly Ser Ala Ser Tyr Ala
465 470 475 480
Asn Gly Leu Asp Thr Thr Gly Phe Tyr Glu Thr Gln Leu Leu Ser Glu
485 490 495
Thr Glu Lys Arg Leu Cys Arg Glu Ile His Leu Pro Pro Pro Val Tyr
500 505 510
Leu Lys Met Gln Glu Val Met Thr Lys Glu Ile Phe Ser Gly Asn Ile
515 520 525
Thr Lys Lys Leu Asp Ala His Pro Leu Phe Lys Ile Glu Ala Ser Lys
530 535 540
Val Asp Arg Val Tyr Asp Ile Leu Val Lys Lys Gly Ile Ala Gln Pro
545 550 555 560
<210>199
<211>1680
<212>DNA
<213>毛果杨
<400>199
atgggtcgtt ctcgtgggaa ttttcactct actgatgaag accctactca gagatcaaga 60
aggaaaaaga acgctgcaag tggagagaat tcggaatctt catcggcagg ccaaggaagt 120
agtgatggta aaagggcatt ataccattgc aattattgca ataaagacat aacagggaag 180
acccgtatca aatgtgctgt gtgccctgat tttgacctat gtttagagtg cttctctgta 240
ggagctgagg ttacgcctca taaaagcaat cacccttaca gggttatgga taacctatct 300
ttcccactta tttgccctga ctggaatgca gatgaagaaa tactgcttct agagggaatt 360
gaaatgtatg gattggggaa ttgggcagaa gttgctgagc atgtggggac aaagaataaa 420
gaaacatgta tcaaacacta taatagcgta tacttgcaat cccagttctt ccctctcccg 480
gacatgtctc atgttgttgg gaaaaataga aaggagcttc ttgctatggc caagggacat 540
agtgaggaca aaaaaggtac ttctatgctt ggggagcata ctttgaagga agaatctcca 600
ttttctcctt caagagtcaa ggtcgaagaa atgcataaag taggttcctc tggccgatta 660
tcaacattaa attcagagtt agaaactgcg agccgtccta atagcgcaaa ttctgcagca 720
acagctgcta ataagaaggc atcaagcatg gcccgaatta atgatggacc cggtgttaag 780
gtggaagatc ctcaagtgga ccgaaatttc aaggggaaga aaccaagttc ctcagggagt 840
gagggtccat cattaatgga gttgagcggt tataatccca agaggcagga gtttgatcct 900
gaatatgaca atgatgctga gcagttgctg gctgagatgg aatttaaaga taatgacact 960
gaggaagagc gtgagctgaa gttgcgagtg ctgcgtatat attcaaggag gcttgatgag 1020
agaaagcgca gaaaagattt cattcttgaa agaaatctac tacatccaag tcctttcgaa 1080
aaggacttga ctccagaaga gagggcatta tgtcggcgtt ttgacccttt catgcgtttt 1140
cattccaagg aagagcatga agaactgctt cgggctgttg tcaaggagca ctggatgctg 1200
aaaagggtcg aagagctgaa ggatgcccaa gtggctggtt gccgcacagc agttgaggca 1260
gacaggtatc ttgagcacaa gagaaaaata gaagctgagg aaacttccag gagactgaaa 1320
gacaatgctc agattggtcc tagcagtcag ggtgctccca atgcattcat gtctccagac 1380
tctgttggga aggattcaag caccagacct gcaggacagg gctcttctag ctatgccaat 1440
gatttggaca taatgggctt ttatgaaacg cagctactgt ctgaaactga aaaacggcta 1500
tgctgcgaga ttcacctacc tccacctgtc tacctcaaga tgcaggaggt gatgaccaaa 1560
gagatcttca gcggtaacat cactaagaaa tcagatgctc accccttgtt caagattgaa 1620
gcaagcaaag ttgatggggt gtatgatatg cttgtgaaga aggggattgc tcaaccttga 1680
<210>200
<211>559
<212>PRT
<213>毛果杨
<400>200
Met Gly Arg Ser Arg Gly Asn Phe His Ser Thr Asp Glu Asp Pro Thr
1 5 10 15
Gln Arg Ser Arg Arg Lys Lys Asn Ala Ala Ser Gly Glu Asn Ser Glu
20 25 30
Ser Ser Ser Ala Gly Gln Gly Ser Ser Asp Gly Lys Arg Ala Leu Tyr
35 40 45
His Cys Asn Tyr Cys Asn Lys Asp Ile Thr Gly Lys Thr Arg Ile Lys
50 55 60
Cys Ala Val Cys Pro Asp Phe Asp Leu Cys Leu Glu Cys Phe Ser Val
65 70 75 80
Gly Ala Glu Val Thr Pro His Lys Ser Asn His Pro Tyr Arg Val Met
85 90 95
Asp Asn Leu Ser Phe Pro Leu Ile Cys Pro Asp Trp Asn Ala Asp Glu
100 105 110
Glu Ile Leu Leu Leu Glu Gly Ile Glu Met Tyr Gly Leu Gly Asn Trp
115 120 125
Ala Glu Val Ala Glu His Val Gly Thr Lys Asn Lys Glu Thr Cys Ile
130 135 140
Lys His Tyr Asn Ser Val Tyr Leu Gln Ser Gln Phe Phe Pro Leu Pro
145 150 155 160
Asp Met Ser His Val Val Gly Lys Asn Arg Lys Glu Leu Leu Ala Met
165 170 175
Ala Lys Gly His Ser Glu Asp Lys Lys Gly Thr Ser Met Leu Gly Glu
180 185 190
His Thr Leu Lys Glu Glu Ser Pro Phe Ser Pro Ser Arg Val Lys Val
195 200 205
Glu Glu Met His Lys Val Gly Ser Ser Gly Arg Leu Ser Thr Leu Asn
210 215 220
Ser Glu Leu Glu Thr Ala Ser Arg Pro Asn Ser Ala Asn Ser Ala Ala
225 230 235 240
Thr Ala Ala Asn Lys Lys Ala Ser Ser Met Ala Arg Ile Asn Asp Gly
245 250 255
Pro Gly Val Lys Val Glu Asp Pro Gln Val Asp Arg Asn Phe Lys Gly
260 265 270
Lys Lys Pro Ser Ser Ser Gly Ser Glu Gly Pro Ser Leu Met Glu Leu
275 280 285
Ser Gly Tyr Asn Pro Lys Arg Gln Glu Phe Asp Pro Glu Tyr Asp Asn
290 295 300
Asp Ala Glu Gln Leu Leu Ala Glu Met Glu Phe Lys Asp Asn Asp Thr
305 310 315 320
Glu Glu Glu Arg Glu Leu Lys Leu Arg Val Leu Arg Ile Tyr Ser Arg
325 330 335
Arg Leu Asp Glu Arg Lys Arg Arg Lys Asp Phe Ile Leu Glu Arg Asn
340 345 350
Leu Leu His Pro Ser Pro Phe Glu Lys Asp Leu Thr Pro Glu Glu Arg
355 360 365
Ala Leu Cys Arg Arg Phe Asp Pro Phe Met Arg Phe His Ser Lys Glu
370 375 380
Glu His Glu Glu Leu Leu Arg Ala Val Val Lys Glu His Trp Met Leu
385 390 395 400
Lys Arg Val Glu Glu Leu Lys Asp Ala Gln Val Ala Gly Cys Arg Thr
405 410 415
Ala Val Glu Ala Asp Arg Tyr Leu Glu His Lys Arg Lys Ile Glu Ala
420 425 430
Glu Glu Thr Ser Arg Arg Leu Lys Asp Asn Ala Gln Ile Gly Pro Ser
435 440 445
Ser Gln Gly Ala Pro Asn Ala Phe Met Ser Pro Asp Ser Val Gly Lys
450 455 460
Asp Ser Ser Thr Arg Pro Ala Gly Gln Gly Ser Ser Ser Tyr Ala Asn
465 470 475 480
Asp Leu Asp Ile Met Gly Phe Tyr Glu Thr Gln Leu Leu Ser Glu Thr
485 490 495
Glu Lys Arg Leu Cys Cys Glu Ile His Leu Pro Pro Pro Val Tyr Leu
500 505 510
Lys Met Gln Glu Val Met Thr Lys Glu Ile Phe Ser Gly Asn Ile Thr
515 520 525
Lys Lys Ser Asp Ala His Pro Leu Phe Lys Ile Glu Ala Ser Lys Val
530 535 540
Asp Gly Val Tyr Asp Met Leu Val Lys Lys Gly Ile Ala Gln Pro
545 550 555
<210>201
<211>1722
<212>DNA
<213>葡萄
<400>201
atgggtcgtt ctcgcgcagt tctgcattct actgacgatg atcaaggttc acacagatcc 60
aagagaagaa agactgcttc aacagcagac aatttagagg gtgcaactgc aggccaagga 120
atgagtgagg ggaagmgagc ttcataccac tgtaattatt gcagcaaaga tatctcagga 180
aagatccgta ccaaatgtgt agtttgtccc gattttgacc tttgcattga atgcttttcc 240
attggagctg aggttacacc tcaygtatgc tttcatccat atcgggtcat ggacaattta 300
tcgttcccac tcatttgtcc tgattggaat gcagatgaag agatgttact tctggaggga 360
attgaaatgt acggactggg gaactggagt gaagtttcag aacatgttgg aaccaaaaga 420
aaatcagaat gtatcgatca ctatgttgct atatacatga attccccatg ctttcctctt 480
cctgacatgt cccatgttct tggaaagact agggctgagc tccttgccat ggccagggga 540
gaagatgaag tcaagaaagg atcccctaca catggggagt taactctgaa agtggaatct 600
cccttatctg caagagtcaa gtacggcaaa tgcatgctaa aatgtgtttc tctcatctgt 660
cagagatcca atccgacatg gatttctagc agcactaaaa catctgcagg tgcagttaag 720
agggcatcta acatggccca ggttaaggat ggtcgtgata acattaaagt ggaagaaact 780
caaacagaca gaagtgttgg agagaaaaag cctaggacct cgggggatga ggggccttct 840
gtgacagagc tgagtggata caatttcaag aggcaagagt ttgatgttga gtatgataat 900
gatgctgagc agttactggc tgatatggaa ttcaaggatg ctgacactga tgctgagcat 960
gaactgaaac tgcaagttct gcatatttat tccaaaaggc ttgatgagag gaaacgcagg 1020
aaggatttca tattggaaag aaatctactt taccctgacc cttttgagaa gaacctctca 1080
cctgaagaga gggacgtaaa tcagcgcttc aaggtcttta tgcggttcca ctcaaaagaa 1140
gaacatgagg aactgcttag ggttgtgctc gaggaacatt ggattcagaa aagaatacaa 1200
gatcttcagg acgcccgagc tgctggctgc cgtacatctg ctgaggcaga gagatatctt 1260
gaagagaaag ggaagaaaga agctgaagaa agtgcccaac aagcaaagga aagtgctgag 1320
gctggtccta gtgggggtaa agtcttacag agggtgaaca ctgccaaagg agaatctgat 1380
ggcagtcctc ggggaggtgg aagaggttct gcaggtctag aacctggaat caaggacact 1440
tcttcaacaa ctgcaggaca tgctatctta agatccctag atgtttggga tatcactgga 1500
tttccggggg aagatttact ctcagaaact gagaaacagc tttgcagtga gatcagaatc 1560
ctcccttcac attatctcaa catgctgcac accatgttga cagagacgtt aaatggaaac 1620
atcaccagga aatcagatgc ccatggcctg ttcaaggttg aaccaagcaa agtagacaaa 1680
gtgtacgata tgtttgtgaa gaagggaatt gttaagtcat ag 1722
<210>202
<211>573
<212>PRT
<213>葡萄
<400>202
Met Gly Arg Ser Arg Ala Val Leu His Ser Thr Asp Asp Asp Gln Gly
1 5 10 15
Ser His Arg Ser Lys Arg Arg Lys Thr Ala Ser Thr Ala Asp Asn Leu
20 25 30
Glu Gly Ala Thr Ala Gly Gln Gly Met Ser Glu Gly Lys Arg Ala Ser
35 40 45
Tyr His Cys Asn Tyr Cys Ser Lys Asp Ile Ser Gly Lys Ile Arg Thr
50 55 60
Lys Cys Val Val Cys Pro Asp Phe Asp Leu Cys Ile Glu Cys Phe Ser
65 70 75 80
Ile Gly Ala Glu Val Thr Pro His Val Cys Phe His Pro Tyr Arg Val
85 90 95
Met Asp Asn Leu Ser Phe Pro Leu Ile Cys Pro Asp Trp Asn Ala Asp
100 105 110
Glu Glu Met Leu Leu Leu Glu Gly Ile Glu Met Tyr Gly Leu Gly Asn
115 120 125
Trp Ser Glu Val Ser Glu His Val Gly Thr Lys Arg Lys Ser Glu Cys
130 135 140
Ile Asp His Tyr Val Ala Ile Tyr Met Asn Ser Pro Cys Phe Pro Leu
145 150 155 160
Pro Asp Met Ser His Val Leu Gly Lys Thr Arg Ala Glu Leu Leu Ala
165 170 175
Met Ala Arg Gly Glu Asp Glu Val Lys Lys Gly Ser Pro Thr His Gly
180 185 190
Glu Leu Thr Leu Lys Val Glu Ser Pro Leu Ser Ala Arg Val Lys Tyr
195 200 205
Gly Lys Cys Met Leu Lys Cys Val Ser Leu Ile Cys Gln Arg Ser Asn
210 215 220
Pro Thr Trp Ile Ser Ser Ser Thr Lys Thr Ser Ala Gly Ala Val Lys
225 230 235 240
Arg Ala Ser Asn Met Ala Gln Val Lys Asp Gly Arg Asp Asn Ile Lys
245 250 255
Val Glu Glu Thr Gln Thr Asp Arg Ser Val Gly Glu Lys Lys Pro Arg
260 265 270
Thr Ser Gly Asp Glu Gly Pro Ser Val Thr Glu Leu Ser Gly Tyr Asn
275 280 285
Phe Lys Arg Gln Glu Phe Asp Val Glu Tyr Asp Asn Asp Ala Glu Gln
290 295 300
Leu Leu Ala Asp Met Glu Phe Lys Asp Ala Asp Thr Asp Ala Glu His
305 310 315 320
Glu Leu Lys Leu Gln Val Leu His Ile Tyr Ser Lys Arg Leu Asp Glu
325 330 335
Arg Lys Arg Arg Lys Asp Phe Ile Leu Glu Arg Asn Leu Leu Tyr Pro
340 345 350
Asp Pro Phe Glu Lys Asn Leu Ser Pro Glu Glu Arg Asp Val Asn Gln
355 360 365
Arg Phe Lys Val Phe Met Arg Phe His Ser Lys Glu Glu His Glu Glu
370 375 380
Leu Leu Arg Val Val Leu Glu Glu His Trp Ile Gln Lys Arg Ile Gln
385 390 395 400
Asp Leu Gln Asp Ala Arg Ala Ala Gly Cys Arg Thr Ser Ala Glu Ala
405 410 415
Glu Arg Tyr Leu Glu Glu Lys Gly Lys Lys Glu Ala Glu Glu Ser Ala
420 425 430
Gln Gln Ala Lys Glu Ser Ala Glu Ala Gly Pro Ser Gly Gly Lys Val
435 440 445
Leu Gln Arg Val Asn Thr Ala Lys Gly Glu Ser Asp Gly Ser Pro Arg
450 455 460
Gly Gly Gly Arg Gly Ser Ala Gly Leu Glu Pro Gly Ile Lys Asp Thr
465 470 475 480
Ser Ser Thr Thr Ala Gly His Ala Ile Leu Arg Ser Leu Asp Val Trp
485 490 495
Asp Ile Thr Gly Phe Pro Gly Glu Asp Leu Leu Ser Glu Thr Glu Lys
500 505 510
Gln Leu Cys Ser Glu Ile Arg Ile Leu Pro Ser His Tyr Leu Asn Met
515 520 525
Leu His Thr Met Leu Thr Glu Thr Leu Asn Gly Asn Ile Thr Arg Lys
530 535 540
Ser Asp Ala His Gly Leu Phe Lys Val Glu Pro Ser Lys Val Asp Lys
545 550 555 560
Val Tyr Asp Met Phe Val Lys Lys Gly Ile Val Lys Ser
565 570
<210>203
<211>1698
<212>DNA
<213>玉蜀黍
<400>203
atggggcggt cgcgaggggt gcagaattcc ggcgacgacg acaccgtaca caggtcgaag 60
cggaggaggg tcgcatcggg cggggatgcg acggactccg tttccgctgg catcggggga 120
gctggagaag gagggggcaa gaaagcgctc taccactgca attactgcaa caaggacatc 180
tctgggaaga tacggatcaa atgctccaag tgccctgact tcgacctttg cgtggagtgc 240
ttctctgttg gcgctgaagt caccccacat cgcagcaacc atccttacaa agtcatggac 300
aacctgtctt tcccacttat ttgcccagat tggaatgcag acgaagaaat tctcctcctt 360
gagggaattg aaatgtatgg tctgggaaac tggcttgaag ttgcagagca tgttggtacc 420
aagtctaagt tacagtgtat tgatcattac acaacagcat acatgaactc accttgttat 480
cccctaccgg atatgtctca tgttaatggc aagaacagga aggagcttct agctatggct 540
aaagtgcagg gtgaaagtaa aaaagggact tcactgttgc ctggagagct gactcctaag 600
gctgaatctc cattttctcc ctccagggtc aaggtggaag atgcacttgg agaaggtcta 660
gcaggtcgat caccttcgca catagctgtt ggtgcaaata aaaaagcttc aaatgtggga 720
catattaaag atgggtctaa tgtatcaaaa gttgaagatg gtcatgtcga tagaagtgtt 780
ggtgtgaaga agcccagata ttctgcagat gaaggccctt cgttgactga actgagtgga 840
tacaatgcaa agagacacga gtttgaccca gagtatgata atgatgccga acaagcgctt 900
gctgagatgg aatttaaaga aactgattca gaaactgatc gtgaactgaa actccgtgtg 960
ctgcgtattt atctgtccag gcttgatgaa agaaaaagga gaaaagagtt catattggaa 1020
aggaatttat tgtttcctaa tcccttggag aaggatctta caaatgaaga cagggaagtt 1080
taccatcggt ataaggtctt catgcgtttt ctttccaagg aggaacatga agcccttgtt 1140
aggagtgtca ttgaagagcg aaaaattcgg aggagaattc aagaactcca ggaatgtcgt 1200
tctgctggat gccgcacact tgctgaagca aagatacaca tagagcaaaa gaggaaaaaa 1260
gaatacgagc tgaatgccca aaaagctaag gaaagtaacc accttattgc aaatactaaa 1320
ttggtgcaga agatgaatcg acctatgaag attgagtctg atgggaattt ggatccaaag 1380
aaaggtggtg ttgcgttaga ttctcctaaa actacaggac ttacaagtgt taagcagtgg 1440
gatgactggg atatagttgg tcttcctggg gcgaagctat taagtgctag cgaaaagctt 1500
ctatgttgcc agaacagact gctacccagt cattacctga gaatgcagga ggtgctgatg 1560
caggagatat tcaagggtag tgtcctaaag aaggaagacg cacacgtctt gtttaaggtt 1620
gatcctacca aagtagatag tgtttatgat atggtaacaa aaaagctggg caaccatgtg 1680
gagttgccta cggtctag 1698
<210>204
<211>565
<212>PRT
<213>玉蜀黍
<400>204
Met Gly Arg Ser Arg Gly Val Gln Asn Ser Gly Asp Asp Asp Thr Val
1 5 10 15
His Arg Ser Lys Arg Arg Arg Val Ala Ser Gly Gly Asp Ala Thr Asp
20 25 30
Ser Val Ser Ala Gly Ile Gly Gly Ala Gly Glu Gly Gly Gly Lys Lys
35 40 45
Ala Leu Tyr His Cys Asn Tyr Cys Asn Lys Asp Ile Ser Gly Lys Ile
50 55 60
Arg Ile Lys Cys Ser Lys Cys Pro Asp Phe Asp Leu Cys Val Glu Cys
65 70 75 80
Phe Ser Val Gly Ala Glu Val Thr Pro His Arg Ser Asn His Pro Tyr
85 90 95
Lys Val Met Asp Asn Leu Ser Phe Pro Leu Ile Cys Pro Asp Trp Asn
100 105 110
Ala Asp Glu Glu Ile Leu Leu Leu Glu Gly Ile Glu Met Tyr Gly Leu
115 120 125
Gly Asn Trp Leu Glu Val Ala Glu His Val Gly Thr Lys Ser Lys Leu
130 135 140
Gln Cys Ile Asp His Tyr Thr Thr Ala Tyr Met Asn Ser Pro Cys Tyr
145 150 155 160
Pro Leu Pro Asp Met Ser His Val Asn Gly Lys Asn Arg Lys Glu Leu
165 170 175
Leu Ala Met Ala Lys Val Gln Gly Glu Ser Lys Lys Gly Thr Ser Leu
180 185 190
Leu Pro Gly Glu Leu Thr Pro Lys Ala Glu Ser Pro Phe Ser Pro Ser
195 200 205
Arg Val Lys Val Glu Asp Ala Leu Gly Glu Gly Leu Ala Gly Arg Ser
210 215 220
Pro Ser His Ile Ala Val Gly Ala Asn Lys Lys Ala Ser Asn Val Gly
225 230 235 240
His Ile Lys Asp Gly Ser Asn Val Ser Lys Val Glu Asp Gly His Val
245 250 255
Asp Arg Ser Val Gly Val Lys Lys Pro Arg Tyr Ser Ala Asp Glu Gly
260 265 270
Pro Ser Leu Thr Glu Leu Ser Gly Tyr Asn Ala Lys Arg His Glu Phe
275 280 285
Asp Pro Glu Tyr Asp Asn Asp Ala Glu Gln Ala Leu Ala Glu Met Glu
290 295 300
Phe Lys Glu Thr Asp Ser Glu Thr Asp Arg Glu Leu Lys Leu Arg Val
305 310 315 320
Leu Arg Ile Tyr Leu Ser Arg Leu Asp Glu Arg Lys Arg Arg Lys Glu
325 330 335
Phe Ile Leu Glu Arg Asn Leu Leu Phe Pro Asn Pro Leu Glu Lys Asp
340 345 350
Leu Thr Asn Glu Asp Arg Glu Val Tyr His Arg Tyr Lys Val Phe Met
355 360 365
Arg Phe Leu Ser Lys Glu Glu His Glu Ala Leu Val Arg Ser Val Ile
370 375 380
Glu Glu Arg Lys Ile Arg Arg Arg Ile Gln Glu Leu Gln Glu Cys Arg
385 390 395 400
Ser Ala Gly Cys Arg Thr Leu Ala Glu Ala Lys Ile His Ile Glu Gln
405 410 415
Lys Arg Lys Lys Glu Tyr Glu Leu Asn Ala Gln Lys Ala Lys Glu Ser
420 425 430
Asn His Leu Ile Ala Asn Thr Lys Leu Val Gln Lys Met Asn Arg Pro
435 440 445
Met Lys Ile Glu Ser Asp Gly Asn Leu Asp Pro Lys Lys Gly Gly Val
450 455 460
Ala Leu Asp Ser Pro Lys Thr Thr Gly Leu Thr Ser Val Lys Gln Trp
465 470 475 480
Asp Asp Trp Asp Ile Val Gly Leu Pro Gly Ala Lys Leu Leu Ser Ala
485 490 495
Ser Glu Lys Leu Leu Cys Cys Gln Asn Arg Leu Leu Pro Ser His Tyr
500 505 510
Leu Arg Met Gln Glu Val Leu Met Gln Glu Ile Phe Lys Gly Ser Val
515 520 525
Leu Lys Lys Glu Asp Ala His Val Leu Phe Lys Val Asp Pro Thr Lys
530 535 540
Val Asp Ser Val Tyr Asp Met Val Thr Lys Lys Leu Gly Asn His Val
545 550 555 560
Glu Leu Pro Thr Val
565
<210>205
<211>1698
<212>DNA
<213>玉蜀黍
<400>205
atggggcggt cgcgaggggt gctgagttcc ggcgacgacg acaccgggca caggtcgaag 60
cggaggagag tctcgtcggg cggggatgcg acggactcca tttcagcctc catcggggga 120
gctggagagg gagggggcaa gaaggcgctc tatcactgca actactgcaa caaggacatc 180
tccgggaaga tacggatcaa gtgctccaag tgccctgact tcgacctttg cgtggagtgt 240
ttctctgtcg gcgctgaagt caccccgcac cgcagcaacc atccttacaa agtcatggac 300
aacttgtctt tcccacttat ttgcccggat tggaatgcag atgaagaaat tctcctcctt 360
gagggaattg aaatgtatgg tctgggaaac tggcttgaag ttgcagagca tgttggtacc 420
aagtctaagt tacagtgtat tgatcattac acatcagcat acatgaactc accttgttat 480
cccctcccgg atatgtctca tgttaatggc aagaatagga aggaacttct agctatggct 540
aaagtacagg gtgagagtaa aaaagggact ttgctgttac ctggagaact cactcctaag 600
gttgaatctc aattttctcc ctccagggtc aaggtggaag atgcacttgg agaaggtcca 660
gcaggtcgat caccttcaca catggctgtt ggtgcaaata aaaaagcttc aaatgtggga 720
catattaagg atggcgctac tgtatcaaaa gtcgaagatg ttcatgtaga tagaagtgtt 780
ggtgtgaaga agcccagata ttctgcagat gaaggccctt cgttgactga actgagtgga 840
tacaatgcaa agagacatga gtttgaccca gaatacgata atgatgccga acaagctctt 900
gctgagatgg aatttaaaga aactgattcg gaaactgatc gtgaactgaa actccgtgtg 960
ctgcgtattt acctctccag gcttgatgaa agaaaaagaa gaaaggagtt catattggaa 1020
agaaatttgt tgttccctaa tcccttggag aaggatctta cgagtgaaga cagggaactt 1080
taccatcgct ataaagtctt catgcgtttt ctttctaagg aggaacatga agccctcgtt 1140
aggagtgtta ttgaggagcg aaaaattcgg aggagaattc aagaactcca ggaatgccgt 1200
tctgctggat gccgcacact ggctgaagca aagatacaca tagagcaaaa gaggaaaaaa 1260
gaatacgagc tgaatgcgca aaaagctaag gatagcagtc aacttaatgc aaataataaa 1320
tcagtacaaa agatgaatcg acctatgaaa attgagtccg atgggaattt ggatccaaag 1380
aaaggtggtg ctggcttgga ttctcctaag acaacaggac ctacaagtgt taagcagtgg 1440
gatgactggg atatagttgg tcttcctggg gcagagctat taagtgctag cgaaaagctt 1500
ctatgctgtc agaacagatt gctacccagc cattacctga gaatgcagga ggtgctgatg 1560
caggagatat tcaagggtag cgtcctaaag aaggaagacg cccacgtctt atttaaggtc 1620
gatcctacca aagtagatag tgtttatgat atggtatcta aaaaactggg caaccatgag 1680
gaggccccaa ccgtctag 1698
<210>206
<211>565
<212>PRT
<213>玉蜀黍
<400>206
Met Gly Arg Ser Arg Gly Val Leu Ser Ser Gly Asp Asp Asp Thr Gly
1 5 10 15
His Arg Ser Lys Arg Arg Arg Val Ser Ser Gly Gly Asp Ala Thr Asp
20 25 30
Ser Ile Ser Ala Ser Ile Gly Gly Ala Gly Glu Gly Gly Gly Lys Lys
35 40 45
Ala Leu Tyr His Cys Asn Tyr Cys Asn Lys Asp Ile Ser Gly Lys Ile
50 55 60
Arg Ile Lys Cys Ser Lys Cys Pro Asp Phe Asp Leu Cys Val Glu Cys
65 70 75 80
Phe Ser Val Gly Ala Glu Val Thr Pro His Arg Ser Asn His Pro Tyr
85 90 95
Lys Val Met Asp Asn Leu Ser Phe Pro Leu Ile Cys Pro Asp Trp Asn
100 105 110
Ala Asp Glu Glu Ile Leu Leu Leu Glu Gly Ile Glu Met Tyr Gly Leu
115 120 125
Gly Asn Trp Leu Glu Val Ala Glu His Val Gly Thr Lys Ser Lys Leu
130 135 140
Gln Cys Ile Asp His Tyr Thr Ser Ala Tyr Met Asn Ser Pro Cys Tyr
145 150 155 160
Pro Leu Pro Asp Met Ser His Val Asn Gly Lys Asn Arg Lys Glu Leu
165 170 175
Leu Ala Met Ala Lys Val Gln Gly Glu Ser Lys Lys Gly Thr Leu Leu
180 185 190
Leu Pro Gly Glu Leu Thr Pro Lys Val Glu Ser Gln Phe Ser Pro Ser
195 200 205
Arg Val Lys Val Glu Asp Ala Leu Gly Glu Gly Pro Ala Gly Arg Ser
210 215 220
Pro Ser His Met Ala Val Gly Ala Asn Lys Lys Ala Ser Asn Val Gly
225 230 235 240
His Ile Lys Asp Gly Ala Thr Val Ser Lys Val Glu Asp Val His Val
245 250 255
Asp Arg Ser Val Gly Val Lys Lys Pro Arg Tyr Ser Ala Asp Glu Gly
260 265 270
Pro Ser Leu Thr Glu Leu Ser Gly Tyr Asn Ala Lys Arg His Glu Phe
275 280 285
Asp Pro Glu Tyr Asp Asn Asp Ala Glu Gln Ala Leu Ala Glu Met Glu
290 295 300
Phe Lys Glu Thr Asp Ser Glu Thr Asp Arg Glu Leu Lys Leu Arg Val
305 310 315 320
Leu Arg Ile Tyr Leu Ser Arg Leu Asp Glu Arg Lys Arg Arg Lys Glu
325 330 335
Phe Ile Leu Glu Arg Asn Leu Leu Phe Pro Asn Pro Leu Glu Lys Asp
340 345 350
Leu Thr Ser Glu Asp Arg Glu Leu Tyr His Arg Tyr Lys Val Phe Met
355 360 365
Arg Phe Leu Ser Lys Glu Glu His Glu Ala Leu Val Arg Ser Val Ile
370 375 380
Glu Glu Arg Lys Ile Arg Arg Arg Ile Gln Glu Leu Gln Glu Cys Arg
385 390 395 400
Ser Ala Gly Cys Arg Thr Leu Ala Glu Ala Lys Ile His Ile Glu Gln
405 410 415
Lys Arg Lys Lys Glu Tyr Glu Leu Asn Ala Gln Lys Ala Lys Asp Ser
420 425 430
Ser Gln Leu Asn Ala Asn Asn Lys Ser Val Gln Lys Met Asn Arg Pro
435 440 445
Met Lys Ile Glu Ser Asp Gly Asn Leu Asp Pro Lys Lys Gly Gly Ala
450 455 460
Gly Leu Asp Ser Pro Lys Thr Thr Gly Pro Thr Ser Val Lys Gln Trp
465 470 475 480
Asp Asp Trp Asp Ile Val Gly Leu Pro Gly Ala Glu Leu Leu Ser Ala
485 490 495
Ser Glu Lys Leu Leu Cys Cys Gln Asn Arg Leu Leu Pro Ser His Tyr
500 505 510
Leu Arg Met Gln Glu Val Leu Met Gln Glu Ile Phe Lys Gly Ser Val
515 520 525
Leu Lys Lys Glu Asp Ala His Val Leu Phe Lys Val Asp Pro Thr Lys
530 535 540
Val Asp Ser Val Tyr Asp Met Val Ser Lys Lys Leu Gly Asn His Glu
545 550 555 560
Glu Ala Pro Thr Val
565
<210>207
<211>46
<212>PRT
<213>人工序列
<220>
<223>锌指ZZ型
<400>207
Lys Pro Gly Leu Tyr Cys Cys Asn Tyr Cys Asp Lys Asp Leu Ser Gly
1 5 10 15
Leu Val Arg Phe Lys Cys Ala Val Cys Met Asp Phe Asp Leu Cys Val
20 25 30
Glu Cys Phe Ser Val Gly Val Glu Leu Asn Arg His Lys Asn
35 40 45
<210>208
<211>49
<212>PRT
<213>人工序列
<220>
<223>SANT DNA结合结构域
<400>208
Val Thr Ser Asp Trp Asn Ala Asp Glu Glu Ile Leu Leu Leu Glu Ala
1 5 10 15
Ile Ala Thr Tyr Gly Phe Gly Asn Trp Lys Glu ValAla Asp His Val
20 25 30
Gly Ser Lys Thr Thr Thr Glu Cys Ile Lys His Phe Asn Ser Ala Tyr
35 40 45
Met
<210>209
<211>13
<212>PRT
<213>人工序列
<220>
<223>该结合EF手
<400>209
Asp Asn Asp Ala Glu Gln Leu Leu Ala Asp Met Glu Phe
1 5 10
<210>210
<21l>88
<212>PRT
<213>人工序列
<220>
<223>SWIRM
<400>210
Pro Arg Ile Tyr Ser Gly Leu Asp Thr Trp Asp Val Asp Gly Leu Leu
1 5 10 15
Gly Ala Asp Leu Leu Ser Glu Thr Glu Lys Lys Met Cys Asn Glu Thr
20 25 30
Arg Ile Leu Pro Val His Tyr Leu Lys Met Leu Asp Ile Leu Thr Arg
35 40 45
Glu Ile Lys Lys Gly Gln Ile Lys Lys Lys Ser Asp Ala Tyr Ser Phe
50 55 60
Phe Lys Val Glu Pro Ser Lys Val Asp Arg Val Tyr Asp Met Leu Val
65 70 75 80
His Lys Gly Ile Gly Asp Ser Thr
85
<210>211
<211>55
<212>DNA
<213>人工序列
<220>
<223>引物1
<400>211
ggggacaagt ttgtacaaaa aagcaggctt aaacaatggg tcgttcgaaa ctagc 55
<210>212
<211>52
<212>DNA
<213>人工序列
<220>
<223>引物2
<400>212
ggggaccact ttgtacaaga aagctgggtc atgttaggac catgaagcta tg 52
<210>213
<211>1130
<212>DNA
<213>稻
<400>213
catgcggcta atgtagatgc tcactgcgct agtagtaagg tactccagta cattatggaa 60
tatacaaagc tgtaatactc gtatcagcaa gagagaggca cacaagttgt agcagtagca 120
caggattaga aaaacgggac gacaaatagt aatggaaaaa caaaaaaaaa caaggaaaca 180
catggcaata taaatggaga aatcacaaga ggaacagaat ccgggcaata cgctgcgaaa 240
gtactcgtac gtaaaaaaaa gaggcgcatt catgtgtgga cagcgtgcag cagaagcagg 300
gatttgaaac cactcaaatc caccactgca aaccttcaaa cgaggccatg gtttgaagca 360
tagaaagcac aggtaagaag cacaacgccc tcgctctcca ccctcccacc caatcgcgac 420
gcacctcgcg gatcggtgac gtggcctcgc cccccaaaaa tatcccgcgg cgtgaagctg 480
acaccccggg cccacccacc tgtcacgttg gcacatgttg gttatggttc ccggccgcac 540
caaaatatca acgcggcgcg gcccaaaatt tccaaaatcc cgcccaagcc cctggcgcgt 600
gccgctcttc cacccaggtc cctctcgtaa tccataatgg cgtgtgtacc ctcggctggt 660
tgtacgtggg cgggttaccc tgggggtgtg ggtggatgac gggtgggccc ggaggaggtc 720
cggccccgcg cgtcatcgcg gggcggggtg tagcgggtgc gaaaaggagg cgatcggtac 780
gaaaattcaa attaggaggt ggggggcggg gcccttggag aataagcgga atcgcagata 840
tgcccctgac ttggcttggc tcctcttctt cttatccctt gtcctcgcaa ccccgcttcc 900
ttctctcctc tcctcttctc ttctcttctc tggtggtgtg ggtgtgtccc tgtctcccct 960
ctccttcctc ctctcctttc ccctcctctc ttcccccctc tcacaagaga gagagcgcca 1020
gactctcccc aggtgaggtg agaccagtct ttttgctcga ttcgacgcgc ctttcacgcc 1080
gcctcgcgcg gatctgaccg cttccctcgg ccttctcgca ggattcagcc 1130
<210>214
<211>1244
<212>DNA
<213>稻
<400>214
aaaaccaccg agggacctga tctgcaccgg ttttgatagt tgagggaccc gttgtgtctg 60
gttttccgat cgagggacga aaatcggatt cggtgtaaag ttaagggacc tcagatgaac 120
ttattccgga gcatgattgg gaagggagga cataaggccc atgtcgcatg tgtttggacg 180
gtccagatct ccagatcact cagcaggatc ggccgcgttc gcgtagcacc cgcggtttga 240
ttcggcttcc cgcaaggcgg cggccggtgg ccgtgccgcc gtagcttccg ccggaagcga 300
gcacgccgcc gccgccgacc cggctctgcg tttgcaccgc cttgcacgcg atacatcggg 360
atagatagct actactctct ccgtttcaca atgtaaatca ttctactatt ttccacattc 420
atattgatgt taatgaatat agacatatat atctatttag attcattaac atcaatatga 480
atgtaggaaa tgctagaatg acttacattg tgaattgtga aatggacgaa gtacctacga 540
tggatggatg caggatcatg aaagaattaa tgcaagatcg tatctgccgc atgcaaaatc 600
ttactaattg cgctgcatat atgcatgaca gcctgcatgc gggcgtgtaa gcgtgttcat 660
ccattaggaa gtaaccttgt cattacttat accagtacta catactatat agtattgatt 720
tcatgagcaa atctacaaaa ctggaaagca ataagaaata cgggactgga aaagactcaa 780
cattaatcac caaatatttc gccttctcca gcagaatata tatctctcca tcttgatcac 840
tgtacacact gacagtgtac gcataaacgc agcagccagc ttaactgtcg tctcaccgtc 900
gcacactggc cttccatctc aggctagctt tctcagccac ccatcgtaca tgtcaactcg 960
gcgcgcgcac aggcacaaat tacgtacaaa acgcatgacc aaatcaaaac caccggagaa 1020
gaatcgctcc cgcgcgcggc ggcgacgcgc acgtacgaac gcacgcacgc acgcccaacc 1080
ccacgacacg atcgcgcgcg acgccggcga caccggccgt ccacccgcgc cctcacctcg 1140
ccgactataa atacgtaggc atctgcttga tcttgtcatc catctcacca ccaaaaaaaa 1200
aaggaaaaaa aaacaaaaca caccaagcca aataaaagcg acaa 1244
<210>215
<211>1482
<212>DNA
<213>拟南芥
<400>215
atgttttttg gaccaagtga gtttgatgct gatgaaatgg gttatgcaat gagtagactt 60
gagatagaat ccgatctatg tgatactgga aaagacgttt gtggagttgg tagtagtagt 120
ggtagtcaca gatcaagtga acatttggct gatctagacc atgaaatcag ccaggttact 180
aaattgaaat ctagtcctca tcaacggtat agccgtgaag tccctgggag acatcagtta 240
cctgtgtcta ctgtgaggat gttggcaggt cgagaaagta atttctctgg aagaggaagg 300
ttttcagccg ctgattgttg ccatatgcta agcagatatt tgcctacaaa aggtccttgg 360
cttgtagatc aaatggacag ccgagcatat gtctctcagt tttcaactga tggttctctc 420
tttattgcgg ggtttcaggg tagccgtatt cggatttaca atgtagagaa gggttggaaa 480
gttcaaaagg atattcttgc aaaaagcttg cgttggactg ttactgatac ttctctatcc 540
cctgatcagc gaaatctggt ttacgcaagc atgtcaccta ttgttcacat tgttgatgtt 600
ggatccggtt caaccgagtc tcatgcaaat gttacggaga tccatgatgg cttagacttc 660
tcttctgatg aagatggagg gtactctttt ggaatattct ctgtgaaatt ttcaacagat 720
ggccgagaag ttgttgctgg gagcagtgat gattccattt atgtttatga ccttgaagca 780
aatcgagttt cactccggac tgttgcacac acgtctgatg taaatactgt gtgctttgct 840
gatgaaagtg ggaacctgat tttatctgga agtgatgata atctctgcaa agtgtgggat 900
aggcgttgtt tcattgggag agataagcca gctggtgttt tagtgggaca cctcgaaggt 960
gttaccttta tcgatagccg tggagatggt cgctatttca tatcaaatgg taaagaccaa 1020
actatcaaat tgtgggatat cagaaaaatg tcctcaagcg cacctgcaag gcatgaggtg 1080
ctaagaaact atgaatggga ctacagatgg atggattatc ctactgaagc aagagatcta 1140
aagcacccac tcgatcagtc agtgtcgaca tataaaggtc actcagtttt gcgtactctc 1200
atccgttgtt acttctctcc agcgcatagt actggccaaa agtacatcta cacaggatcg 1260
aacgacagtt ccgtctacat atacgacttg gtaagtggag ataaagtggc agtgctaaag 1320
caccatagct cacctgtaag agactgtaat tggcacccat attacccaac gcttataagc 1380
tcttcgtggg acggagatct tgtgaagtgg gaatttccgg ggagcggtga ggcgccgatt 1440
atgagtaaga agagggttcg aaggagacat ttctactact ga 1482
<210>216
<211>493
<212>PRT
<213>拟南芥
<400>216
Met Phe Phe Gly Pro Ser Glu Phe Asp Ala Asp Glu Met Gly Tyr Ala
1 5 10 15
Met Ser Arg Leu Glu Ile Glu Ser Asp Leu Cys Asp Thr Gly Lys Asp
20 25 30
Val Cys Gly Val Gly Ser Ser Ser Gly Ser His Arg Ser Ser Glu His
35 40 45
Leu Ala Asp Leu Asp His Glu Ile Ser Gln Val Thr Lys Leu Lys Ser
50 55 60
Ser Pro His Gln Arg Tyr Ser Arg Glu Val Pro Gly Arg His Gln Leu
65 70 75 80
Pro Val Ser Thr Val Arg Met Leu Ala Gly Arg Glu Ser Asn Phe Ser
85 90 95
Gly Arg Gly Arg Phe Ser Ala Ala Asp Cys Cys His Met Leu Ser Arg
100 105 110
Tyr Leu Pro Thr Lys Gly Pro Trp Leu Val Asp Gln Met Asp Ser Arg
115 120 125
Ala Tyr Val Ser Gln Phe Ser Thr Asp Gly Ser Leu Phe Ile Ala Gly
130 135 140
Phe Gln Gly Ser Arg Ile Arg Ile Tyr Asn Val Glu Lys Gly Trp Lys
145 150 155 160
Val Gln Lys Asp Ile Leu Ala Lys Ser Leu Arg Trp Thr Val Thr Asp
165 170 175
Thr Ser Leu Ser Pro Asp Gln Arg Asn Leu Val Tyr Ala Ser Met Ser
180 185 190
Pro Ile Val His Ile Val Asp Val Gly Ser Gly Ser Thr Glu Ser His
195 200 205
Ala Asn Val Thr Glu Ile His Asp Gly Leu Asp Phe Ser Ser Asp Glu
210 215 220
Asp Gly Gly Tyr Ser Phe Gly Ile Phe Ser Val Lys Phe Ser Thr Asp
225 230 235 240
Gly Arg Glu Val Val Ala Gly Ser Ser Asp Asp Ser Ile Tyr Val Tyr
245 250 255
Asp Leu Glu Ala Asn Arg Val Ser Leu Arg Thr Val Ala His Thr Ser
260 265 270
Asp Val Asn Thr Val Cys Phe Ala Asp Glu Ser Gly Asn Leu Ile Leu
275 280 285
Ser Gly Ser Asp Asp Asn Leu Cys Lys Val Trp Asp Arg Arg Cys Phe
290 295 300
Ile Gly Arg Asp Lys Pro Ala Gly Val Leu Val Gly His Leu Glu Gly
305 310 315 320
Val Thr Phe Ile Asp Ser Arg Gly Asp Gly Arg Tyr Phe Ile Ser Asn
325 330 335
Gly Lys Asp Gln Thr Ile Lys Leu Trp Asp Ile Arg Lys Met Ser Ser
340 345 350
Ser Ala Pro Ala Arg His Glu Val Leu Arg Asn Tyr Glu Trp Asp Tyr
355 360 365
Arg Trp Met Asp Tyr Pro Thr Glu Ala Arg Asp Leu Lys His Pro Leu
370 375 380
Asp Gln Ser Val Ser Thr Tyr Lys Gly His Ser Val Leu Arg Thr Leu
385 390 395 400
Ile Arg Cys Tyr Phe Ser Pro Ala His Ser Thr Gly Gln Lys Tyr Ile
405 410 415
Tyr Thr Gly Ser Asn Asp Ser Ser Val Tyr Ile Tyr Asp Leu Val Ser
420 425 430
Gly Asp Lys Val Ala Val Leu Lys His His Ser Ser Pro Val Arg Asp
435 440 445
Cys Asn Trp His Pro Tyr Tyr Pro Thr Leu Ile Ser Ser Ser Trp Asp
450 455 460
Gly Asp Leu Val Lys Trp Glu Phe Pro Gly Ser Gly Glu Ala Pro Ile
465 470 475 480
Met Ser Lys Lys Arg Val Arg Arg Arg His Phe Tyr Tyr
485 490
<210>217
<211>1458
<212>DNA
<213>Aquilegia formosa x Aquilegia pubescens
<400>217
atgtttgtta cagccagtgg ggttgacatt gatgaaatgg ggtacgccat gagtaggcta 60
gagatagaat ccgagttgtt cgatggcggt aataccgtcc atgaagccag tagcagtact 120
aggcccggca aactgtttcc taaagtagat gacgagattt cccagcttac aaacctcaga 180
tcagggccta atgatcgact gcatcaactt gtgcccggaa agcagcaatt acctgtttcc 240
ccggtgagga tgttggcagg tcgagaaagt aattattcag gaaagggaag gttctcgtca 300
gcggatcgtt gtcacatgct tagcaggtat ttgcctgtaa atggtccttg gcttgtggac 360
cagacgacca gtcgggccta tgtttctcaa ttttcagctg atggttctct atttgttgcc 420
gggtttcagg gaagcgatat tagaatatac aatgtggata gaggctggaa agttcagaag 480
aacattcttg ccaaaagttt gcgttggact gttactgaca catccctttc ccctgatcag 540
cgccatcttg tttataccag tatgtcaccc atagtccata ttgttaatgt tgggtctgct 600
accacggaat cccttgcaaa tataacagag gtccatgaag gattggactt ctctgctgct 660
gatggagggt attcttttgg aatattctct gtgaaatttt caaccgatgg acgagaactt 720
gttgctggaa gcagtgatga ttcgatatat gtttatgatc ttgaagcaaa caagctttcc 780
cttcgaattg cggcacacat ggctgatgtt aatactgtaa cctttgctga tgaaagtggt 840
aatctgatat attcaggaag tgatgataat ctctgcaagg tgtgggacag gcgttgtctc 900
agagcaaagg gaaaaccagc aggggttttg acggggcatt tagaaggcat tacgtttatt 960
gatagccgtg gagatggtcg atattttata tcaaatggaa aagaccaggc cattaaactt 1020
tgggacatca ggaaaatgtc ggctaatgct agttgcattg caaaatctag aaattacgaa 1080
tgggactaca gatggatgga atacccaact gaggcgagaa agttaaaaca tccttgtgat 1140
cagtctttgg ctacatacaa agggcactct gtcttgcgta ctctcatccg ttgttacttt 1200
tcaccggctt acagcactgg ccagaagtac atctacaccg gatcaaatga tggttgcgtt 1260
tatatatatg atttggtaag tggagcccaa gttgctcgac tagatcatca ttcatccact 1320
gtgagggatt gtagttggca ccctttctac ccaacgcttg tcagcagttc atgggatggg 1380
gtccttgcca gatgtgaatt tcctggcaat ggagaaaaaa ttcggctgaa gagaagcagg 1440
aggagatatg aatcttga 1458
<210>218
<211>485
<212>PRT
<213>Aquilegia formosa x Aquilegia pubescens
<400>218
Met Phe Val Thr Ala Ser Gly Val Asp Ile Asp Glu Met Gly Tyr Ala
1 5 10 15
Met Ser Arg Leu Glu Ile Glu Ser Glu Leu Phe Asp Gly Gly Asn Thr
20 25 30
Val His Glu Ala Ser Ser Ser Thr Arg Pro Gly Lys Leu Phe Pro Lys
35 40 45
Val Asp Asp Glu Ile Ser Gln Leu Thr Asn Leu Arg Ser Gly Pro Asn
50 55 60
Asp Arg Leu His Gln Leu Val Pro Gly Lys Gln Gln Leu Pro Val Ser
65 70 75 80
Pro Val Arg Met Leu Ala Gly Arg Glu Ser Asn Tyr Ser Gly Lys Gly
85 90 95
Arg Phe Ser Ser Ala Asp Arg Cys His Met Leu Ser Arg Tyr Leu Pro
100 105 110
Val Asn Gly Pro Trp Leu Val Asp Gln Thr Thr Ser Arg Ala Tyr Val
115 120 125
Ser Gln Phe Ser Ala Asp Gly Ser Leu Phe Val Ala Gly Phe Gln Gly
130 135 140
Ser Asp Ile Arg Ile Tyr Asn Val Asp Arg Gly Trp Lys Val Gln Lys
145 150 155 160
Asn Ile Leu Ala Lys Ser Leu Arg Trp Thr Val Thr Asp Thr Ser Leu
165 170 175
Ser Pro Asp Gln Arg His Leu Val Tyr Thr Ser Met Ser Pro Ile Val
180 185 190
His Ile Val Asn Val Gly Ser Ala Thr Thr Glu Ser Leu Ala Asn Ile
195 200 205
Thr Glu Val His Glu Gly Leu Asp Phe Ser Ala Ala Asp Gly Gly Tyr
210 215 220
Ser Phe Gly Ile Phe Ser Val Lys Phe Ser Thr Asp Gly Arg Glu Leu
225 230 235 240
Val Ala Gly Ser Ser Asp Asp Ser Ile Tyr Val Tyr Asp Leu Glu Ala
245 250 255
Asn Lys Leu Ser Leu Arg Ile Ala Ala His Met Ala Asp Val Asn Thr
260 265 270
Val Thr Phe Ala Asp Glu Ser Gly Asn Leu Ile Tyr Ser Gly Ser Asp
275 280 285
Asp Asn Leu Cys Lys Val Trp Asp Arg Arg Cys Leu Arg Ala Lys Gly
290 295 300
Lys Pro Ala Gly Val Leu Thr Gly His Leu Glu Gly Ile Thr Phe Ile
305 310 315 320
Asp Ser Arg Gly Asp Gly Arg Tyr Phe Ile Ser Asn Gly Lys Asp Gln
325 330 335
Ala Ile Lys Leu Trp Asp Ile Arg Lys Met Ser Ala Asn Ala Ser Cys
340 345 350
Ile Ala Lys Ser Arg Asn Tyr Glu Trp Asp Tyr Arg Trp Met Glu Tyr
355 360 365
Pro Thr Glu Ala Arg Lys Leu Lys His Pro Cys Asp Gln Ser Leu Ala
370 375 380
Thr Tyr Lys Gly His Ser Val Leu Arg Thr Leu Ile Arg Cys Tyr Phe
385 390 395 400
Ser Pro Ala Tyr Ser Thr Gly Gln Lys Tyr Ile Tyr Thr Gly Ser Asn
405 410 415
Asp Gly Cys Val Tyr Ile Tyr Asp Leu Val Ser Gly Ala Gln Val Ala
420 425 430
Arg Leu Asp His His Ser Ser Thr Val Arg Asp Cys Ser Trp His Pro
435 440 445
Phe Tyr Pro Thr Leu Val Ser Ser Ser Trp Asp Gly Val Leu Ala Arg
450 455 460
Cys Glu Phe Pro Gly Asn Gly Glu Lys Ile Arg Leu Lys Arg Ser Arg
465 470 475 480
Arg Arg Tyr Glu Ser
485
<210>219
<211>1488
<212>DNA
<213>欧洲油菜
<400>219
atgttttctg gaccaagtga ttctgatact gatgaaatgg gttatgcaat gagtagactc 60
gagattgaat ccgatctatg cgacgctgga aagggctatt acggtgttgg tagcagcagt 120
ggtagtagtc acagatcgag tgagcgtttg ggtgatttag acaatgagat cagccaagtc 180
actaagctga agtcttgtcc tcatgaacgg tttagccgtc aagtacccgg gaggcatcag 240
ttgcctgttt ccactgtgaa gatgttggct ggtcgtgaga gtaacttctc tggaagagga 300
ggaaggtttt cttcagctga tcgttgtcat atcttgagta gatatttgcc tgttaagggt 360
ccttggcttg tggatcagat ggacagccga gcttatgtct ctcagttttc aactgatggg 420
tctctcttca ttgctgggtt tcagggaagc catattcgga tttacaatgt agagaaaggt 480
tggaaagttc aaaaggatat tcttgcaaag agcttgcgtt ggactgttac tgatacttct 540
ctgtcccctg atcagcgaaa cctggtttat gcaagcatgt cacctattgt tcacatagtc 600
gatgttggat ctggtacaac cgagtctcac gcaaatgtca cggagatcca tgatggatta 660
gacttctctt ctgaagaaga tggaggctac tcttttggga tattctctgt gaaattttca 720
acagatggac gagaactcgt tgctggtagc agtgatgatt ccatttacgt ttatgatctc 780
gaagcaaacc gagtctcact ccggactgtt gcacacacgt ctgatgtgaa cactgtgtgt 840
ttcgccgatg aaagtggaca cctgattctc tctggaggtg atgataatct ctgcaaggtg 900
tgggataggc gttgtttcat tgggagagat aagccagctg gtgttctggt gggacaccta 960
gaaggtgtta catttatcga tagccgcgga gatggtcgct atttcatatc aaatgggaaa 1020
gaccaaacca tcaagctatg ggatattaga aaaatgtcct caactgtacc tgcaaggaat 1080
gaggtgcaca gaaactatga atgggattac agatggatgg attacccttc ggaggcaaga 1140
gatctaaagc acccttatga tcagtctgtg tctacatata agggtcactc agtgttgcgt 1200
actctcatcc gttgctactt ctctccagct catagtactg gtcaaaagta catatacaca 1260
ggatccaacg acagttctgt ctacatatat gacttggaaa gtggagataa agcggcggtg 1320
ttaaagcacc atagctcacc tgtgagagac tgtaactggc atccgcatta tccgacgctt 1380
ataagctcgt cgtgggacgg agatcttgtg aaatgggagt ttcctgggag cggtgaggcg 1440
ccgatcatga gcaagaagag ggtccgaagg agacatttct actactga 1488
<210>220
<211>495
<212>PRT
<213>欧洲油菜
<400>220
Met Phe Ser Gly Pro Ser Asp Ser Asp Thr Asp Glu Met Gly Tyr Ala
1 5 10 15
Met Ser Arg Leu Glu Ile Glu Ser Asp Leu Cys Asp Ala Gly Lys Gly
20 25 30
Tyr Tyr Gly Val Gly Ser Ser Ser Gly Ser Ser His Arg Ser Ser Glu
35 40 45
Arg Leu Gly Asp Leu Asp Asn Glu Ile Ser Gln Val Thr Lys Leu Lys
50 55 60
Ser Cys Pro His Glu Arg Phe Ser Arg Gln Val Pro Gly Arg His Gln
65 70 75 80
Leu Pro Val Ser Thr Val Lys Met Leu Ala Gly Arg Glu Ser Asn Phe
85 90 95
Ser Gly Arg Gly Gly Arg Phe Ser Ser Ala Asp Arg Cys His Ile Leu
100 105 1l0
Ser Arg Tyr Leu Pro Val Lys Gly Pro Trp Leu Val Asp Gln Met Asp
115 120 125
Ser Arg Ala Tyr Val Ser Gln Phe Ser Thr Asp Gly Ser Leu Phe Ile
130 135 140
Ala Gly Phe Gln Gly Ser His Ile Arg Ile Tyr Asn Val Glu Lys Gly
145 150 155 160
Trp Lys Val Gln Lys Asp Ile Leu Ala Lys Ser Leu Arg Trp Thr Val
165 170 175
Thr Asp Thr Ser Leu Ser Pro Asp Gln Arg Asn Leu Val Tyr Ala Ser
180 185 190
Met Ser Pro Ile Val His Ile Val Asp Val Gly Ser Gly Thr Thr Glu
195 200 205
Ser His Ala Asn Val Thr Glu Ile His Asp Gly Leu Asp Phe Ser Ser
210 215 220
Glu Glu Asp Gly Gly Tyr Ser Phe Gly Ile Phe Ser Val Lys Phe Ser
225 230 235 240
Thr Asp Gly Arg Glu Leu Val Ala Gly Ser Ser Asp Asp Ser Ile Tyr
245 250 255
Val Tyr Asp Leu Glu Ala Asn Arg Val Ser Leu Arg Thr Val Ala His
260 265 270
Thr Ser Asp Val Asn Thr Val Cys Phe Ala Asp Glu Ser Gly His Leu
275 280 285
Ile Leu Ser Gly Gly Asp Asp Asn Leu Cys Lys Val Trp Asp Arg Arg
290 295 300
Cys Phe Ile Gly Arg Asp Lys Pro Ala Gly Val Leu Val Gly His Leu
305 310 315 320
Glu Gly Val Thr Phe Ile Asp Ser Arg Gly Asp Gly Arg Tyr Phe Ile
325 330 335
Ser Asn Gly Lys Asp Gln Thr Ile Lys Leu Trp Asp Ile Arg Lys Met
340 345 350
Ser Ser Thr Val Pro Ala Arg Asn Glu Val His Arg Asn Tyr Glu Trp
355 360 365
Asp Tyr Arg Trp Met Asp Tyr Pro Ser Glu Ala Arg Asp Leu Lys His
370 375 380
Pro Tyr Asp Gln Ser Val Ser Thr Tyr Lys Gly His Ser Val Leu Arg
385 390 395 400
Thr Leu Ile Arg Cys Tyr Phe Ser Pro Ala His Ser Thr Gly Gln Lys
405 410 415
Tyr Ile Tyr Thr Gly Ser Asn Asp Ser Ser Val Tyr Ile Tyr Asp Leu
420 425 430
Glu Ser Gly Asp Lys Ala Ala Val Leu Lys His His Ser Ser Pro Val
435 440 445
Arg Asp Cys Asn Trp His Pro His Tyr Pro Thr Leu Ile Ser Ser Ser
450 455 460
Trp Asp Gly Asp Leu Val Lys Trp Glu Phe Pro Gly Ser Gly Glu Ala
465 470 475 480
Pro Ile Met Ser Lys Lys Arg Val Arg Arg Arg His Phe Tyr Tyr
485 490 495
<210>221
<211>1428
<212>DNA
<213>大豆
<400>221
atgagttggt tgaataaaaa taaaagtact tgtagtgatg gtagtgccaa taatgaatct 60
tcttcaagtg gaatagttgg agaaagggat aatcatcttg atcatgaaat tgcacagctc 120
acaaaactta ggtcaagtcc tcatgagctt ttgggtcgtg ttgttcctgg taagatgagg 180
ttacctgcat ctactgtgag aatgctggtt ggtagagaag gtaattattc tggaagaggg 240
agattttcat cagcagatgg gtgtcatgtg ttaagccgct atttgcctac caaaggtcct 300
tggattgtgg atcggatgaa aagtcgtgcc tatgtttcac agttttctgc tgatggttct 360
cttttaattg ctggattcca gggaagccac atcaggatct atgatgttga ccagggctgg 420
aaagttaaaa aggacatttc tgctagaaag ttacggtgga cagttactga tacatctctc 480
tcgccagatc aactctatct tgtttatgcc agtatgtcac caattatcca tattgttacc 540
gtgggatctg gcacaacaga atcaatagca aatgttacag aaattcacta tggattaaat 600
ttctcttctg ataatggtga tgatgaattt ggaattttct ctgtcaaatt ttcaacggat 660
gggcgagagc ttgtggctgg aactagtgat tgctcgatat gtgtatatga tcttggagca 720
gataagctga gccttagaat tcctgctcac cagtctgatg ttaacactgt ctgctttgct 780
gatgaatctg gccatctaat attttccggt agtgatgata gttttatcaa ggtgtgggat 840
aggcgttgtt ttgtcgccaa aggacaacca gctggtatct taatgggaca tttagaaggc 900
attacattca ttgatagccg tggggatggt cgatatttaa tttctaatgg aaaagatcaa 960
actaccaaat tatgggatat aaggaagatg tcttctaatg caataaatct tggccttgga 1020
gatgatgagt gggactatcg atggatggac taccctgaat atgcaagaaa tttaaagcat 1080
cctcatgatc agtcattagc aacatataaa ggtcactcag tgttgcgtac tttagtgcgc 1140
tgttatttct ctccttcgta tagcactggt caaaagtaca tttacacagg gtctagtgat 1200
tcatctgttt acatatatga cctggtaagt ggtgcacaag ttgcaaaact tgatcatcat 1260
gaggcacctg taagggattg tagttggcac ccctattatc caatgatgat ctcttcggct 1320
tgggatggtg atgttgtcag gtgggaattt cctgggagtg atgaagcccc tgcttctcca 1380
aataaaagag aaggtcgaat tcgtaggaga aatttgcttt atctatag 1428
<210>222
<211>475
<212>PRT
<213>大豆
<400>222
Met Ser Trp Leu Asn Lys Asn Lys Ser Thr Cys Ser Asp Gly Ser Ala
1 5 10 15
Asn Asn Glu Ser Ser Ser Ser Gly Ile Val Gly Glu Arg Asp Asn His
20 25 30
Leu Asp His Glu Ile Ala Gln Leu Thr Lys Leu Arg Ser Ser Pro His
35 40 45
Glu Leu Leu Gly Arg Val Val Pro Gly Lys Met Arg Leu Pro Ala Ser
50 55 60
Thr Val Arg Met Leu Val Gly Arg Glu Gly Asn Tyr Ser Gly Arg Gly
65 70 75 80
Arg Phe Ser Ser Ala Asp Gly Cys His Val Leu Ser Arg Tyr Leu Pro
85 90 95
Thr Lys Gly Pro Trp Ile Val Asp Arg Met Lys Ser Arg Ala Tyr Val
100 105 110
Ser Gln Phe Ser Ala Asp Gly Ser Leu Leu Ile Ala Gly Phe Gln Gly
115 120 125
Ser His Ile Arg Ile Tyr Asp Val Asp Gln Gly Trp Lys Val Lys Lys
130 135 140
Asp Ile Ser Ala Arg Lys Leu Arg Trp Thr Val Thr Asp Thr Ser Leu
145 150 155 160
Ser Pro Asp Gln Leu Tyr Leu Val Tyr Ala Ser Met Ser Pro Ile Ile
165 170 175
His Ile Val Thr Val Gly Ser Gly Thr Thr Glu Ser Ile Ala Asn Val
180 185 190
Thr Glu Ile His Tyr Gly Leu Asn Phe Ser Ser Asp Asn Gly Asp Asp
195 200 205
Glu Phe Gly Ile Phe Ser Val Lys Phe Ser Thr Asp Gly Arg Glu Leu
210 215 220
Val Ala Gly Thr Ser Asp Cys Ser Ile Cys Val Tyr Asp Leu Gly Ala
225 230 235 240
Asp Lys Leu Ser Leu Arg Ile Pro Ala His Gln Ser Asp Val Asn Thr
245 250 255
Val Cys Phe Ala Asp Glu Ser Gly His Leu Ile Phe Ser Gly Ser Asp
260 265 270
Asp Ser Phe Ile Lys Val Trp Asp Arg Arg Cys Phe Val Ala Lys Gly
275 280 285
Gln Pro Ala Gly Ile Leu Met Gly His Leu Glu Gly Ile Thr Phe Ile
290 295 300
Asp Ser Arg Gly Asp Gly Arg Tyr Leu Ile Ser Asn Gly Lys Asp Gln
305 310 315 320
Thr Thr Lys Leu Trp Asp Ile Arg Lys Met Ser Ser Asn Ala Ile Asn
325 330 335
Leu Gly Leu Gly Asp Asp Glu Trp Asp Tyr Arg Trp Met Asp Tyr Pro
340 345 350
Glu Tyr Ala Arg Asn Leu Lys His Pro His Asp Gln Ser Leu Ala Thr
355 360 365
Tyr Lys Gly His Ser Val Leu Arg Thr Leu Val Arg Cys Tyr Phe Ser
370 375 380
Pro Ser Tyr Ser Thr Gly Gln Lys Tyr Ile Tyr Thr Gly Ser Ser Asp
385 390 395 400
Ser Ser Val Tyr Ile Tyr Asp Leu Val Ser Gly Ala Gln ValAla Lys
405 410 415
Leu Asp His His Glu Ala Pro ValArg Asp Cys Ser Trp His Pro Tyr
420 425 430
Tyr Pro Met Met Ile Ser Ser Ala Trp Asp Gly Asp Val ValArg Trp
435 440 445
Glu Phe Pro Gly Ser Asp Glu Ala Pro Ala Ser Pro Asn Lys Arg Glu
450 455 460
Gly Arg Ile Arg Arg Arg Asn Leu Leu Tyr Leu
465 470 475
<210>223
<211>1467
<212>DNA
<213>陆地棉(Gossypium hirsutum)
<400>223
atgtttgtta cagccagtgg ggttgacatt gatgaaatgg ggtacgccat gagtaggcta 60
gagatagaat ccgagttgtt cgatggcggt aataccgtcc atgaagccag tagcagtact 120
aggcccggca aactgtttcc taaagtagat gacgagattt cccagcttac aaacctcaga 180
tcagggccta atgatcgact gcatcaactt gtgcccggaa agcagcaatt acctgtttcc 240
ccggtgagga tgttggcagg tcgagaaagt aattattcag gaaagggaag gttctcgtca 300
gcggatcgtt gtcacatgct tagcaggtat ttgcctgtaa atggtccttg gcttgtggac 360
cagacgacca gtcgggccta tgtttctcaa ttttcagctg atggttctct atttgttgcc 420
gggtttcagg gaagcaatat taggatatac aatgtggata gaggttggaa agttcaaaag 480
aacattcttg ctaaaagttt gcgttggaca gttaccgaca catccctttc tccggatcag 540
cggtaccttg tttataccag catgtcgcct gtagttcaca ttgttaatgt tgggtcttct 600
accacggaat cctttgcaaa tgtcacggag atccacgaag gattggactt ttcttctaat 660
gatcgaaggt attcttttgg aatattctcc gtgaaatttt caactgatgg acgagaactt 720
gtggctggaa gcagtgatga ctcgatatat gtttatgatc tggaagcaaa caagctttcc 780
cttcgaatta tggcacacac ggctgatgtt aacacggcaa cctttgccga tgaaagcggc 840
aatttgatat attctgggag tgatgattat ctctgcatgg tgtgggatag gcgttgcttt 900
ggagcaaaag ataagccggc aggagttttg gtgggacacc tggaaggtat tacattcctc 960
gacagtcgtg gggatggtcg ttacttcata tcaaacggta aagatcagac tatcaagctt 1020
tgggatatcc ggaaaatgtc ctccgatacc tcttgcaatt tagggtatcg gaatttcgaa 1080
tgggattaca gatggatgga ctaccctcca caggctagag atttgaaaca cccaagtgac 1140
ggatcagtgg ctacttataa aggtcactca gtgttgcgca ctcttattcg ctgttatttt 1200
tcacccgaat actgcacggg ccaaaagtac atttacaccg gatctcacga ttctcgggtt 1260
tatatttatg atgtggtcac cggagcccaa gttgcggtac tgaagcacca tacatcacca 1320
gtaagagact gtagttggca cccgcattac cctgtgttgg tcagctcctc ttgggacggg 1380
gacgtggtta agtgggaatt ccctggtaaa ggagaagcgc cggtccttgc gaacgagagg 1440
agagtcagga ggcaatatca cgattga 1467
<210>224
<211>488
<212>PRT
<213>陆地棉
<400>224
Met Phe Val Thr Ala Ser Gly Val Asp Ile Asp Glu Met Gly Tyr Ala
1 5 10 15
Met Ser Arg Leu Glu Ile Glu Ser Glu Leu Phe Asp Gly Gly Asn Thr
20 25 30
Val His Glu Ala Ser Ser Ser Thr Arg Pro Gly Lys Leu Phe Pro Lys
35 40 45
Val Asp Asp Glu Ile Ser Gln Leu Thr Asn Leu Arg Ser Gly Pro Asn
50 55 60
Asp Arg Leu His Gln Leu Val Pro Gly Lys Gln Gln Leu Pro Val Ser
65 70 75 80
Pro Val Arg Met Leu Ala Gly Arg Glu Ser Asn Tyr Ser Gly Lys Gly
85 90 95
Arg Phe Ser Ser Ala Asp Arg Cys His Met Leu Ser Arg Tyr Leu Pro
100 105 110
Val Asn Gly Pro Trp Leu Val Asp Gln Thr Thr Ser Arg Ala Tyr Val
115 120 125
Ser Gln Phe Ser Ala Asp Gly Ser Leu Phe Val Ala Gly Phe Gln Gly
130 135 140
Ser Asn Ile Arg Ile Tyr Asn Val Asp Arg Gly Trp Lys Val Gln Lys
145 150 155 160
Asn Ile Leu Ala Lys Ser Leu Arg Trp Thr Val Thr Asp Thr Ser Leu
165 170 175
Ser Pro Asp Gln Arg Tyr Leu Val Tyr Thr Ser Met Ser Pro Val Val
180 185 190
His Ile Val Asn Val Gly Ser Ser Thr Thr Glu Ser Phe Ala Asn Val
195 200 205
Thr Glu Ile His Glu Gly Leu Asp Phe Ser Ser Asn Asp Arg Arg Tyr
210 215 220
Ser Phe Gly Ile Phe Ser Val Lys Phe Ser Thr Asp Gly Arg Glu Leu
225 230 235 240
Val Ala Gly Ser Ser Asp Asp Ser Ile Tyr Val Tyr Asp Leu Glu Ala
245 250 255
Asn Lys Leu Ser Leu Arg Ile Met Ala His Thr Ala Asp Val Asn Thr
260 265 270
Ala Thr Phe Ala Asp Glu Ser Gly Asn Leu Ile Tyr Ser Gly Ser Asp
275 280 285
Asp Tyr Leu Cys Met Val Trp Asp Arg Arg Cys Phe Gly Ala Lys Asp
290 295 300
Lys Pro Ala Gly Val Leu Val Gly His Leu Glu Gly Ile Thr Phe Leu
305 310 315 320
Asp Ser Arg Gly Asp Gly Arg Tyr Phe Ile Ser Asn Gly Lys Asp Gln
325 330 335
Thr Ile Lys Leu Trp Asp Ile Arg Lys Met Ser Ser Asp Thr Ser Cys
340 345 350
Asn Leu Gly Tyr Arg Asn Phe Glu Trp Asp Tyr Arg Trp Met Asp Tyr
355 360 365
Pro Pro Gln Ala Arg Asp Leu Lys His Pro Ser Asp Gly Ser Val Ala
370 375 380
Thr Tyr Lys Gly His Ser Val Leu Arg Thr Leu Ile Arg Cys Tyr Phe
385 390 395 400
Ser Pro Glu Tyr Cys Thr GlyGln Lys Tyr Ile Tyr Thr Gly Ser His
405 410 415
Asp Ser Arg Val Tyr Ile Tyr Asp Val Val Thr Gly Ala Gln Val Ala
420 425 430
Val Leu Lys His His Thr Ser Pro Val Arg Asp Cys Ser Trp His Pro
435 440 445
His Tyr Pro Val Leu Val Ser Ser Ser Trp Asp Gly Asp Val Val Lys
450 455 460
Trp Glu Phe Pro Gly Lys Gly Glu Ala Pro Val Leu Ala Asn Glu Arg
465 470 475 480
Arg Val Arg Arg Gln Tyr His Asp
485
<210>225
<211>1464
<212>DNA
<213>向日葵
<400>225
atgtactcta ggggttggac cacacttata ggtgacatgg gatatgccct aagtagattg 60
gaaatcgacc cagattattc tgataacgga tctgttggag acgataatga cagccaccag 120
tcttcatcac aaaatgatgt ggatcctgaa gttgctcagt taacaaagct gaaatcagca 180
ccccatgatg gattgaaacg cgttcttcca cgaagggggg aatttgatgt ttcgcctgtg 240
aagatgttag cgggtcgaga agggaattat tcgggtcgtg ggaagttttc tttagcagat 300
cgttgtcata tgctaaacaa atatttacct gttaaaggtc cttctattgt tgaccaattg 360
accacccggg cttatgtctc acagttttca aaagacgggt ccctttttgt tgctgcattt 420
cagggaagtc agattaaaat ttataatgct gaaatggggt ggaaacttca caagaaaatt 480
gttgctgaaa gctttaattg gacggttact gacacatcta tttcaccaga taaacgtttc 540
ctgatttatt caactttgtc tcctatagtc aacattgtaa atattggatc tgctggaaca 600
gagtctcatg caaatgtcac ggacatacac gaagggctag aatttgcagc tgatgatgaa 660
gaaggatatg catttggaat tttttctgta aaattttcta gtgatggtag agaacttgta 720
gccggaagta gtgatgattc aatttatgtt tatgatattg aagcaaaaag attttccctt 780
cgaattcaag cgcatacgtc agatgtaaac agtgtatgct ttgctgatga agccagcaat 840
ctgatatatt ctgggagtga tgataatctc tgtaaggttt gggacagacg ttccatcaga 900
tcaaaaggaa agccagttgg aatcctcacg gggcatctag aaggaattac acatcttgat 960
agccgtaatg atggtcgtta tttcatttca aatggaaaag atcagactat taagctttgg 1020
gatatcagaa aaatgtcctc taatgcggct cgcgctccta tatccaggaa ctatgaatgg 1080
gactacagat ggatggacta tctacttagg gcacgagatg taaagcatcc atctgaccag 1140
tccgttgcta catacaaagg tcattcagta ttgcgtacac tcatacgctg ctatttttca 1200
ccagaatata gcactggcca aaggtatatt tatactggat ctcatgattc ttgtgtgtat 1260
gtttatgatt tggttactgg ggcccgggtt gcaagacttg tgcaccataa gtcaaccgtg 1320
cgggactgca gctggcaccc ttactatcca atgctcgtca gttcttcgtt tgatggagat 1380
atcgcaaagt gggaatttcc tggaaacgga gagaacccga ttcccgtgaa taacagtagg 1440
cctcgacgac aatattatga ttaa 1464
<210>226
<211>487
<212>PRT
<213>向日葵
<400>226
Met Tyr Ser Arg Gly Trp Thr Thr Leu Ile Gly Asp Met Gly Tyr Ala
1 5 10 15
Leu Ser Arg Leu Glu Ile Asp Pro Asp Tyr Ser Asp Asn Gly Ser Val
20 25 30
Gly Asp Asp Asn Asp Ser His Gln Ser Ser Ser Gln Asn Asp Val Asp
35 40 45
Pro Glu Val Ala Gln Leu Thr Lys Leu Lys Ser Ala Pro His Asp Gly
50 55 60
Leu Lys Arg Val Leu Pro Arg Arg Gly Glu Phe Asp Val Ser Pro Val
65 70 75 80
Lys Met Leu Ala Gly Arg Glu Gly Asn Tyr Ser Gly Arg Gly Lys Phe
85 90 95
Ser Leu Ala Asp Arg Cys His Met Leu Asn Lys Tyr Leu Pro Val Lys
100 105 110
Gly Pro Ser Ile Val Asp Gln Leu Thr Thr Arg Ala Tyr Val Ser Gln
115 120 125
Phe Ser Lys Asp Gly Ser Leu Phe Val Ala Ala Phe Gln Gly Ser Gln
130 135 140
Ile Lys Ile Tyr Asn Ala Glu Met Gly Trp Lys Leu His Lys Lys Ile
145 150 155 160
Val Ala Glu Ser Phe Asn Trp Thr Val Thr Asp Thr Ser Ile Ser Pro
165 170 175
Asp Lys Arg Phe Leu Ile Tyr Ser Thr Leu Ser Pro Ile Val Asn Ile
180 185 190
Val Asn Ile Gly Ser Ala Gly Thr Glu Ser His Ala Asn Val Thr Asp
195 200 205
Ile His Glu Gly Leu Glu Phe Ala Ala Asp Asp Glu Glu Gly Tyr Ala
210 215 220
Phe Gly Ile Phe Ser Val Lys Phe Ser Ser Asp Gly Arg Glu Leu Val
225 230 235 240
Ala Gly Ser Ser Asp Asp Ser Ile Tyr Val Tyr Asp Ile Glu Ala Lys
245 250 255
Arg Phe Ser Leu Arg Ile Gln Ala His Thr Ser Asp Val Asn Ser Val
260 265 270
Cys Phe Ala Asp Glu Ala Ser Asn Leu Ile Tyr Ser Gly Ser Asp Asp
275 280 285
Asn Leu Cys Lys Val Trp Asp Arg Arg Ser Ile Arg Ser Lys Gly Lys
290 295 300
Pro Val Gly Ile Leu Thr Gly His Leu Glu Gly Ile Thr His Leu Asp
305 310 315 320
Ser Arg Asn Asp Gly Arg Tyr Phe Ile Ser Asn Gly Lys Asp Gln Thr
325 330 335
Ile Lys Leu Trp Asp Ile Arg Lys Met Ser Ser Asn Ala Ala Arg Ala
340 345 350
Pro Ile Ser Arg Asn Tyr Glu Trp Asp Tyr Arg Trp Met Asp Tyr Leu
355 360 365
Leu Arg Ala Arg Asp Val Lys His Pro Ser Asp Gln Ser Val Ala Thr
370 375 380
Tyr Lys Gly His Ser Val Leu Arg Thr Leu Ile Arg Cys Tyr Phe Ser
385 390 395 400
Pro Glu Tyr Ser Thr Gly Gln Arg Tyr Ile Tyr Thr Gly Ser His Asp
405 410 415
Ser Cys Val Tyr Val Tyr Asp Leu Val Thr Gly Ala Arg Val Ala Arg
420 425 430
Leu Val His His Lys Ser Thr Val Arg Asp Cys Ser Trp His Pro Tyr
435 440 445
Tyr Pro Met Leu Val Ser Ser Ser Phe Asp Gly Asp Ile Ala Lys Trp
450 455 460
Glu Phe Pro Gly Asn Gly Glu Asn Pro Ile Pro Val Asn Asn Ser Arg
465 470 475 480
Pro Arg Arg Gln Tyr Tyr Asp
485
<210>227
<211>1410
<212>DNA
<213>大麦
<400>227
atggcagcgg cagggaggct gcggggacgg cggcgggcgc agaaggaggt ggagcgcgag 60
ctcgagccgt tcactatcga ggaagaggtg tcccacctca cccgggcttt gtcggagccg 120
tgcccgggca cccgcgccgc cgtccgtggc gccaggcgga agaggggcgt ctcggctttc 180
gacatgctgt cgtcgaggga gtccggccgg tccggtggcg gcgggttctg ctcggccgac 240
cgcgcctacg ccgccgggag gcacctgccc gcggtagggc cgtggtgcgt cgaagacatg 300
gatagcgagg cctatgtttc gcagttctcc agcgatggct cactgctcgt tgctgggttt 360
cggggaagcc gcatcagagt ttacgatgtc gataaagggt ggaaggtgca taagaacata 420
agctgcagaa gtatgaggtg gacggtttca gacattgctc tctcccctga ccagcgatac 480
cttgcttatt ccagtttgtc gcctattgtt cacattgtga atgtgcagaa tgctggaagg 540
gaatcatatg ctaatgttac tgaaattcac gagggtttgg aattctgtga tgatgatgaa 600
tactctttcg ggatattctc tgtgaaattt tcgaaagatg gtagagaagt tgttgttggg 660
aacaatgatt gttcaatata tgtctatgat cttggagcaa ataaagtatc agaccgtatc 720
cgtgctcata tgggtgatgt caacacggtt acctttgctg atgaaagtgg caatttgttg 780
tactctggaa gtgatgataa tctctgtaag gtctgggata ggcgttgcct tgtaagagag 840
aaaccagcag gtgttttgac aggtcattta gatgggatta catttattga tagccgtggt 900
gatgggcgtt atctaatctc gaactgcaag gaccagacta tcaaactttg ggacgtcaga 960
aagatgtccg ccaccgtcaa aggacgacaa ccgagattat atgactggga ctacagatgg 1020
atgtcgctcc catcacacgc tagatattat aagcatccag atgatctgtc tctggcaact 1080
tacaggggtc attcagttct gcggacactt atccgctgct acttctctcc aatgcacagc 1140
acgggccaga ggtacatata cactggatca agtgatgatt cagtgcatat ttacgatgtg 1200
gtaaccgggg cgaccgtcaa gaagctctcg tggcacggtt cgatcatcag agactgcacc 1260
tggcatcctt accgtccaac acttgtcagc tcttcctggg acggctatct ggcccggtgg 1320
gaggcatcag gcgacaacga ggacccctcg gtgctcacgt gcgacgagca gaggaatagc 1380
ccttaccacg agacatacgg gctgttgtaa 1410
<210>228
<211>469
<212>PRT
<213>大麦
<400>228
Met Ala Ala Ala Gly Arg Leu Arg Gly Arg Arg Arg Ala Gln Lys Glu
1 5 10 15
Val Glu Arg Glu Leu Glu Pro Phe Thr Ile Glu Glu Glu Val Ser His
20 25 30
Leu Thr Arg Ala Leu Ser Glu Pro Cys Pro Gly Thr Arg Ala Ala Val
35 40 45
Arg Gly Ala Arg Arg Lys Arg Gly Val Ser Ala Phe Asp Met Leu Ser
50 55 60
Ser Arg Glu Ser Gly Arg Ser Gly Gly Gly Gly Phe Cys Ser Ala Asp
65 70 75 80
Arg Ala Tyr Ala Ala Gly Arg His Leu Pro Ala Val Gly Pro Trp Cys
85 90 95
Val Glu Asp Met Asp Ser Glu Ala Tyr Val Ser Gln Phe Ser Ser Asp
100 105 110
Gly Ser Leu Leu Val Ala Gly Phe Arg Gly Ser Arg Ile Arg Val Tyr
115 120 125
Asp Val Asp Lys Gly Trp Lys Val His Lys Asn Ile Ser Cys Arg Ser
130 135 140
Met Arg Trp Thr Val Ser Asp Ile Ala Leu Ser Pro Asp Gln Arg Tyr
145 150 155 160
Leu Ala Tyr Ser Ser Leu Ser Pro Ile Val His Ile Val Asn Val Gln
165 170 175
Asn Ala Gly Arg Glu Ser Tyr Ala Asn Val Thr Glu Ile His Glu Gly
180 185 190
Leu Glu Phe Cys Asp Asp Asp Glu Tyr Ser Phe Gly Ile Phe Ser Val
195 200 205
Lys Phe Ser Lys Asp Gly Arg Glu Val Val Val Gly Asn Asn Asp Cys
210 215 220
Ser Ile Tyr Val Tyr Asp Leu Gly Ala Asn Lys Val Ser Asp Arg Ile
225 230 235 240
Arg Ala His Met Gly Asp Val Asn Thr Val Thr Phe Ala Asp Glu Ser
245 250 255
Gly Asn Leu Leu Tyr Ser Gly Ser Asp Asp Asn Leu Cys Lys Val Trp
260 265 270
Asp Arg Arg Cys Leu Val Arg Glu Lys Pro Ala Gly Val Leu Thr Gly
275 280 285
His Leu Asp Gly Ile Thr Phe Ile Asp Ser Arg Gly Asp Gly Arg Tyr
290 295 300
Leu Ile Ser Asn Cys Lys Asp Gln Thr Ile Lys Leu Trp Asp Val Arg
305 310 315 320
Lys Met Ser Ala Thr Val Lys Gly Arg Gln Pro Arg Leu Tyr Asp Trp
325 330 335
Asp Tyr Arg Trp Met Ser Leu Pro Ser His Ala Arg Tyr Tyr Lys His
340 345 350
Pro Asp Asp Leu Ser Leu Ala Thr Tyr Arg Gly His Ser Val Leu Arg
355 360 365
Thr Leu Ile Arg Cys Tyr Phe Ser Pro Met His Ser Thr Gly Gln Arg
370 375 380
Tyr Ile Tyr Thr Gly Ser Ser Asp Asp Ser Val His Ile Tyr Asp Val
385 390 395 400
Val Thr Gly Ala Thr Val Lys Lys Leu Ser Trp His Gly Ser Ile Ile
405 410 415
Arg Asp Cys Thr Trp His Pro Tyr Arg Pro Thr Leu Val Ser Ser Ser
420 425 430
Trp Asp Gly Tyr Leu Ala Arg Trp Glu Ala Ser Gly Asp Asn Glu Asp
435 440 445
Pro Ser Val Leu Thr Cys Asp Glu Gln Arg Asn Ser Pro Tyr His Glu
450 455 460
Thr Tyr Gly Leu Leu
465
<210>229
<211>1479
<212>DNA
<213>Linum usitatissum
<400>229
atgttcttcg tagcttcaac tgacgggatg ggttatgcca tgagtagatt ggagatagaa 60
tctcagctgt gtgacgagga agagactgtc aatgaagttg gtggtggaag ccaacagcac 120
aagtctctta acaaatcggt agagaaattg gaccatgaag ttgcccagga cactaaccta 180
aaatctcagc cccatagacg gctcgagaag gagatacctg ggaagagaca gttcccagtc 240
tctcctgtaa agatgttggc cggtcgagaa ggtaattttt ccggaagggg gaggttctcg 300
cgagctgatc ggtgtcatat gctcagcaga tatttgcctg ctgatggccc atggcttgtc 360
gatcgaatga atagccgagc ttatgtctcg cagttttctt ctgatggtac cttgtttgtt 420
gctggctttc agggaagcca tattaaagta tacaatgtcg agaaagggtg gaaagttcag 480
aaggatatta ttgccagaag tttgcgttgg acagttacgg atacctctct gtctccggat 540
caacggtttc ttgtctatgc cagtatgtgc cctattgtgc acattgttaa tatcgcatca 600
tcaacaacgg aatcagttgc aaatgtaacg gagattcacg atggtttaga cttttctgat 660
gaagacgatg ggggctatgc tttcgggatc ttctcagtaa aattttctac tgatggtcgt 720
gaactagttg ctggaagtag tgatgatgct atatgtgtct atgatctcga aactaataag 780
ctctctctca gaatcctagc acacacatct gatgtgaaca ccgtctgttt tgctgacgag 840
agtgggcatc tgatatactc cgggagtgat gataatctct gcaaggtgtg ggatagacgt 900
tgcttcatag caaaagggaa gcctgcagga gtcctaacgg gacatatcga aggaattaca 960
gatatagaca gccgtggaga tggccgatat ttaatatcaa atggaaaaga tcagacaatc 1020
aaactttggg atatcaggaa aatggccccc aatgctacaa gctctttagg gatcaggaat 1080
tatgaatggg attacagatg gatggactac ccaccaccag ctagagactt gaagcatcca 1140
tgcgatctgt ccgtggctac ttataaaggt cactcggttc tccgaacgct tattcgctgc 1200
tatttctcac caacctatag caccggccag aaatacattt acaccggatc tcacgactct 1260
tctgtttata tttatgatgt ggcgactggg gaggtagctg gagtgctgaa acaccataac 1320
tcgccggtaa gagattgcag ttggcacccg cactatccta tgctggtgag ctcttcatgg 1380
gatggggata tagtgaggtg ggagttcgtc ggcaatggag aagctccgat gcctatggcc 1440
aagaaacgac tacgcagaag gcaatactac gatgtctga 1479
<210>230
<211>490
<212>PRT
<213>Linum usitatissum
<400>230
Met Phe Phe Val Ala Ser Thr Asp Gly Met Gly Tyr Ala Met Ser Arg
1 5 10 15
Leu Glu Ile Glu Ser Gln Leu Cys Asp Glu Glu Glu Thr Val Asn Glu
20 25 30
Val Gly Gly Gly Ser Gln Gln His Lys Ser Leu Asn Lys Ser Val Glu
35 40 45
Lys Leu Asp His Glu Val Ala Gln Asp Thr Asn Leu Lys Ser Gln Pro
50 55 60
His Arg Arg Leu Glu Lys Glu Ile Pro Gly Lys Arg Gln Phe Pro Val
65 70 75 80
Ser Pro Val Lys Met Leu Ala Gly Arg Glu Gly Asn Phe Ser Gly Arg
85 90 95
Gly Arg Phe Ser Arg Ala Asp Arg Cys His Met Leu Ser Arg Tyr Leu
100 105 110
Pro Ala Asp Gly Pro Trp Leu Val Asp Arg Met Asn Ser Arg Ala Tyr
115 120 125
Val Ser Gln Phe Ser Ser Asp Gly Thr Leu Phe Val Ala Gly Phe Gln
130 135 140
Gly Ser His Ile Lys Val Tyr Asn Val Glu Lys Gly Trp Lys Val Gln
145 150 155 160
Lys Asp Ile Ile Ala Arg Ser Leu Arg Trp Thr Val Thr Asp Thr Ser
165 170 175
Leu Ser Pro Asp Gln Arg Phe Leu Val Tyr Ala Ser Met Cys Pro Ile
180 185 190
Val His Ile Val Asn Ile Ala Ser Ser Thr Thr Glu Ser Val Ala Asn
195 200 205
Val Thr Glu Ile His Asp Gly Leu Asp Phe Ser Asp Glu Asp Asp Gly
210 215 220
Gly Tyr Ala Phe Gly Ile Phe Ser Val Lys Phe Ser Thr Asp Gly Arg
225 230 235 240
Glu Leu Val Ala Gly Ser Ser Asp Asp Ala Ile Cys Val Tyr Asp Leu
245 250 255
Glu Thr Asn Lys Leu Ser Leu Arg Ile Leu Ala His Thr Ser Asp Val
260 265 270
Asn Thr Val Cys Phe Ala Asp Glu Ser Gly His Leu Ile Tyr Ser Gly
275 280 285
Ser Asp Asp Asn Leu Cys Lys Val Trp Asp Arg Arg Cys Phe Ile Ala
290 295 300
Lys Gly Lys Pro Ala Gly Val Leu Thr Gly His Ile Glu Gly Ile Thr
305 310 315 320
Asp Ile Asp Ser Arg Gly Asp Gly Arg Tyr Leu Ile Ser Asn Gly Lys
325 330 335
Asp Gln Thr Ile Lys Leu Trp Asp Ile Arg Lys Met Ala Pro Asn Ala
340 345 350
Thr Ser Ser Leu Gly Ile Arg Asn Tyr Glu Trp Asp Tyr Arg Trp Met
355 360 365
Asp Tyr Pro Pro Pro Ala Arg Asp Leu Lys His Pro Cys Asp Leu Ser
370 375 380
Val Ala Thr Tyr Lys Gly His Ser Val Leu Arg Thr Leu Ile Arg Cys
385 390 395 400
Tyr Phe Ser Pro Thr Tyr Ser Thr Gly Gln Lys Tyr Ile Tyr Thr Gly
405 410 415
Ser His Asp Ser Ser Val Tyr Ile Tyr Asp Val Ala Thr Gly Glu Val
420 425 430
Ala Gly Val Leu Lys His His Asn Ser Pro Val Arg Asp Cys Ser Trp
435 440 445
His Pro His Tyr Pro Met Leu Val Ser Ser Ser Trp Asp Gly Asp Ile
450 455 460
Val Arg Trp Glu Phe Val Gly Asn Gly Glu Ala Pro Met Pro Met Ala
465 470 475 480
Lys Lys Arg Leu Arg Arg Arg Gln Tyr Tyr
485 490
<210>231
<211>1422
<212>DNA
<213>紫草(Lithospermum erythrorhizon)
<400>231
atggggtatg ctatgagtag atttgaaact gatgtatctg taatctttag ttcaagttct 60
gattctgaga cttctcatga ttctcttatc aataagccag tgaaaaattt ggatcatgaa 120
attgctcagc ttactaggct tagatcagca ccccacgaga atctaagtag agacctacta 180
gttaagaggg tattgccgct ttcgacaatg aaaatgctgg ctggcagaga agctaatgtt 240
tcaggaagag ggaggttttc atctgcagat tgttgtcatg tagtcagtcg acatttgcct 300
gttaacgatc cttgtgttgt cgatcaaatg acatctagag tttatttgtc acagttttcg 360
actgatggtt ctcttttcat tgctggcttt cagggatgcc acatcagaat atacaatgta 420
gataaagggt ggaaagttca aaacgacatt atagcaaaat gtgtgagatg gacaattact 480
gatgcatctc tttctccaga tcaaaagttc cttgcctatg ctagcttgac accaattgca 540
catattgtaa aatttggttc tgctgctacg gaatctcatg caaatgttac ggatatacat 600
gatggattgg atttttcatc taacgatgat gggggatact cttttggggt attttccatc 660
aagttttcaa ccgatggacg ggaaattgta gctggtacca gtgatgaatc aatttgtgtt 720
tatgatctag aagcagatag actttccctt agaatttcag cccacgagtc agatgttaac 780
tctgtatgct ttgctgatga aagcggccat cttatttatt ctggaagtga tgacaatctc 840
tgcaaggttt gggacagacg ttgcttcaat gccaaaggaa aaccagcagg catcttgatg 900
ggacacctcg aaggaattac atttattgat agccgaggag atgggcgata ttttatttca 960
aatggtaaag atcagacaat caaactctgg gatatccgca aaatgtcctc gaatgctggc 1020
ggcacaattc aaagcagaaa tagtgaatgg gactacagat ggatggaata tccacaagag 1080
gcaagagatt tgaagcatcc atctgatcta tcgggtgcta cttacaaagg ccactccgtc 1140
ttgtgtactc ttattcgctg ctacttctcc ccagactata gtactggcca gaaatacatc 1200
tacactggat ctcatgatgc aaatgtttat atctacgact tggtaactgg agatcaagtt 1260
tctacacttc agtaccataa ggcaactgta agggattgta gttggcaccc aaactatcct 1320
atgcttgtta gctcatcgtt tgacggagaa attgtcaaat gggaatatcg tggaaacgat 1380
gaagctcccg tccaaggaaa caatcagcgg cttcaaagat ga 1422
<210>232
<211>473
<212>PRT
<213>紫草
<400>232
Met Gly Tyr Ala Met Ser Arg Phe Glu Thr Asp Val Ser Val Ile Phe
1 5 10 15
Ser Ser Ser Ser Asp Ser Glu Thr Ser His Asp Ser Leu Ile Asn Lys
20 25 30
Pro Val Lys Asn Leu Asp His Glu Ile Ala Gln Leu Thr Arg Leu Arg
35 40 45
Ser Ala Pro His Glu Asn Leu Ser Arg Asp Leu Leu Val Lys Arg Val
50 55 60
Leu Pro Leu Ser Thr Met Lys Met Leu Ala Gly Arg Glu Ala Asn Val
65 70 75 80
Ser Gly Arg Gly Arg Phe Ser Ser Ala Asp Cys Cys His Val Val Ser
85 90 95
Arg His Leu Pro Val Asn Asp Pro Cys Val Val Asp Gln Met Thr Ser
100 105 110
Arg Val Tyr Leu Ser Gln Phe Ser Thr Asp Gly Ser Leu Phe Ile Ala
115 120 125
Gly Phe Gln Gly Cys His Ile Arg Ile Tyr Asn Val Asp Lys Gly Trp
130 135 140
Lys Val Gln Asn Asp Ile Ile Ala Lys Cys Val Arg Trp Thr Ile Thr
145 150 155 160
Asp Ala Ser Leu Ser Pro Asp Gln Lys Phe Leu Ala Tyr Ala Ser Leu
165 170 175
Thr Pro Ile Ala His Ile Val Lys Phe Gly Ser Ala Ala Thr Glu Ser
180 185 190
His Ala Asn Val Thr Asp Ile His Asp Gly Leu Asp Phe Ser Ser Asn
195 200 205
Asp Asp Gly Gly Tyr Ser Phe Gly Val Phe Ser Ile Lys Phe Ser Thr
210 215 220
Asp Gly Arg Glu Ile Val Ala Gly Thr Ser Asp Glu Ser Ile Cys Val
225 230 235 240
Tyr Asp Leu Glu Ala Asp Arg Leu Ser Leu Arg Ile Ser Ala His Glu
245 250 255
Ser Asp Val Asn Ser Val Cys Phe Ala Asp Glu Ser Gly His Leu Ile
260 265 270
Tyr Ser Gly Ser Asp Asp Asn Leu Cys Lys Val Trp Asp Arg Arg Cys
275 280 285
Phe Asn Ala Lys Gly Lys Pro Ala Gly Ile Leu Met Gly His Leu Glu
290 295 300
Gly Ile Thr Phe Ile Asp Ser Arg Gly Asp Gly Arg Tyr Phe Ile Ser
305 310 315 320
Asn Gly Lys Asp Gln Thr Ile Lys Leu Trp Asp Ile Arg Lys Met Ser
325 330 335
Ser Asn Ala Gly Gly Thr Ile Gln Ser Arg Asn Ser Glu Trp Asp Tyr
340 345 350
Arg Trp Met Glu Tyr Pro Gln Glu Ala Arg Asp Leu Lys His Pro Ser
355 360 365
Asp Leu Ser Gly Ala Thr Tyr Lys Gly His Ser Val Leu Cys Thr Leu
370 375 380
Ile Arg Cys Tyr Phe Ser Pro Asp Tyr Ser Thr Gly Gln Lys Tyr Ile
385 390 395 400
Tyr Thr Gly Ser His Asp Ala Asn Val Tyr Ile Tyr Asp Leu Val Thr
405 410 415
Gly Asp Gln Val Ser Thr Leu Gln Tyr His Lys Ala Thr Val Arg Asp
420 425 430
Cys Ser Trp His Pro Asn Tyr Pro Met Leu Val Ser Ser Ser Phe Asp
435 440 445
Gly Glu Ile Val Lys Trp Glu Tyr Arg Gly Asn Asp Glu Ala Pro Val
450 455 460
Gln Gly Asn Asn Gln Arg Leu Gln Arg
465 470
<210>233
<211>1464
<212>DNA
<213>番茄(Lycopersicon esculentum)
<400>233
atgtattttg atttcttaca cccagagtcc attgaagaca tggggtattc tctaagtaag 60
ttagaagtag acaccggact ctttgatggt tcaagttcca atcatggggt tgctagcagt 120
gttcatcatg aaagaccaac aaattatttg gaccatgaaa tttctcaact tactaagctt 180
agatcaggac cccatgaaaa tctcagtaga atcctaccag ggaaaaagga agttcctgta 240
tccgcattca agatgttagc tgctcgagaa gccaatattt ccggtagagg aaggttttcg 300
aaggcagatt gttgtcatgt tctaagtaaa tatttgccag ttagtggtcc ttggattgtg 360
gaccagatgg aaaccagagc ttatgtatca caattttcag cagatggttc cctttttgtt 420
gctgcctttc agggaagtca tattagaata tacaatgtgg aaagagggtg gaaagttcac 480
aagaatattc atgcaaaaag tttgagatgg acagttactg atacatctct ttctccggat 540
caacgtcatt tggtctatgc tactatgtca cccatcgtac atattgtaga tgtaggatct 600
gctgcctctg aatctgtagc caacatcaca gaaattcatg atggtttgct tttgtctact 660
gacaatgatg attttggaat tttctctgtg aaattttcta ctgaaggtcg ggaagttgtt 720
gctggaagta gtgatgatgc gatctatgtt tatgatcttg aagcaaacaa actctctctt 780
cgaatatccg cacacaattc tgatgtcaat tctgtatgtt ttgctgacga aagtggccat 840
ctcatttatt ctggaagtga tgacaatctg tgtaaggtct gggatagacg ttgttttagg 900
gccaaagaaa agccagccgg agtcttgatg ggacacctag aaggcgttac gttccttgat 960
agtcgggggg atggtcgtta tttcatttct aacagtaaag atcagtccat caagctctgg 1020
gatatccgca aaatgtcttc tcatgctgct cgcaatatct ggttcaggaa ttatgagtgg 1080
gactatagat ggatggacta ccctgctcaa gctagagacg tgaagcaccc ttatgatcag 1140
tcaatatcca cttataaggg tcattctgtc ttgcgtactc taattcgctg ctacttctca 1200
ccagaatata gcactgggca gagatacatt tacacaggat cccatgatgc ctgcgtatac 1260
atctatgatt tggtaagtgg agagcaagtc gcgaaattgc agcaccaccg gtcgaccatt 1320
agagattgta gctggcaccc tacttatcca atgcttgtta gctcttcttg ggatggagat 1380
gttgtcaaat gggaattccc tggaaatggt gaagcaccac tccctccaaa aaggaagcag 1440
atcagaagaa ggcatttctt ttaa 1464
<210>234
<211>487
<212>PRT
<213>番茄
<400>234
Met Tyr Phe Asp Phe Leu His Pro Glu Ser Ile Glu Asp Met Gly Tyr
1 5 10 15
Ser Leu Ser Lys Leu Glu Val Asp Thr Gly Leu Phe Asp Gly Ser Ser
20 25 30
Ser Asn His Gly Val Ala Ser Ser Val His His Glu Arg Pro Thr Asn
35 40 45
Tyr Leu Asp His Glu Ile Ser Gln Leu Thr Lys Leu Arg Ser Gly Pro
50 55 60
His Glu Asn Leu Ser Arg Ile Leu Pro Gly Lys Lys Glu Val Pro Val
65 70 75 80
Ser Ala Phe Lys Met Leu Ala Ala Arg Glu Ala Asn Ile Ser Gly Arg
85 90 95
Gly Arg Phe Ser Lys Ala Asp Cys Cys His Val Leu Ser Lys Tyr Leu
100 105 110
Pro Val Ser Gly Pro Trp Ile Val Asp Gln Met Glu Thr Arg Ala Tyr
115 120 125
Val Ser Gln Phe Ser Ala Asp Gly Ser Leu Phe Val Ala Ala Phe Gln
130 135 140
Gly Ser His Ile Arg Ile Tyr Asn Val Glu Arg Gly Trp Lys Val His
145 150 155 160
Lys Asn Ile His Ala Lys Ser Leu Arg Trp Thr Val Thr Asp Thr Ser
165 170 175
Leu Ser Pro Asp Gln Arg His Leu Val Tyr Ala Thr Met Ser Pro Ile
180 185 190
Val His Ile Val Asp Val Gly Ser Ala Ala Ser Glu Ser Val Ala Asn
195 200 205
Ile Thr Glu Ile His Asp Gly Leu Leu Leu Ser Thr Asp Asn Asp Asp
210 215 220
Phe Gly Ile Phe Ser Val Lys Phe Ser Thr Glu Gly Arg Glu Val Val
225 230 235 240
Ala Gly Ser Ser Asp Asp Ala Ile Tyr Val Tyr Asp Leu Glu Ala Asn
245 250 255
Lys Leu Ser Leu Arg Ile Ser Ala His Asn Ser Asp Val Asn Ser Val
260 265 270
Cys Phe Ala Asp Glu Ser Gly His Leu Ile Tyr Ser Gly Ser Asp Asp
275 280 285
Asn Leu Cys Lys Val Trp Asp Arg Arg Cys Phe Arg Ala Lys Glu Lys
290 295 300
Pro Ala Gly Val Leu Met Gly His Leu Glu Gly Val Thr Phe Leu Asp
305 310 315 320
Ser Arg Gly Asp Gly Arg Tyr Phe Ile Ser Asn Ser Lys Asp Gln Ser
325 330 335
Ile Lys Leu Trp Asp Ile Arg Lys Met Ser Ser His Ala Ala Arg Asn
340 345 350
Ile Trp Phe Arg Asn Tyr Glu Trp Asp Tyr Arg Trp Met Asp Tyr Pro
355 360 365
Ala Gln Ala Arg Asp Val Lys His Pro Tyr Asp Gln Ser Ile Ser Thr
370 375 380
Tyr Lys Gly His Ser Val Leu Arg Thr Leu Ile Arg Cys Tyr Phe Ser
385 390 395 400
Pro Glu Tyr Ser Thr Gly Gln Arg Tyr Ile Tyr Thr Gly Ser His Asp
405 410 415
Ala Cys Val Tyr Ile Tyr Asp Leu Val Ser Gly Glu Gln Val Ala Lys
420 425 430
Leu Gln His His Arg Ser Thr Ile Arg Asp Cys Ser Trp His Pro Thr
435 440 445
Tyr Pro Met Leu Val Ser Ser Ser Trp Asp Gly Asp Val Val Lys Trp
450 455 460
Glu Phe Pro Gly Asn Gly Glu Ala Pro Leu Pro Pro Lys Arg Lys Gln
465 470 475 480
Ile Arg Arg Arg His Phe Phe
485
<210>235
<211>1488
<212>DNA
<213>蒺藜苜蓿
<400>235
atgtacgcta tatccggtgc actttacgtt gaccaaatgg gctatgctat gagtagatta 60
gacgtggact ctagtgatac tgaagatgga aatgcaatcc ttgaagattc tagtactgga 120
aaagctaaaa aggcatttga aaatttagac aatgaaattg ctcaaataac caagttgaaa 180
tcaacacctc atcaactgct agtacatgat ggatctggaa ggaaagagtt gcctgtttcc 240
ccggtgaaga tgctggcagg ccgcgaatct aattgttcag gacggggaag gttttcttcc 300
gctgatcgct gtcatctttt gagcaggtat ttacctgtaa atggtccttg gcctatcgac 360
caaatgccta gtcgagcata cgtgtctcag ttttcagctg atggttctct ttttgttgct 420
gggttccagg gaaaccacat aaaaatatac aatgtggaga aaggttggaa agttcaaaaa 480
aacattctaa ccaagagttt gagatggaca atcactgata cttctctttc ccctgatcaa 540
agtcatctag tttatgccag catgtcaccc attgtacaca ttgtgaatgt tggatcttct 600
gagacagagt cactagcaaa tgtgacggag atccacgatg ggttggattt ttcatcaaac 660
gacgatggag gatactcttt tggaattttc tctttgaaat tttcaacaga tgggaaggaa 720
ttagttgcag gaactagtgg cgattctata tatgtatacg atcttgaaac aaataaggtt 780
tcacttcgaa ttttagcaca cacggctgat gtaaacactg tatgttttgc tgatgaaact 840
ggccatctta tttactctgg aagtgatgat agtttctgca aggtctggga tcggcgttgc 900
ttaaatgcta aagacaagcc agcaggggtt ttgatgggac accttgaggg cattacgttt 960
attgattccc gtggagatgg acgctatttc atttcaaacg gtaaagatca gaccatcaaa 1020
ctttgggaca tacgtaaaat gtcatccaat gttaccagta accgtgtccg tggatatagg 1080
agttttgaat gggattacag gtggatggat tacccgccac aagcaaaaga cttgaatcat 1140
ccttgtgatc agtcagtggc tacatataga ggccattcag tcttacgcac tcttgtccgc 1200
tgcttttttt ctccagcttt tagcactggc cagaagtaca tctatactgg atcacacaac 1260
gcatgtgttt atgtatatga tttggtgagt ggagcacaag ttgcaacatt gaagcaccat 1320
aaatcacctg taagagattg tagttggcat cccttccacc ctatgcttgt tagctcttct 1380
tgggatggag atgttgtaaa atggcaatct gctggaagct ctgatatggc agcctcgtcg 1440
gttaagaaga gggtaaacaa aagacatttt tatgaagatt acctatga 1488
<210>236
<211>495
<212>PRT
<213>蒺藜苜蓿
<400>236
Met Tyr Ala Ile Ser Gly Ala Leu Tyr Val Asp Gln Met Gly Tyr Ala
1 5 10 15
Met Ser Arg Leu Asp Val Asp Ser Ser Asp Thr Glu Asp Gly Asn Ala
20 25 30
Ile Leu Glu Asp Ser Ser Thr Gly Lys Ala Lys Lys Ala Phe Glu Asn
35 40 45
Leu Asp Asn Glu Ile Ala Gln Ile Thr Lys Leu Lys Ser Thr Pro His
50 55 60
Gln Leu Leu Val His Asp Gly Ser Gly Arg Lys Glu Leu Pro Val Ser
65 70 75 80
Pro Val Lys Met Leu Ala Gly Arg Glu Ser Asn Cys Ser Gly Arg Gly
85 90 95
Arg Phe Ser Ser Ala Asp Arg Cys His Leu Leu Ser Arg Tyr Leu Pro
100 105 110
Val Asn Gly Pro Trp Pro Ile Asp Gln Met Pro Ser Arg Ala Tyr Val
115 120 125
Ser Gln Phe Ser Ala Asp Gly Ser Leu Phe Val Ala Gly Phe Gln Gly
130 135 140
Asn His Ile Lys Ile Tyr Asn Val Glu Lys Gly Trp Lys Val Gln Lys
145 150 155 160
Asn Ile Leu Thr Lys Ser Leu Arg Trp Thr Ile Thr Asp Thr Ser Leu
165 170 175
Ser Pro Asp Gln Ser His Leu Val Tyr Ala Ser Met Ser Pro Ile Val
180 185 190
His Ile Val Asn Val Gly Ser Ser Glu Thr Glu Ser Leu Ala Asn Val
195 200 205
Thr Glu Ile His Asp Gly Leu Asp Phe Ser Ser Asn Asp Asp Gly Gly
210 215 220
Tyr SerPhe Gly Ile Phe Ser Leu Lys Phe Ser Thr Asp Gly Lys Glu
225 230 235 240
Leu Val Ala Gly Thr Ser Gly Asp Ser Ile Tyr Val Tyr Asp Leu Glu
245 250 255
Thr Asn Lys Val Ser Leu Arg Ile Leu Ala His Thr Ala Asp Val Asn
260 265 270
Thr Val Cys Phe Ala Asp Glu Thr Gly His Leu Ile Tyr Ser Gly Ser
275 280 285
Asp Asp Ser Phe Cys Lys Val Trp Asp Arg Arg Cys Leu Asn Ala Lys
290 295 300
Asp Lys Pro Ala Gly Val Leu Met Gly His Leu Glu Gly Ile Thr Phe
305 310 315 320
Ile Asp Ser Arg Gly Asp Gly Arg Tyr Phe Ile Ser Asn Gly Lys Asp
325 330 335
Gln Thr Ile Lys Leu Trp Asp Ile Arg Lys Met Ser Ser Asn Val Thr
340 345 350
Ser Asn Arg Val Arg Gly Tyr Arg Ser Phe Glu Trp Asp Tyr Arg Trp
355 360 365
Met Asp Tyr Pro Pro Gln Ala Lys Asp Leu Asn His Pro Cys Asp Gln
370 375 380
Ser Val Ala Thr Tyr Arg Gly His Ser Val Leu Arg Thr Leu Val Arg
385 390 395 400
Cys Phe Phe Ser Pro Ala Phe Ser Thr Gly Gln Lys Tyr Ile Tyr Thr
405 410 415
Gly Ser His Asn Ala Cys Val Tyr Val Tyr Asp Leu Val Ser Gly Ala
420 425 430
Gln Val Ala Thr Leu Lys His His Lys Ser Pro Val Arg Asp Cys Ser
435 440 445
Trp His Pro Phe His Pro Met Leu Val Ser Ser Ser Trp Asp Gly Asp
450 455 460
Val Val Lys Trp Gln Ser Ala Gly Ser Ser Asp Met Ala Ala Ser Ser
465 470 475 480
ValLys Lys Arg Val Asn Lys Arg His Phe Tyr Glu Asp Tyr Leu
485 490 495
<210>237
<211>1392
<212>DNA
<213>稻
<400>237
atgggttatg gcatgagtag gatggaggag gaatacagcg agcatgaaga tcagaataat 60
ggtggatcta attcacaagt gaataatgag ttcttaaaca cacataatga tattttccat 120
atgactcaaa taagatcagg acctagtgaa agtcttcgca agtctattgg tacaagcaaa 180
gatgtgatat cgacaaccag gttattgtct ggaagggaaa ttaattcttc aggaaatggg 240
aagttctctt cagttgatcg tgcgtttctt cttggtcgtt atcttccagt tgatggccct 300
gaaatagtgg acaggatgga ttcccgagct tatgtttcac agttttctgc tgatggatct 360
ctttttgttg ctggttttca gggaagccac ataagaatat atgatgttga taaaggttgg 420
aaagtacata gggacattca tgctagaagt ttgagatgga ccattagtga cgcatcactt 480
tcccctgatc aacagtttct tgtctactcc agtctagcac cgattatcca tatcgtcaat 540
gttgggactg ctgcaaaaca atcatatgct aatatcactg acatccacga tggactagat 600
ttttcacagc atgaagatgt tcgatataca tttggaatat tttctgttaa attctcttct 660
gatggccgag agcttgttgc tggcagtaac gatgattcaa tatatgttta tgaccttgtg 720
gcaaacaaac taacgttgcg tttgcctgct catcattctg atgtcaacac agtagcattt 780
gctgacgaaa ctggccatct catatattct ggaagtgatg ataatttatg caaggtctgg 840
gataggcgat gtttatccac agaagaacct gctggagttt tgactgggca tttgcatggc 900
attactcata ttgatagccg tggagatggt cggtgtttca tatcaaatgg aaaagaccaa 960
gctatcaaaa tgtgggacat caggaaaatg acatccaatg ctgatagtta tgaagacaga 1020
acctcaaatt gggactatag atattcaaga tatccacaac agtataagca actaaagcat 1080
ccccatgatc agtcaatagc tacatactgg ggccattcag ttcttcgtac attgatccgt 1140
tgctattttt ctcctgcata tagcacagga cagaagtaca tatatacagg atcctatgat 1200
tctagtgttt gtatctacga tgtggtgagc ggatcacaag ttgcaaaact caaaggatat 1260
catcagctgg caattcgaga ctgcagttgg catcccttcg accctatgct tgtcagctca 1320
tcctgggacg gccgggttgc caagtggtcc aggtcttcct gtcagcaaga agagactact 1380
gatctcgatt ga 1392
<210>238
<211>463
<212>PRT
<213>稻
<400>238
Met Gly Tyr Gly Met Ser Arg Met Glu Glu Glu Tyr Ser Glu His Glu
1 5 10 15
Asp Gln Asn Asn Gly Gly Ser Asn Ser Gln Val Asn Asn Glu Phe Leu
20 25 30
Asn Thr His Asn Asp Ile Phe His Met Thr Gln Ile Arg Ser Gly Pro
35 40 45
Ser Glu Ser Leu Arg Lys Ser Ile Gly Thr Ser Lys Asp Val Ile Ser
50 55 60
Thr Thr Arg Leu Leu Ser Gly Arg Glu Ile Asn Ser Ser Gly Asn Gly
65 70 75 80
Lys Phe Ser Ser Val Asp Arg Ala Phe Leu Leu Gly Arg Tyr Leu Pro
85 90 95
Val Asp Gly Pro Glu Ile Val Asp Arg Met Asp Ser Arg Ala Tyr Val
100 105 110
Ser Gln Phe Ser Ala Asp Gly Ser Leu Phe Val Ala Gly Phe Gln Gly
115 120 125
Ser His Ile Arg Ile Tyr Asp Val Asp Lys Gly Trp Lys Val His Arg
130 135 140
Asp Ile His Ala Arg Ser Leu Arg Trp Thr Ile Ser Asp Ala Ser Leu
145 150 155 160
Ser Pro Asp Gln Gln Phe Leu Val Tyr Ser Ser Leu Ala Pro Ile Ile
165 170 175
His Ile Val Asn Val Gly Thr Ala Ala Lys Gln Ser Tyr Ala Asn Ile
180 185 190
Thr Asp Ile His Asp Gly Leu Asp Phe Ser Gln His Glu Asp Val Arg
195 200 205
Tyr Thr Phe Gly Ile Phe Ser Val Lys Phe Ser Ser Asp Gly Arg Glu
210 215 220
Leu Val Ala Gly Ser Asn Asp Asp Ser Ile Tyr Val Tyr Asp Leu Val
225 230 235 240
Ala Asn Lys Leu Thr Leu Arg Leu Pro Ala His His Ser Asp Val Asn
245 250 255
Thr Val Ala Phe Ala Asp Glu Thr Gly His Leu Ile Tyr Ser Gly Ser
260 265 270
Asp Asp Asn Leu Cys Lys Val Trp Asp Arg Arg Cys Leu Ser Thr Glu
275 280 285
Glu Pro Ala Gly Val Leu Thr Gly His Leu His Gly Ile Thr His Ile
290 295 300
Asp Ser Arg Gly Asp Gly Arg Cys Phe Ile Ser Asn Gly Lys Asp Gln
305 310 315 320
Ala Ile Lys Met Trp Asp Ile Arg Lys Met Thr Ser Asn Ala Asp Ser
325 330 335
Tyr Glu Asp Arg Thr Ser Asn Trp Asp Tyr Arg Tyr Ser Arg Tyr Pro
340 345 350
Gln Gln Tyr Lys Gln Leu Lys His Pro His Asp Gln Ser Ile Ala Thr
355 360 365
Tyr Trp Gly His Ser Val Leu Arg Thr Leu Ile Arg Cys Tyr Phe Ser
370 375 380
Pro Ala Tyr Ser Thr Gly Gln Lys Tyr Ile Tyr Thr Gly Ser Tyr Asp
385 390 395 400
Ser Ser Val Cys Ile Tyr Asp Val Val Ser Gly Ser Gln Val Ala Lys
405 410 415
Leu Lys Gly Tyr His Gln Leu Ala Ile Arg Asp Cys Ser Trp His Pro
420 425 430
Phe Asp Pro Met Leu Val Ser Ser Ser Trp Asp Gly Arg Val Ala Lys
435 440 445
Trp Ser Arg Ser Ser Cys Gln Gln Glu Glu Thr Thr Asp Leu Asp
450 455 460
<210>239
<211>1476
<212>DNA
<213>辐射松(Pinus radiata)
<400>239
atgaatacgg caatgcattt tggtgctggt tggcgatcga ttgctgagat ggggtatacg 60
atgagcagac tagagattga gcctgagtcg tgtgaggacg agaagagctt ggatggggtt 120
ggtaacagcc agggaccgaa tgagttgccg agatgcttgg atcatgagtt ggcgcatttg 180
acgaatctga agtcgaggcc ccatgaacat ttgatccgag atttccctgg gaggcgggct 240
ctgcctgttt ccaccgttaa gatgctggcg ggtcgagagt gtaattattc acgaagaggg 300
aggttctcct ccgctgattg ctgtcacatg ctgagcagat atgtgcctgt taatggtcct 360
tcgcccctgg atcagatgaa tagtcgagct tatgtttcgc aattttcagc tgatggttct 420
ctatttgttg ctggctttca gggtagccac attagaattt ataatgttga taaaggatgg 480
aaatgtcaga agaacattct taccaagagt ttacggtgga cgatcactga tacatctctt 540
tctcctgacc aacgttacct tgtgtatgcc agtatgtcac ccatcgtcca tattgttgac 600
atcggctccg ctgctatgga ttctcttgca aacatcacgg agatccatga gggtttggat 660
ttttccgctg acagtggacc atattctttt ggaatcttct ctgttaaatt ttctaccgat 720
ggacgagaag tcgtcgctgg aagcagcgac gattctatat atgtctatga tcttgtggca 780
aataagcttt ccctcagaat tccagcacat gagtctgatg tgaacacagt atgctttgct 840
gatgaaagtg gtcatataat ttattctggg agtgatgata catactgcaa ggtgtgggat 900
agacgttgcc tgagtgccag aaataaacct gcaggagttc taatgggaca ccttgaaggc 960
attacgttca ttgatagccg tggtgatggt cgttatttca tatcaaatgg caaagatcag 1020
acgatcaaac tttgggatat ccggaaaatg ggctctgata tctgtcgtcg aggctttagg 1080
aatttcgaat gggattacag atggatggac tacccacccc gggctaggga ttcgaaacac 1140
ccttttgatc tgtcagtggc aacatataaa ggccattcgg tgttgcgtac tcttattcgg 1200
tgctacttct ccccagtaca tagcactggt caaaagtata tctacactgg atcccatgat 1260
tcctgtgttt atatctatga tgtggtgact ggagctcaag ttgcggccct caagcaccat 1320
aaatcgccgg tcagagactg cagttggcac ccggagtacc cgatgattgt gagctcttct 1380
tgggatgggg atattgtgaa atgggaattc tttgggaacg gagaaactga gatcccggcg 1440
atgaagaaga ggatccggag gcggcatttg tattaa 1476
<210>240
<211>491
<212>PRT
<213>辐射松
<400>240
Met Asn Thr Ala Met His Phe Gly Ala Gly Trp Arg Ser Ile Ala Glu
1 5 10 15
Met Gly Tyr Thr Met Ser Arg Leu Glu Ile Glu Pro Glu Ser Cys Glu
20 25 30
Asp Glu Lys Ser Leu Asp Gly Val Gly Asn Ser Gln Gly Pro Asn Glu
35 40 45
Leu Pro Arg Cys Leu Asp His Glu Leu Ala His Leu Thr Asn Leu Lys
50 55 60
Ser Arg Pro His Glu His Leu Ile Arg Asp Phe Pro Gly Arg Arg Ala
65 70 75 80
Leu Pro Val Ser Thr Val Lys Met Leu Ala Gly Arg Glu Cys Asn Tyr
85 90 95
Ser Arg Arg Gly Arg Phe Ser Ser Ala Asp Cys Cys His Met Leu Ser
100 105 110
Arg Tyr Val Pro Val Asn Gly Pro Ser Pro Leu Asp Gln Met Asn Ser
115 120 125
Arg Ala Tyr Val Ser Gln Phe Ser Ala Asp Gly Ser Leu Phe Val Ala
130 135 140
Gly Phe Gln Gly Ser His Ile Arg Ile Tyr Asn Val Asp Lys Gly Trp
145 150 155 160
Lys Cys Gln Lys Asn Ile Leu Thr Lys Ser Leu Arg Trp Thr Ile Thr
165 170 175
Asp Thr Ser Leu Ser Pro Asp Gln Arg Tyr Leu Val Tyr Ala Ser Met
180 185 190
Ser Pro Ile Val His Ile Val Asp Ile Gly Ser Ala Ala Met Asp Ser
195 200 205
Leu Ala Asn Ile Thr Glu Ile His Glu Gly Leu Asp Phe Ser Ala Asp
210 215 220
Ser Gly Pro Tyr Ser Phe Gly Ile Phe Ser Val Lys Phe Ser Thr Asp
225 230 235 240
Gly Arg Glu Val Val Ala Gly Ser Ser Asp Asp Ser Ile Tyr Val Tyr
245 250 255
Asp Leu Val Ala Asn Lys Leu Ser Leu Arg Ile Pro Ala His Glu Ser
260 265 270
Asp Val Asn Thr Val Cys Phe Ala Asp Glu Ser Gly His Ile Ile Tyr
275 280 285
Ser Gly Ser Asp Asp Thr Tyr Cys Lys Val Trp Asp Arg Arg Cys Leu
290 295 300
Ser Ala Arg Asn Lys Pro Ala Gly Val Leu Met Gly His Leu Glu Gly
305 310 315 320
Ile Thr Phe Ile Asp Ser Arg Gly Asp Gly Arg Tyr Phe Ile Ser Asn
325 330 335
Gly Lys Asp Gln Thr Ile Lys Leu Trp Asp Ile Arg Lys Met Gly Ser
340 345 350
Asp Ile Cys Arg Arg Gly Phe Arg Asn Phe Glu Trp Asp Tyr Arg Trp
355 360 365
Met Asp Tyr Pro Pro Arg Ala Arg Asp Ser Lys His Pro Phe Asp Leu
370 375 380
Ser Val Ala Thr Tyr Lys Gly His Ser Val Leu Arg Thr Leu Ile Arg
385 390 395 400
Cys Tyr Phe Ser Pro Val His Ser Thr Gly Gln Lys Tyr Ile Tyr Thr
405 410 415
Gly Ser His Asp Ser Cys Val Tyr Ile Tyr Asp Val Val Thr Gly Ala
420 425 430
Gln Val Ala Ala Leu Lys His His Lys Ser Pro Val Arg Asp Cys Ser
435 440 445
Trp His Pro Glu Tyr Pro Met Ile Val Ser Ser Ser Trp Asp Gly Asp
450 455 460
Ile Val Lys Trp Glu Phe Phe Gly Asn Gly Glu Thr Glu Ile Pro Ala
465 470 475 480
Met Lys Lys Arg Ile Arg Arg Arg His Leu Tyr
485 490
<210>241
<211>1467
<212>DNA
<213>美洲山杨(Populus tremuloides)
<400>241
atgtattttt ttgctagaag aacttcggta gacgaaatgg gttacgctat gagcagactg 60
gagactgaat ctgaactctg tgatggtggc aagaccattc ctgaggctgg tagcagcaag 120
agagccagca attggttgaa taacttagac catgaaattg ctcaggttac gaagttgaaa 180
tctagtccgc ataaacagct ggccgaactt gttcccggca tgcataagtc atctgtttcc 240
actgttaaga tgttggttgg tcgagaagct aattattcag caaggggaag attctctgca 300
gctgatcgtt gtcacatgct tagcagatat ttgcctgtca atggaccttg gcttgttgac 360
cagatgagta cccgagccta tgtctcgcag ttttcagctg atggctctct atttgttgct 420
gggtttcagg gaagctatat tagaatatac aatgtggaga aggggtggaa agttcagaag 480
aacattcttg ccaaaagttt gcgttggact gttaccgata catctctttc cccagatcag 540
cgccatctcg tttatgcaag catgtcacct attgtccata ttgttgatgc tgggtctgcc 600
gaaacagagt cgcttgcaaa cgttacggag tttcatgacg gattggactt ttcttctggt 660
gatgggggct attcttttgg aatcttctct gtgaaatttt ccactgatgg gcgagaactt 720
gttgcaggaa gtaatgatga ctccatatat gtctacgacc ttgaacaaaa taagctctcc 780
ctcagaattt tggcacacac gtctgatgtt aacactgtat gttttgctga tgaaagtggc 840
caccttatct tttctgggag tgatgataat ctttgcaagg tgtgggatag acgctgcttt 900
attgcaaaag ggaagccagc tggagtccta acaggacact tagaaggcat aacatttatt 960
gacagccatg gagatggtcg gtattttatc tcaaatggta aagatcagac tatcaaactt 1020
tgggatattc ggaaaatggc ccctaatgct actagctatt cagggcttag gaattatgaa 1080
tgggactaca ggtggatgga ctacccatat gaggcaagag atttgaaaca cccctgcgat 1140
caatcagtag ccacatataa aggtcattcg gttctgcgca ctcttatccg ctgctacttt 1200
tctcctgtat atagtactgg ccagaagtac atctacactg gatctcatga ttcttgtgtt 1260
tatatttatg acttggtgac tggagaacta gtttcattac tacaacatca taaatcacct 1320
gtaagagatt gtagttggca cccatattat cccatgctcg tcagctcttc ttgggacgga 1380
gatgtcgtaa aatgggagtt tcctggcaat ggagaagctc cagtcccttc aaccaagaag 1440
agaattcgaa ggagacaatt tgattga 1467
<210>242
<211>488
<212>PRT
<213>美洲山杨
<400>242
Met Tyr Phe Phe Ala Arg Arg Thr Ser Val Asp Glu Met Gly Tyr Ala
1 5 10 15
Met Ser Arg Leu Glu Thr Glu Ser Glu Leu Cys Asp Gly Gly Lys Thr
20 25 30
Ile Pro Glu Ala Gly Ser Ser Lys Arg Ala Ser Asn Trp Leu Asn Asn
35 40 45
Leu Asp His Glu Ile Ala Gln Val Thr Lys Leu Lys Ser Ser Pro His
50 55 60
Lys Gln Leu Ala Glu Leu Val Pro Gly Met His Lys Ser Ser Val Ser
65 70 75 80
Thr Val Lys Met Leu Val Gly Arg Glu Ala Asn Tyr Ser Ala Arg Gly
85 90 95
Arg Phe Ser Ala Ala Asp Arg Cys His Met Leu Ser Arg Tyr Leu Pro
100 105 110
Val Asn Gly Pro Trp Leu Val Asp Gln Met Ser Thr Arg Ala Tyr Val
115 120 125
Ser Gln Phe Ser Ala Asp Gly Ser Leu Phe Val Ala Gly Phe Gln Gly
130 135 140
Ser Tyr Ile Arg Ile Tyr Asn Val Glu Lys Gly Trp Lys Val Gln Lys
145 150 155 160
Asn Ile Leu Ala Lys Ser Leu Arg Trp Thr Val Thr Asp Thr Ser Leu
165 170 175
Ser Pro Asp Gln Arg His Leu Val Tyr Ala Ser Met Ser Pro Ile Val
180 185 190
His Ile Val Asp Ala Gly Ser Ala Glu Thr Glu Ser Leu Ala Asn Val
195 200 205
Thr Glu Phe His Asp Gly Leu Asp Phe Ser Ser Gly Asp Gly Gly Tyr
210 215 220
Ser Phe Gly Ile Phe Ser Val Lys Phe Ser Thr Asp Gly Arg Glu Leu
225 230 235 240
Val Ala Gly Ser Asn Asp Asp Ser Ile Tyr Val Tyr Asp Leu Glu Gln
245 250 255
Asn Lys Leu Ser Leu Arg Ile Leu Ala His Thr Ser Asp Val Asn Thr
260 265 270
Val Cys Phe Ala Asp Glu Ser Gly His Leu Ile Phe Ser Gly Ser Asp
275 280 285
Asp Asn Leu Cys Lys Val Trp Asp Arg Arg Cys Phe Ile Ala Lys Gly
290 295 300
Lys Pro Ala Gly Val Leu Thr Gly His Leu Glu Gly Ile Thr Phe Ile
305 310 315 320
Asp Ser His Gly Asp Gly Arg Tyr Phe Ile Ser Asn Gly Lys Asp Gln
325 330 335
Thr Ile Lys Leu Trp Asp Ile Arg Lys Met Ala Pro Asn Ala Thr Ser
340 345 350
Tyr Ser Gly Leu Arg Asn Tyr Glu Trp Asp Tyr Arg Trp Met Asp Tyr
355 360 365
Pro Tyr Glu Ala Arg Asp Leu Lys His Pro Cys Asp Gln Ser Val Ala
370 375 380
Thr Tyr Lys Gly His Ser Val Leu Arg Thr Leu Ile Arg Cys Tyr Phe
385 390 395 400
Ser Pro Val Tyr Ser Thr Gly Gln Lys Tyr Ile Tyr Thr Gly Ser His
405 410 415
Asp Ser Cys Val Tyr Ile Tyr Asp Leu Val Thr Gly Glu Leu Val Ser
420 425 430
Leu Leu Gln His His Lys Ser Pro Val Arg Asp Cys Ser Trp His Pro
435 440 445
Tyr Tyr Pro Met Leu Val Ser Ser Ser Trp Asp Gly Asp Val Val Lys
450 455 460
Trp Glu Phe Pro Gly Asn Gly Glu Ala Pro Val Pro Ser Thr Lys Lys
465 470 475 480
Arg Ile Arg Arg Arg Gln Phe Asp
485
<210>243
<211>1428
<212>DNA
<213>野杏(Prunus armeniaca)
<400>243
atgagttaca gaacaaggtt tggaaaagat aatagcgctt gtgatagtgg aaatgctgtt 60
gaaggttctg gttcaagtaa aggacccaat gaagtatcaa atgattttga tcatgaaatt 120
gctcaactca ctaagcatag atcaagaccc catcagcttt tgagccagga catgcctgga 180
aagtcgaggt tactggtttc aacaatgaaa atgttggttg gtcgtgaaag taatcattca 240
ggacgtggga gattctcgtc tgctgacggt tgccatgttt tgagccggta tctgcccatc 300
aatggtcctt ggggggtgga ccagtcaaca agtcctgctt atgtttctca attttcaaat 360
gatggtttgt tttttgttgc tggatttcag ggcggccata ttagaatata taatgttgat 420
aagggatgga aagttcagaa ggacatccta accaaaagct tgagatggac aattactgat 480
acatctctat ctccagatca acgttatctt gtttatgcta gcatgacacc cattgtcaat 540
attgtcaatg ttggatcttc tatgacagag tcacttgcaa atgttacgga aattcatgaa 600
ggtctggatt tttctgttgg tggtgatgag gacgaatttg gaattttctc agttagattt 660
tcaactgatg ggcgagagct tgtagctgca agtagagatg cttcaatata tgtttatgat 720
ctccaagcaa ataaagttaa cctccgaata ccagcacact cgtctgatgt aaacactgta 780
tgctttgcgg atgagactgg acatctcata tattctggca gtgacgataa tctctgtaag 840
gtttgggata gacgctgctt taatcacaaa ggacagccag ctggggtcct gatgggacat 900
cttgaaggtg ttacatttat tgatagtagg ggagatgggc gttacttcat atcaaatggg 960
aaggaccaga ctacccaact ctgggatata agaaagatgt cctctagagc catgtacagc 1020
ccaaggctta gagatcatga ctgggattac agatggatgg agtacccagc tcatgcaaaa 1080
actttgaaac atccaaatga tcagtcactg gctacatata gaggtcatgg agtcctgcgt 1140
actttaattc gctgttacct ttctccagca tatagtactg gacaaaagta catctacact 1200
ggatctagtg atcattgtgt ctatatatat gatctggtga ccggtgctca agttgcgaga 1260
ctcaaccatc acgaaggacc tgtaagagac tgtagttggc atcctctcta tccgatgttg 1320
gtcagctctt cttgggatgg gacgattgcc agatgggaat ttcctgggga tgaccaagta 1380
cccaccctgg agaggccgag agcgcgccgg aaggagaggc tactataa 1428
<210>244
<211>475
<212>PRT
<213>野杏
<400>244
Met Ser Tyr Arg Thr Arg Phe Gly Lys Asp Asn Ser Ala Cys Asp Ser
1 5 10 15
Gly Asn Ala Val Glu Gly Ser Gly Ser Ser Lys Gly Pro Asn Glu Val
20 25 30
Ser Asn Asp Phe Asp His Glu Ile Ala Gln Leu Thr Lys His Arg Ser
35 40 45
Arg Pro His Gln Leu Leu Ser Gln Asp Met Pro Gly Lys Ser Arg Leu
50 55 60
Leu Val Ser Thr Met Lys Met Leu Val Gly Arg Glu Ser Asn His Ser
65 70 75 80
Gly Arg Gly Arg Phe Ser Ser Ala Asp Gly Cys His Val Leu Ser Arg
85 90 95
Tyr Leu Pro Ile Asn Gly Pro Trp Gly Val Asp Gln Ser Thr Ser Pro
100 105 110
Ala Tyr Val Ser Gln Phe Ser Asn Asp Gly Leu Phe Phe Val Ala Gly
115 120 125
Phe Gln Gly Gly His Ile Arg Ile Tyr Asn Val Asp Lys Gly Trp Lys
130 135 140
Val Gln Lys Asp Ile Leu Thr Lys Ser Leu Arg Trp Thr Ile Thr Asp
145 150 155 160
Thr Ser Leu Ser Pro Asp Gln Arg Tyr Leu Val Tyr Ala Ser Met Thr
165 170 175
Pro Ile Val Asn Ile Val Asn Val Gly Ser Ser Met Thr Glu Ser Leu
180 185 190
Ala Asn Val Thr Glu Ile His Glu Gly Leu Asp Phe Ser Val Gly Gly
195 200 205
Asp Glu Asp Glu Phe Gly Ile Phe Ser Val Arg Phe Ser Thr Asp Gly
210 215 220
Arg Glu Leu Val Ala Ala Ser Arg Asp Ala Ser Ile Tyr Val Tyr Asp
225 230 235 240
Leu Gln Ala Asn Lys Val Asn Leu Arg Ile Pro Ala His Ser Ser Asp
245 250 255
Val Asn Thr Val Cys Phe Ala Asp Glu Thr Gly His Leu Ile Tyr Ser
260 265 270
Gly Ser Asp Asp Asn Leu Cys Lys Val Trp Asp Arg Arg Cys Phe Asn
275 280 285
His Lys Gly Gln Pro Ala Gly Val Leu Met Gly His Leu Glu Gly Val
290 295 300
Thr Phe Ile Asp Ser Arg Gly Asp Gly Arg Tyr Phe Ile Ser Asn Gly
305 310 315 320
Lys Asp Gln Thr Thr Gln Leu Trp Asp Ile Arg Lys Met Ser Ser Arg
325 330 335
Ala Met Tyr Ser Pro Arg Leu Arg Asp His Asp Trp Asp Tyr Arg Trp
340 345 350
Met Glu Tyr Pro Ala His Ala Lys Thr Leu Lys His Pro Asn Asp Gln
355 360 365
Ser Leu Ala Thr Tyr Arg Gly His Gly Val Leu Arg Thr Leu Ile Arg
370 375 380
Cys Tyr Leu Ser Pro Ala Tyr Ser Thr Gly Gln Lys Tyr Ile Tyr Thr
385 390 395 400
Gly Ser Ser Asp His Cys Val Tyr Ile Tyr Asp Leu Val Thr Gly Ala
405 410 415
Gln Val Ala Arg Leu Asn His His Glu Gly Pro Val Arg Asp Cys Ser
420 425 430
Trp His Pro Leu Tyr Pro Met Leu Val Ser Ser Ser Trp Asp Gly Thr
435 440 445
Ile Ala Arg Trp Glu Phe Pro Gly Asp Asp Gln Val Pro Thr Leu Glu
450 455 460
Arg Pro Arg Ala Arg Arg Lys Glu Arg Leu Leu
465 470 475
<210>245
<211>1437
<212>DNA
<213>甘蔗
<400>245
atgcgcggag tgcggcggag cgcgcgcggg gaatcgtccc ggaaggcggc ggcggaccgg 60
gaccgggagg tggagcggtt cacgctgtgc gccaagatgt cccacctcac caggaccacg 120
tcggagccgt gccgcagggc tcgcggcgcc gctccggcgc tccggaagag gcccttctcg 180
gcgttcgagc tggtgtcggc gagggaggcc ggccgcgcgg gcggcgccgg gttctccgcg 240
gccgaccgag cctacgtcgg caggcagcac atccccacca aggggccctg gggcgtcgac 300
gacgtggaca gcgaggccta cgtctcgcag ttctccgctg atggctcctt gctcatcgct 360
gggtttcggg gaagccgcat cagagtctac gacgccgaga aagggtggaa gattcataag 420
gatataagct gccaaatggt gcactggacg gtttcagaca ttgctctctc acctgaccaa 480
cgattccttg cctatgcaag tttgtcgcct actgttcaca ttgtgaacgt gcagagtgct 540
ggaaaggaat cacatgctaa tattactgaa attcatgagg gactggattt aactggtggt 600
gatgaggatg aggactttgg aatattttct gttaaattct caaaagatgg taaagaaatt 660
gttgttggga acaatgaaag atcaatatat gtttatgacc ttgcaacaaa taaagtgtca 720
gcccgcatcc gtgctcataa agctgatgtc aatgctgtta ccttcgctga tgaaagtgga 780
aatgtgttgt actctggaag tgatgatagt ttctgtaagg tgtgggacag acgttgcctt 840
tcaggggaaa agtcagcagg tactttaaca ggtcatttag atggagttac atttatcgat 900
agccgtggtg atgggcgtta tttcatctcc aattgcaagg atcagagaat caaactttgg 960
gacatcagga aaatgtcttc cgtcgtgaga gctcgcccag tgagtctagt ggactgggac 1020
tataggtggg agctatttcc atcagaagcc cacaatttta agcatccaga tgatcagtct 1080
gtggccacat acagaggcca ttcagttctg cgaacactta tccgttgcta tttctcccct 1140
gtgcacagca cgggtcagag gtacatatac acaggatcca gtgacaagtc tgtacatatt 1200
tatgatgtgg taagcgggaa gactgtcaag aggctttctt ggcatggctc gatcatcaga 1260
gactgcacct ggcatccata ctacccaacg ctcgtcagct cctcctggga cggctatgtt 1320
gcccgctggg aggcatcagg cgacgacgac gacccttcag tgctcgtcca cgacgagaag 1380
agggcaaccc gttactttcg gagatacgcc aatcccttca cagatccctt catgtga 1437
<210>246
<211>478
<212>PRT
<213>甘蔗
<400>246
Met Arg Gly Val Arg Arg Ser Ala Arg Gly Glu Ser Ser Arg Lys Ala
1 5 10 15
Ala Ala Asp Arg Asp Arg Glu Val Glu Arg Phe Thr Leu Cys Ala Lys
20 25 30
Met Ser His Leu Thr Arg Thr Thr Ser Glu Pro Cys Arg Arg Ala Arg
35 40 45
Gly Ala Ala Pro Ala Leu Arg Lys Arg Pro Phe Ser Ala Phe Glu Leu
50 55 60
Val Ser Ala Arg Glu Ala Gly Arg Ala Gly Gly Ala Gly Phe Ser Ala
65 70 75 80
Ala Asp Arg Ala Tyr Val Gly Arg Gln His Ile Pro Thr Lys Gly Pro
85 90 95
Trp Gly Val Asp Asp Val Asp Ser Glu Ala Tyr Val Ser Gln Phe Ser
100 105 110
Ala Asp Gly Ser Leu Leu Ile Ala Gly Phe Arg Gly Ser Arg Ile Arg
115 120 125
Val Tyr Asp Ala Glu Lys Gly Trp Lys Ile His Lys Asp Ile Ser Cys
130 135 140
Gln Met Val His Trp Thr Val Ser Asp Ile Ala Leu Ser Pro Asp Gln
145 150 155 160
Arg Phe Leu Ala Tyr Ala Ser Leu Ser Pro Thr Val His Ile Val Asn
165 170 175
Val Gln Ser Ala Gly Lys Glu Ser His Ala Asn Ile Thr Glu Ile His
180 185 190
Glu Gly Leu Asp Leu Thr Gly Gly Asp Glu Asp Glu Asp Phe Gly Ile
195 200 205
Phe Ser Val Lys Phe Ser Lys Asp Gly Lys Glu Ile Val Val Gly Asn
210 215 220
Asn Glu Arg Ser Ile Tyr Val Tyr Asp Leu Ala Thr Asn Lys Val Ser
225 230 235 240
Ala Arg Ile Arg Ala His Lys Ala Asp Val Asn Ala Val Thr Phe Ala
245 250 255
Asp Glu Ser Gly Asn Val Leu Tyr Ser Gly Ser Asp Asp Ser Phe Cys
260 265 270
Lys Val Trp Asp Arg Arg Cys Leu Ser Gly Glu Lys Ser Ala Gly Thr
275 280 285
Leu Thr Gly His Leu Asp Gly Val Thr Phe Ile Asp Ser Arg Gly Asp
290 295 300
Gly Arg Tyr Phe Ile Ser Asn Cys Lys Asp Gln Arg Ile Lys Leu Trp
305 310 315 320
Asp Ile Arg Lys Met Ser Ser Val Val Arg Ala Arg Pro Val Ser Leu
325 330 335
Val Asp Trp Asp Tyr Arg Trp Glu Leu Phe Pro Ser Glu Ala His Asn
340 345 350
Phe Lys His Pro Asp Asp Gln Ser Val Ala Thr Tyr Arg Gly His Ser
355 360 365
Val Leu Arg Thr Leu Ile Arg Cys Tyr Phe Ser Pro Val His Ser Thr
370 375 380
Gly Gln Arg Tyr Ile Tyr Thr Gly Ser Ser Asp Lys Ser Val His Ile
385 390 395 400
Tyr Asp Val Val Ser Gly Lys Thr Val Lys Arg Leu Ser Trp His Gly
405 410 415
Ser Ile Ile Arg Asp Cys Thr Trp His Pro Tyr Tyr Pro Thr Leu Val
420 425 430
Ser Ser Ser Trp Asp Gly Tyr Val Ala Arg Trp Glu Ala Ser Gly Asp
435 440 445
Asp Asp Asp Pro Ser Val Leu Val His Asp Glu Lys Arg Ala Thr Arg
450 455 460
Tyr Phe Arg Arg Tyr Ala Asn Pro Phe Thr Asp Pro Phe Met
465 470 475
<210>247
<211>1380
<212>DNA
<213>普通小麦
<400>247
atgggttatg gcatgagtag gctacatgag ggatacagtg agcctgaagg gctgaatagt 60
gatggatcta gttcagtcga agtgaataat gatttctcaa agttacacaa tgatattttc 120
catatgactc gactaagatc aggacctagt gaaagcatcc gcaagtccat ggatagagtc 180
tcagtgacta ggttgttacg tggaagggaa gttaattctt caggaaatgg aaagttctct 240
ccggttgatc gtgcattcgt tcttggtcat tatcttccag tggatggtcc tgaaacggtg 300
gatacaatgg attcacgagc ttatgtttca cagttttctg ccgatggttc tctttttgtt 360
gctggttttc agggaagcca cataagaata tatgatgtcg ataaaggttg ggaaatacac 420
aaggacattc atgctagaag tttgagatgg acaattagtg atgcagcgtt atcacctgat 480
caaaggttcc ttgtctactc tagtctggca cccattatcc atattgtcaa tgttggcact 540
gcttcaagag aatcatatgc taatgtcact gacatccatg atggattaga tttttcagag 600
catgaagatg ttagatattc atttggactc ttttctgtta aattttccac tgatgggcgg 660
gagcttgttg ctggcagtaa tgatgattca atatatgtct atgaccttca gaccaacaaa 720
gtgacattgc gtttgcctgc ccatacatct gatgtcaaca cagtagcatt tgctgatgaa 780
tctggtaacc tactttattc tggaagcgat gataacttgt gcaaggtctg ggacagacgt 840
tgtttgtcca caggggaagc ggctggggtt ttgactggac atctgcatgg cattactcat 900
attgacagcc gtggagatgg tcgatgtttc atatcaaatg gaaaagatca agctattaag 960
atgtgggaca tccggaaaat gacatccaat gctgatggtt ccgaaaacag agtccctgcc 1020
tgggactaca gatattcaag atatccacaa cagtacaagc aacaaaagca tccacatgac 1080
cagtcagtag ctacataccg gggccattca gttctccgta cattgattcg ttgctatttt 1140
tctcccacat atagcacagg acagaagtac atatatacag gatcttatga tgctagtgtc 1200
tgcatctatg atgtggtaag tgggtcgcaa gttgccaaac tgcaaggaca tcatcatttg 1260
gcagttcgag actgcagctg gcatccgtcc gatccaatgc ttgtcagttc atcatgggac 1320
ggccaggttg ccagatggtc caggactcgc tccaagcaag atacttgtga actcgattaa 1380
<210>248
<211>459
<212>PRT
<213>普通小麦
<400>248
Met Gly Tyr Gly Met Ser Arg Leu His Glu Gly Tyr Ser Glu Pro Glu
1 5 10 15
Gly Leu Asn Ser Asp Gly Ser Ser Ser Val Glu Val Asn Asn Asp Phe
20 25 30
Ser Lys Leu His Asn Asp Ile Phe His Met Thr Arg Leu Arg Ser Gly
35 40 45
Pro Ser Glu Ser Ile Arg Lys Ser Met Asp Arg Val Ser Val Thr Arg
50 55 60
Leu Leu Arg Gly Arg Glu Val Asn Ser Ser Gly Asn Gly Lys Phe Ser
65 70 75 80
Pro Val Asp Arg Ala Phe Val Leu Gly His Tyr Leu Pro Val Asp Gly
85 90 95
Pro Glu Thr Val Asp Thr Met Asp Ser Arg Ala Tyr Val Ser Gln Phe
100 105 110
Ser Ala Asp Gly Ser Leu Phe Val Ala Gly Phe Gln Gly Ser His Ile
115 120 125
Arg Ile Tyr Asp Val Asp Lys Gly Trp Glu Ile His Lys Asp Ile His
130 135 140
Ala Arg Ser Leu Arg Trp Thr Ile Ser Asp Ala Ala Leu Ser Pro Asp
145 150 155 160
Gln Arg Phe Leu Val Tyr Ser Ser Leu Ala Pro Ile Ile His Ile Val
165 170 175
Asn Val Gly Thr Ala Ser Arg Glu Ser Tyr Ala Asn Val Thr Asp Ile
180 185 190
His Asp Gly Leu Asp Phe Ser Glu His Glu Asp Val Arg Tyr Ser Phe
195 200 205
Gly Leu Phe Ser Val Lys Phe Ser Thr Asp Gly Arg Glu Leu Val Ala
210 215 220
Gly Ser Asn Asp Asp Ser Ile Tyr Val Tyr Asp Leu Gln Thr Asn Lys
225 230 235 240
Val Thr Leu Arg Leu Pro Ala His Thr Ser Asp Val Asn Thr Val Ala
245 250 255
Phe Ala Asp Glu Ser Gly Asn Leu Leu Tyr Ser Gly Ser Asp Asp Asn
260 265 270
Leu Cys Lys Val Trp Asp Arg Arg Cys Leu Ser Thr Gly Glu Ala Ala
275 280 285
Gly Val Leu Thr Gly His Leu His Gly Ile Thr His Ile Asp Ser Arg
290 295 300
Gly Asp Gly Arg Cys Phe Ile Ser Asn Gly Lys Asp Gln Ala Ile Lys
305 310 315 320
Met Trp Asp Ile Arg Lys Met Thr Ser Asn Ala Asp Gly Ser Glu Asn
325 330 335
Arg Val Pro Ala Trp Asp Tyr Arg Tyr Ser Arg Tyr Pro Gln Gln Tyr
340 345 350
Lys Gln Gln Lys His Pro His Asp Gln Ser Val Ala Thr Tyr Arg Gly
355 360 365
His Ser Val Leu Arg Thr Leu Ile Arg Cys Tyr Phe Ser Pro Thr Tyr
370 375 380
Ser Thr Gly Gln Lys Tyr Ile Tyr Thr Gly Ser Tyr Asp Ala Ser Val
385 390 395 400
Cys Ile Tyr Asp Val Val Ser Gly Ser Gln Val Ala Lys Leu Gln Gly
405 410 415
His His His Leu Ala Val Arg Asp Cys Ser Trp His Pro Ser Asp Pro
420 425 430
Met Leu Val Ser Ser Ser Trp Asp Gly Gln Val Ala Arg Trp Ser Arg
435 440 445
Thr Arg Ser Lys Gln Asp Thr Cys Glu Leu Asp
450 455
<210>249
<211>1416
<212>DNA
<213>普通小麦
<400>249
atggcggcgg cagggagact gcggggacgg cggcggacaa aggaggtgga gcgcgagccc 60
gagccgttca ccatcgagga ggaggtgtcc cacctcaccc gggtccggtc ggagccgtgc 120
cccggcaccc gcgccgccat ccatggcgcc aagcggaaga gggacgtctc ggctttcgag 180
atgctgtcgt cgagggagtc cggcctctcg ggaggcggcg ggttctgttc ggccgaccgc 240
gcctacgccg ccgggaagca cctcccgtcg gaaggaccct ggtgcgtgga agacatggat 300
agcgaggcct atgtctcgca gttctccagc gatggctcga tgctcgttgc cgggtttcgg 360
ggaagccgca tcagagttta cgatgtcgat agagggtgga aggttcataa gaacataagc 420
tgcagaagta tgaggtggac ggtttcagat attgcgctct cccctgacca gcgatatctt 480
gcctattcca gtttgtcgcc tattgttcac attgttaatg tgcagaatgc tggaagggaa 540
tcggatgcta atgttactga aattcacgat ggtttggaat tctgtgatga cgatgaatac 600
tctttcggga tattctctgt gaaattttcg aaagatggta gagaagtagt tgttgggaac 660
aatgattgtt caatatatgt ctatgatctt ggagcaaata aagtgtcaga ccgtatccgt 720
gctcatacgt ctgatgtcaa cacggtcacc tttgctgatg aaagtggcaa tttattgtac 780
tctggaagtg atgataatct ctgtaaggtc tgggataggc gttgccttgt aagagagaaa 840
ccagcaggtg ttttgacagg tcacttagat gggattacat gtattgatag ccgtggtgat 900
gggcgttatc taatctccaa ctgcaaggat cagactatca aactttggga catcagaaag 960
atgtccgcca ccgtaaaagg acgacaacca agattgtatg actgggacta cagatggatg 1020
tcgttcccgt cacacgctag atattataag catccaaatg atctatctct ggcaacatac 1080
aggggtcatt cagttctgcg gacacttatc cgctgctact tctctccaat gcacagcacg 1140
ggccagaggt acatatacac tggatcaagt gacgattcag tgcatattta cgatgtggta 1200
acaggggcga ccgtcaagaa gctctcgtgg cacggttcga tcatcagaga ctgcacctgg 1260
catccttacc gtccaacgct cgtaagctct tcctgggacg gctatctggc ccggtgggag 1320
gcatcaggca acaatgagga cccctcggtg ctcacgtgcg acgagcagag gactagccct 1380
tacgaccaga catacgggct ctcttttgcc ctgtag 1416
<210>250
<211>471
<212>PRT
<213>普通小麦
<400>250
Met Ala Ala Ala Gly Arg Leu Arg Gly Arg Arg Arg Thr Lys Glu Val
1 5 10 15
Glu Arg Glu Pro Glu Pro Phe Thr Ile Glu Glu Glu Val Ser His Leu
20 25 30
Thr Arg Val Arg Ser Glu Pro Cys Pro Gly Thr Arg Ala Ala Ile His
35 40 45
Gly Ala Lys Arg Lys Arg Asp Val Ser Ala Phe Glu Met Leu Ser Ser
50 55 60
Arg Glu Ser Gly Leu Ser Gly Gly Gly Gly Phe Cys Ser Ala Asp Arg
65 70 75 80
Ala Tyr Ala Ala Gly Lys His Leu Pro Ser Glu Gly Pro Trp Cys Val
85 90 95
Glu Asp Met Asp Ser Glu Ala Tyr Val Ser Gln Phe Ser Ser Asp Gly
100 105 110
Ser Met Leu Val Ala Gly Phe Arg Gly Ser Arg Ile Arg Val Tyr Asp
115 120 125
Val Asp Arg Gly Trp Lys Val His Lys Asn Ile Ser Cys Arg Ser Met
130 135 140
Arg Trp Thr Val Ser Asp Ile Ala Leu Ser Pro Asp Gln Arg Tyr Leu
145 150 155 160
Ala Tyr Ser Ser Leu Ser Pro Ile Val His Ile Val Asn ValGln Asn
165 170 175
Ala Gly Arg Glu Ser Asp Ala Asn Val Thr Glu Ile His Asp Gly Leu
180 185 190
Glu Phe Cys Asp Asp Asp Glu Tyr Ser Phe Gly Ile Phe Ser ValLys
195 200 205
Phe Ser Lys Asp Gly Arg Glu Val Val Val Gly Asn Asn Asp Cys Ser
210 215 220
Ile Tyr Val Tyr Asp Leu Gly Ala Asn Lys Val Ser Asp Arg Ile Arg
225 230 235 240
Ala His Thr Ser Asp Val Asn Thr Val Thr Phe Ala Asp Glu Ser Gly
245 250 255
Asn Leu Leu Tyr Ser Gly Ser Asp Asp Asn Leu Cys Lys Val Trp Asp
260 265 270
Arg Arg Cys Leu Val Arg Glu Lys Pro Ala Gly Val Leu Thr Gly His
275 280 285
Leu Asp Gly Ile Thr Cys Ile Asp Ser Arg Gly Asp Gly Arg Tyr Leu
290 295 300
Ile Ser Asn Cys Lys Asp Gln Thr Ile Lys Leu Trp Asp Ile Arg Lys
305 310 315 320
Met Ser Ala Thr Val Lys Gly Arg Gln Pro Arg Leu Tyr Asp Trp Asp
325 330 335
Tyr Arg Trp Met Ser Phe Pro Ser His Ala Arg Tyr Tyr Lys His Pro
340 345 350
Asn Asp Leu Ser Leu Ala Thr Tyr Arg Gly His Ser Val Leu Arg Thr
355 360 365
Leu Ile Arg Cys Tyr Phe Ser Pro Met His Ser Thr Gly Gln Arg Tyr
370 375 380
Ile Tyr Thr Gly Ser Ser Asp Asp Ser Val His Ile Tyr Asp Val Val
385 390 395 400
Thr Gly Ala Thr Val Lys Lys Leu Ser Trp His Gly Ser Ile Ile Arg
405 410 415
Asp Cys Thr Trp His Pro Tyr Arg Pro Thr Leu Val Ser Ser Ser Trp
420 425 430
Asp Gly Tyr Leu Ala Arg Trp Glu Ala Ser Gly Asn Asn Glu Asp Pro
435 440 445
Ser Val Leu Thr Cys Asp Glu Gln Arg Thr Ser Pro Tyr Asp Gln Thr
450 455 460
Tyr Gly Leu Ser Phe Ala Leu
465 470
<210>251
<211>1461
<212>DNA
<213>葡萄
<400>251
atgtatttta cagccagtga gggtgctgct aatgaaatgg ggtatgccat gagtagattg 60
gagctagatt ctgatttctg tgatgctggc aaggacatcc atggaaatga taacactgaa 120
agactcaaca aagaattgaa tcatttagat catgaaattt cccagctcac aaagcttaga 180
tcgggacctc acgaatgtct gagtcagatt attcctggaa agcgggactc acctgtttcg 240
acggttaaga tgctggcggg tcgagaaggg aattattcag gaaggggaag gttctcatca 300
gctgattgtt gtcatatgtt aagtagatat ttgcctgtca atggtccttg gcttgtggac 360
caaatgacaa gtcgagctta tgtgtcgcaa ttttctgctg atggttccct gtttgttgca 420
gggtttcagg gaagccatat tagaatatac aatgtggata gagggtggaa agtgcagaag 480
aatattcttg caaaaagctt gcgatggaca gtcactgata catctctctc ccctgatcaa 540
cgccatcttg tttatgccag catgtcacct attgtccata ttgttaatat tggatctgct 600
gcaacagaat ctcttgcaaa cattacggag attcatgatg gtttggattt ttctgctgct 660
gatgatgagg gttattcttt cggaatcttc tcagtgaaat tttccacaga tgggcgagag 720
cttgtagctg gaagtagtga tgattcaata tatgtttatg atcttgaagc aaataagctt 780
tcccttagaa tttcggcaca cacgtctgat gtcaatactg tatgctttgc tgatgaaagt 840
gggcatctta tttattccgg gagtgatgat agtttgtgca aggtttggga cagacgttgc 900
ttcatatcaa aagggaagcc tgcaggagtc ctgatgggac acctagaagg aattactttt 960
attgatagcc gtagagatgg tcgtcatctc atttcaaata gtaaagacca gtctatcaaa 1020
ctttgggaca tccgaaaaat gtcctccaat gctacttgca ctccagggtt caggaattat 1080
gaatgggatt atagatggat ggattatcca acccaggcaa gagagttgaa acacccatgt 1140
gatcaatcac tttccactta taaaggtcat tcagtcctgc gtactctcat acgctgctac 1200
ttctccccgt cttatagcac tggtcagaaa tacatctact ctggatctag cgattcttgc 1260
atttatattt atgatttgct gaccggagcc caagttgcaa cactggagca ccataaatca 1320
gtcgtaagag attgtaattg gcaccctaat tatccaatcc tggttagctc ttcatgggat 1380
ggagacattg tcaagtggga attccctggg aatggagagc cccctttgat caagaaacga 1440
atccggcgga aatatttata g 1461
<210>252
<211>486
<212>PRT
<213>葡萄
<400>252
Met Tyr Phe Thr Ala Ser Glu Gly Ala Ala Asn Glu Met Gly Tyr Ala
1 5 10 15
Met Ser Arg Leu Glu Leu Asp Ser Asp Phe Cys Asp Ala Gly Lys Asp
20 25 30
Ile His Gly Asn Asp Asn Thr Glu Arg Leu Asn Lys Glu Leu Asn His
35 40 45
Leu Asp His Glu Ile Ser Gln Leu Thr Lys Leu Arg Ser Gly Pro His
50 55 60
Glu Cys Leu Ser Gln Ile Ile Pro Gly Lys Arg Asp Ser Pro Val Ser
65 70 75 80
Thr Val Lys Met Leu Ala Gly Arg Glu Gly Asn Tyr Ser Gly Arg Gly
85 90 95
Arg Phe Ser Ser Ala Asp Cys Cys His Met Leu Ser Arg Tyr Leu Pro
100 105 110
Val Asn Gly Pro Trp Leu Val Asp Gln Met Thr Ser Arg Ala Tyr Val
115 120 125
Ser Gln Phe Ser Ala Asp Gly Ser Leu Phe Val Ala Gly Phe Gln Gly
130 135 140
Ser His Ile Arg Ile Tyr Asn Val Asp Arg Gly Trp Lys Val Gln Lys
145 150 155 160
Asn Ile Leu Ala Lys Ser Leu Arg Trp Thr Val Thr Asp Thr Ser Leu
165 170 175
Ser Pro Asp Gln Arg His Leu Val Tyr Ala Ser Met Ser Pro Ile Val
180 185 190
His Ile Val Asn Ile Gly Ser Ala Ala Thr Glu Ser Leu Ala Asn Ile
195 200 205
Thr Glu Ile His Asp Gly Leu Asp Phe Ser Ala Ala Asp Asp Glu Gly
210 215 220
Tyr Ser Phe Gly Ile Phe Ser Val Lys Phe Ser Thr Asp Gly Arg Glu
225 230 235 240
Leu Val Ala Gly Ser Ser Asp Asp Ser Ile Tyr Val Tyr Asp Leu Glu
245 250 255
Ala Asn Lys Leu Ser Leu Arg Ile Ser Ala His Thr Ser Asp Val Asn
260 265 270
Thr Val Cys Phe Ala Asp Glu Ser Gly His Leu Ile Tyr Ser Gly Ser
275 280 285
Asp Asp Ser Leu Cys Lys Val Trp Asp Arg Arg Cys Phe Ile Ser Lys
290 295 300
Gly Lys Pro Ala Gly Val Leu Met Gly His Leu Glu Gly Ile Thr Phe
305 310 315 320
Ile Asp Ser Arg Arg Asp Gly Arg His Leu Ile Ser Asn Ser Lys Asp
325 330 335
Gln Ser Ile Lys Leu Trp Asp Ile Arg Lys Met Ser Ser Asn Ala Thr
340 345 350
Cys Thr Pro Gly Phe Arg Asn Tyr Glu Trp Asp Tyr Arg Trp Met Asp
355 360 365
Tyr Pro Thr Gln Ala Arg Glu Leu Lys His Pro Cys Asp Gln Ser Leu
370 375 380
Ser Thr Tyr Lys Gly His Ser Val Leu Arg Thr Leu Ile Arg Cys Tyr
385 390 395 400
Phe Ser Pro Ser Tyr Ser Thr Gly Gln Lys Tyr Ile Tyr Ser Gly Ser
405 410 415
Ser Asp Ser Cys Ile Tyr Ile Tyr Asp Leu Leu Thr Gly Ala Gln Val
420 425 430
Ala Thr Leu Glu His His Lys Ser Val Val Arg Asp Cys Asn Trp His
435 440 445
Pro Asn Tyr Pro Ile Leu Val Ser Ser Ser Trp Asp Gly Asp Ile Val
450 455 460
Lys Trp Glu Phe Pro Gly Asn Gly Glu Pro Pro Leu Ile Lys Lys Arg
465 470 475 480
Ile Arg Arg Lys Tyr Leu
485
<210>253
<211>1431
<212>DNA
<213>玉蜀黍
<400>253
atgcaaggaa ggatgcgagg cgcgcggcgg agcgcgcgcg gggaatcgtc ccggaaggcg 60
gcgggccgcg aggtggagcc attcacgctg tgcggcgaga tgtcccatct caccagggcc 120
acgtcagagc cgtgccgcag ggctcgcggc gccgccttcg cccgccgggc gaggcccttc 180
tcggcgtacg agctggtgtc ggcgcgggag gccggccgcg cgggcggcgc cgggttctcc 240
gcggccgatc gagcctacct cggcaggcag cacatcccca ccaaggggcc gtggggcgtc 300
gacgacgtgg aaagcgaggc ctacgtctcg cagttctccg ccgatggctc gttgctcatc 360
gcggggtttc ggggaagccg catcagagtc tacgacgccg agaaagggtg gaagatccac 420
aaggatataa gctgcagaag tgtgcactgg acggtttcag atattgctct ctcacctgac 480
caacgattcc ttgcctatgc aagtctgaca cctattgttc acattgtgaa tgttcagaat 540
gctggaaagg aatcacatgc taatattact gaaattcatg agggattgga tttaaccggt 600
ggtgatgagg atgaggactt tggaatattt tctgttaaat tctccaaaga tggtaaagaa 660
gttgttgttg ggaacaatga aaagtcaata tatgtttatg acctttcagc aaataaagtg 720
tcagcccgca tccgtgctca taaagctgat gtcaatgctg ttaccttcgc tgatgaaact 780
ggaaacgtgt tgtactctgg aagtgatgat agtctctgta aggtgtggga caggcgttgc 840
ctttcagggg aaaagtcagc aggtattttg acaggtcatt tagatggagt tacatttatc 900
gatagccgtg gtgatgggcg ttatttcatc tctcattgca aggatcagag aatcaaactt 960
tgggacatca ggaaaatgtc ttccgtcgtg agatctcgcc cagtgagtct agtggactgg 1020
gactataggt ggatgccatt tccatcagaa gctcataatc ttaagcatcc aggtgatcag 1080
tctgtggcca catacagagg ccattcagtt ttgcgaacac ttatccgttg ctatttttcc 1140
cctgtgcaca gcacgggtca gaggtacata tacacgggat ccagtgacaa gtctgtacat 1200
atttatgacg tggtaaccgg ggaggctgtc aagaggcttt cgtggcatgg ctccatcatc 1260
agagactgca cctggcatcc atactaccca acgctcgtca gctcctcctg ggacggcttc 1320
gttgcccggt gggaggcctc gggcgacgac gacgaccatt cggtgctcgt tgccgacgag 1380
atgagaggaa gcccttacta ccggagatac ggcgatccct tggtgatgta g 1431
<210>254
<211>460
<212>PRT
<213>玉蜀黍
<400>254
Met Gly Tyr Gly Met Ser Arg Leu Asp Asp Glu Tyr Tyr Glu Ala Glu
1 5 10 15
Gly Gln Asn Thr Gly Gly Ser Gly Ser Val Gln Val Asn Asp Glu Phe
20 25 30
Ala Thr Leu His Asn Asp Ile Phe His Met Thr Arg Met Arg Ser Gly
35 40 45
Leu Thr Glu Ser Tyr Lys Ser Met Gly Thr Asn Arg Gly Ile Ile Ser
50 55 60
Thr Ala Lys Leu Leu Ser Arg Arg Glu Ile Asp Cys Ser Gly Lys Gly
65 70 75 80
Met Phe Ser Ser Gly Asp Arg Ala Phe Val Leu Gly Arg His Val Pro
85 90 95
Met Asn Asn Pro Glu Leu Leu Asp Arg Met Asp Ser Arg Ala Tyr Val
100 105 110
Ser Gln Phe Ser Ala Asp Gly Thr Leu Phe Val Ala Gly Phe Gln Gly
115 120 125
Ser His Ile Arg Ile Tyr Asp Val Asp Arg Gly Trp Ser Ile His Lys
130 135 140
Asp Ile His Ala Arg Cys Leu Arg Trp Thr Ile Ser Asp Val Ser Leu
145 150 155 160
Ser Pro Asp Gln Arg Tyr Leu Ala Tyr Ser Ser Leu Ala Pro Ile Ile
165 170 175
His Ile Val Asn Val Gly Asn Ala Ala Arg Glu Ser Tyr Ala Asn Val
180 185 190
Thr Asp Ile His Asp Gly Leu Asp Phe Ser Gln His Glu Asp Val Gln
195 200 205
Tyr Ser Phe Gly Ile Phe Ser Val Lys Phe Ser Ser Asp Gly Arg Glu
210 215 220
Leu Val Ala Gly Ser Asn Asp Asp Ser Ile Tyr Val Tyr Asp Leu His
225 230 235 240
Ala Asn Lys Leu Thr Leu Arg Leu Ser Ala His Thr Ser Asp Val Asn
245 250 255
Thr Val Ala Phe Ala Asp Glu Thr Gly His Leu Ile Tyr Ser Gly Ser
260 265 270
Asp Asp Thr Leu Cys Lys Val Trp Asp Arg Arg Cys Leu Ser Ala Gly
275 280 285
Gln Ala Ala Gly Val Leu Thr Gly His Leu His Gly Ile Thr His Ile
290 295 300
Asp Ser Arg Gly Asp Gly Arg Ser Phe Ile Ser Asn Gly Lys Asp Gln
305 310 315 320
Ala Ile Lys Leu Trp Asp Ile Arg Lys Met Met Ser Asn Ala Asp Ser
325 330 335
Cys Ala Asp Gly Ala Pro Ala Trp Asp Tyr Arg Tyr Ser Arg Tyr Pro
340 345 350
His Gln His Lys Gln Leu Lys His Pro His Asp Gln Ser Ile Ala Thr
355 360 365
Tyr Arg Gly His Thr Val Leu Arg Thr Leu Ile Arg Cys Tyr Phe Ser
370 375 380
Pro Ser Tyr Ser Thr Gly Gln Lys Tyr Ile Tyr Thr Gly Ser Tyr Asp
385 390 395 400
Ser Asn Val Cys Ile Tyr Asp Val Val Ser Gly Ser Gln Val Ala Lys
405 410 415
Leu Lys Trp His Gln Met Ala Ile Arg Asp Cys Ser Trp His Pro Phe
420 425 430
Glu Pro Thr Leu Val Ser Ser Ser Trp Asp Gly Arg Val Val Lys Trp
435 440 445
Thr Ser Ala Arg Asp Glu Gly Ala Ser Asp Val Asp
450 455 460
<210>255
<211>1383
<212>DNA
<213>玉蜀黍
<400>255
atgggttatg gcatgagtag gctagacgat gaatactatg aggctgaagg gcagaatact 60
ggtggatctg gctcagttca agtgaacgat gagtttgcaa cactacataa tgatattttt 120
catatgaccc gaatgagatc aggacttact gaaagctaca agtccatggg taccaacaga 180
ggcataatat caactgccaa gttattatct cgaagggaaa ttgattgttc tggaaagggg 240
atgttctctt ctggtgaccg tgcatttgtt ctaggtcgtc atgttccgat gaacaatcct 300
gaattattgg ataggatgga ttctcgtgct tacgtttcac agttttctgc tgatggtact 360
ctttttgttg ctggttttca gggaagtcac ataagaatat acgatgttga tagaggctgg 420
agcatacaca aagacattca tgctaggtgt ttaagatgga caatcagtga tgtatcctta 480
tcacctgatc agcggtatct tgcctactct agtctggcac ctattatcca tattgtcaat 540
gttgggaatg cggcaagaga atcctatgct aacgtcactg acatccatga tggattggat 600
ttttcacagc atgaagatgt tcaatattca tttggaatat tttctgtaaa attttcctct 660
gatggacggg aacttgttgc tggcagcaat gatgactcaa tatatgtcta tgatcttcat 720
gcaaacaaat tgacattacg tttatctgct catacatctg atgtcaatac agtagcattt 780
gctgatgaaa ccggccatct catatattct ggaagtgacg acactttatg caaggtctgg 840
gacaggcggt gtttgtccgc aggacaggct gccggagttt tgactggaca tttgcatggg 900
ataacacata ttgatagtcg tggagatggc cgaagtttca tatccaatgg aaaagatcaa 960
gcaatcaaac tgtgggatat caggaaaatg atgtccaatg ctgatagttg tgcagatgga 1020
gccccagcct gggactacag atattcaagg tatccgcatc aacataaaca gttaaagcat 1080
ccgcatgatc agtcaatagc gacgtaccgt ggacatacag tcctccggac attgatccgt 1140
tgctactttt ctccttcgta tagcactggg caaaagtaca tatatacagg gtcgtatgat 1200
tccaatgttt gcatctatga tgtggtaagt ggatcccagg ttgcgaagct gaaatggcat 1260
caaatggcga ttcgggactg cagttggcac ccatttgagc ccacgcttgt gagctcatcc 1320
tgggacggcc gggtagtgaa atggaccagc gcacgtgatg aaggggcttc tgatgtcgat 1380
tga 1383
<210>256
<211>476
<212>PRT
<213>玉蜀黍
<400>256
Met Gln Gly Arg Met Arg Gly Ala Arg Arg Ser Ala Arg Gly Glu Ser
1 5 10 15
Ser Arg Lys Ala Ala Gly Arg Glu Val Glu Pro Phe Thr Leu Cys Gly
20 25 30
Glu Met Ser His Leu Thr Arg Ala Thr Ser Glu Pro Cys Arg Arg Ala
35 40 45
Arg Gly Ala Ala Phe Ala Arg Arg Ala Arg Pro Phe Ser Ala Tyr Glu
50 55 60
Leu Val Ser Ala Arg Glu Ala Gly Arg Ala Gly Gly Ala Gly Phe Ser
65 70 75 80
Ala Ala Asp Arg Ala Tyr Leu Gly Arg Gln His Ile Pro Thr Lys Gly
85 90 95
Pro Trp Gly Val Asp Asp Val Glu Ser Glu Ala Tyr Val Ser Gln Phe
100 105 110
Ser Ala Asp Gly Ser Leu Leu Ile Ala Gly Phe Arg Gly Ser Arg Ile
115 120 125
Arg Val Tyr Asp Ala Glu Lys Gly Trp Lys Ile His Lys Asp Ile Ser
130 135 140
Cys Arg Ser Val His Trp Thr Val Ser Asp Ile Ala Leu Ser Pro Asp
145 150 155 160
Gln Arg Phe Leu Ala Tyr Ala Ser Leu Thr Pro Ile Val His Ile Val
165 170 175
Asn Val Gln Asn Ala Gly Lys Glu Ser His Ala Asn Ile Thr Glu Ile
180 185 190
His Glu Gly Leu Asp Leu Thr Gly Gly Asp Glu Asp Glu Asp Phe Gly
195 200 205
Ile Phe Ser Val Lys Phe Ser Lys Asp Gly Lys Glu Val Val Val Gly
210 215 220
Asn Asn Glu Lys Ser Ile Tyr Val Tyr Asp Leu Ser Ala Asn Lys Val
225 230 235 240
Ser Ala Arg Ile Arg Ala His Lys Ala Asp Val Asn Ala Val Thr Phe
245 250 255
Ala Asp Glu Thr Gly Asn Val Leu Tyr Ser Gly Ser Asp Asp Ser Leu
260 265 270
Cys Lys Val Trp Asp Arg Arg Cys Leu Ser Gly Glu Lys Ser Ala Gly
275 280 285
Ile Leu Thr Gly His Leu Asp Gly Val Thr Phe Ile Asp Ser Arg Gly
290 295 300
Asp Gly Arg Tyr Phe Ile Ser His Cys Lys Asp Gln Arg Ile Lys Leu
305 310 315 320
Trp Asp Ile Arg Lys Met Ser Ser Val Val Arg Ser Arg Pro Val Ser
325 330 335
Leu Val Asp Trp Asp Tyr Arg Trp Met Pro Phe Pro Ser Glu Ala His
340 345 350
Asn Leu Lys His Pro Gly Asp Gln Ser Val Ala Thr Tyr Arg Gly His
355 360 365
Ser Val Leu Arg Thr Leu Ile Arg Cys Tyr Phe Ser Pro Val His Ser
370 375 380
Thr Gly Gln Arg Tyr Ile Tyr Thr Gly Ser Ser Asp Lys Ser Val His
385 390 395 400
Ile Tyr Asp Val Val Thr Gly Glu Ala Val Lys Arg Leu Ser Trp His
405 410 415
Gly Ser Ile Ile Arg Asp Cys Thr Trp His Pro Tyr Tyr Pro Thr Leu
420 425 430
Val Ser Ser Ser Trp Asp Gly Phe Val Ala Arg Trp Glu Ala Ser Gly
435 440 445
Asp Asp Asp Asp His Ser Val Leu Val Ala Asp Glu Met Arg Gly Ser
450 455 460
Pro Tyr Tyr Arg Arg Tyr Gly Asp Pro Leu Val Met
465 470 475
<210>257
<211>1357
<212>DNA
<213>甜橙(Citrus sinensis)
<220>
<221>不确定
<222>(665)..(665)
<223>n可以是105至135个核苷酸的一段序列
<220>
<221>misc_feature
<222>(665)..(665)
<223>n是a、c、g或t
<400>257
atgttcgtaa cagcaagtgg agttgacttt gacgaaatgg gatatgccat gagtagattc 60
gagatagaat ctgaattcta tgatgctgcc gatactgtca atcaagctag taatagtcgt 120
agcaaattca agaaaccctt gagtgcttta gaccatgaaa ttgcccagct cacgaagctg 180
aaatcggaac ccaaggagca ttttagcaaa gaagtacctg ggaagcggca cttgcctgtt 240
tccactgtga aaatgttggc tggtagggaa ggcaattatt caggaagagg gaggttctca 300
gctgcagatt gttgtcatat gcttagtaga tatttgcctg ttaatgggcc ctggcctgtg 360
gaccagacaa ctagccgagc atatgtctct cagttttcgg ctgatggttc tttatttgtt 420
gctggatttc aggctagtca aattagaatc tatgatgtgg agagaggttg gaaaatacag 480
aaagacattc ttgctaaaag tttgcgttgg acagtcacag atacatctct ttccccggat 540
cagcgccatc ttgtttatgc aagcatgtca cctatagtgc acattgttga tgttggttct 600
gggacaatgg agtctcttgc aaatgttacg gagatacatg atggattgga cttttctgct 660
gcagnctctc agaattttgg cacacacgtc tgatgttaac actgtatgtt ttggtgatga 720
aagtggccat ctaatctatt ctgggagtga tgacaatcta tgcaaggtat gggatagacg 780
ctgtctaaat gtgaaaggga agccagcagg agtcctgatg ggacacctag aagggatcac 840
gttcattgat agccgtggag atggtcgtta tctgatctca aatggtaaag atcaggccat 900
caaactttgg gatattcgga aaatgtcctc taatgcatcc tgcaatttag gatttaggag 960
ttatgaatgg gattacagat ggatggacta cccaccccag gcaagagatt tgaaacaccc 1020
atgtgatcaa tccgttgcta cgtataaagg tcattctgtc ttgcgtactc ttatccggtg 1080
ccacttttcc cctgtataca gcactggcca aaagtatatc tacactggat cccatgattc 1140
ttgtgtttat gtttatgacc tggtgagtgg tgagcaagtt gctgcactca agtaccatac 1200
ttcacccgtt agagactgta gttggcaccc aagtcaacca atgcttgtta gctcttcttg 1260
ggatggagat gttgtcaggt gggagtttcc gggtaatgga gaagctgctc ctcctctaaa 1320
caagaagagg atccggagga gacagtttta cttgtga 1357
<210>258
<211>451
<212>PRT
<213>甜橙
<220>
<221>不确定
<222>(222)..(222)
<223>X可以是35至45个氨基酸的一段序列
<400>258
Met Phe Val Thr Ala Ser Gly Val Asp Phe Asp Glu Met Gly Tyr Ala
1 5 10 15
Met Ser Arg Phe Glu Ile Glu Ser Glu Phe Tyr Asp Ala Ala Asp Thr
20 25 30
Val Asn Gln Ala Ser Asn Ser Arg Ser Lys Phe Lys Lys Pro Leu Ser
35 40 45
Ala Leu Asp His Glu Ile Ala Gln Leu Thr Lys Leu Lys Ser Glu Pro
50 55 60
Lys Glu His Phe Ser Lys Glu Val Pro Gly Lys Arg His Leu Pro Val
65 70 75 80
Ser Thr Val Lys Met Leu Ala Gly Arg Glu Gly Asn Tyr Ser Gly Arg
85 90 95
Gly Arg Phe Ser Ala Ala Asp Cys Cys His Met Leu Ser Arg Tyr Leu
100 105 110
Pro Val Asn Gly Pro Trp Pro Val Asp Gln Thr Thr Ser Arg Ala Tyr
115 120 125
Val Ser Gln Phe Ser Ala Asp Gly Ser Leu Phe Val Ala Gly Phe Gln
130 135 140
Ala Ser Gln Ile Arg Ile Tyr Asp Val Glu Arg Gly Trp Lys Ile Gln
145 150 155 160
Lys Asp Ile Leu Ala Lys Ser Leu Arg Trp Thr Val Thr Asp Thr Ser
165 170 175
Leu Ser Pro Asp Gln Arg His Leu Val Tyr Ala Ser Met Ser Pro Ile
180 185 190
Val His Ile Val Asp Val Gly Ser Gly Thr Met Glu Ser Leu Ala Asn
195 200 205
Val Thr Glu Ile His Asp Gly Leu Asp Phe Ser Ala Ala Xaa Leu Arg
210 215 220
Ile Leu Ala His Thr Ser Asp Val Asn Thr Val Cys Phe Gly Asp Glu
225 230 235 240
Ser Gly His Leu Ile Tyr Ser Gly Ser Asp Asp Asn Leu Cys Lys Val
245 250 255
Trp Asp Arg Arg Cys Leu Asn Val Lys Gly Lys Pro Ala Gly Val Leu
260 265 270
Met Gly His Leu Glu Gly Ile Thr Phe Ile Asp Ser Arg Gly Asp Gly
275 280 285
Arg Tyr Leu Ile Ser Asn Gly Lys Asp Gln Ala Ile Lys Leu Trp Asp
290 295 300
Ile Arg Lys Met Ser Ser Asn Ala Ser Cys Asn Leu Gly Phe Arg Ser
305 310 315 320
Tyr Glu Trp Asp Tyr Arg Trp Met Asp Tyr Pro Pro Gln Ala Arg Asp
325 330 335
Leu Lys His Pro Cys Asp Gln Ser Val Ala Thr Tyr Lys Gly His Ser
340 345 350
Val Leu Arg Thr Leu Ile Arg Cys His Phe Ser Pro Val Tyr Ser Thr
355 360 365
Gly Gln Lys Tyr Ile Tyr Thr Gly Ser His Asp Ser Cys Val Tyr Val
370 375 380
Tyr Asp Leu Val Ser Gly Glu Gln Val Ala Ala Leu Lys Tyr His Thr
385 390 395 400
Ser Pro Val Arg Asp Cys Ser Trp His Pro Ser Gln Pro Met Leu Val
405 410 415
Ser Ser Ser Trp Asp Gly Asp Val ValArg Trp Glu Phe Pro Gly Asn
420 425 430
Gly Glu Ala Ala Pro Pro Leu Asn Lys Lys Arg Ile Arg Arg Arg Gln
435 440 445
Phe Tyr Leu
450
<210>259
<211>1119
<212>DNA
<213>大豆
<400>259
atgtctagta gggcttatgt ctcccagttt tcagctgatg gctctctttt tattgctggg 60
ttccagggaa gtcacataag aatatacaac gtggacagag gttggaaagt tcagaagaac 120
attctagcta aaaatttgag atggacaatc actgatacat ctctttcacc tgatcaacgc 180
tatctagttt atgccagtat gtcacctatt gtacacattg taaatgccgg atctgctgaa 240
acggagtccc tagcaaatgt tacagagata catgatggtt tggatttttc atcaaatgat 300
gatggaggat actcctttgg aattttctgt gtgaaattct caaaagatgg gaaagaatta 360
gttgcaggaa gtagtggtga ttctatatat gtatacgatc ttgaagcaaa taagctctca 420
cttcgaattt tagctcacac gtgtgatgtg aacactgtat gttttgctga tgaaactagc 480
catcttattt actctgggag tgatgatagt ttctgcaagg tctgggatcg gcgttgcttg 540
attgctaaag gcaagccagc aggggtttta atgggacacc ttgagggcat tacatttatt 600
gatactcgag gagatggacg ctatttcatt tcaaatggta aagatcaaac cattaaactt 660
tgggacatac gcaaaatgtc atccaatgtt accagcaatc ctgggtatag gagttacgaa 720
tgggattaca ggtggatgga ttatccaccc caagcaaaag acttgactca cccttgtgat 780
cagtcagtgg ctacttatag aggccattcg gtcttacgca ctctcatccg ctgctatttc 840
tccccagcgt ttagcacggg ccagaagtac atctatactg gatcacacaa cgcatgtgtt 900
tatatatatg atttggtaag tggagctcaa gtcgcaacgc tgaagcacca taaatcacct 960
gtaagagatt gtagctggca ccccttccac actacacttg ttagctcttc ttgggatgga 1020
gatgttgtga aatgggaatt tgctgggagt ggtgatacac caggctcttc aactaagaag 1080
agggtatgga caagacattt ttatgaacat tacctatga 1119
<210>260
<211>372
<212>PRT
<213>大豆
<400>260
Met Ser Ser Arg Ala Tyr Val Ser Gln Phe Ser Ala Asp Gly Ser Leu
1 5 10 15
Phe Ile Ala Gly Phe Gln Gly Ser His Ile Arg Ile Tyr Asn Val Asp
20 25 30
Arg Gly Trp Lys Val Gln Lys Asn Ile Leu Ala Lys Asn Leu Arg Trp
35 40 45
Thr Ile Thr Asp Thr Ser Leu Ser Pro Asp Gln Arg Tyr Leu Val Tyr
50 55 60
Ala Ser Met Ser Pro Ile Val His Ile Val Asn Ala Gly Ser Ala Glu
65 70 75 80
Thr Glu Ser Leu Ala Asn Val Thr Glu Ile His Asp Gly Leu Asp Phe
85 90 95
Ser Ser Asn Asp Asp Gly Gly Tyr Ser Phe Gly Ile Phe Cys Val Lys
100 105 110
Phe Ser Lys Asp Gly Lys Glu Leu Val Ala Gly Ser Ser Gly Asp Ser
115 120 125
Ile Tyr Val Tyr Asp Leu Glu Ala Asn Lys Leu Ser Leu Arg Ile Leu
130 135 140
Ala His Thr Cys Asp Val Asn Thr Val Cys Phe Ala Asp Glu Thr Ser
145 150 155 160
His Leu Ile Tyr Ser Gly Ser Asp Asp Ser Phe Cys Lys Val Trp Asp
165 170 175
Arg Arg Cys Leu Ile Ala Lys Gly Lys Pro Ala Gly Val Leu Met Gly
180 185 190
His Leu Glu Gly Ile Thr Phe Ile Asp Thr Arg Gly Asp Gly Arg Tyr
195 200 205
Phe Ile Ser Asn Gly Lys Asp Gln Thr Ile Lys Leu Trp Asp Ile Arg
210 215 220
Lys Met Ser Ser Asn Val Thr Ser Asn Pro Gly Tyr Arg Ser Tyr Glu
225 230 235 240
Trp Asp Tyr Arg Trp Met Asp Tyr Pro Pro Gln Ala Lys Asp Leu Thr
245 250 255
His Pro Cys Asp Gln Ser Val Ala Thr Tyr Arg Gly His Ser Val Leu
260 265 270
Arg Thr Leu Ile Arg Cys Tyr Phe Ser Pro Ala Phe Ser Thr Gly Gln
275 280 285
Lys Tyr Ile Tyr Thr Gly Ser His Asn Ala Cys Val Tyr Ile Tyr Asp
290 295 300
Leu Val Ser Gly Ala Gln Val Ala Thr Leu Lys His His Lys Ser Pro
305 310 315 320
Val Arg Asp Cys Ser Trp His Pro Phe His Thr Thr Leu Val Ser Ser
325 330 335
Ser Trp Asp Gly Asp Val Val Lys Trp Glu Phe Ala Gly Ser Gly Asp
340 345 350
Thr Pro Gly Ser Ser Thr Lys Lys Arg Val Trp Thr Arg His Phe Tyr
355 360 365
Glu His Tyr Leu
370
<210>261
<211>448
<212>DNA
<213>大麦
<400>261
atgggttatg gcatgagtag gctgcatgag ggatacagtg aacatgaagg gcagaatagt 60
gatggatcta gttcagtcga agtgaataat gacttctcaa aattaaataa tgatattttc 120
cacatgactc gactaagatc aggacctagt gaaagcatcc gcaagtccat ggatagagtc 180
tcagtaacta ggttgttacg tggaagggaa gttaactctt caggaaatgg aaagttttct 240
ccggttgatc gtgcattcgt tcttggtcat tatcttccag tggatggtcc tgaaacagtg 300
gacaggatgg attcacgagc ttatgtttca catttttctg ccgatggttc tctttttgtt 360
gctggttttc agggaagcca cataagaata tatgatgtcg ataaaggctg ggaagtacac 420
aaggatattc atgctagaag tttgagat 448
<210>262
<211>149
<212>PRT
<213>大麦
<400>262
Met Gly Tyr Gly Met Ser Arg Leu His Glu Gly Tyr Ser Glu His Glu
1 5 10 15
Gly Gln Asn Ser Asp Gly Ser Ser Ser Val Glu Val Asn Asn Asp Phe
20 25 30
Ser Lys Leu Asn Asn Asp Ile Phe His Met Thr Arg Leu Arg Ser Gly
35 40 45
Pro Ser Glu Ser Ile Arg Lys Ser Met Asp Arg Val Ser Val Thr Arg
50 55 60
Leu Leu Arg Gly Arg Glu Val Asn Ser Ser Gly Asn Gly Lys Phe Ser
65 70 75 80
Pro Val Asp Arg Ala Phe Val Leu Gly His Tyr Leu Pro Val Asp Gly
85 90 95
Pro Glu Thr Val Asp Arg Met Asp Ser Arg Ala Tyr Val Ser His Phe
100 105 110
Ser Ala Asp Gly Ser Leu Phe Val Ala Gly Phe Gln Gly Ser His Ile
115 120 125
Arg Ile Tyr Asp Val Asp Lys Gly Trp Glu Val His Lys Asp Ile His
130 135 140
Ala Arg Ser Leu Arg
145
<210>263
<211>1029
<212>DNA
<213>大麦
<400>263
aactcgtgca tgccttcatg ctggcaagag ccatatctct acattcaagc tgttgtcatc 60
aagagaatcc aatcgctccg gatttggtag attctcttca gccgattgct cttatgctct 120
tcgcaaacac ctaccagtaa gaggcccgtg gtgtgttgat agcatggatt gcgcagcata 180
catctcacaa ttctctttgg atggttctct actaattggg aggcgtatca gaatctataa 240
cgctgacaaa aaatggaaga tccacaagga tataacctgc aaaagtctgc ggtggacagt 300
atcagatatt gctctctcac ctgatcaaca atacctagca tattccagtc tgtcccctac 360
tgttcacata gtaaatgttc agaatgctcc gaagcagtca catgctaata ttacaatgca 420
ggatgttcat gagggtttga atttttctgc tgctgctgat gaatcctcct ttggaatatt 480
ttcaataaag ttttcaaaag atgggcatga acttgttgtt ggaaacagca atgagtcaat 540
atgtatttat gatcttggag caaacaaagt gacagagcga attcatgctc atgtggctga 600
tgttaatgcg gtcacgttcg ctgatgaatc tggtggtgtc ttgtactccg gaagtgatga 660
tagcctctgt aaggtgtggg ataggcgttg ccacaacaga gcgaaaccag taggtgtttt 720
ggcaggtcat ttagatggag ttacatttat tgatagccat ggagacgggc attatttcat 780
ctccaactgc aaggatcaga ctattaaact atgggatatc agaaaattgt cctcggctac 840
gaaggactgc acaccaaaag catacgaatg ggattacaga tggatgacct atccatcaga 900
agcccgattt ttgaagcatc catatgatca atcgctagcc acattcagag gccattcggt 960
gttgcgcaca cttatccgtt gctacttttc cccaatgcac agcacaggtc agaggtatat 1020
atacacagg 1029
<210>264
<211>342
<212>PRT
<213>大麦
<400>264
Thr Arg Ala Cys Leu His Ala Gly Lys Ser His Ile Ser Thr Phe Lys
1 5 10 15
Leu Leu Ser Ser Arg Glu Ser Asn Arg Ser Gly Phe Gly Arg Phe Ser
20 25 30
Ser Ala Asp Cys Ser Tyr Ala Leu Arg Lys His Leu Pro Val Arg Gly
35 40 45
Pro Trp Cys Val Asp Ser Met Asp Cys Ala Ala Tyr Ile Ser Gln Phe
50 55 60
Ser Leu Asp Gly Ser Leu Leu Ile Gly Arg Arg Ile Arg Ile Tyr Asn
65 70 75 80
Ala Asp Lys Lys Trp Lys Ile His Lys Asp Ile Thr Cys Lys Ser Leu
85 90 95
Arg Trp Thr Val Ser Asp Ile Ala Leu Ser Pro Asp Gln Gln Tyr Leu
100 105 110
Ala Tyr Ser Ser Leu Ser Pro Thr Val His Ile Val Asn Val Gln Asn
115 120 125
Ala Pro Lys Gln Ser His Ala Asn Ile Thr Met Gln Asp Val His Glu
130 135 140
Gly Leu Asn Phe Ser Ala Ala Ala Asp Glu Ser Ser Phe Gly Ile Phe
145 150 155 160
Ser Ile Lys Phe Ser Lys Asp Gly His Glu Leu Val Val Gly Asn Ser
165 170 175
Asn Glu Ser Ile Cys Ile Tyr Asp Leu Gly Ala Asn Lys Val Thr Glu
180 185 190
Arg Ile His Ala His Val Ala Asp Val Asn Ala Val Thr Phe Ala Asp
195 200 205
Glu Ser Gly Gly Val Leu Tyr Ser Gly Ser Asp Asp Ser Leu Cys Lys
210 215 220
Val Trp Asp Arg Arg Cys His Asn Arg Ala Lys Pro Val Gly Val Leu
225 230 235 240
Ala Gly His Leu Asp Gly Val Thr Phe Ile Asp Ser His Gly Asp Gly
245 250 255
His Tyr Phe Ile Ser Asn Cys Lys Asp Gln Thr Ile Lys Leu Trp Asp
260 265 270
Ile Arg Lys Leu Ser Ser Ala Thr Lys Asp Cys Thr Pro Lys Ala Tyr
275 280 285
Glu Trp Asp Tyr Arg Trp Met Thr Tyr Pro Ser Glu Ala Arg Phe Leu
290 295 300
Lys His Pro Tyr Asp Gln Ser Leu Ala Thr Phe Arg Gly His Ser Val
305 310 315 320
Leu Arg Thr Leu Ile Arg Cys Tyr Phe Ser Pro Met His Ser Thr Gly
325 330 335
Gln Arg Tyr Ile Tyr Thr
340
<210>265
<211>1226
<212>DNA
<213>火炬松(Pinus taeda)
<220>
<221>不确定
<222>(730)..(730)
<223>n可以是315至330个核苷酸的一段序列
<220>
<221>misc_feature
<222>(730)..(730)
<223>n是a、c、g或t
<400>265
gctcgtagac attgaaggtc cggaagcttt gccaagcaca gagaacagaa ctgcaagcac 60
aggacaaagg gagagaatgg gaaactcaca gtttcaaaat catcattttg acaaagatga 120
taacagtaac agaaatgatg aagtccaaaa tatggatgat gcttctgaag atcccaacaa 180
acttcaccat gaacttgaac atataacaaa gctgaggtca gcaccaaatg gtatgctgtc 240
caggatgaat ggtaagagcc agaatggcta tgtttcaaca ttatctatgt tggcaggcag 300
agaagccaat atttctggga gagggaaatt tagtgtagct gattgctgtc atgttgcaag 360
tagatatttg ccatcttgtg gcccagatat aatggatatg atggacagta gagcatatat 420
tggacaattt tcagcagatg gctccctttt tgtcacagga tttcaggatc atcggataag 480
aatatacaat gtggagaatg gatggacaat tcaaaaggat gtgcttgcta gaaacttgcg 540
ttggactatc actgacactt ctctttcacc tgatcagcgt taccttgtgt atgctactat 600
cactcctatt gtccacattg tgaatgttgg aagcagtgtc agagaatcct tagcaaacgt 660
cacagaaatt catgaagcgc tggatttctc tatgtacgaa gaacattgct catttggaat 720
tttctgcgtn cattactttc atcgatagtc gtggagatgg acgctatttt atttctaatg 780
gaaaagacca gacgacaaag atgtgggata ttcgcaaaat gactgcagga aatcctagca 840
taaagtcaag gagttcctct ggtaatgaat gggattatcg ttggatggag tatccaaaga 900
ataggaagaa tgctaagcat ccatatgacc agtctctgat gacttacagg ggacatgctg 960
tcttgcgtac tcttgtccgc tgctacttct ctccatcctt cagcactggc cagaaataca 1020
tctacacagg atcacacgat ggctgtgttt atatttatga tgtggtaagt gggaacctca 1080
ttaagaaact cgactatcat aggtccactg tgagggactg cagctggcat ccgttctatc 1140
caactcttgt aagctgttca tgggatggag ttgttgccaa atgggaccat tcaacagcaa 1200
gaagtcgcag aagccgcagc ccgtga 1226
<210>266
<211>361
<212>PRT
<213>火炬松
<220>
<221>不确定
<222>(197)..(197)
<223>X可以是105至110个氨基酸的一段序列
<400>266
Met Gly Asn Ser Gln Phe Gln Asn His His Phe Asp Lys Asp Asp Asn
1 5 10 15
Ser Asn Arg Asn Asp Glu Val Gln Asn Met Asp Asp Ala Ser Glu Asp
20 25 30
Pro Asn Lys Leu His His Glu Leu Glu His Ile Thr Lys Leu Arg Ser
35 40 45
Ala Pro Asn Gly Met Leu Ser Arg Met Asn Gly Lys Ser Gln Asn Gly
50 55 60
Tyr Val Ser Thr Leu Ser Met Leu Ala Gly Arg Glu Ala Asn Ile Ser
65 70 75 80
Gly Arg Gly Lys Phe Ser Val Ala Asp Cys Cys His Val Ala Ser Arg
85 90 95
Tyr Leu Pro Ser Cys Gly Pro Asp Ile Met Asp Met Met Asp Ser Arg
100 105 110
Ala Tyr Ile Gly Gln Phe Ser Ala Asp Gly Ser Leu Phe Val Thr Gly
115 120 125
Phe Gln Asp His Arg Ile Arg Ile Tyr Asn Val Glu Asn Gly Trp Thr
130 135 140
Ile Gln Lys Asp Val Leu Ala Arg Asn Leu Arg Trp Thr Ile Thr Asp
145 150 155 160
Thr Ser Leu Ser Pro Asp Gln Arg Tyr Leu Val Tyr Ala Thr Ile Thr
165 170 175
Pro Ile Val His Ile Val Asn Val Gly Ser Ser Val Arg Glu Ser Leu
180 185 190
Ala Asn Val Thr Xaa Ile Thr Phe Ile Asp Ser Arg Gly Asp Gly Arg
195 200 205
Tyr Phe Ile Ser Asn Gly Lys Asp Gln Thr Thr Lys Met Trp Asp Ile
210 215 220
Arg Lys Met Thr Ala Gly Asn Pro Ser Ile Lys Ser Arg Ser Ser Ser
225 230 235 240
Gly Asn Glu Trp Asp Tyr Arg Trp Met Glu Tyr Pro Lys Asn Arg Lys
245 250 255
Asn Ala Lys His Pro Tyr Asp Gln Ser Leu Met Thr Tyr Arg Gly His
260 265 270
Ala Val Leu Arg Thr Leu Val Arg Cys Tyr Phe Ser Pro Ser Phe Ser
275 280 285
Thr Gly Gln Lys Tyr Ile Tyr Thr Gly Ser His Asp Gly Cys Val Tyr
290 295 300
Ile Tyr Asp Val Val Ser Gly Asn Leu Ile Lys Lys Leu Asp Tyr His
305 310 315 320
Arg Ser Thr Val Arg Asp Cys Ser Trp His Pro Phe Tyr Pro Thr Leu
325 330 335
Val Ser Cys Ser Trp Asp Gly Val Val Ala Lys Trp Asp His Ser Thr
340 345 350
Ala Arg Ser Arg Arg Ser Arg Ser Pro
355 360
<210>267
<211>1269
<212>DNA
<213>甘蔗
<220>
<221>不确定
<222>(566)..(566)
<220>
<221>misc_feature
<222>(566)..(566)
<223>n是a、c、g或t
<400>267
atgggttatg gcatgagtag gctagaagat gaatattatg agcctgaagg gcagaatact 60
gatggatctg gctcagttca agtgaacgat gagtttgcaa aactgcacaa cgatattttt 120
catatgacac gaatgagatc aagacttact gaaaggtaca agtccatgga taccaacaga 180
ggcataatat caacggccaa gttattatct cgaagggaaa ttgattgttc tggaaagggg 240
atgttctctt cttgtgaccg tgcatttgtt ctaggtcgct atgttccaat gaatggccct 300
gaattattgg ataggatgga ttctcgtgct tatgtttcac agttttctgc tgatggtact 360
ctttttggtg ctggttttca gggaagtcac ataagaatat atgatgctga tagaggctgg 420
agcatacaca aagacattca tgctaagtgt ttgagatgga caatcagtga tgtattccta 480
tcacctgatc aacggtatct tgcctactct agtctggcac ctattaatca tattggcaat 540
gtcggaaatg ctggaagaga atcctngtcg acccacgcgt ccggccggca gcaatgatga 600
atcaatatat gtctatgacc ttcaagcaaa caaattgaca ttacgtttac ctgctcatac 660
atctgatgtc aatacagtag catttgctga tgaaaccggc catctcatat attcaggaag 720
tgacgacact ttatgcaagg tctgggacag gcggtgtttg tccacaggac aagctgccgg 780
agttttgact ggacatttgc atggggtaac acacattgat agtcgtggag atggccgaag 840
tttcatatcc aatggaaaag accaagcaat caaactgtgg gatgtcagga aaatgacgtc 900
caatgctgat agttgtgcag acggagcccc aacttgggac tacagatatt caaggtatcc 960
gcagcagcat aaacagttaa agcatccgca tgatcagtca ttagcgacat accgtggaca 1020
ttcagtcctc cggacattga tccgttgcta cttttctcct gcttacagca ctgggcaaaa 1080
gtacatatat acagggtcgt atgattcctg cgttcacatc tatgatgtgg taagtggatc 1140
gcaagttgcg aagctgaaat ggcatcaaat ggcgatacgt gactgcagct ggcacccatt 1200
tgagcccacg cttgtgagct catcctggga cggccatgta gcgaaatgga ccagtgcacg 1260
tgatcaaaa 1269
<210>268
<211>422
<212>PRT
<213>甘蔗
<220>
<221>不确定
<222>(189)..(189)
<223>Xaa可以是30至40个氨基酸的一段序列
<400>268
Met Gly Tyr Gly Met Ser Arg Leu Glu Asp Glu Tyr Tyr Glu Pro Glu
1 5 10 15
Gly Gln Asn Thr Asp Gly Ser Gly Ser Val Gln Val Asn Asp Glu Phe
20 25 30
Ala Lys Leu His Asn Asp Ile Phe His Met Thr Arg Met Arg Ser Arg
35 40 45
Leu Thr Glu Arg Tyr Lys Ser Met Asp Thr Asn Arg Gly Ile Ile Ser
50 55 60
Thr Ala Lys Leu Leu Ser Arg Arg Glu Ile Asp Cys Ser Gly Lys Gly
65 70 75 80
Met Phe Ser Ser Cys Asp Arg Ala Phe Val Leu Gly Arg Tyr Val Pro
85 90 95
Met Asn Gly Pro Glu Leu Leu Asp Arg Met Asp Ser Arg Ala Tyr Val
100 105 110
Ser Gln Phe Ser Ala Asp Gly Thr Leu Phe Gly Ala Gly Phe Gln Gly
115 120 125
Ser His Ile Arg Ile Tyr Asp Ala Asp Arg Gly Trp Ser Ile His Lys
130 135 140
Asp Ile His Ala Lys Cys Leu Arg Trp Thr Ile Ser Asp Val Phe Leu
145 150 155 160
Ser Pro Asp Gln Arg Tyr Leu Ala Tyr Ser Ser Leu Ala Pro Ile Asn
165 170 175
His Ile Gly Asn Val Gly Asn Ala Gly Arg Glu Ser Xaa Arg Pro Thr
180 185 190
Arg Pro Ala Gly Ser Asn Asp Glu Ser Ile Tyr Val Tyr Asp Leu Gln
195 200 205
Ala Asn Lys Leu Thr Leu Arg Leu Pro Ala His Thr Ser Asp Val Asn
210 215 220
Thr Val Ala Phe Ala Asp Glu Thr Gly His Leu Ile Tyr Ser Gly Ser
225 230 235 240
Asp Asp Thr Leu Cys Lys Val Trp Asp Arg Arg Cys Leu Ser Thr Gly
245 250 255
Gln Ala Ala Gly Val Leu Thr Gly His Leu His Gly Val Thr His Ile
260 265 270
Asp Ser Arg Gly Asp Gly Arg Ser Phe Ile Ser Asn Gly Lys Asp Gln
275 280 285
Ala Ile Lys Leu Trp Asp Val Arg Lys Met Thr Ser Asn Ala Asp Ser
290 295 300
Cys Ala Asp Gly Ala Pro Thr Trp Asp Tyr Arg Tyr Ser Arg Tyr Pro
305 310 315 320
Gln Gln His Lys Gln Leu Lys His Pro His Asp Gln Ser Leu Ala Thr
325 330 335
Tyr Arg Gly His Ser Val Leu Arg Thr Leu Ile Arg Cys Tyr Phe Ser
340 345 350
Pro Ala Tyr Ser Thr Gly Gln Lys Tyr Ile Tyr Thr Gly Ser Tyr Asp
355 360 365
Ser Cys Val His Ile Tyr Asp Val Val Ser Gly Ser Gln Val Ala Lys
370 375 380
Leu Lys Trp His Gln Met Ala Ile Arg Asp Cys Ser Trp His Pro Phe
385 390 395 400
Glu Pro Thr Leu Val Ser Ser Ser Trp Asp Gly His Val Ala Lys Trp
405 410 415
Thr Ser Ala Arg Asp Gln
420
<210>269
<211>1173
<212>DNA
<213>两色蜀黍(Sorghum bicolor)
<220>
<221>不确定
<222>(528)..(528)
<223>n可以是35至165个核苷酸的一段序列
<220>
<221>misc_feature
<222>(528)..(528)
<223>n是a、c、g或t
<400>269
atgggttatg gcatgagtag gctagaagat gaatactatg agcctgaagg gcagaatact 60
gatggatctg gctcagttca agtgaacaat gagtttgcag aactgcataa cgatattttt 120
catatgaccc gaatgagatc aagacttact gaaagctaca agtccatggg taccaacaga 180
ggcataatat caacggccaa gttattatct caaagggaaa ttgattgttc tggaaagaag 240
atgttctctt ctggtgaccg tgcatttgtt ctaggtcgct atgttccaat gaacggccct 300
cagttattgg ataggatgga ttctcgtgct tatgtttcgc agttttctgc tgatggcact 360
ctttttgttg ctggttttca gggaagtcac ataagaatat atgatgctga tagaggctgg 420
agcatacaca aagacattca tgctaggtgg ttgagatgga caatcagtga tgtatcctta 480
tcacctgatc agcggtatct tgcctactct agtctggcac ctattgtncg tttacctgct 540
catacatctg atgtcaatac agtagcattt gctgatgaaa ctggccatct catatattct 600
ggaagtgacg acactttatg caaggtctgg gacaggcggt gtttgtccac aggacaagct 660
gccggagttc tgactggaca tttgcatggg gtaacacata ttgatagtcg tggagatggc 720
cgaagtttca tatcgaatgg aaaagaccaa gcaatcaaac tgtgggatgt caggaaaatg 780
acgtccaatg ctgatagttg tgcagacgga gccccaagtt gggactacag atattcgagg 840
tatccgcagc agcataaaca gttaaagcat ccgcatgatc agtcattagc gacataccgt 900
ggacatgcag tcctccggac attgatccgt tgctactttt ctcctgctta cagcactggg 960
caaaagtaca tatatacagg gtcgtatgat tcctgcgttt acatctatga tgtggtaagt 1020
ggatcgcaag ttgcgaagct gaaatggcat caaatggcga ttcgtgactg cagttggcac 1080
ccatttgagc ccacgcttgt gagctcatcc tgggacggcc aggtagcaaa atggaccagt 1140
gcacgtgatc aacaggcttc tgatatagat tga 1173
<210>270
<211>391
<212>PRT
<213>两色蜀黍
<220>
<221>不确定
<222>(177)..(177)
<223>Xaa可以是45至55个氨基酸的一段序列
<400>270
Met Gly Tyr Gly Met Ser Arg Leu Glu Asp Glu Tyr Tyr Glu Pro Glu
1 5 10 15
Gly Gln Asn Thr Asp Gly Ser Gly Ser Val Gln Val Asn Asn Glu Phe
20 25 30
Ala Glu Leu His Asn Asp Ile Phe His Met Thr Arg Met Arg Ser Arg
35 40 45
Leu Thr Glu Ser Tyr Lys Ser Met Gly Thr Asn Arg Gly Ile Ile Ser
50 55 60
Thr Ala Lys Leu Leu Ser Gln Arg Glu Ile Asp Cys Ser Gly Lys Lys
65 70 75 80
Met Phe Ser Ser Gly Asp Arg Ala Phe Val Leu Gly Arg Tyr Val Pro
85 90 95
Met Asn Gly Pro Gln Leu Leu Asp Arg Met Asp Ser Arg Ala Tyr Val
100 105 110
Ser Gln Phe Ser Ala Asp Gly Thr Leu Phe Val Ala Gly Phe Gln Gly
115 120 125
Ser His Ile Arg Ile Tyr Asp Ala Asp Arg Gly Trp Ser Ile His Lys
130 135 140
Asp Ile His Ala Arg Trp Leu Arg Trp Thr Ile Ser Asp Val Ser Leu
145 150 155 160
Ser Pro Asp Gln Arg Tyr Leu Ala Tyr Ser Ser Leu Ala Pro Ile Val
165 170 175
Xaa Arg Leu Pro Ala His Thr Ser Asp Val Asn Thr Val Ala Phe Ala
180 185 190
Asp Glu Thr Gly His Leu Ile Tyr Ser Gly Ser Asp Asp Thr Leu Cys
195 200 205
Lys Val Trp Asp Arg Arg Cys Leu Ser Thr Gly Gln Ala Ala Gly Val
210 215 220
Leu Thr Gly His Leu His Gly Val Thr His Ile Asp Ser Arg Gly Asp
225 230 235 240
Gly Arg Ser Phe Ile Ser Asn Gly Lys Asp Gln Ala Ile Lys Leu Trp
245 250 255
Asp Val Arg Lys Met Thr Ser Asn Ala Asp Ser Cys Ala Asp Gly Ala
260 265 270
Pro Ser Trp Asp Tyr Arg Tyr Ser Arg Tyr Pro Gln Gln His Lys Gln
275 280 285
Leu Lys His Pro His Asp Gln Ser Leu Ala Thr Tyr Arg Gly His Ala
290 295 300
Val Leu Arg Thr Leu Ile Arg Cys Tyr Phe Ser Pro Ala Tyr Ser Thr
305 310 315 320
Gly Gln Lys Tyr Ile Tyr Thr Gly Ser Tyr Asp Ser Cys Val Tyr Ile
325 330 335
Tyr Asp Val Val Ser Gly Ser Gln Val Ala Lys Leu Lys Trp His Gln
340 345 350
Met Ala Ile Arg Asp Cys Ser Trp His Pro Phe Glu Pro Thr Leu Val
355 360 365
Ser Ser Ser Trp Asp Gly Gln Val Ala Lys Trp Thr Ser Ala Arg Asp
370 375 380
Gln Gln Ala Ser Asp Ile Asp
385 390
<210>271
<211>336
<212>PRT
<213>人工序列
<220>
<223>SEQ ID N0:02中包含的保守结构域
<400>271
Tyr Val Ser Gln Phe Ser Thr Asp Gly Ser Leu Phe Ile Ala Gly Phe
1 5 10 15
Gln Gly Ser Arg Ile Arg Ile Tyr Asn Val Glu Lys Gly Trp Lys Val
20 25 30
Gln Lys Asp Ile Leu Ala Lys Ser Leu Arg Trp Thr Val Thr Asp Thr
35 40 45
Ser Leu Ser Pro Asp Gln Arg Asn Leu Val Tyr Ala Ser Met Ser Pro
50 55 60
Ile Val His Ile Val Asp Val Gly Ser Gly Ser Thr Glu Ser His Ala
65 70 75 80
Asn Val Thr Glu Ile His Asp Gly Leu Asp Phe Ser Ser Asp Glu Asp
85 90 95
Gly Gly Tyr Ser Phe Gly Ile Phe Ser Val Lys Phe Ser Thr Asp Gly
100 105 110
Arg Glu Val Val Ala Gly Ser Ser Asp Asp Ser Ile Tyr Val Tyr Asp
115 120 125
Leu Glu Ala Asn Arg Val Ser Leu Arg Thr Val Ala His Thr Ser Asp
130 135 140
Val Asn Thr Val Cys Phe Ala Asp Glu Ser Gly Asn Leu Ile Leu Ser
145 150 155 160
Gly Ser Asp Asp Asn Leu Cys Lys Val Trp Asp Arg Arg Cys Phe Ile
165 170 175
Gly Arg Asp Lys Pro Ala Gly Val Leu Val Gly His Leu Glu Gly Val
180 185 190
Thr Phe Ile Asp Ser Arg Gly Asp Gly Arg Tyr Phe Ile Ser Asn Gly
195 200 205
Lys Asp Gln Thr Ile Lys Leu Trp Asp Ile Arg Lys Met Ser Ser Ser
210 215 220
Ala Pro Ala Arg His Glu Val Leu Arg Asn Tyr Glu Trp Asp Tyr Arg
225 230 235 240
Trp Met Asp Tyr Pro Thr Glu Ala Arg Asp Leu Lys His Pro Leu Asp
245 250 255
Gln Ser Val Ser Thr Tyr Lys Gly His Ser Val Leu Arg Thr Leu Ile
260 265 270
Arg Cys Tyr Phe Ser Pro Ala His Ser Thr Gly Gln Lys Tyr Ile Tyr
275 280 285
Thr Gly Ser Asn Asp Ser Ser Val Tyr Ile Tyr Asp Leu Val Ser Gly
290 295 300
Asp Lys Val Ala Val Leu Lys His His Ser Ser Pro Val Arg Asp Cys
305 310 315 320
Asn Trp His Pro Tyr Tyr Pro Thr Leu Ile Ser Ser Ser Trp Asp Gly
325 330 335
<210>272
<211>2194
<212>DNA
<213>稻
<400>272
aatccgaaaa gtttctgcac cgttttcacc ccctaactaa caatataggg aacgtgtgct 60
aaatataaaa tgagacctta tatatgtagc gctgataact agaactatgc aagaaaaact 120
catccaccta ctttagtggc aatcgggcta aataaaaaag agtcgctaca ctagtttcgt 180
tttccttagt aattaagtgg gaaaatgaaa tcattattgc ttagaatata cgttcacatc 240
tctgtcatga agttaaatta ttcgaggtag ccataattgt catcaaactc ttcttgaata 300
aaaaaatctt tctagctgaa ctcaatgggt aaagagagag atttttttta aaaaaataga 360
atgaagatat tctgaacgta ttggcaaaga tttaaacata taattatata attttatagt 420
ttgtgcattc gtcatatcgc acatcattaa ggacatgtct tactccatcc caatttttat 480
ttagtaatta aagacaattg acttattttt attatttatc ttttttcgat tagatgcaag 540
gtacttacgc acacactttg tgctcatgtg catgtgtgag tgcacctcct caatacacgt 600
tcaactagca acacatctct aatatcactc gcctatttaa tacatttagg tagcaatatc 660
tgaattcaag cactccacca tcaccagacc acttttaata atatctaaaa tacaaaaaat 720
aattttacag aatagcatga aaagtatgaa acgaactatt taggtttttc acatacaaaa 780
aaaaaaagaa ttttgctcgt gcgcgagcgc caatctccca tattgggcac acaggcaaca 840
acagagtggc tgcccacaga acaacccaca aaaaacgatg atctaacgga ggacagcaag 900
tccgcaacaa ccttttaaca gcaggctttg cggccaggag agaggaggag aggcaaagaa 960
aaccaagcat cctccttctc ccatctataa attcctcccc ccttttcccc tctctatata 1020
ggaggcatcc aagccaagaa gagggagagc accaaggaca cgcgactagc agaagccgag 1080
cgaccgcctt ctcgatccat atcttccggt cgagttcttg gtcgatctct tccctcctcc 1140
acctcctcct cacagggtat gtgcctccct tcggttgttc ttggatttat tgttctaggt 1200
tgtgtagtac gggcgttgat gttaggaaag gggatctgta tctgtgatga ttcctgttct 1260
tggatttggg atagaggggt tcttgatgtt gcatgttatc ggttcggttt gattagtagt 1320
atggttttca atcgtctgga gagctctatg gaaatgaaat ggtttaggga tcggaatctt 1380
gcgattttgt gagtaccttt tgtttgaggt aaaatcagag caccggtgat tttgcttggt 1440
gtaataaagt acggttgttt ggtcctcgat tctggtagtg atgcttctcg atttgacgaa 1500
gctatccttt gtttattccc tattgaacaa aaataatcca actttgaaga cggtcccgtt 1560
gatgagattg aatgattgat tcttaagcct gtccaaaatt tcgcagctgg cttgtttaga 1620
tacagtagtc cccatcacga aattcatgga aacagttata atcctcagga acaggggatt 1680
ccctgttctt ccgatttgct ttagtcccag aatttttttt cccaaatatc ttaaaaagtc 1740
actttctggt tcagttcaat gaattgattg ctacaaataa tgcttttata gcgttatcct 1800
agctgtagtt cagttaatag gtaatacccc tatagtttag tcaggagaag aacttatccg 1860
atttctgatc tccattttta attatatgaa atgaactgta gcataagcag tattcatttg 1920
gattattttt tttattagct ctcacccctt cattattctg agctgaaagt ctggcatgaa 1980
ctgtcctcaa ttttgttttc aaattcacat cgattatcta tgcattatcc tcttgtatct 2040
acctgtagaa gtttcttttt ggttattcct tgactgcttg attacagaaa gaaatttatg 2100
aagctgtaat cgggatagtt atactgcttg ttcttatgat tcatttcctt tgtgcagttc 2160
ttggtgtagc ttgccacttt caccagcaaa gttc 2194
<210>273
<211>1245
<212>DNA
<213>稻
<400>273
cttgttgttg atctgtgccc ccaagaagaa taacactcta ctcttacttg ttggaaaaaa 60
atagtattag caaccacgca tatgcaaatt ttaatgcagt aataataaga gatggatcga 120
tcgttttcca gctcttgtat atgtgactgg ccctgcttta tgtgtgtagt gttaatttca 180
gctttagcag tacgtgatta gtgatggaca ataattgtcg cagacgtatc tatcaattgc 240
tcctgttgtg tgatgcttta actgttggaa tcaaagttgc gttgcctttg ttgttatgag 300
gaggaatata tatgttgggg caggaaaaga atggaggaga gatcgttctc catatcctta 360
tcatcggcct cgtcactgct cgcagtttaa ctttttggtg atgcgagcga tggtcagcca 420
tatatatact cccatgctgc atgctagtaa tcaatatacg ccttgtaaaa gtaaacgatc 480
gtctagtaat tgcaatatca taggggtagc cattgacaga gatctacata gatagagggg 540
gaacaagaat tgacactcca cagatgctcc actcattcac ctttactaat ttatatcttt 600
tgatgtttga tcgatcgatc gatccgtccg tcggtgtctc gacgaataaa aactgcaaat 660
cgaactgtat gtatataata tagcgtcgta aattaaatta aattaaatcg aactgaatac 720
tacatgtcga agcaagaatt agttcaacta aaagatttag tttttccggt tgcaatatct 780
gtgaaattaa ttgaagaaat taagaagaaa actggagaga tatatatatg gatgagacaa 840
aatgagataa gacgcatgat ggtccctcgg atgatgtcgt ccgttcctta tttccattcc 900
atggcagctg ctatcgctat ctagtgcgcg cggcatctcc aatcccatcc attctagtgg 960
tcgatctagc tactactgag tattgttttt tcttcttttt actactgttg attattctgc 1020
aactgcagtt agatgcttgc tactcctaca tcgatctctc tcgcgcgggc gtatgcattg 1080
cattcactac tgatgatccg tgggtgtagt gtgggtggct ataaataggg cagggtgcgg 1140
ttgccattgc tcctcaggcc agcaactgag aagctccata caagtaagca gcagctagtt 1200
gccgacaagg ccagagaagg aagaagaagc tctcatcatc atcac 1245
<210>274
<211>56
<212>DNA
<213>人工序列
<220>
<223>引物:prm09100
<400>274
ggggacaagt ttgtacaaaa aagcaggctt aaacaatgtt ttttggacca agtgag 56
<210>275
<211>50
<212>DNA
<213>人工序列
<220>
<223>引物:prm09101
<400>275
ggggaccact ttgtacaaga aagctgggtt gtgtagagag acgcatcagt 50
Claims (20)
1.用于相对于对照植物而言增强植物中产量相关性状的方法,其包括调节植物中编码选自以下的多肽的核酸的表达,
a)PATELLIN多肽,和任选地选择具有增强的产量相关性状的植物,或
b)PRP38多肽,或
c)ADA2多肽,和任选地选择具有增强的产量相关性状的植物,或
d)GATA样多肽,其中所述GATA样多肽属于GATA转录因子亚家族II,并包含GATA结构域,或
e)WDR23样多肽,所述WDR23样多肽包含下述结构域,所述结构域与SEQ ID NO:271所示的保守结构域(CD)具有至少65%、70%、75%、80%、85%、90%、95%、98%、99%或更高氨基酸序列同一性,和任选地选择具有增强的产量相关性状的植物。
2.根据权利要求1的方法,其中所述PATELLIN多肽包含至少一个如下结构域:
(i)SEQ ID NO:71所示的SEC14结构域:
lpeldsvvfyrgadreghpvcynvygefqdkdlyekafgdeekrerflkwriqllergilsqldfspsgicsmvqvtdlknsppmlgkhravtrqavallqdnypefiakkvfinvpwwylaankmmspfltqrtkskfifaspaksaetlfryiapeqvpvqfgglfk,
或按照递增的优选顺序与SEQ ID NO:71所示的结构域或表A中任意多肽中存在的任意SEC14结构域具有至少50%、55%、60%、65%、70%、75%、80%、82%、85%、90%、92%、95%、97%或更高序列同一性的结构域;
(ii)SEQ ID NO:72所示的GOLD结构域:
sdavteltikpssketveipvtenstigwelrvlgwevsygaeftpdaeggytvivqktrkvpaneepimkgsfkvgepgkivltinnpask
kkkllyrskv,或按照递增的优选顺序与SEQ ID NO:72所示的结构域或表A中任意多肽中存在的任意GOLD结构域具有至少50%、55%、60%、65%、70%、75%、80%、82%、85%、90%、92%、95%、97%或更高序列同一性的结构域。
3.根据任一前述权利要求的方法,其中所述调节的表达是通过在植物中引入和表达下述核酸实现:
a)编码PATELLIN多肽的核酸,或
b)编码PRP38多肽的核酸,或
c)编码GATA样多肽的核酸,或
d)编码ADA2多肽的核酸,或
e)编码WDR23样多肽的核酸。
4.根据任一前述权利要求的方法,其中所述的编码PATELLIN多肽、PRP38多肽、ADA2多肽、GATA样多肽或WDR23样多肽的核酸编码表A中所列出的任一蛋白质,或是此类核酸的部分,或是能够与此类核酸杂交的核酸。
5.根据任一前述权利要求的方法,其中所述的核酸序列编码表A中给出的任意蛋白质的直向同源物或旁系同源物。
6.根据任一前述权利要求的方法,其中所述增强的产量相关性状包括相对于对照植物而言的提高的产量、千粒重,优选是提高的种子产量。
7.根据权利要求1-6中任一项的方法,其中所述增强的产量相关性状在缺氮条件下获得。
8.根据任一前述权利要求的方法,其中所述编码多肽的核酸是植物来源的。
9.根据任一前述权利要求的方法获得的植物,或其包括种子的部分,其中所述植物或其部分包含编码PATELLIN多肽、PRP38多肽、ADA2多肽、GATA样多肽或WDR23样多肽的重组核酸。
10.分离的核酸分子,其包含任一下列特征:
(i)SEQ ID NO:9;SEQ ID NO:11;SEQ ID NO:13;SEQ ID NO:15;SEQ ID NO:17;SEQ ID NO:19;SEQ ID NO:21;SEQ ID NO:23;SEQID NO:25和SEQ ID NO:27所示的核酸;
(ii)与(i)中给出的任一SEQ ID NO互补的核酸片段;
(iii)编码下述PATELLIN多肽的核酸,所述多肽与SEQ ID NO:10;SEQ ID NO:12;SEQ ID NO:14;SEQ ID NO:16;SEQ ID NO:18;SEQID NO:20;SEQ ID NO:22;SEQ ID NO:24;SEQ ID NO:26和SEQ IDNO:28给出的任一氨基酸序列按照递增的优选顺序具有至少70%、75%、80%、85%、90%、95%、96%、97%、98%、99%或更高序列同一性;
(iv)能够在严格杂交条件下与上述(i)、(ii)或(iii)给出的任一核酸杂交的核酸。
11.分离的多肽,其包含:
(i)与SEQ ID NO:10、SEQ ID NO:12、SEQ ID NO:14、SEQ ID NO:16、SEQ ID NO:18、SEQ ID NO:20、SEQ ID NO:22、SEQ ID NO:24、SEQ ID NO:26和SEQ ID NO:28给出的任一氨基酸序列按照递增的优选顺序具有至少80%、85%、90%、95%、96%、97%、98%、99%或100%序列同一性的氨基酸序列;
(i)中给出的任意氨基酸序列的衍生物。
12.构建体,其包含:
(i)编码如权利要求1、2或11中定义的PATELLIN多肽、PRP38多肽、ADA2多肽、GATA样多肽或WDR23样多肽的核酸,或根据权利要求11的核酸;
(ii)能够驱动(a)的核酸序列表达的一个或多个调控序列;和任选地
(iii)转录终止序列。
13.根据权利要求12或13的构建体在用于制备植物的方法中的用途,所述植物相对于对照植物而言具有提高的产量,尤其是提高的生物量和/或提高的种子产量。
14.用根据权利要求12或13的构建体转化的植物、植物部分或植物细胞。
15.用于产生相对于对照植物而言具有提高的产量,优选提高的种子产量的转基因植物的方法,其包括:
(i)在植物中引入并表达编码如权利要求1、2或12中定义的PATELLIN多肽、PRP38多肽、ADA2多肽、GATA样多肽或WDR23样多肽的核酸,或根据权利要求11的核酸;和
(ii)在促进植物生长和发育的条件下培育所述植物细胞;和任选地
(iii)选择具有增强的产量相关性状的植物。
16.转基因植物,其相对于对照植物而言,具有因下述核酸调节的表达所引起的提高的产量,尤其是增提高的种子产量,所述核酸编码权利要求1或2中定义的编码PATELLIN多肽、PRP38多肽、ADA2多肽、GATA样多肽或WDR23样多肽,或者从所述转基因植物衍生的转基因植物细胞。
17.根据权利要求11、14或16的转基因植物,或从其衍生的转基因植物细胞,其中所述的植物是作物植物或单子叶植物或谷物植物,如稻、玉米、小麦、大麦、稷、黑麦、小黑麦、高粱和燕麦。
18.根据权利要求17的植物的可收获部分,其中所述可收获部分优选是枝条生物量和/或种子。
19.产物,其衍生自根据权利要求17的植物和/或根据权利要求18的植物的可收获部分。
20.编码下述多肽的核酸在植物中相对于对照植物而言,提高产量,尤其是提高种子产量和/或枝条生物量中的用途,所述多肽是PATELLIN多肽、PRP38多肽、ADA2多肽、GATA样多肽或WDR23样多肽。
Applications Claiming Priority (23)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
EP07121546.1 | 2007-11-26 | ||
EP07121565.1 | 2007-11-26 | ||
EP07121546 | 2007-11-26 | ||
EP07121565 | 2007-11-26 | ||
EP07122488.5 | 2007-12-06 | ||
EP07122488 | 2007-12-06 | ||
EP07122911 | 2007-12-11 | ||
EP07122911.6 | 2007-12-11 | ||
EP07122998.3 | 2007-12-12 | ||
EP07122998 | 2007-12-12 | ||
US1364907P | 2007-12-14 | 2007-12-14 | |
US1364807P | 2007-12-14 | 2007-12-14 | |
US61/013,649 | 2007-12-14 | ||
US61/013,648 | 2007-12-14 | ||
US1461907P | 2007-12-18 | 2007-12-18 | |
US61/014,619 | 2007-12-18 | ||
US1475707P | 2007-12-19 | 2007-12-19 | |
US61/014,757 | 2007-12-19 | ||
US61/027,053 | 2008-02-08 | ||
EP08166636 | 2008-10-15 | ||
EP08166636.4 | 2008-10-15 | ||
US61/106,989 | 2008-10-21 | ||
PCT/EP2008/066237 WO2009068564A1 (en) | 2007-11-26 | 2008-11-26 | Plants having enhanced yield-related traits and a method for making the same |
Related Child Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201410223928.6A Division CN103951741A (zh) | 2007-11-26 | 2008-11-26 | 具有增强的产量相关性状的植物及其制备方法 |
Publications (1)
Publication Number | Publication Date |
---|---|
CN101883783A true CN101883783A (zh) | 2010-11-10 |
Family
ID=40469937
Family Applications (2)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201410223928.6A Pending CN103951741A (zh) | 2007-11-26 | 2008-11-26 | 具有增强的产量相关性状的植物及其制备方法 |
CN2008801187499A Pending CN101883783A (zh) | 2007-11-26 | 2008-11-26 | 具有增强的产量相关性状的植物及其制备方法 |
Family Applications Before (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201410223928.6A Pending CN103951741A (zh) | 2007-11-26 | 2008-11-26 | 具有增强的产量相关性状的植物及其制备方法 |
Country Status (10)
Country | Link |
---|---|
US (2) | US8697948B2 (zh) |
EP (5) | EP2574621A1 (zh) |
CN (2) | CN103951741A (zh) |
AR (1) | AR071545A1 (zh) |
AU (1) | AU2008328794B2 (zh) |
CA (1) | CA2706602A1 (zh) |
DE (1) | DE112008003225T5 (zh) |
ES (1) | ES2553652T3 (zh) |
MX (1) | MX2010005697A (zh) |
WO (1) | WO2009068564A1 (zh) |
Cited By (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102586280A (zh) * | 2012-01-13 | 2012-07-18 | 昆明理工大学 | 云南红梨ΔPyTTG1基因及其原核表达载体和应用 |
CN102766201A (zh) * | 2012-06-20 | 2012-11-07 | 中国科学院植物研究所 | 来源于拟南芥的gata2蛋白在调节水稻发育中的应用 |
CN103732055A (zh) * | 2011-06-23 | 2014-04-16 | 巴斯夫植物科学有限公司 | 具有增强的产量相关性状的植物和用于产生该植物的方法 |
CN110747202A (zh) * | 2019-11-13 | 2020-02-04 | 昆明理工大学 | 一种岷江百合WRKY转录因子基因LrWRKY11及应用 |
CN111741969A (zh) * | 2017-11-28 | 2020-10-02 | 中国农业大学 | 玉米基因krn2及其用途 |
Families Citing this family (13)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CA2594228C (en) | 2004-12-21 | 2016-11-01 | Monsanto Technology Llc | Recombinant dna for expression of proteins for imparting enhanced agronomic traits to transgenic plants |
EP2240009A2 (en) * | 2007-12-20 | 2010-10-20 | BASF Plant Science GmbH | Plants having enhanced yield-related traits and a method for making the same |
PE20121693A1 (es) | 2010-01-22 | 2012-12-01 | Bayer Ip Gmbh | Combinacion de espiromesifeno y abamectina como insecticidas |
KR101429476B1 (ko) * | 2010-06-25 | 2014-08-22 | 바스프 플랜트 사이언스 컴퍼니 게엠베하 | 향상된 수확량 관련 형질을 갖는 식물 및 이의 제조 방법 |
WO2012153277A1 (en) * | 2011-05-11 | 2012-11-15 | Basf Plant Science Company Gmbh | Plants having enhanced yield-related traits and method for making the same |
CN103717076B (zh) | 2011-08-10 | 2016-04-13 | 拜耳知识产权股份有限公司 | 含有特定特特拉姆酸衍生物的活性化合物组合物 |
AU2013205557B2 (en) | 2012-04-17 | 2016-04-21 | Corteva Agriscience Llc | Synthetic brassica-derived chloroplast transit peptides |
US10392626B1 (en) | 2013-10-09 | 2019-08-27 | Monsanto Technology Llc | Plant regulatory elements and uses thereof |
CN107526940A (zh) * | 2017-08-07 | 2017-12-29 | 清华大学 | 基因关联分析的修正方法及修正装置 |
CN109182593A (zh) * | 2018-11-13 | 2019-01-11 | 内蒙古科技大学 | 确定马铃薯StALDH基因甲基化位点变化在低温胁迫中作用的方法 |
US11747018B2 (en) | 2022-01-05 | 2023-09-05 | General Electric Company | Combustor with dilution openings |
CN114717124B (zh) * | 2022-04-19 | 2024-07-12 | 天津科技大学 | 一株高产麦角甾醇的酿酒酵母工程菌株、构建方法及应用 |
CN116121434B (zh) * | 2022-09-26 | 2023-12-12 | 四川农业大学 | 一种控制小麦zip3a锌转运功能的氨基酸位点及分子标记 |
Family Cites Families (47)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US4962028A (en) | 1986-07-09 | 1990-10-09 | Dna Plant Technology Corporation | Plant promotors |
US5116742A (en) | 1986-12-03 | 1992-05-26 | University Patents, Inc. | RNA ribozyme restriction endoribonucleases and methods |
US4987071A (en) | 1986-12-03 | 1991-01-22 | University Patents, Inc. | RNA ribozyme polymerases, dephosphorylases, restriction endoribonucleases and methods |
US5004863B2 (en) | 1986-12-03 | 2000-10-17 | Agracetus | Genetic engineering of cotton plants and lines |
WO1989012102A1 (en) | 1988-06-01 | 1989-12-14 | The Texas A&M University System | Method for transforming plants via the shoot apex |
AU4115693A (en) | 1992-04-24 | 1993-11-29 | Sri International | In vivo homologous sequence targeting in eukaryotic cells |
PT652705E (pt) | 1992-06-29 | 2006-05-31 | Gene Shears Pty Ltd | Acidos nucleicos e metodos para a sua utilizacao para combater patogenios virais |
WO1994012015A1 (en) | 1992-11-30 | 1994-06-09 | Chua Nam Hai | Expression motifs that confer tissue- and developmental-specific expression in plants |
WO1995003404A1 (en) | 1993-07-22 | 1995-02-02 | Gene Shears Pty Limited | Dna virus ribozymes |
JP3347736B2 (ja) | 1993-11-19 | 2002-11-20 | バイオテクノロジー・リサーチ・アンド・デイベロツプメント・コーポレイシヨン | 植物中で遺伝子を発現させるためのキメラ調節領域及び遺伝子カセット |
NZ278490A (en) | 1993-12-09 | 1998-03-25 | Univ Jefferson | Chimeric polynucleotide with both ribo- and deoxyribonucleotides in one strand and deoxyribonucleotides in a second strand |
US5605793A (en) | 1994-02-17 | 1997-02-25 | Affymax Technologies N.V. | Methods for in vitro recombination |
US6395547B1 (en) | 1994-02-17 | 2002-05-28 | Maxygen, Inc. | Methods for generating polynucleotides having desired characteristics by iterative selection and recombination |
AU718082B2 (en) | 1995-10-06 | 2000-04-06 | Plant Genetic Systems N.V. | Seed shattering |
GB9607517D0 (en) | 1996-04-11 | 1996-06-12 | Gene Shears Pty Ltd | The use of DNA Sequences |
GB9703146D0 (en) | 1997-02-14 | 1997-04-02 | Innes John Centre Innov Ltd | Methods and means for gene silencing in transgenic plants |
GB9710475D0 (en) | 1997-05-21 | 1997-07-16 | Zeneca Ltd | Gene silencing |
GB9720148D0 (en) | 1997-09-22 | 1997-11-26 | Innes John Centre Innov Ltd | Gene silencing materials and methods |
EP2267139B1 (en) | 1998-04-08 | 2017-03-22 | Commonwealth Scientific and Industrial Research Organisation | Methods and means for obtaining modified phenotypes |
FR2779433B1 (fr) * | 1998-06-08 | 2002-08-16 | Centre Nat Rech Scient | Proteine vegetale a motifs wd40 repetes, acide nucleique codant pour ladite proteine, et leurs applications |
TR200100705T2 (tr) | 1998-06-26 | 2001-10-22 | Iowa State University Research Foundation, Inc. | Bitkilerde enzim ve asetil'in değiştirilmesi için yöntemler. |
US6251668B1 (en) | 1998-07-13 | 2001-06-26 | E. I. Du Pont De Nemours & Company | Transcription coactivators |
US6555732B1 (en) | 1998-09-14 | 2003-04-29 | Pioneer Hi-Bred International, Inc. | Rac-like genes and methods of use |
US7196245B2 (en) * | 2002-09-18 | 2007-03-27 | Mendel Biotechnology, Inc. | Polynucleotides and polypeptides that confer increased biomass and tolerance to cold, water deprivation and low nitrogen to plants |
EP1033405A3 (en) * | 1999-02-25 | 2001-08-01 | Ceres Incorporated | Sequence-determined DNA fragments and corresponding polypeptides encoded thereby |
US20100293669A2 (en) | 1999-05-06 | 2010-11-18 | Jingdong Liu | Nucleic Acid Molecules and Other Molecules Associated with Plants and Uses Thereof for Plant Improvement |
US20040031072A1 (en) * | 1999-05-06 | 2004-02-12 | La Rosa Thomas J. | Soy nucleic acid molecules and other molecules associated with transcription plants and uses thereof for plant improvement |
DE69942750D1 (de) | 1999-07-22 | 2010-10-21 | Nat Inst Agrobio Res | Verfahren zur superschnellen transformation von monokotyledonen |
MXPA02001786A (es) | 1999-08-26 | 2003-07-14 | Basf Plant Science Gmbh | Expresion genica en plantas bajo el control de los promotores u-atpasa de plantas, constitutivos. |
DE60130866T2 (de) * | 2000-01-28 | 2008-07-17 | The Governors Of The University Of Alberta, Edmonton | Wurzel-spezifische expression von ziel-genen in pflanzen |
MXPA02010404A (es) | 2000-04-19 | 2004-09-06 | Univ Valencia Politecnica | Proteccion contra toxicidad ambiental mediante manipulacion del procesamiento de los precursores de los acidos ribonucleicos mensajeros. |
US20110131679A2 (en) * | 2000-04-19 | 2011-06-02 | Thomas La Rosa | Rice Nucleic Acid Molecules and Other Molecules Associated with Plants and Uses Thereof for Plant Improvement |
EP1313867A2 (en) | 2000-08-24 | 2003-05-28 | The Scripps Research Institute | Stress-regulated genes of plants, transgenic plants containing same, and methods of use |
US20050108791A1 (en) * | 2001-12-04 | 2005-05-19 | Edgerton Michael D. | Transgenic plants with improved phenotypes |
US20090100536A1 (en) * | 2001-12-04 | 2009-04-16 | Monsanto Company | Transgenic plants with enhanced agronomic traits |
JP2005185101A (ja) * | 2002-05-30 | 2005-07-14 | National Institute Of Agrobiological Sciences | 植物の全長cDNAおよびその利用 |
DE602004004070T2 (de) | 2003-01-21 | 2007-08-16 | Cropdesign N.V. | Verwendung der regulatorischen sequenz des gos2-gens aus reis für die genexpression in dikotyledonen pflanzen oder pflanzenzellen |
CN102586252B (zh) | 2003-02-04 | 2014-09-24 | 作物培植股份有限公司 | 稻启动子 |
EP2311964A1 (en) | 2003-04-11 | 2011-04-20 | CropDesign N.V. | Method to increase stress tolerance in plants |
WO2005083094A2 (en) * | 2004-03-01 | 2005-09-09 | Cropdesign N.V. | Plants having increased yield and method for making the same |
ES2403414T3 (es) * | 2004-04-02 | 2013-05-17 | Cropdesign N.V. | Plantas que sobreexpresan una proteína de tipo NAP1 con aumento del rendimiento de semillas, método para producir las mismas |
US20090205063A1 (en) * | 2004-07-14 | 2009-08-13 | Mendel Biotechnology | Plant polynucleotides for improved yield and quality |
EP1841870B1 (en) * | 2005-01-14 | 2013-09-04 | University Of Guelph | Nitrogen-regulated sugar sensing gene and protein and modulation thereof |
US20070250956A1 (en) | 2005-01-14 | 2007-10-25 | University Of Guelph | Nitrogen-Regulated Sugar Sensing Gene and Protein and Modulation Thereof |
ES2390919T3 (es) * | 2006-03-31 | 2012-11-19 | Basf Plant Science Gmbh | Plantas que tienen rasgos relacionados con el rendimiento mejorados y un método para elaborar las mismas |
AU2007299219A1 (en) * | 2006-04-05 | 2008-03-27 | Metanomics Gmbh | Process for the production of a fine chemical |
WO2009000789A1 (en) | 2007-06-22 | 2008-12-31 | Cropdesign N.V. | Yield enhancement in plants by modulation of maize myb-ada2 gene |
-
2008
- 2008-11-26 CN CN201410223928.6A patent/CN103951741A/zh active Pending
- 2008-11-26 EP EP20120195088 patent/EP2574621A1/en not_active Withdrawn
- 2008-11-26 EP EP12195092A patent/EP2574622A1/en not_active Withdrawn
- 2008-11-26 ES ES12195082.8T patent/ES2553652T3/es active Active
- 2008-11-26 AU AU2008328794A patent/AU2008328794B2/en not_active Ceased
- 2008-11-26 CA CA2706602A patent/CA2706602A1/en not_active Abandoned
- 2008-11-26 CN CN2008801187499A patent/CN101883783A/zh active Pending
- 2008-11-26 EP EP12195097.6A patent/EP2581385A1/en not_active Withdrawn
- 2008-11-26 WO PCT/EP2008/066237 patent/WO2009068564A1/en active Application Filing
- 2008-11-26 MX MX2010005697A patent/MX2010005697A/es active IP Right Grant
- 2008-11-26 US US12/744,503 patent/US8697948B2/en not_active Expired - Fee Related
- 2008-11-26 DE DE112008003225T patent/DE112008003225T5/de not_active Withdrawn
- 2008-11-26 AR ARP080105146A patent/AR071545A1/es not_active Application Discontinuation
- 2008-11-26 EP EP08855025A patent/EP2220111A1/en not_active Withdrawn
- 2008-11-26 EP EP12195082.8A patent/EP2570426B1/en not_active Not-in-force
-
2014
- 2014-02-07 US US14/175,402 patent/US20140237687A1/en not_active Abandoned
Cited By (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103732055A (zh) * | 2011-06-23 | 2014-04-16 | 巴斯夫植物科学有限公司 | 具有增强的产量相关性状的植物和用于产生该植物的方法 |
CN102586280A (zh) * | 2012-01-13 | 2012-07-18 | 昆明理工大学 | 云南红梨ΔPyTTG1基因及其原核表达载体和应用 |
CN102766201A (zh) * | 2012-06-20 | 2012-11-07 | 中国科学院植物研究所 | 来源于拟南芥的gata2蛋白在调节水稻发育中的应用 |
CN111741969A (zh) * | 2017-11-28 | 2020-10-02 | 中国农业大学 | 玉米基因krn2及其用途 |
CN111741969B (zh) * | 2017-11-28 | 2023-05-12 | 中国农业大学 | 玉米基因krn2及其用途 |
CN110747202A (zh) * | 2019-11-13 | 2020-02-04 | 昆明理工大学 | 一种岷江百合WRKY转录因子基因LrWRKY11及应用 |
Also Published As
Publication number | Publication date |
---|---|
EP2574621A1 (en) | 2013-04-03 |
DE112008003225T5 (de) | 2012-06-06 |
EP2220111A1 (en) | 2010-08-25 |
EP2570426A1 (en) | 2013-03-20 |
WO2009068564A8 (en) | 2010-06-24 |
AU2008328794B2 (en) | 2013-11-07 |
US20140237687A1 (en) | 2014-08-21 |
EP2570426B1 (en) | 2015-08-19 |
AR071545A1 (es) | 2010-06-30 |
US20100313299A1 (en) | 2010-12-09 |
CA2706602A1 (en) | 2009-06-04 |
CN103951741A (zh) | 2014-07-30 |
MX2010005697A (es) | 2010-06-11 |
US8697948B2 (en) | 2014-04-15 |
AU2008328794A1 (en) | 2009-06-04 |
EP2574622A1 (en) | 2013-04-03 |
WO2009068564A1 (en) | 2009-06-04 |
EP2581385A1 (en) | 2013-04-17 |
ES2553652T3 (es) | 2015-12-10 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN101495640B (zh) | 具有增强的产量相关性状的伸展蛋白受体样激酶受调节表达的植物和用于产生该植物的方法 | |
CN101842489B (zh) | 具有增强的产量相关性状的植物和用于制备该植物的方法 | |
KR101662483B1 (ko) | 향상된 수확량 관련 형질을 갖는 식물 및 이의 제조 방법 | |
CN101365786B (zh) | 具有改良的生长特征的植物及其生产方法 | |
CN101883783A (zh) | 具有增强的产量相关性状的植物及其制备方法 | |
KR101255413B1 (ko) | 향상된 수확량 관련 형질을 갖는 식물 및 이의 제조 방법 | |
KR101754083B1 (ko) | 향상된 수확량 관련 형질을 갖는 식물 및 이의 제조 방법 | |
CN101952441B (zh) | 具有增强的产量相关性状的植物及其制备方法 | |
CN102027120A (zh) | 具有增强的产量相关性状的植物和用于制备该植物的方法 | |
CN101583720A (zh) | 具有增强的产量相关性状的植物和用于产生该植物的方法 | |
KR20120126061A (ko) | 향상된 수확량 관련 형질을 갖는 식물 및 이의 제조 방법 | |
CN101868544A (zh) | 具有提高的产量相关性状的植物和用于制备该植物的方法 | |
BRPI0718977A2 (pt) | Método para aumentar rendimento de sementes em plantas em relação às plantas de controle, construção, uso da mesma, planta, parte de planta ou célula de planta, método para a produção de uma planta transgênica tendo redimento aumentado de sementes em relação às plantas de controle, planta transgênica, partes colhíveis de uma planta, produtos, e, uso de um ácido nucleico | |
CN101605902A (zh) | 具有增强的产量相关性状和/或提高的非生物胁迫抗性的植物和制备该植物的方法 | |
KR20120034588A (ko) | 향상된 수확량 관련 형질을 갖는 식물 및 이의 제조 방법 | |
CN101627125A (zh) | 具有增强的产量相关性状的植物及其制备方法 | |
KR101429468B1 (ko) | 향상된 수확량 관련 형질을 갖는 식물 및 이의 제조 방법 | |
CN101969759A (zh) | 具有增强的产量相关性状的植物及其制备方法 | |
CN101563461A (zh) | 具有改良特征的植物及其制备方法 | |
CN101351556B (zh) | 具有改良生长特性的植物及其制备方法 | |
CN103502456A (zh) | 具有增强的产量相关性状的植物和用于制备该植物的方法 | |
CN101778942A (zh) | 产率相关性状增强的植物及制备其的方法 | |
CN101595222B (zh) | 具有改良的种子产量相关性状的植物及其制备方法 | |
CN101668859A (zh) | 具有增强的产量相关性状的植物及其制备方法 | |
CN101548016A (zh) | 产率相关性状增强的植物及使用来自yabby蛋白家族的共有序列制备其的方法 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C02 | Deemed withdrawal of patent application after publication (patent law 2001) | ||
WD01 | Invention patent application deemed withdrawn after publication |
Application publication date: 20101110 |