WO2000075346A1 - Fusion proteins comprising a fragment of a chaperon polypeptide - Google Patents
Fusion proteins comprising a fragment of a chaperon polypeptide Download PDFInfo
- Publication number
- WO2000075346A1 WO2000075346A1 PCT/GB2000/001981 GB0001981W WO0075346A1 WO 2000075346 A1 WO2000075346 A1 WO 2000075346A1 GB 0001981 W GB0001981 W GB 0001981W WO 0075346 A1 WO0075346 A1 WO 0075346A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- fusion protein
- polypeptide
- region
- nucleic acid
- fusion
- Prior art date
Links
Classifications
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/435—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans
- C07K14/46—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans from vertebrates
- C07K14/47—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans from vertebrates from mammals
- C07K14/4701—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans from vertebrates from mammals not used
- C07K14/4702—Regulators; Modulating activity
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/195—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from bacteria
- C07K14/24—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from bacteria from Enterobacteriaceae (F), e.g. Citrobacter, Serratia, Proteus, Providencia, Morganella, Yersinia
- C07K14/245—Escherichia (G)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12P—FERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
- C12P21/00—Preparation of peptides or proteins
- C12P21/02—Preparation of peptides or proteins having a known sequence of two or more amino acids, e.g. glutathione
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K2319/00—Fusion polypeptide
Definitions
- the present invention relates to chaperone polypeptides which are active in the folding and maintenance of structural integrity of other proteins and the use thereof as fusion partners to assist in the expression of polypeptides in expression systems.
- the invention also relates to nucleic acids encoding chaperone polypeptides and fusion proteins as described, vectors comprising these nucleic acids, and host cells modified with the nucleic acids or vectors so as to express the fusion protein(s).
- Chaperones are in general known to be large multisubunit protein assemblies essential in mediating polypeptide chain folding in a variety of cellular compartments. Families of chaperones have been identified, for example the chaperonin hsp ⁇ O family otherwise known as the cpn60 class of proteins are expressed constitutively and there are examples to be found in the bacterial cytoplasm (GroEL), in endosymbiotically derived mitochondria (hsp60) and in chloroplasts (Rubisco binding protein). Another chaperone family is designated TF55/TCP1 and found in the thermophilic archaea and the evolutionarily connected eukaryotic cytosol. A comparison of amino acid sequence data has shown that there is at least 50% sequence identity between chaperones found in prokaryotes, mitochondria and chloroplasts (Ellis R J and Van der Vies S M (1991) Ann Rev Biochem 60: 321-347).
- GroEL is a member of the hsp60 family of heat shock proteins.
- GroEL is a tetradecamer wherein each monomeric subunit (cpn ⁇ Om) has a molecular weight of approximately 57kD.
- the tetradecamer facilitates the in vitro folding of a number of proteins which would otherwise misfold or aggregate and precipitate.
- the structure of GroEL from E. coli has been established through X-ray crystallographic studies as reported by Braig K et al (1994) Nature 371 : 578-586.
- the holo protein is cylindrical, consisting of two seven-membered rings that form a large central cavity which according to Ellis R J and Hartl F U (1996) FASEB Journal 10: 20-26 is generally considered to be essential for activity.
- Some small proteins have been demonstrated to fold from their denatured states when bound to GroEL (Gray T E and Fersht A R (1993) J Mol Biol 232: 1 197-1207; Hunt J F et al (1996) Nature 379: 37-45; Weissman J S et al (1996) Cell 84* 481-490; Mayhew M et al (1996) Nature 379: 420-426; Corrales F J and Fersht A R (1995) Proc Nat Acad Sci 92: 5326-5330) and it has been argued that a cagelike structure is necessary to sequester partly folded or assembled proteins (Ellis R J and Hartl F U (1996) 5wpra.
- E. coli GroEL The entire amino acid sequence of E. coli GroEL is also known (see Braig K et al (1994) supra) and three domains have been ascribed to each cpn ⁇ Om of the holo chaperonin (tetradecamer). These are the intermediate (amino acid residues 1-5, 134-190. 377-408 and 524-548), equatorial (residues 6-133 and 409-523) and apical (residues 191-376) domains.
- GroEL facilitates the folding of a number of proteins by two mechanisms; (1) it prevents aggregation by binding to partly folded proteins (Goloubinoff P et al (1989) Nature 342: 884-889; Zahn R and Pl ⁇ ckthun A (1992) Biochemistry 3J_: 3249-3255), which then refold on GroEL to a native-like state (Zahn R and Pluckthun A (1992) Biochemistry 3J .
- the equatorial domain has been shown from the 2.4 A crystal structure of ATP ⁇ S-ligated GroEL (Boisvert D C et al (1996) Nature Structure Biology 3: 170-177) and mutagenesis studies (Fenton W A et al (1994) Nature 371 : 614-619) to have the nucleotide binding sites. Binding and hydrolysis of ATP is cooperative (Bochkareva E S et al (1992) J Biol j
- the crystal structure of GroEL shows unusually high B-factors for the apical domain compared with the equatorial or intermediate domain, and the B-factors vary considerably within the domain (Braig K et al (1994) Nature 37 _: 578-586; Braig K et al (1995) Nature Struct Biol 2: 1083-1094; Boisvert ⁇ C et al (1996) Nature Structure Biology 3: 170- 177).
- the high overall B-factor seems to result from a static disorder within the asymmetric unit and probably throughout the crystals of GroEL, and has been attributed to rigid-body movements generated by hinge-like / 5-sheets in the intermediate domain.
- Regions of high flexibility have also been observed in the 2.8 A structure of the co- chaperonin GroES (Hunt J F et al (1996) Nature 379: 37-45).
- a mobile loop has been shown to be directly involved in ADP-dependent binding to the apical domain (Landry S J et al (1993) Nature 364: 255-258).
- the proteolytic fragment GroEL 150-456 elutes as a monomer during gel filtration, it still comprises the apical domain and significant portions of the intermediate and equatorial domains, the latter of which determine the intersubunit contacts of GroEL (Braig K et al (1994) supra), thus allowing transient formation of the central cavity thereby accounting for the chaperonin activity which is observed.
- EP-A-0 650 975 discloses chaperonin molecules and a method of refolding denatured proteins using GroEL chaperonin 60 monomers (cpn ⁇ Om) obtained from Thermus thermophilus.
- the holo-chaperonin was first extracted and then purified from the bacterial source according to the method of Taguchi et al (1991) J Biol Chem 266: 22411-22418.
- the cpn ⁇ Om was then produced by treatment of the holo-chaperonin with trifluoroacetic acid (TFA) followed by reverse phase (rp) HPLC of the resulting denatured protein.
- TFA trifluoroacetic acid
- rp reverse phase
- the refolding activity of the cpn60m was assayed in solution by monitoring the regain in activity of inactivated rhodanese, which in specific activity terms amounted to about only 25% of the specific activity of the rhodanese prior to inactivation. When background spontaneous rhodanese refolding is subtracted then there is only an approximately 20% refolding activity.
- EP-A-0 650 975 also discloses the use of an approximately 50kD N- terminal deletion fragment of cpn ⁇ Om wherein the N-terminal amino acid residues up to (but not including) the Thr residue at position 79 are removed by proteolysis.
- This 50kD fragment showed an approximately 35% (about 30% when background is subtracted) rhodanese refolding activity when in solution.
- Taguchi H et al (1994) J Biol Chem 269: 8529-8534 is a scientific report on which the invention of EP-A-0 650 975 is based.
- a transiently formed GroEL tetradecamer (the holo-chaperonin) was perceived to exist when the chaperonin monomers are present in solution. Consequently, the refolding activity of these preparations can be seen to be caused by the presence of holo chaperonin. not monomers.
- Taguchi et al immobilised cpn ⁇ Om to a chromatographic resin to exclude the possibility of holo chaperonin formation. When immobilised and therefore when in truly monomeric form, cpn ⁇ Om exhibited only about 10% rhodanese refolding activity.
- TIBS J_8: 81-82 suggested that an "internal fragment" of GroEL may possess a chaperone activity on the basis of amino acid sequence similarity between the altered mRNA stability (ams) gene product (Ams) of E. coli and the central part of GroEL.
- the ams locus is a temperature-sensitive mutation that maps at 23 min on the E. coli chromosome and results in mRNA with an increased half-life.
- the ams gene has been cloned, expressed and shown to complement the ams mutation.
- the gene product is a 149-amino acid protein (Ams) with an apparent molecular weight of 17kD.
- Recombinant DNA technology has allowed industry to produce many proteins of commercial importance. Proteins are produced in a wide variety of expression systems which are based on, for example, bacterial, yeast, insect, plant and mammalian cells, one of the problems associated with the production of proteins by recombinant means is that host cells contain enzymes which degrade proteins and the presence of such enzymes present particular difficulties in the production of small polypeptides. Moreover, polypeptides produced by recombinant DNA technology are frequently at least partially incorrectly folded, such that yields of biologically active molecules vary according to the ability of the expression system to promote correct folding. This can moreover be problematic in the production of polypeptides destined for chemical and physical analysis, for which structural homogeneity is highly relevant.
- DNA encoding the protein of interest is fused in- frame to a fusion partner protein and the resulting fusion is expressed.
- a linker sequence encoding a protease cleavage site between the two parts of the fusion is included to allow cleavage of the fusion after it has been recovered from its host cell.
- the fusion partner protein is often one which may be recovered and purified by some form of highly specific affinity purification means.
- affinity purification means include, for example, glutathione-S-transferase, maltose binding protein and ⁇ -lactamase.
- fusion partner proteins are all relatively large and thus have a number of disadvantages. For example, it is essential to remove them before any meaningful procedure may be carried out on the protein of interest, since they are too large to enable it to function with any degree of independence. Many small polypeptides are still thus made by chemical synthesis.
- the present invention provides fusion proteins which incorporate chaperone fragments as fusion partners to promote high yield expression of correctly folded polypeptides in biological expression systems. It has been observed that consistently higher yields of recombinantly expressed polypeptides are obtained if the proteins are expressed as fusions with a chaperone fragment.
- a fusion protein comprising:
- fusion protein is used in accordance with its ordinary meaning in the art and refers to a single protein which is comprised of two or more regions which are derived from different sources. Typically, a fusion protein is two proteins fused together by way of in-frame fusion of their respective nucleic acid coding sequences.
- a “chaperone fragment”, as referred to herein, is any fragment of a molecular chaperone which possesses the ability to promote the folding of a polypeptide in vivo or in vitro. Preferred fragments are described in International patent application W098/13496, incorporated herein by reference. Especially preferred are fragments 191-375, 191-345 and 193-335 of GroEL.
- the GroEL is E. coli GroEL. as further described below.
- the fusion protein according to the invention in addition to the chaperone fragment, includes a desired polypeptide.
- the desired polypeptide is typically a polypeptide which it is desired to express by recombinant DNA techniques; it is expressed as a fusion with the chaperone fragment in order to increase the yield of correctly folded product, in accordance with the present invention.
- Many polypeptides may be expressed as fusion proteins according to the present invention. However, the expression of smaller polypeptides, up to about 250 amino acids in length, is preferred. Preferably, the polypeptides are between about 5 and about 100 amino acids in length.
- the polypeptide is a eukaryotic polypeptide, such as a mammalian polypeptide.
- the fusion protein according to the invention comprises a cleavable linker between the first and second regions thereof.
- the linker which is typically a polypeptide chain cleavable by a protease, or by other means suitable for effecting polypeptide cleavage, may be cleaved after production of the fusion protein in order to facilitate recovery of the desired polypeptide.
- the cleavable linker may comprise an alternative cleavable site, such as a disulphide bond.
- the chaperone fragment is located N-terminal to the desired polypeptide in the fusion protein.
- the chaperone fragment itself forms the N-terminus of the fusion protein; however, it is envisaged that alternative N-termini may be included, such as to protect the chaperone fragment from degradation.
- polypeptide chains may be held together by non- covalent or covalent means, wherein such covalent means do not include peptide linkages.
- the chains may be held together by disulphide linkages.
- the invention further comprises a nucleic acid encoding the fusion protein of the invention, and preferably the nucleic acid forms part of an expression vector comprising the nucleic acid operably linked to a promoter.
- Nucleic acids, vectors and promoters are further described below.
- the invention further comprises a host cell carrying the expression vector of the invention, and a method of preparing the fusion protein of the invention comprising (i) culturing the host cell under conditions which provide for the expression of the fusion protein from the expression vector within the host cell; and (ii) recovering the fusion protein from the cell.
- the method optionally further comprises cleaving the protein at the protease cleavable linker and recovering the second region.
- Figure 1 is a diagram showing plasmid pHGro, comprising an N-terminal histidine tag, the 191-345 fragment of GroEL, a thrombin cleavage site and a multiple cloning site.
- Figure 2 shows an SDS-PAGE analysis of pHGro expression and purification systems. Molecular weight standards in the range 14,000 - 70,000 Daltons are loaded in lanes 5. 10 and 15 (Sigma #SDS-7 Dalton Mark VII-LTM). Analysis of the sonication extracts shows that the Tenascin (lane 1), RNase HI (lane 6) and FKBP 12 (lane 1 1) fusion proteins are all over-expressed to a high level.
- the first region of the fusion protein of the invention may comprise any natural or synthetic chaperone fragment.
- chaperone polypeptide having an amino acid sequence selected from at least amino acid residues 230-271 but no more than residues 150-455 or 151-456 of a GroEL sequence substantially as shown in SEQ. ID. No. 1, or a corresponding sequence of a substantially homologous chaperone polypeptide, or a modified, mutated or variant thereof having chaperone activity.
- the sequence of GroEL is available in the art, as set forth above, and from academic databases; however, GroEL fragments which conform to the database sequence are inoperative.
- the database contains a sequence in which positions 262 and 267 are occupied by Alanine and Isoleucine respectively. Fragments incorporating one or both of these residues at these positions are inoperative and unable to promote the folding of polypeptides.
- the invention instead, relates to a GroEL polypeptide in which at least one of positions 262 and 267 is occupied by Leucine and Methionine respectively.
- the amino acid sequence is preferably selected from at least amino acid residues 193-335, preferably 193-337, more preferably 191-345, even more preferably 191 -376 but no more than residues 151-455.
- the invention therefore includes polypeptides being GroEL amino acid residues 230-271, 230-272 ...et seq... 230-455 and in like manner residues 230-271, 229-271 ...et seq... 151-271. Also, residues 230-271 , 229-272 ...et seq... 151-351, 151- 352 ...et seq... 151-455. All amino acid sequences of 42 or more residues comprising at least contiguous residues 230-271 and not exceeding 151-455 are within the scope of this aspect of the invention e.g. 171-423 or 166-406.
- the invention provides fragments selected from the group consisting of residues 191-375, 191-345 and 193-335.
- Chaperone activity may be determined in practice by an ability to refold cyclophilin A but other suitable proteins such as glucosamine-6-phosphate deaminase or a mutant form of indoleglycerol phosphate synthase (IGPS) (amino acid residues 49-252) may be used.
- IGPS indoleglycerol phosphate synthase
- a rhodanese refolding assay may also be used. Details of suitable refolding assays are described in more detail in the specific examples provided hereinafter.
- the chaperone activity is determined by the refolding of cyclophilin A. More preferably, 8M urea denatured cyclophilin A (lOO ⁇ M) is diluted into lOOmM potassium phosphate buffer pH7.0, lOmM DTT to a final concentration of l ⁇ M and then contacted with at least l ⁇ M of said polypeptide at 25°C for at least 5 min. the resultant cyclophilin A activity being assayed by the method of Fischer G et al (1984) Biomed Biochim Acta 43: 1 101-1 11 1.
- the polypeptide is preferably an hsp ⁇ O polypeptide, preferably a GroEL polypeptide.
- a preferred polypeptide has the amino acid sequence 191-345 or 191-376. More preferably 193-335 or 191-337 of GroEL, or the equivalent residues of substantially homologous chaperonins, or a modified, mutated or variant sequence thereof.
- the polypeptide preferably has a molecular weight of less than 34kDa.
- Modifications include chemically modified polypeptides for example.
- “Variants” include, for example, naturally occurring variants of the kind to be found amongst a population of hsp ⁇ O chaperonin harbouring organisms/cells as well as naturally occurring polymorphisms or mutations.
- “Mutations” may also be introduced artificially by processes of mutagenesis well known to a person skilled in the art.
- substantially homologous peptides may have at least 50% amino acid sequence homology with the specified GroEL amino acid sequences, preferably at least 60% homology and more preferably 75% homology. Homology may of course also reside in the nucleotide sequences for the polypeptide which may be at least 50%, preferably at least 60% homologous and more preferably 75% homologous with the nucleotide sequence encoding the specified GroEL amino acid residues.
- Synthetic variants of naturally-occurring chaperone proteins may be made by standard recombinant DNA techniques. For example, site-directed mutagenesis may be used to introduce changes to the coding region of a DNA encoding a naturally-occurring coiled- coil protein. Where insertions are to be made, synthetic DNA encoding the insertion together with 5' and 3' flanking regions corresponding to the naturally-occurring sequence either side of the insertion site. The flanking regions will contain convenient restriction sites corresponding to sites in the naturally-occurring sequence so that the sequence may be cut with the appropriate enzyme(s) and the synthetic DNA ligated into the cut. The DNA is then expressed in accordance with the invention to make the encoded protein. These methods are only illustrative of the numerous standard techniques known in the art for manipulation of DNA sequences and other known techniques may also be used.
- the hsp ⁇ O class of chaperonin proteins are generally homologous in structure and so there are therefore conserved or substantially homologous amino acid sequences between the members of the class.
- GroEL is just an example of an hsp ⁇ O chaperonin protein; other suitable proteins having an homologous apical domain may be followed.
- a fusion protein according to the invention will comprise as small a chaperone fragment as is feasible. This can be especially important where in structural determination of proteins by NMR it is often necessary to carry out isotopic labelling with N or C. This is expensive and with a long fusion partner much of the incorporated radioactivity is removed if the carrier protein (e.g. GST in many cases) is cleaved off.
- the second region of the fusion protein according to the invention may comprise any polypeptide sequence of interest which is not naturally associated with the first region. Usually this will mean that the sequence of interest will be found in nature encoded by a gene different from the gene encoding the first region. This may be determined easily by examining the sequences of the first and second regions against publicly available sequence databanks.
- the second region may be from the same species as the first region, or from a different species. It is also possible that the first and second regions are derived from portions of the same protein but are present in the fusion protein of the invention in a manner different from the natural protein sequence.
- the fusion protein according to the invention may be of any size although in general the invention is particularly useful when the polypeptide sequence of interest is short, e.g. from 2 to 100 amino acids in length, preferably 2 to 50 or even 2 to 30 or 5 to 10 amino acids in size. However larger polypeptide sequences of interest, e.g. up 150, 200, 400 or 1000 amino acids are also contemplated.
- the invention is particularly advantageous for the preparation of small polypeptides which are currently difficult to manufacture by recombinant means.
- polypeptides examples include fragments of chaperone proteins, metabolic enzymes, DNA and RNA binding proteins, antibodies, viral proteins, intrinsic membrane proteins (including transport proteins from mitochondria, seven-helix receptor molecules, T-cell receptors), and cytoskeletal complexes, antibody binding peptides, peptide hormones (and other biologically active peptides made by ribosomal synthesis), and small subunits from multi-subunit biological structures such as respiratory enzymes, the ATP synthase.
- the invention is suitable for use with peptides of any dimension, but the advantageous properties thereof are best exploited with small polypeptides, for example from 2 to 50 amino acids in length, particularly from 2 to 20 amino acids in length, and preferably from 5 to 10 amino acids in length.
- a particular advantage of the present invention is that peptides may be produced by recombinant DNA technology which are so short that they would previously have been made by oligopeptide synthesis techniques.
- libraries of peptides for example of mutants of biologically active peptides, which may be screened or otherwise analysed, cheaply and efficiently in recombinant expression systems, particularly bacterial expression systems.
- the cleavable linker region is a protease cleavable linker, although other linkers, cleavable for example by small molecules, may be used. These include Met-X sites, cleavable by cyanogen bromide, Asn-Gly. cleavable by hydroxylamine, Asp-Pro, cleavable by weak acid and Trp-X cleavable by. inter alia, NBS-skatole.
- Protease cleavage sites are preferred due to the milder cleavage conditions necessary and are found in, for example, factor Xa, thrombin and collagenase. Any of these may be used. The precise sequences are available in the art and the skilled person will have no difficulty in selecting a suitable cleavage site.
- the protease cleavage region targeted by Factor Xa is I E G R.
- the protease cleavage region targeted by Enterokinase is D D D D D K.
- the protease cleavage region targeted by Thrombin is L V P R G.
- the invention also provides nucleic acid encoding the fusion proteins of the invention.
- the nucleic acid may be RNA or DNA and is preferably DNA. Where it is RNA, manipulations may be performed via cDNA intermediates. Generally, a nucleic acid sequence encoding the first region will be prepared and suitable restriction sites provided at the 5' and/or 3' ends.
- sequence is manipulated in a standard laboratory vector, such as a plasmid vector based on pBR322 or pUC19 (see below).
- a standard laboratory vector such as a plasmid vector based on pBR322 or pUC19 (see below).
- Nucleic acid encoding the second region may likewise be provided in a similar vector system. Sources of nucleic acid may be ascertained by reference to published literature or databanks such as Genbank.
- Nucleic acid encoding the desired first or second region may be obtained from academic or commercial sources where such sources are willing to provide the material or by synthesising or cloning the appropriate sequence where only the sequence data are available. Generally this may be done by reference to literature sources which describe the cloning of the gene in question.
- nucleic acid sequences known in the art can be characterised as those nucleotide sequences which hybridise to the nucleic acid sequences known in the art.
- Stringency of hybridisation refers to conditions under which polynucleic acids hybrids are stable. Such conditions are evident to those of ordinary skill in the field. As known to those of skill in the art, the stability of hybrids is reflected in the melting temperature (Tm) of the hybrid which decreases approximately 1 to 1.5°C with every 1% decrease in sequence homology. In general, the stability of a hybrid is a function of sodium ion concentration and temperature. Typically, the hybridisation reaction is performed under conditions of higher stringency, followed by washes of varying stringency.
- high stringency refers to conditions that permit hybridisation of only those nucleic acid sequences that form stable hybrids in 1 M Na+ at 65-68 °C.
- High stringency conditions can be provided, for example, by hybridisation in an aqueous solution containing 6x SSC, 5x Denhardt's, 1 % SDS (sodium dodecyl sulphate), 0.1 Na+ pyrophosphate and 0.1 mg/ml denatured salmon sperm DNA as non specific competitor.
- high stringency washing may be done in several steps, with a final wash (about 30 min) at the hybridisation temperature in 0.2 - 0. lx SSC, 0.1 % SDS.
- Moderate stringency refers to conditions equivalent to hybridisation in the above described solution but at about 60-62°C. In that case the final wash is performed at the hybridisation temperature in lx SSC. 0.1 % SDS.
- Low stringency refers to conditions equivalent to hybridisation in the above described solution at about 50-52°C. In that case, the final wash is performed at the hybridisation temperature in 2x SSC, 0.1 % SDS.
- nucleic acids suitable for forming the first or second region of a fusion protein according to the invention are obtainable according to methods well known in the art.
- a DNA of the invention is obtainable by chemical synthesis, using polymerase chain reaction (PCR) or by screening a genomic library or a suitable cDNA library prepared from a source believed to possess the desired nucleic acid and to express it at a detectable level.
- PCR polymerase chain reaction
- Chemical methods for synthesis of a nucleic acid of interest include triester, phosphite, phosphoramidite and H-phosphonate methods, PCR and other autoprimer methods as well as oligonucleotide synthesis on solid supports. These methods may be used if the entire nucleic acid sequence of the nucleic acid is known, or the sequence of the nucleic acid complementary to the coding strand is available. Alternatively, if the target amino acid sequence is known, one may infer potential nucleic acid sequences using known and preferred coding residues for each amino acid residue.
- An alternative means to isolate the gene encoding the desired region of the fusion protein is to use PCR technology as described e.g. in section 14 of Sambrook et al, 1989. This method requires the use of oligonucleotide probes that will hybridise to the desired nucleic acid. Strategies for selection of oligonucleotides are described below.
- cDNA expression libraries are screened with probes or analytical tools designed to identify the gene of interest or the protein encoded by it.
- suitable means include monoclonal or polyclonal antibodies that recognise and specifically bind to the desired protein; oligonucleotides of about 20 to 80 bases in length that encode known or suspected cDNA encoding the desired protein from the same or different species: and/or complementary or homologous cDNAs or fragments thereof that encode the same or a hybridising gene.
- Appropriate probes for screening genomic DNA libraries include, but are not limited to oligonucleotides, cDNAs or fragments thereof that encode the same or hybridising DNA; and/or homologous genomic DNAs or fragments thereof.
- a nucleic acid encoding the desired protein may be isolated by screening suitable cDNA or genomic libraries under suitable hybridisation conditions with a probe.
- a probe is e.g. a single-stranded DNA or RNA that has a sequence of nucleotides that includes between 10 and 50, preferably between 15 and 30 and most preferably at least about 20 contiguous bases that are the same as (or the complement of) an equivalent or greater number of contiguous bases from a known or desired sequence.
- the nucleic acid sequences selected as probes should be of sufficient length and sufficiently unambiguous so that false positive results are minimised.
- the nucleotide sequences are usually based on conserved or highly homologous nucleotide sequences or regions of the desired protein.
- the nucleic acids used as probes may be degenerate at one or more positions. The use of degenerate oligonucleotides may be of particular importance where a library is screened from a species in which preferential codon usage in that species is not known.
- Preferred regions from which to construct probes include 5' and/or 3' coding sequences, sequences predicted to encode ligand binding sites, and the like.
- nucleic acid probes of the invention are labelled with suitable label means for ready detection upon hybridisation.
- a suitable label means is a radiolabel.
- the preferred method of labelling a DNA fragment is by incorporating ⁇ j P dATP with the Klenow fragment of DNA polymerase in a random priming reaction, as is well known in the art.
- Oligonucleotides are usually end-labelled with ⁇ J "P-labelled ATP and polynucleotide kinase.
- other methods e.g. non-radioactive
- positive clones are identified by detecting a hybridisation signal; the identified clones are characterised by restriction enzyme mapping and/or DNA sequence analysis, and then examined to ascertain whether they include DNA encoding a complete polypeptide (i.e., if they include translation initiation and termination codons). If the selected clones are incomplete, they may be used to rescreen the same or a different library to obtain overlapping clones. If the library is genomic, then the overlapping clones may include exons and introns. If the library is a cDNA library, then the overlapping clones will include an open reading frame. In both instances, complete clones may be identified by comparison with the DNAs and deduced amino acid sequences provided herein.
- nucleic acid of the invention can be readily modified by nucleotide substitution, nucleotide deletion, nucleotide insertion or inversion of a nucleotide stretch, and any combination thereof.
- Such mutants can be used e.g. to produce a mutant that has an amino acid sequence differing from the sequences as found in nature. Mutagenesis may be predetermined (site-specific) or random. A mutation which is not a silent mutation must not place sequences out of reading frames and preferably will not create complementary regions that could hybridise to produce secondary mRNA structure such as loops or hairpins.
- the foregoing methods may. of course, be applied to the identification and modification or generation of sequences useful in any part of the fusion protein of the invention.
- the sequence of the IF* polypeptide coiled coil provided herein as SEQ. ID. No. 1, or suitable fragments thereof as discussed above, may be used as a probe for the identification of further suitable sequences.
- the first or second region may also be manipulated to introduce an appropriate restriction enzyme site at the terminus which is to be linked to the nucleic acid encoding the first region via a corresponding restriction enzyme site. Desirably the sites will be either the same or at least have matching cohesive ends.
- the first and second regions may be joined by alternative means; for example, first region may be incorporated into primers used to isolate or replicate the second region.
- protease cleavable linker region is required, this maybe introduced into the linked first and second regions (e.g. into the restriction site linking the two) or introduced into one or the other prior to their combination.
- vector refers to discrete elements that are used to introduce heterologous DNA into cells for either expression or replication thereof. Selection and use of such vehicles are well within the skill of the artisan. Many vectors are available, and selection of appropriate vector will depend on the intended use of the vector, i.e. whether it is to be used for DNA amplification or for DNA expression, the size of the DNA to be inserted into the vector, and the host cell to be transformed with the vector. Each vector contains various components depending on its function (amplification of DNA or expression of DNA) and the host cell for which it is compatible.
- the vector components generally include, but are not limited to, one or more of the following: an origin of replication, one or more marker genes, an enhancer element, a promoter, a transcription termination sequence and a signal sequence.
- Both expression and cloning vectors generally contain nucleic acid sequences that enable the vector to replicate in one or more selected host cells. Typically in cloning vectors. these sequences enable the vector to replicate independently of the host chromosomal DNA, and includes origins of replication or autonomously replicating sequences. Such sequences are well known for a variety of bacteria, yeast and viruses.
- the origin of replication from the plasmid pBR322 is suitable for most Gram-negative bacteria, the 2 ⁇ plasmid origin is suitable for yeast, and various viral origins (e.g. SV 40, polyoma. adenovirus) are useful for cloning vectors in mammalian cells.
- the origin of replication component is not needed for mammalian expression vectors unless these are used in mammalian cells competent for high level DNA replication, such as COS cells.
- Most expression vectors are shuttle vectors, i.e. they are capable of replication in at least one class of organisms but can be transfected into another organism for expression.
- a vector is cloned in E. coli and then the same vector is transfected into yeast or mammalian cells even though it is not capable of replicating independently of the host cell chromosome.
- DNA may also be replicated by insertion into the host genome.
- the recovery of genomic DNA encoding the fusion protein of the invention is more complex than that of exogenously replicated vector because restriction enzyme digestion is required to excise the DNA.
- DNA can be amplified by PCR and be directly transfected into the host cells without any replication component.
- an expression and cloning vector may contain a selection gene also referred to as selectable marker.
- This gene encodes a protein necessary for the survival or growth of transformed host cells grown in a selective culture medium. Host cells not transformed with the vector containing the selection gene will not survive in the culture medium.
- Typical selection genes encode proteins that confer resistance to antibiotics and other toxins, e.g. ampicillin, neomycin, methotrexate or tetracycline, complement auxotrophic deficiencies, or supply critical nutrients not available from complex media.
- any marker gene can be used which facilitates the selection for transformants due to the phenotypic expression of the marker gene.
- Suitable markers for yeast are, for example, those conferring resistance to antibiotics G418, hygromycin or bleomycin. or provide for prototrophy in an auxotrophic yeast mutant, for example the URA3, LEU2, LYS2. TRP1. or HIS3 gene.
- E. coli genetic marker and an E. coli origin of replication are advantageously included. These can be obtained from E. coli plasmids, such as pBR322, Bluescript ⁇ vector or a pUC plasmid. e.g. pUC18 or pUC19, which contain both E. coli replication origin and E. coli genetic marker conferring resistance to antibiotics, such as ampicillin.
- Suitable selectable markers for mammalian cells are those that enable the identification of cells competent to take up vector nucleic acid, such as dihydrofolate reductase (DHFR, methotrexate resistance), thymidine kinase, or genes conferring resistance to G418 or hygromycin.
- DHFR dihydrofolate reductase
- thymidine kinase or genes conferring resistance to G418 or hygromycin.
- the mammalian cell transformants are placed under selection pressure which only those transformants which have taken up and are expressing the marker are uniquely adapted to survive.
- selection pressure can be imposed by culturing the transformants under conditions in which the pressure is progressively increased, thereby leading to amplification (at its chromosomal integration site) of both the selection gene and the linked DNA that encodes the fusion protein.
- Amplification is the process by which genes in greater demand for the production of a protein critical for growth, together with closely associated genes which may encode a desired protein, are reiterated in tandem within the chromosomes of recombinant cells. Increased quantities of desired protein are usually synthesised from thus amplified DNA.
- Expression and cloning vectors usually contain a promoter that is recognised by the host organism and is operably linked to the fusion-protein encoding nucleic acid. Such a promoter may be inducible or constitutive.
- the promoters are operably linked to DNA encoding the fusion protein by removing the promoter from the source DNA by restriction enzyme digestion and inserting the isolated promoter sequence into the vector. Both the native promoter sequence of one of the constituents of the fusion protein and ma ' heterologous promoters may be used to direct amplification and/or expression of the DNA.
- the term "operably linked” refers to a juxtaposition wherein the components described are in a relationship permitting them to function in their intended manner.
- a control sequence "operably linked" to a coding sequence is ligated in such a way that expression of the coding sequence is achieved under conditions compatible with the control sequences.
- Promoters suitable for use with prokaryotic hosts include, for example, the ⁇ -lactamase and lactose promoter systems, alkaline phosphatase, the tryptophan (trp) promoter system and hybrid promoters such as the tac promoter. Their nucleotide sequences have been published, thereby enabling the skilled worker operably to ligate them to DNA encoding the fusion protein using linkers or adaptors to supply any required restriction sites. Promoters for use in bacterial systems will also generally contain a Shine-Delgarno sequence operably linked to the DNA encoding the fusion protein.
- Preferred expression vectors are bacterial expression vectors which comprise a promoter of a bacteriophage such as phagex or T7 which is capable of functioning in the bacteria.
- the nucleic acid encoding the fusion protein may be transcribed from the vector by T7 RNA polymerase (Studier et al, Methods in Enzymol. 185; 60-89, 1990).
- T7 RNA polymerase In the E. coli BL21(DE3) host strain, used in conjunction with pET vectors, the T7 RNA polymerase is produced from the ⁇ -lysogen DE3 in the host bacterium, and its expression is under the control of the IPTG inducible lac UV5 promoter.
- the polymerase gene may be introduced on a lambda phage by infection with an int- phage such as the CE6 phage which is commercially available (Novagen, Madison, USA), other vectors include vectors containing the lambda PL promoter such as PLEX (Invitrogen, NL) , vectors containing the trc promoters such as pTrcHisXpressTm (Invitrogen) or pTrc99 (Pharmacia Biotech, SE) , or vectors containing the tac promoter such as pKK223-3 (Pharmacia Biotech) or PMAL (New England Biolabs, MA, USA).
- PLEX Invitrogen, NL
- vectors containing the trc promoters such as pTrcHisXpressTm (Invitrogen) or pTrc99 (Pharmacia Biotech, SE)
- vectors containing the tac promoter such as pKK223-3 (Pharmaci
- the fusion protein gene according to the invention may include a secretion sequence in order to facilitate secretion of the polypeptide from bacterial hosts, such that it will be produced as a soluble native peptide rather than in an inclusion body.
- the peptide may be recovered from the bacterial periplasmic space, or the culture medium, as appropriate.
- Suitable promoting sequences for use with yeast hosts may be regulated or constitutive and are preferably derived from a highly expressed yeast gene, especially a Saccharomyces cerevisiae gene.
- the S. pombe nmt 1 gene or a promoter from the TATA binding protein (TBP) gene can be used.
- TBP TATA binding protein
- hybrid promoters comprising upstream activation sequences (UAS) of one yeast gene and downstream promoter elements including a functional TATA box of another yeast gene, for example a hybrid promoter including the UAS(s) of the yeast PH05 gene and downstream promoter elements including a functional TATA box of the yeast GAP gene (PH05-GAP hybrid promoter).
- a suitable constitutive PHO5 promoter is e.g.
- PH05 a shortened acid phosphatase PH05 promoter devoid of the upstream regulatory elements (UAS) such as the PH05 (-173) promoter element starting at nucleotide -173 and ending at nucleotide -9 of the PH05 gene.
- UAS upstream regulatory elements
- Fusion protein gene transcription from vectors in mammalian hosts may be controlled by promoters derived from the genomes of viruses such as polyoma virus, adenovirus, fowlpox virus, bovine papilloma virus, avian sarcoma virus, cytomegalovirus (CMV), a retrovirus and Simian Virus 40 (SV40). from heterologous mammalian promoters such as the actin promoter or a very strong promoter, e.g. a ribosomal protein promoter, and from the promoter normally associated with the gene encoding a component of the fusion protein, provided such promoters are compatible with the host cell systems.
- viruses such as polyoma virus, adenovirus, fowlpox virus, bovine papilloma virus, avian sarcoma virus, cytomegalovirus (CMV), a retrovirus and Simian Virus 40 (SV40).
- heterologous mammalian promoters
- Enhancers are relatively orientation and position independent. Many enhancer sequences are known from mammalian genes (e.g. elastase and globin). However, typically one will employ an enhancer from a eukaryotic cell virus. Examples include the SV40 enhancer on the late side of the replication origin (bp 100-270) and the CMV early promoter enhancer. The enhancer may be spliced into the vector at a position 5' or 3' to the coding sequence, but is preferably located at a site 5' from the promoter.
- a eukaryotic expression vector encoding the fusion protein may comprise a locus control region (LCR).
- LCRs are capable of directing high-level integration site independent expression of transgenes integrated into host cell chromatin, which is of importance especially where the fusion protein gene is to be expressed in the context of a permanently-transfected eukaryotic cell line in which chromosomal integration of the vector has occurred, in vectors designed for gene therapy applications or in transgenic animals.
- An expression vector includes any vector capable of expressing nucleic acids that are operatively linked with regulatory sequences, such as promoter regions, that are capable of expression of such DNAs.
- an expression vector refers to a recombinant DNA or RNA construct, such as a plasmid, a phage, recombinant virus or other vector, that upon introduction into an appropriate host cell, results in expression of the cloned DNA.
- Appropriate expression vectors are well known to those with ordinary skill in the art and include those that are replicable in eukaryotic and/or prokaryotic cells and those that remain episomal or those which integrate into the host cell genome.
- DNAs encoding the fusion protein according to the invention may be inserted into a vector suitable for expression of cDNAs in mammalian cells, e.g. a CMV enhancer-based vector such as pEVRF (Matthias, et al, (1989) NAR 17, 6418).
- a CMV enhancer-based vector such as pEVRF (Matthias, et al, (1989) NAR 17, 6418).
- Transient expression usually involves the use of an expression vector that is able to replicate efficiently in a host cell, such that the host cell accumulates many copies of the expression vector, and. in turn, synthesises high levels of fusion protein.
- transient expression systems are useful e.g. for identifying fusion protein mutants, to identify potential phosphorylation sites, or to characterise functional domains of the protein.
- Plasmids employs conventional ligation techniques. Isolated plasmids or DNA fragments are cleaved, tailored, and religated in the form desired to generate the plasmids required. If desired, analysis to confirm correct sequences in the constructed plasmids is performed in a known fashion. Suitable methods for constructing expression vectors, preparing in vitro transcripts, introducing DNA into host cells, and performing analyses for assessing expression and function are known to those skilled in the art.
- Gene presence, amplification and/or expression may be measured in a sample directly, for example, by conventional Southern blotting, Northern blotting to quantitate the transcription of mRNA, dot blotting (DNA or RNA analysis), or in situ hybridisation, using an appropriately labelled probe based on a sequence provided herein. Those skilled in the art will readily envisage how these methods may be modified, if desired.
- the invention moreover provides an expression vector comprising a first nucleic acid sequence encoding a polypeptide capable of forming a coiled coil structure operably linked to a promoter capable of expressing the first nucleic acid sequence in a host cell, and, linked to the nucleic acid sequence, a cloning site permitting the insertion of a second nucleic acid sequence such that it is capable of being expressed in fusion with the first nucleic acid sequence.
- a vector is a useful vehicle for expressing nucleic acids encoding any desired polypeptide in the form of a fusion protein according to the invention.
- a further embodiment of the invention provides host cells transformed or transfected with the vectors for the replication and expression of polynucleotides of the invention.
- the cells will be chosen to be compatible with the vector and may for example be bacterial, yeast, insect or mammalian.
- Such host cells such as prokaryote, yeast and higher eukaryote cells may be used for replicating DNA and producing the fusion protein.
- Suitable prokaryotes include eubacteria, such as Gram-negative or Gram-positive organisms, such as E. coli, e.g. E. coli K-12 strains, DH5 ⁇ and HB101 , or Bacilli.
- Further hosts suitable for fusion protein encoding vectors include eukaryotic microbes such as filamentous fungi or yeast, e.g. Saccharomyces cerevisiae. Higher eukaryotic cells include insect and vertebrate cells, particularly mammalian cells.
- mammalian host cell lines are epithelial or fibroblastic cell lines such as Chinese hamster ovary (CHO) cells, NIH 3T3 cells, HeLa cells or 293T cells.
- the host cells referred to in this disclosure comprise cells in in vitro culture as well as cells that are within a host animal.
- DNA may be stably incorporated into cells or may be transiently expressed using methods known in the art.
- Stably transfected mammalian cells may be prepared by transfecting cells with an expression vector having a selectable marker gene, and growing the transfected cells under conditions selective for cells expressing the marker gene. To prepare transient transfectants, mammalian cells are transfected with a reporter gene to monitor transfection efficiency.
- the cells should be transfected with a sufficient amount of fusion protein-encoding nucleic acid to form the fusion protein.
- the precise amounts of DNA encoding the fusion protein may be empirically determined and optimised for a particular cell and assay.
- Host cells are transfected or, preferably, transformed with the above-captioned expression or cloning vectors of this invention and cultured in conventional nutrient media modified as appropriate for inducing promoters, selecting transformants, or amplifying the genes encoding the desired sequences.
- Heterologous DNA may be introduced into host cells by any method known in the art, such as transfection with a vector encoding a heterologous DNA by the calcium phosphate coprecipitation technique or by electroporation. Numerous methods of transfection are known to the skilled worker in the field. Successful transfection is generally recognised when any indication of the operation of this vector occurs in the host cell. Transformation is achieved using standard techniques appropriate to the particular host cells used.
- Transfected or transformed cells are cultured using media and culturing methods known in the art, preferably under conditions, whereby the fusion protein encoded by the DNA is expressed.
- the composition of suitable media is known to those in the art, so that they can be readily prepared. Suitable culturing media are also commercially available.
- Preferred bacterial hosts which may be used in the method of the invention include B strains of E. coli such as BL21 or a K strain such as JM109. These strains are widely available in the art from academic and/or commercial sources. The B strains are deficient in the Ion protease and other strains with this genotype may also be used. Preferably the strain should not be defective in recombination genes.
- strain is BL21(DE3), as disclosed in Studier et al. (1990).
- Bacteria obtainable by selection for improved heterologous polypeptide expression, optionally cured of the original vector, may also be used as host cells in the present invention.
- E. coli C43 (DE3) (deposited at the European Collection of Cell Cultures (ECCC) . Salisbury. Wiltshire. UK on 4th July 1996 as B96070445): E. coli C0214(DE3) (deposited at the National Collections of Industrial and Marine Bacteria on 25th June 1997 as NCIMB 40884); E. coli DK8(DE3)S (deposited at the National Collections of Industrial and Marine Bacteria on 25th June 1997 as NCIMB 40885); or E. coli C41(DE3) (deposited at the ECCC on 4th July 1996 as B96070444).
- E. coli C43 DE3
- E. coli C0214(DE3) deposited at the National Collections of Industrial and Marine Bacteria on 25th June 1997 as NCIMB 40884
- E. coli DK8(DE3)S deposited at the National Collections of Industrial and Marine Bacteria on 25th June 1997 as NCIMB 40885
- E. coli C41(DE3) (deposited at the EC
- Host cells of the invention may be cultured under conditions in which expression of the fusion protein occurs.
- the fusion protein may be recovered by any suitable means, for example affinity chromatography or HPLC. Where small fusion proteins are involved HPLC is particularly suitable.
- the fusion protein may be cleaved, e.g. using an appropriate protease, to provide the polypeptide sequence of interest and this sequence may be recovered from the resulting mixture of first and second regions of the fusion protein.
- the fusion protein may find application as such, for example as an immunogen where the coiled-coils form aggregates. This avoids the necessity for preparing immunogenic material form small proteins and peptides by coupling them by separate chemical reaction to a carrier protein such as key-hole limpet hemocyanin (KLH).
- KLH key-hole limpet hemocyanin
- Fusion proteins according to the invention possess an extremely small fusion partner.
- One advantage thereof is that the fusion proteins may be employed directly in an NMR experiment without the fusion partner interfering in the spectrum received.
- NMR analysis may be performed according to techniques and methodology which are known in the art, for example as described in K. Wurtrich. "NMR of Proteins and Nucleic Acids", Wiley, New York, 1986, incorporated herein by reference.
- the polymerase chain reaction is used to generate a DNA fragment containing a N- terminal histidine tag, the 191-345 fragment of GroEl, a thrombin cleavage site and a multiple cloning site.
- the 5'- flanking PCR primer is 5'- AGA CGG ACT GCC ATA TGC ATC ATC ATC ATC ATC ATG AAG GTA TGC AGT TCG ACC - 3'.
- the 3'- flanking primer is 5'- ATT GAC CCC AAG CTT CGA ATT CCA TGG TAC CAG CTG CAG ATG TCG AGC TCG GAT CCA CGC GGA ACC AGA CCA CGG CCC TGG ATT GCA GCT TCT TCA CCC -3'.
- the template for the PCR amplification is as described in Zahn et al, (1996) PNAS (USA) 93:15024- 15026.
- the resulting fragment is cloned into Nde I and Hind IIJ digested PRSETA (Invitrogen) to create pHGro (see fig. 1).
- a Fibronectin type III domain of human Tenascin and human FKBP 12 are sub-cloned into pHGro using BamH 1 and EcoR I.
- Residues 2-62 of S. cerevisiae RNase HI are amplified from genomic DNA by PCR and subcloned via BamHl and EcoRI restriction sites into the pRS ⁇ Ta vector (Invitrogen), which also contains a fragment of the Gro ⁇ L chaperone protein and a histidine tag.
- the sequences of the primers used for PCR amplification are as follows;
- the vector is used to transform Escherichia coli strain BL21 (DE3)C41 (Novagen: Miroux. B., and Walker, J. E. (1996) J. Mol. Biol. 260, 289 - 298) which is used for the expression tests in 2xTY medium. Transformants are obtained using a polyethylene gl col method (Chung, et al. (1989) Proc. Natl. Acad. Sci. ⁇ / ⁇ 4 86, 2172 - 2175).
- a 2 litre shake flask containing 0.25 litre of 2XTY medium plus ampicillin at 50 ⁇ g per ml is inoculated with 4 C41 colonies containing a pHGro fusion vector.
- the culture is grown at 28 °C in an orbital shaker at 200 rpm.
- expression is induced with isopropyl B-D-thiogalactoside (IPTG), using a 50 ⁇ M final concentration.
- IPTG isopropyl B-D-thiogalactoside
- the cells are harvested about 20 hours after induction by centrifugation and re-suspended in 200 ml of 20 mM sodium phosphate buffer pH 7.2 + 150 mM NaCl + 10 mM ⁇ -mercaptoethanol + PMSF (0.5 mM final concentration).
- the suspension is sonicated on power level 8 on a Misonix Inc. Model No. XL2020 sonicator, using 1 second pulses and 3 seconds cooling on ice for a total of 12 minutes and centrifuged at 15 k rpm for 30 minutes.
- the insoluble fraction is re-suspended in 100 ml of sonication buffer, re-sonicated and re-centrifuged.
- Purification is performed in a batch-wise manner.
- the centrifuged protein solutions are combined and 10 ml of Ni ⁇ + charged iminodiacetic acid resin (Sigma) is added.
- the solution is stirred for 3 hours at 4 °C and the resin washed 3 times with 50 ml of sonication buffer followed by 2 times with 50 ml of 50 mM Trizma base / Trizma HC1 (Sigma) buffer pH 8.4 + 10 mM mercaptoethanol.
- Centrifugation is used to isolate the resin following each 1 minute wash. This process is repeated for each pHGro fusion.
- 50 ml of buffer containing 250 mM Imidazole is used to elute the fusion proteins from the resin.
- Tris buffer pH 8.4 + 10 mM B- mercaptoethanol is used for human FKBPI2, pH 7.4 is used for Tenascin and RNase HI. It is necessary to include 150 mM NaCl during the elution of the RNase HI domain. 600 units of Thrombin (Sigma) is added to FKBP 12 and 50 units to the other two. After about 20 hours at room temperature the purifications are analysed by SDS-PAGE (Shagger, H., and Jagow, G. (1987) Analytical Biochemistry 166, 368 - 379). Protein concentrations are estimated using Bio- Rad's Protein Assay solution, which is based on the Bradford dye-binding procedure.
- Bovine Serum Albumin is used to produce the calibration curve.
- the test proteins are produced to an average of 400 mg per litre of culture, which is approximately 30% of the total soluble protein. All three fusion proteins behave in a typical manner during metal affinity chromatography, and thrombin removes the GroEL fragment successfully in each case (see FIG.2). Tenascin and RNase HI only require a small quantity of thrombin for the complete removal of GroEL. In the case of FKBP 12, a small amount of fusion protein remains after the treatment with thrombin. This has also been experienced with other FKBP 12 fusion proteins where thrombin has been used and is to be expected.
- the 191-345 apical fragment of GroEL with a N-terminal histidine tag satisfies the criteria for a good fusion protein. It can be over-expressed to high levels as soluble fusion proteins in E.coli, it is small and can be purified easily using nickel affinity chromatography. Being monomeric, this expression system does not suffer from the problems associated with the expression of multimeric proteins with dimeric fusion proteins.
- Uniform labelling of proteins with 15 N, or 15 N and 13 C is achieved by growing cells in minimal media containing 15 NH 4 C1 or 13 C 6 -glucose as nitrogen and carbon sources respectively.
- a 10 % 13 C-labelled protein is produced by incorporating 10 % C 6 -glucose and 90 % unlabelled glucose into the growth medium.
- Half litre cultures are grown at 28 °C, 250 rpm shaking, to an optical density of 0.2 AUs at 600 nm.
- Protein expression is induced for 16 h with 0.2 mM isopropyl-D-thiogalactoside and harvested cells are resuspended in 16 mM Na 2 HPO 4 , 4 mM NaH 2 PO 4 .H 2 O, 150 mM NaCl and 10 mM ⁇ - mercaptoethanol.
- Cells are subject to two rounds of sonication and cell lysates are centrifuged at 17,000 r.p.m. for 30 min. The supernatant is applied to a nickel affinity column (Sigma) and the fusion protein eluted with 50 mM Tris-HCl pH 8.4, 150 mM NaCl, 10 mM ⁇ -mercaptoethanol and 250 mM imidazole.
- Thrombin digestion of the fusion protein using 5 U thrombin per ml protein, released the RNase HI fragment from the GroEL tag fragment. This is carried out for 2 h at room temperature.
- the RNase HI is purified from the GroEL fragment using a Heparin HyperD column (Sigma) with a gradient of 1 M NaCl (0-100%) in 50 mM Tris-HCl pH 8.4 and 10 mM ⁇ - mercaptoethanol.
- RNase HI containing fractions are dialvsed overnight against 50 mM acetate buffer pH 3.6 and 5 mM DTT, and concentrated in an Amicon concentrator.
- NMR samples contained approximately 2 mM protein in 50 mM acetate buffer pH 3.6 and 5 mM DTT, in either H 2 O with 10 % D 2 O or 100 % D 2 O.
Landscapes
- Chemical & Material Sciences (AREA)
- Organic Chemistry (AREA)
- Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Zoology (AREA)
- General Health & Medical Sciences (AREA)
- Genetics & Genomics (AREA)
- Molecular Biology (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Biochemistry (AREA)
- Biophysics (AREA)
- Medicinal Chemistry (AREA)
- Engineering & Computer Science (AREA)
- Gastroenterology & Hepatology (AREA)
- Wood Science & Technology (AREA)
- General Chemical & Material Sciences (AREA)
- Microbiology (AREA)
- Chemical Kinetics & Catalysis (AREA)
- Biotechnology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- General Engineering & Computer Science (AREA)
- Toxicology (AREA)
- Peptides Or Proteins (AREA)
- Preparation Of Compounds By Using Micro-Organisms (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
Abstract
Description
Claims
Priority Applications (4)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
EP00935315A EP1181378A1 (en) | 1999-06-09 | 2000-05-23 | Fusion proteins comprising a fragment of a chaperon polypeptide |
JP2001501628A JP2003501064A (en) | 1999-06-09 | 2000-05-23 | Fusion proteins containing fragments of chaperone polypeptides |
AU50868/00A AU5086800A (en) | 1999-06-09 | 2000-05-23 | Fusion proteins comprising a fragment of a chaperon polypeptide |
CA002376062A CA2376062A1 (en) | 1999-06-09 | 2000-05-23 | Fusion proteins comprising a fragment of a chaperon polypeptide |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
GBGB9913437.1A GB9913437D0 (en) | 1999-06-09 | 1999-06-09 | Fusion proteins |
GB9913437.1 | 1999-06-09 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2000075346A1 true WO2000075346A1 (en) | 2000-12-14 |
Family
ID=10855032
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/GB2000/001981 WO2000075346A1 (en) | 1999-06-09 | 2000-05-23 | Fusion proteins comprising a fragment of a chaperon polypeptide |
Country Status (6)
Country | Link |
---|---|
EP (1) | EP1181378A1 (en) |
JP (1) | JP2003501064A (en) |
AU (1) | AU5086800A (en) |
CA (1) | CA2376062A1 (en) |
GB (1) | GB9913437D0 (en) |
WO (1) | WO2000075346A1 (en) |
Cited By (13)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP1354959A1 (en) * | 2000-12-26 | 2003-10-22 | Sekisui Chemical Co., Ltd. | Process for producing recombinant protein and fused protein |
WO2004001041A1 (en) * | 2002-06-25 | 2003-12-31 | Sekisui Chemical Co., Ltd. | Expression vector, host, fused protein, process for producing fused protein and process for producing protein |
EP1451217A1 (en) * | 2001-11-20 | 2004-09-01 | Atgen Co., Ltd. | Novel peptides conferring environmental stress resistance and fusion proteins including said peptides |
EP1621555A1 (en) * | 2003-04-18 | 2006-02-01 | Chiba, Joe | Immunogen, composition for immunological use, and method of producing antibody using the same |
US8129500B2 (en) | 1997-12-10 | 2012-03-06 | Csl Limited | Porphyromonas gingivalis polypeptides and nucleotides |
US8241611B2 (en) | 2007-07-12 | 2012-08-14 | Oral Health Austrailia Pty. Ltd. | Biofilm treatment |
US8282933B2 (en) * | 1999-12-24 | 2012-10-09 | Csl Limited | P. gingivalis antigenic composition |
US8349807B2 (en) | 2004-10-15 | 2013-01-08 | Joe Chiba | Method of immunizing animal, composition for immunization, method for producing antibody, method for producing hybridoma and method for producing monoclonal antibody |
US8426167B2 (en) | 2001-06-22 | 2013-04-23 | Roche Diagnostics Operations, Inc. | Methods for producing fusion polypeptides or enhancing expression of fusion polypeptides |
US8431688B2 (en) | 1997-04-30 | 2013-04-30 | The University Of Melbourne | Synthetic peptide constructs for the diagnosis and treatment of Periodontitis associated with Porphyromonas gingivalis |
US8871213B2 (en) | 2008-08-29 | 2014-10-28 | Oral Health Australia Pty Ltd | Prevention, treatment and diagnosis of P. gingivalis infection |
US8911745B2 (en) | 2007-07-12 | 2014-12-16 | Oral Health Australia Pty Ltd. | Immunology treatment for biofilms |
US8916166B2 (en) | 2006-06-27 | 2014-12-23 | Oral Health Australia Pty Ltd | Porphyromonas gingivalis polypeptides useful in the prevention of periodontal disease |
Families Citing this family (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CA2523034A1 (en) * | 2003-04-28 | 2004-11-11 | Sekisui Chemical Co., Ltd. | Method of producing target protein, fused protein and gene thereof, partial sequence protein of intein and gene thereof, expression vector and transformant |
EP1619208B1 (en) * | 2003-04-28 | 2008-10-29 | Sekisui Chemical Co., Ltd. | Chaperonine-target protein complex, method of producing the same, method of stabilizing target protein, method of immobilizing target protein, method of analyzing the structure of target protein, sustained-release preparation and method of producing antibody against target protein |
CN103760184A (en) * | 2014-01-16 | 2014-04-30 | 江南大学 | Construction method of magnetic resonance imaging sensor for measuring lead ions |
Citations (11)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP0412465A1 (en) * | 1989-08-08 | 1991-02-13 | Hoechst Aktiengesellschaft | Process for correct biocatalytic chain folding of denaturated recombinant fusion proteins |
WO1993013200A1 (en) * | 1991-12-20 | 1993-07-08 | Novo Nordisk A/S | A process for the preparation of lipase |
WO1993025681A1 (en) * | 1992-06-11 | 1993-12-23 | New York University | A cytoplasmic chaperonin and methods of making and using it |
EP0650975A1 (en) * | 1993-08-03 | 1995-05-03 | Nippon Oil Co., Ltd. | Refolding of denatured proteins using chaperonin 60 monomers from Thermus thermophilus |
EP0774512A2 (en) * | 1995-09-14 | 1997-05-21 | Tadayuki Imanaka | A method for production of protein using molecular chaperon |
WO1997018233A1 (en) * | 1995-11-13 | 1997-05-22 | Pharmacia & Upjohn Ab | Method for producing a correctly folded, biological active recombinant protein |
WO1998013496A1 (en) * | 1996-09-26 | 1998-04-02 | Medical Research Council | Chaperone fragments |
WO1998024909A1 (en) * | 1996-12-03 | 1998-06-11 | Medical Research Council | Chaperone fragments |
WO1999002989A1 (en) * | 1997-07-10 | 1999-01-21 | Medical Research Council | Chaperone fragments |
WO1999005163A1 (en) * | 1997-07-24 | 1999-02-04 | Medical Research Council | Refolding method using a foldase and a chaperone |
WO1999050302A1 (en) * | 1998-03-31 | 1999-10-07 | Tonghua Gantech Biotechnology Ltd. | Chimeric protein containing an intramolecular chaperone-like sequence and its application to insulin production |
-
1999
- 1999-06-09 GB GBGB9913437.1A patent/GB9913437D0/en not_active Ceased
-
2000
- 2000-05-23 EP EP00935315A patent/EP1181378A1/en not_active Withdrawn
- 2000-05-23 CA CA002376062A patent/CA2376062A1/en not_active Abandoned
- 2000-05-23 WO PCT/GB2000/001981 patent/WO2000075346A1/en not_active Application Discontinuation
- 2000-05-23 AU AU50868/00A patent/AU5086800A/en not_active Abandoned
- 2000-05-23 JP JP2001501628A patent/JP2003501064A/en active Pending
Patent Citations (11)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP0412465A1 (en) * | 1989-08-08 | 1991-02-13 | Hoechst Aktiengesellschaft | Process for correct biocatalytic chain folding of denaturated recombinant fusion proteins |
WO1993013200A1 (en) * | 1991-12-20 | 1993-07-08 | Novo Nordisk A/S | A process for the preparation of lipase |
WO1993025681A1 (en) * | 1992-06-11 | 1993-12-23 | New York University | A cytoplasmic chaperonin and methods of making and using it |
EP0650975A1 (en) * | 1993-08-03 | 1995-05-03 | Nippon Oil Co., Ltd. | Refolding of denatured proteins using chaperonin 60 monomers from Thermus thermophilus |
EP0774512A2 (en) * | 1995-09-14 | 1997-05-21 | Tadayuki Imanaka | A method for production of protein using molecular chaperon |
WO1997018233A1 (en) * | 1995-11-13 | 1997-05-22 | Pharmacia & Upjohn Ab | Method for producing a correctly folded, biological active recombinant protein |
WO1998013496A1 (en) * | 1996-09-26 | 1998-04-02 | Medical Research Council | Chaperone fragments |
WO1998024909A1 (en) * | 1996-12-03 | 1998-06-11 | Medical Research Council | Chaperone fragments |
WO1999002989A1 (en) * | 1997-07-10 | 1999-01-21 | Medical Research Council | Chaperone fragments |
WO1999005163A1 (en) * | 1997-07-24 | 1999-02-04 | Medical Research Council | Refolding method using a foldase and a chaperone |
WO1999050302A1 (en) * | 1998-03-31 | 1999-10-07 | Tonghua Gantech Biotechnology Ltd. | Chimeric protein containing an intramolecular chaperone-like sequence and its application to insulin production |
Non-Patent Citations (2)
Title |
---|
SAMUELSSON E ET AL: "FACILITATED IN VITRO REFOLDING OF HUMAN RECOMBINANT INSULIN-LIKE GROWTH FACTOR I USING A SOLUBILIZING FUSION PARTNER", BIO/TECHNOLOGY,US,NATURE PUBLISHING CO. NEW YORK, vol. 9, no. 4, 1 April 1991 (1991-04-01), pages 363 - 366, XP000572690, ISSN: 0733-222X * |
SAMUELSSON ELISABET ET AL: "Enhanced in vitro refolding of insulin-like growth factor I using a solubilizing fusion partner.", BIOCHEMISTRY, vol. 33, no. 14, 1994, pages 4207 - 4211, XP002147681, ISSN: 0006-2960 * |
Cited By (29)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US8431688B2 (en) | 1997-04-30 | 2013-04-30 | The University Of Melbourne | Synthetic peptide constructs for the diagnosis and treatment of Periodontitis associated with Porphyromonas gingivalis |
US8841420B2 (en) | 1997-04-30 | 2014-09-23 | The University Of Melbourne | Synthetic peptide constructs for the diagnosis and treatment of periodontis associated with Porphyromonas gingivalis |
US8642731B2 (en) | 1997-12-10 | 2014-02-04 | Csl Limited | Porphyromonas gingivalis polypeptides and nucleotides |
US8129500B2 (en) | 1997-12-10 | 2012-03-06 | Csl Limited | Porphyromonas gingivalis polypeptides and nucleotides |
US8282933B2 (en) * | 1999-12-24 | 2012-10-09 | Csl Limited | P. gingivalis antigenic composition |
US8784831B2 (en) | 1999-12-24 | 2014-07-22 | Csl Limited | P. gingivalis antigenic composition |
US7276355B2 (en) | 2000-12-26 | 2007-10-02 | Sekisui Chemical Co., Ltd. | Process for production of a recombinant protein and a fusion protein |
EP1354959A1 (en) * | 2000-12-26 | 2003-10-22 | Sekisui Chemical Co., Ltd. | Process for producing recombinant protein and fused protein |
AU2002217505B2 (en) * | 2000-12-26 | 2006-03-02 | Sekisui Chemical Co., Ltd. | Process for producing recombinant protein and fused protein |
KR100892889B1 (en) | 2000-12-26 | 2009-04-15 | 세키스이가가쿠 고교가부시키가이샤 | Process for producing recombinant protein and fused protein |
US7608424B2 (en) | 2000-12-26 | 2009-10-27 | Sekisui Chemical Co., Ltd. | Process for production of a recombinant protein and a fusion protein |
EP1354959A4 (en) * | 2000-12-26 | 2004-09-15 | Sekisui Chemical Co Ltd | Process for producing recombinant protein and fused protein |
US8426167B2 (en) | 2001-06-22 | 2013-04-23 | Roche Diagnostics Operations, Inc. | Methods for producing fusion polypeptides or enhancing expression of fusion polypeptides |
EP1451217A4 (en) * | 2001-11-20 | 2005-10-12 | Atgen Co Ltd | Novel peptides conferring environmental stress resistance and fusion proteins including said peptides |
EP1451217A1 (en) * | 2001-11-20 | 2004-09-01 | Atgen Co., Ltd. | Novel peptides conferring environmental stress resistance and fusion proteins including said peptides |
AU2003243969B2 (en) * | 2002-06-25 | 2008-03-13 | Sekisui Chemical Co., Ltd. | Expression vector, host, fused protein, process for producing fused protein and process for producing protein |
WO2004001041A1 (en) * | 2002-06-25 | 2003-12-31 | Sekisui Chemical Co., Ltd. | Expression vector, host, fused protein, process for producing fused protein and process for producing protein |
EP1621555A4 (en) * | 2003-04-18 | 2006-08-02 | Sekisui Chemical Co Ltd | Immunogen, composition for immunological use, and method of producing antibody using the same |
EP1621555A1 (en) * | 2003-04-18 | 2006-02-01 | Chiba, Joe | Immunogen, composition for immunological use, and method of producing antibody using the same |
US8349807B2 (en) | 2004-10-15 | 2013-01-08 | Joe Chiba | Method of immunizing animal, composition for immunization, method for producing antibody, method for producing hybridoma and method for producing monoclonal antibody |
US8901099B2 (en) | 2004-10-15 | 2014-12-02 | Sekisui Chemical Co., Ltd. | Method for immunizing animal, composition for immunization, method for producing antibody, method for producing hybridoma, and method for producing monoclonal antibody |
US8916166B2 (en) | 2006-06-27 | 2014-12-23 | Oral Health Australia Pty Ltd | Porphyromonas gingivalis polypeptides useful in the prevention of periodontal disease |
US8241611B2 (en) | 2007-07-12 | 2012-08-14 | Oral Health Austrailia Pty. Ltd. | Biofilm treatment |
US8895019B2 (en) | 2007-07-12 | 2014-11-25 | Oral Health Australia Pty Ltd | Biofilm treatment |
US8911745B2 (en) | 2007-07-12 | 2014-12-16 | Oral Health Australia Pty Ltd. | Immunology treatment for biofilms |
US8871213B2 (en) | 2008-08-29 | 2014-10-28 | Oral Health Australia Pty Ltd | Prevention, treatment and diagnosis of P. gingivalis infection |
US9518109B2 (en) | 2008-08-29 | 2016-12-13 | Oral Health Australia Pty Ltd | Prevention, treatment and diagnosis of P. gingivalis infection |
US10851138B2 (en) | 2008-08-29 | 2020-12-01 | Oral Health Australia Pty Ltd | Methods of preparing P. gingivalis antibodies |
US11572391B2 (en) | 2008-08-29 | 2023-02-07 | Oral Health Australia Pty Ltd | Antibodies for prevention, treatment and diagnosis of P. gingivalis infection |
Also Published As
Publication number | Publication date |
---|---|
AU5086800A (en) | 2000-12-28 |
CA2376062A1 (en) | 2000-12-14 |
JP2003501064A (en) | 2003-01-14 |
EP1181378A1 (en) | 2002-02-27 |
GB9913437D0 (en) | 1999-08-11 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
EP1181378A1 (en) | Fusion proteins comprising a fragment of a chaperon polypeptide | |
EP1392717B1 (en) | Rapidly cleavable sumo fusion protein expression system for difficult to express proteins | |
JP6889701B2 (en) | Alpha hemolysin variant | |
US12030913B2 (en) | Bacterial colicin-immunity protein protein purification system | |
US20030078373A1 (en) | Chaperone fragments | |
US20060211087A1 (en) | Compositions and methods for producing recombinant proteins | |
US8859237B2 (en) | Diguanylate cyclase method of producing the same and its use in the manufacture of cyclic-di-GMP and analogues thereof | |
Park et al. | Structural basis of SspB-tail recognition by the zinc binding domain of ClpX | |
KR20170115535A (en) | A protein having affinity for immunoglobulin, an affinity separator using the same, a column for liquid chromatography | |
Ribeiro et al. | Purification of Aminoacyl-tRNA by Affinity Chromatography on Immobilized Thermus thermophilus EF-TuĚGTP | |
JP4749548B2 (en) | Intein-mediated peptide linkage | |
King et al. | A conserved zinc binding domain in the largest subunit of DNA-dependent RNA polymerase modulates intrinsic transcription termination and antitermination but does not stabilize the elongation complex | |
US20030134352A1 (en) | Facilitating protein folding and solubility by use of peptide extensions | |
van den Ent et al. | Crystal structure of the ubiquitin-like protein YukD from Bacillus subtilis | |
Nagamori et al. | Two SecG molecules present in a single protein translocation machinery are functional even after crosslinking | |
US11970520B2 (en) | Alpha-synuclein substrates and methods for making and using the same | |
JP2010063373A (en) | Monomer type streptavidin mutant and method for producing the same | |
US7879578B2 (en) | Self-assembled proteins and related methods and protein structures | |
EP1362120A1 (en) | Method for purification of soluble ssao | |
AU740755B2 (en) | Fusion proteins comprising coiled-coil structures | |
WO2024214767A1 (en) | Polypeptide having peptide ligation activity and use of same | |
CN112391429B (en) | Enzyme-catalyzed C-terminal selective hydrazide modification method for protein | |
WO2024178215A2 (en) | Chimeric biosensor for the detection 2'3'-cyclic gmp-amp (cgamp) and methods of use thereof | |
JP2004024102A (en) | Expression vector, host, fusion protein, protein, method for producing fusion protein and method for producing protein | |
KR20230165919A (en) | Polypeptides interacting with peptide tags at loops or ends and uses thereof |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
AK | Designated states |
Kind code of ref document: A1 Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BY CA CH CN CR CU CZ DE DK DM DZ EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NO NZ PL PT RO RU SD SE SG SI SK SL TJ TM TR TT TZ UA UG US UZ VN YU ZA ZW |
|
AL | Designated countries for regional patents |
Kind code of ref document: A1 Designated state(s): GH GM KE LS MW MZ SD SL SZ TZ UG ZW AM AZ BY KG KZ MD RU TJ TM AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE BF BJ CF CG CI CM GA GN GW ML MR NE SN TD TG |
|
121 | Ep: the epo has been informed by wipo that ep was designated in this application | ||
DFPE | Request for preliminary examination filed prior to expiration of 19th month from priority date (pct application filed before 20040101) | ||
ENP | Entry into the national phase |
Ref document number: 2376062 Country of ref document: CA Ref country code: CA Ref document number: 2376062 Kind code of ref document: A Format of ref document f/p: F |
|
WWE | Wipo information: entry into national phase |
Ref document number: 50868/00 Country of ref document: AU |
|
ENP | Entry into the national phase |
Ref country code: JP Ref document number: 2001 501628 Kind code of ref document: A Format of ref document f/p: F |
|
WWE | Wipo information: entry into national phase |
Ref document number: 2000935315 Country of ref document: EP |
|
WWP | Wipo information: published in national office |
Ref document number: 2000935315 Country of ref document: EP |
|
REG | Reference to national code |
Ref country code: DE Ref legal event code: 8642 |
|
WWW | Wipo information: withdrawn in national office |
Ref document number: 2000935315 Country of ref document: EP |