WO2024138425A1 - Novel nanopore protein and use thereof - Google Patents

Novel nanopore protein and use thereof Download PDF

Info

Publication number
WO2024138425A1
WO2024138425A1 PCT/CN2022/142827 CN2022142827W WO2024138425A1 WO 2024138425 A1 WO2024138425 A1 WO 2024138425A1 CN 2022142827 W CN2022142827 W CN 2022142827W WO 2024138425 A1 WO2024138425 A1 WO 2024138425A1
Authority
WO
WIPO (PCT)
Prior art keywords
mutated
porin
protein
nanopore
sequencing
Prior art date
Application number
PCT/CN2022/142827
Other languages
French (fr)
Chinese (zh)
Inventor
董宇亮
罗婉婷
章佳文
郭斐
曾涛
王乐乐
季州翔
黎宇翔
章文蔚
徐讯
Original Assignee
深圳华大生命科学研究院
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 深圳华大生命科学研究院 filed Critical 深圳华大生命科学研究院
Priority to PCT/CN2022/142827 priority Critical patent/WO2024138425A1/en
Publication of WO2024138425A1 publication Critical patent/WO2024138425A1/en

Links

Images

Classifications

    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07KPEPTIDES
    • C07K14/00Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
    • C07K14/195Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from bacteria
    • C07K14/24Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from bacteria from Enterobacteriaceae (F), e.g. Citrobacter, Serratia, Proteus, Providencia, Morganella, Yersinia
    • C07K14/265Enterobacter (G)
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/63Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
    • C12N15/70Vectors or expression systems specially adapted for E. coli
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/63Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
    • C12N15/74Vectors or expression systems specially adapted for prokaryotic hosts other than E. coli, e.g. Lactobacillus, Micromonospora
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/68Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
    • C12Q1/6869Methods for sequencing
    • GPHYSICS
    • G01MEASURING; TESTING
    • G01NINVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
    • G01N27/00Investigating or analysing materials by the use of electric, electrochemical, or magnetic means
    • G01N27/26Investigating or analysing materials by the use of electric, electrochemical, or magnetic means by investigating electrochemical variables; by using electrolysis or electrophoresis
    • G01N27/416Systems
    • G01N27/447Systems using electrophoresis
    • GPHYSICS
    • G01MEASURING; TESTING
    • G01NINVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
    • G01N33/00Investigating or analysing materials by specific methods not covered by groups G01N1/00 - G01N31/00
    • G01N33/48Biological material, e.g. blood, urine; Haemocytometers
    • G01N33/50Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing
    • G01N33/68Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing involving proteins, peptides or amino acids

Definitions

  • the invention belongs to the field of biotechnology and relates to a novel nanopore protein and application thereof.
  • This technology can be applied to DNA, RNA and protein sequencing at the same time. It records the electrical signals generated by the continuous blockage when the analytes pass through the nanopore protein one by one in real time, and converts them into sequence information through analysis to achieve sequencing.
  • This technology has high advantages in sequencing speed, throughput, portability and direct RNA sequencing, and has received widespread attention in recent years.
  • the present invention provides a porin, which is (a), (b) or (c):
  • the first protein shown in b) is a mutant of the protein shown in a), and compared with the amino acid sequence shown in SEQ ID NO.1, the mutant has mutations in the amino acid residues at one or more of the following sites: 66, 67, 71 and 72.
  • the N at position 66 mutates to A, G or S (located at sensor);
  • the D at position 67 is mutated to A, G, S, T, N or Q (located at sensor);
  • the E at position 71 mutates to A, G, S, T, N or Q (located at sensor);
  • the R at position 102 was mutated to A, G, S, T, N, or Q (entry);
  • the K at position 113 was mutated to A, G, S, T, N, or Q (entry point);
  • the R at position 119 was mutated to A, G, S, T, N, or Q (entry);
  • the R at position 162 mutated to A, G, S, T, N, or Q (barrel lining);
  • the D at position 175 mutates to A, G, S, T, N, or Q (barrel lining);
  • the D at position 217 is mutated to A, G, S, T, N, or Q (exit loop);
  • the K at position 218 mutates to D, E, A, G, S, T, N, or Q (exit loop);
  • porins mentioned above exist in the form of polymers
  • the pores formed by the porins mentioned above have a diameter of 0.5-3 nm;
  • the pore size formed by the porin is 1-2 nm.
  • the present invention provides a recombinant bacterium comprising the nucleic acid molecule described in the second aspect.
  • the present invention provides the use of the pore protein described in the first aspect, the nucleic acid molecule described in the second aspect, the expression cassette, recombinant vector or transgenic cell line described in the third aspect, the recombinant bacteria described in the fourth aspect, or the nanobiosensor described in the seventh aspect in nucleic acid nanopore sequencing.
  • the present invention provides a method for preparing and producing the porin described in the first aspect, comprising the steps of: culturing the recombinant bacteria described in the fourth aspect, and then inducing the bacteria to obtain the porin described in the first aspect.
  • the present invention provides a kit comprising the porin described in the first aspect.
  • the kit described above further comprises at least one of the following: a linker containing a helicase recognition site, a helicase, and single-stranded DNA with cholesterol.
  • the present invention provides a method for nanopore sequencing of nucleic acids, comprising the following steps:
  • the nanopore biosensor is prepared according to the following method: inserting the porin described in the first aspect into a lipid bilayer to form a nanopore channel, thereby constructing a nanopore biosensor;
  • the sequencing library containing the helicase, the single-stranded DNA containing cholesterol and the sequencing buffer are mixed and added to the nanopore biosensor, and a voltage is applied to capture the sequencing library.
  • the sequencing library containing the helicase is prepared according to the following method: the nucleic acid to be tested is connected to a connector to prepare a sequencing library, and then the sequencing library is combined with a helicase to obtain a sequencing library containing the helicase; wherein the connector is obtained by annealing two single-stranded DNA molecules, and the single-stranded DNA molecule contains a helicase binding site.
  • FIG5 is a schematic diagram of the key amino acids in the transmembrane region of the predicted structure of BCP22.
  • the plasmid expressing wild-type BCP22 protein is a vector obtained by inserting the coding gene of wild-type BCP22 protein (SEQ ID No.7) between the NdeI and XhoI sites of the pET24a vector.
  • the vector expresses a fusion protein of wild-type BCP22 fused with a StrepII tag at the C-terminus.
  • the StrepII tag is used for protein purification.
  • the plasmid expressing the wild-type BCP22 protein constructed above was transformed into the E. coli expression strain E. coli BL21 (DE3) to obtain the transformed bacterial solution.
  • the transformed bacterial solution was then evenly spread on a plate containing 50 ⁇ g/mL kanamycin and cultured at 37°C overnight. The next day, a single colony was picked and cultured in 5 ml LB medium containing 50 ⁇ g/mL kanamycin at 37°C, 200 rpm, overnight.
  • the obtained bacterial solution was inoculated into 50 ml LB containing 50 ⁇ g/mL kanamycin at a ratio of 1:100, and cultured at 37°C, 200 rpm for 4 hours.
  • the target protein solution obtained after purification was subjected to SDS-PAGE electrophoresis.
  • Lanes 2 and 3 are the gel bands of the target protein solution before and after boiling at 95°C, respectively.
  • the results show that the protein band is near the spotting hole when the target protein is not boiled and has a larger molecular weight. After boiling, the size is about 30KD, which is the same as the expected target protein. This shows that the target protein is in a polymer state before boiling and in a monomer state after boiling.
  • Figure 10 shows the nanopore biosensor current of wild-type BCP22 when voltages of 0.02V, 0.04V, 0.10V, 0.14V and 0.18V are applied (the figures from left to back are voltages of 0.02V, 0.04V, 0.10V, 0.14V and 0.18V, respectively), indicating that the BCP22 protein can be inserted into the membrane, and then the pore in the middle can allow ions to pass through, generating an open current, and the noise of the open current is relatively small.
  • Sequencing buffer 0.47M KCl, 25mM HEPES, 1mM EDTA, 5mM ATP, 25mM MgCl2, balance water, pH 7.6.

Landscapes

  • Health & Medical Sciences (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Chemical & Material Sciences (AREA)
  • Engineering & Computer Science (AREA)
  • Genetics & Genomics (AREA)
  • Organic Chemistry (AREA)
  • Molecular Biology (AREA)
  • Biochemistry (AREA)
  • Zoology (AREA)
  • Biotechnology (AREA)
  • Wood Science & Technology (AREA)
  • Biomedical Technology (AREA)
  • General Health & Medical Sciences (AREA)
  • Proteomics, Peptides & Aminoacids (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • General Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Immunology (AREA)
  • Biophysics (AREA)
  • Microbiology (AREA)
  • Analytical Chemistry (AREA)
  • Medicinal Chemistry (AREA)
  • Pathology (AREA)
  • Urology & Nephrology (AREA)
  • General Physics & Mathematics (AREA)
  • Hematology (AREA)
  • Plant Pathology (AREA)
  • Electrochemistry (AREA)
  • Chemical Kinetics & Catalysis (AREA)
  • Food Science & Technology (AREA)
  • Cell Biology (AREA)
  • Gastroenterology & Hepatology (AREA)
  • Peptides Or Proteins (AREA)

Abstract

Provided is a nanopore protein BCP22, which is (a), (b) or (c): a) a protein having an amino acid sequence as shown in SEQ ID NO: 1; b) a protein having more than 70% identity to SEQ ID NO: 1 and having the function of SEQ ID NO: 1; and c) a fusion protein obtained by adding a tag to the end of the protein sequence as shown in a) or b), and can be applied to nanopore sequencing for detecting small molecules, DNA, RNA, polypeptides and the like.

Description

一种新型纳米孔蛋白及其应用A novel nanoporous protein and its application 技术领域Technical Field
本发明属于生物技术领域,涉及一种新型纳米孔蛋白及其应用。The invention belongs to the field of biotechnology and relates to a novel nanopore protein and application thereof.
背景技术Background technique
纳米孔测序是近年来新兴起的第三代测序技术,由于其长读长、高通量、低成本和便携性等优势,给基因测序行业带来了颠覆性的改变。纳米孔测序技术在生命科学基础理论研究以及生物医学临床实践中具有广泛的应用。Nanopore sequencing is a third-generation sequencing technology that has emerged in recent years. It has brought disruptive changes to the gene sequencing industry due to its advantages such as long read length, high throughput, low cost and portability. Nanopore sequencing technology has a wide range of applications in basic theoretical research in life sciences and clinical practice in biomedicine.
该技术可同时应用于DNA、RNA及蛋白质测序,通过实时记录待测物逐一通过纳米孔蛋白时产生的连续阻滞产生的电信号,经解析转换为序列信息从而实现测序。该技术在测序速度、通量、便携性、直接RNA测序上均有较高优势,近年来获得了广泛关注。This technology can be applied to DNA, RNA and protein sequencing at the same time. It records the electrical signals generated by the continuous blockage when the analytes pass through the nanopore protein one by one in real time, and converts them into sequence information through analysis to achieve sequencing. This technology has high advantages in sequencing speed, throughput, portability and direct RNA sequencing, and has received widespread attention in recent years.
纳米孔测序要求孔道蛋白内部传感区域足够锐利,以在横向与纵向上均有高的空间分辨能力。目前研究的孔道蛋白中,潜在可用于测序的主要有三类:细菌或其它机体产生的破坏细胞膜通透性的成孔毒素(pore-forming toxin)蛋白,作为细胞内外各种生物大分子与小分子物质运输通道的转运(transporter)蛋白,为病毒侵染宿主时提供基因组输运通道的病毒连接体(viral connector)。Nanopore sequencing requires that the sensing region inside the pore protein is sharp enough to have high spatial resolution in both the horizontal and vertical directions. Among the pore proteins currently being studied, there are three main types that can be potentially used for sequencing: pore-forming toxin proteins produced by bacteria or other organisms that destroy the permeability of cell membranes, transporter proteins that serve as transport channels for various biological macromolecules and small molecules inside and outside cells, and viral connectors that provide genome transport channels when viruses infect hosts.
天然的纳米孔蛋白一般具有成孔道的能力,但是在重组蛋白的体外表达纯化系统中,重组纳米孔蛋白的孔道稳定性不一定可以满足单分子检测器相关仪器产品的需求。同时,天然纳米孔蛋白的孔径分布范围较广,不一定满足单分子检测的需求,导致测序准确度不够高。天然纳米孔蛋白孔道内壁氨基酸残基的性质,尤其是带电性质,不一定满足特定待测物的性质。Natural nanopore proteins generally have the ability to form pores, but in the in vitro expression and purification system of recombinant proteins, the pore stability of recombinant nanopore proteins may not meet the needs of single-molecule detector-related instrument products. At the same time, the pore size distribution range of natural nanopore proteins is relatively wide, which may not meet the needs of single-molecule detection, resulting in insufficient sequencing accuracy. The properties of the amino acid residues on the inner wall of the natural nanopore protein pore, especially the charged properties, may not meet the properties of the specific analyte.
通过基因发掘的方法找到更多优异的测序纳米孔蛋白,仍是一个尚待解决的问题。Finding more excellent sequencing nanopore proteins through gene mining methods remains an unresolved problem.
发明公开Invention Disclosure
本发明的目的是提供孔蛋白。An object of the present invention is to provide porins.
第一个方面,本发明提供了一种孔蛋白,是如下(a)、(b)或(c):In a first aspect, the present invention provides a porin, which is (a), (b) or (c):
a)具有如SEQ ID NO:1所示氨基酸序列的蛋白质;a) a protein having the amino acid sequence shown in SEQ ID NO: 1;
b)与SEQ ID NO:1具有70%以上同一性且具有SEQ ID NO:1功能的蛋白质;b) a protein that has more than 70% identity to SEQ ID NO:1 and has the function of SEQ ID NO:1;
c)在a)或b)所示蛋白序列的末端添加标签得到的融合蛋白。c) A fusion protein obtained by adding a tag to the end of the protein sequence shown in a) or b).
上文的孔蛋白中,第一种b)所示的蛋白为a)所示蛋白的突变体,其与SEQ ID NO.1所示的氨基酸序列相比,所述突变体在如下一个或多个位点的氨基酸残基发生突变:66、67、71和72。Among the pore proteins above, the first protein shown in b) is a mutant of the protein shown in a), and compared with the amino acid sequence shown in SEQ ID NO.1, the mutant has mutations in the amino acid residues at one or more of the following sites: 66, 67, 71 and 72.
上文的孔蛋白中,所述各个位点的突变方式如下:In the porin above, the mutations at each site are as follows:
第66位的N突变为A,G或S(位于sensor处);The N at position 66 mutates to A, G or S (located at sensor);
第67位的D突变为A,G,S,T,N或Q(位于sensor处);The D at position 67 is mutated to A, G, S, T, N or Q (located at sensor);
第71位的E突变为A,G,S,T,N或Q(位于sensor处);The E at position 71 mutates to A, G, S, T, N or Q (located at sensor);
第72位的Y突变为A,G,S,T,N或Q(位于sensor处)。The Y at position 72 is mutated to A, G, S, T, N or Q (located at sensor).
上文的孔蛋白中,第二种b)所示的蛋白为a)所示蛋白的突变体,其与SEQ ID NO.1所示的氨基酸序列相比,所述突变体在如下一个或多个位点的氨基酸残基发生突变:102,111,112,113,116,119,120,123,129,162,170,173,175,204,212,217,218,221和224。Among the pore proteins mentioned above, the second protein shown in b) is a mutant of the protein shown in a), and compared with the amino acid sequence shown in SEQ ID NO.1, the mutant has mutations in the amino acid residues at one or more of the following sites: 102, 111, 112, 113, 116, 119, 120, 123, 129, 162, 170, 173, 175, 204, 212, 217, 218, 221 and 224.
或,上文的孔蛋白中,所述孔蛋白为与上述第一种b)所示的蛋白(即与SEQ ID NO.1所示的氨基酸序列相比,所述突变体在如下一个或多个位点的氨基酸残基发生突变:66、67、71和72)相比,进一步在如下一个或多个位点的氨基酸残基发生突变:102,111,112,113,116,119,120,123,129,162,170,173,175,204,212,217,218,221和224。Or, in the porin above, the porin is compared with the protein shown in the first b) above (i.e., compared with the amino acid sequence shown in SEQ ID NO.1, the mutant has mutations in the amino acid residues at one or more of the following sites: 66, 67, 71 and 72), and further has mutations in the amino acid residues at one or more of the following sites: 102, 111, 112, 113, 116, 119, 120, 123, 129, 162, 170, 173, 175, 204, 212, 217, 218, 221 and 224.
上文的孔蛋白中,所述各个位点的突变的方式如下:In the above porin, the mutations at each site are as follows:
第102位的R突变为A,G,S,T,N或Q(入口处);The R at position 102 was mutated to A, G, S, T, N, or Q (entry);
第111位的E突变为R,K,A,G,S,T,N或Q(入口处);E at position 111 mutated to R, K, A, G, S, T, N, or Q (entry);
第112位的R突变为A,G,S,T,N或Q(入口处);The R at position 112 was mutated to A, G, S, T, N, or Q (entry);
第113位的K突变为A,G,S,T,N或Q(入口处);The K at position 113 was mutated to A, G, S, T, N, or Q (entry point);
第116位的R突变为A,G,S,T,N或Q(入口处);The R at position 116 was mutated to A, G, S, T, N, or Q (entry);
第119位的R突变为A,G,S,T,N或Q(入口处);The R at position 119 was mutated to A, G, S, T, N, or Q (entry);
第120位D的突变为A,G,S,T,N或Q(入口处);The mutation of D at position 120 is A, G, S, T, N, or Q (entry);
第123位的K突变为A,G,S,T,N或Q(入口处);The K at position 123 was mutated to A, G, S, T, N, or Q (entry point);
第129位的K突变为A,G,S,T,N或Q(入口处);The K at position 129 was mutated to A, G, S, T, N, or Q (entry);
第162位的R突变为A,G,S,T,N或Q(桶内壁);The R at position 162 mutated to A, G, S, T, N, or Q (barrel lining);
第170位的S突变为A,G,V,L,I,Y,F或W(跨膜区);The S at position 170 was mutated to A, G, V, L, I, Y, F, or W (transmembrane region);
第173位的R突变为A,G,S,T,N或Q(桶内壁);The R at position 173 mutated to A, G, S, T, N, or Q (barrel lining);
第175位的D突变为A,G,S,T,N或Q(桶内壁);The D at position 175 mutates to A, G, S, T, N, or Q (barrel lining);
第204位的S突变为A,G,V,L,I,Y,F或W(跨膜区);The S at position 204 was mutated to A, G, V, L, I, Y, F, or W (transmembrane region);
第212位的K突变为D,E,A,G,S,T,N或Q(出口loop);The K at position 212 mutates to D, E, A, G, S, T, N, or Q (exit loop);
第217位的D突变为A,G,S,T,N或Q(出口loop);The D at position 217 is mutated to A, G, S, T, N, or Q (exit loop);
第218位的K突变为D,E,A,G,S,T,N或Q(出口loop);The K at position 218 mutates to D, E, A, G, S, T, N, or Q (exit loop);
第221位的E突变为A,G,S,T,N或Q(桶内壁);E at position 221 mutated to A, G, S, T, N, or Q (barrel lining);
第224位的T突变为A,G,V,L,I,Y,F或W(跨膜区)。The T at position 224 was mutated to A, G, V, L, I, Y, F or W (transmembrane region).
上文中的孔蛋白以多聚体的形式存在;The porins mentioned above exist in the form of polymers;
进一步地,所述孔蛋白以9聚体的形式存在;Further, the porin exists in the form of 9-mers;
上文中的孔蛋白形成的孔径为0.5-3nm;The pores formed by the porins mentioned above have a diameter of 0.5-3 nm;
进一步地,所述孔蛋白形成的孔径为1-2nm。Furthermore, the pore size formed by the porin is 1-2 nm.
第二个方面,本发明提供了编码第一个方面的所述孔蛋白的核酸分子。In a second aspect, the present invention provides a nucleic acid molecule encoding the porin of the first aspect.
第三个方面,本发明提供了含有第二个方面的所述核酸分子的表达盒、重组载体或转基因细胞系。In a third aspect, the present invention provides an expression cassette, a recombinant vector or a transgenic cell line comprising the nucleic acid molecule of the second aspect.
第四个方面,本发明提供了一种重组菌,其含有第二个方面所述核酸分子。In a fourth aspect, the present invention provides a recombinant bacterium comprising the nucleic acid molecule described in the second aspect.
第五个方面,本发明提供了第二个方面所述核酸分子或第三个方面所述的表达盒、重组载体或转基因细胞系或第四个方面所述重组菌在制备孔蛋白中的应用。In a fifth aspect, the present invention provides use of the nucleic acid molecule described in the second aspect, the expression cassette, recombinant vector or transgenic cell line described in the third aspect, or the recombinant bacteria described in the fourth aspect in preparing porins.
第六个方面,本发明提供了第一个方面所述孔蛋白在制备纳米生物传感器中的应用。In a sixth aspect, the present invention provides the use of the porin described in the first aspect in the preparation of a nanobiosensor.
第七个方面,本发明提供了一种纳米生物传感器,包括第一个方面的所述孔蛋白和脂双分子层。In a seventh aspect, the present invention provides a nanobiosensor comprising the porin and lipid bilayer of the first aspect.
第八个方面,本发明提供了第一个方面所述孔蛋白、第二个方面所述核酸分子或第三个方面所述的表达盒、重组载体或转基因细胞系或第四个方面所述重组菌或第七个方面所述的纳米生物传感器在核酸纳米孔测序 中的应用。In an eighth aspect, the present invention provides the use of the pore protein described in the first aspect, the nucleic acid molecule described in the second aspect, the expression cassette, recombinant vector or transgenic cell line described in the third aspect, the recombinant bacteria described in the fourth aspect, or the nanobiosensor described in the seventh aspect in nucleic acid nanopore sequencing.
第九个方面,本发明提供了一种制备生产第一个方面所述孔蛋白的方法,包括如下步骤:培养第四个方面所述重组菌,再进行诱导处理,得到第一个方面所述孔蛋白。In a ninth aspect, the present invention provides a method for preparing and producing the porin described in the first aspect, comprising the steps of: culturing the recombinant bacteria described in the fourth aspect, and then inducing the bacteria to obtain the porin described in the first aspect.
第十个方面,本发明提供了一种试剂盒,包括第一个方面所述孔蛋白。In a tenth aspect, the present invention provides a kit comprising the porin described in the first aspect.
上文所述试剂盒还包括如下至少一种:含有解旋酶识别位点的接头、解旋酶和带有胆固醇的单链DNA。The kit described above further comprises at least one of the following: a linker containing a helicase recognition site, a helicase, and single-stranded DNA with cholesterol.
第十一个方面,本发明提供了一种纳米孔测序核酸的方法,包括如下步骤:In an eleventh aspect, the present invention provides a method for nanopore sequencing of nucleic acids, comprising the following steps:
1)制备含有解旋酶的测序文库和纳米孔生物传感器;1) Preparation of sequencing library and nanopore biosensor containing helicase;
所述纳米孔生物传感器按照如下方法制备:将第一个方面所述孔蛋白插入脂双分子层中形成纳米孔通道,构建出纳米孔生物传感器;The nanopore biosensor is prepared according to the following method: inserting the porin described in the first aspect into a lipid bilayer to form a nanopore channel, thereby constructing a nanopore biosensor;
2)将所述含有解旋酶的测序文库、带有胆固醇的单链DNA和测序缓冲液混合后加入所述纳米孔生物传感器中,施加电压,实现捕获测序文库。2) The sequencing library containing the helicase, the single-stranded DNA containing cholesterol and the sequencing buffer are mixed and added to the nanopore biosensor, and a voltage is applied to capture the sequencing library.
上文方法中,所述含有解旋酶的测序文库按照如下方法制备:将待测核酸连接接头,制备测序文库,再将所述测序文库与解旋酶结合,得到含有解旋酶的测序文库;其中,所述接头由两条单链DNA分子退火获得,且所述单链DNA分子中含有解旋酶结合位点。In the above method, the sequencing library containing the helicase is prepared according to the following method: the nucleic acid to be tested is connected to a connector to prepare a sequencing library, and then the sequencing library is combined with a helicase to obtain a sequencing library containing the helicase; wherein the connector is obtained by annealing two single-stranded DNA molecules, and the single-stranded DNA molecule contains a helicase binding site.
在本发明的实施例中,上述接头或含有解旋酶识别位点的接头具体由SEQ ID No.2所示的单链DNA分子和SEQ ID No.3所示的单链DNA分子退火后形成;In an embodiment of the present invention, the above-mentioned linker or the linker containing the helicase recognition site is specifically formed by annealing the single-stranded DNA molecule shown in SEQ ID No. 2 and the single-stranded DNA molecule shown in SEQ ID No. 3;
在本发明的实施例中,上述解旋酶具体为SEQ ID No.5所示的BCH105;In an embodiment of the present invention, the above-mentioned helicase is specifically BCH105 shown in SEQ ID No.5;
在本发明的实施例中,上述带有胆固醇的单链DNA的核苷酸序列具体为SEQ ID NO:4,且胆固醇连接在DNA的5'端;In an embodiment of the present invention, the nucleotide sequence of the single-stranded DNA with cholesterol is specifically SEQ ID NO: 4, and the cholesterol is connected to the 5' end of the DNA;
在本发明的实施例中,上述测序缓冲液配方具体为:0.47M KCl、25mM HEPES、1mM EDTA、5mM ATP、25mM MgCl2、余量为水,pH7.6。In an embodiment of the present invention, the above-mentioned sequencing buffer formula is specifically: 0.47M KCl, 25mM HEPES, 1mM EDTA, 5mM ATP, 25mM MgCl2, the balance is water, pH 7.6.
上文方法中,所述纳米孔通道的孔径为0.5-3nm。In the above method, the pore size of the nanopore channel is 0.5-3 nm.
进一步地,所述孔蛋白形成的孔径为1-2nm。Furthermore, the pore size formed by the porin is 1-2 nm.
下面结合实施例对本发明作进一步地说明。The present invention will be further described below in conjunction with embodiments.
附图说明BRIEF DESCRIPTION OF THE DRAWINGS
图1为Alphafold2 multimer预测得到的BCP22的三维结构(sideview)。Figure 1 is the three-dimensional structure of BCP22 predicted by Alphafold2 multimer (side view).
图2为Alphafold2 multimer预测得到的BCP22的三维结构(top view)。Figure 2 is the three-dimensional structure of BCP22 predicted by Alphafold2 multimer (top view).
图3为BCP22 sensor区的重点氨基酸的alphafold2 multimer预测结构示意图。Figure 3 is a schematic diagram of the alphafold2 multimer predicted structure of key amino acids in the BCP22 sensor region.
图4为BCP22预测结构中sensor区的重点氨基酸示意图。FIG4 is a schematic diagram of the key amino acids in the sensor region of the predicted structure of BCP22.
图5为BCP22预测结构中跨膜区的重点氨基酸示意图。FIG5 is a schematic diagram of the key amino acids in the transmembrane region of the predicted structure of BCP22.
图6为BCP22预测结构中入口区的重点氨基酸示意图Figure 6 is a schematic diagram of the key amino acids in the entry region of the predicted structure of BCP22
图7为BCP22预测结构中出口区和桶内壁的重点氨基酸示意图。Figure 7 is a schematic diagram of the key amino acids in the exit region and the inner wall of the barrel in the predicted structure of BCP22.
图8为BCP22的纯化得到的蛋白。FIG. 8 shows the purified protein of BCP22.
图9为测序文库结构示意图(a:top strand;b:bottom strand;c:双链目的片段;d:解旋酶BCH105;e:chol-ssDNA);图9A:测序文库示意图;图9B:chol-ssDNA与测序文库的结合模式示意图。Figure 9 is a schematic diagram of the sequencing library structure (a: top strand; b: bottom strand; c: double-stranded target fragment; d: helicase BCH105; e: chol-ssDNA); Figure 9A: Schematic diagram of the sequencing library; Figure 9B: Schematic diagram of the binding mode of chol-ssDNA and sequencing library.
图10为BCP22在磷脂双分子层中的不同电压下的开孔电流。FIG. 10 shows the pore opening current of BCP22 at different voltages in the phospholipid bilayer.
图11为待测DNA穿过纳米孔BCP22的电流trace。FIG. 11 is a current trace of the DNA to be tested passing through the nanopore BCP22.
实施发明的最佳方式Best Mode for Carrying Out the Invention
下述实施例中所使用的实验方法如无特殊说明,均为常规方法。Unless otherwise specified, the experimental methods used in the following examples are conventional methods.
下述实施例中所用的材料、试剂等,如无特殊说明,均可从商业途径得到。Unless otherwise specified, the materials and reagents used in the following examples can be obtained from commercial sources.
实施例1、野生型BCP22蛋白的发现及Alphafold2 multimer的预测结构Example 1. Discovery of wild-type BCP22 protein and predicted structure of Alphafold2 multimer
发明人从深海宏基因组中挖掘得到一种BCP22蛋白,其氨基酸序列为SEQ ID No.1,该蛋白编码的基因的核苷酸序列为SEQ ID No.7。The inventors mined a BCP22 protein from the deep-sea metagenome, whose amino acid sequence is SEQ ID No.1, and the nucleotide sequence of the gene encoding the protein is SEQ ID No.7.
利用alphafold2 multimer对BCP22(SEQ ID No.1)进行九聚体的结构预测。The alphafold2 multimer was used to predict the nonamer structure of BCP22 (SEQ ID No.1).
预测结果如图1和图2所示,图1为BCP22预测结构的sideview,图2为BCP22预测结构的top view。从图1和图2可以看出,BCP22蛋白为九聚体形式,且孔径大小为1-2nm。The prediction results are shown in Figures 1 and 2. Figure 1 is the side view of the predicted structure of BCP22, and Figure 2 is the top view of the predicted structure of BCP22. It can be seen from Figures 1 and 2 that the BCP22 protein is in the form of a nonameric structure with a pore size of 1-2 nm.
基于常识,sensor区的氨基酸的组成对于电流信号的产生起到决定性作用。图3和图4为BCP22预测结构的sensor区的重要氨基酸的侧链结构, 显示出氨基酸侧链的四个氨基酸分别为N66,D67,E71,Y72。中间数字未两个蛋白单体的这三个氨基酸侧链之间的距离。单位为埃。Based on common sense, the composition of the amino acids in the sensor region plays a decisive role in the generation of current signals. Figures 3 and 4 show the side chain structures of the important amino acids in the sensor region of the predicted structure of BCP22, showing that the four amino acids in the amino acid side chains are N66, D67, E71, and Y72. The middle number is the distance between the three amino acid side chains of the two protein monomers. The unit is angstrom.
基于常识,孔蛋白跨膜区朝向膜方向的氨基酸更加偏好于疏水氨基酸,而带电荷氨基酸和极性氨基酸可能会影响插孔效率。图5展示了BCP22跨膜区存在的三个朝向膜方向的带电荷或极性氨基酸,它们分别为S170,S204,T224。Based on common sense, the amino acids in the transmembrane region of porins that face the membrane prefer hydrophobic amino acids, while charged amino acids and polar amino acids may affect the efficiency of pore insertion. Figure 5 shows three charged or polar amino acids facing the membrane in the transmembrane region of BCP22, namely S170, S204, and T224.
基于常识,由于核酸带负电,所以如果孔蛋白入口处的氨基酸为正电的话可以增加纳米孔测序的文库捕获率,反之,则可以降低文库捕获率。同时孔蛋白结合强弱对测序速度和测序电流信号也会有一定影响。图6展示了BCP22单体入口区的一些重要的氨基酸,它们分别为;R102,E111,R112,K113,R116,R119,D120,K123,K129。Based on common sense, since nucleic acids are negatively charged, if the amino acids at the entrance of the porin are positively charged, the library capture rate of nanopore sequencing can be increased, otherwise, the library capture rate can be reduced. At the same time, the strength of porin binding will also have a certain impact on the sequencing speed and sequencing current signal. Figure 6 shows some important amino acids in the entrance region of the BCP22 monomer, which are; R102, E111, R112, K113, R116, R119, D120, K123, K129.
同时,桶内壁和出口的一些氨基酸对待测物顺利穿孔和离开纳米孔具有影响,图7展示了BCP22单体的桶内壁和出口的一些可能影响待测物顺利穿孔的氨基酸,尤其是带电荷的氨基酸,他们分别是R162,R173,D175,K212,D217,K218,E221。At the same time, some amino acids on the inner wall and outlet of the barrel have an impact on the smooth penetration and exit of the analyte from the nanopore. Figure 7 shows some amino acids on the inner wall and outlet of the barrel of the BCP22 monomer that may affect the smooth penetration of the analyte, especially the charged amino acids, which are R162, R173, D175, K212, D217, K218, and E221.
因此,野生型BCP22蛋白可以在上述关键氨基酸残基进行突变得到突变体,预测其具有野生型BCP22蛋白的功能。Therefore, the wild-type BCP22 protein can be mutated at the above-mentioned key amino acid residues to obtain a mutant, which is predicted to have the function of the wild-type BCP22 protein.
上述突变体可以为在SEQ ID No.1所示蛋白基础上进行如下一个或多个位点的氨基酸残基发生突变:66,67,71,72,102,111,112,113,116,119,120,123,129,162,170,173,175,204,212,217,218,221和224;The mutants may be mutated at one or more of the following amino acid residues based on the protein shown in SEQ ID No. 1: 66, 67, 71, 72, 102, 111, 112, 113, 116, 119, 120, 123, 129, 162, 170, 173, 175, 204, 212, 217, 218, 221 and 224;
突变体也可以为在SEQ ID No.1所示蛋白基础上进行如下一个或多个位点的氨基酸残基发生突变得到的突变蛋白:66,67,71,72,再进一步在该突变蛋白的基础上进行如下一个或多个位点的氨基酸残基发生突变:102,111,112,113,116,119,120,123,129,162,170,173,175,204,212,217,218,221和224。The mutant can also be a mutant protein obtained by mutating the amino acid residues at one or more of the following sites based on the protein shown in SEQ ID No.1: 66, 67, 71, 72, and further mutating the amino acid residues at one or more of the following sites based on the mutant protein: 102, 111, 112, 113, 116, 119, 120, 123, 129, 162, 170, 173, 175, 204, 212, 217, 218, 221 and 224.
实施例2、野生型BCP22蛋白的获得Example 2: Obtaining wild-type BCP22 protein
一、表达野生型BCP22蛋白的质粒1. Plasmid expressing wild-type BCP22 protein
表达野生型BCP22蛋白的质粒为将野生型BCP22蛋白的编码基因(SEQ ID No.7)插入pET24a载体的NdeI和XhoI位点间得到的载体,该载体表达C端融合StrepII标签的野生型BCP22的融合蛋白,StrepII标签用于 蛋白纯化。The plasmid expressing wild-type BCP22 protein is a vector obtained by inserting the coding gene of wild-type BCP22 protein (SEQ ID No.7) between the NdeI and XhoI sites of the pET24a vector. The vector expresses a fusion protein of wild-type BCP22 fused with a StrepII tag at the C-terminus. The StrepII tag is used for protein purification.
通过定点突变的方法,采用Agilent定点突变试剂盒,以表达野生型BCP22蛋白的质粒为模板,构建相应的突变体.The corresponding mutants were constructed by site-directed mutagenesis using the Agilent site-directed mutagenesis kit and the plasmid expressing the wild-type BCP22 protein as a template.
二、孔蛋白BCP22菌株的培养和诱导2. Cultivation and induction of porin BCP22 strain
将上述一构建好的表达野生型BCP22蛋白的质粒转化到大肠杆菌表达菌株E.coli BL21(DE3)中,得到转化后菌液。再将转化后菌液均匀涂抹在含50μg/mL卡那霉素的平板上,37℃过夜培养。次日挑取单菌落于含50μg/mL卡那霉素的5ml LB培养基中,37℃,200rpm,过夜培养。将上述所得菌液,按1:100接种于含有50μg/mL卡那霉素的50ml LB中,37℃,200rpm,培养4h。将扩大培养的菌液,按1:100接种于含有50μg/mL卡那霉素的2L LB中,37℃,200rpm培养,待OD600值达0.6-0.8左右,加入终浓度为0.5mM的IPTG,16℃,200rpm,培养约18h。将菌液于8000rpm离心10分钟收集菌体,冻存于-20℃待用,得到孔蛋白BCP22菌株。The plasmid expressing the wild-type BCP22 protein constructed above was transformed into the E. coli expression strain E. coli BL21 (DE3) to obtain the transformed bacterial solution. The transformed bacterial solution was then evenly spread on a plate containing 50 μg/mL kanamycin and cultured at 37°C overnight. The next day, a single colony was picked and cultured in 5 ml LB medium containing 50 μg/mL kanamycin at 37°C, 200 rpm, overnight. The obtained bacterial solution was inoculated into 50 ml LB containing 50 μg/mL kanamycin at a ratio of 1:100, and cultured at 37°C, 200 rpm for 4 hours. The expanded cultured bacterial solution was inoculated into 2L LB containing 50 μg/mL kanamycin at a ratio of 1:100, and cultured at 37°C, 200 rpm. When the OD600 value reached about 0.6-0.8, IPTG was added at a final concentration of 0.5 mM, and cultured at 16°C, 200 rpm for about 18 hours. The bacterial liquid was centrifuged at 8000 rpm for 10 minutes to collect the bacterial cells, and frozen at -20°C until use, thereby obtaining the porin BCP22 strain.
三、重组型孔蛋白BCP22提取及纯化3. Extraction and purification of recombinant porin BCP22
1、纯化Buffer配制1. Preparation of purification buffer
Buffer A:20mM Tris-HCl,250mM NaCl,1%DDM,余量为水,pH 8.0。Buffer A: 20 mM Tris-HCl, 250 mM NaCl, 1% DDM, the remainder is water, pH 8.0.
Buffer B:20mM Tris-HCl,250mM NaCl,0.05%DDM,余量为水,pH 8.0。Buffer B: 20 mM Tris-HCl, 250 mM NaCl, 0.05% DDM, balance water, pH 8.0.
Buffer C:20mM Tris-HCl,250mM NaCl,0.05%DDM,5mM脱硫生物素,余量为水,pH 8.0。Buffer C: 20 mM Tris-HCl, 250 mM NaCl, 0.05% DDM, 5 mM desthiobiotin, balance water, pH 8.0.
2、纯化2. Purification
按1g孔蛋白BCP22菌株菌体加10ml Buffer A的比例充分重悬上述收集的各个菌体,超声破碎细胞至菌体溶液澄清。旋转仪上4℃旋转过夜。次日18000rpm 4℃离心1h,取上清,0.22μm滤膜过滤后于4℃待用。Resuspend the collected cells in the ratio of 1g porin BCP22 strain cells to 10ml Buffer A, and ultrasonically disrupt the cells until the cell solution is clear. Rotate overnight at 4℃ on a rotator. Centrifuge at 18000rpm at 4℃ for 1h the next day, take the supernatant, filter with a 0.22μm filter membrane, and store at 4℃ for later use.
用AKTA pure将Strep-Tactin beads(IBA Lifesciences)层析柱用Buffer A平衡5CV后,2ml/min上样。上样完成后,Buffer B冲洗20CV,使用Buffer C洗脱,收集洗脱液(含有目的蛋白)。After equilibration of the Strep-Tactin beads (IBA Lifesciences) column with Buffer A for 5 CV using AKTA pure, the sample was loaded at 2 ml/min. After loading, it was rinsed with Buffer B for 20 CV, eluted with Buffer C, and the eluate (containing the target protein) was collected.
将上述Strep柱亲和层析后的洗脱液(含有目的蛋白)浓缩至1ml,过经buffer B平衡的Superdex 6 increase 10/300GL(Cytiva)柱子,收集目的蛋白,随后储存于-80℃,得到纯合后的目的蛋白溶液。The eluate (containing the target protein) after the above-mentioned Strep column affinity chromatography was concentrated to 1 ml, passed through a Superdex 6 increase 10/300GL (Cytiva) column equilibrated with buffer B, the target protein was collected, and then stored at -80°C to obtain a homozygous target protein solution.
将纯化后获得的目的蛋白溶液进行SDS-PAGE电泳。The target protein solution obtained after purification was subjected to SDS-PAGE electrophoresis.
结果如图8所示,2、3泳道分别是煮前和95℃煮后目的蛋白溶液上样液的跑胶条带,结果显示目标蛋白未煮的情况下蛋白条带在点样孔附近,具有较大的分子量,煮后为大小为30KD左右,与预期目的蛋白相同,这就表明,目标蛋白未煮的情况下为聚体状态,煮后呈现单体状态。The results are shown in Figure 8. Lanes 2 and 3 are the gel bands of the target protein solution before and after boiling at 95°C, respectively. The results show that the protein band is near the spotting hole when the target protein is not boiled and has a larger molecular weight. After boiling, the size is about 30KD, which is the same as the expected target protein. This shows that the target protein is in a polymer state before boiling and in a monomer state after boiling.
经过检测,目的蛋白溶液的浓度为1.0mg/mL。After testing, the concentration of the target protein solution was 1.0 mg/mL.
实施例3、BCP22蛋白及其突变体作为孔蛋白在纳米孔测序中的应用Example 3: Application of BCP22 protein and its mutants as porins in nanopore sequencing
一、文库构建1. Library Construction
将两条部分区域互补的DNA链(top strand(SEQ ID No.2,其中Y=iSP18)和bottom strand(SEQ ID No.3)退火后形成接头(如图9所示),再与待测双链目的片段(SEQ ID No.6)利用T4DNA连接酶在室温下连接并纯化,制备测序文库。然后该测序文库与解旋酶BCH105(SEQ ID No.5)在25℃孵育1h(测序文库与解旋酶BCH105的摩尔浓度比1:8),形成含有BCH105马达蛋白的测序文库(如图9A所示)。在测序时,该测序文库能够进一步与带有胆固醇的单链DNA(SEQ ID NO:4,胆固醇连接在DNA的5'端)互补配对结合,形成如图9B所示结构。Two partially complementary DNA chains (top strand (SEQ ID No. 2, where Y = iSP18) and bottom strand (SEQ ID No. 3) were annealed to form a linker (as shown in FIG. 9 ), and then connected to the double-stranded target fragment to be tested (SEQ ID No. 6) using T4 DNA ligase at room temperature and purified to prepare a sequencing library. The sequencing library was then incubated with helicase BCH105 (SEQ ID No. 5) at 25°C for 1 h (the molar concentration ratio of the sequencing library to the helicase BCH105 was 1:8) to form a sequencing library containing the BCH105 motor protein (as shown in FIG. 9A ). During sequencing, the sequencing library can further complementarily pair with single-stranded DNA with cholesterol (SEQ ID NO: 4, cholesterol connected to the 5' end of the DNA) to form a structure as shown in FIG. 9B .
二、利用孔蛋白BCP22及其突变体构建纳米孔生物传感器2. Construction of nanopore biosensors using porin BCP22 and its mutants
使用膜片钳放大器采集电流信号。Ag/AgCl电极浸润在测序缓冲液中并且电极分别位于电解槽cis和trans区域。测序文库和纳米孔等试剂加入到cis区域中。The current signal is collected using a patch clamp amplifier. Ag/AgCl electrodes are immersed in sequencing buffer and are located in the cis and trans regions of the electrolytic cell. Sequencing libraries and nanopore reagents are added to the cis region.
使用1xPBS缓冲液将实施例2获得的孔蛋白BCP22稀释一定的倍数后,在外加电场力作用下将单个纳米孔蛋白插入由二脂酰磷脂酰胆碱(DPhPC,1,2-diphytanoyl-sn-glycero-3-phosphocholine)组成的磷脂双分子层中,形成纳米孔生物传感器。After the porin BCP22 obtained in Example 2 was diluted a certain multiple using 1xPBS buffer, a single nanopore protein was inserted into a phospholipid bilayer composed of diacylphosphatidylcholine (DPhPC, 1,2-diphytanoyl-sn-glycero-3-phosphocholine) under the action of an external electric field to form a nanopore biosensor.
施加外加电压,获得单个孔蛋白的电流振幅值。An external voltage was applied and the current amplitude value of a single porin was obtained.
图10为施加0.02V、0.04V、0.10V、0.14V和0.18V电压时野生型BCP22的纳米孔生物传感电流(从左到后的图依次为0.02V、0.04V、0.10V、0.14V和0.18V电压),表明BCP22蛋白可以插到膜上,然后中间的孔道可以允许离子通过,产生开孔电流,并且开孔电流的噪声比较小。Figure 10 shows the nanopore biosensor current of wild-type BCP22 when voltages of 0.02V, 0.04V, 0.10V, 0.14V and 0.18V are applied (the figures from left to back are voltages of 0.02V, 0.04V, 0.10V, 0.14V and 0.18V, respectively), indicating that the BCP22 protein can be inserted into the membrane, and then the pore in the middle can allow ions to pass through, generating an open current, and the noise of the open current is relatively small.
三、利用孔蛋白BCP22用于DNA测序3. Using Porin BCP22 for DNA Sequencing
将上述一制备的含有BCH105马达蛋白的测序文库和带有胆固醇的单 链DNA(SEQ ID No.4)与测序缓冲液混合并加入上述二制备的纳米孔生物传感器中;施加外加电压0.14V或0.18V后,观察到DNA被纳米孔捕获,产生特征的阻滞电流振幅值。并且随着DNA通过纳米孔移动,电流振幅值改变。不同的DNA序列产生不同的阻滞电流振幅值。带有胆固醇的单链DNA可以与磷脂双分子层进行结合,有助于纳米孔捕获测序文库,降低测序文库上样量。The sequencing library containing the BCH105 motor protein prepared in the first step and the single-stranded DNA with cholesterol (SEQ ID No. 4) were mixed with the sequencing buffer and added to the nanopore biosensor prepared in the second step; after applying an external voltage of 0.14V or 0.18V, it was observed that the DNA was captured by the nanopore, generating a characteristic blocking current amplitude value. And as the DNA moves through the nanopore, the current amplitude value changes. Different DNA sequences produce different blocking current amplitude values. The single-stranded DNA with cholesterol can bind to the phospholipid bilayer, which helps the nanopore capture the sequencing library and reduce the amount of sequencing library loading.
测序缓冲液:0.47M KCl、25mM HEPES、1mM EDTA、5mM ATP、25mM MgCl2、余量为水,pH7.6。Sequencing buffer: 0.47M KCl, 25mM HEPES, 1mM EDTA, 5mM ATP, 25mM MgCl2, balance water, pH 7.6.
图11为在外加电压0.14V作用下,文库DNA穿过纳米孔蛋白BCP22时产生的电流变化,表明了野生型蛋白BCP22可以作为纳米孔,用于纳米孔测序。FIG. 11 shows the current change when the library DNA passes through the nanopore protein BCP22 under the action of an applied voltage of 0.14 V, indicating that the wild-type protein BCP22 can be used as a nanopore for nanopore sequencing.
工业应用Industrial Applications
本发明通过计算机辅助结构预测的基因挖掘手段,从深海宏基因组中挖掘得到一种纳米孔蛋白BCP22,通过蛋白制备和纳米孔测序验证,表明其具有应用于纳米孔测序的能力,可以针对小分子、DNA、RNA及多肽等的进行检测。The present invention obtains a nanopore protein BCP22 from the deep-sea metagenome through gene mining with computer-aided structure prediction. Protein preparation and nanopore sequencing verification show that it has the ability to be applied to nanopore sequencing and can detect small molecules, DNA, RNA and polypeptides.

Claims (20)

  1. 一种孔蛋白,是如下(a)、(b)或(c):A porin is (a), (b) or (c):
    a)具有如SEQ ID NO:1所示氨基酸序列的蛋白质;a) a protein having the amino acid sequence shown in SEQ ID NO: 1;
    b)与SEQ ID NO:1具有70%以上同一性且具有SEQ ID NO:1功能的蛋白质;b) a protein that has more than 70% identity to SEQ ID NO:1 and has the function of SEQ ID NO:1;
    c)在a)或b)所示蛋白序列的末端添加标签得到的融合蛋白。c) A fusion protein obtained by adding a tag to the end of the protein sequence shown in a) or b).
  2. 根据权利要求1所述的孔蛋白,其特征在于:The porin according to claim 1, characterized in that:
    b)所示的蛋白为a)所示蛋白的突变体,其与SEQ ID NO.1所示的氨基酸序列相比,所述突变体在如下一个或多个位点的氨基酸残基发生突变:66、67、71和72。The protein shown in b) is a mutant of the protein shown in a), and compared with the amino acid sequence shown in SEQ ID NO.1, the mutant has mutations in the amino acid residues at one or more of the following sites: 66, 67, 71 and 72.
  3. 根据权利要求2所述的孔蛋白,其特征在于:The porin according to claim 2, characterized in that:
    所述各个位点的突变方式如下:The mutation modes of each site are as follows:
    第66位的N突变为A,G或S;The N at position 66 mutated to A, G or S;
    第67位的D突变为A,G,S,T,N或Q;The D at position 67 mutated to A, G, S, T, N, or Q;
    第71位的E突变为A,G,S,T,N或Q;E at position 71 mutated to A, G, S, T, N, or Q;
    第72位的Y突变为A,G,S,T,N或Q。The Y at position 72 is mutated to A, G, S, T, N or Q.
  4. 根据权利要求1所述的孔蛋白,其特征在于:The porin according to claim 1, characterized in that:
    b)所示的蛋白为a)所示蛋白的突变体,其与SEQ ID NO.1所示的氨基酸序列相比,所述突变体在如下一个或多个位点的氨基酸残基发生突变:102,111,112,113,116,119,120,123,129,162,170,173,175,204,212,217,218,221和224。The protein shown in b) is a mutant of the protein shown in a), and compared with the amino acid sequence shown in SEQ ID NO.1, the mutant has mutations in amino acid residues at one or more of the following sites: 102, 111, 112, 113, 116, 119, 120, 123, 129, 162, 170, 173, 175, 204, 212, 217, 218, 221 and 224.
  5. 根据权利要求2或3所述的孔蛋白,其特征在于:The porin according to claim 2 or 3, characterized in that:
    所述孔蛋白为与权利要求2或3所述的孔蛋白相比,在如下一个或多个位点的氨基酸残基发生突变:102,111,112,113,116,119,120,123,129,162,170,173,175,204,212,217,218,221和224。The porin protein is a porin protein according to claim 2 or 3, wherein the amino acid residues at one or more of the following positions are mutated: 102, 111, 112, 113, 116, 119, 120, 123, 129, 162, 170, 173, 175, 204, 212, 217, 218, 221 and 224.
  6. 根据权利要求5所述的孔蛋白,其特征在于:The porin according to claim 5, characterized in that:
    所述各个位点的突变的方式如下:The mutation modes of each site are as follows:
    第102位的R突变为A,G,S,T,N或Q;The R at position 102 mutated to A, G, S, T, N, or Q;
    第111位的E突变为R,K,A,G,S,T,N或Q;E at position 111 mutated to R, K, A, G, S, T, N, or Q;
    第112位的R突变为A,G,S,T,N或Q;The R at position 112 mutated to A, G, S, T, N, or Q;
    第113位的K突变为A,G,S,T,N或Q;The K at position 113 mutated to A, G, S, T, N, or Q;
    第116位的R突变为A,G,S,T,N或Q;The R at position 116 mutated to A, G, S, T, N, or Q;
    第119位的R突变为A,G,S,T,N或Q;The R at position 119 mutated to A, G, S, T, N, or Q;
    第120位D的突变为A,G,S,T,N或Q;The mutation of D at position 120 is A, G, S, T, N or Q;
    第123位的K突变为A,G,S,T,N或Q;The K at position 123 mutated to A, G, S, T, N or Q;
    第129位的K突变为A,G,S,T,N或Q;The K at position 129 mutated to A, G, S, T, N, or Q;
    第162位的R突变为A,G,S,T,N或Q;The R at position 162 mutated to A, G, S, T, N, or Q;
    第170位的S突变为A,G,V,L,I,Y,F或W;The S at position 170 mutated to A, G, V, L, I, Y, F, or W;
    第173位的R突变为A,G,S,T,N或Q;The R at position 173 mutated to A, G, S, T, N, or Q;
    第175位的D突变为A,G,S,T,N或Q;The D at position 175 mutated to A, G, S, T, N, or Q;
    第204位的S突变为A,G,V,L,I,Y,F或W;The S at position 204 mutated to A, G, V, L, I, Y, F, or W;
    第212位的K突变为D,E,A,G,S,T,N或Q;The K at position 212 mutated to D, E, A, G, S, T, N, or Q;
    第217位的D突变为A,G,S,T,N或Q;The D at position 217 mutated to A, G, S, T, N, or Q;
    第218位的K突变为D,E,A,G,S,T,N或Q;The K at position 218 mutated to D, E, A, G, S, T, N, or Q;
    第221位的E突变为A,G,S,T,N或Q;E at position 221 mutated to A, G, S, T, N, or Q;
    第224位的T突变为A,G,V,L,I,Y,F或W。The T at position 224 is mutated to A, G, V, L, I, Y, F or W.
  7. 根据根据权利要求1-6中任一所述的孔蛋白,其特征在于:According to any one of claims 1 to 6, the porin is characterized in that:
    所述孔蛋白以多聚体的形式存在;The porin exists in the form of a polymer;
    进一步地,所述孔蛋白以9聚体的形式存在;Further, the porin exists in the form of 9-mers;
    或,所述孔蛋白形成的孔径为0.5-3nm;Or, the pore size formed by the porin is 0.5-3 nm;
    进一步地,所述孔蛋白形成的孔径为1-2nm。Furthermore, the pore size formed by the porin is 1-2 nm.
  8. 编码权利要求1-7中任一所述孔蛋白的核酸分子。A nucleic acid molecule encoding the porin of any one of claims 1-7.
  9. 含有权利要求8所述核酸分子的表达盒、重组载体或转基因细胞系。An expression cassette, a recombinant vector or a transgenic cell line containing the nucleic acid molecule of claim 8.
  10. 一种重组菌,其含有权利要求8所述核酸分子。A recombinant bacterium comprising the nucleic acid molecule according to claim 8.
  11. 权利要求8所述核酸分子或权利要求9所述的表达盒、重组载体或转基因细胞系或权利要求10所述重组菌在制备孔蛋白中的应用。Use of the nucleic acid molecule according to claim 8, the expression cassette, the recombinant vector or the transgenic cell line according to claim 9, or the recombinant bacteria according to claim 10 in the preparation of porins.
  12. 权利要求1-7中任一所述孔蛋白在制备纳米生物传感器中的应用。Use of the porin according to any one of claims 1 to 7 in the preparation of a nanobiosensor.
  13. 一种纳米生物传感器,包括权利要求1-7中任一所述孔蛋白和脂 双分子层。A nano biosensor comprising the porin according to any one of claims 1 to 7 and a lipid bilayer.
  14. 权利要求1-7中任一所述孔蛋白、权利要求8所述核酸分子或权利要求9所述的表达盒、重组载体或转基因细胞系或权利要求10所述重组菌或权利要求13所述的纳米生物传感器在核酸纳米孔测序中的应用。Use of the porin according to any one of claims 1 to 7, the nucleic acid molecule according to claim 8, the expression cassette, recombinant vector or transgenic cell line according to claim 9, the recombinant bacteria according to claim 10 or the nanobiosensor according to claim 13 in nucleic acid nanopore sequencing.
  15. 一种制备生产权利要求1-7中任一所述孔蛋白的方法,包括如下步骤:培养权利要求10所述重组菌,再进行诱导处理,得到权利要求1-7中任一所述孔蛋白。A method for preparing and producing the porin described in any one of claims 1-7, comprising the following steps: culturing the recombinant bacteria described in claim 10, and then performing an induction treatment to obtain the porin described in any one of claims 1-7.
  16. 一种试剂盒,包括权利要求1-7中任一所述孔蛋白。A kit comprising the porin according to any one of claims 1 to 7.
  17. 根据权利要求16所述的试剂盒,其特征在于:所述试剂盒还包括如下至少一种:含有解旋酶识别位点的接头、解旋酶和带有胆固醇的单链DNA。The kit according to claim 16 is characterized in that: the kit further comprises at least one of the following: a linker containing a helicase recognition site, a helicase, and a single-stranded DNA containing cholesterol.
  18. 一种纳米孔测序核酸的方法,包括如下步骤:A method for nanopore sequencing of nucleic acids, comprising the following steps:
    1)制备含有解旋酶的测序文库和纳米孔生物传感器;1) Preparation of sequencing library and nanopore biosensor containing helicase;
    所述纳米孔生物传感器按照如下方法制备:将权利要求1-7中任一所述孔蛋白插入脂双分子层中形成纳米孔通道,构建出纳米孔生物传感器;The nanopore biosensor is prepared according to the following method: inserting the porin described in any one of claims 1 to 7 into a lipid bilayer to form a nanopore channel, thereby constructing a nanopore biosensor;
    2)将所述含有解旋酶的测序文库、带有胆固醇的单链DNA和测序缓冲液混合后加入所述纳米孔生物传感器中,施加电压,实现捕获测序文库。2) The sequencing library containing the helicase, the single-stranded DNA containing cholesterol and the sequencing buffer are mixed and added to the nanopore biosensor, and a voltage is applied to capture the sequencing library.
  19. 根据权利要求18所述的方法,其特征在于:The method according to claim 18, characterized in that:
    所述含有解旋酶的测序文库按照如下方法制备:将待测核酸连接接头,制备测序文库,再将所述测序文库与解旋酶结合,得到含有解旋酶的测序文库;其中,所述接头由两条单链DNA分子退火获得,且所述单链DNA分子中含有解旋酶结合位点。The sequencing library containing the helicase is prepared according to the following method: the nucleic acid to be tested is connected to a connector to prepare a sequencing library, and then the sequencing library is combined with a helicase to obtain a sequencing library containing the helicase; wherein the connector is obtained by annealing two single-stranded DNA molecules, and the single-stranded DNA molecules contain a helicase binding site.
  20. 根据权利要求18或19所述的方法,其特征在于:The method according to claim 18 or 19, characterized in that:
    所述纳米孔通道的孔径为0.5-3nm。The pore size of the nanopore channel is 0.5-3 nm.
PCT/CN2022/142827 2022-12-28 2022-12-28 Novel nanopore protein and use thereof WO2024138425A1 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
PCT/CN2022/142827 WO2024138425A1 (en) 2022-12-28 2022-12-28 Novel nanopore protein and use thereof

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/CN2022/142827 WO2024138425A1 (en) 2022-12-28 2022-12-28 Novel nanopore protein and use thereof

Publications (1)

Publication Number Publication Date
WO2024138425A1 true WO2024138425A1 (en) 2024-07-04

Family

ID=91716088

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2022/142827 WO2024138425A1 (en) 2022-12-28 2022-12-28 Novel nanopore protein and use thereof

Country Status (1)

Country Link
WO (1) WO2024138425A1 (en)

Citations (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102216783A (en) * 2008-09-22 2011-10-12 华盛顿大学 Msp nanopores and related methods
CN105801676A (en) * 2016-04-13 2016-07-27 东南大学 Mutant MspA protein monomer and expressed gene and application thereof
CN108699138A (en) * 2016-03-02 2018-10-23 牛津纳米孔技术公司 It is mutated hole
CN110621692A (en) * 2017-05-04 2019-12-27 牛津纳米孔技术公司 Transmembrane pore composed of two CsgG pores
CN110914290A (en) * 2017-06-30 2020-03-24 弗拉芒区生物技术研究所 Novel protein pores
CN111499705A (en) * 2019-01-30 2020-08-07 深圳市梅丽纳米孔科技有限公司 Mutant NfpAB nanopore, test system, manufacturing method and application
CN113195736A (en) * 2018-11-08 2021-07-30 牛津纳米孔科技公司 Hole(s)
CN113480620A (en) * 2021-08-18 2021-10-08 成都齐碳科技有限公司 Mutant of porin monomer, protein pore and application thereof
CN113735948A (en) * 2021-09-28 2021-12-03 成都齐碳科技有限公司 Mutant of porin monomer, protein pore and application thereof
CN113754743A (en) * 2021-10-12 2021-12-07 成都齐碳科技有限公司 Mutant of porin monomer, protein pore and application thereof
CN113773373A (en) * 2021-10-12 2021-12-10 成都齐碳科技有限公司 Mutant of porin monomer, protein pore and application thereof
CN113896776A (en) * 2021-10-12 2022-01-07 成都齐碳科技有限公司 Mutant of porin monomer, protein pore and application thereof
CN113912683A (en) * 2021-10-12 2022-01-11 成都齐碳科技有限公司 Mutant of porin monomer, protein pore and application thereof

Patent Citations (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102216783A (en) * 2008-09-22 2011-10-12 华盛顿大学 Msp nanopores and related methods
CN108699138A (en) * 2016-03-02 2018-10-23 牛津纳米孔技术公司 It is mutated hole
CN105801676A (en) * 2016-04-13 2016-07-27 东南大学 Mutant MspA protein monomer and expressed gene and application thereof
CN110621692A (en) * 2017-05-04 2019-12-27 牛津纳米孔技术公司 Transmembrane pore composed of two CsgG pores
CN110914290A (en) * 2017-06-30 2020-03-24 弗拉芒区生物技术研究所 Novel protein pores
CN113195736A (en) * 2018-11-08 2021-07-30 牛津纳米孔科技公司 Hole(s)
CN111499705A (en) * 2019-01-30 2020-08-07 深圳市梅丽纳米孔科技有限公司 Mutant NfpAB nanopore, test system, manufacturing method and application
CN113480620A (en) * 2021-08-18 2021-10-08 成都齐碳科技有限公司 Mutant of porin monomer, protein pore and application thereof
CN113735948A (en) * 2021-09-28 2021-12-03 成都齐碳科技有限公司 Mutant of porin monomer, protein pore and application thereof
CN113754743A (en) * 2021-10-12 2021-12-07 成都齐碳科技有限公司 Mutant of porin monomer, protein pore and application thereof
CN113773373A (en) * 2021-10-12 2021-12-10 成都齐碳科技有限公司 Mutant of porin monomer, protein pore and application thereof
CN113896776A (en) * 2021-10-12 2022-01-07 成都齐碳科技有限公司 Mutant of porin monomer, protein pore and application thereof
CN113912683A (en) * 2021-10-12 2022-01-11 成都齐碳科技有限公司 Mutant of porin monomer, protein pore and application thereof

Similar Documents

Publication Publication Date Title
US20220091096A1 (en) Nanopores with internal protein adaptors
JP7027334B2 (en) Alpha hemolysin variants and their use
JP7499761B2 (en) pore
EP3212810B1 (en) Alpha-hemolysin variants with altered characteristics
US20220242922A1 (en) Biological Nanopores for Biopolymer Sensing and Sequencing Based on FRAC Actinoporin
WO2020155242A1 (en) Mutant nfpab nanopore, test system, manufacturing method therefor and use thereof
JP7157164B2 (en) Alpha-hemolysin variants and their uses
JP2023502658A (en) Engineered nanopores and uses and methods associated therewith
CN114957415B (en) Streptavidin mutant and application and product thereof, gene, recombinant plasmid and genetically engineered bacterium
WO2024138425A1 (en) Novel nanopore protein and use thereof
WO2024138424A1 (en) Nanopore protein and application thereof
WO2024138565A1 (en) Nanopore protein, and mutant and use thereof
WO2024138512A1 (en) Novel porin bcp34, and mutant thereof and use thereof
WO2024138473A1 (en) Porin monomer, porin, mutant thereof, and use thereof
WO2024138472A1 (en) Porin monomer, porin, mutant thereof and use of same
WO2024138382A1 (en) Porin monomer, porin, mutant thereof and use thereof
WO2024138470A1 (en) Nanopore sensor and use thereof in sequencing
CN117721101B (en) Modified CaPif1 helicase and application thereof
CN117417418B (en) Nanopore mutant with ultrahigh thermal stability and application thereof
CN117384260B (en) Novel nanoporous protein mutant and application thereof
WO2023123370A1 (en) Nanopore protein and related use thereof in sequencing
CN116496392A (en) Anti-novel coronavirus N protein single domain antibody, fusion protein, encoding gene and application thereof

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 22969577

Country of ref document: EP

Kind code of ref document: A1