JP7212982B1

JP7212982B1 - Cell library and its production method

Info

Publication number: JP7212982B1
Application number: JP2022161024A
Authority: JP
Inventors: 康則相澤; 知幸大野
Original assignee: Logomix Inc Japan
Current assignee: Logomix Inc Japan
Priority date: 2022-10-05
Filing date: 2022-10-05
Publication date: 2023-01-26
Anticipated expiration: 2042-10-05
Also published as: JP2024054652A

Abstract

A method for producing a library containing genetically modified cells is provided.
Kind Code: A1 Disclosed is a cell library containing a plurality of modified cells having a wide variety of types of nucleotide sequences at a specific site of one allele in cells having a plurality of alleles. A cell library can be a combination of aqueous compositions containing one type of modified cell. This library is useful in evaluating the functionality of specific regions of the genome, introducing various types of mutations (eg, SNPs) into specific regions of the genome, and the like.
[Selection drawing] Fig. 2

Description

本発明は、細胞ライブラリおよびその製造方法に関する。 TECHNICAL FIELD The present invention relates to a cell library and its production method.

新規ゲノム編集ツールとしてＣＲＩＳＰＲ／Ｃａｓシステムが報告されてから、ＣＲＩ
ＳＰＲ／Ｃａｓシステムを利用した様々な研究が行われている（例えば、特許文献１）。
ＣＲＩＳＰＲ／Ｃａｓシステムを利用したゲノム編集では、ガイドＲＮＡにより標的化さ
れた標的領域がＣａｓ９ヌクレアーゼにより二本鎖切断される。二本鎖切断されたＤＮＡ
は、相同組み換え修復（ＨｏｍｏｌｏｇｏｕｓＤｉｒｅｃｔｅｄＲｅｐａｉｒ：ＨＤ
Ｒ）又は非相同末端再結合（Ｎｏｎ－ＨｏｍｏｌｏｇｏｕｓＥｎｄ－Ｊｏｉｎｉｎｇ
Ｒｅｐａｉｒ：ＮＨＥＪ）により修復されることが知られている。ＨＤＲでは、標的領域
の周辺領域と相同な配列を有するドナーＤＮＡをＣＲＩＳＰＲ／Ｃａｓシステムと共に細
胞に導入することにより、任意の配列を標的領域に組み込むことができる。 Since the CRISPR/Cas system was reported as a novel genome editing tool, CRI
Various studies using the SPR/Cas system are being conducted (for example, Patent Document 1).
In genome editing using the CRISPR/Cas system, the target region targeted by the guide RNA is double-strand cleaved by Cas9 nuclease. double-strand broken DNA
Homologous Directed Repair: HD
R) or Non-Homologous End-Joining
Repair: NHEJ) is known to be repaired. In HDR, any sequence can be incorporated into the target region by introducing donor DNA having sequences homologous to the regions surrounding the target region into cells with the CRISPR/Cas system.

ゲノム改変技術を用いて、ＨＤＲにより２以上のアレルを同時に効率よく改変する技術
が開発されている（例えば、特許文献２）。特許文献２では、数百ｋｂの大規模欠失を２
以上のアレルを同時に効率よく改変することができたことが開示されている。 Techniques for simultaneously and efficiently modifying two or more alleles by HDR have been developed using genome modification techniques (eg, Patent Document 2). In Patent Document 2, a large-scale deletion of several hundred kb is 2
It is disclosed that the above alleles could be modified simultaneously and efficiently.

国際公開第２０１４／０９３６６１号WO2014/093661 国際公開第２０２１／２０６０５４号WO2021/206054

本開示は、細胞ライブラリーおよびその製造方法を提供する。より具体的には、本開示
は、複数のアレルを有する細胞においてその１アレルの特定部位の塩基配列に豊富な種類
の多様性を有する複数の改変細胞を含む細胞ライブラリーを提供する。細胞ライブラリー
は、１種類の改変細胞を含む水性組成物の組合せであり得る。 The present disclosure provides cell libraries and methods for their production. More specifically, the present disclosure provides a cell library containing a plurality of modified cells having a rich variety of nucleotide sequences at a specific site of one allele in cells having multiple alleles. A cell library can be a combination of aqueous compositions containing one type of modified cell.

（１）改変細胞のライブラリーであって、
ライブラリーは、複数の水性組成物の組合せを含み、
各水性組成物はそれぞれ１種類の改変細胞を含み、
改変細胞はそれぞれ、改変対象である遺伝子座に第一のアレルと第二のアレルを有し、
改変細胞はそれぞれ第一のアレルの同一位置に、水性組成物間で相互に異なるＤＮＡ断
片を含むカセットを有する、
ライブラリー。
（２）各改変細胞は、第二のアレルは、その一部または全部の破壊または欠失を有する、
請求項１または２に記載のライブラリー。
（３）第二のアレルは、シームレスに前記一部または全部を欠失している、上記（２）に
記載のライブラリー。
（４）改変細胞それぞれのＤＮＡ断片を含むカセット以外の配列は、改変前後で実質的に
同一である、上記（１）～（３）のいずれかに記載のライブラリー。
（５）前記カセットの配列のそれぞれは、１以上の改変部分（Ａ）と１以上の非改変部分
（Ｂ）とからなり、前記改変部分（Ａ）はそれぞれ、配列の挿入、欠失、および置換から
なる群から選択される１以上の改変を有し、前記１以上の改変部分の改変は、改変の位置
または内容に関して各水性組成物間で異なり、前記１以上の非改変部分（Ｂ）はそれぞれ
、改変前の対応する部位の配列と同一であり、前記カセット中のセントロメア側の非改変
部分（Ｂ１）は、前記カセットのセントロメア側の隣接配列（Ｃ１）とシームレスに連結
しており、前記カセットのテロメア側の非改変部分（Ｂｔ）は、前記カセットのテロメア
側の隣接配列（Ｃ２）とシームレスに連結しており、記隣接配列（Ｃ１）および非改変部
分（Ｂ１）が連結した領域、並びに非改変部分（Ｂｔ）および上記隣接配列（Ｃ２）が連
結した領域は、改変前の対応する領域の配列と同一の配列を構成している、上記（１）～
（４）のいずれかに記載のライブラリー。
（６）前記改変細胞は、部位特異的組換え酵素の標的配列を含まない、上記（１）～（５
）のいずれかに記載のライブラリー。
（７）ライブラリーに含まれる前記水性組成物の種類は、５０種類以上である、上記（１
）～（６）のいずれかに記載のライブライリー。
（８）改変細胞のライブラリーを製造する方法であって、
（α）改変対象である遺伝子座に第一のアレルと第二のアレルを含むゲノムを有する細胞
において、前記第一のアレルと第二のアレルそれぞれに選択マーカー遺伝子と標的核酸配
列を含むカセットを有する細胞の群を提供することと、
ここで、第一のアレルが有する選択マーカー遺伝子と第二のアレルが有する選択マーカ
ー遺伝子とは区別可能に異なり、前記標的核酸配列は、ゲノム改変システムの標的であり
、ゲノム改変システムにより第一のアレルと第二のアレルとを区別可能に切断できるよう
に設計されており、各選択マーカー遺伝子はネガティブ選択に用いることができるネガテ
ィブ選択用マーカー遺伝子であり、
（β）提供された細胞の群に下記（ｘ）及び（ｙ）を導入する工程と、
（ｘ）第一のアレルに含まれる前記固有の塩基配列を標的とする配列特異的核酸切断分子
、又は前記配列特異的核酸切断分子をコードするポリヌクレオチドを含むゲノム改変シス
テム、
（ｙ）複数種類の第２の組換え用ドナーＤＮＡ｛ここで、複数種類の第２の組換え用ドナ
ーＤＮＡはそれぞれ、上記（ｘ）の前記標的部位の上流側に隣接する塩基配列と相同な塩
基配列を有する上流ホモロジーアームと、前記標的領域の下流側に隣接する塩基配列と相
同な塩基配列を有する下流ホモロジーアームを有し、前記上流ホモロジーアームと前記下
流ホモロジーアームとの間に改変塩基配列を含み、改変塩基配列は、第２の組換え用ドナ
ーＤＮＡ毎に異なり、第２の組換え用ドナーＤＮＡそれぞれに固有である｝、
（γ）前記工程（β）の後、第一のアレルに含まれる選択マーカー遺伝子を発現しない細
胞を選択する工程と、
を含み、
これにより、複数の細胞を含む改変細胞のライブラリー得ることができ、ここで、得ら
れた複数の細胞において、第一のアレルは各細胞に固有の改変塩基配列を有し、第二のア
レルは細胞間で共通の配列を有する、
方法。
（９）上記（８）に記載の方法であって、工程（α）の前に、
改変対象である遺伝子座に第一のアレルと第二のアレルを含むゲノムを有する細胞にお
いて、第一のアレルおよび第二のアレルに含まれる被置換配列を選択マーカー遺伝子と標
的核酸配列を含むカセットにより置換し、これにより被置換配列を第一のアレルおよび第
二のアレルから除去することと、
をさらに含む、方法。
（１０）上記（９）に記載の方法であって、
第一のアレルの改変塩基配列はそれぞれ、第一のアレルの被置換配列に対して、塩基の
付加、挿入、置換、欠失、および削除からなる群から選択される１以上の変異を有する、
方法。
（１１）上記（９）または（１０）に記載の方法であって、
被改変配列は、タンパク質のコード領域であり、
第一のアレルの改変塩基配列は、第一のアレルの被置換配列に対して、塩基の付加、挿
入、置換、欠失、および削除からなる群から選択される１以上の変異を有する、方法
（１２）改変塩基配列は、被改変配列と８０％以上の配列同一性を有する、上記（１０）
または（１１）に記載の方法。
（１３）上記（８）～（１２）のいずれかに記載の方法であって、工程（α）と工程（β
）の間に、
第二のアレルからカセットを除去すること
をさらに含む、方法。
（１４）カセットがシームレスに除去される、上記（１３）に記載の方法。
（１５）上記（８）～（１４）のいずれかに記載の方法により作製される、複数の改変細
胞を含む改変細胞のライブラリー。 (1) a library of modified cells,
the library comprises combinations of a plurality of aqueous compositions;
each aqueous composition comprising one type of modified cell;
Each modified cell has a first allele and a second allele at the locus to be modified,
each of the modified cells has a cassette at the same position of the first allele and containing DNA fragments that differ from each other between the aqueous compositions;
Library.
(2) each modified cell has a second allele that is partially or wholly disrupted or deleted;
3. A library according to claim 1 or 2.
(3) The library according to (2) above, wherein the second allele is seamlessly deleted in part or in whole.
(4) The library according to any one of (1) to (3) above, wherein the sequences other than the cassette containing the DNA fragment of each modified cell are substantially the same before and after modification.
(5) Each of the sequences of the cassette consists of one or more modified portions (A) and one or more unmodified portions (B), and the modified portions (A) are sequence insertions, deletions, and sequences, respectively. having one or more modifications selected from the group consisting of substitutions, the modifications of the one or more modified portions differing between each aqueous composition with respect to the location or content of the modifications, and the one or more unmodified portions (B) is the same as the sequence of the corresponding site before modification, and the unmodified portion (B1) on the centromere side of the cassette is seamlessly linked to the adjacent sequence (C1) on the centromere side of the cassette, The unmodified portion (Bt) on the telomere side of the cassette is seamlessly linked to the flanking sequence (C2) on the telomere side of the cassette, and the region where the flanking sequence (C1) and the unmodified portion (B1) are linked , and the region where the unmodified portion (Bt) and the adjacent sequence (C2) are linked constitutes the same sequence as the sequence of the corresponding region before modification, (1) to
The library according to any one of (4).
(6) The modified cell does not contain the target sequence of the site-specific recombination enzyme, (1) to (5) above
).
(7) The number of types of the aqueous composition contained in the library is 50 or more, the above (1
) to (6).
(8) A method for producing a library of modified cells, comprising:
(α) in a cell having a genome containing a first allele and a second allele at the genetic locus to be modified, a cassette containing a selection marker gene and a target nucleic acid sequence for each of the first allele and the second allele; providing a population of cells having
Here, the selectable marker gene possessed by the first allele and the selectable marker gene possessed by the second allele are distinguishably different, the target nucleic acid sequence is the target of the genome modification system, and the genome modification system selects the first marker gene. each selectable marker gene is a negative selectable marker gene that can be used for negative selection, and is designed so that the allele and the second allele can be distinguished from each other;
(β) introducing the following (x) and (y) into the provided group of cells;
(x) a genome modification system comprising a sequence-specific nucleic acid cleaving molecule targeting the unique base sequence contained in the first allele, or a polynucleotide encoding the sequence-specific nucleic acid cleaving molecule;
(y) a plurality of types of second recombination donor DNA {wherein the plurality of types of second recombination donor DNA are homologous to the base sequence adjacent to the upstream side of the target site in (x) above, respectively; and a downstream homology arm having a nucleotide sequence homologous to the nucleotide sequence adjacent to the target region downstream, and a modified base between the upstream homology arm and the downstream homology arm sequence, wherein the modified nucleotide sequence is different for each second recombination donor DNA and unique to each second recombination donor DNA},
(γ) selecting cells that do not express the selectable marker gene contained in the first allele after the step (β);
including
Thereby, a library of modified cells comprising a plurality of cells can be obtained, wherein in the plurality of cells obtained, the first allele has a modified nucleotide sequence unique to each cell, and the second allele has a common sequence between cells,
Method.
(9) The method according to (8) above, wherein, before step (α),
In a cell having a genome containing a first allele and a second allele at a genetic locus to be modified, a cassette containing a selection marker gene and a target nucleic acid sequence for the substituted sequences contained in the first allele and the second allele thereby removing the substituted sequence from the first allele and the second allele;
The method further comprising:
(10) The method according to (9) above,
Each modified nucleotide sequence of the first allele has one or more mutations selected from the group consisting of addition, insertion, substitution, deletion, and deletion of bases relative to the substituted sequence of the first allele,
Method.
(11) The method according to (9) or (10) above,
The sequence to be modified is a protein coding region,
The modified base sequence of the first allele has one or more mutations selected from the group consisting of base additions, insertions, substitutions, deletions, and deletions with respect to the substituted sequence of the first allele. (12) The modified base sequence has 80% or more sequence identity with the modified sequence (10)
Or the method according to (11).
(13) The method according to any one of (8) to (12) above, wherein step (α) and step (β)
)Between,
The method further comprising removing the cassette from the second allele.
(14) The method according to (13) above, wherein the cassette is seamlessly removed.
(15) A modified cell library containing a plurality of modified cells prepared by the method according to any one of (8) to (14) above.

図１は、本発明のライブラリー作製のスキームの一例の概要を示す。FIG. 1 shows an outline of an example of the scheme for preparing the library of the present invention. 図２は、本発明の第一の改変細胞のライブラリー作製のスキームの一例を示す。FIG. 2 shows an example of a scheme for preparing the first modified cell library of the present invention. 図３は、改変細胞のライブラリーにおいて、改変塩基配列が、改変部分と非改変部分とを含む場合の当該改変部分と非改変部分の位置関係の一例を示す図である。このようにすることで、改変塩基配列内の様々な箇所に様々な変異を有する多様な改変細胞を含むライブラリーを構成することができる。FIG. 3 is a diagram showing an example of the positional relationship between the modified portion and the unmodified portion when the modified base sequence contains the modified portion and the unmodified portion in the modified cell library. By doing so, a library containing various modified cells having various mutations at various sites in the modified base sequence can be constructed. 図４は、部位特異的組換え酵素を用いたゲノム改変方法の特徴（欠点）を示す。FIG. 4 shows features (disadvantages) of genome modification methods using site-specific recombination enzymes.

［定義］
「ポリヌクレオチド」及び「核酸」という用語は、相互に互換的に使用され、ヌクレオ
チドがホスホジエステル結合によって結合したヌクレオチドポリマーを指す。「ポリヌク
レオチド」及び「核酸」は、ＤＮＡであってもよく、ＲＮＡであってもよく、ＤＮＡとＲ
ＮＡとの組み合わせから構成されてもよい。また、「ポリヌクレオチド」及び「核酸」は
、天然ヌクレオチドのポリマーであってもよく、天然ヌクレオチドと非天然ヌクレオチド
（天然ヌクレオチドの類似体、塩基部分、糖部分及びリン酸部分のうち少なくとも一つの
部分が修飾されているヌクレオチド（例えば、ホスホロチオエート骨格）等）とのポリマ
ーであってもよく、非天然ヌクレオチドのポリマーであってもよい。 [definition]
The terms "polynucleotide" and "nucleic acid" are used interchangeably and refer to a nucleotide polymer in which the nucleotides are linked by phosphodiester bonds. "Polynucleotide" and "nucleic acid" may be DNA or RNA, and DNA and R
It may be configured from a combination with NA. "Polynucleotides" and "nucleic acids" may also be polymers of natural nucleotides, including natural nucleotides and non-natural nucleotides (analogs of natural nucleotides, base moieties, sugar moieties and phosphate moieties). may be a polymer with modified nucleotides (eg, phosphorothioate backbone), or may be a polymer of non-natural nucleotides.

「ポリヌクレオチド」又は「核酸」の塩基配列は、特に明示しない限り、一般的に認め
られている１文字コードで記載される。特に明示しない限り、塩基配列は、５’側から３
’側に向かって記載する。「ポリヌクレオチド」又は「核酸」を構成するヌクレオチド残
基は、単に、アデニン、チミン、シトシン、グアニン、又はウラシル等、あるいはそれら
の１文字コードで記載される場合がある。 Base sequences of "polynucleotides" or "nucleic acids" are described in the generally accepted one-letter code unless otherwise specified. Unless otherwise specified, nucleotide sequences are 3
' is written towards the side. Nucleotide residues constituting a “polynucleotide” or “nucleic acid” may simply be described as adenine, thymine, cytosine, guanine, uracil, etc., or their one-letter codes.

「遺伝子」という用語は、特定のタンパク質をコードする少なくとも１つのオープンリ
ーディングフレームを含むポリヌクレオチドを指す。遺伝子は、エクソン及びイントロン
の両方を含み得る。 The term "gene" refers to a polynucleotide containing at least one open reading frame that encodes a particular protein. A gene may contain both exons and introns.

「ポリペプチド」、「ペプチド」及び「タンパク質」という用語は、相互に互換的に使
用され、アミド結合によって結合したアミノ酸のポリマーを指す。「ポリペプチド」、「
ペプチド」又は「タンパク質」は、天然アミノ酸のポリマーであってもよく、天然アミノ
酸と非天然アミノ酸（天然アミノ酸の化学的類似体、修飾誘導体等）とのポリマーであっ
てもよく、非天然アミノ酸のポリマーであってもよい。特に明示しない限り、アミノ酸配
列は、Ｎ末端側からＣ末端側に向かって記載する。 The terms "polypeptide", "peptide" and "protein" are used interchangeably to refer to a polymer of amino acids joined by amide bonds. "polypeptide", "
A "peptide" or "protein" may be a polymer of natural amino acids, a polymer of natural amino acids and non-natural amino acids (such as chemical analogues, modified derivatives of natural amino acids), or non-natural amino acids. It may be a polymer. Unless otherwise specified, amino acid sequences are written from the N-terminal side to the C-terminal side.

「アレル」という用語は、染色体ゲノム上の同一座位に存在する塩基配列のセットを指
す。ある態様では、２倍体の細胞では同一座位に２アレル存在し、３倍体の細胞では同一
座位に３アレル存在する。また、ある態様では、染色体の異常なコピーまたは当該座位の
異常な追加のコピーによって追加のアレルが形成されている場合がある。 The term "allele" refers to a set of base sequences present at the same locus on the chromosomal genome. In some embodiments, diploid cells have 2 alleles at the same locus and triploid cells have 3 alleles at the same locus. Also, in some embodiments, additional alleles may be formed by aberrant copies of the chromosome or aberrant additional copies of the locus.

「ゲノム改変」又は「ゲノム編集」という用語は、相互に互換的に用いられ、ゲノム上
の所望の位置（標的領域）に変異を誘導することを指す。ゲノム改変は、標的領域ＤＮＡ
を切断するように設計された配列特異的核酸切断分子の使用を含み得る。好ましい実施形
態において、ゲノム改変は、標的領域のＤＮＡを切断するように操作されたヌクレアーゼ
の使用、を含み得る。好ましい実施形態において、ゲノム改変は、標的領域中の特定の塩
基配列を有する標的配列を切断するように操作されたヌクレアーゼ（例えば、ＴＡＬＥＮ
やＺＦＮ）の使用を含み得る。好ましい実施形態において、ゲノム改変は、標的領域中の
特定の塩基配列を有する標的配列を切断するように、メガヌクレアーゼなどのゲノムに１
つしか切断部位を有しない制限酵素（例えば、１６ベースの配列特異性を有する制限酵素
（理論上は４¹⁶塩基に１つの割合で存在する）、１７ベースの配列特異性を有する制限酵
素（理論上は４¹⁷塩基に１つの割合で存在する）、および１８ベースの配列特異性を有す
る制限酵素（理論上は４¹⁸塩基に１つの割合で存在する））などの配列特異的エンドヌク
レアーゼを用いることができる場合もある。典型的には、部位特異的ヌクレアーゼの使用
により、標的領域のＤＮＡに二本鎖切断（ＤＳＢ）が誘導され、その後、相同組み換え修
復（ＨｏｍｏｌｏｇｏｕｓＤｉｒｅｃｔｅｄＲｅｐａｉｒ：ＨＤＲ）及び非相同末端
再結合（Ｎｏｎ－ＨｏｍｏｌｏｇｏｕｓＥｎｄ－ＪｏｉｎｉｎｇＲｅｐａｉｒ：ＮＨ
ＥＪ）のような、細胞の内因性プロセスによってゲノムが修復される。ＮＨＥＪは、ドナ
ーＤＮＡを用いずに二本鎖切断された末端を連結する修復方法であり、修復の際に挿入及
び／又は欠失（ｉｎｄｅｌ）が高頻度で誘導される。ＨＤＲは、ドナーＤＮＡを用いた修
復機構であり、標的領域に所望の変異を導入することも可能である。ゲノム改変技術とし
ては、例えば、ＣＲＩＳＰＲ／Ｃａｓシステムが好ましく例示される。メガヌクレアーゼ
としては、例えば、Ｉ－ＳｃｅＩ、Ｉ－ＳｃｅＩＩ、Ｉ－ＳｃｅＩＩＩ、Ｉ－ＳｃｅＩＶ
、Ｉ－ＳｃｅＶ、Ｉ－ＳｃｅＶＩ、Ｉ－ＳｃｅＶＩＩ、Ｉ－ＣｅｕＩ、Ｉ－ＣｅｕＡＩＩ
Ｐ、Ｉ－ＣｒｅＩ、Ｉ－ＣｒｅｐｓｂＩＰ、Ｉ－ＣｒｅｐｓｂＩＩＰ、Ｉ－Ｃｒｅｐｓｂ
ＩＩＩＰ、Ｉ－ＣｒｅｐｓｂＩＶＰ、Ｉ－ＴｌｉＩ、Ｉ－ＰｐｏＩ、ＰＩ－ＰｓｐＩ、Ｆ
－ＳｃｅＩ、Ｆ－ＳｃｅＩＩ、Ｆ－ＳｕｖＩ、Ｆ－ＴｅｖＩ、Ｆ－ＴｅｖＩＩ、Ｉ－Ａｍ
ａＩ、Ｉ－ＡｎｉＩ、Ｉ－ＣｈｕＩ、Ｉ－ＣｍｏｅＩ、Ｉ－ＣｐａＩ、Ｉ－ＣｐａＩＩ、
Ｉ－ＣｓｍＩ、Ｉ－ＣｖｕＩ、Ｉ－ＣｖｕＡＩＰ、Ｉ－ＤｄｉＩ、Ｉ－ＤｄｉＩＩ、Ｉ－
ＤｉｒＩ、Ｉ－ＤｍｏＩ、Ｉ－ＨｍｕＩ、Ｉ－ＨｍｕＩＩ、Ｉ－ＨｓＮＩＰ、Ｉ－Ｌｌａ
Ｉ、Ｉ－ＭｓｏＩ、Ｉ－ＮａａＩ、Ｉ－ＮａｎＩ、Ｉ－ＮｃｌＩＰ、Ｉ－ＮｇｒＩＰ、Ｉ
－ＮｉｔＩ、Ｉ－ＮｊａＩ、Ｉ－Ｎｓｐ２３６ＩＰ、Ｉ－ＰａｋＩ、Ｉ－ＰｂｏＩＰ、Ｉ
－ＰｃｕＩＰ、Ｉ－ＰｃｕＡＩ、Ｉ－ＰｃｕＶＩ、Ｉ－ＰｇｒＩＰ、Ｉ－ＰｏｂＩＰ、Ｉ
－ＰｏｒＩ、Ｉ－ＰｏｒＩＩＰ、Ｉ－ＰｂｐＩＰ、Ｉ－ＳｐＢｅｔａＩＰ、Ｉ－ＳｃａＩ
、Ｉ－ＳｅｘＩＰ、Ｉ－ＳｎｅＩＰ、Ｉ－ＳｐｏｍＩ、Ｉ－ＳｐｏｍＣＰ、Ｉ－Ｓｐｏｍ
ＩＰ、Ｉ－ＳｐｏｍＩＩＰ、Ｉ－ＳｑｕＩＰ、Ｉ－Ｓｓｐ６８０３１、Ｉ－ＳｔｈＰｈｉ
ＪＰ、Ｉ－ＳｔｈＰｈｉＳＴ３Ｐ、Ｉ－ＳｔｈＰｈｉＳＴｅ３ｂＰ、Ｉ－ＴｄｅＩＰ、Ｉ
－ＴｅｖＩ、Ｉ－ＴｅｖＩＩ、Ｉ－ＴｅｖＩＩＩ、Ｉ－ＵａｒＡＰ、Ｉ－ＵａｒＨＧＰＡ
ＩＰ、Ｉ－ＵａｒＨＧＰＡ１３Ｐ、Ｉ－ＶｉｎＩＰ、Ｉ－ＺｂｉＩＰ、ＰＩ－Ｍｔｕｌ、
ＰＩ－ＭｔｕＨＩＰＰＩ－ＭｔｕＨＩＩＰ、ＰＩ－ＰｆｕＩ、ＰＩ－ＰｆｕＩＩ、ＰＩ
－ＰｋｏＩ、ＰＩ－ＰｋｏＩＩ、ＰＩ－Ｒｍａ４３８１２ＩＰ、ＰＩ－ＳｐＢｅｔａＩＰ
、ＰＩ－ＳｃｅＩ、ＰＩ－ＴｆｕＩ、ＰＩ－ＴｆｕＩＩ、ＰＩ－ＴｈｙＩ、ＰＩ－Ｔｌｉ
Ｉ、およびＰＩ－ＴｌｉＩＩ、並びにこれらの機能的な誘導体制限酵素からなる群から選
択されるメガヌクレアーゼおよびその切断部位（または認識部位）、好ましくは、１８塩
基以上の配列特異性を有する制限酵素であるメガヌクレアーゼおよびその切断部位（また
は認識部位）、特に細胞のゲノムを１箇所または複数箇所以上切断しないメガヌクレアー
ゼおよびその切断部位を用いることができる。 The terms "genome modification" or "genome editing" are used interchangeably and refer to the introduction of mutations at desired locations (target regions) on the genome. Genome modification is targeted region DNA
can include the use of sequence-specific nucleic acid cleaving molecules designed to cleave the In preferred embodiments, genomic modification may involve the use of nucleases engineered to cleave the DNA of the target region. In a preferred embodiment, genomic modification involves nucleases (e.g., TALENs) engineered to cleave target sequences with specific base sequences in the target region.
or ZFN). In a preferred embodiment, the genome modification involves the addition of 1 to the genome, such as a meganuclease, to cleave a target sequence having a specific base sequence in the target region.
Restriction enzymes with only one cleavage site (e.g., restriction enzymes with 16-base sequence specificity (theoretically, one in 4 ¹⁶ bases exists), restriction enzymes with 17-base sequence specificity (theoretically using sequence-specific endonucleases such as 1 in 4 ¹⁷ bases above) and restriction enzymes with 18-base sequence specificity (theoretically 1 in 4 ¹⁸ bases) sometimes it is possible. Typically, the use of site-specific nucleases induces double-strand breaks (DSBs) in the DNA of the target region, followed by homologous directed repair (HDR) and non-homologous end recombination (Non- Homologous End-Joining Repair: NH
The genome is repaired by endogenous processes of the cell, such as EJ). NHEJ is a repair method that joins the ends of double-strand breaks without using donor DNA, and insertions and/or deletions (indels) are frequently induced during repair. HDR is a repair mechanism using donor DNA, and it is also possible to introduce desired mutations into target regions. As a genome modification technology, for example, the CRISPR/Cas system is preferably exemplified. Examples of meganucleases include I-SceI, I-SceII, I-SceIII, I-SceIV
, I-SceV, I-SceVI, I-SceVII, I-CeuI, I-CeuAII
P, I-CreI, I-CrepsbIP, I-CrepsbIIP, I-Crepsb
IIP, I-CrepsbIVP, I-TliI, I-PpoI, PI-PspI, F
- SceI, F-SceII, F-SuvI, F-TevI, F-TevII, I-Am
aI, I-AniI, I-ChuI, I-CmoeI, I-CpaI, I-CpaII,
I-CsmI, I-CvuI, I-CvuAIP, I-DdiI, I-DdiII, I-
DirI, I-DmoI, I-HmuI, I-HmuII, I-HsNIP, I-Lla
I, I-MsoI, I-NaaI, I-NanI, I-NclIP, I-NgrIP, I
- NitI, I - NjaI, I - Nsp236IP, I - PakI, I - PboIP, I
-PcuIP, I-PcuAI, I-PcuVI, I-PgrIP, I-PobIP, I
-PorI, I-PorIIP, I-PbpIP, I-SpBetaIP, I-ScaI
, I-SexIP, I-SneIP, I-SpomI, I-SpomCP, I-Spom
IP, I-SpomIIP, I-SquiIP, I-Ssp68031, I-SthPhi
JP, I-SthPhiST3P, I-SthPhiSTe3bP, I-TdeIP, I
-TevI, I-TevII, I-TevIII, I-UarAP, I-UarHGPA
IP, I-UarHGPA13P, I-VinIP, I-ZbiIP, PI-Mtul,
PI-MtuHIP PI-MtuHIIP, PI-PfuI, PI-PfuII, PI
-PkoI, PI-PkoII, PI-Rma43812IP, PI-SpBetaIP
, PI-SceI, PI-TfuI, PI-TfuII, PI-ThyI, PI-Tli
A meganuclease and its cleavage site (or recognition site) selected from the group consisting of I, and PI-TliII, and functional derivative restriction enzymes thereof, preferably a restriction enzyme having a sequence specificity of 18 bases or more Certain meganucleases and their cleavage sites (or recognition sites) can be used, particularly meganucleases and their cleavage sites that do not cut one or more of the genome of the cell.

「標的領域」という用語は、ゲノム改変の対象となるゲノム領域を指す。「欠失」は、
リファレンスゲノムに対する１塩基以上の欠失、および１遺伝子以上の欠失を含む。欠失
は、１００ｂｐ以上の欠失、２００ｂｐ以上の欠失、３００ｂｐ以上の欠失、４００ｂｐ
以上の欠失、５００ｂｐ以上の欠失、６００ｂｐ以上の欠失、７００ｂｐ以上の欠失、８
００ｂｐ以上の欠失、９００ｂｐ以上の欠失、１ｋｂｐ以上の欠失、１０ｋｂｐ以上の欠
失、５０ｋｂｐ以上の欠失、１００ｋｂｐ以上の欠失、２００ｋｂｐ以上の欠失、３００
ｋｂｐ以上の欠失、４００ｋｂｐ以上の欠失、５００ｋｂｐ以上の欠失、または１Ｍｂｐ
以上の欠失またはそれ以下の欠失であり得る。欠失は、１Ｍｂｐ以下の欠失であり得る。
欠失は、７００ｋｂｐ以下の欠失であり得る。欠失は、６００ｋｂｐ以下の欠失であり得
る。欠失は、５００ｋｂｐ以下の欠失であり得る。欠失は、１０ｋｂｐ以上６００ｋｂｐ
以下の欠失であり得る。欠失は、１００ｋｂｐ以上６００ｋｂｐ以下の欠失であり得る。
欠失は、１００ｋｂｐ以上５００ｋｂｐ以下の欠失であり得る。 The term "target region" refers to the genomic region targeted for genomic modification. A "deletion" is
Including deletions of one or more bases relative to the reference genome, and deletions of one or more genes. Deletion is 100 bp or more deletion, 200 bp or more deletion, 300 bp or more deletion, 400 bp
≥ 500 bp deletion ≥ 600 bp deletion ≥ 700 bp deletion 8
00 bp or more deletion, 900 bp or more deletion, 1 kbp or more deletion, 10 kbp or more deletion, 50 kbp or more deletion, 100 kbp or more deletion, 200 kbp or more deletion, 300
kbp or greater deletion, 400 kbp or greater deletion, 500 kbp or greater deletion, or 1 Mbp
There can be more or less deletions. Deletions can be deletions of 1 Mbp or less.
Deletions can be deletions of 700 kbp or less. Deletions can be deletions of 600 kbp or less. Deletions can be deletions of 500 kbp or less. Deletion ≥ 10 kbp and 600 kbp
The deletion can be: Deletions can be deletions of 100 kbp or more and 600 kbp or less.
Deletions can be deletions of 100 kbp or more and 500 kbp or less.

「ドナーＤＮＡ」という用語は、ＤＮＡの二本鎖切断の修復に用いられるＤＮＡであっ
て、標的領域周辺のＤＮＡと相同組換え可能なＤＮＡを指す。ドナーＤＮＡは、ホモロジ
ーアームとして標的領域の上流の塩基配列および下流の塩基配列（例えば、標的領域に隣
接する塩基配列）を含む。本明細書においては、標的領域の上流側の塩基配列（例えば、
上流側に隣接する塩基配列）からなるホモロジーアームを「上流ホモロジーアーム」、標
的配列の下流側の塩基配列（例えば、下流側に隣接する塩基配列）からなるホモロジーア
ームを「下流ホモロジーアーム」と記載する場合がある。ドナーＤＮＡは、上流ホモロジ
ーアームと下流ホモロジーアームとの間に、所望の塩基配列を含むことができる。各ホモ
ロジーアームの長さは、３００ｂｐ以上が好ましく、通常５００～３０００ｂｐ程度であ
る。上流ホモロジーアーム及び下流ホモロジーアームの長さは、互いに同じであってもよ
く、異なっていてもよい。標的領域は、配列依存的な切断後にドナーＤＮＡとの間で相同
組換えの誘発に成功すると、標的領域の上流の塩基配列および下流の塩基配列の間の配列
が、ドナーＤＮＡの上流の塩基配列および下流の塩基配列に挟まれている配列と置き換わ
ることとなる。 The term "donor DNA" refers to DNA used to repair double-strand breaks in DNA that is capable of homologous recombination with the DNA surrounding the target region. The donor DNA contains a base sequence upstream of the target region and a base sequence downstream of the target region (eg, a base sequence flanking the target region) as homology arms. In the present specification, the nucleotide sequence upstream of the target region (e.g.,
A homology arm consisting of a nucleotide sequence adjacent to the upstream side is described as an "upstream homology arm", and a homology arm consisting of a nucleotide sequence downstream of the target sequence (for example, a nucleotide sequence adjacent to the downstream side) is described as a "downstream homology arm". sometimes. The donor DNA can contain the desired nucleotide sequence between the upstream homology arm and the downstream homology arm. The length of each homology arm is preferably 300 bp or more, and usually about 500 to 3000 bp. The lengths of the upstream homology arm and the downstream homology arm may be the same or different. When the target region successfully induces homologous recombination with the donor DNA after sequence-dependent cleavage, the sequence between the upstream and downstream sequences of the target region is the upstream sequence of the donor DNA. and the sequence flanked by the downstream nucleotide sequence.

標的領域の「上流」とは、標的領域の２本鎖ＤＮＡにおいて、基準となるヌクレオチド
鎖の５’側に位置するＤＮＡ領域を意味する。標的領域の「下流」とは、当該基準となる
ヌクレオチド鎖の３’側に位置するＤＮＡを意味する。２本鎖のいずれの鎖を基準となる
ヌクレオチド鎖とするかは任意である。但し、便宜的に、標的領域がタンパク質コード配
列を含む場合、基準となるヌクレオチド鎖は、通常、センス鎖である。一般的に、プロモ
ーターは、タンパク質コード配列の上流に位置する。ターミネーターは、タンパク質コー
ド配列の下流に位置する。 “Upstream” of the target region means the DNA region located on the 5′ side of the reference nucleotide strand in the double-stranded DNA of the target region. "Downstream" of the target region means DNA located 3' to the reference nucleotide strand. It is arbitrary which strand of the double strand is used as the reference nucleotide strand. Conveniently, however, when the target region comprises a protein-coding sequence, the reference nucleotide strand is usually the sense strand. Generally, the promoter is positioned upstream from the protein-coding sequence. A terminator is located downstream of the protein-coding sequence.

「配列特異的核酸切断分子」という用語は、特定の核酸配列を認識し、前記特定の核酸
配列で核酸を切断することができる分子を指す。配列特異的核酸切断分子は、配列特異的
に核酸切断する活性（配列特異的核酸切断活性）を有する分子である。 The term "sequence-specific nucleic acid cleaving molecule" refers to a molecule capable of recognizing a specific nucleic acid sequence and cleaving nucleic acid at said specific nucleic acid sequence. A sequence-specific nucleic acid cleaving molecule is a molecule having a sequence-specific nucleic acid cleaving activity (sequence-specific nucleic acid cleaving activity).

「標的配列」という用語は、配列特異的核酸切断分子による切断の対象となるゲノム中
のＤＮＡ配列を指す。配列特異的核酸切断分子がＣａｓタンパク質である場合、標的配列
は、Ｃａｓタンパク質による切断の対象となるゲノム中のＤＮＡ配列を指す。Ｃａｓタン
パク質としてＣａｓ９タンパク質を用いる場合、標的配列は、プロトスペーサー隣接モチ
ーフ（ＰＡＭ）の５’側に隣接する配列である必要がある。標的配列は、通常、ＰＡＭの
５’側直前に隣接する１７～３０塩基（好ましくは１８～２５塩基、より好ましくは１９
～２２塩基、さらに好ましくは２０塩基）の配列が選択される。標的配列の設計には、Ｃ
ＲＩＳＰＲＤＥＳＩＧＮ（ｃｒｉｓｐｒ．ｍｉｔ．ｅｄｕ／）等の公知のデザインツー
ルを利用することができる。 The term "target sequence" refers to a DNA sequence in the genome that is targeted for cleavage by a sequence-specific nucleolytic molecule. When the sequence-specific nucleic acid cleaving molecule is a Cas protein, the target sequence refers to the DNA sequence in the genome to be cleaved by the Cas protein. When using the Cas9 protein as the Cas protein, the target sequence should be the sequence flanking the 5' side of the protospacer adjacent motif (PAM). The target sequence is usually 17 to 30 bases (preferably 18 to 25 bases, more preferably 19 bases) immediately preceding the 5′ side of PAM.
A sequence of ~22 bases, more preferably 20 bases) is selected. For the design of target sequences, C
A known design tool such as RISPR DESIGN (crispr.mit.edu/) can be used.

「Ｃａｓタンパク質」という用語は、ＣＲＩＳＰＲ関連（ＣＲＩＳＰＲ－ａｓｓｏｃｉ
ａｔｅｄ）タンパク質を指す。好ましい態様において、Ｃａｓタンパク質は、ガイドＲＮ
Ａと複合体を形成し、エンドヌクレアーゼ活性又はニッカーゼ活性を示す。Ｃａｓタンパ
ク質としては、特に限定されないが、例えば、Ｃａｓ９タンパク質、Ｃｐｆ１タンパク質
、Ｃ２ｃ１タンパク質、Ｃ２ｃ２タンパク質、及びＣ２ｃ３タンパク質等が挙げられる。
Ｃａｓタンパク質は、ガイドＲＮＡと協働してエンドヌクレアーゼ活性又はニッカーゼ活
性を示す限り、野生型Ｃａｓタンパク質及びそのホモログ（パラログ及びオーソログ）、
並びにそれらの変異体を包含する。
好ましい態様において、Ｃａｓタンパク質は、クラス２のＣＲＩＳＰＲ／Ｃａｓ系に関
与するものであり、より好ましくはＩＩ型のＣＲＩＳＰＲ／Ｃａｓ系に関与するものであ
る。Ｃａｓタンパク質の好ましい例としては、Ｃａｓ９タンパク質が例示される。Ｃａｓ
タンパク質の好ましい例としては、Ｃａｓ３タンパク質が例示される。 The term "Cas protein" refers to the CRISPR-associated
ated) refers to proteins. In a preferred embodiment, the Cas protein is a guide RN
It forms a complex with A and exhibits endonuclease activity or nickase activity. Examples of Cas proteins include, but are not limited to, Cas9 protein, Cpf1 protein, C2c1 protein, C2c2 protein, and C2c3 protein.
the wild-type Cas protein and its homologs (paralogs and orthologs), as long as the Cas protein exhibits endonuclease activity or nickase activity in cooperation with the guide RNA;
and variants thereof.
In preferred embodiments, the Cas protein is involved in a class 2 CRISPR/Cas system, more preferably a type II CRISPR/Cas system. A preferred example of the Cas protein is the Cas9 protein. Cas
A preferred example of the protein is Cas3 protein.

「Ｃａｓ９タンパク質」という用語は、ＩＩ型のＣＲＩＳＰＲ／Ｃａｓ系に関与するＣ
ａｓタンパク質を指す。Ｃａｓ９タンパク質は、ガイドＲＮＡと複合体を形成し、ガイド
ＲＮＡと協働して標的領域のＤＮＡを切断する活性を示す。Ｃａｓ９タンパク質は、前記
の活性を有する限り、野生型Ｃａｓ９タンパク質及びそのホモログ（パラログ及びオーソ
ログ）、並びにそれらの変異体を包含する。野生型Ｃａｓ９タンパク質は、ヌクレアーゼ
ドメインとしてＲｕｖＣドメイン及びＨＮＨドメインを有するが、本明細書におけるＣａ
ｓ９タンパク質は、ＲｕｖＣドメイン及びＨＮＨドメインのいずれか一方が不活性化され
たものであってもよい。ＲｕｖＣドメイン及びＨＮＨドメインのいずれか一方が不活性化
されたＣａｓ９は、二本鎖ＤＮＡに対して一本鎖切断（ニック）を導入する。そのため、
ＲｕｖＣドメイン及びＨＮＨドメインのいずれか一方が不活性化されたＣａｓ９を二本鎖
ＤＮＡの切断に用いる場合には、センス鎖とアンチセンス鎖それぞれに対してＣａｓ９の
標的配列を設定し、センス鎖およびアンチセンス差のニックが十分に近い位置で生じ、そ
れにより二本鎖切断が誘発されるように、改変システムを構成することができる。
Ｃａｓ９タンパク質が由来する生物種は特に限定されないが、ストレプトコッカス（Ｓ
ｔｒｅｐｔｏｃｏｃｃｕｓ）属、スタフィロコッカス（Ｓｔａｐｈｙｌｏｃｏｃｃｕｓ）
属、ナイセリア（Ｎｅｉｓｓｅｒｉａ）属、又はトレポネーマ（Ｔｒｅｐｏｎｅｍａ）属
に属する細菌等が好ましく例示される。より具体的には、Ｓ．ｐｙｏｇｅｎｅｓ、Ｓ．ｔ
ｈｅｒｍｏｐｈｉｌｕｓ、Ｓ．ａｕｒｅｕｓ、Ｎ．ｍｅｎｉｎｇｉｔｉｄｉｓ、又はＴ．
ｄｅｎｔｉｃｏｌａ等に由来するＣａｓ９タンパク質が好ましく例示される。好ましい態
様において、Ｃａｓ９タンパク質は、Ｓ．ｐｙｏｇｅｎｅｓ由来のＣａｓ９タンパク質で
ある。 The term "Cas9 protein" refers to the C
refers to the as protein. The Cas9 protein forms a complex with the guide RNA and exhibits the activity of cleaving the DNA of the target region in cooperation with the guide RNA. Cas9 proteins include wild-type Cas9 proteins and their homologues (paralogs and orthologs), and variants thereof, as long as they have the above activity. The wild-type Cas9 protein has the RuvC and HNH domains as nuclease domains, whereas the Ca
The s9 protein may have either one of the RuvC domain and the HNH domain inactivated. Cas9 with either one of the RuvC domain and HNH domain inactivated introduces a single-strand break (nick) into the double-stranded DNA. for that reason,
When one of the RuvC domain and HNH domain is inactivated Cas9 is used for cleaving double-stranded DNA, the target sequence of Cas9 is set for each of the sense strand and the antisense strand, and the sense strand and Modified systems can be constructed such that the antisense differential nicks occur in sufficiently close positions to induce a double-strand break.
The biological species from which the Cas9 protein is derived is not particularly limited, but Streptococcus (S
treptococcus) genus, Staphylococcus
Bacteria belonging to the genus Neisseria or Treponema are preferably exemplified. More specifically, S. pyogenes, S.; t
hermophilus, S. aureus, N. meningitidis, or T.
Cas9 protein derived from denticola and the like is preferably exemplified. In preferred embodiments, the Cas9 protein is derived from S. cerevisiae. pyogenes Cas9 protein.

各種Ｃａｓタンパク質のアミノ酸配列、及びそのコード配列の情報は、ＧｅｎＢａｎｋ
、ＵｎｉＰｒｏｔ、Ａｄｄｇｅｎｅ等の各種データベース上で得ることができる。例えば
、Ｓ．ｐｙｏｇｅｎｅｓのＣａｓ９タンパク質のアミノ酸配列は、プラスミド番号４２２
３０としてＡｄｄｇｅｎｅに登録されたもの等を用いることができる。Ｓ．ｐｙｏｇｅｎ
ｅｓのＣａｓ９タンパク質のアミノ酸配列の一例を配列番号１に示す。 Amino acid sequences of various Cas proteins and information on their coding sequences are available at GenBank
, UniProt, and Addgene. For example, S. The amino acid sequence of the Cas9 protein of pyogenes is plasmid number 422
30 registered with Addgene can be used. S. pyogen
An example of the amino acid sequence of the Cas9 protein of es is shown in SEQ ID NO:1.

「ガイドＲＮＡ」及び「ｇＲＮＡ」という用語は、相互に互換的に使用され、Ｃａｓタ
ンパク質と複合体を形成し、Ｃａｓタンパク質を標的領域に誘導することができるＲＮＡ
を指す。好ましい態様において、ガイドＲＮＡは、ＣＲＩＳＰＲＲＮＡ（ｃｒＲＮＡ）
及びトランス活性化型ＣＲＩＳＰＲＲＮＡ（ｔｒａｃｒＲＮＡ）を含む。ｃｒＲＮＡは
、ゲノム上の標的領域への結合に関与し、ｔｒａｃｒＲＮＡは、Ｃａｓタンパク質との結
合に関与する。好ましい態様において、ｃｒＲＮＡは、スペーサー配列とリピート配列と
を含み、スペーサー配列が標的領域において標的配列の相補鎖と結合する。好ましい態様
において、ｔｒａｃｒＲＮＡは、アンチリピート配列と３’テイル配列とを含む。アンチ
リピート配列はｃｒＲＮＡのリピート配列と相補的な配列を有し、リピート配列と塩基対
を形成し、３’テイル配列は通常３つのステムループを形成する。
ガイドＲＮＡは、ｃｒＲＮＡの３’末端にｔｒａｃｒＲＮＡの５’末端を連結した単一
ガイドＲＮＡ（ｓｇＲＮＡ）であってもよく、ｃｒＲＮＡ及びｔｒａｃｒＲＮＡを別々の
ＲＮＡ分子とし、リピート配列及びアンチリピート配列で塩基対を形成させたものであっ
てもよい。好ましい態様において、ガイドＲＮＡはｓｇＲＮＡである。 The terms "guide RNA" and "gRNA" are used interchangeably and refer to an RNA capable of forming a complex with a Cas protein and directing the Cas protein to a target region.
point to In a preferred embodiment, the guide RNA is CRISPR RNA (crRNA)
and transactivating CRISPR RNA (tracrRNA). crRNA is involved in binding to target regions on the genome, and tracrRNA is involved in binding to Cas proteins. In preferred embodiments, the crRNA comprises a spacer sequence and a repeat sequence, and the spacer sequence binds to the complementary strand of the target sequence in the target region. In preferred embodiments, the tracrRNA comprises an anti-repeat sequence and a 3' tail sequence. The anti-repeat sequence has a sequence complementary to the repeat sequence of crRNA, forms a base pair with the repeat sequence, and the 3' tail sequence usually forms three stem loops.
The guide RNA may be a single guide RNA (sgRNA) in which the 5' end of tracrRNA is ligated to the 3' end of crRNA, wherein crRNA and tracrRNA are separate RNA molecules, and the repeat and anti-repeat sequences are base-paired. may be formed. In preferred embodiments, the guide RNA is an sgRNA.

ｃｒＲＮＡのリピート配列及びｔｒａｃｒＲＮＡの配列は、Ｃａｓタンパク質の種類に
応じて適宜選択することができ、Ｃａｓタンパク質と同じ細菌種に由来するものを用いる
ことができる。
例えば、Ｓ．ｐｙｏｇｅｎｅｓ由来のＣａｓ９タンパク質を用いる場合、ｓｇＲＮＡの
長さは、５０～２２０ヌクレオチド（ｎｔ）程度とすることができ、６０～１８０ｎｔ程
度が好ましく、８０～１２０ｎｔ程度がより好ましい。ｃｒＲＮＡの長さは、スペーサー
配列を含めて約２５～７０塩基長とすることができ、２５～５０ｎｔ程度が好ましい。ｔ
ｒａｃｒＲＮＡの長さは１０～１３０ｎｔ程度とすることができ、３０～８０ｎｔ程度が
好ましい。
ｃｒＲＮＡのリピート配列は、Ｃａｓタンパク質が由来する細菌種におけるものと同じ
であってもよく、３’末端の一部を削除したものであってもよい。ｔｒａｃｒＲＮＡは、
Ｃａｓタンパク質が由来する細菌種における成熟ｔｒａｃｒＲＮＡと同じで配列を有して
いてもよく、当該成熟ｔｒａｃｒＲＮＡの５’末端及び／又は３’末端を切断した末端切
断型であってもよい。例えば、ｔｒａｃｒＲＮＡは、成熟ｔｒａｃｒＲＮＡの３’末端か
ら１～４０個程度のヌクレオチド残基を除去した末端切断型であり得る。また、ｔｒａｃ
ｒＲＮＡは、成熟ｔｒａｃｒＲＮＡの５’末端から１～８０個程度のヌクレオチド残基を
除去した末端切断型であり得る。また、ｔｒａｃｒＲＮＡは、例えば、５’末端から１～
２０程度のヌクレオチド残基を除去し、かつ３’末端から１～４０個程度のヌクレオチド
残基を除去した末端切断型であり得る。
ｓｇＲＮＡ設計のためのｃｒＲＮＡリピート配列及びｔｒａｃｒＲＮＡの配列は、種々
提案されており、当業者は、公知技術に基づいてｓｇＲＮＡを設計することができる（例
えば、Ｊｉｎｅｋｅｔａｌ．（２０１２）Ｓｃｉｅｎｃｅ，３３７，８１６
－２１；Ｍａｌｉｅｔａｌ．（２０１３）Ｓｃｉｅｎｃｅ，３３９：６１
２１，８２３－６；Ｃｏｎｇｅｔａｌ．（２０１３）Ｓｃｉｅｎｃｅ，３
３９：６１２１，８１９－２３；Ｈｗａｎｇｅｔａｌ．（２０１３）Ｎａ
ｔ．Ｂｉｏｔｅｃｈｎｏｌ．３１：３，２２７－９；Ｊｉｎｅｋｅｔａｌ
．（２０１３）ｅＬｉｆｅ，２，ｅ００４７１）。 The repeat sequence of crRNA and the sequence of tracrRNA can be appropriately selected according to the type of Cas protein, and those derived from the same bacterial species as the Cas protein can be used.
For example, S. When using a Cas9 protein derived from pyogenes, the length of the sgRNA can be about 50 to 220 nucleotides (nt), preferably about 60 to 180 nt, more preferably about 80 to 120 nt. The length of the crRNA can be about 25-70 bases including the spacer sequence, preferably about 25-50 nt. t
The length of the racrRNA can be about 10-130 nt, preferably about 30-80 nt.
The repeat sequence of the crRNA may be the same as in the bacterial species from which the Cas protein is derived, or may be partially truncated at the 3' end. tracrRNA is
It may have the same sequence as the mature tracrRNA in the bacterial species from which the Cas protein is derived, or may be a truncated form obtained by truncating the 5'-end and/or 3'-end of the mature tracrRNA. For example, the tracrRNA can be truncated by removing as many as 1-40 nucleotide residues from the 3' end of the mature tracrRNA. Also, trac
The rRNA can be truncated by removing as many as 1-80 nucleotide residues from the 5' end of the mature tracrRNA. In addition, tracrRNA is, for example, 1 to
It may be a truncated form with as many as 20 nucleotide residues removed and as many as 1-40 nucleotide residues removed from the 3' end.
Various crRNA repeat sequences and tracrRNA sequences for sgRNA design have been proposed, and those skilled in the art can design sgRNAs based on known techniques (for example, Jinek et al. (2012) Science, 337, 816
-21; Mali et al. (2013) Science, 339: 61
21, 823-6; Cong et al. (2013) Science, 3
39: 6121, 819-23; Hwang et al. (2013) Na
t. Biotechnol. 31: 3, 227-9; Jinek et al.
. (2013) eLife, 2, e00471).

「プロトスペーサー隣接モチーフ」及び「ＰＡＭ」という用語は、相互に互換的に使用
され、Ｃａｓタンパク質によるＤＮＡ切断の際に、Ｃａｓタンパク質に認識される配列を
指す。ＰＡＭの配列及び位置は、Ｃａｓタンパク質の種類によって異なる。例えば、Ｃａ
ｓ９タンパク質の場合、ＰＡＭは標的配列の３’側直後に隣接する必要がある。Ｃａｓ９
タンパク質に対応するＰＡＭの配列は、Ｃａｓ９タンパク質が由来する細菌種によって異
なっている。例えば、Ｓ．ｐｙｏｇｅｎｅｓのＣａｓ９タンパク質に対応するＰＡＭは「
ＮＧＧ」であり、Ｓ．ｔｈｅｒｍｏｐｈｉｌｕｓのＣａｓ９タンパク質に対応するＰＡＭ
は「ＮＮＡＧＡＡ」であり、Ｓ．ａｕｒｅｕｓのＣａｓ９タンパク質に対応するＰＡＭは
「ＮＮＧＲＲＴ」又は「ＮＮＧＲＲ（Ｎ）」であり、Ｎ．ｍｅｎｉｎｇｉｔｉｄｉｓのＣ
ａｓ９タンパク質に対応するＰＡＭは「ＮＮＮＮＧＡＴＴ」であり、Ｔ．ｄｅｎｔｉｃｏ
ｌａのＣａｓ９タンパク質に対応する「ＮＡＡＡＡＣ」である（「Ｒ」はＡ又はＧ；「Ｎ
」は、Ａ、Ｔ、Ｇ又はＣ）。 The terms "protospacer adjacent motif" and "PAM" are used interchangeably and refer to sequences recognized by Cas proteins upon DNA cleavage by the Cas proteins. The sequence and position of PAM differ depending on the type of Cas protein. For example, Ca
For s9 proteins, the PAM should immediately flank the 3' side of the target sequence. Cas9
The PAM sequence corresponding to the protein varies depending on the bacterial species from which the Cas9 protein is derived. For example, S. The PAM corresponding to the Cas9 protein of pyogenes is "
NGG", and S. PAM corresponding to thermophilus Cas9 protein
is "NNAGAA"; The PAM corresponding to the Cas9 protein of N. aureus is "NNGRRT" or "NNGRR(N)"; C for meningitidis
The PAM corresponding to the as9 protein is "NNNNGATT"; dentico
"NAAAC" corresponding to the Cas9 protein of Cas9 ("R" is A or G; "N
' is A, T, G or C).

「スペーサー配列」及び「ガイド配列」という用語は、相互に互換的に使用され、ガイ
ドＲＮＡに含まれる配列であって、標的配列の相補鎖と結合し得る配列を指す。通常、ス
ペーサー配列は、標的配列と同一の配列である（但し、標的配列中のＴは、スペーサー配
列ではＵとなる）。本発明の実施態様において、スペーサー配列は、標的配列に対して１
塩基又は複数塩基のミスマッチを含むことができる。複数塩基のミスマッチを含む場合、
隣接した位置にミスマッチを有していてもよく、離れた位置にミスマッチを有していても
よい。好ましい態様において、スペーサー配列は、標的配列に対して１～５塩基のミスマ
ッチを含み得る。特に好ましい態様において、スペーサー配列は、標的配列に対して１塩
基のミスマッチを含み得る。
ガイドＲＮＡにおいて、スペーサー配列は、ｃｒＲＮＡの５’側に配置される。 The terms "spacer sequence" and "guide sequence" are used interchangeably and refer to a sequence contained in a guide RNA that is capable of binding a complementary strand of a target sequence. The spacer sequence is usually the same sequence as the target sequence (provided that T in the target sequence becomes U in the spacer sequence). In an embodiment of the invention, the spacer sequence is 1
It can contain mismatches of a base or multiple bases. If it contains multiple base mismatches,
There may be mismatches at adjacent positions or mismatches at distant positions. In preferred embodiments, the spacer sequence may contain 1-5 base mismatches to the target sequence. In particularly preferred embodiments, the spacer sequence may contain a single base mismatch to the target sequence.
In the guide RNA, the spacer sequence is placed 5' to the crRNA.

ポリヌクレオチドに関して用いる「機能的に連結」という用語は、第一の塩基配列が第
二の塩基配列に十分に近くに配置され、第一の塩基配列が第二の塩基配列又は第二の塩基
配列の制御下の領域に影響を及ぼしうることを意味する。例えば、ポリヌクレオチドがプ
ロモーターに機能的に連結するとは、当該ポリヌクレオチドが、当該プロモーターの制御
下で発現するように連結されていることを意味する。 The term "operably linked" as used in reference to polynucleotides means that a first base sequence is positioned sufficiently close to a second base sequence such that the first base sequence is either the second base sequence or the second base sequence. means that it can affect areas under the control of For example, that a polynucleotide is operably linked to a promoter means that the polynucleotide is linked so that it is expressed under the control of the promoter.

「発現可能な状態」という用語は、ポリヌクレオチドが導入された細胞内で、該ポリヌ
クレオチドが転写され得る状態にあることを指す。
「発現ベクター」という用語は、対象ポリヌクレオチドを含むベクターであって、該ベ
クターを導入した細胞内で、対象ポリヌクレオチドを発現可能な状態にするシステムを備
えたベクターを指す。例えば、「Ｃａｓタンパク質の発現ベクター」とは、該ベクターを
導入した細胞内で、Ｃａｓタンパク質を発現可能なベクターを意味する。また、例えば、
「ガイドＲＮＡの発現ベクター」とは、該ベクターを導入した細胞内で、ガイドＲＮＡを
発現可能なベクターを意味する。 The term "expressible state" refers to a state in which a polynucleotide can be transcribed in a cell into which the polynucleotide has been introduced.
The term "expression vector" refers to a vector that contains a polynucleotide of interest and is equipped with a system that renders the polynucleotide of interest expressible in a cell into which the vector is introduced. For example, "expression vector for Cas protein" means a vector capable of expressing Cas protein in cells introduced with the vector. Also, for example,
“Guide RNA expression vector” means a vector capable of expressing a guide RNA in a cell into which the vector has been introduced.

本明細明細書において、塩基配列どうし又はアミノ酸配列どうしの配列同一性（又は相
同性）は、２つの塩基配列又はアミノ酸配列を、対応する塩基又はアミノ酸が最も多く一
致するように、挿入及び欠失に当たる部分にギャップを入れながら並置し、得られたアラ
インメント中のギャップを除く、塩基配列全体又はアミノ酸配列全体に対する一致した塩
基又はアミノ酸の割合として求められる。塩基配列又はアミノ酸配列どうしの配列同一性
は、当該技術分野で公知の各種相同性検索ソフトウェアを用いて求めることができる。塩
基配列の配列同一性の値（Ｉｄｅｎｔｉｔｙ値）は、特に限定されないが例えば、公知の
相同性検索ソフトウェアＵＣＳＣＧｅｎｏｍｅＢｒｏｗｓｅｒに搭載されているＢＬ
ＡＴ検索により得ることができる。 As used herein, sequence identity (or homology) between base sequences or between amino acid sequences refers to insertions and deletions of two base sequences or amino acid sequences such that the corresponding bases or amino acids match the most. The ratio of matching bases or amino acids to the entire base sequence or the entire amino acid sequence excluding the gaps in the resulting alignment is calculated. Sequence identity between nucleotide sequences or amino acid sequences can be determined using various homology search software known in the art. The sequence identity value (Identity value) of the base sequence is not particularly limited, but for example, BL installed in the known homology search software UCSC Genome Browser
Can be obtained by AT search.

本明細書では、便宜的にヒトゲノムの位置を参照するときに、リファレンスゲノムとし
てｈｇ３８ゲノム配列における位置を用いる。ｈｇ３８は、カリフォルニア大学サンタク
ルーズ校（ＵＣＳＣ）により２０１３年１２月にリリースされたリファレンスゲノムであ
る。リファレンスゲノムは、様々なゲノムを組み合わせて作成された参照用のゲノムであ
り、このゲノムを有するヒトが存在するというわけではない。しかし、ヒト個体のゲノム
ＤＮＡから解読された断片的な配列情報をリファレンスゲノムに照らすことによって解読
された断片的な配列情報を連結してコンピュータ上で一つながりの配列を構築し、これに
より当該ヒト個体のゲノムＤＮＡの配列を推定することができる。このようにして、リフ
ァレンスゲノムにヒト個体のゲノムＤＮＡの配列を対応付けることによって、ヒト個体等
の個体のゲノムＤＮＡを解読することが通常なされている。そして、ｈｇ３８ゲノム配列
の特定位置または特定領域に対応する位置または領域とは、具体的な配列の異なる別個体
のゲノムにおいて、当該特定位置または特定領域に紐付けられる位置または領域を意味す
る。具体的には、配列の同一性に基づき、当該位置または領域に特徴的な配列を有する位
置または領域が、ｈｇ３８ゲノム配列の特定位置または特定領域に対応する位置または領
域である。対応する位置は、２つのゲノムＤＮＡの部分配列のアラインメントにより決定
することができる。具体的な配列に相違がある場合であっても、オーソログの関係を有し
ていたり、配列同一性を有していてアラインメントをすることによって、２つのゲノムＤ
ＮＡの対応関係を決定することができる。遺伝子重複により生じたパラログが豊富な領域
では、単純な個別の配列に基づく配列の対応関係を決定するだけでは、２つのゲノム間の
真の対応関係を決定するには十分でない場合がある。このことが類似配列が集積した領域
の配列解読の難易度を増加させる。対応する配列の決定に際して、高い配列同一性を求め
ることで２つのゲノム間の対応関係を明らかにすることができる。また、特定領域が、複
数遺伝子を含む大きな領域である場合には、シンテニーを考慮することができる。シンテ
ニーとは、ゲノム上でのオーソログの物理的位置的関係が保存されていることをいう。個
人間および生物間でシンテニーを有し得る。したがって、シンテニーを考慮して特定領域
を決定することができる。 For convenience herein, locations in the hg38 genome sequence are used as reference genomes when referring to locations in the human genome. hg38 is a reference genome released in December 2013 by the University of California, Santa Cruz (UCSC). A reference genome is a reference genome created by combining various genomes, and it does not mean that a human having this genome exists. However, the fragmentary sequence information decoded from the genomic DNA of the human individual is linked to the reference genome, and the fragmentary sequence information decoded is linked to construct a series of sequences on a computer. The sequence of an individual's genomic DNA can be deduced. In this way, it is common practice to decode the genomic DNA of an individual such as a human individual by associating the sequence of the genomic DNA of the human individual with the reference genome. Further, the position or region corresponding to the specific position or specific region of the hg38 genome sequence means the position or region linked to the specific position or specific region in separate genomes with different specific sequences. Specifically, a position or region having a sequence characteristic of that position or region based on sequence identity is a position or region corresponding to a specific position or region of the hg38 genome sequence. Corresponding positions can be determined by alignment of the partial sequences of the two genomic DNAs. Even if there are differences in the specific sequences, the two genomes D can be aligned by having an orthologous relationship or having sequence identity.
NA correspondences can be determined. In regions rich in paralogs generated by gene duplication, determining sequence correspondences based on simple individual sequences may not be sufficient to determine true correspondences between two genomes. This increases the difficulty of sequence decoding in regions where similar sequences accumulate. When determining the corresponding sequences, the correspondence between the two genomes can be clarified by seeking high sequence identity. Synteny can also be considered when the specified region is a large region containing multiple genes. Synteny means preservation of the physical positional relationship of orthologs on the genome. Can have synteny between individuals and between organisms. Therefore, the specific region can be determined in consideration of synteny.

［ライブラリの作製方法］
ライブラリの作製方法は、細胞を用意することを含み得る。用意する細胞は、本開示の
改変前の細胞であり、改変後の細胞と比較する際の参照となり得ることから「リファレン
ス細胞」ということがある。細胞は、好ましくは、クローン化された細胞、株化された細
胞、または不死化細胞であり得る。ある好ましい態様では、細胞は、単一種類を含み得る
。本開示のライブラリーでは、特定遺伝子座の特定領域のみが目的配列に置き換わった細
胞のライブラリーが提供される。目的配列以外の配列を統一すると目的配列に置き換わる
ことの技術的意義を明らかにできるという観点で、細胞は単一種類の細胞からなることが
好ましい。単一種類の細胞は、クローン化された細胞である。 [Method for preparing a library]
A method of making a library can include providing a cell. The cells to be prepared are cells before the modification of the present disclosure, and can be used as a reference for comparison with cells after modification, so they are sometimes referred to as “reference cells”. Cells may preferably be cloned, established or immortalized cells. In some preferred embodiments, cells may comprise a single type. The library of the present disclosure provides a library of cells in which only specific regions of specific loci have been replaced with sequences of interest. From the viewpoint that the technical significance of substituting the target sequence can be clarified by unifying the sequences other than the target sequence, the cells are preferably composed of a single type of cell. A single type of cell is a cloned cell.

第一の作製形態の概要
概要は、図１に示される。
ライブラリの作製方法は、細胞対して下記ゲノム改変方法を用いて、第一の中間体細胞
を得ることを含み得る。第一の中間体細胞は、改変対象である遺伝子座に第一のアレルと
第二のアレルを含むゲノムを有する細胞において、前記第一のアレルと第二のアレルそれ
ぞれに選択マーカー遺伝子と標的核酸配列を含むカセットを有する細胞である。第一の中
間体細胞では、第一のアレルが有する選択マーカー遺伝子と第二のアレルが有する選択マ
ーカー遺伝子とは区別可能に異なり、前記標的核酸配列は、ゲノム改変システムの標的で
あり、ゲノム改変システムにより第一のアレルと第二のアレルとを区別可能に切断できる
ように設計されており、各選択マーカー遺伝子はネガティブ選択に用いることができるネ
ガティブ選択用マーカー遺伝子である。ネガティブ選択用マーカー遺伝子は、ポジティブ
選択用にも用いることができるもの（例えば、可視化マーカー遺伝子等）であってよい。
ライブラリの作製方法は、第一の中間体細胞から、改変細胞のライブラリー（「第一の改
変細胞のライブラリー」または単に「第一のライブラリー」ということがある）を得るこ
とを含み得る。第一のライブラリーに含まれる改変細胞を「第一の改変細胞」ということ
がある。
第一の中間体細胞は、当業者であれば適宜得ることができる。特に限定されないが、以
下に説明するゲノム改変方法を用いることにより、簡便に作製することができる。中間体
細胞からの改変細胞の取得は、改変塩基配列導入用ドナーＤＮＡの存在下で、第一のカセ
ット内またはその近傍を切断することにより達成され得る。 A schematic overview of the first fabrication mode is shown in FIG.
A method of making a library may comprise using the genome modification method described below on a cell to obtain a first intermediate cell. The first intermediate cell is a cell having a genome containing a first allele and a second allele at the genetic locus to be modified, wherein each of the first allele and the second allele contains a selectable marker gene and a target nucleic acid. A cell with a cassette containing the sequence. In the first intermediate cell, the selectable marker gene possessed by the first allele and the selectable marker gene possessed by the second allele are distinguishably different, the target nucleic acid sequence is a target of a genome modification system, and genome modification is performed. Each selection marker gene is a negative selection marker gene that can be used for negative selection, and is designed so that the system can distinguishably cleave the first allele and the second allele. A marker gene for negative selection may be one that can also be used for positive selection (eg, a visualization marker gene, etc.).
A method of making a library can include obtaining a library of modified cells (sometimes referred to as a "first modified cell library" or simply a "first library") from a first intermediate cell. . The modified cells contained in the first library are sometimes referred to as "first modified cells".
The first intermediate cell can be appropriately obtained by those skilled in the art. Although not particularly limited, it can be easily produced by using the genome modification method described below. Obtaining a modified cell from an intermediate cell can be achieved by cleaving within or near the first cassette in the presence of the donor DNA for introducing the modified nucleotide sequence.

第二の作製形態の概要
概要は、図１に示される。
ライブラリの作製方法は、細胞に対して下記ゲノム改変方法を用いて、第一の中間体細
胞を得ることを含み得る。ライブラリの作製方法は、第一の中間体細胞から第二の中間体
細胞を得ることを含みうる。ライブラリの作製方法は、第二の中間体細胞からライブラリ
を得ることを含みうる。ここで、第一の中間体細胞は、上述の通りである。第二の中間体
細胞は、改変対象である遺伝子座に第一のアレルと第二のアレルを含むゲノムを有する細
胞において、前記第一のアレルに選択マーカー遺伝子と標的核酸配列を含むカセットを有
し、第二のアレルには当該カセットを含まない、細胞である。第二の中間体細胞は、第一
の中間体細胞の第二のアレルから前記カセットを除去することにより作製することができ
る。カセットの除去は、ゲノム改変方法により当業者でれあれば適宜実施することができ
る。具体的には、切断時にカセットの上流と相同組換え可能な上流ホモロジーアームと当
該カセットの下流と相同組換え可能な下流ホモロジーアームとを有するドナーＤＮＡの存
在下で、第二のアレルに含まれる標的配列を切断することができるゲノム改変システムを
第一の中間体細胞に適用することにより、第一の中間体細胞から第二の中間体細胞を得る
ことができる。ライブラリの作製方法は、第二の中間体細胞から、改変細胞のライブラリ
ー（「第二の改変細胞のライブラリー」または単に「第二のライブラリー」ということが
ある）を得ることを含み得る。第二のライブラリーに含まれる改変細胞を「第二の改変細
胞」ということがある。第一の中間体細胞からの第二の中間体細胞の取得は、カセット除
去用ドナーＤＮＡの存在下で、第二のカセット内またはその近傍の配列を切断することに
よって達成され得る。第二の中間体細胞からの改変細胞の取得は、改変塩基配列導入用ド
ナーＤＮＡの存在下で、第一のカセットを切断することにより達成され得る。 A schematic overview of the second fabrication mode is shown in FIG.
A method of making a library may comprise using the genome modification method described below on a cell to obtain a first intermediate cell. A method of making a library can include obtaining a second intermediate cell from a first intermediate cell. A method of making a library can include obtaining a library from a second intermediate cell. Here, the first intermediate cell is as described above. The second intermediate cell is a cell having a genome containing a first allele and a second allele at the genetic locus to be modified, and has a cassette containing a selection marker gene and a target nucleic acid sequence at the first allele. However, the second allele is a cell that does not contain the cassette. A second intermediate cell can be made by removing the cassette from the second allele of the first intermediate cell. Removal of the cassette can be appropriately carried out by those skilled in the art by a genome modification method. Specifically, in the presence of a donor DNA having an upstream homology arm capable of homologous recombination upstream of the cassette and a downstream homology arm capable of homologous recombination downstream of the cassette at the time of cleavage, included in the second allele A second intermediate cell can be obtained from the first intermediate cell by applying a genome modification system capable of cleaving the target sequence to the first intermediate cell. A method of making a library can include obtaining a library of modified cells (sometimes referred to as a "second modified cell library" or simply a "second library") from a second intermediate cell. . The modified cells contained in the second library are sometimes referred to as "second modified cells". Obtaining a second intermediate cell from a first intermediate cell can be accomplished by cleaving sequences in or near the second cassette in the presence of cassette removal donor DNA. Obtaining the modified cell from the second intermediate cell can be achieved by cleaving the first cassette in the presence of the donor DNA for introducing the modified nucleotide sequence.

第二の作製形態の変形例
第二の作製形態では、第二の中間体細胞を第一の中間体細胞の第二のアレルから前記カ
セットを除去することにより作製した。第二の作製形態の変形例では、第一の中間体細胞
の第二のアレルにおけるカセットを除去して、第二のアレルを改変前の配列に戻す操作を
行い、前記第一のアレルに選択マーカー遺伝子と標的核酸配列を含むカセットを有し、第
二のアレルは、改変前の配列を有する第三の中間体細胞を得る。第二の作製形態の変形例
では、その後、第三の中間体細胞の第一のアレルに対してライブラリ作製用ドナーＤＮＡ
を適用して改変細胞のライブラリーを得る。この態様では、改変細胞のライブラリーに含
まれる改変細胞は、第一のアレルに改変塩基配列を含み、第二のアレルは改変前の配列を
有する。第一の中間体細胞の第二のアレルを改変前の配列に戻す操作は、第二のカセット
の上流と相同組換え可能な上流ホモロジーアームと当該カセットの下流と相同組換え可能
な下流ホモロジーアームとからなるドナーＤＮＡの存在下で、第二のカセット内またはそ
の近傍を切断することにより達成され得る。 Variations of Second Modification of Construction In a second modification of construction , a second intermediate cell was generated by removing the cassette from the second allele of the first intermediate cell. In a modification of the second production mode, the cassette in the second allele of the first intermediate cell is removed, and the second allele is returned to the sequence before modification, and the first allele is selected. A third intermediate cell is obtained having a cassette containing a marker gene and a target nucleic acid sequence, the second allele having the sequence prior to modification. In a variation of the second production mode, then donor DNA for library production against the first allele of the third intermediate cell
is applied to obtain a library of modified cells. In this embodiment, the modified cells contained in the library of modified cells contain the modified base sequence in the first allele and the second allele has the sequence before modification. The operation of returning the second allele of the first intermediate cell to the sequence before modification includes an upstream homology arm capable of homologous recombination upstream of the second cassette and a downstream homology arm capable of homologous recombination downstream of the cassette can be achieved by cleaving within or near the second cassette in the presence of donor DNA consisting of:

本開示では、第一の中間体細胞、第二の中間体細胞、これらの中間体細胞を含む組成物
、第一の改変細胞、第一のライブラリー、第二の改変細胞、および第二のライブラリーが
提供される。 The present disclosure provides a first intermediate cell, a second intermediate cell, a composition comprising these intermediate cells, a first modified cell, a first library, a second modified cell, and a second A library is provided.

一般的に、ゲノム編集においては、標的領域を有するゲノムに対して、標的領域の上流
と相同組換え可能な上流ホモロジーアーム（例えば、相補的配列を有する）と当該標的領
域の下流と相同組換え可能な下流ホモロジーアーム（例えば、相補的配列を有する）を有
するドナーＤＮＡの存在下で、典型的には当該標的領域内に切断を生じさせることにより
、標的領域がドナーＤＮＡ中の上流ホモロジーアームと下流ホモロジーアームで挟まれた
配列に置き換わる。上流ホモロジーアームと下流ホモロジーアームで挟まれた配列が存在
しない場合には、標的領域が欠失する（またはシームレスに欠失する）。ゲノム編集にお
いては、標的領域内に複数箇所の切断を生じさせてもよい。典型的には、上流ホモロジー
アームに近接した標的領域内と下流ホモロジーアームに近接した標的領域内のそれぞれに
切断を生じさせることが有益である。 Generally, in genome editing, for a genome having a target region, an upstream homology arm (for example, having a complementary sequence) capable of homologous recombination upstream of the target region and a homologous recombination downstream of the target region In the presence of donor DNA that has a possible downstream homology arm (e.g., has a complementary sequence), the target region is isolated from the upstream homology arm in the donor DNA, typically by causing cleavage within the target region. It replaces the sequence flanked by the downstream homology arms. If the sequence flanked by the upstream homology arm and the downstream homology arm is absent, the target region is deleted (or seamlessly deleted). In genome editing, multiple cuts may be generated within the target region. Typically, it is beneficial to have the cleavage occur within the target region adjacent to the upstream homology arm and within the target region adjacent to the downstream homology arm, respectively.

ある好ましい態様では、第一の中間体細胞においては、第一のアレルおよび第二のアレ
ルは、標的領域のカセットによる置換がそれぞれなされ、第一のアレルおよび第二のアレ
ルは、当該標的領域の欠失を有していてもよい。また、ある好ましい態様では、第一のア
レルおよび第二のアレルは、標的領域へのカセットの挿入がなされ、第一のアレルおよび
第二のアレルは、当該標的領域の欠失を有しなくてもよい。ある好ましい態様では、標的
領域のカセットの挿入は、機能を有しない標的領域中になされ、したがって、当該標的領
域の破壊に伴う機能低下または欠損を伴わない。 In one preferred embodiment, in the first intermediate cell, the first allele and the second allele are each replaced by a cassette of the target region, and the first allele and the second allele are It may have deletions. Also, in certain preferred embodiments, the first allele and the second allele have an insertion of the cassette into the target region, and the first allele and the second allele have no deletion of the target region. good too. In certain preferred embodiments, the insertion of the target region cassette is into a non-functional target region, and thus is not accompanied by loss of function or loss associated with disruption of the target region.

本実施形態のゲノム改変方法に用いる細胞は、特に限定されず、１倍体または２倍体以
上の染色体ゲノムを有する細胞であればよい。細胞は、２倍体であってもよく、３倍体で
あってもよく、４倍体以上であってもよい。細胞としては、特に限定されないが、真核生
物の細胞が挙げられる。細胞は、植物細胞であってもよく、動物細胞であってもよく、真
菌細胞であってもよい。動物細胞は、特に限定されないが、ヒト、ヒト以外の哺乳動物（
例えば、サルなどの非ヒト霊長類、イヌ、ネコ、ウシ、ウマ、ヒツジ、ヤギ、ラマ、齧歯
類などの非ヒト哺乳類）、鳥類、爬虫類、両生類、魚類、その他の脊椎動物のいずれの細
胞であってもよい。 Cells used in the genome modification method of the present embodiment are not particularly limited as long as they have a haploid, diploid or higher chromosomal genome. The cells may be diploid, triploid, tetraploid or higher. Cells include, but are not limited to, eukaryotic cells. The cells may be plant cells, animal cells, or fungal cells. Animal cells include, but are not limited to, humans, non-human mammals (
non-human primates such as monkeys, non-human mammals such as dogs, cats, cows, horses, sheep, goats, llamas, rodents), birds, reptiles, amphibians, fish, and other vertebrate cells may be

前記細胞としては、例えば、多能性細胞（例えば、胚性幹細胞（ＥＳ細胞）および誘導
多能性幹細胞（ｉＰＳ細胞）などの多能性幹細胞）、造血幹細胞、造血前駆細胞、骨髄細
胞、脾臓細胞、骨髄系共通前駆細胞、免疫細胞（例えば、Ｔ細胞、Ｂ細胞、ＮＫ細胞、Ｎ
ＫＴ細胞、マクロファージ、単球、好中球、好酸球、好塩基球）、赤血球、巨核球、心臓
細胞、心筋細胞、心臓線維芽細胞、膵β細胞、角膜細胞（例えば、角膜上皮細胞、および
角膜内皮細胞）、表皮細胞、真皮細胞、脂肪細胞、軟骨細胞、骨細胞、破骨細胞、骨芽細
胞、間葉系幹細胞（例えば、脂肪由来、骨髄由来、胎盤由来および臍帯由来）、歯髄細胞
、腱細胞、靭帯細胞、神経細胞（例えば、錐体細胞、星状細胞、および顆粒細胞）、グリ
ア細胞、プルキンエ細胞、網膜神経節細胞、網膜細胞、視神経細胞、および神経幹細胞が
挙げられる。ある好ましい態様では、細胞は初代細胞であり得る。ある好ましい態様では
、細胞は不死化された細胞、または細胞株であり得る。ある好ましい態様では、細胞はヒ
トの細胞である。 Examples of the cells include pluripotent cells (e.g., pluripotent stem cells such as embryonic stem cells (ES cells) and induced pluripotent stem cells (iPS cells)), hematopoietic stem cells, hematopoietic progenitor cells, bone marrow cells, and spleen. cells, common myeloid progenitor cells, immune cells (e.g., T cells, B cells, NK cells, N
KT cells, macrophages, monocytes, neutrophils, eosinophils, basophils), erythrocytes, megakaryocytes, cardiac cells, cardiomyocytes, cardiac fibroblasts, pancreatic β cells, corneal cells (e.g., corneal epithelial cells, and corneal endothelial cells), epidermal cells, dermal cells, adipocytes, chondrocytes, osteocytes, osteoclasts, osteoblasts, mesenchymal stem cells (e.g., adipose-derived, bone marrow-derived, placental- and umbilical cord-derived), dental pulp cells, tendon cells, ligament cells, nerve cells (eg, cone cells, astrocytes, and granule cells), glial cells, Purkinje cells, retinal ganglion cells, retinal cells, optic nerve cells, and neural stem cells. In some preferred embodiments, the cells can be primary cells. In some preferred embodiments, the cells can be immortalized cells, or cell lines. In one preferred embodiment, the cells are human cells.

ある態様では、細胞は、単離された細胞、クローン化された細胞、または細胞株であり
得る。細胞は不死化された細胞であってもよい。ある好ましい態様では、細胞は、クロー
ン化された細胞である。ある好ましい態様では、細胞は、細胞株である。ある好ましい態
様では、細胞は不死化された細胞である。ある態様では、細胞は、初代体細胞である。細
胞は、その使用目的に応じて適宜選択されることを当業者は理解する。 In some aspects, the cells can be isolated cells, cloned cells, or cell lines. The cells may be immortalized cells. In one preferred aspect, the cell is a cloned cell. In one preferred aspect, the cell is a cell line. In one preferred embodiment, the cells are immortalized cells. In one aspect, the cell is a primary somatic cell. Those skilled in the art will understand that cells are appropriately selected according to their intended use.

ある態様では、細胞（またはライブラリー）は、細胞凍結保護液中で凍結されうる。あ
る態様では、前記細胞を含む細胞凍結保護液は、非凍結状態または好ましくは凍結状態で
提供され得る。凍結状態の前記細胞を含む細胞凍結保護液（「フリーズストック」ともい
う）は、リサーチセルバンク（ＲＣＢ）、マスターセルバンク（ＭＣＢ）、またはワーキ
ングセルバンク（ＷＣＢ）として用いることができる。したがって、本発明では、上記フ
リーズストックを含む、リサーチセルバンク（ＲＣＢ）、マスターセルバンク（ＭＣＢ）
、またはワーキングセルバンク（ＷＣＢ）が提供される。 In some embodiments, cells (or libraries) can be frozen in a cell cryoprotectant. In some embodiments, the cell cryoprotectant containing said cells may be provided in an unfrozen or preferably frozen state. A cell cryoprotectant (also referred to as a "freeze stock") containing the cells in a frozen state can be used as a research cell bank (RCB), master cell bank (MCB), or working cell bank (WCB). Therefore, in the present invention, a research cell bank (RCB), a master cell bank (MCB) containing the freeze stock
, or a working cell bank (WCB) is provided.

［ゲノム改変方法］
以下に記載する方法（例えば、ＵＫｉＳ；国際公開第２０２１／２０６０５４号参照）
は、上記細胞の作製に好ましく用いることができる。この方法は、特に図１に示される工
程Ｓ１において、細胞から第一の中間体細胞を作製する際に好ましく用いられる。 [Genome modification method]
Methods described below (e.g. UKiS; see WO2021/206054)
can be preferably used for producing the above cells. This method is preferably used when producing a first intermediate cell from the cell, particularly in step S1 shown in FIG.

１実施態様において、本発明では、第一の中間体細胞の作製に、以下（ａ）及び（ｂ）
を含む方法を用いることができる。
（ａ）下記（ｉ）及び（ｉｉ）を、２つ以上のアレルを含む細胞に導入して、前記２つ以
上のアレルそれぞれに選択マーカー遺伝子を導入する工程と、
（ｉ）前記染色体ゲノムの２つ以上のアレル中の標的領域を標的とし、当該標的領域を切
断することができる配列特異的核酸切断分子、又は前記配列特異的核酸切断分子をコード
するポリヌクレオチドを含むゲノム改変システム、
（ｉｉ）２種以上の選択マーカー用ドナーＤＮＡであって、それぞれが前記標的領域の上
流側の塩基配列と相同組換え可能な塩基配列を有する上流ホモロジーアームと、前記標的
領域の下流側の塩基配列と相同組換え可能な塩基配列を有する下流ホモロジーアームとを
有し、かつ、上流ホモロジーアームと下流ホモロジーアームの間に、選択マーカー遺伝子
の塩基配列を含み、前記２種以上の選択マーカー用ドナーＤＮＡは、相互に区別可能に異
なる選択マーカー遺伝子をそれぞれ有し、前記選択マーカー遺伝子は、選択マーカー用ド
ナーＤＮＡの種類毎に固有であり、前記選択マーカー用ドナーＤＮＡの種類数は、ゲノム
改変の対象とする前記アレルの数と同数以上である、２種以上の選択マーカー用ドナーＤ
ＮＡ、
（ｂ）前記工程（ａ）の後、前記２以上のアレルに対して異なる種類の選択マーカー用ド
ナーＤＮＡがそれぞれ相同組み換えすることによって、当該２以上のアレルにそれぞれ区
別可能に異なる固有の選択マーカー遺伝子が導入され、導入された当該区別可能に異なる
選択マーカー遺伝子のすべてを発現する細胞を選択する工程（ポジティブ選択のための工
程）と、
を含む、方法であり得る。 In one embodiment, in the present invention, the following (a) and (b) are used to prepare the first intermediate cell
can be used.
(a) introducing the following (i) and (ii) into a cell containing two or more alleles to introduce a selectable marker gene into each of the two or more alleles;
(i) a sequence-specific nucleic acid cleaving molecule capable of targeting and cleaving a target region in two or more alleles of said chromosomal genome, or a polynucleotide encoding said sequence-specific nucleic acid cleaving molecule; genome modification system, including
(ii) two or more selection marker donor DNAs, each of which has an upstream homology arm having a base sequence capable of homologous recombination with the base sequence upstream of the target region; and bases downstream of the target region. a sequence and a downstream homology arm having a base sequence capable of homologous recombination, and comprising a base sequence of a selectable marker gene between the upstream homology arm and the downstream homology arm, wherein the two or more types of selectable marker donors Each of the DNAs has a selectable marker gene that is distinguishably different from each other, the selectable marker gene is unique for each type of selectable marker donor DNA, and the number of types of the selectable marker donor DNA is determined by genome modification. Two or more selectable marker donors D, which are the same or more than the number of target alleles
NA,
(b) after the step (a), by homologous recombination of different types of donor DNAs for selection markers with respect to the two or more alleles, unique selection markers that are distinguishably different from each other for the two or more alleles; selecting cells into which the gene has been introduced and expressing all of the introduced distinctly different selectable marker genes (a step for positive selection);
can be a method comprising:

（工程（ａ））
工程（ａ）では、前記（ｉ）及び（ｉｉ）を、染色体を含む細胞に導入する。 (Step (a))
In step (a), the above (i) and (ii) are introduced into cells containing chromosomes.

＜（ｉ）ゲノム改変システム＞
「ゲノム改変システム」とは、所望の標的領域を改変することが可能な分子機構を意味
する。ゲノム改変システムは、染色体ゲノムの標的領域を標的とする配列特異的核酸切断
分子、又は前記配列特異的核酸切断分子をコードするポリヌクレオチドを含む。より具体
的には、ゲノム改変システムは、標的領域中あるいは近傍の少なくとも１つ、好ましくは
２つを切断することができる。 <(i) genome modification system>
By "genomic modification system" is meant a molecular mechanism capable of modifying a desired target region. The genome modification system comprises a sequence-specific nucleic acid cleaving molecule targeted to a target region of the chromosomal genome, or a polynucleotide encoding said sequence-specific nucleic acid cleaving molecule. More specifically, the genome modification system can cut at least one, preferably two, in or near the target region.

ゲノム改変の対象となる標的領域は、１以上のアレルを有するゲノム上の任意の領域と
することができる。標的領域のサイズは、特に限定されない。本実施形態のゲノム改変方
法では、従来よりも大きなサイズの領域を改変することができる。標的領域は、例えば、
１０ｋｂｐ以上であってもよい。標的領域は、例えば、１００ｂｐ以上、２００ｂｐ以上
、４００ｂｐ以上、８００ｂｐ以上、１ｋｂｐ以上、２ｋｂｐ以上、３ｋｂｐ以上、４ｋ
ｂｐ以上、５ｋｂｐ以上、８ｋｂｐ以上、１０ｋｂｐ以上、２０ｋｂｐ以上、４０ｋｂｐ
以上、８０ｋｂｐ以上、１００ｋｂｐ以上、２００ｋｂｐ以上、３００ｋｂｐ、４００ｋ
ｂｐ以上、５００ｋｂｐ以上、６００ｋｂｐ以上、７００ｋｂｐ以上、８００ｋｂｐ以上
、９００ｋｂｐ以上、もしくは１Ｍｂｐ以上、または上記のいずれかの数値以下であって
もよい。ある態様では、改変された細胞において、上記標的領域が欠失している。 A target region for genome modification can be any region on the genome having one or more alleles. The size of the target region is not particularly limited. The genome modification method of this embodiment can modify a region of a larger size than conventionally. The target region may be, for example,
It may be 10 kbp or more. The target region may be, for example, 100 bp or more, 200 bp or more, 400 bp or more, 800 bp or more, 1 kbp or more, 2 kbp or more, 3 kbp or more, 4 kbp or more.
bp or more, 5 kbp or more, 8 kbp or more, 10 kbp or more, 20 kbp or more, 40 kbp
80kbp or more, 100kbp or more, 200kbp or more, 300kbp, 400kbp
bp or more, 500 kbp or more, 600 kbp or more, 700 kbp or more, 800 kbp or more, 900 kbp or more, or 1 Mbp or more, or any of the above numbers or less. In one aspect, the target region is deleted in the modified cell.

配列特異的核酸切断分子は、配列特異的核酸切断活性を有する分子であれば、特に限定
されず、合成有機化合物であってもよく、タンパク質等の生体高分子化合物であってもよ
い。配列特異的部位切断活性を有するタンパク質としては、例えば、配列特異的エンドヌ
クレアーゼが挙げられる。 The sequence-specific nucleic acid cleaving molecule is not particularly limited as long as it is a molecule having sequence-specific nucleic acid cleaving activity, and may be a synthetic organic compound or a biopolymer such as a protein. Proteins having sequence-specific site-cleaving activity include, for example, sequence-specific endonucleases.

配列特異的エンドヌクレアーゼは、所定の配列で核酸を切断することができる酵素であ
る。配列特異的エンドヌクレアーゼは、所定の配列で、２本鎖ＤＮＡを切断することがで
きる。配列特異的エンドヌクレアーゼとしては、特に限定されないが、例えば、ジンクフ
ィンガーヌクレアーゼ（Ｚｉｎｃｆｉｎｇｅｒｎｕｃｌｅａｓｅ（ＺＦＮ））、ＴＡ
ＬＥＮ（Ｔｒａｎｓｃｒｉｐｔｉｏｎａｃｔｉｖａｔｏｒ－ｌｉｋｅｅｆｆｅｃｔｏ
ｒｎｕｃｌｅａｓｅ）、Ｃａｓタンパク質等が挙げられるが、これらに限定されない。 A sequence-specific endonuclease is an enzyme capable of cleaving nucleic acids at a given sequence. A sequence-specific endonuclease can cleave double-stranded DNA at a given sequence. Examples of sequence-specific endonucleases include, but are not limited to, zinc finger nuclease (ZFN), TA
LEN (Transcription activator-like effect)
r nuclease), Cas protein and the like, but are not limited to these.

ＺＦＮは、ジンクフィンガーアレイを含む結合ドメインにコンジュゲートした核酸切断
ドメインを含む人工ヌクレアーゼである。切断ドメインとしては、ＩＩ型制限酵素Ｆｏｋ
Ｉの切断ドメインが挙げられる。標的配列を切断可能なジンクフィンガーヌクレアーゼの
設計は、公知の方法で行うことができる。 ZFNs are artificial nucleases containing a nucleic acid cleaving domain conjugated to a binding domain containing a zinc finger array. As a cleavage domain, the type II restriction enzyme Fok
The cleavage domain of I. A zinc finger nuclease capable of cleaving a target sequence can be designed by a known method.

ＴＡＬＥＮは、ＤＮＡ切断ドメイン（例えば、ＦｏｋＩドメイン）に加えて転写活性化
因子様（ＴＡＬ）エフェクターのＤＮＡ結合ドメインを含む人工ヌクレアーゼである。標
的配列を切断可能なＴＡＬＥ構築物の設計は、公知の方法で行うことができる（例えば、
Ｚｈａｎｇ，Ｆｅｎｇｅｔ．ａｌ．（２０１１）ＮａｔｕｒｅＢｉｏｔｅｃ
ｈｎｏｌｏｇｙ２９（２））。 TALENs are artificial nucleases that contain transcriptional activator-like (TAL) effector DNA-binding domains in addition to a DNA-cleaving domain (eg, a FokI domain). A TALE construct capable of cleaving a target sequence can be designed by a known method (e.g.,
Zhang, Feng et. al. (2011) Nature Biotec
hnology 29 (2)).

配列特異的核酸切断分子として、Ｃａｓタンパク質を用いる場合、ゲノム改変システム
は、ＣＲＩＳＰＲ／Ｃａｓシステムを含む。すなわち、ゲノム改変システムは、Ｃａｓタ
ンパク質と、標的領域内の塩基配列に相同な塩基配列を有するガイドＲＮＡを含むことが
好ましい。ガイドＲＮＡは、スペーサー配列として、標的領域内の配列（標的配列）と相
同な配列を含んでいればよい。ガイドＲＮＡは、標的領域内のＤＮＡに結合できるもので
ればよく、標的配列と完全に同一の配列を有している必要はない。この結合は、細胞核内
の生理的条件下で形成されればよい。ガイドＲＮＡは、例えば、標的配列に対して、例え
ば、０～３塩基のミスマッチを含むことができる。前記ミスマッチの数は、０～２塩基が
好ましく、０～１がより好ましく、ミスマッチを有しないことがさらに好ましい。ガイド
ＲＮＡの設計は、公知の方法に基づいて行うことができる。ゲノム改変システムは、ＣＲ
ＩＳＰＲ／Ｃａｓシステムであることが好ましく、Ｃａｓタンパク質とガイドＲＮＡを含
むことが好ましい。Ｃａｓタンパク質は、Ｃａｓ９タンパク質であることが好ましい。 When using a Cas protein as the sequence-specific nucleic acid cleaving molecule, the genome modification system includes the CRISPR/Cas system. That is, the genome modification system preferably contains a Cas protein and a guide RNA having a nucleotide sequence homologous to the nucleotide sequence in the target region. The guide RNA may contain, as a spacer sequence, a sequence homologous to the sequence in the target region (target sequence). The guide RNA only needs to be able to bind to DNA within the target region, and does not need to have a completely identical sequence to the target sequence. This bond may be formed under physiological conditions within the cell nucleus. The guide RNA can contain, eg, 0-3 base mismatches, eg, to the target sequence. The number of mismatches is preferably 0 to 2 bases, more preferably 0 to 1 bases, and more preferably no mismatches. Guide RNA can be designed based on known methods. The genome modification system is CR
It is preferably an ISPR/Cas system, preferably comprising a Cas protein and a guide RNA. Preferably, the Cas protein is the Cas9 protein.

配列特異的エンドヌクレアーゼは、タンパク質として細胞に導入してもよく、配列特異
的エンドヌクレアーゼをコードするポリヌクレオチドとして細胞に導入してもよい。例え
ば、配列特異的エンドヌクレアーゼのｍＲＮＡを導入してもよく、配列特異的エンドヌク
レアーゼの発現ベクターを導入してもよい。発現ベクターにおいて、配列特異的エンドヌ
クレアーゼのコード配列（配列特異的エンドヌクレアーゼ遺伝子）は、プロモーターに機
能的に連結されている。プロモーターは、特に限定されず、例えば、ｐｏｌＩＩ系プロ
モーターを各種使用することができる。ｐｏｌＩＩ系プロモーターとしては、特に制限
されないが、例えばＣＭＶプロモーター、ＥＦ１プロモーター（ＥＦ１αプロモーター）
、ＳＶ４０プロモーター、ＭＳＣＶプロモーター、ｈＴＥＲＴプロモーター、βアクチン
プロモーター、ＣＡＧプロモーター、ＣＢｈプロモーター等が挙げられる。 A sequence-specific endonuclease may be introduced into the cell as a protein or as a polynucleotide encoding the sequence-specific endonuclease. For example, a sequence-specific endonuclease mRNA may be introduced, or a sequence-specific endonuclease expression vector may be introduced. In expression vectors, a sequence-specific endonuclease coding sequence (sequence-specific endonuclease gene) is operably linked to a promoter. The promoter is not particularly limited, and for example, various pol II promoters can be used. Examples of pol II promoters include, but are not limited to, CMV promoter, EF1 promoter (EF1α promoter)
, SV40 promoter, MSCV promoter, hTERT promoter, β actin promoter, CAG promoter, CBh promoter and the like.

プロモーターは、誘導性プロモーターであってもよい。誘導性プロモーターは、プロモ
ーターを駆動する誘導因子の存在下でのみ、当該プロモーターに機能的に連結されたポリ
ヌクレオチドの発現を誘導することができるプロモーターである。誘導性プロモーターと
しては、ヒートショックプロモーターなどの加熱により遺伝子発現を誘導するプロモータ
ーが挙げられる。また、誘導性プロモーターには、プロモーターを駆動する誘導因子が薬
剤であるプロモーターが挙げられる。このような薬剤誘導性プロモーターとしては、例え
ば、Ｃｕｍａｔｅオペレーター配列、λオペレーター配列（例えば、１２×λＯｐ）、テ
トラサイクリン系誘導性プロモーター等が挙げられる。テトラサイクリン系誘導性プロモ
ーターとしては、例えば、テトラサイクリンもしくはその誘導体（例えば、ドキシサイク
リン）、またはリバーステトラサイクリン制御性トランス活性化因子（ｒｔＴＡ）の存在
下で遺伝子発現を駆動するプロモーターが挙げられる。テトラサイクリン系誘導性プロモ
ーターとしては、例えば、ＴＲＥ３Ｇプロモーターが挙げられる。 The promoter may be an inducible promoter. An inducible promoter is a promoter that can induce expression of a polynucleotide operably linked to it only in the presence of an inducer driving the promoter. Inducible promoters include promoters that induce gene expression by heating, such as heat shock promoters. Inducible promoters also include promoters in which the inducer driving the promoter is a drug. Such drug-inducible promoters include, for example, Cumate operator sequences, λ operator sequences (eg, 12×λOp), tetracycline system-inducible promoters, and the like. Tetracycline-based inducible promoters include, for example, promoters that drive gene expression in the presence of tetracycline or its derivatives (eg, doxycycline), or reverse tetracycline-regulated transactivator (rtTA). Examples of tetracycline-inducible promoters include the TRE3G promoter.

発現ベクターは、公知のものを特に制限なく用いることができる。発現ベクターとして
は、例えば、プラスミドベクター、ウイルスベクターが挙げられる。配列特異的エンドヌ
クレアーゼがＣａｓタンパク質である場合、発現ベクターは、Ｃａｓタンパク質のコード
配列（Ｃａｓタンパク質遺伝子）に加えて、ガイドＲＮＡコード配列（ガイドＲＮＡ遺伝
子）及びを含んでいてもよい。この場合、ガイドＲＮＡコード配列（ガイドＲＮＡ遺伝子
）は、ｐｏｌＩＩＩ系プロモーターに機能的にされていることが好ましい。ｐｏｌＩ
ＩＩ系プロモーターとしては、例えば、マウス及びヒトのＵ６－ｓｎＲＮＡプロモーター
、ヒトＨ１－ＲＮａｓｅＰＲＮＡプロモーター、ヒトバリン－ｔＲＮＡプロモーター
等が挙げられる。 Any known expression vector can be used without particular limitation. Examples of expression vectors include plasmid vectors and virus vectors. When the sequence-specific endonuclease is a Cas protein, the expression vector may contain, in addition to the Cas protein coding sequence (Cas protein gene), a guide RNA coding sequence (guide RNA gene). In this case, the guide RNA coding sequence (guide RNA gene) is preferably functionalized to a pol III-based promoter. pol I
II system promoters include, for example, mouse and human U6-snRNA promoters, human H1-RNase P RNA promoters, human valine-tRNA promoters and the like.

＜（ｉｉ）選択マーカー用ドナーＤＮＡ＞
選択マーカー用ドナーＤＮＡは、選択マーカーを標的領域にノックインするためのドナ
ーＤＮＡである。選択マーカー用ドナーＤＮＡは、標的領域の上流側に隣接する塩基配列
と相同な塩基配列を有する上流ホモロジーアームと、標的領域の下流側に隣接する塩基配
列と相同な塩基配列を有する下流ホモロジーアームとの間に、１以上の選択マーカー遺伝
子の塩基配列を含む。 <(ii) Donor DNA for selection marker>
Selectable marker donor DNA is donor DNA for knocking in a selectable marker into a target region. The donor DNA for selection marker has an upstream homology arm having a nucleotide sequence homologous to the nucleotide sequence adjacent to the upstream side of the target region, and a downstream homology arm having a nucleotide sequence homologous to the nucleotide sequence adjacent to the downstream side of the target region. contains the base sequences of one or more selectable marker genes.

選択マーカー用ドナーＤＮＡは、特に限定されないが、例えば、１ｋｂ以上、２ｋｂ以
上、３ｋｂ以上、４ｋｂ以上、５ｋｂ以上、６ｋｂ以上、７ｋｂ以上、８ｋｂ以上、９ｋ
ｂ以上、９．５ｋｂ以上、または１０ｋｂ以上の長さを有し得る。選択マーカー用ドナー
ＤＮＡは、特に限定されないが、例えば、５０ｋｂ以下、４５ｋｂ以下、４０ｋｂ以下、
３５ｋｂ以下、３０ｋｂ以下、２５ｋｂ以下、２０ｋｂ以下、１５ｋｂ以下、１４ｋｂ以
下、１３ｋｂ以下、１２ｋｂ以下、１１ｋｂ以下、１０ｋｂ以下、９ｋｂ以下、８ｋｂ以
下、７ｋｂ以下、６ｋｂ以下、５ｋｂ以下、または４ｋｂ以下の長さを有し得る。 The donor DNA for the selection marker is not particularly limited, but is, for example, 1 kb or more, 2 kb or more, 3 kb or more, 4 kb or more, 5 kb or more, 6 kb or more, 7 kb or more, 8 kb or more, 9 kb or more.
b or longer, 9.5 kb or longer, or 10 kb or longer. The donor DNA for the selection marker is not particularly limited, but is, for example, 50 kb or less, 45 kb or less, 40 kb or less,
35 kb or less, 30 kb or less, 25 kb or less, 20 kb or less, 15 kb or less, 14 kb or less, 13 kb or less, 12 kb or less, 11 kb or less, 10 kb or less, 9 kb or less, 8 kb or less, 7 kb or less, 6 kb or less, 5 kb or less, or 4 kb or less in length can have

「選択マーカー」とは、その発現の有無に基づいて、細胞を選択することができるタン
パク質を意味する。選択マーカー遺伝子は、選択マーカーをコードする遺伝子である。選
択マーカー発現細胞及び非発現細胞が混在する細胞集団において、選択マーカー発現細胞
を選択する場合、当該選択マーカーを「ポジティブ選択マーカー」または「ポジティブ選
択用の選択マーカー」という。選択マーカー発現細胞及び非発現細胞が混在する細胞集団
において、選択マーカー非発現細胞を選択する場合、当該選択マーカーを「ネガティブ選
択マーカー」または「ネガティブ選択用の選択マーカー」という。選択マーカーが相互に
異なるとは、相互に区別できること（例えば、区別可能に異なること）を意味し、例えば
、選択マーカーが導入された細胞に付与する薬剤耐性の性質などの生理学的性質またはそ
の他の物理化学的性質において少なくとも相互に区別できることを意味する。すなわち、
選択マーカーが相互に異なるとは、異なる複数の選択マーカーが他の選択マーカーと区別
可能に検出できること、または他の選択マーカーとは区別可能に薬剤選択できることを意
味する。また、前記選択マーカー遺伝子が選択マーカー用ドナーＤＮＡの種類毎に固有で
あるとは、選択マーカー用ドナーＤＮＡの１種が有する選択マーカー遺伝子が、上記以外
の種類の選択マーカー用ドナーＤＮＡには含まれないこと、または、複数種類のドナーＤ
ＮＡに含まれている場合には、同時に２種類以上のドナーＤＮＡから発現しないように構
成されていることを意味する。このとき、２種類以上のドナーＤＮＡは、選択マーカー以
外は同一であってもよく、選択マーカー以外の配列および／または構成において相違があ
ってもよい。 By "selectable marker" is meant a protein that allows cells to be selected based on the presence or absence of its expression. A selectable marker gene is a gene that encodes a selectable marker. In a cell population in which selectable marker-expressing cells and non-expressing cells coexist, when selectable marker-expressing cells are selected, the selectable marker is referred to as a "positive selectable marker" or "selective marker for positive selection". In a cell population in which selectable marker-expressing cells and non-expressing cells coexist, when selectable marker-non-expressing cells are selected, the selectable marker is referred to as a "negative selectable marker" or "selective marker for negative selection". The selectable markers differ from each other means that they are distinguishable from each other (e.g., distinguishably different), and may have physiological or other properties such as drug resistance properties that confer on cells into which the selectable markers have been introduced. It means at least distinguishable from each other in physicochemical properties. i.e.
The selection markers being different from each other means that multiple selection markers that are different can be detected in a distinguishable manner from other selection markers, or that a drug can be selected in a manner distinguishable from other selection markers. In addition, the expression that the selectable marker gene is unique to each type of selectable marker donor DNA means that the selectable marker gene possessed by one type of selectable marker donor DNA is not included in the types of selectable marker donor DNAs other than the above. or multiple types of donor D
When it is contained in NA, it means that it is constructed so as not to be expressed from two or more donor DNAs at the same time. At this time, the two or more donor DNAs may be identical except for the selectable marker, or may differ in sequence and/or composition other than the selectable marker.

ポジティブ選択マーカーは、それを発現する細胞を選択可能なものであれば、特に限定
されない。ポジティブ選択マーカー遺伝子としては、例えば、薬剤耐性遺伝子、蛍光タン
パク質遺伝子、発光酵素遺伝子、発色酵素遺伝子等が挙げられる。 The positive selectable marker is not particularly limited as long as it can select cells expressing it. Positive selectable marker genes include, for example, drug resistance genes, fluorescent protein genes, luciferase genes, chromogenic enzyme genes and the like.

ネガティブ選択マーカーは、それを発現しない細胞を選択可能なものであれば、特に限
定されない。ネガティブ選択マーカー遺伝子としては、例えば、自殺遺伝子（チミジンカ
イネース等）、蛍光タンパク質遺伝子、発光酵素遺伝子、発色酵素遺伝子等が挙げられる
。ネガティブ選択マーカー遺伝子が、細胞の生存に負の影響を与える遺伝子（例えば、自
殺遺伝子）である場合、当該ネガティブ選択マーカー遺伝子は、誘導性プロモーターに機
能的に連結され得る。誘導性プロモーターに機能的に連結することで、ネガティブ選択マ
ーカー遺伝子を有する細胞を除去したいときにのみ、ネガティブ選択マーカー遺伝子を発
現させることができる。ネガティブ選択マーカー遺伝子が、蛍光、発光、および発色等の
光学的に検出可能なマーカー遺伝子（可視化マーカー遺伝子；ｖｉｓｉｂｌｅｍａｒｋ
ｅｒｇｅｎｅ）である場合など、細胞の生存に負の影響が少ない場合には、恒常的に発
現させてもよい。 The negative selectable marker is not particularly limited as long as it can select cells that do not express it. Negative selectable marker genes include, for example, suicide genes (thymidine kinase, etc.), fluorescent protein genes, luciferase genes, chromogenic enzyme genes, and the like. When the negative selectable marker gene is a gene that negatively affects cell survival (eg, a suicide gene), the negative selectable marker gene can be operably linked to an inducible promoter. By being operably linked to an inducible promoter, the negative selectable marker gene can be expressed only when it is desired to eliminate cells bearing the negative selectable marker gene. Optically detectable marker genes (visible marker genes; visible mark
If the negative effect on cell survival is small, such as when the gene is an er gene), it may be expressed constitutively.

薬剤耐性遺伝子としては、例えば、ピューロマイシン耐性遺伝子、ブラスティサイディ
ン耐性遺伝子、ジェネティシン耐性遺伝子、ネオマイシン耐性遺伝子、テトラサイクリン
耐性遺伝子、カナマイシン耐性遺伝子、ゼオシン耐性遺伝子、ハイグロマイシン耐性遺伝
子、クロラムフェニコール耐性遺伝子等が挙げられるが、これらに限定されない。
蛍光タンパク質遺伝子としては、例えば、緑色蛍光タンパク質（ＧＦＰ）遺伝子、黄色
蛍光タンパク質（ＹＦＰ）遺伝子、赤色蛍光タンパク質（ＲＦＰ）遺伝子等が挙げられる
が、これらに限定されない。
発光酵素遺伝子としては、例えば、ルシフェラーゼ遺伝子などが挙げられるが、これら
に限定されない。
発色酵素遺伝子としては、例えば、βガラクトシターゼ遺伝子、βグルクロニダーゼ遺
伝子、アルカリフォスファターゼ遺伝子等が挙げられるが、これらに限定されない。
自殺遺伝子としては、例えば、単純ヘルペスウイルスのチミジンキナーゼ（ＨＳＶ－Ｔ
Ｋ）、誘導性カスパーゼ９（ｉｎｄｕｃｉｂｌｅｃａｓｐａｓｅ９）等が挙げられる
が、これらに限定されない。 Examples of drug resistance genes include puromycin resistance gene, blasticidin resistance gene, geneticin resistance gene, neomycin resistance gene, tetracycline resistance gene, kanamycin resistance gene, zeocin resistance gene, hygromycin resistance gene, chloramphenicol resistance gene, and the like. include, but are not limited to:
Examples of fluorescent protein genes include, but are not limited to, green fluorescent protein (GFP) gene, yellow fluorescent protein (YFP) gene, red fluorescent protein (RFP) gene, and the like.
Luciferase genes include, but are not limited to, luciferase genes.
Examples of the chromogenic enzyme gene include, but are not limited to, β-galactosidase gene, β-glucuronidase gene, alkaline phosphatase gene and the like.
Suicide genes include, for example, herpes simplex virus thymidine kinase (HSV-T
K), inducible caspase 9, etc., but are not limited thereto.

選択マーカー用ドナーＤＮＡが有する選択マーカー遺伝子は、ポジティブ選択マーカー
遺伝子であることが好ましい。すなわち、選択マーカーを発現する細胞を、選択マーカー
遺伝子がノックインされた細胞として、選択することができる。 The selectable marker gene possessed by the selectable marker donor DNA is preferably a positive selectable marker gene. That is, cells expressing the selectable marker can be selected as cells in which the selectable marker gene has been knocked in.

上流ホモロジーアームは、改変対象のゲノムにおいて、標的領域の上流側の塩基配列と
相同組換え可能な塩基配列を有し、例えば、標的配列の上流側に隣接する塩基配列と相同
な塩基配列を有する。下流ホモロジーアームは、改変対象のゲノムにおいて、標的領域の
下流側の塩基配列と相同組換え可能な塩基配列を有し、例えば、標的配列の下流側に隣接
する塩基配列と相同な塩基配列を有する。上流ホモロジーアーム及び下流ホモロジーアー
ムは、標的領域の周辺領域と相同組換可能であれば、その長さ及び配列は特に限定されな
い。上流ホモロジーアーム及び下流ホモロジーアームは、相同組換え可能な限り、標的領
域の上流側配列若しくは下流側配列と必ずしも完全一致する必要はない。例えば、上流ホ
モロジーアームは、標的領域の上流側に隣接する塩基配列と９０％以上の配列同一性（相
同性）を有する配列であることができ、９２％以上、９３％以上、９４％以上、９５％以
上、９６％以上、９７％以上、９８％以上、又は９９％以上の配列同一性を有することが
好ましい。例えば、下流ホモロジーアームは、標的領域の下流側に隣接する塩基配列と９
０％以上の配列同一性（相同性）を有する配列であることができ、９２％以上、９３％以
上、９４％以上、９５％以上、９６％以上、９７％以上、９８％以上、又は９９％以上の
配列同一性を有することが好ましい。また、上流ホモロジーアームと下流ホモロジーアー
ムとは、少なくともそのいずれかが標的領域中の切断箇所により近いとアレルの改変効率
をより高め得る。ここで、「近い」とは、２つの配列の距離が１００ｂｐ以下、５０ｂｐ
以下、４０ｂｐ以下、３０ｂｐ以下、２０ｂｐ以下、または１０ｂｐ以下であることを意
味し得る。 The upstream homology arm has a base sequence capable of homologous recombination with the base sequence upstream of the target region in the genome to be modified, for example, has a base sequence homologous to the base sequence adjacent to the upstream side of the target sequence. . The downstream homology arm has a base sequence capable of homologous recombination with the base sequence downstream of the target region in the genome to be modified, for example, has a base sequence homologous to the base sequence adjacent to the downstream side of the target sequence. . The length and sequence of the upstream homology arm and the downstream homology arm are not particularly limited as long as they are capable of homologous recombination with the region surrounding the target region. The upstream homology arm and downstream homology arm do not necessarily have to completely match the upstream or downstream sequence of the target region as long as homologous recombination is possible. For example, the upstream homology arm can be a sequence having 90% or more sequence identity (homology) with the nucleotide sequence adjacent to the upstream side of the target region, such as 92% or more, 93% or more, 94% or more, Preferably, they have a sequence identity of 95% or greater, 96% or greater, 97% or greater, 98% or greater, or 99% or greater. For example, the downstream homology arm includes the base sequence flanking the downstream side of the target region and 9
It can be a sequence having a sequence identity (homology) of 0% or more, 92% or more, 93% or more, 94% or more, 95% or more, 96% or more, 97% or more, 98% or more, or 99% % or greater sequence identity is preferred. In addition, when at least one of the upstream homology arm and the downstream homology arm is closer to the cleavage site in the target region, the efficiency of allele modification can be increased. Here, "close" means that the distance between the two sequences is 100 bp or less, 50 bp
Hereafter can mean 40 bp or less, 30 bp or less, 20 bp or less, or 10 bp or less.

選択マーカー用ドナーＤＮＡにおいて、選択マーカー遺伝子は、上流ホモロジーアーム
と下流ホモロジーアームとの間に位置する。これにより、上記（ｉ）のゲノム改変システ
ムと共に選択マーカー用ドナーＤＮＡを細胞に導入した場合に、ＨＤＲにより、選択マー
カー遺伝子が標的領域に導入される（これにより遺伝子が破壊される場合、遺伝子ノック
アウトといい、これにより所望の遺伝子が導入される場合、遺伝子ノックインという、遺
伝子をノックアウトしつつ、別の遺伝子をノックインすることもできる）。 In the selectable marker donor DNA, the selectable marker gene is located between the upstream homology arm and the downstream homology arm. As a result, when donor DNA for a selection marker is introduced into cells together with the genome modification system of (i) above, a selection marker gene is introduced into the target region by HDR (when the gene is disrupted by this, gene knockout When a desired gene is introduced by this, it is possible to knock in another gene while knocking out a gene, called gene knock-in).

選択マーカー遺伝子は、適切なプロモーターの制御下で発現されるように、プロモータ
ーに機能的に連結されていることが好ましい。プロモーターは、ドナーＤＮＡを導入する
細胞の種類に応じて適宜選択することができる。プロモーターとしては、例えば、ＳＲα
プロモーター、ＳＶ４０初期プロモーター、レトロウイルスのＬＴＲ、ＣＭＶ（サイトメ
ガロウイルス）プロモーター、ＲＳＶ（ラウス肉腫ウイルス）プロモーター、ＨＳＶ－Ｔ
Ｋ（単純ヘルペスウイルスチミジンキナーゼ）プロモーター、ＥＦ１αプロモーター、メ
タロチオネインプロモーター、ヒートショックプロモーター等が挙げられる。選択マーカ
ー用ドナーＤＮＡは、エンハンサー、ポリＡ付加シグナル、ターミネーター等の任意の制
御配列等を有していてもよい。 A selectable marker gene is preferably operably linked to a promoter so that it is expressed under the control of a suitable promoter. A promoter can be appropriately selected according to the type of cell into which the donor DNA is introduced. Examples of promoters include SRα
promoter, SV40 early promoter, retroviral LTR, CMV (cytomegalovirus) promoter, RSV (Rous sarcoma virus) promoter, HSV-T
K (herpes simplex virus thymidine kinase) promoter, EF1α promoter, metallothionein promoter, heat shock promoter and the like. The selectable marker donor DNA may have any regulatory sequences such as enhancers, poly(A) addition signals, terminators, and the like.

選択マーカー用ドナーＤＮＡは、インスレーター配列を有していてもよい。「インスレ
ーター」とは、隣接する染色体環境の影響を遮断または緩和し、その領域に挟まれたＤＮ
Ａの転写調節の独立性を保証するまたは高める配列をいう。インスレーターは、エンハン
サー遮断効果（エンハンサーとプロモーターの間に挿入することにより、エンハンサーに
よるプロモーター活性への影響を遮断する効果）、及び位置効果の抑制作用（導入遺伝子
の両側をインスレーターで挟むことにより、導入遺伝子の発現を挿入されたゲノム上の位
置に影響されないようにする作用）により定義される。選択マーカー用ドナーＤＮＡは、
上流アームと選択マーカー遺伝子との間（又は上流アームと選択マーカー遺伝子を制御す
るプロモーターとの間）に、インスレーター配列を有していてもよい。選択マーカー用ド
ナーＤＮＡは、下流アームと選択マーカー遺伝子との間に、インスレーター配列を有して
いてもよい。 The selectable marker donor DNA may have an insulator sequence. An "insulator" is a DN sandwiched between regions that blocks or mitigates the influence of the adjacent chromosomal environment.
A sequence that ensures or enhances the independence of A transcriptional regulation. Insulators have both an enhancer-blocking effect (the effect of blocking the effect of the enhancer on the promoter activity by inserting it between the enhancer and the promoter) and a position-effect suppressing effect (the effect of sandwiching the transgene with insulators on both sides). , the action of rendering the expression of the transgene unaffected by the position on the genome where it was inserted). The donor DNA for the selectable marker is
An insulator sequence may be present between the upstream arm and the selectable marker gene (or between the upstream arm and the promoter controlling the selectable marker gene). The selectable marker donor DNA may have an insulator sequence between the downstream arm and the selectable marker gene.

選択マーカー用ドナーＤＮＡは、直鎖状であってもよく、環状であってもよいが、環状
であることが好ましい。好ましくは、選択マーカー用ドナーＤＮＡは、プラスミドである
。選択マーカー用ドナーＤＮＡは、上記の配列に加えて、任意の配列を含み得る。例えば
、上流ホモロジーアーム、インスレーター、選択マーカー遺伝子、及び下流ホモロジーア
ームの各配列間の全て又は一部に、スペーサー配列を含んでいてもよい。 The selectable marker donor DNA may be linear or circular, but preferably circular. Preferably, the selectable marker donor DNA is a plasmid. Donor DNA for selectable markers can contain any sequence in addition to the above sequences. For example, all or part of each sequence between the upstream homology arm, the insulator, the selectable marker gene, and the downstream homology arm may contain a spacer sequence.

工程（ａ）では、ゲノム改変の対象とするアレルの数と同数以上の種類の選択マーカー
用ドナーＤＮＡを細胞に導入する。異なる種類の選択マーカー用ドナーＤＮＡは、相互に
異なる（区別可能な）種類の選択マーカー遺伝子を有する。ある態様では、異なる種類の
選択マーカー用ドナーＤＮＡは、完全に同一の選択マーカー遺伝子またはそのセットを有
しない。つまり、第１の種類の選択マーカー用ドナーＤＮＡは、第１の種類の選択マーカ
ー遺伝子を有し、第２の種類の選択マーカー用ドナーＤＮＡは、第２の種類の選択マーカ
ー遺伝子を有する。第３の種類の選択マーカー用ドナーＤＮＡは、第３の種類の選択マー
カー遺伝子を有し、それ以降の種類の選択マーカー用ドナーＤＮＡについても同様である
。ゲノム改変の対象とするアレルが２個である場合、選択マーカー用ドナーＤＮＡの種類
は２種類以上である。ゲノム改変の対象とするアレルが３個である場合、選択マーカー用
ドナーＤＮＡの種類は３種類以上である。ある態様では、１つの選択マーカー用ドナーＤ
ＮＡが、２種類以上の相互に異なる（区別可能な）選択マーカーを有していてもよい（こ
の場合であっても、異なる種類の選択マーカー用ドナーＤＮＡは、相互に異なる（区別可
能な）種類（例えば、固有）の選択マーカー遺伝子を有していなければならない）。ある
態様では、選択マーカー用ドナーＤＮＡは、部位特異的組換え酵素の組換え配列（例えば
、Ｃｒｅリコンビナーゼにより組換えられるｌｏｘＰ配列およびその変種）を有しない。
また、ある態様では、本発明の方法は、部位特異的組換え酵素及びその組換え配列（例え
ば、Ｃｒｅリコンビナーゼにより組換えられるｌｏｘＰ配列およびその変種）を用いない
。部位特異的組換え酵素を用いると、通常は、編集後のゲノムに部位特異的組換え酵素の
組換え配列が１つ残存する。これに対して、ある態様では、本発明の方法で得られる細胞
の改変ゲノムは、部位特異的組換え酵素の組換え配列（外来である）を有しない。 In step (a), the same or more types of donor DNAs for selection markers as the number of target alleles for genome modification are introduced into cells. Different types of selectable marker donor DNAs have mutually different (distinguishable) types of selectable marker genes. In certain aspects, the different types of selectable marker donor DNAs do not have the exact same selectable marker gene or set thereof. That is, the first type of selectable marker donor DNA has the first type of selectable marker gene, and the second type of selectable marker donor DNA has the second type of selectable marker gene. The third type of selectable marker donor DNA has the third type of selectable marker gene, and the same applies to subsequent types of selectable marker donor DNAs. When there are two target alleles for genome modification, there are two or more types of donor DNAs for selection markers. When there are three target alleles for genome modification, there are three or more types of selectable marker donor DNAs. In one aspect, donor D for one selectable marker
NA may have two or more mutually different (distinguishable) selection markers (even in this case, different types of donor DNA for selection markers are mutually different (distinguishable) must have a species (eg, unique) selectable marker gene). In some embodiments, the selectable marker donor DNA does not have recombination sequences for site-specific recombinase (eg, loxP sequences and variants thereof that are recombined by Cre recombinase).
Also, in certain aspects, the methods of the invention do not employ site-specific recombinases and recombination sequences thereof (eg, loxP sequences and variants thereof that are recombined by Cre recombinase). When a site-directed recombinase is used, usually one recombination sequence of the site-directed recombinase remains in the edited genome. In contrast, in one aspect, the modified genome of the cell obtained by the method of the invention does not contain the recombination sequence (which is foreign) of the site-specific recombinase.

選択マーカー用ドナーＤＮＡの種類数は、ゲノム改変の対象とするアレルの数と同数以
上であればよく、上限は特に限定されない。ゲノム改変の対象とするアレルの数と同数以
上の種類の選択マーカー用ドナーＤＮＡを用いることで、２つ以上のアレルを安定的に改
変することができる。選択マーカー用ドナーＤＮＡの種類数は、後述の工程（ｂ）での選
択操作の観点から、ゲノム改変の対象とするアレルの数と同数又は１～２多い程度が好ま
しく、ゲノム改変の対象とするアレルの数と同数であることがより好ましい。 The number of types of selectable marker donor DNAs may be equal to or greater than the number of target alleles for genome modification, and the upper limit is not particularly limited. Two or more alleles can be stably modified by using the same or more types of selective marker donor DNAs as the number of target alleles for genome modification. From the viewpoint of the selection operation in the step (b) described later, the number of types of donor DNA for selection markers is preferably the same as the number of alleles to be modified or about 1 to 2 more than the number of alleles to be modified. More preferably, the number is the same as the number of alleles.

上記（ｉ）と（ｉｉ）を細胞に導入する方法は特に限定されず、公知の方法を特に制限
なく用いることができる。（ｉ）及び（ｉｉ）を細胞に導入する方法としては、例えば、
ウイルス感染法、リポフェクション法、マイクロインジェクション法、カルシウムリン酸
法、ＤＥＡＥ－デキストラン法、エレクトロポーレーション法、パーティクルガン法等が
挙げられるが、これらに限定されない。上記（ｉ）と（ｉｉ）とを細胞に導入することに
より、上記（ｉ）の配列特異的核酸切断分子により、標的領域のＤＮＡが切断された後、
ＨＤＲにより（ｉｉ）の選択マーカー用ドナーＤＮＡ中の選択マーカーが標的領域にノッ
クインされる。この際、２種以上の選択マーカー用ドナーＤＮＡは、それぞれの上流ホモ
ロジーアームおよび下流ホモロジーアームが同一である場合には、ランダムに標的領域の
２つ以上のアレルにノックインされ得る。但し、２種以上の選択マーカー用ドナーＤＮＡ
は、２以上のアレルそれぞれの標的領域の上流配列および下流配列と相同組換え可能な塩
基配列をホモロジーアームの塩基配列をそれぞれ有している限り２以上のアレルそれぞれ
を改変できるので、完全に同一のホモロジーアームの塩基配列を有する必要はない。ある
態様では、２種以上の選択マーカー用ドナーＤＮＡにおいては、その上流および下流ホモ
ロジーアームの塩基配列が、それぞれのアレルの標的領域の上流配列および下流配列とよ
り同一性の高い塩基配列を有していてもよい（例えば、そのように最適化されていてもよ
い）。 The method for introducing the above (i) and (ii) into cells is not particularly limited, and known methods can be used without particular limitations. Methods for introducing (i) and (ii) into cells include, for example,
Examples include, but are not limited to, virus infection method, lipofection method, microinjection method, calcium phosphate method, DEAE-dextran method, electroporation method, particle gun method and the like. By introducing (i) and (ii) above into a cell, the DNA in the target region is cleaved by the sequence-specific nucleic acid cleaving molecule of (i) above,
HDR knocks in the selectable marker in the selectable marker donor DNA of (ii) into the target region. At this time, two or more selection marker donor DNAs can be randomly knocked into two or more alleles of the target region when their respective upstream homology arms and downstream homology arms are the same. However, donor DNA for two or more selection markers
can modify each of two or more alleles as long as they have base sequences of homology arms that allow homologous recombination with the upstream and downstream sequences of the target regions of each of the two or more alleles. need not have the nucleotide sequence of the homology arms of In one aspect, in two or more types of selectable marker donor DNAs, the nucleotide sequences of the upstream and downstream homology arms have nucleotide sequences that are more identical to the upstream and downstream sequences of the target regions of the respective alleles. (eg, optimized to do so).

選択マーカー用ドナーＤＮＡは、ある態様では、上流ホモロジーアームおよび下流ホモ
ロジーアームを有し、上流ホモロジーアームおよび下流ホモロジーアームの間に、選択マ
ーカー遺伝子を有し、好ましくは、メガヌクレアーゼの切断部位などのエンドヌクレアー
ゼ（塩基配列特異的核酸切断分子）の標的配列をさらに有し得る。この態様において、あ
る好ましい態様では、選択マーカーは、ポジティブ選択用の選択マーカー遺伝子およびネ
ガティブ選択用のマーカー遺伝子を含む。別の好ましい態様では、選択マーカーは、ポジ
ティブ選択用の選択マーカーを含むが、これとは別にネガティブ選択マーカー遺伝子を含
まなくてもよい。ある好ましい態様では、ポジティブ選択用の選択マーカー遺伝子は、ネ
ガティブ選択用にも用いられ得、そのようなマーカー遺伝子としては、可視化マーカー遺
伝子が挙げられる。
２以上の選択マーカー用ドナーＤＮＡのセットは、上記の選択マーカー用ドナーＤＮＡ
の組合せであり、かつ、それぞれが互いに区別可能なポジティブ選択用の選択マーカー遺
伝子を有している。上記セットにおいては、メガヌクレアーゼの切断部位などのエンドヌ
クレアーゼ（塩基配列特異的核酸切断分子）の標的配列をさらに有し得、当該標的配列は
互いに異なっていてもよいが、同一である（または同一の塩基配列特異的核酸切断分子で
切断できる）ことが好ましい。選択マーカー用ドナーＤＮＡの長さは上記の通りであるが
、例えば、５ｋｂｐ以上、８ｋｂｐ以上、または１０ｋｂｐ以上であり得る。 In one aspect, the donor DNA for the selectable marker has an upstream homology arm and a downstream homology arm, has a selectable marker gene between the upstream homology arm and the downstream homology arm, and preferably has a meganuclease cleavage site or the like. It may further have a target sequence for an endonuclease (sequence-specific nucleic acid cleaving molecule). In this aspect, in one preferred aspect, the selectable marker comprises a selectable marker gene for positive selection and a marker gene for negative selection. In another preferred embodiment, the selectable marker includes a selectable marker for positive selection, but may not separately include a negative selectable marker gene. In certain preferred embodiments, selectable marker genes for positive selection can also be used for negative selection, and such marker genes include visualization marker genes.
The set of two or more selectable marker donor DNAs includes the above selectable marker donor DNAs
and each have selectable marker genes for positive selection that are distinguishable from each other. The set may further have a target sequence for an endonuclease (base sequence-specific nucleic acid cleaving molecule), such as a cleavage site for a meganuclease, and the target sequences may be different from each other, but are identical (or is cleavable with a base sequence-specific nucleic acid cleaving molecule). The length of the donor DNA for the selectable marker is as described above, but can be, for example, 5 kbp or longer, 8 kbp or longer, or 10 kbp or longer.

（工程（ｂ））
前記工程（ａ）の後、工程（ｂ）を行う。工程（ｂ）では、２以上のアレルにそれぞれ
区別可能に異なる選択マーカー遺伝子またはその組合せが導入された細胞を、当該区別可
能に異なる選択マーカー遺伝子の発現に基づいて、選択する。より具体的には、工程（ｂ
）では、前記２以上のアレルに対して異なる種類の選択マーカー用ドナーＤＮＡがそれぞ
れ相同組み換えすることによって、当該２以上のアレルにそれぞれ区別可能に異なる固有
の選択マーカー遺伝子が導入され、導入された当該区別可能に異なる選択マーカー遺伝子
のすべてを発現する細胞を選択する。ある態様では、工程（ｂ）では、前記２種以上の選
択マーカー用ドナーＤＮＡが有する選択マーカー遺伝子であって染色体ゲノムに組込まれ
たすべての選択マーカー遺伝子の発現に基づいて、異なる選択マーカー用ドナーＤＮＡが
導入されたことによりそれぞれのアレルが改変された細胞を選択する。ある態様では、工
程（ｂ）では、前記２種以上の選択マーカー用ドナーＤＮＡが有する全ての選択マーカー
遺伝子に基づいて、細胞を選択する。ある態様では、工程（ｂ）では、前記２種以上の選
択マーカー用ドナーＤＮＡが有する選択マーカー遺伝子であって染色体ゲノムに組込まれ
たすべての選択マーカー遺伝子（ポジティブ選択用のマーカー遺伝子）の発現に基づいて
、区別可能な選択マーカー用ドナーＤＮＡが導入されたことによりそれぞれのアレルが改
変された細胞を選択する。ある態様では、工程（ｂ）で得られる細胞は、各アレルが異な
るポジティブ選択用のマーカー遺伝子を有する。ある態様では、工程（ｂ）で得られる細
胞は、各アレルが、共通したポジティブ選択用のマーカー遺伝子をここで、ある態様では
、工程（ｂ）では、単一細胞クローニングを行わない｛但し、工程（ｂ）で２つ以上のア
レルが改変された細胞を選択した後に単一細胞クローニングを行うことを含んでいても、
含まなくてもよい｝。ある態様では、工程（ｂ）では、細胞の選択は、各アレルに導入さ
れた区別可能なポジティブ選択用のマーカー遺伝子の複数の発現に基づいて行われる。あ
る態様では、工程（ｂ）は、単一の選択マーカー遺伝子の発現強度（例えば、蛍光タンパ
ク質の発現強度または蛍光強度）に基づいて改変アレル数の多さを推定する方式では行わ
れない。単一の選択マーカー遺伝子の発現強度の強弱に基づいて改変アレル数の多さを推
定する方式で細胞を選択する場合、細胞毎の遺伝子発現量に変動が生じ、２以上のアレル
が改変された細胞を１つのアレルが改変された細胞から完全に分離することが困難となり
、したがって、工程（ｂ）において単一細胞クローニングが必要となるためである。 (Step (b))
After the step (a), the step (b) is performed. In step (b), cells into which two or more alleles have been introduced with distinguishably different selectable marker genes or a combination thereof are selected based on the expression of the distinguishably different selectable marker genes. More specifically, step (b
), by homologous recombination of different types of selectable marker donor DNAs with respect to the two or more alleles, unique selectable marker genes that are distinguishably different from each other are introduced into the two or more alleles. Cells are selected that express all of the distinguishably distinct selectable marker genes. In one aspect, in step (b), different donors for selection markers are selected based on the expression of all the selection marker genes that are included in the two or more types of donor DNAs for selection markers and that are integrated into the chromosomal genome. Cells in which each allele has been modified by introduction of the DNA are selected. In one embodiment, in step (b), cells are selected based on all the selectable marker genes possessed by the two or more selectable marker donor DNAs. In one aspect, in step (b), expression of all selectable marker genes (positive selection marker genes) integrated into the chromosomal genome that are selectable marker genes possessed by the two or more selectable marker donor DNAs Based on this, cells in which each allele has been modified by introduction of a distinguishable selection marker donor DNA are selected. In one aspect, the cells obtained in step (b) have a marker gene for positive selection with each allele being different. In one embodiment, the cells obtained in step (b) each allele carries a common marker gene for positive selection, and in one embodiment, in step (b), single cell cloning is not performed {however, performing single-cell cloning after selecting cells in which two or more alleles have been modified in step (b);
may not be included}. In one aspect, in step (b), the selection of cells is based on multiple expression of the introduced distinct positive selection marker gene at each allele. In some embodiments, step (b) is not performed in a manner that estimates the abundance of modified alleles based on the expression intensity of a single selectable marker gene (eg, fluorescent protein expression intensity or fluorescence intensity). When cells are selected by a method of estimating the number of modified alleles based on the intensity of expression of a single selectable marker gene, the gene expression level of each cell fluctuates, and two or more alleles are modified. This is because it becomes difficult to completely separate the cells from the cells modified in one allele, thus necessitating single cell cloning in step (b).

工程（ｂ）は、工程（ａ）で用いた選択マーカー遺伝子の種類に応じて、適宜、細胞の
選択を行えばよい。この際、工程（ａ）で用いた選択マーカー遺伝子の全ての発現に基づ
いて細胞を選択する。 In step (b), cells may be appropriately selected according to the type of selectable marker gene used in step (a). At this time, cells are selected based on the expression of all of the selectable marker genes used in step (a).

例えば、選択マーカー遺伝子がポジティブ選択マーカー遺伝子である場合、改変対象の
染色体ゲノムに組込まれる（または組込まれた）すべての選択マーカー遺伝子を発現する
細胞を選択することができ、例えば、改変対象のアレルの数と同数のポジティブ選択マー
カーを発現する細胞を選択することができる。ポジティブ選択マーカー遺伝子が薬剤耐性
遺伝子である場合、当該薬剤を含む培地で細胞を培養することにより、前記ポジティブ選
択マーカーを発現する細胞を選択することができる。ポジティブ選択マーカー遺伝子が蛍
光タンパク質遺伝子、発光酵素遺伝子、又は発色酵素遺伝子である場合、蛍光タンパク質
、発光酵素、又は発色酵素による蛍光、発光、又は発色を呈する細胞を選択することによ
り、前記ポジティブ選択マーカーを発現する細胞を選択することができる。本工程では、
改変されるべきアレルの数と同数の選択マーカー用ドナーＤＮＡがゲノムに取り込まれた
場合、当該数のアレルが改変されている。ｎ倍体の細胞においては、改変されるべきアレ
ルの数はｎまたはそれ以下であり、当該数以上ｎ以下の種類の選択マーカー用ドナーＤＮ
Ａがゲノムに取り込まれた場合には、少なくとも改変されるべきアレル（２以上のアレル
である）が改変されている。ある態様では、改変されるべきアレルの数がｎであり、当該
数の種類の選択マーカー用ドナーＤＮＡが染色体ゲノムに取り込まれ、すべてのアレルが
改変されている。ある態様では、本工程では、改変対象とするアレルの数と同数以上の種
類の選択マーカー用ドナーＤＮＡを用いているため、細胞が発現するポジティブ選択マー
カーの数は、当該数のアレルが確実に改変されていることを意味する。工程（ｂ）におけ
る細胞の選択効率を高める観点では、改変対象とするアレルの数は、選択マーカー用ドナ
ーＤＮＡの種類数と同一であることが好ましい。 For example, when the selectable marker gene is a positive selectable marker gene, cells expressing all selectable marker genes integrated (or integrated) into the chromosomal genome to be modified can be selected. Cells can be selected that express as many positive selectable markers as there are. When the positive selectable marker gene is a drug resistance gene, cells expressing the positive selectable marker can be selected by culturing cells in a medium containing the drug. When the positive selectable marker gene is a fluorescent protein gene, a luciferase gene, or a chromogenic enzyme gene, the positive selectable marker is selected by selecting cells exhibiting fluorescence, luminescence, or color development by the fluorescent protein, luciferase, or chromogenic enzyme. can be selected for cells that express In this process,
When the same number of selectable marker donor DNAs as the number of alleles to be modified are incorporated into the genome, the number of alleles is modified. In nploid cells, the number of alleles to be modified is n or less, and the donor DN for selection marker of the number or more and n or less types
When A is incorporated into the genome, at least the alleles to be modified (which are two or more alleles) have been modified. In one aspect, the number of alleles to be modified is n, the number of types of donor DNAs for selection markers are incorporated into the chromosomal genome, and all alleles are modified. In one aspect, in this step, the number of types of donor DNA for selection markers equal to or more than the number of alleles to be modified is used. means modified. From the viewpoint of enhancing the cell selection efficiency in step (b), the number of alleles to be modified is preferably the same as the number of types of selectable marker donor DNAs.

上記の通り、本実施形態のゲノム改変方法では、ｎ倍体の細胞においてｎ個のアレルを
改変するためにｎ種類の選択マーカー用ドナーＤＮＡを用いてＨＤＲを誘発させることに
よって、細胞が有する全てのアレルが改変された細胞を効率よく取得することができる。
また、全てのアレルが改変された細胞を確実に取得することができるため、標的領域が大
きいサイズ（例えば、１０ｋｂｐ以上）であっても、標的領域が改変された細胞を効率よ
く取得することができる。そのため、大規模なゲノム改変も可能となる。 As described above, in the genome modification method of the present embodiment, by inducing HDR using n types of selection marker donor DNAs to modify n alleles in n-ploid cells, all the cells have Allele-modified cells can be obtained efficiently.
In addition, since cells in which all alleles have been modified can be reliably obtained, cells in which the target region has been modified can be efficiently obtained even if the target region has a large size (eg, 10 kbp or more). can. Therefore, large-scale genome modification becomes possible.

ある好ましい態様では、選択マーカー用ドナーＤＮＡは、上流ホモロジーアームと下流
ホモロジーアームとの間にポジティブ選択用マーカー遺伝子の他にネガティブ選択用マー
カー遺伝子を含んでいてもよい。ある好ましい態様では、選択マーカー用ドナーＤＮＡに
おいては、前記ポジティブ選択用マーカー遺伝子は、ネガティブ選択用にも兼用できるマ
ーカー遺伝子（ポジティブ選択およびネガティブ選択に兼用可能なマーカー遺伝子）であ
ってもよい。ある好ましい態様では、ポジティブ選択用マーカー遺伝子は、薬剤耐性遺伝
子であり得る。ある好ましい態様では、ポジティブ選択用マーカー遺伝子は、ネガティブ
選択用にも兼用できる可視化マーカー遺伝子等であり得る。 In a preferred embodiment, the selectable marker donor DNA may contain a negative selectable marker gene in addition to the positive selectable marker gene between the upstream homology arm and the downstream homology arm. In a preferred embodiment, in the selection marker donor DNA, the positive selection marker gene may be a marker gene that can also be used for negative selection (a marker gene that can be used for both positive selection and negative selection). In one preferred embodiment, the positive selectable marker gene can be a drug resistance gene. In a preferred embodiment, the positive selection marker gene can be a visualization marker gene or the like that can also be used for negative selection.

選択マーカー用ドナーＤＮＡは、上流ホモロジーアームと下流ホモロジーアームとの間
にポジティブ選択用マーカー遺伝子の他にネガティブ選択用マーカー遺伝子を含むが、標
的核酸配列をさらに含んでいてもよい。標的核酸配列は、上記ゲノム改変システムにより
切断可能な配列である。標的核酸配列は、アレル特異的な配列であることが好ましく、こ
れにより、第一のアレルのカセット（または第一のカセット）および第二のアレルのカセ
ット（第二のカセット）のうち、第一のアレルのみを切断すること、または第二のアレル
のみを切断することができる。このように、切断をアレル特異的に誘発させることにより
、片アレルのみの選択的な編集が可能である。ある好ましい態様では、選択マーカー用ド
ナーＤＮＡは、上流ホモロジーアームと下流ホモロジーアームとの間に１つの標的核酸配
列を含む。ある好ましい態様では、選択マーカー用ドナーＤＮＡは、上流ホモロジーアー
ムと下流ホモロジーアームとの間に第一の標的核酸配列と第二の標的核酸配列を含み、第
一の標的核酸配列と第二の標的核酸配列の間に選択マーカー遺伝子を含む。別の選択マー
カー用ドナーＤＮＡは、上流ホモロジーアームと下流ホモロジーアームとの間に第三の標
的核酸配列と第四の標的核酸配列を含み、第三の標的核酸配列と第四の標的核酸配列の間
に選択マーカー遺伝子を含む。第一の標的核酸配列と第二の標的核酸配列は、同一配列で
も異なっていてもよく、第三の標的核酸配列と第四の標的核酸配列は、同一配列でも異な
っていてもよい。例えば、第一の標的核酸配列と第二の標的核酸配列を切断するときに、
第三の標的核酸配列と第四の標的核酸配列が切断されないように設計され、および／また
は、第三の標的核酸配列と第四の標的核酸配列を切断するときに、第一の標的核酸配列と
第二の標的核酸配列が切断されないように設計されていれば、第一のカセットおよび第二
のカセットのいずれかのみを特異的に切断し、当該カセットのうちの一方の編集を選択的
に行うことができる。第一のカセットと第二のカセットを同時に編集する場合には、第一
～第四の標的核酸配列は同一でもよいことは明らかであろう。 The selectable marker donor DNA contains a positive selectable marker gene and a negative selectable marker gene between the upstream homology arm and the downstream homology arm, but may further contain a target nucleic acid sequence. A target nucleic acid sequence is a sequence cleavable by the genome modification system described above. The target nucleic acid sequence is preferably an allele-specific sequence whereby the first of the first allele cassette (or first cassette) and the second allele cassette (second cassette) one allele or only the second allele. In this way, selective editing of only one allele is possible by allele-specific induction of cleavage. In one preferred embodiment, the selectable marker donor DNA comprises a single target nucleic acid sequence between the upstream homology arm and the downstream homology arm. In one preferred embodiment, the selectable marker donor DNA comprises a first target nucleic acid sequence and a second target nucleic acid sequence between an upstream homology arm and a downstream homology arm, wherein the first target nucleic acid sequence and the second target nucleic acid sequence are A selectable marker gene is included between the nucleic acid sequences. Another selectable marker donor DNA comprises a third target nucleic acid sequence and a fourth target nucleic acid sequence between the upstream homology arm and the downstream homology arm, wherein the third target nucleic acid sequence and the fourth target nucleic acid sequence are Including a selectable marker gene in between. The first target nucleic acid sequence and the second target nucleic acid sequence may be the same or different, and the third and fourth target nucleic acid sequences may be the same or different. For example, when cleaving a first target nucleic acid sequence and a second target nucleic acid sequence,
designed such that the third target nucleic acid sequence and the fourth target nucleic acid sequence are not cleaved and/or the first target nucleic acid sequence is cleaved when the third target nucleic acid sequence and the fourth target nucleic acid sequence are cleaved and the second target nucleic acid sequence is designed not to be cleaved, specifically cleaves only one of the first cassette and the second cassette, and selectively edits one of the cassettes It can be carried out. It will be appreciated that the first to fourth target nucleic acid sequences may be identical if the first and second cassettes are edited simultaneously.

ある態様では、工程（ｂ）では、工程（ａ）によって得られた細胞を含むプールから、
細胞をクローニングすることなく、改変された細胞を選択することができる。クローニン
グの工程を省くことによって工程に必要な時間が短縮され得る。 In one aspect, in step (b), from the pool comprising the cells obtained by step (a),
Modified cells can be selected without cloning the cells. Omitting the cloning step can reduce the time required for the process.

第一の中間体細胞は、改変前細胞から上記工程（ａ）および（ｂ）によって得ることが
できる（図１の工程Ｓ１参照）。工程（ａ）においては、選択マーカー用ドナーＤＮＡと
して、標的配列に対する上流ホモロジーアームと下流ホモロジーアームとの間にポジティ
ブ選択用マーカー遺伝子とネガティブ選択用マーカー遺伝子、またはポジティブ選択用お
よびネガティブ選択用の両方に兼用可能なマーカー遺伝子を含む。選択マーカー用ドナー
ＤＮＡは好ましくは、標的配列に対する上流ホモロジーアームと下流ホモロジーアームと
の間に２つの標的核酸配列を含む。そして、ポジティブ選択用マーカー遺伝子とネガティ
ブ選択用マーカー遺伝子、またはポジティブ選択用およびネガティブ選択用の両方に兼用
可能なマーカー遺伝子は、好ましくは、当該２つの標的核酸配列の間に存在する。このよ
うにすることで、ポジティブ選択により第一の中間体細胞を得ることができる。 A first intermediate cell can be obtained from the pre-modification cell by steps (a) and (b) above (see step S1 in FIG. 1). In step (a), the selectable marker donor DNA is a positive selectable marker gene and a negative selectable marker gene, or both positive and negative selectable marker genes between the upstream homology arm and the downstream homology arm for the target sequence. contains a marker gene that can be used for both. Donor DNAs for selectable markers preferably contain two target nucleic acid sequences between upstream and downstream homology arms to the target sequences. A positive selection marker gene and a negative selection marker gene, or a marker gene that can be used for both positive selection and negative selection, are preferably present between the two target nucleic acid sequences. By doing so, a first intermediate cell can be obtained by positive selection.

第一の中間体細胞は、改変対象である遺伝子座に第一のアレルと第二のアレルを含むゲ
ノムを有する細胞において、前記第一のアレルと第二のアレルそれぞれに選択マーカー遺
伝子と標的核酸配列を含むカセットを有する細胞である。ある好ましい態様では、第一の
中間体細胞では、第一のアレルが有する選択マーカー遺伝子と第二のアレルが有する選択
マーカー遺伝子とは区別可能に異なる。ある好ましい態様では、第一の中間体細胞では、
前記標的核酸配列は、ゲノム改変システムの標的であり、ゲノム改変システムにより第一
のアレルと第二のアレルとを区別可能に切断できるように設計されている。ある好ましい
態様では、第一の中間体細胞では、各選択マーカー遺伝子はネガティブ選択に用いること
ができるネガティブ選択用マーカー遺伝子である。第一の中間体細胞を取得する際にはポ
ジティブ選択用マーカー遺伝子が有用であるが、第一の中間体細胞を得た後の工程におい
ては、ポジティブ選択用マーカー遺伝子は必要ではない。したがって、ポジティブ選択用
マーカー遺伝子は、除去されてもよい。除去は、例えば、ゲノム編集技術というにより実
施することができる。このように、第一の中間体細胞は、ポジティブ選択用を有していな
くてもよい。 The first intermediate cell is a cell having a genome containing a first allele and a second allele at the genetic locus to be modified, wherein each of the first allele and the second allele contains a selectable marker gene and a target nucleic acid. A cell with a cassette containing the sequence. In a preferred embodiment, in the first intermediate cell, the selectable marker gene possessed by the first allele is distinguishably different from the selectable marker gene possessed by the second allele. In one preferred embodiment, the first intermediate cell comprises
The target nucleic acid sequence is a target of a genome modification system and is designed so that the genome modification system can cleave the first allele and the second allele in a distinguishable manner. In one preferred embodiment, in the first intermediate cell, each selectable marker gene is a negative selectable marker gene that can be used for negative selection. A positive selectable marker gene is useful when obtaining the first intermediate cell, but the positive selectable marker gene is not required in the steps after obtaining the first intermediate cell. Thus, positive selectable marker genes may be removed. Removal can be performed, for example, by what is referred to as genome editing technology. Thus, the first intermediate cell may not have a positive selection potential.

第一の中間体細胞からは、第二の中間体細胞を得てもよい（参考：図１の工程Ｓ３）。
第二の中間体細胞は、第一の中間体細胞の第二のカセットを除去して作製することができ
る。カセットの除去は、好ましくは前記第二のカセットの上流と相同組換え可能な上流ホ
モロジーアームと前記カセットの下流と相同組換え可能な下流ホモロジーアームを含むド
ナーＤＮＡ（カセット除去用ドナーＤＮＡ）存在下で、第二のカセット内部の標的核酸配
列を特異的に切断することにより行うことができる。前記カセットの上流と相同組換え可
能な上流ホモロジーアームと前記カセットの下流と相同組換え可能な下流ホモロジーアー
ムからなるドナーＤＮＡ存在下で標的核酸配列を切断することでカセットの除去の際にカ
セットの上流と下流をシームレスに連結することもできる。このようにして得られる第二
の中間体細胞は、第一のアレルには、選択マーカー遺伝子と標的核酸配列を含むカセット
を有するが、第二のカセットを含まない。 A second intermediate cell may be obtained from the first intermediate cell (see step S3 in FIG. 1).
A second intermediate cell can be made by removing the second cassette of the first intermediate cell. Cassette removal is preferably performed in the presence of a donor DNA (donor DNA for cassette removal) containing an upstream homology arm capable of homologous recombination with the upstream of the second cassette and a downstream homology arm capable of homologous recombination with the downstream of the cassette. by specifically cleaving the target nucleic acid sequence within the second cassette. By cleaving the target nucleic acid sequence in the presence of a donor DNA consisting of an upstream homology arm capable of homologous recombination with the upstream of the cassette and a downstream homology arm capable of homologous recombination with the downstream of the cassette, the cassette is removed when the cassette is removed. It is also possible to connect upstream and downstream seamlessly. The second intermediate cell thus obtained has, in the first allele, the cassette containing the selectable marker gene and the target nucleic acid sequence, but does not contain the second cassette.

第一の中間体細胞および第二の中間体細胞から本開示のライブラリーを作製することが
できる（参考：図１の工程Ｓ２およびＳ４）。第一の中間体細胞および第二の中間体細胞
（合わせて「中間体細胞」という）は、第一のアレルに、選択マーカー遺伝子と標的核酸
配列を含むカセットを有する。ライブラリ作製工程においては、第一のカセットを除去し
、代わりに改変塩基配列を導入することができる。すなわち、第一のカセットを改変塩基
配列で置き換えることができる。この置き換えは、ゲノム改変システムにより実施するこ
とができる。第一のカセットの上流と相同組換え可能な上流ホモロジーアームと前記カセ
ットの下流と相同組換え可能な下流ホモロジーアームを含む改変塩基配列導入用ドナーＤ
ＮＡ（またはライブラリ作製用ドナーＤＮＡ）存在下で、第一のカセット内部の標的核酸
配列を特異的に切断することにより行うことができる。ライブラリ作製用ドナーＤＮＡは
、上流ホモロジーアームと下流ホモロジーアームとの間に改変塩基配列を有する。基本的
には、ライブラリ作製用ドナーＤＮＡの上流ホモロジーアームと下流ホモロジーアームの
間に挟まれた配列へと、ゲノム上の上流ホモロジーアームが相同組換えする領域と下流ホ
モロジーアームが相同組換えする領域に挟まれた領域が置き換わるので、置き換え後の配
列を上流ホモロジーアームと下流ホモロジーアームの間に有するライブラリ作製用ドナー
ＤＮＡが好ましく用いられ得る。したがって、上記操作により、カセットを改変塩基配列
に置き換えることができる。ライブラリ作製用ドナーＤＮＡは、種々の改変塩基配列を含
むＤＮＡ群であり得る。このようにすることで、上記操作により、各中間体細胞のカセッ
トそれぞれを様々な改変塩基配列に置き換えることができる。カセットが置き換えられる
と、カセット内のネガティブ選択用マーカー遺伝子が除去されることとなるので、ネガテ
ィブ選択用マーカー遺伝子が発現しないことを指標としてカセットが改変塩基配列に置き
換わった細胞を取得することができる。ある態様では、ライブラリ作製用ドナーＤＮＡは
、線状である（環状ではない）。このようにすることで、ライブラリ作製用ドナーＤＮＡ
の調整の簡便性において利点を有し得る（例えば、図４の欠点２参照）。カセットの改変
塩基配列の置き換えは、当業者であれは周知慣用技術により確認でき、例えば、制限酵素
による切断の有無、ＰＣＲの増幅の有無（例えば、ジャンクションＰＣＲ）やシークエン
シングにより確認することができる。改変塩基配列は、０塩基長、すなわち存在しないで
もよいが、好ましくは、１塩基長以上であり得る。改変塩基配列は、特に限定されないが
例えば、１～１００万塩基長、１～５０万塩基長、１～１０万塩基長、１～２万塩基長、
１～１．５万塩基長、または１～１万塩基長であり得る。改変塩基配列は、特に限定され
ないが例えば、３塩基長以上、１０塩基長以上、３０塩基長以上、５０塩基長以上、１０
０塩基長以上の長さを有し得る。各ライブラリ作製用ドナーＤＮＡの改変塩基配列は、同
じでも異なっていてもよい。各ライブラリ作製用ドナーＤＮＡの改変塩基配列は、それぞ
れ独立して上記の長さのいずれかを有し得る。 A library of the present disclosure can be generated from a first intermediate cell and a second intermediate cell (see steps S2 and S4 in Figure 1). The first intermediate cell and the second intermediate cell (collectively, "intermediate cells") have at the first allele a cassette comprising a selectable marker gene and a target nucleic acid sequence. In the library preparation process, the first cassette can be removed and a modified nucleotide sequence can be introduced instead. That is, the first cassette can be replaced with a modified base sequence. This replacement can be performed by a genome modification system. A donor D for introducing a modified base sequence comprising an upstream homology arm capable of homologous recombination with the upstream of the first cassette and a downstream homology arm capable of homologous recombination with the downstream of the cassette
It can be performed by specifically cleaving the target nucleic acid sequence inside the first cassette in the presence of NA (or donor DNA for library construction). The donor DNA for library construction has a modified base sequence between the upstream homology arm and the downstream homology arm. Basically, the region where the upstream homology arm on the genome undergoes homologous recombination and the region where the downstream homology arm homologously recombines into the sequence sandwiched between the upstream homology arm and the downstream homology arm of the donor DNA for library construction. Since the region sandwiched between the two is replaced, a donor DNA for library construction having the sequence after replacement between the upstream homology arm and the downstream homology arm can be preferably used. Therefore, the cassette can be replaced with the modified nucleotide sequence by the above operation. The donor DNA for library construction can be a group of DNAs containing various modified base sequences. By doing so, the cassette of each intermediate cell can be replaced with various modified base sequences by the above operation. When the cassette is replaced, the negative selectable marker gene in the cassette is removed, so that non-expression of the negative selectable marker gene can be used as an index to obtain cells in which the cassette has been replaced with the modified base sequence. . In one aspect, the donor DNA for library construction is linear (not circular). By doing so, the donor DNA for library preparation
(see, for example, drawback 2 in FIG. 4). Replacement of the modified nucleotide sequence of the cassette can be confirmed by a person skilled in the art by well-known and commonly used techniques, for example, the presence or absence of restriction enzyme cleavage, the presence or absence of PCR amplification (e.g., junction PCR), and sequencing. . The modified base sequence may have a length of 0 bases, that is, may be absent, but preferably has a length of 1 base or more. Modified base sequences are not particularly limited, but may be, for example, 1 to 1,000,000 bases long, 1 to 500,000 bases long, 10,000 to 100,000 bases long, 10,000 to 20,000 bases long,
It can be 10,000 to 15,000 bases long, or 10,000 to 10,000 bases long. The modified base sequence is not particularly limited, but may be, for example, 3 bases or longer, 10 bases or longer, 30 bases or longer, 50 bases or longer, 10 bases or longer,
It can have a length of 0 bases or longer. The modified nucleotide sequences of the donor DNAs for library construction may be the same or different. The modified base sequence of each library-construction donor DNA can independently have any of the above lengths.

中間体細胞が、標的領域において３以上のアレルを有する場合には、少なくとも第一の
アレルが区別可能な固有のネガティブマーカー遺伝子をあればよい。そのようにすること
で、第一のアレル以外のカセットを除去することができる、または、第一のアレル以外の
カセットを維持したまま、第一のアレルのカセットを改変塩基配列に置き換える操作を行
うことができる。したがって、中間体細胞として、第一のアレルのみが少なくとも第一の
アレルが区別可能な固有のネガティブマーカー遺伝子を有する細胞であり、中間体細胞と
してはそのような細胞を選択して用いることができる。 If the intermediate cell has three or more alleles in the target region, at least the first allele needs to have a distinct negative marker gene. By doing so, the cassette other than the first allele can be removed, or an operation is performed to replace the cassette of the first allele with the modified base sequence while maintaining the cassette other than the first allele. be able to. Therefore, as the intermediate cells, only the first allele is a cell having a unique negative marker gene in which at least the first allele can be distinguished, and such cells can be selected and used as the intermediate cells. .

本開示の方法は、図４に示される欠点１～３のいずれも解決しなくてもよいが、好まし
くは、図４に示される欠点１～３のいずれか１以上を解決し得る。具体的には、本開示の
方法では、ＧａｔｅｗｅｙＴＭ法、およびＬｏｘＰ／Ｃｒｅ法と比較して、標的ゲノムへ
の外来遺伝子の組換え効率が高い。本開示のある好ましい態様では、改変塩基配列導入用
ドナーＤＮＡは、線状であり、環状ではない。本開示の好ましい態様では、改変細胞は、
部位特異的組換え酵素の認識部位を有しない。本開示の方法のある好ましい態様では、Ｇ
ａｔｅｗｅｙＴＭ法、およびＬｏｘＰ／Ｃｒｅ法と比較して、標的ゲノムへの外来遺伝子
の組換え効率が高く、改変塩基配列導入用ドナーＤＮＡは、線状であり、改変細胞は、部
位特異的組換え酵素の認識部位を有しない。部位特異的組換え酵素の認識部位を有しない
という特徴は、例えば、導入カセットとゲノムとをシームレスに連結する際に有益である
。 Although the method of the present disclosure may not solve any of the drawbacks 1-3 shown in FIG. 4, it may preferably solve any one or more of the drawbacks 1-3 shown in FIG. Specifically, the method of the present disclosure has a higher efficiency of recombination of a foreign gene into the target genome than the GatewayTM method and the LoxP/Cre method. In a preferred embodiment of the present disclosure, the donor DNA for introducing modified nucleotide sequences is linear and not circular. In preferred aspects of the present disclosure, the modified cell is
It does not have a site-specific recombination enzyme recognition site. In certain preferred aspects of the methods of the present disclosure, G
Compared to the atweyTM method and the LoxP/Cre method, the recombination efficiency of the foreign gene into the target genome is high, the donor DNA for introducing the modified base sequence is linear, and the modified cell is a site-specific recombination enzyme does not have a recognition site for The feature of not having a recognition site for a site-specific recombination enzyme is useful, for example, when seamlessly ligating an introduction cassette and a genome.

本開示のライブラリーは、
複数の水性組成物の組合せを含み、
各水性組成物はそれぞれ１種類の改変細胞を含み、
改変細胞はそれぞれ、改変対象である遺伝子座（標的遺伝子座または目的遺伝子座）に
第一のアレルと第二のアレルを有し、
改変細胞はそれぞれ第一のアレルの同一位置に、水性組成物間で相互に異なるＤＮＡ断
片を含むカセットを有する。このようなライブラリーは、以下のようにして得ることがで
きる。例えば、様々な改変塩基配列を含むライブラリー作製用ドナーＤＮＡの存在下で前
記中間体細胞の第一のカセットの標的核酸配列を切断する。そうすると細胞に備わったＤ
ＮＡ損傷の修復機構により、中間体細胞の第一のカセットが改変塩基配列に置き換わる。
改変塩基配列を有する細胞をシングルセルクローニングに供することができる。シングル
セルクローニングは、細胞１つを含む液滴または水性組成物を多数形成し、培養に供して
、液滴または水性組成物中で、１つの細胞に由来する細胞クローンを作製することを含み
得る。このようにすると細胞クローンを含む水性組成物が複数（または多数）得られるこ
ととなる。このような複数（または多数）の水性組成物の組合せを改変細胞のライブラリ
ーとして用いることができる。 The library of the present disclosure is
comprising a combination of multiple aqueous compositions;
each aqueous composition comprising one type of modified cell;
Each modified cell has a first allele and a second allele at the locus to be modified (target locus or target locus),
Each modified cell has a cassette containing DNA fragments at the same position of the first allele that differ from each other among the aqueous compositions. Such libraries can be obtained as follows. For example, the target nucleic acid sequence of the first cassette of the intermediate cell is cleaved in the presence of library construction donor DNA containing various modified base sequences. Then the D in the cell
The NA damage repair mechanism replaces the first cassette of the intermediate cell with the modified nucleotide sequence.
Cells with altered nucleotide sequences can be subjected to single cell cloning. Single-cell cloning can involve forming multiple droplets or aqueous compositions containing a single cell and subjecting them to culture to generate cell clones derived from a single cell in the droplets or aqueous compositions. . In this way, a plurality (or a large number) of aqueous compositions containing cell clones are obtained. A combination of multiple (or multiple) such aqueous compositions can be used as a library of modified cells.

中間体細胞または改変細胞のある態様では、第一のアレルは、（当初ゲノム上の）標的
領域の一部又は全体を欠失している。中間体細胞または改変細胞のある態様では、第二の
アレルは、標的領域の一部又は全部を欠失している。中間体細胞または改変細胞のある好
ましい態様では、第一のアレルおよび第二のアレルは、標的領域の一部又は全体、より好
ましくは全部を欠失している。 In some embodiments of intermediate cells or modified cells, the first allele lacks part or all of the target region (originally genomic). In certain aspects of intermediate or modified cells, the second allele lacks part or all of the target region. In certain preferred embodiments of intermediate cells or modified cells, the first allele and the second allele lack part or all, more preferably all, of the target region.

第一の中間体細胞のある態様では、第一のアレルでは、標的領域の全体が第一のカセッ
トにより置換されている。第一の中間体細胞のある態様では、第二のアレルでは、標的領
域の全体が第二のカセットにより置換されている。第一の中間体細胞のある好ましい態様
では、第一のアレルおよび第二のアレルは、標的領域の全体がそれぞれ第一のカセットお
よび第二のカセットにより置換されている。 In some embodiments of the first intermediate cell, the first allele has the entire target region replaced by the first cassette. In some embodiments of the first intermediate cell, the second allele has the entire target region replaced by the second cassette. In one preferred embodiment of the first intermediate cell, the first and second alleles are replaced by the first and second cassettes respectively over the entire target region.

第二の中間体細胞では、第二のカセットは除去されている。ある好ましい態様では、第
二の中間体細胞では、（当初ゲノム上の）第二のアレルの標的領域の上流と下流とがシー
ムレスに連結している。シームレスに連結しているとは、新しい塩基の追加なく、上流と
下流とが連結していることを意味する。上流と下流のシームレスな連結においては、塩基
の追加の脱落なく、上流と下流とが連結していることが好ましい。 In the second intermediate cell the second cassette has been removed. In a preferred embodiment, the second intermediate cell is seamlessly linked upstream and downstream of the target region of the second allele (originally on the genome). "Seamlessly linked" means that upstream and downstream are linked without the addition of new bases. In seamless connection of upstream and downstream, it is preferable that upstream and downstream are connected without additional omission of bases.

改変細胞のある態様では、第一のアレルは、改変塩基配列を含み、改変塩基配列は、（
当初ゲノム上の）標的領域（被置換配列ともいう）に対して塩基の付加、挿入、置換、欠
失、および削除からなる群から選択される１以上の変異を有する。改変細胞は、当初の細
胞（リファレンス細胞または参照細胞）と比較をし、変異の影響を評価することに有利に
用いることができる。また、ライブラリーが、変異において異なる様々な細胞を含むこと
により、細胞間の比較により、標的領域のそれぞれの変異部位の機能を評価することに有
利である。 In some embodiments of the modified cell, the first allele comprises a modified sequence, and the modified sequence comprises (
It has one or more mutations selected from the group consisting of base additions, insertions, substitutions, deletions, and deletions in the target region (also referred to as the substituted sequence) originally on the genome. Modified cells can be advantageously used to compare with the original cells (reference cells or reference cells) and to assess the effects of mutations. Also, by including a variety of cells that differ in their mutations, it is advantageous to assess the function of each mutation site in the target region through cell-to-cell comparisons.

第一の改変細胞のある態様では、第二のアレルは、選択マーカーを含むカセットを含む
。選択マーカーは、ポジティブ選択用マーカー遺伝子を含み得る。第二の改変細胞のある
態様では、第二のアレルは、被置換配列の上流と下流とがシームレスに連結している。改
変細胞の好ましいある態様では、第一のアレルの被置換配列と第二のアレルの被置換配列
とは、対応する配列を有する。対応する配列を有するとは、配列の起点および終点がゲノ
ム上の同一位置に存在することを意味する。対応する配列は、典型的には、高い同一性（
例えば、８０％以上、９０％以上、または９５％以上）を有し、同一の長さを有する。 In some embodiments of the first modified cell, the second allele comprises a cassette containing a selectable marker. Selectable markers can include positive selectable marker genes. In some embodiments of the second modified cell, the second allele is seamlessly linked upstream and downstream of the substituted sequence. In certain preferred embodiments of the modified cell, the substituted sequence of the first allele and the substituted sequence of the second allele have corresponding sequences. Having corresponding sequences means that the start and end of the sequences are located at the same position on the genome. Corresponding sequences typically have high identity (
for example, 80% or more, 90% or more, or 95% or more) and have the same length.

ある態様において、改変細胞におけるカセットの配列のそれぞれは、１以上の改変部分
（Ａ）と１以上の非改変部分（Ｂ）とからなっている（例えば、図３参照）。前記改変部
分（Ａ）はそれぞれ、配列の挿入、欠失、および置換からなる群から選択される１以上の
改変を有し、前記１以上の改変部分の改変（Ａ）は、改変の位置または内容に関して各水
性組成物間で異なる。前記１以上の非改変部分（Ｂ）はそれぞれ、改変前の対応する部位
の配列と同一である。ここで、挿入カセットで置換される改変前の配列と、挿入カセット
内の配列とをアラインメントしたときに、同じ位置に整列される場合に、２つの核酸配列
は対応する部位の配列となる。前記カセット中のセントロメア側の非改変部分（Ｂ１）は
、前記カセットのセントロメア側の隣接配列（Ｃ１）とシームレスに連結しており、前記
カセットのテロメア側の非改変部分（Ｂｔ）は、前記カセットのテロメア側の隣接配列（
Ｃ２）とシームレスに連結しており、記隣接配列（Ｃ１）および非改変部分（Ｂ１）が連
結した領域、並びに非改変部分（Ｂｔ）および上記隣接配列（Ｃ２）が連結した領域は、
改変前の対応する領域の配列と同一の配列を構成していてもよい。このようなカセットを
ＤＮＡに組込むには、上記カセットの構造を上流ホモロジーアームと下流ホモロジーアー
ムの間に有するライブラリ作製用ドナーＤＮＡを用いて中間体細胞を改変すればよい。あ
る好ましい態様では、改変部分（Ａ）の長さの合計は、挿入カセットの長さの９０％以下
、８０％以下、７０％以下、６０％以下、５０％以下、４０％以下、３０％以下、２０％
以下、１０％以下、または５％以下であり得る。この態様の改変細胞を含むライブラリー
は、同一領域内の異なる位置に異なる変異を有するライブラリーであり得、例えば、どの
位置のどの変異が細胞活性を変化させるかを調べることや、所望の特性を有する細胞のス
クリーニングなどに好ましく用いられ得る。改変塩基配列導入用ドナーＤＮＡは、上流ホ
モロジーアームと下流ホモロジーアームの間に上記カセットの構造を有し得る。改変塩基
配列導入用ドナーＤＮＡは、異なるカセットの構造を有する改変塩基配列導入用ドナーＤ
ＮＡのライブラリーに含まれ得る。 In some embodiments, each sequence of the cassette in the modified cell consists of one or more modified portions (A) and one or more unmodified portions (B) (see, eg, Figure 3). Each of the modified portions (A) has one or more modifications selected from the group consisting of sequence insertions, deletions, and substitutions, and the modifications (A) of the one or more modified portions are modified positions or Content differs between each aqueous composition. Each of the one or more unmodified portions (B) is identical to the sequence of the corresponding site before modification. Here, when the sequence before modification to be replaced by the insertion cassette and the sequence in the insertion cassette are aligned and aligned at the same position, the two nucleic acid sequences are sequences at corresponding sites. The unmodified portion (B1) on the centromere side of the cassette is seamlessly linked to the adjacent sequence (C1) on the centromere side of the cassette, and the unmodified portion (Bt) on the telomere side of the cassette is the cassette. flanking sequences on the telomeric side of (
C2), and the region where the adjacent sequence (C1) and the unmodified portion (B1) are linked, and the region where the unmodified portion (Bt) and the adjacent sequence (C2) are linked,
It may constitute the same sequence as the sequence of the corresponding region before modification. In order to incorporate such a cassette into DNA, an intermediate cell may be modified using a donor DNA for library construction having the structure of the cassette between the upstream homology arm and the downstream homology arm. In one preferred embodiment, the total length of the modified portion (A) is 90% or less, 80% or less, 70% or less, 60% or less, 50% or less, 40% or less, 30% or less of the length of the insertion cassette. , 20%
or less, 10% or less, or 5% or less. A library containing modified cells of this embodiment can be a library having different mutations at different positions within the same region, for example, to investigate which mutation at which position changes cell activity, or to determine the desired property. can be preferably used for screening of cells having The donor DNA for introducing a modified nucleotide sequence may have the above cassette structure between the upstream homology arm and the downstream homology arm. The modified nucleotide sequence-introducing donor DNA is a modified nucleotide sequence-introducing donor D having a different cassette structure.
It can be included in a library of NA.

ある態様において、改変細胞における改変塩基配列は、その全体が変異からなる。 In some embodiments, the modified base sequence in the modified cell consists entirely of mutations.

ある態様において、挿入カセットの内部または外部（近辺）に部位特異的組換え酵素の
組換え配列（認識部位）は存在しない。ある態様において、挿入カセット以外は、改変が
なされていない、または、改変前の細胞と同一の塩基配列を有する。このようにすること
で、目的の改変以外の改変を含まない改変細胞を得ることができ、目的の改変以外の改変
の予期せぬ影響を除去することができる（例えば、図４の欠点３参照）。 In some embodiments, there are no recombination sequences (recognition sites) for the site-specific recombination enzyme inside or outside (near) the insertion cassette. In some embodiments, the cell is unmodified or has the same base sequence as the cell before modification, except for the insertion cassette. By doing so, it is possible to obtain a modified cell that does not contain modifications other than the desired modification, and it is possible to eliminate unexpected effects of modifications other than the desired modification (for example, see defect 3 in FIG. 4) ).

ライブラリーは、特に限定されないが好ましくは、４種類以上、５種類以上、６種類以
上、７種類以上、８種類以上、９種類以上、１０種類以上、１１種類以上、１２種類以上
、１３種類以上、１４種類以上、１５種類以上、１６種類以上、１７種類以上、１８種類
以上、１９種類以上、２０種類以上、２５種類以上、３０種類以上、３５種類以上、４０
種類以上、４５種類以上、５０種類以上、６０種類以上、７０種類以上、８０種類以上、
９０種類以上、１００種類以上、１５０種類以上、２００種類以上、３００種類以上、４
００種類以上、５００種類以上、６００種類以上、７００種類以上、８００種類以上、９
００種類以上、１０００種類以上、２０００種類以上、３０００種類以上、４０００種類
以上、５０００種類以上、６０００種類以上、７０００種類以上、８０００種類以上、９
０００種類以上、または１００００種類以上の改変細胞（又は改変細胞を含む水性組成物
）を含んでいてもよい。 The library is not particularly limited, but is preferably 4 or more, 5 or more, 6 or more, 7 or more, 8 or more, 9 or more, 10 or more, 11 or more, 12 or more, 13 or more. , 14 or more, 15 or more, 16 or more, 17 or more, 18 or more, 19 or more, 20 or more, 25 or more, 30 or more, 35 or more, 40
types or more, 45 types or more, 50 types or more, 60 types or more, 70 types or more, 80 types or more,
90 types or more, 100 types or more, 150 types or more, 200 types or more, 300 types or more, 4
00 types or more, 500 types or more, 600 types or more, 700 types or more, 800 types or more, 9
00 types or more, 1000 types or more, 2000 types or more, 3000 types or more, 4000 types or more, 5000 types or more, 6000 types or more, 7000 types or more, 8000 types or more, 9
000 types or more, or 10000 or more types of modified cells (or aqueous compositions containing modified cells) may be included.

ライブラリーは、典型的には、複数の水性組成物の組合せを含む。各水性組成物は、１
種類の細胞を含む。ある態様では、ライブラリーは、複数の水性組成物の組合せを別々に
含む。ある態様では、ライブラリーは、目的によっては、複数の水性組成物の組合せを混
合物として含んでいてもよい。例えば、細胞の増殖力や生存力の高い細胞をスクリーニン
グする場合には、ライブラリーは、複数の水性組成物の組合せを混合物として含んでいて
もよく、このような場合であっても、培養後の細胞を分析することにより、もっとも増殖
力や生存力の高い細胞が富化され、増殖力や生存力の高い細胞を取得することができる。 A library typically contains a combination of a plurality of aqueous compositions. Each aqueous composition contains 1
Contains different types of cells. In some embodiments, the library separately comprises a plurality of combinations of aqueous compositions. In some embodiments, the library may contain combinations of multiple aqueous compositions as a mixture for some purposes. For example, when screening cells with high proliferative ability or viability of cells, the library may contain a combination of a plurality of aqueous compositions as a mixture. By analyzing the cells, cells with the highest proliferative capacity and viability are enriched, and cells with high proliferative capacity and viability can be obtained.

ある態様では、１つのライブラリーにおいて、当該ライブラリーに含まれる改変細胞そ
れぞれの改変塩基配列（またはＤＮＡ断片）以外のゲノム配列は、同一となるように設計
されている。ある態様では、１つのライブラリーにおいて、当該ライブラリーに含まれる
改変細胞それぞれの改変塩基配列（またはＤＮＡ断片）以外のゲノム配列は、実質的に同
一である。実質的に同一であるとは、細胞培養に適した環境下（通常の環境下）で、クロ
ーニングされた細胞を単に１０回継代した後に細胞間に生じ得る変異に基づく相違の存在
を許容する。 In one aspect, in one library, genomic sequences other than the modified nucleotide sequence (or DNA fragment) of each modified cell contained in the library are designed to be identical. In one aspect, in one library, the genomic sequences other than the modified base sequence (or DNA fragment) of each modified cell contained in the library are substantially identical. Substantially identical allows for the existence of mutational differences that can occur between cells after only 10 passages of the cloned cells under conditions suitable for cell culture (under normal conditions). .

改変塩基配列以外が共通していることにより、例えば、改変塩基配列の影響を評価する
ことに適する。第二のアレルにおけるカセットが除去されることで、カセットによる影響
の可能性を最小に抑えることができる。第二のアレルにおけるカセットがシームレスに除
去されることで、追加の新しい塩基が第二のアレルに導入されることによる影響の可能性
を最小に抑えることができる。 Since they are common except for the modified base sequence, they are suitable for evaluating the influence of the modified base sequence, for example. Elimination of the cassette in the second allele minimizes the potential impact of the cassette. Seamless removal of the cassette in the second allele minimizes the potential impact of introducing additional new bases into the second allele.

別のある態様では、中間体細胞における第二のアレルにおけるカセット（第二のカセッ
ト）も、第二の改変塩基配列により置き換えられていてもよい。第二の改変配列は、改変
細胞に共通した配列（つまり、改変細胞にわたって同一配列）であってもよいし、水性組
成物毎に異なる配列であってもよい。場合によっては、第二の改変配列は、同一水性組成
物中においてさえ細胞毎に異なっていてもよい。また、第二の改変配列は、第一のアレル
における改変塩基配列（第一の改変塩基配列）と同一であってもよいし、異なっていても
よい。第二の改変塩基配列については、第一のアレルにおける改変塩基配列と同様に設計
し、第一のアレルへの改変塩基配列の導入と同様に導入することができる。第二のカセッ
トの第二の改変塩基配列への置換は、第二のライブラリ作製用ドナーＤＮＡの存在下で、
第二のカセット付近（好ましくは、第二のカセット内の標的核酸配列）を特異的に切断す
ればよい。本明細書では、第二の改変塩基配列の内容やその導入方法については、第一の
改変塩基配列の説明と同様であるので、この説明を参照し、ここでは説明を省略する。 In another aspect, the cassette in the second allele in the intermediate cell (second cassette) may also be replaced by the second modified nucleotide sequence. The second modified sequence may be a sequence common to the modified cells (ie, the same sequence across the modified cells) or may be a sequence that differs from aqueous composition to aqueous composition. In some cases, the second modified sequence may differ from cell to cell even within the same aqueous composition. Also, the second modified sequence may be the same as or different from the modified base sequence (first modified base sequence) in the first allele. The second modified base sequence can be designed in the same manner as the modified base sequence in the first allele and introduced in the same manner as the modified base sequence in the first allele. Substitution of the second modified base sequence of the second cassette in the presence of the second library preparation donor DNA,
The vicinity of the second cassette (preferably, the target nucleic acid sequence within the second cassette) may be specifically cleaved. In the present specification, the content of the second modified base sequence and the method of introducing it are the same as those described for the first modified base sequence, so this description will be referred to and the description will be omitted here.

応用例１
応用例１は、ゲノムの特定領域の解析への応用例である。本開示によれば、ゲノムの特
定領域の塩基配列に様々な変異をそれぞれ導入して、変異による当該特定領域の機能の獲
得または喪失を観察することによって、当該ゲノムの特定領域における重要塩基を特定す
ることが可能である。特定領域の例としては、機能が不明な領域、プロモーター領域、エ
ンハンサー領域、イントロンに該当する領域、５’非翻訳領域（ＵＴＲ）に該当する領域
、３’非翻訳領域（ＵＴＲ）に該当する領域、ノンコーディングＲＮＡをコードする領域
などが挙げられる。 Application example 1
Application 1 is an application to analysis of a specific region of the genome. According to the present disclosure, by introducing various mutations into the base sequence of a specific region of the genome and observing the gain or loss of function of the specific region due to mutation, important bases in the specific region of the genome are identified. It is possible to Examples of specific regions include regions of unknown function, promoter regions, enhancer regions, regions corresponding to introns, regions corresponding to 5′ untranslated regions (UTR), and regions corresponding to 3′ untranslated regions (UTR). , regions encoding non-coding RNAs, and the like.

応用例２
応用例２は、タンパク質またはＲＮＡの発現レベルの調節への応用例である。この応用
例では、タンパク質またはＲＮＡの転写制御領域（転写制御に関与していると疑われる領
域を含む）、および前記タンパク質の翻訳制御領域（翻訳制御に関与していると疑われる
領域を含む）等の前記タンパク質またはＲＮＡの発現レベルの制御に関与する領域または
関与すると疑われる領域を改変し、当該タンパク質の発現レベルを調節する。これにより
、応用例２では、タンパク質またはＲＮＡの発現量が調節された改変細胞を得ることがで
きる。調節は、発現量の増加または減少であり得る。ＲＮＡは、ｍＲＮＡ、ｔＲＮＡ、ｒ
ＲＮＡ、その他のノンコーディングＲＮＡ（例えば、マイクロＲＮＡ）であり得る。 Application example 2
Application 2 is an application to regulation of protein or RNA expression levels. In this application, the transcriptional control region of a protein or RNA (including a region suspected of being involved in transcriptional control) and the translational control region of said protein (including a region suspected of being involved in translational control) A region involved or suspected of being involved in regulation of the expression level of said protein or RNA is modified to regulate the expression level of said protein. Thus, in Application Example 2, a modified cell in which the expression level of protein or RNA is regulated can be obtained. Modulation can be an increase or decrease in expression levels. RNA is mRNA, tRNA, r
RNA, other non-coding RNA (eg, microRNA).

応用例３
応用例３は、タンパク質またはＲＮＡのコード領域への応用例である。本開示によれば
、タンパク質またはＲＮＡをコードする領域に様々な変異をそれぞれ導入して、変異によ
るタンパク質またはＲＮＡの機能改変（例えば、機能の獲得または喪失）を観察すること
によって、当該タンパク質またはＲＮＡの機能における重要アミノ酸または重要配列を特
定することが可能である。また、例えば、変異によるタンパク質またはＲＮＡの機能の獲
得または喪失を観察することにより、向上もしくは低減した機能または新しい機能を有す
る変異タンパク質またはＲＮＡおよび当該変異タンパク質またはＲＮＡを発現する改変細
胞を取得することができる。タンパク質またはＲＮＡの一部または全部に対して多様な改
変をする場合には、改変された部位を含む一部または全部をタンパク質またはＲＮＡをコ
ードする領域にインフレームでシームレスに連結することが望ましい。ＲＮＡは、ｍＲＮ
Ａ、ｔＲＮＡ、ｒＲＮＡ、その他のノンコーディングＲＮＡ（例えば、マイクロＲＮＡ）
であり得る。応用例３によれば、上記応用例２と組み合わせて、当該変異タンパク質また
はＲＮＡを発現する改変細胞であって、その発現レベルが調節された改変細胞を得ること
もできる。 Application example 3
Application Example 3 is an application example to a protein or RNA coding region. According to the present disclosure, by introducing various mutations into a region encoding a protein or RNA, respectively, and observing functional alteration (e.g., gain or loss of function) of the protein or RNA due to mutation, the protein or RNA It is possible to identify key amino acids or key sequences in the function of . Also, for example, by observing gain or loss of function of protein or RNA due to mutation, obtaining mutant protein or RNA having improved or reduced function or new function and modified cells expressing the mutant protein or RNA can be done. When various modifications are made to part or all of a protein or RNA, it is desirable to seamlessly ligate the part or all containing the modified site in-frame to the region encoding the protein or RNA. RNA is mRNA
A, tRNA, rRNA, other non-coding RNAs (e.g., microRNAs)
can be According to Application Example 3, in combination with Application Example 2, it is possible to obtain a modified cell that expresses the mutant protein or RNA and whose expression level is regulated.

応用例４
応用例５は、増殖力または生存力の高い細胞のスクリーニングへの応用である。本開示
によれば、細胞の増殖力または生存力に関与する可能性のあるゲノム領域について、異な
る変異を有する多様な改変細胞を含むライブラリーを得ることができる。そのようなライ
ブラリーは、各種細胞を含む水性組成物を別々に含むものであってもよいが、各種細胞を
含む水性組成物の混合物であってもよい。混合物中では、それぞれの改変細胞が同等量含
まれていることが好ましい。混合物を、細胞の培養に適した環境下または細胞への淘汰圧
存在下で培養することによって、増殖力または生存力の高い細胞は相対濃度を高め、増殖
力または生存力の低い細胞は相対濃度を低める。したがって、培養後、増殖力または生存
力の高い細胞が濃縮され、これらの細胞を取得できることに有利である。スクリーニング
は、特定の選択圧を負荷した条件下で行うこともできる。このようにすることで、当該特
定の選択圧に対して増殖力または生存力の高い細胞をスクリーニングすることができる。
選択圧としては、特に限定されないが例えば、貧栄養、高塩濃度、低塩濃度、高温、低温
、低酸素、薬物（例えば、毒物、抗生物質等の生理活性物質）の存在などが挙げられる。
ゲノム上の既存の遺伝子の改変は、当該遺伝子の改変遺伝子への置き換えにより行うこと
ができる。あるいは、セーフハーバー領域（例えば、ＡＡＶＳ１遺伝子座、ＲＯＳＡ２６
遺伝子座、ＣＬＢＹＬ遺伝子座、ＣＸＣＲ４遺伝子座、およびＣＣＲ５遺伝子座など）な
どに、単純に改変塩基配列を挿入することもできる。 Application example 4
Application Example 5 is an application to screening for cells with high proliferative ability or viability. According to the present disclosure, libraries can be obtained that contain diverse modified cells with different mutations in genomic regions that may be involved in cell proliferation or viability. Such a library may contain separate aqueous compositions containing each type of cell, but may also be a mixture of aqueous compositions containing each type of cell. Preferably, the mixture contains equal amounts of each modified cell. By culturing the mixture in an environment suitable for culturing cells or in the presence of selective pressure on the cells, the relative concentration of cells with high proliferation or viability is increased, and the relative concentration of cells with low proliferation or viability is reduced. lower. Therefore, after culturing, cells with high proliferation or viability are concentrated, and it is advantageous to be able to obtain these cells. Screening can also be performed under conditions that impose a certain selective pressure. In this way, cells with high proliferative or viable viability can be screened against the particular selective pressure.
Examples of selective pressure include, but are not limited to, poor nutrition, high salt concentration, low salt concentration, high temperature, low temperature, hypoxia, and the presence of drugs (eg, bioactive substances such as toxic substances and antibiotics).
An existing gene on the genome can be modified by replacing the gene with a modified gene. Alternatively, safe harbor regions (e.g., AAVS1 locus, ROSA26
It is also possible to simply insert the modified nucleotide sequence into the gene locus, CLBYL locus, CXCR4 locus, CCR5 locus, etc.).

１実施形態において、本発明は、染色体ゲノムの２つ以上のアレルが改変された細胞で
あって、当該２つ以上のアレルそれぞれにおいて相互に異なる（区別可能な）選択マーカ
ー遺伝子を有する、細胞を提供する。ある態様では、細胞は、単細胞生物の細胞であり得
る。ある態様では、細胞は、単離された細胞であり得る。ある態様では、細胞は、多能性
細胞、および多能性幹細胞（胚性幹細胞および誘導多能性幹細胞など）からなる群から選
択される細胞であり得る。ある態様では、細胞は、組織幹細胞であり得る。ある態様では
、細胞は、体細胞であり得る。ある態様では、細胞は、生殖系列細胞（例えば、生殖細胞
）であり得る。ある態様では、細胞は、細胞株であり得る。ある態様では、細胞は不死化
細胞であり得る。ある態様では、細胞はがん細胞であり得る。ある態様では、細胞は、非
がん細胞であり得る。ある態様では、細胞は、疾患患者の細胞であり得る。ある態様では
、細胞は健常者の細胞であり得る。ある態様では、細胞は、動物細胞（例えば、ヒト細胞
）、例えば、昆虫細胞（例えば、カイコ細胞）、ＨＥＫ２９３細胞、ＨＥＫ２９３Ｔ細胞
、Ｅｘｐｉ２９３Ｆ（商標）細胞、ＦｒｅｅＳｔｙｌｅ（商標）２９３Ｆ細胞、チャイニ
ーズハムスター卵巣細胞（ＣＨＯ細胞）、ＣＨＯ－Ｓ細胞、ＣＨＯ－Ｋ１細胞、およびＥ
ｘｐｉＣＨＯ細胞、ならびにこれらの細胞からの派生細胞からなる群から選択される細胞
であり得る。ある好ましい態様では、上記細胞においては、染色体ゲノムの標的領域のす
べてのアレルが改変され、改変後の領域は、それぞれ相互に異なる（区別可能な）選択マ
ーカー遺伝子を有する。 In one embodiment, the present invention provides a cell in which two or more alleles of the chromosomal genome are modified, the cell having a mutually different (distinguishable) selectable marker gene in each of the two or more alleles. offer. In some aspects, the cell can be a cell of a unicellular organism. In some aspects, the cells can be isolated cells. In some embodiments, the cells can be cells selected from the group consisting of pluripotent cells and pluripotent stem cells, such as embryonic stem cells and induced pluripotent stem cells. In some aspects, the cells can be tissue stem cells. In some aspects, the cell can be a somatic cell. In some aspects, the cells can be germline cells (eg, germ cells). In some aspects, the cell can be a cell line. In some aspects, the cells can be immortalized cells. In some embodiments, the cells can be cancer cells. In some embodiments, the cells can be non-cancer cells. In some aspects, the cells can be cells of a diseased patient. In some embodiments, the cells can be cells from healthy individuals. In some aspects, the cells are animal cells (e.g., human cells), e.g., insect cells (e.g., silkworm cells), HEK293 cells, HEK293T cells, Expi293F™ cells, FreeStyle™ 293F cells, Chinese hamster ovary cells. (CHO cells), CHO-S cells, CHO-K1 cells, and E
xpiCHO cells, as well as cells derived from these cells. In a preferred embodiment, in the cell, all alleles of the target region of the chromosomal genome are modified, and the modified regions have mutually different (distinguishable) selectable marker genes.

１実施形態において、染色体ゲノムの２つ以上のアレルが改変された細胞であって、当
該２つ以上のアレルそれぞれにおいて相互に異なる（区別可能な）選択マーカー遺伝子を
有する、細胞の培養方法が提供される。選択マーカー遺伝子が、薬剤耐性マーカー遺伝子
である場合には、培養はそれぞれの薬剤耐性マーカー遺伝子に対する薬剤の存在下で培養
され得る。培養は、細胞の維持または増殖に適した条件下で行われ得る。 In one embodiment, a method for culturing cells is provided in which two or more alleles of the chromosomal genome are modified, and the two or more alleles have mutually different (distinguishable) selectable marker genes. be done. When the selectable marker gene is a drug resistance marker gene, the culture can be grown in the presence of drugs directed against the respective drug resistance marker gene. Culturing may be performed under conditions suitable for cell maintenance or growth.

１実施形態において、本発明は、２つ以上のアレルが改変された染色体ゲノムを有する
非ヒト生物であって、当該２つ以上のアレルそれぞれにおいて相互に異なる（区別可能な
）選択マーカー遺伝子を有する、非ヒト生物が提供される。ある態様では、細胞は、単細
胞生物の細胞であり得る。ある態様では、非ヒト生物は、酵母（例えば、分裂酵母または
出芽酵母、例えば、サッカロミセス・セレビシエ（Ｓａｃｃｈａｒｏｍｙｃｅｓｃｅｒ
ｅｖｉｓｉａｅ）、サッカロミセス・カールスベルゲンシス（Ｓａｃｃｈａｒｏｍｙｃｅ
ｓｃａｒｌｓｂｅｒｇｅｎｓｉｓ）、サッカロミセス・フラギリス（Ｓａｃｃｈａｒｏ
ｍｙｃｅｓｆｒａｇｉｌｉｓ）、サッカロミセス・ルーキシー（Ｓａｃｃｈａｒｏｍｙ
ｃｅｓｒｏｕｘｉｉ）などのサッカロミセス属、キャンディダ・ユーティリス（Ｃａｎ
ｄｉｄａｕｔｉｌｉｓ）、キャンディダ・トロピカリス（Ｃａｎｄｉｄａｔｒｏｐｉ
ｃａｌｉｓ）などのキャンディダ属、ピキア（Ｐｉｃｈｉａ）属、クリベロマイセス（Ｋ
ｌｕｙｖｅｒｏｍｙｃｅｓ）属、ヤロイワ（Ｙａｒｒｏｗｉａ）属、ハンゼニュラ（Ｈａ
ｎｓｅｎｕｌａ）属、エンドマイセス（Ｅｎｄｏｍｙｃｅｓ）属などの酵母からなる群か
ら選択される生物であり得る。ある態様では、非ヒト生物は、糸状菌（例えば、Ａｓｐｅ
ｒｇｉｌｌｕｓ，Ｔｒｉｃｈｏｄｅｒｍａ，Ｈｕｍｉｃｏｌａ，Ａｃｒｅｍｏｎｉｕｍ，
Ｆｕｓａｒｉｕｍ，及びＰｅｎｉｃｉｌｌｉｕｍ種）であり得る。ある態様では、非ヒト
生物は、は、多細胞生物であり得る。ある態様では、非ヒト生物は、非ヒト動物であり得
る。ある態様では、非ヒト生物は、植物であり得る。ある好ましい態様では、上記非ヒト
生物では、染色体ゲノムの標的領域のすべてのアレルが改変され、改変後の領域は、それ
ぞれ相互に異なる（区別可能な）選択マーカー遺伝子を有する。 In one embodiment, the present invention provides a non-human organism having a chromosomal genome in which two or more alleles are modified, wherein the two or more alleles each have a different (distinguishable) selectable marker gene. , a non-human organism is provided. In some aspects, the cell can be a cell of a unicellular organism. In certain aspects, the non-human organism is a yeast (e.g., fission yeast or budding yeast, e.g., Saccharomyces cerevisiae).
evisiae), Saccharomyces carlsbergensis
scarlsbergensis), Saccharomyces fragilis (Saccharo
myces fragilis), Saccharomyces rouxii (Saccharomyces)
ces rouxii), Candida utilis (Can
dida utilis, Candida tropicalis
calis), Pichia, Kluyveromyces (K
luyveromyces, Yarrowia, Hansenula (Ha
nsenula), Endomyces, etc., selected from the group consisting of yeast. In some aspects, the non-human organism is a filamentous fungus (e.g., Aspe
rgillus, Trichoderma, Humicola, Acremonium,
Fusarium, and Penicillium species). In some aspects, a non-human organism can be a multicellular organism. In some aspects, the non-human organism can be a non-human animal. In some aspects, the non-human organism can be a plant. In a preferred embodiment, in the non-human organism, all alleles of the target region of the chromosomal genome are modified, and the modified regions each have mutually different (distinguishable) selectable marker genes.

上記細胞では、細胞の生存や増殖に必要な遺伝子などの所望の遺伝子の１以上が、染色
体ゲノムの他の領域に含まれ、または集められていてもよい。他の領域は、例えば、セー
フハーバー領域（例えば、ＡＡＶＳ１遺伝子座、ＲＯＳＡ２６遺伝子座、ＣＬＢＹＬ遺伝
子座、ＣＸＣＲ４遺伝子座、およびＣＣＲ５遺伝子座など）であり得る。他の領域は、例
えば、欠失を有する上記（ｉｉ）の領域であり得る。 In such cells, one or more desired genes, such as genes required for cell survival or proliferation, may be contained or clustered in other regions of the chromosomal genome. Other regions can be, for example, safe harbor regions such as the AAVS1 locus, ROSA26 locus, CLBYL locus, CXCR4 locus, and CCR5 locus. The other region can be, for example, the region of (ii) above that has a deletion.

図１に示されるように、改変前細胞（リファレンス細胞という）から第一の中間体細胞
を得る。改変前細胞から第一の中間体細胞を得る工程を工程Ｓ１と呼ぶ。第一の中間体細
胞から第一の改変細胞のライブラリー（以下単に「第一のライブラリー」という）を作製
することができる。第一の中間体細胞から第一のライブラリーを得る工程を工程Ｓ２と呼
ぶ。第一の中間体細胞から第二の中間体細胞を得ることができる。第一の中間体細胞から
第二の中間体細胞を得る工程を工程Ｓ３と呼ぶ。第二の中間体細胞から第二の改変細胞の
ライブラリー（以下単に「第二のライブラリー」という）を作製することができる。第二
の中間体細胞から第二のライブラリーを得る工程を工程Ｓ４と呼ぶ。 As shown in Figure 1, a first intermediate cell is obtained from a pre-modified cell (referred to as a reference cell). The step of obtaining the first intermediate cell from the pre-modification cell is called step S1. A library of first modified cells (hereinafter simply referred to as the "first library") can be generated from the first intermediate cells. The step of obtaining the first library from the first intermediate cell is called step S2. A second intermediate cell can be obtained from the first intermediate cell. The step of obtaining the second intermediate cell from the first intermediate cell is called step S3. A second library of modified cells (hereinafter simply referred to as the "second library") can be generated from the second intermediate cells. The step of obtaining the second library from the second intermediate cells is called step S4.

リファレンス細胞を用意する。リファレンス細胞は、ライブラリー化したい細胞である
。リファレンス細胞は、例えば、真核細胞であり、本開示のライブラリーの作製に用いる
ことができる。リファレンス細胞は、天然の細胞であってもよいが、改変を有する細胞で
あってもよい。リファレンス細胞は、典型的には２倍体であるが、３倍体以上であっても
よい。 Prepare reference cells. A reference cell is a cell to be made into a library. Reference cells are eukaryotic cells, for example, and can be used to generate the libraries of the present disclosure. A reference cell may be a naturally occurring cell, but may also be a cell with modifications. Reference cells are typically diploid, but may be triploid or higher.

例えば、図２に示されるように第一の中間体細胞を作製することができる。具体的には
、リファレンス細胞のゲノム上の標的領域を、選択マーカー遺伝子を含むカセットで置換
する。図２中では、２種類のドナーＤＮＡと標的領域とを相同組換えに供する。ドナーＤ
ＮＡは、それぞれ区別可能に異なる薬剤選択マーカー遺伝子（ポジティブ選択用）と区別
可能に異なる可視化マーカー遺伝子（ネガティブ選択用）を含み、それぞれ両端にＣＲＩ
ＳＰＲ／Ｃａｓ９による標的配列（ｇＲＮＡ１～４）を有する。相同組換え後、２種類の
ドナーＤＮＡに由来する断片がそれぞれ父由来および母由来の染色体上の標的配列を置き
換わった細胞を選択するために、２種類の薬剤により薬剤選択を行う（図２の工程（１）
参照）。選択後生存する細胞は、２アレルの標的領域にそれぞれ区別可能に異なる薬剤耐
性遺伝子を有する細胞（第一の中間体細胞）である。第一の中間体細胞は、シングルセル
クローニングに供することができる。目的の位置に選択マーカー遺伝子を含むカセットが
挿入されているかを確認しておくことができる。 For example, a first intermediate cell can be made as shown in FIG. Specifically, a target region on the genome of a reference cell is replaced with a cassette containing a selectable marker gene. In FIG. 2, two donor DNAs and a target region are subjected to homologous recombination. Donor D
NAs each contain a distinguishably different drug selectable marker gene (for positive selection) and a distinguishably different visualization marker gene (for negative selection), each flanked by a CRI
It has a target sequence (gRNA1-4) by SPR/Cas9. After homologous recombination, drug selection is performed with two drugs in order to select cells in which fragments derived from the two donor DNAs have replaced the target sequences on the paternal and maternal chromosomes, respectively (Fig. 2). Step (1)
reference). Cells that survive the selection are cells (first intermediate cells) that have distinguishably different drug resistance genes in the two alleles of the target region. A first intermediate cell can be subjected to single cell cloning. It is possible to confirm whether a cassette containing a selection marker gene has been inserted at the intended position.

次に、父由来または母由来のカセットいずれか一方を除去する。例えば、図２に示され
るように、父由来の標的領域を置き換えたカセットの両端に位置する標的配列（ｇＲＮＡ
１およびｇＲＮＡ２）を、カセット除去用ドナーＤＮＡの存在下で、ＣＲＩＳＰＲ／Ｃａ
ｓ９システムにより切断することができる。カセット除去用ドナーＤＮＡは、父由来アレ
ル上の標的領域の上流と相同組換え可能な上流ホモロジーアームと当該標的領域の下流と
組換え可能な下流ホモロジーアームからなる。その後、ＧＦＰシグナルが消失した細胞を
セルソーターで選別することで、父由来アレルにおいて標的領域の上流と下流とがシーム
レスに連結した（すなわち、標的領域の全体が脱落した）ゲノムを有する第二の中間体細
胞を得ることができる。 Either the paternal or maternal cassette is then removed. For example, as shown in FIG. 2, target sequences (gRNA
1 and gRNA 2) were subjected to CRISPR/Ca in the presence of donor DNA for cassette removal.
It can be cut by the s9 system. The donor DNA for cassette removal consists of an upstream homology arm capable of homologous recombination with the upstream of the target region on the paternal allele and a downstream homology arm capable of recombination with the downstream of the target region. Then, by sorting cells in which the GFP signal has disappeared, a second intermediate having a genome in which the upstream and downstream of the target region are seamlessly linked in the paternal allele (that is, the entire target region has been dropped) is selected by using a cell sorter. Somatic cells can be obtained.

第二の中間体細胞から第二の改変細胞のライブラリー（第二のライブラリー）を作製す
ることができる。第二の中間体細胞の母由来カセットの両端に位置する標的配列（ｇＲＮ
Ａ３およびｇＲＮＡ４）を、改変塩基配列導入用ドナーＤＮＡ（ライブラリー作製用ドナ
ーＤＮＡ）の存在下で、ＣＲＩＳＰＲ／Ｃａｓ９システムにより切断することができる。
改変塩基配列導入用ドナーＤＮＡは、母由来アレル上の標的領域の上流と相同組換え可能
な上流ホモロジーアームと当該標的領域の下流と組換え可能な下流ホモロジーアームを有
し、上流ホモロジーアームと下流ホモロジーアームの間に、改変塩基配列を含む。このよ
うにすることで、母由来アレルにおいて、標的配列の上流と下流との間に改変塩基配列を
含むゲノムを有する改変細胞が得られる。異なる改変塩基配列を有する改変塩基配列導入
用ドナーＤＮＡを用意しておくことで、異なる改変塩基配列を有する改変細胞を得ること
ができ、これにより第二のライブラリーを得ることができる。 A second library of modified cells (a second library) can be generated from the second intermediate cells. The target sequences (gRN
A3 and gRNA4) can be cleaved by the CRISPR/Cas9 system in the presence of donor DNA for introducing modified nucleotide sequences (donor DNA for library preparation).
The donor DNA for introducing the modified base sequence has an upstream homology arm capable of homologous recombination with the upstream of the target region on the maternal allele and a downstream homology arm capable of recombination with the downstream of the target region, and the upstream homology arm and the downstream A modified base sequence is included between the homology arms. In this way, a modified cell is obtained that has a genome containing a modified base sequence between the upstream and downstream of the target sequence in the maternal allele. By preparing modified nucleotide sequence-introducing donor DNAs having different modified nucleotide sequences, modified cells having different modified nucleotide sequences can be obtained, thereby obtaining a second library.

改変塩基配列導入用ドナーＤＮＡは、上述のように１以上の改変部分（Ａ）と１以上の
非改変部分（Ｂ）とからなっており、改変部分（Ａ）以外は、改変前のゲノムの当該領域
の配列と同じ配列を有していることができる。 The donor DNA for introducing the modified nucleotide sequence consists of one or more modified portions (A) and one or more unmodified portions (B) as described above. It can have the same sequence as the sequence of the region.

Claims

A library of modified cells,
the library comprises combinations of a plurality of aqueous compositions;
each aqueous composition comprising one type of modified cell;
Each modified cell has a first allele and a second allele at the locus to be modified,
Each of the modified cells has at the same position of the first allele a cassette containing DNA fragments that differ from each other among the aqueous compositions , said locus of the modified cell being the recognition site for the site-specific recombination enzyme or the recombination sequence. does not contain
Library.

2. The library of claim 1 , wherein each modified cell has a disruption or deletion of part or all of the target region in the second allele.

3. The library of claim 2, wherein the second allele seamlessly lacks said part or all.

2. The library of claim 1, wherein the sequences other than the cassette containing the DNA fragment of each modified cell are substantially the same before and after modification.

Each of the sequences of said cassette consists of one or more modified portions (A) and one or more unmodified portions (B), each of said modified portions (A) consisting of sequence insertions, deletions and substitutions. having one or more modifications selected from the group, the modifications of the one or more modified portions differing between each aqueous composition with respect to the location or content of the modifications, and the one or more unmodified portions (B) each It is identical to the sequence of the corresponding site before modification, and the unmodified portion (B1) on the centromere side of the cassette is seamlessly linked to the adjacent sequence (C1) on the centromere side of the cassette, and the cassette The unmodified portion (Bt) on the telomere side is seamlessly linked to the flanking sequence (C2) on the telomere side of the cassette, and the region where the flanking sequence (C1) and the unmodified portion (B1) are linked, and the non- The library according to claim 1, wherein the region where the modified portion (Bt) and the flanking sequence (C2) are linked constitutes the same sequence as the sequence of the corresponding region before modification.

2. The library according to claim 1, wherein the library contains 50 or more aqueous compositions .

A method of producing a library of modified cells, comprising:
(α) in a cell having a genome containing a first allele and a second allele at the genetic locus to be modified, a cassette containing a selection marker gene and a target nucleic acid sequence for each of the first allele and the second allele; providing a population of cells having
Here, the selectable marker gene possessed by the first allele and the selectable marker gene possessed by the second allele are distinguishably different, the target nucleic acid sequence is the target of the genome modification system, and the genome modification system selects the first marker gene. each selectable marker gene is a negative selectable marker gene that can be used for negative selection, and is designed so that the allele and the second allele can be distinguished from each other;
(β) introducing the following (x) and (y) into the provided group of cells;
(x) a genome modification system comprising a sequence-specific nucleic acid cleaving molecule targeting the unique base sequence contained in the first allele, or a polynucleotide encoding the sequence-specific nucleic acid cleaving molecule;
(y) a plurality of types of second recombination donor DNA {wherein the plurality of types of second recombination donor DNA are homologous to the base sequence adjacent to the upstream side of the target site in (x) above, respectively; and a downstream homology arm having a nucleotide sequence homologous to the nucleotide sequence adjacent to the target region downstream, and a modified base between the upstream homology arm and the downstream homology arm sequence, wherein the modified nucleotide sequence is different for each second recombination donor DNA and unique to each second recombination donor DNA},
(γ) selecting cells that do not express the selectable marker gene contained in the first allele after the step (β);
including
Thereby, a library of modified cells comprising a plurality of cells can be obtained, wherein in the plurality of cells obtained, the first allele has a modified nucleotide sequence unique to each cell, and the second allele has a common sequence between cells,
Method.

8. The method of claim 7 , wherein prior to step (a),
In a cell having a genome containing a first allele and a second allele at a genetic locus to be modified, a cassette containing a selection marker gene and a target nucleic acid sequence for the substituted sequences contained in the first allele and the second allele thereby removing the substituted sequence from the first allele and the second allele;
The method further comprising:

9. The method of claim 8 , wherein
Each modified nucleotide sequence of the first allele has one or more mutations selected from the group consisting of addition, insertion, substitution, deletion, and deletion of bases relative to the substituted sequence of the first allele,
Method.

9. The method of claim 8 , wherein
The sequence to be modified is a protein coding region,
The modified base sequence of the first allele has one or more mutations selected from the group consisting of base additions, insertions, substitutions, deletions, and deletions with respect to the substituted sequence of the first allele. .

11. The method according to claim 9 or 10 , wherein the modified base sequence has 80% or more sequence identity with the modified sequence.

8. The method of claim 7 , wherein between steps (α) and (β),
The method further comprising removing the cassette from the second allele.

13. The method of claim 12 , wherein the cassette is seamlessly removed.

8. A modified cell library comprising a plurality of modified cells produced by the method of claim 7 .

9. A modified cell library comprising a plurality of modified cells produced by the method of claim 8.

10. A modified cell library comprising a plurality of modified cells produced by the method of claim 9.

11. A modified cell library comprising a plurality of modified cells produced by the method of claim 10.

12. A modified cell library comprising a plurality of modified cells produced by the method of claim 11.

13. A modified cell library comprising a plurality of modified cells produced by the method of claim 12.

14. A modified cell library comprising a plurality of modified cells produced by the method of claim 13.