CN118284434A

CN118284434A - Compositions and methods for skipping exon 45 in duchenne muscular dystrophy

Info

Publication number: CN118284434A
Application number: CN202280066123.8A
Authority: CN
Inventors: 李翔; 钱自清; 马赫布贝·海拉瓦迪; 周明; 马斯韦塔·吉尔根拉特; 内尔萨·埃斯特雷拉; 苏雷什·佩德迪加里; 阿努什里·帕塔克; 纳塔拉詹·塞瑟拉曼; 连文龙; 刘楠君
Original assignee: Ant Rada Therapeutics Ltd By Share Ltd
Current assignee: Ant Rada Therapeutics Ltd By Share Ltd
Priority date: 2021-09-01
Filing date: 2022-08-30
Publication date: 2024-07-02

Abstract

Described herein in various embodiments are compositions comprising (a) a Cell Penetrating Peptide (CPP) sequence; and (b) an antisense compound, wherein the antisense compound targets exon 45 of the DMD gene.

Description

Compositions and methods for skipping exon 45 in duchenne muscular dystrophy

The disclosure claims priority from U.S. provisional application Ser. No. 63/244,915, U.S. provisional application Ser. No. 63/337,574, U.S. provisional application Ser. No. 63/354,454, U.S. provisional application Ser. No. 63/239,671, U.S. provisional application Ser. No. 63/290,960, U.S. provisional application Ser. No. 63/298,565, U.S. provisional application Ser. No. 63/268,577, U.S. provisional application Ser. No. 63/298,565, and U.S. provisional application Ser. No. 63/268,577, U.S. provisional application Ser. No. 63/268,565, and U.S. provisional application Ser. No. 63/268,577, U.S. 25, 2022, and 2022, respectively.

Background

Duchenne muscular dystrophy (Duchenne Muscular Dystrophy, DMD) is a genetic disorder characterized by progressive muscle degeneration and weakness due to alterations in the protein dystrophin. Genetic modification in DMD, the gene encoding dystrophin, results in DMD. These genetic modifications shift the reading frame of DMD resulting in a non-functionally truncated DMD protein. One method of treating DMD patients entails delivering to the patient a compound that restores the reading frame of the DMD. Antisense compounds can restore the reading frame of DMD by skipping internal exons associated with the shift in the reading frame of DMD, which results in a non-functional truncated DMD protein. Exon skipping produces dystrophin, which retains lost function in the disease state.

DMD is caused by mutations in one or more of several exons. For example, 8.1% of DMD patients have a mutation in DMD exon 45. Aartsma-Rus. Human Mutation, volume 30, no.3, 293-299 (2009). Antisense oligonucleotide cassie Mo Sen (casimersen) has been approved for exon 45 skipping, but this drug is less effective, probably due to lower intracellular delivery of the therapeutic agent.

The limited ability to access intracellular compartments is an important issue for antisense oligonucleotide therapeutics such as cassi Mo Sen. Attempts have been made to improve the intracellular delivery of antisense compounds, for example using carrier systems such as polymers, cationic liposomes or by chemical modification of the construct, for example by covalent attachment of cholesterol molecules. However, intracellular delivery efficiency remains low, and improved delivery systems are still needed to increase the efficacy of these antisense compounds.

The need for an effective composition for delivering antisense compounds to intracellular compartments to treat DMD has not been met.

Disclosure of Invention

Described herein are compositions for delivering nucleic acids. In embodiments, the nucleic acid is an Antisense Compound (AC). In embodiments, the antisense compound targets exon 45 in a subject with Duchenne Muscular Dystrophy (DMD).

In an embodiment, there is provided a compound comprising:

(a) Cyclic peptides (also referred to herein as cell penetrating peptides or "CPPs"); and

(B) An Antisense Compound (AC) complementary to a target sequence of a DMD gene in a pre-mRNA sequence, wherein the target sequence comprises at least a portion of a 5 'flanking intron of exon 45, at least a portion of a 3' flanking intron of exon 45, or a combination thereof.

In embodiments, the AC comprises at least one modified nucleotide or nucleic acid selected from Phosphorothioate (PS) nucleotides, phosphorodiamidate morpholino (phosphorodiamidate morpholino, PMO) nucleotides, locked Nucleic Acids (LNA), peptide Nucleic Acids (PNA), nucleotides comprising a 2' -O-methyl (2 ' -OMe) modified backbone, 2' O-methoxy-ethyl (2 ' -MOE) nucleotides, 2',4' constrained ethyl (cEt) nucleotides, and 2' -deoxy-2 ' -fluoro- β -D-arabinonucleic acid (2 ' f-ANA). In embodiments, the AC comprises at least one PMO (e.g., ,1、2、3、4、5、6、7、8、9、10、11、12、13、14、15、16、17、18、19、20、21、22、23、24、25、26、27、28、29、30、31、32、33、34、35、36、37、38、39、40、41、42、43、44、45、46、47、48、49 or 50 PMOs, including all ranges therein). In embodiments, each nucleotide in AC is PMO.

In embodiments, the AC comprises the following sequence: 5'-ATGCCATCCTGGAGTTCCTGTA-3'. In embodiments, the AC comprises the following sequence: 5'-CCCAATGCCATCCTGGAGTTCCT-3'.

In embodiments, the cyclic peptide is FGFGRGRQ. In embodiments, the cyclic peptide is GfFGrGrQ. In an embodiment, the cyclic peptide is Ff Φ GRGRQ.

In embodiments, the EEV is: ac-PKKKRKV-AEEA-Lys- (cyclo [ FGFGRGRQ ]) -PEG12-OH.

In embodiments, the present disclosure provides pharmaceutical compositions comprising the compounds described herein.

In embodiments, the present disclosure provides cells comprising a compound described herein.

In embodiments, the present disclosure provides a method of treating DMD comprising administering a compound described herein to a patient in need thereof.

Drawings

FIGS. 1A and 1B show conjugation chemistry for linking an Antisense Compound (AC) to a Cell Penetrating Peptide (CPP).

FIGS. 2A-2B show a method for linking Cell Penetrating Peptides (CPPs) (e.g.)Shown) with an Antisense Compound (AC), wherein the CPP comprises a PEG4 linker and shows an AC without a linker containing a polyethylene glycol (PEG 2 or miniPEG) moiety (fig. 2A) and an AC with a linker containing a polyethylene glycol (PEG 2 or miniPEG) moiety (fig. 2B). In the figure, "R" represents palmitoyl.

FIG. 3 shows an example of an Endosomal Escape Vector (EEV) design using a representative CPP. It should be understood that a CPP may include any of the CPPs disclosed herein.

FIG. 4 shows exon 45 skipping efficiencies at 5. Mu.M and 10. Mu.M for the selected ACs in Table 8.

Fig. 5A-5D show exon skipping assessment in triceps (fig. 5A), tibialis anterior (fig. 5B), diaphragm (fig. 5C) and heart (fig. 5D) 1 week after administration of (intravenous) EEV-PMO-MDX23-1, as assessed by two-step RT-PCR.

Fig. 6A-6D show duration of action (1 week, 2 weeks, 4 weeks, and 8 weeks) of triceps (fig. 6A), tibialis anterior (fig. 6B), diaphragm (fig. 6C), and heart (fig. 6D) following intravenous administration of 80mpk EEV-PMO-MDX 23-1.

FIG. 7 depicts the results after intravenous administration of 40mpk EEV-PMO-MDX23-1 (4 weekly doses).

Fig. 8 shows d2.Mdx line hanging data (WIRE HANG DATA). After 12 weeks of treatment, the line-up time of animals treated with EEV-PMO-MDX23-180mpk Q2W was statistically indistinguishable from WT animals. DBA WT vehicle (saline); d2.mdx vehicle (saline); d2.mdxEEV-PMO-MDX 23-1; d2.mdxEEV-PMO-MDX 23-2; D2.mdxPMO-MDX 23 (5'-GGCCAAACCTCGGCTTACCTGAAAT-3')

Fig. 9A to 9B show creatine kinase activity in D2MDX mice before (fig. 9A) and 4 weeks after (fig. 9B) administration.

Fig. 10A-10B show creatine kinase activity in serum of D2MDX mice 8 weeks (fig. 10A) and 12 weeks (fig. 10B) after dosing.

FIGS. 11A-11B, grip of D2MDX treated with EEV-PMO-MDX23-3 before (FIG. 11A) and 12 weeks after (FIG. 11B).

Fig. 12A-12D show exon skipping efficiencies in the diaphragm (fig. 12A), heart (fig. 12B), biceps (fig. 12C), and tibialis anterior (fig. 12D) of hmds mice injected with positive controls of 40, 60, or 80 mpk.

Fig. 13A-13C show exon skipping in tibialis anterior (fig. 13A), diaphragm (fig. 13B) and heart (fig. 13C) as detected by 1-STEP RT-PCR.

Figures 14A-14C show exon skipping in tibialis anterior (figure 14A), diaphragm (figure 14B) and heart (figure 14C) of hmd mice 1 week after injection of 60mpk positive control (EEV-PMO-DMD 45-1) or candidate PMO conjugated to EEV-2.

FIGS. 15A-15B show an assessment of DMD45 jump in myoblasts of DMD Δ46-48 iPSC origin treated with 30. Mu.M of Kaxi Mo Sen conjugated with EEV-2 (EEV-PMO-DMD 45-1) or one of 10 EEV-PMO compounds.

Fig. 15C-15E show the assessment of DMD45 jump in human heart cells treated with three EEV-PMO compounds.

FIG. 16 shows the result of CTGlo cell viability measurements of EEVs PMO EEV-PMO-DMD45-2, EEV-PMO-DMD45-3, EEV-PMO-DMD45-4, EEV-PMO-DMD 45-5. Normalized to melittin positive control.

FIG. 17 shows CTGlo cell viability assays for EEVs PMO EEV-PMO-DMD45-6, EEV-PMO-DMD45-7, EEV-PMO-DMD45-8, EEV-PMO-DMD 45-9. Normalized to melittin positive control.

FIG. 18 shows the results of the cell viability assays for EEVs PMO EEV-PMO-DMD45-10, EEV-PMO-DMD45-11 and positive controls CTGlo. Normalized to melittin positive control.

FIG. 19 shows the structure of EEV-PMO-DMD-45-5.

FIG. 20 shows the structure of EEV-PMO-DMD-45-7.

FIGS. 21A-21D illustrate an exemplary EEV-PMO synthesis scheme.

FIGS. 22A-22C show localization of PMO and EEV-NLS-PMO in THP cells as determined by LC-MS/MS: whole cell uptake (fig. 22A); subcellular localization (fig. 22B); and nuclear uptake (fig. 22C).

Detailed Description

Compounds of formula (I)

Disclosed herein in various embodiments are compounds for use in the treatment of Duchenne Muscular Dystrophy (DMD). In embodiments, DMD is caused by a mutation in exon 45. In embodiments, the compound is designed to deliver an Antisense Compound (AC) complementary to a target sequence of a DMD gene in a pre-mRNA sequence, wherein the target sequence comprises at least a portion of a 5 'flanking intron of exon 45, at least a portion of a 3' flanking intron of exon 45, or a combination thereof. In embodiments, the compound is delivered to a subject in need thereof. In embodiments, the compound delivers AC complementary to a target sequence comprising an intron-exon junction of exon 45 of the DMD gene. In embodiments, the compound delivers AC complementary to a target sequence comprising an intron nucleotide sequence upstream (or 5') of exon 45 of the DMD gene.

In embodiments, the compound alters the splicing pattern of the target pre-mRNA that binds to AC, resulting in the formation of a re-spliced target protein. In one embodiment, the re-spliced target protein has increased function as compared to a target protein produced by splicing target pre-mRNA in the absence of AC. In embodiments, re-splicing the target protein increases the function of the target protein by about 1%, about 5%, about 10%, about 15%, about 20%, about 25%, about 30%, about 35%, about 40%, about 45%, about 50%, about 55%, about 60%, about 65%, about 70%, about 75%, about 80%, about 85%, about 90%, about 95%, about 99%, about 100%, about 150%, about 200%, about 250%, about 300%, about 350%, about 400%, about 450%, about 500% or more, including all values and ranges therebetween, as compared to the function of the target protein produced by splicing. In embodiments, the re-spliced target protein restores function to about 1%, about 5%, about 10%, about 15%, about 20%, about 25%, about 30%, about 35%, about 40%, about 45%, about 50%, about 55%, about 60%, about 65%, about 70%, about 75%, about 80%, about 85%, about 90%, about 95%, about 99%, or about 100% of the function of the wild-type target protein, including all values and ranges therebetween.

In embodiments, the compounds disclosed herein comprise an AC moiety and a cyclic peptide moiety (also referred to as a Cell Penetrating Peptide (CPP) moiety) that facilitate intracellular delivery of AC. In embodiments, the compound is capable of penetrating the cell membrane and binding to the target pre-mRNA in vivo. In embodiments, the compound comprises: a) At least one cyclic peptide; and b) at least one AC, wherein the cyclic peptide is directly or indirectly coupled to the AC. In embodiments, the compound comprises 1,2, 3, 4, 5, 6, 7, 8, 9, 10 or more AC moieties. In embodiments, the compound comprises 1,2, 3, 4, 5, 6, 7, 8, 9, 10 or more cyclic peptides. In embodiments, the compound comprises one AC moiety. In embodiments, the compound comprises two AC moieties. As used herein, "coupling" may refer to covalent or non-covalent association between a cyclic peptide and AC, including fusion of the cyclic peptide to AC, and chemical conjugation of the cyclic peptide to AC. A non-limiting example of a method of non-covalently attaching a cyclic peptide to AC is by streptavidin/biotin interaction, for example by conjugating biotin to the cyclic peptide and fusing streptavidin to AC. In the resulting compounds, the CPP is coupled to the AC via non-covalent association between biotin and streptavidin.

In embodiments, the cyclic peptide is conjugated directly or indirectly to AC, thereby forming a cyclic peptide-AC conjugate. Conjugation of AC to CPP may occur at any suitable site on these moieties. For example, the 5 'or 3' end of the AC may be conjugated to the C-terminus, N-terminus, or side chain of an amino acid in the CPP.

In embodiments, AC is covalently linked to a cyclic peptide. As used herein, covalent linkage refers to a construct in which the CPP moiety is covalently linked to the 5 'and/or 3' end of the AC moiety. Such conjugates may alternatively be described as having a cell penetrating moiety and an oligonucleotide moiety. Methods of covalent bonding are well known in the art. According to certain embodiments of the present disclosure, a covalently linked AC-cyclopeptide conjugate includes an AC component and a cyclopeptide component associated with each other through a linker.

In embodiments, AC may be chemically conjugated to the cyclic peptide through a moiety on the 5 'or 3' end of AC. In yet other embodiments, AC may be conjugated to the cyclic peptide through a side chain of an amino acid on the cyclic peptide. Any amino acid side chain on the cyclic peptide that is capable of forming a covalent bond or that can be so modified can be used to attach AC to the cyclic peptide. The amino acid on the cyclic peptide may be a natural or unnatural amino acid. In embodiments, the amino acid on the cyclic peptide used to conjugate AC is aspartic acid, glutamic acid, glutamine, asparagine, lysine, ornithine, 2, 3-diaminopropionic acid or an analog thereof, wherein the side chain is substituted with a bond to AC or a linker. In embodiments, the amino acid is lysine or an analog thereof. In embodiments, the amino acid is glutamic acid or an analog thereof. In embodiments, the amino acid is aspartic acid or an analog thereof.

Endosome escape carrier (EEV)

Provided herein are Endosomal Escape Vectors (EEVs) that can be used to transport AC across a cell membrane, e.g., to deliver AC to the cytoplasm or nucleus of a cell. EEV may comprise a Cell Penetrating Peptide (CPP), for example, a cyclic cell penetrating peptide (cCPP) conjugated to an Exocyclic Peptide (EP). EP is interchangeably referred to as a regulatory peptide (MP). The EP may comprise a sequence of Nuclear Localization Signals (NLS). The EP may be coupled with AC. EP's may be coupled to cCPP. EP's can be coupled to AC's and cCPP's. EP, AC, cCPP or combinations thereof may be non-covalent or covalent. The EP may be attached to the N-terminus of cCPP by a peptide bond. The EP may be attached to the C-terminal end of cCPP by a peptide bond. EP may be attached to cCPP by a side chain of an amino acid in cCPP. EP may be attached to cCPP through the side chain of lysine, which may be conjugated to the side chain of glutamine in cCPP. The EP may be conjugated to the 5 'or 3' end of the AC. The EP may be coupled to a linker. The exocyclic peptide may be conjugated to the amino group of the linker. The EP may be coupled to the linker via the C-terminal ends of EP and cCPP via cCPP and/or a side chain on the EP. For example, an EP may comprise a terminal lysine, which may then be coupled to a glutamine-containing cCPP via an amide linkage. When EP contains a terminal lysine and the side chain of lysine is available for attachment cCPP, the C-terminus or N-terminus may be attached to a linker on AC.

Cyclic exopeptides

The Exocyclic Peptide (EP) may comprise 2 to 10 amino acid residues, for example 2, 3, 4, 5, 6, 7, 8, 9 or 10 amino acid residues, including all ranges and values therebetween. An EP may comprise 6 to 9 amino acid residues. An EP may comprise 4 to 8 amino acid residues.

Each amino acid in the exocyclic peptide may be a natural or unnatural amino acid. The term "unnatural amino acid" refers to an organic compound that is a homolog of a natural amino acid in that it has a structure similar to that of a natural amino acid, thereby mimicking the structure and reactivity of the natural amino acid. The unnatural amino acid can be a modified amino acid and/or amino acid analog that is not one of the 20 common naturally occurring amino acids, nor the rare natural amino acid selenocysteine or pyrrolysine. The unnatural amino acid can also be a D-isomer of the natural amino acid. Examples of suitable amino acids include, but are not limited to, alanine, alloleucine (allosoleucine), arginine, citrulline, asparagine, aspartic acid, cysteine, glutamine, glutamic acid, glycine, histidine, isoleucine, leucine, lysine, methionine, naphthylalanine, phenylalanine, proline, pyroglutamic acid, serine, threonine, tryptophan, tyrosine, valine, derivatives thereof, or combinations thereof. These and other amino acids are listed in table 1 along with their abbreviations used herein. For example, the amino acid may be A, G, P, K, R, V, F, H, nal or citrulline.

The EP may comprise at least one positively charged amino acid residue, e.g. at least one lysine residue and/or at least one amino acid residue comprising a side chain comprising a guanidine group or a protonated form thereof. An EP may comprise 1 or 2 amino acid residues comprising a side chain comprising a guanidine group or a protonated form thereof. The amino acid residue comprising a guanidine-containing side chain may be an arginine residue. Protonated form may mean salts thereof throughout the disclosure.

The EP may comprise at least two, at least three or at least four or more lysine residues. An EP may comprise 2, 3 or 4 lysine residues. The amino group on the side chain of each lysine residue may be substituted with a protecting group including, for example, a trifluoroacetyl (-COCF ₃), allyloxycarbonyl (Alloc), 1- (4, 4-dimethyl-2, 6-dioxocyclohexylidene) ethyl (Dde) or (4, 4-dimethyl-2, 6-dioxocyclohex-1-ylidene-3) -methylbutyl (ivDde) group. The amino group on the side chain of each lysine residue may be substituted with a trifluoroacetyl group (-COCF ₃). Protecting groups may be included to effect amide conjugation. The protecting group may be removed after conjugation of EP to cCPP.

An EP may comprise at least 2 amino acid residues with hydrophobic side chains. Amino acid residues having hydrophobic side chains may be selected from valine, proline, alanine, leucine, isoleucine and methionine. The amino acid residue having a hydrophobic side chain may be valine or proline.

The EP may comprise at least one positively charged amino acid residue, e.g. at least one lysine residue and/or at least one arginine residue. The EP may comprise at least two, at least three or at least four or more lysine residues and/or arginine residues.

EP's may comprise KK、KR、RR、HH、HK、HR、RH、KKK、KGK、KBK、KBR、KRK、KRR、RKK、RRR、KKH、KHK、HKK、HRR、HRH、HHR、HBH、HHH、HHHH、KHKK、KKHK、KKKH、KHKH、HKHK、KKKK、KKRK、KRKK、KRRK、RKKR、RRRR、KGKK、KKGK、HBHBH、HBKBH、RRRRR、KKKKK、KKKRK、RKKKK、KRKKK、KKRKK、KKKKR、KBKBK、RKKKKG、KRKKKG、KKRKKG、KKKKRG、RKKKKB、KRKKKB、KKRKKB、KKKKRB、KKKRKV、RRRRRR、HHHHHH、RHRHRH、HRHRHR、KRKRKR、RKRKRK、RBRBRB、KBKBKB、PKKKRKV、PGKKRKV、PKGKRKV、PKKGRKV、PKKKGKV、PKKKRGV or PKKKRKG, where B is beta-alanine. The amino acids in EP may have D or L stereochemistry.

The EP may comprise KK、KR、RR、KKK、KGK、KBK、KBR、KRK、KRR、RKK、RRR、KKKK、KKRK、KRKK、KRRK、RKKR、RRRR、KGKK、KKGK、KKKKK、KKKRK、KBKBK、KKKRKV、PKKKRKV、PGKKRKV、PKGKRKV、PKKGRKV、PKKKGKV、PKKKRGV or PKKKRKG. EP's may comprise PKKKRKV, RR, RRR, RHR, RBR, RBRBR, RBHBR or HBRBH, where B is beta-alanine. The amino acids in EP may have D or L stereochemistry.

The EP may consist of KK、KR、RR、KKK、KGK、KBK、KBR、KRK、KRR、RKK、RRR、KKKK、KKRK、KRKK、KRRK、RKKR、RRRR、KGKK、KKGK、KKKKK、KKKRK、KBKBK、KKKRKV、PKKKRKV、PGKKRKV、PKGKRKV、PKKGRKV、PKKKGKV、PKKKRGV or PKKKRKG. EP may consist of PKKKRKV, RR, RRR, RHR, RBR, RBRBR, RBHBR or HBRBH, where B is beta-alanine. The amino acids in EP may have D or L stereochemistry.

An EP may comprise an amino acid sequence identified in the art as a Nuclear Localization Sequence (NLS). An EP may consist of an amino acid sequence identified in the art as a Nuclear Localization Sequence (NLS). The EP may comprise an NLS comprising the amino acid sequence PKKKRKV. EP may consist of NLS comprising the amino acid sequence PKKKRKV. The EP may comprise an NLS comprising an amino acid sequence selected from NLSKRPAAIKKAGQAKKKK、PAAKRVKLD、RQRRNELKRSF、RMRKFKNKGKDTAELRRRRVEVSVELR、KAKKDEQILKRRNV、VSRKRPRP、PPKKARED、PQPKKKPL、SALIKKKKKMAP、DRLRR、PKQKKRK、RKLKKKIKKL、REKKKFLKRR、KRKGDEVDGVDEVAKKKSKK and RKCLQAGMNLEARKTKK. EP may consist of NLS comprising an amino acid sequence selected from NLSKRPAAIKKAGQAKKKK、PAAKRVKLD、RQRRNELKRSF、RMRKFKNKGKDTAELRRRRVEVSVELR、KAKKDEQILKRRNV、VSRKRPRP、PPKKARED、PQPKKKPL、SALIKKKKKMAP、DRLRR、PKQKKRK、RKLKKKIKKL、REKKKFLKRR、KRKGDEVDGVDEVAKKKSKK and RKCLQAGMNLEARKTKK

All exocyclic sequences may also contain N-terminal acetyl groups. Thus, for example, an EP may have the following structure: ac-PKKKRKV.

Cell Penetrating Peptide (CPP)

The Cell Penetrating Peptide (CPP) may comprise from 6 to 20 amino acid residues. The cell penetrating peptide may be a cyclic cell penetrating peptide (cCPP). cCPP are capable of penetrating cell membranes. The Exocyclic Peptide (EP) may be conjugated to cCPP and the resulting construct may be referred to as an Endosomal Escape Vector (EEV). cCPP can direct AC to penetrate the cell membrane. cCPP can deliver AC to the cytoplasm of the cell. cCPP can deliver AC to a cell site where a target (e.g., pre-mRNA) is located. To conjugate cCPP to AC, at least one bond or lone pair of electrons on cCPP may be replaced.

The total number of amino acid residues in cCPP is in the range of 6 to 20 amino acid residues, e.g., 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20 amino acid residues, including all ranges and subranges therebetween. cCPP may comprise from 6 to 13 amino acid residues. cCPP disclosed herein may comprise from 6 to 10 amino acids. By way of example, cCPP comprising 6-10 amino acid residues may have a structure according to any one of formulas I-a to I-E:

Wherein AA₁、AA₂、AA₃、AA₄、AA₅、AA₆、AA₇、AA₈、AA₉ and AA ₁₀ are amino acid residues.

CCPP may comprise from 6 to 8 amino acids. cCPP may comprise 8 amino acids.

CCPP may be natural or unnatural amino acids. The term "unnatural amino acid" refers to an organic compound that is a homolog of a natural amino acid in that it has a structure similar to that of a natural amino acid, thereby mimicking the structure and reactivity of the natural amino acid. The unnatural amino acid can be a modified amino acid and/or amino acid analog that is not one of the 20 common naturally occurring amino acids, nor the rare natural amino acid selenocysteine or pyrrolysine. The unnatural amino acid can also be a D-isomer of the natural amino acid. Examples of suitable amino acids include, but are not limited to, alanine, alloleucine, arginine, citrulline, asparagine, aspartic acid, cysteine, glutamine, glutamic acid, glycine, histidine, isoleucine, leucine, lysine, methionine, naphthylalanine, phenylalanine, proline, pyroglutamic acid, serine, threonine, tryptophan, tyrosine, valine, derivatives thereof, or combinations thereof. These and other amino acids are listed in table 1 along with their abbreviations used herein.

TABLE 1 amino acid abbreviations

Amino acids	Abbreviation L-amino acid	Abbreviation D-amino acid

Alanine (Ala)	Ala(A)	ala(a)
			Alloisoleucine (ALL)	Aile	Aile
Arginine (Arg)	Arg(R)	arg(r)
			Asparagine derivatives	Asn(N)	asn(n)
Aspartic acid	Asp(D)	asp(d)
			Cysteine (S)	Cys(C)	cys(c)
Citrulline	Cit	Cit
			Cyclohexylalanine	Cha	cha
2, 3-Diaminopropionic acid	Dap	dap
			4-Fluorophenylalanine	Fpa(∑)	pfa
Glutamic acid	Glu(E)	glu(e)
			Glutamine	Gln(Q)	gln(q)
Glycine (Gly)	Gly(G)	gly(g)
			Histidine	His(H)	his(h)
High proline (also known as pipecolic acid)	Pip(Θ)	pip(θ)
			Isoleucine (Ile)	Ile(I)	ile(i)
Leucine (leucine)	Leu(L)	leu(l)
			Lysine	Lys(K)	lys(k)
Methionine	Met(M)	met(m)
			3- (2-Naphthyl) -alanine	Nal(Ф)	nal(φ)
3- (1-Naphthyl) -alanine	1-Nal	1-nal
			Norleucine (N-leucine)	Nle(Ω)	nle
Phenylalanine (Phe)	Phe(F)	phe(f)
			Phenylglycine	Phg(Ψ)	phg
4- (Phosphonodifluoromethyl) phenylalanine	F₂Pmp(Λ)	f₂pmp
			Proline (proline)	Pro(P)	pro(p)
Sarcosine	Sar(Ξ)	sar
			Selenocysteine	Sec(U)	sec(u)
Serine (serine)	Ser(S)	ser(s)
			Threonine (Thr)	Thr(T)	thr(y)
Tyrosine	Tyr(Y)	tyr(y)
			Tryptophan	Trp(W)	trp(w)
Valine (valine)	Val(V)	val(v)
			T-butyl-alanine	Tle	tle
Penicillamine	Pen	Pen
			Homoarginine	HomoArg	homoarg
Nicotinyl-lysine	Lys(NIC)	lys(NIC)
			Trifluoroacetyl-lysine	Lys(TFA)	lys(TFA)
Methyl-leucine	MeLeu	meLeu
			3- (3-Benzothienyl) -alanine	Bta	bta

* Single letter abbreviations: when shown in uppercase letters herein, it means an L-amino acid form, and when shown in lowercase letters herein, it means a D-amino acid form.

CCPP may comprise from 4 to 20 amino acids, wherein: (i) At least one amino acid has a side chain comprising a guanidino group or a protonated form thereof; (ii) At least one amino acid having no side chain or having a chain comprising Or a side chain of a protonated form thereof; and (iii) at least two amino acids independently have side chains comprising aromatic or heteroaromatic groups.

At least two amino acids may have no side chains or have a chain comprising Or a side chain of a protonated form thereof. As used herein, when a side chain is not present, the amino acid has two hydrogen atoms (e.g., -CH ₂ -) on the carbon atoms linking the amine and the carboxylic acid.

The amino acid without a side chain may be glycine or β -alanine.

CCPP may comprise 6 to 20 amino acid residues that form cCPP, wherein: (i) At least one amino acid may be glycine, beta-alanine or 4-aminobutyric acid residues; (ii) At least one amino acid may have a side chain comprising an aryl or heteroaryl group; and (iii) at least one amino acid has a guanidine group, Or a side chain of a protonated form thereof.

CCPP may comprise 6 to 20 amino acid residues that form cCPP, wherein: (i) At least two amino acids may independently be glycine, beta-alanine or 4-aminobutyric acid residues; (ii) At least one amino acid may have a side chain comprising an aryl or heteroaryl group; and (iii) at least one amino acid has a guanidine group, Or a side chain of a protonated form thereof.

CCPP may comprise 6 to 20 amino acid residues that form cCPP, wherein: (i) At least three amino acids may independently be glycine, beta-alanine or 4-aminobutyric acid residues; (ii) At least one amino acid may have a side chain comprising an aromatic or heteroaromatic group; and (iii) at least one amino acid may have a guanidine group, Or a side chain of a protonated form thereof.

Glycine and related amino acid residues

CCPP may comprise (i) 1,2,3,4, 5, or 6 glycine, β -alanine, 4-aminobutyric acid residues, or a combination thereof. cCPP may comprise (i) 2 glycine, β -alanine, 4-aminobutyric acid residues, or a combination thereof. cCPP may comprise (i) 3 glycine, β -alanine, 4-aminobutyric acid residues, or a combination thereof. cCPP may comprise (i) 4 glycine, β -alanine, 4-aminobutyric acid residues, or a combination thereof. cCPP may comprise (i) 5 glycine, β -alanine, 4-aminobutyric acid residues, or a combination thereof. cCPP may comprise (i) 6 glycine, β -alanine, 4-aminobutyric acid residues, or a combination thereof. cCPP may comprise (i) 3,4, or 5 glycine, beta-alanine, 4-aminobutyric acid residues, or a combination thereof. cCPP may comprise (i) 3 or 4 glycine, beta-alanine, 4-aminobutyric acid residues, or a combination thereof.

CCPP may comprise (i) 1, 2, 3, 4, 5 or 6 glycine residues. cCPP may comprise (i) 2 glycine residues. cCPP may comprise (i) 3 glycine residues. cCPP may comprise (i) 4 glycine residues. cCPP may comprise (i) 5 glycine residues. cCPP may comprise (i) 6 glycine residues. cCPP may comprise (i) 3, 4 or 5 glycine residues. cCPP may comprise (i) 3 or 4 glycine residues. cCPP may comprise (i) 2 or 3 glycine residues. cCPP may comprise (i) 1 or 2 glycine residues.

CCPP may comprise (i) 3, 4,5, or 6 glycine, β -alanine, 4-aminobutyric acid residues, or a combination thereof. cCPP may comprise (i) 3 glycine, β -alanine, 4-aminobutyric acid residues, or a combination thereof. cCPP may comprise (i) 4 glycine, β -alanine, 4-aminobutyric acid residues, or a combination thereof. cCPP may comprise (i) 5 glycine, β -alanine, 4-aminobutyric acid residues, or a combination thereof. cCPP may comprise (i) 6 glycine, β -alanine, 4-aminobutyric acid residues, or a combination thereof. cCPP may comprise (i) 3, 4, or 5 glycine, beta-alanine, 4-aminobutyric acid residues, or a combination thereof. cCPP may comprise (i) 3 or 4 glycine, beta-alanine, 4-aminobutyric acid residues, or a combination thereof.

CCPP may comprise at least three glycine residues. cCPP may comprise (i) 3, 4,5 or 6 glycine residues. cCPP may comprise (i) 3 glycine residues. cCPP may comprise (i) 4 glycine residues. cCPP may comprise (i) 5 glycine residues. cCPP may comprise (i) 6 glycine residues. cCPP may comprise (i) 3, 4 or 5 glycine residues. cCPP can comprise (i) 3 or 4 glycine residues

In embodiments, none of the glycine, β -alanine, or 4-aminobutyric acid residues in cCPP are contiguous. Two or three glycine, beta-alanine or 4-aminobutyric acid residues may be contiguous. The two glycine, β -alanine or 4-aminobutyric acid residues may be contiguous.

In embodiments, none of the glycine residues cCPP are contiguous. Each glycine residue in cCPP may be separated by an amino acid residue that is not glycine. Two or three glycine residues may be contiguous. The two glycine residues may be contiguous.

Amino acid side chains with aromatic or heteroaromatic groups

CCPP may comprise (ii) 2,3,4,5 or 6 amino acid residues independently having a side chain comprising an aromatic or heteroaromatic group. cCPP may comprise (ii) 2 amino acid residues independently having a side chain comprising an aromatic or heteroaromatic group. cCPP may comprise (ii) 3 amino acid residues independently having a side chain comprising an aromatic or heteroaromatic group. cCPP may comprise (ii) 4 amino acid residues independently having a side chain comprising an aromatic or heteroaromatic group. cCPP may comprise (ii) 5 amino acid residues independently having a side chain comprising an aromatic or heteroaromatic group. cCPP may comprise (ii) 6 amino acid residues independently having a side chain comprising an aromatic or heteroaromatic group. cCPP may comprise (ii) 2,3 or 4 amino acid residues independently having a side chain comprising an aromatic or heteroaromatic group. cCPP may comprise (ii) 2 or 3 amino acid residues independently having a side chain comprising an aromatic or heteroaromatic group.

CCPP may comprise (ii) 2, 3, 4, 5 or 6 amino acid residues independently having a side chain comprising an aromatic group. cCPP may comprise (ii) 2 amino acid residues independently having a side chain comprising an aromatic group. cCPP may comprise (ii) 3 amino acid residues independently having a side chain comprising an aromatic group. cCPP may comprise (ii) 4 amino acid residues independently having a side chain comprising an aromatic group. cCPP may comprise (ii) 5 amino acid residues independently having a side chain comprising an aromatic group. cCPP may comprise (ii) 6 amino acid residues independently having a side chain comprising an aromatic group. cCPP may comprise (ii) 2, 3 or 4 amino acid residues independently having a side chain comprising an aromatic group. cCPP may comprise (ii) 2 or 3 amino acid residues independently having a side chain comprising an aromatic group.

The aromatic group may be a 6 to 14 membered aryl group. Aryl groups may be phenyl, naphthyl or anthracenyl, each of which is optionally substituted. Aryl groups may be phenyl or naphthyl, each of which is optionally substituted. The heteroaromatic group may be a 6 to 14 membered heteroaryl group having 1, 2 or 3 heteroatoms selected from N, O and S. Heteroaryl may be pyridinyl, quinolinyl or isoquinolinyl.

Amino acid residues having a side chain comprising an aromatic or heteroaromatic group may each independently be bis (Gao Naiji alanine), gao Naiji alanine, naphthylalanine, phenylglycine, bis (homophenylalanine), homophenylalanine, phenylalanine, tryptophan, 3- (3-benzothienyl) -alanine, 3- (2-quinolinyl) -alanine, O-benzylserine, 3- (4- (benzyloxy) phenyl) -alanine, S- (4-methylbenzyl) cysteine, N- (naphthalen-2-yl) glutamine, 3- (1, 1' -biphenyl-4-yl) -alanine, 3- (3-benzothienyl) -alanine or tyrosine, each of which is optionally substituted with one or more substituents. Amino acids having side chains comprising aromatic or heteroaromatic groups may each be independently selected from:

Wherein the H at the N-terminal and/or H at the C-terminal is replaced by a peptide bond.

The amino acid residues having a side chain comprising an aromatic or heteroaromatic group may each independently be a residue of phenylalanine, naphthylalanine, phenylglycine, homophenylalanine, gao Naiji alanine, bis (homophenylalanine), bis- (Gao Naiji alanine), tryptophan or tyrosine, each of which is optionally substituted with one or more substituents. The amino acid residues having a side chain containing an aromatic group may each independently be a residue of tyrosine, phenylalanine, 1-naphthylalanine, 2-naphthylalanine, tryptophan, 3-benzothienyl alanine, 4-phenylphenylalanine, 3, 4-difluorophenylalanine, 4-trifluoromethylphenylalanine, 2,3,4,5, 6-pentafluorophenylalanine, homophenylalanine, β -homophenylalanine, 4-tert-butyl-phenylalanine, 4-pyridylalanine, 3-pyridylalanine, 4-methylphenylalanine, 4-fluorophenylalanine, 4-chlorophenylalanine, 3- (9-anthryl) -alanine. The amino acid residues having a side chain comprising an aromatic group may each independently be a residue of phenylalanine, naphthylalanine, phenylglycine, homophenylalanine or homonaphthylalanine, each of which is optionally substituted with one or more substituents. The amino acid residues having a side chain comprising an aromatic group may each independently be a residue of phenylalanine, naphthylalanine, homophenylalanine, gao Naiji alanine, bis (Gao Naiji alanine) or bis (Gao Naiji alanine), each of which is optionally substituted with one or more substituents. Amino acid residues having a side chain comprising an aromatic group may each independently be residues of phenylalanine or naphthylalanine, each of which is optionally substituted with one or more substituents. At least one amino acid residue having a side chain comprising an aromatic group may be a residue of phenylalanine. The at least two amino acid residues having a side chain comprising an aromatic group may be residues of phenylalanine. Each amino acid residue having a side chain comprising an aromatic group may be a residue of phenylalanine.

In embodiments, none of the amino acids having side chains comprising aromatic or heteroaromatic groups are contiguous. The two amino acids having side chains comprising aromatic or heteroaromatic groups may be contiguous. The two contiguous amino acids may have opposite stereochemistry. Two contiguous amino acids may have the same stereochemistry. Three amino acids having side chains containing aromatic or heteroaromatic groups may be contiguous. The three contiguous amino acids may have the same stereochemistry. The three contiguous amino acids may have alternating stereochemistry.

The amino acid residue comprising an aromatic or heteroaromatic group may be an L-amino acid. The amino acid residue comprising an aromatic or heteroaromatic group may be a D-amino acid. The amino acid residue comprising an aromatic or heteroaromatic group may be a mixture of D-amino acids and L-amino acids.

The optional substituents may be any atom or group that does not significantly reduce (e.g., more than 50%) the cytoplasmic delivery efficiency of cCPP, e.g., as compared to an otherwise identical sequence without the substituents. The optional substituents may be hydrophobic or hydrophilic substituents. The optional substituents may be hydrophobic substituents. Substituents may increase the solvent accessible surface area (as defined herein) of the hydrophobic amino acid. The substituents may be halogen, alkyl, alkenyl, alkynyl, cycloalkyl, cycloalkenyl, cycloalkynyl, heterocyclyl, aryl, heteroaryl, alkoxy, aryloxy, acyl, alkylcarbamoyl, alkylcarboxamido (alkylcarboxamidyl), alkoxycarbonyl, alkylthio or arylthio. The substituent may be halogen.

While not wishing to be bound by theory, it is believed that amino acids having aromatic or heteroaromatic groups with higher hydrophobicity values (i.e., amino acids having side chains comprising aromatic or heteroaromatic groups) can improve the cytoplasmic delivery efficiency of cCPP relative to amino acids having lower hydrophobicity values. Each hydrophobic amino acid may independently have a hydrophobicity value that is greater than glycine. Each hydrophobic amino acid can independently be a hydrophobic amino acid having a hydrophobicity value greater than alanine. Each hydrophobic amino acid independently can have a hydrophobicity value greater than or equal to phenylalanine. Hydrophobicity can be measured using a hydrophobicity scale known in the art. Table 2 lists the hydrophobicity values for the various amino acids reported by Eisenberg and Weiss (Proc. Natl. A cad. U.S. A.1984;81 (1): 140-144), engleman et al (Ann. Rev. Of B iophys. Biophys. Chem.1986;1986 (15): 321-53), kyte and Doolittle (J. Mo l. Biol.1982;157 (1): 105-132), hoop and Woods (Proc. Natl. Acad. Sci. U.S. A.1981;78 (6): 3824-3828) and Janin (Nature.1979; 277 (5696): 491-492), each of which is incorporated herein by reference in its entirety. Hydrophobicity can be measured using the hydrophobicity scale reported by Engleman et al.

TABLE 2 amino acid hydrophobicity

The size of the aromatic or heteroaromatic groups may be selected to improve the cytoplasmic delivery efficiency of cCPP. While not wishing to be bound by theory, it is believed that larger aromatic or heteroaromatic groups on the amino acid side chains may improve cytoplasmic delivery efficiency compared to otherwise identical sequences with smaller hydrophobic amino acids. The size of the hydrophobic amino acid may be measured in terms of the molecular weight of the hydrophobic amino acid, the steric effect of the hydrophobic amino acid, the solvent accessible surface area of the side chain (SASA), or a combination thereof. The size of the hydrophobic amino acid can be measured in terms of the molecular weight of the hydrophobic amino acid, and the larger hydrophobic amino acid has side chains with a molecular weight of at least about 90g/mol, or at least about 130g/mol, or at least about 141 g/mol. The size of an amino acid can be measured in terms of the SASA of the hydrophobic side chain. The hydrophobic amino acid may have a side chain with SASA greater than or equal to alanine, or greater than or equal to glycine. The larger hydrophobic amino acid may have a side chain with a SASA greater than alanine or greater than glycine. The hydrophobic amino acid may have an aromatic or heteroaromatic group with a SASA greater than or equal to about piperidine-2-carboxylic acid, greater than or equal to about tryptophan, greater than or equal to about phenylalanine, or greater than or equal to about naphthylalanine. The first hydrophobic amino acid (AA _H1) may have a SASA of at least aboutAt least aboutAt least aboutAt least aboutAt least aboutAt least aboutAt least aboutAt least aboutAt least aboutAt least aboutAt least aboutAt least aboutOr at least aboutIs a side chain of (c). The second hydrophobic amino acid (AA _H2) may have a SASA of at least aboutAt least aboutAt least aboutAt least aboutAt least aboutAt least aboutAt least aboutAt least aboutAt least aboutAt least aboutAt least aboutAt least aboutOr at least aboutIs a side chain of (c). The side chains of AA _H1 and AA _H2 may have at least aboutAt least aboutAt least aboutAt least about At least aboutAt least aboutAt least aboutAt least aboutAt least aboutAt least aboutAt least aboutAt least aboutAt least aboutAt least aboutAt least aboutGreater than aboutAt least aboutAt least aboutAt least aboutAt least aboutAt least aboutAt least aboutAt least aboutAt least aboutAt least aboutAt least aboutAt least aboutAt least aboutAt least aboutAt least aboutGreater than aboutAt least aboutAt least aboutAt least aboutAt least aboutOr at least aboutIs a combination of SASA. AA _H2 can be a hydrophobic amino acid residue whose side chain SASA is less than or equal to the SASA of the hydrophobic side chain of AA _H1. By way of example and not limitation, cCPP having a Nal-Arg motif may exhibit improved cytoplasmic delivery efficiency compared to cCPP which is otherwise identical to that having a Phe-Arg motif; cCPP having a Phe-Nal-Arg motif may exhibit improved cytoplasmic delivery efficiency compared to otherwise identical cCPP having a Nal-Phe-Arg motif; and the Phe-Nal-Arg motif may exhibit improved cytoplasmic delivery efficiency compared to otherwise identical cCPP with the Nal-Phe-Arg motif.

As used herein, "hydrophobic surface area" or "SASA" refers to the solvent accessible surface area of an amino acid side chain (reported in square angstroms;). SASA can be calculated using the 'rolling ball' algorithm developed by Shrake & Rupley (J Mol biol.79 (2): 351-71), which is incorporated herein by reference in its entirety for all purposes. This algorithm uses solvent "spheres" of specific radius to probe the molecular surface. Typical values for spheres are Which approximates the radius of a water molecule.

The SASA values for some of the side chains are shown in table 3 below. The SASA values described herein are based on the theoretical values set forth in Table 3 below, as reported by Tien et al (PLOS ONE (11): e80635.https:// doi.org/10.1371/journ.fine.fine.0080635), which is incorporated herein by reference in its entirety for all purposes.

TABLE 3 amino acid SASA values

Residues	Theoretical value	Empirical values	Miller et al (1987)	Rose et al (1985)
					Alanine (Ala)	129.0	121.0	113.0	118.1
Arginine (Arg)	274.0	265.0	241.0	256.0
					Asparagine derivatives	195.0	187.0	158.0	165.5
Aspartic acid	193.0	187.0	151.0	158.7
					Cysteine (S)	167.0	148.0	140.0	146.1
Glutamic acid	223.0	214.0	183.0	186.2
					Glutamine	225.0	214.0	189.0	193.2
Glycine (Gly)	104.0	97.0	85.0	88.1
					Histidine	224.0	216.0	194.0	202.5
Isoleucine (Ile)	197.0	195.0	182.0	181.0
					Leucine (leucine)	201.0	191.0	180.0	193.1
Lysine	236.0	230.0	211.0	225.8
					Methionine	224.0	203.0	204.0	203.4
Phenylalanine (Phe)	240.0	228.0	218.0	222.8
					Proline (proline)	159.0	154.0	143.0	146.8
Serine (serine)	155.0	143.0	122.0	129.8
					Threonine (Thr)	172.0	163.0	146.0	152.5
Tryptophan	285.0	264.0	259.0	266.3
					Tyrosine	263.0	255.0	229.0	236.8
Valine (valine)	174.0	165.0	160.0	164.5

Amino acid residues having a side chain comprising a guanidino group, a guanidine replacement group or a protonated form thereof

Guanidine, as used herein, refers to the following structure:

as used herein, the protonated form of guanidine refers to the following structure:

Guanidine replacement group refers to a functional group on the side chain of an amino acid that will be positively charged at or above physiological pH, or that reproduces the hydrogen bonding donating and accepting activity of a guanidinium (guanidinium) group.

The guanidine replacement group facilitates cell permeation and delivery of therapeutic agents while reducing toxicity associated with the guanidine group or protonated form thereof. cCPP may comprise at least one amino acid having a side chain comprising a guanidine or guanidinium substitution group. cCPP may comprise at least two amino acids having side chains comprising guanidine or guanidinium substitution groups. cCPP can comprise at least three amino acids having side chains comprising guanidine or guanidinium substitution groups

The guanidine or guanidinium group can be an isostere of guanidine or guanidinium. The guanidine or guanidinium replacement group can be less basic than guanidine.

As used herein, guanidine replacement group refers to Or a protonated form thereof.

The present disclosure relates to cCPP comprising 4 to 20 amino acid residues, wherein: (i) At least one amino acid has a side chain comprising a guanidino group or a protonated form thereof; (ii) At least one amino acid residue having no side chain or having a chain comprising Or a side chain of a protonated form thereof; and (iii) at least two amino acid residues independently have a side chain comprising an aromatic or heteroaromatic group.

At least two amino acid residues may have no side chains or have a chain comprising Or a side chain of a protonated form thereof. As used herein, an amino acid residue has two hydrogen atoms (e.g., -CH ₂ -) on the carbon atom connecting the amine and the carboxylic acid when no side chains are present.

CCPP may comprise at least one amino acid having a side chain comprising one of the following moieties: Or a protonated form thereof.

CCPP may comprise at least two amino acids, each independently having one of the following moieties: or a protonated form thereof. At least two amino acids may have side chains comprising the same moiety selected from the group consisting of: Or a protonated form thereof. At least one amino acid may have a polypeptide comprising Or a side chain of a protonated form thereof. At least two amino acids may have a peptide comprisingOr a side chain of a protonated form thereof. One, two, three or four amino acids may have a sequence comprisingOr a side chain of a protonated form thereof. One amino acid may have a polypeptide comprisingOr a side chain of a protonated form thereof. The two amino acids may have a sequence comprisingOr a side chain of a protonated form thereof. Or a protonated form thereof may be attached to the end of the amino acid side chain.May be attached to the end of the amino acid side chain.

CCPP may comprise (iii) 2, 3, 4, 5 or 6 amino acid residues independently having a side chain comprising a guanidino group, a guanidine replacement group or a protonated form thereof. cCPP may comprise (iii) 2 amino acid residues independently having a side chain comprising a guanidino group, a guanidine replacement group or a protonated form thereof. cCPP may comprise (iii) 3 amino acid residues independently having a side chain comprising a guanidino group, a guanidine replacement group or a protonated form thereof. cCPP may comprise (iii) 4 amino acid residues independently having a side chain comprising a guanidino group, a guanidine replacement group or a protonated form thereof. cCPP may comprise (iii) 5 amino acid residues independently having a side chain comprising a guanidino group, a guanidine replacement group or a protonated form thereof. cCPP may comprise (iii) 6 amino acid residues independently having a side chain comprising a guanidino group, a guanidine replacement group or a protonated form thereof. cCPP may comprise (iii) 2, 3, 4, or 5 amino acid residues independently having a side chain comprising a guanidino group, a guanidine replacement group, or a protonated form thereof. cCPP may comprise (iii) 2, 3 or 4 amino acid residues independently having a side chain comprising a guanidino group, a guanidine replacement group or a protonated form thereof. cCPP may comprise (iii) 2 or 3 amino acid residues independently having a side chain comprising a guanidino group, a guanidine replacement group or a protonated form thereof. cCPP may comprise (iii) at least one amino acid residue having a side chain comprising a guanidino group or a protonated form thereof. cCPP may comprise (iii) two amino acid residues having a side chain comprising a guanidino group or a protonated form thereof. cCPP may comprise (iii) three amino acid residues having a side chain comprising a guanidino group or a protonated form thereof.

The amino acid residues may independently have side chains comprising non-contiguous guanidine groups, guanidine replacement groups, or protonated forms thereof. The two amino acid residues may independently have side chains comprising guanidine groups, guanidine replacement groups, or protonated forms thereof, which may be contiguous. The three amino acid residues may independently have side chains comprising guanidine groups, guanidine replacement groups, or protonated forms thereof, which may be contiguous. The four amino acid residues may independently have side chains comprising guanidine groups, guanidine replacement groups, or protonated forms thereof, which may be contiguous. Contiguous amino acid residues may have the same stereochemistry. Contiguous amino acids may have alternating stereochemistry.

The amino acid residue independently having a side chain comprising a guanidino group, a guanidine replacement group or a protonated form thereof may be an L-amino acid. The amino acid residue independently having a side chain comprising a guanidino group, a guanidine replacement group or a protonated form thereof may be a D-amino acid. The amino acid residues independently having a side chain comprising a guanidino group, a guanidine replacement group or a protonated form thereof may be an L-amino acid or a mixture of D-amino acids.

Each amino acid residue having a side chain comprising a guanidino group or a protonated form thereof may independently be an arginine, homoarginine, 2-amino-3-propionic acid, 2-amino-4-guanidino butyric acid, or residue of a protonated form thereof. Each amino acid residue having a side chain comprising a guanidine group or a protonated form thereof can independently be an arginine residue or a residue of a protonated form thereof.

Each amino acid having a side chain comprising a guanidine replacement group or protonated form thereof can independently be Or a protonated form thereof.

Without being bound by theory, it is hypothesized that the guanidine replacement group has a reduced basicity relative to arginine, and in some cases is uncharged at physiological pH (e.g., -N (H) C (O)), and is capable of sustaining bidentate hydrogen bond interactions with phospholipids on the plasma membrane, which is believed to promote efficient membrane binding and subsequent internalization. Removal of the positive charge is also believed to reduce cCPP's toxicity.

Those skilled in the art will appreciate that the N-terminus and/or C-terminus of the above-described unnatural aromatic hydrophobic amino acids form amide linkages upon incorporation into the peptides disclosed herein.

CCPP may comprise a first amino acid having a side chain comprising an aromatic or heteroaromatic group and a second amino acid having a side chain comprising an aromatic or heteroaromatic group, wherein the N-terminus of the first glycine forms a peptide bond with the first amino acid having a side chain comprising an aromatic or heteroaromatic group and the C-terminus of the first glycine forms a peptide bond with the second amino acid having a side chain comprising an aromatic or heteroaromatic group. Although the term "first amino acid" generally refers to the N-terminal amino acid of a peptide sequence by convention, as used herein, a "first amino acid" is used to distinguish the referred amino acid from another amino acid (e.g., "second amino acid") in cCPP, such that the term "first amino acid" may refer to or may refer to an amino acid located at the N-terminal end of a peptide sequence.

CCPP may comprise: the N-terminus of the second glycine forms a peptide bond with an amino acid having a side chain comprising an aromatic or heteroaromatic group, and the C-terminus of the second glycine forms a peptide bond with an amino acid having a side chain comprising a guanidino group or protonated form thereof.

CCPP may comprise a first amino acid having a side chain comprising a guanidino group or a protonated form thereof, and a second amino acid having a side chain comprising a guanidino group or a protonated form thereof, wherein the N-terminus of the third glycine forms a peptide bond with the first amino acid having a side chain comprising a guanidino group or a protonated form thereof, and the C-terminus of the third glycine forms a peptide bond with the second amino acid having a side chain comprising a guanidino group or a protonated form thereof.

CCPP may comprise residues of asparagine, aspartic acid, glutamine, glutamic acid or homoglutamine. cCPP may comprise residues of asparagine. cCPP may comprise residues of glutamine.

CCPP may comprise residues of tyrosine, phenylalanine, 1-naphthylalanine, 2-naphthylalanine, tryptophan, 3-benzothienylalanine, 4-phenylphenylalanine, 3, 4-difluorophenylalanine, 4-trifluoromethylphenylalanine, 2,3,4,5, 6-pentafluorophenylalanine, homophenylalanine, β -homophenylalanine, 4-tert-butyl-phenylalanine, 4-pyridylalanine, 3-pyridylalanine, 4-methylphenylalanine, 4-fluorophenylalanine, 4-chlorophenylalanine, 3- (9-anthracenyl) -alanine.

While not wishing to be bound by theory, it is believed that the chirality of the amino acids in cCPP can affect cytoplasmic uptake efficiency. cCPP may comprise at least one D amino acid. cCPP may comprise one to fifteen D amino acids. cCPP may comprise one to ten D amino acids. cCPP may comprise 1, 2, 3 or 4D amino acids. cCPP may comprise 2, 3, 4, 5, 6, 7 or 8 contiguous amino acids with alternating D and L chiralities. cCPP may comprise three contiguous amino acids with the same chirality. cCPP may comprise two contiguous amino acids having the same chirality. At least two amino acids may have opposite chiralities. At least two amino acids having opposite chiralities may be adjacent to each other. At least three amino acids may have alternating stereochemistry with respect to each other. At least three amino acids having alternating chiralities relative to each other may be adjacent to each other. At least four amino acids have alternating stereochemistry with respect to each other. At least four amino acids having alternating chiralities relative to each other may be adjacent to each other. At least two amino acids may have the same chirality. At least two amino acids having the same chirality may be adjacent to each other. At least two amino acids have the same chirality and at least two amino acids have opposite chiralities. At least two amino acids having opposite chiralities may be adjacent to at least two amino acids having the same chirality. Thus, adjacent amino acids in cCPP may have any of the following sequences: D-L; L-D; D-L-L-D; L-D-D-L; L-D-L-L-D; D-L-D-D-L; D-L-L-D-L; or L-D-D-L-D. The amino acid residues forming cCPP may all be L-amino acids. The amino acid residues forming cCPP may all be D-amino acids.

At least two amino acids may have different chiralities. At least two amino acids having different chiralities may be adjacent to each other. At least three amino acids may have different chiralities relative to adjacent amino acids. At least four amino acids may have different chiralities relative to adjacent amino acids. At least two amino acids have the same chirality and at least two amino acids have different chiralities. One or more of the amino acid residues forming cCPP may be achiral. cCPP may comprise a3, 4 or 5 amino acid motif, wherein two amino acids having the same chirality may be separated by an achiral amino acid. cCPP may comprise the following sequence: D-X-D; D-X-D-X; D-X-D-X-D; L-X-L; L-X-L-X; or L-X-L-X-L, wherein X is an achiral amino acid. The achiral amino acid may be glycine.

An amino acid having a side chain comprising:

or a protonated form thereof, may be adjacent to an amino acid having a side chain comprising an aromatic or heteroaromatic group. An amino acid having a side chain comprising: Or a protonated form thereof, may be adjacent to at least one amino acid having a side chain comprising guanidine, or a protonated form thereof. An amino acid having a side chain comprising guanidine or a protonated form thereof can be adjacent to an amino acid having a side chain comprising an aromatic or heteroaromatic group. Two amino acids having side chains comprising: Or protonated forms thereof, may be adjacent to each other. Two amino acids having side chains comprising guanidine or a protonated form thereof are adjacent to each other. cCPP may comprise at least two contiguous amino acids having a side chain which may comprise an aromatic or heteroaromatic group, and at least two non-contiguous amino acids having a side chain comprising: or a protonated form thereof. cCPP can comprise at least two contiguous amino acids having a side chain comprising an aromatic or heteroaromatic group and at least two amino acids having a chain comprising Or a side chain of a protonated form thereof. Adjacent amino acids may have the same chirality. Adjacent amino acids may have opposite chirality. Other combinations of amino acids may have any arrangement of D and L amino acids, e.g., any of the sequences described in the preceding paragraphs.

At least two amino acids having side chains comprising:

Or a protonated form thereof, with at least two amino acids having side chains comprising a guanidino group or a protonated form thereof.

CCPP may comprise the structure of formula (a):

Or a protonated form thereof,

Wherein:

R ₁、R₂ and R ₃ are each independently H or an aromatic or heteroaromatic side chain of an amino acid;

at least one of R ₁、R₂ and R ₃ is an aromatic or heteroaromatic side chain of an amino acid;

R ₄、R₅、R₆、R₇ is independently H or an amino acid side chain;

At least one of R ₄、R₅、R₆、R₇ is a side chain of 3-guanidino-2-aminopropionic acid, 4-guanidino-2-aminobutyric acid, arginine, homoarginine, N-methylarginine, N, N-dimethylarginine, 2, 3-diaminopropionic acid, 2, 4-diaminobutyric acid, lysine, N-methyllysine, N, N-dimethyllysine, N-ethyllysine, N, N, N-trimethyllysine, 4-guanidinophenylalanine, citrulline, N, N-dimethyllysine, β -homoarginine, 3- (1-piperidinyl) alanine;

AA _SC is an amino acid side chain; and

Q is 1,2,3 or 4.

In embodiments, at least one of R ₄、R₅、R₆、R₇ is independently an uncharged non-aromatic side chain of an amino acid. In an embodiment, at least one of R ₄、R₅、R₆、R₇ is independently H or a side chain of citrulline.

In an embodiment, compounds are provided comprising a cyclic peptide having from 6 to 12 amino acids, wherein at least two amino acids of the cyclic peptide are charged amino acids, at least two amino acids of the cyclic peptide are aromatic hydrophobic amino acids, and at least two amino acids of the cyclic peptide are uncharged non-aromatic amino acids. In embodiments, the at least two charged amino acids of the cyclic peptide are arginine. In embodiments, the at least two aromatic hydrophobic amino acids of the cyclic peptide are phenylalanine, naphthalanine (3-naphthalen-2-yl-alanine), or a combination thereof. In embodiments, the at least two uncharged non-aromatic amino acids of the cyclic peptide are citrulline, glycine, or a combination thereof. In embodiments, the compound is a cyclic peptide having from 6 to 12 amino acids, wherein two amino acids of the cyclic peptide are arginine, at least two amino acids are aromatic hydrophobic amino acids selected from phenylalanine, naphthylalanine, and combinations thereof, and at least two amino acids are uncharged non-aromatic amino acids selected from citrulline, glycine, and combinations thereof.

In embodiments, the cyclic peptide of formula (a) is not a cyclic peptide having the sequence:

Wherein F is L-phenylalanine, F is D-phenylalanine, Φ is L-3- (2-naphthyl) -alanine, Φ is D-3- (2-naphthyl) -alanine, R is L-arginine, R is D-arginine, Q is L-glutamine, Q is D-glutamine, C is L-cysteine, U is L-selenocysteine, W is L-tryptophan, K is L-lysine, D is L-aspartic acid, and Ω is L-norleucine.

CCPP may comprise the structure of formula (I):

Or a protonated form thereof,

Wherein:

R ₁、R₂ and R ₃ may each independently be H or an amino acid residue having a side chain comprising an aromatic group;

r ₄ and R ₆ are independently H or an amino acid side chain;

AA _SC is an amino acid side chain;

q is 1, 2, 3 or 4; and

Each m is independently an integer of 0,1, 2 or 3.

R ₁, R2, and R ₃ may each independently be H, -alkylene-aryl, or-alkylene-heteroaryl. R ₁、R₂ and R ₃ may each independently be H, -C _1-3 alkylene-aryl or-C _1-3 alkylene-heteroaryl. R ₁、R₂ and R ₃ may each independently be H or-alkylene-aryl. R ₁、R₂ and R ₃ may each independently be H or-C _1-3 alkylene-aryl. The C _1-3 alkylene group may be methylene. The aryl group may be a6 to 14 membered aryl group. The heteroaryl group may be a6 to 14 membered heteroaryl group having one or more heteroatoms selected from N, O and S. The aryl group may be selected from phenyl, naphthyl or anthracenyl. Aryl may be phenyl or naphthyl. The aryl group may be phenyl. Heteroaryl groups may be pyridinyl, quinolinyl and isoquinolinyl. R ₁、R₂ and R ₃ may each independently be H, -C _1-3 alkylene-Ph or-C _1-3 alkylene-naphthyl. R ₁、R₂ and R ₃ may each independently be H, -CH ₂ Ph or-CH ₂ naphthyl. R ₁、R₂ and R ₃ may each independently be H or-CH ₂ Ph.

R ₁、R₂ and R ₃ may each independently be a side chain of tyrosine, phenylalanine, 1-naphthylalanine, 2-naphthylalanine, tryptophan, 3-benzothienylalanine, 4-phenylphenylalanine, 3, 4-difluorophenylalanine, 4-trifluoromethylphenylalanine, 2,3,4,5, 6-pentafluorophenylalanine, homophenylalanine, β -homophenylalanine, 4-tert-butyl-phenylalanine, 4-pyridylalanine, 3-pyridylalanine, 4-methylphenylalanine, 4-fluorophenylalanine, 4-chlorophenylalanine, 3- (9-anthracenyl) -alanine.

R ₁ can be a side chain of tyrosine. R ₁ can be the side chain of phenylalanine. R ₁ can be the side chain of 1-naphthylalanine. R ₁ can be the side chain of 2-naphthylalanine. R ₁ can be the side chain of tryptophan. R ₁ can be the side chain of 3-benzothiophenylalanine. R ₁ can be the side chain of 4-phenylphenylalanine. R ₁ can be the side chain of 3, 4-difluorophenylalanine. R ₁ can be the side chain of 4-trifluoromethylphenylalanine. R ₁ can be the side chain of 2,3,4,5, 6-pentafluorophenylalanine. R ₁ can be the side chain of homophenylalanine. R ₁ can be the side chain of beta-homophenylalanine. R ₁ may be the side chain of 4-tert-butyl-phenylalanine. R ₁ can be the side chain of 4-pyridylalanine. R ₁ can be the side chain of 3-pyridylalanine. R ₁ can be the side chain of 4-methylphenylalanine. R ₁ can be the side chain of 4-fluorophenylalanine. R ₁ can be the side chain of 4-chlorophenylalanine. R ₁ can be the side chain of 3- (9-anthryl) -alanine.

R ₂ can be a side chain of tyrosine. R ₂ can be the side chain of phenylalanine. R ₂ can be the side chain of 1-naphthylalanine. R ₁ can be the side chain of 2-naphthylalanine. R ₂ can be the side chain of tryptophan. R ₂ can be the side chain of 3-benzothiophenylalanine. R ₂ can be the side chain of 4-phenylphenylalanine. R ₂ can be the side chain of 3, 4-difluorophenylalanine. R ₂ can be the side chain of 4-trifluoromethylphenylalanine. R ₂ can be the side chain of 2,3,4,5, 6-pentafluorophenylalanine. R ₂ can be the side chain of homophenylalanine. R ₂ can be the side chain of beta-homophenylalanine. R ₂ may be the side chain of 4-tert-butyl-phenylalanine. _R 2 may be the side chain of 4-pyridylalanine. R ₂ can be the side chain of 3-pyridylalanine. R ₂ can be the side chain of 4-methylphenylalanine. R ₂ can be the side chain of 4-fluorophenylalanine. R ₂ can be the side chain of 4-chlorophenylalanine. R ₂ can be the side chain of 3- (9-anthryl) -alanine.

R ₃ can be a side chain of tyrosine. R ₃ can be the side chain of phenylalanine. R ₃ can be the side chain of 1-naphthylalanine. R ₃ can be the side chain of 2-naphthylalanine. R ₃ can be the side chain of tryptophan. R ₃ can be the side chain of 3-benzothiophenylalanine. R ₃ can be the side chain of 4-phenylphenylalanine. R ₃ can be the side chain of 3, 4-difluorophenylalanine. R ₃ can be the side chain of 4-trifluoromethylphenylalanine. R ₃ can be the side chain of 2,3,4,5, 6-pentafluorophenylalanine. R ₃ can be the side chain of homophenylalanine. R ₃ can be the side chain of beta-homophenylalanine. R ₃ may be the side chain of 4-tert-butyl-phenylalanine. R ₃ can be the side chain of 4-pyridylalanine. R ₃ can be the side chain of 3-pyridylalanine. R ₃ can be the side chain of 4-methylphenylalanine. R ₃ can be the side chain of 4-fluorophenylalanine. R ₃ can be the side chain of 4-chlorophenylalanine. R ₃ can be the side chain of 3- (9-anthryl) -alanine.

R ₄ can be H, -alkylene-aryl, -alkylene-heteroaryl. R ₄ can be H, -C _1-3 alkylene-aryl, or-C _1-3 alkylene-heteroaryl. r ₄ can be H or-alkylene-aryl. R ₄ can be H or-C _1-3 alkylene-aryl. The C _1-3 alkylene group may be methylene. The aryl group may be a 6 to 14 membered aryl group. The heteroaryl group may be a 6 to 14 membered heteroaryl group having one or more heteroatoms selected from N, O and S. The aryl group may be selected from phenyl, naphthyl or anthracenyl. Aryl may be phenyl or naphthyl. The aryl group may be phenyl. Heteroaryl groups may be pyridinyl, quinolinyl and isoquinolinyl. R ₄ can be H, -C _1-3 alkylene-Ph or-C _1-3 alkylene-naphthyl. R ₄ can be H or the side chain of an amino acid in Table 1 or Table 3. R ₄ can be H or an amino acid residue having a side chain comprising an aromatic group. R ₄ can be H, -CH ₂ Ph or-CH ₂ naphthyl. R ₄ can be H or-CH ₂ Ph.

R ₅ can be H, -alkylene-aryl, -alkylene-heteroaryl. R ₅ can be H, -C _1-3 alkylene-aryl, or-C _1-3 alkylene-heteroaryl. R ₅ can be H or-alkylene-aryl. R ₅ can be H or-C _1-3 alkylene-aryl. The C _1-3 alkylene group may be methylene. The aryl group may be a 6 to 14 membered aryl group. The heteroaryl group may be a 6 to 14 membered heteroaryl group having one or more heteroatoms selected from N, O and S. The aryl group may be selected from phenyl, naphthyl or anthracenyl. Aryl may be phenyl or naphthyl. The aryl group may be phenyl. Heteroaryl groups may be pyridinyl, quinolinyl and isoquinolinyl. R ₅ can be H, -C _1-3 alkylene-Ph or-C _1-3 alkylene-naphthyl. R ₅ can be H or the side chain of an amino acid in Table 1 or Table 3. R ₄ can be H or an amino acid residue having a side chain comprising an aromatic group. R ₅ can be H, -CH ₂ Ph or-CH ₂ naphthyl. R ₄ can be H or-CH ₂ Ph.

R ₆ can be H, -alkylene-aryl, -alkylene-heteroaryl. R ₆ can be H, -C _1-3 alkylene-aryl, or-C _1-3 alkylene-heteroaryl. R ₆ can be H or-alkylene-aryl. R ₆ can be H or-C _1-3 alkylene-aryl. The C _1-3 alkylene group may be methylene. The aryl group may be a 6 to 14 membered aryl group. The heteroaryl group may be a 6 to 14 membered heteroaryl group having one or more heteroatoms selected from N, O and S. The aryl group may be selected from phenyl, naphthyl or anthracenyl. Aryl may be phenyl or naphthyl. The aryl group may be phenyl. Heteroaryl groups may be pyridinyl, quinolinyl and isoquinolinyl. R ₆ can be H, -C _1-3 alkylene-Ph or-C _1-3 alkylene-naphthyl. r ₆ can be H or the side chain of an amino acid in Table 1 or Table 3. R ₆ can be H or an amino acid residue having a side chain comprising an aromatic group. R ₆ can be H, -CH ₂ Ph or-CH ₂ naphthyl. R ₆ can be H or-CH ₂ Ph.

R ₇ can be H, -alkylene-aryl, -alkylene-heteroaryl. R ₇ can be H, -C _1-3 alkylene-aryl, or-C _1-3 alkylene-heteroaryl. r ₇ can be H or-alkylene-aryl. R ₇ can be H or-C _1-3 alkylene-aryl. The C _1-3 alkylene group may be methylene. The aryl group may be a 6 to 14 membered aryl group. The heteroaryl group may be a 6 to 14 membered heteroaryl group having one or more heteroatoms selected from N, O and S. The aryl group may be selected from phenyl, naphthyl or anthracenyl. Aryl may be phenyl or naphthyl. The aryl group may be phenyl. Heteroaryl groups may be pyridinyl, quinolinyl and isoquinolinyl. R ₇ can be H, -C _1-3 alkylene-Ph or-C _1-3 alkylene-naphthyl. R ₇ can be H or the side chain of an amino acid in Table 1 or Table 3. R ₇ can be H or an amino acid residue having a side chain comprising an aromatic group. R ₇ can be H, -CH ₂ Ph or-CH ₂ naphthyl. R ₇ can be H or-CH ₂ Ph.

One, two or three of R ₁、R₂、R₃、R₄、R₅、R₆ and R ₇ may be-CH ₂Ph.R₁、R₂、R₃、R₄、R₅、R₆ and one of R ₇ may be-CH ₂Ph.R₁、R₂、R₃、R₄、R₅、R₆ and two of R ₇ may be-CH ₂Ph.R₁、R₂、R₃、R₄、R₅、R₆ and three of R ₇ may be-CH ₂Ph.R₁、R₂、R₃、R₄、R₅、R₆ and at least one of R ₇ may be-CH ₂Ph.R₁、R₂、R₃、R₄、R₅、R₆ and no more than four of R ₇ may be-CH ₂ Ph.

One, two or three of R ₁、R₂、R₃ and R ₄ are-CH ₂Ph.R₁、R₂、R₃ and one of R ₄ are-CH ₂Ph.R₁、R₂、R₃ and two of R ₄ are-CH ₂Ph.R₁、R₂、R₃ and three of R ₄ are-CH ₂Ph.R₁、R₂、R₃ and at least one of R ₄ are-CH ₂ Ph.

One, two or three of R ₁、R₂、R₃、R₄、R₅、R₆ and R ₇ may be H. One of R ₁、R₂、R₃、R₄、R₅、R₆ and R ₇ may be H. Two of R ₁、R₂、R₃、R₄、R₅、R₆ and R ₇ may be H. Three of R ₁、R₂、R₃、R₄、R₅、R₆ and R ₇ may be H. At least one of R ₁、R₂、R₃、R₄、R₅、R₆ and R ₇ may be H. No more than three of R ₁、R₂、R₃、R₄、R₅、R₆ and R ₇ may be-CH ₂ Ph.

One, two or three of R ₁、R₂、R₃ and R ₄ are H. One of R ₁、R₂、R₃ and R ₄ is H. Two of R ₁、R₂、R₃ and R ₄ are H. Three of R ₁、R₂、R₃ and R ₄ are H. At least one of R ₁、R₂、R₃ and R ₄ is H.

At least one of R ₄、R₅、R₆ and R ₇ may be a side chain of 3-guanidino-2-aminopropionic acid. At least one of R ₄、R₅、R₆ and R ₇ may be a side chain of 4-guanidino-2-aminopropionic acid. At least one of R ₄、R₅、R₆ and R ₇ may be a side chain of arginine. At least one of R ₄、R₅、R₆ and R ₇ may be a side chain of homoarginine. At least one of R ₄、R₅、R₆ and R ₇ may be a side chain of N-methyl arginine. At least one of R ₄、R₅、R₆ and R ₇ may be a side chain of N, N-dimethylarginine. At least one of R ₄、R₅、R₆ and R ₇ may be a side chain of 2, 3-diaminopropionic acid. At least one of R ₄、R₅、R₆ and R ₇ may be a side chain of 2, 4-diaminobutyric acid, lysine. At least one of R ₄、R₅、R₆ and R ₇ may be a side chain of N-methyl lysine. At least one of R ₄、R₅、R₆ and R ₇ may be a side chain of N, N-dimethyl lysine. At least one of R ₄、R₅、R₆ and R ₇ may be a side chain of N-ethyl lysine. At least one of R ₄、R₅、R₆ and R ₇ may be a side chain of N, N-trimethyllysine, 4-guanidinophenylalanine. at least one of R ₄、R₅、R₆ and R ₇ may be a side chain of citrulline. At least one of R ₄、R₅、R₆ and R ₇ may be a side chain of N, N-dimethyl lysine, β -homoarginine. At least one of R ₄、R₅、R₆ and R ₇ may be the side chain of 3- (1-piperidinyl) alanine.

At least two of R ₄、R₅、R₆ and R ₇ may be side chains of 3-guanidino-2-aminopropionic acid. At least two of R ₄、R₅、R₆ and R ₇ may be side chains of 4-guanidino-2-aminobutyric acid. At least two of R ₄、R₅、R₆ and R ₇ may be side chains of arginine. At least two of R ₄、R₅、R₆ and R ₇ may be side chains of homoarginine. at least two of R ₄、R₅、R₆ and R ₇ may be side chains of N-methyl arginine. At least two of R ₄、R₅、R₆ and R ₇ may be side chains of N, N-dimethylarginine. At least two of R ₄、R₅、R₆ and R ₇ may be side chains of 2, 3-diaminopropionic acid. At least two of R ₄、R₅、R₆ and R ₇ may be side chains of 2, 4-diaminobutyric acid, lysine. at least two of R ₄、R₅、R₆ and R ₇ may be side chains of N-methyl lysine. At least two of R ₄、R₅、R₆ and R ₇ may be side chains of N, N-dimethyl lysine. At least two of R ₄、R₅、R₆ and R ₇ may be side chains of N-ethyl lysine. At least two of R ₄、R₅、R₆ and R ₇ may be side chains of N, N, N-trimethyllysine, 4-guanidinophenylalanine. At least two of R ₄、R₅、R₆ and R ₇ may be side chains of citrulline. At least two of R ₄、R₅、R₆ and R ₇ may be side chains of N, N-dimethyl lysine, β -homoarginine. at least two of R ₄、R₅、R₆ and R ₇ may be side chains of 3- (1-piperidinyl) alanine.

At least three of R ₄、R₅、R₆ and R ₇ may be side chains of 3-guanidino-2-aminopropionic acid. At least three of R ₄、R₅、R₆ and R ₇ may be side chains of 4-guanidino-2-aminobutyric acid. At least three of R ₄、R₅、R₆ and R ₇ may be side chains of arginine. At least three of R ₄、R₅、R₆ and R ₇ may be side chains of homoarginine. At least three of R ₄、R₅、R₆ and R ₇ may be side chains of N-methyl arginine. At least three of R ₄、R₅、R₆ and R ₇ may be side chains of N, N-dimethylarginine. At least three of R ₄、R₅、R₆ and R ₇ may be side chains of 2, 3-diaminopropionic acid. At least three of R ₄、R₅、R₆ and R ₇ may be side chains of 2, 4-diaminobutyric acid and lysine. At least three of R ₄、R₅、R₆ and R ₇ may be side chains of N-methyllysine. At least three of R ₄、R₅、R₆ and R ₇ may be side chains of N, N-dimethyl lysine. At least three of R ₄、R₅、R₆ and R ₇ may be side chains of N-ethyl lysine. At least three of R ₄、R₅、R₆ and R ₇ may be the side chain of N, N, N-trimethyllysine, 4-guanidinophenylalanine. At least three of R ₄、R₅、R₆ and R ₇ may be side chains of citrulline. At least three of R ₄、R₅、R₆ and R ₇ may be side chains of N, N-dimethyl lysine, beta-homoarginine. At least three of R ₄、R₅、R₆ and R ₇ may be side chains of 3- (1-piperidinyl) alanine.

AA _SC can be a side chain of an asparagine, glutamine or homoglutamine residue. AA _SC can be a side chain of a glutamine residue. cCPP can also comprise a linker conjugated to AA _SC (e.g., residues of asparagine, glutamine, or homoglutamine). Thus cCPP may also comprise linkers conjugated to asparagine, glutamine or homoglutamine residues. cCPP may also comprise a linker conjugated to the glutamine residue.

Q may be 1, 2 or 3.q may be 1 or 2.q may be 1.q may be 2.q may be 3.q may be 4.

M may be 1-3.m may be 1 or 2.m may be 0 and m may be 1.m may be 2.m may be 3.

CCPP of formula (a) may comprise the structure of formula (I): Or a protonated form thereof, wherein AA _SC、R₁、R₂、R₃、R₄、R₇, m, and q are as defined herein.

CCPP of formula (A) may comprise a structure of formula (I-a) or formula (I-b):

Or a protonated form thereof, wherein AA _SC、R₁、R₂、R₃、R₄ and m are as defined herein.

CCPP of formula (A) may comprise the structure of formula (I-1), (I-2), (I-3) or (I-4):

Or a protonated form thereof, wherein AA _SC and m are as defined herein.

CCPP of formula (A) may comprise the structure of formula (I-5) or (I-6):

Or a protonated form thereof, wherein AA _SC is as defined herein.

CCPP of formula (A) may comprise the structure of formula (I-1):

Or a protonated form thereof, wherein AA _SC and m are as defined herein.

CCPP of formula (A) may comprise the structure of formula (I-2): Or a protonated form thereof, wherein AA _SC and m are as defined herein.

CCPP of formula (A) may comprise the structure of formula (I-3):

Or a protonated form thereof, wherein AA _SC and m are as defined herein.

CCPP of formula (A) may comprise the structure of formula (I-4):

Or a protonated form thereof, wherein AA _SC and m are as defined herein.

CCPP of formula (a) may comprise the structure of formula (I-5):

Or a protonated form thereof, wherein AA _SC and m are as defined herein.

CCPP of formula (A) may comprise the structure of formula (I-6):

Or a protonated form thereof, wherein AA _SC and m are as defined herein.

CCPP may comprise one of the following sequences: FGFGRGR; gfFGrGr; ff phi GRGR; ffFGRGR; or Ff phi GrGr. cCPP may have one of the following sequences: FGFGRGRQ; gfFGrGrQ; ff phi GRGRQ; ffFGRGRQ; or Ff phi GrGrQ.

The present disclosure also relates to cCPP having the structure of formula (II):

Wherein:

AA _SC is an amino acid side chain;

R ^1a、R^1b and R ^1c are each independently 6 to 14 membered aryl or 6 to 14 membered heteroaryl;

r ^2a、R^2b、R^2c and R ^2d are independently amino acid side chains;

At least one of R ^2a、R^2b、R^2c and R ^2d is Or a protonated form thereof;

At least one of R ^2a、R^2b、R^2c and R ^2d is guanidine or a protonated form thereof;

each n "is independently an integer of 0,1, 2,3,4, or 5;

each n' is independently an integer of 0, 1, 2 or 3; and

If n' is 0, then R ^2a、R^2b、R^2b or R2 ^d are absent.

At least two of R ^2a、R^2b、R^2c and R ^2d may be Or a protonated form thereof. Two or three of R ^2a、R^2b、R^2c and R ^2d may be Or a protonated form thereof. One of R ^2a、R^2b、R^2c and R ^2d may be Or a protonated form thereof. At least one of R ^2a、R^2b、R^2c and R ^2d may beOr a protonated form thereof, and the remainder of R ^2a、R^2b、R^2c and R ^2d may be guanidine or a protonated form thereof. At least two of R ^2a、R^2b、R^2c and R ^2d may beOr a protonated form thereof, and the remainder of R ^2a、R^2b、R^2c and R ^2d may be guanidine or a protonated form thereof.

All R ^2a、R^2b、R^2c and R ² d may be Or a protonated form thereof. At least one of R ^2a、R^2b、R^2c and R ^2d may beOr a protonated form thereof, and the remainder of R ^2a、R^2b、R^2c and R ^2d may be guanidine or a protonated form thereof. At least two of the R ^2a、R^2b、R^2c and R ^2d groups may beOr a protonated form thereof, and the remainder of R ^2a、R^2b、R^2c and R ^2d may be guanidine or a protonated form thereof.

Each of R ^2a、R^2b、R^2c and R ^2d may independently be a side chain of 2, 3-diaminopropionic acid, 2, 4-diaminobutyric acid, the following acids: ornithine, lysine, methyllysine, dimethyllysine, trimethyllysine, homolysine, serine, homoserine, threonine, allothreonine, histidine, 1-methylhistidine, 2-aminobutyric acid, aspartic acid, glutamic acid or homoglutamic acid.

AA _SC can beWherein t may be an integer from 0 to 5. AA _SC can beWherein t may be an integer from 0 to 5.t may be 1 to 5.t is 2 or 3.t may be 2.t may be 3.

The AC described herein may be coupled to AA _SC. In embodiments, the linker (L) couples AC to AA _SC. In embodiments, the linker (L) is covalently bound to the backbone of the AC.

AA _SC can be a side chain of an asparagine, glutamine or homoglutamine residue. AA _SC can be a side chain of a glutamine residue. The cyclic peptide may comprise a linker conjugated to AA _SC (e.g., residues of asparagine, glutamine, or homoglutamine).

R ^1a、R^1b and R ^1c may each independently be a 6 to 14 membered aryl group. R ^1a、R^1b and R ^1c may each independently be a 6 to 14 membered heteroaryl group having one or more heteroatoms selected from N, O or S. R ^1a、R^1b and R ^1c may each be independently selected from phenyl, naphthyl, anthracenyl, pyridinyl, quinolinyl or isoquinolinyl. R ^1a、R^1b and R ^1c may each be independently selected from phenyl, naphthyl or anthracenyl. R ^1a、R^1b and R ^1c may each independently be phenyl or naphthyl. R ^1a、R^1b and R ^1c may each be independently selected pyridinyl, quinolinyl or isoquinolinyl.

Each n' may independently be 1 or 2. Each n' may be 1. Each n' may be 2. At least one n' may be 0. At least one n' may be 1. At least one n' may be 2. At least one n' may be 3. At least one n' may be 4. At least one n' may be 5.

Each n "may independently be an integer from 1 to 3. Each n "may independently be 2 or 3. Each n "may be 2. Each n "may be 3. At least one n "may be 0. At least one n "may be 1. At least one n "may be 2. At least one n "may be 3.

Each n "may independently be 1 or 2, and each n' may independently be 2 or 3. Each n "may be 1 and each n' may independently be 2 or 3. Each n "may be 1 and each n' may be 2. Each n "is 1 and each n' is 3.

CCPP of formula (II) may have the structure of formula (II-1):

Wherein R ^1a、R^1b、R^1c、R^2a、R^2b、R^2c、R^2d、AA_SC, n 'and n' are as defined herein.

CCPP of formula (II) may have the structure of formula (IIa):

Wherein R ^1a、R^1b、R^1c、R^2a、R^2b、R^2c、R^2d、AA_SC and n' are as defined herein.

CCPP of formula (II) may have the structure of formula (IIb):

Wherein R ^2a、R^2b、AA_SC and n' are as defined herein.

CCPP can have the structure of formula (IIb):

Or a protonated form thereof, wherein:

AA _SC and n' are as defined herein.

CCPP of formula (IIa) has one of the following structures:

Wherein AA _SC and n are as defined herein.

CCPP of formula (IIa) has one of the following structures:

Wherein AA _SC and n are as defined herein

CCPP of formula (IIa) has one of the following structures:

Wherein AA _SC and n are as defined herein.

CCPP of formula (II) may have the following structure:

cCPP of formula (II) may have the following structure:

cCPP can have the structure of formula (III):

Wherein:

AA _SC is an amino acid side chain;

R ^2a and R ^2c are each independently H, Or a protonated form thereof;

R ^2b and R ^2d are each independently guanidine or a protonated form thereof;

each n "is independently an integer from 1 to 3;

Each n' is independently an integer from 1 to 5; and

Each p' is independently an integer from 0 to 5.

The AC described herein may be coupled to AA _SC. The linker may couple AC to AA _SC. The linker may be covalently bound to the backbone of AC, the 5 'end of AC or the 3' end of AC.

CCPP of formula (III) may have the structure of formula (III-1):

Wherein:

AA _SC、R^1a、R^1a、R^1c、R^2a、R^2c、R^2b、R^2d, n ', n ", and p' are as defined herein.

CCPP of formula (III) may have the structure of formula (IIIa):

Wherein:

AA _SC、R^2a、R^2c、R^2b、R^2d, n ', n ", and p' are as defined herein.

In formulas (III), (III-1) and (IIIa), R ^a and R ^c may be H. R ^a and R ^c may be H and R ^b and R ^d may each independently be guanidine or a protonated form thereof. R ^a may be H. R ^b may be H. p' may be 0.R ^a and R ^c may be H and each p' may be 0.

In formulas (III), (III-1) and (IIIa), R ^a and R ^c may be H, R ^b and R ^d may each independently be guanidine or protonated form thereof, n "may be 2 or 3, and each p' may be 0.

P' may be 0.p' may be 1.p' may be 2.p' may be 3.p' may be 4.p' may be 5.

CCPP can have the following structure:

cCPP of formula (a) may be selected from:

CPP sequence
	(FfФRrRrQ)
(FfФCit-r-Cit-rQ)
	(FfФGrGrQ)
(FfFGRGRQ)
	(FGFGRGRQ)
(GfFGrGrQ)
	(FGFGRRRQ)
(FGFRRRRQ)

CCPP of formula (a) may be selected from:

CPP sequence
	FФRRRRQ
fФRrRrQ
	FfФRrRrQ
FfФCit-r-Cit-rQ
	FfФGrGrQ
FfФRGRGQ
	FfFGRGRQ
FGFGRGRQ
	GfFGrGrQ
FGFGRRRQ
	FGFRRRRQ

In embodiments cCPP is selected from:

CPP sequence	CPP sequence	CPP sequence
			FФRRRQ	RRFRФRQ	FФRRRRQK
FФRRRC	FRRRRФQ	FФRRRRQC
			FФRRRU	rRFRФRQ	fФRrRrRQ
RRRФFQ	RRФFRRQ	FФRRRRRQ
			RRRRФF	CRRRRFWQ	RRRRФFDΩC
FФRRRR	FfФRrRrQ	FФRRR
			FφrRrRq	FFФRRRRQ	FWRRR
FφrRrRQ	RFRFRФRQ	RRRФF
			FФRRRRQ	URRRRFWQ	RRRWF
fФRrRrQ	CRRRRFWQ

Φ=l-naphthylalanine; =d-naphthylalanine; Ω=l-norleucine

In embodiments, cCPP is not selected from:

Φ=l-naphthylalanine; =d-naphthylalanine; Ω=l-norleucine

CCPP may comprise the structure of formula (D):

Or a protonated form thereof,

Wherein:

r ₄ and R ₆ are independently H or an amino acid side chain;

AA _SC is an amino acid side chain;

Y is

Q is 1,2,3 or 4;

each m is independently an integer of 0, 1, 2 or 3, and

Each n is independently an integer of 0,1, 2 or 3.

CCPP of formula (D) may have the structure of formula (D-I):

Or a protonated form thereof,

Wherein:

r ₄ and R ₆ are independently H or an amino acid side chain;

AA _SC is an amino acid side chain;

q is 1,2,3 or 4;

each m is independently an integer of 0, 1, 2 or 3, and

Y is

CCPP of formula (D) may have the structure of formula (D-II):

Or a protonated form thereof,

Wherein:

r ₄ and R ₆ are independently H or an amino acid side chain;

AA _SC is an amino acid side chain;

q is 1,2,3 or 4;

each m is independently an integer of 0,1, 2 or 3,

Each n is independently an integer of 0, 1, 2 or 3, and

Y is

CCPP of formula (D) may have the structure of formula (D-III):

Or a protonated form thereof,

Wherein:

r ₄ and R ₆ are independently H or an amino acid side chain;

AA _SC is an amino acid side chain;

q is 1,2,3 or 4;

each m is independently an integer of 0,1, 2 or 3,

Each n is independently an integer of 0, 1, 2 or 3, and

Y is

CCPP of formula (D) may have the structure of formula (D-IV):

Or a protonated form thereof,

Wherein:

r ₄ and R ₆ are independently H or an amino acid side chain;

AA _SC is an amino acid side chain;

q is 1,2,3 or 4;

each m is independently an integer of 0, 1, 2 or 3, and

Y is

CCPP of formula (D) may have the structure of formula (D-V):

Or a protonated form thereof,

Wherein:

r ₄ and R ₆ are independently H or an amino acid side chain;

AA _SC is an amino acid side chain;

q is 1,2,3 or 4;

each m is independently an integer of 0, 1, 2 or 3, and

Y is

AA _SC may be conjugated to a linker.

Joint

CCPP of the present disclosure may be conjugated to a linker. The connector may connect the AC to cCPP. The linker may be attached to the side chain of the amino acid of cCPP and the AC may be attached at a suitable position on the linker.

The linker may be any suitable moiety that can conjugate cCPP to one or more additional moieties, such as an Exocyclic Peptide (EP) and/or AC. Prior to conjugation with cCPP and one or more additional moieties, the linker has two or more functional groups, each of which is capable of independently forming a covalent bond with cCPP and one or more additional moieties. The linker may be covalently bound to the 5 'end of the AC or the 3' end of the AC. The linker may be covalently bound to the 5' end of the AC. The linker may be covalently bound to the 3' end of the AC. The linker may be any suitable moiety that conjugates cCPP described herein with AC.

The linker may comprise a hydrocarbon linker.

The linker may comprise a cleavage site. The cleavage site may be a disulfide or caspase cleavage site (e.g., val-Cit-PABC).

The joint may comprise: (i) One or more D or L amino acids, each of which is optionally substituted; (ii) optionally substituted alkylene; (iii) optionally substituted alkenylene; (iv) optionally substituted alkynylene; (v) optionally substituted carbocyclyl; (vi) optionally substituted heterocyclyl; (vii) One or more- (R ^1-J-R²) z "-subunits, wherein R ¹ and R ² are each independently selected from alkylene, alkenylene, alkynylene, carbocyclyl, and heterocyclyl, each J is independently C, NR ³、-NR³ C (O) -, S, and O, wherein R ³ is independently selected from H, alkyl, alkenyl, alkynyl, carbocyclyl, and heterocyclyl, each of which is optionally substituted, and z" is an integer from 1 to 50; (viii) - (R ^1- J) z "-or- (J-R ¹) z" -, wherein R ¹ is each independently alkylene, alkenylene, alkynylene, carbocyclyl or heterocyclyl, each J is independently C, NR ³、-NR³ C (O) -, S or O, wherein R ³ is H, alkyl, alkenyl, alkynyl, carbocyclyl or heterocyclyl, each of which is optionally substituted, and z "is an integer from 1 to 50; or (ix) the linker may comprise one or more of (i) to (x).

The linker may comprise one or more D or L amino acids and/or- (R ^1-J-R²) z "-, wherein R ¹ and R ² are each independently alkylene, each J is independently C, NR ³、-NR³ C (O) -, S and O, wherein R ⁴ is independently selected from H and alkyl, and z" is an integer from 1 to 50; or a combination thereof.

The linker may comprise- (OCH ₂CH₂)_z ' - (e.g., as a spacer), where z ' is an integer from 1 to 23, e.g., 2, 3,4, 5, 6, 7, 8,9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, or 23, "- (OCH ₂CH₂) z '" may also be referred to as polyethylene glycol (PEG).

The linker may comprise one or more amino acids. The linker may comprise a peptide. The linker may comprise- (OCH ₂CH₂)_z '-and a peptide, wherein z' is an integer from 1 to 23, the peptide may comprise from 2 to 10 amino acids, the linker may further comprise a Functional Group (FG) capable of click chemistry reaction, FG may be an azide or alkyne, and triazole is formed when AC is conjugated to the linker.

The linker may comprise (i) a beta alanine residue and a lysine residue; (ii) - (J-R ¹) z "; or (iii) combinations thereof. Each R ¹ can independently be alkylene, alkenylene, alkynylene, carbocyclyl, or heterocyclyl, each J is independently C, NR ³、-NR³ C (O) -, S, or O, wherein R ³ is H, alkyl, alkenyl, alkynyl, carbocyclyl, or heterocyclyl, each of which is optionally substituted, and z "can be an integer from 1 to 50. Each R ¹ can be alkylene and each J can be O.

The linker may comprise residues of (i) beta-alanine, glycine, lysine, 4-aminobutyric acid, 5-aminopentanoic acid, 6-aminocaproic acid, or a combination thereof; and (ii) - (R ^1- J) z '-or- (J-R ¹) z'. Each R ¹ can independently be alkylene, alkenylene, alkynylene, carbocyclyl, or heterocyclyl, each J is independently C, NR ³、-NR³ C (O) -, S, or O, wherein R ³ is H, alkyl, alkenyl, alkynyl, carbocyclyl, or heterocyclyl, each of which is optionally substituted, and z "can be an integer from 1 to 50. Each R ¹ can be alkylene and each J can be O. The linker may comprise glycine, beta-alanine, 4-aminobutyric acid, 5-aminopentanoic acid, 6-aminocaproic acid, or a combination thereof.

The linker may be a trivalent linker. The joint may have the following structure: Wherein A ₁、B₁ and C ₁ can independently be a hydrocarbon linker (e.g., NRH- (CH ₂)_n -COOH), a PEG linker (e.g., NRH- (CH ₂O)_n -COOH, wherein R is H, methyl, or ethyl), or one or more amino acid residues, and Z independently is a protecting group.

The hydrocarbon may be a glycine or beta-alanine residue.

The linker may be divalent and connects cCPP to AC. The linker may be bivalent and connects cCPP to the Exocyclic Peptide (EP).

The linker may be trivalent and connects cCPP to AC and EP.

The linker may be a divalent or trivalent C ₁-C₅₀ alkylene group, wherein 1-25 methylene groups are optionally and independently replaced by-N (H) -, -N (C ₁-C₄ alkyl) -, -N (cycloalkyl) -, -O-, -C (O) O-, -S (O) ₂-、-S(O)₂N(C₁-C₄ alkyl) -, -S (O) ₂ N (cycloalkyl) -, -N (H) C (O) -, -N (C ₁-C₄ alkyl) C (O) -, -N (cycloalkyl) C (O) -, -C (O) N (H) -, -C (O) N (C ₁-C₄ alkyl), -C (O) N (cycloalkyl), aryl, heterocyclyl, heteroaryl, cycloalkyl or cycloalkenyl. The linker may be a divalent or trivalent C ₁-C₅₀ alkylene group in which 1-25 methylene groups are optionally and independently replaced by-N (H) -, -O-, -C (O) N (H) -or a combination thereof.

AC may be coupled to the glutamic acid of the cyclic peptide, which converts the glutamic acid to glutamine. The linker (L) may couple AC to glutamine/glutamate of the cyclic peptide. In embodiments, the linker (L) is covalently bound to the backbone of the AC.

The joint may have the following structure:

Wherein: each AA is independently an amino acid residue; * Is an attachment point to AA _SC, and AA _SC is a side chain of the amino acid residue of cCPP; x is an integer from 1 to 10; y is an integer from 1 to 5; and z is an integer from 1 to 10. x may be an integer from 1 to 5. x may be an integer from 1 to 3. x may be 1.y may be an integer from 2 to 4. y may be 4.z may be an integer from 1 to 5. z may be an integer from 1 to 3. z may be 1. Each AA may be independently selected from glycine, beta-alanine, 4-aminobutyric acid, 5-aminopentanoic acid, and 6-aminocaproic acid.

CCPP may be attached to the AC by a joint ("L"). The linker may be conjugated to AC through a binding group ("M").

The joint may have the following structure:

Wherein; x is an integer from 1 to 10; y is an integer from 1 to 5; z is an integer from 1 to 10; each AA is independently an amino acid residue; * Is an attachment point to AA _SC, and AA _SC is a side chain of the amino acid residue of cCPP; and M is a binding group as defined herein.

The joint may have the following structure:

wherein; x' is an integer from 1 to 23; y is an integer from 1 to 5; z' is an integer from 1 to 23; * Is an attachment point to AA _SC, and AA _SC is a side chain of the amino acid residue of cCPP; and M is a binding group as defined herein.

The joint may have the following structure:

wherein: x' is an integer from 1 to 23; y is an integer from 1 to 5; and z' is an integer from 1 to 23; * Is an attachment point to AA _SC, and AA _SC is a side chain of the amino acid residue of cCPP.

X may be an integer from 1 to 10, such as 1,2,3,4, 5, 6, 7, 8, 9, or 10, including all ranges and subranges therebetween.

X' may be an integer from 1 to 23, such as 1,2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, or 23, including all ranges and subranges therebetween. x' may be an integer from 5 to 15. x' may be an integer from 9 to 13. x' may be an integer from 1 to 5. x' may be 1.

Y may be an integer from 1 to 5, such as 1,2, 3, 4 or 5, including all ranges and subranges therebetween. y may be an integer from 2 to 5.y may be an integer from 3 to 5.y may be 3 or 4.y may be 4 or 5.y may be 3.y may be 4.y may be 5.

Z may be an integer from 1 to 10, such as 1,2,3,4, 5, 6, 7, 8, 9, or 10, including all ranges and subranges therebetween.

Z' may be an integer from 1 to 23, such as 1,2,3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, or 23, including all ranges and subranges therebetween. z' may be an integer from 5 to 15. z' may be an integer from 9 to 13. z' may be 11.

As discussed above, the linker or M (where M is part of the linker) may be covalently bound to AC (any suitable position on AC). The linker or M (where M is part of the linker) may be covalently bound to the 3 'end of the AC or the 5' end of the AC. The linker or M (where M is part of the linker) may be covalently bound to the backbone of the AC.

The linker may be bound to a side chain of aspartic acid, glutamic acid, glutamine, asparagine, or lysine on cCPP, or a modified side chain of glutamine or asparagine (e.g., a reduced side chain having an amino group). The linker may be attached to the side chain of the lysine on cCPP.

The joint may have the following structure:

Wherein the method comprises the steps of

M is a group that conjugates L with AC;

AA _s is the side chain or terminal of the amino acid on cCPP;

Each AA _x is independently an amino acid residue;

o is an integer from 0 to 10; and

P is an integer from 0 to 5.

The joint may have the following structure:

Wherein the method comprises the steps of

M is a group that conjugates L with AC;

AA _s is the side chain or terminal of the amino acid on cCPP;

Each AA _x is independently an amino acid residue;

o is an integer from 0 to 10; and

P is an integer from 0 to 5.

M may include alkylene, alkenylene, alkynylene, carbocyclyl, or heterocyclyl, each of which is optionally substituted. M may be selected from:

wherein R is alkyl, alkenyl, alkynyl, carbocyclyl or heterocyclyl.

M may be selected from:

Wherein: r ¹⁰ is alkylene, cycloalkyl or Wherein a is 0 to 10.

M may beR ¹⁰ can beAnd a is 0 to 10.M may be

M may be a heterobifunctional crosslinker, e.gIt is disclosed in Williams et al Curr. Protoc Nucleic Acid chem.2010, 42,4.41.1-4.41.20, which is incorporated herein by reference in its entirety.

M may be-C (O) -.

AA _s may be a side chain or a terminal end of an amino acid on cCPP. Non-limiting examples of AA _s include aspartic acid, glutamic acid, glutamine, asparagine, or lysine, or modified side chains of glutamine or asparagine (e.g., reduced side chains having an amino group). AA _s may be AA _SC as defined herein.

Each AA _X is independently a natural or unnatural amino acid. One or more AA _X may be a natural amino acid. One or more AA _X may be an unnatural amino acid. One or more AA _X may be a β -amino acid. The beta-amino acid may be beta-alanine.

O may be an integer from 0 to 10, such as 0,1, 2,3, 4,5,6, 7, 8, 9, and 10.o may be 0,1, 2 or 3.o may be 0.o may be 1.o may be 2.o may be 3.

P may be 0 to 5, for example 0,1, 2,3, 4 or 5.p may be 0.p may be 1.p may be 2.p may be 3.p may be 4.p may be 5.

The joint may have the following structure:

Wherein M, AA _s, each- (R ^1-J-R²) z "-, o, and z" are as defined herein; r may be 0 or 1.

R may be 0.r may be 1.

The joint may have the following structure:

Wherein M, AA _s, o, p, q, r, and z "each may be as defined herein.

Z "may be an integer from 1 to 50, such as 1、2、3、4、5、6、7、8、9、10、11、12、13、14、15、16、17、18、19、20、21、22、23、24、25、26、27、28、29、30、31、32、33、34、35、36、37、38、39、40、41、42、43、44、45、46、47、48、49 and 50, including all ranges and values therebetween. z "may be an integer from 5 to 20. z "may be an integer from 10 to 15.

The joint may have the following structure:

Wherein:

m, AA _s and o are as defined herein.

Other non-limiting examples of suitable linkers include:

Wherein M and AA _s are as defined herein.

Provided herein are compounds comprising cCPP and AC complementary to a target in a pre-mRNA sequence, the compounds further comprising L, wherein the linker is conjugated to AC through a binding group (M), wherein M is

Provided herein are compounds comprising cCPP and an Antisense Compound (AC), such as an antisense oligonucleotide, which is complementary to a target in a pre-mRNA sequence, wherein the compound further comprises L, wherein the linker is conjugated to AC through a binding group (M), wherein M is selected from the group consisting of: Wherein: r ¹ is alkylene, cycloalkyl or Wherein t' is 0 to 10, wherein each R is independently alkyl, alkenyl, alkynyl, carbocyclyl, or heterocyclyl, wherein R ¹ isAnd t' is 2.

The joint may have the following structure:

wherein AA _s is as defined herein, and m' is 0-10.

The linker may have the formula:

The linker may have the formula: Wherein "base" corresponds to the nucleobase at the 3' end of the phosphorodiamidate morpholino oligomer.

The linker may have the formula:

Wherein "base" corresponds to the nucleobase at the 3' end of the phosphorodiamidate morpholino oligomer.

The linker may have the formula:

The linker may be covalently bound at any suitable position on the AC. The linker may be covalently bound to the 3 'end of the AC or the 5' end of the AC. The linker may be covalently bound to the backbone of the AC.

CCPP-linker conjugates

CCPP may be conjugated to a linker as defined herein. The linker may be conjugated to AA _SC of cCPP as defined herein.

The linker may comprise a- (OCH ₂CH₂)_z ' -subunit (e.g., as a spacer), wherein z ' is an integer from 1 to 23, e.g., 1, 2,3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, or 23 "- (OCH ₂CH₂)_z '" also known as peg.cpp-linker conjugate may have a structure selected from table 4:

Table 4: cCPP-linker conjugates

Ring (FfΦ -4gp-r-4 gp-rQ) -PEG ₄-K-NH₂
	Ring (FfΦ -Cit-r-Cit-rQ) -PEG ₄-K-NH₂
Ring (FfPhi-Pia-r-Pia-rQ) -PEG ₄-K-NH₂
	Ring (FfΦ -Dml-r-Dml-rQ) -PEG ₄-K-NH₂
Ring (Ffphi-Cit-r-Cit-rQ) -PEG ₁₂ -OH
	Ring (fPhi R-Cit-R-Cit-Q) -PEG ₁₂ -OH

The linker may comprise- (OCH ₂CH₂)_z '-subunit and a peptide subunit, wherein z' is an integer from 1 to 23, the peptide subunit may comprise from 2 to 10 amino acids cCPP-linker conjugate may have a structure selected from table 5:

table 5: endosomal escape vehicle (cCPP-linker conjugate)

Ac-PKKKKRKV-Lys (cyclo [ FfΦ -R-R-Cit-rQ ]) -PEG ₁₂-K(N₃)-NH₂
	Ac-PKKKKRKV-Lys (cyclo [ FfΦ -Cit-R-R-rQ ]) -PEG ₁₂-K(N₃)-NH₂
Ac-PKKKKRKV-K (Ring (FfΦR-cit-R-cit-Q)) -PEG ₁₂-K(N₃)-NH₂
	Ac-PKKKKRKV-PEG 2-Lys (cyclo [ FfΦ -Cit-r-Cit-rQ ]) -B-k (N) ₃)-NH₂
Ac-PKKKKRKV-PEG 2-Lys (cyclo [ FfΦ -Cit-r-Cit-rQ ]) -PEG2-k (N) ₃)-NH₂
	Ac-PKKKKRKV-PEG 2-Lys (cyclo [ FfΦ -Cit-r-Cit-rQ ]) -PEG4-k (N) ₃)-NH₂
Ac-PKKKKRKV-Lys (cyclo [ FfΦ -Cit-r-Cit-rQ ]) -PEG12-k (N ₃)-NH₂
	Ac-pkkkrkv-PEG2-Lys (cyclo [ FfΦ -Cit-r-Cit-rQ ]) -PEG12-k (N ₃)-NH₂
Ac-rrv-PEG2-Lys (cyclo [ FfΦ -Cit-r-Cit-rQ ]) -PEG12-OH
	Ac-PKKKKRKV-PEG 2-Lys (cyclo [ FfΦ -Cit-r-Cit-r-Q ]) -PEG12-k (N) ₃)-NH₂
Ac-PKKK-Cit-KV-PEG2-Lys (cyclo [ FfΦ -Cit-r-Cit-r-Q ]) -PEG12-k (N ₃)-NH₂
	Ac-PKKKKRKV-PEG 2-Lys (cyclo [ FfΦ -Cit-r-Cit-r-Q ] -PEG12-K (N) ₃)-NH₂

CCPP-linker conjugates can be Ac-PKKKRKVK (cyclo [ Ff. Phi. GrGrQ ]) PEG12-K (N ₃)-NH₂).

EEVs comprising a cyclic cell penetrating peptide (cCPP), a linker, and an Exocyclic Peptide (EP) are provided. EEV may comprise a structure of formula (B):

Or a protonated form thereof,

Wherein:

r ₄ and R ₆ are independently H or an amino acid side chain;

EP is a cyclic exopeptide as defined herein;

Each m is independently an integer from 0 to 3;

n is an integer from 0 to 2;

x' is an integer from 1 to 20;

y is an integer from 1 to 5;

q is 1-4; and

Z' is an integer from 1 to 23.

R ₁、R₂、R₃、R₄、R₆, EP, m, q, y, x ', z' are as described herein.

N may be 0.n may be 1.n may be 2.

EEVs may comprise structures of formula (B-a) or (B-B):

Or a protonated form thereof, wherein EP, R ¹、R²、R³、R⁴, m and z' are as defined above in formula (B).

EEVs may comprise structures of formula (B-c):

Or a protonated form thereof, wherein EP, R ¹、R²、R³、R⁴ and m are as defined above in formula (B); AA is an amino acid as defined herein; m is as defined herein; n is an integer from 0 to 2; x is an integer from 1 to 10; y is an integer from 1 to 5; and z is an integer from 1 to 10.

EEVs may have the structure of formula (B-1), (B-2), (B-3) or (B-4):

or a protonated form thereof, wherein EP is as defined above in formula (B).

The EEV may comprise formula (B) and may have the following structure: ac-PKKKRKV-AEEA-K (cyclo [ FGFGRGRQ ]) -PEG ₁₂ -OH or Ac-PKKRKV-AEEA-K (cyclo [ GfFGrGrQ ]) -PEG ₁₂ -OH.

The EEV may comprise cCPP of the formula:

The EEV may comprise the formula: ac-PKKKRKV-niniPEG-Lys (loop (FfFGRGRQ) -miniPEG2-K (N3).

The EEV may be:

The EEV may be: ac-PKKKKRKV-K (cyclo (Ff-Nal-GrGrQ) -PEG ₁₂-K(N₃)-NH_s.

EEVs may be

EEVs may be Ac-P-K (Tfa) -K (Tfa) -K (Tfa) -R-K (Tfa) -V-AEEA-K (cyclo (Ff-Nal-GrGrQ) -PEG ₁₂ -OH or Ac-P-K (Tfa) -K (Tfa) -K (Tfa) -R-K (Tfa) -V-AEEA-K (cyclo (FGFGRGRQ) -PEG ₁₂ -OH).

EEVs may be

EEV may be Ac-PKKKRKV-miniPEG-K (ring (Ff-Nal-GrGrQ) -PEG12-OH.

EEVs may be

The EEV may be:

EEVs may be

EEVs may be selected from

Ac-rr-miniPEG-Dap [ cyclo (FfΦ -Cit-r-Cit-rQ) ] -PEG12-OH

Ac-frr-PEG2-Dap (Ring (FfΦ -Cit-r-Cit-rQ)) -PEG12-OH

Ac-rfr-PEG2-Dap (Ring (FfΦ -Cit-r-Cit-rQ)) -PEG12-OH

Ac-rbfbr-PEG2-Dap (Ring (FfΦ -Cit-r-Cit-rQ)) -PEG12-OH

Ac-rrr-PEG2-Dap (Ring (FfΦ -Cit-r-Cit-rQ)) -PEG12-OH

Ac-rbr-PEG2-Dap (Ring (FfΦ -Cit-r-Cit-rQ)) -PEG12-OH

Ac-rbrbr-PEG2-Dap (Ring (FfΦ -Cit-r-Cit-rQ)) -PEG12-OH

Ac-hh-PEG2-Dap (Ring (FfΦ -Cit-r-Cit-rQ)) -PEG12-OH

Ac-hbh-PEG2-Dap (Ring (FfΦ -Cit-r-Cit-rQ)) -PEG12-OH

Ac-hbhbh-PEG2-Dap (Ring (FfΦ -Cit-r-Cit-rQ)) -PEG12-OH

Ac-rbhbh-PEG2-Dap (Ring (FfΦ -Cit-r-Cit-rQ)) -PEG12-OH

Ac-hbrbh-PEG2-Dap (Ring (FfΦ -Cit-r-Cit-rQ)) -PEG12-OH

Ac-rr-Dap (Ring (FfΦ -Cit-r-Cit-rQ)) -b-OH

Ac-frr-Dap (Ring (FfΦ -Cit-r-Cit-rQ)) -b-OH

Ac-rfr-Dap (Ring (FfΦ -Cit-r-Cit-rQ)) -b-OH

Ac-rbfbr-Dap (Ring (FfΦ -Cit-r-Cit-rQ)) -b-OH

Ac-rrr-Dap (Ring (FfΦ -Cit-r-Cit-rQ)) -b-OH

Ac-rbr-Dap (Ring (FfΦ -Cit-r-Cit-rQ)) -b-OH

Ac-rbrbr-Dap (Ring (FfΦ -Cit-r-Cit-rQ)) -b-OH

Ac-hh-Dap (Ring (FfΦ -Cit-r-Cit-rQ)) -b-OH

Ac-hbh-Dap (Ring (FfΦ -Cit-r-Cit-rQ)) -b-OH

Ac-hbhbh-Dap (Ring (FfΦ -Cit-r-Cit-rQ)) -b-OH

Ac-rbhbh-Dap (Ring (FfΦ -Cit-r-Cit-rQ)) -b-OH

Ac-hbrbh-Dap (Ring (FfΦ -Cit-r-Cit-rQ)) -b-OH

Ac-KKKK-miniPEG-Lys (cyclo (Ff-Nal-GrGrQ)) -miniPEG-K (N3) -NH2

Ac-KGKK-miniPEG-Lys (cyclo (Ff-Nal-GrGrQ)) -miniPEG-K (N3) -NH2

Ac-KKGK-miniPEG-Lys (cyclo (Ff-Nal-GrGrQ)) -miniPEG-K (N3) -NH2

Ac-KKK-miniPEG-Lys (Ring (Ff-Nal-GrGrQ)) -miniPEG-K (N3) -NH2

Ac-KK-miniPEG-Lys (Ring (Ff-Nal-GrGrQ)) -miniPEG2-K (N3) -NH2

Ac-KGK-miniPEG-Lys (Ring (Ff-Nal-GrGrQ)) -miniPEG-K (N3) -NH2

Ac-KBK-miniPEG-Lys (Ring (Ff-Nal-GrGrQ)) -miniPEG-K (N3) -NH2

Ac-KBKBK-miniPEG-Lys (cyclo (Ff-Nal-GrGrQ)) -miniPEG-K (N3) -NH2

Ac-KR-miniPEG-Lys (Ring (Ff-Nal-GrGrQ)) -miniPEG-K (N3) -NH2

Ac-KBR-miniPEG-Lys (Ring (Ff-Nal-GrGrQ)) -miniPEG-K (N3) -NH2

Ac-PKKKRKV-miniPEG-Lys (Ring (Ff-Nal-GrGrQ)) -miniPEG-K (N3) -NH2

Ac-PGKKRKV-miniPEG-Lys (cyclo (Ff-Nal-GrGrQ)) -miniPEG-K (N3) -NH2

Ac-PKGKRKV-miniPEG-Lys (cyclo (Ff-Nal-GrGrQ)) -miniPEG-K (N3) -NH2

Ac-PKKGRKV-miniPEG-Lys (cyclo (Ff-Nal-GrGrQ)) -miniPEG-K (N3) -NH2

Ac-PKKKGKV-miniPEG-Lys (cyclo (Ff-Nal-GrGrQ)) -miniPEG-K (N3) -NH2

Ac-PKKKRGV-miniPEG-Lys (cyclo (Ff-Nal-GrGrQ)) -miniPEG-K (N3) -NH2

Ac-PKKKRKG-miniPEG-Lys (cyclo (Ff-Nal-GrGrQ)) -miniPEG-K (N3) -NH2

Ac-KKKRK-miniPEG-Lys (cyclo (Ff-Nal-GrGrQ)) -miniPEG-K (N3) -NH2

Ac-KKRK-miniPEG-Lys (cyclo (Ff-Nal-GrGrQ)) -miniPEG-K (N3) -NH2 and

Ac-KRK-miniPEG-Lys (Ring (Ff-Nal-GrGrQ)) -miniPEG-K (N3) -NH2.

EEV may be selected from:

Ac-PKKKKRKV-Lys (cyclo [ FfΦ GrGrQ ]) -PEG ₁₂-K(N₃)-NH₂

Ac-PKKKKRKV-miniPEG ₂ -Lys (cyclo [ FfΦ GrGrQ ]) -miniPEG ₂-K(N₃)-NH₂

Ac-PKKKKRKV-miniPEG ₂ -Lys (cyclo [ FGFGRGRQ ]) -miniPEG ₂-K(N₃)-NH₂

Ac-KR-PEG ₂ -K (cyclo [ FGFGRGRQ ]) -PEG ₂-K(N₃)-NH₂

Ac-PKKKGKV-PEG ₂ -K (cyclo [ FGFGRGRQ ]) -PEG ₂-K(N₃)-NH₂

Ac-PKKKRKG-PEG ₂ -K (cyclo [ FGFGRGRQ ]) -PEG ₂-K(N₃)-NH₂

Ac-KKKRK-PEG ₂ -K (cyclo [ FGFGRGRQ ]) -PEG ₂-K(N₃)-NH₂

Ac-PKKKKRKV-miniPEG ₂ -Lys (cyclo [ FF. Phi. GRGRQ ]) -miniPEG ₂-K(N₃)-NH₂

Ac-PKKKKRKV-miniPEG ₂ -Lys (cyclo [ beta hFf. Phi. GrGrQ ]) -miniPEG ₂-K(N₃)-NH₂ and

Ac-PKKKRKV-miniPEG ₂ -Lys (cyclo [ FfΦ SrSrQ ]) -miniPEG ₂-K(N₃)-NH₂.

EEV may be selected from:

Ac-PKKKKRKV-miniPEG ₂ -Lys (loop (GfFGrGrQ)) PEG ₁₂ -OH

Ac-PKKKKRKV-miniPEG ₂ -Lys (cyclo [ FGFKRKRQ ]) -PEG ₁₂ -OH

Ac-PKKKKRKV-miniPEG ₂ -Lys (cyclo [ FGFRGRGQ ]) -PEG ₁₂ -OH

Ac-PKKKKRKV-miniPEG ₂ -Lys (cyclo [ FGFGRGRGRQ ]) -PEG ₁₂ -OH

Ac-PKKKKRKV-miniPEG ₂ -Lys (cyclo [ FGFGRrRQ ]) -PEG ₁₂ -OH

Ac-PKKKKRKV-miniPEG ₂ -Lys (cyclo [ FGFGRRRQ ]) -PEG ₁₂ -OH and

Ac-PKKKRKV-miniPEG ₂ -Lys (cyclo [ FGFRRRRQ ]) -PEG ₁₂ -OH.

EEV may be selected from:

Ac-K-K-K-R-K-G-miniPEG ₂ -K (cyclo [ FGFGRGRQ ]) -PEG ₁₂ -OH

Ac-K-K-K-R-K-miniPEG ₂ -K (cyclo [ FGFGRGRQ ]) -PEG ₁₂ -OH

Ac-K-K-R-K-K-PEG ₄ -K (cyclo [ FGFGRGRQ ]) -PEG ₁₂ -OH

Ac-K-R-K-K-K-PEG ₄ -K (cyclo [ FGFGRGRQ ]) -PEG ₁₂ -OH

Ac-K-K-K-K-R-PEG ₄ -K (cyclo [ FGFGRGRQ ]) -PEG ₁₂ -OH

Ac-R-K-K-K-K-PEG ₄ -K (cyclo [ FGFGRGRQ ]) -PEG ₁₂ -OH and

Ac-K-K-K-R-K-PEG ₄ -K (cyclo [ FGFGRGRQ ]) -PEG ₁₂ -OH.

EEV may be selected from:

Ac-PKKKRKV-PEG ₂ -K (cyclo [ FGFGRGRQ ]) -PEG ₂-K(N₃)-NH₂

Ac-PKKKRKV-PEG ₂ -K (cyclo [ FGFGRGRQ ]) -PEG ₁₂ -OH

Ac-PKKKKRKV-PEG ₂ -K (cyclo [ GfFGrGrQ ]) -PEG ₂-K(N₃)-NH₂ and

Ac-PKKKRKV-PEG ₂ -K (cyclo [ GfFGrGrQ ]) -PEG ₁₂ -OH.

The cargo may be AC and the EEV may be selected from:

Ac-PKKKRKV-PEG ₂ -K (cyclo [ FfPhi GrGrQ ]) -PEG ₁₂ -OH

Ac-PKKKKRKV-PEG ₂ -K (cyclo [ FfΦCit-r-Cit-rQ ]) -PEG ₁₂ -OH

Ac-PKKKRKV-PEG ₂ -K (cyclo [ FfFGRGRQ ]) -PEG ₁₂ -OH

Ac-PKKKRKV-PEG ₂ -K (cyclo [ FGFGRGRQ ]) -PEG ₁₂ -OH

Ac-PKKKRKV-PEG ₂ -K (cyclo [ GfFGrGrQ ]) -PEG ₁₂ -OH

Ac-PKKKRKV-PEG ₂ -K (cyclo [ FGFGRRRQ ]) -PEG ₁₂ -OH

Ac-PKKKRKV-PEG ₂ -K (cyclo [ FGFRRRRQ ]) -PEG ₁₂ -OH

Ac-rr-PEG ₂ -K (cyclo [ FfPhi GrGrQ ]) -PEG ₁₂ -OH

Ac-rr-PEG ₂ -K (cyclo [ FfΦCit-r-Cit-rQ ]) -PEG ₁₂ -OH

Ac-rr-PEG ₂ -K (cyclo [ FfF-GRGRQ ]) -PEG ₁₂ -OH

Ac-rr-PEG ₂ -K (cyclo [ FGFGRGRQ ]) -PEG ₁₂ -OH

Ac-rr-PEG ₂ -K (cyclo [ GfFGrGrQ ]) -PEG ₁₂ -OH

Ac-rr-PEG ₂ -K (cyclo [ FGFGRRRQ ]) -PEG ₁₂ -OH

Ac-rr-PEG ₂ -K (cyclo [ FGFRRRRQ ]) -PEG ₁₂ -OH

Ac-rrr-PEG ₂ -K (cyclo [ FfPhi GrGrQ ]) -PEG ₁₂ -OH

Ac-rrr-PEG ₂ -K (cyclo [ FfΦCit-r-Cit-rQ ]) -PEG ₁₂ -OH

Ac-rrr-PEG ₂ -K (cyclo [ FfFGRGRQ ]) -PEG ₁₂ -OH

Ac-rrr-PEG ₂ -K (cyclo [ FGFGRGRQ ]) -PEG ₁₂ -OH

Ac-rrr-PEG ₂ -K (cyclo [ GfFGrGrQ ]) -PEG ₁₂ -OH

Ac-rrr-PEG ₂ -K (cyclo [ FGFGRRRQ ]) -PEG ₁₂ -OH

Ac-rrr-PEG ₂ -K (cyclo [ FGFRRRRQ ]) -PEG ₁₂ -OH

Ac-rhr-PEG ₂ -K (cyclo [ FfΦ GrGrQ ]) -PEG ₁₂ -OH

Ac-rhr-PEG ₂ -K (cyclo [ FfΦCit-r-Cit-rQ ]) -PEG ₁₂ -OH

Ac-rhr-PEG ₂ -K (cyclo [ FfFGRGRQ ]) -PEG ₁₂ -OH

Ac-rhr-PEG ₂ -K (cyclo [ FGFGRGRQ ]) -PEG ₁₂ -OH

Ac-rhr-PEG ₂ -K (cyclo [ GfFGrGrQ ]) -PEG ₁₂ -OH

Ac-rhr-PEG ₂ -K (cyclo [ FGFGRRRQ ]) -PEG ₁₂ -OH

Ac-rhr-PEG ₂ -K (cyclo [ FGFRRRRQ ]) -PEG ₁₂ -OH

Ac-rbr-PEG ₂ -K (cyclo [ FfΦ GrGrQ ]) -PEG ₁₂ -OH

Ac-rbr-PEG ₂ -K (cyclo [ FfΦCit-r-Cit-rQ ]) -PEG ₁₂ -OH

Ac-rbr-PEG ₂ -K (cyclo [ FfFGRGRQ ]) -PEG ₁₂ -OH

Ac-rbr-PEG ₂ -K (cyclo [ FGFGRGRQ ]) -PEG ₁₂ -OH

Ac-rbr-PEG ₂ -K (cyclo [ GfFGrGrQ ]) -PEG ₁₂ -OH

Ac-rbr-PEG ₂ -K (cyclo [ FGFGRRRQ ]) -PEG ₁₂ -OH

Ac-rbr-PEG ₂ -K (cyclo [ FGFRRRRQ ]) -PEG ₁₂ -OH

Ac-rbrbr-PEG ₂ -K (cyclo [ FfΦ GrGrQ ]) -PEG ₁₂ -OH

Ac-rbrbr-PEG ₂ -K (cyclo [ FfΦCit-r-Cit-rQ ]) -PEG ₁₂ -OH

Ac-rbrbr-PEG ₂ -K (cyclo [ FfFGRGRQ ]) -PEG ₁₂ -OH

Ac-rbrbr-PEG ₂ -K (cyclo [ FGFGRGRQ ]) -PEG ₁₂ -OH

Ac-rbrbr-PEG ₂ -K (cyclo [ GfFGrGrQ ]) -PEG ₁₂ -OH

Ac-rbrbr-PEG ₂ -K (cyclo [ FGFGRRRQ ]) -PEG ₁₂ -OH

Ac-rbrbr-PEG ₂ -K (cyclo [ FGFRRRRQ ]) -PEG ₁₂ -OH

Ac-rbhbr-PEG ₂ -K (cyclo [ FfΦ GrGrQ ]) -PEG ₁₂ -OH

Ac-rbhbr-PEG ₂ -K (cyclo [ FfΦCit-r-Cit-rQ ]) -PEG ₁₂ -OH

Ac-rbhbr-PEG ₂ -K (cyclo [ FfFGRGRQ ]) -PEG ₁₂ -OH

Ac-rbhbr-PEG ₂ -K (cyclo [ FGFGRGRQ ]) -PEG ₁₂ -OH

Ac-rbhbr-PEG ₂ -K (cyclo [ GfFGrGrQ ]) -PEG ₁₂ -OH

Ac-rbhbr-PEG ₂ -K (cyclo [ FGFGRRRQ ]) -PEG ₁₂ -OH

Ac-rbhbr-PEG ₂ -K (cyclo [ FGFRRRRQ ]) -PEG ₁₂ -OH

Ac-hbrbh-PEG ₂ -K (cyclo [ FfΦ GrGrQ ]) -PEG ₁₂ -OH

Ac-hbrbh-PEG ₂ -K (cyclo [ FfΦCit-r-Cit-rQ ]) -PEG ₁₂ -OH

Ac-hbrbh-PEG ₂ -K (cyclo [ FfFGRGRQ ]) -PEG ₁₂ -OH

Ac-hbrbh-PEG ₂ -K (cyclo [ FGFGRGRQ ]) -PEG ₁₂ -OH

Ac-hbrbh-PEG ₂ -K (cyclo [ GfFGrGrQ ]) -PEG ₁₂ -OH

Ac-hbrbh-PEG ₂ -K (cyclo [ FGFGRRRQ ]) -PEG ₁₂ -OH and

Ac-hbrbh-PEG ₂ -K (cyclo [ FGFRRRRQ ]) -PEG ₁₂ -OH,

Wherein b is beta-alanine and the exocyclic sequence may be D or L stereochemistry.

In embodiments, the compound comprising the cyclic peptide and AC has improved cytoplasmic uptake efficiency compared to the compound comprising AC alone. Cytoplasmic uptake efficiency can be measured by comparing the cytoplasmic delivery efficiency of a compound comprising a cyclic peptide and AC with the cytoplasmic delivery efficiency of AC alone.

Antisense compounds

In various embodiments, the compounds disclosed herein comprise a CPP (e.g., a cyclic peptide) conjugated to an Antisense Compound (AC). In embodiments, the AC comprises an antisense oligonucleotide directed against the target polynucleotide. The term "antisense oligonucleotide" or simply "antisense" is intended to include oligonucleotides complementary to a target polynucleotide sequence. Antisense oligonucleotides are single strands of DNA or RNA complementary to a selected sequence (e.g., target gene mRNA).

Antisense oligonucleotides can modulate one or more aspects of protein transcription, translation, and expression. In embodiments, the antisense oligonucleotide is directed against a target sequence within a target pre-mRNA to modulate one or more aspects of pre-mRNA splicing. As used herein, modulation of splicing refers to altering the processing of a pre-mRNA transcript such that a spliced mRNA molecule contains a combination of different exons due to exon skipping or exon inclusion, deletion of one or more exons, or deletion or addition of sequences (e.g., intron sequences) that are not normally present in a spliced mRNA. In embodiments, hybridization of the antisense oligonucleotide to a target sequence in a pre-mRNA molecule restores native splicing to the mutated pre-mRNA sequence. In embodiments, antisense oligonucleotide hybridization results in alternative splicing of the target pre-mRNA. In embodiments, antisense oligonucleotide hybridization results in exon inclusion or exon skipping of one or more exons. In embodiments, the skipped exon sequence comprises a frameshift mutation, a nonsense mutation, or a missense mutation. In embodiments, the skipped exon sequence comprises a nucleic acid deletion, substitution, or insertion. In embodiments, the skipped exons do not themselves contain sequence mutations, but adjacent exons contain mutations that result in frame shift mutations or nonsense mutations. In embodiments, hybridization of the antisense oligonucleotide to a target sequence within the target pre-mRNA prevents inclusion of an exon sequence in the mature mRNA molecule. In embodiments, hybridization of the antisense oligonucleotide to a target sequence within the target pre-mRNA results in preferential expression of the wild-type target protein isoform. In embodiments, hybridization of the antisense oligonucleotide to a target sequence within the target pre-mRNA results in expression of a re-spliced target protein comprising an active fragment of the wild-type target protein.

The antisense mechanism functions via hybridization of the antisense oligonucleotide compound to the target nucleic acid. In embodiments, hybridization of an antisense oligonucleotide to its target sequence inhibits expression of a target protein. In embodiments, hybridization of an antisense oligonucleotide to its target sequence inhibits expression of one or more wild-type target protein isoforms. In embodiments, hybridization of an antisense oligonucleotide to its target sequence upregulates expression of the target protein. In embodiments, hybridization of an antisense oligonucleotide to its target sequence increases expression of one or more wild-type target protein isoforms.

In embodiments, antisense compounds can inhibit gene expression by binding to complementary mRNA. Binding to the target mRNA can result in inhibition of gene expression by preventing translation of the complementary mRNA strand (by sterically blocking RNA-binding proteins involved in translation) or by causing degradation of the target mRNA. Antisense DNA can be used to target specific complementary (coding or non-coding) RNAs. If binding occurs, the DNA/RNA hybrid can be degraded by RNase H. In embodiments, the antisense oligonucleotide contains from about 10 to about 50 nucleotides or from about 15 to about 30 nucleotides. In embodiments, the antisense oligonucleotide may not be fully complementary to the target nucleotide sequence.

Antisense oligonucleotides have proven to be effective targeted inhibitors of protein synthesis and are therefore useful for specifically inhibiting protein synthesis by targeting genes. The efficacy of antisense oligonucleotides in inhibiting protein synthesis has been well established. For example, synthesis of polygalacturonase (polygalactauronase) and muscarinic type 2 acetylcholine receptors is inhibited by antisense oligonucleotides directed against their corresponding mRNA sequences (U.S. patent 5,739,119 and U.S. patent 5,759,829). In addition, examples of antisense inhibition have been demonstrated with nucleoprotein cyclin, multiple drug resistance genes (MDG 1), ICAM-1, E-selectin, STK-1, striatal GABAA receptor and human EGF (Jaskulski et al, science 6, 10, 1988; 240 (4858): 1544-6; vasanthakumar and Ahmed, cancer Commun.1989;1 (4): 225-32; peris et al, brain Res Mol Brain Res.1998, 15, 57 (2): 310-20; U.S. patent 5,801,154; U.S. Pat. No. 5,789,573; U.S. Pat. No. 5,718,709 and U.S. Pat. No. 5,610,288). In addition, antisense constructs have also been described to inhibit and are useful in the treatment of various abnormal cell proliferation, such as cancer (U.S. Pat. No. 5,747,470; U.S. Pat. No. 5,591,317 and U.S. Pat. No. 5,783,683).

Methods of generating antisense oligonucleotides are known in the art and can be readily adapted to generate antisense oligonucleotides targeted to any polynucleotide sequence. The selection of antisense oligonucleotide sequences specific for a given target sequence is based on analysis of the selected target sequence and determination of secondary structure, tm, binding energy and relative stability. Antisense oligonucleotides can be selected based on their relative inability to form dimers, hairpins, or other secondary structures that will reduce or inhibit specific binding to target mRNA in a host cell. Target regions of the mRNA may include those at or near the AUG translation initiation codon and those sequences that are substantially complementary to the 5' region of the mRNA. These secondary structural analysis and target site selection considerations may be performed, for example, using the 4 th edition of OLIGO primer analysis software (Molecular Biology Insights) and/or BLASTN 2.0.5 algorithm software (Altschul et al, nucleic Acids res.1997, 25 (17): 3389-402).

According to the present disclosure, antisense Compounds (ACs) alter one or more aspects of splicing, translation, or expression of a target gene, for example, by altering splicing of eukaryotic target pre-mRNA. An AC according to the present disclosure comprises a nucleic acid sequence that is complementary to a sequence found within a target pre-mRNA sequence (e.g., at a sequence that includes at least a portion of an exon, at least a portion of an intron, or both). The use of these ACs provides a straightforward genetic approach that can regulate splicing of specific pathogenic genes. The principle behind antisense technology is that antisense compounds hybridized to a target nucleic acid regulate gene expression events such as splicing or translation by one of a variety of antisense mechanisms. The sequence specificity of AC makes the technology attractive as a therapeutic approach to selectively modulate splicing of pre-mRNA involved in the pathogenesis of any of a variety of diseases. Antisense technology is an effective means for altering the expression of one or more specific gene products and may therefore prove useful in many therapeutic, diagnostic and research applications.

The compounds described herein may contain one or more asymmetric centers and thus give rise to enantiomers, diastereomers, and other stereoisomeric configurations, which may be defined as (R) or (S), α or β, or (D) or (L) depending on the absolute stereochemistry. Antisense compounds provided herein include all such possible isomers, as well as their racemic and optically pure forms.

Antisense compound hybridization sites

The antisense mechanism relies on hybridization of an antisense compound to a target nucleic acid. In embodiments, the present disclosure provides antisense compounds complementary to a target nucleic acid. In embodiments, the target nucleic acid sequence is present in a pre-mRNA molecule. In embodiments, the target nucleic acid sequence is present in an exon of a pre-mRNA molecule. In embodiments, the target nucleic acid sequence is present in an intron of a pre-mRNA molecule.

The pre-mRNA molecules are produced in the nucleus and processed before or during transport to the cytoplasm for translation. The processing of pre-mRNA involves the addition of a 5 'methylated cap and a poly (A) tail of about 200-250 bases at the 3' end of the transcript. The next step in mRNA processing is splicing of pre-mRNA, which occurs during the maturation of 90-95% of mammalian mRNA. Introns (or intervening sequences) are regions of the primary transcript (or DNA encoding it) that are not included in the coding sequence of the mature mRNA. Exons are regions of the primary transcript in which mature mRNAs remain when they reach the cytoplasm. Exons splice together to form the mature mRNA sequence. The splice junction is also referred to as a splice site, wherein the 5 'side of the junction is commonly referred to as the "5' splice site" or "splice donor site" and the 3 'side is referred to as the "3' splice site" or "splice acceptor site". In splicing, the 3 'end of the upstream exon is joined to the 5' end of the downstream exon. Thus, the non-spliced RNA (pre-mRNA) has an exon/intron junction at the 5 'end of the intron and an intron/exon junction at the 3' end of the intron. After introns are removed, exons are contiguous in the mature mRNA at what is sometimes referred to as an exon/exon junction or boundary. Cryptic splice sites are those splice sites that are not frequently used but can be used when the common splice site is blocked or otherwise unavailable. Alternative splicing, defined as splicing together different combinations of exons, typically results in the production of multiple mRNA transcripts from a single gene.

In embodiments, AC hybridizes to a sequence in a splice site. In embodiments, AC hybridizes to a sequence comprising a portion of a splice site. In embodiments, AC hybridizes to a sequence comprising part or all of the splice site. In embodiments, AC hybridizes to a sequence comprising part or all of the splice donor site. In embodiments, AC hybridizes to a sequence comprising part or all of a splice acceptor site. In embodiments, AC hybridizes to a sequence comprising a portion or all of a cryptic splice site. In embodiments, the AC hybridizes to a sequence comprising an exon/intron junction.

Pre-mRNA splicing involves two consecutive biochemical reactions. Both reactions involve a spliceosome transesterification reaction between RNA nucleotides. In the first reaction, the 2'-OH of a particular branch point nucleotide (defined during assembly of the spliceosome) within the intron nucleophilic attack the first nucleotide of the intron at the 5' splice site, thereby forming a lasso intermediate (LARIAT INTERMEDIATE). In the second reaction, the 3' -OH of the released 5' exon nucleophilic attack on the last nucleotide of the intron at the 3' splice site, thereby splicing the exon and releasing the intronic lasso. pre-mRNA splicing is regulated by Intronic Silencing Sequences (ISS) and Terminal Stem Loop (TSL) sequences. As used herein, the terms "Intronic Silencing Sequence (ISS)" and "Terminal Stem Loop (TSL)" refer to sequence elements within introns and exons, respectively, that control alternative splicing through the binding of trans-acting protein factors within pre-mRNA, resulting in the different use of splice sites. Typically, intronic silencing sequences are between 8 and 16 nucleotides and are less conserved than splice sites at exon-intron junctions. Terminal stem-loop sequences are typically between 12 and 24 nucleotides and form secondary loop structures due to complementarity and thus binding within the 12-24 nucleotide sequences.

In embodiments, the AC hybridizes to a sequence comprising a portion or all of an intron silencing sequence. In embodiments, AC hybridizes to a sequence comprising part or all of the terminal stem loop.

Up to 50% of human genetic diseases caused by point mutations are caused by aberrant splicing. Such point mutations may disrupt the current splice site or create new splice sites, resulting in mRNA transcripts containing different combinations of exons or exon deletions. Point mutations may also result in activation of cryptic splice sites or disruption of cis-regulatory elements (i.e., splice enhancers or silencers).

In embodiments, AC hybridizes to a sequence comprising part or all of the aberrant splice site created by a mutation in the target gene. In embodiments, the AC hybridizes to a sequence comprising part or all of the regulatory elements. Antisense compounds targeting cis regulatory elements are also provided. In embodiments, the regulatory element is in an exon. In embodiments, the regulatory element is in an intron.

In embodiments, the AC can specifically hybridize to sequences in the translational initiator region, the 5' cap region, the intron/exon junction, the coding sequence, the translational stop codon region, or the 5' -untranslated region or the 3' -untranslated region. In embodiments, AC may hybridize to part or all of the pre-mRNA splice site, exon-exon junction or intron-exon junction. In embodiments, the AC may hybridize to an aberrant fusion junction due to rearrangement or deletion. In embodiments, AC may hybridize to a particular exon in an alternatively spliced mRNA.

In embodiments, AC hybridizes to a sequence between 5 and 50 nucleotides in length, which may also be referred to as the length of AC. In embodiments, the AC is between 5 and 50 nucleotides in length, such as between 5 and 10, 10 and 15, 15 and 20, 20 and 25, 25 and 30, 30 and 35, 35 and 40, 40 and 45, or 45 and 50 nucleotides in length. In embodiments, the AC is about 5、6、7、8、9、10、11、12、13、14、15、16、17、18、19、20、21、22、23、24、25、26、27、28、29、30、31、32、33、34、35、36、37、38、39、40、41、42、43、44、45、46、47、48、49 or 50 nucleotides in length. In embodiments, the length of the AC is at least about 5, about 6, about 7, about 8, about 9, about 10, about 11, about 12, about 13, about 14, about 15, about 16, about 17, about 18, about 19, or about 20, and at most about 21, about 22, about 23, about 24, or about 25, and at most about 26, about 27, about 28, about 29, about 30, about 31, about 32, about 33, about 34, about 35, about 36, about 37, about 38, about 39, or about 40, and at most about 41, about 42, about 43, about 44, about 45, about 46, about 47, about 48, about 49, or about 50 nucleotides. In some embodiments, the AC is about 10 nucleotides in length. In some embodiments, the AC is about 15 nucleotides in length. In some embodiments, the AC is about 16 nucleotides in length. In some embodiments, the AC is about 17 nucleotides in length. In some embodiments, the AC is about 18 nucleotides in length. In some embodiments, the AC is about 19 nucleotides in length. In some embodiments, the AC is about 20 nucleotides in length. In some embodiments, the AC is about 21 nucleotides in length. In some embodiments, the AC is about 22 nucleotides in length. In some embodiments, the AC is about 23 nucleotides in length. In some embodiments, the AC is about 24 nucleotides in length. In some embodiments, the AC is about 25 nucleotides in length. In some embodiments, the AC is about 26 nucleotides in length. In some embodiments, the AC is about 27 nucleotides in length. In some embodiments, the AC is about 28 nucleotides in length. In some embodiments, the AC is about 29 nucleotides in length. In some embodiments, the AC is about 30 nucleotides in length.

In embodiments, AC can be less than 100% complementary to the target nucleic acid sequence. As used herein, the term "percent complementarity" refers to the number of nucleobases of AC that have nucleobase complementarity to the corresponding nucleobase of an oligomeric compound or nucleic acid divided by the total length of AC (the number of nucleobases). One skilled in the art recognizes that it is possible to include mismatches without eliminating the activity of the antisense compound. In embodiments, the AC may contain up to about 20% nucleotides that disrupt base pairing of the AC with the target nucleic acid. In embodiments, AC contains no more than about 15%, no more than about 10%, no more than 5% mismatch or no mismatch. In embodiments, the AC contains no more than 1, 2, 3, 4, or 5 mismatches. In embodiments, AC is at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% complementary to the target nucleic acid. The percent complementarity of an oligonucleotide is calculated by dividing the number of complementary nucleobases by the total number of nucleobases of the oligonucleotide. The percent complementarity of an oligonucleotide region is calculated by dividing the number of complementary nucleobases in the region by the total number of nucleobase bases regions.

In embodiments, incorporating nucleotide affinity modifications allows for a greater number of mismatches than unmodified compounds. Similarly, certain oligonucleotide sequences may be more tolerant of mismatches than other oligonucleotide sequences. One of ordinary skill in the art can determine the appropriate number of mismatches between oligonucleotides or between an oligonucleotide and a target nucleic acid, such as by determining the melting temperature (Tm). Tm or ATm can be calculated by techniques familiar to those of ordinary skill in the art. For example, freier et al (Nucleic ACIDS RESEARCH,1997, 25, 22:4429-4443) describe techniques that allow one of ordinary skill in the art to evaluate nucleotide modification-enhancing RNAs: ability of DNA duplex melting temperature.

Antisense mechanism

AC according to the present disclosure may regulate one or more aspects of protein transcription, translation, and expression. In embodiments, AC hybridized to a target sequence within a target pre-mRNA modulates one or more aspects of pre-mRNA splicing. As used herein, modulation of splicing refers to altering the processing of a pre-mRNA transcript such that a spliced mRNA molecule contains a combination of different exons due to exon skipping or exon inclusion, deletion of one or more exons, or deletion or addition of sequences (e.g., intron sequences) that are not normally present in a spliced mRNA. In embodiments, AC hybridization to a target sequence within a pre-mRNA molecule restores native splicing to the mutated pre-mRNA sequence. In embodiments, AC hybridization results in alternative splicing of the target pre-mRNA. In embodiments, AC hybridization results in exon inclusion or exon skipping of one or more exons. In embodiments, the skipped exon sequence comprises a frameshift mutation, a nonsense mutation, or a missense mutation. In embodiments, the skipped exon sequence comprises a nucleic acid deletion, substitution, or insertion. In embodiments, the skipped exons do not themselves contain sequence mutations, but adjacent exons contain mutations that result in frame shift mutations or nonsense mutations. In embodiments, the deletion of an exon that does not contain a sequence mutation restores the reading frame of the mature mRNA. In embodiments, hybridization of AC to a target sequence within a target pre-mRNA results in preferential expression of a wild-type target protein isoform. In embodiments, hybridization of AC to a target sequence within a target pre-mRNA results in expression of a re-spliced target protein comprising an active fragment of a wild-type target protein.

The antisense mechanism functions via hybridization of the antisense compound to the target nucleic acid. In embodiments, hybridization of AC to its target sequence inhibits expression of the target protein. In embodiments, hybridization of AC to its target sequence inhibits expression of one or more wild-type target protein isoforms. In embodiments, hybridization of AC with its target sequence up-regulates expression of the target protein. In embodiments, hybridization of AC to its target sequence increases expression of one or more wild-type target protein isoforms.

The efficacy of the AC of the present disclosure can be assessed by evaluating the antisense activity affected by its administration. As used herein, the term "antisense activity" refers to any detectable and/or measurable activity attributable to hybridization of an antisense compound to its target nucleic acid. Such detection and/or measurement may be direct or indirect. In embodiments, antisense activity is assessed by detecting and/or measuring the amount of a target protein. In embodiments, antisense activity is assessed by detecting and/or measuring the amount of the re-spliced target protein. In embodiments, antisense activity is assessed by detecting and/or measuring the amount of target nucleic acid and/or cleaved target nucleic acid and/or alternatively spliced target nucleic acid.

Antisense compound design

The design of an AC according to the present disclosure will depend on the sequence targeted. Targeting AC to a specific target nucleic acid molecule can be a multi-step process. The process typically begins with the identification of a target nucleic acid whose expression is to be modulated. As used herein, the terms "target nucleic acid" and "nucleic acid encoding a target gene" encompass DNA encoding a selected target gene, RNA transcribed from such DNA (including pre-mRNA and mRNA), and eDNA derived from such RNA. For example, the target nucleic acid may be a nucleic acid molecule that expresses a cellular gene (or mRNA transcribed from the gene) associated with a particular disorder or disease condition, or an infectious agent.

Those skilled in the art will be able to design, synthesize and screen antisense compounds of different nucleobase sequences to identify sequences that produce antisense activity. For example, antisense compounds can be designed that alter splicing of target pre-mRNA or inhibit expression of target protein. Methods for designing, synthesizing, and screening antisense compounds for antisense activity against a preselected target nucleic acid can be found, for example, in "ANTISENSE DRUG TECHNOLOGY, principles, strategies, and Applications," CRC Press, boca Raton, florida, edited by Stanley T.Crooke, which is incorporated by reference in its entirety for any purpose.

In embodiments, antisense compounds comprise modified nucleosides, modified internucleoside linkages, and/or conjugate groups.

In embodiments, the antisense compound is a "tricyclo-DNA (tc-DNA)", which refers to a class of constrained DNA analogs in which each nucleotide is modified by the introduction of a cyclopropane loop to limit conformational flexibility of the backbone and optimize backbone geometry at torsion angle γ. tc-DNA containing the homobases adenine and thymine forms very stable A-T base pairs with complementary RNA.

Nucleoside

In embodiments, antisense compounds comprising linked nucleosides are provided. In embodiments, some or all of the nucleosides are modified nucleosides. In embodiments, one or more nucleosides comprise a modified nucleobase. In embodiments, one or more nucleosides comprise a modified sugar. Chemically modified nucleosides are typically used for incorporation into antisense compounds to enhance one or more properties, such as nuclease resistance, pharmacokinetics, or affinity for a target RNA.

In general, a nucleobase is any group containing one or more atoms or groups of atoms capable of hydrogen bonding with the base of another nucleic acid. In addition to "unmodified" or "natural" nucleobases such as the purine nucleobases adenine (a) and guanine (G) and the pyrimidine nucleobases thymine (T), cytosine (C) and uracil (U), many modified nucleobases or nucleobase mimics known to those skilled in the art are also suitable for use in the compounds described herein. The terms modified nucleobase and nucleobase mimetic may overlap, but typically modified nucleobases refer to nucleobases that are similar in structure to the parent nucleobase, such as, for example, 7-deazapurine, 5-methylcytosine, or G-clamp, whereas nucleobase mimetics will include more complex structures, such as, for example, tricyclic phenoxazine nucleobase mimetics. Methods for preparing the modified nucleobases described above are well known to those skilled in the art.

In embodiments, the AC provided herein comprises one or more nucleosides having a modified sugar moiety. In embodiments, the furanosyl sugar ring of a natural nucleoside can be modified in a variety of ways, including but not limited to adding substituents, bridging two non-geminal ring atoms to form a Bicyclic Nucleic Acid (BNA), and substituting the epoxy at the 4' -position with an atom or group such as-S-, -N (R) -C (R ₁)(R₂). Modified sugar moieties are well known and can be used to alter (typically increase) the affinity of antisense compounds for their targets and/or increase nuclease resistance. Representative lists of modified sugars include, but are not limited to, non-bicyclic substituted sugars, particularly non-bicyclic 2' -substituted sugars having a 2' -F, 2' -OCH ₃, or 2' -O (CH ₂)₂-OCH₃ substituent; and 4' -thio modified sugars, sugar moieties may also be substituted with sugar mimetic groups, and the like, e.g., furanose rings may be substituted with morpholino rings.

In embodiments, the nucleoside comprises a bicyclic modified sugar (BNA) including LNA (4 '- (CH ₂) -O-2' bridge), 2 '-thio-LNA (4' - (CH ₂) -S-2 'bridge), 2' -amino-LNA (4 '- (CH ₂) -NR-2' bridge), ENA (4 '- (CH ₂) 2-O-2' bridge), 4'- (CH ₂)₃ -2' bridged BNA, 4'- (CH ₂CH(CH₃)) -2' bridged BNA "cEt (4 '- (CH ₃) -O-2' bridge) and cMOE BNA (4 '- (CH ₂OCH₃) -O-2' bridge). Certain such BNA have been prepared and disclosed in the patent literature as well as in the scientific literature (see, e.g., srivastava et al J.Am.Chem.Soc.2007, ACS Advanced online publication,10.1021/ja071106y, A1baek et al J.org.chem., 71, 7731-7740, fluidier et al Chembiochem 2005,6, 1104-1109, singh et al chem.Commun., 1998,4, 455-456; koshkin et al Tetrahedron,1998, 54, 3607-3630; wahlstedt et al, proc.Natl.Acad.Sci.U.S.A.,2000, 97, 5633-5638; kumar et al, biorg. Med. Chem. Lett.,1998,8, 2219-2222; WO 94/14226; WO 2005/021570; singh et al, j.org.chem.,1998, 63, 10035-10039, wo 2007/090071; Examples of issued U.S. patents and published applications disclosing BNA include, for example, U.S. patent No. 7,053,207;6,268,490;6,770,748;6,794,499;7,034,133; and 6,525,191; U.S. pre-grant publication No. 2004-0171570;2004-0219565;2004-0014959;2003-0207841;2004-0143114; and 20030082807.

Also provided herein is a "locked nucleic acid" (LNA) in which the 2 '-hydroxyl group of the ribosyl sugar ring is attached to the 4' carbon atom of the sugar ring, thereby forming a 2'-C,4' -C-oxymethylene linkage to form a bicyclic sugar moiety (reviewed in Elayadi et al, curr. Opinion Invens. Drugs,2001,2, 558-561; braasch et al, chem. Biol.,2001, 81-7, and Orum et al, curr. Opinion mol. Ther.,2001,3, 239-243; see also U.S. Pat. Nos. 6,268,490 and 6,670,461). The linkage may be a methylene (-CH ₂ -) group bridging the 2 'oxygen atom and the 4' carbon atom, for which the term LNA is used for the bicyclic moiety; in the case of ethylene in said position, the term ENA ^TM (Singh et al, chem. Commun.,1998,4, 455-456; ENA ^TM: morita et al, bioorganic MEDICINAL CHEMISTRY,2003, 11, 2211-2226) is used. LNAs and other bicyclic sugar analogs exhibit very high duplex thermal stability (tm= +3 to +10 ℃) to complementary DNA and RNA, stability to 3' -exonucleolytic degradation, and good solubility. Effective and nontoxic antisense oligonucleotides containing LNA have been described (Wahlestedt et al, proc. Natl. Acad. Sci. U.S.A.,2000, 97, 5633-5638).

The LNA isomer that has also been investigated is α -L-LNA, which has been shown to have improved stability to 3' -exonucleases. alpha-L-LNA is incorporated into antisense spacers and chimeras that exhibit potent antisense activity (Frieden et al, nucleic ACIDS RESEARCH,2003, 21, 6365-6372).

The synthesis and preparation of the LNA monomers adenine, cytosine, guanine, 5-methyl-cytosine, thymine and uracil and their oligomerization and nucleic acid recognition properties have been described (Koshkin et al, tetrahedron,1998, 54, 3607-3630). LNA and its preparation are also described in WO 98/39352 and WO 99/14226.

Analogs of LNA, phosphorothioate-LNA and 2' -thio-LNA (Kumar et al, biorg. Med. Chem. Lett.,1998,8, 2219-2222) were also prepared. The preparation of locked nucleoside analogues containing oligodeoxyribonucleotide duplex as substrates for nucleic acid polymerase has also been described (Wengel et al, WO 99/14226). The synthesis of a novel conformationally constrained high affinity oligonucleotide analogue, 2' -amino-LNA, has been described in the art (Singh et al, j. Org. Chem.,1998, 63, 10035-10039). In addition, 2 '-amino-LNAs and 2' -methylamino-LNAs have been prepared and their thermal stability with duplex of complementary RNA and DNA strands has been previously reported.

Internucleoside linkage

Described herein are internucleoside linking groups that link together nucleosides or otherwise modified monomer units to form antisense compounds. Two main classes of internucleoside linkages are defined by the presence or absence of phosphorus atoms. Representative phosphorus-containing internucleoside linkages include, but are not limited to, phosphodiester, phosphotriester, methylphosphonate, phosphoramidate (including phosphorodiamidate), and phosphorothioate. Representative phosphorus-free internucleoside linkages include, but are not limited to, methyleneimino (-CH ₂-N(CH₃)-O-CH₂ -), thiodiester (-O-C (O) -S-), thiocarbamate (-O-C (O) (NH) -S-); siloxane (-O-Si (H) ₂ -O-); and N, N' -dimethylhydrazine (-CH ₂-N(CH₃)-N(CH₃) -). Antisense compounds having non-phosphorus internucleoside linkages are referred to as oligonucleotides. Modified internucleoside linkages can be used to alter (typically increase) nuclease resistance of antisense compounds compared to native phosphodiester linkages. Internucleoside linkages having chiral atoms can be prepared as racemic, chiral or as mixtures. Representative chiral internucleoside linkages include, but are not limited to, alkyl phosphonates and phosphorothioates. Methods for preparing phosphorus-containing and phosphorus-free linkages are well known to those skilled in the art.

In embodiments, the phosphate group may be attached to the 2', 3', or 5' hydroxyl moiety of the sugar. In forming the oligonucleotide, phosphate groups covalently link nucleosides adjacent to one another to form a linear polymeric compound. Within an oligonucleotide, phosphate groups are commonly referred to as forming the internucleoside backbone of the oligonucleotide. The normal linkage or backbone of RNA and DNA is a 3 'to 5' phosphodiester linkage.

Conjugation group

In embodiments, AC is modified by covalent attachment of one or more conjugate groups. Typically, the conjugate group modifies one or more properties of the attached AC, including, but not limited to, pharmacodynamics, pharmacokinetics, binding, absorption, cellular distribution, cellular uptake, charge and clearance. The conjugate groups are conventionally used in the chemical arts and are attached to the parent compound, such as AC, either directly or via an optional linking moiety or linking group. Conjugation groups include, but are not limited to, intercalators, reporter molecules, polyamines, polyamides, polyethylene glycols, thioethers, polyethers, cholesterol, thiocholesterols, cholic acid moieties, folic acid, lipids, phospholipids, biotin, phenazine, phenanthridine, anthraquinone, adamantane, acridine, fluorescein, rhodamine, coumarin, and dyes. In embodiments, the conjugation group is polyethylene glycol (PEG), and the PEG is conjugated to AC or a cyclic peptide.

The conjugate group includes a lipid moiety such as a cholesterol moiety (Letsinger et al, proc. Natl. Acad. Sci. USA,1989, 86, 6553); cholic acid (Manoharan et al, biorg. Med. Chem. Lett.,1994,4, 1053); thioethers, for example hexyl-S-tritylthiol (Manoharan et al, ann.N. Y. Acad. Sci.,1992, 660, 306; manoharan et al, biorg. Med. Chem. Let.,1993,3, 2765); sulphur cholesterol (Oberhauser et al, nucleic acids res.,1992, 20, 533); aliphatic chains such as dodecanediol or undecyl residues (Saison-Behmoaras et al, EMBO J.,1991, 10, 111; kabanov et al, FEBS Lett.,1990, 259, 327; svinarchuk et al, biochimie,1993, 75, 49); phospholipids, such as di-hexadecyl racemic glycerol or triethylammonium-1, 2-di-O-hexadecyl racemic glycerol-3-H-phosphonate (Manoharan et al, tetrahedron lett.,1995, 36, 3651; shea et al, nucleic acids res.,1990, 18, 3777); polyamine or polyethylene glycol chains (Manoharan et al, nucleic & nucleic acids, 1995, 14, 969); adamantaneacetic acid (Manoharan et al, tetrahedron lett.,1995, 36, 3651); palm-based moieties (Mishra et al, biochim. Biophys. Acta,1995, 1264, 229); or octadecylamine or hexylamino-carbonyl-oxy cholesterol moiety (Crooke et al, j. Pharmacol. Exp. Ter., 1996, 277, 923).

Linking groups or difunctional linking moieties such as those known in the art may be included in the compounds provided herein. The linking groups can be used to attach chemical functional groups, conjugation groups, reporter groups, and other groups to selective sites of parent compounds (such as AC, for example). In embodiments, the difunctional linking moiety comprises a hydrocarbyl moiety having two functional groups. In embodiments, one of the functional groups is selected to bind to a parent molecule or compound of interest, and the other is selected to bind to substantially any selected group, such as a chemical functional group or a conjugate group. Any of the linkers described herein may be used. In embodiments, the linker comprises a chain structure or oligomer of repeating units such as ethylene glycol or amino acid units. Examples of functional groups used in the difunctional linking moiety include, but are not limited to, electrophiles for reacting with nucleophilic groups and nucleophiles for reacting with electrophilic groups. In embodiments, the difunctional linking moiety may include amino groups, hydroxyl groups, carboxylic acids, thiols, unsaturation (e.g., double or triple bonds), and the like. Some non-limiting examples of difunctional linking moieties include 8-amino-3, 6-dioxaoctanoic Acid (ADO), succinimidyl 4- (N-maleimidomethyl) cyclohexane-1-carboxylate (SMCC), and 6-aminocaproic acid (AHEX or AHA). Other linking groups include, but are not limited to, substituted C ₁-C₁₀ alkyl, substituted or unsubstituted C ₂-C₁₀ alkenyl, or substituted or unsubstituted C ₂-C₁₀ alkynyl, wherein a non-limiting list of substituent groups includes hydroxy, amino, alkoxy, carboxyl, benzyl, phenyl, nitro, thiol, thioalkoxy, halogen, alkyl, aryl, alkenyl, and alkynyl.

In embodiments, AC may be linked to a 10 arginine-serine dipeptide repeat sequence. AC linked to a 10 arginine-serine dipeptide repeat for artificial recruitment of splice enhancers has been used in vitro to induce BRCA1 and SMN2 exons containing mutations that would otherwise be skipped. See Cartegni and Krainer 2003, incorporated herein by reference.

In embodiments, the length of AC may be 5 to 50 nucleotides (e.g., ,5、6、7、8、9、10、11、12、13、14、15、16、17、18、19、20、21、22、23、24、25、26、27、28、29、30、31、32、33、34、35、36、37、38、39、40、41、42、43、44、45、46、47、48、49 or 50, including all values and ranges therein). In embodiments, the length of AC may be 5-10 nucleotides. In embodiments, the length of AC may be 10-15 nucleotides. In embodiments, the length of AC may be 15-20 nucleotides. In embodiments, the length of AC may be 20-25 nucleotides. In embodiments, the length of AC may be 25-30 nucleotides. In embodiments, the length of AC may be 30-35 nucleotides. In embodiments, the length of AC may be 35-40 nucleotides. In embodiments, the length of AC may be 40-45 nucleotides. In embodiments, the length of AC may be 45-50 nucleotides.

In embodiments, AC hybridizes to a nucleic acid sequence of the human DMD gene encoding dystrophin protein. In an embodiment, AC binds exon 45 of DMD. In embodiments, the AC bound to exon 45 of DMD is about 18 to about 30 nucleic acids in length, e.g., about 18, about 19, about 20, about 21, about 22, about 23, about 24, about 25, about 26, about 27, about 28, about 29, or about 30 nucleic acids in length.

In embodiments, the antisense compound hybridizes to a nucleic acid sequence within an intron of DMD exon 45. In embodiments, the antisense compound hybridizes to a nucleic acid sequence within DMD exon 45. In embodiments, the antisense compound hybridizes to a nucleic acid sequence spanning the intron-exon or exon-intron junction of DMD exon 45.

The antisense nomenclature system presented by Mann et al ,(2002)"Improved antisense oligonucleotide induced exon skipping in the mdx mouse model of muscular dystrophy,"J Gen Med.4：644-654 may be used to describe the target region of a gene sequence to which an antisense compound may hybridize. According to this nomenclature system, a negative sign ("-") indicates an intron sequence, and a positive sign ("+") indicates an exon sequence. The letter "A" indicates that the antisense compound binds to the acceptor splice site at the beginning of the exon, while the letter "D" indicates that the antisense compound binds to the donor splice site at the end of the exon. For example, A (-5+15) represents an antisense oligonucleotide that hybridizes to the last 5 bases of an intron preceding the target exon (e.g., exon 45) and the first 15 bases of the target exon. Similarly, D (+15-5) represents an antisense oligonucleotide that hybridizes to the last 5 exon bases of the target exon (e.g., exon 45) and the first 15 intron bases after the target exon, the positions corresponding to annealing sites of the antisense molecule. An antisense oligonucleotide that hybridizes to a nucleic acid sequence entirely within an exon can be represented by a (+5+25), e.g., the antisense oligonucleotide hybridizes to a nucleic acid sequence beginning at nucleotide 5 at the start of an exon and hybridizes to a nucleic acid sequence beginning at nucleotide 25 at the start of the same exon. Unless otherwise indicated, the absence of a "+" or "-" symbol generally means that the antisense oligonucleotide binds to a nucleic acid sequence within an exon of a target nucleic acid. Lower case nucleotides are used to indicate an intron sequence, while upper case nucleotides are used to indicate an exon sequence.

In an embodiment, the nucleic acid sequence of exon 45 of DMD is as set forth in SEQ ID NO:1 (from 5 'to 3', including flanking upstream (5 ') and downstream (3') introns):

The upstream (5 ') intron sequences (residues-50 to-1) and the downstream (3') intron sequences (residues-1 to-44) are shown in lowercase and italics; the exon sequences (residues +1 to +176) are shown in uppercase bold and underlined. In an embodiment, the nucleic acid sequence of the human Duchenne Muscular Dystrophy (DMD) gene of dystrophin exon 45 comprises 176 nucleotides.

In embodiments, the AC that binds to exon 45 of DMD is selected from any one of the nucleic acid sequences set forth in tables 6A-6P, tables 7A-7O, or tables 8A-8C, the reverse complement thereof, or a sequence having at least 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% nucleic acid sequence identity thereto.

In embodiments, AC comprises a nucleotide sequence that hybridizes to SEQ ID NO:1, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, or 30 consecutive nucleotides (e.g., AC is 15-mer、16-mer、17-mer、18-mer、19-mer、20-mer、21-mer、22-mer、23-mer、24-mer、25-mer、26-mer、27-mer、28-mer、29-mer or 30-mer), wherein the first nucleotide of AC is set forth in SEQ ID NO:1 to the exon 45 of DMD at position +1、+2、+3、+4、+5、+6、+7、+8、+9、+10、+11、+12、+13、+14、+15、+16、+17、+18、+19、+20、+21、+22、+23、+24、+25、+26、+27、+28、+29、+30、+31、+32、+33、+34、+35、+36、+37、+38、+39、+40、+41、+42、+43、+44、+45、+46、+47、+48、+49、+50、+51、+52、+53、+54、+55、+56、+57、+58、+59、+60、+61、+62、+63、+64、+65、+66、+67、+68、+69、+70、+71、+72、+73、+74、+75、+76、+77、+78、+79、+80、+81、+82、+83、+84、+85、+86、+87、+88、+89、+90、+91、+92、+93、+94、+95、+96、+97、+98、+99、+100、+101、+102、+103、+104、+105、+106、+107、+108、+109、+110、+111、+112、+113、+114、+115、+116、+117、+118、+119、+120、+121、+122、+123、+124、+125、+126、+127、+128、+129、+130、+131、+132、+133、+134、+135、+136、+137、+138、+139、+140、+141、+142、+143、+144、+145、+146、+147、+148、+149、+150、+151、+152、+153、+154、+155、+156、+157、+158、+159、+160、+161、+162、+163、w+164、+165、+1616、+167、+168、+169、+170、+171、+172、+173、+174、+175 or +176. In embodiments, the AC comprises nucleotides complementary to consecutive nucleotides of the 3 'intron sequence following exon 45 (3' intron sequence not shown). As used herein, "first nucleotide" refers to the 5' nucleotide of AC.

In embodiments, AC binds to a sequence of DMD exon 45 selected from the nucleic acid sequences consisting of the sequences set forth in tables 6A-6P, tables 7A-7O, or tables 8A-8C. In embodiments, the AC of exon 45 that binds to DMD is selected from any one of the nucleic acid sequences within tables 6A-6P, tables 7A-7O, and tables 8A-8C, or the reverse complement thereof. In embodiments, the AC that binds to exon 45 of DMD is selected from any one of the nucleic acid sequences shown in tables 6A-6P, tables 7A-7O, and tables 8A-8C, the reverse complement thereof, or a sequence having at least 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% nucleic acid sequence identity thereto. In embodiments, the AC that binds to exon 45 of DMD comprises one or more modified nucleic acids, one or more modified nucleotide linkages, or a combination thereof. In embodiments, the AC that binds to exon 45 comprises one or more morpholine rings, one or more phosphorodiamidate linkages, or a combination thereof. In embodiments, the AC that binds to exon 45 of DMD is an antisense Phosphorodiamidate Morpholino Oligomer (PMO) having a sequence selected from any one of the nucleic acid sequences within tables 6A-6P, its reverse complement, or a sequence having at least 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% nucleic acid sequence identity thereto.

TABLE 6A 15-mer AC binding to exon 45 of DMD

TABLE 6B 16-merAC binding to exon 45 of DMD

TABLE 6C 17-merAC binding to exon 45 of DMD

TABLE 6D 18-mer AC binding to exon 45 of DMD

TABLE 6E 19-mer AC binding to exon 45 of DMD

TABLE 6F 20-mer AC binding to exon 45 of DMD

Nucleic acid sequence (5 '-3')
	ACTCCAGGATGGCATTGGGC
GATGGCATTGGGCAGCGGCA
	GATGGCATTGGGCAGCGGCA
CAGATGCCAGTATTCTACAG
	CAAAAACTCCTAACGACTTA
AGGAAAAATTGGGAAGCCTG
	ACATCTTATGACCGTAGACA
TCTGCGGTGGCAGGAGGTCT
	TAAGTCCGAAGGGTTAAAAA
GCGGTGGCAGGAGGTCTGCA
	GTCTAAGTCCGAAGGGTTAA
GTGGCAGGAGGTCTGCAAAC
	GGCGTCTAAGTCCGAAGGGT
CAGACTGTCGACAAACGTCT
	AGAAAAAAGACAGACTGTCG
AAAACTCCTAACGACTTAAT
	AATGCAACTGGGGAAGAAAT
ATGACCGTAGACAAAAACTC
	AAGAAATAATTCAGCAATCC
TAAAAAGGACATCTTATGAC
	AATCCTCAAAAACAGATGCC
GGGTTAAAAAGGACATCTTA
	CTCAAAAACAGATGCCAGTA
AGGGTTAAAAAGGACATCTT
	TCAAAAACAGATGCCAGTAT
CGTCTGGAGGACGGTGGCGT
	AAATTGGGAAGCCTGAATCT
GAAAAAAGACAGACTGTCGA
	TGGCAGGAGGTCTGCAAACA
AGAAAAAAGACAGACTGTCG
	GGCAGGAGGTCTGCAAACAG
AGGAGGTCTGCAAACAGCTG
	GAGGTCTGCAAACAGCTGTC
AGGTCTGCAAACAGCTGTCA
	TCTGCAAACAGCTGTCAGAC
ACAGCTGTCAGACAGAAAAA

TABLE 6G 21-mer AC binding to exon 45 of DMD

Nucleic acid sequence (5 '-3')
	ACTCCAGGATGGCATTGGGCA
GATGGCATTGGGCAGCGGCAA
	GATGGCATTGGGCAGCGGCAA
CAGATGCCAGTATTCTACAGG
	CAAAAACTCCTAACGACTTAA
AGGAAAAATTGGGAAGCCTGA
	ACATCTTATGACCGTAGACAA
TCTGCGGTGGCAGGAGGTCTG
	TAAGTCCGAAGGGTTAAAAAG
GCGGTGGCAGGAGGTCTGCAA
	GTCTAAGTCCGAAGGGTTAAA
GTGGCAGGAGGTCTGCAAACA
	GGCGTCTAAGTCCGAAGGGTT
CAGACTGTCGACAAACGTCTG
	AGAAAAAAGACAGACTGTCGA
AAAAACTCCTAACGACTTAAT
	GAATGCAACTGGGGAAGAAAT
TATGACCGTAGACAAAAACTC
	GAAGAAATAATTCAGCAATCC
TTAAAAAGGACATCTTATGAC
	CAATCCTCAAAAACAGATGCC
AGGGTTAAAAAGGACATCTTA
	CCTCAAAAACAGATGCCAGTA
AAGGGTTAAAAAGGACATCTT
	CTCAAAAACAGATGCCAGTAT
ACGTCTGGAGGACGGTGGCGT
	AAAATTGGGAAGCCTGAATCT
AGAAAAAAGACAGACTGTCGA
	GTGGCAGGAGGTCTGCAAACA
TGGCAGGAGGTCTGCAAACAG
	CAGGAGGTCTGCAAACAGCTG
GGAGGTCTGCAAACAGCTGTC
	GAGGTCTGCAAACAGCTGTCA
GTCTGCAAACAGCTGTCAGAC
	AACAGCTGTCAGACAGAAAAA

TABLE 6H 22-mer AC binding to exon 45 of DMD

TABLE 6I 23 mer AC binding to exon 45 of DMD

TABLE 6J 24-mer AC binding to exon 45 of DMD

TABLE 6K 25-mer AC binding to exon 45 of DMD

TABLE 6L 26-mer AC binding to exon 45 of DMD

Nucleic acid sequence (5 '-3')
	ACTCCAGGATGGCATTGGGCAGCGGC
GATGGCATTGGGCAGCGGCAAACTGT
	GATGGCATTGGGCAGCGGCAAACTGT
CAGATGCCAGTATTCTACAGGAAAAA
	CAAAAACTCCTAACGACTTAATAAAG
AGGAAAAATTGGGAAGCCTGAATCTG
	ACATCTTATGACCGTAGACAAAAACT
TCTGCGGTGGCAGGAGGTCTGCAAAC
	TAAGTCCGAAGGGTTAAAAAGGACAT
GCGGTGGCAGGAGGTCTGCAAACAGC
	GTCTAAGTCCGAAGGGTTAAAAAGGA
GTGGCAGGAGGTCTGCAAACAGCTGT
	GGCGTCTAAGTCCGAAGGGTTAAAAA
CAGACTGTCGACAAACGTCTGGAGGA
	AGAAAAAAGACAGACTGTCGACAAAC
TAGACAAAAACTCCTAACGACTTAAT
	ACATTGAATGCAACTGGGGAAGAAAT
CATCTTATGACCGTAGACAAAAACTC
	CTGGGGAAGAAATAATTCAGCAATCC
AAGGGTTAAAAAGGACATCTTATGAC
	TTCAGCAATCCTCAAAAACAGATGCC
TCCGAAGGGTTAAAAAGGACATCTTA
	GCAATCCTCAAAAACAGATGCCAGTA
GTCCGAAGGGTTAAAAAGGACATCTT
	CAATCCTCAAAAACAGATGCCAGTAT
GACAAACGTCTGGAGGACGGTGGCGT
	CAGGAAAAATTGGGAAGCCTGAATCT
CTGCGGTGGCAGGAGGTCTGCAAACA
	TGCGGTGGCAGGAGGTCTGCAAACAG
GGTGGCAGGAGGTCTGCAAACAGCTG
	TGGCAGGAGGTCTGCAAACAGCTGTC
GGCAGGAGGTCTGCAAACAGCTGTCA
	AGGAGGTCTGCAAACAGCTGTCAGAC
CTGCAAACAGCTGTCAGACAGAAAAA

TABLE 6M 27 mer AC binding to exon 45 of DMD

TABLE 6N 28-mer AC binding to exon 45 of DMD

TABLE 6O 29 mer AC binding to exon 45 of DMD

TABLE 6P 30-mer AC binding to exon 45 of DMD

In embodiments, AC hybridizes to a nucleic acid sequence spanning the intron-exon or exon-intron junction of DMD exon 45. In embodiments, AC is complementary to a target nucleic acid sequence comprising at least 1 nucleotide of an upstream (5') intron preceding exon 45 (i.e., beginning at position-1). In embodiments, AC is complementary to a target nucleic acid sequence comprising at least 1, at least 2, at least 3, at least 4, at least 5, at least 6, at least 7, at least 8, at least 9, at least 10, at least 11, at least 12, at least 13, at least 14, at least 15, at least 16, at least 17, at least 18, at least 19, and up to 20 consecutive nucleotides (i.e., beginning at positions-20, -19, -18, -17, -16, -15, -14, -13, -12, -11, -10, -9, -8, -7, -6, -5, -4, -3, -2, or-1) of an upstream (5') intron preceding exon 45. In embodiments, AC is complementary to a target nucleic acid sequence comprising at least the first nucleotide (i.e., position +1) of the 5' end of exon 45. In embodiments, AC is complementary to a target nucleic acid sequence comprising at least 1, at least 2, at least 3, at least 4, at least 5, at least 6, at least 7, at least 8, at least 9, at least 10, at least 11, at least 12, at least 13, at least 14, at least 15, at least 16, at least 17, at least 18, at least 19, at least 20, at least 21, at least 22, at least 23, at least 24, and at most 25 consecutive nucleotides, beginning at the first nucleotide at the 5' end of exon 45 (i.e., beginning at position +1).

In embodiments, AC and the amino acid sequence spanning SEQ ID NO: 1-20 to +25, and the intron-exon junction of exon 45 of DMD. The intron-exon junction comprising DMD exon 45 at positions-20 to +25 consists of SEQ ID NO:2 represents:

SEQ ID NO:2 (residues-20 to-1) and the sequence of the intron upstream (5') of SEQ ID NO:2 are shown in bold and underlined (residues +1 to +25).

In embodiments, AC comprises a nucleotide sequence that hybridizes to SEQ ID NO:2, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, or 30 consecutive nucleotides (e.g., AC is 15-mer、16-mer、17-mer、18-mer、19-mer、20-mer、21-mer、22-mer、23-mer、24-mer、25-mer、26-mer、27-mer、28-mer、29-mer or 30-mer), wherein the first nucleotide of AC is at SEQ ID NO:2 to the nucleotide of exon 45 of DMD or the 5' flanking intron of exon 45 at position -20、-19、-18、-17、-16、-15、-14、-13、-12、-11、-10、-9、-8、-7、-6、-5、-4、-3、-2、-1、+1、+2、+3、+4、+5、+6、+7、+8、+9、+10、+11、+12、+13、+14、+15、+16、+17、+18、+19、+20、+21、+22、+23、+24 or +25.

In an embodiment, the AC that binds to exon 45 of DMD is selected from any one of the nucleic acid sequences shown in tables 7A-7O. In embodiments, the AC that binds to exon 45 of DMD comprises one or more modified nucleic acids, one or more modified nucleotide linkages, or a combination thereof. In embodiments, the AC that binds to exon 45 comprises one or more morpholine rings, one or more phosphorodiamidate linkages, or a combination thereof. In embodiments, the AC that binds to exon 45 of DMD is an antisense Phosphorodiamidate Morpholino Oligomer (PMO) having a sequence selected from any one of the nucleic acid sequences within tables 7A-7O, its reverse complement, or a sequence having at least 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% nucleic acid sequence identity thereto.

TABLE 7A 20-mer to 30-mer AC binding to the intron-exon junction of exon 45 starting at position-15

TABLE 7 20-mer to 30-mer AC binding to the intron-exon junction of exon 45 starting at position-14

TABLE 7 20-mer to 30-mer AC binding to the intron-exon junction of exon 45 starting at position-13

TABLE 7D 20-mer to 30-mer AC binding to the intron-exon junction of exon 45 starting at position-12

TABLE 7E 20-mer to 30-mer AC binding to the intron-exon junction of exon 45 starting at position-11

25-Mer to 30-mer, starting from-11:
	G TATCT TACAG GAACT CCAGG ATGGC ATTG
G TATCT TACAG GAACT CCAGG ATGGC ATT
	G TATCT TACAG GAACT CCAGG ATGGC AT
G TATCT TACAG GAACT CCAGG ATGGC A
	G TATCT TACAG GAACT CCAGG ATGGC
G TATCT TACAG GAACT CCAGG ATGG
	G TATCT TACAG GAACT CCAGG ATG
G TATCT TACAG GAACT CCAGG AT
	G TATCT TACAG GAACT CCAGG A
G TATCT TACAG GAACT CCAGG
	G TATCT TACAG GAACT CCAG

TABLE 7F 20-mer to 30-mer AC binding to the intron-exon junction of exon 45 starting at position-10

20-Mer to 30-mer, starting from-10:
	TATCT TACAG GAACT CCAGG ATGGC ATTGG
TATCT TACAG GAACT CCAGG ATGGC ATTG
	TATCT TACAG GAACT CCAGG ATGGC ATT
TATCT TACAG GAACT CCAGG ATGGC AT
	TATCT TACAG GAACT CCAGG ATGGC A
TATCT TACAG GAACT CCAGG ATGGC
	TATCT TACAG GAACT CCAGG ATGG
TATCT TACAG GAACT CCAGG ATG
	TATCT TACAG GAACT CCAGG AT
TATCT TACAG GAACT CCAGG A
	TATCT TACAG GAACT CCAGG

TABLE 7G 20-mer to 30-mer AC binding to the intron-exon junction of exon 45 starting at position-9

20-Mer to 30-mer, starting from-9:
	ATCT TACAG GAACT CCAGG ATGGC ATTGG G
ATCT TACAG GAACT CCAGG ATGGC ATTGG
	ATCT TACAG GAACT CCAGG ATGGC ATTG
ATCT TACAG GAACT CCAGG ATGGC ATT
	ATCT TACAG GAACT CCAGG ATGGC AT
ATCT TACAG GAACT CCAGG ATGGC A
	ATCT TACAG GAACT CCAGG ATGGC
ATCT TACAG GAACT CCAGG ATGG
	ATCT TACAG GAACT CCAGG ATG
ATCT TACAG GAACT CCAGG AT
	ATCT TACAG GAACT CCAGG A

TABLE 7H 20-mer to 30-mer AC binding to the intron-exon junction of exon 45 starting at position-8

20-Mer to 30-mer, starting from-8:
	TCT TACAG GAACT CCAGG ATGGC ATTGG GC
TCT TACAG GAACT CCAGG ATGGC ATTGG G
	TCT TACAG GAACT CCAGG ATGGC ATTGG
TCT TACAG GAACT CCAGG ATGGC ATTG
	TCT TACAG GAACT CCAGG ATGGC ATT
TCT TACAG GAACT CCAGG ATGGC AT
	TCT TACAG GAACT CCAGG ATGGC A
TCT TACAG GAACT CCAGG ATGGC
	TCT TACAG GAACT CCAGG ATGG
TCT TACAG GAACT CCAGG ATG
	TCT TACAG GAACT CCAGG AT

TABLE 7I 20-mer to 30-mer AC binding to the intron-exon junction of exon 45 starting at position-7

TABLE 7J 20-mer to 30-mer AC binding to the intron-exon junction of exon 45 starting at position-6

20-Mer to 30-mer, starting from-6:
	T TACAG GAACT CCAGG ATGGC ATTGG GCAG
T TACAG GAACT CCAGG ATGGC ATTGG GCA
	T TACAG GAACT CCAGG ATGGC ATTGG GC
T TACAG GAACT CCAGG ATGGC ATTGG G
	T TACAG GAACT CCAGG ATGGC ATTGG
T TACAG GAACT CCAGG ATGGC ATTG
	T TACAG GAACT CCAGG ATGGC ATT
T TACAG GAACT CCAGG ATGGC AT
	T TACAG GAACT CCAGG ATGGC A
T TACAG GAACT CCAGG ATGGC
	T TACAG GAACT CCAGG ATGG

TABLE 7K 20-mer to 30-mer AC binding to the intron-exon junction of exon 45 starting at position-5

20-Mer to 30-mer, starting from-5:
	TACAG GAACT CCAGG ATGGC ATTGG GCAGC
TACAG GAACT CCAGG ATGGC ATTGG GCAG
	TACAG GAACT CCAGG ATGGC ATTGG GCA
TACAG GAACT CCAGG ATGGC ATTGG GC
	TACAG GAACT CCAGG ATGGC ATTGG G
TACAG GAACT CCAGG ATGGC ATTGG
	TACAG GAACT CCAGG ATGGC ATTG
TACAG GAACT CCAGG ATGGC ATT
	TACAG GAACT CCAGG ATGGC AT
TACAG GAACT CCAGG ATGGC A
	TACAG GAACT CCAGG ATGGC

TABLE 7L 20-mer to 30-mer AC binding to the intron-exon junction of exon 45 starting at position-4

TABLE 7M 20-mer to 30-mer AC binding to the intron-exon junction of exon 45 starting at position-3

20-Mer to 28-mer, starting from-3:
	CAG GAACT CCAGG ATGGC ATTGG GCAGC
CAG GAACT CCAGG ATGGC ATTGG GCAG
	CAG GAACT CCAGG ATGGC ATTGG GCA
CAG GAACT CCAGG ATGGC ATTGG GC
	CAG GAACT CCAGG ATGGC ATTGG G
CAG GAACT CCAGG ATGGC ATTGG
	CAG GAACT CCAGG ATGGC ATTG
CAG GAACT CCAGG ATGGC ATT
	CAG GAACT CCAGG ATGGC ATT
CAG GAACT CCAGG ATGGC AT

TABLE 7N 20-mer to 30-mer AC binding to the intron-exon junction of exon 45 starting at position-2

20-Mer to 27-mer, starting from-2:
	AG GAACT CCAGG ATGGC ATTGG GCAGC
AG GAACT CCAGG ATGGC ATTGG GCAG
	AG GAACT CCAGG ATGGC ATTGG GCA
AG GAACT CCAGG ATGGC ATTGG GC
	AG GAACT CCAGG ATGGC ATTGG G
AG GAACT CCAGG ATGGC ATTGG
	AG GAACT CCAGG ATGGC ATTG
AG GAACT CCAGG ATGGC ATT

TABLE 7O 20-mer to 30-mer AC binding to the intron-exon junction of exon 45 starting at position-1

20-Mer to 26-mer, starting from-1:
	G GAACT CCAGG ATGGC ATTGG GCAGC
G GAACT CCAGG ATGGC ATTGG GCAG
	G GAACT CCAGG ATGGC ATTGG GCA
G GAACT CCAGG ATGGC ATTGG GC
	G GAACT CCAGG ATGGC ATTGG G
G GAACT CCAGG ATGGC ATTGG
	G GAACT CCAGG ATGGC ATTG

In an embodiment, the AC that binds to exon 45 of DMD is selected from any one of the nucleic acid sequences shown in tables 8A-8C. In embodiments, the AC that binds to exon 45 of DMD comprises one or more modified nucleic acids, one or more modified nucleotide linkages, or a combination thereof. In embodiments, the AC that binds to exon 45 comprises one or more morpholine rings, one or more phosphorodiamidate linkages, or a combination thereof. In embodiments, the AC that binds to exon 45 of DMD is an antisense Phosphorodiamidate Morpholino Oligomer (PMO) having a sequence selected from any one of the nucleic acid sequences within tables 8A-8C, its reverse complement, or a sequence having at least 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% nucleic acid sequence identity thereto.

TABLE 8A additional ACs with exon 45 skipping

TABLE 8B additional ACs with exon 45 skipping

Sequence(s)
	5’-AATGCCATCCTGGAGTTCCTG-3’
5’-ATGCCATCCTGGAGTTCCTGT-3’
	5’-CCCAATGCCATCCTGGAGTTCC-3’
5’-ATGCCATCCTGGAGTTCCTGTA-3’
	5’-GCCCAATGCCATCCTGGAGTTCC-3’
5’-CCCAATGCCATCCTGGAGTTCCT-3’
	5’-CCCAATGCCATCCTGGAGTTCCTG-3’
5’-TGCCCAATGCCATCCTGGAGTTCCT-3’
	5’-CCCAATGCCATCCTGGAGTTCCTGT-3’
5’-CAATGCCATCCTGGAGTTCCTGT-3’

TABLE 8C additional ACs with exon 45 skipping

In embodiments, any of the ACs in tables 6A-6P or tables 7A-7O or tables 8A-8C, the reverse complement thereof, or a sequence having at least 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% nucleic acid sequence identity thereto, comprises at least one modified nucleotide or nucleic acid selected from the group consisting of: phosphorothioate (PS) nucleotides, phosphorodiamidate Morpholino (PMO) nucleotides, locked Nucleic Acids (LNAs), peptide Nucleic Acids (PNAs), nucleotides comprising a 2' -O-methyl (2 ' -OMe) modified backbone, 2' O-methoxy-ethyl (2 ' -MOE) nucleotides, 2',4' constrained ethyl (cEt) nucleotides and 2' -deoxy-2 ' -fluoro- β -D-arabinonucleic acid (2 ' f-ANA). In embodiments, hybridization of AC to the target sequence reduces or prevents splicing of exon 45. In embodiments, the AC comprises at least one Phosphorodiamidate Morpholino (PMO) nucleotide. In embodiments, each nucleotide in AC is a Phosphorodiamidate Morpholino (PMO) nucleotide.

In embodiments, the compounds have the following structure:

Wherein:

CPP is a cell penetrating peptide;

L is a linker;

b are each independently nucleobases complementary to bases in the target sequence; and

N is an integer from 1 to 50.

In embodiments, the sum of B and n corresponds to a sequence shown in tables 6A-6P or tables 7A-7O or tables 8A-8C, a reverse complement thereof, or a sequence having at least 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% nucleic acid sequence identity thereto.

Cyclic cell penetrating peptide conjugated to AC (cCPP)

The cyclic cell penetrating peptide (cCPP) may be conjugated to AC.

AC may be conjugated to cCPP via a linker. The AC may contain a therapeutic moiety. The therapeutic moiety may comprise an oligonucleotide, a peptide or a small molecule. The oligonucleotide may comprise an antisense oligonucleotide. AC may be conjugated to a linker at the terminal carbonyl group to provide the following structure:

Wherein:

EP is a cyclic exopeptide and M, AA _SC, AC, x ', y and z' are as defined above, are attachment points to AA _SC. x' may be 1.y may be 4.z' may be 11.- (OCH ₂CH-₂)_x '-and/or- (OCH ₂CH-₂)_z' -may independently be replaced by one or more amino acids including, for example, glycine, beta-alanine, 4-aminobutyric acid, 5-aminopentanoic acid, 6-aminocaproic acid, or combinations thereof.

An Endosomal Escape Vector (EEV) may comprise a cyclic cell penetrating peptide (cCPP), an Exocyclic Peptide (EP), and a linker, and may be conjugated with AC to form an EEV-conjugate comprising a structure of formula (C):

Or a protonated form thereof,

Wherein:

r ₄ and R ₆ are independently H or an amino acid side chain;

EP is a cyclic exopeptide as defined herein;

AC is as defined herein;

Each m is independently an integer from 0 to 3;

n is an integer from 0 to 2;

x' is an integer from 2 to 20;

y is an integer from 1 to 5;

q is an integer from 1 to 4; and

Z' is an integer from 2 to 20.

R ₁、R₂、R₃、R₄, EP, AC, m, n, x ', y, q and z' are as defined herein.

EEV can be conjugated to AC, and EEV-conjugates can comprise a structure of formula (C-a) or (C-b):

or a protonated form thereof, wherein EP, m and z are as defined above in formula (C).

EEV can be conjugated to AC, and EEV-conjugates can comprise a structure of formula (C-C):

or a protonated form thereof, wherein EP, R ¹、R²、R³、R⁴ and m are as defined above in formula (III); AA may be an amino acid as defined herein; n may be an integer from 0 to 2; x may be an integer from 1 to 10; y may be an integer from 1 to 5; and z may be an integer from 1 to 10.

EEV may be conjugated to AC, and EEV-oligonucleotide conjugates may comprise structures of formula (C-1), (C-2), (C-3), or (C-4):

in the above formula, EP is a cyclic exopeptide and AC may have a sequence of 15-30 nucleic acids that is complementary to a target sequence comprising at least a portion of exon 44 of the DMD gene in the pre-mRNA sequence. In embodiments, the AC may be selected from the oligonucleotides shown in tables 6A-6P, tables 7A-7O, and tables 8A-8C, reverse complement thereof, or a sequence having at least 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% nucleic acid sequence identity thereto.

In embodiments, the compounds described herein form multimers. In embodiments, multimerization occurs via non-covalent interactions, such as by hydrophobic interactions, ionic interactions, hydrogen bonding, or dipole-dipole interactions. In embodiments, the compounds form dimers, trimers, tetramers, pentamers, hexamers, heptamers, octamers, or nonamers. In embodiments, a compound comprises 1,2, 3, 4, 5, 6, 7, 8, 9, or 10 cyclic peptides. In embodiments, the compound comprises 1,2, 3, 4, 5, 6, 7, 8, 9, or 10 ACs. In embodiments, the compound comprises 1,2, 3, 4, 5, 6, 7, 8, 9, or 10 EPs. In embodiments, the compound comprises 1 to 10 cyclic peptides and 1 to 10 ACs. In embodiments, the compound comprises 1 to 10 cyclic peptides, 1 to 10 ACs, or 1 to 10 EPs.

In embodiments, the compounds of the present disclosure comprise any one of the following structures. The following compounds are merely illustrative, and any of the cyclic peptides, linkers, and AC in any of the following structures may be replaced with any of the cyclic peptides, linkers, or AC described herein.

Cytoplasmic delivery efficiency

Modification of the cyclic cell penetrating peptide (cCPP) may increase cytoplasmic delivery efficiency. By comparing the cytoplasmic delivery efficiency of cCPP with the modified sequence to a control sequence, improved cytoplasmic uptake efficiency can be measured. The control sequence does not include specific replacement amino acid residues in the modified sequence (including, but not limited to, arginine, phenylalanine, and/or glycine), but is otherwise identical.

As used herein, cytoplasmic delivery efficiency refers to the ability of cCPP to cross the cell membrane and enter the cytoplasm of the cell. cCPP are not necessarily dependent on the receptor or cell type. Cytoplasmic delivery efficiency may refer to absolute cytoplasmic delivery efficiency or relative cytoplasmic delivery efficiency.

Absolute cytosolic delivery efficiency is the ratio of the cytosolic concentration of cCPP (or cCPP-AC conjugate) to the concentration of cCPP (or cCPP-AC conjugate) in the growth medium. Relative cytoplasmic delivery efficiency refers to the concentration of cCPP in the cytoplasm as compared to the concentration of control cCPP in the cytoplasm. Quantification may be achieved by fluorescent labeling cCPP (e.g., with FITC dye) and measuring the fluorescence intensity using techniques well known in the art.

The relative cytoplasmic delivery efficiency is determined by comparing the amount of the invention cCPP that is internalized by a cell type (e.g., a HeLa cell) to the amount of the control cCPP that is internalized by the same cell type. To measure relative cytoplasmic delivery efficiency, the cell type can be incubated in the presence of cCPP for a specified period of time (e.g., 30 minutes, 1 hour, 2 hours, etc.), after which the amount of cCPP internalized by the cell can be quantified using methods known in the art, such as fluorescence microscopy. Separately, the same concentration of control cCPP was incubated in the presence of the cell type for the same period of time and the amount of control cCPP internalized by the cells was quantified.

Relative cytoplasmic delivery efficiency can be determined by measuring cCPP with modified sequence versus IC ₅₀ of the intracellular target and comparing cCPP with modified sequence versus IC ₅₀ of the control sequence (as described herein).

The relative cytosolic delivery efficiency of cCPP may be in the range of about 50% to about 450%, such as about 60%, about 70%, about 80%, about 90%, about 100%, about 110%, about 120%, about 130%, about 140%, about 150%, about 160%, about 170%, about 180%, about 190%, about 200%, about 210%, about 220%, about 230%, about 240%, about 250%, about 260%, about 270%, about 280%, about 290%, about 300%, about 310%, about 320%, about 330%, about 340%, about 350%, about 360%, about 370%, about 380%, about 390%, about 400%, about 410%, about 420%, about 430%, about 440%, about 450%, about 460%, about 470%, about 480%, about 490%, about 500%, about 510%, about 520%, about 530%, about 540%, about 550%, about 560%, about 570%, about 580%, or about 590%, as compared to ring (Ff Φ RrRrQ), including all values and subranges therebetween. The relative cytoplasmic delivery efficiency of cCPP can be improved by greater than about 600% as compared to a cyclic peptide comprising a loop (Ff Φ RrRrQ).

The absolute cytoplasmic delivery potency is about 40% to about 100%, for example about 45%, about 50%, about 55%, about 60%, about 65%, about 70%, about 75%, about 80%, about 85%, about 90%, about 91%, about 92%, about 93%, about 94%, about 95%, about 96%, about 97%, about 98%, about 99%, including all values and subranges therebetween.

The cCPP of the present disclosure can increase cytoplasmic delivery efficiency by a factor of about 1.1 to about 30 times, e.g., about 1.2, about 1.3, about 1.4, about 1.5, about 1.6, about 1.7, about 1.8, about 1.9, about 2.0, about 2.5, about 3.0, about 3.5, about 4.0, about 4.5, about 5.0, about 5.5, about 6.0, about 6.5, about 7.0, about 7.5, about 8.0, about 8.5, about 9.0, about 10, about 10.5, about 11.0, about 11.5, about 12.0, about 12.5, about 13.0, about 13.5, about 14.0, about 14.5, about 15.0, about 15.5, about 16.0, about 16.5, about 17.0, about 17.5, about 18.0, about 18.5, about 19.0, about 19.5, about 21.5, about 20.5, about 25.0, about 22.5, about 21.5, about 25.0, about 25.5, about 22.0, about 25.5, about 21.5, about 26.0, about 25.5, about 26.0, about 26.5, about 25.0, about 26.0, about 26.5, about 0.

Resplicing target proteins

A "target protein" is an amino acid sequence that results from transcription and translation of a target gene. As used herein, "re-spliced target protein" refers to a protein encoded by AC binding to target pre-mRNA transcribed from a target gene. "wild-type target protein" refers to a naturally occurring, correctly translated protein isoform resulting from the correct splicing of a target pre-mRNA encoded by a wild-type target gene. The compounds and methods of the invention can produce an alternative splicing target protein that contains one or more amino acid substitutions, deletions, and/or insertions compared to the wild-type target protein. In embodiments, the re-spliced target protein retains some wild-type target protein activity. In embodiments, the re-spliced target protein produced by administration of a compound of the invention is homologous to a wild-type target protein. In embodiments, the re-spliced target protein has an amino acid sequence that is at least about 50%, at least about 55%, at least about 60%, at least about 65%, at least about 70%, at least about 75%, at least about 80%, at least about 85%, at least about 90%, at least about 91%, at least about 92%, at least about 93%, at least about 94%, at least about 95%, at least about 96%, at least about 97%, at least about 98%, or at least about 99% and up to 100% identical to the wild-type target protein. In embodiments, the re-spliced target protein is substantially identical to the wild-type target protein. In embodiments, the amino acid sequence of the re-spliced target protein is at least 50% identical to the amino acid sequence of the wild-type target protein. In embodiments, the amino acid sequence of the re-spliced target protein is at least 75% identical to the amino acid sequence of the wild-type target protein. In embodiments, the amino acid sequence of the re-spliced target protein is at least 90% identical to the amino acid sequence of the wild-type target protein. In embodiments, the re-spliced target protein is a shortened form of the wild-type target protein.

In embodiments, the re-splicing of the target protein can rescue one or more phenotypes or symptoms of the disease associated with transcription and translation of the target gene. In embodiments, the re-splicing of the target protein may rescue one or more phenotypes or symptoms of the disease associated with the expression of the target protein. In embodiments, the re-spliced target protein is an active fragment of a wild-type target protein. In embodiments, the re-spliced target protein functions in a substantially similar manner as the wild-type target protein. In embodiments, the re-splicing of the target protein allows the cell to function substantially similarly to a similar cell expressing the wild-type target protein. In embodiments, the re-splicing of the target protein does not cure the disease associated with the target gene or target protein, but ameliorates one or more symptoms of the disease. In embodiments, re-splicing the target protein results in an increase in target protein function of at least about 1%, about 2%, about 3%, about 4%, about 5%, about 6%, about 7%, about 8%, about 9%, about 10%, about 15%, about 205, about 25%, about 30%, about 35%, about 40%, about 45%, about 50%, about 55%, about 60%, about 65%, about 70%, about 75%, about 80%, about 85%, about 90% or about 95%, and up to about 100%.

In embodiments, the alternative splicing target protein can have an amino acid sequence that is reduced by about 1 or more amino acids from the size of the wild-type target protein, for example by about 5, about 10, about 15, about 20, about 25, about 30, about 35, about 40, about 45, about 50, about 55, about 60, about 65, about 70, about 75, about 80, about 90, about 95, about 100, about 105, about 110, about 115, about 120, about 125, about 130, about 135, about 140, about 145, about 150, about 155, about 160, about 165, about 170, about 175, or about 180 or more amino acids.

In embodiments, the re-spliced target protein may have one or more properties that are improved relative to the target protein. In embodiments, the re-spliced target protein may have one or more properties that are improved relative to a wild-type target protein. In embodiments, enzymatic activity or stability may be enhanced by promoting different splicing of target pre-mRNA. In embodiments, the re-spliced target protein may have a sequence identical or substantially similar to a wild-type target protein isomer, with improved properties compared to another wild-type target protein isomer.

In embodiments, one or more properties of the target protein are absent (eliminated) or reduced in the re-spliced target protein. In embodiments, one or more properties of the wild-type target protein are absent (eliminated) or reduced in the re-spliced target protein. Non-limiting examples of properties that may be reduced or eliminated include immunogenicity, angiogenesis, thrombosis, aggregation, and ligand binding activity.

In embodiments, the re-spliced target protein contains one or more amino acid substitutions as compared to the wild-type target protein. In embodiments, the substitution may be a conservative substitution or a non-conservative substitution. Examples of conservative amino acid substitutions include substitution of one amino acid for another amino acid in one of the following groups: basic amino acids (arginine, lysine and histidine), acidic amino acids (glutamic acid and aspartic acid), polar amino acids (glutamine and asparagine), hydrophobic amino acids (leucine, isoleucine and valine), aromatic amino acids (phenylalanine, tryptophan and tyrosine) and small amino acids (glycine, alanine, serine, threonine and methionine). In embodiments, structurally similar amino acids are substituted to reverse the charge of the residue (e.g., glutamine for glutamic acid, or vice versa, aspartic acid for asparagine, or vice versa). In embodiments, tyrosine replaces phenylalanine, or vice versa. Other non-limiting examples of amino acid substitutions are described, for example, by H.Neurath and R.L.Hill,1979 in The Proteins, ACADEMIC PRESS, new York. Common substitutions are Ala/Ser、Val/Ile、Asp/Glu、Thr/Ser、Ala/Gly、Ala/Thr、Ser/Asn、AlaNal、Ser/Gly、Tyr/Phe、Ala/Pro、Lys/Arg、Asp/Asn、Leu/Ile、Leu/Val、Ala/Glu and Asp/Gly.

In embodiments, the re-spliced target protein may comprise substitutions, deletions, and/or insertions at one or more (e.g., several) positions as compared to the wild-type target protein. In embodiments, the number of amino acid substitutions, deletions and/or insertions in the amino acid sequence of the re-spliced target protein is no more than 200, no more than 150, no more than 100, no more than 50, no more than 40, no more than 30, no more than 20, or no more than 10, e.g., 1, 2,3, 4, 5, 6, 7, 8, 9, or 10.

Therapeutic method

In embodiments, the amount of the active ingredient is about 0.1mg/kg to about 1000mg/kg, e.g., about 0.1mg/kg, about 0.2mg/kg, about 0.3mg/kg, about 0.4mg/kg, about 0.5mg/kg, about 0.6mg/kg, about 0.7mg/kg, about 0.8mg/kg, about 0.9mg/kg, about 1mg/kg, about 2mg/kg, about 3mg/kg, about 4mg/kg, about 5mg/kg, about 6mg/kg, about 7mg/kg, about 8mg/kg, about 9mg/kg, about 10mg/kg, About 11mg/kg, about 12mg/kg, about 13mg/kg, about 14mg/kg, about 15mg/kg, about 16mg/kg, about 17mg/kg, about 18mg/kg, about 19mg/kg, about 20mg/kg, about 21mg/kg, about 22mg/kg, about 23mg/kg, about 24mg/kg, about 25mg/kg, about 26mg/kg, about 27mg/kg, about 28mg/kg, about 29mg/kg, about 30mg/kg, about 31mg/kg, about 32mg/kg, about 33mg/kg, about, About 34mg/kg, about 35mg/kg, about 36mg/kg, about 37mg/kg, about 38mg/kg, about 39mg/kg, about 40mg/kg, about 41mg/kg, about 42mg/kg, about 43mg/kg, about 44mg/kg, about 45mg/kg, about 46mg/kg, about 47mg/kg, about 48mg/kg, about 49mg/kg, about 50mg/kg, about 51mg/kg, about 52mg/kg, about 53mg/kg, about 54mg/kg, about 55mg/kg, about 56mg/kg, about, About 57mg/kg, about 58mg/kg, about 59mg/kg, about 60mg/kg, about 61mg/kg, about 62mg/kg, about 63mg/kg, about 64mg/kg, about 65mg/kg, about 66mg/kg, about 67mg/kg, about 68mg/kg, about 69mg/kg, about 70mg/kg, about 71mg/kg, about 72mg/kg, about 73mg/kg, about 74mg/kg, about 75mg/kg, about 76mg/kg, about 77mg/kg, about 78mg/kg, about 79mg/kg, about, about 80mg/kg, about 81mg/kg, about 82mg/kg, about 83mg/kg, about 84mg/kg, about 85mg/kg, about 86mg/kg, about 87mg/kg, about 88mg/kg, about 89mg/kg, about 90mg/kg, about 91mg/kg, about 92mg/kg, about 93mg/kg, about 94mg/kg, about 95mg/kg, about 96mg/kg, about 97mg/kg, about 98mg/kg, about 99mg/kg, about 100mg/kg, about 110mg/kg, about 120mg/kg, about, About 130mg/kg, about 140mg/kg, about 150mg/kg, about 160mg/kg, about 170mg/kg, about 180mg/kg, about 190mg/kg, about 200mg/kg, about 210mg/kg, about 220mg/kg, about 230mg/kg, about 240mg/kg, about 250mg/kg, about 260mg/kg, about 270mg/kg, about 280mg/kg, about 290mg/kg, about 300mg/kg, about 310mg/kg, about 320mg/kg, about 330mg/kg, about 340mg/kg, about 350mg/kg, about 360mg/kg, about 370mg/kg, about 380mg/kg, about 390mg/kg, about 400mg/kg, about 410mg/kg, about 420mg/kg, about 430mg/kg, about 440mg/kg, about 450mg/kg, about 460mg/kg, about 470mg/kg, about 480mg/kg, about 490mg/kg, about 500mg/kg, about 510mg/kg, about 520mg/kg, about 530mg/kg, about 540mg/kg, about, about 550mg/kg, about 560mg/kg, about 570mg/kg, about 580mg/kg, about 590mg/kg, about 600mg/kg, about 610mg/kg, about 620mg/kg, about 630mg/kg, about 640mg/kg, about 650mg/kg, about 660mg/kg, about 670mg/kg, about 680mg/kg, about 690mg/kg, about 700mg/kg, about 710mg/kg, about 720mg/kg, about 730mg/kg, about 740mg/kg, about 750mg/kg, About 760mg/kg, about 770mg/kg, about 780mg/kg, about 790mg/kg, about 800mg/kg, about 810mg/kg, about 820mg/kg, about 830mg/kg, about 840mg/kg, about 850mg/kg, about 860mg/kg, about 870mg/kg, about 880mg/kg, about 890mg/kg, about 900mg/kg, about 910mg/kg, about 920mg/kg, about 930mg/kg, about 940mg/kg, about 950mg/kg, about 960mg/kg, A dose of about 970mg/kg, about 980mg/kg, about 990mg/kg, or about 1000mg/kg (including all values and ranges therebetween) is administered to a patient diagnosed with Duchenne Muscular Dystrophy (DMD) AC of the present disclosure.

The present disclosure provides a method of treating Duchenne Muscular Dystrophy (DMD) in a subject in need thereof comprising administering a compound disclosed herein. In some embodiments, the target gene is DMD. In embodiments, the target sequence comprises at least a portion of exon 44 of the DMD, at least a portion of exon 44 flanking the 3 'intron of the DMD, at least a portion of exon 44 flanking the 5' intron of the DMD, or a combination thereof.

In various embodiments, treatment refers to a partial or complete alleviation, improvement, alleviation, inhibition, delay of onset, severity, and/or reduction of incidence of one or more symptoms in a subject.

In embodiments, a method for altering expression of a target gene in a subject in need thereof is provided, the method comprising administering a compound disclosed herein. In embodiments, the treatment results in reduced expression of the target protein. In embodiments, the treatment results in expression of the alternatively spliced target protein. In embodiments, the treatment results in preferential expression of the wild-type target protein isomer.

In embodiments, treatment according to the present disclosure results in a reduction in expression of a target protein in a subject by more than about 5%, e.g., about 5%, about 10%, about 15%, about 20%, about 25%, about 30%, about 35%, about 40%, about 45%, about 50%, about 55%, about 60%, about 65%, about 70%, about 75%, about 80%, about 85%, about 90%, about 95%, and about 100%, compared to the average level of the target protein in a pre-treatment subject or in one or more control subjects with a similar disease that are untreated. In embodiments, treatment according to the present disclosure results in an increase in expression of the re-spliced target protein in the subject of more than about 5%, e.g., about 5%, about 10%, about 15%, about 20%, about 25%, about 30%, about 35%, about 40%, about 45%, about 50%, about 55%, about 60%, about 65%, about 70%, about 75%, about 80%, about 85%, about 90%, about 95%, and about 100%, as compared to the average level of the target protein in the subject prior to treatment or in one or more control subjects without treatment having a similar disease. In embodiments, treatment according to the present disclosure results in an increase or decrease in expression of a wild-type target protein isomer in a subject by more than about 5%, e.g., about 5%, about 10%, about 15%, about 20%, about 25%, about 30%, about 35%, about 40%, about 45%, about 50%, about 55%, about 60%, about 65%, about 70%, about 75%, about 80%, about 85%, about 90%, about 95%, and about 100%, compared to the average level of the target protein in a pre-treatment subject or in one or more control subjects with no treatment for a similar disease.

As used herein, the terms "improve," "increase," "decrease," and the like indicate values relative to a control. In embodiments, a suitable control is a baseline measurement, such as a measurement in the same individual prior to initiation of the treatment described herein, or a measurement in a control individual (or multiple control individuals) in the absence of the treatment described herein. A "control individual" is an individual suffering from the same disease that is about the same age and/or sex as the individual being treated (to ensure that the disease stages of the treated individual and the control individual are comparable).

The treated individual (also referred to as a "patient" or "subject") is an individual (fetus, infant, child, adolescent or adult) who has a disease or has the potential to develop a disease. An individual may have a disease mediated by aberrant gene expression or aberrant gene splicing. In various embodiments, an individual with a disease may have a wild-type target protein expression or activity level that is about 1% to 99% of the normal protein expression or activity level in an individual not having the disease. In embodiments, the ranges include, but are not limited to, about 80-99%, about 65-80%, about 50-65%, about 30-50%, about 25-30%, about 20-25%, about 15-20%, about 10-15%, about 5-10%, or about 1-5% of the normal thymidine phosphorylase expression or activity level. In embodiments, the individual may have a target protein expression or activity level that is about 1% to about 500% higher than the normal wild-type target protein expression or activity level. In embodiments, the ranges include, but are not limited to, about 1-10%, about 10-50%, about 50-100%, about 100-200%, about 200-300%, about 300-400%, about 400-500%, or about 500-1000% higher target protein expression or activity levels.

In embodiments, the individual is the most recently diagnosed with the disease. Typically, early treatment (starting treatment as soon as possible after diagnosis) is important to minimize the impact of the disease and maximize therapeutic benefit.

In embodiments, the efficacy of the compounds of the present disclosure and AC on DMD is evaluated in an animal model of DMD. Animal models are a valuable resource for studying disease pathogenesis and provide a means for testing dystrophin-related activity. In embodiments, mdx mice and dystrophy gold beagle dogs (GRMD) that are both dystrophin negative (see, e.g., collins & Morgan, int J Exp pathl 84:165-172, 2003) are used to evaluate compounds of the present disclosure. In embodiments, compounds of the present disclosure are evaluated using a C57BL/10ScSn-Dmdmdx/J (Bl 10/mdx) or D2.B10-Dmdmdx/J (D2/mdx) mouse model. In embodiments, transgenic mice carrying the human DMD gene and lacking the mouse Dmd gene (hmdmd/Dmd-free mice) are used to evaluate compounds of the present disclosure. Such mice can be generated by crossing male hDMD mice (available from Jackson Laboratory, bar Harbor, ME) with female DMD-free mice. Each of the following references describes these models and is incorporated by reference herein in its entirety as international publication No. WO2019014772 to ：J Neuromuscul Dis.2018;5(4)：407-417.;Proc Natl Acad Sci U S A.1984;81(4)：1189-92.;Am J Pathol.2010;176(5)：2414-24.;J Clin Invest.2009;119(12)：3703-12;. These and other animal models can be used to measure the functional activity of various dystrophins.

In embodiments, in vitro models are used to evaluate the efficacy of the compositions of the present disclosure. In embodiments, the in vitro model is an immortalized muscle cell model. The model is described in the following articles, which are incorporated herein by reference in their entirety: nguyen et al J Pers Med.2017, month 12; 7 (4): 13.

Preparation method

The compounds described herein may be prepared in a number of ways known to those skilled in the art of organic synthesis or in variations thereof as understood by those skilled in the art. The compounds described herein can be prepared from readily available starting materials. The optimal reaction conditions may vary with the particular reactants or solvents used, but such conditions may be determined by one skilled in the art.

Variations of the compounds described herein include addition, subtraction, or movement of the various components as described for each compound. Similarly, the chirality of a molecule may change when one or more chiral centers are present in the molecule. In addition, compound synthesis may involve protection and deprotection of various chemical groups. The use of protection and deprotection and the selection of appropriate protecting groups can be determined by one skilled in the art. The chemical nature of the protecting groups can be found, for example, in Wuts and Greene, protective Groups in Organic Synthesis, 4 th edition, wiley & Sons,2006, which is incorporated herein by reference in its entirety.

Starting materials and reagents for preparing the disclosed compounds and compositions are available from commercial suppliers such as Aldrich Chemical Co.,(Milwaukee,WI)、Acros Organics(Morris Plains,NJ)、Fisher Scientific(Pittsburgh,PA)、Sigma(St.Louis,MO)、Pfizer(New York,NY)、GlaxoSmithKline(Raleigh,NC)、Merck(Whitehouse Station,NJ)、Johnson&Johnson(New Brunswick,NJ)、Aventis(Bridgewater,NJ)、AstraZeneca(Wilmington,DE)、Novartis(Basel,Switzerland)、Wyeth(Madison,NJ)、Bristol-Myers-Squibb(New York,NY)、Roche(Basel,Switzerland)、Lilly(Indianapolis,IN)、Abbott(Abbott Park,IL)、Schering Plough(Kenilworth,NJ) or Boehringer Ingelheim (Ingelheim, germany) or are prepared by methods known to those skilled in the art following procedures set forth in the references, such as FIESER AND FIESER' S REAGENTS for Organic Synthesis, volumes 1-17 (John Wiley and Sons, 1991); rodd' S CHEMISTRY of Carbon Compounds, volumes 1-5 and supplements (ELSEVIER SCIENCE Publishers, 1989); organic Reactions, volumes 1-40 (John Wiley and Sons, 1991); march' S ADVANCED Organic Chemistry, (John Wiley and Sons, 4 th edition); larock's Comprehensive Organic Transformations (VCH Publishers Inc., 1989). Other materials, such as the pharmaceutical carriers disclosed herein, are available from commercial sources.

The reactions to produce the compounds described herein may be carried out in solvents that may be selected by one skilled in the art of organic synthesis. The solvent may be substantially non-reactive with the starting materials (reactants), intermediates, or products under the conditions (i.e., temperature and pressure) under which the reaction is carried out. The reaction may be carried out in one solvent or a mixture of more than one solvent. Product or intermediate formation may be monitored according to any suitable method known in the art. For example, product formation may be monitored by spectroscopic means such as nuclear magnetic resonance spectroscopy (e.g., ¹ H or ¹³ C), infrared spectroscopy, spectrophotometry (e.g., UV-visible light) or mass spectrometry, or by chromatography such as High Performance Liquid Chromatography (HPLC) or thin layer chromatography.

The disclosed compounds can be prepared by solid phase peptide synthesis in which the amino acid α -N-terminus is protected by an acid or base protecting group. Such protecting groups should have properties that are stable to the conditions of peptide bond formation while being easily removable without disrupting the growing peptide chain or racemizing any chiral centers contained therein. Suitable protecting groups are 9-fluorenylmethoxycarbonyl (Fmoc), t-butoxycarbonyl (Boc), benzyloxycarbonyl (Cbz), biphenyl isopropoxycarbonyl, t-pentyloxycarbonyl, isobornyloxycarbonyl, α -dimethyl-3, 5-dimethoxybenzyloxycarbonyl, o-nitrophenylsulfinyl, 2-cyano-t-butoxycarbonyl and the like. A9-fluorenylmethoxycarbonyl (Fmoc) protecting group is particularly preferred for the synthesis of the disclosed compounds. For side chain amino groups such as lysine and arginine, other preferred side chain protecting groups are 2,5,7, 8-pentamethylchroman-6-sulfonyl (pmc), nitro, p-toluenesulfonyl, 4-methoxybenzene-sulfonyl, cbz, boc and adamantyloxy carbonyl; for tyrosine are benzyl, o-bromobenzyloxy-carbonyl, 2, 6-dichlorobenzyl, isopropyl, t-butyl (t-Bu), cyclohexyl, cyclopentyl and acetyl (Ac); for serine are tert-butyl, benzyl and tetrahydropyranyl; For histidine are trityl, benzyl, cbz, p-toluenesulfonyl and 2, 4-dinitrophenyl; for tryptophan is formyl; benzyl and tert-butyl for aspartic acid and glutamic acid, and triphenylmethyl (trityl) for cysteine. In the solid phase peptide synthesis method, the α -C-terminal amino acid is attached to a suitable solid support or resin. Suitable solid supports useful in the above synthesis are those materials which are inert to the reagents and reaction conditions of the progressive condensation-deprotection reaction and insoluble in the medium used. The solid support used for the synthesis of the α -C-terminal carboxy peptide is a 4-hydroxymethylphenoxymethyl-co (styrene-1% divinylbenzene) or 4- (2 ',4' -dimethoxyphenyl-Fmoc-aminomethyl) phenoxyacetamido ethyl resin available from Applied Biosystems (Foster City, calif.). the α -C-terminal amino acid is coupled to the resin via coupling with or without 4-Dimethylaminopyridine (DMAP), 1-Hydroxybenzotriazole (HOBT), benzotriazole-1-yloxy-tris (dimethylamino) phosphonium hexafluorophosphate (BOP) or bis (2-oxo-3-oxazolidinyl) phosphine chloride (BOPCl) in a solvent such as dichloromethane or DMF at a temperature between 10 ℃ and 50 ℃ for about 1 to about 24 hours via N, N ' -Dicyclohexylcarbodiimide (DCC), N ' -Diisopropylcarbodiimide (DIC) or O-benzotriazol-1-yl-N, N ' -tetramethyluronium Hexafluorophosphate (HBTU). When the solid support is a 4- (2 ',4' -dimethoxyphenyl-Fmoc-aminomethyl) phenoxy-acetamidoethyl resin, the Fmoc group is cleaved with a secondary amine (preferably piperidine) prior to coupling with the α -C-terminal amino acid as described above. One method for coupling with the deprotected 4- (2 ',4' -dimethoxyphenyl-Fmoc-aminomethyl) phenoxy-acetamidoethyl resin is O-benzotriazol-1-yl-N, N, N ', N' -tetramethyluronium hexafluorophosphate (HBTU, 1 eq.) and 1-hydroxybenzotriazole (HOBT, 1 eq.) in DMF. The coupling of the consecutive protected amino acids can be performed in an automated polypeptide synthesizer. In one example, fmoc is used to protect the alpha-N-terminus in the amino acid of the growing peptide chain. Removal of the Fmoc protecting group from the α -N-terminal side of the growing peptide is accomplished by treatment with a secondary amine, preferably piperidine. Each protected amino acid is then introduced in about a 3-fold molar excess and preferably coupled in DMF. The coupling agent may be O-benzotriazol-1-yl-N, N, N ', N' -tetramethyluronium hexafluorophosphate (HBTU, 1 eq.) and 1-hydroxybenzotriazole (HOBT, 1 eq.). at the end of the solid phase synthesis, the polypeptide is removed from the resin and deprotected, either continuously or in a single operation. Removal and deprotection of the polypeptide can be accomplished in a single operation by treating the resin-bound polypeptide with a cleavage reagent comprising anisole, water, ethylene dithiol and trifluoroacetic acid. In the case where the α -C-terminus of the polypeptide is an alkylamide, the resin is cleaved by ammonolysis with the alkylamine. Alternatively, the peptide may be removed by transesterification (e.g., with methanol), followed by ammonolysis, or by direct transamidation. The protected peptide may be purified at this point or used directly in the next step. The removal of the side chain protecting groups can be accomplished using the cleavage mixtures described above. The fully deprotected peptide may be purified by a series of chromatographic steps using any or all of the following types: ion exchange on a weakly basic resin (acetate form); hydrophobic adsorption chromatography on underivatized polystyrene-divinylbenzene (e.g., amberlite XAD); silica gel adsorption chromatography; ion exchange chromatography on carboxymethyl cellulose; partition chromatography (e.g., on Sephadex G-25, LH-20 or counter-current distribution); high Performance Liquid Chromatography (HPLC), particularly reverse phase HPLC on octyl-silica or octadecylsilyl-silica bonded phase column packing.

The above polymers, such as PEG groups, may be attached to AC under any suitable conditions for reacting the protein with the activated polymer molecules. Any method known in the art may be used, including other chemoselective conjugation/attachment methods via acylation, reductive alkylation, michael addition, thiol alkylation, or by reactive groups on the PEG moiety (e.g., aldehyde, amino, ester, thiol, α -haloacetyl, maleimide, or hydrazino) to reactive groups on AC (e.g., aldehyde, amino, ester, thiol, α -haloacetyl, maleimide, or hydrazino). Activating groups useful for attaching the water-soluble polymer to one or more proteins include, but are not limited to, sulfones, maleimides, thiols, triflates (tresylates), aziridines (azidirine), oxiranes, 5-pyridinyl, and alpha-halo acyl groups (e.g., alpha-iodoacetic acid, alpha-bromoacetic acid, alpha-chloroacetic acid). If attached to AC by reductive alkylation, the polymer selected should have a single reactive aldehyde in order to control the degree of polymerization. See, for example, kinstler et al, adv. Drug. Delivery Rev.54:477-485 (2002); roberts et al, adv. Drug Delivery Rev.54:459-476 (2002); and Zalipsky et al, adv. Drug Delivery Rev.16:157-182 (1995).

To covalently attach AC directly to a CPP, the appropriate amino acid residue of the CPP can be reacted with an organic derivatizing agent capable of reacting with selected side chains or N-or C-termini of the amino acid. Reactive groups on the peptide or conjugate moiety include, for example, aldehyde, amino, ester, thiol, α -haloacetyl, maleimide, or hydrazine groups. Derivatizing agents include, for example, maleimidobenzoyl sulfosuccinimidyl ester (conjugated via a cysteine residue), N-hydroxysuccinimide (conjugated via a lysine residue), glutaraldehyde, succinic anhydride, or other agents known in the art.

Methods of synthesizing oligomeric antisense compounds are known in the art. The present disclosure is not limited to methods of synthesizing AC. In embodiments, provided herein are compounds having reactive phosphorus groups that can be used to form internucleoside linkages (including, for example, phosphodiester and phosphorothioate internucleoside linkages). The methods of preparation and/or purification of the precursor or antisense compound are not limited to the compositions or methods provided herein. Methods for the synthesis and purification of DNA, RNA and antisense compounds are well known to those skilled in the art.

The oligomerization of modified and unmodified nucleosides can be carried out routinely according to literature procedures for DNA (Protocols for Oligonucleotides and Analogs, agrawal et al (1993), humana Press) and/or RNA (Scaringe, methods (2001), 23, 206-217.Gait et al Applications of Chemically synthesized RNA in RNA: protein Interactions, smith et al (1998), 1-36.Gallo et al, tetrahedron (2001), 57, 5707-5713).

Antisense compounds provided herein can be conveniently and routinely prepared by well known solid phase synthesis techniques. The equipment used for such synthesis is sold by several suppliers including, for example, applied Biosystems (Foster City, calif.). Any other method known in the art for such synthesis may additionally or alternatively be used. The use of similar techniques to prepare oligonucleotides such as phosphorothioates and alkylated derivatives is well known. The invention is not limited by the synthetic method of antisense compounds.

Methods of oligonucleotide purification and analysis are known to those skilled in the art. Analytical methods include Capillary Electrophoresis (CE) and electrospray mass spectrometry. Such synthetic and analytical methods can be performed in multiwell plates. The method of the present invention is not limited to oligomer purification methods.

Application method

In vivo application of the disclosed compounds and compositions containing them may be accomplished by any suitable method and technique currently or contemplated to be known to those skilled in the art. For example, the disclosed compounds may be formulated in a physiologically or pharmaceutically acceptable form and administered by any suitable route known in the art, including, for example, oral and parenteral routes of administration. As used herein, the term parenteral includes subcutaneous, intradermal, intravenous, intramuscular, intraperitoneal, intrasternal and intrathecal administration, such as by injection. The administration of the disclosed compounds or compositions may be a single administration, or at successive or different intervals, as readily determinable by one of skill in the art.

The compounds disclosed herein and compositions comprising them may also be administered using liposome technology, slow release capsules, implantable pumps, and biodegradable containers. These delivery methods can advantageously provide uniform doses over an extended period of time. The compounds may also be administered in the form of their salt derivatives or in crystalline form.

The compounds disclosed herein may be formulated according to known methods for preparing pharmaceutically acceptable compositions. Formulations are described in detail in many sources well known and readily available to those skilled in the art. For example, remington's Pharmaceutical Science, e.w. martin (1995) describes formulations that can be used in conjunction with the disclosed methods. In general, the compounds disclosed herein can be formulated such that an effective amount of the compound is combined with a suitable carrier in order to facilitate effective administration of the compound. The compositions used may also take various forms. These forms include, for example, solid, semi-solid, and liquid dosage forms such as tablets, pills, powders, liquid solutions or suspensions, suppositories, injectable and infusible solutions, and sprays. The preferred form depends on the intended mode of administration and therapeutic application. The composition also preferably comprises conventional pharmaceutically acceptable carriers and diluents known to those skilled in the art. Examples of carriers or diluents for use with the compounds include ethanol, dimethylsulfoxide, glycerol, alumina, starch, saline and equivalent carriers and diluents. To provide for administration of such doses for the desired therapeutic treatment, the compositions disclosed herein may advantageously comprise between about 0.1% and 100% by weight of one or more of the subject compounds, in total, based on the weight of the total composition comprising the carrier or diluent.

Formulations suitable for administration include, for example, sterile injectable aqueous solutions which may contain antioxidants, buffers, bacteriostats and solutes which render the formulation isotonic with the blood of the intended recipient; and aqueous and non-aqueous sterile suspensions which may contain suspending agents and thickening agents. The formulations may be presented in unit-dose or multi-dose containers, for example sealed ampoules and vials, and may be stored in a freeze-dried (lyophilized) condition requiring only a sterile liquid carrier, for example water for injections, immediately prior to use. Extemporaneous injection solutions and suspensions may be prepared from sterile powders, granules, tablets and the like. It should be understood that the compositions disclosed herein may contain other conventional agents in the art regarding the type of formulation in question, in addition to the ingredients specifically mentioned above.

The compounds disclosed herein and compositions comprising them may be delivered to cells by direct contact with the cells or via carrier means. Carrier means for delivering the compounds and compositions to cells are known in the art and include, for example, encapsulation of the compositions in a liposomal fraction. Another means for delivering the compounds and compositions disclosed herein to a cell includes attaching the compounds to a protein or nucleic acid that is targeted for delivery to the target cell. U.S. Pat. No. 6,960,648 and U.S. application publication nos. 20030032594 and 20020120100 disclose amino acid sequences that can be coupled to another composition and allow translocation of the composition across a biological membrane. U.S. application publication number 20020035243 also describes compositions for transporting biological moieties across cell membranes for intracellular delivery. The compounds may also be incorporated into polymers, examples of which include poly (D-L lactide-co-glycolide) polymers for intracranial tumors; poly [ bis (p-carboxyphenoxy) propane: sebacic acid ] in a molar ratio of 20:80 (as used in GLIADEL); chondroitin; chitin; and chitosan.

The compounds and compositions disclosed herein, including pharmaceutically acceptable salts or prodrugs thereof, may be administered intravenously, intramuscularly, or intraperitoneally by infusion or injection. Solutions of the active agent or salt thereof may be prepared in water, optionally mixed with a non-toxic surfactant. Dispersions can also be prepared in glycerol, liquid polyethylene glycols, triacetin and mixtures thereof, and in oils. Under ordinary conditions of storage and use, these formulations may contain preservatives to prevent microbial growth.

Pharmaceutical dosage forms suitable for injection or infusion may comprise sterile aqueous solutions or dispersions or sterile powders containing the active ingredient which are suitable for the extemporaneous preparation of sterile injectable or infusible solutions or dispersions optionally encapsulated in liposomes. The final dosage form should be sterile, fluid and stable under the conditions of manufacture and storage. The liquid carrier or vehicle may be a solvent or liquid dispersion medium including, for example, water, ethanol, polyols (e.g., glycerol, propylene glycol, liquid polyethylene glycol, and the like), vegetable oils, non-toxic glycerides, and suitable mixtures thereof. Proper fluidity can be maintained, for example, by the formation of liposomes, by the maintenance of the required particle size in the case of dispersions, or by the use of surfactants. Optionally, the action of microorganisms may be prevented by various other antibacterial and antifungal agents, such as parabens, chlorobutanol, phenol, sorbic acid, thimerosal, and the like. In many cases, it will be preferable to include isotonic agents, for example, sugars, buffers, or sodium chloride. Prolonged absorption of the injectable compositions can be brought about by the inclusion of agents which delay absorption, for example, aluminum monostearate and gelatin.

Sterile injectable solutions are prepared by incorporating the compounds and/or agents disclosed herein in the required amounts with various other ingredients enumerated above, as required, followed by filtered sterilization. In the case of sterile powders for the preparation of sterile injectable solutions, the preferred methods of preparation are vacuum-drying and freeze-drying techniques which yield a powder of the active ingredient plus any additional desired ingredient present in the previously sterile-filtered solution thereof.

Useful dosages of the compounds and agents disclosed herein and pharmaceutical compositions can be determined by comparing their in vitro and in vivo activity in animal models. Methods for extrapolating effective dosages in mice and other animals to humans are known in the art.

The dosage range in which the composition is administered is a dosage range large enough to produce the desired effect affecting the symptom or condition. The dosage should not be so large as to cause adverse side effects such as undesired cross-reactions, allergic reactions, etc. Generally, the dosage will vary with the age, condition, sex and degree of disease of the patient and can be determined by one skilled in the art. In the case of any contraindications, the dosage can be adjusted by the individual physician. The dosage may vary, and may be administered in one or more doses per day for one or more days.

Also disclosed are pharmaceutical compositions comprising a combination of a compound disclosed herein and a pharmaceutically acceptable carrier. Pharmaceutical compositions suitable for oral, topical or parenteral administration comprising an amount of a compound constitute a preferred aspect. The dose administered to a patient, particularly a human, should be sufficient to achieve a therapeutic response in the patient within a reasonable time frame without lethal toxicity, and preferably cause no more than an acceptable level of side effects or morbidity. Those skilled in the art will recognize that the dosage will depend on a variety of factors including the condition (health) of the subject, the weight of the subject, the type of concurrent therapy (if any), the frequency of treatment, the rate of treatment, and the severity and stage of the pathological condition.

Also disclosed are kits comprising compounds disclosed herein in one or more containers. The disclosed kits may optionally include a pharmaceutically acceptable carrier and/or diluent. In one embodiment, the kit comprises one or more other components, adjuvants or adjuvants as described herein. In another embodiment, the kit includes one or more anti-cancer agents, such as those described herein. In one embodiment, the kit includes instructions or packaging materials describing how to administer the compounds or compositions of the kit. The container of the kit may be of any suitable material, such as glass, plastic, metal, etc., and may be of any suitable size, shape or configuration. In one embodiment, the compounds and/or agents disclosed herein are provided in the kit as a solid (such as in tablet, pill, or powder form). In another embodiment, the compounds and/or agents disclosed herein are provided in a kit as a liquid or solution. In one embodiment, the kit comprises an ampoule or syringe containing a compound and/or agent disclosed herein in liquid or solution form.

Various embodiments of the present invention have been described. Nevertheless, it will be understood that various modifications may be made without departing from the spirit and scope of the invention. Accordingly, other embodiments are within the scope of the following claims.

Certain definitions

As used in the specification and the appended claims, the singular forms "a", "an", and "the" include plural referents unless the context clearly dictates otherwise. Thus, for example, reference to "a composition" includes a mixture of two or more such compositions, reference to "an agent" includes a mixture of two or more such agents, reference to "the component" includes a mixture of two or more such components, and the like.

The term "about" when immediately preceding a numerical value means a range (e.g., plus or minus 10% of the stated value). For example, "about 50" may mean 45 to 55, "about 25,000" may mean 22,500 to 27,500, etc., unless the context of the present disclosure indicates otherwise, or is inconsistent with such interpretation. For example, in a list of values such as "about 49, about 50, about 55,", about 50 "means a range extending to less than half the interval between the front and back values, e.g., greater than 49.5 to less than 52.5. Furthermore, the phrase "less than about" value or "greater than about" value should be understood in accordance with the definition of the term "about" provided herein. Similarly, the term "about" when preceding a series of values or ranges of values (e.g., "about 10, 20, 30" or "about 10-30") refers to all values in the series or endpoints of the range, respectively.

The terms "miniPEG", "PEG2" and "AEEA" are used interchangeably herein to refer to 2- [2- [ 2-aminoethoxy ] ethoxy ] acetic acid.

As used herein, the term "cyclic cell penetrating peptide" or "cCPP" refers to a peptide that facilitates the delivery of AC into a cell.

As used herein, the term "endosomal escape vector" (EEV) refers to cCPP conjugated to a linker and/or an Exocyclic Peptide (EP) by chemical linkage (i.e., covalent or non-covalent interactions). The EEV may be an EEV of formula (B).

As used herein, the term "EEV-conjugate" refers to an endosomal escape carrier as defined herein conjugated to AC by chemical linkage (i.e., covalent or non-covalent interactions). AC may be delivered into cells via EEV. The EEV-conjugate may be an EEV-conjugate of formula (C).

As used herein, the terms "exocyclic peptide" (EP) and "modulator peptide" (MP) are used interchangeably to refer to two or more amino acid residues joined by peptide bonds that can be conjugated to a cyclic cell penetrating peptide (cCPP) as disclosed herein. When conjugated to the cyclic peptides disclosed herein, EP may alter the tissue distribution and/or retention of the compound. Typically, an EP comprises at least one positively charged amino acid residue, e.g. at least one lysine residue and/or at least one arginine residue. Non-limiting examples of EPs are described herein. An EP may be a peptide identified in the art as a "nuclear localization sequence" (NLS). Non-limiting examples of nuclear localization sequences include the nuclear localization sequence of the SV40 viral large T antigen, whose smallest functional units are the seven amino acid sequence PKKKRKV, the double-typed nucleoplasmin NLS with sequence NLSKRPAAIKKAGQAKKKK, the c-myc nuclear localization sequence with amino acid sequence PAAKRVKLD or RQRRNELKRSF, the sequence RMRKFKNKGKDTAELRRRRVEVSVELRKAKKDEQILKRRNV from the IBB domain of the input protein-alpha, the sequences VSRKRPRP and PPKKARED of the myoma T protein, the sequence PQPKKKPL of human p53, the sequence SALIKKKKKMAP of mouse c-abl IV, the sequences DRLRR and PKQKKRK of influenza virus NS1, the sequence RKLKKKIKKL of hepatitis virus delta antigen and the sequence REKKKFLKRR of mouse Mxl protein, the sequence KRKGDEVDGVDEVAKKKSKK of human poly (ADP-ribose) polymerase and the sequence RKCLQAGMNLEARKTKK of steroid hormone receptor (human) glucocorticoid. Additional examples of NLS are described in International publication No. 2001/038547 and incorporated herein by reference in its entirety.

As used herein, "linker" or "L" refers to a moiety that covalently binds one or more moieties (e.g., exocyclic Peptides (EP) and AC) to a cyclic cell penetrating peptide (cCPP). The linker may comprise a natural or unnatural amino acid or polypeptide. The linker may be a synthetic compound containing two or more suitable functional groups suitable for binding cCPP to AC to form the compounds disclosed herein. The linker may comprise a polyethylene glycol (PEG) moiety. The linker may comprise one or more amino acids. cCPP can be covalently bound to AC via a linker.

As used herein, the term "oligonucleotide" refers to an oligomeric compound comprising a plurality of linked nucleotides or nucleosides. One or more nucleotides of the oligonucleotide may be modified. The oligonucleotides may include ribonucleic acid (RNA) or deoxyribonucleic acid (DNA). Oligonucleotides may be composed of natural and/or modified nucleobases, sugars, and covalent internucleoside linkages, and may further include non-nucleic acid conjugates.

The terms "peptide," "protein," and "polypeptide" are used interchangeably to refer to a natural or synthetic molecule comprising two or more amino acids linked to an alpha amino group of one amino acid through a carboxyl group of another amino acid. Two or more amino acid residues may be linked to an alpha amino group through the carboxyl group of one amino acid. Two or more amino acids of a polypeptide may be joined by peptide bonds. A polypeptide may include peptide backbone modifications in which two or more amino acids are covalently attached by bonds other than peptide bonds. The polypeptide may include one or more unnatural amino acids, amino acid analogs, or other synthetic molecules that are capable of being integrated into the polypeptide. The term polypeptide includes naturally occurring and artificially occurring amino acids. The term polypeptide includes, for example, peptides comprising about 2 to about 100 amino acid residues as well as proteins comprising more than about 100 amino acid residues or more than about 1000 amino acid residues, including but not limited to therapeutic proteins such as antibodies, enzymes, receptors, soluble proteins, and the like.

The term "therapeutic polypeptide" refers to a polypeptide that has therapeutic, prophylactic or other biological activity. The therapeutic polypeptide may be produced in any suitable manner. For example, the therapeutic polypeptide may be isolated or purified from a naturally occurring environment, may be chemically synthesized, may be recombinantly produced, or a combination thereof.

The term "small molecule" refers to an organic compound that is pharmacologically active and has a molecular weight of less than about 2000 daltons, or less than about 1000 daltons, or less than about 500 daltons. Small molecule therapeutics are typically manufactured by chemical synthesis.

As used herein, the term "contiguous" refers to two amino acids joined by a covalent bond. For example, in a representative cyclic cell penetrating peptide (cCPP) such asContiguous pairs of amino acids are illustrated in the case of AA ₁/AA₂、AA₂/AA₃、AA₃/AA₄ and AA ₅/AA₁.

As used herein, a residue of a chemical refers to a derivative of a chemical that is present in a particular product. To form a product, at least one atom of the substance is replaced with a bond to another moiety, such that the product contains a derivative or residue of the chemical substance. For example, the cyclic cell penetrating peptides (cCPP) described herein have amino acids (e.g., arginine) incorporated therein by formation of one or more peptide bonds. The amino acid incorporated into cCPP may be referred to as a residue, or simply as an amino acid. Thus, arginine or arginine residues refer to

The term "protonated form thereof" refers to a protonated form of an amino acid. For example, the guanidinium group on the arginine side chain may be protonated to form a guanidinium group. The structure of the protonated form of arginine is

As used herein, the term "chiral" refers to a molecule having more than one stereoisomer that differs in the three-dimensional arrangement of atoms, wherein one stereoisomer is a non-superimposable mirror image of the other stereoisomer. In addition to glycine, amino acids have a chiral carbon atom adjacent to a carboxyl group. The term "enantiomer" refers to a chiral stereoisomer. Chiral molecules may be amino acid residues having the "D" and "L" enantiomers. Molecules without chiral centers, such as glycine, may be referred to as "achiral".

As used herein, the term "hydrophobic" refers to a moiety that is insoluble or has minimal solubility in water. Typically, the neutral and/or non-polar moiety, or predominantly neutral and/or non-polar moiety, is hydrophobic. Hydrophobicity can be measured by one of the methods disclosed herein below.

As used herein, "aromatic" refers to an unsaturated ring molecule having 4n+2 pi electrons, wherein n is any integer. The term "non-aromatic" refers to any unsaturated ring molecule that does not fall within the definition of aromatic.

"Alkyl", "alkyl chain" or "alkyl group" refers to a fully saturated, straight or branched hydrocarbon chain group having one to forty carbon atoms and attached to the remainder of the molecule by a single bond. Including alkyl groups containing any number of carbon atoms from 1 to 40. The alkyl group containing up to 40 carbon atoms is a C ₁-C₄₀ alkyl group, the alkyl group containing up to 10 carbon atoms is a C ₁-C₁₀ alkyl group, the alkyl group containing up to 6 carbon atoms is a C ₁-C₆ alkyl group, and the alkyl group containing up to 5 carbon atoms is a C ₁-C₅ alkyl group. C ₁-C₅ alkyl includes C ₅ alkyl, C ₄ alkyl, C ₃ alkyl, C ₂ alkyl and C ₁ alkyl (i.e., methyl). C ₁-C₆ alkyl includes all of the moieties described above for C ₁-C₅ alkyl, but also includes C ₆ alkyl. C ₁-C₁₀ alkyl includes all of the moieties described above for C ₁-C₅ alkyl and C ₁-C₆ alkyl, but also includes C ₇、C₈、C₉ and C ₁₀ alkyl. similarly, C ₁-C₁₂ alkyl includes all of the foregoing moieties, but also includes C ₁₁ and C ₁₂ alkyl. Non-limiting examples of C ₁-C₁₂ alkyl groups include methyl, ethyl, n-propyl, isopropyl, sec-propyl, n-butyl, isobutyl, sec-butyl, tert-butyl, n-pentyl, tert-pentyl, n-hexyl, n-heptyl, n-octyl, n-nonyl, n-decyl, n-undecyl and n-dodecyl. unless specifically stated otherwise in the specification, an alkyl group may be optionally substituted.

"Alkylene", "alkylene chain" or "alkylene group" refers to a fully saturated straight or branched divalent hydrocarbon chain group having one to forty carbon atoms. Non-limiting examples of C ₂-C₄₀ alkylene include ethylene, propylene, n-butylene, vinylene, propenylene, n-butenylene, propynylene, n-butynylene, and the like. Unless specifically stated otherwise in the specification, the alkylene chain may be optionally substituted.

"Alkenyl", "alkenyl chain" or "alkenyl group" refers to a straight or branched hydrocarbon chain group having two to forty carbon atoms and having one or more carbon-carbon double bonds. Each alkenyl group is attached to the remainder of the molecule by a single bond. Including alkenyl groups containing any number of carbon atoms from 2 to 40. Alkenyl containing up to 40 carbon atoms is C ₂-C₄₀ alkenyl, alkenyl containing up to 10 carbon atoms is C ₂-C₁₀ alkenyl, alkenyl containing up to 6 carbon atoms is C ₂-C₆ alkenyl, and alkenyl containing up to 5 carbon atoms is C ₂-C₅ alkenyl. C ₂-C₅ alkenyl includes C ₅ alkenyl, C ₄ alkenyl, C ₃ alkenyl and C ₂ alkenyl. c ₂-C₆ alkenyl includes all of the moieties described above with respect to C2-C5 alkenyl, but also includes C ₆ alkenyl. C ₂-C₁₀ alkenyl includes all of the moieties described above for C ₂-C₅ alkenyl and C ₂-C₆ alkenyl, but also includes C ₇、C₈、C₉ and C ₁₀ alkenyl. Similarly, C ₂-C₁₂ alkenyl includes all of the foregoing moieties, but also includes C ₁₁ and C ₁₂ alkenyl. Non-limiting examples of C ₂-C₁₂ alkenyl include vinyl (ethenyl/vinyl), 1-propenyl, 2-propenyl (allyl), isopropenyl, 2-methyl-1-propenyl, 1-butenyl, 2-butenyl, 3-butenyl, 1-pentenyl, 2-pentenyl, 3-pentenyl, 4-pentenyl, 1-hexenyl, 2-hexenyl, 3-hexenyl, 4-hexenyl, 5-hexenyl, 1-heptenyl, 2-heptenyl, 3-heptenyl, 4-heptenyl, 5-heptenyl, 6-heptenyl, 1-octenyl, 2-octenyl, 3-octenyl, 4-octenyl, 5-octenyl, 6-octenyl, 7-octenyl, 1-nonenyl, 2-nonenyl, 3-nonenyl, 4-nonenyl, 5-nonenyl, 6-nonenyl, 7-nonenyl, 8-nonenyl, 1-decenyl, 2-decenyl, 3-decenyl, 4-decenyl, 5-decenyl, 6-decenyl, 7-decenyl, 8-decenyl, 9-decenyl, 1-undecenyl, 2-undecenyl, 3-undecenyl, 4-undecenyl, 5-undecenyl, 6-undecenyl, 7-undecenyl, 8-undecenyl, 9-undecenyl, 10-undecenyl, 1-dodecenyl, 2-dodecenyl, 3-dodecenyl, 4-dodecenyl, 5-dodecenyl, 6-dodecenyl, 7-dodecenyl, 8-dodecenyl, 9-dodecenyl, 10-dodecenyl and 11-dodecenyl. unless specifically stated otherwise in the specification, an alkyl group may be optionally substituted.

"Alkenylene", "alkenylene" or "alkenylene group" refers to a straight or branched divalent hydrocarbon chain radical having two to forty carbon atoms and having one or more carbon-carbon double bonds. Non-limiting examples of C ₂-C₄₀ alkenylene groups include ethylene, propylene, butene, and the like. Unless specifically stated otherwise in the specification, alkenylene chains may be optional.

"Alkoxy" OR "alkoxy group" refers to the group-OR, wherein R is alkyl, alkenyl, alkynyl, cycloalkyl, OR heterocyclyl as defined herein. Unless specifically stated otherwise in the specification, an alkoxy group may be optionally substituted.

"Acyl" or "acyl group" refers to the group-C (O) R, wherein R is hydrogen, alkyl, alkenyl, alkynyl, carbocyclyl, or heterocyclyl, as defined herein. Unless specifically stated otherwise in the specification, an acyl group may be optionally substituted.

"Alkylcarbamoyl" or "alkylcarbamoyl group" refers to the group-O-C (O) -NR _aR_b, wherein R _a and R _b are the same or different and are independently alkyl, alkenyl, alkynyl, aryl, heteroaryl as defined herein, or R _aR_b may together form a cycloalkyl group or a heterocyclyl group as defined herein. Unless specifically stated otherwise in the specification, alkylcarbamoyl groups may be optionally substituted.

"Alkylcarboxamide" or "alkylcarboxamide group" refers to the group-C (O) -NR _aR_b, wherein Ra and Rb are the same or different and are independently an alkyl, alkenyl, alkynyl, aryl, heteroaryl, cycloalkyl, cycloalkenyl, cycloalkynyl or heterocyclyl group as defined herein, or R _aR_b may be taken together to form a cycloalkyl group as defined herein. Unless specifically stated otherwise in the specification, the alkylcarboxamido groups may be optionally substituted.

"Aryl" refers to a hydrocarbon ring system group comprising hydrogen, 6 to 18 carbon atoms, and at least one aromatic ring. For the purposes of the present invention, aryl groups may be monocyclic, bicyclic, tricyclic or tetracyclic ring systems, which may include fused or bridged ring systems. Aryl groups include, but are not limited to, aryl groups derived from acetate (ACEANTHRYLENE), acenaphthylene (ACENAPHTHYLENE), acetenyl (acetenyl), anthracene, azulene (azulene), benzene, chrysene (chrysene), fluoranthene (fluoranthene), fluorene, asymmetric indacene (as-indacene), symmetric indacene (s-indacene), indane, indene, naphthalene, phenalene (phenalene), phenanthrene, heptaidiene (pleiadiene), pyrene, and benzophenanthrene. Unless specifically stated otherwise in the specification, the term "aryl" is intended to include optionally substituted aryl groups.

"Heteroaryl" refers to a group of a5 to 20 membered ring system comprising a hydrogen atom, one to thirteen carbon atoms, one to six heteroatoms selected from nitrogen, oxygen and sulfur, and at least one aromatic ring. For the purposes of the present invention, heteroaryl groups may be monocyclic, bicyclic, tricyclic or tetracyclic ring systems, which may include fused or bridged ring systems; and the nitrogen, carbon or sulfur atoms in the heteroaryl group may optionally be oxidized; the nitrogen atom may optionally be quaternized. Examples include, but are not limited to, azepinyl, acridinyl, benzimidazolyl, benzothiazolyl, benzindolyl, benzodioxolyl, benzofuranyl, benzoxazolyl, benzothiazolyl, benzothiadiazolyl, benzo [ b ] [1,4] dioxacycloheptenyl, 1, 4-benzodioxanyl, benzonaphtofuranyl, benzoxazolyl, benzodioxolyl, benzodioxanyl, benzopyranyl, benzopyranonyl, benzofuranyl, benzofuranonyl, benzothienyl (benzothiophenyl), benzotriazolyl, benzo [4,6] imidazo [1,2-a ] pyridinyl, carbazolyl, cinnolinyl, dibenzofuranyl, dibenzothienyl, isothiazolyl, imidazolyl, furanyl, furanonyl, isothiazolyl, imidazolyl indazolyl, indolyl, indazolyl, isoindolyl, indolinyl, isoindolinyl, isoquinolinyl, indolizinyl, isoxazolyl, naphthyridinyl, oxadiazolyl, 2-oxoazepinyl, oxazolyl, oxiranyl, 1-oxopyridyl, 1-oxopyrimidinyl, 1-oxopyrazinyl, 1-oxopyridazinyl, 1-phenyl-1H-pyrrolyl, phenazinyl, phenothiazinyl, phenoxazinyl, phthalazinyl, pteridinyl, purinyl, pyrrolyl, pyrazolyl, pyridyl, pyrazinyl, pyrimidinyl, pyridazinyl, quinazolinyl, quinoxalinyl, quinolinyl, quinuclidinyl, isoquinolinyl, tetrahydroquinolinyl, thiazolyl, thiadiazolyl, triazolyl, tetrazolyl, triazinyl, and thiophenyl (i.e., thienyl). Unless specifically stated otherwise in the specification, heteroaryl groups may be optionally substituted.

The term "substituted" as used herein means any of the above groups (i.e., alkyl, alkenyl, alkynyl, cycloalkyl, cycloalkenyl, cycloalkynyl, heterocyclyl, aryl, heteroaryl, alkoxy, aryloxy, acyl, alkylcarbamoyl, alkylcarboxamide, alkoxycarbonyl, alkylthio, or arylthio) in which at least one atom is replaced by a non-hydrogen atom such as, but not limited to: halogen atoms such as F, cl, br, and I; oxygen atoms in groups such as hydroxyl groups, alkoxy groups, and ester groups; a sulfur atom in a group such as a thiol group, a thioalkyl group, a sulfone group, a sulfonyl group, and a sulfoxide group; Nitrogen atoms in groups such as amines, amides, alkylamines, dialkylamines, arylamines, alkylaryl amines, diarylamines, N-oxides, imides, and enamines; silicon atoms in groups such as trialkylsilyl groups, dialkylarylsilyl groups, alkyldiarylsilyl groups and triarylsilyl groups; and other heteroatoms in various other groups. "substituted" also means any of the above groups in which one or more atoms are replaced with Gao Jiejian (e.g., double or triple bonds) to heteroatoms (such as oxygen in oxo, carbonyl, carboxyl, and ester groups; and nitrogen in groups such as imines, oximes, hydrazones, and nitriles). For example, "substituted" includes any of the above groups in which one or more atoms are replaced by -NR_gR_h、-NR_gC(＝O)R_h、-NR_gC(＝O)NR_gR_h、-NR_gC(＝O)OR_h、-NR_gSO₂R_h、-OC(＝O)NR_gR_h、-OR_g、-SR_g、-SOR_g、-SO₂R_g、-OSO₂R_g、-SO₂OR_g、＝NSO₂R_g and-SO ₂NR_gR_h. "substituted" also means any of the above groups in which one or more hydrogen atoms are replaced by -C(＝O)R_g、-C(＝O)OR_g、-C(＝O)NR_gR_h、-CH₂SO₂R_g、-CH₂SO₂NR_gR_h. In the above, R _g and R _h are the same or different and are independently hydrogen, alkyl, alkenyl, alkynyl, alkoxy, alkylamino, thioalkyl, aryl, aralkyl, cycloalkyl, cycloalkenyl, cycloalkynyl, cycloalkylalkyl, haloalkyl, haloalkenyl, haloalkynyl, heterocyclyl, N-heterocyclyl, heterocyclylalkyl, heteroaryl, N-heteroaryl and/or heteroarylalkyl. "substituted" also means any of the foregoing groups in which one or more atoms is replaced with an amino, cyano, hydroxy, imino, nitro, oxo, thio, halo, alkyl, alkenyl, alkynyl, alkoxy, alkylamino, thioalkyl, aryl, aralkyl, cycloalkyl, cycloalkenyl, cycloalkynyl, cycloalkylalkyl, haloalkyl, haloalkenyl, haloalkynyl, heterocyclyl, N-heterocyclyl, heterocyclylalkyl, heteroaryl, N-heteroaryl, and/or heteroarylalkyl group. "substituted" may also mean an amino acid in which one or more atoms in the side chain are replaced by alkyl, alkenyl, alkynyl, acyl, alkylcarboxamide, alkoxycarbonyl, carbocyclyl, heterocyclyl, aryl or heteroaryl groups. in addition, each of the foregoing substituents may also be optionally substituted with one or more of the foregoing substituents.

As used herein, "subject" means an individual. Thus, a "subject" may include domestic animals (e.g., cats, dogs, etc.), farm animals (e.g., cattle, horses, pigs, sheep, goats, etc.), laboratory animals (e.g., mice, rabbits, rats, guinea pigs, etc.), and birds. "subject" may also include mammals, such as primates or humans. Thus, the subject may be a human or veterinary patient. The term "patient" refers to a subject under treatment by a clinician (e.g., physician).

The term "inhibition" refers to a decrease in activity, response, illness, disease or other biological parameter. This may include, but is not limited to, complete elimination of an activity, response, disorder or disease. This may also include, for example, a 10% reduction in activity, response, illness or disease compared to a natural or control level. Thus, the reduction may be a 10%, 20%, 30%, 40%, 50%, 60%, 70%, 80%, 90%, 100% or any reduction therebetween, as compared to a native or control level.

"Reduction" or other forms of the word, such as "reduction", means a decrease in an event or feature (e.g., tumor growth). It will be appreciated that this is typically associated with some standard or expected value, in other words it is relative, but reference to a standard or relative value is not always required. For example, "reducing tumor growth" means reducing the growth rate of a tumor relative to a standard or control (e.g., untreated tumor).

The term "treatment" refers to the medical management of a patient with the aim of curing, ameliorating, stabilizing or preventing a disease, pathological condition or disorder. The term includes active therapies, i.e. therapies directed specifically to ameliorating a disease, pathological condition or disorder, and also includes causal therapies, i.e. therapies directed to abrogating the etiology of the associated disease, pathological condition or disorder. In addition, the term includes palliative treatment, i.e., treatment intended to alleviate symptoms rather than cure a disease, pathological condition, or disorder; prophylactic treatment, i.e., treatment intended to minimize or partially or completely inhibit the development of a related disease, pathological condition, or disorder; and supportive treatment, i.e., treatment for supplementing another specific therapy aimed at ameliorating the associated disease, pathological condition or disorder.

The term "therapeutically effective" means that the amount of the composition used is sufficient to ameliorate one or more causes or symptoms of the disease or disorder. Such improvements need only be reduced or altered and need not be eliminated.

The term "pharmaceutically acceptable" refers to those compounds, materials, compositions, and/or dosage forms which are, within the scope of sound medical judgment, suitable for use in contact with the tissues of human beings and animals without excessive toxicity, irritation, allergic response, or other problem or complication commensurate with a reasonable benefit/risk ratio.

The term "carrier" means a compound, composition, substance or structure that, when combined with a compound or composition, facilitates or facilitates the preparation, storage, administration, delivery, availability, selectivity or any other characteristic of the compound or composition for its intended use or purpose. For example, the carrier may be selected to minimize any degradation of the active ingredient and to minimize any adverse side effects in the subject.

As used herein, the term "pharmaceutically acceptable carrier" refers to sterile aqueous or nonaqueous solutions, dispersions, suspensions or emulsions, as well as sterile powders for reconstitution into sterile injectable solutions or dispersions just prior to use. Examples of suitable aqueous and nonaqueous carriers, diluents, solvents or vehicles include water, ethanol, polyols (such as glycerol, propylene glycol, polyethylene glycol, and the like), carboxymethyl cellulose and suitable mixtures thereof, vegetable oils (such as olive oil), and injectable organic esters such as ethyl oleate. Proper fluidity can be maintained, for example, by the use of a coating material such as lecithin, by the maintenance of the required particle size in the case of dispersions and by the use of surfactants. These compositions may also contain adjuvants such as preserving, wetting, emulsifying and dispersing agents. Prevention of the action of microorganisms can be ensured by the inclusion of various antibacterial and antifungal agents such as parabens, chlorobutanol, phenol, sorbic acid, and the like. It may also be desirable to include isotonic agents, such as sugars, sodium chloride, and the like. The injectable formulation may be sterilized, for example, by filtration through a bacterial-retaining filter, or by incorporating sterilizing agents in the form of sterile solid compositions which may be dissolved or dispersed in sterile water or other sterile injectable medium immediately prior to use. Suitable inert carriers may include sugars such as lactose.

As used herein, the term "sequence identity" refers to the percentage of amino acids that are identical between two polypeptide sequences and in the same relative position. Thus, a polypeptide sequence has a certain percentage of sequence identity compared to another polypeptide sequence. For sequence comparison, one sequence is typically used as a reference sequence, which is compared to a test sequence. One of ordinary skill in the art will appreciate that two sequences are generally considered "substantially identical" if they contain the same residues at the corresponding positions. In embodiments, sequence identity between two amino acid sequences is determined using the Needleman-Wunsch algorithm (Needleman and Wunsch,1970, J.mol. Biol. 48:443-453) (version present by the date of filing) implemented in the Needle program of the EMBOSS package (EMBOSS: the European Molecular Biology Open Software Suite, rice et al, 2000,Trends Genet.16:276-277). The parameters used are gap opening penalty of 10, gap extension penalty of 0.5 and EBLOSUM62 (the emoss version of BLOSUM 62) substitution matrix. The Needle output labeled "longest identity" (obtained using the-nobrief option) was used as the percent identity and calculated as follows: (identical residue. Times.100)/(alignment Length-total number of gaps in the alignment)

In embodiments, sequence identity may be determined using the Smith-Waterman algorithm with versions that exist by the date of submission.

As used herein, "sequence homology" refers to the percentage of amino acid homology between two polypeptide sequences and in the same relative position. Thus, a polypeptide sequence has a certain percentage of sequence homology as compared to another polypeptide sequence. As will be appreciated by one of ordinary skill in the art, two sequences are generally considered "substantially homologous" if they contain homologous residues at corresponding positions. Homologous residues may be identical residues. Or homologous residues may be different residues having suitably similar structural and/or functional characteristics. For example, as is well known to those of ordinary skill in the art, certain amino acids are typically classified as "hydrophobic" or "hydrophilic" amino acids, and/or as having "polar" or "non-polar" side chains, and amino acid substitutions of one amino acid for another amino acid of the same type can generally be considered "homologous" substitutions.

As is well known in the art, any of a variety of algorithms may be used to compare amino acid sequences, including those available in commercial computer programs, such as BLASTP, gapped BLAST, and PSI-BLAST, which exist by the date of filing. Exemplary such procedures are described below: altschul et al, basic local ALIGNMENT SEARCH tool, J.mol.biol.,215 (3): 403-410, 1990; altschul et al Methods in Enzymology; altschul et al ,"Gapped BLAST and PSI-BLAST：a new generation of protein database search programs",Nucleic Acids Res.25：3389-3402,1997;Baxevanis et al, bioinformation A PRACTICAL Guide to THE ANALYSIS of Genes and Proteins, wiley,1998; and Misener et al, (ed), bioinformatics Methods and Protocols (Methods in Molecular Biology, vol.132), humana Press,1999. In addition to identifying homologous sequences, the above procedure typically provides an indication of the degree of homology.

As used herein, the terms "antisense compound" and "AC" are used interchangeably to refer to a polymeric nucleic acid structure (also referred to as an oligonucleotide or polynucleotide) that is at least partially complementary to a target nucleic acid molecule to which it (AC) hybridizes. AC may be a short (in embodiments, less than 50 base pairs) polynucleotide or a polynucleotide homolog comprising a sequence complementary to a target sequence in a target pre-mRNA strand. AC may be formed from natural nucleic acids, synthetic nucleic acids, nucleic acid homologs, or any combination thereof. In embodiments, the AC comprises an oligonucleotide. In embodiments, the AC comprises an antisense oligonucleotide. In embodiments, AC comprises a conjugate group. Non-limiting examples of AC include, but are not limited to, primers, probes, antisense oligonucleotides, external Guide Sequence (EGS) oligonucleotides, alternative splicers, siRNA, oligonucleotides, oligonucleotide analogs, oligonucleotide mimics, and chimeric combinations of these. Thus, these compounds may be introduced in single-stranded, double-stranded, cyclic, branched or hairpin form, and may contain structural elements such as internal or terminal bulges or loops. The oligomeric double-stranded compound may be two strands that hybridize to form a double-stranded compound, or a single strand that has sufficient self-complementarity to allow hybridization and formation of a complete or partial double-stranded compound. In embodiments, AC modulates (increases, decreases, or alters) expression of a target nucleic acid. Various modifications may be made to the polymeric nucleic acid structure, such as Phosphorodiamidate Morpholino (PMO). Thus, AC as used herein encompasses any modification described herein, such as PMO.

As used herein, the terms "pre-mRNA" and "primary transcript" refer to eukaryotic mRNA molecules that are newly synthesized directly after transcription of DNA. The pre-mRNA must be capped with a 5 'cap, modified with a 3' poly A tail, and spliced to produce the mature mRNA sequence.

As used herein, the term "targeting" or "targeting" refers to the association of an Antisense Compound (AC) with a target nucleic acid molecule or target nucleic acid molecule region. In embodiments, the AC is capable of hybridizing to the target nucleic acid under physiological conditions. In embodiments, the AC targets a particular portion or site within the target nucleic acid, e.g., a portion of the target nucleic acid having at least one identifiable structure, function, or feature, such as a particular exon or intron, or a selected nucleobase or motif within an exon and/or intron. In embodiments, the AC targets a region comprising an intron-exon junction of a gene associated with a disease or disorder. In embodiments, AC targets exon 45 of the dystrophin gene. In embodiments, the AC targets a region comprising an intron-exon junction of exon 45 of the dystrophin gene. In embodiments, the AC targets a region comprising an intron nucleotide sequence upstream (or 5') of exon 45 of the dystrophin gene. In embodiments, the AC targets a region comprising an intron nucleotide sequence upstream (or 5') of exon 45 of the dystrophin gene.

As used herein, the terms "target nucleic acid" and "target sequence" refer to a nucleic acid molecule comprising a nucleic acid sequence that binds or hybridizes to an antisense compound. Target nucleic acids include, but are not limited to, RNAs (including but not limited to pre-mrnas and mrnas or portions thereof), cdnas derived from such RNAs, and untranslated RNAs such as mirnas. For example, in embodiments, a target nucleic acid may be a nucleic acid molecule that expresses a cellular gene (or mRNA transcribed from such gene) associated with a particular disorder or disease condition, or an infectious agent. In embodiments, the target nucleic acid is a target RNA. In embodiments, the target nucleic acid is a target mRNA. In embodiments, the target nucleic acid is a target pre-mRNA. In an embodiment, the target nucleic acid comprises the nucleotide sequence of exon 45 of the dystrophin gene. In embodiments, the target nucleic acid comprises a nucleotide sequence comprising an intron-exon junction of exon 45 of the dystrophin gene. In embodiments, the target nucleic acid comprises a region of the intron nucleotide sequence upstream (or 5') of exon 45 of the dystrophin gene.

As used herein, the term "mRNA" refers to an RNA molecule that encodes a protein and includes pre-mRNA and mature mRNA. "Pre-mRNA" refers to a eukaryotic mRNA molecule that is newly synthesized directly after transcription of DNA. In embodiments, the pre-mRNA is capped with a 5 'cap, modified with a 3' poly a tail, and/or spliced to produce a mature mRNA sequence. In embodiments, the pre-mRNA comprises one or more introns. In embodiments, the pre-mRNA undergoes a process known as splicing to remove introns and join exons. In embodiments, the pre-mRNA comprises a polyadenylation site.

As used herein, the terms "splicing" and "processing" refer to the modification of post-transcriptional pre-mRNA in which introns are removed and exons are joined. Splicing occurs in a series of reactions catalyzed by a large RNA-protein complex (called a spliceosome) composed of five small nuclear ribonucleoproteins (snrnps). Within the intron, splicing requires a3 'splice site, a 5' splice site, and a branching site. The RNA component of snRNP interacts with introns and may be involved in catalysis

As used herein, the term "exon" refers to a portion of pre-mRNA that is typically contained in mature mRNA after splicing.

As used herein, the term "intron" refers to a portion of a pre-mRNA that is typically not included in the mature mRNA after splicing.

As used herein, the term "flanking" refers to an intron immediately upstream (5 ') or downstream (3') of the associated exon. For example, a 5 'flanking intron of exon 44 refers to an intron immediately upstream of exon 44 (i.e., directly coupled to the 5' end of exon 44). For example, a3 'flanking intron of exon 44 refers to an intron immediately downstream of exon 44 (i.e., directly coupled to the 5' end of exon 44).

A "target pre-mRNA" is a pre-mRNA comprising a target sequence that hybridizes to AC.

A "target mRNA" is an mRNA sequence resulting from splicing of a target pre-mRNA sequence. In embodiments, the target mRNA does not encode a functional protein. In embodiments, the target mRNA retains one or more intron sequences.

As used herein, the term "gene" refers to a nucleic acid molecule having a nucleic acid sequence that encompasses the 5 'promoter region, as well as any introns and exons, associated with expression of a gene product, and the 3' untranslated region ("UTR") associated with expression of a gene product.

The "target gene" of the present disclosure refers to a gene encoding a target pre-mRNA.

"Target protein" refers to the amino acid sequence encoded by the target mRNA. In embodiments, the target protein may not be a functional protein.

"Wild-type target protein" refers to a natural functional protein isoform produced by a wild-type, normal or unmutated form of a target gene. Wild-type target protein also refers to a protein produced from a target pre-mRNA that has been spliced correctly.

As used herein, the term "transcript" refers to an RNA molecule transcribed from DNA and includes, but is not limited to, mRNA, mature mRNA, pre-mRNA, and partially processed RNA.

As used herein, "re-spliced target protein" refers to a protein encoded by mRNA produced by splicing of target pre-mRNA hybridized to AC. The re-spliced target protein may be identical to the wild-type target protein, may be homologous to the wild-type target protein, may be a functional variant of the wild-type target protein, or may be an active fragment of the wild-type target protein.

As used herein, "functional fragment" or "active fragment" refers to a portion of a eukaryotic wild-type target protein that exhibits activity, such as one or more activities of a full-length wild-type target protein, or has another activity. In embodiments, a re-spliced target protein sharing at least one biological activity of a wild-type target protein is considered an active fragment of the wild-type target protein. The activity may be any percentage (i.e., more or less) of the activity of the full-length wild-type target protein, including but not limited to about 1% of the activity compared to the wild-type target protein, about 2%, about 3%, about 4%, about 5%, about 10%, about 20%, about 30%, about 40%, about 50%, about 60%, about 70%, about 80%, about 90%, about 95%, about 96%, about 97%, about 98%, about 99%, about 100%, about 200%, about 300%, about 400%, about 500% or more (including all values and ranges between these values). Thus, in embodiments, the active fragment may retain at least a portion of one or more biological activities of the wild-type target protein. In embodiments, the active fragment may enhance one or more biological activities of the wild-type target protein.

As used herein, the term "nucleoside" means a glycosylamine comprising a nucleobase and a sugar. Nucleosides include, but are not limited to, natural nucleosides, abasic nucleosides, modified nucleosides and nucleosides having mimicking bases and/or sugar groups. A "natural nucleoside" or "unmodified nucleoside" is a nucleoside comprising a natural nucleobase and a natural sugar. Natural nucleosides include RNA nucleosides and DNA nucleosides.

As used herein, the term "natural sugar" refers to a sugar of a nucleoside that has not been modified for its naturally occurring form in RNA (2 '-OH) or DNA (2' -H).

As used herein, the term "nucleotide" refers to a nucleoside having a phosphate group covalently linked to a sugar. The nucleotide may be modified with any of a variety of substituents.

As used herein, the term "nucleobase" refers to a nucleoside or a base portion of a nucleotide. A nucleobase may comprise any atom or group of atoms capable of hydrogen bonding with a base of another nucleic acid. A natural nucleobase is a nucleobase that has not been modified for its natural form of presence in RNA or DNA.

As used herein, the term "heterocyclic base moiety" refers to a nucleobase comprising a heterocycle.

As used herein, "oligonucleotide" refers to an oligonucleotide in which the internucleoside linkages do not contain a phosphorus atom.

As used herein, the term "oligonucleotide" refers to an oligomeric compound comprising a plurality of linked nucleotides or nucleosides. In certain embodiments, one or more nucleotides of the oligonucleotide are modified. In embodiments, the oligonucleotide comprises ribonucleic acid (RNA) or deoxyribonucleic acid (DNA). In embodiments, the oligonucleotides are comprised of natural and/or modified nucleobases, sugars, and covalent internucleoside linkages, and may also include non-nucleic acid conjugates.

As used herein, "internucleoside linkage" refers to covalent linkage between adjacent nucleosides.

As used herein, "natural internucleoside linkage" refers to 3 'to 5' phosphodiester linkage.

As used herein, the term "modified internucleoside linkage" refers to any linkage between nucleosides or nucleotides other than naturally occurring internucleoside linkages.

As used herein, the term "chimeric antisense compound" or "chimeric AC" refers to an antisense compound having at least one sugar, nucleobase, and/or internucleoside linkage that is differentially modified compared to other sugar, nucleobase, and internucleoside linkages within the same oligomeric compound. The remaining sugar, nucleobase and internucleoside linkages can be independently modified or unmodified. In general, chimeric oligomeric compounds will have modified nucleosides that can be located at separate positions or combined together in regions that will define a particular motif. Any combination of modifying and/or mimicking groups may comprise the chimeric oligomeric compounds as described herein.

As used herein, the term "mixed backbone antisense oligonucleotide" refers to an antisense oligonucleotide in which at least one internucleoside linkage of the antisense oligonucleotide is different from at least one other internucleoside linkage of the antisense oligonucleotide.

As used herein, the term "nucleobase complementarity" refers to a nucleobase that is capable of base pairing with another nucleobase. For example, in DNA, adenine (a) is complementary to thymine (T). For example, in RNA adenine (A) is complementary to uracil (U). In embodiments, complementary nucleobases refer to nucleobases in an antisense compound that are capable of base pairing with nucleobases of their target nucleic acids. For example, if a nucleobase at a position of an antisense compound is capable of hydrogen bonding with a nucleobase at a position of a target nucleic acid, then the position of hydrogen bonding between the oligonucleotide and the target nucleic acid is considered to be complementary at that nucleobase pair.

As used herein, the term "non-complementary nucleobases" refers to a pair of nucleobases that do not form hydrogen bonds with each other or otherwise support hybridization.

As used herein, the term "complementary" refers to the ability of an oligomeric compound to hybridize to another oligomeric compound or nucleic acid by nucleobase complementarity. In embodiments, an antisense compound and its target are complementary to each other when a sufficient number of corresponding positions in each molecule are occupied by nucleobases that can bond to each other to allow stable association between the antisense compound and the target. One skilled in the art recognizes that it is possible to include mismatches without eliminating the ability of the oligomeric compounds to remain associated. Thus, described herein are antisense compounds that can comprise up to about 20% mismatched nucleotides (i.e., not nucleobase complementary to the corresponding nucleotide of the target). Preferably, the antisense compound contains no more than about 15%, more preferably no more than about 10%, most preferably no more than 5% mismatch or no mismatch. The remaining nucleotides are nucleobase complementary or do not disrupt hybridization (e.g., universal bases). One of ordinary skill in the art will recognize that the compounds provided herein are at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% or 100% nucleobases complementary to a target nucleic acid.

As used herein, "hybridization" means pairing of complementary oligomeric compounds (e.g., antisense compounds and their target nucleic acids). Although not limited to a particular mechanism, the most common mechanism of pairing involves hydrogen bonding, which may be Watson-Crick, huo Siting, or reverse Huo Siting hydrogen bonding between complementary nucleoside or nucleotide bases (nucleobases). For example, the natural base adenine is a nucleobase complementary to the natural nucleobases thymidine and uracil, which pair by forming hydrogen bonds. The natural base guanine is a nucleobase complementary to the natural bases cytosine and 5-methylcytosine. Hybridization may occur under different conditions.

As used herein, the term "specific hybridization" refers to the ability of an oligomeric compound to hybridize to one nucleic acid site with greater affinity than to another nucleic acid site. In embodiments, the antisense oligonucleotide specifically hybridizes to more than one target site. In embodiments, the oligomeric compounds specifically hybridize to their targets under stringent hybridization conditions.

The term "modulation" refers to a perturbation of expression, function or activity when compared to the level of expression, function or activity prior to modulation. Modulation may include an increase (stimulation or induction) or decrease (inhibition or decrease) in expression, function or activity. In embodiments, modulation may include perturbing splice site selection during pre-mRNA processing.

The term "inhibit/inhibiting/inhibition" refers to a decrease in activity, expression, function, or other biological parameter, and may include, but does not require, complete elimination of the activity, expression, function, or other biological parameter. Inhibition may include, for example, at least about a 10% reduction in activity, response, disorder or disease as compared to a control. In embodiments, expression, activity, or function of the gene or protein is reduced by a statistically significant amount.

As used herein, the term "expression" refers to all functions and steps that convert the coding information of a gene into the structure that is present and manipulated in a cell. Such structures include, but are not limited to, products of transcription and translation.

As used herein, the term "2' -modified" or "2' -substituted" means a sugar comprising a substituent other than H or OH at the 2' position. 2 '-modified monomers include, but are not limited to, BNA and monomers (e.g., nucleosides and nucleotides) having a2' -substituent such as allyl, amino, azido, thio, O-allyl, O-C ₁-C₁₀ alkyl 、-OCF₃、O-(CH₂)₂-O-CH₃、2'-O(CH₂)₂SCH₃、O-(CH₂)₂-O-N(R_m)(R_n), or O-CH ₂-C(＝O)-N(R_m)(R_n, wherein each Rm and Rn is independently H or a substituted or unsubstituted C ₁-C₁₀ alkyl.

As used herein, the term "MOE" refers to a 2' -O-methoxyethyl substituent.

As used herein, the term "high affinity modified nucleotide" refers to a nucleotide having at least one modified nucleobase, internucleoside linkage, or sugar moiety such that the modification increases the affinity of an antisense compound comprising the modified nucleotide to a target nucleic acid. High affinity modifications include, but are not limited to, BNA, LNA, and 2' -MOE.

As used herein, the term "mimetic" refers to a group that replaces sugar, nucleobase, and/or internucleoside linkages in AC. In general, mimetics are used in place of sugar or sugar-internucleoside linkage combinations and maintain nucleobases for hybridization to a selected target. Representative examples of glycomimetics include, but are not limited to, cyclohexenyl or morpholino. Representative examples of mimetics of sugar-internucleoside linkage combinations include, but are not limited to, peptide Nucleic Acids (PNAs) and morpholino groups linked by uncharged achiral linkages. In some cases, a mimetic is used instead of a nucleobase. Representative nucleobase mimics are well known in the art and include, but are not limited to, tricyclic phenoxazine analogs and universal bases (Berger et al, nuc Acid res.2000, 28:2911-14, incorporated herein by reference). Methods of synthesis of sugar, nucleoside and nucleobase mimetics are well known to those skilled in the art.

As used herein, the term "bicyclic nucleoside" or "BNA" refers to a nucleoside in which the furanose portion of the nucleoside includes a bridge connecting two atoms on the furanose ring, thereby forming a bicyclic ring system. BNA includes, but is not limited to, alpha-L-LNA, beta-D-LNA, ENA, oxyBNA (2 '-O-N (CH ₃)-CH₂ -4') and aminooxyBNA (2 '-N (CH ₃)-O-CH₂ -4').

As used herein, the term "4 'to 2' bicyclic nucleoside" refers to BNA in which the bridge connecting two atoms of the furanose ring bridges the 4 'carbon atom and the 2' carbon atom of the furanose ring, thereby forming a bicyclic ring system.

As used herein, "locked nucleic acid" or "LNA" refers to a nucleotide that is modified such that the 2 '-hydroxyl group of the ribosyl sugar ring is attached to the 4' carbon atom of the sugar ring via a methylene group, thereby forming a 2'-C,4' -C-oxymethylene linkage. LNAs include, but are not limited to, alpha-L-LNA and beta-D-LNA.

As used herein, the term "cap structure" or "terminal cap portion" refers to a chemical modification that has been incorporated into either end of an AC.

As used herein, the term "dosage unit" refers to a form that provides a pharmaceutical agent. In embodiments, the dosage unit is a vial comprising a lyophilized antisense oligonucleotide. In embodiments, the dosage unit is a vial comprising a reconstituted antisense oligonucleotide.

Various embodiments of the present invention have been described. Nevertheless, it will be understood that various modifications may be made without departing from the spirit and scope of the invention. Other embodiments are within the scope of the following claims.

All publications, patents and patent applications mentioned in this specification are indicative of the level of skill of those skilled in the art to which this invention pertains. All publications, patents and patent applications are herein incorporated by reference to the same extent as if each individual publication or patent application was specifically and individually indicated to be incorporated by reference.

Examples

Example 1 conjugation of oligonucleotides to cell penetrating peptides.

Conjugation of AC to CPP. As shown in fig. 1A, an AC oligonucleotide having a (NH ₂-(CH₂)₅-CH₂ -) linker at the 5' phosphorothioate terminus is conjugated to the CPPs disclosed herein via a carboxylate or N-hydroxysuccinimide ester (NHs) functional group on the peptide. As shown in fig. 1B, the AC oligonucleotide was conjugated to a Cell Penetrating Peptide (CPP) via amide bond formation (left) or click chemistry. The linker/CPP is mounted on either the 5 'or 3' end of the oligonucleotide.

As shown in fig. 2A and 2B, oligonucleotide-peptide conjugates were synthesized that did not have (fig. 2A) and that had (fig. 2B) a PEG (polyethylene glycol) linker inserted between the oligonucleotide moiety and the peptide. In the figure, "R" represents palmitoyl.

Exemplary antisense compounds are combined with or consist of the sequences found in tables 6A-6P and tables 7A-7O and tables 8A-8C. Exemplary CPPs and EEVs can be found throughout this disclosure.

Example 2. Splicing of correct DMD exon 45 in an in vitro model using cell penetrating peptide conjugated to an oligonucleotide.

The present study used an in vitro model to study the effect of the following substances on dystrophin expression: an antisense compound of table 6A-6P, table 7A-7O or table 8A-8C alone, a reverse complement thereof, or a sequence having at least 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% nucleic acid sequence identity thereto, or an AC of table 6A-6P, table 7A-7O or table 8A-8C, a reverse complement thereof, or a conjugate thereof having at least 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% nucleic acid sequence identity thereto. The AC of tables 6A-6P, tables 7A-7O, or tables 8A-8C, or the reverse complement thereof, or a sequence having at least 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% nucleic acid sequence identity thereto restores the reading frame of the DMD gene.

The present study uses an in vitro model to study the effect of the following substances on dystrophin expression: antisense compounds that bind or comprise the sequences of tables 6A-6P or tables 7A-7O or tables 8A-8C, or their reverse complements, alone or in combination with cell penetrating peptide conjugates of AC and cell penetrating peptide that bind or comprise the sequences of tables 6A-6P or tables 7A-7O or tables 8A-8C, or their reverse complements. The reading frame of the DMD gene is restored by AC in combination with or consisting of the sequences of tables 6A-6P or tables 7A-7O or tables 8A-8C or their reverse complement.

In vitro model: the study used primary DMD myocytes and DMD-free myocytes. The study also used an immortalized muscle cell model of DMD. Human telomerase reverse transcriptase (hTert) and cyclin-dependent kinase 4 (CDK 4) expression vectors are used to transduce muscle cells derived from DMD patients to produce muscle stem cell lines with enhanced proliferation capacity. These models are described in the following publications: thorley et al Skelet muscle.2016;6:43. the study also used the CRL-2061 ^TM muscle cell line derived from rhabdomyosarcoma patients.

Study design: the compound comprising AC or its reverse complement of Table 6A-6P or Table 7A-7O or Table 8A-8C and the cyclic peptide is administered to an immortalized muscle cell, primary DMD muscle cell or muscle cell line (e.g., CRL-2061 ^TM). Total RNA was extracted from cells and analyzed by RT-PCR and Western blotting to visualize the efficiency of splice correction and to detect dystrophin products. The percentage of exon 45 correction product was evaluated.

Example 3 splicing of correct DMD exon 45 in animal models using cell penetrating peptide conjugated to oligonucleotide

The present study used a mouse model to study the effect of compositions comprising antisense compounds of tables 6A-6P or 7A-7O or tables 8A-8C, or their reverse complement, alone or conjugated to cell penetrating peptides, on dystrophin expression. The reading frame of the DMD gene was restored with the AC or its reverse complement of tables 6A-6P or tables 7A-7O or Table 8C.

Mouse model: veltrop et al PLoS one.2018 was used in this study; 13 (2): del52hDMD/mdx mice described in e 0193289. This document is incorporated by reference in its entirety. del52hDMD/mdx mice carry both the murine and human DMD genes. The model contained a termination mutation in exon 23 to prevent the expression of murine dystrophin. The model contains a deletion of exon 52 to prevent expression of human dystrophin. Mdx52 mice were also used in this study. mdx52 mice lack exon 52 of murine dystrophin. This mouse model is described in the following documents, which are incorporated herein by reference: aoki et al (2012), PNAS usa.109 (34): 1376-13768 and Araki et al Biochem Biophys Res Commun.1997, 9, 18 days; 238 (2): 492-7. Humanized DMD (hDMD) mice were also used. The hDMD mice contain the complete human dystrophin gene. This model is described in U.S. patent No. 9,078,911, incorporated herein by reference in its entirety.

The present study uses CD1 mice to evaluate the safety and tolerability of the compounds described herein. The test concentration range of the compound is 1mg/kg mouse body weight-1 g/kg mouse body weight.

The present study uses non-human primate (NHP) to evaluate the efficacy and safety of the compounds described herein.

Study design compositions comprising AC or its reverse complement and CPP of tables 6A-6P or tables 7A-7O or tables 8A-8C were applied to the mouse model described above to evaluate the ability of the compounds and AC to jump exon 45 to treat DMD. Compound and AC were administered to mice via Intramuscular (IM) or Intravenous (IV) injections at the following doses: 1mg/kg, 3mg/kg, 5mg/kg, 10mg/kg, 15mg/kg, 20mg/kg, 25mg/kg and 30mg/kg.

Total RNA was extracted from tissue samples and analyzed by RT-PCR and Western blotting to visualize the efficiency of splice correction and to detect dystrophin products. The percentage of exon 45 correction product was evaluated.

Compound synthesis and purification: compounds were synthesized according to the following procedure. The TFA-lysine protected cCPP was reacted with the AC of tables 8A-8C, followed by deprotection to provide cCPP-AC conjugates. Briefly, cCPP was pre-activated by reacting it with HATU (2.0 eq) and DIPEA (2.0 eq) in DMSO (10 mm,1.8 ml). After 10min at room temperature, the pre-activated solution was combined with a solution of AC in DMSO (10 mm,1.8 ml) and thoroughly mixed. The reaction was incubated at room temperature for 2 hours. The reaction was monitored by LCMS (Q-TOF) using BEH C18 column @1.7 Μm,2.1 mm. Times.50 mm), buffer A: water (0.1% fa), buffer B: acetonitrile (0.1% fa), flow rate: 0.4mL/min, starting from 2% buffer B and gradually increasing to 98% over 3.4 min. Upon completion, in situ deprotection of TFA protected lysine was initiated by dilution of the reaction mixture with 0.2M KCl (aqueous) pH 12 (36 mL). Using the analytical method described above, the reaction was monitored by LCMS (Q-TOF). The crude mixture was loaded directly onto a C18 reverse phase column (Oligo purity column, 150mm x 21.2 mm). The crude product was then purified using a gradient of 5% -20% over 60min using water containing 0.1% fa and acetonitrile as solvent and a flow rate of 20 mL/min. The fractions containing the desired product were combined and the pH of the solution was adjusted to 7 using 0.5M NaOH. The solution was frozen and lyophilized to give a white powder. Formate was exchanged with chloride by reconstitution of cCPP-AC conjugate in 1M aqueous NaCl solution and repeated washing (centrifugation at 3500rpm for 20-40 min) through 3-kD MW-cut-off amicon tube. The procedure was performed three times with 1M NaCl and three times with saline (0.9% NaCl, sterile, endotoxin free). The conductivity of the final filtrate was evaluated to confirm the appropriate salt concentration. The solution was further diluted with saline to the desired formulation concentration and sterile filtered in a biosafety cabinet. The concentration of each formulation was re-measured after filtration.

EEV-PMO-DMD45-5 was obtained (see table 10A) and the purity and identity of each formulation was assessed by liquid chromatography-mass spectrometry quadrupole time of flight mass spectrometry (QTOF-LCMS). EEV-PMO-DMD45-5 was 99% pure as determined by RP-FA and 73.3% pure as determined by CEX. MW calculated for C ₃₇₈H₆₀₉N₁₅₂O₁₂₄P₂₁ is 9917.42. MW identified by QTOF-LCMS is 9917.29. The endotoxin amount, residual free peptide, FA content and pH of the formulation were further determined.

EEV-PMO-DMD45-7 was obtained (see Table 10A) and purity and identity of each formulation was assessed by QTOF-LCMS. EEV-PMO-DMD45-7 was 99% pure as determined by RP-FA and 79.7% pure as determined by CEX. MW calculated for C ₃₈₇H₆₂₆N₁₅₆O₁₂₇P₂₂ is 10177.65. MW identified by QTOF-LCMS is 10177.60.

Example 4 in vitro screening of exon 45-targeted ACs

The purpose of this study was to evaluate the AC targeting exon 45.

Cell culture and treatment: human rhabdomyosarcoma cells (RMS cells, ATCC CRL-2061) were used in this study. RMS cells were cultured in RPMI1640 (ATCC) medium in T75 flasks at 37 ℃ and 5% co ₂, with 10% fbs (VWR) and lx penicillin-streptomycin (VWR) added. 1X 10 ⁵ RMS cells per well were inoculated overnight into RPMI1640 whole medium in 24-well plates. The next day, the medium was replaced with 1mL of fresh medium. 5. Mu.M or 10. Mu.M AC in Table 8 was added to the cell culture and mixed by rotation. Subsequently, 6. Mu.L of Endo-Porter (Gene-tools) was added to each well of cell culture (1 mL of medium) and immediately mixed by rotation. After 24 hours of incubation, cells were washed twice with PBS (VWR) and then 350 μl RLT lysis buffer (QIAGEN) was added to each well. The cells were allowed to stand at room temperature for 5 minutes for cell lysis and were blown (pipetted) several times with a pipette for thorough mixing. Lysates were collected in sample tube RB (QIAGEN).

RNA extraction: RNA was extracted using QIAcube according to the manufacturer's protocol. The concentration of total RNA was determined using a NanoDrop 8000 (Thermo FISHER SCIENTIFIC).

Nested PCR: nested PCR was performed using 200ng of total RNA extracted and QIAGEN OneStep RT-PCR kit for primary amplification. 50. Mu.L of reaction solution was prepared using DMD-specific primers according to the manufacturer's protocol.

Forward primer: 5'-CAATGCTCCTGACCTCTGTGC-3' A

Reverse primer: 5'-GCTCTTTTCCAGGTTCAAGTGG-3').

The procedure for RT-PCR was as follows:

reverse transcription is carried out at 50℃for 30min.

Initial PCR was performed at 95℃for 15min.

20 Cycles: denaturation at 95℃for 1min; annealing at 55 ℃ for 1min; extending at 72deg.C for 1min.

Finally, the temperature is 72 ℃ and the time is 5min.

Overnight at 4 ℃.

The secondary amplification PCR reaction was performed according to the manufacturer's protocolHot Start 2X Master Mix.

Forward primer: 5'-GTCTACAACAAAGCTCAGGTCG-3' the process of the preparation of the pharmaceutical composition,

Reverse primer: 5'-GCAATGTTATCTGCTTCCTCCAACC-3').

The procedure for RT-PCR was as follows:

95℃，1min。

25 cycles: denaturation at 95℃for 30s; annealing at 55 ℃ for 1min; extending at 72deg.C for 1min.

Finally, the temperature is 72 ℃ and the time is 5min.

Overnight at 4 ℃.

Gel electrophoresis analysis of exon skipping efficiency: 4. Mu.L of the above PCR product was analyzed by 2% E-Gel (Invitrogen ^TM) and imaged using E-Gel ^TM Power Snap electrophoresis system (Thermo FISHER SCIENTIFIC). The intensity of the skip and full length bands was analyzed. Polynucleotide level "a" for the band with exon 45 skipping and polynucleotide level "B" for the band without exon 45 skipping were measured using ImageJ. Based on these measurements of "a" and "B", the jump efficiency is determined by the following equation: jump efficiency (%) =a/(a+b) ×100.

Results: FIG. 4 shows exon 45 skipping efficiencies of the ACs of Table 9A.

TABLE 9A AC sequence of skip exon 45

The AC of table 9B was more efficient at skipping exon 45 than the approved exon 45 antisense oligonucleotide cassie Mo Sen.

TABLE 9A AC sequence of skip exon 45

Example 5: sustained and repeat dose effects on D2MDX mice following administration of EEV-PMO-MDX23-1

The method comprises the following steps: mice dosed intravenously with 20, 40 or 80mpk of the D2/MDX mice were substituted with PMO-EEV (EEV-PMO-MDX 23-1: eev=ac-PKKKRKV-miniPEG-K (loop (GfFGrGrQ)) -PEG ₁₂ -OH; pmo= 5'-GGCCAAACCTCGGCTTACCTGAAAT-3') to induce exon 23 skipping in the D2/MDX mouse model. Tissues were harvested at 1, 2, 4 and 8 weeks to test single dose duration effects and single dose range findings. D2/MDX mice were dosed weekly with 40mpk for 4 weeks and mice were sacrificed 1 week after the final dose to test for repeat dose effects.

Results: exon skipping was observed in all 4 tissues after a single dose of 20, 40 or 80mpk 1 week post injection (fig. 5). Exon skipping peaked 2 weeks after injection and was maintained in skeletal muscle (triceps and tibialis anterior) for at least 8 weeks (fig. 6). After 4 and 8 weeks, exon skipping was observed in the diaphragm and heart (fig. 6). Exon skipping was observed in all 4 tissues after 4 weekly administrations at 40mpk (fig. 7) (tissues were collected 1 week after the last administration).

Example 6: functional assay in D2DMX

The method comprises the following steps: 6 groups of male D2/MDX and DBA/2J (wild type) mice (n=8 per group) were given intravenously every 2 weeks for a total of 6 doses, including: WT vehicle (saline), d2.Mdx vehicle (saline); PMO alone (PMO-MDX 23:5'-GGCCAAACCTCGGCT TACCTGAAAT-3') or one of two EEV-PMO constructs targeting exon 23: (EEV-PMO-MDX 23-1: EEV=Ac-PKKKRKV-miniPEG-K (Ring (GfFGrGrQ)) -PEG ₁₂ -OH; PMO= 5'-GGCCAAACCTCGGCTTACCTGAAAT-3'; and EEV-PMO-MDX23-2(EEV＝Ac-PKKKRKV-Lys(FfΦ-GrGrQ)-PEG12-K(N3)-NH2;PMO＝5'-GGCCAAACCTCGGCTTACCTGAAAT-3'-C4COT).C4COT＝ cycloocta-2-alkyne-1-O- (CH ₂)₄ -O-C (O)) doses are listed above. Creatine kinase levels, grip strength and line hang time were measured once every 4 weeks for a total of 4 measurements using known methods.

Results: EEV-PMO-MDX23-1 80mpk q2w treatment produced longer suspension time than the other groups after 2 weeks after the first injection and continued to show statistically significant improvement, both 4 weeks and 8 weeks after the first injection, as compared to vehicle d2.MDX group (fig. 8). After 12 weeks of treatment, the line-up time of animals treated with EEV-PMO-MDX23-1 80mpk Q2W was statistically indistinguishable from WT animals (FIG. 8). Treatment with the loading doses of EEV-PMO-MDX23-1 40mpk q2w and EEV-PMO-MDX23-2 15mpk q2w showed significantly higher line-on-time from 8 weeks after the first treatment compared to vehicle d2.MDX group, stabilizing until 12 weeks of treatment, at which point signs of phenotype improvement became apparent for the first time (fig. 8). PMO alone treatment appeared to follow the same trend as vehicle d2.Mdx group and vehicle control group, lower at 2 weeks, but they were consistent with published 4, 8 and 12 week data, indicating that the test was performing correctly (fig. 8).

Serum CK was measured at 4 time points: pre-dose, and at 4, 8 and 12 weeks. The EEV-PMO treated group showed a significant decrease in serum CK, approaching the wild type. The PMO-only treated group showed no significant drop at all time points after treatment (fig. 9A-9B and fig. 10A-10B).

Grip strength was measured before dosing (fig. 11A) and at 12 weeks (fig. 11B). For PMO-EEV treated mice, a dose-dependent increase in grip strength was observed. Vehicle and PMO treated mice did not show significant improvement.

Example 7: hDMD and exon 45 skipping

The method comprises the following steps: casi Mo Sen, a commercial exon 45-skipping PMO (5'-CAATGCCATCCTGGAGTTCCTG-3'), was conjugated to EEV (Ac-PKKKRKV-AEEA-Lys- (cyclo [ FGFGRGRQ ]) -PEG12-OH; EEV-PMO-DMD 45-1) and used as a positive control for testing in vivo systems (hDMD).

DMD mice of 8-9 weeks old were intravenously injected with 40, 60, or 80mpk EEV-PMO-DMD45-1 positive controls. After 1 week, tissues were harvested and exon skipping was determined by PCR (Labchip quantification).

Results: dose-dependent exon 45 skipping in muscle tissue was observed: diaphragm (fig. 12A); heart (fig. 12B); biceps (fig. 12C); tibialis anterior (fig. 12D).

Example 8: library screening

The method comprises the following steps: the hDMD mice of 8-9 weeks of age were injected intravenously with 60mpk of EEV conjugated positive control (EEV conjugated Carxi Mo Sen; EEV-PMO-DMD 45-1) or EEV conjugated candidate PMO. The PMO sequence of each is shown in table 10A. EEV has the sequence Ac-PKKKRKV-AEEA-Lys- (cyclo [ FGFGRGRQ ]) -PEG12-OH.

After 1 week, tissues were harvested and exon skipping of Tia, TA, diaphragm and heart was determined by PCR.

Results: exon 45 skipping at 60mpk was greater for each exon 45EEV-PMO than for the positive control (EEV-PMO-DMD 45-1) (FIG. 13). Exon 45 skipping was tested at 30mpk to further differentiate the efficacy of candidate exon 45 EEV-PMO. The candidate EEV-PMO-DMD45-10, EEV-PMO-DMD45-11 and EEV-PMO-DMD45-3 exhibited lower efficacy in TA and diaphragm tissues than the other EEV-PMOs (FIG. 14). Low exon skipping abnormalities are consistent in all tissues and are female.

Example 9: patient-derived cell data

The method comprises the following steps: myoblasts from DMD delta 46-48 iPSC with mutations prone to exon 45 skipping were treated with EEV-PMO-DMD45-1 (positive control) and 10 EEV-PMO compounds (see table 10A) at 30 μm for 24 hours followed by 7 days of differentiation. Exon skipping was determined by RT-PCR. Dystrophin expression was determined by western blotting.

Results: all 10 EEV-PMOs showed excellent exon skipping and dystrophin expression compared to the positive control (fig. 15A-B).

Cardiomyocytes derived from DMD.DELTA.46-48 iPSC were treated with 30. Mu.M positive control (EEV-PMO-DMD 45-1), EEV-PMO-DMD45-5 or EEV-PMO-DMD45-7 for 24 hours and analyzed after 72 hours. Robust exon 45 skipping and dystrophin production were observed for all three constructs (fig. 15C-D).

Cardiomyocytes derived from DMDA46-48 iPSC were treated with 20, 10, 5 or 1. Mu.M positive control (EEV-PMO-DMD 45-1), EEV-PMO-DMD45-5 or EEV-PMO-DMD45-7 for 24 hours and analyzed after 72 hours. Robust exon 45 skipping was observed for all three constructs (fig. 15E).

Table 10A: EEV-PMO

* EEV sequence = Ac-PKKKRKV-AEEA-Lys- (cyclo [ FGFGRGRQ ]) -PEG12-OH.

Example 10: cell viability of human primary kidney proximal tubular epithelial cells (Renal Proximal Tubular EPITHELIAL CELL, RPTEC)

The method comprises the following steps: activity of RPTEC (kidney proximal tubular epithelial cells) of 10 exons 45PMO-EEV (see Table 10B below) was selected. The EEVs of the 10 PMO-EEVs have the sequence Ac-PKKKPKV-AEEA-Lys- (cyclo [ FGFGRGRQ ]) -PEG12-OH. The PMO sequences and concentration ranges tested for each construct are shown in table 10B.

Two positive controls were used: (1) Positive control (EEV-PMO-control: eev=ac-PKKKRKV-miniPEG-Lys (loop (Ff Φ GrGrQ) -AEEA-K (N ₃)), pmo= 5'-TGAAAACGCCGCCATTTCTCAACAG-3' -PEG4COT (PEG 4 cot=cycloocta-2-alkyne-1-O- (PEG 4) -O-C (O))), and (2) melittin (used at 16.6 uM.) PBS was used as vehicle (negative control).

EEV-PMO was resuspended in saline (1:2 serial dilutions) to achieve the concentration ranges shown in Table 10B. Data were normalized to melittin control.

Results: EEV-PMO-DMD45-2 showed some toxicity at only the two highest concentrations. EEV-PMO-DMD45-3, EEV-PMO-DMD45-4, and EEV-PMO-DMD45-5 were substantially non-toxic at the concentrations tested, but showed a decreasing trend of viability at the highest concentrations tested (FIG. 16). The EEV-PMO-DMD45-6, EEV-PMO-DMD45-7, EEV-PMO-DMD45-8 and EEV-PMO-DMD45-9 were substantially non-toxic at the tested concentrations (FIG. 17). EEV-PMO-DMD45-10 showed toxicity at the highest two concentrations tested; EEV-PMO-DMD45-11 is not significantly toxic at the concentrations tested; EEV-PMO-control was used as a positive control for toxicity (FIG. 18).

Subcellular localization of PMO alone (PMO), PMO conjugated to EEV (EEV-PMO), and PMO conjugated to EEV and nuclear localization signals (EEV-NLS-PMO) was determined.

Briefly, THP-1 monocytes were contacted with 3. Mu.M PMO, EEV-PMO or EEV-NLS-PMO and incubated for 24 hours and examined by LC-MS.

FIG. 22A shows whole cell uptake of PMO with EEV-PMO and EEV-NLS-PMO. Both EEV-PMO and EEV-NLS-PMO showed a significant increase in cellular uptake compared to PMO alone.

EEV-PMO and PMO: about 3 times

EEV-NLS-PMO and PMO: about 58 times

EEV-NLS-PMO and EEV-PMO: about 19 times

FIG. 22B shows subcellular localization of PMO and EEV-NLS-PMO in THP cells assayed using LC-MS/MS. As shown in FIG. 22B, EEV-PMO exhibits improved cell permeability compared to PMO alone. The addition of NLS further increases cell permeability. Fig. 22C shows nuclear uptake of the three constructs.

Claim (modification according to treaty 19)

1. A compound comprising an Endosomal Escape Vector (EEV) comprising a cyclic cell penetrating peptide (cCPP), an Exocyclic Peptide (EP), and a linker, the endosomal escape vector being conjugated to AC to form an EEV-conjugate of formula (C):

Or a protonated form thereof,

Wherein:

(a) In the cyclic cell penetrating peptide (cCPP):

r ₄ and R ₆ are independently H or an amino acid side chain;

Both R ₁、R₂、R₃ and R ₄ comprise phenylalanine;

Each m is independently an integer from 0 to 3;

n is an integer from 0 to 2;

x' is an integer from 2 to 20;

y is an integer from 1 to 5;

q is an integer from 1 to 4;

z' is an integer from 2 to 20;

(b) EP is a cyclic exopeptide comprising 4 to 8 amino acid residues comprising 1 or 2 amino acid residues comprising a side chain comprising a guanidine group or protonated form thereof and 2, 3 or 4 lysine residues; and

(C) AC is an oligonucleotide from any of tables 6A-6P or tables 7A-7O or tables 8A-8C, or a reverse complement thereof, or a sequence having at least 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% nucleic acid sequence identity thereto.

2. The compound of claim 1, wherein q is 1.

3. The compound of claim 1 or 2, wherein the EP has the structure: ac-PKKKRKV.

4. The compound of any one of claims 1-3, wherein the cyclic cell penetrating peptide comprises FGFGRGRQ.

5. The compound of any one of claims 1-4, wherein the EEV conjugated to the AC to form the compound of formula (C) comprises: ac-PKKKRKV-AEEA-Lys- (cyclo [ FGFGRGRQ ]) -PEG ₁₂ -OH.

6. The compound of any one of claims 1-5, wherein the AC comprises the sequence: 5'-ATGCCATCCTGGAGTTCCTGTA-3'.

7. The compound of any one of claims 1-5, wherein the AC comprises the sequence: 5'-ATGCCATCCTGGAGTTCCTGT-3'.

8. The compound of any one of claims 1-5, wherein the AC comprises the sequence: 5'-CCCAATGCCATCCTGGAGTTCCT-3'.

9. The compound of any one of claims 1-5, wherein the AC comprises the sequence: 5 'CCCAATGCCCATCCTGGGAGTTCCTG-3'.

10. The compound of any one of claims 1-5, wherein the AC comprises the sequence: 5'-CCCAATGCCATCCTGGAGTTCC-3'.

11. The compound of any one of claims 1-5, wherein the AC comprises the sequence: 5'-GCCCAATGCCATCCTGGAGTTCC-3'.

12. The compound of any one of claims 1-5, wherein the AC comprises the sequence: 5'-TGCCCAATGCCATCCTGGAGTTCCT-3'.

13. The compound of claim 1, wherein the AC comprises at least one modified nucleotide or nucleic acid selected from Phosphorothioate (PS) nucleotides, phosphorodiamidate morpholino nucleotides (PMO), locked Nucleic Acids (LNA), peptide Nucleic Acids (PNA), nucleotides comprising a 2' -O-methyl (2 ' -OMe) modified backbone, 2' O-methoxy-ethyl (2 ' -MOE) nucleotides, 2',4' constrained ethyl (cEt) nucleotides, and 2' -deoxy-2 ' -fluoro- β -D-arabinonucleic acid (2 ' f-ANA).

14. The compound of claim 1, wherein the cyclic peptide is conjugated to:

(a) The 3' end of the AC;

(b) The 5' end of the AC; or (b)

(C) The backbone of the AC.

15. The compound of claim 14, comprising a linker that conjugates the cyclic peptide to the AC, wherein the linker is covalently bound to:

(a) The 5' end of the AC;

(b) The 3' end of the AC; or (b)

(C) The backbone of the AC.

16. The compound of claim 15, wherein the linker is covalently bound to a side chain of an amino acid residue on the cyclic peptide.

17. The compound of claim 14 or 15, wherein the linker comprises:

(i) One or more D or L amino acid residues, each of which is optionally substituted;

(ii) Optionally substituted alkylene;

(iii) Optionally substituted alkenylene;

(iv) Optionally substituted alkynylene;

(v) Optionally substituted carbocyclyl;

(vi) Optionally substituted heterocyclyl;

(vii) One or more- (R ^1-J-R²) z "-subunits, wherein each of R ¹ and R ² is independently selected from alkylene, alkenylene, alkynylene, carbocyclyl, and heterocyclyl, each J is independently C, NR ³、-NR³ C (O) -, S, and O, wherein R ³ is independently selected from H, alkyl, alkenyl, alkynyl, carbocyclyl, and heterocyclyl, each of which is optionally substituted, and z" is an integer from 1 to 50;

(viii) - (R ^1- J) z "-or- (J-R ¹) z" -, wherein each of R ¹ is independently in each occurrence alkylene, alkenylene, alkynylene, carbocyclyl, or heterocyclyl, each J is independently C, NR ³、-NR³ C (O) -, S, or O, wherein R ³ is H, alkyl, alkenyl, alkynyl, carbocyclyl, or heterocyclyl, each of which is optionally substituted, and z "is an integer from 1 to 50; or (ix) combinations thereof.

18. The compound of claim 17, wherein the linker comprises:

(i) Lysine, glycine, beta-alanine, 4-aminobutyric acid, 5-aminopentanoic acid, 6-aminopentanoic acid or combinations thereof;

(ii) One or more- (R ^1-J-R²) z "-subunits,

(Iii) - (R ^1- J) z '-or- (J-R ¹) z' -, or

(Iv) A combination thereof.

19. The compound of claim 17 or 18, wherein each R ¹ and R ² is independently alkylene, each J is O, each x is independently an integer from 1 to 20, and each z is independently an integer from 1 to 20.

20. The compound of claim 15, wherein the linker comprises:

(i) - (OCH ₂CH₂)_z -subunit wherein z is an integer from 2 to 20;

(ii) One or more amino acid residues comprising residues selected from the group consisting of glycine, beta-alanine, 4-aminobutyric acid, 5-aminopentanoic acid, 6-aminopentanoic acid, and combinations thereof; or (b)

(Iii) A combination of (i) and (ii).

21. The compound of any one of claims 15-20, wherein the linker has the structure:

Wherein:

each AA is independently glycine, β -alanine, 4-aminobutyric acid, 5-aminopentanoic acid, or 6-aminopentanoic acid;

AA _SC is an amino acid side chain;

x is an integer from 1 to 10;

y is an integer from 1 to 5; and

Z is an integer from 1 to 10.

22. The compound of any one of claims 15-20, wherein the linker has the structure:

Wherein:

x is an integer from 2 to 20;

y is an integer from 1 to 5;

z is an integer from 2 to 20;

M is a binding moiety; and

AA _SC is an amino acid residue of the cyclic peptide.

23. The compound of claim 22, wherein M is selected from:

-C(O)、

Wherein: r ¹ is alkylene, cycloalkyl or Wherein m is an integer from 0 to 10, wherein each R is independently alkyl, alkenyl, alkynyl, carbocyclyl, or heterocyclyl, and wherein each B is independently selected from nucleobases.

24. The compound of claim 22, wherein M is-C (O).

25. The compound of any one of claims 21-24, wherein z is 11.

26. The compound of any one of claims 21-25, wherein x is 1.

27. The compound of claim 1, wherein the amino group on the side chain of each lysine residue is substituted with a trifluoroacetyl (-COCF ₃) group, an allyloxycarbonyl (Alloc), a 1- (4, 4-dimethyl-2, 6-dioxocyclohexylidene) ethyl (Dde) or a (4, 4-dimethyl-2, 6-dioxocyclohex-1-ylidene-3) -methylbutyl (ivDde) group.

28. The compound of any one of claims 1-27, wherein the EP comprises at least 2 amino acid residues having a hydrophobic side chain.

29. The compound of claim 28, wherein the amino acid residue having a hydrophobic side chain is selected from valine, proline, alanine, leucine, isoleucine and methionine.

30. The compound of any one of claims 1-29, wherein the exocyclic peptide comprises one of the following sequences ：PKKKRKV;KR;RR;KKK;KGK;KBK;KBR;KRK;KRR;RKK;RRR;KKKK;KKRK;KRKK;KRRK;RKKR;RRRR;KGKK;KKGK;KKKKK;KKKRK;KBKBK;KKKRKV;PGKKRKV;PKGKRKV;PKKGRKV;PKKKGKV;PKKKRGV; or PKKKRKG.

31. The compound of any one of claims 1-30, comprising the structure:

Wherein:

x is an integer from 2 to 20;

y is an integer from 1 to 5;

z is an integer from 2 to 20;

EP is a cyclic exopeptide;

m is a binding moiety;

AC is an antisense compound complementary to a target sequence comprising exon 45 of the DMD gene in the pre-mRNA sequence; and

AA _SC is an amino acid residue of the cyclic peptide.

32. The compound of any one of claims 1-31, wherein the cyclic peptide comprises 4 to 20 amino acid residues in the cyclic peptide, wherein at least two amino acid residues comprise a guanidino group or protonated form thereof, and at least two amino acid residues independently comprise a hydrophobic side chain.

33. The compound of any one of claims 1-32, wherein the cyclic peptide comprises 2,3, or 4 acid residues containing a guanidino group or protonated form thereof.

34. The compound of claim 32 or 33, wherein the cyclic peptide comprises 2, 3 or 4 amino acid residues comprising a hydrophobic side chain.

35. The compound of any one of claims 32-34, wherein the cyclic peptide comprises at least one amino acid comprising a side chain selected from the group consisting of Or a protonated form thereof.

36. The compound of any one of claims 32-35, wherein the cyclic peptide comprises 1, 2,3, or 4 amino acids comprising a side chain selected from the group consisting of Or a protonated form thereof.

37. The compound of any one of claims 32-36, wherein the cyclic peptide comprises at least one glycine residue.

38. The compound of any one of claims 32-37, wherein the cyclic peptide comprises 1,2,3, or 4 glycine residues.

39. The compound of any one of claims 1-38, wherein the cyclic peptide has the structure of formula (I):

Or a protonated form thereof,

Wherein:

r ₄ and R ₆ are independently H or an amino acid side chain;

AA _SC is an amino acid side chain;

q is 1, 2, 3 or 4; and

Each m is independently an integer of 0,1, 2 or 3.

40. The compound of claim 39, wherein R ₄ is H or an amino acid residue comprising a side chain containing an aromatic group.

41. The compound of claim 39 or 40, wherein the amino acid comprising a side chain comprising an aromatic group is phenylalanine.

42. The compound of any one of claims 39-41, wherein both of R ₁、R₂、R₃ and R ₄ are H.

43. The compound of any one of claims 39-43, wherein the cyclic peptide has the structure of formula (I-1) or (I-2):

Or a protonated form thereof,

Wherein:

AA _SC is an amino acid side chain; and

Each m is independently an integer from 0 to 3.

44. The compound of any one of claims 39-43, wherein the cyclic peptide has the structure of formula (V-1):

Or a protonated form thereof,

Wherein:

AA _SC is an amino acid side chain; and

Each m is independently an integer from 0 to 3.

45. The compound of any one of claims 39-44, wherein AA _SC comprises a side chain of an asparagine residue, an aspartic acid residue, a glutamine residue, a glutamic acid residue, a homoglutamic acid residue, or a homoglutamine residue.

46. The compound of any one of claims 39-45, wherein AA _SC comprises a side chain of a glutamine residue.

47. The compound of any one of claims 1-46, wherein the AC comprises a sequence shown in any one of tables 6A-6P or 7A-7O or 8A-8C, or a reverse complement thereof, or a sequence having at least 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% nucleic acid sequence identity thereto.

48. The compound of any one of claims 1-47, having the structure of formula (C-1) or formula (C-2):

or a protonated form thereof;

Wherein the AC is selected from the group consisting of the oligonucleotides shown in tables 6A-6P or tables 7A-7O or any of tables 8A-8C, or reverse complement thereof, or a sequence having at least 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% nucleic acid sequence identity thereto.

49. The compound of any one of claims 1-48, wherein the AC is complementary to a portion of the 5' flanking intron of exon 45 and a portion of exon 45.

50. A pharmaceutical composition comprising a compound of any one of claims 1-49.

51. A cell comprising a compound of any one of claims 1-49.

52. A method of treating DMD, the method comprising administering to a patient in need thereof a compound of any one of claims 1-50 or a pharmaceutical composition of claim 50.

Claims

1. A compound, the compound comprising:

(a) A cyclic peptide; and

2. The compound of claim 1, wherein the AC comprises at least one modified nucleotide or nucleic acid selected from Phosphorothioate (PS) nucleotides, phosphorodiamidate morpholino nucleotides (PMO), locked Nucleic Acids (LNA), peptide Nucleic Acids (PNA), nucleotides comprising a 2' -O-methyl (2 ' -OMe) modified backbone, 2' O-methoxy-ethyl (2 ' -MOE) nucleotides, 2',4' constrained ethyl (cEt) nucleotides, and 2' -deoxy-2 ' -fluoro- β -D-arabinonucleic acid (2 ' f-ANA).

3. The compound of claim 1, wherein the AC comprises a nucleic acid sequence set forth in tables 6A-6P or tables 7A-7O or tables 8A-8C, or a reverse complement thereof, or a sequence having at least 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% nucleic acid sequence identity thereto.

4. The compound of claim 1, wherein the cyclic peptide is conjugated to:

(a) The 3' end of the AC;

(b) The 5' end of the AC; or (b)

(C) The backbone of the AC.

5. The compound of any one of claims 4-6, comprising a linker that conjugates the cyclic peptide to the AC, wherein the linker is covalently bound to:

(a) The 5' end of the AC;

(b) The 3' end of the AC; or (b)

(C) The backbone of the AC.

6. The compound of claim 5, wherein the linker is covalently bound to a side chain of an amino acid residue on the cyclic peptide.

7. The compound of claim 5 or 6, wherein the linker comprises:

(ii) Optionally substituted alkylene;

(iii) Optionally substituted alkenylene;

(iv) Optionally substituted alkynylene;

(v) Optionally substituted carbocyclyl;

(vi) Optionally substituted heterocyclyl;

8. The compound of claim 7, wherein the linker comprises:

(ii) One or more- (R ^1-J-R²) z "-subunits,

(Iii) - (R ^1- J) z '-or- (J-R ¹) z' -, or

(Iv) A combination thereof.

9. The compound of claim 8 or 9, wherein each R ¹ and R ² is independently alkylene, each J is O, each x is independently an integer from 1 to 20, and each z is independently an integer from 1 to 20.

10. The compound of claim 5, wherein the linker comprises:

(i) - (OCH ₂CH₂)_z -subunit wherein z is an integer from 2 to 20;

(Iii) A combination of (i) and (ii).

11. The compound of any one of claims 5-10, wherein the linker has the structure:

Wherein:

AA _SC is an amino acid side chain;

x is an integer from 1 to 10;

y is an integer from 1 to 5; and

Z is an integer from 1 to 10.

12. The compound of any one of claims 5-10, wherein the linker has the structure:

Wherein:

x is an integer from 2 to 20;

y is an integer from 1 to 5;

z is an integer from 2 to 20;

M is a binding moiety; and

AA _SC is an amino acid residue of the cyclic peptide.

13. The compound of claim 12, wherein M is selected from:

14. The compound of claim 12, wherein M is-C (O).

15. The compound of any one of claims 11-14, wherein z is 11.

16. The compound of any one of claims 11-15, wherein x is 1.

17. The compound of any one of the preceding claims, comprising an Exocyclic Peptide (EP) conjugated to the cyclic peptide or linker.

18. The compound of any one of claims 5-17, comprising an Exocyclic Peptide (EP) conjugated to an amino group of the linker.

19. The compound of claim 17 or 18, wherein the EP comprises 2 to 10 amino acid residues.

20. The compound of any one of claims 17-19, wherein the EP comprises 4 to 8 amino acid residues.

21. The compound of any one of claims 17-20, wherein the EP comprises 1 or 2 amino acids comprising a side chain comprising a guanidino group or protonated form thereof.

22. The compound of any one of claims 17-21, wherein the EP comprises 1,2, 3 or 4 lysine residues.

23. The compound of claim 22, wherein the amino group on the side chain of each lysine residue is substituted with a trifluoroacetyl (-COCF ₃) group, an allyloxycarbonyl (Alloc), a 1- (4, 4-dimethyl-2, 6-dioxocyclohexylidene) ethyl (Dde) or a (4, 4-dimethyl-2, 6-dioxocyclohex-1-ylidene-3) -methylbutyl (ivDde) group.

24. The compound of any one of claims 17-23, wherein the EP comprises at least 2 amino acid residues having a hydrophobic side chain.

25. The compound of claim 24, wherein the amino acid residue having a hydrophobic side chain is selected from valine, proline, alanine, leucine, isoleucine and methionine.

26. The compound of any one of claims 17-25, wherein the exocyclic peptide comprises one of the following sequences ：PKKKRKV;KR;RR;KKK;KGK;KBK;KBR;KRK;KRR;RKK;RRR;KKKK;KKRK;KRKK;KRRK;RKKR;RRRR;KGKK;KKGK;KKKKK;KKKRK;KBKBK;KKKRKV;PGKKRKV;PKGKRKV;PKKGRKV;PKKKGKV;PKKKRGV; or PKKKRKG.

27. The compound of any one of claims 17-25, wherein the exocyclic peptide has the structure: ac-PKKKRKV-.

28. The compound of any one of the preceding claims, comprising the structure:

Wherein:

x is an integer from 2 to 20;

y is an integer from 1 to 5;

z is an integer from 2 to 20;

EP is a cyclic exopeptide;

m is a binding moiety;

AA _SC is an amino acid residue of the cyclic peptide.

29. The compound of any one of claims 1-28, wherein the cyclic peptide comprises 4 to 20 amino acid residues in the cyclic peptide, wherein at least two amino acid residues comprise a guanidino group or protonated form thereof, and at least two amino acid residues independently comprise a hydrophobic side chain.

30. The compound of any one of claims 1-29, wherein the cyclic peptide comprises 2, 3, or 4 acid residues containing a guanidino group or protonated form thereof.

31. The compound of claim 29 or 30, wherein the cyclic peptide comprises 2,3 or 4 amino acid residues comprising a hydrophobic side chain.

32. The compound of any one of claims 29-31, wherein the cyclic peptide comprises at least one amino acid comprising a side chain selected from the group consisting of Or a protonated form thereof.

33. The compound of any one of claims 29-32, wherein the cyclic peptide comprises 1, 2, 3, or 4 amino acids comprising a side chain selected from the group consisting of Or a protonated form thereof.

34. The compound of any one of claims 29-33, wherein the cyclic peptide comprises at least one glycine residue.

35. The compound of any one of claims 29-34, wherein the cyclic peptide comprises 1,2, 3, or 4 glycine residues.

36. The compound of any one of the preceding claims, wherein the cyclic peptide has the structure of formula (I):

Or a protonated form thereof,

Wherein:

r ₄ and R ₆ are independently H or an amino acid side chain;

AA _SC is an amino acid side chain;

q is 1, 2, 3 or 4; and

Each m is independently an integer of 0,1, 2 or 3.

37. The compound of claim 36, wherein R ₄ is H or an amino acid residue comprising a side chain comprising an aromatic group.

38. The compound of claim 36 or 37, wherein the amino acid comprising a side chain comprising an aromatic group is phenylalanine.

39. The compound of any one of claims 36-38, wherein both of R ₁、R₂、R₃ and R ₄ comprise phenylalanine.

40. The compound of any one of claims 36-39, wherein both of R ₁、R₂、R₃ and R ₄ are H.

41. The compound of any one of claims 36-40, wherein the cyclic peptide has the structure of formula (I-1) or (I-2):

Or a protonated form thereof,

Wherein:

AA _SC is an amino acid side chain; and

Each m is independently an integer from 0 to 3.

42. The compound of any one of claims 36-41, wherein the cyclic peptide has the structure of formula (V-1):

Or a protonated form thereof,

Wherein:

AA _SC is an amino acid side chain; and

Each m is independently an integer from 0 to 3.

43. The compound of any one of claims 36-42, wherein AA _SC comprises a side chain of an asparagine residue, an aspartic acid residue, a glutamine residue, a glutamic acid residue, a homoglutamic acid residue, or a homoglutamine residue.

44. The compound of any one of claims 36-42, wherein AA _SC comprises a side chain of a glutamine residue.

45. The compound of any one of claims 1-44, wherein the AC comprises a sequence shown in any one of tables 6A-6P or 7A-7O or 8A-8C, or a reverse complement thereof, or a sequence having at least 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% nucleic acid sequence identity thereto.

46. The compound of any one of claims 1-45, wherein the cyclic peptide comprises FGFGRGR.

47. The compound of any one of claims 1-45, comprising a structure of formula (C):

Or a protonated form thereof,

Wherein:

r ₄ and R ₆ are independently H or an amino acid side chain;

EP is a cyclic exopeptide as defined herein;

Each m is independently an integer from 0 to 3;

n is an integer from 0 to 2;

x' is an integer from 2 to 20;

y is an integer from 1 to 5;

q is an integer from 1 to 4;

z' is an integer from 2 to 20;

Wherein the AC is an oligonucleotide from any of tables 6A-6P or tables 7A-7O or tables 8A-8C, or a reverse complement thereof, or a sequence having at least 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% nucleic acid sequence identity thereto.

or a protonated form thereof;

51. A cell comprising a compound of any one of claims 1-49.

52. A method of treating DMD, the method comprising administering to a patient in need thereof a compound or pharmaceutical composition of any one of claims 1-49.