JP7065928B2

JP7065928B2 - Storage system and its control method

Info

Publication number: JP7065928B2
Application number: JP2020185556A
Authority: JP
Inventors: 一樹松上; 義裕吉井; 伸光高岡; 智大川口
Original assignee: Hitachi Ltd
Current assignee: Hitachi Ltd
Priority date: 2020-11-06
Filing date: 2020-11-06
Publication date: 2022-05-12
Anticipated expiration: 2038-03-27
Also published as: JP2021039771A

Description

本発明はストレージシステムに関する。 The present invention relates to a storage system.

ストレージシステムは、一般的に、１以上のストレージ装置を備える。１以上のストレージ装置の各々は、一般的に、記憶デバイスとして、例えば、ＨＤＤ（Hard Disk Drive）又はＳＳＤ（Solid State Drive）を備える。ストレージシステムが、ＳＡＮ（Storage Area Network）又はＬＡＮ（Local Area Network）といったネットワーク経由で、１又は複数の上位装置（例えば、ホスト計算機）からアクセスされる。一般的に、ストレージ装置は、ＲＡＩＤ（Redundant Array of Independent （or Inexpensive） Disks）技術に従う高信頼化方法を用いることで信頼性を向上している。 A storage system generally comprises one or more storage devices. Each of the one or more storage devices generally includes, for example, an HDD (Hard Disk Drive) or an SSD (Solid State Drive) as a storage device. The storage system is accessed from one or more higher-level devices (for example, a host computer) via a network such as a SAN (Storage Area Network) or a LAN (Local Area Network). In general, a storage device has improved reliability by using a high reliability method according to a RAID (Redundant Array of Independent (or Inexpensive) Disks) technique.

特許文献１には、ホスト計算機からのデータ書き込み速度を維持しながら、データを圧縮させることが出来る情報システムが開示されている。特許文献１によれば、ストレージ装置においてホスト計算機からのデータ書き込みを受け付ける第１ボリュームと、第１ボリューム上のデータを圧縮して管理する第２ボリュームを提供する。ホスト計算機から第１ボリュームに対するデータ書き込みを終えると、ストレージ装置はホスト計算機に対して書き込み処理が完了したとして応答を返す。その後ストレージ装置は、ホスト計算機からのデータ書き込みとは非同期的な契機にデータを圧縮して第２ボリュームに格納する。 Patent Document 1 discloses an information system capable of compressing data while maintaining the data writing speed from a host computer. According to Patent Document 1, a first volume that accepts data writing from a host computer in a storage device and a second volume that compresses and manages data on the first volume are provided. When the data writing from the host computer to the first volume is completed, the storage device returns a response to the host computer assuming that the writing process is completed. After that, the storage device compresses the data and stores it in the second volume at a timing asynchronous with the data writing from the host computer.

非特許文献１には、ホスト計算機から書き込まれた重複するデータを一つにまとめる重複排除処理について、ストレージ装置の稼働率に応じて処理契機を切り替えることで、レスポンスとスループットを両立させる方法について開示されている。 Non-Patent Document 1 discloses a method of achieving both response and throughput by switching the processing trigger according to the operating rate of the storage device for the deduplication processing that combines the duplicated data written from the host computer into one. Has been done.

例えば、非特許文献１には、「方式の違いによってＩＯＰＳやレイテンシーに関する特性が異なっており、これらを使い分けることでdedup-back方式の低レイテンシー、dedup-through方式の高ＩＯＰＳを実現するのが本稿で提案するハイブリッド方式である。」及び「本稿では、従来の同期的に重複除去を行うdedup-through方式に加えて、非同期に重複除去を行うdedup-back方式の２つを比較して、dedup-through方式の高いＩＯＰＳ性能と同期的な重複除去処理のオーバーヘッドによる高レイテンシー、dedup-back方式の低レイテンシーとtail latencyの増加に伴うＩＯＰＳ低下を明らかにして、この２つの方式を組み合わせることで高ＩＯＰＳと低レイテンシーの両立を目指すハイブリッド方式を提案した。」と記載されている。 For example, Non-Patent Document 1 states that "characteristics related to IOPS and latency differ depending on the method, and by using these properly, low latency of the dedup-back method and high IOPS of the dedup-through method can be realized. In this paper, in addition to the conventional dedup-through method that performs deduplication synchronously, the dedup-back method that performs deduplication asynchronously is compared and dedup. The high IOPS performance of the -through method and the high latency due to the overhead of synchronous deduplication processing, the low latency of the dedup-back method and the decrease in IOPS due to the increase in tail latency are clarified, and the combination of these two methods is high. We proposed a hybrid method that aims to achieve both IOPS and low latency. "

すなわち、非特許文献１によれば、ストレージ装置の稼働率が低い場合、ホスト計算機からのデータ書き込みを終えてから重複排除処理を実施することで応答時間を短くし、稼働率が高い場合はデータ書き込みと同時に重複排除処理を実施する。 That is, according to Non-Patent Document 1, when the operating rate of the storage device is low, the response time is shortened by performing the deduplication processing after the data writing from the host computer is completed, and when the operating rate is high, the data is recorded. Deduplication processing is performed at the same time as writing.

米国特許出願公開第２００９／０１４４４９６号明細書US Patent Application Publication No. 2009/01444996

加藤純，大辻弘貴，鈴木康介，佐藤充，吉田英司：「インメモリー重複除去における書き込み高速化」，研究報告コンピュータシステム・シンポジウム，２０１６年１１月２８日，ｐ．５１－５９Jun Kato, Hiroki Otsuji, Kosuke Suzuki, Mitsuru Sato, Eiji Yoshida: "Speeding up writing in in-memory deduplication", Research Report Computer System Symposium, November 28, 2016, p. 51-59

データ書き込みにおいてＲＡＩＤ技術に従ったデータ保護を行うには、冗長化に必要なデータ量（パリティサイクル）を集める必要がある。パリティサイクル分のデータが集まるまでキャッシュメモリ上でのデータ保護が必要なため、キャッシュメモリ上のデータは二重化される。これは、ホスト計算機から書き込まれたデータ及び圧縮されたデータについても同様に行われる。このような場合、データ書き込みの最大速度は、データの読み出し及び二重化によるキャッシュアクセス量によって制限される。 In order to protect data according to the RAID technique in data writing, it is necessary to collect the amount of data (parity cycle) required for redundancy. Since it is necessary to protect the data on the cache memory until the data for the parity cycle is collected, the data on the cache memory is duplicated. This is also done for the data written from the host computer and the compressed data. In such a case, the maximum speed of data writing is limited by the amount of cache access due to data reading and duplication.

キャッシュアクセス量を低減する方法として、書き込みと同期してデータを圧縮することによって圧縮前のデータを二重化する処理を省略する方法が考えられる。しかし、ホスト計算機に対しての処理完了の応答を返すには、圧縮データを二重化する必要があるため、圧縮処理の時間だけ応答速度が遅くなる。 As a method of reducing the amount of cache access, a method of omitting the process of duplicating the data before compression by compressing the data in synchronization with the writing can be considered. However, in order to return the processing completion response to the host computer, it is necessary to duplicate the compressed data, so the response speed slows down by the time of the compression processing.

このような課題は、圧縮機能を有するストレージシステムに限らず、重複排除などの他のデータ量削減機能を有するストレージシステム、及び、暗号化又は冗長化などを行うストレージシステムについてもあり得る。 Such a problem is not limited to the storage system having a compression function, but may also be a storage system having another data amount reduction function such as deduplication, and a storage system performing encryption or redundancy.

上記の課題の少なくとも一つを解決するための本発明の代表的な一例を示せば、次の通りである。すなわち、第１のストレージ制御部と、第２のストレージ制御部と、少なくとも前記第１のストレージ制御部に接続され、不揮発性の記憶媒体を有するストレージドライブと、を有するストレージシステムであって、前記第１のストレージ制御部は、データを格納する第１のキャッシュ領域と、データを格納する第１のバッファ領域と、を有しており、前記第２のストレージ制御部は、それぞれ、データを格納する第２のキャッシュ領域と、データを格納する第２のバッファ領域と、を有しており、前記第１のストレージ制御部は、前記第１のキャッシュ領域に格納されたデータを前記第２のキャッシュ領域にも格納して二重化を行うようになっており、前記第１のストレージ制御部は、ホスト計算機からデータの書き込み命令を受信すると、前記書き込み命令の対象のデータを、前記第１のストレージ制御部の前記第１のキャッシュ領域に格納するとともに、前記第１のキャッシュ領域に格納したデータを前記第２のストレージ制御部の前記第２のキャッシュ領域に格納して二重化を行い、前記二重化が完了したら、前記ホスト計算機に、前記データの書き込みの終了を示す応答を送信し、前記第１のストレージ制御部は、前記書き込み命令の対象であるいずれかの前記キャッシュ領域に格納された二重化されたデータのうち前記ストレージドライブに格納されるデータについては、前記キャッシュ領域から読み出した前記書き込み命令の対象のデータに圧縮処理を行って前記第１のキャッシュ領域に格納せずに前記第１のバッファ領域に格納し、前記第１のバッファ領域に前記圧縮処理して格納されたデータに基づいてパリティを生成して前記第１のバッファ領域に格納し、前記第１のバッファ領域に格納したデータ及びパリティを読み出して前記ストレージドライブに送信して格納させることを特徴とする。 A typical example of the present invention for solving at least one of the above problems is as follows. That is, a storage system including a first storage control unit, a second storage control unit, and a storage drive connected to at least the first storage control unit and having a non-volatile storage medium. The first storage control unit has a first cache area for storing data and a first buffer area for storing data, and the second storage control unit stores data, respectively. It has a second cache area and a second buffer area for storing data, and the first storage control unit uses the data stored in the first cache area as the second data. It is also stored in the cache area to perform duplication, and when the first storage control unit receives a data write command from the host computer, the data subject to the write command is stored in the first storage. The data stored in the first cache area of the control unit is stored in the first cache area, and the data stored in the first cache area is stored in the second cache area of the second storage control unit to perform duplication. When completed, a response indicating the end of writing the data is transmitted to the host computer, and the first storage control unit is duplicated stored in any of the cache areas subject to the write command. Of the data, the data stored in the storage drive is not stored in the first cache area by performing compression processing on the data subject to the write instruction read from the cache area, but the first buffer. The data stored in the area, the parity is generated based on the data stored in the first buffer area by the compression process, and the data is stored in the first buffer area, and the data stored in the first buffer area. It is characterized in that the parity is read out and transmitted to the storage drive for storage.

本発明の一態様によれば、圧縮処理から記憶デバイスへの格納までを一括で行うことによって、圧縮データの二重化処理が省略される。圧縮データの二重化が不要になることで、キャッシュアクセス量を削減し、データ書き込みの最大速度が向上できる。また、記憶デバイスへの圧縮データの格納が完了するまでキャッシュメモリ上に圧縮前のデータを保持することによって、圧縮処理や記憶デバイスへの格納などの処理中に装置障害が発生してもデータを保護することが出来る。 According to one aspect of the present invention, the duplication process of the compressed data is omitted by collectively performing the process from the compression process to the storage in the storage device. By eliminating the need for duplication of compressed data, the amount of cache access can be reduced and the maximum speed of data writing can be improved. In addition, by retaining the uncompressed data in the cache memory until the storage of the compressed data in the storage device is completed, the data can be stored even if a device failure occurs during processing such as compression processing or storage in the storage device. Can be protected.

上記した以外の課題、構成及び効果は、以下の実施形態の説明によって明らかにされる。 Issues, configurations and effects other than those described above will be clarified by the description of the following embodiments.

本発明の実施例１のストレージシステムが実行する、データ圧縮処理を伴うデータライト手順を示す説明図である。It is explanatory drawing which shows the data write procedure with data compression processing performed by the storage system of Example 1 of this invention. 本発明の実施例１のストレージ装置の構成を示すブロック図である。It is a block diagram which shows the structure of the storage apparatus of Example 1 of this invention. 本発明の実施例１のストレージ装置が保持するＶＯＬ管理テーブルの構成例を示す説明図である。It is explanatory drawing which shows the configuration example of the VOL management table held by the storage apparatus of Example 1 of this invention. 本発明の実施例１のストレージ装置が保持するプール構成管理テーブルの構成例を示す説明図である。It is explanatory drawing which shows the configuration example of the pool composition management table held by the storage apparatus of Example 1 of this invention. 本発明の実施例１のストレージ装置が保持するＲＡＩＤ構成管理テーブルの構成例を示す説明図である。It is explanatory drawing which shows the configuration example of the RAID configuration management table held by the storage apparatus of Example 1 of this invention. 本発明の実施例１のストレージ装置が保持するプール割当管理テーブルの構成例を示す説明図である。It is explanatory drawing which shows the configuration example of the pool allocation management table held by the storage apparatus of Example 1 of this invention. 本発明の実施例１のストレージ装置が保持するドライブ割当管理テーブルの構成例を示す説明図である。It is explanatory drawing which shows the configuration example of the drive allocation management table held by the storage apparatus of Example 1 of this invention. 本発明の実施例１のストレージ装置によって管理される論理記憶階層の構成例を示す説明図である。It is explanatory drawing which shows the structural example of the logical storage hierarchy managed by the storage apparatus of Embodiment 1 of this invention. 本発明の実施例１のストレージ装置が保持するメモリ割当管理テーブルの構成例を示す説明図である。It is explanatory drawing which shows the configuration example of the memory allocation management table held by the storage apparatus of Example 1 of this invention. 本発明の実施例１のストレージ装置におけるメモリ割当の構成例を示す図である。It is a figure which shows the configuration example of the memory allocation in the storage apparatus of Example 1 of this invention. 本発明の実施例１のストレージ装置が実行するリード処理を示すフローチャートである。It is a flowchart which shows the read process which the storage apparatus of Embodiment 1 of this invention performs. 本発明の実施例１のストレージ装置が実行するライト処理を示すフローチャートである。It is a flowchart which shows the write process which performs the storage apparatus of Embodiment 1 of this invention. 本発明の実施例１のストレージ装置が実行するデステージ処理を示すフローチャートである。It is a flowchart which shows the destaging process which performs the storage apparatus of Embodiment 1 of this invention. 本発明の実施例１のストレージ装置が実行する、排他手順を変更したデステージ処理を示すフローチャートである。It is a flowchart which shows the destaging process which changed the exclusion procedure, which is executed by the storage apparatus of Embodiment 1 of this invention.

以下の説明では、「インターフェース部」は、ユーザインターフェース部と、通信インターフェース部とのうちの少なくとも１つを含んでよい。ユーザインターフェース部は、１以上のＩ／Ｏデバイス（例えば入力デバイス（例えばキーボード及びポインティングデバイス）と出力デバイス（例えば表示デバイス））と表示用計算機とのうちの少なくとも１つのＩ／Ｏデバイスを含んでよい。通信インターフェース部は、１以上の通信インターフェースデバイスを含んでよい。１以上の通信インターフェースデバイスは、１以上の同種の通信インターフェースデバイス（例えば１以上のＮＩＣ（Network Interface Card））であってもよいし２以上の異種の通信インターフェースデバイス（例えばＮＩＣとＨＢＡ（Host Bus Adapter））であってもよい。 In the following description, the "interface unit" may include at least one of a user interface unit and a communication interface unit. The user interface unit includes at least one I / O device of one or more I / O devices (for example, an input device (for example, a keyboard and a pointing device) and an output device (for example, a display device)) and a display computer. good. The communication interface unit may include one or more communication interface devices. One or more communication interface devices may be one or more communication interface devices of the same type (for example, one or more NICs (Network Interface Cards)) or two or more different types of communication interface devices (for example, NIC and HBA (Host Bus)). Adapter)) may be used.

また、以下の説明では、「メモリ部」は、１以上のメモリを含む。少なくとも１つのメモリは、揮発性メモリであってもよいし不揮発性メモリであってもよい。メモリ部は、主に、プロセッサ部による処理の際に使用される。 Further, in the following description, the "memory unit" includes one or more memories. The at least one memory may be a volatile memory or a non-volatile memory. The memory unit is mainly used during processing by the processor unit.

また、以下の説明では、「プロセッサ部」は、１以上のプロセッサを含む。少なくとも１つのプロセッサは、典型的には、ＣＰＵ（Central Processing Unit）である。プロセッサは、処理の一部又は全部を行うハードウェア回路を含んでもよい。 Further, in the following description, the "processor unit" includes one or more processors. The at least one processor is typically a CPU (Central Processing Unit). The processor may include hardware circuits that perform some or all of the processing.

また、以下の説明では、「ｘｘｘテーブル」といった表現にて情報を説明することがあるが、情報は、どのようなデータ構造で表現されていてもよい。すなわち、情報がデータ構造に依存しないことを示すために、「ｘｘｘテーブル」を「ｘｘｘ情報」と言うことができる。また、以下の説明において、各テーブルの構成は一例であり、１つのテーブルは、２以上のテーブルに分割されてもよいし、２以上のテーブルの全部又は一部が１つのテーブルであってもよい。 Further, in the following description, the information may be described by an expression such as "xxx table", but the information may be expressed by any data structure. That is, the "xxx table" can be referred to as "xxx information" in order to show that the information does not depend on the data structure. Further, in the following description, the configuration of each table is an example, and one table may be divided into two or more tables, or all or a part of two or more tables may be one table. good.

また、以下の説明では、同種の要素を区別しないで説明する場合には、参照符号のうちの共通符号を使用し、同種の要素を区別する場合は、参照符号（又は要素のＩＤ（例えば識別番号）を使用することがある。例えば、複数のストレージコントローラを区別しない場合には、「ストレージコントローラ２２」と記載し、各ストレージコントローラを区別する場合には、「ストレージコントローラ１＿２２Ａ」、「ストレージコントローラ２＿２２Ｂ」のように記載する。他の要素（例えばキャッシュ領域２０３、バッファ領域２０２、アドレス１１００、１１０１、１１０４等）も同様である。 Further, in the following description, when the same type of element is not distinguished, the common code among the reference codes is used, and when the same type of element is distinguished, the reference code (or the element ID (for example, identification) is used. (Number) may be used. For example, when a plurality of storage controllers are not distinguished, it is described as "storage controller 22", and when each storage controller is distinguished, "storage controller 1-22A" and "storage controller" are used. It is described as "2_22B". Other elements (for example, cache area 203, buffer area 202, addresses 1100, 1101, 1104, etc.) are also described in the same manner.

また、以下の説明では、「ストレージシステム」は、１以上のストレージ装置を含む。少なくとも１つのストレージ装置は、汎用的な物理計算機であってもよい。また、少なくとも１つのストレージ装置が、仮想的なストレージ装置であってもよいし、ＳＤｘ（Software-Defined anything）を実行してもよい。ＳＤｘとしては、例えば、ＳＤＳ（Software Defined Storage）（仮想的なストレージ装置の一例）又はＳＤＤＣ（Software-defined Datacenter）を採用することができる。 Further, in the following description, the "storage system" includes one or more storage devices. The at least one storage device may be a general-purpose physical calculator. Further, at least one storage device may be a virtual storage device, or SDx (Software-Defined anything) may be executed. As SDx, for example, SDS (Software Defined Storage) (an example of a virtual storage device) or SDDC (Software-defined Datacenter) can be adopted.

以下、本発明の実施例を図面に基づいて説明する。 Hereinafter, embodiments of the present invention will be described with reference to the drawings.

以下、本発明の実施例１を説明する。 Hereinafter, Example 1 of the present invention will be described.

＜記憶デバイスへの圧縮データの格納手順＞
図１は、本発明の実施例１のストレージシステム１００が実行する、データ圧縮処理を伴うデータライト手順を示す説明図である。 <Procedure for storing compressed data in a storage device>
FIG. 1 is an explanatory diagram showing a data writing procedure accompanied by a data compression process executed by the storage system 100 of the first embodiment of the present invention.

ストレージシステム１００は、ホスト計算機３０及びストレージ装置１１によって構成される。ホスト計算機３０は、ネットワーク３１を介してストレージ装置１１に接続され、管理計算機（図示せず）によって管理される。ストレージ装置１１は、１以上のボリューム（論理的な記憶領域）を有する。ホスト計算機３０は、物理的な計算機でもよいし、物理的な計算機で実行される仮想的な計算機でもよい。ホスト計算機３０は、ストレージシステムにおいて実行される仮想的な計算機でもよい。 The storage system 100 includes a host computer 30 and a storage device 11. The host computer 30 is connected to the storage device 11 via the network 31 and is managed by a management computer (not shown). The storage device 11 has one or more volumes (logical storage areas). The host computer 30 may be a physical computer or a virtual computer executed by the physical computer. The host computer 30 may be a virtual computer executed in the storage system.

ホスト計算機３０からは、ストレージ装置１１のストレージコントローラ１＿２２Ａ又はストレージコントローラ２＿２２Ｂに対してデータの書き込みが行われる。このストレージシステム１００において、ホスト計算機３０からの圧縮処理を伴うデータのライト処理について説明する。 Data is written from the host computer 30 to the storage controller 1_22A or the storage controller 2_22B of the storage device 11. In this storage system 100, a data write process accompanied by a compression process from the host computer 30 will be described.

本実施例では、ホスト計算機３０からのライト命令をストレージコントローラ１＿２２Ａが受領した場合について示す。 In this embodiment, the case where the storage controller 1_22A receives the write instruction from the host computer 30 is shown.

具体例は、下記に示す通りである。 Specific examples are as shown below.

（Ｓ１）ストレージ装置１１は、ホスト計算機３０からネットワーク３１を介してライト命令を受信する。ライト命令は、データとデータの割当先アドレス１１００とを含んでいる。ライト命令を受信した場合に、Ｓ２以降のライト処理が開始する。 (S1) The storage device 11 receives a write command from the host computer 30 via the network 31. The write instruction includes the data and the data allocation destination address 1100. When the write command is received, the write process after S2 starts.

（Ｓ２）ストレージ装置１１は、ライト命令に応答して、割当先アドレス１１００が示すスロットの排他を確保する。これによって、そのスロット内のデータが他のライト命令によって更新されることを防ぐ。「スロット」とは、ボリューム（ＶＯＬ）における領域である。具体的には、本実施例のスロットは、ドライブ２９への書き込みが行われたか否か、及び、バッファ領域２０２への転送が行われたか否か等の管理の単位となる領域である。本実施例ではこの領域を「スロット」と呼ぶが、他の名称で呼ばれてもよい。 (S2) The storage device 11 secures the exclusion of the slot indicated by the allocation destination address 1100 in response to the write command. This prevents the data in that slot from being updated by other write instructions. A "slot" is an area in a volume (VOL). Specifically, the slot of this embodiment is an area that is a unit of management such as whether or not writing to the drive 29 has been performed and whether or not transfer to the buffer area 202 has been performed. In this embodiment, this area is referred to as a "slot", but it may be referred to by another name.

「スロットの排他を確保」とは、ホスト計算機３０からのリード命令及びライト命令で指定されたアドレスが示すスロットに対するリード及びライトを防ぐ操作であり、排他を確保したことをホスト計算機３０が認識するための情報が管理される。なお、この情報はビットマップ又は時間情報など識別できるものであれば種別は問わない。また、本実施例において、「スロット」が、ＶＯＬ（例えば、シンプロビジョニングに従うＶＯＬであるＴＰ－ＶＯＬ）における領域であるのに対し、「データ領域」は、スロットに割り当てられる領域（例えば、プール内の領域であるプール領域）である。 "Securing slot exclusion" is an operation of preventing read and write to the slot indicated by the address specified by the read command and write command from the host computer 30, and the host computer 30 recognizes that the exclusion has been secured. Information for is managed. The type of this information does not matter as long as it can be identified such as bitmap or time information. Further, in this embodiment, the "slot" is an area in the VOL (for example, TP-VOL which is a VOL according to thin provisioning), whereas the "data area" is an area allocated to the slot (for example, in the pool). The pool area, which is the area of.

（Ｓ３）ストレージ装置１１のストレージコントローラ１＿２２Ａ内、キャッシュ領域２０３Ａにおいて、データの割当先アドレス１１００に対応するアドレス１１００Ａにデータを格納する。 (S3) In the cache area 203A in the storage controller 1_22A of the storage device 11, data is stored in the address 1100A corresponding to the data allocation destination address 1100.

（Ｓ４）ストレージコントローラ１＿２２Ａは、キャッシュ領域２０３Ａ内に格納されたデータをストレージコントローラ２＿２２Ｂに転送する。ストレージコントローラ２＿２２Ｂは、割当先アドレス１１００に対応するキャッシュ領域２０３Ｂ内のアドレス１１００Ｂに受領したデータを格納して、ストレージコントローラ１＿２２Ａへ応答を返すことでストレージ装置１１内での二重化を完了する。 (S4) The storage controller 1_22A transfers the data stored in the cache area 203A to the storage controller 2_22B. The storage controller 2_22B stores the received data in the address 1100B in the cache area 203B corresponding to the allocation destination address 1100, and returns a response to the storage controller 1_22A to complete the duplication in the storage device 11.

（Ｓ５）二重化を完了した後にストレージ装置１１からホスト計算機３０に対してネットワーク３１を介してライト完了を応答する。なお、この時点でホスト計算機３０はライトが完了したと認識する。 (S5) After the duplication is completed, the storage device 11 responds to the host computer 30 with the write completion via the network 31. At this point, the host computer 30 recognizes that the write is completed.

（Ｓ６）ストレージコントローラ１＿２２Ａは、キャッシュ領域２０３Ａからドライブへ書き出すデータを選択し、選択したデータを圧縮してバッファ領域２０２Ａ内のアドレス１１０１Ａに格納する。なお、この処理はバッファ領域２０２Ａ内にパリティサイクル分のデータが溜まるまで実施される。 (S6) The storage controller 1_22A selects data to be written from the cache area 203A to the drive, compresses the selected data, and stores the selected data at the address 1101A in the buffer area 202A. This process is executed until the data for the parity cycle is accumulated in the buffer area 202A.

また、後述するように、ストレージコントローラ１＿２２Ａは、選択したデータを圧縮せずにそのままアドレス１１０１Ａに格納してもよいし、圧縮以外の処理（例えば重複排除又は暗号化等）を行って、処理後のデータをアドレス１１０１Ａに格納してもよい。 Further, as will be described later, the storage controller 1_22A may store the selected data as it is at the address 1101A without compressing it, or it may perform processing other than compression (for example, deduplication or encryption) after the processing. Data may be stored in the address 1101A.

（Ｓ７）ストレージコントローラ１＿２２Ａは、バッファ領域２０２Ａ内のデータ量がパリティサイクル分に達すると、格納したデータからパリティデータを生成し、バッファ領域２０２Ａ内のアドレス１１０４Ａへ格納する。 (S7) When the amount of data in the buffer area 202A reaches the parity cycle, the storage controller 1_22A generates parity data from the stored data and stores it at the address 1104A in the buffer area 202A.

（Ｓ８）ストレージコントローラ１＿２２Ａは、バッファ領域２０２Ａ内の圧縮データ及びパリティデータをドライブ２９へ書き出す（デステージ処理）。 (S8) The storage controller 1_22A writes the compressed data and the parity data in the buffer area 202A to the drive 29 (destage processing).

（Ｓ９）ストレージコントローラ１＿２２Ａは、デステージ処理が完了すると、（Ｓ２）において確保したスロットの排他を解放する。 (S9) When the destage processing is completed, the storage controller 1_22A releases the exclusion of the slot secured in (S2).

以上が、ライト処理の一例である。 The above is an example of light processing.

＜ストレージ装置＞
図２は、本発明の実施例１のストレージ装置１１の構成を示すブロック図である。 <Storage device>
FIG. 2 is a block diagram showing the configuration of the storage device 11 according to the first embodiment of the present invention.

ストレージ装置１１は、１以上のストレージコントローラ２２と、１以上のストレージコントローラ２２に接続された複数のドライブ２９とを有する。 The storage device 11 has one or more storage controllers 22 and a plurality of drives 29 connected to one or more storage controllers 22.

ストレージコントローラ２２は、ホスト計算機３０との通信を行うＦＥ＿Ｉ／Ｆ（フロントエンドインターフェースデバイス）２３、ストレージ装置間での通信を行うためのストレージＩ／Ｆ（ストレージインターフェースデバイス）２８、装置全体を制御するプロセッサ２４、プロセッサ２４で使用されるプログラム及び情報を格納するメモリ２５、ドライブ２９との通信を行うＢＥ＿Ｉ／Ｆ（バックエンドインターフェースデバイス）２７、及びそれらをつなぐ内部ネットワーク２６を備える。 The storage controller 22 controls the FE_I / F (front-end interface device) 23 for communicating with the host computer 30, the storage I / F (storage interface device) 28 for communicating between the storage devices, and the entire device. It includes a processor 24, a memory 25 for storing programs and information used in the processor 24, a BE_I / F (back-end interface device) 27 for communicating with a drive 29, and an internal network 26 for connecting them.

メモリ２５は、プログラムを管理するプログラム領域２０１、データの転送及びコピーの時の一時的な保存領域であるバッファ領域２０２、ホスト計算機３０からのライトデータ（ライト命令に応答して書き込まれるデータ）及びドライブ２９からのリードデータ（リード命令に応答して読み出されたデータ）を一時的に格納するキャッシュ領域２０３、及び、種々のテーブルを格納するテーブル管理領域２０６を有する。 The memory 25 includes a program area 201 that manages the program, a buffer area 202 that is a temporary storage area at the time of data transfer and copying, write data (data written in response to a write command) from the host computer 30, and data. It has a cache area 203 for temporarily storing read data (data read in response to a read instruction) from the drive 29, and a table management area 206 for storing various tables.

キャッシュ領域２０３は、ホスト計算機３０からのライトデータを一時的に格納する非圧縮データ格納領域２０４、及び、圧縮したデータを格納する圧縮データ格納領域２０５を有する。テーブル管理領域２０６は、ＶＯＬに関する情報を保持するＶＯＬ管理テーブル２０７、プールに関する情報を保持するプール構成管理テーブル２０８、ＲＡＩＤ構成に関する情報を保持するＲＡＩＤ構成管理テーブル２０９、プール割当てに関する情報を保持するプール割当管理テーブル２１０、ドライブ割当てに関する情報を保持するドライブ割当管理テーブル２１１、及び、メモリ割当てに関する情報を保持するメモリ割当管理テーブル２１２を格納する。 The cache area 203 has an uncompressed data storage area 204 for temporarily storing write data from the host computer 30, and a compressed data storage area 205 for storing compressed data. The table management area 206 includes a VOL management table 207 that holds information about VOL, a pool configuration management table 208 that holds information about pools, a RADIUS configuration management table 209 that holds information about RADIUS configuration, and a pool that holds information about pool allocation. It stores the allocation management table 210, the drive allocation management table 211 that holds information about drive allocation, and the memory allocation management table 212 that holds information about memory allocation.

ドライブ２９は、不揮発性のデータ記憶媒体を有する装置であり、例えばＳＳＤ（Solid State Drive）でもＨＤＤ（Hard Disk Drive）でもよい。複数のドライブ２９が、複数のＲＡＩＤグループ（パリティグループとも呼ばれる）を構成してよい。各ＲＡＩＤグループは、１以上のドライブ２９から構成される。 The drive 29 is a device having a non-volatile data storage medium, and may be, for example, an SSD (Solid State Drive) or an HDD (Hard Disk Drive). A plurality of drives 29 may form a plurality of RAID groups (also referred to as parity groups). Each RAID group is composed of one or more drives 29.

ＦＥ＿Ｉ／Ｆ２３、ＢＥ＿Ｉ／Ｆ２７及びストレージＩ／Ｆ２８が、インターフェース部の一例である。メモリ２５が、メモリ部の一例である。プロセッサ２４が、プロセッサ部の一例である。 FE_I / F23, BE_I / F27 and storage I / F28 are examples of the interface unit. The memory 25 is an example of the memory unit. The processor 24 is an example of a processor unit.

＜ＶＯＬ管理テーブル＞
図３は、本発明の実施例１のストレージ装置１１が保持するＶＯＬ管理テーブル２０７の構成例を示す説明図である。 <VOL management table>
FIG. 3 is an explanatory diagram showing a configuration example of the VOL management table 207 held by the storage device 11 of the first embodiment of the present invention.

ＶＯＬ管理テーブル２０７は、ＶＯＬ毎にエントリを有する。各エントリは、ＶＯＬ＿ＩＤ４１、ＶＯＬ属性４２、ＶＯＬ容量４３及びプールＩＤ４４といった情報を格納する。以下、１つのＶＯＬ（図３の説明において「対象ＶＯＬ」）を例に取る。 The VOL management table 207 has an entry for each VOL. Each entry stores information such as VOL_ID41, VOL attribute 42, VOL capacity 43 and pool ID44. Hereinafter, one VOL (“target VOL” in the description of FIG. 3) will be taken as an example.

ＶＯＬ＿ＩＤ４１は、対象ＶＯＬのＩＤである。ＶＯＬ属性４２は、対象ＶＯＬの属性（例えば、対象ＶＯＬがシンプロビジョニングを適用されるＶＯＬであるか、通常のＶＯＬであるか、また、圧縮が有効であるか否かなど）を示す。ＶＯＬ容量４３は、対象ＶＯＬの容量を示す。プールＩＤ４４は、対象ＶＯＬに関連付けられているプールのＩＤである。 VOL_ID41 is the ID of the target VOL. The VOL attribute 42 indicates the attribute of the target VOL (for example, whether the target VOL is a VOL to which thin provisioning is applied, whether it is a normal VOL, whether compression is effective, and the like). The VOL capacity 43 indicates the capacity of the target VOL. The pool ID 44 is the ID of the pool associated with the target VOL.

プロセッサ２４は、デステージ処理において、ＶＯＬ管理テーブル２０７のＶＯＬ属性４２を参照することで、データ圧縮を必要とするＶＯＬか否かを判定できる。例えば、ＶＯＬ属性４２“圧縮有効”ならばデータ圧縮処理を行う。 In the destage processing, the processor 24 can determine whether or not the VOL requires data compression by referring to the VOL attribute 42 of the VOL management table 207. For example, if the VOL attribute 42 “compression is valid”, the data compression process is performed.

＜構成管理テーブル＞
図４は、本発明の実施例１のストレージ装置１１が保持するプール構成管理テーブル２０８の構成例を示す説明図である。 <Configuration management table>
FIG. 4 is an explanatory diagram showing a configuration example of the pool configuration management table 208 held by the storage device 11 of the first embodiment of the present invention.

プールは、１以上のＲＡＩＤグループを基に構成された論理記憶領域である。プール構成管理テーブル２０８は、プール毎にエントリを有する。各エントリは、プールＩＤ５１、ＲＡＩＤグループＩＤ５２、プール容量５３及びプール使用容量５４といった情報を格納する。以下、１つのプール（図４の説明において「対象プール」）を例に取る。 A pool is a logical storage area constructed based on one or more RAID groups. The pool configuration management table 208 has an entry for each pool. Each entry stores information such as pool ID 51, RAID group ID 52, pool capacity 53, and pool usage capacity 54. Hereinafter, one pool (“target pool” in the description of FIG. 4) will be taken as an example.

プールＩＤ５１は、対象プールのＩＤである。ＲＡＩＤグループＩＤ５２は、対象プールの基になっている１以上のＲＡＩＤグループの各々のＩＤである。プール容量５３は、対象プールの容量を示す。プール使用容量５４は、対象プールのプール容量のうちＶＯＬに割り当てられている領域の総量を示す。 The pool ID 51 is the ID of the target pool. The RAID group ID 52 is the ID of each of the one or more RAID groups that are the basis of the target pool. The pool capacity 53 indicates the capacity of the target pool. The pool usage capacity 54 indicates the total amount of the area allocated to the VOL among the pool capacities of the target pool.

図５は、本発明の実施例１のストレージ装置１１が保持するＲＡＩＤ構成管理テーブル２０９の構成例を示す説明図である。 FIG. 5 is an explanatory diagram showing a configuration example of the RAID configuration management table 209 held by the storage device 11 of the first embodiment of the present invention.

ＲＡＩＤ構成管理テーブル２０９は、ＲＡＩＤグループ毎にエントリを有する。各エントリは、ＲＡＩＤグループＩＤ６１、ＲＡＩＤレベル６２、ドライブＩＤ６３、ドライブ種別６４、容量６５及び使用容量６６といった情報を格納する。以下、１つのＲＡＩＤグループ（図５の説明において「対象ＲＡＩＤグループ」）を例に取る。 The RAID configuration management table 209 has an entry for each RAID group. Each entry stores information such as RAID group ID 61, RAID level 62, drive ID 63, drive type 64, capacity 65 and used capacity 66. Hereinafter, one RAID group (“target RAID group” in the description of FIG. 5) will be taken as an example.

ＲＡＩＤグループＩＤ６１は、対象ＲＡＩＤグループのＩＤである。ＲＡＩＤレベル６２は、対象ＲＡＩＤグループに適用されるＲＡＩＤアルゴリズムの種別を示す。ドライブＩＤ６３は、対象ＲＡＩＤグループを構成する１以上のドライブの各々のＩＤである。ドライブ種別６４は、対象ＲＡＩＤグループを構成するドライブの種別（例えばＨＤＤかＳＳＤか）を示す。容量６５は、対象ＲＡＩＤグループの容量を示す。使用容量６６は、対象ＲＡＩＤグループの容量のうちの使用されている容量を示す。 The RAID group ID 61 is the ID of the target RAID group. The RAID level 62 indicates the type of RAID algorithm applied to the target RAID group. The drive ID 63 is the ID of each of the one or more drives constituting the target RAID group. The drive type 64 indicates the type of drive (for example, HDD or SSD) that constitutes the target RAID group. Capacity 65 indicates the capacity of the target RAID group. The used capacity 66 indicates the used capacity among the capacities of the target RAID group.

＜割当管理テーブル＞
図６は、本発明の実施例１のストレージ装置１１が保持するプール割当管理テーブル２１０の構成例を示す説明図である。 <Assignment management table>
FIG. 6 is an explanatory diagram showing a configuration example of the pool allocation management table 210 held by the storage device 11 of the first embodiment of the present invention.

プール割当管理テーブル２１０は、ＶＯＬアドレス（ＶＯＬ内のスロットを示すアドレス）毎にエントリを有する。各エントリは、ＶＯＬ＿ＩＤ７１、ＶＯＬアドレス７２、プールＩＤ７３、プールアドレス７４、圧縮前サイズ７５、圧縮後サイズ７６、及び圧縮率７７といった情報を格納する。以下、１つのＶＯＬアドレス（図６の説明において「対象ＶＯＬアドレス」）を例に取る。 The pool allocation management table 210 has an entry for each VOL address (an address indicating a slot in the VOL). Each entry stores information such as VOL_ID71, VOL address 72, pool ID73, pool address 74, pre-compression size 75, post-compression size 76, and compression ratio 77. Hereinafter, one VOL address (“target VOL address” in the description of FIG. 6) will be taken as an example.

ＶＯＬ＿ＩＤ７１は、対象ＶＯＬアドレスによって識別されるスロットが属するＶＯＬのＩＤである。ＶＯＬアドレス７２は、対象ＶＯＬアドレスである。プールＩＤ７３は、対象ＶＯＬアドレスに割り当てられているデータ領域を含むプールのＩＤである。プールアドレス７４は、対象ＶＯＬアドレスに割り当てられているデータ領域のアドレス（プールに属するアドレス）である。圧縮前サイズ７５は、対象プールアドレスを指定したライト命令に従うデータの圧縮前サイズを示す。圧縮後サイズ７６は、対象プールアドレスを指定したライト命令に従うデータの圧縮後のサイズを示す。圧縮率７７は、圧縮後サイズ７６／圧縮前サイズ７５の値である。 VOL_ID71 is the ID of the VOL to which the slot identified by the target VOL address belongs. The VOL address 72 is a target VOL address. The pool ID 73 is an ID of the pool including the data area assigned to the target VOL address. The pool address 74 is an address (an address belonging to the pool) of the data area assigned to the target VOL address. The pre-compression size 75 indicates the pre-compression size of the data according to the write instruction specifying the target pool address. The compressed size 76 indicates the compressed size of the data according to the write instruction specifying the target pool address. The compression rate 77 is a value of post-compression size 76 / pre-compression size 75.

図７は、本発明の実施例１のストレージ装置１１が保持するドライブ割当管理テーブル２１１の構成例を示す説明図である。 FIG. 7 is an explanatory diagram showing a configuration example of the drive allocation management table 211 held by the storage device 11 of the first embodiment of the present invention.

ドライブ割当管理テーブル２１１は、プールアドレス毎にエントリを有する。各エントリは、プールＩＤ８１、プールアドレス８２、ＲＡＩＤグループＩＤ８３、ドライブＩＤ８４及びドライブアドレス８５といった情報を格納する。以下、１つのプールアドレス（図７の説明において「対象プールアドレス」）を例に取る。 The drive allocation management table 211 has an entry for each pool address. Each entry stores information such as pool ID 81, pool address 82, RAID group ID 83, drive ID 84 and drive address 85. Hereinafter, one pool address (“target pool address” in the description of FIG. 7) will be taken as an example.

プールＩＤ８１は、対象プールアドレスが属するプールのＩＤである。プールアドレス８２は、対象プールアドレスである。ＲＡＩＤグループＩＤ８３は、対象プールアドレスが示すデータ領域の基になっているＲＡＩＤグループのＩＤである。ドライブＩＤ８４は、対象プールアドレスが示すデータ領域の基になっているドライブのＩＤである。ドライブアドレス８５は、対象プールアドレスに対応したドライブアドレスである。 The pool ID 81 is the ID of the pool to which the target pool address belongs. The pool address 82 is a target pool address. The RAID group ID 83 is the ID of the RAID group that is the basis of the data area indicated by the target pool address. The drive ID 84 is the ID of the drive that is the basis of the data area indicated by the target pool address. The drive address 85 is a drive address corresponding to the target pool address.

＜論理記憶階層＞
図８は、本発明の実施例１のストレージ装置１１によって管理される論理記憶階層の構成例を示す説明図である。 <Logical storage hierarchy>
FIG. 8 is an explanatory diagram showing a configuration example of a logical storage hierarchy managed by the storage device 11 of the first embodiment of the present invention.

ＶＯＬ１０００は、ホスト計算機３０に提供される。また、コピー処理又は重複排除処理によって、ＶＯＬ１０００内の複数のスロットから１つのプールアドレスを指すことがあり、複数のＶＯＬのスロットから一つのプールアドレスを指すこともある。図８の例では、異なる２つのスロット（ＶＯＬアドレス）１１００及び１１０３が、同一のプールアドレス１１０１を指している。なお、ＶＯＬ１０００からプール１００１の割当ては、プール割当管理テーブル２１０を基に管理される。また、プール１００１からドライブアドレス空間１００３（すなわちＲＡＩＤグループ１００２を構成する複数のドライブ２９が提供する複数のドライブアドレス空間）への割当ては、ドライブ割当管理テーブル２１１を基に管理される。 The VOL 1000 is provided to the host computer 30. Further, due to the copy process or the deduplication process, one pool address may be pointed to from a plurality of slots in the VOL1000, and one pool address may be pointed to from a plurality of VOL slots. In the example of FIG. 8, two different slots (VOL addresses) 1100 and 1103 point to the same pool address 1101. The allocation of the pool 1001 from the VOL 1000 is managed based on the pool allocation management table 210. Further, the allocation from the pool 1001 to the drive address space 1003 (that is, the plurality of drive address spaces provided by the plurality of drives 29 constituting the RAID group 1002) is managed based on the drive allocation management table 211.

＜メモリ割当管理テーブル＞
図９は、本発明の実施例１のストレージ装置１１が保持するメモリ割当管理テーブル２１２の構成例を示す説明図である。 <Memory allocation management table>
FIG. 9 is an explanatory diagram showing a configuration example of the memory allocation management table 212 held by the storage device 11 of the first embodiment of the present invention.

メモリ割当管理テーブル２１２は、ＶＯＬアドレス（スロットを示すアドレス）毎にエントリを有する。各エントリは、ＶＯＬ＿ＩＤ９１、ＶＯＬアドレス９２、バッファ（ＢＦ）アドレス９３、圧縮後ＶＯＬアドレス９４、キュー状態９５及びＢＦ転送状態９６といった情報を格納する。以下、１つのＶＯＬアドレス（図９の説明において「対象ＶＯＬアドレス」）を例に取る。 The memory allocation management table 212 has an entry for each VOL address (address indicating a slot). Each entry stores information such as VOL_ID91, VOL address 92, buffer (BF) address 93, compressed VOL address 94, queue state 95, and BF transfer state 96. Hereinafter, one VOL address (“target VOL address” in the description of FIG. 9) will be taken as an example.

ＶＯＬ＿ＩＤ９１は、対象ＶＯＬアドレスによって識別されるスロットが属するＶＯＬのＩＤである。ＶＯＬアドレス９２は、対象ＶＯＬアドレスである。ＢＦアドレス９３は、対象ＶＯＬアドレスを指定してライトされたデータの転送先ＢＦアドレスを示す。圧縮後ＶＯＬアドレス９４は、対象ＶＯＬアドレスを指定してライトされたデータの内、ＢＦへの転送の対象外となったデータの転送先ＶＯＬアドレスを示す。キュー状態９５は、対象ＶＯＬアドレスを指定してライトされたデータのドライブ２９へのデータ格納が完了しているかを示す。図９では、キュー状態９５の値のうち“Dirty”はドライブ２９への格納が出来ていないことを、“Clean”はドライブ２９への格納が済んでいることを表す。ＢＦ転送状態９６は、対象ＶＯＬアドレスを指定してライトされたデータが圧縮されてＢＦへ転送されているか否かを示す。ＢＦへの転送が完了している場合、ＢＦ転送状態９６の値は“転送済み”となり、転送が行われていない場合は“無し”となる。 VOL_ID91 is the ID of the VOL to which the slot identified by the target VOL address belongs. The VOL address 92 is a target VOL address. The BF address 93 indicates the transfer destination BF address of the data written by designating the target VOL address. The compressed VOL address 94 indicates the transfer destination VOL address of the data that is not the target of transfer to the BF among the data written by designating the target VOL address. The queue state 95 indicates whether the data storage of the data written by designating the target VOL address in the drive 29 is completed. In FIG. 9, among the values of the queue state 95, “Dirty” indicates that the value has not been stored in the drive 29, and “Clean” indicates that the value has been stored in the drive 29. The BF transfer state 96 indicates whether or not the data written by designating the target VOL address is compressed and transferred to the BF. When the transfer to the BF is completed, the value of the BF transfer status 96 is "transferred", and when the transfer is not performed, the value is "none".

図１０は、本発明の実施例１のストレージ装置１１におけるメモリ割当の構成例を示す図である。 FIG. 10 is a diagram showing a configuration example of memory allocation in the storage device 11 of the first embodiment of the present invention.

キャッシュ領域２０３は、ＶＯＬに対応した仮想的なアドレス空間である非圧縮データ格納領域２０４、及び、プールアドレスに対応した圧縮データ格納領域２０５をストレージコントローラ２２へ提供している。ホスト計算機３０からストレージコントローラ２２へのライト命令によって、ＶＯＬアドレスに対応した非圧縮データ格納領域２０４が割当てられる。ストレージコントローラ２２は、ライト命令と非同期でデータを圧縮すると、圧縮したデータを、バッファ領域２０２、又は、キャッシュ領域２０３内圧縮データ格納領域２０５に、プールアドレスに対応させて格納する。 The cache area 203 provides the storage controller 22 with an uncompressed data storage area 204, which is a virtual address space corresponding to the VOL, and a compressed data storage area 205 corresponding to the pool address. The uncompressed data storage area 204 corresponding to the VOL address is allocated by the write command from the host computer 30 to the storage controller 22. When the data is compressed asynchronously with the write instruction, the storage controller 22 stores the compressed data in the buffer area 202 or the compressed data storage area 205 in the cache area 203 corresponding to the pool address.

図１０の例では、ライトされたデータが格納されているＶＯＬ内のスロット１１００が、プールアドレスに対応したバッファ領域２０２上の領域１１０１を指している。ＶＯＬアドレスとプールアドレスの割当ては、プール割当管理テーブル２１０で管理される。また、バッファ領域２０２への割当てはメモリ割当管理テーブル２１２のＢＦアドレス９３で、圧縮データ格納領域への割当てはメモリ割当管理テーブル２１２の圧縮後ＶＯＬアドレス９４で、それぞれ管理される。 In the example of FIG. 10, the slot 1100 in the VOL in which the written data is stored points to the area 1101 on the buffer area 202 corresponding to the pool address. The allocation of the VOL address and the pool address is managed by the pool allocation management table 210. The allocation to the buffer area 202 is managed by the BF address 93 of the memory allocation management table 212, and the allocation to the compressed data storage area is managed by the compressed VOL address 94 of the memory allocation management table 212.

バッファ領域２０２では、バッファ領域内のデータ量がパリティサイクルのサイズに達すると、プロセッサ２４を介して非圧縮データ格納領域２０４とは対応しないパリティ１１０４が生成される。 In the buffer area 202, when the amount of data in the buffer area reaches the size of the parity cycle, a parity 1104 that does not correspond to the uncompressed data storage area 204 is generated via the processor 24.

以下、本実施例で行われる処理の例を説明する。 Hereinafter, an example of the processing performed in this embodiment will be described.

＜リード処理＞
図１１は、本発明の実施例１のストレージ装置１１が実行するリード処理を示すフローチャートである。 <Lead processing>
FIG. 11 is a flowchart showing a read process executed by the storage device 11 of the first embodiment of the present invention.

リード処理は、ホスト計算機３０からネットワーク３１を介してストレージ装置１１がリード命令を受けた場合に開始する。リード命令では、例えば、仮想ＩＤ（例えば、仮想ＶＯＬ＿ＩＤ）、アドレス、及びデータサイズが指定される。 The read process starts when the storage device 11 receives a read command from the host computer 30 via the network 31. In the read instruction, for example, a virtual ID (for example, virtual VOL_ID), an address, and a data size are specified.

Ｓ１２０１で、プロセッサ２４は、リード命令から特定されるスロットの排他を確保する。なお、スロット排他確保時に他の処理がスロットの排他を確保している場合、プロセッサ２４は、一定の時間待ってから、Ｓ１２０１を行う。 In S1201, the processor 24 secures the exclusion of the slot specified from the read instruction. When the slot exclusion is secured by another process, the processor 24 waits for a certain period of time before performing S1201.

Ｓ１２０２で、プロセッサ２４は、リードデータがキャッシュ領域２０３に存在するか否かを判定する。Ｓ１２０２の判定結果が真の場合、Ｓ１２０４に進む。Ｓ１２０２の判定結果が偽の場合、プロセッサ２４は、Ｓ１２０３で、ＲＡＩＤグループからリードデータをバッファ領域２０２に転送する。なお、この際、プロセッサ２４は、ホスト計算機３０が指定したＶＯＬ＿ＩＤとＶＯＬアドレスから、プール割当管理テーブル２１０のプールＩＤ７３、プールアドレス７４及び圧縮後サイズ７６を特定し、ドライブ割当管理テーブル２１１からドライブＩＤ８４及びドライブアドレス８５を参照し、データの格納場所及びデータサイズを特定する。 In S1202, the processor 24 determines whether or not the read data exists in the cache area 203. If the determination result of S1202 is true, the process proceeds to S1204. If the determination result of S1202 is false, the processor 24 transfers read data from the RAID group to the buffer area 202 in S1203. At this time, the processor 24 specifies the pool ID 73, the pool address 74, and the compressed size 76 of the pool allocation management table 210 from the VOL_ID and the VOL address specified by the host computer 30, and drives the drive ID 84 from the drive allocation management table 211. And the drive address 85 to specify the data storage location and data size.

Ｓ１２０４で、プロセッサ２４はバッファ領域２０２上のリードデータが圧縮されているか否かを圧縮後サイズ７６から判定し、圧縮済みのデータであればＳ１２０５において伸長し、圧縮データで無い場合はＳ１２０５をスキップする。 In S1204, the processor 24 determines whether or not the read data on the buffer area 202 is compressed from the compressed size 76, decompresses the compressed data in S1205, and skips S1205 if it is not compressed data. do.

Ｓ１２０６で、プロセッサ２４はバッファ領域２０２上のリードデータをホスト計算機３０に転送する。ホスト計算機３０は、Ｓ１２０６のデータ転送が完了した時点でリード処理が終了したと認識する。 In S1206, the processor 24 transfers the read data on the buffer area 202 to the host computer 30. The host computer 30 recognizes that the read process is completed when the data transfer of S1206 is completed.

その後、プロセッサ２４は、Ｓ１２０５で、確保していたスロット排他を解除する。 After that, the processor 24 releases the reserved slot exclusion in S1205.

＜ライト処理＞
図１２は、本発明の実施例１のストレージ装置１１が実行するライト処理を示すフローチャートである。 <Light processing>
FIG. 12 is a flowchart showing a write process executed by the storage device 11 of the first embodiment of the present invention.

ライト処理は、ホスト計算機３０からストレージ装置１１がライト命令を受信した場合に開始する。なお、以下の説明では、例えば、ストレージコントローラ２＿２２Ａのプロセッサ２４をプロセッサ２４Ａと記載するなど、ストレージコントローラ２＿２２Ａ及びストレージコントローラ２＿２２Ｂに属するものをそれぞれ参照符号に付した「Ａ」及び「Ｂ」によって区別する。 The write process starts when the storage device 11 receives a write command from the host computer 30. In the following description, for example, the processor 24 of the storage controller 2_22A is described as the processor 24A, and those belonging to the storage controller 2_22A and the storage controller 2_22B are distinguished by "A" and "B" attached to reference numerals, respectively. ..

ホスト計算機３０からのライト命令には、割当て先アドレスが付随している。ストレージ装置１１は、Ｓ１３０１において割当て先アドレスが示すスロットの排他を確保する。なお、スロット排他確保と同時に、プロセッサ２４Ａは、データのライト先とするキャッシュ領域２０３Ａのスロット領域を割当てる。 An allocation destination address is attached to the write instruction from the host computer 30. The storage device 11 secures the exclusion of the slot indicated by the allocation destination address in S1301. At the same time as the slot exclusion is secured, the processor 24A allocates the slot area of the cache area 203A to which the data is written.

Ｓ１３０２で、プロセッサ２４Ａは、ホスト計算機３０に対してライト処理の準備ができたことを示す「Ｒｅａｄｙ」を応答する。プロセッサ２４Ａは、「Ｒｅａｄｙ」を受け取ったホスト計算機３０から、ライトデータを受ける。その後、Ｓ１３０３でプロセッサ２４はライト命令と同期して圧縮処理を実行する必要があるかを判定する。なお、プロセッサ２４Ａの負荷、ストレージ装置１１に対するライト量、及びライトデータのデータ長から、ストレージシステム１００においてレスポンス性能を優先するケース１及びスループット性能を優先するケース２のいずれかへ分岐する。例えば、ストレージ装置１１は、以下のような条件を保持し、プロセッサ２４Ａは、ライト命令を受信すると、保持している条件に基づいてレスポンス性能及びスループット性能のいずれを優先するかを判定してもよい。 In S1302, the processor 24A responds to the host computer 30 with a "Ready" indicating that the write process is ready. The processor 24A receives write data from the host computer 30 that has received "Ready". After that, in S1303, the processor 24 determines whether it is necessary to execute the compression process in synchronization with the write instruction. The load of the processor 24A, the amount of write to the storage device 11, and the data length of the write data are branched into either case 1 in which response performance is prioritized or case 2 in which throughput performance is prioritized in the storage system 100. For example, the storage device 11 holds the following conditions, and when the processor 24A receives a write instruction, it may determine whether to prioritize the response performance or the throughput performance based on the holding conditions. good.

＜ケース１＞レスポンス優先
レスポンス性能を優先する条件として以下のものがある。例えば、以下の複数の条件のうちいずれか一つのみ、又は、複数の組合せに基づいて、レスポンス性能を優先するか否かが判定されてもよい。後述するスループット性能に関する条件についても同様である。 <Case 1> Response priority There are the following conditions for prioritizing response performance. For example, it may be determined whether or not the response performance is prioritized based on only one of the following plurality of conditions or a plurality of combinations. The same applies to the conditions related to the throughput performance described later.

（１）ストレージコントローラ２２の（すなわちプロセッサ２４の）負荷が所定の基準より低い (1) The load of the storage controller 22 (that is, the processor 24) is lower than the predetermined reference.

（２）ライトデータを圧縮した場合の圧縮率が所定の基準より低くなることが予想される (2) It is expected that the compression rate when the write data is compressed will be lower than the predetermined standard.

（３）書き込み先のボリュームに圧縮データを格納できない (3) Compressed data cannot be stored in the write destination volume

ここで、上記（１）は、所定の基準近傍で判定結果が頻繁に切り替わると負荷の変動が不安定になるため、これを防ぐために多段階で基準を変動させてもよい。また、上記（１）は、例えばストレージ装置１１に対するＩＯ命令の量に基づいて判定されてもよい。例えば、単位時間当たりのＩＯ命令の回数、又は、ＩＯ命令によって書き込み／読み出しが行われるデータ量が所定の基準より少ない場合に、負荷が低いと判定されてもよい。 Here, in the above (1), since the fluctuation of the load becomes unstable when the determination result is frequently switched in the vicinity of the predetermined reference, the reference may be changed in multiple steps in order to prevent this. Further, the above (1) may be determined based on, for example, the amount of IO instructions for the storage device 11. For example, it may be determined that the load is low when the number of IO instructions per unit time or the amount of data written / read by the IO instruction is less than a predetermined reference.

上記（２）は、例えば、ライトデータのサイズが所定の基準より小さい場合に、ライトデータの圧縮率が低い、すなわち圧縮によるデータ削減が見込めないと判定されてもよい。上記（３）は、例えば、ライトデータの書き込み先のＶＯＬに対応するＶＯＬ管理テーブル２０７のＶＯＬ属性４２が“圧縮有効”でない場合に、書き込み先のボリュームに圧縮データを格納できないと判定されてもよい。 In the above (2), for example, when the size of the write data is smaller than a predetermined reference, it may be determined that the compression rate of the write data is low, that is, data reduction by compression cannot be expected. In the above (3), for example, even if it is determined that the compressed data cannot be stored in the write destination volume when the VOL attribute 42 of the VOL management table 207 corresponding to the VOL to which the write data is written is not "compression enabled". good.

例えばプロセッサ２４Ａが低負荷であり、レスポンス性能を優先する場合、Ｓ１３０３の判定において偽となる。この場合、プロセッサ２４Ａは、Ｓ１３０６において受け取ったライトデータを割当てたキャッシュ領域２０３Ａへ格納する。Ｓ１３０７において、ストレージコントローラ１＿２２Ａからストレージコントローラ２＿２２Ｂに対してキャッシュ領域２０３Ａに格納したライトデータを転送し、キャッシュ領域２０３Ｂに格納することで二重化を行う。 For example, when the processor 24A has a low load and priority is given to response performance, the determination in S1303 is false. In this case, the processor 24A stores the write data received in S1306 in the cache area 203A to which the write data is allocated. In S1307, the write data stored in the cache area 203A is transferred from the storage controller 1_22A to the storage controller 2_22B, and the write data is stored in the cache area 203B to perform duplication.

Ｓ１３０８において、プロセッサ２４Ａは、メモリ割当管理テーブル２１２を更新する。なお、本ケースにおいてライトデータは未だ圧縮されていない。このため、データのライト先として割当てられたスロットのＶＯＬアドレスに対応するＢＦアドレス９３及び圧縮後ＶＯＬアドレス９４の値は無く、プロセッサ２４Ａは、キュー状態９５を“Dirty”に更新する。 In S1308, the processor 24A updates the memory allocation management table 212. In this case, the write data has not been compressed yet. Therefore, there is no value of the BF address 93 and the compressed VOL address 94 corresponding to the VOL address of the slot assigned as the data write destination, and the processor 24A updates the queue state 95 to “Dirty”.

次に、Ｓ１３０９において、ストレージ装置１１から、ネットワーク３１を介してホスト計算機３０に対してライト処理が完了したとして完了応答を返却する。完了応答を返却すると、Ｓ１３１０においてストレージ装置１１は確保していたスロットの排他を解放してライト処理を終了する。 Next, in S1309, the storage device 11 returns a completion response to the host computer 30 via the network 31 assuming that the write process is completed. When the completion response is returned, the storage device 11 releases the exclusion of the slot reserved in S1310 and ends the write process.

＜ケース２＞スループット優先
スループット性能を優先する条件として以下のものがある。 <Case 2> Priority for throughput There are the following conditions for prioritizing throughput performance.

（４）ストレージコントローラ２２の（すなわちプロセッサ２４の）負荷が所定の基準より高い (4) The load of the storage controller 22 (that is, the processor 24) is higher than the predetermined reference.

（５）ライトデータを圧縮した場合の圧縮率が所定の基準より高くなることが予想される (5) It is expected that the compression rate when the write data is compressed will be higher than the predetermined standard.

ここで、上記（４）は、上記（１）と同様に、例えばストレージ装置１１に対するＩＯ命令の量に基づいて判定することができる。例えば、単位時間当たりのＩＯ命令の回数等が所定の基準より多い場合に、負荷が高いと判定されてもよい。 Here, the above (4) can be determined based on, for example, the amount of IO instructions for the storage device 11 as in the above (1). For example, when the number of IO instructions per unit time is larger than a predetermined standard, it may be determined that the load is high.

上記（５）は、例えば、ライトデータのサイズが所定の基準より大きい場合に、ライトデータの圧縮率が高い、すなわち圧縮によるデータ削減が見込まれると判定されてもよい。 In the above (5), for example, when the size of the write data is larger than a predetermined reference, it may be determined that the compression rate of the write data is high, that is, data reduction by compression is expected.

例えばプロセッサ２４が高負荷であり、スループット性能を優先する場合、Ｓ１３０３の判定において真となる。この場合、プロセッサ２４Ａは、Ｓ１３０４において受け取ったライトデータをバッファ領域２０２Ａへ転送する。次に、Ｓ１３０５で、プロセッサ２４Ａは、バッファ内のデータを圧縮する。 For example, when the processor 24 has a high load and priority is given to the throughput performance, it is true in the determination of S1303. In this case, the processor 24A transfers the write data received in S1304 to the buffer area 202A. Next, in S1305, the processor 24A compresses the data in the buffer.

なお、Ｓ１３０４及びＳ１３０５において、ライトデータのバッファ領域２０２Ａへの格納時に圧縮が行われても良い（すなわち、バッファ領域２０２Ａへの格納前に圧縮が行われ、圧縮されたデータがバッファ領域２０２Ａへ格納されても良い）し、バッファ領域２０２Ａへの格納後にバッファ領域２０２Ａ内で圧縮が行われても良い。いずれの場合も、最終的には、圧縮後のデータがバッファ領域２０２Ａに格納される。 In S1304 and S1305, compression may be performed when the write data is stored in the buffer area 202A (that is, compression is performed before storage in the buffer area 202A, and the compressed data is stored in the buffer area 202A. It may be performed), and compression may be performed in the buffer area 202A after storage in the buffer area 202A. In either case, the compressed data is finally stored in the buffer area 202A.

また、この圧縮は、バッファ領域２０２Ａ以外の記憶領域（例えばプロセッサ２４Ａ内のメモリ）において行われてもよい。 Further, this compression may be performed in a storage area other than the buffer area 202A (for example, a memory in the processor 24A).

ここで、圧縮は、ライトデータに対して行われる所定の処理の一例である。プロセッサ２４は、圧縮以外の処理、例えば、重複排除、暗号化又は冗長化等を行い、処理後のデータをバッファ領域２０２Ａに格納してもよい。後述する図１４のＳ１４１１についても同様である。 Here, compression is an example of predetermined processing performed on write data. The processor 24 may perform processing other than compression, for example, deduplication, encryption, redundancy, etc., and store the processed data in the buffer area 202A. The same applies to S1411 in FIG. 14, which will be described later.

次に、Ｓ１３０６において、プロセッサ２４Ａは、バッファ領域２０２Ａ内の圧縮データを、割当てたキャッシュ領域２０３Ａへ格納する。Ｓ１３０７において、ストレージコントローラ１＿２２Ａからストレージコントローラ２＿２２Ｂに対してキャッシュ領域２０３Ａに格納したライトデータを転送し、キャッシュ領域２０３Ｂに格納することで圧縮データの二重化を行う。 Next, in S1306, the processor 24A stores the compressed data in the buffer area 202A in the allocated cache area 203A. In S1307, the write data stored in the cache area 203A is transferred from the storage controller 1_22A to the storage controller 2_22B, and the compressed data is duplicated by storing the write data in the cache area 203B.

Ｓ１３０８において、プロセッサ２４Ａは、メモリ割当管理テーブル２１２を更新する。なお、本ケースにおいてライトデータは圧縮されており、圧縮データに対してアドレスが割当てられる。このため、データのライト先として割当てられたスロットのＶＯＬアドレスに対応する圧縮後ＶＯＬアドレス９４が更新される。また、ＢＦアドレス９３の値は無く、プロセッサ２４Ａは、キュー状態９５を“Dirty”に更新する。 In S1308, the processor 24A updates the memory allocation management table 212. In this case, the write data is compressed, and an address is assigned to the compressed data. Therefore, the compressed VOL address 94 corresponding to the VOL address of the slot assigned as the data write destination is updated. Further, there is no value of the BF address 93, and the processor 24A updates the queue state 95 to "Dirty".

＜デステージ処理＞
図１３は、本発明の実施例１のストレージ装置１１が実行するデステージ処理を示すフローチャートである。 <Destage processing>
FIG. 13 is a flowchart showing a destage process executed by the storage device 11 of the first embodiment of the present invention.

デステージ処理は、ホスト計算機３０からストレージ装置１１へのライト命令が完了した後、非同期的に行われる。なお、デステージは、ライト命令が完了したことを契機として開始されても良いし、周期的に起動しても良いし、キャッシュ領域２０３の消費量などからライト量を判定して選択しても良い。 The destage processing is performed asynchronously after the write instruction from the host computer 30 to the storage device 11 is completed. The destage may be started when the write command is completed, may be started periodically, or the write amount may be determined and selected from the consumption amount of the cache area 203 or the like. good.

デステージ処理が開始されると、ストレージ装置１１は、Ｓ１４０１において、デステージ処理の対象領域がキャッシュ領域上の圧縮データ格納領域２０５に属しているか否かを判定する。判定が真の場合（すなわち対象領域が圧縮データ格納領域２０５に属している場合）はケース２－１、判定が偽の場合（すなわち対象領域が非圧縮データ格納領域２０４に属している場合）はケース１－１の処理が行われる。 When the destage processing is started, the storage device 11 determines in S1401 whether or not the target area of the destage processing belongs to the compressed data storage area 205 on the cache area. Case 2-1 when the determination is true (that is, when the target area belongs to the compressed data storage area 205), and when the determination is false (that is, when the target area belongs to the uncompressed data storage area 204). The processing of case 1-1 is performed.

＜ケース２－１＞圧縮済データのデステージ
Ｓ１４０１の判定が真の場合、キャッシュ領域２０３内の圧縮データ格納領域２０５に対してデステージ処理（Ｓ１４０２～Ｓ１４０６）が行われる。Ｓ１４０２では、プロセッサ２４Ａは、圧縮データ格納領域２０５からデステージ処理を実行するデータを選択する。通常、パリティサイクル分のデータが並ぶデータ列（ストライプ列）が選択され、それに対してデステージが行われる。 <Case 2-1> Destage of compressed data When the determination of S1401 is true, destage processing (S1402 to S1406) is performed on the compressed data storage area 205 in the cache area 203. In S1402, the processor 24A selects data to be destaged from the compressed data storage area 205. Normally, a data string (striped column) in which data for the parity cycle is arranged is selected, and destage is performed for it.

Ｓ１４０３で、プロセッサ２４は、デステージするデータが属するスロットの排他を確保する。排他を確保した後、プロセッサ２４Ａは、Ｓ１４０４で対象のデータ列からパリティデータを生成する。Ｓ１４０５で、プロセッサ２４Ａは、対象のデータ列及び生成したパリティデータをドライブに書き出す。Ｓ１４０６において、プロセッサ２４Ａは、メモリ割当管理テーブル２１２を更新する。なお、本ケースにおいて、キュー状態９５が“Clean”に更新される。Ｓ１４０７で、プロセッサ２４Ａは、デステージされた範囲のスロットの排他を解放し、処理を終了する。 In S1403, the processor 24 ensures the exclusion of the slot to which the destaged data belongs. After ensuring the exclusion, the processor 24A generates parity data from the target data string in S1404. In S1405, the processor 24A writes the target data string and the generated parity data to the drive. In S1406, the processor 24A updates the memory allocation management table 212. In this case, the queue state 95 is updated to "Clean". In S1407, the processor 24A releases the exclusion of the destaged range of slots and ends the process.

＜ケース１－１＞圧縮及びデステージ一括処理（デステージ中排他保持）
Ｓ１４０１の判定が偽の場合、キャッシュ領域２０３内の非圧縮データ格納領域２０４に対してデステージ処理（Ｓ１４０８～Ｓ１４１５）が行われる。Ｓ１４０８では、プロセッサ２４Ａは、非圧縮データ格納領域２０４に格納されているデータのうち、キュー状態９５が“Dirty”であるスロットに属するデータから、デステージ処理を実行するデータを選択する。通常、パリティサイクル分のデータが並ぶデータ列（ストライプ列）が選択され、それに対してデステージが行われる。 <Case 1-1> Compression and destage batch processing (exclusive holding during destage)
If the determination in S1401 is false, destage processing (S1408 to S1415) is performed on the uncompressed data storage area 204 in the cache area 203. In S1408, the processor 24A selects the data to be destaged from the data belonging to the slot whose queue state 95 is “Dirty” among the data stored in the uncompressed data storage area 204. Normally, a data string (striped column) in which data for the parity cycle is arranged is selected, and destage is performed for it.

Ｓ１４０９で、プロセッサ２４は、デステージするデータが属するスロットの排他を確保する。なお、図１３に示すデステージ処理が、図１２に示したライト処理の終了を契機として（すなわちライト処理の直後に）行われる場合には、Ｓ１３１０及びＳ１４０９を省略してもよい。 In S1409, the processor 24 ensures the exclusion of the slot to which the destaged data belongs. When the destage process shown in FIG. 13 is performed with the end of the write process shown in FIG. 12 as a trigger (that is, immediately after the write process), S1310 and S1409 may be omitted.

排他を確保した後、プロセッサ２４Ａは、Ｓ１４１０で対象のデータを読み出して、バッファ領域２０２へ転送する。なお転送の際、プロセッサ２４は、メモリ割当管理テーブル２１２のＢＦアドレス９３及び圧縮後ＶＯＬアドレス９４を割当てる。また、プロセッサ２４Ａは、バッファ領域２０２への転送完了後、ＢＦ転送状態９６を“転送済”に更新する。なお、圧縮後ＶＯＬアドレス９４の割当ては、パリティサイクル分を割当てることが明らかなため、あらかじめパリティサイクル分の領域を割当てることで、マッピング情報の更新回数を削減できる。 After ensuring the exclusion, the processor 24A reads the target data in S1410 and transfers it to the buffer area 202. At the time of transfer, the processor 24 allocates the BF address 93 of the memory allocation management table 212 and the compressed VOL address 94. Further, the processor 24A updates the BF transfer state 96 to "transferred" after the transfer to the buffer area 202 is completed. Since it is clear that the VOL address 94 after compression is allocated for the parity cycle, the number of times the mapping information is updated can be reduced by allocating the area for the parity cycle in advance.

Ｓ１４１１で、プロセッサ２４Ａは、転送したデータを圧縮する。なお、圧縮処理はバッファ転送時に行っても良い（すなわち、バッファ領域２０２への格納前に圧縮が行われ、圧縮されたデータがバッファ領域２０２へ格納されても良い）し、転送後バッファ内で行っても良い。 In S1411, the processor 24A compresses the transferred data. The compression process may be performed at the time of buffer transfer (that is, compression may be performed before storage in the buffer area 202, and the compressed data may be stored in the buffer area 202), and the compressed data may be stored in the buffer area 202 after transfer. You may go.

Ｓ１４１２において、プロセッサ２４Ａは、バッファ内の圧縮データの量を判定する。圧縮データ量がパリティサイクル分よりも小さい場合、プロセッサ２４は、Ｓ１４０８に戻ってデステージするデータを追加で選択する。パリティサイクル分のデータがバッファ領域２０２内に溜まった場合、Ｓ１４１２の判定を真としてＳ１４１３に進む。なお、圧縮データサイズは可変長であるため、バッファ領域２０２内のデータが必ずしもパリティサイクル分揃うとは限らないことから、パリティサイクルを超える前にＳ１４１３へ処理を進めることもありえる。 In S1412, the processor 24A determines the amount of compressed data in the buffer. If the amount of compressed data is less than the parity cycle, the processor 24 returns to S1408 to additionally select data to be destaged. When the data for the parity cycle is accumulated in the buffer area 202, the determination in S1412 is regarded as true and the process proceeds to S1413. Since the compressed data size has a variable length, the data in the buffer area 202 is not always aligned for the parity cycle, so that the processing may proceed to S1413 before the parity cycle is exceeded.

Ｓ１４１３において、プロセッサ２４Ａは、バッファ領域２０２内の圧縮データからパリティデータを生成する。Ｓ１４１４で、プロセッサ２４Ａは、対象のデータ列及び生成したパリティデータを、ＲＡＩＤグループを構成するドライブ２９に書き出す。Ｓ１４１５において、プロセッサ２４Ａは、メモリ割当管理テーブル２１２の更新を確定する。なお、本ケースにおいて、キュー状態９５が“Clean”に更新される。Ｓ１４０７で、プロセッサ２４Ａは、デステージされた範囲のスロットの排他を解放し、処理を終了する。 In S1413, the processor 24A generates parity data from the compressed data in the buffer area 202. In S1414, the processor 24A writes the target data string and the generated parity data to the drive 29 constituting the RAID group. In S1415, the processor 24A confirms the update of the memory allocation management table 212. In this case, the queue state 95 is updated to "Clean". In S1407, the processor 24A releases the exclusion of the destaged range of slots and ends the process.

上記の例では、Ｓ１４１２において、バッファ内の圧縮データの量がパリティサイクルのデータ量に達したか否かが判定されている。しかし、ドライブ２９がＲＡＩＤを構成するか否かにかかわらず、所定の量のデータをまとめてドライブ２９に格納する場合には、プロセッサ２４Ａは、Ｓ１４１２においてバッファ内の圧縮データの量が当該所定の量に達したか否かを判定する。本実施例のＳ１４１２におけるパリティサイクルのデータ量は、上記の所定のデータ量の一例である。 In the above example, in S1412, it is determined whether or not the amount of compressed data in the buffer has reached the data amount of the parity cycle. However, when a predetermined amount of data is collectively stored in the drive 29 regardless of whether or not the drive 29 constitutes RAID, the processor 24A determines that the amount of compressed data in the buffer in S1412 is the predetermined amount. Determine if the quantity has been reached. The data amount of the parity cycle in S1412 of this embodiment is an example of the above-mentioned predetermined data amount.

なお、プロセッサ２４Ａは、Ｓ１４０１の判定が偽の場合であっても、Ｓ１４０８～Ｓ１４１５ではなく、Ｓ１４０２～Ｓ１４０６を実行する場合がある。例えば、ライトデータの書き込み先のＶＯＬ属性４２が圧縮有効でないために、図１２のＳ１３０３の判定が偽であった場合、非圧縮データがキャッシュ領域２０３Ａに格納されている。この場合、Ｓ１４０１の判定は偽となるが、データの圧縮は行わないため、Ｓ１４０２～Ｓ１４０６が実行される。 Note that the processor 24A may execute S1402 to S1406 instead of S1408 to S1415 even if the determination of S1401 is false. For example, if the determination in S1303 in FIG. 12 is false because the VOL attribute 42 to which the write data is written is not compressed, the uncompressed data is stored in the cache area 203A. In this case, the determination of S1401 is false, but since the data is not compressed, S1402 to S1406 are executed.

上記の例では、スループット性能が優先される場合に、ライト処理時には圧縮後のデータがキャッシュ領域２０３で二重化された時点でホスト計算機３０に応答が返され、デステージ処理ではデータの圧縮が不要となる。これによって、レスポンス性能は低下するが、デステージ処理の際のキャッシュアクセスが削減されるため、スループット性能が向上する。このような処理は一例であり、スループット性能が優先される場合に、ライト処理の際にさらに多くの処理が行われてもよい。 In the above example, when throughput performance is prioritized, a response is returned to the host computer 30 when the compressed data is duplicated in the cache area 203 during write processing, and data compression is not required in destage processing. Become. As a result, the response performance is lowered, but the cache access during the destage processing is reduced, so that the throughput performance is improved. Such processing is an example, and when throughput performance is prioritized, more processing may be performed during write processing.

例えば、プロセッサ２４Ａは、Ｓ１３０３（図１２）の判定が真である場合に、Ｓ１３０４～Ｓ１３０８を実行し、続いて、Ｓ１４１２、Ｓ１４０４～Ｓ１４０６（図１３）と同様の処理を実行し、その後にＳ１３０９、Ｓ１３１０を実行してもよい。すなわち、ライト命令に対して圧縮処理及びデステージまで一括して行われるため、レスポンス性能はさらに低下するが、スループット性能は向上する。 For example, the processor 24A executes S1304 to S1308 when the determination of S1303 (FIG. 12) is true, and subsequently executes the same processing as S1412 and S1404 to S1406 (FIG. 13), and then S1309. , S1310 may be executed. That is, since the compression process and the destage are collectively performed for the write instruction, the response performance is further lowered, but the throughput performance is improved.

この場合も、Ｓ１３０３（図１２）の判定が偽であるときの処理は、上記の図１２及び図１３を示して説明した通りである。すなわち、プロセッサ２４Ａは、Ｓ１３０４～Ｓ１３０５を実行せずに、Ｓ１３０６～Ｓ１３１０を実行する。さらに、プロセッサ２４Ａは、Ｓ１４０８～Ｓ１４１５及びＳ１４０７を実行する。 Also in this case, the processing when the determination in S1303 (FIG. 12) is false is as described with reference to FIGS. 12 and 13 above. That is, the processor 24A executes S1306 to S1310 without executing S1304 to S1305. Further, the processor 24A executes S1408 to S1415 and S1407.

上記の例によれば、デステージが開始されるとスロットの排他が確保され（Ｓ１４０９）、その後、データのドライブ２９への転送が終了して（Ｓ１４１４）マッピング情報が更新される（Ｓ１４１５）まで、スロットの排他が確保される（Ｓ１４０７）。このように長時間排他を確保することによって、必要なＩＯ命令が実行できないといったトラブルが発生する場合がある。このようなトラブルを回避するために、ケース１－１における排他手順を変更した実施例として、以下のケース１－２を示す。 According to the above example, when destage is started, slot exclusion is secured (S1409), and then the transfer of data to the drive 29 is completed (S1414) until the mapping information is updated (S1415). , Slot exclusion is ensured (S1407). By ensuring exclusion for a long time in this way, troubles such as the inability to execute necessary IO instructions may occur. In order to avoid such troubles, the following case 1-2 is shown as an example in which the exclusion procedure in case 1-1 is changed.

図１４は、本発明の実施例１のストレージ装置１１が実行する、排他手順を変更したデステージ処理を示すフローチャートである。 FIG. 14 is a flowchart showing a destage process in which the exclusion procedure is changed, which is executed by the storage device 11 of the first embodiment of the present invention.

＜ケース１－２＞圧縮及びデステージ一括処理（デステージ中排他解放）
Ｓ１５０１において、ストレージ装置１１は、図１３のＳ１４０１と同様の判定を行う。Ｓ１５０１の判定が真の場合、キャッシュ領域２０３内の圧縮データ格納領域２０５に対してデステージ処理（Ｓ１５０２～Ｓ１５０７）が行われる。これらの処理は、図１３のＳ１４０２～Ｓ１４０７と同様であるため、説明を省略する。 <Case 1-2> Compression and destage batch processing (exclusive release during destage)
In S1501, the storage device 11 makes the same determination as in S1401 of FIG. If the determination in S1501 is true, destage processing (S1502 to S1507) is performed on the compressed data storage area 205 in the cache area 203. Since these processes are the same as those of S1402 to S1407 in FIG. 13, the description thereof will be omitted.

Ｓ１５０１の判定が偽の場合、キャッシュ領域２０３内の非圧縮データ格納領域２０４に対してデステージ処理が行われる（Ｓ１５０８～Ｓ１５１９）。Ｓ１５０８では、プロセッサ２４は、非圧縮データ格納領域２０４に格納されているデータのうち、キュー状態９５が“Dirty”であるスロットに属するデータからデステージ処理を実行するデータを選択する。通常、パリティサイクル分のデータが並ぶデータ列（ストライプ列）が選択され、それに対してデステージが行われる。 If the determination in S1501 is false, destage processing is performed on the uncompressed data storage area 204 in the cache area 203 (S1508 to S1519). In S1508, the processor 24 selects the data to be destaged from the data belonging to the slot whose queue state 95 is “Dirty” among the data stored in the uncompressed data storage area 204. Normally, a data string (striped column) in which data for the parity cycle is arranged is selected, and destage is performed for it.

先述のケース１－１ではデステージ処理が完了するまでデステージ対象となるスロット範囲が保持されている。しかし、圧縮後のデータサイズがパリティサイクル分に達する広範囲の排他を保持し続けると、ホスト計算機３０からのライト命令が排他範囲に生じることによってデステージ待ちを生じる可能性が高くなる。そこで、プロセッサ２４は、Ｓ１５０９でデステージするデータが属するスロットの排他を確保した後、Ｓ１５１０のバッファ転送及びＳ１５１１の圧縮処理を行う。そして、プロセッサ２４は、圧縮処理が完了した後のＳ１５１２でメモリ割当管理テーブル２１２のＢＦ転送状態９６を“転送済”に更新する。更新が完了すると、プロセッサ２４は、Ｓ１５１３においてスロット排他を解放する。 In case 1-1 described above, the slot range to be destaged is held until the destage processing is completed. However, if the compressed data size continues to hold a wide range of exclusions that reach the parity cycle, there is a high possibility that a write instruction from the host computer 30 will occur in the exclusion range, resulting in a destage wait. Therefore, the processor 24 secures the exclusion of the slot to which the data to be destaged in S1509 belongs, and then performs the buffer transfer of S1510 and the compression process of S1511. Then, the processor 24 updates the BF transfer state 96 of the memory allocation management table 212 to “transferred” in S1512 after the compression process is completed. When the update is completed, the processor 24 releases the slot exclusion in S1513.

以後、プロセッサ２４は、Ｓ１５１４のドライブ転送可否の判定、Ｓ１５１５のパリティ生成、Ｓ１５１６のドライブ転送を、それぞれケース１－１のＳ１４１２、Ｓ１４１３及びＳ１４１４と同様に行う。 After that, the processor 24 performs the determination of whether or not the drive transfer of S1514 is possible, the parity generation of S1515, and the drive transfer of S1516 in the same manner as in S1412, S1413, and S1414 of Case 1-1, respectively.

Ｓ１５１７において、プロセッサ２４は、デステージ範囲のスロット排他を再度確保し、Ｓ１５１８でメモリ割当管理テーブル２１２のキュー状態９５を“Clean”に更新する。 In S1517, the processor 24 secures slot exclusion in the destage range again, and updates the queue state 95 of the memory allocation management table 212 to “Clean” in S1518.

なお、Ｓ１５１７までの間に、上記のデステージ範囲のスロットに対してホスト計算機３０からの更新ライトが発生した場合、プロセッサ２４は、Ｓ１３０８においてメモリ割当管理テーブル２１２のＢＦ転送状態９６を“無し”に更新する。この場合、Ｓ１５１８でプロセッサ２４がキュー状態９５を更新する際にＢＦ転送状態９６が切り替わったことを判定することによって、更新ライトが発生したことに気づくことが出来る。 If an update write from the host computer 30 occurs for the slot in the destage range up to S1517, the processor 24 sets the BF transfer state 96 of the memory allocation management table 212 to “none” in S1308. Update to. In this case, it is possible to notice that the update write has occurred by determining that the BF transfer state 96 has been switched when the processor 24 updates the queue state 95 in S1518.

なお、更新ライトの発生に気づいた（すなわちＳ１５１２で“転送済”に更新したＢＦ転送状態９６がＳ１５１７の時点で“無し”となっていた）場合、プロセッサ２４は、処理をやり直すか又は対象箇所のマッピング情報更新をスキップする。具体的には、プロセッサ２４は、Ｓ１５１８に進まずにＳ１５０８に戻り、更新ライトが行われたスロットを対象とするデステージ処理をやり直してもよい。あるいは、プロセッサは、そのままＳ１５０８に進み、更新ライトが行われたスロットのキュー状態９５を“Clean”に更新せずに、Ｓ１５１９に進んでもよい。その場合、当該スロットは次回以降のデステージ処理の対象となる。 If the occurrence of the update write is noticed (that is, the BF transfer state 96 updated to "transferred" in S1512 is "none" at the time of S1517), the processor 24 either redoes the process or the target location. Skip the mapping information update. Specifically, the processor 24 may return to S1508 without proceeding to S1518, and may redo the destage processing for the slot in which the update write is performed. Alternatively, the processor may proceed to S1508 as it is, and proceed to S1519 without updating the queue state 95 of the slot in which the update write is performed to “Clean”. In that case, the slot is subject to destage processing from the next time onward.

最後にＳ１５１９で、プロセッサ２４は、デステージされた範囲のスロットの排他を解放し、処理を終了する。 Finally, in S1519, the processor 24 releases the exclusion of the destaged range of slots and ends the process.

以上の本発明の実施例によれば、キャッシュ領域に格納されたデータをデステージする際に、圧縮処理から記憶デバイス（ドライブ）への格納までを一括で行うことによって、圧縮データの二重化処理が省略される。キャッシュ領域における圧縮データの二重化が不要になることで、キャッシュアクセス量を削減し、データ書き込みの最大速度が向上できる。 According to the above embodiment of the present invention, when the data stored in the cache area is destaged, the compression process to the storage in the storage device (drive) are collectively performed, so that the compression data duplication process can be performed. Omitted. By eliminating the need for duplication of compressed data in the cache area, the amount of cache access can be reduced and the maximum speed of data writing can be improved.

また、記憶デバイスへの圧縮データの格納が完了するまでキャッシュメモリ上に圧縮前のデータを二重化して保持することによって、圧縮処理及び記憶デバイスへの格納などの処理中に装置障害が発生してもデータを保護することができる。ストレージ装置が圧縮以外の処理（例えば重複排除、暗号化又は冗長化等）を行う場合にも、同様の効果が得られる。 In addition, by duplicating and holding the uncompressed data in the cache memory until the storage of the compressed data in the storage device is completed, a device failure occurs during processing such as compression processing and storage in the storage device. Can also protect your data. Similar effects can be obtained when the storage device performs processing other than compression (for example, deduplication, encryption, redundancy, etc.).

また、デステージの際に圧縮処理を行う場合、例えばパリティサイクル等の所定の大きさの領域を予め割り当てることができるため、マッピング情報の更新回数を削減することができる。 Further, when the compression process is performed at the time of destage, a region having a predetermined size such as a parity cycle can be allocated in advance, so that the number of times the mapping information is updated can be reduced.

また、本発明の実施例によれば、ストレージ装置は、所定の条件に基づいてレスポンス性能及びスループット性能のいずれを優先するかを判定する。そして、レスポンス性能を優先する場合にはキャッシュメモリ上に圧縮前のデータを二重化して保持したところでホストに応答する。これによって、レスポンス性能が向上する。一方、スループット性能を優先する場合には圧縮を行い、圧縮後のデータを二重化して保持したところでホストに応答する。これによってレスポンス性能は低下するが、デステージの際のキャッシュアクセス量が削減されるため、スループット性能は向上する。 Further, according to the embodiment of the present invention, the storage device determines which of the response performance and the throughput performance is prioritized based on a predetermined condition. Then, when the response performance is prioritized, the data before compression is duplicated and held in the cache memory before responding to the host. This improves the response performance. On the other hand, if priority is given to throughput performance, compression is performed, and the compressed data is duplicated and retained before responding to the host. As a result, the response performance is reduced, but the cache access amount at the time of destage is reduced, so that the throughput performance is improved.

例えば、ＩＯ命令の量、予想される圧縮率又は書き込み先のボリュームの属性などに基づいてレスポンス性能又はスループット性能のいずれを優先するかを判定することによって、状況に応じて最適な性能を実現することができる。 For example, by determining whether to prioritize response performance or throughput performance based on the amount of IO instructions, expected compression rate, attributes of the volume to be written, etc., the optimum performance is realized according to the situation. be able to.

また、キャッシュ領域に格納された圧縮前のデータをデステージする場合に、当該データをキャッシュ領域から読み出すときから記憶デバイスへの圧縮後のデータの格納が完了し、キュー状態を“Clean”に変更するまで（Ｓ１４０９～Ｓ１４１５、Ｓ１４０７）、当該データの領域の排他を確保してもよい。これによって、まだデステージされていないデータがデステージされたと誤って判定することが防止される。 In addition, when destage the uncompressed data stored in the cache area, the storage of the compressed data in the storage device is completed from the time when the data is read from the cache area, and the queue state is changed to "Clean". (S1409 to S1415, S1407), the exclusion of the data area may be secured. This prevents erroneous determination that data that has not yet been destaged has been destaged.

あるいは、当該データを読み出して、圧縮を行い、バッファ領域に転送した時点で排他を一旦解除してもよい（Ｓ１５１３）。これによって、排他が確保される時間が短縮し、必要なＩＯが実行できないというトラブルが軽減される。この場合、排他を一旦解除（Ｓ１５１３）してから当該データの記憶デバイスへの転送が終了（Ｓ１５１６）するまでの間に新たな書き込みが行われると、そのことが記録される（すなわちＢＦ転送状態が“転送済み”から“なし”に更新される）。これによって、まだデステージされていないデータがデステージされたと誤って判定することが防止される。 Alternatively, the exclusive data may be temporarily released when the data is read, compressed, and transferred to the buffer area (S1513). As a result, the time for ensuring exclusion is shortened, and the trouble that the required IO cannot be executed is reduced. In this case, if new writing is performed between the time when the exclusion is temporarily released (S1513) and the time when the transfer of the data to the storage device is completed (S1516), that is recorded (that is, the BF transfer state). Is updated from "transferred" to "none"). This prevents erroneous determination that data that has not yet been destaged has been destaged.

特許請求の範囲に記載したもののほか、本発明の観点の代表的なものとして、次のものが挙げられる。
（１）第１のストレージ制御部と、第２のストレージ制御部と、少なくとも前記第１のストレージ制御部に接続され、不揮発性の記憶媒体を有するストレージドライブと、を有するストレージシステムであって、
前記第１のストレージ制御部と前記第２のストレージ制御部とは、それぞれ、データを格納するキャッシュ領域と、データを格納するバッファ領域と、を有しており、前記キャッシュ領域に格納されたデータを互いのキャッシュ領域にも格納して二重化を行い、
前記第１のストレージ制御部は、ホスト計算機からデータの書き込み命令を受信すると、前記書き込み命令の対象のデータを、前記第１のストレージ制御部の前記キャッシュ領域である第１のキャッシュ領域に格納するとともに、前記データを前記第２のストレージ制御部の前記キャッシュ領域である第２のキャッシュ領域に格納して二重化を行い、前記二重化が完了したら、前記ホスト計算機に、前記データの書き込みの終了を示す応答を送信し、
前記書き込み命令の対象のデータに所定の処理を行い前記バッファ領域に格納し、
前記バッファ領域に格納したデータを読み出して前記ストレージドライブに送信することを特徴とするストレージシステム。
（２）上記（１）に記載のストレージシステムであって、
前記第１のストレージ制御部は、前記第１のキャッシュ領域から読み出したデータに前記所定の処理を行い、前記所定の処理後のデータを前記バッファ領域に格納することを特徴とするストレージシステム。
（３）上記（１）に記載のストレージシステムであって、
前記所定の処理は、前記第１のキャッシュ領域から読み出したデータの圧縮、重複排除、暗号化又は冗長化のいずれかであることを特徴とするストレージシステム。
（４）上記（１）に記載のストレージシステムであって、
前記第１のストレージ制御部に接続された複数の前記ストレージドライブを有し、
前記第１のストレージ制御部は、前記バッファ領域に格納したデータの量がパリティを生成するための所定のデータ量に達した場合、前記バッファ領域から読み出したデータに基づいてパリティを作成し、前記バッファ領域から読み出したデータ及び前記パリティを前記複数のストレージドライブに送信することを特徴とするストレージシステム。
（５）上記（２）に記載のストレージシステムであって、
前記第１のストレージ制御部は、
前記ストレージシステムにおいてレスポンス性能及びスループット性能のいずれが優先されるかを判定するための所定の条件を保持して、前記所定の条件に基づいて、レスポンス性能及びスループット性能のいずれが優先されるかを判定し、
レスポンス性能が優先される場合、前記データを前記第１及び第２のキャッシュ領域に格納してから、前記所定の処理を行い、
スループット性能が優先される場合、前記所定の処理を行ったデータを前記第１及び第２のキャッシュ領域に格納することを特徴とするストレージシステム。
（６）上記（５）に記載のストレージシステムであって、
前記第１のストレージ制御部は、前記第１のストレージ制御部の処理の負荷が所定の基準より低い場合に、レスポンス性能が優先されると判定することを特徴とするストレージシステム。
（７）上記（５）に記載のストレージシステムであって、
前記所定の処理は、前記データの圧縮であり、
前記第１のストレージ制御部は、前記データの圧縮率が所定の基準より低くなることが予測される場合、又は、前記データの書き込み対象として指定されたボリュームに圧縮データを格納することができない場合に、レスポンス性能が優先されると判定することを特徴とするストレージシステム。
（８）上記（５）に記載のストレージシステムであって、
前記第１のストレージ制御部は、
前記データが書き込まれるボリュームの管理単位領域ごとに、当該管理単位領域に書き込まれたデータが前記ストレージドライブに格納されたかを示すキュー状態を保持し、
前記ホスト計算機から前記データの書き込み命令を受信すると、前記データの書き込み対象である前記管理単位領域の排他を確保した後に、前記第１のキャッシュ領域に前記データを格納し、
前記ホスト計算機に、前記データの書き込みの終了を示す応答を送信した後に、前記データの書き込み対象である前記管理単位領域の排他を解除し、
前記管理単位領域のうち、前記キュー状態が、書き込まれたデータが前記ストレージドライブに格納されていないことを示す前記管理単位領域の排他を確保した後に、当該管理単位領域に書き込まれたデータを前記第１のキャッシュ領域から読み出して、前記所定の処理後のデータを前記バッファ領域に格納し、
前記バッファ領域から読み出した前記所定の処理後のデータの前記ストレージドライブへの格納が終了すると、前記キュー状態を、書き込まれたデータが前記ストレージドライブに格納されたことを示す値に更新し、その後、当該管理単位領域の排他を解除することを特徴とするストレージシステム。
（９）上記（５）に記載のストレージシステムであって、
前記第１のストレージ制御部は、
前記データが書き込まれるボリュームの管理単位領域ごとに、当該管理単位領域に書き込まれたデータが前記ストレージドライブに格納されたかを示すキュー状態、及び、当該管理単位領域に書き込まれたデータが前記バッファ領域に格納されたかを示すバッファ転送状態を保持し、
前記ホスト計算機から前記データの書き込み命令を受信すると、前記データの書き込み対象である前記管理単位領域の排他を確保した後に、前記第１のキャッシュ領域に前記データを格納し、
前記ホスト計算機に、前記データの書き込みの終了を示す応答を送信した後に、前記データの書き込み対象である前記管理単位領域の排他を解除し、
前記管理単位領域のうち、前記キュー状態が、書き込まれたデータが前記ストレージドライブに格納されていないことを示す前記管理単位領域の排他を確保した後に、当該管理単位領域に書き込まれたデータを前記第１のキャッシュ領域から読み出して、前記所定の処理後のデータを前記バッファ領域に格納し、
当該管理単位領域の前記バッファ転送状態を、格納されたデータが前記バッファ領域に格納されたことを示す値に更新した後に、当該管理単位領域の排他を解除し、
当該管理単位領域の排他が解除されている間に、当該管理単位領域に対するデータの書き込みを行った場合、当該管理単位領域の前記バッファ転送状態を、書き込まれたデータが前記バッファ領域に格納されていないことを示す値に更新し、
前記バッファ領域から読み出した前記所定の処理後のデータが前記ストレージドライブに格納された後に、当該管理単位領域の排他を確保し、
当該管理単位領域の前記バッファ転送状態が、書き込まれたデータが前記バッファ領域に格納されていることを示す場合、前記キュー状態を、書き込まれたデータが前記ストレージドライブに格納されたことを示す値に更新した後に、当該管理単位領域の排他を解除することを特徴とするストレージシステム。
（１０）上記（２）に記載のストレージシステムであって、
前記第１のストレージ制御部は、データを格納する第３のキャッシュ領域をさらに有し、
前記第２のストレージ制御部は、データを格納する第４のキャッシュ領域をさらに有し、
前記第１のストレージ制御部は、前記所定の条件に基づいて、スループット性能が優先されると判定した場合、前記データに前記所定の処理を行い、前記所定の処理後のデータを前記第３のキャッシュ領域に格納して、前記所定の処理後のデータを前記第２のストレージ制御部に送信し、
前記第２のストレージ制御部は、前記第１のストレージ制御部から受信した前記所定の処理後のデータを前記第４のキャッシュ領域に格納して二重化を行い、
前記第１のストレージ制御部は、
前記第２のストレージ制御部による前記第４のキャッシュ領域への前記所定の処理後のデータの格納が終了すると、前記ホスト計算機に、前記データの書き込みの終了を示す応答を送信し、
前記第３のキャッシュ領域に格納したデータを読み出して前記ストレージドライブに送信することを特徴とするストレージシステム。 In addition to those described in the claims, the following are typical examples of the viewpoint of the present invention.
(1) A storage system including a first storage control unit, a second storage control unit, and a storage drive connected to at least the first storage control unit and having a non-volatile storage medium.
The first storage control unit and the second storage control unit each have a cache area for storing data and a buffer area for storing data, and the data stored in the cache area. Is also stored in each other's cache area and duplicated,
When the first storage control unit receives a data write command from the host computer, the first storage control unit stores the data subject to the write command in the first cache area, which is the cache area of the first storage control unit. At the same time, the data is stored in the second cache area, which is the cache area of the second storage control unit, and duplication is performed. When the duplication is completed, the host computer is informed that the writing of the data is completed. Send a response,
The data to be written is subjected to predetermined processing and stored in the buffer area.
A storage system characterized in that data stored in the buffer area is read out and transmitted to the storage drive.
(2) The storage system according to (1) above.
The storage system is characterized in that the first storage control unit performs the predetermined processing on the data read from the first cache area, and stores the data after the predetermined processing in the buffer area.
(3) The storage system according to (1) above.
The storage system, wherein the predetermined process is any one of compression, deduplication, encryption, and redundancy of data read from the first cache area.
(4) The storage system according to (1) above.
It has a plurality of the storage drives connected to the first storage control unit, and has a plurality of the storage drives.
When the amount of data stored in the buffer area reaches a predetermined amount of data for generating parity, the first storage control unit creates parity based on the data read from the buffer area, and the first storage control unit creates parity. A storage system characterized in that data read from a buffer area and the parity are transmitted to the plurality of storage drives.
(5) The storage system according to (2) above.
The first storage control unit is
It holds a predetermined condition for determining which of the response performance and the throughput performance is prioritized in the storage system, and based on the predetermined condition, which of the response performance and the throughput performance is prioritized is determined. Judgment,
When the response performance is prioritized, the data is stored in the first and second cache areas, and then the predetermined processing is performed.
A storage system characterized in that when the throughput performance is prioritized, the data subjected to the predetermined processing is stored in the first and second cache areas.
(6) The storage system according to (5) above.
The storage system is characterized in that the first storage control unit determines that response performance is prioritized when the processing load of the first storage control unit is lower than a predetermined reference.
(7) The storage system according to (5) above.
The predetermined process is compression of the data.
The first storage control unit predicts that the compression rate of the data will be lower than a predetermined reference, or cannot store the compressed data in the volume designated as the write target of the data. In addition, a storage system characterized in that it is determined that response performance is prioritized.
(8) The storage system according to (5) above.
The first storage control unit is
For each management unit area of the volume to which the data is written, a queue state indicating whether the data written in the management unit area is stored in the storage drive is maintained.
When the data write command is received from the host computer, the data is stored in the first cache area after ensuring the exclusion of the management unit area to which the data is written.
After sending a response indicating the end of writing the data to the host computer, the exclusion of the management unit area to which the data is written is released.
Among the management unit areas, the data written in the management unit area is transferred to the management unit area after the queue state secures the exclusion of the management unit area indicating that the written data is not stored in the storage drive. Read from the first cache area, store the data after the predetermined processing in the buffer area, and store the data in the buffer area.
When the storage of the predetermined processed data read from the buffer area in the storage drive is completed, the queue state is updated to a value indicating that the written data is stored in the storage drive, and then the queue state is updated. , A storage system characterized by releasing the exclusion of the management unit area.
(9) The storage system according to (5) above.
The first storage control unit is
For each management unit area of the volume to which the data is written, a queue state indicating whether the data written in the management unit area is stored in the storage drive, and the data written in the management unit area is the buffer area. Holds the buffer transfer state indicating whether it was stored in
When the data write command is received from the host computer, the data is stored in the first cache area after ensuring the exclusion of the management unit area to which the data is written.
After sending a response indicating the end of writing the data to the host computer, the exclusion of the management unit area to which the data is written is released.
Among the management unit areas, the data written in the management unit area is transferred to the management unit area after the queue state secures the exclusion of the management unit area indicating that the written data is not stored in the storage drive. Read from the first cache area, store the data after the predetermined processing in the buffer area, and store the data in the buffer area.
After updating the buffer transfer state of the management unit area to a value indicating that the stored data is stored in the buffer area, the exclusion of the management unit area is released.
If data is written to the management unit area while the exclusion of the management unit area is released, the buffer transfer state of the management unit area and the written data are stored in the buffer area. Update to a value that indicates no,
After the predetermined processed data read from the buffer area is stored in the storage drive, the exclusive control of the management unit area is secured.
When the buffer transfer state of the management unit area indicates that the written data is stored in the buffer area, the queue state is a value indicating that the written data is stored in the storage drive. A storage system characterized in that the exclusion of the management unit area is released after updating to.
(10) The storage system according to (2) above.
The first storage control unit further has a third cache area for storing data.
The second storage control unit further has a fourth cache area for storing data.
When the first storage control unit determines that the throughput performance is prioritized based on the predetermined condition, the first storage control unit performs the predetermined processing on the data, and the data after the predetermined processing is the third. It is stored in the cache area, and the data after the predetermined processing is transmitted to the second storage control unit.
The second storage control unit stores the predetermined processed data received from the first storage control unit in the fourth cache area and performs duplication.
The first storage control unit is
When the storage of the data after the predetermined processing in the fourth cache area by the second storage control unit is completed, a response indicating the end of writing the data is transmitted to the host computer.
A storage system characterized in that data stored in the third cache area is read out and transmitted to the storage drive.

なお、本発明は上記した実施例に限定されるものではなく、様々な変形例が含まれる。例えば、上記した実施例は本発明のより良い理解のために詳細に説明したのであり、必ずしも説明の全ての構成を備えるものに限定されものではない。 The present invention is not limited to the above-described embodiment, and includes various modifications. For example, the above-mentioned examples have been described in detail for a better understanding of the present invention, and are not necessarily limited to those having all the configurations of the description.

また、上記の各構成、機能、処理部、処理手段等は、それらの一部又は全部を、例えば集積回路で設計する等によってハードウェアで実現してもよい。また、上記の各構成、機能等は、プロセッサがそれぞれの機能を実現するプログラムを解釈し、実行することによってソフトウェアで実現してもよい。各機能を実現するプログラム、テーブル、ファイル等の情報は、不揮発性半導体メモリ、ハードディスクドライブ、ＳＳＤ（ＳｏｌｉｄＳｔａｔｅＤｒｉｖｅ）等の記憶デバイス、または、ＩＣカード、ＳＤカード、ＤＶＤ等の計算機読み取り可能な非一時的データ記憶媒体に格納することができる。 Further, each of the above configurations, functions, processing units, processing means and the like may be realized by hardware by designing a part or all of them by, for example, an integrated circuit. Further, each of the above configurations, functions, and the like may be realized by software by the processor interpreting and executing a program that realizes each function. Information such as programs, tables, and files that realize each function can be stored in non-volatile semiconductor memories, hard disk drives, storage devices such as SSDs (Solid State Drives), or computer-readable non-readable devices such as IC cards, SD cards, and DVDs. It can be stored in a temporary data storage medium.

また、制御線及び情報線は説明上必要と考えられるものを示しており、製品上必ずしも全ての制御線及び情報線を示しているとは限らない。実際にはほとんど全ての構成が相互に接続されていると考えてもよい。 In addition, the control lines and information lines indicate what is considered necessary for explanation, and do not necessarily indicate all the control lines and information lines in the product. In practice, it can be considered that almost all configurations are interconnected.

１００ストレージシステム
１１ストレージ装置
２２、２２Ａ、２２Ｂストレージコントローラ
２０２バッファ領域
２０３、２０３Ａ、２０３Ｂキャッシュ領域
２０４非圧縮データ格納領域
２０５圧縮データ格納領域
２９ドライブ
３０ホスト計算機
３１ネットワーク 100 Storage system 11 Storage device 22, 22A, 22B Storage controller 202 Buffer area 203, 203A, 203B Cache area 204 Uncompressed data storage area 205 Compressed data storage area 29 Drive 30 Host computer 31 Network

Claims

A storage system including a first storage control unit, a second storage control unit, and a storage drive connected to at least the first storage control unit and having a non-volatile storage medium.
The first storage control unit has a first cache area for storing data and a first buffer area for storing data.
The second storage control unit has a second cache area for storing data and a second buffer area for storing data, respectively.
The first storage control unit also stores the data stored in the first cache area in the second cache area to perform duplication.
When the first storage control unit receives a data write command from the host computer, the first storage control unit stores the data subject to the write command in the first cache area of the first storage control unit, and at the same time, the first storage control unit receives the data write command. The data stored in the cache area of 1 is stored in the second cache area of the second storage control unit to perform duplication, and when the duplication is completed, the host computer indicates the end of writing the data. Send a response,
The first storage control unit reads the data stored in the storage drive among the duplicated data stored in the cache area, which is the target of the write command, from the cache area. The data to be written command is compressed and stored in the first buffer area without being stored in the first cache area, and the data stored in the first buffer area by the compression process. A storage system characterized in that parity is generated based on the above and stored in the first buffer area, and data and parity stored in the first buffer area are read out and transmitted to the storage drive for storage.

The storage system according to claim 1.
It has a plurality of the storage drives connected to the first storage control unit, and has a plurality of the storage drives.
When the amount of data stored in the first buffer area by the compression process reaches a predetermined amount of data for generating parity, the first storage control unit reads out from the first buffer area. A storage system characterized in that a parity is created based on the data and the data read from the first buffer area and the parity are transmitted to the plurality of storage drives.

It ’s a storage system control method.
The storage system includes a first storage control unit, a second storage control unit, and a storage drive connected to at least the first storage control unit and having a non-volatile storage medium.
The first storage control unit has a first cache area for storing data and a first buffer area for storing data, and the second storage control unit stores data, respectively. It has a second cache area for storing data and a second buffer area for storing data, and the first storage control unit uses the data stored in the first cache area as the second cache area. It is designed to be stored in the cache area of and duplicated.
The control method of the storage system is
When the first storage control unit receives a data write command from the host computer, the data subject to the write command is stored in the first cache area of the first storage control unit, and the first cache area is stored. The data stored in the cache area 1 is stored in the second cache area of the second storage control unit to perform duplication, and when the duplication is completed, the host computer indicates the end of writing the data. The procedure for sending a response and
The data stored in the storage drive among the duplicated data stored in the cache area, which is the target of the write command, is read from the cache area by the first storage control unit. The data to be written is not stored in the first cache area by compression processing, but is stored in the first buffer area, and the data stored in the first buffer area by compression processing is used. It is characterized by including a procedure of generating a parity based on the above and storing it in the first buffer area, reading out the data and the parity stored in the first buffer area, transmitting the parity to the storage drive, and storing the data. How to control the storage system.