JP2009217768A

JP2009217768A - Method and system for managing partition failure in database

Info

Publication number: JP2009217768A
Application number: JP2008063601A
Authority: JP
Inventors: Yuki Ubukata; 裕貴生形
Original assignee: NEC Corp
Current assignee: NEC Corp
Priority date: 2008-03-13
Filing date: 2008-03-13
Publication date: 2009-09-24

Abstract

<P>PROBLEM TO BE SOLVED: To permit or prohibit access per partition by managing failure information for each divided partition with respect to a database divided into a plurality of partitions by partitioning. <P>SOLUTION: The method for controlling access to a storage area divided into the plurality of partitions includes: a state detection step of accessing each partition to store a state of each partition in a state holding part; a state confirmation step of confirming, when a data update request to the storage area is received, a state of a partition concerning the request by referring to the state holding part; and a data update step of executing processing concerning the request only when the partition is in a normal state. <P>COPYRIGHT: (C)2009,JPO&INPIT

Description

本発明は、データベースにおけるパーティション障害管理方法およびシステムに関する。特に、パーティショニングにより複数のパーティションに分割されたデータベースにおけるパーティション障害管理方法およびシステムに関する。 The present invention relates to a partition failure management method and system in a database. In particular, the present invention relates to a partition failure management method and system in a database divided into a plurality of partitions by partitioning.

データベースの可用性、管理性、および問い合わせのレスポンスを向上させるために、１のデータベーステーブルのデータを複数の論理的なデータの集合であるパーティションに分割するパーティショニングというデータ分割手法が知られている。このパーティショニングにより分割された複数のパーティションは、それぞれ別のディスクあるいは論理ボリュームに割り当てることができ、特定のパーティションを破損した場合でも、他の正常なパーティションへのアクセスやデータの更新等が継続して行うことができる。 In order to improve database availability, manageability, and query response, a data partitioning method called partitioning that partitions data of one database table into partitions that are sets of a plurality of logical data is known. Multiple partitions divided by this partitioning can be assigned to different disks or logical volumes, and even when a specific partition is damaged, access to other normal partitions and data update continue. Can be done.

しかしながら、テーブルがパーティショニングにより複数のパーティションに分割されている場合でも、このテーブルは、クライアントアプリケーションからは１のテーブルとして見えているだけであり、クライアントアプリケーションは、どのようなパーティションが存在するのか認識することができない。このため、クライアントアプリケーションは、パーティションの状態を個別に認識することができず、一部のパーティションに障害が生じても、テーブル全体が正常であるか、あるいはテーブル全体がエラーであるかしか認識できない。パーティショニングの利点は、特定のパーティションに障害が発生しても、他の正常なパーティションが存在する限り、他のパーティションに対しては処理を継続できることである。したがって、この利点を生かすためには、特定のパーティションに障害が発生している場合であっても、他の正常なパーティションへのアクセスを可能にするために、クライアントアプリケーションは、テーブル全体が正常であるとして処理を継続する必要がある。しかしながら、特定のパーティションに障害が発生しているにも拘わらずテーブル全体が正常であるとしてしまうと、障害が発生しているパーティションに複数のトランザクションによる多重アクセスが生じる恐れがある。この場合には、エラーメッセージが繰り返し発生してしまい、他の重要なメッセージが埋もれてしまうという問題がある。 However, even if the table is divided into multiple partitions by partitioning, this table is only seen as one table by the client application, and the client application recognizes what kind of partition exists. Can not do it. For this reason, the client application cannot recognize the partition status individually, and even if a failure occurs in some partitions, it can only recognize whether the entire table is normal or whether the entire table is in error. . The advantage of partitioning is that even if a failure occurs in a specific partition, as long as another normal partition exists, the processing can be continued for the other partition. Therefore, in order to take advantage of this advantage, the client application can ensure that the entire table is healthy in order to allow access to other healthy partitions even if a particular partition fails. It is necessary to continue processing as if there is. However, if the entire table is normal even though a failure has occurred in a specific partition, multiple accesses by multiple transactions may occur in the partition in which the failure has occurred. In this case, there is a problem that error messages are repeatedly generated and other important messages are buried.

このような問題に対応すべく、障害の発生したデータを含むブロック領域へのアクセスを禁止するファイル閉塞制御方式が開示されている（例えば、特許文献１）。このファイル閉塞制御方式は、障害が発生したデータのアドレスをメモリに記憶しておき、データへアクセス要求があった場合に、アクセスを要求されているデータのアドレスが前記メモリに格納されているか確認し、このアドレスが格納されているときは、当該アドレスへのアクセス要求を拒否する。
特開昭６０−２５８６５９号公報 In order to cope with such a problem, a file blocking control method that prohibits access to a block area including a failed data is disclosed (for example, Patent Document 1). In this file block control method, the address of the data where the failure has occurred is stored in the memory, and when there is a request for access to the data, it is confirmed whether the address of the data requested to be accessed is stored in the memory. If this address is stored, the access request to the address is rejected.
JP-A-60-258659

しかしながら、この方式では、障害のあるデータアドレスへのアクセスを禁止するのみであり、パーティション単位でアクセスを許可あるいは禁止することはできない。また、この方式は、データ領域の障害情報を監視していないため、クライアントアプリケーションが、いずれのメモリへのアクセスが禁止されているのか認識できず、また、データ領域の障害が解消されたことを認識することできない。 However, this method only prohibits access to a faulty data address, and cannot permit or prohibit access on a partition basis. In addition, since this method does not monitor the failure information of the data area, the client application cannot recognize which memory access is prohibited, and that the failure of the data area has been resolved. I can't recognize it.

したがって、本発明は、パーティショニングにより複数のパーティションに分割されたデータベースにおいて、分割されたパーティション毎の障害情報を管理することにより、パーティション単位のアクセス許可あるいは禁止することを目的とする。 Accordingly, an object of the present invention is to permit or prohibit access in units of partitions by managing failure information for each divided partition in a database divided into a plurality of partitions by partitioning.

本発明は、複数のパーティションに分割された記憶領域へのアクセス制御方法であって、各パーティションにアクセスして正常か否かを状態保有部に保存する状態検出ステップと、前記記憶領域へのデータ更新要求を受けた場合に、前記状態保有部を参照して要求にかかるパーティションの状態を確認する状態確認ステップと、前記パーティションの状態が正常である場合にのみ前記要求にかかる処理を実行するデータ更新ステップとを含むことを特徴とする。 The present invention relates to a method for controlling access to a storage area divided into a plurality of partitions, a state detection step of accessing each partition and storing whether or not it is normal in a state holding unit, and data to the storage area Data for executing a status check step for checking the status of the partition related to the request with reference to the status holding unit when the update request is received, and processing for the request only when the status of the partition is normal And an update step.

また、前記状態検出ステップは定期的に実行され、前記状態保有部には最新の検出状態が保存されることを特徴とする。 Further, the state detection step is periodically executed, and the latest detection state is stored in the state holding unit.

さらに、前記状態確認ステップは、前記記憶領域に格納されるデータがどのパーティションに存在するかを示す対応テーブルを参照して前記要求にかかるデータに対応するパーティションを導出し、当該パーティションの状態を確認することを特徴とする。 Further, the state confirmation step refers to a correspondence table indicating in which partition the data stored in the storage area exists, derives a partition corresponding to the data related to the request, and confirms the state of the partition It is characterized by doing.

本発明は、複数のパーティションに分割された記憶領域へのアクセス制御方法であり、各パーティションにアクセスして正常か否かを状態保有部に保存し、前記記憶領域へのデータ更新要求を受けた場合に、前記記憶領域に格納されるデータがどのパーティションに存在するかを示す対応テーブルを参照して前記要求にかかるデータに対応するパーティションを導出し、当該パーティションの状態を確認し、前記パーティションの状態が正常である場合にのみ前記要求にかかる処理を実行するため、パーティション単位でアクセス許可あるいは禁止することができる。 The present invention is a method of controlling access to a storage area divided into a plurality of partitions, and stores each partition in a state holding unit to determine whether it is normal or not, and receives a data update request to the storage area In this case, a partition corresponding to the data related to the request is derived with reference to a correspondence table indicating in which partition the data stored in the storage area exists, the state of the partition is confirmed, and the partition Since the processing related to the request is executed only when the state is normal, access can be permitted or prohibited in units of partitions.

また、本発明は、前記状態検出が定期的に実行され、前記状態保有部には最新の検出状態が保存されるため、ほぼリアルタイムで記憶領域の状態を確認することができ、障害の発生および復旧に迅速に対応できる。 Further, according to the present invention, since the state detection is periodically performed and the latest detection state is stored in the state holding unit, the state of the storage area can be confirmed almost in real time, Respond quickly to recovery.

以下、本発明を実施するための最良の形態について、添付の図面を参照しながら詳細に説明する。 The best mode for carrying out the present invention will be described below in detail with reference to the accompanying drawings.

図１は、本発明にかかるデータベースの障害管理システムの全体構成を示す概略図である。本図には、パーティション状態監視装置１０と、クライアントアプリケーション２０と、データベースノード３０とが示されている。このシステムは、例えば、ネットワークに接続されたデータサーバであり、クライアントアプリケーション２０が、クライアントＰＣからデータの処理要求を受け付け、データベースノード３０のディスク３３に対し処理を行うものである。パーティション状態監視装置１０は、データベースノード３０から監視情報を収集する監視情報登録部１１と、監視情報登録部１１が収集した情報を保持する情報保有部１２と、予め定義されたパーティション情報を保持するパーティション定義保有部１３と、クライアントアプリケーション２０がパーティションの状態を参照するための状態情報参照部１４とを備えている。 FIG. 1 is a schematic diagram showing the overall configuration of a database failure management system according to the present invention. In this figure, a partition state monitoring apparatus 10, a client application 20, and a database node 30 are shown. This system is, for example, a data server connected to a network, and the client application 20 receives a data processing request from a client PC and performs processing on the disk 33 of the database node 30. The partition status monitoring device 10 holds a monitoring information registration unit 11 that collects monitoring information from the database node 30, an information holding unit 12 that holds information collected by the monitoring information registration unit 11, and a predetermined partition information. The partition definition holding unit 13 and the status information reference unit 14 for the client application 20 to refer to the partition status are provided.

図２は、データベースノード３０の監視部３１をより詳細に示した図である。データベースノード３０は、監視部３１と、データベース３２とを備えており、このデータベース３２は、障害監視用テーブル３２１と、業務用テーブル３２２とを備えている。監視部３１は、データベース３２の障害監視用テーブル３２１を参照してパーティション障害が発生しているか判断する判断手段３１２と、当該判断手段３１２の判断結果を、パーティション状態監視装置１０の監視情報登録部１１に送信する通信手段３１１とを備えている。 FIG. 2 is a diagram showing the monitoring unit 31 of the database node 30 in more detail. The database node 30 includes a monitoring unit 31 and a database 32, and the database 32 includes a failure monitoring table 321 and a business table 322. The monitoring unit 31 refers to the failure monitoring table 321 of the database 32 to determine whether a partition failure has occurred, and the determination result of the determination unit 312 is used as the monitoring information registration unit of the partition state monitoring device 10. 11 and a communication means 311 for transmitting to the terminal 11.

図３は、データベースノード３０のデータベース３２が備える障害監視用テーブル３２１および業務用テーブル３２２と、ディスク３３との関係を示す図である。本図に示すように、本実施例では５つのディスクを使用しているため、これらのディスクに対応するように障害監視用テーブル３２１を５つのパーティション３２１１乃至３２１５に分割する。本実施例では、パーティションを物理的なディスク単位で割り当てているが、代替的に論理ボリューム単位で割り当ててもよい。また、業務用テーブル（Ａ）３２２ａは、４つのディスク３３１乃至３３４に割り当てられ、業務用テーブル（Ｂ）３２２ｂは、２つのディスク３３４および３３５に割り当てられている。 FIG. 3 is a diagram illustrating the relationship between the failure monitoring table 321 and the business table 322 included in the database 32 of the database node 30 and the disk 33. As shown in this figure, since five disks are used in this embodiment, the failure monitoring table 321 is divided into five partitions 3211 to 3215 so as to correspond to these disks. In this embodiment, the partitions are allocated in units of physical disks, but may be allocated in units of logical volumes instead. The business table (A) 322a is assigned to the four disks 331 to 334, and the business table (B) 322b is assigned to the two disks 334 and 335.

図４は、障害監視用テーブル３２１の定義例を示しており、図５は、この定義例による障害監視用テーブル３２１の初期状態を示している。この定義例では、ＩＤ列を主キーに設定し、ＩＤが重複しないようにしている。また、リストパーティションにより、ＩＤ１乃至５の各レコードが、障害監視用テーブル３２１のパーティション３２１１乃至３２１５と一意に対応するように定義されている。これらのパーティションはそれぞれ、ディスク３３１乃至３３５に対応している。これにより、ＩＤ１乃至５の各レコードにＳＱＬＵＰＤＡＴＡ文等を発行し、その結果を確認することにより、パーティション単位で障害の有無を確認することができる。 FIG. 4 shows a definition example of the failure monitoring table 321, and FIG. 5 shows an initial state of the failure monitoring table 321 according to this definition example. In this definition example, an ID column is set as a primary key so that IDs do not overlap. Further, the records of ID 1 to 5 are defined by the list partition so as to uniquely correspond to the partitions 3211 to 3215 of the failure monitoring table 321. These partitions correspond to the disks 331 to 335, respectively. As a result, it is possible to confirm the presence or absence of a failure in units of partitions by issuing a SQL UPDATA statement or the like to each record of ID1 to ID5 and confirming the result.

次に、図６を参照して、本発明の処理動作について説明する。初めに、データベースノード３０内の監視部３１の判断手段３１２が、障害監視用テーブル３２１にＳＱＬＳＥＬＥＣＴ文を発行し（Ａ１）、障害監視用テーブル３２１に格納されたＩＤのリスト（ＩＤ１乃至５）を取得する（Ａ２）。これらのＩＤが障害監視用テーブル３２１の各パーティションおよびディスクに対応する。次に、監視部３１の判断手段３１２は、パーティションの障害の有無を確認すべく、各ＩＤに対応するレコードにＳＱＬＵＰＤＡＴＡ文を発行し（Ａ３）、更新結果を受け取る（Ａ４）。ここで、ＳＱＬ文が正常終了した場合、該当するパーティションは正常であると判断し、ＳＱＬ文が異常終了した場合、該当するパーティションは異常であると判断する。次に、監視部３１の通信手段３１１が、この更新結果をパーティション状態監視装置１０の監視情報登録部１１に送信し（Ａ５）、当該監視情報登録部１１は、この更新結果を状態保有部１２に登録する（Ａ６）。図７は、この状態保有部１２の一例を示す図である。状態保有部１２には、これらの更新結果がＩＤおよびディスク毎に保持される。データベースノード３０の監視部３１は、これらの処理を定期的に、例えば１秒おきに繰り返す。これにより、パーティションの状態が定期的に更新され、パーティション毎の状態をほぼリアルタイムで確認することができる。 Next, the processing operation of the present invention will be described with reference to FIG. First, the determination unit 312 of the monitoring unit 31 in the database node 30 issues an SQL SELECT statement to the failure monitoring table 321 (A1), and a list of IDs (ID1 to ID5) stored in the failure monitoring table 321. Is acquired (A2). These IDs correspond to the partitions and disks in the failure monitoring table 321. Next, the determination unit 312 of the monitoring unit 31 issues an SQL UPDATA statement to a record corresponding to each ID (A3) and receives an update result (A4) in order to confirm whether there is a partition failure. Here, if the SQL statement ends normally, it is determined that the corresponding partition is normal, and if the SQL statement ends abnormally, it is determined that the corresponding partition is abnormal. Next, the communication unit 311 of the monitoring unit 31 transmits the update result to the monitoring information registration unit 11 of the partition state monitoring apparatus 10 (A5), and the monitoring information registration unit 11 transmits the update result to the state holding unit 12. (A6). FIG. 7 is a diagram illustrating an example of the state holding unit 12. The state holding unit 12 holds these update results for each ID and disk. The monitoring unit 31 of the database node 30 repeats these processes regularly, for example, every 1 second. Thereby, the state of a partition is regularly updated, and the state of each partition can be confirmed almost in real time.

次に、図８乃至１０を参照して、パーティション定義保有部１３について説明する。パーティション定義保有部１３は、これらの図に示すように、テーブル／パーティションキー対応テーブル１３１と、業務テーブルＡ用のキー値／ディスク名対応テーブル１３２ａと、業務テーブルＢ用のキー値／ディスク名対応テーブル１３２ｂとを備えている。テーブル／パーティションキー対応テーブル１３１は、図８に示すように、業務テーブルＡおよびＢのパーティションキーが登録されている。業務テーブルＡ用のキー値／ディスク名対応テーブル１３２ａは、図９に示すように、パーティションキー「ｄａｔｅ」の範囲と、これに関連するディスク名とが登録されている。業務テーブルＢ用のキー値／ディスク名対応テーブル１３２ｂは、図１０に示すように、パーティションキー「ＩＤ」の範囲と、これに関連するディスク名とが登録されている。 Next, the partition definition holding unit 13 will be described with reference to FIGS. As shown in these drawings, the partition definition holding unit 13 corresponds to the table / partition key correspondence table 131, the key value / disk name correspondence table 132a for the business table A, and the key value / disk name correspondence for the business table B. And a table 132b. In the table / partition key correspondence table 131, the partition keys of the business tables A and B are registered as shown in FIG. In the key value / disk name correspondence table 132a for the business table A, as shown in FIG. 9, the range of the partition key “date” and the disk name related thereto are registered. In the key value / disk name correspondence table 132b for the business table B, as shown in FIG. 10, the range of the partition key “ID” and the disk name related thereto are registered.

次に図１１を参照して、クライアントアプリケーション２０が、業務テーブルを更新する処理について説明する。以下の説明では、更新の対象の業務テーブルを「業務テーブルＡ」とし、キー値を「２００７．１．１」とする。 Next, a process in which the client application 20 updates the business table will be described with reference to FIG. In the following description, the business table to be updated is “business table A”, and the key value is “2007.1.1”.

初めに、クライアントアプリケーション２０が、クライアントＰＣ（図示せず）から業務用テーブルの更新要求を受信すると（Ｂ１）、パーティション状態監視装置１０の状態情報参照部１４に、更新の対象であるテーブル名「業務用テーブルＡ」および更新する情報を送信する（Ｂ２）。次に、状態情報参照部１４は、パーティション定義保有部１３のテーブル／パーティションキー対応テーブル１３１を参照して、受信したテーブル名「業務用テーブルＡ」に関連するパーティションキー「ｄａｔｅ」を取得する（Ｂ３）。次に、状態情報参照部１４は、取得したパーティションキー「ｄａｔｅ」と当該キーの値「２００７．１．１」を基に、パーティション保有部１３の業務テーブルＡ用のキー値／ディスク名対応テーブル１３２ａを参照して、該当するディスク名「ディスク３３３」を取得する（Ｂ４）。状態情報参照部１４は、図７に示す状態保有部１２を参照して、「ディスク３３３」の状態情報を取得し（Ｂ５）、この状態情報をクライアントアプリケーション２０に送信する（Ｂ６）。クライアントアプリケーション２０は、状態情報が正常である場合にのみデータベースの更新処理を実施する（Ｂ７）。このように、クライアントアプリケーション２０は、データベース更新処理を行う前にパーティションの状態を確認するため、不必要なデータベースの更新を回避することができる。 First, when the client application 20 receives a business table update request from a client PC (not shown) (B1), the status information reference unit 14 of the partition status monitoring device 10 receives the table name “ "Business table A" and information to be updated are transmitted (B2). Next, the status information reference unit 14 refers to the table / partition key correspondence table 131 of the partition definition holding unit 13 and acquires the partition key “date” related to the received table name “business table A” ( B3). Next, based on the acquired partition key “date” and the value “2007.1.1” of the state, the status information reference unit 14 uses the key value / disk name correspondence table for the business table A of the partition holding unit 13. Referring to 132a, the corresponding disk name “disk 333” is acquired (B4). The status information reference unit 14 refers to the status holding unit 12 shown in FIG. 7 to acquire the status information of “disk 333” (B5), and transmits this status information to the client application 20 (B6). The client application 20 performs the database update process only when the state information is normal (B7). As described above, since the client application 20 checks the state of the partition before performing the database update process, unnecessary update of the database can be avoided.

ここでは、更新対象のテーブルが１つの場合の処理についてのみ説明しているが、更新対象のテーブルが複数ある場合にはこれらの処理を繰り返し行う。この場合も同様に、更新対象であるパーティションが正常である場合にのみ更新処理を行う。 Here, only the process when there is one update target table has been described, but when there are a plurality of update target tables, these processes are repeated. Similarly in this case, the update process is performed only when the partition to be updated is normal.

本発明の好適な実施例を示し説明したが、本発明の目的または特徴から逸脱しない範囲で様々な変更を加えることができる。 While the preferred embodiment of the invention has been illustrated and described, various modifications can be made without departing from the scope of the invention.

本発明は、データベースのパーティショニングを利用しているシステム全般に用いることができる。特に、トランザクション処理件数の多い大規模データベースを有するオンラインシステムに有用である。 The present invention can be used for all systems using database partitioning. In particular, it is useful for an online system having a large-scale database with a large number of transaction processes.

図１は、本発明にかかるデータベースの障害管理システムの全体構成を示す概略図である。FIG. 1 is a schematic diagram showing the overall configuration of a database failure management system according to the present invention. 図２は、データベースノード３０の監視部３１の詳細図である。FIG. 2 is a detailed diagram of the monitoring unit 31 of the database node 30. 図３は、データベース３２の障害監視用テーブル３２１および業務用テーブル３２２と、ディスク３３とを示す図である。FIG. 3 is a diagram showing the failure monitoring table 321 and business table 322 of the database 32 and the disk 33. 図４は、障害監視用テーブル３２１の定義例を示す図である。FIG. 4 is a diagram illustrating a definition example of the failure monitoring table 321. 図５は、図４の定義例による障害監視用テーブル３２１の初期状態を示している。FIG. 5 shows an initial state of the failure monitoring table 321 according to the definition example of FIG. 図６は、本発明の処理動作を示す図である。FIG. 6 is a diagram showing the processing operation of the present invention. 図７は、パーティション状態監視装置１０の状態保有部１２を示す図である。FIG. 7 is a diagram illustrating the state holding unit 12 of the partition state monitoring apparatus 10. 図８は、業務テーブルＡおよびＢのパーティションキーを管理するテーブル／パーティションキー対応テーブル１３１を示す図である。FIG. 8 is a diagram showing a table / partition key correspondence table 131 for managing the partition keys of the business tables A and B. 図９は、パーティションキー値「ｄａｔｅ」の範囲と、これに関連するディスク名を管理する業務テーブルＡ用のキー値／ディスク名対応テーブルを示す図である。FIG. 9 is a diagram showing a key value / disk name correspondence table for the business table A that manages the range of the partition key value “date” and the disk names related thereto. 図１０は、パーティションキー値「ＩＤ」の範囲と、これに関連するディスク名を管理する業務テーブルＢ用のキー値／ディスク名対応テーブルを示す図である。FIG. 10 is a diagram showing a key value / disk name correspondence table for the business table B that manages the range of the partition key value “ID” and the disk names related thereto. 図１１は、クライアントアプリケーション２０が、業務テーブルを更新する処理を示す図である。FIG. 11 is a diagram illustrating processing in which the client application 20 updates the business table.

Explanation of symbols

１０パーティション状態監視装置
２０クライアントアプリケーション
３０データベースノード 10 partition status monitoring device 20 client application 30 database node

Claims

A method of controlling access to a storage area divided into a plurality of partitions,
A state detection step for accessing each partition to store whether it is normal or not in the state holding unit, and when receiving a data update request to the storage area, the state of the partition related to the request with reference to the state holding unit An access control method, comprising: a state confirmation step for confirming the data; and a data update step for executing the processing related to the request only when the state of the partition is normal.

The access control method according to claim 1, wherein the state detection step is periodically executed, and a latest detection state is stored in the state holding unit.

3. The method according to claim 1, wherein the status check step refers to a correspondence table indicating in which partition the data stored in the storage area exists, and selects a partition corresponding to the data related to the request. An access control method characterized by deriving and confirming the state of the partition.

In a storage device comprising a storage area divided into a plurality of partitions,
A monitoring unit that accesses each partition and stores whether it is normal or not in the state holding unit, and a state information reference unit that checks the state of the partition related to the request by referring to the state holding unit when a data update request is received And
An apparatus characterized by executing the process related to the request only when the state of the partition is normal.

5. The apparatus according to claim 4, wherein the monitoring unit periodically accesses each partition to update the state holding unit.

6. The apparatus according to claim 4, further comprising a correspondence table indicating in which partition the data stored in the storage area exists, and the state information reference means refers to the correspondence table. And identifying a partition corresponding to the data related to the update request and checking the state.