JPWO2018047250A1

JPWO2018047250A1 - Database transition support apparatus and method

Info

Publication number: JPWO2018047250A1
Application number: JP2018537916A
Authority: JP
Inventors: 高橋　正和; 正和高橋; 友隆塩野谷
Original assignee: Hitachi Ltd
Current assignee: Hitachi Ltd
Priority date: 2016-09-07
Filing date: 2016-09-07
Publication date: 2019-07-04
Anticipated expiration: 2036-09-07
Also published as: WO2018047250A1; JP6695985B2

Abstract

データベース移行支援装置は、関係データベースから移行する分散キーバリューストアの作成を支援する装置であり、メモリとプロセッサとを有している。メモリは、キー評価プログラムと、キー評価提示プログラムとを有している。キー評価プログラムは、関係データベースに設定されている少なくとも１つの列に基づき分散キーバリューストアに用いるキーの候補となるキー候補を少なくとも１つ生成し、キー候補を、関係データベースのデータおよび利用ログに基づいて、評価する。キー評価提示プログラムは、キー候補の評価結果を提示する。プロセッサは、キー評価プログラムおよびキー評価提示プログラムを実行して、キー候補の評価結果を提示する。The database migration support device is a device that supports creation of a distributed key-value store migrated from a relational database, and has a memory and a processor. The memory has a key evaluation program and a key evaluation presentation program. The key evaluation program generates at least one key candidate which is a candidate of a key to be used for the distributed key-value store based on at least one column set in the relational database, and generates the key candidate into data of the relational database and usage log. Evaluate based on. The key evaluation presentation program presents evaluation results of key candidates. The processor executes the key evaluation program and the key evaluation presentation program to present the evaluation result of the key candidate.

Description

関係データベースからキーバリューストアへのデータベースの移行を支援する技術に関する。 The present invention relates to a technology that supports migration of a database from a relational database to a key-value store.

ＩｏＴ市場の急成長などを背景として、データ容量および処理性能を向上させるために、データベースを関係データベース（ＲＤＢ）からキーバリューストア（ＫＶＳ）へ移行させる動きがある。データベースの移行においては、移行後にも移行前に動作していたアプリケーションが同じように動作することが求められる。しかし、移行前のＲＤＢから取得できていたデータと同じデータを取得できるＫＶＳを実現するには、多大な工数をかけて人手によりキー設計をする必要があった。それに対して、特許文献１には、ＲＤＢからＫＶＳへの移行を容易にするために移行設計を支援する技術が開示されている。 With the rapid growth of the IoT market, etc., there is a movement to shift databases from relational databases (RDB) to key value stores (KVS) in order to improve data capacity and processing performance. In database migration, it is required that applications that were operating before migration also operate in the same way after migration. However, in order to realize the KVS that can acquire the same data as the data that could be acquired from the RDB before migration, it was necessary to spend a lot of man-hours and manually design the key. On the other hand, Patent Document 1 discloses a technology that supports transition design to facilitate transition from RDB to KVS.

特開２０１４−２１１７９０号公報JP, 2014-211790, A

特許文献１の技術は、ＲＤＢからＫＶＳへの移行を支援する際、ＲＤＢのスキーマ定義とＲＤＢの利用ログを用いてデータ項目を評価するだけなので、移行後のＫＶＳを実際のデータに適用した場合に必ずしも移行前のＲＤＢと同様のデータを取得できるとは限らない。 When supporting the transition from RDB to KVS, the technology of Patent Document 1 only evaluates data items using the schema definition of RDB and the usage log of RDB, so when KVS after transition is applied to actual data It is not always possible to obtain the same data as the RDB before migration.

本発明の目的は、移行前のＲＤＢの実際のデータを適用したときにＲＤＢと同様のデータが取得できるようなＫＶＳのキー設計を支援することが可能な技術を提供することである。 An object of the present invention is to provide a technology capable of supporting KVS key design such that data similar to RDB can be acquired when actual data of RDB before migration is applied.

本発明の一つの実施態様に従うデータベース移行支援装置は、関係データベースから移行する分散キーバリューストアの作成を支援する装置であり、メモリとプロセッサとを有している。メモリは、キー評価プログラムと、キー評価提示プログラムとを有している。キー評価プログラムは、関係データベースに設定されている少なくとも１つの列に基づき分散キーバリューストアに用いるキーの候補となるキー候補を少なくとも１つ生成し、キー候補を、関係データベースのデータおよび利用ログに基づいて、評価する。キー評価提示プログラムは、キー候補の評価結果を提示する。プロセッサは、キー評価プログラムおよびキー評価提示プログラムを実行して、キー候補の評価結果を提示する。 A database migration support apparatus according to one embodiment of the present invention is an apparatus for supporting creation of a distributed key-value store migrated from a relational database, and has a memory and a processor. The memory has a key evaluation program and a key evaluation presentation program. The key evaluation program generates at least one key candidate which is a candidate of a key to be used for the distributed key-value store based on at least one column set in the relational database, and generates the key candidate into data of the relational database and usage log. Evaluate based on. The key evaluation presentation program presents evaluation results of key candidates. The processor executes the key evaluation program and the key evaluation presentation program to present the evaluation result of the key candidate.

本発明によれば、移行前のデータおよび利用ログを用いて各キー候補を評価した結果が提示されるので、実際のデータを用いて移行による影響を評価し、適切な分散キーバリューストアのキーを作成することが可能となる。 According to the present invention, since the results of evaluating each key candidate using data before migration and the usage log are presented, the effect of migration is evaluated using actual data, and the key of the appropriate distributed key-value store It is possible to create

本実施形態によるデータベース移行システムのブロック図である。It is a block diagram of the database migration system by this embodiment. キー設計支援装置１００のプログラムが参照あるいは入出力する情報を示す図である。It is a figure which shows the information which the program of the key design support apparatus 100 references or inputs / outputs. 本実施形態によるデータベース移行システムにおけるデータベース移行の全体動作を示すシーケンス図である。It is a sequence diagram which shows the whole operation | movement of database migration in the database migration system by this embodiment. 車両情報テーブルスキーマ定義２０４の一例を示す図である。It is a figure which shows an example of the vehicle information table schema definition 204. FIG. 車両情報テーブル２０３の一例を示す図である。It is a figure which shows an example of the vehicle information table 203. As shown in FIG. 利用ログ２０２の一例を示す図である。It is a figure which shows an example of the utilization log 202. As shown in FIG. ＫＶＳ情報３０２の一例を示す図である。7 is a diagram illustrating an example of KVS information 302. FIG. 移行クライアント決定処理Ｓ１０３のフローチャートである。It is a flowchart of migration client determination processing S103. キー評価処理Ｓ１０７のフローチャートである。It is a flowchart of key evaluation process S107. キー候補集合作成処理のフローチャートである。It is a flowchart of a key candidate set creation process. シミュレーションテーブル作成処理のフローチャートである。It is a flowchart of simulation table preparation processing. シミュレーションテーブル作成処理の説明に用いるキー候補の例を示す表である。It is a table showing an example of a key candidate used for explanation of simulation table preparation processing. シミュレーションテーブル作成処理の説明に用いるＲＤＢテーブルの例を示す表である。It is a table showing an example of RDB table used for explanation of simulation table creation processing. クエリシミュレーション処理のフローチャートである。It is a flowchart of a query simulation process. スコア算出処理のフローチャートである。It is a flowchart of a score calculation process. キー評価提示処理のフローチャートである。It is a flowchart of a key evaluation presentation process. キー評価提示処理による画面表示の例を示す図である。It is a figure which shows the example of the screen display by a key evaluation presentation process. キー評価提示処理による画面表示の例を示す図である。It is a figure which shows the example of the screen display by a key evaluation presentation process. キー評価提示処理による画面表示の例を示す図である。It is a figure which shows the example of the screen display by a key evaluation presentation process. キー評価提示処理による画面表示の例を示す図である。It is a figure which shows the example of the screen display by a key evaluation presentation process. データ移行処理のシーケンス図である。It is a sequence diagram of data migration processing. データ移行処理による画面表示の例を示す図である。It is a figure which shows the example of the screen display by data transfer processing. データ移行処理による画面表示の例を示す図である。It is a figure which shows the example of the screen display by data transfer processing. シーケンシャル度および分散度の計算例を示す表である。It is a table showing an example of calculation of the degree of sequentiality and the degree of dispersion.

以下、本発明の実施形態について図面を参照して説明する。 Hereinafter, embodiments of the present invention will be described with reference to the drawings.

図１は、本実施形態によるデータベース移行システムのブロック図である。図１には主にハードウェア構成が示されている。図２は、キー設計支援装置１００のプログラムが参照あるいは入出力する情報を示す図である。 FIG. 1 is a block diagram of a database migration system according to the present embodiment. The hardware configuration is mainly shown in FIG. FIG. 2 is a diagram showing information that the program of the key design support device 100 refers to or inputs and outputs.

図１を参照すると、データベース移行システムは、キー設計支援装置１００、関係データベース（ＲＤＢ）２００、分散キーバリューストア（ＫＶＳ）３００、およびクライアント４０１、４０２、４０３、４０４を有している。 Referring to FIG. 1, the database migration system includes a key design support apparatus 100, a relational database (RDB) 200, a distributed key value store (KVS) 300, and clients 401, 402, 403, and 404.

キー設計支援装置１００はＲＤＢ２００から分散ＫＶＳ３００への移行におけるキー設計を支援する装置である。キー設計支援装置１００は出力装置１１０、演算装置１２０、入力装置１３０、通信装置１４０、および記憶部１５０を有している。 The key design support apparatus 100 is an apparatus for supporting key design in the transition from the RDB 200 to the distributed KVS 300. The key design support device 100 includes an output device 110, an arithmetic device 120, an input device 130, a communication device 140, and a storage unit 150.

記憶部１５０にはプログラム（ＰＧ）領域１５１とデータ領域１５２がある。ＰＧ領域１５１には、移行クライアント決定ＰＧ１５１１、キー評価ＰＧ１５１２、キー評価提示ＰＧ１５１３、データ移行ＰＧ１５１４、キー候補集合作成モジュール１５１５、ＫＶＳテーブル作成モジュール１５１６、クエリシミュレーションモジュール１５１７、およびスコア算出モジュール１５１８が格納されている。 The storage unit 150 has a program (PG) area 151 and a data area 152. In the PG area 151, a migration client determination PG 1511, a key evaluation PG 1512, a key evaluation presentation PG 1513, a data migration PG 1514, a key candidate set creation module 1515, a KVS table creation module 1516, a query simulation module 1517, and a score calculation module 1518 is stored.

データ領域１５２には、ＲＤＢ接続情報１５２１、移行対策クライアント情報１５２２、ＲＤＢ利用ログ１５２３、キー候補集合１５２４、クエリ集合１５２５、ＲＤＢテーブル１５２６、シミュレーションテーブル１５２７、キー評価結果１５２８、および移行用キー情報１５２９が格納されている。 In the data area 152, RDB connection information 1521, migration countermeasure client information 1522, RDB utilization log 1523, key candidate set 1524, query aggregation 1525, RDB table 1526, simulation table 1527, key evaluation result 1528, and migration key information 1529 Is stored.

移行クライアント決定ＰＧ１５１１は、データベースをＲＤＢから分散ＫＶＳへ移行するクライアントを決定する。図２に示すように、移行クライアント決定ＰＧ１５１１は、ＲＤＢ２００の利用ログ２０２を参照することができる。例えば、移行クライアント決定ＰＧ１５１１は、利用ログ２０２からＲＤＢ２００を利用するクライアントを抽出し、クライアント一覧を出力装置１１０により設計者に提示する。移行クライアント決定ＰＧ１５１１は、クライアント一覧を見た設計者が入力装置１３０から入力した情報に基づいて、データベースをＲＤＢから分散ＫＶＳへ移行するクライアントを決定すればよい。 The migration client determination PG 1511 determines a client to migrate the database from RDB to distributed KVS. As shown in FIG. 2, the migration client determination PG 1511 can refer to the usage log 202 of the RDB 200. For example, the migration client determination PG 1511 extracts the client using the RDB 200 from the usage log 202, and presents the client list to the designer through the output device 110. The migration client determination PG 1511 may decide a client for migrating the database from the RDB to the distributed KVS based on the information input from the input device 130 by the designer who has viewed the client list.

キー評価ＰＧ１５１２は、キー候補集合１５２４に含まれるキー候補のそれぞれについて評価を行い、スコアを付与する。スコアの算出にはスコア算出モジュール１５１８が利用される。評価方法については後述する。図２に示すように、キー評価ＰＧ１５１２は、ＲＤＢ２００の利用ログ２０２、車両情報テーブル２０３、車両情報テーブルスキーマ定義２０４、および分散ＫＶＳ３００のＫＶＳ情報３０２を参照することができる。キー評価ＰＧ１５１２は、参照した情報を利用して各キー候補のスコアを算出する。キー評価結果はキー評価結果１５２８に蓄積される。また、図２に示すように、キー評価の結果はキー評価提示ＰＧ１５１６を介し、出力装置１１０から設計者に提示される。 The key evaluation PG 1512 evaluates each of the key candidates included in the key candidate set 1524 and gives a score. A score calculation module 1518 is used to calculate the score. The evaluation method will be described later. As shown in FIG. 2, the key evaluation PG 1512 can refer to the usage log 202 of the RDB 200, the vehicle information table 203, the vehicle information table schema definition 204, and the KVS information 302 of the distributed KVS 300. The key evaluation PG 1512 calculates the score of each key candidate using the referred information. The key evaluation result is accumulated in the key evaluation result 1528. Also, as shown in FIG. 2, the result of the key evaluation is presented to the designer from the output device 110 via the key evaluation presentation PG 1516.

キー評価提示ＰＧ１５１３は、キー評価ＰＧ１５１２が評価した結果を、例えば出力装置１１０を介して設計者に提示する。設計者はキー評価結果を見て、移行後の分散ＫＶＳに用いる移行用キーを決定し、移行用キー情報１５２９に記録する。図２に示すように、キー評価提示ＰＧ１５１６は、設計者が入力装置１３０から入力したキー選択の情報を取得し、データ移行ＰＧ１５１４に通知することができる。 The key evaluation presentation PG 1513 presents the result evaluated by the key evaluation PG 1512 to the designer via the output device 110, for example. The designer looks at the key evaluation result, determines the migration key to be used for the distributed KVS after migration, and records it in the migration key information 1529. As shown in FIG. 2, the key evaluation presentation PG 1516 can obtain information on key selection input by the designer from the input device 130 and notify the data migration PG 1514.

また、キー評価ＰＧ１５１２は、評価の過程で各キー候補のシミュレーションテーブルを作成しているので、移行用キーが決まると移行用キーのシミュレーションテーブルをデータ移行ＰＧ１５１４に通知する。 Further, since the key evaluation PG 1512 creates a simulation table of each key candidate in the evaluation process, when the migration key is determined, the simulation table of the migration key is notified to the data migration PG 1514.

データ移行ＰＧ１５１４は、ＲＤＢから分散ＫＶＳへのデータの移行を実行する。データの移行は、具体的には、ＲＤＢテーブル１５２６に格納されているＲＤＢのデータの所定の列の情報に基づいたキーと、データの値とを対応づけてＫＶＳテーブル（不図示）に記録し、移行データを生成する。その際、データのキー（Ｋｅｙ）とデータの値（Ｖａｌｕｅ）は、移行用キーのシミュレーションテーブルから取得することができる。データ移行ＰＧ１５１４は生成した移行データを分散ＫＶＳ３００に通知する。 Data migration PG 1514 performs migration of data from RDB to distributed KVS. Specifically, in data migration, a key based on information of a predetermined column of RDB data stored in RDB table 1526 is associated with a data value and recorded in a KVS table (not shown). , Generate migration data. At that time, the data key (Key) and the data value (Value) can be acquired from the migration key simulation table. The data migration PG 1514 notifies the distributed KVS 300 of the created migration data.

キー候補集合作成モジュール１５１５は、移行前のＲＤＢ２００の列の情報を基に、移行後の分散ＫＶＳ３００のキーとなる候補を複数生成し、キー候補集合１５２４に格納する。 The key candidate set creation module 1515 generates a plurality of key candidates for the distributed KVS 300 after migration based on the information of the column of the RDB 200 before migration, and stores the plurality of candidates in the key candidate set 1524.

ＫＶＳテーブル作成モジュール１５１６は、移行後の分散ＫＶＳ３００においてデータおよびキーを格納するＫＶＳテーブルを作成する。 The KVS table creation module 1516 creates a KVS table for storing data and keys in the distributed KVS 300 after migration.

クエリシミュレーションモジュール１５１７は、ＲＤＢ２００の利用ログから得られるクエリを用いて、各キー候補を用いた分散ＫＶＳ３００のシミュレーションを実行する。 The query simulation module 1517 executes the simulation of the distributed KVS 300 using each key candidate, using the query obtained from the usage log of the RDB 200.

スコア算出モジュール１５１８は、各キー候補のスコアを算出する。スコアの算出方法は後述する。 The score calculation module 1518 calculates the score of each key candidate. The method of calculating the score will be described later.

ＲＤＢ接続情報１５２１は、クライアントがＲＤＢに接続するための情報である。例えばＲＤＢに接続するためのＵＲＬである。 The RDB connection information 1521 is information for the client to connect to the RDB. For example, it is a URL for connecting to RDB.

移行対象クライアント情報１５２２は、データベースをＲＤＢ２００から分散ＫＶＳ３００に移行するクライアントを示す情報である。 The migration target client information 1522 is information indicating a client that migrates the database from the RDB 200 to the distributed KVS 300.

ＲＤＢ利用ログ１５２３は、移行前のＲＤＢ２００の利用の履歴を蓄積した情報である。ＲＤＢ利用ログ１５２３には、ＲＤＢ２００にアクセスするクエリと、ＲＤＢ２００の検索結果とが含まれる。 The RDB usage log 1523 is information in which the usage history of the RDB 200 before migration is accumulated. The RDB usage log 1523 includes a query for accessing the RDB 200 and a search result of the RDB 200.

キー候補集合１５２４は、キー候補集合作成モジュール１５１５が作成した複数のキー候補を含む情報である。 The key candidate set 1524 is information including a plurality of key candidates created by the key candidate set creation module 1515.

クエリ集合１５２５は、クライアントがＲＤＢを利用したときのクエリを蓄積した情報である。クエリがＲＤＢ利用ログ１５２３から抽出され、クエリ集合１５２５に格納される。 The query set 1525 is information obtained by accumulating queries when the client uses the RDB. A query is extracted from the RDB usage log 1523 and stored in the query set 1525.

シミュレーションテーブル１５２７は、クエリ集合１５２５に含まれるクエリに基づいて、キー候補による分散ＫＶＳあるいは移行用キーによる分散ＫＶＳをシミュレートするためのシミュレーションテーブルである。 The simulation table 1527 is a simulation table for simulating the distributed KVS by the key candidate or the distributed KVS by the migration key based on the query included in the query set 1525.

キー評価結果１５２８は、キー評価ＰＧ１５１２がキー候補を評価した結果の情報である。 The key evaluation result 1528 is information on the result of the key evaluation PG 1512 evaluating the key candidate.

移行用キー情報１５２９は、ＲＤＢ２００から分散ＫＶＳ３００へ移行する際に利用するキーの情報である。例えば、設計者がキー候補の評価結果に基づいて移行用キーを選択する。選択された移行用キーの情報が移行用キー情報１５１２に格納される。 The migration key information 1529 is information of a key used when migrating from the RDB 200 to the distributed KVS 300. For example, the designer selects the migration key based on the evaluation result of the key candidate. Information of the selected migration key is stored in migration key information 1512.

ＲＤＢ２００は、通信装置２０１、利用ログ２０２、車両情報テーブル２０３、車両情報テーブルスキーマ定義２０４、テーブル２２０５、およびスキーマ定義２２０６を有している。 The RDB 200 includes a communication device 201, a usage log 202, a vehicle information table 203, a vehicle information table schema definition 204, a table 2 205, and a schema definition 2 206.

通信装置２０１は、ＲＤＢ２００が通信ネットワーク経由でキー設計支援装置１００およびクライアント４０１、４０２と通信を行うための通信装置である。 The communication device 201 is a communication device for the RDB 200 to communicate with the key design support device 100 and the clients 401 and 402 via the communication network.

利用ログ２０２は、クライアント４０１、４０２によるＲＤＢ２００の利用で蓄積される履歴情報である。利用ログ２０２は、ＲＤＢ２００で取得され、キー設計支援装置１００に通知される。キー設計支援装置１００では、ＲＤＢ利用ログ１５２３としてデータ領域１５２に格納される。 The usage log 202 is history information accumulated by usage of the RDB 200 by the clients 401 and 402. The usage log 202 is acquired by the RDB 200 and notified to the key design support device 100. The key design support device 100 stores the RDB usage log 1523 in the data area 152.

ＲＤＢ２００には、車両の現在位置および走行速度の情報をリアルタイムで管理する関係データベースが備えられている。車両情報テーブル２０３は、車両の情報を蓄積したデータベースのテーブルである。 The RDB 200 is provided with a relational database that manages information on the current position and traveling speed of the vehicle in real time. The vehicle information table 203 is a table of a database in which information of vehicles is accumulated.

車両情報テーブルスキーマ定義２０４は、車両情報テーブル２０３の構造を定義する情報である。 The vehicle information table schema definition 204 is information that defines the structure of the vehicle information table 203.

テーブル２２０５は、他のデータベースのテーブルであり、スキーマ定義２２０６は、他のデータベースの構造を定義する情報である。 Table 2 205 is a table of another database, and schema definition 2 206 is information defining the structure of the other database.

分散ＫＶＳ３００は、通信装置３０１を有し、ＫＶＳ情報３０２を格納している。データ移行ＰＧ１５１４から通知された移行データは複数のノード３０３、３０４、３０５に分散配置される。ＫＶＳ情報３０２は、分散ＫＶＳ３００の構成および動作に関する設定情報である。ここでは一例として図１に示したように分散ＫＶＳ３００のデータベースは３つのノード３０３、３０４、３０５に分散配置される。 The distributed KVS 300 has a communication device 301 and stores KVS information 302. The migration data notified from the data migration PG 1514 is distributed to a plurality of nodes 303, 304, 305. The KVS information 302 is setting information regarding the configuration and operation of the distributed KVS 300. Here, as shown in FIG. 1 as an example, the database of the distributed KVS 300 is distributed to three nodes 303, 304, and 305.

通信装置３０１は、分散ＫＶＳ３００が通信ネットワーク経由でキー設計支援装置１００、ＲＤＢ２００、およびクライアント４０３、４０４と通信するための通信装置である。 The communication device 301 is a communication device for the distributed KVS 300 to communicate with the key design support device 100, the RDB 200, and the clients 403 and 404 via the communication network.

ＫＶＳ情報３０２は、分散ＫＶＳに関する設定情報である。設定情報には、ノード数、Ｒｅｇｉｏｎサイズ、自動シャーディングの有無などの情報が含まれている。 The KVS information 302 is setting information on the distributed KVS. The setting information includes information such as the number of nodes, the region size, and the presence or absence of automatic sharding.

クライアント４０１、４０２はＲＤＢ２００のデータベースを利用するクライアントである。クライアント４０１ではアプリケーション１が動作しており、クライアント４０２ではアプリケーション２ａ、２ｂが動作している。 The clients 401 and 402 are clients that use the RDB 200 database. In the client 401, the application 1 is operating, and in the client 402, the applications 2a and 2b are operating.

クライアント４０３、４０４は分散ＫＶＳ３００のデータベースを利用するクライアントである。 The clients 403 and 404 are clients using the distributed KVS 300 database.

本実施形態によるキー設計支援装置１００の構成および動作の概略について説明する。 An outline of the configuration and operation of the key design support device 100 according to the present embodiment will be described.

キー設計支援装置１００は、関係データベース（ＲＤＢ２００）から移行する分散キーバリューストア（分散ＫＶＳ３００）の作成を支援するデータベース移行支援装置であり、メモリ（記憶部１５０）とプロセッサ（演算装置１２０）とを有している。メモリは、キー評価プログラム（キー評価ＰＧ１５１２）と、キー評価提示プログラム（キー評価提示ＰＧ１５１３）とを有している。 The key design support device 100 is a database migration support device that supports creation of a distributed key-value store (distributed KVS 300) migrated from a relational database (RDB 200), and includes a memory (storage unit 150) and a processor (arithmetic unit 120). And. The memory has a key evaluation program (key evaluation PG 1512) and a key evaluation presentation program (key evaluation presentation PG 1513).

キー評価プログラムは、関係データベースに設定されている少なくとも１つの列に基づき分散キーバリューストアに用いるキーの候補となるキー候補を少なくとも１つ生成し、キー候補のそれぞれについて、関係データベースのデータおよび利用ログに基づいてキー候補のそれぞれを評価する。キー評価提示プログラムは、キー候補の評価結果を、画面表示などにより設計者に提示する。 The key evaluation program generates at least one key candidate to be a candidate for the key to be used for the distributed key-value store based on at least one column set in the relational database, and data and utilization of the relational database for each key candidate. Evaluate each of the key candidates based on the logs. The key evaluation presentation program presents the evaluation result of the key candidate to the designer by screen display or the like.

これによれば、移行前のデータおよび利用ログを用いて各キー候補を評価した結果が提示されるので、実際のデータを用いて移行による影響を評価するので、適切なキーバリューストアのキーを容易に作成することが可能となる。 According to this, since the result of evaluating each key candidate is presented using the data before migration and the usage log, the effect of the migration is evaluated using actual data, so the key of the appropriate key-value store It can be easily created.

また、キー評価プログラムは、関係データベースに設定されている少なくとも１つの列に基づき分散キーバリューストアに用いるキーの候補となるキー候補を少なくとも１つ生成し、キー候補のそれぞれについて、関係データベースの利用ログにおけるそのキー候補の基になった列の出現の度合いを示す適合度と、分散キーバリューストアの複数のノードに分散されたデータのアクセス性能とを算出し、適合度とアクセス性能とに基づいてキー候補のそれぞれを評価する。 In addition, the key evaluation program generates at least one key candidate to be a key candidate to be used for the distributed key-value store based on at least one column set in the relational database, and uses the relational database for each key candidate. Calculate the degree of appearance of the column that the key candidate is based on in the log and the access performance of the data distributed to multiple nodes of the distributed key value store, and based on the degree of conformance and the access performance Evaluate each of the key candidates.

これによれば、移行後に移行前と同様のデータを取得できる度合いである適合度と、移行後のデータへのアクセス性能とに基づくスコアが提示されるので、移行前と後の適合度とデータへのアクセス性能とを考慮した評価に基づいて適切なキーバリューストアを容易に作成することが可能となる。 According to this, since a score based on the degree of acquisition of data similar to that before migration and the access performance to data after migration is presented after migration, the degree of fitness before and after migration is data It becomes possible to easily create an appropriate key-value store based on the evaluation in consideration of the access performance to the access.

また、キー評価プログラムは、キー候補のそれぞれについて適合度とアクセス性能の積を含むスコアを算出し、キー評価提示プログラムは、キー候補のそれぞれのスコアを設計者に提示する。これによれば、１つのスコアにより適合度とアクセス性能を考慮した評価が可能なので、複数のキー候補を容易に比較することができる。 In addition, the key evaluation program calculates a score including the product of the matching degree and the access performance for each of the key candidates, and the key evaluation presentation program presents the designer with the score of each of the key candidates. According to this, it is possible to easily compare a plurality of key candidates because one score can be evaluated in consideration of the matching degree and the access performance.

また、上記アクセス性能は、利用ログに基づくシミュレーションの移行後のデータベースへのアクセスにおいて、ノードに対するスキャンの回数が少ないほど高い値を示すシーケンシャル度を含む。スキャン回数が少ないキー候補に高い評価がつくので、効率の良いアクセスが行われる可能性の高いキー候補の選択を支援することができる。 Further, the access performance includes the degree of sequential showing a higher value as the number of scans for the node is smaller in access to the database after migration of simulation based on usage log. Since key candidates with a small number of scans are highly evaluated, it is possible to support selection of key candidates that are likely to be accessed efficiently.

また、アクセス性能は、更に、利用ログに基づくシミュレーションの移行後のデータベースへのアクセスにおいて、スキャンがノードに均等に分散されているほど高い値を示す分散度を含む。スキャンが複数のノードに均等に分散されるキー候補に高いスコアがつくので、分散ＫＶＳへの効率の良いアクセスが行われる可能性が高いキー候補の選択を支援することができる。 In addition, the access performance further includes a degree of dispersion indicating a higher value as the scan is evenly distributed to the nodes in access to the database after migration of the simulation based on the usage log. The high score of the key candidates that the scan is evenly distributed to a plurality of nodes can support the selection of key candidates that are likely to be efficiently accessed to the distributed KVS.

また、キー評価プログラムは、スコアにおけるシーケンシャル度と分散度の重みをユーザ（設計者）の操作により変更可能である。これによれば、ユーザが適合度とアクセス性能の重みを変更できるので、どちらを重視してキー設計を行うかを適宜選択することができる。 Also, the key evaluation program can change the weight of the degree of sequentiality and the degree of dispersion in the score by the operation of the user (designer). According to this, since the user can change the relevance and the weight of the access performance, it is possible to appropriately select which one is more important for key design.

また、キー評価プログラムは、関係データベースに設定されている列のうち、利用ログから抽出されたクエリによる出現回数が高い方から所定個を選択し、それらの列またはそれらの列の組み合わせのうち、データをユニークに識別するものと主キーとを抽出し、キー候補とする。利用ログでの出現回数の多い列またはその組み合わせからユニークなキー候補を作成するので、移行前と同様にデータが取得できる可能性の高いキー候補を作成することができる。 In addition, the key evaluation program selects predetermined ones from among the columns set in the relational database from the ones with the highest frequency of appearance by the query extracted from the usage log, and among these columns or combinations of those columns, Extract the data uniquely identifying the data and the primary key as key candidates. Since unique key candidates are created from a column with a large number of appearances in the usage log or a combination thereof, it is possible to create key candidates with high possibility of acquiring data as in the case before migration.

また、キー評価プログラムは、適合度に基づいてキー候補を絞り込み、絞り込んだキー候補に対してシミュレーションを行い、アクセス性能を算出する。キー候補を絞り込んでからシミュレーションを行うので、シミュレーションの処理量を低減することができる。 In addition, the key evaluation program narrows down key candidates based on the degree of matching, performs simulation on the narrowed key candidates, and calculates access performance. Since the simulation is performed after narrowing down the key candidates, the processing amount of simulation can be reduced.

また、キー評価プログラムは、関連データベースから分散キーバリューストアに移行するクライアントをユーザに選択させ、選択されたクライアントの利用ログをキー候補の評価に用いる。クライアントを選択するユーザインタフェースにより、所望のクライアントを選択して関連データベースから分散キーバリューストアに移行させることができる。 Also, the key evaluation program allows the user to select a client to be migrated from the related database to the distributed key value store, and uses the usage log of the selected client for evaluation of the key candidate. The user interface for selecting a client allows the desired client to be selected and migrated from the relevant database to the distributed key-value store.

また、キー評価提示プログラムは、いずれか１つのキー候補の評価結果と、そのキー候補を移行用キーとして決定するためのボタンを画面表示する。データ移行プログラムは、ボタンが操作されると、キー候補を移行用キーとして、分散キーバリューストアを作成する。キー候補および評価結果を提示してユーザに移行用キーを決定させるので、所望の性能を得られるキー候補と移行用キーに決定するのを支援することができる。 Also, the key evaluation presentation program displays on the screen the evaluation result of any one key candidate and a button for determining the key candidate as a transition key. When the button is operated, the data migration program creates a distributed key-value store using the key candidate as a migration key. Since the key candidate and the evaluation result are presented to cause the user to determine the transition key, it is possible to assist in determining the key candidate and the transition key from which desired performance can be obtained.

以下、より詳細に説明する。 A more detailed description will be given below.

図３は、本実施形態によるデータベース移行システムにおけるデータベース移行の全体動作を示すシーケンス図である。 FIG. 3 is a sequence diagram showing the overall operation of database migration in the database migration system according to the present embodiment.

図３を参照し、まず、設計者がＲＤＢ接続情報を入力する（ステップＳ１０１）。続いて、移行クライアント決定ＰＧ１５１１がＲＤＢ２００から利用ログのクライアント情報を取得する（ステップＳ１０２）。続いて、移行クライアント決定ＰＧ１５１１が移行クライアント決定処理（ステップＳ１０３）を実行する。移行クライアント決定処理の詳細は後述する。 Referring to FIG. 3, first, a designer inputs RDB connection information (step S101). Subsequently, the migration client determination PG 1511 acquires client information of the usage log from the RDB 200 (step S102). Subsequently, the migration client determination PG 1511 executes migration client determination processing (step S103). Details of the migration client determination process will be described later.

移行クライアント決定処理において、移行クライアント決定ＰＧ１５１１は、クライアント一覧を設計者に提示する（ステップＳ１０４）。設計者が移行クライアントの情報を入力すると（ステップＳ１０５）、移行クライアント決定ＰＧ１５１１は、移行クライアントを決定し、移行クライアント情報をキー評価ＰＧ１５１２に通知する（ステップＳ１０６）。 In the migration client determination process, the migration client determination PG 1511 presents the client list to the designer (step S104). When the designer inputs information of the migration client (step S105), the migration client determination PG 1511 determines the migration client and notifies migration client information to the key evaluation PG 1512 (step S106).

移行クライアント情報を受信したキー評価ＰＧ１５１２は、キー評価処理（ステップＳ１０７）を実行する。キー評価処理の詳細は後述する。キー評価処理において、キー評価ＰＧ１５１２は、ＲＤＢ２００から利用ログ、スキーマ定義テーブルを取得し（ステップＳ１０８）、取得した情報を基にキー候補を生成し、それぞれのキー候補を評価し、評価結果をキー評価提示ＰＧ１５１３に通知する（ステップＳ１０９）。 The key evaluation PG 1512 that has received the migration client information executes key evaluation processing (step S107). Details of the key evaluation process will be described later. In the key evaluation process, the key evaluation PG 1512 acquires the usage log and schema definition table from the RDB 200 (step S108), generates key candidates based on the acquired information, and evaluates each key candidate, and the evaluation result Is notified to the key evaluation presentation PG 1513 (step S109).

キー評価結果を受信したキー評価提示ＰＧ１５１３は、キー評価提示処理（ステップＳ１１０）を実行する。キー評価提示処理の詳細は後述する。キー評価提示処理において、キー評価提示ＰＧ１５１３は、キー評価結果を設計者に提示する（ステップＳ１１１）。評価結果を見た設計者が移行用キーを決定すると（ステップＳ１１２）、キー評価ＰＧ１５１２は、移行キーを示すキー情報をデータ移行ＰＧ１５１４に通知する（ステップＳ１１３）。 The key evaluation presentation PG 1513 that has received the key evaluation result executes key evaluation presentation processing (step S110). Details of the key evaluation presentation process will be described later. In the key evaluation presentation process, the key evaluation presentation PG 1513 presents the key evaluation result to the designer (step S111). When the designer who sees the evaluation result determines the migration key (step S112), the key evaluation PG 1512 notifies the data migration PG 1514 of key information indicating the migration key (step S113).

キー情報を受信したデータ移行ＰＧ１５１４は、データ移行処理（ステップＳ１１４）を実行する。データ移行処理の詳細は後述する。データ移行処理において、データ移行ＰＧ１５１４は、設計者から設定情報を取得し（ステップＳ１１５）、移行キーと設定情報を利用して分散ＫＶＳ３００にＫＶＳテーブルを作成させ（ステップＳ１１５）、キー評価ＰＧ１５１２から移行キーのシミュレーションテーブルを取得し（ステップＳ１１６）、分散ＫＶＳ３００にデータ移行を実行させ（ステップＳ１１７）、データ移行が完了すると、設計者にデータ移行の完了を提示する（ステップＳ１１８）。 The data migration PG 1514 that has received the key information executes data migration processing (step S114). Details of the data migration process will be described later. In the data migration process, the data migration PG 1514 acquires setting information from the designer (step S115) and causes the distributed KVS 300 to create the KVS table using the migration key and the setting information (step S115), and the key evaluation PG 1512 The simulation table of the migration key is acquired from (step S116), the distributed KVS 300 is made to execute data migration (step S117), and when data migration is completed, the designer is presented with the completion of data migration (step S118).

以下、移行クライアント決定処理Ｓ１０３、キー評価処理Ｓ１０７、キー評価提示処理Ｓ１１０、およびデータ移行処理Ｓ１１４について詳細に説明する。 The migration client determination processing S103, the key evaluation processing S107, the key evaluation presentation processing S110, and the data migration processing S114 will be described in detail below.

図４Ａは車両情報テーブルスキーマ定義２０４の一例を示す図である。図４Ｂは車両情報テーブル２０３の一例を示す図である。図４Ｃは利用ログ２０２の一例を示す図である。一例として、ＲＤＢ２００には、図４Ａに示す車両情報テーブルスキーマ定義２０４と、図４Ｂに示す車両情報テーブル２０３と、図４Ｃに示す利用ログ２０２が記憶されているものとする。 FIG. 4A is a diagram showing an example of the vehicle information table schema definition 204. As shown in FIG. FIG. 4B is a diagram showing an example of the vehicle information table 203. As shown in FIG. FIG. 4C shows an example of the usage log 202. As shown in FIG. As an example, it is assumed that the RDB 200 stores a vehicle information table schema definition 204 shown in FIG. 4A, a vehicle information table 203 shown in FIG. 4B, and a usage log 202 shown in FIG. 4C.

図５はＫＶＳ情報３０２の一例を示す図である。分散ＫＶＳ３００には図５に示するＫＶＳ情報３０２が設定されているものとする。 FIG. 5 shows an example of the KVS information 302. As shown in FIG. It is assumed that KVS information 302 shown in FIG. 5 is set in the distributed KVS 300.

図６は、移行クライアント決定処理Ｓ１０３のフローチャートである。 FIG. 6 is a flowchart of the migration client determination processing S103.

移行クライアント決定処理Ｓ１０３において、移行クライアント決定ＰＧ１５１１は、まず、設計者からのＲＤＢ接続情報を入力し、記憶部１５０に格納する（ステップＳ２０１）。 In the migration client determination processing S103, the migration client determination PG 1511 first receives RDB connection information from the designer and stores it in the storage unit 150 (step S201).

続いて、移行クライアント決定ＰＧ１５１１は、ＲＤＢ２００の利用ログ２０２からクライアント情報を取得する（ステップＳ２０２）。利用ログ２０２には、図４Ｃに例示するように、クライアントがＲＤＢ２００を利用したクエリの履歴が蓄積されている。クエリ毎に、クエリを要求したクライアント２０２１、クエリＩＤ２０２２、クエリ２０２３、ＲＤＢ２００の検索の結果２０２４、検索の結果としてＲＤＢ２００から抽出されたデータの個数（結果数）２０２５が対応づけて蓄積されている。移行クライアント決定ＰＧ１５１１は、利用ログ２０２からクライアント２０２１に登場するクライアントの情報を取得し、クライアント情報の一覧を作成する（ステップＳ２０２）。次に、移行クライアント決定ＰＧ１５１１は、クライアント情報の一覧を出力装置１１０の画面に表示する（ステップＳ２０３）。 Subsequently, the migration client determination PG 1511 acquires client information from the usage log 202 of the RDB 200 (step S202). In the usage log 202, as illustrated in FIG. 4C, a history of queries in which the client uses the RDB 200 is accumulated. For each query, the client 2021 that requested the query, the query ID 2022, the query 2023, the search result 2024 of the RDB 200, and the number (the number of results) 2025 of data extracted from the RDB 200 as the search result are stored in association ing. The migration client determination PG 1511 acquires information of clients appearing in the client 2021 from the usage log 202, and creates a list of client information (step S202). Next, the migration client determination PG 1511 displays a list of client information on the screen of the output device 110 (step S203).

図６には、クライアント情報の画面表示Ｄ２０３が示されている。画面表示Ｄ２０３を参照すると、クライアント名に対応づけて移行対象とするか否かを設定するためのチェックボックスが表示されている。 FIG. 6 shows a screen display D203 of client information. Referring to the screen display D203, a check box for setting whether or not to be a migration target in association with the client name is displayed.

次に、設計者がクリック操作を行うと、移行クライアント決定ＰＧ１５１１は、クリック操作が行われたチェックボックスにチェックを表示する（ステップＳ２０４）。設計者がチェックボックスにチェックを入れたクライアントが移行対象となる。図６の例では、クライアント１とクライアント２が移行対象となる。 Next, when the designer performs a click operation, the migration client determination PG 1511 displays a check in the check box in which the click operation has been performed (step S204). Clients for which the designer has checked the check box are subject to migration. In the example of FIG. 6, the client 1 and the client 2 are migration targets.

続いて、移行クライアント決定ＰＧ１５１１は、画面表示Ｄ２０３の実行ボタンがクリックされたことを検知すると（ステップＳ２０５）、クライアント情報の画面表示Ｄ２０３を出力装置１１０の画面から削除し（ステップＳ２０６）、移行対象のクライアント（移行クライアント）の情報を記憶部１５０に格納する（ステップＳ２０７）。 Subsequently, when detecting that the execution button of the screen display D203 has been clicked (step S205), the transition client determination PG 1511 deletes the screen display D203 of the client information from the screen of the output device 110 (step S206). Information on the target client (migration client) is stored in the storage unit 150 (step S207).

図７は、キー評価処理Ｓ１０７のフローチャートである。 FIG. 7 is a flowchart of the key evaluation process S107.

まず、キー評価ＰＧ１５１２は、キー候補集合作成処理（ステップＳ３０１）を実行して複数のキー候補を生成する。キー候補集合作成処理Ｓ３０１の詳細は後述する。ステップＳ３０１の処理が終了すると、図７に示したデータ領域１５２には、利用ログ、ＲＤＢクエリ集合、キー候補集合が格納された状態となる。 First, the key evaluation PG 1512 executes a key candidate set creation process (step S301) to generate a plurality of key candidates. Details of the key candidate set creation process S301 will be described later. When the process of step S301 ends, the usage log, the RDB query set, and the key candidate set are stored in the data area 152 illustrated in FIG. 7.

次に、キー評価ＰＧ１５１２はステップＳ３０２〜ステップＳ３０４の処理を全てのキー候補について繰り返す。まず、キー評価ＰＧ１５１２は、シミュレーションテーブル作成処理（ステップＳ３０２）を実行してシミュレーションテーブル１５２７を作成する。シミュレーションテーブル作成処理の詳細は後述する。ステップＳ３０２の処理が終了すると、図７に示したデータ領域１５２には、ＲＤＢテーブル、シミュレーションテーブル、およびＫＶＳ情報が格納された状態となる。 Next, the key evaluation PG 1512 repeats the processing of step S302 to step S304 for all key candidates. First, the key evaluation PG 1512 executes a simulation table creation process (step S302) to create a simulation table 1527. Details of the simulation table creation process will be described later. When the process of step S302 ends, the RDB table, the simulation table, and the KVS information are stored in the data area 152 illustrated in FIG. 7.

次に、キー評価ＰＧ１５１２は、クエリシミュレーション実施処理（ステップＳ３０３）を実行し、クエリシミュレーション結果を算出する。クエリシミュレーション結果はキー候補の評価に利用される。クエリシミュレーション実施処理の詳細は後述する。ステップＳ３０３の処理が終了すると、図７に示したデータ領域１５２には、シミュレーション結果が格納された状態となる。シミュレーション結果には、取得レコード数およびスキャン回数が含まれている。 Next, the key evaluation PG 1512 executes a query simulation execution process (step S303) to calculate a query simulation result. The query simulation results are used to evaluate key candidates. Details of the query simulation execution process will be described later. When the process of step S303 is completed, the simulation result is stored in the data area 152 shown in FIG. The simulation results include the number of acquired records and the number of scans.

次に、キー評価ＰＧ１５１２は、スコア算出処理（ステップＳ３０４）を実行して各キー候補のスコアを算出する。スコア算出処理により各キー候補の評価結果であるスコアが得られる。各キー候補のスコアは移行後の分散ＫＶＳに用いるキーを決定するのに利用される。スコア算出処理の詳細は後述する。ステップＳ３０４の処理が終了すると、図７に示したデータ領域１５２には、キー評価結果が格納された状態となる。 Next, the key evaluation PG 1512 executes a score calculation process (step S304) to calculate the score of each key candidate. A score which is an evaluation result of each key candidate is obtained by the score calculation process. The score of each key candidate is used to determine the key to be used for the post-transition variance KVS. Details of the score calculation process will be described later. When the process of step S304 ends, the key evaluation result is stored in the data area 152 shown in FIG.

最後に、キー評価ＰＧ１５１２は、キー評価結果を出力する（ステップＳ３０５）。図７には、キー評価結果の一例Ｄ３０５が示されている。キー評価結果には、ＫｅｙＡとＫｅｙＢの評価結果が含まれている。例えば、ＫｅｙＡは平均シーケンシャル度が１．５であり、平均分散度が３８である。 Finally, the key evaluation PG 1512 outputs the key evaluation result (step S305). An example D305 of the key evaluation result is shown in FIG. The key evaluation results include the evaluation results of KeyA and KeyB. For example, KeyA has an average degree of sequentiality of 1.5 and an average degree of dispersion of 38.

図８は、キー候補集合作成処理のフローチャートである。 FIG. 8 is a flowchart of key candidate set creation processing.

まず、キー評価ＰＧ１５１２は、移行の対象とされたクライアントである移行クライアントの情報を取得する（ステップＳ４０１）。次に、キー評価ＰＧ１５１２は、ＲＤＢ２００の移行クライアントによる利用についての利用ログ２０２の入力を受け、記憶部１５０に格納する（ステップＳ４０２）。次に、キー評価ＰＧ１５１２は、記憶部１５０の利用ログ２０２からクエリを取得し、クエリ集合を作成し、記憶部１５０に格納する（ステップＳ４０３）。 First, the key evaluation PG 1512 acquires information on the migration client that is the client targeted for migration (step S401). Next, the key evaluation PG 1512 receives the input of the use log 202 about the use of the RDB 200 by the migration client, and stores it in the storage unit 150 (step S402). Next, the key evaluation PG 1512 acquires a query from the usage log 202 of the storage unit 150, creates a query set, and stores it in the storage unit 150 (step S403).

次に、キー評価ＰＧ１５１２は、クエリ集合のクエリにおけるＷＨＥＲＥ句に出現する、ＲＤＢ２００の列（ＲＤＢ列）それぞれの出現回数をカウントする（ステップＳ４０４）。図８には、各ＲＤＢ列の出現回数を示す表Ｄ４０４が示されている。表Ｄ４０４には、ＲＤＢ列のそれぞれについて、出現回数と、そのＲＤＢ列が主キーであるか否かを示すフラグとが記載されている。例えば、ＲＤＢ列「Ｔｉｍｅ」の出現回数が最も多く２０回である。 Next, the key evaluation PG 1512 counts the number of appearances of each of the columns (RDB column) of the RDB 200 that appear in the WHERE clause in the query of the query set (step S404). FIG. 8 shows a table D404 indicating the number of occurrences of each RDB column. Table D404 describes, for each RDB column, the number of occurrences and a flag indicating whether the RDB column is a primary key. For example, the number of occurrences of the RDB string "Time" is the largest 20 times.

次に、キー評価ＰＧ１５１２は、ＲＤＢ列を出現回数の多い順に所定個選択する（ステップＳ４０５）。ここではＲＤＢ列を３個選択することにするが、３個という数に限定されることは無い。図８には、出願回数が多い３つのＲＤＢ列を示す表Ｄ４０５が示されている。表Ｄ４０５には、ＲＤＢ列「Ｔｉｍｅ」、ＲＤＢ列「ＣａｒＩＤ」、ＲＤＢ列「Ｓｐｅｅｄ」が示されている。 Next, the key evaluation PG 1512 selects a predetermined number of RDB columns in descending order of the number of appearances (step S405). Although three RDB columns are selected here, the number is not limited to three. FIG. 8 shows a table D405 showing three RDB columns for which the number of applications is high. In Table D405, an RDB column "Time", an RDB column "Car ID", and an RDB column "Speed" are shown.

次に、キー評価ＰＧ１５１２は、クエリのＦＲＯＭ句に出現するテーブルに関連するスキーマ定義をＲＤＢ２００から抽出し、ステップＳ４０５で作成した表Ｄ４０５に、スキーマ定義の主キーをＲＤＢ列として追加する（ステップＳ４０６）。また、このとき、主キーにはＷＨＥＲＥ句に出願していないことが考えられるが、ここでは便宜的に、主キーのＷＨＥＲＥ句における出願回数として他のＲＤＢ列（上位３つのＲＤＢ列）の出願回数の平均値を設定することにする。ここで表Ｄ４０６が作成される。図８の例では、Ｔｉｍｅの出願回数が２０であり、ＣａｒＩＤの出願回数が１０であり、Ｓｐｅｅｄの出願回数が５であるため、（２０＋１０＋５）／３≒１２が設定されている。ただし、他のＲＤＢ列の出現回数の平均値を採用するのは一例であり、これに限定されることはない。 Next, the key evaluation PG 1512 extracts the schema definition related to the table appearing in the FROM clause of the query from the RDB 200, and adds the primary key of the schema definition as an RDB column to the table D 405 created in step S 405 Step S406). Also, at this time, it is conceivable that the primary key is not filed in the WHERE clause, but here, for convenience, applications of other RDB columns (upper three RDB columns) as the number of applications in the WHERE clause of the primary key I will set the average value of the number of times. Here, Table D406 is created. In the example of FIG. 8, since the number of applications for Time is 20, the number of applications for CarID is 10, and the number of applications for Speed is 5, (20 + 10 + 5) / 3 ≒ 12 is set. However, adopting the average value of the number of occurrences of other RDB columns is an example, and the present invention is not limited to this.

次に、キー評価ＰＧ１５１２は、データをユニークに特定する、ＲＤＢ列またはＲＤＢ列の組み合わせを登録するユニークＲＤＢ列リストと、データをユニークに特定しない、ＲＤＢ列またはＲＤＢ列の組み合わせを登録する非ユニークＲＤＢ列リストとを作成し、表Ｄ４０６の中の主キーをユニークＲＤＢ列リストに追加する（ステップＳ４０７、ステップＳ４１５）。 Next, the key evaluation PG 1512 registers a unique RDB column or a list of RDB columns that uniquely identifies data, and a combination of an RDB or RDB column that does not uniquely identify data. A unique RDB column list is created, and the primary key in Table D406 is added to the unique RDB column list (steps S407 and S415).

更に、キー評価ＰＧ１５１２は、ＲＤＢ列からデータをユニークに特定するＲＤＢ列の組み合わせを抽出する一連の処理（ステップＳ４０８〜Ｓ４１５）を行う。 Furthermore, the key evaluation PG 1512 performs a series of processes (steps S408 to S415) of extracting a combination of RDB sequences that uniquely identifies data from the RDB sequence.

まず、キー評価ＰＧ１５１２は、表Ｄ４０６の主キー以外のＲＤＢ列を非ユニークＲＤＢ列リストに追加する（ステップＳ４０８）。続いて、キー評価ＰＧ１５１２は、非ユニークＲＤＢ列リスト内のＲＤＢ列を対象として以下の処理（ステップＳ４１０〜Ｓ４１２）を繰り返す。 First, the key evaluation PG 1512 adds an RDB column other than the primary key of the table D 406 to the non-unique RDB column list (step S 408). Subsequently, the key evaluation PG 1512 repeats the following processing (steps S410 to S412) on the RDB sequence in the non-unique RDB sequence list.

まず、キー評価ＰＧ１５１２は、非ユニークＲＤＢ列リストから対象となっているＲＤＢ列を削除する（ステップＳ４１０）。次に、キー評価ＰＧ１５１２は、ＷＨＥＲＥ句でＲＤＢ列を一意に指定したときに取得されるレコードの個数が１であるか否か判定する（ステップＳ４１１）。取得できるレコードの個数が１であれば、キー評価ＰＧ１５１２は、対象となっているＲＤＢ列をユニークＲＤＢ列リストに追加する（ステップＳ４１５）。取得できるレコードの個数が１でなければ、キー評価ＰＧ１５１２は、次に、非ユニークＲＤＢ列リストに対象となっているＲＤＢ列を追加する（ステップＳ４１２）。 First, the key evaluation PG 1512 deletes the target RDB column from the non-unique RDB column list (step S410). Next, the key evaluation PG 1512 determines whether the number of records acquired when the RDB sequence is uniquely designated in the WHERE clause is 1 (step S411). If the number of records that can be acquired is 1, the key evaluation PG 1512 adds the target RDB column to the unique RDB column list (step S 415). If the number of records that can be acquired is not 1, the key evaluation PG 1512 next adds the target RDB column to the non-unique RDB column list (step S412).

ステップＳ４１０〜Ｓ４１２の処理が非ユニークＲＤＢ列リスト内のＲＤＢ列について完了すると、キー評価ＰＧ１５１２は、非ユニークＲＤＢ列のＲＤＢ列を変数Ｃ個分組み合わせて、非ユニークＲＤＢ列リストとし（ステップＳ４１４）、ステップＳ４０９に戻る。ステップＳ４０９で変数Ｃ＝Ｃ＋１の演算を行なってから、ステップＳ４１０〜Ｓ４１３の処理を繰り返す。 When the processing of steps S410 to S412 is completed for the RDB columns in the non-unique RDB column list, the key evaluation PG 1512 combines the RDB columns of the non-unique RDB columns by a variable C to create a non-unique RDB column list (step S414). ), Returns to step S409. After the calculation of the variable C = C + 1 is performed in step S409, the processes of steps S410 to S413 are repeated.

ステップＳ４０９〜Ｓ４１４までの処理をＣの最大値まで実行したら、キー評価ＰＧ１５１２は、次に、ユニークＲＤＢ列リストのＲＤＢ列を組み合わせてＫＶＳのキー候補を作成する（ステップＳ４１６）。図８には、キー候補Ｄ４１６として、ＫｅｙＡ、Ｂ、Ｃ、Ｄ・・Ｇという７つのキー候補が作成された例が示されている。例えば、ＫｅｙＡは、ＲＤＢ列「ＣａｒＩＤ」とＲＤＢ列「Ｔｉｍｅ」を組み合わせたキー候補である。 When the processes in steps S409 to S414 are executed up to the maximum value of C, the key evaluation PG 1512 next combines the RDB columns of the unique RDB column list to create KVS key candidates (step S416). FIG. 8 shows an example in which seven key candidates such as KeyA, B, C, D ··· G are created as the key candidate D416. For example, KeyA is a key candidate obtained by combining the RDB sequence "CarID" and the RDB sequence "Time".

次に、キー評価ＰＧ１５１２は、作成した各キー候補について、そのキー候補に含まれているＲＤＢ列の出現回数の平均値を算出し、その値を当該キー候補の適合度として保持する（ステップＳ４１７）。図８には、適合度算出結果Ｄ４１７が示されている。例えば、ＫｅｙＡおよびＫｅｙＢの適合度は１０と２０の平均値をとって１５である。なお、ここでは、ＲＤＢ列の出現回数の平均値をキー候補の適合度としたが、これに限定されることは無い。他の例として、出現回数の平均値の代わりに出現回数の合計値を用いてもよい。 Next, the key evaluation PG 1512 calculates, for each of the created key candidates, an average value of the appearance frequency of the RDB sequence included in the key candidate, and holds the value as the matching degree of the key candidate (Step S417). FIG. 8 shows the matching degree calculation result D417. For example, the matching degree of Key A and Key B is 15, taking an average value of 10 and 20. Here, although the average value of the number of occurrences of the RDB sequence is used as the matching degree of the key candidate, the present invention is not limited to this. As another example, a total value of the number of occurrences may be used instead of the average value of the number of occurrences.

次に、キー評価ＰＧ１５１２は、キー候補を適合度が高い順に並べて先頭の所定個（ここでは一例として２個）を取り出し、それらのキー候補をキー候補集合１５２４として記憶部１５０に格納する（ステップＳ４１８）。図８には、キー候補集合１５２４の例が示されている。図８の例では、キー候補集合１５２４には、ＫｅｙＡとＫｅｙＢという２つのキー候補が含まれている。 Next, the key evaluation PG 1512 arranges the key candidates in descending order of the matching degree, extracts a predetermined number (two as an example here) of the head, and stores the key candidates in the storage unit 150 as a key candidate set 1524 ( Step S418). An example of the key candidate set 1524 is shown in FIG. In the example of FIG. 8, the key candidate set 1524 includes two key candidates, KeyA and KeyB.

図９Ａは、シミュレーションテーブル作成処理のフローチャートである。図９Ｂは、シミュレーションテーブル作成処理の説明に用いるキー候補の例を示す表である。図９Ｃは、シミュレーションテーブル作成処理の説明に用いるＲＤＢテーブルの例を示す表である。 FIG. 9A is a flowchart of simulation table creation processing. FIG. 9B is a table showing an example of key candidates used to explain the simulation table creation process. FIG. 9C is a table showing an example of an RDB table used to explain the simulation table creation process.

図９Ａを参照すると、まず、キー評価ＰＧ１５１２は、ＲＤＢテーブルを入力し、記憶部１５０に格納する（ステップＳ５０１）。続いて、キー評価ＰＧ１５１２は、記憶部１５０から、シミュレーションの対象となるキー候補を取得する（ステップＳ５０２）。ここでは一例として図９Ｂに示したＫｅｙＡの情報がキー候補Ｄ５０２として取得される。 Referring to FIG. 9A, first, the key evaluation PG 1512 inputs the RDB table and stores it in the storage unit 150 (step S501). Subsequently, the key evaluation PG 1512 acquires key candidates to be simulated from the storage unit 150 (step S502). Here, the information of KeyA shown in FIG. 9B as an example is acquired as the key candidate D502.

次に、キー評価ＰＧ１５１２は、ＫＶＳキーを示すＫｅｙ、ＫＶＳの値を示すＶａｌｕｅ、データが格納される分割された領域を示すＲｅｇｉｏｎ、およびデータが格納されるノードを示すＮｏｄｅを含む分割前シミュレーションテーブルＤ５０６を作成し、分割前シミュレーションテーブルＤ５０６に、キー候補に含まれているＲＤＢ列をＫＶＳのＫｅｙとし、他のＲＤＢ列の列名および値をＶａｌｕｅとして、登録する（ステップＳ５０３）。図９Ｃに示したＲＤＢテーブルの上段の行には、ＰＫが１であり、ＣａｒＩＤが１であり、それ以外に、Ｔｉｍｅ列、Ｌａｔｉｔｕｄｅ列、Ｌｏｎｇｉｔｕｄｅ列、Ｓｐｅｅｄ列があるデータが登録されている。そのデータから、Ｌａｔｉｔｕｄｅ列、Ｌｏｎｇｉｔｕｄｅ列、Ｓｐｅｅｄ列をそれぞれＶａｌｕｅとする３つのデータが作成され、分割前シミュレーションテーブルＤ５０６に登録される。 Next, the key evaluation PG 1512 is a pre-division simulation including a Key indicating a KVS key, a Value indicating a value of KVS, a Region indicating a divided region in which data is stored, and a Node indicating a node in which data is stored. A table D506 is created, and the RDB column included in the key candidate is registered as a Key of KVS, and the column names and values of other RDB columns are registered as Value in the pre-division simulation table D506 (step S503). In the upper row of the RDB table shown in FIG. 9C, PK is 1 and CarID is 1, and other data including a Time column, a Latitude column, a Longitude column, and a Speed column are registered. From the data, three pieces of data, each having the Latitude column, the Longitude column, and the Speed column as Value, are created and registered in the pre-division simulation table D506.

次に、キー評価ＰＧ１５１２は、分散ＫＶＳ３００に用いるＫＶＳ情報３０２を入力し、記憶部１５０に格納する（ステップＳ５０４）。ＫＶＳ情報３０２には、図５に示したように、分散ＫＶＳ３００のノード数やＲｅｇｉｏｎサイズなどが含まれている。 Next, the key evaluation PG 1512 inputs the KVS information 302 used for the distributed KVS 300, and stores it in the storage unit 150 (step S504). As shown in FIG. 5, the KVS information 302 includes the number of nodes of the distributed KVS 300, the region size, and the like.

次に、キー評価ＰＧ１５１２は、ＫＶＳ情報３０２に含まれているＲｅｇｉｏｎサイズに基づいて、各データが格納される領域を示すＲｅｇｉｏｎ番号を決定し、分割前シミュレーションテーブルＤ５０６に追加登録する（ステップＳ５０５）。例えば、各データをシーケンシャルに領域に割り当て、その領域のＲｅｇｉｏｎ番号を登録すればよい。なお、ここでは各データにシーケンシャルにＲｅｇｉｏｎ番号を付与する例を示したが、これに限定されることは無い。他の例として、各データにランダムにＲｅｇｉｏｎ番号を付与することにしても良い。 Next, the key evaluation PG 1512 determines, based on the region size included in the KVS information 302, the region number indicating the region in which each data is stored, and additionally registers it in the pre-division simulation table D 506 (step S505). ). For example, each data may be sequentially assigned to a region, and the region number of that region may be registered. Although an example in which Region numbers are sequentially assigned to each data is shown here, the present invention is not limited to this. As another example, Region numbers may be randomly assigned to each data.

次に、キー評価ＰＧ１５１２は、分散ＫＶＳ３００における各領域を格納するノードを決定し、分割前シミュレーションテーブルＤ５０６におけるＮｏｄｅ列にＮｏｄｅ番号を追加登録する（ステップＳ５０６）。ここでは一例として各領域を各ノードにシーケンシャルに割り当てる。 Next, the key evaluation PG 1512 determines a node storing each area in the distributed KVS 300, and additionally registers a Node number in the Node column in the pre-division simulation table D506 (step S506). Here, as an example, each area is sequentially assigned to each node.

次に、キー評価ＰＧ１５１２は、分割前シミュレーションテーブルＤ５０６をＲｅｇｉｏｎ番号ごとに分割し、シミュレーションテーブルＤ５０８を作成する（ステップＳ５０７）。続いて、キー評価ＰＧ１５１２は、作成したシミュレーションテーブルＤ５０８を記憶部１５０に格納する（ステップＳ５０８）。 Next, the key evaluation PG 1512 divides the pre-division simulation table D 506 for each Region number, and creates a simulation table D 508 (step S 507). Subsequently, the key evaluation PG 1512 stores the created simulation table D508 in the storage unit 150 (step S508).

図１０は、クエリシミュレーション処理のフローチャートである。 FIG. 10 is a flowchart of query simulation processing.

まず、キー評価ＰＧ１５１２は、記憶部１５０からシミュレーションに利用するシミュレーションテーブルを取得する（ステップＳ６０１）。続いて、キー評価ＰＧ１５１２は、ステップＳ６０２〜Ｓ６０５の処理を、利用ログにある全てのクエリについて繰り返し実行する。 First, the key evaluation PG 1512 acquires a simulation table to be used for simulation from the storage unit 150 (step S601). Subsequently, the key evaluation PG 1512 repeatedly executes the processing of steps S602 to S605 for all the queries in the usage log.

まず、キー評価ＰＧ１５１２は、記憶部１５０のＲＤＢ利用ログ１５２３におけるクエリ実行の結果から、クエリによって取得されるデータを特定し、シミュレーションテーブル１５２７に、データがクエリによって取得されるか否かの情報を登録する（ステップＳ６０３）。図１０には、シミュレーションテーブル１５２７の一例が示されている。図１０に示されたシミュレーションテーブルＤ６０３における、「クエリによって取得されるか」という列に、取得される場合にはＹ（Ｙｅｓ）、取得されない場合にはＮ（Ｎｏ）が登録される。 First, the key evaluation PG 1512 identifies the data acquired by the query from the result of the query execution in the RDB usage log 1523 of the storage unit 150, and information of whether the data is acquired by the query in the simulation table 1527 Are registered (step S603). An example of the simulation table 1527 is shown in FIG. Y (Yes) in the case of acquisition, and N (No) in the case of acquisition are registered in the column “Is it acquired by a query?” In the simulation table D603 illustrated in FIG.

次に、キー評価ＰＧ１５１２は、領域毎のシミュレーションテーブル別に「クエリによって取得されるか」の列の末尾にＮを付加したものについて、先頭から順番に見ていき、登録されている情報がＹからＮに変わった回数をカウントし、その回数をスキャン回数とする（ステップＳ６０４）。図１０には、領域毎のスキャン回数を示す表Ｄ６０４が示されている。例えば、Ｎｏｄｅ１に格納されたＲｅｇｉｏｎ１という領域は、取得レコード数が２であり、スキャン回数が１である。そして、キー評価ＰＧ１５１２は、スキャン回数をノード毎に合計する（ステップＳ６０５）。 Next, in the key evaluation PG 1512, for each simulation table for each area, N is added to the end of the column of “Is it acquired by query?”, And the information registered is The number of changes from N to N is counted, and the number is set as the number of scans (step S604). FIG. 10 shows a table D 604 indicating the number of scans for each area. For example, in an area called Region 1 stored in Node 1, the number of acquired records is 2 and the number of scans is 1. Then, the key evaluation PG 1512 sums up the number of scans for each node (step S605).

全てのクエリについてステップＳ６０２〜Ｓ６０６の一連の処理が完了すると、次に、キー評価ＰＧ１５１２は、シミュレーション結果Ｄ６０５の表に、クエリ番号、レコード数、およびスキャン回数の情報を登録する（ステップＳ６０７）図１０には、クエリシミュレーション結果を示す表Ｄ６０５が示されている。例えば、クエリＱ１．１においては、取得レコード数が４つであり、スキャン回数が合計で３である。 When the series of processes in steps S602 to S606 are completed for all queries, the key evaluation PG 1512 next registers information on the query number, the number of records, and the number of scans in the table of the simulation result D605 (step S607) FIG. 10 shows a table D605 showing query simulation results. For example, in the query Q1.1, the number of acquired records is four and the number of scans is three in total.

図１１は、スコア算出処理のフローチャートである。 FIG. 11 is a flowchart of the score calculation process.

まず、キー評価ＰＧ１５１２は、記憶部１５０から、クエリシミュレーション結果を取得する（ステップＳ７０１）。 First, the key evaluation PG 1512 acquires a query simulation result from the storage unit 150 (step S701).

次に、キー評価ＰＧ１５１２は、クエリシミュレーション結果にあるクエリ毎にシーケンシャル度を算出する（ステップＳ７０２）。シーケンシャル度は次の式により計算することができる Next, the key evaluation PG 1512 calculates the sequential degree for each query in the query simulation result (step S702). The degree of sequentiality can be calculated by the following equation

各領域のスキャン回数をｒ_ｎとする。スキャン回数が０の場合には除外される。理論的最初スキャン数をＳとする。Ｓは次の式により算出することができる

Number of scans of each region is r _n. If the number of scans is 0, it is excluded. Let S be the theoretical first scan number. S can be calculated by the following equation

続いて、キー評価ＰＧ１５１２は、クエリ毎に分散度を算出する（ステップＳ７０３）。分散度は次の式により算出することができる

Subsequently, the key evaluation PG 1512 calculates the degree of dispersion for each query (step S703). The degree of dispersion can be calculated by the following equation

σは標準偏差を表わす。ノード数をＮとし、各ノードのスキャン回数をＳｎとし、Ｓｎの平均値をμとしたとき、次の式が成り立つ

σ represents the standard deviation. Assuming that the number of nodes is N, the number of scans of each node is Sn, and the average value of Sn is μ, the following equation holds

図１１には、シーケンシャル度および分散度の情報が付加されたクエリシミュレーション結果Ｄ７０３が示されている。

FIG. 11 shows a query simulation result D703 to which information of the degree of sequentiality and the degree of dispersion is added.

更に、キー評価ＰＧ１５１２は、クエリ毎のシーケンシャル度の平均値を平均シーケンシャル度とし、クエリ毎の分散度の平均値を平均分散度とし、それらをキー評価結果１５２８として記憶部１５０に格納する（ステップＳ７０４）。図１１には、一例としてのキー評価結果Ｄ７０４が示されている。図１１の例では、キー候補ＫｅｙＡは、適合度が１５であり、平均シーケンシャル度が３０であり、平均分散度が３８である。 Furthermore, the key evaluation PG 1512 stores the average value of the degree of sequentiality for each query as the average degree of sequentiality, the average value of the degree of distribution for each query as the average dispersion degree, and stores them as the key evaluation result 1528 in the storage unit 150 Step S704). FIG. 11 shows a key evaluation result D704 as an example. In the example of FIG. 11, the key candidate KeyA has a matching degree of 15, an average sequential degree of 30, and an average dispersion degree of 38.

図１２Ａは、キー評価提示処理のフローチャートである。図１２Ｂ、図１２Ｃ、図１２Ｄ、図１２Ｅは、キー評価提示処理による画面表示の例を示す図である。 FIG. 12A is a flowchart of key evaluation presentation processing. FIG. 12B, FIG. 12C, FIG. 12D, and FIG. 12E are figures which show the example of the screen display by a key evaluation presentation process.

まず、キー評価提示ＰＧ１５１３は、キー評価結果１５２８を記憶部１５０から取得する（ステップＳ８０１）。次に、キー評価提示ＰＧ１５１３は、各キー候補について、キー評価結果１５２８に含まれる適合度、平均シーケンシャル度、および平均分散度の積を算出する（ステップＳ８０２）。続いて、キー評価提示ＰＧ１５１３は、各キー候補について、キー評価結果、適合度、平均シーケンシャル度、平均分散度、スコア、および分散度重み入力部を、画面１として出力装置１１０に画面表示する（ステップＳ８０３）。図１２Ｂには画面１が示されている。分散度重み入力部のスライドバーを移動させることにより、スコア算出におけるシーケンシャル度と分散度の重み付けの比率を変更することができる。 First, the key evaluation presentation PG 1513 acquires a key evaluation result 1528 from the storage unit 150 (step S801). Next, the key evaluation presentation PG 1513 calculates, for each key candidate, the product of the degree of fitness, the average sequentiality, and the average degree of dispersion included in the key evaluation result 1528 (step S802). Subsequently, the key evaluation presentation PG 1513 displays the key evaluation result, the degree of matching, the average sequential degree, the average degree of dispersion, the score, and the degree of dispersion weight input unit on the output device 110 as screen 1 for each key candidate. (Step S803). Screen 1 is shown in FIG. 12B. By moving the slide bar of the dispersion degree weight input unit, it is possible to change the ratio of the degree of order in the score calculation and the weighting of the degree of dispersion.

次に、キー評価提示ＰＧ１５１３は、ユーザ（設計者）による画面へのクリックを検出すると（ステップＳ８０４）、そのユーザクリックが重み入力に関するものであれば、変更された重みでスコアを再計算する（ステップＳ８０５）。一方、ユーザクリックがキー候補の選択に関するものであれば、キー評価提示ＰＧ１５１３は、まず、キー評価結果を示す画面１の表示を終了し（ステップＳ８０６）、選択されたキーに関するキー設計詳細画面（画面２）を表示する（ステップＳ８０７）。 Next, when the key evaluation presentation PG 1513 detects a click on the screen by the user (designer) (step S804), if the user click relates to a weight input, it recalculates the score with the changed weight. (Step S805). On the other hand, if the user click relates to selection of a key candidate, the key evaluation presentation PG 1513 first ends the display of the screen 1 showing the key evaluation result (step S806), and the key design detail screen related to the selected key (Screen 2) is displayed (step S807).

図１２Ｃには画面２が示されている。図１２Ｃの画面２の例では、ＫｅｙＡに含まれているＲＤＢ列がＣａｒＩＤとＴｉｍｅである。スコアが１７１００である。適合度が１５である。平均シーケンシャル度が３０である。平均分散度が３８である。更に、選択されているキーを移行用として確定させる決定ボタンと、前の画面に戻るための戻るボタンが表示されている。 Screen 2 is shown in FIG. 12C. In the example of screen 2 in FIG. 12C, the RDB columns included in KeyA are CarID and Time. The score is 17100. The matching degree is 15. The average sequential degree is 30. The average degree of dispersion is 38. Furthermore, a decision button for confirming the selected key for transition and a back button for returning to the previous screen are displayed.

次に、キー評価提示ＰＧ１５１３は、ユーザクリックを検出すると（ステップＳ８０８）、そのユーザクリックが戻るボタンへのクリックであれば、キー設計詳細画面（画面２）の表示を終了し（ステップＳ８０９）、ステップＳ８０３に戻る。 Next, when the key evaluation presentation PG 1513 detects a user click (step S808), the display of the key design detail screen (screen 2) is ended (step S809) if the user click is a click to a button to return. , And return to step S803.

また、ステップＳ８０８で検出したユーザクリックが平均シーケンシャル度に関するものであれば、キー評価提示ＰＧ１５１３は、シーケンシャル度詳細画面（画面３）を表示する（ステップＳ８１０）。図１２Ｄには画面３が示されている。図１２Ｄに示した画面３の例では、ＫｅｙＡのクエリ毎のシーケンシャル度がグラフで表示され、更に戻るボタンが表示されている。キー評価提示ＰＧ１５１３は、ここでユーザクリックを検出すると（ステップＳ８１１）、シーケンシャル度詳細画面（画面３）の表示を終了し（ステップＳ８１２）、ステップＳ８０７に戻る。 If the user click detected in step S808 relates to the average sequential degree, the key evaluation presentation PG 1513 displays the sequential degree detail screen (screen 3) (step S810). Screen 3 is shown in FIG. 12D. In the example of screen 3 shown in FIG. 12D, the sequential degree for each query of KeyA is displayed as a graph, and a back button is displayed. Here, when the key evaluation presentation PG 1513 detects a user click (step S811), the display of the sequential degree detail screen (screen 3) is ended (step S812), and the process returns to step S807.

また、ステップＳ８０８において検出したユーザクリックが平均分散度を選択するクリックであれば、キー評価提示ＰＧ１５１３は、分散度詳細画面（画面４）を表示する（ステップＳ８１３）。図１２Ｅには画面４が示されている。図１２Ｅに示した画面４の例では、ＫｅｙＡのクエリ毎の分散度がグラフで表示され、更に戻るボタンが表示されている。キー評価提示ＰＧ１５１３は、ここでユーザクリックを検出すると（ステップＳ８１４）、分散度詳細画面（画面４）の表示を終了し（ステップＳ８１５）、ステップＳ８０７に戻る。 If the user click detected in step S808 is a click for selecting the average degree of dispersion, the key evaluation presentation PG 1513 displays the degree of dispersion detail screen (screen 4) (step S813). Screen 4 is shown in FIG. 12E. In the example of screen 4 shown in FIG. 12E, the degree of dispersion for each query of KeyA is displayed as a graph, and a back button is displayed. Here, when the key evaluation presentation PG 1513 detects a user click (step S814), the display of the dispersion degree detail screen (screen 4) is ended (step S815), and the process returns to step S807.

また、ステップＳ８０８において検出したユーザクリックが決定ボタンに対するものであれば、キー評価提示ＰＧ１５１３は、キー設計詳細画面（画面２）の表示を終了し（ステップＳ８１６）、選択され、キー設計詳細画面に表示されていたキー（ここではＫｅｙＡ）を移行用キーとし、移行用キー情報を記憶部１５０に格納し（ステップＳ８１７）、処理を終了する。 If the user click detected in step S808 is for the determination button, the key evaluation presentation PG 1513 ends the display of the key design detail screen (screen 2) (step S816) and is selected, and the key design detail screen is selected. The key (here, KeyA) displayed on the screen is used as a shift key, the shift key information is stored in the storage unit 150 (step S817), and the process is ended.

図１３Ａは、データ移行処理のシーケンス図である。図１３Ｂ、図１３Ｃは、データ移行処理による画面表示の例を示す図である。 FIG. 13A is a sequence diagram of data migration processing. 13B and 13C are diagrams showing examples of screen display by data migration processing.

データ移行ＰＧ１５１４は、まず、記憶部１５０から、ＲＤＢ２００から得た車両情報テーブルスキーマ定義２０４、分散ＫＶＳ３００から得たＫＶＳ情報３０２、移行用キー情報１５２９、移行用キーのシミュレーションテーブル１５２７を取得する（ステップＳ９０１）。データ移行ＰＧ１５１４は、ＫＶＳにおける列ファミリ（ＣｏｌｕｍｎＦａｍｉｌｙ）の名称である列ファミリ名を設定するためにデータ移行設定画面を表示する（ステップＳ９０２）。図１３Ｂには、データ移行設定画面（画面５ａ）が示されている。画面５には、ＲＤＢの列名とＫＶＳの列ファミリ名とが対応表に示されており、更に、メッセージ表示部と実行ボタンが示されている。次に、データ移行ＰＧ１５１４は、ユーザ操作により列ファミリ名を入力する（ステップＳ９０３）。そして、実行ボタンへのユーザクリックを検出すると、データ移行ＰＧ１５１４は、メッセージ表示部にメッセージを表示していればその表示を削除する（ステップＳ９０５）。 The data migration PG 1514 first acquires the vehicle information table schema definition 204 obtained from the RDB 200, the KVS information 302 obtained from the distributed KVS 300, the migration key information 1529, and the migration key simulation table 1527 from the storage unit 150. (Step S901). The data migration PG 1514 displays a data migration setting screen in order to set a column family name, which is the name of a column family in KVS (step S902). A data transfer setting screen (screen 5a) is shown in FIG. 13B. On the screen 5, the column names of RDB and the column family names of KVS are shown in the correspondence table, and further, a message display section and an execution button are shown. Next, the data migration PG 1514 inputs a column family name by user operation (step S903). When the user click to the execution button is detected, the data transfer PG 1514 deletes the display if the message is displayed on the message display unit (step S 905).

次に、データ移行ＰＧ１５１４は、すべての列ファミリ名が入力されているか否か判定する（ステップＳ９０６）。すべての列ファミリ名が入力されていなければ、データ移行ＰＧ１５１４は、メッセージ表示部に「列ファミリを入力してください」というメッセージを表示し（ステップＳ９０７）、ステップＳ９０３に戻る。図１３Ｃには、メッセージが表示されたデータ移行設定画面（画面５ｂ）が示されている。画面５ｂには、ＲＤＢ列「Ｌｏｎｇｉｔｕｄｅ」に対応する列ファミリ名が設定されておらず、「列ファミリを入力してください」というメッセージが表示されている。 Next, the data migration PG 1514 determines whether all column family names have been input (step S906). If all column family names have not been input, the data migration PG 1514 displays a message “Please enter column family” on the message display unit (step S 907), and returns to step S 903. FIG. 13C shows a data migration setting screen (screen 5b) on which a message is displayed. On the screen 5b, the column family name corresponding to the RDB column "Longitude" is not set, and a message "Please enter a column family" is displayed.

ステップＳ９０６においてすべての列ファミリ名が入力されていれば、データ移行ＰＧ１５１４は、移行先の分散ＫＶＳ３００にＫＶＳテーブルのテーブル定義（中身が空の状態のテーブル）を作成する（ステップＳ９０８）。ＫＶＳテーブルには、ＫｅｙとＶａｌｕｅとを対応づけて列ファミリに対して格納可能である。 If all column family names have been input in step S906, the data migration PG 1514 creates a table definition (table with empty contents) of the KVS table in the distribution KVS 300 of the migration destination (step S908). The KVS table can store Key and Value in association with each other for a column family.

次に、データ移行ＰＧ１５１４は、移行キーのシミュレーションテーブル１５２７のＫｅｙとＶａｌｕｅとを、ＫＶＳテーブルに登録していくことにより分散ＫＶＳ３００に移行する（ステップＳ９０９）。このとき、図９Ａに例示した分割後のシミュレーションテーブルが移行先の分散ＫＶＳ３００の各ノード３０３、３０４、３０５のＫＶＳテーブルに格納される。 Next, the data transfer PG 1514 transfers to the distributed KVS 300 by registering the Key and Value of the transfer key simulation table 1527 in the KVS table (step S 909). At this time, the divided simulation table illustrated in FIG. 9A is stored in the KVS table of each of the nodes 303, 304, and 305 of the distributed KVS 300 of the transfer destination.

図１４は、シーケンシャル度および分散度の計算例を示す表である。ここではノード数が４個であり、各ノードにそれぞれＲｅｇｉｏｎが２個ずつ格納される。Ｎｏｄｅ１にはＲｅｇｉｏｎＲ１、Ｒ５が格納される。Ｎｏｄｅ２にはＲｅｇｉｏｎＲ２、Ｒ６が格納される。Ｎｏｄｅ３にはＲｅｇｉｏｎＲ３、Ｒ７が格納される。Ｎｏｄｅ４にはＲｅｇｉｏｎＲ４、Ｒ８が格納される。論理的最小スキャン数Ｓは３である。 FIG. 14 is a table showing an example of calculation of the degree of sequentiality and the degree of dispersion. Here, the number of nodes is four, and two regions are stored in each node. Region R1 and R5 are stored in Node1. Region R2 and R6 are stored in Node2. Region R3 and R7 are stored in Node3. Region R4 and R8 are stored in Node4. The logical minimum scan number S is three.

例１はシーケンシャル度が１００であり、分散度が０．８３である。例２はシーケンシャル度が５０であり、分散度が１．３０である。例３はシーケンシャル度が３７．５であり、分散度が１００である。例４はシーケンシャル度が２．３であり、分散度が３３．３である。 Example 1 has a degree of sequentiality of 100 and a degree of dispersion of 0.83. Example 2 has a degree of sequentiality of 50 and a degree of dispersion of 1.30. Example 3 has a degree of sequentiality of 37.5 and a degree of dispersion of 100. Example 4 has a degree of sequentiality of 2.3 and a degree of dispersion of 33.3.

シーケンシャル度は例１が最大である。例１ではスキャン回数が論理的最小スキャ数と一致している。分散度は例３が最大である。例３ではスキャンが全てのノードにわたり同じ回数（２回）ずつ発生している。 Example 1 is the largest degree of sequentiality. In Example 1, the number of scans matches the logical minimum number of scans. The degree of dispersion is largest in Example 3. In Example 3, the scan occurs the same number of times (twice) across all the nodes.

上述した本発明の実施形態は、本発明の説明のための例示であり、本発明の範囲をそれらの実施形態にのみ限定する趣旨ではない。当業者は、本発明の要旨を逸脱することなしに、他の様々な態様で本発明を実施することができる。 The embodiments of the present invention described above are exemplifications for explanation of the present invention, and are not intended to limit the scope of the present invention only to those embodiments. Those skilled in the art can practice the present invention in various other aspects without departing from the scope of the present invention.

１００…キー設計支援装置、１１０…出力装置、１２０…演算装置、１３０…入力装置、１４０…通信装置、１５０…記憶部、１５１…ＰＧ領域、１５１２…移行用キー情報、１５１３…キー評価提示ＰＧ、１５１４…データ移行ＰＧ、１５１５…キー候補集合作成モジュール、１５１６…ＫＶＳテーブル作成モジュール、１５１７…クエリシミュレーションモジュール、１５１８…スコア算出モジュール、１５２…データ領域、１５２１…ＲＤＢ接続情報、１５２２…移行対策クライアント情報、１５２２…移行対象クライアント情報、１５２３…ＲＤＢ利用ログ、１５２４…キー候補集合、１５２５…クエリ集合、１５２６…ＲＤＢテーブル、１５２７…シミュレーションテーブル、１５２８…キー評価結果、１５２９…移行用キー情報、２００…ＲＤＢ、２０１…通信装置、２０２…利用ログ、２０２１…クライアント、２０２３…クエリ、２０２４…結果、２０３…車両情報テーブル、２０４…車両情報テーブルスキーマ定義、３００…分散ＫＶＳ、３０１…通信装置、３０２…ＫＶＳ情報、３０３…ノード、３０４…ノード、３０５…ノード、４０１…クライアント、４０２…クライアント、４０３…クライアント 100 ... key design support device, 110 ... output device, 120 ... arithmetic device, 130 ... input device, 140 ... communication device, 150 ... storage unit, 151 ... PG area, 1512 ... transition key information, 1513 ... key evaluation presentation PG 1514 Data migration PG 1515 Key candidate set creation module 1516 KVS table creation module 1517 Query simulation module 1518 Score calculation module 152 Data area 1521 RDB connection information 1522 Migration countermeasure client Information, 1522 ... Migration target client information, 1523 ... RDB utilization log, 1524 ... Key candidate set, 1525 ... Query set, 1526 ... RDB table, 1527 ... Simulation table, 1528 ... Key evaluation result, 1529 ... Migration key information 200 ... RDB, 201 ... communication device, 202 ... usage log, 2021 ... client, 2023 ... query, 2024 ... result, 203 ... vehicle information table, 204 ... vehicle information table schema definition, 300 ... distributed KVS, 301 ... communication device, 302 ... KVS information, 303 ... node, 304 ... node, 305 ... node, 401 ... client, 402 ... client, 403 ... client

Claims

A database migration support device that supports creation of a distributed key value store migrated from a relational database, comprising
With memory and processor,
The memory generates at least one key candidate serving as a candidate for a key to be used for the distributed key-value store based on at least one column set in the relational database, and the key candidate is data of the relational database and A key evaluation program for evaluating based on a usage log, and a key evaluation presentation program for presenting the evaluation result of the key candidate,
The processor executes the key evaluation program and the key evaluation presentation program to present evaluation results of the key candidate.
Database migration support device.

The key evaluation program generates at least one key candidate serving as a candidate for a key to be used for the distributed key-value store based on at least one column set in the relational database, and the relation for each of the key candidates is generated. The degree of matching indicating the degree of appearance of the column based on the key candidate in the usage log of the database and the access performance of the data distributed to the plurality of nodes of the distributed key-value store are calculated, Evaluating each of the key candidates based on the access performance;
The database migration support device according to claim 1.

The key evaluation program calculates a score including the product of the matching degree and the access performance for each of the key candidates;
The key evaluation presentation program presents the score of each of the key candidates,
The database migration support device according to claim 2.

The database migration support device according to claim 1, wherein the access performance includes a sequential degree indicating a higher value as the number of scans for the node is smaller in accessing the database after migration of the simulation based on the usage log.

The access performance according to claim 4, wherein the access performance further includes a dispersion degree indicating a higher value as the scan is evenly distributed to the nodes, in access to the post-migration database of the simulation based on the usage log. Database migration support device.

The key evaluation program is capable of changing the weight of the sequential degree and the degree of dispersion in the score by a user operation.
The database migration support device according to claim 5.

The key evaluation program selects, from among the columns set in the relational database, a predetermined number of applications from the one with the highest number of applications by the query extracted from the usage log, and selects one of the selected columns or a combination of the columns 2. The database migration support device according to claim 1, wherein one that uniquely identifies data is extracted as a key candidate, and further, a primary key is used as a key candidate.

8. The database migration support device according to claim 7, wherein the key evaluation program narrows down the key candidates based on the degree of matching, performs simulation on the narrowed key candidates, and calculates the access performance.

The database migration according to claim 1, wherein the key evaluation program causes the user to select a client to be migrated to the distributed key-value store from the related database, and uses the selected usage log of the client for evaluation of the key candidate. Support device.

The memory further comprises a data migration program
The key evaluation presentation program displays an evaluation result of any one key candidate and a button for determining the key candidate as a transition key on the screen.
The data migration program creates the distributed key-value store using the key candidate as a migration key when the button is operated.
The database migration support device according to claim 1.

A database migration support method for assisting in creating a distributed key-value store migrating from a relational database, comprising:
A key evaluation unit generates at least one key candidate as a candidate for a key to be used for the distributed key-value store based on at least one column set in the relational database;
The key evaluation unit evaluates each of the key candidates based on the data of the relational database and the usage log;
Key evaluation presenting means presents the evaluation result of the key candidate,
Database migration support method.