JP2011076139A

JP2011076139A - Document management device, information processing apparatus, system and program for managing document

Info

Publication number: JP2011076139A
Application number: JP2009223729A
Authority: JP
Inventors: Kenji Fukutome; 憲治福留
Original assignee: Konica Minolta Business Technologies Inc
Current assignee: Konica Minolta Business Technologies Inc
Priority date: 2009-09-29
Filing date: 2009-09-29
Publication date: 2011-04-14

Abstract

<P>PROBLEM TO BE SOLVED: To avoid deterioration of performance, in a document management system including a document management device and an information processing apparatus mutually connected through a network in a data-communicable manner, by enhancing the efficiency of document data identity determination processing performed to determine the necessity of update when document data retained by the information processing apparatus are updated. <P>SOLUTION: The document management device 1 is connected to the information processing apparatus 11 through the network in a data-communicable manner, and manages document data composed of a plurality of document components while sharing it with the information processing apparatus. The document management device generates a hash value corresponding to the content of each document component that is a transmission object, further integrates the hash values to generate an integrated hash value, and performs, when document data retained in the information processing apparatus 11 is used, identity determination for the document data by comparison of the hash value and integrated hash value. <P>COPYRIGHT: (C)2011,JPO&INPIT

Description

本発明は、文書管理装置、情報処理装置、文書管理システム、および文書管理プログラムに関し、特に文書データを共有する場合の文書データの管理を行うための技術に関する。 The present invention relates to a document management apparatus, an information processing apparatus, a document management system, and a document management program, and more particularly to a technique for managing document data when document data is shared.

データ管理システムには従来から、データを管理するサーバと、当該サーバにネットワークを介してデータ通信可能に接続されたクライアントとによって構成される、クライアント・サーバ型のデータ管理システムが存在する。また、このようなデータ管理システムには、サーバで管理されるデータと同じデータをクライアントでも保持し、サーバで管理されるデータが更新された場合に、クライアントが保持するデータの更新を行うものが存在する。そしてこのような更新を行う場合、サーバとクライアントとの間でデータの送受信を行うことになるので、更新の回数が増大すれば、ネットワークのトラフィックに大きな負担がかかり、文書管理システムのパフォーマンスが低下する。そこで、このような更新を行う際に、予めデータの内容の同一性を判別して、更新対象となるデータの内容が同一である場合にまで更新を行わないようにし、無駄なデータの送受信を防止してデータ管理システムのパフォーマンスの低下を回避する技術が存在する（例えば、特許文献１）。 2. Description of the Related Art Conventionally, there is a client / server type data management system that includes a server that manages data and a client that is connected to the server via a network so that data communication is possible. In such a data management system, the same data as the data managed by the server is held by the client, and when the data managed by the server is updated, the data held by the client is updated. Exists. When such updates are performed, data is transmitted and received between the server and the client, so if the number of updates increases, the network traffic is overwhelmed and the performance of the document management system decreases. To do. Therefore, when performing such an update, the identity of the contents of the data is determined in advance so that the update is not performed until the contents of the data to be updated are the same. There is a technique for preventing the deterioration of the performance of the data management system (for example, Patent Document 1).

特開２００３−１２２６１８号公報JP 2003-122618 A

例えば、上記特許文献１では、自機が保持するマスターデータベースの列データからダイジェストデータを作成することができるサーバを備えるデータベースダウンロードシステムにおいて、最新のダイジェストデータと、クライアントによる直近の列データダウンロード時のダイジェストデータとを比較することにより、マスターデータベースの列データが直近のダウンロード時から更新されているか否かを判断する技術が開示されている。 For example, in Patent Document 1, in a database download system including a server that can create digest data from column data of a master database held by the own device, the latest digest data and the latest column data downloaded by a client are recorded. A technique for determining whether or not the column data of the master database has been updated since the most recent download by comparing with the digest data is disclosed.

とこで、上記クライアント・サーバ型のデータ管理システムには、サーバが文書管理装置であり、クライアントが情報処理装置であって、管理対象が文書データであるような文書管理システムが存在する。このような文書管理システムにおいては、近年におけるパーソナルコンピュータの所有率の増加や、携帯性に優れるパーソナルコンピュータの普及により、クライアントである情報処理装置の台数が増加する傾向にある。従って、情報処理装置が保持するデータの更新を文書管理装置に対して要求することができる文書管理システムであれば、情報処理装置から文書管理装置に対して、非常に多くの更新要求が行われることになる。 In the client / server type data management system, there is a document management system in which a server is a document management apparatus, a client is an information processing apparatus, and a management target is document data. In such a document management system, the number of information processing apparatuses serving as clients tends to increase due to the recent increase in ownership of personal computers and the spread of personal computers with excellent portability. Therefore, if the document management system can request the document management apparatus to update the data held by the information processing apparatus, a large number of update requests are made from the information processing apparatus to the document management apparatus. It will be.

このような環境下では、上述したような、情報処理装置が保持する文書データの更新を行う際に更新対象となる文書データの内容が同一である場合にまで文書データの更新を行わないようにするため、データの内容の同一性を判別する技術が適用されることも多い。このような技術が適用されれば、文書管理システムのパフォーマンスの低下はある程度回避される。しかし、それでも、少なくともその更新要求のたびに、文書管理装置と情報処理装置との間で、更新対象となるデータのやりとりを行い、それぞれが保持しているデータの中身を解析して同一性の判別を行う必要がある。ゆえに、上述のような文書管理システムにおいては、データ同一性の判別処理そのものが大きな負担となる。 In such an environment, when updating the document data held by the information processing apparatus as described above, the document data is not updated until the contents of the document data to be updated are the same. Therefore, a technique for determining the identity of data contents is often applied. If such a technique is applied, the performance degradation of the document management system is avoided to some extent. However, at least for each update request, the document management device and the information processing device exchange data to be updated, analyze the contents of the data held by each, and verify the identity. It is necessary to make a determination. Therefore, in the document management system as described above, the data identity discrimination process itself is a heavy burden.

このような判別処理に関して、従来から、データが持つ更新日付を用いてデータの内容を判別する技術がある。すなわち、文書管理装置が保持するデータと情報処理装置が保持するデータの更新日付を比較し、データが持つ更新日付が同じであるなら、データの内容が同じであるものと判断して、文書管理装置から情報処理装置へデータの送信を行わないようにし、データが持つ更新日付が異なるならば、データの内容が異なるものであると判断して、データの送信を行う、というものである。この技術では、判別を行うために、文書管理装置と情報処理装置との間で、更新対象となるデータそのものではなく、更新日付データのみを送受信すればよいので、文書管理システムにかかる負担は大きく抑えられることになる。 Conventionally, there is a technique for discriminating the contents of data using the update date of the data. That is, the update date of the data held by the document management device is compared with the update date of the data held by the information processing device, and if the update date held by the data is the same, the content of the data is determined to be the same, and the document management Data is not transmitted from the apparatus to the information processing apparatus, and if the update date of the data is different, it is determined that the data content is different, and the data is transmitted. In this technology, in order to perform the determination, it is only necessary to transmit / receive only the update date data, not the data to be updated, between the document management apparatus and the information processing apparatus. It will be suppressed.

しかし、このような技術にも、次の２つの問題点がある。第１に、例えば、ひとつのフォルダに複数の文書データが記憶されている場合、それら複数の文書データの全てについて同一性の判別を行う際には複数の文書データのそれぞれについて更新日付の比較を行わなければならない。そのため、フォルダに格納される文書データの数が多くなればなるほど、更新日付の送受信回数や比較回数が増大し、文書管理システムにおける処理負担が大きくなる。また例えば、ひとつの文書データが複数の文書構成要素から構成される場合にも同様のことが言える。すなわち、ひとつの文書データの同一性を判断する際には複数の文書構成要素のそれぞれについて更新日付の比較を行わなければならないため、文書データを構成する文書構成要素の数が多くなればなるほど、更新日付の比較回数が増大し、処理負担が増大するという問題がある。第２に、データの更新日付が異なっても、データの内容が同じである場合があり、このような場合、文書管理装置と情報処理装置との間で無駄なデータの送受信が行われ、文書管理システムの処理負担が増大するという問題がある。 However, this technique also has the following two problems. First, for example, when a plurality of document data is stored in one folder, when the identity of all the plurality of document data is determined, the update dates are compared for each of the plurality of document data. It must be made. For this reason, the greater the number of document data stored in the folder, the greater the number of transmission / reception of the update date and the number of comparisons, and the greater the processing load on the document management system. For example, the same can be said when one document data is composed of a plurality of document components. In other words, when determining the identity of one document data, it is necessary to compare the update date for each of a plurality of document components, the more the number of document components constituting the document data, There is a problem in that the number of comparisons of update dates increases and the processing load increases. Second, even if the data update date is different, the data contents may be the same. In such a case, useless data transmission / reception is performed between the document management apparatus and the information processing apparatus, and the document There is a problem that the processing load of the management system increases.

上記第２の問題点に関して、例えば特許文献１のように、データの内容に対応したダイジェストデータを比較してデータ内容の同一性を判別する技術によれば、解決することが可能である。しかし、そのような技術であっても、上記第１の問題点を解決できない。すなわち、文書データが複数の文書構成要素から構成されており、かつそのような文書構成要素の数が多い場合には、ダイジェストデータの比較回数も多くなり、文書データが全体として同一であるか否かを判別する際には効率的な判別が行えず、文書管理システムに負担をかけることになる。 The second problem can be solved by a technique for comparing the digest data corresponding to the data contents and determining the identity of the data contents as in Patent Document 1, for example. However, even such a technique cannot solve the first problem. That is, when the document data is composed of a plurality of document components and the number of such document components is large, the number of comparisons of digest data increases, and whether the document data is the same as a whole. When this is determined, efficient determination cannot be performed, which places a burden on the document management system.

本発明は、上記２つの問題点を解決するためになされたものであり、情報処理装置が保持する文書データの更新の際に行われる、送信対象の文書データの内容の同一性を判別する処理において、文書管理装置と情報処理装置との間で無駄なデータの送受信を行わないようにするとともに、データの比較回数を抑えることによって、文書管理システムのパフォーマンス低下を回避するようにした文書管理装置、情報処理装置、文書管理システム、文書管理プログラム、および情報処理プログラムを提供することを目的としている。 The present invention has been made to solve the above two problems, and is a process for determining the identity of the content of document data to be transmitted, which is performed when document data held by the information processing apparatus is updated. Management apparatus that prevents unnecessary data transmission / reception between the document management apparatus and the information processing apparatus and avoids a decrease in the performance of the document management system by suppressing the number of data comparisons An object of the present invention is to provide an information processing apparatus, a document management system, a document management program, and an information processing program.

上記目的を達成するため、請求項１にかかる発明は、所定の記憶部に複数の文書データを記憶する文書データ記憶手段を有し、ネットワークを介して接続された情報処理装置とデータ通信を行うことによって前記記憶部に記憶された複数の文書データと、前記情報処理装置における所定の記憶領域に記憶された複数の文書データとの同一性を保持するように管理する文書管理装置であって、前記記憶部に記憶された各文書データの内容に対応した文書ダイジェストデータを生成し、前記記憶部に記憶された複数の文書データのそれぞれから生成した複数の文書ダイジェストデータに基づいて、前記複数の文書ダイジェストデータの内容に対応した合成ダイジェストデータを生成する統合ダイジェストデータ生成手段と、前記記憶部に記憶された複数の文書データと、前記所定の記憶領域に記憶された複数の文書データとで内容が異なるものが存在するか否かを、前記統合ダイジェストデータ生成手段により生成された合成ダイジェストデータの比較により判定するデータ判別手段と、を備えることを特徴とする構成である。 To achieve the above object, the invention according to claim 1 includes document data storage means for storing a plurality of document data in a predetermined storage unit, and performs data communication with an information processing apparatus connected via a network. A document management apparatus that manages the plurality of document data stored in the storage unit and the plurality of document data stored in a predetermined storage area in the information processing apparatus so as to maintain the same. Generate document digest data corresponding to the contents of each document data stored in the storage unit, and based on the plurality of document digest data generated from each of the plurality of document data stored in the storage unit, Integrated digest data generating means for generating composite digest data corresponding to the contents of the document digest data, and stored in the storage unit Judgment is made by comparing the composite digest data generated by the integrated digest data generation means whether there are different contents between the plurality of document data and the plurality of document data stored in the predetermined storage area. And a data discriminating means.

また請求項２にかかる発明は、請求項１記載の文書管理装置において、前記データ判別手段は、前記記憶部に記憶された複数の文書データのうち、前記所定の記憶領域に記憶された文書データと内容が異なるものを、前記統合ダイジェストデータ生成手段により生成された文書ダイジェストデータの比較により特定することを特徴とする構成である。 According to a second aspect of the present invention, in the document management apparatus according to the first aspect, the data discriminating unit stores the document data stored in the predetermined storage area among the plurality of document data stored in the storage unit. What is different from the contents is specified by comparing the document digest data generated by the integrated digest data generating means.

また請求項３にかかる発明は、請求項１または２に記載の文書管理装置において、前記記憶部に記憶される文書データは、複数の文書構成要素から構成されており、各文書構成要素の内容に対応したダイジェストデータを生成するダイジェストデータ生成手段を更に備え、前記統合ダイジェストデータ生成手段は、前記記憶部に記憶された文書データを構成する複数の文書構成要素のそれぞれから生成した複数のダイジェストデータに基づいて、各文書データの内容に対応した文書ダイジェストデータを生成することを特徴とする構成である。 According to a third aspect of the present invention, in the document management apparatus according to the first or second aspect, the document data stored in the storage unit is composed of a plurality of document components, and the contents of each document component A digest data generating unit that generates digest data corresponding to the document data, wherein the integrated digest data generating unit generates a plurality of digest data generated from each of a plurality of document constituent elements constituting the document data stored in the storage unit. Based on the above, document digest data corresponding to the contents of each document data is generated.

また請求項４にかかる発明は、請求項３に記載の文書管理装置において、前記データ判別手段は、前記記憶部に記憶された文書データを構成する複数の文書構成要素のうち、前記所定の記憶領域に記憶された文書データを構成する文書構成要素と内容が異なるものを、前記ダイジェストデータ生成手段により生成されたダイジェストデータの比較により特定することを特徴とする構成である。 According to a fourth aspect of the present invention, in the document management apparatus according to the third aspect, the data discriminating unit is configured to store the predetermined storage among the plurality of document constituent elements constituting the document data stored in the storage unit. What is different from the document constituent elements constituting the document data stored in the area is specified by comparing the digest data generated by the digest data generating means.

また請求項５にかかる発明は、所定の記憶部に複数の文書データを記憶する文書データ記憶手段を有し、ネットワークを介して接続された情報処理装置とデータ通信を行うことによって前記記憶部に記憶された複数の文書データと、前記情報処理装置における所定の記憶領域に記憶された複数の文書データとの同一性を保持するように管理する文書管理装置であって、前記記憶部に記憶された各文書データの内容に対応した文書ダイジェストデータを生成し、前記記憶部に記憶された複数の文書データのそれぞれから生成した複数の文書ダイジェストデータに基づいて、前記複数の文書ダイジェストデータの内容に対応した合成ダイジェストデータを生成する統合ダイジェストデータ生成手段と、前記記憶部に記憶された複数の文書データと、前記所定の記憶領域に記憶された複数の文書データとで内容が異なるものが存在するか否かを、前記統合ダイジェストデータ生成手段により生成された合成ダイジェストデータの比較により判定させるために、前記情報処理装置に対して前記合成ダイジェストデータを送信するデータ送信手段と、を備えることを特徴とする構成である。 According to a fifth aspect of the present invention, there is provided document data storage means for storing a plurality of document data in a predetermined storage unit, and the storage unit is configured to perform data communication with an information processing apparatus connected via a network. A document management device that manages a plurality of stored document data and a plurality of document data stored in a predetermined storage area in the information processing device so as to maintain the same, and is stored in the storage unit The document digest data corresponding to the contents of each document data is generated, and based on the plurality of document digest data generated from each of the plurality of document data stored in the storage unit, the contents of the plurality of document digest data are changed. Integrated digest data generating means for generating corresponding composite digest data, and a plurality of document data stored in the storage unit; In order to determine whether there is a different content from the plurality of document data stored in the predetermined storage area by comparing the composite digest data generated by the integrated digest data generation unit, the information And a data transmission means for transmitting the combined digest data to a processing device.

また請求項６にかかる発明は、請求項５に記載の文書管理装置とネットワークを介してデータ通信可能に接続される情報処理装置であって、所定の記憶領域に複数の文書データを記憶するデータ記憶手段と、前記文書管理装置における所定の記憶部に記憶されている複数の文書データと、前記所定の記憶領域に記憶されている複数の文書データとで内容が異なるものが存在するか否かを、前記文書管理装置において生成された合成ダイジェストデータの比較により判定するデータ判別手段と、を備えることを特徴とする構成である。 According to a sixth aspect of the present invention, there is provided an information processing apparatus connected to the document management apparatus according to the fifth aspect of the present invention via a network so as to be capable of data communication, wherein the data stores a plurality of document data in a predetermined storage area. Whether there are different contents between the storage means, the plurality of document data stored in the predetermined storage unit of the document management apparatus, and the plurality of document data stored in the predetermined storage area And a data discriminating means for judging by comparing the composite digest data generated in the document management apparatus.

また請求項７にかかる発明は、情報処理装置と文書管理装置とがネットワークを介して相互にデータ通信可能に接続され、前記情報処理装置と前記文書管理装置とのそれぞれで保持される複数の文書データの同一性を保持するように管理する文書管理システムであって、前記文書管理装置は、所定の記憶部に複数の文書データを記憶する文書データ記憶手段と、前記記憶部に記憶される各文書データの内容に対応した文書ダイジェストデータを生成し、前記記憶部に記憶される複数の文書データのそれぞれから生成した複数の文書ダイジェストデータに基づいて、前記複数の文書ダイジェストデータの内容に対応した合成ダイジェストデータを生成する統合ダイジェストデータ生成手段と、を備え、前記情報処理装置は、所定の記憶領域に複数に文書データを記憶するデータ記憶手段を備え、前記文書管理装置および前記情報処理装置の少なくとも一方は、前記記憶部に記憶されている複数の文書データと、前記記憶領域に記憶されている複数の文書データとで内容が異なるものが存在するか否かを、前記統合ダイジェストデータ生成手段により生成される合成ダイジェストデータの比較により判定するデータ判別手段を備えることを特徴とする構成である。 According to a seventh aspect of the present invention, there is provided a plurality of documents in which an information processing apparatus and a document management apparatus are connected to each other via a network so as to be able to perform data communication with each other, and are held by each of the information processing apparatus and the document management apparatus. A document management system that manages data so as to maintain the sameness, wherein the document management device includes a document data storage unit that stores a plurality of document data in a predetermined storage unit, and each of the storage units stored in the storage unit Document digest data corresponding to the content of the document data is generated, and based on the plurality of document digest data generated from each of the plurality of document data stored in the storage unit, the content of the plurality of document digest data is supported Integrated digest data generation means for generating composite digest data, and the information processing apparatus stores the composite digest data in a predetermined storage area. And at least one of the document management device and the information processing device includes a plurality of document data stored in the storage unit and a plurality of document data stored in the storage area. It is a configuration characterized by comprising data discriminating means for judging whether there is a document whose content differs from that of the document data by comparing the synthesized digest data generated by the integrated digest data generating means.

また請求項８にかかる発明は、所定の記憶部に複数の文書データを記憶する文書データ記憶手段を有する文書管理装置によって実行され、ネットワークを介して前記文書管理装置に接続された情報処理装置とデータ通信を行うことによって前記記憶部に記憶された複数の文書データと、前記情報処理装置における所定の記憶領域に記憶された複数の文書データとの同一性を保持するための文書管理プログラムであって、前記文書管理装置に、前記記憶部に記憶された各文書データの内容に対応した文書ダイジェストデータを生成するステップと、前記記憶部に記憶された複数の文書データのそれぞれから生成された複数の文書ダイジェストデータに基づいて、前記複数の文書ダイジェストデータの内容に対応した合成ダイジェストデータを生成するステップと、前記記憶部に記憶されている複数の文書データと、前記所定の記憶領域に記憶されている複数の文書データとで内容が異なるものが存在するか否かを、前記合成ダイジェストデータの比較により判定するステップと、を実行させることを特徴とする構成である。 According to an eighth aspect of the present invention, there is provided an information processing apparatus which is executed by a document management apparatus having document data storage means for storing a plurality of document data in a predetermined storage unit and connected to the document management apparatus via a network. A document management program for maintaining identity between a plurality of document data stored in the storage unit and a plurality of document data stored in a predetermined storage area in the information processing apparatus by performing data communication. Generating a document digest data corresponding to the content of each document data stored in the storage unit in the document management device, and a plurality of document data generated from each of the plurality of document data stored in the storage unit On the basis of the document digest data, composite digest data corresponding to the contents of the plurality of document digest data is generated. Whether or not there is a different content between the plurality of document data stored in the storage unit and the plurality of document data stored in the predetermined storage area. And a step of determining by comparing the two.

また請求項９にかかる発明は、所定の記憶部に複数の文書データを記憶する文書データ記憶手段を有する文書管理装置によって実行され、ネットワークを介して前記文書管理装置に接続された情報処理装置とデータ通信を行うことによって前記記憶部に記憶された複数の文書データと、前記情報処理装置における所定の記憶領域に記憶された複数の文書データとの同一性を保持するための文書管理プログラムであって、前記文書管理装置に、前記記憶部に記憶された各文書データの内容に対応した文書ダイジェストデータを生成するステップと、前記記憶部に記憶された複数の文書データのそれぞれから生成された複数の文書ダイジェストデータに基づいて、前記複数の文書ダイジェストデータの内容に対応した合成ダイジェストデータを生成するステップと、前記記憶部に記憶されている複数の文書データと、前記所定の記憶領域に記憶されている複数の文書データとで内容が異なるものが存在するか否かを、前記合成ダイジェストデータの比較により判定させるために、前記情報処理装置に対して前記合成ダイジェストデータを送信するステップと、を実行させることを特徴とする構成である。 The invention according to claim 9 is an information processing apparatus that is executed by a document management apparatus having a document data storage unit that stores a plurality of document data in a predetermined storage unit, and that is connected to the document management apparatus via a network. A document management program for maintaining identity between a plurality of document data stored in the storage unit and a plurality of document data stored in a predetermined storage area in the information processing apparatus by performing data communication. Generating a document digest data corresponding to the content of each document data stored in the storage unit in the document management device, and a plurality of document data generated from each of the plurality of document data stored in the storage unit On the basis of the document digest data, composite digest data corresponding to the contents of the plurality of document digest data is generated. Whether or not there is a different content between the plurality of document data stored in the storage unit and the plurality of document data stored in the predetermined storage area. In order to make a determination based on the comparison, the step of transmitting the synthesized digest data to the information processing apparatus is executed.

本発明によれば、情報処理装置が保持するデータの更新の際に行われる、送信対象のデータの内容の同一性を判別する処理において、送信対象のデータの内容に対応したダイジェストデータを統合して生成する文書ダイジェストデータおよび合成ダイジェストデータを判定に用いることにより、文書管理装置と情報処理装置との間で無駄なデータの送受信を行わないようにするとともに、データの比較回数を抑えることができる。それ故、文書管理システムのパフォーマンス低下を回避することが可能となる。 According to the present invention, digest data corresponding to the content of the data to be transmitted is integrated in the process of determining the identity of the content of the data to be transmitted, which is performed when the data held by the information processing apparatus is updated. By using the document digest data and composite digest data generated in this way for the determination, it is possible to prevent unnecessary data transmission / reception between the document management apparatus and the information processing apparatus and to reduce the number of data comparisons. . Therefore, it is possible to avoid the performance degradation of the document management system.

文書管理システムの一構成例を示す図である。It is a figure which shows the example of 1 structure of a document management system. 文書データの一例を示す図である。It is a figure which shows an example of document data. 文書管理装置のハードウェア構成の一例を示すブロック図である。It is a block diagram which shows an example of the hardware constitutions of a document management apparatus. 情報処理装置のハードウェア構成の一例を示すブロック図である。It is a block diagram which shows an example of the hardware constitutions of information processing apparatus. 文書管理装置の制御部がプログラムを実行することによって実現される詳細な機能構成を示すブロック図である。It is a block diagram which shows the detailed functional structure implement | achieved when the control part of a document management apparatus performs a program. 文書構成要素のハッシュ値および文書データの文書ハッシュ値生成の概念図である。It is a conceptual diagram of the generation of the hash value of the document component and the document hash value of the document data. フォルダのフォルダハッシュ値生成の概念図である。It is a conceptual diagram of folder hash value generation of a folder. 情報処理装置の制御部がシステムプログラムおよびアプリケーションプログラムを実行することによって実現される詳細な機能構成を示すブロック図である。It is a block diagram which shows the detailed functional structure implement | achieved when the control part of information processing apparatus runs a system program and an application program. 文書管理装置が情報処理装置から文書構成要素を受信する場合の処理手順の一例を示すフローチャートである。10 is a flowchart illustrating an example of a processing procedure when a document management apparatus receives a document component from an information processing apparatus. 文書管理装置が情報処理装置からハッシュ値・統合ハッシュ値要求を受信する場合の処理手順の一例を示すフローチャートである。10 is a flowchart illustrating an example of a processing procedure when a document management apparatus receives a hash value / integrated hash value request from an information processing apparatus. 情報処理装置においてユーザがフォルダをクリックする場合の処理手順の一例を示すフローチャートである。10 is a flowchart illustrating an example of a processing procedure when a user clicks a folder in the information processing apparatus. データ判定処理の詳細な処理手順の一例を示すフローチャートである。It is a flowchart which shows an example of the detailed process sequence of a data determination process. データ判別処理を、具体例により説明するための図である。It is a figure for demonstrating a data discrimination | determination process by a specific example. データ判別処理を、具体例により説明するための図である。It is a figure for demonstrating a data discrimination | determination process by a specific example. データ判別処理を、具体例により説明するための図である。It is a figure for demonstrating a data discrimination | determination process by a specific example.

以下、本発明に関する好ましい実施形態について図面を参照しつつ詳細に説明する。尚、以下に説明する実施形態において互いに共通する部材には同一符号を付しており、それらについての重複する説明は省略する。 Hereinafter, preferred embodiments of the present invention will be described in detail with reference to the drawings. In the embodiments described below, members that are common to each other are denoted by the same reference numerals, and redundant descriptions thereof are omitted.

図１は、本実施形態における文書管理システムの一構成例を示す図である。この文書管理システムは、いわゆるクライアント・サーバ型の文書管理システムであり、サーバとして機能する文書管理装置１と、クライアントとして機能する複数の情報処理装置１１（１１ａ、１１ｂ、・・・）とを備えている。文書管理装置１および複数の情報処理装置１１のそれぞれは、ＬＡＮやＷＡＮなどのネットワーク１０を介して相互にデータ通信可能に接続されている。 FIG. 1 is a diagram illustrating a configuration example of a document management system according to the present embodiment. This document management system is a so-called client-server type document management system, and includes a document management apparatus 1 that functions as a server and a plurality of information processing apparatuses 11 (11a, 11b,...) That function as clients. ing. The document management apparatus 1 and the plurality of information processing apparatuses 11 are connected to each other via a network 10 such as a LAN or a WAN so as to be able to perform data communication with each other.

文書管理システムは、文書管理装置１と情報処理装置１１とで同一の文書データを保持し、文書管理装置１で保持する文書データが更新されると、情報処理装置１１がその更新された文書データを文書管理装置１からダウンロードすることにより、文書管理装置１と情報処理装置１１とのそれぞれで保持される文書データの整合性を保持するように構成される。それ故、文書管理装置１は、ネットワーク１０を介して、情報処理装置１１において保持される文書データに対応した文書データを保持する。ここで、文書管理装置１に保持される文書データと、情報処理装置１１に保持される文書データとの対応関係は、文書データの所定のプロパティ、例えばファイル名などによって判断される。つまり、文書管理装置１に保持される文書データのプロパティと、情報処理装置１１に保持される文書データのプロパティとが互いに一致すれば、それらの文書データは互いに対応するデータである。 In the document management system, the document management apparatus 1 and the information processing apparatus 11 hold the same document data, and when the document data held by the document management apparatus 1 is updated, the information processing apparatus 11 updates the updated document data. Is downloaded from the document management apparatus 1 so that the consistency of the document data held in each of the document management apparatus 1 and the information processing apparatus 11 is maintained. Therefore, the document management apparatus 1 holds document data corresponding to the document data held in the information processing apparatus 11 via the network 10. Here, the correspondence between the document data held in the document management apparatus 1 and the document data held in the information processing apparatus 11 is determined by a predetermined property of the document data, such as a file name. In other words, if the properties of the document data held in the document management device 1 and the properties of the document data held in the information processing device 11 match each other, the document data are data corresponding to each other.

図１の例では、文書管理装置１は文書データ１００ａを保持している。このとき、情報処理装置１１ａは、文書データ１００ａと同一プロパティを有する文書データ１００ｂを保持し、情報処理装置１１ｂは、文書データ１００ａと所定の同一プロパティを有する文書データ１００ｃを保持する。この場合、文書管理装置１に保持される文書データ１００ａと、情報処理装置１１ａ，１１ａのそれぞれで保持される文書データ１００ｂ，１００ｃとは互いに対応するデータである。 In the example of FIG. 1, the document management apparatus 1 holds document data 100a. At this time, the information processing apparatus 11a holds document data 100b having the same property as the document data 100a, and the information processing apparatus 11b holds document data 100c having the same property as the document data 100a. In this case, the document data 100a held in the document management apparatus 1 and the document data 100b and 100c held in the information processing apparatuses 11a and 11a are data corresponding to each other.

図２は、本実施形態における文書データ１００の一例を示す図である。ここで、文書データ１００は、文書管理装置１で保持される文書データ１００ａおよび情報処理装置１１ａ，１１ａのそれぞれで保持される文書データ１００ｂ，１００ｃを総称するものである。この文書データ１００は、複数の文書構成要素１９０から構成される。文書構成要素１９０として、例えば文書本体データ１０１や、サムネイル１０２や、データベース１０３や、およびセキュリティ情報１０４などが挙げられる。文書本体データ１０１とは、例えばテキストファイルなどの文書ファイル本体のことをいう。サムネイル１０２は、文書本体データ１０１のサムネイルデータである。データベース１０３は、文書本体データ１０１のプロパティ情報、例えば文書本体データ１０１のオーナ情報やアクセス権等を管理するデータベースデータである。セキュリティ情報１０４は、電子署名等の、文書本体データ１０１のセキュリティに関するデータである。 FIG. 2 is a diagram showing an example of the document data 100 in the present embodiment. Here, the document data 100 is a general term for the document data 100a held by the document management apparatus 1 and the document data 100b and 100c held by the information processing apparatuses 11a and 11a, respectively. This document data 100 is composed of a plurality of document components 190. Examples of the document component 190 include the document body data 101, the thumbnail 102, the database 103, and the security information 104. The document body data 101 refers to a document file body such as a text file. A thumbnail 102 is thumbnail data of the document body data 101. The database 103 is database data for managing property information of the document body data 101, for example, owner information and access rights of the document body data 101. The security information 104 is data relating to the security of the document body data 101 such as an electronic signature.

文書データ１００は、文書構成要素１９０として、少なくとも文書本体データ１０１を含む。また文書データ１００は、文書本体データ１０１以外の３つの文書構成要素を含むか否かは任意である。つまり、文書データ１００が、文書本体データ１０１のみから構成されている場合もあり、また文書本体データ１０１、データベース１０３、およびセキュリティ情報１０４という３つの文書構成要素１９０から構成されている場合もある。尚、データベース１０３は必ずしも文書本体データ１０１のプロパティ情報を管理するものに限られず、他の情報を管理するものであってもよい。また、文書データ１００は、文書構成要素１９０として、これ以外のデータを含んでいてもよい。 The document data 100 includes at least document body data 101 as the document component 190. Whether or not the document data 100 includes three document components other than the document body data 101 is arbitrary. That is, the document data 100 may be composed of only the document body data 101, or may be composed of three document components 190, that is, the document body data 101, the database 103, and the security information 104. Note that the database 103 is not necessarily limited to managing the property information of the document main body data 101, but may be other information. Further, the document data 100 may include other data as the document component 190.

図１に戻り、文書管理装置１は、文書データ１００ａを構成するデータとして、文書構成要素１９０ａを保持している。この文書構成要素１９０ａは、情報処理装置１１ａにおいて保持される文書データ１００ｂを構成する文書構成要素１９０ｂに対応する。また文書構成要素１９０ａは、情報処理装置１１ｂにおいて保持される文書データ１００ｃを構成する文書構成要素１９０ｃに対応する。文書管理装置１に保持される文書構成要素１９０ａと、情報処理装置１１ａ，１１ｂに保持される文書構成要素１９０ｂ，１９０ｃとの対応関係は、同種の文書構成要素であるか否かにより判断され、例えばデータの拡張子などで判断される。そして文書データ１００ａの文書構成要素１９０ａと、文書データ１００ｂ，１００ｃの文書構成要素１９０ｂ，１９０ｃとが、同種の文書構成要素であれば、それらは互いに対応した文書構成要素となる。 Returning to FIG. 1, the document management apparatus 1 holds a document component 190a as data constituting the document data 100a. The document constituent element 190a corresponds to the document constituent element 190b constituting the document data 100b held in the information processing apparatus 11a. The document component 190a corresponds to the document component 190c constituting the document data 100c held in the information processing apparatus 11b. The correspondence between the document component 190a held in the document management device 1 and the document components 190b and 190c held in the information processing devices 11a and 11b is determined based on whether or not they are the same type of document component. For example, it is determined by the extension of data. If the document constituent element 190a of the document data 100a and the document constituent elements 190b and 190c of the document data 100b and 100c are the same kind of document constituent elements, they become corresponding document constituent elements.

図１の例では、文書管理装置１が保持する文書データ１００ａには、文書構成要素１９０ａが含まれている。このとき、情報処理装置１１ａが保持する文書データ１００ｂには、文書構成要素１９０ａと同種の文書構成要素１９０ｂ含まれ、情報処理装置１１ｂが保持する文書データ１００ｃには、文書構成要素１９０ａと同種の文書構成要素１９０ｃが含まれる。具体的には、文書管理装置１が保持する文書データ１００ａにサムネイル１０２が含まれていれば、情報処理装置１１ａが保持する文書データ１００ｂにも、それに対応するサムネイル１０２が含まれる。同様に、文書データ１００ａにデータベース１０３が含まれれば、情報処理装置１１ｂが保持する文書データ１００ｃにもそれに対応するデータベース１０３が含まれる。 In the example of FIG. 1, the document data 100a held by the document management apparatus 1 includes a document component 190a. At this time, the document data 100b held by the information processing apparatus 11a includes the same kind of document constituent element 190b as the document constituent element 190a, and the document data 100c held by the information processing apparatus 11b includes the same kind of document constituent element 190a. A document component 190c is included. Specifically, if the document data 100a held by the document management apparatus 1 includes the thumbnail 102, the document data 100b held by the information processing apparatus 11a also includes the corresponding thumbnail 102. Similarly, if the database 103 is included in the document data 100a, the corresponding database 103 is also included in the document data 100c held by the information processing apparatus 11b.

本実施形態の文書管理システムは、文書管理装置１で保持される文書データ１００ａの内容と、各情報処理装置１１で保持される文書データ１００ｂ，１００ｃの内容とが同一になるように、文書管理装置１と情報処理装置１１との間でネットワーク１０を介してデータの送受信を行うように構成されている。ここで、文書データ１００ａと文書データ１００ｂの内容が同一であるとは、文書データ１００ａに含まれる文書構成要素１９０ａの内容と、文書データ１００ｂに含まれる文書構成要素１９０ｂの内容が、全て同一であることをいう。例えば、文書データ１００ａが、文書本体データ１０１ａと、サムネイル１０２ａと、データベース１０３ａと、セキュリティ情報１０４ａとから構成されており、文書データ１００ｂが、文書本体データ１０１ｂと、サムネイル１０２ｂと、データベース１０３ｂと、セキュリティ情報１０４ｂとから構成されている場合に、文書本体データ１０１ａと１０１ｂとが同一内容であり、サムネイル１０２ａと１０２ｂとが同一内容であり、データベース１０３ａと１０３ｂとが同一内容であり、およびセキュリティ情報１０４ａと１０４ｂとが同一内容であるなら、文書データ１００ａの内容と文書データ１００ｂの内容は同一である。同様に、文書データ１００ａと文書データ１００ｃの内容が同一であるとは、文書構成要素１９０ａの内容と文書構成要素１９０ｃの内容が全て同一であることをいう。 The document management system according to the present embodiment is configured so that the contents of the document data 100a held by the document management apparatus 1 and the contents of the document data 100b and 100c held by each information processing apparatus 11 are the same. Data is transmitted and received between the apparatus 1 and the information processing apparatus 11 via the network 10. Here, the contents of the document data 100a and the document data 100b are the same. The contents of the document component 190a included in the document data 100a and the contents of the document component 190b included in the document data 100b are all the same. Say something. For example, the document data 100a includes document body data 101a, a thumbnail 102a, a database 103a, and security information 104a. The document data 100b includes document body data 101b, a thumbnail 102b, a database 103b, When the security information 104b is configured, the document main body data 101a and 101b have the same contents, the thumbnails 102a and 102b have the same contents, the databases 103a and 103b have the same contents, and the security information If the contents 104a and 104b have the same contents, the contents of the document data 100a and the contents of the document data 100b are the same. Similarly, the contents of the document data 100a and the document data 100c being the same means that the contents of the document constituent element 190a and the contents of the document constituent element 190c are all the same.

また文書管理装置１は、文書データ１００ａを所定のフォルダ２００ａに記憶して管理する。そして情報処理装置１１ａは、文書管理装置１におけるフォルダ２００ａに対応するフォルダ２００ｂに文書データ１００ｂを記憶して管理する。同様に、情報処理装置１１ａは、文書管理装置１におけるフォルダ２００ａに対応するフォルダ２００ｃに文書データ１００ｃを記憶して管理する。ここで、文書管理装置１におけるフォルダ２００ａと、情報処理装置１１におけるフォルダ２００ｂ，２００ｃとの対応関係は、それぞれのフォルダの所定のプロパティ、例えばフォルダ名などによって判断される。つまり、文書管理装置１のフォルダ２００ａのプロパティと、情報処理装置１１のフォルダ２００ｂ，２００ｃのプロパティとが互いに一致すれば、それらのフォルダは互いに対応するフォルダである。図１の例では、文書管理装置１は、文書データ１００ａをフォルダ２００ａに格納している。このとき、情報処理装置１１ａは、文書データ１００ｂを、フォルダ２００ａと同一フォルダ名のフォルダ２００ｂに格納し、情報処理装置１１ｂは、文書データ１００ｃを、フォルダ２００ａと同一フォルダ名のフォルダ２００ｃに格納する。 The document management apparatus 1 stores and manages the document data 100a in a predetermined folder 200a. The information processing apparatus 11a stores and manages the document data 100b in the folder 200b corresponding to the folder 200a in the document management apparatus 1. Similarly, the information processing apparatus 11a stores and manages the document data 100c in a folder 200c corresponding to the folder 200a in the document management apparatus 1. Here, the correspondence between the folder 200a in the document management apparatus 1 and the folders 200b and 200c in the information processing apparatus 11 is determined by a predetermined property of each folder, such as a folder name. That is, if the properties of the folder 200a of the document management apparatus 1 and the properties of the folders 200b and 200c of the information processing apparatus 11 match each other, these folders are corresponding to each other. In the example of FIG. 1, the document management apparatus 1 stores document data 100a in a folder 200a. At this time, the information processing apparatus 11a stores the document data 100b in the folder 200b having the same folder name as the folder 200a, and the information processing apparatus 11b stores the document data 100c in the folder 200c having the same folder name as the folder 200a. .

したがって、本実施形態の文書管理システムでは、文書管理装置１における文書データ１００ａのデータ保持構造と、情報処理装置１１における文書データ１００ｂ，１００ｃのデータ保持構造とが同一となっており、文書管理装置１と情報処理装置１１とのそれぞれで同一の文書データを共有して保持するようになっている。 Therefore, in the document management system of this embodiment, the data holding structure of the document data 100a in the document management apparatus 1 is the same as the data holding structure of the document data 100b and 100c in the information processing apparatus 11, and the document management apparatus 1 and the information processing apparatus 11 share the same document data.

情報処理装置１１（１１ａ、１１ｂ・・）は、例えば市販のパーソナルコンピュータ（ＰＣ）などで構成される。情報処理装置１１ａは、文書管理装置１が保持する文書データ１００ａを、ネットワーク１０を介して更新することができる。すなわち、文書管理装置１が保持している文書データ１００ａの内容を、文書管理装置１に対して文書データ１００ｂを送信することによって、文書管理装置１に書き換えさせることができる。また、情報処理装置１１は、自機が保持する文書データ１００ｂの内容を、ネットワーク１０を介して文書管理装置１から受信した文書データ１００ａにより更新することができる。同様に、情報処理装置１１ｂも、文書管理装置１が保持する文書データ１００ａを自機が保持する文書データ１００ｃによって更新でき、また自機が保持する文書データ１００ｃを、文書管理装置１が保持する１００ａによって更新できる。 The information processing apparatus 11 (11a, 11b,...) Is configured by, for example, a commercially available personal computer (PC). The information processing apparatus 11 a can update the document data 100 a held by the document management apparatus 1 via the network 10. That is, the content of the document data 100 a held by the document management apparatus 1 can be rewritten by the document management apparatus 1 by transmitting the document data 100 b to the document management apparatus 1. Further, the information processing apparatus 11 can update the content of the document data 100 b held by the information processing apparatus 11 with the document data 100 a received from the document management apparatus 1 via the network 10. Similarly, the information processing apparatus 11b can update the document data 100a held by the document management apparatus 1 with the document data 100c held by the own apparatus, and the document management apparatus 1 holds the document data 100c held by the own apparatus. It can be updated by 100a.

本実施形態の文書管理システムにおいて、情報処理装置１１のユーザが文書データ１００を利用する際、情報処理装置１１が参照する文書データ１００は当該情報処理装置１１において保持している文書データ１００である。すなわち、ユーザは、情報処理装置１１で保持される文書データ１００を閲覧等することになる。ここで例えば、情報処理装置１１ａのユーザが文書データ１００ｂに対する編集作業を行って、文書データ１００ｂを更新すると、それに伴い、情報処理装置１１は、更新された文書データ１００ｂを文書管理装置１に送信して文書データ１００ａの内容を、更新された文書データ１００ｂの内容に更新する。すると、情報処理装置１１ｂは、文書管理装置１が保持する文書データ１００ａとは異なる内容の文書データ１００ｃを保持することになる。この状態で、情報処理装置１１ｂが、そのまま自機で保持する文書データ１００ｃをユーザに表示すれば、ユーザは、文書管理装置１が保持する文書データ１００ａと異なる内容の文書データ１００ｃを利用することになり、文書の統一が図れない。 In the document management system of this embodiment, when a user of the information processing apparatus 11 uses the document data 100, the document data 100 referred to by the information processing apparatus 11 is the document data 100 held in the information processing apparatus 11. . That is, the user browses the document data 100 held by the information processing apparatus 11. Here, for example, when the user of the information processing apparatus 11a edits the document data 100b and updates the document data 100b, the information processing apparatus 11 transmits the updated document data 100b to the document management apparatus 1 accordingly. Then, the content of the document data 100a is updated to the content of the updated document data 100b. Then, the information processing apparatus 11b holds document data 100c having contents different from the document data 100a held by the document management apparatus 1. In this state, if the information processing apparatus 11b displays the document data 100c held by itself as it is to the user, the user uses the document data 100c having a different content from the document data 100a held by the document management apparatus 1. This makes it difficult to unify documents.

そのため、本実施形態におけるクライアント・サーバ型の文書管理システムでは、各情報処理装置１１において保持される文書データ１００ｂ，１００ｃが利用される際、文書管理装置１において保持される文書データ１００ａが更新されて異なるデータとなっているか否かを判断し、その結果、異なるデータに更新されていれば、情報処理装置１１が文書管理装置１から更新された文書データ１００ａをダウンロードし、その文書データ１００ａにより、各情報処理装置１１において保持されている文書データ１００ｂ，１００ｃを更新するように構成される。その結果、各情報処理装置１１では、この更新された文書データ１００ｂ，１００ｃに基づいてユーザに対する表示を行うことができるようになり、システム全体で利用される文書の統一が行える。 Therefore, in the client-server type document management system according to the present embodiment, when the document data 100b and 100c held in each information processing apparatus 11 is used, the document data 100a held in the document management apparatus 1 is updated. If the data is updated to different data as a result, the information processing apparatus 11 downloads the updated document data 100a from the document management apparatus 1, and uses the document data 100a. The document data 100b and 100c held in each information processing apparatus 11 are configured to be updated. As a result, each information processing apparatus 11 can display to the user based on the updated document data 100b and 100c, and can unify documents used in the entire system.

以下、このような文書管理システムについて更に詳しく説明する。尚、文書管理システムに設けられる複数の情報処理装置１１のそれぞれは情報処理装置１１ａと同様の機能および構成であるので、以下においては情報処理装置１１として、情報処理装置１１ａを例に挙げて説明する。 Hereinafter, such a document management system will be described in more detail. Since each of the plurality of information processing apparatuses 11 provided in the document management system has the same function and configuration as the information processing apparatus 11a, the information processing apparatus 11 will be described below as an example of the information processing apparatus 11. To do.

図３は、文書管理装置１のハードウェア構成の一例を示すブロック図である。図３に示すように文書管理装置１は、制御部２０と、ネットワークインタフェース２３と、記憶装置３０とを備え、これらがデータバス２４を介して接続されている。制御部２０は、ＣＰＵ２１とメモリ２２とを備えており、ＣＰＵ２１が、記憶装置３０に記憶されているプログラム３３を読み出して実行することにより、各部の動作を制御する。メモリ２２は、ＣＰＵ２１がプログラム３３を実行する際に一時的なデータなどを記憶するためのものである。 FIG. 3 is a block diagram illustrating an example of a hardware configuration of the document management apparatus 1. As shown in FIG. 3, the document management apparatus 1 includes a control unit 20, a network interface 23, and a storage device 30, which are connected via a data bus 24. The control unit 20 includes a CPU 21 and a memory 22. The CPU 21 reads out and executes a program 33 stored in the storage device 30, thereby controlling the operation of each unit. The memory 22 is for storing temporary data and the like when the CPU 21 executes the program 33.

ネットワークインタフェース２３は、ネットワーク１０を介して情報処理装置１１とデータ通信を行うためのものである。 The network interface 23 is for performing data communication with the information processing apparatus 11 via the network 10.

記憶装置３０は、例えばハードディスク装置などの不揮発性記憶装置によって構成される。この記憶装置３０には、文書管理装置１に予めインストールされたプログラム３３が記憶される。また記憶装置３０には、文書データ１００ａを格納する文書データ記憶部３１、および文書管理装置１において生成されるハッシュ値および統合ハッシュ値を格納するハッシュ値・統合ハッシュ値記憶部３２が設けられる。尚、ハッシュ値および統合ハッシュ値とは、文書データの内容等に対応した一定長のデータのことであるが、これについては後述する。また文書データ記憶部３１には、上述したように、情報処理装置１１のフォルダ２００ｂと対応するフォルダ２００ａが文書データ１００ａを記憶するための記憶領域（記憶部）として設けられる。 The storage device 30 is configured by a nonvolatile storage device such as a hard disk device, for example. The storage device 30 stores a program 33 installed in advance in the document management device 1. Further, the storage device 30 is provided with a document data storage unit 31 for storing the document data 100a, and a hash value / integrated hash value storage unit 32 for storing the hash value and the integrated hash value generated in the document management device 1. The hash value and the integrated hash value are data of a certain length corresponding to the contents of document data and the like, which will be described later. Further, as described above, the folder 200a corresponding to the folder 200b of the information processing apparatus 11 is provided in the document data storage unit 31 as a storage area (storage unit) for storing the document data 100a.

図４は、情報処理装置１１ａのハードウェア構成の一例を示すブロック図である。図４に示すように情報処理装置１１ａは、制御部４０と、ネットワークインタフェース４３と、表示部４４と、入力部４５と、記憶装置５０とを備え、これらがデータバス４６を介して接続されている。 FIG. 4 is a block diagram illustrating an example of a hardware configuration of the information processing apparatus 11a. As shown in FIG. 4, the information processing apparatus 11 a includes a control unit 40, a network interface 43, a display unit 44, an input unit 45, and a storage device 50, which are connected via a data bus 46. Yes.

制御部４０は、ＣＰＵ４１とメモリ４２とを備えており、ＣＰＵ４１が記憶装置５０に記憶されているプログラム５３および５４を読み出して実行することにより、各部の制御や、各種の演算処理を行う。メモリ４２はプログラム５３および５４の実行時に一時的に生成されるデータなどを記憶する。 The control unit 40 includes a CPU 41 and a memory 42. The CPU 41 reads out and executes the programs 53 and 54 stored in the storage device 50, thereby performing control of each unit and various arithmetic processes. The memory 42 stores data temporarily generated when the programs 53 and 54 are executed.

ネットワークインタフェース４３は、ネットワーク１０を介して文書管理装置１とデータ通信を行うためのものである。 The network interface 43 is for performing data communication with the document management apparatus 1 via the network 10.

表示部４４は、文書データに基づいて文書情報を表示したり、その他の情報を表示したりする表示手段であり、例えば液晶ディスプレイなどで構成される。また入力部４５は、ユーザが操作することにより各種信号を入力する入力手段であり、例えばキーボードやマウスなどを備えて構成される。 The display unit 44 is a display unit that displays document information based on document data or displays other information, and includes a liquid crystal display, for example. The input unit 45 is an input unit that inputs various signals when operated by a user, and includes, for example, a keyboard and a mouse.

記憶装置５０は、例えばハードディスク装置などの不揮発性記憶装置によって構成される。この記憶装置５０には、情報処理装置１１に予めインストールされたプログラム５３および５４が記憶される。システムプログラム５３は、オペレーティングシステムである。また、アプリケーションプログラム５４は、自機が保持する文書データ１００ｂを閲覧したり、また文書管理装置１が保持する文書データ１００ａを更新したり、等するためのプログラムである。また記憶装置５０には、文書データ１００ｂを格納する文書データ記憶部５１、および文書管理装置１から受信するハッシュ値および統合ハッシュ値を格納するハッシュ値・統合ハッシュ値記憶部５２が設けられる。ここで、文書データ記憶部５１には、上述したように、文書管理装置１のフォルダ２００ａと対応するフォルダ２００ｂが文書データ１００ｂを記憶するための記憶領域として設けられる。 The storage device 50 is configured by a nonvolatile storage device such as a hard disk device, for example. The storage device 50 stores programs 53 and 54 installed in advance in the information processing apparatus 11. The system program 53 is an operating system. The application program 54 is a program for browsing the document data 100b held by the device itself, updating the document data 100a held by the document management apparatus 1, and the like. The storage device 50 includes a document data storage unit 51 that stores the document data 100b, and a hash value / integrated hash value storage unit 52 that stores hash values and integrated hash values received from the document management device 1. Here, in the document data storage unit 51, as described above, the folder 200b corresponding to the folder 200a of the document management apparatus 1 is provided as a storage area for storing the document data 100b.

上述したように、情報処理装置１１ａは、自機で保持する文書データ１００ｂにより、文書管理装置１で保持される文書データ１００ａを更新することができる。同様に、他の情報処理装置１１ｂも、文書管理装置１で保持される文書データ１００ａを更新することがある。他の情報処理装置１１ｂが文書管理装置１で保持される文書データ１００ａを更新した場合、情報処理装置１１ａで保持する文書データ１００ｂは文書管理装置１の文書データ１００ａと異なった内容のデータとなる。そこで、情報処理装置１１ａは、自機が保持する文書データ１００ｂを文書管理装置１が保持する文書データ１００ａで更新することによって、文書データ１００ｂが文書データ１００ａと同一の内容となるようにしている。本実施形態では、情報処理装置１１ａが文書管理装置１からダウンロードして文書データ１００ｂを更新する処理は、例えばユーザが情報処理装置１１ａに対して文書データ１００ｂを閲覧等するなどの指示を行った際に行われるようになっている。 As described above, the information processing apparatus 11a can update the document data 100a held by the document management apparatus 1 with the document data 100b held by the information processing apparatus 11a. Similarly, the other information processing apparatus 11b may update the document data 100a held in the document management apparatus 1. When the other information processing apparatus 11b updates the document data 100a held by the document management apparatus 1, the document data 100b held by the information processing apparatus 11a has different data from the document data 100a of the document management apparatus 1. . Therefore, the information processing apparatus 11a updates the document data 100b held by the own apparatus with the document data 100a held by the document management apparatus 1 so that the document data 100b has the same content as the document data 100a. . In the present embodiment, in the process in which the information processing apparatus 11a is downloaded from the document management apparatus 1 and the document data 100b is updated, for example, the user instructs the information processing apparatus 11a to browse the document data 100b. When it comes to being.

このとき、本実施形態では、文書管理装置１で保持される文書データ１００ａに含まれる文書構成要素１９０ａのうち、情報処理装置１１ａで閲覧対象等として選択された文書データ１００ｂに含まれる文書構成要素１９０ｂと内容が異なるものが存在するか否かを判別し、内容の異なるものが存在すればその文書構成要素１９０ａを特定する。このような判別処理は、情報処理装置１１ａにおいて行っても良いし、また文書管理装置１で行っても良い。そして文書構成要素１９０ｂと内容の異なる文書構成要素１９０ａが特定されると、文書管理装置１が、情報処理装置１１ａに対して、その特定された文書構成要素１９０ａのみを送信する。これにより、情報処理装置１１ａが保持する文書データ１００ｂの更新を行うことができる。本実施形態では、このような判別処理を情報処理装置１１ａで行う場合を例示する。情報処理装置１１ａにおいて判別処理を行うことで、例えば文書管理装置１とネットワーク１０を介して接続されている情報処理装置１１の台数が多い場合でも、文書管理装置１の処理パフォーマンスへの負担が大きく軽減することができる。 At this time, in the present embodiment, among the document constituent elements 190a included in the document data 100a held by the document management apparatus 1, the document constituent elements included in the document data 100b selected as a browsing target or the like by the information processing apparatus 11a. It is determined whether or not there is something different in content from 190b. If there is something different in content, the document component 190a is specified. Such determination processing may be performed in the information processing apparatus 11a or may be performed in the document management apparatus 1. When a document component 190a having a content different from that of the document component 190b is specified, the document management apparatus 1 transmits only the specified document component 190a to the information processing apparatus 11a. Thereby, the document data 100b held by the information processing apparatus 11a can be updated. In this embodiment, the case where such a determination process is performed by the information processing apparatus 11a is illustrated. By performing the discrimination process in the information processing apparatus 11a, for example, even when the number of information processing apparatuses 11 connected to the document management apparatus 1 via the network 10 is large, the burden on the processing performance of the document management apparatus 1 is large. Can be reduced.

情報処理装置１１ａが判別処理を行う際、文書管理装置１で生成されるハッシュ値（ダイジェストデータ）を参照して判別を行う。以下、詳しく説明する。本実施形態においては、文書構成要素１９０ａまたは文書構成要素１９０ｂの内容に対応したデータであるハッシュ値（ダイジェストデータ）と、文書データ１００ａまたは文書データ１００ｂの内容に対応したデータである文書ハッシュ値（文書ダイジェストデータ）と、フォルダ２００ａまたはフォルダ２００ｂの内容に対応したデータであるフォルダハッシュ値（合成ダイジェストデータ）、の３種類のハッシュ値を用いて、文書構成要素１９０ｂと異なる内容の文書構成要素１９０ａが存在するか否かを判別する。尚、フォルダハッシュ値が対応するフォルダの内容については、後述する。 When the information processing apparatus 11a performs the determination process, the determination is performed with reference to the hash value (digest data) generated by the document management apparatus 1. This will be described in detail below. In the present embodiment, a hash value (digest data) that is data corresponding to the contents of the document component 190a or the document component 190b and a document hash value (data corresponding to the contents of the document data 100a or the document data 100b) ( Document constituent element 190a having a different content from document constituent element 190b using three types of hash values: document digest data) and folder hash value (synthetic digest data) corresponding to the contents of folder 200a or folder 200b. It is determined whether or not exists. The contents of the folder corresponding to the folder hash value will be described later.

まず文書管理装置１において行われる処理について説明する。文書管理装置１は、複数の文書構成要素１９０ａのそれぞれから、各文書構成要素１９０ａの内容に対応したハッシュ値を生成する。次に文書データ１００ａの内容に対応した文書ハッシュ値と、フォルダ２００ａの内容に対応したフォルダハッシュ値とを生成する。文書管理装置１では、これら各種ハッシュ値を文書データ１００ａのデータ保持構造に対応付けて記憶しておく。そして、情報処理装置１１ａから、文書構成要素１９０ａの送信要求があった場合、文書管理装置１は、文書構成要素１９０ａと共に、それらハッシュ値を情報処理装置１１ａに送信する。これにより情報処理装置１１ａでは、文書データ１００ｂのデータ保持構造に対応したハッシュ値が保持されることになる。 First, processing performed in the document management apparatus 1 will be described. The document management apparatus 1 generates a hash value corresponding to the contents of each document component 190a from each of the plurality of document components 190a. Next, a document hash value corresponding to the contents of the document data 100a and a folder hash value corresponding to the contents of the folder 200a are generated. In the document management apparatus 1, these various hash values are stored in association with the data holding structure of the document data 100a. Then, when there is a transmission request for the document component 190a from the information processing device 11a, the document management device 1 transmits these hash values together with the document component 190a to the information processing device 11a. As a result, the information processing apparatus 11a holds a hash value corresponding to the data holding structure of the document data 100b.

文書管理装置１で保持される文書データ１００ａ又はその文書構成要素１９０ａと、情報処理装置１１ａで保持される文書データ１００ｂ又はその文書構成要素１９０ｂとが同一の内容であれば、文書管理装置１で保持される各種ハッシュ値と、情報処理装置１１ａで保持される各種ハッシュ値とが一致した値となる。これに対し、文書管理装置１で保持される文書データ１００ａ又はその文書構成要素１９０ａと、情報処理装置１１ａで保持される文書データ１００ｂ又はその文書構成要素１９０ｂとが互いに異なる内容となっていれば、文書管理装置１で保持される各種ハッシュ値と、情報処理装置１１ａで保持される各種ハッシュ値とが異なる値となる。 If the document data 100a or its document component 190a held in the document management device 1 and the document data 100b or its document component 190b held in the information processing device 11a have the same contents, the document management device 1 The various hash values held and the various hash values held by the information processing apparatus 11a coincide with each other. On the other hand, if the document data 100a or its document component 190a held in the document management apparatus 1 and the document data 100b or its document component 190b held in the information processing apparatus 11a have different contents from each other. The various hash values held in the document management apparatus 1 are different from the various hash values held in the information processing apparatus 11a.

そのため、本実施形態では、情報処理装置１１ａが上記判別処理を行う際、文書管理装置１においてその時点で保持されている各種ハッシュ値を取得し、情報処理装置１１ａで保持しているハッシュ値との比較を行う。具体的には、文書ハッシュ値およびフォルダハッシュ値を比較することにより、文書データ１００ａと文書データ１００ｂの内容が異なっているか否かを判別し、その後、文書構成要素ごとのハッシュ値を用いて文書構成要素１９０ｂと内容の異なる文書構成要素１９０ａを特定する。これにより、例えば文書データ１００ａと文書データ１００ｂの内容が同一である場合に、これらの文書データの文書ハッシュ値を比較すれば、文書構成要素１９０ａと文書構成要素１９０ｂとを個別に比較することなしに、文書構成要素１９０ａと文書構成要素１９０ｂとは全て同一データ内容の文書構成要素で構成されていることが判明する。また、フォルダ２００ａのフォルダハッシュ値とフォルダ２００ｂのフォルダハッシュ値とを比較すれば、それぞれのフォルダ２００ａ，２００ｂに複数の文書データが記憶されている場合であっても、それらを個別に比較することなしに全ての文書データが同一内容であることが判明する。この場合、各文書データに含まれる複数の文書構成要素１９０のそれぞれについて個別に比較することなく、文書構成要素１９０ａと文書構成要素１９０ｂとが全て同一内容であることが明らかになる。 Therefore, in the present embodiment, when the information processing apparatus 11a performs the determination process, the document management apparatus 1 acquires various hash values held at that time, and the hash value held in the information processing apparatus 11a Make a comparison. Specifically, it is determined whether or not the contents of the document data 100a and the document data 100b are different by comparing the document hash value and the folder hash value. A document component 190a having a content different from that of the component 190b is specified. Accordingly, for example, when the contents of the document data 100a and the document data 100b are the same, if the document hash values of these document data are compared, the document constituent element 190a and the document constituent element 190b are not individually compared. In addition, it is found that the document component 190a and the document component 190b are all composed of document components having the same data contents. Further, if the folder hash value of the folder 200a and the folder hash value of the folder 200b are compared, even if a plurality of document data is stored in each of the folders 200a and 200b, they are individually compared. It turns out that all the document data has the same content without. In this case, it becomes clear that the document component 190a and the document component 190b all have the same content without individually comparing each of the plurality of document components 190 included in each document data.

このように本実施形態では、文書管理装置１で保持される複数の文書構成要素１９０ａのうち、情報処理装置１１ａで保持される文書構成要素１９０ｂと内容が異なるものが存在するかを判別する際、フォルダハッシュ値、文書ハッシュ値およびハッシュ値の順でハッシュ値比較を行っていく。このような判別手法を用いれば、異なる内容の文書構成要素を全て特定するために必要なデータの比較回数を抑えることができ、文書管理システムのパフォーマンスの低下を回避することができるようになる。 As described above, in the present exemplary embodiment, when determining whether there is a document component 190a held in the document management apparatus 1 that has a content different from that of the document component 190b held in the information processing apparatus 11a. The hash value comparison is performed in the order of the folder hash value, the document hash value, and the hash value. By using such a discriminating method, it is possible to suppress the number of comparisons of data necessary to specify all document components having different contents, and to avoid a decrease in the performance of the document management system.

ここで、本実施形態においては、文書ハッシュ値およびフォルダハッシュ値を総称して統合ハッシュ値という。従って、上述した判別手法では、まず統合ハッシュ値を用いて、フォルダ２００ａの内容とフォルダ２００ｂの内容が異なるか否かを判別し、異なる場合は次に文書データ１００ａの内容と文書データ１００ｂの内容が異なるか否かを判別する。そしてさらに異なる場合には、その後、ハッシュ値を用いて文書構成要素１９０ｂと内容が異なる文書構成要素１９０ａを特定することになる。 Here, in the present embodiment, the document hash value and the folder hash value are collectively referred to as an integrated hash value. Therefore, in the above-described determination method, first, using the integrated hash value, it is determined whether or not the contents of the folder 200a and the contents of the folder 200b are different. If they are different, then the contents of the document data 100a and the contents of the document data 100b are determined. It is determined whether or not. If the document component 190a is different, the hash value is used to specify a document component 190a having a different content from the document component 190b.

このような本実施形態の文書管理システムにおいて、ユーザが情報処理装置１１ａを用いて文書データ１００ｂを利用しようとした場合、まず、データ判別を行わせるために、情報処理装置１１ａに対して文書管理装置１が生成したハッシュ値および統合ハッシュ値を送信する。そして情報処理装置１１ａが、文書管理装置１から受信したハッシュ値および統合ハッシュ値を用いてデータ判別を行った後、文書構成要素１９０ｂと内容が異なる文書構成要素１９０ａが存在すれば、その文書構成要素１９０ａの送信を文書管理装置１に対して要求する。これにより、文書管理装置１は要求のあった文書構成要素１９０ａのみを情報処理装置１１ａに送信する。そして情報処理装置１１ｂは、自機で保持している文書構成要素１９０ｂを、文書管理装置１から受信した文書構成要素１９０ａにより更新する。このような一連の処理を行うことで、文書データ１００ａと、文書データ１００ｂとが同一のデータ内容として構成される。つまり、文書管理装置１と情報処理装置１１ａとでデータの同一性が保持される。 In such a document management system of the present embodiment, when a user tries to use the document data 100b using the information processing apparatus 11a, first, the document management is performed on the information processing apparatus 11a in order to perform data discrimination. The hash value and the integrated hash value generated by the device 1 are transmitted. Then, after the information processing apparatus 11a performs data discrimination using the hash value and the integrated hash value received from the document management apparatus 1, if there is a document constituent element 190a whose contents are different from the document constituent element 190b, the document configuration The document management apparatus 1 is requested to transmit the element 190a. As a result, the document management apparatus 1 transmits only the requested document component 190a to the information processing apparatus 11a. Then, the information processing apparatus 11b updates the document component 190b held by the information processing apparatus 11b with the document component 190a received from the document management apparatus 1. By performing such a series of processing, the document data 100a and the document data 100b are configured as the same data content. That is, the sameness of data is maintained between the document management apparatus 1 and the information processing apparatus 11a.

そして本実施形態では、上述の判別処理をハッシュ値および統合ハッシュ値を用いて行うので、従来の方法に比べデータ処理の負担が軽減されることとなり、文書管理システム全体のパフォーマンスの低下を回避できる。尚、上述した例では、主として情報処理装置１１ａと文書管理装置１との関係について説明したが、他の情報処理装置１１ｂについても同様である。 In this embodiment, since the above-described determination process is performed using a hash value and an integrated hash value, the data processing burden is reduced as compared with the conventional method, and a decrease in the performance of the entire document management system can be avoided. . In the example described above, the relationship between the information processing apparatus 11a and the document management apparatus 1 has been mainly described, but the same applies to the other information processing apparatus 11b.

次に上記のような動作を実現するための、文書管理装置１における具体的な内部機能について説明する。図５は、文書管理装置１の制御部２０がプログラム３３を実行することによって実現される詳細な機能構成を示すブロック図である。尚、図５では、文書管理装置１が保持する文書データ１００ａの更新後、新たに更新後の文書データ１００ａについてハッシュ値・統合ハッシュ値を生成する機能、および、ユーザが情報処理装置１１ａを操作して文書データ１００ｂを閲覧等しようとした場合に、文書データ１００ａと文書データ１００ｂとを同一内容にするため、文書管理装置１が情報処理装置１１ａに文書構成要素１９０ａを送信する機能に関するブロックのみを示しており、それ以外の機能については図示を省略している。 Next, specific internal functions in the document management apparatus 1 for realizing the operation as described above will be described. FIG. 5 is a block diagram showing a detailed functional configuration realized by the control unit 20 of the document management apparatus 1 executing the program 33. In FIG. 5, after updating the document data 100a held by the document management apparatus 1, a function for generating a hash value / integrated hash value for the newly updated document data 100a, and a user operating the information processing apparatus 11a When the document data 100b is to be browsed, only the block relating to the function of the document management apparatus 1 transmitting the document component 190a to the information processing apparatus 11a is used to make the document data 100a and the document data 100b have the same contents. The other functions are not shown.

図５に示すように、制御部２０は、プログラム３３を実行することにより、文書データ更新部６０、ハッシュ値生成部６１、統合ハッシュ値生成部６２、文書ハッシュ値生成部６３、フォルダハッシュ値生成部６４、ハッシュ値・統合ハッシュ値送信部６５、文書構成要素送信部６６として機能する。 As shown in FIG. 5, the control unit 20 executes a program 33 to thereby generate a document data update unit 60, a hash value generation unit 61, an integrated hash value generation unit 62, a document hash value generation unit 63, and a folder hash value generation. Functions as a unit 64, a hash value / integrated hash value transmission unit 65, and a document component transmission unit 66.

文書データ更新部６０は、文書管理装置１が保持する文書データ１００ａの更新を行う。具体的には、情報処理装置１１ａから文書構成要素１９０ｂが送信されてきた場合に、文書データ記憶部３１に格納されている文書構成要素１９０ａを、当該送信されてきた文書構成要素１９０ｂに書き換える。これにより、文書構成要素１９０ａの内容は、送信されてきた文書構成要素１９０ｂの内容と同一になる。 The document data update unit 60 updates the document data 100 a held by the document management apparatus 1. Specifically, when the document component 190b is transmitted from the information processing apparatus 11a, the document component 190a stored in the document data storage unit 31 is rewritten to the transmitted document component 190b. As a result, the content of the document component 190a becomes the same as the content of the transmitted document component 190b.

ハッシュ値生成部６１は、文書データ更新部６０により文書データ１００ａが更新された場合に機能し、文書データ更新部６０により書き換えられた文書構成要素１９０ａから、ハッシュ値を生成する。具体的には、文書データ更新部６０により書き換えられた文書構成要素１９０ａを、記憶装置３０に格納されている所定のハッシュ関数に入力し、文書構成要素１９０ａの内容に対応した一定長のデータであるハッシュ値の出力を得る。つまり、ハッシュ値生成部６１は、文書データ更新部６０により書き換えられた文書構成要素１９０ａの、書き換え後の内容に対応するハッシュ値を生成する。ここで、ハッシュ関数とは、入力したデータの内容に対応した、一定長のデータを出力する関数のことをいう。ハッシュ関数は、入力されるデータの内容が異なる場合は、異なるデータをハッシュ値として出力する。従って、ハッシュ関数の出力であるハッシュ値を比較すれば、文書構成要素１９０ｂと内容が異なる文書構成要素１９０ａを特定できる。そしてハッシュ値生成部６１は、上述のようにして生成したハッシュ値により、ハッシュ値・統合ハッシュ値記憶部３２に設けられたハッシュ値記憶部３４に記憶されているハッシュ値を更新する。具体的には、ハッシュ値記憶部３４に記憶されている、文書データ更新部６０による書き換え前の文書構成要素１９０ａのハッシュ値を、上述のようにして新たに生成した書き換え後の文書構成要素１９０ａのハッシュ値に書き換える。 The hash value generation unit 61 functions when the document data 100a is updated by the document data update unit 60, and generates a hash value from the document component 190a rewritten by the document data update unit 60. Specifically, the document component 190a rewritten by the document data update unit 60 is input to a predetermined hash function stored in the storage device 30, and data having a certain length corresponding to the content of the document component 190a is input. Get the output of some hash value. That is, the hash value generation unit 61 generates a hash value corresponding to the rewritten content of the document component 190 a rewritten by the document data update unit 60. Here, the hash function refers to a function that outputs data of a certain length corresponding to the content of input data. The hash function outputs different data as a hash value when the contents of input data are different. Therefore, by comparing the hash values that are the outputs of the hash functions, it is possible to specify a document component 190a having a different content from the document component 190b. The hash value generation unit 61 updates the hash value stored in the hash value storage unit 34 provided in the hash value / integrated hash value storage unit 32 with the hash value generated as described above. Specifically, the hash value of the document component 190a before rewriting by the document data updating unit 60 stored in the hash value storage unit 34 is newly generated as described above, and the document component 190a after rewriting is newly generated. Rewrite to the hash value of.

図６（ａ）は、文書データ更新部６０により更新された文書データ１１０ａの文書ハッシュ値生成概念図である。ここで、文書データ１１０ａは、文書データ１００ａの一例である。すなわち、文書管理装置１が文書データ１１０ａを保持し、文書データ１１０ａと同一のプロパティを有する文書データ１１０ｂを、情報処理装置１１ａが保持していると仮定する。尚、図６では、文書構成要素１９０ａは文書本体データ１１１ａ、サムネイル１１２ａ、データベース１１３ａ、およびセキュリティ情報１１４ａの４つであり、これらは全て文書データ更新部６０により書き換えられているものとする。この場合、ハッシュ値生成部６１は、文書本体データ１１１ａをハッシュ関数８０に入力し、その文書本体データ１１１ａの内容に対応したハッシュ値３１１ａを得る。またサムネイル１１２ａをハッシュ関数８０に入力し、そのサムネイル１１２ａの内容に対応したハッシュ値３１２ａを得る。またデータベース１１３ａをハッシュ関数８０に入力し、そのデータベース１１３ａの内容に対応したハッシュ値３１３ａを得る。またセキュリティ情報１１４ａをハッシュ関数８０に入力し、そのセキュリティ情報１１４ａの内容に対応したハッシュ値３１４ａを得る。以上のようにしてハッシュ値生成部６１は、文書データ更新部６０により更新された文書データ１１０ａに含まれる各文書構成要素１９０ａのハッシュ値を生成する。 FIG. 6A is a conceptual diagram of document hash value generation of the document data 110 a updated by the document data update unit 60. Here, the document data 110a is an example of the document data 100a. That is, it is assumed that the document management apparatus 1 holds the document data 110a, and the information processing apparatus 11a holds the document data 110b having the same property as the document data 110a. In FIG. 6, there are four document components 190a, ie, document body data 111a, thumbnail 112a, database 113a, and security information 114a, all of which have been rewritten by the document data update unit 60. In this case, the hash value generation unit 61 inputs the document body data 111a to the hash function 80, and obtains a hash value 311a corresponding to the contents of the document body data 111a. The thumbnail 112a is input to the hash function 80, and a hash value 312a corresponding to the contents of the thumbnail 112a is obtained. Further, the database 113a is input to the hash function 80, and a hash value 313a corresponding to the contents of the database 113a is obtained. Also, the security information 114a is input to the hash function 80, and a hash value 314a corresponding to the contents of the security information 114a is obtained. As described above, the hash value generation unit 61 generates a hash value of each document component 190a included in the document data 110a updated by the document data update unit 60.

統合ハッシュ値生成部６２は、文書ハッシュ値生成部６３とフォルダハッシュ値生成部６４とを備えている。文書ハッシュ値生成部６３は文書ハッシュ値を生成し、フォルダハッシュ値生成部６４はフォルダハッシュ値を生成する。ハッシュ値生成部６１により新たにハッシュ値が生成された場合に文書ハッシュ値生成部６３が機能し、文書ハッシュ値生成部６３による文書ハッシュ値の生成が終わった後、フォルダハッシュ値生成部６４が機能する。 The integrated hash value generation unit 62 includes a document hash value generation unit 63 and a folder hash value generation unit 64. The document hash value generation unit 63 generates a document hash value, and the folder hash value generation unit 64 generates a folder hash value. When a new hash value is generated by the hash value generation unit 61, the document hash value generation unit 63 functions. After the generation of the document hash value by the document hash value generation unit 63 is finished, the folder hash value generation unit 64 Function.

文書ハッシュ値生成部６３は、文書データ更新部６０により更新された文書データ１００ａにつき、文書データ１００ａを構成する複数の文書構成要素１９０ａのそれぞれに基づいて文書データ１００ａの内容に対応した文書ハッシュ値を生成する。具体的には、文書データ１００ａを構成する複数の文書構成要素１９０ａのそれぞれの内容に対応した複数のハッシュ値を、ハッシュ関数に入力し、当該文書データ１００ａの内容に対応した一定長のデータである文書ハッシュ値の出力を得る。このとき、文書データ更新部６０により書き換えられた文書構成要素１９０ａについては、ハッシュ値生成部６１において生成されたハッシュ値を用い、文書データ更新部６０により書き換えられていない文書構成要素１９０ａについては、ハッシュ値記憶部３４から読み込んだハッシュ値を用いることになる。つまり、文書ハッシュ値生成部６３は、文書データ更新部６０により更新された文書データ１００ａの、更新後の内容に対応した文書ハッシュ値を生成する。このような文書ハッシュ値は、ハッシュ値と同様、ハッシュ関数に入力される複数のハッシュ値のうちの少なくとも１つが異なる値であれば、異なる値のハッシュ値として出力される。そのため、文書ハッシュ値を比較すれば、文書データ１００ａを構成する複数の文書構成要素１９０ａのそれぞれの内容と、文書データ１００ｂを構成する複数の文書構成要素１９０ｂのそれぞれの内容とが異なるか否かを判別できる。 The document hash value generation unit 63, for the document data 100a updated by the document data update unit 60, a document hash value corresponding to the content of the document data 100a based on each of the plurality of document components 190a constituting the document data 100a. Is generated. Specifically, a plurality of hash values corresponding to the contents of the plurality of document constituent elements 190a constituting the document data 100a are input to a hash function, and the data has a fixed length corresponding to the contents of the document data 100a. Get the output of a document hash value. At this time, for the document component 190a rewritten by the document data update unit 60, the hash value generated by the hash value generation unit 61 is used, and the document component 190a that has not been rewritten by the document data update unit 60 is: The hash value read from the hash value storage unit 34 is used. That is, the document hash value generation unit 63 generates a document hash value corresponding to the updated content of the document data 100a updated by the document data update unit 60. Similar to the hash value, such a document hash value is output as a hash value having a different value if at least one of the plurality of hash values input to the hash function is different. Therefore, if the document hash values are compared, whether or not the contents of the plurality of document components 190a constituting the document data 100a are different from the contents of the plurality of document components 190b constituting the document data 100b. Can be determined.

図６（ｂ）は、文書データ１１０ａの文書ハッシュ値生成概念図である。文書ハッシュ値生成部６３は、図６（ａ）においてハッシュ値生成部６１が文書本体データ１１１ａから生成したハッシュ値３１１ａと、ハッシュ値生成部６１がサムネイル１１２ａから生成したハッシュ値３１２ａと、ハッシュ値生成部６１がデータベース１１３ａから生成したハッシュ値３１３ａと、およびハッシュ値生成部６１がセキュリティ情報１１４ａから生成したハッシュ値３１４ａとをハッシュ関数８０に入力し、文書データ１１０ａの内容に対応した文書ハッシュ値４１０ａを得る。 FIG. 6B is a conceptual diagram of document hash value generation of the document data 110a. In FIG. 6A, the document hash value generation unit 63 includes a hash value 311a generated from the document body data 111a by the hash value generation unit 61, a hash value 312a generated from the thumbnail 112a by the hash value generation unit 61, and a hash value. The hash value 313a generated by the generation unit 61 from the database 113a and the hash value 314a generated by the hash value generation unit 61 from the security information 114a are input to the hash function 80, and the document hash value corresponding to the content of the document data 110a 410a is obtained.

上述のようにして文書ハッシュ値生成部６３が文書ハッシュ値を生成した後、フォルダハッシュ値生成部６４は、文書データ更新部６０により更新された文書データ１００ａを格納しているフォルダ２００ａについて、フォルダハッシュ値を生成する。具体的には、フォルダ２００ａが下位フォルダを持つか否かにより、以下の２つの生成方法が実行される。 After the document hash value generation unit 63 generates the document hash value as described above, the folder hash value generation unit 64 uses the folder 200a in which the document data 100a updated by the document data update unit 60 is stored. Generate a hash value. Specifically, the following two generation methods are executed depending on whether or not the folder 200a has a lower folder.

（１）下位フォルダを持たないフォルダ２００ａについては、当該フォルダに格納されている全ての文書データ１００ａの内容に対応した複数の文書ハッシュ値から、その複数の文書ハッシュ値に対応したフォルダハッシュ値を生成する。具体的には、当該フォルダに格納されている複数の文書データ１００ａのそれぞれの内容に対応した複数の文書ハッシュ値を、記憶装置３０に格納されているハッシュ関数に入力し、その複数の文書ハッシュ値に対応した一定長のデータであるフォルダハッシュ値の出力を得る。このとき、文書データ更新部６０により更新された文書データ１００ａについては、文書ハッシュ値生成部６３において生成された文書ハッシュ値を用い、文書データ更新部６０により更新されていない文書データ１００ａについては、統合ハッシュ値記憶部３５から読み込んだ文書ハッシュ値を用いることになる。 (1) For a folder 200a having no lower folder, a folder hash value corresponding to the plurality of document hash values is obtained from a plurality of document hash values corresponding to the contents of all the document data 100a stored in the folder. Generate. Specifically, a plurality of document hash values corresponding to the contents of the plurality of document data 100a stored in the folder are input to a hash function stored in the storage device 30, and the plurality of document hashes are input. An output of a folder hash value that is data of a certain length corresponding to the value is obtained. At this time, for the document data 100a updated by the document data update unit 60, the document hash value generated by the document hash value generation unit 63 is used, and for the document data 100a not updated by the document data update unit 60, The document hash value read from the integrated hash value storage unit 35 is used.

図７（ａ）は、下位フォルダを持たないフォルダ２１０ａのフォルダハッシュ値生成概念図である。ここで、フォルダ２１０ａはフォルダ２００ａの一例である。すなわち、文書管理装置１がフォルダ２１０ａを有し、フォルダ２１０ａと同一のプロパティを有するフォルダ２１０ｂを、情報処理装置１１ａが有していると仮定する。尚、図７では全ての文書データ１００ａが文書データ更新部６０により更新されたものとする。フォルダ２１０ａには、文書データ１００ａの一例である文書データ１１０ａ、文書データ１２０ａ、および文書データ１３０ａが格納されている。文書データ１１０ａの文書ハッシュ値４１０ａ、文書データ１２０ａの文書ハッシュ値４２０ａ、および文書データ１３０ａの文書ハッシュ値４３０ａを、上述した文書ハッシュ値生成方法により文書ハッシュ値生成部６３が生成した後、フォルダハッシュ値生成部６４は、文書ハッシュ値４２０と、文書ハッシュ値４２０と、文書ハッシュ値４４０とをハッシュ関数８０に入力して、文書ハッシュ値４２０の内容、文書ハッシュ値４３０の内容、および文書ハッシュ値４４０の内容に対応した、フォルダ２１０ａのフォルダハッシュ値５１０ａを得る。 FIG. 7A is a conceptual diagram of folder hash value generation for the folder 210a having no lower folder. Here, the folder 210a is an example of the folder 200a. That is, it is assumed that the document management apparatus 1 has a folder 210a, and the information processing apparatus 11a has a folder 210b having the same properties as the folder 210a. In FIG. 7, it is assumed that all document data 100 a has been updated by the document data update unit 60. The folder 210a stores document data 110a, document data 120a, and document data 130a, which are examples of the document data 100a. After the document hash value generation unit 63 generates the document hash value 410a of the document data 110a, the document hash value 420a of the document data 120a, and the document hash value 430a of the document data 130a by the above-described document hash value generation method, the folder hash The value generation unit 64 inputs the document hash value 420, the document hash value 420, and the document hash value 440 to the hash function 80, and the contents of the document hash value 420, the contents of the document hash value 430, and the document hash value A folder hash value 510a of the folder 210a corresponding to the contents of 440 is obtained.

（２）下位フォルダを持つフォルダ２００ａについては、当該フォルダ２００ａに格納されている全ての文書データ１００ａの内容に対応した複数の文書ハッシュ値と、１つ下位のフォルダのフォルダハッシュ値とから、その複数の文書ハッシュ値および１つ下位のフォルダのフォルダハッシュ値に対応したフォルダハッシュ値を生成する。具体的には、フォルダ２００ａに格納されている各文書データ１００ａの内容に対応した複数の文書ハッシュ値と、１つ下位のフォルダのハッシュ値とを、記憶装置３０に格納されているハッシュ関数に入力し、その複数の文書ハッシュ値および１つ下位のフォルダのフォルダハッシュ値に対応した一定長のデータであるフォルダハッシュ値の出力を得る。このとき、文書データ更新部６０により更新された文書データ１００ａについては、文書ハッシュ値生成部６３において生成された文書ハッシュ値を用い、文書データ更新部６０により更新されていない文書データ１００ａについては、統合ハッシュ値記憶部３５から読み込んだ文書ハッシュ値を用いることになる。また、１つ下位のフォルダが文書データ更新部６０により更新された文書データを格納している場合はフォルダハッシュ値生成部６４によって生成されたフォルダハッシュ値を用いる。さらに、１つ下位のフォルダが文書データ更新部６０により更新されていない文書データのみを格納する場合は、統合ハッシュ値記憶部３５から読み込んだフォルダハッシュ値を用いる。 (2) For a folder 200a having a lower folder, from a plurality of document hash values corresponding to the contents of all the document data 100a stored in the folder 200a and a folder hash value of a lower folder, A folder hash value corresponding to a plurality of document hash values and a folder hash value of a folder one level lower is generated. Specifically, a plurality of document hash values corresponding to the contents of each document data 100a stored in the folder 200a and a hash value of a folder one lower level are converted into a hash function stored in the storage device 30. Then, an output of a folder hash value which is data of a fixed length corresponding to the plurality of document hash values and the folder hash value of the folder one level lower is obtained. At this time, for the document data 100a updated by the document data update unit 60, the document hash value generated by the document hash value generation unit 63 is used, and for the document data 100a not updated by the document data update unit 60, The document hash value read from the integrated hash value storage unit 35 is used. Further, when the lower-order folder stores the document data updated by the document data update unit 60, the folder hash value generated by the folder hash value generation unit 64 is used. Furthermore, when only the document data that has not been updated by the document data update unit 60 is stored in the next lower folder, the folder hash value read from the integrated hash value storage unit 35 is used.

図７（ｂ）は、下位フォルダが存在するフォルダ２２０ａのフォルダハッシュ値生成の概念図である。ここで、フォルダ２２０ａはフォルダ２００ａの一例である。フォルダ２２０ａには、文書データ１００ａの一例である文書データ１４０ａおよび文書データ１５０ａが格納されており、その１つ下位のフォルダとしてフォルダ２１０ａが存在する。文書データ１４０ａの文書ハッシュ値４４０ａおよび文書データ１５０ａの文書ハッシュ値４５０ａを、上述した文書ハッシュ値生成方法により文書ハッシュ値生成部６３が生成した後、フォルダハッシュ値生成部６４は、フォルダ２１０ａのフォルダハッシュ値５１０ａを、上述した下位フォルダが存在しない場合のフォルダハッシュ値生成方法により生成する。そしてフォルダハッシュ値生成部６４は、文書ハッシュ値４４０ａと、文書ハッシュ値４５０ａと、フォルダハッシュ値５１０ａとをハッシュ関数８０に入力して、文書ハッシュ値４４０ａの内容、文書ハッシュ値４５０ａの内容、およびフォルダハッシュ値５１０ａの内容に対応した、フォルダ２２０ａのフォルダハッシュ値５２０ａを得る。 FIG. 7B is a conceptual diagram of folder hash value generation for the folder 220a in which the lower folder exists. Here, the folder 220a is an example of the folder 200a. The folder 220a stores document data 140a and document data 150a, which are examples of the document data 100a, and a folder 210a exists as a lower folder. After the document hash value generation unit 63 generates the document hash value 440a of the document data 140a and the document hash value 450a of the document data 150a by the above-described document hash value generation method, the folder hash value generation unit 64 stores the folder hash value of the folder 210a. The hash value 510a is generated by the folder hash value generation method when the above-described lower folder does not exist. Then, the folder hash value generation unit 64 inputs the document hash value 440a, the document hash value 450a, and the folder hash value 510a to the hash function 80, and the contents of the document hash value 440a, the contents of the document hash value 450a, and A folder hash value 520a of the folder 220a corresponding to the contents of the folder hash value 510a is obtained.

つまり、フォルダハッシュ値生成部６４は、文書データ更新部６０により文書データ１００ａが更新された場合、その更新後の文書データ１００ａを格納しているフォルダおよびその上位フォルダの全てについて、それらフォルダに格納されている全ての文書データ１００ａの文書ハッシュ値の内容と、１つ下位のフォルダのフォルダハッシュ値の内容とに対応したハッシュ値を生成する。ここで、下位のフォルダが存在しない場合のフォルダハッシュ値は、そのフォルダに格納されている全ての文書データ１００ａの文書ハッシュ値の内容に対応している。また、下位のフォルダが存在する場合のフォルダハッシュ値は、そのフォルダに格納されている全ての文書データ１００ａの内容と、そのフォルダよりも下位のフォルダ全てに格納されている全ての文書データ１００ａの内容に対応している。従って、フォルダハッシュ値を比較すれば、そのフォルダ、およびそのフォルダより下位のフォルダに格納されている文書データ１００ａのうちで、文書データ１００ｂと内容が異なるものが存在するか否かを判定することができる。このようにフォルダハッシュ値は、そのフォルダに格納されている全ての文書データ１００ａおよびその文書データ１００ａを構成する全ての文書構成要素１９０ａの内容を反映したダイジェストデータとなっている。 That is, when the document data 100a is updated by the document data update unit 60, the folder hash value generation unit 64 stores all of the folder storing the updated document data 100a and its upper folders in those folders. A hash value corresponding to the contents of the document hash values of all the document data 100a and the contents of the folder hash values of the folder one level lower is generated. Here, the folder hash value when there is no lower folder corresponds to the contents of the document hash values of all the document data 100a stored in the folder. In addition, the folder hash value when there is a lower folder is the contents of all the document data 100a stored in that folder and all the document data 100a stored in all lower folders than that folder. Corresponds to the content. Therefore, by comparing the folder hash values, it is determined whether or not there is a document data 100a stored in the folder and a folder lower than the folder that has a different content from the document data 100b. Can do. Thus, the folder hash value is digest data reflecting the contents of all the document data 100a stored in the folder and all the document components 190a constituting the document data 100a.

そして統合ハッシュ値生成部６２は、上述のようにして生成した文書ハッシュ値およびフォルダハッシュ値を、ハッシュ値・統合ハッシュ値記憶部３２に設けられた統合ハッシュ値記憶部３５に格納する。このとき、統合ハッシュ値生成部６２は、統合ハッシュ値記憶部３５に既に格納されている文書ハッシュ値およびフォルダハッシュ値を、上述のようにして生成した文書ハッシュ値およびフォルダハッシュ値で書き換えることにより、文書ハッシュ値およびフォルダハッシュ値を更新する。 The integrated hash value generation unit 62 stores the document hash value and the folder hash value generated as described above in the integrated hash value storage unit 35 provided in the hash value / integrated hash value storage unit 32. At this time, the integrated hash value generation unit 62 rewrites the document hash value and the folder hash value already stored in the integrated hash value storage unit 35 with the document hash value and the folder hash value generated as described above. Update the document hash value and folder hash value.

ハッシュ値・統合ハッシュ値送信部６５は、情報処理装置１１ａにおいてユーザが文書データ１００ｂを閲覧等しようとした場合に情報処理装置１１ａから送信される、ハッシュ値・統合ハッシュ値要求を受信した場合に機能し、情報処理装置１１ｂにハッシュ値および統合ハッシュ値を送信する。具体的には、フォルダ２００ａのフォルダハッシュ値、文書データ１００ａの文書ハッシュ値、および文書構成要素１９０ａのハッシュ値を、ハッシュ値・統合ハッシュ値記憶部３２から読み込み、これを情報処理装置１１ａに送信する。 The hash value / integrated hash value transmission unit 65 receives a hash value / integrated hash value request transmitted from the information processing apparatus 11a when the user tries to view the document data 100b in the information processing apparatus 11a. Functions and transmits the hash value and the integrated hash value to the information processing apparatus 11b. Specifically, the folder hash value of the folder 200a, the document hash value of the document data 100a, and the hash value of the document component 190a are read from the hash value / integrated hash value storage unit 32 and transmitted to the information processing apparatus 11a. To do.

文書構成要素送信部６６は、データ判別処理を終えた情報処理装置１１ａから送信される文書構成要素送信要求を受信した場合に、当該送信要求された文書構成要素１９０ａを送信する。具体的には、情報処理装置１１ａが送信した文書構成要素送信要求から、送信対象となる文書構成要素１９０ａを特定し、特定した文書構成要素１９０ａを文書データ記憶部３１から読み込んで、情報処理装置１１ａに送信する。 When the document component transmission unit 66 receives a document component transmission request transmitted from the information processing apparatus 11a that has completed the data determination process, the document component transmission unit 66 transmits the document component 190a requested to be transmitted. Specifically, the document constituent element 190a to be transmitted is specified from the document constituent element transmission request transmitted by the information processing apparatus 11a, the specified document constituent element 190a is read from the document data storage unit 31, and the information processing apparatus is read. To 11a.

次に情報処理装置１１ａにおける具体的な内部機能について説明する。図８は、情報処理装置１１ａの制御部４０がシステムプログラム５３およびアプリケーションプログラム５４を実行することによって実現される詳細な機能構成を示すブロック図である。尚、図８では、例えばユーザが文書データ１００ｂが格納されているフォルダ２００ｂを選択してクリック操作した場合に、文書データ１００ａと文書データ１００ｂを同一内容にするため、文書管理装置１から送信されてくるハッシュ値・統合ハッシュ値により、文書構成要素１９０ｂと異なる内容の文書構成要素１９０ａが存在するが否かを判定し、存在するならば、その文書構成要素１９０ａの送信要求を行い、その文書構成要素１９０ａによって情報処理装置１１ａが保持する文書構成要素１９０ｂを更新する機能に関するブロックのみを示しており、それ以外の機能については図示を省略している。 Next, specific internal functions in the information processing apparatus 11a will be described. FIG. 8 is a block diagram illustrating a detailed functional configuration that is realized when the control unit 40 of the information processing apparatus 11 a executes the system program 53 and the application program 54. In FIG. 8, for example, when the user selects and clicks on the folder 200b in which the document data 100b is stored, the document data 100a and the document data 100b are transmitted from the document management apparatus 1 so as to have the same contents. Based on the hash value / integrated hash value, it is determined whether or not there is a document component 190a having a content different from that of the document component 190b. If it exists, a transmission request for the document component 190a is made, and the document Only the block relating to the function of updating the document component 190b held by the information processing apparatus 11a by the component 190a is shown, and the other functions are not shown.

図８に示すように、制御部４０は、プログラム５３および５４を実行することにより、文書データ管理部７０、ハッシュ値・統合ハッシュ値要求部７１、データ判別部７２、文書構成要素要求部７３、データ更新部７４として機能する。 As shown in FIG. 8, the control unit 40 executes the programs 53 and 54, thereby executing a document data management unit 70, a hash value / integrated hash value request unit 71, a data determination unit 72, a document component request unit 73, It functions as the data update unit 74.

文書データ管理部７０は、文書データ１００ｂの内容を変更したり、閲覧等できるように、文書データを管理する。具体的には、文書データ管理部７０は、アプリケーションプログラム５４の機能によって、文書データ記憶部５１に格納されている文書構成要素１９０ｂを読み込んだユーザが、その文書構成要素１９０ｂの内容を変更した場合、変更後の文書構成要素１９０ｂを文書管理装置１に送信し、また文書データ記憶部５１に格納されている変更前の文書構成要素１９０ｂの内容を、当該変更後の文書構成要素１９０ｂの内容に書き換える。尚、文書管理装置１に送信された変更後の文書構成要素１９０ｂは、文書データ更新部６０によって処理されることになる。また、ユーザが、例えば文書データ１００ｂを閲覧等しようとして、入力部４５を介してフォルダ２００ｂをクリックすれば、文書データ管理部７０に設けられたハッシュ値・統合ハッシュ値要求部７１が機能し、文書管理装置１に対して、フォルダ２００ａのフォルダハッシュ値、文書データ１００ａの文書ハッシュ値、および文書構成要素１９０ａのハッシュ値の送信を要求する。 The document data management unit 70 manages the document data so that the contents of the document data 100b can be changed or viewed. Specifically, the document data management unit 70 uses the function of the application program 54 when the user who has read the document component 190b stored in the document data storage unit 51 changes the contents of the document component 190b. The changed document component 190b is transmitted to the document management apparatus 1, and the content of the document component 190b before the change stored in the document data storage unit 51 is changed to the content of the document component 190b after the change. rewrite. The changed document component 190b transmitted to the document management apparatus 1 is processed by the document data update unit 60. If the user clicks on the folder 200b via the input unit 45 in order to browse the document data 100b, for example, the hash value / integrated hash value request unit 71 provided in the document data management unit 70 functions. The document management apparatus 1 is requested to transmit the folder hash value of the folder 200a, the document hash value of the document data 100a, and the hash value of the document component 190a.

データ判別部７２は、ハッシュ値・統合ハッシュ値要求部７１が文書管理装置１に対してハッシュ値および統合ハッシュ値を要求した場合に機能し、文書構成要素１９０ｂと異なる文書構成要素１９０ａが文書管理装置１で保持されている場合、これを特定する。具体的には、データ判別部７２は、まず、文書管理装置１からハッシュ値および統合ハッシュ値を受信し、受信したハッシュ値および統合ハッシュ値を用いて、文書データ１００ｂと内容が異なる文書データ１００ａが存在するか否かを判別する。異なるデータが存在している場合、次にその文書データ１００ａに含まれる複数の文書構成要素１９０ａのうち、文書構成要素１９０ｂと内容が異なる文書構成要素１９０ａを全て特定する。また、データ判別部７２は、文書管理装置１から受信したハッシュ値および統合ハッシュ値を、メモリ４２に一時的に格納する。 The data discriminating unit 72 functions when the hash value / integrated hash value requesting unit 71 requests a hash value and an integrated hash value from the document management apparatus 1, and a document component 190a different from the document component 190b is managed by the document. If it is held by the device 1, this is specified. Specifically, the data discriminating unit 72 first receives a hash value and an integrated hash value from the document management apparatus 1, and uses the received hash value and integrated hash value to change the document data 100a whose contents are different from the document data 100b. It is determined whether or not exists. When different data exists, all of the document components 190a having different contents from the document component 190b are specified among the plurality of document components 190a included in the document data 100a. Further, the data determination unit 72 temporarily stores the hash value and the integrated hash value received from the document management apparatus 1 in the memory 42.

文書構成要素要求部７３は、文書管理装置１に対して、データ判別部７２が行う上記データ判別方法によって特定された文書構成要素１９０ａの送信要求を行う。 The document component request unit 73 makes a transmission request for the document component 190 a specified by the data determination method performed by the data determination unit 72 to the document management apparatus 1.

データ更新部７４は、文書構成要素要求部７３が送信要求を行った場合に機能し、文書構成要素１９０ｂと、ハッシュ値および統合ハッシュ値とを更新する。具体的には、まず、文書構成要素要求部７３が文書管理装置１に対して送信要求した文書構成要素１９０ａを受信し、文書データ記憶部５１に格納されている文書構成要素１９０ｂの内容を、当該受信した文書構成要素１９０ａの内容に書き換える。そして、ハッシュ値・統合ハッシュ値記憶部５２に格納されている、上述の書き換えを行う前の文書構成要素１９０ｂのハッシュ値、その文書構成要素１９０ｂにより構成される文書データ１００ｂの文書ハッシュ値、その文書データ１００ｂを格納するフォルダ２００ｂのフォルダハッシュ値、およびそのフォルダ２００ａの上位フォルダのフォルダハッシュ値を、上述した、データ判別部７２が一時的にメモリ４２に格納したハッシュ値・統合ハッシュ値に書き換える。 The data update unit 74 functions when the document component request unit 73 makes a transmission request, and updates the document component 190b, the hash value, and the integrated hash value. Specifically, first, the document component request unit 73 receives the document component 190a requested to be transmitted to the document management apparatus 1, and the contents of the document component 190b stored in the document data storage unit 51 are The content of the received document component 190a is rewritten. Then, the hash value of the document component element 190b stored in the hash value / integrated hash value storage unit 52 before the rewriting described above, the document hash value of the document data 100b configured by the document component element 190b, The folder hash value of the folder 200b for storing the document data 100b and the folder hash value of the upper folder of the folder 200a are rewritten to the hash value / integrated hash value temporarily stored in the memory 42 by the data determination unit 72 described above. .

このデータ更新部７４による文書構成要素１９０ｂの更新により、文書管理装置１と情報処理装置１１ａとで、文書データ１００ａと文書データ１００ｂは、全て同一内容となる。従って、ユーザは、情報処理装置１１ａにより、文書管理装置１で保持されている文書データ１００ａと同一内容の文書データ１００ｂを閲覧等することができるようになる。また、上述のように文書構成要素１９０ｂが更新されることにより、情報処理装置１１ａは、自機が保持する文書構成要素１９０ｂの内容と対応しないハッシュ値、自機が保持する文書データ１００ｂの内容と対応しない文書ハッシュ値、および自機が保持するフォルダ２００ｂの内容と対応しないフォルダハッシュ値、を保持することになるが、データ更新部７４によりハッシュ値および統合ハッシュ値が更新されれば、情報処理装置１１ａは、自機が保持する文書構成要素１９０ｂの内容、文書データ１００ｂの内容、およびフォルダ２００ｂの内容と対応したハッシュ値および統合ハッシュ値を保持することになる。従って、再びユーザがフォルダ２００ｂをクリックして、データ判別部７２が、データ判別を行うことになっても、ハッシュ値・統合ハッシュ値記憶部５２に格納されたハッシュ値・統合ハッシュ値を用いて行えばよいので、自機が保持する文書構成要素１９０ｂのハッシュ値、自機が保持する文書データ１００ｂの文書ハッシュ値、および自機が保持するフォルダ２００ｂのフォルダハッシュ値を生成する必要はなく、文書管理システムのパフォーマンスの低下を回避することができる。 By updating the document component 190b by the data updating unit 74, the document data 100a and the document data 100b all have the same contents in the document management apparatus 1 and the information processing apparatus 11a. Therefore, the user can browse the document data 100b having the same contents as the document data 100a held in the document management apparatus 1 by the information processing apparatus 11a. In addition, by updating the document component 190b as described above, the information processing apparatus 11a causes the hash value not corresponding to the content of the document component 190b held by the own device and the content of the document data 100b held by the own device. The document hash value that does not correspond to the folder hash value and the folder hash value that does not correspond to the contents of the folder 200b held by the own device are stored. If the hash value and the integrated hash value are updated by the data update unit 74, information is stored. The processing apparatus 11a holds the hash value and the integrated hash value corresponding to the contents of the document component 190b, the contents of the document data 100b, and the contents of the folder 200b held by the processing apparatus 11a. Therefore, even if the user clicks on the folder 200b again and the data determination unit 72 performs data determination, the hash value / integrated hash value stored in the hash value / integrated hash value storage unit 52 is used. Therefore, it is not necessary to generate the hash value of the document component 190b held by the own device, the document hash value of the document data 100b held by the own device, and the folder hash value of the folder 200b held by the own device. A decrease in the performance of the document management system can be avoided.

また、本実施形態において、データ判別は、文書管理装置１が行わずに情報処理装置１１が行うこととしている。これにより、文書管理装置１にかかる処理負担を軽減させている。特に、文書管理装置１とネットワーク１０を介しデータ通信可能に接続されている情報処理装置１１の数が多くなればなるほど、文書管理装置１に対する情報処理装置１１からのデータ処理要求が多くなり文書管理装置１の処理負担が増えるので、情報処理装置１１がデータ判別を行うことによる、文書管理システムのパフォーマンス低下回避の効果が大きくなる。 In the present embodiment, the data discrimination is performed by the information processing apparatus 11 without being performed by the document management apparatus 1. As a result, the processing burden on the document management apparatus 1 is reduced. In particular, as the number of information processing apparatuses 11 connected to the document management apparatus 1 via the network 10 so as to be able to perform data communication increases, the number of data processing requests from the information processing apparatus 11 to the document management apparatus 1 increases. Since the processing load of the apparatus 1 increases, the effect of avoiding the performance deterioration of the document management system due to the data determination performed by the information processing apparatus 11 is increased.

図９は、文書管理装置１が情報処理装置１１ａから文書構成要素１９０ｂを受信する場合の処理手順の一例を示すフローチャートである。文書管理装置１は、情報処理装置１１ａから、文書構成要素１９０ｂを受信したか否かを監視している（ステップＳ１０１）。文書管理装置１が文書構成要素１９０ｂを受信したならば（ステップＳ１０１でＹＥＳ）、文書データ更新部６０が機能し、受信した文書構成要素１９０ｂにより文書管理装置１が保持する文書データ１００ａの更新を行う（ステップＳ１０２）。その後、ハッシュ値生成部６１が機能し、ステップＳ１０１で受信した文書構成要素１９０ｂのハッシュ値を生成し（ステップＳ１０３）、生成したハッシュ値によりハッシュ値記憶部３４に格納されているハッシュ値の更新を行う（ステップＳ１０４）。すると統合ハッシュ値生成部６２が機能し、ステップＳ１０３においてハッシュ値生成部６１が生成したハッシュ値から、統合ハッシュ値を生成する（ステップＳ１０５）。そしてその後、統合ハッシュ値生成部６２は、ステップＳ１０５において生成した統合ハッシュ値により、ハッシュ値・統合ハッシュ値記憶部３２における統合ハッシュ値記憶部３５に格納されている統合ハッシュ値を更新して（ステップＳ１０６）、処理を終了する。 FIG. 9 is a flowchart illustrating an example of a processing procedure when the document management apparatus 1 receives the document component 190b from the information processing apparatus 11a. The document management apparatus 1 monitors whether or not the document component 190b is received from the information processing apparatus 11a (step S101). If the document management apparatus 1 receives the document component 190b (YES in step S101), the document data update unit 60 functions to update the document data 100a held in the document management apparatus 1 by the received document component 190b. Perform (step S102). Thereafter, the hash value generation unit 61 functions to generate the hash value of the document component 190b received in step S101 (step S103), and update the hash value stored in the hash value storage unit 34 with the generated hash value Is performed (step S104). Then, the integrated hash value generation unit 62 functions and generates an integrated hash value from the hash value generated by the hash value generation unit 61 in step S103 (step S105). After that, the integrated hash value generation unit 62 updates the integrated hash value stored in the integrated hash value storage unit 35 in the hash value / integrated hash value storage unit 32 with the integrated hash value generated in step S105 ( Step S106) and the process ends.

図１０は、文書管理装置１が情報処理装置１１ａからハッシュ値・統合ハッシュ値要求を受信する場合の処理手順の一例を示すフローチャートである。文書管理装置１は、情報処理装置１１ａから、ハッシュ値・統合ハッシュ値要求を受信したか否かを監視している（ステップＳ２０１）。文書管理装置１がハッシュ値・統合ハッシュ値要求を受信したならば（ステップＳ２０１でＹＥＳ）、ハッシュ値・統合ハッシュ値送信部６５が機能し、フォルダ２００ａ、文書データ１００ａ、および文書構成要素１９０ａのハッシュ値・統合ハッシュ値を、情報処理装置１１ａに送信する（ステップＳ２０２）。その後、文書管理装置１は、所定時間以内に、情報処理装置１１ａから文書構成要素送信要求を受信したか否かを監視する（ステップＳ２０３）。文書構成要素送信要求を受信した場合は（ステップＳ２０３でＹＥＳ）、文書構成要素送信部６６が機能し、受信した文書構成要素送信要求から、送信対象の文書構成要素１９０ａを特定する（ステップＳ２０４）。そしてステップＳ２０４で特定した文書構成要素１９０ａを情報処理装置１１ａに送信して（ステップＳ２０５）、処理を終了する。一方、ステップＳ２０３で所定時間以内に文書構成要素送信要求を受信しなかった場合は（ステップＳ２０３でＮＯ）、文書構成要素１９０ａの送信をせずに処理を終了する。 FIG. 10 is a flowchart illustrating an example of a processing procedure when the document management apparatus 1 receives a hash value / integrated hash value request from the information processing apparatus 11a. The document management apparatus 1 monitors whether or not a hash value / integrated hash value request has been received from the information processing apparatus 11a (step S201). If the document management apparatus 1 receives the hash value / integrated hash value request (YES in step S201), the hash value / integrated hash value transmission unit 65 functions to store the folder 200a, the document data 100a, and the document component 190a. The hash value / integrated hash value is transmitted to the information processing apparatus 11a (step S202). Thereafter, the document management apparatus 1 monitors whether a document component transmission request is received from the information processing apparatus 11a within a predetermined time (step S203). When a document component transmission request is received (YES in step S203), the document component transmission unit 66 functions and identifies the document component 190a to be transmitted from the received document component transmission request (step S204). . Then, the document component 190a specified in step S204 is transmitted to the information processing apparatus 11a (step S205), and the process ends. On the other hand, if the document component transmission request is not received within the predetermined time in step S203 (NO in step S203), the processing is terminated without transmitting the document component 190a.

図１１は、情報処理装置１１ａにおいてユーザがフォルダ２００ｂをクリックする場合の処理手順の一例を示すフローチャートである。文書データ管理部７０は、フォルダ２００ｂが入力部４５を介してクリックされるか否かを監視している（ステップＳ３０１）。共有フォルダ２００ｂがクリックされた場合（ステップＳ３０１でＹＥＳ）、文書データ管理部７０に備わるハッシュ値・統合ハッシュ値要求部７１が機能し、文書管理装置１に対して、フォルダ２００ａ、文書データ１００ａ、および文書構成要素１９０ａのハッシュ値・統合ハッシュ値の送信を要求する（ステップＳ３０２）。その後、データ判別部７２が機能し、文書管理装置１からハッシュ値・統合ハッシュ値を受信したか否かを監視する（ステップＳ３０３）。ハッシュ値・統合ハッシュ値を受信したならば（ステップＳ３０３でＹＥＳ）、データ判別部７２が機能して、受信したハッシュ値・統合ハッシュ値をメモリ４２に一時的に格納し（ステップＳ３０４）、その後データ判別処理を行う（ステップＳ３０５）。このデータ判別処理において、文書データ１００ｂと異なる内容の文書データ１００ａが存在すると判定された場合（ステップＳ３０６でＹＥＳ）、文書構成要素要求部７３が機能し、データ判別処理において特定された文書構成要素１９０ａの送信を、文書管理装置１に対して要求する（ステップＳ３０７）。すると、データ更新部７４が機能し、文書管理装置１から文書構成要素１９０ａを受信したか否かを監視する（ステップＳ３０８）。文書構成要素１９０ａを受信した場合（ステップＳ３０８でＹＥＳ）、文書データ記憶部５１に格納されている文書構成要素１９０ｂの内容を、当該受信した文書構成要素１９０ａの内容に書き換える（ステップＳ３０９）。また、ハッシュ値・統合ハッシュ値記憶部５２に格納されているハッシュ値・統合ハッシュ値を、ステップＳ３０４でデータ判別部７２が一時的にメモリ４２に格納したハッシュ値・統合ハッシュ値に書き換え（ステップＳ３１０）、処理を終了する。一方、ステップＳ３０５のデータ判別処理において、文書データ１００ｂと異なる内容の文書データ１００ａが存在しないと判定された場合（ステップＳ３０６でＮＯ）、文書構成要素およびハッシュ値・統合ハッシュ値の更新は行わずに処理を終了する。 FIG. 11 is a flowchart illustrating an example of a processing procedure when the user clicks the folder 200b in the information processing apparatus 11a. The document data management unit 70 monitors whether or not the folder 200b is clicked via the input unit 45 (step S301). When the shared folder 200b is clicked (YES in step S301), the hash value / integrated hash value request unit 71 provided in the document data management unit 70 functions, and the folder 200a, the document data 100a, The transmission of the hash value / integrated hash value of the document component 190a is requested (step S302). Thereafter, the data determination unit 72 functions to monitor whether or not a hash value / integrated hash value has been received from the document management apparatus 1 (step S303). If the hash value / integrated hash value is received (YES in step S303), the data discriminating unit 72 functions to temporarily store the received hash value / integrated hash value in the memory 42 (step S304). Data discrimination processing is performed (step S305). If it is determined in this data determination process that there is document data 100a having a different content from the document data 100b (YES in step S306), the document component request unit 73 functions and the document component specified in the data determination process is determined. The document management apparatus 1 is requested to transmit 190a (step S307). Then, the data update unit 74 functions and monitors whether or not the document component 190a is received from the document management apparatus 1 (step S308). When the document component 190a is received (YES in step S308), the content of the document component 190b stored in the document data storage unit 51 is rewritten to the content of the received document component 190a (step S309). Further, the hash value / integrated hash value stored in the hash value / integrated hash value storage unit 52 is rewritten to the hash value / integrated hash value temporarily stored in the memory 42 by the data determination unit 72 in step S304 (step S304). S310), the process is terminated. On the other hand, if it is determined in the data determination process in step S305 that there is no document data 100a having a different content from the document data 100b (NO in step S306), the document component and the hash value / integrated hash value are not updated. The process ends.

図１２は、データ判定処理（図１１のステップＳ３０５）の詳細な処理手順の一例を示すフローチャートである。まず、データ判別部７２は、メモリ４２に格納されたフォルダ２００ａ、文書データ１００ａ、および文書構成要素１９０ａのハッシュ値・統合ハッシュ値を読み込み、更にハッシュ値・統合ハッシュ値記憶部５２からフォルダ２００ｂ、文書データ１００ｂ、および文書構成要素１９０ｂのハッシュ値・統合ハッシュ値を読み込む（ステップＳ４０１）。ここでメモリ４２に格納されているハッシュ値・統合ハッシュ値は、図１１のステップＳ３０４において格納したものであり、ハッシュ値・統合ハッシュ値記憶部５２に格納されているハッシュ値・統合ハッシュ値は、図１１のステップＳ３１０において更新されたハッシュ値・統合ハッシュ値を含んでいる。次にフォルダ２００ａのうちの最上位フォルダおよび２００ｂのうちの最上位フォルダを特定し（ステップＳ４０２）、フォルダ２００ａのうちの最上位フォルダのフォルダハッシュ値とフォルダ２００ｂのうちの最上位フォルダのフォルダハッシュ値とを比較する（ステップＳ４０３）。フォルダハッシュ値が等しい場合は（ステップＳ４０４でＹＥＳ）、何もせずにデータ判別処理を終了する。フォルダハッシュ値が異なる場合は（ステップＳ４０４でＮＯ）、次に当該フォルダ２００ｂに格納されている文書データ１００ｂと当該フォルダ２００ａに格納されている文書データ１００ａとで文書ハッシュ値が異なるものが存在するか否かを判定する（ステップＳ４０５）。存在する場合は（ステップＳ４０５でＹＥＳ）、当該文書データ１００ａと当該文書データ１００ｂとで文書ハッシュ値とを比較する（ステップＳ４０６）。文書ハッシュ値が異なる文書データ１００ａおよび文書データ１００ｂが存在する場合は（ステップＳ４０７でＹＥＳ）、当該文書データ１００ａに含まれる文書構成要素１９０ａのハッシュ値と、当該文書データ１００ｂに含まれる文書構成要素１９０ｂのハッシュ値とを比較する（ステップＳ４０８）。そして、文書構成要素１９０ｂと異なるハッシュ値を有する文書構成要素１９０ａを特定し（ステップＳ４０９）、次に下位のフォルダが存在するか否かを判定する（ステップＳ４１０）。下位のフォルダが存在しない場合は（ステップＳ４１０でＮＯ）、データ判定処理を終了する。一方、当該フォルダ２００ａに文書データ１００ａが存在しないか（ステップＳ４０５でＮＯ）、当該フォルダに文書データは存在するが（ステップＳ４０５でＹＥＳ）、文書ハッシュ値が異なる文書データが存在しないか（ステップＳ４０７でＮＯ）、または当該フォルダに文書データが存在し、それらの文書データのうちで異なる文書ハッシュ値を有するものも存在するが、更に下位のフォルダが存在する場合は（ステップＳ４１０でＹＥＳ）、当該フォルダの１つ下位のフォルダを特定し（ステップＳ４１１）、再びステップＳ４０３以降の上記プロセスを繰り返す。 FIG. 12 is a flowchart illustrating an example of a detailed processing procedure of the data determination process (step S305 in FIG. 11). First, the data determination unit 72 reads the hash value / integrated hash value of the folder 200a, the document data 100a, and the document component 190a stored in the memory 42, and further reads the folder 200b from the hash value / integrated hash value storage unit 52, The hash value / integrated hash value of the document data 100b and the document component 190b are read (step S401). The hash value / integrated hash value stored in the memory 42 is the one stored in step S304 in FIG. 11, and the hash value / integrated hash value stored in the hash value / integrated hash value storage unit 52 is The hash value / integrated hash value updated in step S310 of FIG. 11 is included. Next, the highest folder of the folder 200a and the highest folder of the 200b are specified (step S402), and the folder hash value of the highest folder of the folder 200a and the folder hash of the highest folder of the folder 200b are identified. The values are compared (step S403). If the folder hash values are equal (YES in step S404), the data discrimination process is terminated without doing anything. If the folder hash values are different (NO in step S404), the document data 100b stored in the folder 200b and the document data 100a stored in the folder 200a have different document hash values. Is determined (step S405). If it exists (YES in step S405), the document hash value is compared between the document data 100a and the document data 100b (step S406). If document data 100a and document data 100b having different document hash values exist (YES in step S407), the hash value of the document component 190a included in the document data 100a and the document component included in the document data 100b The hash value of 190b is compared (step S408). Then, the document component 190a having a hash value different from that of the document component 190b is specified (step S409), and it is determined whether or not a next lower folder exists (step S410). If there is no lower folder (NO in step S410), the data determination process is terminated. On the other hand, the document data 100a does not exist in the folder 200a (NO in step S405), or the document data exists in the folder (YES in step S405), but there is no document data having a different document hash value (step S407). No), or there is document data in the folder, and some of the document data have different document hash values, but if there is a lower folder (YES in step S410), A folder one level lower than the folder is specified (step S411), and the above-described process after step S403 is repeated again.

図１３、図１４、および図１５は、いずれも図１２におけるデータ判別処理を、具体例により説明するための図である。図例では、文書管理装置１における文書データ記憶部３１のデータ保持構造として、最上位フォルダのフォルダ２２０ａを備え、このフォルダ２２０ａに、文書データ１４０ａ及び１５０ａと、その下位フォルダとなるフォルダ２１０ａとが格納されている。フォルダ２１０ａには、文書データ１１０ａ，１２０ａ，１３０ａが格納されている。また、情報処理装置１１ａにおいて文書データ記憶部５１には、文書管理装置１のデータ保持構造と対応するように、最上位フォルダのフォルダ２２０ｂを備え、このフォルダ２２０ｂに、文書データ１４０ｂ及び１５０ｂと、その下位フォルダとなるフォルダ２１０ｂとが格納されている。フォルダ２１０ｂには、文書データ１１０ｂ，１２０ｂ，１３０ｂが格納されている。そして図１３、図１４、および図１５では文書管理装置１のデータ保持構造に対応したハッシュ値及び統合ハッシュ値と、情報処理装置１１ａのデータ保持構造に対応したハッシュ値及び統合ハッシュ値との比較を行う場合を例示する。 13, FIG. 14, and FIG. 15 are diagrams for explaining the data determination processing in FIG. 12 by a specific example. In the illustrated example, the data storage structure of the document data storage unit 31 in the document management apparatus 1 includes a folder 220a of the highest folder, and the folder 220a includes document data 140a and 150a and a folder 210a that is a lower folder thereof. Stored. Document data 110a, 120a, and 130a are stored in the folder 210a. Further, in the information processing apparatus 11a, the document data storage unit 51 includes a folder 220b of the highest folder so as to correspond to the data holding structure of the document management apparatus 1, and the folder 220b includes document data 140b and 150b, A folder 210b, which is a lower folder, is stored. The folder 210b stores document data 110b, 120b, and 130b. 13, 14, and 15, the hash value and the integrated hash value corresponding to the data holding structure of the document management apparatus 1 are compared with the hash value and the integrated hash value corresponding to the data holding structure of the information processing apparatus 11 a. The case where it performs is illustrated.

図１３は、文書データ１１０ａの内容と文書データ１１０ｂの内容が異なる場合の図である。この例では、文書本体データ１１１ａの内容と文書本体データ１１１ｂの内容とが異なっており、またセキュリティ情報１１４ａの内容とセキュリティ情報１１４ｂの内容とが異なっている。この場合、文書本体データ１１１ａのハッシュ値と、文書本体データ１１１ｂのハッシュ値が異なる。またセキュリティ情報１１４ａのハッシュ値と、セキュリティ情報１１４ｂのハッシュ値も異なる。そのため、文書データ１１０ａの文書ハッシュ値と、文書データ１１０ｂの文書ハッシュ値が異なることとなり、さらにフォルダ２１０ａのフォルダハッシュ値と、フォルダ２１０ｂのフォルダハッシュ値とが異なることとなる。その結果、最上位フォルダであるフォルダ２２０ａのフォルダハッシュ値と、フォルダ２２０ｂのフォルダハッシュ値とが異なることとなる。 FIG. 13 is a diagram when the contents of the document data 110a and the contents of the document data 110b are different. In this example, the contents of the document body data 111a and the contents of the document body data 111b are different, and the contents of the security information 114a and the contents of the security information 114b are different. In this case, the hash value of the document body data 111a is different from the hash value of the document body data 111b. Also, the hash value of the security information 114a is different from the hash value of the security information 114b. For this reason, the document hash value of the document data 110a and the document hash value of the document data 110b are different, and the folder hash value of the folder 210a and the folder hash value of the folder 210b are different. As a result, the folder hash value of the folder 220a that is the highest folder is different from the folder hash value of the folder 220b.

この場合、データ判別処理において文書管理装置１および情報処理装置１１ａのデータ保持構造に基づき、最上位フォルダであるフォルダ２２０ａのフォルダハッシュ値とフォルダ２２０ｂのフォルダハッシュ値との比較が行われる。これらフォルダハッシュ値が互いに異なる値であるので、次に、文書データ１４０ａ，１５０ａの文書ハッシュ値と、文書データ１４０ｂ，１５０ｂの文書ハッシュ値との比較が個別に行われる。これらはいずれも等しい値となるので、次に、その下位のフォルダ２１０ａ，２１０ｂが特定され、フォルダ２１０ａのフォルダハッシュ値とフォルダ２１０ｂのフォルダハッシュ値との比較が行われる。これらフォルダハッシュ値が互いに異なる値である。そのため、次に、文書データ１１０ａ，１２０ａ，１３０ａの文書ハッシュ値と、文書データ１１０ｂ，１２０ｂ，１３０ｂの文書ハッシュ値との比較が個別に行われる。その結果、文書ハッシュ値が互いに異なる文書データ１１０ａ，１１０ｂが特定される。そのため、文書データ１１０ａを構成する文書構成要素１１１ａ，１１２ａ，１１３ａ，１１４ａの各ハッシュ値と、文書データ１１０ｂを構成する文書構成要素１１１ｂ，１１２ｂ，１１３ｂ，１１４ｂの各ハッシュ値とが比較され、文書管理装置１で保持されている文書本体データ１１１ａとセキュリティ情報１１４ａとの２つの文書構成要素が、情報処理装置１１ａで保持している文書構成要素と異なった内容であることを特定することができる。 In this case, based on the data holding structure of the document management apparatus 1 and the information processing apparatus 11a in the data determination process, the folder hash value of the folder 220a, which is the highest folder, is compared with the folder hash value of the folder 220b. Since these folder hash values are different from each other, the comparison between the document hash values of the document data 140a and 150a and the document hash values of the document data 140b and 150b is performed individually. Since both are equal values, the lower folders 210a and 210b are specified, and the folder hash value of the folder 210a and the folder hash value of the folder 210b are compared. These folder hash values are different from each other. Therefore, next, the document hash values of the document data 110a, 120a, and 130a and the document hash values of the document data 110b, 120b, and 130b are individually compared. As a result, document data 110a and 110b having different document hash values are specified. Therefore, the hash values of the document constituent elements 111a, 112a, 113a, 114a constituting the document data 110a are compared with the hash values of the document constituent elements 111b, 112b, 113b, 114b constituting the document data 110b, and the document It can be specified that the two document constituent elements of the document main body data 111a and the security information 114a held in the management apparatus 1 have different contents from the document constituent elements held in the information processing apparatus 11a. .

この例では、文書管理装置１と情報処理装置１１ａのそれぞれが、４つの文書構成要素からなる５つの文書データを保持している。そのため、文書管理装置１と情報処理装置１１ａのそれぞれで保持される文書構成要素の数は２０個である。この場合において、例えば従来のような比較処理を行えば、異なる内容の文書構成要素を全て特定するために２０回の比較処理が必要になる。これに対し、本実施形態のように、ハッシュ値および統合ハッシュ値を用いた比較処理によれば、フォルダ２２０ａ、文書データ１４０ａ、文書データ１５０ａ、フォルダ２１０ａ、文書データ１３０ａ、文書データ１２０ａ、文書データ１１０ａ、文書本体データ１１１ａ、サムネイル１１２ａ、データベース１１３ａ、およびセキュリティ情報１１４ａのハッシュ値又は統合ハッシュ値の比較処理を行えば良いので、合計１１回の比較処理で、文書管理装置１と情報処理装置１１ａとで保持される異なる内容の文書構成要素を全て特定することが可能である。 In this example, each of the document management apparatus 1 and the information processing apparatus 11a holds five document data composed of four document components. Therefore, the number of document components held by each of the document management apparatus 1 and the information processing apparatus 11a is 20. In this case, for example, if a conventional comparison process is performed, 20 comparison processes are required to specify all document components having different contents. On the other hand, according to the comparison process using the hash value and the integrated hash value as in the present embodiment, the folder 220a, document data 140a, document data 150a, folder 210a, document data 130a, document data 120a, document data 110a, the document body data 111a, the thumbnail 112a, the database 113a, and the hash value or integrated hash value of the security information 114a may be compared. Thus, the document management apparatus 1 and the information processing apparatus 11a are compared in a total of 11 comparison processes. It is possible to specify all the document components having different contents held in and.

次に、図１４は、文書データ１４０ａの文書構成要素であるサムネイル１０７の内容が異なっている場合の図である。この例では、サムネイル１０７ａのハッシュ値と、サムネイル１０７ｂのハッシュ値とが異なっており、それに伴って文書データ１４０ａの文書ハッシュ値と、文書データ１４０ｂの文書ハッシュ値とも異なっている。 Next, FIG. 14 is a diagram in the case where the contents of the thumbnail 107 which is the document component of the document data 140a are different. In this example, the hash value of the thumbnail 107a and the hash value of the thumbnail 107b are different, and accordingly, the document hash value of the document data 140a and the document hash value of the document data 140b are also different.

この場合、データ判別処理において文書管理装置１および情報処理装置１１ａのデータ保持構造に基づき、最上位フォルダであるフォルダ２２０ａのフォルダハッシュ値とフォルダ２２０ｂのフォルダハッシュ値との比較が行われる。これらフォルダハッシュ値が互いに異なる値であるので、次に、文書データ１４０ａ，１５０ａの文書ハッシュ値と、文書データ１４０ｂ，１５０ｂの文書ハッシュ値との比較が個別に行われる。そして文書データ１４０ａの文書ハッシュ値と、文書データ１４０ｂの文書ハッシュ値とが異なることになる。そして文書データ１４０ａを構成する文書構成要素１０６ａ，１０７ａ，１０８ａ，１０９ａの各ハッシュ値と、文書データ１４０ｂを構成する文書構成要素１０６ｂ，１０７ｂ，１０８ｂ，１０９ｂの各ハッシュ値とが比較され、文書管理装置１で保持されているサムネイル１０７ａの１つの文書構成要素が、情報処理装置１１ａで保持している文書構成要素と異なった内容であることを特定することができる。 In this case, based on the data holding structure of the document management apparatus 1 and the information processing apparatus 11a in the data determination process, the folder hash value of the folder 220a, which is the highest folder, is compared with the folder hash value of the folder 220b. Since these folder hash values are different from each other, the comparison between the document hash values of the document data 140a and 150a and the document hash values of the document data 140b and 150b is performed individually. Then, the document hash value of the document data 140a is different from the document hash value of the document data 140b. Then, the hash values of the document constituent elements 106a, 107a, 108a, 109a constituting the document data 140a are compared with the hash values of the document constituent elements 106b, 107b, 108b, 109b constituting the document data 140b, and document management is performed. It can be specified that one document component of the thumbnail 107a held by the apparatus 1 has a different content from the document component held by the information processing apparatus 11a.

この例では、ハッシュ値および統合ハッシュ値を用いた比較処理により、上述のようにフォルダ２２０ａ、文書データ１４０ａ、文書データ１５０ａ、フォルダ２２０ａ、文書本体データ１０６ａ、サムネイル１０７ａ、データベース１０８ａ、およびセキュリティ情報１０９ａのハッシュ値又は統合ハッシュ値の比較処理を行えば良いので、合計８回の比較処理で、文書管理装置１と情報処理装置１１ａとで保持される異なる内容の文書構成要素を全て特定することが可能である。 In this example, the folder 220a, the document data 140a, the document data 150a, the folder 220a, the document body data 106a, the thumbnail 107a, the database 108a, and the security information 109a are compared by the comparison process using the hash value and the integrated hash value as described above. Since the comparison processing of the hash value or the integrated hash value may be performed, all document constituent elements having different contents held in the document management apparatus 1 and the information processing apparatus 11a can be specified by a total of eight comparison processes. Is possible.

次に、図１５は、全ての文書データの内容が等しい場合の図である。この場合、文書管理装置１と情報処理装置１１ａとで保持される、フォルダ、文書データ、および文書構成要素のハッシュ値又は統合ハッシュ値は全て等しい。この例では、フォルダ２２０ａのフォルダハッシュ値と、フォルダ２２０ｂのフォルダハッシュ値とが比較されると、それらフォルダハッシュ値は互いに等しいので、その時点で異なる内容の文書構成要素が存在しないということを特定することができる。したがって、この場合の比較処理の回数は、１回となり、効率的にデータ判別処理を終了することができる。 Next, FIG. 15 is a diagram when the contents of all the document data are equal. In this case, the hash value or integrated hash value of the folder, document data, and document constituent elements held in the document management apparatus 1 and the information processing apparatus 11a are all equal. In this example, when the folder hash value of the folder 220a and the folder hash value of the folder 220b are compared, the folder hash values are equal to each other, and therefore it is determined that there are no document components having different contents at that time. can do. Therefore, the number of comparison processes in this case is one, and the data determination process can be efficiently completed.

このように本実施形態では、文書管理装置１と情報処理装置１１とで異なる内容の文書データを保持しているか否かの判別を行う際、または、異なる内容の文書構成要素を特定する際に、ハッシュ値および統合ハッシュ値を用いることにより、従来よりも少ない比較回数で判別や特定を行うことができるようになる。そのため、文書管理システムのパフォーマンスの低下を回避することが可能である。そして、このような比較が行われた後、この比較結果に基づいて、文書管理装置１から情報処理装置１１へ異なる内容の文書構成要素が送信され、情報処理装置１１において当該文書構成要素の更新が行われれば、ユーザは、文書管理装置１が保持する文書データと同一内容の文書データを、情報処理装置１１で利用することができるようになる。 As described above, in this embodiment, when it is determined whether or not the document management apparatus 1 and the information processing apparatus 11 hold document data having different contents, or when document components having different contents are specified. By using the hash value and the integrated hash value, discrimination and identification can be performed with a smaller number of comparisons than in the past. Therefore, it is possible to avoid a decrease in the performance of the document management system. Then, after such a comparison is performed, based on the comparison result, document components having different contents are transmitted from the document management device 1 to the information processing device 11, and the information processing device 11 updates the document components. In this case, the user can use the document data having the same content as the document data held by the document management apparatus 1 in the information processing apparatus 11.

（変形例）
以上、本発明に関するいくつかの実施形態について説明したが、本発明は上述した内容に限られるものではなく、種々の変形例が適用可能である。以下でいくつかの変形例を挙げる。 (Modification)
As mentioned above, although several embodiment regarding this invention was described, this invention is not limited to the content mentioned above, A various modification is applicable. Some variations are given below.

例えば上記実施形態においては、特に図１１において示されるように、情報処理装置１１がハッシュ値および統合ハッシュ値の送信要求を文書管理装置１に対して行うのは、ユーザがフォルダを選択してクリック操作した際に行われるように設定されているが、これに限らず、例えば定期的に行ってもよいし、また情報処理装置１１の電源がオンとなった時に行うようにしてもよい。また、フォルダのクリック操作に限られず、例えば文書データの選択操作が行われた場合に行うこととしてもよい。またその他の入力操作であってもよい。 For example, in the above embodiment, as shown in FIG. 11 in particular, the information processing apparatus 11 makes a transmission request for the hash value and the integrated hash value to the document management apparatus 1 because the user selects a folder and clicks However, the present invention is not limited to this, and may be performed periodically, for example, or may be performed when the information processing apparatus 11 is turned on. Further, the operation is not limited to a folder click operation, and may be performed when, for example, a document data selection operation is performed. Other input operations may be used.

また上記実施形態においては、ハッシュ値を生成するハッシュ関数と、文書ハッシュ値を生成するハッシュ関数と、フォルダハッシュ値を生成するハッシュ関数とが同一のハッシュ関数である場合を例示したが、これに限られるものではない。 In the above embodiment, the hash function for generating the hash value, the hash function for generating the document hash value, and the hash function for generating the folder hash value are exemplified as the same hash function. It is not limited.

また上記実施形態においては、ハッシュ値および統合ハッシュ値の生成は、文書管理装置１が行うものとされているが、情報処理装置１１が行ってもよいし、文書管理装置１と情報処理装置１１との双方が行うようにしてもよい。 In the above embodiment, generation of the hash value and the integrated hash value is performed by the document management apparatus 1. However, the information processing apparatus 11 may perform the hash value and the integrated hash value, or the document management apparatus 1 and the information processing apparatus 11. Both of them may be performed.

また上記実施形態においては、文書管理装置１と情報処理装置１１とで異なる内容の文書データを保持しているか否かのデータ判別処理を情報処理装置１１で行う場合を例示したが、これに限られるものでもない。例えば、文書管理装置１で上述したデータ判別処理（図１２）を行うようにしても良い。ただし、この場合、文書管理装置１が、情報処理装置１１に対してハッシュ値および統合ハッシュ値の送信要求することにより、情報処理装置１１におけるデータ保持構造に対応したハッシュ値および統合ハッシュ値を情報処理装置１１から取得することが必要になる。 In the above-described embodiment, the case where the information processing apparatus 11 performs data determination processing for determining whether or not the document management apparatus 1 and the information processing apparatus 11 hold document data having different contents is illustrated. It is not something that can be done. For example, the data determination process (FIG. 12) described above may be performed by the document management apparatus 1. However, in this case, the document management apparatus 1 requests the information processing apparatus 11 to transmit the hash value and the integrated hash value, thereby obtaining the hash value and the integrated hash value corresponding to the data holding structure in the information processing apparatus 11 as information. It is necessary to obtain from the processing device 11.

また上記実施形態においては、データ判別処理を行う際、文書管理装置１から情報処理装置１１に対してハッシュ値と統合ハッシュ値の双方をまとめて送信する場合を例示したが、これに限られるものでもない。すなわち、まず最上位フォルダに対応するフォルダハッシュ値を送信し、そのフォルダハッシュ値が異なる値と判定されれば、次に文書ハッシュ値や下位フォルダのフォルダハッシュ値などを送信するようにして、統合ハッシュ値とハッシュ値とを段階的に送信する構成としてもよい。この場合、ハッシュ値送受信時におけるネットワーク４のトラフィック量を軽減することができるという利点がある。 In the above-described embodiment, the case where both the hash value and the integrated hash value are collectively transmitted from the document management apparatus 1 to the information processing apparatus 11 when performing the data determination process is illustrated, but the present invention is not limited thereto. not. That is, first send the folder hash value corresponding to the top folder, and if the folder hash value is determined to be different, then the document hash value, the folder hash value of the lower folder, etc. are sent next, and integrated The hash value and the hash value may be transmitted step by step. In this case, there is an advantage that the traffic amount of the network 4 at the time of hash value transmission / reception can be reduced.

１文書管理装置
１１情報処理装置
２０、４０制御部
３０記憶装置（文書データ記憶手段）
５０記憶装置（データ記憶手段）
６１ハッシュ値生成部（ダイジェストデータ生成手段）
６２統合ハッシュ値生成部（統合ダイジェストデータ生成手段）
７２データ判別部（データ判別手段）
１００，１００ａ，１００ｂ文書データ
１０１文書本体データ（文書構成要素）
１０２サムネイル（文書構成要素）
１０３データベース（文書構成要素）
１０４セキュリティ情報（文書構成要素）
１９０文書構成要素
２００，２００ａ，２００ｂフォルダ（記憶領域、記憶部） DESCRIPTION OF SYMBOLS 1 Document management apparatus 11 Information processing apparatus 20, 40 Control part 30 Storage apparatus (document data storage means)
50 Storage device (data storage means)
61 Hash value generator (digest data generator)
62. Integrated hash value generator (integrated digest data generator)
72 Data discrimination unit (data discrimination means)
100, 100a, 100b Document data 101 Document body data (document component)
102 Thumbnail (document component)
103 Database (document component)
104 Security information (document component)
190 Document component 200, 200a, 200b Folder (storage area, storage unit)

Claims

Document data storage means for storing a plurality of document data in a predetermined storage unit, a plurality of document data stored in the storage unit by performing data communication with an information processing apparatus connected via a network, A document management apparatus that manages to maintain identity with a plurality of document data stored in a predetermined storage area in the information processing apparatus,
Generate document digest data corresponding to the contents of each document data stored in the storage unit, and based on the plurality of document digest data generated from each of the plurality of document data stored in the storage unit, Integrated digest data generation means for generating composite digest data corresponding to the contents of the document digest data;
It is generated by the integrated digest data generation means whether there are different contents between the plurality of document data stored in the storage unit and the plurality of document data stored in the predetermined storage area. Data discriminating means for judging by comparison of composite digest data;
A document management apparatus comprising:

The data discrimination means includes
Among the plurality of document data stored in the storage unit, the one having contents different from the document data stored in the predetermined storage area is specified by comparing the document digest data generated by the integrated digest data generation unit. The document management apparatus according to claim 1.

The document data stored in the storage unit is composed of a plurality of document components,
A digest data generating means for generating digest data corresponding to the contents of each document component;
The integrated digest data generation means includes:
The document digest data corresponding to the contents of each document data is generated based on a plurality of digest data generated from each of a plurality of document constituent elements constituting the document data stored in the storage unit. Item 3. The document management apparatus according to Item 1 or 2.

The data discrimination means includes
Among the plurality of document constituent elements constituting the document data stored in the storage unit, those having different contents from the document constituent elements constituting the document data stored in the predetermined storage area are converted by the digest data generating means. 4. The document management apparatus according to claim 3, wherein the document management apparatus is specified by comparing the generated digest data.

Document data storage means for storing a plurality of document data in a predetermined storage unit, a plurality of document data stored in the storage unit by performing data communication with an information processing apparatus connected via a network, A document management apparatus that manages to maintain identity with a plurality of document data stored in a predetermined storage area in the information processing apparatus,
Generate document digest data corresponding to the contents of each document data stored in the storage unit, and based on the plurality of document digest data generated from each of the plurality of document data stored in the storage unit, Integrated digest data generation means for generating composite digest data corresponding to the contents of the document digest data;
It is generated by the integrated digest data generation means whether there are different contents between the plurality of document data stored in the storage unit and the plurality of document data stored in the predetermined storage area. Data transmitting means for transmitting the combined digest data to the information processing apparatus to make a determination by comparing the combined digest data;
A document management apparatus comprising:

An information processing apparatus connected to the document management apparatus according to claim 5 via a network so that data communication is possible,
Data storage means for storing a plurality of document data in a predetermined storage area;
Whether or not there is a document having different contents between a plurality of document data stored in a predetermined storage unit in the document management apparatus and a plurality of document data stored in the predetermined storage area. Data discriminating means for judging by comparing the composite digest data generated in the management device;
An information processing apparatus comprising:

An information processing apparatus and a document management apparatus are connected to each other via a network so as to be able to perform data communication with each other, so that the identity of a plurality of document data held by the information processing apparatus and the document management apparatus is maintained. A document management system for managing
The document management apparatus includes:
Document data storage means for storing a plurality of document data in a predetermined storage unit;
Generate document digest data corresponding to the contents of each document data stored in the storage unit, and based on the plurality of document digest data generated from each of the plurality of document data stored in the storage unit, Integrated digest data generation means for generating composite digest data corresponding to the contents of the document digest data;
With
The information processing apparatus includes:
Data storage means for storing a plurality of document data in a predetermined storage area;
At least one of the document management device and the information processing device is:
The integrated digest data generation means generates whether or not there is a different content between the plurality of document data stored in the storage unit and the plurality of document data stored in the storage area. A document management system comprising data discriminating means for judging by comparing composite digest data.

The storage unit is executed by a document management apparatus having a document data storage unit that stores a plurality of document data in a predetermined storage unit, and performs data communication with an information processing apparatus connected to the document management apparatus via a network. A document management program for maintaining the sameness between a plurality of document data stored in the information processing apparatus and a plurality of document data stored in a predetermined storage area in the information processing apparatus.
Generating document digest data corresponding to the contents of each document data stored in the storage unit;
Generating synthesized digest data corresponding to the contents of the plurality of document digest data based on a plurality of document digest data generated from each of the plurality of document data stored in the storage unit;
It is determined by comparing the composite digest data whether there are different contents between the plurality of document data stored in the storage unit and the plurality of document data stored in the predetermined storage area. And steps to
A document management program characterized by causing

The storage unit is executed by a document management apparatus having a document data storage unit that stores a plurality of document data in a predetermined storage unit, and performs data communication with an information processing apparatus connected to the document management apparatus via a network. A document management program for maintaining the sameness between a plurality of document data stored in the information processing apparatus and a plurality of document data stored in a predetermined storage area in the information processing apparatus.
Generating document digest data corresponding to the contents of each document data stored in the storage unit;
Generating synthesized digest data corresponding to the contents of the plurality of document digest data based on a plurality of document digest data generated from each of the plurality of document data stored in the storage unit;
It is determined by comparing the composite digest data whether there are different contents between the plurality of document data stored in the storage unit and the plurality of document data stored in the predetermined storage area. To send the combined digest data to the information processing apparatus,
A document management program characterized by causing