JP2006074514A

JP2006074514A - Image editing device, image reproducing device, file database, file distributing server, image editing method, image editing program, image reproducing method, image reproducing program, and computer readable storage medium

Info

Publication number: JP2006074514A
Application number: JP2004256191A
Authority: JP
Inventors: Mitsuo Ishii; 光雄石井
Original assignee: Hiroshima University NUC
Current assignee: Hiroshima University NUC
Priority date: 2004-09-02
Filing date: 2004-09-02
Publication date: 2006-03-16

Abstract

<P>PROBLEM TO BE SOLVED: To effectively transmit content of a lecture etc. recorded in a video file to a user. <P>SOLUTION: A device comprises a speech recognition unit 5 for converting audio data to a text file after recognition, and a file producing unit 10 for reproducing LIVE textbook relating the text file to a video file based on time information indicating which time content of the text file corresponding to the video file. <P>COPYRIGHT: (C)2006,JPO&NCIPI

Description

本発明は、講義等を録画したビデオファイルの内容を効率的にユーザに伝えるための映像編集装置等に関する。 The present invention relates to a video editing apparatus or the like for efficiently transmitting to a user the contents of a video file that records a lecture or the like.

大学の講義を録画してそれを学生にネットで配信したりメディアで貸し出したり、社会人に有料で提供したりするという活動は国内外の多くの大学ではるか以前から行われている。そして、遠隔講義やＥ−ラーニングの導入が進む中、情報メディア機器やネットワーク環境が整備されてくるにつれ、大学の講義の録画や配信が低コストで容易に行えるようになったため、講義録画の有効利用は大学のみが持つ知的財産を活用したサービスとしてますます重要になってくると考えられる（非特許文献１〜４参照）。 Activities such as recording university lectures and distributing them online to students, renting them through the media, and providing them to society for a fee have been conducted at many universities in Japan and abroad for a long time. As remote lectures and e-learning have been introduced, the recording and distribution of university lectures has become easier and cheaper as information media equipment and network environments have been improved. Use is considered to become increasingly important as a service that utilizes intellectual property possessed only by universities (see Non-Patent Documents 1 to 4).

一般的に９０分の講義は、やはり９０分かけてその講義の録画物（ビデオ、ＤＶＤ等の媒体）を観ないと講義全体を理解することはできない。しかしこれでは、観たい所を探すための労力が大きく、試験の前に講義を復習しようという学生にとっては魅力的ではない。復習が魅力的であるためには、すくなくとも講義の全体構成をすばやく把握して、観たい場面を直接再生できるようなコンテンツの構成と効率的な操作機能が必要である。 In general, a 90-minute lecture cannot be understood without taking a 90-minute video recording (video, DVD, etc.). However, this is not very attractive for students who want to review the lecture before the exam because of the great effort to find the places they want to see. In order for review to be attractive, it is necessary to have a content structure and an efficient operation function that can quickly grasp the overall structure of the lecture and play back the scene you want to watch directly.

そのためには、講義の録画内容の構造化を行うことが必須である。録画内容の構造化の技術では多くの提案がなされているが、大きく分けると映像認識と音声認識とによるインデキシング技術を挙げることができる。このインデキシング技術では、たとえば、シーンが変化する、顔がある、赤い色が多い、資料の画面であるとかを講義の録画内容から認識して、検索キーを作成することができる。
岩沢和男、石井光雄、岩田則和、小西克巳、「デジタルコンテンツ研究基盤としての公開講座アーカイブ」、コンピュータと教育研究報告, No.73, p47-54,2004 「講義映像のデジタル化と配信を行う教育支援システム−導入を想定したテストから見る問題と展望−」、コンピュータと教育研究報告, No.71, p35-42,2003 藤倉寛史他、「ビデオアーカイブを利用した学習を支援するシステムの提案」、音声言語情報処理研究報告、No040-001, 2001 梶原正宏、向日良夫、日野文男、高取和彦、「板書と音声を電子化した簡便な薬学授業アーカイブスの構築」、[online］、2003、［平成１６年７月２７日検索］、インターネット＜URL：https://www.shijokyo.or.jp/LINK/journal/0304/06_01.html＞ For that purpose, it is essential to structure the content of lecture recordings. Many proposals have been made on the technology for structuring the contents of recording, but broadly speaking, an indexing technology based on video recognition and voice recognition can be mentioned. With this indexing technology, for example, it is possible to create a search key by recognizing from a recorded content of a lecture that the scene changes, there is a face, there are many red colors, and it is a document screen.
Kazuo Iwasawa, Mitsuo Ishii, Norikazu Iwata, Katsumi Konishi, “Public Lecture Archive as Digital Content Research Base”, Computer and Educational Research Report, No.73, p47-54,2004 "Education support system that digitizes and distributes lecture videos-Problems and prospects seen from the test assuming introduction", Computer and Educational Research Report, No.71, p35-42,2003 Hirofumi Fujikura et al., "Proposal of a learning support system using video archives", Spoken Language Information Processing Research Report, No040-001, 2001 Masahiro Sakakibara, Yoshio Himawari, Fumio Hino, Kazuhiko Takatori, “Construction of simple pharmacy class archives by digitizing board and audio”, [online], 2003, [searched July 27, 2004], Internet <URL : Http:https://www.shijokyo.or.jp/LINK/journal/0304/06_01.html>

しかしながら、従来の検索キーは、講義が録画された画像の特徴を用いて作成されたものであり、講義の内容を密接に表すものではないので、講義の内容を録画物を用いて復習するための検索キーとしては有効ではない。すなわち、画像特徴による検索キーは、講義の内容とは関係がないので、講義の内容を理解するための補助としては使えない。 However, the conventional search key was created using the characteristics of the recorded video of the lecture, and does not closely represent the content of the lecture, so that the content of the lecture can be reviewed using recorded materials. Is not a valid search key. That is, the search key based on the image feature is not related to the content of the lecture, and cannot be used as an aid for understanding the content of the lecture.

本発明は、上記従来の問題点に鑑みなされたものであって、ビデオファイルに記録された講義等の内容を効率的にユーザに伝え得る映像編集装置、映像再生装置、ファイルデータベース、ファイル配信サーバ、映像編集方法、映像編集プログラム、映像再生方法、映像再生プログラム、およびコンピュータ読み取り可能な記録媒体を提供することを目的とする。 The present invention has been made in view of the above-described conventional problems, and is a video editing device, a video playback device, a file database, and a file distribution server that can efficiently transmit the contents of a lecture or the like recorded in a video file to a user. An object is to provide a video editing method, a video editing program, a video playback method, a video playback program, and a computer-readable recording medium.

本発明の映像編集装置は、上記課題を解決するために、動画像データおよび音声データを記録したビデオファイルを編集するための映像編集装置であって、上記音声データを認識してテキストファイルに変換する音声認識手段と、上記テキストファイルと上記ビデオファイルとを、そのテキストファイルが上記ビデオファイルにおいてどの時刻の内容であるかを示す時刻情報に基づき関連づけるファイル関連付け手段と、を備えていることを特徴としている。 In order to solve the above problems, the video editing apparatus of the present invention is a video editing apparatus for editing a video file in which moving image data and audio data are recorded, and recognizes the audio data and converts it into a text file. Voice recognizing means, and a file associating means for associating the text file and the video file based on time information indicating the time of the text file in the video file. It is said.

また、本発明の映像編集方法は、動画像データおよび音声データを記録したビデオファイルを編集するための映像編集方法であって、上記音声データを認識してテキストファイルに変換する第１ステップと、上記テキストファイルと上記ビデオファイルとを、そのテキストファイルが上記ビデオファイルにおいてどの時刻の内容であるかを示す時刻情報に基づき関連づける第２ステップと、を備えていることを特徴としている。 The video editing method of the present invention is a video editing method for editing a video file in which moving image data and audio data are recorded, the first step of recognizing the audio data and converting it into a text file; A second step of associating the text file with the video file based on time information indicating at what time the text file is in the video file.

上記構成によれば、音声認識手段または第１ステップにより変換されたテキストファイルにより、ビデオファイルの内容を外部機器で表示することができるので、ビデオファイルの内容を文章化して効率的にユーザに伝えることができる。 According to the above configuration, the content of the video file can be displayed on the external device by the voice recognition means or the text file converted by the first step, so the content of the video file is converted into text and efficiently transmitted to the user. be able to.

さらに、ファイル関連付け手段または第２ステップによりビデオファイルとテキストファイルとが関連付けられているので、ビデオファイルとテキストファイルとを同期させて外部機器で表示することができる。つまり、ビデオファイルとテキストファイルとを互いに関連させた状態で外部機器に表示することができ、ビデオファイルの内容をより効率的にユーザに伝えることができる。 Furthermore, since the video file and the text file are associated by the file association means or the second step, the video file and the text file can be synchronized and displayed on the external device. That is, the video file and the text file can be displayed on the external device in a state of being associated with each other, and the contents of the video file can be transmitted to the user more efficiently.

さらに、上記構成の映像編集装置は、上記ビデオファイルにおいて場面が変化する箇所であるシーンチェンジを検出するシーンチェンジ検出手段と、上記シーンチェンジの時刻に基づき、上記テキストファイルの内容を項目分けするためのタイトル名を、各項目について作成するタイトル作成手段と、を備えていることが好ましい。 Furthermore, the video editing apparatus having the above-described configuration is adapted to categorize the contents of the text file based on scene change detection means for detecting a scene change, which is a location where a scene changes in the video file, and the time of the scene change. It is preferable to include a title creating means for creating a title name for each item.

上記構成によれば、シーンチェンジ検出手段により検出されたシーンチェンジの時刻に基づいて、タイトル作成手段によりテキストファイルに関するタイトル名が作成される。したがって、このタイトル名を外部機器にて表示することにより、ユーザはテキストファイルの内容を大まかに把握することができる。それゆえ、ビデオファイルの内容をより効率的にユーザに伝えることができる。 According to the above configuration, the title name for the text file is created by the title creation unit based on the scene change time detected by the scene change detection unit. Therefore, by displaying the title name on the external device, the user can roughly grasp the contents of the text file. Therefore, the contents of the video file can be transmitted to the user more efficiently.

さらに、上記構成の映像編集装置は、上記シーンチェンジ検出手段により複数のシーンチェンジが上記ビデオファイルから検出される場合、上記シーンチェンジ同士の時間間隔に基づき、上記テキストファイルを分ける項目の数を調整する項目数調整手段をさらに備えていることが好ましい。 Furthermore, the video editing apparatus having the above configuration adjusts the number of items for dividing the text file based on the time interval between the scene changes when a plurality of scene changes are detected from the video file by the scene change detecting means. It is preferable to further include an item number adjusting means.

すなわち、テキストファイルに関して作成されるタイトル名が多くなってしまうと、かえってユーザにビデオファイルの内容が伝わりにくくなってしまう場合がある。そこで、上記構成の映像編集装置では、特に項目数調整手段を備えている。 That is, if the title names created for the text file increase, the content of the video file may be difficult to convey to the user. In view of this, the video editing apparatus having the above-described configuration particularly includes item number adjusting means.

この項目数調整手段によれば、シーンチェンジ同士の時間間隔が長いものを、テキストファイルの項目数として妥当な数だけ選び出し、そのシーンチェンジの時刻に基づきタイトル名を作成することができる。これにより、項目数を適宜調整して、ビデオファイルの内容がより効率的にユーザに伝わるようにすることができる。 According to this item number adjusting means, it is possible to select an appropriate number of items in the text file that have a long time interval between scene changes and create a title name based on the time of the scene change. Thereby, the number of items can be adjusted as appropriate so that the content of the video file can be transmitted to the user more efficiently.

さらに、上記構成の映像編集装置は、上記ファイル関連付け手段が、さらに、上記ビデオファイルに記録された動画像内で配布された資料のデータと、上記テキストファイルおよび上記ビデオファイルの少なくとも一方とを関連付けるものであることが好ましい。 Further, in the video editing apparatus having the above-described configuration, the file association unit further associates the data of the material distributed in the moving image recorded in the video file with at least one of the text file and the video file. It is preferable.

上記構成によれば、ファイル関連付け手段により、テキストファイルおよびビデオファイルの少なくとも一方と配布資料のデータとが関連付けられる。この関連付けは、たとえばその資料が配布された時刻を示す時刻情報に基づき行えばよい。これにより、テキストファイルおよびビデオファイルを外部機器にて表示する場合に、これらのファイルに配布資料のデータを関連付けて表示することができる。 According to the above configuration, at least one of the text file and the video file is associated with the data of the distribution material by the file association unit. This association may be performed based on time information indicating the time when the material is distributed, for example. As a result, when the text file and the video file are displayed on the external device, the data of the distribution material can be displayed in association with these files.

よって、ユーザは、ビデオファイルを見る際に配布資料のデータを確認することができるので、ビデオファイルの内容をより効率的に理解することができる。 Therefore, since the user can check the data of the distribution material when viewing the video file, the user can understand the contents of the video file more efficiently.

また、本発明の映像再生装置は、上記課題を解決するため、動画像データおよび音声データを記録したビデオファイルを再生する映像再生装置であって、上記音声データが変換されたテキストファイルであって、そのテキストファイルが上記ビデオファイルにおいてどの時刻の内容であるかを示す時刻情報に基づき関連付けられたテキストファイルを、上記ビデオファイルと同時に表示することを特徴としている。 In order to solve the above problems, the video playback device of the present invention is a video playback device that plays back a video file in which moving image data and audio data are recorded, and is a text file into which the audio data has been converted. The text file associated with the time information indicating the time content in the video file is displayed simultaneously with the video file.

また、本発明の映像再生方法は、上記課題を解決するために、動画像データおよび音声データを記録したビデオファイルを再生する映像再生方法であって、上記音声データが変換されたテキストファイルであって、そのテキストファイルが上記ビデオファイルにおいてどの時刻の内容であるかを示す時刻情報に基づき関連付けられたテキストファイルを、上記ビデオファイルと同時に表示することを特徴としている。 In addition, the video playback method of the present invention is a video playback method for playing back a video file in which moving image data and audio data are recorded in order to solve the above problems, and is a text file in which the audio data is converted. The text file associated with the time information indicating the time content in the video file is displayed simultaneously with the video file.

上記構成によれば、ビデオファイルの音声データが変換されたテキストファイルと、ビデオファイルとが同時に表示される。したがって、ユーザは、テキストファイルにより文章化されたビデオファイルの内容と、ビデオファイルの動画像および音声とを同時に得ることができるので、ビデオファイルの内容をより効率的に理解することができる。 According to the above configuration, the text file obtained by converting the audio data of the video file and the video file are displayed simultaneously. Therefore, the user can simultaneously obtain the content of the video file documented by the text file and the moving image and sound of the video file, so that the content of the video file can be understood more efficiently.

さらに、上記構成の映像再生装置は、上記ビデオファイルに記録された動画像内で現在話されている内容に対応する上記テキストファイルの一部を、強調表示することが好ましい。 Furthermore, it is preferable that the video reproducing apparatus having the above configuration highlights a part of the text file corresponding to the content currently spoken in the moving image recorded in the video file.

上記構成によれば、ユーザは、ビデオファイルに記録された動画像で現在話されている内容を、テキストファイルの強調表示された部分を視認することにより文章として理解することができるので、より効率的にビデオファイルの内容を理解することができる。 According to the above configuration, the user can understand the content currently spoken in the moving image recorded in the video file as a sentence by visually recognizing the highlighted portion of the text file. Can understand the contents of video files.

さらに、上記構成の映像再生装置は、上記表示されているテキストファイルのうち、ユーザに選択された部分に対応する箇所から上記ビデオファイルを再生することが好ましい。 Furthermore, it is preferable that the video reproducing apparatus having the above configuration reproduces the video file from a portion corresponding to a portion selected by the user in the displayed text file.

さらに、上記構成の映像再生装置は、上記テキストファイルの内容を項目分けするためのタイトル名を複数表示するとともに、上記複数のタイトル名のうち、ユーザに選択されたタイトル名に対応する箇所から上記ビデオファイルを再生することが好ましい。 Furthermore, the video playback device having the above configuration displays a plurality of title names for categorizing the contents of the text file, and from the locations corresponding to the title name selected by the user among the plurality of title names. It is preferable to play a video file.

さらに、上記構成の映像再生装置は、上記ビデオファイルにおいて場面が変化する箇所に対応するサムネイル画像を複数並べて表示するとともに、上記複数のサムネイル画像のうち、ユーザに選択されたサムネイル画像に対応する箇所から上記ビデオファイルを再生することが好ましい。 Furthermore, the video playback device having the above configuration displays a plurality of thumbnail images corresponding to the location where the scene changes in the video file, and also includes a location corresponding to the thumbnail image selected by the user among the plurality of thumbnail images. It is preferable to play the video file from.

上記構成によれば、ユーザは、テキストファイルの一部、タイトル名、またはサムネイル画像を選択すれば、その部分に対応する箇所からビデオファイルを再生することができる。さらに、ユーザは、テキストファイルの一部、タイトル名、またはサムネイル画像を選択する際に、テキストファイルの内容に基づきこれから再生されるビデオファイルの内容を確認できるので、ビデオファイルにおいて自分が見たいと思う箇所を的確に選択できる。 According to the above configuration, if a user selects a part of a text file, a title name, or a thumbnail image, the user can play a video file from a location corresponding to that part. Furthermore, when a user selects a part of a text file, a title name, or a thumbnail image, the user can check the content of the video file to be played based on the content of the text file. You can select exactly where you want.

さらに、上記構成の映像再生装置は、上記ビデオファイルに記録された動画像内で配布された資料のデータへのリンクを、上記テキストファイルまたは上記ビデオファイルの少なくとも一方と関連付けて表示することが好ましい。 Furthermore, it is preferable that the video reproduction apparatus having the above configuration displays a link to data of a material distributed in a moving image recorded in the video file in association with at least one of the text file or the video file. .

上記構成によれば、配布資料のデータへのリンクが、テキストファイルまたはビデオファイルと関連付けて表示されるので、ユーザは、これらのファイルが表示されている際に上記リンクを適宜選択することにより、配布資料を参照することができる。これにより、ユーザは、より効率的にビデオファイルの内容を理解することができる。 According to the above configuration, since the link to the data of the distribution material is displayed in association with the text file or the video file, the user appropriately selects the link when these files are displayed, You can refer to handouts. Thereby, the user can understand the content of the video file more efficiently.

さらに、本発明のファイルデータベースは、上記課題を解決するために、動画像データおよび音声データを記録したビデオファイルの音声データが変換されたテキストファイルと、そのビデオファイルを再生するためのビデオファイル再生用ファイルとが格納されているファイルデータベースであって、上記ビデオファイル再生用ファイルは、上記テキストファイルを、そのテキストファイルが上記ビデオファイルにおいてどの時刻の内容であるかを示す時刻情報に基づき上記ビデオファイルと関連付けるものであり、上記ビデオファイル再生用ファイルを外部機器が取得することにより、上記テキストファイルを上記ビデオファイルと関連付けて上記外部機器で再生するためのものであることを特徴としている。 Furthermore, in order to solve the above problems, the file database of the present invention is a text file obtained by converting audio data of a video file in which moving image data and audio data are recorded, and video file reproduction for reproducing the video file. The video file playback file includes the text file and the video file based on time information indicating at what time the text file is in the video file. The video file playback file is associated with a file, and when the external device obtains the video file playback file, the text file is associated with the video file and played back on the external device.

上記構成によれば、ビデオファイル再生用ファイルによりテキストファイルとビデオファイルとが関連付けられているので、このビデオファイル再生用ファイルをファイル配信サーバを用いて外部に配信することにより、外部機器にてビデオファイルとテキストファイルとを関連付けて再生することができる。したがって、ユーザは、テキストファイルにより文章化されたビデオファイルの内容と、ビデオファイルの動画像および音声とを同時に得ることができるので、ビデオファイルの内容をより効率的に理解することができる。 According to the above configuration, since the text file and the video file are associated with each other by the video file playback file, by distributing the video file playback file to the outside using the file distribution server, the video is played on the external device. A file and a text file can be linked and played. Therefore, the user can simultaneously obtain the content of the video file documented by the text file and the moving image and sound of the video file, so that the content of the video file can be understood more efficiently.

なお、コンピュータに上記映像編集方法を実行させる映像編集プログラムにより、コンピュータを用いて本発明の映像編集方法と同様の作用効果を得ることができる。さらに、上記映像編集プログラムをコンピュータ読み取り可能な記録媒体に記憶させることにより、任意のコンピュータ上で上記映像編集プログラムを実行させることができる。 Note that the same effects as those of the video editing method of the present invention can be obtained using a computer by a video editing program that causes a computer to execute the video editing method. Furthermore, the video editing program can be executed on an arbitrary computer by storing the video editing program in a computer-readable recording medium.

また、コンピュータに上記映像再生方法を実行させる映像再生プログラムにより、コンピュータを用いて本発明の映像再生方法と同様の作用効果を得ることができる。さらに、上記映像再生プログラムをコンピュータ読み取り可能な記録媒体に記憶させることにより、任意のコンピュータ上で上記映像再生プログラムを実行させることができる。 Further, the same effect as the video playback method of the present invention can be obtained using a computer by a video playback program that causes a computer to execute the video playback method. Furthermore, the video reproduction program can be executed on an arbitrary computer by storing the video reproduction program in a computer-readable recording medium.

本発明によれば、テキストファイルによりビデオファイルの内容を文章化してユーザに伝えることができ、さらにビデオファイルとテキストファイルとを互いに関連させた状態で表示することができるので、ビデオファイルの内容をより効率的にユーザに伝えることができる。 According to the present invention, the content of the video file can be documented and transmitted to the user by the text file, and the video file and the text file can be displayed in a state of being associated with each other. It can be communicated to the user more efficiently.

〔１．システム全体の構成〕
本発明の映像編集装置の一実施形態を用いるシステムの構成について、図２を用いて説明する。 [1. Overall system configuration)
The configuration of a system that uses an embodiment of the video editing apparatus of the present invention will be described with reference to FIG.

図２に示すように、本実施形態のＬＩＶＥ教科書データファイル配信システムでは、ＬＩＶＥ教科書作成装置（映像編集装置）１が、ネットワークを介して、ビデオファイル配信サーバ２、および複数の学習端末（映像再生装置）３…に接続されている。なお、ＬＩＶＥ教科書データファイルとは、講義内容を録画したビデオファイルが、後述する手順にて構造化されることにより、ユーザが講義を復習しやすくなるように構成されたファイルを意味する。 As shown in FIG. 2, in the LIVE textbook data file distribution system of this embodiment, a LIVE textbook creation device (video editing device) 1 is connected to a video file distribution server 2 and a plurality of learning terminals (video playback) via a network. Device) 3... The LIVE textbook data file means a file configured so that a user can easily review a lecture by structuring a video file in which the contents of the lecture are recorded according to a procedure described later.

ビデオファイル配信サーバ２は、ビデオファイルデータベース２ａに格納されたＭＰＥＧ形式やＷＭＶ形式のビデオファイルを、ネットワーク経由で外部に配信するものである。このビデオファイルの配信に用いられるネットワークは、インターネットであってもよいし、ＬＡＮ（Local Area Network）であってもよい。なお、ビデオファイルデータベースは、図２において単に「ビデオファイルＤＢ」と記載されている。 The video file distribution server 2 distributes video files in MPEG format or WMV format stored in the video file database 2a to the outside via a network. The network used for distributing the video file may be the Internet or a LAN (Local Area Network). The video file database is simply described as “video file DB” in FIG.

ＬＩＶＥ教科書作成装置１は、ビデオファイル配信サーバ２から、ビデオファイル（たとえば講義を録画したファイル）の配信を受け、ＬＩＶＥ教科書データファイルを作成するものである。ＬＩＶＥ教科書作成装置１がＬＩＶＥ教科書データファイルを作成する手順に関しては、後述する。 The LIVE textbook creation device 1 receives a video file (for example, a file recording a lecture) from the video file delivery server 2 and creates a LIVE textbook data file. The procedure in which the LIVE textbook creation device 1 creates the LIVE textbook data file will be described later.

そして、ＬＩＶＥ教科書作成装置１は、自身が作成したＬＩＶＥ教科書データファイルを、ＬＩＶＥ教科書データベース（ファイルデータベース）１ａに格納する。なお、図２においては、ＬＩＶＥ教科書データベースは、図２において単に「ＬＩＶＥ教科書ＤＢ」と記載されている。 The LIVE textbook creation device 1 stores the LIVE textbook data file created by itself in the LIVE textbook database (file database) 1a. In FIG. 2, the LIVE textbook database is simply described as “Live textbook DB” in FIG.

学習端末３は、ＬＩＶＥ教科書データベース１ａから、ネットワーク経由でＬＩＶＥ教科書データファイルを取得し、該ファイルを再生して表示するためのものである。学習端末３におけるＬＩＶＥ教科書データファイルの表示態様については後述する。 The learning terminal 3 acquires a LIVE textbook data file from the LIVE textbook database 1a via a network, and reproduces and displays the file. The display mode of the LIVE textbook data file on the learning terminal 3 will be described later.

このようにして、本実施形態のＬＩＶＥ教科書データファイル配信システムでは、ＬＩＶＥ教科書作成装置１が、ビデオファイルデータベース２ａから取得したビデオファイルからＬＩＶＥ教科書データファイルを作成し、そのＬＩＶＥ教科書データファイルを学習端末３においてネットワーク経由で再生できるように構成されている。 In this way, in the LIVE textbook data file distribution system of the present embodiment, the LIVE textbook creation device 1 creates a LIVE textbook data file from the video file acquired from the video file database 2a, and learns the LIVE textbook data file as a learning terminal. 3 can be played back via the network.

〔２．ＬＩＶＥ教科書作成装置１の構成〕
次に、ＬＩＶＥ教科書作成装置１の構成について、図１を用いてより具体的に説明する。図１に示すように、ＬＩＶＥ教科書作成装置１は、ビデオファイル取得部４と、音声認識部（音声認識手段）５と、テキストファイル加工部６と、シーンチェンジ検出部（シーンチェンジ検出手段）７と、インデックス画像ファイル作成部８と、全体構成調整部（タイトル作成手段、項目数調整手段）９と、ＬＩＶＥ教科書再生用ファイル作成部（ファイル関連付け手段）１０とを備えている。 [2. Configuration of LIVE textbook creation device 1]
Next, the configuration of the LIVE textbook creation apparatus 1 will be described more specifically with reference to FIG. As shown in FIG. 1, the LIVE textbook creation apparatus 1 includes a video file acquisition unit 4, a voice recognition unit (voice recognition unit) 5, a text file processing unit 6, and a scene change detection unit (scene change detection unit) 7. And an index image file creation unit 8, an overall configuration adjustment unit (title creation unit, item number adjustment unit) 9, and a LIVE textbook reproduction file creation unit (file association unit) 10.

ビデオファイル取得部４は、ビデオファイル配信サーバ２（図２）からネットワーク経由で講義内容を録画したビデオファイルを取得するものである。さらに、ビデオファイル取得部４では、ビデオファイル配信サーバ２から取得したビデオファイルを、編集が容易な形式に変換してもよい。たとえば、Adobe Premiere（Adobe社のソフトウェア製品名）を用いてビデオファイルをMPEG2形式に変換すると、種々の編集が容易となる。 The video file acquisition unit 4 acquires a video file recording the lecture contents from the video file distribution server 2 (FIG. 2) via the network. Further, the video file acquisition unit 4 may convert the video file acquired from the video file distribution server 2 into a format that can be easily edited. For example, if video files are converted to MPEG2 format using Adobe Premiere (Adobe software product name), various editing operations are facilitated.

音声認識部５は、ビデオファイル取得部４が取得したビデオファイルそのもの、または該ビデオファイルが編集容易な形式に変換されたファイルに記録された音声を認識し、その音声を、単語単位でテキストファイルに変換するものである。この音声認識は、市販されている音声認識ソフトを用いて自動で行われる。また、音声認識部５が変換するテキストファイルは、そのテキストファイルを構成する各データが、ビデオファイルにおいてどの時刻に再生される音声に対応するものであるかを示す情報（タイムスタンプ）が付与されている。 The voice recognition unit 5 recognizes the voice recorded in the video file itself acquired by the video file acquisition unit 4 or a file in which the video file is converted into an easily editable format, and the voice is converted into a text file in units of words. It is to convert to. This voice recognition is automatically performed using commercially available voice recognition software. The text file converted by the voice recognition unit 5 is given information (time stamp) indicating at which time each piece of data constituting the text file corresponds to the voice to be reproduced in the video file. ing.

テキストファイル加工部６は、音声認識部５が変換したテキストファイルを、文節単位でまとめるものである。すなわち、テキストファイル加工部６は、音声認識部５において認識されている音声の無音期間が、所定の長さ（たとえば２秒）以上連続するならば、その無音期間を文節の区切りとする。さらに、テキストファイル加工部６は、文節単位でまとめたテキストファイルをＬＩＶＥ教科書データベース１ａに格納する。 The text file processing unit 6 collects the text files converted by the speech recognition unit 5 in phrase units. That is, the text file processing unit 6 sets the silent period as a segment break if the silent period of the voice recognized by the voice recognition unit 5 continues for a predetermined length (for example, 2 seconds) or longer. Furthermore, the text file processing unit 6 stores the text file collected in units of clauses in the LIVE textbook database 1a.

シーンチェンジ検出部７は、ビデオファイルにおいてシーンが変化する部分（シーンチェンジ）を検出するものである。シーンチェンジは、たとえば以下のようにして検出することができる。 The scene change detection unit 7 detects a portion (scene change) where the scene changes in the video file. A scene change can be detected as follows, for example.

すなわち、シーンチェンジ検出部７は、ビデオファイルからたとえば１秒間隔で静止画像を抽出する。ビデオファイルが９０分の長さを有しているならば５４００枚の静止画像が抽出される。そして、シーンチェンジ検出部７は、１枚の静止画像を、たとえば９６×７２画素のサイズを有するサムネイル画像に縮小する。さらに、シーンチェンジ検出部７は、そのサムネイル画像を図３に示すように９分割して９個の分割画像を形成し、各分割画像に含まれる画素の画素値の平均を算出する。 That is, the scene change detection unit 7 extracts still images from the video file at intervals of 1 second, for example. If the video file has a length of 90 minutes, 5400 still images are extracted. Then, the scene change detection unit 7 reduces one still image to a thumbnail image having a size of 96 × 72 pixels, for example. Further, the scene change detection unit 7 divides the thumbnail image into nine as shown in FIG. 3 to form nine divided images, and calculates the average of the pixel values of the pixels included in each divided image.

そして、シーンチェンジ検出部７は、１秒毎に並べられたサムネイル画像間で対応する部分画像同士に関して、上記画素値の平均の差分を計算する。つまり、図３に示すように、時刻ｔにおける分割画像Ａに関する画素値の平均をＡｖ、時刻ｔ＋１における分割画像Ｂに関する画素値の平均をＢｖとした場合、Ｒ＝Ａｖ−Ｂｖを計算する。 Then, the scene change detection unit 7 calculates the average difference of the pixel values for the partial images corresponding to each other between the thumbnail images arranged every second. That is, as shown in FIG. 3, R = Av−Bv is calculated when the average of the pixel values related to the divided image A at time t is Av and the average of the pixel values related to the divided image B at time t + 1 is Bv.

さらに、シーンチェンジ検出部７は、上記Ｒをｒ色、ｇ色、およびｂ色のそれぞれについて計算し、各色についてのＲのうち最大のＲをＲｍとする。また、シーンチェンジ検出部７は、このようにして計算されるＲｍを、サムネイル画像内に含まれる全ての分割画像について計算する。 Furthermore, the scene change detection unit 7 calculates R for each of the r color, the g color, and the b color, and sets the maximum R among the Rs for each color as Rm. In addition, the scene change detection unit 7 calculates Rm calculated in this way for all the divided images included in the thumbnail image.

そして、シーンチェンジ検出部７は、Ｒｍが所定の閾値（Ｐ）を超える分割画像の個数が、所定数（ｑ個）よりも多いとき、時刻ｔ＋１においてシーンが変更したものと認識する。なお、Ｐ，ｑの値に関しては、妥当なシーンチェンジを検出できるように任意に設定することができる。 Then, the scene change detection unit 7 recognizes that the scene has changed at time t + 1 when the number of divided images in which Rm exceeds a predetermined threshold (P) is greater than the predetermined number (q). It should be noted that the values of P and q can be arbitrarily set so that an appropriate scene change can be detected.

インデックス画像ファイル作成部８は、シーンチェンジ検出部７においてシーンが変更したと認識されたサムネイル画像を時系列に沿って複数並べることにより、たとえばｗｍｖ形式にてインデックス画像ファイルを作成するものである。このようにして作成されたインデックスファイル画像ファイルを参照することにより、ビデオファイル取得部４にて取得されたビデオファイルの概要を把握することができる。さらに、インデックス画像ファイル作成部８は、ＬＩＶＥ教科書データベース１ａにインデックス画像ファイルを格納する。 The index image file creation unit 8 creates an index image file in the wmv format, for example, by arranging a plurality of thumbnail images recognized by the scene change detection unit 7 as having changed scenes in time series. By referring to the index file image file created in this way, the outline of the video file acquired by the video file acquisition unit 4 can be grasped. Further, the index image file creation unit 8 stores the index image file in the LIVE textbook database 1a.

なお、ビデオファイル取得部４により取得されるビデオファイル、シーンチェンジ検出部７によりシーンがチェンジしたと検出されるサムネイル画像ファイル、およびインデックス画像ファイル作成部８により作成されるインデックス画像ファイルには、全てタイムスタンプが付与されている。 The video file acquired by the video file acquisition unit 4, the thumbnail image file detected by the scene change detection unit 7 that the scene has been changed, and the index image file created by the index image file creation unit 8 are all included. A time stamp is given.

全体構成調整部９は、上記インデックス画像ファイルに含まれるサムネイル画像のタイムスタンプを参照して、テキストファイル加工部６が加工したテキストファイルを章単位で構成するものである。つまり、全体構成調整部９は、あるシーンが始まる時刻およびその次のシーンが始まる時刻を、シーンチェンジしたサムネイル画像ファイルのタイムスタンプに基づき判断し、これらの時刻の範囲に含まれるテキストファイルを、１つの章を構成するテキストファイルとする。 The overall configuration adjusting unit 9 refers to the time stamp of the thumbnail image included in the index image file, and configures the text file processed by the text file processing unit 6 in units of chapters. That is, the overall configuration adjusting unit 9 determines the time when a certain scene starts and the time when the next scene starts based on the time stamp of the thumbnail image file after the scene change, and the text file included in the range of these times is A text file that constitutes one chapter.

さらに、全体構成調整部９は、章の名前を示すテキストファイルを作成する。たとえば、シーンがチェンジしたサムネイル画像に付与されたタイムスタンプから、章が始まる時刻を示すテキストファイルを作成する一方、その章にて最初に発せられた文節を示すテキストファイルを抽出し、これら２つのテキストファイルを合わせたテキストファイルを章の名前を示すテキストファイルとすることができる。なお、テキストファイル加工部６が加工したテキストファイルから重要な単語を示すテキストファイルを抽出し、そのテキストファイルを章の名前を示すものとしてもよい。 Furthermore, the whole structure adjustment part 9 produces the text file which shows the name of a chapter. For example, while creating a text file indicating the time at which a chapter starts from a time stamp given to a thumbnail image whose scene has been changed, a text file indicating the first phrase issued in that chapter is extracted. A text file combined with a text file can be used as a text file indicating a chapter name. Note that a text file indicating an important word may be extracted from the text file processed by the text file processing unit 6, and the text file may indicate a chapter name.

さらに、全体構成調整部９において、章の数を調整することが可能である。たとえば、９０分のビデオファイルならばシーンチェンジ検出部７において数十個のサムネイル画像が検出されるが、章の数としては１０程度が好ましい。そこで、たとえば以下のようにして章の数を削減する。 Furthermore, the overall configuration adjusting unit 9 can adjust the number of chapters. For example, in the case of a 90-minute video file, dozens of thumbnail images are detected by the scene change detection unit 7, but the number of chapters is preferably about 10. Therefore, for example, the number of chapters is reduced as follows.

すなわち、あるシーンチェンジから次のシーンチェンジまでの時間をサムネイル画像の提示継続時間とした場合において、インデックス画像に含まれるサムネイル画像を、提示継続時間の長いものから順番に１０画像ほど抽出する。そして、このように抽出されたサムネイル画像に基づき、全体構成調整部９に章の名前を示すテキストファイルを作成させる。 That is, when the time from a certain scene change to the next scene change is set as the thumbnail image presentation continuation time, about 10 thumbnail images included in the index image are extracted in order from the longest presentation continuation time. Then, based on the thumbnail image extracted in this way, the overall configuration adjusting unit 9 is made to create a text file indicating the name of the chapter.

このサムネイル画像の抽出方法は、まとまった話をしているシーンの提示継続時間は長く、参考程度のそれほど重要でない話をしているシーンの提示継続時間は短いという傾向に基づいている。このようにして、提示継続時間に基づき抽出されるサムネイル画像の数を調整することにより、章の数を適宜調整することができる。また、講義の中で商用放送ビデオが用いられている場合も、これらは数秒単位でシーンが切り替わる。したがって、上述の提示継続時間に基づくサムネイル画像の抽出処理を行えば、商用放送ビデオが１つの章として認識されることを除外できる。 This thumbnail image extraction method is based on the tendency that the presentation duration of a scene that is talking together is long and the presentation duration of a scene that is not so important for reference is short. In this way, the number of chapters can be appropriately adjusted by adjusting the number of thumbnail images extracted based on the presentation duration time. Also, when commercial broadcast videos are used in lectures, these scenes are switched every few seconds. Therefore, if the thumbnail image extraction process based on the above-described presentation duration time is performed, it can be excluded that the commercial broadcast video is recognized as one chapter.

このようにして、全体構成調整部９は、章の名前を示すテキストファイルを作成するとともに、そのテキストファイルを連続させて、ビデオファイルの目次ページを示すテキストファイルを作成する。この目次ページを示すテキストファイルは、ＬＩＶＥ教科書データベース１ａに格納される。 In this way, the overall configuration adjustment unit 9 creates a text file indicating the name of the chapter, and creates a text file indicating the table of contents page of the video file by continuing the text files. The text file indicating this table of contents page is stored in the LIVE textbook database 1a.

ＬＩＶＥ教科書再生用ファイル作成部１０は、テキストファイル加工部６が加工したテキストファイルのタイムスタンプ、およびインデックス画像ファイル作成部８が作成したインデックス画像ファイルのタイムスタンプに基づき、テキストファイルとビデオファイルとを関連付けて再生するためのＬＩＶＥ教科書再生用ファイルを、たとえばＨＴＭＬ形式にて作成するものである。このＬＩＶＥ教科書再生用ファイルは、講義中に配布した資料のファイルをビデオファイルに関連付けてもよい。このようにして作成されたＬＩＶＥ教科書再生用ファイルは、ＬＩＶＥ教科書データベースに格納される。 The LIVE textbook reproduction file creation unit 10 generates a text file and a video file based on the time stamp of the text file processed by the text file processing unit 6 and the time stamp of the index image file created by the index image file creation unit 8. A LIVE textbook reproduction file for reproduction in association is created, for example, in the HTML format. In the LIVE textbook reproduction file, the material file distributed during the lecture may be associated with the video file. The LIVE textbook reproduction file created in this way is stored in the LIVE textbook database.

上記構成により、ＬＩＶＥ教科書作成装置１は、インデックス画像ファイル、テキストファイル、およびＬＩＶＥ教科書再生用ファイルを、ＬＩＶＥ教科書データベース１ａに格納する。なお、上記テキストファイルには、テキストファイル加工部６により文節単位で構成されたテキストファイル、ならびに全体構成調整部９にて作成された章の名前を示すテキストファイルおよび目次ページを示すテキストファイルが含まれる。 With the above configuration, the LIVE textbook creation apparatus 1 stores the index image file, the text file, and the LIVE textbook reproduction file in the LIVE textbook database 1a. Note that the text file includes a text file configured in units of clauses by the text file processing unit 6, a text file indicating the name of the chapter created by the overall configuration adjusting unit 9, and a text file indicating a table of contents page. It is.

また、ＬＩＶＥ教科書データベース１ａには、講義中で配布した資料のデータファイルを格納しておいてもよい。この場合には、ＬＩＶＥ教科書再生用ファイル作成部１０にて、資料のデータファイルと、インデックス画像ファイルおよびテキストファイルとの関連付けを行う必要がある。 The LIVE textbook database 1a may store data files of materials distributed during the lecture. In this case, the LIVE textbook reproduction file creation unit 10 needs to associate the data file of the material with the index image file and the text file.

つまり、資料のデータファイルがどの時刻での講義内容に対応するものであるかを示すタイムスタンプを、該データファイルに付与し、そのタイムスタンプと、インデックス画像ファイルおよびテキストファイルのそれぞれのタイムスタンプとを関連付けする必要がある。 That is, a time stamp indicating at which time the data file of the material corresponds to the lecture content is given to the data file, and the time stamp and each time stamp of the index image file and the text file Need to be associated.

〔３．処理フロー〕
次に、上記構成のＬＩＶＥ教科書作成装置１により実行されるＬＩＶＥ教科書データファイルの作成手順について、図４のフローチャートを用いて説明する。 [3. Processing flow
Next, a procedure for creating a LIVE textbook data file executed by the LIVE textbook creating apparatus 1 having the above configuration will be described with reference to the flowchart of FIG.

図４に示すように、先ず、ＬＩＶＥ教科書作成装置１のビデオファイル取得部４により、ビデオファイルが取得される（ステップ１、以下ステップを単にＳと記載する）。その後、Ｓ１にて取得されたビデオファイルの音声認識が音声認識部５により実行される（Ｓ２）。さらに、音声認識部５が認識したテキストファイルが、テキストファイル加工部６により文節単位にまとめられる（Ｓ３）。 As shown in FIG. 4, first, a video file is acquired by the video file acquisition unit 4 of the LIVE textbook creation apparatus 1 (step 1, hereinafter, the step is simply referred to as S). Thereafter, voice recognition of the video file acquired in S1 is executed by the voice recognition unit 5 (S2). Further, the text files recognized by the voice recognition unit 5 are grouped into phrase units by the text file processing unit 6 (S3).

また、ビデオファイル取得部４にて取得されたビデオファイルに基づき、シーンチェンジ検出部７にてシーンチェンジが検出される（Ｓ４）。さらに、Ｓ５においては、シーンがチェンジしたと認識されるサムネイル画像が、インデックス画像ファイル作成部８により時系列で並べられてインデックス画像ファイルが作成される。 Further, based on the video file acquired by the video file acquisition unit 4, the scene change detection unit 7 detects a scene change (S4). Furthermore, in S5, thumbnail images that are recognized to have changed scenes are arranged in time series by the index image file creation unit 8 to create an index image file.

その後、全体構成調整部９は、テキストファイル加工部６が加工したテキストファイルを、Ｓ５にて作成されたインデックス画像ファイルに基づき、章単位で構成する（Ｓ６）。 Thereafter, the overall configuration adjusting unit 9 configures the text file processed by the text file processing unit 6 in units of chapters based on the index image file created in S5 (S6).

さらに、全体構成調整部９において、上述した提示継続時間に基づく章の数の調整、および目次ページの作成が行われる（Ｓ７）。なお、章の数の調整は必ずしも行われる必要はない。 Further, the overall configuration adjustment unit 9 adjusts the number of chapters based on the above-described presentation duration and creates a table of contents page (S7). Note that adjustment of the number of chapters is not necessarily performed.

そして、上述のステップにより作成されたテキストファイルおよびインデックス画像ファイルのそれぞれのタイムスタンプにより、ＬＩＶＥ教科書再生用ファイル作成部１０によりＬＩＶＥ教科書再生用ファイルが作成される（Ｓ８）。その後、テキストファイル、インデックス画像ファイル、およびＬＩＶＥ教科書再生用ファイルがＬＩＶＥ教科書データベース１ａに格納されることにより、ＬＩＶＥ教科書データファイルの作成が終了する（Ｓ９）。 Then, a LIVE textbook reproduction file is created by the LIVE textbook reproduction file creation unit 10 based on the respective time stamps of the text file and the index image file created in the above steps (S8). Thereafter, the text file, the index image file, and the LIVE textbook reproduction file are stored in the LIVE textbook database 1a, thereby completing the creation of the LIVE textbook data file (S9).

〔４．再生画面例〕
次に、上述のように作成されたＬＩＶＥ教科書データファイルの再生画面の一例について、図５および図６を用いて説明する。なお、以下に説明する再生画面は、学習端末３が、ネットワーク経由でＬＩＶＥ教科書データベース１ａからＬＩＶＥ教科書再生用ファイル、インデックス画像ファイル、およびテキストファイルを取得することにより、学習端末３側で表示されるものである。なお、ビデオファイルそのものは、学習端末３がビデオファイルデータベース２ａから取得するようにしてもよいし、ＬＩＶＥ教科書データベース１ａから取得するようにしてもよい。 [4. (Example of playback screen)
Next, an example of the reproduction screen of the LIVE textbook data file created as described above will be described with reference to FIGS. The reproduction screen described below is displayed on the learning terminal 3 side when the learning terminal 3 obtains a LIVE textbook reproduction file, an index image file, and a text file from the LIVE textbook database 1a via the network. Is. Note that the video file itself may be acquired from the video file database 2a by the learning terminal 3, or may be acquired from the LIVE textbook database 1a.

図５は、ＬＩＶＥ教科書データファイルを再生する前に表示される、該ファイルの概要を示す画面（以下、単にホーム画面という）を示す図である。このホーム画面には、図５に示すように、講義名および講師名を表示するウインドウ、タイトルウインドウ、テキストウインドウ、インデックス画像表示用ウインドウ、章選択ボタン、および配布資料ファイルへのリンク等が表示される。 FIG. 5 is a diagram showing a screen (hereinafter simply referred to as a home screen) showing an outline of the LIVE textbook data file displayed before reproduction. As shown in FIG. 5, the home screen displays a lecture name and a lecturer name window, a title window, a text window, an index image display window, a chapter selection button, a link to a distribution material file, and the like. The

ホーム画面におけるタイトルウインドウには、講義名を表示することが好ましい。なお、タイトルウインドウには後述するように章の名前を表示してもよい。 It is preferable to display the lecture name in the title window on the home screen. The title window may display the name of the chapter as will be described later.

また、ホーム画面におけるテキストウインドウには、講義の目次が表示される。このテキストウインドウに表示される目次は、ＬＩＶＥ教科書作成装置１の全体構成調整部９（図１参照）により作成される目次ページを示すテキストファイルに基づき表示されるものである。 In addition, a lecture table of contents is displayed in a text window on the home screen. The table of contents displayed in the text window is displayed based on a text file indicating a table of contents page created by the overall configuration adjusting unit 9 (see FIG. 1) of the LIVE textbook creating apparatus 1.

インデックス画像表示用ウインドウには、インデックス画像ファイル作成部８にて作成されたインデックス画像ファイルが表示される。このインデックス画像ファイルは、図５に示すように、複数のサムネイル画像が時系列で並ぶように表示される。また、インデックス画像表示用ウインドウには、スクロールバーが設けられている。これにより、画面サイズとの関係から表示されていないサムネイル画像を、スクロールバーでインデックス画像をスライドさせることにより表示することができる。 In the index image display window, the index image file created by the index image file creation unit 8 is displayed. The index image file is displayed so that a plurality of thumbnail images are arranged in time series as shown in FIG. The index image display window is provided with a scroll bar. Thereby, thumbnail images that are not displayed due to the relationship with the screen size can be displayed by sliding the index image with the scroll bar.

章選択ボタンは、所望の章の画像をユーザが見れるように、ユーザが章番号を入力するためのものである。章選択ボタンは、図５に示すようにテンキー形式で表示してもよいし、その他の形式、たとえばスクロールバー形式により表示してもよい。 The chapter selection button is used by the user to input a chapter number so that the user can see a desired chapter image. The chapter selection button may be displayed in a numeric keypad format as shown in FIG. 5, or may be displayed in another format, for example, a scroll bar format.

配布資料ファイルへのリンクは、講義で配布された資料を適宜参照できるようにするためのリンクである。講義で配布された資料が無い場合は、もちろんこのリンク表示を省略してもよい。また、講義に関連する教科書、参考書、ＨＰアドレスへのリンクを表示しても構わない。 The link to the distribution material file is a link for appropriately referring to the material distributed in the lecture. Of course, if there is no material distributed in the lecture, this link display may be omitted. In addition, textbooks related to lectures, reference books, and links to HP addresses may be displayed.

また、ホーム画面においては、インデックス画像に含まれる代表的なサムネイル画像を適当な数抽出し、そのサムネイル画像を、インデックス画像表示用ウインドウにおけるサイズよりも大きなサイズで表示してもよい。これにより、ユーザが講師の顔を確認し、講義の雰囲気を大まかに把握することが可能となる。 On the home screen, an appropriate number of representative thumbnail images included in the index image may be extracted, and the thumbnail images may be displayed in a size larger than the size in the index image display window. As a result, the user can confirm the face of the instructor and roughly understand the atmosphere of the lecture.

図６は、ＬＩＶＥ教科書データファイルを再生する際に表示される画面（以下、ストリーミング画面という）を示す図である。このストリーミング画面には、図６に示すように、ビデオファイル再生用ウインドウ、タイトルウインドウ、テキストウインドウ、インデックス画像表示用ウインドウ、章選択ボタン、および配布資料のリンク等が表示されている。 FIG. 6 is a view showing a screen (hereinafter referred to as a streaming screen) displayed when a LIVE textbook data file is reproduced. As shown in FIG. 6, the streaming screen displays a video file playback window, a title window, a text window, an index image display window, chapter selection buttons, distribution material links, and the like.

ビデオファイル再生用ウインドウは、ビデオファイル取得部４（図１参照）が取得したビデオファイルを動画像で表示するものである。また、ストリーミング画面におけるタイトルウインドウには、ビデオファイル再生ウインドウにて現在再生されているビデオファイルに相当する章の名前が表示される。 The video file playback window displays the video file acquired by the video file acquisition unit 4 (see FIG. 1) as a moving image. In the title window on the streaming screen, the name of the chapter corresponding to the video file currently played back in the video file playback window is displayed.

また、ストリーミング画面におけるテキストウインドウには、ビデオファイル再生ウインドウにて現在再生されているビデオファイルに対応するテキストファイルが表示される。特に、このテキストウインドウにおいては、ビデオファイル再生ウインドウにて現在再生されている講義内容に対応するテキストが、たとえば反転表示等により強調表示されている。これにより、ユーザは、現在再生中の講義においてどのような内容が話されているかを瞬時に理解することができる。 In the text window on the streaming screen, a text file corresponding to the video file currently reproduced in the video file reproduction window is displayed. In particular, in this text window, the text corresponding to the lecture content currently reproduced in the video file reproduction window is highlighted by, for example, reverse display. As a result, the user can instantly understand what is being spoken in the currently reproduced lecture.

また、強調表示の手法としては、図８に示すように、現在再生されている講義内容に対応するテキストを、拡大表示用ウインドウ内に拡大表示してもよい、これにより、弱視のユーザが講義内容を容易に理解できるようになる。さらに、現在再生されている講義内容に対応するテキストを英語に訳し、留学生のために英文にて講義内容を表示してもよい。 As a highlighting method, as shown in FIG. 8, text corresponding to the currently played lecture content may be enlarged and displayed in the enlarged display window. You will be able to understand the contents easily. Furthermore, the text corresponding to the currently replayed lecture content may be translated into English, and the lecture content may be displayed in English for international students.

さらに、ストリーミング画面におけるインデックス画像表示ウインドウでは、ビデオファイル再生ウインドウにて現在再生されているビデオファイルに対応するサムネイル画像が、太枠で囲まれる等の手法により強調して表示されている。 Further, in the index image display window on the streaming screen, the thumbnail image corresponding to the video file currently played back in the video file playback window is displayed with emphasis by a method such as being surrounded by a thick frame.

このように表示される図５のホーム画面および図６のストリーミング画面では、インデックス画像表示用ウインドウに表示されるサムネイル画像ファイル、テキストウインドウに表示される目次または講義の内容を示すテキストファイル、およびビデオファイル再生用ウインドウに表示されるビデオファイルが、タイムスタンプにより関連づけられている。そして、これらのファイルの関連づけは、ＬＩＶＥ教科書再生用ファイルをＬＩＶＥ教科書データベース１ａから学習端末３が取得することにより、学習端末３にて把握することができる。 In the home screen of FIG. 5 and the streaming screen of FIG. 6 displayed in this way, a thumbnail image file displayed in the index image display window, a text file indicating the contents of the table of contents or lecture displayed in the text window, and a video Video files displayed in the file playback window are related by time stamps. The association of these files can be grasped by the learning terminal 3 when the learning terminal 3 acquires the LIVE textbook reproduction file from the LIVE textbook database 1a.

したがって、インデックス画像表示用ウインドウにて、特定のサムネイル画像をクリックすることにより、そのサムネイル画像のタイムスタンプの時刻からビデオファイルを再生することができる。また、テキストウインドウにて、特定の目次または文節をクリックすることにより、クリックされたテキストのタイムスタンプの時刻からビデオファイルが再生されるようにもできる。 Therefore, by clicking a specific thumbnail image in the index image display window, the video file can be reproduced from the time stamp of the thumbnail image. In addition, by clicking a specific table of contents or clause in the text window, the video file can be played from the time stamp of the clicked text.

参考のため、図５のホーム画面および図６のストリーミング画面を表示するためのＬＩＶＥ教科書データファイルの構成を、図７に示す。図７に示すように、ＬＩＶＥ教科書データファイルでは、講義のビデオファイル、サムネイル画像ファイル、テキストファイル、およびこれらのタイムスタンプが構造化されている。また、配布資料のファイルも、ＬＩＶＥ教科書データファイル内に含まれていることがわかる。 For reference, FIG. 7 shows the structure of a LIVE textbook data file for displaying the home screen of FIG. 5 and the streaming screen of FIG. As shown in FIG. 7, in the LIVE textbook data file, a lecture video file, a thumbnail image file, a text file, and time stamps thereof are structured. It can also be seen that the distribution material file is also included in the LIVE textbook data file.

〔５．適用例〕
本実施形態のＬＩＶＥ教科書作成装置１によれば、上述のように講義内容のビデオファイルと、その講義で話されている内容を示すテキストファイルとが同期して再生されるので、種々の場面において好適である。以下、本実施形態の適用例をいくつか紹介する。 [5. Application example)
According to the LIVE textbook creation apparatus 1 of the present embodiment, the video file of the lecture content and the text file indicating the content spoken in the lecture are reproduced in synchronization as described above, so that in various scenes Is preferred. Hereinafter, some application examples of this embodiment will be introduced.

先ず、本実施形態のＬＩＶＥ教科書作成装置１は、大学・高校等の学校や研究機関における講義を、学生が復習するのに適している。つまり、講義の内容を理解するためには、講義で話されている内容が文字にて表されたものを視認することが直接的でありなおかつ効率的である。この点、本実施形態のＬＩＶＥ教科書作成装置１によれば、学習端末３側において講義内容がテキストファイルとして表示されるので（図６のテキストウインドウ参照）、講義の内容を効率的に理解し復習できる。 First, the LIVE textbook creation apparatus 1 according to the present embodiment is suitable for a student to review a lecture at a school or research institution such as a university or high school. That is, in order to understand the content of the lecture, it is straightforward and efficient to visually recognize what is spoken in the lecture as characters. In this regard, according to the LIVE textbook creation apparatus 1 of the present embodiment, the lecture content is displayed as a text file on the learning terminal 3 side (see the text window in FIG. 6), so that the content of the lecture is efficiently understood and reviewed. it can.

また、講義を担当する先生が、自分の講義をより良いものにするという観点からも、本実施形態のＬＩＶＥ教科書作成装置１は好適である。つまり、学生だけでなく先生にしてみても、ＬＩＶＥ教科書作成装置１により作成されたＬＩＶＥ教科書データファイルを学習端末３で再生すれば、効率的に自分の講義を復習し、次回の講義への改善点を見出すことができる。 Moreover, the LIVE textbook creation apparatus 1 of this embodiment is suitable also from a viewpoint that the teacher in charge of a lecture makes his lecture better. In other words, even if it is not only a student but also a teacher, if the LIVE textbook data file created by the LIVE textbook creation device 1 is played back on the learning terminal 3, it is possible to review your lecture efficiently and improve it to the next lecture Can find a point.

さらに、本実施形態のＬＩＶＥ教科書作成装置１のユーザは、学生や先生だけに限られるものではない。たとえば、本実施形態のＬＩＶＥ教科書作成装置１によればビデオファイルとテキストファイルとを同期させて再生することができるので、難聴者が講義の内容をより容易に理解できるようになる。また、本実施形態のＬＩＶＥ教科書作成装置１の使用目的は、講義の復習に限定されるものではなく、たとえば、講演会、ドキュメンタリー、ドラマ、ニュース番組、英会話番組を見る目的で使用されてもよい。 Furthermore, the user of the LIVE textbook creation apparatus 1 of the present embodiment is not limited to students and teachers. For example, according to the LIVE textbook creation apparatus 1 of the present embodiment, a video file and a text file can be reproduced in synchronization with each other, so that the hearing impaired person can understand the contents of the lecture more easily. Further, the purpose of use of the LIVE textbook creation apparatus 1 of the present embodiment is not limited to review of lectures, and may be used for the purpose of viewing lectures, documentaries, dramas, news programs, English conversation programs, for example. .

〔６．補足〕
本実施形態のＬＩＶＥ教科書作成装置１の処理手順は、ＣＰＵなどの演算手段が、ＲＯＭ（Read Only Memory）やＲＡＭなどの記憶手段に記憶されたプログラムを実行し、キーボードなどの入力手段、ディスプレイなどの出力手段、あるいは、インターフェース回路などの通信手段を制御することにより実現することができる。 [6. Supplement)
The processing procedure of the LIVE textbook creation apparatus 1 of the present embodiment is such that a calculation unit such as a CPU executes a program stored in a storage unit such as a ROM (Read Only Memory) or a RAM, an input unit such as a keyboard, a display, or the like This can be realized by controlling the output means or the communication means such as an interface circuit.

したがって、これらの手段を有するコンピュータが、上記プログラムを記録した記録媒体を読み取り、当該プログラムを実行するだけで、本実施形態のＬＩＶＥ教科書作成装置１の各種処理を実現することができる。また、上記プログラムをリムーバブルな記録媒体に記録することにより、任意のコンピュータ上で上記の各種機能および各種処理を実現することができる。 Therefore, various processes of the LIVE textbook creation apparatus 1 of the present embodiment can be realized simply by a computer having these means reading the recording medium on which the program is recorded and executing the program. In addition, by recording the program on a removable recording medium, the various functions and various processes described above can be realized on an arbitrary computer.

この記録媒体としては、マイクロコンピュータで処理を行うために図示しないメモリ、例えばＲＯＭのようなものがプログラムメディアであっても良いし、また、図示していないが外部記憶装置としてプログラム読み取り装置が設けられ、そこに記録媒体を挿入することにより読み取り可能なプログラムメディアであっても良い。 As the recording medium, a program medium such as a memory (not shown) such as a ROM may be used for processing by the microcomputer, and a program reading device is provided as an external storage device (not shown). It may be a program medium that can be read by inserting a recording medium there.

また、何れの場合でも、格納されているプログラムは、マイクロプロセッサがアクセスして実行される構成であることが好ましい。さらに、プログラムを読み出し、読み出されたプログラムは、マイクロコンピュータのプログラム記憶エリアにダウンロードされて、そのプログラムが実行される方式であることが好ましい。なお、このダウンロード用のプログラムは予め本体装置に格納されているものとする。 In any case, the stored program is preferably configured to be accessed and executed by the microprocessor. Furthermore, it is preferable that the program is read out, and the read program is downloaded to the program storage area of the microcomputer and the program is executed. It is assumed that the download program is stored in the main device in advance.

また、上記プログラムメディアとしては、本体と分離可能に構成される記録媒体であり、磁気テープやカセットテープ等のテープ系、フレキシブルディスクやハードディスク等の磁気ディスクやＣＤ／ＭＯ／ＭＤ／ＤＶＤ等のディスクのディスク系、ＩＣカード（メモリカードを含む）等のカード系、あるいはマスクＲＯＭ、ＥＰＲＯＭ（Erasable Programmable Read Only Memory）、ＥＥＰＲＯＭ（Electrically Erasable Programmable Read Only Memory）、フラッシュＲＯＭ等による半導体メモリを含めた固定的にプログラムを担持する記録媒体等がある。 The program medium is a recording medium configured to be separable from the main body, such as a tape system such as a magnetic tape or a cassette tape, a magnetic disk such as a flexible disk or a hard disk, or a disk such as a CD / MO / MD / DVD. Fixed disk system, card system such as IC card (including memory card), or semiconductor memory such as mask ROM, EPROM (Erasable Programmable Read Only Memory), EEPROM (Electrically Erasable Programmable Read Only Memory), flash ROM, etc. In particular, there are recording media that carry programs.

また、インターネットを含む通信ネットワークを接続可能なシステム構成であれば、通信ネットワークからプログラムをダウンロードするように流動的にプログラムを担持する記録媒体であることが好ましい。 In addition, if the system configuration is capable of connecting to a communication network including the Internet, the recording medium is preferably a recording medium that fluidly carries the program so as to download the program from the communication network.

さらに、このように通信ネットワークからプログラムをダウンロードする場合には、そのダウンロード用のプログラムは予め本体装置に格納しておくか、あるいは別な記録媒体からインストールされるものであることが好ましい。 Further, when the program is downloaded from the communication network as described above, it is preferable that the download program is stored in the main device in advance or installed from another recording medium.

このように、本実施形態のＬＩＶＥ教科書作成装置１は、動画像データおよび音声データを記録したビデオファイルを編集するものであって、上記音声データを認識してテキストファイルに変換する音声認識部５と、上記テキストファイルと上記ビデオファイルとを、そのテキストファイルが上記ビデオファイルにおいてどの時刻の内容であるかを示す時刻情報に基づき関連づけるＬＩＶＥ教科書再生用ファイル作成部１０と、を備えている。 As described above, the LIVE textbook creation device 1 of the present embodiment edits a video file in which moving image data and audio data are recorded, and recognizes the audio data and converts it into a text file. And a LIVE textbook reproduction file creation unit 10 for associating the text file and the video file on the basis of time information indicating what time the text file is in the video file.

上記構成によれば、音声認識部５により変換されたテキストファイルにより、ビデオファイルの内容を学習端末３で表示することができるので、ビデオファイルの内容を文章化して効率的にユーザに伝えることができる。 According to the above configuration, since the content of the video file can be displayed on the learning terminal 3 by the text file converted by the voice recognition unit 5, the content of the video file can be converted into text and efficiently transmitted to the user. it can.

さらに、ＬＩＶＥ教科書再生用ファイル作成部１０によりビデオファイルとテキストファイルとが関連付けられているので、ビデオファイルとテキストファイルとを同期させて学習端末３で表示することができる。つまり、ビデオファイルとテキストファイルとを互いに関連させた状態で学習端末３に表示することができ、ビデオファイルの内容をより効率的にユーザに伝えることができる。 Further, since the video file and the text file are associated with each other by the LIVE textbook reproduction file creation unit 10, the video file and the text file can be synchronized and displayed on the learning terminal 3. That is, the video file and the text file can be displayed on the learning terminal 3 while being associated with each other, and the content of the video file can be transmitted to the user more efficiently.

さらに、上記構成のＬＩＶＥ教科書作成装置１は、上記ビデオファイルにおいて場面が変化する箇所であるシーンチェンジを検出するシーンチェンジ検出部７と、上記シーンチェンジの時刻に基づき、上記テキストファイルの内容を項目分けするためのタイトル名を、各項目について作成する全体構成調整部９と、を備えている。 Furthermore, the LIVE textbook creation apparatus 1 having the above-described configuration includes a scene change detection unit 7 that detects a scene change that is a location where a scene changes in the video file, and items of the contents of the text file based on the time of the scene change. And an overall configuration adjustment unit 9 that creates title names for division for each item.

上記構成によれば、シーンチェンジ検出部７により検出されたシーンチェンジの時刻に基づいて、全体構成調整部９によりテキストファイルに関するタイトル名が作成される。したがって、このタイトル名を学習端末３にて表示することにより、ユーザはテキストファイルの内容を大まかに把握することができる。それゆえ、ビデオファイルの内容をより効率的にユーザに伝えることができる。 According to the above configuration, based on the scene change time detected by the scene change detection unit 7, the overall configuration adjustment unit 9 creates a title name for the text file. Accordingly, by displaying the title name on the learning terminal 3, the user can roughly grasp the contents of the text file. Therefore, the contents of the video file can be transmitted to the user more efficiently.

さらに、全体構成調整部９は、シーンチェンジ検出部７により複数のシーンチェンジが上記ビデオファイルから検出される場合、上記シーンチェンジ同士の時間間隔（提示継続時間）に基づき、上記テキストファイルを分ける項目の数を調整することが好ましい。 Further, when the scene change detecting unit 7 detects a plurality of scene changes from the video file, the overall configuration adjusting unit 9 separates the text file based on the time interval (presentation duration) between the scene changes. It is preferable to adjust the number.

これにより、シーンチェンジ同士の時間間隔が長いものを、テキストファイルの項目数として妥当な数だけ選び出し、そのシーンチェンジの時刻に基づきタイトル名を作成することができる。これにより、項目数を適宜調整して、ビデオファイルの内容がより効率的にユーザに伝わるようにすることができる。 As a result, an item with a long time interval between scene changes can be selected as an appropriate number of items in the text file, and a title name can be created based on the time of the scene change. Thereby, the number of items can be adjusted as appropriate so that the content of the video file can be transmitted to the user more efficiently.

さらに、ＬＩＶＥ教科書再生用ファイル作成部１０は、上記ビデオファイルに記録された動画像内で配布された資料のデータと、上記テキストファイルおよび上記ビデオファイルの少なくとも一方とを関連付けるものである。 Further, the LIVE textbook reproduction file creation unit 10 associates the data of the material distributed in the moving image recorded in the video file with at least one of the text file and the video file.

上記構成によれば、ＬＩＶＥ教科書再生用ファイル作成部１０により、テキストファイルおよびビデオファイルの少なくとも一方と配布資料のデータとが関連付けられる。この関連付けは、たとえばその資料が配布された時刻を示すタイムスタンプに基づき行えばよい。これにより、テキストファイルおよびビデオファイルを外部機器にて表示する場合に、これらのファイルに配布資料のデータを関連付けて表示することができる。 According to the above configuration, the LIVE textbook reproduction file creation unit 10 associates at least one of the text file and the video file with the data of the distribution material. This association may be performed based on, for example, a time stamp indicating the time when the material is distributed. As a result, when the text file and the video file are displayed on the external device, the data of the distribution material can be displayed in association with these files.

また、本実施形態の学習端末３は、動画像データおよび音声データを記録したビデオファイルを再生するものであって、上記音声データが変換されたテキストファイルであって、そのテキストファイルが上記ビデオファイルにおいてどの時刻の内容であるかを示す時刻情報に基づき関連付けられたテキストファイルを、上記ビデオファイルと同時に表示する。 Further, the learning terminal 3 of the present embodiment reproduces a video file in which moving image data and audio data are recorded, and is a text file obtained by converting the audio data, and the text file is the video file. The text file associated based on the time information indicating at which time the content of the video is displayed simultaneously with the video file.

さらに、上記構成の学習端末３は、上記ビデオファイルに記録された動画像内で現在話されている内容に対応する上記テキストファイルの一部を、強調表示する。 Further, the learning terminal 3 having the above configuration highlights a part of the text file corresponding to the content currently spoken in the moving image recorded in the video file.

さらに、上記構成の学習端末３は、上記表示されているテキストファイルのうち、ユーザに選択された部分に対応する箇所から上記ビデオファイルを再生することが好ましい。 Furthermore, it is preferable that the learning terminal 3 having the above configuration reproduces the video file from a portion corresponding to a portion selected by the user in the displayed text file.

または、上記テキストファイルの内容を項目分けするためのタイトル名を複数表示するとともに、上記複数のタイトル名のうち、ユーザに選択されたタイトル名に対応する箇所から上記ビデオファイルを再生するものであってもよい。 Alternatively, a plurality of title names for categorizing the contents of the text file are displayed, and the video file is reproduced from a location corresponding to the title name selected by the user among the plurality of title names. May be.

さらに、上記構成の学習端末３は、上記ビデオファイルにおいて場面が変化する箇所に対応するサムネイル画像を複数並べて表示するとともに、上記複数のサムネイル画像のうち、ユーザに選択されたサムネイル画像に対応する箇所から上記ビデオファイルを再生するものであってもよい。 Further, the learning terminal 3 configured as described above displays a plurality of thumbnail images corresponding to the location where the scene changes in the video file, and also includes a location corresponding to the thumbnail image selected by the user among the plurality of thumbnail images. The above video file may be reproduced.

さらに、上記構成の学習端末３は、上記ビデオファイルに記録された動画像内で配布された資料のデータへのリンクを、上記テキストファイルまたは上記ビデオファイルの少なくとも一方と関連付けて表示することが好ましい。 Furthermore, it is preferable that the learning terminal 3 configured as described above displays a link to the data of the material distributed in the moving image recorded in the video file in association with at least one of the text file or the video file. .

さらに、本実施形態のＬＩＶＥ教科書データベース１ａは、動画像データおよび音声データを記録したビデオファイルを再生するためのＬＩＶＥ教科書再生用ファイルを格納するものであって、上記ＬＩＶＥ教科書再生用ファイルは、上記音声データが変換されたテキストファイルを、そのテキストファイルが上記ビデオファイルにおいてどの時刻の内容であるかを示す時刻情報（タイムスタンプ）に基づき関連付けるものである。 Furthermore, the LIVE textbook database 1a of the present embodiment stores a LIVE textbook reproduction file for reproducing a video file in which moving image data and audio data are recorded, and the LIVE textbook reproduction file is the above-mentioned file. The text file into which the audio data is converted is associated on the basis of time information (time stamp) indicating what time the text file is in the video file.

上記構成によれば、ＬＩＶＥ教科書再生用ファイルによりテキストファイルとビデオファイルとが関連付けられているので、このＬＩＶＥ教科書再生用ファイルをファイル配信サーバを用いて外部に配信することにより、学習端末３にてビデオファイルとテキストファイルとを関連付けて再生することができる。したがって、ユーザは、テキストファイルにより文章化されたビデオファイルの内容と、ビデオファイルの動画像および音声とを同時に得ることができるので、ビデオファイルの内容をより効率的に理解することができる。 According to the above configuration, since the text file and the video file are associated with each other by the LIVE textbook reproduction file, the learning terminal 3 distributes the LIVE textbook reproduction file to the outside using the file distribution server. A video file and a text file can be associated and played. Therefore, the user can simultaneously obtain the content of the video file documented by the text file and the moving image and sound of the video file, so that the content of the video file can be understood more efficiently.

なお、本発明は上述した実施形態に限定されるものではなく、請求項に示した範囲で種々の変更が可能であり、実施形態にそれぞれ開示された技術的手段を適宜組み合わせて得られる実施形態についても本発明の技術的範囲に含まれる。 The present invention is not limited to the above-described embodiment, and various modifications are possible within the scope shown in the claims, and the embodiment can be obtained by appropriately combining technical means disclosed in the embodiment. Is also included in the technical scope of the present invention.

本発明によれば、テキストファイルによりビデオファイルの内容を文章化してユーザに伝えることができ、さらにビデオファイルとテキストファイルとを互いに関連させた状態で表示することができるので、ビデオファイルの内容をより効率的にユーザに伝えることができる。したがって、ビデオファイルに記録された講義等を学生や先生が復習するのに適している。 According to the present invention, the contents of the video file can be documented and transmitted to the user by the text file, and the video file and the text file can be displayed in a state of being associated with each other. It can be communicated to the user more efficiently. Therefore, it is suitable for students and teachers to review lectures recorded in video files.

本発明の映像編集装置の一実施形態に係るＬＩＶＥ教科書作成装置の構成を示すブロック図である。It is a block diagram which shows the structure of the LIVE textbook preparation apparatus which concerns on one Embodiment of the video editing apparatus of this invention. 図１のＬＩＶＥ教科書作成装置を用いるシステムの構成を示す図である。It is a figure which shows the structure of the system using the LIVE textbook preparation apparatus of FIG. 図１のＬＩＶＥ教科書作成装置により行われるシーンチェンジ検出を説明するための図である。It is a figure for demonstrating the scene change detection performed by the LIVE textbook preparation apparatus of FIG. 本発明の映像編集方法の一実施形態に係る処理の流れを示すフローチャートである。It is a flowchart which shows the flow of the process which concerns on one Embodiment of the video editing method of this invention. 図２の学習端末において表示される画面の一例を示す図である。It is a figure which shows an example of the screen displayed in the learning terminal of FIG. 図２の学習端末において表示される画面の一例を示す図である。It is a figure which shows an example of the screen displayed in the learning terminal of FIG. 図１のＬＩＶＥ教科書作成装置により作成されるＬＩＶＥ教科書データファイルの構成を示す図である。It is a figure which shows the structure of the LIVE textbook data file produced by the LIVE textbook production apparatus of FIG. 図２の学習端末において表示される画面の一例を示す図である。It is a figure which shows an example of the screen displayed in the learning terminal of FIG.

Explanation of symbols

１ＬＩＶＥ教科書作成装置（映像編集装置）
１ａＬＩＶＥ教科書データベース（ファイルデータベース）
３学習端末（映像再生装置）
５音声認識部（音声認識手段）
７シーンチェンジ検出部（シーンチェンジ検出手段）
８インデックス画像ファイル作成部
９全体構成調整部（タイトル作成手段、項目数調整手段）
１０ＬＩＶＥ教科書再生用ファイル作成部（ファイル関連付け手段） 1 LIVE textbook creation device (video editing device)
1a LIVE textbook database (file database)
3 Learning terminal (video playback device)
5 Voice recognition unit (voice recognition means)
7 Scene change detection unit (scene change detection means)
8 Index image file creation unit 9 Overall configuration adjustment unit (title creation means, item number adjustment means)
10 LIVE textbook playback file creation unit (file association means)

Claims

A video editing device for editing a video file recording moving image data and audio data,
Voice recognition means for recognizing the voice data and converting it into a text file;
A video editing apparatus, comprising: file association means for associating the text file with the video file based on time information indicating a time at which the text file is in the video file.

Scene change detection means for detecting a scene change that is a location where a scene changes in the video file;
2. The video editing according to claim 1, further comprising: a title creating means for creating a title name for dividing the contents of the text file into items based on the time of the scene change. apparatus.

When a plurality of scene changes are detected from the video file by the scene change detection means,
3. The video editing apparatus according to claim 2, further comprising item number adjusting means for adjusting the number of items for dividing the text file based on a time interval between the scene changes.

4. The file associating means further associates data of a material distributed in a moving image recorded in the video file with at least one of the text file and the video file. The video editing apparatus according to any one of the above.

5. The video editing apparatus according to claim 4, wherein the file association unit associates the data of the material based on time information indicating a time when the material is distributed.

A video playback device for playing back a video file in which moving image data and audio data are recorded,
A text file in which the audio data is converted, and the text file associated with the time information indicating at what time in the video file the text file is displayed simultaneously with the video file. A featured video playback device.

7. The video reproduction apparatus according to claim 6, wherein a part of the text file corresponding to the content currently spoken in the moving image recorded in the video file is highlighted.

8. The video reproduction apparatus according to claim 6, wherein the video file is reproduced from a portion corresponding to a portion selected by the user in the displayed text file.

In addition to displaying multiple title names to categorize the contents of the text file,
9. The video reproduction apparatus according to claim 6, wherein the video file is reproduced from a portion corresponding to the title name selected by the user among the plurality of title names.

In addition to displaying a plurality of thumbnail images corresponding to the location where the scene changes in the video file,
10. The video reproduction apparatus according to claim 6, wherein the video file is reproduced from a portion corresponding to the thumbnail image selected by the user among the plurality of thumbnail images.

11. The link to data of material distributed in a moving image recorded in the video file is displayed in association with at least one of the text file or the video file. The video reproduction apparatus according to item 1.

A file database in which a text file in which audio data of a video file in which moving image data and audio data are recorded is converted, and a video file playback file for playing back the video file is stored.
The video file playback file associates the text file with the video file based on time information indicating at which time the text file is in the video file.
A file database, wherein an external device acquires the video file playback file so that the text file is associated with the video file and played on the external device.

A file delivery server for delivering the video file playback file stored in the file database according to claim 12 to an external device.

A video editing method for editing a video file recording moving image data and audio data,
A first step of recognizing the voice data and converting it into a text file;
A video editing method comprising: a second step of associating the text file with the video file based on time information indicating at what time the text file is in the video file.

A video editing program for causing a computer to execute each step of the video editing method according to claim 14.

A computer-readable recording medium on which the video editing program according to claim 15 is recorded.

A video playback method for playing back a video file recording moving image data and audio data,
A text file in which the audio data is converted, and the text file associated with the time information indicating at what time in the video file the text file is displayed simultaneously with the video file. Characteristic video playback method.

A video playback program for causing a computer to execute the video playback method according to claim 17.

A computer-readable recording medium on which the video reproduction program according to claim 18 is recorded.