summary of the invention
The invention provides a kind of video data search method and device thereof and system, in order to realize the quick-searching of video data based on video pictures feature.
Video data search method provided by the invention, be applied to the video monitoring system with two-stage index structure, the method comprises:
If find by intellectual analysis the pictorial feature occurring needing mark in frame of video, then in the one-level index and secondary index of this frame of video, be provided with the mark of this pictorial feature; Wherein, the mark of described pictorial feature comprises overall video and analyzes code word and GOP video analysis code word, the pictorial feature arranged in one-level index be designated overall video analyze code word, the pictorial feature arranged in secondary index be designated GOP video analysis code word;
After receiving the video data retrieval request of carrying the target picture signature identification information of asking the video data retrieved to some extent, inquiry institute asks the one-level index of video data of retrieval, judge in this one-level index, whether to be provided with described target picture signature identification information, and when being judged as YES, according to the secondary index of all correspondences of this one-level search index, and therefrom find the secondary index being provided with described target picture signature identification information; Determine the memory address of corresponding picture group GOP according to the secondary index being provided with described target picture signature identification information found, and return video data result for retrieval according to the memory address of the GOP determined.
In said method, video data result for retrieval is returned according to the memory address of the GOP determined, comprise: the video data sequences formed for each continuous print GOP determined, respectively to front and extend a GOP backward, the video data sequences that forms of all continuous GOP after extending process is returned to as video frequency searching result the broadcasting that player carries out corresponding video data.
In said method, in secondary index, also comprise meta data block address information, described meta data block is for storing the descriptor of the pictorial feature of corresponding GOP;
After finding the secondary index being provided with target picture signature identification information, also comprise: find corresponding meta data block according to the meta data block address information in the secondary index found, and read pictorial feature descriptor wherein;
Return video data result for retrieval according to the memory address of the GOP determined, be specially: the memory address according to the GOP determined plays corresponding frame of video, and shows corresponding pictorial feature descriptor when playing frame of video.
In said method, in one-level index and secondary index, the process of setting screen signature identification information, comprising: before coding video frequency data, or after decoding to the video data after coding, detects frame of video; If detect that the pictorial feature of mark appears needing in described frame of video picture, then the identification information of this pictorial feature is set in the one-level index and secondary index of described frame of video.
In said method, pictorial feature identification information is arranged at the head of one-level index and secondary index.
In said method, described pictorial feature identification information comprises one of following or combination in any:
Object of which movement classification logotype, for identifying the attribute containing moving object and this moving object in frame of video picture;
Picture quality classification logotype, for identifying video frame images loss situation or image displaying quality;
Sound alarm classification logotype, has sound alarm and sound alarm type for identifying in frame of video picture.
Server provided by the invention, is applied to the video monitoring system with two-stage index structure, and wherein, described server comprises video frequency searching processing unit, and described video frequency searching processing unit comprises:
Interface module, for receiving video data retrieval request, and returns video data result for retrieval;
Determination module, for according to the video data retrieval request that receives, determines that the institute of carrying in this request asks the target picture signature identification information of the video data of retrieval;
Retrieval module, for inquire about ask the one-level index of video data retrieved, judge in this one-level index, whether to be provided with described target picture signature identification information, and when being judged as YES, according to the secondary index of all correspondences of this one-level search index, therefrom find the secondary index being provided with described target picture signature identification information; And, determine the memory address of corresponding picture group GOP according to the secondary index being provided with described target picture signature identification information found, according to the memory address generating video data research result according to the GOP determined; Wherein, the mark of the pictorial feature in described one-level index and secondary index finds to occur in frame of video arranging when needing the pictorial feature of mark by intellectual analysis, the mark of described pictorial feature comprises overall video and analyzes code word and GOP video analysis code word, the pictorial feature arranged in one-level index be designated overall video analyze code word, the pictorial feature arranged in secondary index be designated GOP video analysis code word.
In above-mentioned server, described retrieval module specifically for, for the video data sequences that each continuous print GOP determined forms, respectively to front and extend a GOP backward, the video data sequences that forms of all continuous GOP after extending process is returned to by described interface module the broadcasting that player carries out corresponding video data as video frequency searching result.
In above-mentioned server, in secondary index, also comprise meta data block address information, described meta data block is for storing the descriptor of the pictorial feature of corresponding GOP;
Described retrieval module also for, after finding the secondary index being provided with target picture signature identification information, corresponding meta data block is found according to the meta data block address information in the secondary index found, and the pictorial feature descriptor read wherein, and this pictorial feature descriptor is returned as result for retrieval.
In above-mentioned server, also comprise:
Demoder, for decoding to the video data after coding;
Image analysis module, for detecting decoded frame of video, and when detecting that the pictorial feature needing mark appears in described frame of video picture, arranges the identification information of this pictorial feature in the one-level index and secondary index of described frame of video.
Video monitoring system provided by the invention, has two-stage index structure, comprising:
Scrambler, for detecting this frame of video before to coding video frames, and when being found by intellectual analysis to occur the pictorial feature needing mark in this frame of video, the identification information of this pictorial feature is set in the one-level index and secondary index of this frame of video, wherein, the mark of described pictorial feature comprises overall video and analyzes code word and GOP video analysis code word, the pictorial feature arranged in one-level index be designated overall video analyze code word, the pictorial feature arranged in secondary index be designated GOP video analysis code word;
Server, for receiving video data retrieval request, wherein carries the target picture signature identification information of asking the video data retrieved to some extent; Inquiry institute asks the one-level index of video data of retrieval, judge in this one-level index, whether to be provided with described target picture signature identification information, and when being judged as YES, according to the secondary index of all correspondences of this one-level search index, and therefrom find the secondary index being provided with described target picture signature identification information; And, determine the memory address of corresponding picture group GOP according to the secondary index being provided with described target picture signature identification information found, and return video data result for retrieval according to the memory address of the GOP determined.
In said system, described scrambler specifically for, frame of video is detected, if detect described frame of video picture occur need mark pictorial feature, then the identification information of this pictorial feature is set in the one-level index and secondary index of described frame of video.
In said system, described server comprises:
Interface module, for receiving video data retrieval request, and returns video data result for retrieval;
Determination module, for according to the video data retrieval request that receives, determines that the institute of carrying in this request asks the target picture signature identification information of the video data of retrieval;
Retrieval module, for inquire about ask the one-level index of video data retrieved, judge in this one-level index, whether to be provided with described target picture signature identification information, and when being judged as YES, according to the secondary index of all correspondences of this one-level search index, therefrom find the secondary index being provided with described target picture signature identification information; And, determine the memory address of corresponding picture group GOP according to the secondary index being provided with described target picture signature identification information found, according to the memory address generating video data research result according to the GOP determined.
In said system, described retrieval module specifically for, for the video data sequences that each continuous print GOP determined forms, respectively to front and extend a GOP backward, the video data sequences that forms of all continuous GOP after extending process is returned to by described interface module the broadcasting that player carries out corresponding video data as video frequency searching result.
In said system, in secondary index, also comprise meta data block address information, described meta data block is for storing the descriptor of the pictorial feature of corresponding GOP;
Described retrieval module also for, after finding the secondary index being provided with target picture signature identification information, corresponding meta data block is found according to the meta data block address information in the secondary index found, and the pictorial feature descriptor read wherein, and this pictorial feature descriptor is returned as result for retrieval.
Advantageous Effects of the present invention comprises:
When there is the pictorial feature needing mark in frame of video, by being provided with the mark of this pictorial feature in the one-level index and secondary index of this frame of video, so that when carrying out video data retrieval, can using the pictorial feature of video data as one of search condition, retrieve the video data including respective picture feature according to one-level index and secondary index, achieve the quick-searching of the video data based on pictorial feature.
Embodiment
Because existing scheme does not consider the application of video analysis, namely do not comprise the metadata store that video analysis produces, only as search condition, quick-searching is carried out to media data by temporal information, thus cannot realize the quick indexing to video content.
For the problems referred to above of prior art, the embodiment of the present invention utilizes the video content Intellectual Analysis Technology based on video detection technology, to meet video monitoring customer demand.Consider the application of video content intellectual analysis result in data store organisation and Index process, the embodiment of the present invention is expanded existing video data index structure, the metadata that video analysis produces is added in one-level index and secondary index, accordingly, when carrying out data retrieval, the metadata that video analysis can be used to produce, as index condition, carries out quick-searching to media data, and only carries out great-jump-forward playback to the media data retrieved.
Below in conjunction with accompanying drawing, the embodiment of the present invention is described in detail.
As shown in Figure 2, the two-stage index structure of the embodiment of the present invention to existing video data is improved, for the frame of video having occurred the scene that needs mark or event in picture, in corresponding one-level index and secondary index, add the identification field of pictorial feature, this pictorial feature detects by video content Intellectual Analysis Technology.Further, GOP (the Group ofPictures of the scene that needs mark or event has been there is in picture, picture group, GOP is the block of video data that secondary index identifies) secondary index in add the index information (as address and length) of meta data block, to index corresponding meta data block (as the meta data block 1 in Fig. 2, meta data block 2...... meta data block N), record scene or the event description information of corresponding GOP video pictures in this meta data block.
Based on above-mentioned two-stage index framework, the flag by one-level index metadata is retrieved secondary index, and then goes out concrete content metadata (meta data block as in Fig. 2) by secondary index precise search.Secondary index head is fixed, and can realize quick position.The definition of two-stage index head construction body can as shown in Fig. 3 and Fig. 4.
See Fig. 3, the schematic diagram of the one-level index head construction of the video data provided for the embodiment of the present invention.As shown in the figure, the one-level index head of a video channel, mainly comprise the meta-data identifier of (video sequence in the time period of a data channel is stored as a video data file usually) in the time period of this video channel, comprising version number, (this byte number does not comprise the byte number that version number takies to one-level index head byte number, area of bed corresponding in figure), video channel ID (ID in corresponding diagram), the video sequence beginning and ending time in a period of time (start time in corresponding diagram and end time) of this video channel, first secondary index regional address, the pictorial feature identifier (analyzing code word corresponding to the overall video in figure) of the video sequence of this video channel.Wherein:
Version number: 2 bytes, first character joint represents major version number, second byte representation minor version number.
Area of bed: 2 bytes, value is 2 bytes that whole one-level index head byte number deducts version number and takies.
Start time/end time: 8 bytes, corresponding whole storage space (being generally one piece of hard disk) video sequence start time/end time.Time encoding can as shown in figure 3-1, wherein, before two bytes be that year number deducts 2000 numerals obtained, such as 2012 is 2012-2000=12; Then the Digital ID month of a byte, scope is [1,12]; Then a byte-identifier date is digital again, and scope is [1,31]; 4 bytes are below moment coding, and the chronomere in this moment can be System Clock Reference frequency 90KHz, gets the highest 32 bits.
Such as, the moment encodes and can be expressed as:
TS(i)=(system_clock_frequency×t(i))%2
32
Wherein, TS (i) represents that the moment encodes; System_clock_frequency is system clock frequency, 90KHz; T (i) is the actual moment, in units of millisecond.
First secondary index regional address: 8 bytes, value is the first address of first secondary index district at storage space, measure-alike due to each second-level index block, therefore the address of follow-up all second-level index blocks of this video channel can be obtained according to the first address of second-level index block.
Overall video analyzes code word: 8 bytes, and value is the video analysis result queue of the video sequence of this video channel, i.e. the pictorial feature mark of video sequence.
See Fig. 4, the schematic diagram of the secondary index head construction of the video data provided for the embodiment of the present invention.
2-level search head mainly comprises the metadata produced in time period corresponding to GOP and (wherein records video analysis information, descriptor as pictured scene or event), comprising version number, secondary index head takies byte number, and (this byte number does not comprise the byte number that version number occupies, area of bed as in figure), the initial time of GOP, the address (the meta data block address in corresponding diagram) of GOP corresponding element data space, GOP length, frame per second, resolution, the byte number (the block size in corresponding diagram) of the metadata of GOP, the pictorial feature identifier (the video analysis code word in corresponding diagram) of GOP.Wherein:
Version number: 2 bytes, first character joint represents major version number, second byte representation width version number;
Secondary index area of bed: 2 bytes, value is 2 bytes that whole secondary index header byte number deducts version number and takies;
4 bytes: retain;
Initial time: 8 bytes, value is the first frame acquisition time of GOP, and in units of millisecond, timing definition is with the beginning and ending time definition in one-level index;
Meta data block address: 8 bytes, value is the start address of metadata in stored memory of GOP video sequence;
GOP size: 2 bytes, value is GOP number of video frames;
Frame per second: 2 bytes, value is coding frame per second and the video analysis frame per second of GOP, and wherein, high byte is coding frame per second, and low byte is video analysis frame per second;
Resolution: 2 bytes, value is the resolution of video analysis image, and wherein, high byte is that pixel is wide, and low byte is that pixel is high;
Block size: 2 bytes, value is the byte number that the meta data block of GOP takies;
GOP video analysis code word: 8 bytes, for identifying the pictorial feature of GOP video sequence, the overall video defined in same one-level index analyzes code word.
The pictorial feature of the video sequence in above-mentioned one-level index and secondary index can define as required, such as can according to frame of video image content, as comprise moving object, the motion state of moving object, the attribute of moving object, define; Also according to frame of video image quality, as whether having video-losing, camera whether to be blocked, can define; Whether can also define containing sound alarm according in frame of video.
Pictorial feature mark (i.e. video analysis code word) that the embodiment of the present invention provides is defined as follows:
In 8 bytes of video analysis code word, high 4 bytes are carry out large category division to video analysis result, and low 4 bytes are segmented each classification, as shown in figure 3-2.Concrete:
High 4 bytes may be defined as:
0x00000001: object of which movement class;
0x00000002: picture quality class;
0x00000003: sound alarm class;
Other numerical value retains, and is convenient to expansion in the future.
The large class that each high 4 byte is corresponding, its low 4 bytes define the more fine-grained type under corresponding large class, concrete:
When high 4 bytes are object of which movement class (0x00000001), low 4 bytes are defined as:
Byte 3 is 0x01: represent containing moving target, after two bytes definition moving object attributes, wherein:
Byte 2 is 0x01: behave containing moving object;
Byte 2 is 0x02: be car containing moving object.
Byte 3 is 0x02: represent containing abnormal behaviour, after two bytes definition abnormal behaviour attributes, wherein:
Byte 2 is 0x01: containing legacy;
Byte 2 is 0x02: containing removing objects;
Byte 2 is 0x04: containing Wander behavior;
Byte 2 is 0x08: containing the behavior of fighting;
Byte 2 is 0x10: containing the behavior of running;
Byte 2 is 0x20: containing the excessive behavior of density;
Byte 2 is 0x40: containing the excessive behavior of flow.
When high 4 bytes are picture quality class (0x00000002), low 4 bytes are defined as:
0x00000001: image/video is lost;
0x00000002: camera is blocked;
0x00000004: image exists snowflake interference;
0x00000008: image exists colour cast interference;
0x00000010: brightness of image is excessively dark;
0x00000020: brightness of image is excessively bright;
0x00000040: image frame is shaken;
0x00000080: image frame freezes.
High 4 bytes are sound alarm class (0x00000004), and low 4 bytes are defined as:
0x00000001: containing high-frequency sound alarm;
0x00000002: containing treble sounds alarm;
Other undefined fields are reserved field, are convenient to expansion in the future.
Pictorial feature mark (i.e. video analysis code word) in above-mentioned one-level index and secondary index, is arranged by the video analysis function newly increased in video monitoring system.
Video analysis function can be integrated in head end encoder, such Dui mono-road video carries out (before the coding) in the process of encoding can carry out video analysis, and according to analysis result, corresponding video analysis code word is set in corresponding one-level index and secondary index, and corresponding meta data block is set.Video analysis function also can be carried out in rear end (as by carrying out in the server of videoconference client in video monitoring system), namely after decoding to the video data after coding in rear end, video analysis is carried out again, and according to analysis result, corresponding video analysis code word is set in corresponding one-level index and secondary index, and corresponding meta data block is set.Wherein, existing various video detecting method can be adopted to analyze video image, if according to the predefined pictorial feature needing mark, detect that the picture of current video frame has individual features, then the video analysis code word corresponding with this pictorial feature is set in the one-level index corresponding to current video frame and secondary index.
No matter which kind of set-up mode, object is all obtain the video analysis metadata with video data time synchronized, and fills in video analysis code word according to afore mentioned rules form.Such as, according to the definition of above pictorial feature, be integrated into the analysis module of front segment scrambler when carrying out video analysis, detect at a time (as during 5 days 22 May in 2010 30 points 6.350 seconds) have the behavior that people fights in video pictures, then high 4 bytes and 0x00000001 in the video analysis code word of the secondary index head of current time and one-level index head are carried out inclusive-OR operation by bit, low 4 bytes and 0x02040000 are carried out inclusive-OR operation by bit, thus video analysis code word is set to the pictorial feature mark corresponding with the behavior of fighting.
In video analysis process, no matter which frame of the GOP inside that secondary index is corresponding contains the pictorial feature that needs mark, and carries out respective markers always in this GOP video analysis code word.Equally, when the pictorial feature that in all video sequences that one-level index comprises, no matter which section GOP appearance needs mark, analyze in code word at the overall video of one-level index always and carry out respective markers.
If the different GOP video sequences of a video file have different pictorial feature, then corresponding video analysis code word is set in the secondary index of corresponding GOP; Accordingly, in the one-level index of this video file, its overall video analysis code word comprises all video analysis code words in secondary index.
If do not need the pictorial feature marked in the GOP video sequence that secondary index is corresponding, then the video analysis code word of this secondary index is without the need to carrying out respective markers.Equally, if all do not need the pictorial feature marked in all GOP video sequences of a video file, then the overall video in corresponding one-level index analyzes code word without the need to carrying out respective markers.
One-level index after improving based on pictorial feature one-level defined above and level indexing structure, the video data retrieval flow that the embodiment of the present invention provides, can as shown in Figure 5, comprise:
Step 501, user sends video data retrieval request by place monitor client, wherein carries the pictorial feature information of request retrieval, as pictorial feature mark.
In this step, user submits to the information such as video channel ID, beginning and ending time, pictorial feature as search condition by place monitor client, the server to video monitoring system initiates video data retrieval and the request of playback.
Step 502, after the server of video monitoring system receives this video data retrieval request, according to information inquiries such as the video channel ID wherein carried, initial times to corresponding one-level index, if include the pictorial feature mark of carrying in retrieval request in this one-level index, then perform step 503, otherwise perform step 505.
In this step, server is after the one-level index of video metadata corresponding to the whole video data storage space retrieving corresponding video passage, if judge, this one-level index includes the video analysis code word of the pictorial feature mark correspondence that user submits to, then represent certain section or the video sequence of a few sections of GOP video sequences for asking that this one-level index is corresponding, in this case, step 503 is proceeded to retrieve the video sequence that will ask further.
Step 503, the server of video monitoring system is according to this one-level search index secondary index, and from all secondary indexs inquired, find out the secondary index including this pictorial feature mark, and determine the corresponding address of GOP video sequence and the address of meta data block according to these secondary indexs.
In this step, the GOP video analysis code word of server to all secondary index heads corresponding to this one-level index travels through, and because secondary index head is etc. large, secondary index traversal can complete very soon.If the video analysis code word of secondary index head contains respective picture signature identification, then according to the initial time of these secondary index heads, time on the video playback time fences of correspondence marks, and according to the meta data block index information (as meta data block address and block size) of these secondary index heads, from corresponding meta data block, extract scene or the event description information of video pictures.After retrieving the secondary index including respective picture signature identification, just can determine the memory address of corresponding video sequence according to these secondary indexs.
Step 504, the server of video monitoring system, according to the address of the video sequence retrieved, plays the frame of video that these include specific pictured scene or event.
In this step, the video segment retrieved can send to player to carry out jump and play by server, and the scene of video pictures that can mark in video pictures further in the meta data block corresponding with played video sequence or the descriptor of event, the result as video analysis shows.Retrieval is carried out to video data and playback time, if the video sequence retrieved is multiple discontinuous video segments, then only great-jump-forward broadcasting is carried out to the video segment retrieved.If the darker regions in Fig. 6 is the video segment retrieved, only carry out jump to these dark fragments and play, after namely playing previous video fragment, the broadcast point jumping to next video segment is at once play.
For improving the continuity of video, further, the beginning broadcast point of the video sequence of each video segment is previous GOP, and end point will extend a GOP.As shown in Figure 6, the video sequence retrieved be a kth GOP to m GOP, but this section of video sequence will be play from kth-1 GOP, is played to m+1 GOP always.
Step 505, the server of video monitoring system returns the failed response of retrieval.
In this step, owing to failing to retrieve the video data (namely failing to retrieve the video data including the pictorial feature that user asks) meeting search condition, therefore the server of video monitoring system returns the failed response of retrieval, also can return the reason of failure further.
With an instantiation, above-mentioned flow process is described in detail below.
Such as, when behavior of whether fighting in the video record needing to retrieve a certain passage occurs, first the video analysis code word in the one-level index of this PASS VIDEO metadata storage space is found, high 4 bytes of this video analysis code word and 0x00000001 are carried out by bit AND operation, if result is 1, low 4 bytes of this video analysis code word and 0x02040000 are carried out by bit AND operation, if result is also 1, then show to fight in corresponding video sequence the video of behavior picture, namely represent in time period that this one-level index is corresponding (such as during 1 day 0 March in 2010 during 0 point of 0.000 second to 2010 on March 31,23 59 points and 59.999 seconds) containing the event of fighting, according to this one-level index, all secondary index heads are traveled through, video analysis codeword detection mode is with the detection mode in one-level index, if testing result is 1, to represent in the corresponding time period (as during 2 days 8 March in 2010 during 10 points of 5.000 seconds to 2010 on March 2,8 10 points 5.999 seconds) containing the behavior of fighting.After having searched this storage space, other storage spaces (also containing corresponding two-stage index) continued this passage carry out same retrieval, the video clip that behavior of the fighting event that so just can obtain all video records in a certain passage storage matrix occurs.
Carry out in the above embodiment of the present invention, on the basis of improving, can another embodiment of the present invention being obtained.In this embodiment, after video data is analyzed, if find the scene or the event that include needs mark in frame of video picture, only video analysis code word can be set in the secondary index of corresponding GOP and in corresponding one-level index, and the meta data block that need not arrange for describing pictured scene or event, thus also just not be used in secondary index the address that meta data block is set and length.When carrying out retrieval playback to video data, substantially carry out according to previous embodiment, unlike, in the present embodiment owing to not arranging the meta data block for describing pictured scene or event, and in secondary index, there is not the index information of this meta data block, therefore, after retrieving at the pictorial feature provided according to user the video data including respective picture feature, this video data is carried out playback, and the not scene of additional video picture or the descriptor of event.
By describing above and can finding out, introduce pictorial feature mark in the two-stage index of video data, wherein, different pictorial feature marks is used for identifying different video pictures scenes or event; When carrying out video playback, according to pictorial feature mark, going out scene of interest or event from long video record content fast searching, and playing fast, improve the retrieval of video content interested and the efficiency of playback.
Based on identical technical conceive, the embodiment of the present invention additionally provides server in a kind of video monitoring system and a kind of video monitoring system.
As shown in Figure 7, the structural representation of the server in the video monitoring system provided for the embodiment of the present invention.This server is applied to the video monitoring system with two-stage index structure, wherein, needs the pictorial feature of mark, then in the one-level index and secondary index of this frame of video, be provided with the mark of this pictorial feature if occur in frame of video.This server comprises video frequency searching processing unit 71, and video frequency searching processing unit 71 can comprise:
Interface module 711, for receiving video data retrieval request, and returns video data result for retrieval;
Determination module 712, for according to the video data retrieval request that receives, determines that the institute of carrying in this request asks the target picture signature identification information of the video data of retrieval;
Retrieval module 713, for inquire about ask the one-level index of video data retrieved, judge in this one-level index, whether to be provided with described target picture signature identification information, and when being judged as YES, according to the secondary index of all correspondences of this one-level search index, therefrom find the secondary index being provided with described target picture signature identification information; And, determine the memory address of corresponding picture group GOP according to the secondary index being provided with described target picture signature identification information found, according to the memory address generating video data research result according to the GOP determined.
In above-mentioned server, the video data sequences that retrieval module 713 can form for each continuous print GOP determined, respectively to front and extend a GOP backward, the video data sequences that forms of all continuous GOP after extending process is returned to by interface module 711 broadcasting that player carries out corresponding video data as video frequency searching result.
In secondary index, also comprise meta data block address information, described meta data block is for storing the descriptor of the pictorial feature of corresponding GOP; Accordingly, retrieval module 713 can after finding the secondary index being provided with target picture signature identification information, corresponding meta data block is found according to the meta data block address information in the secondary index found, and the pictorial feature descriptor read wherein, and this pictorial feature descriptor is returned as result for retrieval.
In above-mentioned server, also can comprise: demoder 72 and image analysis module 73.Wherein, demoder 72 is for decoding to the video data after coding; Image analysis module 73, for detecting decoded frame of video, and when detecting that the pictorial feature needing mark appears in described frame of video picture, arranges the identification information of this pictorial feature in the one-level index and secondary index of described frame of video.Concrete, pictorial feature identification information can be arranged at the head of one-level index and secondary index.
See Fig. 8, be the video monitoring system that the embodiment of the present invention provides, there is two-stage index structure, can be applicable to above-mentioned flow process.As shown in the figure, this system can comprise: the scrambler 81 that can be connected with one or more video cameras 80, server 82, one or more monitor clients 83 etc.Wherein:
Scrambler 81, for detecting this frame of video before to coding video frames, and when there is the pictorial feature needing mark in this frame of video, arranges the identification information of this pictorial feature in the one-level index and secondary index of this frame of video;
Server 82, for receiving the video data retrieval request that monitor client 83 sends, wherein carries the target picture signature identification information of asking the video data retrieved to some extent; Inquiry institute asks the one-level index of video data of retrieval, judge in this one-level index, whether to be provided with described target picture signature identification information, and when being judged as YES, according to the secondary index of all correspondences of this one-level search index, and therefrom find the secondary index being provided with described target picture signature identification information; And, determine the memory address of corresponding picture group GOP according to the secondary index being provided with described target picture signature identification information found, and return video data result for retrieval according to the memory address of the GOP determined.
In said system, scrambler 81 can detect frame of video, if detect that the pictorial feature of mark appears needing in described frame of video picture, then in the one-level index and secondary index of described frame of video, arranges the identification information of this pictorial feature.Concrete, pictorial feature identification information can be arranged at the head of one-level index and secondary index.
In said system, server 82 can comprise:
Interface module 821, for receiving video data retrieval request, and returns video data result for retrieval;
Determination module 822, for according to the video data retrieval request that receives, determines that the institute of carrying in this request asks the target picture signature identification information of the video data of retrieval;
Retrieval module 823, for inquire about ask the one-level index of video data retrieved, judge in this one-level index, whether to be provided with described target picture signature identification information, and when being judged as YES, according to the secondary index of all correspondences of this one-level search index, therefrom find the secondary index being provided with described target picture signature identification information; And, determine the memory address of corresponding picture group GOP according to the secondary index being provided with described target picture signature identification information found, according to the memory address generating video data research result according to the GOP determined.
Wherein, the video data sequences that retrieval module 823 can form for each continuous print GOP determined, respectively to front and extend a GOP backward, the video data sequences that forms of all continuous GOP after extending process is returned to by interface module 821 broadcasting that player carries out corresponding video data as video frequency searching result.
Wherein, in secondary index, also comprise meta data block address information, described meta data block is for storing the descriptor of the pictorial feature of corresponding GOP.Accordingly, retrieval module 823 also can after finding the secondary index being provided with target picture signature identification information, corresponding meta data block is found according to the meta data block address information in the secondary index found, and the pictorial feature descriptor read wherein, and this pictorial feature descriptor is returned as result for retrieval.
Through the above description of the embodiments, those skilled in the art can be well understood to the mode that the present invention can add required general hardware platform by software and realize, and can certainly pass through hardware, but in a lot of situation, the former is better embodiment.Based on such understanding, technical scheme of the present invention can embody with the form of software product the part that prior art contributes in essence in other words, this computer software product is stored in a storage medium, comprising some instructions in order to make a station terminal equipment (can be mobile phone, personal computer, server, or the network equipment etc.) perform method described in each embodiment of the present invention.
The above is only the preferred embodiment of the present invention; it should be pointed out that for those skilled in the art, under the premise without departing from the principles of the invention; can also make some improvements and modifications, these improvements and modifications also should look protection scope of the present invention.