JP2007300557A

JP2007300557A - Image encoding device and image encoding method

Info

Publication number: JP2007300557A
Application number: JP2006128627A
Authority: JP
Inventors: Fumitaka Nakayama; 文貴中山
Original assignee: Canon Inc
Current assignee: Canon Inc
Priority date: 2006-05-02
Filing date: 2006-05-02
Publication date: 2007-11-15

Abstract

<P>PROBLEM TO BE SOLVED: To set a code amount corresponding to human sight characteristics. <P>SOLUTION: An image encoding device comprises a target code amount determining means for determining a target code amount of an image to be encoded, a calculation means for calculating a block distortion degree using a decoded image signal outputted from a decoding means, and a target code amount control means for judging a magnitude of the block distortion degree calculated by the calculation means and controlling the target code amount in accordance with the magnitude of the block distortion degree. When it is judged that the block distortion degree is great, the target code amount of the image to be encoded is increased and when it is judged that the block distortion degree is small, on the other hand, the target code amount of the image to be encoded is decreased, so that the code amount is set while taking into account quantitative deterioration conditions of image quality or human sight characteristics. <P>COPYRIGHT: (C)2008,JPO&INPIT

Description

本発明は画像符号化装置及び画像符号化方法に関し、特に、符号化手段で符号化された符号化データを局所的に復号する復号化手段を有する画像符号化装置に係り、人間の視覚特性を考慮した符号量制御を行うために用いて好適な技術に関する。 The present invention relates to an image encoding apparatus and an image encoding method, and more particularly to an image encoding apparatus having a decoding unit that locally decodes encoded data encoded by an encoding unit. The present invention relates to a technique suitable for use in performing code amount control in consideration.

従来、マルティメディアの発展に伴い様々な動画像圧縮符号化方式が提案されている。その代表的なものに、ＭＰＥＧ−１，２，４やＨ．２６Ｌといったものがある。これらの圧縮符号化の処理は、動画像に含まれる原画像（画像）をブロックと呼ばれる所定の領域に分割し、この分割したブロックを単位にして動き補償予測やＤＣＴ変換処理を施すものである。 Conventionally, various moving image compression encoding systems have been proposed with the development of multimedia. Typical examples are MPEG-1, 2, 4 and H.264. There is something like 26L. In these compression encoding processes, an original image (image) included in a moving image is divided into predetermined regions called blocks, and motion compensation prediction and DCT conversion processing are performed in units of the divided blocks. .

また、動き補償予測を行う場合、既に符号化済みの画像データを局所復号化して得られた画像を参照画像としていることで、符号化を行う際にも復号化処理が必要となる。
また、ＭＰＥＧ方式に準拠して画像の圧縮、符号化を行う場合、その符号量は、画像自体の特性である空間周波数特性やシーン及び量子化スケール値に応じて大きく異なる場合が多い。このような符号化特性を有する符号化装置を実現する上で良好な画質の復号画像を得ることができるようにするための重要な技術が符号量制御である。 In addition, when performing motion compensation prediction, an image obtained by local decoding of already encoded image data is used as a reference image, so that a decoding process is required even when encoding is performed.
In addition, when compressing and encoding an image in accordance with the MPEG system, the amount of code often varies greatly depending on the spatial frequency characteristics, which are characteristics of the image itself, the scene, and the quantization scale value. An important technique for realizing a decoded image with good image quality in realizing an encoding apparatus having such encoding characteristics is code amount control.

この符号量制御アルゴリズムの１つとして、ＴＭ５（Test Model 5）が一般的に使用されている。このＴＭ５による符号量制御アルゴリズムは、以下に説明する３つのステップから構成され、ＧＯＰ（Group Of Picture）毎にビットレートが一定になるように以下に挙げる３ステップで符号量が制御される。 TM5 (Test Model 5) is generally used as one of the code amount control algorithms. The code amount control algorithm according to TM5 is composed of the following three steps, and the code amount is controlled in the following three steps so that the bit rate is constant for each GOP (Group Of Picture).

第１ステップ（ＳＴＥＰ１）
今から符号化を行うピクチャの目標符号量を決定する。現在のＧＯＰにおいて利用可能な符号量であるＲgopが以下の（１）式により演算される。
Ｒgop = (ni+np+nb)*(bits＿rate/picture＿rate) ・・・（１）式
ここで、ni,np,nbはそれぞれI、P、Bピクチャの現GOPにおける残りのピクチャ数であり、bits＿rateは目標ビットレート、picture＿rateはピクチャレートを表す。 First step (STEP1)
The target code amount of the picture to be encoded from now is determined. Rgop, which is a code amount that can be used in the current GOP, is calculated by the following equation (1).
Rgop = (ni + np + nb) * (bits_rate / picture_rate) (1) where ni, np, nb are the number of remaining pictures in the current GOP of I, P, and B pictures, respectively, bits_rate Represents the target bit rate, and picture_rate represents the picture rate.

更に、I、P、Bピクチャ毎に符号化結果からピクチャの複雑度を以下の（２―１）〜（２−３）式で求めている。
Ｘi = Ｒi*Ｑi （２−１）式
Ｘp = Ｒp*Ｑp ・・・（２−２）式
Ｘb = Ｒb*Ｑb （２−３）式 Further, the complexity of the picture is obtained from the encoding result for each of the I, P, and B pictures by the following equations (2-1) to (2-3).
Xi = Ri * Qi (2-1) Formula Xp = Rp * Qp (2-2) Formula Xb = Rb * Qb (2-3) Formula

ここで、Ｘi、Ｘｐ、Ｘｂはコンプレキシティ（Complexity）とも呼ばれ、Ｒi、Ｒp及びＲbはそれぞれＩ、Ｐ、Ｂピクチャを符号化した結果得られる符号量である。Ｑi、Ｑp及びＱbはそれぞれI、P、Bピクチャ内のすべてのマクロブロックにおけるＱスケールの平均値である。式(1)及び（２―１）〜（２−３）式から、I、P、Bピクチャそれぞれについての目標符号量Ｔi、Ｔp及びＴbは、以下の（３―１）〜（３−３）式で求めることができる。 Here, Xi, Xp, and Xb are also called complexity, and Ri, Rp, and Rb are code amounts obtained as a result of encoding I, P, and B pictures, respectively. Qi, Qp and Qb are average values of the Q scale in all macroblocks in the I, P and B pictures, respectively. From the equations (1) and (2-1) to (2-3), the target code amounts Ti, Tp, and Tb for the I, P, and B pictures are expressed by the following (3-1) to (3-3). ).

Ｔi= max{(Ｒgop/(1+ ((Ｎp*Ｘp)/(Ｘi*Ｋp)) + ((Ｎb*Ｘb)/(Ｘi*Ｋb)))) , (bit＿rate/(8*picture＿rate))}・・・・・（３―１）式
Ｔp=max{(Ｒgop/(Ｎp+ (Ｎb*Ｋp*Ｘb)/(Ｋb*Ｘp))) , (bit＿rate/(8*picture＿rate))}・・・・・・（３―２）式
Ｔb=max{(Ｒgop/(Ｎb+ (Ｎp*Ｋb*Ｘp)/(Ｋp*Ｘb))) , (bit＿rate/(8*picture＿rate))}・・・・（３―３）式
ただし、Ｎp及びＮbは現GOP内のそれぞれP及びBピクチャの残りの枚数、また定数Ｋp=1.0及びＫb=1.4である。 Ti = max {(Rgop / (1 + ((Np * Xp) / (Xi * Kp)) + ((Nb * Xb) / (Xi * Kb)))), (bit_rate / (8 * picture_rate))} (3-1) Formula Tp = max {(Rgop / (Np + (Nb * Kp * Xb) / (Kb * Xp))), (bit_rate / (8 * picture_rate))} (3-2) Equation Tb = max {(Rgop / (Nb + (Np * Kb * Xp) / (Kp * Xb))), (bit_rate / (8 * picture_rate))} (3- 3) where Np and Nb are the remaining number of P and B pictures in the current GOP, respectively, and constants Kp = 1.0 and Kb = 1.4.

第２ステップ（ＳＴＥＰ２）
P及びBピクチャ毎に３つの仮想バッファを使用し、前述した（３―１）〜（３−３）式で求めた目標符号量と発生符号量との差分を管理する。仮想バッファのデータ蓄積量をフィードバックし、そのデータ蓄積量に基づいて実際の発生符号量が目標符号量に近づくように、次にエンコードするマクロブロックについて、Ｑスケールの参照値が設定される。 Second step (STEP2)
Three virtual buffers are used for each P and B picture, and the difference between the target code amount and the generated code amount obtained by the above equations (3-1) to (3-3) is managed. The data accumulation amount of the virtual buffer is fed back, and the reference value of the Q scale is set for the macroblock to be encoded next so that the actual generated code amount approaches the target code amount based on the data accumulation amount.

例えば、現在のピクチャタイプがPピクチャの場合には、目標符号量と発生符号量との差分は、次の（４）式に従う演算処理により求めることができる。
ｄp,j = ｄp,0 + Ｂp,j-1−((Ｔp*(j-1))/ＭＢ＿cnt) ・・・（４）式 For example, when the current picture type is a P picture, the difference between the target code amount and the generated code amount can be obtained by arithmetic processing according to the following equation (4).
dp, j = dp, 0 + Bp, j−1 − ((Tp * (j−1)) / MB_cnt) (4)

ここで、添字jはピクチャ内のマクロブロックの番号であり、ｄp,0は仮想バッファの初期フルネスを示し、Ｂp，jはj番目のマクロブロックまでの総符号量、ＭＢ＿cntはピクチャ内のマクロブロック数を示す。 Here, the subscript j is the number of the macroblock in the picture, dp, 0 indicates the initial fullness of the virtual buffer, Bp, j is the total code amount up to the jth macroblock, and MB_cnt is the macroblock in the picture Indicates a number.

次に、dp,j(以後、「dj」と記載する。) を用いて、j番目のマクロブロックにおけるＱスケールの参照値を求めると、（５）式のようになる。
Ｑj = (dj*31) / r ・・・（５）式
ここで、r = 2*bits＿rate/picture＿rate ・・・（６）式
である。 Next, when the reference value of the Q scale in the j-th macroblock is obtained using dp, j (hereinafter referred to as “dj”), equation (5) is obtained.
Qj = (dj * 31) / r (5) where r = 2 * bits_rate / picture_rate (6).

第３ステップ（ＳＴＥＰ３）
視覚特性、すなわち、復号画像の画質が良好になるように、エンコード対象のマクロブロックの空間アクティビティに基づいて、量子化スケールを最終的に決定する処理を実行する。
ACTj = 1+ min(vblk1, vblk2,……,vblk8) ・・・（７）式 Third step (STEP3)
A process of finally determining the quantization scale is executed based on the spatial activity of the macroblock to be encoded so as to improve the visual characteristics, that is, the image quality of the decoded image.
ACTj = 1+ min (vblk1, vblk2, ..., vblk8) (7)

前記（７）式中において、vblk1〜vblk4はフレーム構造のマクロブロックにおける8ｘ8のサブブロックにおける空間アクティビティを示す。vblk5〜vblk8はフィールド構造のマクロブロックにおける8x8サブブロックの空間アクティビティを示す。ここで、空間アクティビチィの演算は次の（８）式、（９）式により求めることが可能である。
vblk = Σ(Ｐi−Ｐbar)² ・・・（８）式
Pbar = (1/64 )* ΣＰi ・・・（９）式 In the equation (7), vblk1 to vblk4 indicate spatial activities in 8 × 8 sub-blocks in the macroblock of the frame structure. vblk5 to vblk8 indicate 8x8 sub-block spatial activities in the field-structured macroblock. Here, the calculation of the space activity can be obtained by the following equations (8) and (9).
vblk = Σ (Pi−Pbar) ² (8)
Pbar = (1/64) * ΣPi (9)

ここで、Ｐiはｉ番目のマクロブロックにおける画素値であり、（８）式、（９）式中のΣはi=1〜64の演算である。次に（７）式で求めたACTjを以下の（１０）式によって正規化を行う。 Here, Pi is a pixel value in the i-th macroblock, and Σ in equations (8) and (9) is an operation of i = 1 to 64. Next, ACTj obtained by the equation (7) is normalized by the following equation (10).

N＿ACTj = (2*ACTj +AVG＿ACT)/ (ACTj + AVG＿ACT) ・・・（１０）式
ここで、AVG＿ACTは以前に符号化したピクチャにおけるACTjの参照値であり、最終的に量子化スケール（Ｑスケール値）MQUANTjは以下の（１１）式により求められる。
MQUANTj = Qj * N＿ACTj ・・・（１１）式 N_ACTj = (2 * ACTj + AVG_ACT) / (ACTj + AVG_ACT) (10) where AVG_ACT is the reference value of ACTj in the previously encoded picture, and finally the quantization scale (Q scale) Value) MQUANTj is obtained by the following equation (11).
MQUANTj = Qj * N_ACTj (11) formula

以上のTM5のアルゴリズムによれば、STEP1の処理によりIピクチャに対して多くの符号量を割り当てており、更にピクチャ内においては視覚的に劣化の目立ちやすい平坦部(空間アクティビティが低い)に符号量が多く配分されるようになる。 According to the above TM5 algorithm, a large amount of code is allocated to the I picture by the processing of STEP1, and furthermore, the amount of code is in a flat part (low spatial activity) that is visually noticeable in the picture. Will be distributed more.

このようなＴＭ５方式を応用した符号化方式として、画像信号と局所復号画像のＳＮ比がある一定の値となるように目標符号量を決定する方式が特許文献１に紹介されている。この手法はＳＮ比を一定に保つような目標符号量を設定することですべてのピクチャに対する画質を安定させる効果がある。 As a coding method applying the TM5 method, Patent Document 1 introduces a method for determining a target code amount so that an SN ratio between an image signal and a locally decoded image becomes a certain value. This method has an effect of stabilizing the image quality for all the pictures by setting a target code amount that keeps the SN ratio constant.

また、特許文献１を改良した手法で、Ｉピクチャ，Ｐピクチャ，Ｂピクチャ各ピクチャの符号量を最適な値に設定する方式が特許文献２に紹介されている。この手法は、ＩピクチャのＳＮ比がＢピクチャのＳＮ比より大きくなるように、各フレーム（Ｉピクチャ・Ｐピクチャ・Ｂピクチャ）の符号量を配分制御する。つまり、Ｉピクチャの符号化誤差が、Ｂピクチャの符号化誤差より、小さくなる様に、各フレーム（Ｉピクチャ・Ｐピクチャ・Ｂピクチャ）の符号量を制御することでＧＯＰの原点となるＩピクチャの画質を向上させる効果がある。 Further, Patent Document 2 introduces a method of setting the code amount of each picture of I picture, P picture, and B picture to an optimum value by a method improved from Patent Document 1. In this method, the code amount of each frame (I picture, P picture, B picture) is distributed and controlled so that the SN ratio of the I picture is larger than the SN ratio of the B picture. That is, the I picture that is the origin of the GOP by controlling the code amount of each frame (I picture, P picture, B picture) so that the coding error of the I picture becomes smaller than the coding error of the B picture. It has the effect of improving the image quality.

特開平０２−２１９３８８号公報Japanese Patent Laid-Open No. 02-219388 特開平０８−０７０４５８号公報Japanese Patent Laid-Open No. 08-070458

しかしながら、前述のＴＭ５のアルゴリズムにおいては次のような課題があった。すなわち、最終的なMQUANTjを得るための判断情報として、（４）式における目標符号量と符号量との差分以外、式（５）における前ピクチャの符号化結果のＱスケール参照値（Ｑj）と、ＳＴＥＰ３の処理における空間アクティビティ（ACTj ）のみを使用している。 However, the above-described TM5 algorithm has the following problems. That is, as determination information for obtaining the final MQUANTj, except for the difference between the target code amount and the code amount in equation (4), the Q scale reference value (Qj) of the previous picture encoding result in equation (5) and , Only the spatial activity (ACTj) in the processing of STEP3 is used.

このため、画質の定量的な劣化具合や人間の視覚特性が、ＴＭ５における符号量制御では十分考慮されておらず、符号化状況に応じ人間の視覚特性に合致した符号量制御を行うことが困難であった。 For this reason, the degree of quantitative deterioration of image quality and human visual characteristics are not sufficiently considered in the code amount control in TM5, and it is difficult to perform code amount control that matches human visual characteristics according to the coding situation. Met.

また、特許文献１ならびに特許文献２の方式においては、以下の課題が存在していた。すなわち、特許文献１では各ピクチャでＳＮ比を一定に保つことで一定の画質を保つことは可能である。また、特許文献２ではＳＮ比と各ピクチャの符号配分を考慮することで特許文献１と同様に一定の画質を保つことは可能である。 In addition, the methods described in Patent Document 1 and Patent Document 2 have the following problems. That is, in Patent Document 1, it is possible to maintain a constant image quality by keeping the SN ratio constant for each picture. Further, in Patent Document 2, it is possible to maintain a constant image quality as in Patent Document 1 by considering the SN ratio and code distribution of each picture.

しかしながら、これらの手法は何れもＳＮ比を目標符号量決定の判断情報としているため、画質の定量的な劣化具合や人間の視覚特性が十分に考慮されていない。つまり、ＳＮ比と画質の関係が必ずしも比例関係にならないことが考えられる。 However, since all of these methods use the SN ratio as the determination information for determining the target code amount, the degree of quantitative deterioration in image quality and human visual characteristics are not fully considered. That is, it is conceivable that the relationship between the SN ratio and the image quality is not necessarily a proportional relationship.

例えば、芝生や空のような小振幅信号で構成される画像は画質劣化を受けてもＳＮ比の大幅な低下は起こりにくいが、ブロックノイズのような視覚的に目立ちやすいノイズは発生しやすい。 For example, an image composed of small-amplitude signals such as lawn and sky is less likely to cause a significant decrease in the signal-to-noise ratio even when the image quality is degraded, but visually noticeable noise such as block noise is likely to occur.

逆に、木やテクスチャ模様のような大振幅信号で構成される画像は少しの画質劣化でもＳＮ比が大きく現象することがあるが、ブロックノイズは目立ちにくい傾向にある。よって、ＳＮ比を基準に目標符号量を決定していた従来の手法では、人間の視覚特性に合致した符号量を設定することが困難である問題点があった。 Conversely, an image composed of a large amplitude signal such as a tree or a texture pattern may have a large SN ratio even if the image quality is slightly deteriorated, but the block noise tends to be inconspicuous. Therefore, the conventional method in which the target code amount is determined based on the SN ratio has a problem that it is difficult to set a code amount that matches human visual characteristics.

本発明は前述の問題点にかんがみ、人間の視覚特性に応じた符号量の設定を行うことができるようにすることを目的としている。 The present invention has been made in view of the above-described problems, and an object of the present invention is to make it possible to set a code amount in accordance with human visual characteristics.

本発明の画像符号化装置は、符号化対象画像を符号化する符号化手段と、前記符号化手段で符号化された符号化データを局所的に復号化する復号化手段とを有する画像符号化装置であって、前記符号化対象画像の目標符号量を決定する目標符号量決定手段と、前記復号化手段から出力される復号画像信号を用いてブロック歪度合いを算出する算出手段と、前記算出手段により算出されたブロック歪度合いの大きさを判断し、前記ブロック歪度合いの大きさに応じて前記目標符号量決定手段で決定する目標符号量を制御する目標符号量制御手段とを有することを特徴とする。 An image encoding apparatus according to the present invention includes an encoding unit that encodes an encoding target image, and a decoding unit that locally decodes encoded data encoded by the encoding unit. A device for determining a target code amount for determining a target code amount of the encoding target image; a calculating unit for calculating a block distortion degree using a decoded image signal output from the decoding unit; And a target code amount control means for determining a block distortion degree calculated by the means and controlling a target code amount determined by the target code amount determination means according to the block distortion degree. Features.

本発明の画像符号化方法は、符号化対象画像を符号化する符号化工程と、前記符号化工程で符号化された符号化データを局所的に復号化する復号化工程とを有する画像符号化方法であって、前記符号化対象画像の目標符号量を決定する目標符号量決定工程と、前記復号化工程から出力される復号画像信号を用いてブロック歪度合いを算出する算出工程と、前記算出工程により算出されたブロック歪度合いの大きさを判断し、前記ブロック歪度合いの大きさに応じて前記目標符号量決定手段で決定する目標符号量を制御する目標符号量制御工程とを有することを特徴とする。 An image encoding method according to the present invention includes an encoding process for encoding an image to be encoded, and a decoding process for locally decoding the encoded data encoded in the encoding process. A method for determining a target code amount of the encoding target image; a calculation step for calculating a block distortion degree using a decoded image signal output from the decoding step; and the calculation A target code amount control step of determining a block distortion degree calculated in the process and controlling a target code amount determined by the target code amount determining means according to the block distortion degree. Features.

本発明のプログラムは、符号化対象画像を符号化する符号化工程と、前記符号化工程で符号化された符号化データを局所的に復号化する復号化工程とを有する画像符号化方法をコンピュータに実行させるプログラムであって、前記符号化対象画像の目標符号量を決定する目標符号量決定工程と、前記復号化工程から出力される復号画像信号を用いてブロック歪度合いを算出する算出工程と、前記算出工程により算出されたブロック歪度合いの大きさを判断し、前記ブロック歪度合いの大きさに応じて前記目標符号量決定手段で決定する目標符号量を制御する目標符号量制御工程とを有する画像符号化方法をコンピュータに実行させることを特徴とする。 The program according to the present invention is a computer that performs an image encoding method including an encoding step for encoding an image to be encoded and a decoding step for locally decoding the encoded data encoded in the encoding step. A target code amount determination step of determining a target code amount of the encoding target image, and a calculation step of calculating a block distortion degree using a decoded image signal output from the decoding step; A target code amount control step of determining a block distortion degree calculated by the calculation step and controlling a target code amount determined by the target code amount determining means according to the block distortion degree. A computer is caused to execute the image coding method.

本発明によれば、算出されたブロック歪度合いに応じて目標符号量を制御するようにしたので、人間の視覚特性に応じた符号量の設定を行うことができ、良好な画質の復号画像を得ることができるようにすることができる。 According to the present invention, since the target code amount is controlled according to the calculated degree of block distortion, it is possible to set the code amount according to human visual characteristics, and to obtain a decoded image with good image quality. Can be obtained.

（第１の実施形態）
以下、本発明の第１の実施形態を図１〜図５に基づき説明する。
図１は、本発明にかかる第１の符号化方法を実現する符号化装置の構成を示すブロック図であり、前述のＴＭ５アルゴリズムを実行することが可能な構成になっている。 (First embodiment)
A first embodiment of the present invention will be described below with reference to FIGS.
FIG. 1 is a block diagram showing a configuration of an encoding apparatus that realizes the first encoding method according to the present invention, and is configured to be able to execute the above-described TM5 algorithm.

具体的には、ＭＰＥＧ-１、ＭＰＥＧ-２或いはＭＰＥＧ-４規格に対応するものである。図２は、図１の符号化装置の処理を説明するフローチャートである。図３はマクロブロックとブロック境界を明示化したものである。図４はブロック歪度合いに応じた目標符号量のオフセット量を示す図であり、図５はブロック歪度合いに応じた目標符号量の設定処理を説明するフローチャートである。 Specifically, it corresponds to the MPEG-1, MPEG-2 or MPEG-4 standards. FIG. 2 is a flowchart for explaining processing of the encoding apparatus of FIG. FIG. 3 clarifies macroblocks and block boundaries. FIG. 4 is a diagram showing the offset amount of the target code amount according to the block distortion degree, and FIG. 5 is a flowchart for explaining the target code amount setting process according to the block distortion degree.

以下、図１〜図５を参照して、符号化処理ならびにブロック歪度合いに応じた目標符号量の設定方法の流れを具体的に説明する。
図１において、１００は例えばディジタルビデオカメラに具備されているＣＣＤならびにライン入力端子から送られる入力信号であり、１０１は直交変換回路であり、１０２は量子化回路であり、１０３は可変長符号化回路である。１０４は逆直交変換回路であり、１０５は逆量子化回路であり、１０６は逆可変長復号化回路であり、１０７は図示しない動き補償回路で使用する参照フレーム１０８、ならびに逆量子化回路ならびに逆直交変換回路１０４で局所復号化した画像信号であるローカルデコードフレーム１０９を保存するメモリである。 Hereinafter, the flow of the setting method of the target code amount corresponding to the encoding process and the degree of block distortion will be specifically described with reference to FIGS.
In FIG. 1, 100 is an input signal sent from, for example, a CCD and a line input terminal provided in a digital video camera, 101 is an orthogonal transformation circuit, 102 is a quantization circuit, and 103 is variable length coding. Circuit. Reference numeral 104 denotes an inverse orthogonal transform circuit, 105 denotes an inverse quantization circuit, 106 denotes an inverse variable length decoding circuit, 107 denotes a reference frame 108 used in a motion compensation circuit (not shown), an inverse quantization circuit, and an inverse quantization circuit. This is a memory for storing a local decode frame 109 which is an image signal locally decoded by the orthogonal transform circuit 104.

１１０はビットレートに応じて今から符号化するＧＯＰの目標ビットレートならびに、ピクチャの目標符号量を決定する符号量制御部であり、１１１は符号量制御部１１０で決定されたピクチャの目標符号量を基に、マクロブロックの量子化係数を決定する量子化制御部であり、量子化制御部１１１で決定した量子化パラメータ（量子化スケールとも呼ぶ）を用いて量子化回路１０２は量子化を行う。 110 is a code amount control unit for determining the target bit rate of the GOP to be encoded according to the bit rate and the target code amount of the picture, and 111 is the target code amount of the picture determined by the code amount control unit 110 Is a quantization control unit that determines the quantization coefficient of the macroblock, and the quantization circuit 102 performs quantization using the quantization parameter (also referred to as a quantization scale) determined by the quantization control unit 111. .

量子化された画像信号は、可変長符号化回路１０３において符号化されて図示しない記録媒体や出力端子に出力信号１１２として出力されるのと同時に、逆量子化回路１０５ならびに逆直交変換回路１０４を経由して次ピクチャの符号化に利用する参照フレーム１０８を生成する。１１３はブロック歪度合い算出回路であり、入力信号ならびに局所復号画像であるローカルデコードフレームからブロック歪度合いを算出する。 The quantized image signal is encoded by the variable length encoding circuit 103 and output as an output signal 112 to a recording medium (not shown) or an output terminal. At the same time, the inverse quantization circuit 105 and the inverse orthogonal transform circuit 104 are used. A reference frame 108 to be used for encoding the next picture is generated. A block distortion degree calculation circuit 113 calculates a block distortion degree from an input signal and a local decoded frame that is a locally decoded image.

次に、図２のフローチャートを参照しながら符号化装置の処理手順を説明する
まず、ステップＳ２０１において、前述のＴＭ５アルゴリズムのＳＴＥＰ１を利用するために、（３−１）〜（３−３）式に従ってピクチャタイプ（Ｉ、Ｐ、Ｂピクチャ）のそれぞれについての目標符号量Ｔｉ、Ｔｐ、Ｔｂの初期値を決定する。この際、初期値を決定するためのＸｉ、Ｘｐ、Ｘｂは存在しないので任意の値か、符号化前の画像情報から算出した値を用いて前記目標符号量Ｔｉ、Ｔｐ、Ｔｂの初期値を設定する。 Next, the processing procedure of the encoding device will be described with reference to the flowchart of FIG. 2. First, in order to use STEP1 of the above-described TM5 algorithm in step S201, equations (3-1) to (3-3) Accordingly, the initial values of the target code amounts Ti, Tp, and Tb for each of the picture types (I, P, and B pictures) are determined. At this time, since Xi, Xp, and Xb for determining the initial values do not exist, the initial values of the target code amounts Ti, Tp, and Tb are set using arbitrary values or values calculated from the image information before encoding. Set.

次に、処理をステップＳ２０２に進め、符号化対象ピクチャの符号化を開始する。必要ならば、このステップＳ２０２で符号化装置内部のレジスタセットを行う。
次に、処理をステップ２０３に進め、ピクチャタイプがＩピクチャの場合図示しないブロック分割部でマクロブロックに分割された入力信号は直交変換回路１０１において直交変換を行う。また、量子化制御部１１１で決定された量子化スケールを用いて直交変換出力係数の量子化を行う。 Next, the process proceeds to step S202, and encoding of the encoding target picture is started. If necessary, register setting in the encoding apparatus is performed in step S202.
Next, the process proceeds to step 203. When the picture type is I picture, the orthogonal transform circuit 101 performs orthogonal transform on the input signal divided into macro blocks by a block division unit (not shown). Also, the orthogonal transform output coefficient is quantized using the quantization scale determined by the quantization control unit 111.

また、ピクチャタイプがＰピクチャならびにＢピクチャの場合は、ブロック分割部でマクロブロックに分割された入力信号と参照フレーム１０８（マクロブロックデータ）を用いて図示しない動き予測ならびに動き補償を行う。そして、その結果の出力と入力信号の差分値に対して直交変換回路１０１において直交変換を行い、量子化制御部１１１で決定された量子化スケールを用いて直交変換出力係数の量子化を行う。なお、量子化パラメータである量子化スケールの算出はＴＭ５のステップ２に相当する処理を行うことで算出されるので、ここでの説明は省略する。 When the picture type is a P picture or a B picture, motion prediction and motion compensation (not shown) are performed using the input signal and the reference frame 108 (macroblock data) divided into macroblocks by the block division unit. Then, the orthogonal transformation circuit 101 performs orthogonal transformation on the difference value between the resulting output and the input signal, and the orthogonal transformation output coefficient is quantized using the quantization scale determined by the quantization control unit 111. The calculation of the quantization scale, which is a quantization parameter, is calculated by performing a process corresponding to step 2 of TM5, and thus the description thereof is omitted here.

ここで、入力される符号化前のマクロブロックは、後に説明するブロック歪度合いの算出に利用するために、ブロック歪度合い算出回路１１３に入力される。ブロック歪度合い算出回路１１３における処理は、後に詳述する。 Here, the input macroblock before encoding is input to the block distortion degree calculation circuit 113 in order to be used for calculation of the block distortion degree described later. The processing in the block distortion degree calculation circuit 113 will be described in detail later.

次に、ステップＳ２０４において、ステップＳ２０３の処理により生成された符号化データを逆量子化回路１０５、逆直交変換回路１０４により逆変換処理を行って復号データであるローカルデコードフレーム１０９を生成する。 Next, in step S204, the encoded data generated by the process of step S203 is subjected to inverse transform processing by the inverse quantization circuit 105 and the inverse orthogonal transform circuit 104 to generate a local decode frame 109 which is decoded data.

次に、ステップＳ２０５において、ブロック歪度合い算出回路１１３により、符号化前のマクロブロックと、ローカルデコードフレーム１０９のデータのうち、符号化前のマクロブロックと同じ座標にあるマクロブロックとを比較する。次に、ＭＰＥＧ処理により生じた画像の歪みを評価するパラメータとして、ブロック歪度合いを演算する。ブロック歪度合いの計算方法としては、以下の方法が挙げられる。 Next, in step S205, the block distortion degree calculation circuit 113 compares the macroblock before encoding with the macroblock at the same coordinates as the macroblock before encoding in the data of the local decode frame 109. Next, the degree of block distortion is calculated as a parameter for evaluating the distortion of the image generated by the MPEG processing. The following method is mentioned as a calculation method of a block distortion degree.

符号化前及び復号化後の２つの画像に関し、それぞれを８×８のブロックとし、この８×８ブロックの境界の各画素について、（２１）式による差分和の計算を実行する。
図３は、マクロブロック（図３（a））と、これを構成する符号化前の８×８の画素ブロック（図３（b））と、符号化後の８×８の画素ブロック（図３（c））とを示す図である。 With respect to the two images before encoding and after decoding, each of them is an 8 × 8 block, and the difference sum calculation according to the equation (21) is executed for each pixel at the boundary of the 8 × 8 block.
FIG. 3 shows a macro block (FIG. 3A), an 8 × 8 pixel block before encoding (FIG. 3B) constituting the macro block, and an 8 × 8 pixel block after encoding (FIG. 3B). 3 (c)).

マクロブロックを構成する４つの８×８ブロックの境界画素各々について、入力画像の輝度成分（Ｐ0，Ｐ1、Ｐ2、Ｐ3・・・）と、ローカルデコードデータの輝度成分（Ｒ0、Ｒ1、Ｒ2、Ｒ3・・・）、として両者の差分絶対値和（diff＿sum）は、下記の（２１）式により求めることができる。
diff＿sum=Σabsolute(Ｐj−Ｒj) ・・・（２１）式
ブロック歪み度度合い算出回路１１３は、前述の方法によりブロック歪度合いを演算することができる。 For each of the boundary pixels of the four 8 × 8 blocks constituting the macroblock, the luminance component of the input image (P0, P1, P2, P3...) And the luminance component of the local decode data (R0, R1, R2, R3). )), The difference absolute value sum (diff_sum) of the two can be obtained by the following equation (21).
diff_sum = Σabsolute (Pj−Rj) (21) The block distortion degree calculation circuit 113 can calculate the block distortion degree by the method described above.

説明を図２のフローチャートに戻す。
ステップＳ２０５においてブロック歪度合いを演算したら、次に、ステップＳ２０６に進み、ステップＳ２０３からステップＳ２０５までの処理をピクチャ内の全てのマクロブロックに対して行ったか否かを判断する。その際に、各マクロブロックで使用する量子化パラメータは可変長符号化回路１０３で可変長符号化されたマクロブロックの符号量を用いて量子化制御部１１１で決定される。この判断の結果、全てのマクロブロックに対して行っていない場合にはステップＳ２０２に戻って前述した処理を繰り返し行う。また、全て終了した場合にはステップＳ２０７に進む。 The description returns to the flowchart of FIG.
If the block distortion degree is calculated in step S205, the process proceeds to step S206, and it is determined whether or not the processing from step S203 to step S205 has been performed for all macroblocks in the picture. At this time, the quantization parameter used in each macroblock is determined by the quantization control unit 111 using the code amount of the macroblock that is variable-length encoded by the variable-length encoding circuit 103. If the result of this determination is that it has not been performed for all macroblocks, the process returns to step S202 and the above-described processing is repeated. If all the processes are completed, the process proceeds to step S207.

次に、処理をステップＳ２０７においては、符号化対象ピクチャのすべてのマクロブロックに対して符号化を行った後に、符号量制御部１１０において、前述した（１）式を用いてＸｉ、Ｘｐ、Ｘｂの算出を行う。なお、Ｘｉ、Ｘｐ、Ｘｂは各ピクチャを符号化した結果得られる符号量とピクチャ内のすべてのマクロブロックにおけるＱスケールの平均値を乗算したものである。 Next, in step S207, after encoding all macroblocks of the encoding target picture, the code amount control unit 110 uses the above-described equation (1) to express Xi, Xp, Xb. Is calculated. Xi, Xp, and Xb are obtained by multiplying the code amount obtained as a result of encoding each picture by the average value of the Q scale in all macroblocks in the picture.

次に、処理をステップＳ２０８に進み、符号量制御部１１０において、Ｘｉ、Ｘｐ、Ｘｂとブロック歪度合い算出回路１１３で算出されたブロック歪度合いとを用いて次の符号化対象ピクチャの目標符号量Ｔｉ、Ｔｐ、Ｔｂを決定する。
その後、ステップＳ２０９において、最終ピクチャか否かを判断し、最終ピクチャであった場合には処理を終了する。また、最終ピクチャではなかった場合にはステップＳ２０２に戻って前述した処理を繰り返し実行する。 Next, the process proceeds to step S208, and the code amount control unit 110 uses the target code amount of the next encoding target picture using Xi, Xp, and Xb and the block distortion degree calculated by the block distortion degree calculation circuit 113. Ti, Tp, and Tb are determined.
Thereafter, in step S209, it is determined whether or not it is the last picture. If it is the last picture, the process is terminated. If it is not the final picture, the process returns to step S202 and the above-described processing is repeatedly executed.

この目標符号量算出過程を図４ならびに図５のフローチャート用いて説明する。
図４は、ブロック歪度合いに応じた目標符号量のオフセット量を示しており、このオフセット量はブロック歪度合いの関数になっている。α、β（α＞０、β＜０）は任意の定数であり符号化装置で自由に設定可能である。 This target code amount calculation process will be described with reference to the flowcharts of FIGS.
FIG. 4 shows the offset amount of the target code amount according to the block distortion degree, and this offset amount is a function of the block distortion degree. α and β (α> 0, β <0) are arbitrary constants and can be freely set by the encoding device.

まず、ステップＳ５０１において、（１）式を用いてＴpictype＿baseを算出する。ここで、Ｔpictype＿baseとは、Ｉ、Ｐ、Ｂ各ピクチャに対して（１）式を用いて暫定的に算出した仮の目標符号量である。そして、次フレームの符号化対象ピクチャがＩピクチャであればＴi＿baseを求めることになる（１枚のピクチャ符号化において毎回各ピクチャの目標符号量を算出しているわけではない）。これ以降は、説明の混乱を避けるために、次符号化対象ピクチャがＩピクチャの場合を想定して説明を行う。 First, in step S501, Tpictype_base is calculated using equation (1). Here, Tpictype_base is a tentative target code amount that is tentatively calculated using equation (1) for each of the I, P, and B pictures. Then, if the picture to be encoded in the next frame is an I picture, Ti_base is obtained (the target code amount of each picture is not calculated every time one picture is encoded). In the following, in order to avoid confusion in the description, the description will be made assuming that the next encoding target picture is an I picture.

次に、処理をステップＳ５０２に進め、ブロック歪度合い算出回路１１３で算出されたブロック歪度合いと閾値th1との比較を行う。この比較の結果、ブロック歪度合いが小さければステップＳ５０４へ処理を進めて次符号化対象ピクチャの目標符号量Ｔiを
Ti = Ｔi＿base＋β （β＜０）・・・（２２）式
とする処理を行う。 Next, the process proceeds to step S502, and the block distortion degree calculated by the block distortion degree calculation circuit 113 is compared with the threshold value th1. If the degree of block distortion is small as a result of this comparison, the process proceeds to step S504 to set the target code amount Ti of the next encoding target picture.
Ti = Ti_base + β (β <0) (Processing represented by equation (22) is performed.)

一方、ステップＳ５０２の比較の結果において、ブロック歪度合いが閾値th1よりも大きければ、ステップＳ５０３へ処理を進める。
ステップＳ５０３において、ブロック歪度合いと閾値th2の比較を行う。この比較の結果、ブロック歪度合いが小さければステップＳ５０５へ処理を進めて、次符号化対象ピクチャの目標符号量Ｔiを
Ti = Ｔi＿base ((α-β)*(diff＿sum-th1)/(th2-th1))+β ・・・（２３）式
とする処理を行う。 On the other hand, if the block distortion degree is larger than the threshold th1 in the comparison result in step S502, the process proceeds to step S503.
In step S503, the block distortion degree is compared with the threshold value th2. As a result of this comparison, if the degree of block distortion is small, the process proceeds to step S505, and the target code amount Ti of the next coding target picture is set.
Ti = Ti_base ((α−β) * (diff_sum-th1) / (th2−th1)) + β (23) Processing is performed.

一方、ステップＳ５０３の比較の結果、ブロック歪度合いが閾値th2よりも大きければ、ステップＳ５０６へ処理を進めて、次符号化対象ピクチャの目標符号量Ｔiを
Ti = Ｔi＿base＋α （α＞０）・・・（２４）式
とする処理を行う。 On the other hand, as a result of the comparison in step S503, if the block distortion degree is larger than the threshold th2, the process proceeds to step S506, and the target code amount Ti of the next encoding target picture is set.
Ti = Ti_base + α (α> 0) (24) The processing is performed.

以上の処理を行うことにより、ＴＭ５アルゴリズムのステップ１に相当する処理では人間の視覚特性を考慮した目標符号量の設定が不可能であったが、このようにブロック歪度合いを考慮した目標符号量の設定を行うことで画質の向上を行うことができる。なお、ブロック歪度合い算出計算式は前記方法に限ったものではなく、いずれの形態でもブロック歪度合いが算出可能であれば構わない。 By performing the above processing, the processing corresponding to step 1 of the TM5 algorithm cannot set the target code amount in consideration of human visual characteristics. The image quality can be improved by performing the setting. Note that the block distortion degree calculation formula is not limited to the above method, and any block distortion degree can be calculated in any form.

また、目標符号量の設定方法も同様に前記方法に限ったものではなく、ブロック歪度合いが目標符号量の設定に作用する形であれば構わない。さらに、ブロック歪度合いで目標符号量の制御を行わない場合はＴＭ５のステップ１相当の処理のみを行って目標符号量を決定することも可能である。 Similarly, the method for setting the target code amount is not limited to the above method, and any method may be used as long as the degree of block distortion affects the setting of the target code amount. Furthermore, when the target code amount is not controlled based on the degree of block distortion, it is also possible to determine the target code amount by performing only the process corresponding to step 1 of TM5.

（第２の実施形態）
次に、本発明の第２の実施形態を図６〜図９に基づき説明する。
図６は、本発明にかかる第２の符号化方法を実現する装置の構成を示す符号化装置のブロック図であり、前述のＴＭ５アルゴリズムを実行することが可能な構成になっている。具体的には、ＭＰＥＧ-１、ＭＰＥＧ-２或いはＭＰＥＧ-４規格に対応するものである。図７〜図９は後述するデブロッキングフィルタを説明する図である。 (Second Embodiment)
Next, a second embodiment of the present invention will be described with reference to FIGS.
FIG. 6 is a block diagram of an encoding apparatus showing a configuration of an apparatus that implements the second encoding method according to the present invention, and is configured to be able to execute the above-described TM5 algorithm. Specifically, it corresponds to the MPEG-1, MPEG-2 or MPEG-4 standards. 7 to 9 are diagrams illustrating a deblocking filter to be described later.

また、基本的な処理の流れ、ならびにブロック歪度合いに応じた目標符号量の設定手法は第１の実施形態と同じであるので、図６ならびに図７〜図９を参照して符号化処理ならびにブロック歪度合いに応じた目標符号量の設定方法の流れを具体的に説明する。 Further, since the basic process flow and the target code amount setting method according to the block distortion degree are the same as those in the first embodiment, the encoding process and the process shown in FIGS. A flow of a method for setting a target code amount according to the degree of block distortion will be specifically described.

図６において、６００は例えばディジタルビデオカメラに具備されているＣＣＤならびにライン入力端子からの映像信号であり、６０１は直交変換回路であり、６０２は量子化回路であり、６０３は可変長符号化回路である。 In FIG. 6, 600 is a video signal from a CCD and line input terminal provided in a digital video camera, for example, 601 is an orthogonal transformation circuit, 602 is a quantization circuit, and 603 is a variable length coding circuit. It is.

また、６０４は逆直交変換回路であり、６０５は逆量子化回路であり、６０６は逆可変長復号化回路である。６０７は図示しない動き補償回路で使用する参照フレーム６０８、ならびに逆量子化回路６０５及び逆直交変換回路６０４で局所復号化した画像信号であるローカルデコードフレーム６０９を保存するメモリである。 Reference numeral 604 denotes an inverse orthogonal transform circuit, reference numeral 605 denotes an inverse quantization circuit, and reference numeral 606 denotes an inverse variable length decoding circuit. A memory 607 stores a reference frame 608 used in a motion compensation circuit (not shown) and a local decode frame 609 that is an image signal locally decoded by the inverse quantization circuit 605 and the inverse orthogonal transform circuit 604.

６１０はビットレートに応じて今から符号化するＧＯＰの目標ビットレートならびに、ピクチャの目標符号量を決定する符号量制御部である。６１１は符号量制御部６１０で決定されたピクチャの目標符号量を基に、マクロブロックの量子化係数を決定する量子化制御部であり、量子化制御部６１１で決定した量子化パラメータ（量子化スケールとも呼ぶ）を用いて量子化回路６０２は量子化を行う。 A code amount control unit 610 determines the target bit rate of the GOP to be encoded from now on and the target code amount of the picture according to the bit rate. Reference numeral 611 denotes a quantization control unit that determines the quantization coefficient of the macroblock based on the target code amount of the picture determined by the code amount control unit 610. The quantization parameter (quantization parameter) determined by the quantization control unit 611 The quantization circuit 602 performs quantization using a scale).

量子化された画像信号は、可変長符号化回路６０３において符号化されて図示しない記録媒体や出力端子に出力信号６１２として出力される。また、それと同時に、逆量子化回路６０５ならびに逆直交変換回路６０４を経由して次ピクチャの符号化に利用する参照フレーム６０８を生成する。 The quantized image signal is encoded by the variable-length encoding circuit 603 and output as an output signal 612 to a recording medium (not shown) or an output terminal. At the same time, a reference frame 608 used for encoding the next picture is generated via the inverse quantization circuit 605 and the inverse orthogonal transform circuit 604.

６１３はデブロッキングフィルタであり、復号時に出力信号のブロック境界を目立たなくさせるためのポストフィルタである。このデブロッキングフィルタ６１３は、符号化対象映像のビットレートが低い時や複雑な映像を符号化した場合に８×８画素ブロック（DCTブロック）、ならびにマクロブロック境界に発生するブロックノイズを除去するものである。これは、高画質な映像を提供するためにブロック単位で直交変換を伴う符号化復号化回路に搭載されていることが多い。 Reference numeral 613 denotes a deblocking filter, which is a post filter for making the block boundary of the output signal inconspicuous at the time of decoding. The deblocking filter 613 removes 8 × 8 pixel block (DCT block) and block noise generated at the macroblock boundary when the bit rate of the video to be encoded is low or when complex video is encoded. It is. This is often mounted on an encoding / decoding circuit that involves orthogonal transform in units of blocks in order to provide high-quality video.

デブロッキングフィルタはMPEGの規格には特に規定されていないが、一例として図７〜図９を参照として以下の方法を挙げる。
図７は、マクロブロックを構成する４つの８×８画素ブロック（図７（a）、７０１〜７０４）と、８×８画素ブロック７０１と７０２の境界におけるデブロッキングフィルタ処理を示す図（図７（b））である。 The deblocking filter is not particularly defined in the MPEG standard, but the following method is given as an example with reference to FIGS.
FIG. 7 is a diagram showing deblocking filter processing at the boundary between four 8 × 8 pixel blocks (FIG. 7A and 701 to 704) constituting the macroblock and the 8 × 8 pixel blocks 701 and 702 (FIG. 7). (B)).

７０１の輝度成分（Ｐ0，Ｐ1、Ｐ2、Ｐ3・・・）と、７０２の輝度成分（Ｒ0、Ｒ1、Ｒ2、Ｒ3・・・）として、各輝度値の差分絶対値和（diff＿sum）は（３１）式により求めることができる。
diff＿sum=Σabsolute(Ｐj−Ｒj) ・・・（３１）式 As the luminance component (P0, P1, P2, P3...) Of 701 and the luminance component (R0, R1, R2, R3...) Of 702, the difference absolute value sum (diff_sum) of each luminance value is (31 ).
diff_sum = Σabsolute (Pj−Rj) (31)

また、マクロブロックは符号化時に量子化制御部で決定した量子化パラメータ（量子化スケール）で量子化を行っている。その量子化パラメータは復号時には逆可変長復号化回路６０６より各マクロブロックで算出されている。 The macroblock is quantized with a quantization parameter (quantization scale) determined by the quantization control unit at the time of encoding. The quantization parameter is calculated for each macroblock by the inverse variable length decoding circuit 606 at the time of decoding.

次に、デブロッキングフィルタ６１３の処理について、図８のフローチャートと図９のフィルタ特性を用いて説明する。
まず、ステップＳ８０１において、（３１）式を用いて算出されたdiff＿sumが閾値th＿valよりも大きく量子化パラメータQjが閾値th＿qよりも大きいか否かを判断する。この判断の結果、大きい場合、ブロックノイズが発生していると判断して、処理をステップＳ８０２に進め、ブロック境界に対してフィルタ処理を行う。 Next, the process of the deblocking filter 613 is demonstrated using the flowchart of FIG. 8, and the filter characteristic of FIG.
First, in step S801, it is determined whether or not diff_sum calculated using equation (31) is greater than threshold th_val and quantization parameter Qj is greater than threshold th_q. If the result of this determination is large, it is determined that block noise has occurred, and the process advances to step S802 to perform filter processing on the block boundary.

この時、図９に示したフィルタ特性のうち９０２〜９０４のような帯域を落とすようなフィルタが用いられる。また、９０２〜９０４の選択はdiff＿sum値ならびにQjによって変動するものとする。一方、ステップＳ８０１の条件を満たさなかった場合、ブロックノイズが発生していないと判断して、処理をステップＳ８０３に進めフィルタ処理を行わない。この時のフィルタ特性は、図９に示した９０１のようになる。 At this time, a filter that drops a band such as 902 to 904 among the filter characteristics shown in FIG. 9 is used. In addition, the selection of 902 to 904 varies depending on the diff_sum value and Qj. On the other hand, if the condition in step S801 is not satisfied, it is determined that no block noise has occurred, the process proceeds to step S803, and the filter process is not performed. The filter characteristics at this time are as indicated by 901 shown in FIG.

ここで、デブロッキングフィルタ６１３内でフィルタ処理を行うか行わないか、ならびにフィルタ強度を選択する際のパラメータとして算出したdiff＿sum値は、第１の実施形態におけるブロック歪度合いに相当する。 Here, the diff_sum value calculated as a parameter for selecting whether or not to perform the filtering process in the deblocking filter 613 and the filter strength is equivalent to the block distortion degree in the first embodiment.

デブロッキングフィルタ６１３はポストフィルタの役割を担うため、通常復号時には動作させて符号化時には動作させないことが一般的である。しかしながら、本実施形態では符号化時にもデブロッキングフィルタ６１３を動作させて第１の実施形態におけるブロック歪度合い算出回路１１３で算出するブロック歪度合いを算出するようにする。 Since the deblocking filter 613 plays the role of a post filter, it is generally operated during normal decoding but not during encoding. However, in the present embodiment, the deblocking filter 613 is operated even during encoding, and the block distortion degree calculated by the block distortion degree calculation circuit 113 in the first embodiment is calculated.

ブロック歪度合いを用いて目標符号量を設定する方法は、第１の実施形態に述べた方法と全く同様の方法（図２のフローチャート）で行うことが可能であるため、ここでの説明は省略する。また、前記でブロッキングフィルタにおけるブロック歪度合い算出ならびにフィルタ特性に関しては前記方法に限ったものではない。 The method of setting the target code amount using the block distortion degree can be performed by the same method (the flowchart of FIG. 2) as the method described in the first embodiment, and thus the description thereof is omitted here. To do. The block distortion degree calculation and filter characteristics in the blocking filter are not limited to the above method.

図６に示すように、デブロッキングフィルタへ入力信号６００とローカルデコードフレーム６０９を入力することにより、第１の実施形態と全く同じ処理を行うことも可能であり、ブロック歪度合いが算出可能であれば構わない。 As shown in FIG. 6, by inputting the input signal 600 and the local decode frame 609 to the deblocking filter, it is possible to perform exactly the same processing as in the first embodiment, and the block distortion degree can be calculated. It doesn't matter.

また、目標符号量の設定方法も同様に前記方法に限ったものではなく、ブロック歪度合いが目標符号量の設定に作用する形であれば構わない。さらに、ブロック歪度合いで目標符号量の制御を行わない場合は、ＴＭ５のステップ１相当の処理のみを行って目標符号量を決定することも可能である。 Similarly, the method for setting the target code amount is not limited to the above method, and any method may be used as long as the degree of block distortion affects the setting of the target code amount. Further, when the target code amount is not controlled based on the degree of block distortion, it is possible to determine the target code amount by performing only the process corresponding to step 1 of TM5.

（第３の実施形態）
次に、図１０を参照しながら第３の実施形態を説明する。
図１０は、本発明にかかる第３の符号化方法を実現する装置の構成を示す符号化装置のブロック図であり、前述のＴＭ５アルゴリズム相当の処理を実行することが可能な構成になっている。具体的には、Ｈ．２６４規格に対応するものであり、符号化処理内にループフィルタ（所謂デブロッキングフィルタ）を含む規格に対応するものである。 (Third embodiment)
Next, a third embodiment will be described with reference to FIG.
FIG. 10 is a block diagram of an encoding apparatus showing the configuration of an apparatus that implements the third encoding method according to the present invention, and is configured to be able to execute processing equivalent to the above-described TM5 algorithm. . Specifically, H.C. The H.264 standard corresponds to a standard including a loop filter (so-called deblocking filter) in the encoding process.

以下、図１０を参照して、符号化処理ならびにブロック歪度合いに応じた目標符号量の設定方法の流れを具体的に説明する。なお、本符号化装置ではＨ．２６４／ＭＰＥＧ４−ＡＶＣ（以下、Ｈ．２６４と称す）を例にしている。そして、Ｈ．２６４の符号化方式で使用されるイントラ予測やループフィルタの仕様を始めとした本発明の本質とは関係のないブロックの表示は行っておらず、本文中での説明も省略する。 Hereinafter, with reference to FIG. 10, the flow of a method for setting a target code amount in accordance with the encoding process and the degree of block distortion will be specifically described. In this encoding apparatus, H.264 is used. H.264 / MPEG4-AVC (hereinafter referred to as H.264) is taken as an example. And H. Blocks not related to the essence of the present invention such as intra prediction and loop filter specifications used in the H.264 coding system are not displayed, and description in the text is also omitted.

図１０において、１０００は例えばディジタルビデオカメラに具備されているＣＣＤならびにライン入力端子からの映像信号であり、１００１は直交変換回路であり、１００２は量子化回路であり、１００３はエントロピー符号化回路である。１００４は逆量子化回路であり、１００５はエントロピー復号化回路であり、１００６は逆直交変換回路である。１００７は図示しない動き補償回路で使用する参照フレーム１００８、ならびに逆量子化回路ならびに逆直交変換回路１００４で局所復号化した画像信号であるローカルデコードフレーム１００９を保存するメモリである。 In FIG. 10, 1000 is a video signal from a CCD and line input terminal provided in a digital video camera, for example, 1001 is an orthogonal transformation circuit, 1002 is a quantization circuit, and 1003 is an entropy coding circuit. is there. Reference numeral 1004 denotes an inverse quantization circuit, reference numeral 1005 denotes an entropy decoding circuit, and reference numeral 1006 denotes an inverse orthogonal transform circuit. Reference numeral 1007 denotes a memory for storing a reference frame 1008 used in a motion compensation circuit (not shown) and a local decode frame 1009 that is an image signal locally decoded by an inverse quantization circuit and an inverse orthogonal transform circuit 1004.

１０１０はビットレートに応じて今から符号化するＧＯＰの目標ビットレートならびに、ピクチャの目標符号量を決定する符号量制御部である。１０１１は符号量制御部１０１０で決定されたピクチャの目標符号量を基に、マクロブロックの量子化係数を決定する量子化制御部である。量子化回路１００２は、量子化制御部１０１１で決定した量子化パラメータ（量子化スケールとも呼ぶ）を用いて量子化を行う。 A code amount control unit 1010 determines the target bit rate of the GOP to be encoded from now on and the target code amount of the picture according to the bit rate. Reference numeral 1011 denotes a quantization control unit that determines the quantization coefficient of the macroblock based on the target code amount of the picture determined by the code amount control unit 1010. The quantization circuit 1002 performs quantization using the quantization parameter (also referred to as a quantization scale) determined by the quantization control unit 1011.

量子化された画像信号は、エントロピー符号化回路１００３において符号化されて図示しない記録媒体や出力端子に出力信号１０１２として出力される。これと同時に、逆量子化回路１００５ならびに逆直交変換回路１００４を経由して次ピクチャの符号化に利用する参照フレーム１００８を生成する。 The quantized image signal is encoded by the entropy encoding circuit 1003 and output as an output signal 1012 to a recording medium or an output terminal (not shown). At the same time, a reference frame 1008 used for encoding the next picture is generated via the inverse quantization circuit 1005 and the inverse orthogonal transform circuit 1004.

１０１３はループフィルタであり、第２の実施形態に示したデブロッキングフィルタと同様の機能を果たす。Ｈ．２６４では、このループフィルタを符号化内部に取り込んで動き予測を行う際に、ループフィルタでブロックノイズを除去した参照フレームを用いることにより、符号化効率ならびに画質の向上を行っている。 A loop filter 1013 performs the same function as the deblocking filter described in the second embodiment. H. H.264 improves coding efficiency and image quality by using a reference frame from which block noise has been removed by the loop filter when motion prediction is performed by incorporating the loop filter into the coding.

このループフィルタ１０１３はブロック歪度合いの検出を行い、歪度合いに応じてフィルタ処理の係数を変化させる構成となっている。このブロック歪検出度合いをループフィルタ１０１３で算出した後は、第１の実施形態ならびに第２の実施形態と同様の処理で目標符号量の設定を行うことが可能であるため、ここでの説明は省略する。さらに、ブロック歪度合いで目標符号量の制御を行わない場合はＴＭ５のステップ１相当の処理のみを行って目標符号量を決定することも可能である。 The loop filter 1013 is configured to detect the degree of block distortion and change the filter processing coefficient in accordance with the degree of distortion. After the block distortion detection degree is calculated by the loop filter 1013, the target code amount can be set by the same processing as in the first embodiment and the second embodiment. Omitted. Furthermore, when the target code amount is not controlled based on the degree of block distortion, it is also possible to determine the target code amount by performing only the process corresponding to step 1 of TM5.

（本発明に係る他の実施形態）
前述した本発明の実施形態における画像符号化装置を構成する各手段、並びに画像符号化方法の各ステップは、コンピュータのＲＡＭやＲＯＭなどに記憶されたプログラムが動作することによって実現できる。このプログラム及び前記プログラムを記録したコンピュータ読み取り可能な記録媒体は本発明に含まれる。 (Other embodiments according to the present invention)
Each means constituting the image coding apparatus and each step of the image coding method in the embodiment of the present invention described above can be realized by operating a program stored in a RAM or ROM of a computer. This program and a computer-readable recording medium recording the program are included in the present invention.

また、本発明は、例えば、システム、装置、方法、プログラムもしくは記憶媒体等としての実施形態も可能であり、具体的には、複数の機器から構成されるシステムに適用してもよいし、また、一つの機器からなる装置に適用してもよい。 In addition, the present invention can be implemented as, for example, a system, apparatus, method, program, storage medium, or the like. Specifically, the present invention may be applied to a system including a plurality of devices. The present invention may be applied to an apparatus composed of a single device.

なお、本発明は、前述した実施形態の機能を実現するソフトウェアのプログラム（実施形態では図２、５、８に示すフローチャートに対応したプログラム）を、システムあるいは装置に直接、あるいは遠隔から供給する。そして、そのシステムあるいは装置のコンピュータが前記供給されたプログラムコードを読み出して実行することによっても達成される場合を含む。 In the present invention, a software program (in the embodiment, a program corresponding to the flowcharts shown in FIGS. 2, 5, and 8) for realizing the functions of the above-described embodiments is directly or remotely supplied to the system or apparatus. In addition, this includes a case where the system or the computer of the apparatus is also achieved by reading and executing the supplied program code.

したがって、本発明の機能処理をコンピュータで実現するために、前記コンピュータにインストールされるプログラムコード自体も本発明を実現するものである。つまり、本発明は、本発明の機能処理を実現するためのコンピュータプログラム自体も含まれる。 Accordingly, since the functions of the present invention are implemented by computer, the program code installed in the computer also implements the present invention. In other words, the present invention includes a computer program itself for realizing the functional processing of the present invention.

その場合、プログラムの機能を有していれば、オブジェクトコード、インタプリタにより実行されるプログラム、ＯＳに供給するスクリプトデータ等の形態であってもよい。 In that case, as long as it has the function of a program, it may be in the form of object code, a program executed by an interpreter, script data supplied to the OS, and the like.

プログラムを供給するための記録媒体としては、例えば、フロッピー（登録商標）ディスク、ハードディスク、光ディスク、光磁気ディスク、ＭＯ、ＣＤ−ＲＯＭ、ＣＤ−Ｒ、ＣＤ−ＲＷなどがある。また、磁気テープ、不揮発性のメモリカード、ＲＯＭ、ＤＶＤ（ＤＶＤ−ＲＯＭ，ＤＶＤ−Ｒ）などもある。 Examples of the recording medium for supplying the program include a floppy (registered trademark) disk, hard disk, optical disk, magneto-optical disk, MO, CD-ROM, CD-R, and CD-RW. In addition, there are magnetic tape, nonvolatile memory card, ROM, DVD (DVD-ROM, DVD-R), and the like.

その他、プログラムの供給方法としては、クライアントコンピュータのブラウザを用いてインターネットのホームページに接続する。そして、前記ホームページから本発明のコンピュータプログラムそのもの、もしくは圧縮され自動インストール機能を含むファイルをハードディスク等の記録媒体にダウンロードすることによっても供給できる。 As another program supply method, a browser on a client computer is used to connect to an Internet home page. The computer program itself of the present invention or a compressed file including an automatic installation function can be downloaded from the homepage by downloading it to a recording medium such as a hard disk.

また、本発明のプログラムを構成するプログラムコードを複数のファイルに分割し、それぞれのファイルを異なるホームページからダウンロードすることによっても実現可能である。つまり、本発明の機能処理をコンピュータで実現するためのプログラムファイルを複数のユーザに対してダウンロードさせるＷＷＷサーバも、本発明に含まれるものである。 It can also be realized by dividing the program code constituting the program of the present invention into a plurality of files and downloading each file from a different homepage. That is, a WWW server that allows a plurality of users to download a program file for realizing the functional processing of the present invention on a computer is also included in the present invention.

また、本発明のプログラムを暗号化してＣＤ−ＲＯＭ等の記憶媒体に格納してユーザに配布し、所定の条件をクリアしたユーザに対し、インターネットを介してホームページから暗号化を解く鍵情報をダウンロードさせる。そして、ダウンロードした鍵情報を使用することにより暗号化されたプログラムを実行してコンピュータにインストールさせて実現することも可能である。 In addition, the program of the present invention is encrypted, stored in a storage medium such as a CD-ROM, distributed to users, and key information for decryption is downloaded from a homepage via the Internet to users who have cleared predetermined conditions. Let It is also possible to execute the encrypted program by using the downloaded key information and install the program on a computer.

また、コンピュータが、読み出したプログラムを実行することによって、前述した実施形態の機能が実現される。その他、そのプログラムの指示に基づき、コンピュータ上で稼動しているＯＳなどが、実際の処理の一部または全部を行い、その処理によっても前述した実施形態の機能が実現され得る。 Further, the functions of the above-described embodiments are realized by the computer executing the read program. In addition, based on the instructions of the program, an OS or the like running on the computer performs part or all of the actual processing, and the functions of the above-described embodiments can also be realized by the processing.

さらに、記録媒体から読み出されたプログラムが、コンピュータに挿入された機能拡張ボードやコンピュータに接続された機能拡張ユニットに備わるメモリに書き込まれる。その後、そのプログラムの指示に基づき、その機能拡張ボードや機能拡張ユニットに備わるＣＰＵなどが実際の処理の一部または全部を行い、その処理によっても前述した実施形態の機能が実現される。 Further, the program read from the recording medium is written in a memory provided in a function expansion board inserted into the computer or a function expansion unit connected to the computer. Thereafter, the CPU of the function expansion board or function expansion unit performs part or all of the actual processing based on the instructions of the program, and the functions of the above-described embodiments are realized by the processing.

本発明の第１の実施形態を示し、符号化装置の構成例を示すブロック図である。1 is a block diagram illustrating a configuration example of an encoding device according to a first embodiment of this invention. FIG. 本発明の第１の実施形態を示し、符号化処理の手順の一例を説明するためのフローチャートである。It is a flowchart for demonstrating the 1st Embodiment of this invention and demonstrating an example of the procedure of an encoding process. 本発明の第１の実施形態を示し、（ａ）はマクロブロック、（ｂ）は符号化前の８×８の画素ブロック、（ｃ）は符号化後の８×８の画素ブロックを示す図である。1 shows a first embodiment of the present invention, (a) is a macroblock, (b) is an 8 × 8 pixel block before encoding, and (c) is an 8 × 8 pixel block after encoding. It is. 本発明の第１の実施形態を示し、ブロック歪度合いに応じた目標符号量のオフセット量を示す図である。It is a figure which shows the 1st Embodiment of this invention and shows the offset amount of the target code amount according to the block distortion degree. 本発明の第１の実施形態を示し、ブロック歪度合いに応じた目標符号量の設定処理を説明するフローチャートである。5 is a flowchart illustrating a target code amount setting process according to a block distortion degree according to the first embodiment of this invention. 本発明の第２の実施形態を示し、符号化装置の構成例を示すブロック図である。It is a block diagram which shows the 2nd Embodiment of this invention and shows the structural example of an encoding apparatus. 本発明の第２の実施形態を示し、（ａ）はマクロブロックを構成する４つの８×８画素ブロックを示す図、（ｂ）は８×８画素ブロック７０１と７０２の境界におけるデブロッキングフィルタ処理を示す図である。FIG. 5A shows a second embodiment of the present invention, where FIG. 8A is a diagram showing four 8 × 8 pixel blocks constituting a macroblock, and FIG. FIG. 本発明の第２の実施形態を示し、デブロッキングフィルタの処理手順の一例を説明するフローチャートである。It is a flowchart which shows the 2nd Embodiment of this invention and demonstrates an example of the process sequence of a deblocking filter. 本発明の第２の実施形態を示し、デブロッキングフィルタのフィルタ強度を説明する図である。It is a figure which shows the 2nd Embodiment of this invention and demonstrates the filter strength of a deblocking filter. 本発明の第３の実施形態を示し、符号化装置の構成例を示すブロック図である。It is a block diagram which shows the 3rd Embodiment of this invention and shows the structural example of an encoding apparatus.

Explanation of symbols

１００入力信号
１０１直交変換回路
１０２量子化回路
１０３可変長符号化回路
１０４逆直交変換回路
１０５逆量子化回路
１０６逆可変長復号化回路
１０７メモリ
１０８参照フレーム
１０９ローカルデコードフレーム
１１０符号量制御部
１１１量子化制御部
１１２出力信号
１１３ブロック歪度合い算出回路 DESCRIPTION OF SYMBOLS 100 Input signal 101 Orthogonal transformation circuit 102 Quantization circuit 103 Variable length encoding circuit 104 Inverse orthogonal transformation circuit 105 Inverse quantization circuit 106 Inverse variable length decoding circuit 107 Memory 108 Reference frame 109 Local decoding frame 110 Code amount control part 111 Quantum Control unit 112 output signal 113 block distortion degree calculation circuit

Claims

An image encoding apparatus comprising: encoding means for encoding an encoding target image; and decoding means for locally decoding encoded data encoded by the encoding means,
Target code amount determining means for determining a target code amount of the encoding target image;
Calculating means for calculating a degree of block distortion using a decoded image signal output from the decoding means;
Target code amount control means for determining the degree of block distortion calculated by the calculation means and controlling the target code amount determined by the target code amount determination means in accordance with the magnitude of the block distortion degree. An image encoding apparatus characterized by that.

The image coding apparatus according to claim 1, wherein the target code amount control unit increases the target code amount of the encoding target image when it is determined that the degree of block distortion is large.

The image coding apparatus according to claim 1, wherein the target code amount control unit decreases the target code amount of the encoding target image when it is determined that the degree of block distortion is small.

The target code amount control means has a deblocking filter that changes the filter strength in accordance with the block distortion degree, and controls the target code amount according to the block distortion degree calculation means in the deblocking filter. The image encoding device according to claim 1.

The encoding means and the decoding means may be MPEG or H.264. The image coding apparatus according to any one of claims 1 to 4, wherein the image coding apparatus conforms to a H.264 coding system.

The deblocking filter is H.264. The image coding apparatus according to claim 4, wherein the image coding apparatus is a loop filter in a H.264 coding system.

An image encoding method comprising: an encoding step for encoding an encoding target image; and a decoding step for locally decoding the encoded data encoded in the encoding step,
A target code amount determining step for determining a target code amount of the encoding target image;
A calculation step of calculating a block distortion degree using the decoded image signal output from the decoding step;
A target code amount control step of determining the degree of block distortion calculated by the calculation step and controlling a target code amount determined by the target code amount determining means according to the block distortion degree. An image encoding method characterized by the above.

A program for causing a computer to execute an image encoding method including an encoding process for encoding an image to be encoded and a decoding process for locally decoding the encoded data encoded in the encoding process. And
A target code amount determining step for determining a target code amount of the encoding target image;
A calculation step of calculating a block distortion degree using the decoded image signal output from the decoding step;
A target code amount control step of determining the degree of block distortion calculated by the calculation step and controlling a target code amount determined by the target code amount determining means according to the block distortion degree. A program for causing a computer to execute an image encoding method.