JPH08125990A

JPH08125990A - Encoding device and decoding device

Info

Publication number: JPH08125990A
Application number: JP6254895A
Authority: JP
Inventors: Toru Katsumata; 徹勝又; Jo Matsui; 丈松井
Original assignee: Sony Corp
Current assignee: Sony Corp
Priority date: 1994-10-20
Filing date: 1994-10-20
Publication date: 1996-05-17

Abstract

PURPOSE: To provide an encoding device and decoding device with which cost can be reduced by reducing hardware scale and signal degradation can be reduced. CONSTITUTION: At an encoding system, a signal SA is converted into a linear PCM signal by an A/D converter 11 and parallelly supplied to a telephone voice encoder 12 and low bit rate voice encoder 15 later. At a selector 13, either the output of the encoder 12 or the output of the encoder 15 is selected and defined as transmitting data DA based on the information of the decoding ability of an opposite side terminal or the like. At a decoding system, the data DA are parallelly supplied to a telephone voice decoder 24 and a low bit rate voice decoder 22. At a selector 21, either the output of the decoder 24 or the output of the decoder 22 is selected and the signal SA is provided based on the information of the encoding system of the data DA. When selecting any one encoding system at the encoding system or the decoding system, the processing of encoding or decoding related to the other encoding system is not required, cost can be reduced by reducing the hardware scale by reducing the arithmetic amount of a DSP and sound quality degradation can be reduced.

Description

Detailed Description of the Invention

【０００１】[0001]

【産業上の利用分野】この発明は、例えばテレビ会議装
置の複数の符号化方式に対応したオーディオデータ処理
部（音声コーディック）に適用して好適な符号化装置お
よび復号化装置に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to an encoding device and a decoding device suitable for application to, for example, an audio data processing unit (voice codec) corresponding to a plurality of encoding methods of a video conference apparatus.

【０００２】[0002]

【従来の技術】図６は、従来のテレビ会議装置の構成例
を示している。図において、撮像装置１より出力される
ビデオ信号（輝度信号、色差信号）ＳＶは画像データ処
理部２に供給される。この画像データ処理部２の動作
は、システムコントローラを構成するＣＰＵ３によって
制御される。画像データ処理部２では、アナログ信号か
らデジタル信号に変換された後に、ＣＣＩＴＴ（Comite
Consultatif International Telegraphique et Teleph
onque）、Ｈ．２６１で規定されたフォーマットでデー
タ圧縮されると共に、誤り訂正符号が付加されて画像デ
ータＤＶが形成される。2. Description of the Related Art FIG. 6 shows a configuration example of a conventional video conference apparatus. In the figure, a video signal (luminance signal, color difference signal) SV output from the image pickup apparatus 1 is supplied to the image data processing unit 2. The operation of the image data processing unit 2 is controlled by the CPU 3 which constitutes the system controller. In the image data processing unit 2, after conversion from an analog signal to a digital signal, CCITT (Comite
Consultatif International Telegraphique et Teleph
onque), H. The data is compressed in the format defined by H.261 and the error correction code is added to form the image data DV.

【０００３】マイク４からのオーディオ信号ＳＡはオー
ディオデータ処理部（音声コーディック）５に供給され
る。このオーディオデータ処理部５の動作は、ＣＰＵ３
によって制御される。オーディオデータ処理部５では、
アナログ信号からデジタル信号（線形ＰＣＭ信号）に変
換された後に、ＣＣＩＴＴ、Ｇ．７１１等に規定のフォ
ーマットでデータ圧縮されてオーディオデータＤＡが形
成される。The audio signal SA from the microphone 4 is supplied to an audio data processing section (voice codec) 5. The operation of the audio data processing unit 5 is performed by the CPU 3
Controlled by. In the audio data processing unit 5,
After being converted from an analog signal to a digital signal (linear PCM signal), CCITT, G.G. 711 and the like are data-compressed in a specified format to form audio data DA.

【０００４】画像データ処理部２より出力される画像デ
ータＤＶおよびオーディオデータ処理部５より出力され
るオーディオデータは多重分離部６に供給される。この
多重分離部６の動作はＣＰＵ３によって制御される。多
重分離部６では、画像データＤＶおよびオーディオデー
タＤＡがＣＣＩＴＴ、Ｈ．２２１に規定されたフォーマ
ットで多重化されて多重化データＤＭＵが形成される。
この多重化データＤＭＵは回線インタフェース７および
回線８を介して通話対象たる相手端末に送出される。The image data DV output from the image data processing unit 2 and the audio data output from the audio data processing unit 5 are supplied to the demultiplexing unit 6. The operation of the demultiplexing unit 6 is controlled by the CPU 3. In the demultiplexing unit 6, the image data DV and the audio data DA are CCITT, H.264 and H.264. Multiplexed data DMU is formed by being multiplexed in the format defined by H.221.
This multiplexed data DMU is sent to the other terminal which is the communication target through the line interface 7 and the line 8.

【０００５】回線８は、例えばＩＳＤＮ（サービス統合
デジタル網）の１つであるＩＮＳネット６４（Informat
ion Network System Net 64）の６４［ｋｂｐｓ］の回
線、あるいはＩＳＤＮの３８４［ｋｂｐｓ］、１５３６
［ｋｂｐｓ］、１９２０［ｋｂｐｓ］の回線である。The line 8 is, for example, an INS net 64 (Informat) which is one of ISDN (Service Integrated Digital Network).
64 [kbps] line of ion Network System Net 64), or ISDN 384 [kbps], 1536
The lines are [kbps] and 1920 [kbps].

【０００６】ここで、ＣＣＩＴＴ、Ｈ．２２１に規定さ
れたフォーマットの詳細を説明する。このフォーマット
は伝送速度に応じて規定されている。伝送速度が６４
［ｋｂｐｓ］の回線を複数使用する場合には各回線のチ
ャネルはＢチャネルと規定される。伝送速度が３８４
［ｋｂｐｓ］の回線を複数使用する場合には各回線のチ
ャネルはＨ₀と規定され、さらに伝送速度が１５３６
［ｋｂｐｓ］および１９２０［ｋｂｐｓ］の回線を使用
する場合にはそれぞれＨ₁₁チャネルおよびＨ₁₂チャネル
と規定される。Here, CCITT, H.M. The details of the format defined in 221 will be described. This format is specified according to the transmission speed. Transmission speed is 64
When a plurality of [kbps] lines are used, the channel of each line is defined as the B channel. Transmission speed is 384
When a plurality of [kbps] lines are used, the channel of each line is defined as H ₀ , and the transmission rate is 1536.
When the lines of [kbps] and 1920 [kbps] are used, they are defined as H ₁₁ channel and H ₁₂ channel, respectively.

【０００７】このフォーマットにおいて、各チャネルで
は、連続する２フレームを集めて１サブマルチフレーム
が構成され、連続する８サブマルチフレーム（１６フレ
ーム）を集めて１マルチフレームが構成される。図７は
Ｂチャネルのフレーム構成とビット列の関係を示してお
り、図８はＢチャネルのフレーム構成を示している。こ
のＢチャネルにおいては、８ビット（１２５μｓ）のシ
リアルデータが１０ｍｓ連続して１フレームのデータ
（６４０ビット）が形成される。In this format, in each channel, two consecutive frames are collected to form one sub-multiframe, and eight consecutive subframes (16 frames) are collected to form one multiframe. FIG. 7 shows the relationship between the frame structure of the B channel and the bit string, and FIG. 8 shows the frame structure of the B channel. In this B channel, 8-bit (125 μs) serial data is continuously formed for 10 ms to form one frame of data (640 bits).

【０００８】この場合、８ビット単位のデータがオクテ
ット番号で表され、この８ビットの各ビット列がサブチ
ャネルで表される。第８サブチャネルはサービスチャネ
ル（ＳＣ）と呼ばれ、第１〜第８ビットにフレーム同期
信号ＦＡＳ（Frame Alignment Signal）が配置されると
共に、第９〜第１６ビットにビットレート割当信号ＢＡ
Ｓ（Bitrate Allocation Signal）が配置される。In this case, 8-bit unit data is represented by an octet number, and each 8-bit bit string is represented by a sub-channel. The eighth sub-channel is called a service channel (SC), and a frame synchronization signal FAS (Frame Alignment Signal) is arranged in the first to eighth bits and a bit rate allocation signal BA in the ninth to sixteenth bits.
S (Bitrate Allocation Signal) is arranged.

【０００９】図９はフレーム同期信号ＦＡＳの構造を示
している。同期符号は、偶数フレームの７ビット“００
１１０１１”と奇数フレームの１ビット“１”との合計
８ビットで構成される。偶数フレームの最初の同期ビッ
トと奇数フレームの同期ビットが反転しているのは、他
の信号による同期符号のエミュレーションを避けるため
である。FIG. 9 shows the structure of the frame synchronization signal FAS. The synchronization code is 7 bits "00" of the even frame.
It is composed of a total of 8 bits, 11011 "and 1 bit of the odd frame. The first sync bit of the even frame and the sync bit of the odd frame are inverted to emulate the sync code by another signal. This is to avoid

【００１０】同期符号が以上のように構成されているこ
とから、連続するデータ列を８ビット周期でサンプリン
グして“００１１０１１”のビットパターンを検出する
ことで、偶数フレームの同期信号のタイミングを検出で
き、この検出結果に基づいて各フレーム内データのバイ
トバウンダリの検出が可能となる。また、偶数フレーム
の同期信号のタイミング検出結果に基づき、回線間の位
相ずれを最大１０ｍｓまで補正し得る。Since the sync code is configured as described above, the timing of the sync signal of the even frame is detected by sampling the continuous data string at the 8-bit cycle and detecting the bit pattern of "0011011". It is possible to detect the byte boundary of the data in each frame based on the detection result. Further, the phase shift between the lines can be corrected up to 10 ms based on the timing detection result of the synchronization signal of the even frames.

【００１１】なお、Ａビットは、自己の受信するマルチ
フレーム同期が確率しているか否かを相手側に知らせる
ためのもので、通信手順に活用される。Ｃ₁〜Ｃ₄ビット
は、ＣＲＣ（Cyclic Redundancy Check）のためのビッ
トで、回線品質のモニタに使用される他、通信信号の中
にフレーム同期と同じパターンが存在してその位置にフ
レーム同期が誤って引き込んだ状態から抜け出すために
も使用される。ＥビットはＣ₁〜Ｃ₄ビットの使用法の表
示に使用される。The A-bit is used to inform the other party of whether or not there is a probability of synchronization of multi-frames received by itself, and is used in the communication procedure. C ₁ -C ₄ bits, a bit for CRC (Cyclic Redundancy Check), in addition to being used to monitor line quality, frame synchronization to the position the same pattern as the frame synchronization in a communication signal is present It is also used to get out of the accidentally retracted state. E bit is used to display the C ₁ -C ₄ bit usage.

【００１２】図１０はマルチフレームにおけるフレーム
同期信号ＦＡＳの構造を示している。奇数フレームの第
１ビットは、第１のサブマルチフレームから“００１０
１１”となる。そのため、奇数フレームのサービスチャ
ネルＳＣの第１ビットをサンプリングして“００１０１
１”のビットパターンを検出することで、マルチフレー
ム内における各サブマルチフレームのタイミングを検出
できる。この検出結果に基づき、回線間の位相ずれを最
大８０ｍｓまで補正し得る。FIG. 10 shows the structure of the frame synchronization signal FAS in multi-frame. The first bit of the odd frame is "0010" from the first sub-multiframe.
11 ". Therefore, the first bit of the service channel SC of the odd-numbered frame is sampled and" 00101 "is sampled.
The timing of each sub-multiframe in the multiframe can be detected by detecting the bit pattern of 1 ". Based on this detection result, the phase shift between the lines can be corrected up to 80 ms.

【００１３】なお、Ｌ₁〜Ｌ₃ビットは複数チャネル時の
チャネル番号表示に用いられる。Ｎ ₁〜Ｎ₄ビットは、複
数チャネル構成の際、各マルチフレームに４ビットの番
号（１１１１〜００００）をつけるためのもので、これ
に基づいて最大１．２８秒（＝１６０×１６／２ｍｓ）
までの相対遅延を補正し得る。Ｎ₅ビットはマルチフレ
ームが番号づけされているか否かを示すために使用され
る。Ｒビットは将来使用のためのビットであり、ＴＥＡ
ビットは端末装置の内部に故障があるか否かを表示する
ために使用される。Incidentally, L₁~ L₃Bit is for multiple channels
It is used to display the channel number. N ₁~ N_FourBit is compound
When configuring several channels, each multiframe has a 4-bit number.
This is for attaching the number (1111-0000).
Maximum 1.28 seconds (= 160 × 16 / 2ms)
Can be corrected for relative delays up to. N_FiveBit is multi-frame
Used to indicate whether the system is numbered or not.
You. The R bit is a bit for future use.
The bit indicates whether there is a failure inside the terminal
Used for.

【００１４】図１１はビットレート割当信号ＢＡＳの構
成を示している。ビットレート割当信号ＢＡＳの物理的
な符号は８ビットで構成され、最初の３ビット（ｂ₀〜
ｂ₂）は属性を表現し、残りの５ビット（ｂ₃〜ｂ₇）を
表現するようにされる。属性としては、オーディオ符号
化コマンド、転送レートコマンド、ビデオとその他のコ
マンド、データコマンド、端末能力１，２などが定義さ
れている。例えば、オーディオ符号化コマンドで送信デ
ータの符号化方式の情報が送信され、端末能力によって
どの符号化方式の復号が可能であるかという復号能力の
情報が送信される。ＢＡＳ符号コードが誤るとシステム
全体の動作が送受で不整合となるため、８ビット符号に
８ビットのパリティをつけて２ビットの誤り訂正を行う
ことで保護される。この場合、情報は偶数フレームでパ
リティは奇数フレームで送信される。FIG. 11 shows the structure of the bit rate allocation signal BAS. The physical code of the bit rate allocation signal BAS is composed of 8 bits, and the first 3 bits (b ₀ ~
b ₂ ) represents an attribute, and the remaining 5 bits (b _{3 to} b ₇ ) are expressed. Audio encoding commands, transfer rate commands, video and other commands, data commands, terminal capabilities 1 and 2 are defined as the attributes. For example, the audio encoding command transmits the information about the encoding method of the transmission data, and the information about the decoding ability indicating which encoding method can be decoded depending on the terminal ability. If the BAS code is erroneous, the operation of the entire system becomes inconsistent in transmission and reception. Therefore, protection is performed by adding an 8-bit parity to the 8-bit code and performing 2-bit error correction. In this case, information is sent in even frames and parity in odd frames.

【００１５】Ｂチャネルを使用してデータ伝送する場
合、テレビ会議装置は最大６回線の範囲で所望の回線数
だけ回線を接続し、この接続された回線に同時並列的に
６４［ｋｂｐｓ］のデータが送出される。図１２は、Ｂ
チャネルを２回線（１Ｂ，２Ｂ）使用する場合の伝送フ
ォーマット例を示している。この場合、Ｈ．２２１で規
定されたフォーマットに従ってオーディオデータおよび
画像データ（ビデオデータ）の領域が割り当てられる。
なお、「ＣＰＵ」の記号は、自端末のシステムコントロ
ーラであるＣＰＵ３と相手端末のシステムコントローラ
（ＣＰＵ）との間で送受するデータを表している。In the case of data transmission using the B channel, the video conferencing apparatus connects as many lines as the desired number of lines within the maximum of 6 lines, and simultaneously connects the connected lines in parallel with 64 [kbps] data. Is sent. FIG. 12 shows B
An example of a transmission format when two channels (1B, 2B) are used is shown. In this case, H.264. Areas of audio data and image data (video data) are allocated according to the format defined by 221.
The symbol “CPU” represents data transmitted and received between the CPU 3 which is the system controller of the own terminal and the system controller (CPU) of the other terminal.

【００１６】図１３はＨ₀チャネルのフレーム構成を示
している。Ｈ₀チャネルは、１フレームがＢチャネルの
６フレームに相当するように構成され、４８ビット（１
２５μｓ）のシリアルデータが１０ｍｓ連続して１フレ
ームのデータが形成される。４８ビットの各ビット列が
サブチャネルで表される。そして、第８サブチャネルが
サービスチャネル（ＳＣ）に割り当てられる。第１〜第
８ビットにフレーム同期信号ＦＡＳが配置され、第９〜
第１６ビットにビットレート割当信号ＢＡＳが配置され
る。このＨ₀チャネルは、Ｂチャネルの場合と同様に複
数回線を接続して所望のデータ伝送が行われる。FIG. 13 shows the frame structure of the H ₀ channel. The H ₀ channel is configured such that one frame corresponds to 6 frames of the B channel and has 48 bits (1
25 μs) of serial data is continuously formed for 10 ms to form one frame of data. Each 48-bit bit string is represented by a sub-channel. Then, the eighth sub-channel is assigned to the service channel (SC). The frame synchronization signal FAS is arranged in the first to eighth bits, and the ninth to
The bit rate allocation signal BAS is arranged in the 16th bit. In the H ₀ channel, desired data transmission is performed by connecting a plurality of lines as in the case of the B channel.

【００１７】また、Ｈ₁₁チャネル、Ｈ₁₂チャネルは、Ｈ
₀チャネルの場合と同様に、それぞれ１フレームがＢチ
ャネルの２４フレーム、３０フレームに相当するように
構成され、１２５μｓの１９２ビット、２４０ビットの
シリアルデータが１０ｍｓ連続して１フレームのデータ
が形成される。これにより、それぞれ１５３６［ｋｂｐ
ｓ］，１９２０［ｋｂｐｓ］の伝送速度でデータ伝送し
得るようになされている。The H ₁₁ channel and the H ₁₂ channel are H
Similar to the case of ₀ channel, one frame is configured to correspond to 24 frames and 30 frames of B channel, respectively, and 125 μs of 192 bits and 240 bits of serial data are continuously formed for 1 ms to form one frame of data. It This results in 1536 [kbp
[s], 1920 [kbps].

【００１８】図６に戻って、相手端末から回線８で伝送
されてくる多重化データＤＭＵは回線インタフェース７
を介して多重分離部６に供給される。多重分離部６で
は、上述したフレーム同期信号ＦＡＳによって各フレー
ム内のビットバウンダリが正しく検出され、それに基づ
いて多重化データＤＭＵより画像データＤＶ、オーディ
オデータＤＡが分離される。また、多重化データＤＭＵ
よりビットレート割当信号ＢＡＳも分離され、このビッ
トレート割当信号ＢＡＳはＣＰＵ３に供給され、画像デ
ータ処理部２、オーディオデータ処理部５の制御等に使
用される。Returning to FIG. 6, the multiplexed data DMU transmitted from the partner terminal on the line 8 is the line interface 7
Is supplied to the demultiplexing unit 6 via. The demultiplexing unit 6 correctly detects the bit boundary in each frame by the frame synchronization signal FAS described above, and based on this, the image data DV and the audio data DA are separated from the multiplexed data DMU. Also, multiplexed data DMU
The bit rate allocation signal BAS is also separated, and this bit rate allocation signal BAS is supplied to the CPU 3 and used for controlling the image data processing unit 2, the audio data processing unit 5, and the like.

【００１９】多重分離部６で分離された画像データＤＶ
は画像データ処理部２に供給されて復号化処理や誤り訂
正処理等が行われてビデオ信号ＳＶが形成される。そし
て、このビデオ信号ＳＶがモニタテレビ９に供給されて
相手端末側の画像が表示される。また、多重分離部６で
分離されたオーディオデータＤＡはオーディオデータ処
理部５に供給されて復号化処理等が行われてオーディオ
信号ＳＡが形成される。そして、このオーディオ信号Ｓ
Ａがスピーカ１０に供給されて相手端末側の音声が出力
される。Image data DV separated by the demultiplexer 6
Is supplied to the image data processing unit 2 to be subjected to decoding processing, error correction processing and the like to form a video signal SV. Then, the video signal SV is supplied to the monitor television 9 and the image on the partner terminal side is displayed. The audio data DA separated by the demultiplexing unit 6 is supplied to the audio data processing unit 5 and subjected to decoding processing or the like to form an audio signal SA. And this audio signal S
A is supplied to the speaker 10 and the voice of the partner terminal is output.

【００２０】図１４は、オーディオデータ処理部５の構
成例を示している。まず、符号化系の構成を説明する。
マイク４からのオーディオ信号ＳＡはＡ／Ｄ変換器１１
で線形ＰＣＭ信号（デジタル信号）に変換された後に電
話音声符号器１２で符号化される。電話音声符号化方式
として、例えばビットレートが最大６４［ｋｂｐｓ］、
再生周波数上限が３．４［ｋＨｚ］であるＴＴＣ標準Ｊ
Ｔ−Ｇ７１１（以下、「Ｇ．７１１」という）が採用さ
れる。そして、符号器１２より出力されるオーディオデ
ータ（符号コード）はセレクタ１３に供給されると共
に、電話音声復号器（Ｇ．７１１復号器）１４に供給さ
れて復号化される。FIG. 14 shows an example of the configuration of the audio data processing section 5. First, the configuration of the coding system will be described.
The audio signal SA from the microphone 4 is A / D converter 11
Is converted to a linear PCM signal (digital signal) by the telephone voice encoder 12. As a telephone voice encoding method, for example, a maximum bit rate is 64 [kbps],
TTC standard J whose reproduction frequency upper limit is 3.4 [kHz]
T-G711 (hereinafter referred to as "G.711") is adopted. The audio data (code code) output from the encoder 12 is supplied to the selector 13 and also to the telephone voice decoder (G.711 decoder) 14 to be decoded.

【００２１】復号器１４より出力されるで線形ＰＣＭ信
号は低ビットレート音声符号器１５に供給されて符号化
される。低ビットレート音声符号化方式として、例えば
ビットレートが１６［ｋｂｐｓ］、再生周波数上限が
３．４［ｋＨｚ］であるＴＴＣ標準ＪＴ−Ｇ７２８（以
下、「Ｇ．７２８」という）が採用される。そして、符
号器１５より出力されるオーディオデータ（符号コー
ド）はセレクタ１３に供給される。セレクタ１３にはＣ
ＰＵ３より制御信号ＳＣａが供給され、相手端末（通話
対象）のビットレート割当信号ＢＡＳに含まれる復号能
力の情報等に基づき、符号器１２，１５の出力オーディ
オデータのいずれかが選択されてオーディオデータＤＡ
として出力される。The linear PCM signal output from the decoder 14 is supplied to the low bit rate speech encoder 15 and encoded. As the low bit rate audio encoding method, for example, TTC standard JT-G728 (hereinafter referred to as “G.728”) having a bit rate of 16 [kbps] and a reproduction frequency upper limit of 3.4 [kHz] is adopted. The audio data (code code) output from the encoder 15 is supplied to the selector 13. C for selector 13
The control signal SCa is supplied from the PU3, and one of the output audio data of the encoders 12 and 15 is selected based on the decoding capability information included in the bit rate allocation signal BAS of the partner terminal (call target) to select the audio data. DA
Is output as

【００２２】次に、復号化系の構成を説明する。多重分
離部６からのオーディオデータＤＡはセレクタ２１に供
給される。また、多重分離部６からのオーディオデータ
ＤＡは低ビットレート音声復号器（Ｇ．７２８復号器）
２２に供給され、この復号器２２の出力信号は電話音声
符号器（Ｇ．７１１符号器）２３に供給される。この場
合、オーディオデータＤＡが低ビットレート音声符号器
（Ｇ．７２８符号器）で符号化されたものであるときは
復号器２２で復号化されて線形ＰＣＭ信号が得られ、こ
の線形ＰＣＭ信号が符号器２３で符号化された後にセレ
クタ２１に供給される。Next, the structure of the decoding system will be described. The audio data DA from the demultiplexer 6 is supplied to the selector 21. The audio data DA from the demultiplexing unit 6 is a low bit rate audio decoder (G.728 decoder).
22 and the output signal of the decoder 22 is supplied to a telephone speech coder (G.711 coder) 23. In this case, when the audio data DA is encoded by the low bit rate speech encoder (G.728 encoder), it is decoded by the decoder 22 to obtain a linear PCM signal, and this linear PCM signal is obtained. It is supplied to the selector 21 after being encoded by the encoder 23.

【００２３】セレクタ２１にはＣＰＵ３より制御信号Ｓ
Ｃｂが供給され、相手端末のビットレート割当信号ＢＡ
Ｓに含まれるオーディオデータＤＡの符号化方式の情報
に基づき、多重分離部６からのオーディオデータＤＡお
よび符号器２３の出力オーディオデータのいずれかが選
択される。すなわち、オーディオデータＤＡが電話音声
符号器（Ｇ．７１１符号器）で符号化されたものである
ときは、オーディオデータＤＡそのものが選択され、一
方オーディオデータＤＡが低ビットレート音声符号器
（Ｇ．７２８符号器）で符号化されたものであるとき
は、符号器２３の出力オーディオデータが選択される。
セレクタ２１で選択されるオーディオデータは電話音声
復号器（Ｇ．７１１復号器）２４に供給されて復号化さ
れた後にＤ／Ａ変換器２５でアナログ信号に変換されて
オーディオ信号ＳＡとして出力される。A control signal S is sent from the CPU 3 to the selector 21.
Cb is supplied and the bit rate allocation signal BA of the partner terminal
Either the audio data DA from the demultiplexing unit 6 or the output audio data of the encoder 23 is selected based on the information on the encoding system of the audio data DA included in S. That is, when the audio data DA is encoded by the telephone voice encoder (G.711 encoder), the audio data DA itself is selected, while the audio data DA is the low bit rate voice encoder (G.711). 728 encoder), the output audio data of the encoder 23 is selected.
The audio data selected by the selector 21 is supplied to the telephone voice decoder (G.711 decoder) 24 and decoded, and then converted into an analog signal by the D / A converter 25 and output as an audio signal SA. .

【００２４】図１５は、電話音声符号器１２としての
Ｇ．７１１符号器の構成例を示している。図において、
線形ＰＣＭ信号はＡ則対数量子化器１２１およびμ則対
数量子化器１２２にそれぞれ供給されて対数量子化され
た後にセレクタ１２３に供給される。セレクタ１２３に
はＣＰＵ３より制御信号ＳＣｃが供給され（図１４には
制御ラインは図示せず）、相手端末のビットレート割当
信号ＢＡＳに含まれる復号化能力の情報等に基づき、量
子化器１２１，１２２の出力データのいずれかが選択さ
れてＧ．７１１符号として出力される。FIG. 15 is a diagram illustrating a G.264 as a telephone voice encoder 12. The structural example of a 711 encoder is shown. In the figure,
The linear PCM signal is supplied to the A-law logarithmic quantizer 121 and the μ-law logarithmic quantizer 122, respectively, logarithmically quantized, and then supplied to the selector 123. A control signal SCc is supplied from the CPU 3 to the selector 123 (a control line is not shown in FIG. 14), and the quantizer 121, based on the decoding capability information included in the bit rate allocation signal BAS of the other terminal. One of the output data of G.122 is selected, and G.122. It is output as a 711 code.

【００２５】また、図１６は、電話音声復号器１４とし
てのＧ．７１１復号器の構成例を示している。図におい
て、Ａ則またはμ則のＧ．７１１符号はＡ則逆対数量子
化器１４１およびμ則逆対数量子化器１４２にそれぞれ
供給されて逆対数量子化（線形再現）された後にセレク
タ１４３に供給される。セレクタ１４３にはＣＰＵ３よ
り制御信号ＳＣｄが供給され（図１４には制御ラインは
図示せず）、量子化器１４１，１４２の出力データのい
ずれかが選択されて線形ＰＣＭ信号として出力される。
すなわち、セレクタ１４３では、Ｇ．７１１符号がＡ則
のものであるときは量子化器１４１の出力データが選択
され、一方Ｇ．７１１符号がμ則のものであるときは量
子化器１４２の出力データが選択される。Further, FIG. 11 shows a configuration example of a 711 decoder. In the figure, the G. The 711 code is supplied to the A-law antilogarithmic quantizer 141 and the μ-law antilogarithmic quantizer 142, respectively, and subjected to antilogarithmic quantization (linear reproduction) and then supplied to the selector 143. A control signal SCd is supplied from the CPU 3 to the selector 143 (a control line is not shown in FIG. 14), one of the output data of the quantizers 141 and 142 is selected and output as a linear PCM signal.
That is, in the selector 143, the G. When the 711 code is according to the A-law, the output data of the quantizer 141 is selected, while the G.711. When the 711 code is μ-law, the output data of the quantizer 142 is selected.

【００２６】なお、詳細説明は省略するが、復号化系の
電話音声符号器（Ｇ．７１１符号器）２３は図１５に示
した電話音声符号器１２と同様に構成され、電話音声復
号器（Ｇ．７１１復号器）２４は図１６に示した電話音
声復号器１４と同様に構成される。Although not described in detail, the decoding system telephone voice encoder (G.711 encoder) 23 has the same configuration as the telephone voice encoder 12 shown in FIG. G.711 decoder) 24 is configured similarly to the telephone voice decoder 14 shown in FIG.

【００２７】次に、図１７は、低ビットレート音声符号
器１５としてのＧ．７２８符号器（ＬＤ−ＣＥＬＰ符号
器）の構成例を示している。図において、１５１は線形
ＰＣＭ信号が供給されるベクトルバッファであり、この
ベクトルバッファ１５１では５つの連続する音声サンプ
ルがバッファリングされ、５次元の音声ベクトルが構成
される。１５２は入力信号ベクトルおよび後述する合成
フィルタの出力ベクトルに対して周波数の重み付けをす
る聴覚重み付けフィルタである。Next, FIG. 17 shows the G.264 as the low bit rate speech encoder 15. The structural example of the 728 encoder (LD-CELP encoder) is shown. In the figure, 151 is a vector buffer to which a linear PCM signal is supplied. In this vector buffer 151, five consecutive audio samples are buffered to form a five-dimensional audio vector. Reference numeral 152 is a perceptual weighting filter for weighting the frequency of the input signal vector and the output vector of the synthesis filter described later.

【００２８】１５３は聴覚重み付けされた入力信号ベク
トルと合成フィルタの出力ベクトルとの差分に対して、
１０２４個（１０ビット）の量子化誤差信号ベクトルの
候補の中から自乗平均誤差が最小となるものを選択し、
Ｇ．７２８符号としてのＶＱインデックス（１６［ｋｂ
ｐｓ］）を得るための最小自乗平均誤差ユニットであ
る。１５４は１０２４個の量子化誤差信号ベクトルのテ
ーブルとしての励振ＶＱコードブックである。１５５は
選択された量子化誤差信号ベクトルの利得を調整する利
得調整ユニットである。153 is the difference between the perceptually weighted input signal vector and the output vector of the synthesis filter,
Of the 1024 (10-bit) quantized error signal vector candidates, the one having the smallest root mean square error is selected,
G. VQ index (16 [kb
ps]) is the least mean square error unit. Reference numeral 154 is an excitation VQ codebook as a table of 1024 quantized error signal vectors. Reference numeral 155 is a gain adjustment unit that adjusts the gain of the selected quantization error signal vector.

【００２９】１５６は利得調整ユニット１５５の利得係
数をバックワード適応により更新するバックワード利得
適応器である。１５７は利得調整された量子化誤差信号
ベクトルから量子化音声ベクトルを得るための合成フィ
ルタである。１５８は合成フィルタ１５７により得られ
る量子化音声ベクトルからバックワード適応により合成
フィルタ１５７の係数を更新するバックワード合成フィ
ルタ適応器である。１５９は入力ベクトルと合成フィル
タ１５７の出力ベクトルとの差をとるための減算器であ
る。Reference numeral 156 is a backward gain adaptor for updating the gain coefficient of the gain adjusting unit 155 by backward adaptation. Reference numeral 157 is a synthesis filter for obtaining a quantized speech vector from the quantized error signal vector whose gain has been adjusted. Reference numeral 158 denotes a backward synthesis filter adaptive unit that updates the coefficient of the synthesis filter 157 by backward adaptation from the quantized speech vector obtained by the synthesis filter 157. Reference numeral 159 is a subtractor for calculating the difference between the input vector and the output vector of the synthesis filter 157.

【００３０】図１８は、低ビットレート音声復号器２２
としてのＧ．７２８復号器（ＬＤ−ＣＥＬＰ復号器）の
構成例を示している。図において、２２１は上述した符
号器１５の励振ＶＱコードブック１５４と同様の励振Ｖ
Ｑコードブック２２１である。このコードブック２２１
にＧ．７２８符号としてのＶＱインデックス（１６［ｋ
ｂｐｓ］）が供給されるとき、それに対応する量子化誤
差信号ベクトルが出力される。利得調整ユニット２２
２、バックワード利得適応器２２３、合成フィルタ２２
４およびバックワード合成フィルタ適応器２２５は、そ
れぞれ上述した符号器１５の利得調整ユニット１５５、
バックワード利得適応器１５６、合成フィルタ１５７お
よびバックワード合成フィルタ適応器１５８と同様のも
のである。FIG. 18 shows a low bit rate speech decoder 22.
As G. The structural example of the 728 decoder (LD-CELP decoder) is shown. In the figure, reference numeral 221 denotes an excitation VQ similar to the excitation VQ codebook 154 of the encoder 15 described above.
This is the Q code book 221. This codebook 221
To G. VQ index (16 [k
bps]) is supplied, the corresponding quantization error signal vector is output. Gain adjustment unit 22
2, backward gain adaptor 223, synthesis filter 22
4 and the backward synthesis filter adaptor 225 respectively include the gain adjustment unit 155 of the encoder 15 described above,
It is similar to the backward gain adaptor 156, the synthesis filter 157, and the backward synthesis filter adaptor 158.

【００３１】２２６は聴感上の品質を向上させるために
合成フィルタ２２４より出力される量子化音声ベクトル
に対してフィルタ処理をするポストフィルタである。２
２７はポストフィルタ２２６でフィルタ処理された５次
元の量子化音声ベクトルをそれぞれ対応する５サンプル
の線形ＰＣＭ信号に変換して出力するＰＣＭ変換ユニッ
トである。A post filter 226 filters the quantized voice vector output from the synthesis filter 224 in order to improve the perceptual quality. Two
Reference numeral 27 denotes a PCM conversion unit that converts the five-dimensional quantized speech vector filtered by the post filter 226 into a corresponding five-sample linear PCM signal and outputs the linear PCM signal.

【００３２】[0032]

【発明が解決しようとする課題】従来のオーディオデー
タ処理部５は図１４に示すように構成されており、音声
符号化方式として低ビットレート音声符号化方式が選択
されるときは、符号化系および復号化系のそれぞれにお
いて電話音声符号化方式に係る符号化および復号化の処
理が必要となるため、符号化や復号化の処理をするＤＳ
Ｐ（デジタル・シグナル・プロセッサ）の演算能力を浪
費するという問題点があった。そして、ＤＳＰによって
は、符号化に要する演算量が決定的な要因となり、実現
に必要なハードウェアの規模が大きくなり、コストアッ
プを招くという問題点があった。また、低ビットレート
音声符号化方式が選択されるときにも、電話音声符号化
方式に係る符号化、復号化による音声歪みが発生し、音
質劣化を招く等の問題点があった。The conventional audio data processing unit 5 is constructed as shown in FIG. 14, and when a low bit rate audio encoding system is selected as an audio encoding system, an encoding system is used. Since the encoding and decoding processes relating to the telephone voice encoding system are required in each of the audio and decoding systems, the DS for performing the encoding and decoding processes is required.
There has been a problem that the computing power of P (digital signal processor) is wasted. Then, depending on the DSP, the amount of calculation required for encoding becomes a decisive factor, the scale of hardware required for implementation becomes large, and there is a problem that the cost increases. Further, even when the low bit rate audio encoding method is selected, there is a problem that audio distortion occurs due to encoding and decoding according to the telephone audio encoding method, resulting in deterioration of sound quality.

【００３３】そこで、この発明では、符号化や復号化の
ためのＤＳＰの演算量を軽減してハードウェア規模の縮
小によるコストダウンを図ると共に、信号劣化を軽減し
得る符号化装置および復号化装置を提供するものであ
る。Therefore, according to the present invention, the amount of operations of the DSP for encoding and decoding is reduced to reduce the cost by reducing the hardware scale, and the encoding device and the decoding device capable of reducing the signal deterioration. Is provided.

【００３４】[0034]

【課題を解決するための手段】請求項１の発明に係る符
号化装置は、入力信号が並列的に供給される符号能力を
異にする複数の符号化手段と、この複数の符号化手段よ
り出力される符号コードより一の符号コードを選択して
出力するセレクタとを備えるものである。According to a first aspect of the present invention, there is provided a plurality of coding means having different coding capabilities for supplying input signals in parallel, and a plurality of coding means. And a selector for selecting and outputting one code code from the code codes to be output.

【００３５】請求項２の発明に係る符号化装置は、請求
項１の発明において、相手端末の復号能力を判別し、そ
の復号能力に応じてセレクタの動作を制御するセレクタ
制御手段を有するものである。An encoding apparatus according to a second aspect of the present invention is the encoding apparatus according to the first aspect of the present invention, which has selector control means for determining the decoding capability of the partner terminal and controlling the operation of the selector according to the decoding capability. is there.

【００３６】請求項３の発明に係る符号化装置は、請求
項１の発明において、複数の符号化手段は電話音声符号
器および低ビットレート音声符号器であり、これら電話
音声符号器および低ビットレート音声符号器には入力信
号として線形ＰＣＭ信号が並列的に供給されるものであ
る。According to a third aspect of the present invention, in the first aspect of the present invention, the plurality of encoding means are a telephone voice coder and a low bit rate voice coder. A linear PCM signal is supplied in parallel to the rate speech coder as an input signal.

【００３７】請求項４の発明に係る符号化装置は、請求
項１の発明において、複数の符号化手段は電話音声符号
器、低ビットレート音声符号器および高品位音声符号器
であり、これら電話音声符号器、低ビットレート音声符
号器および高品位音声符号器には入力信号として線形Ｐ
ＣＭ信号が並列的に供給されるものである。According to a fourth aspect of the present invention, in the first aspect of the present invention, the plurality of encoding means are a telephone voice coder, a low bit rate voice coder and a high quality voice coder. The speech encoder, the low bit rate speech encoder, and the high-quality speech encoder have a linear P as an input signal.
CM signals are supplied in parallel.

【００３８】請求項５の発明に係る復号化装置は、入力
符号コードが並列的に供給される復号能力を異にする複
数の復号化手段と、この複数の復号化手段より出力され
る復号化信号より一の復号化信号を選択して出力するセ
レクタと、入力符号コードの符号化方式に応じてセレク
タの動作を制御するセレクタ制御手段とを備えるもので
ある。According to a fifth aspect of the present invention, there is provided a decoding device in which a plurality of decoding means having different decoding capabilities are supplied with the input code code in parallel, and the decoding means outputs the decoding means. A selector for selecting and outputting one decoded signal from the signals and a selector control means for controlling the operation of the selector according to the encoding system of the input code are provided.

【００３９】請求項６の発明に係る復号化装置は、請求
項５の発明において、複数の復号化手段は電話音声復号
器および低ビットレート音声復号器であり、これら電話
音声復号器および低ビットレート音声復号器には入力符
号コードが並列的に供給されるものである。A decoding device according to a sixth aspect of the present invention is the decoding device according to the fifth aspect, wherein the plurality of decoding means are a telephone voice decoder and a low bit rate voice decoder. An input code code is supplied in parallel to the rate speech decoder.

【００４０】請求項７の発明に係る復号化装置は、請求
項６の発明において、複数の復号化手段は電話音声復号
器、低ビットレート音声復号器および高品位音声復号器
であり、これら電話音声復号器、低ビットレート音声復
号器および高品位音声復号器には入力符号コードが並列
的に供給されるものである。In the decoding device according to the invention of claim 7, in the invention of claim 6, the plurality of decoding means are a telephone voice decoder, a low bit rate voice decoder and a high quality voice decoder. The input code code is supplied in parallel to the speech decoder, the low bit rate speech decoder, and the high-quality speech decoder.

【００４１】[0041]

【作用】請求項１の発明においては、入力信号が符号能
力を異にする複数の符号化手段に並列的に供給され、そ
の複数の符号化手段の出力符号コードよりセレクタによ
って一の符号コードを選択して出力するものである。そ
のため、ある符号化方式を選択する場合、それに伴って
他の符号化方式に係る符号化や復号化の処理をする必要
がなく、ＤＳＰの符号化や復号化の演算量を軽減してハ
ードウェア規模の縮小によるコストダウンを図ると共
に、信号劣化を軽減することが可能となる。According to the present invention, an input signal is supplied in parallel to a plurality of coding means having different coding capabilities, and one code code is selected by a selector from output code codes of the plurality of coding means. It is selected and output. Therefore, when a certain encoding method is selected, it is not necessary to perform the encoding and decoding processing according to the other encoding method accordingly, and the operation amount of the DSP encoding and decoding is reduced to reduce the hardware. It is possible to reduce the cost by reducing the scale and reduce the signal deterioration.

【００４２】請求項２の発明においては、相手端末の復
号能力を判別し、その復号能力に応じてセレクタの動作
を制御するため、相手端末の復号能力に応じた適切な符
号化方式を自動的に選択することが可能となる。According to the second aspect of the present invention, the decoding capability of the partner terminal is discriminated and the operation of the selector is controlled in accordance with the decoding capability. Therefore, an appropriate encoding method according to the decoding capability of the partner terminal is automatically selected. Can be selected.

【００４３】請求項３の発明においては、電話音声符号
化方式および低ビットレート音声符号化方式に係る符号
化のいずれかを選択でき、その際に他の符号化方式に係
る符号化や復号化の処理をする必要がなく、ＤＳＰの演
算量を軽減してハードウェア規模の縮小によるコストダ
ウンを図ると共に、音質劣化を軽減することが可能とな
る。According to the third aspect of the present invention, it is possible to select either the telephone voice encoding system or the low bit rate voice encoding system, and at that time, the encoding or decoding of the other encoding system is selected. It is possible to reduce the amount of calculation of the DSP to reduce the cost by reducing the hardware scale and reduce the deterioration of the sound quality.

【００４４】請求項４の発明においては、電話音声符号
化方式、低ビットレート音声符号化方式および高品位音
声符号化方式に係る符号化のいずれかを選択でき、その
際に他の符号化方式に係る符号化や復号化の処理をする
必要がなく、ＤＳＰの符号化や復号化の演算量を軽減し
てハードウェア規模の縮小によるコストダウンを図ると
共に、音質劣化を軽減することが可能となる。In the invention of claim 4, it is possible to select one of the telephone voice coding system, the low bit rate voice coding system, and the coding relating to the high-quality voice coding system, and at that time, another coding system is selected. It is possible to reduce the amount of calculation for DSP encoding and decoding, reduce the cost by reducing the hardware scale, and reduce the deterioration of sound quality. Become.

【００４５】請求項５の発明においては、入力符号コー
ドが復号能力を異にする複数の復号化手段に並列的に供
給され、その複数の復号化手段の出力信号より入力符号
コードの符号化方式に応じて一の出力信号を選択して出
力するものである。そのため、ある符号化方式に係る復
号化を選択する場合、それに伴って他の符号化方式に係
る符号化や復号化の処理をする必要がなく、ＤＳＰの演
算量を軽減してハードウェア規模の縮小によるコストダ
ウンを図ると共に、信号劣化を軽減することが可能とな
る。According to the invention of claim 5, the input code code is supplied in parallel to a plurality of decoding means having different decoding capabilities, and the input code code is encoded from the output signals of the plurality of decoding means. One output signal is selected and output according to the above. Therefore, when the decoding according to a certain encoding method is selected, it is not necessary to perform the processing of encoding or decoding according to the other encoding method accordingly, which reduces the operation amount of the DSP and reduces the hardware scale. It is possible to reduce the cost due to the reduction and reduce the signal deterioration.

【００４６】請求項６の発明においては、電話音声符号
化方式および低ビットレート音声符号化方式に係る復号
化のいずれかを選択でき、その際に他の符号化方式に係
る符号化や復号化の処理をする必要がなく、ＤＳＰの演
算量を軽減してハードウェア規模の縮小によるコストダ
ウンを図ると共に、音質劣化を軽減することが可能とな
る。According to the sixth aspect of the present invention, it is possible to select either of the telephone voice encoding system and the low bit rate voice encoding system, and at that time, the encoding and decoding of other encoding systems are selected. It is possible to reduce the amount of calculation of the DSP to reduce the cost by reducing the hardware scale and reduce the deterioration of the sound quality.

【００４７】請求項７の発明においては、電話音声符号
化方式、低ビットレート音声符号化方式および高品位音
声符号化方式に係る復号化のいずれかを選択でき、その
際に他の符号化方式に係る符号化や復号化の処理をする
必要がなく、ＤＳＰの演算量を軽減してハードウェア規
模の縮小によるコストダウンを図ると共に、音質劣化を
軽減することが可能となる。In the invention of claim 7, it is possible to select any one of the telephone speech coding method, the low bit rate speech coding method and the decoding relating to the high-quality speech coding method, and at that time, another coding method is selected. Since it is not necessary to perform the encoding and decoding processing according to the above, it is possible to reduce the DSP calculation amount, reduce the cost by reducing the hardware scale, and reduce the sound quality deterioration.

【００４８】[0048]

【実施例】以下、図１を参照しながら、この発明の第１
実施例について説明する。本例は、テレビ会議装置のオ
ーディオデータ処理部５（図６参照）に適用した例であ
る。この図１において、図１４と対応する部分には同一
符号を付し、その詳細説明は省略する。DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS The first embodiment of the present invention will be described below with reference to FIG.
Examples will be described. This example is an example applied to the audio data processing unit 5 (see FIG. 6) of the video conference apparatus. In FIG. 1, parts corresponding to those in FIG. 14 are designated by the same reference numerals, and detailed description thereof will be omitted.

【００４９】まず、符号化系（送信側）の構成について
説明する。マイク４からのオーディオ信号ＳＡはＡ／Ｄ
変換器１１で線形ＰＣＭ信号（デジタル信号）に変換さ
れた後に電話音声符号器（Ｇ．７１１符号器）１２およ
び低ビットレート音声符号器（Ｇ．７２８符号器）１５
に並列的に供給されて符号化される。そして、符号器１
２，１５より出力されるオーディオデータ（符号コー
ド）はそれぞれセレクタ１３に供給される。セレクタ１
３にはＣＰＵ３より制御信号ＳＣａが供給され、相手端
末（通話対象）のビットレート割当信号ＢＡＳに含まれ
る復号能力の情報等で決定された送信符号化方式に基づ
き、符号器１２，１５の出力オーディオデータのいずれ
かが選択されてオーディオデータＤＡとして出力され
る。First, the configuration of the coding system (transmission side) will be described. The audio signal SA from the microphone 4 is A / D
After being converted into a linear PCM signal (digital signal) by the converter 11, a telephone voice encoder (G.711 encoder) 12 and a low bit rate voice encoder (G.728 encoder) 15
Are supplied in parallel to and encoded. And encoder 1
The audio data (code code) output from 2 and 15 are respectively supplied to the selector 13. Selector 1
The control signal SCa is supplied from the CPU 3 to the CPU 3, and the outputs of the encoders 12 and 15 are based on the transmission encoding method determined by the decoding capability information included in the bit rate allocation signal BAS of the partner terminal (call target). One of the audio data is selected and output as audio data DA.

【００５０】次に、復号化系（受信側）の構成を説明す
る。多重分離部６からのオーディオデータＤＡは電話音
声復号器（Ｇ．７１１復号器）２４および低ビットレー
ト音声復号器（Ｇ．７２８復号器）２２に並列的に供給
される。そして、復号器２４，２２の出力信号はそれぞ
れセレクタ２１に供給される。ここで、オーディオデー
タＤＡが電話音声符号器（Ｇ．７１１符号器）で符号化
されたものであるときは、復号器２４で復号化されて線
形ＰＣＭ信号が得られる。また、オーディオデータＤＡ
が低ビットレート音声符号器（Ｇ．７２８符号器）で符
号化されたものであるときは、復号器２２で復号化され
て線形ＰＣＭ信号が得られる。Next, the configuration of the decoding system (reception side) will be described. The audio data DA from the demultiplexing unit 6 is supplied in parallel to a telephone voice decoder (G.711 decoder) 24 and a low bit rate voice decoder (G.728 decoder) 22. The output signals of the decoders 24 and 22 are supplied to the selector 21, respectively. Here, when the audio data DA is encoded by the telephone voice encoder (G.711 encoder), it is decoded by the decoder 24 to obtain a linear PCM signal. Also, audio data DA
Is encoded by a low bit rate speech encoder (G.728 encoder), it is decoded by the decoder 22 to obtain a linear PCM signal.

【００５１】セレクタ２１にはＣＰＵ３より制御信号Ｓ
Ｃｂが供給され、相手端末のビットレート割当信号ＢＡ
Ｓに含まれるオーディオデータＤＡの符号化方式の情報
で決定された受信符号化方式に基づき、復号器２４，２
２の出力信号のいずれかが選択される。すなわち、オー
ディオデータＤＡが電話音声符号化方式で符号化された
ものであるときは、復号器２４より出力される線形ＰＣ
Ｍ信号が選択され、一方オーディオデータＤＡが低ビッ
トレート音声符号化方式で符号化されたものであるとき
は、復号器２２より出力される線形ＰＣＭ信号が選択さ
れる。セレクタ２１より出力される線形ＰＣＭ信号はＤ
／Ａ変換器２５でアナログ信号に変換されてオーディオ
信号ＳＡとして出力される。A control signal S from the CPU 3 is sent to the selector 21.
Cb is supplied and the bit rate allocation signal BA of the partner terminal
Decoders 24, 2 based on the reception encoding system determined by the encoding system information of audio data DA included in S
One of the two output signals is selected. That is, when the audio data DA is encoded by the telephone voice encoding system, the linear PC output from the decoder 24 is output.
When the M signal is selected, on the other hand, when the audio data DA is encoded by the low bit rate audio encoding system, the linear PCM signal output from the decoder 22 is selected. The linear PCM signal output from the selector 21 is D
The signal is converted into an analog signal by the / A converter 25 and output as an audio signal SA.

【００５２】ここで、送信符号化方式（自端末の符号器
側＝相手端末の復号器側）の決定方法について説明す
る。Here, a method of determining the transmission coding method (the encoder side of the own terminal = the decoder side of the partner terminal) will be described.

【００５３】１．相手端末からのビットレート割当信号
ＢＡＳの受信により、相手端末の復号能力を得る。例え
ば、次のように表現する。ここで、Ｇ．７２２−４８の
復号能力とはＧ．７２２の全てのモードを復号する能
力、Ｇ．７２２−６４の復号能力とはＧ．７２２の６４
［ｋｂｐｓ］モードのみを復号する能力、Ｇ．７１１−
Ａの復号能力とはＧ．７１１のＡ則を復号する能力、
Ｇ．７１１−μの復号能力とはＧ．７１１のμ則を復号
する能力である。Ｇ．７２８の復号能力：あり（１）なし（０）Ｇ．７２２−４８の復号能力：あり（１）なし（０）Ｇ．７２２−６４の復号能力：あり（１）なし（０）Ｇ．７１１−Ａの復号能力：あり（１）なし（０）Ｇ．７１１−μ の復号能力：あり（１）なし（０）1. By receiving the bit rate allocation signal BAS from the partner terminal, the decoding capability of the partner terminal is obtained. For example, it is expressed as follows. Here, G. What is the decoding capability of 722-48? Ability to decode all modes of G.722. What is the decoding capability of 722-64. 722 of 64
Ability to decode only [kbps] mode, G. 711-
What is the decoding capability of A.G. Ability to decode A-rule of 711,
G. What is the decoding capability of 711-μ? The ability to decode 711 μ-law. G. Decoding capability of 728: Yes (1) No (0) 722-48 decoding ability: Yes (1) No (0) 722-64 decoding ability: Yes (1) No (0) Decoding ability of 711-A: Yes (1) No (0) Decoding ability of 711-μ: Yes (1) No (0)

【００５４】２．自端末の符号能力についても、同様に
表現する。ここで、符号器についてはＧ．７２２のモー
ドの区別は存在しない。すなわち、Ｇ．７２２−６４が
１ならば、Ｇ．７２２−４８も１とする。Ｇ．７２８の符号能力：あり（１）なし（０）Ｇ．７２２−４８の符号能力：あり（１）なし（０）Ｇ．７２２−６４の符号能力：あり（１）なし（０）Ｇ．７１１−Ａの符号能力：あり（１）なし（０）Ｇ．７１１−μ の符号能力：あり（１）なし（０）2. The same applies to the coding capability of the terminal itself. Here, regarding the encoder, the G. There is no 722 mode distinction. That is, G. If 722-64 is 1, G. 722-48 is also set to 1. G. 728 code capability: Yes (1) No (0) 722-48 coding capability: Yes (1) No (0) 722-64 coding capability: Yes (1) No (0) 711-A coding capability: Yes (1) No (0) 711-μ coding ability: Yes (1) No (0)

【００５５】３．両端末で能力交換をする。すなわち、
対応する符号化方式どうしで論理積をとる。3. Exchange capabilities on both terminals. That is,
Performs a logical product between the corresponding encoding methods.

【００５６】４．Ｇ．７２８、Ｇ．７２２−４８、Ｇ．
７２２−６４、Ｇ．７１１−ＡおよびＧ．７１１−μを
それぞれビット４、ビット３、ビット２、ビット１およ
びビット０に対応させると、論理積結果に基づいて、図
２に示すように符号化方式が決定される。ここで、Ｇ．
７２８とＧ．７２２の能力を両端末が共有するとき、す
なわち１＊１＊＊のとき、本来は復号器側に選択権をも
たせることが望ましい。なぜならば、符号復号による品
質は受信側にのみ影響を及ぼすからでる。復号器側は、
画質を優先するならばＧ．７２８を、音質を優先するな
らばＧ．７２２を選択することになる。これを実現する
ためには、復号器側で選択したい能力でない方の符号化
方式を能力としてもたないものとして、再度能力交換を
行えばよい。例えば、画質を優先するならば、復号能力
を０＊１＊＊として再度能力交換を符号器側に要求す
る。4. G. 728, G.I. 722-48, G.I.
722-64, G.I. 711-A and G.G. When 711-μ is associated with bit 4, bit 3, bit 2, bit 1 and bit 0, respectively, the encoding method is determined based on the logical product result as shown in FIG. Here, G.
728 and G.H. When both terminals share the capability of 722, that is, in the case of 1 * 1 **, it is originally desirable that the decoder side has the selection right. This is because the quality of code decoding affects only the receiving side. On the decoder side,
If priority is given to image quality, G.I. If the sound quality is prioritized, G. 722 will be selected. In order to realize this, it is sufficient to perform the capability exchange again, assuming that the encoding method which is not the capability desired to be selected on the decoder side does not have the capability. For example, if the image quality is prioritized, the decoding capability is set to 0 * 1 ** and the capability exchange is requested again to the encoder side.

【００５７】５．一意に決定した符号化方式を、ビット
レート割当信号ＢＡＳ（オーディオ符号化コマンド）を
用いて相手端末（復号器側）に送信し、その符号化方式
でオーディオデータＤＡの送信を開始する。5. The encoding method uniquely determined is transmitted to the partner terminal (decoder side) using the bit rate allocation signal BAS (audio encoding command), and the transmission of the audio data DA is started by the encoding method.

【００５８】なお、Ｇ．７２２は高品位音声符号化方式
として採用される符号化方式であって、ビットレートが
最大６４［ｋｂｐｓ］、再生周波数上限が７．０［ｋＨ
ｚ］であるＴＴＣ標準ＪＴ−Ｇ７２２を表している。G. 722 is an encoding method adopted as a high-quality audio encoding method, in which the maximum bit rate is 64 [kbps] and the reproduction frequency upper limit is 7.0 [kHz.
z], which is the TTC standard JT-G722.

【００５９】次に、受信符号化方式（自端末の復号器側
＝相手端末の符号器側）の決定方法について説明する。１．自端末の復号能力を、ビットレート割当信号ＢＡＳ
（端末能力２）を用いて相手端末に送信する。２．相手端末により、自端末の送信符号化方式を決定し
た方法と同様の手順で、送信符号化方式が一意に決定さ
れる。３．一意に決定した符号化方式を、相手端末によるビッ
トレート割当信号ＢＡＳを用いて自端末（復号器側）に
送信され、その符号化方式を用いて、相手端末からのオ
ーディオデータＤＡを復号する。Next, a method of determining the reception coding method (decoder side of own terminal = coder side of partner terminal) will be described. 1. The decoding capability of the own terminal is set to the bit rate allocation signal BAS.
(Terminal capability 2) is used to transmit to the partner terminal. 2. The partner terminal uniquely determines the transmission coding scheme in the same procedure as the method of determining the transmission coding scheme of the own terminal. 3. The uniquely determined encoding method is transmitted to the terminal itself (decoder side) using the bit rate allocation signal BAS from the partner terminal, and the encoding data is used to decode the audio data DA from the partner terminal.

【００６０】図１の例においては、符号化系において、
オーディオ信号ＳＡがデジタル信号に変換された後に電
話音声符号器１２および低ビットレート音声符号器１５
に並列的に供給され、その符号器１２，１５の出力符号
コードのいずれかがセレクタ１３によって選択されてオ
ーディオデータＤＡとして出力される。また、復号化系
において、オーディオデータＤＡが電話音声復号器２４
および低ビットレート音声復号器２２に並列的に供給さ
れ、オーディオデータＤＡの符号化方式に応じて復号器
２４，２２の出力信号のいずれかがセレクタ２１で選択
されてオーディオ信号ＳＡとして出力される。したがっ
て、符号化系や復号化系で電話音声符号化方式および低
ビットレート音声符号化方式のいずれかの符号化方式が
選択される場合に、従来のように他の符号化方式に係る
符号化や復号化の処理をする必要がなく、ＤＳＰの符号
化や復号化の演算量を軽減してハードウェア規模の縮小
によるコストダウンを図ると共に、音質劣化を軽減する
ことができる。In the example of FIG. 1, in the coding system,
Telephone audio encoder 12 and low bit rate audio encoder 15 after audio signal SA has been converted to a digital signal.
Are supplied in parallel, and one of the output code codes of the encoders 12 and 15 is selected by the selector 13 and output as audio data DA. Also, in the decoding system, the audio data DA is the telephone voice decoder 24.
And the low bit rate audio decoder 22 are supplied in parallel, and one of the output signals of the decoders 24 and 22 is selected by the selector 21 according to the encoding system of the audio data DA and output as the audio signal SA. . Therefore, when any one of the telephone speech coding scheme and the low bit rate speech coding scheme is selected in the coding system and the decoding system, the coding relating to the other coding system as in the past is performed. Therefore, it is possible to reduce the amount of calculation for DSP encoding and decoding, to reduce the cost due to the reduction in the hardware scale, and to reduce the sound quality deterioration.

【００６１】また、符号化系のセレクタ１３の選択動作
は相手端末の復号能力等の情報に基づいて制御されるた
め、符号器１２，１５より出力される符号コードのうち
相手端末の復号能力に応じた適切な符号コードが自動的
に選択されてオーディオデータＤＡとして出力される利
益がある。Since the selecting operation of the selector 13 of the coding system is controlled based on the information such as the decoding capability of the partner terminal, the decoding capability of the partner terminal among the code codes output from the encoders 12 and 15 is determined. There is a benefit that an appropriate code code corresponding to the selected data is automatically selected and output as the audio data DA.

【００６２】次に、図３を参照しながら、この発明の第
２実施例について説明する。本例も、テレビ会議装置の
オーディオデータ処理部５（図６参照）に適用した例で
ある。この図３において、図１と対応する部分には同一
符号を付し、その詳細説明は省略する。Next, a second embodiment of the present invention will be described with reference to FIG. This example is also an example applied to the audio data processing unit 5 (see FIG. 6) of the video conference apparatus. 3, parts corresponding to those in FIG. 1 are designated by the same reference numerals, and detailed description thereof will be omitted.

【００６３】まず、符号化系（送信側）の構成について
説明する。マイク４からのオーディオ信号ＳＡはＡ／Ｄ
変換器１１Ａで線形ＰＣＭ信号（デジタル信号）に変換
された後に高品位音声符号器１６に供給されて符号化さ
れる。そして、符号器１６より出力されるオーディオデ
ータ（符号コード）はセレクタ１７に供給される。高品
位音声符号化方式として、例えばビットレートが最大６
４［ｋｂｐｓ］、再生周波数上限が７．０［ｋＨｚ］で
あるＴＴＣ標準ＪＴ−Ｇ７２２（以下、「Ｇ．７２２」
という）が採用される。Ａ／Ｄ変換器１１Ａでは、この
Ｇ．７２２の符号化方式の再生上限周波数に対応するサ
ンプリング周波数でもってオーディオ信号ＳＡのサンプ
リングが行われる。First, the configuration of the coding system (transmission side) will be described. The audio signal SA from the microphone 4 is A / D
After being converted into a linear PCM signal (digital signal) by the converter 11A, the linear PCM signal is supplied to the high-quality speech encoder 16 and encoded. Then, the audio data (code code) output from the encoder 16 is supplied to the selector 17. As a high-quality audio encoding method, for example, a maximum bit rate of 6
TTC standard JT-G722 (hereinafter, referred to as "G.722") having 4 [kbps] and a reproduction frequency upper limit of 7.0 [kHz].
Will be adopted. In the A / D converter 11A, the G. The audio signal SA is sampled at the sampling frequency corresponding to the reproduction upper limit frequency of the coding method 722.

【００６４】また、Ａ／Ｄ変換器１１Ａより出力される
線形ＰＣＭ信号はローパスフィルタ１８で帯域制限され
た後にダウンコンバータ１９に供給され、電話音声符号
化方式としてのＧ．７１１の符号化方式および低ビット
レート音声符号化方式としてのＧ．７２８の符号化方式
の再生上限周波数に対応するサンプルレートに変換され
た後に、電話音声符号器（Ｇ．７１１符号器）１２およ
び低ビットレート音声符号器（Ｇ．７２８符号器）１５
に並列的に供給されて符号化される。そして、符号器１
２，１５より出力されるオーディオデータ（符号コー
ド）はそれぞれセレクタ１７に供給される。Further, the linear PCM signal output from the A / D converter 11A is band-limited by the low-pass filter 18 and then supplied to the down converter 19, and G. G.711 as the encoding system and the low bit rate audio encoding system. After being converted to a sample rate corresponding to the reproduction upper limit frequency of the encoding system of G.728, the telephone voice encoder (G.711 encoder) 12 and the low bit rate voice encoder (G.728 encoder) 15
Are supplied in parallel to and encoded. And encoder 1
The audio data (code code) output from 2 and 15 are supplied to the selector 17, respectively.

【００６５】セレクタ１７にはＣＰＵ３より制御信号Ｓ
Ｃｅが供給され、相手端末（通話対象）のビットレート
割当信号ＢＡＳに含まれる復号能力の情報等で決定され
た送信符号化方式に基づき、符号器１６，１２，１５の
出力オーディオデータのいずれかが選択されてオーディ
オデータＤＡとして出力される。送信符号化方式の決定
方法は図１の例で述べたと同様にして行われる。A control signal S is sent from the CPU 3 to the selector 17.
One of the audio data output from the encoders 16, 12, and 15 based on the transmission encoding method that is supplied with Ce and is determined by the decoding capability information included in the bit rate allocation signal BAS of the other terminal (call target). Is selected and output as audio data DA. The method of determining the transmission coding method is performed in the same manner as described in the example of FIG.

【００６６】次に、復号化系（受信側）の構成を説明す
る。多重分離部６からのオーディオデータＤＡは、高品
位音声復号器（Ｇ．７２２復号器）２６、電話音声復号
器（Ｇ．７１１復号器）２４および低ビットレート音声
復号器（Ｇ．７２８復号器）２２に並列的に供給され
る。ここで、オーディオデータＤＡが高品位音声符号器
（Ｇ．７２２符号器）、電話音声符号器（Ｇ．７１１符
号器）および低ビットレート音声符号器（Ｇ．７２８符
号器）で符号化されたものであるときは、それぞれ復号
器２６，２４および２２で復号化されて線形ＰＣＭ信号
が得られる。Next, the configuration of the decoding system (reception side) will be described. The audio data DA from the demultiplexing unit 6 includes a high-quality speech decoder (G.722 decoder) 26, a telephone speech decoder (G.711 decoder) 24, and a low bit rate speech decoder (G.728 decoder). ) 22 in parallel. Here, the audio data DA has been encoded by a high-quality speech encoder (G.722 encoder), a telephone speech encoder (G.711 encoder) and a low bit rate speech encoder (G.728 encoder). If so, it is decoded by decoders 26, 24 and 22 respectively to obtain a linear PCM signal.

【００６７】復号器２６の出力信号はセレクタ２７に供
給される。また、復号器２４，２２の出力信号はそれぞ
れセレクタ２１に供給される。セレクタ２１にはＣＰＵ
３より制御信号ＳＣｆが供給され、相手端末のビットレ
ート割当信号ＢＡＳに含まれるオーディオデータＤＡの
符号化方式の情報で決定された受信符号化方式に基づ
き、オーディオデータＤＡが電話音声符号化方式で符号
化されたものであるときは復号器２４の出力線形ＰＣＭ
信号が選択され、一方オーディオデータＤＡが低ビット
レート音声符号化方式で符号化されたものであるときは
復号器２２の出力線形ＰＣＭ信号が選択される。受信符
号化方式の決定方法は図１の例で述べたと同様にして行
われる。The output signal of the decoder 26 is supplied to the selector 27. The output signals of the decoders 24 and 22 are supplied to the selector 21, respectively. CPU for the selector 21
3, the control signal SCf is supplied, and the audio data DA is transmitted by the telephone voice encoding system based on the reception encoding system determined by the encoding system information of the audio data DA included in the bit rate allocation signal BAS of the partner terminal. Output linear PCM of decoder 24 when encoded
The signal is selected, while the output linear PCM signal of the decoder 22 is selected when the audio data DA has been encoded by a low bit rate speech coding scheme. The method of determining the reception coding method is performed in the same manner as described in the example of FIG.

【００６８】セレクタ２１より出力される線形ＰＣＭ信
号はアップコンバータ２８に供給され、高品位音声符号
化方式としてのＧ．７２２の符号化方式の再生上限周波
数に対応するサンプルレートに変換された後に、ローパ
スフィルタ２９で帯域制限されてセレクタ２７に供給さ
れる。セレクタ２７にはＣＰＵ３より制御信号ＳＣｇが
供給され、相手端末のビットレート割当信号ＢＡＳに含
まれるオーディオデータＤＡの符号化方式の情報で決定
される受信符号化方式に基づいて選択動作が制御され
る。The linear PCM signal output from the selector 21 is supplied to the up-converter 28, and the G.264 standard as a high-quality voice encoding system is used. After being converted to a sample rate corresponding to the reproduction upper limit frequency of the encoding method 722, the band is limited by the low-pass filter 29 and supplied to the selector 27. The control signal SCg is supplied from the CPU 3 to the selector 27, and the selection operation is controlled based on the reception coding system determined by the coding system information of the audio data DA included in the bit rate allocation signal BAS of the partner terminal. .

【００６９】すなわち、オーディオデータＤＡが高品位
音声符号化方式で符号化されたものであるときは復号器
２６より出力される線形ＰＣＭ信号が選択される。一
方、オーディオデータＤＡが電話音声符号器化方式や低
ビットレート音声符号化方式で符号化されたものである
ときはローパスフィルタフィルタ２９より出力される線
形ＰＣＭ信号が選択される。そして、セレクタ２７より
出力される線形ＰＣＭ信号はＤ／Ａ変換器２５Ａでアナ
ログ信号に変換されてオーディオ信号ＳＡとして出力さ
れる。That is, when the audio data DA is encoded by the high-quality voice encoding system, the linear PCM signal output from the decoder 26 is selected. On the other hand, when the audio data DA is encoded by the telephone voice encoder system or the low bit rate voice encoding system, the linear PCM signal output from the low pass filter filter 29 is selected. The linear PCM signal output from the selector 27 is converted into an analog signal by the D / A converter 25A and output as the audio signal SA.

【００７０】図４は、高品位音声符号器１６としての
Ｇ．７２２符号器（ＳＢ−ＡＤＰＣＭ符号器）の構成例
を示している。図において、１６１は入力信号（線形Ｐ
ＣＭ信号）Ｘinを４［ｋＨｚ］以上の信号（高域）ＸH
と４［ｋＨｚ］以下の信号（低域）ＸLに分割し、それ
ぞれダウンサンプリングするための送信直交ミラーフィ
ルタである。１６２は高域ＸHに対してＡＤＰＣＭ（適
応差分ＰＣＭ）符号化を行って１６［ｋｂｐｓ］の信号
ＩHを得るための高域ＡＤＰＣＭ符号器である。１６３
は低域ＸLに対してＡＤＰＣＭ（適応差分ＰＣＭ）符号
化を行って４８［ｋｂｐｓ］の信号ＩLを得るための低
域ＡＤＰＣＭ符号器である。１６４は信号ＩHおよびＩL
を時分割多重して６４［ｋｂｐｓ］の送信信号（オーデ
ィオデータＤＡ）Ｉを得るための多重化部である。FIG. 4 is a diagram showing an example of the G.264 as high-quality speech encoder 16. The example of a structure of the 722 encoder (SB-ADPCM encoder) is shown. In the figure, 161 is an input signal (linear P
CM signal) Xin is a signal of 4 [kHz] or higher (high range) XH
And a signal (low band) XL of 4 [kHz] or less, and a transmission quadrature mirror filter for down-sampling each. A high band ADPCM encoder 162 performs ADPCM (adaptive differential PCM) coding on the high band XH to obtain a signal IH of 16 [kbps]. 163
Is a low band ADPCM encoder for performing ADPCM (adaptive differential PCM) coding on the low band XL to obtain a signal IL of 48 [kbps]. 164 is the signals IH and IL
Are time-division multiplexed to obtain a transmission signal (audio data DA) I of 64 [kbps].

【００７１】図５は、高品位音声復号器２６としての
Ｇ．７２２復号器（ＳＢ−ＡＤＰＣＭ復号器）の構成例
を示している。図において、２６１は受信した６４［ｋ
ｂｐｓ］以下の信号Ｉｒを１６［ｋｂｐｓ］の信号ＩH
と４８［ｋｂｐｓ］以下の信号ＩLrとに分離するための
分離部である。２６２は信号ＩHに対してＡＤＰＣＭ
（適応差分ＰＣＭ）復号化を行って高域のオーディオ信
号ｒHを得るための高域ＡＤＰＣＭ復号器である。２６
３は信号ＩLrに対してＡＤＰＣＭ（適応差分ＰＣＭ）復
号化を、回線のオーディオレート（６４ｋｂｐｓ／５６
ｋｂｐｓ／４８ｋｂｐｓ）に応じたモードで行って、低
域のオーディオ信号ｒLを得るための低域ＡＤＰＣＭ復
号器である。２６４は高域、低域それぞれのオーディオ
信号ｒH，ｒLを補間し、アップサンプリングして全帯域
のオーディオ信号（線形ＰＣＭ信号）Ｘoutを得るため
の受信直交ミラーフィルタである。FIG. 5 is a diagram showing an example of the G.35 as the high-quality speech decoder 26. The example of a structure of a 722 decoder (SB-ADPCM decoder) is shown. In the figure, 261 is the received 64 [k
bps] or lower signal Ir and 16 [kbps] signal IH
And a signal ILr of 48 [kbps] or less. 262 is ADPCM for the signal IH
It is a high frequency ADPCM decoder for performing (adaptive differential PCM) decoding to obtain a high frequency audio signal rH. 26
3 performs ADPCM (adaptive differential PCM) decoding on the signal ILr and the line audio rate (64 kbps / 56).
It is a low band ADPCM decoder for obtaining a low band audio signal rL by performing a mode corresponding to kbps / 48 kbps). Reference numeral 264 is a reception quadrature mirror filter for interpolating the high-frequency and low-frequency audio signals rH and rL and up-sampling to obtain the audio signal (linear PCM signal) Xout of the entire band.

【００７２】図３の例においては、符号化系において、
オーディオ信号ＳＡがデジタル信号に変換された後に高
品位音声符号器１６、電話音声符号器１２および低ビッ
トレート音声符号器１５に並列的に供給され、その符号
器１６，１２，１５の出力符号コードのいずれかがセレ
クタ１３によって選択されてオーディオデータＤＡとし
て出力される。また、復号化系において、オーディオデ
ータＤＡが高品位音声復号器２６、電話音声復号器２４
および低ビットレート音声復号器２２に並列的に供給さ
れ、オーディオデータＤＡの符号化方式に応じて復号器
２６，２４，２２の出力信号のいずれかがセレクタ２
１，２７で選択されてオーディオ信号ＳＡとして出力さ
れる。したがって、符号化系や復号化系で高品位音声符
号化方式、電話音声符号化方式および低ビットレート音
声符号化方式のいずれかの符号化方式が選択される場合
に、従来のように他の符号化方式に係る符号化や復号化
の処理をする必要がなく、ＤＳＰの符号化や復号化の演
算量を軽減してハードウェア規模の縮小によるコストダ
ウンを図ると共に、音質劣化を軽減することができる。In the example of FIG. 3, in the encoding system,
After the audio signal SA is converted into a digital signal, the audio signal SA is supplied in parallel to the high-definition audio encoder 16, the telephone audio encoder 12 and the low bit rate audio encoder 15, and output code codes of the encoders 16, 12, 15 Is selected by the selector 13 and output as the audio data DA. In the decoding system, the audio data DA is the high-definition audio decoder 26 and the telephone audio decoder 24.
And the low bit rate audio decoder 22 are supplied in parallel, and one of the output signals of the decoders 26, 24 and 22 is supplied to the selector 2 according to the encoding system of the audio data DA.
It is selected by 1 and 27 and output as the audio signal SA. Therefore, when any one of the high-definition speech coding method, the telephone speech coding method, and the low bit rate speech coding method is selected in the coding system and the decoding system, the other coding method is selected as in the conventional method. It is not necessary to perform encoding or decoding processing related to the encoding method, the amount of calculation for DSP encoding or decoding is reduced, the cost is reduced by reducing the hardware scale, and the sound quality deterioration is reduced. You can

【００７３】また、符号化系のセレクタ１７の選択動作
は相手端末の復号能力等の情報に基づいて制御されるた
め、符号器１６，１２，１５より出力される符号コード
のうち相手端末の復号能力に応じた適切な符号コードが
自動的に選択されてオーディオデータＤＡとして出力さ
れる利益がある。Since the selecting operation of the selector 17 of the encoding system is controlled based on the information such as the decoding capability of the partner terminal, the decoding of the partner terminal among the code codes output from the encoders 16, 12, 15 is performed. There is a benefit that an appropriate code code according to the capability is automatically selected and output as the audio data DA.

【００７４】なお、上述実施例においては、テレビ会議
装置のオーディオデータ処理部５に適用したものである
が、この発明はオーディオ信号の符号化、復号化に限定
されず、ビデオ信号等のその他の信号を複数の符号化方
式でもって符号化、復号化する場合にも適用して効果的
なものとなる。Although the above-described embodiment is applied to the audio data processing unit 5 of the video conference apparatus, the present invention is not limited to the encoding and decoding of audio signals, and other types of video signals and the like. It is also effective when applied to the case where a signal is encoded and decoded by a plurality of encoding methods.

【００７５】[0075]

【発明の効果】請求項１の発明によれば、入力信号が符
号能力を異にする複数の符号化手段に並列的に供給さ
れ、その複数の符号化手段の出力符号コードよりセレク
タによって一の符号コードを選択して出力するものであ
るため、ある符号化方式を選択する場合、それに伴って
他の符号化方式に係る符号化や復号化の処理をする必要
がなく、ＤＳＰの符号化や復号化の演算量を軽減してハ
ードウェア規模の縮小によるコストダウンを図ると共
に、信号劣化を軽減することができる。According to the first aspect of the present invention, the input signal is supplied in parallel to a plurality of coding means having different coding capabilities, and one of the output code codes of the plurality of coding means is selected by the selector. Since the code code is selected and output, when a certain coding method is selected, it is not necessary to perform the coding or decoding processing related to the other coding method, and the coding of the DSP or It is possible to reduce the calculation amount of decoding and reduce the cost due to the reduction of the hardware scale, and reduce the signal deterioration.

【００７６】請求項２の発明によれば、相手端末の復号
能力を判別し、その復号能力に応じてセレクタの動作を
制御するため、相手端末の復号能力に応じた適切な符号
化方式を自動的に選択することができる。According to the second aspect of the present invention, the decoding capability of the partner terminal is discriminated and the operation of the selector is controlled in accordance with the decoding capability. Therefore, an appropriate encoding method according to the decoding capability of the partner terminal is automatically selected. Can be selected as desired.

【００７７】請求項３の発明によれば、電話音声符号化
方式および低ビットレート音声符号化方式に係る符号化
のいずれかを選択でき、その際に他の符号化方式に係る
符号化や復号化の処理をする必要がなく、ＤＳＰの演算
量を軽減してハードウェア規模の縮小によるコストダウ
ンを図ると共に、音質劣化を軽減することができる。According to the third aspect of the present invention, it is possible to select either the telephone voice encoding system or the low bit rate voice encoding system, and at that time, the encoding and decoding of other encoding systems are performed. It is possible to reduce the amount of calculation of the DSP to reduce the cost due to the reduction of the hardware scale and to reduce the deterioration of the sound quality.

【００７８】請求項４の発明によれば、電話音声符号化
方式、低ビットレート音声符号化方式および高品位音声
符号化方式に係る符号化のいずれかを選択でき、その際
に他の符号化方式に係る符号化や復号化の処理をする必
要がなく、ＤＳＰの演算量を軽減してハードウェア規模
の縮小によるコストダウンを図ると共に、音質劣化を軽
減することができる。According to the fourth aspect of the present invention, it is possible to select any one of the telephone speech coding method, the low bit rate speech coding method and the high-quality speech coding method, and at that time, another coding method is selected. It is not necessary to perform encoding and decoding processing according to the method, and it is possible to reduce the DSP calculation amount, reduce the cost by reducing the hardware scale, and reduce the sound quality deterioration.

【００７９】請求項５の発明によれば、入力符号コード
が復号能力を異にする複数の復号化手段に並列的に供給
され、その複数の復号化手段の出力信号より入力符号コ
ードの符号化方式に応じて一の出力信号を選択して出力
するものであるため、ある符号化方式に係る復号化を選
択する場合、それに伴って他の符号化方式に係る符号化
や復号化の処理をする必要がなく、ＤＳＰの符号化や復
号化の演算量を軽減してハードウェア規模の縮小による
コストダウンを図ると共に、信号劣化を軽減することが
できる。According to the invention of claim 5, the input code code is supplied in parallel to a plurality of decoding means having different decoding capabilities, and the input code code is encoded from the output signals of the plurality of decoding means. Since one output signal is selected and output according to the system, when the decoding according to a certain coding system is selected, the encoding or decoding processing according to another coding system is performed accordingly. It is not necessary to reduce the amount of calculation for DSP encoding and decoding to reduce the cost by reducing the hardware scale and reduce the signal deterioration.

【００８０】請求項６の発明によれば、電話音声符号化
方式および低ビットレート音声符号化方式に係る復号化
のいずれかを選択でき、その際に他の符号化方式に係る
符号化や復号化の処理をする必要がなく、ＤＳＰの演算
量を軽減してハードウェア規模の縮小によるコストダウ
ンを図ると共に、音質劣化を軽減することができる。According to the sixth aspect of the present invention, it is possible to select either the decoding according to the telephone speech coding system or the low bit rate speech coding system, and at that time, the coding or decoding according to the other coding system is performed. It is possible to reduce the amount of calculation of the DSP to reduce the cost due to the reduction of the hardware scale and to reduce the deterioration of the sound quality.

【００８１】請求項７の発明によれば、電話音声符号化
方式、低ビットレート音声符号化方式および高品位音声
符号化方式に係る復号化のいずれかを選択でき、その際
に他の符号化方式に係る符号化や復号化の処理をする必
要がなく、ＤＳＰの演算量を軽減してハードウェア規模
の縮小によるコストダウンを図ると共に、音質劣化を軽
減することができる。According to the invention of claim 7, it is possible to select any one of the decoding according to the telephone voice encoding system, the low bit rate voice encoding system and the high-definition voice encoding system, and at that time, another encoding is selected. It is not necessary to perform encoding and decoding processing according to the method, and it is possible to reduce the DSP calculation amount, reduce the cost by reducing the hardware scale, and reduce the sound quality deterioration.

[Brief description of drawings]

【図１】この発明の第１実施例を示す構成図である。FIG. 1 is a configuration diagram showing a first embodiment of the present invention.

【図２】送信符号化方式の決定方法の説明に供する図で
ある。FIG. 2 is a diagram for explaining a method of determining a transmission coding method.

【図３】この発明の第２実施例を示す構成図である。FIG. 3 is a configuration diagram showing a second embodiment of the present invention.

【図４】Ｇ．７２２符号器（ＳＢ−ＡＤＰＣＭ符号器）
の構成例を示す図である。FIG. 4 G. 722 encoder (SB-ADPCM encoder)
It is a figure which shows the structural example.

【図５】Ｇ．７２２復号器（ＳＢ−ＡＤＰＣＭ復号器）
の構成例を示す図である。FIG. 5 G. 722 decoder (SB-ADPCM decoder)
It is a figure which shows the structural example.

【図６】テレビ会議装置の構成例を示す図である。FIG. 6 is a diagram illustrating a configuration example of a video conference device.

【図７】Ｂチャネルのフレーム構成とビット列の関係を
示す図である。FIG. 7 is a diagram showing a relationship between a frame structure of a B channel and a bit string.

【図８】Ｂチャネルのフレーム構成を示す図である。FIG. 8 is a diagram showing a frame structure of a B channel.

【図９】フレーム同期信号ＦＡＳの構造を示す図であ
る。FIG. 9 is a diagram showing a structure of a frame synchronization signal FAS.

【図１０】マルチフレームにおけるフレーム同期信号Ｆ
ＡＳの構造を示す図である。FIG. 10 is a frame synchronization signal F in multi-frame.
It is a figure which shows the structure of AS.

【図１１】ビットレート割当信号ＢＡＳの構成を示す図
である。FIG. 11 is a diagram showing a configuration of a bit rate allocation signal BAS.

【図１２】Ｂチャネルを２回線使用する場合の伝送フォ
ーマット例を示す図である。FIG. 12 is a diagram showing an example of a transmission format when two channels of B channel are used.

【図１３】Ｈ₀チャネルのフレーム構成を示す図であ
る。FIG. 13 is a diagram showing a frame structure of an H ₀ channel.

【図１４】従来のオーディオデータ処理部（音声コーデ
ィック）の構成例を示す図である。FIG. 14 is a diagram showing a configuration example of a conventional audio data processing unit (voice codec).

【図１５】Ｇ．７１１符号器の構成例を示す図である。FIG. 15 G. It is a figure which shows the structural example of a 711 encoder.

【図１６】Ｇ．７１１復号器の構成例を示す図である。FIG. 16 G. It is a figure which shows the structural example of a 711 decoder.

【図１７】Ｇ．７２８符号器（ＬＤ−ＣＥＬＰ符号器）
の構成例を示す図である。FIG. 17: G. 728 encoder (LD-CELP encoder)
It is a figure which shows the structural example.

【図１８】Ｇ．７２８復号器（ＬＤ−ＣＥＬＰ復号器）
の構成例を示す図である。FIG. 18 G. 728 decoder (LD-CELP decoder)
It is a figure which shows the structural example.

[Explanation of symbols]

１撮像装置２画像データ処理部３ＣＰＵ４マイク５オーディオデータ処理部６多重分離部７回線インタフェース８回線９モニタテレビ１０スピーカ１１，１１ＡＡ／Ｄ変換器１２電話音声符号器（Ｇ．７１１符号器）１３，１７，２１，２７セレクタ１５低ビットレート音声符号器（Ｇ．７２８符号器）１６高品位音声符号器（Ｇ．７２２符号器）１８，２９ローパスフィルタ１９ダウンコンバータ２２低ビットレート音声復号器（Ｇ．７２８復号器）２４電話音声復号器（Ｇ．７１１復号器）２５，２５ＡＤ／Ａ変換器２６高品位音声復号器（Ｇ．７２２復号器）２８アップコンバータ 1 Imaging Device 2 Image Data Processing Unit 3 CPU 4 Microphone 5 Audio Data Processing Unit 6 Demultiplexing Unit 7 Line Interface 8 Line 9 Monitor Television 10 Speaker 11, 11A A / D Converter 12 Telephone Voice Encoder (G.711 Encoder) ) 13, 17, 21, 27 selector 15 low bit rate speech encoder (G.728 encoder) 16 high-quality speech encoder (G.722 encoder) 18, 29 low-pass filter 19 down converter 22 low bit rate speech decoding Device (G.728 decoder) 24 telephone speech decoder (G.711 decoder) 25,25 A D / A converter 26 high-definition speech decoder (G.722 decoder) 28 up-converter

Claims

[Claims]

1. A plurality of encoding means for supplying input signals in parallel and having different encoding capabilities, and one code code selected from the code codes output from the plurality of encoding means and output. An encoding device comprising: a selector.

2. The encoding apparatus according to claim 1, further comprising selector control means for determining the decoding capability of the partner terminal and controlling the operation of the selector according to the decoding capability.

3. The plurality of coding means are a telephone speech coder and a low bit rate speech coder, and a linear PCM signal is parallel to the telephone speech coder and the low bit rate speech coder as the input signal. The encoding device according to claim 1, wherein the encoding device is supplied to

4. The plurality of encoding means are a telephone speech coder, a low bit rate speech coder and a high quality speech coder, and these telephone speech coder, low bit rate speech coder and high quality speech coder. The coding apparatus according to claim 1, wherein a linear PCM signal is supplied in parallel to the input terminal as the input signal.

5. A decoding signal selected from a plurality of decoding means which are supplied with input code codes in parallel and have different decoding capabilities, and decoded signals output from the plurality of decoding means. And a selector for controlling the operation of the selector according to the encoding method of the input code code.

6. The plurality of decoding means are a telephone voice decoder and a low bit rate voice decoder, and the input code code is supplied in parallel to the telephone voice decoder and the low bit rate voice decoder. The decoding device according to claim 5, wherein:

7. The plurality of decoding means are a telephone speech decoder, a low bit rate speech decoder and a high quality speech decoder, and the telephone speech decoder, the low bit rate speech decoder and the high quality speech decoder. The decoding device according to claim 5, wherein the input code code is supplied in parallel to.