JPH09172413A

JPH09172413A - Variable rate voice coding system

Info

Publication number: JPH09172413A
Application number: JP7348279A
Authority: JP
Inventors: Seiji Sasaki; 誠司佐々木; Kenzo Urabe; 健三占部
Original assignee: Kokusai Electric Corp
Current assignee: Kokusai Electric Corp
Priority date: 1995-12-19
Filing date: 1995-12-19
Publication date: 1997-06-30
Anticipated expiration: 2015-12-19
Also published as: JP3071388B2

Abstract

PROBLEM TO BE SOLVED: To obtain a speech quality without receiving the influence of transmission rate by transmitting coding information of all the class when the transmission rate is the highest and transmitting coding information by canceling coding information of the class of small timewise fluctuation along with the lowering of the transmission rate. SOLUTION: A variable rate transmission frame buffer 2 temporarily stores coding voice information c4 and outputs this to the side of an adaptive modulation means at a rate based on a transmission rate control signal b4. A variable rate voice decoder 4 decode-processes coding voice information g4 to output a reproduced voice signal h4. At this time, coding voice information of the class of small timewise fluctuation, which is not transmitted in a frame showing that a transmission rate control signal f4 is the frame of low rate transmission, is interpolated by information of a frame transmitted before/after it, otherwise after reproducing information of a frame just before as it is, and is voice-decode- processed.

Description

Detailed Description of the Invention

【０００１】[0001]

【発明の属する技術分野】本発明は、マルチメディア応
用システムに用いられるディジタル移動通信に係わり、
特に、トラヒック量や伝搬路の状況の変化に応じて変調
方式を適応させることにより伝送品質の向上を図った適
応変調方式を用いたディジタル移動通信方式に適用する
ための可変レート音声符号化方式に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to digital mobile communication used in multimedia application systems,
In particular, the present invention relates to a variable rate speech coding method for application to a digital mobile communication method using an adaptive modulation method in which transmission quality is improved by adapting the modulation method according to changes in traffic volume and propagation path conditions. .

【０００２】[0002]

【従来の技術】従来、フェージングによる受信レベルの
変動が激しい無線伝送路を利用した移動通信システムの
設計では、伝送路の平均的特性により適用する変調方式
を決めている。そのため、伝送路の状態が良好なときに
は、本来の伝送可能な容量以下の伝送となり効率が悪く
なる。また、伝送路の状態が悪いときは、設計値以上の
誤り率となり再生音声の品質が劣化する。この問題を解
決するため、瞬時の伝送路変動に応じて、割り当てられ
た帯域内で最適な変調方式と伝送レートを切替え選択
し、高い伝送品質で高スループットの伝送を実現するこ
とを目的とした適応変調方式が提案されている（大槻信
也他：“ＱＡＭを用いた適応変調方式の伝送特性解析”
信学技報、ＲＣＳ９４−６６（１９９９４−０９）参
照）。この適応変調方式は、伝搬路の状況に応じて伝送
レートを変化させる方式であるため、伝送レートの切替
えに対応して音声の符号化レートを適応させることので
きる可変レート音声符号化方式とその装置が必要とな
る。2. Description of the Related Art Conventionally, in the design of a mobile communication system using a radio transmission line in which the reception level fluctuates significantly due to fading, the modulation method to be applied is determined by the average characteristics of the transmission line. Therefore, when the state of the transmission path is good, the transmission becomes less than the original transmittable capacity and the efficiency becomes poor. Further, when the state of the transmission path is bad, the error rate becomes equal to or higher than the design value, and the quality of reproduced voice deteriorates. In order to solve this problem, the objective was to realize the high throughput transmission with high transmission quality by switching and selecting the optimal modulation method and transmission rate within the allocated band according to the instantaneous transmission path fluctuation. Adaptive modulation schemes have been proposed (Shinya Otsuki et al .: "Transmission characteristics analysis of adaptive modulation schemes using QAM")
See IEICE Technical Report, RCS94-66 (19994-09)). Since this adaptive modulation system is a system that changes the transmission rate according to the state of the propagation path, it is possible to adapt the coding rate of the voice corresponding to the switching of the transmission rate, and Equipment is required.

【０００３】従来の主な可変レート音声符号化方式とし
ては、可変レートＡＤＰＣＭ（４０／３２／２４／１６
kbps）（ＩＴＵ−Ｔ勧告Ｇ．７２６）、エンベデッド
ＡＤＰＣＭ（４０／３２／２４／１６／kbps）（ＩＴＵ
−Ｔ勧告Ｇ．７２７）や、北米のＩＳ−９５システム
においてQualcom 社が提案しているＱＣＥＬＰ（８／４
／２／０．８kbps）などがある。A variable-rate ADPCM (40/32/24/16) is the main conventional variable-rate speech coding system.
kbps) (ITU-T Recommendation G.726), embedded ADPCM (40/32/24/16 / kbps) (ITU
-T Recommendation G. 727) and QCELP (8/4) proposed by Qualcom in the North American IS-95 system.
/2/0.8kbps) etc.

【０００４】上記の可変レートＡＤＰＣＭは、任意のサ
ンプル単位で音声符号化レートを、４０／３２／２４／
１６kbpsの内から選択する方式である。また、エンベデ
ッドＡＤＰＣＭは、任意のサンプル単位で音声符号化レ
ートを、４０／３２／２４／１６kbpsの内から選択し、
その結果得られた符号化音声情報を、コア情報（再生音
声の生成に不可欠な情報）とエンハンスメント情報（再
生音声の品質を向上させるための情報であり、この情報
を廃棄しても再生音声は生成できる）とに分割し、必要
に応じてエンハンスメント情報の全部または一部を廃棄
し、コア情報と残ったエンハンスメント情報のみから再
生音声を得ることができる方式である。さらに、ＱＣＥ
ＬＰは、フレーム毎に入力音声信号を音響学的に分類し
（有声／無声／過渡部／雑音に分類し）、それぞれの音
響学的性質に最適な音声符号化レートを、８／４／２／
０．８kbpsの内から選択して符号化処理する方式であ
り、平均の符号化レートは４．８kbps程度である。The above variable rate ADPCM has a voice coding rate of 40/32/24 / in arbitrary sample units.
This is a method of selecting from 16 kbps. In addition, the embedded ADPCM selects a voice coding rate from 40/32/24/16 kbps in arbitrary sample units,
The coded audio information obtained as a result is core information (information indispensable for generating the reproduced sound) and enhancement information (information for improving the quality of the reproduced sound. Even if this information is discarded, the reproduced sound is not reproduced. It can be generated), and all or part of the enhancement information is discarded as necessary, and the reproduced voice can be obtained only from the core information and the remaining enhancement information. Furthermore, QCE
The LP classifies the input speech signal acoustically for each frame (classifies voiced / unvoiced / transient / noise), and determines the optimal speech coding rate for each acoustic property as 8/4/2. /
This is a method of performing a coding process by selecting from 0.8 kbps, and an average coding rate is about 4.8 kbps.

【０００５】[0005]

【発明が解決しようとする課題】これらの従来の可変レ
ート音声符号化方式を、適応変調方式による通信システ
ムに適用した場合の問題点を以下に示す。適応変調方式
では、伝搬路の状態の良し悪しを受信レベル等を監視す
ることにより調べ、その結果に基づいて伝送レートまた
は変調多値数を変化させるため、伝送容量が時々刻々と
変化する。これに伴い音声符号化器の符号化レートも、
伝搬路状態が良好な程高く、劣悪な程低くなる。ここで
問題なのは、上記の従来の符号化方式では、音声符号化
器の符号化レートが変化すると再生音声の品質も同様に
変化してしまうため、通話品質は時々刻々と変化し、受
聴者が不快さを感じるという点である。また、伝搬路状
態がさらに悪くなると情報伝送が不可能になり、再生音
声が瞬時的に途切れてしまう恐れもある。現状では、適
応変調方式を用いたシステムに適用する際、伝送レート
の変化に伴い通話品質が影響を受けにくい可変レート音
声符号化方式は存在しない。Problems to be solved when these conventional variable rate speech coding systems are applied to a communication system based on an adaptive modulation system will be described below. In the adaptive modulation method, the quality of the propagation path is checked by monitoring the reception level and the like, and the transmission rate or the modulation multilevel number is changed based on the result, so the transmission capacity changes momentarily. Along with this, the coding rate of the speech coder is also
The higher the propagation path condition, the higher the value, and the poorer the value, the lower the value. The problem here is that in the above-mentioned conventional coding method, the quality of the reproduced voice also changes when the coding rate of the voice coder changes, so the call quality changes from moment to moment, and The point is that they feel uncomfortable. Further, if the state of the propagation path is further deteriorated, information transmission becomes impossible, and the reproduced voice may be momentarily interrupted. At present, when applied to a system using an adaptive modulation method, there is no variable rate speech coding method in which speech quality is not easily affected by a change in transmission rate.

【０００６】本発明の目的は、この問題を解決するた
め、伝搬路状態の変化に対応して伝送レートを変化させ
る適応変調方式による通信システムに適用でき、かつ、
通話品質が適応変調方式の伝送レートの変化に起因する
影響を受けにくい可変レート音声符号化方式を提供する
ことにある。In order to solve this problem, the object of the present invention can be applied to a communication system using an adaptive modulation system in which the transmission rate is changed in response to changes in the propagation path state, and
It is an object of the present invention to provide a variable rate speech coding system in which speech quality is hardly affected by a change in transmission rate of an adaptive modulation system.

【０００７】[0007]

【課題を解決するための手段】本発明の可変レート音声
符号化方式は、受信信号を監視して伝搬路の状態をフレ
ーム単位に検出し伝搬路の状態が良好なとき高速の伝送
レートを指定し伝搬路の状態が悪くなるに伴って伝送レ
ートをＮ段階に低速にする伝送レート制御信号を出力す
る伝搬路状態推定手段と、該伝送レート制御信号によっ
て伝送レートをＮ段階に変化させる適応変調手段と適応
復調手段とが設けられた適応変調方式の通信システムに
適用するための可変レート音声符号化方式であって、送
信側は、入力音声信号をフレーム毎に音声符号化処理し
て符号化音声情報を抽出したのち前記伝送レート制御信
号に対応させて該符号化音声情報を時間的変動の大きい
情報から変動の小さい情報にわたってＮ段階にクラス分
けし、該伝送レート制御信号が最高レートを示している
フレームに対しては全クラスの符号化音声情報を該伝送
レート制御信号が指示するレートで前記適応変調手段側
に出力し、該伝送レート制御信号が示すレートが低くな
るに伴ってその程度に応じて時間的変動のより小さいク
ラスから符号化音声情報を破棄し、残された時間的変動
の大きいクラスの符号化音声情報のみを、該伝送レート
制御信号が指示するレートで前記適応変調手段側に出力
する可変レート音声符号化手段を備え、受信側は、前記
適応復調手段から符号化音声情報を受け取り、前記伝送
レート制御信号に基づき、高レート伝送のフレームにお
いては全クラスの符号化音声情報を再生し、低レート伝
送のフレームにおいては伝送されてこなかった時間的変
動の小さいクラスの符号化音声情報を当該フレームの前
後のフレームの伝送された情報により補間した後、また
は、直前レームの伝送されたフレームの情報をそのまま
用いて再生した後、伝送されてきた符号化音声情報およ
び該再生された符号化音声情報を音声復号処理すること
により再生音声信号を生成する可変レート音声復号手段
を備えたことを特徴とするものである。According to the variable rate speech coding system of the present invention, a received signal is monitored to detect a state of a propagation path in frame units, and a high transmission rate is designated when the condition of the propagation path is good. Then, a propagation path state estimating means for outputting a transmission rate control signal for decreasing the transmission rate to N stages as the state of the propagation path deteriorates, and an adaptive modulation for changing the transmission rate to N stages by the transmission rate control signal. A variable-rate voice encoding system for application to a communication system of an adaptive modulation system provided with a means and an adaptive demodulating means, wherein a transmitting side encodes an input voice signal by performing a voice encoding process for each frame. After extracting the voice information, the coded voice information is classified into N stages from information having a large temporal variation to information having a small variation in correspondence with the transmission rate control signal. For the frame in which the control signal indicates the highest rate, the encoded voice information of all classes is output to the adaptive modulation means side at the rate indicated by the transmission rate control signal, and the rate indicated by the transmission rate control signal is As the audio data becomes lower, the coded audio information is discarded from the class having the smaller temporal fluctuation according to the degree, and the transmission rate control signal indicates only the remaining coded audio information of the class having the large temporal fluctuation. A variable rate audio encoding means for outputting to the adaptive modulating means side at a rate that the receiving side receives the encoded audio information from the adaptive demodulating means, and based on the transmission rate control signal, in a frame of high rate transmission. Reproduces all classes of coded audio information and has not been transmitted in low-rate transmission frames. After being interpolated by the transmitted information of the frames before and after the frame, or after being reproduced by using the information of the transmitted frame of the immediately preceding frame as it is, transmitted encoded audio information and the reproduced encoding It is characterized by comprising variable rate audio decoding means for generating a reproduced audio signal by performing audio decoding processing of audio information.

【０００８】さらに、伝送誤りに対する耐性を向上する
ため、上記の可変レート音声符号化方式において、符号
化音声情報に誤り訂正・検出機能を付加したことを特徴
とするものである。Further, in order to improve resistance to transmission errors, the above-mentioned variable rate audio encoding system is characterized by adding an error correction / detection function to the encoded audio information.

【０００９】[0009]

BEST MODE FOR CARRYING OUT THE INVENTION

（作用）本発明は、送信側では、入力音声信号をフレー
ム毎に符号化し、その結果である符号化音声情報を時間
的変動の大小によりクラス分けし、伝搬路状態が良好な
時は全クラスの情報を適用変調手段側に出力し、伝搬路
の状態が悪くなるにともない時間的変動のより小さいク
ラスから符号化音声情報を廃棄し、残された時間的変動
の大きいクラスの情報のみを適応変調手段側に出力する
ように制御し、受信側では、適応復調手段を通して符号
化音声情報を受信し、伝搬路の状態が悪いフレームで伝
送されてこなかった時間的変動の小さいクラスの符号化
音声情報に対しては、その前後に伝送されたフレームの
情報により補間して、または、直前に伝送されたフレー
ムの情報をそのまま用いて再生した後、伝送されてきた
符号化音声情報および該再生された符号化音声情報を用
いて音声復号処理し再生音声信号を生成する。これによ
り、伝搬路の状態が悪い時点での再生音声品質の落ち込
みを軽減することができ、伝搬路状態に影響されにくい
通話品質を実現することができる。また、伝送レートに
応じて符号化音声情報に可変レートで誤り訂正・検出を
施す機能を付加することにより、伝送誤りに対する耐性
を向上させることができる。(Operation) In the present invention, on the transmission side, the input speech signal is coded for each frame, and the resulting coded speech information is classified into classes according to the magnitude of temporal fluctuation. When the propagation path state is good, all classes are classified. Is output to the modulation means side, the coded voice information is discarded from the class with the smaller temporal fluctuation as the propagation path condition worsens, and only the information of the remaining class with large temporal fluctuation is applied. The receiving side receives the coded voice information through the adaptive demodulating means, and the receiving side receives the coded voice information through the adaptive demodulating means, and the coded voice of a class with small temporal fluctuation that was not transmitted in a frame with a bad propagation path state. For information, it is interpolated by the information of the frames transmitted before and after it or reproduced using the information of the frame transmitted immediately before, and then the encoded audio information transmitted. Generates audio decoding processing reproduced audio signal by using the finely the reproduced coded speech information. As a result, it is possible to reduce the drop in the reproduced voice quality when the state of the propagation path is poor, and it is possible to realize speech quality that is not easily affected by the state of the propagation path. Further, by adding a function of performing error correction / detection at a variable rate to the encoded voice information according to the transmission rate, it is possible to improve resistance to transmission errors.

【００１０】[0010]

【実施例】本発明の可変レート音声符号化方式とその装
置の実施例について、図１から図５を参照しつつ説明す
る。図１は請求項１に記載した本発明の第１の実施例を
示す構成図である。図２は請求項６に記載した本発明の
第２の実施例を示す構成図であり、移動無線通信におい
て発生する伝送誤りに対する耐性を向上させるため、通
信路符号化・復号機能（誤り訂正・検出機能）を含んだ
構成を示す。また、図３，４は図１及び図２の音声符号
化器及び音声復号器の詳細具体例であり、符号励振線形
予測（ＣＥＬＰ：Codec Excited Linear Prediction ）
を用いた場合（請求項３）の構成例図である。さらに、
請求項７記載の発明は、図１，図２の本発明の第１，第
２の実施例に適用され、伝搬路状況に適応して変調多値
数をフレーム毎にＮ段階に変化させることにより、伝送
レートをＮ段階に変化させる適応変調システムに適用す
るものである。DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENT An embodiment of the variable rate speech coding system and its apparatus of the present invention will be described with reference to FIGS. FIG. 1 is a configuration diagram showing a first embodiment of the present invention described in claim 1. FIG. 2 is a block diagram showing a second embodiment of the present invention as set forth in claim 6, in order to improve resistance to a transmission error occurring in mobile radio communication, a channel coding / decoding function (error correction. A configuration including a detection function) is shown. 3 and 4 are detailed specific examples of the speech encoder and the speech decoder shown in FIGS. 1 and 2, which are coded linear prediction (CELP: Codec Excited Linear Prediction).
It is a structural example figure at the time of using (claim 3). further,
The invention according to claim 7 is applied to the first and second embodiments of the present invention shown in FIGS. 1 and 2, and the modulation multilevel number is changed to N stages for each frame in accordance with the propagation path condition. The present invention is applied to an adaptive modulation system that changes the transmission rate in N steps.

【００１１】まず、第１の実施例について説明する。図
１は、本発明の請求項１に記載した可変レート音声符号
化方式とその装置の構成例図である。同図（Ａ）は送信
側で使用される可変レート音声符号化装置であり、同図
（Ｂ）は受信側の復号装置である。図において、１は送
信側の可変レート音声符号化器、２は可変レート送信フ
レームバッファ、３は受信側の可変レート受信フレーム
バッファ、４は可変レート音声復号器である。First, a first embodiment will be described. FIG. 1 is a diagram showing an example of the configuration of a variable rate speech coding system and its apparatus described in claim 1 of the present invention. The figure (A) shows a variable rate speech coding apparatus used on the transmitting side, and the figure (B) shows a decoding apparatus on the receiving side. In the figure, 1 is a variable rate speech encoder on the transmitting side, 2 is a variable rate transmitting frame buffer, 3 is a variable rate receiving frame buffer on the receiving side, and 4 is a variable rate speech decoder.

【００１２】図１（Ａ）の送信側の符号化装置の可変レ
ート音声符号化器１は、８kHz でサンプリングされ、１
２bit 以上の精度で量子化された入力音声信号ａ４を符
号化処理し、その結果である符号化音声情報を時間的変
動の大小によりクラス分けし、伝送レート制御信号ｂ４
に基づき選定されたクラスの符号化音声情報ｃ４を出力
する。The variable rate speech coder 1 of the transmission side coding apparatus of FIG. 1A is sampled at 8 kHz and
The input voice signal a4 quantized with a precision of 2 bits or more is encoded, and the resulting encoded voice information is classified into classes according to the magnitude of temporal fluctuation, and the transmission rate control signal b4
The encoded audio information c4 of the class selected based on the above is output.

【００１３】ここで、伝送レート制御信号ｂ４とは、受
信側で受信レベル等により伝搬路の状態を推定し、最適
な伝送容量を決定する伝搬路状態推定手段（適応変調シ
ステムの１構成要素）から出力される制御信号であり、
所定の単位時間（フレーム長に対応する：例えば５mse
c）毎に本発明の可変レート音声符号化装置及び復号装
置に対して伝送レートを指定する信号である。可変レー
ト音声符号化器１の詳細な構成については後で説明す
る。Here, the transmission rate control signal b4 is a channel state estimation means (one component of the adaptive modulation system) for estimating the state of the channel based on the reception level and the like on the receiving side and determining the optimum transmission capacity. Is a control signal output from
Predetermined unit time (corresponding to frame length: eg 5 mse
It is a signal that specifies the transmission rate to the variable rate speech encoding device and the decoding device of the present invention for each c). The detailed configuration of the variable rate speech encoder 1 will be described later.

【００１４】可変レート送信フレームバッファ２は、符
号化音声情報ｃ４を一時的に蓄積し、伝送レート制御信
号ｂ４に基づくレートで適応変調手段側に符号化音声情
報ｄ４を出力する。The variable rate transmission frame buffer 2 temporarily stores the coded voice information c4 and outputs the coded voice information d4 to the adaptive modulation means side at a rate based on the transmission rate control signal b4.

【００１５】図１（Ｂ）の受信側で使用される可変レー
ト音声復号装置の可変レート受信フレームバッファ３
は、送信側の可変レート音声符号化装置より伝送され誤
り保護された符号化音声情報ｅ４が適応復調手段側から
入力され、一時的に蓄積し、伝送レート制御信号ｆ４が
指定するレートに基づき符号化音声情報ｇ４を出力す
る。ここで、伝送レート制御信号ｆ４は、前述の可変レ
ート音声符号化装置で参照される伝送レート制御信号ｂ
４と同じものである。可変レート音声復号器４は、符号
化音声情報ｇ４を復号処理し再生音声信号ｈ４を出力す
る。この時、伝送レート制御信号ｆ４が低レート伝送の
フレームであることを示しているフレームにおいて伝送
されてこなかった時間的変動の小さいクラスの符号化音
声情報に対しては、その前後に伝送されたフレームの情
報により補間して、または、直前に伝送されたフレーム
の情報をそのまま用いて再生した後、伝送されてきた符
号化音声情報および該再生された符号化音声情報を用い
て音声復号処理する。可変レート音声復号器４の詳細な
構成については後で説明する。The variable rate reception frame buffer 3 of the variable rate audio decoding device used on the receiving side in FIG.
Is coded voice information e4 that is transmitted from the variable rate voice coding device on the transmission side and is error-protected, is input from the adaptive demodulation means side, is temporarily stored, and is coded based on the rate specified by the transmission rate control signal f4. The converted voice information g4 is output. Here, the transmission rate control signal f4 is the transmission rate control signal b referred to by the above-mentioned variable rate speech coder.
Same as 4. The variable rate audio decoder 4 decodes the encoded audio information g4 and outputs a reproduced audio signal h4. At this time, the encoded voice information of the class with small temporal fluctuation that was not transmitted in the frame indicating that the transmission rate control signal f4 is a low-rate transmission frame was transmitted before and after that. After being interpolated by frame information or reproduced using the information of the frame transmitted immediately before, audio decoding processing is performed using the transmitted encoded audio information and the reproduced encoded audio information. . The detailed configuration of the variable rate speech decoder 4 will be described later.

【００１６】次に、本発明の第２の実施例について説明
する。図２は、請求項６に係わる本発明の通信路符号化
機能を付加した可変レート音声符号化方式とその装置の
構成例図である。同図（Ａ）は、送信側で使用される可
変レート音声符号化装置であり、（Ｂ）は受信側の音声
復号装置である。図において、１〜４は図１の同じ部分
の符号と同一である。５は通信路符号化器、６は通信路
復号器である。Next, a second embodiment of the present invention will be described. FIG. 2 is a block diagram showing the configuration of a variable rate speech coding system to which the channel coding function of the present invention according to claim 6 is added and its apparatus. FIG. 1A shows a variable rate audio encoding device used on the transmitting side, and FIG. 1B shows an audio decoding device on the receiving side. In the figure, 1 to 4 are the same as the same reference numerals in FIG. Reference numeral 5 is a channel encoder, and 6 is a channel decoder.

【００１７】可変レート音声符号化器１は、８kHz でサ
ンプリングされ、１２bit 以上の精度で量子化された入
力音声信号ａ３を符号化処理し、その結果である符号化
音声情報を時間的変動の大小によりクラス分けし、伝送
レート制御信号ｂ３に基づき選定されたクラスの符号化
音声情報ｃ３を出力する。The variable-rate speech encoder 1 encodes the input speech signal a3 sampled at 8 kHz and quantized with an accuracy of 12 bits or more, and the resulting encoded speech information has a large temporal variation. The coded voice information c3 of the class selected based on the transmission rate control signal b3 is output.

【００１８】ここで、伝送レート制御信号とは、受信側
で受信レベル等により伝搬路状態を推定し、最適な伝送
容量を決定する伝搬路状態推定手段（適応変調システム
の１構成要素）から出力される制御信号であり、所定の
単位時間（フレーム長に対応する：例えば５msec）毎に
本発明の可変レート音声符号化装置及び復号装置に対し
て伝送レートを指定する信号である。可変レート音声符
号化器１の詳細な構成については後で説明する。Here, the transmission rate control signal is output from the channel state estimating means (one component of the adaptive modulation system) for estimating the channel state on the receiving side according to the reception level and the like and determining the optimum transmission capacity. The control signal is a signal for designating a transmission rate to the variable rate speech encoding device and the decoding device of the present invention every predetermined unit time (corresponding to the frame length: for example, 5 msec). The detailed configuration of the variable rate speech encoder 1 will be described later.

【００１９】通信路符号化器５は、符号化音声情報ｃ３
を伝送レート制御信号ｂ３に基づくレートで通信路符号
化処理し、誤り保護された符号化音声情報ｄ３を出力す
る。通信路符号化の方法としては、データ伝送の誤り検
出訂正として一般的に用いられている冗長度符号チェッ
ク方式（ＣＲＣ：Cyclic Redundancy Check ）、または
畳込み符号化を用いるものとする。可変レート送信フレ
ームバッファ２は、誤り保護された符号化音声情報ｄ３
を一時的に蓄積し、伝送レート制御信号ｂ３に基づくレ
ートで適応変調手段側に誤り保護された符号化音声情報
ｅ３を出力する。The channel encoder 5 is for encoding encoded voice information c3.
Is subjected to channel coding processing at a rate based on the transmission rate control signal b3, and error-protected coded speech information d3 is output. As a communication channel coding method, a redundancy code check method (CRC: Cyclic Redundancy Check) generally used for error detection and correction of data transmission, or convolutional coding is used. The variable rate transmission frame buffer 2 is provided with error-protected encoded voice information d3.
Is temporarily stored and error-protected coded voice information e3 is output to the adaptive modulation means side at a rate based on the transmission rate control signal b3.

【００２０】図２（Ｂ）の受信側で使用される可変レー
ト音声復号装置の可変レート受信フレームバッファ３
は、送信側の可変レート音声符号化装置からの誤り保護
された符号化音声情報ｆ３が適応復調手段側から入力さ
れ、一時的に蓄積し、伝送レート制御信号ｇ３が指定す
るレートに基づき誤り保護された符号化音声情報ｈ３を
出力する。ここで、伝送レート制御信号ｇ３は、送信側
の可変レート音声符号化装置で参照される伝送レート制
御信号ｂ３と同じものである。The variable rate reception frame buffer 3 of the variable rate audio decoding device used on the receiving side in FIG.
The error-protected encoded voice information f3 from the variable-rate voice encoder on the transmission side is input from the adaptive demodulation means side, temporarily accumulated, and error-protected based on the rate specified by the transmission rate control signal g3. The encoded encoded audio information h3 is output. Here, the transmission rate control signal g3 is the same as the transmission rate control signal b3 referred to by the variable rate audio encoding device on the transmission side.

【００２１】通信路復号器６は、誤り保護された符号化
音声情報ｈ３が入力され、伝送レート制御信号ｇ３によ
って指定されたレートに基づき、一般的に用いられてい
る方法であるＣＲＣまたはビタビ復号により誤り検出，
訂正し、通信路復号した符号化音声情報ｉ３を出力す
る。The channel decoder 6 receives the encoded voice information h3 which is error-protected, and is based on the rate designated by the transmission rate control signal g3, which is a generally used method such as CRC or Viterbi decoding. Error detection,
The corrected and channel-decoded encoded voice information i3 is output.

【００２２】可変レート音声復号器４は、符号化音声情
報ｉ３を復号処理し再生音声信号ｊ３を出力する。この
時、伝送レート制御信号ｇ３が低レート伝送のフレーム
であることを示しているフレームにおいて伝送されてこ
なかった時間的変動の小さいクラスの符号化音声情報に
対しては、その前後に伝送されたフレームの情報により
補間して、または、直前に伝送されたフレームの情報そ
のまま用いて再生した後、伝送されてきた符号化音声情
報および該再生された符号化音声情報を用いて音声復号
処理する。可変レート音声復号器４の詳細な構成につい
ては後で説明する。The variable rate speech decoder 4 decodes the encoded speech information i3 and outputs the reproduced speech signal j3. At this time, the coded voice information of the class with small temporal fluctuation, which was not transmitted in the frame indicating that the transmission rate control signal g3 is a frame of low rate transmission, was transmitted before and after that. After the frame information is interpolated or the frame information transmitted immediately before is used as it is for reproduction, a voice decoding process is performed using the transmitted coded voice information and the reproduced coded voice information. The detailed configuration of the variable rate speech decoder 4 will be described later.

【００２３】図３は、本発明の第１の実施例および第２
の実施例の可変レート符号化器１の詳細ブロック図であ
り、図１（Ａ）の可変レート音声符号化器１および図２
（Ａ）の可変レート音声符号化器１の具体的構成例であ
る。この実施例では、公知の技術である符号励振線形予
測（ＣＥＬＰ：Codec Excited Linear Predicition）音
声符号化方式を基本としている。図において、１１は線
形予測分析器、１２はフレームパワー計算器、１３は適
応符号帳、１４は雑音符号帳、１５，１６は利得調整
器、１７は加算器、１８は合成フィルタ、１９は加算
器、２０は聴覚重み付き波形歪最小化制御器、２１はパ
ラメータ分割・間引き器である。FIG. 3 shows a first embodiment and a second embodiment of the present invention.
2 is a detailed block diagram of the variable rate coder 1 of the embodiment of FIG.
It is a specific configuration example of the variable rate speech encoder 1 of (A). This embodiment is based on a well-known code excitation linear prediction (CELP) speech coding system. In the figure, 11 is a linear prediction analyzer, 12 is a frame power calculator, 13 is an adaptive codebook, 14 is a noise codebook, 15 and 16 are gain adjusters, 17 is an adder, 18 is a synthesis filter, and 19 is addition. 20 is a perceptual weighted waveform distortion minimization controller, and 21 is a parameter divider / thinner.

【００２４】同図において、線形予測分析器１１は入力
音声信号ａ１を入力し、フレーム長（５msec：適応変調
システムでの伝送レートの可変時間間隔に対応）毎にＬ
ＳＰ（線スペクトル対：次元は１０次とする）等のスペ
クトル包絡情報ｂ１を抽出し、合成フィルタ１８とパラ
メータ分割・間引き器２１に出力する。ＬＳＰを求める
際の分析窓長は、現フレームを中心として例えば５フレ
ーム長に設定する。フレームパワー計算器１２は、入力
音声信号ａ１を入力しフレーム毎のパワー情報ｃ１を計
算しパラメータ分割・間引き器２１と聴覚重み付き波形
歪み制御器２０に出力する。In the figure, the linear prediction analyzer 11 receives the input speech signal a1 and outputs L for each frame length (5 msec: corresponding to a variable time interval of the transmission rate in the adaptive modulation system).
Spectral envelope information b1 such as SP (line spectrum pair: dimension is 10th order) is extracted and output to the synthesis filter 18 and the parameter divider / thinner 21. The analysis window length when obtaining the LSP is set to, for example, 5 frame lengths centering on the current frame. The frame power calculator 12 receives the input audio signal a1, calculates power information c1 for each frame, and outputs the power information c1 to the parameter divider / thinner 21 and the auditory weighted waveform distortion controller 20.

【００２５】適応符号帳１３は有声の音源波形を表現す
るための符号帳であり、例えば１２８種類の波形パター
ンを有している。雑音符号帳１４は無声の音源波形を表
現するための符号帳であり、例えば５１２種類の波形パ
ターンを有している。適応符号帳１３および雑音符号帳
１４は聴覚重み付けき波形歪み制御器２０からの制御信
号ｎ１に従い符号語ｄ１，ｅ１をそれぞれ出力する。The adaptive codebook 13 is a codebook for expressing a voiced sound source waveform, and has, for example, 128 types of waveform patterns. The noise codebook 14 is a codebook for expressing unvoiced sound source waveforms, and has, for example, 512 types of waveform patterns. The adaptive codebook 13 and the noise codebook 14 output codewords d1 and e1 in accordance with the control signal n1 from the auditory weighting waveform distortion controller 20, respectively.

【００２６】利得調整器１５および利得調整器１６は、
聴覚重み付波形歪み制御器２０からの制御信号ｎ１に従
い、符号語ｄ１，ｅ１の利得調整を行い、それぞれ利得
調整された信号ｆ１，ｇ１を出力する。加算器１７は利
得調整された２つの符号語ｆ１とｇ１を加算し、再生音
源信号ｈ１を作って出力するとともに、適応符号帳１３
に再生音源信号ｈ１を入力し、適応符号帳１３の中身を
更新する。The gain adjuster 15 and the gain adjuster 16 are
According to the control signal n1 from the perceptual weighted waveform distortion controller 20, the gain adjustment of the code words d1 and e1 is performed, and the gain adjusted signals f1 and g1 are output. The adder 17 adds the two gain-adjusted codewords f1 and g1 to produce and output a reproduced excitation signal h1, and at the same time, the adaptive codebook 13
The reproduced sound source signal h1 is input to and the contents of the adaptive codebook 13 are updated.

【００２７】合成フィルタ１８は、スペクトル包絡情報
ｂ１により再生音源信号ｈ１にスペクトル包絡成分ｂ１
を付加し、再生音声信号ｉ１を合成して出力する。加算
器１９は、再生音声信号ｉ１から入力音声信号ａ１を減
算し、予測誤差信号ｊ１を作り、それを聴覚重み付き波
形歪み制御器２０に入力する。The synthesis filter 18 uses the spectrum envelope information b1 to add the spectrum envelope component b1 to the reproduced sound source signal h1.
Is added and the reproduced voice signal i1 is synthesized and output. The adder 19 subtracts the input audio signal a1 from the reproduced audio signal i1 to create a prediction error signal j1 and inputs it to the auditory weighted waveform distortion controller 20.

【００２８】聴覚重み付き波形歪み制御器２０は、フレ
ーム毎に予測誤差信号ｊ１の聴覚重み付き２乗平均誤差
が最小となるように適応符号帳１３および雑音符号帳１
４のインデックスを制御信号ｎ１により制御し、選択
し、それぞれインデックスｋ１とｌ１をパラメータ分割
・間引き器２１に出力する。また、聴覚重み付き波形歪
み制御器２０は、フレームパワー計算器１２からの出力
ｃ１を使用して予測誤差信号ｊ１の聴覚重み付き２乗平
均誤差が最小となるように利得調整器１５および利得調
整器１６の利得を制御信号ｎ１により制御し、調整し、
利得インデックスｍ１をパラメータ分割・間引き器２１
に出力する。The perceptual weighted waveform distortion controller 20 adapts the adaptive codebook 13 and the noise codebook 1 so that the perceptual weighted mean squared error of the prediction error signal j1 is minimized for each frame.
The index of 4 is controlled and selected by the control signal n1, and the indexes k1 and 11 are output to the parameter divider / thinner 21, respectively. Further, the auditory weighted waveform distortion controller 20 uses the output c1 from the frame power calculator 12 so that the auditory weighted mean squared error of the prediction error signal j1 is minimized. The gain of the device 16 is controlled and adjusted by the control signal n1,
The gain index m1 is converted into a parameter divider / decimator 21.
Output to

【００２９】パラメータ分割・間引き器２１は、符号化
音声情報を時間的変動の大小によりクラス分けし（クラ
ス分け方法の詳細については後述する）、伝送レート制
御信号ｏ１（図１ではｂ４，図２ではｂ３）に基づき選
定されたクラスの符号化音声情報ｐ１を、可変レート送
信フレームバッファ２または通信路符号化器５に対して
可変レートで出力する。ここで、音声符号化情報のクラ
ス分けの例を表１に示す。The parameter divider / decimator 21 classifies the coded speech information into classes according to the magnitude of the temporal fluctuation (details of the class classification method will be described later), and the transmission rate control signal o1 (b4 in FIG. 1 and FIG. 2). Then, the encoded voice information p1 of the class selected based on b3) is output to the variable rate transmission frame buffer 2 or the channel encoder 5 at a variable rate. Here, Table 1 shows an example of classifying the audio coded information.

【００３０】[0030]

【表１】 [Table 1]

【００３１】一般的に、符号化音声情報となる音声の特
徴パラメータを時間的変動が大きい順に並べると、ま
ず、音源情報である雑音符号帳インデックスｌ１および
利得インデックスｍ１，フレームパワー情報ｃｌ、次
に、ピッチ情報である適応符号帳インデックスｋｌ、最
後に、スペクトル包絡情報の順となる。従って、この例
では、表１に示すように時間的変動の大きい順にクラス
Ａ，Ｂ，Ｃと各パラメータをクラス分けしている。他の
クラス分け方法としては、表１において、スペクトル包
絡情報ｂ１（１〜３次）とスペクトル包絡情報ｂ１（４
〜１０次）を入れ替えてもよい。Generally, when the characteristic parameters of the speech that is the encoded speech information are arranged in the order of large temporal variation, first, the noise codebook index l1 and the gain index m1, which are the excitation information, the frame power information cl, and then the , Adaptive codebook index kl, which is pitch information, and finally spectral envelope information. Therefore, in this example, as shown in Table 1, the parameters are classified into classes A, B, and C in descending order of temporal variation. As another classification method, in Table 1, spectrum envelope information b1 (first to third order) and spectrum envelope information b1 (4
10th order) may be replaced.

【００３２】伝送レート制御信号に基づいた音声情報の
可変ビットレート（フレーム長は５msec）の設定を表２
に示す。ここでは、適応変調手段から指定される伝送レ
ートは、高レート，中レート，低レートの３段階に変化
するものとする。また、表２では、本発明の第２の実施
例の通信路符号化器５により冗長ビットが付加された時
のビットレートも示す。Table 2 shows the setting of the variable bit rate (frame length is 5 msec) of audio information based on the transmission rate control signal.
Shown in Here, it is assumed that the transmission rate designated by the adaptive modulation means changes in three stages of high rate, medium rate, and low rate. Table 2 also shows the bit rate when redundant bits are added by the channel encoder 5 according to the second embodiment of the present invention.

【００３３】[0033]

【表２】 [Table 2]

【００３４】図４は、本発明の第１の実施例および第２
の実施例の受信側の可変レート音声復号器４の詳細ブロ
ック図であり、図１（Ｂ）の可変レート音声復号器４、
図２（Ｂ）の可変レート音声復号器４の具体的構成例で
ある。図において、３１はパラメータ分離・補間器、３
２は適応符号帳、３３は雑音符号帳、３４は利得制御
器、３５，３６は利得調整器、３７は加算器、３８は合
成フィルタ、３９はポストフィルタである。FIG. 4 shows a first embodiment and a second embodiment of the present invention.
2 is a detailed block diagram of the variable rate audio decoder 4 on the receiving side of the embodiment of FIG.
3 is a specific configuration example of the variable rate speech decoder 4 of FIG. 2 (B). In the figure, 31 is a parameter separator / interpolator, 3
2 is an adaptive codebook, 33 is a noise codebook, 34 is a gain controller, 35 and 36 are gain adjusters, 37 is an adder, 38 is a synthesis filter, and 39 is a post filter.

【００３５】同図で、パラメータ分離・補間器３１は、
可変レート受信フレームバッファ３または通信路復号器
６からの出力である符号化音声情報ａ２を入力し、各パ
ラメータを分離した後、伝送レート制御信号ｂ２（図１
ではｆ４、図２ではｇ３）を参照し、それが低レート伝
送のフレームであることを示しているとき、伝送されて
こなかった時間的変動の小さいクラス（クラスＣ、また
はクラスＢおよびＣ）の符号化音声情報に対しては、そ
の前後の伝送されたフレームの情報により補間して、ま
たは、直前に伝送されたフレームの情報をそのまま使用
することにより再生する。この詳細については後で説明
する。In the figure, the parameter separation / interpolator 31 is
The encoded voice information a2, which is the output from the variable rate reception frame buffer 3 or the channel decoder 6, is input, and after separating each parameter, the transmission rate control signal b2 (see FIG. 1).
F4, g3) in FIG. 2, and when it indicates that the frame is a low-rate transmission, it indicates that a class with a small time fluctuation (class C, or classes B and C) that has not been transmitted. The encoded audio information is reproduced by interpolating the information of the transmitted frames before and after it or by using the information of the frame transmitted immediately before as it is. The details will be described later.

【００３６】適応符号帳３２，雑音符号帳３３は送信側
の符号化器の符号帳１３，１４とそれぞれ同じ内容を有
しており、それぞれ伝送されてきたインデックスｃ２，
ｆ２が示す符号語ｋ２，ｊ２を出力する。利得制御器３
４は、送信側の利得調整器１５および１６（図３）の利
得インデックスｄ２，フレームパワー情報ｅ２を用いて
利得ｈ２，ｉ２を生成する。利得調整器３５，３６は利
得ｈ２，ｉ２を用いて符号語ｋ２，ｊ２の利得調整を行
う。加算器３７は利得調整された符号語ｌ２，ｍ２を加
算し、再生音源信号ｎ２を作って出力するとともに、適
応符号帳３２に再生音源信号ｎ２を入力し、符号帳３２
の中身を更新する。The adaptive codebook 32 and the noise codebook 33 have the same contents as the codebooks 13 and 14 of the encoder on the transmission side, and the transmitted indexes c2 and c2, respectively.
The code words k2 and j2 indicated by f2 are output. Gain controller 3
4 uses the gain indexes d2 and frame power information e2 of the gain adjusters 15 and 16 (FIG. 3) on the transmission side to generate gains h2 and i2. The gain adjusters 35 and 36 adjust the gain of the codewords k2 and j2 using the gains h2 and i2. The adder 37 adds the gain-adjusted codewords 12 and m2 to generate and output a reproduced excitation signal n2, and inputs the reproduced excitation signal n2 to the adaptive codebook 32, and the codebook 32.
Update the contents of.

【００３７】合成フィルタ３８は、スペクトル包絡情報
ｇ２により再生音源信号ｎ２にスペクトル包絡成分を付
加し、再生音声信号ｏ２を合成する。ポストフィルタ３
９は、聴感上の品質を向上させる処理を再生音声信号ｏ
２に施し、再生音声信号ｐ２を出力する。ｐ２は図１で
はｈ４、図２ではｊ３に対応する。The synthesis filter 38 adds a spectrum envelope component to the reproduced sound source signal n2 based on the spectrum envelope information g2 and synthesizes the reproduced sound signal o2. Post filter 3
Reference numeral 9 denotes a process of improving the perceptual quality of the reproduced voice signal o.
2 and outputs the reproduced audio signal p2. p2 corresponds to h4 in FIG. 1 and j3 in FIG.

【００３８】次に、図５を用いて本発明の可変レート音
声符号化方式の動作について説明する。同図（ａ）はフ
レーム番号Ｆn を示し、フレーム長は５msecとする。同
図（ｂ）はフレーム毎に抽出される符号化音声情報を表
１に従いクラス分けした結果を示し、各フレーム毎にク
ラスＡ，Ｂ，Ｃの符号化音声情報が得られる。同図
（ｃ）は伝送レート制御信号を示し、ここでは、高レー
ト，中レート，低レートの３段階の太線で示した伝送レ
ートをとるものとする。同図（ｄ）は表２に従って伝送
レート制御信号が示す伝送レートに対応して伝送する情
報をクラスＡ，Ｂ，Ｃのうちから選択する様子を示す。
ここで、斜線を施してある部分に位置する情報は伝送さ
れずに廃棄されることを示し、中速ではクラスＣが、低
速ではクラスＢ，Ｃが伝送されずに廃棄される。ここま
でが、図１（Ａ）の可変レート音声符号化器１および、
図２（Ａ）の可変レート音声符号化器１内での動作であ
る。Next, the operation of the variable rate speech coding system of the present invention will be described with reference to FIG. The figure (a) shows the frame number Fn, and the frame length is 5 msec. FIG. 11B shows the result of classifying the coded voice information extracted for each frame according to Table 1, and the coded voice information of classes A, B, and C is obtained for each frame. FIG. 7C shows a transmission rate control signal, and here, it is assumed that the transmission rate shown by the three-stage thick line of high rate, medium rate, and low rate is taken. FIG. 3D shows a state in which information to be transmitted corresponding to the transmission rate indicated by the transmission rate control signal is selected from classes A, B, and C according to Table 2.
Here, it is shown that the information located in the shaded area is discarded without being transmitted, and class C at medium speed and classes B and C are discarded at low speed without being transmitted. Up to this point, the variable rate speech encoder 1 of FIG.
This is the operation in the variable rate speech encoder 1 of FIG.

【００３９】次に、図５（ｅ），（ｆ）に、図１（Ｂ）
の可変レート音声復号器４、図２（Ｂ）の可変レート音
声復号器４内で実行される伝送されてこなかったクラス
ＢとＣの符号化音声情報の再生方法を示す。（ｅ）は矢
印のように伝送された前後のフレームにより補間する方
法を示す。補間は線形補間を用いる。例えば、フレーム
Ｆ２，Ｆ３，Ｆ４においてクラスＣの各パラメータは伝
送されないが、各パラメータ毎にフレームＦ１とＦ５の
情報により線形補間される。フレームＦ３のクラスＢ，
フレームＦ７のクラスＣについても同様にその前後のフ
レームのパラメータにより線形補間される。（ｆ）は矢
印のように伝送された直前のフレームの情報をそのまま
使用して補間する方法を示す。例えば、フレームＦ２，
Ｆ３，Ｆ４においてクラスＣの各パラメータは伝送され
ないが、各パラメータ毎にフレームＦ１の情報をそのま
ま用いる。フレームＦ３のクラスＢ，フレームＦ７のク
ラスＣについても同様に直前のフレームの情報を用いて
情報が再生される。Next, referring to FIGS. 5 (e) and 5 (f), FIG.
The variable rate audio decoder 4 of FIG. 2 and the method of reproducing coded audio information of classes B and C that have not been transmitted, which is executed in the variable rate audio decoder 4 of FIG. (E) shows a method of interpolating by the preceding and following frames transmitted as shown by the arrow. The interpolation uses linear interpolation. For example, each parameter of class C is not transmitted in the frames F2, F3, F4, but linear interpolation is performed for each parameter by the information of the frames F1 and F5. Class B of frame F3,
Similarly, the class C of the frame F7 is also linearly interpolated by the parameters of the frames before and after that. (F) shows a method of interpolation using the information of the immediately preceding frame transmitted as indicated by the arrow as it is. For example, the frame F2
Although the parameters of class C are not transmitted in F3 and F4, the information of frame F1 is used for each parameter as it is. For the class B of the frame F3 and the class C of the frame F7, information is similarly reproduced using the information of the immediately preceding frame.

【００４０】以上より、伝搬路状態が劣悪になり、適応
変調により伝送レートが低レートになったフレームにお
ける再生音声の品質は改善され、伝搬路状態に影響され
にくい通話品質を提供することができる。As described above, the quality of the reproduced voice in the frame in which the propagation path condition becomes poor and the transmission rate becomes low due to the adaptive modulation is improved, and it is possible to provide the speech quality which is hardly influenced by the propagation path condition. .

【００４１】次に、上述の方式を実際に使用する際の異
なる改良について説明する。例えば、伝搬状態の悪い状
態が長く続いた時、及び初めから悪い時の動作が問題と
なる。この場合、伝送レートとして中速または低速しか
選択されなくなるため、上記の方法では、クラスＣまた
はクラスＢの情報が伝送されなくなり、再生音声の生成
が不可能となる。この問題は、伝搬路状態の悪い状態が
長く続く場合、及び初めから悪い場合を検出した時、次
に高レートのフレームが来るまでの期間については、Ｎ
段階の伝送レートのうち最高レート以外においても全ク
ラスの符号化音声情報を伝送可能な音声符号化手段及び
復号化手段を用いることにより解決することができる。Next, different improvements in actual use of the above method will be described. For example, an operation becomes a problem when a bad propagation state continues for a long time and when the propagation state is bad from the beginning. In this case, since only medium speed or low speed is selected as the transmission rate, in the above method, the information of class C or class B is not transmitted, and it becomes impossible to generate the reproduced sound. The problem is that when the bad state of the propagation path continues for a long time, or when the bad state is detected from the beginning, the period until the next high-rate frame comes is N.
The problem can be solved by using the voice coding means and the decoding means capable of transmitting the coded voice information of all classes even at a rate other than the highest rate among the transmission rates of the stages.

【００４２】実施方法としては、可変レート音声符号化
器（図１の１）内のパラメータ分割・間引き器（図３の
２１）が伝送レート制御信号（図３のｏ１）を監視する
ことにより、伝搬状態の悪い状態が長く続く場合、及び
初めから悪い場合を検出する。この時、該パラメータ分
割・間引き器２１は、中速または低速でも全クラスの符
号化音声情報を伝送可能なように、中レートまたは低レ
ートで全パラメータを量子化し、全クラスの符号化音声
情報を伝送する。この低速度化の方法としては、次の２
つが考えられる。（１）各フレームにおける各パラメータに対するビット
割当て数を削減する。（２）各パラメータに対するビット割当て数は削減せず
にフレーム長を長くする。このうち、装置構成が比較的
簡単に実現可能な（１）のビット割当ての実施例を表３
に示す。As a method of implementation, the parameter divider / decimator (21 in FIG. 3) in the variable rate speech encoder (1 in FIG. 1) monitors the transmission rate control signal (o1 in FIG. 3), The case where the bad propagation state continues for a long time and the case where the propagation state is bad from the beginning are detected. At this time, the parameter divider / decimator 21 quantizes all the parameters at the medium rate or the low rate so that the encoded speech information of all classes can be transmitted even at medium speed or low speed, and the encoded speech information of all classes is quantized. To transmit. As a method for reducing the speed, the following 2
One can be considered. (1) The number of bit allocations for each parameter in each frame is reduced. (2) The frame length is increased without reducing the bit allocation number for each parameter. Among these, the embodiment of the bit allocation of (1) that can realize the device configuration relatively easily is shown in Table 3.
Shown in

【００４３】[0043]

【表３】 [Table 3]

【００４４】表３の、低レートにおいて、＊印を付した
パラメータに関しては、最初のフレームのみ通常の量子
化を行い、次のフレームからは前フレームとの差分を量
子化する。次に高レートのフレームが検出されるまでこ
の音声符号化手法を繰り返す。このようにすることによ
り、通話品質は多少低下するものの、伝搬状態の悪い状
態が長く続いた時、及び初めから悪い時でも再生音声の
生成が可能となる。At low rates in Table 3, for parameters marked with *, normal quantization is performed only in the first frame, and the difference from the previous frame is quantized from the next frame. This speech coding technique is then repeated until a high rate frame is detected. By doing so, although the call quality is somewhat deteriorated, it is possible to generate the reproduced voice even when the bad propagation state continues for a long time or when the propagation state is bad from the beginning.

【００４５】本発明の装置はＤＳＰ（デジタルシグナル
プロセッサ）１チップおよび簡単な周辺回路により、容
易に実現可能である。The device of the present invention can be easily realized by one DSP (digital signal processor) chip and a simple peripheral circuit.

【００４６】[0046]

【発明の効果】以上詳細に説明したように、本発明を実
施することにより、適応変調方式を用いた無線通信シス
テムにおいて、再生音声が伝搬路状態の影響を受けにく
くなって受聴者の不快感が軽減され、優れた通話品質を
保つことができるため、実用上の効果が大きい。As described in detail above, by carrying out the present invention, in a wireless communication system using an adaptive modulation system, reproduced voice is less affected by the state of the propagation path and the listener feels uncomfortable. Is reduced, and excellent call quality can be maintained, which has a great practical effect.

[Brief description of the drawings]

【図１】本発明の可変レート音声符号化方式の第１の実
施例を示す構成図である。FIG. 1 is a configuration diagram showing a first embodiment of a variable rate speech coding system according to the present invention.

【図２】本発明の可変レート音声符号化方式の第２の実
施例を示す構成図である。[Fig. 2] Fig. 2 is a configuration diagram showing a second embodiment of the variable rate speech coding system of the present invention.

【図３】本発明の図１，図２の可変レート音声符号化器
の構成例図である。FIG. 3 is a diagram showing a configuration example of the variable rate speech encoder of FIGS. 1 and 2 of the present invention.

【図４】本発明の図１，図２の可変レート音声符復号器
の構成例図である。FIG. 4 is a diagram showing a configuration example of the variable rate speech codec shown in FIGS. 1 and 2 of the present invention.

【図５】本発明の可変レート音声符号化方式の動作説明
図である。[Fig. 5] Fig. 5 is an operation explanatory diagram of the variable rate audio encoding system of the present invention.

[Explanation of symbols]

１可変レート音声符号化器２可変レート送信フレームバッファ３可変レート受信フレームバッファ４可変レート音声復号器５通信路符号化器６通信路復号器１１線形予測分析器１２フレームパワー計算器１３，３２適応符号帳１４，３３雑音符号帳１５，１６，３５，３６利得調整器１７，１９，３７加算器１８，３８合成フィルタ２０聴覚重み付き波形歪最小化制御器２１パラメータ分割・間引き器３１パラメータ分離・補間器３４利得制御器３９ポストフィルタ 1 variable rate speech encoder 2 variable rate transmission frame buffer 3 variable rate reception frame buffer 4 variable rate speech decoder 5 channel encoder 6 channel decoder 11 linear prediction analyzer 12 frame power calculator 13, 32 adaptive Codebook 14,33 Noise codebook 15,16,35,36 Gain adjuster 17,19,37 Adder 18,38 Synthesis filter 20 Auditory weighted waveform distortion minimization controller 21 Parameter division / decimation device 31 Parameter separation / Interpolator 34 Gain controller 39 Post filter

───────────────────────────────────────────────────── フロントページの続き (51)Int.Cl.⁶ 識別記号庁内整理番号ＦＩ技術表示箇所Ｈ０４Ｌ 27/00 Ｈ０４Ｌ 27/00 Ｚ ─────────────────────────────────────────────────── ─── Continuation of the front page (51) Int.Cl. ⁶ Identification code Internal reference number FI Technical display location H04L 27/00 H04L 27/00 Z

Claims

[Claims]

1. A received signal is monitored to detect the state of a channel for each frame, a high-speed transmission rate is designated when the state of the channel is good, and the transmission rate is set to N as the state of the channel deteriorates. Communication of an adaptive modulation system provided with a propagation path state estimating means for outputting a transmission rate control signal for reducing the transmission speed in stages, and an adaptive modulation means and an adaptive demodulation means for changing the transmission rate to N stages by the transmission rate control signal. A variable-rate audio encoding system to be applied to a system, in which a transmitting side performs audio encoding processing on an input audio signal for each frame to extract encoded audio information, and then associates it with the transmission rate control signal. The coded voice information is classified into N stages from information with large temporal fluctuation to information with small fluctuation, and for the frame in which the transmission rate control signal shows the maximum rate. The encoded voice information of all classes is output to the adaptive modulation means side at a rate instructed by the transmission rate control signal, and as the rate indicated by the transmission rate control signal becomes lower, the temporal variation of Variable-rate audio that discards encoded audio information from a smaller class and outputs only the remaining encoded audio information of a class with large temporal fluctuation to the adaptive modulation means side at a rate indicated by the transmission rate control signal. The receiving side receives the coded voice information from the adaptive demodulation means, reproduces the coded voice information of all classes in the frame of high rate transmission based on the transmission rate control signal, and receives the low rate. The coded audio information of the class with small temporal fluctuation that was not transmitted in the transmission frame is converted into the transmitted information of the frames before and after the frame. After being interpolated or reproduced using the information of the frame transmitted immediately before, as it is, reproduced audio signal by performing audio decoding processing of the transmitted encoded audio information and the reproduced encoded audio information. A variable-rate speech encoding system comprising variable-rate speech decoding means for generating

2. The variable-rate speech coding means, when the transmission rate control signal of the N-step transmission rate output from the propagation path state estimating means is in a bad propagation path state for a long time or is bad from the beginning. When the transmission rate is other than the highest rate, which indicates the state, until the next high-rate frame is transmitted, all classes of encoded audio information are transmitted as in the case of the highest transmission rate. The variable rate speech coding system according to claim 1, wherein

3. The variable-rate speech coding means according to claim 1, wherein the variable-rate speech coding means and the variable-rate speech decoding means are constituted by a code-excited linear predictive coding method. method.

4. The N stages according to claim 1, wherein the transmission rate is set to three stages of high speed, medium speed, and low speed, and the coded voice information is classified into classes according to the transmission rate. In order of increasing dynamic variation, class 1 is classified into three stages: high-order component of spectrum envelope information, class 2 is low-order component of spectrum envelope information and pitch information, and class 3 is frame power information and sound source signal information. The variable rate speech coding system according to claim 1, which is characterized in that.

5. The N stages according to claim 1, wherein the transmission rate is set to three stages of high speed, medium speed, and low speed, and the coded voice information is classified into time by classifying the coded voice information corresponding to the transmission rate. In order of increasing dynamic variation, class 1 is classified into three stages: low-order component of spectrum envelope information, class 2 is high-order component of spectrum envelope information and pitch information, class 3 is frame power information and sound source signal information. The variable rate speech coding system according to claim 1, which is characterized in that.

6. The variable rate audio coding system according to claim 1, wherein an error correction / detection function is added to the coded audio information.

7. The transmission rate of the channel is changed by monitoring the received signal, detecting the state of the channel for each frame, and changing the modulation multi-value number in N stages for each frame in accordance with the state of the channel. A propagation path state estimating means for outputting a transmission rate control signal for designating a high transmission rate when the state is good and for reducing the transmission rate to N stages as the state of the propagation path deteriorates, and the transmission rate control signal. A variable-rate voice coding system for application to a communication system of an adaptive modulation system provided with adaptive modulation means and adaptive demodulation means for changing the transmission rate to N stages by The encoded audio information is extracted for each frame by extracting the encoded audio information, and then the encoded audio information is distributed from information having a large temporal variation to information having a small variation corresponding to the transmission rate control signal. Classify into N stages and output the coded voice information of all classes to the adaptive modulation means side at the rate indicated by the transmission rate control signal for the frame in which the transmission rate control signal shows the highest rate. However, as the rate indicated by the transmission rate control signal becomes lower, the coded speech information is discarded from the class having the smaller temporal variation according to the degree, and the remaining coded speech of the class having the large temporal variation. A variable rate audio encoding means for outputting only information to the adaptive modulation means side at a rate indicated by the transmission rate control signal is provided, and the receiving side receives the encoded audio information from the adaptive demodulation means, and outputs the transmission rate. Based on the control signal, the coded audio information of all classes is reproduced in the frame of high rate transmission, and the time that is not transmitted in the frame of low rate transmission. Codes that have been transmitted after interpolating the coded voice information of a class with small fluctuations with the information transmitted in the frames before and after the relevant frame, or after reproducing using the information in the frame transmitted immediately before the frame as it is A variable-rate audio encoding method comprising variable-rate audio decoding means for producing a reproduced audio signal by audio-decoding the encoded audio information and the reproduced encoded audio information.