CA2513842A1 - Apparatus and method for speech coding - Google Patents
Apparatus and method for speech coding Download PDFInfo
- Publication number
- CA2513842A1 CA2513842A1 CA002513842A CA2513842A CA2513842A1 CA 2513842 A1 CA2513842 A1 CA 2513842A1 CA 002513842 A CA002513842 A CA 002513842A CA 2513842 A CA2513842 A CA 2513842A CA 2513842 A1 CA2513842 A1 CA 2513842A1
- Authority
- CA
- Canada
- Prior art keywords
- excitation vector
- stochastic
- adaptive
- speech
- codebook
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
Landscapes
- Compression, Expansion, Code Conversion, And Decoders (AREA)
Abstract
A speech encoder comprising an LPC synthesizing means, a gain calculating means and a parameter coding means for performing predictive coding of gains of an adaptive excitation vector and a stochastic excitation vector associated with indexes searched in the gain calculating means is provided. The parameter coding means comprises prediction coefficient adjusting means for adjusting one or more prediction coefficients used in the predictive coding according to one or more past subframe states. The speech encoder automatically adjusts prediction coefficients when the state value in a preceding subframe is an extremely large value or an extremely small value in predictive quantization with less including local abnormal sounds.
Claims (8)
1. A speech encoder comprising:
LPC synthesizing means for obtaining a synthesized speech by filtering adaptive excitation vector and stochastic excitation vector stored in an adaptive codebook and stochastic codebook using an LPC coefficients obtained from an input speech;
gain calculating means for calculating gains of said adaptive excitation vector and said stochastic excitation vector and searching code of the adaptive excitation vector and searching code of the stochastic excitation vector using coding distortion between said input speech and said synthesized speech obtained using said gains; and parameter coding means for performing predictive coding of gains using the adaptive excitation vector and stochastic excitation vector corresponding to the codes obtained, wherein said parameter coding means comprises prediction coefficient adjusting means for adjusting one or more prediction coefficients used for said predictive coding according to one or more states of a previous subframe.
LPC synthesizing means for obtaining a synthesized speech by filtering adaptive excitation vector and stochastic excitation vector stored in an adaptive codebook and stochastic codebook using an LPC coefficients obtained from an input speech;
gain calculating means for calculating gains of said adaptive excitation vector and said stochastic excitation vector and searching code of the adaptive excitation vector and searching code of the stochastic excitation vector using coding distortion between said input speech and said synthesized speech obtained using said gains; and parameter coding means for performing predictive coding of gains using the adaptive excitation vector and stochastic excitation vector corresponding to the codes obtained, wherein said parameter coding means comprises prediction coefficient adjusting means for adjusting one or more prediction coefficients used for said predictive coding according to one or more states of a previous subframe.
2. The speech encoder according to claim 1, wherein when one or more states of a previous subframe are an extremely large value or an extremely small value, said prediction coefficient adjusting means adjusts said prediction coefficients so as to reduce the influence thereof.
3. The speech encoder according to claim 1, wherein said parameter coding means has a codebook including gain vectors of the adaptive excitation vectors, logarithmic gain vectors of the stochastic excitation vectors and coefficients for adjusting the prediction coefficient.
4. The speech encoder according to claim 3, wherein in predicting coding when a product sum between states and prediction coefficients are calculated, prediction coefficient adjustment coefficients corresponding to the states are multiplied.
5. The speech encoder according to claim 1, further comprising storing means for storing said adaptive excitation vector, said stochastic excitation vector and prediction coefficient adjustment coefficient in accordance with each state.
6. The speech encoder according to claim 5, wherein when said adaptive excitation vector and said stochastic excitation vector stored in said storing means are updated, said prediction coefficient adjustment coefficients are also updated.
7. A CELP-based speech encoder that performs encoding by decomposing one frame into a plurality of subframes, comprising:
LPC synthesizing means for obtaining a synthesized speech by filtering adaptive excitation vector and stochastic excitation vector stored in an adaptive codebook and stochastic codebook using LPC coefficients obtained from an input speech;
gain calculating means for calculating gains of said adaptive excitation vector and said stochastic excitation vector; and parameter coding means for performing vector quantization of the adaptive excitation vector and stochastic excitation vector obtained using coding distortion between said input speech and said synthesized speech and said gains, and further comprising:
pitch analyzing means for performing pitch analyses of a plurality of subframes in the frame respectively, before performing an adaptive codebook search for the first subframe, finding a correlation value and calculating a value most approximate to the pitch period using said correlation values.
LPC synthesizing means for obtaining a synthesized speech by filtering adaptive excitation vector and stochastic excitation vector stored in an adaptive codebook and stochastic codebook using LPC coefficients obtained from an input speech;
gain calculating means for calculating gains of said adaptive excitation vector and said stochastic excitation vector; and parameter coding means for performing vector quantization of the adaptive excitation vector and stochastic excitation vector obtained using coding distortion between said input speech and said synthesized speech and said gains, and further comprising:
pitch analyzing means for performing pitch analyses of a plurality of subframes in the frame respectively, before performing an adaptive codebook search for the first subframe, finding a correlation value and calculating a value most approximate to the pitch period using said correlation values.
8. The speech encoder according to claim 7, further comprising search range setting means for determining a lag search range of a plurality of subframes based on the said adaptive codebook and said stochastic codebook using decoded LPC coefficients obtained from an input speech;
calculating gains of said adaptive excitation vector and said stochastic excitation vector;
performing vector quantization on the adaptive excitation vector and stochastic excitation vector determined using coding distortion between said input speech and said synthesized speech; and calculating correlation values by performing pitch analyses of a plurality of subframes in the processing frame before performing an adaptive codebook search of the first subframe and calculating a value most approximate to the pitch period using said correlation values.
calculating gains of said adaptive excitation vector and said stochastic excitation vector;
performing vector quantization on the adaptive excitation vector and stochastic excitation vector determined using coding distortion between said input speech and said synthesized speech; and calculating correlation values by performing pitch analyses of a plurality of subframes in the processing frame before performing an adaptive codebook search of the first subframe and calculating a value most approximate to the pitch period using said correlation values.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CA2722110A CA2722110C (en) | 1999-08-23 | 2000-08-23 | Apparatus and method for speech coding |
Applications Claiming Priority (7)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP23505099 | 1999-08-23 | ||
JP11-235050 | 1999-08-23 | ||
JP23672899 | 1999-08-24 | ||
JP11-236728 | 1999-08-24 | ||
JP11-248363 | 1999-09-02 | ||
JP24836399 | 1999-09-02 | ||
CA002348659A CA2348659C (en) | 1999-08-23 | 2000-08-23 | Apparatus and method for speech coding |
Related Parent Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CA002348659A Division CA2348659C (en) | 1999-08-23 | 2000-08-23 | Apparatus and method for speech coding |
Related Child Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CA2722110A Division CA2722110C (en) | 1999-08-23 | 2000-08-23 | Apparatus and method for speech coding |
Publications (2)
Publication Number | Publication Date |
---|---|
CA2513842A1 true CA2513842A1 (en) | 2001-03-01 |
CA2513842C CA2513842C (en) | 2011-06-21 |
Family
ID=35206791
Family Applications (2)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CA2513842A Expired - Fee Related CA2513842C (en) | 1999-08-23 | 2000-08-23 | Apparatus and method for speech coding |
CA2514249A Expired - Fee Related CA2514249C (en) | 1999-08-23 | 2000-08-23 | A speech coding system using a dispersed-pulse codebook |
Family Applications After (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CA2514249A Expired - Fee Related CA2514249C (en) | 1999-08-23 | 2000-08-23 | A speech coding system using a dispersed-pulse codebook |
Country Status (1)
Country | Link |
---|---|
CA (2) | CA2513842C (en) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN111554316A (en) * | 2019-01-24 | 2020-08-18 | 富士通株式会社 | Speech processing apparatus, method and medium |
-
2000
- 2000-08-23 CA CA2513842A patent/CA2513842C/en not_active Expired - Fee Related
- 2000-08-23 CA CA2514249A patent/CA2514249C/en not_active Expired - Fee Related
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN111554316A (en) * | 2019-01-24 | 2020-08-18 | 富士通株式会社 | Speech processing apparatus, method and medium |
Also Published As
Publication number | Publication date |
---|---|
CA2514249C (en) | 2010-11-23 |
CA2514249A1 (en) | 2001-03-01 |
CA2513842C (en) | 2011-06-21 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US5208862A (en) | Speech coder | |
US7359855B2 (en) | LPAS speech coder using vector quantized, multi-codebook, multi-tap pitch predictor | |
CA2722110A1 (en) | Apparatus and method for speech coding | |
KR100304682B1 (en) | Fast Excitation Coding for Speech Coders | |
CA2061830C (en) | Speech coding system | |
EP0751496A3 (en) | Speech coding method and apparatus for the same | |
JP4059350B2 (en) | Gain quantization method in analytic synthesis linear predictive speech coding | |
JPH056199A (en) | Voice parameter coding system | |
JP3628268B2 (en) | Acoustic signal encoding method, decoding method and apparatus, program, and recording medium | |
JPH08272395A (en) | Voice encoding device | |
KR100421648B1 (en) | An adaptive criterion for speech coding | |
JPH05216500A (en) | Speech encoding device | |
US5797119A (en) | Comb filter speech coding with preselected excitation code vectors | |
US8825475B2 (en) | Transform-domain codebook in a CELP coder and decoder | |
EP0899720B1 (en) | Quantization of linear prediction coefficients | |
US6470312B1 (en) | Speech coding apparatus, speech processing apparatus, and speech processing method | |
CA2513842A1 (en) | Apparatus and method for speech coding | |
Gerson et al. | A 5600 bps VSELP speech coder candidate for half-rate GSM | |
JPH06282298A (en) | Voice coding method | |
JPH08292797A (en) | Voice encoding device | |
JP2613503B2 (en) | Speech excitation signal encoding / decoding method | |
EP0658877A2 (en) | Speech coding apparatus | |
EP1355298B1 (en) | Code Excitation linear prediction encoder and decoder | |
US7716045B2 (en) | Method for quantifying an ultra low-rate speech coder | |
JP3194930B2 (en) | Audio coding device |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
EEER | Examination request | ||
MKLA | Lapsed |
Effective date: 20190823 |