WO2010016270A1 - 量子化装置、符号化装置、量子化方法及び符号化方法 - Google Patents
量子化装置、符号化装置、量子化方法及び符号化方法 Download PDFInfo
- Publication number
- WO2010016270A1 WO2010016270A1 PCT/JP2009/003798 JP2009003798W WO2010016270A1 WO 2010016270 A1 WO2010016270 A1 WO 2010016270A1 JP 2009003798 W JP2009003798 W JP 2009003798W WO 2010016270 A1 WO2010016270 A1 WO 2010016270A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- signal
- power
- value
- correlation
- correlation value
- Prior art date
Links
- 238000000034 method Methods 0.000 title claims description 38
- 238000013139 quantization Methods 0.000 claims abstract description 63
- 238000004364 calculation method Methods 0.000 claims description 68
- 230000014509 gene expression Effects 0.000 claims description 8
- 238000009499 grossing Methods 0.000 description 23
- 230000006870 function Effects 0.000 description 20
- 238000010586 diagram Methods 0.000 description 7
- 238000006243 chemical reaction Methods 0.000 description 6
- 230000000694 effects Effects 0.000 description 5
- 238000004091 panning Methods 0.000 description 5
- 230000005236 sound signal Effects 0.000 description 5
- 238000001228 spectrum Methods 0.000 description 5
- 238000005516 engineering process Methods 0.000 description 4
- 238000002474 experimental method Methods 0.000 description 4
- 238000004891 communication Methods 0.000 description 3
- 230000010354 integration Effects 0.000 description 3
- 238000010295 mobile communication Methods 0.000 description 3
- 238000003860 storage Methods 0.000 description 3
- 230000002123 temporal effect Effects 0.000 description 3
- 230000005540 biological transmission Effects 0.000 description 2
- 238000011156 evaluation Methods 0.000 description 2
- 238000012545 processing Methods 0.000 description 2
- 239000013598 vector Substances 0.000 description 2
- 238000004458 analytical method Methods 0.000 description 1
- 238000013459 approach Methods 0.000 description 1
- 230000006835 compression Effects 0.000 description 1
- 238000007906 compression Methods 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 230000006866 deterioration Effects 0.000 description 1
- 210000005069 ears Anatomy 0.000 description 1
- 239000000284 extract Substances 0.000 description 1
- 230000010365 information processing Effects 0.000 description 1
- 230000004807 localization Effects 0.000 description 1
- 238000004519 manufacturing process Methods 0.000 description 1
- 238000005070 sampling Methods 0.000 description 1
- 239000004065 semiconductor Substances 0.000 description 1
- 230000003068 static effect Effects 0.000 description 1
- 238000012795 verification Methods 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/008—Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
Definitions
- the present invention relates to a quantization apparatus, an encoding apparatus, a quantization method, and an encoding method, for example, a quantization apparatus and an encoding method using an intensity stereo method, which is a method for encoding a stereo sound signal at a low bit rate.
- the present invention relates to a quantization apparatus and a quantization method.
- the intensity stereo system is known as a system for encoding stereo sound signals at a low bit rate.
- a monaural signal hereinafter referred to as “M signal”
- L signal left channel signal
- R signal right channel signal
- Such a method is also called amplitude panning.
- the most basic method of amplitude panning is to obtain an L signal and an R signal by multiplying an M signal in the time domain by an amplitude panning gain coefficient (balance weight coefficient) (for example, Non-Patent Document 1).
- Non-Patent Document 2 there is a method of obtaining an L signal and an R signal by multiplying an M signal by a balance weight coefficient for each frequency component or frequency group in the frequency domain (for example, Non-Patent Document 2).
- the encoding of the stereo signal can be realized by encoding the balance weight coefficient as the parametric stereo encoding parameter (for example, Patent Document 1 and Patent Document 2).
- the balance weight coefficient is described as a balance parameter in Patent Document 1 and as an ILD (level difference) in Patent Document 2.
- Non-Patent Documents 1 and 2 and Patent Documents 1 and 2 Conventionally, as in Non-Patent Documents 1 and 2 and Patent Documents 1 and 2, efficient encoding of stereo signals of L and R signals has been performed.
- Patent Document 1 discloses that the ratio of left and right volume, which is a balance weighting coefficient in intensity stereo, is obtained and then the ratio is encoded.
- Patent Document 1 discloses that the ratio of the left and right volume is obtained and then the ratio is encoded.
- a complicated arithmetic “divide” is used in order to obtain the volume ratio. The amount of calculation is increasing.
- An object of the present invention is to provide a quantization device, an encoding device, a quantization method, and an encoding method capable of performing more efficient quantization by reducing the amount of calculation in the quantization of the balance weight coefficient. It is.
- the quantization apparatus quantizes two coefficients that adjust the balance of the amplitude of the third signal obtained by using the result of downmixing the first signal and the second signal.
- An apparatus wherein three signals of the first signal, the second signal, and the third signal are inputted, a first correlation value between the first signal and the third signal, and the A power / correlation calculation means for calculating a second correlation value between the second signal and the third signal and calculating a first power of the third signal, and a first intermediate using the first power
- Intermediate value calculating means for calculating a second intermediate value using at least one correlation value of the first correlation value and the second correlation value and the first power, and a plurality of scalar values Is stored on the basis of the code book in which the first intermediate value and the second intermediate value are stored.
- a code corresponding to the obtained scalar value obtained by searching a balance weight coefficient for adjusting the balance of the amplitude of the third signal with respect to the signal from the plurality of scalar values stored in the codebook. And a search means for obtaining the above.
- the encoding apparatus includes a downmix unit that generates a third signal using a result of downmixing a first signal and a second signal, the first signal, and the second signal. And the third signal, and a quantization means for outputting a code obtained by performing quantization on two coefficients for adjusting the amplitude balance of the third signal, and the first signal
- a first balance weight coefficient for adjusting the balance of the amplitude of the third signal with respect to the signal is determined using the sign, and the balance of the amplitude of the third signal with respect to the second signal is determined.
- a first target signal is generated and the first target signal is generated.
- Encoding means for encoding a signal, generating a second target signal using the second signal, the third signal, and the second balance weight coefficient, and encoding the second target signal
- the quantization means calculates a first correlation value between the first signal and the third signal and a second correlation value between the second signal and the third signal.
- a power / correlation calculating means for calculating a first power of the third signal, a first intermediate value using the first power, and at least one of the first correlation value and the second correlation value.
- An intermediate value calculation means for calculating a second intermediate value using one correlation value and the first power, a codebook storing a plurality of scalar values, the first intermediate value and the second intermediate value Based on the first balance weighting factor of the plurality of scalar values.
- search from the adopts a configuration comprising a search means for obtaining the code corresponding to the obtained scalar value, a.
- the quantization method of the present invention quantizes two coefficients that adjust the balance of the amplitude of the third signal obtained by using the result of downmixing the first signal and the second signal.
- the method includes inputting three signals, the first signal, the second signal, and the third signal, and a first correlation value between the first signal and the third signal; Calculating a second correlation value between the second signal and the third signal and calculating a first power of the third signal; and a first intermediate using the first power
- the balance weight coefficient for balancing width by searching among a plurality of scalar values stored in codebook, a search step to obtain a code corresponding to the obtained scalar value, and to have.
- the encoding method of the present invention includes a downmix step of generating a third signal by using a result of downmixing a first signal and a second signal, the first signal, and the second signal. And the third signal, and a quantization step for outputting a code obtained by performing quantization on two coefficients for adjusting the amplitude balance of the third signal;
- a first balance weight coefficient for adjusting the balance of the amplitude of the third signal with respect to the signal is determined using the sign, and the balance of the amplitude of the third signal with respect to the second signal is determined.
- a coefficient determining step for calculating a second balance weight coefficient to be adjusted using the first balance weight coefficient, and using the first signal, the third signal, and the first balance weight coefficient.
- the quantization step includes calculating a first correlation value between the first signal and the third signal and a second correlation value between the second signal and the third signal. Calculating a power / correlation calculating step of calculating a first power of the third signal; calculating a first intermediate value using the first power; and calculating the first correlation value and the second correlation value.
- An intermediate value calculating step of calculating a second intermediate value using at least one correlation value of the first power and the first power, and the first balance weight coefficient based on the first intermediate value and the second intermediate value Multiple scans stored in the codebook. By searching from the La value, a search step to obtain the code corresponding to the obtained scalar value, and to have.
- the block diagram which shows the structure of the encoding apparatus which concerns on Embodiment 1 and 2 of this invention The block diagram which shows the structure of the quantization apparatus which concerns on Embodiment 1 and 2 of this invention
- Embodiment 1 a configuration for performing encoding and decoding using panning (hereinafter referred to as “balance adjustment”) will be described using the following configuration. That is, in ISO / IEC 14496-3: 1999 (E) “MPEG-2”, p.232, FIG.B.13 (hereinafter referred to as Non-Patent Document 3), Part of the configuration of the encoder widely used as AAC (Advanced Audio Codec), which is the standard system of MPEG-2 and MPEG-4 (the part that generates the side signal from the configuration of the left half of FIG.
- AAC Advanced Audio Codec
- the stereo signal allows the listener to enjoy realistic sound by putting different acoustic signals into the listener's left and right ears. Therefore, the simplest stereo signal in the audio signal that is the content is the case of two channels of the L signal and the R signal, and in this embodiment, the case where the input signal is two channels will be described.
- FIG. 1 is a block diagram showing a configuration of encoding apparatus 100 according to the present embodiment.
- FIG. 1 is a diagram for encoding a stereo signal in a scalable (multi-layer structure), and using a decoded signal generated by encoding an M signal with a core encoder and then decoding with a core decoder, A configuration is adopted in which a stereo signal is encoded in a region.
- the encoding apparatus 100 includes a downmix unit 101, a core encoder 102, a core decoder 103, a modified discrete cosine transform (hereinafter referred to as “MDCT (Modified (Discrete Cosine Transform)”) unit 104, an MDCT unit 105, and an MDCT unit.
- MDCT Modified (Discrete Cosine Transform)
- 106 a downmix unit 107, an adder unit 108, a quantizer 109, a multiplier unit 110, a multiplier unit 111, an adder unit 112, an adder unit 113, an encoder 114, an encoder 115, and an encoder 116. Is done.
- the downmix unit 101 inputs an L signal (first signal) and an R signal (second signal), which are vectors of a predetermined length, and downmixes the input L signal and R signal. Thus, the M signal (third signal) is obtained. Then, the downmix unit 101 outputs the obtained M signal to the core encoder 102.
- Expression (1) shows an example of a downmix calculation method in the downmix unit 101. In the present embodiment, the simplest downmix method of adding the L signal and the R signal and multiplying by 0.5 is used.
- the core encoder 102 encodes the M signal input from the downmix unit 101 to obtain a code, and outputs the obtained code to the core decoder 103 and the multiplexing unit 117.
- Core decoder 103 decodes the code input from core encoder 102 to generate a decoded signal, and outputs the generated decoded signal to MDCT section 105.
- the MDCT unit 104 receives an L signal, performs discrete cosine transform on the input L signal, and converts the signal in the time domain (time domain) to a signal in the frequency domain (frequency domain) (frequency spectrum). MDCT section 104 then outputs the converted signal to downmix section 107, addition section 112, and quantization apparatus 109.
- the MDCT unit 105 performs discrete cosine transform on the decoded signal input from the core decoder 103, and converts the signal in the time domain (time domain) into a signal in the frequency domain (frequency domain) (frequency spectrum). Then, MDCT unit 105 outputs the converted signal to addition unit 108.
- the MDCT unit 106 receives an R signal, performs discrete cosine transform on the input R signal, and converts the signal in the time domain (time domain) into a signal in the frequency domain (frequency domain) (frequency spectrum). MDCT section 106 then outputs the converted signal to downmix section 107, addition section 113, and quantization apparatus 109.
- the downmix unit 107 downmixes the L signal input from the MDCT unit 104 and the R signal input from the MDCT unit 106 to obtain an M signal. Then, the downmix unit 107 outputs the obtained M signal to the adder unit 108.
- the downmix unit 107 is different from the downmix unit 101 in that it does not downmix the time domain signal but downmix the frequency domain signal. Note that the downmix calculation method is the same as that in Equation (1), and thus the description thereof is omitted.
- the adding unit 108 subtracts the signal input from the MDCT unit 105 from the M signal input from the downmix unit 107 to calculate a target M signal (hereinafter referred to as “target M signal”). Then, the adding unit 108 outputs the calculated target M signal to the multiplying unit 110, the multiplying unit 111, the encoder 115, and the quantizing device 109.
- target M signal a target M signal
- the quantizer 109 encodes a balance weight coefficient used for balance adjustment using the L signal input from the MDCT unit 104, the target M signal input from the adder unit 108, and the R signal input from the MDCT unit 106. Find the sign of the weighting factor. Further, the quantization device 109 outputs the obtained code to the multiplexing unit 117.
- the quantizing device 109 uses the acquired L signal balance weight coefficient w L to adjust the balance of the amplitude of the target M signal with respect to the R signal, the balance weight coefficient w R (hereinafter referred to as “R signal balance weight coefficient w R ”) and the balance weight coefficient w R of the obtained R signal is set in the multiplier 111.
- R signal balance weight coefficient w R the balance weight coefficient w R of the obtained R signal is set in the multiplier 111.
- the multiplier 110 multiplies the target M signal input from the adder 108 by the balance weight coefficient w L of the L signal input from the quantizer 109 and outputs the result to the adder 112.
- Multiplier 111 the target M signal input from the adder 108, and outputs the multiplied balance weight coefficient w R of the R signal input from the quantizer 109 to the adder 113.
- the adder 112 subtracts the target M signal, which is input from the multiplier 110 and multiplied by the balance weight coefficient w L of the L signal, from the L signal input from the MDCT unit 104 to obtain a target L signal (hereinafter referred to as “target L”). Signal)). Then, the adding unit 112 outputs the obtained target L signal to the encoder 114.
- target L a target L signal
- the adder 113 subtracts the target M signal input from the multiplier 111 and multiplied by the balance weight coefficient w R of the R signal from the R signal input from the MDCT unit 106 to obtain a target R signal (hereinafter referred to as “target R”). Signal)). Then, the adding unit 113 outputs the obtained target R signal to the encoder 116. Calculations in the adding unit 112 and the adding unit 113 are shown in Equation (2).
- the above algorithm corresponds to the conversion between L signal and R signal using balance adjustment.
- the balance weight coefficient represents the similarity between the target M signal and the L signal or R signal. Therefore, the target L signal and the target R signal obtained by subtracting the target M signal multiplied by the balance weight coefficient from the L signal and the R signal are signals in which redundant portions are omitted by the target M signal, and the power as the signal is increased. Since both are reduced, both can be efficiently encoded.
- the encoder 114 encodes the target L signal input from the adding unit 112 and outputs the code obtained by the encoding to the multiplexing unit 117.
- the encoder 115 encodes the target M signal input from the adding unit 108 and outputs a code obtained by encoding to the multiplexing unit 117.
- the encoder 116 encodes the target R signal input from the adder 113 and outputs a code obtained by encoding to the multiplexer 117.
- the multiplexing unit 117 multiplexes the codes input from the core encoder 102, the quantization device 109, the encoder 114, the encoder 115, and the encoder 116, and outputs a multiplexed bit stream.
- FIG. 2 is a block diagram showing the configuration of the quantization device 109.
- the quantizing device 109 mainly includes a power / correlation calculation unit 201, an intermediate value calculation unit 202, a code book 203, a search unit 204, and a decoding unit 205.
- the power / correlation calculation unit 201 uses the L signal input from the MDCT unit 104, the target M signal input from the addition unit 108, and the R signal input from the MDCT unit 106 to perform power calculation and correlation value calculation. Do. Then, the power / correlation calculation unit 201 outputs the calculated power and the correlation value to the intermediate value calculation unit 202.
- the power and the correlation value can be obtained by equation (3).
- the intermediate value calculation unit 202 obtains two intermediate values using the power and the correlation value input from the power / correlation calculation unit 201. Then, intermediate value calculation section 202 outputs the obtained intermediate value to search section 204.
- the intermediate value can be obtained by equation (4).
- the code book 203 is information stored in a storage means such as a ROM (Read Only Memory), and is composed of a plurality of scalar values selected as weighting factors for the L signal.
- FIG. 3 is a diagram showing an example of scalar values stored in the code book 203 numbered in the present embodiment.
- the scalar value stored in the codebook 203 is a value only on the L side of the balance weight coefficient.
- the search unit 204 searches for an optimum one from a plurality of scalar values stored in the codebook 203, and encodes a balance weight coefficient by selecting a number corresponding to the optimum scalar value found by the search. To do. As a specific example, the search unit 204 searches for a number N that minimizes the cost function shown in Equation (5). Then, the search unit 204 outputs the selected number N as a code to the multiplexing unit 117. In addition, the search unit 204 outputs the code output to the multiplexing unit 117 to the decoding unit 205.
- the scalar value stored in the codebook 203 is squared. In this case, by storing the squared value in the codebook 203 in advance, the amount of calculation is further reduced. Can be searched.
- N is the sign of the balance weight coefficient of the L signal
- w L and w R are the decoded balance weight coefficients.
- the constant 2.0 is a value set according to the quantitative relationship between the amplitudes of the signals during the downmix in the downmix unit 101. The reason why the balance weight coefficient of the R signal is obtained by subtracting the balance weight coefficient of the L signal from the constant 2.0 will be described later.
- the decoding unit 205 sets the balance weight coefficient of the L signal in the multiplication unit 110 and sets the balance weight coefficient of the R signal in the multiplication unit 111.
- the M signal is an average value of the L signal and the R signal.
- equation (8) is obtained.
- equation (6) the balance weight coefficient that minimizes the power of the equation on the R signal side is as in equation (9).
- the M signal has the relationship of the expression (1)
- the addition result of the balance weight coefficient of the L signal and the balance weight coefficient of the R signal is expressed by the expression (10) from the expressions (1) and (3). become that way.
- the target M signal is not a simple relationship as shown in equation (1), but is quantized in a scalable manner as shown in FIG. Assuming that it is dominant, the balance weight coefficient is quantized in the relationship of equation (10). With this assumption, the number of parameters to be quantized (encoded) can be reduced to one, so that encoding at a low bit rate is possible.
- the third term is irrelevant to the balance weight coefficient w L of the L signal and is omitted, and only the sum of the first term and the second term is used as the cost function.
- Each value multiplied by each balance weight coefficient becomes two intermediate values shown in the equation (4). Further, the smaller this cost function is, the smaller the total power of the target L signal and the target R signal can be, and the search for the balance weight coefficient w L of such L signal is the optimal balance weight coefficient. Is quantized (encoded).
- the power of the target L signal and the power of the target R signal can be reduced, and good quality speech is transmitted at a low bit rate. can do.
- the encoder used is a codec simulator that performs scalable spectrum quantization of a stereo signal (16 kHz sampling) similar to Non-Patent Document 3.
- the evaluation data is data (24 seconds) appended with 6 voices uttered from various sound source positions.
- the number of quantization bits of the balance weight coefficient is 4 bits.
- the balance weighting coefficient itself is not calculated, and calculation that increases the amount of calculation such as division that is a complex arithmetic as in Patent Document 1 is not performed.
- the number of numbers and scalar values stored in the codebook 203 are relatively small, such as 16 types that can specify a number with 4 bits.
- the present invention by not calculating the balance weight coefficient itself, the amount of calculation in quantization is reduced, and more efficient quantization can be performed.
- the present embodiment is characterized in that, when encoding and decoding are performed using balance adjustment, the quantization apparatus performs calculations different from those in the first embodiment.
- the configuration of the encoding apparatus is the same as that in FIG.
- the configuration of the quantization device is the same as that in FIG. In the following description, description will be made using the reference numerals in FIGS.
- the power / correlation calculation unit 201 uses the L signal input from the MDCT unit 104, the target M signal input from the addition unit 108, and the R signal input from the MDCT unit 106 to perform power calculation and correlation value calculation. Do. Then, the power / correlation calculation unit 201 outputs the calculated power and the correlation value to the intermediate value calculation unit 202. The power / correlation calculation unit 201 obtains the power and the correlation value by the equation (12).
- ⁇ , ⁇ , and ⁇ indicating the ratio of adding power components may be variables, constants, or different numerical values.
- ⁇ , ⁇ , and ⁇ are set to constants, it is confirmed by experiment that good performance can be obtained by setting the three ⁇ , ⁇ , and ⁇ to about 0.25 in advance. Yes.
- the adjustment power of the target M signal, the adjustment correlation value between the target M signal and the L signal, and the adjustment correlation value between the target M signal and the R signal are the correlation between the power of the target M signal and the target M signal and the L signal.
- the correlation value with the signal is redefined, and the adjustment correlation value between the target M signal and the R signal is redefined as the correlation value between the target M signal and the R signal.
- the power / correlation calculation unit 201 performs smoothing to suppress temporal variation of the variables.
- the power / correlation calculation unit 201 performs the calculation according to the equation (13), and performs smoothing by applying the result of the equation (13) to the equation (14) and updating each state.
- each state is a variable stored in a static memory area during the encoding process. Therefore, when starting the encoding process, it is necessary to initialize the three states to “0”. Further, ⁇ indicating the smoothing ratio may be a variable or a constant. As an example, it has been experimentally confirmed that good performance can be obtained when ⁇ is set to 0.5 to 0.7. Note that the power / correlation calculation unit 201 does not perform smoothing when ⁇ is 1.0.
- the smoothing power of the target M signal, the smoothing correlation value of the target M signal and the L signal, and the smoothing correlation value of the target M signal and the R signal are the power of the target M signal, the target M signal and the L signal.
- the correlation value between the target M signal and the R signal, the power state of the target M signal, the state of the correlation value between the target M signal and the L signal, and the correlation value between the target M signal and the R signal are the power of the target M signal, the target M signal and the L signal.
- the smoothing power of the target M signal is redefined as the power of the target M signal
- the target M signal and the L signal are
- the smoothing correlation value is redefined as the correlation value between the target M signal and the L signal
- the smoothing correlation value between the target M signal and the R signal is redefined as the correlation value between the target M signal and the R signal.
- intermediate value calculation section 202 the processing in intermediate value calculation section 202, code book 203, search section 204, and decoding section 205 is the same as that in the first embodiment, and the description thereof is omitted. .
- the present embodiment is different from the first embodiment in that the power of the L signal or the power of the R signal in the equation (12) is added.
- the power of the L signal or the power of the R signal in the equation (12) is added.
- equation (12) can be derived. Experiments have verified that good sound quality can be obtained particularly when the transmission rate is low (when the coding distortion is large).
- the addition of the values of the power terms other than the cross term C LR is the addition of the power of the existing signal, so that it is necessary to greatly increase the amount of calculation required for weight quantization. Don't be. Therefore, a large effect can be obtained with a small increase in calculation amount.
- the influence of the cross term between a plurality of signals is reduced, so that the quantization error is relatively large. It is possible to avoid an uncomfortable sound quality in which the pressure changes extremely, and to suppress an increase in the amount of calculation and obtain a good sound quality.
- the present embodiment is characterized in that when encoding and decoding are performed using balance adjustment, the quantization device performs calculations different from those in the first and second embodiments.
- the configuration of the encoding apparatus is the same as that in FIG.
- the configuration of the quantization device is the same as that in FIG. In the following description of the quantization apparatus, description will be made using the reference numerals in FIGS. 1 and 2.
- the power / correlation calculation unit 201 uses the L signal input from the MDCT unit 104, the target M signal input from the addition unit 108, and the R signal input from the MDCT unit 106 to perform power calculation and correlation value calculation. Do. Then, the power / correlation calculation unit 201 outputs the calculated power and the correlation value to the intermediate value calculation unit 202.
- the power / correlation calculation unit 201 obtains the power and the correlation value by the equation (12) or the following equation (17).
- the equation (17) is an algorithm corresponding to the first embodiment
- the equation (12) is an algorithm corresponding to the second embodiment.
- the power / correlation calculation unit 201 when the power / correlation calculation unit 201 obtains the power and the correlation value by the equation (12), the power / correlation calculation unit 201 suppresses the temporal variation of the variable in the equation (12). Smoothing is performed as shown in the equation. Further, when the power / correlation calculation unit 201 obtains the power and the correlation value by the equation (17), the power / correlation calculation unit 201 performs the calculation by the equation (18) in order to suppress the temporal variation of the variable in the equation (17). , (18) is applied to the equation (19), and each state is updated to perform smoothing.
- the smoothing power of the target M signal, the smoothing correlation value of the target M signal and the L signal, the smoothing correlation value of the target M signal and the R signal, the smoothing power of the L signal, and the smoothing power of the R signal Is the target M signal power, the correlation value between the target M signal and the L signal, the correlation value between the target M signal and the R signal, the power of the L signal, and the power of the R signal, the power state of the target M signal, Smoothed using target M signal and L signal correlation value state, target M signal and R signal correlation value state, L signal power state, R signal power state, and smoothing ratio Therefore, in the following description, the smoothing power of the target M signal is redefined as the power of the target M signal, and the smoothing correlation value between the target M signal and the L signal is the correlation between the target M signal and the L signal.
- the smoothing correlation value between the target M signal and the R signal is redefined as the correlation value between the target M signal and the R signal
- the smoothing power of the L signal is redefined as the power of the L signal
- the smoothing of the R signal is performed. The description will be made by redefining the power to be the power of the R signal.
- the intermediate value calculation unit 202 obtains five intermediate values using the power and the correlation value input from the power / correlation calculation unit 201. Then, intermediate value calculation section 202 outputs the obtained intermediate value to search section 204.
- An intermediate value can be calculated
- the code book 203 is information stored in a storage means such as a ROM, and includes a plurality of scalar values selected as balance weighting factors of the L signal, weighting factors, and calculated values obtained from the weighting factors. The contents of the information stored in the code book 203 will be described later.
- the search unit 204 searches for an optimum one from a plurality of scalar values stored in the codebook 203, and encodes a balance weight coefficient by selecting a number corresponding to the optimum scalar value found by the search. To do. As a specific example, the search unit 204 searches for a number N that minimizes the cost function shown in Equation (21). Then, the search unit 204 outputs the selected number N as a code to the multiplexing unit 117. In addition, the search unit 204 outputs the code output to the multiplexing unit 117 to the decoding unit 205. In the present embodiment, the processing in decoding section 205 is the same as that in the first embodiment, and a description thereof will be omitted.
- the cost function is different from that in the first and second embodiments.
- the cost function of the expression (11) is used.
- the cost function of the expression (11) there is not much between the power of the signal L f and the power of the signal R f.
- FIG. 4 is a diagram showing a part of information stored in the code book 203 in the present embodiment.
- the size of the codebook 203 is 16 (4 bits).
- the calculated values w n 0 , w n 1 , and w n 2 necessary for the calculation of the equation (21) are obtained in advance by the following equation (24) and stored in the codebook 203.
- the intermediate value is obtained by the equation (20), the scalar value is efficiently obtained by the codebook 203 and the equation (21) designed by the above procedure, and the balance weight coefficient is calculated. Quantization is possible. As a result, in the case where there is a large difference between the values of the two terms on the L signal side and the R signal side constituting the cost function, the signal with the smaller value is generated because the term with the larger value becomes dominant. Deterioration can be avoided, and synthetic sound with better sound quality can be obtained comprehensively.
- the codebook has 16 types (4 bits).
- the present embodiment is not limited to this, and it is obvious that other sizes can be used. This is because the present invention does not depend on the size of the codebook.
- the present invention is not limited to this, and can also be applied to encoding of stereo signals without a core encoder. This is because the present invention efficiently encodes the balance weight coefficient using the fact that the M signal is obtained by downmixing, and therefore does not depend on the presence or absence of the core encoder.
- the quantizing device 109 may handle the decoded signal or the downmixed M signal. This is because the present invention efficiently encodes the balance weight coefficient using the fact that the M signal is obtained by downmixing, and therefore does not depend on the quality of the M signal used.
- the case where the sum of the balance weight coefficients of the L signal and the R signal is fixed to 2.0 is disclosed.
- the present invention is not limited to this, and the L signal
- the sum of the balance weight coefficients of the R signal and the R signal may have a value other than 2.0 such as 1.9 or 1.85 because the optimum value may differ depending on the nature of the M signal.
- a value slightly smaller than 2.0 is set. It may be possible to obtain good coding performance.
- the encoding performance is evaluated while changing the number of sums little by little, and the peak value is fixed as the sum of the balance weight coefficients of the L signal and the R signal and used for encoding. The method is mentioned.
- downmixing is performed after conversion to the frequency domain.
- the present invention is not limited to this, and a signal downmixed in the time domain is converted to the frequency domain.
- the effectiveness of the present invention is clear. This is because the present invention does not depend on a region where downmixing is performed.
- MDCT is used as a method for conversion to the frequency domain.
- the present invention is not limited to this, and similar to MDCT such as “DCT” or “FFT”. Any method may be used as long as it is a digital conversion method. This is because the present invention does not depend on the frequency conversion method.
- the three signals may be time domain signals, frequency domain signals, or partial sections thereof. This is because the present invention does not depend on the nature of the vector.
- the codes obtained in the first to third embodiments are transmitted when used for communication and stored in a recording medium (memory, disk, print code, etc.) when used for storage.
- a recording medium memory, disk, print code, etc.
- the present invention is not limited to this, and can be applied to the case of multi-channels such as 5.1 ch.
- the L signal, the R signal, and the M signal are encoded.
- the present invention is not limited to this, and the frequency spectrum obtained from the L signal, the R signal, and the M signal is not limited thereto.
- the partial sections may be encoded as the first signal, the second signal, and the third signal, respectively.
- the target M signal is subjected to balance adjustment before encoding.
- the present invention is not limited to this, and may be encoded before balance adjustment. That is, the encoder 115 may be present at a position closer to the input than the adding unit 108. This is because in the present invention, the balance adjustment of the target M signal does not depend on before and after encoding.
- the quantization device and the coding device according to the present invention can be mounted on a communication terminal device and a base station device in a mobile communication system, and thereby a communication terminal device having the same operational effects as described above, A base station apparatus and a mobile communication system can be provided.
- the present invention can also be realized by software.
- the function according to the present invention can be realized by describing the algorithm according to the present invention in a programming language, storing the program in a memory, and causing the information processing means to execute the same function as the encoding apparatus according to the present invention. it can.
- each functional block used in the description of the above embodiment is typically realized as an LSI which is an integrated circuit. These may be individually made into one chip, or may be made into one chip so as to include a part or all of them.
- LSI LSI
- IC system LSI
- super LSI ultra LSI
- the method of circuit integration is not limited to LSI, and implementation with a dedicated circuit or a general-purpose processor is also possible.
- An FPGA Field Programmable Gate Array
- a reconfigurable processor that can reconfigure the connection or setting of circuit cells inside the LSI may be used.
- the quantization apparatus, encoding apparatus, quantization method, and encoding method according to the present invention are suitable for encoding, for example, a stereo sound signal at a low bit rate.
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Mathematical Physics (AREA)
- Computational Linguistics (AREA)
- Signal Processing (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
Abstract
Description
本実施の形態においては、パニング(以後「バランス調整」と呼ぶ)を利用して符号化及び復号を行う構成を、以下の構成を用いて説明する。すなわち、「ISO/IEC 14496-3:1999(E) “MPEG-2”, p.232, FIG.B.13」(以後、非特許文献3と呼ぶ)に示されている、ISO/IECにおけるMPEG-2及びMPEG-4の標準方式であるAAC(Advanced Audio Codec)として広く使用されている符号化器の構成の一部(FIG.B.13の左半分の構成からサイド信号を生成する部分を除いた構成)を用い、この構成の右側に特許文献1記載のインテンシティステレオの構成要素を加え、それぞれの信号の出力先に符号器を追加することにより、全体の情報を符号化して伝送する、という構成を全体構成として挙げる。
本実施の形態は、バランス調整を利用して符号化及び復号を行う際に、量子化装置において、上記の実施の形態1とは異なる計算を行うことを特徴とする。なお、本実施の形態において、符号化装置の構成は図1と同一構成であるので、その説明を省略する。また、本実施の形態において、量子化装置の構成は図2と同一構成である。以下の説明では、図1及び図2の符号を用いて説明する。
本実施の形態は、バランス調整を利用して符号化及び復号を行う際に、量子化装置において、上記の実施の形態1及び実施の形態2とは異なる計算を行うことを特徴とする。なお、本実施の形態において、符号化装置の構成は図1と同一構成であるので、その説明を省略する。また、本実施の形態において、量子化装置の構成は図2と同一構成である。以下の量子化装置の説明では、図1及び図2の符号を用いて説明する。
Claims (10)
- 第1の信号と第2の信号とをダウンミックスした結果を利用して得られた第3の信号の振幅のバランスを調整する2つの係数を量子化する量子化装置であって、
前記第1の信号、前記第2の信号、及び前記第3の信号の3つの信号を入力し、前記第1の信号と前記第3の信号との第1相関値及び前記第2の信号と前記第3の信号との第2相関値を算出するとともに、前記第3の信号の第1パワを算出するパワ・相関計算手段と、
前記第1パワを用いて第1中間値を算出するとともに、前記第1相関値及び前記第2相関値の少なくとも1つの相関値と前記第1パワとを用いて、第2中間値を算出する中間値計算手段と、
複数のスカラ値が格納された符号帳と、
前記第1中間値及び前記第2中間値に基づいて、前記第1の信号に対して前記第3の信号の振幅のバランスを調整するバランス重み係数を、前記符号帳に格納されている前記複数のスカラ値の中から探索して、求められたスカラ値に対応する符号を得る探索手段と、
を具備する量子化装置。 - 前記中間値計算手段は、
前記第2中間値を、前記第1パワ、前記第1相関値、及び前記第2相関値を用いて算出する、
請求項1記載の量子化装置。 - 前記探索手段は、
前記第1中間値と前記第2中間値とを用いて設定される式に、前記複数のスカラ値を適用して得られる値が最小となるスカラ値に対応する前記符号を求める、
請求項1記載の量子化装置。 - 前記パワ・相関計算手段は、
更に、前記第1の信号の第2パワと前記第2の信号の第3パワとをそれぞれ算出し、前記第2パワと前記第3パワとを用いて第4の信号を算出するとともに、前記第4の信号と予め設定された第1パワ調整係数とを用いて前記第1パワを調整した第1調整パワを算出し、前記第2パワと予め設定された第2パワ調整係数とを用いて前記第1相関値を調整した第1調整相関値、及び、前記第3パワと予め設定された第3パワ調整係数とを用いて前記第2相関値を調整した第2調整相関値を算出し、
前記中間値計算手段は、
前記第1パワの代わりに前記第1調整パワを用いて前記第1中間値を算出するとともに、前記第1相関値及び前記第2相関値の少なくとも1つの相関値の代わりに前記第1調整相関値及び前記第2調整相関値の少なくとも1つの調整相関値と、前記第1パワの代わりに前記第1調整パワと、を用いて前記第2中間値を算出する、
請求項1記載の量子化装置。 - 前記パワ・相関計算手段は、
更に、前記第1の信号の第2パワと前記第2の信号の第3パワとをそれぞれ算出し、
前記中間値計算手段は、
前記第2相関値と前記第1パワとを用いて前記第2中間値を算出するとともに、更に、前記第1相関値を用いて第3中間値を算出し、前記第2パワを用いて第4中間値を算出し、前記第1パワと前記第2相関値と前記第3パワとを用いて第5中間値を算出し、
前記探索手段は、
前記第1中間値、前記第2中間値、前記第3中間値、前記第4中間値、及び前記第5中間値に基づいて、前記バランス重み係数を探索し、対応する前記符号を得る、
請求項1記載の量子化装置。 - 前記符号帳は、
前記複数のスカラ値のそれぞれに対応付けられた、前記第1の信号に関する第1重み係数及び前記第2の信号に関する第2重み係数を更に格納し、
前記探索手段は、
前記第1中間値、前記第2中間値、前記第3中間値、前記第4中間値、及び前記第5中間値を用いて設定される式に、前記複数のスカラ値と、前記複数のスカラ値に対応する前記第1重み係数及び前記第2重み係数と、を適用して得られる値が最小となるスカラ値に対応する前記符号を得る、
請求項5記載の量子化装置。 - 前記パワ・相関計算手段は、
更に、前記第2パワと前記第3パワとを用いて第4の信号を算出するとともに、前記第4の信号と予め設定された第1パワ調整係数とを用いて前記第1パワを調整した第1調整パワを算出し、前記第2パワと予め設定された第2パワ調整係数とを用いて前記第1相関値を調整した第1調整相関値、及び、前記第3パワと予め設定された第3パワ調整係数とを用いて前記第2相関値を調整した第2調整相関値を算出し、
前記中間値計算手段は、
前記第1パワの代わりに前記第1調整パワを用いて前記第1中間値を算出するとともに、前記第2相関値及び前記第1パワの代わりに前記第2調整相関値及び前記第1調整パワを用いて前記第2中間値を算出し、前記第1相関値の代わりに前記第1調整相関値を用いて前記第3中間値を算出し、前記第1パワ及び前記第2相関値の代わりに前記第1調整パワと前記第2調整相関値とを用いて前記第5中間値を算出する、
請求項5記載の量子化装置。 - 第1の信号と第2の信号とを入力してダウンミックスした結果を用いて第3の信号を生成するダウンミックス手段と、
前記第1の信号と前記第2の信号と前記第3の信号とを入力し、前記第3の信号の振幅のバランスを調整する2つの係数に関する量子化を行って得られた符号を出力する量子化手段と、
前記第1の信号に対して前記第3の信号の振幅のバランスを調整する第1のバランス重み係数を、前記符号を用いて決定し、前記第2の信号に対して前記第3の信号の振幅のバランスを調整する第2のバランス重み係数を、前記第1のバランス重み係数を用いて算出する係数決定手段と、
前記第1の信号と前記第3の信号と前記第1のバランス重み係数とを用いて第1ターゲット信号を生成し、前記第1ターゲット信号を符号化するとともに、前記第2の信号と前記第3の信号と前記第2のバランス重み係数とを用いて第2ターゲット信号を生成し、前記第2ターゲット信号を符号化する符号化手段と、を具備し、
前記量子化手段は、
前記第1の信号と前記第3の信号との第1相関値及び前記第2の信号と前記第3の信号との第2相関値を算出するとともに、前記第3の信号の第1パワを算出するパワ・相関計算手段と、
前記第1パワを用いて第1中間値を算出するとともに、前記第1相関値及び前記第2相関値の少なくとも1つの相関値と前記第1パワとを用いて第2中間値を算出する中間値計算手段と、
複数のスカラ値が格納された符号帳と、
前記第1中間値及び前記第2中間値に基づいて、前記第1のバランス重み係数を、前記複数のスカラ値の中から探索して、求められたスカラ値に対応する前記符号を得る探索手段と、
を具備する符号化装置。 - 第1の信号と第2の信号とをダウンミックスした結果を利用して得られた第3の信号の振幅のバランスを調整する2つの係数を量子化する量子化方法であって、
前記第1の信号、前記第2の信号、及び前記第3の信号の3つの信号を入力し、前記第1の信号と前記第3の信号との第1相関値及び前記第2の信号と前記第3の信号との第2相関値を算出するとともに、前記第3の信号の第1パワを算出するパワ・相関計算ステップと、
前記第1パワを用いて第1中間値を算出するとともに、前記第1相関値及び前記第2相関値の少なくとも1つの相関値と前記第1パワとを用いて第2中間値を算出する中間値計算ステップと、
前記第1中間値及び前記第2中間値に基づいて、前記第1の信号に対して前記第3の信号の振幅のバランスを調整するバランス重み係数を、符号帳に格納されている複数のスカラ値の中から探索して、求められたスカラ値に対応する符号を得る探索ステップと、
を有する量子化方法。 - 第1の信号と第2の信号とを入力してダウンミックスした結果を用いて第3の信号を生成するダウンミックスステップと、
前記第1の信号と前記第2の信号と前記第3の信号とを入力し、前記第3の信号の振幅のバランスを調整する2つの係数に関する量子化を行って得られた符号を出力する量子化ステップと、
前記第1の信号に対して前記第3の信号の振幅のバランスを調整する第1のバランス重み係数を、前記符号を用いて決定し、前記第2の信号に対して前記第3の信号の振幅のバランスを調整する第2のバランス重み係数を、前記第1のバランス重み係数を用いて算出する係数決定ステップと、
前記第1の信号と前記第3の信号と前記第1のバランス重み係数とを用いて第1ターゲット信号を生成し、前記第1ターゲット信号を符号化するとともに、前記第2の信号と前記第3の信号と前記第2のバランス重み係数とを用いて第2ターゲット信号を生成し、前記第2ターゲット信号を符号化する符号化ステップと、を有し、
前記量子化ステップは、
前記第1の信号と前記第3の信号との第1相関値及び前記第2の信号と前記第3の信号との第2相関値を算出するとともに、前記第3の信号の第1パワを算出するパワ・相関計算ステップと、
前記第1パワを用いて第1中間値を算出するとともに、前記第1相関値及び前記第2相関値の少なくとも1つの相関値と前記第1パワとを用いて第2中間値を算出する中間値計算ステップと、
前記第1中間値及び前記第2中間値に基づいて、前記第1のバランス重み係数を、符号帳に格納されている複数のスカラ値の中から探索して、求められたスカラ値に対応する前記符号を得る探索ステップと、
を有する符号化方法。
Priority Applications (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US13/057,162 US20110137661A1 (en) | 2008-08-08 | 2009-08-07 | Quantizing device, encoding device, quantizing method, and encoding method |
JP2010523771A JPWO2010016270A1 (ja) | 2008-08-08 | 2009-08-07 | 量子化装置、符号化装置、量子化方法及び符号化方法 |
Applications Claiming Priority (6)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP2008-205643 | 2008-08-08 | ||
JP2008205643 | 2008-08-08 | ||
JP2009-059502 | 2009-03-12 | ||
JP2009059502 | 2009-03-12 | ||
JP2009095260 | 2009-04-09 | ||
JP2009-095260 | 2009-04-09 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2010016270A1 true WO2010016270A1 (ja) | 2010-02-11 |
Family
ID=41663497
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/JP2009/003798 WO2010016270A1 (ja) | 2008-08-08 | 2009-08-07 | 量子化装置、符号化装置、量子化方法及び符号化方法 |
Country Status (3)
Country | Link |
---|---|
US (1) | US20110137661A1 (ja) |
JP (1) | JPWO2010016270A1 (ja) |
WO (1) | WO2010016270A1 (ja) |
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN108427268A (zh) * | 2018-02-26 | 2018-08-21 | 河南理工大学 | 一种基于知识与数据信息决策的污水处理优化控制方法 |
WO2024142358A1 (ja) * | 2022-12-28 | 2024-07-04 | 日本電信電話株式会社 | 音信号処理装置、音信号処理方法、プログラム |
WO2024142357A1 (ja) * | 2022-12-28 | 2024-07-04 | 日本電信電話株式会社 | 音信号処理装置、音信号処理方法、プログラム |
Families Citing this family (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
KR101419151B1 (ko) | 2009-10-20 | 2014-07-11 | 프라운호퍼 게젤샤프트 쭈르 푀르데룽 데어 안겐반텐 포르슝 에. 베. | 영역-의존 산술 코딩 매핑 규칙을 이용하는 오디오 인코더, 오디오 디코더, 오디오 정보를 인코딩하기 위한 방법, 오디오 정보를 디코딩하기 위한 방법 및 컴퓨터 프로그램 |
PT2524371T (pt) | 2010-01-12 | 2017-03-15 | Fraunhofer Ges Forschung | Codificador de áudio, descodificador de áudio, método de codificação de uma informação de áudio, método de descodificação de uma informação de áudio e programa de computador que utiliza uma tabela hash que descreve tanto valores de estado significativos como limites de intervalo |
EP2740222B1 (en) | 2011-08-04 | 2015-04-22 | Dolby International AB | Improved fm stereo radio receiver by using parametric stereo |
US9972325B2 (en) * | 2012-02-17 | 2018-05-15 | Huawei Technologies Co., Ltd. | System and method for mixed codebook excitation for speech coding |
WO2013135819A1 (en) * | 2012-03-14 | 2013-09-19 | Bang & Olufsen A/S | A method of applying a combined or hybrid sound -field control strategy |
CN113450846B (zh) * | 2020-03-27 | 2024-01-23 | 上海汽车集团股份有限公司 | 一种声压级标定方法及装置 |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2004535145A (ja) * | 2001-07-10 | 2004-11-18 | コーディング テクノロジーズ アクチボラゲット | 低ビットレートオーディオ符号化用の効率的かつスケーラブルなパラメトリックステレオ符号化 |
WO2006070757A1 (ja) * | 2004-12-28 | 2006-07-06 | Matsushita Electric Industrial Co., Ltd. | 音声符号化装置および音声符号化方法 |
JP2007529021A (ja) * | 2003-12-19 | 2007-10-18 | テレフオンアクチーボラゲット エル エム エリクソン(パブル) | 忠実度最適化可変フレーム長符号化 |
Family Cites Families (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
GB0025413D0 (en) * | 2000-10-17 | 2000-11-29 | Emp Technologies Ltd | Improvements in and relating to furnaces and methods of melting |
US7809579B2 (en) * | 2003-12-19 | 2010-10-05 | Telefonaktiebolaget Lm Ericsson (Publ) | Fidelity-optimized variable frame length encoding |
US8843378B2 (en) * | 2004-06-30 | 2014-09-23 | Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. | Multi-channel synthesizer and method for generating a multi-channel output signal |
WO2007114290A1 (ja) * | 2006-03-31 | 2007-10-11 | Matsushita Electric Industrial Co., Ltd. | ベクトル量子化装置、ベクトル逆量子化装置、ベクトル量子化方法及びベクトル逆量子化方法 |
ITMI20061360A1 (it) * | 2006-07-13 | 2008-01-14 | Valentino Fossati | Struttura di molleggio particolarmente per la realizzazione di materassi e simili |
WO2008013875A2 (en) * | 2006-07-26 | 2008-01-31 | Hercules Incorporated | Hydrophobically modified poly(ethylene glycol) for use in pitch and stickies control in pulp and papermaking processes |
US20100185442A1 (en) * | 2007-06-21 | 2010-07-22 | Panasonic Corporation | Adaptive sound source vector quantizing device and adaptive sound source vector quantizing method |
US8249883B2 (en) * | 2007-10-26 | 2012-08-21 | Microsoft Corporation | Channel extension coding for multi-channel source |
-
2009
- 2009-08-07 WO PCT/JP2009/003798 patent/WO2010016270A1/ja active Application Filing
- 2009-08-07 JP JP2010523771A patent/JPWO2010016270A1/ja active Pending
- 2009-08-07 US US13/057,162 patent/US20110137661A1/en not_active Abandoned
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2004535145A (ja) * | 2001-07-10 | 2004-11-18 | コーディング テクノロジーズ アクチボラゲット | 低ビットレートオーディオ符号化用の効率的かつスケーラブルなパラメトリックステレオ符号化 |
JP2007529021A (ja) * | 2003-12-19 | 2007-10-18 | テレフオンアクチーボラゲット エル エム エリクソン(パブル) | 忠実度最適化可変フレーム長符号化 |
WO2006070757A1 (ja) * | 2004-12-28 | 2006-07-06 | Matsushita Electric Industrial Co., Ltd. | 音声符号化装置および音声符号化方法 |
Cited By (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN108427268A (zh) * | 2018-02-26 | 2018-08-21 | 河南理工大学 | 一种基于知识与数据信息决策的污水处理优化控制方法 |
CN108427268B (zh) * | 2018-02-26 | 2023-05-23 | 河南理工大学 | 一种基于知识与数据信息决策的污水处理优化控制方法 |
WO2024142358A1 (ja) * | 2022-12-28 | 2024-07-04 | 日本電信電話株式会社 | 音信号処理装置、音信号処理方法、プログラム |
WO2024142357A1 (ja) * | 2022-12-28 | 2024-07-04 | 日本電信電話株式会社 | 音信号処理装置、音信号処理方法、プログラム |
Also Published As
Publication number | Publication date |
---|---|
US20110137661A1 (en) | 2011-06-09 |
JPWO2010016270A1 (ja) | 2012-01-19 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
RU2764287C1 (ru) | Способ и система для кодирования левого и правого каналов стереофонического звукового сигнала с выбором между моделями двух и четырех подкадров в зависимости от битового бюджета | |
WO2010016270A1 (ja) | 量子化装置、符号化装置、量子化方法及び符号化方法 | |
JP5171256B2 (ja) | ステレオ符号化装置、ステレオ復号装置、及びステレオ符号化方法 | |
RU2439718C1 (ru) | Способ и устройство для обработки звукового сигнала | |
KR101391110B1 (ko) | 오디오 신호 디코더, 오디오 신호 인코더, 업믹스 신호 표현을 제공하는 방법, 다운믹스 신호 표현을 제공하는 방법, 공통 객체 간의 상관 파라미터 값을 이용한 컴퓨터 프로그램 및 비트스트림 | |
AU2016234987B2 (en) | Decoder and method for a generalized spatial-audio-object-coding parametric concept for multichannel downmix/upmix cases | |
EP2209114B1 (en) | Speech coding/decoding apparatus/method | |
JP5737077B2 (ja) | オーディオ符号化装置、オーディオ符号化方法及びオーディオ符号化用コンピュータプログラム | |
JP4963965B2 (ja) | スケーラブル符号化装置、スケーラブル復号装置、及びこれらの方法 | |
US8619999B2 (en) | Audio decoding method and apparatus | |
WO2012066727A1 (ja) | ステレオ信号符号化装置、ステレオ信号復号装置、ステレオ信号符号化方法及びステレオ信号復号方法 | |
CN108369810A (zh) | 用于对多声道音频信号进行编码的自适应声道缩减处理 | |
WO2010140350A1 (ja) | ダウンミックス装置、符号化装置、及びこれらの方法 | |
WO2006041055A1 (ja) | スケーラブル符号化装置、スケーラブル復号装置及びスケーラブル符号化方法 | |
CN106663432A (zh) | 对压缩的hoa表示解码的方法和装置以及对压缩的hoa表示编码的方法和装置 | |
US20050160126A1 (en) | Constrained filter encoding of polyphonic signals | |
JP2010139671A (ja) | オーディオ復号装置、方法、及びプログラム | |
EP1639580B1 (en) | Coding of multi-channel signals | |
JP2008026372A (ja) | 符号化データの符号化則変換方法および装置 | |
WO2023172865A1 (en) | Methods, apparatus and systems for directional audio coding-spatial reconstruction audio processing | |
KR20140037118A (ko) | 오디오 신호 처리방법, 오디오 부호화장치, 오디오 복호화장치, 및 이를 채용하는 단말기 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 09804757 Country of ref document: EP Kind code of ref document: A1 |
|
WWE | Wipo information: entry into national phase |
Ref document number: 2010523771 Country of ref document: JP |
|
WWE | Wipo information: entry into national phase |
Ref document number: 13057162 Country of ref document: US |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 09804757 Country of ref document: EP Kind code of ref document: A1 |