CN101583032B - Adaptive down-sampling and lapped transform-based image compression method - Google Patents

Adaptive down-sampling and lapped transform-based image compression method Download PDF

Info

Publication number
CN101583032B
CN101583032B CN 200910023085 CN200910023085A CN101583032B CN 101583032 B CN101583032 B CN 101583032B CN 200910023085 CN200910023085 CN 200910023085 CN 200910023085 A CN200910023085 A CN 200910023085A CN 101583032 B CN101583032 B CN 101583032B
Authority
CN
China
Prior art keywords
image
transform
low
block
transformation
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
CN 200910023085
Other languages
Chinese (zh)
Other versions
CN101583032A (en
Inventor
吴家骥
焦李成
邢艳
石光明
张向荣
王爽
公茂果
马文萍
姜昆
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Xidian University
Original Assignee
Xidian University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Xidian University filed Critical Xidian University
Priority to CN 200910023085 priority Critical patent/CN101583032B/en
Publication of CN101583032A publication Critical patent/CN101583032A/en
Application granted granted Critical
Publication of CN101583032B publication Critical patent/CN101583032B/en
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Landscapes

  • Compression Or Coding Systems Of Tv Signals (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)

Abstract

The invention provides an adaptive down-sampling and lapped transform-based image compression method, and mainly solves the problems of low performance and high complexity existing in the prior down-sampling based compression method. The method comprises the following steps of: 1, adaptively down-sampling the prior image; 2, firstly performing lapped transform on the image after adaptive down-sampling between DCT blocks which are not subjected to the down-sampling, and then performing the DCT transformation on the whole image; 3, interweaving factors after transformation into a wavelet tree structure to obtain a low frequency subband DC and a high frequency subband AC; 4, performing the shape-adaptive DCT transformation on the low frequency subband DC, and interweaving the factors again; 5, by an object-oriented SPECK coding method, coding the factors with the wavelet tree structure to obtain a compressed bit stream; and 6, decompressing the bit stream transmitted to a decoding end to obtain a final re-constructed image. The method can obtain the performance higher than that of the prior image compression method under the condition of low bit rate, is low in complexity, and can be used for the low-bit-rate image coding which has strict requirements on complexity and real-time.

Description

Image compression method based on self-adaptive down-sampling and overlapping transformation
Technical Field
The invention belongs to the technical field of image processing, and particularly relates to an image compression method which can be used for realizing low-complexity and low-power-consumption image compression under a low code rate.
Background
Images, which are the most abundant information carriers, are one of the indispensable elements in the information age. In recent years, with the wide application of technologies such as video conferencing, video telephone, high definition television, remote monitoring and remote sensing imaging, images have become the main carrier of information communication in people's life, and high-resolution images are also required by different industries. With the development of imaging technology, many devices have been able to provide high-resolution digital images to meet people's requirements, however, the improvement of resolution makes the images contain a larger amount of information, which puts higher demands on image compression. It is therefore necessary to represent an image with as little data as possible while ensuring a certain image quality.
In recent years, Wuchang forest, Wufeng et al have tried to improve image compression at low code rate by interpolation and achieved good results in this respect. Their approach essentially follows a pattern: the whole image is downsampled at a coding end by utilizing filtering or lifting technology to obtain a pair of low-resolution images, and the low-resolution images are subjected to transform coding to obtain a compressed file; at the decoding end, the compressed file is decoded first, the low-resolution image is reconstructed, and then the resolution is improved by utilizing interpolation to obtain a decoded image. Although such methods have been able to improve coding performance at relatively low code rates, the following disadvantages remain: firstly, because the method performs down-sampling on the whole image regardless of a smooth region or an edge region, and in the edge region, the correlation between neighborhoods is difficult to estimate, it is difficult to up-sample a clear high-resolution image by using a general interpolation method, thereby affecting the quality of a decoded image; secondly, because the method uses an interpolation method with quite high complexity, the method is not suitable for being applied to some devices with strict requirements on real-time performance and complexity.
Disclosure of Invention
The invention aims to overcome the defects of the prior art and provides an image compression method based on adaptive down-sampling and lapped transform so as to improve the compression performance of images at low code rate, reduce the computational complexity and meet the requirements of a plurality of devices on instantaneity, low complexity and low power consumption.
In order to achieve the above purpose, the implementation steps of the invention comprise the following steps:
(1) DCT transformation is carried out on the original image, and the transformed image is pre-coded under the current code rate by using SPECK, so as to obtain a threshold value of a cut-off bit plane, which is marked as MT;
(2) the original image is divided into 32 × 32 blocks, and the following judgment is made for each block:
2a) performing DCT on the current block, performing 5/3 wavelet transform on the current block if the number of transform coefficients larger than the threshold MT does not exceed 1.6% of the total number of coefficients in the current block, otherwise, not performing the transform, and marking the current block as 0;
2b) performing a DCT transform on 5/3 the low frequency subbands of the wavelet transformed block, if the number of transform coefficients greater than the threshold MT still does not exceed 1.6% of the total number of coefficients in the low frequency subbands, marking the 32 x 32 block as 1, otherwise, marking as 0;
2c) regarding the block marked as 1 as a smooth block, performing 5/3 wavelet transformation on the smooth block, discarding high-frequency sub-bands, and reserving low-frequency sub-bands as the down-sampling result;
(3) firstly, performing overlap transform on the down-sampled image between DCT blocks, and then performing discrete cosine transform on the image after the overlap transform to obtain a transform coefficient;
(4) coefficient interweaving is carried out on the transformed coefficients to obtain a low-frequency sub-band DC and a high-frequency sub-band AC;
(5) performing shape adaptive discrete cosine transform on the low-frequency sub-band DC, performing coefficient interleaving on the transformed coefficient again, and combining the interleaved low-frequency sub-band DC and the high-frequency sub-band AC to obtain a final transform coefficient;
(6) carrying out object-oriented SPECK coding on the final transformation coefficient, and omitting unimportant information according to the required code rate to obtain a compressed bit stream;
(7) at the decoding end, decompression is performed according to the transmitted bit stream to obtain a final decoded image.
Compared with the prior art, the invention has the following advantages:
the invention adopts the self-adaptive down-sampling method, only down-samples the smooth area of the image, and solves the problem that the reconstruction quality of the traditional down-sampling-based image compression method to the edge area is not ideal. Meanwhile, because the invention carries out the crossover transform between DCT blocks, the invention can effectively eliminate the block effect caused by the DCT transform and obtain better subjective visual effect. In addition, the invention adopts an interpolation method with small complexity, thereby overcoming the defect of poor real-time performance of the traditional image compression method based on downsampling.
Drawings
FIG. 1 is a compression flow diagram of the present invention;
FIG. 2 is a decompression flow diagram of the present invention;
FIG. 3 is a diagram of the results of the operation of the present invention prior to encoding; wherein,
FIG. 3(a) is a diagram after adaptive downsampling,
figure 3(b) is a diagram after coefficient interleaving,
figure 3(c) is an enlarged view of the low frequency subband DC of figure 3(b),
FIG. 3(d) is the DC sub-band after SA-DCT transformation;
FIG. 4 is a schematic representation of the present invention utilizing 5/3 wavelet down-sampling;
FIG. 5 is a schematic diagram of coefficient interleaving according to the present invention; wherein,
figure 5(a) is a graph of the original coefficient distribution,
FIG. 5(b) is a coefficient distribution diagram after interleaving;
fig. 6 is a comparison of the subjective visual effect of the reconstructed image of the present invention and JPEG 2000.
Detailed Description
Referring to fig. 1, the image compression process of the present invention is as follows:
step 1, performing DCT (discrete cosine transformation) on an original image, and pre-coding the transformed image at the current code rate by using SPECK (discrete cosine transformation), so as to obtain a threshold of a cut-off bit plane, which is marked as MT;
step 2, dividing the original image into blocks with the size of 32 x 32, and then judging each block as follows:
(2a) performing DCT on the current block, performing 5/3 wavelet transform on the current block if the number of coefficients larger than the threshold MT does not exceed 1.6% of the total number of coefficients in the current block, otherwise, not performing the transform, and marking the current block as 0;
(2b) the 5/3 low-frequency sub-band of the wavelet-transformed block is then DCT-transformed, if the number of coefficients greater than the threshold MT does not exceed 1.6% of the total number of coefficients in the low-frequency sub-band, the 32 x 32 block is marked as 1, otherwise, it is marked as 0;
(2c) considering the block labeled 1 as a smooth block, 5/3 wavelet transforming the smooth block, discarding the high frequency sub-bands, and reserving the low frequency sub-bands as the result of downsampling, as shown in fig. 4, wherein, fig. 4(a) is the original image block,
fig. 4(b) is an 5/3 image block after wavelet transform, and fig. 4(c) is an image block after downsampling; the downsampling result chart shown in fig. 3(a) can be obtained by the steps (2a) to (2 c).
And 3, sequentially performing overlapping transformation and DCT transformation on the down-sampled graph.
The specific process is as follows:
(3a) prior to DCT transformation, an overlap transform is chosen to be performed at DCT block boundaries that are not down-sampled to remove inter-block correlation. The lapped transform matrix P is defined as:
P = 1 2 I J J - I I 0 0 V I J J - I - - - ( 1 )
wherein I is an identity matrix, J is an inverse identity matrix,
Figure G2009100230854D00042
v is the free control matrix and is,
V = J ( C M / 2 II ) T C M / 2 IV J - - - ( 3 )
CM/2 IIand CN/2 IVA second type of discrete cosine transform DCT-II and a fourth type of discrete cosine transform DCT-IV of 8 points, respectively.
(3b) The overlap-transformed image is subjected to a 16-point DCT transform.
And 4, performing coefficient interleaving on the transformed image to obtain a low-frequency subband DC and a high-frequency subband AC, as shown in fig. 3 (b).
Coefficient interleaving is to interleave the transform coefficients into a wavelet tree structure, which is composed of a low frequency subband DC and a high frequency subband AC. The specific interleaving process is as follows:
(4a) initializing a cycle time variable z to be 4, and a sub-band stage number k to be 1;
(4b) taking out the DC coefficient of each block and putting the DC coefficients together to form a low-frequency sub-band DC;
(4c) taking out the sub-system number corresponding to the DC coefficient of each block and placing the sub-system number in the corresponding position to form a high-frequency sub-band ACk
(4d) Let z be z-1, k be k + 1; taking out the sub-coefficient corresponding to the sub-coefficient of the previous stage and placing the sub-coefficient in the corresponding position to form a high-frequency sub-band ACk
(4e) Judging whether z is 0, if not, returning to (4d), if so, completing coefficient interleaving, and all high-frequency sub-bandsACkConstituting a subband high frequency AC.
The coefficient interleaving result shown in fig. 5(b) is obtained from steps (4a) - (4e), as can be seen from fig. 5(a), the original coefficients are distributed in block before the coefficient interleaving, and the DC coefficient distribution is not concentrated, wherein 1, 2, 3, 4, 5, 6, 7, 8, 9 represent DC coefficients, gray symbols represent the number of subsystems corresponding to the DC coefficients, and other symbols represent high-frequency coefficients, as can be seen from fig. 5(b), the interleaved coefficients are distributed in tree form, the DC coefficients are concentrated at the upper left corner to form a low-frequency subband DC, and the remaining high-frequency coefficients form a high-frequency subband AC.
And step 5, performing SA-DCT on the DC sub-band shown in the figure 3(c), performing coefficient interleaving on the obtained transformation coefficient again to obtain a DC sub-band after secondary transformation, and combining the DC sub-band with the AC sub-band to obtain a final transformation image. The result of the SA-DCT transformation is shown in fig. 3(d), and the specific process is as follows:
(5a) marking the down-sampled block in the adaptive down-sampling, and determining the position of the discarded coefficient in the DC sub-band according to the marking result and the principle of coefficient interleaving;
(5b) dividing the low-frequency sub-band into 16-by-16 blocks, and performing SA-DCT on each block: the retained coefficients of each block are arranged in rows, one-dimensional DCT transformation of corresponding points is carried out on each row, the coefficients are arranged close to the left row, one-dimensional DCT transformation is carried out on each row, and the DCT of the points is carried out when the coefficients are retained in each row, so that the SA-DCT transformation is completed.
And 6, carrying out OB-SPECK coding on the final transform coefficient, and specifically comprising the following steps:
(6a) coding and initializing the transformed coefficients to obtain initial information, i.e. according to the highest bit plane
Figure G2009100230854D00051
Obtaining initial bit plane, and making bit plane n equal to nmaxWherein
ci,jis a transform coefficient; then, the whole image is taken as a block and put into an unimportant block set LIS to obtain an initial LIS; then set the significant coefficient set
Figure G2009100230854D00052
(6b) Sorting the blocks in the LIS and sequentially testing the reordered sub-blocks, wherein the sub-blocks discarded due to downsampling are not tested, and if the transform coefficients satisfy 2n≤|ci,j|<2n+1If so, the coefficient is considered as an important coefficient, or the block where the transform coefficient is located is an important block;
(6c) and thinning the important coefficient set LSP, and circularly updating the bit plane to obtain a compressed bit stream file.
Referring to fig. 2, the image decompression process of the present invention is as follows:
and step A, carrying out the reverse process of the step 6 on the bit stream transmitted by the encoding end to obtain a decoded image.
B, performing coefficient reverse interleaving on the low-frequency sub-band DC of the decoded image, namely rearranging the coefficients distributed in a wavelet tree structure into block distribution, and performing SA-DCT inverse transformation on the low-frequency sub-band DC subjected to reverse interleaving to obtain a reconstructed low-frequency sub-band DC;
and step C, combining the reconstructed low-frequency sub-band DC and the high-frequency sub-band AC of the decoded image, and sequentially performing coefficient reverse interleaving, DCT (discrete cosine transformation) inverse transformation and overlapping inverse transformation on the combined image to obtain a reconstructed image after down sampling.
And D, restoring the downsampled area by utilizing Cubic interpolation to obtain a final reconstructed image.
The effects of the present invention can be further illustrated by the following specific experimental data.
1. Conditions and contents of the experiment
The experiment of the invention is to take an image with the size of 512 multiplied by 512 and the gray scale of 8 bits: the 4 images of Barbara, Lena, Goldhill and Baboon were encoded and decoded according to the above encoding and decoding steps, respectively, and compared with the peak signal to noise ratio PSNR performance in the quality progressing mode under the conditions of compression ratios bpp of 0.125bpp, 0.25bpp and 0.5bpp, respectively. Wherein peak signal to noise ratio PSNR performance of SPECK, SPIHT method and the method of the invention are compared without arithmetic coding; the peak signal-to-noise ratio PSNR performance of SPECK-AC, SPIHT-AC, JPEG2000, SPECK-AC with lapped transform and the inventive method with arithmetic coding are compared in the presence of arithmetic coding.
2. Visual effect comparison results
As shown in FIG. 6, the subjective visual effect of the reconstructed images obtained by the method of the present invention and JPEG2000 at a code rate of 0.125bpp for two images, Lena and Barbara, respectively, is compared. Wherein fig. 6(a) is the original image of Lena, fig. 6(b) is the Lena reconfiguration image under JPEG2000, fig. 6(c) is the Lena reconfiguration image under the present invention, fig. 6(d) is the original image of Barbara, fig. 6(e) is the barbarbarbara reconfiguration image under JPEG2000, and fig. 6(f) is the barbarbara reconfiguration image under the present invention, as can be seen from fig. 6, due to the effect of the overlap transform, the method of the present invention has better retention of the texture and edge information of the image, and the visual effect is significantly better than JPEG 2000.
3. The peak signal-to-noise ratio PSNR comparison results are shown in table 1.
TABLE 1 comparison of peak SNR PSNR performance for different methods
Figure G2009100230854D00071
As can be seen from Table 1, the peak signal-to-noise ratio PSNR obtained by the method of the present invention is significantly better than the existing SPECK and SPIHT methods in the absence of arithmetic coding, and in most cases the peak signal-to-noise ratio PSNR obtained by the method of the present invention is up to or even better than the SPECK-AC and SPIHT-AC methods using arithmetic coding, which is close to JPEG 2000. After arithmetic coding is added, the peak signal-to-noise ratio PSNR obtained by the method with arithmetic coding is still obviously superior to SPECK-AC, SPIHT-AC and JPEG 2000. Meanwhile, the arithmetic coding added in the method is the same arithmetic coding as the SPHIT and the SPECK, but not the high-order arithmetic coding adopted in the JPEG2000, so the method not only has higher peak signal-to-noise ratio (PSNR) performance than the SPHIT-AC, the SPIHT-AC and the JPEG2000, but also has low complexity, and is easy to realize by hardware. In addition, the method of the present invention also has significant advantages over SPECK-AC with lapped transform, which means that the adaptive down-sampling in the present invention plays a role.

Claims (3)

1. An image compression method based on adaptive down-sampling and lapped transform, comprising the steps of:
(1) DCT transformation is carried out on the original image, and the transformed image is pre-coded under the current code rate by using SPECK, so as to obtain a threshold value of a cut-off bit plane, which is marked as MT;
(2) dividing the original image into 32 × 32 blocks, and then judging each block as follows:
2a) performing DCT on the current block, performing 5/3 wavelet transform on the current block if the number of transform coefficients larger than the threshold MT does not exceed 1.6% of the total number of coefficients in the current block, otherwise, not performing 5/3 wavelet transform and marking the current block as 0;
2b) performing a DCT transform on 5/3 the low frequency subbands of the wavelet transformed block, if the number of transform coefficients greater than the threshold MT still does not exceed 1.6% of the total number of coefficients in the low frequency subbands, marking the 32 x 32 block as 1, otherwise, marking as 0;
2c) regarding the block marked as 1 as a smooth block, performing 5/3 wavelet transformation on the smooth block, discarding high-frequency sub-bands, and reserving low-frequency sub-bands as the down-sampling result;
(3) and sequentially performing the following overlapped transformation and discrete cosine transformation on the down-sampled image to obtain a transformation coefficient:
(3a) selecting, prior to the DCT transform, an lapped transform at DCT block boundaries that have not been downsampled to remove inter-block correlation;
(3b) performing 16-point DCT on the image after the overlapped transformation;
(4) coefficient interweaving is carried out on the transformed coefficients to obtain a low-frequency sub-band DC and a high-frequency sub-band AC;
(5) performing shape adaptive discrete cosine transform on the low-frequency sub-band DC, performing coefficient interleaving on the transformed coefficient again, and combining the interleaved low-frequency sub-band DC and the high-frequency sub-band AC to obtain a final transform coefficient;
(6) carrying out object-oriented SPECK coding on the final transformation coefficient, and omitting unimportant information according to the required code rate to obtain a compressed bit stream;
(7) at the decoding end, decompression is performed according to the transmitted bit stream to obtain a final decoded image.
2. The image compression method as claimed in claim 1, wherein the coefficient interleaving of step (4) is to interleave the transform coefficients into a wavelet tree structure, which is composed of a low frequency subband DC and a high frequency subband AC.
3. The image compression method as claimed in claim 1, wherein the step (6) of performing object-oriented SPECK coding on the final transform coefficients means that only the coefficients remaining after downsampling are coded.
CN 200910023085 2009-06-26 2009-06-26 Adaptive down-sampling and lapped transform-based image compression method Expired - Fee Related CN101583032B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN 200910023085 CN101583032B (en) 2009-06-26 2009-06-26 Adaptive down-sampling and lapped transform-based image compression method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN 200910023085 CN101583032B (en) 2009-06-26 2009-06-26 Adaptive down-sampling and lapped transform-based image compression method

Publications (2)

Publication Number Publication Date
CN101583032A CN101583032A (en) 2009-11-18
CN101583032B true CN101583032B (en) 2011-04-27

Family

ID=41364946

Family Applications (1)

Application Number Title Priority Date Filing Date
CN 200910023085 Expired - Fee Related CN101583032B (en) 2009-06-26 2009-06-26 Adaptive down-sampling and lapped transform-based image compression method

Country Status (1)

Country Link
CN (1) CN101583032B (en)

Families Citing this family (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101840577B (en) * 2010-06-11 2012-07-25 西安电子科技大学 Image automatic segmentation method based on graph cut
KR20120035096A (en) * 2010-10-04 2012-04-13 한국전자통신연구원 A method and apparatus of side information signaling for quadtree transform
CN103347185B (en) * 2013-06-28 2016-08-10 北京航空航天大学 The comprehensive compaction coding method of unmanned plane reconnaissance image based on the conversion of selectivity block
CN103517082B (en) * 2013-10-17 2017-03-29 广东威创视讯科技股份有限公司 The conversion method and system of jpeg image and wavelet compressed images
CN103618910B (en) * 2013-12-02 2017-01-18 广东威创视讯科技股份有限公司 Image conversion method and device
US9311342B1 (en) * 2015-02-09 2016-04-12 Sony Corporation Tree based image storage system
CN104867165B (en) * 2015-05-19 2017-10-17 电子科技大学 A kind of method for compressing image based on transform domain down-sampling technology
CN104935928B (en) * 2015-06-01 2017-10-17 电子科技大学 A kind of efficient image compression method based on spatial domain down-sampling pattern
CN109874020B (en) * 2019-01-17 2021-03-30 辽宁师范大学 Inseparable lifting wavelet transformation method with gradable quality and complexity
CN113554722B (en) * 2021-07-22 2023-08-15 辽宁科技大学 Image compression method for crown word number of RMB paper currency based on improved EZW

Also Published As

Publication number Publication date
CN101583032A (en) 2009-11-18

Similar Documents

Publication Publication Date Title
CN101583032B (en) Adaptive down-sampling and lapped transform-based image compression method
Li et al. Shape-adaptive discrete wavelet transforms for arbitrarily shaped visual object coding
EP1110180A1 (en) Embedded quadtree wavelets in image compression
EP2406958A2 (en) Adaptive entropy coding for images and videos using set partitioning in generalized hierarchical trees
Pan et al. A fast and low memory image coding algorithm based on lifting wavelet transform and modified SPIHT
Kumar et al. A review: DWT-DCT technique and arithmetic-Huffman coding based image compression
CN109361919A (en) A kind of image coding efficiency method for improving combined super-resolution and remove pinch effect
Li et al. Improvements of embedded zerotree wavelet (EZW) coding
CN110097490A (en) The insertion of area-of-interest reversible water mark and extracting method based on IWT-Schur
CN115150628B (en) Method for coding coarse-to-fine depth video with super-priori guided mode prediction
IL288888B1 (en) Encoder and method of encoding a sequence of frames
Zhang et al. Image coding on quincunx lattice with adaptive lifting and interpolation
CN104935928B (en) A kind of efficient image compression method based on spatial domain down-sampling pattern
CN100450181C (en) An embedded image compression technique based on wavelet transformation
Malviya et al. 2D-discrete walsh wavelet transform for image compression with arithmetic coding
CN100426866C (en) Full-phase Walsh biorthogonal conversion and its improved process for JPEG
CN103402090A (en) Improving method based on Huffman lossless compression algorithm
Abdul-Wahed et al. Compression of image using multi-wavelet techniques
Chai et al. Statistically adaptive wavelet image coding
RU2429541C2 (en) Method of encoding digital images using discrete wavelet transformation of adaptively defined basis
Bai et al. Lifting-based directionlet transform for image coding
CN101431678A (en) Image processing equipment and image processing method
CN106530195A (en) Progressive watermarking method adapting to JPEG2000 scalable image coding
Taujuddin et al. Consolidating Literature for Images Compression and Its Techniques
Lu et al. A wavelet-based multiple-description coding combining pairwise correlating transform with quincunx sub-sampling

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant
CF01 Termination of patent right due to non-payment of annual fee
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20110427

Termination date: 20170626