KR20130078569A - Region of interest based screen contents quality improving video encoding/decoding method and apparatus thereof - Google Patents
Region of interest based screen contents quality improving video encoding/decoding method and apparatus thereof Download PDFInfo
- Publication number
- KR20130078569A KR20130078569A KR1020110147588A KR20110147588A KR20130078569A KR 20130078569 A KR20130078569 A KR 20130078569A KR 1020110147588 A KR1020110147588 A KR 1020110147588A KR 20110147588 A KR20110147588 A KR 20110147588A KR 20130078569 A KR20130078569 A KR 20130078569A
- Authority
- KR
- South Korea
- Prior art keywords
- region
- interest
- image
- screen content
- roi
- Prior art date
Links
- 238000000034 method Methods 0.000 title claims abstract description 54
- 238000000605 extraction Methods 0.000 claims abstract description 5
- 238000013139 quantization Methods 0.000 claims description 26
- 230000033001 locomotion Effects 0.000 claims description 21
- 230000003044 adaptive effect Effects 0.000 claims description 12
- 239000013598 vector Substances 0.000 claims description 10
- 238000007906 compression Methods 0.000 claims description 8
- 230000006835 compression Effects 0.000 claims description 7
- 239000000284 extract Substances 0.000 abstract description 3
- 238000010586 diagram Methods 0.000 description 15
- 238000001914 filtration Methods 0.000 description 2
- 238000012805 post-processing Methods 0.000 description 2
- 230000005540 biological transmission Effects 0.000 description 1
- 230000015556 catabolic process Effects 0.000 description 1
- 239000003086 colorant Substances 0.000 description 1
- 238000006731 degradation reaction Methods 0.000 description 1
- 230000006866 deterioration Effects 0.000 description 1
- 238000005457 optimization Methods 0.000 description 1
- 230000002093 peripheral effect Effects 0.000 description 1
- 230000002123 temporal effect Effects 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T9/00—Image coding
- G06T9/004—Predictors, e.g. intraframe, interframe coding
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/102—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
- H04N19/103—Selection of coding mode or of prediction mode
- H04N19/105—Selection of the reference unit for prediction within a chosen coding or prediction mode, e.g. adaptive choice of position and number of pixels used for prediction
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/134—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or criterion affecting or controlling the adaptive coding
- H04N19/167—Position within a video image, e.g. region of interest [ROI]
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/50—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
- H04N19/59—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving spatial sub-sampling or interpolation, e.g. alteration of picture size or resolution
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/80—Details of filtering operations specially adapted for video compression, e.g. for pixel interpolation
- H04N19/82—Details of filtering operations specially adapted for video compression, e.g. for pixel interpolation involving filtering within a prediction loop
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/20—Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
- H04N21/23—Processing of content or additional data; Elementary server operations; Server middleware
- H04N21/234—Processing of video elementary streams, e.g. splicing of video streams or manipulating encoded video stream scene graphs
Landscapes
- Engineering & Computer Science (AREA)
- Multimedia (AREA)
- Signal Processing (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Theoretical Computer Science (AREA)
- Compression Or Coding Systems Of Tv Signals (AREA)
Abstract
Description
The present invention proposes a method and apparatus for improving the image quality of a corresponding region by discriminating a main region of interest of the screen content video.
The screen content video may be an artificially produced image, or may exist in a mixed form with a general natural image. Unlike general natural video, screen content video has different characteristics from natural video such as limited color difference signal, relatively low noise and high chroma. Due to these characteristics, the ROI of the screen content video may be different from the ROI extraction result for the natural image. Therefore, to improve the image quality based on the region of interest of screen content video, it is necessary to consider the characteristics different from the natural image. Such screen content video compression is also considered in High Efficiency Video Coding (HEVC), which is currently being standardized.
Existing video encoding / decoding techniques have been developed for compression of natural video, which is generally acquired through a camera. However, Joint Collaborative Team on Video Coding (JCT-VC), jointly established by Moving Picture Experts Group (MPEG) from ISO / IEC and Video Coding Experts Group (VCEG) from ITU-T, is the next generation of video that is being standardized. In HEVC, the compression standard, CG (Computer Generated) image, mixed image of natural image and CG image, etc. are used as standard experimental image in order to consider high efficiency compression technology of screen content video. This may be considered to apply the next generation video codec to the field of screen contents such as animation, game, etc., and to apply video compression technology to images of various characteristics including natural video. Accordingly, according to the technical trend, the present invention proposes a method and apparatus for analyzing a region of interest based on characteristics of screen content video and for improving image quality of the region of interest.
The present invention proposes a method and apparatus for efficient compression of screen content video in an existing codec for efficiently compressing general natural video.
Screen content video has characteristics such as sharp edge change as in the text area, no noise in a specific area, and monotonous increase in pixel value. Therefore, blurring is performed at the edge of the edge by the conventional encoding / decoding method. blur) Noise may occur. In addition, a step phenomenon may occur in a monotonically increasing region or a ringing phenomenon may occur at a boundary of the region. In the case of an image mixed with a natural image and a screen content image, a method of minimizing noise in the screen content region may be applied by detecting the screen content region. However, the existing method of extracting the region of interest is based on characteristics such as contrast of the natural image and geometrical shape of the edge, and thus it is difficult to divide the screen content region into regions of interest. Accordingly, the present invention proposes a method of determining the screen content area and improving the image quality in consideration of the characteristics of the screen content for the screen content area.
In the present invention, the proposed method compresses the screen content video to determine the region of interest in consideration of the characteristics of the screen content, and allocates more bits to the major region of interest in consideration of the rate-distortion optimization aspect. By reducing the amount of information allocated to, it can be used to improve the subjective picture quality of screen content video while maintaining the amount of information similar to the existing compression process. In the proposed method, the main region of interest may be determined by referring to the input screen content video. In the encoding of the determined main ROI, the image quality of the main ROI of the screen content video may be improved by adaptively changing the encoding parameter of the main ROI, and may be decoded without transmitting additional information. In addition, by transmitting additional information on the ROI of the input screen content video and referencing it in the decoding process, a reconstructed image identical to the encoding process may be generated.
1 is a block diagram of the highest level of a decoding apparatus according to the present invention.
2 is a block diagram of the highest level of the encoding apparatus according to the present invention.
3 is a simplified block diagram of a decoding apparatus according to the present invention.
4 is a simplified block diagram of an encoder according to the present invention.
5 is a simplified block diagram of the ROI extractor according to the present invention.
6 is a simplified block diagram of a method and apparatus for performing intra prediction in consideration of characteristics of an input image.
7 is a schematic block diagram and an adaptive interpolation method for an inter-screen prediction method and apparatus based on a region of interest considering characteristics of screen content.
FIG. 8 is an embodiment of a quantization parameter table considering an interest map of a block unit extracted in consideration of characteristics of a screen content video with respect to an input image and a corresponding interest map.
9 is a diagram illustrating an example of a reference region and a filtering method of an in-loop post-processing filter considering characteristics of a region of interest and screen content video.
The present invention improves the quality of the main region of interest by extracting the main region of interest of the screen content video or the mixed image in which the screen content and the natural image exist together, and adjusting the bit amount of the region of interest and the other region. A method and apparatus for improving the overall subjective picture quality of input screen content by maintaining the overall bit amount similar to the existing method. The proposed method, in decoding the compressed bitstream, refers to a parsing module of main region of interest information and a parameter controller for adaptively adjusting parameters by referring to region of interest information generated through the module, and to a decoded screen content video slice. It may include an in-loop filter that corrects an error close to the original image by filtering the region of interest. In the following proposed method and apparatus and all processes describing the same, 'screen content' means that a part of the image is a screen content or the whole is composed of screen content.
The above objects, features and methods will become more apparent from the following detailed description taken in conjunction with the accompanying drawings, whereby those skilled in the art to which the present invention pertains may easily implement the technical idea of the present invention. Could be. Hereinafter, exemplary embodiments of the present invention will be described in detail with reference to the accompanying drawings.
1 is an embodiment of the present invention.
A block diagram of the highest level of a decoding apparatus according to the present invention. In the proposed method, in the decoding of the
2 is an embodiment of the present invention.
It is a block diagram of the highest stage of the encoding apparatus which concerns on this invention. In the proposed method, in encoding the
3 is an embodiment of the present invention.
Brief block diagram of a decoding apparatus according to the present invention. Referring to the simple decoding step, in decoding the compressed bitstream of the input screen content video, the ROI information generated in the encoding process may be decoded by the
4 is an embodiment of the present invention.
Brief block diagram of an encoder according to the present invention. According to the present invention, the ROI information is extracted from the
5 is an embodiment of the present invention.
A simplified block diagram of a region of interest extractor 500 in accordance with the present invention. In extracting a region of interest with respect to the
Also, unlike still images, video may affect a region of interest by movement. Accordingly, the
6 is an embodiment of the present invention.
A simplified block diagram of a method and a device for performing intra prediction in consideration of characteristics of an input image. By using the proposed method and apparatus, an intra prediction may be performed on the
7 is an embodiment of the present invention.
A brief block diagram of a method and apparatus for inter-screen prediction based on a region of interest considering characteristics of screen content. In performing inter-screen prediction based on the ROI in consideration of characteristics of screen content, adaptive reference image interpolation may be applied according to the
8 is an embodiment of the present invention.
An
9 is an embodiment of the present invention.
An in-loop post-processing filter considering the region of interest and screen content video may be applied. Applying a filter to improve the image quality of the screen content video decoded with reference to the
Claims (6)
Region of interest extraction unit for extracting characteristics and interests of the screen content with reference to the input image, Compressor for adaptively adjusting parameters and codewords during image compression using extracted region of interest information, extracted region of interest information And an optional transmitter of the RO, a decoder of the ROI information, and the module.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
KR1020110147588A KR20130078569A (en) | 2011-12-30 | 2011-12-30 | Region of interest based screen contents quality improving video encoding/decoding method and apparatus thereof |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
KR1020110147588A KR20130078569A (en) | 2011-12-30 | 2011-12-30 | Region of interest based screen contents quality improving video encoding/decoding method and apparatus thereof |
Publications (1)
Publication Number | Publication Date |
---|---|
KR20130078569A true KR20130078569A (en) | 2013-07-10 |
Family
ID=48991488
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
KR1020110147588A KR20130078569A (en) | 2011-12-30 | 2011-12-30 | Region of interest based screen contents quality improving video encoding/decoding method and apparatus thereof |
Country Status (1)
Country | Link |
---|---|
KR (1) | KR20130078569A (en) |
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US10419762B2 (en) | 2015-03-02 | 2019-09-17 | Dolby Laboratories Licensing Corporation | Content-adaptive perceptual quantizer for high dynamic range images |
EP3468186A4 (en) * | 2016-06-08 | 2019-09-18 | Sony Corporation | Image processing device and method |
CN116886923A (en) * | 2023-06-19 | 2023-10-13 | 广州开得联软件技术有限公司 | Classroom video coding method, device, storage medium and equipment |
-
2011
- 2011-12-30 KR KR1020110147588A patent/KR20130078569A/en not_active Application Discontinuation
Cited By (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US10419762B2 (en) | 2015-03-02 | 2019-09-17 | Dolby Laboratories Licensing Corporation | Content-adaptive perceptual quantizer for high dynamic range images |
EP3468186A4 (en) * | 2016-06-08 | 2019-09-18 | Sony Corporation | Image processing device and method |
US10893269B2 (en) | 2016-06-08 | 2021-01-12 | Sony Corporation | Image processing device and method |
CN116886923A (en) * | 2023-06-19 | 2023-10-13 | 广州开得联软件技术有限公司 | Classroom video coding method, device, storage medium and equipment |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN114071165B (en) | Video encoder, video decoder and corresponding methods | |
KR101752401B1 (en) | Sample adaptive offset control | |
JP7521057B2 (en) | Corresponding methods of boundary strength derivation for the encoder, decoder, and deblocking filter | |
CN113545064A (en) | Method and system for processing video content | |
US11146829B2 (en) | Quantization parameter signaling in video processing | |
JP7343669B2 (en) | Method and apparatus for color conversion in VVC | |
CN115665408B (en) | Filtering method and apparatus for cross-component linear model prediction | |
KR20210044765A (en) | Method and Apparatus for image encoding | |
CN113785573A (en) | Encoder, decoder and corresponding methods using an adaptive loop filter | |
CN110650337B (en) | Image encoding method, decoding method, encoder, decoder and storage medium | |
CN113170202B (en) | Encoder, decoder and corresponding methods for constructing MPM list of block applying multi-hypothesis prediction | |
KR20130098122A (en) | Device and method for encoding/decoding | |
CN118509609A (en) | Method and device for decoding code stream, coding method and device and equipment for transmitting code stream | |
CN113330743A (en) | Encoder, decoder and corresponding method for deblocking filter adaptation | |
CN116233470A (en) | Encoder, decoder and corresponding methods for indicating high level syntax | |
KR20210075201A (en) | Method and apparatus for intra prediction | |
KR101646072B1 (en) | Encryption apparatus and method for moving picture data | |
KR20130078569A (en) | Region of interest based screen contents quality improving video encoding/decoding method and apparatus thereof | |
CN115665407B (en) | Inter-component linear modeling method and device for intra-frame prediction | |
CN114679583B (en) | Video encoder, video decoder and corresponding methods | |
CN114902670A (en) | Method and apparatus for signaling sub-picture division information | |
KR20220065880A (en) | Use of DCT-based interpolation filters and enhanced bilinear interpolation filters in affine motion compensation | |
KR20160125704A (en) | Apparatus and method for processing hybrid moving picture | |
US20240283927A1 (en) | Adaptive in-loop filtering in video encoding | |
EP4409911A1 (en) | Video coding with selectable neural-network-based coding tools |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
WITN | Withdrawal due to no request for examination |