WO2015051533A1 - Improved advanced residual prediction in 3dvc - Google Patents
Improved advanced residual prediction in 3dvc Download PDFInfo
- Publication number
- WO2015051533A1 WO2015051533A1 PCT/CN2013/085049 CN2013085049W WO2015051533A1 WO 2015051533 A1 WO2015051533 A1 WO 2015051533A1 CN 2013085049 W CN2013085049 W CN 2013085049W WO 2015051533 A1 WO2015051533 A1 WO 2015051533A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- derived
- block
- candidate
- aligned
- candidates
- Prior art date
Links
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/50—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
- H04N19/597—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding specially adapted for multi-view video sequence encoding
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/102—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
- H04N19/103—Selection of coding mode or of prediction mode
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/134—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or criterion affecting or controlling the adaptive coding
- H04N19/146—Data rate or code amount at the encoder output
- H04N19/147—Data rate or code amount at the encoder output according to rate distortion criteria
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/169—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
- H04N19/17—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object
- H04N19/176—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a block, e.g. a macroblock
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/46—Embedding additional information in the video signal during the compression process
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/50—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
- H04N19/503—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving temporal prediction
- H04N19/51—Motion estimation or motion compensation
- H04N19/513—Processing of motion vectors
- H04N19/517—Processing of motion vectors by encoding
- H04N19/52—Processing of motion vectors by encoding by predictive encoding
Definitions
- the invention relates generally to Three-Dimensional (3D) video processing.
- the presented invention relates to advanced residual prediction.
- ARP advanced residual prediction
- the base view is considered as a mirror of the current view.
- a mirror block is located with an estimated disparity vector (DV) and a mirror reference block for the mirror block is located using the same motion vector (MV) as the current block.
- DV disparity vector
- MV motion vector
- Pseudo residues are generated by subtracting the reconstructed samples in the mirror block and in the mirror reference block. Then pseudo residues are used to predict the real residues of the current block.
- Fig. 1 depicts a sketch of ARP.
- a coding unit CU can decide to use the pseudo residual prediction or not.
- the estimated DV is derived by a neighbouring block disparity vector (NBDV) procedure.
- NBDV neighbouring block disparity vector
- the first encountered DV from a temporal or spatial neighbouring block is utilized by ARP as the estimated DV for the current block. Since such an estimated DV is not accurate enough, pseudo residual prediction may be inefficient in some cases.
- Fig. 2 is a diagram illustrating the order to fill the DV candidate list
- FIG. 3 is a diagram illustrating the aligned temporal DV and the temporal DV
- Fig. 5 is a diagram illustrating the derivation of DMV
- Fig. 7 is a diagram illustrating exemplary reference blocks.
- ADVD adaptive disparity vector derivation
- three DV candidates are derived from temporal and spatial neighbouring blocks.
- the candidate list is filled according to the order depicted in Fig. 2.
- the aligned temporal DV is obtained from the aligned block, which is located by a scaled MV to the collocated picture, as depicted in Fig. 3.
- the aligned block can be obtained as follows.
- the left-top coordinate of the aligned block is equal to the left-top coordinate of a reference block inside or outside of the current block, added by the MV of the current block scaled to the collocated picture.
- Fig. 7 demonstrates exemplary reference blocks.
- a new DV candidate is appended into the list only if it is not equal to any DV candidate already in the list.
- An encoder can determine the best DV candidate used in ARP according to the RDO criterion, and signal the index to the decoder.
- An advanced temporal residual prediction (ATRP) method is proposed for the disparity compensated prediction.
- ATRP the disparity parameters of the current PU are applied to the corresponding block in a temporal reference picture in the same view to generate the reference residual in temporal direction.
- the corresponding block in the temporal reference picture is located by a derived motion vector (DMV), which is the motion vector (MV) of the reference block which the current DV is pointing to in the reference view.
- DMV derived motion vector
- the reference block can be obtained as follows.
- the left-top coordinate of the reference block is equal to the left-top coordinate of a reference block inside or outside of the current block, added by the DV of the current block.
- Fig. 7 demonstrates exemplary reference blocks.
- the concept of proposed ATRP is illustrated in Fig. 4, and the derivation of DMV is illustrated in Fig. 5.
Landscapes
- Engineering & Computer Science (AREA)
- Multimedia (AREA)
- Signal Processing (AREA)
- Compression Or Coding Systems Of Tv Signals (AREA)
Abstract
This contribution proposes to move syntax elements related to 3D-HEVC from slice segment header to slice segment header extension. In this way, slice segment header in 3D-HEVC can be compatible with that in HEVC.
Description
IMPROVED ADVANCED RESIDUAL PREDICTION IN
3DVC
TECHNICAL FIELD
[0001] The invention relates generally to Three-Dimensional (3D) video processing. In particular, the presented invention relates to advanced residual prediction.
BACKGROUND
[0002] In the current 3D-HEVC, advanced residual prediction (ARP) was adopted to replace the simple inter-view residual prediction for its better coding efficiency. In ARP, the base view is considered as a mirror of the current view. A mirror block is located with an estimated disparity vector (DV) and a mirror reference block for the mirror block is located using the same motion vector (MV) as the current block. Pseudo residues are generated by subtracting the reconstructed samples in the mirror block and in the mirror reference block. Then pseudo residues are used to predict the real residues of the current block. Fig. 1 depicts a sketch of ARP. In ARP, a coding unit (CU) can decide to use the pseudo residual prediction or not. A flag is transmitted to signal the on/off control including the weighting factor for the pseudo residual prediction (ARP). The weighting factor could be 0 (ARP disabled), 1 (ARP enabled) or ½ (The ARP is enabled but the pseudo residues will be divided by 2). For each CU, the encoder determines to enable or disable the ARP according to the rate- distortion optimization (RDO) criterion, and signals the control flag in the bit-stream.
[0003] It plays a critical role to locate the mirror block in the base-view accurately since pseudo residues are similar to the real residues only if they are generated by similar samples. In ARP, the estimated DV is derived by a neighbouring block disparity vector (NBDV) procedure. The first encountered DV from a temporal or spatial neighbouring block is utilized by ARP as the estimated DV for the current block. Since such an estimated DV is not accurate enough, pseudo residual prediction
may be inefficient in some cases.
SUMMARY
[0004] In light of the previously described problems, an adaptive disparity vector derivation (ADVD) method for advanced residual prediction (ARP) and an advanced temporal residual prediction (ATRP) method are proposed. In ADVD, three DV candidates can be chosen to provide more accurate pseudo residues. In ATRP, the concept of ARP is also applied to the disparity compensated prediction. In the combination of ADVD and ATRP, three MV candidates can be chosen in ATRP.
[0005] Other aspects and features of the invention will become apparent to those with ordinary skill in the art upon review of the following descriptions of specific embodiments.
BRIEF DESCRIPTION OF DRAWINGS
The invention can be more fully understood by reading the subsequent detailed description and examples with references made to the accompanying drawings, wherein:
[0006] Fig. 1 is a diagram illustrating a sketch of ARP, where PR represents pseudo residue;
[0007] Fig. 2 is a diagram illustrating the order to fill the DV candidate list;
[0008] Fig. 3 is a diagram illustrating the aligned temporal DV and the temporal DV;
[0009] Fig. 4 is a diagram illustrating the concept of the proposed ATRP;
[0010] Fig. 5 is a diagram illustrating the derivation of DMV;
[0011] Fig. 6 is a diagram illustrating the order to fill the MV candidate list in
ATRP;
[0012] Fig. 7 is a diagram illustrating exemplary reference blocks.
DETAILED DESCRIPTION
[0013] The following description is of the best-contemplated mode of carrying out
the invention. This description is made for the purpose of illustrating the general principles of the invention and should not be taken in a limiting sense. The scope of the invention is best determined by reference to the appended claims.
[0014] An adaptive disparity vector derivation (ADVD) method is proposed to improve the ARP coding efficiency. In ADVD, three DV candidates are derived from temporal and spatial neighbouring blocks. The candidate list is filled according to the order depicted in Fig. 2. The aligned temporal DV is obtained from the aligned block, which is located by a scaled MV to the collocated picture, as depicted in Fig. 3. The aligned block can be obtained as follows. The left-top coordinate of the aligned block is equal to the left-top coordinate of a reference block inside or outside of the current block, added by the MV of the current block scaled to the collocated picture. Fig. 7 demonstrates exemplary reference blocks. A new DV candidate is appended into the list only if it is not equal to any DV candidate already in the list. An encoder can determine the best DV candidate used in ARP according to the RDO criterion, and signal the index to the decoder.
[0015] An advanced temporal residual prediction (ATRP) method is proposed for the disparity compensated prediction. In ATRP, the disparity parameters of the current PU are applied to the corresponding block in a temporal reference picture in the same view to generate the reference residual in temporal direction. The corresponding block in the temporal reference picture is located by a derived motion vector (DMV), which is the motion vector (MV) of the reference block which the current DV is pointing to in the reference view. The reference block can be obtained as follows. The left-top coordinate of the reference block is equal to the left-top coordinate of a reference block inside or outside of the current block, added by the DV of the current block. Fig. 7 demonstrates exemplary reference blocks. The concept of proposed ATRP is illustrated in Fig. 4, and the derivation of DMV is illustrated in Fig. 5.
[0016] When ADVD and ATRP are combined, three MV candidates are derived for ATRP. The order to fill the three MV candidates is depicted in Fig. 6. The spatial MVs and temporal MVs are derived from the spatial and temporal merging candidate list and the first two available spatial and/or temporal MVs will be included into the MV candidate list after the DMV. An encoder can determine the best MV candidate used in ATRP according to RDO criterion, and signal the index to the decoder, similar to what is done in ADVD for ARP.
Claims
1. A method of generating a derived disparity vector (DV), wherein the derived DV of a current block is selected from multiple DV candidates.
2. The method as claimed in claim 1, wherein a predefined order is used to select and include multiple DV candidates in a DV candidate list.
3. The method as claimed in claim 2, the derived DV is selected from the DV candidate list.
4. The method as claimed in claim 2, wherein an encoder selects a best DV candidate from the DV candidate list as the derived DV for a current block according to a criterion.
5. The method as claimed in claim 4, a rate-distortion optimization criterion is applied to select the best DV candidate in a given DV candidate list.
6. The method as claimed in claim 2, wherein the derived DV is signaled explicitly to a decoder or derived implicitly by the decoder.
7. The method as claimed in claim 1, wherein the derived DV is used in procedures requiring a derived DV, including but not limited to advanced residual prediction (ARP), inter-view motion prediction, and inter-view motion/disparity merging scheme.
8. The method as claimed in claim 1, wherein an aligned temporal DV is used as a DV candidate, the aligned temporal DV is a DV of an aligned block in a collocated picture, and the aligned block is located using a motion vector (MV) of the current block scaled to a collocated picture.
9. The method as claimed in claim 8, wherein there are two collocated pictures at most, which are signaled to a decoder explicitly or derived by the decoder implicitly
at slice level.
10. The method as claimed in claim 8, wherein there are two collocated pictures, the first one is signaled to a decoder explicitly and the second one is derived implicitly.
11. The method as claimed in claim 8, wherein the aligned block is obtained as follows: a left-top coordinate of the aligned block is equal to a left-top coordinate of a reference block inside or outside of the current block, added by a MV of the current block scaled to the collocated picture.
12. The method as claimed in claim 8, wherein the aligned temporal DV is a DV in an aligned block for reference list 0 or for reference list 1.
13. The method as claimed in claim 8, wherein there is more than one aligned temporal DVs used as the DV candidates.
14. The method as claimed in claim 8, wherein N aligned temporal DVs are treated as first N DV candidates in the candidate list, and N is equal to or large than 1.
15. A method of generating a derived motion vector (MV), wherein the derived
MV of a current block is chosen from several MV candidates.
16. The method as claimed in claim 15, wherein a predefined order is used to select and include multiple MV candidates in a MV candidate list, and the derived MV is selected from the MV candidate list.
17. The method as claimed in claim 16, wherein an encoder selects a best MV candidate from the MV candidate list as the derived MV for the current block according to a criterion, and the criterion includes rate-distortion optimization.
18. The method as claimed in claim 15, wherein a best MV candidate is signaled explicitly to a decoder or derived implicitly by the decoder.
19. The method as claimed in claim 15, wherein the derived MV is used in
procedures requiring a derived MV, including but not limited to advanced temporal residual prediction (ATRP).
20. The method as claimed in claim 15, wherein the candidate MV comprises one or more MV in a reference block, for reference list 0 or for reference list 1 , and the reference block is located with the DV of the current block to an inter-view picture.
21. The method as claimed in claim 15, wherein N derived MVs are treated as first N MV candidates in a candidate list, and N is equal to or large than 1.
22. The method as claimed in claim 15, wherein the MV in a neighboring block of the current block is a candidate MV for the current block, the neighboring block is a temporal neighboring block or a spatial neighboring block, and a zero-MV=(0,0) is included as a MV candidate.
23. The method as claimed in claim 15, wherein the number of MV candidates denoted as N is greater than 1 , N is transmitted in a bit-stream by explicitly signaled in a sequence, view, picture, slice, CU or PU level, or N is implicitly derived.
24. The method as claimed in claim 23, wherein N is the same for all blocks in the same picture.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
PCT/CN2013/085049 WO2015051533A1 (en) | 2013-10-11 | 2013-10-11 | Improved advanced residual prediction in 3dvc |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
PCT/CN2013/085049 WO2015051533A1 (en) | 2013-10-11 | 2013-10-11 | Improved advanced residual prediction in 3dvc |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2015051533A1 true WO2015051533A1 (en) | 2015-04-16 |
Family
ID=52812453
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/CN2013/085049 WO2015051533A1 (en) | 2013-10-11 | 2013-10-11 | Improved advanced residual prediction in 3dvc |
Country Status (1)
Country | Link |
---|---|
WO (1) | WO2015051533A1 (en) |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101785317A (en) * | 2007-08-15 | 2010-07-21 | 汤姆逊许可证公司 | Methods and apparatus for motion skip mode in multi-view coded video using regional disparity vectors |
WO2013053309A1 (en) * | 2011-10-11 | 2013-04-18 | Mediatek Inc. | Method and apparatus of motion and disparity vector derivation for 3d video coding and hevc |
KR20130039429A (en) * | 2011-10-12 | 2013-04-22 | 주식회사 케이티 | Mvd bi-predictive temporal motion vector derivation |
CN103152563A (en) * | 2011-11-07 | 2013-06-12 | 吴秀美 | Apparatus of decoding video data |
-
2013
- 2013-10-11 WO PCT/CN2013/085049 patent/WO2015051533A1/en active Application Filing
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101785317A (en) * | 2007-08-15 | 2010-07-21 | 汤姆逊许可证公司 | Methods and apparatus for motion skip mode in multi-view coded video using regional disparity vectors |
WO2013053309A1 (en) * | 2011-10-11 | 2013-04-18 | Mediatek Inc. | Method and apparatus of motion and disparity vector derivation for 3d video coding and hevc |
KR20130039429A (en) * | 2011-10-12 | 2013-04-22 | 주식회사 케이티 | Mvd bi-predictive temporal motion vector derivation |
CN103152563A (en) * | 2011-11-07 | 2013-06-12 | 吴秀美 | Apparatus of decoding video data |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US10230937B2 (en) | Method of deriving default disparity vector in 3D and multiview video coding | |
US20180115764A1 (en) | Method and apparatus of motion and disparity vector prediction and compensation for 3d video coding | |
US9621920B2 (en) | Method of three-dimensional and multiview video coding using a disparity vector | |
US10264281B2 (en) | Method and apparatus of inter-view candidate derivation in 3D video coding | |
CA2896905C (en) | Method and apparatus of view synthesis prediction in 3d video coding | |
WO2015006920A1 (en) | An adaptive disparity vector derivation method | |
JP2015514342A5 (en) | ||
US20150249838A1 (en) | Method and apparatus of virtual depth values in 3d video coding | |
KR101861497B1 (en) | Method and apparatus of camera parameter signaling in 3d video coding | |
WO2013159643A1 (en) | Method and apparatus of motion vector derivation for 3d video coding | |
EP2839664A1 (en) | Method and apparatus of inter-view sub-partition prediction in 3d video coding | |
AU2013284038A1 (en) | Method and apparatus of disparity vector derivation in 3D video coding | |
CA2896805A1 (en) | Method and apparatus of disparity vector derivation in 3d video coding | |
CA2909561C (en) | Method and apparatus for advanced temporal residual prediction in three-dimensional video coding | |
KR20130044189A (en) | Method for multi-view video encoding/decoding and apparatus thereof | |
WO2015051533A1 (en) | Improved advanced residual prediction in 3dvc | |
WO2015089721A1 (en) | Improved advanced residual prediction in 3dvc | |
US10075690B2 (en) | Method of motion information prediction and inheritance in multi-view and three-dimensional video coding | |
KR101763083B1 (en) | Method and apparatus for advanced temporal residual prediction in three-dimensional video coding | |
WO2014106327A1 (en) | Method and apparatus for inter-view residual prediction in multiview video coding | |
Jäger | Segment-wise prediction in 3D video coding | |
KR20150126261A (en) | A method and an apparatus for processing a multi-view video signal | |
NZ778416A (en) | Selection and signaling of motion vector (mv) precisions |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 13895378 Country of ref document: EP Kind code of ref document: A1 |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 13895378 Country of ref document: EP Kind code of ref document: A1 |