KR-102963504-B1 - Inter-intra prediction mode for video data
Abstract
An exemplary device for coding (encoding or decoding) video data comprises one or more processors implemented in a memory and circuit portion for storing video data, wherein the one or more processors form an inter-prediction block for a current chrominance block of video data; form an intra-prediction block for a current chrominance block of video data; determine the number of neighbor blocks for a luminance block corresponding to a current chrominance block that is intra-prediction coded; determine a first weight and a second weight according to the number of neighbor blocks that are intra-prediction coded; apply the first weight to the inter-prediction block and apply the second weight to the intra-prediction block; combine the first weighted inter-prediction block and the second weighted intra-prediction block to form a prediction block for a current chrominance block; and are configured to code the current chrominance block using the prediction block.
Inventors
- 팜 반 루옹
- 판 더 아우베라 게르트
- 라마수브라모니안 아다르쉬 크리쉬난
- 카르체비츠 마르타
Assignees
- 퀄컴 인코포레이티드
Dates
- Publication Date
- 20260511
- Application Date
- 20200205
- Priority Date
- 20200204
Claims (20)
- As a method for decoding video data, A step of forming an inter-prediction block for the current block of video data; A step of forming an intra-prediction block for the current block of the above video data; A step of determining whether a first neighbor block for the current block is intra-predicted coded, wherein the step of determining whether the first neighbor block is intra-predicted coded includes a step of determining whether the first neighbor block is coded according to a first intra-predicted mode excluding intra-block copy or combined intra-inter-predicted; A step of determining whether a second neighbor block for the current block is intra-predicted coded, wherein the step of determining whether the second neighbor block is intra-predicted coded includes a step of determining whether the second neighbor block is coded using a second intra-predicted mode excluding intra-block copy or combined intra-inter-predicted; A step of determining a first weight and a second weight according to the number of the first and second neighbor blocks determined to be intra-predicted coded; A step of applying the first weight to the inter prediction block and applying the second weight to the intra prediction block; A step of combining a first weighted inter-prediction block and a second weighted intra-prediction block to form a prediction block for the current block; and A method for decoding video data, comprising the step of decoding the current block using the prediction block.
- In Article 1, A method for decoding video data, wherein the first and second neighbor blocks include an upper neighbor block and a left neighbor block.
- In Article 1, A method for decoding video data, further comprising the step of decoding data of a bitstream representing the first weight and the second weight.
- In Paragraph 3, A method for decoding video data, wherein the data of the bitstream representing the first weight and the second weight comprises at least one of a sequence parameter set, a picture parameter set, an adaptation parameter set, a video parameter set, a picture header, a slice header, a tile header, or a block header.
- In Article 1, The step of decoding the current block above is, A step of decoding a residual block for the above current block; and The step of combining the samples of the above residual block with the samples of the above prediction block A method for decoding video data including
- In Article 1, A method for decoding video data, the above current block being a luminance block.
- In Article 1, The step of determining the first weight and the second weight is, A method for decoding video data, comprising the step of selecting a second weight higher than the first weight when it is determined that neither of the first and second neighbor blocks is intra-predicted coded.
- In Article 7, The step of selecting the second weight to be higher than the first weight when it is determined that neither of the first and second neighbor blocks is intra-predicted coded is, A step of selecting the second weight as 3 and selecting the first weight as 1, or A method for decoding video data, comprising the step of selecting the second weight as 5 and selecting the first weight as 3.
- As a method of encoding video data, A step of forming an inter-prediction block for the current block of video data; A step of forming an intra-prediction block for the current block of the above video data; A step of determining whether a first neighbor block for the current block is intra-predicted coded, wherein the step of determining whether the first neighbor block is intra-predicted coded includes a step of determining whether the first neighbor block is coded according to a first intra-predicted mode excluding intra-block copy or combined intra-inter-predicted; A step of determining whether a second neighbor block for the current block is intra-predicted coded, wherein the step of determining whether the second neighbor block is intra-predicted coded includes a step of determining whether the second neighbor block is coded using a second intra-predicted mode excluding intra-block copy or combined intra-inter-predicted; A step of determining a first weight and a second weight according to the number of the first and second neighbor blocks determined to be intra-predicted coded; A step of applying the first weight to the inter prediction block and applying the second weight to the intra prediction block; A step of combining a first weighted inter-prediction block and a second weighted intra-prediction block to form a prediction block for the current block; and A method for encoding video data, comprising the step of encoding the current block using the prediction block.
- In Article 9, The method further includes the step of encoding data of a bitstream representing the first weight and the second weight, and A method for encoding video data, wherein the data of the bitstream representing the first weight and the second weight comprises at least one of a sequence parameter set, a picture parameter set, an adaptation parameter set, a video parameter set, a picture header, a slice header, a tile header, or a block header.
- As a device for decoding video data, Memory for storing video data; and It includes one or more processors implemented in the circuit, and The above one or more processors are, Forming an inter-prediction block for the current block of the above video data; Forming an intra-prediction block for the current block of the above video data; Determining whether a first neighbor block for the current block is intra-predicted coded, wherein determining whether the first neighbor block is intra-predicted coded includes determining whether the first neighbor block is coded according to a first intra-predicted mode excluding intra-block copy or combined intra-inter-predicted; and determining whether the first neighbor block is intra-predicted coded; Determining whether a second neighbor block for the current block is intra-predicted coded, wherein determining whether the second neighbor block is intra-predicted coded includes determining whether the second neighbor block is coded using a second intra-predicted mode excluding an intra-block copy or combined intra-inter-predicted; and determining whether the second neighbor block is intra-predicted coded; Determining a first weight and a second weight according to the number of the first and second neighbor blocks determined to be intra-predicted coded; Applying the first weight to the inter prediction block and applying the second weight to the intra prediction block; A first weighted inter-prediction block and a second weighted intra-prediction block are combined to form a prediction block for the current block; and To decode the current block using the above prediction block A device for decoding video data configured.
- In Article 11, A device for decoding video data, wherein the one or more processors are further configured to decode data of a bitstream representing the first weight and the second weight.
- In Article 12, A device for decoding video data, wherein the data of the bitstream representing the first weight and the second weight comprises at least one of a sequence parameter set, a picture parameter set, an adaptive parameter set, a video parameter set, a picture header, a slice header, a tile header, or a block header.
- In Article 11, The above current block is a luminance block, a device for decoding video data.
- As a computer-readable storage medium storing instructions, When the above commands are executed by a processor, the processor, A computer-readable storage medium that enables the execution of a method according to any one of claims 1 to 10.
- As a device for decoding video data, Means for forming an inter-prediction block for a current block of video data; Means for forming an intra-prediction block for the current block of the above video data; Means for determining whether a first neighbor block for the current block is intra-predicted coded, wherein determining whether the first neighbor block is intra-predicted coded includes determining whether the first neighbor block is coded according to a first intra-predicted mode excluding intra-block copy or combined intra-inter-predicted; Means for determining whether a second neighbor block for the current block is intra-predicted coded, wherein determining whether the second neighbor block is intra-predicted coded includes determining whether the second neighbor block is coded using a second intra-predicted mode excluding an intra-block copy or combined intra-inter-predicted; Means for determining a first weight and a second weight according to the number of the first and second neighbor blocks determined to be intra-predicted coded; Means for applying the first weight to the inter prediction block and applying the second weight to the intra prediction block; Means for forming a prediction block for the current block by combining a first weighted inter-prediction block and a second weighted intra-prediction block; and A device for decoding video data, comprising means for decoding the current block using the prediction block.
- In Article 16, A device for decoding video data, wherein the first and second neighbor blocks include an upper neighbor block and a left neighbor block.
- In Article 16, A device for decoding video data, further comprising means for decoding data of a bitstream representing the first weight and the second weight.
- In Article 18, A device for decoding video data, wherein the data of the bitstream representing the first weight and the second weight comprises at least one of a sequence parameter set, a picture parameter set, an adaptive parameter set, a video parameter set, a picture header, a slice header, a tile header, or a block header.
- In Article 16, The means for decoding the above-mentioned current block is, Means for decoding a residual block for the above current block; and Means for combining samples of the above residual block with samples of the above prediction block A device for decoding video data, including
Description
Inter-intra prediction mode for video data This application claims priority to U.S. Application No. 16/781,751 filed on February 4, 2020 and U.S. Provisional Application No. 62/802,515 filed on February 7, 2019, the full contents of each of which are incorporated herein by reference. The present disclosure relates to video coding including video encoding and video decoding. Digital video capabilities can be integrated into a wide range of devices, including digital televisions, digital direct broadcast systems, wireless broadcast systems, personal digital assistants (PDAs), laptop or desktop computers, tablet computers, e-book readers, digital cameras, digital recording devices, digital media players, video gaming devices, video game consoles, cellular or satellite wireless phones, so-called “smartphones,” video teleconferencing devices, video streaming devices, etc. Digital video devices implement video coding techniques as described in standards defined by MPEG-2, MPEG-4, ITU-T H.263, ITU-T H.264/MPEG-4, Part 10, AVC (Advanced Video Coding), ITU-T H.265, HEVC (High Efficiency Video Coding), and extensions of these standards. By implementing such video coding techniques, video devices may more efficiently transmit, receive, encode, decode, and/or store digital video information. Video coding techniques include spatial (intra-picture) prediction and/or temporal (inter-picture) prediction to reduce or eliminate redundancy inherent in video sequences. For block-based video coding, a video slice (e.g., a video picture or a part of a video picture) may be partitioned into video blocks, which may also be referred to as coding tree units (CTUs), coding units (CUs), and/or coding nodes. Video blocks in an intra-coded (I) slice of a picture are encoded using spatial prediction for reference samples in neighboring blocks of the same picture. Video blocks in an inter-coded (P or B) slice of a picture may use spatial prediction for reference samples in neighboring blocks of the same picture, or temporal prediction for reference samples in other reference pictures. Pictures may be referred to as frames, and reference pictures may be referred to as reference frames. FIG. 1 is a block diagram illustrating an exemplary video encoding and decoding system capable of performing the techniques of the present disclosure. FIGS. 2a and 2b are conceptual diagrams illustrating an exemplary quadtree binary tree (QTBT) structure and a corresponding coding tree unit (CTU). FIGS. 3a through 3f are conceptual diagrams illustrating exemplary locations of neighboring blocks for the current block. FIG. 4 is a block diagram illustrating an exemplary video encoder capable of performing the techniques of the present disclosure. FIG. 5 is a block diagram illustrating an exemplary video decoder capable of performing the techniques of the present disclosure. FIG. 6 is a flowchart illustrating an exemplary method for encoding a current block according to the techniques of the present disclosure. FIG. 7 is a flowchart illustrating an exemplary method for decoding a current block according to the techniques of the present disclosure. FIG. 8 is a flowchart illustrating an exemplary method of coding (encoding or decoding) video data according to the techniques of the present disclosure. Video coding standards include ITU-T H.261, ISO/IEC MPEG-1 Visual, ITU-T H.262 or ISO/IEC MPEG-2 Visual, ITU-T H.263, ISO/IEC MPEG-4 Visual, and ITU-T H.264 (also known as ISO/IEC MPEG-4 AVC), including its Scalable Video Coding (SVC) and Multi-View Video Coding (MVC) extensions. HEVC (High-Efficiency Video Coding) was finalized in April 2013 by the Joint Collaboration Team on Video Coding (JCT-VC) of the ITU-T Video Coding Expert Group (VCEG) and the ISO/IEC Motion Picture Expert Group (MPEG). The Joint Video Experts Team (JVET), a collaborative team formed by MPEG and ITU-T Study Group 16's VCEG, has recently been working on a new video coding standard known as Versatile Video Coding (VVC). The main objective of VVC is to provide significant improvements in compression performance compared to the existing HEVC standard, while supporting higher quality video services and the deployment of emerging applications such as 360° immersive multimedia and HDR (high-dynamic-range) video. This disclosure describes techniques that may improve combined inter-intra prediction modes for video data prediction. Multi-hypothesis intra-mode (MHI) has been described as improving the merge mode by combining one intra prediction and one merged indexed prediction, and was adopted at the 12th JVET Macao meeting: M.-S. Chiang, C.-W. Hsu, Y.-W. Huang, S.-M. Lei “CE10.1.1: Multi-hypothesis prediction for improving AMVP mode, skip or merge mode, and intra mode,” in JVET-L0100, October 2018. According to MHI, candidates in a set of four intra-modes, including planar, DC, horizontal, and vertical predictions, are evaluated to select the optimal one to be combined with the merged indexed predict