EP-4736439-A1 - SYSTEMS AND METHODS FOR EXTENDED MULTI-RESIDUE BLOCK CODING

EP4736439A1EP 4736439 A1EP4736439 A1EP 4736439A1EP-4736439-A1

Abstract

The various implementations described herein include methods and systems for coding video. In one aspect, a method includes receiving a video bitstream comprising a plurality of coding blocks including a current block. The method includes deriving a first residual block for the current block using a first set of parameters. The method includes deriving a second residual block for the current block using a second set of parameters, where the second set of parameters is independent of the first set of parameters. The method includes deriving a combined residual block using the first residual block and the second residual block. The method also includes reconstructing the current block using the combined residual block.

Inventors

LIU, TIANQI
ZHAO, LIANG
ZHAO, XIN
YE, JING
GAO, Han
LIU, SHAN

Assignees

Tencent America LLC

Dates

Publication Date: 20260506
Application Date: 20240531

Claims (20)

1. A method of video decoding performed at a computing system having memory and one or more processors, the method comprising: receiving a video bitstream comprising a plurality of coding blocks including a current block; deriving a first residual block for the current block using a first set of parameters; deriving a second residual block for the current block using a second set of parameters, wherein the second set of parameters is independent of the first set of parameters; deriving a combined residual block using the first residual block and the second residual block; and reconstructing the current block using the combined residual block.
2. The method of claim 1, wherein the first residual block corresponds to a first transform block size, and wherein the second residual block corresponds to a second transform block size that is different than the first transform block size.
3. The method of claim 2, wherein the first transform block size and the second transform block size are selected from a group consisting of: 4x4, 8x8, 16x16, and 32x32.
4. The method of claim 2, wherein the first transform block size is a fixed size and wherein a transform type for the first residual block is fixed.
5. The method of claim 2, wherein the first transform block size is a fixed size and wherein the second transform block size is selected from a predefined group of transform sizes.
6. The method of claim 1, wherein: the second residual block corresponds to a lossless coding mode; and in accordance with the second residual block having an identity transform type, the second residual block corresponds to a transform block size selected from a group consisting of: 4x4, 8x8, 16x16, and 32x32.
7. The method of claim 1, wherein the first residual block corresponds to a first transform kernel type, and wherein the second residual block corresponds to a second transform kernel type that is different than the first transform kernel type.
8. The method of claim 7, wherein the first transform kernel type is selected from a first transform kernel set, and wherein the second transform kernel type is selected from a second transform kernel set that is different than the first transform kernel set.
9. The method of claim 7, wherein a respective primary transform kernel is predefined or signaled for each of the first residual block and the second residual block.
10. The method of claim 7, wherein the second residual block corresponds to a lossless coding mode, and wherein the second transform kernel type is restricted to a set of lossless transforms.
11. The method of claim 1, wherein the first residual block corresponds to a first quantization step size, and wherein the second residual block corresponds to a second quantization step size that is different than the first quantization step size.
12. The method of claim 1, wherein the first residual block corresponds to a first secondary transform type, and wherein the second residual block corresponds to a second secondary transform type that is different than the first secondary transform type.
13. The method of claim 1, wherein the first residual block corresponds to application of a secondary transform, and wherein the second residual block corresponds to no secondary transform being applied.
14. The method of claim 1, wherein the first residual block corresponds to a first quantization index, and wherein the second residual block corresponds to a second quantization index that is different than the first quantization index.
15. The method of claim 14, wherein the first quantization index is determined from a base quantization value signaled at a first coding level and an index delta signaled at a second coding level that is less than the first coding level.
16. The method of claim 14, wherein at least one of the first quantization index and the second quantization index is not signaled in the video bitstream.
17. The method of claim 1, further comprising: determining a context for entropy coding the first set of parameters using coded information that includes one or more of: a residue mode, a transform size, a transform type, a quantization index, and a secondary transform mode; and entropy encoding the first set of parameters using the determined context.
18. A computing system, comprising: control circuitry; memory; and one or more sets of instructions stored in the memory and configured for execution by the control circuitry, the one or more sets of instructions comprising instructions for: receiving video data comprising a plurality of blocks including a current block; deriving a first residual block for the current block using a first set of parameters; deriving a second residual block for the current block using a second set of parameters, wherein the second set of parameters is independent of the first set of parameters; and signaling the first residual block and the second residual block in a video bitstream.
19. The computing system of claim 18, further comprising signaling, via the video bitstream, at least one of the first set of parameters and the second set of parameters.
20. A non-transitory computer-readable storage medium storing one or more sets of instructions configured for execution by a computing device having control circuitry and memory, the one or more sets of instructions comprising instructions for: obtaining a source video sequence that comprises a plurality of frames; and performing a conversion between the source video sequence and a video bitstream of visual media data, wherein the video bitstream comprises: a plurality of encoded blocks corresponding to the plurality of frames, including a current block; a first residual block for the current block, wherein the first residual block is generated using a first set of parameters; and a second residual block for the current block, wherein the second residual block is generated using a second set of parameters.

Description

Systems and Methods for Extended Multi-Residue Block Coding RELATED APPLICATIONS [0001] This application claims priority to (i) U.S. Provisional Patent Application No. 63/524,549, entitled “Multiple Residual Block Coding Mode,” filed June 30, 2023, and (ii) U.S. Provisional Patent Application No. 63/611,089, entitled “Extended Multi -Residue Block Coding,” filed December 15, 2023, and is a continuation of and claims priority to U.S. Patent Application No. 18/677,729, entitled “Systems and Methods for Extended Multi -Residue Block Coding” filed on May 29, 2024, each of which is hereby incorporated by reference in its entirety. TECHNICAL FIELD [0002] The disclosed embodiments relate generally to video coding, including but not limited to systems and methods for deriving combined residual blocks for use with encoding and decoding coding blocks. BACKGROUND [0003] Digital video is supported by a variety of electronic devices, such as digital televisions, laptop or desktop computers, tablet computers, digital cameras, digital recording devices, digital media players, video gaming consoles, smart phones, video teleconferencing devices, video streaming devices, etc. The electronic devices transmit and receive or otherwise communicate digital video data across a communication network, and/or store the digital video data on a storage device. Due to a limited bandwidth capacity of the communication network and limited memory resources of the storage device, video coding may be used to compress the video data according to one or more video coding standards before it is communicated or stored. The video coding can be performed by hardware and/or software on an electronic/client device or a server providing a cloud service. [0004] Video coding generally utilizes prediction methods (e.g., inter-prediction, intraprediction, or the like) that take advantage of redundancy inherent in the video data. Video coding aims to compress video data into a form that uses a lower bit rate, while avoiding or minimizing degradations to video quality. Multiple video codec standards have been developed. For example, High-Efficiency Video Coding (HEVC/H.265) is a video compression standard designed as part of the MPEG-H project. ITU-T and ISO/IEC published the HEVC/H.265 standard in 2013 (version 1), 2014 (version 2), 2015 (version 3), and 2016 (version 4). Versatile Video Coding (VVC/H.266) is a video compression standard intended as a successor to HEVC. ITU-T and ISO/IEC published the VVC/H.266 standard in 2020 (version 1) and 2022 (version 2). AOMedia Video 1 (AVI) is an open video coding format designed as an alternative to HEVC. On January 8, 2019, a validated version 1.0.0 with Errata 1 of the specification was released. SUMMARY [0005] The present disclosure describes a set of methods for video (image) compression, including residue coding techniques for coding blocks of a video. The residue coding techniques include a multi-residual block coding mode. In some embodiments, to reconstruct a current block, multiple residual blocks (e.g., N partial residual blocks) are coded and used to derive a combined residual block. For example, values of the residual blocks can be added to generate the combined residual block, which is then added to a prediction block to derive the reconstructed block. In some embodiments, the residual blocks have different parameters, such as transform sizes, transform types, and/or coding modes (e.g., lossless or lossy coding modes). An advantage of using multiple residue coding techniques and combined residual blocks is that the quality of the corresponding reconstructed (decoded) video may be improved. Additionally, using multiple residue coding techniques and combined residual blocks can decrease coding overhead (e.g., more efficiently encoding and/or decoding the residuals for a video). [0006] In accordance with some embodiments, a method of video decoding includes (i) receiving a video bitstream comprising a plurality of coding blocks including a current block; (ii) deriving a first residual block (also sometimes referred to as a residue block) for the current block using a first set of parameters; (iii) deriving a second residual block for the current block using a second set of parameters, wherein the second set of parameters is independent of the first set of parameters; (iv) deriving a combined residual block using the first residual block and the second residual block; and (v) reconstructing the current block using the combined residual block. [0007] In accordance with some embodiments, a method of video encoding includes (i) receiving video data comprising a plurality of blocks including a current block; (ii) deriving a first residual block for the current block using a first set of parameters; (iii) deriving a second residual block for the current block using a second set of parameters, wherein the second set of parameters is independent of the first set of parameters; and (iv) signaling the first re