CN-122002045-A - Method and apparatus for video encoding and decoding

CN122002045ACN 122002045 ACN122002045 ACN 122002045ACN-122002045-A

Abstract

An electronic device performs a method of decoding video data, including receiving video data corresponding to an encoding unit from a bitstream, wherein the encoding unit is encoded in an inter prediction mode or an intra block copy mode, receiving a first syntax element from the video data, wherein the first syntax element indicates whether the encoding unit has any non-zero residuals, receiving a second syntax element from the video data based on a determination that the first syntax element has non-zero values, wherein the second syntax element indicates whether the encoding unit has been encoded using an adaptive color space transform (ACT), assigning a zero value to the second syntax element based on a determination that the first syntax element has a zero value, and determining whether to perform an inverse ACT on the video data of the encoding unit based on the value of the second syntax element.

Inventors

XIU XIAOYU
CHEN YIWEN
MA ZONGQUAN
ZHU HONGZHENG
WANG XIANGLIN
YU BING

Assignees

北京达佳互联信息技术有限公司

Dates

Publication Date: 20260508
Application Date: 20200923
Priority Date: 20190923

Claims (16)

1. A method of decoding video data, comprising: receiving a first syntax element from a bitstream, the first syntax element indicating whether a coding unit in the video data is encoded by a single tree partition or a dual tree partition, wherein each coding unit includes both a luma component and a chroma component; based on a determination that the coding unit is coding using a single tree partition: determining a second syntax element from the video data, wherein the second syntax element indicates whether the coding unit has been applied with an adaptive color space transform ACT, and Performing an inverse ACT on residual data of the coding unit according to non-zero values of the second syntax element.
2. The method of claim 1, further comprising: A zero value is assigned to the second syntax element according to a determination that the coding unit is coded using a dual-tree partition, wherein the coding units in a luma partition tree of the dual-tree partition only include coding information related to luma components of the coding unit, and wherein the coding units in a chroma partition tree of the dual-tree partition only include coding information related to chroma components of the coding unit.
3. The method of claim 1, further comprising: in accordance with a determination that the second syntax element has a zero value, the inverse ACT is aborted from being performed on the coding unit.
4. The method of claim 1, wherein switching between the single tree partition structure and the dual tree partition structure occurs at a sequence level, a picture level, a slice level, or a coding unit level.
5. The method of claim 1, wherein the coding unit is encoded in a 4:4:4 chroma format.
6. A method of encoding an encoding unit within a video frame, comprising: Determining a value of a first syntax element indicating whether the coding unit is encoded by a single tree partition or a dual tree partition, wherein each coding unit includes both a luma component and a chroma component; determining a value of a second syntax element, wherein the second syntax element indicates whether the coding unit has been applied with an adaptive color space transform ACT, and The method further includes signaling a value of a determined second syntax element based on determining that the value of the first syntax element indicates that the coding unit is coded using a single tree partition.
7. The method of claim 6, further comprising: The method further includes determining, based on determining that the value of the first syntax element indicates that the coding unit is coded using a dual-tree partition, that the value of the second syntax element is zero, wherein coding units in a luma partition tree of the dual-tree partition include only coding information related to luma components of the coding units, and wherein coding units in a chroma partition tree of the dual-tree partition include only coding information related to chroma components of the coding units.
8. The method of claim 6, wherein the second syntax element having a non-zero value indicates that ACT is performed on residual data of the coding unit.
9. The method of claim 6, wherein the second syntax element having a value of zero indicates that ACT is relinquished to be performed on the coding unit.
10. The method of claim 6, wherein switching between the single tree partition structure and the dual tree partition structure occurs at a sequence level, a picture level, a slice level, or a coding unit level.
11. The method of claim 6, wherein the coding unit is encoded in a 4:4:4 chroma format.
12. An electronic device, comprising: one or more processing units; A memory coupled to the one or more processing units, and A plurality of programs stored in the memory, which when executed by the one or more processing units, cause the electronic device to perform the method of any of claims 1-11.
13. A computer-readable storage medium storing a bitstream formed of instructions that, when executed by a computing device having one or more processors, cause the one or more processors to perform the encoding method of any of claims 6-10.
14. The computer-readable storage medium of claim 13, wherein the encoding unit is encoded in a 4:4:4 chroma format.
15. A method for storing a bitstream, comprising: Performing the encoding method according to any of claims 6-10 to generate a bitstream, and The bit stream is stored.
16. The method of claim 15, wherein the coding unit is encoded in a 4:4:4 chroma format.

Description

Method and apparatus for video encoding and decoding The application relates to a method and a device for encoding and decoding video in a 4:4:4 chroma format, which is applied for the patent application with the application number 202080050545.7 and is divided into patent applications with the application number 2020 and 23. RELATED APPLICATIONS The present application claims priority from U.S. provisional patent application No. 62/904,539 entitled "METHODS AND APPARATUS OF VIDEO CODING IN 4:4:4 CHROMA FORMAT," filed on date 23 at 9 in 2019, the entire contents of which are incorporated by reference. Technical Field The present application relates generally to video data codec and compression, and in particular, to a method and system for improving the codec efficiency of video. Background Digital video is supported by a variety of electronic devices such as digital televisions, laptop or desktop computers, tablet computers, digital cameras, digital recording devices, digital media players, video game consoles, smart phones, video teleconferencing devices, video streaming devices, and the like. The electronic device transmits, receives, encodes, decodes and/or stores digital video data by implementing video compression/decompression standards as defined by the MPEG-4, ITU-T H.263, ITU-T H.264/MPEG-4 (part 10, advanced Video Codec (AVC)), high Efficiency Video Codec (HEVC), and common video codec (VCC) standards. Video compression typically includes performing spatial (intra) prediction and/or temporal (inter) prediction to reduce or remove redundancy inherent in video data. For block-based video coding, a video frame is partitioned into one or more slices, each slice having a plurality of video blocks, which may also be referred to as Coding Tree Units (CTUs). Each CTU may contain one Coding Unit (CU), or be split recursively into smaller CUs until a predefined minimum CU size is reached. Each CU (also referred to as a leaf CU) contains one or more Transform Units (TUs), and each CU also contains one or more Prediction Units (PUs). Each CU may be encoded and decoded in intra, inter or IBC mode. Video blocks in an intra-coded (I) slice of a video frame are coded using spatial prediction with respect to reference samples in neighboring blocks within the same video frame. Video blocks in inter-coded (P or B) slices of a video frame may use spatial prediction with respect to reference samples in neighboring blocks within the same video frame, or use temporal prediction with respect to reference samples in other previous and/or future reference video frames. A prediction block for a current video block to be encoded is generated based on spatial or temporal prediction of a reference block (e.g., a neighboring block) that has been previously encoded. The process of finding the reference block may be accomplished by a block matching algorithm. Residual data representing pixel differences between a current block to be encoded and a prediction block is referred to as a residual block or prediction error. The inter-coded block is coded based on a residual block and a motion vector pointing to a reference block in a reference frame forming the prediction block. The process of determining motion vectors is commonly referred to as motion estimation. The intra-coded block is coded according to an intra-prediction mode and a residual block. For further compression, the residual block is transformed from the pixel domain to a transform domain (e.g., frequency domain), resulting in residual transform coefficients, which may then be quantized. The quantized transform coefficients, initially arranged in a two-dimensional array, may be scanned to produce a one-dimensional vector of transform coefficients, and then entropy encoded into the video bitstream to achieve even more compression. The encoded video bitstream is then stored in a computer readable storage medium (e.g., flash memory) for access by another electronic device having digital video capabilities or directly transmitted to the electronic device in a wired or wireless manner. The electronic device then performs video decompression (which is the reverse of the video compression described above) by, for example, parsing the encoded video bitstream to obtain syntax elements from the bitstream and reconstructing the digital video data from the encoded video bitstream into its original format based at least in part on the syntax elements obtained from the bitstream, and rendering the reconstructed digital video data on a display of the electronic device. As digital video quality goes from high definition to 4Kx2K or even 8Kx4K, the amount of video data to be encoded/decoded grows exponentially. This is a continuing challenge in terms of how video data can be encoded/decoded more efficiently while preserving the image quality of the decoded video data. Some video content (e.g., screen content video) is encoded in a 4:4:4 chroma format in which all three components (lum