Search

US-20260129217-A1 - METHOD, APPARATUS, AND MEDIUM FOR VIDEO PROCESSING

US20260129217A1US 20260129217 A1US20260129217 A1US 20260129217A1US-20260129217-A1

Abstract

Embodiments of the present disclosure provide a solution for video processing. A method for video processing is proposed. In the method, for a conversion between a current video block of a video and a bitstream of the video, usage information of a plurality of template-based approaches is determined for the current video block based on at least one syntax element at a first level. The first level is higher than a coding unit level. The conversion is performed based on the usage information. The plurality of template-based approaches comprises at least two coding tools associated with at least one of: intra coding, inter coding, or intra block copy (IBC) coding.

Inventors

  • Zhipin Deng
  • Kai Zhang
  • Li Zhang

Assignees

  • Douyin Vision Co., Ltd.
  • BYTEDANCE INC.

Dates

Publication Date
20260507
Application Date
20251229
Priority Date
20230628

Claims (20)

  1. 1 . A method for video processing, comprising: determining, for a conversion between a current video block of a video and a bitstream of the video, usage information of a plurality of template-based approaches for the current video block based on at least one syntax element at a first level, the first level being higher than a coding unit level; and performing the conversion based on the usage information, wherein the plurality of template-based approaches comprises at least two coding tools associated with at least one of: intra coding, inter coding, or intra block copy (IBC) coding.
  2. 2 . The method of claim 1 , wherein the at least one syntax element comprises at least one of: a first syntax element at the first level in the bitstream, the first syntax element indicating one of: enabling, disabling, allowance or disallowance of a template-based determination, without considering whether the template-based determination is used to an intra tool, an IBC tool, a palette (PLT) tool, or an inter tool, a second syntax element at the first level in the bitstream, the second syntax element indicating one of: enabling, disabling, allowance or disallowance of a template-based determination for an intra tool, a third syntax element at the first level in the bitstream, the third syntax element indicating at least one of: enabling, disabling, allowance or disallowance of a template-based determination for an IBC tool, a fourth syntax element at the first level in the bitstream, the fourth syntax element indicating one of: enabling, disabling, allowance or disallowance of a template-based determination for an inter tool, or a fifth syntax element at the first level in the bitstream, the fifth syntax element indicating one of: enabling, disabling, allowance or disallowance of a template-based determination for a non-inter tool.
  3. 3 . The method of claim 2 , wherein the non-inter tool comprises at least one of: an intra tool, an IBC tool or a palette (PLT) tool, or wherein whether at least one of the second, the third, the fourth or the fifth syntax element is included in the bitstream is based on the first syntax element.
  4. 4 . The method of claim 3 , wherein the first syntax element in the bitstream indicates enabling or allowance of the template-based determination, and the fourth and fifth syntax elements are included in the bitstream, or wherein the fourth and fifth syntax elements are excluded from the bitstream and determined to be a predetermined number, the predetermined number indicative of disabling or disallowance of the template-based determination, or wherein the first syntax element in the bitstream indicates enabling or allowance of the template-based determination, and the second, the third and the fourth syntax elements are included in the bitstream, or wherein the second, the third and the fourth syntax elements are excluded from the bitstream and determined to be a predetermined number, the predetermined number indicative of disabling or disallowance of the template-based determination.
  5. 5 . The method of claim 2 , wherein the second, the third and the fourth syntax elements are included in the bitstream independently.
  6. 6 . The method of claim 2 , wherein the fourth syntax element and the fifth syntax element are included in the bitstream independently.
  7. 7 . The method of claim 2 , wherein the at least one syntax element comprises a syntax element controlling at least one of: enabling, disabling, allowance or disallowance of a template-based determination for at least one of: an intra tool, or an IBC tool.
  8. 8 . The method of claim 7 , wherein the syntax element comprises a sequence parameter set (SPS) flag at a sequence level controlling at least one of: enabling, disabling, allowance or disallowance of the template-based determination for the intra tool and the IBC tool.
  9. 9 . The method of claim 7 , wherein the syntax element comprises a sequence parameter set (SPS) flag at a sequence level controlling at least one of: enabling, disabling, allowance or disallowance of the template-based determination for the intra tool or the IBC tool.
  10. 10 . The method of claim 8 , wherein the SPS flag comprises an SPS template matching non-inter flag.
  11. 11 . The method of claim 1 , wherein the plurality of template-based approaches comprises at least one of: a coding tool involving a template with search or a coding tool involving a template without search.
  12. 12 . The method of claim 11 , wherein the coding tool involving the template with search comprises at least one of: a template cost calculation, or a template matching.
  13. 13 . The method of claim 11 , wherein the coding tool involving the template without search comprises at least one of: a decoder side intra mode derivation (DIMD), a convolutional cross-component model (CCCM), a linear model (LM), a cross-component linear model (CCLM), a gradient linear model (GLM), or a local illumination compensation (LIC).
  14. 14 . The method of claim 1 , wherein the template-based determination indicates whether to use the plurality of template-based approaches for the current video block.
  15. 15 . The method of claim 1 , wherein the first level comprises one of: a sequence level, a group of pictures level, a picture header level, a picture level, a slice header level, a slice level, or a tile group level, or wherein an indication of whether to and/or how to determine the usage information based on the at least one syntax element at the first level is included in the bitstream at one of: a sequence level, a group of pictures level, a picture level, a slice level, or a tile group level, or wherein an indication of whether to and/or how to determine the usage information based on the at least one syntax element at the first level is included in the bitstream in one of: a sequence header, a picture header, a sequence parameter set (SPS), a video parameter set (VPS), a dependency parameter set (DPS), a decoding capability information (DCI), a picture parameter set (PPS), an adaptation parameter set (APS), a slice header, or a tile group header, or wherein an indication of whether to and/or how to determine the usage information based on the at least one syntax element at the first level is included in the bitstream in one of: a prediction block (PB), a transform block (TB), a coding block (CB), a prediction unit (PU), a transform unit (TU), a coding unit (CU), a virtual pipeline data unit (VPDU), a coding tree unit (CTU), a CTU row, a slice, a tile, a sub-picture, or a region containing more than one sample or pixel.
  16. 16 . The method of claim 1 , further comprising: determining, based on coded information of the current video block, whether to and/or how to determine the usage information based on the at least one syntax element at the first level, the coded information including at least one of: a block size, a colour format, a single and/or dual tree partitioning, a colour component, a slice type, or a picture type.
  17. 17 . The method of claim 1 , wherein the conversion comprising encoding the current video block into the bitstream, or wherein the conversion comprises decoding the current video block from the bitstream.
  18. 18 . An apparatus for video processing comprising a processor and a non-transitory memory with instructions thereon, wherein the instructions upon execution by the processor, cause the processor to: determine, for a conversion between a current video block of a video and a bitstream of the video, usage information of a plurality of template-based approaches for the current video block based on at least one syntax element at a first level, the first level being higher than a coding unit level; and perform the conversion based on the usage information, wherein the plurality of template-based approaches comprises at least two coding tools associated with at least one of: intra coding, inter coding, or intra block copy (IBC) coding.
  19. 19 . A non-transitory computer-readable storage medium storing instructions that cause a processor to perform a method performed by an apparatus for video processing, wherein the method comprises: determining, for a conversion between a current video block of a video and a bitstream of the video, usage information of a plurality of template-based approaches for the current video block based on at least one syntax element at a first level, the first level being higher than a coding unit level; and performing the conversion based on the usage information, wherein the plurality of template-based approaches comprises at least two coding tools associated with at least one of: intra coding, inter coding, or intra block copy (IBC) coding.
  20. 20 . A non-transitory computer-readable recording medium storing a bitstream of a video which is generated by a method performed by an apparatus for video processing, wherein the method comprises: determining usage information of a plurality of template-based approaches for a current video block of the video based on at least one syntax element at a first level, the first level being higher than a coding unit level; and generating the bitstream based on the usage information, wherein the plurality of template-based approaches comprises at least two coding tools associated with at least one of: intra coding, inter coding, or intra block copy (IBC) coding.

Description

CROSS-REFERENCE TO RELATED APPLICATIONS This application is a continuation of International Application No. PCT/CN2024/102148, filed on Jun. 27, 2024, which claims the benefit of International Application No. PCT/CN2023/103341, filed on Jun. 28, 2023. The entire contents of these applications are hereby incorporated by reference in their entireties. FIELDS Embodiments of the present disclosure relates generally to video processing techniques, and more particularly, to syntax element control of coding tool. BACKGROUND In nowadays, digital video capabilities are being applied in various aspects of peoples' lives. Multiple types of video compression technologies, such as MPEG-2, MPEG-4, ITU-TH.263, ITU-TH.264/MPEG-4 Part 10 Advanced Video Coding (AVC), ITU-TH.265 high efficiency video coding (HEVC) standard, versatile video coding (VVC) standard, have been proposed for video encoding/decoding. However, coding efficiency of video coding techniques is generally expected to be further improved. SUMMARY Embodiments of the present disclosure provide a solution for video processing. In a first aspect, a method for video processing is proposed. The method comprises: determining, for a conversion between a current video block of a video and a bitstream of the video, usage information of a plurality of template-based approaches for the current video block based on at least one syntax element at a first level, the first level being higher than a coding unit level; and performing the conversion based on the usage information, wherein the plurality of template-based approaches comprises at least two coding tools associated with at least one of: intra coding, inter coding, or intra block copy (IBC) coding. The method in accordance with the first aspect of the present disclosure enables a high-level syntax element control for the template-based approaches, and thus can improve the coding effectiveness and/or coding efficiency. In a second aspect, another method for video processing is proposed. The method comprises: determining, for a conversion between a current video block of a video and a bitstream of the video, a first syntax element for controlling a template matching, the template matching being used for at least one of: an intra coding, or an intra block copy (IBC) coding; determining usage information of a template-based coding tool for the current video block based on the first syntax element, the template-based coding tool comprising at least one of: a template-based intra coding tool, or a template-based IBC coding tool; and performing the conversion based on the usage information. The method in accordance with the second aspect of the present disclosure control the template-based coding tool based on the template matching control, and thus can improve the coding effectiveness and/or coding efficiency. In a third aspect, another method for video processing is proposed. The method comprises: determining, for a conversion between a current video block of a video and a bitstream of the video, a first syntax element for controlling a template matching, the template matching being used for at least one of: an intra coding, or an intra block copy (IBC) coding; determining usage information of a local-boosting cross-component prediction for the current video block based on the first syntax element; and performing the conversion based on the usage information. The method in accordance with the third aspect of the present disclosure controls the local-boosting cross-component prediction based on high level syntax element, and thus can improve the coding effectiveness and/or coding efficiency. In a fourth aspect, another method for video processing is proposed. The method comprises: determining, for a conversion between a current video block of a video and a bitstream of the video, usage information of an adaptive reordering of merge candidates (ARMC) for the current video block based on at least one syntax element; and performing the conversion based on the usage information. The method in accordance with the fourth aspect of the present disclosure controls the usage of the ARMC based on the syntax element, and thus can improve the coding effectiveness and/or coding efficiency. In a fifth aspect, an apparatus for video processing is proposed. The apparatus comprises a processor and a non-transitory memory with instructions thereon. The instructions upon execution by the processor, cause the processor to perform a method in accordance with the first, second, third, or fourth aspect of the present disclosure. In a sixth aspect, a non-transitory computer-readable storage medium is proposed. The non-transitory computer-readable storage medium stores instructions that cause a processor to perform a method in accordance with the first, second, third, or fourth aspect of the present disclosure. In a seventh aspect, another non-transitory computer-readable recording medium is proposed. The non-transitory computer-readable recordi