US-20260129204-A1 - METHOD, APPARATUS, AND MEDIUM FOR VIDEO PROCESSING
Abstract
Embodiments of the disclosure provide a solution for video processing. A method for video processing is proposed. The method includes: deriving, for a conversion between a video unit of a video and a bitstream of the video, a refined prediction sample of a prediction sample in the video unit by applying a function used in local illumination compensation (LIC) to the prediction sample, in response to that the video unit is coded with a target coding mode, wherein one or more parameters of the function are modified and the target coding mode is applied to at least one of: a prediction mode or a coding tool; and performing the conversion based on the refined prediction sample.
Inventors
- Yang Wang
- Kai Zhang
- Yuwen He
- Hongbin Liu
- Li Zhang
Assignees
- Douyin Vision Co., Ltd.
- BYTEDANCE INC.
Dates
- Publication Date
- 20260507
- Application Date
- 20251229
- Priority Date
- 20230629
Claims (20)
- 1 . A method for video processing, comprising: deriving, for a conversion between a video unit of a video and a bitstream of the video, a refined prediction sample of a prediction sample in the video unit by applying a function used in local illumination compensation (LIC) to the prediction sample, in response to that the video unit is coded with a target coding mode, wherein one or more parameters of the function are modified and the target coding mode is applied to at least one of: a prediction mode or a coding tool; and performing the conversion based on the refined prediction sample.
- 2 . The method of claim 1 , wherein the prediction mode comprises at least one of: an intra prediction mode, an inter prediction mode, an intra block copy (IBC) prediction mode, or other prediction mode.
- 3 . The method of claim 1 , wherein if the target coding mode is used, one or more coding tools are not used.
- 4 . The method of claim 3 , wherein overlapped block motion compensation (OBMC) is not used.
- 5 . The method of claim 4 , wherein an indication of OBMC is not signaled, if the target coding mode is used.
- 6 . The method of claim 1 , wherein the target coding mode is not allowed to be used for bi-prediction.
- 7 . The method of claim 1 , wherein the target coding mode is applied to one or more inter coding tools, or wherein the target coding mode is not applied to one or more inter coding tools.
- 8 . The method of claim 7 , wherein the target coding mode is not used for RPR, and/or wherein if at least one of: BCW, adaptive motion vector resolution (AMVR), or symmetric motion vector differences (SMVD) is used, a syntax element indicating whether the target coding mode is used is not signaled.
- 9 . The method of claim 7 , wherein the target coding mode is not used for a cross-component coding tool which uses cross-component information to get the prediction or reconstruction.
- 10 . The method of claim 9 , wherein the cross-component coding tool comprises at least one of: a cross-component linear model (CCLM), a variant of CCLM, a multi-model CCLM (MMLM), a variant of MMLM, a gradient linear model (GLM), a variant of GLM, an inter convolutional cross-component model (CCCM), a variant of inter CCCM, a CCP merge, or a variant of CCP merge.
- 11 . The method of claim 9 , wherein the target coding mode is not used with the cross-component coding tool.
- 12 . The method of claim 1 , wherein the target coding mode is applied to AMVP mode, and/or wherein the target coding mode is applied to merge mode.
- 13 . The method of claim 12 , wherein the AMVP mode comprises at least one of: a regular AMVP mode, an affine AMVP mode, a TM AMVP mode, or an IBC AMVP mode.
- 14 . The method of claim 12 , wherein LIC parameters after being modified by a slope adjustment parameter are used with other LIC parameters.
- 15 . The method of claim 14 , wherein the other LIC parameters are inherited LIC parameters.
- 16 . The method of claim 12 , wherein reordering is used for slope adjustment parameters.
- 17 . The method of claim 1 , wherein the conversion includes encoding the video unit into the bitstream, or wherein the conversion includes decoding the video unit from the bitstream.
- 18 . An apparatus for video processing comprising a processor and a non-transitory memory with instructions thereon, wherein the instructions upon execution by the processor, cause the processor to perform acts comprising: deriving, for a conversion between a video unit of a video and a bitstream of the video, a refined prediction sample of a prediction sample in the video unit by applying a function used in local illumination compensation (LIC) to the prediction sample, in response to that the video unit is coded with a target coding mode, wherein one or more parameters of the function are modified and the target coding mode is applied to at least one of: a prediction mode or a coding tool; and performing the conversion based on the refined prediction sample.
- 19 . A non-transitory computer-readable storage medium storing instructions that cause a processor to perform acts comprising: deriving, for a conversion between a video unit of a video and a bitstream of the video, a refined prediction sample of a prediction sample in the video unit by applying a function used in local illumination compensation (LIC) to the prediction sample, in response to that the video unit is coded with a target coding mode, wherein one or more parameters of the function are modified and the target coding mode is applied to at least one of: a prediction mode or a coding tool; and performing the conversion based on the refined prediction sample.
- 20 . A non-transitory computer-readable recording medium storing a bitstream of a video which is generated by a method performed by an apparatus for video processing, wherein the method comprises: deriving a refined prediction sample of a prediction sample in a video unit of the video by applying a function used in local illumination compensation (LIC) to the prediction sample, in response to that the video unit is coded with a target coding mode, wherein one or more parameters of the function are modified and the target coding mode is applied to at least one of: a prediction mode or a coding tool; and generating the bitstream of the video unit based on the refined prediction sample.
Description
CROSS-REFERENCE TO RELATED APPLICATIONS This application is a continuation of International Application No. PCT/CN2024/102046, filed on Jun. 27, 2024, which claims the benefits of International Application No. PCT/CN2023/103830, filed on Jun. 29, 2023, International Application No. PCT/CN2023/124102, filed on Oct. 11, 2023, and International Application No. PCT/CN2023/138564, filed on Dec. 13, 2023. The entire contents of these applications are hereby incorporated by reference in their entireties. FIELDS Embodiments of the present disclosure relates generally to video processing techniques, and more particularly, to local illumination compensation with slope adjustment. BACKGROUND In nowadays, digital video capabilities are being applied in various aspects of peoples' lives. Multiple types of video compression technologies, such as MPEG-2, MPEG-4, ITU-TH.263, ITU-TH.264/MPEG-4 Part 10 Advanced Video Coding (AVC), ITU-TH.265 high efficiency video coding (HEVC) standard, versatile video coding (VVC) standard, have been proposed for video encoding/decoding. However, coding efficiency of video coding techniques is generally expected to be further improved. SUMMARY Embodiments of the present disclosure provide a solution for video processing. In a first aspect, a method for video processing is proposed. The method comprises: deriving, for a conversion between a video unit of a video and a bitstream of the video, a refined prediction sample of a prediction sample in the video unit by applying a function used in local illumination compensation (LIC) to the prediction sample, in response to that the video unit is coded with a target coding mode, wherein one or more parameters of the function are modified and the target coding mode is applied to at least one of: a prediction mode or a coding tool; and performing the conversion based on the refined prediction sample. In this way, coding performance can be improved by adjusting the parameters. In a second aspect, an apparatus for video processing is proposed. The apparatus comprises a processor and a non-transitory memory with instructions thereon. The instructions upon execution by the processor, cause the processor to perform a method in accordance with the first aspect of the present disclosure. In a third aspect, a non-transitory computer-readable storage medium is proposed. The non-transitory computer-readable storage medium stores instructions that cause a processor to perform a method in accordance with the first aspect of the present disclosure. In a fourth aspect, another non-transitory computer-readable recording medium is proposed. The non-transitory computer-readable recording medium stores a bitstream of a video which is generated by a method performed by an apparatus for video processing. The method comprises: deriving a refined prediction sample of a prediction sample in a video unit of the video by applying a function used in local illumination compensation (LIC) to the prediction sample, in response to that the video unit is coded with a target coding mode, wherein one or more parameters of the function are modified and the target coding mode is applied to at least one of: a prediction mode or a coding tool; and generating the bitstream of the video unit based on the refined prediction sample. In a fifth aspect, a method for storing a bitstream of a video is proposed. The method comprises: deriving a refined prediction sample of a prediction sample in a video unit of the video by applying a function used in local illumination compensation (LIC) to the prediction sample, in response to that the video unit is coded with a target coding mode, wherein one or more parameters of the function are modified and the target coding mode is applied to at least one of: a prediction mode or a coding tool: generating the bitstream of the video unit based on the refined prediction sample; and storing the bitstream in a non-transitory computer-readable recording medium. This Summary is provided to introduce a selection of concepts in a simplified form that are further described below in the Detailed Description. This Summary is not intended to identify key features or essential features of the claimed subject matter, nor is it intended to be used to limit the scope of the claimed subject matter. BRIEF DESCRIPTION OF THE DRAWINGS Through the following detailed description with reference to the accompanying drawings, the above and other objectives, features, and advantages of example embodiments of the present disclosure will become more apparent. In the example embodiments of the present disclosure, the same reference numerals usually refer to the same components. FIG. 1 illustrates a block diagram that illustrates an example video coding system, in accordance with some embodiments of the present disclosure; FIG. 2 illustrates a block diagram that illustrates a first example video encoder, in accordance with some embodiments of the present disclosure: FIG. 3 illustrates