CN-122029821-A - Method, apparatus and medium for video processing
Abstract
Embodiments of the present disclosure provide a solution for video processing. A method for video processing is presented. The method includes applying a combined filtering method to samples associated with a video unit for conversion between the video unit and a bitstream of the video, wherein the combined filtering method includes a filtering method and a Bilateral Filtering (BF) method for reference samples, and performing conversion based on the filtered samples.
Inventors
- YIN WENBIN
- ZHANG KAI
- WANG YANG
- ZHANG LI
Assignees
- 抖音视界有限公司
- 字节跳动有限公司
Dates
- Publication Date
- 20260512
- Application Date
- 20241014
- Priority Date
- 20231015
Claims (20)
- 1. A method for video processing, comprising: Applying a combined filtering method to samples associated with a video unit of the video for conversion between the video and a bitstream of the video, wherein the combined filtering method comprises a filtering method and a Bilateral Filtering (BF) method for reference samples, and The conversion is performed based on the filtered samples.
- 2. The method of claim 1, wherein the filtering method is applied to prediction samples associated with the video unit, or Wherein the filtering method is applied to intra-prediction samples before they are used to generate an intra-frame reconstruction, or Wherein the filtering method is applied to Intra Block Copy (IBC) prediction samples before they are used to generate an IBC reconstruction, or Wherein the filtering method is applied to intra-template matching predicted (intra-TMP) samples before they are used to generate intra-TMP reconstructions, or Wherein the filtering method is applied to inter-prediction samples before they are used to generate inter-frame reconstructions, or Wherein the filtering method is applied to the color components of the predicted samples, or Wherein the filtering method is applied to a predicted codec region, or Wherein the filtering method is applied to the predicted samples if the condition is met, or Wherein the filtering method is applied to the predicted samples after the filling method is applied to the predicted samples.
- 3. The method of claim 1, wherein the BF method is applied to a reconstructed sample before the reconstructed sample is used for intra-prediction intra-reference samples, or Wherein the BF method is applied to reference points of a conventional intra mode, or Wherein the BF method is applied to reference points of non-conventional intra modes, or Wherein the BF method is applied to color components, or Wherein the BF method is applied to a codec region, or Wherein if the condition is met, the BF method is applied to an intra reference sample point, or Wherein the BF method is applied to intra reference samples based on the location of the reference samples.
- 4. The method of claim 1, wherein the reference samples comprise reference samples of inter-related modes.
- 5. The method of claim 4, wherein the inter-related modes comprise modes applied to non-I-slices.
- 6. The method of claim 1, wherein the combined filtering method is enabled or disabled based on characteristic information of the reference samples.
- 7. The method of claim 6, wherein the characteristic information comprises at least one of: the texture intensity information is used to determine the texture intensity information, The information of the gradient is provided to the user, With the information of the object, Variance information, or Texture related information.
- 8. The method of claim 1, wherein the BF method of the combined filtering method for the reference samples is applied to at least one of intra-related modes, IBC modes, intra-TMP modes, or inter-related modes.
- 9. The method of claim 8, wherein the BF method in the combined filtering method for the reference samples is applied to an angle-based pattern.
- 10. The method of claim 8, wherein the BF method in the combined filtering method for the reference samples is applied to non-angle based modes.
- 11. The method of claim 8, wherein bilateral filtered reference samples are used to replace unfiltered reference samples.
- 12. The method of claim 8, wherein the bilateral filtered reference samples are used to replace reference samples filtered by another filtering method.
- 13. The method of claim 12, wherein the reference samples filtered by the other filtering method comprise 1-2-1 filtered reference samples.
- 14. The method of claim 8, wherein the BF method is applied to intra reference samples prior to another filtering method.
- 15. The method of claim 14, wherein the other filtering method comprises a 1-2-1 filter.
- 16. The method of claim 8, wherein bilateral filtered reference samples are used as reference sample candidates in the process of intra prediction.
- 17. The method of claim 8, wherein the BF method is applied to intra reference samples after another filtering method, wherein the other filtering method comprises a 1-2-1 filter.
- 18. The method of claim 8, wherein the BF method is applied to intra reference points.
- 19. The method of claim 1, wherein the reference sample point comprises at least one of: reference points for intra-related modes, Reference points for IBC-related modes, or Reference points for modes associated with intra-TMP.
- 20. The method of claim 19, wherein the related modes comprising at least one of intra-related modes, IBC-related modes, or intra-TMP-related modes comprise modes applied to at least one of I-stripes, B-stripes, or P-stripes.
Description
Method, apparatus and medium for video processing Technical Field Embodiments of the present disclosure relate generally to video processing technology and, more particularly, to a bilateral filter of prediction samples in video codec. Background Today, digital video capabilities are being applied to various aspects of a person's life. Various types of video compression techniques have been proposed for video encoding/decoding, such as the MPEG-2, MPEG-4, ITU-T H.263, ITU-T H.264/MPEG-4 part 10 Advanced Video Codec (AVC), ITU-T H.265 High Efficiency Video Codec (HEVC) standard, the multifunctional video codec (VVC) standard. However, the codec efficiency of video codec technology is generally expected to be further improved. Disclosure of Invention Embodiments of the present disclosure provide a solution for video processing. In a first aspect, a method for video processing is presented. The method includes applying a combined filtering method to samples associated with a video unit for conversion between the video unit and a bitstream of the video, wherein the combined filtering method includes a filtering method and a Bilateral Filtering (BF) method for reference samples, and performing conversion based on the filtered samples. The method according to the first aspect of the present disclosure advantageously improves codec efficiency and performance compared to conventional solutions by applying a combined filtering method. In a second aspect, an apparatus for video processing is presented. The apparatus includes a processor and a non-transitory memory having instructions thereon. The instructions, when executed by a processor, cause the processor to perform a method according to the first aspect of the present disclosure. In a third aspect, a non-transitory computer readable storage medium is presented. The non-transitory computer readable storage medium stores instructions that cause a processor to perform a method according to the first aspect of the present disclosure. In a fourth aspect, another non-transitory computer readable recording medium is presented. The non-transitory computer readable recording medium stores a bitstream of video generated by a method performed by an apparatus for video processing. The method includes applying a combined filtering method to samples associated with a video unit of a video, wherein the combined filtering method includes a filtering method and a Bilateral Filtering (BF) method for reference samples, and generating a bitstream based on the filtered samples. In a fifth aspect, a method for storing a bitstream of video is presented. The method includes applying a combined filtering method to samples associated with a video unit of a video, wherein the combined filtering method includes a filtering method and a Bilateral Filtering (BF) method for reference samples, generating a bitstream based on the filtered samples, and storing the bitstream in a non-transitory computer readable recording medium. This summary is intended to introduce a selection of concepts in a simplified form that are further described below in the detailed description. This summary is not intended to identify key features or essential features of the claimed subject matter, nor is it intended to be used to limit the scope of the claimed subject matter. Drawings The above and other objects, features and advantages of the exemplary embodiments of the present disclosure will become more apparent by the following detailed description with reference to the accompanying drawings. In example embodiments of the present disclosure, like reference numerals generally refer to like elements. FIG. 1 illustrates a block diagram of an example video codec system according to some embodiments of the present disclosure; fig. 2 illustrates a block diagram of a first example video encoder, according to some embodiments of the present disclosure; fig. 3 illustrates a block diagram of an example video decoder, according to some embodiments of the present disclosure; FIG. 4 illustrates nominal vertical and horizontal positions of 4:2:2 luminance and chrominance samples in a picture; FIG. 5 shows an example of a block diagram of an encoder; Fig. 6 shows a picture with 18 x 12 luma Codec Tree Units (CTUs) partitioned into 12 slices and 3 raster scan stripes; FIG. 7 shows a picture with 18×12 luma CTUs partitioned into 24 tiles and 9 rectangular stripes; fig. 8 shows a picture divided into 4 pieces, 11 bricks and 4 rectangular strips; fig. 9A shows CTS across the bottom boundary of a picture; Fig. 9B shows CTS across the right boundary of the picture; Fig. 9C shows CTBs across the lower right boundary of a picture; Fig. 10 shows 67 intra prediction modes; FIG. 11 shows a schematic of picture samples and horizontal and vertical block boundaries on an 8X 8 grid, and non-overlapping blocks of 8X 8 samples; fig. 12 shows pixels related to filter on/off decision and strong/weak filter switching; Fig. 13A to 13C show filter