Search

CN-120128699-B - Computer-implemented encoding and decoding method and non-transitory computer-readable storage medium

CN120128699BCN 120128699 BCN120128699 BCN 120128699BCN-120128699-B

Abstract

The present disclosure provides computer-implemented methods of encoding and decoding and non-transitory computer-readable storage media. The method includes receiving a bitstream of video data, enabling or disabling an encoding mode for a video sequence based on a first flag in the bitstream, and determining whether to enable or disable control of the encoding mode at a level below a sequence level based on a second flag in the bitstream.

Inventors

  • CHEN JIE
  • Liao Ruling
  • LUO JIANCONG
  • YE YAN

Assignees

  • 阿里巴巴集团控股有限公司

Dates

Publication Date
20260512
Application Date
20200820
Priority Date
20190912

Claims (12)

  1. 1. A computer-implemented decoding method, comprising: Receiving a bitstream associated with a video sequence; Decoding a first flag and a second flag in a Sequence Parameter Set (SPS) of the bitstream, wherein the first flag indicates at least one of enabling or disabling a plurality of encoding modes at a sequence level; determining whether a third flag is present in the bitstream based on a value of the second flag, wherein the third flag indicates whether the coding mode enabled at a sequence level is enabled or disabled at a lower level below a sequence level; The plurality of coding modes include a decoder-side motion vector correction mode; the bitstream is decoded based on the third flag value.
  2. 2. The computer-implemented decoding method of claim 1, wherein the lower level below a sequence level is a slice level or an image level.
  3. 3. The computer-implemented decoding method of claim 1, the method further comprising: In response to the second flag having a first value, it is determined that the third flag is present in a slice header or an image header of the bitstream.
  4. 4. The computer-implemented decoding method of claim 3, wherein the first value is 1.
  5. 5. A computer-implemented encoding method, comprising: encoding a first flag and a second flag in a Sequence Parameter Set (SPS) of a bitstream associated with a video sequence, wherein the first flag indicates that at least one of a plurality of encoding modes is enabled or disabled at a sequence level; Determining whether to signal a third flag in the bitstream based on the value of the second flag; the third flag indicates whether the coding mode enabled at the sequence level is enabled or disabled at a lower level below the sequence level; The plurality of coding modes include a decoder-side motion vector correction mode; the bitstream is encoded based on the value of the third flag.
  6. 6. The computer-implemented encoding method of claim 5, wherein the lower level below the sequence level is a slice level or an image level.
  7. 7. The computer-implemented encoding method of claim 5, the method further comprising: Encoding in a slice header or an image header of the bitstream based on the third flag in response to the second flag having a first value.
  8. 8. The computer-implemented encoding method of claim 7, wherein the first value is 1.
  9. 9. A non-transitory computer readable storage medium storing computer instructions and a bitstream associated with a video sequence, the computer instructions, when executed by a processor, generate the bitstream associated with a video sequence according to a method comprising: encoding a first flag and a second flag in a Sequence Parameter Set (SPS) of a bitstream associated with a video sequence, wherein the first flag indicates that at least one of a plurality of encoding modes is enabled or disabled at a sequence level; Determining whether to signal a third flag in the bitstream based on the value of the second flag; the third flag indicates whether the coding mode enabled at the sequence level is enabled or disabled at a lower level below the sequence level; The plurality of coding modes include a decoder-side motion vector correction mode; the bitstream is encoded based on the value of the third flag.
  10. 10. The non-transitory computer-readable storage medium of claim 9, wherein the lower level below a sequence level is a slice level or an image level.
  11. 11. The non-transitory computer-readable storage medium of claim 9, the method further comprising: Encoding in a slice header or an image header of the bitstream based on the third flag in response to the second flag having a first value.
  12. 12. The non-transitory computer-readable storage medium of claim 11, wherein the first value is 1.

Description

Computer-implemented encoding and decoding method and non-transitory computer-readable storage medium Cross Reference to Related Applications The present disclosure claims priority from U.S. provisional application No.62/899,169, filed on 9, month 12 of 2019, which is incorporated herein by reference in its entirety. Background Video is a set of still images (or "frames") that capture visual information. To reduce storage memory and transmission bandwidth, video may be compressed prior to storage or transmission and then decompressed prior to display. The compression process is often referred to as encoding and the decompression process is often referred to as decoding. There are a variety of video coding formats that use standardized video coding techniques, most commonly based on prediction, transformation, quantization, entropy coding, and loop filtering. Video coding standards, such as the high efficiency video coding (HEVC/h.265) standard, the general video coding (VVC/h.266) standard AVS standard, specify a specific video coding format, as defined by a standardization organization. As more and more advanced video coding techniques are adopted in video standards, the coding efficiency of new video coding standards is also increasing. Disclosure of Invention The embodiment of the invention provides a method and a device for controlling a coding mode for video data. In one example embodiment, a method includes receiving a bitstream of video data, enabling or disabling an encoding mode for a video sequence based on a first flag in the bitstream, and determining to enable or disable control of the encoding mode at a level below a sequence level based on a second flag in the bitstream. In another example embodiment, a method includes receiving a bitstream of video data, enabling or disabling a first encoding mode for a video sequence based on a first flag in the bitstream, enabling or disabling a second encoding mode for the video sequence based on a second flag in the bitstream, and determining whether to enable control of at least one of the first encoding mode or the second encoding mode at a level below a sequence level based on a third flag in the bitstream. In another example embodiment, a method includes receiving a video sequence, a first flag, and a second flag, enabling or disabling an encoding mode for a video bitstream based on the first flag, and enabling or disabling control of the encoding mode at a level below a sequence level based on the second flag. In another example embodiment, a method includes receiving a video sequence, a first flag, a second flag, and a third flag, enabling or disabling a first encoding mode for a video bitstream based on the first flag, enabling or disabling a second encoding mode for the video bitstream based on the second flag, and enabling or disabling control of at least one of the first encoding mode or the second encoding mode at a level below a sequence level based on the third flag. In another example embodiment, a non-transitory computer-readable medium storing a set of instructions executable by at least one processor of an apparatus to cause the apparatus to perform a method comprising receiving a video data bitstream, enabling or disabling an encoding mode for a video sequence based on a first flag in the bitstream, and determining whether to enable or disable control of the encoding mode at a level below a sequence level based on a second flag in the bitstream. In another example embodiment, a non-transitory computer readable medium storing a set of instructions executable by at least one processor of an apparatus to cause the apparatus to perform a method comprising receiving a bitstream of video data, enabling or disabling a first encoding mode for a video sequence based on a first flag in the bitstream, enabling or disabling a second encoding mode for the video sequence based on a second flag in the bitstream, determining whether control of at least one of the first encoding mode or the second encoding mode is enabled at a level below a sequence level based on a third flag in the bitstream. In another embodiment, an apparatus includes a memory configured to store a set of instructions and one or more processors communicatively coupled to the memory, the one or more processors configured to execute the set of instructions to cause the apparatus to receive a bitstream of video data, enable or disable an encoding mode for a video sequence based on a first flag in the bitstream, and determine whether to enable or disable control of the encoding mode at a level below a sequence level based on a second flag in the bitstream. In another embodiment, an apparatus includes a memory configured to store a set of instructions and one or more processors communicatively coupled to the memory, the one or more processors configured to execute the set of instructions to cause the apparatus to receive a bitstream of video data, enable or disable a first encoding mode