Search

BR-122025008972-A2 - SYSTEM, METHOD FOR DECODING A COMPUTER-READABLE VIDEO AND MEDIA

BR122025008972A2BR 122025008972 A2BR122025008972 A2BR 122025008972A2BR-122025008972-A2

Abstract

In some embodiments, a video decoder decodes a video from a video bitstream. The video decoder extracts, from the video bitstream, an additional bit count M indicating an amount of additional bits of general constraint information (GCI) included in the video bitstream. The additional bits include designaling bits that indicate the respective additional encoding tools to be restricted for the video, and an expected value of the additional bit count is 0 or 6. The decoder extracts M-6 bits that follow six signaling bits in the bitstream in response to the determination that the extracted additional bit count M is greater than 6. The decoder further decodes the remaining portion of the bitstream into images independent of the extracted M-6 bits and based, at least in part, on the constraints specified for the respective additional encoding tools by the six signaling bits.

Inventors

  • JONATHAN GAN
  • Yue Yu
  • Haoping Yu

Assignees

  • GUANGDONG OPPO MOBILE TELECOMMUNICATIONS CORP., LTD

Dates

Publication Date
20260310
Application Date
20221108
Priority Date
20220110

Claims (11)

  1. 1. A SYSTEM characterized by comprising: a memory for storing instructions; and a processor configured to execute instructions to perform operations of: decoding a general constraints information (GCI) flag from a video bitstream; in response to the determination that the value of the GCI flag is equal to 1, decoding an additional bit count M indicating a quantity of additional GCI bits comprised in the video bitstream, where an expected value of the additional bit count M is 0 or 6; in response to the determination that a value of the additional bit count M is equal to 6, decoding, from the video bitstream, six flag bits representing six respective flags indicating six respective additional encoding tools to be restricted for the video; and decoding a remaining portion of the video bitstream into images based, at least in part, on the restrictions for the six additional encoding tools indicated by the six flags.
  2. 2. SYSTEM, according to claim 1, characterized in that in response to the determination that the value of the GCI flag is equal to 1 and the determination that the value of the additional bit count M is equal to 0, the values of the six flags are inferred as equal to a specific value.
  3. 3. SYSTEM, according to claim 2, characterized in that in response to the determination that the value of the GCI flag is equal to 1 and the determination that the value of the additional bit count M is equal to 0, the values of the six flags are inferred as equal to 0.
  4. 4. SYSTEM, according to claim 1, characterized by the six flags comprising: gci_all_rap_pictures_constraint_flag, wherein gci_all_rap_pictures_constraint_flag being equal to 1 specifies that all pictures in a set of output layers in the OlsInScope scope are Gradual Refresh Decoder (GDR) pictures with ph_recovery_poc_cnt equal to 0, or Intra-Random Access Point (IRAP) pictures; gci_no_extended_precision_processing_constraint_flag, wherein gci_no_extended_precision_processing_constraint_flag being equal to 1 specifies that sps_extended_precision_flag for all pictures in OlsInScope is equal to 0; gci_no_ts_residual_coding_rice_constraint_flag, wherein gci_no_ts_residual_coding_rice_constraint_flag being equal to 1 specifies that sps_ts_residual_coding_rice_present_in_sh_flag for all figures in OlsInScope is equal to 0; gci_no_rrc_rice_extension_constraint_flag, where gci_no_rrc_rice_extension_constraint_flag being equal to 1 specifies that sps_rrc_rice_extension_flag for all figures in OlsInScope is equal to 0; gci_no_persistent_rice_adaptation_constraint_flag, where gci_no_persistent_rice_adaptation_constraint_flag being equal to 1 specifies that sps_persistent_rice_adaptation_enabled_flag for all figures in OlsInScope is equal to 0; egci_no_reverse_last_sig_coeff_constraint_flag, where gci_no_reverse_last_sig_coeff_constraint_flag being equal to 1 specifies that sps_reverse_last_sig_coeff_enabled_flag for all configurations in OlsInScope is equal to 0.
  5. 5. SYSTEM, according to claim 4, characterized by decoding the remaining portion of the video bitstream into images based, at least in part, on the constraints for the six additional encoding tools indicated by the six flags, comprising one or more of the following: determining that all figurations in one or more sets of output layers are GDR figurations with ph_recovery_poc_cnt equal to 0, or IRAP figurations, based on gci_all_rap_pictures_constraint_flag being one, and decoding the GDR figurations or IRAP figurations in one or more sets of output layers; determining that an extended transform precision is constrained based on gci_no_extended_precision_processing_constraint_flag being one and decoding the remaining portion of the video bitstream by setting sps_extended_precision_flag for figurations in OlsInScope equal to 0 so that the extended dynamic range is not used; determine that an explicit Rice parameter signaling is constrained based on the fact that gci_no_ts_residual_coding_rice_constraint_flag is one, and decode the remaining portion of the video bitstream by disabling the alternative Rice parameter signaling for configurations in OlsInScope; determine that an alternative Rice parameter derivation for binarization of video quantization residues is constrained based on the fact that degci_no_rrc_rice_extension_constraint_flag is one, and decode the remaining portion of the video bitstream by disabling the alternative Rice parameter signaling for configurations in OlsInScope; determine that an initialization of the Rice parameter derivation for binarization based on the previous transform unit state is constrained based on the determination that degci_no_persistent_rice_adaptation_constraint_flag is one, and decode the remaining portion of the stream of video bits without initializing the Rice parameter based on the previous transform unit state for configurations in OlsInScope; or determine that the coordinates of a last significant coefficient are encoded relative to the upper left corner for each transform block of a slice based on the gci_no_reverse_last_sig_coeff_constraint_flag being one and decode the remaining portion of the video bitstream by interpreting the decoded coordinates of the last significant coefficient as being relative to the upper left corner for each transform block of the slice.
  6. 6. METHOD FOR DECODING A VIDEO, wherein the method is characterized by comprising: decoding a general constraint information (GCI) flag from a video bitstream; in response to the determination that the value of the GCI flag is equal to 1, decoding an additional bit count M indicating a quantity of additional GCI bits comprised in the video bitstream, wherein an expected value of the additional bit count M is 0 or 6; in response to the determination that a value of the additional bit count M is equal to 6, decoding, from the video bitstream, six flag bits representing six respective flags indicating six respective additional coding tools to be restricted for the video; and decoding a remaining portion of the video bitstream into images based, at least in part, on the constraints for the six additional coding tools indicated by the six flags.
  7. 7. METHOD, according to claim 6, characterized in that in response to the determination that the value of the GCI flag is equal to 1 and the determination that the value of the additional bit count M is equal to 0, the values of the six flags are inferred as equal to a specific value.
  8. 8. METHOD, according to claim 7, characterized in that in response to the determination that the value of the GCI flag is equal to 1 and the determination that the value of the additional bit count M is equal to 0, the values of the six flags are inferred as equal to 0.
  9. 9. METHOD, according to claim 6, characterized in that the six flags comprise: gci_all_rap_pictures_constraint_flag, wherein gci_all_rap_pictures_constraint_flag being equal to 1 specifies that all pictures in a set of output layers in the OlsInScope scope are either Gradual Refresh Decoder (GDR) pictures with ph_recovery_poc_cnt equal to 0, or Intra-Random Access Point (IRAP) pictures; gci_no_extended_precision_processing_constraint_flag, wherein gci_no_extended_precision_processing_constraint_flag being equal to 1 specifies that sps_extended_precision_flag for all pictures in OlsInScope is equal to 0; gci_no_ts_residual_coding_rice_constraint_flag, wherein gci_no_ts_residual_coding_rice_constraint_flag being equal to 1 specifies that sps_ts_residual_coding_rice_present_in_sh_flag for all figures in OlsInScope is equal to 0; gci_no_rrc_rice_extension_constraint_flag being equal to 1 specifies that sps_rrc_rice_extension_flag for all figures in OlsInScope is equal to 0; gci_no_persistent_rice_adaptation_constraint_flag being equal to 1 specifies that sps_persistent_rice_adaptation_constraint_flag for all figures in OlsInScope is equal to 0; and gci_no_reverse_last_sig_coeff_constraint_flag, where gci_no_reverse_last_sig_coeff_constraint_flag being equal to 1 specifies that sps_reverse_last_sig_coeff_enabled_flag for all configurations in OlsInScope is equal to 0.
  10. 10. METHOD, according to claim 9, characterized by decoding the remaining portion of the video bitstream into the images based, at least in part, on the constraints for the six additional encoding tools indicated by the six flags, comprising one or more of the following: determining that all figurings in one or more sets of output layers are GDR figurings with ph_recovery_poc_cnt equal to 0, or IRAP figurings, based on gci_all_rap_pictures_constraint_flag being one, and decoding the GDR figurings or IRAP figurings in one or more sets of output layers; determining that an extended transform precision is constrained based on gci_no_extended_precision_processing_constraint_flag being one and decoding the remaining portion of the video bitstream by setting sps_extended_precision_flag for figurings in OlsInScope equal to 0 so that the extended dynamic range is not used; determine that an explicit Rice parameter flag is constrained based on the fact that gci_no_ts_residual_coding_rice_constraint_flag is one, and decode the remaining portion of the video bitstream by disabling the alternative Rice parameter flag for configurations in OlsInScope; determine that an alternative Rice parameter derivation for binarizing video quantization residues is constrained based on the fact that degci_no_rrc_rice_extension_constraint_flag is one, and decode the remaining portion of the video bitstream by disabling the alternative Rice parameter flag for figures in OlsInScope; determine that an initialization of the Rice parameter derivation for binarization based on the previous transform unit state is constrained based on the determination that degci_no_persistent_rice_adaptation_constraint_flag is one, and decode the remaining portion of the video bitstream without initializing the Rice parameter based on the previous transform unit state for figures in OlsInScope; To determine that the coordinates of a last significant coefficient are encoded relative to the upper left corner for each transform block of a slice based on the gci_no_reverse_last_sig_coeff_constraint_flag being one, and to decode the remaining portion of the video bitstream by interpreting the decoded coordinates of the last significant coefficient as being relative to the upper left corner for each transform block of the slice.
  11. 11. COMPUTER-READABLE STORAGE MEDIA characterized by storing a bit stream, wherein the bit stream is decoded according to the method as defined in any one of claims 6 to 10.

Description

This patent application consists of a split application of the invention patent application BR 112024013663-9, dated 08/11/2022. CROSS-REFERENCE ON RELATED REQUEST [001] This application claims priority over Provisional Application No. U.S. 63/266,615, entitled "Signaling Methods for General Constraints Information for Video Coding," filed January 10, 2022, Provisional Application No. U.S. 63/266,616, entitled "Initialization Method for General Constraint Information Flags for Video Coding," filed January 10, 2022, and Provisional Application No. U.S. 63/266,765, entitled "Signaling and Initialization Methods for General Constraints Information for Video Coding," filed January 13, 2022, all of which are incorporated herein by reference. FIELD OF TECHNIQUE [002] This disclosure relates generally to video processing. Specifically, this disclosure involves signaling and initialization for general restrictions information for video encoding. BACKGROUND [003] The ubiquitous camera-enabled devices, such as smartphones, tablets, and computers, have made capturing videos or images easier than ever. However, the amount of data, even for a short video, can be substantially large. Video encoding technology (including video encoding and decoding) allows video data to be compressed into smaller sizes, thus enabling multiple videos to be stored and transmitted. Video encoding has been used in a wide range of applications, such as digital TV broadcasting, video streaming over the Internet and mobile networks, real-time applications (e.g., video chat, video conferencing), DVD and Blu-ray discs, and so on. To reduce the storage space required to store a video and/or the network bandwidth consumption for transmitting a video, it is desirable to improve the efficiency of the video encoding scheme. SUMMARY [004] Some embodiments involve signaling and initialization for general constraint information for video encoding. In one example, a method for decoding a video includes extracting, from a video bitstream, an additional bit count M indicating a quantity of bits of additional general constraint information (GCI) included in the video bitstream, wherein the additional bits comprise signaling bits indicating the respective additional encoding tools to be constrained for the video, and wherein an expected value of the additional bit count is 0 or 6; in response to the determination that the extracted additional bit count M is greater than 6, extracting M-6 bits that follow six signaling bits in the bitstream; and decoding a remaining portion of the video bitstream into images independent of the extracted M-6 bits and based, at least in part, on the constraints specified for the respective additional encoding tools by the six signaling bits. [005] In another example, a non-transient computer-readable medium has program code stored on it, and the program code is executable by one or more processing devices to perform operations. The operations include extracting, from a video bitstream, an additional bit count M indicating a quantity of additional bits of general constraint information (GCI) included in the video bitstream, wherein the additional bits comprise signaling bits indicating the respective additional encoding tools to be restricted for the video, and wherein an expected value of the additional bit count is 0 or 6; in response to the determination that the extracted additional bit count M is greater than 6, extracting M-6 bits that follow six signaling bits in the bitstream; and decoding a remaining portion of the video bitstream into images independent of the extracted M-6 bits and based, at least in part, on the constraints specified for the respective additional encoding tools by the six signaling bits. [006] In yet another example, a system included a processing device and a computer-readable non-transient medium communicatively coupled to the processing device. The processing device is configured to execute program code stored on the computer-readable non-transient medium and thus perform operations. The operations include extracting, from a bitstream of a video, an additional bit count M indicating a quantity of additional general constraint information (GCI) bits included in the video bitstream, wherein the additional bits comprise signaling bits indicating the respective additional encoding tools to be restricted for the video, and wherein an expected value of the additional bit count is 0 or 6; in response to the determination that the extracted additional bit count M is greater than 6, extracting M-6 bits that follow six signaling bits in the bitstream; and decoding a remaining portion of the video bitstream into images independent of the extracted M-6 bits and based, at least in part, on the constraints specified for the respective additional encoding tools by the six signaling bits. [007] These illustrative forms are mentioned not to limit or define the disclosure, but to provide examples to aid understanding