US-12621491-B2 - Decoding parameter sets in video coding

US12621491B2US 12621491 B2US12621491 B2US 12621491B2US-12621491-B2

Abstract

Devices, systems and methods for specifying the syntax and semantics of the decoding parameter set (DPS) are described. One example method of video processing includes performing a conversion between a video and a bitstream of the video, wherein the bitstream conforms to a format rule specifying a data structure is in a first access unit (AU) of the bitstream, wherein the first AU is located at a start of the bitstream, wherein the data structure is a type of network abstraction layer (NAL) unit.

Inventors

Ye-Kui Wang

Assignees

BYTEDANCE INC.

Dates

Publication Date: 20260505
Application Date: 20220624

Claims (16)

1 . A method of video processing, comprising: performing a conversion between a video and a bitstream of the video, wherein the bitstream conforms to a format rule, wherein the format rule specifies (a) that a plurality of network abstraction layer (NAL) units related to decoding parameters that indicate decoding capability information are included in the bitstream, and all of the plurality of NAL units related to the decoding parameters that indicate the decoding capability information have a same content, and (b) that a NAL unit related to the decoding parameters that indicate the decoding capability information is in a first access unit (AU) of the bitstream, and wherein the first AU is located at a start of the bitstream, wherein the format rule further specifies that a plurality of profile-tier-level (PTL) syntax structures is included in a raw byte sequence payload of a NAL unit comprising the decoding capability information, and wherein a syntax element profile_tier_level, representing a PTL syntax structure, has an input parameter, and wherein the input parameter is specified equal to zero indicative of an exclusion of sublayer level information.
2 . The method of claim 1 , wherein the NAL unit related to the decoding parameters that indicate the decoding capability information is a decoding capability information NAL unit.
3 . The method of claim 1 , wherein the NAL unit related to the decoding parameters that indicate the decoding capability information is a supplemental enhancement information (SEI) NAL unit comprising a decoding capability information SEI message.
4 . The method of claim 1 , wherein the format rule further specifies that the decoding capability information excludes a first syntax element indicative of an identifier of the decoding capability information due to the decoding capability information being included in the NAL unit comprising the decoding capability information.
5 . The method of claim 1 , wherein the format rule further specifies that a sequence parameter set (SPS) does not reference the decoding capability information through inclusion of a second syntax element indicative of an identifier of the decoding capability information referenced by the SPS due to the decoding capability information being included in the NAL unit comprising the decoding capability information.
6 . The method of claim 1 , wherein the conversion comprises decoding the video from the bitstream.
7 . The method of claim 1 , wherein the conversion comprises encoding the video into the bitstream.
8 . An apparatus for processing video data, comprising: a processor; and a non-transitory memory with instructions thereon, wherein the instructions, upon execution by the processor, cause the processor to: perform a conversion between a video and a bitstream of the video, wherein the bitstream conforms to a format rule, wherein the format rule specifies (a) that a plurality of network abstraction layer (NAL) units related to decoding parameters that indicate decoding capability information are included in the bitstream, and all of the plurality of NAL units related to the decoding parameters that indicate the decoding capability information have a same content, and (b) that a NAL unit related to the decoding parameters that indicate the decoding capability information is in a first access unit (AU) of the bitstream, and wherein the first AU is located at a start of the bitstream, wherein the format rule further specifies that a plurality of profile-tier-level (PTL) syntax structures is included in a raw byte sequence payload of a NAL unit comprising the decoding capability information, and wherein a syntax element profile_tier_level, representing a PTL syntax structure, has an input parameter, and wherein the input parameter is specified equal to zero indicative of an exclusion of sublayer level information.
9 . The apparatus of claim 8 , wherein the format rule further specifies that the decoding capability information excludes a first syntax element indicative of an identifier of the decoding capability information due to the decoding capability information is included in a NAL unit.
10 . The apparatus of claim 8 , wherein the format rule further specifies that a sequence parameter set (SPS) does not reference the decoding capability information through inclusion of a second syntax element indicative of an identifier of the decoding capability information referenced by the SPS due to the decoding capability information is included in a NAL unit.
11 . A non-transitory computer-readable storage medium storing instructions that cause a processor to: perform a conversion between a video and a bitstream of the video, wherein the bitstream conforms to a format rule, wherein the format rule specifies (a) that a plurality of network abstraction layer (NAL) units related to decoding parameters that indicate decoding capability information are included in the bitstream, and all of the plurality of NAL units related to the decoding parameters that indicate the decoding capability information have a same content, and (b) that a NAL unit related to the decoding parameters that indicate the decoding capability information is in a first access unit (AU) of the bitstream, and wherein the first AU is located at a start of the bitstream, wherein the format rule further specifies that a plurality of profile-tier-level (PTL) syntax structures is included in a raw byte sequence payload of a NAL unit comprising the decoding capability information, and wherein a syntax element profile_tier_level, representing a PTL syntax structure, has an input parameter, and wherein the input parameter is specified equal to zero indicative of an exclusion of sublayer level information.
12 . The non-transitory computer-readable storage medium of claim 11 , wherein the format rule further specifies that the decoding capability information excludes a first syntax element indicative of an identifier of the decoding capability information due to the decoding capability information is included in a NAL unit.
13 . The non-transitory computer-readable storage medium of claim 11 , wherein the format rule further specifies that a sequence parameter set (SPS) does not reference the decoding capability information through inclusion of a second syntax element indicative of an identifier of the decoding capability information referenced by the SPS due to the decoding capability information is included in a NAL unit.
14 . A method for storing bitstream of a video, comprising: generating the bitstream of the video; and storing the bitstream in a non-transitory computer-readable recording medium, wherein the bitstream conforms to a format rule, wherein the format rule specifies (a) that a plurality of network abstraction layer (NAL) units related to decoding parameters that indicate decoding capability information are included in the bitstream, and all of the plurality of NAL units related to the decoding parameters that indicate the decoding capability information have a same content, and (b) that a NAL unit related to the decoding parameters that indicate the decoding capability information is in a first access unit (AU) of the bitstream, and wherein the first AU is located at a start of the bitstream, wherein the format rule further specifies that a plurality of profile-tier-level (PTL) syntax structures is included in a raw byte sequence payload of a NAL unit comprising the decoding capability information, and wherein a syntax element profile_tier_level, representing a PTL syntax structure, has an input parameter, and wherein the input parameter is specified equal to zero indicative of an exclusion of sublayer level information.
15 . The method of claim 14 , wherein the format rule further specifies that the decoding capability information excludes a first syntax element indicative of an identifier of the decoding capability information due to the decoding capability information is included in a NAL unit.
16 . The method of claim 14 , wherein the format rule further specifies that a sequence parameter set (SPS) does not reference the decoding capability information through inclusion of a second syntax element indicative of an identifier of the decoding capability information referenced by the SPS due to the decoding capability information is included in a NAL unit.

Description

CROSS-REFERENCE TO RELATED APPLICATION This application is a continuation of International Patent Application No. PCT/US2020/067077, filed on Dec. 26, 2020, which claims the priority to and benefits of U.S. Provisional Patent Application No. 62/953,862 filed on Dec. 26, 2019. All the aforementioned patent applications are hereby incorporated by reference in their entireties. TECHNICAL FIELD This document is related to video coding techniques, systems and devices. BACKGROUND Digital video accounts for the largest bandwidth use on the internet and other digital communication networks. As the number of connected user devices capable of receiving and displaying video increases, it is expected that the bandwidth demand for digital video usage will continue to grow. SUMMARY Devices, systems and methods related to digital video coding, which include specifying the syntax and semantics of the decoding parameter set (DPS), are described. The described methods may be applied to both the existing video coding standards (e.g., High Efficiency Video Coding (HEVC) and/or Versatile Video Coding (VVC)) and future video coding standards or video codecs. In one representative aspect, the disclosed technology may be used to provide a method for video processing. This method includes performing a conversion between a video and a bitstream of the video, wherein the bitstream conforms to a format rule, wherein the format rule specifies that each of a plurality of decoding parameter set network abstraction layer (NAL) units in the bitstream has a same content, and wherein the decoding parameter set comprises decoding capability information. In another representative aspect, the disclosed technology may be used to provide a method for video processing. This method includes performing a conversion between a video and a bitstream of the video, wherein the bitstream conforms to a format rule specifying that a decoding parameter set excludes a first syntax element indicative of an identifier of the decoding parameter set, and wherein the decoding parameter set comprises decoding capability information. In yet another representative aspect, the disclosed technology may be used to provide a method for video processing. This method includes performing a conversion between a video and a bitstream of the video, wherein the bitstream conforms to a format rule specifying a data structure is in a first access unit (AU) of the bitstream, wherein the first AU is located at a start of the bitstream, wherein the data structure is a type of network abstraction layer (NAL) unit. In yet another representative aspect, the above-described method is embodied in the form of processor-executable code and stored in a computer-readable program medium. In yet another representative aspect, a device that is configured or operable to perform the above-described method is disclosed. The device may include a processor that is programmed to implement this method. In yet another representative aspect, a video decoder apparatus may implement a method as described herein. The above and other aspects and features of the disclosed technology are described in greater detail in the drawings, the description and the claims. BRIEF DESCRIPTION OF THE DRAWINGS FIG. 1 is a block diagram showing an example video processing system in which various techniques disclosed herein may be implemented. FIG. 2 is a block diagram of an example hardware platform used for video processing. FIG. 3 is a block diagram that illustrates an example video coding system that can implement some embodiments of the present disclosure. FIG. 4 is a block diagram that illustrates an example of an encoder that can implement some embodiments of the present disclosure. FIG. 5 is a block diagram that illustrates an example of a decoder that can implement some embodiments of the present disclosure. FIGS. 6-8 show flowcharts for example methods of video processing. DETAILED DESCRIPTION Due to the increasing demand of higher resolution video, video coding methods and techniques are ubiquitous in modern technology. Video codecs typically include an electronic circuit or software that compresses or decompresses digital video, and are continually being improved to provide higher coding efficiency. A video codec converts uncompressed video to a compressed format or vice versa. There are complex relationships between the video quality, the amount of data used to represent the video (determined by the bit rate), the complexity of the encoding and decoding algorithms, sensitivity to data losses and errors, ease of editing, random access, and end-to-end delay (latency). The compressed format usually conforms to a standard video compression specification, e.g., the High Efficiency Video Coding (HEVC) standard (also known as H.265 or MPEG-H Part 2), the Versatile Video Coding standard to be finalized, or other current and/or future video coding standards. Embodiments of the disclosed technology may be applied to exist