US-12621493-B2 - Video coding using parameter sets

US12621493B2US 12621493 B2US12621493 B2US 12621493B2US-12621493-B2

Abstract

Methods and apparatus for video processing are described. The processing may include video encoding, video decoding or video transcoding. An example video processing method includes performing a conversion between a video comprising one or more pictures and a bitstream of the video according to a format rule, and wherein the format rule specifies that, in response to satisfying one or more conditions, a syntax element indicating whether a first syntax structure providing a profile, a tier and a level information and a second syntax structure providing decoded picture buffer information are present in a sequence parameter set is set to be equal to 1 to indicate that the first syntax structure and the second syntax structure are present in the sequence parameter set.

Inventors

Ye-Kui Wang
Li Zhang
Kai Zhang

Assignees

BYTEDANCE INC.

Dates

Publication Date: 20260505
Application Date: 20231107

Claims (16)

1 . A method of video processing, comprising: performing a conversion between a video comprising one or more pictures and a bitstream of the video according to a format rule, and wherein the format rule specifies that, in response to satisfying one or more conditions, a syntax element indicating whether a first syntax structure providing level information and a second syntax structure providing decoded picture buffer information are present in a sequence parameter set is set to be equal to 1 to indicate that the first syntax structure and the second syntax structure are present in the sequence parameter set, wherein the one or more conditions are related with a video parameter set identifier referenced by the sequence parameter set.
2 . The method of claim 1 , wherein the one or more conditions are further related with whether there exists an output layer set that contains only one layer having a NAL (network abstraction layer) unit header layer identifier equal to a particular value.
3 . The method of claim 1 , wherein the one or more conditions include 1) the video parameter set identifier referenced by the sequence parameter set is greater than 0 and there exists an output layer set that contains only one layer having a NAL (network abstraction layer) unit header layer identifier equal to a particular value, or 2) the video parameter set identifier is equal to 0.
4 . The method of claim 1 , wherein the syntax element corresponds to sps_ptl_dpb_hrd_params_present_flag, the first syntax structure corresponds to profile_tier_level ( ) syntax structure, and the second syntax structure corresponds to dpb_parameters ( ) syntax structure.
5 . The method of claim 1 , wherein the conversion includes encoding the video into the bitstream.
6 . The method of claim 1 , wherein the conversion includes decoding the video from the bitstream.
7 . An apparatus for processing video data comprising a processor and a non-transitory memory with instructions thereon, wherein the instructions upon execution by the processor, cause the processor to: perform a conversion between a video comprising one or more pictures and a bitstream of the video according to a format rule, and wherein the format rule specifies that, in response to satisfying one or more conditions, a syntax element indicating whether a first syntax structure providing level information and a second syntax structure providing decoded picture buffer information are present in a sequence parameter set is set to be equal to 1 to indicate that the first syntax structure and the second syntax structure are present in the sequence parameter set, wherein the one or more conditions are related with a video parameter set identifier referenced by the sequence parameter set.
8 . The apparatus of claim 7 , wherein the one or more conditions are further related with whether there exists an output layer set that contains only one layer having a NAL (network abstraction layer) unit header layer identifier equal to a particular value.
9 . The apparatus of claim 7 , wherein the one or more conditions include 1) the video parameter set identifier referenced by the sequence parameter set is greater than 0 and there exists an output layer set that contains only one layer having a NAL (network abstraction layer) unit header layer identifier equal to a particular value, or 2) the video parameter set identifier is equal to 0.
10 . The apparatus of claim 7 , wherein the syntax element corresponds to sps_ptl_dpb_hrd_params_present_flag, the first syntax structure corresponds to profile_tier_level ( ) syntax structure, and the second syntax structure corresponds to dpb_parameters ( ) syntax structure.
11 . A non-transitory computer-readable storage medium storing instructions that cause a processor to: perform a conversion between a video comprising one or more pictures and a bitstream of the video according to a format rule, and wherein the format rule specifies that, in response to satisfying one or more conditions, a syntax element indicating whether a first syntax structure providing level information and a second syntax structure providing decoded picture buffer information are present in a sequence parameter set is set to be equal to 1 to indicate that the first syntax structure and the second syntax structure are present in the sequence parameter set, wherein the one or more conditions are related with a video parameter set identifier referenced by the sequence parameter set.
12 . The medium of claim 11 , wherein the one or more conditions include 1) the video parameter set identifier referenced by the sequence parameter set is greater than 0 and there exists an output layer set that contains only one layer having a NAL (network abstraction layer) unit header layer identifier equal to a particular value, or 2) the video parameter set identifier is equal to 0.
13 . The medium of claim 11 , wherein the syntax element corresponds to sps_ptl_dpb_hrd_params_present_flag, the first syntax structure corresponds to profile_tier_level ( ) syntax structure, and the second syntax structure corresponds to dpb_parameters ( ) syntax structure.
14 . A non-transitory computer-readable recording medium storing a bitstream of a video which is generated by a method performed by a video processing apparatus, wherein the method comprises: generating the bitstream of the video comprising one or more pictures and a bitstream of the video according to a format rule, and wherein the format rule specifies that, in response to satisfying one or more conditions, a syntax element indicating whether a first syntax structure providing level information and a second syntax structure providing decoded picture buffer information are present in a sequence parameter set is set to be equal to 1 to indicate that the first syntax structure and the second syntax structure are present in the sequence parameter set, wherein the one or more conditions are related with a video parameter set identifier referenced by the sequence parameter set.
15 . The medium of claim 14 , wherein the one or more conditions include 1) the video parameter set identifier referenced by the sequence parameter set is greater than 0 and there exists an output layer set that contains only one layer having a NAL (network abstraction layer) unit header layer identifier equal to a particular value, or 2) the video parameter set identifier is equal to 0.
16 . The medium of claim 14 , wherein the syntax element corresponds to sps_ptl_dpb_hrd_params_present_flag, the first syntax structure corresponds to profile_tier_level ( ) syntax structure, and the second syntax structure corresponds to dpb_parameters ( ) syntax structure.

Description

CROSS REFERENCE TO RELATED APPLICATIONS This application is a continuation of U.S. patent application Ser. No. 17/957,139 filed on Sep. 30, 2022, which is a continuation of International Patent Application No. PCT/US2021/025351, filed on Apr. 1, 2021 which claims the priority to and benefits of U.S. Application No. 63/006,054, filed on Apr. 6, 2020. All the aforementioned patent applications are hereby incorporated by reference in their entireties. TECHNICAL FIELD This patent document relates to image and video coding and decoding. BACKGROUND Digital video accounts for the largest bandwidth use on the internet and other digital communication networks. As the number of connected user devices capable of receiving and displaying video increases, it is expected that the bandwidth demand for digital video usage will continue to grow. SUMMARY The present document discloses techniques that can be used by video encoders and decoders for processing coded representation of video using control information useful for decoding of the coded representation. In one example aspect, a video processing method is disclosed. The method includes performing a conversion between a video comprising one or more layers comprising one or more video regions and a coded representation of the video according to a format rule, wherein the format rule specifies that one or more syntax elements are included in the coded representation at the one or more video region level corresponding to allowed slice types for a corresponding video region. In another example aspect, another video processing method is disclosed. The method includes performing a conversion between a video comprising one or more layers comprising one or more video pictures comprising one or more video slices and a coded representation of the video according to a format rule, wherein the format rule specifies that syntax elements related to enablement or use of a coding mode at a slice level are included at most once between either a picture header or a slice header according to a second rule. In another example aspect, another video processing method is disclosed. The method includes performing a conversion between a video comprising one or more video pictures comprising one or more video slices and a coded representation of the video according to a format rule, wherein the format rule specifies that allowed slice types in a video picture controls whether a reference picture list is signaled in or generatable from the coded representation. In another example aspect, another video processing method is disclosed. The method includes performing a conversion between a video comprising one or more video pictures comprising one or more subpictures wherein the coded representation conforms to a format rule, wherein the format rule specifies a processing of a noncoded subpicture of a video picture. In another example aspect, another video processing method is disclosed. The method includes performing a conversion between a video comprising one or more pictures and a bitstream of the video according to a format rule, and wherein the format rule specifies that, in response to satisfying one or more conditions, a syntax element indicating whether a first syntax structure providing a profile, a tier and a level information and a second syntax structure providing decoded picture buffer information are present in a sequence parameter set is set to be equal to 1 to indicate that the first syntax structure and the second syntax structure are present in the sequence parameter set. In another example aspect, another video processing method is disclosed. The method includes performing a conversion between a video and a bitstream of the video including one or more coded layers according to a format rule, and wherein the format rule specifies that one or more syntax elements that indicate allowed slice types in a picture of a coded layer video sequence are included in one or more parameter sets and/or a general constraint information syntax structure. In another example aspect, another video processing method is disclosed. The method includes performing a conversion between a video comprising one or more layers comprising one or more pictures comprising one or more slices and a bitstream of the video according to a format rule, and wherein the format rule specifies that a syntax element is included in a picture header or a slice header to indicate whether bidirectionally predicted (B) slices are allowed or used for a corresponding picture or slice of the video. In another example aspect, another video processing method is disclosed. The method includes performing a conversion between a video comprising one or more layers comprising one or more pictures comprising one or more slices and a bitstream of the video according to a format rule, and wherein the format rule specifies that one or more syntax elements related to enablement or use of a coding mode at a slice level are included at most once