JP-7856700-B2 - Video decoding method, video encoding method, apparatus, device, and storage medium

JP7856700B2JP 7856700 B2JP7856700 B2JP 7856700B2JP-7856700-B2

Inventors

▲張▼ 洪彬
李翔
李 ▲貴▼春
▲劉▼ 杉

Assignees

▲騰▼▲訊▼科技（深▲セン▼）有限公司

Dates

Publication Date: 20260511
Application Date: 20240725
Priority Date: 20191218

Claims (15)

A video decoding method, wherein the method is A step of obtaining a first parameter set corresponding to a video frame to be decoded, wherein the first parameter set includes syntax elements related to a quantization matrix QM, the QM has 28 identifier variables, and the 28 identifier variables are associated with different QM sizes and different QM types . A step of determining a valid QM based on syntax elements included in the first parameter set, wherein the syntax elements are used to indicate whether or not it is necessary to decode a color difference QM, the valid QM refers to the QM actually used when performing dequantization on quantized conversion coefficients in the decoding process of the video frame to be decoded , the step of determining a valid QM based on syntax elements included in the first parameter set, the range of the identifier variable of the QM, and the QM belonging to the range of the identifier variable of the QM as the valid QM , The step includes decoding the valid QM, The value of the syntax element is determined based on the sampling rate of the color difference component relative to the luminance component. A video decoding method in which, when the sampling rate of the chromatic difference component with respect to the luminance component is 4:0:0, the value of the syntax element is 0, indicating that there is no need to decode the chromatic difference QM, and all elements of the chromatic difference QM are set to 16 .
The step of determining a valid QM based on the syntax elements included in the first parameter set is: The steps include reading the value of the flag syntax element corresponding to the first QM from the first parameter set, If the value of the flag syntax element corresponding to the first QM is a first numerical value, then it is determined that the first QM belongs to the valid QM. The method according to claim 1, comprising the step of determining that if the value of the flag syntax element corresponding to the first QM is a second numerical value, then the first QM does not belong to the valid QM.
The method according to claim 2, wherein the first and second chromatic difference QMs having the same prediction mode and the same size share the same flag syntax elements.
The method according to claim 1, wherein the first parameter set is a self-adaptive parameter set APS.
The method according to claim 1, wherein all elements of other QMs that do not belong to the valid QMs are defined in advance as default values.
The method according to claim 5, wherein the default value is 16.
A video encoding method, wherein the method is A step of generating an encoded video bitstream corresponding to a first parameter set corresponding to a video frame to be encoded, wherein the first parameter set includes syntax elements associated with a quantization matrix QM, the QM has 28 identifier variables, and the 28 identifier variables are associated with different QM sizes and different QM types. A step of determining a valid quantization matrix QM based on the syntax elements included in the first parameter set, wherein the syntax elements are used to indicate whether or not a color difference QM needs to be encoded, the valid QM refers to the QM actually used when performing dequantization in encoding the video frame to be encoded with respect to the quantization conversion coefficients, the range of the identifier variable of the QM is determined based on the syntax elements included in the first parameter set, and the QM within the range of the identifier variable of the QM is determined as the valid QM. The step of encoding the valid QM is included, The value of the syntax element is determined based on the sampling rate of the color difference component relative to the luminance component. A video encoding method in which, when the sampling rate of the chromatic difference component with respect to the luminance component is 4:0:0, the value of the syntax element is 0, indicating that there is no need to decode the chromatic difference QM, and all elements of the chromatic difference QM are set to 16.
A method for generating and storing an encoded video bitstream, wherein the method is: A step of generating an encoded video bitstream corresponding to a first parameter set corresponding to a video frame to be encoded, wherein the first parameter set includes syntax elements associated with a quantization matrix QM, the QM has 28 identifier variables, and the 28 identifier variables are associated with different QM sizes and different QM types. A step of determining a valid quantization matrix QM based on the syntax elements included in the first parameter set, wherein the syntax elements are used to indicate whether or not a color difference QM needs to be encoded, the valid QM refers to the QM actually used when performing dequantization in encoding the video frame to be encoded with respect to the quantization conversion coefficients, the range of the identifier variable of the QM is determined based on the syntax elements included in the first parameter set, and the QM within the range of the identifier variable of the QM is determined as the valid QM. The steps include: encoding into the valid QM, The step includes storing the generated encoded video bitstream, The value of the syntax element is determined based on the sampling rate of the color difference component relative to the luminance component. A method for setting all elements of the color difference QM to 16, where the sampling rate of the color difference component with respect to the luminance component is 4:0:0, the value of the syntax element is 0, indicating that there is no need to decode the color difference QM.
A video decoding device, the device including a parameter acquisition module, a QM determination module, and a QM decoding module, The parameter acquisition module is used to acquire a first parameter set corresponding to the video frame to be decoded, the first parameter set includes syntax elements related to the quantization matrix QM, the QM has 28 identifier variables, the 28 identifier variables are associated with different QM sizes and different QM types, The QM determination module is used to determine a valid QM based on syntax elements included in the first parameter set, the syntax elements are used to indicate whether or not it is necessary to decode the color difference QM, the valid QM refers to the QM actually used when performing inverse quantization on the quantized conversion coefficients in the decoding process of the video frame to be decoded, the range of the QM identifier variable is determined based on the elements included in the first parameter set, and the QM belonging to the range of the QM identifier variable is determined as the valid QM. The aforementioned QM decoding module is used to perform decoding on the valid QM. The value of the syntax element is determined based on the sampling rate of the color difference component relative to the luminance component. A video decoding device in which, when the sampling rate of the chromatic difference component with respect to the luminance component is 4:0:0, the value of the syntax element is 0, indicating that there is no need to decode the chromatic difference QM, and all elements of the chromatic difference QM are set to 16.
A computer device comprising a processor and memory, wherein at least one program, code set or instruction set is stored in the memory, and the at least one program, code set or instruction set is loaded and executed by the processor to realize the method according to any one of claims 1 to 6.
A computer device comprising a processor and memory, wherein at least one program, code set or instruction set is stored in the memory, and the at least one program, code set or instruction set is loaded and executed by the processor to realize the method according to claim 7.
A computer device comprising a processor and memory, wherein at least one program, code set or instruction set is stored in the memory, and the at least one program, code set or instruction set is loaded and executed by the processor to realize the method according to claim 8.
A computer program, when executed by a processor, causes the processor to perform the method according to any one of claims 1 to 6.
A computer program, when executed by a processor, causes the processor to perform the method according to claim 7.
A computer program, when executed by a processor, causes the processor to perform the method according to claim 8.

Description

The embodiments of this application relate to the technical field of video encoding and decoding, and more particularly to video decoding methods, video encoding methods, apparatus, devices, and storage media. This application claims priority from the Chinese patent application filed on December 18, 2019, with application number 201911309768.6, whose title is "Video Decoding Method, Apparatus, Device, and Storage Medium," and whose entire contents are incorporated into this application by reference. H.266 is a next-generation video coding technology that is an improvement on H.265/HEVC (High-Efficient Video Coding). It has already been officially named VVC (Versatile Video Coding) and is continuously being updated and improved under the guidance of the JVET (Joint Video Experts Team) organization. At the 14th JVET meeting, it was decided that frequency-related scaling can be supported in VVC by using two types of quantization matrices: a default quantization matrix and a user-defined quantization matrix. When a quantization matrix is enabled, individual quantization can be performed on the transformation coefficients in the Transform Block (TB) based on the quantization coefficients (i.e., integer weighting values) contained in the quantization matrix. Currently, the decoding method for the quantization matrix used in VVC involves a relatively high level of computational complexity on the decoder side. This is a schematic diagram of video coding as illustrated in the present invention.This is a simplified block diagram of a communication system provided by one embodiment of the present invention.This is a schematic diagram illustrating the arrangement of a video encoder and video decoder in a streaming environment, as exemplified in this application.This is a schematic diagram of coding under interframe prediction mode provided by one embodiment of the present invention.This is a schematic diagram of coding under an in-frame prediction mode provided by one embodiment of the present invention.This is a schematic diagram of a functional module of a video encoder provided in one embodiment of the present invention.This is a schematic diagram of a video decoder function module provided in one embodiment of the present invention.This is a schematic diagram illustrating the generation of a QM by downsampling copy provided in one embodiment of the present invention.This is a schematic diagram of the diagonal scanning sequence provided by one embodiment of the present application.This is a flowchart of a video decoding method provided in one embodiment of the present invention.This is a flowchart of a video encoding method provided in one embodiment of the present invention.This is a block diagram of a video decoding device provided in one embodiment of the present invention.This is a block diagram of a video decoding device provided in another embodiment of the present application.This is a block diagram of a video encoding device provided in one embodiment of the present invention.This is a structural block diagram of a computer device provided by one embodiment of the present invention. To further clarify the purpose, technical means, and advantages of this application, embodiments of this application will be described in more detail below, with reference to the drawings. As shown in Figure 1, the current block 101 contains samples already detected by the encoder during the motion detection process, and these samples can be predicted based on previous blocks of the same size where a spatial offset has occurred. Furthermore, the Motion Vector (MV) can be derived from metadata associated with one or more reference pictures, rather than directly encoding the MV. For example, the MV associated with one of the five surrounding samples A0, A1, B0, B1, and B2 (corresponding to 102-106, respectively) is used, and the MV is derived from the metadata of the nearest reference picture (depending on the decoding order). As shown in Figure 2, it illustrates a simplified block diagram of a communication system provided by one embodiment of the present invention. The communication system 200 includes a plurality of devices that can communicate with each other, for example, via a network 250. For example, the communication system 200 includes a first device 210 and a second device 220 interconnected by the network 250. In the embodiment of Figure 2, the first device 210 and the second device 220 perform one-way data transmission. For example, the first device 210 can encode video data, for example, a video picture stream collected by the first device 210, and transmit it to the second device 220 via the network 250. The encoded video data is transmitted in the form of one or more encoded video code streams. The second device 220 receives the encoded video data from the network 250, decodes the encoded video data to recover the video data, and can display a video picture based on the recovered video data. One-way data transmissio