Search

EP-4736434-A1 - TEMPORAL PREDICTION FOR PARTITIONING PARAMETERS

EP4736434A1EP 4736434 A1EP4736434 A1EP 4736434A1EP-4736434-A1

Abstract

Video coding may be performed using enhanced temporal prediction for partitioning parameters. Coding information (e.g., motion vector difference (MVD) of the current coding unit (CU)) and/or more temporal areas from previously coded frames may be used. For example a device for video decoding and/or encoding may obtain a plurality of collocated pictures of a current picture. The current picture may include a current video block. The device may identify, for the current video block, a plurality of collocated coding blocks in the plurality of collocated pictures. The device may predict a partition parameter associated with the current video block based on partition information associated with the plurality of collocated coding blocks. The device may decode and/or encode the current video block based on the predicted partition parameter.

Inventors

  • CHEN, Ya
  • LE LEANNEC, FABRICE
  • NASER, Karam
  • RADOSAVLJEVIC, Milos

Assignees

  • InterDigital CE Patent Holdings, SAS

Dates

Publication Date
20260506
Application Date
20240607

Claims (1)

  1. IDVC_ 2023P00543WO PATENT Claims 1. A device for video decoding comprising a processor, wherein the processor is configured to: obtain a plurality of collocated pictures of a current picture, the current picture comprising a current video block; identify, for the current video block, a plurality of collocated coding blocks in the plurality of collocated pictures; predict a partition parameter associated with the current video block based on partition information associated with the plurality of collocated coding blocks; and decode the current video block based on the predicted partition parameter. 2. A device for video encoding comprising a processor, wherein the processor is configured to: obtain a plurality of collocated pictures of a current picture, the current picture comprising a current video block; identify, for the current video block, a plurality of collocated coding blocks in the plurality of collocated pictures; predict a partition parameter associated with the current video block based on partition information associated with the plurality of collocated coding blocks; and encode the current video block based on the predicted partition parameter. 3. The device of claim 1 or 2, wherein the plurality of collocated pictures are obtained based on the current picture being a bi-predicted picture. 4. The device of any one of claims 1-3, wherein the plurality of collocated pictures includes at least one reference picture of the current picture. 5. The device of any one of claims 1-4, wherein the partition information comprises a temporal depth associated with a first collocated coding block and a temporal depth associated with a second collocated coding block in the plurality of collocated coding blocks, and wherein the processor is further configured to: IDVC_ 2023P00543WO PATENT determine a difference between the temporal depth associated with the first collocated coding block and the temporal depth associated with the second collocated coding block; compare the difference between the temporal depth associated with the first collocated coding block and the temporal depth associated with the second collocated coding block to a threshold; and based on the comparison, determine whether to refrain from predicting the partition parameter, wherein prediction of the partition parameter associated with the current video block is performed based on the difference being below the threshold. 6. The device of any one of claims 1-4, wherein the processor is further configured to: obtain a motion vector difference for the current video block; compare the motion vector difference to a threshold; and based on the comparison, determine whether to refrain from predicting the partition parameter, wherein prediction of the partition parameter associated with the current video block is performed based on the motion vector difference being below the threshold. 7. The device of any one of claim 1-4, wherein the processor is further configured to: obtain a motion vector difference for the current video block, wherein prediction of the partition parameter associated with the current video block is performed based on the partition information associated with the plurality of collocated coding blocks and the motion vector difference for the current video block. 8. The device of any one of claims 1-7, wherein prediction of the partition parameter associated with the current video block is performed further based on at least one of a block partition, an intra sub- partition, a subblock transform, or a geometric partition mode. 9. The device of any one of claims 1-8, wherein prediction of the partition parameter associated with the current video block is performed based on the partition information associated with the plurality of collocated coding blocks and at least one of a quantization parameter, a block size, or a color component of the current video block. 10. A method for video decoding, the method comprising: IDVC_ 2023P00543WO PATENT obtaining a plurality of collocated pictures of a current picture, the current picture comprising a current video block; identifying, for the current video block, a plurality of collocated coding blocks in the plurality of collocated pictures; predicting a partition parameter associated with the current video block based on partition information associated with the plurality of collocated coding blocks; and decoding the current video block based on the predicted partition parameter. 11. A method for video encoding, the method comprising: obtaining a plurality of collocated pictures of a current picture, the current picture comprising a current video block; identifying, for the current video block, a plurality of collocated coding blocks in the plurality of collocated pictures; predicting a partition parameter associated with the current video block based on partition information associated with the plurality of collocated coding blocks; and encoding the current video block based on the predicted partition parameter. 12. The method of claim 10 or 11, wherein the plurality of collocated pictures are obtained based on the current picture being a bi-predicted picture. 13. The method of any one of claims 10-12, wherein the plurality of collocated pictures includes at least one reference picture of the current picture. 14. The method of any one of claims 10-13, wherein the partition information comprises a temporal depth associated with a first collocated coding block and a temporal depth associated with a second collocated coding block in the plurality of collocated coding blocks, and wherein the method further comprises: determining a difference between the temporal depth associated with the first collocated coding block and the temporal depth associated with the second collocated coding block; comparing the difference between the temporal depth associated with the first collocated coding block and the temporal depth associated with the second collocated coding block to a threshold; and IDVC_ 2023P00543WO PATENT based on the comparison, determining whether to refrain from predicting the partition parameter, wherein prediction of the partition parameter associated with the current video block is performed based on the difference being below the threshold. 15. The method of any one of claims 10-13, wherein the method further comprises: obtaining a motion vector difference for the current video block; comparing the motion vector difference to a threshold; and based on the comparison, determining whether to refrain from predicting the partition parameter, wherein prediction of the partition parameter associated with the current video block is performed based on the motion vector difference being below the threshold. 16. The method of any one of claims 10-13, wherein the method further comprises: obtaining a motion vector difference for the current video block, wherein prediction of the partition parameter associated with the current video block is performed based on the partition information associated with the plurality of collocated coding blocks and the motion vector difference for the current video block. 17. The method of any one of claims 10-16, wherein prediction of the partition parameter associated with the current video block is performed further based on at least one of a block partition, an intra sub-partition, a subblock transform, or a geometric partition mode. 18. The method of any one of claims 10-17, wherein prediction of the partition parameter associated with the current video block is performed based on the partition information associated with the plurality of collocated coding blocks and at least one of a quantization parameter, a block size, or a color component of the current video block. 19. A computer program product which is stored on a non-transitory computer readable medium and comprises program code instructions for implementing one or more steps according to at least one of claims 10-18 when executed by a processor. IDVC_ 2023P00543WO PATENT 20. Video data comprising information representative of the current video block encoded in accordance with any one of claims 11-18.

Description

IDVC_ 2023P00543WO PATENT TEMPORAL PREDICTION FOR PARTITIONING PARAMETERS CROSS-REFERENCE TO RELATED APPLICATIONS [0001] This application claims the benefit of European Provisional Patent Application No.23306114.2, filed June 30, 2023, the contents of which are hereby incorporated by reference herein. BACKGROUND [0002] Video coding systems may be used to compress digital video signals, e.g., to reduce the storage and/or transmission bandwidth needed for such signals. Video coding systems may include, for example, block-based, wavelet-based, and/or object-based systems. SUMMARY [0003] Systems, methods, and instrumentalities are disclosed for performing video coding using temporal prediction for partitioning parameters. The temporal prediction of partitioning parameters may be enhanced by utilizing: the coding information (e.g., motion vector difference (MVD) of the current coding unit (CU)) and/or more temporal areas from several previous coded frames. For example, multiple (e.g., two) collocated reference pictures may be derived and/or used to perform the bi-predictive temporal prediction of the partitioning parameters for a (e.g., each) B slice. The temporal prediction may be utilized to derive other partitioning parameters, such as one or more of the following: the allowance and the direction of intra sub-partitions (ISP) split; the allowance, the direction, and/or the position of subblock transform (SBT) split; and/or the allowances and/or the partitioning modes of geometric partition mode (GPM) Intra/spatial GPM (SGPM)/intra block copy with geometry partitioning (IBC-GPM). [0004] A coding device (e.g., decoder) may obtain a current coding block. The decoder may determine whether to temporally predict a partition parameter associated with the current coding block, for example, based on coding information associated with the current coding block. The coding information associated with the current coding block may comprise at least one of a quantization parameter, a block size, or a color component of the current coding block. The coding information associated with the current coding IDVC_ 2023P00543WO PATENT block may comprise an MVD. The decoder may decode the current coding block based on the determining. [0005] The decoder may determine whether to temporally predict a partition parameter associated with the current coding block, for example, based on the MVD. For example, based on the MVD being zero, the decoder may determine to temporally predict the partition parameter(s) associated with the current coding block. For example, based on the MVD being less than a value, the decoder may determine to temporally predict the partition parameter(s) associated with the current coding block. For example, the decoder may (e.g., determine to) refrain from temporally predicting the partition parameter associated with the current coding block, for example, based on the MVD being above a value. [0006] Upon determining to temporally predict the partition parameters, the decoder may obtain a collocated picture and may identify, in the collocated picture, a collocated coding block of the current block. The decoder may determine the partition parameter associated with the current coding block, for example, based on partition information associated with the collocated coding block. [0007] A coding device (e.g., an encoder) may obtain a current coding block. The encoder may determine whether to temporally predict a partition parameter associated with the current coding block, for example, based on coding information associated with the current coding block. The coding information associated with the current coding block may comprise at least one of a quantization parameter, a block size, or a color component of the current coding block. The coding information associated with the current coding block may comprise an MVD. The encoder may encode the current coding block based on the determining. [0008] For example, based on the MVD being zero, the encoder may determine to temporally predict the partition parameter(s) associated with the current coding block. For example, based on the MVD being less than a value, the encoder may determine to temporally predict the partition parameter(s) associated with the current coding block. For example, based on the MVD being above a value, the encoder may (e.g., determine to) refrain from temporally predicting the partition parameter associated with the current coding block. [0009] Upon determining to temporally predict the partition parameters, the encoder may obtain a collocated picture and may identify, in the collocated picture, a collocated coding block of the current block. The encoder may determine the partition parameter associated with the current coding block, for example, based on partition information associated with the collocated coding block. [0010] For example, a device video encoding and/or decoding may include a processor. The device may obtain a plurality of colloca