CN-122029570-A - Method for defining configuration level for dynamic grid codec and subcomponents thereof

CN122029570ACN 122029570 ACN122029570 ACN 122029570ACN-122029570-A

Abstract

A method according to some embodiments includes obtaining a volumetric video bitstream, wherein the volumetric video bitstream includes at least one sub-bitstream, and wherein the volumetric video bitstream further includes a parameter set providing configuration level information for the at least one sub-bitstream, selecting a decoder configuration for decoding the at least one sub-bitstream based on the configuration level information, and decoding the at least one sub-bitstream using the selected decoder configuration.

Inventors

G. Brad
G. Martin Kocher
O. Mokokuade

Assignees

交互数字CE专利控股有限公司

Dates

Publication Date: 20260512
Application Date: 20241009
Priority Date: 20231010

Claims (15)

1. A method of encoding, comprising: Encoding a first set of parameters of a volumetric video bitstream, comprising: performing a first iteration through one or more indices associated with the respective atlas, and For each index in the first iteration, encoding an associated atlas identifier of the respective atlas, and Encoding a second set of parameters for one or more sub-bitstreams of the volumetric video bitstream, comprising: Performing a second iteration through the one or more indices; for each index in the second iteration, the associated atlas identifier is determined, and a plurality of parameters of the atlas identified by the associated atlas identifier are encoded.
2. An encoding device includes one or more processors, the one or more processors are configured to perform at least the following: Encoding a first set of parameters of a volumetric video bitstream, comprising: performing a first iteration through one or more indices associated with the respective atlas, and For each index in the first iteration, encoding an associated atlas identifier of the respective atlas, and Encoding a second set of parameters for one or more sub-bitstreams of the volumetric video bitstream, comprising: Performing a second iteration through the one or more indices; for each index in the second iteration, the associated atlas identifier is determined, and a plurality of parameters of the atlas identified by the associated atlas identifier are encoded.
3. The method of claim 1 or the apparatus of claim 2, further comprising encoding a parameter indicative of a number of atlases in the volumetric video bitstream in a first parameter set, wherein a number of iterations performed in a first iteration and a second iteration is determined by the indicated number.
4. A method according to claim 1, or a method according to claim 3 when dependent on claim 1, or an apparatus according to claim 2, or an apparatus according to claim 3 when dependent on claim 2, wherein the first parameter set is a V3C parameter set.
5. The method of claim 1, or the method of any of claims 3-4 when dependent on claim 1, or the apparatus of claim 2, or the apparatus of any of claims 3-4 when dependent on claim 2, wherein the second parameter set is a video parameter set VPS extension for video-based dynamic trellis codec VDMC.
6. The method of claim 1, or the method of any of claims 3-4 when dependent on claim 1, or the apparatus of claim 2, or the apparatus of any of claims 3-4 when dependent on claim 2, wherein the plurality of parameters includes codec identifiers of the respective atlases.
7. The method of claim 1, or the method of any of claims 3-4 when dependent on claim 1, or the apparatus of claim 2, or the apparatus of any of claims 3-4 when dependent on claim 2, wherein the plurality of parameters includes a frame width and a frame height of the respective atlas.
8. A decoding method, comprising: decoding a first set of parameters of a volumetric video bitstream, comprising: performing a first iteration through one or more indices associated with the respective atlas, and For each index in the first iteration, decoding an associated atlas identifier of the respective atlas, and Decoding a second set of parameters of one or more sub-bitstreams of the volumetric video bitstream, comprising: Performing a second iteration through the one or more indices; For each index in the second iteration, the associated atlas identifier is determined, and a plurality of parameters of the atlas identified by the associated atlas identifier are decoded.
9. A decoding device includes one or more processors, the one or more processors are configured to: decoding a first set of parameters of a volumetric video bitstream, comprising: performing a first iteration through one or more indices associated with the respective atlas, and For each index in the first iteration, decoding an associated atlas identifier of the respective atlas, and Decoding a second set of parameters of one or more sub-bitstreams of the volumetric video bitstream, comprising: Performing a second iteration through the one or more indices; For each index in the second iteration, the associated atlas identifier is determined, and a plurality of parameters of the atlas identified by the associated atlas identifier are decoded.
10. The method of claim 8 or the apparatus of claim 9, further comprising decoding a parameter indicating a number of atlases in the volumetric video bitstream from a first parameter set, wherein a number of iterations performed in a first iteration and a second iteration is determined by the indicated number.
11. The method of claim 8, or the method of claim 10 when dependent on claim 8, or the apparatus of claim 9, or the apparatus of claim 10 when dependent on claim 9, wherein the first parameter set is a V3C parameter set.
12. The method of claim 8, or the method of any of claims 10-11 when dependent on claim 8, or the apparatus of claim 9, or the apparatus of any of claims 10-11 when dependent on claim 9, wherein the second parameter set is a video parameter set VPS extension for video-based dynamic trellis codec VDMC.
13. The method of claim 8, or the method of any of claims 10-12 when dependent on claim 8, or the apparatus of claim 9, or the apparatus of any of claims 10-12 when dependent on claim 9, wherein the plurality of parameters includes codec identifiers of the respective atlases.
14. The method of claim 8, or the method of any of claims 10-13 when dependent on claim 8, or the apparatus of claim 9, or the apparatus of any of claims 10-13 when dependent on claim 9, wherein the plurality of parameters includes bit depths of the respective atlases.
15. The method of claim 8, or the method of any of claims 10-14 when dependent on claim 8, or the apparatus of claim 9, or the apparatus of any of claims 10-14 when dependent on claim 9, wherein the plurality of parameters includes a frame width and a frame height of the respective atlas.

Description

Method for defining configuration level for dynamic grid codec and subcomponents thereof Cross reference The present application claims priority from European patent application No. 23306750.3 entitled "Methods to Define Profiles for DYNAMIC MESH Codec and its Sub-Components" filed on 10/2023, which is incorporated herein by reference in its entirety. Background The visual volume video based codec (V3C) is described in ISO/IEC 23090-5:2021, Information technology — Coded representation of immersive media — Part 5: Visual volumetric video-based coding (V3C) and video-based point cloud compression (V-PCC). Video-based dynamic grid codec (V-DMC) is a framework developed by expanding V3C for compressing dynamic grids. The V-DMC framework may involve different properties specified for the dynamic grid sequence. In V-DMC, the underlying static mesh codec may decode mesh attributes from facets or from vertices. These attributes may provide additional information about the static grid, such as color, texture coordinates, normals, reflectivity information, transparency information, and/or user-defined attributes. For V-DMC, a framework was developed by expanding V3C. V3C is described in ISO/IEC 23090-5:2021, Information technology — Coded representation of immersive media — Part 5: Visual volumetric video-based coding (V3C) and video-based point cloud compression (V-PCC). Attributes from the static trellis codec may also be used by the base trellis codec. The base trellis bitstream is further encapsulated in a V3C bitstream. The hierarchy of the V-DMC bit stream is shown in figure 1. The extension mechanism in the V3C bitstream may provide information about the type of properties that the V3C bitstream/framework will use from the underlying base grid codec. Additionally, the underlying grid bit stream may provide a mechanism to provide information about the type of grid properties of interest from the underlying static grid codec. Fig. 1 illustrates encapsulation of different sub-bit streams in a V-DMC bit stream. Disclosure of Invention A method according to some embodiments includes obtaining a volumetric video bitstream, wherein the volumetric video bitstream includes at least one sub-bitstream, and wherein the volumetric video bitstream further includes a parameter set providing profile information for the at least one sub-bitstream, selecting a decoder configuration for decoding the at least one sub-bitstream based on the profile information, and decoding the at least one sub-bitstream using the selected decoder configuration. In some embodiments, the volumetric video bitstream is a V3C bitstream, and wherein the parameter set is provided at the V3C level. In some embodiments, the at least one sub-bitstream is a base trellis bitstream, and wherein the parameter set includes information indicating whether the base trellis bitstream encodes a static trellis or a dynamic trellis. In some embodiments, the sub-bit stream includes a static trellis bit stream and a displacement bit stream, and the configuration level information indicates whether the displacement bit stream is encoded using an arithmetic codec. In some embodiments, the configuration level information includes information identifying a video codec to be used for decoding the at least one sub-bitstream. Some embodiments include a flag signaling at the bitstream level, wherein the flag indicates whether information identifying the video codec is signaled at the bitstream level or at the sub-bitstream level. In some embodiments, the sub-bitstreams include at least a displacement bitstream and a texture attribute bitstream, and wherein the configuration level information identifies at least a first codec for decoding the displacement bitstream and a second codec for decoding the texture attribute bitstream. In some embodiments, the information identifying the first codec and the second codec is an integer value. In some embodiments, the first codec and the second codec are the same video codec. In other embodiments, the first codec is an arithmetic codec and the second codec is a video codec. In some embodiments, the sub-bitstream comprises a displacement bitstream, wherein the parameter set comprises a video parameter set extension, and wherein the video parameter set extension identifies a codec used to decode the displacement bitstream. An apparatus according to some embodiments includes one or more processors, the apparatus configured to perform any of the methods described herein. An apparatus according to some embodiments includes at least one processor and a computer-readable medium storing instructions for performing any of the methods described herein. A computer readable medium according to some embodiments stores instructions for performing any of the methods described herein. A signal according to some embodiments comprises a volumetric video bitstream, wherein the volumetric video bitstream comprises at least one sub-bitstream, and wherein the