US-12621495-B2 - Method, apparatus, and medium for point cloud coding
Abstract
Embodiments of the present disclosure provide a solution for point cloud coding. A method for point cloud coding is proposed. The method comprises: performing a conversion between a point cloud sequence and a bitstream of the point cloud sequence based on a tile inventory, the tile inventory being valid starting from a first coded point cloud frame in the bitstream, the first coded point cloud frame immediately following a data unit for the tile inventory in a bitstream order, or the first coded point cloud frame following the data unit in the bitstream order and having a frame counter partially indicated in the data unit. Thereby, the proposed method can advantageously better support the application of the tile inventory.
Inventors
- Ye-Kui Wang
Assignees
- BYTEDANCE INC.
Dates
- Publication Date
- 20260505
- Application Date
- 20240620
Claims (20)
- 1 . A method for point cloud coding, comprising: performing a conversion between a point cloud sequence and a bitstream of the point cloud sequence based on a tile inventory, the tile inventory being valid starting from the first coded point cloud frame following a data unit for the tile inventory.
- 2 . The method of claim 1 , wherein a frame counter of the first coded point cloud frame is partially indicated in the data unit.
- 3 . The method of claim 2 , wherein the data unit comprises a first syntax element indicating a first number of least significant bits of the frame counter and a second syntax element indicating the first number.
- 4 . The method of claim 3 , wherein the bitstream comprises a plurality of coded point cloud frames following the data unit, each of the plurality of coded point cloud frames has a frame counter comprising the first number of least significant bits, and the first coded point cloud frame is closest to the data unit among the plurality of coded point cloud frames.
- 5 . The method of claim 3 , wherein the first syntax element is syntax element ti_frame_ctr_lsb, and the second syntax element is syntax element ti_frame_ctr_lsb bits.
- 6 . The method of claim 1 , wherein the tile inventory remains valid for at least one coded point cloud frame following the first coded point cloud frame, until the tile inventory is replaced by a further tile inventory for a further coded point cloud frame in the bitstream.
- 7 . The method of claim 1 , wherein the tile inventory remains valid for at least one coded point cloud frame following the first coded point cloud frame, until a further data unit for a further tile inventory is coded, the further data unit following the data unit and the further tile inventory being valid starting from a further coded point cloud frame in the bitstream.
- 8 . The method of claim 1 , wherein the tile inventory remains valid for at least one coded point cloud frame following the first coded point cloud frame until a time instant, the time instant being an earlier one of following: a time instant when the tile inventory is replaced by a further tile inventory for a further coded point cloud frame in the bitstream, or an ending time of a coded point cloud sequence associated with the point cloud sequence.
- 9 . The method of claim 1 , wherein the tile inventory remains valid for at least one coded point cloud frame following the first coded point cloud frame until a time instant, the time instant being an earlier one of following: a time instant when a further data unit for a further tile inventory is coded, the further data unit following the data unit and the further tile inventory being valid starting from a further coded point cloud frame in the bitstream, or an ending time of a coded point cloud sequence associated with the point cloud sequence.
- 10 . The method of claim 1 , wherein the conversion includes encoding the point cloud sequence into the bitstream.
- 11 . The method of claim 1 , wherein the conversion includes decoding the point cloud sequence from the bitstream.
- 12 . An apparatus for processing point cloud data comprising a processor and a non-transitory memory with instructions thereon, wherein the instructions upon execution by the processor, cause the processor to perform acts comprising: performing a conversion between a point cloud sequence and a bitstream of the point cloud sequence based on a tile inventory, the tile inventory being valid starting from the first coded point cloud frame following a data unit for the tile inventory.
- 13 . The apparatus of claim 12 , wherein a frame counter of the first coded point cloud frame is partially indicated in the data unit.
- 14 . The apparatus of claim 13 , wherein the data unit comprises a first syntax element indicating a first number of least significant bits of the frame counter and a second syntax element indicating the first number.
- 15 . The apparatus of claim 14 , wherein the first syntax element is syntax element ti_frame_ctr_lsb, and the second syntax element is syntax element ti_frame_ctr_lsb_bits.
- 16 . A non-transitory computer-readable storage medium storing instructions that cause a processor to perform acts comprising: performing a conversion between a point cloud sequence and a bitstream of the point cloud sequence based on a tile inventory, the tile inventory being valid starting from the first coded point cloud frame following a data unit for the tile inventory.
- 17 . The non-transitory computer-readable storage medium of claim 16 , wherein a frame counter of the first coded point cloud frame is partially indicated in the data unit.
- 18 . The non-transitory computer-readable storage medium of claim 17 , wherein the data unit comprises a first syntax element indicating a first number of least significant bits of the frame counter and a second syntax element indicating the first number.
- 19 . The non-transitory computer-readable storage medium of claim 18 , wherein the first syntax element is syntax element ti_frame_ctr_lsb, and the second syntax element is syntax element ti_frame_ctr_lsb bits.
- 20 . A method for storing a bitstream of a point cloud sequence, comprising: performing a conversion between the point cloud sequence and the bitstream based on a tile inventory, the tile inventory being valid starting from the first coded point cloud frame following a data unit for the tile inventory; and storing the bitstream in a non-transitory computer-readable recording medium.
Description
CROSS REFERENCE TO RELATED APPLICATIONS This application is a continuation of International Application No. PCT/US2022/081953, filed on Dec. 19, 2022, which claims the benefit of the U.S. Provisional Application No. 63/265,758, filed Dec. 20, 2021. The entire contents of these applications are hereby incorporated by reference in their entireties. FIELD Embodiments of the present disclosure relates generally to point cloud coding techniques, and more particularly, to signaling of tiles in geometry based point cloud compression. BACKGROUND A point cloud is a collection of individual data points in a three-dimensional (3D) plane with each point having a set coordinate on the X, Y, and Z axes. Thus, a point cloud may be used to represent the physical content of the three-dimensional space. Point clouds have shown to be a promising way to represent 3D visual data for a wide range of immersive applications, from augmented reality to autonomous cars. Point cloud coding standards have evolved primarily through the development of the well-known MPEG organization. MPEG, short for Moving Picture Experts Group, is one of the main standardization groups dealing with multimedia. In 2017, the MPEG 3D Graphics Coding group (3DG) published a call for proposals (CFP) document to start to develop point cloud coding standard. The final standard will consist in two classes of solutions. Video-based Point Cloud Compression (V-PCC or VPCC) is appropriate for point sets with a relatively uniform distribution of points. Geometry-based Point Cloud Compression (G-PCC or GPCC) is appropriate for more sparse distributions. However, coding efficiency of conventional point cloud coding techniques is generally expected to be further improved. SUMMARY Embodiments of the present disclosure provide a solution for point cloud coding. In a first aspect, a method for point cloud coding is proposed. The method comprises: performing a conversion between a point cloud sequence and a bitstream of the point cloud sequence based on a tile inventory, the tile inventory being valid starting from a first coded point cloud frame in the bitstream, the first coded point cloud frame immediately following a data unit for the tile inventory in a bitstream order, or the first coded point cloud frame following the data unit in the bitstream order and having a frame counter partially indicated in the data unit. Based on the method in accordance with the first aspect of the present disclosure, the persistency scope of the tile inventory starts from a first coded point cloud frame following the data unit for the tile inventory. Compared with the conventional solution where the persistency scope of the tile inventory is not clearly specified, the persistency scope of the tile inventory is clearly specified in the proposed method and thus the proposed method can advantageously better support the application of the tile inventory and improve the coding efficiency. In a second aspect, an apparatus for processing point cloud data is proposed. The apparatus for processing point cloud data comprises a processor and a non-transitory memory with instructions thereon. The instructions, upon execution by the processor, cause the processor to perform a method in accordance with the first aspect of the present disclosure. In a third aspect, a non-transitory computer-readable storage medium is proposed. The non-transitory computer-readable storage medium stores instructions that cause a processor to perform a method in accordance with the first aspect of the present disclosure. In a fourth aspect, another non-transitory computer-readable recording medium is proposed. The non-transitory computer-readable recording medium stores a bitstream of a point cloud sequence which is generated by a method performed by a point cloud processing apparatus. The method comprises: performing a conversion between the point cloud sequence and the bitstream based on a tile inventory, the tile inventory being valid starting from a first coded point cloud frame in the bitstream, the first coded point cloud frame immediately following a data unit for the tile inventory in a bitstream order, or the first coded point cloud frame following the data unit in the bitstream order and having a frame counter partially indicated in the data unit. In a fifth aspect, a method for storing a bitstream of a point cloud sequence is proposed. The method comprises: performing a conversion between the point cloud sequence and the bitstream based on a tile inventory, the tile inventory being valid starting from a first coded point cloud frame in the bitstream, the first coded point cloud frame immediately following a data unit for the tile inventory in a bitstream order, or the first coded point cloud frame following the data unit in the bitstream order and having a frame counter partially indicated in the data unit; and storing the bitstream in a non-transitory computer-readable recording medium. This Summary is provid