CN-121986491-A - Basic grid information design in dynamic grid coding and decoding
Abstract
A mechanism for processing video data is disclosed. The mechanism includes determining that a value of an estimated syntax element is equal to 0 when the syntax element is not present in the bitstream. The value of the syntax element is incremented by one to specify the number of sub-meshes in each mesh frame. Conversion is performed between the visual media data and the bitstream based on the syntax element.
Inventors
- XU JIZHENG
Assignees
- 字节跳动有限公司
Dates
- Publication Date
- 20260505
- Application Date
- 20241008
- Priority Date
- 20231009
Claims (20)
- 1. A method for processing media data, comprising: determining that a value of a syntax element is estimated to be equal to 0 when the syntax element is not present in the bitstream, wherein the value of the syntax element is incremented by one to specify a number of sub-grids in each grid frame, and Conversion between visual media data and the bitstream is performed based on the value of the syntax element.
- 2. The method of claim 1, wherein the syntax element is designated bmsi _num_ submeshes _minus1.
- 3. The method of any of claims 1-2, wherein the syntax element has a value in a range of 0 to 255 (including a boundary value).
- 4. The method of any of claims 1-2, wherein the value of the syntax element is encoded in the bitstream as a 6-bit unsigned integer.
- 5. The method of any of claims 1-2, wherein the value of the syntax element is encoded in the bitstream using N bits, where N is an integer greater than 8.
- 6. The method of any of claims 1-2, wherein the value of the syntax element is encoded in the bitstream as an unsigned integer of N bits, where N is an integer greater than 8.
- 7. The method of any of claims 1-2, wherein the value of the syntax element is encoded in the bitstream as an unsigned integer of N bits, where N is one of 12, 16, 24, and 32.
- 8. The method according to any of claims 1-7, wherein when there is only one sub-grid, the value of the Identifier (ID) of the sub-grid is not present in the bitstream.
- 9. The method according to any of claims 1-7, wherein when there is only one sub-grid, the value of the Identifier (ID) of the sub-grid is assumed to be 0.
- 10. The method according to any of claims 1-9, wherein an increment value between a number of bits representing a sub-grid Identifier (ID) syntax element and a fixed value is included in the bitstream, and the fixed value is greater than 1.
- 11. The method of claim 10, wherein the fixed value comprises Ceil (Log 2 (bmsi _num_ submeshes _minus1+1)), where Ceil represents an upper bound function and Log2 represents a base 2 logarithmic function.
- 12. The method of any of claims 10-11, wherein the delta value is presumed to be equal to 0 when the delta value is not present in the bitstream.
- 13. The method of any of claims 10-12, wherein the delta value ranges from 0 to N, where N is an integer.
- 14. The method of claim 13, wherein N is equal to 15.
- 15. The method of any of claims 1-14, wherein the converting comprises encoding the media data into the bitstream.
- 16. The method of any of claims 1-14, wherein the converting comprises decoding the media data from the bitstream.
- 17. An apparatus for processing video or image data comprising a processor and a non-transitory memory having instructions thereon, wherein the instructions, when executed by the processor, cause the processor to perform the method of any of claims 1-16.
- 18. A non-transitory computer readable medium comprising a computer program product for use by a video codec device, the computer program product comprising computer executable instructions stored on the non-transitory computer readable medium such that when executed by a processor cause the video codec device to perform the method of any one of claims 1-16.
- 19. A non-transitory computer readable recording medium storing a bitstream of video generated by a method performed by a video processing apparatus, wherein the method comprises: determining that a value of a syntax element is estimated to be equal to 0 when the syntax element is not present in the bitstream, wherein the value of the syntax element is incremented by one to specify a number of sub-grids in each grid frame, and Conversion between visual media data and the bitstream is performed based on the value of the syntax element.
- 20. A method for storing a bitstream of video, comprising: determining to infer that a value of a syntax element is equal to 0 when the syntax element is not present in the bitstream, wherein the value of the syntax element is incremented by one to specify a number of sub-meshes in each mesh frame; generating the bit stream based on the syntax element, and The bit stream is stored in a non-transitory computer readable recording medium.
Description
Basic grid information design in dynamic grid coding and decoding Cross Reference to Related Applications This patent application claims the benefit of U.S. provisional patent application No. 63/588,788, filed on day 10/9 of 2023, which is incorporated herein by reference. Technical Field The present disclosure relates to the generation, storage, and use of digital audio video media information in a file format. Background Digital video occupies the maximum bandwidth used on the internet and other digital communication networks. As the number of connected user devices capable of receiving and displaying video increases, the bandwidth requirements for digital video usage may continue to increase. Disclosure of Invention The first aspect relates to a method for processing media data, comprising determining that a value of a putative syntax element is equal to 0 when the syntax element is not present in a bitstream, wherein the value of the syntax element plus one specifies a number of sub-grids in each grid frame, and performing a conversion between visual media data and the bitstream based on the value of the syntax element. Optionally, in any one of the above aspects, another embodiment of this aspect provides that the syntax element is designated bmsi _num_ submeshes _minus1. Alternatively, in any of the above aspects, another embodiment of the aspect provides that the value of the syntax element is in a range of 0 to 255 (including the boundary value). Optionally, in any of the above aspects, another embodiment of this aspect provides that the value of the syntax element is encoded in the bitstream as an unsigned integer of 6 bits. Optionally, in any of the above aspects, another embodiment of this aspect provides for encoding and decoding the value of the syntax element in the bitstream using N bits, where N is an integer greater than 8. Optionally, in any of the above aspects, another embodiment of this aspect provides that the value of the syntax element is encoded in the bitstream as an unsigned integer of N bits, where N is an integer greater than 8. Optionally, in any of the above aspects, another embodiment of this aspect provides that the value of the syntax element is encoded in the bitstream as an unsigned integer of N bits, where N is one of 12, 16, 24 and 32. Alternatively, in any of the above aspects, another implementation of the aspect provides that when there is only one sub-grid, a value of an Identifier (ID) of the sub-grid does not exist in the bitstream. Alternatively, in any one of the above aspects, another embodiment of the aspect provides that when there is only one sub-grid, a value of an Identifier (ID) of the sub-grid is estimated to be equal to 0. Optionally, in any one of the above aspects, another embodiment of the aspect provides that an increment value between the number of bits representing the sub-grid Identifier (ID) syntax element and a fixed value is included in the bitstream, and wherein the fixed value is greater than 1. Alternatively, in any one of the above aspects, another embodiment of this aspect provides that the fixed value includes Ceil (Log 2 (bmsi _num_ submeshes _minus1+1)), where Ceil represents an upper limit function and Log2 represents a logarithmic function that bases on 2. Alternatively, in any of the above aspects, another embodiment of the aspect provides that when the delta value is not present in the bitstream, the delta value is estimated to be equal to 0. Alternatively, in any of the above aspects, another embodiment of this aspect provides that the increment value ranges from 0 to N, where N is an integer. Alternatively, in any of the above aspects, another embodiment of the aspect provides that N is equal to 15. Optionally, in any of the above aspects, another embodiment of the aspect provides that the converting comprises encoding the media data as a bitstream. Optionally, in any of the above aspects, another embodiment of the aspect provides that the converting comprises decoding the media data from a bitstream. A second aspect relates to an apparatus for processing video or image data, comprising a processor, and a non-transitory memory having instructions thereon, wherein the instructions, when executed by the processor, cause the processor to perform the method of any of the disclosed embodiments. A third aspect relates to a non-transitory computer readable medium comprising a computer program product for use by a video codec device, the computer program product comprising computer executable instructions stored on the non-transitory computer readable medium such that when executed by a processor cause the video codec device to perform the method of any of the disclosed embodiments. A fourth aspect relates to a non-transitory computer-readable recording medium storing a bitstream of video generated by a method performed by a video processing apparatus, wherein the method includes determining that a value of a putative syntax element is e