Search

CN-121986482-A - Support of texture layer and depth layer in MV-HEVC bit stream

CN121986482ACN 121986482 ACN121986482 ACN 121986482ACN-121986482-A

Abstract

A mechanism for processing video data is disclosed herein. The mechanism includes determining to apply a multiview ten-bit (multiview 10) level to the visual media data. The conversion between visual media data and the bitstream may then be performed based on the multi-view 10 level. Various constraints may also be applied to bitstreams that conform to the multi-view 10 level or layers thereof.

Inventors

  • WANG YEKUI
  • ZHANG LI

Assignees

  • 字节跳动有限公司

Dates

Publication Date
20260505
Application Date
20240927
Priority Date
20230929

Claims (19)

  1. 1. A method of processing media data, comprising: determining to apply multi-view ten-bit (multi-view 10) levels to visual media data, and The conversion between the visual media data and the bitstream is performed based on the multi-view 10 level.
  2. 2. The method of claim 2, wherein an Output Layer Set (OLS) containing layers conforming to the multi-view 10 level is allowed to have both texture and depth layers.
  3. 3. The method according to claim 1 or 2, wherein the chroma format identification code (chroma format idc) of all active Sequence Parameter Sets (SPS) of the current layer and of all active SPS of the reference layer of the current layer has to be equal to only 1.
  4. 4. A method as claimed in any one of claims 1-3, wherein, for a Network Abstraction Layer (NAL) unit header layer identifier (nuh_layer_id) equal to the current layer (layerA) of i and another layer (layerB) with nuh_layer_id equal to j, when layerB is the reference layer of layerA, the value of the luma bit depth reduction (bit_depth_luma_minus8) of layerA and the value of the bit_depth_luma_minus8 of layerB must be the same and the value of the chroma bit depth reduction (bit_depth_chroma_minus8) of layerA and the value of the bit_depth_chroma_minus8 of layerB must be the same.
  5. 5. The method of any of claims 1-4, wherein a sub-bitstream (subBitstream) is the bitstream of the OLS, and when a layer in the OLS meets the multiview 10 level, chroma_format_idc of all active SPS of a texture layer in subBitstream has to be equal to only 1, and chroma_format_idc of all active SPS of a depth layer in subBitstream has to be equal to only 0.
  6. 6. The method of any of claims 1-5, wherein, for layerra with nuh_layerjd equal to i and layerB with nuh_layerjd equal to j in subBitstream when the layers in OLS conform to the multiview 10 level, the value of the chroma_format_idc of layerra and the value of the chroma_format_idc of layerrb must be the same when layerB is the reference layer of layerra, and the depth layer flag (DEPTHLAYERFLAG [ i ]) of i must be equal to the depth layer flag (DEPTHLAYERFLAG [ j ]).
  7. 7. The method of any of claims 1-6, wherein when a layer in OLS meets the multiview 10 level, any j value of a scalability identifier and scalability map index (scalabilityId [ j ] [ smIdx ]), for any scalability map index (smIdx) value not equal to 0 or 1, and for a layer identifier (layer_id_in_nuh [ j ]) in a NAL unit header j in a layer identifier list target (LAYERIDLISTTARGET) for deriving subBitstream, needs to be equal to 0, derived from any active Video Parameter Set (VPS).
  8. 8. The method of any of claims 1-7, wherein bit_depth_luma_minus8 of all active SPS of texture layers in subBitstream must be within a range of 0 to 2 (including 0 and 2) when layers in OLS conform to the multiview 10 level.
  9. 9. The method of any of claims 1-8, wherein bit_depth_luma_minus8 of all active SPS of a depth layer in subBitstream must be within a range of 0 to 8 (including 0 and 8) when a layer in OLS meets the multiview 10 level.
  10. 10. The method of any of claims 1-9, wherein bit_depth_chroma_minus8 of all active SPS for layers in subBitstream with nuh_layer_id equal to i and DEPTHLAYERFLAG [ i ] equal to 0 must be within a range of 0to 2 (including 0 and 2) when layers in OLS conform to the multiview 10 level.
  11. 11. The method of any of claims 1-10, wherein bit_depth_chroma_minus8 of all active SPS for layers in subBitstream with nuh_layer_id equal to i and DEPTHLAYERFLAG [ i ] equal to 1 must be within a range of 0 to 8 (including 0 and 8) when layers in OLS conform to the multiview 10 level.
  12. 12. The method of any of claims 1-11, wherein when a layer in OLS conforms to the multiview 10 level, for any layer in subBitstream, the values of picture width (pic_width_in_luma_samples) in units of luma samples and picture height (pic_height_in_luma_samples) in units of luma samples in each active SPS of all reference layers of the layer must be equal to the values of pic_width_in_luma_samples and pic_height_in_luma_samples, respectively, in each active SPS of the layers.
  13. 13. The method of any of claims 1-12, wherein values of pic_width_in_luma_samples and pic_height_in_luma_samples in each active SPS of the depth layers are required to be less than or equal to values of pic_width_in_luma_samples and pic_height_in_samples in each active SPS of the texture layers, respectively, when a layer in OLS conforms to the multiview 10 level, and when a texture layer with nuh_layer_id equal to i and a depth layer with nuh_layer_id equal to j are in subBitstream, and when view order index i (ViewOrderIdx [ i ]) is equal to view order index j (ViewOrderIdx [ j ]).
  14. 14. The method of any of claims 1-13, wherein the converting comprises encoding the visual media data into the bitstream.
  15. 15. The method of any of claims 1-13, wherein the converting comprises decoding the visual media data from the bitstream.
  16. 16. An apparatus for processing video data comprising a processor and a non-transitory memory having instructions thereon, wherein the instructions, when executed by the processor, cause the processor to perform the method of any of claims 1-15.
  17. 17. A non-transitory computer readable medium comprising a computer program product for use by a video codec device, wherein the computer program product comprises computer executable instructions stored on the non-transitory computer readable medium such that when the computer executable instructions are executed by a processor cause the video codec device to perform the method of any one of claims 1-15.
  18. 18. A non-transitory computer-readable recording medium storing a bitstream of video generated by a method performed by a video processing apparatus, wherein the method comprises: determining to apply multi-view ten-bit (multi-view 10) levels to visual media data, and A bitstream is generated based on the determination.
  19. 19. A method of storing a bitstream of video, comprising: Determining to apply a multi-view ten-bit (multi-view 10) level to the visual media data; Generating a bit stream based on the determination, and The bit stream is stored in a non-transitory computer readable recording medium.

Description

Support of texture layer and depth layer in MV-HEVC bit stream Cross Reference to Related Applications The present application claims priority and benefit from U.S. provisional patent application 63/586,750 filed on 29 of 2023, U.S. provisional patent application 63/591,033 filed on 17 of 2023, and U.S. provisional patent application 63/591,219 filed on 18 of 2023. All of the foregoing patent applications are incorporated herein by reference in their entirety. Technical Field This patent document relates to the generation, storage and use of digital audio video media information in a file format. Background Digital video occupies the maximum bandwidth used on the internet and other digital communication networks. As the number of connected user devices capable of receiving and displaying video increases, the bandwidth requirements for digital video usage may continue to increase. Disclosure of Invention A first aspect relates to a method of processing video data comprising determining to apply a multiview ten-bit (multiview 10) level to visual media data and performing a transition between the visual media data and a bitstream based on the multiview 10 level. A second aspect relates to an apparatus for processing video data, comprising a processor and a non-transitory memory having instructions thereon, wherein the instructions, when executed by the processor, cause the processor to perform any of the preceding aspects. A third aspect relates to a non-transitory computer readable medium comprising a computer program product for use by a video codec device, the computer program product comprising computer executable instructions stored on the non-transitory computer readable medium such that when the computer executable instructions are executed by a processor cause the video codec device to perform the method of any of the preceding aspects. A fourth aspect relates to a non-transitory computer readable recording medium storing a bitstream of video generated by a method performed by a video processing apparatus, wherein the method includes determining to apply a multiview ten-bit (multiview 10) level to visual media data, and generating the bitstream based on the determination. A fifth aspect relates to a method of storing a bitstream of video, comprising determining to apply a multiview ten-bit (multiview 10) level to visual media data, generating a bitstream based on the determination, and storing the bitstream in a non-transitory computer-readable recording medium. For clarity, any one of the foregoing embodiments may be combined with any one or more of the other foregoing embodiments to create new embodiments within the scope of the present disclosure. These and other features will be more clearly understood from the following detailed description taken in conjunction with the accompanying drawings and claims. Drawings For a more complete understanding of the present disclosure, reference is now made to the following brief description, taken in connection with the accompanying drawings and detailed description, wherein like reference numerals represent like parts. Fig. 1 is a block diagram illustrating an example video processing system. Fig. 2 is a block diagram of an example video processing apparatus. Fig. 3 is a flow chart of an example method of video processing. Fig. 4 is a block diagram illustrating an example video codec system. Fig. 5 is a block diagram illustrating an example encoder. Fig. 6 is a block diagram illustrating an example decoder. Fig. 7 is a schematic diagram of an example encoder. Detailed Description It should be understood at the outset that although an illustrative implementation of one or more embodiments are provided below, the disclosed systems and/or methods may be implemented using any number of techniques, whether currently known or in existence of development. The disclosure should in no way be limited to the illustrative embodiments, drawings, and techniques illustrated below, including the exemplary designs and implementations shown and described herein, but may be modified within the scope of the appended claims along with their full scope of equivalents. 1. Preliminary discussion This document relates to video codec technology. In particular, the present disclosure relates to support of both texture and depth layers in a multi-view efficient video codec (MV-HEVC) bitstream. The concept can be applied to any video codec standard or non-standard video codec, such as modifications of the High Efficiency Video Codec (HEVC) standard, alone or in various combinations. 2. Further discussion 2.1 Video coding and decoding standard Video codec standards have evolved primarily through the development of the International Telecommunications Union (ITU) telecommunication standardization sector (ITU-T) and the international organization for standardization (ISO)/International Electrotechnical Commission (IEC) standards. The ITU-T specifies the h.261 and h.263 standards, the ISO/IE