Search

US-12627495-B2 - Picture partitioning in video coding

US12627495B2US 12627495 B2US12627495 B2US 12627495B2US-12627495-B2

Abstract

Methods, systems, and devices for media processing are described. One example method of processing digital media includes performing a conversion between a media segment and a bitstream of the media segment, the conversion conforming to a format rule and an encryption rule, and the format rule specifying that an indication of an integrity of a portion of the media segment is signaled in the bitstream. In an example, the media segment is a video segment, an audio segment, or an image.

Inventors

  • Jizheng Xu
  • Ye-Kui Wang
  • Li Zhang
  • Kai Zhang

Assignees

  • BYTEDANCE INC.

Dates

Publication Date
20260512
Application Date
20230329

Claims (18)

  1. 1 . A method of processing video data, comprising: performing a conversion between a media segment and a bitstream of the media segment, wherein the conversion conforms to a format rule and an encryption rule, wherein the format rule specifies an indication of an integrity of a portion of the media segment is signaled in the bitstream, wherein the encryption rule specifies applying at least one of message digest algorithm 5 (MD5), secure hash algorithm 0 (SHA-0), or secure hash algorithm 1 (SHA-1) to the portion of the media segment, and wherein the format rule further specifies that a region of the media segment is subject to a constraint, and wherein the format rule further specifies that the region is measured, based on a color format of the media segment, using a chroma unit or a luma unit.
  2. 2 . The method of claim 1 , wherein the region of measured using the luma unit due to the color format of the media segment being 4:0:0.
  3. 3 . The method of claim 1 , wherein the region of measured using the chroma unit due to the color format of the media segment being different from 4:0:0.
  4. 4 . The method of claim 1 , wherein the encryption rule specifies applying a post-quantum cryptography method to the portion of the media segment.
  5. 5 . The method of claim 4 , wherein the post-quantum cryptography method is an eXtended Merkle Signature Scheme (XMSS) or a Leighton-Micali Signature (LMS).
  6. 6 . The method of claim 4 , wherein the post-quantum cryptography method comprises code-based cryptography, and wherein the code-based cryptography comprises a McEliece public key encryption system or a Niederreiter encryption algorithm.
  7. 7 . The method of claim 1 , wherein the bitstream further comprises a public key fingerprint or a shorter term public key fingerprint.
  8. 8 . The method of claim 7 , wherein the public key fingerprint comprises a 128-bit message digest algorithm 5 (MD5) fingerprint or a 12-bit secure hash algorithm 1 (SHA-1) fingerprint, or the public key fingerprint comprises a 160-bit message digest algorithm 5 (MD5) fingerprint or a 12-bit secure hash algorithm 1 (SHA-1) fingerprint.
  9. 9 . The method of claim 7 , wherein information associated with the public key fingerprint comprises user authentication information.
  10. 10 . The method of claim 1 , wherein the bitstream further comprises a public key or a shorter term public key.
  11. 11 . The method of claim 10 , wherein the public key is distributed using one or more trusted servers.
  12. 12 . The method of claim 10 , wherein the public key is sent in a supplemental enhancement information (SEI) message, and wherein an indication of a color component, the color format, a monochrome picture, or a usage of the SEI message is signaled in the SEI message.
  13. 13 . The method of claim 12 , wherein the indication is a 1-bit indication.
  14. 14 . The method of claim 12 , wherein the indication is equal to (ChromaFormatIdc==0) or (sps_chroma_format_idc==0).
  15. 15 . The method of claim 1 , wherein the conversion comprises decoding the media segment from the bitstream.
  16. 16 . The method of claim 1 , wherein the conversion comprises encoding the media segment into the bitstream.
  17. 17 . An apparatus for processing video data comprising a processor and a non-transitory memory with instructions thereon, wherein the instructions upon execution by the processor cause the processor to: perform a conversion between a media segment and a bitstream of the media segment, wherein the conversion conforms to a format rule and an encryption rule, wherein the format rule specifies an indication of an integrity of a portion of the media segment is signaled in the bitstream, wherein the encryption rule specifies applying at least one of message digest algorithm 5 (MD5), secure hash algorithm 0 (SHA-0), or secure hash algorithm 1 (SHA-1) to the portion of the media segment, and wherein the format rule further specifies that a region of the media segment is subject to a constraint, and wherein the format rule further specifies that the region is measured, based on a color format of the media segment, using a chroma unit or a luma unit.
  18. 18 . A non-transitory computer-readable recording medium storing a bitstream of a video which is generated by a method performed by a video processing apparatus, wherein the method comprises: generating the bitstream of a media segment, wherein the generating conforms to a format rule and an encryption rule, wherein the format rule specifies an indication of an integrity of a portion of the media segment is signaled in the bitstream, wherein the encryption rule specifies applying at least one of message digest algorithm 5 (MD5), secure hash algorithm 0 (SHA-0), or secure hash algorithm 1 (SHA-1) to the portion of the media segment, and wherein the format rule further specifies that a region of the media segment is subject to a constraint, and wherein the format rule further specifies that the region is measured, based on a color format of the media segment, using a chroma unit or a luma unit.

Description

CROSS-REFERENCE TO RELATED APPLICATION This application is a continuation of International Patent Application No. PCT/US2021/053008, filed on Sep. 30, 2021, which claims the priority to and benefits of U.S. Provisional Patent Application No. U.S. 63/085,906, filed on Sep. 30, 2020. All the aforementioned patent applications are hereby incorporated by reference in their entireties. TECHNICAL FIELD The present disclosure relates to digital media coding and decoding. BACKGROUND Digital video accounts for the largest bandwidth use on the internet and other digital communication networks. As the number of connected user devices capable of receiving and displaying video increases, it is expected that the bandwidth demand for digital video usage will continue to grow. SUMMARY The present disclosure discloses techniques that can be used by image, audio or video encoders and decoders for ensuring integrity of encoding operations, decoding operations, and encoded digital media segments. In one example aspect, a method of processing digital media is disclosed. The method includes performing a conversion between a media segment and a bitstream of the media segment, the conversion conforming to a format rule and an encryption rule, and the format rule specifying that an indication of an integrity of a portion of the media segment is signaled in the bitstream. In yet another example aspect, a media processing apparatus is disclosed. The apparatus comprises a processor configured to implement above-described methods. In yet another example aspect, a computer readable medium having code stored thereon is disclose. The code embodies one of the methods described herein in the form of processor-executable code. These, and other, features are described throughout the present disclosure. BRIEF DESCRIPTION OF DRAWINGS FIG. 1 shows an example of region-based image verification. FIG. 2 shows an example of a digital signature of the secure hash value. FIG. 3 shows an example of trusted timestamping. FIG. 4 is a block diagram of an example video processing system. FIG. 5 is a block diagram of a video processing apparatus. FIG. 6 is a block diagram that illustrates a video coding system in accordance with some embodiments of the present disclosure. FIG. 7 is a block diagram that illustrates an encoder in accordance with some embodiments of the present disclosure. FIG. 8 is a block diagram that illustrates a decoder in accordance with some embodiments of the present disclosure. FIG. 9 is a flowchart for an example method of digital media processing. DETAILED DESCRIPTION Section headings are used in the present disclosure for ease of understanding and do not limit the applicability of techniques and embodiments disclosed in each section only to that section. Furthermore, H.266 terminology is used in some description only for ease of understanding and not for limiting scope of the disclosed techniques. As such, the techniques described herein are applicable to other video codec protocols and designs also. 1. Summary The present disclosure is related to image/video system and coding technologies. Specifically, it provides a method to verify the image/video integrity, i.e., if the image/video has been modified from its certified source. It may be applicable to future image/video coding and/or system standards. It can also be applied as a core technology in general service for trustworthy image/video applications, e.g., telemedicine, source certified broadcasting. 2. Initial Discussion Video coding standards have evolved primarily through the development of the well-known International Telecommunication Union-Telecommunication Standardization Sector (ITU-T) and International Organization for Standardization (ISO)/International Electrotechnical Commission (IEC) standards. The ITU-T produced H.261 and H.263, ISO/IEC produced Moving Picture Experts Group (MPEG)-1 and MPEG-4 Visual, and the two organizations jointly produced the H.262/MPEG-2 Video and H.264/MPEG-4 Advanced Video Coding (AVC) and H.265/High Efficiency Video Coding (HEVC) standards. Since H.262, video coding standards are based on the hybrid video coding structure wherein temporal prediction plus transform coding are utilized. To explore future video coding technologies beyond HEVC, Joint Video Exploration Team (JVET) was founded by Video Coding Experts Group (VCEG) and MPEG jointly in 2015. Since then, many new methods have been adopted by JVET and put into the reference software named Joint Exploration Model (JEM). In April 2018, the Joint Video Expert Team (JVET) between VCEG (Q6/16) and ISO/IEC Joint Technical Committee (JTC 1) SC29/WG11 (MPEG) was created to work on the Versatile Video Coding (VVC) standard targeting at 50% bitrate reduction compared to HEVC. The latest version of VVC draft, i.e., Versatile Video Coding (Draft 8) can be found at: http://phenix.int-evry.fr/jvet/doc_end_user/documents/17_Brussels/wg11/JVET-Q2001-v13.zip And the latest test model software can