Search

EP-4742684-A2 - SIGNALING THE PURPOSE OF PRESELECTION

EP4742684A2EP 4742684 A2EP4742684 A2EP 4742684A2EP-4742684-A2

Abstract

A method of processing media data. The method includes determining, for a conversion between the media data and a media data file, that a preselection element includes an indicator, wherein the indicator indicates that a purpose of the preselection element is for providing a picture-in-picture experience where a supplementary video appears to be overlaid on a target picture-in-picture region in a main video; and performing a conversion between the media data and the media data file based on the indicator. A corresponding video coding apparatus and non-transitory computer-readable recording medium are also disclosed.

Inventors

  • WANG, YE-KUI

Assignees

  • Lemon Inc.

Dates

Publication Date
20260513
Application Date
20220630

Claims (15)

  1. A method of processing media data, comprising: determining, for a conversion between the media data and a media data file, that a preselection element includes an indicator, wherein the indicator indicates that a purpose of the preselection element is for providing a picture-in-picture experience where a supplementary video appears to be overlaid on a target picture-in-picture region in a main video; and performing the conversion between the media data and the media data file based on the indicator.
  2. The method of claim 1, wherein the indicator comprises an @tag attribute.
  3. The method of claim 1, wherein the indicator comprises an @value attribute.
  4. The method of any one of claims 1-3, wherein the indicator has a value of "PicInPic" to indicate that the purpose of the preselection element is for providing the picture-in-picture experience.
  5. The method of any one of claims 1, 2 and 4, wherein the indicator comprises an @tag attribute having a value of "PicInPic" to indicate that the purpose of the preselection element is for providing the picture-in-picture experience.
  6. The method of any one of claims 1, 3 and 4, wherein the indicator comprises an @value attribute having a value of "PicInPic" to indicate that the purpose of the preselection element is for providing the picture-in-picture experience.
  7. The method of any one of claims 1, 2, 4 and 5, wherein the indicator comprises an @tag attribute within a CommonAttributesElements element in the preselection element, or wherein the indicator comprises an @tag attribute that also specifies a representation, adaption set, or the preselection element that may be used for selection purposes toward a decoder.
  8. The method of any one of claims 1, 3, 4 and 6, wherein the indicator comprises an @value attribute of a Role element in the preselection element, or, wherein the indicator comprises an @value attribute of a Role element that specifies information on a role annotation scheme.
  9. The method of any one of claims 1-8, wherein the preselection element includes an @dataUnitsReplacable attribute that specifies whether coded video data units representing the target picture-in-picture region in the main video can be replaced by corresponding video data units of the supplementary video; or wherein the preselection element includes an @regionsIds attribute that specifies an identifier (ID) for each coded video data unit representing the target picture-in-picture region in the main video.
  10. The method of any one of claims 1, 2, 4, 5 and 7, wherein the preselection element only includes an @dataUnitsReplacable attribute when the indicator is an @tag attribute having a value of "PicInPic," or wherein the preselection element only includes an @regionsIds attribute when the indicator is an @tag attribute having a value of "PicInPic."
  11. The method of any one of claims 1-10, wherein the preselection element is disposed in a Media Presentation Description (MPD) file, or wherein the preselection element is a Dynamic Adaptive Streaming over Hypertext Transfer Protocol (DASH) preselection element.
  12. The method of any one of claims 1-11, wherein the conversion includes encoding the media data into a bitstream.
  13. The method of any one of claims 1-11, wherein the conversion includes decoding the media data from a bitstream.
  14. An apparatus for processing media data comprising a processor and a non-transitory memory with instructions thereon, wherein the instructions upon execution by the processor, cause the processor to: determine, for a conversion between the media data and a media data file, that a preselection element includes an indicator, wherein the indicator indicates that a purpose of the preselection element is for providing a picture-in-picture experience where a supplementary video appears to be overlaid on a target picture-in-picture region in a main video; and perform the conversion between the media data and the media data file based on the indicator.
  15. A non-transitory computer-readable recording medium storing a Media Presentation Description (MPD) of a video which is generated by a method performed by a video processing apparatus, wherein the method comprises: determining that a preselection element includes an indicator, wherein the indicator indicates that a purpose of the preselection element is for providing a picture-in-picture experience where a supplementary video appears to be overlaid on a target picture-in-picture region in a main video; and generating the MPD based on the indicator.

Description

CROSS-REFERENCE TO RELATED APPLICATIONS Under the applicable patent law and/or rules pursuant to the Paris Convention, this application is made to timely claim the priority to and benefits of U.S. Provisional Patent Application No. 63/216,975, filed on June 30, 2021. For all purposes under the law, the entire disclosure of the aforementioned application is incorporated by reference as part of the disclosure of this application. TECHNICAL FIELD The present disclosure is generally related to video streaming and, in particular, to the support of picture-in-picture services in Dynamic Adaptive Streaming over Hypertext Transfer Protocol (DASH). BACKGROUND Digital video accounts for the largest bandwidth use on the internet and other digital communication networks. As the number of connected user devices capable of receiving and displaying video increases, it is expected that the bandwidth demand for digital video usage will continue to grow. SUMMARY The disclosed aspects/embodiments provide techniques that incorporate an indicator in a preselection element. In an embodiment, the indicator indicates that the purpose of the preselection comprises picture-in-picture (a.k.a., a picture-in-picture experience or picture-in-picture services). Thus, the video coding process is improved relative to conventional video coding techniques. A first aspect relates to a method of processing media data. The method includes determining, for a conversion between the media data and a media data file, that a preselection element includes an indicator, wherein the indicator indicates that a purpose of the preselection element is for providing a picture-in-picture experience where a supplementary video appears to be overlaid on a target picture-in-picture region in a main video; and performing the conversion between the media data and the media data file based on the indicator. Optionally, in any of the preceding aspects, another implementation of the aspect provides that the indicator comprises an @tag attribute. Optionally, in any of the preceding aspects, another implementation of the aspect provides that the indicator comprises an @value attribute. Optionally, in any of the preceding aspects, another implementation of the aspect provides that the indicator has a value of "PicInPic" to indicate that the purpose of the preselection element is for providing the picture-in-picture experience. Optionally, in any of the preceding aspects, another implementation of the aspect provides that the indicator comprises an @tag attribute having a value of "PicInPic" to indicate that the purpose of the preselection element is for providing the picture-in-picture experience. Optionally, in any of the preceding aspects, another implementation of the aspect provides that the indicator comprises an @value attribute having a value of "PicInPic" to indicate that the purpose of the preselection element is for providing the picture-in-picture experience. Optionally, in any of the preceding aspects, another implementation of the aspect provides that the indicator comprises an @tag attribute within a CommonAttributesElements element in the preselection element. Optionally, in any of the preceding aspects, another implementation of the aspect provides that the indicator comprises an @value attribute of a Role element in the preselection element. Optionally, in any of the preceding aspects, another implementation of the aspect provides that the indicator comprises an @tag attribute that also specifies a representation, adaption set, or the preselection element that may be used for selection purposes toward a decoder. Optionally, in any of the preceding aspects, another implementation of the aspect provides the indicator comprises an @value attribute of a Role element that specifies information on a role annotation scheme. Optionally, in any of the preceding aspects, another implementation of the aspect provides that the preselection element includes an @dataUnitsReplacable attribute that specifies whether coded video data units representing the target picture-in-picture region in the main video can be replaced by corresponding video data units of the supplementary video. Optionally, in any of the preceding aspects, another implementation of the aspect provides that the preselection element only includes an @dataUnitsReplacable attribute when the indicator is an @tag attribute having a value of "PicInPic." Optionally, in any of the preceding aspects, another implementation of the aspect provides that the preselection element includes an @regionsIds attribute that specifies an identifier (ID) for each coded video data unit representing the target picture-in-picture region in the main video. Optionally, in any of the preceding aspects, another implementation of the aspect provides that the preselection element only includes an @regionsIds attribute when the indicator is an @tag attribute having a value of "PicInPic." Optionally, in any of the preceding aspects