US-20260129199-A1 - IMAGE DATA ENCODING/DECODING METHOD AND APPARATUS

US20260129199A1US 20260129199 A1US20260129199 A1US 20260129199A1US-20260129199-A1

Abstract

A method for decoding a 360-degree image includes: receiving a bitstream obtained by encoding a 360-degree image; generating a prediction image by making reference to syntax information obtained from the received bitstream; combining the generated prediction image with a residual image obtained by dequantizing and inverse-transforming the bitstream, so as to obtain a decoded image; and reconstructing the decoded image into a 360-degree image according to a projection format. Here, generating the prediction image includes: checking, from the syntax information, prediction mode accuracy for a current block to be decoded; determining whether the checked prediction mode accuracy corresponds to most probable mode (MPM) information obtained from the syntax information; and when the checked prediction mode accuracy does not correspond to the MPM information, reconfiguring the MPM information according to the prediction mode accuracy for the current block.

Inventors

Ki Baek Kim

Assignees

B1 INSTITUTE OF IMAGE TECHNOLOGY, INC.

Dates

Publication Date: 20260507
Application Date: 20260105
Priority Date: 20161004

Claims (6)

1 . A method of decoding an image, comprising: predicting blocks in the image to generate prediction blocks of the blocks; reconstructing the image based on the prediction blocks; filtering the reconstructed image; and determining regions corresponding to objects included in the filtered image, wherein the regions are determined based on information indicating a number of the regions.
2 . The method of claim 1 , wherein a size of the filtered image is smaller than a size of an image indicated by encoding information.
3 . The method of claim 1 , wherein the information is obtained from a supplemental enhancement information (SEI) message of a bitstream.
4 . The method of claim 1 , based on a value of flag obtained from a bitstream, the information is determined depending on information included in a previous SEI message.
5 . A method of encoding an image, comprising: predicting blocks in the image to generate prediction blocks of the blocks; reconstructing the image based on the prediction blocks; filtering the reconstructed image; determining regions corresponding to objects included in the filtered image; and generating information indicating a number of the identified region.
6 . A method of transmitting a bitstream, comprising: predicting blocks in the image to generate prediction blocks of the blocks; reconstructing the image based on the prediction blocks; filtering the reconstructed image; determining regions corresponding to objects included in the filtered image; generating information indicating a number of the identified region, generating the bitstream including the generated information; and transmitting the bitstream.

Description

RELATED APPLICATIONS This application is a continuation application of U.S. patent application Ser. No. 18/949,072, filed Nov. 15, 2024, which is a continuation application of U.S. patent application Ser. No. 18/498,597, filed Oct. 31, 2023, which is now U.S. Pat. No. 12,231,610, which is a continuation application of U.S. patent application Ser. No. 18/314,965, filed May 10, 2023, which is now U.S. Pat. No. 11,902,668, which is a continuation application of U.S. patent application Ser. No. 17/579,225, filed Jan. 19, 2022, which is now U.S. Pat. No. 11,696,035, which is a continuation application of U.S. patent application Ser. No. 17/027,154, filed Sep. 21, 2020, which is now U.S. Pat. No. 11,483,476, which is a continuation application of U.S. patent application Ser. No. 16/372,287, filed Apr. 1, 2019, which is a continuation application of the International Patent Application Serial No. PCT/KR2017/011138, filed Oct. 10, 2017, which claims priority to the Korean Patent Application Serial No. 10-2016-0127887, filed Oct. 4, 2016; the Korean Patent Application Serial No. 10-2016-0129386, filed Oct. 6, 2016; and the Korean Patent Application Serial No. 10-2017-0090616, filed Jul. 17, 2017. All of these applications are incorporated by reference herein in their entireties. TECHNICAL FIELD The present invention relates to image data encoding and decoding technology, and more particularly, to a method and apparatus for encoding and decoding a 360-degree image for realistic media service. BACKGROUND With the spread of the Internet and mobile terminals and the development of information and communication technology, the use of multimedia data is increasing rapidly. Recently, demand for high-resolution images and high-quality images such as a high definition (HD) image and an ultra high definition (UHD) image is emerging in various fields, and demand for realistic media service such as virtual reality, augmented reality, and the like is increasing rapidly. In particular, since multi-view images captured with a plurality of cameras are processed for 360-degree images for virtual reality and augmented reality, the amount of data generated for the processing increases massively, but the performance of an image processing system for processing a large amount of data is insufficient. As described above, in an image encoding and decoding method and apparatus of the related art, there is a demand for improvement of performance in image processing, particularly, image encoding/decoding. SUMMARY It is an object of the present invention to provide a method for improving an image setting process in initial steps for encoding and decoding. More particularly, the present invention is directed to providing an encoding and decoding method and apparatus for improving an image setting process in consideration of the characteristics of a 360-degree image. According to an aspect of the present invention, there is provided a method of decoding a 360-degree image. Here, the method of decoding a 360-degree image may include receiving a bitstream including an encoded 360-degree image, generating a predicted image with reference to syntax information acquired from the received bitstream, acquiring a decoded image by combining the generated predicted image with a residual image acquired by inversely quantizing and inversely transforming the bitstream, and reconstructing the decoded image into the 360-degree image according to a projection format. Here, the syntax information may include projection format information for the 360-degree image. Here, the projection format information may be information indicating at least one of an Equi-Rectangular Projection (ERP) format in which the 360-degree image is projected into a 2D plane, a CubeMap Projection (CMP) format in which the 360-degree image is projected to a cube, an OctaHedron Projection (OHP) format in which the 360-degree image is projected to an octahedron, and an IcoSahedral Projection (ISP) format in which the 360-degree image is projected to a polyhedron. Here, the reconstructing may include acquiring arrangement information according to region-wise packing with reference to the syntax information and rearranging blocks of the decoded image according to the arrangement information. Here, the generating of the predicted image may include performing image expansion on a reference picture acquired by restoring the bitstream, and generating a predicted image with reference to the reference picture on which the image expansion is performed. Here, the performing of the image expansion may include performing image expansion on the basis of partitioning units of the reference picture. Here, the performing of the image expansion on the basis of the partitioning units may include generating an expanded region individually for each partitioning unit by using the reference pixel of the partitioning unit. Here, the expanded region may be generated using a boundary pixel of a partitioning unit sp