EP-4742687-A1 - INFORMATION PROCESSING DEVICE, REPRODUCTION DEVICE, INFORMATION PROCESSING METHOD, AND PROGRAM
Abstract
An obtaining means obtains first data of a first three-dimensional object and second data of a second three-dimensional object different from the first three-dimensional object. A first generating means generates arrangement information for arranging the first three-dimensional object and the second three-dimensional object in a same coordinate system. A second generating means generates metadata that is common for the first three-dimensional object and the second three-dimensional object. A third generating means generates a first track that manages the first data, a second track that manages the second data, and a third track that manages the metadata. A fourth generating means generates a single file that stores the first track, the second track, the third track, the first data, the second data, the arrangement information, and the metadata.
Inventors
- Suneya, Toru
Assignees
- Canon Kabushiki Kaisha
Dates
- Publication Date
- 20260513
- Application Date
- 20240701
Claims (20)
- An information processing apparatus comprising: an obtaining means for obtaining first data of a first three-dimensional object and second data of a second three-dimensional object different from the first three-dimensional object; a first generating means for generating arrangement information for arranging the first three-dimensional object and the second three-dimensional object in a same coordinate system; a second generating means for generating metadata that is common for the first three-dimensional object and the second three-dimensional object; a third generating means for generating a first track that manages the first data, a second track that manages the second data, and a third track that manages the metadata; and a fourth generating means for generating a single file that stores the first track, the second track, the third track, the first data, the second data, the arrangement information, and the metadata.
- The information processing apparatus according to claim 1, characterized in that the arrangement information includes information indicating a position in the same coordinate system, information indicating an orientation in the same coordinate system, and information indicating a size in the same coordinate system, for each of the first and second three-dimensional objects.
- The information processing apparatus according to claim 2, characterized in that the information indicating the position of each of the first and second three-dimensional objects in the same coordinate system is a vector between coordinates of an origin of a local coordinate system of each of the first and second three-dimensional objects and coordinates of an origin of the same coordinate system, evaluated by a three-dimensional orthogonal coordinate system.
- The information processing apparatus according to claim 2 or 3, characterized in that the information indicating the orientation of each of the first and second three-dimensional objects in the same coordinate system is a rotation angle between a coordinate axis of a local coordinate system of each of the first and second three-dimensional objects and a coordinate axis of the same coordinate system.
- The information processing apparatus according to any one of claims 2 to 4, characterized in that the information indicating the size of the first and second three-dimensional objects in the same coordinate system is a magnification or a reduction for each of X axis, Y axis, and Z axis values in a local coordinate system of each of the first and second three-dimensional objects and the same coordinate system, evaluated by a three-dimensional orthogonal coordinate system.
- The information processing apparatus according to any one of claims 1 to 5, characterized in that an origin of the same coordinate system is a reference position of a space managed by a space ID.
- The information processing apparatus according to any one of claims 1 to 6, characterized in that the metadata is a parameter set referred to when the first and second three-dimensional objects are decoded.
- The information processing apparatus according to any one of claims 1 to 7, characterized in that : the third generating means generates, as each of the first, second, and third tracks, a box that stores index information indicating a reference location of data, and the fourth generating means generates the single file such that the arrangement information and the metadata are managed by an index using a same box among the boxes.
- The information processing apparatus according to claim 8, characterized in that : the second generating means further generates field of view information indicating a viewpoint in the same coordinate system, and the fourth generating means generates the single file such that the metadata and the field of view information are managed by an index using a same box among the boxes.
- The information processing apparatus according to claim 9, characterized in that the field of view information is information indicating a viewpoint position in the same coordinate system.
- The information processing apparatus according to any one of claims 8 to 10, characterized in that : the second generating means further generates light source information indicating a light source in the same coordinate system, and the fourth generating means generates the single file such that the metadata and the light source information are managed by an index using a same box among the boxes.
- The information processing apparatus according to claim 11, characterized in that the light source information includes at least one of a position, a direction, a light intensity, and a light color of the light source in the same coordinate system.
- The information processing apparatus according to claim 11 or 12, characterized in that in a case where the light source includes a spot light, the light source information includes a light distribution angle of the spot light.
- The information processing apparatus according to any one of claims 11 to 13, characterized in that the light source information is information indicating a plurality of light sources in the same coordinate system.
- The information processing apparatus according to any one of claims 11 to 14, characterized in that in a case where the light source includes ambient light incident uniformly from all directions, the light source information includes information indicating a ratio of the ambient light to an entirety of the light source.
- A playback apparatus comprising: an obtaining means for obtaining a single file in which data of a first three-dimensional object and data of a second three-dimensional object are stored; a first obtaining means for obtaining, from the file, arrangement information for arranging the first three-dimensional object and the second three-dimensional object in a same coordinate system; a second obtaining means for obtaining, from the file, metadata common to the first three-dimensional object and the second three-dimensional object; a decoding means for, in a case where the first three-dimensional object and the second three-dimensional object can be decoded, decoding the data of the first three-dimensional object and the data of the second three-dimensional object; and a rendering means for rendering the first three-dimensional object and the second three-dimensional object in a space in the same coordinate system, based on the arrangement information and the first three-dimensional object and the second three-dimensional object that were decoded by the decoding means.
- The playback apparatus according to claim 16, characterized in that the arrangement information includes information indicating a position in the same coordinate system, information indicating an orientation in the same coordinate system, and information indicating a size in the same coordinate system, for each of the first and second three-dimensional objects.
- The playback apparatus according to claim 16 or 17, characterized in that an origin of the same coordinate system is a reference position of a space managed by a space ID.
- The playback apparatus according to any one of claims 16 to 18, characterized in that the metadata is profile information referred to when decoding the first and second three-dimensional objects.
- The playback apparatus according to any one of claims 16 to 19, characterized in that : the second obtaining means further obtains, from the file, field of view information indicating a viewpoint in the same coordinate system, and the rendering means renders the first three-dimensional object and the second three-dimensional object based on the field of view information.
Description
TECHNICAL FIELD The present invention relates to an information processing apparatus, a playback apparatus, an information processing method, and a program. BACKGROUND ART Methods that use computer graphics are known as conventional methods for generating 3D object data. However, in recent years, methods for obtaining 3D object data by scanning the shapes real objects, people, and the like using dedicated devices, studios, and the like are being used more often. Efforts are also underway for data utilization in which multiple items of 3D object data generated or obtained in such a manner are arranged within the same three-dimensional space. For example, in the field of autonomous driving or driver assistance, progress is being made in developing systems for updating road information in real time by obtaining 3D object data of objects around roads, by in-vehicle remote sensing devices such as LiDAR (Light Detection And Ranging or Laser Imaging Detection and Ranging), and displaying the 3D object data superimposed onto a dynamic map. Meanwhile, MPEG (Moving Picture Experts Group), which is a group under the ISO (International Organization for Standardization) and the IEC (International Electrotechnical Commission), is advancing the standardization of specifications for encoding 3D object data such as point clouds or meshes, and file format standards for storing encoded 3D object data. Patent Literature (PTL) 1 discloses a technique that makes it possible to, for example, change the quality of a portion of 3D object data by spatially dividing 3D object data encoded through G-PCC (Geometry based Point Cloud Compression), which is being standardized by MPEG, and then generating position information on the positions of the divided portions of the 3D object data in a three-dimensional space, and grouping information indicating that the divided portions belong to the same group. CITATION LIST PATENT LITERATURE PTL 1: International Publication No. 2020/137642 SUMMARY OF INVENTION TECHNICAL PROBLEM However, although the technique described in PTL 1 discloses an aspect for dividing a single item of 3D object data, an aspect for storing multiple different items of 3D object data in a single file is not considered. Furthermore, glTF, which is a format for expressing 3D models, writes the structure and configuration of a three-dimensional space in JSON format, and makes it possible to arrange a plurality of items of 3D object data in the same three-dimensional space. However, glTF is a specification in which the three-dimensional space and object data are associated with each other using a URI, and it has been necessary to read out and analyze individual items of object data in order to determine whether that object data is in a data format that can be displayed. An object of the present invention is to utilize a single file that stores information for displaying a plurality of items of three-dimensional object data in the same coordinate system. SOLUTION TO PROBLEM To achieve the object of the present invention, an information processing apparatus according to one embodiment has the following configuration, for example. In other words, An information processing apparatus comprises: an obtaining means for obtaining first data of a first three-dimensional object and second data of a second three-dimensional object different from the first three-dimensional object; a first generating means for generating arrangement information for arranging the first three-dimensional object and the second three-dimensional object in a same coordinate system; a second generating means for generating metadata that is common for the first three-dimensional object and the second three-dimensional object; a third generating means for generating a first track that manages the first data, a second track that manages the second data, and a third track that manages the metadata; and a fourth generating means for generating a single file that stores the first track, the second track, the third track, the first data, the second data, the arrangement information, and the metadata. ADVANTAGEOUS EFFECTS OF INVENTION A single file that stores information for displaying a plurality of items of three-dimensional object data in the same coordinate system is utilized. Other features and advantages of the present invention will be apparent from the following description taken in conjunction with the accompanying drawings. Note that the same reference numerals denote the same or like components throughout the accompanying drawings. BRIEF DESCRIPTION OF DRAWINGS The accompanying drawings, which are incorporated in and constitute a part of the specification, illustrate embodiments of the invention and, together with the description, serve to explain principles of the invention. FIG. 1 is a block diagram illustrating an example of the functional configuration of an information processing apparatus according to a first embodiment.FIG. 2 is a flowchart illustrati