Search

US-20260127841-A1 - INFORMATION PROCESSING APPARATUS, INFORMATION PROCESSING METHOD, AND NON-TRANSITORY COMPUTER-READABLE STORAGE MEDIUM

US20260127841A1US 20260127841 A1US20260127841 A1US 20260127841A1US-20260127841-A1

Abstract

An obtaining unit obtains first data of a first three-dimensional object and second data of a second three-dimensional object different from the first three-dimensional object. A first generating unit generates arrangement information for arranging the first three-dimensional object and the second three-dimensional object in a same coordinate system. A second generating unit generates metadata that is common for the first three-dimensional object and the second three-dimensional object. A third generating unit generates a first track that manages the first data, a second track that manages the second data, and a third track that manages the metadata. A fourth generating unit generates a single file that stores the first track, the second track, the third track, the first data, the second data, the arrangement information, and the metadata.

Inventors

  • Toru Suneya

Assignees

  • CANON KABUSHIKI KAISHA

Dates

Publication Date
20260507
Application Date
20251230
Priority Date
20230705

Claims (20)

  1. 1 . An information processing apparatus comprising: an obtaining unit configured to obtain first data of a first three-dimensional object and second data of a second three-dimensional object different from the first three-dimensional object; a first generating unit configured to generate arrangement information for arranging the first three-dimensional object and the second three-dimensional object in a same coordinate system; a second generating unit configured to generate metadata that is common for the first three-dimensional object and the second three-dimensional object; a third generating unit configured to generate a first track that manages the first data, a second track that manages the second data, and a third track that manages the metadata; and a fourth generating unit configured to generate a single file that stores the first track, the second track, the third track, the first data, the second data, the arrangement information, and the metadata.
  2. 2 . The information processing apparatus according to claim 1 , wherein the arrangement information includes information indicating a position in the same coordinate system, information indicating an orientation in the same coordinate system, and information indicating a size in the same coordinate system, for each of the first and second three-dimensional objects.
  3. 3 . The information processing apparatus according to claim 2 , wherein the information indicating the position of each of the first and second three-dimensional objects in the same coordinate system is a vector between coordinates of an origin of a local coordinate system of each of the first and second three-dimensional objects and coordinates of an origin of the same coordinate system, evaluated by a three-dimensional orthogonal coordinate system.
  4. 4 . The information processing apparatus according to claim 2 , wherein the information indicating the orientation of each of the first and second three-dimensional objects in the same coordinate system is a rotation angle between a coordinate axis of a local coordinate system of each of the first and second three-dimensional objects and a coordinate axis of the same coordinate system.
  5. 5 . The information processing apparatus according to claim 2 , wherein the information indicating the size of the first and second three-dimensional objects in the same coordinate system is a magnification or a reduction for each of X axis, Y axis, and Z axis values in a local coordinate system of each of the first and second three-dimensional objects and the same coordinate system, evaluated by a three-dimensional orthogonal coordinate system.
  6. 6 . The information processing apparatus according to claim 1 , wherein an origin of the same coordinate system is a reference position of a space managed by a space ID.
  7. 7 . The information processing apparatus according to claim 1 , wherein the metadata is a parameter set referred to when the first and second three-dimensional objects are decoded.
  8. 8 . The information processing apparatus according to claim 1 , wherein: the third generating unit generates, as each of the first, second, and third tracks, a box that stores index information indicating a reference location of data, and the fourth generating unit generates the single file such that the arrangement information and the metadata are managed by an index using a same box among the boxes.
  9. 9 . The information processing apparatus according to claim 8 , wherein: the second generating unit further generates field of view information indicating a viewpoint in the same coordinate system, and the fourth generating unit generates the single file such that the metadata and the field of view information are managed by an index using a same box among the boxes.
  10. 10 . The information processing apparatus according to claim 9 , wherein the field of view information is information indicating a viewpoint position in the same coordinate system.
  11. 11 . The information processing apparatus according to claim 8 , wherein: the second generating unit further generates light source information indicating a light source in the same coordinate system, and the fourth generating unit generates the single file such that the metadata and the light source information are managed by an index using a same box among the boxes.
  12. 12 . The information processing apparatus according to claim 11 , wherein the light source information includes at least one of a position, a direction, a light intensity, and a light color of the light source in the same coordinate system.
  13. 13 . The information processing apparatus according to claim 11 , wherein in a case where the light source includes a spot light, the light source information includes a light distribution angle of the spot light.
  14. 14 . The information processing apparatus according to claim 11 , wherein the light source information is information indicating a plurality of light sources in the same coordinate system.
  15. 15 . The information processing apparatus according to claim 11 , wherein in a case where the light source includes ambient light incident uniformly from all directions, the light source information includes information indicating a ratio of the ambient light to an entirety of the light source.
  16. 16 . A information processing apparatus comprising: an obtaining unit configured to obtain a single file in which data of a first three-dimensional object and data of a second three-dimensional object are stored; a first obtaining unit configured to obtain, from the file, arrangement information for arranging the first three-dimensional object and the second three-dimensional object in a same coordinate system; a second obtaining unit configured to obtain, from the file, metadata common to the first three-dimensional object and the second three-dimensional object; a decoding unit configured to, in a case where the first three-dimensional object and the second three-dimensional object can be decoded, decode the data of the first three-dimensional object and the data of the second three-dimensional object; and a rendering unit configured to render the first three-dimensional object and the second three-dimensional object in a space in the same coordinate system, based on the arrangement information and the first three-dimensional object and the second three-dimensional object that were decoded by the decoding unit.
  17. 17 . The information processing apparatus according to claim 16 , wherein the arrangement information includes information indicating a position in the same coordinate system, information indicating an orientation in the same coordinate system, and information indicating a size in the same coordinate system, for each of the first and second three-dimensional objects.
  18. 18 . The information processing apparatus according to claim 16 , wherein an origin of the same coordinate system is a reference position of a space managed by a space ID.
  19. 19 . The information processing apparatus according to claim 16 , wherein the metadata is profile information referred to when decoding the first and second three-dimensional objects.
  20. 20 . The information processing apparatus according to claim 16 , wherein: the second obtaining unit further obtains, from the file, field of view information indicating a viewpoint in the same coordinate system, and the rendering unit renders the first three-dimensional object and the second three-dimensional object based on the field of view information.

Description

CROSS-REFERENCE TO RELATED APPLICATIONS This application is a Continuation of International Patent Application No. PCT/JP 2024/023737, filed Jul. 1, 2024, which claims the benefit of Japanese Patent Application No. 2023-110826, filed Jul. 5, 2023, both of which are hereby incorporated by reference herein in their entirety. BACKGROUND Field of the Technology The present disclosure relates to an information processing apparatus, an information processing method, and a non-transitory computer-readable storage medium. Description of the Related Art Methods that use computer graphics are known as conventional methods for generating 3D object data. However, in recent years, methods for obtaining 3D object data by scanning the shapes real objects, people, and the like using dedicated devices, studios, and the like are being used more often. Efforts are also underway for data utilization in which multiple items of 3D object data generated or obtained in such a manner are arranged within the same three-dimensional space. For example, in the field of autonomous driving or driver assistance, progress is being made in developing systems for updating road information in real time by obtaining 3D object data of objects around roads, by in-vehicle remote sensing devices such as LiDAR (Light Detection And Ranging or Laser Imaging Detection and Ranging), and displaying the 3D object data superimposed onto a dynamic map. Meanwhile, MPEG (Moving Picture Experts Group), which is a group under the ISO (International Organization for Standardization) and the IEC (International Electrotechnical Commission), is advancing the standardization of specifications for encoding 3D object data such as point clouds or meshes, and file format standards for storing encoded 3D object data. International Publication No. 2020/137642 discloses a technique that makes it possible to, for example, change the quality of a portion of 3D object data by spatially dividing 3D object data encoded through G-PCC (Geometry based Point Cloud Compression), which is being standardized by MPEG, and then generating position information on the positions of the divided portions of the 3D object data in a three-dimensional space, and grouping information indicating that the divided portions belong to the same group. However, although the technique described in International Publication No. 2020/137642 discloses an aspect for dividing a single item of 3D object data, an aspect for storing multiple different items of 3D object data in a single file is not considered. Furthermore, glTF, which is a format for expressing 3D models, writes the structure and configuration of a three-dimensional space in JSON format, and makes it possible to arrange a plurality of items of 3D object data in the same three-dimensional space. However, glTF is a specification in which the three-dimensional space and object data are associated with each other using a URI, and it has been necessary to read out and analyze individual items of object data in order to determine whether that object data is in a data format that can be displayed. SUMMARY According to an embodiment of the present disclosure, a information processing apparatus is provided that utilizes a single file that stores information for displaying a plurality of items of three-dimensional object data in the same coordinate system. According to one embodiment of the present disclosure, an information processing apparatus comprises: an obtaining unit configured to obtain first data of a first three-dimensional object and second data of a second three-dimensional object different from the first three-dimensional object; a first generating unit configured to generate arrangement information for arranging the first three-dimensional object and the second three-dimensional object in a same coordinate system; a second generating unit configured to generate metadata that is common for the first three-dimensional object and the second three-dimensional object; a third generating unit configured to generate a first track that manages the first data, a second track that manages the second data, and a third track that manages the metadata; and a fourth generating unit configured to generate a single file that stores the first track, the second track, the third track, the first data, the second data, the arrangement information, and the metadata. Features of the present disclosure will become apparent from the following description of embodiments with reference to the attached drawings. BRIEF DESCRIPTION OF THE DRAWINGS The accompanying drawings, which are incorporated in and constitute a part of the specification, illustrate embodiments of the present disclosure, and together with the description, serve to explain the principles of the embodiments. FIG. 1 is a block diagram illustrating an example of the functional configuration of an information processing apparatus according to a first embodiment. FIG. 2 is a flowchart illustrating an ex