Search

JP-2026075545-A - Information processing device and program

JP2026075545AJP 2026075545 AJP2026075545 AJP 2026075545AJP-2026075545-A

Abstract

[Problem] The purpose of this application is to facilitate the expression of the data to be edited as a string of natural language characters and to provide instructions to the generating AI for data editing. [Solution] The information processing device in this application comprises: a receiving means for receiving a selection of an object contained in data and a natural language string from a user; a identifying means for identifying a natural language string corresponding to the object once the object has been selected; and an input means for inputting the data, the identified string, and the received natural language string to a generating AI, wherein, upon input of the identified string and the received natural language string to the generating AI, the generating AI performs processing on the selected object based on the received natural language string. [Selection Diagram] Figure 7

Inventors

  • 戸田 航平
  • 望月 宏史

Assignees

  • キヤノン株式会社

Dates

Publication Date
20260508
Application Date
20241022

Claims (19)

  1. A means of receiving the selection of objects included in the data and natural language strings from the user, A means for identifying a natural language string corresponding to the object, upon selection of the object, The system includes input means for inputting the aforementioned data, the identified string, and the received natural language string into a generating AI. An information processing device characterized in that, when the identified string and the received natural language string are input to the generating AI, the generating AI performs processing on the selected object based on the received natural language string.
  2. It further has a means of display, The information processing apparatus according to claim 1, characterized in that the display means displays an object on which the processing has been performed by the generating AI.
  3. It further has a receiving means, The receiving means receives a natural language string obtained by inputting the data into the generating AI, which corresponds to the object contained in the data. The information processing apparatus according to claim 1, characterized in that the identifying means identifies a natural language string corresponding to the object contained in the data based on the received natural language string.
  4. It further has a means of recognition, The recognition means recognizes the object contained in the data and associates the natural language string with the recognized object. The information processing apparatus according to claim 1, characterized in that the identifying means identifies a natural language string corresponding to the object contained in the data based on the natural language string associated by the recognition means.
  5. The information processing apparatus according to claim 1, characterized in that the object is an image, and the data is image data including the image.
  6. The information processing apparatus according to claim 1, characterized in that the object is a string, and the data is image data containing the string.
  7. The information processing device according to claim 1, characterized in that the natural language string corresponding to the object is the name of the object.
  8. A means for receiving data from the user, including the selection of a data area and a natural language string, A means for identifying a natural language string corresponding to the selected region, The system includes input means for inputting the aforementioned data, the identified string, and the received natural language string into a generating AI. An information processing device characterized in that, when the identified string and the received natural language string are input to the generating AI, the generating AI performs processing based on the received natural language string in the selected region.
  9. It further has a means of display, The information processing apparatus according to claim 8, characterized in that the display means displays objects included in the region where the processing was performed by the generating AI.
  10. It further has a receiving means, The receiving means receives a natural language string obtained by inputting the data and the selected region into the generating AI, the natural language string corresponding to the selected region included in the data, The information processing apparatus according to claim 8, characterized in that the identifying means identifies a natural language string corresponding to the selected region contained in the data based on the received natural language string.
  11. It further has a means of recognition, The recognition means recognizes the region selected by the user and associates the natural language string with the recognized region. The information processing apparatus according to claim 8, characterized in that the identifying means identifies a natural language string corresponding to the selected region included in the data based on the natural language string associated by the recognition means.
  12. The information processing apparatus according to claim 8, characterized in that the region is a region in the image indicated by the data.
  13. A process for receiving data, including the selection of objects included in the data and natural language strings from the user, A program that causes a computer to perform the following steps: an identification step of identifying a natural language string corresponding to an object upon selection of the object; and an input step of inputting the data, the identified string, and the received natural language string to a generating AI, A program characterized in that, when the identified string and the received natural language string are input to the generating AI, the generating AI performs processing on the selected object based on the received natural language string.
  14. The process further includes a display step, The program according to claim 13, wherein the display step displays the object on which the processing was performed by the generating AI.
  15. It further has a receiving process, The receiving step receives a natural language string obtained by inputting the data into the generating AI in the input step, which is a natural language string corresponding to the object contained in the data. The program according to claim 13, wherein the specified step identifies a natural language string corresponding to the object contained in the data based on the received natural language string.
  16. It further includes a recognition process, The recognition step recognizes the object contained in the data and associates the natural language string with the recognized object. The program according to claim 13, wherein the identification step identifies a natural language string corresponding to the object included in the data based on the natural language string associated by the recognition step.
  17. The program according to claim 13, characterized in that the object is an image, and the data is image data including the image.
  18. The program according to claim 13, characterized in that the object is a string, and the data is image data containing the string.
  19. The program according to claim 13, characterized in that the natural language string corresponding to the object is the name of the object.

Description

This invention relates to an information processing device and a program. Conversational AI (AI), such as chatbots and Generative Artificial Intelligence (Generative Artificial Intelligence), has been developed, and a variety of AI-based services are being offered. Patent Document 1 discloses a system that displays a preview image of a car on a display, and by inputting natural language instructions (prompts) to change the body color via chat to the Generative AI, the edited preview image of the car is displayed. Japanese Patent Publication No. 2024-25293 This diagram shows an example of the overall configuration of this system.This diagram shows an example of the computer hardware configuration of this system.This diagram shows an example of the hardware configuration of the AI server that generates this system.This diagram shows an example of the printer hardware configuration in this system.This diagram shows an example of the software configuration of this system.This diagram shows an example of the print application screen in this system.This diagram shows an example of the print application screen when an image is selected in the preview area of the print application screen.This diagram shows an example of the print application screen after selecting an image in the preview area of the print application and executing the conversion process.This diagram shows an example of the print application screen when an area is manually selected in the preview area of the print application screen.This diagram shows an example of the print application screen when a region is manually selected in the preview area of the print application screen and the conversion process is executed.This sequence diagram shows an example of the process of converting and printing images via chat in this system.A flowchart illustrating an example of object recognition processing in this system.This figure shows an example of the results of object recognition processing in this system.This diagram shows an example of the structure of historical data in this system. The embodiments will be described in detail below with reference to the attached drawings. Note that the following embodiments do not limit the invention as defined in the claims. While multiple features are described in the embodiments, not all of these features are essential to the invention, and the features may be combined in any way. Furthermore, in the attached drawings, identical or similar configurations are given the same reference numerals, and redundant descriptions are omitted. <Implementation> <System Configuration> The following describes a first embodiment of the present invention. First, the network configuration of the printing system according to this embodiment will be described with reference to Figure 1. As shown in Figure 1, this printing system comprises a terminal device, a computer 1000, a printer 2000 configured for printing, and a generation AI server 3000. The computer 1000 and printer 2000 are located, for example, within an office and are connected to each other via a network 4000. The network 4000 is connected to the external internet 5000 via a router (not shown). As a result, the generation AI server 3000, the computer 1000, and the printer 2000 are all connected to the internet 5000 and are able to communicate with each other. Here, computer 1000 is an example of an information processing device, user terminal, or terminal device; printer 2000 is an example of an image processing device, image forming apparatus, or MFP; and generation AI server 3000 is an example of an information processing device. MFP stands for Multi Function Peripheral. The generation AI server 3000 provides the generation AI service 3100. Furthermore, the printing application 1100, described later, is executed and provided by computer 1000 or printer 2000. <Hardware Configuration> Referring to Figures 2, 3, and 4, an example of the hardware configuration of each device constituting the printing system according to this embodiment will be described. Figure 2 shows an example of the hardware configuration of the computer 1000. Figure 3 shows an example of the hardware configuration of the generation AI server 3000. Figure 4 shows an example of the hardware configuration of the printer 2000. As shown in Figure 2, the computer 1000 is composed of a CPU 111, ROM 112, RAM 113, HDD 114, and network I/F 115. Note that CPU stands for Central Processing Unit, ROM for Read Only Memory, RAM for Random Access Memory, HDD for Hard Disk Drive, and I/F for Interface. The CPU 111 controls the overall operation by reading control programs stored in the ROM 112 and HDD 114 and executing various processes. The RAM 113 is used as the CPU 111's main memory and temporary storage area, such as the work area. The HDD 114 is a large-capacity storage unit that stores image data and various programs. The network interface 115 is an interface for connecting to the internet. It receives processing