JP-7854925-B2 - Generation apparatus, generation method, and generation program
Inventors
- 土屋 裕子
- 那須 弘明
- 内海 幸治
Assignees
- 株式会社日立製作所
Dates
- Publication Date
- 20260507
- Application Date
- 20221130
Claims (12)
- A generating apparatus having a processor for executing a program and a storage device for storing the program, The aforementioned processor, A search process that retrieves document data from information sources based on search keywords, A process for generating a co-occurrence network by connecting co-occurring words in each sentence within the document data retrieved by the search process , comprising a generation process for generating the co-occurrence network based on conditions for increasing or decreasing the number of words in the co-occurrence network , A modification process to change the aforementioned conditions, An update process that updates the co-occurrence network generated by the generation process based on the conditions changed by the modification process, A calculation process that calculates the similarity between the search keyword and the words in the co-occurrence network, Based on the similarity of words within the co-occurrence network calculated by the calculation process, an output process is performed to output the co-occurrence network in a displayable format. A generating device characterized by performing the following actions.
- A generating apparatus having a processor for executing a program and a storage device for storing the program, The aforementioned processor, A search process that retrieves document data from information sources based on search keywords, A process for generating a co-occurrence network by connecting co-occurring words in each sentence within the document data retrieved by the search process, comprising a generation process for generating the co-occurrence network based on conditions for increasing or decreasing the number of words in the co-occurrence network, A modification process to change the aforementioned conditions, An update process that updates the co-occurrence network generated by the generation process based on the conditions changed by the modification process, A comparison process that compares the first word in the aforementioned co-occurrence network with the second word in another co-occurrence network, Output processing that associates the first word and the second word based on the comparison results of the comparison process and outputs the co-occurrence network and the other co-occurrence networks in a displayable format, A generating device characterized by performing the following actions.
- The generating apparatus according to claim 2, In the comparison process, the processor detects a specific second word that is similar to the first word, In the output processing described above, the processor associates the first word and the specific second word and outputs the co-occurrence network and the other co-occurrence networks in a displayable format. A generating apparatus characterized by the following features.
- The generating apparatus according to claim 2, In the comparison process, the processor detects a specific second word that is similar to the first word and dissimilar to the titles of the other co-occurrence networks. In the output processing described above, the processor associates the first word and the specific second word and outputs the co-occurrence network and the other co-occurrence networks in a displayable format. A generating apparatus characterized by the following features.
- The generating apparatus according to claim 2, In the comparison process, the processor identifies a first word pair consisting of the first word and a third word in the co-occurrence network that co-occurs with the first word and has a grammatical connection to the first word, identifies a second word pair consisting of the second word and a fourth word in the other co-occurrence network that co-occurs with the second word and has a grammatical connection to the second word, and detects the relationship between the expressions of the first word pair and the second word pair. In the output processing, the processor associates the first word pair and the second word pair and outputs the co-occurrence network and the other co-occurrence networks in a displayable format. A generating apparatus characterized by the following features.
- A generating apparatus according to claim 1 or 2, The aforementioned condition defines the range of word occurrences within the co-occurrence network. A generating apparatus characterized by the following features.
- A generating apparatus according to claim 1 or 2, The aforementioned condition is that the co-occurrence network is generated using words whose co-occurrence probability is within a predetermined probability range. A generating apparatus characterized by the following features.
- A generating apparatus according to claim 1 or 2, The aforementioned conditions include a condition that defines the range of the number of occurrences of words within the co-occurrence network, and a condition that the co-occurrence network is generated using words whose co-occurrence probability of co-occurring word pairs falls within a predetermined probability range. A generating apparatus characterized by the following features.
- A generation method performed by a generation apparatus having a processor for executing a program and a storage device for storing the program, The aforementioned processor, A search process that retrieves document data from information sources based on search keywords, A process for generating a co-occurrence network by connecting co-occurring words in each sentence within the document data retrieved by the search process, comprising a generation process for generating the co-occurrence network based on conditions for increasing or decreasing the number of words in the co-occurrence network, A modification process to change the aforementioned conditions, An update process that updates the co-occurrence network generated by the generation process based on the conditions changed by the modification process, A calculation process that calculates the similarity between the search keyword and the words in the co-occurrence network, Based on the similarity of words within the co-occurrence network calculated by the calculation process, an output process is performed to output the co-occurrence network in a displayable format. A generation method characterized by performing the following.
- A generation method performed by a generation apparatus having a processor for executing a program and a storage device for storing the program, The aforementioned processor, A search process that retrieves document data from information sources based on search keywords, A process for generating a co-occurrence network by connecting co-occurring words in each sentence within the document data retrieved by the search process, comprising a generation process for generating the co-occurrence network based on conditions for increasing or decreasing the number of words in the co-occurrence network, A modification process to change the aforementioned conditions, An update process that updates the co-occurrence network generated by the generation process based on the conditions changed by the modification process, A comparison process that compares the first word in the aforementioned co-occurrence network with the second word in another co-occurrence network, Output processing that associates the first word and the second word based on the comparison results of the comparison process and outputs the co-occurrence network and the other co-occurrence networks in a displayable format, A generation method characterized by performing the following.
- In the processor, A search process that retrieves document data from information sources based on search keywords, A process for generating a co-occurrence network by connecting co-occurring words in each sentence within the document data retrieved by the search process, comprising a generation process for generating the co-occurrence network based on conditions for increasing or decreasing the number of words in the co-occurrence network, A modification process to change the aforementioned conditions, An update process that updates the co-occurrence network generated by the generation process based on the conditions changed by the modification process, A calculation process that calculates the similarity between the search keyword and the words in the co-occurrence network, Based on the similarity of words within the co-occurrence network calculated by the calculation process, an output process is performed to output the co-occurrence network in a displayable format. A generation program characterized by causing the execution of a specific action.
- In the processor, A search process that retrieves document data from information sources based on search keywords, A process for generating a co-occurrence network by connecting co-occurring words in each sentence within the document data retrieved by the search process, comprising a generation process for generating the co-occurrence network based on conditions for increasing or decreasing the number of words in the co-occurrence network, A modification process to change the aforementioned conditions, An update process that updates the co-occurrence network generated by the generation process based on the conditions changed by the modification process, A comparison process that compares the first word in the aforementioned co-occurrence network with the second word in another co-occurrence network, Output processing that associates the first word and the second word based on the comparison results of the comparison process and outputs the co-occurrence network and the other co-occurrence networks in a displayable format, A generation program characterized by causing the execution of a specific action.
Description
This invention relates to an information generation device, a generation method, and a generation program. Digital transformation (DX) in the engineering chain (EC) of B2B (Business to Business) manufacturing is progressing, primarily focusing on the management of the product lifecycle and product data from design to mass production preparation. While the upstream of the EC (market research, product planning, research and development, and design departments) collects external information, including customer challenges and needs (defined as corporate Voice of Customer), and incorporates this into product and service development, DX in the upstream of the EC is still in its early stages. Therefore, external information, including corporate VoC from customers and potential customers, is currently collected and analyzed manually from external patent and academic paper databases and trade show information. Patent Document 1 discloses a self-generating information processing system that continuously provides new information leading to user insights and discoveries. This self-generating information processing system is an information processing system that collects and outputs information, and comprises means for inputting first information, means for collecting second information related to the first information, means for selecting third information from the second information, means for outputting the second or third information, means for collecting second information as new first information from the third information, means for merging existing second information and new second information in a predetermined ratio, means for selecting new third information from the merged second information, and means for outputting the merged second or new third information, and operates in a recursive manner. Patent Document 2 discloses an idea generation support program. This program performs the following steps: morphological analysis of first information data consisting of multiple natural language sentences limited to a specific subject, and extraction of multiple first terms; extraction of multiple first terms in the first information data according to their frequency of occurrence in each of multiple topics, using a latent Dirichlet allocation method; and morphological analysis of second information data consisting of multiple natural language sentences not limited to a specific subject, and extraction of second terms that co-occur with the multiple first terms in each of the multiple topics. International Publication No. 2016/027372Japanese Patent Publication No. 2022-117931 Figure 1 is an explanatory diagram showing an example of the configuration of the generation system.Figure 2 is a block diagram showing an example of the hardware configuration of the generation device.Figure 3 is an explanatory diagram showing an example of a concern database.Figure 4 is a block diagram showing an example of the functional configuration of the generating device.Figure 5 is a flowchart showing an example of the visualization information generation process procedure by the generation device.Figure 6 is a flowchart showing a detailed example of the co-occurrence network generation process (step S502).Figure 7 is an explanatory diagram showing the search results of step S602.Figure 8 is an explanatory diagram showing the morphological analysis results.Figure 9 is an explanatory diagram showing an example of a co-occurrence network display (Example 1).Figure 10 is an explanatory diagram showing an example of a co-occurrence network display (example 2).Figure 11 is a flowchart showing a detailed example of the processing procedure for the similarity visualization process (step S504).Figure 12 is an explanatory diagram showing an example of similarity visualization.Figure 13 is a flowchart showing a detailed example of the processing procedure for the cross-field comparison process (step S505).Figure 14 is an explanatory diagram showing an example of a comparison between different fields. <Example of a generation system configuration> Figure 1 is an explanatory diagram showing an example configuration of the generation system. The generation system 100 includes a generation device 101 and a terminal 102. The generation device 101 and the terminal 102 are connected to each other via a network 103 such as the Internet, LAN (Local Area Network), or WAN (Wide Area Network). The generation device 101 is connected to a search results DB (Data Base) 104. The search results DB 104 is a database in which multiple terminals 102 store search results for internal databases 110 and external databases 120. The terminal 102 has a browser function and displays the information on its screen when it receives visualization information from the generation device 101. Each user of the multiple terminals 102 is, for example, an employee of the same company. Furthermore, the generation device 101 and terminal 102 are connected to the internal database group