JP-7856841-B1 - Information processing method and program to support conversational learning
Abstract
[Problem] To provide information processing that supports more effective conversational learning. [Solution] Using an information processing device, data of the user's voice in a language conversation with the user using artificial intelligence is acquired, and an evaluation result is output that evaluates the language expressions spoken by the user in the conversation based on the acquired voice data. The language expressions selected by the user from the output evaluation result are stored in the memory as stock information for later learning. After the stock information is stored in the memory, data of the user's voice in a language conversation with the user using artificial intelligence is acquired, and using the acquired voice data and the stored stock information, it is determined whether the expression from the stock information was used in the language conversation with the user using artificial intelligence. If it is determined that the expression from the stock information was used, evaluation information that the expression from the stock information was used is output. [Selection Diagram] Figure 3
Inventors
- 矢部 大輔
Assignees
- 株式会社プログリット
Dates
- Publication Date
- 20260511
- Application Date
- 20251110
Claims (10)
- In an information processing method for supporting a user's learning of conversation in a certain language, Using an information processing device, The AI acquires the user's voice data in a conversation with the user in the aforementioned language, Based on the acquired audio data, the system outputs an evaluation result that assesses the linguistic expressions spoken by the user during the conversation. The language expressions selected by the user from the output evaluation results are stored in the memory unit as stock information for later learning. After storing the stock information in the storage unit, the system acquires the user's voice data from the language conversation with the user using artificial intelligence, and uses the acquired voice data and the stored stock information to determine whether the expression in the stock information was used in the language conversation with the user using artificial intelligence. If the determination determines that the representation of the stock information was used, the system outputs information evaluating that the representation of the stock information was used . The method is characterized by outputting a similar alternative expression or a correct expression for the linguistic expression uttered by the user in the conversation, as the linguistic expression in the evaluation result .
- The method according to claim 1, characterized in that the information processing device further acquires information on expressions entered as text by the user as stock information and stores it in the storage unit.
- The information processing device displays the linguistic expression of the evaluation result on the screen of the display device. The method according to claim 1, characterized in that the language expressions are displayed such that the user can select the expression to be used as stock information from among the language expressions in the evaluation results displayed on the screen.
- The method according to claim 3, characterized in that the linguistic expressions in the evaluation results displayed on the screen are displayed on the screen so that the user can select them word by word.
- The method according to claim 1, characterized in that the information processing device outputs new stock information candidates based on the results of analyzing the stock information previously selected or learned by the user .
- The memory unit stores multiple different representations as the stock information, The method according to claim 1, characterized in that it determines whether any of the expressions in the stock information were used in the user's conversation by comparing the audio data obtained in the user's conversation with each of the multiple different expressions in the stock information.
- The method according to claim 1, characterized in that the information processing device sets a target expression for the conversation from among the stored stock information expressions, performs a process to evaluate whether the set target expression was used in the user's conversation, and outputs the evaluation result that the set target expression was used in the user's conversation.
- The method according to claim 1, characterized in that the information processing device sets the topic of the conversation based on the representation of the stored stock information.
- A program for causing a computer to perform the method described in any one of claims 1 to 8.
- In an information processing device for supporting a user's learning of conversation in a certain language, The processing unit, The AI acquires the user's voice data in a conversation with the user in the aforementioned language, Based on the acquired audio data, the system outputs an evaluation result that assesses the linguistic expressions spoken by the user during the conversation. The language expressions selected by the user from the output evaluation results are stored in the memory unit as stock information for later learning. After storing the stock information in the storage unit, the system acquires the user's voice data from the language conversation with the user using artificial intelligence, and uses the acquired voice data and the stored stock information to determine whether the expression in the stock information was used in the language conversation with the user using artificial intelligence. If the determination determines that the representation of the stock information was used, the system outputs information evaluating that the representation of the stock information was used . The device is characterized in that, as a linguistic expression in the evaluation results, it outputs a similar alternative expression to the linguistic expression uttered by the user in conversation, or the correct expression .
Description
This invention relates to an information processing method and program for supporting a user's learning of conversation in a particular language. Conventionally, artificial intelligence-based English conversation support technologies have been proposed. These technologies acquire the user's speech content as text data through speech recognition, and analyze the meaning and emotion of the sentences using natural language processing. Based on the analysis results, they generate appropriate English expressions and response sentences using large-scale language models, and output them with natural pronunciation using speech synthesis technology. Patent Document 1 discloses a technology that analyzes users' pronunciation data and reports on individual speaking habits, weaknesses, and alternative expressions. Japanese Patent Publication No. 2025-048940 This is a diagram showing the configuration of an information processing system.This diagram shows the configuration of an information processing terminal.This is a diagram illustrating a flowchart of an information processing method.This is a diagram of the GUI screen.This is a diagram of the GUI screen.This diagram shows the screen for saving expressions you want to remember.This figure shows a screen displaying the evaluation results regarding the use of stock information representations.This figure shows a screen displaying a list of stock information. <Details of the implementation> This embodiment describes an information processing system for providing an application (software) to support conversational learning in a predetermined language. As an example of use, it is used to support a user whose native language is a first language in learning conversational second language, which is different from the first language. The predetermined language is not particularly limited and may be any language such as Japanese, English, Chinese, Korean, German, or French. Figure 1 shows the configuration diagram of the information processing system in this embodiment. This system comprises an information processing terminal (information processing device) 1, a control server (information processing device) 2, and an AI server (information processing device) 3. These are connected by a network 4. Information processing terminal 1 is a terminal used by a particular user. In addition to the information processing terminal used by user A, multiple information processing terminals used by other users, such as user B, can be connected to network 4. In this system, the server controls and manages the data, while the application (client) on the information processing terminal displays that data. The server-side (backend) manages data in a database, provides API endpoints, performs data updates and calculations, sends data to the client, controls the display of the application screen, and handles security control. The client-side (application) sends requests to the server's API, displays the data received from the server on the screen, provides a user interface (UI), and performs temporary data caching. Information processing terminal 1 (hereinafter referred to as terminal 1) has software and application programs installed to support conversational learning in a specified language. These programs are installed by downloading them from a server connected to terminal 1 via network 4. Terminal 1 transmits information entered by the user and acquired audio data to control server 2 via network 4. For example, terminal 1 acquires audio data spoken by the user using a microphone, etc., and transmits this information to control server 2 via network 4. Terminal 1 also receives (acquires) information output by control server 2 via network 4 and displays information or images based on the received information. Next, we will explain the configuration of terminal 1. Terminal 1 has a computer, such as a personal computer (PC), smartphone, or tablet. Terminal 1 displays various screens on a display device (display screen) using installed software and applications (programs). Figure 2 shows the configuration diagram of terminal 1. Terminal 1 includes a processing unit 11, memory 12, storage 13, communication unit 14, input unit 15, and display unit 16. These are electrically connected to each other via a bus 17. The bus 17 is commonly connected to each of the above components and transmits, for example, address signals, data signals, and various control signals. The processing unit 11 is a computing device that controls the operation of the entire terminal 1, controls the transmission and reception of data between each part, and performs information processing necessary for program execution and authentication. The processing unit 11 includes a computing device such as a CPU, GPU, or FPGA, and executes programs stored in the storage 13 and loaded into the memory 12 to perform various information processing tasks described later. The memory 12 (storage unit) includes a main memory composed of a vola