KR-20260067940-A - ELECTRONIC APPARATUS FOR PROVIDING RESPONSE TO USER COMMAND AND CONTROL METHOD THEREOF

KR20260067940AKR 20260067940 AKR20260067940 AKR 20260067940AKR-20260067940-A

Abstract

An electronic device is disclosed. The electronic device includes one or more processors, each comprising a memory for storing a plurality of sounds and instructions, a speaker, and processing circuitry. When the instructions are executed individually or collectively by one or more processors, when a user command is obtained, at least one sound among a plurality of sounds is identified, and a first neural network model is requested to perform an operation on the user command. While the operation of the first neural network model is being performed, at least one sound is output through the speaker, and when an answer based on the operation of the first neural network model is obtained, a sound corresponding to the answer is output through the speaker.

Inventors

우광택
안진완

Assignees

삼성전자주식회사

Dates

Publication Date: 20260513
Application Date: 20241206
Priority Date: 20241106

Claims (20)

In electronic devices, Memory for storing multiple sounds and instructions; Speaker; and One or more processors including processing circuitry; and When the above instructions are executed individually or collectively by the one or more processors, When a user command is obtained, at least one sound among the plurality of sounds is identified, and Requesting the execution of a first neural network model operation for the above user command, and While the operation of the first neural network model is being performed, the at least one sound is output through the speaker, and An electronic device that outputs a sound corresponding to the answer through the speaker when an answer based on the operation of the first neural network model is obtained.
In paragraph 1, It further includes a communication interface; and When the above instructions are executed individually or collectively by the one or more processors, Control the communication interface to transmit the user command to a server that performs operations on the first neural network model, and After identifying at least one sound, output the at least one sound through the speaker until outputting a sound corresponding to the answer, and An electronic device that outputs a sound corresponding to the answer through the speaker when the answer is received from the server through the communication interface.
In paragraph 1, The above memory is, The above-mentioned first neural network model is further stored, When the above instructions are executed individually or collectively by the one or more processors, Performing operations of the first neural network model for the above user command, While the operation of the first neural network model is being performed, the at least one sound is output through the speaker, and An electronic device that outputs a sound corresponding to the above answer through the speaker when the above answer is obtained.
In paragraph 1, The above memory is, Save the second neural network model, When the above instructions are executed individually or collectively by the one or more processors, Performing operations of the second neural network model on the above user command to identify the estimated operation time until the answer based on the operation of the first neural network model is provided, and An electronic device that identifies at least one sound among the plurality of sounds based on the above-mentioned expected operation time.
In paragraph 4, The above plurality of sounds are, It includes sounds with different playback times, When the above instructions are executed individually or collectively by the one or more processors, An electronic device that identifies the sound having the playback time closest to the estimated computation time among the plurality of sounds.
In paragraph 1, When the above instructions are executed individually or collectively by the one or more processors, An electronic device that, when the above user command is obtained, identifies a sound corresponding to a part of the above user command as the at least one sound.
In paragraph 6, When the above instructions are executed individually or collectively by the one or more processors, While the operation of the first neural network model is being performed, the at least one sound is output through the speaker, and An electronic device that, when the above answer is obtained, outputs a sound through the speaker corresponding to the remainder of the above answer, excluding a part of the above user command.
In paragraph 6, When the above instructions are executed individually or collectively by the one or more processors, A prompt to output a part of the above user command as sound and a request to perform an operation of the above first neural network model on the above user command, and While the operation of the first neural network model is being performed, the at least one sound is output through the speaker, and When the above answer is obtained, a sound corresponding to the above answer is output through the speaker, and The above answer is, An electronic device that is an answer with part of the above user command excluded.
In paragraph 1, Includes a microphone; and When the above instructions are executed individually or collectively by the one or more processors, When the user utterance is received through the microphone, the user command is obtained based on the user utterance, and An electronic device that identifies at least one sound based on at least one of the content or tone of the above user command.
In Paragraph 9, When the above instructions are executed individually or collectively by the one or more processors, An electronic device that updates the plurality of sounds based on the tone of the above user command.
In a method for controlling an electronic device, When a user command is obtained, a step of identifying at least one sound among a plurality of sounds; A step of requesting the execution of a first neural network model operation for the above user command; A step of outputting at least one sound through a speaker of the electronic device while the operation of the first neural network model is being performed; and A control method comprising the step of outputting a sound corresponding to the answer through the speaker when an answer based on the operation of the first neural network model is obtained.
In Paragraph 11, The above-mentioned requested step is, The user command is transmitted to a server that performs operations of the first neural network model, and The step of outputting at least one sound through the speaker is, After identifying at least one sound, output the at least one sound through the speaker until outputting a sound corresponding to the answer, and The step of outputting a sound corresponding to the above answer through the speaker is, A control method that outputs a sound corresponding to the answer through the speaker when the answer is received from the server.
In Paragraph 11, The above-mentioned requested step is, A control method for performing operations of the first neural network model on the above user command.
In Paragraph 11, The above identification step is, Performing operations of a second neural network model on the above user command to identify the estimated operation time until the answer based on the operations of the first neural network model is provided, and A control method for identifying at least one sound among a plurality of sounds based on the above-mentioned expected operation time.
In Paragraph 14, The above plurality of sounds are, It includes sounds with different playback times, The above identification step is, A control method for identifying the sound having the playback time closest to the expected calculation time among the plurality of sounds.
In Paragraph 11, The above identification step is, A control method that, when the above user command is obtained, identifies a sound corresponding to a part of the above user command as the at least one sound.
In Paragraph 16, The step of outputting a sound corresponding to the above answer through the speaker is, A control method that, when the above answer is obtained, outputs a sound through the speaker corresponding to the remainder of the above answer, excluding a part of the above user command.
In Paragraph 16, The above-mentioned requested step is, A prompt to output a part of the above user command as sound and a request to perform an operation of the above first neural network model on the above user command, and The step of outputting a sound corresponding to the above answer through the speaker is, When the above answer is obtained, a sound corresponding to the above answer is output through the speaker, and The above answer is, A control method that is an answer excluding part of the above user command.
In Paragraph 11, The above identification step is, When the user utterance is received through the microphone included in the electronic device, the user command is obtained based on the user utterance, and A control method for identifying at least one sound based on at least one of the content or tone of the above user command.
In Paragraph 19, A control method further comprising the step of updating the plurality of sounds based on the tone of the user command.

Description

ELECTRONIC APPARATUS FOR PROVIDING RESPONSE TO USER COMMAND AND CONTROL METHOD THEREOF } The present disclosure relates to an electronic device and a method for controlling the same, and more specifically, to an electronic device and a method for controlling the same that provides a response to a user command. Thanks to advancements in electronic technology, various types of electronic devices are being developed. In particular, user convenience is improving as electronic devices that provide responses to user utterances through neural network models are becoming more widespread. However, electronic devices that provide responses to user utterances through neural network models merely provide mechanical answers, so they could not provide the user with the feeling of conversing with a person. FIG. 1 is a block diagram showing an electronic system according to one embodiment of the present disclosure. FIG. 2 is a block diagram showing the configuration of an electronic device according to one embodiment of the present disclosure. FIG. 3 is a block diagram showing the detailed configuration of an electronic device according to one embodiment of the present disclosure. FIG. 4 is a drawing for explaining a generative artificial intelligence system (400) according to one embodiment of the present disclosure. FIG. 5 is a flowchart illustrating an operation to identify at least one sound using an estimated computation time according to one embodiment of the present disclosure. FIG. 6 is a flowchart illustrating an operation for identifying a sound corresponding to a part of a user command as at least one sound according to one embodiment of the present disclosure. FIG. 7 is a drawing for explaining a second neural network model according to one embodiment of the present disclosure. FIG. 8 is a drawing for explaining a plurality of sounds according to one embodiment of the present disclosure. FIG. 9 is a drawing for explaining the operation of outputting a sound corresponding to a part of a user command as at least one sound according to one embodiment of the present disclosure. FIG. 10 is a flowchart illustrating a method for controlling an electronic device according to one embodiment of the present disclosure. The purpose of the present disclosure is to provide an electronic device and a method for controlling the same for providing a user with the sensation of conversing with a person in the process of providing a response to a user's utterance through a neural network model. The present disclosure will be described in detail below with reference to the attached drawings. The terms used in the embodiments of this disclosure have been selected to be as widely used as possible, taking into account their functions within this disclosure; however, these terms may vary depending on the intent of those skilled in the art, case law, the emergence of new technologies, etc. Additionally, in specific cases, terms have been arbitrarily selected by the applicant, and in such cases, their meanings will be described in detail in the relevant explanatory section of this disclosure. Therefore, terms used in this disclosure should be defined not merely by their names, but based on their meanings and the overall content of this disclosure. In this specification, expressions such as “have,” “may have,” “include,” or “may include” indicate the presence of such features (e.g., numerical values, functions, operations, or components such as parts) and do not exclude the presence of additional features. The expression "at least one of A or/and B" should be understood as representing either "A" or "B" or "A and B". Expressions such as "first," "second," "first," or "second" used in this specification may modify various components regardless of order and/or importance, and are used only to distinguish one component from another and do not limit said components. The singular expression includes the plural expression unless the context clearly indicates otherwise. In this application, terms such as "comprising" or "consisting of" are intended to specify the existence of the features, numbers, steps, actions, components, parts, or combinations thereof described in the specification, and should be understood as not precluding the existence or addition of one or more other features, numbers, steps, actions, components, parts, or combinations thereof. In this specification, the term "user" may refer to a person using an electronic device or a device using an electronic device (e.g., an artificial intelligence electronic device). Various embodiments of the present disclosure will be described in more detail below with reference to the attached drawings. FIG. 1 is a block diagram showing an electronic system (1000) according to one embodiment of the present disclosure. As shown in FIG. 1, the electronic system (1000) may include an electronic device (100) and a server (200). The electronic device (100) is a device that provides a response to a u