CN-115917642-B - Sound processing system, sound processing device, and sound processing method

CN115917642BCN 115917642 BCN115917642 BCN 115917642BCN-115917642-B

Abstract

The sound processing system according to the present disclosure includes an input unit, a determination unit, and a sound recognition unit. The input unit receives the 1 st sound, which is the sound uttered by the 1 st speaker. The determination unit determines whether or not the position of the 1 st speaker can be determined. The voice recognition unit outputs a voice command, which is a signal for controlling the target device, to the target device, the voice command being determined based on a voice, and the voice recognition unit restricts the output of a speaking position command, which is a command relating to the position of the speaker, among the voice commands, when the determination unit determines that the position of the 1 st speaker cannot be determined.

Inventors

YAMANASHI TOMOFUMI
Holding Wood, Nanshengye

Assignees

松下知识产权经营株式会社

Dates

Publication Date: 20260505
Application Date: 20210420
Priority Date: 20200918

Claims (13)

1. A sound processing system is provided with: an input unit that receives a1 st sound, the 1 st sound being a sound generated by a1 st speaker; a determination unit for determining whether the 1 st speaker position can be determined, and A voice recognition unit that outputs a voice command, which is a signal for controlling the target device, to the target device, the voice command being determined based on a voice, wherein the voice recognition unit restricts output of a speaking position command, which is a command relating to a position of a speaker, among the voice commands when the determination unit determines that the position of the 1 st speaker cannot be determined, The input unit receives a sound including the 1 st sound output from a plurality of microphones including the 1 st microphone and a 2 nd microphone associated with the 1 st microphone, The determination section detects the presence or absence of a failure of the plurality of microphones, determines whether or not the position of the 1 st speaker can be determined based on the result of the detection, When the determination unit detects a failure of the 1 st microphone and determines that the 1 st speaker cannot be specified, the voice recognition unit does not output the speech position command specified from the voice input from the 2 nd microphone, among the speech position commands.
2. The sound processing system of claim 1, wherein, When the determination unit determines that the position of the 1 st speaker cannot be specified, the voice recognition unit does not output the speaking position command.
3. The sound processing system of claim 1 or 2, wherein, When the determination unit determines that the position of the 1 st speaker cannot be specified, the voice recognition unit changes the priority of the output of the driver seat command associated with the driver seat among the speaking position commands.
4. The sound processing system of claim 3, wherein, When the determination unit determines that the position of the 1 st speaker cannot be specified, the voice recognition unit increases the priority of the output of the driver's seat command.
5. The sound processing system of claim 1 or 2, wherein, Further comprising a speaker determination unit that determines whether or not the 1 st sound is a sound of a registered person registered in advance, The voice recognition unit outputs the speaking position command on the condition that the speaker determination unit determines that the 1 st voice is the voice of the registrant.
6. The sound processing system of claim 1, wherein, The speaker 1 system further includes a determination unit that determines a position of the speaker 1 from which the speaker 1 is to be emitted by comparing sound signals of the plurality of microphones.
7. The sound processing system of claim 1, wherein, The determination unit determines that a failure has occurred when a difference between a level of the audio signal output from the 1 st microphone and a level of the audio signal output from the 2 nd microphone is equal to or greater than a threshold value.
8. The sound processing system of claim 5, wherein, The voice recognition unit outputs the speaking position command for determining the processing to be executed, based on the position of the speaker.
9. The sound processing system of claim 5, wherein, When the determination unit determines that the speaker position cannot be specified due to the detection of the microphone failure, the voice recognition unit restricts the output of the speaking position command of the voice input from the microphone associated with the microphone having the failure.
10. A sound processing system is provided with: an input unit that receives a1 st sound, the 1 st sound being a sound generated by a1 st speaker; A determination unit that determines whether or not the position of the 1 st speaker can be determined; a voice recognition unit that outputs a voice command, which is a signal for controlling the target device, to the target device, the voice command being determined based on a voice, the voice recognition unit limiting output of a speaking position command, which is a command related to a position of a speaker, among the voice commands, in a case where the determination unit determines that the position of the 1 st speaker cannot be determined, and And a1 st cancellation processing unit that cancels sound components other than the sound uttered by the speaker based on the reference signal indicating the specific sound component.
11. A sound processing system is provided with: an input unit that receives a1 st sound, the 1 st sound being a sound generated by a1 st speaker; A determination unit that determines whether or not the position of the 1 st speaker can be determined; a voice recognition unit that outputs a voice command, which is a signal for controlling the target device, to the target device, the voice command being determined based on a voice, the voice recognition unit limiting output of a speaking position command, which is a command related to a position of a speaker, among the voice commands, in a case where the determination unit determines that the position of the 1 st speaker cannot be determined, and And a2 nd cancellation processing unit that performs cancellation processing for canceling sounds from directions other than the target direction.
12. A sound processing device is provided with: a receiving unit that receives a1 st signal indicating whether or not the 1 st speaker who uttered the 1 st sound can be specified; A voice recognition unit that outputs a voice command determined from a voice as a signal for controlling a target device to the target device, the voice recognition unit restricting output of a speaking position command as a command related to a position of a speaker among the voice commands determined from the voice in a case where the 1 st signal indicates that the 1 st speaker position cannot be determined, and An enhancement processing section that performs directivity control for enhancing sound in the target direction.
13. A sound processing method comprising: An input step of receiving a1 st sound from a1 st speaker; A determination step of determining whether the position of the 1 st speaker can be determined, and An output step of outputting a sound command determined from sound as a signal for controlling the object device to the object device, Wherein, when it is determined in the determining step that the position of the 1 st speaker cannot be determined, the priority of the output of the driver seat command associated with the driver seat in the voice command is changed in the outputting step.

Description

Sound processing system, sound processing device, and sound processing method Technical Field The present disclosure relates to a sound processing system, a sound processing apparatus, and a sound processing method. Background A sound processing system is known which processes a sound recognition command based on a sound uttered by a speaker. Patent document 1 discloses a sound processing system that processes a sound recognition command based on a position where a speaker utters a sound. Prior art literature Patent literature Patent document 1 Japanese patent application laid-open No. 2017-90611 Disclosure of Invention However, patent document 1 does not disclose control in the case where the position of the speaker cannot be specified. If the speaker position cannot be specified, the sound processing system may perform unintended processing. The present disclosure aims to perform appropriate processing in a sound processing system even in a case where the position of a speaker cannot be determined. The sound processing system according to the present disclosure includes an input unit, a determination unit, and a sound recognition unit. The input unit receives the 1 st sound, which is the sound uttered by the 1 st speaker. The determination unit determines whether or not the position of the 1 st speaker can be determined. The voice recognition unit outputs a voice command, which is a signal for controlling the target device, to the target device, the voice command being determined based on a voice, and the voice recognition unit restricts the output of a speaking position command, which is a command relating to the position of the speaker, among the voice commands, when the determination unit determines that the position of the 1 st speaker cannot be determined. According to the present disclosure, in the sound processing system, an appropriate process can be performed even in a case where the position of the speaker cannot be determined. Drawings Fig. 1 is a diagram showing an example of a schematic configuration of the in-vehicle sound processing system according to embodiment 1. Fig. 2 is a diagram showing an example of a hardware configuration of the audio processing system according to embodiment 1. Fig. 3 is a block diagram showing an example of the configuration of the sound processing system according to embodiment 1. Fig. 4 is a flowchart showing an example of the operation of the audio processing system according to embodiment 1. Fig. 5 is a block diagram showing an example of the configuration of the sound processing system according to embodiment 2. Fig. 6 is a flowchart showing an example of the operation of the audio processing system according to embodiment 2. Detailed Description Embodiments of the present disclosure will be described in detail below with reference to the drawings as appropriate. Here, too detailed explanation may be omitted. Furthermore, the drawings and the following description are provided to enable those skilled in the art to fully understand the present disclosure, and are not intended to limit the subject matter recited in the claims thereto. (Embodiment 1) Fig. 1 is a diagram showing an example of the outline configuration of the audio system 5 in embodiment 1. The sound system 5 is mounted on the vehicle 10, for example. Next, an example in which the sound system 5 is mounted on the vehicle 10 will be described. A plurality of seats are provided in the cabin of the vehicle 10. The plurality of seats are, for example, 4 seats, that is, a driver's seat, a passenger seat, and left and right rear seats. Further, the number of seats is not limited thereto. Hereinafter, a person sitting in the driver's seat is referred to as a passenger hm1, a person sitting in the passenger seat is referred to as a passenger hm2, a person sitting on the right side of the rear seat is referred to as a passenger hm3, and a person sitting on the left side of the rear seat is referred to as a passenger hm4. The sound system 5 has a microphone MC1, a microphone MC2, a microphone MC3, a microphone MC4, a sound processing system 20, and an electronic device 30. The sound system 5 shown in fig. 1 has a number equal to the number of seats, i.e. 4 microphones, but the number of microphones may not be equal to the number of seats. Microphone MC1, microphone MC2, microphone MC3, and microphone MC4 output sound signals to sound processing system 20. Then, the sound processing system 20 outputs the sound recognition result to the electronic device 30. The electronic device 30 performs a process specified according to the voice recognition result based on the inputted voice recognition result. The microphone MC1 is a microphone that collects sounds emitted from the occupant hm 1. In other words, the microphone MC1 acquires a sound signal containing a sound component emitted by the occupant hm 1. The microphone MC1 is disposed, for example, on the right side of an overhead console (overhead console). T