CN-121983076-A - Audio processing method, device, equipment and storage medium

CN121983076ACN 121983076 ACN121983076 ACN 121983076ACN-121983076-A

Abstract

The application provides an audio processing method, device, equipment and storage medium, wherein the audio processing method comprises the steps of obtaining a preset audio signal to be processed; the preset audio signals comprise first audio signals and second audio signals, the first audio signals are different from the second audio signals, echo return loss estimation is conducted on the first audio signals and the second audio signals to obtain corresponding echo return loss, double-talk detection is conducted on the preset audio signals according to the first audio signals and the second audio signals to obtain double-talk detection results of the preset audio signals, echo cancellation is conducted on the preset audio signals according to at least one of the echo return loss and the double-talk detection results to obtain target audio signals corresponding to the preset audio signals, so that echo cancellation flexibility of the preset audio signals is improved, and echo cancellation effects on the preset audio signals are improved.

Inventors

YANG PENG
HU LITIAN
WANG CHUANSHENG
XIE XUN

Assignees

珠海莫界科技有限公司

Dates

Publication Date: 20260505
Application Date: 20260205

Claims (10)

1. An audio processing method, characterized in that the audio processing method comprises: the method comprises the steps of obtaining a preset audio signal to be processed, wherein the preset audio signal comprises a first audio signal and a second audio signal, and the first audio signal is different from the second audio signal; performing echo return loss estimation on the first audio signal and the second audio signal to obtain corresponding echo return loss; According to the first audio signal and the second audio signal, carrying out double-talk detection on the preset audio signal to obtain a double-talk detection result of the preset audio signal; And carrying out echo cancellation on the preset audio signal according to at least one of the echo return loss and the double-talk detection result to obtain a target audio signal corresponding to the preset audio signal.
2. The audio processing method according to claim 1, wherein the performing echo cancellation on the preset audio signal according to at least one of the echo return loss and the double talk detection result to obtain a target audio signal corresponding to the preset audio signal includes: When the echo return loss is greater than a first echo return loss threshold and the preset audio signal is determined to be in a single-talk state according to the double-talk detection result, performing echo cancellation on the preset audio signal through a first filter to obtain a target audio signal, or When the echo return loss is smaller than or equal to a first echo return loss threshold value and the preset audio signal is determined to be in a single-talk state according to the double-talk detection result, performing echo cancellation on the preset audio signal through a second filter to obtain a target audio signal, or And when the preset audio signal is determined to be in the double-talk state according to the double-talk detection result, performing echo cancellation on the preset audio signal through a first filter and a second filter to obtain a target audio signal.
3. The audio processing method according to claim 2, wherein the performing echo cancellation on the preset audio signal by the first filter to obtain a target audio signal includes: performing echo cancellation on the preset audio signal based on the first filter to obtain a first error signal corresponding to the preset audio signal; determining the target audio signal according to the first error signal; the echo cancellation is performed on the preset audio signal through a second filter to obtain a second error signal corresponding to the preset audio signal, including: performing echo cancellation on the preset audio signal based on the second filter to obtain a second error signal corresponding to the preset audio signal; determining the target audio signal from the second error signal; The echo cancellation is performed on the preset audio signal through the first filter and the second filter to obtain a target audio signal, including: performing echo cancellation on the preset audio signal based on the first filter to obtain a first error signal corresponding to the preset audio signal; performing echo cancellation on the preset audio signal based on the second filter to obtain a second error signal corresponding to the preset audio signal; Determining a third error signal corresponding to the preset audio signal according to the first error signal, the second error signal, a first weight corresponding to the first error signal and a second weight corresponding to the second error signal; and determining the target audio signal according to the third error signal.
4. The audio processing method according to claim 3, wherein before the determining the third error signal corresponding to the preset audio signal according to the first error signal, the second error signal, the first weight corresponding to the first error signal, and the second weight corresponding to the second error signal, the audio processing method further comprises: and determining a first weight corresponding to the first error signal and a second weight corresponding to the second error signal according to the first error signal and the second error signal.
5. The image processing method according to claim 4, wherein the determining the first weight and the second weight from the first error signal and the second error signal includes: Determining a first error variance corresponding to the first error signal according to the first error signal; Determining a second error variance corresponding to the second error signal according to the second error signal; determining the first weight according to the duty ratio of the second error variance in the sum of the first error variance and the second error variance; the second weight is determined according to the duty ratio of the first error variance in the sum of the first error variance and the second error variance.
6. The audio processing method according to claim 3, wherein before the determining the third error signal corresponding to the preset audio signal according to the first error signal, the second error signal, the first weight corresponding to the first error signal, and the second weight corresponding to the second error signal, the audio processing method further comprises: When the echo return loss is smaller than a second echo return loss threshold value, and the double-talk intensity parameter corresponding to the preset audio signal is larger than or equal to a preset intensity threshold value, determining that the first weight and the second weight are both preset weights, wherein the second echo return loss threshold value is smaller than the first echo return loss threshold value.
7. The audio processing method according to claim 3, wherein the determining the third error signal corresponding to the preset audio signal according to the first error signal, the second error signal, the first weight corresponding to the first error signal, and the second weight corresponding to the second error signal includes: determining a first product between the first error signal and the first weight; Determining a second product between the second error signal and the second weight; The third error signal is determined based on a sum of the first product and the second product.
8. An audio processing apparatus, characterized in that the audio processing apparatus comprises: The device comprises an audio acquisition module, a processing module and a processing module, wherein the audio acquisition module is used for acquiring a preset audio signal to be processed, the preset audio signal comprises a first audio signal and a second audio signal, and the first audio signal is different from the second audio signal; The echo return loss estimation module is used for carrying out echo return loss estimation on the first audio signal and the second audio signal to obtain corresponding echo return loss; The double-talk detection module is used for carrying out double-talk detection on the preset audio signal according to the first audio signal and the second audio signal to obtain a double-talk detection result of the preset audio signal; And the echo cancellation module is used for performing echo cancellation on the preset audio signal according to at least one of the echo return loss and the double-talk detection result to obtain a target audio signal corresponding to the preset audio signal.
9. An electronic device comprising a memory and a processor; The memory is used for storing a computer program; The processor for executing the computer program and for implementing the steps of the audio processing method according to any one of claims 1 to 7 when the computer program is executed.
10. A computer-readable storage medium, characterized in that the computer-readable storage medium stores a computer program which, when executed by a processor, causes the processor to implement the steps of the audio processing method according to any one of claims 1 to 7.

Description

Audio processing method, device, equipment and storage medium Technical Field The present application relates to the field of echo cancellation technologies, and in particular, to an audio processing method, apparatus, device, and storage medium. Background Echo cancellation is a key technique in audio processing. For example, echo cancellation has an important role in communication scenarios of voice communication, video conferencing, and so forth. However, in the related art, when the electronic device performs echo cancellation on the audio, the situation that the echo cancellation flexibility on the audio is poor easily occurs, which results in poor echo cancellation effect on the audio. Disclosure of Invention The application provides an audio processing method, an audio processing device, audio processing equipment and a storage medium, and aims to solve the technical problem of an echo cancellation effect on audio caused by poor flexibility of echo cancellation on audio. In a first aspect, the present application provides an audio processing method, the audio processing method comprising: the method comprises the steps of obtaining a preset audio signal to be processed, wherein the preset audio signal comprises a first audio signal and a second audio signal, and the first audio signal is different from the second audio signal; performing echo return loss estimation on the first audio signal and the second audio signal to obtain corresponding echo return loss; According to the first audio signal and the second audio signal, carrying out double-talk detection on the preset audio signal to obtain a double-talk detection result of the preset audio signal; And carrying out echo cancellation on the preset audio signal according to at least one of the echo return loss and the double-talk detection result to obtain a target audio signal corresponding to the preset audio signal. In a second aspect, the present application provides an audio processing apparatus comprising: The device comprises an audio acquisition module, a processing module and a processing module, wherein the audio acquisition module is used for acquiring a preset audio signal to be processed, the preset audio signal comprises a first audio signal and a second audio signal, and the first audio signal is different from the second audio signal; The echo return loss estimation module is used for carrying out echo return loss estimation on the first audio signal and the second audio signal to obtain corresponding echo return loss; The double-talk detection module is used for carrying out double-talk detection on the preset audio signal according to the first audio signal and the second audio signal to obtain a double-talk detection result of the preset audio signal; And the echo cancellation module is used for performing echo cancellation on the preset audio signal according to at least one of the echo return loss and the double-talk detection result to obtain a target audio signal corresponding to the preset audio signal. In a third aspect, the present application provides an electronic device comprising a memory and a processor; The memory is used for storing a computer program; The processor is configured to execute the computer program and implement the steps of the audio processing method as described above when the computer program is executed. In a fourth aspect, the present application is a computer-readable storage medium storing a computer program which, when executed by a processor, causes the processor to implement the steps of the audio processing method as described above. The application provides an audio processing method, a device, equipment and a storage medium, when a preset audio signal to be processed is acquired, the preset audio signal comprises a first audio signal and a second audio signal, and under the condition that the first audio signal is different from the second audio signal, the first audio signal and the second audio signal can perform echo return loss estimation on the electronic equipment so as to determine the corresponding echo return loss. Correspondingly, the first audio signal and the second audio signal can also carry out double-talk detection on the preset audio signal by the electronic equipment so as to determine a double-talk detection result of the preset audio signal. Based on the above, the electronic device can flexibly perform echo cancellation on the preset audio signal according to at least one of echo return loss and double-talk detection result to obtain a target audio signal corresponding to the preset audio signal, which is beneficial to improving the echo cancellation flexibility of the preset audio signal and further beneficial to improving the echo cancellation effect of the preset audio signal. Drawings In order to more clearly illustrate the technical solutions of the embodiments of the present application, the drawings required for the description of the embodiments will be briefly