CN-122027948-A - Audio signal processing method, device, equipment, storage medium and program product

CN122027948ACN 122027948 ACN122027948 ACN 122027948ACN-122027948-A

Abstract

The embodiment of the application provides a processing method, a device, equipment, a storage medium and a program product of an audio signal, relating to the technical field of audio processing; the method comprises the steps of performing first analog gain on an initial signal acquired by a microphone to obtain a first analog gain signal, performing second analog gain on the initial signal to obtain a second analog gain signal, performing analog-to-digital conversion on the first analog gain signal and the second analog gain signal to obtain a first analog gain digital signal and a second analog gain digital signal, performing time domain framing processing on the first analog gain digital signal and the second analog gain digital signal to obtain a plurality of frames of first audio signals and a plurality of frames of second audio signals respectively, performing digital gain negative compensation on the first audio signals to obtain a third audio signal, and performing signal fusion on the second audio signals and the third audio signal to obtain a processed target audio signal. The method can improve the recording effect of the recording equipment.

Inventors

LI JING
HE YANXING

Assignees

展讯通信（上海）有限公司

Dates

Publication Date: 20260512
Application Date: 20260319

Claims (10)

1. A method of processing an audio signal, the method comprising: Performing first analog gain on an initial signal acquired by a microphone to obtain a first analog gain signal, and performing second analog gain on the initial signal to obtain a second analog gain signal, wherein the first analog gain is used for increasing the initial signal by a first preset decibel value, the second analog gain is used for increasing the initial signal by a second preset decibel value, and the first preset decibel value is larger than the second preset decibel value; performing analog-to-digital conversion on the first analog gain signal and the second analog gain signal to obtain a first analog gain digital signal and a second analog gain digital signal; Respectively carrying out time domain framing treatment on the first analog gain digital signal and the second analog gain digital signal to obtain a plurality of frames of first audio signals and a plurality of frames of second audio signals; Performing digital gain negative compensation on the first audio signal to obtain a third audio signal, so that the third audio signal and the second audio signal are aligned under the same total gain, wherein the total gain comprises an analog gain and a digital gain; and carrying out signal fusion on the second audio signal and the third audio signal to obtain a processed target audio signal.
2. The processing method according to claim 1, wherein performing digital gain negative compensation on the first audio signal to obtain a third audio signal comprises: Determining a difference between the first preset decibel value and the second preset decibel value; And taking the difference value as a negative compensation value of the digital gain, and carrying out digital gain negative compensation on the first audio signal to obtain a third audio signal.
3. The processing method according to claim 2, wherein the difference is taken as a negative compensation value of a digital gain, and the first audio signal is subjected to negative compensation of the digital gain, and before the third audio signal is obtained, the method further comprises: And adjusting the first audio signal from a bit representation of a first preset value to a bit representation of a second preset value, wherein the second preset value is greater than the first preset value.
4. The processing method according to claim 1, wherein performing signal fusion on the second audio signal and the third audio signal to obtain a processed target audio signal includes: Determining a fusion weight corresponding to the third audio signal and a fusion weight corresponding to the second audio signal; And carrying out signal fusion on the second audio signal and the third audio signal according to the fusion weight corresponding to the third audio signal and the fusion weight corresponding to the second audio signal to obtain a processed target audio signal.
5. The processing method of claim 4, wherein the determining the fusion weight corresponding to the third audio signal and the fusion weight corresponding to the second audio signal comprises: and determining whether the peak value of the third audio signal is larger than a preset amplitude threshold value, if so, determining that the first fusion weight corresponding to the third audio signal is 1 and the fusion weight corresponding to the second audio signal is 0, and if not, determining that the fusion weight corresponding to the third audio signal is 0 and the fusion weight corresponding to the second audio signal is 1.
6. The method of processing according to claim 5, further comprising: Acquiring a third fusion weight of a third audio signal corresponding to the first audio signal of the previous frame; Determining whether the third fusion weight is the same as a first fusion weight of a third audio signal corresponding to a first audio signal of a current frame; if the third fusion weights are different, determining weight change gradients of the third fusion weights to the first fusion weights, switching the third fusion weights to the first fusion weights in a delay smoothing mode according to the weight change gradients, and if the third fusion weights are the same, keeping the third fusion weights unchanged.
7. An audio signal processing apparatus, comprising: the analog gain module is used for carrying out first analog gain on the initial signal acquired by the microphone to obtain a first analog gain signal, and carrying out second analog gain on the initial signal to obtain a second analog gain signal; the first analog gain is used for increasing the initial signal by a first preset decibel value, and the second analog gain is used for increasing the initial signal by a second preset decibel value, and the first preset decibel value is larger than the second preset decibel value; the analog-to-digital conversion module is used for carrying out analog-to-digital conversion on the first analog gain signal and the second analog gain signal to obtain a first analog gain digital signal and a second analog gain digital signal; The time domain framing module is used for performing time domain framing processing on the first analog gain digital signal and the second analog gain digital signal respectively to obtain a plurality of frames of first audio signals and a plurality of frames of second audio signals; The digital signal compensation module is used for carrying out digital gain negative compensation on the first audio signal to obtain a third audio signal so as to enable the third audio signal to be aligned with the second audio signal under the same total gain, wherein the total gain comprises an analog gain and a digital gain; and the digital signal fusion module is used for carrying out signal fusion on the second audio signal and the third audio signal to obtain a processed target audio signal.
8. An electronic device is characterized by comprising a memory and a processor; The memory stores computer-executable instructions; the processor executing computer-executable instructions stored in the memory, causing the processor to perform the method of processing an audio signal as claimed in any one of claims 1-6.
9. A computer-readable storage medium, in which computer-executable instructions are stored, which computer-executable instructions, when executed by a processor, are for implementing the method of processing an audio signal according to any one of claims 1-6.
10. A computer program product comprising a computer program which, when executed by a processor, implements the method of processing an audio signal according to any one of claims 1-6.

Description

Audio signal processing method, device, equipment, storage medium and program product Technical Field The present application relates to the field of audio processing technologies, and in particular, to a method, an apparatus, a device, a storage medium, and a program product for processing an audio signal. Background In audio processing scenes such as audio collection, voice processing, music production, video recording and the like, the requirements of users on recording quality are increasingly improved. In the prior art, the recording device may include a microphone, a PGA (Programmable gain amplifier) and an ADC (Analog to Digital Converter, analog-to-digital converter), where an analog signal collected by the microphone is amplified by the PGA and then sent to the ADC for digital processing. However, since ADC noise floor and quantization accuracy result in limited dynamic range (ratio between minimum signal and maximum signal that can be processed) of ADC, quality of capturing high sound pressure level audio signal and low sound pressure level audio signal by the recording device is affected, which in turn results in poor recording effect of the recording device. Disclosure of Invention The embodiment of the application provides a processing method, a device, equipment, a storage medium and a program product for audio signals, which can improve the recording effect of recording equipment. In a first aspect, an embodiment of the present application provides a method for processing an audio signal, where the method includes: Performing first analog gain on an initial signal acquired by a microphone to obtain a first analog gain signal, and performing second analog gain on the initial signal to obtain a second analog gain signal, wherein the first analog gain is used for increasing the initial signal by a first preset decibel value, the second analog gain is used for increasing the initial signal by a second preset decibel value, and the first preset decibel value is larger than the second preset decibel value; performing analog-to-digital conversion on the first analog gain signal and the second analog gain signal to obtain a first analog gain digital signal and a second analog gain digital signal; Respectively carrying out time domain framing treatment on the first analog gain digital signal and the second analog gain digital signal to obtain a plurality of frames of first audio signals and a plurality of frames of second audio signals; Performing digital gain negative compensation on the first audio signal to obtain a third audio signal, so that the third audio signal and the second audio signal are aligned under the same total gain, wherein the total gain comprises an analog gain and a digital gain; and carrying out signal fusion on the second audio signal and the third audio signal to obtain a processed target audio signal. In one possible implementation, performing negative digital gain compensation on the first audio signal to obtain a third audio signal includes determining a difference between a first preset decibel value and a second preset decibel value, and performing negative digital gain compensation on the first audio signal by taking the difference as a negative compensation value of the digital gain to obtain the third audio signal. In a possible implementation, before the difference is used as the negative compensation value of the digital gain to perform the negative compensation of the digital gain on the first audio signal to obtain the third audio signal, the method further comprises adjusting the bit representation of the first audio signal from the bit representation of the first preset value to the bit representation of the second preset value, wherein the second preset value is larger than the first preset value. In one possible implementation manner, the second audio signal and the third audio signal are subjected to signal fusion to obtain a processed target audio signal, wherein the method comprises the steps of determining fusion weights corresponding to the third audio signal and fusion weights corresponding to the second audio signal, and carrying out signal fusion on the second audio signal and the third audio signal according to the fusion weights corresponding to the third audio signal and the fusion weights corresponding to the second audio signal to obtain the processed target audio signal. In one possible implementation manner, determining the fusion weight corresponding to the third audio signal and the fusion weight corresponding to the second audio signal includes determining whether a peak value of the third audio signal is greater than a preset amplitude threshold, if so, determining that the first fusion weight corresponding to the third audio signal is 1 and the fusion weight corresponding to the second audio signal is 0, and if not, determining that the fusion weight corresponding to the third audio signal is 0 and the fusion weight corresponding to the seco