CN-121999788-A - Audio processing method, electronic device and computer program product

CN121999788ACN 121999788 ACN121999788 ACN 121999788ACN-121999788-A

Abstract

The application provides an audio processing method, electronic equipment and a storage medium. The method comprises the steps of acquiring background sound in original audio data. And identifying the sound type of the background sound and determining the target sound type to which the background sound belongs. And carrying out enhancement processing on the background sound based on the sample background sound of the target sound type. And outputting the background sound after the enhancement processing. The application can pertinently enhance the background sound in the audio data.

Inventors

MA JIAN
YANG ZHENHUI
LV DONGZE

Assignees

中兴通讯股份有限公司

Dates

Publication Date: 20260508
Application Date: 20241105

Claims (10)

1. An audio processing method, comprising: Acquiring background sound in original audio data; identifying the sound type of the background sound and determining the target sound type to which the background sound belongs; performing enhancement processing on the background sound based on the sample background sound of the target sound type; And outputting the background sound after the enhancement processing.
2. The method according to claim 1, Identifying the sound type of the background sound, and determining the target sound type to which the background sound belongs, wherein the method comprises the following steps: extracting the characteristics of the background sound to obtain characteristic data corresponding to the background sound; and determining the type of the target sound to which the background sound belongs based on the characteristic data corresponding to the background sound.
3. The method according to claim 2, Based on the feature data corresponding to the background sound, determining the target sound type to which the background sound belongs, including: The method comprises the steps of inputting feature data corresponding to background sound into a target deep learning model to determine a target sound type to which the background sound belongs, wherein the target deep learning model is trained based on the feature data of sample background sound and a sound type label of the sample background sound, and the sound type label is used for labeling the sound type to which the sample background sound belongs; and/or the number of the groups of groups, And determining a target sample background sound of which the characteristic data is matched with the background sound from a plurality of sample background sounds of which the characteristic data is determined, and determining the sound type of the target sample background sound as the target sound type of the background sound.
4. The method according to claim 2, Extracting the characteristics of the background sound to obtain the characteristic data corresponding to the background sound, wherein the characteristic data comprises: Performing fast Fourier transform based on the background sound to obtain frequency spectrum data corresponding to the background sound; And carrying out feature extraction on the frequency spectrum data in a time domain and/or a frequency domain to obtain feature data corresponding to the background sound.
5. The method according to claim 1 to 4, Based on the sample background sound of the target sound type, enhancing the background sound comprises the following steps: Determining a distribution structure of a sample background sound of the target sound type in at least one of frequency, energy, and harmonics; And performing gain processing on the signals which accord with the distribution structure in the background sound, and/or performing subtraction and benefit processing on the signals which do not accord with the distribution structure in the background sound.
6. The method according to claim 1 to 4, Outputting the background sound after the enhancement processing, including: Mixing the background sound after the enhancement processing and the original audio data, and outputting target audio data obtained by the mixing processing; Or alternatively And carrying out audio mixing processing on the background sound after the enhancement processing and the residual audio data of the original audio data after the background sound is removed, and outputting target audio data obtained through the audio mixing processing.
7. The method according to claim 1 to 4, The original audio data are audio data collected by a call local terminal device in a call scene; The method further comprises the steps of: And carrying out emergency abnormal event recognition based on the background sound after the enhancement processing, and notifying prompt information corresponding to the emergency abnormal event to a call opposite terminal device in the call scene under the condition that the emergency abnormal event is recognized.
8. The method according to claim 1 to 4, The original audio data are collected data of virtual reality equipment; and outputting the audio frequency based on the background sound after the enhancement processing, comprising: and controlling the virtual reality equipment to output audio in a virtual reality scene based on the background sound after the enhancement processing.
9. An electronic device comprising a processor, and a memory configured to store computer-executable instructions that, when executed, cause the processor to perform the method of any of claims 1-8.
10. A computer program product comprising a computer readable storage medium storing a computer program operable to cause a computer to perform the method of any one of claims 1 to 8.

Description

Audio processing method, electronic device and computer program product Technical Field The present application relates to the field of audio processing, and in particular, to an audio processing method, an electronic device, and a computer program product. Background The existing intelligent terminals, such as mobile phones, tablet computers, central control of automobiles and the like, are generally provided with microphones with noise reduction functions, and background sounds are used as noise to be subjected to noise reduction treatment to a certain extent when the sounds are collected. In practical use, background sounds which are needed by many terminal applications and even users are opposite to background sounds which are not in a different situation. Disclosure of Invention The application aims to provide an audio processing method, electronic equipment and a storage medium, which can at least pertinently enhance background sound in audio data. In order to achieve the above object, embodiments of the present application are realized as follows: in a first aspect, an audio processing method is provided, which includes acquiring background sound in original audio data, identifying a sound type of the background sound, determining a target sound type to which the background sound belongs, performing enhancement processing on the background sound based on a sample background sound of the target sound type, and outputting the background sound after the enhancement processing. In a second aspect, an embodiment of the application provides an electronic device comprising a processor and a memory configured to store computer-executable instructions that, when executed, cause the processor to perform the method of the first aspect. In a third aspect, there is provided a computer program product comprising a computer readable storage medium storing a computer program operable to cause a computer to perform the method of the first aspect when executed. The embodiment of the application extracts background sound from the original audio data acquired by the terminal equipment to identify the sound type of the background sound and determine the target sound type, and carries out enhancement processing on the background sound of the original audio data and outputs the background sound based on the sample background sound which is also the target sound type. Drawings In order to more clearly illustrate the embodiments of the present application or the technical solutions in the prior art, the drawings that are required in the embodiments or the description of the prior art will be briefly described below, and it is obvious that the drawings in the following description are only some embodiments described in the embodiments of the present application, and other drawings may be obtained according to these drawings without inventive effort to a person having ordinary skill in the art. Fig. 1 is a flow chart of an audio processing method according to an embodiment of the application. Fig. 2 is a schematic diagram of providing a background sound option in the audio processing method according to an embodiment of the application. Fig. 3 is a schematic diagram of a first application scenario of an audio processing method according to an embodiment of the present application. Fig. 4 is a schematic diagram of a second application scenario of an audio processing method according to an embodiment of the present application. Fig. 5 is a schematic structural diagram of an audio processing device according to an embodiment of the application. Fig. 6 is a schematic structural diagram of an electronic device according to an embodiment of the present application. Detailed Description As described above, many intelligent terminals on the market generally have a microphone with a noise reduction function, and when collecting sound, background sound is used as noise to perform noise reduction to a certain extent. For example, the dual microphone configuration commonly found on cell phones is designed to reduce background noise. In practical use, background sound is used as a key factor of environment expression, can provide a lot of high-value information, and should avoid filtering indiscriminate situations. Therefore, the application aims to provide an audio processing scheme which can pertinently enhance the background sound in the audio data so as to play the role of the background sound. In order to make the technical solution in the present specification better understood by those skilled in the art, the technical solution in the embodiments of the present application will be clearly and completely described below with reference to the accompanying drawings in the embodiments of the present application, and it is apparent that the described embodiments are only some embodiments of the present specification, but not all embodiments. All other embodiments, which can be made by one of ordinary skill in the art without undue burden