EP-4192021-B1 - AUDIO DATA PROCESSING METHOD AND APPARATUS, AND DEVICE AND STORAGE MEDIUM

EP4192021B1EP 4192021 B1EP4192021 B1EP 4192021B1EP-4192021-B1

Inventors

LI, CHENG
HUANG, HAO

Dates

Publication Date: 20260506
Application Date: 20210826

Claims (9)

A method for processing audio data, comprising: acquiring (101) a first playback position on first audio data and an audition instruction of a user for at least one first sound effect; adding (102) the at least one first sound effect to a first audio segment in the first audio data, to generate sound effect audition data, and playing the sound effect audition data to audition, wherein the first audio segment starts from the first playback position; selecting a sound effect from the auditioned at least one first sound effect as a second sound effect; receiving (103) a first adding instruction of the user for the second sound effect, wherein the first adding instruction comprises information on a first adding length of the second sound effect to be added in the first audio data; and adding (104) the second sound effect to a second audio segment in the first audio data to obtain second audio data, wherein the second audio segment starts from the first playback position, and has a length of the first adding length, wherein adding (102) the first sound effect to the first audio segment in the first audio data comprises: displaying one or more sound options, and determining a sound selected by the user from the at least one sound option, as the target sound; recognizing the target sound from the first audio segment through a preset sound recognition model; and adding the at least one first sound effect to the target sound in the first audio segment, to generate audition data of the first sound effect on the first audio segment.
The method according to claim 1, wherein the first audio data is audio data of a to-be-edited video in a video editing interface.
The method according to claim 2, wherein acquiring the first playback position on the first audio data and the audition instruction of the user for the first sound effect comprising: displaying the video editing interface, wherein the video editing interface comprises a playback progress control for a video and a sound effect control for the first sound effect; and acquiring the first playback position selected by the user through the playback progress control, and the audition instruction triggered by the user through the sound effect control.
The method according to any one of claims 1 to 3, further comprising: returning a playback position of the first audio data to the first playback position, after playing of the sound effect audition data is finished.
The method according to claim 1, wherein adding (104) the second sound effect to the second audio segment in the first audio data comprises: adding the second sound effect to a target sound in the second audio segment.
The method according to claim 1, wherein after adding (104) the second sound effect to the second audio segment in the first audio data to obtain the second audio data, the method further comprises: obtaining a second playback position on the second audio data, and a second adding instruction of the user for a third sound effect, wherein the second adding instruction comprises information on a second adding length of the third sound effect to be added to the second audio data; and adding the third sound effect to a third audio segment in the second audio data to obtain third audio data, wherein the third audio segment starts from the second playback position and has a length of the second adding length.
The method according to claim 6, further comprising: applying a fade-out effect on the second sound effect and a fade-in effect on the third sound effect, in a case that an end position of the second sound effect and the second playback position are two consecutive playback positions on the third audio data.
A terminal device, comprising: a memory (1008); and a processor (1001), wherein the memory (1008) stores a computer program; and the computer program, when executed by the processor (1001), causes the processor (1001) to implement the method according to any one of claims 1 to 7.
A computer-readable storage medium storing a computer program, wherein the computer program, when executed by a processor (1001), causes the processor to implement the method according to any one of claims 1 to 7.

Description

FIELD Embodiment of the present disclosure relates to the technical field of audio data processing, in particular to a method and device for processing audio data, an apparatus, and a storage medium. BACKGROUND Video applications provided by relevant technologies have a voice changing function, through which users can add a preferred voice changing effect to a video. However, the method for adding the voice changing effect to the video provided in the relevant technologies cannot meet user requirements. The published patent application CN 109 346 111 A, GUANGZHOU KUGOU TECH CO LTD, 15 February 2019, discloses adding sound effects to audio-video content, after trying them. The published patent application CN 110 377 212 A, SHANGHAI EDAY SOFTWARE CO LTD, 25 October 2019 discloses addition of sound effects to a sound effect area determined according to a rhythm recognition rule. SUMMARY The invention is set out in the appended set of claims. In order to solve or at least partially solve the above technical problems, a method and a device for processing audio data, an apparatus, and a storage medium are provided according to embodiments of the present disclosure. Advantages of the technical solution provided in the embodiments of the present disclosure, compared with the conventional technologies, are described below. In the embodiments of the present disclsoure, the first playback position on the first audio data and the audition instruction of the user for the first sound effect are acquired, the first sound effect is added to the first audio segment in the first audio data to obtain the sound effect audition data for audition, and then the sound effect audition data is played; and in response to a reception of the first adding instruction of the user for the second sound effect, the second sound effect is added to the second audio segment in the first audio data based on the first adding length carried in the first adding instruction, so as to obtain the second audio data, where the second audio segment starts from the first playback position. Based on the solution in the embodiments, the user can select any position on the audio data for auditioning the sound effect, and the satisfying sound effect may be added to a certain audio segment of the audio data based on the in the audition result. Hence, compared to a condition where the adding effect of the sound effect cannot be auditioned, the solution provided in the present disclosure enables the user to select a satisfying sound effect through audition and add the satisfying sound effect to the audio data. Thereby, it is ensured that the sound effect added to the audio data is satisfying for the user, and a situation in which the user is not satisfied with an added sound effect and has to add another sound effect is avoided. Hence, user operation is simplified, and user experience is improved. In addition, with the solution according to the embodiments of the present disclosure, the user can add a certain sound effect on a certain audio segment in the audio data, and can add multiple sound effects correspondingly to multiple audio segments in the audio data. In this way, the adding effect of sound effect is enriched, an interest of adding the sound effects is improved, and user experience is improved. BRIEF DESCRIPTION OF THE DRAWINGS The accompanying drawings are incorporated herein and constitute a part of this specification. The drawings illustrate embodiments consistent with the present disclosure and, together with the specification, serve to explain the principles of the present disclosure. In order to more clearly explain the embodiments of the present disclosure or the technical solutions in the conventional technology, the drawings used in the description of the embodiments or the conventional technology are briefly introduced below. Apparently, those skilled in the art can obtain other drawings based on the provided drawings without any creative effort. Figure 1 shows a flowchart of a method for processing audio data according to an embodiment of the present disclosure.Figure 2A and Figure 2B are schematic diagrams showing operations on an operation interface according to an embodiment of the present disclosure.Figure 3A is a schematic diagram of a method for adding a sound effect according to an embodiment of the present disclosure.Figure 3B is a schematic diagram of a method for adding a sound effect according to another embodiment of the present disclosure.Figure 4A and Figure 4B are schematic diagrams of a method for adding a sound effect according to yet another embodiment of the present disclosure.Figure 5A and Figure 5B are schematic diagrams of a method for adding a sound effect according to yet another embodiment of the present disclosure.Figure 6 is a schematic diagram of a method for smoothing an audio according to an embodiment of the present disclosure.Figure 7 is a flowchart of a method for processing audio data according to another