EP-4283617-B1 - AUDIO DATA PROCESSING METHOD AND APPARATUS, DEVICE, STORAGE MEDIUM, AND PROGRAM PRODUCT

EP4283617B1EP 4283617 B1EP4283617 B1EP 4283617B1EP-4283617-B1

Inventors

CAO, Muyong

Dates

Publication Date: 20260513
Application Date: 20211118

Claims (10)

An audio data processing method, performed by a computer device, the method comprising: obtaining (S101), in a game voice mode, a signal processing result of a first pre-signal processing policy in an application layer of a service application, the first pre-signal processing policy comprising at least one first optimization component; adaptively selecting (S102), in the application layer according to the signal processing result, a state of a second optimization component in a second pre-signal processing policy in a terminal system layer, or a state of a first optimization component in the first pre-signal processing policy, the second pre-signal processing policy comprising at least one second optimization component, wherein a first optimization component turned on in the first pre-signal processing policy and a second optimization component turned off in the second pre-signal processing policy have the same optimization function, a second optimization component turned on in the second pre-signal processing policy and a first optimization component turned off in the first pre-signal processing policy have the same optimization function, and the first optimization component turned on in the first pre-signal processing policy is different from the second optimization component turned on in the second pre-signal processing policy; and obtaining (S103) uplink voice data of a first user of the service application in the game voice mode, and performing (S103), based on the first optimization component turned on in the first pre-signal processing policy and the second optimization component turned on in the second pre-signal processing policy, voice optimization on the uplink voice data in the game voice mode.
The method according to claim 1, wherein the adaptively selecting (S102), in the application layer according to the signal processing result, a state of a second optimization component in a second pre-signal processing policy in a terminal system layer comprises: determining, according to the signal processing result, the second optimization component turned on in the second pre-signal processing policy, and a second optimization component turned off in the second pre-signal processing policy; using, in the application layer, the second optimization component turned off in the second pre-signal processing policy as a first collaboration component, and turning on, in the first pre-signal processing policy, a first optimization component having the same optimization function as the first collaboration component; and using, in the application layer, the second optimization component turned on in the second pre-signal processing policy as a second collaboration component, and turning off, in the first pre-signal processing policy, a first optimization component having the same optimization function as the second collaboration component.
The method according to claim 1, wherein the obtaining (S101), in a game voice mode, a signal processing result of a first pre-signal processing policy in an application layer of a service application comprises: obtaining a terminal type of a terminal to which the service application belongs, and searching a test list associated with the service application for a test type matching the terminal type; obtaining, from the test list and based on the sound quality parameter in a case that a test type matching the terminal type is found in the test list, a first test processing result obtained by using the first pre-signal processing policy and a second test processing result obtained by using the second pre-signal processing policy; and determining, based on the first test processing result and the second test processing result, an optimal signal processing policy associated with the sound quality parameter from the first pre-signal processing policy and the second pre-signal processing policy, and using the optimal signal processing policy as the signal processing result of the first pre-signal processing policy.
The method according to claim 3, wherein the first optimization component in the first pre-signal processing policy comprises at least one of the following voice optimization algorithms: a first acoustic echo cancellation algorithm used for performing acoustic echo cancellation in the application layer, a first noise suppression algorithm used for performing noise suppression in the application layer, and a first gain control algorithm used for performing gain adjustment in the application layer; and the second optimization component in the second pre-signal processing policy comprises at least one of the following voice optimization algorithms: a second acoustic echo cancellation algorithm used for performing acoustic echo cancellation in the terminal system layer, a second noise suppression algorithm used for performing noise suppression in the terminal system layer, and a second gain control algorithm used for performing gain adjustment in the terminal system layer.
The method according to claim 1, wherein the adaptively selecting (S102), in the application layer according to the signal processing result, a state of a second optimization component in a second pre-signal processing policy in a terminal system layer, or a state of a first optimization component in the first pre-signal processing policy comprises: determining, according to the signal processing result, a second optimization component to be turned on in the second pre-signal processing policy, and a second optimization component to be turned off in the second pre-signal processing policy; turning off the second optimization component to be turned off in the second pre-signal processing policy, and turning on, in the first pre-signal processing policy, a first optimization component having the same optimization function as the turned-off second optimization component; and turning on the second optimization component to be turned on in the second pre-signal processing policy, and turning off, in the first pre-signal processing policy, a first optimization component having the same optimization function as the turned-on second optimization component.
The method according to claim 1, wherein the obtaining (S101), in a game voice mode, a signal processing result of a first pre-signal processing policy in an application layer of a service application comprises: obtaining a terminal type of a terminal to which the service application belongs, and searching a test list associated with the service application for a test type matching the terminal type; obtaining, from the test list and based on the sound quality parameter in a case that a test type matching the terminal type is found in the test list, a first test processing result obtained by using the first pre-signal processing policy and a second test processing result obtained by using the second pre-signal processing policy; and determining, based on the first test processing result and the second test processing result, an optimal signal processing policy associated with the sound quality parameter from the first pre-signal processing policy and the second pre-signal processing policy, and using the optimal signal processing policy as the signal processing result of the first pre-signal processing policy.
The method according to claim 6, wherein the first optimization component in the first pre-signal processing policy comprises at least one of the following voice optimization algorithms: a first acoustic echo cancellation algorithm used for performing acoustic echo cancellation in the application layer, a first noise suppression algorithm used for performing noise suppression in the application layer, and a first gain control algorithm used for performing gain adjustment in the application layer; and the second optimization component in the second pre-signal processing policy comprises at least one of the following voice optimization algorithms: a second acoustic echo cancellation algorithm used for performing acoustic echo cancellation in the terminal system layer, a second noise suppression algorithm used for performing noise suppression in the terminal system layer, and a second gain control algorithm used for performing gain adjustment in the terminal system layer.
An audio data processing apparatus, comprising: a processing result obtaining module (12), configured to obtain, in a game voice mode, a signal processing result of a first pre-signal processing policy in an application layer of a service application, the first pre-signal processing policy comprising at least one first optimization component; a component control module (13), configured to adaptively select, in the application layer according to the signal processing result, a state of a second optimization component in a second pre-signal processing policy in a terminal system layer, or a state of a first optimization component in the first pre-signal processing policy, the second pre-signal processing policy comprising at least one second optimization component, wherein a first optimization component turned on in the first pre-signal processing policy and a second optimization component turned off in the second pre-signal processing policy have the same optimization function, a second optimization component turned on in the second pre-signal processing policy and a first optimization component turned off in the first pre-signal processing policy have the same optimization function, and the first optimization component turned on in the first pre-signal processing policy is different from the second optimization component turned on in the second pre-signal processing policy; and a voice optimization module (14), configured to: obtain uplink voice data of a first user of the service application in the game voice mode, and perform, based on the first optimization component turned on in the first pre-signal processing policy and the second optimization component turned on in the second pre-signal processing policy, voice optimization on the uplink voice data in the game voice mode.
A computer-readable storage medium, storing a computer program, the computer program being configured to be loaded and executed by a processor, to cause a computer device having the processor to carry out the method according to any one of claims 1 to 7.
A computer program, the computer program comprising computer instructions, the computer instructions being stored in a computer-readable storage medium, a processor reading and executing the computer instructions from the computer-readable storage medium to carry out the method according to any one of claims 1 to 7.

Description

FIELD This application relates to the field of computer technologies, and in particular, to an audio data processing method and apparatus, a device, a storage medium, and a program product. BACKGROUND Currently, a user (for example, a user A) of a mobile terminal may make a system call to another user (for example, a user B) in a system call mode. For example, the user A may make a system call (that is, make a phone call) to the user B in the foregoing system call mode in a telephone call scenario. Therefore, when the user A runs a game application (for example, a game application X) on the mobile terminal and makes a system call (that is, make a phone call) to the user B, an application layer of the game application X usually needs to use a system call mode of a terminal system layer of the mobile terminal. Based on this, when the mobile terminal identically turns on signal processing units (that is, voice optimization components) in a voice pre-signal processing solution in the system call mode, both the application layer and the terminal system layer will perform voice optimization on a collected sound of the user A through signal processing units of the same function type (that is, voice optimization components with the same function). As a result, the voice optimization components with the same function are operated repeatedly, resulting in increased overheads of the system. Further, the multiple number of times of repeated processing can cause problems such as sound quality loss, which will reduce the voice optimization effect. SUMMARY An audio data processing method and apparatus, a device, a storage medium, and a program product are provided according to embodiments of the present disclosure, to improve a voice optimization effect in a game scenario. The invention is set out in the appended claims. In the embodiments of the present disclosure, the computer device (for example, a mobile terminal) may control, in the application layer according to the signal processing result in the game voice mode, to turn on and turn off of the second optimization component in the second pre-signal processing policy in the terminal system layer (that is, the voice optimization component in the second pre-signal processing policy), or to turn on and turn off of the first optimization component in the first pre-signal processing policy (that is, the voice optimization component in the first pre-signal processing policy). It can be understood that, a first optimization component turned on in the first pre-signal processing policy is different from a second optimization component turned on in the second pre-signal processing policy. Therefore, it can be seen that, in the embodiments of the present disclosure, one or more voice optimization components in the terminal system layer may be controlled to be turned on or turned off in the application layer according to the foregoing signal processing result (that is, algorithm comparison results respectively of voice optimization components having the same function), so that the voice optimization components having the same optimization function may be either run in the application layer or the terminal system layer. In this way, sound quality loss of the uplink voice data may be reduced effectively. It can be understood that, the quantity and type of second optimization components turned on or turned off in the terminal system layer are not limited herein. Further, when obtaining the uplink voice data of the first user in the game voice mode, the computer device may quickly perform voice optimization on the uplink voice data in the game voice mode based on the first optimization component and the second optimization component having different functions, which can improve the voice optimization effect in the game scenario while reducing the sound quality loss. BRIEF DESCRIPTION OF THE DRAWINGS FIG. 1 is a schematic structural diagram of a network architecture according to an embodiment of the present disclosure.FIG. 2 is a schematic diagram of service modes according to an embodiment of the present disclosure.FIG. 3 is a schematic diagram of a voice data processing process according to an embodiment of the present disclosure.FIG. 4 is a schematic diagram of a scenario of performing voice interaction in a game scenario according to an embodiment of the present disclosure.FIG. 5 is a schematic flowchart of an audio data processing method according to an embodiment of the present disclosure.FIG. 6 is a schematic diagram of a scenario of a test list according to an embodiment of the present disclosure.FIG. 7 is a schematic diagram of a scenario of determining an optimal signal processing policy associated with a sound quality parameter according to an embodiment of the present disclosure.FIG. 8 is a schematic diagram of a scenario of controlling each voice optimization component to be turned on or turned off in a voice pre-signal processing solution according to an embodiment of