Search

CN-122021668-A - Voice processing method, electronic equipment and program product

CN122021668ACN 122021668 ACN122021668 ACN 122021668ACN-122021668-A

Abstract

The embodiment of the disclosure provides a voice processing method, electronic equipment and a program product. The voice processing method comprises the steps of obtaining voices collected from a plurality of Bluetooth audio devices to obtain a plurality of different voices, wherein the voices of each Bluetooth audio device correspond to one language type, translating the plurality of different voices based on the language type of each voice and the target language type to obtain corresponding target voices respectively, and playing the target voices according to a preset playing mode. The embodiment scheme realizes playing the translated voice, can enable the user to quickly listen to the translated content, and provides convenience for the user.

Inventors

  • XU MEI
  • HUANG TUBIN

Assignees

  • 中兴通讯股份有限公司

Dates

Publication Date
20260512
Application Date
20241111

Claims (10)

  1. 1. A method of speech processing, comprising: Acquiring voices acquired from a plurality of Bluetooth audio devices to obtain a plurality of different voices, wherein the voices of each Bluetooth audio device correspond to one language type; Translating the plurality of different voices based on the language type of each voice and the target language type to respectively obtain corresponding target voices; and playing the target voice according to a preset playing mode.
  2. 2. The method for processing speech according to claim 1, wherein said translating a plurality of different voices based on the language type of each voice and the target language type to obtain corresponding target voices, respectively, comprises: starting a target number of threads according to the number of the different voices; respectively adopting a thread to carry out tone recognition and voice translation on voice streams corresponding to each voice; And combining the voice obtained after voice translation with the corresponding voice to obtain the target voice corresponding to each voice.
  3. 3. The method for processing speech according to claim 1, wherein said translating a plurality of different voices based on the language type of each voice and the target language type to obtain corresponding target voices, respectively, comprises: mixing the different voices to obtain a voice stream; splitting the voice stream according to the waveform of the voice stream to obtain a plurality of sub-voice streams; sequentially carrying out voice translation on each split sub-voice stream and identifying the tone of each sub-voice stream; And combining the voice obtained after the voice translation of each sub-voice stream with the corresponding voice to obtain the target voice corresponding to each voice.
  4. 4. The speech processing method of claim 1 wherein the method further comprises: and acquiring target information of each target voice, and outputting and displaying the target information, wherein the target information comprises identifications and/or translation contents corresponding to the target voice.
  5. 5. The voice processing method according to claim 4, wherein the play mode includes a manual selection play mode or an automatic play mode; The manual selection playing mode comprises playing corresponding target voice according to the selection result of the displayed identification and/or the translation content, or The automatic playing mode comprises the step of automatically playing target voice corresponding to the specified mark according to the pre-stored specified mark or automatically playing the target voice according to the sequence of the completion of the translation of the target voice.
  6. 6. The voice processing method according to claim 1, wherein each of the bluetooth audio device, the voice collected by the bluetooth audio device, and the target voice obtained after the voice translation corresponds to the same identifier.
  7. 7. The speech processing method of claim 6 wherein the method further comprises: after each target voice is obtained, the target voice is sent to Bluetooth audio equipment corresponding to the identification except the identification corresponding to the target voice to be played.
  8. 8. The voice processing method of claim 1, wherein after the acquiring the voices collected from the plurality of bluetooth audio devices, the method further comprises: Segmenting each voice with the voice time length being greater than or equal to the time length threshold according to a preset time length threshold, so that the voice time length of each voice obtained after segmentation is smaller than the time length threshold.
  9. 9. An electronic device, comprising: one or more processors; A memory having one or more programs stored thereon, which when executed by the one or more processors, cause the one or more processors to implement the speech processing method of any of claims 1-8; One or more input/output I/O interfaces coupled between the processor and the memory configured to enable information interaction of the processor with the memory.
  10. 10. A computer program product comprising a computer program which, when executed by a processor, implements the speech processing method of any of claims 1-8.

Description

Voice processing method, electronic equipment and program product Technical Field Embodiments of the present disclosure relate to the field of wireless communications, and in particular, to a voice processing method, an electronic device, and a program product. Background At present, after receiving the sound, the earphone translates the audio through the corresponding application, and only displays the translated content on the interface, so that the translated sound cannot be played from the earphone. In addition, the radio device in the existing earphone is inconvenient for collecting a plurality of voices and performing unified translation. In addition, only translation content is provided in the current speech translation application scene, the playing function of translation speech is not supported, and inconvenience is brought to users. Disclosure of Invention The embodiment of the disclosure provides a voice processing method, electronic equipment and a program product. In a first aspect, an embodiment of the present disclosure provides a voice processing method, including: Acquiring voices acquired from a plurality of Bluetooth audio devices to obtain a plurality of different voices, wherein the voices of each Bluetooth audio device correspond to one language type; Translating the plurality of different voices based on the language type of each voice and the target language type to respectively obtain corresponding target voices; and playing the target voice according to a preset playing mode. In a second aspect, embodiments of the present disclosure further provide an electronic device, including: one or more processors; a memory having one or more programs stored thereon, which when executed by the one or more processors, cause the one or more processors to implement the speech processing method; One or more input/output I/O interfaces coupled between the processor and the memory configured to enable information interaction of the processor with the memory. In a third aspect, the disclosed embodiments also provide a computer program product comprising a computer program which, when executed by a processor, implements the speech processing method. According to the embodiment of the disclosure, voices collected from a plurality of Bluetooth audio devices are obtained to obtain a plurality of different voices, wherein the voices of each Bluetooth audio device correspond to one language type, the plurality of different voices are translated based on the language type of each voice and the target language type to obtain corresponding target voices respectively, and the target voices are played according to a preset playing mode. The embodiment scheme realizes playing the translated voice, can enable the user to quickly listen to the translated content, and provides convenience for the user. Drawings In the drawings of the embodiments of the present disclosure: Fig. 1 is a schematic flow chart of a voice processing method according to an embodiment of the disclosure; Fig. 2 is a schematic diagram of audio communication between a speaker and a listener based on bluetooth in a conference scenario provided by an embodiment of the present disclosure; FIG. 3 is a schematic diagram of a parallel translation flow provided in an embodiment of the disclosure; FIG. 4 is a schematic diagram of a serial translation flow provided in an embodiment of the disclosure; fig. 5 is a schematic diagram of a first target information display method according to an embodiment of the disclosure; Fig. 6 is a schematic diagram of a second method for displaying target information according to an embodiment of the disclosure; fig. 7 is a schematic diagram of an electronic device according to an embodiment of the disclosure. Detailed Description In order to better understand the technical solutions of the present disclosure, the following describes in detail a communication-aware data processing method and a computer-readable storage medium provided by embodiments of the present disclosure with reference to the accompanying drawings. The present disclosure will be described more fully hereinafter with reference to the accompanying drawings, but the embodiments shown may be embodied in different forms and should not be construed as limited to the embodiments set forth below. Rather, these embodiments are provided so that this disclosure will be thorough and complete, and will fully convey the scope of the disclosure to those skilled in the art. The accompanying drawings, which are included to provide a further understanding of embodiments of the disclosure and are incorporated in and constitute a part of this specification, illustrate the disclosure and together with the detailed embodiment, do not limit the disclosure. The above and other features and advantages will become more readily apparent to those skilled in the art from the description of the detailed embodiments with reference to the accompanying drawings. The present disc