Search

US-20260124918-A1 - Voice Interaction Method, Device, and Storage Medium

US20260124918A1US 20260124918 A1US20260124918 A1US 20260124918A1US-20260124918-A1

Abstract

A voice interaction method includes, when detecting a first voice command of a first user, first determining a screen corresponding to the first user, for example, a first screen, and then determining the first screen as a primary voice screen. The voice interaction method further includes displaying a first voice interaction interface through the first screen. In a process in which a vehicle-mounted device receives the first voice command, when detecting a second voice command issued by a second user, and determining a screen corresponding to the second user as another screen, determining the second screen as a secondary voice screen, displaying a second voice interaction interface through the second screen, and displaying, through the first screen, an interaction identifier indicating that there is another interacting user.

Inventors

  • JIE GENG
  • Ping Xu
  • Wei Zhao
  • Hongbin Jin
  • Sicong Sun

Assignees

  • HUAWEI TECHNOLOGIES CO., LTD.

Dates

Publication Date
20260507
Application Date
20251230
Priority Date
20230822

Claims (20)

  1. 1 . A method, comprising: detecting a first voice command of a first user; displaying, through a primary screen, a first voice interaction interface corresponding to the first voice command, wherein the primary screen corresponds to a first location of the first user in a vehicle; detecting a second voice command of a second user; displaying, through a secondary screen, a second voice interaction interface corresponding to the second voice command, wherein the secondary screen corresponds to a second location of the second user in the vehicle; and displaying a first interaction identifier on the first voice interaction interface, wherein the first interaction identifier indicates that the second user is currently interacting with the secondary screen.
  2. 2 . The method of claim 1 , wherein the first interaction identifier comprises at least one of an interaction icon, a quantity of interacting persons, or an interaction location.
  3. 3 . The method of claim 1 , wherein the first voice interaction interface comprises first interaction content corresponding to the first voice command, and wherein the second voice interaction interface comprises second interaction content corresponding to the second voice command.
  4. 4 . The method of claim 1 , wherein detecting the second voice command comprises detecting the second voice command while receiving the first voice command.
  5. 5 . The method of claim 1 , further comprising: detecting a third voice command of a third user while receiving the first voice command, wherein the third user is at a third location in the vehicle; and displaying, in the first voice interaction interface, interaction content corresponding to the third voice command in response to the primary screen further corresponding to the third location.
  6. 6 . The method of claim 1 , further comprising: detecting a third voice command of a third user while receiving the second voice command, wherein the third user is at a third location in the vehicle; and displaying, in the second voice interaction interface, a second interaction identifier indicating a quantity of persons interacting with the secondary screen.
  7. 7 . The method of claim 6 , further comprising further displaying, in the second voice interaction interface, interaction content corresponding to the third voice command.
  8. 8 . The method of claim 1 , further comprising displaying, in the second voice interaction interface, a second interaction identifier indicating the first user is currently interacting with the primary screen.
  9. 9 . The method of claim 1 , further comprising: determining the first user is at the first location based on a first sound source location of the first voice command; determining the second user is at the second location based on a second sound source location of the second voice command; determining the primary screen corresponds to the first location based on a first preset relationship between the first location and the primary screen; and determining the secondary screen corresponds to the second location based on a second preset relationship between the second location and the secondary screen.
  10. 10 . The method of claim 1 , further comprising determining a third screen corresponds to the first location, wherein displaying, through the primary screen, the first voice interaction interface is based on the third screen being in an exception state and based on a preset screen replacement rule specifying that the primary screen is a replacement for the third screen.
  11. 11 . The method of claim 1 , wherein first command processing for the first voice command takes precedence over second command processing for the second voice command, and wherein the first and second command processing each comprise at least one of broadcast processing, response processing, or execution processing.
  12. 12 . An electronic device, comprising: a memory configured to store program code; and one or more processors coupled to the memory and configured to execute the program code to cause the electronic device to: detect a first voice command of a first user; display, through a primary screen, a first voice interaction interface corresponding to the first voice command, wherein the primary screen corresponds to a first location of the first user in a vehicle; detect a second voice command of a second user; display, through a secondary screen, a second voice interaction interface corresponding to the second voice command, wherein the secondary screen corresponds to a second location of the second user in the vehicle; and display a first interaction identifier on the first voice interaction interface, wherein the first interaction identifier indicates that the second user is currently interacting with the secondary screen.
  13. 13 . The electronic device of claim 12 , wherein the first interaction identifier comprises at least one of an interaction icon, a quantity of interacting persons, or an interaction location.
  14. 14 . The electronic device of claim 12 , wherein the first voice interaction interface comprises first interaction content corresponding to the first voice command, and wherein the second voice interaction interface comprises second interaction content corresponding to the second voice command.
  15. 15 . The electronic device of claim 12 , wherein the one or more processors are further configured to execute the program code to further cause the electronic device to detect the second voice command by detecting the second voice command while receiving the first voice command.
  16. 16 . The electronic device of claim 12 , wherein the one or more processors are further configured to execute the program code to further cause the electronic device to: detect a third voice command of a third user while receiving the first voice command, wherein the third user is at a third location in the vehicle; and display, in the first voice interaction interface, interaction content corresponding to the third voice command in response to the primary screen further corresponding to the third location.
  17. 17 . The electronic device of claim 12 , wherein the one or more processors are further configured to execute the program code to further cause the electronic device to: detect a third voice command of a third user while receiving the second voice command, wherein the third user is at a third location in the vehicle; and display, in the second voice interaction interface, a second interaction identifier indicating a quantity of persons interacting with the secondary screen.
  18. 18 . The electronic device of claim 17 , wherein the one or more processors are further configured to execute the program code to further cause the electronic device to further display, in the second voice interaction interface, interaction content corresponding to the third voice command.
  19. 19 . The electronic device of claim 12 , wherein the one or more processors are further configured to execute the program code to further cause the electronic device to display, in the second voice interaction interface, a second interaction identifier indicating the first user is currently interacting with the primary screen.
  20. 20 . A computer program product comprising a computer program that, when executed by one or more processors, cause an electronic device to: detect a first voice command of a first user; display, through a primary screen, a first voice interaction interface corresponding to the first voice command, wherein the primary screen corresponds to a first location of the first user in a vehicle; detect a second voice command of a second user; display, through a secondary screen, a second voice interaction interface corresponding to the second voice command, wherein the secondary screen corresponds to a second location of the second user in the vehicle; and display a first interaction identifier on the first voice interaction interface, wherein the first interaction identifier indicates that the second user is currently interacting with the secondary screen.

Description

CROSS-REFERENCE TO RELATED APPLICATIONS This is a continuation of International Patent Application No. PCT/CN2024/109799, filed on Aug. 5, 2024, which claims priority to Chinese Patent Application No. 202311065233.5, filed on Aug. 22, 2023, which are both incorporated by reference. TECHNICAL FIELD This specification relates to the field of voice interaction technologies, and in particular, to a voice interaction method, a device, and a storage medium. BACKGROUND Voice assistants may be used in in-vehicle intelligent cockpits, and may implement a plurality of voice interaction functions such as voice conversation, voice wakeup, and voice navigation. A user may perform human-machine interaction through a primary screen at a driver seat. With development of vehicles, in-vehicle systems may develop from a single primary screen disposed at a driver seat to a plurality of screens. For example, in a five-seater vehicle, in addition to a primary screen disposed at a driver seat, screens are also disposed in front of a front passenger seat and a rear-row seat. This provides a hardware configuration for users to perform voice interaction through the plurality of screens in the in-vehicle system. However, how to design a manner of voice interaction between the users and the plurality of screens in the in-vehicle system to meet a user requirement and improve user experience is still an urgent problem to be resolved currently. SUMMARY To resolve the foregoing problem, this specification provides a voice interaction method, a device, and a storage medium. According to a first aspect, this specification provides a voice interaction method. The method includes: detecting a first voice command of a first user; determining a screen corresponding to a first location of the first user in a vehicle as a first screen, determining the first screen as a primary voice screen, and displaying, through the first screen, a first voice interaction interface corresponding to the first voice command; detecting a second voice command issued by a second user; and determining a screen corresponding to a second location of the second user in the vehicle as a second screen, determining the second screen as a secondary voice screen, displaying, through the second screen, a second voice interaction interface corresponding to the second voice command, and displaying a first interaction identifier on the first voice interaction interface, where the first interaction identifier indicates that the second user is currently interacting with the second screen. In this specification, the foregoing method may be applied to a vehicle-mounted device. The vehicle-mounted device may be a vehicle or an intelligent vehicle, or may be an electronic device loaded on a vehicle or an intelligent vehicle. The first voice command may be a wakeup word, for example, Celia, or may be a specific voice command, for example, open the vehicle window. The first user may be a first speaking user mentioned below, and the second user may be a second speaking user mentioned below. The first location may be a seat that is of the first user in the vehicle and that is determined by the vehicle-mounted device based on a sound source location of the first voice command. The first screen may be a screen that corresponds to the first location and that is determined by the vehicle-mounted device based on a preset relationship between a seat and a screen. The first voice interaction interface may be a voice assistant interface. It may be understood that, in this specification, when a user issues a voice interaction command, the vehicle-mounted device may detect, based on a sound source location, a seat of each user issuing the voice command in the vehicle, activate a corresponding bearer screen based on the distribution relationship between the screen and the seat in the vehicle, and display voice interaction content on the bearer screen. In some embodiments, the first user may be a user who first issues a voice command in a wake-up process, for example, a user who first wakes up a voice assistant of a vehicle. In this case, a corresponding first screen activated by the first user may be determined as a primary voice screen, another screen is used as a secondary voice screen, and a first voice interaction interface may be displayed on the primary voice screen. If it is detected that there is a second user issuing a voice interaction command at the same time, a screen corresponding to the second user is a second screen, that is, a secondary voice screen, and a second voice interaction interface may be displayed on the second screen. It may be understood that, in this specification, after a user activates a corresponding bearer screen, a voice interaction interface may be displayed, and voice command content of the corresponding user may be displayed on the voice interaction interface. Voice command content of a user interacting with the primary voice screen and an interaction identifier