Search

EP-4395305-B1 - COMMUNICATION METHOD, APPARATUS AND SYSTEM

EP4395305B1EP 4395305 B1EP4395305 B1EP 4395305B1EP-4395305-B1

Inventors

  • FENG, Junhui
  • ZHUANG, NAIFENG
  • ZHANG, CHUNHE
  • WANG, KUN
  • XU, Changyue

Dates

Publication Date
20260506
Application Date
20230323

Claims (14)

  1. A communication method, wherein the method is performed by a first call terminal, and the method comprises: establishing (S601) a video call media transmission channel, wherein the video call media transmission channel is used to transmit a call video stream between the first call terminal and a second call terminal in a video call service, and the call video stream comprises video content photographed by the first call terminal or the second call terminal; receiving (S603) a session initiation protocol SIP message from a media server, wherein the SIP message comprises a face recognition request identifier, and the face recognition request identifier is for requesting to perform face recognition on a user corresponding to the first call terminal; sending (S604) a response message of the SIP message to the media server, wherein the response message of the SIP message indicates that the user corresponding to the first call terminal agrees to perform face recognition; sending (S606) a face video stream to the media server through the video call media transmission channel, wherein the face video stream comprises a face image of the user corresponding to the first call terminal; and receiving (S608) a face recognition result from the media server.
  2. The method according to claim 1, wherein the face video stream is a video stream photographed by using a camera apparatus of the first call terminal; or the face video stream is a video stream obtained from a storage apparatus of the first call terminal.
  3. The method according to claim 2, wherein the method further comprises: receiving source indication information from the media server, wherein the source indication information indicates that the first call terminal obtains the face video stream by using the camera apparatus of the first call terminal or obtains the face video stream from the storage apparatus of the first call terminal.
  4. The method according to claim 3, wherein the face video stream is the video stream obtained from the storage apparatus of the first call terminal, and before the sending a face video stream to the media server through the video call media transmission channel, the method further comprises: receiving (S605) transmission channel indication information from the media server, wherein the transmission channel indication information indicates the first call terminal to transmit the face video stream through the video call media transmission channel.
  5. A communication method, wherein the method is performed by a second call terminal, and the method comprises: establishing (S601) a video call media transmission channel, wherein the video call media transmission channel is used to transmit a call video stream between the second call terminal and a first call terminal in a video call service, and the call video stream comprises video content photographed by the second call terminal or the first call terminal; sending (S602) a face recognition application to a media server, wherein the face recognition application comprises a face recognition application identifier, and the face recognition application identifier is used to apply for performing face recognition on a user corresponding to the first call terminal communicating with the second call terminal; receiving (S610) a face recognition result from the media server through the video call media transmission channel, wherein the face recognition result is a result of performing, based on a face video stream, face recognition on the user corresponding to the first call terminal; and processing (S611) a service request of the first call terminal based on the face recognition result.
  6. The method according to claim 5, wherein before the receiving a face recognition result from the media server through the video call media transmission channel, the method further comprises: stopping (S613) transmitting the call video stream through the video call media transmission channel.
  7. A communication method, wherein the method is performed by a media server, and the method comprises: establishing (S601) a first video call media transmission channel and a second video call media transmission channel, wherein the first video call media transmission channel is a video call media transmission channel between a first call terminal and the media server, and the second video call media transmission channel is a video call media transmission channel between the media server and a second call terminal; and the first video call media transmission channel and the second video call media transmission channel are used to transmit a call video stream between the first call terminal and the second call terminal in a video call service, and the call video stream comprises video content photographed by the first call terminal or the second call terminal; receiving (S602) a face recognition application from the second call terminal, wherein the face recognition application comprises a face recognition application identifier, and the face recognition application identifier is used to apply for performing face recognition on a user corresponding to the first call terminal communicating with the second call terminal; sending (S603) a session initiation protocol SIP message to the first call terminal, wherein the SIP message comprises a face recognition request identifier, and the face recognition request identifier is for requesting to perform face recognition on a user corresponding to the first call terminal; receiving (S604) a response message of the SIP message to the media server, wherein the response message of the SIP message indicates that the user corresponding to the first call terminal agrees to perform face recognition; receiving (S606) a face video stream from the first call terminal through the first video call media transmission channel, wherein the face video stream comprises a face image of the user corresponding to the first call terminal; obtaining (S607) a face recognition result, wherein the face recognition result is a result of performing, based on the face video stream, face recognition on the user corresponding to the first call terminal; sending (S608) the face recognition result to the first call terminal; and sending (S610) the face recognition result to the second call terminal through the second video call media transmission channel, to trigger the second call terminal to process (S611) a service request of the first call terminal based on the face recognition result.
  8. The method according to claim 7, wherein the face video stream is a video stream photographed by using a camera apparatus of the first call terminal; or the face video stream is a video stream obtained from a storage apparatus of the first call terminal.
  9. The method according to claim 8, wherein the method further comprises: sending source indication information to the first call terminal, wherein the source indication information indicates that the first call terminal obtains the face video stream by using the camera apparatus of the first call terminal or obtains the face video stream from the storage apparatus of the first call terminal.
  10. A call terminal, comprising a memory and at least one processor connected to the memory, wherein the memory is configured to store computer program code, the computer program code comprises computer instructions, and when the computer instructions are executed by the at least one processor, the call terminal is enabled to perform the method according to any one of claims 1 to 4, or 5-6.
  11. A media server, comprising a memory and at least one processor connected to the memory, wherein the memory is configured to store computer program code, the computer program code comprises computer instructions, and when the computer instructions are executed by the at least one processor, the media server is enabled to perform the method according to any one of claims 7 to 9.
  12. A computer-readable storage medium, comprising computer instructions, wherein when the computer instructions are run on a call terminal, the call terminal is enabled to perform the method according to any one of claims 1 to 4, or 5-6.
  13. A computer-readable storage medium, comprising computer instructions, wherein when the computer instructions are run on a server, the server is enabled to perform the method according to any one of claims 7 to 9.
  14. A communication system, comprising a first call terminal, a second call terminal, and a media server, wherein the first call terminal performs the method according to any one of claims 1 to 4, the second call terminal performs the method according to any one of claims 5 to 6, and the media server performs the method according to any one of claims 7 to 9.

Description

TECHNICAL FIELD Embodiments of this application relate to the field of communication technologies, and in particular, to a communication method, apparatus, and system. BACKGROUND Based on a carrier network, in a process in which a user makes a voice/video call with a customer service center by using a terminal, face recognition is performed on the user to implement online user identity authentication, so that a secure and convenient service can be provided for the user. Currently, when face recognition is performed on a user online, an application (application, APP) for face recognition needs to be installed on a terminal (referred to as a user terminal for short) held by the user. The APP is an APP that is specified by the customer service center and used to perform face recognition on the user. Next, the user terminal acquires a face image of the user and uploads the face image to the APP. The APP completes the face recognition, or the APP sends the face image to a recognition server and the recognition server then completes the face recognition. In the foregoing face recognition method, the APP needs to be installed on the user terminal and performs complex operations, and high skills are required for a staff. In addition, a transmission channel dedicated to transmitting the face image needs to be established between the user terminal and the APP. If face recognition is performed in a call process, additional bandwidth resources need to be occupied to establish the transmission channel dedicated to transmitting the face image, and additional port resources of the user terminal also need to be occupied to transmit the face image based on the transmission channel. US2020/342084A1 discloses a method for recognizing an identity in a video conference and related device. US2020/259845A1 discloses methods for providing access control and identity verification for communications. SUMMARY Embodiments of this application provide a communication method, apparatus, and system, to reduce bandwidth resources and port resources of a terminal that are occupied by performing face recognition in a call process. To achieve the foregoing objective, the following technical solutions are used in embodiments of this application. According to a first aspect, an embodiment of this application provides a communication method, where the method is performed by a call terminal, and the method includes: establishing a video call media transmission channel, where the video call media transmission channel is used to transmit a call video stream between the call terminal and a peer call terminal in a video call service, and the call video stream includes video content photographed by the call terminal or the peer call terminal; next receiving a SIP message from a media server, where the SIP message includes a face recognition request identifier, and the face recognition request identifier is for requesting to perform face recognition on a user corresponding to the call terminal; sending a response message of the SIP message to the media server, where the response message of the SIP message indicates that the user corresponding to the call terminal agrees to perform face recognition; further sending a face video stream to the media server through the video call media transmission channel, where the face video stream includes a face image of the user corresponding to the call terminal; and then receiving a face recognition result from the media server. In the communication method provided in this embodiment of this application, when face recognition needs to be performed in a call process, the call terminal may transmit the face video stream through the video call media transmission channel originally used to transmit the call video stream. In this way, no additional bandwidth resources need to be consumed to establish a transmission channel dedicated to transmitting the face video stream, and no additional port resources of the call terminal need to be occupied. Further, compared with an existing communication method, in the technical solution provided in this embodiment of this application, a face recognition APP does not need to be installed on the terminal. In this way, an operator does not need to perform complex related operations, high operation skills are not required for the operator, and a call does not need to be interrupted. In a possible implementation, the face video stream is a video stream photographed by using a camera apparatus of the call terminal. Alternatively, the face video stream is a video stream obtained from a storage apparatus of the call terminal. In a possible implementation, the communication method provided in this embodiment of this application further includes: receiving source indication information from the media server, where the source indication information indicates that the call terminal obtains the face video stream by using the camera apparatus of the call terminal or obtains the face video strea