Search

CN-122027518-A - Network monitoring method, device, electronic equipment, storage medium and program product

CN122027518ACN 122027518 ACN122027518 ACN 122027518ACN-122027518-A

Abstract

Some cases relate to the technical field of Internet and disclose a network monitoring method, a device, electronic equipment, a storage medium and a program product. The method comprises the steps of obtaining network traffic between a client and a cloud end, detecting a bidirectional reachable state between the client and the cloud end based on the flow direction of the network traffic, monitoring a first network path between the client and the cloud end if the bidirectional reachable state represents that the client and the cloud end are not bidirectional reachable, switching the network traffic from the first network path to a second network path, wherein the second network path is a standby path corresponding to the first network path, and switching the network traffic back to the first network path if the network failure event is repaired. Therefore, comprehensive monitoring of the accessibility of the fine-grained bidirectional path between the cloud and the client is realized, repair detection aiming at network fault events is realized, and rapid recovery of the path is ensured.

Inventors

  • DONG YONGBIN
  • GUO SHIXIAN
  • LI HANG
  • FU WEIYI
  • DUAN YANDONG
  • ZHANG CHAOYANG
  • CHEN ANJIAN
  • CHEN YUAN
  • BAI YANGYANG
  • ZHANG QI
  • LIU SONGLIN
  • LV YANG
  • HU JIANWEI
  • LI GEN
  • XU SHUAI

Assignees

  • 北京字跳网络技术有限公司

Dates

Publication Date
20260512
Application Date
20260211

Claims (15)

  1. 1. A network monitoring method, comprising: acquiring network traffic between a client and a cloud; detecting a bidirectional reachable state between the client and the cloud based on the flow direction of the network traffic; monitoring a first network path between the client and the cloud end in response to the bidirectional reachability state to represent non-bidirectional reachability between the client and the cloud end, wherein the first network path is a path for generating a network failure event; Switching the network traffic from the first network path to a second network path, wherein the second network path is a standby path corresponding to the first network path; switching the network traffic back to the first network path in response to the network failure event being repaired.
  2. 2. The method of claim 1, the detecting a bidirectional reachability status between the client and the cloud based on the flow direction of the network traffic, comprising: acquiring a content distribution network node between the client and the cloud; Based on the flow direction of the network traffic, initiating an incoming detection request to the cloud by using the content distribution network node to obtain an incoming path reachable state corresponding to the network traffic; Detecting the content distribution network node in use by the client based on the flow direction of the network flow, and obtaining an outbound path reachable state corresponding to the network flow; Wherein the bidirectional reachable states include the inbound path reachable state and the outbound path reachable state.
  3. 3. The method of claim 2, wherein the initiating, based on the flow direction of the network traffic, an incoming probe request to the cloud end by using the content distribution network node, to obtain an incoming path reachable state corresponding to the network traffic, includes: Acquiring a plurality of subnets corresponding to the cloud and first network addresses corresponding to the subnets; transmitting a probe request to each of the first network addresses using the content delivery network node; Acquiring an Internet gateway through which the network flow passes; Intercepting the detection request by using the Internet gateway, generating a detection response corresponding to the detection request, and returning the detection response to the content distribution network node; the inbound path reachable state is determined based on the probe request and the probe response.
  4. 4. The method of claim 3, wherein intercepting the probe request with the internet gateway, and generating the probe response corresponding to the probe request, comprises: responding to the Internet gateway to successfully intercept the detection request, modifying the data packet header of the detection request, and changing a request data packet corresponding to the detection request into a first response data packet; and exchanging the source address and the target address in the first response data packet to obtain a second response data packet, and representing the detection response by the second response data packet.
  5. 5. The method of claim 2, wherein detecting the content distribution network node being used by the client based on the flow direction of the network traffic, and obtaining the outbound path reachable state corresponding to the network traffic, comprises: Based on the flow direction of the network traffic, all candidate content distribution network nodes corresponding to the client are periodically obtained; Obtaining at least one target content distribution network node corresponding to the client based on the active state of each candidate content distribution network node, wherein the target content distribution network node meets a preset active condition; And obtaining the outbound path reachable state based on the flow reachability corresponding to each target content distribution network node.
  6. 6. The method of claim 5, obtaining at least one target content distribution network node corresponding to the client based on the active status of each of the candidate content distribution network nodes, comprising: Obtaining an active content distribution network node set based on the active state of each candidate content distribution network node; based on a predictive barreling strategy, barreling the active content distribution network node set to obtain a plurality of content distribution network node barrels; and uniformly selecting at least one target content distribution network node from each content distribution network node barrel.
  7. 7. The method of claim 1, monitoring a first network path between the client and the cloud, comprising: acquiring a fault detection data packet for the first network path; Directing the transmission of the fault detection data packet in the first network path based on a multi-stage path control strategy; And monitoring transmission indexes of the fault detection data packet in the transmission process of the first network path.
  8. 8. The method of claim 7, the directing the transmission of the failure detection data packet in the first network path based on the multi-stage path control policy, comprising: based on a first-stage path control strategy, guiding the fault detection data packet from a data center of the cloud to a target outlet; Directing the fault detection data packet from the target egress to a designated internet service access link based on a second stage path control policy, the internet service access link determined based on a segment route; The failure detection data packet is directed from the internet service access link to the designated client based on a third stage path control policy.
  9. 9. The method of claim 8, further comprising: Copying second network traffic of the network link where each target exit is located; Based on a load balancing mechanism, the second network traffic is sent to a preset analysis cluster, so that the preset analysis cluster identifies tenant-level indexes in the second network traffic; and positioning a network fault event of the tenant level based on the tenant level index.
  10. 10. The method of claim 9, the locating a tenant-level network failure event based on the tenant-level metrics, comprising: comparing the real-time tenant-level index with a historical reference index to obtain an index comparison result; And positioning a first abnormal position corresponding to the network fault event according to the index comparison result, and adding the association path of the network fault event to a detection list corresponding to the bidirectional reachable state.
  11. 11. The method of claim 9, the locating a tenant-level network failure event based on the tenant-level metrics, comprising: responding to the network fault event as tenant service interruption, and inquiring a network detection snapshot corresponding to the tenant level index; and positioning a second abnormal position corresponding to the network fault event based on the network detection snapshot.
  12. 12. A network monitoring device, the device comprising: the acquisition module is used for acquiring network traffic between the client and the cloud; the state detection module is used for detecting the bidirectional reachable state between the client and the cloud based on the flow direction of the network traffic; the path monitoring module is used for responding to the bidirectional reachable state to represent the non-bidirectional reachable state between the client and the cloud, and monitoring a first network path between the client and the cloud, wherein the first network path is a path for generating a network fault event; a first switching module, configured to switch the network traffic from the first network path to a second network path, where the second network path is a standby path corresponding to the first network path; And a second switching module configured to switch the network traffic back to the first network path in response to the failure event being repaired.
  13. 13. An electronic device, comprising: A memory and a processor in communication with each other, the memory having stored therein computer instructions, the processor executing the computer instructions to perform the network monitoring method of any of claims 1 to 11.
  14. 14. A computer-readable storage medium having stored thereon computer instructions for causing a computer to perform the network monitoring method of any one of claims 1 to 11.
  15. 15. A computer program product comprising computer instructions for causing a computer to perform the network monitoring method of any one of claims 1 to 11.

Description

Network monitoring method, device, electronic equipment, storage medium and program product Technical Field Some cases relate to the technical field of internet, and in particular relate to a network monitoring method, a device, an electronic device, a storage medium and a program product. Background The complexity, uncontrollability and dynamic changes of the internet make it a major source of network failures, which poses serious challenges to the stability of cloud services. Because cloud manufacturers cannot directly control the internal infrastructure of the Internet and lack visibility of the running state of the cloud manufacturers, the Internet is effectively monitored, faults are timely found and diagnosed, and the cloud manufacturers are vital to guaranteeing service level agreements and user experience. At present, monitoring for the Internet is mainly realized based on a public Internet monitoring system of the cloud, but the mode cannot cover the two-way path accessibility between the cloud and the client, mainly focuses on fault detection, and lacks the detection capability for fault repair. Disclosure of Invention In view of the above, a network monitoring method, apparatus, electronic device, storage medium and program product are provided to solve the problem that it is difficult to cover the bidirectional path reachability detection and the failure repair detection. The first aspect provides a network monitoring method, which comprises the steps of obtaining network traffic between a client and a cloud end, detecting a bidirectional reachable state between the client and the cloud end based on the flow direction of the network traffic, monitoring a first network path between the client and the cloud end if the bidirectional reachable state represents that the client and the cloud end are not bidirectional reachable, switching the network traffic from the first network path to a second network path, wherein the second network path is a standby path corresponding to the first network path, and switching the network traffic back to the first network path if the network failure event is repaired. In a second aspect, a network monitoring device is provided, which comprises an acquisition module, a state detection module, a path monitoring module, a first switching module and a second switching module, wherein the acquisition module is used for acquiring network traffic between a client and a cloud end, the state detection module is used for detecting a bidirectional reachable state between the client and the cloud end based on the flow direction of the network traffic, the path monitoring module is used for responding to the bidirectional reachable state to represent non-bidirectional reachable state between the client and the cloud end, then a first network path between the client and the cloud end is monitored, the first network path is a path for generating a network fault event, the first switching module is used for switching the network traffic from the first network path to the second network path, the second network path is a standby path corresponding to the first network path, and the second switching module is used for responding to the fault event to be repaired and switching the network traffic back to the first network path. In a third aspect, an electronic device is provided, including a memory and a processor, where the memory and the processor are communicatively connected to each other, and the memory stores computer instructions, and the processor executes the computer instructions, thereby executing the network monitoring method according to the first aspect or any implementation manner corresponding to the first aspect. In a fourth aspect, there is provided a computer readable storage medium having stored thereon computer instructions for causing a computer to perform the network monitoring method of the first aspect or any of its corresponding embodiments. In a fifth aspect, a computer program product is provided, comprising computer instructions for causing a computer to perform the network monitoring method of the first aspect or any of its corresponding embodiments. According to the network monitoring method, the device, the electronic equipment, the storage medium and the program product, the network flow between the client and the cloud is obtained, the bidirectional reachable state between the client and the cloud is detected according to the flow direction of the network flow, and the fine-granularity bidirectional path reachability between the cloud and the client is comprehensively monitored. When the bidirectional reachable state represents that the communication service between the cloud end and the client end is not bidirectional reachable, the fact that the path reachable is influenced by the generation of the network fault event can be determined, and at the moment, the network traffic is switched from the first network path to the second network path, so tha