CN-122027454-A - Disaster recovery management method and device, storage medium and electronic equipment

CN122027454ACN 122027454 ACN122027454 ACN 122027454ACN-122027454-A

Abstract

The application discloses a disaster recovery management method, a device, a storage medium and electronic equipment, which relate to the technical field of cloud computing and distributed storage and comprise the steps of creating an extended storage bucket resource in a target cluster, configuring a main storage bucket corresponding to a main station point contained in the target cluster and a secondary storage bucket corresponding to a secondary station point contained in the target cluster in the extended storage bucket resource, and relating the cross-site storage buckets (the main storage bucket and the secondary storage bucket) through the extended storage bucket resource. The application can solve the problem that disaster recovery management between the cross-site storage buckets can not be carried out in the related technology.

Inventors

LI FENXIANG
LIU SHILUN
ZHANG ZHONGWEN
ZHANG DASHUAI

Assignees

济南浪潮数据技术有限公司

Dates

Publication Date: 20260512
Application Date: 20260130

Claims (10)

1. A disaster recovery management method, comprising: creating an extended storage bucket resource in a target cluster, and configuring a main storage bucket, a secondary storage bucket and a topological relation between the main storage bucket and the secondary storage bucket in the extended storage bucket resource, wherein the main storage bucket is a storage bucket corresponding to a main station point contained in the target cluster, and the secondary storage bucket is a storage bucket corresponding to a secondary station point contained in the target cluster; And monitoring the state of a target storage bucket in the target cluster according to the extended storage bucket resources, and carrying out disaster recovery management on the target storage bucket according to a monitoring result.
2. The disaster recovery management method of claim 1, wherein monitoring a state of a target bucket in the target cluster according to the extended bucket resources comprises: Performing N-layer state monitoring on the target storage bucket according to the extended storage bucket resource, wherein N is a positive integer, the N-layer state monitoring comprises first-layer state monitoring, second-layer state monitoring and third-layer state monitoring, wherein the first-layer state monitoring is used for monitoring the service state of storage service corresponding to the target storage bucket, the second-layer state monitoring is used for monitoring the read-write permission of the target storage bucket, and the third-layer state monitoring is used for monitoring the data consistency between the main storage bucket and the auxiliary storage bucket; Determining that the target storage barrels are in a health state under the condition that the target storage barrels are determined to pass through the first layer state monitoring, the second layer state monitoring and the third layer state monitoring; And determining that the target storage bucket is in a fault state under the condition that the target storage bucket is not monitored by any one of the N-layer state monitoring.
3. The disaster recovery management method of claim 2, wherein performing N-layer monitoring on the target bucket according to the extended bucket resources comprises: invoking a target interface corresponding to the storage service, and determining a network access point in the storage service according to the extended storage bucket resources; sending a first request to the network access point according to the target interface, and determining whether first reply information sent by the storage service based on the first request is received in a first time period, wherein the first request is used for detecting the service state of the storage service; determining that the target storage bucket is monitored through the first layer state under the condition that the first reply message is received in the first time period; And under the condition that the first reply message is not received in the first time period, determining that the target storage bucket is not monitored by the first layer.
4. The disaster recovery management method of claim 2, wherein the N-level monitoring of the target bucket is performed according to the extended bucket resources, further comprising: Under the condition that the target storage bucket is monitored through the first layer state, determining the main storage bucket and the auxiliary storage bucket according to the extended storage bucket resources, and writing test files into the main storage bucket and the auxiliary storage bucket respectively to determine a first writing state of the test files in the main storage bucket and a second writing state of the test files in the auxiliary storage bucket; Determining that the target storage bucket is not monitored by the second layer state under the condition that the first writing state and/or the second writing state are determined to be writing failure; under the condition that the first writing state and the second writing state are both writing success, respectively sending a second request to the main storage barrel and the auxiliary storage barrel, determining whether second reply information sent by the main storage barrel based on the second request is received in a second time period, and determining whether third reply information sent by the auxiliary storage barrel based on the second request is received in the second time period, wherein the second request is used for reading the test file; determining that the target bucket is not monitored by the second layer state if the second reply request and/or the third reply request is not received within the second time period; In the event that it is determined that the second reply request and the third reply request have been received for the second period of time, determining that the target bucket is monitored by the second tier.
5. The disaster recovery management method of claim 2, wherein the N-level monitoring of the target bucket is performed according to the extended bucket resources, further comprising: Under the condition that the target storage bucket passes through the first layer monitoring and the second layer monitoring, acquiring first vector metadata corresponding to a target data object in the main storage bucket according to the extended storage bucket resource, and acquiring second vector metadata corresponding to the target data object in the auxiliary storage bucket according to the extended storage bucket resource, wherein the first vector metadata is used for indicating version information of the target data object recorded in the main storage bucket, and the second vector metadata is used for indicating version information of the target data object recorded in the auxiliary storage bucket; Determining a first version time corresponding to a first version of the target data object recorded in the main storage bucket according to the first vector metadata, and determining a second version time corresponding to a second version of the target data object recorded in the auxiliary storage bucket according to the second vector metadata; determining whether the first version time and the second version time are consistent; determining that the target storage bucket is monitored through the third layer state under the condition that the first version time is consistent with the second version time; And under the condition that the first version time is inconsistent with the second version time, determining that the target storage bucket is not monitored by the third layer state.
6. The disaster recovery management method according to claim 5, wherein in case it is determined that the first version time is inconsistent with the second version time, the method further comprises: Determining a context relationship between the first version time and the second version time; Determining incremental data in the main storage barrel and synchronizing the incremental data into the auxiliary storage barrel under the condition that the context indicates that the first version time is later than the second version time, wherein the incremental data is used for indicating data which is recorded in the main storage barrel and is not synchronized into the auxiliary storage barrel; And marking the auxiliary storage bucket through a target mark under the condition that the front-back relation indicates that the first version time is earlier than the second version time, wherein the target mark is used for indicating that the data recorded in the auxiliary storage bucket is inconsistent with the data recorded in the main storage bucket.
7. The disaster recovery management method according to claim 1, wherein performing disaster recovery management on the target storage bucket according to a monitoring result comprises: Under the condition that the main storage barrel is in a fault state according to the monitoring result, acquiring a performance index of each auxiliary storage barrel in the target cluster, and determining the priority of each auxiliary storage barrel according to the performance index, wherein the performance index at least comprises one of the following steps of the distance between each auxiliary station corresponding to each auxiliary storage barrel and the main station, the network bandwidth of each auxiliary station and the protocol level corresponding to each auxiliary station; And determining a target auxiliary storage bucket with the highest priority among the auxiliary storage buckets, and redirecting the application flow corresponding to the main storage bucket into the target auxiliary storage bucket so as to switch the main storage bucket into the target auxiliary storage bucket.
8. A disaster recovery management device, comprising: The system comprises a creation module, a storage management module and a storage management module, wherein the creation module is used for creating an extended storage bucket resource in a target cluster, and configuring a main storage bucket, a secondary storage bucket and a topological relation between the main storage bucket and the secondary storage bucket in the extended storage bucket resource, wherein the main storage bucket is a storage bucket corresponding to a main station point contained in the target cluster, and the secondary storage bucket is a storage bucket corresponding to a secondary station point contained in the target cluster; And the monitoring module is used for monitoring the state of the target storage bucket in the target cluster according to the extended storage bucket resources and carrying out disaster recovery management on the target storage bucket according to the monitoring result.
9. An electronic device, comprising: A memory for storing a computer program; A processor for implementing the steps of the disaster recovery management method as claimed in any one of claims 1 to 7 when executing said computer program.
10. A computer readable storage medium, wherein a computer program is stored in the computer readable storage medium, wherein the computer program, when executed by a processor, implements the steps of the disaster recovery management method according to any one of claims 1 to 7.

Description

Disaster recovery management method and device, storage medium and electronic equipment Technical Field The present application relates to the field of cloud computing and distributed storage technologies, and in particular, to a disaster recovery management method, a disaster recovery management device, a storage medium, and an electronic device. Background With the popularity of cloud computing and microservice architecture, containerization technology has become a core means of scalable applications. However, the increasingly dependent storage systems of containerized applications, especially object storage services, are assuming critical functions of data persistence, log management, configuration storage, etc. in a container environment. In a distributed storage system, high availability and disaster recovery capability of data are key to ensuring service continuity. Conventional single site object storage schemes, when faced with natural disasters, hardware failures, or network outages, present a single point of failure risk that may lead to outage, data loss, or unrecoverable performance degradation. The container object storage interface (Cloud Object Storage Interface, abbreviated as COSI) is used as a container object storage interface standard in the open source container orchestration platform (Kubernetes, abbreviated as K8 s) ecology, and abstract management of storage resources is realized by defining uniform custom resource definitions (Custom Resource Definition, abbreviated as CRD, for example, a bucket class (BucketClass) and the like). However, COSI interface capabilities have the limitation that standard container object store interfaces only provide single bucket base operations, lacking disaster recovery management capabilities across site buckets. Therefore, the related art has a problem that disaster recovery management between the cross-site buckets cannot be performed. Aiming at the problem that disaster recovery management between cross-site storage buckets cannot be carried out in the related art, no effective solution has been proposed yet. Accordingly, there is a need for improvements in the related art to overcome the drawbacks of the related art. Disclosure of Invention The application provides a disaster recovery management method, a disaster recovery management device, a storage medium and electronic equipment, which at least solve the problem that disaster recovery management between cross-site storage barrels cannot be carried out in the related technology. The application provides a disaster recovery management method, which comprises the steps of creating an extended storage bucket resource in a target cluster, configuring a main storage bucket, a secondary storage bucket and a topological relation between the main storage bucket and the secondary storage bucket in the extended storage bucket resource, wherein the main storage bucket is a storage bucket corresponding to a main station point contained in the target cluster, the secondary storage bucket is a storage bucket corresponding to a secondary station point contained in the target cluster, monitoring the state of the target storage bucket in the target cluster according to the extended storage bucket resource, and carrying out disaster recovery management on the target storage bucket according to a monitoring result. The application also provides a disaster recovery management device, which comprises a creation module, a monitoring module and a disaster recovery management module, wherein the creation module is used for creating an extended storage bucket resource in a target cluster, configuring a main storage bucket, a secondary storage bucket and a topological relation between the main storage bucket and the secondary storage bucket in the extended storage bucket resource, wherein the main storage bucket is a storage bucket corresponding to a main station contained in the target cluster, the secondary storage bucket is a storage bucket corresponding to a secondary station contained in the target cluster, and the monitoring module is used for monitoring the state of the target storage bucket in the target cluster according to the extended storage bucket resource and carrying out disaster recovery management on the target storage bucket according to a monitoring result. The application also provides electronic equipment, which comprises a memory for storing a computer program and a processor for realizing the steps of any disaster recovery management method when executing the computer program. The application also provides a computer readable storage medium, wherein the computer readable storage medium stores a computer program, and the computer program realizes the steps of any disaster recovery management method when being executed by a processor. The application also provides a computer program product, which comprises a computer program, and the computer program realizes the steps of any disaster recovery man