CN-122018949-A - Interrupt-free management method and system for intelligent hot patch
Abstract
The invention provides an uninterrupted management method and system of an intelligent thermal patch in the technical field of communication system software maintenance, wherein the method comprises the steps of S1, a host computer receiving a patch operation request, S2, checking patch data carried by the patch operation request and then forwarding the patch data to a standby computer, S3, automatically executing the patch operation by the standby computer through a patch management module, controlling the patch operation by the patch management module through a state machine mechanism, ensuring no interruption of service in the patch operation process, S4, automatically triggering a rollback mechanism when the patch operation fails, sending patch operation success feedback to the host computer when the patch operation is successful, S5, executing the main-standby switching operation with the standby computer based on the patch operation success feedback, and executing the patch operation of the host computer after the switching is completed. The invention has the advantages of realizing automatic execution, uninterrupted updating and reliable rollback of patch operation, thereby comprehensively improving the efficiency and the safety of patch management.
Inventors
- CHEN JINHUI
- CHEN YULONG
- CHEN HAO
Assignees
- 福建星网物联信息系统有限公司
Dates
- Publication Date
- 20260512
- Application Date
- 20251222
Claims (10)
- 1. The interrupt-free management method of the intelligent hot patch is characterized by comprising the following steps of: Step S1, a host receives a patch operation request carrying patch operation content and patch data through a communication sub-module, wherein the patch operation content is at least one of uploading, installing, activating, deactivating and rolling back; S2, after checking the patch data carried by the patch operation request, the host computer forwards the patch operation request to the slave computer; Step S3, the standby machine automatically executes patch operation through a patch management module based on the received patch operation request, wherein the patch management module controls the patch operation by adopting a state machine mechanism to ensure no interruption of service in the patch operation process, the state machine mechanism defines and maintains global states including an idle state, a preparation state, an application state, a verification state and a failure state, and triggers migration of all the global states based on the patch operation; Step S4, when the patch operation fails, the standby machine automatically triggers a rollback mechanism, and when the patch operation is successful, the standby machine sends patch operation success feedback to the host machine; And S5, the host machine executes the main-standby switching operation with the standby machine based on the received successful feedback of the patch operation, and then executes the patch operation of the host machine after the switching is completed, so that the consistency of the patch states between the host machine and the standby machine is ensured.
- 2. The method for uninterrupted administration of an intelligent thermal patch according to claim 1, wherein in step S1, said communication sub-module supports domain socket communication and TCP short connection communication: The domain nested word communication is used for interacting with a patch operation client of the same equipment, and a JSON message format is adopted; the TCP short connection communication is used for main and standby synchronization among different devices, and adopts a structure binary message format.
- 3. The method for uninterrupted managing an intelligent thermal patch according to claim 1, wherein in step S3, the step of automatically performing the patch operation by the patch management module comprises: loading system configuration parameters through a configuration management submodule to provide an operating environment for patch operation; The method comprises the steps of storing and inquiring patch information comprising patch numbers, patch types, dependency relationships and activation states through a database management submodule; The method comprises the steps of packaging specific patch operation through a patch operation submodule, and checking the dependency relationship, version compatibility and patch number of patch data through analyzing a standardized control file of the patch data before the operation, wherein the standardized control file comprises the patch number, an adaptation version, the dependency relationship, the patch type, an activation mode and producer information; The global state migration is driven by a state machine sub-module, and the patch operation is subjected to flow restriction; recording a patch operation process, state migration and abnormal events through a log sub-module; and performing JSON analysis, file verification and path analysis through the tool submodule.
- 4. The method for uninterrupted managing an intelligent hot patch according to claim 1, wherein in step S3, the state machine mechanism executes automatic recovery logic after the device is powered down and restarted, and the method specifically comprises the following steps: Reading the last global state stored in the database; If the global state is an idle state, directly recovering to a normal state; If the global state is a preparation state, an application state or a verification state, automatically triggering a rollback operation, cleaning incomplete patch operation, and recovering to a stable version before power failure; If the global state is a failure state, the failure identification is maintained.
- 5. The method for uninterrupted managing an intelligent hot patch according to claim 1, wherein in step S4, the rollback mechanism is specifically: rolling back to the latest stable version according to the patch information recorded by the patch management module; Recording a rollback process through a log submodule and providing error prompt and recovery options; And the rollback operation is triggered only once, and if the continuous rollback fails, the operation automatically returns to the no-supplement Ding Zhuangtai.
- 6. The interrupt-free management system of the intelligent hot patch is characterized by comprising the following modules: The system comprises a patch operation request receiving module, a communication submodule and a control module, wherein the patch operation request receiving module is used for receiving a patch operation request carrying patch operation content and patch data by a host, and the patch operation content is at least one of uploading, installing, activating, deactivating and rolling back; The patch operation request forwarding module is used for forwarding the patch operation request to the standby machine after the host checks the patch data carried by the patch operation request; the patch operation module is used for automatically executing patch operation by the standby machine through the patch management module based on the received patch operation request, wherein the patch management module controls the patch operation by adopting a state machine mechanism to ensure no interruption of service in the patch operation process; The operation feedback module is used for automatically triggering a rollback mechanism when the standby machine fails in patch operation, and sending patch operation success feedback to the host machine when the patch operation is successful; And the main and standby synchronization module is used for the host to execute main and standby switching operation with the standby machine based on the received successful feedback of the patch operation, and then execute the patch operation of the host after the switching is completed, so as to ensure the consistency of patch states between the host and the standby machine.
- 7. The system for intelligent hot-patch interrupt-free management as recited in claim 6, wherein in said patch operation request receiving module, said communication sub-module supports domain socket communications and TCP short connection communications: The domain nested word communication is used for interacting with a patch operation client of the same equipment, and a JSON message format is adopted; the TCP short connection communication is used for main and standby synchronization among different devices, and adopts a structure binary message format.
- 8. The system for uninterrupted managing an intelligent thermal patch of claim 6, wherein said patch management module automatically performs the patch operation comprises: loading system configuration parameters through a configuration management submodule to provide an operating environment for patch operation; The method comprises the steps of storing and inquiring patch information comprising patch numbers, patch types, dependency relationships and activation states through a database management submodule; The method comprises the steps of packaging specific patch operation through a patch operation submodule, and checking the dependency relationship, version compatibility and patch number of patch data through analyzing a standardized control file of the patch data before the operation, wherein the standardized control file comprises the patch number, an adaptation version, the dependency relationship, the patch type, an activation mode and producer information; The global state migration is driven by a state machine sub-module, and the patch operation is subjected to flow restriction; recording a patch operation process, state migration and abnormal events through a log sub-module; and performing JSON analysis, file verification and path analysis through the tool submodule.
- 9. The system for non-interrupt management of an intelligent thermal patch of claim 6 wherein, in said patch operation module, said state machine mechanism performs automatic recovery logic after a device power-down restart, comprising: Reading the last global state stored in the database; If the global state is an idle state, directly recovering to a normal state; If the global state is a preparation state, an application state or a verification state, automatically triggering a rollback operation, cleaning incomplete patch operation, and recovering to a stable version before power failure; If the global state is a failure state, the failure identification is maintained.
- 10. The system for managing intelligent thermal patches without interruption of claim 6 wherein said rollback mechanism is embodied in said operational feedback module as: rolling back to the latest stable version according to the patch information recorded by the patch management module; Recording a rollback process through a log submodule and providing error prompt and recovery options; And the rollback operation is triggered only once, and if the continuous rollback fails, the operation automatically returns to the no-supplement Ding Zhuangtai.
Description
Interrupt-free management method and system for intelligent hot patch Technical Field The invention relates to the technical field of communication system software maintenance, in particular to an interrupt-free management method and system of an intelligent hot patch. Background Under the background of increasingly higher requirements for high availability of communication systems, the updating and maintenance of software patches becomes a key for ensuring stable operation of equipment and system safety. However, the current common patch management techniques still have significant limitations, particularly in terms of implementing business-uninterrupted updates. At present, patch management modes in the industry mainly comprise the following three types, and the requirements of intelligent and interrupt-free operation are difficult to meet at the same time: (1) Whole package replacement type updating, namely, patch updating is realized through reinstallation or whole coverage of the software package. The method usually needs to stop and restart the party to be effective, resulting in longer service interruption time. In equipment with high availability requirements at the carrier level, such interruption can directly affect the user experience, and the applicable scene is limited. (2) Single patch superposition type updating, namely maintaining in a mode of sequentially superposing patches one by one, but lacking a unified version management and dependency relationship control mechanism. In actual operation, problems such as disordered patch application sequence, difficult rollback and the like are easy to occur, and instability of the system is often caused by patch conflict, such as circular dependence or version incompatibility which may occur when dependency relationship is not checked. (3) And the updating file is added with independent script type updating, and the operation and maintenance personnel are relied on to manually execute the scattered updating script, so that the operation process is easy to operate by mistake and difficult to unify. The method has low efficiency, equipment abnormality or service interruption is easy to cause by human error, and automatic and controllable updating guarantee is lacking. In summary, the main problems of the prior art can be summarized as follows: (1) The patch management mechanism is imperfect, the patch version management is irregular, the dependency relationship lacks automatic verification, and the installation failure or the system instability is easy to be caused. (2) And the operation recoverability is poor, when the patch is failed to install, the system cannot realize quick automatic rollback, and the recovery process depends on manual intervention to influence the operation reliability of the equipment. (3) The method lacks the capability of updating without interruption, and the existing scheme is required to be stopped and restarted, does not support a hot patch mechanism, and is difficult to meet the requirement of a high-availability system on service continuity. (4) And the main equipment and the standby equipment are not synchronous enough, namely, in a main and standby deployment environment, the patch operation lacks an efficient synchronous mechanism, the main and standby states are inconsistent easily, and the running risk of the system is increased. Therefore, how to provide an interrupt-free management method and system for intelligent thermal patches, which realize automatic execution, interrupt-free update and reliable rollback of patch operation, thereby comprehensively improving the efficiency and safety of patch management, is a technical problem to be solved. Disclosure of Invention The invention aims to solve the technical problem of providing an uninterrupted management method and system for intelligent thermal patches, which realize automatic execution, uninterrupted updating and reliable rollback of patch operation, thereby comprehensively improving the efficiency and safety of patch management. In a first aspect, the present invention provides an interrupt-free management method for an intelligent hot patch, including the following steps: Step S1, a host receives a patch operation request carrying patch operation content and patch data through a communication sub-module, wherein the patch operation content is at least one of uploading, installing, activating, deactivating and rolling back; S2, after checking the patch data carried by the patch operation request, the host computer forwards the patch operation request to the slave computer; Step S3, the standby machine automatically executes patch operation through a patch management module based on the received patch operation request, wherein the patch management module controls the patch operation by adopting a state machine mechanism to ensure no interruption of service in the patch operation process, the state machine mechanism defines and maintains global states includi