Search

CN-121996472-A - Method, device, equipment and medium for recovering after hanging and dying substrate management controller

CN121996472ACN 121996472 ACN121996472 ACN 121996472ACN-121996472-A

Abstract

The invention discloses a method, a device, equipment and a medium for recovering a substrate management controller after hanging, which relate to the technical field of servers and comprise the steps of controlling a first logic switch to conduct circuit connection on a target chip selection signal in the substrate management controller and a second Flash through the second logic switch if a first Flash fails in a non-hardware mode in the operation of the substrate management controller, so that the substrate management controller can recover starting based on the second Flash, storing configuration information of a basic output and input system and the substrate management controller into an electrified erasable programmable read-only memory connected with the substrate management controller, and starting the substrate management controller according to the configuration information in the electrified erasable programmable read-only memory when restarting, wherein the first logic switch is a logic switch positioned among the chip selection signals connected with the first Flash, the second Flash and the substrate management controller. The invention reduces the influence of faults on the service.

Inventors

  • GAO PENG
  • LV DONGBO

Assignees

  • 浪潮计算机科技有限公司

Dates

Publication Date
20260508
Application Date
20260129

Claims (10)

  1. 1. A post-hang recovery method for a baseboard management controller, comprising: In the operation of the baseboard management controller, if a first Flash has a non-hardware fault, a second logic switch is used for controlling the first logic switch to conduct circuit connection between a target chip selection signal in the baseboard management controller and a second Flash, so that the baseboard management controller is started and recovered based on the second Flash; Storing configuration information of a basic input and output system and the baseboard management controller into a charged erasable programmable read-only memory connected with the baseboard management controller, so that when the baseboard management controller is restarted, corresponding starting is performed according to the configuration information in the charged erasable programmable read-only memory; The first logic switch is a logic switch positioned in the middle of a chip selection signal connected with the first Flash, the second Flash and the baseboard management controller.
  2. 2. The post-hang recovery method of a baseboard management controller of claim 1, further comprising: And if the second logic switch is disconnected, starting the baseboard management controller based on the first Flash.
  3. 3. The post-suspension recovery method of a baseboard management controller according to claim 1, wherein before storing configuration information of a basic input-output system and the baseboard management controller in a live erasable programmable read-only memory connected with the baseboard management controller, the method further comprises: The charged erasable programmable read-only memory is connected with the baseboard management controller through a serial communication bus.
  4. 4. The post-suspension recovery method of a baseboard management controller according to claim 1, wherein storing configuration information of a basic input and output system and the baseboard management controller to a charged erasable programmable read-only memory connected with the baseboard management controller comprises: The method comprises the steps of adding an operable option on a Web page of a baseboard management controller, wherein the operable option comprises an option of synchronizing configuration information of a basic input-output system to an electrified erasable programmable read-only memory, an option of synchronizing the configuration information of the baseboard management controller to the electrified erasable programmable read-only memory, an option of starting according to the configuration information of the basic input-output system in the electrified erasable programmable read-only memory and an option of importing the configuration information of the baseboard management controller; After the customer service normally runs, the configuration information of the current basic input and output system is saved to the charged erasable programmable read-only memory connected with the baseboard management controller through the option of synchronizing the configuration information of the basic input and output system to the charged erasable programmable read-only memory; And after the baseboard management controller is stabilized, storing the configuration information of the baseboard management controller to a charged erasable programmable read-only memory connected with the baseboard management controller based on the option of synchronizing the configuration information of the baseboard management controller to the charged erasable programmable read-only memory.
  5. 5. The method for post-suspension recovery of a baseboard management controller according to claim 4, wherein when the baseboard management controller is restarted, the baseboard management controller is correspondingly started according to configuration information in the charged erasable programmable read-only memory, comprising: When the baseboard management controller is restarted, the basic output input system is started according to the configuration information of the basic output input system in the electrified erasable programmable read-only memory, and the starting is performed according to the configuration information of the current baseboard management controller in the electrified erasable programmable read-only memory.
  6. 6. The post-hang recovery method of a baseboard management controller of any one of claims 1-5, further comprising: Comparing the configuration information stored in the baseboard management controller with the configuration information stored in the electrified erasable programmable read-only memory based on a target period to obtain a corresponding judgment result; and carrying out corresponding operation on the charged erasable programmable read-only memory according to the judging result.
  7. 7. The method for post-suspension recovery of a baseboard management controller according to claim 6, wherein the performing the corresponding operation on the live eeprom according to the determination result comprises: If the configuration information stored in the baseboard management controller is the same as the configuration information stored in the electrified erasable programmable read-only memory, no operation is executed; And if the configuration information stored in the baseboard management controller is different from the configuration information stored in the electrified erasable programmable read-only memory, triggering alarm information so as to manually determine whether to synchronize the configuration information stored in the baseboard management controller to the electrified erasable programmable read-only memory according to the alarm information.
  8. 8. A post-hang recovery device for a baseboard management controller, comprising: The switch control module is used for controlling the first logic switch to conduct circuit connection between a target chip selection signal in the baseboard management controller and the second Flash through the second logic switch if the first Flash has a non-hardware fault in the operation of the baseboard management controller, so that the baseboard management controller can be started and restored based on the second Flash; The configuration information storage module is used for storing the configuration information of the basic input and output system and the baseboard management controller to an electrified erasable programmable read-only memory connected with the baseboard management controller, so that when the baseboard management controller is restarted, corresponding starting is carried out according to the configuration information in the electrified erasable programmable read-only memory; The first logic switch is a logic switch positioned in the middle of a chip selection signal connected with the first Flash, the second Flash and the baseboard management controller.
  9. 9. An electronic device, comprising: A memory for storing a computer program; a processor for executing the computer program to implement the steps of the post-hang recovery method of a baseboard management controller as claimed in any one of claims 1 to 7.
  10. 10. A computer readable storage medium, wherein a computer program is stored on the computer readable storage medium, which when executed by a processor, implements the steps of the post-hang recovery method of the baseboard management controller of any one of claims 1 to 7.

Description

Method, device, equipment and medium for recovering after hanging and dying substrate management controller Technical Field The present invention relates to the field of server technologies, and in particular, to a method, an apparatus, a device, and a medium for recovering a substrate management controller after suspension. Background With the wide-spread use of OPEN BMC (Baseboard Management Controller ), development is convenient and quick for each server manufacturer, and meanwhile, a certain risk is brought, and a plurality of unknown unstable factors exist inside the OPEN source project. Recently, the problem of multi-BMC hanging is solved at the market end, so that the fan management of the server is out of control, the power of the cabinet is out of standard after the fan rotates fully, the tripping risk exists in the whole cabinet, and customer complaints and worry are caused. In addition, the condition that the BMC is hung up is complicated, a large number of forced restarting and unrecoverable conditions exist, BMC configuration and BIOS (Basic Input Output System ) configuration inside the BMC are lost, great trouble is brought to the use of a client, particularly, the service cannot be pulled up after the BIOS configuration is lost, because the configuration on a plurality of services needs BIOS special settings to support, particularly, service scenes such as virtualization and the like and scenes with extremely high performance requirements are optimized, and the service cannot be successfully pulled up on the client after the service is reconfigured by the client, so that the service is a disastrous event. Most manufacturers in the prior art also adopt a main flash chip and a standby flash chip to ensure the stable operation of the BMC as much as possible, but the design can only ensure that the main flash chip is switched to the standby flash operation when the hardware failure of the main flash chip can not communicate, and the problems of the hanging up of the BMC, the configuration loss and the like can not be solved. Therefore, how to ensure that the BMC is hung up, and can also perform extreme operation to ensure that the customer service can run steadily when the situation such as configuration loss occurs is a problem to be solved by the person skilled in the art. Disclosure of Invention The embodiment of the invention aims to provide a method, a device, equipment and a medium for recovering a substrate management controller after hanging, which can ensure that the BMC is hung up, and can also perform extreme operation when the conditions such as configuration loss occur so as to ensure that customer business can run stably and eliminate hidden danger of the customer in time. The specific scheme is as follows: In a first aspect, the present invention provides a method for recovering a substrate management controller after suspension, including: in the operation of the baseboard management controller, if the first Flash has a non-hardware fault, the first logic switch is controlled by the second logic switch to conduct circuit connection between a target chip selection signal in the baseboard management controller and the second Flash, so that the baseboard management controller is started and recovered based on the second Flash; the configuration information of the basic input and output system and the baseboard management controller is stored in an electrified erasable programmable read-only memory connected with the baseboard management controller, so that when the baseboard management controller is restarted, corresponding starting is carried out according to the configuration information in the electrified erasable programmable read-only memory; The first logic switch is positioned in the middle of a chip selection signal connected with the first Flash, the second Flash and the baseboard management controller. Optionally, the method for recovering the substrate management controller after hanging up further includes: And if the second logic switch is disconnected, starting the baseboard management controller based on the first Flash. Optionally, before storing the configuration information of the bios and the baseboard management controller in the live eeprom connected to the baseboard management controller, the method further includes: The charged erasable programmable read-only memory is connected with the baseboard management controller through a serial communication bus. Optionally, storing configuration information of the bios and the baseboard management controller in a charged eeprom connected to the baseboard management controller, including: The method comprises the steps of adding an operable option on a Web page of a baseboard management controller, wherein the operable option comprises an option of synchronizing configuration information of a basic input and output system to a charged erasable programmable read-only memory, an option of synchronizing the configuration