CN-121996539-A - Method, apparatus and program product for fault resilient enabled watchdog function test
Abstract
The application provides a fault elastic starting watchdog function test method, equipment and a program product, which can be used in the technical field of servers. The method is applied to an execution machine and comprises the steps of generating a test configuration file based on test information and importing the test configuration file into the test machine, wherein the test information comprises overtime time and overtime strategies corresponding to the overtime time, sending a hot reset instruction to the test machine, sending a debugging instruction to the test machine in an initialization stage of a basic input output system and starting timing, wherein the debugging instruction is used for indicating a CPU of the test machine to enter a debugging mode, acquiring an event log of a baseboard management controller of the test machine when the timing time reaches the overtime time, and determining whether the function of a fault elastic start watchdog of the test machine is normal or not based on the event log. The method of the application obviously improves the test efficiency and test repeatability of the fault elastic start watchdog function test.
Inventors
- WEI DONG
- WU LING
- Hou Jiaang
- HONG HAO
Assignees
- 上海远图未来信息技术有限公司
Dates
- Publication Date
- 20260508
- Application Date
- 20251217
Claims (10)
- 1. A method for testing a function of a fault resilient start watchdog, applied to an execution machine, the execution machine being connected to at least one test machine to be tested, the method comprising: Generating a test configuration file based on test information and importing the test configuration file into the tester, wherein the test information comprises timeout time and timeout strategies corresponding to the timeout time; The method comprises the steps of sending a hot reset instruction to the testing machine, wherein the hot reset instruction is used for indicating the testing machine to enter an initialization stage of a basic input output system; in the initialization stage of the basic input/output system, a debugging instruction is sent to the testing machine and timing is started, wherein the debugging instruction is used for indicating a CPU of the testing machine to enter a debugging mode; when the timing time reaches the overtime time, acquiring an event log of a baseboard management controller of the testing machine; and determining whether the function of the fault elastic starting watchdog of the testing machine is normal or not based on the event log.
- 2. The method of claim 1, wherein determining whether a failure resilience-enabled watchdog of the test machine is functioning properly based on the event log comprises: acquiring an operation event corresponding to the overtime time by the baseboard management controller based on the event log; And if the operation event is consistent with the timeout strategy, determining that the fault elastic starting watchdog has normal function.
- 3. The method according to claim 2, wherein the method further comprises: and if the operation event is inconsistent with the timeout strategy, determining that the function of the fault elastic starting watchdog is abnormal.
- 4. The method of claim 1, wherein prior to sending a hot reset instruction to the tester, the method further comprises: sending a restarting instruction to the testing machine, wherein the restarting instruction is used for indicating the testing machine to execute restarting operation; after the restarting of the testing machine is completed, acquiring the current configuration file of the fault elastic starting watchdog; and if the current configuration file is matched with the test configuration file, sending a hot reset instruction to the test machine.
- 5. The method according to claim 4, wherein the method further comprises: and if the current configuration file is not matched with the test configuration file, jumping to a step of generating the test configuration file based on the test information and importing the test configuration file into the test machine.
- 6. The method of any of claims 1-5, wherein prior to sending debug instructions to the tester, the method further comprises: An unlocking instruction is sent to the testing machine, wherein the unlocking instruction is used for indicating the CPU of the testing machine to unlock; And if the CPU is successfully unlocked, sending a debugging instruction to the testing machine.
- 7. The method of any of claims 1-5, wherein the test profile further comprises a functional state configured as an enabled state.
- 8. An electronic device comprising a processor, and a memory communicatively coupled to the processor; The memory stores computer-executable instructions; The processor executes computer-executable instructions stored in the memory to implement the method of any one of claims 1 to 7.
- 9. A computer readable storage medium having stored therein computer executable instructions which when executed are adapted to implement the method of any of claims 1 to 7.
- 10. A computer program product comprising a computer program for implementing the method of any one of claims 1 to 7 when the computer program is executed.
Description
Method, apparatus and program product for fault resilient enabled watchdog function test Technical Field The present application relates to the field of server technologies, and in particular, to a method, an apparatus, and a program product for testing a failure elastic start watchdog function. Background The fault elastic starting watchdog is a watchdog timer aiming at a fault elastic starting stage of a server, and has the function of starting a firmware-level timer to perform overtime monitoring in the fault elastic starting stage, if the server fails to complete the fault elastic starting stage on time, after the timer overtime, the server firmware triggers a preset recovery mechanism so as to try to recover the normal operation of the server. For this reason, in order to ensure that the fail-safe watchdog is operating properly, it is necessary to perform a functional test on the fail-safe watchdog. The existing method for testing the fault elastic starting watchdog mainly comprises the steps of constructing an abnormal initialization state of a basic input and output system of a server by installing fault hardware on a main board of the server, so that the server is clamped in a hardware initialization stage in the starting process, and further a fault elastic starting watchdog timer is triggered to detect whether functions of the server run normally or not. However, according to the test scheme, on one hand, a tester is required to collect fault hardware required by fault test, and on the other hand, a test environment is required to be built again by using the fault hardware for each test, so that repeated tests are difficult to quickly and efficiently perform, and the test efficiency and the test repeatability are low. Disclosure of Invention The application provides a fault elastic starting watchdog function test method, equipment and a program product, which are used for solving the technical problems of low test efficiency and low test repeatability of the existing fault elastic starting watchdog function test scheme. According to a first aspect of the disclosure, the present application provides a method for testing a failure elastic start watchdog function, applied to an executing machine, where the executing machine is connected with at least one testing machine to be detected, the method includes: Generating a test configuration file based on test information and importing the test configuration file into the tester, wherein the test information comprises timeout time and timeout strategies corresponding to the timeout time; The method comprises the steps of sending a hot reset instruction to the testing machine, wherein the hot reset instruction is used for indicating the testing machine to enter an initialization stage of a basic input output system; in the initialization stage of the basic input/output system, a debugging instruction is sent to the testing machine and timing is started, wherein the debugging instruction is used for indicating a CPU of the testing machine to enter a debugging mode; when the timing time reaches the overtime time, acquiring an event log of a baseboard management controller of the testing machine; and determining whether the function of the fault elastic starting watchdog of the testing machine is normal or not based on the event log. In a possible implementation, determining whether the failure elasticity start watchdog of the test machine is normal or not based on the event log includes: acquiring an operation event corresponding to the overtime time by the baseboard management controller based on the event log; And if the operation event is consistent with the timeout strategy, determining that the fault elastic starting watchdog has normal function. In a possible embodiment, the method further comprises: and if the operation event is inconsistent with the timeout strategy, determining that the function of the fault elastic starting watchdog is abnormal. In a possible embodiment, before sending a hot reset instruction to the tester, the method further comprises: sending a restarting instruction to the testing machine, wherein the restarting instruction is used for indicating the testing machine to execute restarting operation; after the restarting of the testing machine is completed, acquiring the current configuration file of the fault elastic starting watchdog; and if the current configuration file is matched with the test configuration file, sending a hot reset instruction to the test machine. In a possible embodiment, the method further comprises: and if the current configuration file is not matched with the test configuration file, jumping to a step of generating the test configuration file based on the test information and importing the test configuration file into the test machine. In a possible embodiment, before sending the debug instruction to the tester, the method further comprises: An unlocking instruction is sent to the testing machine, where