EP-4740380-A1 - METHOD AND SYSTEM FOR MANAGING LOAD OF ALARMS IN A NETWORK MANAGEMENT SYSTEM
Abstract
The present disclosure relates to a system (125) and a method (600) for managing load of alarms in a Network Management Systems (NMS) (120). The method (600) includes the step of fetching, by one or more processors (205), data pertaining to a total count of alarms stored in a database (130). The method (600) includes the step of comparing, by the one or more processors (205), the total count of alarms with a first predefined threshold and a second predefined threshold, respectively. The method (600) includes the step of performing, by the one or more processors (205), one or more tasks to manage the load of alarms in the NMS (120) based on comparison of the total count of alarms with the first predefined threshold and the second predefined threshold, respectively.
Inventors
- BHATNAGAR, AAYUSH
- BISHT, SANDEEP
- MISHRA, RAHUL
- Sekar, Pavithra
- SHARMA, Smridhi
- E, Elanchezhiyan
- TIWARI, SUMIT
Assignees
- Jio Platforms Limited
Dates
- Publication Date
- 20260513
- Application Date
- 20240702
Claims (20)
- 1. A method (600) for managing load of alarms in a Network Management System (NMS) (120), the method (600) comprises the steps of: fetching (605), by one or more processors (205), data pertaining to a total count of alarms stored in a database (130); comparing (610), by the one or more processors (205), the total count of alarms with a first predefined threshold and a second predefined threshold, respectively; and performing (615), by the one or more processors (205), one or more tasks to manage the load of alarms in the NMS (120) based on comparison of the total count of alarms with the first predefined threshold and the second predefined threshold, respectively.
- 2. The method (600) as claimed in claim 1 , wherein the one or more tasks performed by the one or more processors (205), includes at least one of: disabling, consumption of subsequent one or more alarms until a pre-defined time period is completed, in response to detecting, the total count of alarms being greater than the first predefined threshold; enabling, consumption of subsequent one or more alarms until the pre-defined time period is completed, in response to detecting, the total count of alarms being lower than the second predefined threshold; enabling, consumption of subsequent one or more alarms until the pre-defined time period is completed, in response to detecting, the total count of alarms being lower than the first predefined threshold and greater than the second predefined threshold, and the total count of the alarms increases until the total count of alarms reaches the first pre-defined threshold; and disabling, consumption of subsequent one or more alarms until the pre-defined time period is completed, in response to detecting, the total count of alarms being greater than the first predefined threshold, and the total count of the alarms at least one of, remains constant and decreases.
- 3. The method (600) as claimed in claim 1 , wherein the one or more processors (205) fetches details of the alarm by: retrieving, an alarm identifier from a message stream; and fetching, a relevant alarm from the database (130) using the retrieved alarm identifier.
- 4. The method (600) as claimed in claim 1 , wherein the alarms are generated by the one or more network elements and stored in the database (130) with the alarm identifier.
- 5. The method (600) as claimed in claim 1, wherein the one or more processors (205) communicates with the database (130) to facilitate in maintaining a timestamp array for each alarm, thereby enabling accurate tracking of alarm occurrences.
- 6. The method (600) as claimed in claim 1, wherein the first predefined threshold and the second predefined threshold are configured by a user.
- 7. The method (600) as claimed in claim 1, wherein the method (600) further comprises the step of: transmitting, by the one or more processors (205), at least one of, a burst critical alert via Short Message Service (SMS) or email to the user when the total count of alarms is greater than the first predefined threshold for a pre-defined number of consecutive time intervals.
- 8. The method (600) as claimed in claim 1, wherein subsequent to the total count of alarms in the database (130) is greater than the first predefined threshold, the one or more processors, delays in consumption of the one or more alarms in the NMS (120) to manage the load of alarms, wherein the delay is at least one of, a predefined and dynamic.
- 9. The method (600) as claimed in claim 1, wherein the method (600) further comprises the step of: managing the load of the alarms in the NMS (120) by introducing a first counter (235) that stores count of consumed alarm records from the message stream and a second counter (240) that stores the count of alarm records inserted to the database (130).
- 10. The method (600) as claimed in claim 9, wherein the method (600) further comprises the step of: comparing, by the one or more processors (205), an absolute difference between the second counter (240) and the first counter (235) with the first predefined threshold; in response to the comparison, enabling, by the one or more processors (205), a flag to indicate a threshold breach of the database (130) when the absolute difference between the second counter (240) and the first counter (235) is greater than the first predefined threshold; and disabling, by the one or more processors (205), the flag when the absolute difference between the second counter (240) and the first counter (235) is less than the first predefined threshold.
- 11. The method (600) as claimed in claim 1, wherein the method (600) further comprises the steps of: retrieving, by the one or more processors (205), load of the alarms from the message stream; communicating, by the one or more processors (205), with the database (130) pertaining to the load of the alarms; and segregating, by the one or more processors (205), the load as at least one of, a Raise, a Clear, a Retry and a Call-back.
- 12. A system (125) for managing load of alarms in a Network Management System (NMS) (120), the system (125) comprising: a fetching unit (220) configured to fetch, data pertaining to a total count of alarms stored in a database (130); a comparator (225) configured to compare, the total count of alarms with a first predefined threshold and a second predefined threshold, respectively; and a load manager (230) configured to perform, one or more tasks to manage the load of alarms in the NMS (120) based on comparison of the total count of alarms with the first predefined threshold and the second predefined threshold, respectively.
- 13. The system (125) as claimed in claim 12, wherein the one or more tasks performed by the load manager (230), includes at least one of: disabling, consumption of subsequent one or more alarms until a pre-defined time period is completed, in response to detecting, the total count of alarms being greater than the first predefined threshold; enabling, consumption of subsequent one or more alarms until the pre-defined time period is completed, in response to detecting, the total count of alarms being lower than the second predefined threshold; enabling, consumption of subsequent one or more alarms until the pre-defined time period is completed, in response to detecting, the total count of alarms being lower than the first predefined threshold and greater than the second predefined threshold, and the total count of the alarms increases until the total count of alarms reaches the first pre-defined threshold; and disabling, consumption of subsequent one or more alarms until the pre-defined time period is completed, in response to detecting, the total count of alarms being greater than the first predefined threshold, and the total count of the alarms at least one of, remains constant and decreases.
- 14. The system (125) as claimed in claim 12, wherein the fetching unit (220) fetches details of the alarm by: retrieving, an alarm identifier from a message stream; and fetching, a relevant alarm from the database (130) using the retrieved alarm identifier.
- 15. The system (125) as claimed in claim 12, wherein the alarms are generated by the one or more network elements and stored in the database (130) with an alarm identifier.
- 16. The system (125) as claimed in claim 12, wherein the load manager (230) communicates with the database (130) to facilitate in maintaining a timestamp array for each alarm, thereby enabling accurate tracking of alarm occurrences.
- 17. The system (125) as claimed in claim 12, wherein the first predefined threshold and the second predefined threshold are configured by a user.
- 18. The system (125) as claimed in claim 12, wherein the load manager (230) is further configured to: transmit, at least one of, a burst critical alert via Short Message Service (SMS) or email to the user when the total count of alarms is greater than the first predefined threshold for a pre-defined number of consecutive time intervals.
- 19. The system (125) as claimed in claim 12, wherein subsequent to the total count of alarms in the database (130) is greater than the first predefined threshold, the load manager (230), delays in consumption of the one or more alarms in the NMS (120) to manage the load of alarms, wherein the delay is at least one of, a predefined and dynamic.
- 20. The system (125) as claimed in claim 12, wherein the load manager (230) is further configured to: manage load of the alarms in the NMS (120) by introducing a first counter (235) that stores count of consumed alarm records from the message stream and a second counter (240) that stores the count of alarm records inserted to the database.
Description
METHOD AND SYSTEM FOR MANAGING LOAD OF ALARMS IN A NETWORK MANAGEMENT SYSTEM FIELD OF THE INVENTION [0001] The present invention generally relates to wireless communication networks, and more particularly relates to a method and system for managing load of alarms in the networks. BACKGROUND OF THE INVENTION [0002] Network Management Systems (NMS) are software applications or platforms designed to monitor, control, and manage computer networks. NMS solutions play a crucial role in maintaining network performance, reliability, and security. NMS provides a centralized interface for network administrators to efficiently manage and troubleshoot network devices, services, and resources. [0003] NMS tools monitor network devices and services for fault conditions and generate alarms or notifications when issues are detected. Alarms can include events such as device failures, connectivity problems, threshold violations, security breaches, or performance degradation. [0004] By effectively managing network traffic and alarms, NMS tools help network administrators monitor network health, troubleshoot issues, and maintain a stable and reliable network infrastructure. These functionalities contribute to efficient network operations and better end-user experiences. [0005] In the Network Management System, there are several possible events of sudden and significant increase in the number of alarms or events being generated within a short period, that is known as burst case. It could be due to various factors such as network failures, equipment malfunctions, security incidents, or sudden spikes in network traffic. During a burst case, the NMS experiences a surge in the volume of alarms, which can potentially overwhelm the system and its components responsible for processing and managing those alarms. [0006] The existing burst mechanism may not have been designed to handle sudden and significant increases in alarm volumes. As a result, the NMS may struggle to scale up its processing capabilities to accommodate the burst, leading to performance degradation or unresponsiveness. [0007] Additionally, if the existing burst mechanism heavily relies on manual intervention for managing the burst, it can introduce delays and increase the risk of errors. Manual processes may not be able to keep up with the rapid influx of alarms, potentially leading to extended resolution times and suboptimal handling of the burst situation. [0008] Therefore, there is a need for an advancement of a system and method that can overcome at least one of the above shortcomings, particularly for managing load of alarms in the network management system. BRIEF SUMMARY OF THE INVENTION [0009] One or more embodiments of the present disclosure provide a method and system for managing load of alarms in a Network Management Systems (NMS). [0010] In one aspect of the present invention, a method for managing load of alarms in a Network Management Systems (NMS) is disclosed. The method includes the step of fetching, by one or more processors, data pertaining to a total count of alarms stored in a database. The method includes the step of comparing, by the one or more processors, the total count of alarms with a first predefined threshold and a second predefined threshold, respectively. The method includes the step of performing, by the one or more processors, one or more tasks to manage the load of alarms in the NMS based on comparison of the total count of alarms with the first predefined threshold and the second predefined threshold, respectively. [0011] In one embodiment, the one or more tasks performed by the one or more processors, includes at least one of disabling, consumption of subsequent one or more alarms until a pre-defined time period is completed, in response to detecting, the total count of alarms being greater than the first predefined threshold. The one or more tasks includes enabling, consumption of subsequent one or more alarms until the pre-defined time period is completed, in response to detecting, the total count of alarms being lower than the second predefined threshold. The one or more tasks includes enabling, consumption of subsequent one or more alarms until the pre-defined time period is completed, in response to detecting, the total count of alarms being lower than the first predefined threshold and greater than the second predefined threshold, and the total count of the alarms increases until the total count of alarms reaches the first predefined threshold. The one or more tasks includes disabling, consumption of subsequent one or more alarms until the pre-defined time period is completed, in response to detecting, the total count of alarms being lower than the first predefined threshold and greater than the second predefined threshold, and the total count of the alarms at least one of, remains constant and decreases. [0012] In another embodiment, the one or more processors fetches details of the alarm by retrieving an alarm identifier