IT Event Management
IT monitoring tools such as Datadog, Grafana, New Relic, or AWS Cloudwatch are used to monitor IT applications and infrastructure. Using these tools, you can configure monitors to send out an email when a certain metric threshold exceeds the acceptable value.
Example A threshold can be set so that an email notifies you if the disk space usage exceeds 80% of the allocated disk space.
These alerts can be classified as errors, warnings, fatal messages, or recovered messages. On a day-to-day basis, these tools generate thousands of messages, where 95% of them are warning messages and only 5% of them need attention, such as for fatal messages.
Even in the event of fatal messages, most modern applications or IT infrastructure is self-recoverable. When an application or infrastructure recovers automatically, monitoring sends a follow-up recovery message to inform you that service has returned to normal.
These messages create excessive load for on-call support personnel, who must review each message to determine when to act on them or when an issue becomes an incident.
Juvare ARC's IT event management provides an event intelligent solution to minimize unrelated interference and excessive messaging, and activate support teams only when needed.