In the overnight/early hours of March 25, 2023, Omnilert’s systems experienced issues with the transmission of alerts on all channels (SMS, Email).
Customers sending alerts experienced delayed delivery across all endpoints.
The cause was investigated by Omnilert’s engineers with the highest priority. It was determined that an issue impacting logging caused the ability of files to be written to be affected, leading to the inability of delivery services to run.
Omnilert’s system status warning and recovery automation did not properly detect this specific kind of escalating issue, which led to the service outage and delay in delivery experienced by recipients.
Once the root cause issue was discovered, Omnilert engineers were able to correct the logging issue and restart the systems.
This alleviated the immediate problem and all of Omnilert’s service was returned to normal functionality.
Naturally, this kind of incident is being studied to prevent any recurrence and further harden Omnilert’s systems against problems of this nature.
Omnilert’s team is taking the following steps to mitigate any recurrence of this issue: