Sunday 3rd June 2012

Network 03/06/2012 downtime explaination

Our report from the incident on 03/06/2012 is as follows.
 
Issue

sms-akuma restarted

Underlying cause

Watchdog triggered a reboot

Symptoms

Complete loss of service on sms-akuma.

Resolution

The automatic watchdog monitoring service restarted the server after detecting a non-recoverable error.

This is the second incident of this nature within 60 days - so the chassis will be taken down for fault testing and sms-akuma will be migrated to another physical server.

This maintenance window is scheduled for 03/06/2012 11:00pm to minimise disruption. Downtime should be a maximum of 20 minutes.