Downtime on 5/8/10
We were down for about eight hours today. The downtime occurred while we were upgrading system software trying to address the downtime and related issues from the past week. This was caused when a system component got upgraded before a planned kernel upgrade was completed. Unfortunately, this left the system in a state where we needed to be on-site to work on it.
After getting to the unit, we were able to diagnose and fix the issue, and while doing so completed some additional upgrades to the machine.
We are continuing to investigate the cause of our downtime; the server has not been performing as it should and we are investigating how to best and most effectively fix it.
