How do you measure system availability
Posted on 2014-08-20
Hi EE -
We use SolarWinds to monitor services, ram, cpu, disk space, and ping availability for our critical systems. However, this number usually has an up-time of 99.999 percent each month. We feel this isn’t really an accurate number, because something out of our hands or not currently monitored could be causing slow performance for staff thus it isn’t captured that moment.
Examples: influx of spam email happens globally, then many people receive a spam message; 3rd party hosted app is running slow but works; system nightly backups complete at 100%, but some take much longer due to influx of new data added.
How do you capture system availability information and deliver it to senior management?