• Status: Solved
  • Priority: Medium
  • Security: Public
  • Views: 281
  • Last Modified:

How do you measure system availability

Hi EE -

We use SolarWinds to monitor services, ram, cpu, disk space, and ping availability for our critical systems. However, this number usually has an up-time of 99.999 percent each month. We feel this isn’t really an accurate number, because something out of our hands or not currently monitored could be causing slow performance for staff thus it isn’t captured that moment.

Examples: influx of spam email happens globally, then many people receive a spam message; 3rd party hosted app is running slow but works; system nightly backups complete at 100%, but some take much longer due to influx of new data added.

How do you capture system availability information and deliver it to senior management?
First Last
First Last
1 Solution
Well, you first need to define clearly what "availability" means to you. The cases you describe don't mean that the system is unavailable, so 99.999% sounds correct to me. You should define SLAs under which your applications should respond. If that SLA is not met, then you meassure the "unavailable" time.

Featured Post

Keep up with what's happening at Experts Exchange!

Sign up to receive Decoded, a new monthly digest with product updates, feature release info, continuing education opportunities, and more.

Tackle projects and never again get stuck behind a technical roadblock.
Join Now