[Webinar] Streamline your web hosting managementRegister Today

x
  • Status: Solved
  • Priority: Medium
  • Security: Public
  • Views: 363
  • Last Modified:

MOM Question about Alerting for Offline/Down Systems

Hi guys, we are running MOM 2005 in an AD 2003 Environment.
What we would like to know is the following:

a) Is it true that MOM cannot detect systems that are offline or down?
The reason i ask is that it appears that MOM will only alert when systems have had unexpected shutdowns...that is...after the case the system is offline.

b) What we want is for MOM to alert us when the system is down/offline, not alert us when the system is back from being offline/down as in alerting us to an unexpected shutdown event id.

If anyone can provide any info as to how you configure MOM to alert when a system is down (eg.cannot be pinged), while that system is down, instead of alerting once the system is back up, that would be great. Thank you  :>)
0
Simon336697
Asked:
Simon336697
  • 2
  • 2
1 Solution
 
wwwallyCommented:
Hello Simon,
MOM can detect when a system is online/offline, you will receive a heart beat failure for that system when a becomes unavailable. In a lot of MOM environment this will give a lot of overhead and the rule is tuned down to receive less false positives.
But when you want some addition check to see if a system is online try the following article:
http://www.myitforum.com/articles/2/view.asp?track=NL-390&ad=515186&id=8615 
This will create a ping to the specific system and when you create the alert rule you will receive an alert.
Regards,
Walter
http://weblogwally.spaces.live.com
0
 
Simon336697Author Commented:
HI wwwally,
Thanks so much for responding :>)
wwwally, i thought that by default, MOM could only report/alert when a system has come back up from being offline, for example, alerting that there was an unexpected shutdown, and not be able to report when that system is actually offline.
So our understanding is, is that MOM cannot (out of the box) tell you WHEN a system is unreachable....only when that system comes back online does it report an alert that it WAS offline. If this is the case, then we could have systems offline for an unknown amount of time without even knowing it.....only when they come back online and report an alert that they were offline would we know. If this is incorrect what im saying (and I hope it makes sense), then where in MOM would we configure a job to poll each device and alert us when that system is offline?
Appreciate your kind help :>)
0
 
wwwallyCommented:
Your right you should get an error when a system get offline but heartbeat failure must be configured and a lot of people have complaint about how it works.
This article could probably help you with your understanding and fixing of the heartbeat failures for off line machines: http://blogs.msdn.com/steverac/archive/2006/02/11/530292.aspx
Regards,
Walter
http://weblogwally.spaces.live.com
0
 
Simon336697Author Commented:
Hi www,
Mate youve been a champion. Really appreciate it. Have a great easter.
0

Featured Post

Free Tool: ZipGrep

ZipGrep is a utility that can list and search zip (.war, .ear, .jar, etc) archives for text patterns, without the need to extract the archive's contents.

One of a set of tools we're offering as a way to say thank you for being a part of the community.

  • 2
  • 2
Tackle projects and never again get stuck behind a technical roadblock.
Join Now