Link to home
Start Free TrialLog in
Avatar of systemnet
systemnet

asked on

HP Insight Manager Email Alerts

Hi Guys

Something which I hoped would be fairly simple to implement has turned out to be a source of incredible frustration and hope to get your help.

I have installed and configured HP Insight Manager in the hope that it could provide me with email alerts if any hardware fails on Proliant servers, eg a disk drive. Too many times the failed disks are only noticed when walking past the physical hardware. With more and more servers located in data centres, this is obviously an issue.

I have followed the installation guide and installed HP SIM on a server and it performed a full discovery of the local network and added the local servers. I created a new Automatic Event Handling Task and chose the Important Proliant Events collection. I selected the "All Servers" system collection and then in the following step configured the email addresses for sending and receiving the email and confirmed the correct SMTP server settings. I then confirmed that the following categories are part of the monitored events:

Event category is Proliant Storage Events and subcategory is Logical Drive status
Event category is Proliant Storage Events and subcategory is Physical Drive Status

Now, I know that SMTP sending is working ok because I am receiving email notifications when systems are turned on or turned off. I even get email notifications when I log on or off HP SIM. I can take or leave those alerts, but ironically the really important messages relating to hardware failures do not seem to work.

I have simulated disk failures by pulling out disks in some of our servers and nothing happens. The HP agents on the server itself knows about the event and this is reflected on the HP Systems Management Homepage on the server itself, but HP SIM sees nothing and no Critical or Major event is logged and subsequently no email is sent. The only Critical events logged are ones relating to "Systems being unreachable". I have confirmed that the "Hardware Status Polling for Servers" task is enabled and configured to run every five minutes.

Can somebody please explain the exact steps to follow to get this kind of alerting going or whether I am missing something really obvious?

Many thanks
JM
Avatar of Andrew Hancock (VMware vExpert PRO / EE Fellow/British Beekeeper)
Andrew Hancock (VMware vExpert PRO / EE Fellow/British Beekeeper)
Flag of United Kingdom of Great Britain and Northern Ireland image

we have seen this issue, and we ended up configuring Notification alerts on the individual Server to a group email address, using the installed application. We then just captured the settings once configured from the registry, and imported into all our servers, and restarted the service.
Avatar of systemnet
systemnet

ASKER

Hi Hanccoka

Thanks for the suggestion but unfortunately that will not work for us as the servers we need to receive the alerts from do not run Windows. They are either running VMWare or Citrix XenServer with the HP SNMP Agents installed.

Cheers
JM
Yes, that's probably the issue!

ESXi?

are you using the HP version of ESXi or added the CIM providers, you would be better montioring with Nagios or Splunk via SNMP. (or Solarwinds)
Both ESX and XenServer SIM / SNMP agents are installed and functioning. Had a look and unfortunately I do not this either Nagios or Splunk can easily tap in to the HP SNMP agents specifically related to the disk / storage system.

This is a bit frustrating because the whole point of HP SIM is to provide you with hardware monitoring / alerting capabilities and I am just curious if anybody with HP servers out there is actually using this product?

FYI, I have again done a drive failure simulation and have some screenshots to share which may provide a clue as to what may be wrong.

The following screenshot clearly shows the failed disk:

User generated image
Yes in HP SIM these are the only events listed for the server in question:

User generated image
As far as alerts are concerned, as you can see the following screenshot seems to indicate that the automatic alerts have not worked since earlier in the month. Not that it would have made any difference because HP SIM did not detect any critical events anyway.

User generated image
So I am sure I am missing something obvious somewhere but just cant put my fingers on it. Hence I would really appreciate it if anybody out there can provide me with just a few simple bullet point steps in how to configure alerting for disk failures. I am not even bothered about other hardware failures for now, once the disk alerts are happening I think the rest will follow.

Thanks again
JM
@systemNet, so how did you go with this case man ? has it been resolved ?
No unfortunately never got this resolved. In the end we resorted third party monitoring tools as HP Insight Manager could not be trusted.
ASKER CERTIFIED SOLUTION
Avatar of systemnet
systemnet

Link to home
membership
This solution is only available to members.
To access this solution, you must be a member of Experts Exchange.
Start Free Trial
lol, I thought so.. HP SIM is hard to configure and confusing so much....

I've tried to read this page: http://h20628.www2.hp.com/km-ext/kmcsdirect/emr_na-c04471776-1.pdf

just to set email alert takes more than 15 minutes to find the menu.