I have read a little about isolation response and I was hoping to get some suggestions on a few things.
First, I have a cluster with two hosts in it With HA turned on. Second, one of the hosts will stop responding and the only thing I can do is to physically reboot the host to get it back. It will not respond to a ping or anything else once this has happened.
I have read about putting in a secondary isolation ip address for the cluster and also increasing the failure detection time. Are these two things pretty safe to implement without any bad side effects?
Also, could there be a false positive that is causing an isolation of one of my VMware hosts?
I have had this happen in the past and it seemed to be related to our shared storage devices. Please help!! I have been struggling with this same problem for a while now and nothing seems to fix. Thanks experts!!!