Cluster Servers continue to lose connection to Storage on machine reboot.
Posted on 2008-06-25
I have a two node cluster setup and connected to a promise vtrak m300i storage utility by iSCSI. I really have two problems, but am more concerned with my first problem right now.
Several times during the day, the servers will lose connection between them and then drop connection to the storage unit and cause a delayed/write/fail error. This is starting to happen with considerable more frequency. I have to shut down both servers and then turn the storage utility off, turn it back on, and bring the servers up one at time for them to reconnect to the storage unit. This is becoming a hassle. I don't know why their dropping unless it has something to do with the cluster continuing to lose connection to each node all the time. I thought that if one server went down the other should take up the slack, but on ours, when one goes down the other pretty much goes down as well. The only time I can tell it actually works is when I initiate failover or move groups from one node to the other.
The second problem is that if one machine goes down on its own, when it comes back up, it does not connect to the storage utility automatically. Do I always have to take everything down and then bring up the storage unit first before the servers, so that they will connect? That doesn't seem very user friendly to me.