We have recently installed an 8 node stretch cluster CGS (HP OEM Polyserve) file serving environment.
There are four "live" nodes , 1 Backup Node and 3 additional standby nodes.
We have used mountpoints for disks that are san attached to get over the number of Disks available.
Users access their shares via login scripts and are mapped to their business unit share.
what we are seeing is intermittent but fairly regular, it can be any of the following:
Client PC's XP Home
Browsing using explorer
Browsing from within the APP
Opening the file
Saving the file
The client just hangs with the eggtimer can sometimes take up to a minute to open a file or simply open a folder to the next level can take 20 seconds.
We have our network guys monitoring a connection from client to host using vitalsuite and this gives a breakdown of client/network/server activity during a transaction. What we are seeing when it hangs is 98% ish as server activity.
I have run Ethereal and during a hang there doesnt seem to anything standout with the log/trace.
The performance at all other times is extremely fast.
The hosts are all dual fiber SAN attached 2003 windows storage servers accessing EVA8000 (synchronous CA).
This is proving to be an extremely annoying difficult problem to solve and pin down issue - if I had any hair i would be tearing it out!!!!
User base is 5000, 53 mount points across those four servers equating to around 8TB.
Heres hoping someone can help.