Have been meaning to join this forum for some years now and have finally got stumpped enough to jump on-board!!
I have a problem with a Mezzanine HBA Fibre Controller operating in a BL460 c-Class blade that is connected to an EMC CX3-20 (baby SAN). There are 8 blades in the c-7000 enclosure that all have the same physical configuration. 4 of them are running ESX 3.0 with various WIn2003 server configs and they are happily communicating with the SAN switches and SP's. 3 of the 4 remaining Blade servers are configured with WIn2003 as primary OS doing various tasks. The Blade server that is having the communication error is running Win2003 with SQL and IIS installed. The OS is on the local Blade SAS drives and the SQL database is on a SAN attached LUN. Every couple of days I am getting the following error and the server all but completely shuts down. The System Event log is riddle with the error and remains the same until I reboot the system :-
The driver detected a controller error on \Device\RaidPort1.
0000: 0010000f 00660001 00000000 c004000b
0010: f0200000 00000000 00000000 00000000
0020: 00000000 00000000 00000000 00000000
0030: 00000000 c004000b
I have upgrade the Qlogic Fibre Channel drive to the latest version (188.8.131.52 3/27/09) and I am still seeing the problem. What baffels me is the irregularity of the problem. It may go a couple of days no problem with a fair bit of load on the SQL database before needing a reboot and then it might last 1 day.
I am suspicious of the Fibre Cabling and potentially bad connectors, however before I go moving anything around I just wanted to check if there were any know issues. The fact that none of the other servers are having issues says to me that it has to be the HBA card its self (or firmware maybe) or cabling or something I am completely over looking!! ;-)
Any comments would be much appreciated!
Thanks in advance