Adaptec 2100s SCSI RAID 1 won’t let me replace “failed” drive.
Posted on 2006-05-10
Last week I noticed an alarm in the server room. I looked around and it was coming from our SMTP server. I knew this was the SCSI controller so I shutdown and ran the SMOR util (Ctrl-A during boot). In the SMOR I see that one of the Drives (ID1) is showing “failed” and the other drive (ID 0) is showing “optimal”. “Ok, no big deal, I’ll just swap out the drive.” I think. I come back at 2AM to do the swap. I have a Seagate ST336607LW, and it’s replacing a ST336607LC. The only difference between the drives is the LC is 80 Pin and the LW is 68 Pin. They aren’t in a hot swap rack or anything the drives are internal with 68 -> 80 pin converters on the cable. When I swap the drive and run the SMOR util this time it’s showing 2 RAID’s. This is odd since there was only 1 RAID on the system. It’s 2:30 AM and I don’t recall exactly what on the screen was but it was reporting that on the 1st RAID one of the drive was “missing information” and the other was “optimal”. The 2nd RAID was showing the mirror image with one drive as “failed” and the other as “missing information”. I attempted to run repair on the array that was showing the "optimal" drive. No dice, the screen blinks and nothing happens, not even an error. I ended up reinstalling the old drive and attempted a repair with it. This time it shows only 1 RAID and I was able to run repair on it. The really annoying alarm is silenced and the system is chugging along nicely, but I can only assume the array can’t be trusted and need to revisit this issue. Has anyone ever seen this before?