Solved

3ware 9500S RAID Controller

Posted on 2014-03-06
5
1,162 Views
Last Modified: 2016-06-26
I have a file server running Server 2003 SP2 with a RAID 10 array on a 3Ware 9500S-12 card. Drives are Western Digital WD1002FBYS 1 Tb.
We have begun having severe disk performance issues with this machine and I cannot determine why. We have very bad fragmentation and we cannot defrag (hangs at about 3% no matter what software is used)
I would assume that I have at least one failing disk in the array, but the 3DM2 site shows healthy disks. The only indication I saw that there might be a bad disk was last night in the Drive section... one of the drives was highlighted in yellow. There were no alarms and I could not find any documentation on that that means (I assume its bad!). This morning, the highlight is gone.
The controller will give me a SMART readout, but it is all in hex and I cannot find any info online on how to interpret it.

All that said, has anyone encountered the yellow highlight before?
Does anyone know how to read the Western Digital Hex readout?
Is there anything else I can do that I am not already doing? At this point, I am tempted to start swapping out drives. 48% fragmentation is killing us!
0
Comment
Question by:JP_TechGroup
  • 3
  • 2
5 Comments
 
LVL 47

Accepted Solution

by:
dlethe earned 500 total points
Comment Utility
You have unrecoverable read errors most likely.  Run the windows chkdsk option with the flag to check & repair bad blocks for the entire volume.  (Which means doing it at the next reboot).

Most likely a combination of unrecoverable read errors and filesystem corruption.  If you are lucky, the check disk will be able to repair it.    If not, then you'll have to create a full bootable backup, blow the RAID away, reinitialize it, then do a restore.
0
 

Author Comment

by:JP_TechGroup
Comment Utility
That is so completely not the answer I wanted to hear!
0
 

Author Comment

by:JP_TechGroup
Comment Utility
Checkdisk found and repaired a few errors. there were no bad sectors and no unrecoverable errors. Using Diskkeeper and a lot of patience I have finally managed to defrag the drive more or less but the read write performance is still terrible.
Now the 3ware driver has begun spitting out errors every hour into the application log.
Typical of 3ware, there is no reference to what they mean to be found anywhere online.
They are all event 3 and have entries like:
Packet: Id=80 Opcode=0x10
Sense:  Stat=0x2 Err=0x10D Slen=18
MODE_SENSE6 Unit=0 Len=64
FW: INVALID_OPCODE (opcode=0x4D)
Packet: Id=114 Opcode=0x10
Sense:  Stat=0x2 Err=0x101 Slen=18
0x4D Unit=0 Len=4
etc
They arrive in one big block, once per hour, so clearly it is the result of some kind of scan the controller is doing. I have downloaded the error log from the RAID controller but it too is just cryptic enough to be useless to me.

So, to recap... a previously operational RAID array in RAID 10 array on port 4-7
Western Digital Drives WD1002FBYS 1TB. Very poor disk performance and now getting hourly error message from the RAID controller in the system log. RAID controller shows the drives to be healthy.
Stumped.
errorlog-0.txt
0
 
LVL 47

Expert Comment

by:dlethe
Comment Utility
This message is the result of some 3rd party software trying to send a LOG SENSE command that the array does not support.  It does not reveal the full command but it is typical of 3rd PARTY S.M.A.R.T. software.  That program is trying to ask the RAID array itself not individual drives of the problem.

This command is NOT an error.  This is what is supposed to happen when a SCSI emulated device gets a command it does not support.  It is returning the proper data to whatever program sent the request to get log page info.

Are you running some S.M.A.R.T. software not written by 3ware?  IF so uninstall it. The software won't work anyway, and it is causing this thing to happen in the first place.
0
 

Author Comment

by:JP_TechGroup
Comment Utility
Ahha, brilliant and so simple. Diskkeeper has been trying to check the drive health.
So that mysyery is solved, but it doesn't resolve my performance issues.
It too Diskkeeper a week of plugging away to finally defrag the drive and that has helped a great deal, but the read writes are still too slow.

Once again, checkdisk came up clean (although it took an hour to run)
and scf /scannow only made a few adjustments.
I know I am virus free, so what's left? I've never known a raid card to die a lingering death and with all of the drive useage I'm sure that if a drive was going to fail it would have done so... what's left?
0

Featured Post

How your wiki can always stay up-to-date

Quip doubles as a “living” wiki and a project management tool that evolves with your organization. As you finish projects in Quip, the work remains, easily accessible to all team members, new and old.
- Increase transparency
- Onboard new hires faster
- Access from mobile/offline

Join & Write a Comment

Your wedding is a big day and a big deal. Whether you have ten guests or 350, it’s highly probable that you and your sweetie will be posing for a lot of pictures and videos. By the end of it, your faces might even hurt from smiling so much!
Moving your enterprise fax infrastructure from in-house fax machines and servers to the cloud makes sense — from both an efficiency and productivity standpoint. But does migrating to a cloud fax solution mean you will no longer be able to send or re…
This video Micro Tutorial explains how to clone a hard drive using a commercial software product for Windows systems called Casper from Future Systems Solutions (FSS). Cloning makes an exact, complete copy of one hard disk drive (HDD) onto another d…
This Micro Tutorial will teach you how to reformat your flash drive. Sometimes your flash drive may have issues carrying files so this will completely restore it to manufacturing settings. Make sure to backup all files before reformatting. This w…

763 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question

Need Help in Real-Time?

Connect with top rated Experts

7 Experts available now in Live!

Get 1:1 Help Now