[2 days left] What’s wrong with your cloud strategy? Learn why multicloud solutions matter with Nimble Storage.Register Now

x
?
Solved

What does it mean "One or more logical drives contain a bad stripe: controller 1" (server RAIDmanager)

Posted on 2009-05-07
9
Medium Priority
?
7,573 Views
Last Modified: 2012-05-06
hi

I have a raid 5 with 4 disks.
the server raid manager sent a Warning msg about bad stripe
"One or more logical drives contain a bad stripe: controller 1"

What does it mean?
0
Comment
Question by:Diskal
[X]
Welcome to Experts Exchange

Add your voice to the tech community where 5M+ people just like you are talking about what matters.

  • Help others & share knowledge
  • Earn cash & points
  • Learn & ask questions
  • 3
  • 3
  • 2
  • +1
9 Comments
 
LVL 3

Expert Comment

by:Cherubim
ID: 24326221
It means that your logical drive has an error.
A logical drive is a drive combined out of X physical drives.

In your case it means that one of you 4 disks seems to be bad.
0
 
LVL 2

Expert Comment

by:potva03
ID: 24327313
If you are using Hp Server then try checking in the SML or ADU so that we can identify if all the disks are having issue or if any drive is faulty and this depends on the Raid being implemented

we can also look in IML to confirm if the controller is fine or not
0
 

Author Comment

by:Diskal
ID: 24342553
Thank you,

i'm using ibm servsr "IBM Server xseries 226 "

How do I know which of the disks is damaged?
0
How to Use the Help Bell

Need to boost the visibility of your question for solutions? Use the Experts Exchange Help Bell to confirm priority levels and contact subject-matter experts for question attention.  Check out this how-to article for more information.

 
LVL 56

Expert Comment

by:andyalder
ID: 24344055
You cannot tell which disk is the problem. A bad stripe on a logical drive generally means that all disks have readable data but that parity doesn't match data. This can occur with RAID 5 if you have a power-out and the cache battery goes flat before it can write the contents to the disk. The error is most likely found when the controller is doing background parity checking which most of them do when they are idle, since the data and parity are inconsistant the controller doesn't know what to do to repair the problem. Only way out of the problem is to backup what you can and restore after re-initialising the array.
0
 
LVL 3

Expert Comment

by:Cherubim
ID: 24344931
Some controllers have a re-sync function, don't know if yours has it though
0
 

Author Comment

by:Diskal
ID: 24345083
i try to backup with acronis and get "faild to read data from the disk ".

I think that  one of the disks has a bad sectore.
;(
0
 
LVL 56

Expert Comment

by:andyalder
ID: 24345289
If one of the disks had a bad sector the controller would get the data off the other disks using parity so that can not cause the problem. You wouldn't have a problem if one of the disks had a bad sector. The controller can't reconstruct the data from parity with a bad stripe because it doesn't know which drive has faulty data. It therefore returns a read error to the OS. It is not a disk fault, it is a data mismatch.

Acronis is an imaging product, it will stop as soon as it reaches the read error. You have to do a file-by-file backup, you should be able to get all but one or two files off it, you might even get the whole lot since the bad stripe may not have any data on it. If the file-by-file backup fails when it reaches a bad file rather than carry on with the rest you have to delete that file and then try again.

There may be an option to re-sync but it would require manual intervention to tell which of the disks had the mis-matched data on it. RAID reconstructor from runtime.org may be able to deal with the problem but you still would have no way of knowing which version of the file is bad.

If you imaged all 4 disks seperately with RAID reconstructor (you would have to take them off the RAID controller and put on a non-RAID one for this) you could generate 4 recoveries, each with a disk missing, and compare them.

Taking one of the disks out and shoving it back in again will probably cure the bad stripe *but* you would not know if the data was valid or not. You would be randomly selecting which disk had the mismatching stripe element on it.

RAID level 2 would cope with this but nobody uses RAID 2 any more.

You could even ignore the error and continue to use it, the controller will give a read error whenever the bad stripe is encountered and the OS may automatically map it into its bad sector list (it's not a physical bad sector but to the OS it looks like one). chkdsk, fsck etc can probably map the blocks out for you but you may not be able to do an image backup even then.
0
 

Author Comment

by:Diskal
ID: 24347249
Thank you,

i try to copy file by file and Unfortunately the data base of mail server not pass .
0
 
LVL 56

Accepted Solution

by:
andyalder earned 2000 total points
ID: 24347384
Oh dear, if the error is right in your main database file backup isn't gping to help you. Is the database still running? You could run an individual mailbox backup and again hopefully get most of the mail out.

I think you're going to have to do the raid reconstructor trick, you'll end up with 4 versions of the file and you may be able to pick the correct version by seeing which one passes database consistency check. RAID reconstructor may take hours and you need a load of spare disk space for the disk images and restores, either that or send the disks off to a data recovery agency which will cost you a packet. How long ago is your last good backup? Is the database still running?
0

Featured Post

Free Tool: Site Down Detector

Helpful to verify reports of your own downtime, or to double check a downed website you are trying to access.

One of a set of tools we are providing to everyone as a way of saying thank you for being a part of the community.

Question has a verified solution.

If you are experiencing a similar issue, please ask a related question

Learn about cloud computing and its benefits for small business owners.
Moving your enterprise fax infrastructure from in-house fax machines and servers to the cloud makes sense — from both an efficiency and productivity standpoint. But does migrating to a cloud fax solution mean you will no longer be able to send or re…
In this video, Percona Director of Solution Engineering Jon Tobin discusses the function and features of Percona Server for MongoDB. How Percona can help Percona can help you determine if Percona Server for MongoDB is the right solution for …
In this video, Percona Solutions Engineer Barrett Chambers discusses some of the basic syntax differences between MySQL and MongoDB. To learn more check out our webinar on MongoDB administration for MySQL DBA: https://www.percona.com/resources/we…
Suggested Courses

656 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question