Solved

bad drive issues

Posted on 2014-11-10
6
190 Views
Last Modified: 2016-11-23
I have a client running SBS 2011, on a Power R720, They are running a Dell Power Edge R710 with 64GB (a waist), with a RAID 1, and a RAID 5 (3 drives), recently we had a drive fail and everything has done downhill.

Now Dell support is telling me the issue is related to bad blocks on drive 4, but the bad blocks have spread to 4 of the five drives (by the way Open manager is not reporting errors other then on one drive). and the only way to fix the issue is to copy the data, delete the RAID, replace all the drives and  reinstall Windows from scratch.

Has anybody ever heard of bad block spreading between drives on a RAID.

Thanks
Rudy
0
Comment
Question by:rudym88
[X]
Welcome to Experts Exchange

Add your voice to the tech community where 5M+ people just like you are talking about what matters.

  • Help others & share knowledge
  • Earn cash & points
  • Learn & ask questions
  • 3
  • 2
6 Comments
 
LVL 33

Expert Comment

by:PowerEdgeTech
ID: 40435016
It's called a puncture. RAID replicates data/parity across the disks, so if one block is bad or unreadable, what does the controller do? It either replicates bad data or fails a drive or even the array. This is why RAID is not a backup strategy.
0
 

Author Comment

by:rudym88
ID: 40435071
Can I ask what is the best was to resolve the issue. other then redoing everything?
0
 
LVL 33

Accepted Solution

by:
PowerEdgeTech earned 500 total points
ID: 40435132
Wiping out the array and recreating (and initializing) it is the only way to create a healthy array to hold your data. The array - the logical arrangement of storage units managed by the controller - is irreparably broken - missing a piece(s). The controller cannot guess at the missing/corrupt area's contents, and there is no software capable of restoring specific pieces of the array like with file backups.

I would also recommend you update all system firmware (iDRAC/LCC, BIOS, PERC, etc.) and drivers, run regular Consistency Checks (at least monthly), and promptly replace faulty drives to help prevent it from happening in the future.
0
Free eBook: Backup on AWS

Everything you need to know about backup and disaster recovery with AWS, for FREE!

 
LVL 55

Expert Comment

by:andyalder
ID: 40435586
Only other way to repair a punctured stripe is to overwrite the bad blocks and that's virtually impossible since your OS may have already written a $err entry into the file allocation table. Not sure where you can get to the disk error stats with Dells, with HP the drive errors are listed in the ADU report. Blasted disk manufacturers don't consider disk read errors as a reason to set the pre-failure alert so its possible to run with a flakey disk or two without knowing it, then a disk fails and the rebuild doesn't complete when it is replaced.
0
 

Author Comment

by:rudym88
ID: 40446416
That what I was afraid, let me ask how can I prevent this from happening?
0
 
LVL 33

Expert Comment

by:PowerEdgeTech
ID: 40446614
I would also recommend you update all system firmware (iDRAC/LCC, BIOS, PERC, etc.) and drivers, run regular Consistency Checks (at least monthly), and promptly replace faulty drives to help prevent it from happening in the future.
0

Featured Post

Free Tool: SSL Checker

Scans your site and returns information about your SSL implementation and certificate. Helpful for debugging and validating your SSL configuration.

One of a set of tools we are providing to everyone as a way of saying thank you for being a part of the community.

Question has a verified solution.

If you are experiencing a similar issue, please ask a related question

Requirements: root access via SSH, telnet, or other.. Alternately, access from the server administrator to run a counter-strike server, and the proper access rights to do so. Enough free disk space (and allowed to use this much, eg disk quota): 6…
Every server (virtual or physical) needs a console: and the console can be provided through hardware directly connected, software for remote connections, local connections, through a KVM, etc. This document explains the different types of consol…
Although Jacob Bernoulli (1654-1705) has been credited as the creator of "Binomial Distribution Table", Gottfried Leibniz (1646-1716) did his dissertation on the subject in 1666; Leibniz you may recall is the co-inventor of "Calculus" and beat Isaac…
A short tutorial showing how to set up an email signature in Outlook on the Web (previously known as OWA). For free email signatures designs, visit https://www.mail-signatures.com/articles/signature-templates/?sts=6651 If you want to manage em…

733 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question