?
Solved

How many drives can fail in a RAID-DP netapp filer aggregate

Posted on 2014-11-01
15
Medium Priority
?
581 Views
Last Modified: 2016-12-08
Hi we recently had 2 drives fail.

The RAID still held - and still appeared to have 2 parity drives.

question: could I have managed to have a 3rd drive fail - without collapsing or losing data?
0
Comment
Question by:philb19
[X]
Welcome to Experts Exchange

Add your voice to the tech community where 5M+ people just like you are talking about what matters.

  • Help others & share knowledge
  • Earn cash & points
  • Learn & ask questions
  • 6
  • 4
  • 2
  • +3
15 Comments
 
LVL 7

Expert Comment

by:Stampel
ID: 40417021
To answer correctly i need to know :
How many disks you had/have ?
What RAID level are you using ? RAID5/RAID50/RAID6/RAID60/RAID10 ?
Did you have hotspares ?

For example in a RAID6 with 12 disks (that would do a single RAID0 of three RAID6 of 4disks each) you could lose maximum six of the 12 total disks (two in each set if you are lucky) and it could also fail with only 3 bad disks (3 in the same set) if you are very unlucky.

you may have a look at those links :
http://www.techrepublic.com/blog/the-enterprise-cloud/raid-50-offers-a-balance-of-performance-storage-capacity-and-data-integrity/
http://www.techrepublic.com/blog/the-enterprise-cloud/understand-when-raid-60-is-overkill/
0
 
LVL 37

Expert Comment

by:Neil Russell
ID: 40417023
Did a hot spare kick in and the raid rebuild to that?
A triple disk failure in any single raid group is terminal. You would then need to be spending big bucks to get your data recovered or of course going back to that very reliable backup that you keep :D
0
 
LVL 37

Expert Comment

by:Neil Russell
ID: 40417024
@Stampel
Please read the question.

This is a NETAPP RAID DP implementation so what you say is not relevant.  There is no " RAID5/RAID50/RAID6/RAID60/RAID10 ?"
0
Migrating Your Company's PCs

To keep pace with competitors, businesses must keep employees productive, and that means providing them with the latest technology. This document provides the tips and tricks you need to help you migrate an outdated PC fleet to new desktops, laptops, and tablets.

 
LVL 7

Expert Comment

by:Stampel
ID: 40417027
RAID-DP uses RAID6 and only prevents from the loss of 2 disks.
A 3rd failed drive would have collapsed the volume
0
 
LVL 37

Expert Comment

by:Neil Russell
ID: 40417030
RAID DP Is a proprietary implementation and not a standard raid 6 implementation.  I already stated that it is two disks.

If you wish to read the technical explinations, see here

http://community.netapp.com/t5/Tech-OnTap-Articles/Back-to-Basics-RAID-DP/ta-p/86123
0
 
LVL 7

Expert Comment

by:Stampel
ID: 40417038
Read again you did not state its RAID6, i did :)
0
 
LVL 37

Expert Comment

by:Neil Russell
ID: 40417041
No i did not say it was raid six as it is not. It is RAID DP, as stated by the questioner.  I did state that a triple disk failure would be terminal.

The objective here is not to argue about specifications it is to give the questioner clear concise accurate information based on reading his/her question.  My first reply did so.
0
 
LVL 7

Expert Comment

by:Stampel
ID: 40417053
You are wrong, RAID DP uses RAID6 check the documentation !
http://www.netapp.com/us/products/platform-os/raid-dp.aspx
0
 
LVL 88

Expert Comment

by:rindi
ID: 40417056
As stated already it is RAID DP, which is a special form or RAID 6. Raid 6 uses 2 parity blocks, so 2 disks can fail. But the parity of RAID 6 isn't dedicated to certain disks, rather it is distributed on all of the. With RAID DP, the parity blocks are on two disks dedicated for parity. For that reason you still had 2 parity disks running OK when you had the issue, the 2 disks that failed happened to be disks with data, and not those dedicated to parity.

But again as has been mentioned already, if a 3rd disk had failed, you'd have lost the data.
0
 
LVL 37

Expert Comment

by:Neil Russell
ID: 40417060
Typo rindi "As stated already it is RAID DP, which is a special form or RAID 6"

should have said "As stated already it is RAID DP, which is a special form OF RAID 6"
0
 
LVL 88

Expert Comment

by:rindi
ID: 40417084
Of course, I didn't notice... I tend have many typo's.
0
 
LVL 37

Expert Comment

by:Neil Russell
ID: 40417085
Really? I never make Mistikes when i type ;)
0
 
LVL 47

Expert Comment

by:David
ID: 40417094
technically you COULD have had data loss with even a single HDD failure, but the odds are quite small.   HDD parity protection data loss scenarios are based on the premise that 100% of the blocks on surviving drives are still readable and the parity is consistent and current.

To make it easy, let's say you just have 2 disks in a RAID1 (mirrored) config.    Block #3 just failed in disk A,  so it is unreadable.  Disk B just died.    You have partial data loss on disk A.  The data on Block#3 is unavailable.

See?  Even RAID DP, or RAID6, and RAIDZ2 levels dont protect against data loss if you have bad blocks at the wrong places when disks survive.
0
 
LVL 56

Accepted Solution

by:
andyalder earned 2000 total points
ID: 40417195
Please read the question again gentlemen, it says "How many drives can fail in a RAID-DP netapp filer **aggregate**", not how many can fail in a RAID-DP disk group. If there are 5 disk groups in the aggregate then 10 drives could fail before data loss if you were very lucky and lost two from each group and didn't have any bad blocks.
0
 
LVL 1

Author Comment

by:philb19
ID: 40417241
Thats A great answer     I couldnt tell what raid group they had come from.      Sysconfig -r. Just said broken 2 disks
0

Featured Post

Migrating Your Company's PCs

To keep pace with competitors, businesses must keep employees productive, and that means providing them with the latest technology. This document provides the tips and tricks you need to help you migrate an outdated PC fleet to new desktops, laptops, and tablets.

Question has a verified solution.

If you are experiencing a similar issue, please ask a related question

We look at whether swapping a controller board on a failed hard drive is likely to solve the problem.
When speed and performance are vital to revenue, companies must have complete confidence in their cloud environment.
This video Micro Tutorial explains how to clone a hard drive using a commercial software product for Windows systems called Casper from Future Systems Solutions (FSS). Cloning makes an exact, complete copy of one hard disk drive (HDD) onto another d…
This tutorial will walk an individual through the process of installing the necessary services and then configuring a Windows Server 2012 system as an iSCSI target. To install the necessary roles, go to Server Manager, and select Add Roles and Featu…
Suggested Courses

764 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question