Want to protect your cyber security and still get fast solutions? Ask a secure question today.Go Premium


NetApp predictive disk failure

Posted on 2014-08-30
Medium Priority
Last Modified: 2014-08-31

I have a 2246 SAS shelf and I'm tired of waiting for disks with predictive failures to fail before hot swapping so my question is how would I properly fail a good disk that is only predicting to fail? Here are some of the commands but I'm not sure which ones to use.

Disk Fail command forces a file system disk to fail and will be selected for Rapid RAID Recovery and copied to a spare. You must use the disk swap command afterwords when using SCSI disks. (we use SAS, does this not apply?)


Disk Replace command can be used to replace a file system disk with a more appropriate spare disk.
Followed by a disk swap?

It's not clear in the man disk pages on which to use in a predictive failure situation.

Thanks in advance

Question by:snyderkv
  • 2
  • 2
  • 2
LVL 42

Assisted Solution

by:Paul Solovyovsky
Paul Solovyovsky earned 1000 total points
ID: 40294747
You can use either or.  Disk replace is less intrusive as it copies valid data before taking the original offline.  Disk fail causes degrading in the raid dp as it has to rebuild the data, not a big deal but just a little more work for the system.

Expert Comment

ID: 40295026
"disk replace" is definitely better. If you are manually replacing disks to avid them failing, the last thing you want to do is to deliberately degrade RAID to replace the disk - the effect of this would be the same as waiting for the disk to fail.

Having said that, ONTAP is very good at managing disks and disk failures. It's Maintenance Center feature can predictively replace disks, take them offline for testing and return them if needed. It is unlikely that you can do a better job manually. Also, it is unlikely NetApp Support will replace disks you remove manually. So my recommendation is to let ONTAP manage disk failures.
LVL 42

Expert Comment

by:Paul Solovyovsky
ID: 40295030
If you call Netapp Support and send them the error with predictive failure on the drives they will send a replacement. Sort of like having a slow leak in a tire, you don't want to wait until it goes flat even though you have a spare.
Technology Partners: We Want Your Opinion!

We value your feedback.

Take our survey and automatically be enter to win anyone of the following:
Yeti Cooler, Amazon eGift Card, and Movie eGift Card!


Author Comment

ID: 40295154
The replace command does not work. It says it's not a file system disk. It's listed as predictive failure however, it shows up under broken disks. For some reason it's predictive but no longer a file system disk or part of an aggr or rdgrp therefore I cannot use the replace command.

I guess I would just blink the drive and hot swap it as if it were a failed drive? Since it did not fail, why did it take it out of the raid group and essentially fial the drive?

Accepted Solution

FilipZahradnik earned 1000 total points
ID: 40295825
If the disk is listed under broken disks, it has already been failed and replaced. It seems that ONTAP used the predictive mechanism to proactively replace the drive before it failed. If you are interested in exactly what happened with that disk, search the messages files for the name of the disk and you will see events describing the exact sequence.

Also, chances are the orange light is on on that disk, so no need to blink.

Author Comment

ID: 40296072

Featured Post


Modern healthcare requires a modern cloud. View this brief video to understand how the Concerto Cloud for Healthcare can help your organization.

Question has a verified solution.

If you are experiencing a similar issue, please ask a related question

When we purchase storage, we typically are advertised storage of 500GB, 1TB, 2TB and so on. However, when you actually install it into your computer, your 500GB HDD will actually show up as 465GB. Why? It has to do with the way people and computers…
The business world is becoming increasingly integrated with tech. It’s not just for a select few anymore — but what about if you have a small business? It may be easier than you think to integrate technology into your small business, and it’s likely…
This video Micro Tutorial explains how to clone a hard drive using a commercial software product for Windows systems called Casper from Future Systems Solutions (FSS). Cloning makes an exact, complete copy of one hard disk drive (HDD) onto another d…
This Micro Tutorial will teach you how to reformat your flash drive. Sometimes your flash drive may have issues carrying files so this will completely restore it to manufacturing settings. Make sure to backup all files before reformatting. This w…
Suggested Courses
Course of the Month14 days, 20 hours left to enroll

577 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question