Solved

question about replacing bad disk on dell powervault 220s raid 5

Posted on 2010-09-22
11
1,646 Views
Last Modified: 2013-11-14
need to replace a "bad" disk on a powervault 220s, currently setup in raid 5
the disk is blinking amber on the array, we have sql data being written to this array, and want to make sure there arent any errors when rebuilding, its an extremely important production system.
any ideas on how to move forward? no support from dell, this device is EOL as of last year.
not the first time im replacing a bad disk in an array, but i need to be 110% positive i dont run into any problems.
in the dell openmanage array manager, this is what shows:
ARRAY DISK 0:0 MAXTOR
ARRAY DISK 0:1 MAXTOR
ARRAY DISK 0:2 MAXTOR
ARRAY DISK 0:3 MAXTOR (BAD DISK)
ARRAY DISK 0:4 MAXTOR
ARRAY DISK 0:5 SEAGATE
ARRAY DISK 0:8 SEAGATE

the ARRAY DISK 0:0 is unallocated, and i can right click on it and select "ASSIGN GLOBAL HOT SPARE", would this be sufficient right now?
also, why are disks 6 and 7 missing?
one more thing, these are all 36gb 15k drives, i also have a 36gb 15k drive from dell but its a hitatchi, this is the one i was going to use to replace the bad drive, would this work? i was told by dell at first, that it wouldnt, then i was told that if they were the ones that shipped out the drives, it would work. i need to make sure
thanks!
0
Comment
Question by:jsctechy
[X]
Welcome to Experts Exchange

Add your voice to the tech community where 5M+ people just like you are talking about what matters.

  • Help others & share knowledge
  • Earn cash & points
  • Learn & ask questions
  • 4
  • 4
  • 3
11 Comments
 
LVL 11

Expert Comment

by:JoeNuvo
ID: 33733123
make disk 0:0 to become hotspare would help rebuild process to take that as replacement and help any performance impact which may occur.

regarding different brand, I'm not so sure, but in my environment, I've few brand of HDD working together in same unit
Seagate, Maxtor, WD
So far, I never face any problem.
0
 
LVL 1

Author Comment

by:jsctechy
ID: 33733229
so if i make 0:0 the global hot spare, it would start to rebuild right away, right?
im sure it would slow performance, so i would want to do this after hours,
should i do it straight from the array manager? is there another way to do it by taking the unit offline?
0
 
LVL 11

Expert Comment

by:SemperWiFi
ID: 33733395
Yes, make the hotspare change and it begins rebuild straight away.
0
Why Off-Site Backups Are The Only Way To Go

You are probably backing up your data—but how and where? Ransomware is on the rise and there are variants that specifically target backups. Read on to discover why off-site is the way to go.

 
LVL 11

Expert Comment

by:SemperWiFi
ID: 33733427
Start it now, if you lose another disk on a RAID 5 you're in trouble, it's going to take a while anyway so chances of you completing the task completely outside working hours is slim to none.

As far as the multiple brands.. often times various brands of HDDs can be used in one array without much issue noticed by the user because of the controller's and the drive's error correction abilities. For an array to be optimal and in best form it should not only be of the same brand but should be the same firmware version across all disks as well. Sure you can get away without doing this while using the array, recovery though will prove to be a different story. So lets say another drive should fail before you fix this array, with different brands and I assume various firmwares across same branded disks, when you send them off to a recovery company I think you will find you should have been slightly more diligent about these little details being correct.
0
 
LVL 1

Author Comment

by:jsctechy
ID: 33733522
I think we are going to wait til after hours to start this, i was told by our development team that we need to get a full backup (86gb) of what is on the drive before we start the rebuild process with the hot spare. probably do it over the weekend, start friday night, or possibly start sunday morning after a full backup. what do you guys think?
0
 
LVL 11

Expert Comment

by:SemperWiFi
ID: 33733771
I think:

A - This is a RAID 5 which means that with a disk failed you no longer have any protection. If another drive fails while you wait you lose everything and since you have a mixed soup of disks in the array recovery will be difficult if possible at all.
B - 85GB isn't much and it isn't a bad idea at all to grab a quick backup prior to rebuild.
C - This is a RAID 5 and with a disk failed you no longer have any protection.

Today is Wednesday - you want to wait till Friday to rebuild this array? Really?

0
 
LVL 11

Expert Comment

by:JoeNuvo
ID: 33734162
I understand your situation (about have to wait).
some system, is likely not allow to have unschedule downtime.

so, the best you could do for right now is,
change recovery model be full  and do database full backup & very often transaction log backup
or
if your recovery model is anything but full, then do database backup as often as you could

however, as many comment mention above, rebuild should take a while, but with 36g 15k, I'm sure it'll take less than a day

good luck!
0
 
LVL 11

Expert Comment

by:SemperWiFi
ID: 33738188
It's RAID 5, why would there be down time for this rebuild?
0
 
LVL 1

Author Comment

by:jsctechy
ID: 33768477
hey guys
i set the spare disk in the array to GLOBAL HOT SWAP and it started to rebuild with the new drive
when it rebuild was done (only took about 1.5 hours) i checked the status, and the drive that was previously bad/failed, was now showing as "READY" and not part of the array. could this drive still be good? why was it showing as bad in the first place?
0
 
LVL 11

Accepted Solution

by:
JoeNuvo earned 500 total points
ID: 33768570
I don't have technical information to share.
just my experience.

the "bad" drive, sometimes I just remove and put it back into the same slot.  then it start to work again.
but then soon or later, it will turn bad (again).

so, my recommend is not to trust the "ever mark as bad" disk as your primary storage.
0
 
LVL 1

Author Closing Comment

by:jsctechy
ID: 33768579
thanks for your help dude
0

Featured Post

Independent Software Vendors: We Want Your Opinion

We value your feedback.

Take our survey and automatically be enter to win anyone of the following:
Yeti Cooler, Amazon eGift Card, and Movie eGift Card!

Question has a verified solution.

If you are experiencing a similar issue, please ask a related question

Concerto Cloud Services, a provider of fully managed private, public and hybrid cloud solutions, announced today it was named to the 20 Coolest Cloud Infrastructure Vendors Of The 2017 Cloud  (http://www.concertocloud.com/about/in-the-news/2017/02/0…
In this article we will learn how to backup a VMware farm using Nakivo Backup & Replication. In this tutorial we will install the software on a Windows 2012 R2 Server.
This tutorial will walk an individual through the steps necessary to configure their installation of BackupExec 2012 to use network shared disk space. Verify that the path to the shared storage is valid and that data can be written to that location:…
This tutorial will walk an individual through locating and launching the BEUtility application to properly change the service account username and\or password in situation where it may be necessary or where the password has been inadvertently change…

707 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question