• Status: Solved
  • Priority: Medium
  • Security: Public
  • Views: 827
  • Last Modified:

Proliant ML310 G1 Debian Squeeze (RAID 1+0)

Hi

I have been setting up an old Prtoliant ML310 G1 with Debian Squeeze using a four disk RAID 1+0 array as storage

We had a few hiccups in the beginning, but seem to be nearing what resembles a stable system: every time we rebooted the system, the smart array 6402 controller would warn us that one of the drives (#4) required a rebuild, and requested that we allow automatic recovery process to kick in. Which we did, each and every time.

Since all hardware was second hand, we assumed that one or more components wasn't up to par. So we had them replaced : the hotplug backplane, the smart array controller, and of course the incriminated disk

There isn't much in terms of data on the drives since all we've done so far is install Debian, and a basic email system composed of Postfix, Dovecot, and Webmin. The server is still idle in our workshop

In the beginning I wasn't too observant of the logical drive's rebuild status before rebooting. Then I began to monitor it using HPACUCLI

The rebuiild process doesn't seem clear to me. Sometimes the progress seems linear and at other times it looks as if it keeps restarting from the beginning at different moments throughout the day.

I finally loaded the controller's Option Rom Confiuration Utility (F8) at boot and allowed the rebuild process to complete. Which it it did. The system rebooted normally, without any error or warning. Great !

Twelve hours later I decided to reboot the system (shutdown -r now) and lo and behold the smart array warning is back, same drive, same error.

So my question is: must I do anything special before rebooting the server in order to allow the logical drive to remain in proper shape during a reboot ?

I can obviously replace the 4th drive once again, who knows, maybe I still have a duff drive in the system.

Thanks

yann
0
Yann Shukor
Asked:
Yann Shukor
  • 4
1 Solution
 
DrDamnitCommented:
You did everything right. it's not supposed to be that complicated. Something isn't quite working right, and you've already replaced the backplane and controller. I would Replace the drive next. I but them cheap off eBay in instances like this. Usually from the natecds store if he had them, otherwise, from whoever had what I need.
0
 
Yann ShukorAuthor Commented:
We broke the RAID, swapped the drives around and recreated a new (RAID 1+0) logical drive
We reinstalled Debian Squeeze with LVM ext4 partitions (except for boot)
Upon rebooting the system requests an automatic recovery for the same 4th drive in slot 3.
0
 
Yann ShukorAuthor Commented:
Does anyone think this situation could be linked to something other than hardware ?

An if not, I'm not sure what else to try.
Replace a potentially weak PSU for instance ?

Maybe RAID10 is too much to ask for,
maybe we should settle for RAID5 on three drives
0
 
Yann ShukorAuthor Commented:
I unplugged the DAT drive that was connected on the same scsi cable
that was connecting the hotplug back-plane
and all seems to be well now...I feel a little silly if this was actually
the source of our error.
0
 
Yann ShukorAuthor Commented:
the situation is fixed
0

Featured Post

Free Tool: SSL Checker

Scans your site and returns information about your SSL implementation and certificate. Helpful for debugging and validating your SSL configuration.

One of a set of tools we are providing to everyone as a way of saying thank you for being a part of the community.

  • 4
Tackle projects and never again get stuck behind a technical roadblock.
Join Now