Link to home
Start Free TrialLog in
Avatar of ivyfranklin
ivyfranklin

asked on

How to monitor drive health for a HW RAID 1 Array under linux

I recently setup a Linux server running RH9.  The server includes a SATA RAID controller.  I setup a RAID 1 array with two drives in the BIOS and I used 'linux dd' with an aarich driver disk to install the OS.  Everything seems to be working OK, but I would like to know if there is any way to monitor the health of the array.  For example, if one of the drives fail, how will I know that it failed.  I assume that the computer will continue to function normally (with the RAID array in degraded mode), but I would like to be notified by the machine that one of the drives has failed so that I can replace and rebuild the failed drive.  It would also be nice if the log contained enough information to tell me which of the two drives had failed.  Does anyone have any experience with this setup.  I have included a copy of the output I receive at boot time for the RAID controller.

scsi0 : Vendor: ADAPTEC  Model: AAR-ICHx  Version: 2.01.016
  Vendor: ADAPTEC   Model: RAID 1            Rev: 1.0
  Type:   Direct-Access                      ANSI SCSI revision: 00
Attached scsi disk sda at scsi0, channel 0, id 0, lun 0

Thanks in advance.
Avatar of jlevie
jlevie

Check with Adaptec and see if they have any Linux tools for this. I know that there are Linux tools for some of the SCSI & IDE raid controllers than can  be used for RAID array monitoring and management.
Alright well this is quit simple but you can do it

make a cron job to run every morning before you come in

and in the script write this

mail  -s array_drive_status1 < fsck -t file_system      drive one location  such as /dev/sd0
mail  -s array_drive_status2 < fsck -t file_system      drive one location  such as /dev/sd1

it will check the file system for you and give you the report on the drives in the form of an email every morning
better then another process taking cpu time.
I don't believe this RAID controller will allow the system direct access to the drives. Only the RAID volume(s) will be accessible. To find out if the drives that comprise the RAID volume are healthy requires interrogating the RAID controller.
Avatar of ivyfranklin

ASKER

So is there any way to iterrogate the controller without rebooting the system?
What model RAID controller is it?
The motherboard is a Supermicro P4sCi (http://www.supermicro.com/products/motherboard/P4/E7210/P4SCi.cfm).  I believe the controller is an Adaptec ICH5-R / 6300ESB controller.
Hmm, the 6300ESB is an Intel SATA controller. And a search of Adaptec's site doesn't run up anything for ICH5-R or AAR-ICH. All of the Adaptec RAID products are ID'd with a four-five digit number (e.g., 2410SA, 21610SA, etc). So I don't know what you have.
I think it is an Intel ICH5 controller, that uses Adaptec technology.  Perhaps the Intel chipset talks to an Adaptec RAID chip on the mother board.  Any more ideas?
ASKER CERTIFIED SOLUTION
Avatar of jlevie
jlevie

Link to home
membership
This solution is only available to members.
To access this solution, you must be a member of Experts Exchange.
Start Free Trial
ok thanks.  I am also looking into smartmontools to see if I can get any help there.