Solved

RH EL4 RAID BOOT ISSUE

Posted on 2007-04-01
4
1,232 Views
Last Modified: 2013-12-15
System has two drives (250GB) set as RAID-0 mirror.

Partitions are formed as:
/dev/hda1 Boot Linux
/dev/hda2 Linux swap
/dev/hda3 Linux raid autodetect

/dev/hdb1 Linux swap
/dev/hdb2 Linux raid autodetect

This is the console screen at boot time:
Decompressing Linux...done.
Booting the kernel.
Red Hat nash version 4.2.1.6 starting
EXT3-fs error (device md0): ext3_find_entry: reading directory #2 offset 0
mount: error 2 mounting none
EXT3-fs error (device md0): ext3_find_entry: reading directory #2 offset 0
EXT3-fs error (device md0): ext3_find_entry: reading directory #2 offset 0
EXT3-fs error (device md0): ext3_find_entry: reading directory #2 offset 0
WARNING: can't access (null)
exec of init ((null)) failed!!!: 14
EXT3-fs error (device md0): ext3_find_entry: reading directory #2 offset 0
unmount /initrd/dev failed: 2
Kernel panic - not syncing: Attempted to kill init!

Here the cursor just blinks indefinately and the Caps and Scroll Lock lights on the keyboard flash.

I need serious help... Thanks in advance...
-greg
0
Comment
Question by:Technodweeb
  • 2
  • 2
4 Comments
 
LVL 27

Accepted Solution

by:
Nopius earned 500 total points
ID: 18833855
Was everything working and just recently has been broken or you work with installation of Linux?
It might be helpful of you copy-paste entire boot screen here.
That can be easy done with a serial console connectio to your server. Just connect to COM1 with terminal cable, and add in a GRUB kernel flags 'console=tty0 console=ttyS0,38400n8' (edit before boot). Full screen dump might be more informative.
What about your problem RAID-0 is NOT a mirror, it's a striped device that, once corrupted, cant be restored. If you use 'md0' I guess it's a software RAID, because 'md0' is a driver.
If problem is 'just happened' and everithing was OK before, I can guess that you either have dead mount labels on your devices (EL4 uses labels instead of device names for finding appropriate partitions), or you have corrupted 'md' superbloks at the end of each partition from RAID, or you have corrupted filesystem in a working RAID0, or you have changed 'partition type' on device /dev/hda3 or /dev/hdb2.

You may read about more kernel flags in 'md' manual here: http://www.squarebox.co.uk/cgi-squarebox/manServer/md.4

Then you may try to use kernel flags to manualy define RAID: 'ro raid=noautodetect md=0,/dev/hda3,/dev/hdb2'

Your problem is really serious and it whould be nice to have a backup copy of all your data...

0
 
LVL 11

Author Comment

by:Technodweeb
ID: 18835646
I have gotten it solved. As you mentioned, the problem was VERY serious and I ended up hiring a fellow out of Michigan (not found through EE) to assist over the phone to resolve. Unfortunately, the entire file structure of the RAID was sent to Lost+Found, due to some corruption in the directory, and I now get the job of picking through the scraps to find specific data that is important.

The resolution was to boot to rescue mode and manually assign the RAID and then fsck the RAID.

boot from RH install CD
at prompt: "linux rescue"

This command reconnected the two drives into the RAID. The partitions definition was still intact luckily...
mdadm -Ac partitions -m 0 /dev/md0

This command fixed the issue:
fsck /dev/md0        or it may have been    fsck /dev/hda3   (sorry, it was really late)

About an hour later, voila... All done!
0
 
LVL 11

Author Comment

by:Technodweeb
ID: 18835664
Noplus,

I saw in your post some detail that the fellow I paid was covering as well. Even though the question was answered yesterday, I will give you the points because of the similarities and I think between us, it would have gotten figured out.

thanks,
-greg
0
 
LVL 27

Expert Comment

by:Nopius
ID: 18840333
Technodweeb, thank you.
Really live assistance via phone is much more helpfull (onsite visit is better).
I am glad that problem is partially resolved (you need to find useful data in lost+found). Now you may use 'file' utility to do a fast check of data type of unknown file.
0

Featured Post

Complete VMware vSphere® ESX(i) & Hyper-V Backup

Capture your entire system, including the host, with patented disk imaging integrated with VMware VADP / Microsoft VSS and RCT. RTOs is as low as 15 seconds with Acronis Active Restore™. You can enjoy unlimited P2V/V2V migrations from any source (even from a different hypervisor)

Question has a verified solution.

If you are experiencing a similar issue, please ask a related question

Suggested Solutions

Title # Comments Views Activity
Linux : taking backup different mount points under the same directory 8 67
Best way to virtualise a remote Linux server 2 66
wipe a usb using python 5 44
Linux VM 6 86
Network Interface Card (NIC) bonding, also known as link aggregation, NIC teaming and trunking, is an important concept to understand and implement in any environment where high availability is of concern. Using this feature, a server administrator …
Introduction We as admins face situation where we need to redirect websites to another. This may be required as a part of an upgrade keeping the old URL but website should be served from new URL. This document would brief you on different ways ca…
Learn how to get help with Linux/Unix bash shell commands. Use help to read help documents for built in bash shell commands.: Use man to interface with the online reference manuals for shell commands.: Use man to search man pages for unknown command…
Learn how to find files with the shell using the find and locate commands. Use locate to find a needle in a haystack.: With locate, check if the file still exists.: Use find to get the actual location of the file.:

813 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question

Need Help in Real-Time?

Connect with top rated Experts

11 Experts available now in Live!

Get 1:1 Help Now