Still celebrating National IT Professionals Day with 3 months of free Premium Membership. Use Code ITDAY17

x
?
Solved

RH EL4 RAID BOOT ISSUE

Posted on 2007-04-01
4
Medium Priority
?
1,242 Views
Last Modified: 2013-12-15
System has two drives (250GB) set as RAID-0 mirror.

Partitions are formed as:
/dev/hda1 Boot Linux
/dev/hda2 Linux swap
/dev/hda3 Linux raid autodetect

/dev/hdb1 Linux swap
/dev/hdb2 Linux raid autodetect

This is the console screen at boot time:
Decompressing Linux...done.
Booting the kernel.
Red Hat nash version 4.2.1.6 starting
EXT3-fs error (device md0): ext3_find_entry: reading directory #2 offset 0
mount: error 2 mounting none
EXT3-fs error (device md0): ext3_find_entry: reading directory #2 offset 0
EXT3-fs error (device md0): ext3_find_entry: reading directory #2 offset 0
EXT3-fs error (device md0): ext3_find_entry: reading directory #2 offset 0
WARNING: can't access (null)
exec of init ((null)) failed!!!: 14
EXT3-fs error (device md0): ext3_find_entry: reading directory #2 offset 0
unmount /initrd/dev failed: 2
Kernel panic - not syncing: Attempted to kill init!

Here the cursor just blinks indefinately and the Caps and Scroll Lock lights on the keyboard flash.

I need serious help... Thanks in advance...
-greg
0
Comment
Question by:Gregory Miller
[X]
Welcome to Experts Exchange

Add your voice to the tech community where 5M+ people just like you are talking about what matters.

  • Help others & share knowledge
  • Earn cash & points
  • Learn & ask questions
  • 2
  • 2
4 Comments
 
LVL 27

Accepted Solution

by:
Nopius earned 1500 total points
ID: 18833855
Was everything working and just recently has been broken or you work with installation of Linux?
It might be helpful of you copy-paste entire boot screen here.
That can be easy done with a serial console connectio to your server. Just connect to COM1 with terminal cable, and add in a GRUB kernel flags 'console=tty0 console=ttyS0,38400n8' (edit before boot). Full screen dump might be more informative.
What about your problem RAID-0 is NOT a mirror, it's a striped device that, once corrupted, cant be restored. If you use 'md0' I guess it's a software RAID, because 'md0' is a driver.
If problem is 'just happened' and everithing was OK before, I can guess that you either have dead mount labels on your devices (EL4 uses labels instead of device names for finding appropriate partitions), or you have corrupted 'md' superbloks at the end of each partition from RAID, or you have corrupted filesystem in a working RAID0, or you have changed 'partition type' on device /dev/hda3 or /dev/hdb2.

You may read about more kernel flags in 'md' manual here: http://www.squarebox.co.uk/cgi-squarebox/manServer/md.4

Then you may try to use kernel flags to manualy define RAID: 'ro raid=noautodetect md=0,/dev/hda3,/dev/hdb2'

Your problem is really serious and it whould be nice to have a backup copy of all your data...

0
 
LVL 11

Author Comment

by:Gregory Miller
ID: 18835646
I have gotten it solved. As you mentioned, the problem was VERY serious and I ended up hiring a fellow out of Michigan (not found through EE) to assist over the phone to resolve. Unfortunately, the entire file structure of the RAID was sent to Lost+Found, due to some corruption in the directory, and I now get the job of picking through the scraps to find specific data that is important.

The resolution was to boot to rescue mode and manually assign the RAID and then fsck the RAID.

boot from RH install CD
at prompt: "linux rescue"

This command reconnected the two drives into the RAID. The partitions definition was still intact luckily...
mdadm -Ac partitions -m 0 /dev/md0

This command fixed the issue:
fsck /dev/md0        or it may have been    fsck /dev/hda3   (sorry, it was really late)

About an hour later, voila... All done!
0
 
LVL 11

Author Comment

by:Gregory Miller
ID: 18835664
Noplus,

I saw in your post some detail that the fellow I paid was covering as well. Even though the question was answered yesterday, I will give you the points because of the similarities and I think between us, it would have gotten figured out.

thanks,
-greg
0
 
LVL 27

Expert Comment

by:Nopius
ID: 18840333
Technodweeb, thank you.
Really live assistance via phone is much more helpfull (onsite visit is better).
I am glad that problem is partially resolved (you need to find useful data in lost+found). Now you may use 'file' utility to do a fast check of data type of unknown file.
0

Featured Post

Give Your Engineering Team a Productivity Boost

Learn why container technology is so powerful and how it can provide your team with productivity gains and other benefits.

Question has a verified solution.

If you are experiencing a similar issue, please ask a related question

Little introduction about CP: CP is a command on linux that use to copy files and folder from one location to another location. Example usage of CP as follow: cp /myfoder /pathto/destination/folder/ cp abc.tar.gz /pathto/destination/folder/ab…
Linux users are sometimes dumbfounded by the severe lack of documentation on a topic. Sometimes, the documentation is copious, but other times, you end up with some obscure "it varies depending on your distribution" over and over when searching for …
Learn how to find files with the shell using the find and locate commands. Use locate to find a needle in a haystack.: With locate, check if the file still exists.: Use find to get the actual location of the file.:
Learn how to navigate the file tree with the shell. Use pwd to print the current working directory: Use ls to list a directory's contents: Use cd to change to a new directory: Use wildcards instead of typing out long directory names: Use ../ to move…
Suggested Courses

715 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question