Hi
I have a Dell PowerEdge 2650 that recently had a failed disk and a predicted failed disk. Dell came out and replaced the failed disk and once the Raid 5 had rebuilt this disk we replaced the predicted failed disk. The server has three disks in a raid 5 configuration.
After all the hard drives were rebuilt everything seemed fine until I came into work on the day after the server had a full backup taken. The server had crashed with a BSOD with the error page saying Kernel_stack_inpage_error.
I ran a few full backup tests and during each full backup the server crashed. Incremental backups of the server didn't cause the server to crash.
All the hardware is reporting that it is fine so I tried to run a checkdisk of the operating system to see if there were any problems. It got 1% into the file data check section when it BSOD again.
I looked at the backup logs and saw that the server was able to backup until somewhere in the Windows directory. I scheduled another full backup of the server that exluded the Windows directory and the System State directory and this was able to run successfully without crashing the server.
The server is also becoming unstable in running programs. If I launch IE it will cause the server to lock up and stop responding. It also did the same once when I was trying to use the Control Panel.
It seems to me that there are some corrupt files in the Windows directory that were probably corrupted with the failing disks. The server is about to come into its most busy period of the year so rebuilding the server isn't an option for the immediate future.
I'm hoping that someone can come up with a way to identify the corrupt files and replace them with clean versions without doing any major damage to the operating system or the applications running on them.
Please let me know if your require more information.
Regards
Mathew Archibald
Start Free Trial