Solved

linux machine shutdown by itself

Posted on 2013-05-13
6
338 Views
Last Modified: 2013-05-21
Hi

We got an older linux running SLES9.  Sometimes the machine got shut down during the night by itself.  I don't quite see useful info from /var/log/messages and /var/log/messages-yyyymmdd.gz.  How do I find out the reason?  Where can I check for useful log info?

Thanks.
0
Comment
Question by:asugri
  • 3
  • 2
6 Comments
 
LVL 88

Accepted Solution

by:
rindi earned 400 total points
ID: 39163800
Uncontrolled shutdowns like that are usually caused by hardware problems and overheating.

Clean out all the dust from your server and make sure all the fans run smoothly. Test the RAM using memtest86+. On most Linux distro's it is included with it's boot menu, if it isn't in yours, boot using the UBCD:

http://ultimatebootcd.com
http://pharry.org/data/ubcd523.iso

Also test the HD's. If you are using a RAID controller, some have built-in options to test them, if not, the manufacturer's diagnostics are also included on the CD above. If the RAID controller doesn't have built-in diagnostics available, it should at least tell you what state the disks are in, and if it tells you a disk is bad, replace it.
0
 
LVL 1

Assisted Solution

by:ganesh4282
ganesh4282 earned 100 total points
ID: 39165599
You can configure Disk dump.. But you need to send the coredump to Novell to find the root cause.
0
 

Author Comment

by:asugri
ID: 39170253
Rindi,

The machine was purchased about 8 years ago.  We just got limited info now.  I believe it has a RAID.  How do I find out the RAID and test if any drive is bad?  More specific steps are very much appreciated.  

Ganesh4282,

OS is outside of support period, too.  I don't thin Novell will handle this case.
Thanks.
0
Master Your Team's Linux and Cloud Stack

Come see why top tech companies like Mailchimp and Media Temple use Linux Academy to build their employee training programs.

 
LVL 88

Assisted Solution

by:rindi
rindi earned 400 total points
ID: 39170483
You'll first have to found out what hardware you have, what RAID controller your server has builtin. RAID controllers generally have utilities or options included that tell you a general status of the disks connected to them. If it tells you a disk is bad, replace it. Also how you should do that depends on the hardware. Many servers have the disks in hot-swap caddies, and those should be removed while the server is running and replaced with the new disk, and then it should automatically rebuild the array...
0
 

Author Comment

by:asugri
ID: 39186326
Rindi,

I was hoping you (or somebody) can provide some kind of linux command to find out more about the RAID.  Perhaps I will post another question.

Thanks.
0
 
LVL 88

Expert Comment

by:rindi
ID: 39186619
It is different from hardware and RAID controller manufacturer to manufacturer, they provide you with the utilities or tools to diagnose their hardware, or they don't, it depends on them.

To properly check the state of the disks, you also need to run their diagnostics out of the RAID system. There's no Linux command that can do that.

Only if you are using Linux built-in Software RAID do you have some commands to check the state of the array, but also that won't tell you the reason for it failing, or whether the hardware / disks are actually good or not. For that you again have to run the manufacturer's diagnostics.
0

Featured Post

VMware Disaster Recovery and Data Protection

In this expert guide, you’ll learn about the components of a Modern Data Center. You will use cases for the value-added capabilities of Veeam®, including combining backup and replication for VMware disaster recovery and using replication for data center migration.

Question has a verified solution.

If you are experiencing a similar issue, please ask a related question

Suggested Solutions

Title # Comments Views Activity
Using SSH Through A Bastion Host Transparently (Is the topic) 1 53
gdb doesn't stop on breakpoint 2 67
Choosing CentOS 16 79
error log using ftp 7 38
If you have a server on collocation with the super-fast CPU, that doesn't mean that you get it running at full power. Here is a preamble. When doing inventory of Linux servers, that I'm administering, I've found that some of them are running on l…
I. Introduction There's an interesting discussion going on now in an Experts Exchange Group — Attachments with no extension (http://www.experts-exchange.com/discussions/210281/Attachments-with-no-extension.html). This reminded me of questions tha…
Learn how to find files with the shell using the find and locate commands. Use locate to find a needle in a haystack.: With locate, check if the file still exists.: Use find to get the actual location of the file.:
This demo shows you how to set up the containerized NetScaler CPX with NetScaler Management and Analytics System in a non-routable Mesos/Marathon environment for use with Micro-Services applications.

813 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question

Need Help in Real-Time?

Connect with top rated Experts

20 Experts available now in Live!

Get 1:1 Help Now