Solved

linux machine shutdown by itself

Posted on 2013-05-13
6
336 Views
Last Modified: 2013-05-21
Hi

We got an older linux running SLES9.  Sometimes the machine got shut down during the night by itself.  I don't quite see useful info from /var/log/messages and /var/log/messages-yyyymmdd.gz.  How do I find out the reason?  Where can I check for useful log info?

Thanks.
0
Comment
Question by:asugri
  • 3
  • 2
6 Comments
 
LVL 87

Accepted Solution

by:
rindi earned 400 total points
ID: 39163800
Uncontrolled shutdowns like that are usually caused by hardware problems and overheating.

Clean out all the dust from your server and make sure all the fans run smoothly. Test the RAM using memtest86+. On most Linux distro's it is included with it's boot menu, if it isn't in yours, boot using the UBCD:

http://ultimatebootcd.com
http://pharry.org/data/ubcd523.iso

Also test the HD's. If you are using a RAID controller, some have built-in options to test them, if not, the manufacturer's diagnostics are also included on the CD above. If the RAID controller doesn't have built-in diagnostics available, it should at least tell you what state the disks are in, and if it tells you a disk is bad, replace it.
0
 
LVL 1

Assisted Solution

by:ganesh4282
ganesh4282 earned 100 total points
ID: 39165599
You can configure Disk dump.. But you need to send the coredump to Novell to find the root cause.
0
 

Author Comment

by:asugri
ID: 39170253
Rindi,

The machine was purchased about 8 years ago.  We just got limited info now.  I believe it has a RAID.  How do I find out the RAID and test if any drive is bad?  More specific steps are very much appreciated.  

Ganesh4282,

OS is outside of support period, too.  I don't thin Novell will handle this case.
Thanks.
0
What Should I Do With This Threat Intelligence?

Are you wondering if you actually need threat intelligence? The answer is yes. We explain the basics for creating useful threat intelligence.

 
LVL 87

Assisted Solution

by:rindi
rindi earned 400 total points
ID: 39170483
You'll first have to found out what hardware you have, what RAID controller your server has builtin. RAID controllers generally have utilities or options included that tell you a general status of the disks connected to them. If it tells you a disk is bad, replace it. Also how you should do that depends on the hardware. Many servers have the disks in hot-swap caddies, and those should be removed while the server is running and replaced with the new disk, and then it should automatically rebuild the array...
0
 

Author Comment

by:asugri
ID: 39186326
Rindi,

I was hoping you (or somebody) can provide some kind of linux command to find out more about the RAID.  Perhaps I will post another question.

Thanks.
0
 
LVL 87

Expert Comment

by:rindi
ID: 39186619
It is different from hardware and RAID controller manufacturer to manufacturer, they provide you with the utilities or tools to diagnose their hardware, or they don't, it depends on them.

To properly check the state of the disks, you also need to run their diagnostics out of the RAID system. There's no Linux command that can do that.

Only if you are using Linux built-in Software RAID do you have some commands to check the state of the array, but also that won't tell you the reason for it failing, or whether the hardware / disks are actually good or not. For that you again have to run the manufacturer's diagnostics.
0

Featured Post

How your wiki can always stay up-to-date

Quip doubles as a “living” wiki and a project management tool that evolves with your organization. As you finish projects in Quip, the work remains, easily accessible to all team members, new and old.
- Increase transparency
- Onboard new hires faster
- Access from mobile/offline

Join & Write a Comment

Network Interface Card (NIC) bonding, also known as link aggregation, NIC teaming and trunking, is an important concept to understand and implement in any environment where high availability is of concern. Using this feature, a server administrator …
Introduction We as admins face situation where we need to redirect websites to another. This may be required as a part of an upgrade keeping the old URL but website should be served from new URL. This document would brief you on different ways ca…
Connecting to an Amazon Linux EC2 Instance from Windows Using PuTTY.
Get a first impression of how PRTG looks and learn how it works.   This video is a short introduction to PRTG, as an initial overview or as a quick start for new PRTG users.

758 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question

Need Help in Real-Time?

Connect with top rated Experts

22 Experts available now in Live!

Get 1:1 Help Now