Link to home
Start Free TrialLog in
Avatar of Sabrin
Sabrin

asked on

dedicated server goes down for no reason

hello
I have a dedicated server running on my hosting company it has Centos v4.3 installed with all
the updates an everything but the server sometimes goes down for no reason so i have to
send a reboot request! can someone please help or tell me why is this happening?
all the cron jobs are checked already!
Avatar of Ibrahim Bazarwala
Ibrahim Bazarwala
Flag of Kuwait image

What does your /var/log/message contains
Avatar of Sabrin
Sabrin

ASKER

its empty
Avatar of Sabrin

ASKER

my /var/log/messages shows me a lot!
what im looking for?
->sometimes goes down for no reason means what it hangs or what
Can u post contents here?
look what error it shows before u restart the server
As usually in such cases - have You check Your RAM already? http://www.memtest.org/
Avatar of Sabrin

ASKER

I only have ssh access to the root how would I test the ram ?
Ask the admin to reboot and test with memtest86 - if it's free...

You may try scripts like http://people.redhat.com/dledford/memtest.html
Another test - simple and efficien is to spawn kernel compilation with unlimited job number. http://linuxmafia.com/faq/VALinux-kb/ram-testing.html
But note: if the test fails, You know the RAM is broken, it the test passes - You know nothing in fact.
On the other hand if memtest86 reports no error for say 1hour You pretty sure it's fine.
Avatar of Sabrin

ASKER

its not memory!
Hi,
Can you open /var/log/messages with nano and then press "ctrl+w", type in restart and enter. This should take you to a line saying the system has restarted and it will have a load of kernel lines below it. If you scroll up a bit and check before the restart you may be able to find some issues as to why the network has cut out.

Thanks.
Avatar of Sabrin

ASKER

this is what i see

Nov 21 04:03:33 dedicated syslogd 1.4.1: restart.

just lines like that, nothing else about system only syslogd has "restart"
Hi,
Could you put a link up for me to download your messages file or something from your server so I can take a proper look at it for you?

Thanks.
Avatar of Sabrin

ASKER

yes you can download them from here
members.lycos.co.uk/eehost/messages/
Avatar of Sabrin

ASKER

there was a reboot request in nov 20
Avatar of Sabrin

ASKER

I gave you the logs from nov19 to nov21
Hi,
It looks like there may be an issue with ACPI which I have known to cause problems on a server I had running before. It may be worth recompiling the kernel without support for ACPI and getting it disabled in the bios.

Failing that the only over time I have seen something like you describe is when the memory was all be used and the SWAP was to small causing the server to freeze until the server was rebooted. This may not be the case but it may be something worth checking.

Thanks.
Avatar of Sabrin

ASKER

hello talkster5,
today nov 23 at 3am the server stoped responding so I sent a reboot
request so they can manually reboot the server. when the server came
back up I copied the file messages and uploaded to the site so you
can please check it one more time to make sure its the ACPI
here: http://members.lycos.co.uk/eehost/messages/
thanks
Hi,
It looks like there is something wrong with ACPI but without seeing what is actually before the restart it is hard to tell if that is definatley what the problem is. Could you send me the messages file leading up to the restart as well please.

Thanks.
Avatar of Sabrin

ASKER

Is this due to the hardware (and/or bios) combination, or is it a bug in the kernel?
ASKER CERTIFIED SOLUTION
Avatar of talkster5
talkster5
Flag of United Kingdom of Great Britain and Northern Ireland image

Link to home
membership
This solution is only available to members.
To access this solution, you must be a member of Experts Exchange.
Start Free Trial
Avatar of Sabrin

ASKER

is there any way to log everything ?
Pretty much everything is already logged either in messages or the applications own log file.

If you are renting this server from someone then it should not really be you that is having to fix the problem though as it has got nothing to do with a configuration change you have made by the looks of things.
Avatar of Sabrin

ASKER

ok, I have disabled ACPI now lets see if it gets frozen again in this the last 24 hours!
Avatar of Sabrin

ASKER

its not the ACPI the server keeps getting frozen..
man this sucks