Sabrin
asked on
dedicated server goes down for no reason
hello
I have a dedicated server running on my hosting company it has Centos v4.3 installed with all
the updates an everything but the server sometimes goes down for no reason so i have to
send a reboot request! can someone please help or tell me why is this happening?
all the cron jobs are checked already!
I have a dedicated server running on my hosting company it has Centos v4.3 installed with all
the updates an everything but the server sometimes goes down for no reason so i have to
send a reboot request! can someone please help or tell me why is this happening?
all the cron jobs are checked already!
What does your /var/log/message contains
ASKER
its empty
ASKER
my /var/log/messages shows me a lot!
what im looking for?
what im looking for?
->sometimes goes down for no reason means what it hangs or what
Can u post contents here?
look what error it shows before u restart the server
Can u post contents here?
look what error it shows before u restart the server
As usually in such cases - have You check Your RAM already? http://www.memtest.org/
ASKER
I only have ssh access to the root how would I test the ram ?
Ask the admin to reboot and test with memtest86 - if it's free...
You may try scripts like http://people.redhat.com/dledford/memtest.html
Another test - simple and efficien is to spawn kernel compilation with unlimited job number. http://linuxmafia.com/faq/VALinux-kb/ram-testing.html
But note: if the test fails, You know the RAM is broken, it the test passes - You know nothing in fact.
On the other hand if memtest86 reports no error for say 1hour You pretty sure it's fine.
You may try scripts like http://people.redhat.com/dledford/memtest.html
Another test - simple and efficien is to spawn kernel compilation with unlimited job number. http://linuxmafia.com/faq/VALinux-kb/ram-testing.html
But note: if the test fails, You know the RAM is broken, it the test passes - You know nothing in fact.
On the other hand if memtest86 reports no error for say 1hour You pretty sure it's fine.
ASKER
its not memory!
Hi,
Can you open /var/log/messages with nano and then press "ctrl+w", type in restart and enter. This should take you to a line saying the system has restarted and it will have a load of kernel lines below it. If you scroll up a bit and check before the restart you may be able to find some issues as to why the network has cut out.
Thanks.
Can you open /var/log/messages with nano and then press "ctrl+w", type in restart and enter. This should take you to a line saying the system has restarted and it will have a load of kernel lines below it. If you scroll up a bit and check before the restart you may be able to find some issues as to why the network has cut out.
Thanks.
ASKER
this is what i see
Nov 21 04:03:33 dedicated syslogd 1.4.1: restart.
just lines like that, nothing else about system only syslogd has "restart"
Nov 21 04:03:33 dedicated syslogd 1.4.1: restart.
just lines like that, nothing else about system only syslogd has "restart"
Hi,
Could you put a link up for me to download your messages file or something from your server so I can take a proper look at it for you?
Thanks.
Could you put a link up for me to download your messages file or something from your server so I can take a proper look at it for you?
Thanks.
ASKER
yes you can download them from here
members.lycos.co.uk/eehost /messages/
members.lycos.co.uk/eehost
ASKER
there was a reboot request in nov 20
ASKER
I gave you the logs from nov19 to nov21
Hi,
It looks like there may be an issue with ACPI which I have known to cause problems on a server I had running before. It may be worth recompiling the kernel without support for ACPI and getting it disabled in the bios.
Failing that the only over time I have seen something like you describe is when the memory was all be used and the SWAP was to small causing the server to freeze until the server was rebooted. This may not be the case but it may be something worth checking.
Thanks.
It looks like there may be an issue with ACPI which I have known to cause problems on a server I had running before. It may be worth recompiling the kernel without support for ACPI and getting it disabled in the bios.
Failing that the only over time I have seen something like you describe is when the memory was all be used and the SWAP was to small causing the server to freeze until the server was rebooted. This may not be the case but it may be something worth checking.
Thanks.
ASKER
hello talkster5,
today nov 23 at 3am the server stoped responding so I sent a reboot
request so they can manually reboot the server. when the server came
back up I copied the file messages and uploaded to the site so you
can please check it one more time to make sure its the ACPI
here: http://members.lycos.co.uk/eehost/messages/
thanks
today nov 23 at 3am the server stoped responding so I sent a reboot
request so they can manually reboot the server. when the server came
back up I copied the file messages and uploaded to the site so you
can please check it one more time to make sure its the ACPI
here: http://members.lycos.co.uk/eehost/messages/
thanks
Hi,
It looks like there is something wrong with ACPI but without seeing what is actually before the restart it is hard to tell if that is definatley what the problem is. Could you send me the messages file leading up to the restart as well please.
Thanks.
It looks like there is something wrong with ACPI but without seeing what is actually before the restart it is hard to tell if that is definatley what the problem is. Could you send me the messages file leading up to the restart as well please.
Thanks.
ASKER
Is this due to the hardware (and/or bios) combination, or is it a bug in the kernel?
ASKER CERTIFIED SOLUTION
membership
This solution is only available to members.
To access this solution, you must be a member of Experts Exchange.
ASKER
is there any way to log everything ?
Pretty much everything is already logged either in messages or the applications own log file.
If you are renting this server from someone then it should not really be you that is having to fix the problem though as it has got nothing to do with a configuration change you have made by the looks of things.
If you are renting this server from someone then it should not really be you that is having to fix the problem though as it has got nothing to do with a configuration change you have made by the looks of things.
ASKER
ok, I have disabled ACPI now lets see if it gets frozen again in this the last 24 hours!
ASKER
its not the ACPI the server keeps getting frozen..
man this sucks
man this sucks