Server room Airconditioning failed
Posted on 2007-11-29
The A/C in our server room failed and we were unaware of it for about two hours. Actually I don't worry about it because there is a stand alone unit as well as the central cooling so if one is off then the other is cooling the room . This morning the Central system was shut down for maintenance( of which i was informed very late) and at the same time( or earlier) the stand alone unit decided to quit. Dont know what caused it. I notice a couple of hours later that the server room(3.5 by 1.6 mts with two racks/ 6 servers /10KVA ups/ switches no ventilation except for the A/C's, Glass sliding doors) is very hot .The room was cooler after opening the doors but there seems to be some damage done on one of the servers. One HDD on this machine ( RAid 5 with 5 SCSI HDD's) is showing a red indication with a cross.The other 4 HDD's are ok with a green light. I checked server and see no problems .Everything is working fine. I checked the Disk MAnagemnet and it says the Logical drive is healthy. I need to know where I can find if something is wrong. Also what does the red cross on the HDD mean( obviously it is still working )Is this caused by the temprature rise or could it be something that was already there and I missed it.
Also the other servers ... what possibilitise are ther that there is damage caused due to the temprature. Is it treu that tempratures upto 60C are bearable by the equipments.Is there anything that I can setup that will monitor the temperature on servers / Server room and notify me.
I also need to know if the servers will shut down if the temperatures rise above limits( before getting damaged) . Is there a way to do this .Would my servers have shutdown if it had continued for another two hours in the same condition. The thing is I do not know what damage the servers took and need some idea of what maight have hapenned.