Hello Experts
Having a hard time trying to track down the source of one of our servers intermittent freezing
What happens on the odd occasion (3 times last month, 1 time this month so far) is that the server completely 'locks up'
We have a 2 server site setup, so when it happens, we can connect to serverB and ping serverA
We cannot however connect to any resources on ServerA (ServerA runs exchange and some other file shares)
Its a Dell poweredge R710 server, so when I connect to the DRAC - the console responds to mouse, but not keyboard. The only remedy at this point is to restart the server
Once the server starts back up, there is a pysical 'gap' in the event logs. As in - when the server crashes until when the server is back up, there is a gap in all event logs (system, application, security etc)
The only thing that seems to be happening is a VSS start command
Log Name: System
Source: Service Control Manager
Date: 16/03/2012 7:15:01 AM
Event ID: 7036
Task Category: None
Level: Information
Keywords: Classic
User: N/A
Computer: ServerA
Description:
The Volume Shadow Copy service entered the running state.
Shadow copies are disabled via 'My computer' - however we do run shadow protect as a backup solution that runs on the hour (15 minutes past the hour) that backs up the server volumes to a NAS over gigabit network
It's only the odd occasion when the server locks up, but the symptoms are exactly the same
Vss admin list writers show all writers as stable, all system volumes are 0% fragmented, Dell drivers for the RAID controller are up to date
Any other ideas?
This is a tricky one.
I'm assuming the exchange server is freezing intermittently when you try to back it up.
Its all a process of ilimination.
You mention the drivers for the controllers are upto date but have you checked for any backup software updates & service packs.
I would change to job to 2 seperate ones.
Backup the server without backing up the exchange databases then backup the databases as a seperate job using the exchange mailbox backup and not just backup the edb files like you would the other files.
See how you get on then, this could be a problem with the database being scanned by A/V as the snapshot is taken.
initially I would be performing a manual backup daily rather than leaving it automated untill we've tracked down the problem.
if the backup fails, you will have a good idea what part is failing.
while its manual, you have the opertunity to disable the A/V from scanning while you carry out the backup, thats what I would do, but that up to you.
if you find it works ok as 2 seperate jobs then try it automated as 2 jobs for a while.
it very important that your backup solution isnt actually 'file' backing up the exchange database file itself as it should be backing up using exchange backup.
let me know how you get on with that and lets take it from there.
Good luck
Regards
Dave