Possible Disk Errors on Server 2012 R2 Virtual Domain Controller?
Posted on 2016-11-07
I have a PDC running on server 2012 R2 which is running on ESXI, on a PERC 700 series RAID 1 array. I am using server backup to run nightly system state backup of it. It looks like it got jammed up on friday and stayed jammed up until today. I have restarted the server and I am hoping the backup will just run tonight without any issues.
However...I am noticing some more ominous issues. I am seeing more exotic and malevolent errors in the windows event viewer.
THESE ERRORS OCCURRED around 2am when the server backup runs.
1. The backup seemed to fail and get stuck on the following..."Dhcp Jet Writer..." VSS
2. The backup operation that started at '2016-11-07T07:00:01.018843100Z' has failed because another backup or recovery operation is in progress. Please stop the conflicting operation, and then rerun the backup operation.
3. The volume \\?\Volume*\ was not optimized because an error was encountered: Neither Slab Consolidation nor Slab Analysis will run if slabs are less than 8 MB. (0x8900002D)
THESE ERRORS OCCURRED RIGHT AFTER THE FIRST TIME I GRACEFULLY RESTARTED THE SERVER
4. An error was detected on device \Device\Harddisk1\DR37 during a paging operation.
5. The system failed to flush data to the transaction log. Corruption may occur in VolumeId: \\?\Volume*, DeviceName: \Device\HarddiskVolume76.
(A device which does not exist was specified.)
6. The default transaction resource manager on volume \\?\Volume* encountered a non-retryable error and could not start. The data contains the error code.
7. Errors 4 to 6 rinse and repeat for a bit until this last error occurs which seems to shut them up.
Volume Shadow Copy Service error: Unexpected error calling routine RegOpenKeyExW(-2147483646,SYSTEM\CurrentControlSet\Services\VSS\Diag,...). hr = 0x80070005, Access is denied.
8. Some new errors started to occur, which I have googled, could mean volume resizing errors (i didnt do this) or mouse pointer, touch display issues. Event ID 265: A pointer device did not report a valid unit of angular measurement.
I restarted the server a second time after this and these frightening errors did not occur again, with the exception of error 8, which settles down and ceases after the computer has finished booting up.
What's going on here? I have not changed anything on this server, other than day to day updating users accounts and DNS. It is running on VMDK which is on a hardware backed RAID1 array. I checked the hardware status for the server in vCenter Server under hardware for that ESXI host and everything is green. Also did a c drive check disk and no problems found.
My anecdotal theory, is all these disk errors are related to the hung Server Backup job not being able to write to the Microsoft Virtual hard drive file that it uses to do its backup. And then post restart it gives up? Or should I migrate the VM to another host and datastore?
Thanks for the info :-)