Link to home
Start Free TrialLog in
Avatar of thomaschalmers
thomaschalmers

asked on

Exchange 2013 VM on Hyper-V BSOD

Hi there,

I have 4 VM's 2012 on Hyper-V and Exchange 2013 on them (2 CAS + 2 MBX) i get a BSOD from time to time on all 4 servers. Event log shows only "The computer has rebooted from a bugcheck." The bugcheck was: 0x000000ef
Event ID 1001.

Memory dump can be found here
https://www.dropbox.com/sh/q46vifj4tpyp1vg/AAC93s1lQR-Jvx6cKvL5sw6da?dl=0 

Thanks!
Untitled.png
Avatar of VB ITS
VB ITS
Flag of Australia image

Have you installed at least Cumulative Update 3 on your Exchange servers? Have a look at this article and see if it applies to you: http://support.microsoft.com/kb/2883203

If it does apply to you, look at upgrading to CU3 at the very least. Ideally though your environment would be up to date (I believe the most current is CU7)
Avatar of thomaschalmers
thomaschalmers

ASKER

Yes, exchange has the most current update CU7 so i don't think article applies...
Have you checked to see if it's the Managed Availability feature that's forcing the server reboots though?

The Managed Availability feature detects issues with your server and in the event of an issue attempts to fix the problematic component. Fixes can range from simple restarts of the component (e.g. restarting the service) to doing a bugcheck.  Bugchecks occur when the earlier simple fixes don't work.

To confirm if Managed Availability is the cause of the bugchecks, run the below command in PowerShell:
(Get-WinEvent -LogName Microsoft-Exchange-ManagedAvailability/* | % {[XML]$_.toXml()}).event.userData.eventXml| ?{$_.ActionID -like "*ForceReboot*"} | ft RequesterName

Open in new window


If you do find that the server is being bugchecked by the ActiveDirectoryConnectivityConfigDCServerReboot responder that's part of the Managed Availability service then run the below command to delay the responder for 60 days:
Add-GlobalMonitoringOverride -Identity Exchange\ActiveDirectoryConnectivityConfigDCServerReboot -ItemType Responder -PropertyName Enabled -PropertyValue 0 -Duration 60.00:00:00

Open in new window

If you would like to do some more reading on the Managed Availability feature you can do so here: http://technet.microsoft.com/en-us/library/dn482056%28v=exchg.150%29.aspx
Thank you for your reply. I have tried that and it look like the ActiveDirectoryConnectivityServerReboot is causing the bugcheck and restart however the property value is not changing with override and stays on 1...
ASKER CERTIFIED SOLUTION
Avatar of VB ITS
VB ITS
Flag of Australia image

Link to home
membership
This solution is only available to members.
To access this solution, you must be a member of Experts Exchange.
Start Free Trial
Sorry, forgot to mention that if you have multiple domain controllers then it's best to force replication, then wait about 10-15 minutes for the Microsoft Exchange Health Manager Service to pick up the override setting.
Thanks,

Yes, i tried that command to confirm and it's still on 1. Tried version override as well and same still on 1... :(
override.png
Try forcing AD replication then wait about 10-15 minutes. Re-run the command then to check if the override has taken effect.
Are you seeing clock drift warnings in the logs (blog post on the matter) as well?

If yes, is your PDCe physical or virtual as well?
Download finished.

Mbx and Cas DMP: WinInit.exe
A critical system process died
Arguments:
Arg1: ffffe00191cf2080, Process object or thread object

Both are the same.
I had some problems with the access list on the core switch and they are now fixed. Since then there was no reboot or crash of any kind. Could that be the issue from the beginning? No i do not have any clock drifts and all my DC are virtual.
Is the ActiveDirectoryConnectivityServerReboot responder disabled though?
No, ActiveDirectoryConnectivityServerReboot is not disabled.