Go Premium for a chance to win a PS4. Enter to Win

x
  • Status: Solved
  • Priority: Medium
  • Security: Public
  • Views: 577
  • Last Modified:

Exchange 2007 Cluster - Outage Causing "Split Brain" - Lost Emails

We have a two node Exchange cluster (CCR) that experienced a non controlled outage on one of the nodes today (outage at 1pm). We have 5 Information Stores, and four of them came up on the Passive node with no problem. The fifth looked at first like it worked but then we realized that it was missing emails from the last week.

We then realized that on the Node B (Originally passive, now active) it showed that Information Store as "Initalizing" and the Information Store Logs had not been replicated for over a week from Node A.

We tried to dismount (and suspend replication) the broken Information Store and transfer back to Node A (used the -IgnoreDismount switch) and it worked but we then had all emails prior to the outage but none between 1pm and 5pm when we took down the server again for maintenance. We then realized that the log files were re-creating on Node A and were conflicting with the log files on Node B. At this point I think we have a "Split Brain" convergence of the Information Stores. We have a backup and snapshot of both Node A and Node B .edb and log files before we started to troubleshoot this problem so we could roll back.


Are there any options? Is the best option to get Node A working up to 1pm, re-seed Node B, then use the backup of Node B .edb and log files into a DR environment and then just export the changes since 1 pm in mail to a PST (and give to the users)? Are there better options? Could we use the Exchange Recovery Group?
0
rsp_it
Asked:
rsp_it
  • 2
1 Solution
 
rsp_itAuthor Commented:
We don't think an eseutil /r will work on Node A since the log file file names are the same as Node B. It feels like a split brain and our only option is to choose one of the versions of the DB and then go from there trying to recover as much data as we can between 1pm and 5pm.
0
 
rsp_itAuthor Commented:
We ended up resolving this ourselves by moving back to Node A, losing information between 1pm and 5pm. Then we used Recovery Storage Groups with Node B's database version and the "merge" functionality to get the old emails between 1pm and 5pm back.
0

Featured Post

Veeam and MySQL: How to Perform Backup & Recovery

MySQL and the MariaDB variant are among the most used databases in Linux environments, and many critical applications support their data on them. Watch this recorded webinar to find out how Veeam Backup & Replication allows you to get consistent backups of MySQL databases.

  • 2
Tackle projects and never again get stuck behind a technical roadblock.
Join Now