Avatar of ShailendraJadhav
ShailendraJadhav asked on

Replication Issue on Additional Domain Controller Windows 2008 R2

We have 4 Domain Controllers (ABCDCQ1,ABCDCQ2,ABCDCQ3 and ABCDCQ4). In that ABCDCQ1 having all 5 fsmo roles. It’s single domain environment. We are getting replication error on ABCDCQ2 , this domain controller having Additional Domain Controller role only. We want to resolve replication issue on ABCDCQ2.Result of repadmin /showrepl which we ran on ABCDCQ2 is attached and PFA. We are getting replication error on DomainDnsZones partition only rest shows successful.

Please assist us to resolve replication issue on ABCDCQ2.
Replication-issue-on-Additional-.doc
Microsoft Legacy OSMicrosoft Server OSActive Directory

Avatar of undefined
Last Comment
Sandesh Dubey

8/22/2022 - Mon
ASKER
ShailendraJadhav

Would you please look in to this at earliest. We want to provide solution as early as possible to our client.
ASKER CERTIFIED SOLUTION
footech

Log in or sign up to see answer
Become an EE member today7-DAY FREE TRIAL
Members can start a 7-Day Free trial then enjoy unlimited access to the platform
Sign up - Free for 7 days
or
Learn why we charge membership fees
We get it - no one likes a content blocker. Take one extra minute and find out why we block content.
See how we're fighting big data
Not exactly the question you had in mind?
Sign up for an EE membership and get your own personalized solution. With an EE membership, you can ask unlimited troubleshooting, research, or opinion questions.
ask a question
Will Szymkowski

Take a look at the Directory Service Event Logs on the DC in question. Also run dcdiag /c: (Comprehensive Runs all tests) on the DC as well and verify the out-put.

You may also want to reference the link below for more details on troubleshooting AD replicaiton issues...
http://support.microsoft.com/kb/2645996

Hope this helps
stu29

I am with Spec01 .. start with the basics. DCDIAG /TEST:DNS to see if anything is misconfigured in DNS.  If it is try dcdiag /fix
I started with Experts Exchange in 2004 and it's been a mainstay of my professional computing life since. It helped me launch a career as a programmer / Oracle data analyst
William Peck
Sarang Tinguria

Please refer below URL it could have enough information to drive you towards resolution

http://support.microsoft.com/kb/2645996/en-gb
footech

Oh my gosh, how many people are going to provide the same link that I did in the first reply? :)

BTW, although running DCDIAG is always a good step when suspecting a replication problem, in this case it should only echo the problem as shown by repadmin /showrepl.  A problem with DNS wouldn't manifest itself as only a single AD partition failing to replicate and showing corruption.
stu29

ShailendraJadhav ... I think we need some more info to help you.  

Did you check the DS logs?

Did you try to up the logging level? (http://technet.microsoft.com/en-us/library/cc961809.aspx)
Get an unlimited membership to EE for less than $4 a week.
Unlimited question asking, solutions, articles and more.
Sarang Tinguria

@Footech...appologies..-):-):-):-) haven't compared prior posting
Sandesh Dubey

It seems that AD database is corrupted and hence replication failure occuring.this could de due to errors on drive,drivers and firmware not update,AD database corrupt.For possible cause and resolution sse this:http://technet.microsoft.com/en-us/library/replication-error-8451-the-replication-operation-encountered-a-database-error(v=ws.10).aspx

http://social.technet.microsoft.com/Forums/windowsserver/en-US/d05d1174-3193-416f-a1b3-4dd61919f763/repadmin-syncall-error-the-replication-operation-encountered-a-database-error

Check the integrity of AD database if error is reported defrag the AD database:http://technet.microsoft.com/en-us/library/cc816754(v=ws.10).aspx http://support.microsoft.com/kb/232122

Run chkdsk in read only mode to check for drive errors.If error is reported run chkdsk/f to fix the same.Exclude ntds/sysvol/ntfrs folder from AV scan.

Alternately if the issue is not getting fix you can demote the dc forcefully followed by metadata cleanup and promote the server back as DC.

Reference link
Forcefull removal of DC: http://support.microsoft.com/kb/332199
Metadata cleanup: http://www.petri.co.il/delete_failed_dcs_from_ad.htm

Hope this helps
ASKER
ShailendraJadhav

Hello All,
Thank you very much for your immediate response.
1.      Have performed file integrity check, which completed with database error as “Operation terminated with error -1206( JET_errDatabaseCorrupted, Non database file or corrupted db )”.

2.      Have performed defragmentation, which terminated with error as “Operation terminated with error -1605( JET_errKeyDuplicate, Illegal duplicate key )”.

3.      Have performed Semantic Database Analysis with Fixup, errors are reported during Semantic Database Analysis with Fixup. So have ran file maintenance: recover, which completed with output as database recovery is successful.

However when we run repadmin /showrepl, it shows replication fails for DomainDnsZones partition. Please see result only for DomainDnsZones partition as below.

DC=DomainDnsZones,DC=AEESINC,DC=COM
    CLT\ABCDCQ3 via RPC
        DSA object GUID: 7c1e8bc2-8dcf-4ea6-80a3-d5bf6311dd7f
        Last attempt @ 2013-09-10 06:13:41 failed, result 8451 (0x2103):
            The replication operation encountered a database error.
        3260 consecutive failure(s).
        Last success @ 2013-08-07 09:14:51.
    CLT\ABCDCQ4 via RPC
        DSA object GUID: b9ce7848-b161-4882-a797-a0f9a03c2c6b
        Last attempt @ 2013-09-10 06:13:41 failed, result 8451 (0x2103):
            The replication operation encountered a database error.
        3259 consecutive failure(s).
        Last success @ 2013-08-07 09:14:51.
    NAS2\ABCDCQ1 via RPC
        DSA object GUID: 6b81deee-fa46-4f14-ae09-70f4f148be80
        Last attempt @ 2013-09-10 06:15:14 failed, result 8451 (0x2103):
            The replication operation encountered a database error.
        43370 consecutive failure(s).
        Last success @ 2013-08-07 09:09:54.

Please let me know if you have any suggestion.

Thanks.
This is the best money I have ever spent. I cannot not tell you how many times these folks have saved my bacon. I learn so much from the contributors.
rwheeler23
footech

After your step 3 did you perform an offline defrag?
Sandeshdubey posted a link for the procedure, but here it is again.
http://support.microsoft.com/kb/232122

If after the reboot the event logs are still reporting errors and replication still isn't working, I would proceed with the demote/promote.
compdigit44

IS your client running a Windows 2003 AD domain?
http://support.microsoft.com/kb/832851


ALso please review the following link: http://eniackb.blogspot.com/2009/06/active-directory-database.html
compdigit44

Have you checked the problem DC for hardware errors?
Get an unlimited membership to EE for less than $4 a week.
Unlimited question asking, solutions, articles and more.
ASKER
ShailendraJadhav

Thank you all for your kind support.

1. Have already performed offline defragmentation after step3 and also rebooted the server, however still replication is not working.

2. We are using Windows 2008 r2 AD domain.

3. Have already performed hardware check, no any hardware issue we observe on our Dell model server.

Thanks.
stu29

At this point I would have to agree with Footech ... I would be demoting the box >>>> http://www.smallbusinesstech.net/more-complicated-instructions/windows/adding-and-removing-windows-server-2008-r2-domain-controllers
Sandesh Dubey

The log indicates that defrag and integrity failed.As you have multiple DC the best way to deal as suggested by others is to demote the faulty DC and promote the server back as DC.

You cannot demote the faulty DC gracefully you need to do forcefull removal.You need to ran dcpromo/force removal and then run matadata cleanup on other DC(healthy) to remove the instance of faulty DC from AD database and DNS.If faulty DC is fsmo role holder server the you need to seize the FSMO role on other DC.

Once done you can promote the Server back as ADC.Also configure authorative time server role on PDC role holder server.

Reference link
Forcefull removal of DC: http://support.microsoft.com/kb/332199
Metadata cleanup: http://www.petri.co.il/delete_failed_dcs_from_ad.htm
Seize FSMO role: http://www.petri.co.il/seizing_fsmo_roles.htm
Authorative time server: http://support.microsoft.com/kb/816042
Configuring the time service on the PDC Emulator FSMO role holder

Hope this helps
Experts Exchange is like having an extremely knowledgeable team sitting and waiting for your call. Couldn't do my job half as well as I do without it!
James Murphy
compdigit44

This maybe a little of topic but this posting made me think for the following...

If you have a multi-master domain as the user has posted and you start to have issues on NTDS issues on one server would this automatically affect all servers since the NTDS DB is replicated to all servers. There for a corrupt DB on one server would corrupt all of them???

With this in mind I am a bit confused as to how demoting the problem server would correct a corrupt NTDS DB issue on one server in a multi-master environment
footech

@compdigit44 - I'm not really in a position to speak authoritatively on this, but here's my take.  Although I believe it's possible, typically this won't be the case (I'm wondering myself about percentages).  When corruption is detected you will usually see replication stopped which will prevent the spread.  I've heard of people having to manually stop replication to/from a particular server because of corruption, but I've never witnessed that situation myself.
ASKER
ShailendraJadhav

Hi,

When we run chkdsk command on affected server it gives error as "Windows found problems with the file system, run CHKDSK with the /F (fix) option to correct these".

Is this could be the reason for database corruption issue?

Thanks.
Get an unlimited membership to EE for less than $4 a week.
Unlimited question asking, solutions, articles and more.
ASKER
ShailendraJadhav

Demote and Re-promote resolved issue. Thanks to all for yours assistance.
footech

If you haven't already, run the chkdsk.  If it's a physical server with only a single drive (not RAID) run it with the /r switch.  I wouldn't think this would be related since if the file itself was corrupt (and not just something with its data) then I would expect more than just the DomainDNSZones replication being stopped, but I suppose it's possible and the disk errors could cause other problems.
Sandesh Dubey

If ther are errors on disk you need to fix the same.Kindly take backup of server and then proceed with chkdsk/f.
Experts Exchange has (a) saved my job multiple times, (b) saved me hours, days, and even weeks of work, and often (c) makes me look like a superhero! This place is MAGIC!
Walt Forbes
compdigit44

I would also run a fully system diagnostic on this server as well at this point. Vendors like IBM, HP, Dell etc.. provide their own bootable diagnostic tools to scan your server.

Just a suggestion
compdigit44

How did you make out with this?
ASKER
ShailendraJadhav

Demoted and then did metadatacleanup , removed DNS entries , removed entries from sites and services. Repromoted it again
Get an unlimited membership to EE for less than $4 a week.
Unlimited question asking, solutions, articles and more.
compdigit44

Are so still having replication issues post demote / repromote???
Sandesh Dubey

After the DC is promoted enusre to check the health of new dc with dcdiag /q and repadmin /replsum and post the log if error is reported.