Link to home
Start Free TrialLog in
Avatar of Tekati68
Tekati68Flag for United States of America

asked on

Windows Server 2003 PDC not answering domain requests.

I have a severly sick server.  We had two servers on the network but what was the backup server died and the primary server is in bad shape.  I am not sure what to do about it as it runs SQL Server which is no big deal but it also runs Exchange 2003 which is the issue at hand.  I realize fully that it is a bad idea to run pretty much anything else on a domain server especially Exchange so please dont comment on how bad it is.  The damage is done and what I need now is some help on solving these issues.  I have another server that is ready to go if need be if that will help at all.  Below is the message I am getting from dcdiag on the sick server.  If you do not think it will be salvagable without a rebuild then please suggest some tools I can use to help salvage the data from Exchange and SQL I can handle but if you have any better solution for that then please let me know as well.  

Big thank you to anyone who can help with this.  I have to have this solved by Monday morning so what ever you can do is greatly appreciated.

DCDiag output...

Domain Controller Diagnosis

Performing initial setup:
   Done gathering initial info.

Doing initial required tests

   Testing server: Default-First-Site-Name\BIGBOX
      Starting test: Connectivity
         ......................... BIGBOX passed test Connectivity

Doing primary tests

   Testing server: Default-First-Site-Name\BIGBOX
      Starting test: Replications
         [Replications Check,BIGBOX] A recent replication attempt failed:
            From SERVER1 to BIGBOX
            Naming Context: DC=ForestDnsZones,DC=weeksroses,DC=local
            The replication generated an error (1256):
            The remote system is not available. For information about network tr
oubleshooting, see Windows Help.
            The failure occurred at 2008-07-19 20:56:49.
            The last success occurred at 2008-07-18 11:45:48.
            34 failures have occurred since the last success.
         [SERVER1] DsBindWithSpnEx() failed with error 1722,
         The RPC server is unavailable..
         [Replications Check,BIGBOX] A recent replication attempt failed:
            From SERVER1 to BIGBOX
            Naming Context: DC=DomainDnsZones,DC=weeksroses,DC=local
            The replication generated an error (1256):
            The remote system is not available. For information about network tr
oubleshooting, see Windows Help.
            The failure occurred at 2008-07-19 20:56:49.
            The last success occurred at 2008-07-18 11:45:48.
            34 failures have occurred since the last success.
         [Replications Check,BIGBOX] A recent replication attempt failed:
            From SERVER1 to BIGBOX
            Naming Context: CN=Schema,CN=Configuration,DC=weeksroses,DC=local
            The replication generated an error (1753):
            There are no more endpoints available from the endpoint mapper.
            The failure occurred at 2008-07-19 20:56:49.
            The last success occurred at 2008-07-18 11:47:55.
            34 failures have occurred since the last success.
            The directory on SERVER1 is in the process.
            of starting up or shutting down, and is not available.
            Verify machine is not hung during boot.
         [Replications Check,BIGBOX] A recent replication attempt failed:
            From SERVER1 to BIGBOX
            Naming Context: CN=Configuration,DC=weeksroses,DC=local
            The replication generated an error (1753):
            There are no more endpoints available from the endpoint mapper.
            The failure occurred at 2008-07-19 20:56:49.
            The last success occurred at 2008-07-18 11:45:48.
            34 failures have occurred since the last success.
            The directory on SERVER1 is in the process.
            of starting up or shutting down, and is not available.
            Verify machine is not hung during boot.
         [Replications Check,BIGBOX] A recent replication attempt failed:
            From SERVER1 to BIGBOX
            Naming Context: DC=weeksroses,DC=local
            The replication generated an error (1753):
            There are no more endpoints available from the endpoint mapper.
            The failure occurred at 2008-07-19 20:56:49.
            The last success occurred at 2008-07-18 11:58:21.
            34 failures have occurred since the last success.
            The directory on SERVER1 is in the process.
            of starting up or shutting down, and is not available.
            Verify machine is not hung during boot.
         REPLICATION-RECEIVED LATENCY WARNING
         BIGBOX:  Current time is 2008-07-19 20:57:30.
            DC=ForestDnsZones,DC=weeksroses,DC=local
               Last replication recieved from SERVER1 at 2008-07-18 11:50:22.
            DC=DomainDnsZones,DC=weeksroses,DC=local
               Last replication recieved from SERVER1 at 2008-07-18 11:50:22.
            CN=Schema,CN=Configuration,DC=weeksroses,DC=local
               Last replication recieved from SERVER1 at 2008-07-18 11:52:29.
            CN=Configuration,DC=weeksroses,DC=local
               Last replication recieved from SERVER1 at 2008-07-18 11:50:22.
            DC=weeksroses,DC=local
               Last replication recieved from SERVER1 at 2008-07-18 12:02:55.
         ......................... BIGBOX passed test Replications
      Starting test: NCSecDesc
         ......................... BIGBOX passed test NCSecDesc
      Starting test: NetLogons
         ......................... BIGBOX passed test NetLogons
      Starting test: Advertising
         ......................... BIGBOX passed test Advertising
      Starting test: KnowsOfRoleHolders
         ......................... BIGBOX passed test KnowsOfRoleHolders
      Starting test: RidManager
         ......................... BIGBOX passed test RidManager
      Starting test: MachineAccount
         ......................... BIGBOX passed test MachineAccount
      Starting test: Services
         ......................... BIGBOX passed test Services
      Starting test: ObjectsReplicated
         ......................... BIGBOX passed test ObjectsReplicated
      Starting test: frssysvol
         ......................... BIGBOX passed test frssysvol
      Starting test: frsevent
         There are warning or error events within the last 24 hours after the
         SYSVOL has been shared.  Failing SYSVOL replication problems may cause
         Group Policy problems.
         ......................... BIGBOX failed test frsevent
      Starting test: kccevent
         ......................... BIGBOX passed test kccevent
      Starting test: systemlog
         An Error Event occured.  EventID: 0x00000457
            Time Generated: 07/19/2008   20:55:32
            (Event String could not be retrieved)
         An Error Event occured.  EventID: 0x00000457
            Time Generated: 07/19/2008   20:55:33
            (Event String could not be retrieved)
         An Error Event occured.  EventID: 0x00000457
            Time Generated: 07/19/2008   20:55:33
            (Event String could not be retrieved)
         An Error Event occured.  EventID: 0x00000457
            Time Generated: 07/19/2008   20:55:34
            (Event String could not be retrieved)
         An Error Event occured.  EventID: 0x00000457
            Time Generated: 07/19/2008   20:55:34
            (Event String could not be retrieved)
         ......................... BIGBOX failed test systemlog
      Starting test: VerifyReferences
         ......................... BIGBOX passed test VerifyReferences

   Running partition tests on : ForestDnsZones
      Starting test: CrossRefValidation
         ......................... ForestDnsZones passed test CrossRefValidation

      Starting test: CheckSDRefDom
         ......................... ForestDnsZones passed test CheckSDRefDom

   Running partition tests on : DomainDnsZones
      Starting test: CrossRefValidation
         ......................... DomainDnsZones passed test CrossRefValidation

      Starting test: CheckSDRefDom
         ......................... DomainDnsZones passed test CheckSDRefDom

   Running partition tests on : Schema
      Starting test: CrossRefValidation
         ......................... Schema passed test CrossRefValidation
      Starting test: CheckSDRefDom
         ......................... Schema passed test CheckSDRefDom

   Running partition tests on : Configuration
      Starting test: CrossRefValidation
         ......................... Configuration passed test CrossRefValidation
      Starting test: CheckSDRefDom
         ......................... Configuration passed test CheckSDRefDom

   Running partition tests on : weeksroses
      Starting test: CrossRefValidation
         ......................... weeksroses passed test CrossRefValidation
      Starting test: CheckSDRefDom
         ......................... weeksroses passed test CheckSDRefDom

   Running enterprise tests on : weeksroses.local
      Starting test: Intersite
         ......................... weeksroses.local passed test Intersite
      Starting test: FsmoCheck
         Warning: DcGetDcName(GC_SERVER_REQUIRED) call failed, error 1355
         A Global Catalog Server could not be located - All GC's are down.
         Warning: DcGetDcName(TIME_SERVER) call failed, error 1355
         A Time Server could not be located.
         The server holding the PDC role is down.
         Warning: DcGetDcName(GOOD_TIME_SERVER_PREFERRED) call failed, error 135
5
         A Good Time Server could not be located.
         Warning: DcGetDcName(KDC_REQUIRED) call failed, error 1355
         A KDC could not be located - All the KDCs are down.
         ......................... weeksroses.local failed test FsmoCheck


Avatar of ryansoto
ryansoto
Flag of United States of America image

OK server 2 that went down appears to be the only global catalog server.
Make your remaining DC a global catalog immediately
http://support.microsoft.com/kb/313994
This should solve some of the issue.

Also did DC 2 hold the fsmo roles?
If so you need to seize the roles back from server 2 to server 1
http://support.microsoft.com/kb/324801

Now since DC2 died is it coming back up?  If not you need to remove it from Active Directory...
Use meta data clean up - this will remove it from active directory and no longer try and replicate with it
http://technet2.microsoft.com/windowsserver/en/library/012793ee-5e8c-4a5c-9f66-4a486a7114fd1033.mspx?mfr=true

The second server can always be handy.
Personally I would move exchange to it asap.  No worries its not that hard use the migration method
This will also be your chance to move exchange to a member server
http://www.amset.info/exchange/migration.asp


Finally I would get your old server going or something that way you can bring up a second domain controller.  Then also make this server a domain controller.



OOPS  last sentence - I meant then make this server a global catalog sevrer
Avatar of tigermatt
Yep... the reason you're getting those issues is predominantly because the other server is still present in Active Directory. Do as ryan has already said and run a metadata cleanup (provided you won't be bringing it back online) and it should sort a lot of the replication problems.

-tigermatt
Avatar of Tekati68

ASKER

This server does have the catalog server check box checked already.

The second server did hold the fsmo roles at one time but the remaining server has already been assigned all of the fsmo roles.

The second server will not be coming back in it's current form. I will be reinstalling it as a fresh server. It will be running SQL and Exchange but no DC roles. I have one more 2k3 server used as a terminal server for a few remote employees used to access Navision. With that would it or box two be able to handle a backup DC role or would we be better off purchasing a new copy and licenses to build a second stand alone 2k3 DC for backup?  If the last statement were true would it be possible to use 2k8 for that new server or would that not work or introduce too many new challenges?

With all that said and with the exception of running meta cleanup for the second serrver users still cannot log on to computers on the domain as they get the "Domain not reachable" message and yes the remaining server has been rebooted.
ASKER CERTIFIED SOLUTION
Avatar of Tekati68
Tekati68
Flag of United States of America image

Link to home
membership
This solution is only available to members.
To access this solution, you must be a member of Experts Exchange.
Start Free Trial
>> The solution was actually pretty simple
It's identifying and finding the solution which takes the time!
Glad you solved it.
Nice work.  For yoru question about bringing the other DC online a 2k8 would be fine to bring up as backup.  Dont make the terminal server a DC in anyway shape or form.