Link to home
Start Free TrialLog in
Avatar of LearningToProgram
LearningToProgram

asked on

Network went down completely. Now cant access mapped drives

Hi, I'm tearing my hair out over this one. We've been having occasional network problems up until yesterday, where the network goes down for a few minutes and then comes up back up. Then yesterday everyone loses connection to the server and the internet and it stays down.
First I tried pinging the router from the server, and wasn't getting anything, so I bypassed the switch and plugged the server directly into the router. Then when I pinged the router, it would connect sporadically. Sometimes getting a response and sometimes not. So I figured the router was bad and replaced the router with a known good router. Still had problems pinging the router, so I disconnected the cable connecting the switch to the router, leaving just the server plugged into the router and then it worked fine consistently. At that point the server could access the internet, but as soon as I plugged in the switch, it went down. So I methodically unplugged different cables going into the switch and finally isolated what was apparently causing the server not to function when the switch was connected. At this point all workstations except the one that I had to unplug could : ping the router, access the internet, BUT only some of them could access their mapped drives to the server. The others, when you try to click on them, a request came up to re-enter the user name and password, but when the correct user name and password were entered, the workstation still could not access the mapped drives.
There are a lot of 1054 errors in the event log.
If I look at the DNS on the server, it has all of the old IP addresses, from the workstations PRIOR to the new router and them being assigned new IP addresses.
I have no real experience with DNS and Active directory, but it seems that the problem lies somewhere in that area.
Some of the workstations are set up with the two server IP addresses as the DNS addresses and some are set up with one server IP address as the primary DNS and our ISP's DNS as the secondary DNS address. I dont know if that is relevant.
The server has two NIC cards each with  static IPs. I don't know if these are set up correctly as I get an error about having duplicate names.
Any help you can give me in understanding what's causing this problem and how to fix it would be much appreaciated!
We use Windows Server 2003 (Domain set up). About 35 workstations with Win XP Pro.
Avatar of Miguel Angel Perez Muñoz
Miguel Angel Perez Muñoz
Flag of Spain image

Two NICS has IP on same network segment?
You mentioned that you swapped the router...  The router may have been doing DHCP for the network and since you switched it the new machines are not getting the proper settings.

The machines need to be using the DNS server most likely running on the server.  If they are not, you may get authentication and can't find issues happening.

Try an ipconfig /all on the client pc's and see if they are on the same subnet as the server and that there settings match what you would expect.
Avatar of LearningToProgram
LearningToProgram

ASKER

Drashiel: The server has two NICs, and each has a different static IP 192.168.1.10 and 192.168.1.20
mmicha:  The new router has the exact same settings as the old router. the workstations are now all set as follows:
Obtain an IP address automatically
DNS server1: 192.168.1.20
DNS server2: is set to our ISP's dns
This is how they've been set in the past and it worked.
All are on the same subnet
ASKER CERTIFIED SOLUTION
Avatar of Justin Owens
Justin Owens
Flag of United States of America image

Link to home
membership
This solution is only available to members.
To access this solution, you must be a member of Experts Exchange.
Start Free Trial
Here's some screen shots from a workstation that cant get onto the network shares. ping-and-ipconfig.pdf NetworkErrors.pdf
Maybe this will help to narrow it down. I did an nslookup for a workstation and the IP address returned was incorrect. It was the IP address that computer had yesterday. So it seems like this would be a problem. How can I get the DNS on the server to update all of the IP addresses?
I did ipconfig /registerdns and it supposedly succeeded, but didn't fix it.
SOLUTION
Link to home
membership
This solution is only available to members.
To access this solution, you must be a member of Experts Exchange.
Start Free Trial
Hi DrUltima,
I'm not very familiar with the details of AD and the DNS.
Since the router has always been the source of the DHCP, I would like to leave that as-is for now, and try to get the network back up running, and then experiment with making it better. It did work up until yesterday for most of the time (like it would go for a week or so). Now nobody can access the server (except one person for some reason).
1. Can you tell me how I would find out if the DNS allows dynamic updates?  I did just try manually updating one of the entries to the correct ip address, and rebooted that computer so that is shows up correctly with  nslookup. That computer still can't get to network shares, so it looks like there is a different problem.
2.The router (dhcp server) is not handling out dns addresses. These are both entered into each workstations network connections. the priimary is entered with the servers ip and the secondary is entered with the ips's dns.  I dont know why--that's how it was when I took over from the former person handling the network.

SOLUTION
Link to home
membership
This solution is only available to members.
To access this solution, you must be a member of Experts Exchange.
Start Free Trial
they are both set to 'secure only' for the dynamic updates
I removed the isp dns addresses from the router and entered the servers dns address.
And I've removed the isp dns addresses from the workstations.
Then i put the isp's dns addresses into the servers dns Zone transfers -- I assume this is where they were supposed to go.
Still the same situation: 3 computers have access to the server shared folders, and the remaining 30+ have no access -- just internet access.
Okay here's some more information. When I removed the cached login credentials on a workstation by setting the number of cached to 0, I could not longer log in to the domain. getting the error "the system cannot log you on now because the domain xxx is not available."
I then ran the DCDIAGS diagnostic tool and this is the log file. Can you help me figure out how to fix these?


Domain Controller Diagnosis

Performing initial setup:
   Done gathering initial info.

Doing initial required tests
   
   Testing server: Default-First-Site-Name\ULAN-SERVER
      Starting test: Connectivity
         ......................... ULAN-SERVER passed test Connectivity

Doing primary tests
   
   Testing server: Default-First-Site-Name\ULAN-SERVER
      Starting test: Replications
         ......................... ULAN-SERVER passed test Replications
      Starting test: NCSecDesc
         ......................... ULAN-SERVER passed test NCSecDesc
      Starting test: NetLogons
         Unable to connect to the NETLOGON share! (\\ULAN-SERVER\netlogon)
         [ULAN-SERVER] An net use or LsaPolicy operation failed with error 1203, No network provider accepted the given network path..
         ......................... ULAN-SERVER failed test NetLogons
      Starting test: Advertising
         Fatal Error:DsGetDcName (ULAN-SERVER) call failed, error 1355
         The Locator could not find the server.
         ......................... ULAN-SERVER failed test Advertising
      Starting test: KnowsOfRoleHolders
         ......................... ULAN-SERVER passed test KnowsOfRoleHolders
      Starting test: RidManager
         ......................... ULAN-SERVER passed test RidManager
      Starting test: MachineAccount
         ......................... ULAN-SERVER passed test MachineAccount
      Starting test: Services
         ......................... ULAN-SERVER passed test Services
      Starting test: ObjectsReplicated
         ......................... ULAN-SERVER passed test ObjectsReplicated
      Starting test: frssysvol
         ......................... ULAN-SERVER passed test frssysvol
      Starting test: frsevent
         There are warning or error events within the last 24 hours after the

         SYSVOL has been shared.  Failing SYSVOL replication problems may cause

         Group Policy problems.
         ......................... ULAN-SERVER failed test frsevent
      Starting test: kccevent
         ......................... ULAN-SERVER passed test kccevent
      Starting test: systemlog
         An Error Event occured.  EventID: 0xC00010DF
            Time Generated: 03/15/2011   20:46:39
            (Event String could not be retrieved)
         An Error Event occured.  EventID: 0xC00010DF
            Time Generated: 03/15/2011   20:46:42
            (Event String could not be retrieved)
         An Error Event occured.  EventID: 0xC00010DF
            Time Generated: 03/15/2011   21:09:57
            (Event String could not be retrieved)
         An Error Event occured.  EventID: 0xC00010DF
            Time Generated: 03/15/2011   21:16:27
            (Event String could not be retrieved)
         An Error Event occured.  EventID: 0x40000004
            Time Generated: 03/15/2011   21:21:32
            Event String: The kerberos client received a

         An Error Event occured.  EventID: 0x40000004
            Time Generated: 03/15/2011   21:21:33
            Event String: The kerberos client received a

         An Error Event occured.  EventID: 0x40000004
            Time Generated: 03/15/2011   21:21:37
            Event String: The kerberos client received a

         An Error Event occured.  EventID: 0x40000004
            Time Generated: 03/15/2011   21:21:38
            Event String: The kerberos client received a

         An Error Event occured.  EventID: 0x40000004
            Time Generated: 03/15/2011   21:21:38
            Event String: The kerberos client received a

         An Error Event occured.  EventID: 0x40000004
            Time Generated: 03/15/2011   21:21:38
            Event String: The kerberos client received a

         An Error Event occured.  EventID: 0x40000004
            Time Generated: 03/15/2011   21:21:39
            Event String: The kerberos client received a

         An Error Event occured.  EventID: 0xC00010DF
            Time Generated: 03/15/2011   21:26:34
            (Event String could not be retrieved)
         ......................... ULAN-SERVER failed test systemlog
      Starting test: VerifyReferences
         ......................... ULAN-SERVER passed test VerifyReferences
   
   Running partition tests on : DomainDnsZones
      Starting test: CrossRefValidation
         ......................... DomainDnsZones passed test CrossRefValidation
      Starting test: CheckSDRefDom
         ......................... DomainDnsZones passed test CheckSDRefDom
   
   Running partition tests on : ForestDnsZones
      Starting test: CrossRefValidation
         ......................... ForestDnsZones passed test CrossRefValidation
      Starting test: CheckSDRefDom
         ......................... ForestDnsZones passed test CheckSDRefDom
   
   Running partition tests on : Schema
      Starting test: CrossRefValidation
         ......................... Schema passed test CrossRefValidation
      Starting test: CheckSDRefDom
         ......................... Schema passed test CheckSDRefDom
   
   Running partition tests on : Configuration
      Starting test: CrossRefValidation
         ......................... Configuration passed test CrossRefValidation
      Starting test: CheckSDRefDom
         ......................... Configuration passed test CheckSDRefDom
   
   Running partition tests on : UNS
      Starting test: CrossRefValidation
         ......................... UNS passed test CrossRefValidation
      Starting test: CheckSDRefDom
         ......................... UNS passed test CheckSDRefDom
   
   Running enterprise tests on : UNS.lan
      Starting test: Intersite
         ......................... UNS.lan passed test Intersite
      Starting test: FsmoCheck
         Warning: DcGetDcName(GC_SERVER_REQUIRED) call failed, error 1355
         A Global Catalog Server could not be located - All GC's are down.
         Warning: DcGetDcName(TIME_SERVER) call failed, error 1355
         A Time Server could not be located.
         The server holding the PDC role is down.
         Warning: DcGetDcName(GOOD_TIME_SERVER_PREFERRED) call failed, error 1355
         A Good Time Server could not be located.
         Warning: DcGetDcName(KDC_REQUIRED) call failed, error 1355
         A KDC could not be located - All the KDCs are down.
         ......................... UNS.lan failed test FsmoCheck
SOLUTION
Link to home
membership
This solution is only available to members.
To access this solution, you must be a member of Experts Exchange.
Start Free Trial
SOLUTION
Link to home
membership
This solution is only available to members.
To access this solution, you must be a member of Experts Exchange.
Start Free Trial
One more thought.  If you haven't already, change out your key network cables.  Between the server & switch particularity - just to eliminate a possible cable problem.
How can I tell if Is my DNS Server also my AD controller?  Is DNS AD Integrated or not?
The IP address you are using for DNS... Is it the same IP address as your Domain Controller?  As far as Integrated or not, just look at the properties for the DNS Zone.
Hi Dosdet2, thanks for your input. I tried your suggestion on the dhcp on the router, and there is no other dhcp server active, so it rules that out.
On the 2 NICs on the server--if I only use one, then what do I use for the secondary DNS address in the router and on each workstation?  Thanks.
Thanks for your help so far.  I've been able to narrow the problem down, so I'm going to close this thread and open a new one as this one is getting very long.