Mark
asked on
Lost connection with ALL workstations on LAN
I am running Slackware64 14.2 and BIND 9.10.4-P6 in an office LAN. This morning a bit after 8:00AM one user called and said he'd lost Internet access and access to the local shared drive. About 2 hours later another user called, same thing. By shortly after noon 3 more users were having the same problem, but another 3 were accessing Internet and LAN resources OK. I rebooted the DNS server and now nobody has any access at all.
I suspect DNS. I did make a change a couple of days ago to add an 'A' record, but I can't see how that would have done this. I will experiment with restoring the zone file from a couple of days ago.
Here is the interesting wrinkle. Upon restart of named I get the following logged message:
/hprs.loca l.zone has serial # 2014134046. I made 2 separate changes to this file and changed the serial number each time. However, the named log output indicates it is getting a zone file with serial 2014137944. How can that be? I suspect this is part of the problem.
From the DNS server, I can get the IP of any host, but cannot ping:
No workstation on the LAN can connect to external domains, nor can they ping the DNS server or any other host in the LAN.
This is super urgent! Any ideas would be greatly appreciated!!!
More Info: ...
Part of issue resolved. The serial numbers on the zone file were not correct because named was looking in /etc/samba/private/dns instead of the ones I modified in /var/lib/samba/private/dns . This was a result of Slackware changing the location of samba files from Samba version 4.2.14 to 4.4.8. This bit me before. Now when I restart named I do get the correct serial number in the log.
However, I still cannot ping LAN hosts and LAN workstation still cannot connect to the Internet.
I suspect DNS. I did make a change a couple of days ago to add an 'A' record, but I can't see how that would have done this. I will experiment with restoring the zone file from a couple of days ago.
Here is the interesting wrinkle. Upon restart of named I get the following logged message:
May 1 12:49:08 mail named[1666]: managed-keys-zone: loaded serial 0
May 1 12:49:08 mail named[1666]: zone 127.in-addr.arpa/IN: loaded serial 1
May 1 12:49:08 mail named[1666]: zone 0.168.192.in-addr.arpa/IN: loaded serial 379
May 1 12:49:08 mail named[1666]: zone hprs.local/IN: loaded serial 2014137944
May 1 12:49:09 mail named[1666]: zone localhost/IN: loaded serial 2
May 1 12:49:09 mail named[1666]: all zones loaded
The zone for hprs.local has serial # 2014137944, yet the zone file I modified: /var/lib/samba/private/dnsFrom the DNS server, I can get the IP of any host, but cannot ping:
$ host dbserver
dbserver.hprs.local has address 192.168.0.4
$ ping dbserver
PING dbserver.hprs.local (192.168.0.4) 56(84) bytes of data.
From mail.hprs.local (192.168.0.2) icmp_seq=1 Destination Host Unreachable
From mail.hprs.local (192.168.0.2) icmp_seq=2 Destination Host Unreachable
From mail.hprs.local (192.168.0.2) icmp_seq=3 Destination Host Unreachable
From mail.hprs.local (192.168.0.2) icmp_seq=4 Destination Host Unreachable
The DNS server can resolve IPs for external domains, e.g. yahoo.com.No workstation on the LAN can connect to external domains, nor can they ping the DNS server or any other host in the LAN.
This is super urgent! Any ideas would be greatly appreciated!!!
More Info: ...
Part of issue resolved. The serial numbers on the zone file were not correct because named was looking in /etc/samba/private/dns instead of the ones I modified in /var/lib/samba/private/dns
However, I still cannot ping LAN hosts and LAN workstation still cannot connect to the Internet.
Going on your description it does sound to me that the DNS problems are either minor, or a red herring altogether.
I wondering if you have a faulting switch on your network???
I wondering if you have a faulting switch on your network???
ASKER
I'm getting the following for all workstations:
As to your switch/router idea, I'll try powering down all network devices and see what happens.
May 1 15:02:50 mail named[10299]: client 169.254.84.196#56816 (MIKE.hprs.local): query 'MIKE.hprs.local/A/IN' denied
A whois on 169.254.84.196 tells me, "Computers use addresses starting with "169.254." when they do not have a manually configured address or when they are not told which address to use by a service on the network. They are commonly called the "link local" addresses."As to your switch/router idea, I'll try powering down all network devices and see what happens.
ASKER CERTIFIED SOLUTION
membership
This solution is only available to members.
To access this solution, you must be a member of Experts Exchange.
I second the DCHP comment... Either your DHCP server is not running, or you lost connectivity from the DHCP server to the lan... The last system running still has a valid lease...
ASKER
YES!!!! That was it!! I modified the dhcpd.conf file several days ago to comment out a static IP assignment, but I missed putting a '#' in front of one of the lines. Probably as the leases expired, those workstations lost their connection. If I had looked at /var/log/syslog I would have seen the error. I looked at every relevant log file EXCEPT that one.
Thanks for the help
Thanks for the help
I'm glad that you have the problem resolved! :)
ASKER
However, I still cannot ping LAN hosts and LAN workstation still cannot connect to the Internet.
Note that there is still one workstation (WIN7) that can be pinged:
Open in new window
Very confused! Need help!