?
Solved

Macs Losing Network Connection - Mixed Environment

Posted on 2012-08-21
2
Medium Priority
?
1,490 Views
Last Modified: 2013-06-02
Hello,

I am pulling my hair out. We have a network of approximately 30 Windows machines and 12 Macs.

We are having an issue where periodically all of the Macs will lose their network connection. They cannot connect to each other, our mac file server, or the internet. Restarting all of the Macs including the file server temporarily resolves the issue. Windows machines are not affected.

The Macs where DHCP, but I changed them all to static IP addresses. Didn't seem to make a difference.

It seems like the problems are heavily related to the Mac file server (current gen Mac Mini on Lion Server) because restarting it is key to getting everything going again. If we restart all of the clients, but not the server...the network connection does not return. Restarting the server and the clients gets everything going again.

The Mac server is running only AFP file sharing (using two brand new firewire external hard drives). We are not using directory accounts nor are any of the Macs joined to the server. No other services are running. The only other thing that runs from the Mac server is Extensis Universal Type Server. But, the problem happens even when that is disabled.

We changed to another Mac mini and did a clean install of Lion Server. Everything was set up from scratch, but that did not make a difference.

We also have a Windows domain, but none of the Macs are joined to it.

Our Sonicwall serves IP addresses and handles the routing. We have a Windows server on the network that handles internal DNS.

This issue has been occurring for a few weeks now and just came out of nowhere. I don't think we have changed anything on our network that would affect this.

It is weird because it sounds like a higher level issue than what an individual machine could cause. If something was off with the Mac file server, I could see us losing connection to it, but how does it also completely hose our client's network connections to the point that we cannot even get out to the internet. Plus, the issue is Mac only. I would think if there were network infrastructure problems, the Windows machines would be affected too.

I'm sure I am missing some things that you will need to help troubleshoot the problem, but I will start here. Please let me know what further information you need.

Thanks in advance for any help!
0
Comment
Question by:bnewkirk
2 Comments
 
LVL 21

Accepted Solution

by:
wyliecoyoteuk earned 1500 total points
ID: 38319543
First place to look is the logs on a client and on the server, see if you can find anything happening just prior to the disconnection.


http://www.tech-recipes.com/rx/3521/os-x-how-to-view-log-files/

You may find a lot of the same error repeated in system.log for example.

It may also be worthwhile checking the event viewer on a windows client as well.
0
 

Author Comment

by:bnewkirk
ID: 38325206
Have not checked a Windows client yet, but, here is an excerpt from a Mac client briefly before the problem happened until the machine was shut down:

Aug 23 09:50:16 client-mac auditd[497]: Auditing enabled
Aug 23 09:50:16 client-mac auditd[497]: Got low space trigger
Aug 23 09:50:16 client-mac auditd[497]: auditd_read_dirs(): all audit log directories over soft limit
Aug 23 09:50:16 client-mac auditd[497]: renamed /var/audit/20120823124921.not_terminated to /var/audit/20120823124921.20120823135016
Aug 23 09:50:16 client-mac auditd[497]: New audit file is /var/audit/20120823135016.not_terminated
Aug 23 09:50:16 client-mac _spotlight[501]: audit warning: allsoft
Aug 23 09:50:16 client-mac _spotlight[502]: audit warning: soft /var/audit
Aug 23 09:50:16 client-mac _spotlight[503]: audit warning: closefile /var/audit/20120823124921.20120823135016
Aug 23 09:50:16 client-mac [0x0-0x1d01d].com.apple.mail[281]: Xerox_700.ppd
Aug 23 09:50:16: --- last message repeated 1 time ---
Aug 23 09:50:16 client-mac Mail[281]: PropertyListFile Exists
Aug 23 09:50:16 client-mac [0x0-0x1d01d].com.apple.mail[281]: cp: /private/etc/cups/ppd/Xerox_700.ppd and /private/etc/cups/ppd/Xerox_700.ppd are identical (not copied).
Aug 23 09:50:21 client-mac com.apple.printtool.agent[494]: No log handling enabled - using stderr logging
Aug 23 09:50:21 client-mac com.apple.printtool.agent[494]: snmpget: Failure in sendto (Sub-id not found: mib-2 -> -1) (Host is down)
Aug 23 09:50:21 client-mac com.apple.printtool.agent[494]: snmpget: Failure in sendto (Host is down)
Aug 23 09:50:28 client-mac Mail[281]: Document is now sent for printing
Aug 23 09:50:28 client-mac Mail[281]: Print window is set to close itself
Aug 23 09:50:28 client-mac Mail[281]: kCGErrorIllegalArgument: _CGSFindSharedWindow: WID -1
Aug 23 09:50:28 client-mac Mail[281]: kCGErrorFailure: Set a breakpoint @ CGErrorBreakpoint() to catch errors as they are logged.
Aug 23 09:50:28 client-mac Mail[281]: kCGErrorIllegalArgument: CGSSetWindowShadowAndRimParametersWithStretch: Invalid window 0xffffffff
Aug 23 09:51:11 client-mac LogMeInGUI[240]: String:WEBSVC|OFFLINE
Aug 23 09:51:15 client-mac KernelEventAgent[75]: tid 00000000 received event(s) VQ_NOTRESP (1)
Aug 23 09:51:15 client-mac KernelEventAgent[75]: tid 00000000 type 'afpfs', mounted on '/Volumes/Art Files', from 'afp_3yZbVG2g3dbh0ocO2T0bzIsm-1.2e000003', not responding
Aug 23 09:51:15 client-mac KernelEventAgent[75]: tid 00000000 type 'afpfs', mounted on '/Volumes/Orders', from 'afp_3yZbVG2g3dbh0ocO2T0bzIsm-2.2e000004', not responding
Aug 23 09:51:15 client-mac KernelEventAgent[75]: tid 00000000 found 2 filesystem(s) with problem(s)
Aug 23 09:51:21 client-mac KernelEventAgent[75]: tid 00000000 received event(s) VQ_NOTRESP (1)
Aug 23 09:51:21 client-mac KernelEventAgent[75]: tid 00000000 type 'afpfs', mounted on '/Volumes/Art Files', from 'afp_3yZbVG2g3dbh0ocO2T0bzIsm-1.2e000003', not responding
Aug 23 09:51:21 client-mac KernelEventAgent[75]: tid 00000000 type 'afpfs', mounted on '/Volumes/Orders', from 'afp_3yZbVG2g3dbh0ocO2T0bzIsm-2.2e000004', not responding
Aug 23 09:51:21 client-mac KernelEventAgent[75]: tid 00000000 found 2 filesystem(s) with problem(s)
Aug 23 09:52:00 client-mac KernelEventAgent[75]: tid 00000000 received event(s) VQ_DEAD (32)
Aug 23 09:52:00 client-mac KernelEventAgent[75]: tid 00000000 type 'afpfs', mounted on '/Volumes/Art Files', from 'afp_3yZbVG2g3dbh0ocO2T0bzIsm-1.2e000003', dead
Aug 23 09:52:00 client-mac KernelEventAgent[75]: tid 00000000 force unmount afp_3yZbVG2g3dbh0ocO2T0bzIsm-1.2e000003 from /Volumes/Art Files
Aug 23 09:52:00 client-mac KernelEventAgent[75]: tid 00000000 type 'afpfs', mounted on '/Volumes/Orders', from 'afp_3yZbVG2g3dbh0ocO2T0bzIsm-2.2e000004', dead
Aug 23 09:52:00 client-mac KernelEventAgent[75]: tid 00000000 force unmount afp_3yZbVG2g3dbh0ocO2T0bzIsm-2.2e000004 from /Volumes/Orders
Aug 23 09:52:00 client-mac KernelEventAgent[75]: tid 00000000 found 2 filesystem(s) with problem(s)
Aug 23 09:52:01 client-mac KernelEventAgent[75]: tid 00000000 received event(s) VQ_DEAD (32)
Aug 23 09:53:54 client-mac com.apple.launchd.peruser.501[206] ([0x0-0x47047].com.apple.print.PrinterProxy[519]): Exited: Killed: 9
Aug 23 09:53:54 client-mac com.apple.launchd.peruser.501[206] ([0x0-0x12012].com.apple.iChat[251]): Exited: Killed: 9
Aug 23 09:53:54 client-mac com.logmein.logmeinguiagent[239]: 2012-08-23 09:53:54.838 - Debug     - LMIGUIAgent - SessionManager - Application exiting
Aug 23 09:53:54 client-mac com.logmein.logmeinguiagent[239]: Session - Error occured:read errno: (9) Bad file descriptor
Aug 23 09:53:54 client-mac com.logmein.logmeinguiagent[239]: 2012-08-23 09:53:54.839 - Debug     - LMIGUIAgent - SessionManager - Connection closed
Aug 23 09:53:54 client-mac com.logmein.logmeinguiagent[239]: 2012-08-23 09:53:54.839 - Debug     - LMIGUIAgent - SessionManager - Closing DisplayServer
Aug 23 09:53:54 client-mac com.logmein.logmeinguiagent[239]: 2012-08-23 09:53:54.839 - Debug     - LMIGUIAgent - SessionManager - DisplayServer close
Aug 23 09:53:54 client-mac com.apple.launchd.peruser.501[206] (com.apple.talagent[217]): Exited: Killed: 9
Aug 23 09:53:54 client-mac com.apple.launchd.peruser.501[206] ([0x0-0xf00f].com.apple.iTunesHelper[248]): Exited: Killed: 9
Aug 23 09:53:54 client-mac com.apple.launchd.peruser.501[206] ([0x0-0x18018].com.apple.AppleSpell[264]): Exited: Killed: 9
Aug 23 09:53:54 client-mac com.apple.launchd.peruser.501[206] (com.apple.quicklook[256]): Exited: Killed: 9
Aug 23 09:53:54 client-mac com.apple.dock.extra[535]: Could not connect the action buttonPressed: to target of class NSApplication
Aug 23 09:53:54 client-mac com.apple.dock.extra[535]: 2012-08-23 09:53:54.883 com.apple.dock.extra[535:1707] Could not connect the action buttonPressed: to target of class NSApplication
Aug 23 09:53:54 client-mac com.apple.dock.extra[535]: Could not connect the action buttonPressed: to target of class NSApplication
Aug 23 09:53:54 client-mac com.apple.dock.extra[535]: 2012-08-23 09:53:54.884 com.apple.dock.extra[535:1707] Could not connect the action buttonPressed: to target of class NSApplication
Aug 23 09:53:54 client-mac com.apple.dock.extra[535]: Could not connect the action buttonPressed: to target of class NSApplication
Aug 23 09:53:54 client-mac com.apple.dock.extra[535]: 2012-08-23 09:53:54.885 com.apple.dock.extra[535:1707] Could not connect the action buttonPressed: to target of class NSApplication
Aug 23 09:53:54 client-mac com.apple.dock.extra[535]: Could not connect the action buttonPressed: to target of class NSApplication
Aug 23 09:53:54 client-mac com.apple.dock.extra[535]: 2012-08-23 09:53:54.885 com.apple.dock.extra[535:1707] Could not connect the action buttonPressed: to target of class NSApplication
Aug 23 09:53:55 client-mac loginwindow[73]: sendQuitEventToApp (FMCore): AESendMessage returned error -1712
Aug 23 09:53:55 client-mac com.apple.launchd.peruser.501[206] (com.apple.mdworker.i386.0[473]): Exited: Terminated: 15
Aug 23 09:53:55 client-mac com.apple.launchd.peruser.501[206] (com.apple.mdworker.pool.0[269]): Exited: Terminated: 15
Aug 23 09:53:58 client-mac loginwindow[73]: DEAD_PROCESS: 73 console
Aug 23 09:53:58 client-mac auditd[536]: Auditing enabled
Aug 23 09:53:58 client-mac auditd[536]: Got low space trigger
Aug 23 09:53:58 client-mac auditd[536]: auditd_read_dirs(): all audit log directories over soft limit
Aug 23 09:53:58 client-mac auditd[536]: renamed /var/audit/20120823135016.not_terminated to /var/audit/20120823135016.20120823135358
Aug 23 09:53:58 client-mac auditd[536]: New audit file is /var/audit/20120823135358.not_terminated
Aug 23 09:53:58 client-mac _lp[540]: audit warning: allsoft
Aug 23 09:53:58 client-mac _lp[541]: audit warning: soft /var/audit
Aug 23 09:53:58 client-mac _lp[543]: audit warning: closefile /var/audit/20120823135016.20120823135358
Aug 23 09:53:58 client-mac loginwindow[73]: Application hardKill returned -600
Aug 23 09:53:58: --- last message repeated 2 times ---
Aug 23 09:53:58 client-mac shutdown[544]: reboot by client-mac: 
Aug 23 09:53:58 client-mac shutdown[544]: SHUTDOWN_TIME: 1345730038 388776

Open in new window


And here is a capture from the system-log of our mac file server from before the incident to reboot:

8/23/12 8:14:39.207 AM com.apple.launchd.peruser.501: (com.apple.ReportCrash) Falling back to default Mach exception handler. Could not find: com.apple.ReportCrash.Self
8/23/12 8:14:39.223 AM com.apple.launchctl.Aqua: load: option requires an argument -- D
8/23/12 8:14:39.223 AM com.apple.launchctl.Aqua: usage: launchctl load [-wF] [-D <user|local|network|system|all>] paths...
8/23/12 8:14:40.234 AM UserEventAgent: CaptiveNetworkSupport:CNSServerRegisterUserAgent:187 new user agent port: 22587
8/23/12 8:14:40.338 AM com.apple.launchd.peruser.501: (com.apple.launchctl.Aqua[287]) Exited with code: 1
8/23/12 8:14:43.017 AM com.apple.dock.extra: Could not connect the action buttonPressed: to target of class NSApplication
8/23/12 8:14:43.018 AM com.apple.dock.extra: 2012-08-23 08:14:42.974 com.apple.dock.extra[316:1707] Could not connect the action buttonPressed: to target of class NSApplication
8/23/12 8:14:43.018 AM com.apple.dock.extra: Could not connect the action buttonPressed: to target of class NSApplication
8/23/12 8:14:43.019 AM com.apple.dock.extra: 2012-08-23 08:14:43.017 com.apple.dock.extra[316:1707] Could not connect the action buttonPressed: to target of class NSApplication
8/23/12 8:14:43.019 AM com.apple.dock.extra: Could not connect the action buttonPressed: to target of class NSApplication
8/23/12 8:14:43.019 AM com.apple.dock.extra: 2012-08-23 08:14:43.018 com.apple.dock.extra[316:1707] Could not connect the action buttonPressed: to target of class NSApplication
8/23/12 8:14:43.020 AM com.apple.dock.extra: Could not connect the action buttonPressed: to target of class NSApplication
8/23/12 8:14:43.020 AM com.apple.dock.extra: 2012-08-23 08:14:43.019 com.apple.dock.extra[316:1707] Could not connect the action buttonPressed: to target of class NSApplication
8/23/12 8:14:57.945 AM KeyboardSetupAssistant: writeKeyboardType <8196-1149-0> 40
8/23/12 8:15:18.335 AM com.apple.pbs: 2012-08-23 12:15 pbs[341] (CarbonCore.framework) FSEventStreamCreate: _FSEventStreamCreate: ERROR: watch_path() failed for '/Volumes/System BU/Applications/Utilities'
8/23/12 8:15:18.336 AM com.apple.pbs: 2012-08-23 12:15 pbs[341] (CarbonCore.framework) FSEventStreamSetDispatchQueue(): failed assertion 'streamRef != NULL'
8/23/12 8:15:18.336 AM com.apple.pbs: 2012-08-23 12:15 pbs[341] (CarbonCore.framework) FSEventStreamStart(): failed assertion 'streamRef != NULL'
8/23/12 8:15:18.336 AM com.apple.pbs: 2012-08-23 12:15 pbs[341] (CarbonCore.framework) FSEventStreamStop(): failed assertion 'streamRef != NULL'
8/23/12 8:15:18.336 AM com.apple.pbs: 2012-08-23 12:15 pbs[341] (CarbonCore.framework) FSEventStreamInvalidate(): failed assertion 'streamRef != NULL'
8/23/12 8:15:18.336 AM com.apple.pbs: 2012-08-23 12:15 pbs[341] (CarbonCore.framework) 
8/23/12 8:15:18.336 AM com.apple.pbs: FSEventStreamRelease(): failed assertion 'streamRef != NULL'
8/23/12 8:53:36.897 AM AppleFileServer: received message with invalid client_id 31
8/23/12 9:16:58.610 AM ScreensharingAgent: [CL_INVALID_DEVICE] : OpenCL Error : Failed to create context! Invalid device
8/23/12 9:20:30.538 AM ScreensharingAgent: [CL_INVALID_DEVICE] : OpenCL Error : Failed to create context! Invalid device
8/23/12 9:23:51.942 AM ScreensharingAgent: [CL_INVALID_DEVICE] : OpenCL Error : Failed to create context! Invalid device
8/23/12 9:27:12.575 AM ScreensharingAgent: [CL_INVALID_DEVICE] : OpenCL Error : Failed to create context! Invalid device
8/23/12 9:28:20.827 AM AppleFileServer: received message with invalid client_id 36
8/23/12 9:30:32.323 AM ScreensharingAgent: [CL_INVALID_DEVICE] : OpenCL Error : Failed to create context! Invalid device
8/23/12 9:33:51.324 AM ScreensharingAgent: [CL_INVALID_DEVICE] : OpenCL Error : Failed to create context! Invalid device
8/23/12 9:37:11.716 AM ScreensharingAgent: [CL_INVALID_DEVICE] : OpenCL Error : Failed to create context! Invalid device
8/23/12 9:40:33.113 AM ScreensharingAgent: [CL_INVALID_DEVICE] : OpenCL Error : Failed to create context! Invalid device
8/23/12 9:43:55.048 AM ScreensharingAgent: [CL_INVALID_DEVICE] : OpenCL Error : Failed to create context! Invalid device
8/23/12 9:47:19.136 AM ScreensharingAgent: [CL_INVALID_DEVICE] : OpenCL Error : Failed to create context! Invalid device
8/23/12 9:52:00.127 AM loginwindow: NSAlert is being used from a background thread, which is not safe.  This is probably going to crash sometimes. Break on _NSAlertWarnUnsafeBackgroundThreadUsage to debug.  This will be logged only once.  This may break in the future.
8/23/12 9:52:04.011 AM com.apple.launchd.peruser.501: ([0x0-0x17017].com.panic.TransmitMenu[331]) Exited: Killed: 9
8/23/12 9:52:04.011 AM com.apple.launchd.peruser.501: (com.apple.talagent[298]) Exited: Killed: 9
8/23/12 9:52:04.013 AM com.apple.launchd.peruser.501: ([0x0-0x1b01b].com.apple.AppleSpell[347]) Exited: Killed: 9
8/23/12 9:52:04.781 AM loginwindow: DEAD_PROCESS: 97 console
8/23/12 9:52:10.580 AM shutdown: reboot by admin: 
8/23/12 9:52:10.580 AM shutdown: SHUTDOWN_TIME: 1345729930 580252

Open in new window

0

Featured Post

Get quick recovery of individual SharePoint items

Free tool – Veeam Explorer for Microsoft SharePoint, enables fast, easy restores of SharePoint sites, documents, libraries and lists — all with no agents to manage and no additional licenses to buy.

Question has a verified solution.

If you are experiencing a similar issue, please ask a related question

This article will show how Aten was able to supply easy management and control for Artear's video walls and wide range display configurations of their newsroom.
While there are many new features for iOS 11, these are the five that can improve your digital lifestyle.
Internet Business Fax to Email Made Easy - With  eFax Corporate (http://www.enterprise.efax.com), you'll receive a dedicated online fax number, which is used the same way as a typical analog fax number. You'll receive secure faxes in your email, f…
Monitoring a network: why having a policy is the best policy? Michael Kulchisky, MCSE, MCSA, MCP, VTSP, VSP, CCSP outlines the enormous benefits of having a policy-based approach when monitoring medium and large networks. Software utilized in this v…

862 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question