Jay_Ratansi
asked on
SBS2008 server problem every monday 2am
I've SBS2008 server unresponsive every monday morning for the past 3 weeks: users can't logon to domain or Exchange store is offline etc. Server console appears to have crashed although mouse moves... Remote access software Logmein free shows server as online but unable to logon. Can't use RDP as well.The only option is to power down & restart which resolves.
I recently installed GFImax monitoring agent on this server. This refreshes server status every 15 minutes. Last monday it lost contact around 2am. I've checked windows event logs: server has continued to record events beyond 2am.
GFImax fault report shows following for period 1st April -15th April:
"CLIENT","Site","Server"," Check","Er ror Count","From","To","Failed _hr","Fail ed_percent "
"ACME","Main","SERVER1","W indows Service Check - Group Policy Client","1","2010-04-04"," 2010-04-04 ","0.53"," 0.15"
"ACME","Main","SERVER1","W indows Service Check - Distributed Transaction Coordinator","3","2010-04- 04","2010- 04-12","0. 75","0.21"
"ACME","Main","SERVER1","W indows Service Check - Microsoft Exchange Anti-spam Update","1","2010-04-04"," 2010-04-04 ","0.25"," 0.07"
"ACME","Main","SERVER1","W indows Service Check - Microsoft Exchange File Distribution","1","2010-04 -04","2010 -04-04","0 .25","0.07 "
"ACME","Main","SERVER1","W indows Service Check - Microsoft Exchange Search Indexer","1","2010-04-04", "2010-04-0 4","0.25", "0.07"
"ACME","Main","SERVER1","W indows Service Check - Routing and Remote Access","1","2010-04-04"," 2010-04-04 ","0.25"," 0.07"
"ACME","Main","SERVER1","W indows Service Check - World Wide Web Publishing Service","1","2010-04-04", "2010-04-0 4","0.25", "0.07"
"ACME","Main","SERVER1","W indows Service Check - Windows Management Instrumentation","1","2010 -04-04","2 010-04-04" ,"0.25","0 .07"
"ACME","Main","SERVER1","W indows Service Check - Update Services","3","2010-04-04" ,"2010-04- 12","0.75" ,"0.21"
"ACME","Main","SERVER1","W indows Service Check - Windows Update","3","2010-04-04"," 2010-04-12 ","1.28"," 0.36"
"ACME","Main","SERVER1","A nti-virus Update Check - Grisoft AVG","2","2010-04-05","201 0-04-09"," 41.89","11 .64"
"ACME","Main","SERVER1","B ackup Check - Microsoft Windows Backup (Vista/Server 2008)","1","2010-04-09","2 010-04-10" ,"24","6.6 7"
(I do see lots of events in the sbs2008 logs relating to Distributed Transaction Coordinator (DTC) for 3 workstations in the network.)
Any ideas/suggestions?
I recently installed GFImax monitoring agent on this server. This refreshes server status every 15 minutes. Last monday it lost contact around 2am. I've checked windows event logs: server has continued to record events beyond 2am.
GFImax fault report shows following for period 1st April -15th April:
"CLIENT","Site","Server","
"ACME","Main","SERVER1","W
"ACME","Main","SERVER1","W
"ACME","Main","SERVER1","W
"ACME","Main","SERVER1","W
"ACME","Main","SERVER1","W
"ACME","Main","SERVER1","W
"ACME","Main","SERVER1","W
"ACME","Main","SERVER1","W
"ACME","Main","SERVER1","W
"ACME","Main","SERVER1","W
"ACME","Main","SERVER1","A
"ACME","Main","SERVER1","B
(I do see lots of events in the sbs2008 logs relating to Distributed Transaction Coordinator (DTC) for 3 workstations in the network.)
Any ideas/suggestions?
How much RAM in the box?
How many nics enabled?
Is IPv6 disabled?
Have you run the SBS BPA and fixed everything there.
How many nics enabled?
Is IPv6 disabled?
Have you run the SBS BPA and fixed everything there.
ASKER
The problem re-occurred yesterday (friday) 4am approx ie. no longer only a monday morning issue!
To answer both sets of questions: (a) Windows Updates : server shows updates applied last on 30th Dec. & there are 16 important updates pending. = I will apply these today; (b) RAM: 4GB = I am upgrading this to 8GB; (c) NICs= one enabled/ only one seen under adapters; (d) IPV6 = enabled
I've run SBS BPA now & can see 1 critical error & 18 items under 'all issues'. The critical error: ,"DNS A resource record points to incorrect IP address","","","The host (A) resource record points to the incorrect IP address 192.168.0.2192.168.0.23. The record should point to 192.168.0.2.","http://invalidurlnamegiveninconfigen6". I'll correct this & go through rest.
System Event Log for issues around 4am friday include:
03.53am: error 7011: A timeout (90000 milliseconds) was reached while waiting for a transaction response from the SrvCtrl service.
04.02am: Group Policy: error 1058 : The processing of Group Policy failed. Windows attempted to read the file \\defensorfds.local\SysVol \defensorf ds.local\P olicies\{8 D161517-D1 12-4371-92 7A-8BC9DC3 11321}\gpt .ini from a domain controller and was not successful. Group Policy settings may not be applied until this event is resolved. This issue may be transient and could be caused by one or more of the following:
a) Name Resolution/Network Connectivity to the current domain controller.
b) File Replication Service Latency (a file created on another domain controller has not replicated to the current domain controller).
To answer both sets of questions: (a) Windows Updates : server shows updates applied last on 30th Dec. & there are 16 important updates pending. = I will apply these today; (b) RAM: 4GB = I am upgrading this to 8GB; (c) NICs= one enabled/ only one seen under adapters; (d) IPV6 = enabled
I've run SBS BPA now & can see 1 critical error & 18 items under 'all issues'. The critical error: ,"DNS A resource record points to incorrect IP address","","","The host (A) resource record points to the incorrect IP address 192.168.0.2192.168.0.23. The record should point to 192.168.0.2.","http://invalidurlnamegiveninconfigen6". I'll correct this & go through rest.
System Event Log for issues around 4am friday include:
03.53am: error 7011: A timeout (90000 milliseconds) was reached while waiting for a transaction response from the SrvCtrl service.
04.02am: Group Policy: error 1058 : The processing of Group Policy failed. Windows attempted to read the file \\defensorfds.local\SysVol
a) Name Resolution/Network Connectivity to the current domain controller.
b) File Replication Service Latency (a file created on another domain controller has not replicated to the current domain controller).
ASKER CERTIFIED SOLUTION
membership
This solution is only available to members.
To access this solution, you must be a member of Experts Exchange.
ASKER
How do I resolve the sbsBPA critical error: ,"DNS A resource record points to incorrect IP address","","","The host (A) resource record points to the incorrect IP address 192.168.0.2192.168.0.23. The record should point to 192.168.0.2.","http://invalidurlnamegiveninconfigen6". I
I've opened DNS but can't see any entry with this. BTW 192.168.0.23 is used by the server for RAS
I've opened DNS but can't see any entry with this. BTW 192.168.0.23 is used by the server for RAS
SOLUTION
membership
This solution is only available to members.
To access this solution, you must be a member of Experts Exchange.
ASKER
I've upgraded RAM to 8Gb & applied all available updates. On re-running sbsBPA the DNS critical error has vanished.
Before I work through sbsBPA, SP etc issues, I noticed that sqlservr.exe *32 is grabbing 1.4-1.5GB of RAM. The server has only the sbs2008 default installation incl. sharepoint ie. no other sql application at all. With this the physical memory usage shows at 75%. This seems excessive in comparison say with Exchange store.exe at under 0.6GB
Windows 2008 (Fujitsu-locked oem) is at SP1. I'm not offered windows sp2 via windows update. Is that normal?
Thanks
Jay
Before I work through sbsBPA, SP etc issues, I noticed that sqlservr.exe *32 is grabbing 1.4-1.5GB of RAM. The server has only the sbs2008 default installation incl. sharepoint ie. no other sql application at all. With this the physical memory usage shows at 75%. This seems excessive in comparison say with Exchange store.exe at under 0.6GB
Windows 2008 (Fujitsu-locked oem) is at SP1. I'm not offered windows sp2 via windows update. Is that normal?
Thanks
Jay
There have been some instances where it doesn't get offered...you can go get the 64 bit from the Microsoft Download site
ASKER
Hi
where can I download SBS 2008 Update Rollup 3 from?
Thanks
Jay
where can I download SBS 2008 Update Rollup 3 from?
Thanks
Jay
ASKER
After upgrading from 4GB to 8GB RAM, I'm noticing that the server is grabbing large amount of physical memory. During working hrs right now, the physical memory usage is 93% ie. 7.45GB in use. Of this sqlservr.exe *32 is grabbing 1.4-1.5GB of RAM. (The server has only the sbs2008 default installation incl. sharepoint).
sqlservr.exe *32 process appears to be the SBSMONITORING service.
I wonder if the problems encountered by the server are memory usage related ie. running out of memory & services start shutting down?
sqlservr.exe *32 process appears to be the SBSMONITORING service.
I wonder if the problems encountered by the server are memory usage related ie. running out of memory & services start shutting down?
It's normal. Exchange will use as much memory as you can throw at it. the memory will be released when other applications require it. It is a design change with Exchange 2007.
Shaun
Shaun
I agree with Shaun...this is normal. I don't recall how many workstations you have, but it may be that you need to up the RAM to 12GB
Are your processors being maxed?
Are your processors being maxed?
ASKER
using an excellent article http://msmvps.com/blogs/bradley/archive/2005/02/04/34984.aspx I reconfigured sqlservr.exe *33 ram from 15.GB to 400MB approx. This is working fine since Tuesday. Total server physical memory in use is 81-85% with about 1GB free. Exchange store.exe is using 600MB approx
Server is noticeably responsive especially RWW & OWA websites.
Between 5.30am-6am, server carries out several Exchange maintenance tasks. During this time, GFAmax monitoring issues alerts that server contact has been lost; During this time RWW & OWA & RDP can't be used. This happening for the past 3 days (I have checked that this is not due to simply losing internet connection)
Server is noticeably responsive especially RWW & OWA websites.
Between 5.30am-6am, server carries out several Exchange maintenance tasks. During this time, GFAmax monitoring issues alerts that server contact has been lost; During this time RWW & OWA & RDP can't be used. This happening for the past 3 days (I have checked that this is not due to simply losing internet connection)
how much ram in the server now? how many nics? have you installed sp2 and UR3?
ASKER
Hi
The server RAM is 8GB (upgraded from 4GB) + 1NIC. The server isn't updated to UR3 or Exch.SP2.
Server is stable now. The changes that have been made (1) ram upgrade to 8GB; (2) reconfig sqlserver memory usage (see my 23rd April comment) & (3) change Exchange 2007 mailbox maintenance from daily 2-6am to weekly Sat/Sun midnight-6am, have resulted in physical memory usage of 70-75%, very responsive server, OWA & remote are fast, Group Policy: error 1058 have vanished, GFI max outages no longer reported.
I'll give it another week. If it crashes then I'll proceed to carry out all SBS BPA recommendations.
Thanks
Jay Ratansi
The server RAM is 8GB (upgraded from 4GB) + 1NIC. The server isn't updated to UR3 or Exch.SP2.
Server is stable now. The changes that have been made (1) ram upgrade to 8GB; (2) reconfig sqlserver memory usage (see my 23rd April comment) & (3) change Exchange 2007 mailbox maintenance from daily 2-6am to weekly Sat/Sun midnight-6am, have resulted in physical memory usage of 70-75%, very responsive server, OWA & remote are fast, Group Policy: error 1058 have vanished, GFI max outages no longer reported.
I'll give it another week. If it crashes then I'll proceed to carry out all SBS BPA recommendations.
Thanks
Jay Ratansi
There are 2 or 3 UR for Exchange SP2 and SBS 2008 is now up to UR 4
I strongly suggest getting up to speed on the rest of the updates ASAP as many issues are resolved.
I strongly suggest getting up to speed on the rest of the updates ASAP as many issues are resolved.
ASKER
The server is stable now.
What do the event logs show?
Shaun