Link to home
Start Free TrialLog in
Avatar of sc456a
sc456a

asked on

Win2k8 Server Connectivity Issues

I have a network consisting of various types of Windows XP Pro machines connecting to a domain hosted on a Dell PowerEdge 2950 Windows 2008 Standard SP2 server.  They are all connect to a gigabit Cat6 network.  The server has two gigabit connections which are teamed (paired).

The server shows no hardware errors.  The network itself seems fine.  However, numerous users are randomly receiving the following errors when they log in or out, and their Outlook PST files are being corrupted:

"Windows was unable to save all the data for the file \\server\path\outlook\pf6DD.tmp.  The data has been lost.  This error may be caused by a failure of your computer hardware or network connection.  Please try to save the file elsewhere."

This is seen during the "Saving your settings" portion of the logout process, and it basically means that their Outlook PST file is now corrupted and will have to be restored from a backup because repairing it means data will be lost.

Sometimes the following roaming profile error occurs:

"Windows cannot update your roaming profile.  Possible causes of this error include network problems of insufficient security rights.  If this problem persists, contact your network administrator."


When the PST file is corrupted, this message may appear:

"Cannot start Microsoft Office Outlook.  Cannot open the Outlook window.  The set of folders cannot be opened.  The file ...\outlook.pst is not a personal files folder."

There are a few more errors, all pointing in the same directions - no where.  Sometimes a roaming profile error or network connectivity error will occur and Outlook will become corrupted, and sometimes it won't.  Most of the time on each workstation it just works normally.  There is no correlation between the model or workstation, the SP level applied, or the version of Office (using 2003 and 2007).

I'm looking for any input at all here.  The only thing I can think of is some kind of elusive network connectivity issue, but even that doesn't make sense because it works most of the time.  Switches usually work or don't.  If it's the server, it's strange because it doesn't really have any errors on it that indicate these sorts of issues.

Avatar of amdaxiom
amdaxiom

This really does sound like network connectivity issues.  I assume you have managed switches to handle the teaming?  Do your switches show any errors on any ports?

Is this something that was working fine for a while and then recently stopped working?  Or is this a brand new install?

Have you tried removing the team (maybe it's the teaming) and using a single network port for now?

Can you test transferring a few hundred gigs of data to/from the server and see if you get any errors?
Avatar of sc456a

ASKER

The switches and teaming have been in place for years without issue.  The switches have the latest firmware applied, and the NICs have the latest drivers and management software.

I have not had any issues transferring 4GB+ files, however the issue would only occur if a dozen or so other machines were doing the same at that moment, such as during morning/evening login/logout.
Avatar of sc456a

ASKER

Another error I'm getting is Event ID 2012 in the System Log:

Log Name:      System
Source:        srv
Date:          4/14/2010 2:07:59 PM
Event ID:      2012
Task Category: None
Level:         Warning
Keywords:      Classic
User:          N/A
Computer:      server
Description:
While transmitting or receiving data, the server encountered a network error. Occassional errors are expected, but large amounts of these indicate a possible error in your network configuration.  The error status code is contained within the returned data (formatted as Words) and may point you towards the problem.
Event Xml:
<Event xmlns="http://schemas.microsoft.com/win/2004/08/events/event">
  <System>
    <Provider Name="srv" />
    <EventID Qualifiers="32768">2012</EventID>
    <Level>3</Level>
    <Task>0</Task>
    <Keywords>0x80000000000000</Keywords>
    <TimeCreated SystemTime="2010-04-14T18:07:59.364Z" />
    <EventRecordID>24221</EventRecordID>
    <Channel>System</Channel>
    <Computer>server</Computer>
    <Security />
  </System>
  <EventData>
    <Data>\Device\LanmanServer</Data>
    <Binary>0000040001002C0000000000DC07008000000000840100C00000000000000000000000000000000087050000</Binary>
  </EventData>
</Event>
Avatar of Darius Ghassem
This could be multiple issues.

What AV are you using?

Look over link below.

http://support.microsoft.com/kb/968991

http://support.microsoft.com/kb/951037

There have been some problems with Teaming on Dell 2950s even with the most updated drivers there have been some conflicts with certain updates from MS.
Avatar of sc456a

ASKER


I'm running AVG v9 Business Edition on the server and workstations.

With the first link, I'm not using Intel NICs (Broadcom NetXTreme II), so I don't think that applies.

I don't have chimney offload enabled, so I don't think that's an issue.

Do you know which specific updates conflict with teaming, so I can see if I have them installed?

First thing remove AVG for testing. Break the team for testing.
Avatar of sc456a

ASKER

I removed the AVG Anti-Virus software and for about a week there were no issues, but today there have been numerous reports of the same login errors that occurred before, such as "Windows cannot locate the server copy of your roaming profile."  Thoughts?
Avatar of sc456a

ASKER

Exact Error - Event ID 1521:

Windows cannot locate the server copy of your roaming profile and is attempting to log you on with your local profile. Changes to the profile will not be copied to the server when you logoff. Possible causes of this error include network problems or insufficient security rights. If this problem persists, contact your network administrator.  

 DETAIL - The network location cannot be reached. For information about network troubleshooting, see Windows Help.
Avatar of sc456a

ASKER

Other errors that show up in the event viewer just before Event ID 1521:

Event ID 15:

Automatic certificate enrollment for local system failed to contact the active directory (0x8007054b).  The specified domain either does not exist or could not be contacted.
  Enrollment will not be performed.

Event ID 1054:

Windows cannot obtain the domain controller name for your computer network. (The specified domain either does not exist or could not be contacted. ). Group Policy processing aborted.
Post ipconfig /all. Are you having network switch problems?
Avatar of sc456a

ASKER

No switch issues that I'm aware of.  Latest firmware is applied.  Here is the ipconfig:

Microsoft Windows XP [Version 5.1.2600]
(C) Copyright 1985-2001 Microsoft Corp.

Windows IP Configuration

        Host Name . . . . . . . . . . . . : PKG1
        Primary Dns Suffix  . . . . . . . : domain.local
        Node Type . . . . . . . . . . . . : Hybrid
        IP Routing Enabled. . . . . . . . : No
        WINS Proxy Enabled. . . . . . . . : No
        DNS Suffix Search List. . . . . . : domain.local

Ethernet adapter Local Area Connection:

        Connection-specific DNS Suffix  . :
        Description . . . . . . . . . . . : Broadcom NetXtreme 57xx Gigabit Cont
roller
        Physical Address. . . . . . . . . : 00-0F-1F-E8-3A-55
        Dhcp Enabled. . . . . . . . . . . : Yes
        Autoconfiguration Enabled . . . . : Yes
        IP Address. . . . . . . . . . . . : 192.168.1.150
        Subnet Mask . . . . . . . . . . . : 255.255.255.0
        Default Gateway . . . . . . . . . : 192.168.1.1
        DHCP Server . . . . . . . . . . . : 192.168.1.1
        DNS Servers . . . . . . . . . . . : 192.168.1.15
        Lease Obtained. . . . . . . . . . : Tuesday, April 27, 2010 10:27:20 AM
        Lease Expires . . . . . . . . . . : Friday, April 30, 2010 10:27:20 AM
ASKER CERTIFIED SOLUTION
Avatar of Darius Ghassem
Darius Ghassem
Flag of United States of America image

Link to home
membership
This solution is only available to members.
To access this solution, you must be a member of Experts Exchange.
Start Free Trial
Avatar of sc456a

ASKER

Yes, I did consider that.  I will make that change tonight and see what happens, but I cannot see why the DHCP on the router would have worked properly for a year and suddenly start having issues.
Well it is just a best practice.
Interesting that you said it's been working for years and can transfer large amounts of data with no issues.  Any licensing issues?  Maybe running out of license?  Of course, you would see that in the event logs on the server.

You posted IPconfig of your workstation.  I'm assuming the error event logs are from the server and not your workstation, correct?  Server is 192.168.1.15 or something else?

When did these errors start occurring?
Avatar of sc456a

ASKER

I have rebooted the router that is the DHCP server, and so far so good.  It could have just needed a reboot, although I always cringe at that being the "solution".

amdaxiom:  The errors are on the workstations, not the server.  Server is .15.  Errors started a few weeks before I began this post.
Again I would move DHCP off onto a server so you can actually troubleshoot issues like this without having to reboot your router.
Avatar of sc456a

ASKER

I have disabled DHCP on the router and set it up on a Windows server.  Still awaiting feedback from the customer, but so far, so good.
Good let me know.
Avatar of sc456a

ASKER

Amazingly there was some mysterious problem with the router's DHCP server - although it had been fully functional for a year and nothing had been changed on the router for months.  Thanks for the suggestion.