Solved

Server losing network connection randomly - reboot required to reactivate.

Posted on 2009-05-13
6
443 Views
Last Modified: 2012-05-06
Ok, we are having an issue with one of our new servers.

It is a DL380 G5 - 1 quad core CPU with 4GB memory running Windows Sever 2003 SP2. We built this machine specifically for one application within our company, which required SQL server 2005. The server has 2 NICs which we connected to 1 subnet - teamed.

After I had installed WinServer2k3 + sp2 we left the machine sitting there for a month or so and we had no problems - then sql 2005 sp2 was installed and again - 1 month with no problems.

Since then, we have installed the application it was designed to support which involves an amount of data being imported into the SQL database. Either during the actual import of data, the SQL verification or the SQL DB backup (as far as we can tell) we lose network connection to the machine completely and I have to connect via iLo2 to reboot to fix. This will generally occur once every day or so.

When I connect to the machine via iLo the nic appeared connected, but can not contact anything on the domain outside its own ip (10.22.20.x & 127.0.0.1). I tried restarting the Network service - but this fails.

To diagnose this, we have changed the IP, port on the router, network cables. We then broke the team, disabled one nic and ran off the other - the same occured - we reversed that situation and the issue still occured. We had HP replace the motherboard as the NICs are onboard and the issue still occurs. There is nothing in the event log at all or anything in the SQL Server log that points to anything other than the network connection dropped a number of users connections. Have also upgraded software nic drivers/firmware with no results as yet and really starting to run out of ideas.
0
Comment
Question by:1645Y
6 Comments
 
LVL 14

Expert Comment

by:igor-1965
ID: 24382562
Have you tried to reset tcp/ip stack?
0
 
LVL 7

Expert Comment

by:ManicD
ID: 24382640
What firewalls do you have installed on the server?
Can you ping the server when the card stops communicating?
Where can/cant you ping when it stops communicating?
Do you have physical access to the server when its down?

0
 

Author Comment

by:1645Y
ID: 24390062
No internal firewalls.
No, you can't ping the server from anywhere outside the server itself.
The only place you can actually ping its address is from the server itself.
When on the server you can not ping anything outside. Yes, we have physical access to the server when its down.

There does seem to be network activity lights on the machine when its down.
0
 

Author Comment

by:1645Y
ID: 24469490
Can't identify the exact cause, but have found something that I am fairly sure is the cause.

We had a contractor come in to do some DB copy from our old accounting system to new system in which he needed local admin access. Seems he installed Citrix Secure Access Client to connect to his work network from the server itself.

He then used his laptop / wyse box we gave him to connect to our server.

From what I understand there are issues with network compatibility that could cause a clash and bring down the specific IP. Trying to find definitive proof so I can go to the company with the data, but other than the fact I know he installed the software on the date the errors started to happen, and hasn't occurred since he left I can't find the exact cause.
0
 

Accepted Solution

by:
ee_auto earned 0 total points
ID: 24977624
Question PAQ'd, 500 points refunded, and stored in the solution database.
0

Featured Post

Efficient way to get backups off site to Azure

This user guide provides instructions on how to deploy and configure both a StoneFly Scale Out NAS Enterprise Cloud Drive virtual machine and Veeam Cloud Connect in the Microsoft Azure Cloud.

Question has a verified solution.

If you are experiencing a similar issue, please ask a related question

ADCs have gained traction within the last decade, largely due to increased demand for legacy load balancing appliances to handle more advanced application delivery requirements and improve application performance.
When you try to share a printer , you may receive one of the following error messages. Error message when you use the Add Printer Wizard to share a printer: Windows could not share your printer. Operation could not be completed (Error 0x000006…
Viewers will learn how to connect to a wireless network using the network security key. They will also learn how to access the IP address and DNS server for connections that must be done manually. After setting up a router, find the network security…
Get a first impression of how PRTG looks and learn how it works.   This video is a short introduction to PRTG, as an initial overview or as a quick start for new PRTG users.

777 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question