Solved

Both network connections on Win 7 64bit dual NIC system lose connectivity until reboot

Posted on 2011-09-24
16
359 Views
Last Modified: 2012-05-12
I have Windows 7 Pro 64bit running on an ASUS KGPE-D16 board with 2 Intel 82574L on-board Gigabit adapters (firmware v11.11.43.0 6/21/2011) that are configured with static IP addresses (address info below) connected to 2 separate networks via 2 separate switches and they are completely dropping both connections until I reboot the system. I cannot ping or access the system in any way from either networks, while all other equipment on both networks are fully accessible and respond to pings.

The connections seem to be dropping at random times, but on the hour mark of a clock. The 1st was 5 days ago at 8:00am, then the following day at 9:00am, then today at 2:00pm. I have updated the adapter firmware from Intel, gone through the Scheduled Tasks to make sure nothing is running and disabled most of the non-essential stuff. Looking through the Windows Event Logs, I can't find anything. The system was up and running for about a week with zero issues prior to this.

I've setup multiple NICs in in other systems running Windows 7 32bit with no problems what so ever using the same network setup and other ASUS boards. I'm stumped. Any help or ides would be greatly appreciated.


Windows Update - Off
Windows Firewall - Off
Windows UAC - Off
Windows Defender - Off
Installed Firewall - COMODO (setup to trust all LAN 1 traffic and LAN 2 traffic to/from the VMS software and it's associated components)

Here are the IP configurations for the adapters and where they're connected.

LAN 1
IP - 192.168.5.200
Subnet - 255.255.255.0
Default Gate - <blank>
DNS 1 - <blank>
DNS 2 - <blank>
QoS Packet Scheduler - Unchecked after the 2nd event
File & Printer Sharing - Unchecked after the 2nd event
IPv6 - Unchecked
Link-Layer... Responder - Unchecked after the 2nd event
Connected to - Internal IP Video LAN via 48-port Gigabit PoE switch, model GS748TP

LAN 2
IP - 192.168.10.250
Subnet - 255.255.255.0
Default Gate - 192.168.10.1
DNS 1 - 192.168.10.1
DNS 2 - <blank>
QoS Packet Scheduler - Unchecked after the 2nd event
File & Printer Sharing - Unchecked after the 2nd event
IPv6 - Unchecked
Link-Layer... Responder - Unchecked after the 2nd event
Connected to - Client LAN via 8-port Gigabit switch, model GS108

This is a dedicated IP Video system running VMS software.
0
Comment
Question by:thma94
  • 10
  • 6
16 Comments
 
LVL 66

Expert Comment

by:johnb6767
ID: 36594297
Check your Network related services.....

Network Connections
Network Location Awareness
Network Store Interface Service

Any other events in the Event Logs?
0
 

Author Comment

by:thma94
ID: 36594585
Thank you for the response.

Network Connections - Started - set to Automatic
Network Location Awareness - Started - set to Manual, I switched it to Automatic
Network Store Interface Service - Started - set to Manual, I switched it to Automatic

So far, I can't find anything in the logs. I will do more log digging later this morning.
0
 

Author Comment

by:thma94
ID: 36595829
Good afternoon,

I have been digging through the Event Logs and I can't find anything in the logs leading up to when the loss took place. The only things I see are things like the system was rebooted, which was me, and the loss of network connection on LAN 1 and LAN 2 and when the connection comes back online.

Thank you so much for the assistance.
0
 
LVL 66

Expert Comment

by:johnb6767
ID: 36597047
Those are three of the major services that would kill the NICs....

Just out of curiousity....

"Unchecked after the 2nd event"

What about "TCPIP (Internet Protocol" and "Client for MS Networks"?

Hit the Start button, and type "Reliability Monitor", and see if anything is shown in the events.....

0
 

Author Comment

by:thma94
ID: 36597199
Ah. OK.

"Unchecked after the 2nd event"
Those items were unchecked the 2nd time everything locked. I was trying to eliminate possible causes or box in the issue. In the past, disabling QoS had cleared up other networking issues I had with other clients.

The TCP/IPv4 and Client for Microsoft Networks are checked and haven't been changed since the initial setup.

The Reliability Monitor doesn't show anything other than the Shut Downs I had performed to bring everything back online.
0
 
LVL 66

Expert Comment

by:johnb6767
ID: 36597274
I usually start looking at security software (COMODO in this case)....

This might be hard to run at full productivity, but might be about the only thing I can see to do to troubleshoot.....

Perform a clean startup to determine whether background programs are interfering with your game or program
http://support.microsoft.com/kb/331796

Start with JUST the Startup Tab Items (if on the web, leave Security apps enabled).. If it stays up for a longer period of time, than usual, enable some more items (or leave them disabled if not needed as a backgorund "startup" item)....
0
 

Author Comment

by:thma94
ID: 36597287
Do you think this could possibly be a BIOS issue? There is a BIOS update on the ASUS website.

I typically don't flash BIOS unless something is really jacked up.
0
 

Author Comment

by:thma94
ID: 36597362
I took a look at the Startup Items under msconfig and the only items are COMODO and the video recording software.

I hesitate knocking off COMODO on Startup because there has in recent history been issues with employees at Corporate bringing something in to the network a just hammering everything at all the branch locations, like this one. I will do it if I absolutely have to, but a last resort.

I have been looking through the COMODO logs at the traffic because I have it set to record everything in and out that is allowed or blocked. The only traffic are video streams coming in, connections coming in to the video software for playback or live, and your typical port 53, 1947, and 6117. COMODO itself is set to not update automatically. During the troubleshooting part of the 2nd instance, I had set all aspects of it in to training mode.

Thanks again for helping me try and figure this out.
0
Top 6 Sources for Identifying Threat Actor TTPs

Understanding your enemy is essential. These six sources will help you identify the most popular threat actor tactics, techniques, and procedures (TTPs).

 

Author Comment

by:thma94
ID: 36712757
UPDATE.

The system locked up again yesterday, on the hour of a clock again, this time at 3pm.

Because it's a secured installation, I do not always have physical access to the system. So, until this point I have only been able to remote reboot and troubleshoot remotely.

I was able to get physical access to the system this time, so I connected my monitor and keyboard to the system and it and all I had was a blank screen with a blinking cursor. No Windows, no messages, nothing. I went ahead and power cycled the system to look through the Windows logs again and the only thing I could find was that Windows was not properly shut down.

Now, I am looking to this possibly being a hardware issue. I ran memory tests, and everything passed with no errors. Both of the CPUs and each memory slot also has an LED indicating error or failure. All LEDs are off, indicating everything is good. Since there was a BIOS update, I went ahead and flashed the BIOS with the update. Something I noticed after the flash was that my HDD Activity LED started working properly, so now I am in a wait and see pattern. I watched the system for a few hours after the flash, I never had more than 7-10% CPU usage and no more than 7% memory usage.

At this point should I still be looking at Win 7 or close out the issue and open a new one in hardware?

Thank you again for the assistance.
0
 
LVL 66

Accepted Solution

by:
johnb6767 earned 500 total points
ID: 36714434
I am thinking hardware.....

Is there any DMP files in C:\windows\minidump?

The fact that it was at a flashing cursor, indicates maybe it wasnt able to read the disk properly. Maybe an issue with the MBR?

Do a read only 'chkdsk' and see what is reported, if it says it is unable to continue in Read Only mode....
0
 
LVL 66

Expert Comment

by:johnb6767
ID: 36714437
I have gotten to the point on ANY odd problem, I run this.... Seen too many odd problems from these buggers....
Especially the ones (rootkits) that failed to install properly....

How to remove malware belonging to the family Rootkit.Win32.TDSS (aka Tidserv, TDSServ, Alureon)?
http://support.kaspersky.com/faq/?qid=208280684
0
 

Assisted Solution

by:thma94
thma94 earned 0 total points
ID: 36714478
Well, no mini dumps.

I'm with you on the hardware. I've been digging this afternoon on the black screen with the blinking cursor and the information is all over the map. I found some info on BCDEDIT and I compared the files on my "troubled system" and 3 other systems that are running like a champ and it matches up like it's supposed to.

Last night/this morning after flashing the BIOS, the system didn't boot at all because it was looking at a video storage drive instead of the OS SSD and I had found some information that some BIOS can loose the boot drive order causing the black screen. Right now, I'm going to hang back and see if it fails again in the near future (couple of days), then next look at the SSD. I downloaded a new firmware version for that drive as well.

I will post updates for the thread. Thanks again for assisting me with this issue. It's greatly appreciated.
0
 
LVL 66

Expert Comment

by:johnb6767
ID: 36714525
Anytime... Good luck....
0
 

Assisted Solution

by:thma94
thma94 earned 0 total points
ID: 36897610
UPDATE.

On the 28th the system was down again. Same thing, black screen with a blinking cursor.

This time, I updated the firmware in my OCZ Vertex 3 SSD from v2.08 to v2.13. Since then it has been stable.

I have ruled out any root kit or virus'. I have also been closely monitoring the internal chassis temps as well as CPU and drive temps to rule out heat issues. Everything well within spec. I have been digging on the Vertex 3 SSD and it seems the odds of working versus total headache are 50/50, but the 50% that are having the issues look just like what I am dealing with.

I am going to pick up another SSD from another manufacture and stand by with that unit, in the event it happens again in the near future.
0
 

Author Comment

by:thma94
ID: 36944379
UPDATE.

Since the firmware update to the OCZ Vertex 3 SSD, the system has been solid. Running with no hiccups since.

Thank you johnm6767 for the assistance on this issue.
0
 

Author Closing Comment

by:thma94
ID: 36972718
Without johnb6767 assisting me with this issue, I would have wasted more time than necessary looking through Win 7.

He took me right where I needed to be, within Win 7, to eliminate that as the issue, which in turn led me to looking at the hardware at an earlier stage.

Thanks johnb6767.
0

Featured Post

How to improve team productivity

Quip adds documents, spreadsheets, and tasklists to your Slack experience
- Elevate ideas to Quip docs
- Share Quip docs in Slack
- Get notified of changes to your docs
- Available on iOS/Android/Desktop/Web
- Online/Offline

Join & Write a Comment

Let’s list some of the technologies that enable smooth teleworking. 
Meet the world's only “Transparent Cloud™” from Superb Internet Corporation. Now, you can experience firsthand a cloud platform that consistently outperforms Amazon Web Services (AWS), IBM’s Softlayer, and Microsoft’s Azure when it comes to CPU and …
This Micro Tutorial will give you a basic overview of Windows Live Photo Gallery and show you various editing filters and touches to photos you can apply. This will be demonstrated using Windows Live Photo Gallery on Windows 7 operating system.
Get a first impression of how PRTG looks and learn how it works.   This video is a short introduction to PRTG, as an initial overview or as a quick start for new PRTG users.

747 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question

Need Help in Real-Time?

Connect with top rated Experts

13 Experts available now in Live!

Get 1:1 Help Now