?
Solved

WS 2008 R2 VM eventually loses network connectivity then becomes completely unresponsive

Posted on 2013-02-03
15
Medium Priority
?
448 Views
Last Modified: 2013-10-12
Hello EE,

We're having reliability issues with just one VM among the 25 we're hosting on this host server.

About this VM:

OS: Windows Server 2008 R2 SP1
Role: RDS
Anti-virus: none
Software: Office 2010 and a line-of-business application called AVImark. Actually, the users start the executable with a shortcut that points to the EXE on another VM. Other VMs start the same EXE residing on the same server but these VMs don't have these symptoms.

After a few weeks, users will report the following and in this order:

- an inability to logon via RDS
- no Desktop icons (folder redirection fails)
- eventually existing RDS sessions become unresponsive.

Other symptoms from the console session:

- numerous service fail:
     - ttkEvents: "Insufficient system resources exist to complete the requested service"
     - WinLogon: "The Windows logon process has terminated unexpectedly"
     - Folder Redirection: "Failed to apply policy and redirect folder..."
     - Group Policy Drive Maps: "not enough storage is available to process this command"

- everything slows down. What slows down? Everything. Clicking anything might result in a 10s, 20s, 60s or longer delay before getting a response. Eventually, no input gets a response and the VM has to be shutdown which it will do gracefully if the shutdown is initiated by Hyper-V Manager. However, I found that if I catch the VM in a semi-usable state (before it completely locks up), I can resuscitate the VM by disabling then enabling the NIC from within the VM. Things aren't perfect - there's still some delay in clicking things, Resource Manager shows no data, etc. - but it'll get them through the day until I can reboot the VM at night.

Questions? Ideas?

Nathan
0
Comment
Question by:nathanwc
[X]
Welcome to Experts Exchange

Add your voice to the tech community where 5M+ people just like you are talking about what matters.

  • Help others & share knowledge
  • Earn cash & points
  • Learn & ask questions
  • 9
  • 3
  • 3
15 Comments
 
LVL 30

Expert Comment

by:IanTh
ID: 38849117
is there any resources left for the host as thats a classic error you need resources for the host to work properly and the more vm's make that requirement go up.
0
 
LVL 1

Author Comment

by:nathanwc
ID: 38849129
Good point - I should have pointed out that neither the VM nor the host are out of disk space. Here's a screenshot from the VM:

Disk space in VM
Here's a screenshot of Task Manager in it's typical state also from the VM:

Task Manager performance tab from VM
Nathan
0
 
LVL 122
ID: 38849202
do yu reboot the VM server daily?
0
Has Powershell sent you back into the Stone Age?

If managing Active Directory using Windows Powershell® is making you feel like you stepped back in time, you are not alone.  For nearly 20 years, AD admins around the world have used one tool for day-to-day AD management: Hyena. Discover why.

 
LVL 30

Expert Comment

by:IanTh
ID: 38849206
vm resources no host resources
0
 
LVL 1

Author Comment

by:nathanwc
ID: 38849432
Hanccocka: no
ianth: what? :-)
0
 
LVL 122
ID: 38849448
how many users logon and logoff your RDS host daily?

we reboot our hosts daily to avoid memory fragmentation, and application memory leaks
0
 
LVL 1

Author Comment

by:nathanwc
ID: 38849544
This is in fact one of the smaller sites. I'd guess 10-15 users login per day. Other servers have 30+ per day. We don't routinely reboot this host or any other host since every other VM on this server and others are fine.
0
 
LVL 122
ID: 38849551
also check and replace the e1000 interface in the VM with the VMXNET3 interface.
0
 
LVL 1

Author Comment

by:nathanwc
ID: 38849663
This is a Hyper-V environment.
0
 
LVL 30

Expert Comment

by:IanTh
ID: 38850267
you used pictures from the virtual machine I am saying does the hyper-v HOST have resources left ?
0
 
LVL 1

Author Comment

by:nathanwc
ID: 38850989
Yes, it does:

Host resources
0
 
LVL 1

Author Comment

by:nathanwc
ID: 38861919
I might have solved this one. Looks like RDS isn't too happy when USB devices are redirected to the server. I found that people were plugging in flash drives which I wouldn't think would cause a problem, but I noticed Disk Management hanging and when it finally responded, there was a removable drive in the disk view. I had my client visit each computer and look for any flash drives. Once they were found and removed, the server became responsive.

I've used Group Policy to prohibit redirection, and while only time will tell, things have been good for 48 hours.

http://blogs.technet.com/b/perfguru/archive/2008/03/10/terminal-server-group-policy-guide-in-server-2008.aspx

Nathan
0
 
LVL 1

Author Comment

by:nathanwc
ID: 38981454
Update: we still had lock-up issues on these remote desktop servers after I prohibited USB redirection. I noticed that these three remote desktop servers all lived on one server and all the other unaffected remote desktop servers lived on another. While I can't find anything wrong with that server, I've moved them to the server on which other RDS aren't locking up.
0
 
LVL 1

Accepted Solution

by:
nathanwc earned 0 total points
ID: 39553430
Turned out to be a problem with Dynamic Memory. The apps running on these VMs weren't returning freed memory to the VM. Assigning a static amount of memory to these VMs put this issue to bed.
0
 
LVL 1

Author Closing Comment

by:nathanwc
ID: 39567864
Gave it an A because it works :-)
0

Featured Post

 [eBook] Windows Nano Server

Download this FREE eBook and learn all you need to get started with Windows Nano Server, including deployment options, remote management
and troubleshooting tips and tricks

Question has a verified solution.

If you are experiencing a similar issue, please ask a related question

Veeam Backup & Replication has added a new integration – Veeam Backup for Microsoft Office 365.  In this blog, we will discuss how you can benefit from Office 365 email backup with the Veeam’s new product and try to shed some light on the needs and …
After seeing many questions for JRNL_WRAP_ERROR for replication failure, I thought it would be useful to write this article.
This tutorial will walk an individual through the steps necessary to join and promote the first Windows Server 2012 domain controller into an Active Directory environment running on Windows Server 2008. Determine the location of the FSMO roles by lo…
How to install and configure Citrix XenApp 6.5 - Part 1. In this video tutorial we have explained step by step installation of Citrix XenApp 6.5 Server on Windows Server 2008 R2 is explained in this video. We have explained the difference between…
Suggested Courses

801 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question