Welcome to Experts Exchange

Add your voice to the tech community where 5M+ people, just like you, are talking about what matters.

  • Help others & share knowledge
  • Earn cash & points
  • Learn & ask questions
Solved

VSS causing random lockups, Server 2008

Posted on 2012-03-15
6
1,649 Views
Last Modified: 2016-11-23
Hello Experts

Having a hard time trying to track down the source of one of our servers intermittent freezing

What happens on the odd occasion (3 times last month, 1 time this month so far) is that the server completely 'locks up'

We have a 2 server site setup, so when it happens, we can connect to serverB and ping serverA

We cannot however connect to any resources on ServerA (ServerA runs exchange and some other file shares)

Its a Dell poweredge R710 server, so when I connect to the DRAC - the console responds to mouse, but not keyboard. The only remedy at this point is to restart the server

Once the server starts back up, there is a pysical 'gap' in the event logs. As in - when the server crashes until when the server is back up, there is a gap in all event logs (system, application, security etc)

The only thing that seems to be happening is a VSS start command

Log Name:      System
Source:        Service Control Manager
Date:          16/03/2012 7:15:01 AM
Event ID:      7036
Task Category: None
Level:         Information
Keywords:      Classic
User:          N/A
Computer:      ServerA
Description:
The Volume Shadow Copy service entered the running state.

Shadow copies are disabled via 'My computer' - however we do run shadow protect as a backup solution that runs on the hour (15 minutes past the hour) that backs up the server volumes to a NAS over gigabit network

It's only the odd occasion when the server locks up, but the symptoms are exactly the same

Vss admin list writers show all writers as stable, all system volumes are 0% fragmented, Dell drivers for the RAID controller are up to date

Any other ideas?
0
Comment
Question by:HeronTech
  • 4
6 Comments
 
LVL 2

Expert Comment

by:brammer90
ID: 37728735
Hi
This is a tricky one.

I'm assuming the exchange server is freezing intermittently when you try to back it up.
Its all a process of ilimination.

You mention the drivers for the controllers are upto date but have you checked for any backup software updates & service packs.

I would change to job to 2 seperate ones.

Backup the server without backing up the exchange databases then backup the databases as a seperate job using the exchange mailbox backup and not just backup the edb files like you would the other files.

See how you get on then, this could be a problem with the database being scanned by A/V as the snapshot is taken.

initially I would be performing a manual backup daily rather than leaving it automated untill we've tracked down the problem.

if the backup fails, you will have a good idea what part is failing.

while its manual, you have the opertunity to disable the A/V from scanning while you carry out the backup, thats what I would do, but that up to you.

if you find it works ok as 2 seperate jobs then try it automated as 2 jobs for a while.

it very important that your backup solution isnt actually 'file' backing up the exchange database file itself as it should be backing up using exchange backup.

let me know how you get on with that and lets take it from there.

Good luck

Regards
Dave
0
 
LVL 1

Author Comment

by:HeronTech
ID: 37735875
Hello

The backup isnt being ran whilst any a/v 'scheduled' scans are being ran - and the exchange EDB files are already excluded by the 'real time scan'

The version of shadow protect is as up to date as we'd care it to be - its version 4.05 - I know version 4.2 is out now - but that in itself has issues (as did 4.1.5) causing SQL VDI 'errors' to be logged every backup.

There is already 2 backup jobs being ran - a full backup (each Saturday) then incrementals every hour to the NAS - and full backups (each night) to an external USB drive

It seems to be the network backup that its falling over on - the server has not crashed once during the normal full backup to the external drives

The backup jobs are identical - backing up both volumes. The only difference is the location

For now, I have a job open with shadow protect as well as here - for now I have disabled the backup to the NAS as a preventative measure
0
 
LVL 1

Author Comment

by:HeronTech
ID: 37740633
Hello

Going right way back to basics - if a server has a pagefile configured that is too small and runs out of virtual memory trying to execute a vss - will the whole server lock up as in the behaviour described?
0
Best Practices: Disaster Recovery Testing

Besides backup, any IT division should have a disaster recovery plan. You will find a few tips below relating to the development of such a plan and to what issues one should pay special attention in the course of backup planning.

 
LVL 1

Accepted Solution

by:
HeronTech earned 0 total points
ID: 37922917
I have fixed this issue myself. Please close
0
 
LVL 1

Author Closing Comment

by:HeronTech
ID: 37936196
Solved externally by 3rd party software provider
0
 

Expert Comment

by:SupermanTB
ID: 39089244
I'm having this exact same issue.  Would you mind sharing what 3rd party software you used?
0

Featured Post

PRTG Network Monitor: Intuitive Network Monitoring

Network Monitoring is essential to ensure that computer systems and network devices are running. Use PRTG to monitor LANs, servers, websites, applications and devices, bandwidth, virtual environments, remote systems, IoT, and many more. PRTG is easy to set up & use.

Question has a verified solution.

If you are experiencing a similar issue, please ask a related question

VM backup deduplication is a method of reducing the amount of storage space needed to save VM backups. In most organizations, VMs contain many duplicate copies of data, such as VMs deployed from the same template, VMs with the same OS, or VMs that h…
You might have come across a situation when you have Exchange 2013 server in two different sites (Production and DR). After adding the Database copy in ECP console it displays Database copy status unknown for the DR exchange server. Issue is strange…
This tutorial will walk an individual through the steps necessary to configure their installation of BackupExec 2012 to use network shared disk space. Verify that the path to the shared storage is valid and that data can be written to that location:…
This tutorial will show how to configure a single USB drive with a separate folder for each day of the week. This will allow each of the backups to be kept separate preventing the previous day’s backup from being overwritten. The USB drive must be s…

808 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question