Solved

Backup throughput degrades from remote AIX 5.3 using Backup Exec 12.5 remote agent

Posted on 2009-07-13
5
962 Views
Last Modified: 2013-12-01
I have Backup Exec 12.5 on Windows 2003 Enterprise trying to backup an AIX 5.3 partition to LTO4 drives via a BE AIX remote agent installed on the AIX Server. Up until a week ago backup of 890 GB took a little over 6 hours with a throughput of 2.3GBpm. Now I have the same job running about 2MBpm. It originally started out at 8MBpm but slowly degrades to 2MBpm.
Cleaned drives, restarted AIX agent & BE server services. Finding very little online that addresses this issue. Any ideas where to look or how to fix?
Thanks, Joe
0
Comment
Question by:jdones
[X]
Welcome to Experts Exchange

Add your voice to the tech community where 5M+ people just like you are talking about what matters.

  • Help others & share knowledge
  • Earn cash & points
  • Learn & ask questions
5 Comments
 
LVL 68

Expert Comment

by:woolmilkporc
ID: 24838033

Hi,

this sounds quite like a network problem. Any changes in your network environment?
Particularly have a look at speed/auto-negotiate settings, which shoud be consistent on both sides (server/switch).
Keep in mind that you'll have to check this on your VIO server, if you're using VIO.
Might also be related to MTU sizes.
Another thing to examine is of course the overall load of the network and the network interfaces, and CPU.
 
wmp
0
 

Expert Comment

by:SalfordsFinest
ID: 24838073
Sometimes Backup Exec needs to renegotiate the connection between the drive and the server.  A server reboot will sort that out in the interim.  Long term check that you have the latest veritas service packs and updates installed and that you're using the veritas drivers for the tape drive.

One other thing, check the NIC settings they may need changing from AutoDetect to Full Duplex.
0
 
LVL 32

Expert Comment

by:Rodney Barnhardt
ID: 24843380
We had a similar problem, where the time to backup the server jumped and the rate went down. After some trouble shooting, it ended up being a bad switch. All of our servers for that office were segmented from the rest of the network on a single switch. When we replaced that switch, our performance came back. It was an unmanaged switch in a small office.
0
 

Author Comment

by:jdones
ID: 24896865
I checked the network configurations on both servers & the switch. All set for auto-negotiate & all flowing at 1Gbps/ Full Duplex speed/config. I performed a test copy to the files using WinSCP to the problem partition & another partition using a different LUN. The throughput speeds were consistent & did not degrade in speed.
However, the problem partition contains the running applications for the server, Oracle, Peoplesoft, Sybase, Siebel, JDE, Etc. Also, the directory is 94% full. I think it may be a combination of disk saturation causing disk thrashing + all of the small files contained in each application subdirectory that is causing  an issue with Backup Exec's ability to manage the backup job. A lot of little files tend to slow down the backup process.
What do you think?
0
 
LVL 68

Accepted Solution

by:
woolmilkporc earned 500 total points
ID: 24897193
Yes,

>>  A lot of little files tend to slow down the backup process << - that's more than true, unfortunately.

You wrote that the degradation began a week ago. Does this correlate with the increase in number of small files, or have they been there before?

Did you try to tune block size, buffer size and buffer count in drive properties?

I mostly use TSM and thus can't say much about BE tuning, but blocking/buffering is always a good idea.

>> the problem partition contains the running applications << - do you see high CPU load during the backup process? If yes, any chance to move it off-shift?

How about BE's network settings? Particularly TCP_NODELAY is useful with many small files. Does this setting exist in BE? Are there tunable buffer settings at the BE client side?

Additionally, you could set AIX's tcp_nodelayack to 1 and see if it helps (no -o tcp_nodelayack=1). This setting is useful to overcome the weak implementation of Nagle's algorithm in Windows. Attention - in a highly CPU constrained environment it could cause too much overhead (well, not very likely, but who knows ...)

wmp





0

Featured Post

The Ultimate Checklist to Optimize Your Website

Websites are getting bigger and complicated by the day. Video, images, custom fonts are all great for showcasing your product/service. But the price to pay in terms of reduced page load times and ultimately, decreased sales, can lead to some difficult decisions about what to cut.

Question has a verified solution.

If you are experiencing a similar issue, please ask a related question

The Delta outage: 650 cancelled flights, more than 1200 delayed flights, thousands of frustrated customers, tens of millions of dollars in damages – plus untold reputational damage to one of the world’s most trusted airlines. All due to a catastroph…
Each year, investment in cloud platforms grows more than 20% (https://www.immun.io/hubfs/Immunio_2016/Content/Marketing/Cloud-Security-Report-2016.pdf?submissionGuid=a8d80a00-6fee-4b85-81db-a4e28f681762) as an increasing number of companies begin to…
To efficiently enable the rotation of USB drives for backups, storage pools need to be created. This way no matter which USB drive is installed, the backups will successfully write without any administrative intervention. Multiple USB devices need t…
Two types of users will appreciate AOMEI Backupper Pro: 1 - Those with PCIe drives (and haven't found cloning software that works on them). 2 - Those who want a fast clone of their boot drive (no re-boots needed) and it can clone your drive wh…

724 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question