Slow NFS Performance
Posted on 2006-04-07
We moved our office a few weeks ago, and as part of that move, re-ip'd the whole environment from a 10.0.0.x range to a 192.168.0.x range.
Since then, we've had performance issues on Solaris, AIX, and HP-UX NFS clients. Our linux clients are as well-behaved as they ever were.
We get a lot of RPC timeouts, and very slow performance on the NFS mounted directories. Have tried playing with vers=2/vers=3 and tweaking the wsize/rsize params, but so far, no luck.
Looking for any suggestions or ideas... Since the linux environments remain fast, I think we can safely rule out a network or hardware issue, so I'm thinking it's on the nfs clients themselves.
netstat -i shows no collisions on the interfaces themselves.
nfsstat from the client shows a lot of badcalls and badxids:
# nfsstat -rc
calls badcalls badxids timeouts newcreds badverfs timers
27347 2599 537 2563 0 0 0
cantconn nomem interrupts
0 0 17
calls badcalls retrans badxids timeouts newcreds badverfs
9 1 0 0 0 0 0
timers nomem cantsend
6 0 0
Finally, a truss of a cp from an NFS mount to local disk on the client hangs on the following line:
write(4, 0xFE800000, 8388608) (sleeping...)
We have a fairly heterogenous environment, and the NFS servers in question are a linux-based Snap Server, and a stock Solaris 8 box with clients running linux, solaris, HP-UX, and AIX. None of these issues were present in the old environment, and the linux clients are all still happy.
Looking for any ideas...