Netware 6.5 SP2 Server Freezes Cold w/o Indication of Errors

I'm a novice at Novell, but I've been given the task of taking on old Poweredge 1400 (dual PIII 866Mhz, 1GB RAM, PERC2/DC RAID, dual intel nic, and 4x36GB 10K SCSI drives in RAID 5 and setting it up as a Novell Server.

Fair enough.

I installed the OS and PROD CDs.  Got the SP2 files and loaded them.  So I got everything working.  Then I started noticing a disturbing trend.  

My server freezes cold without recourse other than to powercycle.  This server is pre-production, so I'm not concerned about uptime.  But the problem is completely baffling.  My only ABEND log entries point to SERVER.NLM and AFPTCP.NLM (the apple talk config driver.)  But I updated the AFPTCP and lockups still occur.  

If I reboot the server and just let it be, it will not lock up for upwards of 24 hours, but it will eventually happen.  If I am using the server, it locks up much quicker.  If I try and copy about 1GB of files to it, it freezes almost immediately and without fail.

I thought it might be the PERC Drivers, so I've gotten the latest from Dell (PEDGE3.HAM).  I thought it might be the LAN drivers, but I updated those too.  

Any help would be greatly appreciated.  Keep in mind I'm no expert on the OS.
OK, NetWare v6.5 with SP2. Does it have any Post-SP2 patches? In particular, there is one which is critical if the server hosts an NDS replica - see -->

Generally, the symptoms you describe point to a hardware problem. Have you tried running the Dell diagnostics?

Novell TID #10017179 ( provides a general guide to troubleshooting ABENDs, altho I'm not sure if it really applies, since you're getting a hard lockup with no ABENDs, right? Are you able to even activate the NetWare Debugger (hold down the Left Shift and Right-Shift keys and then, at the same time, depress the ALT and ESC keys)?
TID 10017179 is a great place to start, as PsiCop pointed out.

Can you post your AUTOEXEC and STARTUP.NCF files here? We'll take a look to get a better idea of your setup.


The first thing would check is your intel lan driver

Get most current from the intel web site

Neware 6.5 ship with bad intel lan drivers.

I had a very similar problem load up new  drvivers cleared right up.  
Is all the firmware flashed and up todate on the server? Backplane,PERC,BIOS?
ColebertAuthor Commented:
first, thanks for the outpouring of responses.

on firmware:

PERC: Firmware, BIOS, and drivers all latest from Dell
BIOS: Latest from Dell
BACKPLANE: no backplane present.


on  Intel Pro/100+ Dual Port Server Adapter:

I downloaded the latest driver in hdetect from Intel and restarted the sever.


ran a consistency check of the logicial drive in the PERC2/DC bios and it claimed nothing was wrong.

Will now try PsiCop and WBM's advice.*+6.5&submit=Go%21

and tried to copy a huge grouping of files (Groupwise install files) and it still froze.


ColebertAuthor Commented:
should have been:

on  Intel Pro/100+ Dual Port Server Adapter:

I updated the latest driver in hdetect from Intel and restarted the sever.

Source file:*+6.5&submit=Go%21

and tried to copy a huge grouping of files (Groupwise install files) and it still froze.
ColebertAuthor Commented:
ok... onto the eDirectory update.  that's out.

that didn't help anything either.   after i RESET SERVER, it restarted and within 2 minutes was frozen.  ...and I didn't even touch it or do any file copies.
have you tried backing out sp2, and see if the server still freezes. If it does not, then I would think something is Sp2 is having issues with Hardware. i wil agree with  PsiCop  that it is a harware issue. he has never steered me wrong, and helped me numerous times. Also I am sorry if I missed it, but did you do any post sp2 patches.
To follow up on eberhardt2329's advice, how did you install SP2? Did you install from the original NetWare v6.5 media and then applied SP2 separately, or did you get the NetWare v6.5 SP2 Overlay CDs from the Novell Support site ( and and install from that?

If the former, I would suggest re-doing in the installation uing the Overlay CDs.
ColebertAuthor Commented:
thanks for the advice guys.

however, the problem began almost immediately after installing the entire OS.  it was a clean install.  i thought installing SP2 might help, but it didn't.  
Could we possibly have the old client side caching issue that NW is infamous for ?
ColebertAuthor Commented:
btw, right now there are only two users configured to use this server.   admin and myself.  no one connects to it except me.  

i'm in a mixed enviroment.  i have two Windows 2k3 and 2k DCs.  The Novell doesn't use DNS or DHCP.  It has a static and is registered on the MS DNS servers.  

ColebertAuthor Commented:
by "doesn't use DNS" i mean "isn't a DNS Server."
I dunno - generally the client-side caching resulted in corrupt or inaccessible files, not server lockups.

I'd suggest wiping the server and performing an install from the Overlay CDs.

I recently installed NetWare v6.5 SP2 (from Overlay CDs) on an old ProLiant 3000 (dual 450 MHz CPUs, 1.6 GB RAM) and ran into ABENDs and lockups. I eventually traced them down to a bad SDRAM DIMM.

Are you enabling Native File Access support? Starting with NetWare v6.5 SP2, NetWare using NFAP can participate in the Windoze Domain as a DC. I'm wondering if you have that turned on.
ColebertAuthor Commented:
lol, if I did I didn't consciously do it.  like i said, i'm no expert.  i just got told to do it so i started learning.  

how do you think doing a total reinstall would solve the problem?  
"how do you think doing a total reinstall would solve the problem?  "

Very good question. And its one of those mysteries of complex operating systems. I've had a system were I installed a base version of NetWare and then tried to apply the latest SP and it just didn't work out right and my system was unstable. But when I installed, on the same hardware, using the OS with the SP already overlaid, it went great - no problems at all.

What I'm thinking is that the base install made some mistake, during the install, that's since been fixed by an SP, but the fix didn't propogate thru your system because it was applied after the install had already done its thing. Exactly what that fix is or where it is I'm not sure, but this is my supposition.

And I've just generally gotten better results doing installs using the OS with the latest SP already overlaid. Just seems to work smoother.

Whether or not the Native File Access support was turned on is, in part, driven by what "personality" or server "template" you chose during your setup. You may recall shortly after the setup program switched to the GUI interface, you were given a set of pre-configured server packages, selectable by radio button. I dunno which one you chose, but NFAP support would be affected by that choice.

I still think a hardware problem is on the table as a possible cause. We have NetWare v6.5 running on a variety of DELL boxes using PERC boards (which I don't like, I think they're a second-rate RAID board when compared to something like the Compaq SmartSCSI Array controllers) and we don't have any issues like this (and we're almost entirely NetWare v6.5)

How 'bout the PSM?  Are you running the current MultiProcessor Support Module for that model server?
My first encounter with the CSC issue caused high server util and server lockups.That was the infamous SP2 for 5.1 and, as a lot of us know, Novell has been "fixing" this one for a very long time .(It was also present in SP1,just not as pronounced)

If I had a nickel for every post that I have seen concerning that issue ...

As for a possible fix ,try this:

Client - Novell Client Configuration, Advanced Settings, File Caching=off
Server - Set parameters, NCP, Client file caching enabled=off
                                                   Level 2 oplocks enabled=off
BTW, using the installs with the latest SPs already applied is simple - all ya gotta have is a CD-ROM burner. I gave the links above, you download the files, unpack them and then burn the resultant ISO images to CDs. There are two, OS and Products. The OS CD is bootable (assuming the server's hardware supports that, and I'm fairly sure the Dells do) and boots to the install.
ColebertAuthor Commented:
i understand the theory behind just doing a reinstall.   i'm gonna hold off on that for a few more days since i can try some of the other solutions, etc.

keep in mind, though, that I also used the trail OS and PROD discs from the website.  my site is on an open educational liscense and we're supposed to use that smartcert application to liscense everything, but i couldn't figure out how to use it to download Netware, so I just got the trail and figured I could liscense it after the fact.  

am i missing something here?
ColebertAuthor Commented:
i think i just selected the most generic server role and said i'd pick and choose the specific parts later.  
All the modern NetWare products work the same way insofar as licensing goes. You can download the trial product or the latest overlay images and install them in trial mode (no license) and install a license later.

Frankly, I think its downright stupid that Novell does not provide the latest overlay images as their "trial" version downloads. Best foot forward, only one chance to make a good first impression, that sort of thing. But they've always been a little slow when it came to marketing themselves. Just drives me up a wall to see amateur hour in the marketing department when the rest of the org is anything but.

pgm554, you could be right, but I'd honestly suspect a buggy driver or failing hardware first. In my experience, the OpLocks bit (which HAS been an ongoing problem, you're right about that) generally manifests itself with client-side problems as opposed to server-side problems. I'm not saying it can't be the OpLocks, but I think that's a zebra solution right now, given the info at hand (altho if that did turn out to be the issue, I'll be the first to stand in awe of you :-)

As for ShineOn's comment regarding which Processor Support Module (PSM) module to use, we use MPS14.PSM on our Dell servers. As ShineOn suggests, be sure that the install did not erroneously choose ACPI.PSM. At the NetWare console prompt, enter --> MODULES *.PSM

The result should look like:

   Loaded from [C:\NWSERVER]
   Address Space=OS
   blah blah blah

If it says "ACPI.PSM", you need to change that in C:\NWSERVER\STARTUP.NCF. Easiest way to do that is use NWCONFIG to edit the file. At the console prompt, enter --> NWCONFIG

Choose NCF Files Options and Edit STARTUP.NCF and replace LOAD ACPI.PSM with LOAD MPS14.PSM
ColebertAuthor Commented:
it says i'm using MPS14.PSM.  

i ran on the ram and its fine.   this server used to be a Server 2000 box, but we retired it to move our Netware 5 server over to it.   But instead we decided to make a new tree and go with a whol enew server then transfer the eDirectory info.  (Mostly because I had no confidence I wouldn't compeltely screw up that production server.)

The reason I'm saying this is that that box worked as a 2k server for over 3 years without hardware Issues.  Maybe thats why I think its still a driver issue.  The only thing different I did with that box is make pull the 18G SCSIs out and put in 36G scsis that were working.   Furhtermore, i've run the consistency test off the PERC drivers and it passed.  

I'm not entirely clear on how to slipstream the SP2 into the OS disk.  How I originally installed the OS was to make the array, format the OS to DOS 6.22, bust the ISO and burn it to CD .  Copy the contents onto the drive.  Then run the install from the disk.  Hacked through getting it setup and understanding the Console1 and all the webbased controls.  Started seeing lockups, upped the FWs on everything, latest drivers for the PERC, and then tried SP2.  
ColebertAuthor Commented:
Well, yes, I tend to agree with you, with that history, failed hardware is somewhat unlikely. I still think its worth your time to open it up and make sure everything is properly seated and wasn't jostled loose when you switch drives out. But you're probably right in that its not an outright hardware failure.

Well, you certainly came up with an interesting install procedure. Nominally, once you build the array, you can do the entire install directly from the CD - it'll even build the DOS partition and put a DR-DOS install on it.

That said, I like to build my own MS-DOS v6.22 (Redmond's best-ever product, once they stripped out the stuff they stole from Stac) DOS partition. But after that, I boot to the CD and let the NetWare installer do everything else (as opposed to booting to DOS, loading the CD-ROM drivers, and running the install from the CD).

I'm not sure why you copied stuff from the CD to the HDD and did the install from there. Its all designed to work from those CDs.

I really think that, once you make sure everything is seated and plugged properly, you should boot up into MS-DOS, make sure the CONFIG.SYS has nothing more in it than a FILES= statement, and that the AUTOEXEC.BAT does nothing more than set a PATH, perhaps a PROMPT, and then CDs to C:\NWSERVER and executes SERVER.

Once I made sure the DOS startup files did just those things, I'd blow away C:\NWSERVER (and all its subdirs - DELTREE works nicely) and any other NetWare-related subdirs (like C:\NWINST), and then run FDISK to get rid of the NetWare partition. I'd reboot (make sure the CD-ROM is set as the first boot device) with the NetWare v6.5 SP2 Overlay OS CD in the CD-ROM drive and re-do the installation. If you wanted, you could even skip killing the NetWare partition and just do a new server install and re-create just the SYS: Volume (leaving any other Volumes intact).

Since you're new to the NetWare world, some important tips:

1) NetWare v6.5 prefers NetWare Storage Services (NSS) to the older Traditional (FAT) Volumes. This is fine, NSS is MUCH faster when mounting Volumes and delivers good file service. However, inexperienced admins may make the mistake of not giving the SYS: Volume its own NSS Pool. Devices have Free Space, Free Space can be allocated to NSS Pools, Volumes are carved out of NSS Pools. The issue is that, by default, when a Volume is created in an NSS Pool, it is allowed to grow to the size of the pool. If SYS: is created in the same NSS pool as another Volume (say, DATA:) and the other Volume is not capped in terms of its growth, it is possible for SYS: to run out of space (if the other Volume takes up all the free space in the pool). This is a Very Bad Thing (tm) in the NetWare environment. If SYS: runs out of space, the Transaction Tracking System (TTS) will shut down, and TTS is crucial to maintaining integrity of the NDS database. At best, the server will choke, and you'll have to bring it up and clean out some space on the other Volume to grive SYS: room to grow. At worst, you'll corrupt the NDS database on the server and give yourself a mess to clean up (this is what happened to us where I work, where a previous admin - who no longer works here - had built a NetWare server as an NDS replication server, and where SYS: shared an NSS pool with other Volumes, and the other Volume were not limited in growth, and when they grew too large, SYS: was starved for space - fortunately, we are an NDS shop, and recovery of the corrupted NDS database was a little tedious, but the user population was never affected).

2) Do NOT allow users Write permissions on the SYS: Volume. Reserve the SYS: Volume for system software and administration tools. Create other Volumes and put user home directories, application softwares, databases, E-Mail, etc. etc. on those Volumes. You do not want users to have the ability to run the SYS: Volume out of space (see #1 above).

3) Since you're coming to NetWare from a Windoze/AD environment, you're in for some shocks:

a) You can repair the directory services databases ON THE FLY. There is no rebooting to a special "directory repair" mode. Simply load DSREPAIR at the console and away you go. At worst, you'll temporarily lock the NDS database, preventing users from authenticating during that window. Logged-in users will be unaffected.

b) You can have any server host an NDS database, or remove an NDS database from any server hosting one, pretty much at will. You do not have to rebuild the server to accomplish this.

c) You can assign both Directory Service rights AND File/Directory rights using objects other than Users and Groups. For example, instead of creating a Group called "Faculty" and using that to assign rights, if you have an OU in your NDS Tree called "Faculty", you can assign rights (rights to the NDS tree, or rights to a Volume/Directory/File) to that OU. EVERY USER OBJECT IN THE OU gets those rights. And if you move BobSmith's existing user object into that OU, Bob will get those rights AUTOMATICALLY (no intervention on your part, aside from moving his user object into the OU). And if later Bob's user object is moved out of the OU, he loses those rights, again AUTOMATICALLY.

d) Rights changes are dynamic. At most, the delay is as long as it takes to replicate thru the NDS tree. Thuse, users do NOT have to logout and log back in to get their new rights. If Sally needs permissions to read files in FACULTYSERVER/DATA:GRANTS\FEDERAL and she calls you, then when you grant them to her user object, the change is pretty much effective immediately. Sally will be able to access the files/directory without having to logout and log back in. The same applies to removing rights.

e) If a user has no rights in a sub-directory structure, then the user will be unable to even see that subdirectory structure. Going back to Sally, if she had read permissions in FACULTYSERVER/DATA:GRANTS\STATE but had no rights in FACULTYSERVER/DATA:GRANTS\FEDERAL, Sally would not even see that the "FEDERAL" subdirectory existed. As far as she would see, the only subdir in FACULTYSERVER/DATA:GRANTS would be "STATE" - the other one would be invisible. This is good for making sure Harold the Hacker in your freshman class has one less thing to stumble scross at 4am while his dorm buddies are sleeping off their drunken stupor.

f) The filesystem rights you can assign are MUCH more granular than you're used to. For example, you can give Bob Smith, the professor, the ability to grant rights in a subdirectory to other users, but he doesn't necessarily have total control of that subdirectory. Let's say that Bob puts class assignment resources in STUDENTSERVER/DATA:CLASSES\BOBSMITH\CSC111. You give Bob the "Access Control" right in that directory in addition to the other "normal" rights he gets, Bob can now, without your intervention, assign rights in that directory to other users, like Mary, who just added his class late.

4) You're going to find that, generally speaking, NetWare will consume less hardware, and need less maintenance, than a Windoze environment serving the same number of users and providing the same services. Expect that your hardware purchases will have a longer useful lifespan than you've been getting out of them, and that you spend less time providing care and feeding to the server environment. Once properly set up, It Just Runs.
Back to your ABEND, I note that your AFPTCP.NLM file is somewhat backdated. Have you looked at Novell TID #10091485 (
Novell TID #10094911 ( also offers similar advice, but looking at the ABEND details I'm not sure it matches yours. But updating the NLM is nevertheless a very good idea.
Those are about the only two time-apropos ABEND in AFPTCP.NLM TIDs that I'm finding. Anyone else seeing anything?
ColebertAuthor Commented:
just a quick note on the AFPTCP deal.  I found that too before i ever posted here.  i thought I updated it to the latest version yesterday.   haven't seen anymore abends from that, but the lockups still come.  i thought I'd post my last ABEND and see if anyone saw anything interesting.  

only other thing I can think of that I might have done is this:

before the array was all on one channel.  i got the brilliant idea to move two of the drives from channel one to channel two in order to help throughput.  obviously, i did this BEFORE creating the array, but now i'm beginning to wonder if maybe theres some problem with that channel two cable.  

when i blow the current OS away and start from scratch, i'll consolidate back to one channel again.
ColebertAuthor Commented:
i'm pretty sure i have all the hardware in right.  i've had the server on the floor guts all spilled out tinkering with it, changing card slots, reseating stuff, etc.  i started out with a GigE card, but thought it was bad, so i went to the intel dual port.  i wish i had a better raid card than the PERC, but its all I have now.  I'm honestly considering re-doing the whole thing, starting from the array level and working up.  

i'm still not entirely clear on how to slipstream SP2 into the OS files.  i get the initial install procedures, but I don't think i'm gonna be able to integrate the two without a little more guidance.

Hmmm....I think we have a fundamental disconnect on SP2.

If you download the files that I gave you pointers to above, the .ISO images are for the COMPLETE NetWare v6.5 with SP2 ALREADY OVERLAID. There is no need to "slipstream" anything. You do the install from those CDs and SP2 is already applied when you get done.
Just a thought, as I am not that familiar with Dell server hardware, but with Compaq there is a BIOS configuration option where you set the NOS and version.  This has caused us some issues in the past.  Might be worth checking if you previously had W2K installed on this hardware.

Also (if this hasn't already been posted) you should make sure your autoexec.bat file is not configured with a himem statement.
ColebertAuthor Commented:
ok, heres an update on my situation.

as i said before, i was running a RAID5 array of 4x36GB drives.  In order to increase throughput, I built the array across both channels of my PERC2/DC controller.  2 on one channel, 2 on the other.

as a last step before i was going to rebuilt and reformat, i consolidated all the drives back onto one chain.  

i've been almost 18 hours with uptime since the consolidation.  i've been copying the SP2 and GroupWise source files (unzipped) back and forth across the the network to the NetWare server without incident.  

Things are looking promising.  However, I have noticed my transfer rate has declined since the consolidation.  Before, I would get an estimate of 8-12 minutes to move the entire GroupWise source directory from my computer to the NetWare server.  Now i'm getting 18-24.   I'm gussing this is due to the troughput loss resulting from my consolidation onto one chain.

Anyone got any thoughts about this?  Also, got any good recommendations for a possible replacement RAID adapter?  My PowerEdge 1400 is 66MHz/32Bit PCI, so that rules out PCI-X.  My drives are Ultra160.   My budget is probably 150-200.  It doesn't have to be new.


ColebertAuthor Commented:
ok.  i'm closing this topic and awarding points.
Thanks for the points. I don't have any recommendations concerning RAID adapters. My main experiences have been witht he Compaq SmartSCSI line and the Dell PERC, and, well, you already know what I think of the Dell PERC. And the Compaq SmartSCSI is not an option for you.
All the PERC is ,is just a OEM Adaptec RAID controller.
There are other decent brands ,the only one that comes to mind is LSI Logic, for the simple reason that they tend to have better drivers for Novell.
Most of the newer Adaptec's(2100 series)  use something called an I2O driver, which Novell broke in some of their service packs.There was a workaround that involved creating a shutdown.ncf  file to unload the drivers so that the system would not hang on a down command.

Adaptec Novell drivers over the years have ranged from usable to s**t.

ColebertAuthor Commented:
this PERC is an LSI, i believe.  its a PERC2.  PERC3 and PERC4 went to adaptec.
