Question

NT4.0 Server Kicks out users from network!

Asked by: Allan_Shiels

Hi there,

We seem to be experiencing a real nasty problem with one of our customers NT4 servers.

Site info:
The site has 2 NT4 servers, one a PDC and the other a BDC.  The email system is kept on the PDC and the
data is all kept on the BDC.  The workstations are a mix of windows 98, XP and 2k.

The network is CAT5 Throughout 10/100  standard except for a gigabit link between the two servers which
are situated in different buildings from each other.

Problem info:
What happens is, at random intervals, the BDC decides to just throw everyone off the network.  Anyone in any network docs or apps which are situated on the BDC  are thrown out.  Any mapped network drivers to the BDC become unavailable.

Double clicking on the BDC in network neighborhood from a workstation during this time gives the message "the server is not accepting this type of request at this time, please try later"

However, the plot thickens.  The BDC is still pingable!  and also we can still VNC into the server desktop, so the IP layer of the network on it still works ok.  It just seems to kick the users out, its like the browser service just decides to stop working or something!

The only way we are resolving this issue is by rebooting the BDC.  However, on reboot the server always hangs at the windows NT4 server splash screen.  It then needs to be rebooted again before it all works again.

NT Event logs show no anomalies whatsoever!

Additional info:
Mostly happens through the night, sometimes in the afternoon though.
Set up replication service between BDC and PDC to run one night, BDC crashed out that night and needed rebooting in the morning (could be co-incidence, could be linked)


Server spec:
Windows NT4.0 Server with SP6a
Intel pentium DUAL XEON 2.8
512MB DDR
3x 36GB SCSI Raid 5 array HOT SWAP
Intel on board Gigabit Lan card
HP 20/40 Tape drive using veritas 8.6 running nightly backup

This is an urgent problem, hence the 500 pints.  I mean points :)


This Question has been solved and asker verified All Experts Exchange premium technology solutions are available to subscription members.

Subscribe now for full access to Experts Exchange and get

Instant Access to this Solution

  • Plus...
  • 30 Day FREE access, no risk, no obligation
  • Collaborate with the world's top tech experts
  • Unlimited access to our exclusive solution database
  • Never be left without tech help again

Subscribe Now

Asked On
2004-10-27 at 02:51:29ID21184238
Tags

bdc

Topics

Windows NT Operating System

,

Tivoli Storage Manager

Participating Experts
4
Points
500
Comments
30

Trusted by hundreds of thousands everyday for fast, accurate and reliable tech support.

  • "The time we save is the biggest benefit of Experts Exchange to Warner Bros. What could take multiple guys 2 hours or more each to find is accessed in around 15 minutes on Experts Exchange." Mike Kapnisakis, Warner Bros.
  • "Our team likes having a resource that is more secure than just using Google and most experts using this service really know their stuff. It's nice to look here first versus using Google." Dayna Sellner, Lockheed Martin
  • "Anytime that I've been stumped with a problem, 9 out of 10 times Experts Exchange has either the accepted solution or an open discussion of the potential solution to the problem." Kenny Red, eBay Inc.

See what Experts Exchange can do for you.

Got a question?

We've got the answer.

Experts Exchange has been collecting answers to technology questions since 1996…3 million and counting! If you have a question, chances are we already have your answer.

Screenshot of Experts Exchange Knowledgebase

Need individual assistance?

Our experts are ready to help.

If you can't find the exact answer you're looking for, ask our exclusive community of 50,000 experts. You’ll get a personalized answer from a trusted professional.

Screenshot of Experts Exchange Knowledgebase

Want to learn from the best?

Read articles from industry experts.

Thousands of free tech tips, tricks, how-to’s and tutorials are available in our peer reviewed articles section. See for yourself how smart our experts are, no login required.

Screenshot of an Article

Working on a long term project?

Store your work and research.

Save solutions to your questions, answers you’ve discovered through searching plus helpful articles in your personal knowledgebase for easy future access.

Screenshot of Experts Exchange Knowledgebase

Access the answers to your technology questions today.

Subscribe Now

30-day free trial. Register in 60 seconds.

What Makes Experts Exchange Unique?

Members of the expert community talk about why the experience at Experts Exchange is different than what you will find anywhere else.

Trusted by the world's most respected brands.

image of each brand's logo

Faithfully serving IT professionals since 1996.

Experts Exchange Logo

Try it out and discover for yourself.

Subscribe Now

30-day free trial. Register in 60 seconds.

Related Solutions

  1. Veritas
    this is a real simpl question (but not for me - doah!) one of our offices has just sent over some data on a dds4 tape but how do i open up veritas to see whats on this tape? version 8.6
  2. Tape capacity
    I have Veritas Backupexec 9.1 running on a NT4.0 SP6. I just acquired a new SDLT320 drive and a new set of SDLT1 tapes. When i mounted the tape, Veritas would show that the tape capacity is only 149GB instead of 160GB. I tried all my tapes and all of them are reporting the wr...
  3. Tape Catalog with Veritas
    I'm running Windows 2000 AS with a 2 terabyte ataboy in a closed network with about 10 clients. I'm running Veritas 9 with a Dell 132 (20 tapes) LTO changer as a backup solution. Here's what happened, A user asked me to restore a file from a few days ago. I lit off a r...
  4. Veritas tape library
    I have a Dell Power vault 132T that is running Veritas 9.1. There are currently 2 cartrigages that hold 7 tapes each for a total of 14 tapes. The issue that I am having is that, there seems to be no order when the backup job runs. It will pick random tapes from both cartridga...

Free Tech Articles

  1. WARNING: 5 Reasons why you should NEVER fix a computer for free.
    It is in our nature to love the puzzle. We are obsessed. The lot of us. We love puzzles. We love the challenge. We thrive on finding the answer. We hate disarray. It bothers us deep in our soul. W...
  2. SCCM OSD Basic troubleshooting
    SCCM 2007 OSD is a fantastic way to deploy operating systems, however, like most things SCCM issues can sometimes be difficult to resolve due to the sheer volume of logs to sift through and the dispe...
  3. Migrate Small Business Server 2003 to Exchange 2010 and Windows 2008 R2
    This guide is intended to provide step by step instructions on how to migrate from Small Business Server 2003 to Windows 2008 R2 with Exchange 2010. For this migration to work you will need the fo...
  4. Create a Win7 Gadget
    This article shows you how to create a simple "Gadget" -- a sort of mini-application supported by Windows 7 and Vista. Gadgets can be dropped anywhere on the desktop to provide instant information, ...
  5. Outlook continually prompting for username and password
    There have been a lot of questions recently regarding Outlook prompting for a username and password whilst using Exchange 2007. There are a few reasons why this would happen and I will try to cover t...
  6. Backup Exchange 2010 Information Store using Windows Backup
    There seems to be quite a lot of confusion around the ability to backup Exchange 2010 using the built in Windows Backup feature. This stems from the omission of this feature prior to Exchange 2007 s...

Cloud Class Webinars

  1. Avoiding Bugs in Microsoft Access
    Alison Balter takes and in-depth look at avoiding bugs in Access. In this webinar you will learn about using the immediate window to debug your applications, invoking the debugger, using breakpoints to troubleshoot, stepping through code, setting the next statement to execute, ...
  2. Top 10 Best New Features in Visio 2010
    Scott Helmers gives live demonstrations of the top 10 new features in Visio 2010. This webinar will teach you how to create compelling diagrams by adding shapes to the page with a single click, linking the shapes in a diagram to data in Excel (or SQL Server, or SharePoint), ...
  3. IT Consultant Business Secrets Revealed
    Michael Munger, Experts Exchange tech pro and IT consultant, pulls back the curtain on his very successful businesses and answers question on every IT consultant and business owner should know about. He shares secrets on what he did to solve the 5 most common problems in IT, ...
  4. Disaster Recovery and Business Continuity
    Quest CTO, Mike Billon, gives an overview of the steps involved in building a dunamic disaster recovery plan. Through case studies and an examination of software/hardware tooles for monitoring and testing, you'll gain a better understandin of where you are, where you want ...
  5. Organize Your Visio Diagrams with Containers and Lists
    Scott Helmers uses cross functional flowcharts, wireframe diagrams, data graphic legends and seating charts to teach you: how to ustilize all three new structured diagram components in Visio 2010, the best practices for organizeing shapes in previous version of Visio, how to organize ...
  6. How to Us Objects, Properties, Events and Methods in Microsoft Access
    Alison Dalter gives an in-depbth look at objects, properties, events and methods in Microsoft Access. In this webinar you will learn about using the object browser, referring to objects, working with properties and methods, working with object variables, understanding the ...

Join the Community

Give a Little. Get a Lot.

Join the community of experts here and help other tech pros by answering question in your area of expertise. You can earn FREE access to all Experts Exchange's premium features and resources.

Join the Community

Answers

 

by: kz20flPosted on 2004-10-27 at 04:35:47ID: 12420545

Have you tried resetting the domain secure channels to the PDC when this happens, e.g. run the command

nltest /server:bdcname /sc_reset:domain\pdcname

try reapplying SP6a also

 

by: Allan_ShielsPosted on 2004-10-27 at 05:33:04ID: 12421109

I Have not tried either of these.

What does resetting the domain secure channels do exactly?

 

by: kz20flPosted on 2004-10-27 at 05:46:39ID: 12421253

Secure channels are the mode of communication between servers and domain controllers. they can drop out and give you access problems, however I would probably expect to see these more on a member server than a DC, but anyway, worth a try. Resetting them will re-establish the secure link between the DCs (you will need to get nltest.exe from the NT4 Resource kit though)

I would also look at your paged-pool and non-paged pool memory usage, memory handle count, and the status of the Netlogon service, amongst other things. Does the server return an error code (a number) when all the connections drop? Are there a large amount of open connections (check the Server applet in Control Panel)?

 

by: kz20flPosted on 2004-10-27 at 05:47:59ID: 12421269

I would also disable the replication and use robocopy (from the ResKit again) as a scheduled job instead. It may help narrow things down.

 

by: Allan_ShielsPosted on 2004-10-27 at 05:53:42ID: 12421330

I have disabled the replication service.

The main problem is that when it occurs ithroughout the day the customer just goes and restarts the server themselves, thus giving us no options to connect in remotly and check services and things out!

I have told them to let us know straight away when it happens so i can try and obtain new information, which i will post back here.  could be days though.

 

by: vivekparaPosted on 2004-10-27 at 17:16:12ID: 12428869

It sounds like the RPC service (port 493, I think) is taking a hit of some sort.  Have you done a virus sweep recently?  Some viruses will attempt to attack this port...and with the Microsoft patches, it denies access...but still keeps the service responding to the attempts.

What does the event log say in the System area regarding events during these "blackouts"?  You will normally see errors against the RPC service at this time.

Besides viruses, if you are using the BDC without a server WINS enabled, you will often get browser elections that can foul things up.  Normally the BDC will become the Master in these elections and when overloaded, it may not be able to service requests.

The reason I highly believe its the RPC is because you can still Ping.  Pinging is just at the Network layer while the RPC is at the application layer...you would be able to ping but not get RPC requests.  I have a feeling if you can use Network analyzer when this occurs, you should be able to see "where" the problem is coming from.

 

by: vivekparaPosted on 2004-10-27 at 17:29:41ID: 12428967

Port 135 is RPC endpoint mapper and 139 is NETBIOS.  I think these are the ones to check out.  I don't know HOW I got 493...

 

by: Allan_ShielsPosted on 2004-10-28 at 02:24:30ID: 12431665

I will connect in and have a look at the event log from the last time it happened.

 

by: Allan_ShielsPosted on 2004-10-28 at 02:42:32ID: 12431731

There is just absolutely no logging in the event log whatsoever at the times of this.  The only ones are 'event log is being shut down' (presumably when they shut the server down) then after that its the good old 'windows NT4 sp6 multiprocessor free' message when it starts up.

However, a few weeks back there is two red warnings from 'aacdisk' during bootup stating that the file system is corrupt and unusable and to run chkdsk.  

I just ran chkdsk and it said it found some minor inconsistisies, but i just dont think this is causing the problem.  Unless the network card is faulty and its taking corrupt packets from the network and putting it on the hard disks?

 

by: TannerManPosted on 2004-10-28 at 04:14:25ID: 12432340

I to had this problem with a BDC on a multi-spoked WAN and never could resolve, even with the suggestions metioned above.  It was, for me, most deffinitely, the loss of the secure channel but I could never restore it. A reboot would sometimes allow the BDC to pull a Browse list from the PDC and it would work ok for a while and shutdown again.

Even though I don't advice it, it was suggesed to me to remove the BDC from the domain and re-add, but that didn't help either.

I had to bring up a separate BDC for that network, move the data needed over, and shutdown the original.

DO NOT do what I did on my say so. Exhaust everything you can find including the very good advice already mentioned by the above posters.

 

by: Allan_ShielsPosted on 2004-10-28 at 04:17:43ID: 12432359

Tannerman, did you try replacing the network card?

I am going to try that, then reinstall SP6 after that.

Also, please can you tell me the EXACT spec of the server that you had this problem with?  Software installed and all.  I know its alot to ask but we could compare specs and see if there are any consistent apps/hardware!

 

by: TannerManPosted on 2004-10-28 at 04:23:38ID: 12432399

I did replace the NIC
I did re-apply SP 6a

NT4.0 BDC running SP 6a
No software outside of OS, just data storage.
The machine did have exchange 5.5 at one time, but instead of uninstalling (usually leaves a lot of junk behind) I just killed all exchange services.

I wish you luck on this. The site I had the problem with was part of a 13 node network with BDC's at each node  and this was the only one that gave the problem.

 

by: Allan_ShielsPosted on 2004-10-28 at 04:37:19ID: 12432475

If you dont mind me asking, what what the hardware spec.

were you running adaptec raid sottware?

 

by: TannerManPosted on 2004-10-28 at 04:57:40ID: 12432627

OH, sorry I didn't include that.
The box was an old proliant 800 Ppro200mhz 64 meg ram (well overdue for updating anyway), but no raid on the scsi drives other than software based mirroring.

 

by: Allan_ShielsPosted on 2004-10-28 at 06:25:34ID: 12433406

Well considering that your rig is substantially older and less powerful than mine, and not even running similar software this problem must surely be down to a software problem with NT SERVER.

Its the only consistency between the two incidents.

When it happens again im going to check the secure channels.

Tannerman, may i ask you how often it happened with you, and if there was any specific time?

 

by: vivekparaPosted on 2004-10-28 at 07:39:45ID: 12434291

kz2ofl may be right, then.  But normally you lose secure channels when you promote or demote DC incorrectly (or should I say PDC and BDC in NT 4.0 cases).  Here are the articles to back him up:

http://support.microsoft.com/default.aspx?scid=kb;en-us;150518

This also resets the NETDOM password.   Expect to spend an hour doing this...but it can take as little as 10 minutes if you get everything to work right.
http://support.microsoft.com/default.aspx?scid=kb;EN-US;260575

A more detailed explanation of netdom
http://support.microsoft.com/default.aspx?scid=kb;en-us;329721

I had a similar problem in a Mixed-Mode domain...and I think kz2ofl is on the money on this one.  The question is "why is it losing secure channels?"  I'm wondering about the switch or router you have between these two devices.  Is it using cut-through or spanning tree algorithms?  Sometimes, if the router builds a routing table with certain parameters, it will "misdirect" packets to the wrong address.

You may be better served moving the two servers in to the same area and letting the clients on the other end log in through the fibre channel route.  It would be a simple way to see if that is your problem.  If the problem still persists when the two computers are on the same subnet and "routing space", then you need to dig deeper.

 

by: Allan_ShielsPosted on 2004-10-28 at 07:55:15ID: 12434507

The servers are on the same subnet, the buildings are only separated by a road and a factory.

There is a CAT5 cable that runs over the road and through the factory which connects two netgear 10/100/1000 switches together.  The PDC and the BDC each hang off one of these switches, here is a crude diagram -

BDC <> 2M Patch cable <> Netgear gigabit port on switch <> CAT5 daisy chain <> Netgear gigabit port on switch <> 2M Patch cable <> PDC

I hope that makes some sense.

There are several workstations that hang off both the netgear switches utilising the 10/100 ports.

Before all this happened, the PDC was the only server there, and all the data and email was accessed on that server alone, with no problems.  Its only since we put the BDC in there and got everyone mapping network drives to that, that the problem has risen.

 

by: vivekparaPosted on 2004-10-28 at 08:23:37ID: 12434885

If it is a switch, you could still be experiencing the problem there.  Switches can direct traffic at the Physical/Network layer of the IP protocol stack.  Most modern "routers" are just dumb devices that basically just broadcast address packets across the ports...but the switch actually looks at the header and tries to "interpret" where a packet should go.

I agree that it may not be LIKELY considering a reboot seems to solve the problem...but I would consider it.

Is there a WINS server in place?  Are you using P or B node type for resolution in WINS?  Or are you using NETBIOS over TCP/IP?  If NETBIOS, it could be a broadcast storm causing your problems.  WINS reduces this significantly if used in P mode.  All computers will be resolving to the BDC...and if a broadcast storm occured, it would overwhelm its interface.

 

by: Allan_ShielsPosted on 2004-10-28 at 08:30:01ID: 12434971

We are using NETBIOS over IP.  I dig what you are saying about broadcast storms - cant faulty network cards cause these?

If so, could be tricky to track down.

What i want to do is knock the BDC back onto a 100MB Card instead of a 1000MB card.  I just dont trust NT4 and gigabit LAN.  Too old an OS with too new a tech if you know what i mean.  It cant have been rigorously tested enough in all environments.

I would also like to add at this point, that this BDC has already had SATA mirroring disks replaced with RAID5 SCSI Because we were getting corrupted data in our accounts package.

 

by: vivekparaPosted on 2004-10-28 at 08:48:53ID: 12435289

Faulty cards on your network can cause broadcast storms.  The most likely candidates are computers that have a tendency to slow down periodically.  The best way that I've found is to wait for a storm and just turn on Network Analyzer immediately and start sifting through packets against the interface.

It could be as simple as it is the card on the server...and the fact that rebooting seems to help does push things in that direction.

I would also implement a WINS server and have your DHCP assign the node (P prefereably) and WINS information dynamically to your clients.  This can reduce broadcast traffic up to 50-70 percent on some networks (on average, I say about 20-30 percent, though).  I've done a test and seen dramatic improvements in throughput.  With Win2K, this is NOT as much of a problem with name resolution being performed in DNS...but all old Win98 and NT depend on the NETBIOS for name resolution if there are no WINS services deployed...and even 2K and XP workstations are dependent on it if in a NT server domain.

 

by: ahmedbahgatPosted on 2004-11-02 at 18:53:30ID: 12479692

you may check the muber of licences using the license manager,


cheers

 

by: Allan_ShielsPosted on 2004-11-03 at 01:26:31ID: 12481247

ahmed,

Initially, i thought it was something to do with the licencing also, as the message you get when you click on the server in network neighborhood is the exact same as the message you get when you exceed client licences, and you try to map a drive!

When i first set up the server i forgot to put it up to 50 concurrent connection licences, and left it at 5.  This resulted in people not being able to connect it with the same message as they get now in network neighborhood.

We have since upped the concurrent connections to 50.

I wonder if the lisence system is working ok...

 

by: ahmedbahgatPosted on 2004-11-03 at 02:34:52ID: 12481635

if you have a look at license manager you will know if it has a problem with the yellow mark next to the installed product, it is not just adding licenses you will also need to confirm purchasing it using licenses manager again untill all marks are blue next to the ms products, not to say that you reall need to purchase it to get around that


cheers

 

by: Allan_ShielsPosted on 2004-11-03 at 02:46:33ID: 12481694

I have nice blue boxes beside the server in licence manager.

 

by: vivekparaPosted on 2004-11-03 at 07:08:11ID: 12483778

Have you tried the network monitor to see what the bandwith looks like?  Just curious.

 

by: Allan_ShielsPosted on 2004-11-03 at 07:32:52ID: 12484018

Where is the network monitor located?

 

by: vivekparaPosted on 2004-11-03 at 07:49:50ID: 12484209

It is normally located on your server if you have the Networking Tools (adminpak.msi) installed.  Its basically a discrete network sniffer which can monitor traffic.  If you want something more robust and, Network Probe is one you can buy to evaluate traffic.

A really neat system monitor, which may be more useful for you, is Visual Server Monitor.  Its by 2morrow software and they have a 30-day trial.  It will be able to monitor ALL your servers using a very visually appealing and easy-to-use interface.  I recommend trying that....its free and you won't have to deal with trying to filter your results.
Its not as comprehensive, but I bet it will help eliminate a lot of possibilities.

 

by: Allan_ShielsPosted on 2004-11-03 at 07:52:14ID: 12484229

I am going to dowload the 30 day trial then for visaual server monitor :)

20120131-EE-VQP-002

3 Ways to Join

30-Day Free Trial

The Experts

98% positive feedback on 31,087 answers since March 2000. angeliii is a Microsoft Most Valuable Professional for his work with MS SQL Server & Develoment.

He has also proven his knowledge of Visual Basic Programming, PHP Scripting and Oracle Databases.

The Experts

97% positive feedback on 10,752 answers since July 2000. lrmoore has more than 18 years experience in the networking industry.

The six-time Mircosoft MVPs specialties include firewalls, virtual private networking, and network management.

Testimonials

"...and excellent source for support... Kind of like having your very own IT dept." Electriciansnet

Testimonials

"I was apprehensive at signing up at first. However... it has already made my life as an IT administrator much easier." JaCrews

Testimonials

"WOW! You guys have great, active, and knowledgeable people on here." moore50

Business Clients

Business Clients

In the Press

"If you’ve got a question... Experts Exchange can supply an answer.”

In the Press

"...an invaluable aid for both IT professionals and those who require tech support."

In the Press

"where IT professionals provide quick answers on just about any topic"

Business Account Plans

Loading Advertisement...