Windows 2003 Server DFS and SANs

Posted on 2009-06-29
Last Modified: 2012-05-07
I have a couple of questions here.  I have been in my current position for about 5 months and there was and still is much to clean up here.  First and foremost, I have many single points of failure that I need to make redundant.  I am hoping that DFS can help me with this.

Here is my first challenge.  Both of my AD servers are running on W2K3 server - not R2!  I don't have any experience with non-R2 W2K3 server and DFS/FRS seems to work differently or at least has a different GUI.  I have increased storage on both servers so that I can setup replication for our 'public', 'users' and 'dept' shares.  However, when I start FRS process, it quickly fails (event 13508, 13544, 13552) after replicating only a small percentage of the files, and then never recovers.  I have goggled this until there is nothing new to read on the errors I am getting.  So, here is my question.  Will upgrading these two servers to R2 resolve this issue, or is there something else screwing me up here?

Second question: If I have two servers that both have access to two SAN drives, with both of them sharing those drives with the same share names, can I configure a DFS share that has them both a root shares, but not enable FRS (no need to) - that way, if one server is down, the other will automatically services any requests to the DFS share.

Question by:david_griswold
Welcome to Experts Exchange

Add your voice to the tech community where 5M+ people just like you are talking about what matters.

  • Help others & share knowledge
  • Earn cash & points
  • Learn & ask questions
  • 3
  • 3
LVL 11

Expert Comment

ID: 24738154
In relation to your second question, the answer I think is 'no'.  The reason is that only one server may access any single disk at one time.  If you wan to present a disk to 2 servers, you need to set up a cluster.  Of coarse you could set up the disk to be available to both (wiring), but then when there is a server failure, you would have to manually intervene to actually connect the disk so the data could be shared, AND there is the worry that windows may try to 'help' and load the disk anyway, causing disk and data failures.
LVL 11

Expert Comment

ID: 24738178
As for the first, there are no guarantees in life, only death and taxes.  My experience has shown that upgrading in hopes of fixing an issue has failed 9 out of 10 times for me.  From your description, I would be looking at the network connects.  If FRS starts and is working and fails half way through, I am thinking the network lag or latency may be causing to many errors, and windows is saying 'enough'.

There are some specifc FRS logs, but I woul have to dig around to find out were they are hiding.

Author Comment

ID: 24738620
The IBM Storage manager software allows for configuring a  group of hosts to access the same logical drives.  There is nothing in documentation on how the system deals with any possible contention with resources on the logical drives.  I assume, since the OS accesses the resources through a driver that talks to the IBM Storage System that the driver would handle this issue.

As for network issues, they are both connected to the same gigabit switch.  I would assume that isn't the issue.

Optimize your web performance

What's in the eBook?
- Full list of reasons for poor performance
- Ultimate measures to speed things up
- Primary web monitoring types
- KPIs you should be monitoring in order to increase your ROI

LVL 11

Expert Comment

ID: 24738652
It has been my experience that drivers do not take care of the disk issue.  But, is just my opinion, and it would not be the first time I was wrong.

Have you checked the speed and duplex on the NIC's?  Are the switch ports reporting any errors.  Perfmon will also tell you if there are errors (and monitor lots more), but run it during a test of FRS.

If you want to look at the FRS logs, try this

Author Comment

ID: 24739008
I just found the info I was looking for concerning hosts sharing LUNs in the SAN via a Host Group.  This function is only for clustered servers - which would eliminate the need for DFS.

I am currently rerunning the FRS initialization and gathering logs and performance data.

Accepted Solution

david_griswold earned 0 total points
ID: 25033819
I restarted the DFS process again and just left it alone.  There were some errors (event 13508, 13544, 13552) but after awhile they cleared up and started the replication process again.  I had some additional issues with clients getting connected to one server or another, when I wanted only one server to be the primary, and I was able to resolve this using the 'dfsutil.exe' to set the primary server.

Featured Post

Industry Leaders: We Want Your Opinion!

We value your feedback.

Take our survey and automatically be enter to win anyone of the following:
Yeti Cooler, Amazon eGift Card, and Movie eGift Card!

Question has a verified solution.

If you are experiencing a similar issue, please ask a related question

This may not be a text book method to resolve VSS backup issues but it seemed to have worked on few of the Windows 2003 servers we had issues while performing a Volume Shadow Copy backup. If you have issues while performing a shadow copy backup usin…
On July 14th 2015, Windows Server 2003 will become End of Support, leaving hundreds of thousands of servers around the world that still run this 12 year old operating system vulnerable and potentially out of compliance in many organisations around t…
NetCrunch network monitor is a highly extensive platform for network monitoring and alert generation. In this video you'll see a live demo of NetCrunch with most notable features explained in a walk-through manner. You'll also get to know the philos…
Sometimes it takes a new vantage point, apart from our everyday security practices, to truly see our Active Directory (AD) vulnerabilities. We get used to implementing the same techniques and checking the same areas for a breach. This pattern can re…

630 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question