Solved

Dell/EMC FC4500 & Powerpath path failures

Posted on 2006-07-13
10
489 Views
Last Modified: 2013-11-15
Hi,
 
We have clustered PE2600's with a FC4500 with 5 DAE's direct attached.
 
We experienced multiple disk failures in our last DAE and after replacing the disks and rebinding the LUN, we are unable to access the disk within the OS (2000 Server) or Powerpath.
 
Everything looks fine within Navisphere, with no faults.
We have OpenManage Array Manager on one node which fails to find the disk when a scan is done.
On the other node we don't have array manager and when doing a scan within Windows Disk Management it also fails to find the disk.
 
Within power path we see the disk (probably from before we encountered multiple disk failures), but it says that the storage device has a failed status. It also shows the two paths c3t0d8 & c4t1d8 with a status of failed and a state of closed.
I've tried running powermt restore, with and without the force option, but it just tells me: Clariion device path c3t0d8 is currently dead, and the same for c4t1d8.
Anyone know how we can regain access to this enclosure??
Any help would be very much appreciated.
Thanks,
mark
0
Comment
Question by:markmall
  • 5
  • 2
10 Comments
 
LVL 30

Expert Comment

by:Duncan Meyers
ID: 17106221
My immediate suspicision is that you may not have added the new LUN into the Storage Group that the server is in.

You'll need to start Navisphere Manager (from the fibre channel connected host)and log in.

In the Storage Tree, expand out Storage Groups. You'll see one or more storage groups. If you open up the storage groups, you'll see that they have a host and one or more LUNs in each one. Right-click on the storage group with the troublesome host in it and click on 'Select LUNs'. Click on the new LUN, click on the arrow pointing to the right and click on OK. Now reboot the host and it will see the LUN.

 
0
 
LVL 30

Expert Comment

by:Duncan Meyers
ID: 17106229
If the above solution doesn't fix your problem, then we'll need more information about your environment:

How many hosts are attached?
Are the hosts direct-connect or via a FC switch?
What is the FC4500 software revision?(click on the array serial number in Navisphere Manager, select properties and select the software tab. The FLARE code revision will be either in the format 02.06.0.x00.xx or (more likely) xx.xxxx. Could you also click on the Access Control tab and see if the Access Control Enabled tick box is ticked an greyed out?
0
 

Author Comment

by:markmall
ID: 17106683
Hi Meyersd,

Thanks for you replies.
Firstly, we have two hosts attached and they're direct connected.
Secondly, we don't use storage groups.

When I get the properties of the array, I don't have a software tab or an access control tab, I just have the following tabs:
General
Cache
Memory
Storage Access.
Am I looking in the right place? I don't see anywhere else where there's a software tab!?
0
Free Trending Threat Insights Every Day

Enhance your security with threat intelligence from the web. Get trending threat insights on hackers, exploits, and suspicious IP addresses delivered to your inbox with our free Cyber Daily.

 
LVL 30

Expert Comment

by:Duncan Meyers
ID: 17107879
Ummmm...

You're on Navisphere 5.3 or earlier I guess. Which means some pretty serious remembering on my part :-)

The Storage Access tab will have a tick box that says something like "enable access control" but if you only have two hosts and no storage groups, then you're unlikely to have access control enabled anyway - it was an extra-cost option on those earlier arrays.

Do you have two FC cables from each host to the array or one cable only from each host to the array? If you have a single cable only, or both cables to the same storage processor, then the most liekly cause of your problem is that the new LUN is assigned to the other storag eprocessor. To trespass the LUN, right click on it and select "trespass". If that fixes your problem you'll then need to open the properties of that LUN and set the default SP to that storage processor.

...

A penny has just dropped. It may be that clusdisk.sys is masking the new LUN. You can check that by disabling clusdisk.sys. Go into Computer Management - Device Manager. Enable "Show Hidden Devices" on the View menu, then Expand Non-Plug and Play Drivers, right-click Cluster Disk Driver, click Properties, click the Driver tab, in the Startup type list click Disabled, and then click OK, the restart the computer. If you can then see the new LUN, create a partition on it, the reenable clusdisk.sys and reboot. ***Note that you should only have one node running in the cluster when you do this.****

Another thing to try depends on your FC HBAs and how old they are. You could try going into the HBA BIOS, enable the HBA BIOS and then check to see if the card can see any FC devices. Make sure that you disable the BIOS once again after you've finished the test.
0
 

Author Comment

by:markmall
ID: 17112264
Thanks again for these suggestions.
I had a look in "About" on Navisphere and it says our version is 6.2.0.6.0.
On the Storage Access tab there's just one option that says "Enforce Fair Access", but as you suggested it's greyed out.

I've tried the trespass scenario, but that didn't fix the problem.

I'm keen to try the clusdisk scenario as well, but I'm obviously going to have to schedule some down time to try that. It's the weekend here now and so I won't get a chance to organise that for a few days. I'll let you know how I get on when I've gotten around to it.
Thanks again.
mark
0
 
LVL 30

Accepted Solution

by:
Duncan Meyers earned 500 total points
ID: 17113455
You can probably/almost certainly get away with bringing up the second box with clusdisk.sys disabled - but there's always a risk that Windows will try to write to the disks that it can see - and that results in terminal corruption. I've sucessfully done the clusdisk thing on a quite a few occasions and got away with it, but it's far better to be safe than sorry...
0
 
LVL 30

Expert Comment

by:Duncan Meyers
ID: 17597714
Points please!
0

Featured Post

IT, Stop Being Called Into Every Meeting

Highfive is so simple that setting up every meeting room takes just minutes and every employee will be able to start or join a call from any room with ease. Never be called into a meeting just to get it started again. This is how video conferencing should work!

Join & Write a Comment

Suggested Solutions

Title # Comments Views Activity
restoring image to a different pc 20 92
Server 2012 Standard OS Mirroring 1 52
exchange, outlook 1 35
Adding JBOD to a VMware host 2 33
Ever notice how you can't use a new drive in Windows without having Windows assigning a Disk Signature?  Ever have a signature collision problem (especially with Virtual Machines?)  This article is intended to help you understand what's going on and…
Create your own, high-performance VM backup appliance by installing NAKIVO Backup & Replication directly onto a Synology NAS!
To efficiently enable the rotation of USB drives for backups, storage pools need to be created. This way no matter which USB drive is installed, the backups will successfully write without any administrative intervention. Multiple USB devices need t…
This tutorial will walk an individual through the steps necessary to install and configure the Windows Server Backup Utility. Directly connect an external storage device such as a USB drive, or CD\DVD burner: If the device is a USB drive, ensure i…

707 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question

Need Help in Real-Time?

Connect with top rated Experts

19 Experts available now in Live!

Get 1:1 Help Now