Dell/EMC FC4500 & Powerpath path failures

Hi,
 
We have clustered PE2600's with a FC4500 with 5 DAE's direct attached.
 
We experienced multiple disk failures in our last DAE and after replacing the disks and rebinding the LUN, we are unable to access the disk within the OS (2000 Server) or Powerpath.
 
Everything looks fine within Navisphere, with no faults.
We have OpenManage Array Manager on one node which fails to find the disk when a scan is done.
On the other node we don't have array manager and when doing a scan within Windows Disk Management it also fails to find the disk.
 
Within power path we see the disk (probably from before we encountered multiple disk failures), but it says that the storage device has a failed status. It also shows the two paths c3t0d8 & c4t1d8 with a status of failed and a state of closed.
I've tried running powermt restore, with and without the force option, but it just tells me: Clariion device path c3t0d8 is currently dead, and the same for c4t1d8.
Anyone know how we can regain access to this enclosure??
Any help would be very much appreciated.
Thanks,
mark
markmallAsked:
Who is Participating?

[Webinar] Streamline your web hosting managementRegister Today

x
 
Duncan MeyersConnect With a Mentor Commented:
You can probably/almost certainly get away with bringing up the second box with clusdisk.sys disabled - but there's always a risk that Windows will try to write to the disks that it can see - and that results in terminal corruption. I've sucessfully done the clusdisk thing on a quite a few occasions and got away with it, but it's far better to be safe than sorry...
0
 
Duncan MeyersCommented:
My immediate suspicision is that you may not have added the new LUN into the Storage Group that the server is in.

You'll need to start Navisphere Manager (from the fibre channel connected host)and log in.

In the Storage Tree, expand out Storage Groups. You'll see one or more storage groups. If you open up the storage groups, you'll see that they have a host and one or more LUNs in each one. Right-click on the storage group with the troublesome host in it and click on 'Select LUNs'. Click on the new LUN, click on the arrow pointing to the right and click on OK. Now reboot the host and it will see the LUN.

 
0
 
Duncan MeyersCommented:
If the above solution doesn't fix your problem, then we'll need more information about your environment:

How many hosts are attached?
Are the hosts direct-connect or via a FC switch?
What is the FC4500 software revision?(click on the array serial number in Navisphere Manager, select properties and select the software tab. The FLARE code revision will be either in the format 02.06.0.x00.xx or (more likely) xx.xxxx. Could you also click on the Access Control tab and see if the Access Control Enabled tick box is ticked an greyed out?
0
Never miss a deadline with monday.com

The revolutionary project management tool is here!   Plan visually with a single glance and make sure your projects get done.

 
markmallAuthor Commented:
Hi Meyersd,

Thanks for you replies.
Firstly, we have two hosts attached and they're direct connected.
Secondly, we don't use storage groups.

When I get the properties of the array, I don't have a software tab or an access control tab, I just have the following tabs:
General
Cache
Memory
Storage Access.
Am I looking in the right place? I don't see anywhere else where there's a software tab!?
0
 
Duncan MeyersCommented:
Ummmm...

You're on Navisphere 5.3 or earlier I guess. Which means some pretty serious remembering on my part :-)

The Storage Access tab will have a tick box that says something like "enable access control" but if you only have two hosts and no storage groups, then you're unlikely to have access control enabled anyway - it was an extra-cost option on those earlier arrays.

Do you have two FC cables from each host to the array or one cable only from each host to the array? If you have a single cable only, or both cables to the same storage processor, then the most liekly cause of your problem is that the new LUN is assigned to the other storag eprocessor. To trespass the LUN, right click on it and select "trespass". If that fixes your problem you'll then need to open the properties of that LUN and set the default SP to that storage processor.

...

A penny has just dropped. It may be that clusdisk.sys is masking the new LUN. You can check that by disabling clusdisk.sys. Go into Computer Management - Device Manager. Enable "Show Hidden Devices" on the View menu, then Expand Non-Plug and Play Drivers, right-click Cluster Disk Driver, click Properties, click the Driver tab, in the Startup type list click Disabled, and then click OK, the restart the computer. If you can then see the new LUN, create a partition on it, the reenable clusdisk.sys and reboot. ***Note that you should only have one node running in the cluster when you do this.****

Another thing to try depends on your FC HBAs and how old they are. You could try going into the HBA BIOS, enable the HBA BIOS and then check to see if the card can see any FC devices. Make sure that you disable the BIOS once again after you've finished the test.
0
 
markmallAuthor Commented:
Thanks again for these suggestions.
I had a look in "About" on Navisphere and it says our version is 6.2.0.6.0.
On the Storage Access tab there's just one option that says "Enforce Fair Access", but as you suggested it's greyed out.

I've tried the trespass scenario, but that didn't fix the problem.

I'm keen to try the clusdisk scenario as well, but I'm obviously going to have to schedule some down time to try that. It's the weekend here now and so I won't get a chance to organise that for a few days. I'll let you know how I get on when I've gotten around to it.
Thanks again.
mark
0
 
Duncan MeyersCommented:
Points please!
0
All Courses

From novice to tech pro — start learning today.