• Status: Solved
  • Priority: Medium
  • Security: Public
  • Views: 1730
  • Last Modified:

MD3000i - Problems with Thermal Sensor & Host Board

We recently shipped our MD3000i from one office to another, and after hooking it up at the new office, there is a problem with the system.

When connecting with MD Storage Manager, you get an alert saying the storage array needs attention. Upon closer inspection, there are two errors, which I have listed below.

Storage array:  StorageMH Component reporting problem:     Thermal sensor     Status:     Not available   Location:  Expansion enclosure 0   Component requiring service:  Temperature sensor

Storage array:  StorageMH Component reporting problem:     Host Board Left   Status:     Not available   RAID Controller Module:  Slot 1   Service action (removal) allowed:  No        Service action LED on component:  No

I cannot ping one of the management IPs. The weird thing is that when we shutdown one of the management ports from the switch, the other one starts responding. It seems to be in some kind of stand-by mode, so I dont think so this is a hardware issue.

Thank you !
0
maxihost
Asked:
maxihost
  • 9
  • 8
1 Solution
 
Greg HejlPrincipal ConsultantCommented:
is this a dual controller?
0
 
Greg HejlPrincipal ConsultantCommented:
0
 
maxihostAuthor Commented:
Greg_Hejl,

Yes, is a dual controller.
0
Free Tool: Path Explorer

An intuitive utility to help find the CSS path to UI elements on a webpage. These paths are used frequently in a variety of front-end development and QA automation tasks.

One of a set of tools we're offering as a way of saying thank you for being a part of the community.

 
Greg HejlPrincipal ConsultantCommented:
Sometimes one of the controllers will go into a lock state and prevent the other controller from entering operational state.  starting up the MD with the errored controller unplugged will bring up the SAN.

Do you have data that needs to be preserved?  if so, call Dell and pay for service.
0
 
maxihostAuthor Commented:
Hello Greg,

Our MD is currently in operational state, the alert is only for one of the controllers.

I can see that at Virtual Disk/Operational Status all the volumes are set to Controller 0.

Is it safe to remove Controller 1 and connect it back ? What should I do in order to recover this controller ?
0
 
Greg HejlPrincipal ConsultantCommented:
yes it is - power it down, pull it out, wait for voltage drain (1 min), plug it back in.

if thats all it needs happiness will occur!

if not, try this:

      NOTE: When you reset a RAID controller module, the RAID controller module is not available for I/O operations until the reset is complete. If a host is using virtual disks owned by the RAID controller module being reset, the I/O directed to the RAID controller module is rejected. Before resetting the RAID controller module, either verify that the virtual disks owned by the RAID controller module are not in use or ensure a multipath driver is installed on all hosts using these virtual disks.
Syntax

reset controller [(0 | 1)]

http://support.dell.com/support/systemsinfo/document.aspx?~file=/systems/md3000/en/cli/html/scriptcm.htm
0
 
maxihostAuthor Commented:
Hi Greg,

I will try the reset controller first, what do you think ?

How do I login to CLI ? I dont see that option through the MDSM.

Thank you.
0
 
Greg HejlPrincipal ConsultantCommented:
ftp://ftp.dell.com/Manuals/all-products/esuprt_ser_stor_net/esuprt_powervault/powervault-md3000i_Reference%20Guide2_en-us.pdf

open a command window, path to were you find the SMcli.exe,  follow the instructions in the link.
0
 
Greg HejlPrincipal ConsultantCommented:
I would physically remove it and plug it back in since it's current condition is due to a physical move.
0
 
maxihostAuthor Commented:
Hello Greg,

I just made the procedure. Removed the controller, wait for some minutes and plugged it back. The management IP is now pinging, but its flapping (stops pinging then start back). And when I try to click "Configure iSCSI Ports" I get "Unable to obtain current iSCSI network settings for RAID controller module 1 port 1.

What should I do ?
0
 
Greg HejlPrincipal ConsultantCommented:
do the cli reset
0
 
maxihostAuthor Commented:
Here is what I am running,

C:\Program Files (x86)\Dell\MD Storage Manager\client>SMcli -c 10.80.0.XX "reset
 controller [1];"

I get,

Script file null not found.

Do you know the exact syntax ?

Thank you.
0
 
maxihostAuthor Commented:
Greg,

I fixed the syntax, but now I am getting this message,
Executing script...

The reset RAID controller module 1 operation failed because of an exception. Exc
eption message:

Error 1009 - A management connection to all RAID controller modules in the stora
ge array must be accessible to complete this operation.

If you are managing this storage array directly through the Ethernet (out-of-ban
d), a management connection to at least one RAID controller module still needs t
o be defined. Use the Add Storage Array option to define a management connection
 (IP address or DNS/Network name).

If you are managing this storage array through the host agent (in-band), verify
that all physical paths to the storage array are connected and operational. Then
 run the hot_add utility on the affected host, and then Refresh or Rescan the Ho
st.
The command at line 1 that caused the error is:

reset controller [1];

Script execution halted due to error.
0
 
maxihostAuthor Commented:
When I try to send the same command to the failed controller I get,

C:\Program Files (x86)\Dell\MD Storage Manager\client>SMcli 10.80.0.31 -p cw4786
8777$$  -c "reset controller [1];"
Network errors were detected while connecting to storage array 10.80.0.31.
Please check for any network problems and then try again.

SMcli failed.
0
 
Greg HejlPrincipal ConsultantCommented:
that corresponds with your flapping issue - try removing and reinserting the controller again (sometimes the bits don't line up right...:-)

Also - try this out: http://rtumaykin-it.blogspot.com/2012/04/fixing-unresponsive-management-ports-on.html

This may also apply to the iSCSI ports too - what does the MDSM tell you about the controller 1? anything?
0
 
maxihostAuthor Commented:
I tried this as well. Maybe the controller is dead ? I have another one to replace. How to safely replace the controller without harming the current RAID configurations ? The new controller comes from another MD3000i chassis. Thank you.
0
 
Greg HejlPrincipal ConsultantCommented:
Controller replacement is automagic - I've never swapped controllers from one MD to another, so I do not know what the result would be.  they are supposed to query the time configured on the controller they initiate contact with.  The newer time-stamped configuration wins.

It may be time to put in a call to Dell - they are very helpful on the MD's.
0
Question has a verified solution.

Are you are experiencing a similar issue? Get a personalized answer when you ask a related question.

Have a better answer? Share it in a comment.

Join & Write a Comment

Featured Post

The 14th Annual Expert Award Winners

The results are in! Meet the top members of our 2017 Expert Awards. Congratulations to all who qualified!

  • 9
  • 8
Tackle projects and never again get stuck behind a technical roadblock.
Join Now