Solved

MD3000i - Problems with Thermal Sensor & Host Board

Posted on 2014-03-24
17
1,412 Views
Last Modified: 2014-05-19
We recently shipped our MD3000i from one office to another, and after hooking it up at the new office, there is a problem with the system.

When connecting with MD Storage Manager, you get an alert saying the storage array needs attention. Upon closer inspection, there are two errors, which I have listed below.

Storage array:  StorageMH Component reporting problem:     Thermal sensor     Status:     Not available   Location:  Expansion enclosure 0   Component requiring service:  Temperature sensor

Storage array:  StorageMH Component reporting problem:     Host Board Left   Status:     Not available   RAID Controller Module:  Slot 1   Service action (removal) allowed:  No        Service action LED on component:  No

I cannot ping one of the management IPs. The weird thing is that when we shutdown one of the management ports from the switch, the other one starts responding. It seems to be in some kind of stand-by mode, so I dont think so this is a hardware issue.

Thank you !
0
Comment
Question by:maxihost
  • 9
  • 8
17 Comments
 
LVL 13

Expert Comment

by:Greg Hejl
ID: 39952081
is this a dual controller?
0
 
LVL 13

Expert Comment

by:Greg Hejl
ID: 39952105
0
 

Author Comment

by:maxihost
ID: 39953315
Greg_Hejl,

Yes, is a dual controller.
0
 
LVL 13

Expert Comment

by:Greg Hejl
ID: 39957461
Sometimes one of the controllers will go into a lock state and prevent the other controller from entering operational state.  starting up the MD with the errored controller unplugged will bring up the SAN.

Do you have data that needs to be preserved?  if so, call Dell and pay for service.
0
 

Author Comment

by:maxihost
ID: 39957473
Hello Greg,

Our MD is currently in operational state, the alert is only for one of the controllers.

I can see that at Virtual Disk/Operational Status all the volumes are set to Controller 0.

Is it safe to remove Controller 1 and connect it back ? What should I do in order to recover this controller ?
0
 
LVL 13

Expert Comment

by:Greg Hejl
ID: 39957642
yes it is - power it down, pull it out, wait for voltage drain (1 min), plug it back in.

if thats all it needs happiness will occur!

if not, try this:

      NOTE: When you reset a RAID controller module, the RAID controller module is not available for I/O operations until the reset is complete. If a host is using virtual disks owned by the RAID controller module being reset, the I/O directed to the RAID controller module is rejected. Before resetting the RAID controller module, either verify that the virtual disks owned by the RAID controller module are not in use or ensure a multipath driver is installed on all hosts using these virtual disks.
Syntax

reset controller [(0 | 1)]

http://support.dell.com/support/systemsinfo/document.aspx?~file=/systems/md3000/en/cli/html/scriptcm.htm
0
 

Author Comment

by:maxihost
ID: 39957654
Hi Greg,

I will try the reset controller first, what do you think ?

How do I login to CLI ? I dont see that option through the MDSM.

Thank you.
0
 
LVL 13

Expert Comment

by:Greg Hejl
ID: 39957882
ftp://ftp.dell.com/Manuals/all-products/esuprt_ser_stor_net/esuprt_powervault/powervault-md3000i_Reference%20Guide2_en-us.pdf

open a command window, path to were you find the SMcli.exe,  follow the instructions in the link.
0
Better Security Awareness With Threat Intelligence

See how one of the leading financial services organizations uses Recorded Future as part of a holistic threat intelligence program to promote security awareness and proactively and efficiently identify threats.

 
LVL 13

Expert Comment

by:Greg Hejl
ID: 39958588
I would physically remove it and plug it back in since it's current condition is due to a physical move.
0
 

Author Comment

by:maxihost
ID: 39960225
Hello Greg,

I just made the procedure. Removed the controller, wait for some minutes and plugged it back. The management IP is now pinging, but its flapping (stops pinging then start back). And when I try to click "Configure iSCSI Ports" I get "Unable to obtain current iSCSI network settings for RAID controller module 1 port 1.

What should I do ?
0
 
LVL 13

Expert Comment

by:Greg Hejl
ID: 39960317
do the cli reset
0
 

Author Comment

by:maxihost
ID: 39960386
Here is what I am running,

C:\Program Files (x86)\Dell\MD Storage Manager\client>SMcli -c 10.80.0.XX "reset
 controller [1];"

I get,

Script file null not found.

Do you know the exact syntax ?

Thank you.
0
 

Author Comment

by:maxihost
ID: 39960397
Greg,

I fixed the syntax, but now I am getting this message,
Executing script...

The reset RAID controller module 1 operation failed because of an exception. Exc
eption message:

Error 1009 - A management connection to all RAID controller modules in the stora
ge array must be accessible to complete this operation.

If you are managing this storage array directly through the Ethernet (out-of-ban
d), a management connection to at least one RAID controller module still needs t
o be defined. Use the Add Storage Array option to define a management connection
 (IP address or DNS/Network name).

If you are managing this storage array through the host agent (in-band), verify
that all physical paths to the storage array are connected and operational. Then
 run the hot_add utility on the affected host, and then Refresh or Rescan the Ho
st.
The command at line 1 that caused the error is:

reset controller [1];

Script execution halted due to error.
0
 

Author Comment

by:maxihost
ID: 39960428
When I try to send the same command to the failed controller I get,

C:\Program Files (x86)\Dell\MD Storage Manager\client>SMcli 10.80.0.31 -p cw4786
8777$$  -c "reset controller [1];"
Network errors were detected while connecting to storage array 10.80.0.31.
Please check for any network problems and then try again.

SMcli failed.
0
 
LVL 13

Expert Comment

by:Greg Hejl
ID: 39960537
that corresponds with your flapping issue - try removing and reinserting the controller again (sometimes the bits don't line up right...:-)

Also - try this out: http://rtumaykin-it.blogspot.com/2012/04/fixing-unresponsive-management-ports-on.html

This may also apply to the iSCSI ports too - what does the MDSM tell you about the controller 1? anything?
0
 

Author Comment

by:maxihost
ID: 39961686
I tried this as well. Maybe the controller is dead ? I have another one to replace. How to safely replace the controller without harming the current RAID configurations ? The new controller comes from another MD3000i chassis. Thank you.
0
 
LVL 13

Accepted Solution

by:
Greg Hejl earned 500 total points
ID: 39961904
Controller replacement is automagic - I've never swapped controllers from one MD to another, so I do not know what the result would be.  they are supposed to query the time configured on the controller they initiate contact with.  The newer time-stamped configuration wins.

It may be time to put in a call to Dell - they are very helpful on the MD's.
0

Featured Post

Find Ransomware Secrets With All-Source Analysis

Ransomware has become a major concern for organizations; its prevalence has grown due to past successes achieved by threat actors. While each ransomware variant is different, we’ve seen some common tactics and trends used among the authors of the malware.

Join & Write a Comment

Suggested Solutions

If you have a USB Drive that is not recognized by Windows the problem is usually that you have too many network drives or other drives that occupy all the drive letters D: E: or F: which is the normal drive letter of a usb drive. The way to correct …
AWS Glacier is Amazons cheapest storage option and is their answer to a ‘Cold’ storage service.  Customers primarily use this service for archival purposes and storage of infrastructure backups.  Its unlimited storage potential and low storage cost …
This video teaches viewers how to encrypt an external drive that requires a password to read and edit the drive. All tasks are done in Disk Utility. Plug in the external drive you wish to encrypt: Make sure all previous data on the drive has been …
This Micro Tutorial will teach you how to reformat your flash drive. Sometimes your flash drive may have issues carrying files so this will completely restore it to manufacturing settings. Make sure to backup all files before reformatting. This w…

758 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question

Need Help in Real-Time?

Connect with top rated Experts

17 Experts available now in Live!

Get 1:1 Help Now