Blade S SAS RAID Module - Unable to see SCM

Posted on 2012-08-21
Last Modified: 2012-11-05
Hello experts,

I have a Blade S chassis that was working fine for over 2 years, but it was move to another location last week, and has not fully worked since.
It has;
2 SAS controllers
3 Blades (VMware) zoned to boot from SAN- do not boot as they cannot see any boot device
2 Blades booting from local disk -  these work fine but cannot see any disks apart from local
IP of AMM -
RAID controller subsystem
RAID controller subsystem

All share the same mask of and same gateway
Both controllers display the same error
I/O Module 3 requires user attention
I/O Module 4 requires user attention
Controller I/O 3 can ping its subsystem on
Controller I/O 4 cannot ping its subsystem on
Zoning is using Predefined Config 10, and displays Normal, then No Cable in the status for Blades 1, 2 and 3 as they continually reboot.

i have tried to update the firmware by the AMM and by the web portal and also using the Python method.
Both AMM methods fail with either bad image or invalid header, and running via Cli (C:\Windows\System32>C:\Temp\ibm_fw_bcsw_s0cl- -i 192
.168.96.233 -n) first displayed the below, which i took to mean the password was wrong, so i put it back to the default PASSW0RD
Image unpacked.

Package name  : rssm.
Package level :
Product       : rssm
Image created : Oct25201107:42:11(GMT)

Raid ctlr uBoot version : H-
Raid ctlr code version  : H-
Raid ctlr Linux version : H-
BMC version : S0BT10A
FPGA version : 01.07
SES version : 0107
BBU version : 58.0
DSM version : 1.08
SAS switch version : R1.07

Initializing firmware update - please wait.
MSG: ./ failed in function telnetRemoteHost, rc = 13.

If i try and run it again, it just returns Unpacking image C:\Temp\ibm_fw_bcsw_s0cl- the goes back windows prompt and does nothing else.

Any ideas would be great.
Question by:deanwilsons
    LVL 46

    Expert Comment

    Hardware has been known to break from time to time ... have you tried any diagnostics?

    Author Comment

    Not yet, as i didn't expect both controllers to fail at same time, but its certainly possible. Whats the best way to run these diags?
    LVL 46

    Expert Comment

    HI deanwilsons - I have no Idea. I don't have one of these.  I'm just falling back on tried-and-true techniques.  If you can't easily figure out how to fix what seems to be a software problem, confirm whether or not the hardware is good.
    LVL 55

    Expert Comment

    Why the list of IP addresses? They're only used for management so aren't really relevant with SAS connectivity.

    You didn't remove the I/O switches to lighten it when unracking per chance and accidentally put the SAS switches in bays 1 and 2 rather than 3 and 4 did you? No, pretty sure not as says bays 3 and 4 in text above. What about swapping them around, that would break connectivity since they would both have the wrong zones on them.

    Author Comment

    Thought id cover all basis an supply as much info as poss, hence the ip addresses.
    I will try and swap the controllers around, but during the rebuild, everything was labeled.
    The controllers are in sync, so would that really matter if they were in the wrong bays?

    I did notice the time is over 2 hours out on the controllers, but unable to connect the controllers via storage manager to change it.

    Author Comment

    The controllers were in the correct bays, but i swapped them around, and nothing changed so i put them back in the original bays.

    I have checked the event logs in the AMM and the controller module, but nothing points to an issue.

    The controller just displays a warning, and that user intervention is required.

    Author Comment

    Still getting no further with this issue.

    does anyone else have any further suggestions?

    Accepted Solution

    IBM Support was called in, and they diagnosed it as a corrupt firmware. They supplied a pre-release firmware which resolved the issue.

    Author Closing Comment

    No other resolution was offered.

    Write Comment

    Please enter a first name

    Please enter a last name

    We will never share this with anyone.

    Featured Post

    How your wiki can always stay up-to-date

    Quip doubles as a “living” wiki and a project management tool that evolves with your organization. As you finish projects in Quip, the work remains, easily accessible to all team members, new and old.
    - Increase transparency
    - Onboard new hires faster
    - Access from mobile/offline

    Suggested Solutions

    Title # Comments Views Activity
    To attach and locate usb on Esxi 8 69
    Enable UEFI boot on Dell R710 Server 3 39
    External Hard Drive Error 48 68
    Tape Management 8 33
    Every server (virtual or physical) needs a console: and the console can be provided through hardware directly connected, software for remote connections, local connections, through a KVM, etc. This document explains the different types of consol…
    Lets start to have a small explanation what is VAAI(vStorage API for Array Integration ) and what are the benefits using it. VAAI is an API framework in VMware that enable some Storage tasks. It first presented in ESXi 4.1, but only after 5.x sup…
    This video teaches viewers how to encrypt an external drive that requires a password to read and edit the drive. All tasks are done in Disk Utility. Plug in the external drive you wish to encrypt: Make sure all previous data on the drive has been …
    This Micro Tutorial will teach you how to reformat your flash drive. Sometimes your flash drive may have issues carrying files so this will completely restore it to manufacturing settings. Make sure to backup all files before reformatting. This w…

    737 members asked questions and received personalized solutions in the past 7 days.

    Join the community of 500,000 technology professionals and ask your questions.

    Join & Ask a Question

    Need Help in Real-Time?

    Connect with top rated Experts

    22 Experts available now in Live!

    Get 1:1 Help Now