Brand new T5220, ILOM says MCU0 enabled, OBP says MCU0 disabled

I've got a brand new T5220 with four 2GB FB-DIMMs in it.  Initially it had eight 2GB FB-DIMMs in it, but they weren't all the same part number and ILOM would report that the box was working in a degraded memory condition.  It would not even allow an OS to be installed.  Sun documentation states that all FB-DIMMS in a branch must have the same identical part number, although I am not certain what a 'branch' refers to in this server.

So, I removed the second set of different FB-DIMMs that had been added after the box was purchased.  This left the original four 2GB FB-DIMMs with identical part numbers (M395T5663QZ4 by Samsung).  

The above change removed the error message about the box working in a degraded memory condition.  And, from the ILOM prompt, /SYS/MB/CMP0/MCU0 says it is enabled.  In fact, everything shows enabled.  But, when the box is powered on, just before you get the OK prompt, there is a message from the OBP that says that /SYS/MB/CMP0/MCU0 is disabled.  At this point you can go back into the ILOM and you will see that MCU0 still says enabled.

Why are these reporting oppositely?  It isn't as thought I can go into the ILOM and mark MCU0 as enabled--it already IS enabled.  There is a SUN patch for the firmware that talks about this problem (136932-10) but the version of OBP that the patch brings the OBP to is lower than the OBP is right now.  

If you answer, keep in mind that there is no OS installed on the box, yet.
cbecker001Asked:
Who is Participating?

[Webinar] Streamline your web hosting managementRegister Today

x
 
cbecker001Connect With a Mentor Author Commented:
I put the additional memory in that matched what was in the box and powered it up and all of the errors are gone.

However, I don't think that the additional memory is necessarily what solved the problem.  I read somewhere (perhaps in the reading one of you pointed me to) that you have to cycle the box once you use some of these 'set' or 'enable' commands.  I'm not sure that was done previously.  So, by shutting the server down over the weekend, and then bringing it up again today, those previous commands probably took effect--whether or not I put in additional memory.  The errors might have been gone whether or not I added more memory.  

I thank you both for responding quickly with helpful, thoughtful suggestions.  I'm going to split the points between you.  
0
 
arnoldConnect With a Mentor Commented:
You need to look at how the memory is installed.
i.e. do you have all four in the same bank or do you have them alternating?
http://blogs.sun.com/deniss/entry/t5120_and_t5220_system_overview
Did you then install the second type in the first open spot?
0
 
RowleyConnect With a Mentor Commented:
Did the system ship from Sun as you've described here? If so, i'd get on the phone to them. You should get 30 days of hardware support on the box. If not, i'd still chase your vendor. If you've no other recourse however, then take a look at the service manual if you haven't already:

http://dlc.sun.com/pdf/820-2181-13/820-2181-13.pdf

Page 41 describes how to clear POST-detected faults on the box. Give that a try...
0
Free Tool: SSL Checker

Scans your site and returns information about your SSL implementation and certificate. Helpful for debugging and validating your SSL configuration.

One of a set of tools we are providing to everyone as a way of saying thank you for being a part of the community.

 
cbecker001Author Commented:
Arnold:

There are four banks:  0,1,2,3    At present, there are only four FB-DIMMs with the part number I listed in the opening paragraph in bank 0.  The memory added to the server later has been removed while troubleshooting this problem.  There is more memory identical to the memory in the server now being delivered this morning.  I won't have access to the server until about 10:30 am central time.
0
 
cbecker001Author Commented:
Rowley:

I have some output from the box now.  When you cycle the box you get the following:

============================================================
{0} ok reset-all

SPARC Enterprise T5220, No Keyboard
Copyright 2009 Sun Microsystems, Inc.  All rights reserved.
OpenBoot 4.30.2.b, 3968 MB memory available, Serial #1234567.
Ethernet address 0:12:34:56:78:90, Host ID: 12345678.

ERROR: The following devices are disabled:
    MB/CMP0/MCU0

{0} ok
============================================================


But then, if you switch over to the ILOM prompt,  every component shows 'enabled':


============================================================

-> show -level all -o table component_state
Target              | Property               | Value                          
--------------------+------------------------+---------------------------------
/SYS/MB/RISER0/     | component_state        | Enabled                        
 PCIE0              |                        |                                
/SYS/MB/RISER0/     | component_state        | Enabled                        
 PCIE3              |                        |                                
/SYS/MB/RISER1/     | component_state        | Enabled                        
 PCIE1              |                        |                                
/SYS/MB/RISER1/     | component_state        | Enabled                        
 PCIE4              |                        |                                
/SYS/MB/RISER2/     | component_state        | Enabled                        
 PCIE2              |                        |                                
/SYS/MB/RISER2/     | component_state        | Enabled                        
 PCIE5              |                        |                                
/SYS/MB/GBE0        | component_state        | Enabled                        
/SYS/MB/GBE1        | component_state        | Enabled                        
/SYS/MB/PCIE        | component_state        | Enabled                        
/SYS/MB/PCIE-IO/    | component_state        | Enabled                        
 USB                |                        |                                
/SYS/MB/SASHBA      | component_state        | Enabled                        
/SYS/MB/CMP0/NIU0   | component_state        | Enabled                        
/SYS/MB/CMP0/NIU1   | component_state        | Enabled                        
/SYS/MB/CMP0/MCU0   | component_state        | Enabled                        
/SYS/MB/CMP0/MCU1   | component_state        | Enabled                        
/SYS/MB/CMP0/MCU2   | component_state        | Enabled                        
/SYS/MB/CMP0/MCU3   | component_state        | Enabled                        
/SYS/MB/CMP0/       | component_state        | Enabled                        
 L2_BANK0           |                        |                                
/SYS/MB/CMP0/       | component_state        | Enabled                        
 L2_BANK1           |                        |                                
/SYS/MB/CMP0/       | component_state        | Enabled                        
 L2_BANK2           |                        |                                
/SYS/MB/CMP0/       | component_state        | Enabled                        
 L2_BANK3           |                        |                                
/SYS/MB/CMP0/       | component_state        | Enabled                        
 L2_BANK4           |                        |                                
/SYS/MB/CMP0/       | component_state        | Enabled                        
 L2_BANK5           |                        |                                
/SYS/MB/CMP0/       | component_state        | Enabled                        
 L2_BANK6           |                        |                                
/SYS/MB/CMP0/       | component_state        | Enabled                        
 L2_BANK7           |                        |                                
/SYS/MB/CMP0/BR0/   | component_state        | Enabled                        
 CH0/D0             |                        |                                
/SYS/MB/CMP0/BR1/   | component_state        | Enabled                        
 CH0/D0             |                        |                                
/SYS/MB/CMP0/BR2/   | component_state        | Enabled                        
 CH0/D0             |                        |                                
/SYS/MB/CMP0/BR3/   | component_state        | Enabled                        
 CH0/D0             |                        |                                
/SYS/MB/CMP0/P0     | component_state        | Enabled                        
/SYS/MB/CMP0/P1     | component_state        | Enabled                        
/SYS/MB/CMP0/P2     | component_state        | Enabled                        
/SYS/MB/CMP0/P3     | component_state        | Enabled                        
/SYS/MB/CMP0/P4     | component_state        | Enabled                        
/SYS/MB/CMP0/P5     | component_state        | Enabled                        
/SYS/MB/CMP0/P6     | component_state        | Enabled                        
/SYS/MB/CMP0/P7     | component_state        | Enabled                        
/SYS/MB/CMP0/P8     | component_state        | Enabled                        
/SYS/MB/CMP0/P9     | component_state        | Enabled                        
/SYS/MB/CMP0/P10    | component_state        | Enabled                        
/SYS/MB/CMP0/P11    | component_state        | Enabled                        
/SYS/MB/CMP0/P12    | component_state        | Enabled                        
/SYS/MB/CMP0/P13    | component_state        | Enabled                        
/SYS/MB/CMP0/P14    | component_state        | Enabled                        
/SYS/MB/CMP0/P15    | component_state        | Enabled                        
/SYS/MB/CMP0/P16    | component_state        | Enabled                        
/SYS/MB/CMP0/P17    | component_state        | Enabled                        
/SYS/MB/CMP0/P18    | component_state        | Enabled                        
/SYS/MB/CMP0/P19    | component_state        | Enabled                        
/SYS/MB/CMP0/P20    | component_state        | Enabled                        
/SYS/MB/CMP0/P21    | component_state        | Enabled                        
/SYS/MB/CMP0/P22    | component_state        | Enabled                        
/SYS/MB/CMP0/P23    | component_state        | Enabled                        
/SYS/MB/CMP0/P32    | component_state        | Enabled                        
/SYS/MB/CMP0/P33    | component_state        | Enabled                        
/SYS/MB/CMP0/P34    | component_state        | Enabled                        
/SYS/MB/CMP0/P35    | component_state        | Enabled                        
/SYS/MB/CMP0/P36    | component_state        | Enabled                        
/SYS/MB/CMP0/P37    | component_state        | Enabled                        
/SYS/MB/CMP0/P38    | component_state        | Enabled                        
/SYS/MB/CMP0/P39    | component_state        | Enabled                        
/SYS/TTYA           | component_state        | Enabled                        

============================================================

So, I don't see that your suggestion would work unless I'm missing something.
0
 
RowleyCommented:
OK. If you try and clear the fault, do you get a message back saying there is no fault on that device or something else? Does show faulty return anything? What about trying to disable and enable the component from the SC?

sc> enablecomponent /SYS/MB/CMP0/MCU0

Again, if this is brand new don't waste your own and your companies time, get straight onto your vendor for support.
0
 
cbecker001Author Commented:
Rowley,

I don't have access to the box right now, but I'll try that and post the results after I try that.  
0
 
arnoldCommented:
The ILOM may indicate the setting while the OBP is indicating the state of the device.

The memory installation seems to be to use alternating memory slots.
as others asked, is the installed memory came in this form?

In which bank did you add the other four among the four from a different part number.  It could be that the four are designed for a different model and are incompatible with this hardware.  I.e. if you swap the four that you have with the additional four that you bough, is the error still there?
The FBDIMMS your system needs match in the slot design, but are not compatible with the DDR2 memory chips that are available for the earlier model, T5120.
Check the part number with SUN/Oracle to see whether it is suitable for your system.

0
 
cbecker001Author Commented:
I think I used the wrong option to close the question.  I want to accept my answer as the solution but give points to the Experts for their assistance.  I need this question opened back up so I can close it the way I intended to.  
0
All Courses

From novice to tech pro — start learning today.