sun sparc t5440 System failure with a peculiar set of diagnostics that I am at a loss and need help

There is a Sun Spac T5440 that supports a legacy POS system, and was first brought on line in 2010.  In tha time line it ran almost without flaw.  There had been little if any maintenance performed, and nothing had been added to the original hardware.

The system failed the day after Thanksgiving.  Thus far little progress has been made to communicate with the device.  Each attempt has met with failure.  there isn't a monitor on the unit.  Does this unit require a proprietary monitor configuration?

On the back the Amber Exclamation light is lit.  The light for the power is lit solid green.  We are hoping that we are dealing with a power issue.  The poer supplies, of which there are four of them, display a green light that rolls from 1-4.  these lights display in rolling fashion one after another in rapid sequence.  See video.
IMG_1706.3gp
JohnnyIT Consultant!Asked:
Who is Participating?
 
andyalderCommented:
If I remember correctly the blue Cisco rollover cable worked with Sun/Oracle/Fujitsu SPARCs but it was a few years ago that I used them. The plug end Sun provided was used with a straight Ethernet cable. Make sure there isn't a keyboard connected to the SPARC although that won't stop any display on PC it may not accept keyboard input over serial.
0
 
andyalderCommented:
SPARCs don't need a display, they are managed using ILOM via the management processor serial port so you need a laptop and serial dongle - they're normally set to 9600 baud 8 bit no parity 1 stop bit. They're often not set to power on automatically after power-cycle, you might just need to issue start /SYS. auto-boot is normally off too so after start /sys you may need to type "boot disk0" at the OK prompt.

https://docs.oracle.com/cd/E19910-01/E21500-01/z40002fe1298584.html will get you started
https://docs.oracle.com/cd/E19910-01/E21500-01/z40001341020700.html
0
 
andyalderCommented:
Oh, BTW just looked at your video, https://docs.oracle.com/cd/E19488-01/E22632-01/z40003f61106869.html - blinking green just means AC present/system in standby. The rolling you see is just coincidence.
0
Ultimate Tool Kit for Technology Solution Provider

Broken down into practical pointers and step-by-step instructions, the IT Service Excellence Tool Kit delivers expert advice for technology solution providers. Get your free copy now.

 
JohnnyIT Consultant!Author Commented:
Andyalder,  thank your response.  The problems I have is that I am charged with correcting this problem without being on site.  Someone has to be my hands.  Thus far he tells me he is doing everything correctly to gain access to the command line through to serial port.  He is using a gigaware USB com port emulator which he says rests on com3.  He is using Putty for his connection.  I have had lots of problems using Putty with serial communications, but perhaps the problem is with Gigaware as  I to am using this emulator.  The problems I have had went away with Teraterm.  I have advise my onsite person to download Tera term.

Basically,, he has received nothing, but a blank screen  when connecting to the communication port on the Sparc.  Is there anyway I can tell that ILOM is not accessible.

Further,  I have read your statement about the powersupply lights constantly lighting in rotating 1-4 fashion as being nothing but a red hearing.  I find it hard to accept that engineering has left the lights to blink in this manner randomly, and without meaning.  However, I cannot refute it.  Thus far, everything I have about the LEDs on the power supply have not made a statement about this pattern.  

I will examine the documents you have supplied.  Everything revolves around ILOM access.  

In summary, if ILOm access is broken how can I make this deduction?
0
 
andyalderCommented:
If there's no output from ILOM after power-cycling it I would suspect the serial cable/dongle, do they have a router you can prove the laptop serial port with? Probably not using the right serial plug that came with the server.

You can always get them to pull a PSU out if you want the pattern to change, there's always one or two redundant ones.
0
 
JohnnyIT Consultant!Author Commented:
Ahh, ok, the serial port adapter is proprietary like a Cisco?
That would do it all day long.  They are using a GIGAWARE USB serial port emulator
Thanks

Please let me know.
0
 
JohnnyIT Consultant!Author Commented:
Yes, that was the correct solution.  The Cisco roll over cable got him in.  I had him also use Teraterm and the two pieces got him into ilom.
Once there will the other doc you sent me get me to the OS?  Will it force the server to boot?


Thanks
0
 
andyalderCommented:
If it's at the OK prompt it might start if you type "boot"

What does
ok> printenv boot-device
show? It should list the boot device (if set) amongst other things.

https://docs.oracle.com/cd/E19455-01/805-7228/6j6q7uepp/index.html would also help,
0
 
JohnnyIT Consultant!Author Commented:
Cant get to the OK prompt issued  stop /s and it just hung for like an hour.  I should have team viewer access now
0
 
JohnnyIT Consultant!Author Commented:
Andy, when I go through the document to try and try and get to the OK promp it simply fails.  It either just hangs or gives me starting /sys but just blinks
0
 
andyalderCommented:
I'm not going to be much use to you for the rest of today, just about going to bed as I'm in the UK.

He has taken the lid off and checked things like the DIMM board leds I hope?
https://docs.oracle.com/cd/E19488-01/E22632-01/z400008e1160690.html
0
 
andyalderCommented:
Set it to boot into diagnostic mode:

https://docs.oracle.com/cd/E19350-01/E21356/6_config-booting.html#50634778_75366

-> set /SYS keyswitch_state=diag
-> start /SYS
-> start /SP/console

Of course it's possible that start /SYS changes the baud rate or other serial port parameter but I don't think so.
0
 
JohnnyIT Consultant!Author Commented:
Take a look at the system properties
So far everything real just hangs and does nothing but I got this bsck
  1. System properties
0
 
JohnnyIT Consultant!Author Commented:
See. It says faulted powerstate off
0
 
JohnnyIT Consultant!Author Commented:
Andy, hope all is well.  wanted to follow up with you on a couple of things.  1.  Why on earth does Oracle not allow users access to their website.  I mean I really need access to documentation.  Even the Service manual for the T5440 server pushes me back to the Sun.com site which is integral to the oracle site.
From what I'm gathering I have a problem with a field replaceable unit, or FRU.  If I had access to the oracle site it seems to be short order to locate the commands to zero in on which component needs replacement.

Any thoughts would be appreciated.

Thanks
0
 
andyalderCommented:
I hate that too, it's a way to force you to use their costly field maintenance contracts and to push third party maintainers out of business, Others are following suit, tried to get HPE firmware lately?
0
 
JohnnyIT Consultant!Author Commented:
Andy here is the specific fault:

arget              | Property               | Value
--------------------+------------------------+---------------------------------
/SP/faultmgmt/0     | fru                    | /SYS
/SP/faultmgmt/0/    | timestamp              | Nov 24 23:10:20
 faults/0           |                        |
/SP/faultmgmt/0/    | sp_detected_fault      | Poweron failed: suspect
 faults/0           |                        | components: MB PS0 PS1 PS2 PS3
                    |                        | CPU0 CPU1 MEM0 MEM1

When I say shpw PS1 PSO and so on I get a response from system with no faults.  When I try these other componenets

show MEM0 or MEM1 and on and on I get Property not found.

What gives.  How do I find out which one of these is bad>


Thanks
0
 
andyalderCommented:
With all those FRUs listed as possibly faulty I would suspect MB as it connects to everything. Initially I thought it would cost a lot but there's one on eBay for $399, https://www.ebay.co.uk/itm/Sun-SPARC-Enterprise-T5440-Motherboard-541-2749-/221634260483

You can do the obvious and see what happens with just one cpu and memory board just like it was a Wintel box, and of course examine the capacitors visually. Since poweron fails it can't get the properties from memory and CPUs as they aren't powered up. PSUs are redundant so very unlikely and anyway you can try with individual ones removed in case there's a dead short across the output,

Note I'm not really an expert and there may be more than one version of the mb. Probably best to contact the seller of that mb on eBay,
0
 
JohnnyIT Consultant!Author Commented:
Andy Helped me several times
0
Question has a verified solution.

Are you are experiencing a similar issue? Get a personalized answer when you ask a related question.

Have a better answer? Share it in a comment.

All Courses

From novice to tech pro — start learning today.