solaris error.

Posted on 2004-10-04
Last Modified: 2013-12-21
/var/adm/messages show the following. uname -r shows 5.7
"unix: [AFT2] errID 0x002e7772.72894422 PA=0x00000000.0c927230
E$tag 0x00000000.08400192 E$State: Shared E$parity 0x04
unix: [AFT2] E$Data (0x00): 0x0014a174.0014a17c
 [AFT2] E$Data (0x08): 0x0014a184.0014a18c
 unix: [AFT2] E$Data (0x10): 0x0014a190.0014a198
 unix: [AFT2] E$Data (0x18): 0x0014a19c.00138f10
 unix: [AFT2] E$Data (0x20): 0x00112b00.00112c00
 unix: [AFT2] E$Data (0x28): 0x00138fd4.00112c50
 unix: [AFT2] E$Data (0x30): 0x00112c5f.00112c6e *Bad* PSYND=0xff00
 unix: [AFT2] E$Data (0x38): 0x00112c80.00138fd8
 unix: [AFT2] errID 0x002e7772.72894422 PA=0x00000000.0c927230
 E$tag 0x00000000.08400192 E$State: Shared E$parity 0x04
 unix: [AFT2] E$Data (0x00): 0x0014a174.0014a17c
 unix: [AFT2] E$Data (0x08): 0x0014a184.0014a18c
 unix: [AFT2] E$Data (0x10): 0x0014a190.0014a198
 unix: [AFT2] E$Data (0x18): 0x0014a19c.00138f10
 unix: [AFT2] E$Data (0x20): 0x00112b00.00112c00
 unix: [AFT2] E$Data (0x28): 0x00138fd4.00112c50
 unix: [AFT2] E$Data (0x30): 0x00112c5f.00112c6e *Bad* PSYND=0x0400
 unix: [AFT2] E$Data (0x38): 0x00112c80.00138fd8
 unix: [AFT3] errID 0x002e7772.72894422 Above Error is in User Mode
 and is fatal: will reboot......
plz.explain what the error stands for and how to debug it.
Question by:byerabati
LVL 18

Accepted Solution

liddler earned 125 total points
ID: 12216165
It looks like you have a bad memory module or CPU, assuming you have a service contract with sun, you need to get them to check the explorer output and change the hardware

Author Comment

ID: 12216214
Thanx liddler,
how can i know the error is due to bad memory or to read the error.

LVL 18

Expert Comment

ID: 12216538
I don't know, any error like this I pass to Sun, as they support the hardware on all of my systems.

You could have a look at the output of
/usr/platform/`uname -i`/sbin/prtdiag -v
to see  if that reports any hardware problems

Also have a look at:
Ransomware-A Revenue Bonanza for Service Providers

Ransomware – malware that gets on your customers’ computers, encrypts their data, and extorts a hefty ransom for the decryption keys – is a surging new threat.  The purpose of this eBook is to educate the reader about ransomware attacks.


Expert Comment

ID: 12247892
liddler is correct. It's an E-Cache ("E$") error, which means that there is some kind of hardware problem with the CPU cache memory.

If you send that data to Sun, they will most likely accept it as proof of a hardware failure and fix it for you if you have a support contract or are covered under warranty.


Assisted Solution

SumeshDaftary earned 125 total points
ID: 12280824

Its sure that its E-cache problem.

This following link will help you to understand conversion as well. 

you may need to replace CPU on board 0


Expert Comment

ID: 12294408
Just one point to clarify regarding Sun's support policy on the E-cache errors having spent the last 6 years in a Sun Solution Centre (before I was laid off and my job got out-sourced to low-cost engineers in India):
If this error has just been a one-off event then Sun Support is likely to say "call us next time it happens again on that CPU *then* we'll replace it."

If this has happened more than once already then Sun should replace the CPU module.

Believe it or not, transient e-cache errors can be caused by cosmic rays, however the chances of naturally occurring causes of the error hitting the same CPU twice are almost zero. Hence the "replace after 2 hits" policy.

The newer UltraSPARC III CPUs now in all new Sun h/w have ECC functions to allow single bit error correction on transient e-cache errors and the latest policy on replacing those CPUs is >24 errors in 24 hours, or immediately on any double bit errors.

A very detailed scientific document on the subject of naturally occurring causes of memory faults can be found at the following URL: 

Don't ask me how I came to have this URL, I bookmarked it years ago and it's thankfully still there, otherwise people wouldn't believe me when I mention cosmic rays.  HTH

- CB

Expert Comment

ID: 12542713
Hey liddler: what about some points for the info about cosmic rays?  :-)

Featured Post

DevOps Toolchain Recommendations

Read this Gartner Research Note and discover how your IT organization can automate and optimize DevOps processes using a toolchain architecture.

Question has a verified solution.

If you are experiencing a similar issue, please ask a related question

Suggested Solutions

When you do backups in the Solaris Operating System, the file system must be inactive. Otherwise, the output may be inconsistent. A file system is inactive when it's unmounted or it's write-locked by the operating system. Although the fssnap utility…
Introduction Regular patching is part of a system administrator's tasks. However, many patches require that the system be in single-user mode before they can be installed. A cluster patch in particular can take quite a while to apply if the machine…
Learn how to find files with the shell using the find and locate commands. Use locate to find a needle in a haystack.: With locate, check if the file still exists.: Use find to get the actual location of the file.:
In a previous video, we went over how to export a DynamoDB table into Amazon S3.  In this video, we show how to load the export from S3 into a DynamoDB table.

803 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question