solaris error.

Posted on 2004-10-04
Medium Priority
Last Modified: 2013-12-21
/var/adm/messages show the following. uname -r shows 5.7
"unix: [AFT2] errID 0x002e7772.72894422 PA=0x00000000.0c927230
E$tag 0x00000000.08400192 E$State: Shared E$parity 0x04
unix: [AFT2] E$Data (0x00): 0x0014a174.0014a17c
 [AFT2] E$Data (0x08): 0x0014a184.0014a18c
 unix: [AFT2] E$Data (0x10): 0x0014a190.0014a198
 unix: [AFT2] E$Data (0x18): 0x0014a19c.00138f10
 unix: [AFT2] E$Data (0x20): 0x00112b00.00112c00
 unix: [AFT2] E$Data (0x28): 0x00138fd4.00112c50
 unix: [AFT2] E$Data (0x30): 0x00112c5f.00112c6e *Bad* PSYND=0xff00
 unix: [AFT2] E$Data (0x38): 0x00112c80.00138fd8
 unix: [AFT2] errID 0x002e7772.72894422 PA=0x00000000.0c927230
 E$tag 0x00000000.08400192 E$State: Shared E$parity 0x04
 unix: [AFT2] E$Data (0x00): 0x0014a174.0014a17c
 unix: [AFT2] E$Data (0x08): 0x0014a184.0014a18c
 unix: [AFT2] E$Data (0x10): 0x0014a190.0014a198
 unix: [AFT2] E$Data (0x18): 0x0014a19c.00138f10
 unix: [AFT2] E$Data (0x20): 0x00112b00.00112c00
 unix: [AFT2] E$Data (0x28): 0x00138fd4.00112c50
 unix: [AFT2] E$Data (0x30): 0x00112c5f.00112c6e *Bad* PSYND=0x0400
 unix: [AFT2] E$Data (0x38): 0x00112c80.00138fd8
 unix: [AFT3] errID 0x002e7772.72894422 Above Error is in User Mode
 and is fatal: will reboot......
plz.explain what the error stands for and how to debug it.
Question by:byerabati
LVL 18

Accepted Solution

liddler earned 500 total points
ID: 12216165
It looks like you have a bad memory module or CPU, assuming you have a service contract with sun, you need to get them to check the explorer output and change the hardware

Author Comment

ID: 12216214
Thanx liddler,
how can i know the error is due to bad memory or cpu.how to read the error.

LVL 18

Expert Comment

ID: 12216538
I don't know, any error like this I pass to Sun, as they support the hardware on all of my systems.

You could have a look at the output of
/usr/platform/`uname -i`/sbin/prtdiag -v
to see  if that reports any hardware problems

Also have a look at:
Train for your Pen Testing Engineer Certification

Enroll today in this bundle of courses to gain experience in the logistics of pen testing, Linux fundamentals, vulnerability assessments, detecting live systems, and more! This series, valued at $3,000, is free for Premium members, Team Accounts, and Qualified Experts.


Expert Comment

ID: 12247892
liddler is correct. It's an E-Cache ("E$") error, which means that there is some kind of hardware problem with the CPU cache memory.

If you send that data to Sun, they will most likely accept it as proof of a hardware failure and fix it for you if you have a support contract or are covered under warranty.


Assisted Solution

SumeshDaftary earned 500 total points
ID: 12280824

Its sure that its E-cache problem.

This following link will help you to understand conversion as well.


you may need to replace CPU on board 0


Expert Comment

ID: 12294408
Just one point to clarify regarding Sun's support policy on the E-cache errors having spent the last 6 years in a Sun Solution Centre (before I was laid off and my job got out-sourced to low-cost engineers in India):
If this error has just been a one-off event then Sun Support is likely to say "call us next time it happens again on that CPU *then* we'll replace it."

If this has happened more than once already then Sun should replace the CPU module.

Believe it or not, transient e-cache errors can be caused by cosmic rays, however the chances of naturally occurring causes of the error hitting the same CPU twice are almost zero. Hence the "replace after 2 hits" policy.

The newer UltraSPARC III CPUs now in all new Sun h/w have ECC functions to allow single bit error correction on transient e-cache errors and the latest policy on replacing those CPUs is >24 errors in 24 hours, or immediately on any double bit errors.

A very detailed scientific document on the subject of naturally occurring causes of memory faults can be found at the following URL:

Don't ask me how I came to have this URL, I bookmarked it years ago and it's thankfully still there, otherwise people wouldn't believe me when I mention cosmic rays.  HTH

- CB

Expert Comment

ID: 12542713
Hey liddler: what about some points for the info about cosmic rays?  :-)

Featured Post

Free Tool: Site Down Detector

Helpful to verify reports of your own downtime, or to double check a downed website you are trying to access.

One of a set of tools we are providing to everyone as a way of saying thank you for being a part of the community.

Question has a verified solution.

Are you are experiencing a similar issue? Get a personalized answer when you ask a related question.

Have a better answer? Share it in a comment.

Join & Write a Comment

When you do backups in the Solaris Operating System, the file system must be inactive. Otherwise, the output may be inconsistent. A file system is inactive when it's unmounted or it's write-locked by the operating system. Although the fssnap utility…
Installing FreeBSD… FreeBSD is a darling of an operating system. The stability and usability make it a clear choice for servers and desktops (for the cunning). Savvy?  The Ports collection makes available every popular FOSS application and packag…
This video shows how to set up a shell script to accept a positional parameter when called, pass that to a SQL script, accept the output from the statement back and then manipulate it in the Shell.
In a previous video, we went over how to export a DynamoDB table into Amazon S3.  In this video, we show how to load the export from S3 into a DynamoDB table.

624 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question