solaris error.

Posted on 2004-10-04
Last Modified: 2013-12-21
/var/adm/messages show the following. uname -r shows 5.7
"unix: [AFT2] errID 0x002e7772.72894422 PA=0x00000000.0c927230
E$tag 0x00000000.08400192 E$State: Shared E$parity 0x04
unix: [AFT2] E$Data (0x00): 0x0014a174.0014a17c
 [AFT2] E$Data (0x08): 0x0014a184.0014a18c
 unix: [AFT2] E$Data (0x10): 0x0014a190.0014a198
 unix: [AFT2] E$Data (0x18): 0x0014a19c.00138f10
 unix: [AFT2] E$Data (0x20): 0x00112b00.00112c00
 unix: [AFT2] E$Data (0x28): 0x00138fd4.00112c50
 unix: [AFT2] E$Data (0x30): 0x00112c5f.00112c6e *Bad* PSYND=0xff00
 unix: [AFT2] E$Data (0x38): 0x00112c80.00138fd8
 unix: [AFT2] errID 0x002e7772.72894422 PA=0x00000000.0c927230
 E$tag 0x00000000.08400192 E$State: Shared E$parity 0x04
 unix: [AFT2] E$Data (0x00): 0x0014a174.0014a17c
 unix: [AFT2] E$Data (0x08): 0x0014a184.0014a18c
 unix: [AFT2] E$Data (0x10): 0x0014a190.0014a198
 unix: [AFT2] E$Data (0x18): 0x0014a19c.00138f10
 unix: [AFT2] E$Data (0x20): 0x00112b00.00112c00
 unix: [AFT2] E$Data (0x28): 0x00138fd4.00112c50
 unix: [AFT2] E$Data (0x30): 0x00112c5f.00112c6e *Bad* PSYND=0x0400
 unix: [AFT2] E$Data (0x38): 0x00112c80.00138fd8
 unix: [AFT3] errID 0x002e7772.72894422 Above Error is in User Mode
 and is fatal: will reboot......
plz.explain what the error stands for and how to debug it.
Question by:byerabati
Welcome to Experts Exchange

Add your voice to the tech community where 5M+ people just like you are talking about what matters.

  • Help others & share knowledge
  • Earn cash & points
  • Learn & ask questions
LVL 18

Accepted Solution

liddler earned 125 total points
ID: 12216165
It looks like you have a bad memory module or CPU, assuming you have a service contract with sun, you need to get them to check the explorer output and change the hardware

Author Comment

ID: 12216214
Thanx liddler,
how can i know the error is due to bad memory or to read the error.

LVL 18

Expert Comment

ID: 12216538
I don't know, any error like this I pass to Sun, as they support the hardware on all of my systems.

You could have a look at the output of
/usr/platform/`uname -i`/sbin/prtdiag -v
to see  if that reports any hardware problems

Also have a look at:
Technology Partners: We Want Your Opinion!

We value your feedback.

Take our survey and automatically be enter to win anyone of the following:
Yeti Cooler, Amazon eGift Card, and Movie eGift Card!


Expert Comment

ID: 12247892
liddler is correct. It's an E-Cache ("E$") error, which means that there is some kind of hardware problem with the CPU cache memory.

If you send that data to Sun, they will most likely accept it as proof of a hardware failure and fix it for you if you have a support contract or are covered under warranty.


Assisted Solution

SumeshDaftary earned 125 total points
ID: 12280824

Its sure that its E-cache problem.

This following link will help you to understand conversion as well. 

you may need to replace CPU on board 0


Expert Comment

ID: 12294408
Just one point to clarify regarding Sun's support policy on the E-cache errors having spent the last 6 years in a Sun Solution Centre (before I was laid off and my job got out-sourced to low-cost engineers in India):
If this error has just been a one-off event then Sun Support is likely to say "call us next time it happens again on that CPU *then* we'll replace it."

If this has happened more than once already then Sun should replace the CPU module.

Believe it or not, transient e-cache errors can be caused by cosmic rays, however the chances of naturally occurring causes of the error hitting the same CPU twice are almost zero. Hence the "replace after 2 hits" policy.

The newer UltraSPARC III CPUs now in all new Sun h/w have ECC functions to allow single bit error correction on transient e-cache errors and the latest policy on replacing those CPUs is >24 errors in 24 hours, or immediately on any double bit errors.

A very detailed scientific document on the subject of naturally occurring causes of memory faults can be found at the following URL: 

Don't ask me how I came to have this URL, I bookmarked it years ago and it's thankfully still there, otherwise people wouldn't believe me when I mention cosmic rays.  HTH

- CB

Expert Comment

ID: 12542713
Hey liddler: what about some points for the info about cosmic rays?  :-)

Featured Post

Free Tool: SSL Checker

Scans your site and returns information about your SSL implementation and certificate. Helpful for debugging and validating your SSL configuration.

One of a set of tools we are providing to everyone as a way of saying thank you for being a part of the community.

Question has a verified solution.

If you are experiencing a similar issue, please ask a related question

When you do backups in the Solaris Operating System, the file system must be inactive. Otherwise, the output may be inconsistent. A file system is inactive when it's unmounted or it's write-locked by the operating system. Although the fssnap utility…
Java performance on Solaris - Managing CPUs There are various resource controls in operating system which directly/indirectly influence the performance of application. one of the most important resource controls is "CPU".   In a multithreaded…
Learn several ways to interact with files and get file information from the bash shell. ls lists the contents of a directory: Using the -a flag displays hidden files: Using the -l flag formats the output in a long list: The file command gives us mor…
In a previous video, we went over how to export a DynamoDB table into Amazon S3.  In this video, we show how to load the export from S3 into a DynamoDB table.

696 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question