Solved

solaris error.

Posted on 2004-10-04
9
688 Views
Last Modified: 2013-12-21
Hallow,
/var/adm/messages show the following. uname -r shows 5.7
"unix: [AFT2] errID 0x002e7772.72894422 PA=0x00000000.0c927230
E$tag 0x00000000.08400192 E$State: Shared E$parity 0x04
unix: [AFT2] E$Data (0x00): 0x0014a174.0014a17c
 [AFT2] E$Data (0x08): 0x0014a184.0014a18c
 unix: [AFT2] E$Data (0x10): 0x0014a190.0014a198
 unix: [AFT2] E$Data (0x18): 0x0014a19c.00138f10
 unix: [AFT2] E$Data (0x20): 0x00112b00.00112c00
 unix: [AFT2] E$Data (0x28): 0x00138fd4.00112c50
 unix: [AFT2] E$Data (0x30): 0x00112c5f.00112c6e *Bad* PSYND=0xff00
 unix: [AFT2] E$Data (0x38): 0x00112c80.00138fd8
 unix: [AFT2] errID 0x002e7772.72894422 PA=0x00000000.0c927230
 E$tag 0x00000000.08400192 E$State: Shared E$parity 0x04
 unix: [AFT2] E$Data (0x00): 0x0014a174.0014a17c
 unix: [AFT2] E$Data (0x08): 0x0014a184.0014a18c
 unix: [AFT2] E$Data (0x10): 0x0014a190.0014a198
 unix: [AFT2] E$Data (0x18): 0x0014a19c.00138f10
 unix: [AFT2] E$Data (0x20): 0x00112b00.00112c00
 unix: [AFT2] E$Data (0x28): 0x00138fd4.00112c50
 unix: [AFT2] E$Data (0x30): 0x00112c5f.00112c6e *Bad* PSYND=0x0400
 unix: [AFT2] E$Data (0x38): 0x00112c80.00138fd8
 unix: [AFT3] errID 0x002e7772.72894422 Above Error is in User Mode
 and is fatal: will reboot......
plz.explain what the error stands for and how to debug it.
Thanx,
BY
0
Comment
Question by:byerabati
9 Comments
 
LVL 18

Accepted Solution

by:
liddler earned 125 total points
ID: 12216165
It looks like you have a bad memory module or CPU, assuming you have a service contract with sun, you need to get them to check the explorer output and change the hardware
0
 

Author Comment

by:byerabati
ID: 12216214
Thanx liddler,
how can i know the error is due to bad memory or cpu.how to read the error.

Thanx
0
 
LVL 18

Expert Comment

by:liddler
ID: 12216538
I don't know, any error like this I pass to Sun, as they support the hardware on all of my systems.

You could have a look at the output of
/usr/platform/`uname -i`/sbin/prtdiag -v
to see  if that reports any hardware problems

Also have a look at:
http://forums.sunhelp.org/archive/index.php/t-372
http://sunportal.sunmanagers.org/pipermail/summaries/2004-January/004790.html
http://sunportal.sunmanagers.org/pipermail/summaries/2002-January/000662.html
0
Is Your Active Directory as Secure as You Think?

More than 75% of all records are compromised because of the loss or theft of a privileged credential. Experts have been exploring Active Directory infrastructure to identify key threats and establish best practices for keeping data safe. Attend this month’s webinar to learn more.

 
LVL 2

Expert Comment

by:Matt_Avery
ID: 12247892
liddler is correct. It's an E-Cache ("E$") error, which means that there is some kind of hardware problem with the CPU cache memory.

If you send that data to Sun, they will most likely accept it as proof of a hardware failure and fix it for you if you have a support contract or are covered under warranty.

0
 
LVL 1

Assisted Solution

by:SumeshDaftary
SumeshDaftary earned 125 total points
ID: 12280824
Bayerbati

Its sure that its E-cache problem.

This following link will help you to understand conversion as well.

http://forums.sunhelp.org/archive/index.php/t-372 

you may need to replace CPU on board 0

0
 

Expert Comment

by:colinbo
ID: 12294408
Just one point to clarify regarding Sun's support policy on the E-cache errors having spent the last 6 years in a Sun Solution Centre (before I was laid off and my job got out-sourced to low-cost engineers in India):
If this error has just been a one-off event then Sun Support is likely to say "call us next time it happens again on that CPU *then* we'll replace it."

If this has happened more than once already then Sun should replace the CPU module.

Believe it or not, transient e-cache errors can be caused by cosmic rays, however the chances of naturally occurring causes of the error hitting the same CPU twice are almost zero. Hence the "replace after 2 hits" policy.

The newer UltraSPARC III CPUs now in all new Sun h/w have ECC functions to allow single bit error correction on transient e-cache errors and the latest policy on replacing those CPUs is >24 errors in 24 hours, or immediately on any double bit errors.

A very detailed scientific document on the subject of naturally occurring causes of memory faults can be found at the following URL:
http://www.research.ibm.com/journal/rd/421/ziegler.html 

Don't ask me how I came to have this URL, I bookmarked it years ago and it's thankfully still there, otherwise people wouldn't believe me when I mention cosmic rays.  HTH

- CB
0
 

Expert Comment

by:colinbo
ID: 12542713
Hey liddler: what about some points for the info about cosmic rays?  :-)
0

Featured Post

Is Your Active Directory as Secure as You Think?

More than 75% of all records are compromised because of the loss or theft of a privileged credential. Experts have been exploring Active Directory infrastructure to identify key threats and establish best practices for keeping data safe. Attend this month’s webinar to learn more.

Question has a verified solution.

If you are experiencing a similar issue, please ask a related question

Suggested Solutions

Title # Comments Views Activity
nodeip 9 75
UNiX Script filesystem space usage 19 67
Python Assistance 7 72
Solaris 4.1.3 cloning and booting 13 71
I have been running these systems for a few years now and I am just very happy with them.   I just wanted to share the manual that I have created for upgrades and other things.  Oooh yes! FreeBSD makes me happy (as a server), no maintenance and I al…
Java performance on Solaris - Managing CPUs There are various resource controls in operating system which directly/indirectly influence the performance of application. one of the most important resource controls is "CPU".   In a multithreaded…
Learn how to get help with Linux/Unix bash shell commands. Use help to read help documents for built in bash shell commands.: Use man to interface with the online reference manuals for shell commands.: Use man to search man pages for unknown command…
Learn how to navigate the file tree with the shell. Use pwd to print the current working directory: Use ls to list a directory's contents: Use cd to change to a new directory: Use wildcards instead of typing out long directory names: Use ../ to move…

914 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question

Need Help in Real-Time?

Connect with top rated Experts

14 Experts available now in Live!

Get 1:1 Help Now