Link to home
Start Free TrialLog in
Avatar of HappyEngineer
HappyEngineer

asked on

"Single-Bit ECC Errors in Memory Bank"

At bootup I see the message "Single-Bit ECC Errors in Memory Bank", but I am able to proceed with booting and nothing seems wrong.

I assume that at least one of the banks of ram is bad (I think I have either 4 or 8 banks in there).

Should I pull one out at a time until I find the broken one (and then replace it) or is it safe to just ignore it? If I can ignore it, is the speed of the ram impacted by this at all?

I don't know if my type of memory corrects single bit errors or simply detects them.
SOLUTION
Avatar of jhance
jhance

Link to home
membership
This solution is only available to members.
To access this solution, you must be a member of Experts Exchange.
Start Free Trial
ASKER CERTIFIED SOLUTION
Link to home
membership
This solution is only available to members.
To access this solution, you must be a member of Experts Exchange.
Start Free Trial
Avatar of engineer_dell
engineer_dell

ECC has the ability to correct a detected single-bit error in a 64-bit block of memory. When this happens, the computer will continue without a hiccup; it will have no idea that anything even happened. However, if you have a corrected error, it is useful to know this; a pattern of errors can indicate a hardware problem that needs to be addressed. Chipsets allowing ECC normally include a way to report corrected errors to the operating system, but it is up to the operating system to support this.

If a minor (one-bit) memory error occurs, the ECC logic will handle it. If a two-bit or larger error occurs in ECC memory, your system will be halted--similar to what happens with parity memory when any error is encountered.

refer this.
http://www.pcguide.com/ref/ram/errECC-c.html

Or you can download a third party memory testing software to find the fault
http://www.memtest.org/#downiso

Hope you find this helpful,
Engineer_Dell
SOLUTION
Link to home
membership
This solution is only available to members.
To access this solution, you must be a member of Experts Exchange.
Start Free Trial
SOLUTION
Link to home
membership
This solution is only available to members.
To access this solution, you must be a member of Experts Exchange.
Start Free Trial
The inportance of a good CAS match to the MB chipset is one of THE most important issues for stability.  Single bit ECC errors in this case are more likely related to the MB chipset not liking that ECC Ram, but the CAS issue is still supremely relevant.  With the wrong CAS, this is what you can get with ECC.
SOLUTION
Link to home
membership
This solution is only available to members.
To access this solution, you must be a member of Experts Exchange.
Start Free Trial
Avatar of HappyEngineer

ASKER

This is all useful information, but for some reason the message stopped coming up during booting. I've been waiting to see if it reoccurs, but so far it hasn't. I obviously can't go in and change things in order to fix it if it isn't giving me any messages to indicate that it's still happening, so I guess I'll close the question.

I tried memtest86, but it didn't find any problems and I haven't seen the error during bootup since I ran it.
It may have been heat or any condition.  One bits are flakey, but it will return one day, count on it.