Memory Error On Dell PowerEdge 2950 Server

I am getting the following error on my Dell 2950 DRAC log:

Mem Fatal SB CRC: memory sensor, uncorrectable ECC was asserted

Also, I see an amber warning on the bezel and the following message is displayed:


We purchased an installed new memory modules (32GB) for this machine about 6 months ago and these messages just appeared last month.  How can I troubleshoot the issue to determine:

1.  If there really is a physical memory issue
2.  What modules are the source

This is a production database server, so I'd like to be able to perform the memory diagnostics without shutting the machine down.
Who is Participating?
PCBONEZConnect With a Mentor Commented:
You should check here first.
That error is specifically addressed under E2119.
The Sensor is generally correct when it say memory errorl.

Here a few thing to try to ensure it correct.
Have you tried changing the memory with another one like switching around the memory and seeing if it goes away.
I would also take the memory out and test in different system with memtest86+

The solution link is no longer valid.  I am getting the same error so I would like to see the solution.
artisitAuthor Commented:

We purchased all new memory modules.  Our memory vendor replaced them as they were defective.  The error indicated a hardware problem while led us to the RMA problem with the vendor.
Question has a verified solution.

Are you are experiencing a similar issue? Get a personalized answer when you ask a related question.

Have a better answer? Share it in a comment.

All Courses

From novice to tech pro — start learning today.