Solved

Hardware Malfunction - NMI Parity Check / Memory Parity Error

Posted on 2011-03-17
16
1,885 Views
Last Modified: 2012-06-27
I have a Dell Precision 210 (2-PIII processors, 600mhz, 4x128MB ECC RAM) Rrunning Windows Server 2003.

Almost every day, it blue screens with the message:

Hardware Malfunction
Call you hardware vendor for support
NMI: Parity Check / Memory Parity Error

Memory passes the recommended Windows Memory Diagnostic found here: http://oca.microsoft.com/en/windiag.asp

I've replaced the memory (they are all matching sticks), run the memory tests, and I still get the blue screen.

I've replaced the box (moved drives and memory to a new Precision 210 box), run the memory tests, and I still get the blue screen.

Any ideas?
0
Comment
Question by:nasupport1
[X]
Welcome to Experts Exchange

Add your voice to the tech community where 5M+ people just like you are talking about what matters.

  • Help others & share knowledge
  • Earn cash & points
  • Learn & ask questions
  • 7
  • 4
  • 3
  • +1
16 Comments
 
LVL 95

Expert Comment

by:John Hurst
ID: 35156315
So you replaced the memory (and continue to use the new memory) and moved the new memory and existing hard drive to a new box (different motherboard and peripherals) and get the same error.

So it must be the operating system throwing up this error. Try downloading and installing all new drivers for this OS (video, audio, network cards, chipset and so on). ... Thinkpads_User
0
 
LVL 4

Expert Comment

by:cavp76
ID: 35156324
Have you used memtest86: http://www.memtest86.com/? It has many options for configuring memory tests... definitely there's a bad module, if there's no error detected, try booting the machine taking one stick (or two, up on if it supports an odd number of RAM sticks) and seeing if it gives again the BSOD.
0
 
LVL 5

Expert Comment

by:Phoenixke
ID: 35156344
Might be a long shot but have you tried looking for upgraded BIOS firmware and drivers?
I only suggest it because you've already done the heavy lifting and changed motherboard/ram...
0
Resolve Critical IT Incidents Fast

If your data, services or processes become compromised, your organization can suffer damage in just minutes and how fast you communicate during a major IT incident is everything. Learn how to immediately identify incidents & best practices to resolve them quickly and effectively.

 
LVL 95

Expert Comment

by:John Hurst
ID: 35156370
I am not sure about the overall fit for System File Checker and this situation, but in addition to drivers (my earlier post), try running (from a command prompt) SFC /SCANNOW. Let it complete and restart.

... Thinkpads_User
0
 

Author Comment

by:nasupport1
ID: 35156622
@thinkpads_user - It could be the OS, or an application that's causing it.  I'll explore updating drivers for the hardware, but despite it's age, it's up-to-date.  I'll try the SFC, as well.

@cavp76 - I have a copy of Memtest86, and I guess I'll try it.  It doesn't seem to be a memory issue despite the memory error.  I've systematically removed and replaced RAM, and I still get the message.  In fact, I can remove and reseat the RAM, and it will boot up.  Then, maybe a day or two later, it blue screens again.

@phoenixke - I'll try the driver updates, but the BIOS may be tough - it's an older server.
0
 
LVL 4

Expert Comment

by:cavp76
ID: 35157186
That's different... if you say one or two days and it BSODs again, I'll go for a power supply / energy problem; we're talking about machines older than 10 years, leaky capacitors could be throwing a fit, try replacing the PSU. Is this server behind a UPS?
0
 

Author Comment

by:nasupport1
ID: 35157583
Running Memtest-86 v3.5 now - no issues through the first 5 tests.

All capacitors look fine in both of the boxes (old and new), and it crashes on both PSUs.  I can see if I have a newer PSU to try.  It is plugged into an older APC Back-UPS Pro 420.  I'll see if swapping that out makes a difference.
0
 

Author Comment

by:nasupport1
ID: 35157996
Running Memtest-86 v3.5 now - no issues through the first 5 tests.

All capacitors look fine in both of the boxes (old and new), and it crashes on both PSUs.  I can see if I have a newer PSU to try.  It is plugged into an older APC Back-UPS Pro 420.  I'll see if swapping that out makes a difference.
0
 
LVL 95

Expert Comment

by:John Hurst
ID: 35158454
I struggle a little bit that this is a hardware issue. Why? You replaced the memory and changed machines. You would have to have the same hardware error in both machines.

More likely, I think, is an operating system corruption.  ... Thinkpads_User
0
 

Author Comment

by:nasupport1
ID: 35159433
@thinkpads_user - I'm not ruling a corrupt OS out, or an application causing the error.  I started with hardware troubleshooting because of the nature of the error message: "Harware Malfunction".  But I do see what you're saying - the hard drive is a common denominator.  Event logs don't indicate anything out of the ordinary for this behavior.
0
 
LVL 4

Expert Comment

by:cavp76
ID: 35159571
Agreed... try smartmontools; it will give you all the information it can extract from the disk about S.M.A.RT. status; usually the examples are enough to get a quick view of the status of the HDD
0
 

Author Comment

by:nasupport1
ID: 35182900
The server has a RealTek Gigabit ethernet card that does not include Server 2003 as a supported OS, despite it installing drivers for it.  It appeared to install correctly, and has worked normally.  I have disabled it to rule it out as a culprit for this issue.  I will keep the ticket updated.
0
 
LVL 95

Expert Comment

by:John Hurst
ID: 35409123
I think we gave a decent answer here (corrupt OS).  ... Thinkpads_User
0
 

Accepted Solution

by:
nasupport1 earned 0 total points
ID: 35437922
My apologies....I have been away.

The server in question has been functional since the removal of the NIC, which it turns out is not supported by the OS.  I will close this question.

Thanks to everyone for their suggestions.
0
 

Author Closing Comment

by:nasupport1
ID: 35458592
I investigated the system requirements of the Realtek Gigabit NIC, and it does not support the currently installed Operating System.

Since removing the incompatible NIC, the system has remained up and functional, with no more blue screens.
0

Featured Post

Industry Leaders: We Want Your Opinion!

We value your feedback.

Take our survey and automatically be enter to win anyone of the following:
Yeti Cooler, Amazon eGift Card, and Movie eGift Card!

Question has a verified solution.

If you are experiencing a similar issue, please ask a related question

More or less everybody in the IT market understands the basics of Networking, however when we start talking about Storage Networks, things get a bit dizzier, and this is where I would like to help.
Data center, now-a-days, is referred as the home of all the advanced technologies. In-fact, most of the businesses are now establishing their entire organizational structure around the IT capabilities.
There are cases when e.g. an IT administrator wants to have full access and view into selected mailboxes on Exchange server, directly from his own email account in Outlook or Outlook Web Access. This proves useful when for example administrator want…
Do you want to know how to make a graph with Microsoft Access? First, create a query with the data for the chart. Then make a blank form and add a chart control. This video also shows how to change what data is displayed on the graph as well as form…

687 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question