Event ID 129 Source HPCISSs2

Hello,

I have a Proliant DL180G5 E5420 with a RAID configuration that has a Smart Array E200 with a RAID 1 with 2 Hard drives for my OS and a RAID 5  with 5 hard drives for my files. This is a File server running Windows 2003 R2 SP2.  Everything has been running great until just a couple of days ago when I noticed some lag when accessing files. I logged into my server and checked the event log, I noticed Event ID 129 with the following description:

Event Type:      Warning
Event Source:      HpCISSs2
Event Category:      None
Event ID:      129
Date:            8/28/2015
Time:            1:59:15 AM
User:            N/A
Computer:      ADVANCE_SERVER
Description:
The description for Event ID ( 129 ) in Source ( HpCISSs2 ) cannot be found. The local computer may not have the necessary registry information or message DLL files to display messages from a remote computer. You may be able to use the /AUXSOURCE= flag to retrieve this description; see Help and Support for details. The following information is part of the event: \Device\RaidPort0.


I have contacted HP and they had me update the Driver and Firmware to the latest version of my RAID. This did not resolve the issue.

I have noticed that this message pops up when there is a lot of activity on the server or when there are files being copied to and from the server. It does not seem to happen all of the time but does happen about every hour more. When I don't have users logged into the server like now at 2:44am I do not see the event being posted on event viewer.

Has anyone seen this type of message?

Thank you in advance
cisgoro76Asked:
Who is Participating?
I wear a lot of hats...

"The solutions and answers provided on Experts Exchange have been extremely helpful to me over the last few years. I wear a lot of hats - Developer, Database Administrator, Help Desk, etc., so I know a lot of things but not a lot about one thing. Experts Exchange gives me answers from people who do know a lot about one thing, in a easy to use platform." -Todd S.

TimorosCommented:
Did you check your HDDs for errors ?
You can try this tool http://www8.hp.com/us/en/products/server-software/product-detail.html?oid=489496 (HP Websim) and look at the stats for each hard disk.
0
cisgoro76Author Commented:
hello,

i have not used tgis tool but i do have one installed on the server called hp array configuration utility which checks the health or your array and harddrive, no errors. i have also called HP and they ran a diagnostic on the server that also came back with no errors on hardware.
0
cisgoro76Author Commented:
Has anyone else seen this error message and know of a solution?
0
Cloud Class® Course: Python 3 Fundamentals

This course will teach participants about installing and configuring Python, syntax, importing, statements, types, strings, booleans, files, lists, tuples, comprehensions, functions, and classes.

andyalderCommented:
It appears to be a bug - http://h30499.www3.hp.com/t5/ProLiant-Servers-ML-DL-SL/HPCISSs2-Event-ID-129-appears-after-applying-HP-Service-Pack-for/td-p/6746271#.VeB6zJf7Ph4

May be a dodgy disk though, if you upload an ADU report I can look through it for you (either start-hp system tools-ADU or the newer ACU has an option to generate ADU report). Note that the ACU may show everything as good even if there is a disk with hundreds of bad blocks on it.

HP SIM isn't any use for this, it's for monitoring large herds of servers via SNMP/WBEM.
0
cisgoro76Author Commented:
Hi AndyAlder

I am attaching a ACU report . Thank you for your help.
report-a1352c8f-00000080-00000000.zip
0
andyalderCommented:
Disk 7 has bad sectors and should be replaced, it has a bit error rate of 1 in 1000. It's throwing about 15 errors per hour on average. Was your last reboot 3 weeks ago?


Smart Array E200 in Embedded Slot : Internal Drive Cage at Port 2I : Box 1 : Drive Cage on Port 2I : Physical Drive 2I:1:7 : Monitor and Performance Statistics (Since Reset)

   Serial Number                        WMAYP4952032        
   Firmware Revision                    HPG2
   Product Revision                     ATA     MB0500EBNCR    
   Reference Time                       0x000001fd
   Sectors Read                         0x0000000000245330
   Read Errors Hard                     0x00000a9c                       <-A9C(hex) errors since boot
   Read Errors Retry Recovered          0x00000000
   Read Errors ECC Corrected            0x0000000000000000
   Sectors Written                      0x00000000002adba2
0
cisgoro76Author Commented:
my last reboot was lastnight.
0
andyalderCommented:
How did I get that wrong? 1FD(hex) minutes is 8.5 hours so it's throwing 10 errors per minute, not 10 per hour. Did HP really say there was nothing wrong with the array?
1

Experts Exchange Solution brought to you by

Your issues matter to us.

Facing a tech roadblock? Get the help and guidance you need from experienced professionals who care. Ask your question anytime, anywhere, with no hassle.

Start your 7-day free trial
cisgoro76Author Commented:
Thank you very much, I was able to replace drive 7 and server is working normal again. I had to practically argue with HP support to have them send me a spare drive. We have support with them but they insisted that the issue was the OS. They wanted me to reinstall the OS. I removed drive 7, restarted the server and it came back up super fast obviously my RAID advised me that there was a drive missing when I did that. I received the spare drive a couple of hours later and after installing the drive it rebuilt itself overnight. Im getting a couple of event ID 55 which means that some of my files may be corrupted but I may just need to run chkdisk to resolve this. The server is running much better and Im not getting the event id 129 that I was getting before.

Thanks for your help with this. HP support has serious issues. I really feel like calling them back and tell them what the issue really was.
0
Senior IT System EngineerIT ProfessionalCommented:
So what was the fix in the issue above ?

I also getting the smae problem here every hours:

Log Name:      System
Source:        HpCISSs2
Date:          7/04/2016 11:39:11 AM
Event ID:      129
Task Category: None
Level:         Warning
Keywords:      Classic
User:          N/A
Computer:      PRODBU01.domain.com
Description:
Reset to device, \Device\RaidPort1, was issued.

Open in new window

0
andyalderCommented:
ITSystemEngineer, post a new question and include an ADU report with it so we can see if you have a bad disk.
1
It's more than this solution.Get answers and train to solve all your tech problems - anytime, anywhere.Try it for free Edge Out The Competitionfor your dream job with proven skills and certifications.Get started today Stand Outas the employee with proven skills.Start learning today for free Move Your Career Forwardwith certification training in the latest technologies.Start your trial today
Windows Server 2003

From novice to tech pro — start learning today.

Question has a verified solution.

Are you are experiencing a similar issue? Get a personalized answer when you ask a related question.

Have a better answer? Share it in a comment.