Raid Controller Errormessage

Hello,
on a new machine we discovered performance issues. In the RaidControllers
Logfile we found the following Errors:
SEQUENCE NUMBER = 206460
TIME = 05-09-2013 17:01:01
LOCALIZED MESSAGE = Controller ID:  0   Link restored on SAS wide port:   0  PHY       =   2

ID = 3795
SEQUENCE NUMBER = 206459
TIME = 05-09-2013 17:01:01
LOCALIZED MESSAGE = Controller ID:  0   Link lost on SAS wide port:   0  PHY       =   2

ID = 3794
SEQUENCE NUMBER = 206458
TIME = 05-09-2013 17:00:59
LOCALIZED MESSAGE = Controller ID:  0   Link restored on SAS wide port:   0  PHY       =   2

ID = 3793
SEQUENCE NUMBER = 206457
TIME = 05-09-2013 17:00:58
LOCALIZED MESSAGE = Controller ID:  0   Link lost on SAS wide port:   0  PHY       =   2

ID = 3792
SEQUENCE NUMBER = 206456
TIME = 05-09-2013 17:00:50
LOCALIZED MESSAGE = Controller ID:  0   Unexpected sense:   PD       =   Port 0 - 3:1:1Unknown Sense Code,   CDB   =    0x2a 0x00 0x23 0xf4 0xea 0x00 0x00 0x02 0x00 0x00    ,   Sense   =    0x70 0x00 0x0b 0x00 0x00 0x00 0x00 0x18 0x00 0x00 0x00 0x00 0x4b 0x03 0x00 0x00 0x00 0x00 0x00 0x00 0xf5 0x51 0x00 0x00 0x00 0x00 0x00 0x00 0x00 0x00 0x00 0x00

ID = 3791
SEQUENCE NUMBER = 206455
TIME = 05-09-2013 17:00:49
LOCALIZED MESSAGE = Controller ID:  0   Unexpected sense:   PD       =   Port 0 - 3:1:0Unknown Sense Code,   CDB   =    0x2a 0x00 0x23 0xf3 0x32 0x00 0x00 0x02 0x00 0x00    ,   Sense   =    0x70 0x00 0x0b 0x00 0x00 0x00 0x00 0x18 0x00 0x00 0x00 0x00 0x4b 0x03 0x00 0x00 0x00 0x00 0x00 0x00 0xf5 0x51 0x00 0x00 0x00 0x00 0x00 0x00 0x00 0x00 0x00 0x00

So I replaced the Disc on Port 0, now I don't get that much errors but still have 8 Lines with after the rebiuld was finished:

Controller ID:  0   Link restored on SAS wide port:   0  PHY       =   2
Controller ID:  0   Link lost on SAS wide port:   0  PHY       =   2

My question: Are these messages normal or do I have to go deeper and change a cable ord the raidcontroller?

The Raid Controller is a LSI MegaRAID SAS 9271-8i.

- Bernhard
Bernhard KanduthAsked:
Who is Participating?
I wear a lot of hats...

"The solutions and answers provided on Experts Exchange have been extremely helpful to me over the last few years. I wear a lot of hats - Developer, Database Administrator, Help Desk, etc., so I know a lot of things but not a lot about one thing. Experts Exchange gives me answers from people who do know a lot about one thing, in a easy to use platform." -Todd S.

Daniel HelgenbergerCommented:
Hello,
you should investigate further! This is not normal, esp. link losses/restores in rapid succession. Do you have multiple disks on the controller or just one?

There are of course a number of reasons, I try to list a few of them from the most to the least probable:
- If the connector is not directly on the back plane, check the cable connections or swap the SAS cable on Port 0 of the controller
- Swap the disk
- Swap the raid controller, if you only used one disk.
0
Bernhard KanduthAuthor Commented:
Hello,

thank you for your help. what do you mean with "multiple disks"?
I have raid 10 with 4 disks on this controller.
I'll try to change the cables first and see what happens, then I can try to change the raid controller, I hope that's it.
Or could the mainboard also make these issues?

- Bernhard
0
Daniel HelgenbergerCommented:
This is what I meant with multiple disks, you have four, I asked since you can of course operate any RAID controller with one disk only (these LSI's are often on board on better workstations). So the controller seems to be fine, the cause may be the connection of the disk at Port 0 of the controller or that particular disk itself is failing.

If this problem is new and it worked before (so no errors in log files) the failing disk is me likely.

And no, it is highly unlikely that this has something to do with the main board.
0
Powerful Yet Easy-to-Use Network Monitoring

Identify excessive bandwidth utilization or unexpected application traffic with SolarWinds Bandwidth Analyzer Pack.

Bernhard KanduthAuthor Commented:
OK, since this is a new server that hase come from the store I will check the cables.
I did change the disk yesterday.
hope this works out

- Bernhard
0
Bernhard KanduthAuthor Commented:
OK, I have changed both cables of the controller, now it seems to work, but I have
to monitor the server for a while to be shure that the problem is solved.
the thin now is, that from withing the LSI MegaRAID Storage Manager the logs don't show up?
So I'm not able to see in these logs if the problem still occurs?
What I see now is from the windows event logs the problem seems to be gone and from the front plane of the server al lights on the disk are in sync, where before they where not!

- Bernhard
0
Daniel HelgenbergerCommented:
Hello Bernhard,
if you only saw these errors in the error log form windows, you will be most certainly fine since you only see logs of higher priority there (info messages are not logged, error and critical are). Keep a close look at the logs for a week of normal operation though.

If you really want to be sure look at the LSI site to turn on debug logging for some time (but turn it off again for performance!) or use the cli tools form LSI, see this how to:
http://erikimh.com/megacli-cheatsheet/
0

Experts Exchange Solution brought to you by

Your issues matter to us.

Facing a tech roadblock? Get the help and guidance you need from experienced professionals who care. Ask your question anytime, anywhere, with no hassle.

Start your 7-day free trial
It's more than this solution.Get answers and train to solve all your tech problems - anytime, anywhere.Try it for free Edge Out The Competitionfor your dream job with proven skills and certifications.Get started today Stand Outas the employee with proven skills.Start learning today for free Move Your Career Forwardwith certification training in the latest technologies.Start your trial today
Server Hardware

From novice to tech pro — start learning today.