Still celebrating National IT Professionals Day with 3 months of free Premium Membership. Use Code ITDAY17

x
?
Solved

iscsi-target and Highpoint RAID occasionally resets

Posted on 2011-09-07
14
Medium Priority
?
523 Views
Last Modified: 2016-09-26
I am running a couple of iSCSI target boxes.  Occasionally, we see resets on the RAID controller that cause the storage system to turn into read only and all my LUN's drop.  We have to restart the iscsi-target server, and then restart all the servers with attached LUN's.  We have no notice that this will take place, and the RAID controller event log shows nothing at all.

The servers attaching to the LUN's are Windows 2003/2008, and CentOS 5.x servers.

Here is the error message we see in the /var/log/messages file when this happens.
kernel: hptiop_reset(4/0/0) scp=ffff8101eb41ab00

Basic Configuration:
Intel Server, dual 5400 processors, 8GB RAM
6 Intel Ehternet (2 on board, 2 duals)
Highpoint RAID 4320
0
Comment
Question by:thetechgroup
[X]
Welcome to Experts Exchange

Add your voice to the tech community where 5M+ people just like you are talking about what matters.

  • Help others & share knowledge
  • Earn cash & points
  • Learn & ask questions
  • 8
  • 5
14 Comments
 
LVL 2

Accepted Solution

by:
McRonis earned 1500 total points
ID: 36499266
Looks like you got faulty raid adapter. Update firmware for raid card, if it doesn't help, then you should RMA it.
0
 

Author Comment

by:thetechgroup
ID: 36499594
We tried 3 of the same card, and they have all, eventually, produced a similar result.
0
 

Author Comment

by:thetechgroup
ID: 36499618
I forgot to mention that the Highpoint RAID controller 4320, also has a backup battery.
0
Survive A High-Traffic Event with Percona

Your application or website rely on your database to deliver information about products and services to your customers. You can’t afford to have your database lose performance, lose availability or become unresponsive – even for just a few minutes.

 
LVL 2

Expert Comment

by:McRonis
ID: 36501036
Tell more about your iscsi server, what kind software you are using ?
0
 

Author Comment

by:thetechgroup
ID: 36504663
I am using CentOS 5.4 very vanilla installation.  Kernel 2.6.18-164 #1 SMP X86_64

There are 5 subnets on the server.  1 of the 2 onboards used for Management, the other 4 for iSCSI data.  We use the basic intel drivers that came with CentOS.

You know about the Raid Controller, and we are using the driver provided on the web site for CentIOS 5.4

iscsi-target is the latest version 1.4.20-2

Did I answer your question?

0
 
LVL 2

Expert Comment

by:McRonis
ID: 36504837
You could install openfiler for testing purpose, do the same setup ?
IF it doesn't help, try to setup newer CentOS, like version 5.6, or even 6
0
 
LVL 2

Expert Comment

by:McRonis
ID: 36504943
Maybe it is caused by heat ? I mean, RAID card processor is getting too hot, and it's not working right.
It can be possible ?
0
 

Author Comment

by:thetechgroup
ID: 36505370
I have not thought about heat...  The box is definitely quite tight and it sits in a tight cabinet.  I need to put a heat sensor on it to keep an eye on any possible intake problems.  I'll watch for some time and report back.

As far as openfiler, we had issues with it and it was behaving worse than the clean install.  We thought about upgrading, but the HighPoint drivers are specific to 5.4
0
 
LVL 2

Expert Comment

by:McRonis
ID: 36505460
Anyway you can try these drivers with newer CentOS .
It is specific to 5.4 because, when the Highpoint RAID 4320 was released, there wasn't RedHat/Centos version 6 , nor CentOS 5.6 or 5.7
0
 

Author Comment

by:thetechgroup
ID: 36505890
Thanks.  Let us try a couple of these and report back.
0
 

Author Comment

by:thetechgroup
ID: 36584683
Last weekend we loaded the latest firmware onto the RAID controller.  It had an older version.  We moved it into a more data intensive environment.  It has been running for 4 days.  We'll see in 6 weeks.
0
 

Author Comment

by:thetechgroup
ID: 37698820
It looks like the updated RAID Controller firmware and Updated Intel e1000 drivers did the trick.
0
 

Author Closing Comment

by:thetechgroup
ID: 37698829
It worked in combination with network driver update
0
 

Expert Comment

by:john doe
ID: 41816397
i've had several of these cards exhibit the same behavior and finally nailed down the fix
the vcore regulators on the back of the card do not get enough airflow to really cool them if there is a solid backplate in the card slot next to them (125-140 deg C)
the regulators get too hot and vcore droops under load causing the iop to reset
shoved a few heatsinks on them and problem hasn't returned for weeks the tantalum capacitors may also be at fault but i haven't gotten around to replacing them on any of my cards
0

Featured Post

Free Tool: IP Lookup

Get more info about an IP address or domain name, such as organization, abuse contacts and geolocation.

One of a set of tools we are providing to everyone as a way of saying thank you for being a part of the community.

Question has a verified solution.

If you are experiencing a similar issue, please ask a related question

Google Drive is extremely cheap offsite storage, and it's even possible to get extra storage for free for two years.  You can use the free account 15GB, and if you have an Android device..when you install Google Drive for the first time it will give…
Many businesses neglect disaster recovery and treat it as an after-thought. I can tell you first hand that data will be lost, hard drives die, servers will be hacked, and careless (or malicious) employees can ruin your data.
Learn how to get help with Linux/Unix bash shell commands. Use help to read help documents for built in bash shell commands.: Use man to interface with the online reference manuals for shell commands.: Use man to search man pages for unknown command…
Learn how to navigate the file tree with the shell. Use pwd to print the current working directory: Use ls to list a directory's contents: Use cd to change to a new directory: Use wildcards instead of typing out long directory names: Use ../ to move…
Suggested Courses

722 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question