• Status: Solved
  • Priority: Medium
  • Security: Public
  • Views: 195
  • Last Modified:

I have a zfs storage box and two xen servers that are having errors

a recent network interruption has now caused errors with my xen servers connection to an ifs share over iSCSI.

If I reboot the Xen server then everything comes backup, but within 20 mins the /var directory on the linux containers goes into read only because of journal errors

Here is an out put of the errors from the zfs storage box

Jan  8 00:06:42 dzfs01 istgt[1211]: Login from iqn.1994-05.com.redhat:d8a183801935 (172.16.30.220) on iqn.dzfs01:alameda LU18 (172.16.30.221:3260,1), ISID=23d0a0000, TSIH=299, CID=0, HeaderDigest=off, DataDigest=off
Jan  8 00:06:43 dzfs01 istgt[1211]: Login from iqn.1994-05.com.redhat:d8a183801935 (172.16.30.220) on iqn.dzfs01:malden LU13 (172.16.30.221:3260,1), ISID=23d100000, TSIH=286, CID=0, HeaderDigest=off, DataDigest=off
Jan  8 00:06:44 dzfs01 istgt[1211]: istgt.c:1411:istgt_acceptor: ***ERROR*** accept error: -1
Jan  8 00:06:44 dzfs01 istgt[1211]: Login from iqn.1994-05.com.redhat:d8a183801935 (172.16.30.220) on iqn.dzfs01:dogeared01 LU14 (172.16.30.221:3260,1), ISID=23d010000, TSIH=297, CID=0, HeaderDigest=off, DataDigest=off
Jan  8 00:06:44 dzfs01 istgt[1211]: istgt.c:1411:istgt_acceptor: ***ERROR*** accept error: -1
Jan  8 00:06:44 dzfs01 istgt[1211]: istgt.c:1411:istgt_acceptor: ***ERROR*** accept error: -1

Jan  8 00:06:45 dzfs01 istgt[1211]: Login from iqn.1994-05.com.redhat:d8a183801935 (172.16.30.220) on iqn.dzfs01:timor LU5 (172.16.30.221:3260,1), ISID=23d050000, TSIH=285, CID=0, HeaderDigest=off, DataDigest=off
Jan  8 00:06:46 dzfs01 istgt[1211]: Login from iqn.1994-05.com.redhat:d8a183801935 (172.16.30.220) on iqn.dzfs01:gargoyle LU2 (172.16.30.221:3260,1), ISID=23d060000, TSIH=293, CID=0, HeaderDigest=off, DataDigest=off
Jan  8 00:06:58 dzfs01 istgt[1211]: istgt.c:1411:istgt_acceptor: ***ERROR*** accept error: -1
Jan  8 00:06:59 dzfs01 last message repeated 5 times
Jan  8 00:06:59 dzfs01 istgt[1211]: Login from iqn.1994-05.com.redhat:d8a183801935 (172.16.30.220) on iqn.dzfs01:coronado LU19 (172.16.30.221:3260,1), ISID=23d0f0000, TSIH=295, CID=0, HeaderDigest=off, DataDigest=off
Jan  8 00:06:59 dzfs01 istgt[1211]: istgt_iscsi.c: 777:istgt_iscsi_write_pdu_internal: ***ERROR*** iscsi_write() failed (errno=32)
Jan  8 00:06:59 dzfs01 istgt[1211]: istgt_iscsi.c:4984:sender: ***ERROR*** iscsi_write_pdu() failed on iqn.dzfs01:coronado,t,0x0001(iqn.1994-05.com.redhat:d8a183801935,i,0x00023d0f0000)
0
Gribble
Asked:
Gribble
  • 2
  • 2
1 Solution
 
gheistCommented:
According to errno.h -1 means insufficient permission.
-32 broken pipe (quite logical once permission is denied)
0
 
GribbleAuthor Commented:
We resolved the problem.  The network switch was having a problem.  We replaced the switch and all issues have been resolved.  Thanks for you reply.
0
 
gheistCommented:
Just take care to monitor the new switch o it never again comes to data loss.
0
 
GribbleAuthor Commented:
Resolved the issue by replacing the network switch
0
Question has a verified solution.

Are you are experiencing a similar issue? Get a personalized answer when you ask a related question.

Have a better answer? Share it in a comment.

Join & Write a Comment

Featured Post

Improve Your Query Performance Tuning

In this FREE six-day email course, you'll learn from Janis Griffin, Database Performance Evangelist. She'll teach 12 steps that you can use to optimize your queries as much as possible and see measurable results in your work. Get started today!

  • 2
  • 2
Tackle projects and never again get stuck behind a technical roadblock.
Join Now