• Status: Solved
  • Priority: Medium
  • Security: Public
  • Views: 175
  • Last Modified:

I have a zfs storage box and two xen servers that are having errors

a recent network interruption has now caused errors with my xen servers connection to an ifs share over iSCSI.

If I reboot the Xen server then everything comes backup, but within 20 mins the /var directory on the linux containers goes into read only because of journal errors

Here is an out put of the errors from the zfs storage box

Jan  8 00:06:42 dzfs01 istgt[1211]: Login from iqn.1994-05.com.redhat:d8a183801935 (172.16.30.220) on iqn.dzfs01:alameda LU18 (172.16.30.221:3260,1), ISID=23d0a0000, TSIH=299, CID=0, HeaderDigest=off, DataDigest=off
Jan  8 00:06:43 dzfs01 istgt[1211]: Login from iqn.1994-05.com.redhat:d8a183801935 (172.16.30.220) on iqn.dzfs01:malden LU13 (172.16.30.221:3260,1), ISID=23d100000, TSIH=286, CID=0, HeaderDigest=off, DataDigest=off
Jan  8 00:06:44 dzfs01 istgt[1211]: istgt.c:1411:istgt_acceptor: ***ERROR*** accept error: -1
Jan  8 00:06:44 dzfs01 istgt[1211]: Login from iqn.1994-05.com.redhat:d8a183801935 (172.16.30.220) on iqn.dzfs01:dogeared01 LU14 (172.16.30.221:3260,1), ISID=23d010000, TSIH=297, CID=0, HeaderDigest=off, DataDigest=off
Jan  8 00:06:44 dzfs01 istgt[1211]: istgt.c:1411:istgt_acceptor: ***ERROR*** accept error: -1
Jan  8 00:06:44 dzfs01 istgt[1211]: istgt.c:1411:istgt_acceptor: ***ERROR*** accept error: -1

Jan  8 00:06:45 dzfs01 istgt[1211]: Login from iqn.1994-05.com.redhat:d8a183801935 (172.16.30.220) on iqn.dzfs01:timor LU5 (172.16.30.221:3260,1), ISID=23d050000, TSIH=285, CID=0, HeaderDigest=off, DataDigest=off
Jan  8 00:06:46 dzfs01 istgt[1211]: Login from iqn.1994-05.com.redhat:d8a183801935 (172.16.30.220) on iqn.dzfs01:gargoyle LU2 (172.16.30.221:3260,1), ISID=23d060000, TSIH=293, CID=0, HeaderDigest=off, DataDigest=off
Jan  8 00:06:58 dzfs01 istgt[1211]: istgt.c:1411:istgt_acceptor: ***ERROR*** accept error: -1
Jan  8 00:06:59 dzfs01 last message repeated 5 times
Jan  8 00:06:59 dzfs01 istgt[1211]: Login from iqn.1994-05.com.redhat:d8a183801935 (172.16.30.220) on iqn.dzfs01:coronado LU19 (172.16.30.221:3260,1), ISID=23d0f0000, TSIH=295, CID=0, HeaderDigest=off, DataDigest=off
Jan  8 00:06:59 dzfs01 istgt[1211]: istgt_iscsi.c: 777:istgt_iscsi_write_pdu_internal: ***ERROR*** iscsi_write() failed (errno=32)
Jan  8 00:06:59 dzfs01 istgt[1211]: istgt_iscsi.c:4984:sender: ***ERROR*** iscsi_write_pdu() failed on iqn.dzfs01:coronado,t,0x0001(iqn.1994-05.com.redhat:d8a183801935,i,0x00023d0f0000)
0
Gribble
Asked:
Gribble
  • 2
  • 2
1 Solution
 
gheistCommented:
According to errno.h -1 means insufficient permission.
-32 broken pipe (quite logical once permission is denied)
0
 
GribbleAuthor Commented:
We resolved the problem.  The network switch was having a problem.  We replaced the switch and all issues have been resolved.  Thanks for you reply.
0
 
gheistCommented:
Just take care to monitor the new switch o it never again comes to data loss.
0
 
GribbleAuthor Commented:
Resolved the issue by replacing the network switch
0

Featured Post

Prep for the ITIL® Foundation Certification Exam

December’s Course of the Month is now available! Enroll to learn ITIL® Foundation best practices for delivering IT services effectively and efficiently.

  • 2
  • 2
Tackle projects and never again get stuck behind a technical roadblock.
Join Now