[2378] in linux-scsi channel archive

home help back first fref pref prev next nref lref last post

BIG-SCSI-Problems

daemon@ATHENA.MIT.EDU (Klaus Dombrofsky)
Fri Aug 29 07:18:05 1997

Date: 	Fri, 29 Aug 1997 06:41:59 -0400
From: Klaus Dombrofsky <KDombrofsky@compuserve.com>
Cc: "'linux-scsi@vger.rutgers.edu'" <linux-scsi@vger.rutgers.edu>
To: ;@unlisted-recipients (no To-header on input)

Hi there,
i have a very big problem with my linux-machines.

linux1:
P100
32 MB
Adaptec 2940UW with external ASS3000-Raid
Symbios 53C81xx with external CDROM, TAPE, JAZ
Kernel: 2.0.30

linux2
P200
128 MB
Adaptec 2940UW with external ASS3000-Raid
Symbios 53C81xx with internal CDROM, TAPE, JAZ
Kernel 2.0.30

Problem:
When i make a very frequent access on a device on the symbios then it comes
often to a scsi-reset and after this reset on the symbios-controller the
system hangs, but theres no scsi-reset on the adaptec with the raid and the
operating-system.
As long as i make no access on the symbios-controller the system works
well. But when i want 
to copy a lot of data onto the jaz or drive, then it cames to a scsi-reset
often, but not always.

Here is /var/log/messages

Aug 29 11:42:23 linux1 kernel: scsidisk I/O error: dev 08:11, sector 279056
Aug 29 11:42:26 linux1 kernel: SCSI disk error : host 1 channel 0 id 1 lun
0 return code = 28000002
Aug 29 11:42:26 linux1 kernel: Current error sd08:11: sns = f0  4
Aug 29 11:42:26 linux1 kernel: ASC=15 ASCQ=be
Aug 29 11:42:26 linux1 kernel: Raw sense data:0xf0 0x00 0x04 0x00 0x04 0x42
0x32 0x11 0x00 0x00 0x00 0x00 0x15 0xbe 0x00 0x00

"The upper messages comes very often"

Aug 29 11:42:26 linux1 kernel: scsidisk I/O error: dev 08:11, sector 279058
Aug 29 11:42:26 linux1 kernel: EXT2-fs error (device 08:11):
ext2_find_entry: bad entry in directory #34817: rec_len is too
 small for name_len - offset=0, inode=3974950124, rec_len=60652,
name_len=60652
Aug 29 11:42:26 linux1 kernel: EXT2-fs error (device 08:11): ext2_readdir:
bad entry in directory #34817: rec_len is too small for name_len -
offset=0, inode=3974950124, rec_len=60652, name_len=60652
Aug 29 11:42:26 linux1 kernel: EXT2-fs error (device 08:11):
ext2_add_entry: bad entry in directory #34817: rec_len is too small for
name_len - offset=0, inode=3974950124, rec_len=60652, name_len=60652
Aug 29 11:43:11 linux1 kernel: scsi : aborting command due to timeout : pid
14046618, scsi1, channel 0, id 1, lun 0 0x0a 00
 00 30 02 00
Aug 29 11:43:11 linux1 kernel: ncr53c8xx_abort: pid=14046618
serial_number=14046632 serial_number_at_timeout=14046632
Aug 29 11:43:11 linux1 kernel: ncr53c815-0: abort ccb=00296020 (skip)
Aug 29 11:43:14 linux1 kernel: SCSI host 1 abort (pid 14046618) timed out -
resetting
Aug 29 11:43:14 linux1 kernel: SCSI bus is being reset for host 1 channel
0.
Aug 29 11:43:14 linux1 kernel: ncr53c8xx_reset: pid=14046618 reset_flags=2
serial_number=14046632 serial_number_at_timeout=14046632
Aug 29 11:43:14 linux1 kernel: ncr53c815-0: restart (scsi reset).
Aug 29 11:43:14 linux1 kernel: ncr53c815-0-<1,0>: 5.0 MB/s (200 ns, offset
8)
Aug 29 11:43:30 linux1 login[1015]: ROOT LOGIN on `tty1'
Aug 29 11:43:33 linux1 kernel: SCSI host 1 abort (pid 14046618) timed out -
resetting
Aug 29 11:43:33 linux1 kernel: SCSI bus is being reset for host 1 channel
0.
Aug 29 11:43:33 linux1 kernel: ncr53c8xx_reset: pid=14046618 reset_flags=2
serial_number=14046651 serial_number_at_timeout=14046651
Aug 29 11:43:33 linux1 kernel: ncr53c815-0: restart (scsi reset).
Aug 29 11:43:33 linux1 kernel: ncr53c815-0-<1,0>: 5.0 MB/s (200 ns, offset
8)
Aug 29 11:43:51 linux1 kernel: SCSI host 1 abort (pid 14046618) timed out -
resetting
Aug 29 11:43:52 linux1 kernel: SCSI bus is being reset for host 1 channel
0.
Aug 29 11:43:52 linux1 kernel: ncr53c8xx_reset: pid=14046618 reset_flags=2
serial_number=14046812 serial_number_at_timeout=14046812
Aug 29 11:43:52 linux1 kernel: ncr53c815-0: restart (scsi reset).
Aug 29 11:43:52 linux1 kernel: ncr53c815-0-<1,0>: 5.0 MB/s (200 ns, offset
8)
Aug 29 11:44:11 linux1 kernel: SCSI host 1 abort (pid 14046618) timed out -
resetting
Aug 29 11:44:11 linux1 kernel: SCSI bus is being reset for host 1 channel
0.
Aug 29 11:44:11 linux1 kernel: ncr53c8xx_reset: pid=14046618 reset_flags=2
serial_number=14046840 serial_number_at_timeout=14046840
Aug 29 11:44:11 linux1 kernel: ncr53c815-0: restart (scsi reset).
Aug 29 11:44:11 linux1 kernel: ncr53c815-0-<1,0>: 5.0 MB/s (200 ns, offset
8)
Aug 29 12:13:31 linux1 syslogd 1.3-0: restart.

I already checked the cables and the terminators.

Is it a problem to disable DISCONNECT on the symbios ?

Why does the reset on symbios hang the whole system ?


An additional Problem on the linux2:
Sometimes, when there are many read/write-accesses on the adaptec with the
raid-system theres also a scsi-reset and then the ystem hangs, it must hang
because the OS is running on the raid.
But sometimes the raid-system is not recognized any more and the system
hangs without any kernel-messages. Is it a problem, when i have on a 2940UW
a SCSI-II-Raid ?


Klaus-Peter Dombrofsky
kdombrofsky@compuserve.com

###################
Don't let your friends boot NT
###################

home help back first fref pref prev next nref lref last post