[430] in linux-scsi channel archive
Problem with aic7xxx driver or hardware problem?
daemon@ATHENA.MIT.EDU (Farzad FARID)
Wed Jul 26 20:05:51 1995
Date: Wed, 26 Jul 1995 13:50:57 +0200 (GMT+0200)
From: Farzad FARID <farzy@publicis.fr>
To: linux-scsi <linux-scsi@vger.rutgers.edu>
I've got a Pentium with PCI bus and an Adaptec AHA-2940 PCI SCSI
adapter, it's running Linux 1.2.9 with the aic7xxx driver. An HP 1Gb hard
driver and a quad speed Pioneer CD-ROM are connected to the adapter.
During the last month we've had troubles with the adapter/hd. The first
2 problems just hung the machine for 10 secondes, left an ugly message in
/var/adm/messages but the last 2 problems did hang the machine completely.
Here is the content of the log file with some comments at the end, I'd
appreciate any help in order to know if this could be a driver problem/bug
or if this is a hardware problem.
Here is the driver version:
aic7xxx: aic7870 hcntrl=0x2
aha294x: extended translation disabled
AHA294X (PCI-bus):
irq 15
bus release time ??? bclks
data fifo threshold 100
SCSI CHANNEL A:
scsi id 7
scsi bus parity check disabled
scsi selection timeout 256 ms
scsi bus reset at power-on enabled
aic7xxx: Downloading sequencer code..done.
NeedSdtr = 0xff, 0xff
NeedWdtr = 0x0, 0x0
Resetting the SCSI bus...
scsi0 : Adaptec AHA274x/284x/294x (EISA/VL-bus/PCI -> Fast SCSI)
1.8/1.14/1.34
scsi : 1 host.
aic7xxx: Scanning channel A for devices.
aic7xxx: target 0 now synchronous at 10.0Mb/s
Vendor: HP Model: C3323-300 Rev: 4269
Type: Direct-Access ANSI SCSI revision: 02
Detected scsi disk sda at scsi0, id 0, lun 0
aic7xxx: target 2 now synchronous at 4.0Mb/s
Vendor: PIONEER Model: CD-ROM DR-104X Rev: 0022
Type: CD-ROM ANSI SCSI revision: 02
Detected scsi CD-ROM sr0 at scsi0, id 2, lun 0
scsi : detected 1 SCSI cdrom 1 SCSI disk total.
SCSI Hardware sector size is 512 bytes on device sda
And here are the error messages:
...
Jun 30 18:23:13 sphinx kernel: scsi : aborting command due to timeout :
pid 75438, scsi0, id 0, lun 0 Read (6) 0f b0 40 02 00
Jun 30 18:23:13 sphinx kernel: SCSI disk error : host 0 id 0 lun 0 return
code = 28000002
Jun 30 18:23:13 sphinx kernel: extra data not valid Current error sd803:
sense key Unit Attention
Jun 30 18:23:13 sphinx kernel: Additional sense indicates Power on,
reset, or bus device reset occurred
Jun 30 18:23:13 sphinx kernel: scsidisk I/O error: dev 0803, sector 2156
Jun 30 18:23:13 sphinx kernel: aic7xxx: target 0 now synchronous at 10.0Mb/s
Jun 30 18:23:13 sphinx kernel: offset=2048, inode=12, rec_len=4, name_len=0
Jun 30 18:23:28 sphinx last message repeated 17 times
...
...
Jul 13 18:25:48 sphinx kernel: scsi : aborting command due to timeout :
pid 378783, scsi0, id 0, lun 0 Read (6) 09 db f2 02 00
Jul 13 18:25:48 sphinx kernel: SCSI disk error : host 0 id 0 lun 0 return
code = 28000002
Jul 13 18:25:48 sphinx kernel: extra data not valid Current error sd802:
sense key Unit Attention
Jul 13 18:25:48 sphinx kernel: Additional sense indicates Power on,
reset, or bus device reset occurred
Jul 13 18:25:48 sphinx kernel: scsidisk I/O error: dev 0802, sector 147694
Jul 13 18:25:48 sphinx kernel: aic7xxx: target 0 now synchronous at 10.0Mb/s
...
...
Jul 24 12:22:53 sphinx kernel: scsi : aborting command due to timeout :
pid 16503, scsi0, id 0, lun 0 Read (6) 06 65 64 02 00
Jul 24 12:22:53 sphinx kernel: SCSI disk error : host 0 id 0 lun 0 return
code = 28000002
Jul 24 12:22:53 sphinx kernel: extra data not valid Current error sd802:
sense key Unit Attention
Jul 24 12:22:53 sphinx kernel: Additional sense indicates Power on,
reset, or bus device reset occurred
Jul 24 12:22:53 sphinx kernel: scsidisk I/O error: dev 0802, sector 9576
Jul 24 12:22:53 sphinx kernel: aic7xxx: target 0 now synchronous at 10.0Mb/s
...
...
Jul 24 18:00:15 sphinx kernel: scsi : aborting command due to timeout :
pid 64624, scsi0, id 0, lun 0 Read (6) 17 ec b2 0a 00
Jul 24 18:00:15 sphinx kernel: SCSI disk error : host 0 id 0 lun 0 return
code = 28000002
Jul 24 18:00:15 sphinx kernel: extra data not valid Current error sd803:
sense key Unit Attention
Jul 24 18:00:15 sphinx kernel: Additional sense indicates Power on,
reset, or bus device reset occurred
Jul 24 18:00:15 sphinx kernel: scsidisk I/O error: dev 0803, sector 1310
Jul 24 18:00:15 sphinx kernel: aic7xxx: target 0 now synchronous at 10.0Mb/s
...
We can see that each error occured at a different location (different
devices or different sectors).
All these message are related to the hard drive, but the last message I
got is really strange:
Jul 25 17:03:08 sphinx kernel: aic7xxx: target 2 now synchronous at 4.0Mb/s
Jul 25 17:03:08 sphinx kernel: CD-ROM I/O error: dev 0b00, sector 304
Target 2 is the CD-ROM, I wasn't using it when this message occured.. And
the machine had not been rebooted since the last scsi problem.
The last scsi hd problem occured when I was compiling a program, the
machine froze 10 secondes and then gcc stopped, saying there were errors
in a source file. I looked at the C file and there was indeed garbage in
it. That was yesterday, today I looked at the file again and it looks fine..
I am really puzzled. Do these error message tell me who between the
driver, the adapter and the hard driver is faulty?
Thanks for any help.
Farzad