[2300] in linux-scsi channel archive

home help back first fref pref prev next nref lref last post

ncr disables disk

daemon@ATHENA.MIT.EDU (Edwin Glaser)
Sun Aug 17 17:40:25 1997

Date: 	Sun, 17 Aug 1997 23:31:37 +0200
From: Edwin Glaser <glaser@inter-nett.de>
To: linux-scsi@vger.rutgers.edu

Hello,

from time to time the SCSI driver disables my disk. Sometimes it happens
3 times a day, then the problem disapears for weeks. 

I have tried to resolve it for 8 mounth and have no idea what to try
next.

The ncr53c8xx locks without any message. The ncr53c{7,8}xx writes some
screens to the console -- including "can't reconnect, disabling target".
A few times it managed to reconnect and saved a log to the disk.

No problems with NT 4.0 ;-/

My configuration is :

- Linux 2.0.29 (S.u.S.E 4.4.1)
- scsi-ncr53c7,8xx 
  or ncr53c8xx
- No Name NCR-Karte 
  or Asus SC-200
- IBM DORS-32160
- Chaintech 5IFM 
- a Miro card with a Trio64

------------------------------------------------------------
 
the boot.log:

Loaded 3224 symbols from /usr/src/linux/System.map.
Symbols match kernel version.
klogd 1.3-0, log source = /proc/kmsg started.
<4>Console: 16 point font, 400 scans
<4>Console: colour VGA+ 80x25, 1 virtual console (max 63)
<4>pcibios_init : BIOS32 Service Directory structure at 0x000faf60
<4>pcibios_init : BIOS32 Service Directory entry at 0xfb400
<4>pcibios_init : PCI BIOS revision 2.10 entry at 0xfb430
<4>Probing PCI hardware.
<4>Calibrating delay loop.. ok - 39.73 BogoMIPS
<4>Memory: 31028k/32768k available (636k kernel code, 384k reserved,
720k data)
<6>Swansea University Computer Society NET3.035 for Linux 2.0
<6>NET3: Unix domain sockets 0.13 for Linux NET3.035.
<4>Swansea University Computer Society TCP/IP for NET3.034
<4>IP Protocols: ICMP, UDP, TCP
<4>Checking 386/387 coupling... Ok, fpu using exception 16 error
reporting.
<4>Checking 'hlt' instruction... Ok.
<4>Linux version 2.0.29 (root@rio) (gcc version 2.7.2.1) #2 Thu Jun 5
08:59:26 M
EST 1997
<6>Serial driver version 4.13 with no serial options enabled
<6>tty00 at 0x03f8 (irq = 4) is a 16550A
<6>tty01 at 0x02f8 (irq = 3) is a 16550A
<4>hda: CDA66801I, ATAPI CDROM drive
<4>ide0 at 0x1f0-0x1f7,0x3f6 on irq 14
<6>Floppy drive(s): fd0 is 1.44M
<4>Started kswapd v 1.4.2.2
<6>FDC 0 is a post-1991 82077
<4>scsi-ncr53c7,8xx : at PCI bus 0, device 12,  function 0
<4>scsi-ncr53c7,8xx : warning : revision of 18 is greater than 2.
<4>scsi-ncr53c7,8xx : NCR53c810 at memory 0xf4000000, io 0x6000, irq 9
<4>scsi0 : burst length 8
<4>scsi0 : reset ccf to 3 from 0
<4>scsi0 : NCR code relocated to 0x88600 (virt 0x00088600)
<4>scsi0 : test 1 started
<4>scsi0 : NCR53c{7,8}xx (rel 17)
<4>scsi : 1 host.
<4>scsi0 : target 0 accepting asynchronous SCSI
<4>scsi0 : setting target 0 to asynchronous SCSI
<4>  Vendor: IBM       Model: DORS-32160        Rev: WA6A
<4>  Type:   Direct-Access                      ANSI SCSI revision: 02
<4>Detected scsi disk sda at scsi0, channel 0, id 0, lun 0
<4>scsi : detected 1 SCSI disk total.
<4>SCSI device sda: hdwr sector= 512 bytes. Sectors= 4226725 [2063 MB]
[2.1 GB]
<4>Partition check:
<4> sda: sda1 sda2 < sda5 sda6 sda7 sda8 sda9 sda10 >
<4>VFS: Mounted root (ext2 filesystem) readonly.
<4>Adding Swap: 103912k swap-space
Kernel logging (proc) stopped.
Kernel log daemon terminating.        

------------------------------------------------------------

the saved log:

Jun  4 23:34:46 rio kernel: scsi0 : WARNING : INTFLY with no completed
commands.
Jun  4 23:35:01 rio kernel: scsi : aborting command due to timeout : pid
5826, scsi0, channel 0, id 0, lun 0 Read (6)
 1c 7d 3b 02 00
Jun  4 23:35:01 rio kernel: scsi0 : DANGER : command running, can not
abort.
Jun  4 23:35:01 rio kernel: scsi : aborting command due to timeout : pid
5827, scsi0, channel 0, id 0, lun 0 Read (10
) 00 00 20 23 e3 00 00 02 00
Jun  4 23:35:01 rio kernel: scsi0 : found command 5827 in Linux issue
queue
Jun  4 23:35:17 rio kernel: scsi : aborting command due to timeout : pid
5826, scsi0, channel 0, id 0, lun 0 Read (6)
 1c 7d 3b 02 00
Jun  4 23:35:17 rio kernel: scsi0 : DANGER : command running, can not
abort.
Jun  4 23:35:17 rio kernel: SCSI host 0 abort (pid 5826) timed out -
resetting
Jun  4 23:35:17 rio kernel: SCSI bus is being reset for host 0 channel
0.
Jun  4 23:35:17 rio kernel: scsi0 : DCMD|DBC=0x50000000, DNAD=0x88c5c
(virt 0x00088c5c)
Jun  4 23:35:17 rio kernel:          DSA=0x821e8 (virt 0x000821e8)
Jun  4 23:35:17 rio kernel:          DSPS=0x88d6c, TEMP=0x888ec (virt
0x000888ec), DMODE=0x88
Jun  4 23:35:17 rio kernel:          SXFER=0x0, SCNTL3=0x3
Jun  4 23:35:17 rio kernel:          phase=MSGIN, 0 bytes in SCSI FIFO
Jun  4 23:35:1Jun  4 23:34:46 rio kernel: scsi0 : WARNING : INTFLY with
no completed commands.
Jun  4 23:35:01 rio kernel: scsi : aborting command due to timeout : pid
5826, scsi0, channel 0, id 0, lun 0 Read (6)
 1c 7d 3b 02 00
Jun  4 23:35:01 rio kernel: scsi0 : DANGER : command running, can not
abort.
Jun  4 23:35:01 rio kernel: scsi : aborting command due to timeout : pid
5827, scsi0, channel 0, id 0, lun 0 Read (10
) 00 00 20 23 e3 00 00 02 00
Jun  4 23:35:01 rio kernel: scsi0 : found command 5827 in Linux issue
queue
Jun  4 23:35:17 rio kernel: scsi : aborting command due to timeout : pid
5826, scsi0, channel 0, id 0, lun 0 Read (6)
 1c 7d 3b 02 00
Jun  4 23:35:17 rio kernel: scsi0 : DANGER : command running, can not
abort.
Jun  4 23:35:17 rio kernel: SCSI host 0 abort (pid 5826) timed out -
resetting
Jun  4 23:35:17 rio kernel: SCSI bus is being reset for host 0 channel
0.
Jun  4 23:35:17 rio kernel: scsi0 : DCMD|DBC=0x50000000, DNAD=0x88c5c
(virt 0x00088c5c)
Jun  4 23:35:17 rio kernel:          DSA=0x821e8 (virt 0x000821e8)
Jun  4 23:35:17 rio kernel:          DSPS=0x88d6c, TEMP=0x888ec (virt
0x000888ec), DMODE=0x88
Jun  4 23:35:17 rio kernel:          SXFER=0x0, SCNTL3=0x3
Jun  4 23:35:17 rio kernel:          phase=MSGIN, 0 bytes in SCSI FIFO
Jun  4 23:35:17 rio kernel:          STEST0=0x7
Jun  4 23:35:17 rio kernel: scsi0 : DSP 0x88c54 (virt 0x00088c54) ->
Jun  4 23:35:17 rio kernel: 0x88c54 (virt 0x00088c54) : 0x50000000
0x00088d6c (virt 0x00088d6c)
Jun  4 23:35:17 rio kernel: 0x88c5c (virt 0x00088c5c) : 0x60000200
0x00000000 (virt 0x00000000)
Jun  4 23:35:17 rio kernel: 0x88c64 (virt 0x00088c64) : 0x78388800
0x00000000 (virt 0x00000000)
Jun  4 23:35:17 rio kernel: 0x88c6c (virt 0x00088c6c) : 0x0f000001
0x000885d8 (virt 0x000885d8)
Jun  4 23:35:17 rio kernel: 0x88c74 (virt 0x00088c74) : 0x78389800
0x00000000 (virt 0x00000000)
Jun  4 23:35:17 rio kernel: 0x88c7c (virt 0x00088c7c) : 0xc0000004
0x000885d4 (virt 0x000885d4) 0x00006034 (virt 0x00
006034)
Jun  4 23:35:17 rio kernel: scsi0 : issue queue
Jun  4 23:35:17 rio kernel: scsi0 : dsa at phys 0x1e080 (virt
0x0001e080)
Jun  4 23:35:17 rio kernel:         + 64 : dsa_msgout length = 558564,
data = 0x0 (virt 0x00000000)
Jun  4 23:35:17 rio kernel:         + 60 : select_indirect = 0xc0000004
Jun  4 23:35:17 rio kernel:         + 56 : dsa_cmnd = 0x0
Jun  4 23:35:17 rio kernel:         + 48 : dsa_next = 0x0
Jun  4 23:35:17 rio kernel: scsi0 : schedule dsa array :
Jun  4 23:35:17 rio kernel: scsi0 : end schedule dsa array
Jun  4 23:35:17 rio kernel: scsi0 : reconnect_dsa_head :
Jun  4 23:35:18 rio kernel: scsi0 : end reconnect_dsa_head
Jun  4 23:35:18 rio kernel: The sti() implicit in a printk() prevents
hangs
Jun  4 23:35:18 rio kernel: scsi : aborting command due to timeout : pid
5827, scsi0, channel 0, id 0, lun 0 Read (10
) 00 00 20 23 e3 00 00 02 00
Jun  4 23:35:18 rio kernel: scsi0 : did this command ever run?
Jun  4 23:35:19 rio kernel: scsi0 : target 0 accepting asynchronous SCSI
Jun  4 23:35:19 rio kernel: scsi0 : setting target 0 to asynchronous
SCSI
J7 rio kernel:          STEST0=0x7
Jun  4 23:35:17 rio kernel: scsi0 : DSP 0x88c54 (virt 0x00088c54) ->
Jun  4 23:35:17 rio kernel: 0x88c54 (virt 0x00088c54) : 0x50000000
0x00088d6c (virt 0x00088d6c)
Jun  4 23:35:17 rio kernel: 0x88c5c (virt 0x00088c5c) : 0x60000200
0x00000000 (virt 0x00000000)
Jun  4 23:35:17 rio kernel: 0x88c64 (virt 0x00088c64) : 0x78388800
0x00000000 (virt 0x00000000)
Jun  4 23:35:17 rio kernel: 0x88c6c (virt 0x00088c6c) : 0x0f000001
0x000885d8 (virt 0x000885d8)
Jun  4 23:35:17 rio kernel: 0x88c74 (virt 0x00088c74) : 0x78389800
0x00000000 (virt 0x00000000)
Jun  4 23:35:17 rio kernel: 0x88c7c (virt 0x00088c7c) : 0xc0000004
0x000885d4 (virt 0x000885d4) 0x00006034 (virt 0x00
006034)
Jun  4 23:35:17 rio kernel: scsi0 : issue queue
Jun  4 23:35:17 rio kernel: scsi0 : dsa at phys 0x1e080 (virt
0x0001e080)
Jun  4 23:35:17 rio kernel:         + 64 : dsa_msgout length = 558564,
data = 0x0 (virt 0x00000000)
Jun  4 23:35:17 rio kernel:         + 60 : select_indirect = 0xc0000004
Jun  4 23:35:17 rio kernel:         + 56 : dsa_cmnd = 0x0
Jun  4 23:35:17 rio kernel:         + 48 : dsa_next = 0x0
Jun  4 23:35:17 rio kernel: scsi0 : schedule dsa array :
Jun  4 23:35:17 rio kernel: scsi0 : end schedule dsa array
Jun  4 23:35:17 rio kernel: scsi0 : reconnect_dsa_head :
Jun  4 23:35:18 rio kernel: scsi0 : end reconnect_dsa_head
Jun  4 23:35:18 rio kernel: The sti() implicit in a printk() prevents
hangs
Jun  4 23:35:18 rio kernel: scsi : aborting command due to timeout : pid
5827, scsi0, channel 0, id 0, lun 0 Read (10
) 00 00 20 23 e3 00 00 02 00
Jun  4 23:35:18 rio kernel: scsi0 : did this command ever run?
Jun  4 23:35:19 rio kernel: scsi0 : target 0 accepting asynchronous SCSI
Jun  4 23:35:19 rio kernel: scsi0 : setting target 0 to asynchronous
SCSI

Any hints are welcome. 
Thanks in advance.

-- 
Edwin Glaser -- <mailto:glaser@inter-nett.de>

home help back first fref pref prev next nref lref last post