[2078] in linux-scsi channel archive

home help back first fref pref prev next nref lref last post

2940 and known bad sectors

daemon@ATHENA.MIT.EDU (Ed Kujawski)
Sun Jun 29 13:09:22 1997

Date: 	Sun, 29 Jun 1997 12:04:05 -0500 (CDT)
From: Ed Kujawski <kujawski@etsam2.eng.uab.edu>
Reply-To: Ed Kujawski <kujawski@etsam2.eng.uab.edu>
To: linux-scsi@vger.rutgers.edu

-----BEGIN PGP SIGNED MESSAGE-----

first, this will be long, so in a few words: i had a 2940 connected to
two disks, everything works fine. i added an old 1G disk with known bad
sectors, and the kernel stalls in a timeout, and i'm unable to really use
the disk.

1) my setup: normal computer really, its purpose in life is to be a
file/web/print server. the 2940uw is connected to a quantum atlas on the
wide scsi internal bus. a conner 1G (this is the new one) and a micropolus
are on the narrow internal scsi bus. the scsi chain looks like: quantum,
2940, conner, micropolus. the quantum and micropolus are terminated,

2) operation without conner: the computer had no errors before the
installation of the conner (of course). and now that the conner is in, if
its not mounted, or if its mounted but not used, the computer operates
properly.

3) point of failure: under dos there were bad sectors, so i reformated the
disk under ext2 with bad sector check: mke2fs -v -t /dev/sdc1 this caused
all sourts of delays in the kernel. i was confused so i did a surface scan
in the scsi bios. the scan found many sectors errors and i asked to remap
the bad sectors. i then tried the format command for ext2 again, but i got
the same errors and kernel delays.

4) kernel patches: i patched the 2.0.30 kernel to pre-2.0.31-2 and tried
to reformat without any sucess, i tried to patch the kernel with the new
ext2, but it seems that my directory structure is different, so i'm not
sure if the patch was sucessfull or not, but i did upgrade to the
ext2tools 1.1.

5) error messages: here is the message from the syslogd 

Jun 27 16:26:34 bmewww kernel: scsi0: MEDIUM ERROR on channel 0, id 2, lun 0, CDB: 0x0a 15 40 30 f4 00 
Jun 27 16:26:34 bmewww kernel: Current error sd08:21: sns = f0  3
Jun 27 16:26:34 bmewww kernel: ASC=10 ASCQ= 0
Jun 27 16:26:34 bmewww kernel: Raw sense data:0xf0 0x00 0x03 0x00 0x15 0x40 0x7d 0x0c 0x06 0x7d 0x00 0x36 0x10 0x00 0x15 0x00 
Jun 27 16:26:34 bmewww kernel: scsidisk I/O error: dev 08:21, sector 1392732
Jun 27 16:26:35 bmewww kernel: scsi0: MEDIUM ERROR on channel 0, id 2, lun 0, CDB: 0x0a 15 40 7e a6 00 
Jun 27 16:26:35 bmewww kernel: Current error sd08:21: sns = f0  3
Jun 27 16:26:35 bmewww kernel: ASC=10 ASCQ= 0
Jun 27 16:26:35 bmewww kernel: Raw sense data:0xf0 0x00 0x03 0x00 0x15 0x40 0xc4 0x0c 0x06 0x7e 0x00 0x24 0x10 0x00 0x15 0x00 
Jun 27 16:26:35 bmewww kernel: scsidisk I/O error: dev 08:21, sector 1392804
Jun 27 16:47:54 bmewww kernel: scsi0: MEDIUM ERROR on channel 0, id 2, lun 0, CDB: 0x0a 15 34 da f4 00 
Jun 27 16:47:54 bmewww kernel: Current error sd08:21: sns = f0  3
Jun 27 16:47:54 bmewww kernel: ASC=10 ASCQ= 0
Jun 27 16:47:54 bmewww kernel: Raw sense data:0xf0 0x00 0x03 0x00 0x15 0x35 0xbf 0x0c 0x06 0x7a 0x00 0x3f 0x10 0x00 0x15 0x00 
Jun 27 16:47:54 bmewww kernel: scsidisk I/O error: dev 08:21, sector 1389982
Jun 27 16:47:55 bmewww kernel: scsi0: MEDIUM ERROR on channel 0, id 2, lun 0, CDB: 0x0a 15 3a 92 f4 00 
Jun 27 16:47:55 bmewww kernel: Current error sd08:21: sns = f0  3
Jun 27 16:47:55 bmewww kernel: ASC=10 ASCQ= 0
Jun 27 16:47:55 bmewww kernel: Raw sense data:0xf0 0x00 0x03 0x00 0x15 0x3a 0xfb 0x0c 0x06 0x7b 0x00 0x44 0x10 0x00 0x15 0x00 
Jun 27 16:47:55 bmewww kernel: scsidisk I/O error: dev 08:21, sector 1391322
Jun 27 16:47:56 bmewww kernel: scsi0: MEDIUM ERROR on channel 0, id 2, lun 0, CDB: 0x0a 15 3a fc 8a 00 
Jun 27 16:47:56 bmewww kernel: Current error sd08:21: sns = f0  3
Jun 27 16:47:56 bmewww kernel: ASC=10 ASCQ= 0
Jun 27 16:47:56 bmewww kernel: Raw sense data:0xf0 0x00 0x03 0x00 0x15 0x3b 0x42 0x0c 0x06 0x7c 0x00 0x32 0x10 0x00 0x15 0x00 
Jun 27 16:47:56 bmewww kernel: scsidisk I/O error: dev 08:21, sector 1391394
Jun 27 17:00:51 bmewww kernel: scsi0: Target 2 underflow - Wanted at least 35840, got 28160, residual SG count 2.
Jun 27 17:00:54 bmewww kernel: scsi0: Target 2 underflow - Wanted at least 35840, got 28160, residual SG count 2.
Jun 27 17:00:54 bmewww kernel: scsi0 channel 0 : resetting for second half of retries.
Jun 27 17:00:54 bmewww kernel: SCSI bus is being reset for host 0 channel 0.
Jun 27 17:00:54 bmewww kernel: aic7xxx: (reset) target/channel 2/0
Jun 27 17:00:54 bmewww kernel: aic7xxx: (reset_device) target/channel -1/A, active_scb 4
Jun 27 17:00:54 bmewww kernel: aic7xxx: (match_scb) comparing target/channel -1/A to scb 0/A
Jun 27 17:00:54 bmewww kernel: aic7xxx: (match_scb) comparing target/channel -1/A to scb 2/A
Jun 27 17:00:54 bmewww kernel: aic7xxx: (reset_channel) Resetting current channel A
Jun 27 17:00:54 bmewww kernel: aic7xxx: (reset_channel) Channel reset, sequencer restarted
Jun 27 17:00:54 bmewww kernel: aic7xxx: (done_aborted_scbs) Aborting scb 1, TCL=0/0/0
Jun 27 17:00:54 bmewww kernel: aic7xxx: (done_aborted_scbs) Aborting scb 4, TCL=2/0/0
Jun 27 17:00:54 bmewww kernel: scsi0: Received MSG_WDTR, Target 0, channel A needwdtr(0xfffb).
Jun 27 17:00:54 bmewww kernel: scsi0: Target 0, channel A, using 16 bit transfers.
Jun 27 17:00:54 bmewww kernel: scsi0: Target 0, channel A, now synchronous at 10.0MHz, offset 8.
Jun 27 17:01:12 bmewww kernel: scsi0: Target 2 underflow - Wanted at least 35840, got 28160, residual SG count 2.
Jun 27 17:01:13 bmewww last message repeated 2 times
Jun 27 17:01:13 bmewww kernel: SCSI disk error : host 0 channel 0 id 2 lun 0 return code = 27070002
Jun 27 17:01:13 bmewww kernel: scsidisk I/O error: dev 08:21, sector 1235340
Jun 27 17:01:14 bmewww kernel: scsi0: Target 2 underflow - Wanted at least 34816, got 27136, residual SG count 2.
Jun 27 17:01:17 bmewww last message repeated 4 times
Jun 27 17:01:17 bmewww kernel: SCSI disk error : host 0 channel 0 id 2 lun 0 return code = 27070002
Jun 27 17:01:17 bmewww kernel: scsidisk I/O error: dev 08:21, sector 1235342
Jun 27 17:01:18 bmewww kernel: scsi0: Target 2 underflow - Wanted at least 33792, got 26112, residual SG count 2.
Jun 27 17:01:19 bmewww kernel: scsi0: Target 2 underflow - Wanted at least 33792, got 26112, residual SG count 2.
Jun 27 17:01:20 bmewww init: Switching to runlevel: 6
Jun 27 17:01:20 bmewww kernel: scsi0: Target 2 underflow - Wanted at least 33792, got 26112, residual SG count 2.
Jun 27 17:01:21 bmewww last message repeated 2 times
Jun 27 17:01:21 bmewww kernel: SCSI disk error : host 0 channel 0 id 2 lun 0 return code = 27070002
Jun 27 17:01:21 bmewww kernel: scsidisk I/O error: dev 08:21, sector 1235344
Jun 27 17:01:22 bmewww kernel: scsi0: Target 2 underflow - Wanted at least 32768, got 25088, residual SG count 2.
Jun 27 17:01:23 bmewww kernel: scsi0: Target 2 underflow - Wanted at least 32768, got 25088, residual SG count 2.
Jun 27 17:01:24 bmewww kernel: scsi0: MEDIUM ERROR on channel 0, id 2, lun 0, CDB: 0x08 12 d9 b2 40 00 
Jun 27 17:01:24 bmewww kernel: Current error sd08:21: sns = f0  3
Jun 27 17:01:24 bmewww kernel: ASC=11 ASCQ= 0
Jun 27 17:01:24 bmewww kernel: Raw sense data:0xf0 0x00 0x03 0x00 0x12 0xd9 0xb2 0x0c 0x05 0xa8 0x07 0x59 0x11 0x00 0x15 0x00 
Jun 27 17:01:24 bmewww kernel: scsidisk I/O error: dev 08:21, sector 1235346
Jun 27 17:03:34 bmewww syslogd 1.3-3: restart.


in case you interested, the boot log:

Jun 27 17:03:35 bmewww kernel: aic7xxx: BurstLen = 0 DWDs, Latency Timer = 32 PCLKS
Jun 27 17:03:35 bmewww kernel: aic7xxx: AHA-2940 Ultra Rev B.
Jun 27 17:03:35 bmewww kernel: aic7xxx: devconfig = 0x1580.
Jun 27 17:03:35 bmewww kernel: aic7xxx: Reading SEEPROM...done.
Jun 27 17:03:35 bmewww kernel: aic7xxx: Enabling support for Ultra SCSI speed.
Jun 27 17:03:35 bmewww kernel: aic7xxx: Extended translation enabled.
Jun 27 17:03:35 bmewww kernel: aic7xxx: Memory check yields 16 SCBs, paging not enabled.
Jun 27 17:03:35 bmewww kernel: aic7xxx: Enabling wide channel of AHA-2940 Ultra-Wide.
Jun 27 17:03:35 bmewww kernel: AHA-2940 Ultra-WIDE (PCI-bus), I/O 0x6100, Mem 0xf2000000:
Jun 27 17:03:35 bmewww kernel:     irq 10
Jun 27 17:03:35 bmewww kernel:     bus release time 40 bclks
Jun 27 17:03:35 bmewww kernel:     data fifo threshold 100
Jun 27 17:03:35 bmewww kernel:     SCSI CHANNEL A:
Jun 27 17:03:35 bmewww kernel:         scsi id 7
Jun 27 17:03:35 bmewww kernel:         scsi selection timeout 256 ms
Jun 27 17:03:35 bmewww kernel:         scsi bus reset at power-on enabled
Jun 27 17:03:35 bmewww kernel:         scsi bus parity enabled
Jun 27 17:03:35 bmewww kernel:         scsi bus termination (low byte) disabled
Jun 27 17:03:35 bmewww kernel:         scsi bus termination (high byte) enabled
Jun 27 17:03:35 bmewww kernel: aic7xxx: Downloading sequencer code...done.
Jun 27 17:03:35 bmewww kernel: aic7xxx: Resetting the SCSI bus...done.
Jun 27 17:03:35 bmewww kernel: scsi0 : Adaptec AHA274x/284x/294x (EISA/VLB/PCI-Fast SCSI) 4.0/3.2/4.0
Jun 27 17:03:35 bmewww kernel: scsi : 1 host.
Jun 27 17:03:35 bmewww kernel: scsi0: Scanning channel A for devices.
Jun 27 17:03:35 bmewww kernel: scsi0: Received MSG_WDTR, Target 0, channel A needwdtr(0xfffb).
Jun 27 17:03:35 bmewww kernel: scsi0: Target 0, channel A, using 16 bit transfers.
Jun 27 17:03:35 bmewww kernel: scsi0: Target 0, channel A, now synchronous at 10.0MHz, offset 8.
Jun 27 17:03:35 bmewww kernel:   Vendor: Quantum   Model: XP34300W          Rev: 81HB
Jun 27 17:03:35 bmewww kernel:   Type:   Direct-Access                      ANSI SCSI revision: 02
Jun 27 17:03:35 bmewww kernel: Detected scsi disk sda at scsi0, channel 0, id 0, lun 0
Jun 27 17:03:35 bmewww kernel: scsi0: Target 1, channel A, now synchronous at 10.0MHz, offset 15.
Jun 27 17:03:35 bmewww kernel:   Vendor: MICROP    Model: 1936-21MW1002004  Rev: HW0A
Jun 27 17:03:35 bmewww kernel:   Type:   Direct-Access                      ANSI SCSI revision: 02
Jun 27 17:03:35 bmewww kernel: Detected scsi disk sdb at scsi0, channel 0, id 1, lun 0
Jun 27 17:03:35 bmewww kernel:   Vendor: CONNER    Model: CFP1060S 1.05GB   Rev: 2135
Jun 27 17:03:35 bmewww kernel:   Type:   Direct-Access                      ANSI SCSI revision: 02
Jun 27 17:03:35 bmewww kernel: Detected scsi disk sdc at scsi0, channel 0, id 2, lun 0
Jun 27 17:03:35 bmewww kernel: scsi : detected 3 SCSI disks total.
Jun 27 17:03:35 bmewww kernel: SCSI device sda: hdwr sector= 512 bytes. Sectors= 8399520 [4101 MB] [4.1 GB]
Jun 27 17:03:35 bmewww kernel: SCSI device sdb: hdwr sector= 512 bytes. Sectors= 5903825 [2882 MB] [2.9 GB]
Jun 27 17:03:35 bmewww kernel: SCSI device sdc: hdwr sector= 512 bytes. Sectors= 2074880 [1013 MB] [1.0 GB]
Jun 27 17:03:35 bmewww kernel:   Overriding PCI latency timer (CFLT) setting of 32, new value is 255.


6) so is there any hope? is this expected from bad sectors? should i do
more with the bios? did i apply the ext2 bad sector fix right? are there
more patches i've missed?

ed kujawski

-----BEGIN PGP SIGNATURE-----
Version: 2.6.3a
Charset: noconv
Comment: PGP Signed with PineSign 2.0

iQCVAwUBM7YFIjMSOJR28/+BAQGFvgQAiZG3SOykOlY/N51aqnTOWFUd7Gepcbr0
rIP5pAFJM3kI+SFUM2tr5OBJ6cSuFtOkPbytgoLX9Mbfhbq+f3QCPXgG9HIKYZxW
adiYml9lEqHabz7oU5HJT3sG6gZhukhVgqGZ0tOXFCignkeCV/9lpKZR7HAfLEBo
CunJlWwHs8M=
=PZcS
-----END PGP SIGNATURE-----
 



home help back first fref pref prev next nref lref last post