[2357] in linux-scsi channel archive

home help back first fref pref prev next nref lref last post

Repeated system crash. Long. Help badly needed.

daemon@ATHENA.MIT.EDU (Boris Tobotras)
Wed Aug 27 08:14:39 1997

To: linux-scsi@vger.rutgers.edu
Date: 	Wed, 27 Aug 1997 16:04:19 +0400
From: Boris Tobotras <boris@xtalk.msk.su>

	I see the only solution now: installation of Solaris :( Don't 
want to, really, but it is our production server. Can anybody give an 
advice, what can be a cause?

	I've built two servers with identical hardware:

	PPro-200 w/256k cache, 128M ram, AHA2940UW, Quantum Atlas II, S3 
Trio 64/V+, 3C905.

	One of servers is Linux one. It crashes every several hours. Just 
system hang, with no messages in log (syslog is told to log *.debug) or 
console. But last time it hang, console was full of scrolled messages

	scsi0: queue full

	I tried exchange every hardware with the second server, so I'm
pretty sure it is not hardware fault. Following is as much information as 
I can imagine. Any ideas, 
please?

	Kernel 2.0.29 (was 2.0.30, downgraded just in case), Boomerang (3c59x)
driver v0.42o.

	Here's /proc/scsi/scsi:

Attached devices: 
Host: scsi0 Channel: 00 Id: 06 Lun: 00
  Vendor: QUANTUM  Model: XP32275W         Rev: LXY4
  Type:   Direct-Access                    ANSI SCSI revision: 02

	Here's /proc/scsi/aic7xxx/0:

Adaptec AIC7xxx driver version: 4.0/3.2/4.0

Compile Options:
  AIC7XXX_RESET_DELAY    : 15
  AIC7XXX_CMDS_PER_LUN   : 8
  AIC7XXX_TWIN_SUPPORT   : Enabled
  AIC7XXX_TAGGED_QUEUEING: Enabled
  AIC7XXX_PAGE_ENABLE    : Enabled
  AIC7XXX_PROC_STATS     : Enabled

Adapter Configuration:
          SCSI Adapter: AHA-2940 Ultra
                        (AIC-788x chipset)
              Host Bus: Wide
               Base IO: 0xec00
                   IRQ: 10
                  SCBs: Used 8, HW 16, Page 255
            Interrupts: 4998
         Serial EEPROM: True
  Extended Translation: Enabled
        SCSI Bus Reset: Enabled
            Ultra SCSI: Enabled
     Target Disconnect: Enabled

Statistics:
CHAN#A (TGT 6 LUN 0):
nxfers 4966 (2452 read;2514 written)
blks(512) rd=18075; blks(512) wr=7356
        < 512 512-1K   1-2K   2-4K   4-8K  8-16K 16-32K 32-64K 64-128K >128K
 Reads:     3      1   1264     96    542    527      7     10      2      0
Writes:     0      0   2263    221     16      6      1      0      7      0

	Here's dmesg:

Intel MultiProcessor Specification v1.1
    Virtual Wire compatibility mode.
OEM ID: INTEL    Product ID: 440FX        APIC at: 0xFEE00000
Processor #0 Pentium(tm) Pro APIC version 17
I/O APIC #2 Version 17 at 0xFEC00000.
Processors: 1
Console: 16 point font, 400 scans
Console: colour VGA+ 80x25, 1 virtual console (max 63)
pcibios_init : BIOS32 Service Directory structure at 0x000fdb70
pcibios_init : BIOS32 Service Directory entry at 0xfdb80
pcibios_init : PCI BIOS revision 2.10 entry at 0xfdba1
Probing PCI hardware.
Warning : Unknown PCI device (10b7:9050).  Please read include/linux/pci.h 
Calibrating delay loop.. ok - 199.07 BogoMIPS
Memory: 127952k/131072k available (660k kernel code, 384k reserved, 2076k data)
Swansea University Computer Society NET3.035 for Linux 2.0
NET3: Unix domain sockets 0.13 for Linux NET3.035.
Swansea University Computer Society TCP/IP for NET3.034
IP Protocols: ICMP, UDP, TCP
VFS: Diskquotas version dquot_5.6.0 initialized
Checking 386/387 coupling... Ok, fpu using exception 16 error reporting.
Checking 'hlt' instruction... Ok.
Linux version 2.0.29 (root@goliath.jet.msk.su) (gcc version 2.7.2.1) #4 Tue Aug 26 17:49:08 MSD 1997
Error: only one processor found.
Serial driver version 4.13 with no serial options enabled
tty00 at 0x03f8 (irq = 4) is a 16550A
tty01 at 0x02f8 (irq = 3) is a 16550A
Real Time Clock Driver v1.07
ide: i82371 PIIX (Triton) on PCI bus 0 function 57
    ide0: BM-DMA at 0xffa0-0xffa7
    ide1: BM-DMA at 0xffa8-0xffaf
hda: Pioneer CD-ROM ATAPI Model DR-A12X 0100, ATAPI CDROM drive
ide0 at 0x1f0-0x1f7,0x3f6 on irq 14
md driver 0.35 MAX_MD_DEV=4, MAX_REAL=8
aic7xxx: BurstLen = 8 DWDs, Latency Timer = 64 PCLKS
aic7xxx: AHA-2940 Ultra Rev B.
aic7xxx: devconfig = 0x1580.
aic7xxx: Reading SEEPROM...done.
aic7xxx: Enabling support for Ultra SCSI speed.
aic7xxx: Extended translation enabled.
aic7xxx: Memory check yields 16 SCBs, 255 page-enabled SCBs.
aic7xxx: Enabling wide channel of AHA-2940 Ultra-Wide.
AHA-2940 Ultra-WIDE (PCI-bus), I/O 0xec00, Mem 0xfebff000:
    irq 10
    bus release time 40 bclks
    data fifo threshold 100%
    SCSI CHANNEL A:
        scsi id 7
        scsi selection timeout 256 ms
        scsi bus reset at power-on enabled
        scsi bus parity enabled
        scsi bus termination (low byte) enabled
        scsi bus termination (high byte) enabled
aic7xxx: Downloading sequencer code...done.
aic7xxx: Resetting the SCSI bus...done.
scsi0 : Adaptec AHA274x/284x/294x (EISA/VLB/PCI-Fast SCSI) 4.0/3.2/4.0
scsi : 1 host.
scsi0: Scanning channel A for devices.
Started kswapd v 1.4.2.2
scsi0: Received MSG_WDTR, Target 6, channel A needwdtr(0xfffd).
scsi0: Target 6, channel A, using 16 bit transfers.
scsi0: Target 6, channel A, now synchronous at 20.0MHz, offset 8.
  Vendor: QUANTUM   Model: XP32275W          Rev: LXY4
  Type:   Direct-Access                      ANSI SCSI revision: 02
Detected scsi disk sda at scsi0, channel 0, id 6, lun 0
scsi0: Enabled tagged queuing for target 6, channel 0, LUN 0, queue depth 8.
scsi : detected 1 SCSI disk total.
SCSI device sda: hdwr sector= 512 bytes. Sectors= 4445380 [2170 MB] [2.2 GB]
Partition check:
 sda: sda1 sda2
VFS: Mounted root (ext2 filesystem) readonly.
Adding Swap: 102396k swap-space

	Here's "scsiinfo -a /dev/sda":

Inquiry command
---------------
Relative Address                   0
Wide bus 32                        0
Wide bus 16                        1
Synchronous neg.                   1
Linked Commands                    1
Command Queueing                   1
SftRe                              0
Device Type                        0
Peripheral Qualifier               0
Removable?                         0
Device Type Modifier               0
ISO Version                        0
ECMA Version                       0
ANSI Version                       2
AENC                               0
TrmIOP                             0
Response Data Format               2
Vendor:                    QUANTUM 
Product:                   XP32275W        
Revision level:            LXY4182706552275

Data from Rigid Disk Drive Geometry Page
----------------------------------------
Number of cylinders                5899
Number of heads                    5
Starting write precomp             5899
Starting reduced current           0
Drive step rate                    0
Landing Zone Cylinder              5900
RPL                                0
Rotational Offset                  0
Rotational Rate                    7200

Data from Caching Page
----------------------
Write Cache                        1
Read Cache                         1
Prefetch units                     0
Demand Read Retention Priority     0
Demand Write Retention Priority    0
Disable Pre-fetch Transfer Length  65535
Minimum Pre-fetch                  0
Maximum Pre-fetch                  357
Maximum Pre-fetch Ceiling          357

Data from Format Device Page
----------------------------
Removable Medium                   0
Supports Hard Sectoring            1
Supports Soft Sectoring            0
Addresses assigned by surface      0
Tracks per Zone                    5
Alternate sectors per zone         5
Alternate tracks per zone          0
Alternate tracks per lun           0
Sectors per track                  152
Bytes per sector                   512
Interleave                         1
Track skew factor                  9
Cylinder skew factor               54

Data from Error Recovery Page
-----------------------------
AWRE                               1
ARRE                               1
TB                                 0
RC                                 0
EER                                0
PER                                0
DTE                                0
DCR                                0
Read Retry Count                   8
Correction Span                    80
Head Offset Count                  0
Data Strobe Offset Count           0
Write Retry Count                  8
Recovery Time Limit                0

Data from Control Page
----------------------
RLEC                               0
QErr                               0
DQue                               0
EECA                               0
RAENP                              0
UUAENP                             0
EAENP                              0
Queue Algorithm Modifier           1
Ready AEN Holdoff Period           0

Data from Disconnect-Reconnect Page
-----------------------------------
Buffer full ratio                  0
Buffer empty ratio                 0
Bus Inactivity Limit               0
Disconnect Time Limit              0
Connect Time Limit                 0
Maximum Burst Size                 0
DTDC                               0x0

Data from Defect Lists
----------------------
21 entries in manufacturer table.

	[skipped]

0 entries in grown table.

Data from Notch Parameters Page
-------------------------------
Notched Drive                      1
Logical or Physical Notch          0
Max # of notches                   16
Active Notch                       0
Starting Boundary                  0x2c00
Ending Boundary                    0x16df04
Pages Notched                      00000000 00001008

Data from Verify Error Recovery Page
------------------------------------
EER                                0
PER                                0
DTE                                0
DCR                                0
Verify Retry Count                 8
Verify Correction Span (bits)      80
Verify Recovery Time Limit (ms)    0

	Here's /proc/pci:

PCI devices found:
  Bus  0, device  19, function  0:
    SCSI storage controller: Adaptec AIC-7881U (rev 0).
      Medium devsel.  Fast back-to-back capable.  IRQ 10.  Master Capable.  Latency=64.  Min Gnt=8.Max Lat=8.
      I/O at 0xec00.
      Non-prefetchable 32 bit memory at 0xfebff000.
  Bus  0, device  18, function  0:
    Ethernet controller: 3Com Unknown device (rev 0).
      Vendor id=10b7. Device id=9050.
      Medium devsel.  IRQ 9.  Master Capable.  Latency=248.  Min Gnt=3.Max Lat=8.
      I/O at 0xef00.
  Bus  0, device  17, function  0:
    VGA compatible controller: S3 Inc. Trio32/Trio64 (rev 84).
      Medium devsel.  IRQ 11.  
      Non-prefetchable 32 bit memory at 0x80000000.
  Bus  0, device   7, function  1:
    IDE interface: Intel 82371SB Natoma/Triton II PIIX3 (rev 0).
      Medium devsel.  Fast back-to-back capable.  Master Capable.  Latency=32.  
      I/O at 0xffa0.
  Bus  0, device   7, function  0:
    ISA bridge: Intel 82371SB Natoma/Triton II PIIX3 (rev 1).
      Medium devsel.  Fast back-to-back capable.  Master Capable.  No bursts.  
  Bus  0, device   0, function  0:
    Host bridge: Intel 82441FX Natoma (rev 2).
      Medium devsel.  Fast back-to-back capable.  Master Capable.  Latency=32.  

	Any ideas what can I check?
-- 
	Best regards, -- Boris.



home help back first fref pref prev next nref lref last post