[2357] in linux-scsi channel archive
Repeated system crash. Long. Help badly needed.
daemon@ATHENA.MIT.EDU (Boris Tobotras)
Wed Aug 27 08:14:39 1997
To: linux-scsi@vger.rutgers.edu
Date: Wed, 27 Aug 1997 16:04:19 +0400
From: Boris Tobotras <boris@xtalk.msk.su>
I see the only solution now: installation of Solaris :( Don't
want to, really, but it is our production server. Can anybody give an
advice, what can be a cause?
I've built two servers with identical hardware:
PPro-200 w/256k cache, 128M ram, AHA2940UW, Quantum Atlas II, S3
Trio 64/V+, 3C905.
One of servers is Linux one. It crashes every several hours. Just
system hang, with no messages in log (syslog is told to log *.debug) or
console. But last time it hang, console was full of scrolled messages
scsi0: queue full
I tried exchange every hardware with the second server, so I'm
pretty sure it is not hardware fault. Following is as much information as
I can imagine. Any ideas,
please?
Kernel 2.0.29 (was 2.0.30, downgraded just in case), Boomerang (3c59x)
driver v0.42o.
Here's /proc/scsi/scsi:
Attached devices:
Host: scsi0 Channel: 00 Id: 06 Lun: 00
Vendor: QUANTUM Model: XP32275W Rev: LXY4
Type: Direct-Access ANSI SCSI revision: 02
Here's /proc/scsi/aic7xxx/0:
Adaptec AIC7xxx driver version: 4.0/3.2/4.0
Compile Options:
AIC7XXX_RESET_DELAY : 15
AIC7XXX_CMDS_PER_LUN : 8
AIC7XXX_TWIN_SUPPORT : Enabled
AIC7XXX_TAGGED_QUEUEING: Enabled
AIC7XXX_PAGE_ENABLE : Enabled
AIC7XXX_PROC_STATS : Enabled
Adapter Configuration:
SCSI Adapter: AHA-2940 Ultra
(AIC-788x chipset)
Host Bus: Wide
Base IO: 0xec00
IRQ: 10
SCBs: Used 8, HW 16, Page 255
Interrupts: 4998
Serial EEPROM: True
Extended Translation: Enabled
SCSI Bus Reset: Enabled
Ultra SCSI: Enabled
Target Disconnect: Enabled
Statistics:
CHAN#A (TGT 6 LUN 0):
nxfers 4966 (2452 read;2514 written)
blks(512) rd=18075; blks(512) wr=7356
< 512 512-1K 1-2K 2-4K 4-8K 8-16K 16-32K 32-64K 64-128K >128K
Reads: 3 1 1264 96 542 527 7 10 2 0
Writes: 0 0 2263 221 16 6 1 0 7 0
Here's dmesg:
Intel MultiProcessor Specification v1.1
Virtual Wire compatibility mode.
OEM ID: INTEL Product ID: 440FX APIC at: 0xFEE00000
Processor #0 Pentium(tm) Pro APIC version 17
I/O APIC #2 Version 17 at 0xFEC00000.
Processors: 1
Console: 16 point font, 400 scans
Console: colour VGA+ 80x25, 1 virtual console (max 63)
pcibios_init : BIOS32 Service Directory structure at 0x000fdb70
pcibios_init : BIOS32 Service Directory entry at 0xfdb80
pcibios_init : PCI BIOS revision 2.10 entry at 0xfdba1
Probing PCI hardware.
Warning : Unknown PCI device (10b7:9050). Please read include/linux/pci.h
Calibrating delay loop.. ok - 199.07 BogoMIPS
Memory: 127952k/131072k available (660k kernel code, 384k reserved, 2076k data)
Swansea University Computer Society NET3.035 for Linux 2.0
NET3: Unix domain sockets 0.13 for Linux NET3.035.
Swansea University Computer Society TCP/IP for NET3.034
IP Protocols: ICMP, UDP, TCP
VFS: Diskquotas version dquot_5.6.0 initialized
Checking 386/387 coupling... Ok, fpu using exception 16 error reporting.
Checking 'hlt' instruction... Ok.
Linux version 2.0.29 (root@goliath.jet.msk.su) (gcc version 2.7.2.1) #4 Tue Aug 26 17:49:08 MSD 1997
Error: only one processor found.
Serial driver version 4.13 with no serial options enabled
tty00 at 0x03f8 (irq = 4) is a 16550A
tty01 at 0x02f8 (irq = 3) is a 16550A
Real Time Clock Driver v1.07
ide: i82371 PIIX (Triton) on PCI bus 0 function 57
ide0: BM-DMA at 0xffa0-0xffa7
ide1: BM-DMA at 0xffa8-0xffaf
hda: Pioneer CD-ROM ATAPI Model DR-A12X 0100, ATAPI CDROM drive
ide0 at 0x1f0-0x1f7,0x3f6 on irq 14
md driver 0.35 MAX_MD_DEV=4, MAX_REAL=8
aic7xxx: BurstLen = 8 DWDs, Latency Timer = 64 PCLKS
aic7xxx: AHA-2940 Ultra Rev B.
aic7xxx: devconfig = 0x1580.
aic7xxx: Reading SEEPROM...done.
aic7xxx: Enabling support for Ultra SCSI speed.
aic7xxx: Extended translation enabled.
aic7xxx: Memory check yields 16 SCBs, 255 page-enabled SCBs.
aic7xxx: Enabling wide channel of AHA-2940 Ultra-Wide.
AHA-2940 Ultra-WIDE (PCI-bus), I/O 0xec00, Mem 0xfebff000:
irq 10
bus release time 40 bclks
data fifo threshold 100%
SCSI CHANNEL A:
scsi id 7
scsi selection timeout 256 ms
scsi bus reset at power-on enabled
scsi bus parity enabled
scsi bus termination (low byte) enabled
scsi bus termination (high byte) enabled
aic7xxx: Downloading sequencer code...done.
aic7xxx: Resetting the SCSI bus...done.
scsi0 : Adaptec AHA274x/284x/294x (EISA/VLB/PCI-Fast SCSI) 4.0/3.2/4.0
scsi : 1 host.
scsi0: Scanning channel A for devices.
Started kswapd v 1.4.2.2
scsi0: Received MSG_WDTR, Target 6, channel A needwdtr(0xfffd).
scsi0: Target 6, channel A, using 16 bit transfers.
scsi0: Target 6, channel A, now synchronous at 20.0MHz, offset 8.
Vendor: QUANTUM Model: XP32275W Rev: LXY4
Type: Direct-Access ANSI SCSI revision: 02
Detected scsi disk sda at scsi0, channel 0, id 6, lun 0
scsi0: Enabled tagged queuing for target 6, channel 0, LUN 0, queue depth 8.
scsi : detected 1 SCSI disk total.
SCSI device sda: hdwr sector= 512 bytes. Sectors= 4445380 [2170 MB] [2.2 GB]
Partition check:
sda: sda1 sda2
VFS: Mounted root (ext2 filesystem) readonly.
Adding Swap: 102396k swap-space
Here's "scsiinfo -a /dev/sda":
Inquiry command
---------------
Relative Address 0
Wide bus 32 0
Wide bus 16 1
Synchronous neg. 1
Linked Commands 1
Command Queueing 1
SftRe 0
Device Type 0
Peripheral Qualifier 0
Removable? 0
Device Type Modifier 0
ISO Version 0
ECMA Version 0
ANSI Version 2
AENC 0
TrmIOP 0
Response Data Format 2
Vendor: QUANTUM
Product: XP32275W
Revision level: LXY4182706552275
Data from Rigid Disk Drive Geometry Page
----------------------------------------
Number of cylinders 5899
Number of heads 5
Starting write precomp 5899
Starting reduced current 0
Drive step rate 0
Landing Zone Cylinder 5900
RPL 0
Rotational Offset 0
Rotational Rate 7200
Data from Caching Page
----------------------
Write Cache 1
Read Cache 1
Prefetch units 0
Demand Read Retention Priority 0
Demand Write Retention Priority 0
Disable Pre-fetch Transfer Length 65535
Minimum Pre-fetch 0
Maximum Pre-fetch 357
Maximum Pre-fetch Ceiling 357
Data from Format Device Page
----------------------------
Removable Medium 0
Supports Hard Sectoring 1
Supports Soft Sectoring 0
Addresses assigned by surface 0
Tracks per Zone 5
Alternate sectors per zone 5
Alternate tracks per zone 0
Alternate tracks per lun 0
Sectors per track 152
Bytes per sector 512
Interleave 1
Track skew factor 9
Cylinder skew factor 54
Data from Error Recovery Page
-----------------------------
AWRE 1
ARRE 1
TB 0
RC 0
EER 0
PER 0
DTE 0
DCR 0
Read Retry Count 8
Correction Span 80
Head Offset Count 0
Data Strobe Offset Count 0
Write Retry Count 8
Recovery Time Limit 0
Data from Control Page
----------------------
RLEC 0
QErr 0
DQue 0
EECA 0
RAENP 0
UUAENP 0
EAENP 0
Queue Algorithm Modifier 1
Ready AEN Holdoff Period 0
Data from Disconnect-Reconnect Page
-----------------------------------
Buffer full ratio 0
Buffer empty ratio 0
Bus Inactivity Limit 0
Disconnect Time Limit 0
Connect Time Limit 0
Maximum Burst Size 0
DTDC 0x0
Data from Defect Lists
----------------------
21 entries in manufacturer table.
[skipped]
0 entries in grown table.
Data from Notch Parameters Page
-------------------------------
Notched Drive 1
Logical or Physical Notch 0
Max # of notches 16
Active Notch 0
Starting Boundary 0x2c00
Ending Boundary 0x16df04
Pages Notched 00000000 00001008
Data from Verify Error Recovery Page
------------------------------------
EER 0
PER 0
DTE 0
DCR 0
Verify Retry Count 8
Verify Correction Span (bits) 80
Verify Recovery Time Limit (ms) 0
Here's /proc/pci:
PCI devices found:
Bus 0, device 19, function 0:
SCSI storage controller: Adaptec AIC-7881U (rev 0).
Medium devsel. Fast back-to-back capable. IRQ 10. Master Capable. Latency=64. Min Gnt=8.Max Lat=8.
I/O at 0xec00.
Non-prefetchable 32 bit memory at 0xfebff000.
Bus 0, device 18, function 0:
Ethernet controller: 3Com Unknown device (rev 0).
Vendor id=10b7. Device id=9050.
Medium devsel. IRQ 9. Master Capable. Latency=248. Min Gnt=3.Max Lat=8.
I/O at 0xef00.
Bus 0, device 17, function 0:
VGA compatible controller: S3 Inc. Trio32/Trio64 (rev 84).
Medium devsel. IRQ 11.
Non-prefetchable 32 bit memory at 0x80000000.
Bus 0, device 7, function 1:
IDE interface: Intel 82371SB Natoma/Triton II PIIX3 (rev 0).
Medium devsel. Fast back-to-back capable. Master Capable. Latency=32.
I/O at 0xffa0.
Bus 0, device 7, function 0:
ISA bridge: Intel 82371SB Natoma/Triton II PIIX3 (rev 1).
Medium devsel. Fast back-to-back capable. Master Capable. No bursts.
Bus 0, device 0, function 0:
Host bridge: Intel 82441FX Natoma (rev 2).
Medium devsel. Fast back-to-back capable. Master Capable. Latency=32.
Any ideas what can I check?
--
Best regards, -- Boris.