[465] in SIPB_Linux_Development

home help back first fref pref prev next nref lref last post

Resnet causes net card transmit timeouts

daemon@ATHENA.MIT.EDU (Erik Nygren)
Mon Feb 21 15:32:42 1994

To: resnet-help@MIT.EDU
Cc: linux-dev@MIT.EDU
Date: Mon, 21 Feb 94 15:32:19 EST
From: Erik Nygren <nygren@MIT.EDU>


With the 3Com 3C509 under Linux, the following error is generated
during large transmissions:

Feb 21 14:35:57 foundation kernel: eth0: transmit timed out, tx_status 00 status 2000.

A quick poll of 6 users of Linux and 3c509's showed that 5 of them
have this problem.  No data gets lost, but a delay is introduced as
the card recovers from the error.  One AT-1700 user reports getting an
equivalent error, although this has not been verified.  During an ftp
of a few megabytes from the Linux machine to another machine, transmit
timeouts occur a few times.

Thinking it was a driver problem, I talked to Donald Becker (an MIT graduate
and the author of many of the Linux networking drivers).  He suggested
that the timeout might be set to low in the kernel.  I tried numerous
different values.  I even increased it to ridiculously high amounts of time
and the error persisted.  Writing back to him with this, he responded:

> The timeout check is there because most ethernet controllers can hang. The
> 3c509 is a recent design by people that have a lot of experience, but the
> people doing the BSD 3c509 driver ran into a similar problem, so I don't
> think it's the driver.  It don't happen on all networks, so it must be
> timing related.
>
>The AT-1700 has a similar problem, but it can't always be reset :-<.

From this, it sounds like the problem might be a timing problem
with how the dorm networks are configured.  Is there anything
you know of that might be causing this problem?

Thanks,
Erik Nygren

home help back first fref pref prev next nref lref last post