[263] in linux-net channel archive

home help back first fref pref prev next nref lref last post

Re: lcp-echo-failure doesn't work?

daemon@ATHENA.MIT.EDU (Alexander Terczka)
Wed May 3 06:35:01 1995

From: alex@terz.via.at (Alexander Terczka)
To: linux-net@vger.rutgers.edu
Date: Wed, 3 May 1995 09:48:33 +0100 (MET DST)

Hello,
> 
> Hello all!
> 
[cut]
> 
> .... 	This does NOT fix the problem with the PPP link dropping dead
> periodically!  I have seen this complained about by several individuals
> over the last several months.  The usual remark they get is "oh you must
> be having a modem problem".  Wrong again.  At this point, I have been able
> to confirm that the modem is in an operational state.  Flow control is
> NOT in a blocking state, the modem continues to receive data fine (implying
> that the modem has not gone brain-dead and the error correcting link is
> still fully functional) and that interrupts are proceeding properly.
> The higher level code seems to have lost track of what it is doing and is
> no longer able to send data!
> 
> 	I've mentioned this to a couple of others, including Alan Cox
> when I was looking at some things in the latest network snapshots.  So
> far, the best I can figure out is that it is some sort of timing problem.
> It seems to primarily occur, on my system, when there are a high number of
> connections being established and broken (you guessed it - WWW and Netscape).
> 
> 	As has been reported by others, this problem has seemed to come
> and go over a large number of patch levels.  Some have indicated that
> it got much worse around 1.1.6[345] or there abouts.  I can confirm that.
> It also seemed to get somewhat better with the 1.2.4 version but I can't
> find anything in those patches that would acount for it.  It seems much
> worse on a 28.8 link than a 14.4 link (I have one of each), but that may
> be a bogus observation since I don't have the same level of Web traffic
> over the 14.4 link; just telnet, ftp, and smtp and at a lower frequency.
> 
...
> 	Right now, it's just a minor annoyance, since pppd can now detect
> the problem and recover from it.  It would be nice to figure out where
> the problem is and fix it though.  I've been pawing through serial code,
> ppp code, and network code but have come up empty handed.  Anyone with some
> more suggestions as to where to look, let me know and I check'm out.
> 

   I had lockups on ppp-connections for a long time. The error was, that
TCP in the Linux kernel and TCP in a Cisco router (V ~9.1) didn't like
each other. It has to do with the windowing mechanism in the TCP layer. 
I know, this would not lock up the whole connection, but only one TCP 
connection. But perhaps it looks like a total modem lock in your 
situation. To fix the problem I simply said "PC/TCP compatibility mode : YES" 

  Alan told me, that there is a real patch that linux tcp can handle ciscos, 
and that I should try it, but as my IP-provider upgrade the cisco software 
to 10.2 I can't reproduce the bug even without the patch. 

btw: linux is the best OS I ever used. :)

AlexT
(alext@cslab.tuwien.ac.at)

home help back first fref pref prev next nref lref last post