[988] in linux-net channel archive

home help back first fref pref prev next nref lref last post

Re: heavily-accessed site w/ random hangs on specific port... suggestions?

daemon@ATHENA.MIT.EDU (Rob Janssen reading Linux mailingl)
Fri Aug 25 00:41:42 1995

From: linux@pe1chl.ampr.org (Rob Janssen reading Linux mailinglist)
To: andrewm@connect.com.au (Andrew McNamara)
Date: Thu, 24 Aug 1995 08:22:51 +0200 (MET DST)
Cc: mea@mea.cc.utu.fi, linux-net@vger.rutgers.edu
In-Reply-To: <199508230547.PAA21867@warrane.connect.com.au> from "Andrew McNamara" at Aug 23, 95 03:47:55 pm
Reply-To: linux-vger@wab-tis.rabobank.nl

According to Andrew McNamara:
> >	When the jam happens, network monitors (persons with equipment
> >	on right spots on our ATM and FDDI nets) say that  ftp.funet.fi
> >	gets  TCP-SYNs (so, requests to create connection) into FTP
> >	control port, however it does not send TCP-ACK for it -> connection
> >	will not get estabilished.   Simultaneous system call trace on
> >	that machine also shows that  accept() does not happen.
> >[...]
> >	I am beginning to wonder, if Linux network code does something
> >	wrong in the same way as BSD one ?  (The OSF/1 networking code
> >	is clearly BSD code.)   Or is it something more sinister ...
> >	... the TCP/IP protocol is at fault ?
> 
> We have a similar experience here - but it usually effects SMTP (although
> we are starting to see it on our web server as well).
> 
> I think it's a problem with the sockets interface model rather than TCP
> itself - the listen(2) call only supports a finite number of
> outstanding connection requests.  In spite of what the manual entries
> appear to say, connections requests in excess of the maxium supported
> are simply ignored as you have seen.

Couldn't it be a problem with TIME WAIT state?
When many connections are made and released, a lot of TCP controlblocks
can be in TIME WAIT state, and thus unavailable for use for some time.
When the controlblocks are from a fixed array (rather than dynamically
allocated) you can be out of controlblocks even while only a few connections
are active.  You just need a lot of connections/releases over a short
interval.  WWW could cause that (also when doing FTP from WWW browsers).

I don't know why it would affect only connects to a single port number.
Maybe the TCP code counts connections to the same port in an attempt to
fairly distribute the finite resource over different uses?

Rob

-- 
+------------------------------------+--------------------------------------+
| Rob Janssen         rob@knoware.nl | AMPRnet:   rob@pe1chl.ampr.org       |
| e-mail: pe1chl@wab-tis.rabobank.nl | AX.25 BBS: PE1CHL@PI8WNO.#UTR.NLD.EU |
+------------------------------------+--------------------------------------+

home help back first fref pref prev next nref lref last post