[6137] in www-talk@info.cern.ch

home help back first fref pref prev next nref lref last post

Re: Full-text indexing for WWW conference avail.

daemon@ATHENA.MIT.EDU (Roy T. Fielding)
Thu Oct 13 00:56:57 1994

Date: Thu, 13 Oct 1994 05:55:04 +0100
Errors-To: listmaster@www0.cern.ch
Errors-To: listmaster@www0.cern.ch
Reply-To: fielding@avron.ICS.UCI.EDU
From: "Roy T. Fielding" <fielding@avron.ICS.UCI.EDU>
To: Multiple recipients of list <www-talk@www0.cern.ch>

Nick Arnett wrote:

> The spider will hit your server fairly hard.  We have a real-time indexing
> engine and a T-1...

This is just plain irresponsible.  You are not only affecting their server,
you will also effect every network connection between your site and theirs.
People pay good money for that bandwidth -- you should not attempt to hog it.

Your spider should be running on their local net -- running at your site
provides no added value.  At a minimum, the spider should be forced to
delay between consecutive requests (about 15-30 seconds, depending on the
network throughput and speed of the server).


.....Roy Fielding   ICS Grad Student, University of California, Irvine  USA
                                     <fielding@ics.uci.edu>
                     <URL:http://www.ics.uci.edu/dir/grad/Software/fielding>

home help back first fref pref prev next nref lref last post