[91313] in North American Network Operators' Group
RE: www.gigablast.com
daemon@ATHENA.MIT.EDU (Bill Woodcock)
Thu Jul 13 15:19:10 2006
Date: Thu, 13 Jul 2006 12:11:30 -0700 (PDT)
From: Bill Woodcock <woody@pch.net>
To: David Schwartz <davids@webmaster.com>
Cc: nanog <nanog@merit.edu>
In-Reply-To: <MDEHLPKNGKAHNMBLJOLKGEFINCAB.davids@webmaster.com>
Errors-To: owner-nanog@merit.edu
> What gigablast seems to be doing, on the other hand, is trying to open
> every window in a house in the hopes that it will find one that's open.
Just looking at the text strings in the URLs, my off-the-top-of-my-head
guess was that those were URLs it saw in email spam. They looked very
similar to a lot of the ascii-garbage that gets generated by spammers
trying to get through bayesian filters. It seemed plausible to me (not a
good idea, of course, but the sort of thing that happens) that they might
have been grepping web pages for URLs, and run across an archive of spam.
-Bill