[6959] in cryptography@c2.net mail archive
Re: "Harmonized Packet Data Intercept Standards"
daemon@ATHENA.MIT.EDU (Declan McCullagh)
Fri Apr 28 00:48:23 2000
Message-Id: <4.3.0.20000428002605.00b549a0@pop.webcom.com>
Date: Fri, 28 Apr 2000 00:34:31 -0400
To: John Gilmore <gnu@toad.com>, jya@pipeline.com, cypherpunks@toad.com,
cryptography@c2.net
From: Declan McCullagh <declan@well.com>
In-Reply-To: <200004280331.UAA27114@toad.com>
Mime-Version: 1.0
Content-Type: text/plain; charset="us-ascii"; format=flowed
At 20:31 4/27/2000 -0700, John Gilmore wrote:
>[I think we need software for automatically extracting the words from PDF
>and MS-Word documents so they can be found in web searches. It looks like
>the bad guys are deliberately putting lots of interesting stuff in PDF
>to make it hard to find and read. --gnu]
I don't have the time to write such a thing, at least right now, but I'd be
happy to host a targeted spider on my server. I've got the space and
bandwidth and we could aim it at certain potentially-interesting sites.
(There must be plenty of text converters for MSWD and PDF files, right?)
-Declan