[7892] in cryptography@c2.net mail archive
Re: Lowercase compresses better?
daemon@ATHENA.MIT.EDU (Ted Lemon)
Fri Sep 29 18:54:12 2000
Message-Id: <200009292055.e8TKt4604037@grosse.bisbee.fugue.com>
To: rsalz@CaveoSystems.com
Cc: cryptography@c2.net
In-Reply-To: Message from rsalz@CaveoSystems.com
of "Fri, 29 Sep 2000 14:29:50 -0400." <200009291829.OAA22268@os390.caveosystems.com>
Date: Fri, 29 Sep 2000 13:55:04 -0700
From: Ted Lemon <mellon@nominum.com>
If your compression algorithm is tuned for normal ASCII text, then
<UC letter><lc letter> may be considered more frequent than
<UC letter><UC letter> for all combinations of values of <UC letter>,
and thus pairs of uppercased letters may result in longer bit streams
than pairs of lowercase letters or one uppercase letter followed by
one lowercase letter. In practice I have some trouble believing that
this matters, but I don't even play a data compression expert on the
net, so my lack of belief doesn't mean it doesn't make sense.
_MelloN_