[4159] in linux-announce channel archive
Linux-Announce Digest #451
daemon@ATHENA.MIT.EDU (Digestifier)
Fri Jun 20 21:13:06 2003
From: Digestifier <Linux-Announce-Request@senator-bedfellow.mit.edu>
To: Linux-Announce@senator-bedfellow.mit.edu
Reply-To: Linux-Announce@senator-bedfellow.mit.edu
Date: Fri, 20 Jun 2003 21:13:01 EDT
Linux-Announce Digest #451, Volume #4 Fri, 20 Jun 2003 21:13:01 EDT
Contents:
bogofilter-0.13.7 (David Relson)
----------------------------------------------------------------------------
Date: Fri, 20 Jun 2003 11:12:31 CST
From: David Relson <relson@osagesoftware.com>
Subject: bogofilter-0.13.7
Bogofilter is a mail filter that classifies mail as spam or ham (non-spam)
by a statistical analysis of the message's header and content (body). The
program is able to learn from the user's classifications and corrections.
The statistical technique is known as the Bayesian technique and its use
for spam was described by Paul Graham in his article "A Plan For
Spam". Gary Robinson, in his weblog Rants, suggests some refinements for
improved discrimination between spam and ham. Bogofilter's primary
algorithm uses the f(w) parameter and the Fisher inverse chi-square
technique that he describes.
Bogofilter is run by an MDA script to classify an incoming message as spam
or ham (using wordlists stored by BerkeleyDB). Bogofilter provides
processing for plain text and html, supports multi-part mime message with
decoding of base64, quoted-printable, and uuencoded text and ignores
attachments, such as images.
Bogofilter is written in C. Supported platforms: Linux, FreeBSD, Solaris,
OS X, HP-UX, AIX, ...
******* ******* ******* ******* *******
Bogofilter-0.13.7 has been released on SourceForge.Net as the "new current
release". The tuning/tuning.sh script has been replaced by a new
tuning/bogotune script (and related files). bogotune does a comprehensive
job of testing bogofilter's s, x, min_dev, and spam_cutoff parameters in
order to determine the best values for a bogofilter site.
Project page: http://bogofilter.sourceforge.net/
Download page: http://sourceforge.net/project/showfiles.php?group_id=62265
=================
BOGOFILTER NEWS
=================
0.13.7 2003-06-20
* Replaced tuning/tuning.sh with tuning/bogotune (and related files).
0.13.6.3 2003-06-18
* Minor code rewrites to speed up processing messages, mboxes,
and msg-count files. In particular, tuning/tuning.sh runs
are approx 47% faster than before.
* Fixed several errors in tuning/tuning.sh and reformatted
"Top 10 Results" output.
* Minor changes to bogoutil to support bogotune script.
* Added newlines to correct usage messages.
0.13.6.2 2003-06-05
* Don't allow square brackets in tokens. Do allow dollar signs in
tokens in msg-count files.
* Bogolexer now discards first 'From' token to match scoring behavior
of bogofilter.
0.13.6.1 2003-06-05
* Updated file tuning/README and script tuning/tuning.sh.
0.13.6 2003-06-04
* Fix check for "^From " lines to work properly during base64 decoding.
* End html comment processing when a message header is found.
* Improve README for the tuning scripts and simplify them.
0.13.5 2003-06-03
* Allow terminal exclamation points on tokens.
* Diagnose invalid values of robx.
* Fixed bogofilter's non-use of message counts in msg-count files.
* Updated contrib/mime.get.rfc822
* Modified rstats_print_histogram() so it doesn't print 'nan's.
* Modified t.frame to find version of grep on Solaris so t.bulkmode
can run successfully.
0.13.4.1 2003-05-31
* Modified t.parsing test so it works with OSX's default file system.
0.13.4 2003-05-30
* Changed default value of ROBS from 0.001 to 0.01
* Fixed options '-M' (mailbox mode) and '-p' (passthrough mode) so
they work properly together.
* Minor cleanups in bogofilter.cf.example
* Added db-3.2 and db-3.1 to list for AC_CHECK_DB in configure.ac
0.13.3 2003-05-28
* Tweaked long line check used to prevent scanner buffer overflow.
* Added -V (version) option to bogolexer.
* Minor code tweaks to quiet gcc-3.3 warnings.
* Added doc/programmer/README.osx to distribution.
* Corrected FAQ's procmail recipe for training with SpamAssassin.
0.13.2.1 2003-05-24
* In bulkmode, output filenames to stdout.
* Further fixes for static-build system.
0.13.2 2003-05-24
* Build procedure for statically linked binaries has been improved.
* Autoconfiguration of BerkeleyDB library has been improved.
* Fixed defect in replace_nonascii_characters that was
superseding ignore_case option.
* Portability fix for efence usage in t.frame.
0.13.1 2003-05-23
* Modified "make rpm" to also build statically linked binaries.
They're packaged in bogofilter-static-x.y.z-1.i586.rpm
* Fixed bogofilter.spec.in to include files CHANGES-0.13 and
RELEASE.NOTES-0.13 which had been left out.
0.13.0 2003-05-21
* Added file RELEASE.NOTES-0.13. Read it!!!
* Changed parsing defaults to:
-PI ignore_case (default is disabled)
-Ph header_line_markup (default is enabled)
-Pt tokenize_html_tags (default is enabled)
* contrib/randomtrain and contrib/scramble can now process
both mbox and maildir formats.
* Added perl script contrib/mime.get.rfc822 to extract
forwarded messages from with a message.
* Added basic support for emacs RMAIL mailboxes.
* Remove incomplete RMAIL/Babyl-5 support.
* Registration code modified to count unique tokens for each
message and display the total of the counts.
* Added 'bogo-what?' to FAQ.
##########################################################################
# Send submissions for comp.os.linux.announce to: cola@stump.algebra.com #
# PLEASE remember a short description of the software and the LOCATION. #
# This group is archived at http://stump.algebra.com/~cola/ #
##########################################################################
------------------------------
** FOR YOUR REFERENCE **
The service address, to which questions about the list itself and requests
to be added to or deleted from it should be directed, is:
Internet: Linux-Announce-Request@NEWS-DIGESTS.MIT.EDU
You can submit announcements to be moderated via:
Internet: linux-announce@NEWS.ORNL.GOV
Linux may be obtained via one of these FTP sites:
ftp.funet.fi pub/Linux
tsx-11.mit.edu pub/linux
sunsite.unc.edu pub/Linux
End of Linux-Announce Digest
******************************