[4159] in linux-announce channel archive

home help back first fref pref prev next nref lref last post

Linux-Announce Digest #451

daemon@ATHENA.MIT.EDU (Digestifier)
Fri Jun 20 21:13:06 2003

From: Digestifier <Linux-Announce-Request@senator-bedfellow.mit.edu>
To: Linux-Announce@senator-bedfellow.mit.edu
Reply-To: Linux-Announce@senator-bedfellow.mit.edu
Date:     Fri, 20 Jun 2003 21:13:01 EDT

Linux-Announce Digest #451, Volume #4          Fri, 20 Jun 2003 21:13:01 EDT

Contents:
  bogofilter-0.13.7 (David Relson)

----------------------------------------------------------------------------

Date: Fri, 20 Jun 2003 11:12:31 CST
From: David Relson <relson@osagesoftware.com>
Subject: bogofilter-0.13.7

Bogofilter is a mail filter that classifies mail as spam or ham (non-spam) 
by a statistical analysis of the message's header and content (body).  The 
program is able to learn from the user's classifications and corrections.

The statistical technique is known as the Bayesian technique and its use 
for spam was described by Paul Graham in his article "A Plan For 
Spam".  Gary Robinson, in his weblog Rants, suggests some refinements for 
improved discrimination between spam and ham.  Bogofilter's primary 
algorithm uses the f(w) parameter and the Fisher inverse chi-square 
technique that he describes.

Bogofilter is run by an MDA script to classify an incoming message as spam 
or ham (using wordlists stored by BerkeleyDB).  Bogofilter provides 
processing for plain text and html, supports multi-part mime message with 
decoding of base64, quoted-printable, and uuencoded text and ignores 
attachments, such as images.

Bogofilter is written in C.  Supported platforms: Linux, FreeBSD, Solaris, 
OS X, HP-UX, AIX, ...

******* ******* ******* ******* *******

Bogofilter-0.13.7 has been released on SourceForge.Net as the "new current 
release".  The tuning/tuning.sh script has been replaced by a new 
tuning/bogotune script (and related files).  bogotune does a comprehensive 
job of testing bogofilter's s, x, min_dev, and spam_cutoff parameters in 
order to determine the best values for a bogofilter site.

Project page:  http://bogofilter.sourceforge.net/
Download page: http://sourceforge.net/project/showfiles.php?group_id=62265

                               =================
                                BOGOFILTER NEWS
                               =================

0.13.7  2003-06-20

* Replaced tuning/tuning.sh with tuning/bogotune (and related files).

0.13.6.3 2003-06-18

* Minor code rewrites to speed up processing messages, mboxes,
   and msg-count files.  In particular, tuning/tuning.sh runs
   are approx 47% faster than before.
* Fixed several errors in tuning/tuning.sh and reformatted
   "Top 10 Results" output.
* Minor changes to bogoutil to support bogotune script.
* Added newlines to correct usage messages.

0.13.6.2 2003-06-05

* Don't allow square brackets in tokens.  Do allow dollar signs in
   tokens in msg-count files.
* Bogolexer now discards first 'From' token to match scoring behavior
   of bogofilter.

0.13.6.1 2003-06-05

* Updated file tuning/README and script tuning/tuning.sh.

0.13.6  2003-06-04

* Fix check for "^From " lines to work properly during base64 decoding.
* End html comment processing when a message header is found.
* Improve README for the tuning scripts and simplify them.

0.13.5  2003-06-03

* Allow terminal exclamation points on tokens.
* Diagnose invalid values of robx.
* Fixed bogofilter's non-use of message counts in msg-count files.
* Updated contrib/mime.get.rfc822
* Modified rstats_print_histogram() so it doesn't print 'nan's.
* Modified t.frame to find version of grep on Solaris so t.bulkmode
    can run successfully.

0.13.4.1 2003-05-31

* Modified t.parsing test so it works with OSX's default file system.

0.13.4  2003-05-30
* Changed default value of ROBS from 0.001 to 0.01
* Fixed options '-M' (mailbox mode) and '-p' (passthrough mode) so
   they work properly together.
* Minor cleanups in bogofilter.cf.example
* Added db-3.2 and db-3.1 to list for AC_CHECK_DB in configure.ac
0.13.3  2003-05-28

* Tweaked long line check used to prevent scanner buffer overflow.
* Added -V (version) option to bogolexer.
* Minor code tweaks to quiet gcc-3.3 warnings.
* Added doc/programmer/README.osx to distribution.
* Corrected FAQ's procmail recipe for training with SpamAssassin.

0.13.2.1 2003-05-24

* In bulkmode, output filenames to stdout.
* Further fixes for static-build system.

0.13.2  2003-05-24

* Build procedure for statically linked binaries has been improved.
* Autoconfiguration of BerkeleyDB library has been improved.
* Fixed defect in replace_nonascii_characters that was
   superseding ignore_case option.
* Portability fix for efence usage in t.frame.

0.13.1  2003-05-23

* Modified "make rpm" to also build statically linked binaries.
   They're packaged in bogofilter-static-x.y.z-1.i586.rpm
* Fixed bogofilter.spec.in to include files CHANGES-0.13 and
   RELEASE.NOTES-0.13 which had been left out.

0.13.0  2003-05-21

* Added file RELEASE.NOTES-0.13.  Read it!!!
* Changed parsing defaults to:
   -PI  ignore_case             (default is disabled)
   -Ph  header_line_markup      (default is enabled)
   -Pt  tokenize_html_tags      (default is enabled)
* contrib/randomtrain and contrib/scramble can now process
   both mbox and maildir formats.
* Added perl script contrib/mime.get.rfc822 to extract
   forwarded messages from with a message.
* Added basic support for emacs RMAIL mailboxes.
* Remove incomplete RMAIL/Babyl-5 support.
* Registration code modified to count unique tokens for each
   message and display the total of the counts.
* Added 'bogo-what?' to FAQ.

##########################################################################
# Send submissions for comp.os.linux.announce to: cola@stump.algebra.com #
# PLEASE remember a short description of the software and the LOCATION.  #
# This group is archived at http://stump.algebra.com/~cola/              #
##########################################################################


------------------------------


** FOR YOUR REFERENCE **

The service address, to which questions about the list itself and requests
to be added to or deleted from it should be directed, is:

    Internet: Linux-Announce-Request@NEWS-DIGESTS.MIT.EDU

You can submit announcements to be moderated via:

    Internet: linux-announce@NEWS.ORNL.GOV

Linux may be obtained via one of these FTP sites:
    ftp.funet.fi				pub/Linux
    tsx-11.mit.edu				pub/linux
    sunsite.unc.edu				pub/Linux

End of Linux-Announce Digest
******************************

home help back first fref pref prev next nref lref last post