bogofilter 1.1.1 - New Stable Release

David Relson relson at osagesoftware.com
Fri Sep 1 05:00:46 CEST 2006


Bogofilter v1.1.1 has new parsing options for setting the minimum
token length, maximum token length, max number of words per token
(pairs, triples, etc), and max length of a multi word token.  These
options are:

      --min-token-len=N           min len for single tokens
      --max-token-len=N           max len for single tokens
      --max-multi-token-len=N     max len for multi-word tokens
      --multi-token-count=N       number of tokens per multi-word token

These options provide additional flexibility in parsing an email
message.  When used with a count greater than one, the
multi-token-count option does seem to increase bogofilter's accuracy.
However it also increase the number of tokens processed per message,
will _significantly_ increase the size of your wordlist (database),
and _will_ make bogofilter run slower.

Version 1.1.1 improved on 1.1.0 with a minor token parsing fix, a new
Italian FAQ, and cleaned up formatting for the English and French
FAQs.


########################################################################

Additional information is in my 8 posting titled "multi-word token
results" made on Aug 8 to the bogofilter-dev mailing list.  Copies can
be found at

    http://www.bogofilter.org/pipermail/bogofilter-dev/2006-August/003357.html

    http://article.gmane.org/gmane.mail.bogofilter.devel/4226


########################################################################

Files are available at http://sourceforge.net/projects/bogofilter for
download.

Here are the md5sums for the release:

2a8f3f2d249126f4d07d2b35c103ec7f  bogofilter-1.1.1-1.src.rpm
e7f236002daf22d732cad0433adac2ff  bogofilter-1.1.1.tar.bz2
8ef277996a0bcfcbacb8b5dcfd789e84  bogofilter-1.1.1.tar.gz
6c176808ee4c92eee46812e77a1c08f7  bogofilter-db42-1.1.1-1.i586.rpm
9ab833491fe924b2a31c26fb2cdfd12e  bogofilter-db42-debug-1.1.1-1.i586.rpm
cf6423e41f8b59fe45f4b0b3321a2edb  bogofilter-db42-static-1.1.1-1.i586.rpm
cb9a0695c6ac2997761c53a890b1fb0c  bogofilter-db42-static-debug-1.1.1-1.i586.rpm
fefab2e867445309300c16e14fde0def  bogofilter-sqlite3-1.1.1-1.i586.rpm
77a4a840659594771167afbc7ef69c3e  bogofilter-sqlite3-debug-1.1.1-1.i586.rpm
21c39098bae1ede1c21486ac3e7ac631  bogofilter-sqlite3-static-1.1.1-1.i586.rpm
f1643e892c12cade3938d210b925f6a0  bogofilter-sqlite3-static-debug-1.1.1-1.i586.rpm

########################################################################

Revision history for multi-word token support:

1.1.1	2006-08-23 (released)

	2006-08-22

	* Added bogofilter-faq-it.html, an Italian translation of the
          FAQ  (thanks to Marco Bozzolan).

	2006-08-10

	* Fixed minor header/body multi-word token defect.

1.1.0	2006-08-09 (released)

	* Revised FAQ's mailbox conversion example.

	2006-07-24

	* Forward port GNU make compatibility fix for doc/Makefile* from
	  1.0 branch.

	2006-07-08

	* Add large file support for 32-bit systems.
	  (_FILE_OFFSET_BITS/_LARGE_FILE).
	* Fix lexer_v3.l format string mismatch that broke debugging
code on 64-bit systems.

	2006-07-04

	* Add multi-word token support to bogoutil & bogotune.

	2006-07-03

	* Clean up token prefixing.
	* Clean up queue-id processing.
	* Add max-multi-token-len checks.
	* Revised function names.  get_token() uses parse_new_token(),
          add_token_to_array(), build_token_from_array(), and
          build_prefixed_token(). 

	2006-07-02

	* Add min-token-len check (with exemption for 2 character
          money amounts which bogofilter has long accepted).
	* Add "short token" pattern to lexer

	2006-07-01

	* Refactor get_token.  Function get_single_token is the original
          get_token function.  Function get_multi_token calls
          get_single_token when another token must be parsed, else it
          constructs multi-part tokens using w_token_array (an array
          of word_t structs).

	2006-06-20

	* Add options for min/max token length, multi-token count, and
          max multi-token length.

	* Modify get_token() to return multi-word tokens.

1.0.3	2006-07-26 (promoted to stable)
1.0.3	2006-07-10 (released)


-- 
David Relson                   Osage Software Systems, Inc.
relson at osagesoftware.com       Ann Arbor, MI 48103
www.osagesoftware.com          tel:  734.821.8800



More information about the bogofilter-dev mailing list