bogofilter 1.1.1 - New Stable Release
David Relson
relson at osagesoftware.com
Fri Sep 1 05:00:46 CEST 2006
Bogofilter v1.1.1 has new parsing options for setting the minimum
token length, maximum token length, max number of words per token
(pairs, triples, etc), and max length of a multi word token. These
options are:
--min-token-len=N min len for single tokens
--max-token-len=N max len for single tokens
--max-multi-token-len=N max len for multi-word tokens
--multi-token-count=N number of tokens per multi-word token
These options provide additional flexibility in parsing an email
message. When used with a count greater than one, the
multi-token-count option does seem to increase bogofilter's accuracy.
However it also increase the number of tokens processed per message,
will _significantly_ increase the size of your wordlist (database),
and _will_ make bogofilter run slower.
Version 1.1.1 improved on 1.1.0 with a minor token parsing fix, a new
Italian FAQ, and cleaned up formatting for the English and French
FAQs.
########################################################################
Additional information is in my 8 posting titled "multi-word token
results" made on Aug 8 to the bogofilter-dev mailing list. Copies can
be found at
http://www.bogofilter.org/pipermail/bogofilter-dev/2006-August/003357.html
http://article.gmane.org/gmane.mail.bogofilter.devel/4226
########################################################################
Files are available at http://sourceforge.net/projects/bogofilter for
download.
Here are the md5sums for the release:
2a8f3f2d249126f4d07d2b35c103ec7f bogofilter-1.1.1-1.src.rpm
e7f236002daf22d732cad0433adac2ff bogofilter-1.1.1.tar.bz2
8ef277996a0bcfcbacb8b5dcfd789e84 bogofilter-1.1.1.tar.gz
6c176808ee4c92eee46812e77a1c08f7 bogofilter-db42-1.1.1-1.i586.rpm
9ab833491fe924b2a31c26fb2cdfd12e bogofilter-db42-debug-1.1.1-1.i586.rpm
cf6423e41f8b59fe45f4b0b3321a2edb bogofilter-db42-static-1.1.1-1.i586.rpm
cb9a0695c6ac2997761c53a890b1fb0c bogofilter-db42-static-debug-1.1.1-1.i586.rpm
fefab2e867445309300c16e14fde0def bogofilter-sqlite3-1.1.1-1.i586.rpm
77a4a840659594771167afbc7ef69c3e bogofilter-sqlite3-debug-1.1.1-1.i586.rpm
21c39098bae1ede1c21486ac3e7ac631 bogofilter-sqlite3-static-1.1.1-1.i586.rpm
f1643e892c12cade3938d210b925f6a0 bogofilter-sqlite3-static-debug-1.1.1-1.i586.rpm
########################################################################
Revision history for multi-word token support:
1.1.1 2006-08-23 (released)
2006-08-22
* Added bogofilter-faq-it.html, an Italian translation of the
FAQ (thanks to Marco Bozzolan).
2006-08-10
* Fixed minor header/body multi-word token defect.
1.1.0 2006-08-09 (released)
* Revised FAQ's mailbox conversion example.
2006-07-24
* Forward port GNU make compatibility fix for doc/Makefile* from
1.0 branch.
2006-07-08
* Add large file support for 32-bit systems.
(_FILE_OFFSET_BITS/_LARGE_FILE).
* Fix lexer_v3.l format string mismatch that broke debugging
code on 64-bit systems.
2006-07-04
* Add multi-word token support to bogoutil & bogotune.
2006-07-03
* Clean up token prefixing.
* Clean up queue-id processing.
* Add max-multi-token-len checks.
* Revised function names. get_token() uses parse_new_token(),
add_token_to_array(), build_token_from_array(), and
build_prefixed_token().
2006-07-02
* Add min-token-len check (with exemption for 2 character
money amounts which bogofilter has long accepted).
* Add "short token" pattern to lexer
2006-07-01
* Refactor get_token. Function get_single_token is the original
get_token function. Function get_multi_token calls
get_single_token when another token must be parsed, else it
constructs multi-part tokens using w_token_array (an array
of word_t structs).
2006-06-20
* Add options for min/max token length, multi-token count, and
max multi-token length.
* Modify get_token() to return multi-word tokens.
1.0.3 2006-07-26 (promoted to stable)
1.0.3 2006-07-10 (released)
--
David Relson Osage Software Systems, Inc.
relson at osagesoftware.com Ann Arbor, MI 48103
www.osagesoftware.com tel: 734.821.8800
More information about the bogofilter-dev
mailing list