Bogofilter seems to not be working

Michael D Richards michael at emdee.net
Wed Mar 26 06:29:26 CET 2003


David Relson wrote:

> As you say, this process is learning and extending bogofilter's 
> vocabulary.  It also assumes bogofilter is doing a good job (which it 
> can do).  However, bogofilter can not be all knowing, so there will be 
> messages that are incorrectly classified.  When bogofilter is first 
> being used, the wordlists are small, and bogofilter's accuracy is at 
> its worst.  When using "-u", the sysadmin _must_ monitor what 
> bogofilter is doing.  When bogofilter makes a mistakes, the sysadmin 
> needs to notifiy bogofilter and have the message removed from one 
> wordlist and added to the other.


Just to chime in with a little personal experience...

I have setup a system where -u is used starting with zero corpora. 
Bogofilter assumes everything is "good" at first and as users start 
fixing the false negatives, accuracy builds *very* rapidly. Under this 
system I have only seen one false positive, and chances are that one 
would have been classified that way even under a highly trained and 
mature instance.

In other words, even when bogofilter's accuracy is "at its worst", it 
works very well and -u can be very useful if the surrounding system is 
well thought out and you make it very easy for users to fix errors on 
their own.

Michael~
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://www.bogofilter.org/pipermail/bogofilter/attachments/20030325/8ca27f4e/attachment.html>


More information about the Bogofilter mailing list