GETTING.STARTED (rev 2)

David Relson relson at osagesoftware.com
Wed Oct 27 03:12:08 CEST 2004


On Tue, 26 Oct 2004 23:41:21 +0200 (CEST)
Boris 'pi' Piwinger wrote:

> David Relson said:
> 
> >     2b. Training bogofilter
> >     -----------------------
> 
> At this section it would be important to point at the various training
> methods as described in the FAQ.

Good suggestion.  Thanks.

> > 6. Tuning bogofilter
> > --------------------
> >
> >     Once you've use bogofilter for a while, you may wish to optimize
> >     its classification parameters.  The bogotune utility uses your
> >     wordlist and additional ham and spam messages to check a large
> >     variety of possible parameter values and find what'll work best
> >     for your environment.  For more info, read the bogotune man page
> >     and file bogofilter-tuning.HOWTO.html.
> 
> This then applies only to some training methods.

Eh??  Bogotune uses the wordist, and the ham and spam corpora you
specify, and then does a rather exhaustive scan of possible scoring
parameters to find what gives the best results.  As you know, bogotune
has minimum requirements for number of messages registered in
wordlist.db and minimum numbers of messages for the ham and spam corpora
used in the tuning process.  Experience has shown that a wordlist with
too little info doesn't allow bogotune to do a good job, hence there are
_required_ minimum message counts.  Bogotune doesn't care what training
method you've used in building the wordlist, so long as there are enough
messages.

The only effect that training method has is whether or not enough
messages are present in the wordlist for bogotune to work successfully.

David

cc: bogofilter mailing list




More information about the Bogofilter mailing list