GETTING.STARTED (rev 2)

Boris 'pi' Piwinger 3.14 at piology.org
Wed Oct 27 17:21:05 CEST 2004


David Relson said:

>> > Eh??  Bogotune uses the wordist, and the ham and spam corpora you
>> > specify, and then does a rather exhaustive scan of possible scoring
>> > parameters to find what gives the best results.  As you know,
>> > bogotune has minimum requirements for number of messages registered
>> > in wordlist.db and minimum numbers of messages for the ham and spam
>> > corpora used in the tuning process.
>>
>> Right, so it is not usable for pure train-on-error approaches.
>
> Usually when I run bogotune, I start with an empty wordlist and 10K-15K
> ham and 10K-15K spam.

This is one approach. Others are listed in the FAQ. Also some people
don't have huge mail archives.

pi



More information about the Bogofilter mailing list