how to bogotune?

tallison at tacocat.net tallison at tacocat.net
Wed Sep 29 21:05:01 CEST 2004


> On September 29, 2004 3:05 pm, tallison at tacocat.net wrote:
>> The inconsistency comes when you have real email that scores .9999 and
>> real spam that scores 0.0001.  What bogofilter/bogotune would like to
>> see
>> is a greater seperation between ham/spam such that your lowest spam
>> might
>> be 0.5 and your highest ham might be 0.6.  But when you have maybe 50
>> spam
>> at 0.0001 and 50 ham at 0.9999 then it will come back with numbers for
>> cutoff like:
>> ham_cutoff = 0.435
>> spam_cutoff = 0.000
>> duh?
>
> I don't know. When I tried bogotune with messages that I had already
> trained
> with, I got results like this:
>
>  ham_cutoff = 0.000
>  spam_cutoff = 0.000
>
> Not too useful. :-)
>

The first question that needs to be answered is, "how confident are you of
your different messages being filed correctly?".  I've sometimes made
mistakes and have had to spend some time to dig out the misfilings.

You might run up a script to find the low scoring spam or high scoring ham
and verify that those are classified (by you) correctly.




More information about the Bogofilter mailing list