What is a spamicity of exactly 0.5?

Boris 'pi' Piwinger 3.14 at logic.univie.ac.at
Mon Jan 26 08:49:17 CET 2004


"Jason A. Smith" <jazbo at jazbo.dyndns.org> wrote:

>> You ask about improving bogofilter's detection of spam with random
>> words.  If you have an archive with several thousand ham and spam
>> messages, you can run bogotune to compute a set of parameters customized
>> for _your_ environment and for _your_ mix of ham and spam.
>
>I can't use bogotune yet since I just started using bogofilter and
>haven't saved enough spam yet to reach the min 2k threshold.  It would
>be nice if bogotune included a flag to disable this enforced minimum. 

Would be interesting. I never get that many messages in my
database.

>New users could then at least start with some numbers besides the built
>in defaults, even though they may not be as accurate as if they had
>waited till the 2k limit.  They can always re-run bogotune later once
>they build up enough spam.  Depending on how much spam someone receives
>daily, it could take weeks or months to reach this minimum and during
>that time the user can only guess at the parameters or stick with the
>built in defaults.

If you want to experiment a bit, have a look at the FAQ and
try training-to-exhaustion. Would be nice to know how this
performs in your case with only few messages to train with.

pi




More information about the Bogofilter mailing list