What is a spamicity of exactly 0.5?
Boris 'pi' Piwinger
3.14 at logic.univie.ac.at
Mon Jan 26 08:49:17 CET 2004
"Jason A. Smith" <jazbo at jazbo.dyndns.org> wrote:
>> You ask about improving bogofilter's detection of spam with random
>> words. If you have an archive with several thousand ham and spam
>> messages, you can run bogotune to compute a set of parameters customized
>> for _your_ environment and for _your_ mix of ham and spam.
>
>I can't use bogotune yet since I just started using bogofilter and
>haven't saved enough spam yet to reach the min 2k threshold. It would
>be nice if bogotune included a flag to disable this enforced minimum.
Would be interesting. I never get that many messages in my
database.
>New users could then at least start with some numbers besides the built
>in defaults, even though they may not be as accurate as if they had
>waited till the 2k limit. They can always re-run bogotune later once
>they build up enough spam. Depending on how much spam someone receives
>daily, it could take weeks or months to reach this minimum and during
>that time the user can only guess at the parameters or stick with the
>built in defaults.
If you want to experiment a bit, have a look at the FAQ and
try training-to-exhaustion. Would be nice to know how this
performs in your case with only few messages to train with.
pi
More information about the Bogofilter
mailing list