When is spam_cutoff too low?
David Relson
relson at osagesoftware.com
Mon Dec 13 03:16:11 CET 2004
On Sun, 12 Dec 2004 18:38:10 -0500
Matej Cepl wrote:
> Tom Anderson wrote:
> > Look at all of your ham and find the highest scoring one over the past 1-3
> > months. You can set your spam cutoff to just above that value and not
> > fear getting false positives. It's still possible of course, but highly
> > unlikely. I find that by using -u to register all of my hams
> > automatically, my highest ham score is around 0.01.
>
> Would you have some tool to do get this statistics from the email corpora,
> or should I made myself some combination of grep, procmail, and other shell
> tools (or Python)?
>
> Matej
Hi Matej,
"grep ^X-Bogosity:.Ham * | sort +3 | tail" (or similar) provides part of
the answer :-)
My use of -u hasn't been as successful as Tom's. My ham and spam don't
cooperate and score at the extremes of 0 and 1. Rather they cover a
wide range of values. I've got ham as high as 0.65 and spam as low as
0.22. I suspect we have a different mix of incoming mail.
Regards,
David
More information about the Bogofilter
mailing list