When is spam_cutoff too low?

David Relson relson at osagesoftware.com
Mon Dec 13 03:16:11 CET 2004


On Sun, 12 Dec 2004 18:38:10 -0500
Matej Cepl wrote:

> Tom Anderson wrote:
> > Look at all of your ham and find the highest scoring one over the past 1-3
> > months.  You can set your spam cutoff to just above that value and not
> > fear getting false positives.  It's still possible of course, but highly
> > unlikely.  I find that by using -u to register all of my hams
> > automatically, my highest ham score is around 0.01.
> 
> Would you have some tool to do get this statistics from the email corpora,
> or should I made myself some combination of grep, procmail, and other shell
> tools (or Python)?
> 
> Matej

Hi Matej,

"grep ^X-Bogosity:.Ham * | sort +3 | tail" (or similar) provides part of
the answer :-)

My use of -u hasn't been as successful as Tom's.  My ham and spam don't
cooperate and score at the extremes of 0 and 1.  Rather they cover a
wide range of values.  I've got ham as high as 0.65 and spam as low as
0.22.  I suspect we have a different mix of incoming mail.

Regards,

David



More information about the Bogofilter mailing list