Minimum usable counts [was: Question]
David Relson
relson at osagesoftware.com
Thu May 21 05:16:05 CEST 2009
On Thu, 21 May 2009 12:01:41 +0930
Stephen Davies wrote:
> I understand.
>
> My initial issue is with the obvious spams not being detected first
> time round.
> The first I see of them is in my inbox as ham - despite being so
> obviously spam.
>
> If I save the email and run it through bogofilter -vvv, I get the
> results I posted.
>
> I then use bogofilter -Ns to "fix" the database and this seems to
> work - until the next spam with the same pattern but from a different
> source arrives. (bogofilter -vvv at this stage gives bogosity of 1.0).
>
> I have changed my min-dev, robx and robs to 0.35, 0.7, 0.1 but first
> indications are that this is not enough.
...[snip]...
Hi Stephen,
'Tis an interesting idea to allow not scoring tokens whose spam and ham
counts are low. As an experiment, the attached patch for src/score.c
will ignore tokens for which good_count+bad_count<3. Give it a try and
let me know what you think of it.
Regards,
David
P.S. If the patch works for you, we'll need a good name for the
option. Any suggestions?
-------------- next part --------------
An embedded and charset-unspecified text was scrubbed...
Name: patch.0520.score.txt
URL: <http://www.bogofilter.org/pipermail/bogofilter/attachments/20090520/ed3d1153/attachment.txt>
More information about the Bogofilter
mailing list