Data base "maintenance" (removing tokens) and MSG_COUNT?

David Relson relson at osagesoftware.com
Tue Jul 15 17:17:15 CEST 2003


At 11:09 AM 7/15/03, Matthias Andree wrote:
>Hi,
>
>what I am currently wondering about is:
>
>we register the token count and a message count, to obtain a certain
>"spamicity" of an individual token.
>
>However, what happens if tokens are removed? We don't adjust MSG_COUNT
>AFAIR, so I fear that in the long run, all individual token spamicity
>values will be too low because the MSG_COUNT is too high, and the ROBX
>might also be bogus.
>
>Does this need to be taken into account? If so, do we need to store more
>information or adjust the .MSG_COUNT?

Matthias,

.MSG_COUNT is decremented when messages are removed via -N and -S.

Try the following:

rm -f /tmp/????list.db
echo this is a test | bogofilter -s -d /tmp
bogoutil -w /tmp .MSG_COUNT
echo this is a test | bogofilter -S -d /tmp
bogoutil -w /tmp .MSG_COUNT


David





More information about the Bogofilter mailing list