db maintenance "delete oldest least used tokens, but maintain count of x"

Matthias Andree matthias.andree at gmx.de
Fri Mar 5 17:16:39 CET 2004


David Relson <relson at osagesoftware.com> writes:

>> Won't work in the long run. One half of your tokens have expired,
>> .MSG_COUNT is way too large.
>
> Is it?  Or does it reflect shorter messages.  When "xyz" expires, it's
> comparable to saying that the message was 1 token shorter than
> bogofilter originally thought it to be.

I've looked closer. Indeed the message counts are used for calculating
the probability of a token. If it's gone, then we assume robx instead,
roughly speaking.

> Many tokens will continue to exist.  The counts for "relson" and
> "osagesoftware.com" will not be affected by age or date pruning, hence
> the number of messages isn't affected.

Right.

-- 
Matthias Andree

Encrypt your mail: my GnuPG key ID is 0x052E7D95




More information about the Bogofilter mailing list