db maintenance "delete oldest least used tokens, but maintain count of x"
Matthias Andree
matthias.andree at gmx.de
Fri Mar 5 17:16:39 CET 2004
David Relson <relson at osagesoftware.com> writes:
>> Won't work in the long run. One half of your tokens have expired,
>> .MSG_COUNT is way too large.
>
> Is it? Or does it reflect shorter messages. When "xyz" expires, it's
> comparable to saying that the message was 1 token shorter than
> bogofilter originally thought it to be.
I've looked closer. Indeed the message counts are used for calculating
the probability of a token. If it's gone, then we assume robx instead,
roughly speaking.
> Many tokens will continue to exist. The counts for "relson" and
> "osagesoftware.com" will not be affected by age or date pruning, hence
> the number of messages isn't affected.
Right.
--
Matthias Andree
Encrypt your mail: my GnuPG key ID is 0x052E7D95
More information about the Bogofilter
mailing list