Should I glob tokens?
Thomas Anderson
tanderson at orderamidchaos.com
Fri Apr 17 17:27:44 CEST 2009
If they have barely occurred in years, they should not be affecting
classifications anyway, so you can just delete all of them.
Tom
Charles Hewson wrote:
> Hi all,
> I am planning upgrade from 0.94.12 to 1.2.0 and have been looking
> at wordilst accumulated over 5 years. About 10% of tokens start with
> currency sing "$" followed by digits. Most of these occured once in spam.
> The highest count occured 17 times in spam in 2005. Only 4 of these tokens
> ever occure in non-spam. It would seem that a combined token could be more
> efficient?
>
> Any thoughts,
> Charles
>
> ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
> pub 1024D/F88852DE 2008-06-25 Charles Hewson <cahewson at eskimo.com>
> Key fingerprint = 0779 BBA4 CF82 0707 288B 3B37 BDB7 3DC3 F888 52DE
> sub 2048g/71B13048 2008-06-25 [expires: 2009-06-25]
>
> (For info see http://www.gnupg.org)
>
> Public key at - HTTP://WWW.ESKIMO.COM/~cahewson/pubkey.asc
>
> _______________________________________________
> Bogofilter mailing list
> Bogofilter at bogofilter.org
> http://www.bogofilter.org/mailman/listinfo/bogofilter
>
>
More information about the Bogofilter
mailing list