charset implementtion progress
Matt Armstrong
matt at lickey.com
Tue Nov 26 20:15:30 CET 2002
David Relson <relson at osagesoftware.com> writes:
> With these routines in place, the regression test results have changed
> a little bit. Since"iso-8859-1", "us-ascii", etc are now processed by
> the got_charset() routine and are not passed on as tokens...
Is it possible to pass them on as tokens too? The actual charset of
the message is a reliable SPAM indicator for me.
E.g. charset="ks_c_5601-1987", charset=euc-kr. They often showed up
as the tokens chosen for calculation in the original Graham method.
I'd hate to lose them.
More information about the bogofilter-dev
mailing list