replace-nonascii-characters stopped working?

Bill McClain wmcclain at salamander.com
Tue Mar 22 02:05:31 CET 2005


On Mon, 21 Mar 2005 19:39:43 -0500
David Relson <relson at osagesoftware.com> wrote:

> The file looks like all the accented european characters.  The
> relevant line is:
> 
>    Content-Type: text/plain; charset=Windows-1251
> 
> Running the 0.92.8 and 0.94.1 versions of bogolexer give the same
> results.  The design purpose of replace-nonascii-characters was to
> limit the tokens generated by asian spam. Since windows-1251 is a
> 'normal' charset, what bogofilter's doing is appropriate.

Ok, I believe it. It is a new type of spam for me. The first one arrived
just a few days ago the same time I upgraded bogofilter. Or more
exactly: 8-bit tokens first appeared in my wordlist then. I see that
messages with that Content-Type have arrived in the past, but weren't
registered because of thresh-update.

Thanks for investigating this.

-Bill
-- 
Sattre Press                                      Tales of War
http://sattre-press.com/                       by Lord Dunsany
info at sattre-press.com         http://sattre-press.com/tow.html
_______________________________________________
Bogofilter mailing list
Bogofilter at bogofilter.org
http://www.bogofilter.org/mailman/listinfo/bogofilter



More information about the Bogofilter mailing list