Bogofilter FAQ corrections (Asian spam)
David Relson
relson at osagesoftware.com
Tue Apr 1 14:27:22 CEST 2003
pi,
I've made two charset changes in the UNREADABLE macro. The FAQ section is
about asian languages, specifically chinese, japanese, and korean. Since
windows-1251 is cyrillic and windows-1256 is arabic, they don't belong in
this section. For some people (like me) any charset not basd on the roman
alphabet is unreadable, so these could reasonably be included. However,
the FAQ is specifically for asian languages, so these two don't belong.
David
At 03:27 AM 4/1/03, Boris 'pi' Piwinger wrote:
>David Relson wrote:
>
> >>* 1^0 $ ^Subject:.*=\?($UNREADABLE)
> >>* 1^0 $ ^Content-Type:.*charset=.*($UNREADABLE)
>
>charset= should be followed by "? -- anyhow here is my
>version (more secure) of UNREADABLE:
>UNREADABLE='[^?"]*big5|iso-2022-jp|ISO-2022-KR|euc-kr|gb2312|ks_c_5601-1987|windows-1251|windows-1256'
>
>Note: I have actually found charset of the form '[^?"]*big5'
>when I created that rule which previously had '.*big5'.
>
>Theoretically, that will do it. I had some very similar
>recipe as above which did not work. I don't know why and
>gonna test the above version.
>
>pi
>
>
>---------------------------------------------------------------------
>FAQ: http://bogofilter.sourceforge.net/bogofilter-faq.html
>To unsubscribe, e-mail: bogofilter-unsubscribe at aotto.com
>For summary digest subscription: bogofilter-digest-subscribe at aotto.com
>For more commands, e-mail: bogofilter-help at aotto.com
More information about the Bogofilter
mailing list