Bogofilter FAQ corrections (Asian spam)

David Relson relson at osagesoftware.com
Tue Apr 1 14:27:22 CEST 2003


pi,

I've made two charset changes in the UNREADABLE macro.  The FAQ section is 
about asian languages, specifically chinese, japanese, and korean.  Since 
windows-1251 is cyrillic and windows-1256 is arabic, they don't belong in 
this section.  For some people (like me) any charset not basd on the roman 
alphabet is unreadable, so these could reasonably be included.  However, 
the FAQ is specifically for asian languages, so these two don't belong.

David

At 03:27 AM 4/1/03, Boris 'pi' Piwinger wrote:

>David Relson wrote:
>
> >>* 1^0 $ ^Subject:.*=\?($UNREADABLE)
> >>* 1^0 $ ^Content-Type:.*charset=.*($UNREADABLE)
>
>charset= should be followed by "? -- anyhow here is my
>version (more secure) of UNREADABLE:
>UNREADABLE='[^?"]*big5|iso-2022-jp|ISO-2022-KR|euc-kr|gb2312|ks_c_5601-1987|windows-1251|windows-1256'
>
>Note: I have actually found charset of the form '[^?"]*big5'
>when I created that rule which previously had '.*big5'.
>
>Theoretically, that will do it. I had some very similar
>recipe as above which did not work. I don't know why and
>gonna test the above version.
>
>pi
>
>
>---------------------------------------------------------------------
>FAQ: http://bogofilter.sourceforge.net/bogofilter-faq.html
>To unsubscribe, e-mail: bogofilter-unsubscribe at aotto.com
>For summary digest subscription: bogofilter-digest-subscribe at aotto.com
>For more commands, e-mail: bogofilter-help at aotto.com





More information about the Bogofilter mailing list