korean spam

Boris 'pi' Piwinger 3.14 at logic.univie.ac.at
Thu Oct 10 13:36:15 CEST 2002


David Relson wrote:

>>I do on http://piology.org/.procmailrc.html:
>>
>> > ## Silently drop all completely unreadable spam
>> > :0E
>> > * 1^0 
>> ^\/Subject:.*=\?(.*big5|iso-2022-jp|ISO-2022-KR|euc-kr|gb2312|ks_c_5601-1987|windows-1251|windows-1256)\?
>> > * 1^0 
>> ^Content-Type:.*charset="?(.*big5|iso-2022-jp|ISO-2022-KR|euc-kr|gb2312|ks_c_5601-1987|windows-1251|windows-1256)
>> > /dev/null
>>
>>Works nicely. Some undeclared messages come through to bogofilter, but
>>that is not a problem.
> 
> I like this!!!
> 
> Adrian, How about a FAQ entry on "How to drop all completely unreadable 
> messages".
> 
> Boris, would you write up the entry and submit it?

Q: There is lots of spam which is in charsets which I cannot read or
not even display. Should I let bogofilter work with it? What else can
I do?

A: As it stands now these messages don't work properly with bogofilter
due to a limitation with 8bit characters which are used heavily in
those languages. A solution using Procmail would be to drop that spam
before bogofilter is called. The following does the trick:

[Remove the quote symbols which are just to avoid linewrapping]
> ## Silently drop all completely unreadable spam
> :0E
> * 1^0 ^\/Subject:.*=\?(.*big5|iso-2022-jp|ISO-2022-KR|euc-kr|gb2312|ks_c_5601-1987|windows-1251|windows-1256)\?
> * 1^0 ^Content-Type:.*charset="?(.*big5|iso-2022-jp|ISO-2022-KR|euc-kr|gb2312|ks_c_5601-1987|windows-1251|windows-1256)
> /dev/null
> 
> :0HB:
> * ? bogofilter -u
> spam-bogofilter

pi


For summay digest subscription: bogofilter-digest-subscribe at aotto.com



More information about the Bogofilter mailing list