Russian charsets and functions

Clint Adams schizo at debian.org
Tue Jan 4 17:00:16 CET 2005


> Evgeny Kotsuba <evgen at shatura.laser.ru> writes:
> 
> > one problem is that charset may be set impropelly - by mail client 
> > and/or spammer, second problem will be doubling data base. Really  
> > english/americans don't  need russian or asian spam or mail,   russian 
> > don't need asian spam/mail and all english letterrs are placed to 0-127 
> > and russian - to 128-255. All really multy-lang documents I see was sent 
> > in .doc or .pdf  and so on.
> 
> Some mails earlier you documented how the same Cyrillic characters were
> encoded differently in the different character sets, so I presume some
> spammer actually exploiting this (we saw a time when spammers massively
> used ISO-8859-* accented Latin characters) will have to specify the
> proper character set lest he wants to produce garbage.

Plus, if someone sends Evgeny a message like this one, bogofilter
probably will not behave the manner he wants.

шапку с дурака не снимают

人文



More information about the bogofilter-dev mailing list