Problems with default charset and map_xlate_characters

Evgeny Kotsuba evgen at shatura.laser.ru
Thu Sep 25 13:10:16 CEST 2003


Hi,

It seems that with default charset wrong things are doing for any mail's 
charset exept well knowng  to  bogofilter, Even if  
allow_nonascii_replacement = 0. Problem is with map_xlate_characters 
wich has nothing common with ascii.   Say I  have letter in russian 
koi-8R coding wich should be standart for russian and used in unix and 
in "right" mailers. There also may be a number of codings for other 
ex-ussr rebublics like Ukrainian and more, we have now codings for 
russia's national republics (something like states in US or provinces in 
Canada)

Also next comment to: map_nonascii_characters - this is very bad 
function for any statictics etc.  I have made some russian's codepage 
decoder for decoding mails with wrong double and triple recodings and 
have name such coding as "Debillnaia" (de-billy's) because in case if 
you have message like ???? ??? ?? ???? any  decoding will false.
So if you have a lot of messages in foreing coding as  spam that map to 
???? ????? etc. and than have any short letter with some foreing words 
(say signature, user's name etc.) - than what will be ?

SY,
EK





More information about the bogofilter-dev mailing list