Problems with default charset and map_xlate_characters
Evgeny Kotsuba
evgen at shatura.laser.ru
Thu Sep 25 13:10:16 CEST 2003
Hi,
It seems that with default charset wrong things are doing for any mail's
charset exept well knowng to bogofilter, Even if
allow_nonascii_replacement = 0. Problem is with map_xlate_characters
wich has nothing common with ascii. Say I have letter in russian
koi-8R coding wich should be standart for russian and used in unix and
in "right" mailers. There also may be a number of codings for other
ex-ussr rebublics like Ukrainian and more, we have now codings for
russia's national republics (something like states in US or provinces in
Canada)
Also next comment to: map_nonascii_characters - this is very bad
function for any statictics etc. I have made some russian's codepage
decoder for decoding mails with wrong double and triple recodings and
have name such coding as "Debillnaia" (de-billy's) because in case if
you have message like ???? ??? ?? ???? any decoding will false.
So if you have a lot of messages in foreing coding as spam that map to
???? ????? etc. and than have any short letter with some foreing words
(say signature, user's name etc.) - than what will be ?
SY,
EK
More information about the bogofilter-dev
mailing list