... convert_unicode.c ...

Clint Adams schizo at debian.org
Sat Jun 25 01:05:13 CEST 2005


> Either we run iconv, then the output charset is always UTF-8 and not
> user-configurable for consistency, or we don't, in which case a
> configuration option doesn't make sense, as we're storing raw data.

At some point, when the spammers are sending lots of UTF-8 mail, it may
be useful to normalize the UTF-8 output.

http://www.unicode.org/faq/normalization.html



More information about the bogofilter-dev mailing list