[cvs] bogofilter/src convert_unicode.c,1.5,1.6
David Relson
relson at osagesoftware.com
Mon Jun 20 02:20:28 CEST 2005
On Mon, 20 Jun 2005 00:54:06 +0200
Matthias Andree wrote:
> David Relson <relson at users.sourceforge.net> writes:
>
> > Update of /cvsroot/bogofilter/bogofilter/src
> > In directory sc8-pr-cvs1.sourceforge.net:/tmp/cvs-serv2292
> >
> > Modified Files:
> > convert_unicode.c
> > Log Message:
> > Use identity map when invalid conversion is specified.
> >
> > Index: convert_unicode.c
> > ===================================================================
> > RCS file: /cvsroot/bogofilter/bogofilter/src/convert_unicode.c,v
> > retrieving revision 1.5
> > retrieving revision 1.6
> > diff -u -d -r1.5 -r1.6
> > --- convert_unicode.c 18 Jun 2005 13:48:10 -0000 1.5
> > +++ convert_unicode.c 18 Jun 2005 13:51:28 -0000 1.6
> > @@ -128,9 +128,11 @@
> > cd = iconv_open( to_charset, from_charset );
> > if (cd == (iconv_t)(-1)) {
> > int err = errno;
> > - if (err != EINVAL)
> > - fprintf( stderr, "Invalid charset '%s'\n", to_charset );
> > - cd = iconv_open( from_charset, from_charset );
> > + if (err == EINVAL) {
> > + /* error - use identity mapping */
> > + fprintf( stderr, "Conversion from '%s' to '%s' is not supported.\n", from_charset, to_charset );
> > + cd = iconv_open( "iso-8859-1", "iso-8859-1" );
>
> Are you sure this is transparent for characters in the 0x80 - 0x9f and
> perhaps the 0x00 - 0x1f ranges? I'm not. I'm not even sure bogofilter
> will then find the same tokens if they happen to be seen again in a
> different character set - we might rather want to either reject the
> whole registration.
Perhaps we'd do better if we disable translation when iconv_open()
rejects the character set ???
More information about the bogofilter-dev
mailing list