Serious problem with non-ASCII words

Matthias Andree matthias.andree at gmx.de
Sat Sep 21 00:04:39 CEST 2002


On Fri, 20 Sep 2002, David Relson wrote:

> At 12:48 PM 9/20/02, Paul Tomblin wrote:
> >Quoting Jørgen Thomsen (jth at jth.net):
> >> Den Fri, 20 Sep 2002 17:24:34 +0200, skrev du :
> >>
> >> >Looks like the parser is broken. Since I know German, I shall have a
> >> >look.
> >>
> >> I already observed this problem. It is quite simple. These are the
> >> alphanumeric characters in Latin-1, which should be used in the parser 
> >for the
> >> Latin-1 character set.
> >>
> >> [0-9A-Z_a-zªµºÀ-ÖØ-öø-ÿ]
> >
> >Doesn't [::isalpha::] work?
> 
> I'll be the first to admit I don't know special "names" like 
> "::isalpha::".  However, I can type it and I can test it :-)

[[:alpha:]] is probably the right name for flex, but it does not
comprise digits, so [[:alnum:]] would be it. However, my flex is too
dumb and sticks with ASCII (POSIX locale FWIW).

For summay digest subscription: bogofilter-digest-subscribe at aotto.com



More information about the Bogofilter mailing list