RFC-2047 & encoded QP text

Boris 'pi' Piwinger 3.14 at logic.univie.ac.at
Mon Jul 28 10:22:48 CEST 2003


Matthias Andree wrote:

>> The "\?" is not only standards compliant but necessary.  Junior has
>> email that kills bogofilter without it because the lexer keeps on
>> reading an encoded attachment and runs out of buffer space.  With the
>> "\?", the lexer works much better.
> 
>> Regarding spaces, if necessary we could try something like "[^\t\r\n\?]"
>> and see what happens.  Let's wait until there's a clear need.

No, there MUST NOT be any non-ASCII-character.

> An encoded word does not continue past the end of the line, this must be
> accounted for.

Exactly. I think this would save us from any danger.

> We will also need to take care that we remove linear
> white space between two encoded words, so that:
> 
> Summary: =?ISO-8859-1?Q?Regen?=
>   =?ISO-8859-1?Q?w=FCrmer=3F?=
> 
> yields {Summary; Regenwürmer} rather than {Summary; Regen; würmer}.

Yep.

> This is necessary so spammers can't split up their tokens at will to
> hide them from bogofilter's view.

Actually, many normal readers also split words like this.
Sometimes this is necessary.

pi





More information about the bogofilter-dev mailing list