BASE64 [was: various]
Boris 'pi' Piwinger
3.14 at logic.univie.ac.at
Wed Oct 23 09:16:53 CEST 2002
Matthias Andree <matthias.andree at gmx.de> wrote:
>I've spent some little thoughts on whether lex would be smart enough to
>parse the MIME structure while still emitting delimiters as tokens, and
>on-the-fly decoding of base64 and qp. I believe that
>Content-Transfer-Encoding should be taken into account because there are
>words (particularly with 4, 8, 12 characters) that make up valid base64
>words;
Sounds good.
>OTOH, we should decide if we're actually interested in more than
>the headers of MIME parts that have Content-Type other than text/*.
I am not sure how you think about multipart/*. That might
well contain text parts, but you can find them anyways.
>I believe we should only parse text/*, convert things to UTF-8 (which will
>bring another dependency, iconv, jconv, whatever), and only read headers
>of other content types. As someone (I believe Boris) has pointed out,
>spammers won't use base64 to deceive spam traps because that will render
>the mail unreadable.
I once posted some statistics about the use of base64. To my
very surprise it was used by spammers significantly.
pi
More information about the bogofilter-dev
mailing list