defining empty lines.

David Relson relson at osagesoftware.com
Wed May 28 13:44:48 CEST 2003


At 07:15 AM 5/28/03, Matthias Andree wrote:
>David Relson <relson at osagesoftware.com> writes:
>
> > RFC2822 specifies "The body is simply a sequence of characters that
> > follows the header and is separated from the header by an empty line
> > (i.e., a line with nothing preceding the CRLF).
> >
> > Jeremy Blosser has encountered many spam messages where "\b\r\n" appears
> > in this position.  Bogofilter is looking for the truly empty lines for
> > writing out the "X-Bogosity" line (in passthrough mode) and gets it
> > wrong for these messages.
>
>Well, \b isn't exactly whitespace, it's a control, and so, you're
>looking at a message without body. However, as spam is trimmed for the
>Winbloze mailers, what does Outlook Express 6.X display on such
>messages? What does Netscape 7.x display?

Matthias,

As I recall, the '\b' was the message _after_ procmail,etc had processed 
it.  Additional monitoring found that some message had a separator line 
with a single blank character, 0x20, on it.  I found several of them in my 
incoming mail (from early April).  All were from the same source.

David






More information about the Bogofilter mailing list