DOCTYPE [was: [PATCH] ... the evasive message discussed on the list]

David Relson relson at osagesoftware.com
Sat Nov 8 13:50:22 CET 2003


On Sat, 08 Nov 2003 12:47:50 +0100
Matthias Andree <matthias.andree at gmx.de> wrote:

> David Relson <relson at osagesoftware.com> writes:
> 
> > 	Content-Type: text/html
> >
> > 	<!DOCTYPE ...>
> >
> > Are other people seeing this same behavior?  If so, please let us
> > know.
> 
> That <!DOCTYPE ...> is regular SGML, what would be interesting enough
> to have it reported? It is the canoncial way to specify which HTML
> standard the document is supposed to conform to.

Hi Matthias,

My test results don't change at all if <!DOCTYPE...> is used as an html
indicator.  The DOCTYPE directive is pretty common in my incoming mail,
specifically in my "unsures".  Recognizing it has minimal impact on
bogofilter's size and speed and follows the principle of least surprise.
 

Anyhow, for the above reasons, I'm inclined to make the lexer change.

What do you think?

David






More information about the Bogofilter mailing list