Minimal rules [was: Test with different lexers]

David Relson relson at osagesoftware.com
Tue Dec 2 18:15:28 CET 2003


Tom and pi,

If you're really intent on simplifying the lexer, reduce the rules to a
single rule which uses only whitespace for delimiters.  That would
indicate what happens without any of the current special rules for
processing html, ignoring binary attachments, tagging header lines.  I'd
be interested in hearing how much smaller it is, how much faster it is,
and what its scoring performance is.

David




More information about the Bogofilter mailing list