lexer change

David Relson relson at osagesoftware.com
Tue Nov 4 16:26:41 CET 2003


On Tue, 04 Nov 2003 15:44:29 +0100
Boris 'pi' Piwinger <3.14 at logic.univie.ac.at> wrote:

> David Relson wrote:

...[snip]...

> > 2 - acceptance of digits at the beginning of tokens and acceptance
> > of numbers as tokens
> > 
> >     Rejected.  I don't see value in this change.
> >
> > 3 - acceptance of two character tokens.
> > 
> >     Rejected pending further evaluation.
> 
> The was the idea. People should test it. Attached is a patch
> for those who want to play to be applied *after* David's patch.
> 
> > 4 - Removal of the {1,70} repetition count in the TOKEN pattern.
> > 
> >     Accepted.  This is the biggie!  
> > 
> >     With this change the generated lexer_v3.c file shrinks from 1.8M
> >     to
> > 1.2M and a stripped bogofilter executable shrinks from 1.8M to 1.4M.
> 
> Great, I did not expect my changes would be that useful:-))

Nor did I, although this isn't the first time I've seen big changes in
lexer_v3.c caused by seemingly insignificant changes in lexer_v3.l. 
Anyhow, I'm pleased with the effect.




More information about the Bogofilter mailing list