boundary tokens
David Relson
relson at osagesoftware.com
Fri Dec 20 14:53:38 CET 2002
Matthias,
I was looking at my wordlists and noticed that, right at the beginning of
each (as dumped by bogoutil) there are a lot of "boundary" symbols (as
returned from lexer.l).
Approx. 99% of them have counts of 1. The longest is 94 characters. My
spamlist has 1050 of them and goodlist has 2200 of them.
I don't think we need them. What would happen if lexer stopped returning them?
David
More information about the bogofilter-dev
mailing list