small lexer change

David Relson relson at osagesoftware.com
Tue Oct 22 13:34:58 CEST 2002


Just a heads up...

In 0.7.5, the token pattern uses only two control characters, '\n' and '\r' 
as token delimiters.  I have changed the three occurrences of \n\r to 
[:cntrl:] so that all control characters will be token delimiters.  For 
example, my word list used to contain "^T^T^T^T.co.kr" as one of its 
tokens.  Having rebuilt the word lists, I now have "co.kr" which is better.

David
--------------------------------------------------------
David Relson                   Osage Software Systems, Inc.
relson at osagesoftware.com       Ann Arbor, MI 48103
www.osagesoftware.com          tel:  734.821.8800





More information about the Bogofilter mailing list