bogofilter (-o) & bogoutil (-c) bugs?

Nick Simicich njs at scifi.squawk.com
Tue Feb 18 12:14:17 CET 2003


At 12:25 PM 2003-02-17 -0500, David Relson wrote:

>We're using flex to generate the parsing tables.  AFAIK, it doesn't 
>support runtime specified options.  Trimming replicated special characters 
>could be implemented pretty easily.  Would you care to write the code?

The key to that is "states", which is the same scheme I used to track in 
and out of tags and tokens.  You can have multiple schemes that reference 
multiple regular repressions in the parsing, and control which ones you use 
through the use of states.  States can be used to control what the options 
do.  In fact, if it were an absolute requirement not to reorder the html 
world, I would handle it with states.

--
SPAM: Trademark for spiced, chopped ham manufactured by Hormel.
spam: Unsolicited, Bulk E-mail, where e-mail can be interpreted generally 
to mean electronic messages designed to be read by an individual, and it 
can include Usenet, SMS, AIM, etc.  But if it is not all three of 
Unsolicited, Bulk, and E-mail, it simply is not spam. Misusing the term 
plays into the hands of the spammers, since it causes confusion, and 
spammers thrive on  confusion. Spam is not speech, it is an action, like 
theft, or vandalism. If you were not confused, would you patronize a spammer?
Nick Simicich - njs at scifi.squawk.com - http://scifi.squawk.com/njs.html
Stop by and light up the world!



More information about the Bogofilter mailing list