bogofilter (-o) & bogoutil (-c) bugs?

David Relson relson at osagesoftware.com
Mon Feb 17 18:25:17 CET 2003


At 12:05 PM 2/17/03, W M Brelsford wrote:

>On Mon Feb 17 2003 at 10:53 AM -0500, David Relson wrote:
> > There's _no_ need for keeping 0-count tokens.  They just take up
> > space.  They'll go away.
>
>Good.
>
> > Have you observed any other details that need attending to?
>
>No, that was my list.  But you didn't answer my last question
>regarding -o without -C: don't command-line arguments override
>configuration file settings?

Bill,

I overlooked that one.  Guess I better get my eyeglasses checked.

Since bogofilter doesn't track whether values come from the command line or 
from the config file, it uses the last value it sees.  Thus if spam_cutoff 
is specified in bogo.cf, "bogofilter -c bogo.cf -o .44 -qv" will give 
different results than "bogofilter -o .44 -c bogo.cf -qv".  When the code 
was written, I couldn't think of a good way (read "simple, clean, and 
elegant") to code a precedence like you suggest.  Have you any ideas on 
coding it?

>Has any thought been given to an option to specify additional
>regular expressions for delimiting tokens?  My wife tends to use a
>random number of periods to separate clauses, so replies that quote
>her message leave lots of tokens like "soon.....but".  Not a big
>deal -- I'll probably remove them periodically with dump-grep-load
>(or remove her account on my machine..).

We're using flex to generate the parsing tables.  AFAIK, it doesn't support 
runtime specified options.  Trimming replicated special characters could be 
implemented pretty easily.  Would you care to write the code?

David





More information about the Bogofilter mailing list