Ignore lists [was: Keeping the cruft out ...]

Pavel Kankovsky peak at argo.troja.mff.cuni.cz
Fri Mar 5 01:24:23 CET 2004


On Thu, 4 Mar 2004, David Relson wrote:

> It _could_ be done that way, but it seems messy.  The old code required
> the existance of an ignore.db file (that could be built from a simple
> text file using bogoutil).  An alternate approach would be to simply
> read the simple text file.

You'd still have to do two lookups: one in the list of ignored tokens (or
perhaps in some in-memory structure built from the text file), the other
in the main db. It depends on the expected size of the list. If it is
supposed to be really tiny (say < 1 page) then the extra overhead needed
to read a file and build a hash table (?) would be negligible.

> Having "ignored" tokens in the regular database calls for adding an
> "ignore" flag or using an unlikely value (such as 0xFFFFFFFF).  I expect
> special checks would quickly spread throughout bogofilter, which would
> be bad.

I think it might be possible to hide the implementation details in the
datastore layer (maybe in the datastore "middleware", e.g. datastore.c).
But I am aware you know the innards of Bogofilter better than I. :)

--Pavel Kankovsky aka Peak  [ Boycott Microsoft--http://www.vcnet.com/bms ]
"Resistance is futile. Open your source code and prepare for assimilation."





More information about the Bogofilter mailing list