repetitive training

Tom Anderson tanderso at oac-design.com
Tue Mar 9 14:45:33 CET 2004


On Tue, 2004-03-09 at 06:59, Greg Louis wrote:
> Great!  A procedure that is easy to reproduce "in the lab" for testing.
> I may very likely run such an experiment.

I'd like to see the results of that.

> Is this per user?  If so, the results aren't as good as I get ("no" fn
> or fp and an average of about 5 unsures/day), but pretty close, and
> with what seems to be significantly less effort.  (By "no" I mean
> unmeasurably few, like one in 120,000 or so; for me that translates to
> one in about six weeks.)

Well, the FN/unsure numbers are mine alone.  Milage varies from user to
user, but overall I've seen a decline in FN and unsures.  However, the
main reason they are still relatively high is because I'm very resistant
to tuning.  I only use small increments at a time and no automatic
tuning.  Since I never get any hams in my unsures, I can lower both the
spam_cutoff and ham_cutoff a bit more to reduce both FN and unsures.  I
go at least a week or two between changes in these numbers, usually
longer, and only change them 0.1-0.5.  Of the FN and unsures I receive,
they are all of two types: really long messages with many paragraphs of
plain english (as I've described here before), and virii with no body
and only an attachment.  If I killed the virii in procmail, my FN and
unsure numbers would decrease significantly.  I'm still holding out to
see if I can get bogofilter to filter these though.  I'm also in the
process of writing a prefilter that will apply some heuristics to the
headers to help polarize them, as too many hammy tokens are contained in
the x-headers, dates, etc., of spam.

Tom

-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 189 bytes
Desc: This is a digitally signed message part
URL: <http://www.bogofilter.org/pipermail/bogofilter/attachments/20040309/824746bb/attachment.sig>


More information about the Bogofilter mailing list