How to deal with extremely high spam levels

Bob Vincent bobvin at pillars.net
Fri Jul 9 18:29:48 EDT 2004


On Wed, Jun 23, 2004 at 08:41:41AM -0400, Tom Anderson wrote:
> Doing exhaustive training should move hams and spams out away from
> 0.5.

Promised I'd report back once I got things back under control, so here goes.

I'm running bogominitrain once a day to update my database.

If it doesn't close off in 3 or 4 runs, it's usually a misclassified
message.  So when that happens, I restore the database, correct the
error, and re-run bogominitrain.

Once a week, I delete my database, register all of my hams (I still
don't have over 1000 of them), and run bogominitrain again.  Then I
cat all the bogominitrain.spam.* files together, sort by date, and
overwrite my spam folder with them.

This keeps my ham::spam ratio pretty close to 1::1 and also lets me
throw away over 90% of my spams without losing training accuracy.

Now I just gotta write some scripts to automate this process.

Thanks for all the help, all!


More information about the Bogofilter mailing list