How to deal with extremely high spam levels
tanderso at oac-design.com
Sun Jul 11 10:30:10 EDT 2004
On Fri, 2004-07-09 at 18:29, Bob Vincent wrote:
> On Wed, Jun 23, 2004 at 08:41:41AM -0400, Tom Anderson wrote:
> > Doing exhaustive training should move hams and spams out away from
> > 0.5.
> Promised I'd report back once I got things back under control, so here goes.
> I'm running bogominitrain once a day to update my database.
> If it doesn't close off in 3 or 4 runs, it's usually a misclassified
> message. So when that happens, I restore the database, correct the
> error, and re-run bogominitrain.
> Once a week, I delete my database, register all of my hams (I still
> don't have over 1000 of them), and run bogominitrain again. Then I
> cat all the bogominitrain.spam.* files together, sort by date, and
> overwrite my spam folder with them.
> This keeps my ham::spam ratio pretty close to 1::1 and also lets me
> throw away over 90% of my spams without losing training accuracy.
> Now I just gotta write some scripts to automate this process.
Even when automated, that sounds like a complex process. Do you think
regular users will be able to do this?
More information about the Bogofilter