Automatic site training

Steffen Nissen lukesky at diku.dk
Mon Feb 7 19:55:44 CET 2005


Hi,

I have run bogofilter for a while now and have had very good results
with bogofilter. Unfortunately the harddisk on my server crashed along
with my bogofilter scripts (I know I should have had a backup, but ...).

Now I have installed bogofilter again and I thought that it might be a
good idea to use all the bells and whistles in my new setup.

My general idea is to have each user create a ham and a spam maildir on
the server where they place all mail which gets classified wrong in
these dirs. I will then run a script each week or so which trains on
these files so that the users will not have to do this by themselves.

I thought of training with one full training iteration and then a few
iterations with train-on-error.

My question is then: Are any of you running a similar setup and what are
your experiences with this, and do you even think that this is a good
idea?. Also does anyone have scripts that does something like this.

On a sidenote I can mention that there will only be a few user on the
server and that they all have a pretty high level of tech knowledge, so
there will be no problems with people not being able to copy wrongly
classified mails to the appropriate folders, and there will neither be
any problems with training taking too long.

All comments and suggestions are welcome.

-- 
Steffen Nissen
Project Administrator - Fast Artificial Neural Network Library (fann)
http://fann.sf.net





More information about the Bogofilter mailing list