FAQ: How to train
David Relson
relson at osagesoftware.com
Wed Jul 30 21:38:54 CEST 2003
At 02:47 PM 7/30/03, Peter Bishop wrote:
>Could you train on "near to error"
>i.e. if spam is near to the spam cut-off point, add it to the database
>even though the classification is correct. (ditto for ham)
>This might make the database less sensitive to extra messages
Seem like making small changes to the values of spam_cutoff and ham_cutoff
would achieve the same effect. No?
>Might be easier to do this woth the Robinson algorithm which has a more
>linear spamicity range.(e.g. spam cutoff=0.54, add spam if <0.60)
The linear quality of the Robinson algorithm _would_ be valuable for
this. This is a bit awkward for someone who's run bogotune and has Fisher
values for the cutoffs -- it would be necessary to rerun bogotune to get
Robinson values :-(
More information about the Bogofilter
mailing list