training to exhaustion and the risk of overvaluing irrelevant tokens
Matthias Andree
matthias.andree at gmx.de
Fri Aug 15 13:43:28 CEST 2003
Boris 'pi' Piwinger <3.14 at logic.univie.ac.at> writes:
> But the problem described above is independend of this
> double-training. It (overvaluing irrelevant tokens) can
> happen with or without double training, just by the choice
> of messages to train with. But as I argue above there are
> less problems to expect because of the repeated training as
> compared to just one run.
Well, if you recieved a spam message, i. e. a bag of tokens, twice, then
registering it twice is the right thing to do, isn't it?
--
Matthias Andree
More information about the Bogofilter
mailing list