bogotrain
Dave Lovelace
dave at firstcomp.biz
Fri Dec 19 14:05:16 CET 2003
I ran bogotrain, and it ran for many hours. Then, when it finally was
done, here's what it produced:
>
> The wordlist contains 17589 non-spam and 1500 spam messages.
> Bogotune must be run with at least 2000 of each.
> The wordlist has a ratio of spam to non-spam of 0.1 to 1.0.
> Bogotune requires the ratio be in the range of 0.2 to 5.
>
I don't know how to check how many messages of each kind there are,
so as to know in advance whether bogotune will be happy with any wordlist
I build. This is the wordlist I built from mail I had on hand when I
uprev'd bogofilter.
But the big question is: why should it take over 12 hours for bogotune
to find out that the wordlist is unacceptable? Surely this is something
it can check quickly at the outset?
--
- Dave Lovelace
dave at firstcomp.biz
davel at cyberspace.org
More information about the Bogofilter
mailing list