how to bogotune?
Trevor Smith
trevor at haligonian.com
Wed Sep 29 17:34:15 CEST 2004
Man, bogotune is very difficult to figure out. The man page is confusing since
it does not clearly state what it means by "wordlist" and "message files".
After numerous readings I think I have figured out what it wants: I'm
assuming that the "wordlist" is just my wordlist.db that I've built over
months of using bogofilter and that the "message files" are some group of
emails that I have separated into spam and nonspam categories. Is this close
to correct?
Next question:
If I have already trained bogofilter with the messages in question, can
bogotune work on them? Or does that screw it up?
Final question:
The man page appears to say it wants 500+ messages of spam/ham each, with
2000+ ham/spam each in the wordlist. I certainly have more than 2000 each in
my wordlist, and I fed it ~1000 each for messages but it complained about
"low number / uniformity" of messages and produced no useful results (that I
can tell).
My guess is that the messages I feed in must NOT be already trained, or else
they're all going to read 1.0000 and 0.0000 (and that would make sense). Is
this correct?
--
Trevor Smith // trevor at haligonian.com
More information about the Bogofilter
mailing list